BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254780934|ref|YP_003065347.1| hypothetical protein CLIBASIA_04165 [Candidatus Liberibacter asiaticus str. psy62] (374 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254780934|ref|YP_003065347.1| hypothetical protein CLIBASIA_04165 [Candidatus Liberibacter asiaticus str. psy62] gi|254040611|gb|ACT57407.1| hypothetical protein CLIBASIA_04165 [Candidatus Liberibacter asiaticus str. psy62] Length = 374 Score = 776 bits (2004), Expect = 0.0, Method: Compositional matrix adjust. Identities = 374/374 (100%), Positives = 374/374 (100%) Query: 1 MFSLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVH 60 MFSLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVH Sbjct: 1 MFSLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVH 60 Query: 61 AATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVV 120 AATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVV Sbjct: 61 AATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVV 120 Query: 121 VPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDV 180 VPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDV Sbjct: 121 VPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDV 180 Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV 240 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV Sbjct: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV 240 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL Sbjct: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH Sbjct: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 Query: 361 IGKDIVTKRIWYDK 374 IGKDIVTKRIWYDK Sbjct: 361 IGKDIVTKRIWYDK 374 >gi|254780833|ref|YP_003065246.1| hypothetical protein CLIBASIA_03630 [Candidatus Liberibacter asiaticus str. psy62] gi|254040510|gb|ACT57306.1| hypothetical protein CLIBASIA_03630 [Candidatus Liberibacter asiaticus str. psy62] Length = 371 Score = 408 bits (1048), Expect = e-112, Method: Compositional matrix adjust. Identities = 203/370 (54%), Positives = 273/370 (73%), Gaps = 3/370 (0%) Query: 6 LNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 LNIRNFFYN KG ++ILTAI LP+IF+V+G++IE SH FF+K LH ++D SL++ AT+I Sbjct: 4 LNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI 63 Query: 66 MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 +N+ NGNN KK K IKN W FRNELR+NGF DI++I RSTSL I++ Q++ Sbjct: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK 123 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 Y++SA+SRY++P FCTF PW NS H + ITSSVK++S++D LDMM+VLDVS SM Sbjct: 124 DYNLSAVSRYEMPFIFCTF-PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR 245 F + K+ +A +SI ML+ +K IPDVNNVV+SGLVTFS+KI + F L WGV H+Q Sbjct: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK-E 304 KI L FG +T STPGL+YAYN+IFD + +H +YKK I+F+TDGEN S + Sbjct: 243 KINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 +++SL+YCNEAK+RGAIVYAIG++ + +FL+ CASP+ FY V+N ++DAF IGK+ Sbjct: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKE 361 Query: 365 IVTKRIWYDK 374 +V +RI Y+K Sbjct: 362 MVKQRILYNK 371 >gi|315122347|ref|YP_004062836.1| hypothetical protein CKC_02995 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495749|gb|ADR52348.1| hypothetical protein CKC_02995 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 362 Score = 206 bits (525), Expect = 4e-51, Method: Compositional matrix adjust. Identities = 118/370 (31%), Positives = 215/370 (58%), Gaps = 29/370 (7%) Query: 6 LNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 + IRNFF N +G +TI +AI P+I +++ ++ E+S+I+ K L ++IDR+L+ T I Sbjct: 4 IKIRNFFQNKRGIITITSAIIFPLIIILMAIVFEMSNIYLEKERLQAVIDRALLDTVTMI 63 Query: 66 MNEGNGNNRKKLKGGDILCR----IKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVV 121 KLK + + + + W + + EL + F +D+ +++ TS+ + Sbjct: 64 ----------KLKNIEDVVKNVGPVNTIWTKNLKYELEHSDFSSDVQNVIDDTSMKLESD 113 Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 + SI+AIS+YK+P K C ++++ +P+ SS+K+ + +D+MIVLDVS Sbjct: 114 SNFKTLSITAISQYKMPFKICNIHLLCPKNKYVTVPVLSSMKIGRNEGSDIDLMIVLDVS 173 Query: 182 RSMESFF----DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 SM+ F ++ +++++A KSI MLE+ + +P+ NV ++G V F++ ++ L+ Sbjct: 174 SSMDDNFMKPEEAPCSRLEVAKKSIRKMLEDFRKVPNYANVFRTGSVGFNDMVQFPMPLK 233 Query: 238 WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA-NYKKIIVFMTD 296 G+ + IK FG STNS G+KYA+ Q++ N +D + KKI++F+TD Sbjct: 234 RGLKRIYNDIKKYRAFG-STNSYVGMKYAWEQLYG--------NPQDTKDRKKIVIFLTD 284 Query: 297 GENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYD 356 GEN+ ++++ CN+ KK+ A++Y+I + V + E L+ C+S + Y ++ S+ Sbjct: 285 GENMIINATRKTIELCNDMKKKKAVIYSIALAV-DNKEVLQGCSSSGNVYAADDAQSLVQ 343 Query: 357 AFSHIGKDIV 366 A+S IGKD++ Sbjct: 344 AYSLIGKDVM 353 >gi|163760496|ref|ZP_02167578.1| hypothetical protein HPDFL43_04296 [Hoeflea phototrophica DFL-43] gi|162282447|gb|EDQ32736.1| hypothetical protein HPDFL43_04296 [Hoeflea phototrophica DFL-43] Length = 363 Score = 84.3 bits (207), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 77/367 (20%), Positives = 154/367 (41%), Gaps = 37/367 (10%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 IR N G ++ A +P++F+ + ++ ++ MK L + +D + + A ++ Sbjct: 10 IRKLLRNENGNFALIAAAAVPVLFMAGSLAVDTTNAMSMKVRLQNAVDSAALATAARLSE 69 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRN-ELRD-----NGFVNDIDDIVRSTSLDIVVV 121 E N + + + F N ++++ NGF + + ++D V Sbjct: 70 EEN-----------LTAAQAQAFALKFVNGQVKEDFGAFNGF-----SVTPTVNIDPV-- 111 Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 + G ++ ++ + T + + + + + + M +VLD S Sbjct: 112 -ETGGRTVWKVAVSMEGSQSLTPMARIMGKDKLTVSVVGKSESAGEAQGAFSMALVLDRS 170 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS 241 SM+ + KI++ ++ ++E+ + V+ G ++++K+ L W Sbjct: 171 GSMDWNLNGQ-KKINVLKTAVGGLIEQFEEADPERKYVRLGASSYNSKLTGSTKLRWNPG 229 Query: 242 HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY----KKIIVFMTDG 297 + + L G T+ST +AY + ++ NT DA KK IVFMTDG Sbjct: 230 KTKEFVDALPASG-GTDSTDAFDWAYTAVT----HKRENNTHDAKSGQVPKKFIVFMTDG 284 Query: 298 ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACASPNS-FYLVENPHSMY 355 +N + D + + C++AK G VY + R + L CAS F+ +N + Sbjct: 285 DNNYSSADSSTKHLCDDAKDDGIEVYTVAFAAPNRGKQLLSYCASTEEHFFDAQNSAQLI 344 Query: 356 DAFSHIG 362 +AF +IG Sbjct: 345 EAFKNIG 351 >gi|254780388|ref|YP_003064801.1| hypothetical protein CLIBASIA_01365 [Candidatus Liberibacter asiaticus str. psy62] gi|254040065|gb|ACT56861.1| hypothetical protein CLIBASIA_01365 [Candidatus Liberibacter asiaticus str. psy62] Length = 458 Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 62/210 (29%), Positives = 100/210 (47%), Gaps = 21/210 (10%) Query: 178 LDVSRSMESFFDSS-----ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 LD S S E F DSS I K + ++ +++ +K I +VN+ V+ G F++++ Sbjct: 249 LDPSLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVIS 308 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNS------TPGLKYAYNQIFDMQGMRQHCNTEDAN 286 WGV L R I + F + N ++ AY+ I H + Sbjct: 309 DPSFSWGVHKLIRTI--VKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLE 366 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE-----FLRACAS 341 KK IV +TDGEN T+++++ + CN+AK +G + I V ++ + FL CAS Sbjct: 367 AKKYIVLLTDGEN--TQDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS 424 Query: 342 PNSFYLVENPHSMYDAF-SHIGKDIVTKRI 370 PNSF+ + H + F IG +I + I Sbjct: 425 PNSFFEANSTHELNKIFRDRIGNEIFERVI 454 Score = 38.1 bits (87), Expect = 2.1, Method: Compositional matrix adjust. Identities = 28/130 (21%), Positives = 58/130 (44%), Gaps = 23/130 (17%) Query: 17 GGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKK 76 G I+TA+ +P++ V GM+++V + + L +++ A+ ++ Sbjct: 20 GHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQSLE------ 73 Query: 77 LKGGDILCRIKNTWNM-----------SFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 ++ R KN++ +F N L+ N ++ DIVR T++++ P+ Sbjct: 74 ----EVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEM--NPRKS 127 Query: 126 GYSISAISRY 135 Y + SRY Sbjct: 128 AYQVVLSSRY 137 >gi|241206334|ref|YP_002977430.1| hypothetical protein Rleg_3648 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240860224|gb|ACS57891.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 400 Score = 75.9 bits (185), Expect = 9e-12, Method: Compositional matrix adjust. Identities = 47/174 (27%), Positives = 88/174 (50%), Gaps = 7/174 (4%) Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK 252 TKI+ ++ + E+ + V++G V+++ ++++ L+WG +H+ KY++K Sbjct: 218 TKIEALKLAVGTLTGELDAVDPEKEYVRTGAVSYNIEMQKAKALDWGTAHV---TKYVNK 274 Query: 253 FGVS--TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 + T+S K AYN++ D + H + K IVFMTDG+N T D ++ Sbjct: 275 LTATDGTDSGEAFKTAYNKLADAAEDKAHVDKTGQVPTKYIVFMTDGDNNYTSADTETKT 334 Query: 311 YCNEAKKRGAIVYAIG-IRVIRSHEFLRACA-SPNSFYLVENPHSMYDAFSHIG 362 +C++A+ VY I + R L CA +P +++ + ++ AF IG Sbjct: 335 WCDKARDAKMQVYTIAFMAPARGQALLSYCATAPGNYFPAGDMTALLKAFKEIG 388 >gi|222087111|ref|YP_002545646.1| hypothetical protein Arad_3867 [Agrobacterium radiobacter K84] gi|221724559|gb|ACM27715.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 401 Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 44/174 (25%), Positives = 83/174 (47%), Gaps = 4/174 (2%) Query: 192 ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK-IKYL 250 I KI ++ +L+++ + V++ + +S++++ L WG + + I L Sbjct: 217 IKKIAALKTAVGTLLDQLDSADPKSQYVRTAAIAWSSEVDSSSALAWGTTTTRSNVISGL 276 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 + G + +S P + AY + + ++KIIV MTDGEN +T D ++L Sbjct: 277 NANGGTESSAP-MALAYKNVSASSEATAQAAKGNTTFQKIIVLMTDGENNATSSDTKTLA 335 Query: 311 YCNEAKKRGAIVYAIGIRVI-RSHEFLRACA-SPNSFYLVENPHSMYDAFSHIG 362 C AK G ++Y++ R L+ CA SP++++ + + AF IG Sbjct: 336 TCKAAKDAGVLIYSVAFMAPDRGQTLLKNCASSPSNYFDAQQMSDLIAAFKTIG 389 >gi|150397936|ref|YP_001328403.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] gi|150029451|gb|ABR61568.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] Length = 419 Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 86/407 (21%), Positives = 162/407 (39%), Gaps = 54/407 (13%) Query: 5 NLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQ 64 L+ + G ++TA+ P++ V G+ ++V+++ K L D + + AA+ Sbjct: 8 GLSFMRMLRDRGGNFGMMTALVAPLLLAVGGVSVDVANMLMTKNQLQDATDAAALAAASA 67 Query: 65 IMNEGNGNNRKKLKGGDILCR--IKNTWNMSFRNELRDNGFV-------NDIDDI----- 110 ++++ R ++ L R +K + ++L D G N D++ Sbjct: 68 LVSDA----RPDIEEAKDLARKFLKTQAAAATASDLPDEGPSIGARGGGNADDEVPATPR 123 Query: 111 ---VRSTSLDIVVVPQN-EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 V +T +DI P +G S K L+F + I + S+ + + Sbjct: 124 WEDVNATEIDITATPNGAKGKSFQVTVANKHLLQF-NAMTRLLGPESIEIETRSTAESAT 182 Query: 167 QTDARLDMMIVLDVSRSM----------------------------ESFFDSSITKIDMA 198 ++ L M +VLD S SM ++ +TKID Sbjct: 183 ESKNALSMYLVLDRSGSMAWKTNTINTGKAKCPNYTEANWSKYPDLKATGPCYVTKIDAL 242 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTN 258 ++ +L ++ + V++G +++++ + L WG + L G T Sbjct: 243 KTAVGDLLAQLVTADPESAYVRTGAISYNSAQDAASSLSWGTRGAAGYVDALVAIG-GTA 301 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 S K A+ ++ + +H K IVFMTDGEN +D + +C+ AK Sbjct: 302 SGNAFKTAFQKVTNAAEDSEHGAKNGQVPTKYIVFMTDGENNHANDDTVTRQWCDTAKAS 361 Query: 319 GAIVYAIGIRVI-RSHEFLRACASPNSFYL-VENPHSMYDAFSHIGK 363 +Y++ R + L++CAS +S Y E + AF IG+ Sbjct: 362 KVQIYSVAFMAPDRGQKLLKSCASSSSHYFEAEEASDLVAAFKAIGE 408 >gi|315122199|ref|YP_004062688.1| hypothetical protein CKC_02245 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495601|gb|ADR52200.1| hypothetical protein CKC_02245 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 463 Score = 63.5 bits (153), Expect = 6e-08, Method: Compositional matrix adjust. Identities = 43/180 (23%), Positives = 86/180 (47%), Gaps = 10/180 (5%) Query: 201 SINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIK--YLSKF-GVST 257 ++ ++ V+ I ++ + ++ + F+ +I+ +F + WG+ ++++ Y K +T Sbjct: 280 ALATFIKRVRKIDNLKDKLRMSFMYFNERIDHYFPMTWGIKEFKQEVSSHYKRKHENTAT 339 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAK 316 + P L+ AYN++ +H KK IV +TDG +N L C+ AK Sbjct: 340 DIHPILQEAYNKLHSKNEDDEHKKKNSVEVKKFIVLLTDGAQNEGVHSVDSVLKICDAAK 399 Query: 317 KRGAIVYAIGIRV-----IRSHEFLRACASPNSFYLVENPHSMYDAF-SHIGKDIVTKRI 370 + G ++ I V ++++FL CASP+ F+ + + F HIG I + + Sbjct: 400 EEGIKIFTISYSVDSSERKKANDFLSRCASPDKFFEAYDADKLNMIFKEHIGDAIFERLV 459 Score = 42.0 bits (97), Expect = 0.16, Method: Compositional matrix adjust. Identities = 46/201 (22%), Positives = 90/201 (44%), Gaps = 20/201 (9%) Query: 3 SLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAA 62 SL N + Y G +++A+ LP+IF+V+G++I++ + L ++ + + A+ Sbjct: 6 SLFFNFKRLKKCYNGSFFVISALLLPVIFMVIGLLIDLVRWGYYHNSLVQAVNTAALSAS 65 Query: 63 TQIMNEGNGNNRKK-----LKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLD 117 Q++N +++K L +I + N +S N + D I++ T ++ Sbjct: 66 VQLLNSVEDKSKEKALSSVLGENNIKQYLLNNLKISLYNNFGE----MDSQRIIQHTKVN 121 Query: 118 IVVVPQNEGYSISAISRYKIPLK-FCTFIPWYTNSRHIVMPITS------SVKVNSQTDA 170 I + + I+ S Y +PL F F N + PIT+ + K N + Sbjct: 122 I--YNRKGTHIINVYSHYNLPLNPFSLFFMNLINIKS--WPITTVGEAEVTSKKNYHKEE 177 Query: 171 RLDMMIVLDVSRSMESFFDSS 191 + + ++D S SM S D + Sbjct: 178 GVSVQWLIDDSGSMGSIIDRA 198 >gi|307945905|ref|ZP_07661241.1| putative von Willebrand factor type A [Roseibium sp. TrichSKD4] gi|307771778|gb|EFO31003.1| putative von Willebrand factor type A [Roseibium sp. TrichSKD4] Length = 432 Score = 62.8 bits (151), Expect = 8e-08, Method: Compositional matrix adjust. Identities = 77/376 (20%), Positives = 148/376 (39%), Gaps = 30/376 (7%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM 66 I N G + L I + ++ V+ + I++S F +T L + D + V ++ Sbjct: 70 GIPNAHKERDGSILPLFGILIMLLLAVVTIGIDMSQTFGERTRLQTAADMAAVQTGRALL 129 Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 E + D RI + +S + + + + + V Sbjct: 130 AEEITIAQANAYAKDAFNRIAS--GLSASGDGSSGTSIFGTMTVKPAVQITETVDGNTTN 187 Query: 127 YSISAISRYKIPLKFCTFI--PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 Y + KIP +F+ T I + S ++ A L M +VLD S SM Sbjct: 188 YVVKVNGTAKIPASPLSFMFFDGETGKNTISLGFESETTAKAEAGASLSMALVLDRSGSM 247 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF----SNKIEEFFLLEWGV 240 S ++++ A++S +++E++ + + + G + + K E L W Sbjct: 248 GWERPSRMSELKKAVRS---LIKELQTVDPDDQFTRLGAYAYHWYYAGKKE----LTWNK 300 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG--- 297 + ++ + L G T + P ++ A N + + H N + I++MTDG Sbjct: 301 NSVRSWVNSLPASG-GTRAAPAIQKAKNDLLTNSELNAHINKNEQEPDLFILYMTDGIDG 359 Query: 298 -ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-HEFLRACASPNS-FYLVENPHSM 354 N + +E C AK G +Y + + S L+ACA+ ++ +Y +N + + Sbjct: 360 DPNWAKRE-------CTSAKNAGITIYTVAFKAPASGRNLLKACATSDAHYYDAKNANEL 412 Query: 355 YDAFSHIGKDIVTKRI 370 F I ++ TK I Sbjct: 413 NKVFKDIARE-TTKSI 427 >gi|254781110|ref|YP_003065523.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] gi|254040787|gb|ACT57583.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] Length = 420 Score = 62.4 bits (150), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 86/400 (21%), Positives = 169/400 (42%), Gaps = 51/400 (12%) Query: 16 KGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN-------- 67 K +I+ A+ + L++G +I V + K + S + +++ A+++++ Sbjct: 19 KANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDR 78 Query: 68 -EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN-DIDDIVRSTSLDIVVVPQNE 125 E N+ K+ D IKN S F N +I +IV S+ + + + N Sbjct: 79 FESISNHAKRALIDDAKRFIKNHIKESLSG--YSAVFYNTEIQNIVNSSRISMTHMANNR 136 Query: 126 --------GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 Y++ ++ Y L+F + ++ IV I + +++ + +V Sbjct: 137 LDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELV 196 Query: 178 LDVSRSMESFF--------------DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 +D+S SM D TK+ ++ L+ + L+ V V GL Sbjct: 197 VDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHVKEDVYMGL 256 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS-----TNSTPGLKYAYN-------QIF 271 + ++ ++E+ WG ++ +Y+++ S T+STP +K AY + F Sbjct: 257 IGYTTRVEKNIEPSWGTEKVR---QYVTRDMDSLILKPTDSTPAMKQAYQILTSDKKRSF 313 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-I 330 RQ ++K I+F+TDGEN + K + ++ C++AK+ + I I Sbjct: 314 FTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTISINASP 373 Query: 331 RSHEFLRAC-ASPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 L+ C +SP Y V N S+ F +I + +V ++ Sbjct: 374 NGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 >gi|227823417|ref|YP_002827390.1| hypothetical protein NGR_c28930 [Sinorhizobium fredii NGR234] gi|227342419|gb|ACP26637.1| hypothetical protein NGR_c28930 [Sinorhizobium fredii NGR234] Length = 413 Score = 62.4 bits (150), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 46/175 (26%), Positives = 78/175 (44%), Gaps = 5/175 (2%) Query: 192 ITKIDMAIKSINAMLEEVKLIPDVNNV-VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL 250 +TKID ++ +L ++ L+ D + + V++ +++++ + L WG S + L Sbjct: 230 VTKIDALKTAVTDLLAQL-LVADPDQIYVRTAAISYNSVQDTAGTLAWGTSGAAAYVNAL 288 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 G T S K AY ++ H K +VFMTDGEN +D + Sbjct: 289 VATG-GTASAGAFKTAYQKVIAATENTAHAAKNGQVPSKYMVFMTDGENNYANDDTVTKQ 347 Query: 311 YCNEAKKRGAIVYAIGIRVI-RSHEFLRACASPNSFYL-VENPHSMYDAFSHIGK 363 +C+ AK +Y++ R L+ CAS +S Y E + AF IG+ Sbjct: 348 WCDTAKANKVEIYSVAFMAPERGQALLKYCASSSSHYFEAEEVTDLVAAFKAIGE 402 >gi|315122473|ref|YP_004062962.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495875|gb|ADR52474.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 403 Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 81/404 (20%), Positives = 164/404 (40%), Gaps = 66/404 (16%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F N G I++A + + + + +I+++H+ MK + S +D +++ + ++++ Sbjct: 16 FSKNKSGVFHIMSASIIFVCLIFVSFVIDITHLLHMKNHIQSSLDNAIISGCSIVVSDPK 75 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 N+ N R+ ++ N +VN + + + I+ +N S S Sbjct: 76 INDL-------------NPQEERIRDVIKKNAYVNMVQNFPAEHAAYII---ENANISFS 119 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIV----MP--------ITSSVKVNSQTDARLDMMIVL 178 K K + ++ + MP I++ + + +VL Sbjct: 120 KDLTNKYEYKITMEAKHQLSGKNFILGFLMPNVITHISSISTGIIQKPSDKKAFSVEMVL 179 Query: 179 DVSRSMESFFDSSITKIDM----------------------AIKS-----INAMLEEVKL 211 D S SM DS D+ A+K+ +N + E V+ Sbjct: 180 DCSGSM---LDSMQESCDLSSGRGGYYFYSKNNNKPKSKIYALKTASSDFVNLIQETVQT 236 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 P ++ + GL+TF++ I + L + +++ I + G + P + AY + Sbjct: 237 FPQIS--ARIGLITFNHYIMQDSKLSNNFNVIKKTISRMKPKGGTDTFLP-MNAAYEYLN 293 Query: 272 DMQG-MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 ++ + H +++ K+ I+ MTDGEN D +++ C+ A+K G I+Y+I + Sbjct: 294 NIPNETKAHNISDNVPLKRYIILMTDGENNHPSYDLKTINVCDNARKNGIIIYSIFLNYY 353 Query: 331 R---SHEFLRACASPNS-FYLVENPHSMYDAFSHIGKDIVTKRI 370 +E R CAS F+ N ++ D+F I I K + Sbjct: 354 EYTDGYELARKCASSEKHFFYANNTKALLDSFKSIAHAIQDKAV 397 >gi|254781108|ref|YP_003065521.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] gi|254040785|gb|ACT57581.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] Length = 398 Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 79/402 (19%), Positives = 171/402 (42%), Gaps = 63/402 (15%) Query: 22 LTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGD 81 +TAI + + FL + I+++HI +++ + S +D +++ I+++ + K Sbjct: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKD-- 58 Query: 82 ILCRIKNTWNMSFRNELRDNGFVND-IDDIVRSTSLDIVVVPQNE-GYSISAISRYKIPL 139 + + + L+ ++ + DI + ++I N Y + ++Y+IP Sbjct: 59 ---QTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPT 115 Query: 140 K---FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME----------- 185 + IP ++ + T ++ +S+ A + + +VLDVSRSME Sbjct: 116 ENLFLKGLIPSALT--NLSLRSTGIIERSSENLA-ISICMVLDVSRSMEDLYLQKHNDNN 172 Query: 186 ---------------SFFDSSITK-------------IDMAIKSINAMLEEV-KLIPDVN 216 SF+ + TK ID+ I+S ++ + K I + Sbjct: 173 NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK 232 Query: 217 NV-VQSGLVTFSNKI--EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 N+ V+ G + ++ I + L ++ ++ ++ L+ + +TN+ P + +AY ++++ Sbjct: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYN- 290 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ---QSLYYCNEAKKRGAIVYAIGIRV- 329 + H KK ++F+TDGEN Q +L C + G +Y++ + Sbjct: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 Query: 330 IRSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + LR C S F+ V + + ++F I I + + Sbjct: 351 PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 >gi|315122479|ref|YP_004062968.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495881|gb|ADR52480.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 427 Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 90/416 (21%), Positives = 171/416 (41%), Gaps = 67/416 (16%) Query: 8 IRNFFYNYK-------GGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVH 60 +R F +N+K +IL ++ L I L +G++I V + K + + +++ Sbjct: 4 LRTFLFNFKKIILSPKANFSILFSVILISILLFIGILIYVLDYYHKKNAMENANTSAILS 63 Query: 61 AATQIMNEGN--GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD---IVRSTS 115 A++I++ + G+N I+ + + L + V DI + I +++ Sbjct: 64 GASKIISRISYFGDNMSSHTHRAIVDDVTRFIKSYIKESLLMDSSVFDISEKNIISQNSK 123 Query: 116 LDIVVVP--------------QNEG--YSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + I P QN+ Y IS + Y +KF + + I+ + Sbjct: 124 VSITREPHPNVFHEFNNQSILQNKKTFYHISVETFYDYHIKFFDNLLNKKINSKIISFVP 183 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI---------------TKIDMAIKSINA 204 + VK+++ + +V+D+S SM +S +K+D K++ Sbjct: 184 ALVKIDTGEHPFFFVQLVVDLSASMSCLMNSDPEHATEFSVCGKSKKNSKMDALKKAVLL 243 Query: 205 MLEEV----KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS---- 256 L+ V K D + + GL ++ ++E+ WG +++ I + + V+ Sbjct: 244 FLDSVDRGSKTQKDTHYI---GLTGYTTRVEKNIEPSWGTGKVRKYI--VEEIDVNMLGQ 298 Query: 257 TNSTPGLKYAYNQIFDMQGMR--------QHCNTEDANYKKIIVFMTDGENLSTKEDQQS 308 T+STP +K AY QI R + ++K ++F+TDGEN K D ++ Sbjct: 299 TDSTPAMKKAY-QILTSDKKRNFIRNILHKRIKIPPLPFQKFLIFLTDGENNDPKSDVKT 357 Query: 309 LYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACAS-PNSFYLVENPHSMYDAFSHIG 362 + C +AKK + I I L+ C S P +Y V + S+ F I Sbjct: 358 IKICEKAKKNSIKILTISINASANGKRLLKKCVSAPEYYYNVVDTGSLLRVFQDIS 413 >gi|15966595|ref|NP_386948.1| hypothetical protein SMc04059 [Sinorhizobium meliloti 1021] gi|307300370|ref|ZP_07580150.1| TadE family protein [Sinorhizobium meliloti BL225C] gi|307319653|ref|ZP_07599079.1| TadE family protein [Sinorhizobium meliloti AK83] gi|15075867|emb|CAC47421.1| Hypothetical protein SMc04059 [Sinorhizobium meliloti 1021] gi|306894775|gb|EFN25535.1| TadE family protein [Sinorhizobium meliloti AK83] gi|306904536|gb|EFN35120.1| TadE family protein [Sinorhizobium meliloti BL225C] Length = 410 Score = 57.0 bits (136), Expect = 5e-06, Method: Compositional matrix adjust. Identities = 40/167 (23%), Positives = 72/167 (43%), Gaps = 14/167 (8%) Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 K+ P N +++G +++++ + L WG + ++ L G T+S+ A Sbjct: 235 AKMDPG-NEYLRAGAASYNDRQDRASKLTWGTKNASAHVQGLDATG-GTDSSSAFAAAVE 292 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS----------TKEDQQSLYYCNEAKKR 318 ++ H +K IVFMTDGEN S K D + C AK Sbjct: 293 ELLLDGENEAHLAKNGQTPEKYIVFMTDGENTSYNGKTSPRDLEKADSVTKAACTTAKNN 352 Query: 319 GAIVYAIG-IRVIRSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 G ++ + + R + L+ACA SP+ + ++ ++ F IG+ Sbjct: 353 GIAIFTVAFMAPQRGKDLLKACATSPDHYKEADDAAALVSEFEKIGQ 399 >gi|86359182|ref|YP_471074.1| hypothetical protein RHE_CH03592 [Rhizobium etli CFN 42] gi|86283284|gb|ABC92347.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 411 Score = 57.0 bits (136), Expect = 5e-06, Method: Compositional matrix adjust. Identities = 89/407 (21%), Positives = 154/407 (37%), Gaps = 69/407 (16%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R F + G I+TAI P++ GM I+V + K L D + + AT + N Sbjct: 10 LRRMFSDRGGNFGIMTAILAPVLLGAAGMAIQVGDMLLSKQQLQEAADSAALATATALAN 69 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDI--VVVPQNE 125 G I + +F N ++ DI +TS+++ ++ Sbjct: 70 ------------GTIQTTEAEAFARNFVAGQMAN-YLQSGTDIKSTTSVNVQTTTSGKST 116 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM- 184 Y ++ Y + + +T ++H+ T+ + +SQT + M + LD S SM Sbjct: 117 SYQVTVSPAYVLTVNPLMQAVGFT-TQHLSTSGTT-IGGHSQTQGSISMFLALDKSGSMG 174 Query: 185 ------------ESF-FDSSI----------------------TKIDMAIKSINAMLEEV 209 ES+ +D ++ TKI+ + + ++ Sbjct: 175 EDTATVNEESPTESYTYDCNLHYNTKNNKWVYDKCTGSRTNYYTKIEALKMAAGNLFSQL 234 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 V++G V++ L WG++ + + L G TNS+ + AY Sbjct: 235 NSADPNAQYVRTGAVSYDINQYAPSSLAWGITGVSSYVNALQANG-GTNSSGAMNTAYTS 293 Query: 270 IF-------DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE-----DQQSLYYCNEAKK 317 + D++ T KK IVFMTDG+N + D + C++AK Sbjct: 294 LTAKNAAGNDVENSAHQQKTGQVP-KKYIVFMTDGDNNNDPSGGRSYDTATKKTCDDAKS 352 Query: 318 RGAIVYAIG-IRVIRSHEFLRACASPNSFYL-VENPHSMYDAFSHIG 362 +G +Y I + L CAS +S Y E + AF IG Sbjct: 353 KGIEIYTIAFMAPAGGQALLHYCASDDSHYFQAEKMEDLLAAFQAIG 399 >gi|190893432|ref|YP_001979974.1| hypothetical protein RHECIAT_CH0003859 [Rhizobium etli CIAT 652] gi|190698711|gb|ACE92796.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 410 Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 91/410 (22%), Positives = 147/410 (35%), Gaps = 76/410 (18%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R F + G I+TAI P++ GM I+V + K L D + + AT + N Sbjct: 10 LRRMFSDRGGNFGIMTAILAPVLLGAAGMAIQVGDMLISKQQLQEAADSAALATATALAN 69 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG- 126 I+ + +F N + V S V V N Sbjct: 70 G----------------TIQTSQAEAFARNFVAGQMANYLQSGVDIKSATGVTVQTNTSG 113 Query: 127 ----YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 Y ++ Y + + +T ++H+ T+ + +SQT + M + LD S Sbjct: 114 NSTSYQVTVSPSYDLTVNPLMQAVGFT-TQHLSTSGTT-IGGHSQTQGSISMYLALDKSG 171 Query: 183 SM-------------ESF---------------FDSSI-------TKIDMAIKSINAMLE 207 SM ES+ +D+ TKI+ + + Sbjct: 172 SMGEDTATVNEEDPTESYTYDCNGHYNKKGKWIYDTCTGSRANYYTKIEALKMAAGNLFG 231 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 ++ V++G V++ L WG S + + L + G TNS+ + AY Sbjct: 232 QLSSADPNAQYVRTGAVSYDIVQYTPSALAWGTSGVSTYVNAL-QAGGGTNSSGAMSTAY 290 Query: 268 NQIFDMQGMRQHCNTEDANYK--------KIIVFMTDGENLSTKEDQQSL-----YYCNE 314 + + + EDA +K K IVFMTDG+N +S C+ Sbjct: 291 SSLTAKNAAGN--DAEDAAHKLKTGQTPKKYIVFMTDGDNNDDSSGGRSYDTLTKATCDT 348 Query: 315 AKKRGAIVYAIGIRVIRSHE-FLRACASPNSFYL-VENPHSMYDAFSHIG 362 AK +G +Y I + L CAS +S Y E + AF IG Sbjct: 349 AKSKGIEIYTIAFMAPEGGQALLHYCASDDSHYFQAEKMEDLLAAFKAIG 398 >gi|327189644|gb|EGE56794.1| hypothetical protein RHECNPAF_570041 [Rhizobium etli CNPAF512] Length = 415 Score = 53.1 bits (126), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 96/418 (22%), Positives = 158/418 (37%), Gaps = 87/418 (20%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R F + G I+TAI P++ G+ I+V + K L + + A Sbjct: 10 LRRMFSDRGGNFGIMTAILAPVLLGAAGLAIQVGDMLLSKQQLQ---EAADSAALATATA 66 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG- 126 GNG I + +F N N +D I +T++++ N G Sbjct: 67 LGNGT---------IQTSQAEAFARNFVAGQMANYLQNGVD-IKNATAVNVQT--SNSGK 114 Query: 127 ---YSISAISRYKI---PL-KFCTFIPWY--TNSRHIVMPITSSVKVNSQTDARLDMMIV 177 Y ++ Y + PL + F + T+S + P S S + + M + Sbjct: 115 SASYQVTVTPSYDLTVNPLMQAVGFSTQHLSTSSTTVSGP---SQTPGSNSQGSVSMFLA 171 Query: 178 LDVSRSM-------------ESF---------------FDSSI-------TKID-MAIKS 201 LD S SM E+F +D+ TKI+ + + + Sbjct: 172 LDKSGSMGDPTETVNKDQPTETFTYDCNPHLNKKGKWVYDTCTGSRTNYYTKIEALKMAA 231 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF--LLEWGVSHLQRKIKYLSKFGVSTNS 259 N + PD V++G V++ I+++ L WG S + + L + G TNS Sbjct: 232 GNLFGQLTSADPDAQ-YVRTGAVSY--DIDQYTPSTLAWGTSGVSSYVNAL-QAGGGTNS 287 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYK--------KIIVFMTDGENLSTKEDQQSL-- 309 + + AY+ + + EDA +K K IVFMTDG+N + +S Sbjct: 288 SGAMGTAYSSLTAKNAAGN--DAEDAAHKLKTGQIPKKYIVFMTDGDNNNDSSGGRSYDT 345 Query: 310 ---YYCNEAKKRGAIVYAIGIRVIRSHE-FLRACASPNSFYL-VENPHSMYDAFSHIG 362 C+ AK +G +Y I + L+ CAS + Y E + AF IG Sbjct: 346 LTKATCDTAKSKGIEIYTIAFMAPPGGQALLQYCASDAAHYFQAEQMEDLLAAFKAIG 403 >gi|154250683|ref|YP_001411507.1| von Willebrand factor type A [Parvibaculum lavamentivorans DS-1] gi|154154633|gb|ABS61850.1| von Willebrand factor type A [Parvibaculum lavamentivorans DS-1] Length = 436 Score = 53.1 bits (126), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 42/139 (30%), Positives = 64/139 (46%), Gaps = 13/139 (9%) Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 S L I ++ G +TN+T GL + +N + G AN K+IVF+TDG+N Sbjct: 300 SALNSHIDAMASAG-NTNTTIGLAWGWN-MLTQGGPLSSAAAPAANLDKVIVFLTDGDNT 357 Query: 301 -------STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL-RACAS-PNSFYLVENP 351 S + ++ CN K G VY+ +RVI + L R CA+ P +Y V Sbjct: 358 RNRWSNNSNTINARTTLICNNIKAAGIKVYS--VRVIEGNATLIRNCATEPGMYYSVTTA 415 Query: 352 HSMYDAFSHIGKDIVTKRI 370 + F+ I + + RI Sbjct: 416 SELTSVFASIAQSLSNLRI 434 >gi|301626452|ref|XP_002942405.1| PREDICTED: collagen alpha-6(VI) chain-like [Xenopus (Silurana) tropicalis] Length = 2615 Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 56/201 (27%), Positives = 93/201 (46%), Gaps = 29/201 (14%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS-NK 229 +LD++ ++D S S+ S S+ T +K I V N V+ G+ +S N Sbjct: 1401 QLDLVFLIDGSASITS---SNFTSAKTFMKEI------VDSFTISENRVRIGVAQYSANP 1451 Query: 230 IEEFFLLEW-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 +EFFL E+ S ++++I +S+ +T + GL++ Q FD N N Sbjct: 1452 KKEFFLNEYYSSSDMKKQIDSISQLKATTYTGKGLRFV-KQFFD------PANGGRKNVP 1504 Query: 289 KIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 + ++ MTDG N S ED +L + G +++IGI + S E + SP + Y Sbjct: 1505 QYLIVMTDGMSNDSVNEDAAAL------RSSGVKIFSIGIGLRNSFELVMIAGSPKNVYE 1558 Query: 348 VENPHSMYDAFSHIGKDIVTK 368 VE + A I + IV + Sbjct: 1559 VET----FQALDSIKRQIVAQ 1575 Score = 39.7 bits (91), Expect = 0.80, Method: Compositional matrix adjust. Identities = 39/168 (23%), Positives = 76/168 (45%), Gaps = 23/168 (13%) Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNS 259 + +M+++ ++ PD VQ GL+ FS++ +E F L +Q I+ + + T Sbjct: 1048 MESMVKQAEIGPDR---VQIGLIQFSSETKEEFPLNRYKRKDEIQSAIRGIQQLSQGTLM 1104 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKR 318 LKY +G R N K+ ++ +TDGE Q ++ +A + Sbjct: 1105 GEALKYTLPYFSASKGGR-------VNTKQYLIVITDGEA------QDAVGNPAKAIRDH 1151 Query: 319 GAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIV 366 G I+YAIG++ + + L Y ++ +D+ + + K+I+ Sbjct: 1152 GVIIYAIGVQQANNTQLLEIAGKQEQVYYEDS----FDSLAFLNKNIM 1195 >gi|332716587|ref|YP_004444053.1| hypothetical protein AGROH133_12352 [Agrobacterium sp. H13-3] gi|325063272|gb|ADY66962.1| hypothetical protein AGROH133_12352 [Agrobacterium sp. H13-3] Length = 412 Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 84/408 (20%), Positives = 159/408 (38%), Gaps = 70/408 (17%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI-MN 67 R F + G ++TAI LP++ G +E++++ +K L + D + + AAT+ + Sbjct: 11 RRFLADTGGNFGMMTAILLPVLLGFAGAGMELANVMQVKADLQNTADSAALAAATEARLK 70 Query: 68 EGNGNNR--KKLKGGDILCRIKNTWNMSFRNELRDNGFVN--DIDDIVRSTSLDIVVVPQ 123 EG + K++ I +++ T + L N VN DD T Sbjct: 71 EGALTDEQIKEIAKAFIASQMEKTLTEEEKKALEKNSPVNIGTTDDARGKT--------- 121 Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 Y+I Y++ L + + ++ + + T + A + M +VLD S S Sbjct: 122 ---YTIQTTINYQMQLNP---LLGFFGAKTLDLAATGTAVSTVNKGAPISMYLVLDRSGS 175 Query: 184 MESFFDS-------------------------------SITKIDMAIKSINAMLEEVKLI 212 M D+ T + A+ + A L + Sbjct: 176 MSFKTDTLNTKKTSCQNYTVDNWGSYPNLKNTSPCYVNKATSLKTAVGYLVATLNKADPT 235 Query: 213 PDVNN---VVQSGLVTFSNKIEEFFLLEWGVS----HLQRKIKYLSKFGVSTNSTPGLKY 265 N +V++G ++++ + WG S ++ ++I G S+ L Sbjct: 236 YTANGGSELVRTGASVYTHETYAAQPITWGTSSVATYVDKQIPEFPSGGTDARSS--LNA 293 Query: 266 AYNQIFDMQGM--RQHCNTEDANYKKIIVFMTDGENLSTKE------DQQSLYYCNEAKK 317 AYN + + ++H + + ++++ IV MTDGE DQ C+ AKK Sbjct: 294 AYNALKKANTVEAKEHKDKKSESFERYIVLMTDGEMTGNSSSWSSSIDQTVRNTCDTAKK 353 Query: 318 RGAIVYAIGIRVI-RSHEFLRACASP-NSFYLVENPHSMYDAFSHIGK 363 G ++++ + L+ CAS +++Y EN + AF I + Sbjct: 354 DGIKIFSVAFMAPDKGKSLLQHCASSLDNYYAPENMEQIVTAFGEIAR 401 >gi|306823858|ref|ZP_07457232.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|309802423|ref|ZP_07696530.1| conserved repeat protein [Bifidobacterium dentium JCVIHMP022] gi|304552856|gb|EFM40769.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|308221023|gb|EFO77328.1| conserved repeat protein [Bifidobacterium dentium JCVIHMP022] Length = 1136 Score = 50.8 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 66/249 (26%), Positives = 104/249 (41%), Gaps = 32/249 (12%) Query: 103 FVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSV 162 F N T +IV P+ +I AI P K N+ + + +T + Sbjct: 532 FTNTATATGTDTDGNIVNSPE-ASVTIKAIKPLGAPEKHKRIKKNADNTYTVNVDVTGAA 590 Query: 163 KVNSQTDAR-LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI----PDVNN 217 ++ T + +D +VLDVS SM S+TK+ ++N L+E I + Sbjct: 591 SSSTITTTQPVDFTLVLDVSGSMRENM-GSVTKLQALQSAVNNFLDEAAKINKGAQSGSE 649 Query: 218 VVQSGLVTF----SNKI-EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 V+ GLV F + KI + + +W + + +K L T T GLK N++ Sbjct: 650 PVRVGLVKFAGNATKKIGNKTYQDKWNTYNYSQIVKKL------TADTDGLKNEVNKLTA 703 Query: 273 MQGMR-----QHCNT----EDANYKKIIVFMTDGENLSTKE-----DQQSLYYCNEAKKR 318 R QH T KK+++F TDG+ S K ++ Y + K Sbjct: 704 GGATRADYGFQHAFTVMSEARTEAKKVVIFFTDGKPTSEKTFDGKVANDAVEYAKQLKDS 763 Query: 319 GAIVYAIGI 327 GAIVY+IG+ Sbjct: 764 GAIVYSIGV 772 >gi|209550922|ref|YP_002282839.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209536678|gb|ACI56613.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 411 Score = 50.8 bits (120), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 92/413 (22%), Positives = 154/413 (37%), Gaps = 81/413 (19%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R F + G I+TAI P++ V G+ I+V + K L D + + AT + N Sbjct: 10 LRRMFSDRGGNFGIMTAIMAPVLLGVAGVAIQVGDMMLSKQQLQEAADSAALATATALAN 69 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDI--VVVPQNE 125 G I + +F N +V D TS+++ ++ Sbjct: 70 ------------GTIQTSQAEAFAQNFVAGQMAN-YVQSGVDFKSGTSVNVQTSTSGKST 116 Query: 126 GYSISAISRYKI---PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 Y ++ Y + PL ++H+ T+ V +SQT + M + LD S Sbjct: 117 SYQVTVSPSYDLTVNPLMQAVGF----KTQHLSTSGTT-VGGHSQTQGSISMFLALDKSG 171 Query: 183 SM-------------ESF-FDSSI----------------------TKIDMAIKSINAML 206 SM ES+ +D ++ TKI+ + + Sbjct: 172 SMGEATATVNADDPTESYTYDCNLHYNSKNNKWVYDKCTGSRTNYYTKIEALKIAAGNLF 231 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFL--LEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 ++ V++G V++ I ++ L WG + + + L G TNS+ + Sbjct: 232 GQLNSADPNAEYVRTGAVSY--DINQYTPSNLAWGTAGVTSYVNALQANG-GTNSSGAMS 288 Query: 265 YAYNQIFDMQGMRQHCNTEDANYK--------KIIVFMTDGENLSTKEDQQSL-----YY 311 AY+ + + ED+ +K K IVFMTDG+N +S Sbjct: 289 TAYSSLTAKNAAGN--DAEDSAHKLKTGQTPKKYIVFMTDGDNNDDSSGGRSYDTLTKAT 346 Query: 312 CNEAKKRGAIVYAIG-IRVIRSHEFLRACASPNSFYL-VENPHSMYDAFSHIG 362 C+ AK +G +Y I + L CAS +S Y E + AF IG Sbjct: 347 CDTAKSKGIEIYTIAFMAPAGGQTLLHYCASDDSHYFQAEKMEDLLAAFKAIG 399 >gi|91203253|emb|CAJ72892.1| hypothetical protein kustd2147 [Candidatus Kuenenia stuttgartiensis] Length = 701 Score = 49.7 bits (117), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 55/207 (26%), Positives = 96/207 (46%), Gaps = 27/207 (13%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 +Q D +L +MIVLD S SM + IT++ ++ K LI +N+ Q G+V Sbjct: 318 TQDDFQLQVMIVLDYSASMYEK-NGDITRMALSAKD---------LIESLNDTHQVGVVE 367 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD--MQGMRQHCNTE 283 F E +L+ ++ I+ +S+F S+ + ++ +D ++G++Q Sbjct: 368 FHRPDEPPAILQDFTTYKNAAIEAVSQF-----SSGKIYRDFSSCWDAVLKGLKQFPEKP 422 Query: 284 DANYKKIIVFMTDG---ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR--A 338 D + K +VF++DG + ST + SL AK+R +Y +GI E L+ A Sbjct: 423 DPDIFKTLVFLSDGFDNSSFSTPGNVISL-----AKERDVHIYILGIGRGSEEEVLKNIA 477 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDI 365 + ++ EN + F KDI Sbjct: 478 LETGGTYVHAENIAVFRERFKQTIKDI 504 >gi|149410544|ref|XP_001506183.1| PREDICTED: similar to protocadherin 9, partial [Ornithorhynchus anatinus] Length = 588 Score = 49.7 bits (117), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 44/160 (27%), Positives = 80/160 (50%), Gaps = 21/160 (13%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RLD++ V+D SRS+ K+ I +I L+ + + PDV V GL+ + + + Sbjct: 9 RLDLIFVIDSSRSVRPH---DFEKVKEFIVTI---LQFLDVAPDVTRV---GLIQYGSTV 59 Query: 231 E-EFFLLEWG-VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF-DMQGMRQHCNTEDANY 287 + EF L +G S ++R +K + + G T + ++YA N F + +G R N Sbjct: 60 KNEFSLKTYGRKSEVERAVKVMKRLGTGTMTGLAIQYAVNIAFSESEGAR----PLRENV 115 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 +II+ +TDG + + +A+ G +++AIG+ Sbjct: 116 PRIIMIVTDGRPQDPVAEVAA-----KARNSGILIFAIGV 150 >gi|147906540|ref|NP_001087052.1| von Willebrand factor A domain-containing protein 2 precursor [Xenopus laevis] gi|82235697|sp|Q6DCQ6|VWA2_XENLA RecName: Full=von Willebrand factor A domain-containing protein 2; AltName: Full=A domain-containing protein similar to matrilin and collagen; Short=AMACO; Flags: Precursor gi|50415038|gb|AAH77945.1| MGC80919 protein [Xenopus laevis] Length = 790 Score = 49.3 bits (116), Expect = 0.001, Method: Compositional matrix adjust. Identities = 62/220 (28%), Positives = 94/220 (42%), Gaps = 37/220 (16%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSM--ESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-N 217 S+ V LD+ VLD S ++ E F + + V L D+N + Sbjct: 520 SIDVQGCQAQPLDLAFVLDASTAVGQEKF------------NRLKNFVTMVSLQFDINRD 567 Query: 218 VVQSGLVTFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 V Q GLVT+S++ E F L+ G S LQ I S G S ++ L YN + +Q Sbjct: 568 VTQIGLVTYSSRPETVFGLDTHDSGSSLLQ-GIGRASYMGGSASTGSALLRVYNDVMTVQ 626 Query: 275 -GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 G R N K +V +TDG + + + + G +VY IGI I+ + Sbjct: 627 KGARPGVN-------KAVVVITDGRGA-----EDAAVPAQKLRDNGIMVYVIGIGNIQRN 674 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 LR S +L+ P Y++ H +D V +R+ D Sbjct: 675 SLLRLAGSEK--FLISVPS--YESLGHY-EDSVVQRVCED 709 >gi|297482040|ref|XP_002692521.1| PREDICTED: matrilin 4 [Bos taurus] gi|296480952|gb|DAA23067.1| matrilin 4 [Bos taurus] Length = 584 Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust. Identities = 53/234 (22%), Positives = 100/234 (42%), Gaps = 38/234 (16%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 PW T + P SS LD++ V+D SRS+ F + + + + + Sbjct: 18 PWETQL-QLAGPRCSS--------GPLDLVFVIDSSRSVRPF------EFETMRQFLVGL 62 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGL 263 L + + P+ V G++ +S++++ F L ++R I+ + T + + Sbjct: 63 LRSLDVGPNATRV---GVIQYSSQVQSVFPLRAFSRREDMERAIRAVVPLAQGTMTGLAI 119 Query: 264 KYAYNQIFDM-QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 +YA N F + +G R +A+ ++ V +TDG + + +A+ RG + Sbjct: 120 QYAMNVAFSVAEGAR----PPEAHVPRVAVIVTDG-----RPQDRVAEVAAQARARGIEI 170 Query: 323 YAIGIRVIRSHEFLRACASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 YA+G++ LRA ASP +LVE+ +D G+ + D Sbjct: 171 YAVGVQRADVGS-LRAMASPPLNEHVFLVES----FDLIQEFGRQFQGRLCTLD 219 Score = 36.6 bits (83), Expect = 6.0, Method: Compositional matrix adjust. Identities = 42/212 (19%), Positives = 97/212 (45%), Gaps = 32/212 (15%) Query: 151 SRHIVMPITSSVKVNSQTDAR-----LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 S V P ++ + ++ +R +D+++++D S+S+ ++ + +N + Sbjct: 320 SYRCVCPEGRQLQADGKSCSRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQI 373 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIE-EFFLLEWGVSHLQRK----IKYLSKFGVSTNST 260 ++ + + P+ V GLV FS+++ EF L +G + ++ ++Y+ + T + Sbjct: 374 VDFLDVSPEGTRV---GLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMER---GTMTG 427 Query: 261 PGLKYAYNQIF-DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 L++ F + QG R N ++ + TDG + +D S++ AK+ G Sbjct: 428 LALRHMVEHSFSEAQGAR----PRALNVPRVGLVFTDGRS----QDNISVWA-ARAKEEG 478 Query: 320 AIVYAIGIRVIRSHEFLRACASPNSFYLVENP 351 ++YA+G+ E + P ++ +P Sbjct: 479 IVMYAVGVGKAVEEELREIASEPAELHVSYSP 510 >gi|300796915|ref|NP_001178240.1| matrilin-4 [Bos taurus] Length = 584 Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust. Identities = 53/234 (22%), Positives = 100/234 (42%), Gaps = 38/234 (16%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 PW T + P SS LD++ V+D SRS+ F + + + + + Sbjct: 18 PWETQL-QLAGPRCSS--------GPLDLVFVIDSSRSVRPF------EFETMRQFLVGL 62 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGL 263 L + + P+ V G++ +S++++ F L ++R I+ + T + + Sbjct: 63 LRSLDVGPNATRV---GVIQYSSQVQSVFPLRAFSRREDMERAIRAVVPLAQGTMTGLAI 119 Query: 264 KYAYNQIFDM-QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 +YA N F + +G R +A+ ++ V +TDG + + +A+ RG + Sbjct: 120 QYAMNVAFSVAEGAR----PPEAHVPRVAVIVTDG-----RPQDRVAEVAAQARARGIEI 170 Query: 323 YAIGIRVIRSHEFLRACASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 YA+G++ LRA ASP +LVE+ +D G+ + D Sbjct: 171 YAVGVQRADVGS-LRAMASPPLNEHVFLVES----FDLIQEFGRQFQGRLCTLD 219 Score = 36.6 bits (83), Expect = 6.0, Method: Compositional matrix adjust. Identities = 42/212 (19%), Positives = 97/212 (45%), Gaps = 32/212 (15%) Query: 151 SRHIVMPITSSVKVNSQTDAR-----LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 S V P ++ + ++ +R +D+++++D S+S+ ++ + +N + Sbjct: 320 SYRCVCPEGRQLQADGKSCSRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQI 373 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIE-EFFLLEWGVSHLQRK----IKYLSKFGVSTNST 260 ++ + + P+ V GLV FS+++ EF L +G + ++ ++Y+ + T + Sbjct: 374 VDFLDVSPEGTRV---GLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMER---GTMTG 427 Query: 261 PGLKYAYNQIF-DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 L++ F + QG R N ++ + TDG + +D S++ AK+ G Sbjct: 428 LALRHMVEHSFSEAQGAR----PRALNVPRVGLVFTDGRS----QDNISVWA-ARAKEEG 478 Query: 320 AIVYAIGIRVIRSHEFLRACASPNSFYLVENP 351 ++YA+G+ E + P ++ +P Sbjct: 479 IVMYAVGVGKAVEEELREIASEPAELHVSYSP 510 >gi|15891094|ref|NP_356766.1| hypothetical protein Atu3868 [Agrobacterium tumefaciens str. C58] gi|15159433|gb|AAK89551.1| hypothetical protein Atu3868 [Agrobacterium tumefaciens str. C58] Length = 412 Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust. Identities = 83/408 (20%), Positives = 160/408 (39%), Gaps = 70/408 (17%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI-MN 67 R F + G ++TAI LP++ V G +E++++ +K + + D + + AAT+ + Sbjct: 11 RRFLADTSGNFGMMTAILLPVLLGVAGAGMELANVMQVKADMQNTADSAALAAATEARLR 70 Query: 68 EGNGNNR--KKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 EG ++ K++ I +++ + EL N T + + + Sbjct: 71 EGKLSDEQIKEIAKNFIAAQMEKNLTAEEKIELEKN----------SPTRVTTTENARGK 120 Query: 126 GYSISAISRYKIPL-KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 Y++ +++I L FI ++ + + +T + K A + M + LD S SM Sbjct: 121 TYAVETTIKHQIQLNPMLGFI----GAKTLDLSVTGTAKSTINKGAPISMYLALDRSGSM 176 Query: 185 ESF--------------------------------FDSSITKIDMAIKSINAMLEEVKLI 212 SF + + + A+ + A L + Sbjct: 177 -SFKTDTVDTTKTSCQNYTSDNWSKYPNLAKTSPCYVNKAASLKTAVGFLVATLNKADPT 235 Query: 213 PDVNN---VVQSGLVTFSNKIEEFFLLEWGVS----HLQRKIKYLSKFGVSTNSTPGLKY 265 VN +V++G ++++ + WG S ++ ++I G S+ L Sbjct: 236 YTVNGGSELVRTGASVYTHETYVAQSIGWGTSGVTSYVDKQIPEFPSGGTDARSS--LNA 293 Query: 266 AYNQI--FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE------DQQSLYYCNEAKK 317 AYN + + R H ++++ IV MTDGE DQ C AKK Sbjct: 294 AYNALKKANPDEARYHKEKGSESFERYIVLMTDGEMTGNSAAWNSSIDQSVRTTCETAKK 353 Query: 318 RGAIVYAIGIRVI-RSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 G ++++ + L+ CA S +++Y EN + AF I + Sbjct: 354 DGIKIFSVAFMAPDKGKSLLQYCASSADNYYAPENMEQIVTAFGEIAR 401 >gi|327274976|ref|XP_003222250.1| PREDICTED: collagen alpha-6(VI) chain-like [Anolis carolinensis] Length = 2088 Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust. Identities = 50/208 (24%), Positives = 88/208 (42%), Gaps = 37/208 (17%) Query: 171 RLDMMIVLDVSRS--------MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 RLD++ V+D S S M+ F S + K D++ ++ VQ G Sbjct: 798 RLDIVFVIDGSGSIDPKEYDIMKEFMISLVKKSDVS-----------------HDRVQFG 840 Query: 223 LVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 V +S + E FF L S + R I+ G +T + L+++ + G R+H Sbjct: 841 AVKYSAEPETFFYLNRYTTKSAIIRAIQNDKSIGETTYTAKALRHSEGLFSEEHGSRKH- 899 Query: 281 NTEDANYKKIIVFMTDGENLSTKE-DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 ++++ +TDG++ E D+ S + + G I+YAIGI R E L Sbjct: 900 ----RGVPQVLIVITDGDSHDAAELDEVS----KKLRANGIIIYAIGIERARPDELLTMA 951 Query: 340 ASPNSFYLVENPHSMYDAFSHIGKDIVT 367 S + ++ V + + I + I + Sbjct: 952 GSEDKYFYVNTFEGLKHLYPRISEKICS 979 Score = 45.4 bits (106), Expect = 0.013, Method: Compositional matrix adjust. Identities = 51/235 (21%), Positives = 101/235 (42%), Gaps = 46/235 (19%) Query: 144 FIPWYTNSRHIVMPITSSV-KVNSQTDARLDMMIVLDVSRS--------MESFFDSSITK 194 F+ + + +HI + + V + + + D++ ++D SRS M+ F + K Sbjct: 583 FVNDFASLKHIRNEVVRDICAVEACKEKKSDILFLVDSSRSISPDNFLKMKDFMSELVNK 642 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK-IEEFFLLEWGV-SHLQRKIKYLSK 252 D+ + ++ G + FS++ EEF L ++ S + R I +S Sbjct: 643 SDIGLDRMHV-----------------GAIQFSSRNKEEFRLSQYATKSDIIRAIGRMSL 685 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 G ST + L++ + ++G R + KKI++ +TDGE Q + Sbjct: 686 MGQSTLTGGALQFVLDYFRPIKGSRPYV-------KKILILITDGEA------QDDVKTP 732 Query: 313 NEA-KKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIV 366 EA ++ G IVY++G+ + + P + VE+ +D H+ +I+ Sbjct: 733 AEALRQEGIIVYSVGVFNANRTQLVEISGKPEMVFYVED----FDILKHLENEIL 783 >gi|32394600|gb|AAM93998.1| proximal thread matrix protein 1 [Griffithsia japonica] Length = 218 Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust. Identities = 31/118 (26%), Positives = 52/118 (44%), Gaps = 11/118 (9%) Query: 239 GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE 298 GV +Q + LS F + N+ L N ++G Q T+ ++++ +TDG Sbjct: 79 GVKLIQAPTQSLSTFNTAVNTVSPLNGGTNIFRGLRGCYQQLKTKPMT-DRVLILVTDGF 137 Query: 299 NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL----VENPH 352 Q + YCN K +G ++ +GI + FL+ CA+ FY+ PH Sbjct: 138 G------GQPINYCNFIKSKGILLVTVGIGTSINQNFLKNCATSEEFYINVKDTGRPH 189 >gi|114682171|ref|XP_001153893.1| PREDICTED: matrilin 4 isoform 2 [Pan troglodytes] gi|114682181|ref|XP_001153957.1| PREDICTED: matrilin-4 isoform 3 [Pan troglodytes] Length = 499 Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust. Identities = 46/188 (24%), Positives = 85/188 (45%), Gaps = 31/188 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVVQSGLVTFSN 228 LD++ V+D SRS+ F ++ + M L+P +N N + G++ +S+ Sbjct: 33 LDLVFVIDSSRSVRPFEFETMRQFLMG------------LLPGLNVGPNATRVGVIQYSS 80 Query: 229 KIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDA 285 +++ F L ++R I+ L T + ++YA N F + +G R + Sbjct: 81 QVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR----PPEE 136 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--- 342 ++ V +TDG + + +A+ RG +YA+G++ LRA ASP Sbjct: 137 RVPRVAVIVTDG-----RPQDRVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPLD 190 Query: 343 NSFYLVEN 350 +LVE+ Sbjct: 191 EHVFLVES 198 Score = 37.4 bits (85), Expect = 3.6, Method: Compositional matrix adjust. Identities = 40/193 (20%), Positives = 88/193 (45%), Gaps = 27/193 (13%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N + +D+++++D S+S+ ++ + +N +++ + + P+ V GLV Sbjct: 255 NRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEGTRV---GLV 305 Query: 225 TFSNKIE-EFFLLEWGVSHLQRK----IKYLSKFGVSTNSTPGLKYAYNQIF-DMQGMRQ 278 FS+++ EF L +G + ++ ++Y+ + T + L++ F + QG R Sbjct: 306 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMER---GTMTGLALRHMVEHSFSEAQGARP 362 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 N ++ + TDG + +D S++ AK+ G +VYA+G+ E Sbjct: 363 RA----LNVPRVGLVFTDGRS----QDDISVWAAR-AKEEGIVVYAVGVGKAVEAELREI 413 Query: 339 CASPNSFYLVENP 351 + P ++ P Sbjct: 414 ASEPAELHVSYAP 426 >gi|73992503|ref|XP_543015.2| PREDICTED: similar to Matrilin-4 precursor [Canis familiaris] Length = 624 Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust. Identities = 47/187 (25%), Positives = 87/187 (46%), Gaps = 29/187 (15%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SRS+ F + + + + +L + + P+ V G++ +S++++ Sbjct: 35 LDLVFVIDSSRSVRPF------EFETMRQFLVGLLRGLDVGPNATRV---GVIQYSSQVQ 85 Query: 232 EFFLLEWGV----SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDAN 286 F L G ++R I+ L T + ++YA N F + +G R +A Sbjct: 86 SVFPL--GAFSRREDMERAIRALVPLAQGTMTGLAIQYAMNVAFSVAEGAR----PPEAR 139 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---N 343 +I V +TDG + + +A+ RG +YA+G++ LRA ASP Sbjct: 140 VPRIAVIVTDG-----RPQDRVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPLDE 193 Query: 344 SFYLVEN 350 +LVE+ Sbjct: 194 HVFLVES 200 Score = 38.5 bits (88), Expect = 1.9, Method: Compositional matrix adjust. Identities = 39/194 (20%), Positives = 90/194 (46%), Gaps = 27/194 (13%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 N + +D+++++D S+S+ ++ + +N +++ + + P+ V GL Sbjct: 379 CNRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEGTRV---GL 429 Query: 224 VTFSNKIE-EFFLLEWGVSHLQRK----IKYLSKFGVSTNSTPGLKYAYNQIF-DMQGMR 277 V FS+++ EF L +G + ++ ++Y+ + T + L++ F + QG R Sbjct: 430 VQFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMER---GTMTGLALRHMVEHSFSEAQGAR 486 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++ + TDG + +D S++ AK+ G ++YA+G+ E + Sbjct: 487 ----PRALNVPRVGLVFTDGRS----QDDISVWA-RRAKEEGIVMYAVGVGKAVEEELRQ 537 Query: 338 ACASPNSFYLVENP 351 + P ++ +P Sbjct: 538 IASEPAELHVSYSP 551 >gi|114682169|ref|XP_001154082.1| PREDICTED: matrilin 4 isoform 5 [Pan troglodytes] gi|114682179|ref|XP_001154207.1| PREDICTED: matrilin-4 isoform 6 [Pan troglodytes] Length = 540 Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust. Identities = 46/188 (24%), Positives = 85/188 (45%), Gaps = 31/188 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVVQSGLVTFSN 228 LD++ V+D SRS+ F ++ + M L+P +N N + G++ +S+ Sbjct: 33 LDLVFVIDSSRSVRPFEFETMRQFLMG------------LLPGLNVGPNATRVGVIQYSS 80 Query: 229 KIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDA 285 +++ F L ++R I+ L T + ++YA N F + +G R + Sbjct: 81 QVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR----PPEE 136 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--- 342 ++ V +TDG + + +A+ RG +YA+G++ LRA ASP Sbjct: 137 RVPRVAVIVTDG-----RPQDRVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPLD 190 Query: 343 NSFYLVEN 350 +LVE+ Sbjct: 191 EHVFLVES 198 Score = 37.0 bits (84), Expect = 4.6, Method: Compositional matrix adjust. Identities = 40/194 (20%), Positives = 88/194 (45%), Gaps = 27/194 (13%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 N + +D+++++D S+S+ ++ + +N +++ + + P+ V GL Sbjct: 295 CNRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEGTRV---GL 345 Query: 224 VTFSNKIE-EFFLLEWGVSHLQRK----IKYLSKFGVSTNSTPGLKYAYNQIF-DMQGMR 277 V FS+++ EF L +G + ++ ++Y+ + T + L++ F + QG R Sbjct: 346 VQFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMER---GTMTGLALRHMVEHSFSEAQGAR 402 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++ + TDG + +D S++ AK+ G +VYA+G+ E Sbjct: 403 ----PRALNVPRVGLVFTDGRS----QDDISVWAAR-AKEEGIVVYAVGVGKAVEAELRE 453 Query: 338 ACASPNSFYLVENP 351 + P ++ P Sbjct: 454 IASEPAELHVSYAP 467 >gi|26349121|dbj|BAC38200.1| unnamed protein product [Mus musculus] Length = 280 Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust. Identities = 41/184 (22%), Positives = 88/184 (47%), Gaps = 21/184 (11%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 + + RH+ M +++ +S + R D+ ++D SRS+ ++ + + + + I L Sbjct: 31 FPSRGRHVRMYPQTALLESSCENKRADLAFIIDSSRSVNTYHYAKVKEFILDI------L 84 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLK 264 + + + PDV V GL+ + + ++ F L+ S ++R +K + T + ++ Sbjct: 85 QFLDIGPDVTRV---GLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQ 141 Query: 265 YAYNQIF-DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 YA N F + +G R N +II+ +TDG + + +A+ G +++ Sbjct: 142 YALNIAFSEAEGAR----PLRENVPRIIMIVTDGRPQDSVAE-----VAAKARNTGILIF 192 Query: 324 AIGI 327 AIG+ Sbjct: 193 AIGV 196 >gi|218515283|ref|ZP_03512123.1| hypothetical protein Retl8_17130 [Rhizobium etli 8C-3] Length = 329 Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust. Identities = 42/142 (29%), Positives = 59/142 (41%), Gaps = 18/142 (12%) Query: 236 LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK------- 288 L WG S + + L + G TNS+ + AY+ + + EDA +K Sbjct: 179 LAWGTSGVSTYVNAL-QAGGGTNSSGAMSTAYSSLTAKNAAGN--DAEDAAHKLKTGQTP 235 Query: 289 -KIIVFMTDGENLSTKEDQQSL-----YYCNEAKKRGAIVYAIGIRVIRSHE-FLRACAS 341 K IVFMTDG+N +S C+ AK +G +Y I + L CAS Sbjct: 236 KKYIVFMTDGDNNDDSSGGRSYDTLTKATCDTAKSKGIEIYTIAFMAPEGGQALLHYCAS 295 Query: 342 PNSFYL-VENPHSMYDAFSHIG 362 +S Y E + AF IG Sbjct: 296 DDSHYFQAEKMEDLLAAFKAIG 317 >gi|194224466|ref|XP_001500626.2| PREDICTED: matrilin 4 [Equus caballus] Length = 542 Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust. Identities = 42/174 (24%), Positives = 81/174 (46%), Gaps = 22/174 (12%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SRS+ F + + + + +L + + P+ V G++ +S++++ Sbjct: 35 LDLVFVIDSSRSVRPF------EFETMRQFLVGLLRSLDVGPNATRV---GVIQYSSQVQ 85 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDANYK 288 F L ++R I+ L T + ++YA N F + +G R +A Sbjct: 86 SVFPLSAFSRREDMERAIRALVPLAQGTMTGLAIQYAMNVAFSVAEGAR----PPEARVP 141 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 ++ V +TDG + + +A+ RG +YA+G++ LRA ASP Sbjct: 142 RVAVIVTDG-----RPQDRVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASP 189 Score = 38.1 bits (87), Expect = 2.1, Method: Compositional matrix adjust. Identities = 42/212 (19%), Positives = 95/212 (44%), Gaps = 32/212 (15%) Query: 151 SRHIVMP-----ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 S H + P + N + +D+++++D S+S+ ++ + +N + Sbjct: 279 SYHCLCPEGRQLLADGKSCNRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQI 332 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIE-EFFLLEWGVSHLQRK----IKYLSKFGVSTNST 260 ++ + + P+ V GLV FS+++ EF L +G + ++ ++Y+ + T + Sbjct: 333 VDFLDVSPEGTRV---GLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMER---GTMTG 386 Query: 261 PGLKYAYNQIF-DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 L++ F + QG R N ++ + TDG + +D S++ AK+ G Sbjct: 387 LALRHMVEHSFSEAQGAR----PRALNVPRVGLVFTDGRS----QDDISVWAAR-AKEEG 437 Query: 320 AIVYAIGIRVIRSHEFLRACASPNSFYLVENP 351 ++YA+G+ E + + P ++ +P Sbjct: 438 IVMYAVGVGKAVEEELRKIASEPAELHVSYSP 469 >gi|114682165|ref|XP_001154021.1| PREDICTED: matrilin 4 isoform 4 [Pan troglodytes] gi|114682167|ref|XP_001154256.1| PREDICTED: matrilin 4 isoform 7 [Pan troglodytes] gi|114682175|ref|XP_514674.2| PREDICTED: matrilin 4 isoform 9 [Pan troglodytes] gi|114682177|ref|XP_001154315.1| PREDICTED: matrilin-4 isoform 8 [Pan troglodytes] Length = 581 Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust. Identities = 46/188 (24%), Positives = 85/188 (45%), Gaps = 31/188 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVVQSGLVTFSN 228 LD++ V+D SRS+ F ++ + M L+P +N N + G++ +S+ Sbjct: 33 LDLVFVIDSSRSVRPFEFETMRQFLMG------------LLPGLNVGPNATRVGVIQYSS 80 Query: 229 KIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDA 285 +++ F L ++R I+ L T + ++YA N F + +G R + Sbjct: 81 QVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR----PPEE 136 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--- 342 ++ V +TDG + + +A+ RG +YA+G++ LRA ASP Sbjct: 137 RVPRVAVIVTDG-----RPQDRVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPLD 190 Query: 343 NSFYLVEN 350 +LVE+ Sbjct: 191 EHVFLVES 198 Score = 37.4 bits (85), Expect = 4.2, Method: Compositional matrix adjust. Identities = 40/194 (20%), Positives = 88/194 (45%), Gaps = 27/194 (13%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 N + +D+++++D S+S+ ++ + +N +++ + + P+ V GL Sbjct: 336 CNRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEGTRV---GL 386 Query: 224 VTFSNKIE-EFFLLEWGVSHLQRK----IKYLSKFGVSTNSTPGLKYAYNQIF-DMQGMR 277 V FS+++ EF L +G + ++ ++Y+ + T + L++ F + QG R Sbjct: 387 VQFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMER---GTMTGLALRHMVEHSFSEAQGAR 443 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++ + TDG + +D S++ AK+ G +VYA+G+ E Sbjct: 444 ----PRALNVPRVGLVFTDGRS----QDDISVWA-ARAKEEGIVVYAVGVGKAVEAELRE 494 Query: 338 ACASPNSFYLVENP 351 + P ++ P Sbjct: 495 IASEPAELHVSYAP 508 >gi|26350291|dbj|BAC38785.1| unnamed protein product [Mus musculus] Length = 810 Score = 46.2 bits (108), Expect = 0.008, Method: Compositional matrix adjust. Identities = 41/182 (22%), Positives = 89/182 (48%), Gaps = 21/182 (11%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + RH+ M +++ +S + R D++ ++D SRS+ ++ + + + + I L+ Sbjct: 33 SRGRHVRMYPQTALLESSCENKRADLVFIIDSSRSVNTYDYAKVKEFILDI------LQF 86 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 + + PDV V GL+ + + ++ F L+ S ++R +K + T + ++YA Sbjct: 87 LDIGPDVTRV---GLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYA 143 Query: 267 YNQIF-DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 N F + +G R N +II+ +TDG + + + +A+ G +++AI Sbjct: 144 LNIAFSEAEGAR----PLRENVPRIIMIVTDGRPQDSVAEVAA-----KARNTGILIFAI 194 Query: 326 GI 327 G+ Sbjct: 195 GV 196 >gi|148676905|gb|EDL08852.1| matrilin 2, isoform CRA_a [Mus musculus] Length = 836 Score = 46.2 bits (108), Expect = 0.008, Method: Compositional matrix adjust. Identities = 41/182 (22%), Positives = 89/182 (48%), Gaps = 21/182 (11%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + RH+ M +++ +S + R D++ ++D SRS+ ++ + + + + I L+ Sbjct: 59 SRGRHVRMYPQTALLESSCENKRADLVFIIDSSRSVNTYDYAKVKEFILDI------LQF 112 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 + + PDV V GL+ + + ++ F L+ S ++R +K + T + ++YA Sbjct: 113 LDIGPDVTRV---GLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYA 169 Query: 267 YNQIF-DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 N F + +G R N +II+ +TDG + + + +A+ G +++AI Sbjct: 170 LNIAFSEAEGAR----PLRENVPRIIMIVTDGRPQDSVAEVAA-----KARNTGILIFAI 220 Query: 326 GI 327 G+ Sbjct: 221 GV 222 >gi|295092462|emb|CBK78569.1| von Willebrand factor type A domain. [Clostridium cf. saccharolyticum K10] Length = 2061 Score = 46.2 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 53/212 (25%), Positives = 91/212 (42%), Gaps = 25/212 (11%) Query: 172 LDMMIVLDVSRSMESFFDSSITK-----IDMAIKSINAMLEEVKLIPDVNNV---VQSGL 223 +D++ V+D S SM+ D K + +N L+E+ IPD+ + +Q Sbjct: 1524 VDLVFVIDKSLSMDYDIDGDEIKWWEDETESRKDIVNDALDEI--IPDLCSQQYDIQIAG 1581 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 FS +L+W Q+ + L S N+ P A DM + Sbjct: 1582 YQFSGSSTR--VLDWSREE-QQVLNNLKISNTSYNTEPSQALA--DALDMLKTGSQAHQN 1636 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL---RACA 340 +N KK ++FMTDGE T+ ++ S Y ++ GA +Y IG+ S + + R+ A Sbjct: 1637 QSNVKKYLIFMTDGE--PTESEELSYYAISKNPVPGASIYTIGVSSDASTDLMEGIRSTA 1694 Query: 341 SPNSF-----YLVENPHSMYDAFSHIGKDIVT 367 N + + + DAF+ I +I++ Sbjct: 1695 EGNGMTAPATFKGTSAQLIKDAFTQIKDEIIS 1726 Score = 38.5 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 46/186 (24%), Positives = 78/186 (41%), Gaps = 22/186 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV-KLIPDVNNVVQSGLVTFSNKIEE 232 +M V+D S SM+ F S + D + +N+ LE + D + +Q G FS+ E Sbjct: 1106 VMFVIDKSGSMDQSFGSGNS--DARREVVNSALELFFNQLSDGDYNIQFGGYKFSDSGER 1163 Query: 233 FFLLEWGVSHLQRKIKY--------LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 +WG KY LS +++ T G Y + ++ + Sbjct: 1164 VNFNDWGWQD-----KYWETDTSNALSHLKLTSWETDGSTYPSQTLRSAISALENVELGE 1218 Query: 285 ANYKKIIVFMTDGENLS-----TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 N K+ ++F+TDGE +KE+ ++ Y + G YAI + SH F+ + Sbjct: 1219 -NGKRYLIFLTDGEPGQNSYSFSKEEAENCYSAIKNLDSGTTFYAIQVANSDSHGFMESM 1277 Query: 340 ASPNSF 345 S +F Sbjct: 1278 VSNANF 1283 >gi|134093095|gb|ABO52955.1| matrilin 4 isoform 1 precursor [Gorilla gorilla gorilla] Length = 581 Score = 46.2 bits (108), Expect = 0.009, Method: Compositional matrix adjust. Identities = 45/185 (24%), Positives = 85/185 (45%), Gaps = 25/185 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SRS+ F ++ + M +L + + P+ V G++ +S++++ Sbjct: 33 LDLVFVIDSSRSVRPFEFETVRQFLM------GLLRGLNVGPNATRV---GVIQYSSQVQ 83 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDANYK 288 F L ++R I+ L T + ++YA N F + +G R + Sbjct: 84 SVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR----PPEERVP 139 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 ++ V +TDG + + +A+ RG +YA+G++ LRA ASP Sbjct: 140 RVAVIVTDG-----RPQDRVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPLDEHV 193 Query: 346 YLVEN 350 +LVE+ Sbjct: 194 FLVES 198 >gi|99081991|ref|YP_614145.1| hypothetical protein TM1040_2151 [Ruegeria sp. TM1040] gi|99038271|gb|ABF64883.1| hypothetical protein TM1040_2151 [Ruegeria sp. TM1040] Length = 582 Score = 46.2 bits (108), Expect = 0.009, Method: Compositional matrix adjust. Identities = 26/71 (36%), Positives = 37/71 (52%), Gaps = 1/71 (1%) Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACASPNSFYLVENPHSMYDAFS 359 ST +D ++L C AK +G +V+ IG R E L+ACAS S Y + + DAF+ Sbjct: 510 STTKDARTLDICEAAKAKGVVVFTIGFEAPSRGQEVLQACASSASHYYDVDGLEISDAFA 569 Query: 360 HIGKDIVTKRI 370 I I R+ Sbjct: 570 SIASAIRQLRL 580 >gi|126273404|ref|XP_001377627.1| PREDICTED: similar to AMACO [Monodelphis domestica] Length = 784 Score = 45.8 bits (107), Expect = 0.011, Method: Compositional matrix adjust. Identities = 54/203 (26%), Positives = 96/203 (47%), Gaps = 39/203 (19%) Query: 162 VKVNS-----QTDARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 VK++S Q A LD++ LD S S+ + F+ S AIK M + + + PD Sbjct: 35 VKISSASQLMQCSAALDVLFALDGSHSIGKGSFERSKY---FAIK----MCDALAIYPDR 87 Query: 216 NNVVQSGLVTFSN------KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 V+ G++ FS+ ++ FF E ++ KIK + G ST + LKY ++ Sbjct: 88 ---VRVGVLQFSSVPQLEFPLDSFFTRE----EVKEKIKKIVFKGGSTETGLALKYLLHK 140 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 F G R +++ ++++ +TDG++ + N+ K+RG +V+A+G+R Sbjct: 141 GFP--GGR------NSSVPQLLIIVTDGKSQGNID-----LPANQLKERGVMVFAVGVRF 187 Query: 330 IRSHEFLRACASPNSFYLVENPH 352 R E + P Y++ H Sbjct: 188 PRWTELHTLASEPKDQYVLFAEH 210 >gi|13699836|ref|NP_085095.1| matrilin-4 isoform 3 precursor [Homo sapiens] Length = 499 Score = 45.8 bits (107), Expect = 0.011, Method: Compositional matrix adjust. Identities = 45/185 (24%), Positives = 85/185 (45%), Gaps = 25/185 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SRS+ F ++ + M +L + + P+ V G++ +S++++ Sbjct: 33 LDLVFVIDSSRSVRPFEFETMRQFLM------GLLRGLNVGPNATRV---GVIQYSSQVQ 83 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDANYK 288 F L ++R I+ L T + ++YA N F + +G R + Sbjct: 84 SVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR----PPEERVP 139 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 ++ V +TDG + + +A+ RG +YA+G++ LRA ASP Sbjct: 140 RVAVIVTDG-----RPQDRVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPLDEHV 193 Query: 346 YLVEN 350 +LVE+ Sbjct: 194 FLVES 198 >gi|148676906|gb|EDL08853.1| matrilin 2, isoform CRA_b [Mus musculus] Length = 941 Score = 45.8 bits (107), Expect = 0.011, Method: Compositional matrix adjust. Identities = 41/182 (22%), Positives = 89/182 (48%), Gaps = 21/182 (11%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + RH+ M +++ +S + R D++ ++D SRS+ ++ + + + + I L+ Sbjct: 33 SRGRHVRMYPQTALLESSCENKRADLVFIIDSSRSVNTYDYAKVKEFILDI------LQF 86 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 + + PDV V GL+ + + ++ F L+ S ++R +K + T + ++YA Sbjct: 87 LDIGPDVTRV---GLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYA 143 Query: 267 YNQIF-DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 N F + +G R N +II+ +TDG + + + +A+ G +++AI Sbjct: 144 LNIAFSEAEGAR----PLRENVPRIIMIVTDGRPQDSVAEVAA-----KARNTGILIFAI 194 Query: 326 GI 327 G+ Sbjct: 195 GV 196 >gi|13699834|ref|NP_085080.1| matrilin-4 isoform 2 precursor [Homo sapiens] gi|119596274|gb|EAW75868.1| matrilin 4, isoform CRA_c [Homo sapiens] Length = 540 Score = 45.8 bits (107), Expect = 0.012, Method: Compositional matrix adjust. Identities = 45/185 (24%), Positives = 85/185 (45%), Gaps = 25/185 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SRS+ F ++ + M +L + + P+ V G++ +S++++ Sbjct: 33 LDLVFVIDSSRSVRPFEFETMRQFLM------GLLRGLNVGPNATRV---GVIQYSSQVQ 83 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDANYK 288 F L ++R I+ L T + ++YA N F + +G R + Sbjct: 84 SVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR----PPEERVP 139 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 ++ V +TDG + + +A+ RG +YA+G++ LRA ASP Sbjct: 140 RVAVIVTDG-----RPQDRVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPLDEHV 193 Query: 346 YLVEN 350 +LVE+ Sbjct: 194 FLVES 198 >gi|116253849|ref|YP_769687.1| hypothetical protein RL4112 [Rhizobium leguminosarum bv. viciae 3841] gi|115258497|emb|CAK09601.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 398 Score = 45.8 bits (107), Expect = 0.012, Method: Compositional matrix adjust. Identities = 85/400 (21%), Positives = 161/400 (40%), Gaps = 53/400 (13%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R + G I+TAI +P++ G+ I+ S++ K L D + + AAT + + Sbjct: 10 LRRMLGDRGGNFGIMTAIMMPVLLGAAGLAIDYSNMALSKRELQEATDSAALAAATALAS 69 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVV--VPQNE 125 D K+ + N + + I I TS+DI V ++ Sbjct: 70 GAASTT------ADAEAIAKDFVSGQMANYVD----TDAISSIKAGTSVDIDVSATATSK 119 Query: 126 GYSISAISRYKIPLK-FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 Y ++ + Y I F + + + T + + TS+ S T L M +VLD S SM Sbjct: 120 SYKVTVATSYGIAATPFMSVLGYKTLN---IGASTSTSSGTSDTKTALSMELVLDQSGSM 176 Query: 185 ESFFDSS-----------ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN----- 228 + +TKID K+ +A+ + + +++V++G +++N Sbjct: 177 GEKTTTCATYNGKNCKTYVTKIDALKKAADALFDALDTADPDHSLVRTGAYSYNNGLIYN 236 Query: 229 ----KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF------DMQGMRQ 278 +I+ + WG + + ++ G T++T ++ A I D++ + Sbjct: 237 SQKTQIKSMSGMAWGTATTATYVSGITASG-GTDATEPMRQATLSIAKASDGSDVE-TQA 294 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKE------DQQSLYYCNEAKKRGAIVYAIGIRVI-R 331 H + + I+ MTDGE DQ C+ K G ++ + + Sbjct: 295 HAVKGNTIVSRYIILMTDGEMTGNTGVWQSSFDQNVRNQCDATKTAGIKIFTVAFMAPDK 354 Query: 332 SHEFLRACASPNS-FYLVENPHSMYDAFSHIGKDIVTKRI 370 + L+ CASP +Y E + +F+ I K+ TK + Sbjct: 355 GKQLLQYCASPGGNYYEAETMEKLVASFTSIAKE-ATKAV 393 >gi|301764008|ref|XP_002917404.1| PREDICTED: matrilin-4-like [Ailuropoda melanoleuca] Length = 594 Score = 45.4 bits (106), Expect = 0.013, Method: Compositional matrix adjust. Identities = 50/222 (22%), Positives = 96/222 (43%), Gaps = 34/222 (15%) Query: 135 YKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITK 194 + +P+ PW T + + P + LD++ V+D SRS+ F + Sbjct: 7 WPLPMLLFLLQPWETQLQ-LAGPRCRT--------GPLDLVFVIDSSRSVRPF------E 51 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSK 252 + + + +L + + P+ V G++ +S++++ F L ++ I+ L Sbjct: 52 FETMRQFLVGLLRGLDVGPNATRV---GVIQYSSQVQSVFPLGAFARREDMEHAIRALVP 108 Query: 253 FGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY 311 T + ++YA N F + +G R +A +I V +TDG + + Sbjct: 109 LAQGTMTGLAIQYAMNVAFSVAEGAR----PPEARVPRIAVIVTDG-----RPQDRVAEV 159 Query: 312 CNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSFYLVEN 350 +A+ RG +YA+G++ LRA ASP +LVE+ Sbjct: 160 AAQARARGIEIYAVGVQRADVGS-LRAMASPPLDEHVFLVES 200 Score = 37.7 bits (86), Expect = 3.2, Method: Compositional matrix adjust. Identities = 39/194 (20%), Positives = 90/194 (46%), Gaps = 27/194 (13%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 N + +D+++++D S+S+ ++ + +N +++ + + P+ V GL Sbjct: 338 CNRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEGTRV---GL 388 Query: 224 VTFSNKIE-EFFLLEWGVSHLQRK----IKYLSKFGVSTNSTPGLKYAYNQIF-DMQGMR 277 V FS+++ EF L +G + ++ ++Y+ + T + L++ F + QG R Sbjct: 389 VQFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMER---GTMTGLALRHMVEHSFSEAQGAR 445 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++ + TDG + +D S++ AK+ G ++YA+G+ E + Sbjct: 446 ----PRALNVPRVGLVFTDGRS----QDDISVWA-ARAKEEGIVMYAVGVGKAVEEELRQ 496 Query: 338 ACASPNSFYLVENP 351 + P ++ +P Sbjct: 497 IASEPAELHVSYSP 510 >gi|281182610|ref|NP_001162037.1| matrilin-4 [Pongo abelii] gi|134093133|gb|ABO52993.1| matrilin 4 isoform 1 precursor [Pongo abelii] Length = 581 Score = 45.4 bits (106), Expect = 0.013, Method: Compositional matrix adjust. Identities = 45/185 (24%), Positives = 85/185 (45%), Gaps = 25/185 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SRS+ F ++ + M +L + + P+ V G++ +S++++ Sbjct: 33 LDLVFVIDSSRSVRPFEFETMRQFLM------GLLRGLNVGPNATRV---GVIQYSSQVQ 83 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDANYK 288 F L ++R I+ L T + ++YA N F + +G R + Sbjct: 84 SVFPLRAFSRSEDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR----PPEERVP 139 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 ++ V +TDG + + +A+ RG +YA+G++ LRA ASP Sbjct: 140 RVAVIVTDG-----RPQDRVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPLDEHV 193 Query: 346 YLVEN 350 +LVE+ Sbjct: 194 FLVES 198 >gi|74202868|dbj|BAE37504.1| unnamed protein product [Mus musculus] Length = 928 Score = 45.4 bits (106), Expect = 0.013, Method: Compositional matrix adjust. Identities = 41/182 (22%), Positives = 89/182 (48%), Gaps = 21/182 (11%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + RH+ M +++ +S + R D++ ++D SRS+ ++ + + + + I L+ Sbjct: 33 SRGRHVRMYPQTALLESSCENKRADLVFIIDSSRSVNTYDYAKVKEFILDI------LQF 86 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 + + PDV V GL+ + + ++ F L+ S ++R +K + T + ++YA Sbjct: 87 LDIGPDVTRV---GLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYA 143 Query: 267 YNQIF-DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 N F + +G R N +II+ +TDG + + + +A+ G +++AI Sbjct: 144 LNIAFSEAEGAR----PLRENVPRIIMIVTDGRPQDSVAEVAA-----KARNTGILIFAI 194 Query: 326 GI 327 G+ Sbjct: 195 GV 196 >gi|7387906|sp|O08746|MATN2_MOUSE RecName: Full=Matrilin-2; Flags: Precursor gi|2072792|gb|AAC53163.1| matrilin-2 precursor [Mus musculus] Length = 956 Score = 45.4 bits (106), Expect = 0.014, Method: Compositional matrix adjust. Identities = 41/182 (22%), Positives = 89/182 (48%), Gaps = 21/182 (11%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + RH+ M +++ +S + R D++ ++D SRS+ ++ + + + + I L+ Sbjct: 33 SRGRHVRMYPQTALLESSCENKRADLVFIIDSSRSVNTYDYAKVKEFILDI------LQF 86 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 + + PDV V GL+ + + ++ F L+ S ++R +K + T + ++YA Sbjct: 87 LDIGPDVTRV---GLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYA 143 Query: 267 YNQIF-DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 N F + +G R N +II+ +TDG + + + +A+ G +++AI Sbjct: 144 LNIAFSEAEGAR----PLRENVPRIIMIVTDGRPQDSVAEVAA-----KARNTGILIFAI 194 Query: 326 GI 327 G+ Sbjct: 195 GV 196 >gi|62185620|gb|AAH92298.1| Matrilin 2 [Mus musculus] Length = 937 Score = 45.4 bits (106), Expect = 0.014, Method: Compositional matrix adjust. Identities = 41/182 (22%), Positives = 89/182 (48%), Gaps = 21/182 (11%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + RH+ M +++ +S + R D++ ++D SRS+ ++ + + + + I L+ Sbjct: 33 SRGRHVRMYPQTALLESSCENKRADLVFIIDSSRSVNTYDYAKVKEFILDI------LQF 86 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 + + PDV V GL+ + + ++ F L+ S ++R +K + T + ++YA Sbjct: 87 LDIGPDVTRV---GLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYA 143 Query: 267 YNQIF-DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 N F + +G R N +II+ +TDG + + + +A+ G +++AI Sbjct: 144 LNIAFSEAEGAR----PLRENVPRIIMIVTDGRPQDSVAEVAA-----KARNTGILIFAI 194 Query: 326 GI 327 G+ Sbjct: 195 GV 196 >gi|20136122|gb|AAM11539.1| matrilin-2 [Mus musculus] Length = 956 Score = 45.4 bits (106), Expect = 0.014, Method: Compositional matrix adjust. Identities = 41/182 (22%), Positives = 89/182 (48%), Gaps = 21/182 (11%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + RH+ M +++ +S + R D++ ++D SRS+ ++ + + + + I L+ Sbjct: 33 SRGRHVRMYPQTALLESSCENKRADLVFIIDSSRSVNTYDYAKVKEFILDI------LQF 86 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 + + PDV V GL+ + + ++ F L+ S ++R +K + T + ++YA Sbjct: 87 LDIGPDVTRV---GLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYA 143 Query: 267 YNQIF-DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 N F + +G R N +II+ +TDG + + + +A+ G +++AI Sbjct: 144 LNIAFSEAEGAR----PLRENVPRIIMIVTDGRPQDSVAEVAA-----KARNTGILIFAI 194 Query: 326 GI 327 G+ Sbjct: 195 GV 196 >gi|193786651|dbj|BAG51974.1| unnamed protein product [Homo sapiens] Length = 581 Score = 45.4 bits (106), Expect = 0.014, Method: Compositional matrix adjust. Identities = 45/185 (24%), Positives = 85/185 (45%), Gaps = 25/185 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SRS+ F ++ + M +L + + P+ V G++ +S++++ Sbjct: 33 LDLVFVIDSSRSVRPFEFETMRQFLM------GLLRGLNVGPNATRV---GVIQYSSQVQ 83 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDANYK 288 F L ++R I+ L T + ++YA N F + +G R + Sbjct: 84 SVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR----PPEERVP 139 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 ++ V +TDG + + +A+ RG +YA+G++ LRA ASP Sbjct: 140 RVAVIVTDG-----RPQDRVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPLDEHV 193 Query: 346 YLVEN 350 +LVE+ Sbjct: 194 FLVES 198 >gi|120407045|ref|NP_058042.2| matrilin-2 [Mus musculus] Length = 937 Score = 45.4 bits (106), Expect = 0.014, Method: Compositional matrix adjust. Identities = 41/182 (22%), Positives = 89/182 (48%), Gaps = 21/182 (11%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + RH+ M +++ +S + R D++ ++D SRS+ ++ + + + + I L+ Sbjct: 33 SRGRHVRMYPQTALLESSCENKRADLVFIIDSSRSVNTYDYAKVKEFILDI------LQF 86 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 + + PDV V GL+ + + ++ F L+ S ++R +K + T + ++YA Sbjct: 87 LDIGPDVTRV---GLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYA 143 Query: 267 YNQIF-DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 N F + +G R N +II+ +TDG + + + +A+ G +++AI Sbjct: 144 LNIAFSEAEGAR----PLRENVPRIIMIVTDGRPQDSVAEVAA-----KARNTGILIFAI 194 Query: 326 GI 327 G+ Sbjct: 195 GV 196 >gi|13699830|ref|NP_003824.2| matrilin-4 isoform 1 precursor [Homo sapiens] gi|4499937|emb|CAB39280.1| matrilin 4 [Homo sapiens] Length = 581 Score = 45.4 bits (106), Expect = 0.015, Method: Compositional matrix adjust. Identities = 45/185 (24%), Positives = 85/185 (45%), Gaps = 25/185 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SRS+ F ++ + M +L + + P+ V G++ +S++++ Sbjct: 33 LDLVFVIDSSRSVRPFEFETMRQFLM------GLLRGLNVGPNATRV---GVIQYSSQVQ 83 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDANYK 288 F L ++R I+ L T + ++YA N F + +G R + Sbjct: 84 SVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR----PPEERVP 139 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 ++ V +TDG + + +A+ RG +YA+G++ LRA ASP Sbjct: 140 RVAVIVTDG-----RPQDRVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPLDEHV 193 Query: 346 YLVEN 350 +LVE+ Sbjct: 194 FLVES 198 >gi|22760140|dbj|BAC11083.1| unnamed protein product [Homo sapiens] Length = 540 Score = 45.4 bits (106), Expect = 0.015, Method: Compositional matrix adjust. Identities = 45/185 (24%), Positives = 85/185 (45%), Gaps = 25/185 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SRS+ F ++ + M +L + + P+ V G++ +S++++ Sbjct: 33 LDLVFVIDSSRSVRPFEFETMRQFLM------GLLRGLNVGPNATRV---GVIQYSSQVQ 83 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDANYK 288 F L ++R I+ L T + ++YA N F + +G R + Sbjct: 84 SVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR----PPEERVP 139 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 ++ V +TDG + + +A+ RG +YA+G++ LRA ASP Sbjct: 140 RVAVIVTDG-----RPQDRVAEVAAQARARGIEIYAVGLQRADVGS-LRAMASPPLDEHV 193 Query: 346 YLVEN 350 +LVE+ Sbjct: 194 FLVES 198 >gi|332208765|ref|XP_003253479.1| PREDICTED: LOW QUALITY PROTEIN: matrilin-4-like [Nomascus leucogenys] Length = 448 Score = 45.1 bits (105), Expect = 0.018, Method: Compositional matrix adjust. Identities = 45/185 (24%), Positives = 85/185 (45%), Gaps = 25/185 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SRS+ F ++ + M +L + + P+ V G++ +S++++ Sbjct: 33 LDLVFVIDSSRSVRPFEFETMRQFLM------GLLRGLNVGPNATRV---GVIQYSSQVQ 83 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDANYK 288 F L ++R I+ L T + ++YA N F + +G R + Sbjct: 84 SVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR----PPEERVP 139 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 ++ V +TDG + + +A+ RG +YA+G++ LRA ASP Sbjct: 140 RVAVIVTDG-----RPQDRVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPLDEHV 193 Query: 346 YLVEN 350 +LVE+ Sbjct: 194 FLVES 198 >gi|119596273|gb|EAW75867.1| matrilin 4, isoform CRA_b [Homo sapiens] Length = 620 Score = 45.1 bits (105), Expect = 0.018, Method: Compositional matrix adjust. Identities = 45/185 (24%), Positives = 85/185 (45%), Gaps = 25/185 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SRS+ F ++ + M +L + + P+ V G++ +S++++ Sbjct: 33 LDLVFVIDSSRSVRPFEFETMRQFLM------GLLRGLNVGPNATRV---GVIQYSSQVQ 83 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDANYK 288 F L ++R I+ L T + ++YA N F + +G R + Sbjct: 84 SVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR----PPEERVP 139 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 ++ V +TDG + + +A+ RG +YA+G++ LRA ASP Sbjct: 140 RVAVIVTDG-----RPQDRVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPLDEHV 193 Query: 346 YLVEN 350 +LVE+ Sbjct: 194 FLVES 198 >gi|73920229|sp|O95460|MATN4_HUMAN RecName: Full=Matrilin-4; Flags: Precursor gi|5419632|emb|CAB46380.1| matrilin 4 [Homo sapiens] Length = 622 Score = 45.1 bits (105), Expect = 0.018, Method: Compositional matrix adjust. Identities = 45/185 (24%), Positives = 85/185 (45%), Gaps = 25/185 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SRS+ F ++ + M +L + + P+ V G++ +S++++ Sbjct: 33 LDLVFVIDSSRSVRPFEFETMRQFLM------GLLRGLNVGPNATRV---GVIQYSSQVQ 83 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDANYK 288 F L ++R I+ L T + ++YA N F + +G R + Sbjct: 84 SVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR----PPEERVP 139 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 ++ V +TDG + + +A+ RG +YA+G++ LRA ASP Sbjct: 140 RVAVIVTDG-----RPQDRVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPLDEHV 193 Query: 346 YLVEN 350 +LVE+ Sbjct: 194 FLVES 198 >gi|281345581|gb|EFB21165.1| hypothetical protein PANDA_005644 [Ailuropoda melanoleuca] Length = 581 Score = 45.1 bits (105), Expect = 0.019, Method: Compositional matrix adjust. Identities = 45/185 (24%), Positives = 85/185 (45%), Gaps = 25/185 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SRS+ F + + + + +L + + P+ V G++ +S++++ Sbjct: 11 LDLVFVIDSSRSVRPF------EFETMRQFLVGLLRGLDVGPNATRV---GVIQYSSQVQ 61 Query: 232 EFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDANYK 288 F L ++ I+ L T + ++YA N F + +G R +A Sbjct: 62 SVFPLGAFARREDMEHAIRALVPLAQGTMTGLAIQYAMNVAFSVAEGAR----PPEARVP 117 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 +I V +TDG + + +A+ RG +YA+G++ LRA ASP Sbjct: 118 RIAVIVTDG-----RPQDRVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPLDEHV 171 Query: 346 YLVEN 350 +LVE+ Sbjct: 172 FLVES 176 Score = 37.7 bits (86), Expect = 2.8, Method: Compositional matrix adjust. Identities = 39/194 (20%), Positives = 90/194 (46%), Gaps = 27/194 (13%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 N + +D+++++D S+S+ ++ + +N +++ + + P+ V GL Sbjct: 355 CNRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEGTRV---GL 405 Query: 224 VTFSNKIE-EFFLLEWGVSHLQRK----IKYLSKFGVSTNSTPGLKYAYNQIF-DMQGMR 277 V FS+++ EF L +G + ++ ++Y+ + T + L++ F + QG R Sbjct: 406 VQFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMER---GTMTGLALRHMVEHSFSEAQGAR 462 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++ + TDG + +D S++ AK+ G ++YA+G+ E + Sbjct: 463 ----PRALNVPRVGLVFTDGRS----QDDISVWA-ARAKEEGIVMYAVGVGKAVEEELRQ 513 Query: 338 ACASPNSFYLVENP 351 + P ++ +P Sbjct: 514 IASEPAELHVSYSP 527 >gi|317483048|ref|ZP_07942050.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] gi|316915549|gb|EFV36969.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] Length = 813 Score = 44.7 bits (104), Expect = 0.024, Method: Compositional matrix adjust. Identities = 52/184 (28%), Positives = 83/184 (45%), Gaps = 30/184 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS-----INAMLEEVKLIPDVNNVVQSG 222 T+ LD+++VLDVS SM KID A+K+ I+A +E I D + + Sbjct: 119 TNQPLDIVLVLDVSGSMADNLSGGPKKID-ALKTAVNGFIDATADENAKITDQSQRNRIA 177 Query: 223 LVTFSNKIE-----EFFLLEWG-------VSHLQRKIKYLSKF--GVSTNSTPGLKYAYN 268 LV F+ + +F+ W VS+L + L+ G+S + YA+N Sbjct: 178 LVKFAGTEKTSVGNDFYREGWSSYNYTQIVSNLTYDVSGLTSTVNGLSASGATSADYAFN 237 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQ-SLYYCNEAKK---RGAIVY 323 + + AN KK+++F TDGE N + D + N+AK G +Y Sbjct: 238 RAQAALTYQPR-----ANAKKVVIFFTDGEPNHGSGFDPTVAATAVNKAKSLKDAGTTIY 292 Query: 324 AIGI 327 +IG+ Sbjct: 293 SIGV 296 >gi|281183022|ref|NP_001162498.1| matrilin-4 [Papio anubis] gi|134093054|gb|ABO52914.1| matrilin 4 isoform 1 precursor [Papio anubis] Length = 581 Score = 44.7 bits (104), Expect = 0.025, Method: Compositional matrix adjust. Identities = 44/185 (23%), Positives = 85/185 (45%), Gaps = 25/185 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SRS+ F + + + + +L + + P+ V G++ +S++++ Sbjct: 33 LDLVFVIDSSRSVRPF------EFETMRQFLVGLLRGLNVGPNATRV---GVIQYSSQVQ 83 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDANYK 288 F L ++R I+ L T + ++YA N F + +G R + Sbjct: 84 SVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR----PPEERVP 139 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 ++ V +TDG + + +A+ RG +YA+G++ LRA ASP Sbjct: 140 RVAVIVTDG-----RPQDRVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPLDEHV 193 Query: 346 YLVEN 350 +LVE+ Sbjct: 194 FLVES 198 >gi|312794604|ref|YP_004027527.1| von willebrand factor type a [Caldicellulosiruptor kristjanssonii 177R1B] gi|312181744|gb|ADQ41914.1| von Willebrand factor type A [Caldicellulosiruptor kristjanssonii 177R1B] Length = 900 Score = 44.7 bits (104), Expect = 0.026, Method: Compositional matrix adjust. Identities = 57/221 (25%), Positives = 96/221 (43%), Gaps = 38/221 (17%) Query: 148 YTNS-RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 Y+NS ++P+ +K N + + +D+M+VLD S SM D+ I K+++A + M+ Sbjct: 382 YSNSVLEKMLPVKMELK-NKEKEKNIDVMLVLDHSGSMADTEDAGIPKLEIAKSASAKMV 440 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK---IKYLS--KFGVSTNSTP 261 E ++ V G++ F + + + L RK I+ +S + G T P Sbjct: 441 EHLESSDGV------GVIAFDHN----YYWAYKFGKLVRKEDVIESISSIEVGGGTAIIP 490 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY--CNEAKKRG 319 L A + + K++V +TDG +QS Y +EAK+ Sbjct: 491 PLSEAVKTLKKSKAKN-----------KLVVLLTDGMG------EQSGYEIPADEAKRNN 533 Query: 320 AIVYAIGIRVIRSHEFLRACASPNS--FYLVENPHSMYDAF 358 + IG+ + L A+ S FYLV NP + D F Sbjct: 534 IKITTIGVGKFVNASVLSWIAAYTSGRFYLVSNPSELVDVF 574 >gi|274320027|ref|NP_001162099.1| matrilin-4 [Macaca mulatta] gi|134093113|gb|ABO52973.1| matrilin 4 isoform 1 precursor [Macaca mulatta] Length = 581 Score = 44.3 bits (103), Expect = 0.028, Method: Compositional matrix adjust. Identities = 44/185 (23%), Positives = 85/185 (45%), Gaps = 25/185 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SRS+ F + + + + +L + + P+ V G++ +S++++ Sbjct: 33 LDLVFVIDSSRSVRPF------EFETMRQFLVGLLRGLNVGPNATRV---GVIQYSSQVQ 83 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDANYK 288 F L ++R I+ L T + ++YA N F + +G R + Sbjct: 84 SVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR----PPEERVP 139 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 ++ V +TDG + + +A+ RG +YA+G++ LRA ASP Sbjct: 140 RVAVIVTDG-----RPQDRVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPLDEHV 193 Query: 346 YLVEN 350 +LVE+ Sbjct: 194 FLVES 198 >gi|47208180|emb|CAF89812.1| unnamed protein product [Tetraodon nigroviridis] Length = 1636 Score = 44.3 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 50/199 (25%), Positives = 86/199 (43%), Gaps = 30/199 (15%) Query: 173 DMMIVLDVSRSME--SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK- 229 D+ ++D S S+ F+D K I L+ ++ P N V+ G+V +++ Sbjct: 459 DIFFLIDQSGSIHPPDFYDMK--------KFILEFLQTFRVGP---NHVRIGVVKYADSP 507 Query: 230 IEEFFLLEW-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 EF L + V L++ I + + G T + L + Q FD + T K Sbjct: 508 TLEFDLHTYTDVKSLEKAITNIHQVGGGTETGKALDFMRPQ-FD-----RAVTTRGHKVK 561 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 + +V +TDG + +D ++ + +G +VYAIG++ E L P + V Sbjct: 562 EYLVVITDGNSTDKVKDP-----ADKLRAQGVVVYAIGVKDAVEKELLEISGEPQRTFYV 616 Query: 349 ENPHSMYDAFSHIGKDIVT 367 N +DA I DI+T Sbjct: 617 NN----FDALKPIKDDIIT 631 >gi|222149754|ref|YP_002550711.1| hypothetical protein Avi_3756 [Agrobacterium vitis S4] gi|221736736|gb|ACM37699.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 437 Score = 44.3 bits (103), Expect = 0.029, Method: Compositional matrix adjust. Identities = 45/182 (24%), Positives = 77/182 (42%), Gaps = 14/182 (7%) Query: 192 ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS 251 I + +A+ + A L+ K P+ + V++G +++ + + WG +++ + LS Sbjct: 249 IESLKLAVADLAAQLK--KADPN-SEYVRTGADSYNASADTAQAMSWGTANVVTYVNALS 305 Query: 252 KFGVSTNSTPGLKYAYN--QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL 309 G T++ L AY+ Q + + H + + + IVFMTDGE S Sbjct: 306 ATG-GTDARGALSAAYSALQTSNKTEITAHNVSSVSKIGRYIVFMTDGEMTGNSSSWSSS 364 Query: 310 Y------YCNEAKKRGAIVYAIG-IRVIRSHEFLRACASPNS-FYLVENPHSMYDAFSHI 361 C K G +Y + + L ACAS S +Y + S+ AF I Sbjct: 365 IDSAVRSQCTSIKADGIQIYTVAFMAPANGKSLLSACASDASHYYEATDAASLVAAFGEI 424 Query: 362 GK 363 GK Sbjct: 425 GK 426 >gi|311274909|ref|XP_003134506.1| PREDICTED: matrilin-4-like [Sus scrofa] Length = 721 Score = 43.9 bits (102), Expect = 0.037, Method: Compositional matrix adjust. Identities = 45/187 (24%), Positives = 86/187 (45%), Gaps = 29/187 (15%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SRS+ F + + + + +L + + P+ V G++ +S++++ Sbjct: 173 LDLVFVIDSSRSVRPF------EFETMRQFLVGLLRSLDVGPNATRV---GVIQYSSQVQ 223 Query: 232 EFFLLEWGV----SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDAN 286 F L G ++ I+ L T + ++YA N F + +G R +A Sbjct: 224 SVFPL--GAFSRREDMEGAIRALVPLAQGTMTGLAIQYAMNVAFSVAEGAR----PPEAR 277 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---N 343 ++ V +TDG + + +A+ RG +YA+G++ LRA ASP Sbjct: 278 VPRVAVIVTDG-----RPQDRVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPLDE 331 Query: 344 SFYLVEN 350 +LVE+ Sbjct: 332 HVFLVES 338 Score = 36.2 bits (82), Expect = 7.9, Method: Compositional matrix adjust. Identities = 39/194 (20%), Positives = 89/194 (45%), Gaps = 27/194 (13%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 N + +D+++++D S+S+ ++ + +N +++ + + P+ V GL Sbjct: 476 CNRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEGTRV---GL 526 Query: 224 VTFSNKIE-EFFLLEWGVSHLQRK----IKYLSKFGVSTNSTPGLKYAYNQIF-DMQGMR 277 V FS+++ EF L +G + ++ ++Y+ + T + L++ F + QG R Sbjct: 527 VQFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMER---GTMTGLALRHMVEHSFSEAQGAR 583 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++ + TDG + +D S++ AK+ G ++YA+G+ E Sbjct: 584 ----PRALNVPRVGLVFTDGRS----QDDVSVWA-ARAKEEGIVMYAVGVGKAVEEELRE 634 Query: 338 ACASPNSFYLVENP 351 + P ++ +P Sbjct: 635 IASEPAELHVSYSP 648 >gi|13529371|gb|AAH05429.1| Matn2 protein [Mus musculus] Length = 956 Score = 43.9 bits (102), Expect = 0.039, Method: Compositional matrix adjust. Identities = 41/182 (22%), Positives = 88/182 (48%), Gaps = 21/182 (11%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + RH+ M +++ +S + R D++ ++D SRS+ + + + + + I L+ Sbjct: 33 SRGRHVRMYPQTALLESSCENKRADLVFIIDSSRSVNTHDYAKVKEFILDI------LQF 86 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 + + PDV V GL+ + + ++ F L+ S ++R +K + T + ++YA Sbjct: 87 LDIGPDVTRV---GLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYA 143 Query: 267 YNQIF-DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 N F + +G R N +II+ +TDG + + + +A+ G +++AI Sbjct: 144 LNIAFSEAEGAR----PLRENVPRIIMIVTDGRPQDSVAEVAA-----KARNTGILIFAI 194 Query: 326 GI 327 G+ Sbjct: 195 GV 196 >gi|218506715|ref|ZP_03504593.1| hypothetical protein RetlB5_03444 [Rhizobium etli Brasil 5] Length = 269 Score = 43.5 bits (101), Expect = 0.054, Method: Compositional matrix adjust. Identities = 40/142 (28%), Positives = 57/142 (40%), Gaps = 18/142 (12%) Query: 236 LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK------- 288 L WG++ + + L G TNS+ + AY + + EDA +K Sbjct: 119 LAWGITAVTSYVNALESGG-GTNSSGAVNTAYTSLTAKNAAGN--DAEDAAHKLKTGQLP 175 Query: 289 -KIIVFMTDGENLSTKEDQQSL-----YYCNEAKKRGAIVYAIGIRVIRSHE-FLRACAS 341 K IVFMTDG+N +S C+ AK +G Y I + L CAS Sbjct: 176 KKYIVFMTDGDNNDDSRGGRSYDTLTKATCDTAKAKGIETYTIAFMAPEGGQALLHYCAS 235 Query: 342 PNSFYL-VENPHSMYDAFSHIG 362 ++ Y E + AF IG Sbjct: 236 DDAHYFQAEKMEDLLAAFKAIG 257 >gi|317502941|ref|ZP_07961033.1| aerotolerance protein BatB [Prevotella salivae DSM 15606] gi|315665940|gb|EFV05515.1| aerotolerance protein BatB [Prevotella salivae DSM 15606] Length = 340 Score = 43.5 bits (101), Expect = 0.054, Method: Compositional matrix adjust. Identities = 48/202 (23%), Positives = 90/202 (44%), Gaps = 32/202 (15%) Query: 132 ISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSS 191 +S+Y+ +KFC + T ++ K++ + ++++I LD+S SM Sbjct: 50 VSKYRPTIKFCLLLSAITILIFMIARPQVGSKISHEKREGIEVLIALDISNSM------- 102 Query: 192 ITKIDMAIKSINAMLEEVKLIPDVNNVV------QSGLVTFSNKIEEFFLLEWGVSHLQR 245 MA I + LE+ KL+ + ++V + GLV F+ + F L ++ Sbjct: 103 -----MAQDVIPSRLEKSKLL--IEDLVDHFTNDKVGLVVFAG--DAFVQLPITNDYVSA 153 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 K+ +L S +T G A + Q C T+ + + I+ +TDGE+ + Sbjct: 154 KM-FLQNINPSLITTQGTDLAR-----AISLSQSCFTQREHIGRAIIVITDGED----HE 203 Query: 306 QQSLYYCNEAKKRGAIVYAIGI 327 +L EA K+G V+ +G+ Sbjct: 204 GGALEAAREAYKKGINVFILGV 225 >gi|224046544|ref|XP_002198814.1| PREDICTED: matrilin 2 [Taeniopygia guttata] Length = 902 Score = 43.1 bits (100), Expect = 0.064, Method: Compositional matrix adjust. Identities = 43/166 (25%), Positives = 82/166 (49%), Gaps = 21/166 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 NS + LD++ ++D SRS+ S +D K+ I +I L+ + + PD +V GL+ Sbjct: 7 NSCNNKHLDLVFIIDSSRSV-SHYD--FEKVKEFILTI---LQFLDISPDATHV---GLI 57 Query: 225 TFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF-DMQGMRQHCN 281 + + +++ F L+ ++R +K + G T + L+YA N F + +G R Sbjct: 58 QYGSTVKQEFSLKTFRRKQDIERAVKRMMHLGTGTMTGLALQYAVNIAFSETEGAR---- 113 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 N +II+ +TDG + + +A+ G +++AIG+ Sbjct: 114 PLRQNVPRIIMIVTDGRPQDPVAEIAA-----KARNSGILIFAIGV 154 >gi|134093078|gb|ABO52938.1| matrilin 4 isoform 1 precursor [Colobus guereza] Length = 581 Score = 43.1 bits (100), Expect = 0.066, Method: Compositional matrix adjust. Identities = 43/185 (23%), Positives = 85/185 (45%), Gaps = 25/185 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SRS+ F + + + + ++ + + P+ V G++ +S++++ Sbjct: 33 LDLVFVIDSSRSVRPF------EFETMRQFLVGLVRGLNVGPNATRV---GVIQYSSQVQ 83 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDANYK 288 F L ++R I+ L T + ++YA N F + +G R + Sbjct: 84 SVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR----PPEERVP 139 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 ++ V +TDG + + +A+ RG +YA+G++ LRA ASP Sbjct: 140 RVAVIVTDG-----RPQDRVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPLDEHV 193 Query: 346 YLVEN 350 +LVE+ Sbjct: 194 FLVES 198 >gi|126341666|ref|XP_001379908.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 2347 Score = 43.1 bits (100), Expect = 0.071, Method: Compositional matrix adjust. Identities = 45/208 (21%), Positives = 81/208 (38%), Gaps = 43/208 (20%) Query: 169 DARLDMMIVLDVSRS--------MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 D + D+M ++D SRS M++F + + K + + VQ Sbjct: 624 DVKADVMFLVDSSRSIGLENFGKMKTFMKNLVNKSQIG-----------------EDQVQ 666 Query: 221 SGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 G+V FS+ +E F L W + I +S T + LK+ + +G R Sbjct: 667 VGIVQFSDVNKEEFQLNRYWTQHEIFDAIDRMSNIDRETLTGSALKFVSDYFHPSKGAR- 725 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 +K ++ +TDGE+ +D ++ G I+Y++G+ + + Sbjct: 726 ------PGVRKFLILITDGESQDPVKDPAMAL-----RQDGVIIYSVGVYGANETQLVEI 774 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIV 366 P + VE +D HI D+V Sbjct: 775 SGKPEMIFYVET----FDILKHIEDDLV 798 >gi|134093165|gb|ABO53025.1| matrilin 4 isoform 1 precursor, 5 prime [Chlorocebus aethiops] Length = 214 Score = 43.1 bits (100), Expect = 0.072, Method: Compositional matrix adjust. Identities = 41/184 (22%), Positives = 82/184 (44%), Gaps = 23/184 (12%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SRS+ F + + + + +L +K+ N + G++ +S++++ Sbjct: 33 LDLVFVIDSSRSVRPF------EFETMRQFLVGLLRGLKV---GANATRVGVIQYSSQVQ 83 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L ++R I+ L T + ++Y N F + + + + Sbjct: 84 SVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYVMNVAFSVA---EGARPPEERVPR 140 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSFY 346 + V +TDG + + +A+ RG +YA+G++ LRA ASP + Sbjct: 141 VAVIVTDG-----RPQDRVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPLDEHVF 194 Query: 347 LVEN 350 LVE+ Sbjct: 195 LVES 198 >gi|134093103|gb|ABO52963.1| matrilin 4 isoform 1 precursor [Lemur catta] Length = 583 Score = 43.1 bits (100), Expect = 0.074, Method: Compositional matrix adjust. Identities = 35/160 (21%), Positives = 74/160 (46%), Gaps = 21/160 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SRS+ F ++ + + +L + + P+ V G++ +S++++ Sbjct: 35 LDLVFVIDSSRSVRPFEFETVRQF------LVGLLHGLNVGPNATRV---GVIQYSSQVQ 85 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDANYK 288 F L ++R I+ L T + ++YA N F + +G R + Sbjct: 86 SVFPLGAFSRPEDMERAIRALVPLAQGTMTGLAIQYAMNVAFSVAEGAR----PPEERVP 141 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 ++ V +TDG + + +A+ RG +YA+G++ Sbjct: 142 RVAVIVTDG-----RPQDRVAEVAAQARARGIEIYAVGVQ 176 Score = 38.9 bits (89), Expect = 1.3, Method: Compositional matrix adjust. Identities = 40/194 (20%), Positives = 89/194 (45%), Gaps = 27/194 (13%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 N + +D+++++D S+S+ ++ + +N +++ + + PD V GL Sbjct: 338 CNRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPDGTRV---GL 388 Query: 224 VTFSNKIE-EFFLLEWGVSHLQRK----IKYLSKFGVSTNSTPGLKYAYNQIF-DMQGMR 277 V FS+++ EF L +G + ++ ++Y+ + T + L++ F + QG R Sbjct: 389 VQFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMER---GTMTGLALRHMVEHSFSEAQGAR 445 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++ + TDG + +D S++ AK+ G ++YA+G+ E Sbjct: 446 ----PRALNVPRVGLVFTDGRS----QDDISVWA-ARAKEEGIVMYAVGVGKAVEEELRE 496 Query: 338 ACASPNSFYLVENP 351 + P ++ +P Sbjct: 497 IASEPAELHVSYSP 510 >gi|218662625|ref|ZP_03518555.1| hypothetical protein RetlI_26027 [Rhizobium etli IE4771] Length = 389 Score = 43.1 bits (100), Expect = 0.077, Method: Compositional matrix adjust. Identities = 52/207 (25%), Positives = 83/207 (40%), Gaps = 21/207 (10%) Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 V D + + + I + MA ++ A L P+ V++G V++ L Sbjct: 183 VWDTCTGSRANYYTKIEALKMAAGNLFAQLSGAD--PNAQ-YVRTGAVSYDIVQYAPSSL 239 Query: 237 EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK-------- 288 WG + + L + G TNS+ + AY + + ED+ +K Sbjct: 240 AWGAIGVSSYVNAL-QAGGGTNSSGAMSTAYLSLTAKNAAGN--DAEDSAHKLKSGQIPQ 296 Query: 289 KIIVFMTDGENLSTKEDQQSL-----YYCNEAKKRGAIVYAIG-IRVIRSHEFLRACASP 342 K IVFMTDG+N + +S C+ AK +G +Y I + L+ CAS Sbjct: 297 KYIVFMTDGDNNNDSSGGRSYDTLTKATCDTAKSKGIEIYTIAFMAPPGGQALLQYCASD 356 Query: 343 NSFYL-VENPHSMYDAFSHIGKDIVTK 368 S Y E ++ AF IG T+ Sbjct: 357 ASHYFQAEKMEDLFAAFKAIGAKASTQ 383 >gi|146327011|gb|AAI41812.1| Matrilin 4 [Homo sapiens] Length = 540 Score = 42.7 bits (99), Expect = 0.080, Method: Compositional matrix adjust. Identities = 44/185 (23%), Positives = 84/185 (45%), Gaps = 25/185 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SRS+ F ++ + M +L + + P+ V G++ +S++++ Sbjct: 33 LDLVFVIDSSRSVRPFEFETMRQFLM------GLLRGLNVGPNATRV---GVIQYSSQVQ 83 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDANYK 288 F L ++R I+ L T + ++YA N F + +G R + Sbjct: 84 SVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR----PPEERVP 139 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 ++ V +TDG + + +A+ G +YA+G++ LRA ASP Sbjct: 140 RVAVIVTDG-----RPQDRVAEVAAQARASGIEIYAVGVQRADVGS-LRAMASPPLDEHV 193 Query: 346 YLVEN 350 +LVE+ Sbjct: 194 FLVES 198 >gi|3927992|emb|CAA07569.1| matrilin-4 [Homo sapiens] gi|124297520|gb|AAI31764.1| Matrilin 4 [Homo sapiens] gi|153217472|gb|AAI51220.1| Matrilin 4 [Homo sapiens] Length = 581 Score = 42.7 bits (99), Expect = 0.100, Method: Compositional matrix adjust. Identities = 44/185 (23%), Positives = 84/185 (45%), Gaps = 25/185 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SRS+ F ++ + M +L + + P+ V G++ +S++++ Sbjct: 33 LDLVFVIDSSRSVRPFEFETMRQFLM------GLLRGLNVGPNATRV---GVIQYSSQVQ 83 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDANYK 288 F L ++R I+ L T + ++YA N F + +G R + Sbjct: 84 SVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR----PPEERVP 139 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 ++ V +TDG + + +A+ G +YA+G++ LRA ASP Sbjct: 140 RVAVIVTDG-----RPQDRVAEVAAQARASGIEIYAVGVQRADVGS-LRAMASPPLDEHV 193 Query: 346 YLVEN 350 +LVE+ Sbjct: 194 FLVES 198 >gi|311268548|ref|XP_003132103.1| PREDICTED: collagen alpha-5(VI) chain-like [Sus scrofa] Length = 2519 Score = 42.4 bits (98), Expect = 0.12, Method: Compositional matrix adjust. Identities = 48/199 (24%), Positives = 88/199 (44%), Gaps = 23/199 (11%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ VLD S S++ S+ IN + VK N VQ G + +SN+ + Sbjct: 811 LDVVFVLDHSSSIQPRDQESM---------INLTMHLVKKSDVGPNRVQFGALRYSNEPD 861 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 F L S + ++ LS G T + L+ A N +F +H + N K+++ Sbjct: 862 IIFYLNSNRSAIMEYLRSLSAKGGDTYTAKALERA-NILF----TEEHGSRIKQNVKQML 916 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKK---RGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 + +TDG++ ++ ++A K +G I+YA+G+ E + + V Sbjct: 917 IIITDGKS------HDHIHLSDKASKLRAKGIIIYAVGVGEANQEELETMAGNKHYTIHV 970 Query: 349 ENPHSMYDAFSHIGKDIVT 367 N S+ D + + + + T Sbjct: 971 SNFDSLKDVYQPLQESMCT 989 >gi|149042955|gb|EDL96529.1| matrilin 4 (predicted), isoform CRA_a [Rattus norvegicus] Length = 637 Score = 42.4 bits (98), Expect = 0.12, Method: Compositional matrix adjust. Identities = 42/220 (19%), Positives = 104/220 (47%), Gaps = 34/220 (15%) Query: 153 HIVMPITSSVKVNSQT-----DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 H + P ++ + ++ + +D+++++D S+S+ ++ + +N +++ Sbjct: 376 HCLCPEGRRLQADGKSCDRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVD 429 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIE-EFFLLEWGVSHLQRK----IKYLSKFGVSTNSTPG 262 + + P+ V GLV FS+++ EF L +G + ++ ++Y+ + T + Sbjct: 430 FLDVSPEGTRV---GLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMER---GTMTGLA 483 Query: 263 LKYAYNQIF-DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 L++ F ++QG R D N ++ + TDG + +D S++ AK+ G + Sbjct: 484 LRHMVEHSFSEVQGAR----PRDLNVPRVGLVFTDGRS----QDDISVWA-ARAKEEGIV 534 Query: 322 VYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHI 361 +YA+G+ E + P+ ++ +P ++ +H+ Sbjct: 535 MYAVGVGKAVEEELREIASEPSELHVSYSPD--FNTMTHL 572 Score = 42.4 bits (98), Expect = 0.13, Method: Compositional matrix adjust. Identities = 43/186 (23%), Positives = 86/186 (46%), Gaps = 27/186 (14%) Query: 172 LDMMIVLDVSRSMESF-FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 LD++ ++D SRS+ F F++ + + S++ L N + G++ +S+++ Sbjct: 48 LDLVFMIDSSRSVRPFEFETMRQFLVGLLHSLDVGL----------NATRVGVIQYSSQV 97 Query: 231 EEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF-DMQGMRQHCNTEDANY 287 + F L ++R I+ + T + ++YA N F + +G R + Sbjct: 98 QSVFPLGAFSNREDMERAIRAVVPLAQGTMTGLAIQYAMNVAFSEAEGAR----PSEERV 153 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NS 344 +++V +TDG + + +A+ RG +YA+G++ LRA ASP Sbjct: 154 PRVLVIVTDG-----RPQDRVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPLDQH 207 Query: 345 FYLVEN 350 +LVE+ Sbjct: 208 VFLVES 213 >gi|290769676|gb|ADD61455.1| putative protein [uncultured organism] Length = 816 Score = 42.4 bits (98), Expect = 0.13, Method: Compositional matrix adjust. Identities = 55/186 (29%), Positives = 83/186 (44%), Gaps = 31/186 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSS---ITKID---MAI-KSINAMLEEVKLIPDVNNVVQ 220 T+ LD+++VLDVS SM S TKID A+ K INA E I D + + Sbjct: 119 TNQPLDIVLVLDVSGSMAEKIASGWNQPTKIDSLKTAVNKFINATAAENAKITDQSQRNR 178 Query: 221 SGLVTFSNKIE-----EFFLLEWG-------VSHLQRKIKYLSKF--GVSTNSTPGLKYA 266 LV F+ + +F+ W VS+L + L+ G+S + YA Sbjct: 179 IALVKFAGTEKTSVGNDFYREGWSSYNYTQIVSNLTYDVSGLTSTVNGLSASGATSADYA 238 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQ-SLYYCNEAKK---RGAI 321 +N+ + AN KK+++F TDGE N + D + N+AK G Sbjct: 239 FNRAQAALTYQPR-----ANAKKVVIFFTDGEPNHGSGFDPTVAATAVNKAKSLKDAGTT 293 Query: 322 VYAIGI 327 +Y+IG+ Sbjct: 294 IYSIGV 299 >gi|118087212|ref|XP_424219.2| PREDICTED: similar to matrilin 2 [Gallus gallus] Length = 1799 Score = 42.4 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 37/172 (21%), Positives = 81/172 (47%), Gaps = 21/172 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + S+ N+ + RLD++ ++D SRS+ + + + + I L+ + + PD V Sbjct: 18 SQSLLENTCNNKRLDLVFIIDSSRSVRPYDFEKVKEFILTI------LQFLDVSPDATRV 71 Query: 219 VQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF-DMQG 275 GL+ + + ++ F L+ ++R ++ + T + ++YA N F + +G Sbjct: 72 ---GLIQYGSTVKHEFSLKTFRRKQEIERAVRRMMHLATGTMTGLAIQYAVNIAFSESEG 128 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 R + N +II+ +TDG + + +A+ G +++AIG+ Sbjct: 129 AR----PLNQNVPRIIMIVTDGRPQDPVGEIAA-----KARNSGILIFAIGV 171 >gi|239620965|ref|ZP_04663996.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis CCUG 52486] gi|239516066|gb|EEQ55933.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis CCUG 52486] Length = 816 Score = 42.4 bits (98), Expect = 0.13, Method: Compositional matrix adjust. Identities = 55/186 (29%), Positives = 83/186 (44%), Gaps = 31/186 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSS---ITKID---MAI-KSINAMLEEVKLIPDVNNVVQ 220 T+ LD+++VLDVS SM S TKID A+ K INA E I D + + Sbjct: 119 TNQPLDIVLVLDVSGSMAEKIASGWNQPTKIDSLKTAVNKFINATAAENAKITDQSQRNR 178 Query: 221 SGLVTFSNKIE-----EFFLLEWG-------VSHLQRKIKYLSKF--GVSTNSTPGLKYA 266 LV F+ + +F+ W VS+L + L+ G+S + YA Sbjct: 179 IALVKFAGTEKTSVGNDFYREGWSSYNYTQIVSNLTYDVSGLTSTVNGLSASGATSADYA 238 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQ-SLYYCNEAKK---RGAI 321 +N+ + AN KK+++F TDGE N + D + N+AK G Sbjct: 239 FNRAQAALTYQPR-----ANAKKVVIFFTDGEPNHGSGFDPTVAATAVNKAKSLKDAGTT 293 Query: 322 VYAIGI 327 +Y+IG+ Sbjct: 294 IYSIGV 299 >gi|46190503|ref|ZP_00121395.2| COG2304: Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Bifidobacterium longum DJO10A] gi|189440499|ref|YP_001955580.1| von Willebrand factor (vWF) domain containing protein [Bifidobacterium longum DJO10A] gi|189428934|gb|ACD99082.1| von Willebrand factor (vWF) domain containing protein [Bifidobacterium longum DJO10A] Length = 794 Score = 42.4 bits (98), Expect = 0.13, Method: Compositional matrix adjust. Identities = 55/186 (29%), Positives = 83/186 (44%), Gaps = 31/186 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSS---ITKID---MAI-KSINAMLEEVKLIPDVNNVVQ 220 T+ LD+++VLDVS SM S TKID A+ K INA E I D + + Sbjct: 97 TNQPLDIVLVLDVSGSMAEKIASGWNQPTKIDSLKTAVNKFINATAAENAKITDQSQRNR 156 Query: 221 SGLVTFSNKIE-----EFFLLEWG-------VSHLQRKIKYLSKF--GVSTNSTPGLKYA 266 LV F+ + +F+ W VS+L + L+ G+S + YA Sbjct: 157 IALVKFAGTEKTSVGNDFYREGWSSYNYTQIVSNLTYDVSGLTSTVNGLSASGATSADYA 216 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQ-SLYYCNEAKK---RGAI 321 +N+ + AN KK+++F TDGE N + D + N+AK G Sbjct: 217 FNRAQAALTYQPR-----ANAKKVVIFFTDGEPNHGSGFDPTVAATAVNKAKSLKDAGTT 271 Query: 322 VYAIGI 327 +Y+IG+ Sbjct: 272 IYSIGV 277 >gi|157818269|ref|NP_001100009.1| matrilin-4 [Rattus norvegicus] gi|149042956|gb|EDL96530.1| matrilin 4 (predicted), isoform CRA_b [Rattus norvegicus] Length = 624 Score = 42.4 bits (98), Expect = 0.13, Method: Compositional matrix adjust. Identities = 42/220 (19%), Positives = 104/220 (47%), Gaps = 34/220 (15%) Query: 153 HIVMPITSSVKVNSQT-----DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 H + P ++ + ++ + +D+++++D S+S+ ++ + +N +++ Sbjct: 363 HCLCPEGRRLQADGKSCDRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVD 416 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIE-EFFLLEWGVSHLQRK----IKYLSKFGVSTNSTPG 262 + + P+ V GLV FS+++ EF L +G + ++ ++Y+ + T + Sbjct: 417 FLDVSPEGTRV---GLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMER---GTMTGLA 470 Query: 263 LKYAYNQIF-DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 L++ F ++QG R D N ++ + TDG + +D S++ AK+ G + Sbjct: 471 LRHMVEHSFSEVQGAR----PRDLNVPRVGLVFTDGRS----QDDISVWA-ARAKEEGIV 521 Query: 322 VYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHI 361 +YA+G+ E + P+ ++ +P ++ +H+ Sbjct: 522 MYAVGVGKAVEEELREIASEPSELHVSYSPD--FNTMTHL 559 Score = 42.0 bits (97), Expect = 0.14, Method: Compositional matrix adjust. Identities = 43/186 (23%), Positives = 86/186 (46%), Gaps = 27/186 (14%) Query: 172 LDMMIVLDVSRSMESF-FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 LD++ ++D SRS+ F F++ + + S++ L N + G++ +S+++ Sbjct: 35 LDLVFMIDSSRSVRPFEFETMRQFLVGLLHSLDVGL----------NATRVGVIQYSSQV 84 Query: 231 EEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF-DMQGMRQHCNTEDANY 287 + F L ++R I+ + T + ++YA N F + +G R + Sbjct: 85 QSVFPLGAFSNREDMERAIRAVVPLAQGTMTGLAIQYAMNVAFSEAEGAR----PSEERV 140 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NS 344 +++V +TDG + + +A+ RG +YA+G++ LRA ASP Sbjct: 141 PRVLVIVTDG-----RPQDRVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPLDQH 194 Query: 345 FYLVEN 350 +LVE+ Sbjct: 195 VFLVES 200 >gi|322689979|ref|YP_004209713.1| cell surface protein [Bifidobacterium longum subsp. infantis 157F] gi|320461315|dbj|BAJ71935.1| putative cell surface protein [Bifidobacterium longum subsp. infantis 157F] Length = 794 Score = 42.4 bits (98), Expect = 0.14, Method: Compositional matrix adjust. Identities = 55/186 (29%), Positives = 83/186 (44%), Gaps = 31/186 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSS---ITKID---MAI-KSINAMLEEVKLIPDVNNVVQ 220 T+ LD+++VLDVS SM S TKID A+ K INA E I D + + Sbjct: 97 TNQPLDIVLVLDVSGSMAEKIASGWNQPTKIDSLKTAVNKFINATAAENAKITDQSQRNR 156 Query: 221 SGLVTFSNKIE-----EFFLLEWG-------VSHLQRKIKYLSKF--GVSTNSTPGLKYA 266 LV F+ + +F+ W VS+L + L+ G+S + YA Sbjct: 157 IALVKFAGTEKTSVGNDFYREGWSSYNYTQIVSNLTYDVSGLTSTVNGLSASGATSADYA 216 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQ-SLYYCNEAKK---RGAI 321 +N+ + AN KK+++F TDGE N + D + N+AK G Sbjct: 217 FNRAQAALTYQPR-----ANAKKVVIFFTDGEPNHGSGFDPTVAATAVNKAKSLKDAGTT 271 Query: 322 VYAIGI 327 +Y+IG+ Sbjct: 272 IYSIGV 277 >gi|320106407|ref|YP_004181997.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319924928|gb|ADV82003.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 305 Score = 42.0 bits (97), Expect = 0.14, Method: Compositional matrix adjust. Identities = 50/212 (23%), Positives = 89/212 (41%), Gaps = 33/212 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 Q L M++ +D S S+ + F T+ D A + + ML E + L++F Sbjct: 72 QATTPLSMVMAIDTSESVITQFQ---TERDAAKRFVKQMLREQD---------EMDLISF 119 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 S+ ++E + I L K G +T+ Y+ I+ + R DA Sbjct: 120 SDTVDEIVPFTNDAGRMNAGIGNLHK-GDATS-------LYDAIY-LASQRLTEAKRDAT 170 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI---------GIRVIRSHEFLR 337 +KI+V +TDG N T + + A++ GA +Y I G H ++ Sbjct: 171 RRKILVIVTDGGN--TTKGMRYQQAVEAAERAGAAIYPIIMVPIEADAGRNTGGEHALIQ 228 Query: 338 -ACASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 A + ++ V + H + AF+H+ D+ T+ Sbjct: 229 MAQDTGGKYFYVLDKHDLDKAFAHLSDDLRTQ 260 >gi|23466092|ref|NP_696695.1| hypothetical protein BL1539 [Bifidobacterium longum NCC2705] gi|322691915|ref|YP_004221485.1| cell surface protein [Bifidobacterium longum subsp. longum JCM 1217] gi|23326823|gb|AAN25331.1| hypothetical protein with gram positive cell wall anchoring domain [Bifidobacterium longum NCC2705] gi|320456771|dbj|BAJ67393.1| putative cell surface protein [Bifidobacterium longum subsp. longum JCM 1217] Length = 794 Score = 42.0 bits (97), Expect = 0.14, Method: Compositional matrix adjust. Identities = 55/186 (29%), Positives = 83/186 (44%), Gaps = 31/186 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSS---ITKID---MAI-KSINAMLEEVKLIPDVNNVVQ 220 T+ LD+++VLDVS SM S TKID A+ K INA E I D + + Sbjct: 97 TNQPLDIVLVLDVSGSMAEKIASGWNQPTKIDSLKTAVNKFINATAAENAKITDQSQRNR 156 Query: 221 SGLVTFSNKIE-----EFFLLEWG-------VSHLQRKIKYLSKF--GVSTNSTPGLKYA 266 LV F+ + +F+ W VS+L + L+ G+S + YA Sbjct: 157 IALVKFAGTEKTSVGNDFYREGWSSYNYTQIVSNLTYDVSGLTSTVNGLSASGATSADYA 216 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQ-SLYYCNEAKK---RGAI 321 +N+ + AN KK+++F TDGE N + D + N+AK G Sbjct: 217 FNRAQAALTYQPR-----ANAKKVVIFFTDGEPNHGSGFDPTVAATAVNKAKSLKDAGTT 271 Query: 322 VYAIGI 327 +Y+IG+ Sbjct: 272 IYSIGV 277 >gi|257062895|ref|YP_003142567.1| hypothetical protein Shel_01450 [Slackia heliotrinireducens DSM 20476] gi|256790548|gb|ACV21218.1| uncharacterized protein [Slackia heliotrinireducens DSM 20476] Length = 744 Score = 42.0 bits (97), Expect = 0.15, Method: Compositional matrix adjust. Identities = 36/135 (26%), Positives = 54/135 (40%), Gaps = 17/135 (12%) Query: 243 LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 L+ ++ LS G TN L+ +Y + ++G + K+IIV M+DGE Sbjct: 440 LKAAVRDLSAGG-GTNIEDALRVSYER---LEGS--------GSDKRIIVLMSDGEANEG 487 Query: 303 KEDQQSLYYCNEAKKRGAIVYAIGI-----RVIRSHEFLRACASPNSFYLVENPHSMYDA 357 + Y NE K G +Y +G + ASP Y V++ + Sbjct: 488 LVGDDLIAYANEIKDDGVTIYTLGFFQSVSDKAECQRVMEGIASPGCHYEVDDASQLRYF 547 Query: 358 FSHIGKDIVTKRIWY 372 F IG DI R Y Sbjct: 548 FGDIGDDINGTRFIY 562 >gi|312133821|ref|YP_004001160.1| von willebrand factor (vwf) domain containing protein [Bifidobacterium longum subsp. longum BBMN68] gi|311773110|gb|ADQ02598.1| Von Willebrand factor (VWF) domain containing protein [Bifidobacterium longum subsp. longum BBMN68] Length = 794 Score = 42.0 bits (97), Expect = 0.15, Method: Compositional matrix adjust. Identities = 55/186 (29%), Positives = 83/186 (44%), Gaps = 31/186 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSS---ITKID---MAI-KSINAMLEEVKLIPDVNNVVQ 220 T+ LD+++VLDVS SM S TKID A+ K INA E I D + + Sbjct: 97 TNQPLDIVLVLDVSGSMAEKIASGWNQPTKIDSLKTAVNKFINATAAENAKITDQSQRNR 156 Query: 221 SGLVTFSNKIE-----EFFLLEWG-------VSHLQRKIKYLSKF--GVSTNSTPGLKYA 266 LV F+ + +F+ W VS+L + L+ G+S + YA Sbjct: 157 IALVKFAGTEKTSVGNDFYREGWSSYNYTQIVSNLTYDVSGLTSTVNGLSASGATSADYA 216 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQ-SLYYCNEAKK---RGAI 321 +N+ + AN KK+++F TDGE N + D + N+AK G Sbjct: 217 FNRAQAALTYQPR-----ANAKKVVIFFTDGEPNHGSGFDPTVAATAVNKAKSLKDAGTT 271 Query: 322 VYAIGI 327 +Y+IG+ Sbjct: 272 IYSIGV 277 >gi|163759224|ref|ZP_02166310.1| hypothetical protein HPDFL43_05650 [Hoeflea phototrophica DFL-43] gi|162283628|gb|EDQ33913.1| hypothetical protein HPDFL43_05650 [Hoeflea phototrophica DFL-43] Length = 541 Score = 42.0 bits (97), Expect = 0.15, Method: Compositional matrix adjust. Identities = 38/182 (20%), Positives = 71/182 (39%), Gaps = 38/182 (20%) Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F ++E L S ++ +K L G STN G+ + + + D + Q DA Sbjct: 359 FGCEMEPLVPLTTDFSKIRTTVKALEANG-STNMLEGVMWGWRVLSDREPFAQGAPKSDA 417 Query: 286 NYKKIIVFMTDGEN----------------------------------LSTKEDQQSLYY 311 + +KI++F+TDG+N + D+++ Sbjct: 418 SVEKIMIFLTDGQNSFGNLNNDLGSAYTSMGYLVDGRLDGMTAANIGQTNNALDKKTKAA 477 Query: 312 CNEAKKRGAIVYAIGIRV--IRSHEFLRACASPNSFYL-VENPHSMYDAFSHIGKDIVTK 368 C AK+ G +Y I + + + + L CA+ ++ Y + + F I K +V Sbjct: 478 CENAKEDGVTIYTIRLEEADVGTGKMLEECATSSAHYFDAPSRQQLTPIFDAIKKGVVKL 537 Query: 369 RI 370 R+ Sbjct: 538 RL 539 >gi|229825750|ref|ZP_04451819.1| hypothetical protein GCWU000182_01113 [Abiotrophia defectiva ATCC 49176] gi|229790313|gb|EEP26427.1| hypothetical protein GCWU000182_01113 [Abiotrophia defectiva ATCC 49176] Length = 1659 Score = 42.0 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 42/132 (31%), Positives = 65/132 (49%), Gaps = 20/132 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 Q + LD+++V+D S SME ++ A K+ N ++ KL+P NN V G+V+F Sbjct: 82 QNNKILDIVLVMDTSGSMEG------KSLENAKKAANNFVD--KLLPQ-NNNVNIGIVSF 132 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + K E L V+ L+ IK L G T + GL+ A + N A Sbjct: 133 AEKGEIKSGLTRNVTTLKNAIKGLKADG-GTYTQQGLEKAATVL----------NGAPAE 181 Query: 287 YKKIIVFMTDGE 298 +KK++V + DGE Sbjct: 182 HKKVMVVIGDGE 193 >gi|85374104|ref|YP_458166.1| hypothetical protein ELI_06385 [Erythrobacter litoralis HTCC2594] gi|84787187|gb|ABC63369.1| hypothetical protein ELI_06385 [Erythrobacter litoralis HTCC2594] Length = 623 Score = 42.0 bits (97), Expect = 0.15, Method: Compositional matrix adjust. Identities = 33/138 (23%), Positives = 56/138 (40%), Gaps = 10/138 (7%) Query: 238 WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV----- 292 WG R + F +S PG + + IF G + + +Y + Sbjct: 489 WG----GRLLSQYGLFAAENSSKPGRTTSRHLIFLTDGQTEPYDLAYGSYGIDPIDERRW 544 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPH 352 T L+ +++ L+ CNE KK GA V+ + +++ ++ CA ++ N Sbjct: 545 TQTSSLTLAQTVEERFLFACNEVKKLGATVWVVAFGTA-ANDKMKTCAGSGRYFEAANAS 603 Query: 353 SMYDAFSHIGKDIVTKRI 370 + DAFS I K RI Sbjct: 604 QLNDAFSTIAKSTGDLRI 621 >gi|56696619|ref|YP_166980.1| hypothetical protein SPO1742 [Ruegeria pomeroyi DSS-3] gi|56678356|gb|AAV95022.1| conserved hypothetical protein [Ruegeria pomeroyi DSS-3] Length = 558 Score = 42.0 bits (97), Expect = 0.16, Method: Compositional matrix adjust. Identities = 23/71 (32%), Positives = 37/71 (52%), Gaps = 1/71 (1%) Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-HEFLRACASPNSFYLVENPHSMYDAFS 359 ++ +DQ++ + C+ AK G IVY +G S L+ CAS +S Y + + DAF+ Sbjct: 486 ASAKDQRTDHVCDAAKDEGIIVYTVGFEAPYSGRRVLKRCASSDSHYYDADGLEISDAFT 545 Query: 360 HIGKDIVTKRI 370 I I R+ Sbjct: 546 SIASSIRKLRL 556 >gi|259416688|ref|ZP_05740608.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] gi|259348127|gb|EEW59904.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] Length = 583 Score = 42.0 bits (97), Expect = 0.17, Method: Compositional matrix adjust. Identities = 27/85 (31%), Positives = 44/85 (51%), Gaps = 6/85 (7%) Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACASPNSF 345 Y ++ ++ D STK D ++L C+ AK++G +V+ IG R + L+ CAS S Sbjct: 502 YNRLYSYVGD----STK-DSRTLAVCDAAKEKGIVVFTIGFEAPWRGQQVLQQCASSASH 556 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRI 370 Y + + DAF+ I I R+ Sbjct: 557 YYDVDGLEISDAFASIASAIRQLRL 581 >gi|302870768|ref|YP_003839404.1| von Willebrand factor type A [Caldicellulosiruptor obsidiansis OB47] gi|302573627|gb|ADL41418.1| von Willebrand factor type A [Caldicellulosiruptor obsidiansis OB47] Length = 900 Score = 42.0 bits (97), Expect = 0.17, Method: Compositional matrix adjust. Identities = 58/221 (26%), Positives = 97/221 (43%), Gaps = 38/221 (17%) Query: 148 YTNS-RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 Y+NS ++P+ +K N + + +D+++VLD S SM D+ I+K+++A + M+ Sbjct: 382 YSNSVLEKMLPVKMEIK-NKEKEKNIDVVLVLDHSGSMADTEDAGISKLEIAKSASAKMI 440 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK---IKYLS--KFGVSTNSTP 261 E ++ V G++ F + + + S L RK I+ +S + G T P Sbjct: 441 EHLESSDGV------GVIAFDHN----YYWAYEFSKLVRKKDVIESISSIEVGGGTAIIP 490 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY--CNEAKKRG 319 L A + + + IV +TDG +Q Y NEAK+ Sbjct: 491 PLSEAVKTLKKSKAKSKL-----------IVLLTDGMG------EQGGYEIPANEAKRNN 533 Query: 320 AIVYAIGIRVIRSHEFLRACASPNS--FYLVENPHSMYDAF 358 + IG+ + L AS S FYLV NP+ + D F Sbjct: 534 IKITTIGVGKFVNLPVLSWIASFTSGRFYLVSNPYELVDVF 574 >gi|291087243|ref|ZP_06571866.1| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] gi|291076088|gb|EFE13452.1| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] Length = 2012 Score = 42.0 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 57/213 (26%), Positives = 91/213 (42%), Gaps = 27/213 (12%) Query: 172 LDMMIVLDVSRSMESFFDSSITKI-DMAIKS----INAMLEEVKLIPDVNNV---VQSGL 223 +D++ V+D S SM+ D + K D +S +N LEE+ IPD+ + +Q Sbjct: 1475 VDLVFVIDKSLSMDYDIDGNEIKWWDDETESRKDIVNDALEEI--IPDLCSQQYDIQIAG 1532 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS +L+W Q LS ++ T+S+ A DM + Sbjct: 1533 YQFSGSSTR--VLDWSREEQQ----VLSGLKIARTSSSTEPSQALADALDMLKTGSPAHR 1586 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS---HEFLRAC 339 +N KK ++FMTDGE T+ + S GA +Y IG+ S E +R+ Sbjct: 1587 NQSNVKKYLIFMTDGE--PTEPEDWSYNAVRNHAVPGASIYTIGVSSDASTNLMEGIRST 1644 Query: 340 ASPNSFYLV-----ENPHSMYDAFSHIGKDIVT 367 A N Y + + DAF+ I +I++ Sbjct: 1645 ALSNGMYAPATFKGTSAQLIRDAFTQIKDEIIS 1677 >gi|312877126|ref|ZP_07737097.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] gi|311796100|gb|EFR12458.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] Length = 900 Score = 42.0 bits (97), Expect = 0.17, Method: Compositional matrix adjust. Identities = 56/221 (25%), Positives = 95/221 (42%), Gaps = 38/221 (17%) Query: 148 YTNS-RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 Y+NS ++P+ +K N + + +D+++VLD S SM D+ I K+++A + M+ Sbjct: 382 YSNSVLEKMLPVKMELK-NKEKEKNIDVVLVLDHSGSMADTEDAGIPKLEIAKSASAKMV 440 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK---IKYLS--KFGVSTNSTP 261 E ++ V G++ F + + + L RK I+ +S + G T P Sbjct: 441 EHLESSDGV------GVIAFDHN----YYWAYKFGKLVRKEDVIESISSIEVGGGTAIIP 490 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY--CNEAKKRG 319 L A + + K++V +TDG +QS Y +EAK+ Sbjct: 491 PLSEAVKTLKKSKAKN-----------KLVVLLTDGMG------EQSGYEIPADEAKRNN 533 Query: 320 AIVYAIGIRVIRSHEFLRACASPNS--FYLVENPHSMYDAF 358 + IG+ + L A S FYLV NP + D F Sbjct: 534 IKITTIGVGKFVNASVLSWIADYTSGRFYLVSNPSELVDVF 574 >gi|223462569|gb|AAI50696.1| RIKEN cDNA E330026B02 gene [Mus musculus] Length = 1182 Score = 42.0 bits (97), Expect = 0.18, Method: Compositional matrix adjust. Identities = 49/202 (24%), Positives = 89/202 (44%), Gaps = 31/202 (15%) Query: 169 DARLDMMIVLDVSRSM--ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 D + D+M ++D S S+ E+F +K+ M +K+ ++ + ++ D VQ G+V F Sbjct: 617 DMKADIMFLVDSSGSIGPENF-----SKMKMFMKN---LVSKSQIGADR---VQIGVVQF 665 Query: 227 SNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S++ +E F L +S + I ++ G +T + L + +G R Sbjct: 666 SHENKEEFQLNTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFSPDKGAR------- 718 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 N +K ++ +TDGE D +K G I+Y++G+ + P Sbjct: 719 PNVRKFLILITDGEAQDIVRDPALAL-----RKEGVIIYSVGVFGSNVTQLEEISGKPEM 773 Query: 345 FYLVENPHSMYDAFSHIGKDIV 366 + VEN +D HI D+V Sbjct: 774 VFYVEN----FDILQHIEDDLV 791 Score = 40.0 bits (92), Expect = 0.53, Method: Compositional matrix adjust. Identities = 47/201 (23%), Positives = 90/201 (44%), Gaps = 29/201 (14%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D S S++ + + +I K + M+ VK N V+ G + +++ E Sbjct: 807 LDVVFVIDSSGSID-YQEYNIMK--------DFMIGLVKKADVGKNQVRFGALKYADDPE 857 Query: 232 EFFLLEWGVSHLQRKIKYLS------KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F L+ L K++ +S G +T + L ++ + + +G R H Sbjct: 858 VLFYLD----ELGTKLEVISVLQNDHPMGGNTYTAEALAFSNHMFTEARGSRLH-----K 908 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIRVIRSHEFLRACASPNS 344 ++++ +TDGE+ D + L +A + +G +V A+GI S E L S + Sbjct: 909 GVPQVLIVITDGES----HDAEKLNATAKALRDKGILVLAVGIAGANSWELLAMAGSGDK 964 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 +Y VE + FS + + Sbjct: 965 YYFVETFGGLKGIFSDVSASV 985 >gi|329850249|ref|ZP_08265094.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] gi|328840564|gb|EGF90135.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] Length = 412 Score = 41.6 bits (96), Expect = 0.19, Method: Compositional matrix adjust. Identities = 39/141 (27%), Positives = 62/141 (43%), Gaps = 14/141 (9%) Query: 240 VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 +S + IK L G TN T G+++ + Q +K ++ +TDG+N Sbjct: 274 ISSARNFIKTLQPGGY-TNVTMGVQWGMEVLSPNQPFSDATEFGSTKARKFMIVVTDGDN 332 Query: 300 LSTKE---------DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS-PNSFYLVE 349 TK D+++ C AK +G VY + I + + LR CAS P FY + Sbjct: 333 --TKSFTSWSASVIDKRTALACENAKAKGITVYTVKI-IQGNSNMLRKCASAPEYFYDLT 389 Query: 350 NPHSMYDAFSHIGKDIVTKRI 370 + + + A S I K I R+ Sbjct: 390 SANQLNAAMSGIFKSINKTRL 410 >gi|156616288|ref|NP_766515.2| collagen alpha-6(VI) chain isoform 2 [Mus musculus] Length = 1182 Score = 41.6 bits (96), Expect = 0.19, Method: Compositional matrix adjust. Identities = 49/202 (24%), Positives = 89/202 (44%), Gaps = 31/202 (15%) Query: 169 DARLDMMIVLDVSRSM--ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 D + D+M ++D S S+ E+F +K+ M +K+ ++ + ++ D VQ G+V F Sbjct: 617 DMKADIMFLVDSSGSIGPENF-----SKMKMFMKN---LVSKSQIGADR---VQIGVVQF 665 Query: 227 SNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S++ +E F L +S + I ++ G +T + L + +G R Sbjct: 666 SHENKEEFQLNTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFSPDKGAR------- 718 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 N +K ++ +TDGE D +K G I+Y++G+ + P Sbjct: 719 PNVRKFLILITDGEAQDIVRDPAIAL-----RKEGVIIYSVGVFGSNVTQLEEISGKPEM 773 Query: 345 FYLVENPHSMYDAFSHIGKDIV 366 + VEN +D HI D+V Sbjct: 774 VFYVEN----FDILQHIEDDLV 791 Score = 40.0 bits (92), Expect = 0.59, Method: Compositional matrix adjust. Identities = 47/201 (23%), Positives = 90/201 (44%), Gaps = 29/201 (14%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D S S++ + + +I K + M+ VK N V+ G + +++ E Sbjct: 807 LDVVFVIDSSGSID-YQEYNIMK--------DFMIGLVKKADVGKNQVRFGALKYADDPE 857 Query: 232 EFFLLEWGVSHLQRKIKYLS------KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F L+ L K++ +S G +T + L ++ + + +G R H Sbjct: 858 VLFYLD----ELGTKLEVVSVLQNDHPMGGNTYTAEALAFSDHMFTEARGSRLH-----K 908 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIRVIRSHEFLRACASPNS 344 ++++ +TDGE+ D + L +A + +G +V A+GI S E L S + Sbjct: 909 GVPQVLIVITDGES----HDAEKLNTTAKALRDKGILVLAVGIAGANSWELLAMAGSSDK 964 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 +Y VE + FS + + Sbjct: 965 YYFVETFGGLKGIFSDVSASV 985 >gi|26344185|dbj|BAC35749.1| unnamed protein product [Mus musculus] Length = 1182 Score = 41.6 bits (96), Expect = 0.19, Method: Compositional matrix adjust. Identities = 49/202 (24%), Positives = 89/202 (44%), Gaps = 31/202 (15%) Query: 169 DARLDMMIVLDVSRSM--ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 D + D+M ++D S S+ E+F +K+ M +K+ ++ + ++ D VQ G+V F Sbjct: 617 DMKADIMFLVDSSGSIGPENF-----SKMKMFMKN---LVSKSQIGADR---VQIGVVQF 665 Query: 227 SNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S++ +E F L +S + I ++ G +T + L + +G R Sbjct: 666 SHENKEEFQLNTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFSPDKGAR------- 718 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 N +K ++ +TDGE D +K G I+Y++G+ + P Sbjct: 719 PNVRKFLILITDGEAQDIVRDPAIAL-----RKEGVIIYSVGVFGSNVTQLEEISGKPEM 773 Query: 345 FYLVENPHSMYDAFSHIGKDIV 366 + VEN +D HI D+V Sbjct: 774 VFYVEN----FDILQHIEDDLV 791 Score = 39.7 bits (91), Expect = 0.68, Method: Compositional matrix adjust. Identities = 47/201 (23%), Positives = 90/201 (44%), Gaps = 29/201 (14%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D S S++ + + +I K + M+ VK N V+ G + +++ E Sbjct: 807 LDVVFVIDSSGSID-YQEYNIMK--------DFMIGLVKKADVGKNQVRFGALKYADDPE 857 Query: 232 EFFLLEWGVSHLQRKIKYLS------KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F L+ L K++ +S G +T + L ++ + + +G R H Sbjct: 858 VLFYLD----ELGTKLEVVSVLQNDHPMGGNTYTAEALAFSDHMFTEARGSRLH-----K 908 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIRVIRSHEFLRACASPNS 344 ++++ +TDGE+ D + L +A + +G +V A+GI S E L S + Sbjct: 909 GVPQVLIVITDGES----NDAEKLNTTAKALRDKGILVLAVGIAGANSWELLAMAGSSDK 964 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 +Y VE + FS + + Sbjct: 965 YYFVETFGGLKGIFSDVSASV 985 >gi|149773091|emb|CAO01895.1| collagen type VI alpha 6 [Mus musculus] Length = 1162 Score = 41.6 bits (96), Expect = 0.19, Method: Compositional matrix adjust. Identities = 49/202 (24%), Positives = 88/202 (43%), Gaps = 31/202 (15%) Query: 169 DARLDMMIVLDVSRSM--ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 D + D+M ++D S S+ E+F +K+ M +K+ ++ + ++ D VQ G+V F Sbjct: 597 DMKADIMFLVDSSGSIGPENF-----SKMKMFMKN---LVSKSQIGADR---VQIGVVQF 645 Query: 227 SNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S++ +E F L S + I ++ G +T + L + +G R Sbjct: 646 SHENKEEFQLNTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFSPDKGAR------- 698 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 N +K ++ +TDGE D +K G I+Y++G+ + P Sbjct: 699 PNVRKFLILITDGEAQDIVRDPAIAL-----RKEGVIIYSVGVFGSNVTQLEEISGKPEM 753 Query: 345 FYLVENPHSMYDAFSHIGKDIV 366 + VEN +D HI D+V Sbjct: 754 VFYVEN----FDILQHIEDDLV 771 Score = 40.0 bits (92), Expect = 0.64, Method: Compositional matrix adjust. Identities = 47/201 (23%), Positives = 90/201 (44%), Gaps = 29/201 (14%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D S S++ + + +I K + M+ VK N V+ G + +++ E Sbjct: 787 LDVVFVIDSSGSID-YQEYNIMK--------DFMIGLVKKADVGKNQVRFGALKYADDPE 837 Query: 232 EFFLLEWGVSHLQRKIKYLS------KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F L+ L K++ +S G +T + L ++ + + +G R H Sbjct: 838 VLFYLD----ELGTKLEVVSVLQNDHPMGGNTYTAEALAFSDHMFTEARGSRLH-----K 888 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIRVIRSHEFLRACASPNS 344 ++++ +TDGE+ D + L +A + +G +V A+GI S E L S + Sbjct: 889 GVPQVLIVITDGES----HDAEKLNTTAKALRDKGILVLAVGIAGANSWELLAMAGSSDK 944 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 +Y VE + FS + + Sbjct: 945 YYFVETFGGLKGIFSDVSASV 965 >gi|288956977|ref|YP_003447318.1| hypothetical protein AZL_001360 [Azospirillum sp. B510] gi|288909285|dbj|BAI70774.1| hypothetical protein AZL_001360 [Azospirillum sp. B510] Length = 456 Score = 41.6 bits (96), Expect = 0.21, Method: Compositional matrix adjust. Identities = 28/104 (26%), Positives = 44/104 (42%), Gaps = 21/104 (20%) Query: 288 KKIIVFMTDGENL-------------------STKEDQQSLYYCNEAKK-RGAIVYAIGI 327 KK++V MTDG N ++K D + CN AKK ++Y I Sbjct: 351 KKVLVLMTDGVNHIAGRNYTAYYSDPYQTVADASKADADLMTICNAAKKDHNVVLYTITY 410 Query: 328 RVIRSHEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + + CAS P+ Y P + AF+ +G D+ T ++ Sbjct: 411 GSDTDEQQMSDCASDPSKHYHAALPQDLAKAFTQVGTDLTTMKL 454 >gi|14548116|sp|O89029|MATN4_MOUSE RecName: Full=Matrilin-4; Short=MAT-4; Flags: Precursor gi|3766288|emb|CAA06889.1| matrilin-4 precursor [Mus musculus] gi|22477196|gb|AAH36558.1| Matrilin 4 [Mus musculus] gi|148674433|gb|EDL06380.1| matrilin 4, isoform CRA_a [Mus musculus] gi|148674434|gb|EDL06381.1| matrilin 4, isoform CRA_a [Mus musculus] Length = 624 Score = 41.6 bits (96), Expect = 0.21, Method: Compositional matrix adjust. Identities = 41/210 (19%), Positives = 98/210 (46%), Gaps = 32/210 (15%) Query: 153 HIVMPITSSVKVNSQT-----DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 H + P ++ + ++ + +D+++++D S+S+ ++ + +N +++ Sbjct: 363 HCLCPEGRRLQADGKSCDRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVD 416 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIE-EFFLLEWGVSHLQRK----IKYLSKFGVSTNSTPG 262 + + P+ V GLV FS+++ EF L +G + ++ ++Y+ + T + Sbjct: 417 FLDVSPEGTRV---GLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMER---GTMTGLA 470 Query: 263 LKYAYNQIF-DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 L++ F + QG R D N ++ + TDG + +D S++ AK+ G + Sbjct: 471 LRHMVEHSFSEAQGAR----PRDLNVPRVGLVFTDGRS----QDDISVWA-ARAKEEGIV 521 Query: 322 VYAIGIRVIRSHEFLRACASPNSFYLVENP 351 +YA+G+ E + P+ ++ +P Sbjct: 522 MYAVGVGKAVEEELREIASEPSELHVSYSP 551 Score = 41.2 bits (95), Expect = 0.29, Method: Compositional matrix adjust. Identities = 43/188 (22%), Positives = 87/188 (46%), Gaps = 31/188 (16%) Query: 172 LDMMIVLDVSRSMESF-FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 LD++ ++D SRS+ F F++ + ++S++ L N + G++ +S+++ Sbjct: 35 LDLVFMIDSSRSVRPFEFETMRQFLVGLLRSLDVGL----------NATRVGVIQYSSQV 84 Query: 231 EEFFLLEWGV----SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF-DMQGMRQHCNTEDA 285 + F L G ++R I+ + T + ++YA N F + +G R + Sbjct: 85 QSVFPL--GAFSRREDMERAIRAVVPLAQGTMTGLAIQYAMNVAFSEAEGAR----PSEE 138 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--- 342 +++V +TDG + + +A+ RG +YA+G++ LR ASP Sbjct: 139 RVPRVLVIVTDG-----RPQDRVAEVAAQARARGIEIYAVGVQRADVGS-LRTMASPPLD 192 Query: 343 NSFYLVEN 350 +LVE+ Sbjct: 193 QHVFLVES 200 >gi|224967060|ref|NP_038620.2| matrilin-4 precursor [Mus musculus] Length = 624 Score = 41.6 bits (96), Expect = 0.22, Method: Compositional matrix adjust. Identities = 41/210 (19%), Positives = 98/210 (46%), Gaps = 32/210 (15%) Query: 153 HIVMPITSSVKVNSQT-----DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 H + P ++ + ++ + +D+++++D S+S+ ++ + +N +++ Sbjct: 363 HCLCPEGRRLQADGKSCDRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVD 416 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIE-EFFLLEWGVSHLQRK----IKYLSKFGVSTNSTPG 262 + + P+ V GLV FS+++ EF L +G + ++ ++Y+ + T + Sbjct: 417 FLDVSPEGTRV---GLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMER---GTMTGLA 470 Query: 263 LKYAYNQIF-DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 L++ F + QG R D N ++ + TDG + +D S++ AK+ G + Sbjct: 471 LRHMVEHSFSEAQGAR----PRDLNVPRVGLVFTDGRS----QDDISVWA-ARAKEEGIV 521 Query: 322 VYAIGIRVIRSHEFLRACASPNSFYLVENP 351 +YA+G+ E + P+ ++ +P Sbjct: 522 MYAVGVGKAVEEELREIASEPSELHVSYSP 551 Score = 41.2 bits (95), Expect = 0.29, Method: Compositional matrix adjust. Identities = 43/188 (22%), Positives = 87/188 (46%), Gaps = 31/188 (16%) Query: 172 LDMMIVLDVSRSMESF-FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 LD++ ++D SRS+ F F++ + ++S++ L N + G++ +S+++ Sbjct: 35 LDLVFMIDSSRSVRPFEFETMRQFLVGLLRSLDVGL----------NATRVGVIQYSSQV 84 Query: 231 EEFFLLEWGV----SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF-DMQGMRQHCNTEDA 285 + F L G ++R I+ + T + ++YA N F + +G R + Sbjct: 85 QSVFPL--GAFSRREDMERAIRAVVPLAQGTMTGLAIQYAMNVAFSEAEGAR----PSEE 138 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--- 342 +++V +TDG + + +A+ RG +YA+G++ LR ASP Sbjct: 139 RVPRVLVIVTDG-----RPQDRVAEVAAQARARGIEIYAVGVQRADVGS-LRTMASPPLD 192 Query: 343 NSFYLVEN 350 +LVE+ Sbjct: 193 QHVFLVES 200 >gi|3766289|emb|CAA06890.1| matrilin-4 precursor, alternate splice product [Mus musculus] Length = 434 Score = 41.6 bits (96), Expect = 0.23, Method: Compositional matrix adjust. Identities = 41/210 (19%), Positives = 98/210 (46%), Gaps = 32/210 (15%) Query: 153 HIVMPITSSVKVNSQT-----DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 H + P ++ + ++ + +D+++++D S+S+ ++ + +N +++ Sbjct: 173 HCLCPEGRRLQADGKSCDRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVD 226 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIE-EFFLLEWGVSHLQRK----IKYLSKFGVSTNSTPG 262 + + P+ V GLV FS+++ EF L +G + ++ ++Y+ + T + Sbjct: 227 FLDVSPEGTRV---GLVQFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMER---GTMTGLA 280 Query: 263 LKYAYNQIF-DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 L++ F + QG R D N ++ + TDG + +D S++ AK+ G + Sbjct: 281 LRHMVEHSFSEAQGAR----PRDLNVPRVGLVFTDGRS----QDDISVWA-ARAKEEGIV 331 Query: 322 VYAIGIRVIRSHEFLRACASPNSFYLVENP 351 +YA+G+ E + P+ ++ +P Sbjct: 332 MYAVGVGKAVEEELREIASEPSELHVSYSP 361 >gi|148689164|gb|EDL21111.1| RIKEN cDNA E330026B02 [Mus musculus] Length = 1482 Score = 41.2 bits (95), Expect = 0.25, Method: Compositional matrix adjust. Identities = 47/200 (23%), Positives = 86/200 (43%), Gaps = 27/200 (13%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D + D+M ++D S S+ + +K+ M +K+ ++ + ++ D VQ G+V FS+ Sbjct: 617 DMKADIMFLVDSSGSIGP---ENFSKMKMFMKN---LVSKSQIGADR---VQIGVVQFSH 667 Query: 229 KIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + +E F L S + I ++ G +T + L + +G R N Sbjct: 668 ENKEEFQLNTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFSPDKGAR-------PN 720 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 +K ++ +TDGE D +K G I+Y++G+ + P + Sbjct: 721 VRKFLILITDGEAQDIVRDPALAL-----RKEGVIIYSVGVFGSNVTQLEEISGKPEMVF 775 Query: 347 LVENPHSMYDAFSHIGKDIV 366 VEN +D HI D+V Sbjct: 776 YVEN----FDILQHIEDDLV 791 Score = 40.8 bits (94), Expect = 0.38, Method: Compositional matrix adjust. Identities = 47/201 (23%), Positives = 90/201 (44%), Gaps = 29/201 (14%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D S S++ + + +I K + M+ VK N V+ G + +++ E Sbjct: 807 LDVVFVIDSSGSID-YQEYNIMK--------DFMIGLVKKADVGKNQVRFGALKYADDPE 857 Query: 232 EFFLLEWGVSHLQRKIKYLS------KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F L+ L K++ +S G +T + L ++ + + +G R H Sbjct: 858 VLFYLD----ELGTKLEVISVLQNDHPMGGNTYTAEALAFSNHMFTEARGSRLH-----K 908 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIRVIRSHEFLRACASPNS 344 ++++ +TDGE+ D + L +A + +G +V A+GI S E L S + Sbjct: 909 GVPQVLIVITDGES----HDAEKLNATAKALRDKGILVLAVGIAGANSWELLAMAGSGDK 964 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 +Y VE + FS + + Sbjct: 965 YYFVETFGGLKGIFSDVSASV 985 >gi|126303381|ref|XP_001379571.1| PREDICTED: similar to matrilin-4 [Monodelphis domestica] Length = 623 Score = 41.2 bits (95), Expect = 0.27, Method: Compositional matrix adjust. Identities = 44/185 (23%), Positives = 84/185 (45%), Gaps = 25/185 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SRS+ F ++ + + I + + + P+ V G++ +S++++ Sbjct: 35 LDLVFVIDSSRSVRPFEFETMRRFLVNI------IRGLDIGPNATRV---GVIQYSSQVQ 85 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDANYK 288 F L ++R I + T + ++YA N F + +G R A Sbjct: 86 SVFPLGAFSRREDMERAIHAIVPLAQGTMTGLAIQYAMNVAFSVAEGAR----PSQARVP 141 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 ++ V +TDG + + +A+ RG +YA+G++ LRA ASP Sbjct: 142 RVAVIVTDG-----RPQDRVTEVAAQARNRGIEIYAVGVQRADVGS-LRAMASPPLDEHV 195 Query: 346 YLVEN 350 +LVE+ Sbjct: 196 FLVES 200 Score = 37.0 bits (84), Expect = 5.3, Method: Compositional matrix adjust. Identities = 40/186 (21%), Positives = 87/186 (46%), Gaps = 27/186 (14%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++V+D S+S+ ++ + +N +++ + + P+ V GLV +S+++ Sbjct: 387 VDLVLVIDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEGTRV---GLVQYSSRVR 437 Query: 232 -EFFLLEWGVSHLQRK----IKYLSKFGVSTNSTPGLKYAYNQIF-DMQGMRQHCNTEDA 285 EF L +G + ++ ++Y+ K T + L++ F + QG R Sbjct: 438 TEFPLGRYGTADEVKQAVLAVEYMEK---GTMTGLALRHLVEHSFSEAQGARPRAQ---- 490 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 N ++ + TDG + +D S++ AK+ G I+YA+G+ E + P Sbjct: 491 NVPRVGLVFTDGRS----QDDISVWA-ARAKEEGIIMYAVGVGKAVEEELREIASDPPEQ 545 Query: 346 YLVENP 351 ++ +P Sbjct: 546 HVSYSP 551 >gi|134093121|gb|ABO52981.1| matrilin 4 isoform 1 precursor [Callithrix jacchus] Length = 580 Score = 41.2 bits (95), Expect = 0.28, Method: Compositional matrix adjust. Identities = 43/185 (23%), Positives = 84/185 (45%), Gaps = 25/185 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SRS+ F + + + + ++ + + P+ V G++ +S++++ Sbjct: 32 LDLVFVIDSSRSVRPF------EFETMRQFLVGLVRGLNVGPNATRV---GVIQYSSQVQ 82 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDANYK 288 F L + R I+ L T + ++YA N F + +G R + Sbjct: 83 SVFPLRAFSRREDMVRAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR----PPEERVP 138 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 ++ V +TDG + + +A+ RG +YA+G++ LRA ASP Sbjct: 139 RVAVIVTDG-----RPQDRVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPLEEHV 192 Query: 346 YLVEN 350 +LVE+ Sbjct: 193 FLVES 197 >gi|47219516|emb|CAG09870.1| unnamed protein product [Tetraodon nigroviridis] Length = 1259 Score = 41.2 bits (95), Expect = 0.28, Method: Compositional matrix adjust. Identities = 41/155 (26%), Positives = 73/155 (47%), Gaps = 25/155 (16%) Query: 219 VQSGLVTFSNKIEEFFLL--EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF-DMQG 275 VQ LV +S F L ++ + R ++ G STN+ +KY ++IF +G Sbjct: 432 VQISLVQYSRDPHTEFALNTHHDINAVVRAVRTFPYRGGSTNTGKAMKYVKDKIFVASRG 491 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHE 334 RQ N +++V +TDG++ + +D + ++A+G++ +RS Sbjct: 492 ARQ-------NVPRVMVLITDGKSSDSFKDA-----ATNLRNIDVEIFAVGVKDAVRSE- 538 Query: 335 FLRACASP---NSFYLVENPHSMYDAFSHIGKDIV 366 L A A+P N + VE+ +DAF I K++ Sbjct: 539 -LEAIANPPADNHVFEVED----FDAFQRISKELT 568 >gi|328953621|ref|YP_004370955.1| hypothetical protein Desac_1940 [Desulfobacca acetoxidans DSM 11109] gi|328453945|gb|AEB09774.1| hypothetical protein Desac_1940 [Desulfobacca acetoxidans DSM 11109] Length = 376 Score = 41.2 bits (95), Expect = 0.30, Method: Compositional matrix adjust. Identities = 14/60 (23%), Positives = 36/60 (60%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 N + +G + ++TA+ LP++ G+ I++ +++ +KT + S +D ++ ++ N+G Sbjct: 4 NLGRHEEGAIAVITALLLPVLIGFTGLAIDIGNLYVIKTRMQSAVDAAVCGGGLKLPNQG 63 >gi|296200542|ref|XP_002747689.1| PREDICTED: matrilin-4 [Callithrix jacchus] Length = 770 Score = 41.2 bits (95), Expect = 0.30, Method: Compositional matrix adjust. Identities = 43/185 (23%), Positives = 84/185 (45%), Gaps = 25/185 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SRS+ F + + + + ++ + + P+ V G++ +S++++ Sbjct: 32 LDLVFVIDSSRSVRPF------EFETMRQFLVGLVRGLNVGPNATRV---GVIQYSSQVQ 82 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDANYK 288 F L + R I+ L T + ++YA N F + +G R + Sbjct: 83 SVFPLRAFSRREDMVRAIRDLVPLAQGTMTGLAIQYAMNVAFSVAEGAR----PPEERVP 138 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 ++ V +TDG + + +A+ RG +YA+G++ LRA ASP Sbjct: 139 RVAVIVTDG-----RPQDRVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPLEEHV 192 Query: 346 YLVEN 350 +LVE+ Sbjct: 193 FLVES 197 >gi|126662673|ref|ZP_01733672.1| hypothetical protein FBBAL38_04940 [Flavobacteria bacterium BAL38] gi|126626052|gb|EAZ96741.1| hypothetical protein FBBAL38_04940 [Flavobacteria bacterium BAL38] Length = 288 Score = 40.8 bits (94), Expect = 0.34, Method: Compositional matrix adjust. Identities = 55/171 (32%), Positives = 77/171 (45%), Gaps = 33/171 (19%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L MM+V+D+S S ESF + K DM + I A L N + GL+ F Sbjct: 72 EEERELTMMLVVDISGS-ESFGTKNQQKRDM-VTEIAATL----AFSATQNNDKIGLLLF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNST---PGLKYAYNQIFDMQGMRQHCNTE 283 S++IE F + G SH+ R I+ L +F +N T LKY + G+ + Sbjct: 126 SDQIELFIPPKKGKSHVLRIIRELIEFQPKSNKTDLAQALKY-------LSGVLK----- 173 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 KK IVFM + K+ QQ+L AKK GIRV E Sbjct: 174 ----KKAIVFMIS--DFMVKDYQQTLKIA--AKKHD----VTGIRVYDQRE 212 >gi|291409921|ref|XP_002721255.1| PREDICTED: matrilin 4 [Oryctolagus cuniculus] Length = 346 Score = 40.8 bits (94), Expect = 0.35, Method: Compositional matrix adjust. Identities = 42/187 (22%), Positives = 85/187 (45%), Gaps = 29/187 (15%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SRS+ + + + + ++ + + P+ V G++ +S++++ Sbjct: 35 LDLVFVIDSSRSVRPL------EFETMRRFLVGLVRSLDVGPNATRV---GVIQYSSQVQ 85 Query: 232 EFFLLEWGV----SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDAN 286 F L G ++R ++ L T + ++YA N F + +G R + Sbjct: 86 SVFPL--GAFSRREDMERALRTLVPLAQGTMTGLAIQYAMNVAFSVAEGAR----PPEER 139 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---N 343 ++ V +TDG + + +A+ RG +YA+G++ LRA ASP Sbjct: 140 VPRVAVIVTDG-----RPQDRVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPLDE 193 Query: 344 SFYLVEN 350 +LVE+ Sbjct: 194 HVFLVES 200 >gi|299139026|ref|ZP_07032203.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] gi|298599180|gb|EFI55341.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] Length = 318 Score = 40.8 bits (94), Expect = 0.38, Method: Compositional matrix adjust. Identities = 24/94 (25%), Positives = 46/94 (48%), Gaps = 12/94 (12%) Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI---------GIRVIRSHEF 335 A ++++V +TDGEN + + +A++ GA++YA+ G H Sbjct: 182 AGQRRVLVLITDGENTTHHGSYDAA--LEQAQRAGAMIYALIIVPVSADAGRNTGGEHAL 239 Query: 336 LR-ACASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 ++ A + +Y VE+ H + AF H+ D+ T+ Sbjct: 240 IQLARDTGGKYYYVEDKHDLAPAFQHVSDDLRTQ 273 >gi|218678237|ref|ZP_03526134.1| hypothetical protein RetlC8_04927 [Rhizobium etli CIAT 894] Length = 120 Score = 40.4 bits (93), Expect = 0.41, Method: Composition-based stats. Identities = 28/82 (34%), Positives = 37/82 (45%), Gaps = 7/82 (8%) Query: 288 KKIIVFMTDGENLSTKE-----DQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACAS 341 KK IVFMTDG+N + D + C++AK +G +Y I L CAS Sbjct: 27 KKYIVFMTDGDNNNDSSGGRSYDTATKKTCDDAKSKGIEIYTIAFMAPAGGQALLHYCAS 86 Query: 342 PNSFYL-VENPHSMYDAFSHIG 362 +S Y E + AF IG Sbjct: 87 DDSHYFQAEKMEDLLAAFEAIG 108 >gi|149018699|gb|EDL77340.1| rCG25821 [Rattus norvegicus] Length = 1513 Score = 40.4 bits (93), Expect = 0.42, Method: Compositional matrix adjust. Identities = 49/203 (24%), Positives = 84/203 (41%), Gaps = 33/203 (16%) Query: 169 DARLDMMIVLDVSRS--MESFFDSSITKIDMAIKS-INAMLEEVKLIPDVNNVVQSGLVT 225 D + D+M ++D S S +E+F + ++ KS I A + VQ G+V Sbjct: 617 DMKADIMFLVDSSGSIGLENFSKMKVFMKNLVSKSQIGA------------DRVQIGVVQ 664 Query: 226 FSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 FS++ E F L +S + I ++ G +T + L + +G R Sbjct: 665 FSHENREEFQLNTFMSQNDIANAIDQMAHIGETTLTGSALTFVSQYFSPEKGAR------ 718 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 N +K ++ +TDGE +D +K G I+Y++G+ + P Sbjct: 719 -PNVRKFLILITDGEAQDIVKDPAVAL-----RKDGVIIYSVGVFGSNVTQLEEISGKPE 772 Query: 344 SFYLVENPHSMYDAFSHIGKDIV 366 + VEN +D HI D+V Sbjct: 773 MVFYVEN----FDILQHIEDDLV 791 Score = 39.7 bits (91), Expect = 0.76, Method: Compositional matrix adjust. Identities = 46/201 (22%), Positives = 90/201 (44%), Gaps = 29/201 (14%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D S S++ + + +I K + M+ VK N V+ G + +++ E Sbjct: 807 LDVVFVIDSSGSID-YQEYNIMK--------DFMIGLVKKADVGKNQVRFGALKYADDPE 857 Query: 232 EFFLLEWGVSHLQRKIKYLS------KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F L+ L K++ +S G +T + L ++ + + +G R H Sbjct: 858 VLFYLD----ELGTKLEVISVLQNDQPMGGNTYTAEALAFSDHMFTEARGSRLH-----K 908 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIRVIRSHEFLRACASPNS 344 ++++ +TDGE+ D + L +A + +G +V A+GI + E L S + Sbjct: 909 GVPQVLIVITDGES----HDAEKLNATAKALRDKGILVLAVGIAGANTWELLAMAGSSDK 964 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 +Y VE + FS + + Sbjct: 965 YYFVETFGGLKGIFSDVSASV 985 >gi|953237|gb|AAA99719.1| collagen type XII alpha-1 precursor [Mus musculus] Length = 3067 Score = 40.4 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 60/257 (23%), Positives = 115/257 (44%), Gaps = 36/257 (14%) Query: 112 RSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR 171 ++T+L++ + + Y IS + + T + T + P+ V+ + D + Sbjct: 385 QTTTLNVRDLTADTEYQISVFAMKGLTSSEPTSVMEKTQPK--TQPMKVQVECSRGVDIK 442 Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS-NKI 230 D++ ++D S S+ + ++ K+ ++ + E + P N VQ LV +S + Sbjct: 443 ADIVFLVDGSYSIGT---ANFVKVRAFLEVLAKSFE---ISP---NRVQISLVQYSRDPH 493 Query: 231 EEFFLLEWG-VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF-DMQGMRQHCNTEDANYK 288 EF L E+ V + + I G STN+ + Y +IF +G R +N Sbjct: 494 TEFTLKEFNRVEDIIKAINTFPYRGGSTNTGKAMTYVREKIFVPNKGSR-------SNVP 546 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRACASPNS--- 344 K+++ +TDGE+ D + + ++A+G++ +RS L A ASP + Sbjct: 547 KVMILITDGESSDAFRDPAI-----KLRNSDVEIFAVGVKDAVRSE--LEAIASPPAETH 599 Query: 345 FYLVENPHSMYDAFSHI 361 + VE+ +DAF I Sbjct: 600 VFTVED----FDAFQRI 612 >gi|114682173|ref|XP_001153832.1| PREDICTED: matrilin 4 isoform 1 [Pan troglodytes] Length = 488 Score = 40.4 bits (93), Expect = 0.45, Method: Compositional matrix adjust. Identities = 37/149 (24%), Positives = 69/149 (46%), Gaps = 19/149 (12%) Query: 211 LIPDVN---NVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKY 265 L+P +N N + G++ +S++++ F L ++R I+ L T + ++Y Sbjct: 8 LLPGLNVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQY 67 Query: 266 AYNQIFDM-QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 A N F + +G R + ++ V +TDG + + +A+ RG +YA Sbjct: 68 AMNVAFSVAEGAR----PPEERVPRVAVIVTDG-----RPQDRVAEVAAQARARGIEIYA 118 Query: 325 IGIRVIRSHEFLRACASP---NSFYLVEN 350 +G++ LRA ASP +LVE+ Sbjct: 119 VGVQRADVGS-LRAMASPPLDEHVFLVES 146 Score = 36.6 bits (83), Expect = 6.2, Method: Compositional matrix adjust. Identities = 40/194 (20%), Positives = 88/194 (45%), Gaps = 27/194 (13%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 N + +D+++++D S+S+ ++ + +N +++ + + P+ V GL Sbjct: 243 CNRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEGTRV---GL 293 Query: 224 VTFSNKIE-EFFLLEWGVSHLQRK----IKYLSKFGVSTNSTPGLKYAYNQIF-DMQGMR 277 V FS+++ EF L +G + ++ ++Y+ + T + L++ F + QG R Sbjct: 294 VQFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMER---GTMTGLALRHMVEHSFSEAQGAR 350 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++ + TDG + +D S++ AK+ G +VYA+G+ E Sbjct: 351 ----PRALNVPRVGLVFTDGRS----QDDISVWAAR-AKEEGIVVYAVGVGKAVEAELRE 401 Query: 338 ACASPNSFYLVENP 351 + P ++ P Sbjct: 402 IASEPAELHVSYAP 415 >gi|156616286|ref|NP_001096077.1| collagen alpha-6(VI) chain isoform 1 [Mus musculus] gi|189082903|sp|Q8C6K9|CO6A6_MOUSE RecName: Full=Collagen alpha-6(VI) chain; Flags: Precursor Length = 2265 Score = 40.4 bits (93), Expect = 0.46, Method: Compositional matrix adjust. Identities = 47/200 (23%), Positives = 86/200 (43%), Gaps = 27/200 (13%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D + D+M ++D S S+ + +K+ M +K+ ++ + ++ D VQ G+V FS+ Sbjct: 617 DMKADIMFLVDSSGSIGP---ENFSKMKMFMKN---LVSKSQIGADR---VQIGVVQFSH 667 Query: 229 KIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + +E F L S + I ++ G +T + L + +G R N Sbjct: 668 ENKEEFQLNTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFSPDKGAR-------PN 720 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 +K ++ +TDGE D +K G I+Y++G+ + P + Sbjct: 721 VRKFLILITDGEAQDIVRDPAIAL-----RKEGVIIYSVGVFGSNVTQLEEISGKPEMVF 775 Query: 347 LVENPHSMYDAFSHIGKDIV 366 VEN +D HI D+V Sbjct: 776 YVEN----FDILQHIEDDLV 791 Score = 39.7 bits (91), Expect = 0.78, Method: Compositional matrix adjust. Identities = 47/197 (23%), Positives = 89/197 (45%), Gaps = 29/197 (14%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D S S++ + + +I K + M+ VK N V+ G + +++ E Sbjct: 807 LDVVFVIDSSGSID-YQEYNIMK--------DFMIGLVKKADVGKNQVRFGALKYADDPE 857 Query: 232 EFFLLEWGVSHLQRKIKYLS------KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F L+ L K++ +S G +T + L ++ + + +G R H Sbjct: 858 VLFYLD----ELGTKLEVVSVLQNDHPMGGNTYTAEALAFSDHMFTEARGSRLH-----K 908 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIRVIRSHEFLRACASPNS 344 ++++ +TDGE+ D + L +A + +G +V A+GI S E L S + Sbjct: 909 GVPQVLIVITDGES----HDAEKLNTTAKALRDKGILVLAVGIAGANSWELLAMAGSSDK 964 Query: 345 FYLVENPHSMYDAFSHI 361 +Y VE + FS + Sbjct: 965 YYFVETFGGLKGIFSDV 981 >gi|84688081|ref|ZP_01015939.1| hypothetical protein 1099457000215_RB2654_05415 [Maritimibacter alkaliphilus HTCC2654] gi|84663909|gb|EAQ10415.1| hypothetical protein RB2654_05415 [Rhodobacterales bacterium HTCC2654] Length = 595 Score = 40.4 bits (93), Expect = 0.48, Method: Compositional matrix adjust. Identities = 28/90 (31%), Positives = 43/90 (47%), Gaps = 5/90 (5%) Query: 285 ANYKKIIVFMTDGE---NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-HEFLRACA 340 AN+ F+ D N S K D C AK G +V+ IG V S H+ +R+CA Sbjct: 505 ANWYDQWNFLADAHDYFNYSEKNDNLD-EICTAAKNAGMVVFTIGFEVSGSQHDIMRSCA 563 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 S ++Y + + AF+ I ++I R+ Sbjct: 564 SAPAYYFDVDGLDISAAFAAIAREISKLRL 593 >gi|326931809|ref|XP_003212016.1| PREDICTED: matrilin-4-like, partial [Meleagris gallopavo] Length = 465 Score = 40.4 bits (93), Expect = 0.51, Method: Compositional matrix adjust. Identities = 51/223 (22%), Positives = 99/223 (44%), Gaps = 32/223 (14%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P +++K + LD++ V+D SRS+ F ++ + M++ + + Sbjct: 21 PKPAALKCRT---GPLDIVFVIDSSRSVRPFEFETMRRF---------MIDIIGNLDVGP 68 Query: 217 NVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 N + G++ +S++++ F L+ + + ++R I + T + ++YA N F +Q Sbjct: 69 NATRVGVIQYSSQVQNIFSLKTFFTRAEMERAINSIVPLAQGTMTGLAIQYAMNVAFTVQ 128 Query: 275 -GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 G R +I + +TDG +D+ S + A+ G +YA+GI+ + Sbjct: 129 EGAR----PPHKKIPRIAIIVTDGR----PQDRVSEVAAH-ARNAGIEIYAVGIQRADMN 179 Query: 334 EFLRACASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 LRA ASP +LVE+ ++ GK K D Sbjct: 180 S-LRAMASPPLEEHVFLVES----FELIQQFGKQFQDKLCAID 217 >gi|326918160|ref|XP_003205359.1| PREDICTED: hypothetical protein LOC100539194 [Meleagris gallopavo] Length = 1584 Score = 40.4 bits (93), Expect = 0.51, Method: Composition-based stats. Identities = 45/183 (24%), Positives = 82/183 (44%), Gaps = 31/183 (16%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDM--AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK- 229 D++ +LD S SS+ K D + ++ ++E ++ PD V G+V +S++ Sbjct: 38 DLVFILDAS--------SSVGKEDFEKVRQWVSNLVETFEIGPDKTRV---GVVRYSDRP 86 Query: 230 IEEFFLLEWG----VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ-GMRQHCNTED 284 EF L ++ + RKI+Y +G +TN+ L+Y F + G R D Sbjct: 87 TTEFDLGKYKTREEIKEAARKIRY---YGGNTNTGDALRYINTYSFSKEAGGR----LSD 139 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 KK+ + +TDG + L N A++ G ++A+G+ E + P S Sbjct: 140 RTVKKVAILLTDG-----RSQDYVLDPANAARQAGIRIFAVGVGEALKEELDEIASEPKS 194 Query: 345 FYL 347 ++ Sbjct: 195 AHV 197 >gi|212720733|ref|NP_001132911.1| collagen, type XXII, alpha 1 [Gallus gallus] Length = 1599 Score = 40.4 bits (93), Expect = 0.51, Method: Composition-based stats. Identities = 45/183 (24%), Positives = 82/183 (44%), Gaps = 31/183 (16%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDM--AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK- 229 D++ +LD S SS+ K D + ++ ++E ++ PD V G+V +S++ Sbjct: 38 DLVFILDAS--------SSVGKEDFEKVRQWVSNLVETFEIGPDKTRV---GVVRYSDRP 86 Query: 230 IEEFFLLEWG----VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ-GMRQHCNTED 284 EF L ++ + RKI+Y +G +TN+ L+Y F + G R D Sbjct: 87 TTEFDLGKYKTCEEIKEAARKIRY---YGGNTNTGDALRYINTYSFSKEAGGR----LSD 139 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 KK+ + +TDG + L N A++ G ++A+G+ E + P S Sbjct: 140 RTVKKVAILLTDG-----RSQDYVLDPANAARQAGIRIFAVGVGEALKEELDEIASEPKS 194 Query: 345 FYL 347 ++ Sbjct: 195 AHV 197 >gi|117618125|ref|YP_856000.1| hypothetical protein AHA_1462 [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|117559532|gb|ABK36480.1| conserved hypothetical protein [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 460 Score = 40.0 bits (92), Expect = 0.53, Method: Compositional matrix adjust. Identities = 30/120 (25%), Positives = 51/120 (42%), Gaps = 17/120 (14%) Query: 241 SHLQRKIKYLSKF-----GVSTNSTPGLKYAYNQIFDM-QGMRQHCNTE------DANYK 288 +HL + Y +TN+ G+ + + + QG Q E A+ + Sbjct: 316 AHLSTRAAYRQALDTLHAAFNTNTAEGVMWGWRLLSPQWQGRWQQGAAELPRPYGQADNR 375 Query: 289 KIIVFMTDGENLSTK---EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 KI+V +DGE++ + D++ L C E K++G VY + F+ CAS S Sbjct: 376 KILVLFSDGEHMGPEAALRDRKQLLLCREMKRKGIQVYTVAFEG--DARFVAQCASERSL 433 >gi|13476511|ref|NP_108081.1| hypothetical protein mlr7847 [Mesorhizobium loti MAFF303099] gi|14027272|dbj|BAB54226.1| mlr7847 [Mesorhizobium loti MAFF303099] Length = 548 Score = 40.0 bits (92), Expect = 0.54, Method: Compositional matrix adjust. Identities = 33/143 (23%), Positives = 52/143 (36%), Gaps = 35/143 (24%) Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL-------STKEDQ 306 G TN + GL + + + KI++ +TDGEN+ TK D Sbjct: 395 GSGTNVSEGLSWGMRVLSPAAPYTDGAPWKTPGISKIVLLLTDGENVVYGASEQPTKSDY 454 Query: 307 QSLYY-------------------------CNEAKKRGAIVYAIGIR--VIRSHEFLRAC 339 S Y C + K +G +Y + ++ + AC Sbjct: 455 TSYGYLAGGRFGSDDQTAAARNVDGWTKSVCTQLKNQGVQIYTMVLQSDTAANRALYSAC 514 Query: 340 AS-PNSFYLVENPHSMYDAFSHI 361 AS P+ +Y V +P + D F HI Sbjct: 515 ASDPSGYYAVNDPAKLPDVFQHI 537 >gi|326672754|ref|XP_002664126.2| PREDICTED: collagen alpha-1(XXI) chain-like [Danio rerio] Length = 572 Score = 40.0 bits (92), Expect = 0.58, Method: Compositional matrix adjust. Identities = 36/134 (26%), Positives = 64/134 (47%), Gaps = 18/134 (13%) Query: 219 VQSGLVTFSNKIEEFFLLEWG----VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 Q G+V +S+ + F + G S L + ++ + G +TN+ +K+A +++F + Sbjct: 78 TQVGVVQYSD--DPFLHIPLGKHFSSSDLIKAMESIEYMGGNTNTGRAIKFANDKLFALS 135 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + KI V +TDG K + L A+K+G I++AIG+ Sbjct: 136 ERGPN------GIAKIAVVLTDG-----KSQDEVLAAAEAARKKGIILFAIGVGSETEEA 184 Query: 335 FLRACAS-PNSFYL 347 LRA A+ P+S Y+ Sbjct: 185 QLRAIANKPSSTYV 198 >gi|222528098|ref|YP_002571980.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] gi|222454945|gb|ACM59207.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] Length = 902 Score = 40.0 bits (92), Expect = 0.58, Method: Compositional matrix adjust. Identities = 56/219 (25%), Positives = 97/219 (44%), Gaps = 34/219 (15%) Query: 148 YTNS-RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 Y+NS ++P+ +K N + + +D+++VLD S SM D+ I K+++A + M+ Sbjct: 382 YSNSVLEKMLPVKMEIK-NKEKEKNIDVVLVLDHSGSMADTEDAGIPKLEIAKSASAKMI 440 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG-VSHLQRKIKYLS--KFGVSTNSTPGL 263 E ++ V G++ F + ++ ++G +S + I+ +S + G T P L Sbjct: 441 EHLESSDGV------GVIAFDHNY--YWAYKFGKISKKEDVIESISSIEVGGGTAIIPPL 492 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY--CNEAKKRGAI 321 A + + + IV +TDG +Q Y NEAK+ Sbjct: 493 SEAVKTLKKSKAKSKL-----------IVLLTDGMG------EQGGYEIPANEAKRNNIK 535 Query: 322 VYAIGIRVIRSHEFLRACASPNS--FYLVENPHSMYDAF 358 + IG+ + L AS S FYLV NP + D F Sbjct: 536 ITTIGVGKYVNATVLSWIASFTSGRFYLVSNPSELVDVF 574 >gi|306824220|ref|ZP_07457590.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|309801684|ref|ZP_07695804.1| von Willebrand factor type A domain protein [Bifidobacterium dentium JCVIHMP022] gi|304552423|gb|EFM40340.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|308221626|gb|EFO77918.1| von Willebrand factor type A domain protein [Bifidobacterium dentium JCVIHMP022] Length = 967 Score = 40.0 bits (92), Expect = 0.59, Method: Compositional matrix adjust. Identities = 51/179 (28%), Positives = 80/179 (44%), Gaps = 23/179 (12%) Query: 172 LDMMIVLDVSRSMESFFDS--SITKIDMAIKSINAMLEEVK----LIPDVNNVVQSGLVT 225 +D+ +VLDVS SM F S +KI ++N+ L+E I D NN V+ LV Sbjct: 244 IDIALVLDVSGSMNDDFGGRGSPSKISALKTAVNSFLDETAKTNDTIEDDNNKVKVALVK 303 Query: 226 FSNKIEEFFLLEW---------GVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFD--M 273 ++N+I + + + ++ L+ G+ S GL+ A D M Sbjct: 304 YANQIGTATGADGCRISNSRQSDTGNCTQIVQELTTDAGLLKTSVNGLQAAGATYADAAM 363 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGE--NLSTKEDQ---QSLYYCNEAKKRGAIVYAIGI 327 + +Q A KK ++F TDGE + S +D ++ E K G VY+IGI Sbjct: 364 EVAQQALAGGRAGAKKYVIFFTDGEPNHWSGFDDDVANAAIKKSQELKNAGTTVYSIGI 422 >gi|293349448|ref|XP_002727144.1| PREDICTED: collagen type VI alpha 6-like [Rattus norvegicus] Length = 2264 Score = 40.0 bits (92), Expect = 0.62, Method: Compositional matrix adjust. Identities = 49/203 (24%), Positives = 84/203 (41%), Gaps = 33/203 (16%) Query: 169 DARLDMMIVLDVSRS--MESFFDSSITKIDMAIKS-INAMLEEVKLIPDVNNVVQSGLVT 225 D + D+M ++D S S +E+F + ++ KS I A + VQ G+V Sbjct: 617 DMKADIMFLVDSSGSIGLENFSKMKVFMKNLVSKSQIGA------------DRVQIGVVQ 664 Query: 226 FSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 FS++ E F L +S + I ++ G +T + L + +G R Sbjct: 665 FSHENREEFQLNTFMSQNDIANAIDQMAHIGETTLTGSALTFVSQYFSPEKGAR------ 718 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 N +K ++ +TDGE +D +K G I+Y++G+ + P Sbjct: 719 -PNVRKFLILITDGEAQDIVKDPAVAL-----RKDGVIIYSVGVFGSNVTQLEEISGKPE 772 Query: 344 SFYLVENPHSMYDAFSHIGKDIV 366 + VEN +D HI D+V Sbjct: 773 MVFYVEN----FDILQHIEDDLV 791 Score = 39.3 bits (90), Expect = 1.2, Method: Compositional matrix adjust. Identities = 46/197 (23%), Positives = 89/197 (45%), Gaps = 29/197 (14%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D S S++ + + +I K + M+ VK N V+ G + +++ E Sbjct: 807 LDVVFVIDSSGSID-YQEYNIMK--------DFMIGLVKKADVGKNQVRFGALKYADDPE 857 Query: 232 EFFLLEWGVSHLQRKIKYLS------KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F L+ L K++ +S G +T + L ++ + + +G R H Sbjct: 858 VLFYLD----ELGTKLEVISVLQNDQPMGGNTYTAEALAFSDHMFTEARGSRLH-----K 908 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIRVIRSHEFLRACASPNS 344 ++++ +TDGE+ D + L +A + +G +V A+GI + E L S + Sbjct: 909 GVPQVLIVITDGES----HDAEKLNATAKALRDKGILVLAVGIAGANTWELLAMAGSSDK 964 Query: 345 FYLVENPHSMYDAFSHI 361 +Y VE + FS + Sbjct: 965 YYFVETFGGLKGIFSDV 981 >gi|118100589|ref|XP_425698.2| PREDICTED: similar to matrilin-4 [Gallus gallus] Length = 564 Score = 40.0 bits (92), Expect = 0.63, Method: Compositional matrix adjust. Identities = 49/208 (23%), Positives = 93/208 (44%), Gaps = 29/208 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SRS+ F ++ + M++ + + N + G++ +S++++ Sbjct: 33 LDIVFVIDSSRSVRPFEFETMRRF---------MIDIIGNLDVGPNATRVGVIQYSSQVQ 83 Query: 232 EFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ-GMRQHCNTEDANYK 288 F L+ + + ++R I + T + ++YA N F +Q G R Sbjct: 84 NIFSLKTFFTRAEMERAINSIVPLAQGTMTGLAIQYAMNVAFTVQEGAR----PPHKKIP 139 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 +I + +TDG +D+ S + A+ G +YA+GI+ + LRA ASP Sbjct: 140 RIAIIVTDGR----PQDRVSEVAAH-ARNAGIEIYAVGIQRADMNS-LRAMASPPLEEHV 193 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWYD 373 +LVE+ ++ GK K D Sbjct: 194 FLVES----FELIQQFGKQFQDKLCAID 217 >gi|293361343|ref|XP_236593.5| PREDICTED: collagen type VI alpha 6 [Rattus norvegicus] Length = 2264 Score = 40.0 bits (92), Expect = 0.64, Method: Compositional matrix adjust. Identities = 49/203 (24%), Positives = 84/203 (41%), Gaps = 33/203 (16%) Query: 169 DARLDMMIVLDVSRS--MESFFDSSITKIDMAIKS-INAMLEEVKLIPDVNNVVQSGLVT 225 D + D+M ++D S S +E+F + ++ KS I A + VQ G+V Sbjct: 617 DMKADIMFLVDSSGSIGLENFSKMKVFMKNLVSKSQIGA------------DRVQIGVVQ 664 Query: 226 FSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 FS++ E F L +S + I ++ G +T + L + +G R Sbjct: 665 FSHENREEFQLNTFMSQNDIANAIDQMAHIGETTLTGSALTFVSQYFSPEKGAR------ 718 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 N +K ++ +TDGE +D +K G I+Y++G+ + P Sbjct: 719 -PNVRKFLILITDGEAQDIVKDPAVAL-----RKDGVIIYSVGVFGSNVTQLEEISGKPE 772 Query: 344 SFYLVENPHSMYDAFSHIGKDIV 366 + VEN +D HI D+V Sbjct: 773 MVFYVEN----FDILQHIEDDLV 791 Score = 39.3 bits (90), Expect = 1.1, Method: Compositional matrix adjust. Identities = 46/197 (23%), Positives = 89/197 (45%), Gaps = 29/197 (14%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D S S++ + + +I K + M+ VK N V+ G + +++ E Sbjct: 807 LDVVFVIDSSGSID-YQEYNIMK--------DFMIGLVKKADVGKNQVRFGALKYADDPE 857 Query: 232 EFFLLEWGVSHLQRKIKYLS------KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F L+ L K++ +S G +T + L ++ + + +G R H Sbjct: 858 VLFYLD----ELGTKLEVISVLQNDQPMGGNTYTAEALAFSDHMFTEARGSRLH-----K 908 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIRVIRSHEFLRACASPNS 344 ++++ +TDGE+ D + L +A + +G +V A+GI + E L S + Sbjct: 909 GVPQVLIVITDGES----HDAEKLNATAKALRDKGILVLAVGIAGANTWELLAMAGSSDK 964 Query: 345 FYLVENPHSMYDAFSHI 361 +Y VE + FS + Sbjct: 965 YYFVETFGGLKGIFSDV 981 >gi|293361345|ref|XP_236596.5| PREDICTED: collagen type VI alpha 4 [Rattus norvegicus] Length = 2327 Score = 39.7 bits (91), Expect = 0.68, Method: Composition-based stats. Identities = 38/132 (28%), Positives = 63/132 (47%), Gaps = 13/132 (9%) Query: 219 VQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAY-NQIFDMQG 275 VQ GLV +S+ I F L+ S + +I+ L STN+ L++ N + +M G Sbjct: 272 VQVGLVQYSDNIYPAFQLKQSSLKSVVLEQIRNLPYNMGSTNTGSALEFIRANYLTEMSG 331 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R +D +I++ +TDGE S E Q + ++ K+ G VY +GI + E Sbjct: 332 SR----AKDG-VPQIVILVTDGE--SNDEVQDA---ADQLKRDGVFVYVVGINIQDVQEL 381 Query: 336 LRACASPNSFYL 347 + + P +L Sbjct: 382 QKIASEPFEEFL 393 >gi|293349452|ref|XP_002727145.1| PREDICTED: similar to procollagen, type VI, alpha 3 isoform 4 [Rattus norvegicus] Length = 2114 Score = 39.7 bits (91), Expect = 0.68, Method: Composition-based stats. Identities = 38/132 (28%), Positives = 63/132 (47%), Gaps = 13/132 (9%) Query: 219 VQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAY-NQIFDMQG 275 VQ GLV +S+ I F L+ S + +I+ L STN+ L++ N + +M G Sbjct: 272 VQVGLVQYSDNIYPAFQLKQSSLKSVVLEQIRNLPYNMGSTNTGSALEFIRANYLTEMSG 331 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R +D +I++ +TDGE S E Q + ++ K+ G VY +GI + E Sbjct: 332 SR----AKDG-VPQIVILVTDGE--SNDEVQDA---ADQLKRDGVFVYVVGINIQDVQEL 381 Query: 336 LRACASPNSFYL 347 + + P +L Sbjct: 382 QKIASEPFEEFL 393 >gi|297581617|ref|ZP_06943539.1| flp pilus assembly protein TadG [Vibrio cholerae RC385] gi|297534024|gb|EFH72863.1| flp pilus assembly protein TadG [Vibrio cholerae RC385] Length = 467 Score = 39.7 bits (91), Expect = 0.69, Method: Compositional matrix adjust. Identities = 20/70 (28%), Positives = 38/70 (54%), Gaps = 4/70 (5%) Query: 283 EDANYKKIIVFMTDGENLS--TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + N +K+++ TDG +L K D++ + C E KK+G + I I + +++CA Sbjct: 379 QHPNNRKVMLLFTDGNHLVDVAKRDRKQVALCREMKKQG--IEIISIDFNNRSQVMKSCA 436 Query: 341 SPNSFYLVEN 350 S +Y+ +N Sbjct: 437 SAGQYYIADN 446 >gi|311105413|ref|YP_003978266.1| hemolysin-type calcium-binding repeat family protein 3 [Achromobacter xylosoxidans A8] gi|310760102|gb|ADP15551.1| hemolysin-type calcium-binding repeat family protein 3 [Achromobacter xylosoxidans A8] Length = 2061 Score = 39.7 bits (91), Expect = 0.72, Method: Composition-based stats. Identities = 45/153 (29%), Positives = 69/153 (45%), Gaps = 37/153 (24%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSI---TKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 N A ++ +VLD+S SM+ ++ S T+++ A K++ ++LE D + Sbjct: 1436 NVTPGASYNIALVLDLSGSMDYYWGSGSNQETRLETAKKALKSLLENQLATHD--GTINV 1493 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS--TNSTPGLKY--------AYNQI- 270 L+TF++ S LQ+ I L+ V N GLK A+N+ Sbjct: 1494 SLITFADS----------SSKLQKAISGLTPDNVDDMVNILLGLKAGGGTPYGAAFNETK 1543 Query: 271 --FDMQGMRQHCNTEDAN---YKKIIVFMTDGE 298 FD Q TEDAN YK + F+TDGE Sbjct: 1544 SWFDGQ------PTEDANGNAYKNLTFFLTDGE 1570 >gi|224077994|ref|XP_002192008.1| PREDICTED: similar to matrilin 4 [Taeniopygia guttata] Length = 580 Score = 39.7 bits (91), Expect = 0.80, Method: Compositional matrix adjust. Identities = 45/185 (24%), Positives = 85/185 (45%), Gaps = 25/185 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SRS+ F ++ + M I + + + P+ V G++ +S++++ Sbjct: 33 LDIVFVIDSSRSVRPFEFETMRRFMMDI------IGNLDVGPNATRV---GVIQYSSQVQ 83 Query: 232 EFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ-GMRQHCNTEDANYK 288 F L+ + + ++R I + T + ++YA N F Q G R Sbjct: 84 NIFSLKTFFTRADMERAINSIIPLAQGTMTGLAIQYAMNVAFTTQEGAR----PLHKRIP 139 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 +I + +TDG + + +A+ G +YA+GI+ + LRA ASP Sbjct: 140 RIAIVVTDG-----RPQDRVTEVATQARNAGIEIYAVGIQRADMNS-LRAMASPPLEEHV 193 Query: 346 YLVEN 350 +LVE+ Sbjct: 194 FLVES 198 >gi|120616160|gb|ABG80452.1| collagen [Hydra vulgaris] Length = 2439 Score = 39.3 bits (90), Expect = 0.95, Method: Composition-based stats. Identities = 45/202 (22%), Positives = 91/202 (45%), Gaps = 22/202 (10%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D D+ LD S SF++ + IKS+ +++++L ++ V+ G++T+S+ Sbjct: 1836 DIPTDIAFALDAS---ASFYEEGFQQEKDFIKSV---IDKIEL---SSSGVRVGVLTYSD 1886 Query: 229 --KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 KI F + +++ I + + T GL+ A + G R + Sbjct: 1887 EAKIRIRFDYSFDKEDVKKAIDNIPYDSMGTRIDLGLEAAKELFLEKSGGR-------GS 1939 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-RVIRSHEFLRACASPNSF 345 KK+++ +TDG+ + + + Y E + G ++AIGI I + + P Sbjct: 1940 SKKVLILLTDGQQTYIPDAKDPVDYAKELAEYGVDIFAIGIGDEINKVDLEDLISKPQHI 1999 Query: 346 YLVENPHSMYDAFSHIGKDIVT 367 +L ++ +S+ + + KDI T Sbjct: 2000 FLSDDINSL---ITDLSKDIST 2018 >gi|89091868|ref|ZP_01164823.1| sensory box sensor/GGDEF/EAL domain protein [Oceanospirillum sp. MED92] gi|89083603|gb|EAR62820.1| sensory box sensor/GGDEF/EAL domain protein [Oceanospirillum sp. MED92] Length = 735 Score = 39.3 bits (90), Expect = 1.00, Method: Compositional matrix adjust. Identities = 25/78 (32%), Positives = 38/78 (48%), Gaps = 15/78 (19%) Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN-----E 314 T L A+NQ+F++QG+R C+T + +F D +N Q L+ C E Sbjct: 403 TQSLVGAFNQVFELQGLRLRCSTSIG----LSLFPHDADN------SQELFKCADQALYE 452 Query: 315 AKKRGAIVYAIGIRVIRS 332 AKK G YA+ +R+ Sbjct: 453 AKKNGGACYAVFTPRLRT 470 >gi|315122852|ref|YP_004063341.1| hypothetical protein CKC_05540 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496254|gb|ADR52853.1| hypothetical protein CKC_05540 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 494 Score = 39.3 bits (90), Expect = 1.0, Method: Compositional matrix adjust. Identities = 30/135 (22%), Positives = 64/135 (47%), Gaps = 17/135 (12%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI--MNE 68 F++ KG +++AI +P + L+LG+++ S+ K + S + +L H + I N+ Sbjct: 30 LFFSKKGNFAMISAIMIPSLALLLGIVLVTSNYLLHKYSVESASEEALSHGMSLICYQND 89 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQN---- 124 +N K+ D++ +K ++N + D + +++ +DI + + Sbjct: 90 IERDNLAKIILNDLIVSLK-----------KNNFTKQEADLVAKNSKIDITTLINDSTNV 138 Query: 125 EGYSISAISRYKIPL 139 + Y S YK+PL Sbjct: 139 KSYHFYIKSVYKMPL 153 >gi|47230696|emb|CAF99889.1| unnamed protein product [Tetraodon nigroviridis] Length = 1031 Score = 39.3 bits (90), Expect = 1.1, Method: Compositional matrix adjust. Identities = 50/215 (23%), Positives = 98/215 (45%), Gaps = 38/215 (17%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV----QSGLVTFS 227 +D++IVLD S S+ +++ I++ L++ + P + Q G++ + Sbjct: 110 MDIVIVLDGSNSIYPWYEVQAFLINI--------LQKFYIGPGQIQFLWFTGQVGVLQYG 161 Query: 228 NKI-EEFFLLEW-GVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 K+ EF L ++ V + ++ + ++ + G TN+ G+ A +Q F G R Sbjct: 162 EKVVHEFKLSDYKSVEEVVKRARSINQRGGEETNTALGINVACSQAFKHGGRR------- 214 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFLRA 338 KK+++ +TDGE+ + + QQ + C +K G YAI + R I FL Sbjct: 215 -GAKKVMIVITDGESHDSADLQQVIKDC---EKDGITRYAIAVLGYYNRRGINPEAFLNE 270 Query: 339 ----CASPNS--FYLVENPHSMYDAFSHIGKDIVT 367 + P+ F+ V + ++ D +G+ I + Sbjct: 271 IKYIASDPDDKHFFNVTDEAALKDIVDALGERIFS 305 >gi|39936736|ref|NP_949012.1| hypothetical protein RPA3674 [Rhodopseudomonas palustris CGA009] gi|192292562|ref|YP_001993167.1| TadE family protein [Rhodopseudomonas palustris TIE-1] gi|39650592|emb|CAE29115.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009] gi|192286311|gb|ACF02692.1| TadE family protein [Rhodopseudomonas palustris TIE-1] Length = 177 Score = 39.3 bits (90), Expect = 1.1, Method: Compositional matrix adjust. Identities = 21/49 (42%), Positives = 26/49 (53%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRS 57 R F N KG + AI PI F +L IIEV+ IFF VL + + S Sbjct: 13 RKFRRNRKGSAAVEFAIVAPIFFALLFAIIEVAMIFFASQVLETAVQDS 61 >gi|198434986|ref|XP_002126110.1| PREDICTED: similar to RIKEN cDNA E330026B02 [Ciona intestinalis] Length = 1715 Score = 38.9 bits (89), Expect = 1.2, Method: Compositional matrix adjust. Identities = 24/64 (37%), Positives = 37/64 (57%), Gaps = 3/64 (4%) Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS-PNS 344 N +II+F+TDG + E+ Q++ NE + GA+VY IG+ + S LR AS P S Sbjct: 348 NVSQIIIFVTDGGVVDDSEESQTIL--NELRFSGALVYTIGVGRMVSRPQLRMIASRPAS 405 Query: 345 FYLV 348 ++ Sbjct: 406 HHVT 409 >gi|308511201|ref|XP_003117783.1| hypothetical protein CRE_00574 [Caenorhabditis remanei] gi|308238429|gb|EFO82381.1| hypothetical protein CRE_00574 [Caenorhabditis remanei] Length = 566 Score = 38.9 bits (89), Expect = 1.2, Method: Compositional matrix adjust. Identities = 46/187 (24%), Positives = 75/187 (40%), Gaps = 33/187 (17%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D R D+ I+ D S S+E F +K ++E++ + P+ V ++ F+ Sbjct: 386 DCRYDIGIIFDSSGSLEKNFQKQ-------LKFATTLVEQMPISPNATRVA---IIQFAG 435 Query: 229 KIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 K + L ++ ++K + S F T T G +F Q DA Sbjct: 436 KTKLRVLADFAQKKSAAELKTIIGRSHFFSGTTFTNGALKTMADLF------QKSKRADA 489 Query: 286 NYKKIIVFMTDGENLS-TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-----HEFLRAC 339 K +V TDG + T E ++L K +G +VY +GI +S E Sbjct: 490 KLK--VVLFTDGYSAEDTSEGAEAL------KSQGVVVYTVGISTEKSTGLNMKELHGMA 541 Query: 340 ASPNSFY 346 SPN F+ Sbjct: 542 TSPNHFF 548 >gi|73998866|ref|XP_535021.2| PREDICTED: similar to A-domain containing protein similar to matrilin and collagen [Canis familiaris] Length = 788 Score = 38.9 bits (89), Expect = 1.2, Method: Compositional matrix adjust. Identities = 45/201 (22%), Positives = 88/201 (43%), Gaps = 27/201 (13%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEV 209 SR + I+++ K+ Q A +D++ ++D S S+ + F+ S M +++ E V Sbjct: 48 SRETIGKISAASKMM-QCSAAVDILFLIDGSHSVGKGSFERSKHFAIMVCDALDINPERV 106 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAY 267 ++ G + FS+ F L+ S ++ KIK + G T + LKY Sbjct: 107 RV----------GAIQFSSAPHLEFPLDSFSSQQEVKAKIKRMVFKGGRTETGLALKYLL 156 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + F +A+ +I++ +TDG + + K+RG ++A+G+ Sbjct: 157 RRGFP--------GGRNASVPQILIVITDG-----RSQGHVAVPTKQLKERGVTIFAVGV 203 Query: 328 RVIRSHEFLRACASPNSFYLV 348 R R E R + P +++ Sbjct: 204 RFPRWEELHRLASEPREQHVL 224 >gi|291388325|ref|XP_002710627.1| PREDICTED: matrilin 3-like [Oryctolagus cuniculus] Length = 1109 Score = 38.9 bits (89), Expect = 1.3, Method: Compositional matrix adjust. Identities = 40/184 (21%), Positives = 85/184 (46%), Gaps = 23/184 (12%) Query: 168 TDARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 T+ LD++ V+D S+S+ E F+ + + + A+++ + + P V GL+ + Sbjct: 803 TEGPLDLVFVIDGSKSLGEDNFE-------IVKQFLTAIIDSLAVSPKAARV---GLLQY 852 Query: 227 SNKIE-EFFLLEWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIF-DMQGMRQHCNTE 283 S ++ EF L + + +++ + ++ G + + LK+ + + F ++G R Sbjct: 853 STQVRAEFTLRSFSTAKDMKKAVAHMKYMGKGSMTGQALKHMFERSFTQLEGAR----PL 908 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 A ++ + TDG +D S + N+AK G +YA+G+ E + P Sbjct: 909 SARVPRVAIVFTDGR----AQDDVSEWA-NKAKANGITMYAVGVGKAIEEELQEIASEPT 963 Query: 344 SFYL 347 +L Sbjct: 964 DKHL 967 >gi|114798549|ref|YP_759188.1| hypothetical protein HNE_0458 [Hyphomonas neptunium ATCC 15444] gi|114738723|gb|ABI76848.1| conserved domain protein [Hyphomonas neptunium ATCC 15444] Length = 460 Score = 38.9 bits (89), Expect = 1.5, Method: Compositional matrix adjust. Identities = 22/96 (22%), Positives = 46/96 (47%), Gaps = 8/96 (8%) Query: 283 EDANYKKIIVFMTDGENLSTKEDQ-------QSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 ++ + K+++ M+DG+ T+ + Q+ C++ K++ ++Y +G + Sbjct: 363 DEPDATKVVIMMSDGQYNETRHNNAYPSSVTQAEAICDKMKEKEVVIYTVGFDAGYGQDV 422 Query: 336 LRACASPNSF-YLVENPHSMYDAFSHIGKDIVTKRI 370 L CAS +F Y N + +A+ I + I RI Sbjct: 423 LNYCASNPAFAYKPTNGQELTEAYKSIARSISDLRI 458 >gi|75675889|ref|YP_318310.1| hypothetical protein Nwi_1697 [Nitrobacter winogradskyi Nb-255] gi|74420759|gb|ABA04958.1| hypothetical protein Nwi_1697 [Nitrobacter winogradskyi Nb-255] Length = 605 Score = 38.5 bits (88), Expect = 1.6, Method: Compositional matrix adjust. Identities = 35/145 (24%), Positives = 62/145 (42%), Gaps = 16/145 (11%) Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN-YKKIIVFMTDGEN 299 S L+ +I +S G STN GL + + + G + A Y+ +V ++DG N Sbjct: 460 STLKNQIDSMSPSG-STNQAIGLAWGWQTLSTTNGPFPAPAKDKAYVYQDYLVLLSDGLN 518 Query: 300 L-----------STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS---HEFLRACASPNSF 345 S + D + C + K G +++ + + V + L+ CAS +F Sbjct: 519 TRNRWSGNGSDHSPEVDVRQALLCQKVKDSGTVIFTVQVNVGNRDPLSQVLQDCASNGNF 578 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRI 370 ++ + + DAF +I I RI Sbjct: 579 QMITSANQTADAFQNILTQISQLRI 603 >gi|118086119|ref|XP_426008.2| PREDICTED: similar to alpha 3 type VI collagen [Gallus gallus] Length = 2533 Score = 38.5 bits (88), Expect = 1.6, Method: Compositional matrix adjust. Identities = 41/181 (22%), Positives = 84/181 (46%), Gaps = 23/181 (12%) Query: 173 DMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D++ ++D S S+ E+ F + T + + S + ++V+L G+V +S + + Sbjct: 1211 DLIFLIDGSESISENSFSTMKTFMKDVVDSFDISRDKVRL----------GVVQYSQEPQ 1260 Query: 232 -EFFLLE-WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 EF+L E + + ++ +I + + ST + GL++ + G R++ + Sbjct: 1261 REFYLNEFYSDTIIKEQINRIEQLRSSTFTGKGLRFVQSLFESANGGRKN-----QGVSQ 1315 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +V +TDG + + +D N G V+A+G+ ++ S E LR + VE Sbjct: 1316 NLVVITDGYSADSVDDAAMALRSN-----GIHVFAVGVGIVNSFELLRIAGDARRVFTVE 1370 Query: 350 N 350 N Sbjct: 1371 N 1371 Score = 36.2 bits (82), Expect = 8.7, Method: Compositional matrix adjust. Identities = 44/194 (22%), Positives = 84/194 (43%), Gaps = 42/194 (21%) Query: 144 FIPWYTNSRHIVMPITSSV-KVNSQTDARLDMMIVLDVSRSMES--------FFDSSITK 194 F+ + + +HI I V N + R D++ ++D S S+ + F S + K Sbjct: 803 FVNDFDSLKHIKHEIVQEVCSTNVCKNVRADIVFLVDSSNSIRAAEFQKIKDFMQSFVIK 862 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSK 252 +D+ + + V+ GL+ FS++I E F L+ ++ +QR I+ + + Sbjct: 863 VDVGLDN-----------------VRIGLIQFSSEIREEFQLDRYSTIADVQRAIQEMQQ 905 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 + T + L +A + +G R K+ ++ +TDGE Q S+ Sbjct: 906 IKLGTLTGKALTFAASYFDRPKGGR-------PELKQYLIVITDGEA------QDSVKSP 952 Query: 313 NEA-KKRGAIVYAI 325 A + +G +YAI Sbjct: 953 ARAIRDKGITIYAI 966 >gi|327269537|ref|XP_003219550.1| PREDICTED: matrilin-2-like [Anolis carolinensis] Length = 809 Score = 38.5 bits (88), Expect = 1.7, Method: Compositional matrix adjust. Identities = 38/163 (23%), Positives = 79/163 (48%), Gaps = 21/163 (12%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +D RLD++ ++D SRS+ + K+ I +I L+ + + PDV V GL+ + Sbjct: 14 SDKRLDLVFIIDSSRSVRPY---DFEKVKEFILNI---LQFLDIRPDVTRV---GLIQYG 64 Query: 228 NKIE-EFFLLEWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIF-DMQGMRQHCNTED 284 + ++ EF L + ++R ++ + T + +++A N F + +G R Sbjct: 65 STVKNEFSLKTFARKQDMERAVRRMMYLSTGTMTGLAIQFAVNIAFSETEGAR----PLS 120 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 N ++I+ +TDG + + +A+ G +++AIG+ Sbjct: 121 QNVPRVIMIVTDGRPQDPVAEIAA-----KARNSGILIFAIGV 158 Score = 37.4 bits (85), Expect = 4.3, Method: Compositional matrix adjust. Identities = 39/183 (21%), Positives = 83/183 (45%), Gaps = 21/183 (11%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 T+ +D++ V+D S+S+ + ++ + + +L+ + + P + GL+ +S Sbjct: 572 TEGPVDLVFVIDGSKSL------GVNNFEIVKEFVLGILDSLTISPKAARI---GLLQYS 622 Query: 228 NKIE-EFFLLEWG-VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF-DMQGMRQHCNTED 284 ++ EF L ++ + +++ + + G + + LK + F + +G R Sbjct: 623 TQVRTEFTLKQFSTATDMKKAVSQMKYMGKGSMTGLALKQMTERSFTEAEGARHL----S 678 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 A ++ V TDG +D+ S + +AK+RG +YAIGI E + P Sbjct: 679 AKVPRVCVVFTDGR----AQDEVSEWA-AKAKQRGITMYAIGIGKAIEEELREIASDPPE 733 Query: 345 FYL 347 +L Sbjct: 734 KHL 736 >gi|118096903|ref|XP_425157.2| PREDICTED: similar to type VII collagen [Gallus gallus] Length = 1645 Score = 38.5 bits (88), Expect = 1.9, Method: Composition-based stats. Identities = 33/119 (27%), Positives = 55/119 (46%), Gaps = 19/119 (15%) Query: 236 LEWGVSH------LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 +E+G S +Q+ I+ LS G +T + GL+Y + F +R K Sbjct: 8 VEFGFSQHINGTGIQKAIQQLSYKGGNTRTGAGLRYISDNFFGPTQLR-------PGVPK 60 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP-NSFYL 347 I + +TDG++ E Q +L K G V+A+GI+ E +R + P +SF+ Sbjct: 61 ICILITDGKSQDDAE-QPAL----RLKSLGIKVFAVGIKNADRQELIRVASLPTDSFFF 114 >gi|221109528|ref|XP_002169888.1| PREDICTED: similar to collagen type VI alpha 6 [Hydra magnipapillata] Length = 366 Score = 38.5 bits (88), Expect = 1.9, Method: Compositional matrix adjust. Identities = 40/167 (23%), Positives = 77/167 (46%), Gaps = 24/167 (14%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-NVVQSGLV 224 S +DA +D+ +LD S S+ + K++ L+ + D+ N Q+G++ Sbjct: 17 SCSDAIVDVGFILDSSGSLRRDY-----------KNLKEFLKTIASFFDIKINGSQAGVI 65 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ-GMRQHCN 281 TFS++ E L ++ + + G +T L+++ N +F Q G R Sbjct: 66 TFSHRSEHSIKLNDFSDGDSFEKAVDKIPLMGSTTRIDKALRHSKNVMFTNQNGGRLEAT 125 Query: 282 TEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 K+++ +TDG + S K++ S+ +E + G ++ AIGI Sbjct: 126 -------KLLILLTDGSQTFSAKQEDPSI-IADEIRNDGVLIIAIGI 164 >gi|255526265|ref|ZP_05393182.1| von Willebrand factor type A [Clostridium carboxidivorans P7] gi|296186259|ref|ZP_06854663.1| von Willebrand factor type A domain protein [Clostridium carboxidivorans P7] gi|255510045|gb|EET86368.1| von Willebrand factor type A [Clostridium carboxidivorans P7] gi|296049060|gb|EFG88490.1| von Willebrand factor type A domain protein [Clostridium carboxidivorans P7] Length = 580 Score = 38.1 bits (87), Expect = 2.0, Method: Compositional matrix adjust. Identities = 44/172 (25%), Positives = 77/172 (44%), Gaps = 31/172 (18%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS--GL 223 S T + LD++ VLD S SM+ I E +K+ D++ V + GL Sbjct: 30 SNTSSNLDVVFVLDSSGSMKESDPEEI------------RTEAIKMFLDMSQVQGNKFGL 77 Query: 224 VTFS-NKIEEFFLLEWGVSHLQRKIKYLS---KFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 V +S N + E L + + +IK ++ G T++ G+ A N + Sbjct: 78 VAYSDNVVREHNLDTINSNDDKERIKNMALNIPLGQKTDTGAGILEAVNLM--------- 128 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA----KKRGAIVYAIGI 327 + D N+K +I+ ++DG+N ++ + SL + K +G VY IG+ Sbjct: 129 NSGHDKNHKPVIILLSDGKNDPQRKTEDSLKDLKSSISTCKDKGYPVYTIGL 180 >gi|319952787|ref|YP_004164054.1| hypothetical protein Celal_1241 [Cellulophaga algicola DSM 14237] gi|319421447|gb|ADV48556.1| protein of unknown function DUF58 [Cellulophaga algicola DSM 14237] Length = 288 Score = 38.1 bits (87), Expect = 2.0, Method: Compositional matrix adjust. Identities = 35/102 (34%), Positives = 51/102 (50%), Gaps = 9/102 (8%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L MM+V+DVS S F SS + I I+A L + N + GL+ F Sbjct: 72 EEERELTMMLVVDVSGS--ELFGSSNQFKNEIITEISATL----AFSALQNNDKIGLILF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP---GLKY 265 S++IE F + G SH+ R I+ L +F +N T LKY Sbjct: 126 SDQIELFIPPKKGKSHVLRIIRELIEFTPKSNKTDLAVALKY 167 >gi|325286049|ref|YP_004261839.1| hypothetical protein Celly_1140 [Cellulophaga lytica DSM 7489] gi|324321503|gb|ADY28968.1| protein of unknown function DUF58 [Cellulophaga lytica DSM 7489] Length = 288 Score = 38.1 bits (87), Expect = 2.2, Method: Compositional matrix adjust. Identities = 35/105 (33%), Positives = 55/105 (52%), Gaps = 9/105 (8%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L MM+++DVS S E+F ++ K D+ I I+A L + N + GL+ F Sbjct: 72 EEERELTMMLMVDVSGS-ENFGTTNQFKKDI-ITEISATL----AFSALQNNDKVGLILF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYN 268 S+++E F + G SH R I+ L +F TN + LKY N Sbjct: 126 SDEVELFIPPKKGKSHALRIIRELLEFTPKSTKTNLSEALKYLTN 170 >gi|297286916|ref|XP_002808380.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-5(VI) chain-like [Macaca mulatta] Length = 2604 Score = 38.1 bits (87), Expect = 2.2, Method: Compositional matrix adjust. Identities = 45/199 (22%), Positives = 87/199 (43%), Gaps = 19/199 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ VLD S S++ + + IN + VK N VQ G + +S++ Sbjct: 813 LDVVFVLDHSGSIKKQYQDHM---------INLTIHLVKKADVGRNGVQFGALKYSDQPN 863 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L S + ++ G ST + LK+A N +F +H + N K+ Sbjct: 864 ILFYLNTYSNRSAIIENLRMRRDTGGSTYTAKALKHA-NALF----TEEHGSRIKENVKQ 918 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDG+ + + Q +E + +G + A+G+ E + N+ V+ Sbjct: 919 MLIVITDGK---SHDHDQLNDTASELRDKGITILAVGVGKANQKELEGMAGNKNNTIYVD 975 Query: 350 NPHSMYDAFSHIGKDIVTK 368 N + D F+ + + + T+ Sbjct: 976 NFDKLKDVFTLVQESMCTE 994 >gi|297671961|ref|XP_002814089.1| PREDICTED: collagen alpha-5(VI) chain-like [Pongo abelii] Length = 2586 Score = 38.1 bits (87), Expect = 2.4, Method: Compositional matrix adjust. Identities = 44/199 (22%), Positives = 88/199 (44%), Gaps = 19/199 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ VLD S S++ + + IN + VK N VQ G + +S++ Sbjct: 813 LDVVFVLDHSGSIKKQYQDHM---------INLTIHLVKKADVGRNRVQFGALKYSDQPN 863 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L S + ++ G +T + LK+A N +F +H + N K+ Sbjct: 864 ILFYLNTYSNRSAIIENLRMRRDTGGNTYTAKALKHA-NALF----TEKHGSRIKQNVKQ 918 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDG+ + + Q +E + +G ++A+G+ E + N+ V+ Sbjct: 919 VLIVITDGK---SHDHDQLNDTASELRDKGITIFAVGVGKANQKELEGMAGNKNNTIYVD 975 Query: 350 NPHSMYDAFSHIGKDIVTK 368 N + D F+ + + + T+ Sbjct: 976 NFDKLKDVFTLVQERMCTE 994 >gi|171742038|ref|ZP_02917845.1| hypothetical protein BIFDEN_01142 [Bifidobacterium dentium ATCC 27678] gi|283456833|ref|YP_003361397.1| hypothetical protein BDP_2000 [Bifidobacterium dentium Bd1] gi|171277652|gb|EDT45313.1| hypothetical protein BIFDEN_01142 [Bifidobacterium dentium ATCC 27678] gi|283103467|gb|ADB10573.1| Conserved hypothetical protein containing a von Willebrand factor type A (vWA) domain [Bifidobacterium dentium Bd1] Length = 967 Score = 38.1 bits (87), Expect = 2.5, Method: Compositional matrix adjust. Identities = 67/265 (25%), Positives = 107/265 (40%), Gaps = 46/265 (17%) Query: 95 RNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHI 154 +NE +D+ N+ D VR + + G SA S P + I + N +I Sbjct: 172 QNEAKDDETKNNADKTVR------LGIASYRGMLKSASSGLSTP-EHTKSIEYQGNGAYI 224 Query: 155 VM--PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS--SITKIDMAIKSINAMLEEVK 210 + I ++ +D+ +VLDVS SM F S +KI ++N+ L+E Sbjct: 225 LKLNVIGKDASTSTTDTTPIDIALVLDVSGSMNDDFGGRGSPSKISALKTAVNSFLDETA 284 Query: 211 ----LIPDVNNVVQSGLVTFSNKIEEFFLLEW---------GVSHLQRKIKYLS-KFGVS 256 I D N+ V+ LV ++N+I + + + ++ L+ G+ Sbjct: 285 KTNDTIEDDNDKVKVALVKYANQIGTATGADGCRISNSRQSDTGNCTQIVQELTTDAGLL 344 Query: 257 TNSTPGLKYAYNQIFD--MQGMRQHCNTEDANYKKIIVFMTDGE------------NLST 302 S GL+ A D M+ +Q A KK ++F TDGE N + Sbjct: 345 KTSVNGLQAAGATYADAAMEVAQQALAGGRAGAKKYVIFFTDGEPNHWSGFDGDVANAAI 404 Query: 303 KEDQQSLYYCNEAKKRGAIVYAIGI 327 K+ Q E K G VY+IGI Sbjct: 405 KKSQ-------ELKNAGTTVYSIGI 422 >gi|260800527|ref|XP_002595180.1| hypothetical protein BRAFLDRAFT_240914 [Branchiostoma floridae] gi|229280424|gb|EEN51192.1| hypothetical protein BRAFLDRAFT_240914 [Branchiostoma floridae] Length = 419 Score = 38.1 bits (87), Expect = 2.5, Method: Compositional matrix adjust. Identities = 32/127 (25%), Positives = 55/127 (43%), Gaps = 13/127 (10%) Query: 219 VQSGLVTFSNKIEEFFLL--EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 Q G++ +SN+ + LL + LQ+ I ++ TN+ L+Y N F + Sbjct: 38 TQIGVIQYSNQPQSEILLNDHQDAASLQQAISSINYLQGGTNTGKALRYLANNAFSGK-- 95 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 N A K+ + +TDG + + A K G ++YA+GI ++ L Sbjct: 96 ----NGARAGVSKVAIVVTDG-----RSSDDVVRPALNAGKEGIVLYAVGIGGSVDYQEL 146 Query: 337 RACASPN 343 R AS + Sbjct: 147 RDIASSD 153 >gi|163738634|ref|ZP_02146048.1| hypothetical protein RGBS107_11437 [Phaeobacter gallaeciensis BS107] gi|161387962|gb|EDQ12317.1| hypothetical protein RGBS107_11437 [Phaeobacter gallaeciensis BS107] Length = 558 Score = 37.7 bits (86), Expect = 2.7, Method: Compositional matrix adjust. Identities = 25/73 (34%), Positives = 36/73 (49%), Gaps = 2/73 (2%) Query: 299 NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLRACASPNSFYLVENPHSMYDA 357 N STK D ++ CN AK +G +VY IG S L+ CAS ++ + + DA Sbjct: 485 NTSTK-DARTRAVCNAAKNQGIVVYTIGFEAPSSGTAVLKDCASSDAHHFDVRGLEIRDA 543 Query: 358 FSHIGKDIVTKRI 370 F+ I I R+ Sbjct: 544 FASIATSIRQLRL 556 >gi|163742980|ref|ZP_02150363.1| hypothetical protein RG210_01902 [Phaeobacter gallaeciensis 2.10] gi|161383663|gb|EDQ08049.1| hypothetical protein RG210_01902 [Phaeobacter gallaeciensis 2.10] Length = 560 Score = 37.7 bits (86), Expect = 2.8, Method: Compositional matrix adjust. Identities = 25/73 (34%), Positives = 36/73 (49%), Gaps = 2/73 (2%) Query: 299 NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLRACASPNSFYLVENPHSMYDA 357 N STK D ++ CN AK +G +VY IG S L+ CAS ++ + + DA Sbjct: 487 NTSTK-DARTRAVCNAAKNQGIVVYTIGFEAPSSGTAVLKDCASSDAHHFDVRGLEIRDA 545 Query: 358 FSHIGKDIVTKRI 370 F+ I I R+ Sbjct: 546 FASIATSIRQLRL 558 >gi|300853770|ref|YP_003778754.1| hypothetical protein CLJU_c05700 [Clostridium ljungdahlii DSM 13528] gi|300433885|gb|ADK13652.1| hypothetical protein CLJU_c05700 [Clostridium ljungdahlii DSM 13528] Length = 484 Score = 37.7 bits (86), Expect = 2.8, Method: Compositional matrix adjust. Identities = 43/172 (25%), Positives = 77/172 (44%), Gaps = 31/172 (18%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV--QSGL 223 S + +D++ VLD S SM+ I E +K+ D+ V ++GL Sbjct: 30 STSSPNVDVVFVLDSSGSMKQSDPEEI------------RTEAIKMFLDMGQVQGNKAGL 77 Query: 224 VTFSNKI-EEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGMRQH 279 V +S+ I +E L + +I+ ++ G T+ GL ++G + Sbjct: 78 VAYSDTIIKEHNLDAINSESDKERIEEMASNIPLGQKTDIGRGL---------LEGAKVL 128 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA----KKRGAIVYAIGI 327 + D+N + +I+ ++DG+N S + +SL N A K +G VY IG+ Sbjct: 129 DSGHDSNNRPLIILLSDGKNDSQRSASESLKDLNSAISTCKSKGYPVYTIGL 180 >gi|332232505|ref|XP_003265445.1| PREDICTED: collagen alpha-5(VI) chain [Nomascus leucogenys] Length = 2526 Score = 37.7 bits (86), Expect = 2.9, Method: Compositional matrix adjust. Identities = 43/199 (21%), Positives = 89/199 (44%), Gaps = 19/199 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ VLD S S++ + + IN + VK + VQ G + +S++ Sbjct: 813 LDVVFVLDHSGSIKKQYQDHM---------INLTIHLVKKADVGRDRVQFGALKYSDQPN 863 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L S + ++ G +T + LK+A N +F +H + + N K+ Sbjct: 864 ILFYLNTYSNRSAIIENLRMRRDTGGNTYTAKALKHA-NALF----TEEHGSRINQNVKQ 918 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDG+ + + Q +E + +G ++A+G+ E + N+ V+ Sbjct: 919 MLIVITDGK---SHDHDQLNDTASELRDKGITIFAVGVGKANQKELEGMAGNKNNAIYVD 975 Query: 350 NPHSMYDAFSHIGKDIVTK 368 N + D F+ + + + T+ Sbjct: 976 NFDKLKDVFTLVQESMCTE 994 >gi|83816834|ref|YP_446668.1| von Willebrand factor type A domain-containing protein [Salinibacter ruber DSM 13855] gi|83758228|gb|ABC46341.1| von Willebrand factor type A domain protein [Salinibacter ruber DSM 13855] Length = 289 Score = 37.7 bits (86), Expect = 3.0, Method: Compositional matrix adjust. Identities = 42/158 (26%), Positives = 72/158 (45%), Gaps = 23/158 (14%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+VLD S SM++ D T+ + A ++ A +E V++ V GL+ F+ + Sbjct: 51 IDIMMVLDASTSMQA-EDFQPTRFEAAREAAGAFVEG-----RVSDRV--GLIVFAAEAY 102 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG--LKYAYNQIFDMQGMRQHCNTEDANYKK 289 L S LQR ++ + V + G L A N++ D + K Sbjct: 103 TQAPLTLDYSFLQRMLEDVEVGAVEDGTAVGTALATAVNRLKDSEA-----------ESK 151 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + + +TDG N + D ++ A+ G VYAIG+ Sbjct: 152 VAILLTDGRNNRGQIDPRTA--AEVARTMGVRVYAIGV 187 >gi|20089145|ref|NP_615220.1| hypothetical protein MA0247 [Methanosarcina acetivorans C2A] gi|19914014|gb|AAM03700.1| hypothetical protein (multi-domain) [Methanosarcina acetivorans C2A] Length = 589 Score = 37.7 bits (86), Expect = 3.0, Method: Compositional matrix adjust. Identities = 47/202 (23%), Positives = 85/202 (42%), Gaps = 26/202 (12%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ +D S SM+S S + K A KS ++ + +G+V++ + I+ Sbjct: 81 MDVVFAIDSSGSMQSNDPSGLRK--TAAKSFVDKMDSSRDT--------AGVVSWDDSID 130 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L ++ I + G STN GL+ A + I D N N ++I Sbjct: 131 FSLPLTNDFPLVKTNIDSVDSSG-STNLNVGLEEAID-ILD-------ANPRTENSVEVI 181 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR-ACASPNSFYLVEN 350 +F+TDG+ Q EA +G ++Y+IG+ + A + ++Y + Sbjct: 182 IFLTDGQGTYLHSTAQ------EAADKGYVIYSIGLGGVNPTPLQDMATTTGGAYYSSPD 235 Query: 351 PHSMYDAFSHIGKDIVTKRIWY 372 S+ F I ++ T I Y Sbjct: 236 ATSLQAIFDDIFSEVTTSTIPY 257 >gi|295132196|ref|YP_003582872.1| hypothetical protein ZPR_0316 [Zunongwangia profunda SM-A87] gi|294980211|gb|ADF50676.1| conserved hypothetical protein [Zunongwangia profunda SM-A87] Length = 288 Score = 37.7 bits (86), Expect = 3.1, Method: Compositional matrix adjust. Identities = 34/105 (32%), Positives = 49/105 (46%), Gaps = 9/105 (8%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L MM+V+DVS S FF + I I A L N + GL+ F Sbjct: 72 EEERELTMMLVVDVSGS--EFFGTQSQFKKEVITEIAATL----AFSATQNNDKIGLLMF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP---GLKYAYN 268 S++IE + + G SH+ R I+ L +F + T GLKY N Sbjct: 126 SDQIENYIPPKKGKSHVLRIIRELLEFQPKSKKTDIGLGLKYLSN 170 >gi|315499132|ref|YP_004087936.1| von willebrand factor type a [Asticcacaulis excentricus CB 48] gi|315417144|gb|ADU13785.1| von Willebrand factor type A [Asticcacaulis excentricus CB 48] Length = 519 Score = 37.4 bits (85), Expect = 3.5, Method: Compositional matrix adjust. Identities = 39/170 (22%), Positives = 69/170 (40%), Gaps = 15/170 (8%) Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKY 265 +V + P +VV S + S + S YLS G +TN T G+++ Sbjct: 346 DVSVAPYATDVVDSNYIAASCSTTALAKVLDLTSDFTSVNTYLSSLSPGGNTNITLGVQF 405 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE-------DQQSLYYCNEAKKR 318 + + + D + KK ++ +TDG N + + ++ C AK + Sbjct: 406 GMEMLSPAEPYTKATAFGDTDVKKYMIIVTDGANTQNRWSTSNSAINARTALACTAAKAQ 465 Query: 319 GAIVYAIGIRVIRSHEFLRACASPNSFY-----LVENPHSMYDAFSHIGK 363 G ++ + + S L ACAS +S+Y + +M D F+ I K Sbjct: 466 GITLFVVRVEDGDS-SLLEACASQSSYYYDLSQASDLTKTMQDIFATINK 514 >gi|313219850|emb|CBY30766.1| unnamed protein product [Oikopleura dioica] Length = 1473 Score = 37.4 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 52/187 (27%), Positives = 81/187 (43%), Gaps = 29/187 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N+ RLD+ IV+D S S+ +S D + + L + D N V+ GL Sbjct: 1234 NTTISGRLDIQIVIDTSGSL-----TSAPNKDQVLMNFTNNLANMY---DTINQVKIGLT 1285 Query: 225 TFS-NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +FS + + E L + LQ + ++ G TN T G++ A N DM +T Sbjct: 1286 SFSESSVLEMPLDFYNQLELQDGVSNMTWQGSFTNITSGVETALN---DM-------DTS 1335 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 DA +++ +TDG + ++AK G + A+G EF A SPN Sbjct: 1336 DA-VDDVMILITDG--FQSTNTTLMFQMIDQAKAEGVRLIALGF----FGEF--AFYSPN 1386 Query: 344 SFYLVEN 350 YL+ N Sbjct: 1387 -LYLMTN 1392 >gi|281340555|gb|EFB16139.1| hypothetical protein PANDA_003424 [Ailuropoda melanoleuca] Length = 191 Score = 37.4 bits (85), Expect = 3.8, Method: Compositional matrix adjust. Identities = 28/98 (28%), Positives = 47/98 (47%), Gaps = 14/98 (14%) Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK---IIVFMTDGENLSTKEDQQSLYYC 312 +T+ GL+ A QI E+A KK +I+ +TDG L ++ + Sbjct: 74 ATHMQEGLRKANEQI----------EQENAGEKKAPIVILALTDGTLLPFPFEETKME-A 122 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 E+++ GA VY IG++ R + L SP+ + V+N Sbjct: 123 EESRRLGATVYCIGVKDYRKDQLLDIADSPDHMFGVDN 160 >gi|294508603|ref|YP_003572662.1| von Willebrand factor type A domain protein [Salinibacter ruber M8] gi|294344932|emb|CBH25710.1| von Willebrand factor type A domain protein [Salinibacter ruber M8] Length = 317 Score = 37.4 bits (85), Expect = 3.8, Method: Compositional matrix adjust. Identities = 42/158 (26%), Positives = 72/158 (45%), Gaps = 23/158 (14%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+VLD S SM++ D T+ + A ++ A +E V++ V GL+ F+ + Sbjct: 79 IDIMMVLDASTSMQA-EDFQPTRFEAAREAAGAFVEG-----RVSDRV--GLIVFAAEAY 130 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG--LKYAYNQIFDMQGMRQHCNTEDANYKK 289 L S LQR ++ + V + G L A N++ D + K Sbjct: 131 TQAPLTLDYSFLQRMLEDVEVGAVEDGTAVGTALATAVNRLKDSEA-----------ESK 179 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + + +TDG N + D ++ A+ G VYAIG+ Sbjct: 180 VAILLTDGRNNRGQIDPRTA--AEVAQTMGVRVYAIGV 215 >gi|290979997|ref|XP_002672719.1| predicted protein [Naegleria gruberi] gi|284086298|gb|EFC39975.1| predicted protein [Naegleria gruberi] Length = 859 Score = 37.0 bits (84), Expect = 4.4, Method: Compositional matrix adjust. Identities = 26/94 (27%), Positives = 45/94 (47%), Gaps = 12/94 (12%) Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 SK G+ST+ TP + Y + + D +HC K+ ++F+ N S K ++ L Sbjct: 597 SKTGISTSQTPKISYLHLLLDDFDP--KHCYFTAEQMKESVLFI----NQSLKNNENILI 650 Query: 311 YCNEAKKRGAIVYA------IGIRVIRSHEFLRA 338 +C E + R + IG ++ S EF+R+ Sbjct: 651 HCQEGRSRSVTIICLYMMINIGWKLSYSLEFMRS 684 >gi|145299821|ref|YP_001142662.1| flp pilus assembly protein FlpL [Aeromonas salmonicida subsp. salmonicida A449] gi|88866595|gb|ABD57363.1| FlpL [Aeromonas salmonicida subsp. salmonicida A449] gi|142852593|gb|ABO90914.1| putative flp pilus assembly protein FlpL [Aeromonas salmonicida subsp. salmonicida A449] Length = 460 Score = 37.0 bits (84), Expect = 4.9, Method: Compositional matrix adjust. Identities = 29/119 (24%), Positives = 51/119 (42%), Gaps = 17/119 (14%) Query: 241 SHLQRKIKYLSKF-----GVSTNSTPGLKYAYNQIFDM-QGMRQHCNTE------DANYK 288 +HL + +Y +TN+ G+ + + + QG E A+ + Sbjct: 316 AHLSTRAEYRQALDTLYAAFNTNTAEGVMWGWRLLSPQWQGRWGQGAAELPRPYGQADNR 375 Query: 289 KIIVFMTDGENLSTK---EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 KI+V +DGE++ + D++ L C E K++G VY + F+ CAS S Sbjct: 376 KIMVLFSDGEHMGPEAALRDRKQLLLCREMKRKGIQVYTVAFE--GDARFVAQCASDRS 432 >gi|226315301|ref|YP_002775197.1| hypothetical protein BBR47_57160 [Brevibacillus brevis NBRC 100599] gi|226098251|dbj|BAH46693.1| hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 597 Score = 37.0 bits (84), Expect = 5.1, Method: Compositional matrix adjust. Identities = 47/171 (27%), Positives = 73/171 (42%), Gaps = 37/171 (21%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS--GLVTFSNK 229 +D ++V+DVS SM + ++ E +K+ D+ ++ + G+V +++K Sbjct: 39 MDAVLVVDVSNSMTQSDKNKVSN------------EAMKMFVDMTSIQANKVGVVAYTDK 86 Query: 230 IE-EFFLLEWGVSHLQRKIKY----LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 IE E LLE + IK L K G T+ G+ A +I D + Sbjct: 87 IEREKALLEINSEEDKNDIKAFIDSLQK-GAYTDIAVGVTEAV-KILDA--------GRN 136 Query: 285 ANYKKIIVFMTDGENL--------STKEDQQSLYYCNEAKKRGAIVYAIGI 327 N IIV + DG N K DQ+ EAK +G VY IG+ Sbjct: 137 PNNAPIIVLLADGNNFLNKASSRTQAKSDQELQQAVKEAKDKGYPVYTIGL 187 >gi|330995096|ref|ZP_08319013.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] gi|332879554|ref|ZP_08447249.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|329576672|gb|EGG58175.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] gi|332682520|gb|EGJ55422.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 289 Score = 37.0 bits (84), Expect = 5.6, Method: Compositional matrix adjust. Identities = 31/94 (32%), Positives = 50/94 (53%), Gaps = 6/94 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+V+DVS S+ SF S TK +M + I A L + N + G++ F Sbjct: 72 EEERELTVMLVIDVSGSL-SFGTVSQTKREM-VAEIAATL----AFSAIQNNDKIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNST 260 S++IE+F + G H+ R I+ L F + ST Sbjct: 126 SDRIEKFIPPKKGRKHILRIIRELLDFTPQSKST 159 >gi|303235711|ref|ZP_07322318.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] gi|302484158|gb|EFL47146.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] Length = 341 Score = 36.6 bits (83), Expect = 6.1, Method: Compositional matrix adjust. Identities = 18/52 (34%), Positives = 31/52 (59%), Gaps = 4/52 (7%) Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + +H +E AN+ K I+ +TDGEN ++ + EA+K+G V+ +GI Sbjct: 178 LSEHSFSEKANFGKAIIIITDGENHEKGAEEMA----REAQKQGIRVFILGI 225 >gi|119599630|gb|EAW79224.1| hypothetical protein FLJ35880 [Homo sapiens] Length = 2531 Score = 36.6 bits (83), Expect = 6.2, Method: Compositional matrix adjust. Identities = 46/199 (23%), Positives = 88/199 (44%), Gaps = 19/199 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ VLD S S++ + + IN + VK + VQ G + +S++ Sbjct: 758 LDVVFVLDHSGSIKKQYQDHM---------INLTIHLVKKADVGRDRVQFGALKYSDQPN 808 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L S + ++ G +T + LK+A N +F +H + N K+ Sbjct: 809 ILFYLNTYSNRSAIIENLRKRRDTGGNTYTAKALKHA-NALF----TEEHGSRIKQNVKQ 863 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDGE S DQ + E + +G ++A+G+ E + N+ V+ Sbjct: 864 MLIVITDGE--SHDHDQLNDTAL-ELRNKGITIFAVGVGKANQKELEGMAGNKNNTIYVD 920 Query: 350 NPHSMYDAFSHIGKDIVTK 368 N + D F+ + + + T+ Sbjct: 921 NFDKLKDVFTLVQERMCTE 939 >gi|147921050|ref|YP_685140.1| hypothetical protein RCIX370 [uncultured methanogenic archaeon RC-I] gi|110620536|emb|CAJ35814.1| hypothetical protein RCIX370 [uncultured methanogenic archaeon RC-I] Length = 1310 Score = 36.6 bits (83), Expect = 6.2, Method: Compositional matrix adjust. Identities = 20/78 (25%), Positives = 40/78 (51%), Gaps = 3/78 (3%) Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 K+ I+ +TDG + + D L ++AK +G ++ IG+ + + + P +Y Sbjct: 988 KQYIIVLTDGYSQYPEFD---LIEADKAKAKGYTIFTIGMGMADEDTLKKIASKPEYYYR 1044 Query: 348 VENPHSMYDAFSHIGKDI 365 V +P + A+ IG++I Sbjct: 1045 VLSPEQLEAAYYDIGQEI 1062 >gi|332817900|ref|XP_526306.3| PREDICTED: collagen alpha-5(VI) chain isoform 2 [Pan troglodytes] Length = 2526 Score = 36.6 bits (83), Expect = 6.7, Method: Compositional matrix adjust. Identities = 46/199 (23%), Positives = 88/199 (44%), Gaps = 19/199 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ VLD S S++ + + IN + VK + VQ G + +S++ Sbjct: 813 LDVVFVLDHSGSIKKQYQDHM---------INLTIHLVKKADVGRDRVQFGALKYSDQPN 863 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L S + ++ G +T + LK+A N +F +H + N K+ Sbjct: 864 ILFYLNTYSNRSAIIENLRKRRDTGGNTYTAKALKHA-NALF----TEEHGSRIKQNVKQ 918 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDGE S DQ + E + +G ++A+G+ E + N+ V+ Sbjct: 919 MLIVITDGE--SHDHDQLNDTAL-ELRNKGITIFAVGVGKANQKELEGMAGNKNNTIYVD 975 Query: 350 NPHSMYDAFSHIGKDIVTK 368 N + D F+ + + + T+ Sbjct: 976 NFDKLKDVFTLVQERMCTE 994 >gi|183583553|ref|NP_694996.5| collagen alpha-5(VI) chain [Homo sapiens] Length = 2526 Score = 36.6 bits (83), Expect = 6.9, Method: Compositional matrix adjust. Identities = 46/199 (23%), Positives = 88/199 (44%), Gaps = 19/199 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ VLD S S++ + + IN + VK + VQ G + +S++ Sbjct: 813 LDVVFVLDHSGSIKKQYQDHM---------INLTIHLVKKADVGRDRVQFGALKYSDQPN 863 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L S + ++ G +T + LK+A N +F +H + N K+ Sbjct: 864 ILFYLNTYSNRSAIIENLRKRRDTGGNTYTAKALKHA-NALF----TEEHGSRIKQNVKQ 918 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDGE S DQ + E + +G ++A+G+ E + N+ V+ Sbjct: 919 MLIVITDGE--SHDHDQLNDTAL-ELRNKGITIFAVGVGKANQKELEGMAGNKNNTIYVD 975 Query: 350 NPHSMYDAFSHIGKDIVTK 368 N + D F+ + + + T+ Sbjct: 976 NFDKLKDVFTLVQERMCTE 994 >gi|189082691|sp|A8TX70|CO6A5_HUMAN RecName: Full=Collagen alpha-5(VI) chain; AltName: Full=Collagen alpha-1(XXIX) chain; AltName: Full=von Willebrand factor A domain-containing protein 4; Flags: Precursor gi|158828630|gb|ABW81241.1| collagen XXIX alpha 1 [Homo sapiens] Length = 2615 Score = 36.6 bits (83), Expect = 6.9, Method: Compositional matrix adjust. Identities = 46/199 (23%), Positives = 88/199 (44%), Gaps = 19/199 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ VLD S S++ + + IN + VK + VQ G + +S++ Sbjct: 813 LDVVFVLDHSGSIKKQYQDHM---------INLTIHLVKKADVGRDRVQFGALKYSDQPN 863 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L S + ++ G +T + LK+A N +F +H + N K+ Sbjct: 864 ILFYLNTYSNRSAIIENLRKRRDTGGNTYTAKALKHA-NALF----TEEHGSRIKQNVKQ 918 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDGE S DQ + E + +G ++A+G+ E + N+ V+ Sbjct: 919 MLIVITDGE--SHDHDQLNDTAL-ELRNKGITIFAVGVGKANQKELEGMAGNKNNTIYVD 975 Query: 350 NPHSMYDAFSHIGKDIVTK 368 N + D F+ + + + T+ Sbjct: 976 NFDKLKDVFTLVQERMCTE 994 >gi|332817898|ref|XP_003310056.1| PREDICTED: collagen alpha-5(VI) chain isoform 1 [Pan troglodytes] Length = 2615 Score = 36.6 bits (83), Expect = 7.0, Method: Compositional matrix adjust. Identities = 46/199 (23%), Positives = 88/199 (44%), Gaps = 19/199 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ VLD S S++ + + IN + VK + VQ G + +S++ Sbjct: 813 LDVVFVLDHSGSIKKQYQDHM---------INLTIHLVKKADVGRDRVQFGALKYSDQPN 863 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L S + ++ G +T + LK+A N +F +H + N K+ Sbjct: 864 ILFYLNTYSNRSAIIENLRKRRDTGGNTYTAKALKHA-NALF----TEEHGSRIKQNVKQ 918 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDGE S DQ + E + +G ++A+G+ E + N+ V+ Sbjct: 919 MLIVITDGE--SHDHDQLNDTAL-ELRNKGITIFAVGVGKANQKELEGMAGNKNNTIYVD 975 Query: 350 NPHSMYDAFSHIGKDIVTK 368 N + D F+ + + + T+ Sbjct: 976 NFDKLKDVFTLVQERMCTE 994 >gi|83951473|ref|ZP_00960205.1| hypothetical protein ISM_12960 [Roseovarius nubinhibens ISM] gi|83836479|gb|EAP75776.1| hypothetical protein ISM_12960 [Roseovarius nubinhibens ISM] Length = 550 Score = 36.6 bits (83), Expect = 7.2, Method: Compositional matrix adjust. Identities = 18/62 (29%), Positives = 32/62 (51%), Gaps = 3/62 (4%) Query: 312 CNEAKKRGAIVYAIGIRVIR---SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 C K G +V++IG + + + + L+ CAS + Y ++ DAFS I ++V Sbjct: 487 CTATKNEGVVVFSIGFEIDQGGTAEQVLKNCASSENHYFRAEGININDAFSAIASNVVNL 546 Query: 369 RI 370 R+ Sbjct: 547 RL 548 >gi|225028486|ref|ZP_03717678.1| hypothetical protein EUBHAL_02763 [Eubacterium hallii DSM 3353] gi|224954191|gb|EEG35400.1| hypothetical protein EUBHAL_02763 [Eubacterium hallii DSM 3353] Length = 538 Score = 36.6 bits (83), Expect = 7.3, Method: Compositional matrix adjust. Identities = 40/159 (25%), Positives = 71/159 (44%), Gaps = 23/159 (14%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 T + D+++ LD+S SM+ K A K ++++L + I GLV++S Sbjct: 221 TSKKRDIVLTLDISASMDGIPLDETKK--AAAKFVDSILNKNSNI---------GLVSYS 269 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 ++ + L+ I LS +TN GL AY+ + Q Sbjct: 270 DEATSLSGICSNDVFLKNTITSLSS-AENTNIEDGLSRAYSMLQLGQSK----------- 317 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 KK+IV M+DG K+ ++ + Y + K +G ++Y +G Sbjct: 318 KKLIVLMSDGLPTLGKDGEELIKYAEKIKDQGVLIYTLG 356 >gi|149632101|ref|XP_001514410.1| PREDICTED: hypothetical protein [Ornithorhynchus anatinus] Length = 2392 Score = 36.6 bits (83), Expect = 7.4, Method: Compositional matrix adjust. Identities = 49/197 (24%), Positives = 83/197 (42%), Gaps = 21/197 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D S S+ DS+ I A M++ VK N VQ G + +S+ E Sbjct: 806 LDIVFVIDSSGSI----DSNEYNIMKAF-----MIDLVKKADVGKNQVQFGALKYSDFPE 856 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L S + I+ G ST + L ++ + + G R H + Sbjct: 857 VLFNLNEFSSKSEIISFIQNDHPRGGSTYTAKALAHSAHLFSESLGSRMH-----RGVPQ 911 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 +++ +TDGE+ D L A + +G +V A+GI E L S + ++ V Sbjct: 912 VLIVITDGES----HDAHLLNATARALRDKGILVLAVGIEGANHEELLSMAGSTDRYFFV 967 Query: 349 ENPHSMYDAFSHIGKDI 365 EN + F ++ + Sbjct: 968 ENFEGLKGIFENVSASV 984 >gi|222529355|ref|YP_002573237.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] gi|222456202|gb|ACM60464.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] Length = 1188 Score = 36.2 bits (82), Expect = 7.5, Method: Compositional matrix adjust. Identities = 58/242 (23%), Positives = 101/242 (41%), Gaps = 40/242 (16%) Query: 141 FCTFIPWYTNSRHIV--MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA 198 F TF+ +Y + + +PI + +Q +D++ VLD S SM S+ D + + A Sbjct: 465 FTTFVLFYIPTWKAIWEVPINKGEREINQQVNYIDLVFVLDSSGSM-SWNDPNGYRKIAA 523 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTN 258 ++A+++ + + +V F N L ++ I + +G TN Sbjct: 524 KSFVDALIQGDR----------AAVVDFDNFGYLLQPLTTDFQAVKNAIDRIDSWG-GTN 572 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN----E 314 G++ A Q+ ++ED K+I+ +TDGE YY N E Sbjct: 573 IAEGIRIANQQLISR-------SSEDR--IKVIILLTDGEG----------YYDNNLTTE 613 Query: 315 AKKRGAIVYAIGIRVIRSHEFLRACASPNS--FYLVENPHSMYDAFSHIGKDIVTKRIWY 372 AK G +Y IG+ LR A+ ++ V + + F I +IVT+ I Sbjct: 614 AKNNGITIYTIGLGTSVDENLLRDIATQTGGMYFPVSSASQLPQVFKRI-TEIVTEPIDT 672 Query: 373 DK 374 D+ Sbjct: 673 DE 674 >gi|297462925|ref|XP_608567.5| PREDICTED: collagen type VI alpha 6-like [Bos taurus] Length = 2343 Score = 36.2 bits (82), Expect = 7.6, Method: Composition-based stats. Identities = 49/236 (20%), Positives = 96/236 (40%), Gaps = 44/236 (18%) Query: 144 FIPWYTNSRHIVMPITSSVKVNSQTDA---RLDMMIVLDVS--------RSMESFFDSSI 192 F+ + + I +++SV +S+ D ++D++ ++D S R M+ F S I Sbjct: 964 FVETFGGLKGIFSDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIHPDDFRKMKEFLASVI 1023 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYL 250 D++ NN V+ G FS+ + F L G + +I+ + Sbjct: 1024 QDFDIS-----------------NNRVRIGAAQFSHTYQPEFPLGMFIGKEEISFQIENI 1066 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 + T+ L+ + G R H T ++++ +TDG+ S E Q+ Sbjct: 1067 KQIFGYTHIGAALRQVGHYFRPDMGSRIHAGT-----PQVLLVLTDGQ--SQDEVAQA-- 1117 Query: 311 YCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIV 366 E + +G +Y++GI + + ++ + N V N +D + K IV Sbjct: 1118 -AEELRHKGIDIYSVGIGDVDDQQLVQITGTANKKLTVHN----FDELKKVKKRIV 1168 >gi|297471452|ref|XP_002685218.1| PREDICTED: collagen, type VI, alpha 1-like [Bos taurus] gi|296490817|gb|DAA32930.1| collagen, type VI, alpha 1-like [Bos taurus] Length = 2268 Score = 36.2 bits (82), Expect = 7.6, Method: Composition-based stats. Identities = 49/236 (20%), Positives = 96/236 (40%), Gaps = 44/236 (18%) Query: 144 FIPWYTNSRHIVMPITSSVKVNSQTDA---RLDMMIVLDVS--------RSMESFFDSSI 192 F+ + + I +++SV +S+ D ++D++ ++D S R M+ F S I Sbjct: 964 FVETFGGLKGIFSDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIHPDDFRKMKEFLASVI 1023 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYL 250 D++ NN V+ G FS+ + F L G + +I+ + Sbjct: 1024 QDFDIS-----------------NNRVRIGAAQFSHTYQPEFPLGMFIGKEEISFQIENI 1066 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 + T+ L+ + G R H T ++++ +TDG+ S E Q+ Sbjct: 1067 KQIFGYTHIGAALRQVGHYFRPDMGSRIHAGT-----PQVLLVLTDGQ--SQDEVAQA-- 1117 Query: 311 YCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIV 366 E + +G +Y++GI + + ++ + N V N +D + K IV Sbjct: 1118 -AEELRHKGIDIYSVGIGDVDDQQLVQITGTANKKLTVHN----FDELKKVKKRIV 1168 >gi|149026142|gb|EDL82385.1| rCG29121 [Rattus norvegicus] Length = 905 Score = 36.2 bits (82), Expect = 7.8, Method: Compositional matrix adjust. Identities = 54/193 (27%), Positives = 83/193 (43%), Gaps = 27/193 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-KIEE 232 + +VLD S SM S IT++ + ++ E+ LI + GLVTF + I + Sbjct: 309 VCLVLDKSGSMSS--GDPITRLTLMNQAA-----ELYLIQILEKESLVGLVTFDSIAIVQ 361 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L+ S+L+ K + T+ GL+ + I + C +E Sbjct: 362 NNLIRMINDSSYLEISAKLPQEAAGGTSICNGLRKGFETITSSD--QSTCGSE------- 412 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY--AIGIRVIRSHEFLRACASPNSFYLV 348 IV +TDGE+ DQ S + E K GA+++ A+G R E L FY Sbjct: 413 IVLLTDGED-----DQISSCF-EEVKHSGAVIHTIALGPDAARELETLSDMTGGRRFYAS 466 Query: 349 ENPHSMYDAFSHI 361 E + + DAFS I Sbjct: 467 EGINGLIDAFSGI 479 >gi|296228122|ref|XP_002759734.1| PREDICTED: collagen alpha-4(VI) chain-like [Callithrix jacchus] Length = 2348 Score = 36.2 bits (82), Expect = 8.0, Method: Composition-based stats. Identities = 47/200 (23%), Positives = 89/200 (44%), Gaps = 35/200 (17%) Query: 171 RLDMMIVLDVSRSM--ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D+ ++D S S+ E+F D + +N +++ ++ PD VQ G++ +S+ Sbjct: 841 KADIYFLIDGSGSIYPEAFLDMKVF--------MNEVIKMFQIGPDR---VQFGVIQYSD 889 Query: 229 KIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 +I+ F+L V+ L+ I + + G T + L Q+F +T N Sbjct: 890 EIKSKFVLSQYPTVAELKVAIDNIQQGGGGTTTGEALN-NMTQVF--------ADTARIN 940 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 + ++ +TDG++ D + G I+YAIG+R E L+ A F+ Sbjct: 941 VARYLIVITDGKSSDPVADA-----AEGLRASGVIIYAIGVREANIDE-LKEIAKDKIFF 994 Query: 347 LVENPHSMYDAFSHIGKDIV 366 + E +D I K++V Sbjct: 995 VYE-----FDLLKDIQKEVV 1009 >gi|116487355|ref|NP_001070824.1| chloride channel calcium activated 4-like [Rattus norvegicus] gi|116013527|dbj|BAF34587.1| calcium-activated chloride channel [Rattus norvegicus] Length = 905 Score = 36.2 bits (82), Expect = 8.3, Method: Compositional matrix adjust. Identities = 54/193 (27%), Positives = 83/193 (43%), Gaps = 27/193 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-KIEE 232 + +VLD S SM S IT++ + ++ E+ LI + GLVTF + I + Sbjct: 309 VCLVLDKSGSMSS--GDPITRLTLMNQAA-----ELYLIQILEKESLVGLVTFDSIAIVQ 361 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L+ S+L+ K + T+ GL+ + I + C +E Sbjct: 362 NNLIRMINDSSYLEISAKLPQEAAGGTSICNGLRKGFETITSSD--QSTCGSE------- 412 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY--AIGIRVIRSHEFLRACASPNSFYLV 348 IV +TDGE+ DQ S + E K GA+++ A+G R E L FY Sbjct: 413 IVLLTDGED-----DQISSCF-EEVKHSGAVIHTIALGPDAARELETLSDMTGGRRFYAS 466 Query: 349 ENPHSMYDAFSHI 361 E + + DAFS I Sbjct: 467 EGINGLIDAFSGI 479 >gi|45361321|ref|NP_989238.1| matrilin 2 [Xenopus (Silurana) tropicalis] gi|39645939|gb|AAH63920.1| matrilin 2 [Xenopus (Silurana) tropicalis] Length = 839 Score = 36.2 bits (82), Expect = 9.7, Method: Compositional matrix adjust. Identities = 38/184 (20%), Positives = 87/184 (47%), Gaps = 23/184 (12%) Query: 169 DARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + +D++ V+D S+S+ E+ F+ + + +N +L+ +++ +V GL+ +S Sbjct: 567 EGPVDLVFVIDGSKSLGENNFE-------IVKQFVNGILDSLEISQKAAHV---GLIQYS 616 Query: 228 NKIE-EFFLLEWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIF-DMQGMRQHCNTED 284 I EF + ++ + +++ + + G + + LK + + F ++QG R Sbjct: 617 THIRTEFTMAQYSSAKDVKKAVSQIKYMGRGSMTGLALKLMHEKSFSEVQGAR----PRA 672 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 ++ + TDG +D+ S Y +AK+ G +YAIG+ E ++P Sbjct: 673 MGVPRVAIVFTDGR----AQDEVSEYA-KKAKQSGITIYAIGVGKAIDEELQEIASAPQE 727 Query: 345 FYLV 348 +++ Sbjct: 728 KHVI 731 Searching..................................................done Results from round 2 >gi|254780934|ref|YP_003065347.1| hypothetical protein CLIBASIA_04165 [Candidatus Liberibacter asiaticus str. psy62] gi|254040611|gb|ACT57407.1| hypothetical protein CLIBASIA_04165 [Candidatus Liberibacter asiaticus str. psy62] Length = 374 Score = 448 bits (1151), Expect = e-123, Method: Composition-based stats. Identities = 374/374 (100%), Positives = 374/374 (100%) Query: 1 MFSLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVH 60 MFSLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVH Sbjct: 1 MFSLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVH 60 Query: 61 AATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVV 120 AATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVV Sbjct: 61 AATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVV 120 Query: 121 VPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDV 180 VPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDV Sbjct: 121 VPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDV 180 Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV 240 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV Sbjct: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV 240 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL Sbjct: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH Sbjct: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 Query: 361 IGKDIVTKRIWYDK 374 IGKDIVTKRIWYDK Sbjct: 361 IGKDIVTKRIWYDK 374 >gi|254780833|ref|YP_003065246.1| hypothetical protein CLIBASIA_03630 [Candidatus Liberibacter asiaticus str. psy62] gi|254040510|gb|ACT57306.1| hypothetical protein CLIBASIA_03630 [Candidatus Liberibacter asiaticus str. psy62] Length = 371 Score = 384 bits (987), Expect = e-104, Method: Composition-based stats. Identities = 203/370 (54%), Positives = 273/370 (73%), Gaps = 3/370 (0%) Query: 6 LNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 LNIRNFFYN KG ++ILTAI LP+IF+V+G++IE SH FF+K LH ++D SL++ AT+I Sbjct: 4 LNIRNFFYNCKGSISILTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKI 63 Query: 66 MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 +N+ NGNN KK K IKN W FRNELR+NGF DI++I RSTSL I++ Q++ Sbjct: 64 LNQENGNNGKKQKNDFSYRIIKNIWQTDFRNELRENGFAQDINNIERSTSLSIIIDDQHK 123 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 Y++SA+SRY++P FCTF PW NS H + ITSSVK++S++D LDMM+VLDVS SM Sbjct: 124 DYNLSAVSRYEMPFIFCTF-PWCANSSHAPLLITSSVKISSKSDIGLDMMMVLDVSLSMN 182 Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR 245 F + K+ +A +SI ML+ +K IPDVNNVV+SGLVTFS+KI + F L WGV H+Q Sbjct: 183 DHFGPGMDKLGVATRSIREMLDIIKSIPDVNNVVRSGLVTFSSKIVQTFPLAWGVQHIQE 242 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK-E 304 KI L FG +T STPGL+YAYN+IFD + +H +YKK I+F+TDGEN S + Sbjct: 243 KINRLI-FGSTTKSTPGLEYAYNKIFDAKEKLEHIAKGHDDYKKYIIFLTDGENSSPNID 301 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 +++SL+YCNEAK+RGAIVYAIG++ + +FL+ CASP+ FY V+N ++DAF IGK+ Sbjct: 302 NKESLFYCNEAKRRGAIVYAIGVQAEAADQFLKNCASPDRFYSVQNSRKLHDAFLRIGKE 361 Query: 365 IVTKRIWYDK 374 +V +RI Y+K Sbjct: 362 MVKQRILYNK 371 >gi|315122347|ref|YP_004062836.1| hypothetical protein CKC_02995 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495749|gb|ADR52348.1| hypothetical protein CKC_02995 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 362 Score = 287 bits (734), Expect = 2e-75, Method: Composition-based stats. Identities = 114/366 (31%), Positives = 211/366 (57%), Gaps = 19/366 (5%) Query: 6 LNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 + IRNFF N +G +TI +AI P+I +++ ++ E+S+I+ K L ++IDR+L+ T I Sbjct: 4 IKIRNFFQNKRGIITITSAIIFPLIIILMAIVFEMSNIYLEKERLQAVIDRALLDTVTMI 63 Query: 66 MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 + + K + + W + + EL + F +D+ +++ TS+ + + Sbjct: 64 KLKNIEDVVKN------VGPVNTIWTKNLKYELEHSDFSSDVQNVIDDTSMKLESDSNFK 117 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 SI+AIS+YK+P K C ++++ +P+ SS+K+ + +D+MIVLDVS SM+ Sbjct: 118 TLSITAISQYKMPFKICNIHLLCPKNKYVTVPVLSSMKIGRNEGSDIDLMIVLDVSSSMD 177 Query: 186 SFF----DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS 241 F ++ +++++A KSI MLE+ + +P+ NV ++G V F++ ++ L+ G+ Sbjct: 178 DNFMKPEEAPCSRLEVAKKSIRKMLEDFRKVPNYANVFRTGSVGFNDMVQFPMPLKRGLK 237 Query: 242 HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 + IK FG STNS G+KYA+ Q++ + + KKI++F+TDGEN+ Sbjct: 238 RIYNDIKKYRAFG-STNSYVGMKYAWEQLYGNPQDTK-------DRKKIVIFLTDGENMI 289 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHI 361 ++++ CN+ KK+ A++Y+I + V + E L+ C+S + Y ++ S+ A+S I Sbjct: 290 INATRKTIELCNDMKKKKAVIYSIALAV-DNKEVLQGCSSSGNVYAADDAQSLVQAYSLI 348 Query: 362 GKDIVT 367 GKD++ Sbjct: 349 GKDVMK 354 >gi|163760496|ref|ZP_02167578.1| hypothetical protein HPDFL43_04296 [Hoeflea phototrophica DFL-43] gi|162282447|gb|EDQ32736.1| hypothetical protein HPDFL43_04296 [Hoeflea phototrophica DFL-43] Length = 363 Score = 286 bits (732), Expect = 4e-75, Method: Composition-based stats. Identities = 72/359 (20%), Positives = 144/359 (40%), Gaps = 19/359 (5%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM 66 IR N G ++ A +P++F+ + ++ ++ MK L + +D + + A ++ Sbjct: 9 KIRKLLRNENGNFALIAAAAVPVLFMAGSLAVDTTNAMSMKVRLQNAVDSAALATAARLS 68 Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGF-VNDIDDIVRSTSLDIVVVPQNE 125 E N + + F N F + + + ++D V + Sbjct: 69 EEENLT-----------AAQAQAFALKFVNGQVKEDFGAFNGFSVTPTVNIDPV---ETG 114 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 G ++ ++ + T + + + + + + M +VLD S SM+ Sbjct: 115 GRTVWKVAVSMEGSQSLTPMARIMGKDKLTVSVVGKSESAGEAQGAFSMALVLDRSGSMD 174 Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR 245 + KI++ ++ ++E+ + V+ G ++++K+ L W + Sbjct: 175 WNLNGQ-KKINVLKTAVGGLIEQFEEADPERKYVRLGASSYNSKLTGSTKLRWNPGKTKE 233 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 + L G T+ST +AY + + H KK IVFMTDG+N + D Sbjct: 234 FVDALPASG-GTDSTDAFDWAYTAVTHKRENNTHDAKSGQVPKKFIVFMTDGDNNYSSAD 292 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACAS-PNSFYLVENPHSMYDAFSHIG 362 + + C++AK G VY + R + L CAS F+ +N + +AF +IG Sbjct: 293 SSTKHLCDDAKDDGIEVYTVAFAAPNRGKQLLSYCASTEEHFFDAQNSAQLIEAFKNIG 351 >gi|190893432|ref|YP_001979974.1| hypothetical protein RHECIAT_CH0003859 [Rhizobium etli CIAT 652] gi|190698711|gb|ACE92796.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 410 Score = 283 bits (723), Expect = 4e-74, Method: Composition-based stats. Identities = 83/412 (20%), Positives = 139/412 (33%), Gaps = 66/412 (16%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R F + G I+TAI P++ GM I+V + K L + Sbjct: 10 LRRMFSDRGGNFGIMTAILAPVLLGAAGMAIQVGDMLISKQQLQEAA------------D 57 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVV--VPQNE 125 L G I + +F N ++ DI +T + + + Sbjct: 58 SAALATATALANGTIQTSQAEAFARNFVAGQMAN-YLQSGVDIKSATGVTVQTNTSGNST 116 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM- 184 Y ++ Y + + +T ++H+ T+ + +SQT + M + LD S SM Sbjct: 117 SYQVTVSPSYDLTVNPLMQAVGFT-TQHLSTSGTT-IGGHSQTQGSISMYLALDKSGSMG 174 Query: 185 ---------------------------ESFFDSSI-------TKIDMAIKSINAMLEEVK 210 + +D+ TKI+ + + ++ Sbjct: 175 EDTATVNEEDPTESYTYDCNGHYNKKGKWIYDTCTGSRANYYTKIEALKMAAGNLFGQLS 234 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 V++G V++ L WG S + + L G TNS+ + AY+ + Sbjct: 235 SADPNAQYVRTGAVSYDIVQYTPSALAWGTSGVSTYVNALQA-GGGTNSSGAMSTAYSSL 293 Query: 271 FDMQ------GMRQHCNTEDANYKKIIVFMTDGENLSTKE-----DQQSLYYCNEAKKRG 319 H KK IVFMTDG+N D + C+ AK +G Sbjct: 294 TAKNAAGNDAEDAAHKLKTGQTPKKYIVFMTDGDNNDDSSGGRSYDTLTKATCDTAKSKG 353 Query: 320 AIVYAIGIRVIR-SHEFLRACASPNSFYL-VENPHSMYDAFSHIGKDIVTKR 369 +Y I L CAS +S Y E + AF IG ++ Sbjct: 354 IEIYTIAFMAPEGGQALLHYCASDDSHYFQAEKMEDLLAAFKAIGAKASSQL 405 >gi|150397936|ref|YP_001328403.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] gi|150029451|gb|ABR61568.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] Length = 419 Score = 282 bits (720), Expect = 7e-74, Method: Composition-based stats. Identities = 81/405 (20%), Positives = 158/405 (39%), Gaps = 50/405 (12%) Query: 5 NLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQ 64 L+ + G ++TA+ P++ V G+ ++V+++ K L D + + AA+ Sbjct: 8 GLSFMRMLRDRGGNFGMMTALVAPLLLAVGGVSVDVANMLMTKNQLQDATDAAALAAASA 67 Query: 65 IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFV-------NDIDDI------- 110 ++++ + + +K + ++L D G N D++ Sbjct: 68 LVSDARPD--IEEAKDLARKFLKTQAAAATASDLPDEGPSIGARGGGNADDEVPATPRWE 125 Query: 111 -VRSTSLDIVVVPQN-EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT 168 V +T +DI P +G S K L+F + I + S+ + +++ Sbjct: 126 DVNATEIDITATPNGAKGKSFQVTVANKHLLQF-NAMTRLLGPESIEIETRSTAESATES 184 Query: 169 DARLDMMIVLDVSRSMESFFD----------------------------SSITKIDMAIK 200 L M +VLD S SM + +TKID Sbjct: 185 KNALSMYLVLDRSGSMAWKTNTINTGKAKCPNYTEANWSKYPDLKATGPCYVTKIDALKT 244 Query: 201 SINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNST 260 ++ +L ++ + V++G +++++ + L WG + L G T S Sbjct: 245 AVGDLLAQLVTADPESAYVRTGAISYNSAQDAASSLSWGTRGAAGYVDALVAIG-GTASG 303 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA 320 K A+ ++ + +H K IVFMTDGEN +D + +C+ AK Sbjct: 304 NAFKTAFQKVTNAAEDSEHGAKNGQVPTKYIVFMTDGENNHANDDTVTRQWCDTAKASKV 363 Query: 321 IVYAIGIRVI-RSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 +Y++ R + L++CA S + ++ E + AF IG+ Sbjct: 364 QIYSVAFMAPDRGQKLLKSCASSSSHYFEAEEASDLVAAFKAIGE 408 >gi|307945905|ref|ZP_07661241.1| putative von Willebrand factor type A [Roseibium sp. TrichSKD4] gi|307771778|gb|EFO31003.1| putative von Willebrand factor type A [Roseibium sp. TrichSKD4] Length = 432 Score = 282 bits (720), Expect = 8e-74, Method: Composition-based stats. Identities = 68/365 (18%), Positives = 137/365 (37%), Gaps = 13/365 (3%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM 66 I N G + L I + ++ V+ + I++S F +T L + D + V ++ Sbjct: 70 GIPNAHKERDGSILPLFGILIMLLLAVVTIGIDMSQTFGERTRLQTAADMAAVQTGRALL 129 Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 E + D RI + +S + + + + + V Sbjct: 130 AEEITIAQANAYAKDAFNRIAS--GLSASGDGSSGTSIFGTMTVKPAVQITETVDGNTTN 187 Query: 127 YSISAISRYKIPLKFCTFI--PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 Y + KIP +F+ T I + S ++ A L M +VLD S SM Sbjct: 188 YVVKVNGTAKIPASPLSFMFFDGETGKNTISLGFESETTAKAEAGASLSMALVLDRSGSM 247 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ 244 S +++ K++ ++++E++ + + + G + L W + ++ Sbjct: 248 GWERPSRMSE---LKKAVRSLIKELQTVDPDDQFTRLGAYAYHWYYAGKKELTWNKNSVR 304 Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 + L G T + P ++ A N + + H N + I++MTDG + Sbjct: 305 SWVNSLPASG-GTRAAPAIQKAKNDLLTNSELNAHINKNEQEPDLFILYMTDGIDGDPNW 363 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACASPN-SFYLVENPHSMYDAFSHIG 362 + C AK G +Y + + L+ACA+ + +Y +N + + F I Sbjct: 364 ---AKRECTSAKNAGITIYTVAFKAPASGRNLLKACATSDAHYYDAKNANELNKVFKDIA 420 Query: 363 KDIVT 367 ++ Sbjct: 421 RETTK 425 >gi|241206334|ref|YP_002977430.1| hypothetical protein Rleg_3648 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240860224|gb|ACS57891.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 400 Score = 280 bits (715), Expect = 3e-73, Method: Composition-based stats. Identities = 87/400 (21%), Positives = 155/400 (38%), Gaps = 55/400 (13%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R + G I+TAI LP++F GM I+V + K L + Sbjct: 11 LRRMLGDRGGNFGIMTAIVLPVLFGAAGMAIQVGDLLLSKQQLQEAA------------D 58 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDI--VVVPQNE 125 L G I + F N ++ DI +T +D+ ++ Sbjct: 59 SAALATATALANGTIQTSQAEAFARDFVAGQMAN-YLQSGIDIKSTTGVDVRTTTSGKST 117 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM- 184 Y ++ Y I + +T + +++ NSQT + M +VLD S SM Sbjct: 118 SYQVTVSPDYNIAVNPLMQTIGFTTQN--ISTSSTTTSGNSQTQGSVSMFLVLDRSGSMG 175 Query: 185 -------------ESFFDSS---------------------ITKIDMAIKSINAMLEEVK 210 E +D S TKI+ ++ + E+ Sbjct: 176 EDTATVNASDPTEEYNYDCSEKDRYGNVTKKKTCTDTRPHYYTKIEALKLAVGTLTGELD 235 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + V++G V+++ ++++ L+WG +H+ + + L+ T+S K AYN++ Sbjct: 236 AVDPEKEYVRTGAVSYNIEMQKAKALDWGTAHVTKYVNKLTAT-DGTDSGEAFKTAYNKL 294 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 D + H + K IVFMTDG+N T D ++ +C++A+ VY I Sbjct: 295 ADAAEDKAHVDKTGQVPTKYIVFMTDGDNNYTSADTETKTWCDKARDAKMQVYTIAFMAP 354 Query: 331 -RSHEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDIVTK 368 R L CA+ P +++ + ++ AF IG + Sbjct: 355 ARGQALLSYCATAPGNYFPAGDMTALLKAFKEIGMKASNQ 394 >gi|254781110|ref|YP_003065523.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] gi|254040787|gb|ACT57583.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] Length = 420 Score = 278 bits (711), Expect = 9e-73, Method: Composition-based stats. Identities = 84/405 (20%), Positives = 168/405 (41%), Gaps = 45/405 (11%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 + + K +I+ A+ + L++G +I V + K + S + +++ A+++++ Sbjct: 11 FKKGIASEKANFSIIFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVS 70 Query: 68 ---------EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN-DIDDIVRSTSLD 117 E N+ K+ D IKN S F N +I +IV S+ + Sbjct: 71 NLSRLGDRFESISNHAKRALIDDAKRFIKNHIKESLSG--YSAVFYNTEIQNIVNSSRIS 128 Query: 118 IVVVPQNE--------GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD 169 + + N Y++ ++ Y L+F + ++ IV I + +++ Sbjct: 129 MTHMANNRLDSSNNTIFYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGER 188 Query: 170 ARLDMMIVLDVSRSMESFFDSSI--------------TKIDMAIKSINAMLEEVKLIPDV 215 + +V+D+S SM +S TK+ ++ L+ + L+ V Sbjct: 189 PIFLIELVVDLSGSMHCAMNSDPEDVNSAPICQDKKRTKMAALKNALLLFLDSIDLLSHV 248 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL--SKFGVSTNSTPGLKYAYNQIFD- 272 V GL+ ++ ++E+ WG +++ + S T+STP +K AY + Sbjct: 249 KEDVYMGLIGYTTRVEKNIEPSWGTEKVRQYVTRDMDSLILKPTDSTPAMKQAYQILTSD 308 Query: 273 ------MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 RQ ++K I+F+TDGEN + K + ++ C++AK+ + I Sbjct: 309 KKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFKSNVNTIKICDKAKENFIKIVTIS 368 Query: 327 IRV-IRSHEFLRAC-ASPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 I L+ C +SP Y V N S+ F +I + +V ++ Sbjct: 369 INASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHRK 413 >gi|86359182|ref|YP_471074.1| hypothetical protein RHE_CH03592 [Rhizobium etli CFN 42] gi|86283284|gb|ABC92347.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 411 Score = 278 bits (710), Expect = 1e-72, Method: Composition-based stats. Identities = 83/413 (20%), Positives = 143/413 (34%), Gaps = 67/413 (16%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R F + G I+TAI P++ GM I+V + K L + Sbjct: 10 LRRMFSDRGGNFGIMTAILAPVLLGAAGMAIQVGDMLLSKQQLQEAA------------D 57 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDI--VVVPQNE 125 L G I + +F N ++ DI +TS+++ ++ Sbjct: 58 SAALATATALANGTIQTTEAEAFARNFVAGQMAN-YLQSGTDIKSTTSVNVQTTTSGKST 116 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 Y ++ Y + + +T ++H+ T+ + +SQT + M + LD S SM Sbjct: 117 SYQVTVSPAYVLTVNPLMQAVGFT-TQHLSTSGTT-IGGHSQTQGSISMFLALDKSGSMG 174 Query: 186 S--------------FFDSSI----------------------TKIDMAIKSINAMLEEV 209 +D ++ TKI+ + + ++ Sbjct: 175 EDTATVNEESPTESYTYDCNLHYNTKNNKWVYDKCTGSRTNYYTKIEALKMAAGNLFSQL 234 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 V++G V++ L WG++ + + L G TNS+ + AY Sbjct: 235 NSADPNAQYVRTGAVSYDINQYAPSSLAWGITGVSSYVNALQANG-GTNSSGAMNTAYTS 293 Query: 270 IFDMQ------GMRQHCNTEDANYKKIIVFMTDGENLSTKE-----DQQSLYYCNEAKKR 318 + H KK IVFMTDG+N + D + C++AK + Sbjct: 294 LTAKNAAGNDVENSAHQQKTGQVPKKYIVFMTDGDNNNDPSGGRSYDTATKKTCDDAKSK 353 Query: 319 GAIVYAIGIRVIR-SHEFLRACASPNSFYL-VENPHSMYDAFSHIGKDIVTKR 369 G +Y I L CAS +S Y E + AF IG + Sbjct: 354 GIEIYTIAFMAPAGGQALLHYCASDDSHYFQAEKMEDLLAAFQAIGAKASAQL 406 >gi|327189644|gb|EGE56794.1| hypothetical protein RHECNPAF_570041 [Rhizobium etli CNPAF512] Length = 415 Score = 278 bits (710), Expect = 1e-72, Method: Composition-based stats. Identities = 80/416 (19%), Positives = 140/416 (33%), Gaps = 69/416 (16%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R F + G I+TAI P++ G+ I+V + K L + Sbjct: 10 LRRMFSDRGGNFGIMTAILAPVLLGAAGLAIQVGDMLLSKQQLQEAA------------D 57 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVV--PQNE 125 L G I + +F N N + DI +T++++ ++ Sbjct: 58 SAALATATALGNGTIQTSQAEAFARNFVAGQMANYLQNGV-DIKNATAVNVQTSNSGKSA 116 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITS----SVKVNSQTDARLDMMIVLDVS 181 Y ++ Y + + + +++H+ T+ S S + + M + LD S Sbjct: 117 SYQVTVTPSYDLTVNPLMQAVGF-STQHLSTSSTTVSGPSQTPGSNSQGSVSMFLALDKS 175 Query: 182 RSM----------------------------ESFFDSSI-------TKIDMAIKSINAML 206 SM + +D+ TKI+ + + Sbjct: 176 GSMGDPTETVNKDQPTETFTYDCNPHLNKKGKWVYDTCTGSRTNYYTKIEALKMAAGNLF 235 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 ++ V++G V++ L WG S + + L G TNS+ + A Sbjct: 236 GQLTSADPDAQYVRTGAVSYDIDQYTPSTLAWGTSGVSSYVNALQA-GGGTNSSGAMGTA 294 Query: 267 YNQIFDMQ------GMRQHCNTEDANYKKIIVFMTDGENLSTKE-----DQQSLYYCNEA 315 Y+ + H KK IVFMTDG+N + D + C+ A Sbjct: 295 YSSLTAKNAAGNDAEDAAHKLKTGQIPKKYIVFMTDGDNNNDSSGGRSYDTLTKATCDTA 354 Query: 316 KKRGAIVYAIGIRV-IRSHEFLRACASPNSFYL-VENPHSMYDAFSHIGKDIVTKR 369 K +G +Y I L+ CAS + Y E + AF IG + Sbjct: 355 KSKGIEIYTIAFMAPPGGQALLQYCASDAAHYFQAEQMEDLLAAFKAIGAKASAQL 410 >gi|209550922|ref|YP_002282839.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209536678|gb|ACI56613.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 411 Score = 274 bits (700), Expect = 2e-71, Method: Composition-based stats. Identities = 83/412 (20%), Positives = 139/412 (33%), Gaps = 67/412 (16%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R F + G I+TAI P++ V G+ I+V + K L + Sbjct: 10 LRRMFSDRGGNFGIMTAIMAPVLLGVAGVAIQVGDMMLSKQQLQEAA------------D 57 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDI--VVVPQNE 125 L G I + +F N +V D TS+++ ++ Sbjct: 58 SAALATATALANGTIQTSQAEAFAQNFVAGQMAN-YVQSGVDFKSGTSVNVQTSTSGKST 116 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 Y ++ Y + + + ++H+ T+ V +SQT + M + LD S SM Sbjct: 117 SYQVTVSPSYDLTVNPLMQAVGF-KTQHLSTSGTT-VGGHSQTQGSISMFLALDKSGSMG 174 Query: 186 S--------------FFDSSI----------------------TKIDMAIKSINAMLEEV 209 +D ++ TKI+ + + ++ Sbjct: 175 EATATVNADDPTESYTYDCNLHYNSKNNKWVYDKCTGSRTNYYTKIEALKIAAGNLFGQL 234 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 V++G V++ L WG + + + L G TNS+ + AY+ Sbjct: 235 NSADPNAEYVRTGAVSYDINQYTPSNLAWGTAGVTSYVNALQANG-GTNSSGAMSTAYSS 293 Query: 270 IFDMQ------GMRQHCNTEDANYKKIIVFMTDGENLSTKE-----DQQSLYYCNEAKKR 318 + H KK IVFMTDG+N D + C+ AK + Sbjct: 294 LTAKNAAGNDAEDSAHKLKTGQTPKKYIVFMTDGDNNDDSSGGRSYDTLTKATCDTAKSK 353 Query: 319 GAIVYAIGIRVIR-SHEFLRACASPNSFYL-VENPHSMYDAFSHIGKDIVTK 368 G +Y I L CAS +S Y E + AF IG + Sbjct: 354 GIEIYTIAFMAPAGGQTLLHYCASDDSHYFQAEKMEDLLAAFKAIGAKASAQ 405 >gi|315122479|ref|YP_004062968.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495881|gb|ADR52480.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 427 Score = 256 bits (654), Expect = 3e-66, Method: Composition-based stats. Identities = 84/416 (20%), Positives = 165/416 (39%), Gaps = 57/416 (13%) Query: 8 IRNFFYNYK-------GGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVH 60 +R F +N+K +IL ++ L I L +G++I V + K + + +++ Sbjct: 4 LRTFLFNFKKIILSPKANFSILFSVILISILLFIGILIYVLDYYHKKNAMENANTSAILS 63 Query: 61 AATQIMNEGN--GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD---IVRSTS 115 A++I++ + G+N I+ + + L + V DI + I +++ Sbjct: 64 GASKIISRISYFGDNMSSHTHRAIVDDVTRFIKSYIKESLLMDSSVFDISEKNIISQNSK 123 Query: 116 LDIVVVP----------------QNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + I P + Y IS + Y +KF + + I+ + Sbjct: 124 VSITREPHPNVFHEFNNQSILQNKKTFYHISVETFYDYHIKFFDNLLNKKINSKIISFVP 183 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI---------------TKIDMAIKSINA 204 + VK+++ + +V+D+S SM +S +K+D K++ Sbjct: 184 ALVKIDTGEHPFFFVQLVVDLSASMSCLMNSDPEHATEFSVCGKSKKNSKMDALKKAVLL 243 Query: 205 MLEEVKLI-PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY---LSKFGVSTNST 260 L+ V + GL ++ ++E+ WG +++ I ++ G T+ST Sbjct: 244 FLDSVDRGSKTQKDTHYIGLTGYTTRVEKNIEPSWGTGKVRKYIVEEIDVNMLGQ-TDST 302 Query: 261 PGLKYAYNQIFDMQGMR-------QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 P +K AY + + + ++K ++F+TDGEN K D +++ C Sbjct: 303 PAMKKAYQILTSDKKRNFIRNILHKRIKIPPLPFQKFLIFLTDGENNDPKSDVKTIKICE 362 Query: 314 EAKKRGAIVYAIGIRV-IRSHEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDIVT 367 +AKK + I I L+ C S P +Y V + S+ F I I Sbjct: 363 KAKKNSIKILTISINASANGKRLLKKCVSAPEYYYNVVDTGSLLRVFQDISTLITH 418 >gi|315122473|ref|YP_004062962.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495875|gb|ADR52474.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 403 Score = 249 bits (635), Expect = 5e-64, Method: Composition-based stats. Identities = 76/401 (18%), Positives = 161/401 (40%), Gaps = 60/401 (14%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F N G I++A + + + + +I+++H+ MK + S +D +++ + ++++ Sbjct: 16 FSKNKSGVFHIMSASIIFVCLIFVSFVIDITHLLHMKNHIQSSLDNAIISGCSIVVSDPK 75 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 N+ N R+ ++ N +VN + + + I+ +N S S Sbjct: 76 INDL-------------NPQEERIRDVIKKNAYVNMVQNFPAEHAAYII---ENANISFS 119 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVM------------PITSSVKVNSQTDARLDMMIVL 178 K K + ++ ++ I++ + + +VL Sbjct: 120 KDLTNKYEYKITMEAKHQLSGKNFILGFLMPNVITHISSISTGIIQKPSDKKAFSVEMVL 179 Query: 179 DVSRSM--------------------ESFFDSSITKIDMAIKS----INAMLEEVKLIPD 214 D S SM + +KI + +N + E V+ P Sbjct: 180 DCSGSMLDSMQESCDLSSGRGGYYFYSKNNNKPKSKIYALKTASSDFVNLIQETVQTFPQ 239 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM- 273 ++ + GL+TF++ I + L + +++ I + G T++ + AY + ++ Sbjct: 240 IS--ARIGLITFNHYIMQDSKLSNNFNVIKKTISRMKPKG-GTDTFLPMNAAYEYLNNIP 296 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-- 331 + H +++ K+ I+ MTDGEN D +++ C+ A+K G I+Y+I + Sbjct: 297 NETKAHNISDNVPLKRYIILMTDGENNHPSYDLKTINVCDNARKNGIIIYSIFLNYYEYT 356 Query: 332 -SHEFLRACASPN-SFYLVENPHSMYDAFSHIGKDIVTKRI 370 +E R CAS F+ N ++ D+F I I K + Sbjct: 357 DGYELARKCASSEKHFFYANNTKALLDSFKSIAHAIQDKAV 397 >gi|15891094|ref|NP_356766.1| hypothetical protein Atu3868 [Agrobacterium tumefaciens str. C58] gi|15159433|gb|AAK89551.1| hypothetical protein Atu3868 [Agrobacterium tumefaciens str. C58] Length = 412 Score = 248 bits (632), Expect = 1e-63, Method: Composition-based stats. Identities = 73/405 (18%), Positives = 152/405 (37%), Gaps = 62/405 (15%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI-MN 67 R F + G ++TAI LP++ V G +E++++ +K + + D + + AAT+ + Sbjct: 11 RRFLADTSGNFGMMTAILLPVLLGVAGAGMELANVMQVKADMQNTADSAALAAATEARLR 70 Query: 68 EGNGNNR--KKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 EG ++ K++ I +++ + EL N T + + + Sbjct: 71 EGKLSDEQIKEIAKNFIAAQMEKNLTAEEKIELEKNSP----------TRVTTTENARGK 120 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 Y++ +++I L + + ++ + + +T + K A + M + LD S SM Sbjct: 121 TYAVETTIKHQIQLNP---MLGFIGAKTLDLSVTGTAKSTINKGAPISMYLALDRSGSMS 177 Query: 186 ----------------------------SFFDSSITKIDMAIKSINAMLEEVKLIPDV-- 215 + K ++ ++ + Sbjct: 178 FKTDTVDTTKTSCQNYTSDNWSKYPNLAKTSPCYVNKAASLKTAVGFLVATLNKADPTYT 237 Query: 216 ----NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQ 269 + +V++G ++++ + WG S + + T++ L AYN Sbjct: 238 VNGGSELVRTGASVYTHETYVAQSIGWGTSGVTSYVDKQIPEFPSGGTDARSSLNAAYNA 297 Query: 270 I--FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE------DQQSLYYCNEAKKRGAI 321 + + R H ++++ IV MTDGE DQ C AKK G Sbjct: 298 LKKANPDEARYHKEKGSESFERYIVLMTDGEMTGNSAAWNSSIDQSVRTTCETAKKDGIK 357 Query: 322 VYAIGIRVI-RSHEFLRACASP-NSFYLVENPHSMYDAFSHIGKD 364 ++++ + L+ CAS +++Y EN + AF I + Sbjct: 358 IFSVAFMAPDKGKSLLQYCASSADNYYAPENMEQIVTAFGEIARK 402 >gi|15966595|ref|NP_386948.1| hypothetical protein SMc04059 [Sinorhizobium meliloti 1021] gi|307300370|ref|ZP_07580150.1| TadE family protein [Sinorhizobium meliloti BL225C] gi|307319653|ref|ZP_07599079.1| TadE family protein [Sinorhizobium meliloti AK83] gi|15075867|emb|CAC47421.1| Hypothetical protein SMc04059 [Sinorhizobium meliloti 1021] gi|306894775|gb|EFN25535.1| TadE family protein [Sinorhizobium meliloti AK83] gi|306904536|gb|EFN35120.1| TadE family protein [Sinorhizobium meliloti BL225C] Length = 410 Score = 247 bits (630), Expect = 2e-63, Method: Composition-based stats. Identities = 76/400 (19%), Positives = 152/400 (38%), Gaps = 48/400 (12%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE-- 68 + G ++TA+ P++ V G+ ++V+++ K L D + + AA+ ++++ Sbjct: 3 MLRDRGGNFGMMTALIAPLLLAVGGVSVDVANMLMTKNQLQDATDAAALAAASALVSDAR 62 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNE----LRDNGFVNDIDDIVRSTSLDIVVVPQ- 123 + K + + ++ T + E + G D V ++ + IV P Sbjct: 63 PDIEEAKAIARKFLKTQMAATSSADVPGEAVGTMAAAGSTAPSWDDVNTSEVVIVETPNG 122 Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 +G S K L+F + I + S+ +++ + M +VLD S S Sbjct: 123 TKGKSFQVSVANKHLLQF-NAMTRLLGKESIELETRSTADSATESKNAISMYLVLDRSGS 181 Query: 184 MESFFD---------------------------SSITKIDMAIKSINAMLEEVKLIPDVN 216 M D + KI +++ + + + N Sbjct: 182 MAWKTDTVDTSRPRCINWTASNWGESNVRATSPCYVDKITTLKSAVDKLFTPLAKMDPGN 241 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 +++G +++++ + L WG + ++ L G T+S+ A ++ Sbjct: 242 EYLRAGAASYNDRQDRASKLTWGTKNASAHVQGLDATG-GTDSSSAFAAAVEELLLDGEN 300 Query: 277 RQHCNTEDANYKKIIVFMTDGENLS----------TKEDQQSLYYCNEAKKRGAIVYAIG 326 H +K IVFMTDGEN S K D + C AK G ++ + Sbjct: 301 EAHLAKNGQTPEKYIVFMTDGENTSYNGKTSPRDLEKADSVTKAACTTAKNNGIAIFTVA 360 Query: 327 IRVI-RSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGKD 364 R + L+ACA SP+ + ++ ++ F IG+ Sbjct: 361 FMAPQRGKDLLKACATSPDHYKEADDAAALVSEFEKIGQK 400 >gi|332716587|ref|YP_004444053.1| hypothetical protein AGROH133_12352 [Agrobacterium sp. H13-3] gi|325063272|gb|ADY66962.1| hypothetical protein AGROH133_12352 [Agrobacterium sp. H13-3] Length = 412 Score = 246 bits (629), Expect = 3e-63, Method: Composition-based stats. Identities = 75/405 (18%), Positives = 152/405 (37%), Gaps = 62/405 (15%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI-MN 67 R F + G ++TAI LP++ G +E++++ +K L + D + + AAT+ + Sbjct: 11 RRFLADTGGNFGMMTAILLPVLLGFAGAGMELANVMQVKADLQNTADSAALAAATEARLK 70 Query: 68 EGNGNNR--KKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 EG + K++ I +++ T + L N VN + + + Sbjct: 71 EGALTDEQIKEIAKAFIASQMEKTLTEEEKKALEKNSPVN----------IGTTDDARGK 120 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 Y+I Y++ L + ++ + + T + A + M +VLD S SM Sbjct: 121 TYTIQTTINYQMQLNPL---LGFFGAKTLDLAATGTAVSTVNKGAPISMYLVLDRSGSMS 177 Query: 186 SFFD----------------------------SSITKIDMAIKSINAMLEEVKLIPDV-- 215 D + K ++ ++ + Sbjct: 178 FKTDTLNTKKTSCQNYTVDNWGSYPNLKNTSPCYVNKATSLKTAVGYLVATLNKADPTYT 237 Query: 216 ----NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQ 269 + +V++G ++++ + WG S + + T++ L AYN Sbjct: 238 ANGGSELVRTGASVYTHETYAAQPITWGTSSVATYVDKQIPEFPSGGTDARSSLNAAYNA 297 Query: 270 IFDMQ--GMRQHCNTEDANYKKIIVFMTDGENLSTKE------DQQSLYYCNEAKKRGAI 321 + ++H + + ++++ IV MTDGE DQ C+ AKK G Sbjct: 298 LKKANTVEAKEHKDKKSESFERYIVLMTDGEMTGNSSSWSSSIDQTVRNTCDTAKKDGIK 357 Query: 322 VYAIGIRVI-RSHEFLRACASP-NSFYLVENPHSMYDAFSHIGKD 364 ++++ + L+ CAS +++Y EN + AF I + Sbjct: 358 IFSVAFMAPDKGKSLLQHCASSLDNYYAPENMEQIVTAFGEIARK 402 >gi|227823417|ref|YP_002827390.1| hypothetical protein NGR_c28930 [Sinorhizobium fredii NGR234] gi|227342419|gb|ACP26637.1| hypothetical protein NGR_c28930 [Sinorhizobium fredii NGR234] Length = 413 Score = 243 bits (620), Expect = 4e-62, Method: Composition-based stats. Identities = 71/398 (17%), Positives = 146/398 (36%), Gaps = 40/398 (10%) Query: 4 LNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAAT 63 + + + G ++TA+ P++ G+ I+++++ K L D + + AA+ Sbjct: 7 IGRSFITMLKDRGGNFGMMTAVAAPLLLAAGGVSIDMANMLMTKNQLQDATDAAALAAAS 66 Query: 64 QIMNEGN--GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDI-----VRSTSL 116 ++++ K++ + + T + + V + + Sbjct: 67 ALVSDEQPDIAAAKEIARKFLKTQAGGTTTPDAPADSGEGASSGAASSTPDWDDVNTLEV 126 Query: 117 DIVVVPQ-NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMM 175 +I P +G K +F + + I + +S+ + +++ L M Sbjct: 127 NITETPNGTKGKIFQVTVINKRVTEF-NAMTRLLGTDSIELEASSTAESATESKNALSMY 185 Query: 176 IVLDVSRSMESFFD----------------------------SSITKIDMAIKSINAMLE 207 +VLD S SM + +TKID ++ +L Sbjct: 186 LVLDRSGSMAWKTNTINAAKKSCPNYTESNWSRYPNLWASSPCYVTKIDALKTAVTDLLA 245 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 ++ + V++ +++++ + L WG S + L G T S K AY Sbjct: 246 QLLVADPDQIYVRTAAISYNSVQDTAGTLAWGTSGAAAYVNALVATG-GTASAGAFKTAY 304 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 ++ H K +VFMTDGEN +D + +C+ AK +Y++ Sbjct: 305 QKVIAATENTAHAAKNGQVPSKYMVFMTDGENNYANDDTVTKQWCDTAKANKVEIYSVAF 364 Query: 328 RVI-RSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 R L+ CA S + ++ E + AF IG+ Sbjct: 365 MAPERGQALLKYCASSSSHYFEAEEVTDLVAAFKAIGE 402 >gi|218662625|ref|ZP_03518555.1| hypothetical protein RetlI_26027 [Rhizobium etli IE4771] Length = 389 Score = 242 bits (618), Expect = 6e-62, Method: Composition-based stats. Identities = 76/398 (19%), Positives = 129/398 (32%), Gaps = 67/398 (16%) Query: 23 TAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDI 82 TAI P++ GM + V + K L + L G I Sbjct: 1 TAILAPVLLGAAGMAVHVGDMLLSKQQLQEAA------------DSAALATATALANGKI 48 Query: 83 LCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVV--VPQNEGYSISAISRYKIPLK 140 + +F N ++ DI +T + + + Y ++ Y + + Sbjct: 49 QTSEAEAYARNFVAGQMAN-YLQSGVDIKSATGVSVQTNTSGNSTSYQVTVSPSYDLTVN 107 Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDAR--LDMMIVLDVSRSMES------------ 186 +T ++H+ T+ +SQT + + M + LD S SM Sbjct: 108 PLMQAVGFT-TQHLSTSGTTIGGGHSQTQGQGSISMYLALDKSGSMGEDTATVNEEDPTE 166 Query: 187 --FFDSSI---------------------TKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + + TKI+ + + ++ V++G Sbjct: 167 SYTYPCNPHYNRKGKEVWDTCTGSRANYYTKIEALKMAAGNLFAQLSGADPNAQYVRTGA 226 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ------GMR 277 V++ L WG + + L G TNS+ + AY + Sbjct: 227 VSYDIVQYAPSSLAWGAIGVSSYVNALQA-GGGTNSSGAMSTAYLSLTAKNAAGNDAEDS 285 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKE-----DQQSLYYCNEAKKRGAIVYAIGIRV-IR 331 H +K IVFMTDG+N + D + C+ AK +G +Y I Sbjct: 286 AHKLKSGQIPQKYIVFMTDGDNNNDSSGGRSYDTLTKATCDTAKSKGIEIYTIAFMAPPG 345 Query: 332 SHEFLRACASPNSFYL-VENPHSMYDAFSHIGKDIVTK 368 L+ CAS S Y E ++ AF IG T+ Sbjct: 346 GQALLQYCASDASHYFQAEKMEDLFAAFKAIGAKASTQ 383 >gi|254781108|ref|YP_003065521.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] gi|254040785|gb|ACT57581.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] Length = 398 Score = 237 bits (604), Expect = 3e-60, Method: Composition-based stats. Identities = 75/400 (18%), Positives = 168/400 (42%), Gaps = 59/400 (14%) Query: 22 LTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGD 81 +TAI + + FL + I+++HI +++ + S +D +++ I+++ + K Sbjct: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKD-- 58 Query: 82 ILCRIKNTWNMSFRNELRDNGFVND-IDDIVRSTSLDIVVVPQNE-GYSISAISRYKIPL 139 + + + L+ ++ + DI + ++I N Y + ++Y+IP Sbjct: 59 ---QTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPT 115 Query: 140 KFCTFIPWYTNS-RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME------------- 185 + ++ ++ + T ++ +S+ A + + +VLDVSRSME Sbjct: 116 ENLFLKGLIPSALTNLSLRSTGIIERSSENLA-ISICMVLDVSRSMEDLYLQKHNDNNNM 174 Query: 186 -------------SFFDSSITK-------------IDMAIKSINAMLEEVKLI--PDVNN 217 SF+ + TK ID+ I+S ++ ++ N Sbjct: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNL 234 Query: 218 VVQSGLVTFSNKI--EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 V+ G + ++ I + L ++ ++ ++ L+ + +TN+ P + +AY ++++ + Sbjct: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKE 293 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQ---QSLYYCNEAKKRGAIVYAIGIRV-IR 331 H KK ++F+TDGEN Q +L C + G +Y++ + Sbjct: 294 -SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 Query: 332 SHEFLRACA-SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + LR C S F+ V + + ++F I I + + Sbjct: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 >gi|218515283|ref|ZP_03512123.1| hypothetical protein Retl8_17130 [Rhizobium etli 8C-3] Length = 329 Score = 219 bits (557), Expect = 7e-55, Method: Composition-based stats. Identities = 66/328 (20%), Positives = 113/328 (34%), Gaps = 54/328 (16%) Query: 92 MSFRNELRDNGFVNDIDDIVRSTSLDIVV--VPQNEGYSISAISRYKIPLKFCTFIPWYT 149 +F N ++ DI +T + + + Y ++ Y + + +T Sbjct: 1 RNFVAGQMAN-YLQSGVDIKSATGVTVQTNTSGNSTSYQVTVSPSYDLTVNPLMQAVGFT 59 Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM------------------------- 184 ++H+ T+ + +SQT + M + LD S SM Sbjct: 60 -TQHLSTSGTT-IGGHSQTQGSISMYLALDKSGSMGEDTATVNEEDPTESYTYDCNGHYN 117 Query: 185 ---ESFFDSSI-------TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 + +D+ TKI+ + + ++ V++G V++ Sbjct: 118 KKGKWIYDTCTGSRANYYTKIEALKMAAGNLFGQLSSADPNAQYVRTGAVSYDIVQYTPS 177 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ------GMRQHCNTEDANYK 288 L WG S + + L G TNS+ + AY+ + H K Sbjct: 178 ALAWGTSGVSTYVNALQA-GGGTNSSGAMSTAYSSLTAKNAAGNDAEDAAHKLKTGQTPK 236 Query: 289 KIIVFMTDGENLSTKE-----DQQSLYYCNEAKKRGAIVYAIGIRVIR-SHEFLRACASP 342 K IVFMTDG+N D + C+ AK +G +Y I L CAS Sbjct: 237 KYIVFMTDGDNNDDSSGGRSYDTLTKATCDTAKSKGIEIYTIAFMAPEGGQALLHYCASD 296 Query: 343 NSFYL-VENPHSMYDAFSHIGKDIVTKR 369 +S Y E + AF IG ++ Sbjct: 297 DSHYFQAEKMEDLLAAFKAIGAKASSQL 324 >gi|116253849|ref|YP_769687.1| hypothetical protein RL4112 [Rhizobium leguminosarum bv. viciae 3841] gi|115258497|emb|CAK09601.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 398 Score = 205 bits (520), Expect = 1e-50, Method: Composition-based stats. Identities = 76/393 (19%), Positives = 142/393 (36%), Gaps = 50/393 (12%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R + G I+TAI +P++ G+ I+ S++ K L + Sbjct: 10 LRRMLGDRGGNFGIMTAIMMPVLLGAAGLAIDYSNMALSKRELQEAT-----------DS 58 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGF-VNDIDDIVRSTSLDI--VVVPQN 124 G F + N + I I TS+DI + Sbjct: 59 AALAAATALASGAASTTADAEAIAKDFVSGQMANYVDTDAISSIKAGTSVDIDVSATATS 118 Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 + Y ++ + Y I + Y + TS+ S T L M +VLD S SM Sbjct: 119 KSYKVTVATSYGIAATPFMSVLGYKTLN--IGASTSTSSGTSDTKTALSMELVLDQSGSM 176 Query: 185 ESFFDSS-----------ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN----- 228 + +TKID K+ +A+ + + +++V++G +++N Sbjct: 177 GEKTTTCATYNGKNCKTYVTKIDALKKAADALFDALDTADPDHSLVRTGAYSYNNGLIYN 236 Query: 229 ----KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ-----GMRQH 279 +I+ + WG + + ++ G T++T ++ A I + H Sbjct: 237 SQKTQIKSMSGMAWGTATTATYVSGITASG-GTDATEPMRQATLSIAKASDGSDVETQAH 295 Query: 280 CNTEDANYKKIIVFMTDGENLSTKE------DQQSLYYCNEAKKRGAIVYAIGIRVI-RS 332 + + I+ MTDGE DQ C+ K G ++ + + Sbjct: 296 AVKGNTIVSRYIILMTDGEMTGNTGVWQSSFDQNVRNQCDATKTAGIKIFTVAFMAPDKG 355 Query: 333 HEFLRACASP-NSFYLVENPHSMYDAFSHIGKD 364 + L+ CASP ++Y E + +F+ I K+ Sbjct: 356 KQLLQYCASPGGNYYEAETMEKLVASFTSIAKE 388 >gi|222087111|ref|YP_002545646.1| hypothetical protein Arad_3867 [Agrobacterium radiobacter K84] gi|221724559|gb|ACM27715.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 401 Score = 202 bits (514), Expect = 6e-50, Method: Composition-based stats. Identities = 72/404 (17%), Positives = 144/404 (35%), Gaps = 48/404 (11%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM 66 ++ ++ G ILTAI +P++ G+ ++V+++ + L Q Sbjct: 10 SLIQLIHDRTGNFGILTAIAIPVVAATAGVAVDVTNMTVSNSQLQ------------QAT 57 Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGF---VNDIDDIVRSTSLDIVVVPQ 123 + L G+ F N N D + T+ ++ Sbjct: 58 DAAALATATALANGNATTSNAQQLATQFVTGQMSNYLSGDTNTADALKAGTTANVTSATN 117 Query: 124 ---NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDV 180 Y+++ + Y + + + + ++S + A L M I LD Sbjct: 118 SSGGTSYTVAVNASYDMSVNGMSQLLGIKTMHVSAASTSTSGSAAAAKQAALSMEIALDK 177 Query: 181 SRSM----------------------------ESFFDSSITKIDMAIKSINAMLEEVKLI 212 S SM ++ I KI ++ +L+++ Sbjct: 178 SGSMLLNTDVIDTSQKSCTQYYTEGNYLYQYPKAKSPCYIKKIAALKTAVGTLLDQLDSA 237 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + V++ + +S++++ L WG + + + T S+ + AY + Sbjct: 238 DPKSQYVRTAAIAWSSEVDSSSALAWGTTTTRSNVISGLNANGGTESSAPMALAYKNVSA 297 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI-R 331 + ++KIIV MTDGEN +T D ++L C AK G ++Y++ R Sbjct: 298 SSEATAQAAKGNTTFQKIIVLMTDGENNATSSDTKTLATCKAAKDAGVLIYSVAFMAPDR 357 Query: 332 SHEFLRACA-SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 L+ CA SP++++ + + AF IG + K Sbjct: 358 GQTLLKNCASSPSNYFDAQQMSDLIAAFKTIGNQASKQITLLTK 401 >gi|222149754|ref|YP_002550711.1| hypothetical protein Avi_3756 [Agrobacterium vitis S4] gi|221736736|gb|ACM37699.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 437 Score = 192 bits (488), Expect = 7e-47, Method: Composition-based stats. Identities = 72/396 (18%), Positives = 142/396 (35%), Gaps = 50/396 (12%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 + G ++TA+ LP+ V G+ ++ + + ++ L S +D + + AA+ + N Sbjct: 41 RLLRHSGGNFGMMTAVLLPVSIGVAGLAMDATEMVQSRSALQSSVDAAALAAASAMSNGM 100 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIV-VVPQNEGYS 128 + + L + ++ NT + V+ I S+ V + Y Sbjct: 101 SEADAIALAKSFLSSQLANTMARD-----ENTSSVDQITQAEPDISVKTTQVNSSSTSYD 155 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK-VNSQTDARLDMMIVLDVSRSMESF 187 + Y I + + + + + + +++ L M +VLD S SM Sbjct: 156 VELTGSYTITMNPLS---RVLGWETVTLKAYGKAQAATTASESPLSMYLVLDRSGSMNDE 212 Query: 188 FD-----------------------------SSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + TKI+ ++ + ++K + Sbjct: 213 TATTYTGTCTKTTTSGYGWNKKTTTTSYSCTKNYTKIESLKLAVADLAAQLKKADPNSEY 272 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ--GM 276 V++G +++ + + WG +++ + LS G T++ L AY+ + + Sbjct: 273 VRTGADSYNASADTAQAMSWGTANVVTYVNALSATG-GTDARGALSAAYSALQTSNKTEI 331 Query: 277 RQHCNTEDANYKKIIVFMTDGENLST------KEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 H + + + IVFMTDGE D C K G +Y + Sbjct: 332 TAHNVSSVSKIGRYIVFMTDGEMTGNSSSWSSSIDSAVRSQCTSIKADGIQIYTVAFMAP 391 Query: 331 -RSHEFLRACASP-NSFYLVENPHSMYDAFSHIGKD 364 L ACAS + +Y + S+ AF IGK Sbjct: 392 ANGKSLLSACASDASHYYEATDAASLVAAFGEIGKK 427 >gi|254780388|ref|YP_003064801.1| hypothetical protein CLIBASIA_01365 [Candidatus Liberibacter asiaticus str. psy62] gi|254040065|gb|ACT56861.1| hypothetical protein CLIBASIA_01365 [Candidatus Liberibacter asiaticus str. psy62] Length = 458 Score = 179 bits (454), Expect = 6e-43, Method: Composition-based stats. Identities = 86/449 (19%), Positives = 160/449 (35%), Gaps = 85/449 (18%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM-- 66 + + G I+TA+ +P++ V GM+++V + + L +++ A+ ++ Sbjct: 12 KKLIKSCTGHFFIITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLIQS 71 Query: 67 -------NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIV 119 + + K+ ++ +N +F + + + ++ S V Sbjct: 72 LEEVSSRAKNSFTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQV 131 Query: 120 VVP------------------------QNEGYSISAISRY-------------------K 136 V+ Q + + + Y Sbjct: 132 VLSSRYDLLLNPLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLD 191 Query: 137 IPLKFCTFIPWYTN--SRHIVMPITSSVKVNSQTDARLDMMIV--------------LDV 180 + V +S D +L +V LD Sbjct: 192 YQRDSEGQPLNCFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDP 251 Query: 181 SRSMESFFDSS-----ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 S S E F DSS I K + ++ +++ +K I +VN+ V+ G F++++ Sbjct: 252 SLSEEHFVDSSSLRHVIKKKHLVRDALASVIRSIKKIDNVNDTVRMGATFFNDRVISDPS 311 Query: 236 LEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 WGV L R I ST ++ AY+ I H + KK I Sbjct: 312 FSWGVHKLIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYI 371 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE-----FLRACASPNSFY 346 V +TDGEN +++++ + CN+AK +G + I V ++ + FL CASPNSF+ Sbjct: 372 VLLTDGENT--QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCASPNSFF 429 Query: 347 LVENPHSMYDAF-SHIGKDIVTKRIWYDK 374 + H + F IG +I + I K Sbjct: 430 EANSTHELNKIFRDRIGNEIFERVIRITK 458 >gi|218506715|ref|ZP_03504593.1| hypothetical protein RetlB5_03444 [Rhizobium etli Brasil 5] Length = 269 Score = 177 bits (449), Expect = 2e-42, Method: Composition-based stats. Identities = 54/257 (21%), Positives = 85/257 (33%), Gaps = 50/257 (19%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMES--------------FFDSSI-------------- 192 +V +SQ+ + M + LD S SM +D + Sbjct: 8 TVSGHSQSQGSISMFLALDKSGSMGDPTATVNADDPTEPFTYDCNPHLNKKGTKIIYDTC 67 Query: 193 --------TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ 244 TKI+ + + ++ V++G V++ L WG++ + Sbjct: 68 TGSRAHYYTKIEALKIAAGNLFSQLNSADPNAEYVRTGAVSYDLVEYTPSKLAWGITAVT 127 Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ------GMRQHCNTEDANYKKIIVFMTDGE 298 + L G TNS+ + AY + H KK IVFMTDG+ Sbjct: 128 SYVNALES-GGGTNSSGAVNTAYTSLTAKNAAGNDAEDAAHKLKTGQLPKKYIVFMTDGD 186 Query: 299 NLST-----KEDQQSLYYCNEAKKRGAIVYAIGIRVIR-SHEFLRACASPNSFYL-VENP 351 N D + C+ AK +G Y I L CAS ++ Y E Sbjct: 187 NNDDSRGGRSYDTLTKATCDTAKAKGIETYTIAFMAPEGGQALLHYCASDDAHYFQAEKM 246 Query: 352 HSMYDAFSHIGKDIVTK 368 + AF IG + Sbjct: 247 EDLLAAFKAIGAKASAQ 263 >gi|315122199|ref|YP_004062688.1| hypothetical protein CKC_02245 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495601|gb|ADR52200.1| hypothetical protein CKC_02245 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 463 Score = 156 bits (395), Expect = 4e-36, Method: Composition-based stats. Identities = 43/189 (22%), Positives = 87/189 (46%), Gaps = 10/189 (5%) Query: 192 ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIK--- 248 I K + ++ ++ V+ I ++ + ++ + F+ +I+ +F + WG+ ++++ Sbjct: 271 IRKRYLVRDALATFIKRVRKIDNLKDKLRMSFMYFNERIDHYFPMTWGIKEFKQEVSSHY 330 Query: 249 YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQ 307 +T+ P L+ AYN++ +H KK IV +TDG +N Sbjct: 331 KRKHENTATDIHPILQEAYNKLHSKNEDDEHKKKNSVEVKKFIVLLTDGAQNEGVHSVDS 390 Query: 308 SLYYCNEAKKRGAIVYAIGIRV-----IRSHEFLRACASPNSFYLVENPHSMYDAFS-HI 361 L C+ AK+ G ++ I V ++++FL CASP+ F+ + + F HI Sbjct: 391 VLKICDAAKEEGIKIFTISYSVDSSERKKANDFLSRCASPDKFFEAYDADKLNMIFKEHI 450 Query: 362 GKDIVTKRI 370 G I + + Sbjct: 451 GDAIFERLV 459 Score = 77.9 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 42/193 (21%), Positives = 85/193 (44%), Gaps = 8/193 (4%) Query: 3 SLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAA 62 SL N + Y G +++A+ LP+IF+V+G++I++ + L ++ + + A+ Sbjct: 6 SLFFNFKRLKKCYNGSFFVISALLLPVIFMVIGLLIDLVRWGYYHNSLVQAVNTAALSAS 65 Query: 63 TQIMNE-GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVV 121 Q++N + + K L IK + + L +N D I++ T ++I Sbjct: 66 VQLLNSVEDKSKEKALSSVLGENNIKQYLLNNLKISLYNNFGEMDSQRIIQHTKVNIYN- 124 Query: 122 PQNEGYSISAISRYKIPLKFCT-FIPWYTNSRHIVMPITSSVKVNSQ----TDARLDMMI 176 + + I+ S Y +PL + F N + + +V S+ + + + Sbjct: 125 -RKGTHIINVYSHYNLPLNPFSLFFMNLINIKSWPITTVGEAEVTSKKNYHKEEGVSVQW 183 Query: 177 VLDVSRSMESFFD 189 ++D S SM S D Sbjct: 184 LIDDSGSMGSIID 196 >gi|297482040|ref|XP_002692521.1| PREDICTED: matrilin 4 [Bos taurus] gi|296480952|gb|DAA23067.1| matrilin 4 [Bos taurus] Length = 584 Score = 142 bits (357), Expect = 1e-31, Method: Composition-based stats. Identities = 50/233 (21%), Positives = 97/233 (41%), Gaps = 36/233 (15%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 PW T + + P SS LD++ V+D SRS+ F + + + + + Sbjct: 18 PWETQLQ-LAGPRCSS--------GPLDLVFVIDSSRSVRPF------EFETMRQFLVGL 62 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGL 263 L + + P+ + G++ +S++++ F L ++R I+ + T + + Sbjct: 63 LRSLDVGPNA---TRVGVIQYSSQVQSVFPLRAFSRREDMERAIRAVVPLAQGTMTGLAI 119 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 +YA N F + +A+ ++ V +TDG + +A+ RG +Y Sbjct: 120 QYAMNVAFSVAE---GARPPEAHVPRVAVIVTDGRPQD-----RVAEVAAQARARGIEIY 171 Query: 324 AIGIRVIRSHEFLRACASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 A+G++ LRA ASP +LVE+ +D G+ + D Sbjct: 172 AVGVQRADVGS-LRAMASPPLNEHVFLVES----FDLIQEFGRQFQGRLCTLD 219 Score = 82.9 bits (203), Expect = 7e-14, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 75/180 (41%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 339 SRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 389 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 390 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 446 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G ++YA+G+ E + P Sbjct: 447 RALNVPRVGLVFTDGRSQDNIS-----VWAARAKEEGIVMYAVGVGKAVEEELREIASEP 501 >gi|300796915|ref|NP_001178240.1| matrilin-4 [Bos taurus] Length = 584 Score = 142 bits (357), Expect = 1e-31, Method: Composition-based stats. Identities = 50/233 (21%), Positives = 97/233 (41%), Gaps = 36/233 (15%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 PW T + + P SS LD++ V+D SRS+ F + + + + + Sbjct: 18 PWETQLQ-LAGPRCSS--------GPLDLVFVIDSSRSVRPF------EFETMRQFLVGL 62 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGL 263 L + + P+ + G++ +S++++ F L ++R I+ + T + + Sbjct: 63 LRSLDVGPNA---TRVGVIQYSSQVQSVFPLRAFSRREDMERAIRAVVPLAQGTMTGLAI 119 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 +YA N F + +A+ ++ V +TDG + +A+ RG +Y Sbjct: 120 QYAMNVAFSVAE---GARPPEAHVPRVAVIVTDGRPQD-----RVAEVAAQARARGIEIY 171 Query: 324 AIGIRVIRSHEFLRACASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 A+G++ LRA ASP +LVE+ +D G+ + D Sbjct: 172 AVGVQRADVGS-LRAMASPPLNEHVFLVES----FDLIQEFGRQFQGRLCTLD 219 Score = 82.9 bits (203), Expect = 7e-14, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 75/180 (41%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 339 SRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 389 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 390 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 446 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G ++YA+G+ E + P Sbjct: 447 RALNVPRVGLVFTDGRSQDNIS-----VWAARAKEEGIVMYAVGVGKAVEEELREIASEP 501 >gi|194224466|ref|XP_001500626.2| PREDICTED: matrilin 4 [Equus caballus] Length = 542 Score = 134 bits (338), Expect = 2e-29, Method: Composition-based stats. Identities = 44/213 (20%), Positives = 87/213 (40%), Gaps = 27/213 (12%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ V+D SRS+ F + + + + +L + + P+ + G++ Sbjct: 29 GCRAGPLDLVFVIDSSRSVRPF------EFETMRQFLVGLLRSLDVGPNA---TRVGVIQ 79 Query: 226 FSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +S++++ F L ++R I+ L T + ++YA N F + Sbjct: 80 YSSQVQSVFPLSAFSRREDMERAIRALVPLAQGTMTGLAIQYAMNVAFSVAE---GARPP 136 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP- 342 +A ++ V +TDG + +A+ RG +YA+G++ LRA ASP Sbjct: 137 EARVPRVAVIVTDGRPQD-----RVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPP 190 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 +LV++ +D G + D Sbjct: 191 LDEHVFLVDS----FDLIQEFGLQFQGQLCAID 219 Score = 84.4 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 76/180 (42%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 298 NRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 348 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 349 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 405 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G ++YA+G+ E + + P Sbjct: 406 RALNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVMYAVGVGKAVEEELRKIASEP 460 >gi|311274909|ref|XP_003134506.1| PREDICTED: matrilin-4-like [Sus scrofa] Length = 721 Score = 134 bits (336), Expect = 3e-29, Method: Composition-based stats. Identities = 43/213 (20%), Positives = 87/213 (40%), Gaps = 27/213 (12%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ V+D SRS+ F + + + + +L + + P+ + G++ Sbjct: 167 RCRTGPLDLVFVIDSSRSVRPF------EFETMRQFLVGLLRSLDVGPNA---TRVGVIQ 217 Query: 226 FSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +S++++ F L ++ I+ L T + ++YA N F + Sbjct: 218 YSSQVQSVFPLGAFSRREDMEGAIRALVPLAQGTMTGLAIQYAMNVAFSVAE---GARPP 274 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP- 342 +A ++ V +TDG + +A+ RG +YA+G++ LRA ASP Sbjct: 275 EARVPRVAVIVTDGRPQD-----RVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPP 328 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 +LVE+ +D G+ + + Sbjct: 329 LDEHVFLVES----FDLIQEFGQQFQGRLCPIN 357 Score = 83.3 bits (204), Expect = 5e-14, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 74/180 (41%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 477 NRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 527 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 528 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 584 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + AK+ G ++YA+G+ E + P Sbjct: 585 RALNVPRVGLVFTDGR-----SQDDVSVWAARAKEEGIVMYAVGVGKAVEEELREIASEP 639 >gi|301764008|ref|XP_002917404.1| PREDICTED: matrilin-4-like [Ailuropoda melanoleuca] Length = 594 Score = 132 bits (332), Expect = 9e-29, Method: Composition-based stats. Identities = 50/242 (20%), Positives = 96/242 (39%), Gaps = 36/242 (14%) Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 +P+ PW T + + P + LD++ V+D SRS+ F + + Sbjct: 9 LPMLLFLLQPWETQLQ-LAGPRCRT--------GPLDLVFVIDSSRSVRPF------EFE 53 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFG 254 + + +L + + P+ + G++ +S++++ F L ++ I+ L Sbjct: 54 TMRQFLVGLLRGLDVGPNA---TRVGVIQYSSQVQSVFPLGAFARREDMEHAIRALVPLA 110 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 T + ++YA N F + +A +I V +TDG + + Sbjct: 111 QGTMTGLAIQYAMNVAFSVAE---GARPPEARVPRIAVIVTDGRPQD-----RVAEVAAQ 162 Query: 315 AKKRGAIVYAIGIRVIRSHEFLRACASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 A+ RG +YA+G++ LRA ASP +LVE+ +D G + Sbjct: 163 ARARGIEIYAVGVQRADVGS-LRAMASPPLDEHVFLVES----FDLIQEFGLQFQGRLCA 217 Query: 372 YD 373 D Sbjct: 218 ID 219 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 76/180 (42%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 339 NRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 389 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 390 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 446 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G ++YA+G+ E + + P Sbjct: 447 RALNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVMYAVGVGKAVEEELRQIASEP 501 >gi|73992503|ref|XP_543015.2| PREDICTED: similar to Matrilin-4 precursor [Canis familiaris] Length = 624 Score = 131 bits (330), Expect = 1e-28, Method: Composition-based stats. Identities = 50/242 (20%), Positives = 96/242 (39%), Gaps = 36/242 (14%) Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 +P+ PW T P + LD++ V+D SRS+ F + + Sbjct: 9 LPVLLFLLQPWET-LSQFAGPRCRT--------GPLDLVFVIDSSRSVRPF------EFE 53 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFG 254 + + +L + + P+ + G++ +S++++ F L ++R I+ L Sbjct: 54 TMRQFLVGLLRGLDVGPNA---TRVGVIQYSSQVQSVFPLGAFSRREDMERAIRALVPLA 110 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 T + ++YA N F + +A +I V +TDG + + Sbjct: 111 QGTMTGLAIQYAMNVAFSVAE---GARPPEARVPRIAVIVTDGRPQD-----RVAEVAAQ 162 Query: 315 AKKRGAIVYAIGIRVIRSHEFLRACASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 A+ RG +YA+G++ LRA ASP +LVE+ ++ G+ + Sbjct: 163 ARARGIEIYAVGVQRADVGS-LRAMASPPLDEHVFLVES----FNLIQEFGQQFRGRLCG 217 Query: 372 YD 373 D Sbjct: 218 KD 219 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 76/180 (42%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 380 NRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 430 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 431 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 487 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G ++YA+G+ E + + P Sbjct: 488 RALNVPRVGLVFTDGRSQDDIS-----VWARRAKEEGIVMYAVGVGKAVEEELRQIASEP 542 >gi|281345581|gb|EFB21165.1| hypothetical protein PANDA_005644 [Ailuropoda melanoleuca] Length = 581 Score = 131 bits (328), Expect = 2e-28, Method: Composition-based stats. Identities = 46/217 (21%), Positives = 88/217 (40%), Gaps = 27/217 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 +K LD++ V+D SRS+ F + + + + +L + + P+ + Sbjct: 1 LKGPRCRTGPLDLVFVIDSSRSVRPF------EFETMRQFLVGLLRGLDVGPNA---TRV 51 Query: 222 GLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G++ +S++++ F L ++ I+ L T + ++YA N F + Sbjct: 52 GVIQYSSQVQSVFPLGAFARREDMEHAIRALVPLAQGTMTGLAIQYAMNVAFSVAE---G 108 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 +A +I V +TDG + +A+ RG +YA+G++ LRA Sbjct: 109 ARPPEARVPRIAVIVTDGRPQD-----RVAEVAAQARARGIEIYAVGVQRADVGS-LRAM 162 Query: 340 ASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 ASP +LVE+ +D G + D Sbjct: 163 ASPPLDEHVFLVES----FDLIQEFGLQFQGRLCGKD 195 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 76/180 (42%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 356 NRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 406 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 407 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 463 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G ++YA+G+ E + + P Sbjct: 464 RALNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVMYAVGVGKAVEEELRQIASEP 518 >gi|91203253|emb|CAJ72892.1| hypothetical protein kustd2147 [Candidatus Kuenenia stuttgartiensis] Length = 701 Score = 130 bits (326), Expect = 4e-28, Method: Composition-based stats. Identities = 53/208 (25%), Positives = 94/208 (45%), Gaps = 23/208 (11%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 +Q D +L +MIVLD S SM + IT++ ++ K LI +N+ Q G+V Sbjct: 318 TQDDFQLQVMIVLDYSASMYEK-NGDITRMALSAK---------DLIESLNDTHQVGVVE 367 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM--QGMRQHCNTE 283 F E +L+ ++ I+ +S+F S+ + ++ +D +G++Q Sbjct: 368 FHRPDEPPAILQDFTTYKNAAIEAVSQF-----SSGKIYRDFSSCWDAVLKGLKQFPEKP 422 Query: 284 DANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 D + K +VF++DG +N S + AK+R +Y +GI E L+ A Sbjct: 423 DPDIFKTLVFLSDGFDNSSFSTPGNVISL---AKERDVHIYILGIGRGSEEEVLKNIALE 479 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 + ++ EN + F KDI + Sbjct: 480 TGGTYVHAENIAVFRERFKQTIKDIKGQ 507 >gi|306823858|ref|ZP_07457232.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|309802423|ref|ZP_07696530.1| conserved repeat protein [Bifidobacterium dentium JCVIHMP022] gi|304552856|gb|EFM40769.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|308221023|gb|EFO77328.1| conserved repeat protein [Bifidobacterium dentium JCVIHMP022] Length = 1136 Score = 129 bits (325), Expect = 5e-28, Method: Composition-based stats. Identities = 74/351 (21%), Positives = 126/351 (35%), Gaps = 59/351 (16%) Query: 65 IMNEGNGNNRKKLKGGDILCRIKN---TWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVV 121 ++N+ N + G + T ++ + F N T +IV Sbjct: 491 VVNDPNITDVSCGTGTLASNQQTTCSGTLTLTEGMVGSEGHFTNTATATGTDTDGNIVNS 550 Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD-ARLDMMIVLDV 180 P+ +I AI P K N+ + + +T + ++ T +D +VLDV Sbjct: 551 PE-ASVTIKAIKPLGAPEKHKRIKKNADNTYTVNVDVTGAASSSTITTTQPVDFTLVLDV 609 Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLI----PDVNNVVQSGLVTFSNKI-----E 231 S SM S+TK+ ++N L+E I + V+ GLV F+ Sbjct: 610 SGSMRENM-GSVTKLQALQSAVNNFLDEAAKINKGAQSGSEPVRVGLVKFAGNATKKIGN 668 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR-----QHCNT---- 282 + + +W + + +K L T T GLK N++ R QH T Sbjct: 669 KTYQDKWNTYNYSQIVKKL------TADTDGLKNEVNKLTAGGATRADYGFQHAFTVMSE 722 Query: 283 EDANYKKIIVFMTDGENLSTKE-----DQQSLYYCNEAKKRGAIVYAIGI--------RV 329 KK+++F TDG+ S K ++ Y + K GAIVY+IG+ Sbjct: 723 ARTEAKKVVIFFTDGKPTSEKTFDGKVANDAVEYAKQLKDSGAIVYSIGVFDGANPASTA 782 Query: 330 IRSHEFLRACAS----------------PNSFYLVENPHSMYDAFSHIGKD 364 ++F+ A +S + + + F I K Sbjct: 783 TSENKFMHAVSSNYPNAANYEDLSEGSNAGYYKTATDASGLNSIFEEIRKS 833 >gi|134093078|gb|ABO52938.1| matrilin 4 isoform 1 precursor [Colobus guereza] Length = 581 Score = 129 bits (325), Expect = 5e-28, Method: Composition-based stats. Identities = 43/213 (20%), Positives = 87/213 (40%), Gaps = 27/213 (12%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ V+D SRS+ F + + + + ++ + + P+ + G++ Sbjct: 27 RCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLVGLVRGLNVGPNA---TRVGVIQ 77 Query: 226 FSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +S++++ F L ++R I+ L T + ++YA N F + Sbjct: 78 YSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---GARPP 134 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP- 342 + ++ V +TDG + +A+ RG +YA+G++ LRA ASP Sbjct: 135 EERVPRVAVIVTDGRPQD-----RVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPP 188 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 +LVE+ +D G ++ D Sbjct: 189 LDEHVFLVES----FDVIQEFGLQFQSRLCAID 217 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 75/180 (41%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 337 DRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 387 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 388 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 444 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G ++YA+G+ E + P Sbjct: 445 RALNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVMYAVGVGKAVEAELREIASEP 499 >gi|274320027|ref|NP_001162099.1| matrilin-4 [Macaca mulatta] gi|134093113|gb|ABO52973.1| matrilin 4 isoform 1 precursor [Macaca mulatta] Length = 581 Score = 129 bits (325), Expect = 5e-28, Method: Composition-based stats. Identities = 44/213 (20%), Positives = 87/213 (40%), Gaps = 27/213 (12%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ V+D SRS+ F + + + + +L + + P+ + G++ Sbjct: 27 RCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLVGLLRGLNVGPNA---TRVGVIQ 77 Query: 226 FSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +S++++ F L ++R I+ L T + ++YA N F + Sbjct: 78 YSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---GARPP 134 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP- 342 + ++ V +TDG + +A+ RG +YA+G++ LRA ASP Sbjct: 135 EERVPRVAVIVTDGRPQD-----RVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPP 188 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 +LVE+ +D G ++ D Sbjct: 189 LDEHVFLVES----FDLIQEFGLQFQSRLCAID 217 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 74/180 (41%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 337 DRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 387 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 388 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 444 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G +YA+G+ E + P Sbjct: 445 RALNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIAMYAVGVGKAVEAELREIASEP 499 >gi|281183022|ref|NP_001162498.1| matrilin-4 [Papio anubis] gi|134093054|gb|ABO52914.1| matrilin 4 isoform 1 precursor [Papio anubis] Length = 581 Score = 129 bits (325), Expect = 5e-28, Method: Composition-based stats. Identities = 44/213 (20%), Positives = 87/213 (40%), Gaps = 27/213 (12%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ V+D SRS+ F + + + + +L + + P+ + G++ Sbjct: 27 RCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLVGLLRGLNVGPNA---TRVGVIQ 77 Query: 226 FSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +S++++ F L ++R I+ L T + ++YA N F + Sbjct: 78 YSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---GARPP 134 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP- 342 + ++ V +TDG + +A+ RG +YA+G++ LRA ASP Sbjct: 135 EERVPRVAVIVTDGRPQD-----RVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPP 188 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 +LVE+ +D G ++ D Sbjct: 189 LDEHVFLVES----FDLIQEFGLQFQSRLCAID 217 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 75/180 (41%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 337 DRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 387 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 388 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 444 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G ++YA+G+ E + P Sbjct: 445 RALNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVMYAVGVGKAVEAELREIASEP 499 >gi|193786651|dbj|BAG51974.1| unnamed protein product [Homo sapiens] Length = 581 Score = 129 bits (325), Expect = 6e-28, Method: Composition-based stats. Identities = 44/217 (20%), Positives = 88/217 (40%), Gaps = 27/217 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + LD++ V+D SRS+ F + + + + +L + + P+ + Sbjct: 23 LTGPRCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLMGLLRGLNVGPNA---TRV 73 Query: 222 GLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G++ +S++++ F L ++R I+ L T + ++YA N F + Sbjct: 74 GVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---G 130 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 + ++ V +TDG + +A+ RG +YA+G++ LRA Sbjct: 131 ARPPEERVPRVAVIVTDGRPQD-----RVAEVAAQARARGIEIYAVGVQRADVGS-LRAM 184 Query: 340 ASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 ASP +LVE+ +D G ++ D Sbjct: 185 ASPPLDEHVFLVES----FDLIQEFGLQFQSRLCAID 217 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 75/180 (41%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 337 NRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 387 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 388 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 444 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G ++YA+G+ E + P Sbjct: 445 RALNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVMYAVGVGKAVEAELREIASEP 499 >gi|13699830|ref|NP_003824.2| matrilin-4 isoform 1 precursor [Homo sapiens] gi|4499937|emb|CAB39280.1| matrilin 4 [Homo sapiens] Length = 581 Score = 129 bits (325), Expect = 6e-28, Method: Composition-based stats. Identities = 44/217 (20%), Positives = 88/217 (40%), Gaps = 27/217 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + LD++ V+D SRS+ F + + + + +L + + P+ + Sbjct: 23 LTGPRCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLMGLLRGLNVGPNA---TRV 73 Query: 222 GLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G++ +S++++ F L ++R I+ L T + ++YA N F + Sbjct: 74 GVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---G 130 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 + ++ V +TDG + +A+ RG +YA+G++ LRA Sbjct: 131 ARPPEERVPRVAVIVTDGRPQD-----RVAEVAAQARARGIEIYAVGVQRADVGS-LRAM 184 Query: 340 ASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 ASP +LVE+ +D G ++ D Sbjct: 185 ASPPLDEHVFLVES----FDLIQEFGLQFQSRLCAID 217 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 75/180 (41%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 337 NRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 387 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 388 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 444 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G ++YA+G+ E + P Sbjct: 445 RALNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVMYAVGVGKAVEAELREIASEP 499 >gi|13699834|ref|NP_085080.1| matrilin-4 isoform 2 precursor [Homo sapiens] gi|119596274|gb|EAW75868.1| matrilin 4, isoform CRA_c [Homo sapiens] Length = 540 Score = 129 bits (325), Expect = 6e-28, Method: Composition-based stats. Identities = 44/217 (20%), Positives = 88/217 (40%), Gaps = 27/217 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + LD++ V+D SRS+ F + + + + +L + + P+ + Sbjct: 23 LTGPRCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLMGLLRGLNVGPNA---TRV 73 Query: 222 GLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G++ +S++++ F L ++R I+ L T + ++YA N F + Sbjct: 74 GVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---G 130 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 + ++ V +TDG + +A+ RG +YA+G++ LRA Sbjct: 131 ARPPEERVPRVAVIVTDGRPQD-----RVAEVAAQARARGIEIYAVGVQRADVGS-LRAM 184 Query: 340 ASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 ASP +LVE+ +D G ++ D Sbjct: 185 ASPPLDEHVFLVES----FDLIQEFGLQFQSRLCAID 217 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 75/180 (41%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 296 NRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 346 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 347 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 403 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G ++YA+G+ E + P Sbjct: 404 RALNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVMYAVGVGKAVEAELREIASEP 458 >gi|13699836|ref|NP_085095.1| matrilin-4 isoform 3 precursor [Homo sapiens] Length = 499 Score = 129 bits (324), Expect = 6e-28, Method: Composition-based stats. Identities = 44/217 (20%), Positives = 88/217 (40%), Gaps = 27/217 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + LD++ V+D SRS+ F + + + + +L + + P+ + Sbjct: 23 LTGPRCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLMGLLRGLNVGPNA---TRV 73 Query: 222 GLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G++ +S++++ F L ++R I+ L T + ++YA N F + Sbjct: 74 GVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---G 130 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 + ++ V +TDG + +A+ RG +YA+G++ LRA Sbjct: 131 ARPPEERVPRVAVIVTDGRPQD-----RVAEVAAQARARGIEIYAVGVQRADVGS-LRAM 184 Query: 340 ASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 ASP +LVE+ +D G ++ D Sbjct: 185 ASPPLDEHVFLVES----FDLIQEFGLQFQSRLCVRD 217 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 75/180 (41%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 255 NRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 305 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 306 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 362 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G ++YA+G+ E + P Sbjct: 363 RALNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVMYAVGVGKAVEAELREIASEP 417 >gi|332208765|ref|XP_003253479.1| PREDICTED: LOW QUALITY PROTEIN: matrilin-4-like [Nomascus leucogenys] Length = 448 Score = 129 bits (324), Expect = 7e-28, Method: Composition-based stats. Identities = 44/213 (20%), Positives = 87/213 (40%), Gaps = 27/213 (12%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ V+D SRS+ F + + + + +L + + P+ + G++ Sbjct: 27 RCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLMGLLRGLNVGPNA---TRVGVIQ 77 Query: 226 FSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +S++++ F L ++R I+ L T + ++YA N F + Sbjct: 78 YSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---GARPP 134 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP- 342 + ++ V +TDG + +A+ RG +YA+G++ LRA ASP Sbjct: 135 EERVPRVAVIVTDGRPQD-----RVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPP 188 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 +LVE+ +D G ++ D Sbjct: 189 LDEHVFLVES----FDLIQEFGLQFQSRLCAID 217 >gi|281182610|ref|NP_001162037.1| matrilin-4 [Pongo abelii] gi|134093133|gb|ABO52993.1| matrilin 4 isoform 1 precursor [Pongo abelii] Length = 581 Score = 129 bits (324), Expect = 7e-28, Method: Composition-based stats. Identities = 44/213 (20%), Positives = 87/213 (40%), Gaps = 27/213 (12%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ V+D SRS+ F + + + + +L + + P+ + G++ Sbjct: 27 RCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLMGLLRGLNVGPNA---TRVGVIQ 77 Query: 226 FSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +S++++ F L ++R I+ L T + ++YA N F + Sbjct: 78 YSSQVQSVFPLRAFSRSEDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---GARPP 134 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP- 342 + ++ V +TDG + +A+ RG +YA+G++ LRA ASP Sbjct: 135 EERVPRVAVIVTDGRPQD-----RVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPP 188 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 +LVE+ +D G ++ D Sbjct: 189 LDEHVFLVES----FDLIQEFGLQFQSRLCAID 217 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 75/180 (41%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 337 NRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 387 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 388 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 444 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G ++YA+G+ E + P Sbjct: 445 RALNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVMYAVGVGKAVEAELREIASKP 499 >gi|146327011|gb|AAI41812.1| Matrilin 4 [Homo sapiens] Length = 540 Score = 129 bits (324), Expect = 7e-28, Method: Composition-based stats. Identities = 43/217 (19%), Positives = 87/217 (40%), Gaps = 27/217 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + LD++ V+D SRS+ F + + + + +L + + P+ + Sbjct: 23 LTGPRCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLMGLLRGLNVGPNA---TRV 73 Query: 222 GLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G++ +S++++ F L ++R I+ L T + ++YA N F + Sbjct: 74 GVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---G 130 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 + ++ V +TDG + +A+ G +YA+G++ LRA Sbjct: 131 ARPPEERVPRVAVIVTDGRPQD-----RVAEVAAQARASGIEIYAVGVQRADVGS-LRAM 184 Query: 340 ASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 ASP +LVE+ +D G ++ D Sbjct: 185 ASPPLDEHVFLVES----FDLIQEFGLQFQSRLCAID 217 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 75/180 (41%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 296 NRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 346 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 347 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 403 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G ++YA+G+ E + P Sbjct: 404 RALNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVMYAVGVGKAVEAELREIASEP 458 >gi|3927992|emb|CAA07569.1| matrilin-4 [Homo sapiens] gi|124297520|gb|AAI31764.1| Matrilin 4 [Homo sapiens] gi|153217472|gb|AAI51220.1| Matrilin 4 [Homo sapiens] Length = 581 Score = 129 bits (324), Expect = 7e-28, Method: Composition-based stats. Identities = 43/217 (19%), Positives = 87/217 (40%), Gaps = 27/217 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + LD++ V+D SRS+ F + + + + +L + + P+ + Sbjct: 23 LTGPRCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLMGLLRGLNVGPNA---TRV 73 Query: 222 GLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G++ +S++++ F L ++R I+ L T + ++YA N F + Sbjct: 74 GVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---G 130 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 + ++ V +TDG + +A+ G +YA+G++ LRA Sbjct: 131 ARPPEERVPRVAVIVTDGRPQD-----RVAEVAAQARASGIEIYAVGVQRADVGS-LRAM 184 Query: 340 ASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 ASP +LVE+ +D G ++ D Sbjct: 185 ASPPLDEHVFLVES----FDLIQEFGLQFQSRLCAID 217 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 75/180 (41%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 337 NRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 387 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 388 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 444 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G ++YA+G+ E + P Sbjct: 445 RALNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVMYAVGVGKAVEAELREIASEP 499 >gi|119596273|gb|EAW75867.1| matrilin 4, isoform CRA_b [Homo sapiens] Length = 620 Score = 129 bits (324), Expect = 7e-28, Method: Composition-based stats. Identities = 44/218 (20%), Positives = 89/218 (40%), Gaps = 27/218 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + LD++ V+D SRS+ F + + + + +L + + P+ + Sbjct: 23 LTGPRCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLMGLLRGLNVGPNA---TRV 73 Query: 222 GLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G++ +S++++ F L ++R I+ L T + ++YA N F + Sbjct: 74 GVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---G 130 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 + ++ V +TDG + +A+ RG +YA+G++ LRA Sbjct: 131 ARPPEERVPRVAVIVTDGRPQD-----RVAEVAAQARARGIEIYAVGVQRADVGS-LRAM 184 Query: 340 ASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 ASP +LVE+ +D G ++ D+ Sbjct: 185 ASPPLDEHVFLVES----FDLIQEFGLQFQSRLCGKDQ 218 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 75/180 (41%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 376 NRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 426 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 427 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 483 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G ++YA+G+ E + P Sbjct: 484 RALNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVMYAVGVGKAVEAELREIASEP 538 >gi|73920229|sp|O95460|MATN4_HUMAN RecName: Full=Matrilin-4; Flags: Precursor gi|5419632|emb|CAB46380.1| matrilin 4 [Homo sapiens] Length = 622 Score = 129 bits (324), Expect = 7e-28, Method: Composition-based stats. Identities = 44/218 (20%), Positives = 89/218 (40%), Gaps = 27/218 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + LD++ V+D SRS+ F + + + + +L + + P+ + Sbjct: 23 LTGPRCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLMGLLRGLNVGPNA---TRV 73 Query: 222 GLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G++ +S++++ F L ++R I+ L T + ++YA N F + Sbjct: 74 GVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---G 130 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 + ++ V +TDG + +A+ RG +YA+G++ LRA Sbjct: 131 ARPPEERVPRVAVIVTDGRPQD-----RVAEVAAQARARGIEIYAVGVQRADVGS-LRAM 184 Query: 340 ASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 ASP +LVE+ +D G ++ D+ Sbjct: 185 ASPPLDEHVFLVES----FDLIQEFGLQFQSRLCGKDQ 218 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 75/180 (41%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 378 NRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 428 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 429 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 485 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G ++YA+G+ E + P Sbjct: 486 RALNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVMYAVGVGKAVEAELREIASEP 540 >gi|134093095|gb|ABO52955.1| matrilin 4 isoform 1 precursor [Gorilla gorilla gorilla] Length = 581 Score = 129 bits (323), Expect = 9e-28, Method: Composition-based stats. Identities = 44/217 (20%), Positives = 88/217 (40%), Gaps = 27/217 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + LD++ V+D SRS+ F + + + + +L + + P+ + Sbjct: 23 LTGPRCHTGPLDLVFVIDSSRSVRPF------EFETVRQFLMGLLRGLNVGPNA---TRV 73 Query: 222 GLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G++ +S++++ F L ++R I+ L T + ++YA N F + Sbjct: 74 GVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---G 130 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 + ++ V +TDG + +A+ RG +YA+G++ LRA Sbjct: 131 ARPPEERVPRVAVIVTDGRPQD-----RVAEVAAQARARGIEIYAVGVQRADVGS-LRAM 184 Query: 340 ASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 ASP +LVE+ +D G ++ D Sbjct: 185 ASPPLDEHVFLVES----FDLIQEFGLQFQSRLCAID 217 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 75/180 (41%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 337 NRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 387 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 388 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 444 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G ++YA+G+ E + P Sbjct: 445 RALNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVMYAVGVGKAVEAELREIASEP 499 >gi|22760140|dbj|BAC11083.1| unnamed protein product [Homo sapiens] Length = 540 Score = 129 bits (323), Expect = 1e-27, Method: Composition-based stats. Identities = 44/217 (20%), Positives = 88/217 (40%), Gaps = 27/217 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + LD++ V+D SRS+ F + + + + +L + + P+ + Sbjct: 23 LTGPRCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLMGLLRGLNVGPNA---TRV 73 Query: 222 GLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G++ +S++++ F L ++R I+ L T + ++YA N F + Sbjct: 74 GVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---G 130 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 + ++ V +TDG + +A+ RG +YA+G++ LRA Sbjct: 131 ARPPEERVPRVAVIVTDGRPQD-----RVAEVAAQARARGIEIYAVGLQRADVGS-LRAM 184 Query: 340 ASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 ASP +LVE+ +D G ++ D Sbjct: 185 ASPPLDEHVFLVES----FDLIQEFGLQFQSRLCAID 217 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 75/180 (41%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 296 NRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 346 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 347 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 403 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G ++YA+G+ E + P Sbjct: 404 RALNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVMYAVGVGKAVEAELREIASEP 458 >gi|326931809|ref|XP_003212016.1| PREDICTED: matrilin-4-like, partial [Meleagris gallopavo] Length = 465 Score = 128 bits (322), Expect = 1e-27, Method: Composition-based stats. Identities = 44/212 (20%), Positives = 87/212 (41%), Gaps = 27/212 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 LD++ V+D SRS+ F + + + + ++ + + P+ + G++ + Sbjct: 28 CRTGPLDIVFVIDSSRSVRPF------EFETMRRFMIDIIGNLDVGPNA---TRVGVIQY 78 Query: 227 SNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S++++ F L+ + ++R I + T + ++YA N F +Q Sbjct: 79 SSQVQNIFSLKTFFTRAEMERAINSIVPLAQGTMTGLAIQYAMNVAFTVQE---GARPPH 135 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP-- 342 +I + +TDG + A+ G +YA+GI+ + LRA ASP Sbjct: 136 KKIPRIAIIVTDGRPQD-----RVSEVAAHARNAGIEIYAVGIQRADMNS-LRAMASPPL 189 Query: 343 -NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 +LVE+ ++ GK K D Sbjct: 190 EEHVFLVES----FELIQQFGKQFQDKLCAID 217 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 23/138 (16%), Positives = 55/138 (39%), Gaps = 14/138 (10%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D+++V+D S+S+ ++ + +N +++ +++ PD + GLV + Sbjct: 339 CGAGHVDLVMVIDGSKSVRP------QNFELVKQFVNRIVDLLEVSPDG---TRVGLVQY 389 Query: 227 SNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S+++ F L ++ + + T + LK+ F + Sbjct: 390 SSRVRTEFPLNKYHSAEEIKEAVMKMEYMEKGTMTGLALKHMVEHSFS---ELEGARPLS 446 Query: 285 ANYKKIIVFMTDGENLST 302 N +I + TDG + Sbjct: 447 HNVPRIGLVFTDGRSQDD 464 >gi|118100589|ref|XP_425698.2| PREDICTED: similar to matrilin-4 [Gallus gallus] Length = 564 Score = 128 bits (322), Expect = 1e-27, Method: Composition-based stats. Identities = 44/212 (20%), Positives = 87/212 (41%), Gaps = 27/212 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 LD++ V+D SRS+ F + + + + ++ + + P+ + G++ + Sbjct: 28 CRTGPLDIVFVIDSSRSVRPF------EFETMRRFMIDIIGNLDVGPNA---TRVGVIQY 78 Query: 227 SNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S++++ F L+ + ++R I + T + ++YA N F +Q Sbjct: 79 SSQVQNIFSLKTFFTRAEMERAINSIVPLAQGTMTGLAIQYAMNVAFTVQE---GARPPH 135 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP-- 342 +I + +TDG + A+ G +YA+GI+ + LRA ASP Sbjct: 136 KKIPRIAIIVTDGRPQD-----RVSEVAAHARNAGIEIYAVGIQRADMNS-LRAMASPPL 189 Query: 343 -NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 +LVE+ ++ GK K D Sbjct: 190 EEHVFLVES----FELIQQFGKQFQDKLCAID 217 Score = 83.3 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 79/203 (38%), Gaps = 26/203 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + +D+++V+D S+S+ ++ + +N +++ +++ PD +V GLV Sbjct: 337 SKCGAGHVDLVMVIDGSKSVRP------QNFELVKQFVNRIVDLLEVSPDGTHV---GLV 387 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+++ F L ++ + + T + LK+ F + Sbjct: 388 QYSSRVRTEFPLNKYHSAEEIKAAVMKMEYMEKGTMTGLALKHMVEHSFS---ELEGARP 444 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N +I + TDG + + AK+ G +++A+G+ E LRA AS Sbjct: 445 LSHNVPRIGLVFTDGRSQDDIS-----EWAQRAKESGIVMFAVGVGKAVEEE-LRAIASE 498 Query: 343 --NSFY----LVENPHSMYDAFS 359 + + + F Sbjct: 499 PVEQHFSYSADFTTMTHLVENFK 521 >gi|114682165|ref|XP_001154021.1| PREDICTED: matrilin 4 isoform 4 [Pan troglodytes] gi|114682167|ref|XP_001154256.1| PREDICTED: matrilin 4 isoform 7 [Pan troglodytes] gi|114682175|ref|XP_514674.2| PREDICTED: matrilin 4 isoform 9 [Pan troglodytes] gi|114682177|ref|XP_001154315.1| PREDICTED: matrilin-4 isoform 8 [Pan troglodytes] Length = 581 Score = 128 bits (322), Expect = 1e-27, Method: Composition-based stats. Identities = 44/217 (20%), Positives = 88/217 (40%), Gaps = 27/217 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + LD++ V+D SRS+ F + + + + +L + + P+ + Sbjct: 23 LTGPRCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLMGLLPGLNVGPNA---TRV 73 Query: 222 GLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G++ +S++++ F L ++R I+ L T + ++YA N F + Sbjct: 74 GVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---G 130 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 + ++ V +TDG + +A+ RG +YA+G++ LRA Sbjct: 131 ARPPEERVPRVAVIVTDGRPQD-----RVAEVAAQARARGIEIYAVGVQRADVGS-LRAM 184 Query: 340 ASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 ASP +LVE+ +D G ++ D Sbjct: 185 ASPPLDEHVFLVES----FDLIQEFGLQFQSRLCAID 217 Score = 82.9 bits (203), Expect = 7e-14, Method: Composition-based stats. Identities = 32/180 (17%), Positives = 75/180 (41%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 337 NRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 387 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 388 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 444 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G +VYA+G+ E + P Sbjct: 445 RALNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVVYAVGVGKAVEAELREIASEP 499 >gi|114682169|ref|XP_001154082.1| PREDICTED: matrilin 4 isoform 5 [Pan troglodytes] gi|114682179|ref|XP_001154207.1| PREDICTED: matrilin-4 isoform 6 [Pan troglodytes] Length = 540 Score = 128 bits (322), Expect = 1e-27, Method: Composition-based stats. Identities = 44/217 (20%), Positives = 88/217 (40%), Gaps = 27/217 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + LD++ V+D SRS+ F + + + + +L + + P+ + Sbjct: 23 LTGPRCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLMGLLPGLNVGPNA---TRV 73 Query: 222 GLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G++ +S++++ F L ++R I+ L T + ++YA N F + Sbjct: 74 GVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---G 130 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 + ++ V +TDG + +A+ RG +YA+G++ LRA Sbjct: 131 ARPPEERVPRVAVIVTDGRPQD-----RVAEVAAQARARGIEIYAVGVQRADVGS-LRAM 184 Query: 340 ASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 ASP +LVE+ +D G ++ D Sbjct: 185 ASPPLDEHVFLVES----FDLIQEFGLQFQSRLCAID 217 Score = 82.9 bits (203), Expect = 7e-14, Method: Composition-based stats. Identities = 32/180 (17%), Positives = 75/180 (41%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 296 NRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 346 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 347 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 403 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G +VYA+G+ E + P Sbjct: 404 RALNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVVYAVGVGKAVEAELREIASEP 458 >gi|114682171|ref|XP_001153893.1| PREDICTED: matrilin 4 isoform 2 [Pan troglodytes] gi|114682181|ref|XP_001153957.1| PREDICTED: matrilin-4 isoform 3 [Pan troglodytes] Length = 499 Score = 128 bits (321), Expect = 1e-27, Method: Composition-based stats. Identities = 44/217 (20%), Positives = 88/217 (40%), Gaps = 27/217 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + LD++ V+D SRS+ F + + + + +L + + P+ + Sbjct: 23 LTGPRCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLMGLLPGLNVGPNA---TRV 73 Query: 222 GLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G++ +S++++ F L ++R I+ L T + ++YA N F + Sbjct: 74 GVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---G 130 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 + ++ V +TDG + +A+ RG +YA+G++ LRA Sbjct: 131 ARPPEERVPRVAVIVTDGRPQD-----RVAEVAAQARARGIEIYAVGVQRADVGS-LRAM 184 Query: 340 ASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 ASP +LVE+ +D G ++ D Sbjct: 185 ASPPLDEHVFLVES----FDLIQEFGLQFQSRLCVRD 217 Score = 82.9 bits (203), Expect = 7e-14, Method: Composition-based stats. Identities = 32/180 (17%), Positives = 75/180 (41%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 255 NRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 305 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 306 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 362 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G +VYA+G+ E + P Sbjct: 363 RALNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVVYAVGVGKAVEAELREIASEP 417 >gi|126303381|ref|XP_001379571.1| PREDICTED: similar to matrilin-4 [Monodelphis domestica] Length = 623 Score = 128 bits (321), Expect = 2e-27, Method: Composition-based stats. Identities = 46/241 (19%), Positives = 93/241 (38%), Gaps = 31/241 (12%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 P FI + + ++ + LD++ V+D SRS+ F + + Sbjct: 5 PYAPLLFILLV----SLNAELQATPAGSKCRTGPLDLVFVIDSSRSVRPF------EFET 54 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGV 255 + + ++ + + P+ + G++ +S++++ F L ++R I + Sbjct: 55 MRRFLVNIIRGLDIGPNA---TRVGVIQYSSQVQSVFPLGAFSRREDMERAIHAIVPLAQ 111 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 T + ++YA N F + A ++ V +TDG + +A Sbjct: 112 GTMTGLAIQYAMNVAFSVAE---GARPSQARVPRVAVIVTDGRPQD-----RVTEVAAQA 163 Query: 316 KKRGAIVYAIGIRVIRSHEFLRACASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + RG +YA+G++ LRA ASP +LVE+ +D G + Sbjct: 164 RNRGIEIYAVGVQRADVGS-LRAMASPPLDEHVFLVES----FDLIQQFGFHFQGRLCGV 218 Query: 373 D 373 D Sbjct: 219 D 219 Score = 87.1 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 35/187 (18%), Positives = 77/187 (41%), Gaps = 22/187 (11%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + +D+++V+D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 380 DRCRTGHVDLVLVIDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 430 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+++ F L +G + +++ + + T + L++ F Q Sbjct: 431 QYSSRVRTEFPLGRYGTADEVKQAVLAVEYMEKGTMTGLALRHLVEHSFSEA---QGARP 487 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS- 341 N ++ + TDG + + AK+ G I+YA+G+ E LR AS Sbjct: 488 RAQNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIIMYAVGVGKAVEEE-LREIASD 541 Query: 342 -PNSFYL 347 P Sbjct: 542 PPEQHVS 548 >gi|134093121|gb|ABO52981.1| matrilin 4 isoform 1 precursor [Callithrix jacchus] Length = 580 Score = 127 bits (320), Expect = 2e-27, Method: Composition-based stats. Identities = 43/213 (20%), Positives = 85/213 (39%), Gaps = 27/213 (12%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ V+D SRS+ F + + + + ++ + + P+ + G++ Sbjct: 26 RCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLVGLVRGLNVGPNA---TRVGVIQ 76 Query: 226 FSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +S++++ F L + R I+ L T + ++YA N F + Sbjct: 77 YSSQVQSVFPLRAFSRREDMVRAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---GARPP 133 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP- 342 + ++ V +TDG + +A+ RG +YA+G++ LRA ASP Sbjct: 134 EERVPRVAVIVTDGRPQD-----RVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPP 187 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 +LVE+ +D G + D Sbjct: 188 LEEHVFLVES----FDLIQEFGLQFQGRLCAID 216 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 72/166 (43%), Gaps = 19/166 (11%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 336 SRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 386 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 387 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 443 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 N ++ + TDG + + AK+ G ++YA+G+ Sbjct: 444 RALNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVMYAVGVG 484 >gi|134093103|gb|ABO52963.1| matrilin 4 isoform 1 precursor [Lemur catta] Length = 583 Score = 127 bits (320), Expect = 2e-27, Method: Composition-based stats. Identities = 44/217 (20%), Positives = 86/217 (39%), Gaps = 27/217 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 V LD++ V+D SRS+ F + + + + +L + + P+ + Sbjct: 25 VAGPRCHTGPLDLVFVIDSSRSVRPF------EFETVRQFLVGLLHGLNVGPNA---TRV 75 Query: 222 GLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G++ +S++++ F L ++R I+ L T + ++YA N F + Sbjct: 76 GVIQYSSQVQSVFPLGAFSRPEDMERAIRALVPLAQGTMTGLAIQYAMNVAFSVAE---G 132 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 + ++ V +TDG + +A+ RG +YA+G++ LRA Sbjct: 133 ARPPEERVPRVAVIVTDGRPQD-----RVAEVAAQARARGIEIYAVGVQRADVGS-LRAM 186 Query: 340 AS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 AS +LVE+ +D G + D Sbjct: 187 ASHPLDEHVFLVES----FDLIQEFGLQFQGRLCAID 219 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 32/180 (17%), Positives = 75/180 (41%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N + +D+++++D S+S+ ++ + +N +++ + + PD + GLV Sbjct: 339 NRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPDG---TRVGLV 389 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 390 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 446 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G ++YA+G+ E + P Sbjct: 447 RALNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVMYAVGVGKAVEEELREIASEP 501 >gi|291409921|ref|XP_002721255.1| PREDICTED: matrilin 4 [Oryctolagus cuniculus] Length = 346 Score = 127 bits (319), Expect = 3e-27, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 85/209 (40%), Gaps = 27/209 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 LD++ V+D SRS+ + + + + ++ + + P+ + G++ + Sbjct: 30 CHTGPLDLVFVIDSSRSVRPL------EFETMRRFLVGLVRSLDVGPNA---TRVGVIQY 80 Query: 227 SNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S++++ F L ++R ++ L T + ++YA N F + + Sbjct: 81 SSQVQSVFPLGAFSRREDMERALRTLVPLAQGTMTGLAIQYAMNVAFSVAE---GARPPE 137 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP-- 342 ++ V +TDG + +A+ RG +YA+G++ LRA ASP Sbjct: 138 ERVPRVAVIVTDGRPQD-----RVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPPL 191 Query: 343 -NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 +LVE+ +D G+ + Sbjct: 192 DEHVFLVES----FDLIQEFGRQFRGRLC 216 >gi|296200542|ref|XP_002747689.1| PREDICTED: matrilin-4 [Callithrix jacchus] Length = 770 Score = 127 bits (318), Expect = 3e-27, Method: Composition-based stats. Identities = 43/213 (20%), Positives = 85/213 (39%), Gaps = 27/213 (12%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ V+D SRS+ F + + + + ++ + + P+ + G++ Sbjct: 26 RCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLVGLVRGLNVGPNA---TRVGVIQ 76 Query: 226 FSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +S++++ F L + R I+ L T + ++YA N F + Sbjct: 77 YSSQVQSVFPLRAFSRREDMVRAIRDLVPLAQGTMTGLAIQYAMNVAFSVAE---GARPP 133 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP- 342 + ++ V +TDG + +A+ RG +YA+G++ LRA ASP Sbjct: 134 EERVPRVAVIVTDGRPQD-----RVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPP 187 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 +LVE+ +D G + D Sbjct: 188 LEEHVFLVES----FDLIQEFGLQFQGRLCGKD 216 Score = 81.7 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 75/180 (41%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 377 SRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 427 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 428 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 484 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G ++YA+G+ E + P Sbjct: 485 RALNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVMYAVGVGKAVEAELREIASEP 539 >gi|224077994|ref|XP_002192008.1| PREDICTED: similar to matrilin 4 [Taeniopygia guttata] Length = 580 Score = 126 bits (317), Expect = 5e-27, Method: Composition-based stats. Identities = 43/212 (20%), Positives = 86/212 (40%), Gaps = 27/212 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 LD++ V+D SRS+ F + + + + ++ + + P+ + G++ + Sbjct: 28 CRTGPLDIVFVIDSSRSVRPF------EFETMRRFMMDIIGNLDVGPNA---TRVGVIQY 78 Query: 227 SNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S++++ F L+ + ++R I + T + ++YA N F Q Sbjct: 79 SSQVQNIFSLKTFFTRADMERAINSIIPLAQGTMTGLAIQYAMNVAFTTQE---GARPLH 135 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP-- 342 +I + +TDG + +A+ G +YA+GI+ + LRA ASP Sbjct: 136 KRIPRIAIVVTDGRPQD-----RVTEVATQARNAGIEIYAVGIQRADMNS-LRAMASPPL 189 Query: 343 -NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 +LVE+ ++ K K D Sbjct: 190 EEHVFLVES----FELIQQFAKQFQDKLCAID 217 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 79/203 (38%), Gaps = 26/203 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + +D+++V+D S+S+ ++ + +N +++ +++ P + GLV Sbjct: 337 SKCGAGHVDLVMVIDGSKSVRP------QNFELVKQFVNRIVDLLEVSPHG---TRVGLV 387 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+++ F L +++ + + T + LK+ F + Sbjct: 388 QYSSRVRTEFPLNKYHSADEIKKAVMDVEYMEKGTMTGLALKHMVEHSFS---ELEGARP 444 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N +I + TDG + + AK+ G +++A+G+ E LRA AS Sbjct: 445 LSYNIPRIGLVFTDGRSQDDIS-----EWARRAKESGIVMFAVGVGKAVEEE-LRAIASE 498 Query: 343 --NSFY----LVENPHSMYDAFS 359 + + + FS Sbjct: 499 PVEQHFSYSADFTTMTHLVENFS 521 >gi|134093165|gb|ABO53025.1| matrilin 4 isoform 1 precursor, 5 prime [Chlorocebus aethiops] Length = 214 Score = 124 bits (312), Expect = 2e-26, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 85/210 (40%), Gaps = 27/210 (12%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ V+D SRS+ F + + + + +L +K+ N + G++ Sbjct: 27 RCHTGPLDLVFVIDSSRSVRPF------EFETMRQFLVGLLRGLKV---GANATRVGVIQ 77 Query: 226 FSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +S++++ F L ++R I+ L T + ++Y N F + Sbjct: 78 YSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYVMNVAFSVAE---GARPP 134 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP- 342 + ++ V +TDG + +A+ RG +YA+G++ LRA ASP Sbjct: 135 EERVPRVAVIVTDGRPQD-----RVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPP 188 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 +LVE+ +D G ++ Sbjct: 189 LDEHVFLVES----FDLIQEFGLQFQSRLC 214 >gi|327271798|ref|XP_003220674.1| PREDICTED: matrilin-4-like [Anolis carolinensis] Length = 592 Score = 124 bits (311), Expect = 2e-26, Method: Composition-based stats. Identities = 43/222 (19%), Positives = 90/222 (40%), Gaps = 27/222 (12%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P + + + LD++ V+D SRS+ F + + + + ++ + + P+ Sbjct: 30 PYGTGSQQSKCKTGPLDIVFVIDSSRSVRPF------EFETMRRFMIDIIHNLDIGPNA- 82 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + G++ +S++++ F L+ + +++ I + T + ++Y N F Q Sbjct: 83 --TRVGVIQYSSQVQNVFSLKSFFTRAEMEKAINNIVPLAQGTMTGLAIQYVMNVAFTTQ 140 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++ V +TDG + +A+ G +YA+G++ + Sbjct: 141 E---GARPLHKKIPRVAVIVTDGRPQD-----RVTEVSAQARAAGIEIYAVGVQRADMNS 192 Query: 335 FLRACASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 LRA ASP +LVE+ +D GK K D Sbjct: 193 -LRAMASPALEEHVFLVES----FDLIQQFGKQFQDKLCVID 229 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 30/184 (16%), Positives = 72/184 (39%), Gaps = 20/184 (10%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D+++V+D S+S+ ++ + +N +++ + + P + GLV + Sbjct: 351 CKAGHIDLVLVIDGSKSVRP------QNFELVKQFVNQIVDFLDVSPHG---TRVGLVQY 401 Query: 227 SNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S+++ F L + L++ ++ + T + LK+ F Sbjct: 402 SSRVRTEFPLNKFTTAADLKKAVQRVQYMEKGTMTGLALKHMLEHSFTEAE---GARPLS 458 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP-N 343 N +I + TDG + + +K+ G I++A+G+ E + P + Sbjct: 459 QNVPRIGLVFTDGRSQDDIS-----EWARRSKEAGIIMFAVGVGKAVESELREIASEPVD 513 Query: 344 SFYL 347 + Sbjct: 514 KHFS 517 >gi|149042955|gb|EDL96529.1| matrilin 4 (predicted), isoform CRA_a [Rattus norvegicus] Length = 637 Score = 124 bits (310), Expect = 3e-26, Method: Composition-based stats. Identities = 41/213 (19%), Positives = 85/213 (39%), Gaps = 27/213 (12%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ ++D SRS+ F + + + + +L + + + + G++ Sbjct: 42 RCYTGPLDLVFMIDSSRSVRPF------EFETMRQFLVGLLHSLDVGLNA---TRVGVIQ 92 Query: 226 FSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +S++++ F L ++R I+ + T + ++YA N F Sbjct: 93 YSSQVQSVFPLGAFSNREDMERAIRAVVPLAQGTMTGLAIQYAMNVAFSEAE---GARPS 149 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP- 342 + +++V +TDG + +A+ RG +YA+G++ LRA ASP Sbjct: 150 EERVPRVLVIVTDGRPQD-----RVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPP 203 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 +LVE+ +D G + D Sbjct: 204 LDQHVFLVES----FDLIQEFGLQFQGRLCGKD 232 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 36/194 (18%), Positives = 80/194 (41%), Gaps = 20/194 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 393 DRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 443 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 444 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFS---EVQGARP 500 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 D N ++ + TDG + + AK+ G ++YA+G+ E LR AS Sbjct: 501 RDLNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVMYAVGVGKAVEEE-LREIASE 554 Query: 343 NSFYLVENPHSMYD 356 S V Sbjct: 555 PSELHVSYSPDFNT 568 >gi|157818269|ref|NP_001100009.1| matrilin-4 [Rattus norvegicus] gi|149042956|gb|EDL96530.1| matrilin 4 (predicted), isoform CRA_b [Rattus norvegicus] Length = 624 Score = 124 bits (310), Expect = 3e-26, Method: Composition-based stats. Identities = 41/213 (19%), Positives = 85/213 (39%), Gaps = 27/213 (12%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ ++D SRS+ F + + + + +L + + + + G++ Sbjct: 29 RCYTGPLDLVFMIDSSRSVRPF------EFETMRQFLVGLLHSLDVGLNA---TRVGVIQ 79 Query: 226 FSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +S++++ F L ++R I+ + T + ++YA N F Sbjct: 80 YSSQVQSVFPLGAFSNREDMERAIRAVVPLAQGTMTGLAIQYAMNVAFSEAE---GARPS 136 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP- 342 + +++V +TDG + +A+ RG +YA+G++ LRA ASP Sbjct: 137 EERVPRVLVIVTDGRPQD-----RVAEVAAQARARGIEIYAVGVQRADVGS-LRAMASPP 190 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 +LVE+ +D G + D Sbjct: 191 LDQHVFLVES----FDLIQEFGLQFQGRLCGKD 219 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 36/194 (18%), Positives = 80/194 (41%), Gaps = 20/194 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 380 DRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 430 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 431 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFS---EVQGARP 487 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 D N ++ + TDG + + AK+ G ++YA+G+ E LR AS Sbjct: 488 RDLNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVMYAVGVGKAVEEE-LREIASE 541 Query: 343 NSFYLVENPHSMYD 356 S V Sbjct: 542 PSELHVSYSPDFNT 555 >gi|224967060|ref|NP_038620.2| matrilin-4 precursor [Mus musculus] Length = 624 Score = 123 bits (308), Expect = 5e-26, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 84/209 (40%), Gaps = 27/209 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ ++D SRS+ F + + + + +L + + + + G++ +S++ Sbjct: 33 GPLDLVFMIDSSRSVRPF------EFETMRQFLVGLLRSLDVGLNA---TRVGVIQYSSQ 83 Query: 230 IEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 ++ F L ++R I+ + T + ++YA N F + Sbjct: 84 VQSVFPLGAFSRREDMERAIRAVVPLAQGTMTGLAIQYAMNVAFSEAE---GARPSEERV 140 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NS 344 +++V +TDG + +A+ RG +YA+G++ LR ASP Sbjct: 141 PRVLVIVTDGRPQD-----RVAEVAAQARARGIEIYAVGVQRADVGS-LRTMASPPLDQH 194 Query: 345 FYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 +LVE+ +D G + D Sbjct: 195 VFLVES----FDLIQEFGLQFQGRLCGKD 219 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 77/181 (42%), Gaps = 19/181 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 380 DRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 430 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 431 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 487 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 D N ++ + TDG + + AK+ G ++YA+G+ E + P Sbjct: 488 RDLNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVMYAVGVGKAVEEELREIASEP 542 Query: 343 N 343 + Sbjct: 543 S 543 >gi|14548116|sp|O89029|MATN4_MOUSE RecName: Full=Matrilin-4; Short=MAT-4; Flags: Precursor gi|3766288|emb|CAA06889.1| matrilin-4 precursor [Mus musculus] gi|22477196|gb|AAH36558.1| Matrilin 4 [Mus musculus] gi|148674433|gb|EDL06380.1| matrilin 4, isoform CRA_a [Mus musculus] gi|148674434|gb|EDL06381.1| matrilin 4, isoform CRA_a [Mus musculus] Length = 624 Score = 123 bits (308), Expect = 5e-26, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 84/209 (40%), Gaps = 27/209 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ ++D SRS+ F + + + + +L + + + + G++ +S++ Sbjct: 33 GPLDLVFMIDSSRSVRPF------EFETMRQFLVGLLRSLDVGLNA---TRVGVIQYSSQ 83 Query: 230 IEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 ++ F L ++R I+ + T + ++YA N F + Sbjct: 84 VQSVFPLGAFSRREDMERAIRAVVPLAQGTMTGLAIQYAMNVAFSEAE---GARPSEERV 140 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NS 344 +++V +TDG + +A+ RG +YA+G++ LR ASP Sbjct: 141 PRVLVIVTDGRPQD-----RVAEVAAQARARGIEIYAVGVQRADVGS-LRTMASPPLDQH 194 Query: 345 FYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 +LVE+ +D G + D Sbjct: 195 VFLVES----FDLIQEFGLQFQGRLCGKD 219 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 77/181 (42%), Gaps = 19/181 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 380 DRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 430 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 431 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 487 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 D N ++ + TDG + + AK+ G ++YA+G+ E + P Sbjct: 488 RDLNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVMYAVGVGKAVEEELREIASEP 542 Query: 343 N 343 + Sbjct: 543 S 543 >gi|301617277|ref|XP_002938060.1| PREDICTED: matrilin-4-like [Xenopus (Silurana) tropicalis] Length = 721 Score = 120 bits (300), Expect = 5e-25, Method: Composition-based stats. Identities = 40/215 (18%), Positives = 87/215 (40%), Gaps = 27/215 (12%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 V +D++ ++D SRS+ F + + K + ++ +++ + G+ Sbjct: 23 VQKCKSGPMDLVFIIDSSRSVRPF------EFETMRKFMIDIINSLEVGLS---TTRVGV 73 Query: 224 VTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 V +S++++ F L+ S +++ I + T + ++YA N F + Sbjct: 74 VQYSSQVQTVFSLKTFSNKSDMEKAINEIIPLAQGTMTGLAIQYAMNVAFTEEE---GAR 130 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 N ++ + +TDG + +A++ G +YA+G++ LRA AS Sbjct: 131 PLSKNIPRVAIIVTDGRPQD-----RVTEVAVQAREAGIEIYAVGVQRADVSS-LRAMAS 184 Query: 342 ---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + + VE+ +D H+ K D Sbjct: 185 HPLDDHVFHVES----FDLIQHLSIQFQDKLSCVD 215 Score = 84.4 bits (207), Expect = 2e-14, Method: Composition-based stats. Identities = 43/244 (17%), Positives = 84/244 (34%), Gaps = 38/244 (15%) Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT-------------SSVKVNSQTDA 170 NEGY++ A + + C + + + P + N + Sbjct: 404 NEGYTLQADGKTCVADDMCNIVDHGCEFKCVSTPGSYYCICPEGQELQADGKTCNKCSTG 463 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D++ V+D S+S+ ++ + + + V GLV +S+++ Sbjct: 464 YIDLVFVIDGSKSVRP------QNFELVKEFV---INIVDSSAISAQGTHIGLVQYSSRV 514 Query: 231 EEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L ++ +K + T + LK+ Q F N Sbjct: 515 RTEFPLSQYTNGQDIKTAVKNIQYMEKGTMTGLALKHMVEQSFSEAE------GARKNVP 568 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 KI + TDG + + +AK+ G +YA+G+ E + P +SF Sbjct: 569 KIGLVFTDGRSQDDIS-----EWAKKAKEAGITMYAVGVGKAVEDELNEIASDPVNKHSF 623 Query: 346 YLVE 349 Y + Sbjct: 624 YTAD 627 >gi|301626452|ref|XP_002942405.1| PREDICTED: collagen alpha-6(VI) chain-like [Xenopus (Silurana) tropicalis] Length = 2615 Score = 119 bits (297), Expect = 8e-25, Method: Composition-based stats. Identities = 52/208 (25%), Positives = 92/208 (44%), Gaps = 29/208 (13%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + + +LD++ ++D S S+ S S+ T +K I V N V+ G+ Sbjct: 1396 ACSHEQLDLVFLIDGSASITS---SNFTSAKTFMKEI------VDSFTISENRVRIGVAQ 1446 Query: 226 FSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +S ++ F L + S ++++I +S+ +T + GL++ Q FD N Sbjct: 1447 YSANPKKEFFLNEYYSSSDMKKQIDSISQLKATTYTGKGLRF-VKQFFD------PANGG 1499 Query: 284 DANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N + ++ MTDG N S ED +L + G +++IGI + S E + SP Sbjct: 1500 RKNVPQYLIVMTDGMSNDSVNEDAAAL------RSSGVKIFSIGIGLRNSFELVMIAGSP 1553 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + Y VE + A I + IV + Sbjct: 1554 KNVYEVET----FQALDSIKRQIVAQVC 1577 Score = 92.9 bits (229), Expect = 8e-17, Method: Composition-based stats. Identities = 47/275 (17%), Positives = 92/275 (33%), Gaps = 31/275 (11%) Query: 103 FVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKI---PLKFCTFIPWYTNSRHIVMPIT 159 F +I T LDI+ + + S ++ + + + +P Sbjct: 766 FAVGFQNI-NDTELDIIASHPPRKHVTNVESFLQLSNLEFRIQKRLCNEIVVKSFAVPAI 824 Query: 160 SSVKVNSQ-TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + D+ ++D S S+ + K M+E + + N Sbjct: 825 ARAVKEGCVDTEEADIYFLIDGSGSIYPEDFEDMKKF---------MIELISMFQVGANR 875 Query: 219 VQSGLVTFSN-KIEEFFLLEWGVSHL-QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 V+ G+V +S+ + EFF+ E + + I + + G T + L + Sbjct: 876 VRFGVVQYSDVRRTEFFISEHNTQKMLKDAISQIEQLGGGTLTGEALTSMKQLFVNAAKD 935 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 R H + +V +TDGE + E + G ++AIG++ E Sbjct: 936 RPHK------VPQSLVVITDGE-----SQDRVTEAAAEIRNDGITIFAIGVKNAVEEEIR 984 Query: 337 RACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 S + V N +D+ I D+ + Sbjct: 985 DIAGSNEKMFFVNN----FDSLKVIKNDLARELCT 1015 Score = 84.8 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 82/205 (40%), Gaps = 23/205 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + + D++ ++D S S+ S + + + +M+++ ++ PD VQ G Sbjct: 1015 TPEACKNMKADIVFLVDSSASINS------DDYETMKEFMESMVKQAEIGPD---RVQIG 1065 Query: 223 LVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 L+ FS++ +E F L +Q I+ + + T LKY +G Sbjct: 1066 LIQFSSETKEEFPLNRYKRKDEIQSAIRGIQQLSQGTLMGEALKYTLPYFSASKG----- 1120 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 N K+ ++ +TDGE + G I+YAIG++ + + L Sbjct: 1121 --GRVNTKQYLIVITDGE-----AQDAVGNPAKAIRDHGVIIYAIGVQQANNTQLLEIAG 1173 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDI 365 Y ++ S+ +I +I Sbjct: 1174 KQEQVYYEDSFDSLAFLNKNIMFEI 1198 Score = 66.3 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 44/216 (20%), Positives = 97/216 (44%), Gaps = 22/216 (10%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 +++I+ I + + +T+ D++ +LD S S+ + + + + A++ + Sbjct: 1191 NKNIMFEICNPQESCKKTEVA-DIIFLLDASASIT------RGEFRLMQRFVEAVVNDSL 1243 Query: 211 LIPDVNNVVQSGLVTF-SNKIEEFFLLEWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + D VQ G V + +N E+F L + + + + L + T + L+Y Sbjct: 1244 VGKDN---VQFGAVVYGTNPAEQFSLNTYSTKLDILKAVFSLPQVSGYTYTAKALEYTRI 1300 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGI 327 + G R + I++ +TDG +T+ D+ +L ++A K G IV+A+G+ Sbjct: 1301 RFGTSYGGRPGISH-------ILILVTDG--ATTEADRPNLPIVSKALKDDGIIVFAVGV 1351 Query: 328 RVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGK 363 E + P+ ++LV+N + + +I + Sbjct: 1352 GKAVPQELQQIAGYPDRWFLVQNYKGLDNIHDNITQ 1387 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 37/203 (18%), Positives = 73/203 (35%), Gaps = 26/203 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-E 231 D++ ++D S S+ + ++ + D +Q GL+ +S+ Sbjct: 33 DIVFLMDGSWSIG------TENFITMKNFLYTLVNGFDVGLDK---IQIGLIQYSDNART 83 Query: 232 EFFLLEW-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 EFFL + + + I+ L G T + L++ + + +I Sbjct: 84 EFFLNSYSNKEDVLKYIQNLKYKGGGTKTGLSLEF----MLTQHFSEAAGSRAAEGVPQI 139 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SFYLV 348 V +TDG+ K G I+YAIGI+ E + P+ Y V Sbjct: 140 AVVITDGQ-----AQDSIREPAIAVKNAGIILYAIGIKDAVLSELNEIASDPDDKHVYSV 194 Query: 349 ENPHSMYDAFSHIGKDIVTKRIW 371 + ++A I ++++ Sbjct: 195 AD----FNALQSISQNMIQVLCT 213 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 37/231 (16%), Positives = 90/231 (38%), Gaps = 30/231 (12%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARL----------DMMIVL 178 I++ K F + S++++ + ++V+ ++ ++ D+++++ Sbjct: 182 IASDPDDKHVYSVADFNALQSISQNMIQVLCTTVEEAARQTGQIAQVCRTANQADIVLLV 241 Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW 238 + + M K+ N + + V + N ++ GLVT++++ FLL Sbjct: 242 ESTTRMGD---------ATFEKAKNFLYDLVSNLDVGINKIRIGLVTYNDETNPEFLLNS 292 Query: 239 GVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 S + I+ + T + L+Y F + + + +I++ +T+ Sbjct: 293 YSSKTEILESIQNMKYVEGYTYTGRALEYVNTTYFTQAAGSRF----EESVAQILIIVTE 348 Query: 297 GENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 G++ T E K RG VY +G + + A + P+ + Sbjct: 349 GDSSDT-----LTEPAKELKSRGISVYVVGTNIKYDRQLQEASSKPDEKFF 394 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 77/203 (37%), Gaps = 22/203 (10%) Query: 157 PITSSVKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 SV + A + D++ ++D S S+ + +T++ + + V + Sbjct: 615 FSPVSVVPAVCSSASVADIVFLIDESSSIGP-INFQLTRV--------FLHKVVSALDIS 665 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + V+ GLV +S++ L + I L G ++ L + ++F Sbjct: 666 LSNVRVGLVLYSDEPRLELKLNTFNEKYEILDFITKLPYRGGKAHTGAALDFLRKKMFT- 724 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 +Q+ ++I V MT+G+++ + ++ G V+A+G + I Sbjct: 725 ---KQNGGRPHQGVQQIAVVMTNGQSMDNFT-----KPAAKLRRSGVEVFAVGFQNINDT 776 Query: 334 EFLRACASPNSFYLVENPHSMYD 356 E L AS V N S Sbjct: 777 E-LDIIASHPPRKHVTNVESFLQ 798 Score = 52.8 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 40/237 (16%), Positives = 83/237 (35%), Gaps = 28/237 (11%) Query: 116 LDIVVVPQNEGY-----SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ--T 168 + + VV N Y S+ K + F + ++ + S+ +N Q + Sbjct: 367 ISVYVVGTNIKYDRQLQEASSKPDEKFFYQLDDFDDSENVTEQLLKNLCFSIDLNIQVYS 426 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D++ ++D S SM + F I ++ ++ + + + GL +S Sbjct: 427 KRYADVVFLVDSSTSMGTIF------FQKMKDFIIHIINQLNVGINK---HRIGLAQYSG 477 Query: 229 KIEEFFLLEW--GVSHLQRKIKY-LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + FLL + + IK + G + L++ F ++ N + Sbjct: 478 LPQTEFLLNHYETKEEILKHIKETFTYRGGPLKTGHALEF-VRSTFFIEEAGSRINYGN- 535 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + +V +T S+K + + E K G ++GI E + P Sbjct: 536 --PQFLVVIT-----SSKSEDAVRRHAEELKSVGVTTISVGIGNSDRKELEKIATDP 585 >gi|149410544|ref|XP_001506183.1| PREDICTED: similar to protocadherin 9, partial [Ornithorhynchus anatinus] Length = 588 Score = 116 bits (290), Expect = 6e-24, Method: Composition-based stats. Identities = 45/203 (22%), Positives = 87/203 (42%), Gaps = 24/203 (11%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S RLD++ V+D SRS+ + + I +L+ + + PD V + GL+ Sbjct: 3 SSCDSKRLDLIFVIDSSRSVRPH------DFEKVKEFIVTILQFLDVAPD---VTRVGLI 53 Query: 225 TFSNKIEEFFLL-EWG-VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + + ++ F L +G S ++R +K + + G T + ++YA N F Sbjct: 54 QYGSTVKNEFSLKTYGRKSEVERAVKVMKRLGTGTMTGLAIQYAVNIAFSESE---GARP 110 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N +II+ +TDG +A+ G +++AIG+ + + + P Sbjct: 111 LRENVPRIIMIVTDGRPQDP-----VAEVAAKARNSGILIFAIGVGQVDYNTLKSIGSKP 165 Query: 343 --NSFYLVEN---PHSMYDAFSH 360 + +LV N S+ F + Sbjct: 166 HQDHVFLVANFSQIESLTSVFQN 188 >gi|147906540|ref|NP_001087052.1| von Willebrand factor A domain-containing protein 2 precursor [Xenopus laevis] gi|82235697|sp|Q6DCQ6|VWA2_XENLA RecName: Full=von Willebrand factor A domain-containing protein 2; AltName: Full=A domain-containing protein similar to matrilin and collagen; Short=AMACO; Flags: Precursor gi|50415038|gb|AAH77945.1| MGC80919 protein [Xenopus laevis] Length = 790 Score = 115 bits (287), Expect = 1e-23, Method: Composition-based stats. Identities = 62/245 (25%), Positives = 98/245 (40%), Gaps = 38/245 (15%) Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI 192 Y P + +P + I S + V LD+ VLD S ++ Sbjct: 499 VTYSNPQQLFNQLP------QLQKRICS-IDVQGCQAQPLDLAFVLDASTAVG------- 544 Query: 193 TKIDMAIKSINAMLEEVKLIPDVN-NVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKY 249 + + V L D+N +V Q GLVT+S++ E F L+ S L + I Sbjct: 545 ---QEKFNRLKNFVTMVSLQFDINRDVTQIGLVTYSSRPETVFGLDTHDSGSSLLQGIGR 601 Query: 250 LSKFGVSTNSTPGLKYAYNQIFDMQ-GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS 308 S G S ++ L YN + +Q G R N K +V +TDG + + Sbjct: 602 ASYMGGSASTGSALLRVYNDVMTVQKGARPGVN-------KAVVVITDGRGA-----EDA 649 Query: 309 LYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 + + G +VY IGI I+ + LR S +L+ P Y++ H +D V + Sbjct: 650 AVPAQKLRDNGIMVYVIGIGNIQRNSLLRLAGSEK--FLISVPS--YESLGHY-EDSVVQ 704 Query: 369 RIWYD 373 R+ D Sbjct: 705 RVCED 709 Score = 66.3 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 39/214 (18%), Positives = 79/214 (36%), Gaps = 25/214 (11%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S+ + + LD++I+LD S S+ S + + + ++++ Sbjct: 37 SAAGRRMRCSSPLDILILLDGSNSIG---RGSFERSKHFASKL------CDALDIGSDLI 87 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G + +S + F L+ S ++ KIK + G T + LKY + Sbjct: 88 RVGAMQYSGAPQVEFRLDSSFSKAAIKEKIKSIVFKGGPTETGLALKYIV--------WK 139 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 A+ KI++ ++DG K + K V+ +G++ R E L Sbjct: 140 GFPGGRPASVPKILIIVSDG-----KSQGNIKLPAAQIKGEDIEVFTVGVKFPRWEE-LH 193 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 A +S V + DA + + + + Sbjct: 194 ALSSEPQEAHVLFAEHVDDAVNGLATSLTNSSLC 227 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 71/197 (36%), Gaps = 22/197 (11%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ ++D S T ++ ++ + + ++ + + G+ +SN+++ Sbjct: 341 IDLLFLVDSS---------DTTSLEAFMQHKSFLKRFIQASLSDESPLNVGVAQYSNEVQ 391 Query: 232 EFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + ++ L + I + G + L+Y F + + + Sbjct: 392 MVVKIGEYQSMAELLKHIDNMRFMGGGLFTGKALRYVTQYGFKSTPVFSDV---RDDLPR 448 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLRACASPNSFYLV 348 ++V +T +K A+ + V+ IG+ + E +P + Sbjct: 449 LVVLLT-----GSKSQDSVTGPATYARDQ--EVFLIGVTSDSNKGEMAEIVGNPLNLVTY 501 Query: 349 ENPHSMYDAFSHIGKDI 365 NP +++ + K I Sbjct: 502 SNPQQLFNQLPQLQKRI 518 >gi|327269537|ref|XP_003219550.1| PREDICTED: matrilin-2-like [Anolis carolinensis] Length = 809 Score = 108 bits (269), Expect = 1e-21, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 88/212 (41%), Gaps = 24/212 (11%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 + +S + +D RLD++ ++D SRS+ + + + I +L+ + + PD Sbjct: 2 LFYSSFLPEEFCSDKRLDLVFIIDSSRSVRPY------DFEKVKEFILNILQFLDIRPD- 54 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 V + GL+ + + ++ F L+ ++R ++ + T + +++A N F Sbjct: 55 --VTRVGLIQYGSTVKNEFSLKTFARKQDMERAVRRMMYLSTGTMTGLAIQFAVNIAFSE 112 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 N ++I+ +TDG +A+ G +++AIG+ + + Sbjct: 113 TE---GARPLSQNVPRVIMIVTDGRPQDP-----VAEIAAKARNSGILIFAIGVGRVDMN 164 Query: 334 EFLRACASP--NSFYLVEN---PHSMYDAFSH 360 + P + +LV N ++ F + Sbjct: 165 TLKSIGSEPYEDHVFLVANFSQIETLTSVFQN 196 Score = 97.1 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 83/210 (39%), Gaps = 25/210 (11%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ + ++ + + +L+ + + P + GL+ Sbjct: 569 KGCTEGPVDLVFVIDGSKSLG------VNNFEIVKEFVLGILDSLTISPKAA---RIGLL 619 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L+ + +++ + + G + + LK + F +H Sbjct: 620 QYSTQVRTEFTLKQFSTATDMKKAVSQMKYMGKGSMTGLALKQMTERSFTEAEGARHL-- 677 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 A ++ V TDG + + +AK+RG +YAIGI E + P Sbjct: 678 -SAKVPRVCVVFTDGR-----AQDEVSEWAAKAKQRGITMYAIGIGKAIEEELREIASDP 731 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + E+ + A I + + + Sbjct: 732 PEKHLFYAED----FSAMGEITEKLQKRMC 757 >gi|224046544|ref|XP_002198814.1| PREDICTED: matrilin 2 [Taeniopygia guttata] Length = 902 Score = 107 bits (267), Expect = 3e-21, Method: Composition-based stats. Identities = 43/202 (21%), Positives = 85/202 (42%), Gaps = 24/202 (11%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 NS + LD++ ++D SRS+ + + + I +L+ + + PD +V GL+ Sbjct: 7 NSCNNKHLDLVFIIDSSRSVSHY------DFEKVKEFILTILQFLDISPDATHV---GLI 57 Query: 225 TFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + + +++ F L+ ++R +K + G T + L+YA N F Sbjct: 58 QYGSTVKQEFSLKTFRRKQDIERAVKRMMHLGTGTMTGLALQYAVNIAFSETE---GARP 114 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N +II+ +TDG +A+ G +++AIG+ + + + P Sbjct: 115 LRQNVPRIIMIVTDGRPQDP-----VAEIAAKARNSGILIFAIGVGRVDMNTLKSIGSEP 169 Query: 343 --NSFYLVEN---PHSMYDAFS 359 +LV N ++ AF Sbjct: 170 HEEHVFLVANFSQIETLTSAFQ 191 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 39/209 (18%), Positives = 84/209 (40%), Gaps = 25/209 (11%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 T+ +D++ V+D S+S+ ++ + ++ +L+ +++ P + GL+ Sbjct: 606 RCTEGPIDLVFVIDGSKSLGE------DNFEIVKQFVSGILDTLEISPKAA---RVGLLQ 656 Query: 226 FSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +S ++ F L S +++ + + G + + L+ + F Sbjct: 657 YSTEVRTEFTLRQFSSAKDMKKAVSQMKYMGRGSMTGLALRQMSERSFTETE---GARPF 713 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP- 342 AN +I + TDG + + AK+RG I+YAIGI E L + P Sbjct: 714 SANVPRISIVFTDGR-----AQDEVSEWATRAKQRGIIIYAIGIGKAIEEELLEIASEPS 768 Query: 343 -NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + E+ + A I +++ + Sbjct: 769 YKHLFYAED----FTALEDISEELRAQIC 793 >gi|118087212|ref|XP_424219.2| PREDICTED: similar to matrilin 2 [Gallus gallus] Length = 1799 Score = 107 bits (267), Expect = 3e-21, Method: Composition-based stats. Identities = 39/206 (18%), Positives = 85/206 (41%), Gaps = 24/206 (11%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 S+ N+ + RLD++ ++D SRS+ + + + I +L+ + + PD + Sbjct: 20 SLLENTCNNKRLDLVFIIDSSRSVRPY------DFEKVKEFILTILQFLDVSPDA---TR 70 Query: 221 SGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 GL+ + + ++ F L+ ++R ++ + T + ++YA N F Sbjct: 71 VGLIQYGSTVKHEFSLKTFRRKQEIERAVRRMMHLATGTMTGLAIQYAVNIAFSESE--- 127 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + N +II+ +TDG +A+ G +++AIG+ + + Sbjct: 128 GARPLNQNVPRIIMIVTDGRPQDPVG-----EIAAKARNSGILIFAIGVGRVDMNTLKSI 182 Query: 339 CASP--NSFYLVEN---PHSMYDAFS 359 + P +LV N ++ F Sbjct: 183 GSEPHEEHIFLVANFSQIETLTSVFQ 208 Score = 79.4 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 38/209 (18%), Positives = 86/209 (41%), Gaps = 25/209 (11%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 T+ +D++ V+D S+S+ ++ + ++ +L+ +++ P + GL+ Sbjct: 1482 RCTEGPVDLVFVIDGSKSLGE------DNFEIVKQFVSGILDTLEISPKAA---RVGLLQ 1532 Query: 226 FSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +S+++ F L + +++ + + G + + LK + + F Sbjct: 1533 YSSEVRTEFTLRRFSTAKDMKKAVSQMKYMGRGSMTGLALKQMFERSFTETE---GARPL 1589 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP- 342 AN +I + TDG + + + AK+ G I+YAIGI E L + P Sbjct: 1590 SANIPRISIVFTDG-----RAQDEVSEWAARAKRSGIIIYAIGIGKAIEEELLEIASEPS 1644 Query: 343 -NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + E+ + A I +++ + Sbjct: 1645 YKHLFYAED----FTAMEDISEELKVQIC 1669 >gi|22760136|dbj|BAC11081.1| unnamed protein product [Homo sapiens] Length = 488 Score = 107 bits (266), Expect = 4e-21, Method: Composition-based stats. Identities = 35/181 (19%), Positives = 72/181 (39%), Gaps = 21/181 (11%) Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGV 255 + + +L + + P+ + G++ +S++++ F L ++R I+ L Sbjct: 1 MRQFLMGLLRGLNVGPNA---TRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQ 57 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 T + ++YA N F + + ++ V +TDG + +A Sbjct: 58 GTMTGLAIQYAMNVAFSVAE---GARPPEERVPRVAVIVTDGRPQD-----RVAEVAAQA 109 Query: 316 KKRGAIVYAIGIRVIRSHEFLRACASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + G +YA+G++ LRA ASP +LVE+ +D G ++ Sbjct: 110 RASGIEIYAVGVQRADVGS-LRAMASPPLDEHVFLVES----FDLIQEFGLQFQSRLCAI 164 Query: 373 D 373 D Sbjct: 165 D 165 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 75/180 (41%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 244 NRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 294 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 295 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 351 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G ++YA+G+ E + P Sbjct: 352 RALNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVMYAVGVGKAVEAELREIASEP 406 >gi|47208832|emb|CAF90336.1| unnamed protein product [Tetraodon nigroviridis] Length = 443 Score = 106 bits (265), Expect = 5e-21, Method: Composition-based stats. Identities = 45/297 (15%), Positives = 105/297 (35%), Gaps = 35/297 (11%) Query: 82 ILCRIKNTWNMSFRNELRDNGFVNDIDDIVRS--TSLDIVVVPQNEGYSISA-------I 132 + ++ F + + + N + + L + P N ++ I Sbjct: 122 DFYQPHQSFKSHFSYKPKASCPKNLVPTFSSAPVAPLPTIPSPPNPTVQLTTPPAPLATI 181 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI 192 + +P++ ++ + + +S LD++ ++D SRS+ S Sbjct: 182 TPEVLPVETSAPATLSVSTFTTTVSAVETETDSSCLSRPLDLVFIIDSSRSVR---PSEF 238 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYL 250 K+ + + + V + + + +V +++ ++ FLL+ +L++ I + Sbjct: 239 EKVKIF------LADMVDTLDVGADATRVAVVNYASTVKTEFLLKDHFNKPNLKKAISRI 292 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 T + +K A ++ F Q N K+ + +TDG E+ + Sbjct: 293 EPLATGTMTGLAIKTAVSEAFTEQ---SGARPRPRNIAKVAIIVTDGRPQDQVEEVSAA- 348 Query: 311 YCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---PNSFYLVEN---PHSMYDAFSHI 361 A+ G +YA+G+ L+ AS + + VE + F Sbjct: 349 ----ARASGVEIYAVGVDRADMRS-LQLMASVPLEDHVFYVETYGVIEKLTSKFRET 400 >gi|114682173|ref|XP_001153832.1| PREDICTED: matrilin 4 isoform 1 [Pan troglodytes] Length = 488 Score = 106 bits (264), Expect = 7e-21, Method: Composition-based stats. Identities = 36/181 (19%), Positives = 73/181 (40%), Gaps = 21/181 (11%) Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGV 255 + + +L + + P+ + G++ +S++++ F L ++R I+ L Sbjct: 1 MRQFLMGLLPGLNVGPNA---TRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQ 57 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 T + ++YA N F + + ++ V +TDG + +A Sbjct: 58 GTMTGLAIQYAMNVAFSVAE---GARPPEERVPRVAVIVTDGRPQD-----RVAEVAAQA 109 Query: 316 KKRGAIVYAIGIRVIRSHEFLRACASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + RG +YA+G++ LRA ASP +LVE+ +D G ++ Sbjct: 110 RARGIEIYAVGVQRADVGS-LRAMASPPLDEHVFLVES----FDLIQEFGLQFQSRLCAI 164 Query: 373 D 373 D Sbjct: 165 D 165 Score = 82.9 bits (203), Expect = 7e-14, Method: Composition-based stats. Identities = 32/180 (17%), Positives = 75/180 (41%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 244 NRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 294 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 295 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 351 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G +VYA+G+ E + P Sbjct: 352 RALNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVVYAVGVGKAVEAELREIASEP 406 >gi|218678237|ref|ZP_03526134.1| hypothetical protein RetlC8_04927 [Rhizobium etli CIAT 894] Length = 120 Score = 106 bits (263), Expect = 9e-21, Method: Composition-based stats. Identities = 31/111 (27%), Positives = 42/111 (37%), Gaps = 7/111 (6%) Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE-----DQQSLYYCNEAKKRG 319 A N + H KK IVFMTDG+N + D + C++AK +G Sbjct: 4 TAKNAAGNDAEDAAHKLKTGQIPKKYIVFMTDGDNNNDSSGGRSYDTATKKTCDDAKSKG 63 Query: 320 AIVYAIGIRVIR-SHEFLRACASPNSFYL-VENPHSMYDAFSHIGKDIVTK 368 +Y I L CAS +S Y E + AF IG + Sbjct: 64 IEIYTIAFMAPAGGQALLHYCASDDSHYFQAEKMEDLLAAFEAIGAKSAAQ 114 >gi|47212423|emb|CAF93579.1| unnamed protein product [Tetraodon nigroviridis] Length = 688 Score = 104 bits (260), Expect = 2e-20, Method: Composition-based stats. Identities = 41/212 (19%), Positives = 82/212 (38%), Gaps = 30/212 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D++ ++D SRS+ + + + M++ + + N + G+V + Sbjct: 1 CRSGPVDLLFLIDSSRSVRPH------EFETMRRF---MIDILNTLDIGLNSTRVGVVQY 51 Query: 227 SNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S+++ F L + + + I+ + T + ++Y N F + Sbjct: 52 SSQVRSEFSLRSHASLDSMVKAIQEMVPLAQGTMTGLAIRYTMNVAFTAAE------GDR 105 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP-- 342 ++V +TDG + EA++RG +YA+G+ LRA ASP Sbjct: 106 PKVPNVVVIVTDGRPQD-----RVAEVAAEARERGMEIYAVGVARADMTS-LRAMASPPF 159 Query: 343 -NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + +LVE+ +D G K D Sbjct: 160 EDHVFLVES----FDLIHQFGLQFQDKLCGAD 187 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 60/185 (32%), Gaps = 44/185 (23%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA---------MLEE------- 208 + A +D+++++D S+S+ ++ K +N +LE+ Sbjct: 436 GTCKSANIDLVLLIDGSKSVRP------QNFELVKKFVNQGPEPDEGSWLLEDMEKRPER 489 Query: 209 ------------VKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFG 254 V + + GLV +S+++ F L ++ + + Sbjct: 490 KTWRHLRFSCWVVDSLDVSPQGTRVGLVQYSSRVRTEFPLSMYHSAQEIKAAVMKVEYME 549 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 T + LK+ F + +I + TDG + + + Sbjct: 550 KGTMTGLALKHMLENSFSEAE---GARPARRHVPRIGLVFTDGRSQDDIS-----EHARK 601 Query: 315 AKKRG 319 AK+ G Sbjct: 602 AKEAG 606 >gi|26349121|dbj|BAC38200.1| unnamed protein product [Mus musculus] Length = 280 Score = 103 bits (257), Expect = 4e-20, Method: Composition-based stats. Identities = 46/218 (21%), Positives = 95/218 (43%), Gaps = 26/218 (11%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 RH+ M +++ +S + R D+ ++D SRS+ ++ + + + I +L+ + Sbjct: 35 GRHVRMYPQTALLESSCENKRADLAFIIDSSRSVNTYHYAKV------KEFILDILQFLD 88 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + PD V + GL+ + + ++ F L+ S ++R +K + T + ++YA N Sbjct: 89 IGPD---VTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALN 145 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 F N +II+ +TDG +A+ G +++AIG+ Sbjct: 146 IAFSEAE---GARPLRENVPRIIMIVTDGRPQD-----SVAEVAAKARNTGILIFAIGVG 197 Query: 329 VIRSHEFLRACASP---NSFYLVEN---PHSMYDAFSH 360 + + L+A S + +LV N S+ F + Sbjct: 198 QVDLNT-LKAIGSEPHKDHVFLVANFSQIESLTSVFQN 234 >gi|47205231|emb|CAG06181.1| unnamed protein product [Tetraodon nigroviridis] Length = 427 Score = 103 bits (257), Expect = 4e-20, Method: Composition-based stats. Identities = 37/215 (17%), Positives = 81/215 (37%), Gaps = 26/215 (12%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 S +S + +D++ ++D SRS+ T+ + A + + +++ +++ D Sbjct: 36 PSSTVSPASGSSCRNGPIDLVFIVDSSRSVRP------TEFEKAKEFLQDLVDSLEVGLD 89 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + GLV +++ + F L+ S L+ + + T + ++ A + F Sbjct: 90 S---TRVGLVNYASTVRMEFPLKAHFSKPALKGALARVEPLASGTMTGLAIRTAVEKAFA 146 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + ++ V +TDG E + A++ G +YA+G+ Sbjct: 147 AE---AGARLNSTKVARVAVVVTDGRPQDEVERVSAA-----ARESGIEIYAVGVDRADR 198 Query: 333 HEFLRACAS---PNSFYLVEN---PHSMYDAFSHI 361 LR AS + + VE + F Sbjct: 199 TS-LRLMASQPHEDHVFYVETYGVIEKLTSRFRET 232 >gi|149721558|ref|XP_001490961.1| PREDICTED: matrilin 2 isoform 1 [Equus caballus] Length = 956 Score = 103 bits (257), Expect = 4e-20, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 90/209 (43%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + + + + I +L+ + + PD V Sbjct: 44 TALLESSCENKRADVVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPD---VT 94 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 95 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 152 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++AIG+ + L+ Sbjct: 153 -GARPLRENVPRVIMIVTDGRPQD-----SVAEVAAKARDTGILIFAIGVGQVD-FNTLK 205 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 A S + +LV N S+ F + Sbjct: 206 AIGSEPHEDHVFLVANFSQMESLTSVFQN 234 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 80/207 (38%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 647 KRCTEGPIDLVFVIDGSKSLGE------ENFEIVKQFVTGIIDSLAISPKAA---RVGLL 697 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 698 QYSTQVRTEFTLRNFGSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQVE---GARP 754 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 A ++ + TDG + ++AK G +YA+G+ E + P Sbjct: 755 LSARVPRVAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 810 IDKHLFYAEDFSTMGEINEKLKKGICE 836 >gi|149721562|ref|XP_001490991.1| PREDICTED: matrilin 2 isoform 2 [Equus caballus] Length = 915 Score = 103 bits (257), Expect = 4e-20, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 90/209 (43%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + + + + I +L+ + + PD V Sbjct: 44 TALLESSCENKRADVVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPD---VT 94 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 95 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 152 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++AIG+ + L+ Sbjct: 153 -GARPLRENVPRVIMIVTDGRPQD-----SVAEVAAKARDTGILIFAIGVGQVD-FNTLK 205 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 A S + +LV N S+ F + Sbjct: 206 AIGSEPHEDHVFLVANFSQMESLTSVFQN 234 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 80/207 (38%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 606 KRCTEGPIDLVFVIDGSKSLGE------ENFEIVKQFVTGIIDSLAISPKAA---RVGLL 656 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 657 QYSTQVRTEFTLRNFGSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQVE---GARP 713 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 A ++ + TDG + ++AK G +YA+G+ E + P Sbjct: 714 LSARVPRVAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 768 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 769 IDKHLFYAEDFSTMGEINEKLKKGICE 795 >gi|13529371|gb|AAH05429.1| Matn2 protein [Mus musculus] Length = 956 Score = 103 bits (257), Expect = 4e-20, Method: Composition-based stats. Identities = 46/218 (21%), Positives = 95/218 (43%), Gaps = 26/218 (11%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 RH+ M +++ +S + R D++ ++D SRS+ + + + + I +L+ + Sbjct: 35 GRHVRMYPQTALLESSCENKRADLVFIIDSSRSVNTHDYAKV------KEFILDILQFLD 88 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + PD V + GL+ + + ++ F L+ S ++R +K + T + ++YA N Sbjct: 89 IGPD---VTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALN 145 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 F N +II+ +TDG +A+ G +++AIG+ Sbjct: 146 IAFSEAE---GARPLRENVPRIIMIVTDGRPQD-----SVAEVAAKARNTGILIFAIGVG 197 Query: 329 VIRSHEFLRACASP---NSFYLVEN---PHSMYDAFSH 360 + + L+A S + +LV N S+ F + Sbjct: 198 QVDLNT-LKAIGSEPHKDHVFLVANFSQIESLTSVFQN 234 Score = 96.8 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 31/207 (14%), Positives = 77/207 (37%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ + + +++ + + P + GL+ Sbjct: 647 KRCTEGPIDLVFVIDGSKSLGE------ENFETVKHFVTGIIDSLAVSPKAA---RVGLL 697 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 698 QYSTQVRTEFTLRGFSSAKEMKKAVAHMKYMGKGSMTGLALKHMFERSFTQVE---GARP 754 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 ++ + TDG + ++AK G +YA+G+ E + P Sbjct: 755 PSTQVPRVAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + + I Sbjct: 810 IDKHLFYAEDFSTMGEISEKLKEGICE 836 >gi|148676905|gb|EDL08852.1| matrilin 2, isoform CRA_a [Mus musculus] Length = 836 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 46/218 (21%), Positives = 96/218 (44%), Gaps = 26/218 (11%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 RH+ M +++ +S + R D++ ++D SRS+ ++ + + + I +L+ + Sbjct: 61 GRHVRMYPQTALLESSCENKRADLVFIIDSSRSVNTYDYAKV------KEFILDILQFLD 114 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + PD V + GL+ + + ++ F L+ S ++R +K + T + ++YA N Sbjct: 115 IGPD---VTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALN 171 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 F N +II+ +TDG +A+ G +++AIG+ Sbjct: 172 IAFSEAE---GARPLRENVPRIIMIVTDGRPQD-----SVAEVAAKARNTGILIFAIGVG 223 Query: 329 VIRSHEFLRACASP---NSFYLVEN---PHSMYDAFSH 360 + + L+A S + +LV N S+ F + Sbjct: 224 QVDLNT-LKAIGSEPHKDHVFLVANFSQIESLTSVFQN 260 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 23/157 (14%), Positives = 60/157 (38%), Gaps = 19/157 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ + + +++ + + P + GL+ Sbjct: 673 KRCTEGPIDLVFVIDGSKSLGE------ENFETVKHFVTGIIDSLAVSPKAA---RVGLL 723 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 724 QYSTQVRTEFTLRGFSSAKEMKKAVTHMKYMGKGSMTGLALKHMFERSFTQVE---GARP 780 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 ++ + TDG + + ++AK G Sbjct: 781 PSTQVPRVAIVFTDG-----RAQDDVSEWASKAKANG 812 >gi|148676906|gb|EDL08853.1| matrilin 2, isoform CRA_b [Mus musculus] Length = 941 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 46/218 (21%), Positives = 96/218 (44%), Gaps = 26/218 (11%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 RH+ M +++ +S + R D++ ++D SRS+ ++ + + + I +L+ + Sbjct: 35 GRHVRMYPQTALLESSCENKRADLVFIIDSSRSVNTYDYAKV------KEFILDILQFLD 88 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + PD V + GL+ + + ++ F L+ S ++R +K + T + ++YA N Sbjct: 89 IGPD---VTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALN 145 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 F N +II+ +TDG +A+ G +++AIG+ Sbjct: 146 IAFSEAE---GARPLRENVPRIIMIVTDGRPQD-----SVAEVAAKARNTGILIFAIGVG 197 Query: 329 VIRSHEFLRACASP---NSFYLVEN---PHSMYDAFSH 360 + + L+A S + +LV N S+ F + Sbjct: 198 QVDLNT-LKAIGSEPHKDHVFLVANFSQIESLTSVFQN 234 Score = 96.8 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 31/207 (14%), Positives = 77/207 (37%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ + + +++ + + P + GL+ Sbjct: 647 KRCTEGPIDLVFVIDGSKSLGE------ENFETVKHFVTGIIDSLAVSPKAA---RVGLL 697 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 698 QYSTQVRTEFTLRGFSSAKEMKKAVTHMKYMGKGSMTGLALKHMFERSFTQVE---GARP 754 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 ++ + TDG + ++AK G +YA+G+ E + P Sbjct: 755 PSTQVPRVAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + + I Sbjct: 810 IDKHLFYAEDFSTMGEISEKLKEGICE 836 >gi|120407045|ref|NP_058042.2| matrilin-2 [Mus musculus] Length = 937 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 46/218 (21%), Positives = 96/218 (44%), Gaps = 26/218 (11%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 RH+ M +++ +S + R D++ ++D SRS+ ++ + + + I +L+ + Sbjct: 35 GRHVRMYPQTALLESSCENKRADLVFIIDSSRSVNTYDYAKV------KEFILDILQFLD 88 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + PD V + GL+ + + ++ F L+ S ++R +K + T + ++YA N Sbjct: 89 IGPD---VTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALN 145 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 F N +II+ +TDG +A+ G +++AIG+ Sbjct: 146 IAFSEAE---GARPLRENVPRIIMIVTDGRPQD-----SVAEVAAKARNTGILIFAIGVG 197 Query: 329 VIRSHEFLRACASP---NSFYLVEN---PHSMYDAFSH 360 + + L+A S + +LV N S+ F + Sbjct: 198 QVDLNT-LKAIGSEPHKDHVFLVANFSQIESLTSVFQN 234 Score = 96.8 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 31/207 (14%), Positives = 77/207 (37%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ + + +++ + + P + GL+ Sbjct: 647 KRCTEGPIDLVFVIDGSKSLGE------ENFETVKHFVTGIIDSLAVSPKAA---RVGLL 697 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 698 QYSTQVRTEFTLRGFSSAKEMKKAVTHMKYMGKGSMTGLALKHMFERSFTQVE---GARP 754 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 ++ + TDG + ++AK G +YA+G+ E + P Sbjct: 755 PSTQVPRVAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + + I Sbjct: 810 IDKHLFYAEDFSTMGEISEKLKEGICE 836 >gi|20136122|gb|AAM11539.1| matrilin-2 [Mus musculus] Length = 956 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 46/218 (21%), Positives = 96/218 (44%), Gaps = 26/218 (11%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 RH+ M +++ +S + R D++ ++D SRS+ ++ + + + I +L+ + Sbjct: 35 GRHVRMYPQTALLESSCENKRADLVFIIDSSRSVNTYDYAKV------KEFILDILQFLD 88 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + PD V + GL+ + + ++ F L+ S ++R +K + T + ++YA N Sbjct: 89 IGPD---VTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALN 145 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 F N +II+ +TDG +A+ G +++AIG+ Sbjct: 146 IAFSEAE---GARPLRENVPRIIMIVTDGRPQD-----SVAEVAAKARNTGILIFAIGVG 197 Query: 329 VIRSHEFLRACASP---NSFYLVEN---PHSMYDAFSH 360 + + L+A S + +LV N S+ F + Sbjct: 198 QVDLNT-LKAIGSEPHKDHVFLVANFSQIESLTSVFQN 234 Score = 96.8 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 31/207 (14%), Positives = 77/207 (37%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ + + +++ + + P + GL+ Sbjct: 647 KRCTEGPIDLVFVIDGSKSLGE------ENFETVKHFVTGIIDSLAVSPKAA---RVGLL 697 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 698 QYSTQVRTEFTLRGFSSAKEMKKAVTHMKYMGKGSMTGLALKHMFERSFTQVE---GARP 754 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 ++ + TDG + ++AK G +YA+G+ E + P Sbjct: 755 PSTQVPRVAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + + I Sbjct: 810 IDKHLFYAEDFSTMGEISEKLKEGICE 836 >gi|62185620|gb|AAH92298.1| Matrilin 2 [Mus musculus] Length = 937 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 46/218 (21%), Positives = 96/218 (44%), Gaps = 26/218 (11%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 RH+ M +++ +S + R D++ ++D SRS+ ++ + + + I +L+ + Sbjct: 35 GRHVRMYPQTALLESSCENKRADLVFIIDSSRSVNTYDYAKV------KEFILDILQFLD 88 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + PD V + GL+ + + ++ F L+ S ++R +K + T + ++YA N Sbjct: 89 IGPD---VTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALN 145 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 F N +II+ +TDG +A+ G +++AIG+ Sbjct: 146 IAFSEAE---GARPLRENVPRIIMIVTDGRPQD-----SVAEVAAKARNTGILIFAIGVG 197 Query: 329 VIRSHEFLRACASP---NSFYLVEN---PHSMYDAFSH 360 + + L+A S + +LV N S+ F + Sbjct: 198 QVDLNT-LKAIGSEPHKDHVFLVANFSQIESLTSVFQN 234 Score = 96.8 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 31/207 (14%), Positives = 77/207 (37%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ + + +++ + + P + GL+ Sbjct: 647 KRCTEGPIDLVFVIDGSKSLGE------ENFETVKHFVTGIIDSLAVSPKAA---RVGLL 697 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 698 QYSTQVRTEFTLRGFSSAKEMKKAVTHMKYMGKGSMTGLALKHMFERSFTQVE---GARP 754 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 ++ + TDG + ++AK G +YA+G+ E + P Sbjct: 755 PSTQVPRVAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + + I Sbjct: 810 IDKHLFYAEDFSTMGEISEKLKEGICE 836 >gi|74202868|dbj|BAE37504.1| unnamed protein product [Mus musculus] Length = 928 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 46/218 (21%), Positives = 96/218 (44%), Gaps = 26/218 (11%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 RH+ M +++ +S + R D++ ++D SRS+ ++ + + + I +L+ + Sbjct: 35 GRHVRMYPQTALLESSCENKRADLVFIIDSSRSVNTYDYAKV------KEFILDILQFLD 88 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + PD V + GL+ + + ++ F L+ S ++R +K + T + ++YA N Sbjct: 89 IGPD---VTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALN 145 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 F N +II+ +TDG +A+ G +++AIG+ Sbjct: 146 IAFSEAE---GARPLRENVPRIIMIVTDGRPQD-----SVAEVAAKARNTGILIFAIGVG 197 Query: 329 VIRSHEFLRACASP---NSFYLVEN---PHSMYDAFSH 360 + + L+A S + +LV N S+ F + Sbjct: 198 QVDLNT-LKAIGSEPHKDHVFLVANFSQIESLTSVFQN 234 Score = 96.8 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 31/207 (14%), Positives = 77/207 (37%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ + + +++ + + P + GL+ Sbjct: 647 KRCTEGPIDLVFVIDGSKSLGE------ENFETVKHFVTGIIDSLAVSPKAA---RVGLL 697 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 698 QYSTQVRTEFTLRGFSSAKEMKKAVTHMKYMGKGSMTGLALKHMFERSFTQVE---GARP 754 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 ++ + TDG + ++AK G +YA+G+ E + P Sbjct: 755 PSTQVPRVAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + + I Sbjct: 810 IDKHLFYAEDFSTMGEISEKLKEGICE 836 >gi|7387906|sp|O08746|MATN2_MOUSE RecName: Full=Matrilin-2; Flags: Precursor gi|2072792|gb|AAC53163.1| matrilin-2 precursor [Mus musculus] Length = 956 Score = 103 bits (257), Expect = 5e-20, Method: Composition-based stats. Identities = 46/218 (21%), Positives = 96/218 (44%), Gaps = 26/218 (11%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 RH+ M +++ +S + R D++ ++D SRS+ ++ + + + I +L+ + Sbjct: 35 GRHVRMYPQTALLESSCENKRADLVFIIDSSRSVNTYDYAKV------KEFILDILQFLD 88 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + PD V + GL+ + + ++ F L+ S ++R +K + T + ++YA N Sbjct: 89 IGPD---VTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALN 145 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 F N +II+ +TDG +A+ G +++AIG+ Sbjct: 146 IAFSEAE---GARPLRENVPRIIMIVTDGRPQD-----SVAEVAAKARNTGILIFAIGVG 197 Query: 329 VIRSHEFLRACASP---NSFYLVEN---PHSMYDAFSH 360 + + L+A S + +LV N S+ F + Sbjct: 198 QVDLNT-LKAIGSEPHKDHVFLVANFSQIESLTSVFQN 234 Score = 96.8 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 31/207 (14%), Positives = 77/207 (37%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ + + +++ + + P + GL+ Sbjct: 647 KRCTEGPIDLVFVIDGSKSLGE------ENFETVKHFVTGIIDSLAVSPKAA---RVGLL 697 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 698 QYSTQVRTEFTLRGFSSAKEMKKAVTHMKYMGKGSMTGLALKHMFERSFTQVE---GARP 754 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 ++ + TDG + ++AK G +YA+G+ E + P Sbjct: 755 PSTQVPRVAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + + I Sbjct: 810 IDKHLFYAEDFSTMGEISEKLKEGICE 836 >gi|218458490|ref|ZP_03498581.1| von Willebrand factor type A [Rhizobium etli Kim 5] Length = 220 Score = 102 bits (255), Expect = 7e-20, Method: Composition-based stats. Identities = 35/235 (14%), Positives = 71/235 (30%), Gaps = 53/235 (22%) Query: 26 FLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCR 85 P++ GM + V + K L + L G I Sbjct: 1 MAPVLLGAAGMAVHVGDMLLSKQQLQEAA------------DSAALATATALANGKIQTS 48 Query: 86 IKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDI--VVVPQNEGYSISAISRYKIPLKFCT 143 + +F N ++ DI TS+++ ++ Y ++ Y + + Sbjct: 49 EAEAYARNFVAGQMAN-YLQSGVDIKGGTSVNVQTSTSGKSTSYQVTVSPSYDLSVNPLM 107 Query: 144 FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES--------------FFD 189 + ++H+ T+ V +SQT + M + LD S SM +D Sbjct: 108 QAVGF-KTQHLSTSGTT-VGGHSQTQGSISMFLALDKSGSMGESTATVNEDDPTETFTYD 165 Query: 190 SSI----------------------TKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 ++ TKI+ + + ++ V++G Sbjct: 166 CNLHYNSKNNKWVYDKCTGSRTNYYTKIEALKIAAGNLFSQLNSADPNAQYVRTG 220 >gi|45384196|ref|NP_990403.1| matrilin-3 precursor [Gallus gallus] gi|14548115|sp|O42401|MATN3_CHICK RecName: Full=Matrilin-3; Flags: Precursor gi|2326444|emb|CAA03885.1| matrilin-3 [Gallus gallus] Length = 452 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 30/205 (14%), Positives = 78/205 (38%), Gaps = 26/205 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + LD++ ++D SRS+ + + ++ M++ + + + ++ Sbjct: 46 TACKNRPLDLVFIIDSSRSVRP------EEFEKVKIFLSKMIDTLDVGE---RTTRVAVM 96 Query: 225 TFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +++ ++ F L + ++ + + T + ++ A +++F + Sbjct: 97 NYASTVKVEFPLRTYFDKASMKEAVSRIQPLSAGTMTGLAIQAAMDEVFTEE---MGTRP 153 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS- 341 + N K+++ +TDG E+ + A+ G +YA+G+ LR AS Sbjct: 154 ANFNIPKVVIIVTDGRPQDQVENVAA-----NARTAGIEIYAVGVGRADMQS-LRIMASE 207 Query: 342 --PNSFYLVEN---PHSMYDAFSHI 361 + VE + F Sbjct: 208 PLDEHVFYVETYGVIEKLTSKFRET 232 >gi|26350291|dbj|BAC38785.1| unnamed protein product [Mus musculus] Length = 810 Score = 101 bits (252), Expect = 1e-19, Method: Composition-based stats. Identities = 45/218 (20%), Positives = 95/218 (43%), Gaps = 26/218 (11%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 RH+ M +++ +S + R D++ ++D SRS+ ++ + + + I +L+ + Sbjct: 35 GRHVRMYPQTALLESSCENKRADLVFIIDSSRSVNTYDYAKV------KEFILDILQFLD 88 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + PD V + GL+ + + ++ F L+ S ++R +K + T + ++YA N Sbjct: 89 IGPD---VTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALN 145 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 F N +II+ +TDG +A+ G +++AIG+ Sbjct: 146 IAFSEAE---GARPLRENVPRIIMIVTDGRPQD-----SVAEVAAKARNTGILIFAIGVG 197 Query: 329 VIRSHEFLRACASP---NSFYLVEN---PHSMYDAFSH 360 + + +A S + +LV N S+ F + Sbjct: 198 QVDLNTV-KAIGSEPHKDHVFLVANFSQIESLTSVFQN 234 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 23/157 (14%), Positives = 60/157 (38%), Gaps = 19/157 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ + + +++ + + P + GL+ Sbjct: 647 KRCTEGPIDLVFVIDGSKSLGE------ENFETVKHFVTGIIDSLAVSPKAA---RVGLL 697 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 698 QYSTQVRTEFTLRGFSSAKEMKKAVTHMKYMGKGSMTGLALKHMFERSFTQVE---GARP 754 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 ++ + TDG + + ++AK G Sbjct: 755 PSTQVPRVAIVFTDG-----RAQDDVSEWASKAKANG 786 >gi|329850249|ref|ZP_08265094.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] gi|328840564|gb|EGF90135.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] Length = 412 Score = 101 bits (252), Expect = 1e-19, Method: Composition-based stats. Identities = 68/419 (16%), Positives = 141/419 (33%), Gaps = 92/419 (21%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 + G + ++ A+ + +IF +G I+ S + + + L D +++ + M + Sbjct: 23 RDQSGNVIMIFALSVFVIFGFVGAAIDFSRVDYARRRLQDAADSAVL----RAMALKSAT 78 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 + + D +F G + + R + +I+ ++ Y++ A Sbjct: 79 DESRGVAAD----------KAFAENFGHPGVYDLNGALKREVNENII----SQTYTVHAT 124 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES--FFDS 190 ++ + + + S K + ++ VLD + SM + Sbjct: 125 VS--------SYFGAFFGKDSYPVTVVSQAKTSLD---VFEIAFVLDTTGSMAEANKMPN 173 Query: 191 SITKIDMAIKSI----NAMLEEVKLIPDVNNVVQSGLVTFSNK----------------- 229 + +D A+ + + + N V+ T + Sbjct: 174 LKSSVDSAMAGLLQNGKNLSGSKIAVVPFNTQVRLSDATVTTMSSQGLSSGWGNCVHDRD 233 Query: 230 ------------------------------IEEFFLLEWGVSHLQRKIKYLSKFGVSTNS 259 ++ L +S + IK L G TN Sbjct: 234 LATSHDVSASAAQKGKAQTLYPLETCDEASLKPVQGLSDNISSARNFIKTLQP-GGYTNV 292 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL-------STKEDQQSLYYC 312 T G+++ + Q +K ++ +TDG+N ++ D+++ C Sbjct: 293 TMGVQWGMEVLSPNQPFSDATEFGSTKARKFMIVVTDGDNTKSFTSWSASVIDKRTALAC 352 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 AK +G VY + I S+ LR CAS P FY + + + + A S I K I R+ Sbjct: 353 ENAKAKGITVYTVKIIQGNSN-MLRKCASAPEYFYDLTSANQLNAAMSGIFKSINKTRL 410 >gi|332238409|ref|XP_003268390.1| PREDICTED: matrilin-2 isoform 3 [Nomascus leucogenys] Length = 915 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 41/209 (19%), Positives = 89/209 (42%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + + + + I +L+ + + PD V Sbjct: 44 TALLESSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPD---VT 94 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 95 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 152 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++AIG+ + L+ Sbjct: 153 -GARPLRENVPRVIMIVTDGRPQD-----SVAEVAAKARDTGILIFAIGVGQVD-FNTLK 205 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 A S + +LV N ++ F Sbjct: 206 AIGSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 96.0 bits (237), Expect = 8e-18, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 606 KKCTEGPIDLVFVIDGSKSLGE------ENFEVVKQFVTGIIDSLTISPKAA---RVGLL 656 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 657 QYSTQVRTEFTLRNFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGE---GARP 713 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + + TDG + ++AK G +YA+G+ E + P Sbjct: 714 LSTRVPRAAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 768 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 769 TNKHLFYAEDFSTMDEISEKLKKGICE 795 >gi|332238407|ref|XP_003268389.1| PREDICTED: matrilin-2 isoform 2 [Nomascus leucogenys] Length = 937 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 41/209 (19%), Positives = 89/209 (42%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + + + + I +L+ + + PD V Sbjct: 44 TALLESSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPD---VT 94 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 95 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 152 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++AIG+ + L+ Sbjct: 153 -GARPLRENVPRVIMIVTDGRPQD-----SVAEVAAKARDTGILIFAIGVGQVD-FNTLK 205 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 A S + +LV N ++ F Sbjct: 206 AIGSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 96.0 bits (237), Expect = 8e-18, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 647 KKCTEGPIDLVFVIDGSKSLGE------ENFEVVKQFVTGIIDSLTISPKAA---RVGLL 697 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 698 QYSTQVRTEFTLRNFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGE---GARP 754 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + + TDG + ++AK G +YA+G+ E + P Sbjct: 755 LSTRVPRAAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 810 TNKHLFYAEDFSTMDEISEKLKKGICE 836 >gi|332238405|ref|XP_003268388.1| PREDICTED: matrilin-2 isoform 1 [Nomascus leucogenys] Length = 956 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 41/209 (19%), Positives = 89/209 (42%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + + + + I +L+ + + PD V Sbjct: 44 TALLESSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPD---VT 94 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 95 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 152 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++AIG+ + L+ Sbjct: 153 -GARPLRENVPRVIMIVTDGRPQD-----SVAEVAAKARDTGILIFAIGVGQVD-FNTLK 205 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 A S + +LV N ++ F Sbjct: 206 AIGSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 96.0 bits (237), Expect = 8e-18, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 647 KKCTEGPIDLVFVIDGSKSLGE------ENFEVVKQFVTGIIDSLTISPKAA---RVGLL 697 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 698 QYSTQVRTEFTLRNFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGE---GARP 754 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + + TDG + ++AK G +YA+G+ E + P Sbjct: 755 LSTRVPRAAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 810 TNKHLFYAEDFSTMDEISEKLKKGICE 836 >gi|297683362|ref|XP_002819353.1| PREDICTED: LOW QUALITY PROTEIN: matrilin-2-like [Pongo abelii] Length = 935 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 41/209 (19%), Positives = 89/209 (42%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + + + + I +L+ + + PD V Sbjct: 44 TALLESSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPD---VT 94 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 95 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 152 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++AIG+ + L+ Sbjct: 153 -GARPLRENVPRVIMIVTDGRPQD-----SVAEVAAKARDTGILIFAIGVGQVD-FNTLK 205 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 A S + +LV N ++ F Sbjct: 206 AIGSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 96.0 bits (237), Expect = 8e-18, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 645 KKCTEGPIDLVFVIDGSKSLGE------ENFEVVKQFVTGIIDSLTISPKAA---RVGLL 695 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 696 QYSTQVRTEFTLRNFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGE---GARP 752 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + + TDG + ++AK G +YA+G+ E + P Sbjct: 753 LSTRVPRAAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 807 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 808 TTKHLFYAEDFSTMDEISEKLKKGICE 834 >gi|301756400|ref|XP_002914037.1| PREDICTED: matrilin-2-like isoform 3 [Ailuropoda melanoleuca] Length = 957 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 91/209 (43%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + + + + I +L+ + + PD V Sbjct: 45 TALLGDSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPD---VT 95 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 96 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 153 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++AIG+ + + L+ Sbjct: 154 -GARPLRENVLRVIMIVTDGRPQD-----SVAEVAAKARDTGILIFAIGVGQVDLNT-LK 206 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 A S + +LV N S+ F + Sbjct: 207 AIGSEPHEDHVFLVANFSQMESLTSVFQN 235 Score = 92.5 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 31/207 (14%), Positives = 79/207 (38%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 648 KRCTEGPVDLVFVIDGSKSLGE------ENFEIVKQFVTGIIDSLAVSPKAA---RVGLL 698 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 699 QYSTQVRTEFTLRNFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQVE---GARP 755 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 ++ + TDG + ++A+ G +YA+G+ E + P Sbjct: 756 LSTRVPRVAIVFTDGR-----AQDDVSEWASKAQANGITMYAVGVGKAIEEELQEIASEP 810 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 811 TDKHLFYAEDFSTMGEISEKLKKGICE 837 >gi|56797865|emb|CAG27566.1| matrilin-4 [Danio rerio] Length = 726 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 40/212 (18%), Positives = 79/212 (37%), Gaps = 30/212 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D++ ++D SRS+ + + K + ++ E+ + + G V + Sbjct: 16 CKSGPVDLVFIIDGSRSVRPH------EFETMRKFMIDIIHELDIGLAA---TRIGAVQY 66 Query: 227 SNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S++++ F L+ + + I + T + ++YA N F + Sbjct: 67 SSQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLAIRYAMNVAFSAEE------GAR 120 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP-- 342 N + V +TDG + + + G +YA+G+ LRA ASP Sbjct: 121 PNVPHVAVIVTDGRPQDRVAEVAAAAR-----ESGIEIYAVGVARADMTS-LRAMASPPF 174 Query: 343 -NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + +LVE+ +D G K D Sbjct: 175 EDHVFLVES----FDLIHQFGLQFQDKLCGMD 202 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 30/205 (14%), Positives = 77/205 (37%), Gaps = 25/205 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + +D+ ++D S+S+ ++ + +N +++++ + + GLV Sbjct: 486 GTCRTSNIDLAPLIDGSKSVRP------QNFELVKQFVNQVVDQLDV---SAKGTRVGLV 536 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+++ F L +++ + + T + LK+ F + Sbjct: 537 QYSSRVRTEFPLSMYHSKDEIKKAVMNVEYMEKGTMTGLALKHMVENSFSEAEGARPAEK 596 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 ++ + TDG + +AK+ G +YA+G+ E + P Sbjct: 597 N---IPRVGLVFTDGR-----SQDDIQEWAKKAKEAGITMYAVGVGKAVEDELREIASDP 648 Query: 343 --NSFYLVENPHSMYDAFSHIGKDI 365 F+ + + A S I +++ Sbjct: 649 VEKHFFYSAD----FTAISQIAENL 669 >gi|156523168|ref|NP_001095998.1| matrilin-2 [Bos taurus] gi|146186873|gb|AAI40517.1| MATN2 protein [Bos taurus] gi|296480475|gb|DAA22590.1| matrilin 2 [Bos taurus] Length = 958 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 89/209 (42%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ ++ + R D++ ++D SRS+ + + + + I +L+ + + PD V Sbjct: 46 TALLGSACENKRADVVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPD---VT 96 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 97 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 154 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++AIG+ + L+ Sbjct: 155 -GARPLRENVPRVIMIVTDGRPQD-----SVAEVAAKARDTGILIFAIGVGQVD-FNTLK 207 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 A S + +LV N ++ F Sbjct: 208 AIGSEPHEDHVFLVANFSQIETLTSVFQK 236 Score = 90.2 bits (222), Expect = 5e-16, Method: Composition-based stats. Identities = 32/212 (15%), Positives = 78/212 (36%), Gaps = 26/212 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 649 KRCTEGPVDLVFVIDGSKSLGE------DNFEIVKQFVTGIIDSLAISPKAA---RVGLL 699 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S + F L S +++ + ++ G + + LK+ + + F Sbjct: 700 QYSTLVRTEFTLRNFSSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQVE---GARP 756 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 A ++ + TDG + ++A+ G +YA+G+ E + P Sbjct: 757 LSARVPRVAIVFTDGR-----AQDDVSEWASKAQASGITMYAVGVGKAIEEELQEIASEP 811 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + E+ + I + K I Sbjct: 812 TEKHLFYAED----FSTMGEISDKLQ-KGICE 838 >gi|73974062|ref|XP_548552.2| PREDICTED: similar to matrilin 2 isoform a precursor [Canis familiaris] Length = 978 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 44/234 (18%), Positives = 98/234 (41%), Gaps = 28/234 (11%) Query: 137 IPLKFCTFIPWYTNSR--HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITK 194 +P + PW + SR H +++ +S + + D++ ++D SRS+ + + + + Sbjct: 55 LPAEAKERTPWRSISRGRHAWTHPQTALLESSCENKQADLVFIIDSSRSVNTHDYAKVKE 114 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSK 252 +L+ ++ + ++ + GL+ + + I+ F L+ S ++R +K + Sbjct: 115 F---------ILDILQFLDISPDLTRVGLLQYGSTIKNEFSLKTFKKKSEVERAVKRMRH 165 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 T + ++YA N F N ++I+ +TDG Sbjct: 166 LSTGTMTGLAIQYALNIAFSEAE---GARPLRENVLRVIMIVTDGRPQD-----SVAEVA 217 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACAS---PNSFYLVEN---PHSMYDAFSH 360 +A+ G +++AIG+ + + L+A S + +LV N S+ F Sbjct: 218 AKARDTGILIFAIGVGQVDLNT-LKAIGSEPHEDHVFLVANFSQMESLTSVFQK 270 Score = 90.2 bits (222), Expect = 5e-16, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 80/207 (38%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ LD++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 683 KRCTEGPLDLVFVIDGSKSLGE------ENFEIVKQFVAGIIDSLAVSPKAA---RVGLL 733 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F+ + + Sbjct: 734 QYSTQVRTEFTLGDFSSARDMKKAVAHMKYMGKGSMTGLALKHMFERSFNPVEGARPVSP 793 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 ++ + TDG + A+ G +YA+G+ E + P Sbjct: 794 G---VSRVAIVFTDGR-----AQDDVSAWARRARAGGITMYAVGVGKAIEEELQEIASEP 845 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 846 TDKHLFYAEDFSTMGEISEKLKKGICE 872 >gi|45361321|ref|NP_989238.1| matrilin 2 [Xenopus (Silurana) tropicalis] gi|39645939|gb|AAH63920.1| matrilin 2 [Xenopus (Silurana) tropicalis] Length = 839 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 36/204 (17%), Positives = 82/204 (40%), Gaps = 26/204 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 ++ + LD++ ++D SRS+ K+ + + ML+ + + PD + GL+ Sbjct: 44 SACHNKPLDLVFIIDSSRSVR---PPDFEKV---KEFLITMLKFLDIGPD---TTRVGLL 94 Query: 225 TFSNKIEEFFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + + ++ F L+ + +R +K + T + ++YA N F Sbjct: 95 QYGSTVKNEFSLKTYKKKMDIERAVKRMMHLATGTMTGLAIQYAMNIAFSESE---GARP 151 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + + +I + +TDG +A+ G +++A+G+ + L+ S Sbjct: 152 LNQHVPRIAMIVTDGRPQDP-----VAEIAAKARNSGILIFAVGVGRVD-MSTLKTIGSE 205 Query: 343 ---NSFYLVEN---PHSMYDAFSH 360 +LV N ++ F + Sbjct: 206 PHTEHVFLVANFSQIETLTTVFQN 229 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 37/212 (17%), Positives = 80/212 (37%), Gaps = 23/212 (10%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 + S + +D++ V+D S+S+ ++ + +N +L+ Sbjct: 546 CFEGYTLGEDGKSCKSERKCGEGPVDLVFVIDGSKSLGEN------NFEIVKQFVNGILD 599 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKY 265 +++ +V GL+ +S I F + S +++ + + G + + LK Sbjct: 600 SLEISQKAAHV---GLIQYSTHIRTEFTMAQYSSAKDVKKAVSQIKYMGRGSMTGLALKL 656 Query: 266 AYNQIFD-MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 + + F +QG R ++ + TDG + Y +AK+ G +YA Sbjct: 657 MHEKSFSEVQGARPRAMG----VPRVAIVFTDGR-----AQDEVSEYAKKAKQSGITIYA 707 Query: 325 IGIRVIRSHEFLRACASP--NSFYLVENPHSM 354 IG+ E ++P E+ +M Sbjct: 708 IGVGKAIDEELQEIASAPQEKHVIYAEDFSAM 739 >gi|157819693|ref|NP_001101483.1| matrilin-3 [Rattus norvegicus] gi|149050910|gb|EDM03083.1| matrilin 3 (predicted) [Rattus norvegicus] Length = 463 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 74/213 (34%), Gaps = 26/213 (12%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P + LD++ ++D SRS+ TK+ + I + + Sbjct: 62 PYSGVRGSGVCKSRPLDLVFIIDSSRSVRPL---EFTKVKTFVSRI------IDTLDIGA 112 Query: 217 NVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + +V +++ ++ F L L++ + ++ T S ++ A + F ++ Sbjct: 113 TDTRVAVVNYASTVKIEFQLNTYSNKQALKQAVARITPLSTGTMSGLAIQTAMEEAFTVE 172 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 +N K+ + +TDG Q A+ G +YA+G+ Sbjct: 173 ---AGARGPTSNIPKVAIIVTDGRPQD-----QVNEVAARARASGIELYAVGVDRADMES 224 Query: 335 FLRACAS---PNSFYLVEN---PHSMYDAFSHI 361 L+ AS + + VE + F Sbjct: 225 -LKMMASKPLEDHVFYVETYGVIEKLSARFQET 256 >gi|297299828|ref|XP_002805471.1| PREDICTED: matrilin-2-like [Macaca mulatta] Length = 897 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 84/209 (40%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + K+ + I L +V Sbjct: 44 TALLESSCENKRADLVFIIDSSRSVNTH---DYAKVKEFLMDILQFL------DIGPDVT 94 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 95 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 152 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++AIG+ + L+ Sbjct: 153 -GARPLRENVPRVIMIVTDGRPQD-----SVAEVAAKARDTGILIFAIGVGQVD-FNTLK 205 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 A S + +LV N ++ F Sbjct: 206 AIGSEPHEDHVFLVANFSQIETLTSVFQR 234 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 23/157 (14%), Positives = 61/157 (38%), Gaps = 19/157 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 647 KKCTEGPIDLVFVIDGSKSLGE------ENFEVVKQFVTGIIDSLTISPKAA---RVGLL 697 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 698 QYSTQVRTEFTLRNFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGE---GARP 754 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 + + TDG + + ++AK G Sbjct: 755 LSTRVPRAAIVFTDG-----RAQDDVSEWASKAKANG 786 >gi|293360567|ref|XP_216941.5| PREDICTED: matrilin 2 [Rattus norvegicus] Length = 900 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 41/209 (19%), Positives = 92/209 (44%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + + + + I +L+ + + PD+ Sbjct: 43 TALLESSCENKRADLVFIIDSSRSVNTHDYAKV------KEFILDILQFLDIGPDI---T 93 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 94 RVGLLQYGSTVKNEFSLKTFKRKSDVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 151 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG ++A+ G +++AIG+ + + L+ Sbjct: 152 -GARPLRENVPRVIMIVTDGRPQD-----SVAEVASKARNTGILIFAIGVGQVDLNT-LK 204 Query: 338 ACASP---NSFYLVEN---PHSMYDAFSH 360 A S + +LV N S+ F + Sbjct: 205 AIGSEPHKDHVFLVANFSQIESLTSVFQN 233 Score = 93.7 bits (231), Expect = 4e-17, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 74/197 (37%), Gaps = 21/197 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + +++ + + P + GL+ Sbjct: 605 KRCTEGPIDLVFVIDGSKSLGE------ENFEIVKHFVTGIIDSLAVSPKAA---RVGLL 655 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L+ S +++ + ++ G + + LK+ Y + F Sbjct: 656 QYSTQVRTEFTLKGFSSAKDMKKAVAHMKYMGKGSMTGLALKHMYERSFTQVE---GARP 712 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + + TDG + +AK G +YA+GI E + P Sbjct: 713 LSTRVPRAAIVFTDGR-----AQDDVSEWARKAKANGITMYAVGIGKAIEEELQEIASEP 767 Query: 343 --NSFYLVENPHSMYDA 357 + E+ +M + Sbjct: 768 IDKHLFYAEDFSTMGEI 784 >gi|293348660|ref|XP_001058523.2| PREDICTED: matrilin 2 [Rattus norvegicus] Length = 922 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 41/209 (19%), Positives = 92/209 (44%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + + + + I +L+ + + PD+ Sbjct: 43 TALLESSCENKRADLVFIIDSSRSVNTHDYAKV------KEFILDILQFLDIGPDI---T 93 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 94 RVGLLQYGSTVKNEFSLKTFKRKSDVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 151 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG ++A+ G +++AIG+ + + L+ Sbjct: 152 -GARPLRENVPRVIMIVTDGRPQD-----SVAEVASKARNTGILIFAIGVGQVDLNT-LK 204 Query: 338 ACASP---NSFYLVEN---PHSMYDAFSH 360 A S + +LV N S+ F + Sbjct: 205 AIGSEPHKDHVFLVANFSQIESLTSVFQN 233 Score = 93.7 bits (231), Expect = 4e-17, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 74/197 (37%), Gaps = 21/197 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + +++ + + P + GL+ Sbjct: 633 KRCTEGPIDLVFVIDGSKSLGE------ENFEIVKHFVTGIIDSLAVSPKAA---RVGLL 683 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L+ S +++ + ++ G + + LK+ Y + F Sbjct: 684 QYSTQVRTEFTLKGFSSAKDMKKAVAHMKYMGKGSMTGLALKHMYERSFTQVE---GARP 740 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + + TDG + +AK G +YA+GI E + P Sbjct: 741 LSTRVPRAAIVFTDGR-----AQDDVSEWARKAKANGITMYAVGIGKAIEEELQEIASEP 795 Query: 343 --NSFYLVENPHSMYDA 357 + E+ +M + Sbjct: 796 IDKHLFYAEDFSTMGEI 812 >gi|149066552|gb|EDM16425.1| matrilin 2 (predicted) [Rattus norvegicus] Length = 898 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 41/209 (19%), Positives = 92/209 (44%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + + + + I +L+ + + PD+ Sbjct: 43 TALLESSCENKRADLVFIIDSSRSVNTHDYAKV------KEFILDILQFLDIGPDI---T 93 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 94 RVGLLQYGSTVKNEFSLKTFKRKSDVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 151 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG ++A+ G +++AIG+ + + L+ Sbjct: 152 -GARPLRENVPRVIMIVTDGRPQD-----SVAEVASKARNTGILIFAIGVGQVDLNT-LK 204 Query: 338 ACASP---NSFYLVEN---PHSMYDAFSH 360 A S + +LV N S+ F + Sbjct: 205 AIGSEPHKDHVFLVANFSQIESLTSVFQN 233 Score = 93.7 bits (231), Expect = 4e-17, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 74/197 (37%), Gaps = 21/197 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + +++ + + P + GL+ Sbjct: 605 KRCTEGPIDLVFVIDGSKSLGE------ENFEIVKHFVTGIIDSLAVSPKAA---RVGLL 655 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L+ S +++ + ++ G + + LK+ Y + F Sbjct: 656 QYSTQVRTEFTLKGFSSAKDMKKAVAHMKYMGKGSMTGLALKHMYERSFTQVE---GARP 712 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + + TDG + +AK G +YA+GI E + P Sbjct: 713 LSTRVPRAAIVFTDGR-----AQDDVSEWARKAKANGITMYAVGIGKAIEEELQEIASEP 767 Query: 343 --NSFYLVENPHSMYDA 357 + E+ +M + Sbjct: 768 IDKHLFYAEDFSTMGEI 784 >gi|301756398|ref|XP_002914036.1| PREDICTED: matrilin-2-like isoform 2 [Ailuropoda melanoleuca] Length = 938 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 91/209 (43%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + + + + I +L+ + + PD V Sbjct: 45 TALLESSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPD---VT 95 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 96 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 153 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++AIG+ + + L+ Sbjct: 154 -GARPLRENVLRVIMIVTDGRPQD-----SVAEVAAKARDTGILIFAIGVGQVDLNT-LK 206 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 A S + +LV N S+ F + Sbjct: 207 AIGSEPHEDHVFLVANFSQMESLTSVFQN 235 Score = 92.5 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 31/207 (14%), Positives = 79/207 (38%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 648 KRCTEGPVDLVFVIDGSKSLGE------ENFEIVKQFVTGIIDSLAVSPKAA---RVGLL 698 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 699 QYSTQVRTEFTLRNFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQVE---GARP 755 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 ++ + TDG + ++A+ G +YA+G+ E + P Sbjct: 756 LSTRVPRVAIVFTDGR-----AQDDVSEWASKAQANGITMYAVGVGKAIEEELQEIASEP 810 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 811 TDKHLFYAEDFSTMGEISEKLKKGICE 837 >gi|301756396|ref|XP_002914035.1| PREDICTED: matrilin-2-like isoform 1 [Ailuropoda melanoleuca] Length = 957 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 91/209 (43%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + + + + I +L+ + + PD V Sbjct: 45 TALLESSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPD---VT 95 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 96 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 153 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++AIG+ + + L+ Sbjct: 154 -GARPLRENVLRVIMIVTDGRPQD-----SVAEVAAKARDTGILIFAIGVGQVDLNT-LK 206 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 A S + +LV N S+ F + Sbjct: 207 AIGSEPHEDHVFLVANFSQMESLTSVFQN 235 Score = 92.5 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 31/207 (14%), Positives = 79/207 (38%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 648 KRCTEGPVDLVFVIDGSKSLGE------ENFEIVKQFVTGIIDSLAVSPKAA---RVGLL 698 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 699 QYSTQVRTEFTLRNFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQVE---GARP 755 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 ++ + TDG + ++A+ G +YA+G+ E + P Sbjct: 756 LSTRVPRVAIVFTDGR-----AQDDVSEWASKAQANGITMYAVGVGKAIEEELQEIASEP 810 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 811 TDKHLFYAEDFSTMGEISEKLKKGICE 837 >gi|281350435|gb|EFB26019.1| hypothetical protein PANDA_001886 [Ailuropoda melanoleuca] Length = 942 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 91/209 (43%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + + + + I +L+ + + PD V Sbjct: 45 TALLESSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPD---VT 95 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 96 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 153 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++AIG+ + + L+ Sbjct: 154 -GARPLRENVLRVIMIVTDGRPQD-----SVAEVAAKARDTGILIFAIGVGQVDLNT-LK 206 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 A S + +LV N S+ F + Sbjct: 207 AIGSEPHEDHVFLVANFSQMESLTSVFQN 235 Score = 92.5 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 31/207 (14%), Positives = 79/207 (38%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 648 KRCTEGPVDLVFVIDGSKSLGE------ENFEIVKQFVTGIIDSLAVSPKAA---RVGLL 698 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 699 QYSTQVRTEFTLRNFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQVE---GARP 755 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 ++ + TDG + ++A+ G +YA+G+ E + P Sbjct: 756 LSTRVPRVAIVFTDGR-----AQDDVSEWASKAQANGITMYAVGVGKAIEEELQEIASEP 810 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 811 TDKHLFYAEDFSTMGEISEKLKKGICE 837 >gi|56797871|emb|CAG27569.1| matrilin-4 [Danio rerio] Length = 644 Score = 100 bits (250), Expect = 2e-19, Method: Composition-based stats. Identities = 40/212 (18%), Positives = 80/212 (37%), Gaps = 30/212 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D++ ++D SRS+ + + K + ++ E+ + + G+V + Sbjct: 16 CKSGPVDLVFIIDGSRSVRPH------EFETMRKFMIDIIHELDIGLAA---TRIGVVQY 66 Query: 227 SNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S++++ F L+ + + I + T + ++YA N F + Sbjct: 67 SSQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLAIRYAMNVAFSAEE------GAR 120 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP-- 342 N + V +TDG + + + G +YA+G+ LRA ASP Sbjct: 121 PNVPHVAVIVTDGRPQDRVAEVAAAAR-----ESGIEIYAVGVARADMTS-LRAMASPPF 174 Query: 343 -NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + +LVE+ +D G K D Sbjct: 175 EDHVFLVES----FDLIHQFGLQFQDKLCGMD 202 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 30/205 (14%), Positives = 79/205 (38%), Gaps = 25/205 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + +D+++++D S+S+ ++ + +N +++++ + + GLV Sbjct: 404 GTCRTSNIDLVLLIDGSKSVRP------QNFELVKQFVNQVVDQLDV---SAKGTRVGLV 454 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+++ F L +++ + + T + LK+ F + Sbjct: 455 QYSSRVRTEFPLSMYHSKDEIKKAVMNVEYMEKGTMTGLALKHMVENSFSEAEGARPAEK 514 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 ++ + TDG + +AK+ G +YA+G+ E + P Sbjct: 515 N---IPRVGLVFTDGR-----SQDDIQEWAKKAKEAGITMYAVGVGKAVEDELREIASDP 566 Query: 343 --NSFYLVENPHSMYDAFSHIGKDI 365 F+ + + A S I +++ Sbjct: 567 VEKHFFYSAD----FTAISQIAENL 587 >gi|56797869|emb|CAG27568.1| matrilin-4 [Danio rerio] Length = 685 Score = 100 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 40/212 (18%), Positives = 80/212 (37%), Gaps = 30/212 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D++ ++D SRS+ + + K + ++ E+ + + G+V + Sbjct: 16 CKSGPVDLVFIIDGSRSVRPH------EFETMRKFMIDIIHELDIGLAA---TRIGVVQY 66 Query: 227 SNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S++++ F L+ + + I + T + ++YA N F + Sbjct: 67 SSQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLAIRYAMNVAFSAEE------GAR 120 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP-- 342 N + V +TDG + + + G +YA+G+ LRA ASP Sbjct: 121 PNVPHVAVIVTDGRPQDRVAEVAAAAR-----ESGIEIYAVGVARADMTS-LRAMASPPF 174 Query: 343 -NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + +LVE+ +D G K D Sbjct: 175 EDHVFLVES----FDLIHQFGLQFQDKLCGMD 202 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 30/205 (14%), Positives = 78/205 (38%), Gaps = 25/205 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + +D+++++D S+S+ ++ + +N +++++ + + GLV Sbjct: 445 GTCRTSNIDLVLLIDGSKSVRP------QNFELVKQFVNQVVDQLDV---SAKGTRVGLV 495 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+++ F L +++ + T + LK+ F + Sbjct: 496 QYSSRVRTEFPLSMYHSKDEIKKAEMNVEYMEKGTMTGLALKHMVENSFSEAEGARPAEK 555 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 ++ + TDG + +AK+ G +YA+G+ E + P Sbjct: 556 N---IPRVGLVFTDGR-----SQDDIQEWAKKAKEAGITMYAVGVGKAVEDELREIASDP 607 Query: 343 --NSFYLVENPHSMYDAFSHIGKDI 365 F+ + + A S I +++ Sbjct: 608 VEKHFFYSAD----FTAISQIAENL 628 >gi|332830871|ref|XP_003311907.1| PREDICTED: matrilin-2 isoform 2 [Pan troglodytes] Length = 915 Score = 100 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 89/209 (42%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + + + + I +L+ + + PD V Sbjct: 44 TALLESSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPD---VT 94 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 95 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 152 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++AIG+ + L+ Sbjct: 153 -GARPLRENVPRVIMIVTDGRPQD-----SVAEVAAKARDTGILIFAIGVGQVD-FNTLK 205 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 + S + +LV N ++ F Sbjct: 206 SIGSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 96.0 bits (237), Expect = 8e-18, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 606 KKCTEGPIDLVFVIDGSKSLGE------ENFEVVKQFVTGIIDSLTISPKAA---RVGLL 656 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 657 QYSTQVRTEFTLRNFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGE---GARP 713 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + + TDG + ++AK G +YA+G+ E + P Sbjct: 714 LSTRVPRAAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 768 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 769 TNKHLFYAEDFSTMDEISEKLKKGICE 795 >gi|332830869|ref|XP_528309.3| PREDICTED: matrilin-2 isoform 3 [Pan troglodytes] Length = 937 Score = 100 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 89/209 (42%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + + + + I +L+ + + PD V Sbjct: 44 TALLESSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPD---VT 94 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 95 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 152 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++AIG+ + L+ Sbjct: 153 -GARPLRENVPRVIMIVTDGRPQD-----SVAEVAAKARDTGILIFAIGVGQVD-FNTLK 205 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 + S + +LV N ++ F Sbjct: 206 SIGSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 96.0 bits (237), Expect = 8e-18, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 647 KKCTEGPIDLVFVIDGSKSLGE------ENFEVVKQFVTGIIDSLTISPKAA---RVGLL 697 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 698 QYSTQVRTEFTLRNFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGE---GARP 754 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + + TDG + ++AK G +YA+G+ E + P Sbjct: 755 LSTRVPRAAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 810 TNKHLFYAEDFSTMDEISEKLKKGICE 836 >gi|332830867|ref|XP_003311906.1| PREDICTED: matrilin-2 isoform 1 [Pan troglodytes] Length = 956 Score = 100 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 89/209 (42%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + + + + I +L+ + + PD V Sbjct: 44 TALLESSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPD---VT 94 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 95 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 152 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++AIG+ + L+ Sbjct: 153 -GARPLRENVPRVIMIVTDGRPQD-----SVAEVAAKARDTGILIFAIGVGQVD-FNTLK 205 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 + S + +LV N ++ F Sbjct: 206 SIGSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 96.0 bits (237), Expect = 8e-18, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 647 KKCTEGPIDLVFVIDGSKSLGE------ENFEVVKQFVTGIIDSLTISPKAA---RVGLL 697 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 698 QYSTQVRTEFTLRNFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGE---GARP 754 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + + TDG + ++AK G +YA+G+ E + P Sbjct: 755 LSTRVPRAAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 810 TNKHLFYAEDFSTMDEISEKLKKGICE 836 >gi|119612173|gb|EAW91767.1| matrilin 2, isoform CRA_c [Homo sapiens] Length = 451 Score = 100 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 89/209 (42%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + + + + I +L+ + + PD V Sbjct: 44 TALLESSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPD---VT 94 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 95 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 152 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++AIG+ + L+ Sbjct: 153 -GARPLRENVPRVIMIVTDGRPQD-----SVAEVAAKARDTGILIFAIGVGQVD-FNTLK 205 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 + S + +LV N ++ F Sbjct: 206 SIGSEPHEDHVFLVANFSQIETLTSVFQK 234 >gi|119612172|gb|EAW91766.1| matrilin 2, isoform CRA_b [Homo sapiens] gi|119612174|gb|EAW91768.1| matrilin 2, isoform CRA_b [Homo sapiens] Length = 922 Score = 100 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 89/209 (42%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + + + + I +L+ + + PD V Sbjct: 44 TALLESSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPD---VT 94 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 95 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 152 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++AIG+ + L+ Sbjct: 153 -GARPLRENVPRVIMIVTDGRPQD-----SVAEVAAKARDTGILIFAIGVGQVD-FNTLK 205 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 + S + +LV N ++ F Sbjct: 206 SIGSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 96.0 bits (237), Expect = 8e-18, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 647 KKCTEGPIDLVFVIDGSKSLGE------ENFEVVKQFVTGIIDSLTISPKAA---RVGLL 697 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 698 QYSTQVHTEFTLRNFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGE---GARP 754 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + + TDG + ++AK G +YA+G+ E + P Sbjct: 755 LSTRVPRAAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 810 TNKHLFYAEDFSTMDEISEKLKKGICE 836 >gi|119612170|gb|EAW91764.1| matrilin 2, isoform CRA_a [Homo sapiens] gi|119612171|gb|EAW91765.1| matrilin 2, isoform CRA_a [Homo sapiens] Length = 941 Score = 100 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 89/209 (42%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + + + + I +L+ + + PD V Sbjct: 44 TALLESSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPD---VT 94 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 95 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 152 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++AIG+ + L+ Sbjct: 153 -GARPLRENVPRVIMIVTDGRPQD-----SVAEVAAKARDTGILIFAIGVGQVD-FNTLK 205 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 + S + +LV N ++ F Sbjct: 206 SIGSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 96.0 bits (237), Expect = 8e-18, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 647 KKCTEGPIDLVFVIDGSKSLGE------ENFEVVKQFVTGIIDSLTISPKAA---RVGLL 697 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 698 QYSTQVHTEFTLRNFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGE---GARP 754 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + + TDG + ++AK G +YA+G+ E + P Sbjct: 755 LSTRVPRAAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 810 TNKHLFYAEDFSTMDEISEKLKKGICE 836 >gi|62548862|ref|NP_085072.2| matrilin-2 isoform b precursor [Homo sapiens] Length = 937 Score = 100 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 89/209 (42%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + + + + I +L+ + + PD V Sbjct: 44 TALLESSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPD---VT 94 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 95 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 152 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++AIG+ + L+ Sbjct: 153 -GARPLRENVPRVIMIVTDGRPQD-----SVAEVAAKARDTGILIFAIGVGQVD-FNTLK 205 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 + S + +LV N ++ F Sbjct: 206 SIGSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 96.0 bits (237), Expect = 8e-18, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 647 KKCTEGPIDLVFVIDGSKSLGE------ENFEVVKQFVTGIIDSLTISPKAA---RVGLL 697 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 698 QYSTQVHTEFTLRNFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGE---GARP 754 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + + TDG + ++AK G +YA+G+ E + P Sbjct: 755 LSTRVPRAAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 810 TNKHLFYAEDFSTMDEISEKLKKGICE 836 >gi|62548860|ref|NP_002371.3| matrilin-2 isoform a precursor [Homo sapiens] Length = 956 Score = 100 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 89/209 (42%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + + + + I +L+ + + PD V Sbjct: 44 TALLESSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPD---VT 94 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 95 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 152 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++AIG+ + L+ Sbjct: 153 -GARPLRENVPRVIMIVTDGRPQD-----SVAEVAAKARDTGILIFAIGVGQVD-FNTLK 205 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 + S + +LV N ++ F Sbjct: 206 SIGSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 96.0 bits (237), Expect = 8e-18, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 647 KKCTEGPIDLVFVIDGSKSLGE------ENFEVVKQFVTGIIDSLTISPKAA---RVGLL 697 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 698 QYSTQVHTEFTLRNFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGE---GARP 754 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + + TDG + ++AK G +YA+G+ E + P Sbjct: 755 LSTRVPRAAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 810 TNKHLFYAEDFSTMDEISEKLKKGICE 836 >gi|37182908|gb|AAQ89254.1| MATN2 [Homo sapiens] Length = 915 Score = 100 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 89/209 (42%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + + + + I +L+ + + PD V Sbjct: 44 TALLESSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPD---VT 94 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 95 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 152 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++AIG+ + L+ Sbjct: 153 -GARPLRENVPRVIMIVTDGRPQD-----SVAEVAAKARDTGILIFAIGVGQVD-FNTLK 205 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 + S + +LV N ++ F Sbjct: 206 SIGSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 96.0 bits (237), Expect = 8e-18, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 606 KKCTEGPIDLVFVIDGSKSLGE------ENFEVVKQFVTGIIDSLTISPKAA---RVGLL 656 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 657 QYSTQVHTEFTLRNFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGE---GARP 713 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + + TDG + ++AK G +YA+G+ E + P Sbjct: 714 LSTRVPRAAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 768 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 769 TNKHLFYAEDFSTMDEISEKLKKGICE 795 >gi|14714613|gb|AAH10444.1| Matrilin 2 [Homo sapiens] gi|261858984|dbj|BAI46014.1| matrilin 2 [synthetic construct] Length = 937 Score = 100 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 89/209 (42%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + + + + I +L+ + + PD V Sbjct: 44 TALLESSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPD---VT 94 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 95 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 152 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++AIG+ + L+ Sbjct: 153 -GARPLRENVPRVIMIVTDGRPQD-----SVAEVAAKARDTGILIFAIGVGQVD-FNTLK 205 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 + S + +LV N ++ F Sbjct: 206 SIGSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 96.0 bits (237), Expect = 8e-18, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 647 KKCTEGPIDLVFVIDGSKSLGE------ENFEVVKQFVTGIIDSLTISPKAA---RVGLL 697 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 698 QYSTQVHTEFTLRNFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGE---GARP 754 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + + TDG + ++AK G +YA+G+ E + P Sbjct: 755 LSTRVPRAAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 810 TNKHLFYAEDFSTMDEISEKLKKGICE 836 >gi|11125762|gb|AAC51260.2| matrilin-2 precursor [Homo sapiens] Length = 956 Score = 100 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 89/209 (42%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + + + + I +L+ + + PD V Sbjct: 44 TALLESSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPD---VT 94 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 95 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 152 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++AIG+ + L+ Sbjct: 153 -GARPLRENVPRVIMIVTDGRPQD-----SVAEVAAKARDTGILIFAIGVGQVD-FNTLK 205 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 + S + +LV N ++ F Sbjct: 206 SIGSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 95.6 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 647 KKCTEGPIDLVFVIDGSKSLGE------ENFEVVKQFVTGIIDSLTISPKAA---RVGLL 697 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 698 QYSTQVHTEFTLRNFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGE---GARP 754 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + + TDG + ++AK G +YA+G+ E + P Sbjct: 755 FSTRVPRAAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 810 TNKHLFYAEDFSTMDEISEKLKKGICE 836 >gi|62298084|sp|O00339|MATN2_HUMAN RecName: Full=Matrilin-2; Flags: Precursor Length = 956 Score = 100 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 89/209 (42%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + + + + I +L+ + + PD V Sbjct: 44 TALLESSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPD---VT 94 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 95 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 152 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++AIG+ + L+ Sbjct: 153 -GARPLRENVPRVIMIVTDGRPQD-----SVAEVAAKARDTGILIFAIGVGQVD-FNTLK 205 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 + S + +LV N ++ F Sbjct: 206 SIGSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 96.0 bits (237), Expect = 8e-18, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 647 KKCTEGPIDLVFVIDGSKSLGE------ENFEVVKQFVTGIIDSLTISPKAA---RVGLL 697 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 698 QYSTQVHTEFTLRNFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGE---GARP 754 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + + TDG + ++AK G +YA+G+ E + P Sbjct: 755 LSTRVPRAAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 810 TNKHLFYAEDFSTMDEISEKLKKGICE 836 >gi|21739491|emb|CAD38787.1| hypothetical protein [Homo sapiens] Length = 1016 Score = 100 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 89/209 (42%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + + + + I +L+ + + PD V Sbjct: 104 TALLESSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPD---VT 154 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 155 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 212 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++AIG+ + L+ Sbjct: 213 -GARPLRENVPRVIMIVTDGRPQD-----SVAEVAAKARDTGILIFAIGVGQVD-FNTLK 265 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 + S + +LV N ++ F Sbjct: 266 SIGSEPHEDHVFLVANFSQIETLTSVFQK 294 Score = 96.0 bits (237), Expect = 8e-18, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 707 KKCTEGPIDLVFVIDGSKSLGE------ENFEVVKQFVTGIIDSLTISPKAA---RVGLL 757 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 758 QYSTQVHTEFTLRNFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGE---GARP 814 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + + TDG + ++AK G +YA+G+ E + P Sbjct: 815 LSTRVPRAAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 869 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 870 TNKHLFYAEDFSTMDEISEKLKKGICE 896 >gi|47087209|ref|NP_998714.1| matrilin-2 [Danio rerio] gi|45827653|gb|AAS78465.1| matrilin-4-like protein [Danio rerio] Length = 821 Score = 100 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 40/212 (18%), Positives = 80/212 (37%), Gaps = 30/212 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D++ ++D SRS+ + + K + ++ E+ + + G+V + Sbjct: 29 CKSGPVDLVFIIDGSRSVRPH------EFETMRKFMIDIIHELDIGLAA---TRIGVVQY 79 Query: 227 SNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S++++ F L+ + + I + T + ++YA N F + Sbjct: 80 SSQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLAIRYAMNVAFSAEE------GAR 133 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP-- 342 N + V +TDG + + + G +YA+G+ LRA ASP Sbjct: 134 PNVPHVAVIVTDGRPQDRVAEVAAAAR-----ESGIEIYAVGVARADMTS-LRAMASPPF 187 Query: 343 -NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + +LVE+ +D G K D Sbjct: 188 EDHVFLVES----FDLIHQFGLQFQDKLCGMD 215 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 30/205 (14%), Positives = 79/205 (38%), Gaps = 25/205 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + +D+++++D S+S+ ++ + +N +++++ + + GLV Sbjct: 581 GTCRTSNIDLVLLIDGSKSVRP------QNFELVKQFVNQVVDQLDV---SAKGTRVGLV 631 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+++ F L +++ + + T + LK+ F + Sbjct: 632 QYSSRVRTEFPLSMYHSKDEIKKAVMNVEYMEKGTMTGLALKHMVENSFSEAEGARPAEK 691 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 ++ + TDG + +AK+ G +YA+G+ E + P Sbjct: 692 N---IPRVGLVFTDGR-----SQDDIQEWAKKAKEAGITMYAVGVGKAVEDELREIASDP 743 Query: 343 --NSFYLVENPHSMYDAFSHIGKDI 365 F+ + + A S I +++ Sbjct: 744 VEKHFFYSAD----FTAISQIAENL 764 >gi|190339201|gb|AAI63867.1| Matn4 protein [Danio rerio] Length = 944 Score = 100 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 40/212 (18%), Positives = 80/212 (37%), Gaps = 30/212 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D++ ++D SRS+ + + K + ++ E+ + + G+V + Sbjct: 29 CKSGPVDLVFIIDGSRSVRPH------EFETMRKFMIDIIHELDIGLAA---TRIGVVQY 79 Query: 227 SNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S++++ F L+ + + I + T + ++YA N F + Sbjct: 80 SSQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLAIRYAMNVAFSAEE------GAR 133 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP-- 342 N + V +TDG + + + G +YA+G+ LRA ASP Sbjct: 134 PNVPHVAVIVTDGRPQDRVAEVAAAAR-----ESGIEIYAVGVARADMTS-LRAMASPPF 187 Query: 343 -NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + +LVE+ +D G K D Sbjct: 188 EDHVFLVES----FDLIHQFGLQFQDKLCGMD 215 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 30/205 (14%), Positives = 79/205 (38%), Gaps = 25/205 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + +D+++++D S+S+ ++ + +N +++++ + + GLV Sbjct: 704 GTCRTSNIDLVLLIDGSKSVRP------QNFELVKQFVNQVVDQLDV---SAKGTRVGLV 754 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+++ F L +++ + + T + LK+ F + Sbjct: 755 QYSSRVRTEFPLSMYHSKDEIKKAVMNVEYMEKGTMTGLALKHMVENSFSEAEGARPAEK 814 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 ++ + TDG + +AK+ G +YA+G+ E + P Sbjct: 815 N---IPRVGLVFTDGR-----SQDDIQEWAKKAKEAGITMYAVGVGKAVEDELREIASDP 866 Query: 343 --NSFYLVENPHSMYDAFSHIGKDI 365 F+ + + A S I +++ Sbjct: 867 VEKHFFYSAD----FTAISQIAENL 887 >gi|56797863|emb|CAG27565.1| matrilin-4 [Danio rerio] Length = 944 Score = 100 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 40/212 (18%), Positives = 80/212 (37%), Gaps = 30/212 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D++ ++D SRS+ + + K + ++ E+ + + G+V + Sbjct: 29 CKSGPVDLVFIIDGSRSVRPH------EFETMRKFMIDIIHELDIGLAA---TRIGVVQY 79 Query: 227 SNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S++++ F L+ + + I + T + ++YA N F + Sbjct: 80 SSQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLAIRYAMNVAFSAEE------GAR 133 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP-- 342 N + V +TDG + + + G +YA+G+ LRA ASP Sbjct: 134 PNVPHVAVIVTDGRPQDRVAEVAAAAR-----ESGIEIYAVGVARADMTS-LRAMASPPF 187 Query: 343 -NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + +LVE+ +D G K D Sbjct: 188 EDHVFLVES----FDLIHQFGLQFQDKLCGMD 215 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 30/205 (14%), Positives = 79/205 (38%), Gaps = 25/205 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + +D+++++D S+S+ ++ + +N +++++ + + GLV Sbjct: 704 GTCRTSNIDLVLLIDGSKSVRP------QNFELVKQFVNQVVDQLDV---SAKGTRVGLV 754 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+++ F L +++ + + T + LK+ F + Sbjct: 755 QYSSRVRTEFPLSMYHSKDEIKKAVMNVEYMEKGTMTGLALKHMVENSFSEAEGARPAEK 814 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 ++ + TDG + +AK+ G +YA+G+ E + P Sbjct: 815 N---IPRVGLVFTDGR-----SQDDIQEWAKKAKEAGITMYAVGVGKAVEDELREIASDP 866 Query: 343 --NSFYLVENPHSMYDAFSHIGKDI 365 F+ + + A S I +++ Sbjct: 867 VEKHFFYSAD----FTAISQIAENL 887 >gi|296227520|ref|XP_002759384.1| PREDICTED: matrilin-2 [Callithrix jacchus] Length = 973 Score = 100 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 41/209 (19%), Positives = 84/209 (40%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + K+ I I L +V Sbjct: 44 TALLESSCENKRADLVFIIDSSRSVNTH---DYAKVKEFIMDILQFL------DIGPDVT 94 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 95 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 152 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++AIG+ + L+ Sbjct: 153 -GARPLKENVPRVIMIVTDGRPQD-----SVAEVAAKARNTGILIFAIGVGQVD-FNTLK 205 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 A S + +LV N ++ F Sbjct: 206 AIGSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 97.1 bits (240), Expect = 4e-18, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 79/207 (38%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 647 ERCTEGPIDLVFVIDGSKSLGE------ENFEVVKQFVTGIIDSLTISPKAA---RVGLL 697 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 698 QYSTQVRTEFTLRNFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGE---GARP 754 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 ++ + TDG + ++AK G +YA+G+ E + P Sbjct: 755 LSTRVPRVAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 810 TDKHLFYAEDFSTMDEISEKLKKGICE 836 >gi|56797867|emb|CAG27567.1| matrilin-4 [Danio rerio] Length = 548 Score = 100 bits (249), Expect = 3e-19, Method: Composition-based stats. Identities = 40/212 (18%), Positives = 80/212 (37%), Gaps = 30/212 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D++ ++D SRS+ + + K + ++ E+ + + G+V + Sbjct: 16 CKSGPVDLVFIIDGSRSVRPH------EFETMRKFMIDIIHELDIGLAA---TRIGVVQY 66 Query: 227 SNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S++++ F L+ + + I + T + ++YA N F + Sbjct: 67 SSQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLAIRYAMNVAFSAEE------GAR 120 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP-- 342 N + V +TDG + + + G +YA+G+ LRA ASP Sbjct: 121 PNVPHVAVIVTDGRPQDRVAEVAAAAR-----ESGIEIYAVGVARADMTS-LRAMASPPF 174 Query: 343 -NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + +LVE+ +D G K D Sbjct: 175 EDHVFLVES----FDLIHQFGLQFQDKLCGMD 202 Score = 40.5 bits (93), Expect = 0.41, Method: Composition-based stats. Identities = 10/72 (13%), Positives = 36/72 (50%), Gaps = 9/72 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + +++++++D S+S+ ++ + +N +++++ + + GLV Sbjct: 486 GTCRTSNINLVLLIDGSKSVRP------QNFELVKQFVNQVVDQLDV---SAKGTRVGLV 536 Query: 225 TFSNKIEEFFLL 236 +S+++ F L Sbjct: 537 QYSSRVRTEFPL 548 >gi|291388325|ref|XP_002710627.1| PREDICTED: matrilin 3-like [Oryctolagus cuniculus] Length = 1109 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 90/209 (43%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ S + R D++ ++D SRS+ + + + + I +L+ + + PD V Sbjct: 197 TALLEGSCGNKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPD---VT 247 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 248 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 305 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++AIG+ + + L+ Sbjct: 306 -GARPLRENVPRVIMIVTDGRPQD-----SVAEVAAKARDTGILIFAIGVGQVDLNT-LK 358 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 A S + +LV N S+ F + Sbjct: 359 AIGSEPHEDHVFLVANFSQIESLTSVFQN 387 Score = 95.2 bits (235), Expect = 1e-17, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 79/197 (40%), Gaps = 21/197 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ LD++ V+D S+S+ ++ + + A+++ + + P + GL+ Sbjct: 800 KRCTEGPLDLVFVIDGSKSLGE------DNFEIVKQFLTAIIDSLAVSPKAA---RVGLL 850 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L + +++ + ++ G + + LK+ + + F + Sbjct: 851 QYSTQVRAEFTLRSFSTAKDMKKAVAHMKYMGKGSMTGQALKHMFERSFTQ---LEGARP 907 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 A ++ + TDG + N+AK G +YA+G+ E + P Sbjct: 908 LSARVPRVAIVFTDGR-----AQDDVSEWANKAKANGITMYAVGVGKAIEEELQEIASEP 962 Query: 343 --NSFYLVENPHSMYDA 357 + E+ +M + Sbjct: 963 TDKHLFYAEDFSTMNEI 979 >gi|224048789|ref|XP_002188138.1| PREDICTED: matrilin 3 [Taeniopygia guttata] Length = 284 Score = 100 bits (248), Expect = 4e-19, Method: Composition-based stats. Identities = 28/205 (13%), Positives = 75/205 (36%), Gaps = 26/205 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + LD++ ++D SRS+ + + ++ M++ + + + ++ Sbjct: 25 AACKNQPLDLVFIVDSSRSVRP------EEFEKVKIFLSEMIDTLDVGE---RTTRVAVM 75 Query: 225 TFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +++ ++ F L + ++ + + T + ++ A ++F + Sbjct: 76 NYASTVKVEFPLRTYFDKASMKEAVSRIEPLSAGTMTGLAIQTAMEEVFTEE---MGTRP 132 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS- 341 N ++++ +TDG +D + A+ G +Y +G+ LR AS Sbjct: 133 AAFNIPRVVIVVTDGRPQDQVQDVAA-----SARTAGIEIYTVGVGRADMQA-LRIMASE 186 Query: 342 --PNSFYLVEN---PHSMYDAFSHI 361 + VE + F Sbjct: 187 PLDEHVFYVETYGVIEKLTSRFRET 211 >gi|327274976|ref|XP_003222250.1| PREDICTED: collagen alpha-6(VI) chain-like [Anolis carolinensis] Length = 2088 Score = 99.8 bits (247), Expect = 6e-19, Method: Composition-based stats. Identities = 46/197 (23%), Positives = 86/197 (43%), Gaps = 19/197 (9%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RLD++ V+D S S++ + D+ + M+ VK ++ VQ G V +S + Sbjct: 798 RLDIVFVIDGSGSIDP------KEYDIMKEF---MISLVKKSDVSHDRVQFGAVKYSAEP 848 Query: 231 EEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E FF L + + R I+ G +T + L+++ + G R+H Sbjct: 849 ETFFYLNRYTTKSAIIRAIQNDKSIGETTYTAKALRHSEGLFSEEHGSRKHRG-----VP 903 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 ++++ +TDG++ E + + + G I+YAIGI R E L S + ++ V Sbjct: 904 QVLIVITDGDSHDAAELDEVSK---KLRANGIIIYAIGIERARPDELLTMAGSEDKYFYV 960 Query: 349 ENPHSMYDAFSHIGKDI 365 + + I + I Sbjct: 961 NTFEGLKHLYPRISEKI 977 Score = 87.5 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 84/204 (41%), Gaps = 27/204 (13%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 V + + + D++ ++D SRS+ + K+ + + V + + G Sbjct: 604 VEACKEKKSDILFLVDSSRSIS---PDNFLKMKDFMSEL------VNKSDIGLDRMHVGA 654 Query: 224 VTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 + FS++ +E F L S + R I +S G ST + L++ + ++G R + Sbjct: 655 IQFSSRNKEEFRLSQYATKSDIIRAIGRMSLMGQSTLTGGALQFVLDYFRPIKGSRPY-- 712 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 KKI++ +TDGE ++ G IVY++G+ + + Sbjct: 713 -----VKKILILITDGE-----AQDDVKTPAEALRQEGIIVYSVGVFNANRTQLVEISGK 762 Query: 342 PNSFYLVENPHSMYDAFSHIGKDI 365 P + VE+ +D H+ +I Sbjct: 763 PEMVFYVED----FDILKHLENEI 782 Score = 86.0 bits (211), Expect = 8e-15, Method: Composition-based stats. Identities = 43/208 (20%), Positives = 83/208 (39%), Gaps = 23/208 (11%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + D++ ++D S S+ DS TK+ L++V D + VQ G+ Sbjct: 981 SKPECGIPADLVFLIDGSNSIS---DSDFTKMK-------NFLQDVVRPFDTGHNVQVGI 1030 Query: 224 VTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKF-GVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +S++ + F L L+ +I + + G+ T L N G R Sbjct: 1031 AQYSDRYRKEFSLNMFSHKLELETQIGRIRQMEGLQTYIGAALDRVRNFFTPEGGSRV-- 1088 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + N ++I++ +TDG + + + +K+G +YAIG+ I + + Sbjct: 1089 ---NENIQQILLVITDGR-----SHDKVVKAAEDLRKKGVDIYAIGVGRIDHLQLSQIAG 1140 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 S + Y V+N + + DI + Sbjct: 1141 SSDRKYTVDNFSELKVIKKRLVDDICEE 1168 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 54/378 (14%), Positives = 116/378 (30%), Gaps = 36/378 (9%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 + + ++T P +L M + +H+ + ++ + + E + Sbjct: 253 KDKCMRIGLVTYSNKPQATSLLRMATDKTHVLQSIQSISPKGGKANLGSVIHFTKEKVFS 312 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNG-----------FVNDIDDIVRSTSLDIVVV 121 + + +I + F ID + IV Sbjct: 313 KSAGSRKSQGVEQIAIVITHRSSEDDVSGAATALRRAGVTIFAIGIDAANTTQLAQIVAY 372 Query: 122 PQNEGYSISAI---SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVL 178 P + + K + + ++ +K D+ ++ Sbjct: 373 PPEKHLTRLKTFSHLPNKTQIFHKKIMNQIQQKLYLDAKRAELLKRGCVDTEEADIYFLI 432 Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-EFFLLE 237 S SM F + + +K + N V+ G+V ++ E EF E Sbjct: 433 GGSSSMNYF---DFADLKLFLKEVVKFFMV------GPNKVRFGVVQYAEINELEFGPEE 483 Query: 238 WG-VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 +G S + + I+ + + + ++ LK+ + + QG N +V +TD Sbjct: 484 YGKTSDILKAIENIRQLRGNPHTGKALKFIHPLLRKSQG------QHSRNVPCHLVVLTD 537 Query: 297 GENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYD 356 + + K +YAIGIR + S + Y V + S+ Sbjct: 538 -----QISEDPVKEPAKKLKNEMVSIYAIGIRHANESQIYEIAESKDRAYFVNDFASLKH 592 Query: 357 AFSHIGKDIVTKRIWYDK 374 + + +DI +K Sbjct: 593 IRNEVVRDICAVEACKEK 610 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 83/215 (38%), Gaps = 29/215 (13%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + DA D+ ++DV S ++I KI + ++ + V + + + Sbjct: 209 GTALEACDADAVADVAFIVDVGSS-----QTNIEKIQIFLEKL------VSSLDVKDKCM 257 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GLVT+SNK + LL +H+ + I+ +S G N + + ++F Sbjct: 258 RIGLVTYSNKPQATSLLRMATDKTHVLQSIQSISPKGGKANLGSVIHFTKEKVFSKSAGS 317 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + + ++I + +T + ++ G ++AIGI + + + Sbjct: 318 R----KSQGVEQIAIVIT-----HRSSEDDVSGAATALRRAGVTIFAIGIDAANTTQLAQ 368 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGK--DIVTKRI 370 A Y E + FSH+ I K+I Sbjct: 369 IVA-----YPPEKHLTRLKTFSHLPNKTQIFHKKI 398 Score = 44.0 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 31/172 (18%), Positives = 55/172 (31%), Gaps = 17/172 (9%) Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGV 255 I+ + + L P+ ++ L FS+++ F L+ G + + IK F Sbjct: 43 VKTFISKTINSLPLGPNE---YRAALAQFSDQVYNEFHLDTYKGKNQMLNHIKSKLVFKG 99 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 T N + +Q + N KI++ MT + Sbjct: 100 GPLKTG------NALRKIQESVYWKSITGQNRSKILIVMT-----PQPSEDDVKEAAQML 148 Query: 316 KKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 +K G A+G+ E P SF+ + +I IV Sbjct: 149 QKSGVKTVALGMENASHDELFLIATKPYSFFFPI-MEELSLFSQNISGIIVD 199 >gi|22761666|dbj|BAC11648.1| unnamed protein product [Homo sapiens] Length = 451 Score = 99.5 bits (246), Expect = 7e-19, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 89/209 (42%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + + + + I +L+ + + PD V Sbjct: 44 TALLESSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPD---VT 94 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 95 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHPSTGTMTGLAIQYALNIAFSEAE-- 152 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++AIG+ + L+ Sbjct: 153 -GARPLRENVPRVIMIVTDGRPQD-----SVAEVAAKARDTGILIFAIGVGQVD-FNTLK 205 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 + S + +LV N ++ F Sbjct: 206 SIGSEPHEDHVFLVANFSQIETLTSVFQK 234 >gi|301758388|ref|XP_002915048.1| PREDICTED: matrilin-3-like [Ailuropoda melanoleuca] Length = 466 Score = 99.5 bits (246), Expect = 8e-19, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 83/234 (35%), Gaps = 28/234 (11%) Query: 139 LKFCTFIPWYTNSRHIVMPITS----SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITK 194 + +T S+H M ++ S++ LD++ ++D SRS+ TK Sbjct: 25 FEPACSKVEFTGSQHGTMQASALPFWSLQAGVCKSRPLDLVFIIDSSRSVRPL---EFTK 81 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSK 252 + + I + + + +V +++ ++ F L+ L++ + ++ Sbjct: 82 VKTFVSQI------IDTLDIGAADTRVAVVNYASTVKTEFHLQTYSDKQSLKQAVARITP 135 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 T S ++ A ++ F ++ +N K+ + +TDG Q Sbjct: 136 LSTGTMSGLAIQTAMDEAFTVE---AGARGPTSNIPKVAIIVTDGRPQD-----QVNEVA 187 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLVEN---PHSMYDAFSHI 361 A+ G +YA+G+ + P + VE + F Sbjct: 188 ARARASGIELYAVGVDRADMESLKMIASEPLDEHVFYVETYGVIEKLSSRFQET 241 >gi|148226222|ref|NP_001089834.1| hypothetical protein LOC734900 [Xenopus laevis] gi|80477144|gb|AAI08519.1| MGC130922 protein [Xenopus laevis] Length = 840 Score = 99.5 bits (246), Expect = 8e-19, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 80/204 (39%), Gaps = 26/204 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 ++ + +D++ ++D SRS+ + + + ML+ + + PD + GL+ Sbjct: 44 SACYNKPMDLVFIIDSSRSVRP------ADFEKVKEFLITMLKFLDIGPDN---TRVGLL 94 Query: 225 TFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + + ++ F L+ ++R +K + T + ++YA N F Sbjct: 95 QYGSTVKNEFSLKTYKRKPDIERAVKRMMHLATGTMTGLAIQYAMNIAFSEAE---GARP 151 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS- 341 + +I + +TDG +A+ G +++AIG+ + L+ S Sbjct: 152 LNQYVPRIAMIVTDGRPQDP-----VAEIAAKARNSGILIFAIGVGRVD-MSTLKTIGSQ 205 Query: 342 --PNSFYLVEN---PHSMYDAFSH 360 +LV N ++ F + Sbjct: 206 PHSEHVFLVANFSQIETLTSVFQN 229 Score = 87.9 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 35/212 (16%), Positives = 77/212 (36%), Gaps = 23/212 (10%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 + S + +D++ V+D S+S+ ++ + + +L+ Sbjct: 546 CLEGYSLGEDGKSCKGEKKCGEGPVDLVFVIDGSKSLGE------DNFEIVKQFVKGILD 599 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKY 265 +++ + GL+ +S + F + S +++ + + G + + LK Sbjct: 600 SLEISQKAA---RVGLIQYSTHVRTEFTMAQYSSAKDVKKAVSQIKYMGRGSMTGLALKL 656 Query: 266 AYNQIFD-MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 + + F QG R ++ + TDG + Y +AK+ G +YA Sbjct: 657 MHEKSFSEAQGARARPM----RVPRVAIVFTDGR-----AQDEVSEYAEKAKQSGITIYA 707 Query: 325 IGIRVIRSHEFLRACASP--NSFYLVENPHSM 354 IGI E ++P E+ +M Sbjct: 708 IGIGKAIDEELQEIASAPQEKHVIYAEDFSAM 739 >gi|154250683|ref|YP_001411507.1| von Willebrand factor type A [Parvibaculum lavamentivorans DS-1] gi|154154633|gb|ABS61850.1| von Willebrand factor type A [Parvibaculum lavamentivorans DS-1] Length = 436 Score = 99.5 bits (246), Expect = 8e-19, Method: Composition-based stats. Identities = 43/153 (28%), Positives = 68/153 (44%), Gaps = 13/153 (8%) Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + ++ L S L I ++ G +TN+T GL + +N + G AN Sbjct: 286 NCNLQTIMPLSTNWSALNSHIDAMASAG-NTNTTIGLAWGWN-MLTQGGPLSSAAAPAAN 343 Query: 287 YKKIIVFMTDGEN-------LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL-RA 338 K+IVF+TDG+N S + ++ CN K G VY++ RVI + L R Sbjct: 344 LDKVIVFLTDGDNTRNRWSNNSNTINARTTLICNNIKAAGIKVYSV--RVIEGNATLIRN 401 Query: 339 CAS-PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 CA+ P +Y V + F+ I + + RI Sbjct: 402 CATEPGMYYSVTTASELTSVFASIAQSLSNLRI 434 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 29/221 (13%), Positives = 70/221 (31%), Gaps = 37/221 (16%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 + + +G + AI + + G +++S + +++ L + +D S + Sbjct: 11 LWQDRRGNFAAIFAIAIIPVVAAAGATVDISRAYIVESRLKAALDASALAVGGA------ 64 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 + SF N + + S S ++V Sbjct: 65 ---------TGMTTSQMQAMAQSFFNANYPASKLGVPGTLSVSQSGNVV----------- 104 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 + + T + + + TS V + L++ +VLD + SM S Sbjct: 105 ---SLSVHAQLPTTLMGVVGINTLNVSATSQVTRMGKK---LEVALVLDNTGSMASGG-- 156 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 ++ + + ++ V V+ +V F+ + Sbjct: 157 ---RMTVLKTAAKNLITTVSAAATNPGDVKVAIVPFNVDVN 194 >gi|297265499|ref|XP_002799207.1| PREDICTED: matrilin-3-like [Macaca mulatta] Length = 445 Score = 99.1 bits (245), Expect = 9e-19, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 74/209 (35%), Gaps = 26/209 (12%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + LD++ ++D SRS+ TK+ + I + + + Sbjct: 72 ASGAGVCKSRPLDLVFLIDSSRSVRPL---EFTKVKTFVSRI------IDTLDIGPADTR 122 Query: 221 SGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 +V +++ ++ F L+ L++ + ++ T S ++ A +++F + Sbjct: 123 VAVVNYASTVKIEFQLQAYTDKQSLKQAVGQITPLSTGTMSGLAIQTAMDEVFTAE---A 179 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 +N K+ + +TDG Q A+ G +YA+G+ L+ Sbjct: 180 GARGPSSNIPKVAIIVTDGRPQD-----QVNEVAARARASGIELYAVGVDRADMQS-LKM 233 Query: 339 CAS---PNSFYLVEN---PHSMYDAFSHI 361 AS + VE + F Sbjct: 234 MASEPLEEHVFYVETYGVIEKLSSRFQET 262 >gi|56797873|emb|CAG27570.1| matrilin-4 [Danio rerio] Length = 428 Score = 99.1 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 40/212 (18%), Positives = 80/212 (37%), Gaps = 30/212 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D++ ++D SRS+ + + K + ++ E+ + + G+V + Sbjct: 16 CKSGPVDLVFIIDGSRSVRPH------EFETMRKFMIDIIHELNIGLAA---TRIGVVQY 66 Query: 227 SNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S++++ F L+ + + I + T + ++YA N F + Sbjct: 67 SSQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLAIRYAMNVAFSAEE------GAR 120 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP-- 342 N + V +TDG + + + G +YA+G+ LRA ASP Sbjct: 121 PNVPHVAVIVTDGRPQDRVAEVAAAAR-----ESGIEIYAVGVARADMTS-LRAMASPPF 174 Query: 343 -NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + +LVE+ +D G K D Sbjct: 175 EDHVFLVES----FDLIHQFGLQSQDKLCGMD 202 Score = 43.6 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 11/73 (15%), Positives = 36/73 (49%), Gaps = 9/73 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + +D+++++D S+S+ ++ + +N +++++ + + GLV Sbjct: 363 GTCRTSNIDLVLLIDGSKSVRP------QNFELVKQFVNQVVDQLDV---SAKGTRVGLV 413 Query: 225 TFSNKIEEFFLLE 237 +S+++ F L Sbjct: 414 QYSSRVRTEFPLS 426 >gi|55729600|emb|CAH91529.1| hypothetical protein [Pongo abelii] Length = 955 Score = 99.1 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 88/209 (42%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D++ ++D SRS+ + + + + I +L+ + + PD V Sbjct: 44 TALLESSCENKRADLVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPD---VT 94 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 95 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 152 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++A G+ + L+ Sbjct: 153 -GARPLRENVPRVIMIVTDGRPQD-----SVAEVAAKARDTGILIFAFGVGQVD-FNTLK 205 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 A S + +LV N ++ F Sbjct: 206 AIGSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 92.1 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 30/207 (14%), Positives = 76/207 (36%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D+ V+D S+ + ++ + + +++ + + P + GL+ Sbjct: 647 KKCTEGPIDLAFVIDGSKGLGE------ENFEVVKQFVTGIIDSLTISPKAA---RVGLL 697 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + L++ + + F Sbjct: 698 QYSTQVRTEFTLRNFNSAKDMKKAVAHMKYMGKGSMTGLALRHMFERSFTQGE---GARP 754 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + + TDG + ++AK G +YA+G+ E + P Sbjct: 755 LSTRVPRAAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 810 TTKHLFYAEDFSTMDEISEKLKKGICE 836 >gi|87311197|ref|ZP_01093320.1| hypothetical protein DSM3645_16250 [Blastopirellula marina DSM 3645] gi|87286105|gb|EAQ78016.1| hypothetical protein DSM3645_16250 [Blastopirellula marina DSM 3645] Length = 373 Score = 98.7 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 60/378 (15%), Positives = 125/378 (33%), Gaps = 38/378 (10%) Query: 4 LNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAAT 63 LN + +G + IL A+ LP+I + ++V+++ +T L D + A Sbjct: 8 LNRKLGRISR-RRGAVLILIAVLLPVILWMAAFCVDVAYMQLTRTELRIATDSAARAGAR 66 Query: 64 QIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQ 123 + E + + K ++ + G DD+ R T + Sbjct: 67 TLSLEQDASLAHKSAIEYAAKNNVAGNTLTLADSDVQIGLSVRTDDVGRFTFSSGGKLLN 126 Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 +++ + P P+ + D+ +V+D S S Sbjct: 127 --SVNVTGRRTQQAPDGAVRLYLTPIFGHEFFQPVADATASQIDR----DIALVVDRSGS 180 Query: 184 MESFFD--------------SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 M + S + + S++ L E+ P + V L T+++ Sbjct: 181 MTFRINRNSYESGWRNNDPVPSRARWWALVDSVDGFLTELGSTPQLELV---SLSTYNSS 237 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + L S ++ + S+ STN T G+ + + + + R + + Sbjct: 238 AKIDEQLTDKYSRIEDALDDYSRRYPDGSTNITAGMDRGISTLQNKKYARPYAS------ 291 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSF 345 K +V MTDG + + Y +A +V+ I + +R A Sbjct: 292 -KTMVVMTDGNHNYGSSPTNAAY---DAASDDIVVHTITYSDGANQSLMREVARIGGGQH 347 Query: 346 YLVENPHSMYDAFSHIGK 363 + + + + F I + Sbjct: 348 WHAPDGDELEEIFREIAR 365 >gi|85859126|ref|YP_461328.1| von Willebrand factor type A domain-containing protein [Syntrophus aciditrophicus SB] gi|85722217|gb|ABC77160.1| von Willebrand factor type A domain protein [Syntrophus aciditrophicus SB] Length = 447 Score = 98.3 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 56/334 (16%), Positives = 106/334 (31%), Gaps = 45/334 (13%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 N KG + I+ A+ L ++ + ++V + ++ L +D + A I N G Sbjct: 6 KNQKGAVLIIFALLLIVLLGFTALAVDVGRWYTTRSELSKSVDAGAIAGAKNISNPYLGE 65 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 + E G++ D RS + + I Sbjct: 66 DGHLR------------LAEEVARENFSAGYLMTPDSGERSATFTAYADEDHR---IRVE 110 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI 192 P + + K +++M+VLD S SM Sbjct: 111 GTVSSP----GNLAGLFGVDWVATSAMGVAK-----KNEVEIMLVLDRSGSM------DG 155 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG-VSHLQRKIKYLS 251 T ++ K+ + + + D + + GLV+F+ ++ L VS + KI + Sbjct: 156 TPMNDLKKAARSFVSFFEETQDQD---KMGLVSFATSVKVDVPLGNNYVSSMTSKINAMD 212 Query: 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK-------E 304 G TN+ L A N + + ++ ++F +DG + + Sbjct: 213 AVGA-TNAEDSLSQAGNPAKGGLTDQSGV-PGNKRVQQFVIFFSDGNPTAFRGKFKYNGT 270 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 D C G VY + E+L Sbjct: 271 DNIDAVVCGTGNDCG-TVYT-KLGKPEREEWLSY 302 Score = 45.5 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 13/69 (18%), Positives = 26/69 (37%) Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHI 361 T ++ + E K +Y IG+ I + + P+ + + F+ I Sbjct: 378 TTARGMAVEHAQELKDNNVKIYTIGLGNIDRDFLSQIASGPSFEFYAPTSGELQAIFNKI 437 Query: 362 GKDIVTKRI 370 KDI + + Sbjct: 438 AKDIKLRLV 446 >gi|148234427|ref|NP_001080470.1| matrilin 2 [Xenopus laevis] gi|28175657|gb|AAH45220.1| Matn2-prov protein [Xenopus laevis] Length = 589 Score = 98.3 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 38/204 (18%), Positives = 84/204 (41%), Gaps = 26/204 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 ++ + LD++ ++D SRS+ + + + ML+ + + PD + GL+ Sbjct: 44 SACHNKPLDLVFIIDSSRSVRP------ADFEKVKEFLITMLKFLDIGPD---TTRVGLL 94 Query: 225 TFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 95 QYGSTVKNEFSLKMYKRKSDIERAVKRMMHLATGTMTGLAIQYAMNIAFSEAE---GARP 151 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS- 341 + +I + +TDG E+ + +A+ G +++AIG+ + L+ S Sbjct: 152 LNQYVPRIAMIVTDGRPQDPVEEISA-----KARMSGILIFAIGVGRVD-MSTLKTIGSE 205 Query: 342 --PNSFYLVEN---PHSMYDAFSH 360 +LV N ++ F + Sbjct: 206 PHSEHVFLVANFSQIETLTSVFQN 229 >gi|332253805|ref|XP_003276022.1| PREDICTED: matrilin-3 [Nomascus leucogenys] Length = 488 Score = 98.3 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 73/207 (35%), Gaps = 26/207 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 LD++ ++D SRS+ TK+ + I + + + Sbjct: 75 GAGVCKSRPLDLVFIIDSSRSVR---PPEFTKVKTFVSRI------IDTLDIGPADTRVA 125 Query: 223 LVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V +++ ++ F L+ L++ + ++ T S ++ A ++ F ++ Sbjct: 126 VVNYASTVKIEFQLQAYTDKQSLRQAVGRITPLSTGTMSGLAIQTAMDEAFTVE---AGA 182 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +N K+ + +TDG Q A+ G +YA+G+ L+ A Sbjct: 183 RDPSSNIPKVAIIVTDGRPQD-----QVNEVAARARASGIELYAVGVDRADLKS-LKMMA 236 Query: 341 S---PNSFYLVEN---PHSMYDAFSHI 361 S + VE + F Sbjct: 237 SEPLEEHVFYVETYGVIEKLSSRFQET 263 >gi|296224421|ref|XP_002758053.1| PREDICTED: matrilin-3 isoform 2 [Callithrix jacchus] Length = 445 Score = 98.3 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 36/217 (16%), Positives = 75/217 (34%), Gaps = 27/217 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 LD++ ++D SRS+ TK+ + I + + + Sbjct: 74 GAGVCKSRPLDLVFIIDSSRSVRPL---EFTKVKTFVSRI------IDTLDIGPADTRVA 124 Query: 223 LVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V +++ ++ F L+ L++ + ++ T S ++ A ++ F ++ Sbjct: 125 VVNYASTVKIEFQLQAYTDKQSLKQAVGRITPLSTGTMSGLAIQTAMDEAFTLE---AGA 181 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +N K+ + +TDG Q A+ G +YA+G+ L+ A Sbjct: 182 RGPSSNVPKVAIIVTDGRPQD-----QVNEVAARARASGIELYAVGVDRADMES-LKMMA 235 Query: 341 S---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 S + VE Y + DK Sbjct: 236 SEPLEEHVFYVET----YGVIEKLSSRFQETFCALDK 268 >gi|56797859|emb|CAG27402.1| matrilin-3b [Danio rerio] Length = 434 Score = 98.3 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 36/213 (16%), Positives = 78/213 (36%), Gaps = 26/213 (12%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P T LD++ ++D SRS+ + K+ + + E V + + Sbjct: 188 PPTVPAPAEPCKSRPLDLVFIIDSSRSVR---PAEFEKVKIF------LSEMVDSLDIGS 238 Query: 217 NVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + + LV +++ + F L+ + + +++ + T + +K A +Q+F Sbjct: 239 DATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLSTGTMTGMAIKTAMDQVFTEN 298 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K+ + +TDG E+ + A+ G +YA+G+ Sbjct: 299 AGARPLKKG---IGKVAIIVTDGRPQDKVEEVSAA-----ARASGIEIYAVGVDRAEMRS 350 Query: 335 FLRACAS---PNSFYLVEN---PHSMYDAFSHI 361 L+ AS + + VE + F Sbjct: 351 -LKQMASQPLDDHVFYVETYGVIEKLTSKFRET 382 >gi|148665999|gb|EDK98415.1| matrilin 3 [Mus musculus] Length = 482 Score = 98.3 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 73/213 (34%), Gaps = 26/213 (12%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P + LD++ ++D SRS+ TK+ + I + + Sbjct: 62 PYSGGRGAGVCKSRPLDLVFIIDSSRSVRPL---EFTKVKTFVSRI------IDTLDIGA 112 Query: 217 NVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + +V +++ ++ F L L++ + ++ T S ++ A + F ++ Sbjct: 113 TDTRVAVVNYASTVKIEFQLNTYSDKQALKQAVARITPLSTGTMSGLAIQTAMEEAFTVE 172 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 +N K+ + +TDG Q A+ G +YA+G+ Sbjct: 173 ---AGARGPMSNIPKVAIIVTDGRPQD-----QVNEVAARARASGIELYAVGVDRADMES 224 Query: 335 FLRACAS---PNSFYLVEN---PHSMYDAFSHI 361 L+ AS + VE + F Sbjct: 225 -LKMMASKPLEEHVFYVETYGVIEKLSARFQET 256 >gi|306518578|ref|NP_034900.4| matrilin-3 precursor [Mus musculus] gi|6918887|emb|CAB72265.1| matrilin-3 [Mus musculus] gi|26339344|dbj|BAC33343.1| unnamed protein product [Mus musculus] Length = 481 Score = 98.3 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 73/213 (34%), Gaps = 26/213 (12%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P + LD++ ++D SRS+ TK+ + I + + Sbjct: 62 PYSGGRGAGVCKSRPLDLVFIIDSSRSVRPL---EFTKVKTFVSRI------IDTLDIGA 112 Query: 217 NVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + +V +++ ++ F L L++ + ++ T S ++ A + F ++ Sbjct: 113 TDTRVAVVNYASTVKIEFQLNTYSDKQALKQAVARITPLSTGTMSGLAIQTAMEEAFTVE 172 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 +N K+ + +TDG Q A+ G +YA+G+ Sbjct: 173 ---AGARGPMSNIPKVAIIVTDGRPQD-----QVNEVAARARASGIELYAVGVDRADMES 224 Query: 335 FLRACAS---PNSFYLVEN---PHSMYDAFSHI 361 L+ AS + VE + F Sbjct: 225 -LKMMASKPLEEHVFYVETYGVIEKLSARFQET 256 >gi|14548114|sp|O35701|MATN3_MOUSE RecName: Full=Matrilin-3; Flags: Precursor gi|2342635|emb|CAA71532.1| matrilin-3 [Mus musculus] Length = 481 Score = 98.3 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 73/213 (34%), Gaps = 26/213 (12%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P + LD++ ++D SRS+ TK+ + I + + Sbjct: 62 PYSGGRGAGVCKSRPLDLVFIIDSSRSVRPL---EFTKVKTFVSRI------IDTLDIGA 112 Query: 217 NVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + +V +++ ++ F L L++ + ++ T S ++ A + F ++ Sbjct: 113 TDTRVAVVNYASTVKIEFQLNTYSDKQALKQAVARITPLSTGTMSGLAIQTAMEEAFTVE 172 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 +N K+ + +TDG Q A+ G +YA+G+ Sbjct: 173 ---AGARGPMSNIPKVAIIVTDGRPQD-----QVNEVAARARASGIELYAVGVDRADMES 224 Query: 335 FLRACAS---PNSFYLVEN---PHSMYDAFSHI 361 L+ AS + VE + F Sbjct: 225 -LKMMASKPLEEHVFYVETYGVIEKLSARFQET 256 >gi|296224419|ref|XP_002758052.1| PREDICTED: matrilin-3 isoform 1 [Callithrix jacchus] Length = 487 Score = 98.3 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 73/207 (35%), Gaps = 26/207 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 LD++ ++D SRS+ TK+ + I + + + Sbjct: 74 GAGVCKSRPLDLVFIIDSSRSVRPL---EFTKVKTFVSRI------IDTLDIGPADTRVA 124 Query: 223 LVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V +++ ++ F L+ L++ + ++ T S ++ A ++ F ++ Sbjct: 125 VVNYASTVKIEFQLQAYTDKQSLKQAVGRITPLSTGTMSGLAIQTAMDEAFTLE---AGA 181 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +N K+ + +TDG Q A+ G +YA+G+ L+ A Sbjct: 182 RGPSSNVPKVAIIVTDGRPQD-----QVNEVAARARASGIELYAVGVDRADMES-LKMMA 235 Query: 341 S---PNSFYLVEN---PHSMYDAFSHI 361 S + VE + F Sbjct: 236 SEPLEEHVFYVETYGVIEKLSSRFQET 262 >gi|311253076|ref|XP_003125384.1| PREDICTED: matrilin-3-like [Sus scrofa] Length = 488 Score = 97.9 bits (242), Expect = 2e-18, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 74/207 (35%), Gaps = 26/207 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 LD++ ++D SRS+ TK+ + I + + + Sbjct: 75 GAGICKSRPLDLVFIIDSSRSVRPL---EFTKVKTFVSRI------IDNLDIGAEDTRVA 125 Query: 223 LVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V +++ ++ F L+ L+R + ++ T S ++ A ++ F ++ Sbjct: 126 VVNYASTVKIEFHLQTHSDKQALKRAVARIAPLSTGTMSGLAIQTAMDEAFTVE---AGA 182 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 ++N K+ + +TDG Q A+ G +YA+G+ LR A Sbjct: 183 RGPNSNIPKVAIIVTDGRPQD-----QVNEVAARARASGIELYAVGVDRADMES-LRLMA 236 Query: 341 S---PNSFYLVEN---PHSMYDAFSHI 361 S + VE + F Sbjct: 237 SEPLDEHVFYVETYGVIEKLSSRFQET 263 >gi|158260465|dbj|BAF82410.1| unnamed protein product [Homo sapiens] Length = 937 Score = 97.5 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 88/209 (42%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ +S + R D + ++D SRS+ + + + + I +L+ + + PD V Sbjct: 44 TALLESSCENKRADPVFIIDSSRSVNTHDYAKV------KEFIVDILQFLDIGPD---VT 94 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L+ S ++R +K + T + ++YA N F Sbjct: 95 RVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIAFSEAE-- 152 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N ++I+ +TDG +A+ G +++AIG+ + L+ Sbjct: 153 -GARPLRENVPRVIMIVTDGRPQD-----SVAEVAAKARDMGILIFAIGVGQVD-FNTLK 205 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSH 360 + S + +LV N ++ F Sbjct: 206 SIGSEPHEDHVFLVANFSQIETLTSVFQK 234 Score = 96.0 bits (237), Expect = 8e-18, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 647 KKCTEGPIDLVFVIDGSKSLGE------ENFEVVKQFVTGIIDSLTISPKAA---RVGLL 697 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 698 QYSTQVHTEFTLRNFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGE---GARP 754 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + + TDG + ++AK G +YA+G+ E + P Sbjct: 755 LSTRVPRAAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 809 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 810 TNKHLFYAEDFSTMDEISEKLKKGICE 836 >gi|297668115|ref|XP_002812300.1| PREDICTED: LOW QUALITY PROTEIN: matrilin-3-like [Pongo abelii] Length = 493 Score = 97.5 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 74/207 (35%), Gaps = 26/207 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 LD++ ++D SRS+ TK+ + I + + + Sbjct: 80 GAGVCKSRPLDLVFIIDSSRSVRPL---EFTKVKTFVSRI------IDTLDIGPADTRVA 130 Query: 223 LVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V +++ ++ F L+ L++ + ++ T S ++ A ++IF ++ Sbjct: 131 VVNYASTVKIEFQLQTYTDKQSLKQAVGRITPLSTGTMSGLAIQTAMDEIFTVE---AGA 187 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +N K+ + +TDG Q A+ G +YA+G+ L+ A Sbjct: 188 RGPSSNIPKVAIIVTDGRPQD-----QVNEVAARAQASGIELYAVGVDRADMES-LKMMA 241 Query: 341 S---PNSFYLVEN---PHSMYDAFSHI 361 S + VE + F Sbjct: 242 SEPLEEHVFYVETYGVIEKLSSRFQET 268 >gi|47217883|emb|CAG05005.1| unnamed protein product [Tetraodon nigroviridis] Length = 647 Score = 97.5 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 75/209 (35%), Gaps = 26/209 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S N LD + V+D SRS+ + I +++ +++ P Sbjct: 41 SRAVGNPCKAVPLDFVFVIDSSRSIRP------RDYEKVKTFIVNLVQFLEVGP---EAT 91 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ + + ++ F L + +++ ++ + T + ++YA F Sbjct: 92 RVGLLQYGSVVQPEFSLSTFSTKAEVEQAVRNMKHLATGTMTGLAIQYAAETSFTEAD-- 149 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + +I V +TDG + +A++ G ++AIG+ + L+ Sbjct: 150 -GARPAHLHIPRIAVVVTDGRPQD-----RVEEVAAQARQAGIQIFAIGVGRVDMKT-LK 202 Query: 338 ACAS---PNSFYLV---ENPHSMYDAFSH 360 S +LV ++ F Sbjct: 203 TIGSEPHSEHVHLVASFSQMETLVSVFQS 231 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 38/242 (15%), Positives = 84/242 (34%), Gaps = 57/242 (23%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K ++ D +D++ ++D S+SM + + + +++E + + P +V G Sbjct: 397 KRSACADGAMDLVFMIDGSKSMGP------ANFERVKQFVISIVESLDVSPTGAHV---G 447 Query: 223 LVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 L+ +S + F L S +++ + + G + + L+ + F + Sbjct: 448 LLQYSTNVRTEFTLSQHTSAQGIRQAVSRMQYMGRGSMTGSALRRMFQSSFSAEE----- 502 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA-------------------- 320 N ++ V TDG + + + +AK G Sbjct: 503 -GARPNVPRVSVVFTDG-----RSQDDASEWAKKAKNSGIPGSFSYFGGTGRFLSCSFLL 556 Query: 321 ----IVYAIGIRVIRSHEFLRACASPN--SFYLVENPHSMYDAFSHIGK--DIVTKRIWY 372 +YA+G+ E + P Y + F +G+ + + R+ Sbjct: 557 VLGVTIYAVGVGKAIEQELREIASEPEEKHLYYAQE-------FKDVGEITEKLKSRMCT 609 Query: 373 DK 374 DK Sbjct: 610 DK 611 >gi|220675931|emb|CAX12090.1| matrilin 3b [Danio rerio] Length = 434 Score = 97.1 bits (240), Expect = 3e-18, Method: Composition-based stats. Identities = 36/213 (16%), Positives = 77/213 (36%), Gaps = 26/213 (12%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P T LD++ ++D SRS+ + K+ + + E V + + Sbjct: 188 PPTVPAPAEPCKSRPLDLVFIIDSSRSVR---PAEFEKVKIF------LSEMVNSLDIGS 238 Query: 217 NVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + + LV +++ + F L+ + + +++ + T + +K A Q+F Sbjct: 239 DATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLSTGTMTGMAIKTAMEQVFTEN 298 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K+ + +TDG E+ + A+ G +YA+G+ Sbjct: 299 AGARPLKKG---IGKVAIIVTDGRPQDKVEEVSAA-----ARASGIEIYAVGVDRAEMRS 350 Query: 335 FLRACAS---PNSFYLVEN---PHSMYDAFSHI 361 L+ AS + + VE + F Sbjct: 351 -LKQMASQPLDDHVFYVETYGVIEKLTSKFRET 382 >gi|59939916|ref|NP_001012385.1| matrilin 3b [Danio rerio] gi|56797875|emb|CAG30518.1| matrilin-3b precursor [Danio rerio] gi|220675929|emb|CAX12088.1| matrilin 3b [Danio rerio] Length = 478 Score = 97.1 bits (240), Expect = 3e-18, Method: Composition-based stats. Identities = 36/213 (16%), Positives = 77/213 (36%), Gaps = 26/213 (12%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P T LD++ ++D SRS+ + K+ + + E V + + Sbjct: 188 PPTVPAPAEPCKSRPLDLVFIIDSSRSVR---PAEFEKVKIF------LSEMVNSLDIGS 238 Query: 217 NVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + + LV +++ + F L+ + + +++ + T + +K A Q+F Sbjct: 239 DATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLSTGTMTGMAIKTAMEQVFTEN 298 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K+ + +TDG E+ + A+ G +YA+G+ Sbjct: 299 AGARPLKKG---IGKVAIIVTDGRPQDKVEEVSAA-----ARASGIEIYAVGVDRAEMRS 350 Query: 335 FLRACAS---PNSFYLVEN---PHSMYDAFSHI 361 L+ AS + + VE + F Sbjct: 351 -LKQMASQPLDDHVFYVETYGVIEKLTSKFRET 382 >gi|55251329|emb|CAH69127.1| novel protein similar to vertebrate matrilin 3 (MATN3) [Danio rerio] Length = 454 Score = 96.8 bits (239), Expect = 4e-18, Method: Composition-based stats. Identities = 35/209 (16%), Positives = 82/209 (39%), Gaps = 26/209 (12%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + + LD++ ++D SRS+ + + + M++ + + PD + Sbjct: 52 AATDSQCRSRPLDLVFIIDSSRSVRP------GEFEKVKIFLADMVDTLDVGPDA---TR 102 Query: 221 SGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 +V +++ ++ FLL+ ++ +++ I + T + +K A ++ F + Sbjct: 103 VAVVNYASTVKIEFLLKSHLTKDTIKQAITRIEPLAAGTMTGMAIKKAMDEAFT---EKS 159 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + N K+ + +TDG E+ + A+ G +YA+G+ L+ Sbjct: 160 GARPKSKNISKVAIIVTDGRPQDQVEEVSAA-----ARASGIEIYAVGVDRADMRS-LKL 213 Query: 339 CAS---PNSFYLVEN---PHSMYDAFSHI 361 AS + + VE + F Sbjct: 214 MASNPLEDHVFYVETYGVIEKLTSKFRET 242 >gi|312878233|ref|ZP_07738157.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] gi|311794982|gb|EFR11387.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] Length = 1221 Score = 96.8 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 78/215 (36%), Gaps = 29/215 (13%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 +PI + +Q +D++ VLD S SM + KI + + ++ + Sbjct: 515 EVPINKGEREVNQQIKYIDLVFVLDSSGSMSWNDPNGYRKI-----AAKSFVDALIQGD- 568 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 ++ +V F + L ++ I + +G TN G++ A +Q+ Sbjct: 569 -----RAAVVDFDDYGYLLQPLTTDFQTVKNAIDRIDSWG-GTNIAEGIRIANHQLISQS 622 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K+I+ +TDGE EAK G +Y IG+ Sbjct: 623 ---------SDDRIKVIILLTDGEGYYDN------NLTTEAKNNGITIYTIGLGTSVDEN 667 Query: 335 FLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVT 367 LR A + ++ V + + F I + + Sbjct: 668 LLRNIATQTGGMYFPVSSASQLPQVFKRITEIVTE 702 >gi|194671641|ref|XP_591137.3| PREDICTED: matrilin 3 [Bos taurus] gi|297480578|ref|XP_002691564.1| PREDICTED: matrilin 3 [Bos taurus] gi|296482389|gb|DAA24504.1| matrilin 3 [Bos taurus] Length = 574 Score = 96.8 bits (239), Expect = 6e-18, Method: Composition-based stats. Identities = 35/210 (16%), Positives = 74/210 (35%), Gaps = 26/210 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + LD++ ++D SRS+ TK+ + I + + Sbjct: 158 GARGAGVCKSRPLDLVFIIDSSRSVRPL---QFTKVKTFVSKI------IDTLDIGPMDT 208 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + +V +++ ++ F L+ L+R + ++ T S ++ A ++ F ++ Sbjct: 209 RVAVVNYASTVKIEFHLQTHSDKQSLKRAVARITPLSTGTMSGLAIQTAMDEAFTVE--- 265 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +N K+ + +TDG Q A+ G +YA+G+ L+ Sbjct: 266 AGARGPSSNIPKVAIIVTDGRPQD-----QVNEVAARARASGIELYAVGVDRADMES-LK 319 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSHI 361 AS + VE + F Sbjct: 320 MMASEPLDEHVFYVETYGVIEKLSSRFQET 349 >gi|281344227|gb|EFB19811.1| hypothetical protein PANDA_002988 [Ailuropoda melanoleuca] Length = 395 Score = 96.4 bits (238), Expect = 6e-18, Method: Composition-based stats. Identities = 32/205 (15%), Positives = 71/205 (34%), Gaps = 24/205 (11%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 LD++ ++D SRS+ TK+ + I + + + + Sbjct: 1 AGVCKSRPLDLVFIIDSSRSVRPL---EFTKVKTFVSQI------IDTLDIGAADTRVAV 51 Query: 224 VTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 V +++ ++ F L+ L++ + ++ T S ++ A ++ F ++ Sbjct: 52 VNYASTVKTEFHLQTYSDKQSLKQAVARITPLSTGTMSGLAIQTAMDEAFTVE---AGAR 108 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 +N K+ + +TDG Q A+ G +YA+G+ + Sbjct: 109 GPTSNIPKVAIIVTDGRPQD-----QVNEVAARARASGIELYAVGVDRADMESLKMIASE 163 Query: 342 P--NSFYLVEN---PHSMYDAFSHI 361 P + VE + F Sbjct: 164 PLDEHVFYVETYGVIEKLSSRFQET 188 >gi|32425428|gb|AAH16394.1| MATN2 protein [Homo sapiens] Length = 715 Score = 96.0 bits (237), Expect = 8e-18, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 406 KKCTEGPIDLVFVIDGSKSLGE------ENFEVVKQFVTGIIDSLTISPKAA---RVGLL 456 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 457 QYSTQVHTEFTLRNFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGE---GARP 513 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + + TDG + ++AK G +YA+G+ E + P Sbjct: 514 LSTRVPRAAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 568 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 569 TNKHLFYAEDFSTMDEISEKLKKGICE 595 >gi|52545626|emb|CAB70853.2| hypothetical protein [Homo sapiens] Length = 672 Score = 96.0 bits (237), Expect = 8e-18, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 363 KKCTEGPIDLVFVIDGSKSLGE------ENFEVVKQFVTGIIDSLTISPKAA---RVGLL 413 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 414 QYSTQVHTEFTLRNFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGE---GARP 470 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + + TDG + ++AK G +YA+G+ E + P Sbjct: 471 LSTRVPRAAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 525 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 526 TNKHLFYAEDFSTMDEISEKLKKGICE 552 >gi|14042702|dbj|BAB55358.1| unnamed protein product [Homo sapiens] Length = 537 Score = 96.0 bits (237), Expect = 8e-18, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 228 KKCTEGPIDLVFVIDGSKSLGE------ENFEVVKQFVTGIIDSLTISPKAA---RVGLL 278 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 279 QYSTQVHTEFTLRNFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGE---GARP 335 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + + TDG + ++AK G +YA+G+ E + P Sbjct: 336 LSTRVPRAAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 390 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 391 TNKHLFYAEDFSTMDEISEKLKKGICE 417 >gi|11360063|pir||T46488 hypothetical protein DKFZp434J065.1 - human (fragment) Length = 741 Score = 96.0 bits (237), Expect = 8e-18, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 78/207 (37%), Gaps = 21/207 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 432 KKCTEGPIDLVFVIDGSKSLGE------ENFEVVKQFVTGIIDSLTISPKAA---RVGLL 482 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + ++ G + + LK+ + + F Sbjct: 483 QYSTQVHTEFTLRNFNSAKDMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGE---GARP 539 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + + TDG + ++AK G +YA+G+ E + P Sbjct: 540 LSTRVPRAAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 594 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVT 367 + E+ +M + + K I Sbjct: 595 TNKHLFYAEDFSTMDEISEKLKKGICE 621 >gi|194220937|ref|XP_001501805.2| PREDICTED: matrilin 3 [Equus caballus] Length = 450 Score = 96.0 bits (237), Expect = 9e-18, Method: Composition-based stats. Identities = 38/242 (15%), Positives = 81/242 (33%), Gaps = 27/242 (11%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS-QTDARLDMMIVLDVSRSMESF 187 + Y T IP ++ + + + LD++ ++D SRS+ Sbjct: 2 LRVSPAYWRYPSATTLIPLHSGAPTLGPGQPGVARGAGVCKSRPLDLVFIIDSSRSVRPL 61 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQR 245 TK+ + I + + + +V +++ ++ F L L++ Sbjct: 62 ---EFTKVKTFVSQI------IDTLDIGPADTRVAVVNYASTVKIEFHLNTHSDKQSLKQ 112 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 + ++ T S ++ A ++ F ++ N K+ + +TDG Sbjct: 113 AVARITPLSTGTMSGLAIQTAMDEAFTVE---AGARVPSYNIPKVAIIVTDGRPQD---- 165 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---PNSFYLVEN---PHSMYDAFS 359 Q A+ G +YA+G+ L+ AS + VE + F Sbjct: 166 -QVNEVAARARASGIELYAVGVDRADMES-LKVIASEPLDEHVFYVETYGVIEKLSSRFQ 223 Query: 360 HI 361 Sbjct: 224 ET 225 >gi|48734898|gb|AAH71224.1| Matrilin 3 [Mus musculus] Length = 481 Score = 95.6 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 73/213 (34%), Gaps = 26/213 (12%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P + LD++ ++D SRS+ TK+ + I + + Sbjct: 62 PYSGGRGAGVCKSRPLDLVFIIDSSRSVRPL---EFTKVKTFVSRI------IDTLGIGA 112 Query: 217 NVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + +V +++ ++ F L L++ + ++ T S ++ A + F ++ Sbjct: 113 TDTRVAVVNYASTVKIEFQLNTYSDKQALKQAVARITPLSTGTMSGLAIQTAMEEAFTVE 172 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 +N K+ + +TDG Q A+ G +YA+G+ Sbjct: 173 ---AGARGPMSNIPKVAIIVTDGRPQD-----QVNEVAARARASGIELYAVGVDRADMES 224 Query: 335 FLRACAS---PNSFYLVEN---PHSMYDAFSHI 361 L+ AS + VE + F Sbjct: 225 -LKMMASKPLEEHVFYVETYGVIEKLSARFQET 256 >gi|126341666|ref|XP_001379908.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 2347 Score = 95.6 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 59/349 (16%), Positives = 118/349 (33%), Gaps = 36/349 (10%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 S + L R+ AA + + + +K + + +I + Sbjct: 296 SEVLQEIQKLSPKAGRAYTGAAMTKVRKEVFSVQKGSRRMQGVPQIAILVAHRPSEDNVS 355 Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTF----IPWYTNSRHIVM 156 ++ + V ++ + + IS+ K K TF T + + Sbjct: 356 EAALDLRREGVTVFTVGLEGSDDTQLGQISSHPPEKYLSKLKTFSELAAHNQTFQKKLRG 415 Query: 157 PITSSVKVNSQTDARL----------DMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 I V V S+ L D+ +++D S S+ + + ++ + + M Sbjct: 416 EIQHKVSVLSEQTEHLKAGCVETEEADIYLLIDGSGSI---YPTDFQEMKAFLSEVIEMF 472 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLK 264 I V+ G V +S+ E F + + L + I + + G +TN+ L Sbjct: 473 ----TIAPYK--VRVGAVQYSHIQELEFEINKYSNKNDLGKAIDNIWQLGGNTNTGAALD 526 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 F + +++ ++ +TDG D L + K VYA Sbjct: 527 ------FTLGLLQRAKTQRGNKVPCHLIVLTDG-----MSDDNVLEPAKKLKDENINVYA 575 Query: 325 IGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 IG++ + L + Y V N S+ D + + + I +K D Sbjct: 576 IGVKEANRTQLLEIAGTEKRVYYVYNFDSLKDIKNQVVQGICSKEACKD 624 Score = 94.1 bits (232), Expect = 3e-17, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 81/196 (41%), Gaps = 19/196 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D S S++ + + + M++ VK + V+ G + +S Sbjct: 814 LDIVFVIDSSGSIDYNEYNIMKEF---------MIKLVKKADVAKDRVRFGALKYSYDPT 864 Query: 232 EFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L+ S + ++ S G T + L ++ + + +G R + + Sbjct: 865 ILFYLDEFDTRSKVISLLQNDSPKGGDTYTAKALAFSEHMFTEARGSRI-----NQKVPQ 919 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDGE+ + + + +G ++ A+GI + E L S + ++ VE Sbjct: 920 VLIVITDGESHDANQLNAT---AKALRDKGILILAVGIAGANTEELLAMAGSTDKYFFVE 976 Query: 350 NPHSMYDAFSHIGKDI 365 + F ++ I Sbjct: 977 TFGGLKGIFQNVSDSI 992 Score = 89.8 bits (221), Expect = 7e-16, Method: Composition-based stats. Identities = 45/211 (21%), Positives = 84/211 (39%), Gaps = 27/211 (12%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + + D + D+M ++D SRS+ + K+ +K++ V + Sbjct: 613 VQGICSKEACKDVKADVMFLVDSSRSIGLE---NFGKMKTFMKNL------VNKSQIGED 663 Query: 218 VVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 VQ G+V FS+ +E F L W + I +S T + LK+ + +G Sbjct: 664 QVQVGIVQFSDVNKEEFQLNRYWTQHEIFDAIDRMSNIDRETLTGSALKFVSDYFHPSKG 723 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R +K ++ +TDGE+ +D ++ G I+Y++G+ + Sbjct: 724 ARPG-------VRKFLILITDGESQDPVKD-----PAMALRQDGVIIYSVGVYGANETQL 771 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIGKDIV 366 + P + VE +D HI D+V Sbjct: 772 VEISGKPEMIFYVET----FDILKHIEDDLV 798 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 33/216 (15%), Positives = 83/216 (38%), Gaps = 22/216 (10%) Query: 152 RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 +++ I KV + + D++ +LD S S+ + +++++ + Sbjct: 986 QNVSDSICGPSKVECKMEKA-DLVFLLDGSNSIYP------ENFKKMKDFLVSVVDDFDI 1038 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 P V GL FS+ F L S + +I+ + +T+ LK Sbjct: 1039 GPS---RVHIGLAQFSHVYRAEFFLGSFTSEGEVSTQIEMTQQVFGNTHIGAALKQVEQY 1095 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 G R + +++++ +TDG+ + + +++G +Y++GI Sbjct: 1096 FRPEMGSRI-----NVGIQQVLLVLTDGQ-----SQDEVAKAAEDLRRKGIDIYSLGIGD 1145 Query: 330 IRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 + + ++ + + ++N + I ++I Sbjct: 1146 VDEQQLIQISGTSDKKLTIDNFDELKKIKKRIVRNI 1181 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 64/177 (36%), Gaps = 20/177 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD + + + ++ + E V N ++ GLVT++++ + Sbjct: 235 DVVFLLD------EYVNGTQENLEHLKGFLE---ESVSSFDVKENCMRIGLVTYTDETKV 285 Query: 233 FFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L G S + ++I+ LS + + ++F +Q + +I Sbjct: 286 IHSLSTGTNKSEVLQEIQKLSPKAGRAYTGAAMTKVRKEVFSVQKGSRRM----QGVPQI 341 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 + + + + ++ G V+ +G+ + + + P YL Sbjct: 342 AILV-----AHRPSEDNVSEAALDLRREGVTVFTVGLEGSDDTQLGQISSHPPEKYL 393 >gi|327538644|gb|EGF25299.1| protein containing von Willebrand factor, type A domains [Rhodopirellula baltica WH47] Length = 388 Score = 95.2 bits (235), Expect = 1e-17, Method: Composition-based stats. Identities = 57/396 (14%), Positives = 127/396 (32%), Gaps = 64/396 (16%) Query: 2 FSLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHA 61 F+LN+ + + G ++ I LP++ V I V ++ +T L D + A Sbjct: 19 FTLNVVDKARLRSRSGTTVVMLVILLPVMLAVAAYCINVVYMEMARTELQISTDLATRAA 78 Query: 62 ATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSL--DIV 119 + G+ + L N +++ I + + Sbjct: 79 GRVLAVTGDKAEAIEAAERL----------------LEANPYLDRTLSIGDADIIFGKSN 122 Query: 120 VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSS--------VKVNSQTDAR 171 +N Y + K + + + M + +K T Sbjct: 123 RTEENRRYEFTPDK------KVNSVGLRAFGADDVPMLFPTMGVPIEFRPIKQAVATQVE 176 Query: 172 LDMMIVLDVSRSM------------------ESFFDSSITKIDMAIKSINAMLEEVKLIP 213 LD+ IVLD S SM ++ K + ++ A+ + ++ Sbjct: 177 LDIAIVLDRSGSMAFSHDEVAKNGSPSSAPPGWKMGHAVPKNARWLDTVAAVNGFLDIME 236 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIF 271 D ++ + L T+S+K + L + ++ + S G +TN G+ + Sbjct: 237 DSSHDERVSLSTYSDKSKADVKLTGDYTEIRAAMNAHSTNFKGGATNIGSGILEGGATLG 296 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-I 330 D R + ++++ M+DG + + + +++ + Sbjct: 297 DKNLARSWAS-------RVLIVMSDG---IHNTGIEPIPAAQQVANEKIMIFTVTFSNEA 346 Query: 331 RSHEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDI 365 E + S + ++ + +AF I K + Sbjct: 347 NVQEMEKVAVSGGGQHFHAKDSQQLAEAFRKIAKSL 382 >gi|114576315|ref|XP_515709.2| PREDICTED: matrilin-3 [Pan troglodytes] Length = 486 Score = 95.2 bits (235), Expect = 1e-17, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 74/207 (35%), Gaps = 26/207 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 LD++ ++D SRS+ TK+ + I + + + Sbjct: 73 GAGVCKSRPLDLVFIIDSSRSVRPL---EFTKVKTFVSRI------IDTLDIGPADTRVA 123 Query: 223 LVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V +++ ++ F L+ L++ + ++ T S ++ A ++ F ++ Sbjct: 124 VVNYASTVKIEFQLQAYTDKQSLKQAVGRITPLSTGTMSGLAIQTAMDEAFTVE---AGA 180 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +N K+ + +TDG + + A+ G +YA+G+ L+ A Sbjct: 181 REPSSNIPKVAIIVTDGRPQDQVNEVVAR-----AQASGIELYAVGVDRADMES-LKMMA 234 Query: 341 S---PNSFYLVEN---PHSMYDAFSHI 361 S + VE + F Sbjct: 235 SEPLEEHVFYVETYGVIEKLSSRFQET 261 >gi|56797855|emb|CAG27023.1| matrilin-3a [Danio rerio] Length = 460 Score = 95.2 bits (235), Expect = 1e-17, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 81/209 (38%), Gaps = 26/209 (12%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + + LD++ ++D SRS+ + + + M++ + + PD + Sbjct: 52 AATDSQCRSRPLDLVFIIDSSRSVRP------GEFEKVKIFLADMVDTLDVGPDA---TR 102 Query: 221 SGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 +V +++ ++ LL+ ++ +++ I + T + +K A ++ F + Sbjct: 103 VAVVNYASTVKIESLLKSHLTKDTIKQAITRIEPLAAGTMTGMAIKKAMDEAFT---EKS 159 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + N K+ + +TDG E+ + A+ G +YA+G+ L+ Sbjct: 160 GARPKSKNISKVAIIVTDGRPQDQVEEVSAA-----ARASGIEIYAVGVDRADMRS-LKL 213 Query: 339 CAS---PNSFYLVEN---PHSMYDAFSHI 361 AS + + VE + F Sbjct: 214 MASNPLEDHVFYVETYGVIEKLTSKFRET 242 >gi|312622403|ref|YP_004024016.1| von willebrand factor type a [Caldicellulosiruptor kronotskyensis 2002] gi|312202870|gb|ADQ46197.1| von Willebrand factor type A [Caldicellulosiruptor kronotskyensis 2002] Length = 1166 Score = 95.2 bits (235), Expect = 1e-17, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 78/215 (36%), Gaps = 29/215 (13%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 +PI + +Q +D++ VLD S SM + KI + + ++ + Sbjct: 481 EVPINKGEREINQQINYIDLVFVLDSSGSMSWNDPNGYRKI-----AAKSFVDALIQGD- 534 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 ++ +V F + L ++ I + +G TN G++ A Q+ + Sbjct: 535 -----RAAVVDFDDFGYLLQPLTTDFQAVKNAIDRIDSWG-GTNIAEGIRIANQQLISLS 588 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K+I+ +TDGE EAK G +Y IG+ Sbjct: 589 ---------SEDRIKVIILLTDGEGYYDN------NLTTEAKNNGITIYTIGLGTSVDEN 633 Query: 335 FLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVT 367 LR A + ++ V + + F I + + Sbjct: 634 LLRDIATQTGGMYFPVSSASQLPQVFKRITEIVTE 668 >gi|149641369|ref|XP_001505343.1| PREDICTED: similar to matrilin-3, partial [Ornithorhynchus anatinus] Length = 354 Score = 94.8 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 30/220 (13%), Positives = 80/220 (36%), Gaps = 24/220 (10%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 T + LD++ ++D SRS+ + + ++ +++ Sbjct: 127 TGGSQSPGAGGRGPGADICRSRPLDLVFIVDSSRSVRP------REFEKVKTFLSQVIDT 180 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 + + + +V +++ ++ F L+ L++ + ++ T S ++ A Sbjct: 181 LDIGETA---TRVAVVNYASTVKVEFHLQTHSDKESLKQAVSRIAPLATGTMSGLAIRTA 237 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 +++F ++ N K++V +TDG ++ + +A+ G +YA+G Sbjct: 238 MDEVFTVE---AGARAPAFNIPKVVVIVTDGRPQDQVQEAVA-----QAQASGIEIYAVG 289 Query: 327 IRVIRSHEFLRACASP--NSFYLVEN---PHSMYDAFSHI 361 + + + P + VE + F Sbjct: 290 VGRADMQSLRQLASEPVETHAFYVETYGVIEKLTSTFRKT 329 >gi|170672288|gb|ACB29772.1| matrilin-3 alternative transcript [Homo sapiens] Length = 444 Score = 94.8 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 73/207 (35%), Gaps = 26/207 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 LD++ ++D SRS+ TK+ + I + + + Sbjct: 73 GAGVCKSRPLDLVFIIDSSRSVRPL---EFTKVKTFVSRI------IDTLDIGPADTRVA 123 Query: 223 LVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V +++ ++ F L+ L++ + ++ T S ++ A ++ F ++ Sbjct: 124 VVNYASTVKIEFQLQAYTDKQSLKQAVGRITPLSTGTMSGLAIQTAMDEAFTVE---AGA 180 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +N K+ + +TDG Q A+ G +YA+G+ L+ A Sbjct: 181 REPSSNIPKVAIIVTDGRPQD-----QVNEVAARAQASGIELYAVGVDRADMAS-LKMMA 234 Query: 341 S---PNSFYLVEN---PHSMYDAFSHI 361 S + VE + F Sbjct: 235 SEPLEEHVFYVETYGVIEKLSSRFQET 261 >gi|158255148|dbj|BAF83545.1| unnamed protein product [Homo sapiens] Length = 486 Score = 94.8 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 73/207 (35%), Gaps = 26/207 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 LD++ ++D SRS+ TK+ + I + + + Sbjct: 73 GAGVCKSRPLDLVFIIDSSRSVRPL---EFTKVKTFVSRI------IDTLDIGPADTRVA 123 Query: 223 LVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V +++ ++ F L+ L++ + ++ T S ++ A ++ F ++ Sbjct: 124 VVNYASTVKIEFQLQAYTDKQSLKQAVGRITPLSTGTMSGLAIQTAMDEAFTVE---AGA 180 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +N K+ + +TDG Q A+ G +YA+G+ L+ A Sbjct: 181 REPSSNIPKVAIIVTDGRPQD-----QVNEVAARAQASGIELYAVGVDRADMAS-LKMMA 234 Query: 341 S---PNSFYLVEN---PHSMYDAFSHI 361 S + VE + F Sbjct: 235 SEPLEEHVFYVETYGVIEKLSSRFQET 261 >gi|11321565|ref|NP_002372.1| matrilin-3 precursor [Homo sapiens] gi|14548113|sp|O15232|MATN3_HUMAN RecName: Full=Matrilin-3; Flags: Precursor gi|3647275|emb|CAA12110.1| matrilin-3 [Homo sapiens] gi|62630192|gb|AAX88937.1| unknown [Homo sapiens] gi|119621242|gb|EAX00837.1| matrilin 3 [Homo sapiens] gi|146218451|gb|AAI39908.1| Matrilin 3 [Homo sapiens] Length = 486 Score = 94.8 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 73/207 (35%), Gaps = 26/207 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 LD++ ++D SRS+ TK+ + I + + + Sbjct: 73 GAGVCKSRPLDLVFIIDSSRSVRPL---EFTKVKTFVSRI------IDTLDIGPADTRVA 123 Query: 223 LVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V +++ ++ F L+ L++ + ++ T S ++ A ++ F ++ Sbjct: 124 VVNYASTVKIEFQLQAYTDKQSLKQAVGRITPLSTGTMSGLAIQTAMDEAFTVE---AGA 180 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +N K+ + +TDG Q A+ G +YA+G+ L+ A Sbjct: 181 REPSSNIPKVAIIVTDGRPQD-----QVNEVAARAQASGIELYAVGVDRADMAS-LKMMA 234 Query: 341 S---PNSFYLVEN---PHSMYDAFSHI 361 S + VE + F Sbjct: 235 SEPLEEHVFYVETYGVIEKLSSRFQET 261 >gi|222529355|ref|YP_002573237.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] gi|222456202|gb|ACM60464.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] Length = 1188 Score = 94.8 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 42/215 (19%), Positives = 77/215 (35%), Gaps = 29/215 (13%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 +PI + +Q +D++ VLD S SM + KI + + ++ + Sbjct: 481 EVPINKGEREINQQVNYIDLVFVLDSSGSMSWNDPNGYRKI-----AAKSFVDALIQGD- 534 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 ++ +V F N L ++ I + +G TN G++ A Q+ Sbjct: 535 -----RAAVVDFDNFGYLLQPLTTDFQAVKNAIDRIDSWG-GTNIAEGIRIANQQLISRS 588 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K+I+ +TDGE EAK G +Y IG+ Sbjct: 589 ---------SEDRIKVIILLTDGEGYYDN------NLTTEAKNNGITIYTIGLGTSVDEN 633 Query: 335 FLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVT 367 LR A + ++ V + + F I + + Sbjct: 634 LLRDIATQTGGMYFPVSSASQLPQVFKRITEIVTE 668 >gi|55741484|ref|NP_001006980.1| cartilage matrix protein [Rattus norvegicus] gi|54035339|gb|AAH83869.1| Matrilin 1, cartilage matrix protein [Rattus norvegicus] gi|149024105|gb|EDL80602.1| matrilin 1, cartilage matrix protein [Rattus norvegicus] Length = 498 Score = 94.8 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 36/221 (16%), Positives = 83/221 (37%), Gaps = 21/221 (9%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + + S + + D++ V+D SRS+ + + ++ +++ + + P+ Sbjct: 25 SLGLASQPRGHLCRTRPTDLVFVVDSSRSVRP------VEFEKVKVFLSQVIKSLDVGPN 78 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + GLV +++ ++ F L S + + + T + L++A + Sbjct: 79 A---TRVGLVNYASTVKPEFPLRAHTSKASLLQAVHRIQPLSTGTMTGLALQFAITKALS 135 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + ++ K+++ +TDG + D A+ G ++AIG+ + Sbjct: 136 DAEGGRSR---SSDISKVVIVVTDGRPQDSVRDVS-----ERARASGIELFAIGVGRVD- 186 Query: 333 HEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 LR AS V+ S Y+ + K D Sbjct: 187 KATLRQIASEPQDEHVDYVES-YNVIEKLAKKFQEAFCVSD 226 Score = 86.4 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 41/221 (18%), Positives = 85/221 (38%), Gaps = 30/221 (13%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T +V + D++ ++D S+S+ ++ K IN +++ + + + Sbjct: 262 TCNVCRGGGNGSATDLVFLIDGSKSVRP------ENFELVKKFINQIVDTLDVSD---RL 312 Query: 219 VQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAY-NQIFDMQG 275 Q GLV +S+ I + F L ++ ++ +S T + LKY N G Sbjct: 313 AQVGLVQYSSSIRQEFPLGRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSG 372 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R +K+ + TDG + D +AK G ++A+G+ E Sbjct: 373 ARPGA-------QKVGIVFTDGRSQDYINDA-----ARKAKDLGFKMFAVGVGNAVEEEL 420 Query: 336 LRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + P + ++ + + + IGK + K ++ Sbjct: 421 REIASEPVADHYFYTAD----FKTINQIGKKLQKKICVEEE 457 >gi|73980586|ref|XP_540098.2| PREDICTED: similar to matrilin 3 precursor [Canis familiaris] Length = 481 Score = 94.8 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 73/210 (34%), Gaps = 26/210 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + LD++ ++D SRS+ TK+ + I + + Sbjct: 69 GARGAGVCKSRPLDLVFIIDSSRSVRPL---EFTKVKTFVSQI------IDTLDIGAADT 119 Query: 220 QSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + +V +++ ++ F L+ L++ + ++ T S ++ A ++ F + Sbjct: 120 RVAVVNYASTVKIEFHLQTYSDKQSLKQAVARITPLSTGTMSGLAIQTAMDEAFTEE--- 176 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +N K+ + +TDG Q A+ G +YA+G+ L+ Sbjct: 177 AGARGPTSNIPKVAIIVTDGRPQD-----QVNEVAARARASGIELYAVGVDRADMES-LK 230 Query: 338 ACAS---PNSFYLVEN---PHSMYDAFSHI 361 AS + VE + F Sbjct: 231 IIASEPLDEHVFYVETYGVIEKLSSRFQET 260 >gi|301777181|ref|XP_002924014.1| PREDICTED: cartilage matrix protein-like [Ailuropoda melanoleuca] Length = 495 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 36/218 (16%), Positives = 82/218 (37%), Gaps = 21/218 (9%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + +T + D++ V+D SRS+ + + ++ ++E + + P+ Sbjct: 22 ALGLTPQAGGHLCRTRPTDLVFVVDSSRSVRP------VEFEKVKVFLSQVIESLDVGPN 75 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + G+V +++ +++ F L S R ++ + T + +++A + F Sbjct: 76 A---TRVGVVNYASAVKQEFPLRAHGSKASLLRAVRRIQPLSTGTMTGLAIQFAITKAFS 132 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + K+++ +TDG + D + A+ G ++AIG+ + Sbjct: 133 NTE---GGRARSPDISKVVIVVTDGRPQDSVRDVSAR-----ARANGIELFAIGVGRVD- 183 Query: 333 HEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 LR AS V+ S Y + K Sbjct: 184 KATLRQIASEPQDEHVDYVES-YSVIEKLSKKFQEAFC 220 Score = 82.5 bits (202), Expect = 9e-14, Method: Composition-based stats. Identities = 39/220 (17%), Positives = 83/220 (37%), Gaps = 30/220 (13%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T +V + D++ ++D S+S+ ++ K IN +++ + + + Sbjct: 260 TCNVCSGGGGSSATDLVFLIDGSKSVRP------ENFELVKKFINQIVDTLDVSD---KL 310 Query: 219 VQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAY-NQIFDMQG 275 Q GLV +S+ + + F L ++ ++ +S T + LKY N G Sbjct: 311 AQVGLVQYSSSVRQEFPLGRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSG 370 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R +K+ + TDG + + +AK G ++A+G+ E Sbjct: 371 ARPGA-------QKVGIVFTDGRSQDYINNA-----AKKAKDLGFKMFAVGVGNAVEDEL 418 Query: 336 LRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P ++ + + + IGK + K + Sbjct: 419 REIASEPVAEHYFYTAD----FKTINQIGKRLQKKICVEE 454 >gi|28374313|gb|AAH45465.1| Matn1 protein [Danio rerio] Length = 507 Score = 94.5 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 35/244 (14%), Positives = 91/244 (37%), Gaps = 28/244 (11%) Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSR-HIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 S+ RY + L + ++ + + +++ D++ ++D SRS+ Sbjct: 8 SRSLGLTPRYSMTLPGFVMLLCIMGAQATVDLRQAAAMAAGLCNTKPTDVVFIVDSSRSV 67 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ 244 ++ + + +++ + + PD + G+V ++++++ L+ + Sbjct: 68 RP------SEFEQVKVFLAKVIDGLSVGPDA---TRVGVVNYASRVKNEVSLKSHKTKAA 118 Query: 245 --RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 + + + T + +++A N F + + + K+ + +TDG Sbjct: 119 LVKAVSKIEPLSTGTMTGLAIQFAMNVAFSEAEGGR----KSPDISKVAIIVTDGRPQDN 174 Query: 303 KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---PNSFYLVEN---PHSMYD 356 D A++ G ++AIG+ + LR AS + VE+ + Sbjct: 175 IRD-----IAARAREAGIEIFAIGVGRVD-MTTLRQMASEPLEDHVDYVESYSLIEKLTK 228 Query: 357 AFSH 360 F Sbjct: 229 KFQE 232 Score = 77.9 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 36/213 (16%), Positives = 82/213 (38%), Gaps = 28/213 (13%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 ++ ++A D++ ++D S+S+ ++ K IN +++++ + +V GLV Sbjct: 278 SACSNAATDVVFLIDGSKSVRP------ENFELVKKWINLIIDKLDVSETNTHV---GLV 328 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ +++ F L L+ +K ++ T + L + + F Sbjct: 329 QYSSTVKQEFPLGRHNSKRSLKEAVKRMNYMERGTMTGHALSFLVDNSFGPNQ------G 382 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 K+ + TDG + D +AK G +YA+G+ E + P Sbjct: 383 ARPGVPKVGIVFTDGRSQDYIGDA-----AKKAKALGFKMYAVGVGNAVEDELREIASEP 437 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + ++ + +M I K + + Sbjct: 438 IADHYFYTADFKTMNQ----IAKKLQINVCQEE 466 >gi|293349448|ref|XP_002727144.1| PREDICTED: collagen type VI alpha 6-like [Rattus norvegicus] Length = 2264 Score = 94.1 bits (232), Expect = 3e-17, Method: Composition-based stats. Identities = 56/337 (16%), Positives = 122/337 (36%), Gaps = 38/337 (11%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + F + + + ID+ T + G+ + + F + D Sbjct: 675 LNTFMSQNDIANAIDQMAHIGETTL----TGSALTFVSQYFSPEKGARPNVRKFLILITD 730 Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIP-------WYTNSRH 153 + DIV+ ++ + YS+ L+ + P + +H Sbjct: 731 G----EAQDIVKDPAVALRKDG-VIIYSVGVFGSNVTQLEEISGKPEMVFYVENFDILQH 785 Query: 154 IVMPITSSVKVNSQTDAR---LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 I + + + R LD++ V+D S S+ + ++ M+ VK Sbjct: 786 IEDDLVLGICSPREECKRIEVLDVVFVIDSSGSI------DYQEYNIMKDF---MIGLVK 836 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEW-GVS-HLQRKIKYLSKFGVSTNSTPGLKYAYN 268 N V+ G + +++ E F L+ G + ++ G +T + L ++ + Sbjct: 837 KADVGKNQVRFGALKYADDPEVLFYLDELGTKLEVISVLQNDQPMGGNTYTAEALAFSDH 896 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +G R H ++++ +TDGE+ + ++ + +G +V A+GI Sbjct: 897 MFTEARGSRLHKG-----VPQVLIVITDGES---HDAEKLNATAKALRDKGILVLAVGIA 948 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 + E L S + +Y VE + FS + + Sbjct: 949 GANTWELLAMAGSSDKYYFVETFGGLKGIFSDVSASV 985 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 45/204 (22%), Positives = 83/204 (40%), Gaps = 27/204 (13%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + D + D+M ++D S S+ + +K+ + +K++ V + VQ G+V Sbjct: 613 EACRDMKADIMFLVDSSGSIGLE---NFSKMKVFMKNL------VSKSQIGADRVQIGVV 663 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS++ E F L +S + I ++ G +T + L + +G R Sbjct: 664 QFSHENREEFQLNTFMSQNDIANAIDQMAHIGETTLTGSALTFVSQYFSPEKGAR----- 718 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N +K ++ +TDGE +K G I+Y++G+ + P Sbjct: 719 --PNVRKFLILITDGE-----AQDIVKDPAVALRKDGVIIYSVGVFGSNVTQLEEISGKP 771 Query: 343 NSFYLVENPHSMYDAFSHIGKDIV 366 + VEN +D HI D+V Sbjct: 772 EMVFYVEN----FDILQHIEDDLV 791 Score = 79.8 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 82/214 (38%), Gaps = 22/214 (10%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + +S KV+ ++D++ ++D S S+ K+ + S+ V+ Sbjct: 981 VSASVCNSSKVD-CEIEKVDLVFLMDGSNSIH---PDDFQKMKEFLASV------VQDFD 1030 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N V+ G+ FS+ FLL + +I+ + + T+ L+ + Sbjct: 1031 VSLNRVRIGVAQFSDSYRSEFLLGTFTGEKEISTQIEAIQQIFGYTHIGDALRKVKHYFR 1090 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 G R + T ++++ +TDG + E + +G +Y++GI + Sbjct: 1091 PDTGSRINAGTP-----QVLLVLTDGR-----SQDEVAQAAEELRHKGVDIYSVGIGDVD 1140 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 + ++ + V N + I ++I Sbjct: 1141 DQQLIQITGTAEKKLTVHNFDELKKVKKRIVRNI 1174 Score = 76.3 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 48/359 (13%), Positives = 119/359 (33%), Gaps = 42/359 (11%) Query: 34 LGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMS 93 L M + + + L + ++ AA + + + ++ + + +I Sbjct: 282 LSMGVNKTEVLQRIQDLSPHVGQAYTGAALRKTRKEVFSAQRGSRKNQGVPQIAVLVTHR 341 Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRY---KIPLKFCTFIPWYTN 150 ++ VN + V T + V N + I+ Y + K F T+ Sbjct: 342 ASDDNVTKAAVNLRREGV--TVFTMGVEGANPE-QLEKIASYPAEQFTSKLSNFSELATH 398 Query: 151 SRH--------------IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 ++ + T ++K D+ +++D S ++ T Sbjct: 399 NQTFLKKLRNQITHTVSVFSERTETLKSACVDTEEADIYLLIDGSG------NTQPTDFH 452 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFG 254 ++ +++ + P V+ G V +++ + F + L + I + + G Sbjct: 453 EMKIFLSEVVDMFNIAPHK---VRVGAVQYADTWDLEFEISKYTNKPDLGKAIDNIRQMG 509 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 +TN+ L + + + R +V +T+G L ++ Sbjct: 510 GNTNTGAALNFTLTLLQRAKKQR------GNKVPCHLVVLTNG-----MSQDSVLGPAHK 558 Query: 315 AKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 ++ V+AIG++ + Y V ++ D + + ++I + D Sbjct: 559 LREENIRVHAIGVKEANQTQLREIAGDEKRVYYVHEFDALRDIRNQVVQEICAEEACRD 617 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 75/207 (36%), Gaps = 26/207 (12%) Query: 159 TSSVKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 T + V + + D++ +LD++ + S +D + E V + N Sbjct: 213 TDDIIVEACQGPSVADVVFLLDMA------INGSQENLDHLKAFLG---ESVSALDIKEN 263 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 ++ GLV +SN+ L GV+ + ++I+ LS + L+ ++F Sbjct: 264 CMRVGLVAYSNETRVISSLSMGVNKTEVLQRIQDLSPHVGQAYTGAALRKTRKEVFSA-- 321 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 Q + ++ +I V +T D ++ G V+ +G+ + Sbjct: 322 --QRGSRKNQGVPQIAVLVT-----HRASDDNVTKAAVNLRREGVTVFTMGVEGANPEQL 374 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIG 362 + + Y E S FS + Sbjct: 375 EKIAS-----YPAEQFTSKLSNFSELA 396 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 63/186 (33%), Gaps = 24/186 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + + + I ++ + P N + L +S+ + Sbjct: 26 DVVFLVDSSDHLG------MKSFPLVKTFIQKLVSSL---PVEANKYRVALAQYSDALHN 76 Query: 233 FFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L + + +L K G S L+ A+ F + + Sbjct: 77 EFHLGAFKNR-NPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSAPTNGRDKK----QFP 131 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 I+V + S + + ++ G + ++G++ + L+A A+ + + Sbjct: 132 PILVVL-----ASAESEDDVEEASKALREDGVKIISVGVQKASEED-LKAMATSQFHFNL 185 Query: 349 ENPHSM 354 + Sbjct: 186 RTARDL 191 >gi|311253580|ref|XP_003125597.1| PREDICTED: matrilin-2-like [Sus scrofa] Length = 423 Score = 94.1 bits (232), Expect = 3e-17, Method: Composition-based stats. Identities = 34/217 (15%), Positives = 79/217 (36%), Gaps = 26/217 (11%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S T+ +D++ V+D S+S+ ++ + + +++ + + P Sbjct: 102 SWSGTRGCTEGPVDLVFVIDGSKSLGE------ENFEIVKQFVTGIIDSLTISPKAA--- 152 Query: 220 QSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ +S ++ F L S +++ + + G + + LK+ + + F Sbjct: 153 RVGLLQYSTQVRTEFTLRNFGSAKDMKKAVASMKYMGKGSMTGLALKHMFERSFTQIE-- 210 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 A ++ + TDG + ++AK G +YA+G+ E Sbjct: 211 -GARPLSARVPRVAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQE 264 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + P + E+ + I + K I Sbjct: 265 IASEPTDKHLFYAED----FSTMGEISDKLQ-KGICE 296 >gi|311253578|ref|XP_001926459.2| PREDICTED: matrilin-2 [Sus scrofa] Length = 707 Score = 94.1 bits (232), Expect = 3e-17, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 78/212 (36%), Gaps = 26/212 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T+ +D++ V+D S+S+ ++ + + +++ + + P + GL+ Sbjct: 391 KRCTEGPVDLVFVIDGSKSLGE------ENFEIVKQFVTGIIDSLTISPKAA---RVGLL 441 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S ++ F L S +++ + + G + + LK+ + + F Sbjct: 442 QYSTQVRTEFTLRNFGSAKDMKKAVASMKYMGKGSMTGLALKHMFERSFTQIE---GARP 498 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 A ++ + TDG + ++AK G +YA+G+ E + P Sbjct: 499 LSARVPRVAIVFTDGR-----AQDDVSEWASKAKANGITMYAVGVGKAIEEELQEIASEP 553 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + E+ + I + K I Sbjct: 554 TDKHLFYAED----FSTMGEISDKLQ-KGICE 580 >gi|32477945|ref|NP_870939.1| hypothetical protein RB13237 [Rhodopirellula baltica SH 1] gi|32448502|emb|CAD78017.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 388 Score = 94.1 bits (232), Expect = 3e-17, Method: Composition-based stats. Identities = 55/387 (14%), Positives = 129/387 (33%), Gaps = 46/387 (11%) Query: 2 FSLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHA 61 F+LN+ + + G ++ I LP++ V I V ++ +T L D + A Sbjct: 19 FTLNVVDKARLRSRSGTTVVMLVILLPVMLAVAAYCINVVYMEMARTELQISTDLATRAA 78 Query: 62 ATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVV 121 + ++ + N + + + + + + Sbjct: 79 GRVLA---VTGDKAEAIEAAERLLEANPYLDRTLSIGDADIIFGKSNRTEENRRYEFTPD 135 Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT-SSVKVNSQTDARLDMMIVLDV 180 + S+ A +P+ F + +PI +K T LD+ IVLD Sbjct: 136 KKVNSVSLRAFGADDVPMLF----------PTMGVPIEFRPIKQAVATQVELDIAIVLDR 185 Query: 181 SRSM------------------ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 S SM ++ + + ++ A+ + ++ D ++ + Sbjct: 186 SGSMAFSHDEVAKNGSPSSAPPGWKMGHAVPENARWLDTVAAVNGFLDIMEDSSHDERVS 245 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 L T+S+K + L + ++ + S G +TN G+ + D + R Sbjct: 246 LSTYSDKSKADVKLTGDYTEIRAAMNAHSTKFKGGATNIGSGILEGGATLGDKKLARSWA 305 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRAC 339 + ++++ M+DG + + + +++ + E + Sbjct: 306 S-------RVLIVMSDG---IHNTGIEPIPAAQQVANEKIMIFTVTFSDEANVQEMEKVA 355 Query: 340 AS-PNSFYLVENPHSMYDAFSHIGKDI 365 S + ++ + +AF I K + Sbjct: 356 VSGGGQHFHAKDSQQLTEAFRKIAKSL 382 >gi|293361343|ref|XP_236593.5| PREDICTED: collagen type VI alpha 6 [Rattus norvegicus] Length = 2264 Score = 94.1 bits (232), Expect = 4e-17, Method: Composition-based stats. Identities = 56/337 (16%), Positives = 122/337 (36%), Gaps = 38/337 (11%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + F + + + ID+ T + G+ + + F + D Sbjct: 675 LNTFMSQNDIANAIDQMAHIGETTL----TGSALTFVSQYFSPEKGARPNVRKFLILITD 730 Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIP-------WYTNSRH 153 + DIV+ ++ + YS+ L+ + P + +H Sbjct: 731 G----EAQDIVKDPAVALRKDG-VIIYSVGVFGSNVTQLEEISGKPEMVFYVENFDILQH 785 Query: 154 IVMPITSSVKVNSQTDAR---LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 I + + + R LD++ V+D S S+ + ++ M+ VK Sbjct: 786 IEDDLVLGICSPREECKRIEVLDVVFVIDSSGSI------DYQEYNIMKDF---MIGLVK 836 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEW-GVS-HLQRKIKYLSKFGVSTNSTPGLKYAYN 268 N V+ G + +++ E F L+ G + ++ G +T + L ++ + Sbjct: 837 KADVGKNQVRFGALKYADDPEVLFYLDELGTKLEVISVLQNDQPMGGNTYTAEALAFSDH 896 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +G R H ++++ +TDGE+ + ++ + +G +V A+GI Sbjct: 897 MFTEARGSRLHKG-----VPQVLIVITDGES---HDAEKLNATAKALRDKGILVLAVGIA 948 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 + E L S + +Y VE + FS + + Sbjct: 949 GANTWELLAMAGSSDKYYFVETFGGLKGIFSDVSASV 985 Score = 90.6 bits (223), Expect = 3e-16, Method: Composition-based stats. Identities = 45/204 (22%), Positives = 83/204 (40%), Gaps = 27/204 (13%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + D + D+M ++D S S+ + +K+ + +K++ V + VQ G+V Sbjct: 613 EACRDMKADIMFLVDSSGSIGLE---NFSKMKVFMKNL------VSKSQIGADRVQIGVV 663 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS++ E F L +S + I ++ G +T + L + +G R Sbjct: 664 QFSHENREEFQLNTFMSQNDIANAIDQMAHIGETTLTGSALTFVSQYFSPEKGAR----- 718 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N +K ++ +TDGE +K G I+Y++G+ + P Sbjct: 719 --PNVRKFLILITDGE-----AQDIVKDPAVALRKDGVIIYSVGVFGSNVTQLEEISGKP 771 Query: 343 NSFYLVENPHSMYDAFSHIGKDIV 366 + VEN +D HI D+V Sbjct: 772 EMVFYVEN----FDILQHIEDDLV 791 Score = 79.8 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 82/214 (38%), Gaps = 22/214 (10%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + +S KV+ ++D++ ++D S S+ K+ + S+ V+ Sbjct: 981 VSASVCNSSKVD-CEIEKVDLVFLMDGSNSIH---PDDFQKMKEFLASV------VQDFD 1030 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N V+ G+ FS+ FLL + +I+ + + T+ L+ + Sbjct: 1031 VSLNRVRIGVAQFSDSYRSEFLLGTFTGEKEISTQIEAIQQIFGYTHIGDALRKVKHYFR 1090 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 G R + T ++++ +TDG + E + +G +Y++GI + Sbjct: 1091 PDTGSRINAGTP-----QVLLVLTDGR-----SQDEVAQAAEELRHKGVDIYSVGIGDVD 1140 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 + ++ + V N + I ++I Sbjct: 1141 DQQLIQITGTAEKKLTVHNFDELKKVKKRIVRNI 1174 Score = 76.3 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 48/359 (13%), Positives = 119/359 (33%), Gaps = 42/359 (11%) Query: 34 LGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMS 93 L M + + + L + ++ AA + + + ++ + + +I Sbjct: 282 LSMGVNKTEVLQRIQDLSPHVGQAYTGAALRKTRKEVFSAQRGSRKNQGVPQIAVLVTHR 341 Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRY---KIPLKFCTFIPWYTN 150 ++ VN + V T + V N + I+ Y + K F T+ Sbjct: 342 ASDDNVTKAAVNLRREGV--TVFTMGVEGANPE-QLEKIASYPAEQFTSKLSNFSELATH 398 Query: 151 SRH--------------IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 ++ + T ++K D+ +++D S ++ T Sbjct: 399 NQTFLKKLRNQITHTVSVFSERTETLKSACVDTEEADIYLLIDGSG------NTQPTDFH 452 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFG 254 ++ +++ + P V+ G V +++ + F + L + I + + G Sbjct: 453 EMKIFLSEVVDMFNIAPHK---VRVGAVQYADTWDLEFEISKYTNKPDLGKAIDNIRQMG 509 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 +TN+ L + + + R +V +T+G L ++ Sbjct: 510 GNTNTGAALNFTLTLLQRAKKQR------GNKVPCHLVVLTNG-----MSQDSVLGPAHK 558 Query: 315 AKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 ++ V+AIG++ + Y V ++ D + + ++I + D Sbjct: 559 LREENIRVHAIGVKEANQTQLREIAGDEKRVYYVHEFDALRDIRNQVVQEICAEEACRD 617 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 75/207 (36%), Gaps = 26/207 (12%) Query: 159 TSSVKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 T + V + + D++ +LD++ + S +D + E V + N Sbjct: 213 TDDIIVEACQGPSVADVVFLLDMA------INGSQENLDHLKAFLG---ESVSALDIKEN 263 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 ++ GLV +SN+ L GV+ + ++I+ LS + L+ ++F Sbjct: 264 CMRVGLVAYSNETRVISSLSMGVNKTEVLQRIQDLSPHVGQAYTGAALRKTRKEVFSA-- 321 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 Q + ++ +I V +T D ++ G V+ +G+ + Sbjct: 322 --QRGSRKNQGVPQIAVLVT-----HRASDDNVTKAAVNLRREGVTVFTMGVEGANPEQL 374 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIG 362 + + Y E S FS + Sbjct: 375 EKIAS-----YPAEQFTSKLSNFSELA 396 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 63/186 (33%), Gaps = 24/186 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + + + I ++ + P N + L +S+ + Sbjct: 26 DVVFLVDSSDHLG------MKSFPLVKTFIQKLVSSL---PVEANKYRVALAQYSDALHN 76 Query: 233 FFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L + + +L K G S L+ A+ F + + Sbjct: 77 EFHLGAFKNR-NPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSAPTNGRDKK----QFP 131 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 I+V + S + + ++ G + ++G++ + L+A A+ + + Sbjct: 132 PILVVL-----ASAESEDDVEEASKALREDGVKIISVGVQKASEED-LKAMATSQFHFNL 185 Query: 349 ENPHSM 354 + Sbjct: 186 RTARDL 191 >gi|156616286|ref|NP_001096077.1| collagen alpha-6(VI) chain isoform 1 [Mus musculus] gi|189082903|sp|Q8C6K9|CO6A6_MOUSE RecName: Full=Collagen alpha-6(VI) chain; Flags: Precursor Length = 2265 Score = 93.7 bits (231), Expect = 4e-17, Method: Composition-based stats. Identities = 46/204 (22%), Positives = 84/204 (41%), Gaps = 27/204 (13%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + D + D+M ++D S S+ + +K+ M +K++ V + VQ G+V Sbjct: 613 EACRDMKADIMFLVDSSGSIGPE---NFSKMKMFMKNL------VSKSQIGADRVQIGVV 663 Query: 225 TFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS++ +E F L + S + I ++ G +T + L + +G R Sbjct: 664 QFSHENKEEFQLNTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFSPDKGAR----- 718 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N +K ++ +TDGE +K G I+Y++G+ + P Sbjct: 719 --PNVRKFLILITDGE-----AQDIVRDPAIALRKEGVIIYSVGVFGSNVTQLEEISGKP 771 Query: 343 NSFYLVENPHSMYDAFSHIGKDIV 366 + VEN +D HI D+V Sbjct: 772 EMVFYVEN----FDILQHIEDDLV 791 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 59/337 (17%), Positives = 123/337 (36%), Gaps = 38/337 (11%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + F ++ + + IDR T + G+ + + F + D Sbjct: 675 LNTFMSQSDIANAIDRMTHIGETTL----TGSALTFVSQYFSPDKGARPNVRKFLILITD 730 Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIP-------WYTNSRH 153 + DIVR ++ + YS+ L+ + P + +H Sbjct: 731 G----EAQDIVRDPAIALRKEG-VIIYSVGVFGSNVTQLEEISGKPEMVFYVENFDILQH 785 Query: 154 IVMPITSSVKVNSQTDAR---LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 I + + + R LD++ V+D S S+ + ++ M+ VK Sbjct: 786 IEDDLVLGICSPREECKRIEVLDVVFVIDSSGSI------DYQEYNIMKDF---MIGLVK 836 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEW-GVS-HLQRKIKYLSKFGVSTNSTPGLKYAYN 268 N V+ G + +++ E F L+ G + ++ G +T + L ++ + Sbjct: 837 KADVGKNQVRFGALKYADDPEVLFYLDELGTKLEVVSVLQNDHPMGGNTYTAEALAFSDH 896 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +G R H ++++ +TDGE+ + ++ + +G +V A+GI Sbjct: 897 MFTEARGSRLHKG-----VPQVLIVITDGES---HDAEKLNTTAKALRDKGILVLAVGIA 948 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 S E L S + +Y VE + FS + + Sbjct: 949 GANSWELLAMAGSSDKYYFVETFGGLKGIFSDVSASV 985 Score = 77.1 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 81/214 (37%), Gaps = 22/214 (10%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + +S KV+ ++D++ ++D S S+ K+ + S+ V+ Sbjct: 981 VSASVCNSSKVD-CEIEKVDLVFLMDGSNSIH---PDDFQKMKGFLVSV------VQDFD 1030 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N V+ G+ FS+ FLL + +I+ + + T+ L+ + + Sbjct: 1031 VSLNRVRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQIFGYTHIGDALRKV--KYY 1088 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 M N ++++ +TDG + E + +G +Y++GI + Sbjct: 1089 FQPDMGSRINAGT---PQVLLVLTDGR-----SQDEVAQAAEELRHKGVDIYSVGIGDVD 1140 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 E ++ + V N + I ++I Sbjct: 1141 DQELVQITGTAEKKLTVHNFDELKKVKKRIVRNI 1174 Score = 74.0 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 32/227 (14%), Positives = 82/227 (36%), Gaps = 22/227 (9%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 T++ + T ++K D+ +++D S S + T ++ ++ Sbjct: 411 THTVSVFSERTETLKSACVDTEEADIYLLIDGSGS------TQPTDFHEMKTFLSEVVGM 464 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 + P V+ G V +++ + F + L + I+ + + G +TN+ L + Sbjct: 465 FNIAPHK---VRVGAVQYADTWDLEFEISKYSNKPDLGKAIENIRQMGGNTNTGAALNFT 521 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 + + R + +V +T+G L ++ ++ V+AIG Sbjct: 522 LKLLQRAKKER------GSKVPCHLVVLTNG-----MSRDSVLGPAHKLREENIRVHAIG 570 Query: 327 IRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 ++ + Y V ++ + + + ++I + D Sbjct: 571 VKEANQTQLREIAGEEKRVYYVHEFDALRNIRNQVVQEICAEEACRD 617 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 70/188 (37%), Gaps = 21/188 (11%) Query: 162 VKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + V + + D++ +LD++ + S +D + E + + N ++ Sbjct: 216 IIVEACQGPSVADVVFLLDMA------INGSQEDLDHLKAFLG---ESISALDIKENCMR 266 Query: 221 SGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 GLVT+SN+ L G + + ++I+ LS + L+ +IF Q Sbjct: 267 VGLVTYSNETRVISSLSTGNNKTEVLQRIQDLSPQVGQAYTGAALRKTRKEIFSA----Q 322 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + ++ +I V +T + ++ G ++ +GI E + Sbjct: 323 RGSRKNQGVPQIAVLVT-----HRASEDNVTKAAVNLRREGVTIFTMGIEGANPDELEKI 377 Query: 339 CASPNSFY 346 + P + Sbjct: 378 ASHPAEQF 385 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 31/205 (15%), Positives = 70/205 (34%), Gaps = 29/205 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + + + I+ M+ + P N + L +S+ + Sbjct: 26 DVVFLVDSSDHLG------LKSFPLVKTFIHKMISSL---PIEANKYRVALAQYSDALHN 76 Query: 233 FFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L + + +L K G S L+ A+ F + + Sbjct: 77 EFQLGTFKNR-NPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSAPTNGRDKK----QFP 131 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 I+V + S + + ++ G + ++G++ L+A A+ + + Sbjct: 132 PILVVL-----ASAESEDDVEEAAKALREDGVKIISVGVQKASEEN-LKAMATSQFHFNL 185 Query: 349 ENPHSMYDAFS----HIGKDIVTKR 369 + F+ I KD+ R Sbjct: 186 RTARDL-SVFAPNMTEIIKDVTQYR 209 >gi|56675026|gb|AAW19655.1| matrilin-1 [Cervus elaphus] Length = 230 Score = 93.7 bits (231), Expect = 4e-17, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 77/203 (37%), Gaps = 21/203 (10%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++ V+D SRS+ + + ++ ++E + + P+ + GLV +++ + Sbjct: 1 PTDLVFVVDSSRSVRP------VEFEKVKVFLSQVIESLDVGPNA---TRVGLVNYASSV 51 Query: 231 EEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 ++ F L S + + ++ + T + + +A + + Sbjct: 52 KQEFPLRAHSSKAELLQAVRRIQPLSTGTMTGLAIHFAITKALSDAE---GGRPRSPDIS 108 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 K+++ +TDG + D + A+ G ++AIG+ + LR AS V Sbjct: 109 KVVIVVTDGRPQDSVRDVSAR-----ARASGIELFAIGVGRVD-KATLRQIASEPQDEHV 162 Query: 349 ENPHSMYDAFSHIGKDIVTKRIW 371 + S Y + K Sbjct: 163 DYVES-YRVIEKLSKKFQEAFCL 184 >gi|312793553|ref|YP_004026476.1| von willebrand factor type a [Caldicellulosiruptor kristjanssonii 177R1B] gi|312180693|gb|ADQ40863.1| von Willebrand factor type A [Caldicellulosiruptor kristjanssonii 177R1B] Length = 726 Score = 93.7 bits (231), Expect = 4e-17, Method: Composition-based stats. Identities = 37/196 (18%), Positives = 70/196 (35%), Gaps = 29/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ VLD S SM + KI + + ++ + ++ +V F + Sbjct: 39 LVFVLDSSGSMSWNDPNGYRKI-----AAKSFVDALIQGD------RAAVVDFDDYGYLL 87 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L ++ I + +G TN G++ A +Q+ + K+I+ Sbjct: 88 QPLTTDFQTVKNAIDRIDSWG-GTNIAEGIRIANHQLISQS---------SDDRIKVIIL 137 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENP 351 +TDGE EAK G +Y IG+ LR A + ++ V + Sbjct: 138 LTDGEGYYDN------NLTTEAKNNGITIYTIGLGTSVDENLLRNIATQTGGMYFPVSSA 191 Query: 352 HSMYDAFSHIGKDIVT 367 + F I + + Sbjct: 192 SQLPQVFKRITEIVTE 207 >gi|257062895|ref|YP_003142567.1| hypothetical protein Shel_01450 [Slackia heliotrinireducens DSM 20476] gi|256790548|gb|ACV21218.1| uncharacterized protein [Slackia heliotrinireducens DSM 20476] Length = 744 Score = 93.3 bits (230), Expect = 6e-17, Method: Composition-based stats. Identities = 46/218 (21%), Positives = 75/218 (34%), Gaps = 28/218 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 KV+ + +++ LD S SM+ + TK + + + Sbjct: 368 GDSKVDPNDASSRHVVLALDTSGSMDGE-PLNETKTATREFASTIFKSDADVC------- 419 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 LV++ + L+ ++ LS G TN L+ +Y ++ Sbjct: 420 ---LVSYDSSARNVIDSTDNEYALKAAVRDLSA-GGGTNIEDALRVSYERLEGSGSD--- 472 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-----RVIRSHE 334 K+IIV M+DGE + Y NE K G +Y +G Sbjct: 473 --------KRIIVLMSDGEANEGLVGDDLIAYANEIKDDGVTIYTLGFFQSVSDKAECQR 524 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + ASP Y V++ + F IG DI R Y Sbjct: 525 VMEGIASPGCHYEVDDASQLRYFFGDIGDDINGTRFIY 562 >gi|57524519|ref|NP_001004007.1| matrilin 3a [Danio rerio] gi|51330145|gb|AAH80220.1| Matrilin 3a [Danio rerio] gi|123233072|emb|CAM15633.1| novel protein similar to vertebrate matrilin 3 (MATN3) (zgc:101120) [Danio rerio] gi|158254363|gb|AAI54374.1| Matn3a protein [Danio rerio] Length = 337 Score = 92.9 bits (229), Expect = 7e-17, Method: Composition-based stats. Identities = 35/209 (16%), Positives = 82/209 (39%), Gaps = 26/209 (12%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + + LD++ ++D SRS+ + + + M++ + + PD + Sbjct: 52 AATDSQCRSRPLDLVFIIDSSRSVRP------GEFEKVKIFLADMVDTLDVGPDA---TR 102 Query: 221 SGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 +V +++ ++ FLL+ ++ +++ I + T + +K A ++ F + Sbjct: 103 VAVVNYASTVKIEFLLKSHLTKDTIKQAITRIEPLAAGTMTGMAIKKAMDEAFT---EKS 159 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + N K+ + +TDG E+ + A+ G +YA+G+ L+ Sbjct: 160 GARPKSKNISKVAIIVTDGRPQDQVEEVSAA-----ARASGIEIYAVGVDRADMRS-LKL 213 Query: 339 CAS---PNSFYLVEN---PHSMYDAFSHI 361 AS + + VE + F Sbjct: 214 MASNPLEDHVFYVETYGVIEKLTSKFRET 242 >gi|77567855|gb|AAI07522.1| Matn3b protein [Danio rerio] Length = 299 Score = 92.9 bits (229), Expect = 7e-17, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 75/209 (35%), Gaps = 26/209 (12%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 LD++ ++D SRS+ + K+ + + E V + ++ + Sbjct: 57 GAPAEPCKSRPLDLVFIIDSSRSVR---PAEFEKVKIF------LSEMVDSLDIGSDATR 107 Query: 221 SGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 LV +++ + F L+ + + +++ + T + +K A Q+F + Sbjct: 108 VALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLSTGTMTGMAIKTAMEQVFTENAGAR 167 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 K+ + +TDG E+ + A+ G +YA+G+ L+ Sbjct: 168 PLKKG---IGKVAIIVTDGRPQDKVEEVSAA-----ARASGIEIYAVGVDRAEVRS-LKQ 218 Query: 339 CAS---PNSFYLVEN---PHSMYDAFSHI 361 AS + + VE + F Sbjct: 219 MASQPLDDHVFYVETYGVIEKLTSKFRET 247 >gi|47208180|emb|CAF89812.1| unnamed protein product [Tetraodon nigroviridis] Length = 1636 Score = 92.5 bits (228), Expect = 9e-17, Method: Composition-based stats. Identities = 50/271 (18%), Positives = 96/271 (35%), Gaps = 42/271 (15%) Query: 103 FVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITS-- 160 + + D V L+I PQ Y F I+ I S Sbjct: 591 YAIGVKDAVEKELLEISGEPQRTFYV-------------NNFDALKPIKDDIITDICSTD 637 Query: 161 ----SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 S+ D D++ ++D S S+ + K+ +KS+ V+ Sbjct: 638 GSDLSLLSTVCKDVPGDLIFLIDSSGSI---YPEDYQKMKDFMKSL------VQKSNIGK 688 Query: 217 NVVQSGLVTFSNKIEEFFLL--EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + V G++ +S + + F L + L + I + + G T++ + ++ FD Q Sbjct: 689 DQVHVGVLQYSTEQKLVFPLIQYYTKDQLSKAIDDMQQIGGGTHTGEAI-AVVSKYFDAQ 747 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 N + K+ +V +TDGE + +G IVY+IG+ + + Sbjct: 748 ------NGGRPDLKQRLVVVTDGE-----SQDDVKLPAEALRAKGVIVYSIGVVAANTSQ 796 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 L + Y + ++ D + +I Sbjct: 797 LLEISGDADRMYAERDFDALKDLEKQMALEI 827 Score = 90.6 bits (223), Expect = 3e-16, Method: Composition-based stats. Identities = 49/230 (21%), Positives = 86/230 (37%), Gaps = 34/230 (14%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES--FFDSSITKIDMAIKSINAM 205 + + + IT + D+ ++D S S+ F+D K I Sbjct: 438 ELGVQQVAVVITDGCVETEEA----DIFFLIDQSGSIHPPDFYD--------MKKFILEF 485 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGL 263 L+ ++ P N V+ G+V +++ F L L++ I + + G T + L Sbjct: 486 LQTFRVGP---NHVRIGVVKYADSPTLEFDLHTYTDVKSLEKAITNIHQVGGGTETGKAL 542 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 + Q R H K+ +V +TDG + ++ + +G +VY Sbjct: 543 DFMRPQFDRAVTTRGHK------VKEYLVVITDG-----NSTDKVKDPADKLRAQGVVVY 591 Query: 324 AIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 AIG++ E L P + V N +DA I DI+T D Sbjct: 592 AIGVKDAVEKELLEISGEPQRTFYVNN----FDALKPIKDDIITDICSTD 637 Score = 77.1 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 92/200 (46%), Gaps = 19/200 (9%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + A+ D++ ++DVS S+ + + + + ++S+ + + + P + + G++TF Sbjct: 847 KKTAQADIIFLVDVSTSI--LKEKAFPSVTVFMESV---VNQSSVGP---ELTRFGVITF 898 Query: 227 SNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S ++ F L+ S + + + ++ G +TN+ L Y+ ++H Sbjct: 899 STGVQSIFTLKQYSSKRDVLQAVGAVTAPGGNTNTGDALDYSLQYF-----GKEHGGRAA 953 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRACASPN 343 +I++ +TDG + +E + +K+G V++IG++ + + A P+ Sbjct: 954 LKVPQILMVITDG---AAQEPSKLPGPSEALRKQGVSVFSIGVKNASREQLDIMAGNDPS 1010 Query: 344 SFYLVENPHSMYDAFSHIGK 363 + V+ ++ + +I K Sbjct: 1011 RVFFVDTFDALETLYKNISK 1030 Score = 76.7 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 58/352 (16%), Positives = 116/352 (32%), Gaps = 40/352 (11%) Query: 31 FLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTW 90 F V+ V IF +K R ++ A + G N + K Sbjct: 893 FGVITFSTGVQSIFTLK---QYSSKRDVLQAVGAVTAPGGNTNTGDALDYSLQYFGKEHG 949 Query: 91 NMSFRNE-----LRDNGFVNDIDDI--------VRSTSLDIVVVPQNEGYSISAISRYKI 137 + + +G + + + S+ + V + ++ Sbjct: 950 GRAALKVPQILMVITDGAAQEPSKLPGPSEALRKQGVSVFSIGVKNASREQLDIMAGNDP 1009 Query: 138 PLKFC--TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKI 195 F TF T ++I + + K + D++ +LD S S++S +++ K Sbjct: 1010 SRVFFVDTFDALETLYKNISKVLCNHTKP-VCEKQKADLVFLLDQSGSIQSDDYTTMKKF 1068 Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKF 253 ++ + ++V GL FS+ ++ F L + IK + + Sbjct: 1069 ---------TIDLINKFQISRDLVHVGLAQFSSTFKDEFYLNKFFDEQAISAHIKDMQQE 1119 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 T L G R+ + +V +TDG++ E+ L Sbjct: 1120 EGGTLIGLALNSIRKYFEASHGSRK-----AEGISQNLVLITDGDSQDDVEEAARL---- 1170 Query: 314 EAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 + G V+AIGI + E L+ +P + + V+N + + I Sbjct: 1171 -LRGLGVEVFAIGIGNVHDLELLQIAGTPENVFTVKNFDKLEGIHQKVVDTI 1221 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 58/160 (36%), Gaps = 14/160 (8%) Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQI 270 + + G V+ S++ + FLL++ + + + T + + + Q Sbjct: 41 QPRQHPDRLGSVS-SDEPHQEFLLKYHIEKMNLLAAFESFPYRNGGTETGKAINFLRKQY 99 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 F + + D +I V +TDG++ + E +K G IV+AIG+ Sbjct: 100 FTKKAGSR----ADQRVPQIAVVITDGDSTD-----DVVVPARELRKHGVIVFAIGVGNA 150 Query: 331 RSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 E P+ + + + A + + ++ Sbjct: 151 NQGELKSIANRPSERF--KFTIDSFQALKRLTERLLETMC 188 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 26/137 (18%), Positives = 62/137 (45%), Gaps = 15/137 (10%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + + D++ ++D S S+ S + + +++++ +++ P N V+ G Sbjct: 328 EAHCKAANIADIVFIIDESGSIGS------SDFQLVRTFLHSLVSGLEVSP---NRVRVG 378 Query: 223 LVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V + + + L S L I+ L G TN+ L + +Q+F +R+ Sbjct: 379 IVVYHGEPKAEVFLNTFTDKSELLDFIRILPYHGGGTNTGAALNFTQHQVF----VREKG 434 Query: 281 NTEDANYKKIIVFMTDG 297 + + +++ V +TDG Sbjct: 435 SRIELGVQQVAVVITDG 451 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 60/159 (37%), Gaps = 25/159 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ I+LD + + + +++ ++ P + GL + I+ Sbjct: 204 DVFILLDG--------GITQAEFRQIRTFLGSLVNQLNFSPS---TYRLGLAQYGQDIKV 252 Query: 233 FFLLEWGVSH------LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 FL + ++ ++ + + N L+YAY F + + D + Sbjct: 253 DFLFKDHQTNKDLLTAVKNAQQRKLQPNEPRNLGKALQYAYKNFFTPEAGSR----NDQS 308 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 +++ +V +T G++ +++ +C A V+ I Sbjct: 309 FRQYLVVLT-GKDADDPVYEEA--HCKAANIADI-VFII 343 >gi|56797994|emb|CAG27564.2| matrilin-3b [Danio rerio] gi|220675932|emb|CAX12091.1| matrilin 3b [Danio rerio] Length = 343 Score = 92.5 bits (228), Expect = 1e-16, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 75/209 (35%), Gaps = 26/209 (12%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 LD++ ++D SRS+ + K+ + + E V + ++ + Sbjct: 57 GAPAEPCKSRPLDLVFIIDSSRSVR---PAEFEKVKIF------LSEMVNSLDIGSDATR 107 Query: 221 SGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 LV +++ + F L+ + + +++ + T + +K A Q+F + Sbjct: 108 VALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLSTGTMTGMAIKTAMEQVFTENAGAR 167 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 K+ + +TDG E+ + A+ G +YA+G+ L+ Sbjct: 168 PLKKG---IGKVAIIVTDGRPQDKVEEVSAA-----ARASGIEIYAVGVDRAEMRS-LKQ 218 Query: 339 CAS---PNSFYLVEN---PHSMYDAFSHI 361 AS + + VE + F Sbjct: 219 MASQPLDDHVFYVETYGVIEKLTSKFRET 247 >gi|149624864|ref|XP_001517479.1| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1), partial [Ornithorhynchus anatinus] Length = 249 Score = 92.1 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 77/206 (37%), Gaps = 21/206 (10%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 D++ ++D SRS+ + + + ++E + + P+ + G+V + Sbjct: 21 CRTRPTDLVFIVDSSRSVRPH------EFEKVKVFLTQVIESLDVGPNA---TRVGVVNY 71 Query: 227 SNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ ++ F L+ S + ++ L T + +++A ++ F Sbjct: 72 ASAVKHEFPLKAHRSKASLLQAVRRLEPLSTGTMTGLAIQFAISRAFSEVE---GARPLS 128 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 K+ + +TDG +D + A++ G ++AIG+ + LR AS Sbjct: 129 PALSKVAIVVTDGRPQDDVKDVSAR-----AREAGIELFAIGVGRVD-KTTLRRIASEPL 182 Query: 345 FYLVENPHSMYDAFSHIGKDIVTKRI 370 V+ S Y + K Sbjct: 183 AEHVDYVES-YSVIEKLAKKFQEAFC 207 >gi|219804750|ref|NP_001137338.1| cartilage matrix protein [Bos taurus] gi|296490178|gb|DAA32291.1| matrilin 1, cartilage matrix protein [Bos taurus] Length = 497 Score = 92.1 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 33/211 (15%), Positives = 80/211 (37%), Gaps = 21/211 (9%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + D++ V+D SRS+ + + ++ ++E + + P+ + G Sbjct: 32 RGHLCRTRPTDLVFVVDSSRSVRP------VEFEKVKVFLSQVIESLDVGPNA---TRVG 82 Query: 223 LVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 LV +++ +++ F L S + + ++ + T + +++A + Sbjct: 83 LVNYASSVKQEFPLRAHSSKAELLQAVRRIQPLSTGTMTGLAIQFAITKALSDAE---GG 139 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + K+++ +TDG + D + A+ G ++AIG+ + L+ A Sbjct: 140 RPRSPDISKVVIVVTDGRPQDSVRDVSAR-----ARAGGIELFAIGVGRVD-KATLQQIA 193 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 S V+ S Y + K Sbjct: 194 SEPQDEHVDYVES-YSVIEKLSKKFQEAFCL 223 Score = 85.2 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 54/358 (15%), Positives = 118/358 (32%), Gaps = 66/358 (18%) Query: 57 SLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNG-------FVNDIDD 109 ++ A T+ +++ G + ++ + + ++ F + Sbjct: 124 AIQFAITKALSDAEGGRPRSPDISKVVIVVTDGRPQDSVRDVSARARAGGIELFAIGVGR 183 Query: 110 IVRSTSLDIVVVPQNE------GYSISAISRYKIPLKFCTFIPWYTNSRH----IVMPIT 159 + ++T I PQ+E YS+ K FC H + + Sbjct: 184 VDKATLQQIASEPQDEHVDYVESYSVIEKLSKKFQEAFCLVSDLCATGDHDCEQVCVSSP 243 Query: 160 SSVKVNSQTDARL-------------------DMMIVLDVSRSMESFFDSSITKIDMAIK 200 S + L D++ ++D S+S+ ++ K Sbjct: 244 GSYTCACREGFTLNSDGKTCNVCNGGGGSSATDLVFLIDGSKSVRP------ENFELVKK 297 Query: 201 SINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTN 258 IN +++ + + + Q GLV +S+ + + F L ++ ++ +S T Sbjct: 298 FINQIVDTLDVSD---KLAQVGLVQYSSSVRQEFPLGRFHTKKDIKAAVRNMSYMEKGTM 354 Query: 259 STPGLKYAY-NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 + LKY N G R +K+ + TDG + D +AK Sbjct: 355 TGAALKYLIDNSFTVSSGARPGA-------QKVGIVFTDGRSQDYINDA-----AKKAKD 402 Query: 318 RGAIVYAIGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 G ++A+G+ E + P ++ + + + IGK + + + Sbjct: 403 LGFKMFAVGVGNAVEDELREIASEPVAEHYFYTAD----FKTINQIGKKLQKRICVEE 456 >gi|281353337|gb|EFB28921.1| hypothetical protein PANDA_013248 [Ailuropoda melanoleuca] Length = 471 Score = 92.1 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 35/210 (16%), Positives = 79/210 (37%), Gaps = 21/210 (10%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + D++ V+D SRS+ + + ++ ++E + + P+ + G Sbjct: 15 TGHLCRTRPTDLVFVVDSSRSVRP------VEFEKVKVFLSQVIESLDVGPNA---TRVG 65 Query: 223 LVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V +++ +++ F L S R ++ + T + +++A + F Sbjct: 66 VVNYASAVKQEFPLRAHGSKASLLRAVRRIQPLSTGTMTGLAIQFAITKAFSNTE---GG 122 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + K+++ +TDG + D + A+ G ++AIG+ + LR A Sbjct: 123 RARSPDISKVVIVVTDGRPQDSVRDVSAR-----ARANGIELFAIGVGRVD-KATLRQIA 176 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 S V+ S Y + K Sbjct: 177 SEPQDEHVDYVES-YSVIEKLSKKFQEAFC 205 Score = 82.1 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 39/220 (17%), Positives = 83/220 (37%), Gaps = 30/220 (13%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T +V + D++ ++D S+S+ ++ K IN +++ + + + Sbjct: 245 TCNVCSGGGGSSATDLVFLIDGSKSVRP------ENFELVKKFINQIVDTLDVSD---KL 295 Query: 219 VQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAY-NQIFDMQG 275 Q GLV +S+ + + F L ++ ++ +S T + LKY N G Sbjct: 296 AQVGLVQYSSSVRQEFPLGRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSG 355 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R +K+ + TDG + + +AK G ++A+G+ E Sbjct: 356 ARPGA-------QKVGIVFTDGRSQDYINNA-----AKKAKDLGFKMFAVGVGNAVEDEL 403 Query: 336 LRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P ++ + + + IGK + K + Sbjct: 404 REIASEPVAEHYFYTAD----FKTINQIGKRLQKKICVEE 439 >gi|296228118|ref|XP_002759733.1| PREDICTED: collagen alpha-6(VI) chain [Callithrix jacchus] Length = 2267 Score = 91.8 bits (226), Expect = 1e-16, Method: Composition-based stats. Identities = 44/196 (22%), Positives = 77/196 (39%), Gaps = 27/196 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M ++D S S+ + +K+ +K++ V N VQ G+V FS +E Sbjct: 621 DIMFLVDSSGSIGPE---NFSKMKTFMKNL------VSKSQIGANRVQIGVVQFSGVNKE 671 Query: 233 FFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L +S + I + G +T + L + +G R N +K Sbjct: 672 EFQLNRFMSQSDISNAIDQMVHIGETTLTGSALSFVSQYFSPTKGARP-------NVRKF 724 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDGE ++ G I+Y++G+ + P + VEN Sbjct: 725 LILITDGE-----AQDVVKEPAVALRQEGIIIYSVGVFGSNVTQLEEISGRPEMVFYVEN 779 Query: 351 PHSMYDAFSHIGKDIV 366 +D HI D+V Sbjct: 780 ----FDILQHIEDDLV 791 Score = 86.7 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 77/192 (40%), Gaps = 19/192 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D S S++ + ++ M+ VK N V+ G + +++ E Sbjct: 807 LDVVFVIDSSGSIDH------DEYNIMKDF---MIGLVKKADVGKNRVRFGALKYADDPE 857 Query: 232 EFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L + L+ ++ G +T ++ L ++ + N + Sbjct: 858 VLFYLGDFDTKLEVISVLQNDQPMGGNTYTSEAL--GFSDHMFTEAQGSRLNKG---VPQ 912 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDGE+ + + + +G +V A+GI E L S + ++ VE Sbjct: 913 VLIVITDGES---HDADKLNATAKALRDKGILVLAVGIAGANPVELLAMAGSSDKYFFVE 969 Query: 350 NPHSMYDAFSHI 361 + FS + Sbjct: 970 TFGGLKGIFSDV 981 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 87/214 (40%), Gaps = 22/214 (10%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + +S KV+ + D +D++ ++D S S+ S S TK+ + S+ V+ Sbjct: 981 VTASVCNSSKVDCEIDK-VDLVFLMDGSNSIHS---SDFTKMKEFLVSV------VQDFD 1030 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N V+ G FS+ + F L G + +I+ + + G +T+ L+ + Sbjct: 1031 VSLNRVRIGAAQFSHNYRQEFPLGTFIGEKEISFQIENIQQLGGNTHIGDALRQVGHYFR 1090 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 G R + ++++ +TDG+ + + RG +Y++GI + Sbjct: 1091 PDMGSRI-----NTGTPQVLLVLTDGQ-----SQDEVAQAAEALRHRGIDIYSVGIGDVD 1140 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 + ++ + V N + I ++I Sbjct: 1141 DQQLIQITGAAEKKLTVHNFDELKKVKKRIVRNI 1174 Score = 82.5 bits (202), Expect = 9e-14, Method: Composition-based stats. Identities = 54/356 (15%), Positives = 117/356 (32%), Gaps = 40/356 (11%) Query: 34 LGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIK------ 87 L M I S I L + AA + + + + R + + +I Sbjct: 282 LSMGINKSEILRNIQNLSPQTGNAYTGAAIKKLRKEVFSARNGSRKNQGVPQIAVLVTHR 341 Query: 88 ---NTWNMSFRNELRDNGFVNDID-DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCT 143 + + N R+ + + + R + L+ + E Y + + T Sbjct: 342 ASEDNVTKAAVNLRREGVAIFTLGIEGARDSQLEKIASHPAEQYVSKLKTFADLAAHNQT 401 Query: 144 FIPWYTNSRHIVMPI----TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 F+ N + + T ++K D+ +++D S S + T Sbjct: 402 FLKKLRNQITHTVSVFSERTETLKSGCVDTEEADIYLLIDGSGS------TQATDFHEMK 455 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVST 257 ++ ++ + P V+ G V +++ + F + L + I+ + + G +T Sbjct: 456 SFLSEVVGMFNIAPHK---VRVGAVQYADSWDLEFEINKYSNKQDLGKAIENIRQMGGNT 512 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 N+ L + + + + R +V +T+G L N ++ Sbjct: 513 NTGAALNFTLSLLQKAKQQR------GNKVPCHLVVLTNG-----MSKDSILEPANRLRE 561 Query: 318 RGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 VYAIG++ + Y V + +DA I +V + + Sbjct: 562 EHIRVYAIGVKEANKTQLREIAGEEKRVYYVHD----FDALKDIRNQVVQEICAEE 613 Score = 62.5 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 35/188 (18%), Positives = 67/188 (35%), Gaps = 21/188 (11%) Query: 162 VKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + V + D++ +LDVS + S D + + E V + N ++ Sbjct: 216 ILVEVCQGPSVADVVFLLDVS------VNGSEENFDFLKEFLE---ESVSALDIKENCMR 266 Query: 221 SGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 GLV +SN+ + L G S + R I+ LS + + +K ++F R Sbjct: 267 VGLVAYSNETKVINSLSMGINKSEILRNIQNLSPQTGNAYTGAAIKKLRKEVFS---ARN 323 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 +I V +T + ++ G ++ +GI R + + Sbjct: 324 GSRKN-QGVPQIAVLVT-----HRASEDNVTKAAVNLRREGVAIFTLGIEGARDSQLEKI 377 Query: 339 CASPNSFY 346 + P Y Sbjct: 378 ASHPAEQY 385 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 64/185 (34%), Gaps = 22/185 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S I M+ + P + + L +S+K+ Sbjct: 26 DVVFLLDSSDHLGS------KSFPFVKMFITKMINSL---PIEADKYRVALAQYSDKLHS 76 Query: 233 FFLLEW--GVSHLQRKIKY-LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L G + + +K G S L+ A+ F + + + Sbjct: 77 EFHLSTFKGRNPMLNHLKKNFQFLGGSLQIGKALQEAHRTYFSAPTNGR----DRKQFPP 132 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 I+V + S + + +K G + ++G++ L+A A+ + + + Sbjct: 133 ILVVL-----ASAESEDDVEEASKALQKDGVKIISVGVQDASEEN-LKAMATSHFHFNLR 186 Query: 350 NPHSM 354 + Sbjct: 187 TVRDL 191 >gi|332808244|ref|XP_001147912.2| PREDICTED: cartilage matrix protein [Pan troglodytes] Length = 717 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 85/210 (40%), Gaps = 21/210 (10%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + N D++ V+D SRS+ + + ++ ++E + + P+ + G Sbjct: 31 RGNLCRTRPTDLVFVVDSSRSVRP------VEFEKVKVFLSQVIESLDVGPNA---TRVG 81 Query: 223 LVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V +++ +++ F L VS + ++ + T + +++A + F G + Sbjct: 82 MVNYASTVKQEFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKAF---GDAEGG 138 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 ++ + K+++ +TDG + +D + A+ G ++AIG + LR A Sbjct: 139 HSRSPDISKVVIVVTDGRPHDSVQDVSAR-----ARASGVELFAIGFGRVD-KATLRQIA 192 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 S V+ S Y + + Sbjct: 193 SEPQDEHVDYVES-YSVIEKLSRKFQEAFC 221 Score = 84.8 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 40/217 (18%), Positives = 82/217 (37%), Gaps = 30/217 (13%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T +V + D++ ++D S+S+ ++ K IN +++ + + + Sbjct: 261 TCNVCSGGGGSSATDLVFLIDGSKSVRP------ENFELVKKFINQIVDTLDVSD---KL 311 Query: 219 VQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAY-NQIFDMQG 275 Q GLV +S+ + + F L ++ ++ +S T + LKY N G Sbjct: 312 AQVGLVQYSSSVRQEFPLGRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSG 371 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R +K+ + TDG + D +AK G ++A+G+ E Sbjct: 372 ARPGA-------QKVGIVFTDGRSQDYINDA-----AKKAKDLGFKMFAVGVGNAVEDEL 419 Query: 336 LRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + P ++ + + + IGK + K Sbjct: 420 REIASEPVAEHYFYTAD----FKTINQIGKKLQKKIC 452 >gi|291399639|ref|XP_002716220.1| PREDICTED: collagen, type VI, alpha 6 [Oryctolagus cuniculus] Length = 2273 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 44/204 (21%), Positives = 81/204 (39%), Gaps = 27/204 (13%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + D + D+M ++D S S+ + K+ +KS+ V + VQ G+V Sbjct: 613 EACKDLKADIMFLVDSSGSIGLE---NFGKMKTFMKSL------VSKSQIGAHRVQIGVV 663 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+ +E F L+ +S + I ++ G +T + L + +G R Sbjct: 664 QFSHINKEEFQLDTFMSQSDISNAIDRMAHIGETTLTGGALTFVSQYFSPAKGAR----- 718 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N +K ++ +TDGE ++ G I+Y++G+ + P Sbjct: 719 --PNVRKFLILITDGE-----AQDVVKEPAVALRQEGVIIYSVGVFGSNVTQLEEISGRP 771 Query: 343 NSFYLVENPHSMYDAFSHIGKDIV 366 + VEN +D I D+V Sbjct: 772 EMVFYVEN----FDILQRIEDDLV 791 Score = 84.1 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 52/337 (15%), Positives = 118/337 (35%), Gaps = 38/337 (11%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 F ++ + + IDR T + G + + F + D Sbjct: 675 LDTFMSQSDISNAIDRMAHIGETTL----TGGALTFVSQYFSPAKGARPNVRKFLILITD 730 Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIP-------WYTNSRH 153 + D+V+ ++ + YS+ L+ + P + + Sbjct: 731 G----EAQDVVKEPAVALRQEG-VIIYSVGVFGSNVTQLEEISGRPEMVFYVENFDILQR 785 Query: 154 IVMPITSSVKVNSQTDAR---LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 I + + + R LD++ V+D S S+ + ++ M+ VK Sbjct: 786 IEDDLVFGICSPREECKRIEVLDVVFVIDSSGSI------DYDEYNIMKDF---MIGLVK 836 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYN 268 V+ G + +++ E F L + ++ ++ G +T + L ++ + Sbjct: 837 KADVGKAQVRFGALKYADDPEVLFYLGDLDTKMEVISMLQNDQPMGGNTYTAEALAFSDH 896 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +G R H ++++ +TDGE+ ++ + + +G +V A+GI Sbjct: 897 MFTEARGSRLHKG-----VPQVLIVITDGESHDAEKLNGTTK---ALRDKGILVLAVGIA 948 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 E L S + ++ VE + FS + + Sbjct: 949 GANPVELLAMAGSSDKYFFVETFGGLQGIFSDVSASV 985 Score = 84.1 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 81/214 (37%), Gaps = 22/214 (10%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + +S KV+ ++D++ ++D S S++ S K+ + S+ V+ Sbjct: 981 VSASVCNSSKVD-CEIEKVDLVFLMDGSNSID---PSDFKKMKEFVASV------VQDFD 1030 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N VQ G FS+ + F L + I+ + + T+ L Sbjct: 1031 VSLNRVQIGAAQFSHTYQPEFPLGTFTDEKEVSFHIENIQQIFGYTHIGAALHQVGRYFQ 1090 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 G R + ++++ +TDG+ + E + G +Y++GI + Sbjct: 1091 PDMGSRI-----NTGTPQVLLVLTDGQ-----SQDEVAQAAEELRHNGVDIYSVGIGNVD 1140 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 + ++ + + V+N + I ++I Sbjct: 1141 HQQLIQITGTADKKLTVDNFDELKKIKKRIVRNI 1174 Score = 79.4 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 48/334 (14%), Positives = 108/334 (32%), Gaps = 32/334 (9%) Query: 52 SMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRI-----KNTWNMSFRNELRDNGFVND 106 + + + + G +RK I+ + ++ + N R V Sbjct: 304 EAYTGAALRKIRKEVFGARGGSRKNQGVPQIVVLVTHRASEDNVTRAAVNLRRQGVTVFT 363 Query: 107 ID-DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPI----TSS 161 + + +T L+ + E Y + + TF+ N + + T + Sbjct: 364 LGVEGASATQLEKIASHPAEQYVSKLPTFSDLAAHNQTFVKKLRNQITHTVSVLAERTET 423 Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 +K D+ +++D S S + T ++ + + P V+ Sbjct: 424 LKSGCVDTEEADIYLLIDGSGS------TQATDFQEMKTFLSEVASMFHIGPHK---VRV 474 Query: 222 GLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G V +++ + F + L R I+ + + G TN+ L + + + R Sbjct: 475 GAVQYASSWDLEFEIGKYSNKHDLGRAIENIRQLGGDTNTGAALNFTLRLLQKAKQQR-- 532 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 +V +T+G L ++ VYAIG++ + Sbjct: 533 ----GNKVPCHLVVLTNG-----MSKDSILEPAKRLREENIRVYAIGVKEANQTQLREIA 583 Query: 340 ASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 Y V + ++ D + + ++I + D Sbjct: 584 GDEKRVYYVHDFDALKDIRNQVVQEICAEEACKD 617 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 41/249 (16%), Positives = 90/249 (36%), Gaps = 31/249 (12%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK------- 163 + V V + + A++ + T T S ++ I +V+ Sbjct: 155 RDGVRIISVGVQKAAEEDVKAMATSQFHFSLRTVRDLSTFSHNVTQAIRDAVQYRDAAVD 214 Query: 164 ---VNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 V + D++ +LDV+ + S+ D + E V + + + Sbjct: 215 DILVEVCQGPSVADVVFLLDVA------VNGSLEDFDHLKGF---LQESVSALDIKESCM 265 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GLV +SN+ E L GV S + +I+ LS + L+ ++F +G Sbjct: 266 RVGLVAYSNETEVIGSLSEGVNRSAVLHRIQSLSPGAGEAYTGAALRKIRKEVFGARGGS 325 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + ++ +I+V +T + +++G V+ +G+ + + + Sbjct: 326 R----KNQGVPQIVVLVT-----HRASEDNVTRAAVNLRRQGVTVFTLGVEGASATQLEK 376 Query: 338 ACASPNSFY 346 + P Y Sbjct: 377 IASHPAEQY 385 >gi|126341670|ref|XP_001379945.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 2439 Score = 91.8 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 50/340 (14%), Positives = 114/340 (33%), Gaps = 49/340 (14%) Query: 43 IFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNG 102 +F +K + + + A + +E +G + ++ + L+++ Sbjct: 1060 VFHLKPKVGKAHTATALKFAKERFSEMHGGRQSLAVTQILVLITNKPTESEEKKYLQESA 1119 Query: 103 ----------FVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSR 152 F I ++ R I S Y + K I + Sbjct: 1120 QTLQEAGIDVFAIGIKNVKRPELQAITKHRDRSFMVQSYNELYNLHEKVTHIICNESK-- 1177 Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 PI +D++ ++D S S+ + T + +K I V Sbjct: 1178 ----PICG--------HREIDLVFLIDGSSSIH---PRNFTAMKTFMKQI------VNSF 1216 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + V+ G+ +S ++ F L S + + I +++ T + GL++ + Sbjct: 1217 TIGKDRVRIGVAQYSTNPQKEFYLNTFYSGAEINQHIDKITQLRTQTYTGKGLRFVKSFF 1276 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 G R+ + + + +V +TDG + + N+ + +++IGI VI Sbjct: 1277 EPANGSRK-----NLHVLQSLVVITDG-----MSNDSVVEAANDLRNEKIQIFSIGIGVI 1326 Query: 331 RSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 E + ++V + + I + +V + Sbjct: 1327 NLFELQLIAGNVKRVFVVGDFGQL----GSIERKVVRELC 1362 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 38/238 (15%), Positives = 81/238 (34%), Gaps = 26/238 (10%) Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ-TDARLDMMIVLDVSRSMESFFDSSITKI 195 I K + + +P+ + + D ++D S S+ ++ Sbjct: 586 ITEKIKKRVCTEIVQKTFSVPVMTRTLQEGCVSTEEADFYFLIDGSGSINH---DDFAEM 642 Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKF 253 M+E + + V+ G+V +S+ F + S L+ I + + Sbjct: 643 KTF------MIELISTFRVGADHVRFGVVQYSDSPTVEFDIRQHSSVAQLKSAITKIWQT 696 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 G T + L + ++F + ++ +TDG+ Q Sbjct: 697 GGGTRTGEALTF-MKRLFS--------EVARDKVLRFLIVITDGQ-----SQDQVAQAAE 742 Query: 314 EAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 E ++ +YAIG++ + E L S N + V + S+ + +DI + Sbjct: 743 ELRQENITIYAIGVKSAVTKELLEISGSQNRMFFVNDFDSLKPIQQEVIQDICFLEVC 800 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 37/186 (19%), Positives = 75/186 (40%), Gaps = 22/186 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S S+ + K+ + S+ LE + V+ GL F++ I + Sbjct: 17 DLVFLVDSSTSIGPE---NFQKVKSFLYSLVLGLEI------GRDQVRVGLAQFNDNIYK 67 Query: 233 FFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 FLL S + +I L T + L + + F + + N +I Sbjct: 68 AFLLNQFPRKSDVLEQILSLPYRTGGTRTGSALNFLRTEFFTESAGSRAKD----NVPQI 123 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLV 348 ++ +TDGE + + ++ K +G +Y +GI V E + P + + Sbjct: 124 VILVTDGE-----SNDEVAEAASKLKGQGVSIYVVGINVQDVQELKTIASKPLEKFLFSI 178 Query: 349 ENPHSM 354 E+ + + Sbjct: 179 EDFNIL 184 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 64/199 (32%), Gaps = 23/199 (11%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + D++ ++D S E K+ M++ V VQ GL+ Sbjct: 798 EVCKGMKADILFLVDGS---ERINTRDFDKMKEF------MMQMVNKSDLGPEKVQIGLL 848 Query: 225 TFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+ +E F L S + R I + + L ++ + Sbjct: 849 QFSSNPQEEFRLNTYYSKVDILRAITGMVQIRAGARVGSALSFSLPYF-------ERSRG 901 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N + ++ + G K + +G ++AIG+ + + L + Sbjct: 902 GRLNVPQYLIIIISG-----KTGDAVKMPAKALRDKGIKIFAIGVHKANNSQLLEITGAQ 956 Query: 343 NSFYLVENPHSMYDAFSHI 361 + Y EN S+ I Sbjct: 957 DKVYYEENFDSLLFLEKQI 975 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 40/229 (17%), Positives = 82/229 (35%), Gaps = 23/229 (10%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 + RH+ P + T A L D++ +++ S + ++ ++ + Sbjct: 389 GSSGQRHLS-PEMGEETLAVCTSASLADIVFLVEASSRIG---------LENFQLAVELL 438 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGL 263 + + + N V+ GLV +S++ F L + S + + L G T + L Sbjct: 439 RKIIHTLIIGPNKVRVGLVLYSDEPRLEFGLNTFLSQSEILSHLNKLPFIGGKTKTGAAL 498 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 + N +F Q ++ +++ V +T+G + + SL ++ G V+ Sbjct: 499 DFLRNTVFTQQKGSRY----RQGVQQLAVVITEGYSQDEVDRPASL-----LRRAGVTVF 549 Query: 324 AIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 A+G L AS S S I + I + Sbjct: 550 AVGTLKASGSRDLNKIASHPPRKHAIYLESFLQ-LSVITEKIKKRVCTE 597 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 37/237 (15%), Positives = 91/237 (38%), Gaps = 22/237 (9%) Query: 116 LDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMM 175 +++ V + + + + ++ ++ + + + + ++ + T + D++ Sbjct: 155 INVQDVQELKTIASKPLEKFLFSIEDFNILEGLSGNILPTLCSAVENQIRAFTKSYADVV 214 Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 ++D S +S + I +++ +++ D + Q GL + N+ FL Sbjct: 215 FLVDTS------EGTSSVSFNQMKDFICRVIDTLEVGRDKD---QIGLAQYGNQGHVEFL 265 Query: 236 LEW--GVSHLQRKIKY-LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L + I+ G + + GL+Y F + + + V Sbjct: 266 LNAYQNPVEMISHIQQNFLPRGGARKTGNGLQYIQETFFQEEAGSRFL----QGIPQYAV 321 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +T G+ + L + K+RG + +GI+ S E L+A A+P + +E Sbjct: 322 VITSGQ-----SEDLVLEKAQKLKERGVKIMVVGIQDFDSRE-LKAMATPPLVFEIE 372 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 28/156 (17%), Positives = 70/156 (44%), Gaps = 8/156 (5%) Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + V+ G +++S+ E F L+ ++ ++ + +L ++ LK+A + +M Sbjct: 1027 KDNVRFGAISYSDNSEVLFSLDTYITKAQIRDAVFHLKPKVGKAHTATALKFAKERFSEM 1086 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 G RQ +I+V +T+ + ++E + ++ G V+AIGI+ ++ Sbjct: 1087 HGGRQ-----SLAVTQILVLITN-KPTESEEKKYLQESAQTLQEAGIDVFAIGIKNVKRP 1140 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 E + ++V++ + +Y+ + I + Sbjct: 1141 ELQAITKHRDRSFMVQSYNELYNLHEKVTHIICNES 1176 >gi|194221587|ref|XP_001495285.2| PREDICTED: similar to collagen type VI alpha 6 [Equus caballus] Length = 2301 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 43/205 (20%), Positives = 82/205 (40%), Gaps = 27/205 (13%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + + + D+M ++D S S+ + +K+ + +K++ V + VQ G+ Sbjct: 647 QEACKEMKADIMFLVDSSGSIGPE---NFSKMKIFMKNL------VSKSQIGADRVQIGV 697 Query: 224 VTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 V FS+ +E F L S + I ++ G +T + L + +G R Sbjct: 698 VQFSHVNKEEFQLNRYMSQSEISDAIDRMAHIGETTLTGHALTFVSQYFSPAKGAR---- 753 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 N +K ++ +TDGE ++ G I+Y++G+ + Sbjct: 754 ---PNVRKFLILITDGE-----AQDIVKDPAVALRQEGIIIYSVGVFGSNVTQLEEISGR 805 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIV 366 P + VEN +D HI D+V Sbjct: 806 PEMVFYVEN----FDILQHIEDDLV 826 Score = 88.7 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 33/214 (15%), Positives = 82/214 (38%), Gaps = 22/214 (10%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + +S KV+ ++D++ ++D S S+ K+ + S+ V+ Sbjct: 1016 VSASVCNSSKVD-CEIGQIDLVFLMDGSNSIH---PDDFKKMKAFLASV------VQDFD 1065 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N V+ G FS+ + F L + +I+ + + T+ L+ Sbjct: 1066 VTVNRVRIGAAQFSHNYQPEFPLGTFTGEEEISLQIEKIQQIFGYTHIGAALRRVGRYFR 1125 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 G R + T ++++ +TDG+ + + +++G +Y++GI + Sbjct: 1126 PDMGSRINAGTP-----QVLLVLTDGQ-----SQDEVAQAAEDLRRKGINIYSVGIGDVD 1175 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 + ++ + + V + + I + I Sbjct: 1176 DQQLVQITGTADKKLTVHDFDELRKVKKRIVRHI 1209 Score = 87.5 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 52/337 (15%), Positives = 119/337 (35%), Gaps = 38/337 (11%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + + ++ + IDR T + G+ + + F + D Sbjct: 710 LNRYMSQSEISDAIDRMAHIGETTL----TGHALTFVSQYFSPAKGARPNVRKFLILITD 765 Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIP-------WYTNSRH 153 + DIV+ ++ + YS+ L+ + P + +H Sbjct: 766 G----EAQDIVKDPAVALRQEG-IIIYSVGVFGSNVTQLEEISGRPEMVFYVENFDILQH 820 Query: 154 IVMPITSSVKVNSQTDAR---LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 I + + + R LD++ V+D S S+ + ++ + + VK Sbjct: 821 IEDDLVFGICSPREECKRIEVLDVVFVIDSSGSI------DYDEYNIMKDFMTDL---VK 871 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEW-GVS-HLQRKIKYLSKFGVSTNSTPGLKYAYN 268 N V+ G + +++ E F L+ G + ++ G +T + L ++ + Sbjct: 872 KADVGKNQVRFGALKYADDPEVLFYLDTLGTKWEVISVLQNDQPMGGNTYTAEALAFSDH 931 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +G R ++++ +TDGE+ + + + +G ++ A+GI Sbjct: 932 MFTEARGSRLQRG-----VPQVLIVITDGES---HDADKLNATAKALRDKGILILAVGIA 983 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 E L S + ++ VE + FS + + Sbjct: 984 GANPVELLAMAGSSDKYFFVETFGGLKGIFSDVSASV 1020 Score = 82.1 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 49/351 (13%), Positives = 113/351 (32%), Gaps = 36/351 (10%) Query: 37 IIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIK--------- 87 I S + L + AA + + + + + + +I Sbjct: 320 GINKSEVLQYIQNLSPQAGNAYTGAAIRKIRKEVFGAQNGSRKNQGVPQIAVLVTHRPSE 379 Query: 88 NTWNMSFRNELRDNGFVNDID-DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIP 146 + + N R + + + T L+ + E Y S + TF+ Sbjct: 380 DNVTKAAVNLRRQGVTIFTMGIEGASDTQLEKIASHPAEQYVSKLKSFSDLAAHNQTFLK 439 Query: 147 WYTNSRHIVMPI----TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 N + + T ++K D+ +++D S S + T + Sbjct: 440 KLRNQITHTLSVFSERTETLKSGCVDTEEADIYLLIDGSGS------TQATDFHEMKTFL 493 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNST 260 + ++ + P V+ G V +++ + F + L + I+ + + G +TN+ Sbjct: 494 SELVGMFNIAP---QKVRFGAVQYADSWDLEFEINKYSNKHDLGKAIENIRQMGGNTNTG 550 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA 320 L F + +++ +V +T+G + N ++ Sbjct: 551 AALN------FTLGLLQKAKKERGNKVPCHLVVLTNG-----VSKDSIVEPANRLREELI 599 Query: 321 IVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 VYAIG+R + Y V + ++ D + + ++I + Sbjct: 600 HVYAIGVREANQTQLREIAGEEKRMYYVHDFDALKDIRNQVVQEICAQEAC 650 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 40/249 (16%), Positives = 88/249 (35%), Gaps = 31/249 (12%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK------- 163 + V V + ++ A++ + T T S+++ I + + Sbjct: 190 RDGVRIISVGVQKASEENLKAMATAQFHFNLRTVRDLSTFSQNMTQIIKDAAQYKDGAVN 249 Query: 164 ---VNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 V + D++ +LDVS + S D + + E V + N + Sbjct: 250 DILVEVCQGPSVADVVFLLDVS------LNGSQENFDYLKEFLE---ESVSALDIKENCM 300 Query: 220 QSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GLV FSN+ + L G S + + I+ LS + + ++ ++F Q Sbjct: 301 RVGLVAFSNETKVINSLSRGINKSEVLQYIQNLSPQAGNAYTGAAIRKIRKEVFGAQNGS 360 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + ++ +I V +T + +++G ++ +GI + + Sbjct: 361 R----KNQGVPQIAVLVT-----HRPSEDNVTKAAVNLRRQGVTIFTMGIEGASDTQLEK 411 Query: 338 ACASPNSFY 346 + P Y Sbjct: 412 IASHPAEQY 420 >gi|148689164|gb|EDL21111.1| RIKEN cDNA E330026B02 [Mus musculus] Length = 1482 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 46/204 (22%), Positives = 84/204 (41%), Gaps = 27/204 (13%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + D + D+M ++D S S+ + +K+ M +K++ V + VQ G+V Sbjct: 613 EACRDMKADIMFLVDSSGSIGPE---NFSKMKMFMKNL------VSKSQIGADRVQIGVV 663 Query: 225 TFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS++ +E F L + S + I ++ G +T + L + +G R Sbjct: 664 QFSHENKEEFQLNTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFSPDKGAR----- 718 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N +K ++ +TDGE +K G I+Y++G+ + P Sbjct: 719 --PNVRKFLILITDGE-----AQDIVRDPALALRKEGVIIYSVGVFGSNVTQLEEISGKP 771 Query: 343 NSFYLVENPHSMYDAFSHIGKDIV 366 + VEN +D HI D+V Sbjct: 772 EMVFYVEN----FDILQHIEDDLV 791 Score = 87.9 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 60/337 (17%), Positives = 123/337 (36%), Gaps = 38/337 (11%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + F ++ + + IDR T + G+ + + F + D Sbjct: 675 LNTFMSQSDIANAIDRMTHIGETTL----TGSALTFVSQYFSPDKGARPNVRKFLILITD 730 Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIP-------WYTNSRH 153 + DIVR +L + YS+ L+ + P + +H Sbjct: 731 G----EAQDIVRDPALALRKEG-VIIYSVGVFGSNVTQLEEISGKPEMVFYVENFDILQH 785 Query: 154 IVMPITSSVKVNSQTDAR---LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 I + + + R LD++ V+D S S+ + ++ M+ VK Sbjct: 786 IEDDLVLGICSPREECKRIEVLDVVFVIDSSGSI------DYQEYNIMKDF---MIGLVK 836 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEW-GVS-HLQRKIKYLSKFGVSTNSTPGLKYAYN 268 N V+ G + +++ E F L+ G + ++ G +T + L ++ + Sbjct: 837 KADVGKNQVRFGALKYADDPEVLFYLDELGTKLEVISVLQNDHPMGGNTYTAEALAFSNH 896 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +G R H ++++ +TDGE+ + ++ + +G +V A+GI Sbjct: 897 MFTEARGSRLHKG-----VPQVLIVITDGES---HDAEKLNATAKALRDKGILVLAVGIA 948 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 S E L S + +Y VE + FS + + Sbjct: 949 GANSWELLAMAGSGDKYYFVETFGGLKGIFSDVSASV 985 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 37/214 (17%), Positives = 81/214 (37%), Gaps = 22/214 (10%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + +S KV+ ++D++ ++D S S+ K+ + S+ V+ Sbjct: 981 VSASVCNSSKVD-CEIEKVDLVFLMDGSNSIH---PDDFQKMKGFLVSV------VQDFD 1030 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N V+ G+ FS+ FLL + +I+ + + T+ L+ Sbjct: 1031 VSLNRVRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQIFGYTHIGDALRKVKYYFQ 1090 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 G R + T ++++ +TDG + E + +G +Y++GI + Sbjct: 1091 PDTGSRINAGTP-----QVLLVLTDGR-----SQDEVAQAAEELRHKGVDIYSVGIGDVD 1140 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 E ++ + V N + I ++I Sbjct: 1141 DQELVQITGTAEKKLTVHNFDELKKVKKRIVRNI 1174 Score = 73.3 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 32/227 (14%), Positives = 83/227 (36%), Gaps = 22/227 (9%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 T++ + T ++K D+ +++D S S + T ++ ++ Sbjct: 411 THTVSVFSERTETLKSACVDTEEADIYLLIDGSGS------TQPTDFHEMKTFLSEVVGM 464 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 + P V+ G V +++ + F + L + I+ + + G +TN+ L + Sbjct: 465 FNIAPHK---VRVGAVQYADTWDLEFEISKYSNKPDLGKAIENIRQMGGNTNTGAALNFT 521 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 + + R + +V +T+G L ++ ++ V+AIG Sbjct: 522 LKLLQRAKKER------GSKVPCHLVVLTNG-----MSRDSVLGPAHKLREENIRVHAIG 570 Query: 327 IRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 ++ + Y V + ++ + + + ++I + D Sbjct: 571 VKEANQMQLREIAGEEKRVYYVHDFDALRNIRNQVVQEICAEEACRD 617 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 70/188 (37%), Gaps = 21/188 (11%) Query: 162 VKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + V + + D++ +LD++ + S +D + E + + N ++ Sbjct: 216 IIVEACQGPSVADVVFLLDMA------INGSQEDLDHLKAFLG---ESISALDIKENCMR 266 Query: 221 SGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 GLVT+SN+ L G + + ++I+ LS + L+ +IF Q Sbjct: 267 VGLVTYSNETRVISSLSTGNNKTEVLQRIQDLSPQVGQAYTGAALRKTRKEIFSA----Q 322 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + ++ +I V +T + ++ G ++ +GI E + Sbjct: 323 RGSRKNQGVPQIAVLVT-----HRASEDNVTKAAVNLRREGVTIFTMGIEGANPDELEKI 377 Query: 339 CASPNSFY 346 + P + Sbjct: 378 ASHPAEQF 385 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 31/205 (15%), Positives = 70/205 (34%), Gaps = 29/205 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + + + I+ M+ + P N + L +S+ + Sbjct: 26 DVVFLVDSSDHLG------LKSFPLVKTFIHKMISSL---PIEANKYRVALAQYSDALHN 76 Query: 233 FFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L + + +L K G S L+ A+ F + + Sbjct: 77 EFQLGTFKNR-NPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSAPTNGRDKK----QFP 131 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 I+V + S + + ++ G + ++G++ L+A A+ + + Sbjct: 132 PILVVL-----ASAESEDDVEEAAKALREDGVKIISVGVQKASEEN-LKAMATSQFHFNL 185 Query: 349 ENPHSMYDAFS----HIGKDIVTKR 369 + F+ I KD+ R Sbjct: 186 RTARDL-SVFAPNMTEIIKDVTQYR 209 >gi|171846231|ref|NP_034899.2| cartilage matrix protein precursor [Mus musculus] gi|28503027|gb|AAH47140.1| Matrilin 1, cartilage matrix protein [Mus musculus] gi|66350790|emb|CAC79633.1| cartilage matrix protein [Mus musculus] gi|123295072|emb|CAM17812.1| matrilin 1, cartilage matrix protein 1 [Mus musculus] gi|187466486|emb|CAQ51562.1| matrilin 1, cartilage matrix protein 1 [Mus musculus] Length = 500 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 80/218 (36%), Gaps = 21/218 (9%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + + + + D++ V+D SRS+ + + ++ ++E + + P+ Sbjct: 27 SLSLVPQPRGHLCRTRPTDLVFVVDSSRSVRP------VEFEKVKVFLSQVIESLDVGPN 80 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + GLV +++ ++ F L S + ++ + T + L++A + Sbjct: 81 A---TRVGLVNYASTVKPEFPLRAHGSKASLLQAVRRIQPLSTGTMTGLALQFAITKALS 137 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + K+++ +TDG + D A+ G ++AIG+ + Sbjct: 138 DAE---GGRARSPDISKVVIVVTDGRPQDSVRDVS-----ERARASGIELFAIGVGRVD- 188 Query: 333 HEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 LR AS V+ S Y+ + K Sbjct: 189 KATLRQIASEPQDEHVDYVES-YNVIEKLAKKFQEAFC 225 Score = 87.1 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 40/220 (18%), Positives = 85/220 (38%), Gaps = 30/220 (13%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T +V + + D++ ++D S+S+ ++ K IN +++ + + + Sbjct: 265 TCNVCRGGGSGSATDLVFLIDGSKSVRP------ENFELVKKFINQIVDTLDVSD---RL 315 Query: 219 VQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAY-NQIFDMQG 275 Q GLV +S+ I + F L ++ ++ +S T + LKY N G Sbjct: 316 AQVGLVQYSSSIRQEFPLGRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSG 375 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R +K+ + TDG + D +AK G ++A+G+ E Sbjct: 376 ARPGA-------QKVGIVFTDGRSQDYINDA-----ARKAKDLGFKMFAVGVGNAVEEEL 423 Query: 336 LRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P + ++ + + + IGK + + + Sbjct: 424 REIASEPVADHYFYTAD----FKTINQIGKKLQKQICVEE 459 >gi|296207278|ref|XP_002750612.1| PREDICTED: cartilage matrix protein [Callithrix jacchus] Length = 496 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 33/210 (15%), Positives = 83/210 (39%), Gaps = 21/210 (10%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + D++ V+D SRS+ + + ++ ++E + + P+ + G Sbjct: 31 RGHLCRTRPTDLVFVVDSSRSVRPI------EFEKVKVFLSQVIESLDVGPNA---TRVG 81 Query: 223 LVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V +++ +++ F L VS + ++ + T + +++A + F + Sbjct: 82 VVNYASTVKQEFPLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKAFSDAEGGRSR 141 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + K+++ +TDG + D + A+ G ++AIG+ + L+ A Sbjct: 142 ---SPDISKVVIVVTDGRPQDSVRDVSAR-----ARASGVELFAIGVGRVD-KATLQQIA 192 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 S V+ S Y+ + + Sbjct: 193 SEPQEEHVDYVES-YNVIEKLSRKFQEAFC 221 Score = 84.1 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 40/220 (18%), Positives = 83/220 (37%), Gaps = 30/220 (13%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T +V + D++ ++D S+S+ ++ K IN +++ + + + Sbjct: 261 TCNVCSGGGGSSATDLVFLIDGSKSVRP------ENFELVKKFINQIVDTLDVSD---KL 311 Query: 219 VQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAY-NQIFDMQG 275 Q GLV +S+ + + F L ++ ++ +S T + LKY N G Sbjct: 312 AQVGLVQYSSSVRQEFPLGRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSG 371 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R +K+ + TDG + D +AK G ++A+G+ E Sbjct: 372 ARPGA-------QKVGIVFTDGRSQDYINDA-----AKKAKDLGFKMFAVGVGNAVEDEL 419 Query: 336 LRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P ++ + + + IGK + K + Sbjct: 420 REIASEPVAEHYFYTAD----FKTINQIGKKLQKKICVEE 455 >gi|149018699|gb|EDL77340.1| rCG25821 [Rattus norvegicus] Length = 1513 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 56/337 (16%), Positives = 122/337 (36%), Gaps = 38/337 (11%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + F + + + ID+ T + G+ + + F + D Sbjct: 675 LNTFMSQNDIANAIDQMAHIGETTL----TGSALTFVSQYFSPEKGARPNVRKFLILITD 730 Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIP-------WYTNSRH 153 + DIV+ ++ + YS+ L+ + P + +H Sbjct: 731 G----EAQDIVKDPAVALRKDG-VIIYSVGVFGSNVTQLEEISGKPEMVFYVENFDILQH 785 Query: 154 IVMPITSSVKVNSQTDAR---LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 I + + + R LD++ V+D S S+ + ++ M+ VK Sbjct: 786 IEDDLVLGICSPREECKRIEVLDVVFVIDSSGSI------DYQEYNIMKDF---MIGLVK 836 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEW-GVS-HLQRKIKYLSKFGVSTNSTPGLKYAYN 268 N V+ G + +++ E F L+ G + ++ G +T + L ++ + Sbjct: 837 KADVGKNQVRFGALKYADDPEVLFYLDELGTKLEVISVLQNDQPMGGNTYTAEALAFSDH 896 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +G R H ++++ +TDGE+ + ++ + +G +V A+GI Sbjct: 897 MFTEARGSRLHKG-----VPQVLIVITDGES---HDAEKLNATAKALRDKGILVLAVGIA 948 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 + E L S + +Y VE + FS + + Sbjct: 949 GANTWELLAMAGSSDKYYFVETFGGLKGIFSDVSASV 985 Score = 88.7 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 45/204 (22%), Positives = 83/204 (40%), Gaps = 27/204 (13%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + D + D+M ++D S S+ + +K+ + +K++ V + VQ G+V Sbjct: 613 EACRDMKADIMFLVDSSGSIGLE---NFSKMKVFMKNL------VSKSQIGADRVQIGVV 663 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS++ E F L +S + I ++ G +T + L + +G R Sbjct: 664 QFSHENREEFQLNTFMSQNDIANAIDQMAHIGETTLTGSALTFVSQYFSPEKGAR----- 718 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N +K ++ +TDGE +K G I+Y++G+ + P Sbjct: 719 --PNVRKFLILITDGE-----AQDIVKDPAVALRKDGVIIYSVGVFGSNVTQLEEISGKP 771 Query: 343 NSFYLVENPHSMYDAFSHIGKDIV 366 + VEN +D HI D+V Sbjct: 772 EMVFYVEN----FDILQHIEDDLV 791 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 82/214 (38%), Gaps = 22/214 (10%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + +S KV+ ++D++ ++D S S+ K+ + S+ V+ Sbjct: 981 VSASVCNSSKVD-CEIEKVDLVFLMDGSNSIH---PDDFQKMKEFLASV------VQDFD 1030 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N V+ G+ FS+ FLL + +I+ + + T+ L+ + Sbjct: 1031 VSLNRVRIGVAQFSDSYRSEFLLGTFTGEKEISTQIEAIQQIFGYTHIGDALRKVKHYFR 1090 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 G R + T ++++ +TDG + E + +G +Y++GI + Sbjct: 1091 PDTGSRINAGTP-----QVLLVLTDGR-----SQDEVAQAAEELRHKGVDIYSVGIGDVD 1140 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 + ++ + V N + I ++I Sbjct: 1141 DQQLIQITGTAEKKLTVHNFDELKKVKKRIVRNI 1174 Score = 74.0 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 44/356 (12%), Positives = 118/356 (33%), Gaps = 36/356 (10%) Query: 34 LGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMS 93 L M + + + L + ++ AA + + + ++ + + +I Sbjct: 282 LSMGVNKTEVLQRIQDLSPHVGQAYTGAALRKTRKEVFSAQRGSRKNQGVPQIAVLVTHR 341 Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 ++ VN + V ++ + + I++ + K F T+++ Sbjct: 342 ASDDNVTKAAVNLRREGVTVFTMGVEGANPEQLEKIASYPAEQFTSKLSNFSELATHNQT 401 Query: 154 --------------IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 + T ++K D+ +++D S ++ T Sbjct: 402 FLKKLRNQITHTVSVFSERTETLKSACVDTEEADIYLLIDGSG------NTQPTDFHEMK 455 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVST 257 ++ +++ + P V+ G V +++ + F + L + I + + G +T Sbjct: 456 IFLSEVVDMFNIAPHK---VRVGAVQYADTWDLEFEISKYTNKPDLGKAIDNIRQMGGNT 512 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 N+ L + + + R +V +T+G L ++ ++ Sbjct: 513 NTGAALNFTLTLLQRAKKQR------GNKVPCHLVVLTNG-----MSQDSVLGPAHKLRE 561 Query: 318 RGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 V+AIG++ + Y V ++ D + + ++I + D Sbjct: 562 ENIRVHAIGVKEANQTQLREIAGDEKRVYYVHEFDALRDIRNQVVQEICAEEACRD 617 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 75/207 (36%), Gaps = 26/207 (12%) Query: 159 TSSVKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 T + V + + D++ +LD++ + S +D + E V + N Sbjct: 213 TDDIIVEACQGPSVADVVFLLDMA------INGSQENLDHLKAFLG---ESVSALDIKEN 263 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 ++ GLV +SN+ L GV+ + ++I+ LS + L+ ++F Sbjct: 264 CMRVGLVAYSNETRVISSLSMGVNKTEVLQRIQDLSPHVGQAYTGAALRKTRKEVFSA-- 321 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 Q + ++ +I V +T D ++ G V+ +G+ + Sbjct: 322 --QRGSRKNQGVPQIAVLVT-----HRASDDNVTKAAVNLRREGVTVFTMGVEGANPEQL 374 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIG 362 + + Y E S FS + Sbjct: 375 EKIAS-----YPAEQFTSKLSNFSELA 396 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 63/186 (33%), Gaps = 24/186 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + + + I ++ + P N + L +S+ + Sbjct: 26 DVVFLVDSSDHLG------MKSFPLVKTFIQKLVSSL---PVEANKYRVALAQYSDALHN 76 Query: 233 FFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L + + +L K G S L+ A+ F + + Sbjct: 77 EFHLGAFKNR-NPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSAPTNGRDKK----QFP 131 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 I+V + S + + ++ G + ++G++ + L+A A+ + + Sbjct: 132 PILVVL-----ASAESEDDVEEASKALREDGVKIISVGVQKASEED-LKAMATSQFHFNL 185 Query: 349 ENPHSM 354 + Sbjct: 186 RTARDL 191 >gi|126330546|ref|XP_001381755.1| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1) [Monodelphis domestica] Length = 495 Score = 91.4 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 82/211 (38%), Gaps = 24/211 (11%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 + + + D++ ++D SRS+ + + ++ ++E + + P Sbjct: 24 GVPPQARGHLCRTRPTDLVFIIDSSRSVRPH------EFEKVKVFLSQVIESLDVGP--- 74 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 N + G++ +++ ++ F L+ S + ++ + T + +++A N+ F Sbjct: 75 NTTRVGVINYASAVKHEFPLKAHRSKASLLQAVRKIEPLSTGTMTGLAIQFAINKAFSEV 134 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K+ + +TDG +D + AK+ G ++AIG+ + H Sbjct: 135 EGSRLKFP---EISKVAIVVTDGRPQDGVKDVSAR-----AKQSGIELFAIGVGRVDKHT 186 Query: 335 FLRACASP--NSFYLVEN---PHSMYDAFSH 360 + + P + VE+ + F Sbjct: 187 LRQIASEPLDDHVDYVESYSVIEKLSKKFQE 217 Score = 82.1 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 40/220 (18%), Positives = 83/220 (37%), Gaps = 30/220 (13%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T S + + D++ ++D S+S+ ++ + IN +++ + + + Sbjct: 260 TCSACNGAAGGSATDLVFLIDGSKSVRP------ENFELVKRFINQIVDSLDVSD---KL 310 Query: 219 VQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAY-NQIFDMQG 275 Q GLV +S+ + + F L ++ +K +S T + LKY N G Sbjct: 311 AQVGLVQYSSSVRQEFPLGRYKTKKDIKAAVKKMSYMEKGTMTGAALKYLIDNTFTISSG 370 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R +K+ + TDG + D +AK G ++A+G+ E Sbjct: 371 ARPGA-------QKVGIVFTDGRSQDYINDA-----AKKAKDLGFKMFAVGVGNAVEDEL 418 Query: 336 LRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P ++ + + + IGK + K + Sbjct: 419 REIASEPVAEHYFYTAD----FKTINQIGKKLQKKICVEE 454 >gi|118086119|ref|XP_426008.2| PREDICTED: similar to alpha 3 type VI collagen [Gallus gallus] Length = 2533 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 44/277 (15%), Positives = 108/277 (38%), Gaps = 33/277 (11%) Query: 103 FVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYK---IPLKFCTFIPWYTNSRHIVMPIT 159 + + +I + LD + + + + I K + ++ V+P+ Sbjct: 576 YAVGVKNITEGSKLDKIATYPPRNHVTTLKYFLQLSNIRWKIKKQLCNEIVTKTFVVPLQ 635 Query: 160 S-SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 S S+K D+ ++D S S+ + S + + + +++ N Sbjct: 636 SRSLKKGCMDTEEADIYFLIDGSGSI---YPSDFKDMKTFMNEV------IRIFQLGANN 686 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 V+ G+V ++++ + ++ ++ I+ +++ G T + L + +F Sbjct: 687 VRFGVVQYASESKTEIIIGQHSQMMRLTEAIENINQIGGGTRTGNAL-RSMKSLF----- 740 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 N +I++ +TDG K + + + +++G ++YAIGI+ E Sbjct: 741 ---QMAYRENVPQILIVITDG-----KSEDKVNQAARDLRQQGIVIYAIGIKDAVQQELE 792 Query: 337 RACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + N + V + +D+ HI +IV + + Sbjct: 793 EIAETKNRMFFVND----FDSLKHIKHEIVQEVCSTN 825 Score = 87.9 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 41/218 (18%), Positives = 89/218 (40%), Gaps = 22/218 (10%) Query: 152 RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 ++ + S K D++ ++D S S+ S + + +K + V Sbjct: 1191 ENLTQTVCESSKPVCSNHVA-DLIFLIDGSESISEN---SFSTMKTFMKDV------VDS 1240 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + V+ G+V +S + + F L + + ++ +I + + ST + GL++ + Sbjct: 1241 FDISRDKVRLGVVQYSQEPQREFYLNEFYSDTIIKEQINRIEQLRSSTFTGKGLRFVQSL 1300 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 G R+ + + +V +TDG + + +D + G V+A+G+ + Sbjct: 1301 FESANGGRK-----NQGVSQNLVVITDGYSADSVDDA-----AMALRSNGIHVFAVGVGI 1350 Query: 330 IRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + S E LR + VEN +++ S I +I Sbjct: 1351 VNSFELLRIAGDARRVFTVENFNALKTIKSTIINEICE 1388 Score = 83.3 bits (204), Expect = 5e-14, Method: Composition-based stats. Identities = 39/262 (14%), Positives = 89/262 (33%), Gaps = 36/262 (13%) Query: 93 SFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSR 152 +LR G V I + ++ + + + I Sbjct: 766 QAARDLRQQGIVIYAIGIKDAVQQELEEIAETKNRMFFVN-----DFDSLKHIKHEIVQE 820 Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 + +V+ D++ ++D S S+ + + KI ++S V + Sbjct: 821 VCSTNVCKNVRA--------DIVFLVDSSNSIRA---AEFQKIKDFMQS------FVIKV 863 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + V+ GL+ FS++I E F L+ + +QR I+ + + + T + L +A + Sbjct: 864 DVGLDNVRIGLIQFSSEIREEFQLDRYSTIADVQRAIQEMQQIKLGTLTGKALTFAASYF 923 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 K+ ++ +TDGE + +G +YAI + Sbjct: 924 -------DRPKGGRPELKQYLIVITDGE-----AQDSVKSPARAIRDKGITIYAIDMLQA 971 Query: 331 RSHEFLRACASPNSFYLVENPH 352 + + + + + + + Sbjct: 972 NNSQLVEITGAQDKVFFESEMN 993 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 46/260 (17%), Positives = 102/260 (39%), Gaps = 29/260 (11%) Query: 118 IVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 V P ++ Y I + P++ + S ++ + SS + D++ + Sbjct: 389 TVREPSSKVYQIYEVESID-PVQQ-NIVNNIETSLQMLYDLDSSAPAVCSSATVADIVFL 446 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 +D S + S + + ++ + + P V+ GLV +SN+ F L+ Sbjct: 447 VDESSKIGS------KNFQLIRAFLLKIVNALDIAPSN---VRVGLVLYSNEPRLEFTLD 497 Query: 238 WGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 L+ +K L G + +++ N++F + + ++I V +T Sbjct: 498 TFKDKLEILNYLKNLPYRGGQAYTGIAIEFLRNKVFTQEAGSRKK----QGVQQIAVVIT 553 Query: 296 DGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--PNSF-----YLV 348 DG++L + ++ ++ VYA+G++ I L A+ P + Y + Sbjct: 554 DGQSLD-----DYIEPASKLRRESVTVYAVGVKNITEGSKLDKIATYPPRNHVTTLKYFL 608 Query: 349 ENPHSMYDAFSHIGKDIVTK 368 + + + + +IVTK Sbjct: 609 QLSNIRWKIKKQLCNEIVTK 628 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 75/195 (38%), Gaps = 21/195 (10%) Query: 156 MPITSSVKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 T + + + A + D++ ++D S S+ + K+ + S+ V + Sbjct: 28 TFGTIDAQQTACSKATVADVVFIVDTSTSIAQE---NFQKVKNFLSSL------VSSLDI 78 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 ++++ GL +S++ + FLL S + +I L G T + L + + F Sbjct: 79 GLDMIRVGLAQYSDEAYQVFLLNQYLLKSDVLDQIGNLPYRGGETYTGRALDFVSTRYFT 138 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 ++ V +T GE + + + + RG +Y +GI + + Sbjct: 139 --ESAGSRAKG--YVPQLAVLITSGE-----SNDEVEQPAKKLRYRGISIYVVGIGIQNT 189 Query: 333 HEFLRACASPNSFYL 347 E + + P YL Sbjct: 190 TELQQIASKPFRRYL 204 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 60/184 (32%), Gaps = 16/184 (8%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M V+ S + D + ++++ V + VQ G++ +S+ E Sbjct: 1017 DIMFVVHGSSGVT---DLQFKNMLQLVEAVVN--NSVVGKDN----VQFGVLVYSSNPEV 1067 Query: 233 FFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L S ++ + L + L +A G+ +I Sbjct: 1068 QFSLNSYASKSQIREAVFSLKPLSGQPFTARALSFARQTF----GVNYGGRASSLAVARI 1123 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 +V +TD E + K+ ++ A+G+ E + ++ Sbjct: 1124 LVLITD-EPTVPSDRDNLPMAIRALKEDKIVLIAVGVSKASREELEEITEDQKRLFFAQS 1182 Query: 351 PHSM 354 ++ Sbjct: 1183 YDAL 1186 >gi|114589213|ref|XP_516745.2| PREDICTED: hypothetical protein [Pan troglodytes] Length = 1859 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 47/258 (18%), Positives = 92/258 (35%), Gaps = 30/258 (11%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 T L + + Y + + + + + S + D Sbjct: 795 ANQTQLREIAGEEKRVYYVH---NFDALKDIRNQVVQEICTEEADLNKHHSFSYAACKDM 851 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 + D+M ++D S S+ + +K+ +K++ V + VQ G+V FS+ Sbjct: 852 KADIMFLVDSSGSIGPE---NFSKMKTFMKNL------VSKSQIGPDRVQIGVVQFSDIN 902 Query: 231 EEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 +E F L +S + I ++ G +T + L + +G R N + Sbjct: 903 KEEFQLNRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFSPTKGAR-------PNIR 955 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 K ++ +TDGE ++ G I+Y++G+ + P + V Sbjct: 956 KFLILITDGE-----AQDIVKEPAVVLRQEGVIIYSVGVFGSNVTQLEEISGRPEMVFYV 1010 Query: 349 ENPHSMYDAFSHIGKDIV 366 EN +D I D+V Sbjct: 1011 EN----FDILQRIEDDLV 1024 Score = 87.5 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 56/356 (15%), Positives = 119/356 (33%), Gaps = 40/356 (11%) Query: 34 LGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIK------ 87 L M I S + L ++ AA + + + + R + + +I Sbjct: 503 LSMGINKSEVLQHIQNLSPRTGKAYTGAAIKKLRKEVFSARNGSRKNQGVPQIAVLVTHR 562 Query: 88 ---NTWNMSFRNELRDNGFVNDID-DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCT 143 + + N R+ + + + T L+ + E Y + + T Sbjct: 563 DSEDNVTKAAVNLRREGVTIFTLGIEGASDTQLEKIASHPAEQYVSKLKTFADLAAHNQT 622 Query: 144 FIPWYTNSRHIVMPI----TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 F+ N + + T ++K D+ +++D S S + T Sbjct: 623 FLKKLRNQITHTVSVFSERTETLKSGCVDTEEADIYLLIDGSGS------TQATDFHEMK 676 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVST 257 ++ ++ + P V+ G V +++ + F + L + I+ + + G +T Sbjct: 677 TFLSEVVGMFNIAPHK---VRVGAVQYADSWDLEFEINKYSNKQDLGKAIENIRQMGGNT 733 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 N+ L + + + + +Q N +V +T+G L N ++ Sbjct: 734 NTGAALNFTLSLL--QKAKKQRGNK----VPCHLVVLTNG-----MSKDSILEPANRLRE 782 Query: 318 RGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 VYAIGI+ + Y V N +DA I +V + + Sbjct: 783 EHIRVYAIGIKEANQTQLREIAGEEKRVYYVHN----FDALKDIRNQVVQEICTEE 834 Score = 85.2 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 37/192 (19%), Positives = 75/192 (39%), Gaps = 19/192 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D S S+ + ++ M+ VK N V+ G + +++ E Sbjct: 1040 LDVVFVIDSSGSI------DYDEYNIMKDF---MIGLVKKADVGKNQVRFGALKYADDPE 1090 Query: 232 EFFLLEW-GVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L+ G + ++ G ST + L ++ + N + Sbjct: 1091 VLFYLDDFGTKLEVISVLQNDQAMGGSTYTAEAL--GFSDHMFTEARGSRLNKG---VPQ 1145 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDGE+ + + + +G +V A+GI E L S + ++ VE Sbjct: 1146 VLIVITDGES---HDADKLNATAKALRDKGILVLAVGIDGANPMELLAMAGSSDKYFFVE 1202 Query: 350 NPHSMYDAFSHI 361 + FS + Sbjct: 1203 TFGGLKGIFSDV 1214 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 83/214 (38%), Gaps = 22/214 (10%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + +S KV+ + D +D++ ++D S S++ K+ + S+ V+ Sbjct: 1214 VTASVCNSSKVDCEIDK-VDLVFLMDGSTSIQPN---DFKKMKEFLASV------VQDFD 1263 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N V+ G FS+ F L G + +I+ + + +T+ L+ + Sbjct: 1264 VSLNRVRIGAAQFSDTYHPEFPLGTFIGEKEISFQIENIKQIFGNTHIGAALREVEHYFR 1323 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 G R + ++++ +TDG+ + + RG +Y++GI + Sbjct: 1324 PDMGSRI-----NTGTPQVLLVLTDGQ-----SQDEVAQAAEALRHRGIDIYSVGIGDVD 1373 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 + ++ + V N + I ++I Sbjct: 1374 DQQLIQITGTAEKKLTVHNFDELKKVNKRIVRNI 1407 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 39/249 (15%), Positives = 81/249 (32%), Gaps = 31/249 (12%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK------- 163 + V V + ++ A++ + T S+++ I K Sbjct: 376 KDGVKIISVGVQKASEENLKAMATSQFHFNLRTVRDLSMFSQNMTHIIKDVTKYKEGAVD 435 Query: 164 ---VNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 V + + D++ +LD+S + S D + E V + N + Sbjct: 436 DIFVEACQGPSMADVVFLLDMS------INGSEENFDYLKGFLE---ESVSALDIKENCM 486 Query: 220 QSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GLV +SN+ + L G S + + I+ LS + +K ++F R Sbjct: 487 RVGLVAYSNETKVINSLSMGINKSEVLQHIQNLSPRTGKAYTGAAIKKLRKEVFS---AR 543 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +I V +T + ++ G ++ +GI + + Sbjct: 544 NGSRKN-QGVPQIAVLVT-----HRDSEDNVTKAAVNLRREGVTIFTLGIEGASDTQLEK 597 Query: 338 ACASPNSFY 346 + P Y Sbjct: 598 IASHPAEQY 606 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 69/201 (34%), Gaps = 25/201 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + S I M+ + P + + L +S+K+ Sbjct: 247 DVVFLVDSSDRLGS------KSFPFVKMFITKMISSL---PIEADKYRVALAQYSDKLHS 297 Query: 233 FFLLEW--GVSHLQRKIKY-LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L G S + ++ G S L+ A+ F + + Sbjct: 298 EFHLSTFKGRSPMLNHLRKNFGFIGGSLQIGKALQEAHRTYFSAPTNGRDKK----QFPP 353 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL-- 347 I+V + S++ + +K G + ++G++ L+A A+ + Sbjct: 354 ILVVL-----ASSESEDDVEEASKALRKDGVKIISVGVQKASEEN-LKAMATSQFHFNLR 407 Query: 348 -VENPHSMYDAFSHIGKDIVT 367 V + +HI KD+ Sbjct: 408 TVRDLSMFSQNMTHIIKDVTK 428 >gi|297665730|ref|XP_002811194.1| PREDICTED: cartilage matrix protein-like [Pongo abelii] Length = 495 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 33/210 (15%), Positives = 82/210 (39%), Gaps = 21/210 (10%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + D++ V+D SRS+ + + ++ ++E + + P+ + G Sbjct: 31 RGHLCRTRPTDLVFVVDSSRSVRP------VEFEKVKVFLSQVIESLDVGPNA---TRVG 81 Query: 223 LVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V +++ +++ F L +S + ++ + T + +++A + F + Sbjct: 82 MVNYASTVKQEFSLRAHISKAALLQAVRRIQPLSTGTMTGLAIQFAITKAFSDAEGGRSR 141 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + K+++ +TDG +D + A+ G ++AIG+ + LR A Sbjct: 142 ---SPDISKVVIVVTDGRPQDNVQDVSAR-----ARASGVELFAIGVGRVD-KATLRQIA 192 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 S V+ S Y + + Sbjct: 193 SEPQDEHVDYVES-YSVIEKLSRKFQEAFC 221 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 30/171 (17%), Positives = 64/171 (37%), Gaps = 19/171 (11%) Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLK 264 + V + + + Q GLV +S+ + + F L ++ ++ +S T + LK Sbjct: 299 QIVDTLDVSDKLAQVGLVQYSSSVRQEFPLGRFHTKKDIKAAVRNMSYMEKGTMTGAALK 358 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 Y + F + + +K+ + TDG + D +AK G ++A Sbjct: 359 YLIDNSFTVS------SGARPRAQKVGIVFTDGRSQDYINDA-----AKKAKDLGFKMFA 407 Query: 325 IGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 +G+ E + P ++ + + + IGK + K + Sbjct: 408 VGVGNAVEDELREIASEPVAEHYFYTAD----FKTINQIGKKLQKKICVEE 454 >gi|224081306|ref|XP_002190595.1| PREDICTED: matrilin 1, cartilage matrix protein [Taeniopygia guttata] Length = 493 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 42/230 (18%), Positives = 85/230 (36%), Gaps = 28/230 (12%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 + + + N + LD++ ++D S+S+ ++ K IN +++ Sbjct: 247 CRDGFSLNNDGKTCTACNGGLGSALDLVFLIDGSKSVRP------ENFELVKKFINQIVD 300 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKY 265 +++ V GLV +S+ + + F L S ++ +K +S T + LKY Sbjct: 301 SLEVSDKQAQV---GLVQYSSSVRQEFPLGQFKSKKDIKAAVKKMSYMEKGTMTGQALKY 357 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 + F N K+ + TDG + D +AK G ++A+ Sbjct: 358 LVDSSFSA------INGARPGVPKVGIVFTDGRSQDYISDA-----AKKAKDSGFRMFAV 406 Query: 326 GIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 G+ E + P ++ + + S IGK + K + Sbjct: 407 GVGNAVEDELREIASEPVAEHYFYTAD----FRTISKIGKKLQMKICVEE 452 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 31/202 (15%), Positives = 78/202 (38%), Gaps = 26/202 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 D++ ++D SRS+ + + ++ ++E + + P+ + G++ + Sbjct: 33 CRTKPTDLVFIIDSSRSVRPH------EFEKIKVFVSRVIEALDVGPNA---TRVGVINY 83 Query: 227 SNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ + L+ + L + ++ + T + +++A ++ F + Sbjct: 84 ASAVRNELSLQGPHSKAALLQAVRRIQPLSTGTMTGLAIQFAISRAFSAAEGGRGSAPN- 142 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--- 341 +KK+ + +TDG +D + A+ G ++AIG+ + LR AS Sbjct: 143 --FKKVAIVVTDGRPQDGVQDVSAR-----ARAAGIEIFAIGVGRVDMGT-LRQMASEPL 194 Query: 342 PNSFYLVEN---PHSMYDAFSH 360 VE+ + F Sbjct: 195 DEHVDYVESYSVIEKLTHKFQE 216 >gi|149632101|ref|XP_001514410.1| PREDICTED: hypothetical protein [Ornithorhynchus anatinus] Length = 2392 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 43/196 (21%), Positives = 80/196 (40%), Gaps = 19/196 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D S S++S + + M++ VK N VQ G + +S+ E Sbjct: 806 LDIVFVIDSSGSIDSNEYNIMKAF---------MIDLVKKADVGKNQVQFGALKYSDFPE 856 Query: 232 EFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L S + I+ G ST + L ++ + + G R H + Sbjct: 857 VLFNLNEFSSKSEIISFIQNDHPRGGSTYTAKALAHSAHLFSESLGSRMHRG-----VPQ 911 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDGE+ + + +G +V A+GI E L S + ++ VE Sbjct: 912 VLIVITDGESHDAHLLNAT---ARALRDKGILVLAVGIEGANHEELLSMAGSTDRYFFVE 968 Query: 350 NPHSMYDAFSHIGKDI 365 N + F ++ + Sbjct: 969 NFEGLKGIFENVSASV 984 Score = 85.2 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 43/199 (21%), Positives = 74/199 (37%), Gaps = 23/199 (11%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + S + + D+M ++D S S+ + K+ +K++ V N Sbjct: 605 VRSICSEEACKEMSADIMFLVDSSGSIG---GDNFEKMKTFMKNV------VNRTKIGAN 655 Query: 218 VVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 VQ GLV FS+ +E F L + + I LS G T L + + +G Sbjct: 656 QVQVGLVQFSDINKEGFQLNQYDTKTKISDAIDGLSLIGRGTLIGGALTFVSDYFSVSKG 715 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R N KK +V +TDG K ++ G I+Y++G+ + Sbjct: 716 AR-------PNVKKFLVLLTDG-----KSQDAVKEAAVALRQDGVIIYSVGVFGSEYSQL 763 Query: 336 LRACASPNSFYLVENPHSM 354 + + VEN + Sbjct: 764 EEISGRSDMVFYVENFDIL 782 Score = 84.1 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 56/357 (15%), Positives = 114/357 (31%), Gaps = 41/357 (11%) Query: 34 LGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMS 93 LG S I L R+L AA + + + + + +I Sbjct: 281 LGTGTNKSDILQQIDGLSPKAGRALTGAAINVTRKEIFSRGAGSRKSQGVLQITVLITHR 340 Query: 94 FRNE---------LRDNGFVNDID-DIVRSTSLDIVVVPQNEGYSISAISRYKIPL---K 140 + R+ V + + T LD + E Y S + Sbjct: 341 SSEDNVSEAALSLRREGVTVFAVGIEGANETQLDQIASYPREQYVSMVKSYSDMGAYYRI 400 Query: 141 FCTFIPWYTNSR-HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 F + ++ + T +K D+ +++D S S+ + Sbjct: 401 FQKKLRNEIQNKVSVASEQTERLKSGCADTEAADIYLLIDGSGSI------QVADFQEMK 454 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVST 257 + + ++ + P V+ G V +S+ E F ++ + L + ++ + + G +T Sbjct: 455 RFLAEVIGMFNIGPHK---VRFGAVQYSHLWEWEFEMDRYSNKNDLVKAVENIRQLGGNT 511 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 ++ L + Q RQ + +V +TDG + Sbjct: 512 DTGAALD---KMLPLFQRARQQRAR---KVPQHLVVLTDG-----LSHDSVREPAGRLRG 560 Query: 318 RGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 VYAIG++ + S + Y V N +D+ I +V + I ++ Sbjct: 561 DNINVYAIGVKEANHTQLEEIAGSDSRVYYVHN----FDSLKDIKNRVV-RSICSEE 612 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 85/216 (39%), Gaps = 22/216 (10%) Query: 152 RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 ++ + ++ KV+ D++ ++D S S+ + K+ + ++ + + Sbjct: 978 ENVSASVCNTSKVD-CELGMADLVFLIDGSTSI---LEEDFKKM---KDFLVTIVNDFDI 1030 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 P V GL FS++ F L ++ +I + + +T L+ + Sbjct: 1031 RP---GKVHVGLAQFSHEYRPEFSLIPFRDKIEVKNQIGRIQQIFGNTLIGAALRNVGSY 1087 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 + G R +A +++++ +TDG+ + + + +G +Y++G+ Sbjct: 1088 FWPDFGSRI-----NAGVQQVLLVLTDGQ-----SQDEVAQAAEDLRNKGIDIYSLGVGQ 1137 Query: 330 IRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 + + ++ S V+N + + +D+ Sbjct: 1138 VNDQQLIQISGSAKKKLTVDNFSELDKIKKRVVRDV 1173 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 32/209 (15%), Positives = 71/209 (33%), Gaps = 26/209 (12%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + +D++ ++D S + + + ++E + N ++ GLV +S Sbjct: 222 DASVVDIVFLVDES------VNGTDENFEHLKGF---LVETIDSFDVKENCMRIGLVMYS 272 Query: 228 NKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 N+ + L G S + ++I LS + + +IF + + Sbjct: 273 NETKLVSRLGTGTNKSDILQQIDGLSPKAGRALTGAAINVTRKEIFSRGAGSR----KSQ 328 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +I V +T + ++ G V+A+GI + + + P Sbjct: 329 GVLQITVLIT-----HRSSEDNVSEAALSLRREGVTVFAVGIEGANETQLDQIASYPREQ 383 Query: 346 Y-----LVENPHSMYDAF-SHIGKDIVTK 368 Y + + Y F + +I K Sbjct: 384 YVSMVKSYSDMGAYYRIFQKKLRNEIQNK 412 >gi|119628048|gb|EAX07643.1| matrilin 1, cartilage matrix protein, isoform CRA_b [Homo sapiens] Length = 519 Score = 91.0 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 35/210 (16%), Positives = 85/210 (40%), Gaps = 21/210 (10%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + D++ V+D SRS+ + + ++ ++E + + P+ + G Sbjct: 31 RGHLCRTRPTDLVFVVDSSRSVRP------VEFEKVKVFLSQVIESLDVGPNA---TRVG 81 Query: 223 LVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V +++ +++ F L VS + ++ + T + +++A + F G + Sbjct: 82 MVNYASTVKQEFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKAF---GDAEGG 138 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + + K+++ +TDG + +D + A+ G ++AIG+ + LR A Sbjct: 139 RSRSPDISKVVIVVTDGRPQDSVQDVSAR-----ARASGVELFAIGVGSVD-KATLRQIA 192 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 S V+ S Y + + Sbjct: 193 SEPQDEHVDYVES-YSVIEKLSRKFQEAFC 221 Score = 84.1 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 39/220 (17%), Positives = 83/220 (37%), Gaps = 30/220 (13%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T +V + D++ ++D S+S+ ++ K I+ +++ + + + Sbjct: 261 TCNVCSGGGGSSATDLVFLIDGSKSVRP------ENFELVKKFISQIVDTLDVSD---KL 311 Query: 219 VQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAY-NQIFDMQG 275 Q GLV +S+ + + F L ++ ++ +S T + LKY N G Sbjct: 312 AQVGLVQYSSSVRQEFPLGRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSG 371 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R +K+ + TDG + D +AK G ++A+G+ E Sbjct: 372 ARPGA-------QKVGIVFTDGRSQDYINDA-----AKKAKDLGFKMFAVGVGNAVEDEL 419 Query: 336 LRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P ++ + + + IGK + K + Sbjct: 420 REIASEPVAEHYFYTAD----FKTINQIGKKLQKKICVEE 455 >gi|56797861|emb|CAG27403.1| matrilin-3b [Danio rerio] gi|220675930|emb|CAX12089.1| matrilin 3b [Danio rerio] Length = 299 Score = 90.6 bits (223), Expect = 3e-16, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 75/209 (35%), Gaps = 26/209 (12%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 LD++ ++D SRS+ + K+ + + E V + ++ + Sbjct: 57 GAPAEPCKSRPLDLVFIIDSSRSVR---PAEFEKVKIF------LSEMVNSLDIGSDATR 107 Query: 221 SGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 LV +++ + F L+ + + +++ + T + +K A Q+F + Sbjct: 108 VALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLSTGTMTGMAIKTAMEQVFTENAGAR 167 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 K+ + +TDG E+ + A+ G +YA+G+ L+ Sbjct: 168 PLKKG---IGKVAIIVTDGRPQDKVEEVSAA-----ARASGIEIYAVGVDRAEMRS-LKQ 218 Query: 339 CAS---PNSFYLVEN---PHSMYDAFSHI 361 AS + + VE + F Sbjct: 219 MASQPLDDHVFYVETYGVIEKLTSKFRET 247 >gi|1705570|sp|P51942|MATN1_MOUSE RecName: Full=Cartilage matrix protein; AltName: Full=Matrilin-1; Flags: Precursor gi|1163179|gb|AAB06521.1| cartilage matrix protein precursor [Mus musculus] Length = 500 Score = 90.6 bits (223), Expect = 3e-16, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 80/218 (36%), Gaps = 21/218 (9%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + + + + D++ V+D SRS+ + + ++ ++E + + P+ Sbjct: 27 SLSLVPQPRGHLCRTRPTDLVFVVDSSRSVRP------VEFEKVKVFLSQVIESLDVGPN 80 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + GLV +++ ++ F L S + ++ + T + L++A + Sbjct: 81 A---TRVGLVNYASTVKPEFPLRAHGSKASLLQAVRRIQPLSTGTMTGLALQFAITKALS 137 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + K+++ +TDG + D A+ G ++AIG+ + Sbjct: 138 DAE---GGRARSPDISKVVIVVTDGRPQDSVRDVS-----ERARASGIELFAIGLGRVD- 188 Query: 333 HEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 LR AS V+ S Y+ + K Sbjct: 189 KATLRQIASEPQDEHVDYVES-YNVIEKLAKKFQEAFC 225 Score = 87.1 bits (214), Expect = 5e-15, Method: Composition-based stats. Identities = 40/220 (18%), Positives = 86/220 (39%), Gaps = 30/220 (13%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T +V + + D++ ++D S+S+ ++ K IN +++ + + + Sbjct: 265 TCNVCRGGGSGSATDLVFLIDGSKSVRP------ENFELVKKFINQIVDTLDVSD---RL 315 Query: 219 VQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAY-NQIFDMQG 275 Q GLV +S+ I + F L ++ +++ +S T + LKY N G Sbjct: 316 AQVGLVQYSSSIRQEFPLGRFHSKKDIKARVRNMSYMEKGTMTGAALKYLIDNSFTVSSG 375 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R +K+ + TDG + D +AK G ++A+G+ E Sbjct: 376 ARPGA-------QKVGIVFTDGRSQDYINDA-----ARKAKDLGFKMFAVGVGNAVEEEL 423 Query: 336 LRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P + ++ + + + IGK + + + Sbjct: 424 REIASEPVADHYFYTAD----FKTINQIGKKLQKQICVEE 459 >gi|56797851|emb|CAF33338.1| matrilin-3a [Danio rerio] Length = 295 Score = 90.6 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 35/209 (16%), Positives = 82/209 (39%), Gaps = 26/209 (12%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + + LD++ ++D SRS+ + + + M++ + + PD + Sbjct: 52 AATDSQCRSRPLDLVFIIDSSRSVRP------GEFEKVKIFLADMVDTLDVGPDA---TR 102 Query: 221 SGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 +V +++ ++ FLL+ ++ +++ I + T + +K A ++ F + Sbjct: 103 VAVVNYASTVKIEFLLKSHLTKDTIKQAITRIEPLAAGTMTGMAIKKAMDEAFT---EKS 159 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + N K+ + +TDG E+ + A+ G +YA+G+ L+ Sbjct: 160 GARPKSKNISKVAIIVTDGRPQDQVEEVSAA-----ARASGIEIYAVGVDRADMRS-LKL 213 Query: 339 CAS---PNSFYLVEN---PHSMYDAFSHI 361 AS + + VE + F Sbjct: 214 MASNPLEDHVFYVETYGVIEKLTSKFRET 242 >gi|73990557|ref|XP_853279.1| PREDICTED: similar to alpha 3 type VI collagen isoform 1 precursor [Canis familiaris] Length = 1634 Score = 90.6 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 54/337 (16%), Positives = 122/337 (36%), Gaps = 38/337 (11%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + + + + + IDR T + G+ + + F + D Sbjct: 674 LNRYMSQNEISNAIDRMAHIGETTL----TGSALTFVSQYFSPAKGARPNVRRFLILITD 729 Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIP-------WYTNSRH 153 + DIV+ ++ + YS+ L+ + P + +H Sbjct: 730 G----EAQDIVKDPAVALRQEG-IIIYSVGVFGSNVTQLEEISGRPEMVFYVENFDILQH 784 Query: 154 IVMPITSSVKVNSQTDAR---LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 I + + + R LD++ V+D S S++ + ++ M++ VK Sbjct: 785 IEDDLVFGICSPREECKRIEVLDVVFVIDSSGSIDH------DEYNIMKDF---MVDLVK 835 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYN 268 N V+ G + +++ E F L+ + + ++ G +T + L ++ + Sbjct: 836 KADVGKNQVRFGALKYADDPEVLFYLDDLSTKWEVISVLQKDQPMGGNTYTAEALGFSDH 895 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +G R H ++++ +TDGE+ + + + +G +V A+GI Sbjct: 896 MFTEARGSRLHKG-----VPQVLIVITDGES---HDADKLNDTAKALRDKGILVLAVGIA 947 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 E L S + ++ VE + FS + + Sbjct: 948 GANPVELLAMAGSSDKYFFVETFGGLKGIFSDVSASV 984 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 41/196 (20%), Positives = 77/196 (39%), Gaps = 27/196 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M ++D S S+ + K+ +K++ V + VQ G+V FS+ +E Sbjct: 620 DIMFLVDSSGSIGLE---NFIKMKTFMKNL------VSKSQIGADRVQIGVVQFSDINKE 670 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L + + I ++ G +T + L + +G R N ++ Sbjct: 671 EFQLNRYMSQNEISNAIDRMAHIGETTLTGSALTFVSQYFSPAKGAR-------PNVRRF 723 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDGE ++ G I+Y++G+ + P + VEN Sbjct: 724 LILITDGE-----AQDIVKDPAVALRQEGIIIYSVGVFGSNVTQLEEISGRPEMVFYVEN 778 Query: 351 PHSMYDAFSHIGKDIV 366 +D HI D+V Sbjct: 779 ----FDILQHIEDDLV 790 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 42/294 (14%), Positives = 101/294 (34%), Gaps = 31/294 (10%) Query: 87 KNTWNMSFRNELRDNGFVNDID-DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFI 145 ++ + N R+ + + + + L+ + E + + ++ TF+ Sbjct: 343 EDNVTKAAVNLRREGVTIFTMGIEGASDSQLEKIASHPAEQHVSKLKTFSELAAHNQTFL 402 Query: 146 PWYTNSRHIVMPI----TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 N + + + T ++K D+ +++D S S + T Sbjct: 403 KKLRNQITLTVSVFSERTETLKSGCVDTEEADIYLLIDGSGS------TQATDFHEMKTF 456 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNS 259 ++ ++ + P V+ G V +++ + F + L + I+ + + G +TN+ Sbjct: 457 LSEVVGMFNIAP---QKVRVGAVQYADSWDLEFEINKYTNKHDLGKAIENIRQMGGNTNT 513 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 L F + +++ +V +T+G L N ++ Sbjct: 514 GAALN------FTLGLLQKAKKQRGNRVPCHLVVLTNG-----MSKDSILEPANRLREEL 562 Query: 320 AIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 VYAIG++ + Y V + +DA I +V + + Sbjct: 563 IRVYAIGVKEANQTQLREIAGEDKRVYYVHD----FDALKDIRNQVVQEICAEE 612 Score = 79.4 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 83/214 (38%), Gaps = 22/214 (10%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + +S KV+ ++D++ ++D S S+ K+ + S+ V+ Sbjct: 980 VSASVCNSSKVD-CEIEKVDLVFLMDGSNSIH---PDDFKKMKEFLASV------VQDFD 1029 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N V+ G FS+ F L G + +I+ + + T+ L+ + Sbjct: 1030 VSVNRVRIGAAQFSHTYRPEFPLGTFIGKKEISFQIENIQQIFGYTHIGAALREVGDYFR 1089 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 G R + T ++++ +TDG+ + E + +G +Y++GI + Sbjct: 1090 PDMGSRINAGTP-----QVLLVLTDGQ-----SQDEVAQAAEELRHKGIDIYSVGIGDVD 1139 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 + ++ + + V N + I ++I Sbjct: 1140 DQQLIQITGTADKKLTVHNFDELTKVKKRIVRNI 1173 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 72/207 (34%), Gaps = 26/207 (12%) Query: 159 TSSVKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 T + V + D++ +LDVS + S D + + E V + N Sbjct: 212 TDDILVEVCQGPSVADLVFLLDVS------VNGSQENFDYLKEFLE---ESVSALDIKEN 262 Query: 218 VVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 ++ GLVT+SN+ + L GV S + + I+ LS + ++ ++F Sbjct: 263 CMRVGLVTYSNETKVINSLSRGVNKSEVLQNIQNLSPRAGKAYTGAAIRKIRKEVFS--- 319 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R +I V +T + ++ G ++ +GI + Sbjct: 320 ARNGSRKN-QGVPQIAVLVT-----HRPSEDNVTKAAVNLRREGVTIFTMGIEGASDSQL 373 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIG 362 + + P E S FS + Sbjct: 374 EKIASHP-----AEQHVSKLKTFSELA 395 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 33/210 (15%), Positives = 72/210 (34%), Gaps = 30/210 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + + IN M+ + P + + L +S+++ Sbjct: 25 DVVFLVDSSDHLG------AKSFPLVKTFINRMISSL---PIEAHKYRVALAQYSDQLHS 75 Query: 233 FFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L S + +L K G S L+ A+ F R + + Sbjct: 76 EFQLGTFKSR-NPMLNHLKKNFGFVGGSLRIGQALREAHRTYFS----RPDSGRDKKQFP 130 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 I+V + S + + + G + ++G++ E L+A A+ + + Sbjct: 131 PILVVL-----ASAESEDDVEEPSKALRGDGVRIISVGLQSASEQE-LKAMATGQFHFNL 184 Query: 349 ENPHSMYDAF-----SHIGKDIVTKRIWYD 373 + + AF I + + + D Sbjct: 185 RSARDL-GAFAPNMTQIIKEATQYREVATD 213 >gi|317483048|ref|ZP_07942050.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] gi|316915549|gb|EFV36969.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] Length = 813 Score = 89.8 bits (221), Expect = 6e-16, Method: Composition-based stats. Identities = 49/289 (16%), Positives = 97/289 (33%), Gaps = 22/289 (7%) Query: 60 HAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIV 119 A + +L + + ++ + + D + S Sbjct: 9 AAVGAAAGRSGKRLVAVVAAVAMLGGVAGVSATAMADDGNASTTQSQTTDEKAAASAPAP 68 Query: 120 VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ---TDARLDMMI 176 + + + P + T + + + +T + + T+ LD+++ Sbjct: 69 LSTEGTN-GVPDDPTLSAPAREKTVTANGDGTYKVALNVTGAKSAGTGAIVTNQPLDIVL 127 Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAML----EEVKLIPDVNNVVQSGLVTF-----S 227 VLDVS SM KID ++N + +E I D + + LV F + Sbjct: 128 VLDVSGSMADNLSGGPKKIDALKTAVNGFIDATADENAKITDQSQRNRIALVKFAGTEKT 187 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCN---TE 283 + +F+ W + + + L+ T++ GL + D R Sbjct: 188 SVGNDFYREGWSSYNYTQIVSNLTYDVSGLTSTVNGLSASGATSADYAFNRAQAALTYQP 247 Query: 284 DANYKKIIVFMTDGENLS-----TKEDQQSLYYCNEAKKRGAIVYAIGI 327 AN KK+++F TDGE ++ K G +Y+IG+ Sbjct: 248 RANAKKVVIFFTDGEPNHGSGFDPTVAATAVNKAKSLKDAGTTIYSIGV 296 >gi|156616288|ref|NP_766515.2| collagen alpha-6(VI) chain isoform 2 [Mus musculus] Length = 1182 Score = 89.8 bits (221), Expect = 6e-16, Method: Composition-based stats. Identities = 46/204 (22%), Positives = 84/204 (41%), Gaps = 27/204 (13%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + D + D+M ++D S S+ + +K+ M +K++ V + VQ G+V Sbjct: 613 EACRDMKADIMFLVDSSGSIGPE---NFSKMKMFMKNL------VSKSQIGADRVQIGVV 663 Query: 225 TFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS++ +E F L + S + I ++ G +T + L + +G R Sbjct: 664 QFSHENKEEFQLNTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFSPDKGAR----- 718 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N +K ++ +TDGE +K G I+Y++G+ + P Sbjct: 719 --PNVRKFLILITDGE-----AQDIVRDPAIALRKEGVIIYSVGVFGSNVTQLEEISGKP 771 Query: 343 NSFYLVENPHSMYDAFSHIGKDIV 366 + VEN +D HI D+V Sbjct: 772 EMVFYVEN----FDILQHIEDDLV 791 Score = 87.1 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 59/337 (17%), Positives = 123/337 (36%), Gaps = 38/337 (11%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + F ++ + + IDR T + G+ + + F + D Sbjct: 675 LNTFMSQSDIANAIDRMTHIGETTL----TGSALTFVSQYFSPDKGARPNVRKFLILITD 730 Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIP-------WYTNSRH 153 + DIVR ++ + YS+ L+ + P + +H Sbjct: 731 G----EAQDIVRDPAIALRKEG-VIIYSVGVFGSNVTQLEEISGKPEMVFYVENFDILQH 785 Query: 154 IVMPITSSVKVNSQTDAR---LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 I + + + R LD++ V+D S S++ + + M+ VK Sbjct: 786 IEDDLVLGICSPREECKRIEVLDVVFVIDSSGSIDYQEYNIMKDF---------MIGLVK 836 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEW-GVS-HLQRKIKYLSKFGVSTNSTPGLKYAYN 268 N V+ G + +++ E F L+ G + ++ G +T + L ++ + Sbjct: 837 KADVGKNQVRFGALKYADDPEVLFYLDELGTKLEVVSVLQNDHPMGGNTYTAEALAFSDH 896 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +G R H ++++ +TDGE+ + ++ + +G +V A+GI Sbjct: 897 MFTEARGSRLHKG-----VPQVLIVITDGES---HDAEKLNTTAKALRDKGILVLAVGIA 948 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 S E L S + +Y VE + FS + + Sbjct: 949 GANSWELLAMAGSSDKYYFVETFGGLKGIFSDVSASV 985 Score = 73.7 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 81/214 (37%), Gaps = 22/214 (10%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + +S KV+ ++D++ ++D S S+ K+ + S+ V+ Sbjct: 981 VSASVCNSSKVD-CEIEKVDLVFLMDGSNSIH---PDDFQKMKGFLVSV------VQDFD 1030 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N V+ G+ FS+ FLL + +I+ + + T+ L+ + + Sbjct: 1031 VSLNRVRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQIFGYTHIGDALRKV--KYY 1088 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 M N ++++ +TDG + E + +G +Y++GI + Sbjct: 1089 FQPDMGSRINAGT---PQVLLVLTDGR-----SQDEVAQAAEELRHKGVDIYSVGIGDVD 1140 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 E ++ + V N + I ++I Sbjct: 1141 DQELVQITGTAEKKLTVHNFDELKKVKKRIVRNI 1174 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 32/227 (14%), Positives = 82/227 (36%), Gaps = 22/227 (9%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 T++ + T ++K D+ +++D S S + T ++ ++ Sbjct: 411 THTVSVFSERTETLKSACVDTEEADIYLLIDGSGS------TQPTDFHEMKTFLSEVVGM 464 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 + P V+ G V +++ + F + L + I+ + + G +TN+ L + Sbjct: 465 FNIAPHK---VRVGAVQYADTWDLEFEISKYSNKPDLGKAIENIRQMGGNTNTGAALNFT 521 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 + + R + +V +T+G L ++ ++ V+AIG Sbjct: 522 LKLLQRAKKER------GSKVPCHLVVLTNG-----MSRDSVLGPAHKLREENIRVHAIG 570 Query: 327 IRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 ++ + Y V ++ + + + ++I + D Sbjct: 571 VKEANQTQLREIAGEEKRVYYVHEFDALRNIRNQVVQEICAEEACRD 617 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 70/188 (37%), Gaps = 21/188 (11%) Query: 162 VKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + V + + D++ +LD++ + S +D + E + + N ++ Sbjct: 216 IIVEACQGPSVADVVFLLDMA------INGSQEDLDHLKAFLG---ESISALDIKENCMR 266 Query: 221 SGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 GLVT+SN+ L G + + ++I+ LS + L+ +IF Q Sbjct: 267 VGLVTYSNETRVISSLSTGNNKTEVLQRIQDLSPQVGQAYTGAALRKTRKEIFSA----Q 322 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + ++ +I V +T + ++ G ++ +GI E + Sbjct: 323 RGSRKNQGVPQIAVLVT-----HRASEDNVTKAAVNLRREGVTIFTMGIEGANPDELEKI 377 Query: 339 CASPNSFY 346 + P + Sbjct: 378 ASHPAEQF 385 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 31/205 (15%), Positives = 70/205 (34%), Gaps = 29/205 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + + + I+ M+ + P N + L +S+ + Sbjct: 26 DVVFLVDSSDHLG------LKSFPLVKTFIHKMISSL---PIEANKYRVALAQYSDALHN 76 Query: 233 FFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L + + +L K G S L+ A+ F + + Sbjct: 77 EFQLGTFKNR-NPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSAPTNGRDKK----QFP 131 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 I+V + S + + ++ G + ++G++ L+A A+ + + Sbjct: 132 PILVVL-----ASAESEDDVEEAAKALREDGVKIISVGVQKASEEN-LKAMATSQFHFNL 185 Query: 349 ENPHSMYDAFS----HIGKDIVTKR 369 + F+ I KD+ R Sbjct: 186 RTARDL-SVFAPNMTEIIKDVTQYR 209 >gi|223462569|gb|AAI50696.1| RIKEN cDNA E330026B02 gene [Mus musculus] Length = 1182 Score = 89.8 bits (221), Expect = 6e-16, Method: Composition-based stats. Identities = 46/204 (22%), Positives = 84/204 (41%), Gaps = 27/204 (13%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + D + D+M ++D S S+ + +K+ M +K++ V + VQ G+V Sbjct: 613 EACRDMKADIMFLVDSSGSIGPE---NFSKMKMFMKNL------VSKSQIGADRVQIGVV 663 Query: 225 TFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS++ +E F L + S + I ++ G +T + L + +G R Sbjct: 664 QFSHENKEEFQLNTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFSPDKGAR----- 718 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N +K ++ +TDGE +K G I+Y++G+ + P Sbjct: 719 --PNVRKFLILITDGE-----AQDIVRDPALALRKEGVIIYSVGVFGSNVTQLEEISGKP 771 Query: 343 NSFYLVENPHSMYDAFSHIGKDIV 366 + VEN +D HI D+V Sbjct: 772 EMVFYVEN----FDILQHIEDDLV 791 Score = 86.4 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 60/337 (17%), Positives = 123/337 (36%), Gaps = 38/337 (11%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + F ++ + + IDR T + G+ + + F + D Sbjct: 675 LNTFMSQSDIANAIDRMTHIGETTL----TGSALTFVSQYFSPDKGARPNVRKFLILITD 730 Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIP-------WYTNSRH 153 + DIVR +L + YS+ L+ + P + +H Sbjct: 731 G----EAQDIVRDPALALRKEG-VIIYSVGVFGSNVTQLEEISGKPEMVFYVENFDILQH 785 Query: 154 IVMPITSSVKVNSQTDAR---LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 I + + + R LD++ V+D S S++ + + M+ VK Sbjct: 786 IEDDLVLGICSPREECKRIEVLDVVFVIDSSGSIDYQEYNIMKDF---------MIGLVK 836 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEW-GVS-HLQRKIKYLSKFGVSTNSTPGLKYAYN 268 N V+ G + +++ E F L+ G + ++ G +T + L ++ + Sbjct: 837 KADVGKNQVRFGALKYADDPEVLFYLDELGTKLEVISVLQNDHPMGGNTYTAEALAFSNH 896 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +G R H ++++ +TDGE+ + ++ + +G +V A+GI Sbjct: 897 MFTEARGSRLHKG-----VPQVLIVITDGES---HDAEKLNATAKALRDKGILVLAVGIA 948 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 S E L S + +Y VE + FS + + Sbjct: 949 GANSWELLAMAGSGDKYYFVETFGGLKGIFSDVSASV 985 Score = 74.0 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 37/214 (17%), Positives = 81/214 (37%), Gaps = 22/214 (10%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + +S KV+ ++D++ ++D S S+ K+ + S+ V+ Sbjct: 981 VSASVCNSSKVD-CEIEKVDLVFLMDGSNSIH---PDDFQKMKGFLVSV------VQDFD 1030 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N V+ G+ FS+ FLL + +I+ + + T+ L+ Sbjct: 1031 VSLNRVRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQIFGYTHIGDALRKVKYYFQ 1090 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 G R + T ++++ +TDG + E + +G +Y++GI + Sbjct: 1091 PDTGSRINAGTP-----QVLLVLTDGR-----SQDEVAQAAEELRHKGVDIYSVGIGDVD 1140 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 E ++ + V N + I ++I Sbjct: 1141 DQELVQITGTAEKKLTVHNFDELKKVKKRIVRNI 1174 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 32/227 (14%), Positives = 83/227 (36%), Gaps = 22/227 (9%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 T++ + T ++K D+ +++D S S + T ++ ++ Sbjct: 411 THTVSVFSERTETLKSACVDTEEADIYLLIDGSGS------TQPTDFHEMKTFLSEVVGM 464 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 + P V+ G V +++ + F + L + I+ + + G +TN+ L + Sbjct: 465 FNIAPHK---VRVGAVQYADTWDLEFEISKYSNKPDLGKAIENIRQMGGNTNTGAALNFT 521 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 + + R + +V +T+G L ++ ++ V+AIG Sbjct: 522 LKLLQRAKKER------GSKVPCHLVVLTNG-----MSRDSVLGPAHKLREENIRVHAIG 570 Query: 327 IRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 ++ + Y V + ++ + + + ++I + D Sbjct: 571 VKEANQMQLREIAGEEKRVYYVHDFDALRNIRNQVVQEICAEEACRD 617 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 69/188 (36%), Gaps = 21/188 (11%) Query: 162 VKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + V + + D++ +LD++ + S +D + E + + N ++ Sbjct: 216 IIVEACQGPSVADVVFLLDMA------INGSQEDLDHLKAFLG---ESISALDIKENCMR 266 Query: 221 SGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 GLVT+SN+ L G + + + I+ LS + L+ +IF Q Sbjct: 267 VGLVTYSNETRVISSLSTGNNKTEVLQLIQDLSPQVGQAYTGAALRKTRKEIFSA----Q 322 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + ++ +I V +T + ++ G ++ +GI E + Sbjct: 323 RGSRKNQGVPQIAVLVT-----HRASEDNVTKAAVNLRREGVTIFTMGIEGANPDELEKI 377 Query: 339 CASPNSFY 346 + P + Sbjct: 378 ASHPAEQF 385 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 68/204 (33%), Gaps = 27/204 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + + + I+ M+ + P N + L +S+ + Sbjct: 26 DVVFLVDSSDHLG------LKSFPLVKTFIHKMISSL---PIEANKYRVALAQYSDALHN 76 Query: 233 FFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L + L K G S L+ A+ F + + Sbjct: 77 EFQLGTFKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSAPTNGRDKK----QFPP 132 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 I+V + S + + ++ G + ++G++ L+A A+ + + Sbjct: 133 ILVVL-----ASAESEDDVEEAAKALREDGVKIISVGVQKASEEN-LKAMATSQFHFNLR 186 Query: 350 NPHSMYDAFS----HIGKDIVTKR 369 + F+ I KD+ R Sbjct: 187 TARDL-SVFAPNMTEIIKDVTQYR 209 >gi|297462925|ref|XP_608567.5| PREDICTED: collagen type VI alpha 6-like [Bos taurus] Length = 2343 Score = 89.8 bits (221), Expect = 6e-16, Method: Composition-based stats. Identities = 43/196 (21%), Positives = 76/196 (38%), Gaps = 27/196 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M ++D S S+ + K+ +K++ V + VQ G+V FS+ +E Sbjct: 618 DIMFLVDSSGSIGLE---NFIKMKTFMKNL------VSKSQIRADRVQIGVVQFSDVNKE 668 Query: 233 FFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L S + I ++ G +T L + +G R N +K Sbjct: 669 EFQLNRYTSQEEISDAIDRMAHIGETTLMGSALTFVSQYFSPAKGAR-------PNVRKF 721 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDGE ++ G I+Y++G+ + P + VEN Sbjct: 722 LILITDGE-----AQDIVKDPAVALREEGIIIYSVGVFGSNVTQLEEISGRPEMVFYVEN 776 Query: 351 PHSMYDAFSHIGKDIV 366 +D HI D+V Sbjct: 777 ----FDILKHIEDDLV 788 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 55/337 (16%), Positives = 120/337 (35%), Gaps = 38/337 (11%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + + + + IDR T +M G+ + + F + D Sbjct: 672 LNRYTSQEEISDAIDRMAHIGETTLM----GSALTFVSQYFSPAKGARPNVRKFLILITD 727 Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIP-------WYTNSRH 153 + DIV+ ++ + YS+ L+ + P + +H Sbjct: 728 G----EAQDIVKDPAVALREEG-IIIYSVGVFGSNVTQLEEISGRPEMVFYVENFDILKH 782 Query: 154 IVMPITSSVKVNSQTDAR---LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 I + + + R LD++ V+D S S++ + ++ M+ VK Sbjct: 783 IEDDLVFGICSPHEECKRIEVLDVVFVIDSSGSIDH------DEYNIMKDF---MINLVK 833 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 N V+ G + +++ E F L+ + ++ G +T + L ++ + Sbjct: 834 KADVGKNHVRFGALKYADDPEVLFYLDNLDTKWEVISVLQNDQPLGGNTYTAEALGFSDH 893 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +G R H ++++ +TDGE+ + + + +G +V A+GI Sbjct: 894 MFTEARGSRLHKG-----VPQVLIVITDGES---HDADKLNATAKALRDKGILVLAVGIA 945 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 E L S + ++ VE + FS + + Sbjct: 946 GANPVELLAMAGSSDKYFFVETFGGLKGIFSDVSASV 982 Score = 86.0 bits (211), Expect = 9e-15, Method: Composition-based stats. Identities = 38/214 (17%), Positives = 85/214 (39%), Gaps = 22/214 (10%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + +S KV+ ++D++ ++D S S+ K+ + S+ ++ Sbjct: 978 VSASVCNSSKVD-CEIEKVDLVFLMDGSNSIH---PDDFRKMKEFLASV------IQDFD 1027 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 NN V+ G FS+ + F L G + +I+ + + T+ L+ + Sbjct: 1028 ISNNRVRIGAAQFSHTYQPEFPLGMFIGKEEISFQIENIKQIFGYTHIGAALRQVGHYFR 1087 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 G R H T ++++ +TDG+ + E + +G +Y++GI + Sbjct: 1088 PDMGSRIHAGTP-----QVLLVLTDGQ-----SQDEVAQAAEELRHKGIDIYSVGIGDVD 1137 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 + ++ + N V N + I ++I Sbjct: 1138 DQQLVQITGTANKKLTVHNFDELKKVKKRIVRNI 1171 Score = 81.7 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 45/294 (15%), Positives = 98/294 (33%), Gaps = 31/294 (10%) Query: 87 KNTWNMSFRNELRDNGFVNDID-DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFI 145 ++ + N R V I + T L+ + E Y S + TF+ Sbjct: 341 QDNVTKAAVNLRRQGVIVFTIGVEGASDTQLEKIASHPAEQYVSQLRSFSDLAAHNQTFL 400 Query: 146 P----WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 T++ ++ T ++K D+ +++D S S + T Sbjct: 401 KKLRNQITHTVSVISERTETLKAGCVDTEEADIYLLIDGSGS------TQATDFQEMKTF 454 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNS 259 ++ + + P V+ G V ++++ + F + +++ I+ + + G + N+ Sbjct: 455 LSEVAGMFNIAP---QKVRVGAVQYADRWDLEFEISKYTNKHDVRKAIENIRQMGGNRNT 511 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 L F + +++ +V +T+G N + Sbjct: 512 GAALN------FTLGLLQRAKQQRGGRVPCHLVVLTNG-----ASRDSVSGPANRLSEEL 560 Query: 320 AIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 VYAIG+R + Y V + +DA I +V + + Sbjct: 561 IHVYAIGVREANQTQLREIAGEEKRVYYVHD----FDALKDIRNQVVQEICAEE 610 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 34/188 (18%), Positives = 73/188 (38%), Gaps = 21/188 (11%) Query: 162 VKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + V + D++ +LD+S +SS D + + E + + + ++ Sbjct: 213 ILVEVCQGPSVADVVFLLDMST------NSSWEDFDYLKEFLE---ESISALDIKEHCMR 263 Query: 221 SGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 GLV +SN+ + L GV S + + I+ L+ + L+ ++F Q Sbjct: 264 VGLVAYSNETKVISTLSRGVNKSEVLQDIQSLAPQAGKAYTGAALRKIRKEVFSA----Q 319 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 H + ++ +I V +T + +++G IV+ IG+ + + Sbjct: 320 HGSRKNQGVPQIAVLVT-----HSPSQDNVTKAAVNLRRQGVIVFTIGVEGASDTQLEKI 374 Query: 339 CASPNSFY 346 + P Y Sbjct: 375 ASHPAEQY 382 Score = 45.5 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 34/185 (18%), Positives = 64/185 (34%), Gaps = 25/185 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + I IN M+ + P + GL +S+ + Sbjct: 26 DVVFLVDSSDHLG------IKSFPFVKAFINKMISSL---PIEADKYHVGLAQYSDGLHR 76 Query: 233 FFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 FLL S L K G S L+ + F R+ + Sbjct: 77 EFLLSTFKSRGPMLNHLKKNFGFLGGSLQVGKALREVHRAYFSSGRDRKQ-------FPP 129 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 I+V + GE+ E+ +K G + ++G++ + L+A A+ Y + Sbjct: 130 ILVVLASGESEDAVEEAA-----EALRKDGVRIVSVGMQGVSEKT-LKAMATGQFHYNLR 183 Query: 350 NPHSM 354 + Sbjct: 184 TVRDL 188 >gi|297471452|ref|XP_002685218.1| PREDICTED: collagen, type VI, alpha 1-like [Bos taurus] gi|296490817|gb|DAA32930.1| collagen, type VI, alpha 1-like [Bos taurus] Length = 2268 Score = 89.8 bits (221), Expect = 6e-16, Method: Composition-based stats. Identities = 43/196 (21%), Positives = 76/196 (38%), Gaps = 27/196 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M ++D S S+ + K+ +K++ V + VQ G+V FS+ +E Sbjct: 618 DIMFLVDSSGSIGLE---NFIKMKTFMKNL------VSKSQIRADRVQIGVVQFSDVNKE 668 Query: 233 FFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L S + I ++ G +T L + +G R N +K Sbjct: 669 EFQLNRYTSQEEISDAIDRMAHIGETTLMGSALTFVSQYFSPAKGAR-------PNVRKF 721 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDGE ++ G I+Y++G+ + P + VEN Sbjct: 722 LILITDGE-----AQDIVKDPAVALREEGIIIYSVGVFGSNVTQLEEISGRPEMVFYVEN 776 Query: 351 PHSMYDAFSHIGKDIV 366 +D HI D+V Sbjct: 777 ----FDILKHIEDDLV 788 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 55/337 (16%), Positives = 120/337 (35%), Gaps = 38/337 (11%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + + + + IDR T +M G+ + + F + D Sbjct: 672 LNRYTSQEEISDAIDRMAHIGETTLM----GSALTFVSQYFSPAKGARPNVRKFLILITD 727 Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIP-------WYTNSRH 153 + DIV+ ++ + YS+ L+ + P + +H Sbjct: 728 G----EAQDIVKDPAVALREEG-IIIYSVGVFGSNVTQLEEISGRPEMVFYVENFDILKH 782 Query: 154 IVMPITSSVKVNSQTDAR---LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 I + + + R LD++ V+D S S++ + ++ M+ VK Sbjct: 783 IEDDLVFGICSPHEECKRIEVLDVVFVIDSSGSIDH------DEYNIMKDF---MINLVK 833 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 N V+ G + +++ E F L+ + ++ G +T + L ++ + Sbjct: 834 KADVGKNHVRFGALKYADDPEVLFYLDNLDTKWEVISVLQNDQPLGGNTYTAEALGFSDH 893 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +G R H ++++ +TDGE+ + + + +G +V A+GI Sbjct: 894 MFTEARGSRLHKG-----VPQVLIVITDGES---HDADKLNATAKALRDKGILVLAVGIA 945 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 E L S + ++ VE + FS + + Sbjct: 946 GANPVELLAMAGSSDKYFFVETFGGLKGIFSDVSASV 982 Score = 86.0 bits (211), Expect = 9e-15, Method: Composition-based stats. Identities = 38/214 (17%), Positives = 85/214 (39%), Gaps = 22/214 (10%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + +S KV+ ++D++ ++D S S+ K+ + S+ ++ Sbjct: 978 VSASVCNSSKVD-CEIEKVDLVFLMDGSNSIH---PDDFRKMKEFLASV------IQDFD 1027 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 NN V+ G FS+ + F L G + +I+ + + T+ L+ + Sbjct: 1028 ISNNRVRIGAAQFSHTYQPEFPLGMFIGKEEISFQIENIKQIFGYTHIGAALRQVGHYFR 1087 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 G R H T ++++ +TDG+ + E + +G +Y++GI + Sbjct: 1088 PDMGSRIHAGTP-----QVLLVLTDGQ-----SQDEVAQAAEELRHKGIDIYSVGIGDVD 1137 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 + ++ + N V N + I ++I Sbjct: 1138 DQQLVQITGTANKKLTVHNFDELKKVKKRIVRNI 1171 Score = 81.7 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 45/294 (15%), Positives = 98/294 (33%), Gaps = 31/294 (10%) Query: 87 KNTWNMSFRNELRDNGFVNDID-DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFI 145 ++ + N R V I + T L+ + E Y S + TF+ Sbjct: 341 QDNVTKAAVNLRRQGVIVFTIGVEGASDTQLEKIASHPAEQYVSQLRSFSDLAAHNQTFL 400 Query: 146 P----WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 T++ ++ T ++K D+ +++D S S + T Sbjct: 401 KKLRNQITHTVSVISERTETLKAGCVDTEEADIYLLIDGSGS------TQATDFQEMKTF 454 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNS 259 ++ + + P V+ G V ++++ + F + +++ I+ + + G + N+ Sbjct: 455 LSEVAGMFNIAP---QKVRVGAVQYADRWDLEFEISKYTNKHDVRKAIENIRQMGGNRNT 511 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 L F + +++ +V +T+G N + Sbjct: 512 GAALN------FTLGLLQRAKQQRGGRVPCHLVVLTNG-----ASRDSVSGPANRLSEEL 560 Query: 320 AIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 VYAIG+R + Y V + +DA I +V + + Sbjct: 561 IHVYAIGVREANQTQLREIAGEEKRVYYVHD----FDALKDIRNQVVQEICAEE 610 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 34/188 (18%), Positives = 73/188 (38%), Gaps = 21/188 (11%) Query: 162 VKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + V + D++ +LD+S +SS D + + E + + + ++ Sbjct: 213 ILVEVCQGPSVADVVFLLDMST------NSSWEDFDYLKEFLE---ESISALDIKEHCMR 263 Query: 221 SGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 GLV +SN+ + L GV S + + I+ L+ + L+ ++F Q Sbjct: 264 VGLVAYSNETKVISTLSRGVNKSEVLQDIQSLAPQAGKAYTGAALRKIRKEVFSA----Q 319 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 H + ++ +I V +T + +++G IV+ IG+ + + Sbjct: 320 HGSRKNQGVPQIAVLVT-----HSPSQDNVTKAAVNLRRQGVIVFTIGVEGASDTQLEKI 374 Query: 339 CASPNSFY 346 + P Y Sbjct: 375 ASHPAEQY 382 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 34/185 (18%), Positives = 64/185 (34%), Gaps = 25/185 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + I IN M+ + P + GL +S+ + Sbjct: 26 DVVFLVDSSDHLG------IKSFPFVKAFINKMISSL---PIEADKYHVGLAQYSDGLHR 76 Query: 233 FFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 FLL S L K G S L+ + F R+ + Sbjct: 77 EFLLSTFKSRGPMLNHLKKNFGFLGGSLQVGKALREVHRAYFSSGRDRKQ-------FPP 129 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 I+V + GE+ E+ +K G + ++G++ + L+A A+ Y + Sbjct: 130 ILVVLASGESEDAVEEAA-----EALRKDGVRIVSVGMQGVSEKT-LKAMATGQFHYNLR 183 Query: 350 NPHSM 354 + Sbjct: 184 TVRDL 188 >gi|26344185|dbj|BAC35749.1| unnamed protein product [Mus musculus] Length = 1182 Score = 89.8 bits (221), Expect = 6e-16, Method: Composition-based stats. Identities = 46/204 (22%), Positives = 84/204 (41%), Gaps = 27/204 (13%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + D + D+M ++D S S+ + +K+ M +K++ V + VQ G+V Sbjct: 613 EACRDMKADIMFLVDSSGSIGPE---NFSKMKMFMKNL------VSKSQIGADRVQIGVV 663 Query: 225 TFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS++ +E F L + S + I ++ G +T + L + +G R Sbjct: 664 QFSHENKEEFQLNTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFSPDKGAR----- 718 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N +K ++ +TDGE +K G I+Y++G+ + P Sbjct: 719 --PNVRKFLILITDGE-----AQDIVRDPAIALRKEGVIIYSVGVFGSNVTQLEEISGKP 771 Query: 343 NSFYLVENPHSMYDAFSHIGKDIV 366 + VEN +D HI D+V Sbjct: 772 EMVFYVEN----FDILQHIEDDLV 791 Score = 88.7 bits (218), Expect = 2e-15, Method: Composition-based stats. Identities = 59/337 (17%), Positives = 122/337 (36%), Gaps = 38/337 (11%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + F ++ + + IDR T + G+ + + F + D Sbjct: 675 LNTFMSQSDIANAIDRMTHIGETTL----TGSALTFVSQYFSPDKGARPNVRKFLILITD 730 Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIP-------WYTNSRH 153 + DIVR ++ + YS+ L+ + P + +H Sbjct: 731 G----EAQDIVRDPAIALRKEG-VIIYSVGVFGSNVTQLEEISGKPEMVFYVENFDILQH 785 Query: 154 IVMPITSSVKVNSQTDAR---LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 I + + + R LD++ V+D S S++ + + M+ VK Sbjct: 786 IEDDLVLGICSPREECKRIEVLDVVFVIDSSGSIDYQEYNIMKDF---------MIGLVK 836 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEW-GVS-HLQRKIKYLSKFGVSTNSTPGLKYAYN 268 N V+ G + +++ E F L+ G + ++ G +T + L ++ + Sbjct: 837 KADVGKNQVRFGALKYADDPEVLFYLDELGTKLEVVSVLQNDHPMGGNTYTAEALAFSDH 896 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +G R H ++++ +TDGE+ ++ + +G +V A+GI Sbjct: 897 MFTEARGSRLHKG-----VPQVLIVITDGESNDA---EKLNTTAKALRDKGILVLAVGIA 948 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 S E L S + +Y VE + FS + + Sbjct: 949 GANSWELLAMAGSSDKYYFVETFGGLKGIFSDVSASV 985 Score = 73.7 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 81/214 (37%), Gaps = 22/214 (10%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + +S KV+ ++D++ ++D S S+ K+ + S+ V+ Sbjct: 981 VSASVCNSSKVD-CEIEKVDLVFLMDGSNSIH---PDDFQKMKGFLVSV------VQDFD 1030 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N V+ G+ FS+ FLL + +I+ + + T+ L+ + + Sbjct: 1031 VSLNRVRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQIFGYTHIGDALRKV--KYY 1088 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 M N ++++ +TDG + E + +G +Y++GI + Sbjct: 1089 FQPDMGSRINAGT---PQVLLVLTDGR-----SQDEVAQAAEELRHKGVDIYSVGIGDVD 1140 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 E ++ + V N + I ++I Sbjct: 1141 DQELVQITGTAEKKLTVHNFDELKKVKKRIVRNI 1174 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 32/227 (14%), Positives = 82/227 (36%), Gaps = 22/227 (9%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 T++ + T ++K D+ +++D S S + T ++ ++ Sbjct: 411 THTVSVFSERTETLKSACVDTEEADIYLLIDGSGS------TQPTDFHEMKTFLSEVVGM 464 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 + P V+ G V +++ + F + L + I+ + + G +TN+ L + Sbjct: 465 FNIAPHK---VRVGAVQYADTWDLEFEISKYSNKPDLGKAIENIRQMGGNTNTGAALNFT 521 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 + + R + +V +T+G L ++ ++ V+AIG Sbjct: 522 LKLLQRAKKER------GSKVPCHLVVLTNG-----MSRDSVLGPAHKLREENIRVHAIG 570 Query: 327 IRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 ++ + Y V ++ + + + ++I + D Sbjct: 571 VKEANQTQLREIAGEEKRVYYVHEFDALRNIRNQVVQEICAEEACRD 617 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 70/188 (37%), Gaps = 21/188 (11%) Query: 162 VKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + V + + D++ +LD++ + S +D + E + + N ++ Sbjct: 216 IIVEACQGPSVADVVFLLDMA------INGSQEDLDHLKAFLG---ESISALDIKENCMR 266 Query: 221 SGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 GLVT+SN+ L G + + ++I+ LS + L+ +IF Q Sbjct: 267 VGLVTYSNETRVISSLSTGNNKTEVLQRIQDLSPQVGQAYTGAALRKTRKEIFSA----Q 322 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + ++ +I V +T + ++ G ++ +GI E + Sbjct: 323 RGSRKNQGVPQIAVLVT-----HRASEDNVTKAAVNLRREGVTIFTMGIEGANPDELEKI 377 Query: 339 CASPNSFY 346 + P + Sbjct: 378 ASHPAEQF 385 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 68/204 (33%), Gaps = 27/204 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + + + I+ M+ + P N + L +S+ + Sbjct: 26 DVVFLVDSSDHLG------LKSFPLVKTFIHKMISSL---PIEANKYRVALAQYSDALHN 76 Query: 233 FFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L + L K G S L+ A+ F + + Sbjct: 77 EFQLGTFKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSAPTNGRDKK----QFPP 132 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 I+V + S + + ++ G + ++G++ L+A A+ + + Sbjct: 133 ILVVL-----ASAESEDDVEEAAKALREDGVKIISVGVQKASEEN-LKAMATSQFHFNLR 186 Query: 350 NPHSMYDAFS----HIGKDIVTKR 369 + F+ I KD+ R Sbjct: 187 TARDL-SVFAPNMTEIIKDVTQYR 209 >gi|149773091|emb|CAO01895.1| collagen type VI alpha 6 [Mus musculus] Length = 1162 Score = 89.4 bits (220), Expect = 7e-16, Method: Composition-based stats. Identities = 46/204 (22%), Positives = 84/204 (41%), Gaps = 27/204 (13%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + D + D+M ++D S S+ + +K+ M +K++ V + VQ G+V Sbjct: 593 EACRDMKADIMFLVDSSGSIGPE---NFSKMKMFMKNL------VSKSQIGADRVQIGVV 643 Query: 225 TFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS++ +E F L + S + I ++ G +T + L + +G R Sbjct: 644 QFSHENKEEFQLNTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFSPDKGAR----- 698 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N +K ++ +TDGE +K G I+Y++G+ + P Sbjct: 699 --PNVRKFLILITDGE-----AQDIVRDPAIALRKEGVIIYSVGVFGSNVTQLEEISGKP 751 Query: 343 NSFYLVENPHSMYDAFSHIGKDIV 366 + VEN +D HI D+V Sbjct: 752 EMVFYVEN----FDILQHIEDDLV 771 Score = 86.7 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 59/337 (17%), Positives = 123/337 (36%), Gaps = 38/337 (11%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + F ++ + + IDR T + G+ + + F + D Sbjct: 655 LNTFMSQSDIANAIDRMTHIGETTL----TGSALTFVSQYFSPDKGARPNVRKFLILITD 710 Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIP-------WYTNSRH 153 + DIVR ++ + YS+ L+ + P + +H Sbjct: 711 G----EAQDIVRDPAIALRKEG-VIIYSVGVFGSNVTQLEEISGKPEMVFYVENFDILQH 765 Query: 154 IVMPITSSVKVNSQTDAR---LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 I + + + R LD++ V+D S S++ + + M+ VK Sbjct: 766 IEDDLVLGICSPREECKRIEVLDVVFVIDSSGSIDYQEYNIMKDF---------MIGLVK 816 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEW-GVS-HLQRKIKYLSKFGVSTNSTPGLKYAYN 268 N V+ G + +++ E F L+ G + ++ G +T + L ++ + Sbjct: 817 KADVGKNQVRFGALKYADDPEVLFYLDELGTKLEVVSVLQNDHPMGGNTYTAEALAFSDH 876 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +G R H ++++ +TDGE+ + ++ + +G +V A+GI Sbjct: 877 MFTEARGSRLHKG-----VPQVLIVITDGES---HDAEKLNTTAKALRDKGILVLAVGIA 928 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 S E L S + +Y VE + FS + + Sbjct: 929 GANSWELLAMAGSSDKYYFVETFGGLKGIFSDVSASV 965 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 81/214 (37%), Gaps = 22/214 (10%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + +S KV+ ++D++ ++D S S+ K+ + S+ V+ Sbjct: 961 VSASVCNSSKVD-CEIEKVDLVFLMDGSNSIH---PDDFQKMKGFLVSV------VQDFD 1010 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N V+ G+ FS+ FLL + +I+ + + T+ L+ + + Sbjct: 1011 VSLNRVRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQIFGYTHIGDALRKV--KYY 1068 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 M N ++++ +TDG + E + +G +Y++GI + Sbjct: 1069 FQPDMGSRINAGT---PQVLLVLTDGR-----SQDEVAQAAEELRHKGVDIYSVGIGDVD 1120 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 E ++ + V N + I ++I Sbjct: 1121 DQELVQITGTAEKKLTVHNFDELKKVKKRIVRNI 1154 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 32/227 (14%), Positives = 82/227 (36%), Gaps = 22/227 (9%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 T++ + T ++K D+ +++D S S + T ++ ++ Sbjct: 391 THTVSVFSERTETLKSACVDTEEADIYLLIDGSGS------TQPTDFHEMKTFLSEVVGM 444 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 + P V+ G V +++ + F + L + I+ + + G +TN+ L + Sbjct: 445 FNIAPHK---VRVGAVQYADTWDLEFEISKYSNKPDLGKAIENIRQMGGNTNTGAALNFT 501 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 + + R + +V +T+G L ++ ++ V+AIG Sbjct: 502 LKLLQRAKKER------GSKVPCHLVVLTNG-----MSRDSVLGPAHKLREENIRVHAIG 550 Query: 327 IRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 ++ + Y V ++ + + + ++I + D Sbjct: 551 VKEANQTQLREIAGEEKRVYYVHEFDALRNIRNQVVQEICAEEACRD 597 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 70/188 (37%), Gaps = 21/188 (11%) Query: 162 VKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + V + + D++ +LD++ + S +D + E + + N ++ Sbjct: 196 IIVEACQGPSVADVVFLLDMA------INGSQEDLDHLKAFLG---ESISALDIKENCMR 246 Query: 221 SGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 GLVT+SN+ L G + + ++I+ LS + L+ +IF Q Sbjct: 247 VGLVTYSNETRVISSLSTGNNKTEVLQRIQDLSPQVGQAYTGAALRKTRKEIFSA----Q 302 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + ++ +I V +T + ++ G ++ +GI E + Sbjct: 303 RGSRKNQGVPQIAVLVT-----HRASEDNVTKAAVNLRREGVTIFTMGIEGANPDELEKI 357 Query: 339 CASPNSFY 346 + P + Sbjct: 358 ASHPAEQF 365 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 68/204 (33%), Gaps = 27/204 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + + + I+ M+ + P N + L +S+ + Sbjct: 6 DVVFLVDSSDHLG------LKSFPLVKTFIHKMISSL---PIEANKYRVALAQYSDALHN 56 Query: 233 FFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L + L K G S L+ A+ F + + Sbjct: 57 EFQLGTFKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSAPTNGRDKK----QFPP 112 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 I+V + S + + ++ G + ++G++ L+A A+ + + Sbjct: 113 ILVVL-----ASAESEDDVEEAAKALREDGVKIISVGVQKASEEN-LKAMATSQFHFNLR 166 Query: 350 NPHSMYDAFS----HIGKDIVTKR 369 + F+ I KD+ R Sbjct: 167 TARDL-SVFAPNMTEIIKDVTQYR 189 >gi|281345782|gb|EFB21366.1| hypothetical protein PANDA_017603 [Ailuropoda melanoleuca] Length = 2245 Score = 89.4 bits (220), Expect = 8e-16, Method: Composition-based stats. Identities = 54/337 (16%), Positives = 118/337 (35%), Gaps = 38/337 (11%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + + + + + IDR T + G+ + + F + D Sbjct: 655 LNRYMSQNEISNAIDRMTHIGETTL----TGSALTFVSQYFSPAKGARPNVRRFLILITD 710 Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIP-------WYTNSRH 153 + DIV+ ++ + YS+ L+ + P + +H Sbjct: 711 G----EAQDIVKDPAVALRQEG-IIIYSVGVFGSNVTQLEEISGRPEMVFYVENFDILQH 765 Query: 154 IVMPITSSVKVNSQTDAR---LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 I + + + R LD++ V+D S S++ S + M++ VK Sbjct: 766 IEDDLVFGICSPREECKRIEVLDVVFVIDSSGSIDHDEYSIMKDF---------MVDLVK 816 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 N V+ G + +++ E F L + ++ G +T + L ++ + Sbjct: 817 KADVGKNQVRFGALKYADDPEVLFYLGDLGSKWEVISVLQKDQPMGGNTYTAEALGFSDH 876 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +G R ++++ +TDGE+ + + + +G +V A+GI Sbjct: 877 MFTEARGSRLQKG-----VPQVLIVITDGES---HDADKLNATAKALRDKGILVLAVGIA 928 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 E L S + ++ VE + FS + + Sbjct: 929 GANPVELLAMAGSSDKYFFVETFGGLKGIFSDVSASV 965 Score = 88.7 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 41/196 (20%), Positives = 77/196 (39%), Gaps = 27/196 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M ++D S S+ + K+ +K++ V + VQ G+V FS+ +E Sbjct: 601 DIMFLVDSSGSIGLE---NFIKMKTFMKNL------VSKSQIGADRVQIGVVQFSDVNKE 651 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L + + I ++ G +T + L + +G R N ++ Sbjct: 652 EFQLNRYMSQNEISNAIDRMTHIGETTLTGSALTFVSQYFSPAKGAR-------PNVRRF 704 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDGE ++ G I+Y++G+ + P + VEN Sbjct: 705 LILITDGE-----AQDIVKDPAVALRQEGIIIYSVGVFGSNVTQLEEISGRPEMVFYVEN 759 Query: 351 PHSMYDAFSHIGKDIV 366 +D HI D+V Sbjct: 760 ----FDILQHIEDDLV 771 Score = 79.4 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 77/223 (34%), Gaps = 26/223 (11%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 + T ++K D+ +++D S S + T ++ ++ + Sbjct: 395 SVFSERTETLKSGCVDTEEADIYLLIDGSGS------TQATDFHEMKTFLSEVVGMFNIA 448 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 P V+ G V +++ + F + L + I+ + + G +TN+ L Sbjct: 449 P---QKVRVGAVQYADSWDLEFEINKYTNKHDLGKAIENIRQMGGNTNTGAALN------ 499 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 F + +++ +V +T+G L N ++ VYAIG++ Sbjct: 500 FTLGLLQKAKKQRGNRVPCHLVVLTNG-----MSKDSILEPANRLREELIRVYAIGVKEA 554 Query: 331 RSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + Y V + +DA I +V + + Sbjct: 555 NQTQLREIAGEEKRVYYVHD----FDALKDIRNQVVQEICAEE 593 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 35/214 (16%), Positives = 82/214 (38%), Gaps = 22/214 (10%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + +S KV+ ++D++ ++D S S+ K+ + S+ V+ Sbjct: 961 VSASVCNSSKVD-CEIEKVDLVFLMDGSNSIH---PDDFRKMKEFLASV------VQDFD 1010 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N V+ G FS+ F L G + +I+ + + T+ L+ + Sbjct: 1011 VSVNRVRIGAAQFSHTYRPEFPLGTFVGKKEISFQIENIQQIFGYTHIGAALRQVGHYFR 1070 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 G R + T ++++ +TDG+ + + + +G +Y++GI + Sbjct: 1071 PDMGSRINAGTP-----QVLLVLTDGQ-----SQDEVARAAEDLRHKGVDIYSVGIGDVD 1120 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 + ++ + V N + I ++I Sbjct: 1121 DQQLIQITGTAGKKLTVHNFDELTKVKKRIVRNI 1154 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 72/204 (35%), Gaps = 26/204 (12%) Query: 162 VKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + V + D++ +LDVS + S + + + E V + + ++ Sbjct: 196 ILVEVCQGPSVADVVFLLDVS------VNGSQENFEYLKEFLE---ESVSALDIKEHCMR 246 Query: 221 SGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 GLVT+SN+ + L GV S + + I+ LS + + ++ ++F R Sbjct: 247 VGLVTYSNETKVINSLSRGVNKSEVLQNIQNLSPWSGKAYTGAAIRKIRKEVFS---ARN 303 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 +I V +T + ++ G ++ +GI + + Sbjct: 304 GSRKN-QGVPQIAVLVT-----HRPSEDNVTKAAVNLRREGVTIFTMGIEGASDSQLEKI 357 Query: 339 CASPNSFYLVENPHSMYDAFSHIG 362 + P E S FS + Sbjct: 358 ASHP-----AEQHVSKLKTFSDLA 376 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 74/203 (36%), Gaps = 23/203 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S D TK +K+ + V +P + + + +S+++ Sbjct: 6 DVVFLVDSS-------DHLGTKSFPLVKAF--IHRMVSSLPIEAHKYRVAVAQYSDRLHS 56 Query: 233 FFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L S L K L G S + L+ A+ F + + Sbjct: 57 EFQLGTFKSRNPMLNHLKKNLGFLGGSLRTGHALREAHRTYFSAPAGGRDKK----QFPP 112 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 I+V + S + + ++ G + ++G++ E L+A A+ + + Sbjct: 113 ILVVL-----ASAQSEDDVEEASKALREDGVRIVSVGLQSASEEE-LKAMATAQFHFNLR 166 Query: 350 NPHSMYDAFSHIGKDIVTKRIWY 372 + + AFS I+ + Y Sbjct: 167 SARDL-GAFSQNMTQIIKEATQY 188 >gi|301784735|ref|XP_002927783.1| PREDICTED: collagen alpha-6(VI) chain-like [Ailuropoda melanoleuca] Length = 2267 Score = 89.4 bits (220), Expect = 8e-16, Method: Composition-based stats. Identities = 54/337 (16%), Positives = 118/337 (35%), Gaps = 38/337 (11%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + + + + + IDR T + G+ + + F + D Sbjct: 675 LNRYMSQNEISNAIDRMTHIGETTL----TGSALTFVSQYFSPAKGARPNVRRFLILITD 730 Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIP-------WYTNSRH 153 + DIV+ ++ + YS+ L+ + P + +H Sbjct: 731 G----EAQDIVKDPAVALRQEG-IIIYSVGVFGSNVTQLEEISGRPEMVFYVENFDILQH 785 Query: 154 IVMPITSSVKVNSQTDAR---LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 I + + + R LD++ V+D S S++ S + M++ VK Sbjct: 786 IEDDLVFGICSPREECKRIEVLDVVFVIDSSGSIDHDEYSIMKDF---------MVDLVK 836 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 N V+ G + +++ E F L + ++ G +T + L ++ + Sbjct: 837 KADVGKNQVRFGALKYADDPEVLFYLGDLGSKWEVISVLQKDQPMGGNTYTAEALGFSDH 896 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +G R ++++ +TDGE+ + + + +G +V A+GI Sbjct: 897 MFTEARGSRLQKG-----VPQVLIVITDGES---HDADKLNATAKALRDKGILVLAVGIA 948 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 E L S + ++ VE + FS + + Sbjct: 949 GANPVELLAMAGSSDKYFFVETFGGLKGIFSDVSASV 985 Score = 88.7 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 41/196 (20%), Positives = 77/196 (39%), Gaps = 27/196 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M ++D S S+ + K+ +K++ V + VQ G+V FS+ +E Sbjct: 621 DIMFLVDSSGSIGLE---NFIKMKTFMKNL------VSKSQIGADRVQIGVVQFSDVNKE 671 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L + + I ++ G +T + L + +G R N ++ Sbjct: 672 EFQLNRYMSQNEISNAIDRMTHIGETTLTGSALTFVSQYFSPAKGAR-------PNVRRF 724 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDGE ++ G I+Y++G+ + P + VEN Sbjct: 725 LILITDGE-----AQDIVKDPAVALRQEGIIIYSVGVFGSNVTQLEEISGRPEMVFYVEN 779 Query: 351 PHSMYDAFSHIGKDIV 366 +D HI D+V Sbjct: 780 ----FDILQHIEDDLV 791 Score = 79.0 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 77/223 (34%), Gaps = 26/223 (11%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 + T ++K D+ +++D S S + T ++ ++ + Sbjct: 415 SVFSERTETLKSGCVDTEEADIYLLIDGSGS------TQATDFHEMKTFLSEVVGMFNIA 468 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 P V+ G V +++ + F + L + I+ + + G +TN+ L Sbjct: 469 P---QKVRVGAVQYADSWDLEFEINKYTNKHDLGKAIENIRQMGGNTNTGAALN------ 519 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 F + +++ +V +T+G L N ++ VYAIG++ Sbjct: 520 FTLGLLQKAKKQRGNRVPCHLVVLTNG-----MSKDSILEPANRLREELIRVYAIGVKEA 574 Query: 331 RSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + Y V + +DA I +V + + Sbjct: 575 NQTQLREIAGEEKRVYYVHD----FDALKDIRNQVVQEICAEE 613 Score = 77.9 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 35/214 (16%), Positives = 82/214 (38%), Gaps = 22/214 (10%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + +S KV+ ++D++ ++D S S+ K+ + S+ V+ Sbjct: 981 VSASVCNSSKVD-CEIEKVDLVFLMDGSNSIH---PDDFRKMKEFLASV------VQDFD 1030 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N V+ G FS+ F L G + +I+ + + T+ L+ + Sbjct: 1031 VSVNRVRIGAAQFSHTYRPEFPLGTFVGKKEISFQIENIQQIFGYTHIGAALRQVGHYFR 1090 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 G R + T ++++ +TDG+ + + + +G +Y++GI + Sbjct: 1091 PDMGSRINAGTP-----QVLLVLTDGQ-----SQDEVARAAEDLRHKGVDIYSVGIGDVD 1140 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 + ++ + V N + I ++I Sbjct: 1141 DQQLIQITGTAGKKLTVHNFDELTKVKKRIVRNI 1174 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 72/204 (35%), Gaps = 26/204 (12%) Query: 162 VKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + V + D++ +LDVS + S + + + E V + + ++ Sbjct: 216 ILVEVCQGPSVADVVFLLDVS------VNGSQENFEYLKEFLE---ESVSALDIKEHCMR 266 Query: 221 SGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 GLVT+SN+ + L GV S + + I+ LS + + ++ ++F R Sbjct: 267 VGLVTYSNETKVINSLSRGVNKSEVLQNIQNLSPWSGKAYTGAAIRKIRKEVFS---ARN 323 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 +I V +T + ++ G ++ +GI + + Sbjct: 324 GSRKN-QGVPQIAVLVT-----HRPSEDNVTKAAVNLRREGVTIFTMGIEGASDSQLEKI 377 Query: 339 CASPNSFYLVENPHSMYDAFSHIG 362 + P E S FS + Sbjct: 378 ASHP-----AEQHVSKLKTFSDLA 396 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 74/203 (36%), Gaps = 23/203 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S D TK +K+ + V +P + + + +S+++ Sbjct: 26 DVVFLVDSS-------DHLGTKSFPLVKAF--IHRMVSSLPIEAHKYRVAVAQYSDRLHS 76 Query: 233 FFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L S L K L G S + L+ A+ F + + Sbjct: 77 EFQLGTFKSRNPMLNHLKKNLGFLGGSLRTGHALREAHRTYFSAPAGGRDKK----QFPP 132 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 I+V + S + + ++ G + ++G++ E L+A A+ + + Sbjct: 133 ILVVL-----ASAQSEDDVEEASKALREDGVRIVSVGLQSASEEE-LKAMATAQFHFNLR 186 Query: 350 NPHSMYDAFSHIGKDIVTKRIWY 372 + + AFS I+ + Y Sbjct: 187 SARDL-GAFSQNMTQIIKEATQY 208 >gi|147901111|ref|NP_001079801.1| matrilin 1, cartilage matrix protein [Xenopus laevis] gi|32450626|gb|AAH54272.1| MGC64509 protein [Xenopus laevis] Length = 490 Score = 89.4 bits (220), Expect = 9e-16, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 84/203 (41%), Gaps = 20/203 (9%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + K + D++ ++D SRS+ ++ + A ++ ++E + + N + Sbjct: 26 AQKGPNCRTRPTDILFIIDSSRSVRP------SEFEQAKVFLSQVIESLDV---GANATR 76 Query: 221 SGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 GLV +++ ++ F L+ + L + +K + T + ++YA N F Sbjct: 77 VGLVNYASTVKNEFSLKTHKAKPALLQAVKKVQPLSTGTMTGLAIQYAMNNAFTESE--- 133 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + K+ + +TDG T +D + A++ G +YAIG+ + + + Sbjct: 134 GARIKSPGINKVAIVVTDGRPQDTVKDISAR-----ARESGLEIYAIGVGRVDKNTLRQI 188 Query: 339 CASP-NSFYLVENPHSMYDAFSH 360 + P + +S+ + S Sbjct: 189 ASEPLDEHVDYVESYSLIEKLSK 211 Score = 87.1 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 61/399 (15%), Positives = 132/399 (33%), Gaps = 58/399 (14%) Query: 6 LNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLV-HAATQ 64 + + N+ K ++ T P + + + +S + ++ + + Sbjct: 77 VGLVNYASTVKNEFSLKTHKAKPALLQAVKKVQPLSTGTMTGLAIQYAMNNAFTESEGAR 136 Query: 65 IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQN 124 I + G + G +K+ + + L + + + ++T I P + Sbjct: 137 IKSPGINKVAIVVTDGRPQDTVKDISARARESGLE--IYAIGVGRVDKNTLRQIASEPLD 194 Query: 125 E------GYSISAISRYKIPLKFCTFIPWYTNSRH----IVMPITSSVKVNSQTDARL-- 172 E YS+ K FC + H I + S + L Sbjct: 195 EHVDYVESYSLIEKLSKKFQEAFCVTADLCSTGDHDCEQICVSTPGSYTCACRDGFTLNE 254 Query: 173 -------------DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D++ ++D S+S+ ++ + IN +++ + + Sbjct: 255 DGKTCNACGASAVDLVFLIDGSKSVRP------ENFELVKQFINQIVDSMDVGE---RRA 305 Query: 220 QSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 GLV +S+ + + F L ++ +K +S T + L+Y + F + Sbjct: 306 HVGLVQYSSSVRQEFPLGRYSSKKDIKSAVKKMSYMEKGTMTGQALQYLVDNSFAIS--- 362 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + K+ + TDG + D + AK+ G ++A+G+ E Sbjct: 363 ---SGGRPAVPKVGIVFTDGRSQDYINDAAAR-----AKELGYKMFAVGVGNAVEEELRM 414 Query: 338 ACASPN---SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P SFY + + A IGK + K + Sbjct: 415 IASEPQAEHSFYTAD-----FKAMKEIGKKLQMKICVEE 448 >gi|118088945|ref|XP_419902.2| PREDICTED: similar to collagen XXI [Gallus gallus] Length = 964 Score = 89.4 bits (220), Expect = 9e-16, Method: Composition-based stats. Identities = 39/223 (17%), Positives = 84/223 (37%), Gaps = 28/223 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + S + A D++ +LD S S+ ++ + + + P Sbjct: 25 FISFLKMSKVADGCRTAPADLVFILDGSYSVGP------ENFEIIKSWLVNITRNFDIGP 78 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 +Q G+V +S+ L +L ++++ + G +T + +++AY+ +F Sbjct: 79 ---KFIQVGVVQYSDYPVLEIPLGTHESTENLIKEMESIHYLGGNTKTGRAIQFAYDHLF 135 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 KI V +TDG K + EA+K ++AIG+ Sbjct: 136 ---------AKSSRFLTKIAVVLTDG-----KSQDEVKDVAAEARKNKITLFAIGVGSEI 181 Query: 332 SHEFLRACA---SPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + L+A A S + VE+ ++ I + + + + Sbjct: 182 EEDELKAIANKPSSTYVFYVEDYIAISRIKEVIKQKLCEESVC 224 >gi|332817903|ref|XP_003310057.1| PREDICTED: collagen alpha-6(VI) chain [Pan troglodytes] Length = 2263 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 43/204 (21%), Positives = 81/204 (39%), Gaps = 27/204 (13%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + D + D+M ++D S S+ + +K+ +K++ V + VQ G+V Sbjct: 614 EACKDMKADIMFLVDSSGSIGPE---NFSKMKTFMKNL------VSKSQIGPDRVQIGVV 664 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+ +E F L +S + I ++ G +T + L + +G R Sbjct: 665 QFSDINKEEFQLNRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFSPTKGARP---- 720 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N +K ++ +TDGE ++ G I+Y++G+ + P Sbjct: 721 ---NIRKFLILITDGE-----AQDIVKEPAVVLRQEGVIIYSVGVFGSNVTQLEEISGRP 772 Query: 343 NSFYLVENPHSMYDAFSHIGKDIV 366 + VEN +D I D+V Sbjct: 773 EMVFYVEN----FDILQRIEDDLV 792 Score = 86.0 bits (211), Expect = 8e-15, Method: Composition-based stats. Identities = 56/356 (15%), Positives = 119/356 (33%), Gaps = 40/356 (11%) Query: 34 LGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIK------ 87 L M I S + L ++ AA + + + + R + + +I Sbjct: 283 LSMGINKSEVLQHIQNLSPRTGKAYTGAAIKKLRKEVFSARNGSRKNQGVPQIAVLVTHR 342 Query: 88 ---NTWNMSFRNELRDNGFVNDID-DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCT 143 + + N R+ + + + T L+ + E Y + + T Sbjct: 343 DSEDNVTKAAVNLRREGVTIFTLGIEGASDTQLEKIASHPAEQYVSKLKTFADLAAHNQT 402 Query: 144 FIPWYTNSRHIVMPI----TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 F+ N + + T ++K D+ +++D S S + T Sbjct: 403 FLKKLRNQITHTVSVFSERTETLKSGCVDTEEADIYLLIDGSGS------TQATDFHEMK 456 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVST 257 ++ ++ + P V+ G V +++ + F + L + I+ + + G +T Sbjct: 457 TFLSEVVGMFNIAPHK---VRVGAVQYADSWDLEFEINKYSNKQDLGKAIENIRQMGGNT 513 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 N+ L + + + + +Q N +V +T+G L N ++ Sbjct: 514 NTGAALNFTLSLL--QKAKKQRGNK----VPCHLVVLTNG-----MSKDSILEPANRLRE 562 Query: 318 RGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 VYAIGI+ + Y V N +DA I +V + + Sbjct: 563 EHIRVYAIGIKEANQTQLREIAGEEKRVYYVHN----FDALKDIRNQVVQEICTEE 614 Score = 84.1 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 37/192 (19%), Positives = 76/192 (39%), Gaps = 19/192 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D S S+ + ++ M+ VK N V+ G + +++ E Sbjct: 808 LDVVFVIDSSGSI------DYDEYNIMKDF---MIGLVKKADVGKNQVRFGALKYADDPE 858 Query: 232 EFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L+ + L+ ++ G ST + L ++ + N + Sbjct: 859 VLFYLDDFGTKLEVISVLQNDQAMGGSTYTAEAL--GFSDHMFTEARGSRLNKG---VPQ 913 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDGE+ + + + +G +V A+GI E L S + ++ VE Sbjct: 914 VLIVITDGES---HDADKLNATAKALRDKGILVLAVGIDGANPMELLAMAGSSDKYFFVE 970 Query: 350 NPHSMYDAFSHI 361 + FS + Sbjct: 971 TFGGLKGIFSDV 982 Score = 79.8 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 83/214 (38%), Gaps = 22/214 (10%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + +S KV+ + D +D++ ++D S S++ K+ + S+ V+ Sbjct: 982 VTASVCNSSKVDCEIDK-VDLVFLMDGSTSIQPN---DFKKMKEFLASV------VQDFD 1031 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N V+ G FS+ F L G + +I+ + + +T+ L+ + Sbjct: 1032 VSLNRVRIGAAQFSDTYHPEFPLGTFIGEKEISFQIENIKQIFGNTHIGAALREVEHYFR 1091 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 G R + ++++ +TDG+ + + RG +Y++GI + Sbjct: 1092 PDMGSRI-----NTGTPQVLLVLTDGQ-----SQDEVAQAAEALRHRGIDIYSVGIGDVD 1141 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 + ++ + V N + I ++I Sbjct: 1142 DQQLIQITGTAEKKLTVHNFDELKKVNKRIVRNI 1175 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 39/249 (15%), Positives = 81/249 (32%), Gaps = 31/249 (12%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK------- 163 + V V + ++ A++ + T S+++ I K Sbjct: 156 KDGVKIISVGVQKASEENLKAMATSQFHFNLRTVRDLSMFSQNMTHIIKDVTKYKEGAVD 215 Query: 164 ---VNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 V + + D++ +LD+S + S D + E V + N + Sbjct: 216 DIFVEACQGPSMADVVFLLDMS------INGSEENFDYLKGFLE---ESVSALDIKENCM 266 Query: 220 QSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GLV +SN+ + L G S + + I+ LS + +K ++F R Sbjct: 267 RVGLVAYSNETKVINSLSMGINKSEVLQHIQNLSPRTGKAYTGAAIKKLRKEVFS---AR 323 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +I V +T + ++ G ++ +GI + + Sbjct: 324 NGSRKN-QGVPQIAVLVT-----HRDSEDNVTKAAVNLRREGVTIFTLGIEGASDTQLEK 377 Query: 338 ACASPNSFY 346 + P Y Sbjct: 378 IASHPAEQY 386 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 75/217 (34%), Gaps = 26/217 (11%) Query: 158 ITSSVKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 I S + VN + D++ ++D S + S I M+ + P Sbjct: 11 ICSHISVNQDSGPEYADVVFLVDSSDRLGS------KSFPFVKMFITKMISSL---PIEA 61 Query: 217 NVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKY-LSKFGVSTNSTPGLKYAYNQIFDM 273 + + L +S+K+ F L G S + ++ G S L+ A+ F Sbjct: 62 DKYRVALAQYSDKLHSEFHLSTFKGRSPMLNHLRKNFGFIGGSLQIGKALQEAHRTYFSA 121 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + + I+V + S++ + +K G + ++G++ Sbjct: 122 PTNGRDKK----QFPPILVVL-----ASSESEDDVEEASKALRKDGVKIISVGVQKASEE 172 Query: 334 EFLRACASPNSFYL---VENPHSMYDAFSHIGKDIVT 367 L+A A+ + V + +HI KD+ Sbjct: 173 N-LKAMATSQFHFNLRTVRDLSMFSQNMTHIIKDVTK 208 >gi|154486447|ref|ZP_02027854.1| hypothetical protein BIFADO_00261 [Bifidobacterium adolescentis L2-32] gi|154084310|gb|EDN83355.1| hypothetical protein BIFADO_00261 [Bifidobacterium adolescentis L2-32] Length = 882 Score = 89.1 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 52/290 (17%), Positives = 96/290 (33%), Gaps = 65/290 (22%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS-QTDARLDMMIVLDVSRSMESFF 188 S+ P K S + + + +V + T +D +VLDVS SM+ Sbjct: 148 ETGSQLGAPEKHKRIKKNDNGSYTVNVDVKGAVNSTTVTTTQPIDFTLVLDVSGSMDDPM 207 Query: 189 DS--SITKIDMAIKSINAMLEEV----------------------KLIPDVNNVVQSGLV 224 ++D +++ A L+E K +++ +SG Sbjct: 208 SKTDRTRRLDALKEAVKAFLDEAANTNTEAGSELVHVGLVKFAGDKTDKIGDDMYRSGGY 267 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 T++ + L ++ L+ K+ L G T + G A + + Sbjct: 268 TYN-YSQIVSNLTADMNGLKNKVSKLKAAGA-TRADNGFNRAVKVM--------GSASAR 317 Query: 285 ANYKKIIVFMTDGENLSTKE-----DQQSLYYCNEAKKRGAIVYAIGIRV--------IR 331 + KK+++F DG S+ +++ E K GA VY+IGI Sbjct: 318 TDAKKVVIFFADGSPTSSSGFEGKVANKAVEAAKELKDGGAAVYSIGIFASANPSSLSSN 377 Query: 332 SHEFLRACAS-----------------PNSFYLVENPHSMYDAFSHIGKD 364 ++F+ A +S + N + F I K Sbjct: 378 ENQFMHAVSSNFPKATKYNQLGEGNIEAGYYKSATNASELNTIFDEIEKS 427 >gi|126153367|gb|AAI31711.1| MATN1 protein [Homo sapiens] Length = 480 Score = 88.7 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 35/210 (16%), Positives = 84/210 (40%), Gaps = 21/210 (10%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + D++ V+D SRS+ + + ++ ++E + + P+ + G Sbjct: 15 SGHLCRTRPTDLVFVVDSSRSVRP------VEFEKVKVFLSQVIESLDVGPNA---TRVG 65 Query: 223 LVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V +++ +++ F L VS + ++ + T + +++A + F G + Sbjct: 66 MVNYASTVKQEFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKAF---GDAEGG 122 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + + K+++ +TDG + +D + A+ G ++AIG+ + LR A Sbjct: 123 RSRSPDISKVVIVVTDGRPQDSVQDVSAR-----ARASGVELFAIGVGSVD-KATLRQIA 176 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 S V+ S Y + + Sbjct: 177 SEPQDEHVDYVES-YSVIEKLSRKFQEAFC 205 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 39/220 (17%), Positives = 83/220 (37%), Gaps = 30/220 (13%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T +V + D++ ++D S+S+ ++ K I+ +++ + + + Sbjct: 245 TCNVCSGGGGSSATDLVFLIDGSKSVRP------ENFELVKKFISQIVDTLDVSD---KL 295 Query: 219 VQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAY-NQIFDMQG 275 Q GLV +S+ + + F L ++ ++ +S T + LKY N G Sbjct: 296 AQVGLVQYSSSVRQEFPLGRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSG 355 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R +K+ + TDG + D +AK G ++A+G+ E Sbjct: 356 ARPGA-------QKVGIVFTDGRSQDYINDA-----AKKAKDLGFKMFAVGVGNAVEDEL 403 Query: 336 LRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P ++ + + + IGK + K + Sbjct: 404 REIASEPVAEHYFYTAD----FKTINQIGKKLQKKICVEE 439 >gi|332254526|ref|XP_003276380.1| PREDICTED: cartilage matrix protein [Nomascus leucogenys] Length = 496 Score = 88.7 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 33/210 (15%), Positives = 82/210 (39%), Gaps = 21/210 (10%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + D++ V+D SRS+ + + ++ +++ + + P+ + G Sbjct: 31 TGHLCRTRPTDLVFVVDSSRSVRP------VEFEKVKVFLSQVIQSLDVGPNA---TRVG 81 Query: 223 LVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V +++ +++ F L VS + ++ + T + +++A + F + Sbjct: 82 MVNYASTVKQEFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKAFSDAEGGRSR 141 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + K+++ +TDG + +D + A+ G ++AIG+ + LR A Sbjct: 142 ---SPDISKVVIVVTDGRPQDSVQDVSAR-----ARASGVELFAIGVGRVD-KATLRQIA 192 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 S V+ S Y + + Sbjct: 193 SEPQDEHVDYVES-YSVIEKLSRKFQEAFC 221 Score = 82.9 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 40/220 (18%), Positives = 83/220 (37%), Gaps = 30/220 (13%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T +V + D++ ++D S+S+ ++ K IN +++ + + + Sbjct: 261 TCNVCSGGGGSSATDLVFLIDGSKSVRP------ENFELVKKFINQIVDTLDVSD---KL 311 Query: 219 VQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAY-NQIFDMQG 275 Q GLV +S+ + + F L ++ ++ +S T + LKY N G Sbjct: 312 AQVGLVQYSSSVRQEFPLGRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSG 371 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R +K+ + TDG + D +AK G ++A+G+ E Sbjct: 372 ARPGA-------QKVGIVFTDGRSQDYINDA-----AKKAKDLGFKMFAVGVGNAVEDEL 419 Query: 336 LRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P ++ + + + IGK + K + Sbjct: 420 REIASEPVAEHYFYTAD----FKTINQIGKKLQKKICVEE 455 >gi|225028486|ref|ZP_03717678.1| hypothetical protein EUBHAL_02763 [Eubacterium hallii DSM 3353] gi|224954191|gb|EEG35400.1| hypothetical protein EUBHAL_02763 [Eubacterium hallii DSM 3353] Length = 538 Score = 88.7 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 50/276 (18%), Positives = 97/276 (35%), Gaps = 34/276 (12%) Query: 105 NDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKV 164 N+ D+ + + + + F + +KV Sbjct: 163 NEPDNTFSDYGYYVRFNENAKDGTWKVDTFSGGETNFNNVFLCEWGDYSVTG--NDGLKV 220 Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 S+ D+++ LD+S SM+ +D K+ ++ + +N GLV Sbjct: 221 TSKKR---DIVLTLDISASMDGIP------LDETKKAAAKFVDSI-----LNKNSNIGLV 266 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++S++ + L+ I LS +TN GL AY+ + Sbjct: 267 SYSDEATSLSGICSNDVFLKNTITSLSS-AENTNIEDGLSRAYSML-----------QLG 314 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFLRA 338 + KK+IV M+DG K+ ++ + Y + K +G ++Y +G + Sbjct: 315 QSKKKLIVLMSDGLPTLGKDGEELIKYAEKIKDQGVLIYTLGFFQNTEEYKAEGQYLMEK 374 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 AS Y V + + F + I ++ Y K Sbjct: 375 IASEGCHYEVSSSEDLVFFFEDVAGQIGGQKYIYVK 410 >gi|953237|gb|AAA99719.1| collagen type XII alpha-1 precursor [Mus musculus] Length = 3067 Score = 88.7 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 56/267 (20%), Positives = 109/267 (40%), Gaps = 33/267 (12%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 ++T+L++ + + Y IS + + T + T + M + V+ + D Sbjct: 384 PQTTTLNVRDLTADTEYQISVFAMKGLTSSEPTSVMEKTQPKTQPMKV--QVECSRGVDI 441 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 + D++ ++D S S+ + ++ K+ ++ + K N VQ LV +S Sbjct: 442 KADIVFLVDGSYSIGT---ANFVKVRAFLEVL------AKSFEISPNRVQISLVQYSRDP 492 Query: 231 EEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L+ V + + I G STN+ + Y +IF +N Sbjct: 493 HTEFTLKEFNRVEDIIKAINTFPYRGGSTNTGKAMTYVREKIFVPN------KGSRSNVP 546 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 K+++ +TDGE+ D + + ++A+G++ E L A ASP Sbjct: 547 KVMILITDGESSDAFRD-----PAIKLRNSDVEIFAVGVKDAVRSE-LEAIASPPAETHV 600 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 + VE+ +DAF I ++T+ I Sbjct: 601 FTVED----FDAFQRIS-FVLTQSICL 622 Score = 86.4 bits (212), Expect = 6e-15, Method: Composition-based stats. Identities = 42/279 (15%), Positives = 97/279 (34%), Gaps = 34/279 (12%) Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP--LKFCTFIPWYTNSRHIVMPI 158 +G + +T + + Y ++ S ++ + + T Sbjct: 64 DGPTKEFTLAASTTETLLSDLIPETQYVVTITSYNEVEESVPVIGQLTIQTGGPTKPGEK 123 Query: 159 -TSSVKVNSQTDAR-LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 ++ + + D++ ++D S S+ ++ ++ V Sbjct: 124 KPGKTEIQKCSVSAWTDLVFLVDGSWSVGRNN---------FKYILDFIVALVSAFDIGE 174 Query: 217 NVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDM 273 + G+V +S+ F L + L +K + G +T + + Y N + Sbjct: 175 EKTRVGVVQYSSDTRTEFNLNQYYRREDLLAAVKKIPYKGGNTMTGDAIDYLVKNTFTES 234 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 G R + K+ + +TDG K + E + G V+++GI+ + Sbjct: 235 AGSRAG-------FPKVGIIITDG-----KSQDEVEIPARELRNIGVEVFSLGIKAADAK 282 Query: 334 EFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 E + ++P N + V N +DA I +I+++ Sbjct: 283 ELKQIASTPSLNHVFNVAN----FDAIVDIQNEIISQVC 317 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 43/251 (17%), Positives = 83/251 (33%), Gaps = 28/251 (11%) Query: 124 NEGYSISAISRYK--IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 Y ++ + L + + ++S + ++ +A D+++++D S Sbjct: 1154 GTTYRVNVFGMFDGGESLPLVGQEMTTLSDTTVTPFLSSGMDCLTRAEA--DIVLLVDGS 1211 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WG 239 S+ I+ ++E ++ P VQ L +S + L Sbjct: 1212 WSIG------RANFRTVRSFISRIVEVFEIGP---KRVQIALAQYSGDPRTEWQLNAHRD 1262 Query: 240 VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 L + + L G +T + L + Q F Q +KI V +TDG Sbjct: 1263 KKSLLQAVANLPNKGGNTLTGMALNFIRQQSFKTQ------AGMRPRARKIGVLITDG-- 1314 Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFS 359 K + K G ++AIGI+ E P+ N +++ S Sbjct: 1315 ---KSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDT-HAYNVAD-FESLS 1369 Query: 360 HIGKDIVTKRI 370 I D+ Sbjct: 1370 KIVDDLTINLC 1380 >gi|149694147|ref|XP_001503972.1| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1) [Equus caballus] Length = 495 Score = 88.7 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 80/210 (38%), Gaps = 21/210 (10%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + D++ V+D SRS+ + + ++ ++E + + P+ + G Sbjct: 30 RGHLCRTRPTDLVFVVDSSRSVRP------VEFEKVKVFLSQVIESLDVGPNA---TRVG 80 Query: 223 LVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 LV +++ +++ F L + L + ++ + T + +++A + F Sbjct: 81 LVNYASAVKQEFPLRAHGSKAALLQAVRRIQPLSTGTMTGLAIQFAITRAFSEGE---GG 137 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + K+++ +TDG + D + + G ++AIG+ + LR A Sbjct: 138 RARSPDISKVVIVVTDGRPQDSVRDVSARS-----RASGIELFAIGVGRVD-KATLREIA 191 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 S V+ S Y + K Sbjct: 192 SEPQDEHVDYVES-YSVIEKLSKKFQEAFC 220 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 40/220 (18%), Positives = 83/220 (37%), Gaps = 30/220 (13%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T +V + D++ ++D S+S+ ++ K IN +++ + + + Sbjct: 260 TCNVCSGGGGSSATDLVFLIDGSKSVRP------ENFELVKKFINQIVDTLDVSD---KL 310 Query: 219 VQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAY-NQIFDMQG 275 Q GLV +S+ + + F L ++ ++ +S T + LKY N G Sbjct: 311 AQVGLVQYSSSVRQEFPLGRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSG 370 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R +K+ + TDG + D +AK G ++A+G+ E Sbjct: 371 ARPGA-------QKVGIVFTDGRSQDYINDA-----AKKAKDLGFKMFAVGVGNAVEDEL 418 Query: 336 LRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P ++ + + + IGK + K + Sbjct: 419 REIASEPVAEHYFYTAD----FKTINQIGKKLQKKICVEE 454 >gi|327286711|ref|XP_003228073.1| PREDICTED: cartilage matrix protein-like [Anolis carolinensis] Length = 565 Score = 88.7 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 78/202 (38%), Gaps = 26/202 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 D++ ++D SRS+ + + ++ ++E + + P+ + G+V + Sbjct: 106 CRTRPTDLVFLIDSSRSVRPN------EFEQVKVFLSQVIESLDVGPNA---TRVGVVNY 156 Query: 227 SNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ ++ F L+ + + ++ + T + +++A N+ F Sbjct: 157 ASAVKSEFSLKTHRTKASLLQAVRRIEPLSTGTMTGLAIQFAINRAFSEGE---GARVRV 213 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP-- 342 K+ + +TDG +D + A+ G ++AIG+ + H LR ASP Sbjct: 214 PEINKVAIIVTDGRPQDAVKDVAAR-----ARALGIEIFAIGVGRVDMHT-LRLIASPPL 267 Query: 343 -NSFYLVEN---PHSMYDAFSH 360 VE+ + F Sbjct: 268 EEHVDYVESYSVIEKLTKKFQE 289 Score = 84.4 bits (207), Expect = 2e-14, Method: Composition-based stats. Identities = 36/209 (17%), Positives = 81/209 (38%), Gaps = 28/209 (13%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 A D++ ++D S+S+ ++ K IN +++ + + +V GLV +S+ Sbjct: 339 GAPTDLVFLIDGSKSVRP------ENFELVKKFINQIVDSLDVSDRNAHV---GLVQYSS 389 Query: 229 KIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + F L ++ ++ +S T + L+Y + F + + Sbjct: 390 SVRQEFPLGQYKDKKDIKAAVRRMSYMEKGTMTGQALQYLVDSSFAIS------SGARPG 443 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NS 344 K+ + TDG + D +AK+ G ++A+G+ E + P Sbjct: 444 VPKVAIVFTDGRSQDYINDS-----AKKAKELGYKMFAVGVGNAVEDELKEIASEPVAEH 498 Query: 345 FYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 ++ + ++ IGK + K + Sbjct: 499 YFYTADFRTINQ----IGKKLQNKICVEE 523 >gi|306824220|ref|ZP_07457590.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|309801684|ref|ZP_07695804.1| von Willebrand factor type A domain protein [Bifidobacterium dentium JCVIHMP022] gi|304552423|gb|EFM40340.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|308221626|gb|EFO77918.1| von Willebrand factor type A domain protein [Bifidobacterium dentium JCVIHMP022] Length = 967 Score = 88.7 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 63/298 (21%), Positives = 102/298 (34%), Gaps = 63/298 (21%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA-RLDMMIVLDVSRSMESFF 188 SA + P + + + + +T S TD +D+ +VLDVS SM F Sbjct: 201 SASAGLSTPEHTKSIEYQGNGAYTLKLDVTGKDASTSTTDTTPIDIALVLDVSGSMNDDF 260 Query: 189 D--SSITKIDMAIKSINAMLEEV----KLIPDVNNVVQSGLVTFSNKIEEFF-------- 234 S +KI ++N+ L+E I D NN V+ LV ++N+I Sbjct: 261 GGRGSPSKISALKTAVNSFLDETAKTNDTIEDDNNKVKVALVKYANQIGTATGADGCRIS 320 Query: 235 -LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI---------FDMQGMRQHCNTED 284 + + + ++ L T LK + N + M+ +Q Sbjct: 321 NSRQSDTGNCTQIVQEL------TTDAGLLKTSVNGLQAAGATYADAAMEVAQQALAGGR 374 Query: 285 ANYKKIIVFMTDGENLSTKE-----DQQSLYYCNEAKKRGAIVYAIGI------------ 327 A KK ++F TDGE ++ E K G VY+IGI Sbjct: 375 AGAKKYVIFFTDGEPNHWSGFDDDVANAAIKKSQELKNAGTTVYSIGIFDGANPSASVSS 434 Query: 328 ---------------RVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + L AS + +Y + + F+ I K I K + Sbjct: 435 ASNANKFMHGISSNYPNATGYRSLGDRASGDYYYSASSATQLAQIFNDIQKTITEKHV 492 >gi|109071570|ref|XP_001110086.1| PREDICTED: collagen alpha-1(XXI) chain-like isoform 2 [Macaca mulatta] Length = 959 Score = 88.7 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 39/212 (18%), Positives = 77/212 (36%), Gaps = 28/212 (13%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S A D++ +LD S S+ ++ K + + + + P +Q G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGP------ENFEIVKKWLVNITKNFDIGP---KFIQVGVV 79 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ L HL ++ + G +T + +++A + +F Sbjct: 80 QYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTRTGKAIQFALDYLF---------AK 130 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 KI V +TDG K A+ ++AIG+ LRA A Sbjct: 131 SSRFLTKIAVVLTDG-----KSQDDVKDAAEAARDSKITLFAIGVGSETEDAELRAIANK 185 Query: 341 -SPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 S + VE+ ++ I + + + + Sbjct: 186 PSSTYVFYVEDYIAISKIREVIKQKLCEESVC 217 >gi|153792263|ref|NP_001093210.1| matrilin 1 [Danio rerio] gi|148726249|emb|CAN88321.1| matrilin 1 [Danio rerio] gi|148726497|emb|CAN88267.1| matrilin 1 [Danio rerio] Length = 489 Score = 88.7 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 82/216 (37%), Gaps = 27/216 (12%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 + + +++ D++ ++D SRS+ ++ + + +++ + + Sbjct: 18 TVDLRQAAAMAAGLCNTKPTDVVFIVDSSRSVRP------SEFEQVKVFLAKVIDGLSVG 71 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQI 270 PD + G+V ++++++ L+ + + + + T + +++A N Sbjct: 72 PDA---TRVGVVNYASRVKNEVSLKSHKTKAALVKAVSKIEPLSTGTMTGLAIQFAMNVA 128 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 F + + + K+ + +TDG D A++ G ++AIG+ + Sbjct: 129 FSEAEGGR----KSPDISKVAIIVTDGRPQDNIRD-----IAARAREAGIEIFAIGVGRV 179 Query: 331 RSHEFLRACAS---PNSFYLVEN---PHSMYDAFSH 360 LR AS + VE+ + F Sbjct: 180 D-MTTLRQMASEPLEDHVDYVESYSLIEKLTKKFQE 214 Score = 76.7 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 36/213 (16%), Positives = 81/213 (38%), Gaps = 28/213 (13%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 ++ ++A D++ ++D S+S+ ++ K IN +++++ + +V GLV Sbjct: 260 SACSNAATDVVFLIDGSKSVRP------ENFELVKKWINLIIDKLDVSETNTHV---GLV 310 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ +++ F L L+ +K + T + L + + F Sbjct: 311 QYSSTVKQEFPLGRHNSKRSLKEAVKRMDYMERGTMTGHALSFLVDNSFGPNQ------G 364 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 K+ + TDG + D +AK G +YA+G+ E + P Sbjct: 365 ARPGVPKVGIVFTDGRSQDYIGDA-----AKKAKALGFKMYAVGVGNAVEDELREIASEP 419 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + ++ + +M I K + + Sbjct: 420 IADHYFYTADFKTMNQ----IAKKLQINVCQEE 448 >gi|119628047|gb|EAX07642.1| matrilin 1, cartilage matrix protein, isoform CRA_a [Homo sapiens] Length = 496 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 35/210 (16%), Positives = 85/210 (40%), Gaps = 21/210 (10%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + D++ V+D SRS+ + + ++ ++E + + P+ + G Sbjct: 31 RGHLCRTRPTDLVFVVDSSRSVRP------VEFEKVKVFLSQVIESLDVGPNA---TRVG 81 Query: 223 LVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V +++ +++ F L VS + ++ + T + +++A + F G + Sbjct: 82 MVNYASTVKQEFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKAF---GDAEGG 138 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + + K+++ +TDG + +D + A+ G ++AIG+ + LR A Sbjct: 139 RSRSPDISKVVIVVTDGRPQDSVQDVSAR-----ARASGVELFAIGVGSVD-KATLRQIA 192 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 S V+ S Y + + Sbjct: 193 SEPQDEHVDYVES-YSVIEKLSRKFQEAFC 221 Score = 81.7 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 39/220 (17%), Positives = 83/220 (37%), Gaps = 30/220 (13%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T +V + D++ ++D S+S+ ++ K I+ +++ + + + Sbjct: 261 TCNVCSGGGGSSATDLVFLIDGSKSVRP------ENFELVKKFISQIVDTLDVSD---KL 311 Query: 219 VQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAY-NQIFDMQG 275 Q GLV +S+ + + F L ++ ++ +S T + LKY N G Sbjct: 312 AQVGLVQYSSSVRQEFPLGRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSG 371 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R +K+ + TDG + D +AK G ++A+G+ E Sbjct: 372 ARPGA-------QKVGIVFTDGRSQDYINDA-----AKKAKDLGFKMFAVGVGNAVEDEL 419 Query: 336 LRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P ++ + + + IGK + K + Sbjct: 420 REIASEPVAEHYFYTAD----FKTINQIGKKLQKKICVEE 455 >gi|4505111|ref|NP_002370.1| cartilage matrix protein precursor [Homo sapiens] gi|115556|sp|P21941|MATN1_HUMAN RecName: Full=Cartilage matrix protein; AltName: Full=Matrilin-1; Flags: Precursor gi|1732121|gb|AAB38702.1| cartilage matrix protein [Homo sapiens] gi|56205026|emb|CAI19322.1| matrilin 1, cartilage matrix protein [Homo sapiens] gi|182887817|gb|AAI60064.1| Matrilin 1, cartilage matrix protein [synthetic construct] gi|189066540|dbj|BAG35790.1| unnamed protein product [Homo sapiens] Length = 496 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 35/210 (16%), Positives = 85/210 (40%), Gaps = 21/210 (10%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + D++ V+D SRS+ + + ++ ++E + + P+ + G Sbjct: 31 RGHLCRTRPTDLVFVVDSSRSVRP------VEFEKVKVFLSQVIESLDVGPNA---TRVG 81 Query: 223 LVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V +++ +++ F L VS + ++ + T + +++A + F G + Sbjct: 82 MVNYASTVKQEFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKAF---GDAEGG 138 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + + K+++ +TDG + +D + A+ G ++AIG+ + LR A Sbjct: 139 RSRSPDISKVVIVVTDGRPQDSVQDVSAR-----ARASGVELFAIGVGSVD-KATLRQIA 192 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 S V+ S Y + + Sbjct: 193 SEPQDEHVDYVES-YSVIEKLSRKFQEAFC 221 Score = 81.7 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 39/220 (17%), Positives = 83/220 (37%), Gaps = 30/220 (13%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T +V + D++ ++D S+S+ ++ K I+ +++ + + + Sbjct: 261 TCNVCSGGGGSSATDLVFLIDGSKSVRP------ENFELVKKFISQIVDTLDVSD---KL 311 Query: 219 VQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAY-NQIFDMQG 275 Q GLV +S+ + + F L ++ ++ +S T + LKY N G Sbjct: 312 AQVGLVQYSSSVRQEFPLGRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSG 371 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R +K+ + TDG + D +AK G ++A+G+ E Sbjct: 372 ARPGA-------QKVGIVFTDGRSQDYINDA-----AKKAKDLGFKMFAVGVGNAVEDEL 419 Query: 336 LRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P ++ + + + IGK + K + Sbjct: 420 REIASEPVAEHYFYTAD----FKTINQIGKKLQKKICVEE 455 >gi|115555|sp|P05099|MATN1_CHICK RecName: Full=Cartilage matrix protein; AltName: Full=Matrilin-1; Flags: Precursor gi|833607|emb|CAA30915.1| cartilage matrix protein [Gallus gallus] Length = 493 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 40/230 (17%), Positives = 86/230 (37%), Gaps = 28/230 (12%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 + + + + + LD++ ++D S+S+ ++ K IN ++E Sbjct: 247 CKEGFTLNNDGKTCSACSGGSGSALDLVFLIDGSKSVRP------ENFELVKKFINQIVE 300 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKY 265 +++ Q GLV +S+ + + F L ++ +K ++ T + LKY Sbjct: 301 SLEV---SEKQAQVGLVQYSSSVRQEFPLGQFKNKKDIKAAVKKMAYMEKGTMTGQALKY 357 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 + F + N K+ + TDG + D +AK G ++A+ Sbjct: 358 LVDSSFSI------ANGARPGVPKVGIVFTDGRSQDYITDA-----AKKAKDLGFRMFAV 406 Query: 326 GIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 G+ E + P ++ + + S+IGK + K + Sbjct: 407 GVGNAVEDELREIASEPVAEHYFYTAD----FRTISNIGKKLQMKICVEE 452 Score = 86.0 bits (211), Expect = 8e-15, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 82/205 (40%), Gaps = 24/205 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + D++ ++D SRS+ + + ++ ++E + + P+ + G Sbjct: 29 RGTLCRTKPTDLVFIIDSSRSVRP------QEFEKVKVFLSRVIEGLDVGPNS---TRVG 79 Query: 223 LVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 ++ +++ ++ F L+ + + + ++ + T + +++A ++ F + Sbjct: 80 VINYASAVKNEFSLKTHQTKAELLQAVQRIEPLSTGTMTGLAIQFAISRAFS---DTEGA 136 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 N K+ + +TDG +D + A++ G ++AIG+ + H + + Sbjct: 137 RLRSPNINKVAIVVTDGRPQDGVQDVSAR-----ARQAGIEIFAIGVGRVDMHTLRQIAS 191 Query: 341 SP--NSFYLVEN---PHSMYDAFSH 360 P + VE+ + F Sbjct: 192 EPLDDHVDYVESYSVIEKLTHKFQE 216 >gi|146325834|sp|Q60847|COCA1_MOUSE RecName: Full=Collagen alpha-1(XII) chain; Flags: Precursor Length = 3120 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 42/279 (15%), Positives = 97/279 (34%), Gaps = 34/279 (12%) Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP--LKFCTFIPWYTNSRHIVMPI 158 +G + +T + + Y ++ S ++ + + T Sbjct: 64 DGPTKEFTLAASTTETLLSDLIPETQYVVTITSYNEVEESVPVIGQLTIQTGGPTKPGEK 123 Query: 159 -TSSVKVNSQTDAR-LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 ++ + + D++ ++D S S+ ++ ++ V Sbjct: 124 KPGKTEIQKCSVSAWTDLVFLVDGSWSVGRNN---------FKYILDFIVALVSAFDIGE 174 Query: 217 NVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDM 273 + G+V +S+ F L + L +K + G +T + + Y N + Sbjct: 175 EKTRVGVVQYSSDTRTEFNLNQYYRREDLLAAVKKIPYKGGNTMTGDAIDYLVKNTFTES 234 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 G R + K+ + +TDG K + E + G V+++GI+ + Sbjct: 235 AGSRAG-------FPKVAIIITDG-----KSQDEVEIPARELRNIGVEVFSLGIKAADAK 282 Query: 334 EFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 E + ++P N + V N +DA I +I+++ Sbjct: 283 ELKQIASTPSLNHVFNVAN----FDAIVDIQNEIISQVC 317 Score = 86.4 bits (212), Expect = 6e-15, Method: Composition-based stats. Identities = 53/267 (19%), Positives = 106/267 (39%), Gaps = 37/267 (13%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 ++T+L++ + + Y IS + + T + T + + + V + + Sbjct: 384 PQTTTLNVRDLTADTEYQISVFAMKGLTSSEPTSVMEKTQPMKVQVECSRGVDIKA---- 439 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++ ++D S S+ ++ K+ ++ + K N VQ LV +S Sbjct: 440 --DIVFLVDGSYSIG---IANFVKVRAFLEVL------AKSFEISPNRVQISLVQYSRDP 488 Query: 231 EEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L+ V + + I G STN+ + Y +IF +N Sbjct: 489 HTEFTLKEFNRVEDIIKAINTFPYRGGSTNTGKAMTYVREKIFVPN------KGSRSNVP 542 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 K+++ +TDG++ D + + ++A+G++ E L A ASP Sbjct: 543 KVMILITDGKSSDAFRD-----PAIKLRNSDVEIFAVGVKDAVRSE-LEAIASPPAETHV 596 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 + VE+ +DAF I ++ T+ I Sbjct: 597 FTVED----FDAFQRISFEL-TQSICL 618 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 43/251 (17%), Positives = 83/251 (33%), Gaps = 28/251 (11%) Query: 124 NEGYSISAISRYK--IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 Y ++ + L + + ++S + ++ +A D+++++D S Sbjct: 1150 GTTYRVNVFGMFDGGESLPLVGQEMTTLSDTTVTPFLSSGMDCLTRAEA--DIVLLVDGS 1207 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WG 239 S+ I+ ++E ++ P VQ L +S + L Sbjct: 1208 WSIG------RANFRTVRSFISRIVEVFEIGP---KRVQIALAQYSGDPRTEWQLNAHRD 1258 Query: 240 VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 L + + L G +T + L + Q F Q +KI V +TDG Sbjct: 1259 KKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKTQ------AGMRPRARKIGVLITDG-- 1310 Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFS 359 K + K G ++AIGI+ E P+ N +++ S Sbjct: 1311 ---KSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDT-HAYNVAD-FESLS 1365 Query: 360 HIGKDIVTKRI 370 I D+ Sbjct: 1366 KIVDDLTINLC 1376 >gi|47214246|emb|CAG12465.1| unnamed protein product [Tetraodon nigroviridis] Length = 467 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 33/213 (15%), Positives = 73/213 (34%), Gaps = 46/213 (21%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 S +S + +D++ ++D SRS+ T+ + A + + +++ +++ D Sbjct: 36 PSSTVSPASGSSCRNGPIDLVFIVDSSRSVRP------TEFEKAKEFLQDLVDSLEVGLD 89 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + GLV +++ + T + ++ A + F + Sbjct: 90 S---TRVGLVNYAS----------------------TPLASGTMTGLAIRTAVEKAFAAE 124 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++ V +TDG E + A++ G +YA+G+ Sbjct: 125 ---AGARLNSTKVARVAVVVTDGRPQDEVERVSAA-----ARESGIEIYAVGVDRADRTS 176 Query: 335 FLRACAS---PNSFYLVEN---PHSMYDAFSHI 361 LR AS + + VE + F Sbjct: 177 -LRLMASQPHEDHVFYVETYGVIEKLTSRFRET 208 >gi|109000862|ref|XP_001094970.1| PREDICTED: cartilage matrix protein [Macaca mulatta] Length = 495 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 33/210 (15%), Positives = 82/210 (39%), Gaps = 21/210 (10%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + D++ V+D SRS+ + + ++ ++E + + P+ + G Sbjct: 30 RGHLCRTRPTDLVFVVDSSRSVRP------VEFEKVKVFLSQVIESLDVGPNA---TRVG 80 Query: 223 LVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V +++ +++ F L VS + ++ + T + +++A + + Sbjct: 81 MVNYASTVKQEFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKALSDAEGGRSR 140 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + K+++ +TDG + +D + A+ G ++AIG+ + LR A Sbjct: 141 ---SPDISKVVIVVTDGRPQDSVQDVSAR-----ARASGVELFAIGVGRVD-KATLRQIA 191 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 S V+ S Y + + Sbjct: 192 SEPQDEHVDYVES-YSVIEKLSRKFQEAFC 220 Score = 82.9 bits (203), Expect = 7e-14, Method: Composition-based stats. Identities = 40/220 (18%), Positives = 83/220 (37%), Gaps = 30/220 (13%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T +V + D++ ++D S+S+ ++ K IN +++ + + + Sbjct: 260 TCNVCSGGGGSSATDLVFLIDGSKSVRP------ENFELVKKFINQIVDTLDVSD---KL 310 Query: 219 VQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAY-NQIFDMQG 275 Q GLV +S+ + + F L ++ ++ +S T + LKY N G Sbjct: 311 AQVGLVQYSSSVRQEFPLGRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSG 370 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R +K+ + TDG + D +AK G ++A+G+ E Sbjct: 371 ARPGA-------QKVGIVFTDGRSQDYINDA-----AKKAKDLGFKMFAVGVGNAVEDEL 418 Query: 336 LRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P ++ + + + IGK + K + Sbjct: 419 REIASEPVAEHYFYTAD----FKTINQIGKKLQKKICVEE 454 >gi|296228122|ref|XP_002759734.1| PREDICTED: collagen alpha-4(VI) chain-like [Callithrix jacchus] Length = 2348 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 42/276 (15%), Positives = 98/276 (35%), Gaps = 24/276 (8%) Query: 96 NELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIV 155 LR G + + + ++ + + + ++ L + Sbjct: 156 EALRRAGILVYAVGVKDAVWAELREIASSPQENFTSFVPSLSGLSSLAQKLRQELCDTLA 215 Query: 156 --MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + A D++ ++D S S+ + K+ + S+ V + Sbjct: 216 KAASRVDHISPACREGAVADVVFLVDSSTSIG---PQNFQKVKNFLYSV------VSGLD 266 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 ++ V+ GL +++ I F L S + +I+ L TN+ L++ Sbjct: 267 VSSDRVRVGLAQYNDDIYPAFQLNQHPLKSMVLEQIQNLPYRTGGTNTGNALEFIRTSYL 326 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 + + +I++ +TDGE + + ++ K+ G +VY +GI V Sbjct: 327 TEGSGSRAKD----RVPQIVILVTDGE-----SNDEVQEAADQLKEDGIVVYVVGINVQD 377 Query: 332 SHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDI 365 E + + P + +EN + + D I + + Sbjct: 378 VQELQKIASEPFEKFLFNIENFNILQDFSGSILQTL 413 Score = 77.9 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 50/330 (15%), Positives = 110/330 (33%), Gaps = 42/330 (12%) Query: 61 AATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNG-----------FVNDIDD 109 AA + K + + +I +F + + D Sbjct: 716 AALDFLRNEVFIQEKGSRSSHGVQQIAVVITENFSQDSVSGPASRLRRAGVTIYAVGTQD 775 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYK---IPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 + S L+ + +S+ S + I K + + + TS Sbjct: 776 VSESKDLEKMASYPPWKHSVPLESFLQLSIIGSKLTNQLCPEIVDSQVSVRGTSYPVQEG 835 Query: 167 QTDAR-LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + D+ ++D S S+ + + + + + + +K+ + VQ G++ Sbjct: 836 CVHIQKADIYFLIDGSGSI---YPEAFLDMKVFMNEV------IKMFQIGPDRVQFGVIQ 886 Query: 226 FSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +S++I+ F+L V+ L+ I + + G T + L + +T Sbjct: 887 YSDEIKSKFVLSQYPTVAELKVAIDNIQQGGGGTTTGEALN---------NMTQVFADTA 937 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 N + ++ +TDG++ D + G I+YAIG+R E + Sbjct: 938 RINVARYLIVITDGKSSDPVADA-----AEGLRASGVIIYAIGVREANIDELKEIAK--D 990 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + V + D + +DI + + Sbjct: 991 KIFFVYEFDLLKDIQKEVVRDICSSEACKN 1020 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 31/210 (14%), Positives = 80/210 (38%), Gaps = 27/210 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + +++ D++ ++D S S+ + + + +M+ + + D +Q G Sbjct: 1014 SSEACKNSKADIVFLIDGSESISP------EDFERMKRFVESMVNQSNIGTDS---IQIG 1064 Query: 223 LVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 L+ FS+ +E F L S + I + + T + L + +G Sbjct: 1065 LLQFSSIPKEEFRLNRYSSKVDIYNAIFAVQQMRDGTRTGKALNFTLPFFESSKG----- 1119 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + ++ ++ +TDG + + + I++AIG+ + + L Sbjct: 1120 --GRPSVQQYLIVITDG-----VAQDNVILPAKALRDKNIIIFAIGVGEAKKSQLLEITN 1172 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + Y + ++ ++ K+I++K Sbjct: 1173 DEDRVYYDVD----FEVLQNLEKEILSKVC 1198 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 38/237 (16%), Positives = 85/237 (35%), Gaps = 24/237 (10%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT--DARLDMMIVLDVSRSMES 186 I++ K F S I+ + S+V+ + A D++++ D S Sbjct: 384 IASEPFEKFLFNIENFNILQDFSGSILQTLCSAVEGKIKEFIKAYADVVLLADTS----- 438 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGV-SHLQ 244 D+S T ++ ++ +++ + Q GL + ++ FLL + + + Sbjct: 439 -QDTSRTSFQWMQNFLSRVVGMLEV---GRDKYQIGLAQYGDQSHIEFLLNTYKTQNEMI 494 Query: 245 RKI-KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 I + G S + L+Y F + + + V +T G++ Sbjct: 495 AHIHERFVPLGGSRRTGKALQYLLQTFFQEEAGSRFL----QGIPQYAVVITSGKSKDEV 550 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 +D +++G V ++G++ E L SP+ Y ++ + Sbjct: 551 QDA-----AQRLREKGVKVMSVGVQDFDRRE-LEWMGSPDLVYDIQREDRVRHVVED 601 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 28/176 (15%), Positives = 60/176 (34%), Gaps = 22/176 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S +S T+ + ++ + + ++ GL +S+ Sbjct: 34 DVVFLVDTS------INSQHTR--SVRNFLYILVNSFNV---SSKTIRVGLAQYSDVPHS 82 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 FLL + + R I+ L++ I D + +I Sbjct: 83 EFLLSTYHRKNDVLRHIRQFQFKPGGKKMGLALQF----ILDHHFQEAAGSRASQGVPQI 138 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 V M+ G + ++ G +VYA+G++ E +SP + Sbjct: 139 AVVMSSG-----PAEDHVHGPGEALRRAGILVYAVGVKDAVWAELREIASSPQENF 189 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 34/193 (17%), Positives = 68/193 (35%), Gaps = 23/193 (11%) Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSH 242 E F I + ++ + + PD V+ GLV +S + F L+ + Sbjct: 639 EEFSRVRQPNFQQVISFLKTIVSLLSIRPDA---VRFGLVFYSEEPRLEFSLDTFQNPAE 695 Query: 243 LQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 + + L+ T + L + N++F + + ++I V +T EN Sbjct: 696 ILEHLDKLTYRERRGRTKTGAALDFLRNEVFIQEKGSRS----SHGVQQIAVVIT--ENF 749 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 S + ++ G +YA+G + + + L AS Y ++F Sbjct: 750 S---QDSVSGPASRLRRAGVTIYAVGTQDVSESKDLEKMAS----YPPWKHSVPLESFLQ 802 Query: 361 ---IGKDIVTKRI 370 IG + + Sbjct: 803 LSIIGSKLTNQLC 815 >gi|56797849|emb|CAF33009.1| matrilin-1 [Danio rerio] Length = 320 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 82/216 (37%), Gaps = 27/216 (12%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 + + +++ D++ ++D SRS+ ++ + + +++ + + Sbjct: 18 TVDLRQAAAMAAGLCNTKPTDVVFIVDSSRSVRP------SEFEQVKVFLAKVIDGLSVG 71 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQI 270 PD + G+V ++++++ L+ + + + + T + +++A N Sbjct: 72 PDA---TRVGVVNYASRVKNEVSLKSHKTKAALVKAVSKIEPLSTGTMTGLAIQFAMNVA 128 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 F + + + K+ + +TDG D A++ G ++AIG+ + Sbjct: 129 FSEAEGGR----KSPDISKVAIIVTDGRPQDNIRD-----IAARAREAGIEIFAIGVGRV 179 Query: 331 RSHEFLRACAS---PNSFYLVEN---PHSMYDAFSH 360 LR AS + VE+ + F Sbjct: 180 D-MTTLRQMASEPLEDHVDYVESYSLIEKLTKKFQE 214 >gi|224048603|ref|XP_002193071.1| PREDICTED: collagen, type XXI, alpha 1 [Taeniopygia guttata] Length = 945 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 39/214 (18%), Positives = 81/214 (37%), Gaps = 28/214 (13%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 S A D++ +LD S S+ ++ + + + P +Q G Sbjct: 27 TRASCRTAPADLVFILDGSYSVGP------ENFEIIKSWLVNITRNFDIGP---KFIQVG 77 Query: 223 LVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V +S+ L +L R+++ + G +T + +++A++ +F Sbjct: 78 VVQYSDYPVLEIPLGTHESTENLIREMESIHYLGGNTRTGRAIQFAFDHLF--------- 128 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 KI V +TDG K + EA+K ++AIG+ + L+A A Sbjct: 129 AKSSRFLTKIAVVLTDG-----KSQDEVKDVAAEARKNKITLFAIGVGSEIEEDELKAIA 183 Query: 341 ---SPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 S + VE+ ++ I + + + + Sbjct: 184 NKPSSTYVFYVEDYIAISRIKEVIKQKLCEESVC 217 >gi|326932831|ref|XP_003212516.1| PREDICTED: cartilage matrix protein-like [Meleagris gallopavo] Length = 493 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 40/230 (17%), Positives = 86/230 (37%), Gaps = 28/230 (12%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 + + + + + LD++ ++D S+S+ ++ K IN ++E Sbjct: 247 CKEGFTLNSDGKTCSACSGGSGSALDLVFLIDGSKSVRP------ENFELVKKFINQIVE 300 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKY 265 +++ Q GLV +S+ + + F L ++ +K ++ T + LKY Sbjct: 301 SLEV---SEKQAQVGLVQYSSSVRQEFPLGQFKNKKDIKAAVKKMAYMEKGTMTGQALKY 357 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 + F + N K+ + TDG + D +AK G ++A+ Sbjct: 358 LVDNSFSI------ANGARPGVPKVGIVFTDGRSQDYITDA-----AKKAKDLGFRMFAV 406 Query: 326 GIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 G+ E + P ++ + + S+IGK + K + Sbjct: 407 GVGNAVEDELREIASEPVAEHYFYTAD----FRTISNIGKKLQMKICVEE 452 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 30/205 (14%), Positives = 82/205 (40%), Gaps = 24/205 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + D++ ++D SRS+ + + ++ ++E + + P+ + G Sbjct: 29 RGTLCRTKPTDLVFIIDSSRSVRP------QEFEKVKVFLSRVIEGLDVGPNS---TRVG 79 Query: 223 LVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 ++ +++ ++ F L+ + L + ++ + T + +++A ++ F Sbjct: 80 VINYASAVKNEFSLKTHQTKAGLLQAVRRIEPLSTGTMTGLAIQFAISRAFSDAE---GA 136 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +N K+ + +TDG +D + A++ G ++AIG+ + H + + Sbjct: 137 RLRSSNINKVAIVVTDGRPQDGVQDVSAR-----ARQAGIEIFAIGVGRVDMHTLRQIAS 191 Query: 341 SP--NSFYLVEN---PHSMYDAFSH 360 P + VE+ + F Sbjct: 192 EPLDDHVDYVESYSVIEKLTHKFQE 216 >gi|327541799|gb|EGF28311.1| von Willebrand factor type A [Rhodopirellula baltica WH47] Length = 363 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 45/352 (12%), Positives = 100/352 (28%), Gaps = 34/352 (9%) Query: 15 YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNR 74 G M +L AI + + +V+ I+++ + +T L S N Sbjct: 33 RHGAMLVLIAIMMFLFLIVVAFSIDIAQMHLARTELRSST--------DAAANAAATTLA 84 Query: 75 KKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISR 134 L + R + + N D ++ + + Sbjct: 85 DTLDRNLAIQRGQQIAQANLVNGQPLLLADGDFQFGRSDRQVNGKYAFNAGEAPFNGVRV 144 Query: 135 YKIPL--KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI 192 +P + + + T D+ +V+D S SM Sbjct: 145 NGQRTAGSLSGPVPLFFGNVTGTSIFEPEA-FATATYVERDITLVVDRSGSMAG------ 197 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK 252 ++ + +I + + P Q GL +++++ E L + + + L Sbjct: 198 SRFNDLQAAIRIFTDLLATTPVDE---QIGLASYNDRASEDVQLTENFAEVNNAMDRLRT 254 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 G T+ + G++ ++ ++ MTDG + E Sbjct: 255 -GGFTSISRGMQAGQEIAL--------RGRPPEFVERTMIVMTDGRHNRGPE---PRVVA 302 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIG 362 + G ++ I ++ A + N + D + I Sbjct: 303 TDLAADGVTIHTITFGAGADFGRMQDVARIGGGRHFHATNGDQLRDIYREIA 354 >gi|254501086|ref|ZP_05113237.1| hypothetical protein SADFL11_1122 [Labrenzia alexandrii DFL-11] gi|222437157|gb|EEE43836.1| hypothetical protein SADFL11_1122 [Labrenzia alexandrii DFL-11] Length = 465 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 63/461 (13%), Positives = 136/461 (29%), Gaps = 109/461 (23%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 I F N G + + A + ++ ++ G I++S + L ID + + AT + Sbjct: 6 ISKFNRNQDGSILPIFAGMVLVLVVIGGAAIDISRAVNAREKLAYAIDAAALSVATDLST 65 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRN----ELRDNGFVNDIDDIVRSTSLDIVVVPQ 123 L+ I RI+N++ + + + + D+D + ++ Sbjct: 66 -------TVLRDNQIKTRIENSFRANLSDAEFLDQAIDNLDFDVDSNAGTVTVSSSAGLN 118 Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSRH-------IVMPITSSVKVNSQTDAR----- 171 N +I + + F + +V+ +T S+ + Sbjct: 119 NYFLNIPGFGKDGLGPDVFNFGTSAEVNYSRFDVELALVVDVTGSMAGDMGALRDAAEEV 178 Query: 172 LDMMIVLDVSRSMES------------FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +D++I D S S S + + S + E + + V Sbjct: 179 VDILIEDDASNSASKVRISLVPYSQGVNLGSYASTVTNGSTSWRNCVNEREGQQKYTDAV 238 Query: 220 QS--GL-VTFSNKIEEFFLLEWG--------------------VSHLQRKIKYLSKFGVS 256 + G + + ++ +F+ ++G S I + Sbjct: 239 YNYDGTNSEYFHGLQSYFIWDYGSSENWSSARDDCPSSSLQPLTSDKNTLISDIRNLSSG 298 Query: 257 TNSTPGLKYAY---------NQIF--DMQGMRQHCNTEDANYKKIIVFMTDGENL----- 300 + A+ ++ D T D + KK + MTDG+ Sbjct: 299 GGTGGQTGVAWGWYTLSPNWTSLWPTDSDPEPYGNGTPDDDVKKFALIMTDGDFNAQYGK 358 Query: 301 -------------------------------STKEDQQSLYYCNEAKKRGAIVYAIGI-- 327 + ++ C+ K ++ + Sbjct: 359 EERTTCTGRGRNRVCTTNEYWVERYHRYSDYNDPPATRARTLCDAMKAENIEIFTVFFDT 418 Query: 328 -RVIRSHEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDIV 366 + + CAS + +Y +N + AFS+I K I Sbjct: 419 GGSAFGDDLMSYCASGSDYYYEADNKDELITAFSNIAKRIQ 459 >gi|156616290|ref|NP_001096078.1| collagen alpha-6(VI) chain precursor [Homo sapiens] gi|189082902|sp|A6NMZ7|CO6A6_HUMAN RecName: Full=Collagen alpha-6(VI) chain; Flags: Precursor Length = 2263 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 42/196 (21%), Positives = 78/196 (39%), Gaps = 27/196 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M ++D S S+ + +K+ +K++ V + VQ G+V FS+ +E Sbjct: 622 DIMFLVDSSGSIGPE---NFSKMKTFMKNL------VSKSQIGPDRVQIGVVQFSDINKE 672 Query: 233 FFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L +S + I ++ G +T + L + +G R N +K Sbjct: 673 EFQLNRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFSPTKGARP-------NIRKF 725 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDGE ++ G I+Y++G+ + P + VEN Sbjct: 726 LILITDGE-----AQDIVKEPAVVLRQEGVIIYSVGVFGSNVTQLEEISGRPEMVFYVEN 780 Query: 351 PHSMYDAFSHIGKDIV 366 +D I D+V Sbjct: 781 ----FDILQRIEDDLV 792 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 55/356 (15%), Positives = 119/356 (33%), Gaps = 40/356 (11%) Query: 34 LGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIK------ 87 L M I S + L ++ AA + + + + R + + +I Sbjct: 283 LSMGINKSEVLQHIQNLSPRTGKAYTGAAIKKLRKEVFSARNGSRKNQGVPQIAVLVTHR 342 Query: 88 ---NTWNMSFRNELRDNGFVNDID-DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCT 143 + + N R+ + + + T L+ + E Y + + T Sbjct: 343 DSEDNVTKAAVNLRREGVTIFTLGIEGASDTQLEKIASHPAEQYVSKLKTFADLAAHNQT 402 Query: 144 FIPWYTNSRHIVMPI----TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 F+ N + + T ++K D+ +++D S S + T Sbjct: 403 FLKKLRNQITHTVSVFSERTETLKSGCVDTEEADIYLLIDGSGS------TQATDFHEMK 456 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVST 257 ++ ++ + P V+ G V +++ + F + L + I+ + + G +T Sbjct: 457 TFLSEVVGMFNIAPHK---VRVGAVQYADSWDLEFEINKYSNKQDLGKAIENIRQMGGNT 513 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 N+ L + + + + +Q N +V +T+G L N ++ Sbjct: 514 NTGAALNFTLSLL--QKAKKQRGNK----VPCHLVVLTNG-----MSKDSILEPANRLRE 562 Query: 318 RGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 VYAIGI+ + Y V + +DA I +V + + Sbjct: 563 EHIRVYAIGIKEANQTQLREIAGEEKRVYYVHD----FDALKDIRNQVVQEICTEE 614 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 37/192 (19%), Positives = 76/192 (39%), Gaps = 19/192 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D S S+ + ++ M+ VK N V+ G + +++ E Sbjct: 808 LDVVFVIDSSGSI------DYDEYNIMKDF---MIGLVKKADVGKNQVRFGALKYADDPE 858 Query: 232 EFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L+ + L+ ++ G ST + L ++ + N + Sbjct: 859 VLFYLDDFGTKLEVISVLQNDQAMGGSTYTAEAL--GFSDHMFTEARGSRLNKG---VPQ 913 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDGE+ + + + +G +V A+GI E L S + ++ VE Sbjct: 914 VLIVITDGES---HDADKLNATAKALRDKGILVLAVGIDGANPVELLAMAGSSDKYFFVE 970 Query: 350 NPHSMYDAFSHI 361 + FS + Sbjct: 971 TFGGLKGIFSDV 982 Score = 79.8 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 83/214 (38%), Gaps = 22/214 (10%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + +S KV+ + D +D++ ++D S S++ K+ + S+ V+ Sbjct: 982 VTASVCNSSKVDCEIDK-VDLVFLMDGSTSIQPN---DFKKMKEFLASV------VQDFD 1031 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N V+ G FS+ F L G + +I+ + + +T+ L+ + Sbjct: 1032 VSLNRVRIGAAQFSDTYHPEFPLGTFIGEKEISFQIENIKQIFGNTHIGAALREVEHYFR 1091 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 G R + ++++ +TDG+ + + RG +Y++GI + Sbjct: 1092 PDMGSRI-----NTGTPQVLLVLTDGQ-----SQDEVAQAAEALRHRGIDIYSVGIGDVD 1141 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 + ++ + V N + I ++I Sbjct: 1142 DQQLIQITGTAEKKLTVHNFDELKKVNKRIVRNI 1175 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 44/296 (14%), Positives = 93/296 (31%), Gaps = 31/296 (10%) Query: 64 QIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQ 123 + + E + G + + +E DN + V V + Sbjct: 109 KALQEAHRTYFSAPANGRDKKQFPPILVVLASSESEDNVEEASKALRKDGVKIISVGVQK 168 Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK----------VNSQTDARL- 172 ++ A++ + T S+++ I +K V + + Sbjct: 169 ASEENLKAMATSQFHFNLRTVRDLSMFSQNMTHIIKDVIKYKEGAVDDIFVEACQGPSMA 228 Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD+S + S D + E V + N ++ GLV +SN+ + Sbjct: 229 DVVFLLDMS------INGSEENFDYLKGFLE---ESVSALDIKENCMRVGLVAYSNETKV 279 Query: 233 FFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L G S + + I+ LS + +K ++F R +I Sbjct: 280 INSLSMGINKSEVLQHIQNLSPRTGKAYTGAAIKKLRKEVFS---ARNGSRKN-QGVPQI 335 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 V +T + ++ G ++ +GI + + + P Y Sbjct: 336 AVLVT-----HRDSEDNVTKAAVNLRREGVTIFTLGIEGASDTQLEKIASHPAEQY 386 >gi|86749514|ref|YP_486010.1| hypothetical protein RPB_2394 [Rhodopseudomonas palustris HaA2] gi|86572542|gb|ABD07099.1| conserved hypothetical protein [Rhodopseudomonas palustris HaA2] Length = 456 Score = 88.3 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 61/442 (13%), Positives = 126/442 (28%), Gaps = 84/442 (19%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 F G + I+ AI L + +G I+ S +T + + +D + + + + + Sbjct: 16 RFVKTDGGNVAIIFAIALLPMIGFIGAAIDYSRANKARTSMQAALDSAALMVSKDLASGV 75 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 + K + N N N + + L E ++ Sbjct: 76 ITAGQVSAKAQSYFASLYN--NTEAPNITVTATYTAKDSTGSSTVLLKGTGDISTEFMNM 133 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS------------------QTDAR 171 I I + +T S+ Sbjct: 134 FGFPTLGIGSAATATWGGTRLRVAIALDVTGSMASAGKMPAMQSAAKTLVDNLRANAQTA 193 Query: 172 LDMMIVL-------------------------DVSRSMESFFDSSITKIDMAIKS----- 201 D+ I + D + S ++ ++ + + A ++ Sbjct: 194 DDLYISIIPFAQMVNVGKSNKNASWIKWDYWEDTTGSCNWWWLTTKSSCESAGRTWSSTN 253 Query: 202 ----INAMLEEVKLIPDVNNV-----VQSGLVTFSNKIEEFFLLEW-----GVSHLQRKI 247 + + + + + +S E+ + + ++ KI Sbjct: 254 QSQWGGCVTDRDQPADTTKDAPTTAATRFPAANYSACPEQILPMTSAYSSSNATTIKDKI 313 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN-------- 299 LS G TN G+ +A+ + D + D Y I+ ++DG N Sbjct: 314 DALSPNG-GTNQPIGMHWAWMSLQDGAPLNTPAKDADYKYTDAIILLSDGMNTIDRWYGN 372 Query: 300 ---LSTKEDQQSLYYCNEAKKRGA------IVYAIGIRVIR--SHEFLRACASPNSFYLV 348 S D + C+ + A ++Y I + E L+ CA +F+ Sbjct: 373 GSSWSKDVDARQKLLCDNIRAASAASTTKTVIYTIQVNTDGDPESEVLKYCADSGNFFAT 432 Query: 349 ENPHSMYDAFSHIGKDIVTKRI 370 + AF+ IG + RI Sbjct: 433 TTASGISTAFAQIGASLSKLRI 454 >gi|297671963|ref|XP_002814093.1| PREDICTED: collagen alpha-6(VI) chain-like [Pongo abelii] Length = 2291 Score = 87.9 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 41/196 (20%), Positives = 77/196 (39%), Gaps = 27/196 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M ++D S S+ + +K+ +K++ V + VQ G V FS+ +E Sbjct: 650 DIMFLVDSSGSIGPE---NFSKMKTFMKNL------VSKSQIGPDRVQIGAVQFSDINKE 700 Query: 233 FFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L +S + I ++ G +T + L + +G R N +K Sbjct: 701 EFQLNRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFSPTKGAR-------PNVRKF 753 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDGE ++ G I+Y++G+ + P + VEN Sbjct: 754 LILITDGE-----AQDIVKEPAIALRQEGVIIYSVGVFGSNVTQLEEISGRPEMVFYVEN 808 Query: 351 PHSMYDAFSHIGKDIV 366 +D + D+V Sbjct: 809 ----FDILQRVEDDLV 820 Score = 86.0 bits (211), Expect = 8e-15, Method: Composition-based stats. Identities = 54/356 (15%), Positives = 119/356 (33%), Gaps = 40/356 (11%) Query: 34 LGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIK------ 87 L M I S + L ++ AA + + + + R + + +I Sbjct: 311 LSMGINKSEVLQHIQNLSPRTGKAYTGAAIKKLRKEVFSARNGSRKNQGVPQIAVLVTHR 370 Query: 88 ---NTWNMSFRNELRDNGFVNDID-DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCT 143 + + N R+ + + + T L+ + E Y + + T Sbjct: 371 DSEDNVTKAAVNLRREGVTIFTLGIEGASDTQLEKIASHPAEQYVSKLKTFADLAAHNQT 430 Query: 144 FIPWYTNSRHIVMPI----TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 F+ N + + T ++K D+ +++D S S + T Sbjct: 431 FLKKLRNQITHTVSVFSERTETLKSGCVDTEEADIYLLIDGSGS------TQATDFHEMK 484 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVST 257 ++ ++ + P V+ G V +++ + F + L + I+ + + G +T Sbjct: 485 TFLSEVVGMFNIAPHK---VRVGAVQYADSWDLEFEINKYSNKQDLGKAIENIRQMGGNT 541 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 N+ L + + + + +Q N +V +T+G L N ++ Sbjct: 542 NTGAALNFTLSLL--QKAKKQRGNK----VPCHLVVLTNG-----MSKDSILEPANRLRE 590 Query: 318 RGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 VYAIG++ + Y V + +DA I +V + + Sbjct: 591 EHIRVYAIGVKEANQTQLREIAGEEKRVYYVHD----FDALKDIRNQVVQEICTEE 642 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 76/192 (39%), Gaps = 19/192 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D S S+ + ++ M+ VK N V+ G + +++ E Sbjct: 836 LDVVFVIDSSGSI------DYDEYNIMKDF---MIGLVKKADVGKNQVRFGALKYADDPE 886 Query: 232 EFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L+ + L+ ++ G +T + L ++ + N + Sbjct: 887 VLFYLDDFGTKLEVISVLQNDQAMGGNTYTAEAL--GFSDHMFTEARGSRLNKG---VPQ 941 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDGE+ + + + +G +V A+GI E L S + ++ VE Sbjct: 942 VLIVITDGES---HDADKLNATAKALRDKGILVLAVGIAGANPVELLAMAGSSDKYFFVE 998 Query: 350 NPHSMYDAFSHI 361 + FS + Sbjct: 999 TFGGLKGIFSDV 1010 Score = 79.4 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 83/214 (38%), Gaps = 22/214 (10%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + +S KV+ + D +D++ ++D S S++ K+ + S+ V+ Sbjct: 1010 VTASVCNSSKVDCEIDK-VDLVFLMDGSTSIQQN---DFKKMKEFLASV------VQDFD 1059 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N V+ G FS+ F L G + +I+ + + +T+ L+ + Sbjct: 1060 VSLNRVRIGAAQFSDTYHPEFPLGAFIGEKEISFQIENIKQIFGNTHIGAALRKVEHYFR 1119 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 G R + ++++ +TDG+ + + RG +Y++GI + Sbjct: 1120 PDMGSRI-----NTGTPQVLLVLTDGQ-----SQDEVAQAAEALRHRGIDIYSVGIGDVD 1169 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 + ++ + V N + I ++I Sbjct: 1170 DQQLIQITGTAEKKLTVHNFDELKKVNKRIVRNI 1203 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 39/249 (15%), Positives = 81/249 (32%), Gaps = 31/249 (12%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK------- 163 + V V + ++ A++ + T S+++ I K Sbjct: 184 KDGVKIISVGVQKASEENLKAMATSQFHFNLRTVRDLSMFSQNMTHIIKDVTKYKEGAVD 243 Query: 164 ---VNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 V + + D++ +LD+S + S D + E V + N + Sbjct: 244 DIFVEACQGPSMADVVFLLDMS------INGSEENFDYLKGFLE---ESVSALDIKENCM 294 Query: 220 QSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GLV +SN+ + L G S + + I+ LS + +K ++F R Sbjct: 295 RVGLVAYSNETKVINSLSMGINKSEVLQHIQNLSPRTGKAYTGAAIKKLRKEVFS---AR 351 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +I V +T + ++ G ++ +GI + + Sbjct: 352 NGSRKN-QGVPQIAVLVT-----HRDSEDNVTKAAVNLRREGVTIFTLGIEGASDTQLEK 405 Query: 338 ACASPNSFY 346 + P Y Sbjct: 406 IASHPAEQY 414 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 69/201 (34%), Gaps = 25/201 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + S I M+ + P + + L +S+K+ Sbjct: 55 DVVFLVDSSDRLGS------KSFPFVKMFITKMISSL---PIEADKYRVALAQYSDKLHS 105 Query: 233 FFLLEW--GVSHLQRKIKY-LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L G S + ++ G S L+ A+ F + + Sbjct: 106 EFHLSTFKGRSPMLNHLRKNFGFIGGSLQIGKALQEAHRTYFSAPANGRDKK----QFPP 161 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL-- 347 I+V + S++ + +K G + ++G++ L+A A+ + Sbjct: 162 ILVVL-----ASSESEDDVEEASKALQKDGVKIISVGVQKASEEN-LKAMATSQFHFNLR 215 Query: 348 -VENPHSMYDAFSHIGKDIVT 367 V + +HI KD+ Sbjct: 216 TVRDLSMFSQNMTHIIKDVTK 236 >gi|148698183|gb|EDL30130.1| matrilin 1, cartilage matrix protein 1 [Mus musculus] Length = 456 Score = 87.9 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 76/200 (38%), Gaps = 21/200 (10%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ V+D SRS+ + + ++ ++E + + P+ + GLV +++ ++ Sbjct: 1 DLVFVVDSSRSVRP------VEFEKVKVFLSQVIESLDVGPNA---TRVGLVNYASTVKP 51 Query: 233 FFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L S + ++ + T + L++A + + K+ Sbjct: 52 EFPLRAHGSKASLLQAVRRIQPLSTGTMTGLALQFAITKALSDAE---GGRARSPDISKV 108 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDG + D A+ G ++AIG+ + LR AS V+ Sbjct: 109 VIVVTDGRPQDSVRDVS-----ERARASGIELFAIGVGRVD-KATLRQIASEPQDEHVDY 162 Query: 351 PHSMYDAFSHIGKDIVTKRI 370 S Y+ + K Sbjct: 163 VES-YNVIEKLAKKFQEAFC 181 Score = 86.0 bits (211), Expect = 9e-15, Method: Composition-based stats. Identities = 40/220 (18%), Positives = 85/220 (38%), Gaps = 30/220 (13%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T +V + + D++ ++D S+S+ ++ K IN +++ + + + Sbjct: 221 TCNVCRGGGSGSATDLVFLIDGSKSVRP------ENFELVKKFINQIVDTLDVSD---RL 271 Query: 219 VQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAY-NQIFDMQG 275 Q GLV +S+ I + F L ++ ++ +S T + LKY N G Sbjct: 272 AQVGLVQYSSSIRQEFPLGRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSG 331 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R +K+ + TDG + D +AK G ++A+G+ E Sbjct: 332 ARPGA-------QKVGIVFTDGRSQDYINDA-----ARKAKDLGFKMFAVGVGNAVEEEL 379 Query: 336 LRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P + ++ + + + IGK + + + Sbjct: 380 REIASEPVADHYFYTAD----FKTINQIGKKLQKQICVEE 415 >gi|326672754|ref|XP_002664126.2| PREDICTED: collagen alpha-1(XXI) chain-like [Danio rerio] Length = 572 Score = 87.9 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 44/214 (20%), Positives = 86/214 (40%), Gaps = 25/214 (11%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 S A D++ +LD S S++ ++ K + + + Q G+V Sbjct: 34 SCRTAPSDLVFILDGSWSVDDI------NFEIVKKWLVNITMSFNIGQ---KFTQVGVVQ 84 Query: 226 FSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +S+ L + S L + ++ + G +TN+ +K+A +++F + Sbjct: 85 YSDDPFLHIPLGKHFSSSDLIKAMESIEYMGGNTNTGRAIKFANDKLFALSE------RG 138 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--- 340 KI V +TDG K + L A+K+G I++AIG+ LRA A Sbjct: 139 PNGIAKIAVVLTDG-----KSQDEVLAAAEAARKKGIILFAIGVGSETEEAQLRAIANKP 193 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 S + V++ ++ I + + + + K Sbjct: 194 SSTYVFSVKDYKAIAKIREVIRQKLCEETVCPAK 227 >gi|111074529|ref|NP_031756.2| collagen alpha-1(XII) chain [Mus musculus] Length = 3061 Score = 87.9 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 42/279 (15%), Positives = 97/279 (34%), Gaps = 34/279 (12%) Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP--LKFCTFIPWYTNSRHIVMPI 158 +G + +T + + Y ++ S ++ + + T Sbjct: 64 DGPTKEFTLAASTTETLLSDLIPETQYVVTITSYNEVEESVPVIGQLTIQTGGPTKPGEK 123 Query: 159 -TSSVKVNSQTDAR-LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 ++ + + D++ ++D S S+ ++ ++ V Sbjct: 124 KPGKTEIQKCSVSAWTDLVFLVDGSWSVGRNN---------FKYILDFIVALVSAFDIGE 174 Query: 217 NVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDM 273 + G+V +S+ F L + L +K + G +T + + Y N + Sbjct: 175 EKTRVGVVQYSSDTRTEFNLNQYYRREDLLAAVKKIPYKGGNTMTGDAIDYLVKNTFTES 234 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 G R + K+ + +TDG K + E + G V+++GI+ + Sbjct: 235 AGSRAG-------FPKVAIIITDG-----KSQDEVEIPARELRNIGVEVFSLGIKAADAK 282 Query: 334 EFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 E + ++P N + V N +DA I +I+++ Sbjct: 283 ELKQIASTPSLNHVFNVAN----FDAIVDIQNEIISQVC 317 Score = 86.0 bits (211), Expect = 9e-15, Method: Composition-based stats. Identities = 53/267 (19%), Positives = 106/267 (39%), Gaps = 37/267 (13%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 ++T+L++ + + Y IS + + T + T + + + V + + Sbjct: 384 PQTTTLNVRDLTADTEYQISVFAMKGLTSSEPTSVMEKTQPMKVQVECSRGVDIKA---- 439 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++ ++D S S+ ++ K+ ++ + K N VQ LV +S Sbjct: 440 --DIVFLVDGSYSIG---IANFVKVRAFLEVL------AKSFEISPNRVQISLVQYSRDP 488 Query: 231 EEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L+ V + + I G STN+ + Y +IF +N Sbjct: 489 HTEFTLKEFNRVEDIIKAINTFPYRGGSTNTGKAMTYVREKIFVPN------KGSRSNVP 542 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 K+++ +TDG++ D + + ++A+G++ E L A ASP Sbjct: 543 KVMILITDGKSSDAFRD-----PAIKLRNSDVEIFAVGVKDAVRSE-LEAIASPPAETHV 596 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 + VE+ +DAF I ++ T+ I Sbjct: 597 FTVED----FDAFQRISFEL-TQSICL 618 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 43/251 (17%), Positives = 83/251 (33%), Gaps = 28/251 (11%) Query: 124 NEGYSISAISRYK--IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 Y ++ + L + + ++S + ++ +A D+++++D S Sbjct: 1150 GTTYRVNVFGMFDGGESLPLVGQEMTTLSDTTVTPFLSSGMDCLTRAEA--DIVLLVDGS 1207 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WG 239 S+ I+ ++E ++ P VQ L +S + L Sbjct: 1208 WSIG------RANFRTVRSFISRIVEVFEIGP---KRVQIALAQYSGDPRTEWQLNAHRD 1258 Query: 240 VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 L + + L G +T + L + Q F Q +KI V +TDG Sbjct: 1259 KKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKTQ------AGMRPRARKIGVLITDG-- 1310 Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFS 359 K + K G ++AIGI+ E P+ N +++ S Sbjct: 1311 ---KSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDT-HAYNVAD-FESLS 1365 Query: 360 HIGKDIVTKRI 370 I D+ Sbjct: 1366 KIVDDLTINLC 1376 >gi|171741586|ref|ZP_02917393.1| hypothetical protein BIFDEN_00672 [Bifidobacterium dentium ATCC 27678] gi|171277200|gb|EDT44861.1| hypothetical protein BIFDEN_00672 [Bifidobacterium dentium ATCC 27678] Length = 1256 Score = 87.9 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 61/296 (20%), Positives = 103/296 (34%), Gaps = 45/296 (15%) Query: 65 IMNEGNGNNRKKLKGGDILCRIKN---TWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVV 121 +N+ N N + T ++ + F N + V Sbjct: 481 AVNDPNITNVSCGTDTLAANQQTTCSGTLTLTEDMVDSEGHFTNTATASGTDDEGNAVNS 540 Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR-LDMMIVLDV 180 PQ +I AI P K N+ + + +T + ++ T + +D +VLDV Sbjct: 541 PQ-ASVTIKAIKPLGAPEKHKRIKKNSDNTYTVNVDVTGAANSSTITTTQSVDFTLVLDV 599 Query: 181 SRSMESFFD---SSITKIDMAIKSINAMLEEVKLI--PDVNNVVQSGLVTFSNKIEE--- 232 S SM D SI K+ ++N L E I + +++ GLV F+ K Sbjct: 600 SSSMSDEMDSDQGSIKKMTALKSAVNNFLGEAAEINEQSGSELIRVGLVKFAGKESSKVG 659 Query: 233 ----------------FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 L +S L+ K+ L G T + G K+A + Sbjct: 660 NETYTEGRFVYNYSQIVSPLTADMSDLKNKVSALRHNGA-TRADLGFKHASTVMSGA--- 715 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKE-----DQQSLYYCNEAKKRGAIVYAIGI 327 + K++++F TDG + ++ Y K GA VY+IG+ Sbjct: 716 -------RTDAKRVVIFFTDGTPTKVSDFDKDVANSAVTYAKSLKDSGATVYSIGV 764 >gi|332244061|ref|XP_003271190.1| PREDICTED: collagen alpha-1(XII) chain [Nomascus leucogenys] Length = 3100 Score = 87.9 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 53/267 (19%), Positives = 103/267 (38%), Gaps = 37/267 (13%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 ++T+L + + + Y IS + + I T + + + V + + Sbjct: 384 PQTTTLSVRDLSADTEYQISVSAMKGMTSSEPISIMEKTQPMKVQVECSRGVDIKA---- 439 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++ ++D S S+ ++ K+ ++ + VK N VQ LV +S Sbjct: 440 --DIVFLVDGSYSIG---IANFVKVRAFLEVL------VKSFEISPNRVQISLVQYSRDP 488 Query: 231 EEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L+ + I G STN+ + Y +IF +N Sbjct: 489 HTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPN------KGSRSNVP 542 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 K+++ +TDG++ D + + ++A+G++ E L A ASP Sbjct: 543 KVMILITDGKSSDAFRD-----PAIKLRNSDVEIFAVGVKDAVRSE-LEAIASPPAETHV 596 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 + VE+ +DAF I ++ T+ I Sbjct: 597 FTVED----FDAFQRISFEL-TQSICL 618 Score = 87.1 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 46/279 (16%), Positives = 100/279 (35%), Gaps = 34/279 (12%) Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP--LKFCTFIPWYTNSRHIVMPI 158 +G + +T + + Y ++ S ++ + + T S + Sbjct: 64 DGPTKEFTLAASTTETLLSELVPETEYVVTITSYDEVEESVPVIGQLTIQTGSSTKPVEK 123 Query: 159 -TSSVKVNSQTDAR-LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 ++ + + D++ ++D S S+ + I I ++ V Sbjct: 124 KPGKTEIQKCSVSAWTDLVFLVDGSWSVGRN---NFKYILDFIAAL------VSAFDIGE 174 Query: 217 NVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDM 273 + G+V +S+ F L + L IK + G +T + + Y N + Sbjct: 175 EKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTES 234 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 G R + K+ + +TDG K + E + G V+++GI+ + Sbjct: 235 AGARVG-------FPKVAIIITDG-----KSQDEVEIPARELRNVGVEVFSLGIKAADAK 282 Query: 334 EFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 E + ++P N + V N +DA I +I+++ Sbjct: 283 ELKQIASTPSLNHVFNVAN----FDAIVDIQNEIISQVC 317 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 49/250 (19%), Positives = 82/250 (32%), Gaps = 26/250 (10%) Query: 124 NEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 Y ++ + T S VMP+ SS + T A D+++++D S Sbjct: 1150 GTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPVLSS-GMECLTRAEADIVLLVDGSW 1208 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGV 240 S+ I+ ++E + P VQ L +S + L Sbjct: 1209 SIG------RANFRTVRSFISRIVEVFDIGP---KRVQIALAQYSGDPRTEWQLNAHRDK 1259 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 L + + L G +T + L + Q F Q +KI V +TDG Sbjct: 1260 KSLLQAVANLPYKGGNTLTGMALNFIRQQNFRTQ------AGMRPRARKIGVLITDG--- 1310 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 K + K G ++AIGI+ E P+ N +++ S Sbjct: 1311 --KSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDT-HAYNVAD-FESLSR 1366 Query: 361 IGKDIVTKRI 370 I D+ Sbjct: 1367 IVDDLTVNLC 1376 >gi|283455087|ref|YP_003359651.1| fimbriae protein with LPXTG motif and von Willebrand factor typeA domain [Bifidobacterium dentium Bd1] gi|283101721|gb|ADB08827.1| Fimbriae protein with LPXTG motif and von Willebrand factor typeA domain [Bifidobacterium dentium Bd1] Length = 1256 Score = 87.5 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 61/296 (20%), Positives = 103/296 (34%), Gaps = 45/296 (15%) Query: 65 IMNEGNGNNRKKLKGGDILCRIKN---TWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVV 121 +N+ N N + T ++ + F N + V Sbjct: 481 AVNDPNITNVSCGTDTLAANQQTTCSGTLTLTEDMVDSEGHFTNTATASGTDDEGNAVNS 540 Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR-LDMMIVLDV 180 PQ +I AI P K N+ + + +T + ++ T + +D +VLDV Sbjct: 541 PQ-ASVTIKAIKPLGAPEKHKRIKKNSDNTYTVNVDVTGAANSSTITTTQSVDFTLVLDV 599 Query: 181 SRSMESFFD---SSITKIDMAIKSINAMLEEVKLI--PDVNNVVQSGLVTFSNKIEE--- 232 S SM D SI K+ ++N L E I + +++ GLV F+ K Sbjct: 600 SSSMSDEMDSDQGSIKKMTALKSAVNNFLGEAAEINEQSGSELIRVGLVKFAGKESSKVG 659 Query: 233 ----------------FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 L +S L+ K+ L G T + G K+A + Sbjct: 660 NETYTEGRFVYNYSQIVSPLTADMSDLKNKVSALRHNGA-TRADLGFKHASTVMSGA--- 715 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKE-----DQQSLYYCNEAKKRGAIVYAIGI 327 + K++++F TDG + ++ Y K GA VY+IG+ Sbjct: 716 -------RTDAKRVVIFFTDGTPTKVSDFDKDVANSAVTYAKSLKDSGATVYSIGV 764 >gi|306821351|ref|ZP_07454960.1| von Willebrand factor [Eubacterium yurii subsp. margaretiae ATCC 43715] gi|304550638|gb|EFM38620.1| von Willebrand factor [Eubacterium yurii subsp. margaretiae ATCC 43715] Length = 467 Score = 87.5 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 36/209 (17%), Positives = 75/209 (35%), Gaps = 26/209 (12%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 T ++ +++ V+D S SM + D + + ++ K I + + Sbjct: 13 FTFLIEDEKDKYDGINIAFVIDSSGSM-FYNDPNGLRREVTHKF----------IDRLTD 61 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + ++ F K L + + G TN + AY+ ++ R Sbjct: 62 NDMAAVIGFDYKATVLEQFTSNKEKLHDAVDKIRSDG-GTNIGRAVSIAYDLFNNLDNNR 120 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + Y K ++ +TDG+ ++E AKK G +Y IG+ S + L+ Sbjct: 121 K------EKYPKFLILLTDGDGDYSEEYTIL------AKKAGIKIYTIGLGNGVSEKLLK 168 Query: 338 ACA--SPNSFYLVENPHSMYDAFSHIGKD 364 A + ++ ++ + F I Sbjct: 169 DIAKGTDGEYFHAKDASKLNKIFEKIADK 197 >gi|149638912|ref|XP_001511941.1| PREDICTED: similar to Collagen alpha-1(XII) chain [Ornithorhynchus anatinus] Length = 3176 Score = 87.5 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 53/267 (19%), Positives = 99/267 (37%), Gaps = 37/267 (13%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 R++SL++ + + Y IS + + I T + + + V + + Sbjct: 436 PRTSSLNVRDLSSDTEYQISVYAMKGLTSSEAVSIMEKTQPMKVQVECSRGVDIKA---- 491 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++ ++D S S+ + K+ ++ + K N VQ LV +S Sbjct: 492 --DIVFLVDGSYSIG---IGNFIKVRAFLEVL------AKSFEISPNRVQISLVQYSRDP 540 Query: 231 EEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L+ + I G STN+ + Y +IF +N Sbjct: 541 HTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVAS------KGSRSNVP 594 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 K+++ +TDG K + + ++A+G++ E L ASP Sbjct: 595 KVMILITDG-----KSSDAFKEPAIKLRNSDVEIFAVGVKDAVRSE-LETIASPPAETHV 648 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 Y VE+ +DAF I ++ T+ I Sbjct: 649 YTVED----FDAFQRISFEL-TQSICL 670 Score = 85.2 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 41/203 (20%), Positives = 80/203 (39%), Gaps = 30/203 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S S+ + I I ++ V + G+V +S+ Sbjct: 192 DLIFLVDGSWSVGRN---NFKYILDFIAAL------VSAFDVGEEKTRVGVVQYSSDTRT 242 Query: 233 FFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDMQGMRQHCNTEDANYKK 289 F L + + L IK + G +T + + Y N D G R + K Sbjct: 243 EFNLNQYYQRNELLSAIKKIPYKGGNTMTGDAIDYLIKNTFTDAAGARIG-------FPK 295 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SFYL 347 + + +TDG K + E + RG V+++GI+ + E + ++P+ + Sbjct: 296 VAIVITDG-----KSQDEVEIPARELRNRGVEVFSLGIKAADAKELKQIASTPSLQHVFN 350 Query: 348 VENPHSMYDAFSHIGKDIVTKRI 370 V N +DA I +I+++ Sbjct: 351 VAN----FDAIVDIQNEIISQVC 369 Score = 66.3 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 45/249 (18%), Positives = 82/249 (32%), Gaps = 24/249 (9%) Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 Y ++ + T +P SS+ + T A D+++++D S S Sbjct: 1202 GTTYKVNVFGMFDGGESLPLIGQEMTTLSDSTVPPISSMGLECLTRAEADIVLLVDGSWS 1261 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVS 241 + I+ ++E + P VQ L +S + L Sbjct: 1262 IG------RPNFKTVRSFISRIVEVFDIGP---KRVQIALAQYSGDPRTEWQLNAYKDKK 1312 Query: 242 HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 +L + L G +T + L F Q + +KI V +TDG Sbjct: 1313 NLLEAVANLPYKGGNTLTGMALN------FIRQNNFRPQAGMRPRARKIGVLITDG---- 1362 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHI 361 K + + G ++A+GI+ +E L+A A+ N + S I Sbjct: 1363 -KSQDDVEAPSKKLRDEGVELFAVGIKNADENE-LKAVATDPDDTHAYNVAD-FSFLSSI 1419 Query: 362 GKDIVTKRI 370 +D+ Sbjct: 1420 VEDLTANLC 1428 >gi|311268548|ref|XP_003132103.1| PREDICTED: collagen alpha-5(VI) chain-like [Sus scrofa] Length = 2519 Score = 87.5 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 37/207 (17%), Positives = 75/207 (36%), Gaps = 26/207 (12%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +D+ ++D S S+ S I K MLE ++ + V+ G V +S Sbjct: 435 DTKEVDIYFLIDGSGSIRSDHFEQIKKF---------MLEVIENFDIGPDKVRVGAVQYS 485 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + E+ F + + L++ I + + G T + L + I +G Q N Sbjct: 486 DTREKEFDITDYTTDETLRKAISNIRQLGGGTYTGEALDFILQII--KKGREQRINK--- 540 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 ++ +TDG ++ L + + V+AIGI + + Sbjct: 541 -VPCYLIVLTDGMSMD-----DVLEPAEKLRAENIAVHAIGIGEANRTQLQQIAGKEERV 594 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 +N +D+ +I +++ Sbjct: 595 SFGQN----FDSLKNIKNEVLHSICTE 617 Score = 86.4 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 48/228 (21%), Positives = 94/228 (41%), Gaps = 21/228 (9%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA 198 + +F ++ + + + LD++ VLD S S++ S+ Sbjct: 780 FQVESFSDLQEIENKLIFRVCALHDCKRIKE--LDVVFVLDHSSSIQPRDQESM------ 831 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTN 258 IN + VK N VQ G + +SN+ + F L S + ++ LS G T Sbjct: 832 ---INLTMHLVKKSDVGPNRVQFGALRYSNEPDIIFYLNSNRSAIMEYLRSLSAKGGDTY 888 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + L+ A N +F +H + N K++++ +TDG++ + ++ + + Sbjct: 889 TAKALERA-NILFT----EEHGSRIKQNVKQMLIIITDGKS---HDHIHLSDKASKLRAK 940 Query: 319 GAIVYAIGIRVIRSHEFLRACASPNSF-YLVENPHSMYDAFSHIGKDI 365 G I+YA+G+ E L A + V N S+ D + + + + Sbjct: 941 GIIIYAVGVGEAN-QEELETMAGNKHYTIHVSNFDSLKDVYQPLQESM 987 Score = 84.8 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 35/234 (14%), Positives = 84/234 (35%), Gaps = 28/234 (11%) Query: 142 CTFIPWYTNSRHIVMPITSSV-KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 +F + + ++I + S+ D + D+M ++D S S+ Sbjct: 594 VSFGQNFDSLKNIKNEVLHSICTEKGCEDMKADIMFLVDSSGSIGP------ENFRTMKS 647 Query: 201 SINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGV-SHLQRKIKYLSKFGVSTN 258 + +L ++++ D + G+V FS+ +E F L + + I+ +S +T Sbjct: 648 FMKNLLAKIQIGLDK---TRIGVVQFSDVTKEEFKLDTYFTQKEISDAIERMSPIEQNTL 704 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + L + +G R +K ++ +TDGE + + Sbjct: 705 TGKALTSIEPYFTESKGARSM-------VRKFLILITDGE-----AQDDVRNPAKVLRDK 752 Query: 319 GAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 G +++A+G+ + + + VE+ + I ++ + Sbjct: 753 GVVIFAVGVFRANRTQLEEISGDSSLVFQVESFSDL----QEIENKLIFRVCAL 802 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 71/205 (34%), Gaps = 26/205 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + IN M+ + P + GL +S+K+ Sbjct: 30 DVVFLVDSSDHLG------TKSFPYMKTFINKMINSL---PIETDKYHVGLAQYSDKLHT 80 Query: 233 FFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L S L K G S L+ + F R+ + Sbjct: 81 EFQLNTFKSKGPMLNHLKKNFGFLGGSLRIGSALREVHRTYFSNGRDRKQ-------FPP 133 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 I+V + S + +K G + ++G++ + S E L+A A+ Y + Sbjct: 134 ILVVL-----ASADSEDAVEEAAEALRKDGVRIISVGMQRV-SEETLKAMATGQFRYNLR 187 Query: 350 NPHSMYDAFSHIGKDIVTKRIWYDK 374 + AFS ++ + I Y + Sbjct: 188 TVRDL-SAFSQNMTQVLKEAIQYKE 211 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 28/199 (14%), Positives = 71/199 (35%), Gaps = 26/199 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + D+ D++ ++D S + + + + + + + ++ GL+++ Sbjct: 228 EKDSLADVIFLVDES-------VGTSQNLRNLQNFLKNITSYMDVKDNC---LRLGLMSY 277 Query: 227 SNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S++ E +L+ S Q++I+ LS +N A ++ + Sbjct: 278 SDRAETLSVLKSSTSQAEFQKQIQKLSLRAGKSNVG----AAVEKMRREGFSASSGSRRA 333 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I V +T + D + + V+A+GI + + Sbjct: 334 LGVPQIAVLVT-----NRPSDDEVREATLNLRLDDVTVFAMGIHGANKTQLEEIVS---- 384 Query: 345 FYLVENPHSMYDAFSHIGK 363 Y SM ++++ +G Sbjct: 385 -YPPRQTISMLESYADLGN 402 >gi|332232509|ref|XP_003265447.1| PREDICTED: collagen alpha-6(VI) chain [Nomascus leucogenys] Length = 2264 Score = 87.5 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 42/196 (21%), Positives = 78/196 (39%), Gaps = 27/196 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M ++D S S+ + +K+ +K++ V + VQ G+V FS+ +E Sbjct: 623 DIMFLVDSSGSIGPE---NFSKMKTFMKNL------VSKSQIGPDQVQIGVVQFSDINKE 673 Query: 233 FFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L +S + I ++ G +T + L + +G R N +K Sbjct: 674 EFQLNRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFSPTKGARP-------NIRKF 726 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDGE ++ G I+Y++G+ + P + VEN Sbjct: 727 LILITDGE-----AQDIVKEPAVVLRQEGVIIYSVGVFGSNVTQLEEISGRPEMVFYVEN 781 Query: 351 PHSMYDAFSHIGKDIV 366 +D I D+V Sbjct: 782 ----FDILQRIEDDLV 793 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 37/192 (19%), Positives = 76/192 (39%), Gaps = 19/192 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D S S+ + ++ M+ VK N V+ G + +++ E Sbjct: 809 LDVVFVIDSSGSI------DYDEYNIMKDF---MIGLVKKADVGKNQVRFGALKYADDPE 859 Query: 232 EFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F LE + L+ ++ G +T + L ++ + N + Sbjct: 860 VLFYLEDFGTKLEVISVLQNDQAMGGNTYTAEAL--GFSDHMFTEARGSRLNKG---VPQ 914 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDGE+ + + + +G +V A+GI E L S + ++ VE Sbjct: 915 VLIVITDGES---HDADKLNATAKALRDKGILVLAVGIAGANPAELLAMAGSSDKYFFVE 971 Query: 350 NPHSMYDAFSHI 361 + FS + Sbjct: 972 TFGGLKGIFSDV 983 Score = 85.2 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 55/356 (15%), Positives = 120/356 (33%), Gaps = 40/356 (11%) Query: 34 LGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIK------ 87 L M I S + L ++ AA + + + + R + + +I Sbjct: 284 LSMGINKSEVLQHIQNLSPRTGKAYTGAAIKKLRKEVFSARNGSRKNQGVPQIAVLVTHR 343 Query: 88 ---NTWNMSFRNELRDNGFVNDID-DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCT 143 + + N R+ + + + T L+ + E Y + + T Sbjct: 344 DSEDNVTKAAVNLRREGVTIFTLGIEGASDTQLEKIASHPAEQYVSKLKTFADLAAHNQT 403 Query: 144 FIPWYTNSRHIVMPI----TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 F+ N + + T ++K D+ +++D S S + T Sbjct: 404 FLKKLRNQITHTVSVFSERTETLKSGCVDTEEADIYLLIDGSGS------TQATDFHEMK 457 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVST 257 ++A++ + P V+ G V +++ + F + L + I+ + + G +T Sbjct: 458 TFLSAVVGMFDIAPHK---VRVGAVQYADSWDLEFEINKYSNRQDLGKAIENIRQMGGNT 514 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 N+ L + + + + +Q N +V +T+G L N ++ Sbjct: 515 NTGAALNFTLSLL--QKAKKQRGNK----VPCHLVVLTNG-----MSKDSILEPANRLRE 563 Query: 318 RGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 VYAIG++ + Y V + +DA I +V + + Sbjct: 564 EHIRVYAIGVKEANQTQLREIAGEEKRVYYVHD----FDALKDIRNQVVQEICAEE 615 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 83/214 (38%), Gaps = 22/214 (10%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + +S KV+ + D +D++ ++D S S++ K+ + S+ V+ Sbjct: 983 VTASVCNSSKVDCEIDK-VDLVFLMDGSTSIQPN---DFKKMKEFLASV------VQDFD 1032 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N V+ G FS+ F L G + +I+ + + +T+ L+ + Sbjct: 1033 VSLNRVRIGAAQFSDTYHPEFPLGTFIGEKEISFQIENIKQIFGNTHIGAALRKVEHYFR 1092 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 G R + ++++ +TDG+ + + RG +Y++GI + Sbjct: 1093 PDMGSRI-----NTGTPQVLLVLTDGQ-----SQDEVAQAAEALRHRGIDIYSVGIGDVD 1142 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 + ++ + V N + I ++I Sbjct: 1143 DQQLIQITGTAEKKLTVHNFDELKKVNKRIVRNI 1176 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 38/249 (15%), Positives = 81/249 (32%), Gaps = 31/249 (12%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK------- 163 + + V + ++ A++ + T S+++ I K Sbjct: 157 KDGVKIISIGVQKASEENLKAMATSQFHFNLRTVRDLSMFSQNMTHIIKDVTKYKEGAVD 216 Query: 164 ---VNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 V + + D++ +LD+S + S D + E V + N + Sbjct: 217 DIFVEACQGPSVADVVFLLDMS------INGSEENFDYLKGFLE---ESVSALDIKENCM 267 Query: 220 QSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GLV +SN+ + L G S + + I+ LS + +K ++F R Sbjct: 268 RVGLVAYSNETKVINSLSMGINKSEVLQHIQNLSPRTGKAYTGAAIKKLRKEVFS---AR 324 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +I V +T + ++ G ++ +GI + + Sbjct: 325 NGSRKN-QGVPQIAVLVT-----HRDSEDNVTKAAVNLRREGVTIFTLGIEGASDTQLEK 378 Query: 338 ACASPNSFY 346 + P Y Sbjct: 379 IASHPAEQY 387 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 76/217 (35%), Gaps = 26/217 (11%) Query: 158 ITSSVKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 I S + VN + D++ ++D S + S I M+ + P Sbjct: 12 ICSHISVNQDSGPEYADVVFLVDSSDRLGS------RSFPFVKMFITKMINSL---PIEA 62 Query: 217 NVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKY-LSKFGVSTNSTPGLKYAYNQIFDM 273 + + L +S+K+ F L G S + ++ + G S L+ A+ F Sbjct: 63 DKYRVALAQYSDKLHSEFHLSTFKGRSPMLNHLRKNVGFIGGSLQIGKALQEAHRTYFSA 122 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + + ++V S++ + + +K G + +IG++ Sbjct: 123 PAD--GRDKKQFPPILVVV-------ASSESEDDVVEASKALQKDGVKIISIGVQKASEE 173 Query: 334 EFLRACASPNSFYL---VENPHSMYDAFSHIGKDIVT 367 L+A A+ + V + +HI KD+ Sbjct: 174 N-LKAMATSQFHFNLRTVRDLSMFSQNMTHIIKDVTK 209 >gi|126341668|ref|XP_001379927.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 2568 Score = 87.5 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 41/269 (15%), Positives = 100/269 (37%), Gaps = 33/269 (12%) Query: 109 DIVRSTSLDIVVV-PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + + E Y+++ + + + +++ T + + R+ I + Sbjct: 327 SYPPDHYISEQNSYAELEKYTVTLLKKLQYEVQYRTSVQYGQTERYRTGCIETEKA---- 382 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+ ++D S S+ +F + +++ + P V+ G V +S Sbjct: 383 -----DIYFLIDGSGSVRPYF-------HEIKNFMIEVIQMFNVAP---EKVRFGAVQYS 427 Query: 228 NKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + ++ F + V L + I + + G +TN+ L+ F ++ ++ + Sbjct: 428 STHQQEFGISTYSNVVDLSKAILNIRQLGHTTNTGAALR------FMLEEIKSSKKKQHD 481 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 + ++ +TDG + + ++ I++AIG++ E S + Sbjct: 482 SVPCHLLVVTDG-----MSNDEVSEPAKRLREEDVIIHAIGVKGANIIELKDIAGSEDRV 536 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 V N S+ + I + I +R DK Sbjct: 537 KFVYNFESLKGIKNEIVQTICRERDCKDK 565 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 37/195 (18%), Positives = 81/195 (41%), Gaps = 20/195 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ VLD S S+ + + I ++E+ + + VQ G + +S+ E Sbjct: 753 LDIVFVLDDSSSISEH------QYESLINFTKHLVEKADV---GRDRVQFGALKYSDTPE 803 Query: 232 EFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L + + I L + G +T + L ++ QH + K+ Sbjct: 804 ILFYLNDYQNK-KSIIDKLKFQRKGGNTYTAKALNRSHELFT-----EQHGSRIKRGVKQ 857 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDG++ + + + + I+Y +G+ E L S +++++V+ Sbjct: 858 MLIVITDGKS---HDYLELETVGKALRAKNIIIYGVGVAEASDKELLDITGSKDNYFMVD 914 Query: 350 NPHSMYDAFSHIGKD 364 N + + + I K Sbjct: 915 NFEKLKEIYLPIEKK 929 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 34/189 (17%), Positives = 66/189 (34%), Gaps = 23/189 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 D R D+M ++D S+S+ + + KI + +K V + V G+ + Sbjct: 562 CKDKRADVMFLVDSSKSIGA---PAFNKIKIFMK------NMVNKSHVGLDQVHIGVGQY 612 Query: 227 SNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 SN +E F L ++ + I ++ G T + L++ + + Sbjct: 613 SNVFKEEFGLTEYMTQRDIFDAIDRMNFIGDKTYTGKALEFVSSYF-------EPLKGGR 665 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 KK ++ +TDG + ++ + I +S + PN Sbjct: 666 KGVKKFLILITDG-----FSQDSITEPALHLRNNEVVILLVAIFRSKSPQLEDITGDPNL 720 Query: 345 FYLVENPHS 353 Y VE Sbjct: 721 VYYVEKFDD 729 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 44/288 (15%), Positives = 98/288 (34%), Gaps = 42/288 (14%) Query: 72 NNRKKLKGGDILCRIKNTWNMSFRNELRD---NGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 N K+ G + R+ ++ S ++ +I+ + + E + Sbjct: 1785 NAFKRTLPGPNVRRLTVFFSDSQSPDISSIITATMEFSGLNIIPAVFAFNDMFALEEAFQ 1844 Query: 129 ISAISRYK-IPLKFCTF------IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 Y+ IP++ F + T P + + Q + +D + +LD S Sbjct: 1845 FDDTGTYQVIPVRPSQFSHPIERLRRCTLCYDKCFPNVCAEEQIDQVHSYVDAIFLLDSS 1904 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVK-LIPDVNNVV--QSGLVTFS----------N 228 R + + + ++++L+ + V + L+++S Sbjct: 1905 RHVRG------DEFEQMKAFLSSVLDNFDISADPEQSDVGDRLALLSYSPRDFSIRRTST 1958 Query: 229 KIEEFFLLEWGVSH--LQRKI-KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 +++ F S ++R I L + L++ N +F H Sbjct: 1959 PVKKEFDFTTYSSKALMRRHIHNSLKQLHGEAYVGHALQWTTNMVFSNTF---HSRKN-- 2013 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 K+I +T GEN K+ + + AK +G +++ I + R Sbjct: 2014 ---KVIFVITAGENNENKDVLRKISL--RAKCQGYVIFVISLGSTREE 2056 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 38/295 (12%), Positives = 97/295 (32%), Gaps = 41/295 (13%) Query: 58 LVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLD 117 ++ +E + + D + I + +L+ ++ + ++ Sbjct: 82 ILVVLASAKSEDDVEGPAEALRRDGVKIISLGIQDASEEDLKAMATSQYHYNLRTARNVG 141 Query: 118 IVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 I +N I ++K ++ + + D+ D++ + Sbjct: 142 I--FSENMTRIIKETIKHKEIIQ--------------DDDEVGEFSILCERDSLADIVFL 185 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 LD S + ++ K + + + + + G+++FS++ E LE Sbjct: 186 LDESLKVRRN-------LENLQKFLENITASMDI---KAGCTRIGILSFSDETEVVASLE 235 Query: 238 WGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 ++H + + I+ LS +K A + + H ++I V +T Sbjct: 236 RDMTHDEIIQAIRSLSLRDGKAYIGAAMKKA-------RELFSHSRK-AQGVRQIAVLVT 287 Query: 296 DGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 + + +G ++ IGI+ +F + + P Y+ E Sbjct: 288 -----HRSSEDDMHEAAEDLLLQGVSIFGIGIKGYNITQFKQMVSYPPDHYISEQ 337 >gi|291399431|ref|XP_002716111.1| PREDICTED: matrilin 1, cartilage matrix protein [Oryctolagus cuniculus] Length = 497 Score = 87.5 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 31/209 (14%), Positives = 76/209 (36%), Gaps = 22/209 (10%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + D++ V+D SRS+ + + ++ ++E + + P+ + Sbjct: 30 ARGQLCRTRPTDLVFVVDSSRSVRP------VEFEKVKVFLSQVIESLDVGPNA---TRV 80 Query: 222 GLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G+V +++ + + F L + L + ++ + T + +++A + + Sbjct: 81 GVVNYASAVRQEFPLRAHGSKAALLQAVRRIRPLATGTMTGLAIQFAITKALSDAEGGRV 140 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 + K+++ +TDG + D A+ G ++AIG+ LR Sbjct: 141 TSPG---ISKVVIVVTDGRPQDSVRDVS-----ERARASGVELFAIGVGGRVDKATLRQI 192 Query: 340 AS---PNSFYLVENPHSMYDAFSHIGKDI 365 AS VE+ + + Sbjct: 193 ASEPQDEHVDYVESYSVIEKLSKKFREAF 221 Score = 84.4 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 39/220 (17%), Positives = 83/220 (37%), Gaps = 30/220 (13%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T +V + D++ ++D S+S+ ++ K IN +++ + + + Sbjct: 262 TCNVCSGGGGSSATDLVFLIDGSKSVRP------ENFELVKKFINQIVDTLDVSD---RL 312 Query: 219 VQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAY-NQIFDMQG 275 Q GLV +S+ + + F L ++ ++ +S T + LKY N G Sbjct: 313 AQVGLVQYSSSVRQEFPLGRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSG 372 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R +K+ + TDG + D +AK G ++A+G+ E Sbjct: 373 ARPGA-------QKVGIVFTDGRSQDYINDA-----AKKAKDLGFKMFAVGVGNAVEDEL 420 Query: 336 LRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P ++ + + + IGK + + + Sbjct: 421 REIASEPVAEHYFYTAD----FKTITQIGKKLQKRICVEE 456 >gi|148694466|gb|EDL26413.1| procollagen, type XII, alpha 1, isoform CRA_b [Mus musculus] gi|148694467|gb|EDL26414.1| procollagen, type XII, alpha 1, isoform CRA_b [Mus musculus] Length = 2886 Score = 87.5 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 42/279 (15%), Positives = 97/279 (34%), Gaps = 34/279 (12%) Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP--LKFCTFIPWYTNSRHIVMPI 158 +G + +T + + Y ++ S ++ + + T Sbjct: 64 DGPTKEFTLAASTTETLLSDLIPETQYVVTITSYNEVEESVPVIGQLTIQTGGPTKPGEK 123 Query: 159 -TSSVKVNSQTDAR-LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 ++ + + D++ ++D S S+ ++ ++ V Sbjct: 124 KPGKTEIQKCSVSAWTDLVFLVDGSWSVGRNN---------FKYILDFIVALVSAFDIGE 174 Query: 217 NVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDM 273 + G+V +S+ F L + L +K + G +T + + Y N + Sbjct: 175 EKTRVGVVQYSSDTRTEFNLNQYYRREDLLAAVKKIPYKGGNTMTGDAIDYLVKNTFTES 234 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 G R + K+ + +TDG K + E + G V+++GI+ + Sbjct: 235 AGSRAG-------FPKVAIIITDG-----KSQDEVEIPARELRNIGVEVFSLGIKAADAK 282 Query: 334 EFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 E + ++P N + V N +DA I +I+++ Sbjct: 283 ELKQIASTPSLNHVFNVAN----FDAIVDIQNEIISQVC 317 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 53/267 (19%), Positives = 106/267 (39%), Gaps = 37/267 (13%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 ++T+L++ + + Y IS + + T + T + + + V + + Sbjct: 384 PQTTTLNVRDLTADTEYQISVFAMKGLTSSEPTSVMEKTQPMKVQVECSRGVDIKA---- 439 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++ ++D S S+ ++ K+ ++ + K N VQ LV +S Sbjct: 440 --DIVFLVDGSYSIG---IANFVKVRAFLEVL------AKSFEISPNRVQISLVQYSRDP 488 Query: 231 EEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L+ V + + I G STN+ + Y +IF +N Sbjct: 489 HTEFTLKEFNRVEDIIKAINTFPYRGGSTNTGKAMTYVREKIFVPN------KGSRSNVP 542 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 K+++ +TDG++ D + + ++A+G++ E L A ASP Sbjct: 543 KVMILITDGKSSDAFRD-----PAIKLRNSDVEIFAVGVKDAVRSE-LEAIASPPAETHV 596 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 + VE+ +DAF I ++ T+ I Sbjct: 597 FTVED----FDAFQRISFEL-TQSICL 618 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 43/251 (17%), Positives = 83/251 (33%), Gaps = 28/251 (11%) Query: 124 NEGYSISAISRYK--IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 Y ++ + L + + ++S + ++ +A D+++++D S Sbjct: 1150 GTTYRVNVFGMFDGGESLPLVGQEMTTLSDTTVTPFLSSGMDCLTRAEA--DIVLLVDGS 1207 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WG 239 S+ I+ ++E ++ P VQ L +S + L Sbjct: 1208 WSIG------RANFRTVRSFISRIVEVFEIGP---KRVQIALAQYSGDPRTEWQLNAHRD 1258 Query: 240 VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 L + + L G +T + L + Q F Q +KI V +TDG Sbjct: 1259 KKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKTQ------AGMRPRARKIGVLITDG-- 1310 Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFS 359 K + K G ++AIGI+ E P+ N +++ S Sbjct: 1311 ---KSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDT-HAYNVAD-FESLS 1365 Query: 360 HIGKDIVTKRI 370 I D+ Sbjct: 1366 KIVDDLTINLC 1376 >gi|183583553|ref|NP_694996.5| collagen alpha-5(VI) chain [Homo sapiens] Length = 2526 Score = 87.5 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 63/349 (18%), Positives = 127/349 (36%), Gaps = 53/349 (15%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + +F + + IDR + +NEG G L + + S L Sbjct: 682 LNRYFTQQEISDAIDRMSL------INEGTLT-------GKALNFVGQYFTHSKGARLGA 728 Query: 101 NGFVNDIDDIVRSTSL-DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNS-------- 151 F+ I D V + D + + + +I ++ Y I ++ Sbjct: 729 KKFLILITDGVAQDDVRDPARILRGKDVTIFSVGVYNANRSQLEEISGDSSLVFHVENFD 788 Query: 152 ------RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 R ++ + + T LD++ VLD S S++ + + IN Sbjct: 789 HLKALERKLIFRVCALHDCKRITL--LDVVFVLDHSGSIKKQYQDHM---------INLT 837 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGL 263 + VK + VQ G + +S++ F L S + ++ G +T + L Sbjct: 838 IHLVKKADVGRDRVQFGALKYSDQPNILFYLNTYSNRSAIIENLRKRRDTGGNTYTAKAL 897 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 K+A N +F +H + N K++++ +TDGE+ + + E + +G ++ Sbjct: 898 KHA-NALFT----EEHGSRIKQNVKQMLIVITDGESHDHDQLNDTAL---ELRNKGITIF 949 Query: 324 AIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 A+G+ E + N+ V+N + D F+ + + Sbjct: 950 AVGVGKANQKELEGMAGNKNNTIYVDNFDKLKDVFT----LVQERMCTE 994 Score = 79.0 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 70/192 (36%), Gaps = 23/192 (11%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D + D+M ++D S S+ + + K+ + +K++ + I + Q G+V Sbjct: 620 KGCEDMKADIMFLVDSSWSIG---NENFRKMKIFMKNL------LTKIQIGADKTQIGVV 670 Query: 225 TFSNKIEEFFLL-EWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+K +E F L + + I +S T + L + +G R Sbjct: 671 QFSDKTKEEFQLNRYFTQQEISDAIDRMSLINEGTLTGKALNFVGQYFTHSKGARLGA-- 728 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 KK ++ +TDG + + ++++G+ + Sbjct: 729 -----KKFLILITDG-----VAQDDVRDPARILRGKDVTIFSVGVYNANRSQLEEISGDS 778 Query: 343 NSFYLVENPHSM 354 + + VEN + Sbjct: 779 SLVFHVENFDHL 790 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 72/204 (35%), Gaps = 27/204 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S++ I + MLE ++ + V+ G+V +S+ E Sbjct: 442 DIHFLIDGSSSIQEKQFEQIKRF---------MLEVTEMFSIGPDKVRVGVVQYSDDTEV 492 Query: 233 FFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + + + ++ I + + T + L Y I + R Sbjct: 493 EFYITDYSNDIDLRKAIFNIKQLTGGTYTGKALDYILQIIKNGMKDRMSK------VPCY 546 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDG + + + + V+A+GI E +N Sbjct: 547 LIVLTDGMSTD-----RVVEPAKRLRAEQITVHAVGIGAANKIELQEIAGKEERVSFGQN 601 Query: 351 PHSMYDAFSHIGKDIVTKRIWYDK 374 +DA I ++V + I +K Sbjct: 602 ----FDALKSIKNEVV-REICAEK 620 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 64/185 (34%), Gaps = 22/185 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + IN M+ + P N + L +S++ Sbjct: 30 DVVFLVDSSDHLGP------KSFPFVKTFINKMINSL---PIEANKYRVALAQYSDEFHS 80 Query: 233 FFLLEW--GVSHLQRKIKY-LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L G S + +K G S L+ A+ F + + + Sbjct: 81 EFHLSTFKGRSPMLNHLKKNFQFIGGSLQIGKALQEAHRTYFSAPINGR----DRKQFPP 136 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 I+V + S + + + +K G + ++G++ L+A A+ + + + Sbjct: 137 ILVVL-----ASAESEDEVEEASKALQKDGVKIISVGVQKASEEN-LKAMATSHFHFNLR 190 Query: 350 NPHSM 354 + Sbjct: 191 TIRDL 195 >gi|297678514|ref|XP_002817115.1| PREDICTED: collagen alpha-1(XII) chain-like isoform 1 [Pongo abelii] Length = 3115 Score = 87.1 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 46/279 (16%), Positives = 100/279 (35%), Gaps = 34/279 (12%) Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP--LKFCTFIPWYTNSRHIVMPI 158 +G + +T + + Y ++ S ++ + + T S + Sbjct: 64 DGPTKEFTLAASTTETLLSELVPETEYVVTITSYDEVEESVPVIGQLTIQTGSSTKPVEK 123 Query: 159 -TSSVKVNSQTDAR-LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 ++ + + D++ ++D S S+ + I I ++ V Sbjct: 124 KPGKTEIQKCSVSAWTDLVFLVDGSWSVGRN---NFKYILDFIAAL------VSAFDIGE 174 Query: 217 NVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDM 273 + G+V +S+ F L + L IK + G +T + + Y N + Sbjct: 175 EKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTES 234 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 G R + K+ + +TDG K + E + G V+++GI+ + Sbjct: 235 AGARVG-------FPKVAIIITDG-----KSQDEVEIPARELRNVGVEVFSLGIKAADAK 282 Query: 334 EFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 E + ++P N + V N +DA I +I+++ Sbjct: 283 ELKQIASTPSLNHVFNVAN----FDAIVDIQNEIISQVC 317 Score = 87.1 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 53/267 (19%), Positives = 103/267 (38%), Gaps = 37/267 (13%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 ++T+L + + + Y IS + + I T + + + V + + Sbjct: 384 PQTTTLSVRDLSADTEYQISVSAMKGMTSSEPISIMEKTQPMKVQVECSRGVDIKA---- 439 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++ ++D S S+ ++ K+ ++ + VK N VQ LV +S Sbjct: 440 --DIVFLVDGSYSIG---IANFVKVRAFLEVL------VKSFEISPNRVQISLVQYSRDP 488 Query: 231 EEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L+ + I G STN+ + Y +IF +N Sbjct: 489 HTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF------VPSKGSRSNVP 542 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 K+++ +TDG++ D + + ++A+G++ E L A ASP Sbjct: 543 KVMILITDGKSSDAFRD-----PAIKLRNSDVEIFAVGVKDAVRSE-LEAIASPPAETHV 596 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 + VE+ +DAF I ++ T+ I Sbjct: 597 FTVED----FDAFQRISFEL-TQSICL 618 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 69/205 (33%), Gaps = 24/205 (11%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 T A D+++++D S S+ I+ ++E + P VQ L +S Sbjct: 1191 TRAEADIVLLVDGSWSIG------RANFRTVRSFISRIVEVFDIGP---KRVQVALAQYS 1241 Query: 228 NKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L L + + L G +T + L + Q F Q Sbjct: 1242 GDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQNFRTQ------AGMRP 1295 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +KI V +TDG K + K G ++AIGI+ E P+ Sbjct: 1296 RARKIGVLITDG-----KSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDT 1350 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRI 370 N +++ S I D+ Sbjct: 1351 -HAYNVAD-FESLSRIVDDLTINLC 1373 >gi|297286914|ref|XP_001113364.2| PREDICTED: collagen alpha-6(VI) chain-like [Macaca mulatta] Length = 2262 Score = 87.1 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 41/196 (20%), Positives = 78/196 (39%), Gaps = 27/196 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M ++D S S+ + +K+ +K++ V + VQ G+V FS+ +E Sbjct: 623 DIMFLVDSSGSIGPE---NFSKMKTFMKNL------VSKSQIGPDRVQIGVVQFSDINKE 673 Query: 233 FFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L +S + I ++ G +T + L + +G R + +K Sbjct: 674 EFQLNRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFSPTKGSRP-------SVRKF 726 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDGE ++ G I+Y++G+ + P + VEN Sbjct: 727 LILITDGE-----AQDIVKEPAVALRQEGVIIYSVGVFGSNVTQLEEISGRPEMVFYVEN 781 Query: 351 PHSMYDAFSHIGKDIV 366 +D I D+V Sbjct: 782 ----FDILQRIEDDLV 793 Score = 82.5 bits (202), Expect = 9e-14, Method: Composition-based stats. Identities = 38/227 (16%), Positives = 91/227 (40%), Gaps = 24/227 (10%) Query: 144 FIPWYTNSRHIVMPITSSVKVNS---QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 F+ + + I+ +T+SV +S ++D++ ++D S S++ + K+ + Sbjct: 969 FVETFGGLKGIISDVTASVCNSSKVDCEIDKVDLVFLMDGSTSIQ---QTDFKKMKEFMV 1025 Query: 201 SINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTN 258 S+ V+ N V+ G FS+ F L G + +I+ +++ +T+ Sbjct: 1026 SV------VQDFDVSNKRVRIGAAQFSDAYRPEFPLGTFIGAKEISIQIENITQIFGNTH 1079 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 L+ + G R + ++++ +TDG+ + ++R Sbjct: 1080 IGAALRKVEHYFRPDMGSRI-----NTGTPQVLLVLTDGQ-----SQDEVAQAAEALRRR 1129 Query: 319 GAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 G +Y++GI + + ++ + V N + I ++I Sbjct: 1130 GIDIYSVGIGDVDDQQLMQITGTAEKKLTVHNFDELKKVNKRIVRNI 1176 Score = 82.1 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 35/205 (17%), Positives = 77/205 (37%), Gaps = 23/205 (11%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D S S+ + ++ M++ V+ N V+ G + +++ E Sbjct: 809 LDVVFVIDSSGSI------DYDEYNIMKDF---MIDIVRKADVGMNRVRFGALKYADDPE 859 Query: 232 EFFLLEW-GVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L+ G + ++ G +T + L ++ + N + Sbjct: 860 VLFYLDNFGTKPEVISVLQTDQAMGGNTYTAEAL--GFSDHMFTEARGSRLNKG---VPQ 914 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDG++ + + + +G +V A+GI E L S + ++ VE Sbjct: 915 VLIVITDGDS---HDADKLNATAKALRDKGILVLAVGIADANPVELLAMAGSSDKYFFVE 971 Query: 350 NPHSMYDAFSHIGKDIVTKRIWYDK 374 + I D+ K Sbjct: 972 TFGGL----KGIISDVTASVCNSSK 992 Score = 81.7 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 55/356 (15%), Positives = 118/356 (33%), Gaps = 40/356 (11%) Query: 34 LGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIK------ 87 L M I S + L ++ AA + + + + R + + +I Sbjct: 284 LSMGINKSEVLQHIQNLSPRTGKAYTGAAIKKLRKEVFSARNGSRKNQGVPQIAVLVTHR 343 Query: 88 ---NTWNMSFRNELRDNGFVNDID-DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCT 143 + + N R+ + + T L+ + E Y + + T Sbjct: 344 DSEDNVTKAAVNLRREGVTIFTLGIKGASDTQLEKIASHPAEQYVSKLKTFADLAAHNQT 403 Query: 144 FIPWYTNSRHIVMPI----TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 F+ N + + T ++K D+ +++D S S + T Sbjct: 404 FLKKLRNQITHTVSVFSERTETLKSGCVDTEEADIYLLIDGSGS------TQATDFHEMK 457 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVST 257 ++ ++ + P V+ G V +++ + F + L + I+ + + G +T Sbjct: 458 IFLSEVVGMFNIAPHK---VRVGAVQYADSWDLEFEINKYSNKQDLGKAIENIRQLGGNT 514 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 N+ L + + + + +Q N +V +T+G L N ++ Sbjct: 515 NTGAALNFTLSLL--QKAKKQRGNK----VPCHLVVLTNG-----MSKDSILEPANRLRE 563 Query: 318 RGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 VYAIGI+ + Y V + +DA I +V + + Sbjct: 564 EHIRVYAIGIKEANQTQLREIAGEEKRVYYVHD----FDALKDIRNQVVQEICAEE 615 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 39/249 (15%), Positives = 83/249 (33%), Gaps = 31/249 (12%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK------- 163 + V V + ++ A++ + T S+++ I K Sbjct: 157 KDGVKIISVGVQKASEENLKAMATSQFHFNLRTVRDLSMFSQNMTQIIKDVAKYKEGAVD 216 Query: 164 ---VNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 V + + D++ +LD+S + S +D + E V + N + Sbjct: 217 DIFVEACQGPSMADVVFLLDMS------INGSDENLDYLKGFLE---ESVSALDIKENCM 267 Query: 220 QSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GLV +SN+ + L G S + + I+ LS + +K ++F R Sbjct: 268 RIGLVAYSNETKVINSLSMGINKSEVLQHIQNLSPRTGKAYTGAAIKKLRKEVFS---AR 324 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +I V +T + ++ G ++ +GI+ + + Sbjct: 325 NGSRKN-QGVPQIAVLVT-----HRDSEDNVTKAAVNLRREGVTIFTLGIKGASDTQLEK 378 Query: 338 ACASPNSFY 346 + P Y Sbjct: 379 IASHPAEQY 387 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 70/206 (33%), Gaps = 24/206 (11%) Query: 154 IVMPITSSVKVNSQTDAR--LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 + + IT S SQ D++ ++D S + S I M+ + Sbjct: 7 LFLMITCSHISVSQDSGPKYADVVFLVDSSDRLGS------RSFPFVKMFITKMINSL-- 58 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKY-LSKFGVSTNSTPGLKYAYN 268 P + + L +S+K+ F L G S + +K G S L+ A+ Sbjct: 59 -PIEADKYRVALAQYSDKLHSEFHLSTFKGRSPMLNHLKKNFGFIGGSLQIGKALQEAHR 117 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 F + + I+V + S++ + +K G + ++G++ Sbjct: 118 TYFSATTNGRDKK----QFPPILVVL-----ASSESEDDVEKASKALQKDGVKIISVGVQ 168 Query: 329 VIRSHEFLRACASPNSFYLVENPHSM 354 L+A A+ + + + Sbjct: 169 KASEEN-LKAMATSQFHFNLRTVRDL 193 >gi|189082691|sp|A8TX70|CO6A5_HUMAN RecName: Full=Collagen alpha-5(VI) chain; AltName: Full=Collagen alpha-1(XXIX) chain; AltName: Full=von Willebrand factor A domain-containing protein 4; Flags: Precursor gi|158828630|gb|ABW81241.1| collagen XXIX alpha 1 [Homo sapiens] Length = 2615 Score = 87.1 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 63/349 (18%), Positives = 127/349 (36%), Gaps = 53/349 (15%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + +F + + IDR + +NEG G L + + S L Sbjct: 682 LNRYFTQQEISDAIDRMSL------INEGTLT-------GKALNFVGQYFTHSKGARLGA 728 Query: 101 NGFVNDIDDIVRSTSL-DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNS-------- 151 F+ I D V + D + + + +I ++ Y I ++ Sbjct: 729 KKFLILITDGVAQDDVRDPARILRGKDVTIFSVGVYNANRSQLEEISGDSSLVFHVENFD 788 Query: 152 ------RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 R ++ + + T LD++ VLD S S++ + + IN Sbjct: 789 HLKALERKLIFRVCALHDCKRITL--LDVVFVLDHSGSIKKQYQDHM---------INLT 837 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGL 263 + VK + VQ G + +S++ F L S + ++ G +T + L Sbjct: 838 IHLVKKADVGRDRVQFGALKYSDQPNILFYLNTYSNRSAIIENLRKRRDTGGNTYTAKAL 897 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 K+A N +F +H + N K++++ +TDGE+ + + E + +G ++ Sbjct: 898 KHA-NALFT----EEHGSRIKQNVKQMLIVITDGESHDHDQLNDTAL---ELRNKGITIF 949 Query: 324 AIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 A+G+ E + N+ V+N + D F+ + + Sbjct: 950 AVGVGKANQKELEGMAGNKNNTIYVDNFDKLKDVFT----LVQERMCTE 994 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 70/192 (36%), Gaps = 23/192 (11%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D + D+M ++D S S+ + + K+ + +K++ + I + Q G+V Sbjct: 620 KGCEDMKADIMFLVDSSWSIG---NENFRKMKIFMKNL------LTKIQIGADKTQIGVV 670 Query: 225 TFSNKIEEFFLL-EWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+K +E F L + + I +S T + L + +G R Sbjct: 671 QFSDKTKEEFQLNRYFTQQEISDAIDRMSLINEGTLTGKALNFVGQYFTHSKGARLGA-- 728 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 KK ++ +TDG + + ++++G+ + Sbjct: 729 -----KKFLILITDG-----VAQDDVRDPARILRGKDVTIFSVGVYNANRSQLEEISGDS 778 Query: 343 NSFYLVENPHSM 354 + + VEN + Sbjct: 779 SLVFHVENFDHL 790 Score = 72.9 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 72/204 (35%), Gaps = 27/204 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S++ I + MLE ++ + V+ G+V +S+ E Sbjct: 442 DIHFLIDGSSSIQEKQFEQIKRF---------MLEVTEMFSIGPDKVRVGVVQYSDDTEV 492 Query: 233 FFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + + + ++ I + + T + L Y I + R Sbjct: 493 EFYITDYSNDIDLRKAIFNIKQLTGGTYTGKALDYILQIIKNGMKDRMSK------VPCY 546 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDG + + + + V+A+GI E +N Sbjct: 547 LIVLTDGMSTD-----RVVEPAKRLRAEQITVHAVGIGAANKIELQEIAGKEERVSFGQN 601 Query: 351 PHSMYDAFSHIGKDIVTKRIWYDK 374 +DA I ++V + I +K Sbjct: 602 ----FDALKSIKNEVV-REICAEK 620 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 64/185 (34%), Gaps = 22/185 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + IN M+ + P N + L +S++ Sbjct: 30 DVVFLVDSSDHLGP------KSFPFVKTFINKMINSL---PIEANKYRVALAQYSDEFHS 80 Query: 233 FFLLEW--GVSHLQRKIKY-LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L G S + +K G S L+ A+ F + + + Sbjct: 81 EFHLSTFKGRSPMLNHLKKNFQFIGGSLQIGKALQEAHRTYFSAPINGR----DRKQFPP 136 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 I+V + S + + + +K G + ++G++ L+A A+ + + + Sbjct: 137 ILVVL-----ASAESEDEVEEASKALQKDGVKIISVGVQKASEEN-LKAMATSHFHFNLR 190 Query: 350 NPHSM 354 + Sbjct: 191 TIRDL 195 >gi|260782980|ref|XP_002586557.1| hypothetical protein BRAFLDRAFT_106340 [Branchiostoma floridae] gi|229271674|gb|EEN42568.1| hypothetical protein BRAFLDRAFT_106340 [Branchiostoma floridae] Length = 534 Score = 87.1 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 41/224 (18%), Positives = 83/224 (37%), Gaps = 31/224 (13%) Query: 150 NSRHIVMPITSSVKVNSQTDAR----LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 ++R I S+ + S LD++ V+D S S+ S S+ K Sbjct: 120 SARTIERSDCKSLIIESGPGTPAKTELDLVFVIDGSGSISSVSFGSVMKFAA-------- 171 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGL 263 + + + G+V +S + F+L+ + +++ I + + G TN+ L Sbjct: 172 -DMSLRLDISATTTRVGMVQYSTNVTPEFMLKEHTTKKSVEKAIGDVKRLGGGTNTGKAL 230 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 K+ R + D K++ + +TDG K ++ G ++Y Sbjct: 231 KFV----------RTEMDWRDPPTKRVAIVVTDG-----KSQDDVGTPATALRQAGVVLY 275 Query: 324 AIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 A+G+ + + E P Y + + + D I +V Sbjct: 276 AVGVGLP-TDELKEITGDPTKVYALNSYDELQDIIQDISNSVVE 318 >gi|119599630|gb|EAW79224.1| hypothetical protein FLJ35880 [Homo sapiens] Length = 2531 Score = 87.1 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 63/349 (18%), Positives = 127/349 (36%), Gaps = 53/349 (15%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + +F + + IDR + +NEG G L + + S L Sbjct: 627 LNRYFTQQEISDAIDRMSL------INEGTLT-------GKALNFVGQYFTHSKGARLGA 673 Query: 101 NGFVNDIDDIVRSTSL-DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNS-------- 151 F+ I D V + D + + + +I ++ Y I ++ Sbjct: 674 KKFLILITDGVAQDDVRDPARILRGKDVTIFSVGVYNANRSQLEEISGDSSLVFHVENFD 733 Query: 152 ------RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 R ++ + + T LD++ VLD S S++ + + IN Sbjct: 734 HLKALERKLIFRVCALHDCKRITL--LDVVFVLDHSGSIKKQYQDHM---------INLT 782 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGL 263 + VK + VQ G + +S++ F L S + ++ G +T + L Sbjct: 783 IHLVKKADVGRDRVQFGALKYSDQPNILFYLNTYSNRSAIIENLRKRRDTGGNTYTAKAL 842 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 K+A N +F +H + N K++++ +TDGE+ + + E + +G ++ Sbjct: 843 KHA-NALFT----EEHGSRIKQNVKQMLIVITDGESHDHDQLNDTAL---ELRNKGITIF 894 Query: 324 AIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 A+G+ E + N+ V+N + D F+ + + Sbjct: 895 AVGVGKANQKELEGMAGNKNNTIYVDNFDKLKDVFT----LVQERMCTE 939 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 70/192 (36%), Gaps = 23/192 (11%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D + D+M ++D S S+ + + K+ + +K++ + I + Q G+V Sbjct: 565 KGCEDMKADIMFLVDSSWSIG---NENFRKMKIFMKNL------LTKIQIGADKTQIGVV 615 Query: 225 TFSNKIEEFFLL-EWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+K +E F L + + I +S T + L + +G R Sbjct: 616 QFSDKTKEEFQLNRYFTQQEISDAIDRMSLINEGTLTGKALNFVGQYFTHSKGARLGA-- 673 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 KK ++ +TDG + + ++++G+ + Sbjct: 674 -----KKFLILITDG-----VAQDDVRDPARILRGKDVTIFSVGVYNANRSQLEEISGDS 723 Query: 343 NSFYLVENPHSM 354 + + VEN + Sbjct: 724 SLVFHVENFDHL 735 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 72/204 (35%), Gaps = 27/204 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S++ I + MLE ++ + V+ G+V +S+ E Sbjct: 387 DIHFLIDGSSSIQEKQFEQIKRF---------MLEVTEMFSIGPDKVRVGVVQYSDDTEV 437 Query: 233 FFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + + + ++ I + + T + L Y I + R Sbjct: 438 EFYITDYSNDIDLRKAIFNIKQLTGGTYTGKALDYILQIIKNGMKDRMSK------VPCY 491 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDG + + + + V+A+GI E +N Sbjct: 492 LIVLTDGMSTD-----RVVEPAKRLRAEQITVHAVGIGAANKIELQEIAGKEERVSFGQN 546 Query: 351 PHSMYDAFSHIGKDIVTKRIWYDK 374 +DA I ++V + I +K Sbjct: 547 ----FDALKSIKNEVV-REICAEK 565 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 23/150 (15%), Positives = 53/150 (35%), Gaps = 13/150 (8%) Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKY-LSKFGVSTNSTPGLK 264 + +P N + L +S++ F L G S + +K G S L+ Sbjct: 1 MINSLPIEANKYRVALAQYSDEFHSEFHLSTFKGRSPMLNHLKKNFQFIGGSLQIGKALQ 60 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 A+ F + + + I+V + S + + + +K G + + Sbjct: 61 EAHRTYFSAPINGR----DRKQFPPILVVL-----ASAESEDEVEEASKALQKDGVKIIS 111 Query: 325 IGIRVIRSHEFLRACASPNSFYLVENPHSM 354 +G++ L+A A+ + + + + Sbjct: 112 VGVQKASEEN-LKAMATSHFHFNLRTIRDL 140 >gi|297291177|ref|XP_001109727.2| PREDICTED: collagen alpha-1(XII) chain-like [Macaca mulatta] Length = 3095 Score = 87.1 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 53/267 (19%), Positives = 103/267 (38%), Gaps = 37/267 (13%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 ++T+L + + + Y IS + + I T + + + V + + Sbjct: 384 PQTTTLSVRDLSADTEYQISVSAMKGMTSSEPISIMEKTQPMKVQVECSRGVDIKA---- 439 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++ ++D S S+ ++ K+ ++ + VK N VQ LV +S Sbjct: 440 --DIVFLVDGSYSIG---IANFVKVRAFLEVL------VKSFEISPNRVQISLVQYSRDP 488 Query: 231 EEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L+ + I G STN+ + Y +IF +N Sbjct: 489 HTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF------VPSKGSRSNVP 542 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 K+++ +TDG++ D + + ++A+G++ E L A ASP Sbjct: 543 KVMILITDGKSSDAFRD-----PAIKLRNSDVEIFAVGVKDAVRSE-LEAIASPPAETHV 596 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 + VE+ +DAF I ++ T+ I Sbjct: 597 FTVED----FDAFQRISFEL-TQSICL 618 Score = 86.7 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 46/279 (16%), Positives = 100/279 (35%), Gaps = 34/279 (12%) Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP--LKFCTFIPWYTNSRHIVMPI 158 +G + +T + + Y ++ S ++ + + T S + Sbjct: 64 DGPTKEFTLAASTTETLLSELVPETEYVVTITSYDEVEESVPVIGQLTIQTGSSTKPVEK 123 Query: 159 -TSSVKVNSQTDAR-LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 ++ + + D++ ++D S S+ + I I ++ V Sbjct: 124 KPGKTEIQKCSVSAWTDLIFLVDGSWSVGRN---NFKYILDFIAAL------VSAFDIGE 174 Query: 217 NVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDM 273 + G+V +S+ F L + L IK + G +T + + Y N + Sbjct: 175 EKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTES 234 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 G R + K+ + +TDG K + E + G V+++GI+ + Sbjct: 235 AGARVG-------FPKVAIIITDG-----KSQDEVEIPARELRNVGVEVFSLGIKAADAK 282 Query: 334 EFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 E + ++P N + V N +DA I +I+++ Sbjct: 283 ELKQIASTPSLNHVFNVAN----FDAIVDIQNEIISQVC 317 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 44/248 (17%), Positives = 72/248 (29%), Gaps = 45/248 (18%) Query: 124 NEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 Y ++ + T S VMPI SS + T A D+++++D S Sbjct: 1150 GTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSS-GMECLTRAEADIVLLVDGSW 1208 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH 242 S+ I+ ++E + P VQ L +S L Sbjct: 1209 SIG------RANFRTVRSFISRIVEVFDIGP---KRVQIALAQYSGDPRTEGQL------ 1253 Query: 243 LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 S + G+ +KI V +TDG Sbjct: 1254 -NAHRDKKSXXXXXXXTQAGM--------------------RPRARKIGVLITDG----- 1287 Query: 303 KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIG 362 K + K G ++AIGI+ E P+ N +++ S I Sbjct: 1288 KSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDT-HAYNVAD-FESLSRIV 1345 Query: 363 KDIVTKRI 370 D+ Sbjct: 1346 DDLTINLC 1353 >gi|114608138|ref|XP_518589.2| PREDICTED: collagen, type XII, alpha 1 isoform 3 [Pan troglodytes] Length = 3119 Score = 87.1 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 53/267 (19%), Positives = 103/267 (38%), Gaps = 37/267 (13%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 ++T+L + + + Y IS + + I T + + + V + + Sbjct: 384 PQTTTLSVRDLSADTEYQISVSAMKGMTSSEPISIMEKTQPMKVQVECSRGVDIKA---- 439 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++ ++D S S+ ++ K+ ++ + VK N VQ LV +S Sbjct: 440 --DIVFLVDGSYSIG---IANFVKVRAFLEVL------VKSFEISPNRVQISLVQYSRDP 488 Query: 231 EEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L+ + I G STN+ + Y +IF +N Sbjct: 489 HTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF------VPSKGSRSNVP 542 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 K+++ +TDG++ D + + ++A+G++ E L A ASP Sbjct: 543 KVMILITDGKSSDAFRD-----PAIKLRNSDVEIFAVGVKDAVRSE-LEAIASPPAETHV 596 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 + VE+ +DAF I ++ T+ I Sbjct: 597 FTVED----FDAFQRISFEL-TQSICL 618 Score = 86.4 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 46/279 (16%), Positives = 100/279 (35%), Gaps = 34/279 (12%) Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP--LKFCTFIPWYTNSRHIVMPI 158 +G + +T + + Y ++ S ++ + + T S + Sbjct: 64 DGPTKEFTLSASTTETLLSELVPETEYVVTITSYDEVEESVPVIGQLTIQTGSSTKPVEK 123 Query: 159 -TSSVKVNSQTDAR-LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 ++ + + D++ ++D S S+ + I I ++ V Sbjct: 124 KPGKTEIQKCSVSAWTDLVFLVDGSWSVGRN---NFKYILDFIAAL------VSAFDIGE 174 Query: 217 NVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDM 273 + G+V +S+ F L + L IK + G +T + + Y N + Sbjct: 175 EKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTES 234 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 G R + K+ + +TDG K + E + G V+++GI+ + Sbjct: 235 AGARVG-------FPKVAIIITDG-----KSQDEVEIPARELRNVGVEVFSLGIKAADAK 282 Query: 334 EFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 E + ++P N + V N +DA I +I+++ Sbjct: 283 ELKQIASTPSLNHVFNVAN----FDAIVDIQNEIISQVC 317 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 50/250 (20%), Positives = 82/250 (32%), Gaps = 26/250 (10%) Query: 124 NEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 Y ++ + T S VMPI SS + T A D+++++D S Sbjct: 1150 GTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSS-GMECLTRAEADIVLLVDGSW 1208 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGV 240 S+ I+ ++E + P VQ L +S + L Sbjct: 1209 SIG------RANFRTVRSFISRIVEVFDIGP---KRVQIALAQYSGDPRTEWQLNAHRDK 1259 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 L + + L G +T + L + Q F Q +KI V +TDG Sbjct: 1260 KSLLQAVANLPYKGGNTLTGMALNFIRQQNFRTQ------AGMRPRARKIGVLITDG--- 1310 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 K + K G ++AIGI+ E P+ N +++ S Sbjct: 1311 --KSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDT-HAYNVAD-FESLSR 1366 Query: 361 IGKDIVTKRI 370 I D+ Sbjct: 1367 IVDDLTINLC 1376 >gi|126310411|ref|XP_001373784.1| PREDICTED: similar to collagen type XII alpha 1 [Monodelphis domestica] Length = 3116 Score = 86.7 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 55/277 (19%), Positives = 105/277 (37%), Gaps = 37/277 (13%) Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITS 160 + ++TSL++ + + Y IS S + I T + + + Sbjct: 366 GSKQQALSVGPQTTSLNVRDLSADTEYQISVYSMKGLTSSEPISIMEKTQPMKVQVECSR 425 Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 V + + D++ ++D S S+ ++ K+ ++ + VK N VQ Sbjct: 426 GVDIKA------DIVFLVDGSYSIG---IANFVKVRAFLEVL------VKSFEISPNRVQ 470 Query: 221 SGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 LV +S F L+ + I G STN+ + Y +IF Sbjct: 471 ISLVQYSRDPHTEFTLKKFTRVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVAS---- 526 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 +N K+++ +TDG++ D + + ++A+G++ E L A Sbjct: 527 --KGSRSNVPKVMILITDGKSSDAFRD-----PAIKLRNSDVEIFAVGVKDAVRSE-LEA 578 Query: 339 CASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 ASP + VE+ +DAF I ++ T+ I Sbjct: 579 IASPPPETHVFTVED----FDAFQRISFEL-TQSICL 610 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 47/244 (19%), Positives = 87/244 (35%), Gaps = 38/244 (15%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 + P+ P T + + + D++ ++D S S+ Sbjct: 101 QLTIQTGGPVDPKEKKPGQTEIQKCSLSAWT------------DLVFLVDGSWSVGRN-- 146 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKI 247 + I I ++ V + G+V +S+ F L + L I Sbjct: 147 -NFKYILDFIAAL------VSAFDVGEGKTRVGVVQYSSDTRTEFNLNQYYQRKDLLAAI 199 Query: 248 KYLSKFGVSTNSTPGLKY-AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 K + G +T + + Y N + G R + K+ + +TDG K Sbjct: 200 KKIPYKGGNTMTGDAIDYLIKNTFTESAGARVG-------FPKVAIIITDG-----KSQD 247 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIV 366 + E + G V+++GI+ + E LR ASP S V N + +DA I +I+ Sbjct: 248 EVEIPARELRNIGVEVFSLGIKAADAKE-LRQIASPPSLKHVFNVAN-FDAIVDIQNEII 305 Query: 367 TKRI 370 ++ Sbjct: 306 SQVC 309 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 49/251 (19%), Positives = 81/251 (32%), Gaps = 28/251 (11%) Query: 124 NEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 Y ++ + T S VMP S + T A D+++++D S Sbjct: 1142 GTTYKVNVFGMFDGGESSPLIGQEMTTLSDTTVMPFLSK-GMECFTRAEADIVLLVDGSW 1200 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL---EWG 239 S+ I+ ++E ++ P VQ L +S + L + Sbjct: 1201 SIG------RANFKTIRSFISRIVEVFEIGP---KRVQIALAQYSGDPRTEWQLNAHKDR 1251 Query: 240 VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 S L + L G +T + L F Q + +KI V +TDG Sbjct: 1252 RSLLDS-VANLPYKGGNTLTGMALN------FIRQNNFRPQAGMRPRARKIGVLITDG-- 1302 Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFS 359 K + K G ++AIGI+ E P+ N +D+ S Sbjct: 1303 ---KSQDDVEAPSKKLKDDGVELFAIGIKNADESELKMIATDPDDT-HAYNVAD-FDSLS 1357 Query: 360 HIGKDIVTKRI 370 I D+ Sbjct: 1358 RIVDDLTVNLC 1368 >gi|126273404|ref|XP_001377627.1| PREDICTED: similar to AMACO [Monodelphis domestica] Length = 784 Score = 86.7 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 38/216 (17%), Positives = 83/216 (38%), Gaps = 29/216 (13%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-NVV 219 S++ LD++ +LD S ++ + TK+ + + L ++N +V Sbjct: 522 SLQRPGCQSQSLDLVFLLDASATVGQE---NFTKLQ-------SFVRGSSLQFNINRDVT 571 Query: 220 QSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 Q GLV + ++++ F L+ S L + I + ++ L + Y ++ +Q Sbjct: 572 QIGLVVYGSRVQTTFALDTHPTSSSLLQAISQAPYMDGAGSTGSALLHVYEEVMTVQ--- 628 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 K +V +T+G + ++ + + G V I + + LR Sbjct: 629 ---KGARPGVSKAVVVITEGTGT-----EDAVVPAQKLRNNGISVLVIAVGPVLKETLLR 680 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 SP+ V + + +D+ +RI + Sbjct: 681 LAGSPDFLIHVASYEDL-----ENYQDLFIERICEE 711 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 43/189 (22%), Positives = 81/189 (42%), Gaps = 24/189 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 SS Q A LD++ LD S S+ + + M + + + PD V Sbjct: 38 SSASQLMQCSAALDVLFALDGSHSIGK------GSFERSKYFAIKMCDALAIYPD---RV 88 Query: 220 QSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G++ FS+ + F L+ + ++ KIK + G ST + LKY + Sbjct: 89 RVGVLQFSSVPQLEFPLDSFFTREEVKEKIKKIVFKGGSTETGLALKYLL--------HK 140 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +++ ++++ +TDG++ + N+ K+RG +V+A+G+R R E Sbjct: 141 GFPGGRNSSVPQLLIIVTDGKSQGNID-----LPANQLKERGVMVFAVGVRFPRWTELHT 195 Query: 338 ACASPNSFY 346 + P Y Sbjct: 196 LASEPKDQY 204 >gi|326918160|ref|XP_003205359.1| PREDICTED: hypothetical protein LOC100539194 [Meleagris gallopavo] Length = 1584 Score = 86.7 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 37/220 (16%), Positives = 85/220 (38%), Gaps = 25/220 (11%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + + + D++ +LD S S+ + + ++ ++E ++ PD Sbjct: 25 TEAQRAGCKNVHYDLVFILDASSSVGK------EDFEKVRQWVSNLVETFEIGPDK---T 75 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIK----YLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + G+V +S++ F L G + +IK + +G +TN+ L+Y F + Sbjct: 76 RVGVVRYSDRPTTEFDL--GKYKTREEIKEAARKIRYYGGNTNTGDALRYINTYSFSKE- 132 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 D KK+ + +TDG L N A++ G ++A+G+ E Sbjct: 133 --AGGRLSDRTVKKVAILLTDGR-----SQDYVLDPANAARQAGIRIFAVGVGEALKEEL 185 Query: 336 LRACASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P + V + +++ + + + + + Sbjct: 186 DEIASEPKSAHVFHVSDYNAIDKIRGKLRRRLCENVLCPN 225 >gi|12052774|emb|CAB66559.1| hypothetical protein [Homo sapiens] Length = 957 Score = 86.7 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 43/214 (20%), Positives = 83/214 (38%), Gaps = 33/214 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S A D++ +LD S S+ ++ K + + + + P +Q G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGP------ENFEIVKKWLVNITKNFDIGP---KFIQVGVV 79 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ L HL ++ + G +T + +++A + +FD Sbjct: 80 QYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTGKAIQFALDYLFD---------K 130 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 KI V +TDG K A+ ++AIG+ LRA A Sbjct: 131 SSRFLTKIAVVLTDG-----KSQDDVKDAAQAARDSKITLFAIGVGSETEDAELRAIANK 185 Query: 341 -SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 S + VE+ Y A S I ++++ +++ + Sbjct: 186 PSSTYVFYVED----YIAISKI-REVMKQKLCEE 214 >gi|293349450|ref|XP_001073278.2| PREDICTED: collagen type VI alpha 5-like [Rattus norvegicus] Length = 2640 Score = 86.7 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 49/279 (17%), Positives = 99/279 (35%), Gaps = 34/279 (12%) Query: 103 FVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSV 162 F + + DIV P + SIS S YK + ++ T + Sbjct: 395 FAMSVQGANNTQLEDIVSYPSRQ--SISTHSSYKHLESYSGNFLKKIHNEIWTQVSTHAE 452 Query: 163 KVNSQTDARLD-----MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 ++ +D + ++D S S+ +I + + S+ V + P N Sbjct: 453 QMELDKTGCVDTKEADIYFLIDGSSSIR---RKEFEQIQIFMSSV------VDMFPIGPN 503 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 V+ G+V +S++ E F + + ++ + + + T + L + I + Sbjct: 504 NVRVGVVQYSHRNEVEFPVSQYTKGIELRKAVWNIKQLKGGTFTGKALDFILPIIKKGKS 563 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R H ++ +TDG K + L N + ++A+GI + Sbjct: 564 ERIH------EVPCYLIVLTDG-----KSNDSVLEPANRLRAEHITIHAVGIGEANKTQL 612 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + +N +D+ +I +IV + I DK Sbjct: 613 QQIAGKDERVSFGQN----FDSLKYIKNEIVHR-ICSDK 646 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 46/281 (16%), Positives = 103/281 (36%), Gaps = 33/281 (11%) Query: 97 ELRDNGFVNDIDDIVRSTSLDI--VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH- 153 + + N V + + +R+ + I V + + + I+ + F ++ Sbjct: 578 DGKSNDSVLEPANRLRAEHITIHAVGIGEANKTQLQQIAGKDERVSFGQNFDSLKYIKNE 637 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 IV I S D + D+M ++D S S+ ++ K+ +K++ I Sbjct: 638 IVHRICSD---KGCEDMKADIMFLVDSSGSIG---LTNFEKMKTFMKNLVG------KIE 685 Query: 214 DVNNVVQSGLVTFSNKIEEFFLL-EWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 + Q G+V FS+ +E F L ++ + I +S +T + L + N+ F Sbjct: 686 IGADRSQVGVVQFSDYNKEEFQLNKYSTREEVYAAIDGMSPINRNTLTGSALTF-VNEYF 744 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 D+ +K ++ +TDGE + + + ++++G+ Sbjct: 745 DIS------KGGRPQVRKFLILLTDGE-----AQDEVGGPAMALRSKSVTIFSVGVYGAN 793 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + + + VEN +D I ++ + Sbjct: 794 RTQLEEISGEGSLVFHVEN----FDHLKTIESKLIFRVCAL 830 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 41/189 (21%), Positives = 77/189 (40%), Gaps = 19/189 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ VLD S S+ S+ IN + V+ + VQ G + +SN E Sbjct: 839 LDIVFVLDHSGSIGPREQESM---------INLTIHLVEKADVGRDRVQFGALMYSNNPE 889 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L S + +K T + L++A N +F M +H + N ++ Sbjct: 890 ILFYLNTYSSRSAITEHLKRPRDTRGDTYTAKALQHA-NILF----MEEHGSRLKQNVRQ 944 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDGE+ + + E + +G ++A+G+ E + V+ Sbjct: 945 LMIVITDGES---HDRDKLNDTARELRDKGITIFAVGVGRANQDELETMAGKKENTIHVD 1001 Query: 350 NPHSMYDAF 358 N + D + Sbjct: 1002 NFDKLRDIY 1010 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 44/291 (15%), Positives = 106/291 (36%), Gaps = 32/291 (10%) Query: 72 NNRKKLKGGDILCRIKNTWNMSFR-NELRDNGFVNDIDDIVR-STSLDIVVVPQNEGYSI 129 +N + IL + N V + + T++D V Q+ Sbjct: 1898 SNGQAASRASILTATMEFSALDISLAVFAHNERVFLDEAFGQCETAVDPVRSWQH----F 1953 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 +Y + + ++ P +++ P T + + ++ +D+ +LD SR++ S Sbjct: 1954 PPPIQYMLHEQLVSWTP--VTNQNKCFPNTCAEEPFFPENSYMDVAFLLDNSRNIAS--- 2008 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF----SNKIEEFFLLEWGVSH--- 242 + + S+ + + L+++ +++ + E+G + Sbjct: 2009 DDFQAVKGLVSSVIDGFHITSNPSTSESGDRVALLSYSPSENSRRKGSVKTEFGFTTFDS 2068 Query: 243 ---LQRKI-KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE 298 ++ I L + L++A +F T + K+I+ ++ GE Sbjct: 2069 ESIMKNYIHTSLQQLNGDAAIGLALEWAMEGLFLG--------TPNPRKHKVIIVISAGE 2120 Query: 299 NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 N KE +++ AK +G I++ + + R E + P +L++ Sbjct: 2121 NHEEKEFVKTVAL--RAKCQGYIIFVVSLGSTRRDEMEELASYPLDHHLIQ 2169 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 27/206 (13%), Positives = 70/206 (33%), Gaps = 25/206 (12%) Query: 154 IVMPITSSVKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 +++ + + + D++ ++D S + I +N ++ + Sbjct: 10 LILWTETLSDQSPGPGPKYADVVFLVDSSDYLG------IKSFPFVRTFLNKIISSL--- 60 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYN 268 P N + L +S+ + F L + + +L K G S L+ A+ Sbjct: 61 PVEANKYRVALAQYSDALHNEFHLGAFKNR-NPMLNHLKKNFGFIGGSLKIGNALQEAHR 119 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 F + + I+V + S + + ++ G + ++G++ Sbjct: 120 TYFSAPTNGRDKK----QFPPILVVL-----ASAESEDDVEEASKALREDGVKIISVGVQ 170 Query: 329 VIRSHEFLRACASPNSFYLVENPHSM 354 + L+A A+P + + + Sbjct: 171 KASEED-LKAMATPQFHFNLRTARDL 195 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 37/292 (12%), Positives = 98/292 (33%), Gaps = 31/292 (10%) Query: 50 LHSMIDRSLVHAATQIMNEG----NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN 105 L S V A++ + E +K D+ ++ + R + F Sbjct: 141 LASAESEDDVEEASKALREDGVKIISVGVQKASEEDLKAMATPQFHFNLRTARDLSLFAP 200 Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVN 165 D+ I++ D+ + ++ P T T + ++ + + Sbjct: 201 DMAQIIQ----DVTQYREGTTVAVVTDVAPTTPEPLITPAALTTPANNVDETVPFLA--S 254 Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 Q D+ D++ ++D S + + + + V + + ++ GL++ Sbjct: 255 CQKDSLADLIFLVDES-------VGTTQNLRDLQNFLENVTSSVDVKDNC---MRLGLMS 304 Query: 226 FSN--KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 FS+ + S Q++I+ LS ++N +++ + + + Sbjct: 305 FSDKARTVSSLKSSTSQSDFQQQIQKLSLRTGASNVGAAIEH----MRTEGFSESSGSRK 360 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 +I V +T D + + +G ++A+ ++ + + Sbjct: 361 AQGVPQIAVLVT-----HRASDDAVREAALDLRLQGVTMFAMSVQGANNTQL 407 >gi|296198593|ref|XP_002806760.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-1(XII) chain-like [Callithrix jacchus] Length = 3113 Score = 86.7 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 53/267 (19%), Positives = 102/267 (38%), Gaps = 37/267 (13%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 ++T L + + + Y IS + + I T + + + V + + Sbjct: 384 PQTTMLSVRDLSADTEYQISVSAMKGMTSSEPISIMEKTQPMKVQVECSRGVDIKA---- 439 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++ ++D S S+ ++ K+ ++ + VK N VQ LV +S Sbjct: 440 --DIVFLVDGSYSIG---IANFVKVRAFLEVL------VKSFEISPNRVQISLVQYSRDP 488 Query: 231 EEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L+ + I G STN+ + Y +IF +N Sbjct: 489 HTEFTLKXFTKVEDIIEAINNFPYRGGSTNTGKAMTYVREKIF------VPSKGSRSNVP 542 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 K+++ +TDG++ D + + ++A+G++ E L A ASP Sbjct: 543 KVMILITDGKSSDAFRD-----PAIKLRNSDVEIFAVGVKDAVRSE-LEAIASPPAETHV 596 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 + VE+ +DAF I ++ T+ I Sbjct: 597 FTVED----FDAFQRISFEL-TQSICL 618 Score = 86.4 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 45/279 (16%), Positives = 100/279 (35%), Gaps = 34/279 (12%) Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP--LKFCTFIPWYTNSRHIVMPI 158 +G + +T + + Y ++ S ++ + + T S Sbjct: 64 DGPTKEFTLAASTTETLLSELVPETEYVVTITSYDEVEESVPVIGQLTIQTGSSTKPGEK 123 Query: 159 -TSSVKVNSQTDAR-LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 ++ + + D++ ++D S S+ + I I ++ V Sbjct: 124 KPGKTEIQKCSVSAWTDLIFLVDGSWSVGRN---NFKYILDFIAAL------VSAFDIGE 174 Query: 217 NVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDM 273 + + G+V +S+ F L + L IK + G +T + + Y N + Sbjct: 175 DKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTES 234 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 G R + K+ + +TDG K + E + G V+++GI+ + Sbjct: 235 AGARVG-------FPKVAIVITDG-----KSQDEVEIPARELRNVGVEVFSLGIKAADAK 282 Query: 334 EFLRACASPN--SFYLVENPHSMYDAFSHIGKDIVTKRI 370 E + ++P+ + V N +DA I +I+++ Sbjct: 283 ELKQIASTPSLTHVFNVAN----FDAIVDIQNEIISQVC 317 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 50/250 (20%), Positives = 82/250 (32%), Gaps = 26/250 (10%) Query: 124 NEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 Y ++ + T S VMPI SS + T A D+++++D S Sbjct: 1150 GTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSS-GMECLTRAEADIVLLVDGSW 1208 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GV 240 S+ I+ ++E + P VQ L +S + L Sbjct: 1209 SIG------RANFRTVRSFISRIVEVFDIGP---KRVQIALAQYSGDPRTEWQLNTHRDK 1259 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 L + + L G +T + L + Q F Q +KI V +TDG Sbjct: 1260 KSLLQAVANLPYKGGNTLTGMALNFIRQQSFRTQ------AGMRPRARKIGVLITDG--- 1310 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 K + K G ++AIGI+ E P+ N +++ S Sbjct: 1311 --KSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDT-HAYNVAD-FESLSR 1366 Query: 361 IGKDIVTKRI 370 I D+ Sbjct: 1367 IVDDLTINLC 1376 >gi|323700353|ref|ZP_08112265.1| von Willebrand factor type A [Desulfovibrio sp. ND132] gi|323460285|gb|EGB16150.1| von Willebrand factor type A [Desulfovibrio desulfuricans ND132] Length = 400 Score = 86.7 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 58/436 (13%), Positives = 132/436 (30%), Gaps = 136/436 (31%) Query: 24 AIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDIL 83 A+ LP++ V G+ +++ +++ T L + +D + + ++ + + + Sbjct: 2 ALLLPVLLGVAGIAVDMGNMYMTHTRLQAAVDAGALAGSLELPYDPDLS----------- 50 Query: 84 CRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCT 143 K + + + N + +I T + S+ ++ ++ + Sbjct: 51 ---KGIVTQAVNDMVETNMEEAVVTEISAGTEI----------RSVKVTAQAEVRMLLME 97 Query: 144 FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSIN 203 + + + +S L+++ V+D S SM T ID+ ++ Sbjct: 98 VLGMADKT------VEASAMAGFNK---LEVVFVIDNSGSM------KGTPIDLVKQASE 142 Query: 204 AMLEEVKLIPDVNNVVQSGLVTFSNKIE-------------------------------- 231 + + + + GLV F KI Sbjct: 143 ELTDLLIP-DGTTPDTKVGLVPFRGKIRLGEAVDGYAEGCVNADGSLNTGINEEFMDEYN 201 Query: 232 -------------------EFFLLEWGVSHLQRKIKYLSKFGV--STNSTPGLKYAYNQI 270 L S + I + G T + G+K+ N + Sbjct: 202 ALPYYYKRYITLDTCSDIPTVLPLSKNKSTIIAAIGSQTATGAASGTVISEGIKWGRNIL 261 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGE-------------------------------- 298 Q + ED ++KI++ +TDG+ Sbjct: 262 TPDAPFTQAGSKED--FRKIMIVLTDGDTEDGECGGTYRATYRPNNYWTNAYYGMGVDTA 319 Query: 299 --NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE--FLRACASP-----NSFYLVE 349 N + L AK G +++I + + ++ AS + ++ Sbjct: 320 HCNDGGVLNADMLSEAQLAKDAGIEIFSIRFGSSDTTDINLMKEIASSKAGTDDHYFDAP 379 Query: 350 NPHSMYDAFSHIGKDI 365 + + + D F IGK + Sbjct: 380 SVYDIPDIFKQIGKQL 395 >gi|332817900|ref|XP_526306.3| PREDICTED: collagen alpha-5(VI) chain isoform 2 [Pan troglodytes] Length = 2526 Score = 86.7 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 63/349 (18%), Positives = 127/349 (36%), Gaps = 53/349 (15%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + +F + + IDR + +NEG G L + + S L Sbjct: 682 LNRYFTQQEISDAIDRMSL------INEGTLT-------GKALNFVGQYFTHSKGARLGA 728 Query: 101 NGFVNDIDDIVRSTSL-DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNS-------- 151 F+ I D V + D + + + +I ++ Y I ++ Sbjct: 729 KKFLILITDGVAQDDVRDPARILRGKDVTIFSVGVYNANRSQLEEISGDSSLVFHVENFD 788 Query: 152 ------RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 R ++ + + T LD++ VLD S S++ + + IN Sbjct: 789 HLKALERKLIFHVCALHDCKRITL--LDVVFVLDHSGSIKKQYQDHM---------INLT 837 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGL 263 + VK + VQ G + +S++ F L S + ++ G +T + L Sbjct: 838 IHLVKKADVGRDRVQFGALKYSDQPNILFYLNTYSNRSAIIENLRKRRDTGGNTYTAKAL 897 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 K+A N +F +H + N K++++ +TDGE+ + + E + +G ++ Sbjct: 898 KHA-NALFT----EEHGSRIKQNVKQMLIVITDGESHDHDQLNDTAL---ELRNKGITIF 949 Query: 324 AIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 A+G+ E + N+ V+N + D F+ + + Sbjct: 950 AVGVGKANQKELEGMAGNKNNTIYVDNFDKLKDVFT----LVQERMCTE 994 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 70/192 (36%), Gaps = 23/192 (11%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D + D+M ++D S S+ + + K+ + +K++ + I + Q G+V Sbjct: 620 KGCEDMKADIMFLVDSSWSIG---NENFRKMKIFMKNL------LTKIQIGADKTQIGVV 670 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+K +E F L + + I +S T + L + +G R Sbjct: 671 QFSDKTKEEFQLNRYFTQQEISDAIDRMSLINEGTLTGKALNFVGQYFTHSKGARLGA-- 728 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 KK ++ +TDG + + ++++G+ + Sbjct: 729 -----KKFLILITDG-----VAQDDVRDPARILRGKDVTIFSVGVYNANRSQLEEISGDS 778 Query: 343 NSFYLVENPHSM 354 + + VEN + Sbjct: 779 SLVFHVENFDHL 790 Score = 73.3 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 31/203 (15%), Positives = 71/203 (34%), Gaps = 22/203 (10%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S++ I + MLE ++ + V+ G+V +S+ E Sbjct: 442 DIHFLIDGSSSIQEKQFEQIKRF---------MLEVTEMFSIGPDKVRVGVVQYSDDTEV 492 Query: 233 FFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + + + ++ I + + T + L Y I + R Sbjct: 493 EFYVTDYSNDIDLRKAIFNIKQLTGGTYTGKALDYILQIIKNGMKDRMSK------VPCY 546 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDG + + + + V+A+GI E +N Sbjct: 547 LIVLTDGMSTD-----RVVEPAKRLRAEQITVHAVGIGAANKTELQEIAGKEERVSFGQN 601 Query: 351 PHSMYDAFSHIGKDIVTKRIWYD 373 ++ + + ++I ++ D Sbjct: 602 FDALKSIKNEVIREICAEKGCED 624 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 28/185 (15%), Positives = 65/185 (35%), Gaps = 22/185 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + S + I + + +P N + L +S++ Sbjct: 30 DVVFLVDSS---DHLGPKSFPFVKTFIS------KMINSLPIEANKYRVALAQYSDEFHS 80 Query: 233 FFLLEW--GVSHLQRKIKY-LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L G S + +K G S L+ A+ F + + + Sbjct: 81 EFHLSTFEGRSPMLNHLKKNFQFIGGSLQIGKALQEAHRTYFSAPTNGR----DRKQFPP 136 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 I+V + S + + + +K G + ++G++ L+A A+ + + + Sbjct: 137 ILVVL-----ASAESEDEVEGASKALQKDGVKIISVGVQEASEEN-LKAMATSHFHFNLR 190 Query: 350 NPHSM 354 + Sbjct: 191 TIRDL 195 >gi|332210152|ref|XP_003254169.1| PREDICTED: collagen alpha-1(XXI) chain [Nomascus leucogenys] Length = 957 Score = 86.7 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 42/214 (19%), Positives = 82/214 (38%), Gaps = 33/214 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S A D++ +LD S S+ ++ K + + + + P +Q G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGP------ENFEIVKKWLVNITKNFDIGP---KFIQVGVV 79 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ L HL ++ + G +T + +++A + +F Sbjct: 80 QYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTRTGKAIQFALDYLF---------AK 130 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 KI V +TDG K A+ ++AIG+ LRA A Sbjct: 131 SSRFLTKIAVVLTDG-----KSQDDVKDAAQAARDSKITLFAIGVGSETEDAELRAIANK 185 Query: 341 -SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 S + VE+ Y A S I ++++ +++ + Sbjct: 186 PSSTYVFYVED----YIAISKI-REVMKQKLCEE 214 >gi|297678422|ref|XP_002817079.1| PREDICTED: collagen alpha-1(XXI) chain-like [Pongo abelii] Length = 612 Score = 86.7 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 42/214 (19%), Positives = 82/214 (38%), Gaps = 33/214 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S A D++ +LD S S+ ++ K + + + + P +Q G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGP------ENFEIVKKWLVNITKNFDIGP---KFIQVGVV 79 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ L HL ++ + G +T + +++A + +F Sbjct: 80 QYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTRTGKAIQFALDYLF---------AK 130 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 KI V +TDG K A+ ++AIG+ LRA A Sbjct: 131 SSRFLTKIAVVLTDG-----KSQDDVKDAAQAARDSKITLFAIGVGSETEDAELRAIANK 185 Query: 341 -SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 S + VE+ Y A S I ++++ +++ + Sbjct: 186 PSSTYVFYVED----YIAISKI-REVMKQKLCEE 214 >gi|1846005|gb|AAC51244.1| collagen type XII alpha-1 [Homo sapiens] Length = 3063 Score = 86.4 bits (212), Expect = 6e-15, Method: Composition-based stats. Identities = 53/267 (19%), Positives = 103/267 (38%), Gaps = 37/267 (13%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 ++T+L + + + Y IS + + I T + + + V + + Sbjct: 384 PQTTTLSVRDLSADTEYQISVSAMKGMTSSEPISIMEKTQPMKVQVECSRGVDIKA---- 439 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++ ++D S S+ ++ K+ ++ + VK N VQ LV +S Sbjct: 440 --DIVFLVDGSYSIG---IANFVKVRAFLEVL------VKSFEISPNRVQISLVQYSRDP 488 Query: 231 EEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L+ + I G STN+ + Y +IF +N Sbjct: 489 HTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF------VPSKGSRSNVP 542 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 K+++ +TDG++ D + + ++A+G++ E L A ASP Sbjct: 543 KVMILITDGKSSDAFRD-----PAIKLRNSDVEIFAVGVKDAVRSE-LEAIASPPAETHV 596 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 + VE+ +DAF I ++ T+ I Sbjct: 597 FTVED----FDAFQRISFEL-TQSICL 618 Score = 86.0 bits (211), Expect = 9e-15, Method: Composition-based stats. Identities = 46/279 (16%), Positives = 100/279 (35%), Gaps = 34/279 (12%) Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP--LKFCTFIPWYTNSRHIVMPI 158 +G + +T + + Y ++ S ++ + + T S + Sbjct: 64 DGPTKEFTLSASTTETLLSELVPETEYVVTITSYDEVEESVPVIGQLTIQTGSSTKPVEK 123 Query: 159 -TSSVKVNSQTDAR-LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 ++ + + D++ ++D S S+ + I I ++ V Sbjct: 124 KPGKTEIQKCSVSAWTDLVFLVDGSWSVGRN---NFKYILDFIAAL------VSAFDIGE 174 Query: 217 NVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDM 273 + G+V +S+ F L + L IK + G +T + + Y N + Sbjct: 175 EKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTES 234 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 G R + K+ + +TDG K + E + G V+++GI+ + Sbjct: 235 AGARVG-------FPKVAIIITDG-----KSQDEVEIPARELRNVGVEVFSLGIKAADAK 282 Query: 334 EFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 E + ++P N + V N +DA I +I+++ Sbjct: 283 ELKQIASTPSLNHVFNVAN----FDAIVDIQNEIISQVC 317 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 49/250 (19%), Positives = 82/250 (32%), Gaps = 26/250 (10%) Query: 124 NEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 Y ++ + T S VMPI SS + T A D+++++D S Sbjct: 1150 GTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSS-GMECLTRAEADIVLLVDGSW 1208 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGV 240 S+ I+ ++E + P VQ L +S + L Sbjct: 1209 SIG------RANFRTVRSFISRIVEVFDIGP---KRVQIALAQYSGDPRTEWQLNAHRDK 1259 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 L + + L G +T + L + Q F Q +KI V +TDG Sbjct: 1260 KSLLQAVANLPYKGGNTLTGMALNFIRQQNFRTQ------AGMRPRARKIGVLITDG--- 1310 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 K + K G ++AIGI+ E P+ + +++ S Sbjct: 1311 --KSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDTHDYNVAD--FESLSR 1366 Query: 361 IGKDIVTKRI 370 I D+ Sbjct: 1367 IVDDLTINLC 1376 >gi|119569135|gb|EAW48750.1| collagen, type XII, alpha 1, isoform CRA_c [Homo sapiens] Length = 3063 Score = 86.4 bits (212), Expect = 6e-15, Method: Composition-based stats. Identities = 53/267 (19%), Positives = 103/267 (38%), Gaps = 37/267 (13%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 ++T+L + + + Y IS + + I T + + + V + + Sbjct: 384 PQTTTLSVRDLSADTEYQISVSAMKGMTSSEPISIMEKTQPMKVQVECSRGVDIKA---- 439 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++ ++D S S+ ++ K+ ++ + VK N VQ LV +S Sbjct: 440 --DIVFLVDGSYSIG---IANFVKVRAFLEVL------VKSFEISPNRVQISLVQYSRDP 488 Query: 231 EEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L+ + I G STN+ + Y +IF +N Sbjct: 489 HTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF------VPSKGSRSNVP 542 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 K+++ +TDG++ D + + ++A+G++ E L A ASP Sbjct: 543 KVMILITDGKSSDAFRD-----PAIKLRNSDVEIFAVGVKDAVRSE-LEAIASPPAETHV 596 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 + VE+ +DAF I ++ T+ I Sbjct: 597 FTVED----FDAFQRISFEL-TQSICL 618 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 46/279 (16%), Positives = 100/279 (35%), Gaps = 34/279 (12%) Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP--LKFCTFIPWYTNSRHIVMPI 158 +G + +T + + Y ++ S ++ + + T S + Sbjct: 64 DGPTKEFTLSASTTETLLSELVPETEYVVTITSYDEVEESVPVIGQLTIQTGSSTKPVEK 123 Query: 159 -TSSVKVNSQTDAR-LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 ++ + + D++ ++D S S+ + I I ++ V Sbjct: 124 KPGKTEIQKCSVSAWTDLVFLVDGSWSVGRN---NFKYILDFIAAL------VSAFDIGE 174 Query: 217 NVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDM 273 + G+V +S+ F L + L IK + G +T + + Y N + Sbjct: 175 EKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTES 234 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 G R + K+ + +TDG K + E + G V+++GI+ + Sbjct: 235 AGARVG-------FPKVAIIITDG-----KSQDEVEIPARELRNVGVEVFSLGIKAADAK 282 Query: 334 EFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 E + ++P N + V N +DA I +I+++ Sbjct: 283 ELKQIASTPSLNHVFNVAN----FDAIVDIQNEIISQVC 317 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 50/250 (20%), Positives = 82/250 (32%), Gaps = 26/250 (10%) Query: 124 NEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 Y ++ + T S VMPI SS + T A D+++++D S Sbjct: 1150 GTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSS-GMECLTRAEADIVLLVDGSW 1208 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGV 240 S+ I+ ++E + P VQ L +S + L Sbjct: 1209 SIG------RANFRTVRSFISRIVEVFDIGP---KRVQIALAQYSGDPRTEWQLNAHRDK 1259 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 L + + L G +T + L + Q F Q +KI V +TDG Sbjct: 1260 KSLLQAVANLPYKGGNTLTGMALNFIRQQNFRTQ------AGMRPRARKIGVLITDG--- 1310 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 K + K G ++AIGI+ E P+ N +++ S Sbjct: 1311 --KSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDT-HAYNVAD-FESLSR 1366 Query: 361 IGKDIVTKRI 370 I D+ Sbjct: 1367 IVDDLTINLC 1376 >gi|114608140|ref|XP_001142912.1| PREDICTED: collagen alpha-1(XII) chain isoform 2 [Pan troglodytes] Length = 3063 Score = 86.4 bits (212), Expect = 6e-15, Method: Composition-based stats. Identities = 53/267 (19%), Positives = 103/267 (38%), Gaps = 37/267 (13%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 ++T+L + + + Y IS + + I T + + + V + + Sbjct: 384 PQTTTLSVRDLSADTEYQISVSAMKGMTSSEPISIMEKTQPMKVQVECSRGVDIKA---- 439 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++ ++D S S+ ++ K+ ++ + VK N VQ LV +S Sbjct: 440 --DIVFLVDGSYSIG---IANFVKVRAFLEVL------VKSFEISPNRVQISLVQYSRDP 488 Query: 231 EEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L+ + I G STN+ + Y +IF +N Sbjct: 489 HTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF------VPSKGSRSNVP 542 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 K+++ +TDG++ D + + ++A+G++ E L A ASP Sbjct: 543 KVMILITDGKSSDAFRD-----PAIKLRNSDVEIFAVGVKDAVRSE-LEAIASPPAETHV 596 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 + VE+ +DAF I ++ T+ I Sbjct: 597 FTVED----FDAFQRISFEL-TQSICL 618 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 46/279 (16%), Positives = 100/279 (35%), Gaps = 34/279 (12%) Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP--LKFCTFIPWYTNSRHIVMPI 158 +G + +T + + Y ++ S ++ + + T S + Sbjct: 64 DGPTKEFTLSASTTETLLSELVPETEYVVTITSYDEVEESVPVIGQLTIQTGSSTKPVEK 123 Query: 159 -TSSVKVNSQTDAR-LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 ++ + + D++ ++D S S+ + I I ++ V Sbjct: 124 KPGKTEIQKCSVSAWTDLVFLVDGSWSVGRN---NFKYILDFIAAL------VSAFDIGE 174 Query: 217 NVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDM 273 + G+V +S+ F L + L IK + G +T + + Y N + Sbjct: 175 EKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTES 234 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 G R + K+ + +TDG K + E + G V+++GI+ + Sbjct: 235 AGARVG-------FPKVAIIITDG-----KSQDEVEIPARELRNVGVEVFSLGIKAADAK 282 Query: 334 EFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 E + ++P N + V N +DA I +I+++ Sbjct: 283 ELKQIASTPSLNHVFNVAN----FDAIVDIQNEIISQVC 317 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 50/250 (20%), Positives = 82/250 (32%), Gaps = 26/250 (10%) Query: 124 NEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 Y ++ + T S VMPI SS + T A D+++++D S Sbjct: 1150 GTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSS-GMECLTRAEADIVLLVDGSW 1208 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGV 240 S+ I+ ++E + P VQ L +S + L Sbjct: 1209 SIG------RANFRTVRSFISRIVEVFDIGP---KRVQIALAQYSGDPRTEWQLNAHRDK 1259 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 L + + L G +T + L + Q F Q +KI V +TDG Sbjct: 1260 KSLLQAVANLPYKGGNTLTGMALNFIRQQNFRTQ------AGMRPRARKIGVLITDG--- 1310 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 K + K G ++AIGI+ E P+ N +++ S Sbjct: 1311 --KSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDT-HAYNVAD-FESLSR 1366 Query: 361 IGKDIVTKRI 370 I D+ Sbjct: 1367 IVDDLTINLC 1376 >gi|73974730|ref|XP_539177.2| PREDICTED: similar to collagen, type XXII, alpha 1 [Canis familiaris] Length = 1628 Score = 86.4 bits (212), Expect = 6e-15, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 82/216 (37%), Gaps = 26/216 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + D++ +LD S S+ + + + +++ ++ PD + Sbjct: 35 AQRAGCKTVHYDLVFLLDTSSSVGK------EDFEKVRQWVANLVDTFEVGPD---RTRV 85 Query: 222 GLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G+V +S++ F L ++ ++L+ G +TN+ L++ F Q Sbjct: 86 GVVRYSDRPTTAFELGLFGSREAVKAAARHLAYHGGNTNTGDALRFITRHSFSRQ---AG 142 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 D +K++ + +TDG L A + G ++A+G+ E Sbjct: 143 GRPGDRAFKQVAILLTDGR-----SQDLVLDAAATAHRAGIRIFAVGVGAALREELEEIA 197 Query: 340 ASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P + V + +DA I + + +R+ + Sbjct: 198 SEPKSAHVFHVSD----FDAIDKI-RGKLRRRLCEN 228 >gi|93141047|ref|NP_004361.3| collagen alpha-1(XII) chain long isoform precursor [Homo sapiens] gi|146345397|sp|Q99715|COCA1_HUMAN RecName: Full=Collagen alpha-1(XII) chain; Flags: Precursor gi|55662663|emb|CAH71310.1| collagen, type XII, alpha 1 [Homo sapiens] gi|56203512|emb|CAI19898.1| collagen, type XII, alpha 1 [Homo sapiens] gi|56203521|emb|CAI19908.1| collagen, type XII, alpha 1 [Homo sapiens] Length = 3063 Score = 86.4 bits (212), Expect = 6e-15, Method: Composition-based stats. Identities = 53/267 (19%), Positives = 103/267 (38%), Gaps = 37/267 (13%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 ++T+L + + + Y IS + + I T + + + V + + Sbjct: 384 PQTTTLSVRDLSADTEYQISVSAMKGMTSSEPISIMEKTQPMKVQVECSRGVDIKA---- 439 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++ ++D S S+ ++ K+ ++ + VK N VQ LV +S Sbjct: 440 --DIVFLVDGSYSIG---IANFVKVRAFLEVL------VKSFEISPNRVQISLVQYSRDP 488 Query: 231 EEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L+ + I G STN+ + Y +IF +N Sbjct: 489 HTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF------VPSKGSRSNVP 542 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 K+++ +TDG++ D + + ++A+G++ E L A ASP Sbjct: 543 KVMILITDGKSSDAFRD-----PAIKLRNSDVEIFAVGVKDAVRSE-LEAIASPPAETHV 596 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 + VE+ +DAF I ++ T+ I Sbjct: 597 FTVED----FDAFQRISFEL-TQSICL 618 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 46/279 (16%), Positives = 100/279 (35%), Gaps = 34/279 (12%) Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP--LKFCTFIPWYTNSRHIVMPI 158 +G + +T + + Y ++ S ++ + + T S + Sbjct: 64 DGPTKEFTLSASTTETLLSELVPETEYVVTITSYDEVEESVPVIGQLTIQTGSSTKPVEK 123 Query: 159 -TSSVKVNSQTDAR-LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 ++ + + D++ ++D S S+ + I I ++ V Sbjct: 124 KPGKTEIQKCSVSAWTDLVFLVDGSWSVGRN---NFKYILDFIAAL------VSAFDIGE 174 Query: 217 NVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDM 273 + G+V +S+ F L + L IK + G +T + + Y N + Sbjct: 175 EKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTES 234 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 G R + K+ + +TDG K + E + G V+++GI+ + Sbjct: 235 AGARVG-------FPKVAIIITDG-----KSQDEVEIPARELRNVGVEVFSLGIKAADAK 282 Query: 334 EFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 E + ++P N + V N +DA I +I+++ Sbjct: 283 ELKQIASTPSLNHVFNVAN----FDAIVDIQNEIISQVC 317 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 50/250 (20%), Positives = 82/250 (32%), Gaps = 26/250 (10%) Query: 124 NEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 Y ++ + T S VMPI SS + T A D+++++D S Sbjct: 1150 GTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSS-GMECLTRAEADIVLLVDGSW 1208 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGV 240 S+ I+ ++E + P VQ L +S + L Sbjct: 1209 SIG------RANFRTVRSFISRIVEVFDIGP---KRVQIALAQYSGDPRTEWQLNAHRDK 1259 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 L + + L G +T + L + Q F Q +KI V +TDG Sbjct: 1260 KSLLQAVANLPYKGGNTLTGMALNFIRQQNFRTQ------AGMRPRARKIGVLITDG--- 1310 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 K + K G ++AIGI+ E P+ N +++ S Sbjct: 1311 --KSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDT-HAYNVAD-FESLSR 1366 Query: 361 IGKDIVTKRI 370 I D+ Sbjct: 1367 IVDDLTINLC 1376 >gi|291399641|ref|XP_002716222.1| PREDICTED: collagen, type XXIX, alpha 1 [Oryctolagus cuniculus] Length = 2738 Score = 86.4 bits (212), Expect = 6e-15, Method: Composition-based stats. Identities = 35/217 (16%), Positives = 77/217 (35%), Gaps = 27/217 (12%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + S D + D+M ++D S S+ ++ K+ ++++ + I + Sbjct: 613 VHSICTEKGCEDMKADIMFLVDGSSSIGY---ANFEKMKNFMQTL------LAKIQIGAD 663 Query: 218 VVQSGLVTFSNKIEEFFLL-EWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 Q G+ FS+ +E F L ++ + I + +T + LK+ +G Sbjct: 664 KTQIGVAQFSDYNKEEFPLNKYFTQKEISDAIDRMLLITGNTLTGSALKFIDTYFTQSKG 723 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R KK ++ +TDGE + +G I+ ++G+ + Sbjct: 724 ARHG-------VKKFLILITDGE-----AQDDVREPAVALRDKGVIILSVGVYGANRTQL 771 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + + VEN + I + ++ Sbjct: 772 EEISGDGSLVFHVENFEDL----KEIERKLIFLVCTL 804 Score = 79.8 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 44/256 (17%), Positives = 87/256 (33%), Gaps = 27/256 (10%) Query: 125 EGYSISAISRY----KIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ-TDARLDMMIVLD 179 G S+S +S Y KF + ++ + + D+ ++D Sbjct: 389 PGQSVSVMSSYADLGNYTTKFMKKLQNEIWAQVSTVAEQMELDKTGCVDTKEADIYFLID 448 Query: 180 VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--IEEFFLLE 237 S S+ D +I + ++ M + V+ G+V +SN +E + Sbjct: 449 GSSSIR---DKQFLQIKEFMLAVTDMFNI------GPDKVRVGVVQYSNDRAVEFDIDVY 499 Query: 238 WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 S L++ I + + T + L F + M++ T + ++ +TDG Sbjct: 500 HDNSVLRKAIYNIKQLKGGTLTGKALD------FILPIMKKGRKTRASQVPCYLLVLTDG 553 Query: 298 ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDA 357 K + + L + +AIGI E L+ +N ++ Sbjct: 554 -----KSEDEVLGPAERIRAEQISTHAIGIGKAHKKELLQIAGEEERVNFGQNTDALKSI 608 Query: 358 FSHIGKDIVTKRIWYD 373 I I T++ D Sbjct: 609 KKEIVHSICTEKGCED 624 Score = 79.4 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 53/326 (16%), Positives = 108/326 (33%), Gaps = 31/326 (9%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + +F + + IDR L+ + G+ K + + F + D Sbjct: 682 LNKYFTQKEISDAIDRMLLITGNTL----TGSALKFIDTYFTQSKGARHGVKKFLILITD 737 Query: 101 NGFVNDIDDIVRSTS------LDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHI 154 +D+ + + L + V N + F R + Sbjct: 738 GEAQDDVREPAVALRDKGVIILSVGVYGANRTQLEEISGDGSLVFHVENFEDLKEIERKL 797 Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + + + LD++ VLD S S+ S S+ IN + VK Sbjct: 798 IFLVCTLHDCKRIQL--LDVVFVLDHSGSINSEQQESM---------INLTIHLVKKSDV 846 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + VQ G + +S+ + F L S + ++ G ST + L + Sbjct: 847 GRDRVQFGALRYSDDPDILFYLNTYSNRSAIIEHLRRRRDTGGSTFTAKALGRS-----A 901 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 QH + N K++++ +TDGE+ + + + +G + A+G+ Sbjct: 902 TLFEEQHGSRIKQNVKQMLIIITDGESHDRHLLNDTAL---KLRNKGITIIAVGVGKANQ 958 Query: 333 HEFLRACASPNSFYLVENPHSMYDAF 358 E + + V++ + D + Sbjct: 959 EELEAMAGNKENTIHVKDFDKLKDVY 984 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 32/238 (13%), Positives = 80/238 (33%), Gaps = 32/238 (13%) Query: 128 SISAIS-RYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 + I + + T P + + ++ +D+ ++D SR + S Sbjct: 1918 TFQVIPVPPDGEYEPLDRLQRCTLCYDKCFPNACAKDIFLTEESYMDIAFLIDNSRHVAS 1977 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPD---VNNVVQSGLVTFSN-----------KIEE 232 + ++ ++ + D N+ + L+++S K E Sbjct: 1978 ------DEFKAVKDMVSLTIDNFNIASDPLISNSGDRIALLSYSASGSSRRNTSTVKTEF 2031 Query: 233 FFLLEWGVSHLQRKI-KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 F + + ++ I L + L + +F T + K+I Sbjct: 2032 DFTMYGNQALMKNYIQNSLQQLNGEATIGHALLWTMENLFS--------ETPNIRRHKVI 2083 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 + ++ GEN +E + + AK +G +++ I + + + P +L++ Sbjct: 2084 LVVSAGENHERREFVKKMAL--RAKCQGYVIFVISLGPAPRDDLEELASHPLDHHLIQ 2139 Score = 40.9 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 28/172 (16%), Positives = 59/172 (34%), Gaps = 23/172 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-NVVQSGLVT 225 Q D+ D++ ++D S + + LE++ DV N + GL+ Sbjct: 230 QKDSLADLVFLVDESLGSRQNL-----------RHLQTFLEDISSSMDVKENCTRLGLMR 278 Query: 226 FSNKIEEF--FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +S + S Q++++ LS ++ + Q+ + Sbjct: 279 YSTSTKTVSSLKSSRSQSEFQQQVQKLSLQAGRAHTGAAI----EQLRREGFSEAGGSRR 334 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 +I V +T S D + + + G V+A+ I+ S + Sbjct: 335 AQGVPQIAVLVT-----SRPSDDEVRDAALDLRLEGVTVFAVNIQGANSTQL 381 >gi|193786838|dbj|BAG52161.1| unnamed protein product [Homo sapiens] Length = 496 Score = 86.4 bits (212), Expect = 6e-15, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 83/200 (41%), Gaps = 21/200 (10%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ V+D SRS+ + + ++ ++E + + P+ + G+V +++ +++ Sbjct: 41 DLVFVVDSSRSVRP------VEFEKVKVFLSQVIESLDVGPNA---TRVGMVNYASTVKQ 91 Query: 233 FFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L VS + ++ + T + +++A + F G + + + K+ Sbjct: 92 EFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKAF---GDAEGGRSRSPDISKV 148 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDG + +D + A+ G ++AIG+ + LR AS V+ Sbjct: 149 VIVVTDGRPQDSVQDVSAR-----ARASGVELFAIGVGSVD-KATLRQIASEPQDEHVDY 202 Query: 351 PHSMYDAFSHIGKDIVTKRI 370 S Y + + Sbjct: 203 VES-YSVIEKLSRKFQEAFC 221 Score = 80.6 bits (197), Expect = 3e-13, Method: Composition-based stats. Identities = 40/220 (18%), Positives = 84/220 (38%), Gaps = 30/220 (13%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T +V + D++ ++D S+S+ ++ K I+ +++ + + + Sbjct: 261 TCNVCSGGGGSSATDLVFLIDGSKSVRP------ENFELVKKFISQIVDTLDVSD---KL 311 Query: 219 VQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAY-NQIFDMQG 275 Q GLV +S+ + + F L ++ ++ +S ST + LKY N G Sbjct: 312 AQVGLVQYSSSVRQEFPLGRFHTKKDIKAAVRNMSYMEKSTMTGAALKYLIDNSFTVSSG 371 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R +K+ + TDG + D +AK G ++A+G+ E Sbjct: 372 ARPGA-------QKVGIVFTDGRSQDYINDA-----AKKAKDLGFKMFAVGVGNAVEDEL 419 Query: 336 LRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P ++ + + + IGK + K + Sbjct: 420 REIASEPVAEHYFYTAD----FKTINQIGKKLQKKICVEE 455 >gi|332817898|ref|XP_003310056.1| PREDICTED: collagen alpha-5(VI) chain isoform 1 [Pan troglodytes] Length = 2615 Score = 86.4 bits (212), Expect = 6e-15, Method: Composition-based stats. Identities = 63/349 (18%), Positives = 127/349 (36%), Gaps = 53/349 (15%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + +F + + IDR + +NEG G L + + S L Sbjct: 682 LNRYFTQQEISDAIDRMSL------INEGTLT-------GKALNFVGQYFTHSKGARLGA 728 Query: 101 NGFVNDIDDIVRSTSL-DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNS-------- 151 F+ I D V + D + + + +I ++ Y I ++ Sbjct: 729 KKFLILITDGVAQDDVRDPARILRGKDVTIFSVGVYNANRSQLEEISGDSSLVFHVENFD 788 Query: 152 ------RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 R ++ + + T LD++ VLD S S++ + + IN Sbjct: 789 HLKALERKLIFHVCALHDCKRITL--LDVVFVLDHSGSIKKQYQDHM---------INLT 837 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGL 263 + VK + VQ G + +S++ F L S + ++ G +T + L Sbjct: 838 IHLVKKADVGRDRVQFGALKYSDQPNILFYLNTYSNRSAIIENLRKRRDTGGNTYTAKAL 897 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 K+A N +F +H + N K++++ +TDGE+ + + E + +G ++ Sbjct: 898 KHA-NALFT----EEHGSRIKQNVKQMLIVITDGESHDHDQLNDTAL---ELRNKGITIF 949 Query: 324 AIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 A+G+ E + N+ V+N + D F+ + + Sbjct: 950 AVGVGKANQKELEGMAGNKNNTIYVDNFDKLKDVFT----LVQERMCTE 994 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 70/192 (36%), Gaps = 23/192 (11%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D + D+M ++D S S+ + + K+ + +K++ + I + Q G+V Sbjct: 620 KGCEDMKADIMFLVDSSWSIG---NENFRKMKIFMKNL------LTKIQIGADKTQIGVV 670 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+K +E F L + + I +S T + L + +G R Sbjct: 671 QFSDKTKEEFQLNRYFTQQEISDAIDRMSLINEGTLTGKALNFVGQYFTHSKGARLGA-- 728 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 KK ++ +TDG + + ++++G+ + Sbjct: 729 -----KKFLILITDG-----VAQDDVRDPARILRGKDVTIFSVGVYNANRSQLEEISGDS 778 Query: 343 NSFYLVENPHSM 354 + + VEN + Sbjct: 779 SLVFHVENFDHL 790 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 31/203 (15%), Positives = 71/203 (34%), Gaps = 22/203 (10%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S++ I + MLE ++ + V+ G+V +S+ E Sbjct: 442 DIHFLIDGSSSIQEKQFEQIKRF---------MLEVTEMFSIGPDKVRVGVVQYSDDTEV 492 Query: 233 FFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + + + ++ I + + T + L Y I + R Sbjct: 493 EFYVTDYSNDIDLRKAIFNIKQLTGGTYTGKALDYILQIIKNGMKDRMSK------VPCY 546 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDG + + + + V+A+GI E +N Sbjct: 547 LIVLTDGMSTD-----RVVEPAKRLRAEQITVHAVGIGAANKTELQEIAGKEERVSFGQN 601 Query: 351 PHSMYDAFSHIGKDIVTKRIWYD 373 ++ + + ++I ++ D Sbjct: 602 FDALKSIKNEVIREICAEKGCED 624 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 28/185 (15%), Positives = 65/185 (35%), Gaps = 22/185 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + S + I + + +P N + L +S++ Sbjct: 30 DVVFLVDSS---DHLGPKSFPFVKTFIS------KMINSLPIEANKYRVALAQYSDEFHS 80 Query: 233 FFLLEW--GVSHLQRKIKY-LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L G S + +K G S L+ A+ F + + + Sbjct: 81 EFHLSTFEGRSPMLNHLKKNFQFIGGSLQIGKALQEAHRTYFSAPTNGR----DRKQFPP 136 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 I+V + S + + + +K G + ++G++ L+A A+ + + + Sbjct: 137 ILVVL-----ASAESEDEVEGASKALQKDGVKIISVGVQEASEEN-LKAMATSHFHFNLR 190 Query: 350 NPHSM 354 + Sbjct: 191 TIRDL 195 >gi|71896057|ref|NP_001025613.1| matrilin 1, cartilage matrix protein [Xenopus (Silurana) tropicalis] gi|60552391|gb|AAH91071.1| MGC108367 protein [Xenopus (Silurana) tropicalis] Length = 490 Score = 86.4 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 83/203 (40%), Gaps = 20/203 (9%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + K + D++ ++D SRS+ ++ + ++ ++E + + N + Sbjct: 26 AQKGPNCRTRPTDILFIIDSSRSVRP------SEFEQVKVFLSQVVESLDV---GANATR 76 Query: 221 SGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 GLV +++ ++ F L+ + L + +K + T + ++YA N F + Sbjct: 77 VGLVNYASTVKNEFSLKTHKAKPALLQAVKKVQPLSTGTMTGLAIQYAINIAFTEPEGAR 136 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + K+ + +TDG +D + A++ G +YAIG+ + + + Sbjct: 137 LKSPG---INKVAIIVTDGRPQDAVKDISAR-----ARESGLEIYAIGVGRVDKNTLRQI 188 Query: 339 CASP-NSFYLVENPHSMYDAFSH 360 + P + +S+ + S Sbjct: 189 ASEPLDEHVDYVESYSLIEKLSK 211 Score = 84.8 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 64/399 (16%), Positives = 135/399 (33%), Gaps = 58/399 (14%) Query: 6 LNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVH-AATQ 64 + + N+ K ++ T P + + + +S + I+ + + Sbjct: 77 VGLVNYASTVKNEFSLKTHKAKPALLQAVKKVQPLSTGTMTGLAIQYAINIAFTEPEGAR 136 Query: 65 IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQN 124 + + G + G +K+ + + L + + + ++T I P + Sbjct: 137 LKSPGINKVAIIVTDGRPQDAVKDISARARESGLE--IYAIGVGRVDKNTLRQIASEPLD 194 Query: 125 E------GYSISAISRYKIPLKFCTFIPWYTNSRH----IVMPITSSVKVNSQTDARL-- 172 E YS+ K FC + H I + S + L Sbjct: 195 EHVDYVESYSLIEKLSKKFQEAFCVTADLCSTGDHDCEQICVSTPGSYTCACRDGFTLNE 254 Query: 173 -------------DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D++ ++D S+S+ ++ + IN ++E + + +V Sbjct: 255 DGKTCNACGASAVDLVFLIDGSKSVRP------ENFELVKQFINQIVESMDVGDHRAHV- 307 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 GLV +S+ + + F L S ++ +K +S T + L+Y + F + Sbjct: 308 --GLVQYSSSVRQEFPLGRYTSKKDIKSAVKKMSYMEKGTMTGQALQYLIDNSFAIS--- 362 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + K+ + TDG + D AK+ G ++A+G+ E Sbjct: 363 ---SGGRPAVPKVGIVFTDGRSQDYINDAALR-----AKELGYKMFAVGVGNAVEEELRM 414 Query: 338 ACASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P +SFY + + A IGK + K + Sbjct: 415 IASEPVVEHSFYTAD-----FKAMKEIGKKLQMKICVEE 448 >gi|211546|gb|AAA48695.1| cartilage matrix protein [Gallus gallus] Length = 416 Score = 86.4 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 40/230 (17%), Positives = 86/230 (37%), Gaps = 28/230 (12%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 + + + + + LD++ ++D S+S+ ++ K IN ++E Sbjct: 170 CKEGFTLNNDGKTCSACSGGSGSALDLVFLIDGSKSVRP------ENFELVKKFINQIVE 223 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKY 265 +++ Q GLV +S+ + + F L ++ +K ++ T + LKY Sbjct: 224 SLEV---SEKQAQVGLVQYSSSVRQEFPLGQFKNKKDIKAAVKKMAYMEKGTMTGQALKY 280 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 + F + N K+ + TDG + D +AK G ++A+ Sbjct: 281 LVDSSFSI------ANGARPGVPKVGIVFTDGRSQDYITDA-----AKKAKDLGFRMFAV 329 Query: 326 GIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 G+ E + P ++ + + S+IGK + K + Sbjct: 330 GVGNAVEDELREIASEPVAEHYFYTAD----FRTISNIGKKLQMKICVEE 375 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 22/147 (14%), Positives = 59/147 (40%), Gaps = 15/147 (10%) Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 G++ +++ ++ F L+ + + + ++ + T + +++A ++ F + Sbjct: 1 VGVINYASAVKNEFSLKTHQTKAELLQAVQRIEPLSTGTMTGLAIQFAISRAFS---DTE 57 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 N K+ + +TDG +D + A++ G ++AIG+ + H + Sbjct: 58 GARLRSPNINKVAIVVTDGRPQDGVQDVSAR-----ARQAGIEIFAIGVGRVDMHTLRQI 112 Query: 339 CASP--NSFYLVEN---PHSMYDAFSH 360 + P + VE+ + F Sbjct: 113 ASEPLDDHVDYVESYSVIEKLTHKFQE 139 >gi|296484270|gb|DAA26385.1| collagen, type XII, alpha 1 [Bos taurus] Length = 3115 Score = 86.4 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 53/267 (19%), Positives = 103/267 (38%), Gaps = 37/267 (13%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 ++T+L + + + Y IS + + I T + + + V + + Sbjct: 384 PQTTTLSVRDLSADTEYQISVSAMKGLTSSEPVSIMEKTQPMKVQVECSRGVDIKA---- 439 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++ ++D S S+ ++ K+ ++ + VK N VQ LV +S Sbjct: 440 --DIVFLVDGSYSIG---IANFVKVRAFLEVL------VKSFEISPNRVQISLVQYSRDP 488 Query: 231 EEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L+ + I G STN+ + Y +IF +N Sbjct: 489 HTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF------VPSKGSRSNVP 542 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 K+++ +TDG++ D + + ++A+G++ E L A ASP Sbjct: 543 KVMILITDGKSSDAFRD-----PAIKLRNSDVEIFAVGVKDAVRSE-LEAIASPPAETHV 596 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 + VE+ +DAF I ++ T+ I Sbjct: 597 FTVED----FDAFQRISFEL-TQSICL 618 Score = 82.1 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 45/279 (16%), Positives = 98/279 (35%), Gaps = 34/279 (12%) Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP--LKFCTFIPWYTNSRHIVMPI 158 +G + +T + + Y ++ S ++ + + T Sbjct: 64 DGPTKEFTLSASTTETLLTELIPEIEYVVTITSYDEVEESVPVIGQLTIQTGGPTKPGEK 123 Query: 159 -TSSVKVNSQTDAR-LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 ++ + + D++ ++D S S+ + I I ++ V Sbjct: 124 KPGKTEIQKCSVSAWTDLVFLVDGSWSVGRN---NFKYILDFIAAL------VSAFDIGE 174 Query: 217 NVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDM 273 + G+V +S+ F L + L IK + G +T + + Y N + Sbjct: 175 EKTRVGVVQYSSDTRTEFNLNQYYQQEELLAAIKKIPYKGGNTMTGEAIDYLIKNTFTES 234 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 G R + K+ + +TDG K + E + G V+++GI+ + Sbjct: 235 AGARVG-------FPKVAIIITDG-----KSQDEVEIPARELRNIGVEVFSLGIKAADAK 282 Query: 334 EFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 E + ++P N + V N +DA I +I+++ Sbjct: 283 ELKQIASTPSLNHVFNVAN----FDAIVDIQNEIISQVC 317 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 50/250 (20%), Positives = 82/250 (32%), Gaps = 26/250 (10%) Query: 124 NEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 Y ++ + T S VMPI SS + T A D+++++D S Sbjct: 1150 GTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSS-GMECLTRAEADIVLLVDGSW 1208 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH 242 S+ I+ ++E ++ P VQ L +S + L Sbjct: 1209 SIG------RANFRTVRSFISRIVEVFEIGP---KRVQIALAQYSGDPRTEWQLNAHKDK 1259 Query: 243 LQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 + + L G +T + L + Q F Q +KI V +TDG Sbjct: 1260 KSLLQAVANLPYKGGNTLTGMALNFIRQQSFKTQ------AGMRPRARKIGVLITDG--- 1310 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 K + K G ++AIGI+ E P+ N +D+ S Sbjct: 1311 --KSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDI-HAYNVAD-FDSLSR 1366 Query: 361 IGKDIVTKRI 370 I D+ Sbjct: 1367 IVDDLTNNLC 1376 >gi|148726250|emb|CAN88322.1| matrilin 1 [Danio rerio] gi|148726498|emb|CAN88268.1| matrilin 1 [Danio rerio] Length = 277 Score = 86.0 bits (211), Expect = 8e-15, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 82/216 (37%), Gaps = 27/216 (12%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 + + +++ D++ ++D SRS+ ++ + + +++ + + Sbjct: 18 TVDLRQAAAMAAGLCNTKPTDVVFIVDSSRSVRP------SEFEQVKVFLAKVIDGLSVG 71 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQI 270 PD + G+V ++++++ L+ + + + + T + +++A N Sbjct: 72 PDA---TRVGVVNYASRVKNEVSLKSHKTKAALVKAVSKIEPLSTGTMTGLAIQFAMNVA 128 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 F + + + K+ + +TDG D A++ G ++AIG+ + Sbjct: 129 FSEAEGGR----KSPDISKVAIIVTDGRPQDNIRD-----IAARAREAGIEIFAIGVGRV 179 Query: 331 RSHEFLRACAS---PNSFYLVEN---PHSMYDAFSH 360 LR AS + VE+ + F Sbjct: 180 D-MTTLRQMASEPLEDHVDYVESYSLIEKLTKKFQE 214 >gi|293361347|ref|XP_576462.3| PREDICTED: collagen type VI alpha 5 [Rattus norvegicus] Length = 1730 Score = 86.0 bits (211), Expect = 8e-15, Method: Composition-based stats. Identities = 49/279 (17%), Positives = 99/279 (35%), Gaps = 34/279 (12%) Query: 103 FVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSV 162 F + + DIV P + SIS S YK + ++ T + Sbjct: 395 FAMSVQGANNTQLEDIVSYPSRQ--SISTHSSYKHLESYSGNFLKKIHNEIWTQVSTHAE 452 Query: 163 KVNSQTDARLD-----MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 ++ +D + ++D S S+ +I + + S+ V + P N Sbjct: 453 QMELDKTGCVDTKEADIYFLIDGSSSIR---RKEFEQIQIFMSSV------VDMFPIGPN 503 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 V+ G+V +S++ E F + + ++ + + + T + L + I + Sbjct: 504 NVRVGVVQYSHRNEVEFPVSQYTKGIELRKAVWNIKQLKGGTFTGKALDFILPIIKKGKS 563 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R H ++ +TDG K + L N + ++A+GI + Sbjct: 564 ERIH------EVPCYLIVLTDG-----KSNDSVLEPANRLRAEHITIHAVGIGEANKTQL 612 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + +N +D+ +I +IV + I DK Sbjct: 613 QQIAGKDERVSFGQN----FDSLKYIKNEIVHR-ICSDK 646 Score = 79.8 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 46/281 (16%), Positives = 103/281 (36%), Gaps = 33/281 (11%) Query: 97 ELRDNGFVNDIDDIVRSTSLDI--VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH- 153 + + N V + + +R+ + I V + + + I+ + F ++ Sbjct: 578 DGKSNDSVLEPANRLRAEHITIHAVGIGEANKTQLQQIAGKDERVSFGQNFDSLKYIKNE 637 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 IV I S D + D+M ++D S S+ ++ K+ +K++ I Sbjct: 638 IVHRICSD---KGCEDMKADIMFLVDSSGSIG---LTNFEKMKTFMKNLVG------KIE 685 Query: 214 DVNNVVQSGLVTFSNKIEEFFLL-EWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 + Q G+V FS+ +E F L ++ + I +S +T + L + N+ F Sbjct: 686 IGADRSQVGVVQFSDYNKEEFQLNKYSTREEVYAAIDGMSPINRNTLTGSALTF-VNEYF 744 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 D+ +K ++ +TDGE + + + ++++G+ Sbjct: 745 DIS------KGGRPQVRKFLILLTDGE-----AQDEVGGPAMALRSKSVTIFSVGVYGAN 793 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + + + VEN +D I ++ + Sbjct: 794 RTQLEEISGEGSLVFHVEN----FDHLKTIESKLIFRVCAL 830 Score = 79.8 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 41/189 (21%), Positives = 77/189 (40%), Gaps = 19/189 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ VLD S S+ S+ IN + V+ + VQ G + +SN E Sbjct: 839 LDIVFVLDHSGSIGPREQESM---------INLTIHLVEKADVGRDRVQFGALMYSNNPE 889 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L S + +K T + L++A N +F M +H + N ++ Sbjct: 890 ILFYLNTYSSRSAITEHLKRPRDTRGDTYTAKALQHA-NILF----MEEHGSRLKQNVRQ 944 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDGE+ + + E + +G ++A+G+ E + V+ Sbjct: 945 LMIVITDGES---HDRDKLNDTARELRDKGITIFAVGVGRANQDELETMAGKKENTIHVD 1001 Query: 350 NPHSMYDAF 358 N + D + Sbjct: 1002 NFDKLRDIY 1010 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 27/206 (13%), Positives = 70/206 (33%), Gaps = 25/206 (12%) Query: 154 IVMPITSSVKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 +++ + + + D++ ++D S + I +N ++ + Sbjct: 10 LILWTETLSDQSPGPGPKYADVVFLVDSSDYLG------IKSFPFVRTFLNKIISSL--- 60 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYN 268 P N + L +S+ + F L + + +L K G S L+ A+ Sbjct: 61 PVEANKYRVALAQYSDALHNEFHLGAFKNR-NPMLNHLKKNFGFIGGSLKIGNALQEAHR 119 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 F + + I+V + S + + ++ G + ++G++ Sbjct: 120 TYFSAPTNGRDKK----QFPPILVVL-----ASAESEDDVEEASKALREDGVKIISVGVQ 170 Query: 329 VIRSHEFLRACASPNSFYLVENPHSM 354 + L+A A+P + + + Sbjct: 171 KASEED-LKAMATPQFHFNLRTARDL 195 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 37/292 (12%), Positives = 98/292 (33%), Gaps = 31/292 (10%) Query: 50 LHSMIDRSLVHAATQIMNEG----NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN 105 L S V A++ + E +K D+ ++ + R + F Sbjct: 141 LASAESEDDVEEASKALREDGVKIISVGVQKASEEDLKAMATPQFHFNLRTARDLSLFAP 200 Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVN 165 D+ I++ D+ + ++ P T T + ++ + + Sbjct: 201 DMAQIIQ----DVTQYREGTTVAVVTDVAPTTPEPLITPAALTTPANNVDETVPFLA--S 254 Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 Q D+ D++ ++D S + + + + V + + ++ GL++ Sbjct: 255 CQKDSLADLIFLVDES-------VGTTQNLRDLQNFLENVTSSVDVKDNC---MRLGLMS 304 Query: 226 FSN--KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 FS+ + S Q++I+ LS ++N +++ + + + Sbjct: 305 FSDKARTVSSLKSSTSQSDFQQQIQKLSLRTGASNVGAAIEH----MRTEGFSESSGSRK 360 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 +I V +T D + + +G ++A+ ++ + + Sbjct: 361 AQGVPQIAVLVT-----HRASDDAVREAALDLRLQGVTMFAMSVQGANNTQL 407 >gi|189066649|dbj|BAG36196.1| unnamed protein product [Homo sapiens] Length = 957 Score = 86.0 bits (211), Expect = 8e-15, Method: Composition-based stats. Identities = 42/214 (19%), Positives = 82/214 (38%), Gaps = 33/214 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S A D++ +LD S S+ ++ K + + + + P +Q G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGP------ENFEIVKKWLVNITKNFDIGP---KFIQVGVV 79 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ L HL ++ + G +T + +++A + +F Sbjct: 80 QYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTGKAIQFALDYLF---------AK 130 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 KI V +TDG K A+ ++AIG+ LRA A Sbjct: 131 SSRFLTKIAVVLTDG-----KSQDDVKDAAQAARDSKITLFAIGVGSETEDAELRAIANK 185 Query: 341 -SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 S + VE+ Y A S I ++++ +++ + Sbjct: 186 PSSTYVFYVED----YIAISKI-REVMKQKLCEE 214 >gi|119624855|gb|EAX04450.1| collagen, type XXI, alpha 1, isoform CRA_c [Homo sapiens] Length = 552 Score = 86.0 bits (211), Expect = 8e-15, Method: Composition-based stats. Identities = 42/214 (19%), Positives = 82/214 (38%), Gaps = 33/214 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S A D++ +LD S S+ ++ K + + + + P +Q G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGP------ENFEIVKKWLVNITKNFDIGP---KFIQVGVV 79 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ L HL ++ + G +T + +++A + +F Sbjct: 80 QYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTGKAIQFALDYLF---------AK 130 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 KI V +TDG K A+ ++AIG+ LRA A Sbjct: 131 SSRFLTKIAVVLTDG-----KSQDDVKDAAQAARDSKITLFAIGVGSETEDAELRAIANK 185 Query: 341 -SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 S + VE+ Y A S I ++++ +++ + Sbjct: 186 PSSTYVFYVED----YIAISKI-REVMKQKLCEE 214 >gi|119624854|gb|EAX04449.1| collagen, type XXI, alpha 1, isoform CRA_b [Homo sapiens] Length = 567 Score = 86.0 bits (211), Expect = 8e-15, Method: Composition-based stats. Identities = 42/214 (19%), Positives = 82/214 (38%), Gaps = 33/214 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S A D++ +LD S S+ ++ K + + + + P +Q G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGP------ENFEIVKKWLVNITKNFDIGP---KFIQVGVV 79 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ L HL ++ + G +T + +++A + +F Sbjct: 80 QYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTGKAIQFALDYLF---------AK 130 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 KI V +TDG K A+ ++AIG+ LRA A Sbjct: 131 SSRFLTKIAVVLTDG-----KSQDDVKDAAQAARDSKITLFAIGVGSETEDAELRAIANK 185 Query: 341 -SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 S + VE+ Y A S I ++++ +++ + Sbjct: 186 PSSTYVFYVED----YIAISKI-REVMKQKLCEE 214 >gi|114607957|ref|XP_001157649.1| PREDICTED: collagen alpha-1(XXI) chain isoform 3 [Pan troglodytes] Length = 957 Score = 86.0 bits (211), Expect = 8e-15, Method: Composition-based stats. Identities = 42/214 (19%), Positives = 82/214 (38%), Gaps = 33/214 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S A D++ +LD S S+ ++ K + + + + P +Q G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGP------ENFEIVKKWLVNITKNFDIGP---KFIQVGVV 79 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ L HL ++ + G +T + +++A + +F Sbjct: 80 QYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTGKAIQFALDYLF---------AK 130 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 KI V +TDG K A+ ++AIG+ LRA A Sbjct: 131 SSRFLTKIAVVLTDG-----KSQDDVKDAAQAARDSKITLFAIGVGSETEDAELRAIANK 185 Query: 341 -SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 S + VE+ Y A S I ++++ +++ + Sbjct: 186 PSSTYVFYVED----YIAISKI-REVMKQKLCEE 214 >gi|114607959|ref|XP_001157475.1| PREDICTED: collagen, type XXI, alpha 1 isoform 1 [Pan troglodytes] gi|114607961|ref|XP_518554.2| PREDICTED: collagen, type XXI, alpha 1 isoform 4 [Pan troglodytes] gi|114607963|ref|XP_001157591.1| PREDICTED: collagen, type XXI, alpha 1 isoform 2 [Pan troglodytes] Length = 954 Score = 86.0 bits (211), Expect = 8e-15, Method: Composition-based stats. Identities = 42/214 (19%), Positives = 82/214 (38%), Gaps = 33/214 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S A D++ +LD S S+ ++ K + + + + P +Q G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGP------ENFEIVKKWLVNITKNFDIGP---KFIQVGVV 79 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ L HL ++ + G +T + +++A + +F Sbjct: 80 QYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTGKAIQFALDYLF---------AK 130 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 KI V +TDG K A+ ++AIG+ LRA A Sbjct: 131 SSRFLTKIAVVLTDG-----KSQDDVKDAAQAARDSKITLFAIGVGSETEDAELRAIANK 185 Query: 341 -SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 S + VE+ Y A S I ++++ +++ + Sbjct: 186 PSSTYVFYVED----YIAISKI-REVMKQKLCEE 214 >gi|18780273|ref|NP_110447.2| collagen alpha-1(XXI) chain precursor [Homo sapiens] gi|74752071|sp|Q96P44|COLA1_HUMAN RecName: Full=Collagen alpha-1(XXI) chain; Flags: Precursor gi|15593270|gb|AAL02227.1|AF414088_1 collagen XXI [Homo sapiens] gi|19310967|gb|AAL86699.1|AF438327_1 alpha 1 type XXI collagen precursor [Homo sapiens] gi|55665071|emb|CAH73913.1| collagen type XXI alpha 1 [Homo sapiens] gi|56202573|emb|CAI22496.1| collagen type XXI alpha 1 [Homo sapiens] gi|56202937|emb|CAI22395.1| collagen type XXI alpha 1 [Homo sapiens] gi|116496597|gb|AAI26109.1| Collagen, type XXI, alpha 1 [Homo sapiens] gi|215434893|gb|ACJ66843.1| alpha 1 type XXI collagen precursor [Homo sapiens] Length = 957 Score = 86.0 bits (211), Expect = 8e-15, Method: Composition-based stats. Identities = 42/214 (19%), Positives = 82/214 (38%), Gaps = 33/214 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S A D++ +LD S S+ ++ K + + + + P +Q G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGP------ENFEIVKKWLVNITKNFDIGP---KFIQVGVV 79 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ L HL ++ + G +T + +++A + +F Sbjct: 80 QYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTGKAIQFALDYLF---------AK 130 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 KI V +TDG K A+ ++AIG+ LRA A Sbjct: 131 SSRFLTKIAVVLTDG-----KSQDDVKDAAQAARDSKITLFAIGVGSETEDAELRAIANK 185 Query: 341 -SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 S + VE+ Y A S I ++++ +++ + Sbjct: 186 PSSTYVFYVED----YIAISKI-REVMKQKLCEE 214 >gi|17974510|gb|AAL50033.1|AF330693_1 alpha 1 chain-like collagen COLA1L precursor [Homo sapiens] gi|55665070|emb|CAH73912.1| collagen type XXI alpha 1 [Homo sapiens] gi|56202574|emb|CAI22497.1| collagen type XXI alpha 1 [Homo sapiens] gi|56202938|emb|CAI22396.1| collagen type XXI alpha 1 [Homo sapiens] Length = 954 Score = 86.0 bits (211), Expect = 8e-15, Method: Composition-based stats. Identities = 42/214 (19%), Positives = 82/214 (38%), Gaps = 33/214 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S A D++ +LD S S+ ++ K + + + + P +Q G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGP------ENFEIVKKWLVNITKNFDIGP---KFIQVGVV 79 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ L HL ++ + G +T + +++A + +F Sbjct: 80 QYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTGKAIQFALDYLF---------AK 130 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 KI V +TDG K A+ ++AIG+ LRA A Sbjct: 131 SSRFLTKIAVVLTDG-----KSQDDVKDAAQAARDSKITLFAIGVGSETEDAELRAIANK 185 Query: 341 -SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 S + VE+ Y A S I ++++ +++ + Sbjct: 186 PSSTYVFYVED----YIAISKI-REVMKQKLCEE 214 >gi|212720733|ref|NP_001132911.1| collagen, type XXII, alpha 1 [Gallus gallus] Length = 1599 Score = 86.0 bits (211), Expect = 8e-15, Method: Composition-based stats. Identities = 34/218 (15%), Positives = 85/218 (38%), Gaps = 21/218 (9%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + + + D++ +LD S S+ + + ++ ++E ++ PD Sbjct: 25 TEAQRAGCKNVHYDLVFILDASSSVGK------EDFEKVRQWVSNLVETFEIGPDK---T 75 Query: 220 QSGLVTFSNKIEEFFLL-EWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G+V +S++ F L ++ ++ + + +G +TN+ L+Y F + Sbjct: 76 RVGVVRYSDRPTTEFDLGKYKTCEEIKEAARKIRYYGGNTNTGDALRYINTYSFSKE--- 132 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 D KK+ + +TDG L N A++ G ++A+G+ E Sbjct: 133 AGGRLSDRTVKKVAILLTDGR-----SQDYVLDPANAARQAGIRIFAVGVGEALKEELDE 187 Query: 338 ACASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P + V + +++ + + + + + Sbjct: 188 IASEPKSAHVFHVSDYNAIDKIRGKLRRRLCENVLCPN 225 >gi|219520386|gb|AAI43866.1| COL21A1 protein [Homo sapiens] Length = 957 Score = 86.0 bits (211), Expect = 9e-15, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 82/214 (38%), Gaps = 33/214 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S A D++ +LD S S+ ++ K + + + + P +Q G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGP------ENFEIVKKWLVNITKNFDIGP---KFIQVGVV 79 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ L HL ++ + G +T + +++A + +F Sbjct: 80 QYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTGKAIQFALDYLF---------AK 130 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 KI V +TDG K A+ ++AIG+ L+A A Sbjct: 131 SSRFLTKIAVVLTDG-----KSQDDVKDAAQAARDSKITLFAIGVGSETEDAELKAIANK 185 Query: 341 -SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 S + VE+ Y A S I ++++ +++ + Sbjct: 186 PSSTYVFYVED----YIAISKI-REVMKQKLCEE 214 >gi|224048537|ref|XP_002190467.1| PREDICTED: similar to collagen, type XII, alpha 1 [Taeniopygia guttata] Length = 3122 Score = 86.0 bits (211), Expect = 9e-15, Method: Composition-based stats. Identities = 57/338 (16%), Positives = 112/338 (33%), Gaps = 41/338 (12%) Query: 46 MKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCR--IKNTWNM--------SFR 95 MKT L S + + + AAT + + N I+ ++ +W Sbjct: 1 MKTALCSAV--AALCAATLLSSIEAEVNPPSDLNFTIIDEHNVQMSWKRPPDAIVGYRIT 58 Query: 96 NELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYK---IPLKFCTFIPWYTNSR 152 ++G + +T I + + Y +S I+ Y L + T Sbjct: 59 VVPTNDGPTKEFTLSPSTTQTVISDLIPDVEYIVS-IASYDDREESLPVFGQLTIQTGGP 117 Query: 153 HIVMPITSSVKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 I ++ + + + D++ ++D S S+ I ++ M+ V Sbjct: 118 GIPEEKKVEAQLQRCSISAMTDLVFLVDGSWSVGRNNFGYI---------LDFMVALVSA 168 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + G+V +S+ F L + S L IK + G +T + + Y Sbjct: 169 FDIGEEKTRVGVVQYSSDTRTEFNLNQYFRRSDLIDAIKRIPYKGGNTMTGEAIDYLVQN 228 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 F + K+ + +TDG K E + G V+++GI+ Sbjct: 229 TFTES------AGARKGFPKVAIVITDG-----KAQDDVEIPARELRNIGVEVFSLGIKA 277 Query: 330 IRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDI 365 + E + P + V N + D + I + Sbjct: 278 ADAKELKLIASQPSLKHVFNVANFDGIVDIQNEIILQV 315 Score = 83.3 bits (204), Expect = 5e-14, Method: Composition-based stats. Identities = 51/268 (19%), Positives = 106/268 (39%), Gaps = 39/268 (14%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD 169 ++T+L++ + + Y I+ + + P + T + ++ + + + S V+ + Sbjct: 383 PQTTALNVKDLSPDTEYQINVYAMKGLTPSEPITIME---KTQQVKVQVECSRGVDVKA- 438 Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D++ ++D S S+ ++ K+ ++ + VK VQ LV +S Sbjct: 439 ---DVVFLVDGSYSIG---IANFVKVRAFLEVL------VKSFEISPRKVQISLVQYSRD 486 Query: 230 IEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 F L V + + I G STN+ + Y ++F N Sbjct: 487 PHMEFSLNRYNRVEDIIQAINTFPYRGGSTNTGKAMTYVREKVFVTS------KGSRPNV 540 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NS 344 ++++ +TDG K + + ++A+G++ E L A ASP Sbjct: 541 PRVMILITDG-----KSSDAFKEPAIKLRDADVEIFAVGVKDAVRTE-LEAIASPPADTH 594 Query: 345 FYLVENPHSMYDAFSHIGKDIVTKRIWY 372 Y VE+ +DAF I ++ T+ + Sbjct: 595 VYTVED----FDAFQRISFEL-TQSVCL 617 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 46/272 (16%), Positives = 94/272 (34%), Gaps = 34/272 (12%) Query: 99 RDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMP 157 D+G + ++ +++ + + Y ++ ++ T S P Sbjct: 1123 EDDGSLGELIVGPYDSTVVLEELRAGTTYKVNVFGMFEGGESNPLVGQEMTTLSDTTTEP 1182 Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 S + +T A D+++++D S S+ I ++E + PD Sbjct: 1183 FLSR-GLECRTRAEADIVLLVDGSWSIG------RPNFKTIRNFIARIVEVFDIGPD--- 1232 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQ------RKIKYLSKFGVSTNSTPGLKYAYNQIF 271 VQ GL +S EW ++ + + L G +T + L F Sbjct: 1233 RVQIGLAQYSGDPRT----EWNLNAYRTKQSLLEAVANLPYKGGNTLTGMALD------F 1282 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++ + +KI V +TDG K + + + G +YAIGI+ Sbjct: 1283 ILRNNFKPVAGLRPRARKIGVLITDG-----KSQDDVVAPSRKLRDEGVELYAIGIKNAD 1337 Query: 332 SHEFLRACASPN--SFYLVENPHSMYDAFSHI 361 +E + P+ Y V + + ++ Sbjct: 1338 ENELKQIATDPDDIHAYNVADFSFLATIVDNV 1369 >gi|147899676|ref|NP_001087858.1| collagen alpha-1(XXI) chain precursor [Xenopus laevis] gi|82234134|sp|Q641F3|COLA1_XENLA RecName: Full=Collagen alpha-1(XXI) chain; Flags: Precursor gi|51950065|gb|AAH82384.1| MGC81791 protein [Xenopus laevis] Length = 957 Score = 86.0 bits (211), Expect = 9e-15, Method: Composition-based stats. Identities = 44/214 (20%), Positives = 84/214 (39%), Gaps = 33/214 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S A D++ +LD S S+ ++ K + + + P Q G+V Sbjct: 29 SSCRTAPNDLVFILDGSWSVGP------ENFEILKKWVVNITSNFNIGP---KFTQVGVV 79 Query: 225 TFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ L + L R+ + + G +T + +++A + +F Sbjct: 80 QYSDYPILEIPLGSYESIDDLSRRTQSIQYLGGNTQTGNAIQFAIDNLFARSL------- 132 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 KI + +TDG K + EA+K ++AIG+ LRA A Sbjct: 133 --RPLTKIAIVLTDG-----KSQDDVKHIAEEARKNKITLFAIGVGSEIEESELRAIANK 185 Query: 341 -SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 S + VE+ Y A S I ++I+ +++ + Sbjct: 186 PSSTYVFYVED----YIAISRI-REIMKQKLCEE 214 >gi|291396486|ref|XP_002714579.1| PREDICTED: collagen, type XII, alpha 1 [Oryctolagus cuniculus] Length = 3117 Score = 86.0 bits (211), Expect = 9e-15, Method: Composition-based stats. Identities = 53/267 (19%), Positives = 102/267 (38%), Gaps = 37/267 (13%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 ++T+L I + + Y IS + + I T + + + V + + Sbjct: 384 PQTTTLSIRDLSADTEYQISVSAMKGLTSSEPISIMEKTQPMKVQVECSRGVDIKA---- 439 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++ ++D S S+ ++ K+ ++ + K N VQ LV +S Sbjct: 440 --DIVFLVDGSYSIG---IANFVKVRAFLEVL------AKSFEISPNRVQISLVQYSRDP 488 Query: 231 EEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L+ + I G STN+ + Y +IF +N Sbjct: 489 HTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF------VPSKGSRSNVP 542 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 K+++ +TDG++ D + + ++A+G++ E L A ASP Sbjct: 543 KVMILITDGKSSDAFRD-----PAIKLRNSDVEIFAVGVKDAVRSE-LEAIASPPAETHV 596 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 + VE+ +DAF I ++ T+ I Sbjct: 597 FTVED----FDAFQRISFEL-TQSICL 618 Score = 82.1 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 40/203 (19%), Positives = 78/203 (38%), Gaps = 30/203 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S S+ + I I ++ V + G+V +S+ Sbjct: 140 DLVFLVDGSWSVGRN---NFKYILDFIAAL------VSAFDIGEEKTRVGVVQYSSDTRT 190 Query: 233 FFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDMQGMRQHCNTEDANYKK 289 F L + L IK + G +T + + Y N + G R + K Sbjct: 191 EFNLNQYYQRDELLAAIKRIPYKGGNTMTGDAIDYLVKNTFLESAGARVG-------FPK 243 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYL 347 + + +TDG K + E + G V+++GI+ + E + ++P N + Sbjct: 244 VAIIITDG-----KSQDEVEIPARELRNIGVEVFSLGIKAADAKELKQIASTPSLNHVFN 298 Query: 348 VENPHSMYDAFSHIGKDIVTKRI 370 V N +DA I +I+++ Sbjct: 299 VAN----FDAIVDIQNEIISQVC 317 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 51/250 (20%), Positives = 82/250 (32%), Gaps = 26/250 (10%) Query: 124 NEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 Y ++ + T S VMPI SS + T A D+++++D S Sbjct: 1150 GTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSS-GMECLTRAEADIVLLVDGSW 1208 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGV 240 S+ I+ ++E + P VQ L +S + L Sbjct: 1209 SIG------RANFRTVRSFISRIVEVFDIGP---KRVQIALAQYSGDPRTEWQLNAHRDK 1259 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 L + + L G +T + L + Q F Q +KI V +TDG Sbjct: 1260 KSLLQAVANLPYKGGNTLTGMALNFIRQQNFKTQ------AGMRPRARKIGVLITDG--- 1310 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 K + K G ++AIGI+ E P+ N +D+ S Sbjct: 1311 --KSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDT-HAYNVAD-FDSLSK 1366 Query: 361 IGKDIVTKRI 370 I D+ Sbjct: 1367 IVDDLTINLC 1376 >gi|311260227|ref|XP_003128387.1| PREDICTED: collagen alpha-1(XXI) chain-like [Sus scrofa] Length = 957 Score = 86.0 bits (211), Expect = 9e-15, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 82/214 (38%), Gaps = 33/214 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S A D++ +LD S S+ ++ K + + + + P +Q G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGP------ENFEIVKKWLVNITKNFDIGP---KFIQVGVV 79 Query: 225 TFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ L S + + + G +T + +++A + +F Sbjct: 80 QYSDYPVLEIPLGRHESGEKLLAAVGSIHYLGGNTRTGKAIQFALDYLF---------AK 130 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 KI V +TDG K + A+ ++AIG+ LRA A Sbjct: 131 SSRFLTKIAVVLTDG-----KSQDEVKDAAEAARDSKITLFAIGVGSETEDAELRAIANK 185 Query: 341 -SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 S + VE+ Y A S I ++++ +++ + Sbjct: 186 PSSTYVFHVED----YIAISKI-REVMKQKLCEE 214 >gi|293339632|gb|ADE44108.1| collagen type XXI alpha 1 [Sus scrofa] Length = 895 Score = 86.0 bits (211), Expect = 9e-15, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 82/214 (38%), Gaps = 33/214 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S A D++ +LD S S+ ++ K + + + + P +Q G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGP------ENFEIVKKWLVNITKNFDIGP---KFIQVGVV 79 Query: 225 TFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ L S + + + G +T + +++A + +F Sbjct: 80 QYSDYPVLEIPLGRHESGEKLLAAVGSIHYLGGNTRTGKAIQFALDYLF---------AK 130 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 KI V +TDG K + A+ ++AIG+ LRA A Sbjct: 131 SSRFLTKIAVVLTDG-----KSQDEVKDAAEAARDSKITLFAIGVGSETEDAELRAIANK 185 Query: 341 -SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 S + VE+ Y A S I ++++ +++ + Sbjct: 186 PSSTYVFHVED----YIAISKI-REVMKQKLCEE 214 >gi|156396520|ref|XP_001637441.1| predicted protein [Nematostella vectensis] gi|156224553|gb|EDO45378.1| predicted protein [Nematostella vectensis] Length = 177 Score = 86.0 bits (211), Expect = 9e-15, Method: Composition-based stats. Identities = 36/191 (18%), Positives = 67/191 (35%), Gaps = 24/191 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ VLD S S+ + + M + IN ++ + P + G+V +S + Sbjct: 1 DIGFVLDASGSVRAN------RFKMCLNFINKLVNSFHIGPHN---TRIGIVRYSTRPSG 51 Query: 233 FFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + ++ + G T + + YA ++ R +K+ Sbjct: 52 IFRFTSYRNKHSTKHRVNRIRYTGGWTRTGAAINYARRYLYQHNRRRG--------VRKV 103 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ MTDG K + K+ G V+AIGI L A+ + L Sbjct: 104 LIVMTDG-----KSQDSVVGASRSVKRMGIEVFAIGIGRGYRRSELNQMATDRNHVLTAR 158 Query: 351 PHSMYDAFSHI 361 ++ I Sbjct: 159 FRDLHKIIGKI 169 >gi|32471725|ref|NP_864718.1| hypothetical protein RB2055 [Rhodopirellula baltica SH 1] gi|32397096|emb|CAD72400.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 402 Score = 86.0 bits (211), Expect = 9e-15, Method: Composition-based stats. Identities = 57/399 (14%), Positives = 127/399 (31%), Gaps = 72/399 (18%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 ++N G + +L I LP++ ++ +I V+++ + + D ++ A + Sbjct: 29 LKNQPKQRSGAVIVLLVIMLPVLLILAAYVINVAYVEAVTADSQVVTDAAVCAAGRVYIQ 88 Query: 68 EGNGNNRKKLKGGDILCR-----------IKNTWNMSFRNELRDNG---FVNDIDDIVRS 113 G+ N + +S R L + ++D D+ + Sbjct: 89 TGDKNAALAAARDAAERNPVAGKVVPINMSDLEFGISLRESLDEGYSFQPLSDDDEFGNA 148 Query: 114 TSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLD 173 L + + + S PL + + + S++ D Sbjct: 149 VRLTTLSLSNSPQPVFS-------PLFPTMGTNLEIRPQRVAVSTQSTM----------D 191 Query: 174 MMIVLDVSRSM----ESFFDSSITKIDM------------------AIKSINAMLEEVKL 211 + +V+D S SM + D + + S+NA + Sbjct: 192 VALVIDRSGSMAYANDEAPDPYVNPAAAPPGWTYGDPVPPNSRWLDLVASVNAFNGFLAD 251 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQ 269 P + L T+S+ L + + ++ +S G T+ GL++ Sbjct: 252 SP---QYEKLCLATYSDNASRDCDLTHTYAEISNQLDAISYQFNGGGTSVGYGLEHGLAV 308 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + D R+ +++V MTDG N + + + + G ++ I Sbjct: 309 LTDATHARKFAV-------RVMVLMTDGHHNTGKSPESMTYHL----QNHGVTLFTITFS 357 Query: 329 VIRSHEFLRACAS--PNSFYLVENPHSMYDAFSHIGKDI 365 + A+ + + + +AF I K + Sbjct: 358 DDADQSRMSNLANACGGENFHATDASQLQNAFQKIAKKL 396 >gi|288956977|ref|YP_003447318.1| hypothetical protein AZL_001360 [Azospirillum sp. B510] gi|288909285|dbj|BAI70774.1| hypothetical protein AZL_001360 [Azospirillum sp. B510] Length = 456 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 66/449 (14%), Positives = 138/449 (30%), Gaps = 108/449 (24%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 + +G + I+ A+ ++ +LG+ I+ + F+ + ++ D + + ++ N Sbjct: 23 ADRRGSVAIMVALSFLVLLGMLGVAIDFARAQFVSSRIYYAADAATLA-----VSRENF- 76 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 + ++ N + + P +G++++ Sbjct: 77 ----------QVSTNDQLKALAQSYFDANFPPGTMGATTSLSVATSGTPPTVQGFTVTVT 126 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT--DARLDMMIVLDVS----RSMES 186 + + + + S +QT ++++IVLD S S E Sbjct: 127 ATLPLVFAPLVETLGGPTIGSVGISKASGAVFTTQTSNQGGMELVIVLDNSASMKGSQED 186 Query: 187 FFDSSITKIDMA-----------------IKSINAMLEEVKLIPDVNNVVQSGL------ 223 +DM ++N + +K D+ V G+ Sbjct: 187 LRGGVKALLDMLYGNADTRKNLYVGIVHYSGAVNVLQSALKNKADIVAPVVGGMANCPMA 246 Query: 224 -----------------------------VTF--SNKIEEFFLLEWGVSHLQRKIKYLSK 252 + + ++ + L + IK Sbjct: 247 TVNGKLNGSRLSNAPPKTFKFDSTTDGVEIQYCGASTLGTSSALSPNRGDADKAIKSYVA 306 Query: 253 FGVSTNSTPGLKYAYNQIF----------DMQGMRQHCNTEDANYKKIIVFMTDGEN--- 299 G T GL + + + D G + + KK++V MTDG N Sbjct: 307 -GGDTLIGEGLVWGWRMLTPSWRGLWNTKDQPGASLPLDYDLPYMKKVLVLMTDGVNHIA 365 Query: 300 ----------------LSTKEDQQSLYYCNEAKKR-GAIVYAIGIRVIRSHEFLRACAS- 341 ++K D + CN AKK ++Y I + + CAS Sbjct: 366 GRNYTAYYSDPYQTVADASKADADLMTICNAAKKDHNVVLYTITYGSDTDEQQMSDCASD 425 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 P+ Y P + AF+ +G D+ T ++ Sbjct: 426 PSKHYHAALPQDLAKAFTQVGTDLTTMKL 454 >gi|194216197|ref|XP_001914777.1| PREDICTED: collagen, type XII, alpha 1 [Equus caballus] Length = 3120 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 53/267 (19%), Positives = 102/267 (38%), Gaps = 37/267 (13%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 ++T L + + + Y IS + + I T + + + V + + Sbjct: 384 PQTTMLSVRDLSADTEYQISVSAMKGLTSSEPVSIMEKTQPMKVQVECSRGVDIKA---- 439 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++ ++D S S+ ++ K+ ++ + VK N VQ LV +S Sbjct: 440 --DIVFLVDGSYSIG---IANFVKVRAFLEVL------VKSFEISPNRVQISLVQYSRDP 488 Query: 231 EEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L+ + I G STN+ + Y +IF +N Sbjct: 489 HTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF------VPSKGSRSNVP 542 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 K+++ +TDG++ D + + ++A+G++ E L A ASP Sbjct: 543 KVMILITDGKSSDAFRD-----PAIKLRNSDVEIFAVGVKDAVRSE-LEAIASPPAETHV 596 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 + VE+ +DAF I ++ T+ I Sbjct: 597 FTVED----FDAFQRISFEL-TQSICL 618 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 40/203 (19%), Positives = 78/203 (38%), Gaps = 30/203 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S S+ + I I ++ V + G+V +S+ Sbjct: 140 DLVFLVDGSWSVGRN---NFKYILDFIAAL------VSAFDIGEEKTRVGVVQYSSDPRT 190 Query: 233 FFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDMQGMRQHCNTEDANYKK 289 F L + L IK + G +T + + Y N + G R + K Sbjct: 191 EFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLIKNTFMESAGARVG-------FPK 243 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYL 347 + + +TDG K + E + G V+++GI+ + E + ++P N + Sbjct: 244 VAIIITDG-----KSQDEVEIPARELRSIGVEVFSLGIKAADAKELKQIASTPSLNHVFN 298 Query: 348 VENPHSMYDAFSHIGKDIVTKRI 370 V N +DA I +I+++ Sbjct: 299 VAN----FDAIVDIQNEIISQVC 317 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 49/250 (19%), Positives = 82/250 (32%), Gaps = 26/250 (10%) Query: 124 NEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 Y ++ + T S VMPI SS + T A D+++++D S Sbjct: 1150 GTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSS-GMECLTRAEADIVLLVDGSW 1208 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GV 240 S+ I+ ++E ++ P VQ L +S + L Sbjct: 1209 SIG------RANFRTVRSFISRIVEVFEIGP---KRVQIALAQYSGDPRTEWQLNAHKDK 1259 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 L + L G +T + L + Q F Q +KI V +TDG Sbjct: 1260 QSLLEAVANLPYKGGNTLTGMALNFIRQQSFKTQ------AGMRPRARKIGVLITDG--- 1310 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 K + K G ++A+GI+ E P+ N +++ S Sbjct: 1311 --KSQDDVEAPSKKLKDEGVELFAVGIKNADEVELKMIATDPDDT-HAYNVAD-FESLSR 1366 Query: 361 IGKDIVTKRI 370 I D+ Sbjct: 1367 IVDDLTINLC 1376 >gi|56797853|emb|CAG26904.1| matrilin-1 [Danio rerio] Length = 277 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 82/216 (37%), Gaps = 27/216 (12%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 + + +++ D++ ++D SRS+ ++ + + +++ + + Sbjct: 18 TVDLRQAAAMAAGLCNTKPTDVVFIVDSSRSVRP------SEFEQVKVFLAKVIDGLSVG 71 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQI 270 PD + G+V ++++++ L+ + + + + T + +++A N Sbjct: 72 PDA---TRVGVVNYASRVKNEVSLKSHKTKAALVKAVSKIEPLSTGTMTGLAIQFAMNVA 128 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 F + + + K+ + +TDG D A++ G ++AIG+ + Sbjct: 129 FSEAEGGR----KSPDISKVAIIVTDGRPQDNIRD-----IAARAREAGIEIFAIGVGRV 179 Query: 331 RSHEFLRACAS---PNSFYLVEN---PHSMYDAFSH 360 LR AS + VE+ + F Sbjct: 180 D-MTTLRQMASEPLEDHVDYVESYSLIEKLTKKFQE 214 >gi|296228120|ref|XP_002759672.1| PREDICTED: collagen alpha-5(VI) chain [Callithrix jacchus] Length = 2614 Score = 86.0 bits (211), Expect = 1e-14, Method: Composition-based stats. Identities = 57/336 (16%), Positives = 126/336 (37%), Gaps = 29/336 (8%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + +F + + IDR + +M + + + + + + Sbjct: 682 LNTYFTQQEISDAIDRMSLIDKGTLMGKALNFVDQYFTHSKGARFGAKKFLILITDGVAQ 741 Query: 101 NGFVNDIDDIVRSTSLDIVVVP-----QNEGYSISAISRYKIPLKFCTFIPWYTNSRHIV 155 + V D I+R + I V +++ IS ++ + + +V Sbjct: 742 DD-VRDPARILRGKDVTIFSVGVYGADRSQLEEISGDGSLVFYVENFDHL--QALEKKLV 798 Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 + + +LD++ VLD S S+ + IN + VK Sbjct: 799 FRVCALHDCKRIK--QLDVVFVLDHSGSIAEQSQDHM---------INLTMHLVKKADVG 847 Query: 216 NNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + VQ G + +S+ E F L S + ++ G +T + LK+A N +F Sbjct: 848 RDRVQFGALKYSDNPEILFYLNTYSNRSAIIENLRMRRDTGGNTYTAKALKHA-NALFT- 905 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 +H + + N K++++ +TDGE+ E + + + +G ++A+G+ Sbjct: 906 ---EEHGSRINQNVKQMLIVITDGESDDRVELNDTAA---KLRDKGITIFAVGVGKADQK 959 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 E + N+ V+N + D ++ + + + T+ Sbjct: 960 ELEGMAGNKNNTIYVDNFDKLKDIYAPVQESMCTES 995 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 51/349 (14%), Positives = 117/349 (33%), Gaps = 40/349 (11%) Query: 37 IIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRN 96 +++ S ++ + + + A + + G+ IL RIKN Sbjct: 483 VVQFSDKMRVEFSITDYSNDIDLRKAILNIQQLTGDTHTGEALDFILPRIKNGIKERMSQ 542 Query: 97 ---------ELRDNGFVNDIDDIVRSTSLDI--VVVPQNEGYSISAISRYKIPLKFCTFI 145 + + V + VR+ + + + + + + I+ + + F Sbjct: 543 VPCYLIVLTDGKSQYSVVEPAKRVRAEQITVHAIGIGEANKKELQEIAGKEERVSFGQNF 602 Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 + ++ V + D + D+M ++D S S+ + + K+ + +K++ Sbjct: 603 DALKSIKNEV--VHGICTEKGCEDMKADIMFLVDSSGSIG---NENFGKMKIFMKNL--- 654 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGL 263 + I + Q G+V FS+ EE F L + + I +S T L Sbjct: 655 ---LTKIQIGADKTQIGVVQFSDDPEEEFQLNTYFTQQEISDAIDRMSLIDKGTLMGKAL 711 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 + +G R KK ++ +TDG + + ++ Sbjct: 712 NFVDQYFTHSKGARFGA-------KKFLILITDG-----VAQDDVRDPARILRGKDVTIF 759 Query: 324 AIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 ++G+ + + + VEN +D + K +V + Sbjct: 760 SVGVYGADRSQLEEISGDGSLVFYVEN----FDHLQALEKKLVFRVCAL 804 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 70/202 (34%), Gaps = 26/202 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S+ I K MLE ++ + V+ G+V FS+K+ Sbjct: 442 DIHFLIDGSTSIREEQFEQIKKF---------MLEVTEMFSIGPDKVRVGVVQFSDKMRV 492 Query: 233 FFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + + + ++ I + + T++ L F + ++ + Sbjct: 493 EFSITDYSNDIDLRKAILNIQQLTGDTHTGEALD------FILPRIKNGIKERMSQVPCY 546 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDG++ + + V+AIGI E +N Sbjct: 547 LIVLTDGKSQY-----SVVEPAKRVRAEQITVHAIGIGEANKKELQEIAGKEERVSFGQN 601 Query: 351 PHSMYDAFSHIGKDIVTKRIWY 372 +DA I ++V Sbjct: 602 ----FDALKSIKNEVVHGICTE 619 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 64/185 (34%), Gaps = 22/185 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + IN M+ + P + + L +S+K+ Sbjct: 30 DVVFLVDSSDHLG------TKSFPFVKTFINKMINSL---PIEADKYRVALAQYSDKLHS 80 Query: 233 FFLLEW--GVSHLQRKIKY-LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L G + + +K G S L+ A+ F + + + Sbjct: 81 EFHLSTFKGRNPMLNHLKKNFQFLGGSLQIGKALQEAHRTYFSAPTNGR----DRKQFPP 136 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 I+V + S + + ++ G + ++G++ L+A A+ + + + Sbjct: 137 ILVVL-----ASAESEDDVEEASKALQEDGVKIISVGVQEASEEN-LKAMATSHFHFNLR 190 Query: 350 NPHSM 354 + Sbjct: 191 TVRDL 195 Score = 42.8 bits (99), Expect = 0.078, Method: Composition-based stats. Identities = 29/175 (16%), Positives = 64/175 (36%), Gaps = 23/175 (13%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV-KLIPDVNNVVQSG 222 ++ Q D+ D++ ++D S + + A LE + + N ++ G Sbjct: 227 ISCQKDSLADLLFLVDESLGTRQNL-----------RHLQAFLENITSSMDVKENCMRLG 275 Query: 223 LVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 L+++SN + L+ S Q++I+ LS +N+ + Q+ H Sbjct: 276 LMSYSNSAKTISFLKSSTSQSEFQQQIQKLSIQVGKSNAGAAI----EQMRREGFSESHG 331 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + +I V +T D + + V+A+ I+ + + Sbjct: 332 SRRAQGVPQIAVLVT-----HRPSDDEVHDAALNLRLEDVTVFALSIQGANNTQL 381 >gi|293361231|ref|XP_243912.5| PREDICTED: collagen alpha-1(XII) chain [Rattus norvegicus] Length = 3119 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 52/309 (16%), Positives = 102/309 (33%), Gaps = 45/309 (14%) Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 + R + + T S L N + + +V TS D V ++ Sbjct: 49 ADPIVGYRITVDSTTDGPTKEFTLAASTTETLLSN-LIPETQYVVTITSYDEVE--ESVP 105 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 P K P T + + + D++ ++D S S+ Sbjct: 106 VIGQLTIETGDPAKPGEKKPGKTEIQKCSVSAWT------------DLVFLVDGSWSVGR 153 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQ 244 ++ ++ V + G+V +S+ F L + L Sbjct: 154 NN---------FKYILDFIVALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYRREDLL 204 Query: 245 RKIKYLSKFGVSTNSTPGLKY-AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 IK + G +T + + Y N + G R + K+ + +TDG K Sbjct: 205 AAIKKIPYKGGNTMTGDAIDYLVKNTFTESAGSRAG-------FPKVAIIITDG-----K 252 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHI 361 + E + G V+++GI+ + E + ++P N + V N +DA I Sbjct: 253 SQDEVEIPARELRNIGVEVFSLGIKAADAKELKQIASTPSLNHVFNVAN----FDAIVDI 308 Query: 362 GKDIVTKRI 370 +I+++ Sbjct: 309 QNEIISQVC 317 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 52/267 (19%), Positives = 105/267 (39%), Gaps = 37/267 (13%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 ++T+L++ + + Y IS + + + T + + + V + + Sbjct: 384 PQTTTLNVRDLSADTEYQISVFAMKGLTSSEPVSVMEKTQPMKVQVECSRGVDIKA---- 439 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++ ++D S S+ ++ K+ ++ + K N VQ LV +S Sbjct: 440 --DIVFLVDGSYSIG---IANFVKVRAFLEVL------AKSFEISPNRVQISLVQYSRDP 488 Query: 231 EEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L+ V + + I G STN+ + Y +IF +N Sbjct: 489 HTEFTLKEFNRVEDIIKAINNFPYRGGSTNTGKAMTYVREKIFVPN------KGSRSNVP 542 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 K+++ +TDG++ D + + ++A+G++ E L A ASP Sbjct: 543 KVMILITDGKSSDAFRD-----PAIKLRNSDVEIFAVGVKDAVRSE-LEAIASPPAETHV 596 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 + VE+ +DAF I ++ T+ I Sbjct: 597 FTVED----FDAFQRISFEL-TQSICL 618 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 43/251 (17%), Positives = 84/251 (33%), Gaps = 28/251 (11%) Query: 124 NEGYSISAISRYK--IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 Y ++ + L + + ++S ++ ++ +A D+++++D S Sbjct: 1150 GTTYRVNVFGMFDGGESLPLVGQEMTTLSDTTVTPFLSSGMECLTRAEA--DIVLLVDGS 1207 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WG 239 S+ I+ ++E ++ P VQ L +S + L Sbjct: 1208 WSIG------RANFRTVRSFISRIVEVFEIGP---KRVQIALAQYSGDPRTEWHLNAHRD 1258 Query: 240 VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 L + + L G +T + L + Q F Q +KI V +TDG Sbjct: 1259 KKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKTQ------AGMRPRARKIGVLITDG-- 1310 Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFS 359 K + K G ++AIGI+ E P+ N +++ S Sbjct: 1311 ---KSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDI-HAYNVAD-FESLS 1365 Query: 360 HIGKDIVTKRI 370 I D+ Sbjct: 1366 KIVDDLTINLC 1376 >gi|260823583|ref|XP_002606160.1| hypothetical protein BRAFLDRAFT_126487 [Branchiostoma floridae] gi|229291499|gb|EEN62170.1| hypothetical protein BRAFLDRAFT_126487 [Branchiostoma floridae] Length = 515 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 77/210 (36%), Gaps = 25/210 (11%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + +DM+ VLD S S+ + + ++++ ++ + G+V Sbjct: 169 SECKIPGMDMVFVLDGSGSVG------ADNFETVKDFVVSVVDGFEIGQS---RTRIGVV 219 Query: 225 TFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S++++ F L + +Q I ++ T + L+Y + F + Sbjct: 220 QYSDEVQNEFNLTEYGNKADVQSAISNITYLQGRTYTGAALRYMTDVSFSEE---AGARP 276 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 K+ + +TDGE + A + G V+AIGI E + P Sbjct: 277 PYQAIPKVGIVVTDGE-----ATDNVQGPASSAHEAGVNVFAIGIGGYDVRELRQIATDP 331 Query: 343 N--SFYLVENPHSMYDAFSHIGKDIVTKRI 370 + + V+N + A +I + + Sbjct: 332 DATHVFAVDN----FAATDYIKDALEGRTC 357 >gi|330417948|ref|NP_001193426.1| collagen alpha-1(XII) chain [Bos taurus] Length = 3065 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 53/267 (19%), Positives = 103/267 (38%), Gaps = 37/267 (13%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 ++T+L + + + Y IS + + I T + + + V + + Sbjct: 384 PQTTTLSVRDLSADTEYQISVSAMKGLTSSEPVSIMEKTQPMKVQVECSRGVDIKA---- 439 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++ ++D S S+ ++ K+ ++ + VK N VQ LV +S Sbjct: 440 --DIVFLVDGSYSIG---IANFVKVRAFLEVL------VKSFEISPNRVQISLVQYSRDP 488 Query: 231 EEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L+ + I G STN+ + Y +IF +N Sbjct: 489 HTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF------VPSKGSRSNVP 542 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 K+++ +TDG++ D + + ++A+G++ E L A ASP Sbjct: 543 KVMILITDGKSSDAFRD-----PAIKLRNSDVEIFAVGVKDAVRSE-LEAIASPPAETHV 596 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 + VE+ +DAF I ++ T+ I Sbjct: 597 FTVED----FDAFQRISFEL-TQSICL 618 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 45/279 (16%), Positives = 98/279 (35%), Gaps = 34/279 (12%) Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP--LKFCTFIPWYTNSRHIVMPI 158 +G + +T + + Y ++ S ++ + + T Sbjct: 64 DGPTKEFTLSASTTETLLTELIPEIEYVVTITSYDEVEESVPVIGQLTIQTGGPTKPGEK 123 Query: 159 -TSSVKVNSQTDAR-LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 ++ + + D++ ++D S S+ + I I ++ V Sbjct: 124 KPGKTEIQKCSVSAWTDLVFLVDGSWSVGRN---NFKYILDFIAAL------VSAFDIGE 174 Query: 217 NVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDM 273 + G+V +S+ F L + L IK + G +T + + Y N + Sbjct: 175 EKTRVGVVQYSSDTRTEFNLNQYYQQEELLAAIKKIPYKGGNTMTGEAIDYLIKNTFTES 234 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 G R + K+ + +TDG K + E + G V+++GI+ + Sbjct: 235 AGARVG-------FPKVAIIITDG-----KSQDEVEIPARELRNIGVEVFSLGIKAADAK 282 Query: 334 EFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 E + ++P N + V N +DA I +I+++ Sbjct: 283 ELKQIASTPSLNHVFNVAN----FDAIVDIQNEIISQVC 317 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 50/250 (20%), Positives = 82/250 (32%), Gaps = 26/250 (10%) Query: 124 NEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 Y ++ + T S VMPI SS + T A D+++++D S Sbjct: 1150 GTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSS-GMECLTRAEADIVLLVDGSW 1208 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH 242 S+ I+ ++E ++ P VQ L +S + L Sbjct: 1209 SIG------RANFRTVRSFISRIVEVFEIGP---KRVQIALAQYSGDPRTEWQLNAHKDK 1259 Query: 243 LQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 + + L G +T + L + Q F Q +KI V +TDG Sbjct: 1260 KSLLQAVANLPYKGGNTLTGMALNFIRQQSFKTQ------AGMRPRARKIGVLITDG--- 1310 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 K + K G ++AIGI+ E P+ N +D+ S Sbjct: 1311 --KSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDI-HAYNVAD-FDSLSR 1366 Query: 361 IGKDIVTKRI 370 I D+ Sbjct: 1367 IVDDLTNNLC 1376 >gi|301613500|ref|XP_002936246.1| PREDICTED: collagen alpha-1(XII) chain-like [Xenopus (Silurana) tropicalis] Length = 3127 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 48/277 (17%), Positives = 108/277 (38%), Gaps = 31/277 (11%) Query: 98 LRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMP 157 + + ++ ++TS+++ + + Y I+ + + T + I + Sbjct: 371 MLAGSKQHSLNLGPQTTSINVKDLSPDTEYQINLYAMKGLTASEPISTLEKTQAVKIKVE 430 Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + V + + D+++++D S S+ ++ K+ ++ + VK + Sbjct: 431 CSGGVDIKA------DVVLLVDGSYSIG---VANFAKVRAFLEVL------VKSFDISPS 475 Query: 218 VVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 VQ LV +S F L + + + + G STN+ + Y ++F Sbjct: 476 KVQISLVQYSRDPFTEFTLNRYDSIEDILKAVNTFPYRGGSTNTGKAMTYVREKVF---- 531 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 N ++++ +TDG++ +D + + ++A+G++ E Sbjct: 532 --VEIKGARPNVPRVMILITDGKSSDAFKD-----PAIKLRNSDVEIFAVGVKDAVRSE- 583 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 L A ASP S V +DAF I ++ T+ I Sbjct: 584 LEAIASPPSDTHVYTVED-FDAFQRISFEL-TQSICL 618 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 47/274 (17%), Positives = 101/274 (36%), Gaps = 30/274 (10%) Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP--LKFCTFIPWYTNSR-HIVMP 157 +G ++D +T + + + Y ++AIS ++ + + T R + P Sbjct: 64 DGPEKELDLPASATQTILTELVPDVEYVVTAISYDELEESVPVFGQLTIQTGGRVTLEEP 123 Query: 158 ITSSVKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 + K + + + D++ ++D S S+ ++ M+ V Sbjct: 124 KLETSKSPRCSISSVADVVFLVDGSWSVGRNN---------FKYILDFMVSLVSAFDIGE 174 Query: 217 NVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDM 273 + + G+V +S+ F L + L IK + G +T + L + N Sbjct: 175 DKTRVGVVQYSSDTRTEFNLNSYYKKEELVSAIKRIPYKGGNTMTGDALDFLIKNSFVKT 234 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 G R+ + +I + +TDG K + E + G V+++GI+ + Sbjct: 235 AGSRKG-------FPRIAIIITDG-----KSQDEVEIPARELRSLGVEVFSLGIKAADAK 282 Query: 334 EFLRACASP--NSFYLVENPHSMYDAFSHIGKDI 365 E + + P + V N S+ + I + Sbjct: 283 ELKQIASLPSLKHVFNVANFDSIVEVQDEIITQV 316 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 42/250 (16%), Positives = 84/250 (33%), Gaps = 26/250 (10%) Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 Y ++ ++ T S + +T A D+++++D S S Sbjct: 1150 GTSYKVNVFGVFEKGESVPLIGQEMTTLSDAPEVRIDSSGLECKTKAAADIVLLVDGSWS 1209 Query: 184 M-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH 242 + F + + I V++ ++ VQ L +S + L + Sbjct: 1210 IGRPNFRTVRSFIARL----------VEVFDIGSDRVQIALAQYSGDPRSEWQLNAHSTK 1259 Query: 243 LQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 + L G +T + L + F + A KKI V +TDG Sbjct: 1260 KSLMDAVANLPYKGGNTLTGMALNFILQNNFKAE------AGMRAKSKKIGVLITDG--- 1310 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 K + + + +G ++AIGI+ +E + + P+ Y+ + + Sbjct: 1311 --KSQDDIVAPSKKLRDQGIDLFAIGIKNADENELRQIGSDPDETYVFNVAD--FSLLVN 1366 Query: 361 IGKDIVTKRI 370 I D+ T Sbjct: 1367 IVDDLTTNLC 1376 >gi|296198464|ref|XP_002746714.1| PREDICTED: collagen alpha-1(XXI) chain isoform 2 [Callithrix jacchus] Length = 954 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 42/214 (19%), Positives = 83/214 (38%), Gaps = 33/214 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S A D++ +LD S S+ ++ K + + + + P +Q G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGP------ENFEIVKKWLINITKNFDIGP---KFIQVGVV 79 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ L HL ++ + G +T + +++A + +F Sbjct: 80 QYSDYPVLEIPLGSYDSGEHLMAAVESILYLGGNTRTGKAIQFALDYLF---------AK 130 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 KI V +TDG K + A+ ++AIG+ LRA A Sbjct: 131 SSRFLTKIAVVLTDG-----KSQDEVKDAAEAARDSKITLFAIGVGSETEDAELRAIANK 185 Query: 341 -SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 S + VE+ Y A S I ++++ +++ + Sbjct: 186 PSSTYVFYVED----YIAISKI-REVMKQKLCEE 214 >gi|296198462|ref|XP_002746713.1| PREDICTED: collagen alpha-1(XXI) chain isoform 1 [Callithrix jacchus] Length = 957 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 42/214 (19%), Positives = 83/214 (38%), Gaps = 33/214 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S A D++ +LD S S+ ++ K + + + + P +Q G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGP------ENFEIVKKWLINITKNFDIGP---KFIQVGVV 79 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ L HL ++ + G +T + +++A + +F Sbjct: 80 QYSDYPVLEIPLGSYDSGEHLMAAVESILYLGGNTRTGKAIQFALDYLF---------AK 130 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 KI V +TDG K + A+ ++AIG+ LRA A Sbjct: 131 SSRFLTKIAVVLTDG-----KSQDEVKDAAEAARDSKITLFAIGVGSETEDAELRAIANK 185 Query: 341 -SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 S + VE+ Y A S I ++++ +++ + Sbjct: 186 PSSTYVFYVED----YIAISKI-REVMKQKLCEE 214 >gi|114798549|ref|YP_759188.1| hypothetical protein HNE_0458 [Hyphomonas neptunium ATCC 15444] gi|114738723|gb|ABI76848.1| conserved domain protein [Hyphomonas neptunium ATCC 15444] Length = 460 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 30/156 (19%), Positives = 58/156 (37%), Gaps = 18/156 (11%) Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY-------NQIFDMQGMRQHCNT 282 L + + L I ++ +T G + + N ++ + Sbjct: 306 DNTPIPLTYNRNKLHDFIDDMTPR-RNTAGHIGQAWGWYLVSPEWNSVWPAGSKALPYDE 364 Query: 283 EDANYKKIIVFMTDGE-------NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 DA K+++ M+DG+ N Q+ C++ K++ ++Y +G + Sbjct: 365 PDAT--KVVIMMSDGQYNETRHNNAYPSSVTQAEAICDKMKEKEVVIYTVGFDAGYGQDV 422 Query: 336 LRACAS-PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 L CAS P Y N + +A+ I + I RI Sbjct: 423 LNYCASNPAFAYKPTNGQELTEAYKSIARSISDLRI 458 Score = 76.0 bits (185), Expect = 8e-12, Method: Composition-based stats. Identities = 38/224 (16%), Positives = 82/224 (36%), Gaps = 28/224 (12%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 RNFF N G + ++ A+ + I + G I+ K + +D +++ AT+ M + Sbjct: 6 RNFFRNESGNVAMIAALTIIPIVGIAGFAIDFQVTTTQKARVQQAVDSAVLA-ATKSMQD 64 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 G +++ N+ + + T++D+V + + E Sbjct: 65 GK----------------DRAYSLKEANDYFKGILNQSNNSGLNCTNIDLVYIDETEELE 108 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 L RH+ ++S+ +L++ V DVS SM + Sbjct: 109 GHVECSQNTTLSKV------AGIRHLDFNVSSAATYGI---GKLEIAFVFDVSGSMAN-- 157 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ + + +A + L V+ V+ +V++ + Sbjct: 158 DNRMGNLKVAAREAVNTLLPVEGYAGDPEDVRLAMVSYDTMVNA 201 >gi|83859216|ref|ZP_00952737.1| hypothetical protein OA2633_12465 [Oceanicaulis alexandrii HTCC2633] gi|83852663|gb|EAP90516.1| hypothetical protein OA2633_12465 [Oceanicaulis alexandrii HTCC2633] Length = 441 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 71/449 (15%), Positives = 147/449 (32%), Gaps = 119/449 (26%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R+F + +G + ++ A+ L + + +G ++ S F + + S +D + AA + Sbjct: 24 LRHFTQDVRGNVAMMFAMLLGPLVVSVGGALDYSRTFTIGAEIQSAMDAGTLAAA----S 79 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 G + + + I + + ++ ++ + Sbjct: 80 LSQGEDPETIVRNYITAALSEH------------------NGVLERLNVQVSSDLAINSR 121 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 ++A + +P T + + + S + L++ +VLD+S SM Sbjct: 122 EVTADAVISVP----TLMLGIIGYDALTLNRVSEANERVRN---LEISLVLDISGSMS-- 172 Query: 188 FDSSITKI-DMAIKSINAM-------LEEVKLIP--------------------DVNNVV 219 S IT + D A + + M L + +IP + + + Sbjct: 173 -GSKITALRDAAEEFVGVMMDPDLEGLTSLSVIPYNGGVRLPQTVTNDLVPGTPNDSGCL 231 Query: 220 QSGL---VTFSNKIEEFFLLEWGVSHLQRK-----------------------IKYLSKF 253 + G+ VT + L+W + + + Sbjct: 232 ELGVSDPVTMDLAANGYDWLDWQDRDQRGWRSSAFCPEENEATVFLEQTPSVLVNLIRDL 291 Query: 254 GVSTNSTP---------GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG------- 297 N+ L A+ + +D + K++V MTDG Sbjct: 292 DAGGNTGLDVATAWGARALDPAWRGRLGGDFASRPAAYDDPSTMKVLVVMTDGAATAQIR 351 Query: 298 --ENLSTKEDQQSLYYCNEAKKR-----------GAIVYAIGIRVIRS--HEFLRACAS- 341 +N +Y ++A+ G +Y I +V S +R CAS Sbjct: 352 RAQNWYGDWYSYEIYSASQARDNMADACDAAEAEGVHIYTIAFQVSGSTNRNLMRDCASR 411 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 P ++Y VEN + AF+ I D+ R+ Sbjct: 412 PENYYAVENL-DISAAFNSIAADLNNLRL 439 >gi|332232505|ref|XP_003265445.1| PREDICTED: collagen alpha-5(VI) chain [Nomascus leucogenys] Length = 2526 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 64/342 (18%), Positives = 129/342 (37%), Gaps = 49/342 (14%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + +F + + IDR + +NEG G L + + S L Sbjct: 682 LNRYFTQQEISDAIDRMSL------INEGTLT-------GKALNFVGQYFTHSKGARLGA 728 Query: 101 NGFVNDIDDIVRSTSL-DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNS-------- 151 F+ I D V + D + + + +I ++ Y I + Sbjct: 729 KKFLILITDGVARDDVRDPARILRGKDVTIFSVGVYNANRSQLEEISGDGSLVFHVENFD 788 Query: 152 ------RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 R +V + + T LD++ VLD S S++ + + IN Sbjct: 789 HLKALERKLVFRVCALHDCKRITL--LDVVFVLDHSGSIKKQYQDHM---------INLT 837 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGL 263 + VK + VQ G + +S++ F L S + ++ G +T + L Sbjct: 838 IHLVKKADVGRDRVQFGALKYSDQPNILFYLNTYSNRSAIIENLRMRRDTGGNTYTAKAL 897 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 K+A N +F +H + + N K++++ +TDG++ + Q +E + +G ++ Sbjct: 898 KHA-NALFT----EEHGSRINQNVKQMLIVITDGKS---HDHDQLNDTASELRDKGITIF 949 Query: 324 AIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 A+G+ E + N+ V+N + D F+ + + + Sbjct: 950 AVGVGKANQKELEGMAGNKNNAIYVDNFDKLKDVFTLVQESM 991 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 35/210 (16%), Positives = 76/210 (36%), Gaps = 27/210 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D + D+M ++D S S+ + + K+ + +K++ + I + Q G+V Sbjct: 620 KGCEDMKADIMFLVDSSWSIG---NENFRKMKIFMKNL------LTKIQIGADKTQIGVV 670 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+K +E F L + + I +S T + L + +G R Sbjct: 671 QFSDKTKEEFQLNRYFTQQEISDAIDRMSLINEGTLTGKALNFVGQYFTHSKGARLGA-- 728 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 KK ++ +TDG + + ++++G+ + Sbjct: 729 -----KKFLILITDG-----VARDDVRDPARILRGKDVTIFSVGVYNANRSQLEEISGDG 778 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + + VEN +D + + +V + Sbjct: 779 SLVFHVEN----FDHLKALERKLVFRVCAL 804 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 68/202 (33%), Gaps = 26/202 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S++ I + MLE ++ + V+ G+V +S+ E Sbjct: 442 DIHFLIDGSSSIQEKQFEQIKRF---------MLEVTEMFSIGPDKVRVGVVQYSDDTEV 492 Query: 233 FFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + + + ++ I + + T + L Y I + R Sbjct: 493 EFYITDYSNDIDLRKAILNIKQITGGTYTGRALDYILQIIKNGMKDRMSK------VPCY 546 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDG + + + ++A+GI E +N Sbjct: 547 LIVLTDG-----MSADRVVEPAKRLRAEQITIHAVGIGAANKIELQEIAGKEERVSFGQN 601 Query: 351 PHSMYDAFSHIGKDIVTKRIWY 372 +DA I ++V + Sbjct: 602 ----FDALKSIKNEVVREICTE 619 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 30/185 (16%), Positives = 63/185 (34%), Gaps = 22/185 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + IN M+ + P N + L +S++ Sbjct: 30 DVVFLVDSSDHLGP------KSFPFVKTFINKMINSL---PIEANKYRVALAQYSDEFHS 80 Query: 233 FFLLEW--GVSHLQRKIKY-LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L G S + +K G S L+ A+ F + + Sbjct: 81 EFHLSTFKGRSPMLNHLKKNFQFIGGSLQIGKALREAHRTYFSAHTNGRDKK----QFPP 136 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 I+V + S + + + KK G + ++G++ L+A A+ + + + Sbjct: 137 ILVVL-----ASAESEDEVEEASKALKKDGVKIISVGVQEASEEN-LKAMATSHFHFNLR 190 Query: 350 NPHSM 354 + Sbjct: 191 TIRDL 195 >gi|83859217|ref|ZP_00952738.1| hypothetical protein OA2633_12470 [Oceanicaulis alexandrii HTCC2633] gi|83852664|gb|EAP90517.1| hypothetical protein OA2633_12470 [Oceanicaulis alexandrii HTCC2633] Length = 436 Score = 85.6 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 73/464 (15%), Positives = 143/464 (30%), Gaps = 136/464 (29%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 + + +G + I+ A+ ++ +G ++ S + + L S +D + AA+ + Sbjct: 8 FSRWSDDRRGNVAIIMALCSGVLVTAVGGALDYSRSTTVSSELQSALDSGALAAAS-LTQ 66 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 + N + + L S LD+V Sbjct: 67 DRNPEDVVRAYVEAALADHPQLLA---------------------SLQLDVVADISLNSR 105 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 ++A + +P T + + + S + +++ +VLDVS SM Sbjct: 106 VVNATASVAMP----TTMLGLVGINTLTLEHASEAIEQVRD---VEISLVLDVSGSMGG- 157 Query: 188 FDSSITKIDMAIKSINAMLE-----------EVKLIP------------------DVNNV 218 +KI+ + +E + +IP + N+ Sbjct: 158 -----SKINALQDAAIEFVEIVLAADAAERTSISVIPYNGGVRTPREVNQDIVSGNNNHR 212 Query: 219 VQSGLV---------------------TFSNKIEEFFL-------------LEWGVSHLQ 244 QSG V + ++ L ++ Sbjct: 213 RQSGCVDMGTDYPVEMTLPYREMEFTEYYGSEQTGNSSSAFCPRSNMESEFLSQNEGRMR 272 Query: 245 RKIKYLSKFGVS-----TNSTP-GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE 298 I L G + T L A+ + + +D + KI+V MTDGE Sbjct: 273 GLINSLRAEGNTGLDVATMWGARALDPAWRGNLGGSFSDRPASYDDRDTIKILVVMTDGE 332 Query: 299 NLST----------------------------KEDQQSLYYCNEAKKRGAIVYAIGIRV- 329 + + + C+ A+ G +Y I ++ Sbjct: 333 ATAQIRSEEYTYYDWWGRERTGTRSYELYSARQARENMAEACDIAEGNGVQIYTIAFQLS 392 Query: 330 -IRSHEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + + +R CA+ P ++Y VEN + +AFS I DI R+ Sbjct: 393 GQTNRDLMRNCANKPQNYYQVENL-DIAEAFSSIAADINRLRLT 435 >gi|20089145|ref|NP_615220.1| hypothetical protein MA0247 [Methanosarcina acetivorans C2A] gi|19914014|gb|AAM03700.1| hypothetical protein (multi-domain) [Methanosarcina acetivorans C2A] Length = 589 Score = 85.2 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 52/284 (18%), Positives = 104/284 (36%), Gaps = 30/284 (10%) Query: 90 WNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYT 149 WN + L + F + +V V ++S + P Sbjct: 3 WNKKIKENLLRSIFFASVLGVVAIVLTGAVSAQAIAEPAVSKTAS---PALINIAGSGVN 59 Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 + + +T S + +D++ +D S SM+S S + K + + ++++ Sbjct: 60 EETTVTIEVTG-AGSTSTSAVPMDVVFAIDSSGSMQSNDPSGLRK-----TAAKSFVDKM 113 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 D G+V++ + I+ L ++ I + G STN GL+ A + Sbjct: 114 DSSRDTA-----GVVSWDDSIDFSLPLTNDFPLVKTNIDSVDSSG-STNLNVGLEEAIDI 167 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 + N N ++I+F+TDG+ EA +G ++Y+IG+ Sbjct: 168 L--------DANPRTENSVEVIIFLTDGQGTYLHS------TAQEAADKGYVIYSIGLGG 213 Query: 330 IRSHEFL-RACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + A + ++Y + S+ F I ++ T I Y Sbjct: 214 VNPTPLQDMATTTGGAYYSSPDATSLQAIFDDIFSEVTTSTIPY 257 >gi|73950493|ref|XP_544451.2| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1) [Canis familiaris] Length = 562 Score = 85.2 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 40/220 (18%), Positives = 83/220 (37%), Gaps = 30/220 (13%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T +V + D++ ++D S+S+ ++ K IN +++ + + + Sbjct: 216 TCNVCSGGGGSSATDLVFLIDGSKSVRP------ENFELVKKFINQIVDTLDVSD---KL 266 Query: 219 VQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAY-NQIFDMQG 275 Q GLV +S+ + + F L ++ ++ +S T + LKY N G Sbjct: 267 AQVGLVQYSSSVRQEFPLGRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSG 326 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R +K+ + TDG + D +AK G ++A+G+ E Sbjct: 327 ARPGA-------QKVGIVFTDGRSQDYINDA-----AKKAKDLGFKMFAVGVGNAVEDEL 374 Query: 336 LRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P ++ + + + IGK + K + Sbjct: 375 REIASEPVAEHYFYTAD----FKTINQIGKRLQKKICVEE 410 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 16/115 (13%), Positives = 49/115 (42%), Gaps = 11/115 (9%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + D++ V+D SRS+ + + ++ ++E + + P+ + G Sbjct: 31 RGHLCRTRPTDLVFVVDSSRSVRP------VEFEKVKVFLSQVIESLDVGPNA---TRVG 81 Query: 223 LVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 +V +++ +++ F L + L + ++ + T + +++A + F Sbjct: 82 VVNYASAVKQEFPLRAHGSKAALLQAVRRIQPLSTGTMTGLAIQFAITKAFGDAE 136 >gi|152990340|ref|YP_001356062.1| von Willebrand factor type A domain-containing protein [Nitratiruptor sp. SB155-2] gi|151422201|dbj|BAF69705.1| von Willebrand factor type A domain protein [Nitratiruptor sp. SB155-2] Length = 289 Score = 85.2 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 40/202 (19%), Positives = 68/202 (33%), Gaps = 19/202 (9%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + D+++ LD S SME +K ++ K D G Sbjct: 68 SIKLDDRKGRDLVLALDASGSMEESLYDEKSKFEVVKSMAQNFFH--KRFDDN-----IG 120 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +V F + L + L I YL N+ G ++ Q Sbjct: 121 IVIFGSFAYIAAPLTYDTKALDFLINYLEPSIAGNNTAIG-----EGLWQGIKALQ---- 171 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC-AS 341 D +K+++ +TDG + S +AKK G +Y IG+ H + S Sbjct: 172 ADTAKQKVLILITDGHHNSGS--ISPRQAVEKAKKLGIKIYTIGLGDADKHLLEQIAKES 229 Query: 342 PNSFYLVENPHSMYDAFSHIGK 363 F+ ++ + FS + K Sbjct: 230 GGKFFYAKSEEDLQSIFSELNK 251 >gi|311244457|ref|XP_001927071.2| PREDICTED: collagen alpha-1(XII) chain [Sus scrofa] Length = 1894 Score = 85.2 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 53/267 (19%), Positives = 102/267 (38%), Gaps = 37/267 (13%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 ++T L + + + Y IS + + I T + + + V + + Sbjct: 364 PQTTMLSVRDLSADTEYQISVSAMKGLTASEPISIMEKTQPMKVQVECSRGVDIKA---- 419 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++ ++D S S+ ++ K+ ++ + VK N VQ LV +S Sbjct: 420 --DIVFLVDGSYSIG---IANFVKVRAFLEVL------VKSFEISPNRVQISLVQYSRDP 468 Query: 231 EEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L+ + I G STN+ + Y +IF +N Sbjct: 469 HTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF------VPSKGSRSNVP 522 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 K+++ +TDG++ D + + ++A+G++ E L A ASP Sbjct: 523 KVMILITDGKSSDAFRD-----PAIKLRNSDVEIFAVGVKDAVRSE-LEAIASPPAETHV 576 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 + VE+ +DAF I ++ T+ I Sbjct: 577 FTVED----FDAFQRISFEL-TQSICL 598 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 42/264 (15%), Positives = 99/264 (37%), Gaps = 34/264 (12%) Query: 116 LDIVVVPQNEGYSISAISRYKIPLKFC---TFIPWYTNSRHIVMPITSSVKVNSQTDAR- 171 +D + ++++A + + + ++ T+ + + ++ + + Sbjct: 59 VDPTTDGPTKEFTLAASTTETLLSELIPEVEYVVTITSYDEVEESVPVIGQLTKCSVSAW 118 Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D++ ++D S S+ + I I ++ V + G+V +S+ Sbjct: 119 TDLVFLVDGSWSVGRN---NFKYILDFIAAL------VSAFDIGEEKTRVGVVQYSSDTR 169 Query: 232 EFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDMQGMRQHCNTEDANYK 288 F L + L IK + G +T + + + N + G R + Sbjct: 170 TEFNLNQFYQRDELLAAIKKIPYKGGNTMTGDAIDFLIKNTFTESAGARVG-------FP 222 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFY 346 K+ + +TDG K + E + G V+++GI+ + E + ++P N + Sbjct: 223 KVAIIITDG-----KSQDEVEIPARELRNIGVEVFSLGIKAADAKELKQIASTPSLNHVF 277 Query: 347 LVENPHSMYDAFSHIGKDIVTKRI 370 V N +DA I +I+++ Sbjct: 278 NVAN----FDAIVDIQNEIISQVC 297 >gi|293349337|ref|XP_001060689.2| PREDICTED: collagen, type XII, alpha 1 [Rattus norvegicus] Length = 3064 Score = 85.2 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 52/267 (19%), Positives = 105/267 (39%), Gaps = 37/267 (13%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 ++T+L++ + + Y IS + + + T + + + V + + Sbjct: 384 PQTTTLNVRDLSADTEYQISVFAMKGLTSSEPVSVMEKTQPMKVQVECSRGVDIKA---- 439 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++ ++D S S+ ++ K+ ++ + K N VQ LV +S Sbjct: 440 --DIVFLVDGSYSIG---IANFVKVRAFLEVL------AKSFEISPNRVQISLVQYSRDP 488 Query: 231 EEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L+ V + + I G STN+ + Y +IF +N Sbjct: 489 HTEFTLKEFNRVEDIIKAINNFPYRGGSTNTGKAMTYVREKIFVPN------KGSRSNVP 542 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 K+++ +TDG++ D + + ++A+G++ E L A ASP Sbjct: 543 KVMILITDGKSSDAFRD-----PAIKLRNSDVEIFAVGVKDAVRSE-LEAIASPPAETHV 596 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 + VE+ +DAF I ++ T+ I Sbjct: 597 FTVED----FDAFQRISFEL-TQSICL 618 Score = 85.2 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 52/309 (16%), Positives = 102/309 (33%), Gaps = 45/309 (14%) Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 + R + + T S L N + + +V TS D V ++ Sbjct: 49 ADPIVGYRITVDSTTDGPTKEFTLAASTTETLLSN-LIPETQYVVTITSYDEVE--ESVP 105 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 P K P T + + + D++ ++D S S+ Sbjct: 106 VIGQLTIETGDPAKPGEKKPGKTEIQKCSVSAWT------------DLVFLVDGSWSVGR 153 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQ 244 ++ ++ V + G+V +S+ F L + L Sbjct: 154 NN---------FKYILDFIVALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYRREDLL 204 Query: 245 RKIKYLSKFGVSTNSTPGLKY-AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 IK + G +T + + Y N + G R + K+ + +TDG K Sbjct: 205 AAIKKIPYKGGNTMTGDAIDYLVKNTFTESAGSRAG-------FPKVAIIITDG-----K 252 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHI 361 + E + G V+++GI+ + E + ++P N + V N +DA I Sbjct: 253 SQDEVEIPARELRNIGVEVFSLGIKAADAKELKQIASTPSLNHVFNVAN----FDAIVDI 308 Query: 362 GKDIVTKRI 370 +I+++ Sbjct: 309 QNEIISQVC 317 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 43/251 (17%), Positives = 84/251 (33%), Gaps = 28/251 (11%) Query: 124 NEGYSISAISRYK--IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 Y ++ + L + + ++S ++ ++ +A D+++++D S Sbjct: 1150 GTTYRVNVFGMFDGGESLPLVGQEMTTLSDTTVTPFLSSGMECLTRAEA--DIVLLVDGS 1207 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WG 239 S+ I+ ++E ++ P VQ L +S + L Sbjct: 1208 WSIG------RANFRTVRSFISRIVEVFEIGP---KRVQIALAQYSGDPRTEWHLNAHRD 1258 Query: 240 VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 L + + L G +T + L + Q F Q +KI V +TDG Sbjct: 1259 KKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKTQ------AGMRPRARKIGVLITDG-- 1310 Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFS 359 K + K G ++AIGI+ E P+ N +++ S Sbjct: 1311 ---KSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDI-HAYNVAD-FESLS 1365 Query: 360 HIGKDIVTKRI 370 I D+ Sbjct: 1366 KIVDDLTINLC 1376 >gi|301610722|ref|XP_002934910.1| PREDICTED: collagen alpha-1(XXI) chain-like [Xenopus (Silurana) tropicalis] Length = 957 Score = 85.2 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 45/214 (21%), Positives = 85/214 (39%), Gaps = 33/214 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S A D++ +LD S S+ ++ K + + + P Q G+V Sbjct: 29 SSCRTAPNDLVFILDGSWSVGP------ENFEILKKWVVNITSNFNIGP---KFTQVGVV 79 Query: 225 TFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ L L R+++ + G +T + +++A + +F Sbjct: 80 QYSDYPILEIPLGSYESSDDLSRRMQSIQYLGGNTQTGNAIRFAIDNLFARSL------- 132 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 KI V +TDG K + + EA+K ++AIG+ LRA A Sbjct: 133 --RPLTKIAVVLTDG-----KSQDEVKHVAEEARKNKITLFAIGVGSEIEESELRAIANK 185 Query: 341 -SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 S + VE+ Y A S I ++I+ +++ + Sbjct: 186 PSSTYVFYVED----YIAISKI-REIMKQKLCEE 214 >gi|332716075|ref|YP_004443541.1| hypothetical protein AGROH133_11102 [Agrobacterium sp. H13-3] gi|325062760|gb|ADY66450.1| hypothetical protein AGROH133_11102 [Agrobacterium sp. H13-3] Length = 429 Score = 85.2 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 70/415 (16%), Positives = 137/415 (33%), Gaps = 67/415 (16%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMID----------RS 57 ++NF+ G ILTA+ + + G+ ++++ +K L D + Sbjct: 1 MKNFWQEKSGNFGILTALLMVPLCGAAGVALDITRGMSVKADLQQAADSAALAAVADMSA 60 Query: 58 LVHAATQIMNEGNGNNRKKLKGGDILCRIKN----TWNMSFRNELRDNGFVNDIDDIVRS 113 V AA ++ +G + + T + ++ V S Sbjct: 61 SVQAAKKMSGDGVIPVGNEEARAFFDGNQRGDADYTITSVDVSVIKHGNVVESSVSFKAS 120 Query: 114 TSLDIVVVPQNEGYSISAISRYKIPLK-FCTFIPWYTNSRHIVMPIT----SSVKVNSQT 168 S + + + S++ + K + F F N+ + + T +++ N+ Sbjct: 121 VSTTLSGLLGKDFVSVAGTATAKYETETFSDFYLLLDNTPSMGVGATPTDVATLVANTGD 180 Query: 169 DARLDMMIVLDVSRSMESFF-----DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 IV D S++ T+ID+ K+ ++++ K +N + + Sbjct: 181 KCAFACHIVKDGVADPNSYYFKAKKLGVTTRIDVVAKATASLMDTAKSTRKSSNQYRMAV 240 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 TF + E+ LLE V L + K N YN R Sbjct: 241 YTFGERAEDTKLLE--VVSLTSDLDAAKKKAGEINLMSIPYQGYNNDQQTDFDRALIQIG 298 Query: 284 D-----------ANYKKIIVFMTDGENLSTKEDQQSLYY-------------CNEAKKRG 319 D AN K+I F++DG S K + C + K++G Sbjct: 299 DKVGSSGTGASSANPDKVIFFVSDGVGDSYKPSSCTKKLTGGRCQEPIDIKDCTKLKEKG 358 Query: 320 AIV---YAIGIRVIRS--------------HEFLRACASPNSFYLVENPHSMYDA 357 + Y + + + +++CASP ++ V + DA Sbjct: 359 FRIAVLYTTYLPLPTNDWYNSWIKPFQAEIGSRMQSCASPGLYFEVSPSQGISDA 413 >gi|327269503|ref|XP_003219533.1| PREDICTED: collagen alpha-1(XXII) chain-like [Anolis carolinensis] Length = 1601 Score = 84.8 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 40/221 (18%), Positives = 85/221 (38%), Gaps = 30/221 (13%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + + D++ +LD S S+ + + + +++ ++ PD Sbjct: 24 ETQAQRAGCKNVHYDLVFILDTSSSVGK------EDFEKVRQWVANLVDTFEIGPDK--- 74 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY----LSKFGVSTNSTPGLKYAYNQIFDMQ 274 + G+V +S++ F L G + +IK + +G +TN+ L+Y F + Sbjct: 75 TRVGVVRYSDRPTTEFDL--GRYQTREQIKEAAKNIKYYGGNTNTGDALRYINTYSFSEE 132 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 D+ KK+ + +TDG L A K G ++A+G+ E Sbjct: 133 ---AGGRPTDSAIKKVAILLTDGR-----SQDHVLDPATAAHKAGIRIFAVGVGEALKEE 184 Query: 335 FLRACASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P + V + Y+A I + + +R+ + Sbjct: 185 LDEIASEPKSAHVFHVSD----YNAIDKI-RGKLRRRLCEN 220 >gi|149732296|ref|XP_001503293.1| PREDICTED: collagen, type XXI, alpha 1 [Equus caballus] Length = 957 Score = 84.8 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 42/214 (19%), Positives = 84/214 (39%), Gaps = 33/214 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S A D++ +LD S S+ ++ K + + + P +Q G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGP------ENFEIVKKWLVNITRNFDIGP---KFIQVGVV 79 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ L +L ++ + G +T + +++A + +F Sbjct: 80 QYSDYPVLEIPLGSHDSGENLMAAMESIHYLGGNTRTGKAIQFALDYLF---------AK 130 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 KI V +TDG K + A++ I++AIG+ LRA A Sbjct: 131 SSRFLTKIAVVLTDG-----KSQDEVKDAAEAARESKIILFAIGVGSETEEAELRAIANK 185 Query: 341 -SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 S + VE+ Y A S I ++++ +++ + Sbjct: 186 PSSTYVFYVED----YIAISKI-REVMKQKLCEE 214 >gi|149019069|gb|EDL77710.1| procollagen, type XII, alpha 1, isoform CRA_b [Rattus norvegicus] gi|149019070|gb|EDL77711.1| procollagen, type XII, alpha 1, isoform CRA_b [Rattus norvegicus] Length = 2827 Score = 84.8 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 52/267 (19%), Positives = 105/267 (39%), Gaps = 37/267 (13%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 ++T+L++ + + Y IS + + + T + + + V + + Sbjct: 326 PQTTTLNVRDLSADTEYQISVFAMKGLTSSEPVSVMEKTQPMKVQVECSRGVDIKA---- 381 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++ ++D S S+ ++ K+ ++ + K N VQ LV +S Sbjct: 382 --DIVFLVDGSYSIG---IANFVKVRAFLEVL------AKSFEISPNRVQISLVQYSRDP 430 Query: 231 EEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L+ V + + I G STN+ + Y +IF +N Sbjct: 431 HTEFTLKEFNRVEDIIKAINNFPYRGGSTNTGKAMTYVREKIFVPN------KGSRSNVP 484 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 K+++ +TDG++ D + + ++A+G++ E L A ASP Sbjct: 485 KVMILITDGKSSDAFRD-----PAIKLRNSDVEIFAVGVKDAVRSE-LEAIASPPAETHV 538 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 + VE+ +DAF I ++ T+ I Sbjct: 539 FTVED----FDAFQRISFEL-TQSICL 560 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 42/269 (15%), Positives = 82/269 (30%), Gaps = 39/269 (14%) Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 + R + + T S L N + + +V TS D V ++ Sbjct: 49 ADPIVGYRITVDSTTDGPTKEFTLAASTTETLLSN-LIPETQYVVTITSYDEVE--ESVP 105 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 P K P T + + + D++ ++D S S+ Sbjct: 106 VIGQLTIETGDPAKPGEKKPGKTEIQKCSVSAWT------------DLVFLVDGSWSVGR 153 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQ 244 ++ ++ V + G+V +S+ F L + L Sbjct: 154 NN---------FKYILDFIVALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYRREDLL 204 Query: 245 RKIKYLSKFGVSTNSTPGLKY-AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 IK + G +T + + Y N + G R + K+ + +TDG K Sbjct: 205 AAIKKIPYKGGNTMTGDAIDYLVKNTFTESAGSRAG-------FPKVAIIITDG-----K 252 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + E + G V+++ I + Sbjct: 253 SQDEVEIPARELRNIGVEVFSLVIEPPSN 281 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 43/251 (17%), Positives = 84/251 (33%), Gaps = 28/251 (11%) Query: 124 NEGYSISAISRYK--IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 Y ++ + L + + ++S ++ ++ +A D+++++D S Sbjct: 1092 GTTYRVNVFGMFDGGESLPLVGQEMTTLSDTTVTPFLSSGMECLTRAEA--DIVLLVDGS 1149 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WG 239 S+ I+ ++E ++ P VQ L +S + L Sbjct: 1150 WSIG------RANFRTVRSFISRIVEVFEIGP---KRVQIALAQYSGDPRTEWHLNAHRD 1200 Query: 240 VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 L + + L G +T + L + Q F Q +KI V +TDG Sbjct: 1201 KKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKTQ------AGMRPRARKIGVLITDG-- 1252 Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFS 359 K + K G ++AIGI+ E P+ N +++ S Sbjct: 1253 ---KSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDI-HAYNVAD-FESLS 1307 Query: 360 HIGKDIVTKRI 370 I D+ Sbjct: 1308 KIVDDLTINLC 1318 >gi|327282764|ref|XP_003226112.1| PREDICTED: collagen alpha-1(XXI) chain-like [Anolis carolinensis] Length = 956 Score = 84.8 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 41/212 (19%), Positives = 81/212 (38%), Gaps = 28/212 (13%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S A D++ +LD S S+ ++ + + + + P +Q G+V Sbjct: 29 SSCRTAPTDLVFILDGSWSVGP------ENFEIVKRWLVNITSNFNIGP---KFIQVGVV 79 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ L +L R ++Y+ G +T + +++A + +F Sbjct: 80 QYSDYPVLEIPLGFHDSNENLVRGMEYIQYLGGNTQTGKAIQFALDHLF---------AK 130 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 KI V +TDG K EA+K ++AIG+ + LRA A Sbjct: 131 SSRFLTKIAVVLTDG-----KSQDDVKEVAAEARKNRITLFAIGVGSETEEDELRAIANK 185 Query: 341 -SPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 S + VE+ ++ I + + + + Sbjct: 186 PSSTYVFYVEDYIAISRIREVIKQKLCEESVC 217 >gi|73990553|ref|XP_853265.1| PREDICTED: similar to alpha 3 type VI collagen isoform 1 precursor [Canis familiaris] Length = 1798 Score = 84.8 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 40/216 (18%), Positives = 82/216 (37%), Gaps = 24/216 (11%) Query: 142 CTFIPWYTNSRHIVMPITSSV-KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 F + + R+I + + D + D+M ++D S S+ + K+ +K Sbjct: 585 VNFGQNFDSLRNIKNEVVHRICTEKGCEDMKADIMFLVDSSGSIGH---DNFGKMKTFMK 641 Query: 201 SINAMLEEVKLIPDVNNVVQSGLVTFSN-KIEEFFLLEWGV-SHLQRKIKYLSKFGVSTN 258 +L ++++ PD Q G+V FS+ EEF L ++ + I +S T Sbjct: 642 ---NLLAKIQIGPDS---TQIGVVQFSDINQEEFQLNKYFTQNETSDAIDRMSLINRGTL 695 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + L + +G R KK ++ +TDGE + + Sbjct: 696 TGSALTFVGQYFTPTKGAR-------TKVKKFLILITDGEAQDP-----VRDPAKALRDK 743 Query: 319 GAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSM 354 G +++++G+ + + + VEN + Sbjct: 744 GVVIFSVGVYGANRTQLEEISGDSSLVFQVENFDDL 779 Score = 84.8 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 52/322 (16%), Positives = 116/322 (36%), Gaps = 25/322 (7%) Query: 41 SHIFFMKTVLHSMIDR-SLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELR 99 + +F + IDR SL++ T + + ++K + Sbjct: 671 LNKYFTQNETSDAIDRMSLINRGTLTGSALTFVGQYFTPTKGARTKVKKFLI--LITDGE 728 Query: 100 DNGFVNDIDDIVRSTSL---DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVM 156 V D +R + + V N + + F T +V Sbjct: 729 AQDPVRDPAKALRDKGVVIFSVGVYGANRTQLEEISGDSSLVFQVENFDDLKTVESKLVF 788 Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 + + + LD++ VLD S S+ + S+ +N + VK + Sbjct: 789 RVCALHDCKNIK--VLDIVFVLDHSGSIGTQEQESM---------MNLTIHLVKKADVDS 837 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 + V+ G + +S+ E F L S + ++ T + L++A N +F Sbjct: 838 DRVRVGALKYSDYPEVLFYLSGNKSAVIEHLRRRRYTSGHTYTARALEHA-NIMFT---- 892 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 ++ + N K++++ +TDG + + ++ + +G +YA+G+ E Sbjct: 893 EEYGSRIQQNVKQMLIIITDG---VSHDRDNLSDTASKLRNKGINIYAVGVGQANQLELE 949 Query: 337 RACASPNSFYLVENPHSMYDAF 358 + ++ + V+N ++ D + Sbjct: 950 TMAGNKSNTFHVDNFSNLKDIY 971 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 31/202 (15%), Positives = 78/202 (38%), Gaps = 26/202 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S+ + +I + ++ M ++ VQ+G V +S+KI Sbjct: 431 DIYFLIDGSTSINTE---GFEQIKQFMLAVTGMFSI------GSDKVQAGAVQYSDKIRV 481 Query: 233 FFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + + + ++ I + + +T++ L + + ++ R+H + Sbjct: 482 EFYINASSNDMDLRKAILNIEQLQGNTHTGKALDF---MLSIIKKDRKHRIS---EIPCH 535 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDG K + L + ++A+GI + + +N Sbjct: 536 LIVLTDG-----KSQDEVLKPAERLRDEQITIHAVGIGEADKIQLQQIAGEEERVNFGQN 590 Query: 351 PHSMYDAFSHIGKDIVTKRIWY 372 +D+ +I ++V + Sbjct: 591 ----FDSLRNIKNEVVHRICTE 608 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 27/186 (14%), Positives = 64/186 (34%), Gaps = 24/186 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + I+ ++ + P + + L +S+++ Sbjct: 19 DVVFLVDSSNHLG------TKSFPFVKTFISKIINSL---PIEAHKYRVALAQYSDQLHS 69 Query: 233 FFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L S + +L K G S L+ A+ F R + + Sbjct: 70 EFQLGTFKSR-NPMLNHLKKNFGFVGGSLRIGQALREAHRTYFS----RPDSGRDKKQFP 124 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 I+V + S + + + G + ++G++ E L+A A+ + + Sbjct: 125 PILVVL-----ASAESEDDVEEPSKALRGDGVRIISVGLQSASEQE-LKAMATGQFHFNL 178 Query: 349 ENPHSM 354 + + Sbjct: 179 RSARDL 184 >gi|301606205|ref|XP_002932732.1| PREDICTED: collagen alpha-1(XXI) chain-like [Xenopus (Silurana) tropicalis] Length = 1058 Score = 84.8 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 41/222 (18%), Positives = 89/222 (40%), Gaps = 25/222 (11%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + + ++V A D++ ++D S S+ D A + + + P Sbjct: 42 TIFMRGRLEVAGCRTAVNDLVFIVDGSWSVG------YKDFDTAKNWLLNITSSFDIGPS 95 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 Q G+V +S+ + L S+ Q +K + G +TN+ +K+A ++F Sbjct: 96 ---YTQVGVVQYSDFPQLEIPLGHNTSYQQLLSALKSIKYLGGNTNTGRAIKFATEEVFP 152 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + + KI + +TDG K + + A+ +G I++A+G+ + Sbjct: 153 TS------KRLNVSKNKIAIVITDG-----KSQDNVVNISSSARAQGIILFAVGVGSEIT 201 Query: 333 HEFLRACAS-PNSFY--LVENPHSMYDAFSHIGKDIVTKRIW 371 L A A+ P++ Y E+ ++ + + I + + Sbjct: 202 KSELVAIANMPSTHYVLYAEDYTTIDRIKETMRQKICEESVC 243 >gi|326675078|ref|XP_692457.5| PREDICTED: collagen alpha-6(VI) chain [Danio rerio] Length = 1605 Score = 84.8 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 49/321 (15%), Positives = 104/321 (32%), Gaps = 50/321 (15%) Query: 58 LVHAATQIMNEGNGNNRKKLKGGDILCRIKNTW------NMSFRNELRDNGFVNDIDDIV 111 + + N+ + K++ I+ K + R + + + D Sbjct: 94 ALKYVASLFNQAKASRPAKVQEILIVITDKTSQDDVGDPAEELRIQGVS-VYAIGVKDAS 152 Query: 112 RSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR 171 + L + + Y ++ Y + IV I S + + Sbjct: 153 QDELLKMTADETKDFY----VTNYDA----LNVLKRE-----IVTDICS---QEACKNKV 196 Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D+M ++D S S+ + T + I + V + V G+V FSN + Sbjct: 197 ADIMFLIDGSSSI---YGPDFTSMKTFITKV------VNGTIIGEDSVHVGVVQFSNNPQ 247 Query: 232 EFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 E F L + L+ I + + T + L + G + + Sbjct: 248 EQFPLNRYFDQNELEEAIDGIEQLTGDTYTGKALSFISKYFDASNG-------GRPDVPQ 300 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +V +TDGE + +G +++IG+ + + + + + Y+ Sbjct: 301 FLVVITDGE-----AHDAVAVPAKAIRDKGVTIFSIGVASVNTTQLWEISGTQDKVYVQR 355 Query: 350 NPHSMYDAFSHIGKDIVTKRI 370 + +DA I K++ K Sbjct: 356 D----FDALHSIDKNLQFKLC 372 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 40/204 (19%), Positives = 75/204 (36%), Gaps = 23/204 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ +LD S S + F+ + K +L ++L N V+ G+V Sbjct: 13 DIYFLLDNSGSTRADFE-DVKKF---------ILGSLQLFNIGPNRVRVGVVKVDRNPTL 62 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L + + ++ +S+ + LKY + + R A ++I Sbjct: 63 QFSLTEHKNRASFEAAVRGISQPVGGSEKGKALKYVASLFNQAKASRP------AKVQEI 116 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TD E + +G VYAIG++ E L+ A + V N Sbjct: 117 LIVITD-----KTSQDDVGDPAEELRIQGVSVYAIGVKDASQDELLKMTADETKDFYVTN 171 Query: 351 PHSMYDAFSHIGKDIVTKRIWYDK 374 ++ I DI ++ +K Sbjct: 172 YDALNVLKREIVTDICSQEACKNK 195 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 40/247 (16%), Positives = 85/247 (34%), Gaps = 18/247 (7%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 T I V N F ++ +++ + SS + Sbjct: 324 KGVTIFSIGVASVNTTQLWEISGTQDKVYVQRDFDALHSIDKNLQFKLCSSHPGGCPSTQ 383 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++ ++ +R + KI ++ V +N+++ G++ +S+ Sbjct: 384 LADVIFLVQCTRQIR---LQDFEKIKSF------LISVVNSTQIGDNLIRFGVIVYSDTP 434 Query: 231 EEFFLLEWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 +F L ++ + I L VS + L Y+ + G RQ + Sbjct: 435 SQFSLNQYNTRRQIAEAITSLKSPAVSGYTARALAYSLTYFTEANGGRQKRG-----VPQ 489 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 ++ +TDG+ ++ + +E + VY IG+ + E S N + V Sbjct: 490 MLFMITDGD---ARDRENLRARADEFAAKQINVYGIGVARAQDSELEIITKSKNKIFHVN 546 Query: 350 NPHSMYD 356 N ++ D Sbjct: 547 NYDNLQD 553 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 35/257 (13%), Positives = 85/257 (33%), Gaps = 46/257 (17%) Query: 99 RDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPI 158 R+N + + ++ + V + + + I++ K + + + V + Sbjct: 502 RENLRARADEFAAKQINVYGIGVARAQDSELEIITKSKNKIFHVNNYDNLQDLQMNVSGV 561 Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + D++ ++D S S+ + I I +L V + + Sbjct: 562 LCNATKPVCQNEVADLVFLIDGSESISE---------ESWITVIAFLLNVVDKLRIGPEL 612 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + G+ FS+ ++ F + N+ D G + Sbjct: 613 FRVGIAQFSSVYQKEFYM-------------------------------NEYKDADGEDK 641 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + + +V +TDG + ++ + G V+AIGI + + Sbjct: 642 GSRKQSGVPQN-LVLITDG-----VSSDRVNEAADQLRNLGINVFAIGIGAVSLQQLSYI 695 Query: 339 CASPNSFYLVENPHSMY 355 SP+ + V+N + + Sbjct: 696 AGSPDRLFKVQNFNYLN 712 >gi|91773457|ref|YP_566149.1| von Willebrand factor, type A [Methanococcoides burtonii DSM 6242] gi|91712472|gb|ABE52399.1| hypothetical protein with von Willebrand factor type A domain and Invasin domain [Methanococcoides burtonii DSM 6242] Length = 892 Score = 84.8 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 53/352 (15%), Positives = 118/352 (33%), Gaps = 34/352 (9%) Query: 37 IIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRN 96 +I + ++ ++ + T ++ N +N I + +F Sbjct: 475 LIALDAWGHSLQNINVTLNNTAPSLGTLSLDGSNESNLINFTTDQYGRVITEFTSSNFVG 534 Query: 97 E---LRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS----AISRYKIPLKFCTFIPWYT 149 + + +ND I I E Y ++ I ++ + Sbjct: 535 NCTIIGLSDAINDSLSIEIRNQPFISASIDAEPYVVTSGDIVNITTVITVEGELPVSRSA 594 Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 + +++ + S+ + LD+++VLD S SM+ ++ + A + +E + Sbjct: 595 ATSMLILDRSGSMDPDYYAGTALDIVLVLDRSGSMKFLGNAPEQPLTDAKSAAKIFMENL 654 Query: 210 KLIPDVNNVVQSGLVTFSN-----KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 + G+V+FS+ + + L I + G T + Sbjct: 655 LS------NTEVGVVSFSSTSTVDRQPVSLNISGNKDLLHNAIDSMVADG-GTAIGDAMA 707 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 A N + + + KKI++ +TDG + + S + A +Y+ Sbjct: 708 DANNLLIN----------GRPDAKKIMIVLTDGVATAGSDRDGSDAI-STANLNNIRIYS 756 Query: 325 IGIRVIR--SHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 IG+ L+ AS S+Y + + ++ I K+I + Sbjct: 757 IGLGSSEYIDEPMLKRIASETGGSYYNAPSGSELQTVYNTISKEISDFDVTE 808 >gi|297671961|ref|XP_002814089.1| PREDICTED: collagen alpha-5(VI) chain-like [Pongo abelii] Length = 2586 Score = 84.4 bits (207), Expect = 2e-14, Method: Composition-based stats. Identities = 43/203 (21%), Positives = 85/203 (41%), Gaps = 23/203 (11%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ VLD S S++ + + IN + VK N VQ G + +S++ Sbjct: 813 LDVVFVLDHSGSIKKQYQDHM---------INLTIHLVKKADVGRNRVQFGALKYSDQPN 863 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L S + ++ G +T + LK+A N +F +H + N K+ Sbjct: 864 ILFYLNTYSNRSAIIENLRMRRDTGGNTYTAKALKHA-NALFT----EKHGSRIKQNVKQ 918 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDG++ + Q +E + +G ++A+G+ E + N+ V+ Sbjct: 919 VLIVITDGKS---HDHDQLNDTASELRDKGITIFAVGVGKANQKELEGMAGNKNNTIYVD 975 Query: 350 NPHSMYDAFSHIGKDIVTKRIWY 372 N + D F+ + + Sbjct: 976 NFDKLKDVFT----LVQERMCTE 994 Score = 79.0 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 35/210 (16%), Positives = 76/210 (36%), Gaps = 27/210 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D + D+M ++D S S+ + + K+ + +K++ + I + Q G+V Sbjct: 620 KGCEDMKADIMFLVDSSWSIG---NENFRKMKIFMKNL------LTKIQVGADKTQIGVV 670 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+K +E F L + + I +S T + L + +G R Sbjct: 671 QFSDKTKEEFQLSRYFTQQEISDAIDRMSLINEGTLTGKALNFVSQYFTHSKGARLGA-- 728 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 KK ++ +TDG + + ++++G+ + Sbjct: 729 -----KKFLILITDG-----VAQDDVRDPARILRGKDVTIFSVGVYNANRSQLEEISGDG 778 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + + VEN +D + + +V + Sbjct: 779 SLVFHVEN----FDHLKALERKLVFRVCAL 804 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 69/202 (34%), Gaps = 26/202 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S++ I + MLE ++ + V+ G+V +S+ E Sbjct: 442 DIHFLIDGSSSIQEKQFEQIKRF---------MLEVTEMFSIGPDKVRVGVVQYSDDTEV 492 Query: 233 FFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + + + ++ I + + T + L Y I + R Sbjct: 493 EFYITDYSNDIDLRKAIFNIKQLTGGTYTGKALDYILQIIKNGTKDRMSK------VPCY 546 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDG + + + + V+A+GI E +N Sbjct: 547 LIVLTDGMSTD-----RVVEPAKRLRAEQITVHAVGIGAANKIELQEIAGKEERVSFGQN 601 Query: 351 PHSMYDAFSHIGKDIVTKRIWY 372 +DA I ++V + Sbjct: 602 ----FDALKSIKNEVVREICTE 619 Score = 50.5 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 64/185 (34%), Gaps = 22/185 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + IN M+ + P N + L +S++ Sbjct: 30 DVVFLVDSSDHLGP------KSFPFVKTFINKMINSL---PVEANKYRVALAQYSDEFHS 80 Query: 233 FFLLEW--GVSHLQRKIKY-LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L G S + +K G S L+ A+ F + + + Sbjct: 81 EFHLSTFKGRSPMLNHLKKNFQFIGGSLQIGKALQEAHRTYFSAPINGR----DRKQFPP 136 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 I+V + S + + + +K G + ++G++ L+A A+ + + + Sbjct: 137 ILVVL-----ASAESEDEVEEASKALQKDGVKIISVGVQEASEEN-LKAMATSHFHFNLR 190 Query: 350 NPHSM 354 + Sbjct: 191 TIRDL 195 >gi|60477748|gb|AAH90753.1| Matn4 protein [Danio rerio] Length = 261 Score = 84.4 bits (207), Expect = 2e-14, Method: Composition-based stats. Identities = 30/205 (14%), Positives = 78/205 (38%), Gaps = 25/205 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + +D+++++D S+S+ ++ + +N +++++ + + GLV Sbjct: 21 GTCRTSNIDLVLLIDGSKSVRP------QNFELVKQFVNQVVDQLDV---SAKGTRVGLV 71 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ + F L +++ + + T + LK+ F + Sbjct: 72 QYSSCVRTEFPLSMYHSKDEIKKAVMNVEYMEKGTMTGLALKHMVENSFSEAEGARPAEK 131 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 ++ + TDG + +AK+ G +YA+G+ E + P Sbjct: 132 N---IPRVGLVFTDGR-----SQDDIQEWAKKAKEAGITMYAVGVGKAVEDELREIASDP 183 Query: 343 --NSFYLVENPHSMYDAFSHIGKDI 365 F+ + + A S I +++ Sbjct: 184 VEKHFFYSAD----FTAISQIAENL 204 >gi|32474888|ref|NP_867882.1| hypothetical protein RB7557 [Rhodopirellula baltica SH 1] gi|32445428|emb|CAD75429.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 327 Score = 84.4 bits (207), Expect = 2e-14, Method: Composition-based stats. Identities = 44/348 (12%), Positives = 99/348 (28%), Gaps = 34/348 (9%) Query: 19 MTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLK 78 M +L AI + + +V+ I+++ + +T L S N L Sbjct: 1 MLVLIAIMMFLFLIVVAFSIDIAQMHLARTELRSST--------DAAANAAATTLADTLD 52 Query: 79 GGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP 138 + R + + N D ++ + + Sbjct: 53 RNLAIQRGQQIAQANLVNGQPLLLADGDFQFGRSDRQVNGKYAFNAGEAPFNGVRVNGQR 112 Query: 139 L--KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 +P + + + T D+ +V+D S SM ++ + Sbjct: 113 TTGSLSGPVPLFFGNVTGTSIFEPEA-FATATYVERDITLVVDRSGSMAG------SRFN 165 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS 256 +I + + P Q GL +++++ E L + + + L G Sbjct: 166 DLQAAIRIFTDLLATTPVDE---QIGLASYNDRASEDVQLTENFAEVNNAMDRLRT-GGF 221 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 T+ + G++ ++ ++ MTDG + E + Sbjct: 222 TSISRGMQAGQEIAL--------RGRPPEFVERTMIVMTDGRHNRGPE---PRVVATDLA 270 Query: 317 KRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIG 362 G ++ I ++ A + N + D + I Sbjct: 271 ADGVTIHTITFGAGADFGRMQDVARIGGGRHFHATNGDQLRDIYREIA 318 >gi|290769676|gb|ADD61455.1| putative protein [uncultured organism] Length = 816 Score = 84.4 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 48/266 (18%), Positives = 93/266 (34%), Gaps = 25/266 (9%) Query: 86 IKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFI 145 + + ++ + + D + S + + + P + T Sbjct: 35 VAGVSATAMADDGNASTTQSQTTDEKAAASAPAPLSTEGTN-GVPDDPTLSAPAREKTVT 93 Query: 146 PWYTNSRHIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDS---SITKIDMAI 199 + + + +T + + + LD+++VLDVS SM S TKID Sbjct: 94 ANEDGTYTVALNVTGAKSAGTGEIVTNQPLDIVLVLDVSGSMAEKIASGWNQPTKIDSLK 153 Query: 200 KSINAMLE----EVKLIPDVNNVVQSGLVTF-----SNKIEEFFLLEWGVSHLQRKIKYL 250 ++N + E I D + + LV F ++ +F+ W + + + L Sbjct: 154 TAVNKFINATAAENAKITDQSQRNRIALVKFAGTEKTSVGNDFYREGWSSYNYTQIVSNL 213 Query: 251 SKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCN---TEDANYKKIIVFMTDGENLS----- 301 + T++ GL + D R AN KK+++F TDGE Sbjct: 214 TYDVSGLTSTVNGLSASGATSADYAFNRAQAALTYQPRANAKKVVIFFTDGEPNHGSGFD 273 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGI 327 ++ K G +Y+IG+ Sbjct: 274 PTVAATAVNKAKSLKDAGTTIYSIGV 299 >gi|239620965|ref|ZP_04663996.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis CCUG 52486] gi|239516066|gb|EEQ55933.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis CCUG 52486] Length = 816 Score = 84.4 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 48/266 (18%), Positives = 93/266 (34%), Gaps = 25/266 (9%) Query: 86 IKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFI 145 + + ++ + + D + S + + + P + T Sbjct: 35 VAGVSATAMADDGNASTTQSQTTDEKAAASAPAPLSTEGTN-GVPDDPTLSAPAREKTVT 93 Query: 146 PWYTNSRHIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDS---SITKIDMAI 199 + + + +T + + + LD+++VLDVS SM S TKID Sbjct: 94 ANEDGTYTVALNVTGAKSAGTGEIVTNQPLDIVLVLDVSGSMAEKIASGWNQPTKIDSLK 153 Query: 200 KSINAMLE----EVKLIPDVNNVVQSGLVTF-----SNKIEEFFLLEWGVSHLQRKIKYL 250 ++N + E I D + + LV F ++ +F+ W + + + L Sbjct: 154 TAVNKFINATAAENAKITDQSQRNRIALVKFAGTEKTSVGNDFYREGWSSYNYTQIVSNL 213 Query: 251 SKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCN---TEDANYKKIIVFMTDGENLS----- 301 + T++ GL + D R AN KK+++F TDGE Sbjct: 214 TYDVSGLTSTVNGLSASGATSADYAFNRAQAALTYQPRANAKKVVIFFTDGEPNHGSGFD 273 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGI 327 ++ K G +Y+IG+ Sbjct: 274 PTVAATAVNKAKSLKDAGTTIYSIGV 299 >gi|161788949|dbj|BAF95091.1| double von Willebrand factor A domains [Mus musculus] Length = 2309 Score = 84.4 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 44/237 (18%), Positives = 91/237 (38%), Gaps = 33/237 (13%) Query: 135 YKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD----ARLDMMIVLDVSRSMESFFDS 190 K+ K C + ++ I S + ++Q + D+ ++D S S++ Sbjct: 809 NKLKKKLCPEMLSRMPP--LMSFIPESTRQSTQEGCESVEKADIYFLIDGSGSIKPN--- 863 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIK 248 ++ +K + +K+ + V+ G+V +S+KI F L ++ L I Sbjct: 864 DFIEMKDFMKEV------IKMFHIGPDRVRFGVVQYSDKIISQFFLTQYASMAGLSAAID 917 Query: 249 YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS 308 + + G T + L + + NT + + ++ +TDG++ Sbjct: 918 NIQQEGGGTTTGKAL---------SKMVPVFQNTARVDVARYLIVITDGQSTDP-----V 963 Query: 309 LYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 + G +YAIG+R + E L AS F+ + S+ + +DI Sbjct: 964 AEAAQGLRDIGVNIYAIGVRDANTTE-LEEIASKKMFF-IYEFDSLKSIHQEVIRDI 1018 Score = 81.7 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 56/318 (17%), Positives = 115/318 (36%), Gaps = 26/318 (8%) Query: 46 MKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN 105 M L +++ A ++G + G I+ R + + Sbjct: 111 MGQALQFILEHHFREGAGSRASQGVPQVAVVVSSGLAEDHIREPAEALRRAGIL--VYAI 168 Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVN 165 + D ++ +I P++ ++ + +P P ++ T Sbjct: 169 GVKDASQAELREISSSPKD-NFTFFVPNFPGLPGLAQKLRPELCSTLAKAAQYTEQESPA 227 Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + D++ ++D S S+ + K+ + S+ L ++ VQ GLV Sbjct: 228 CSEASPADIVFLVDSSTSIG---LQNFQKVKHFLHSVVLGL------DVRSDQVQVGLVQ 278 Query: 226 FSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYA-YNQIFDMQGMRQHCNT 282 +S+ I F L+ S + +I+ L T++ L++ N + +M G R Sbjct: 279 YSDNIYPAFPLKQSSLKSAVLDRIRNLPYSMGGTSTGSALEFIRANSLTEMSGSRA---- 334 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 +I+V +TDGE+ +D ++ K+ G V+ +GI + E + + P Sbjct: 335 -KDGVPQIVVLVTDGESSDEVQD-----VADQLKRDGVFVFVVGINIQDVQELQKIASEP 388 Query: 343 -NSFYLVENPHSMYDAFS 359 F S+ A S Sbjct: 389 FEEFLFTTENFSILQALS 406 Score = 76.0 bits (185), Expect = 8e-12, Method: Composition-based stats. Identities = 55/341 (16%), Positives = 110/341 (32%), Gaps = 39/341 (11%) Query: 37 IIEVSHIFFMKTVLHSMIDRSLVHAA-TQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFR 95 +++ S + L + + AA I EG G K + R Sbjct: 890 VVQYSDKIISQFFLTQYASMAGLSAAIDNIQQEGGGTTTGKALSKMVPVFQNTARVDVAR 949 Query: 96 N-----ELRDNGFVNDIDDIVRSTSLDI----VVVPQNEGYSISAISRYKIPLKFCTFIP 146 + + V + +R ++I V + I+ K+ F Sbjct: 950 YLIVITDGQSTDPVAEAAQGLRDIGVNIYAIGVRDANTT--ELEEIASKKM-FFIYEFDS 1006 Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 + + ++ I SS SQ D++ ++D S S+ K+ ++ Sbjct: 1007 LKSIHQEVIRDICSSENCKSQKA---DIIFLIDGSESI---APKDFEKMKDFME------ 1054 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLK 264 V + +Q GL+ FS+ E F L S + R I + + T++ L Sbjct: 1055 RMVNQSNIGADEIQIGLLQFSSNPREEFRLNRYSSKVDMCRAILSVQQMSDGTHTGKALN 1114 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 + +G + ++ +TDG + R I++A Sbjct: 1115 FTLPFFDSSRG-------GRPRVHQYLIVITDG-----VSQDNVAPPAKALRDRNIIIFA 1162 Query: 325 IGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 IG+ ++ + L + + EN S+ I ++ Sbjct: 1163 IGVGNVQRAQLLEITNDQDKVFQEENFESLQSLEKEILSEV 1203 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 41/324 (12%), Positives = 107/324 (33%), Gaps = 42/324 (12%) Query: 34 LGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMS 93 +G++ +I+ + S + +++ + G + + Sbjct: 274 VGLVQYSDNIYPAFPLKQSSLKSAVLDRIRNLPYSMGGTSTGSALEFIRANSLTEMSGSR 333 Query: 94 FRNELR-------DNGFVNDIDDIVRS----------TSLDIVVVPQNEGYSISAISRYK 136 ++ + D +++ D+ ++I V + + I++ + Sbjct: 334 AKDGVPQIVVLVTDGESSDEVQDVADQLKRDGVFVFVVGINIQDVQELQK--IASEPFEE 391 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT--DARLDMMIVLDVSRSMESFFDSSITK 194 F S ++ + S+V+ + D++ ++D S +S Sbjct: 392 FLFTTENFSILQALSGTLLQALCSTVERQMKKSNKTYADVVFLIDTS------QGTSPAS 445 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSK 252 I+ ++ +++ D + GL +S++ FL + + I L Sbjct: 446 FQWMQNFISRIIGILEVGQDK---YRIGLAQYSDQGHTEFLFNTHKTRNEMVAHIHELLV 502 Query: 253 F-GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY 311 F G S + GL++ + F + + +V +T G K + + Sbjct: 503 FQGGSRKTGQGLRFLHRTFFQEAAGSRLL----QGVPQYVVVITSG-----KSEDEVGEV 553 Query: 312 CNEAKKRGAIVYAIGIRVIRSHEF 335 +KRG + ++G++ E Sbjct: 554 AQILRKRGVDIVSVGLQDFDRAEL 577 >gi|148689169|gb|EDL21116.1| mCG140660 [Mus musculus] Length = 2242 Score = 84.4 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 44/237 (18%), Positives = 91/237 (38%), Gaps = 33/237 (13%) Query: 135 YKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD----ARLDMMIVLDVSRSMESFFDS 190 K+ K C + ++ I S + ++Q + D+ ++D S S++ Sbjct: 809 NKLKKKLCPEMLSRMPP--LMSFIPESTRQSTQEGCESVEKADIYFLIDGSGSIKPN--- 863 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIK 248 ++ +K + +K+ + V+ G+V +S+KI F L ++ L I Sbjct: 864 DFIEMKDFMKEV------IKMFHIGPDRVRFGVVQYSDKIISQFFLTQYASMAGLSAAID 917 Query: 249 YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS 308 + + G T + L + + NT + + ++ +TDG++ Sbjct: 918 NIQQEGGGTTTGKAL---------SKMVPVFQNTARVDVARYLIVITDGQSTDP-----V 963 Query: 309 LYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 + G +YAIG+R + E L AS F+ + S+ + +DI Sbjct: 964 AEAAQGLRDIGVNIYAIGVRDANTTE-LEEIASKKMFF-IYEFDSLKSIHQEVIRDI 1018 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 56/318 (17%), Positives = 115/318 (36%), Gaps = 26/318 (8%) Query: 46 MKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN 105 M L +++ A ++G + G I+ R + + Sbjct: 111 MGQALQFILEHHFREGAGSRASQGVPQVAVVVSSGLTEDHIREPAEALRRAGIL--VYAI 168 Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVN 165 + D ++ +I P++ ++ + +P P ++ T Sbjct: 169 GVKDASQAELREISSSPKD-NFTFFVPNFPGLPGLAQKLRPELCSTLGKAAQYTERESPA 227 Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + D++ ++D S S+ + K+ + S+ L ++ VQ GLV Sbjct: 228 CSEASPADIVFLVDSSTSIG---LQNFQKVKHFLHSVVLGL------DVRSDQVQVGLVQ 278 Query: 226 FSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYA-YNQIFDMQGMRQHCNT 282 +S+ I F L+ S + +I+ L T++ L++ N + +M G R Sbjct: 279 YSDNIYPAFPLKQSSLKSAVLDRIRNLPYSMGGTSTGSALEFIRANSLTEMSGSRA---- 334 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 +I+V +TDGE+ +D ++ K+ G V+ +GI + E + + P Sbjct: 335 -KDGVPQIVVLVTDGESSDEVQD-----VADQLKRDGVFVFVVGINIQDVQELQKIASEP 388 Query: 343 -NSFYLVENPHSMYDAFS 359 F S+ A S Sbjct: 389 FEEFLFTTENFSILQALS 406 Score = 76.0 bits (185), Expect = 8e-12, Method: Composition-based stats. Identities = 55/341 (16%), Positives = 110/341 (32%), Gaps = 39/341 (11%) Query: 37 IIEVSHIFFMKTVLHSMIDRSLVHAA-TQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFR 95 +++ S + L + + AA I EG G K + R Sbjct: 890 VVQYSDKIISQFFLTQYASMAGLSAAIDNIQQEGGGTTTGKALSKMVPVFQNTARVDVAR 949 Query: 96 N-----ELRDNGFVNDIDDIVRSTSLDI----VVVPQNEGYSISAISRYKIPLKFCTFIP 146 + + V + +R ++I V + I+ K+ F Sbjct: 950 YLIVITDGQSTDPVAEAAQGLRDIGVNIYAIGVRDANTT--ELEEIASKKM-FFIYEFDS 1006 Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 + + ++ I SS SQ D++ ++D S S+ K+ ++ Sbjct: 1007 LKSIHQEVIRDICSSENCKSQKA---DIIFLIDGSESI---APKDFEKMKDFME------ 1054 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLK 264 V + +Q GL+ FS+ E F L S + R I + + T++ L Sbjct: 1055 RMVNQSNIGADEIQIGLLQFSSNPREEFRLNRYSSKVDMCRAILSVQQMSDGTHTGKALN 1114 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 + +G + ++ +TDG + R I++A Sbjct: 1115 FTLPFFDSSRG-------GRPRVHQYLIVITDG-----VSQDNVAPPAKALRDRNIIIFA 1162 Query: 325 IGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 IG+ ++ + L + + EN S+ I ++ Sbjct: 1163 IGVGNVQRAQLLEITNDQDKVFQEENFESLQSLEKEILSEV 1203 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 42/324 (12%), Positives = 107/324 (33%), Gaps = 42/324 (12%) Query: 34 LGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMS 93 +G++ +I+ + S + +++ + G + + Sbjct: 274 VGLVQYSDNIYPAFPLKQSSLKSAVLDRIRNLPYSMGGTSTGSALEFIRANSLTEMSGSR 333 Query: 94 FRNELR-------DNGFVNDIDDIVRS----------TSLDIVVVPQNEGYSISAISRYK 136 ++ + D +++ D+ ++I V + + I++ + Sbjct: 334 AKDGVPQIVVLVTDGESSDEVQDVADQLKRDGVFVFVVGINIQDVQELQK--IASEPFEE 391 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT--DARLDMMIVLDVSRSMESFFDSSITK 194 F S ++ + S+V+ + D++ ++D S +S Sbjct: 392 FLFTTENFSILQALSGTLLQALCSTVERQMKKSNKTYADVVFLIDTS------QGTSQAS 445 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSK 252 I+ ++ +++ D Q GL +S++ FL + + I L Sbjct: 446 FQWMQNFISRIIGILEVGQDK---YQIGLAQYSDQGHTEFLFNTHKTRNEMVAHIHELLV 502 Query: 253 F-GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY 311 F G S + GL++ + F + + +V +T G K + + Sbjct: 503 FQGGSRKTGQGLRFLHRTFFQEAAGSRLL----QGVPQYVVVITSG-----KSEDEVGEV 553 Query: 312 CNEAKKRGAIVYAIGIRVIRSHEF 335 +KRG + ++G++ E Sbjct: 554 AQILRKRGVDIVSVGLQDFDRAEL 577 Score = 43.2 bits (100), Expect = 0.058, Method: Composition-based stats. Identities = 22/149 (14%), Positives = 49/149 (32%), Gaps = 12/149 (8%) Query: 201 SINAMLEEV-KLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVST 257 S+ L + + + ++ GL + + FLL + + I+ L Sbjct: 50 SVRNFLYILANSLQVGRDNIRVGLAQYGDTPTSEFLLSVYHRKGDVLKHIRGLQFKPGGN 109 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 L++ I + + ++ V ++ G + ++ Sbjct: 110 RMGQALQF----ILEHHFREGAGSRASQGVPQVAVVVSSG-----LTEDHIREPAEALRR 160 Query: 318 RGAIVYAIGIRVIRSHEFLRACASPNSFY 346 G +VYAIG++ E +SP + Sbjct: 161 AGILVYAIGVKDASQAELREISSSPKDNF 189 >gi|119599629|gb|EAW79223.1| hCG1743181 [Homo sapiens] Length = 1211 Score = 84.4 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 44/213 (20%), Positives = 84/213 (39%), Gaps = 27/213 (12%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 M + S + + + D+M ++D S S+ + +K+ +K++ V Sbjct: 583 MSMVLSFSYAACKEMKADIMFLVDSSGSIGPE---NFSKMKTFMKNL------VSKSQIG 633 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + VQ G+V FS+ +E F L +S + I ++ G +T + L + Sbjct: 634 PDRVQIGVVQFSDINKEEFQLNRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFSPT 693 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 +G R N +K ++ +TDGE ++ G I+Y++G+ Sbjct: 694 KGARP-------NIRKFLILITDGE-----AQDIVKEPAVVLRQEGVIIYSVGVFGSNVT 741 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGKDIV 366 + P + VEN +D I D+V Sbjct: 742 QLEEISGRPEMVFYVEN----FDILQRIEDDLV 770 Score = 82.5 bits (202), Expect = 9e-14, Method: Composition-based stats. Identities = 59/348 (16%), Positives = 118/348 (33%), Gaps = 32/348 (9%) Query: 34 LGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMS 93 L M I S + L ++ AA + + + + R + + +I Sbjct: 236 LSMGINKSEVLQHIQNLSPRTGKAYTGAAIKKLRKEVFSARNGSRKNQGVPQIAVLVTHR 295 Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIP----WYT 149 + VN + V +L I + I++ + K TF T Sbjct: 296 DSEDNVTKAAVNLRREGVTIFTLGIEGTSDTQLEKIASHPAEQYVSKLKTFADLAAHNQT 355 Query: 150 NSRHIVMPITSSVKVNSQTDARL--DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 + + IT +V V S+ L D+ +++D S S + T ++ ++ Sbjct: 356 FLKKLRNQITHTVSVFSERTETLKSDIYLLIDGSGS------TQATDFHEMKTFLSEVVG 409 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 + P V+ G V +++ + F + L + I+ + + G +TN+ L + Sbjct: 410 MFNIAPHK---VRVGAVQYADSWDLEFEINKYSNKQDLGKAIENIRQMGGNTNTGAALNF 466 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 + + + R +V +T+G L N ++ VYAI Sbjct: 467 TLSLLQKAKKQR------GNKVPCHLVVLTNG-----MSKDSILEPANRLREEHIRVYAI 515 Query: 326 GIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 GI+ + Y V + +DA I +V + + Sbjct: 516 GIKEANQTQLREIAGEEKRVYYVHD----FDALKDIRNQVVQEICTEE 559 Score = 79.8 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 37/192 (19%), Positives = 76/192 (39%), Gaps = 19/192 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D S S+ + ++ M+ VK N V+ G + +++ E Sbjct: 786 LDVVFVIDSSGSI------DYDEYNIMKDF---MIGLVKKADVGKNQVRFGALKYADDPE 836 Query: 232 EFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L+ + L+ ++ G ST + L ++ + N + Sbjct: 837 VLFYLDDFGTKLEVISVLQNDQAMGGSTYTAEAL--GFSDHMFTEARGSRLNKG---VPQ 891 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDGE+ + + + +G +V A+GI E L S + ++ VE Sbjct: 892 VLIVITDGES---HDADKLNATAKALRDKGILVLAVGIDGANPVELLAMAGSSDKYFFVE 948 Query: 350 NPHSMYDAFSHI 361 + FS + Sbjct: 949 TFGGLKGIFSDV 960 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 71/192 (36%), Gaps = 21/192 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ ++D S S++ K+ + S+ V+ N V+ G FS+ Sbjct: 994 VDLVFLMDGSTSIQPN---DFKKMKEFLASV------VQDFDVSLNRVRIGAAQFSDTYH 1044 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L G + +I+ + + +T+ L+ + G R + + Sbjct: 1045 PEFPLGTFIGEKEISFQIENIKQIFGNTHIGAALREVEHYFRPDMGSRI-----NTGTPQ 1099 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDG+ + + RG +Y++GI + + ++ + V Sbjct: 1100 VLLVLTDGQ-----SQDEVAQAAEALRHRGIDIYSVGIGDVDDQQLIQITGTAEKKLTVH 1154 Query: 350 NPHSMYDAFSHI 361 N + I Sbjct: 1155 NFDELKKVNKRI 1166 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 43/296 (14%), Positives = 96/296 (32%), Gaps = 31/296 (10%) Query: 64 QIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQ 123 + + E + G + + +E DN + V V + Sbjct: 62 KALQEAHRTYFSAPANGRDKKQFPPILVVLASSESEDNVEEASKALRKDGVKIISVGVQK 121 Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK----------VNSQTDARL- 172 ++ A++ + T S+++ I +K V + + Sbjct: 122 ASEENLKAMATSQFHFNLRTVRDLSMFSQNMTHIIKDVIKYKEGAVDDIFVEACQGPSMA 181 Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD+S + S D + E V + N ++ GLV +SN+ + Sbjct: 182 DVVFLLDMS------INGSEENFDYLKGFLE---ESVSALDIKENCMRVGLVAYSNETKV 232 Query: 233 FFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L G S + + I+ LS + +K ++F + + ++ +I Sbjct: 233 INSLSMGINKSEVLQHIQNLSPRTGKAYTGAAIKKLRKEVFSARNGSR----KNQGVPQI 288 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 V +T + ++ G ++ +GI + + + P Y Sbjct: 289 AVLVT-----HRDSEDNVTKAAVNLRREGVTIFTLGIEGTSDTQLEKIASHPAEQY 339 >gi|171742038|ref|ZP_02917845.1| hypothetical protein BIFDEN_01142 [Bifidobacterium dentium ATCC 27678] gi|283456833|ref|YP_003361397.1| hypothetical protein BDP_2000 [Bifidobacterium dentium Bd1] gi|171277652|gb|EDT45313.1| hypothetical protein BIFDEN_01142 [Bifidobacterium dentium ATCC 27678] gi|283103467|gb|ADB10573.1| Conserved hypothetical protein containing a von Willebrand factor type A (vWA) domain [Bifidobacterium dentium Bd1] Length = 967 Score = 84.4 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 55/256 (21%), Positives = 89/256 (34%), Gaps = 62/256 (24%) Query: 171 RLDMMIVLDVSRSMESFFD--SSITKIDMAIKSINAMLEEV----KLIPDVNNVVQSGLV 224 +D+ +VLDVS SM F S +KI ++N+ L+E I D N+ V+ LV Sbjct: 243 PIDIALVLDVSGSMNDDFGGRGSPSKISALKTAVNSFLDETAKTNDTIEDDNDKVKVALV 302 Query: 225 TFSNKIEEFF---------LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI----- 270 ++N+I + + + ++ L T LK + N + Sbjct: 303 KYANQIGTATGADGCRISNSRQSDTGNCTQIVQEL------TTDAGLLKTSVNGLQAAGA 356 Query: 271 ----FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE-----DQQSLYYCNEAKKRGAI 321 M+ +Q A KK ++F TDGE ++ E K G Sbjct: 357 TYADAAMEVAQQALAGGRAGAKKYVIFFTDGEPNHWSGFDGDVANAAIKKSQELKNAGTT 416 Query: 322 VYAIGI---------------------------RVIRSHEFLRACASPNSFYLVENPHSM 354 VY+IGI + L AS + +Y + + Sbjct: 417 VYSIGIFDGANPSASVSSASNANKFMHGISSNYPNATGYWNLGDRASGDYYYSASSATQL 476 Query: 355 YDAFSHIGKDIVTKRI 370 F+ I K I K + Sbjct: 477 AQIFNDIQKTITEKHV 492 >gi|73973418|ref|XP_532180.2| PREDICTED: similar to alpha 1 type XXI collagen precursor [Canis familiaris] Length = 961 Score = 84.1 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 83/214 (38%), Gaps = 33/214 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S A D++ +LD S S+ ++ K + + + + P +Q G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGP------ENFEIVKKWLVNITKNFDIGP---KFIQVGVV 79 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ L +L ++ + G +T + +++A + +F Sbjct: 80 QYSDYPVLEIPLGSHDSGKNLVAAMESIHYLGGNTRTGKAIQFALDYLF---------AK 130 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 KI V +TDG K + A+ ++AIG+ LRA A Sbjct: 131 SSRFLTKIAVVLTDG-----KSQDEVKDAAEAARDSKITLFAIGVGSETEDAELRAIANK 185 Query: 341 -SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 S + VE+ Y A S I ++++ +++ + Sbjct: 186 PSSTYVFYVED----YIAISKI-REVMKQKLCEE 214 >gi|301780200|ref|XP_002925521.1| PREDICTED: collagen alpha-1(XII) chain-like [Ailuropoda melanoleuca] Length = 3172 Score = 84.1 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 53/280 (18%), Positives = 101/280 (36%), Gaps = 38/280 (13%) Query: 99 RDNGFVNDIDDIVRSTSLDIVVV-PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMP 157 G + T++ V + Y IS + + I T + + Sbjct: 371 MTAGSRQHALSVGPQTTILSVRDLSADTEYQISVSAMKGLTSSEPISIMEKTQPMKVQVE 430 Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + V + + D++ ++D S S+ ++ K+ ++ + +K N Sbjct: 431 CSRGVDIKA------DIVFLVDGSYSIG---IANFVKVRAFLEVL------IKSFEISPN 475 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 VQ LV +S F L+ + I G STN+ + Y +IF Sbjct: 476 RVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF---- 531 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 N K+++ +TDG++ D + + ++A+G++ E Sbjct: 532 --VPSKGSRGNVPKVMILITDGKSSDAFRD-----PAIKLRNSDVEIFAVGVKDAVRSE- 583 Query: 336 LRACASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 L A ASP + VE+ +DAF I ++ T+ I Sbjct: 584 LEAIASPPAETHVFTVED----FDAFQRISFEL-TQSICL 618 Score = 83.3 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 44/253 (17%), Positives = 89/253 (35%), Gaps = 34/253 (13%) Query: 127 YSISAISRYKIP--LKFCTFIPWYTNSRHIVMPI-TSSVKVNSQTDAR-LDMMIVLDVSR 182 Y ++ S I + + T ++ + + D++ ++D S Sbjct: 90 YVVTITSYDDIEESVPVIGQLTIQTGGPTKPGEKKPGKPEIQKCSVSAWTDLVFLVDGSW 149 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGV 240 S+ + I A+ V + G+V +S+ F L + Sbjct: 150 SVGRN------NFKYILDFIGAL---VSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQR 200 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 L IK + G +T + + Y N + G R + K+ + +TDG Sbjct: 201 DELLAAIKKIPYKGGNTMTGDAIDYLIKNTFTESAGARAG-------FPKVAIIITDG-- 251 Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLVENPHSMYDA 357 K + E + G V+++GI+ + E + ++P N + V N +DA Sbjct: 252 ---KSQDEVEIPARELRNIGVEVFSLGIKAADAKELKQIASTPSLNHVFNVAN----FDA 304 Query: 358 FSHIGKDIVTKRI 370 I +I+++ Sbjct: 305 IVDIQNEIISQVC 317 Score = 63.6 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 50/250 (20%), Positives = 83/250 (33%), Gaps = 26/250 (10%) Query: 124 NEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 Y ++ + T S VMPI SS + T A D+++++D S Sbjct: 1150 GTTYKVNVFGMFDGGESSPLIGQEMTTLSDTTVMPILSS-GMECLTRAEADIVLLVDGSW 1208 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGV 240 S+ I+ ++E ++ P VQ L +S + L Sbjct: 1209 SIG------RANFRTVRSFISRIVEIFEIGP---KRVQIALAQYSGDPRTEWQLNAHRDK 1259 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 L + + L G +T + L + Q F Q +KI V +TDG Sbjct: 1260 KSLLQAVANLPYKGGNTLTGMALNFIRQQSFKTQ------AGMRPRARKIGVLITDG--- 1310 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 K + K G ++AIGI+ E P+ N +++ S Sbjct: 1311 --KSQDDVEAPSKKLKDEGVELFAIGIKNADEDELKMIATDPDDT-HAYNVAD-FESLSK 1366 Query: 361 IGKDIVTKRI 370 I D+ Sbjct: 1367 IVDDLTINLC 1376 >gi|119624853|gb|EAX04448.1| collagen, type XXI, alpha 1, isoform CRA_a [Homo sapiens] Length = 429 Score = 84.1 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 42/214 (19%), Positives = 82/214 (38%), Gaps = 33/214 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S A D++ +LD S S+ ++ K + + + + P +Q G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGP------ENFEIVKKWLVNITKNFDIGP---KFIQVGVV 79 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ L HL ++ + G +T + +++A + +F Sbjct: 80 QYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTGKAIQFALDYLF---------AK 130 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 KI V +TDG K A+ ++AIG+ LRA A Sbjct: 131 SSRFLTKIAVVLTDG-----KSQDDVKDAAQAARDSKITLFAIGVGSETEDAELRAIANK 185 Query: 341 -SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 S + VE+ Y A S I ++++ +++ + Sbjct: 186 PSSTYVFYVED----YIAISKI-REVMKQKLCEE 214 >gi|301767086|ref|XP_002918971.1| PREDICTED: collagen alpha-1(XXI) chain-like [Ailuropoda melanoleuca] Length = 957 Score = 84.1 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 83/214 (38%), Gaps = 33/214 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S A D++ +LD S S+ ++ K + + + + P +Q G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGP------ENFEIVKKWLVNITKNFDIGP---KFIQVGVV 79 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ L +L ++ + G +T + +++A + +F Sbjct: 80 QYSDYPVLEIPLGSHDSGENLVAAMESIHYLGGNTRTGKAIQFALDYLF---------AK 130 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 KI V +TDG K + A+ ++AIG+ LRA A Sbjct: 131 SSRFLTKIAVVLTDG-----KSQDEVKDAAEAARDSKITLFAIGVGSETEDAELRAIANK 185 Query: 341 -SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 S + VE+ Y A S I ++++ +++ + Sbjct: 186 PSSTYVFYVED----YIAISKI-REVMKQKLCEE 214 >gi|126310280|ref|XP_001371684.1| PREDICTED: similar to collagen XXI [Monodelphis domestica] Length = 957 Score = 84.1 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 78/214 (36%), Gaps = 32/214 (14%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S A D++ +LD S S+ ++ K + + + P +Q G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGP------ENFEIVKKWLVNISNNFDIGP---KFIQVGVV 79 Query: 225 TFSNKIEEFFLLEWGVSH----LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +S+ L G H L ++ + G +T + +++A + +F Sbjct: 80 QYSDYPVLEIPL--GSHHSGENLMEAMESIQYLGGNTRTGKAIQFALDHLF--------- 128 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 KI V +TDG K + A+ ++AIG+ LRA A Sbjct: 129 AKSSRFLTKIAVVLTDG-----KSQDEVKDAAEAARDNRITLFAIGVGSETEDAELRAIA 183 Query: 341 ---SPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 S + VE+ ++ I + + + + Sbjct: 184 NKPSSTYVFYVEDYIAISKIREVIKQKLCEESVC 217 >gi|192291928|ref|YP_001992533.1| hypothetical protein Rpal_3558 [Rhodopseudomonas palustris TIE-1] gi|192285677|gb|ACF02058.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 455 Score = 84.1 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 57/443 (12%), Positives = 130/443 (29%), Gaps = 85/443 (19%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R F G + ++ A+ L + +G+ ++ S +T L + +D + + + + Sbjct: 15 RRFPQANGGNIAVIFALALVPLLGFIGVAVDYSRANNARTSLQNALDSAALMLSRDLGVG 74 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 ++ K + N + + ++ Q + Sbjct: 75 TITPDQVSSKAQTYFNSLYT--NKETGAVTVTATYTAKDGSGSSTIAMSGQGAVQTQFMK 132 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ------------------TDA 170 I I T + + +T S+ + Sbjct: 133 ILGFQTMAIGSSTTTTWGGTRLRVAMALDVTGSMASAGKMSAMKTAAKNLVDSLRASAQT 192 Query: 171 RLDMMIVL------------------------DVS-RSMESFFDSSITKIDMAIKS---- 201 D+ I + D S S S++ S+ + + A ++ Sbjct: 193 ADDVYISVVPFAQMVNVGSSNRNANWVRWDLWDESNGSCSSWWYSTKSSCEYAGRTWTAT 252 Query: 202 -----INAMLEEVKLIPDVNNV-----VQSGLVTFSNKIEE------FFLLEWGVSHLQR 245 + + + +V + V + ++ + L + ++ Sbjct: 253 SHNQWAGCVTDRDQPADTTKDVPTSYATRFPAVDYDACPQQLLGMTSAYSLSNATT-IKN 311 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN------ 299 KI LS G TN G+ +A+ + + + Y I+ ++DG N Sbjct: 312 KIDALSPNG-GTNQAIGMHWAWMSLRTGDPLNTPAKDSNYKYTDAIILLSDGLNTVDRWY 370 Query: 300 -----LSTKEDQQSLYYCNEAKKRG-----AIVYAIGIRVIR--SHEFLRACASPNSFYL 347 S + D + C+ + ++Y I + L+ CA +F+ Sbjct: 371 GNGRDWSPQVDARQRILCDNIRASATNTNPVVIYTIQVNTDGDPESTVLKYCADSGNFFA 430 Query: 348 VENPHSMYDAFSHIGKDIVTKRI 370 + AF+ IG + R+ Sbjct: 431 TTTSSGIGTAFAQIGSSLSKLRV 453 >gi|3766289|emb|CAA06890.1| matrilin-4 precursor, alternate splice product [Mus musculus] Length = 434 Score = 84.1 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 77/181 (42%), Gaps = 19/181 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 190 DRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 240 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 241 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 297 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 D N ++ + TDG + + AK+ G ++YA+G+ E + P Sbjct: 298 RDLNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVMYAVGVGKAVEEELREIASEP 352 Query: 343 N 343 + Sbjct: 353 S 353 >gi|73973308|ref|XP_539002.2| PREDICTED: similar to alpha 1 type XII collagen long isoform precursor isoform 1 [Canis familiaris] Length = 3065 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 53/267 (19%), Positives = 102/267 (38%), Gaps = 37/267 (13%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 ++T+L + + + Y IS + + I T + + + V + + Sbjct: 384 PQTTTLSVRDLSADTEYQISISAMKGLTSSEPVSIMEKTQPMKVQVECSRGVDIKA---- 439 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++ ++D S S+ ++ K+ ++ + VK N VQ LV +S Sbjct: 440 --DIVFLVDGSYSIG---IANFVKVRAFLEVL------VKSFEISPNRVQISLVQYSRDP 488 Query: 231 EEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L+ + I G STN+ + Y +IF N Sbjct: 489 HTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF------VPSKGSRGNVP 542 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 K+++ +TDG++ D + + ++A+G++ E L A ASP Sbjct: 543 KVMILITDGKSSDAFRD-----PAIKLRNSDVEIFAVGVKDAVRSE-LEAIASPPAETHV 596 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 + VE+ +DAF I ++ T+ I Sbjct: 597 FTVED----FDAFQRISFEL-TQSICL 618 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 40/203 (19%), Positives = 77/203 (37%), Gaps = 30/203 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S S+ + I A+ V + G+V +S+ Sbjct: 140 DLVFLVDGSWSVGRN------NFKYILDFIGAL---VSAFDIGEEKTRVGVVQYSSDTRT 190 Query: 233 FFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDMQGMRQHCNTEDANYKK 289 F L + L IK + G +T + + Y N + G R + K Sbjct: 191 EFNLNQYYRTDELLAAIKKIPYKGGNTMTGDAIDYLIKNTFTESAGARVG-------FPK 243 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYL 347 + + +TDG K + E + G V+++GI+ + E + ++P N + Sbjct: 244 VAIIITDG-----KSQDEVEIPARELRNIGVEVFSLGIKAADAKELKQIASTPSLNHVFN 298 Query: 348 VENPHSMYDAFSHIGKDIVTKRI 370 V N +DA I +I+++ Sbjct: 299 VAN----FDAIVDIQNEIISQVC 317 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 50/250 (20%), Positives = 83/250 (33%), Gaps = 26/250 (10%) Query: 124 NEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 Y ++ + T S VMPI SS + T A D+++++D S Sbjct: 1150 GTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSS-GMECLTRAEADIVLLVDGSW 1208 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGV 240 S+ I+ ++E ++ P VQ L +S + L Sbjct: 1209 SIG------RANFRTVRSFISRIVEVFEIGP---KRVQIALAQYSGDPRTEWQLNAHRDK 1259 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 L + + L G +T + L + Q F Q +KI V +TDG Sbjct: 1260 KSLLQAVANLPYKGGNTLTGMALNFIRQQNFRTQ------AGMRPRARKIGVLITDG--- 1310 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 K + K G ++AIGI+ E P+ N +++ S Sbjct: 1311 --KSQDDVEAPSKKLKDEGVELFAIGIKNADEDELKMIATDPDDT-HAYNVAD-FESLSK 1366 Query: 361 IGKDIVTKRI 370 I D+ Sbjct: 1367 IVDDLTINLC 1376 >gi|73973312|ref|XP_867438.1| PREDICTED: similar to alpha 1 type XII collagen long isoform precursor isoform 3 [Canis familiaris] Length = 2989 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 53/267 (19%), Positives = 102/267 (38%), Gaps = 37/267 (13%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 ++T+L + + + Y IS + + I T + + + V + + Sbjct: 384 PQTTTLSVRDLSADTEYQISISAMKGLTSSEPVSIMEKTQPMKVQVECSRGVDIKA---- 439 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++ ++D S S+ ++ K+ ++ + VK N VQ LV +S Sbjct: 440 --DIVFLVDGSYSIG---IANFVKVRAFLEVL------VKSFEISPNRVQISLVQYSRDP 488 Query: 231 EEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L+ + I G STN+ + Y +IF N Sbjct: 489 HTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF------VPSKGSRGNVP 542 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 K+++ +TDG++ D + + ++A+G++ E L A ASP Sbjct: 543 KVMILITDGKSSDAFRD-----PAIKLRNSDVEIFAVGVKDAVRSE-LEAIASPPAETHV 596 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 + VE+ +DAF I ++ T+ I Sbjct: 597 FTVED----FDAFQRISFEL-TQSICL 618 Score = 83.3 bits (204), Expect = 5e-14, Method: Composition-based stats. Identities = 40/203 (19%), Positives = 77/203 (37%), Gaps = 30/203 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S S+ + I A+ V + G+V +S+ Sbjct: 140 DLVFLVDGSWSVGRN------NFKYILDFIGAL---VSAFDIGEEKTRVGVVQYSSDTRT 190 Query: 233 FFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDMQGMRQHCNTEDANYKK 289 F L + L IK + G +T + + Y N + G R + K Sbjct: 191 EFNLNQYYRTDELLAAIKKIPYKGGNTMTGDAIDYLIKNTFTESAGARVG-------FPK 243 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYL 347 + + +TDG K + E + G V+++GI+ + E + ++P N + Sbjct: 244 VAIIITDG-----KSQDEVEIPARELRNIGVEVFSLGIKAADAKELKQIASTPSLNHVFN 298 Query: 348 VENPHSMYDAFSHIGKDIVTKRI 370 V N +DA I +I+++ Sbjct: 299 VAN----FDAIVDIQNEIISQVC 317 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 50/250 (20%), Positives = 83/250 (33%), Gaps = 26/250 (10%) Query: 124 NEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 Y ++ + T S VMPI SS + T A D+++++D S Sbjct: 1150 GTTYKVNVFGMFDGGESSPLVGQEMTTLSDTTVMPILSS-GMECLTRAEADIVLLVDGSW 1208 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGV 240 S+ I+ ++E ++ P VQ L +S + L Sbjct: 1209 SIG------RANFRTVRSFISRIVEVFEIGP---KRVQIALAQYSGDPRTEWQLNAHRDK 1259 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 L + + L G +T + L + Q F Q +KI V +TDG Sbjct: 1260 KSLLQAVANLPYKGGNTLTGMALNFIRQQNFRTQ------AGMRPRARKIGVLITDG--- 1310 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 K + K G ++AIGI+ E P+ N +++ S Sbjct: 1311 --KSQDDVEAPSKKLKDEGVELFAIGIKNADEDELKMIATDPDDT-HAYNVAD-FESLSK 1366 Query: 361 IGKDIVTKRI 370 I D+ Sbjct: 1367 IVDDLTINLC 1376 >gi|312133821|ref|YP_004001160.1| von willebrand factor (vwf) domain containing protein [Bifidobacterium longum subsp. longum BBMN68] gi|311773110|gb|ADQ02598.1| Von Willebrand factor (VWF) domain containing protein [Bifidobacterium longum subsp. longum BBMN68] Length = 794 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 48/266 (18%), Positives = 93/266 (34%), Gaps = 25/266 (9%) Query: 86 IKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFI 145 + + ++ + + D + S + + + P + T Sbjct: 13 VAGVSATAMADDGNASTTQSQTTDEKAAASAPAPLSTEGTN-GVPDDPTLSAPAREKTVT 71 Query: 146 PWYTNSRHIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDS---SITKIDMAI 199 + + + +T + + + LD+++VLDVS SM S TKID Sbjct: 72 ANEDGTYTVALNVTGAKSAGTGEIVTNQPLDIVLVLDVSGSMAEKIASGWNQPTKIDSLK 131 Query: 200 KSINAMLE----EVKLIPDVNNVVQSGLVTF-----SNKIEEFFLLEWGVSHLQRKIKYL 250 ++N + E I D + + LV F ++ +F+ W + + + L Sbjct: 132 TAVNKFINATAAENAKITDQSQRNRIALVKFAGTEKTSVGNDFYREGWSSYNYTQIVSNL 191 Query: 251 SKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCN---TEDANYKKIIVFMTDGENLS----- 301 + T++ GL + D R AN KK+++F TDGE Sbjct: 192 TYDVSGLTSTVNGLSASGATSADYAFNRAQAALTYQPRANAKKVVIFFTDGEPNHGSGFD 251 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGI 327 ++ K G +Y+IG+ Sbjct: 252 PTVAATAVNKAKSLKDAGTTIYSIGV 277 >gi|242247116|ref|NP_081039.2| collagen alpha-4(VI) chain precursor [Mus musculus] gi|189082905|sp|A2AX52|CO6A4_MOUSE RecName: Full=Collagen alpha-4(VI) chain; Flags: Precursor Length = 2309 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 44/237 (18%), Positives = 91/237 (38%), Gaps = 33/237 (13%) Query: 135 YKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD----ARLDMMIVLDVSRSMESFFDS 190 K+ K C + ++ I S + ++Q + D+ ++D S S++ Sbjct: 809 NKLKKKLCPEMLS--GMPPLMSFIPESTRQSTQEGCESVEKADIYFLIDGSGSIKPN--- 863 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIK 248 ++ +K + +K+ + V+ G+V +S+KI F L ++ L I Sbjct: 864 DFIEMKDFMKEV------IKMFHIGPDRVRFGVVQYSDKIISQFFLTQYASMAGLSAAID 917 Query: 249 YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS 308 + + G T + L + + NT + + ++ +TDG++ Sbjct: 918 NIQQVGGGTTTGKAL---------SKMVPVFQNTARIDVARYLIVITDGQSTDP-----V 963 Query: 309 LYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 + G +YAIG+R + E L AS F+ + S+ + +DI Sbjct: 964 AEAAQGLRDIGVNIYAIGVRDANTTE-LEEIASKKMFF-IYEFDSLKSIHQEVIRDI 1018 Score = 79.0 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 56/318 (17%), Positives = 115/318 (36%), Gaps = 26/318 (8%) Query: 46 MKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN 105 M L +++ A ++G + G I+ R + + Sbjct: 111 MGQALQFILEHHFREGAGSRASQGVPQVAVVVSSGLTEDHIREPAEALRRAGIL--VYAI 168 Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVN 165 + D ++ +I P++ ++ + +P P ++ T Sbjct: 169 GVKDASQAELREISSSPKD-NFTFFVPNFPGLPGLAQKLRPELCSTLGKAAQYTERESPA 227 Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + D++ ++D S S+ + K+ + S+ V + ++ VQ GLV Sbjct: 228 CSEASPADIVFLVDSSTSIG---LQNFQKVKHFLHSV------VSGLDVRSDQVQVGLVQ 278 Query: 226 FSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYA-YNQIFDMQGMRQHCNT 282 +S+ I F L+ S + +I+ L T++ L++ N + +M G R Sbjct: 279 YSDNIYPAFPLKQSSLKSAVLDRIRNLPYSMGGTSTGSALEFIRANSLTEMSGSRA---- 334 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 +I+V +TDGE+ +D ++ K+ G V+ +GI + E + P Sbjct: 335 -KDGVPQIVVLVTDGESSDEVQD-----VADQLKRDGVFVFVVGINIQDVQELQKIANEP 388 Query: 343 -NSFYLVENPHSMYDAFS 359 F S+ A S Sbjct: 389 FEEFLFTTENFSILQALS 406 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 52/341 (15%), Positives = 113/341 (33%), Gaps = 39/341 (11%) Query: 37 IIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRN 96 +++ S + L + + AA + + G ++ +NT + Sbjct: 890 VVQYSDKIISQFFLTQYASMAGLSAAIDNIQQVGGGTTTGKALSKMVPVFQNTARIDVAR 949 Query: 97 ------ELRDNGFVNDIDDIVRSTSLDI----VVVPQNEGYSISAISRYKIPLKFCTFIP 146 + + V + +R ++I V + I+ K+ F Sbjct: 950 YLIVITDGQSTDPVAEAAQGLRDIGVNIYAIGVRDANTT--ELEEIASKKM-FFIYEFDS 1006 Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 + + ++ I SS SQ D++ ++D S S+ K+ ++ Sbjct: 1007 LKSIHQEVIRDICSSENCKSQKA---DIIFLIDGSESI---APKDFEKMKDFME------ 1054 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLK 264 V + +Q GL+ FS+ +E F L S + R I + + T++ L Sbjct: 1055 RMVNQSNIGADEIQIGLLQFSSNPQEEFRLNRYSSKVDMCRAILSVQQMSDGTHTGKALN 1114 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 + +G + ++ +TDG + R I++A Sbjct: 1115 FTLPFFDSSRG-------GRPRVHQYLIVITDG-----VSQDNVAPPAKALRDRNIIIFA 1162 Query: 325 IGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 IG+ ++ + L + + EN S+ I ++ Sbjct: 1163 IGVGNVQRAQLLEITNDQDKVFQEENFESLQSLEKEILSEV 1203 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 46/355 (12%), Positives = 121/355 (34%), Gaps = 42/355 (11%) Query: 3 SLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAA 62 S ++ ++NF ++++ + + + +G++ +I+ + S + +++ Sbjct: 243 STSIGLQNFQKVKHFLHSVVSGLDVRSDQVQVGLVQYSDNIYPAFPLKQSSLKSAVLDRI 302 Query: 63 TQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELR-------DNGFVNDIDDIVRS-- 113 + G + + ++ + D +++ D+ Sbjct: 303 RNLPYSMGGTSTGSALEFIRANSLTEMSGSRAKDGVPQIVVLVTDGESSDEVQDVADQLK 362 Query: 114 --------TSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVN 165 ++I V + + I+ + F S ++ + S+V+ Sbjct: 363 RDGVFVFVVGINIQDVQELQK--IANEPFEEFLFTTENFSILQALSGTLLQALCSTVERQ 420 Query: 166 SQ--TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + T D++ ++D S +S I+ ++ +++ D Q GL Sbjct: 421 MKKSTKTYADVVFLIDTS------QGTSQASFQWMQNFISRIIGILEVGQDK---YQIGL 471 Query: 224 VTFSNKIEEFFLLEWGV--SHLQRKIKYLSKF-GVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +S++ FL + + I L F G S + GL++ + F + Sbjct: 472 AQYSDQGHTEFLFNTHKTRNEMVAHIHELLVFQGGSRKTGQGLRFLHRTFFQEAAGSRLL 531 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + +V +T G K + + +KRG + ++G++ E Sbjct: 532 ----QGVPQYVVVITSG-----KSEDEVGEVAQILRKRGVDIVSVGLQDFDRAEL 577 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 23/149 (15%), Positives = 50/149 (33%), Gaps = 12/149 (8%) Query: 201 SINAMLEEV-KLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVST 257 S+ L + + + ++ GL +S+ FLL + + I+ L Sbjct: 50 SVRNFLYILANSLQVGRDNIRVGLAQYSDTPTSEFLLSVYHRKGDVLKHIRGLQFKPGGN 109 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 L++ I + + ++ V ++ G + ++ Sbjct: 110 RMGQALQF----ILEHHFREGAGSRASQGVPQVAVVVSSG-----LTEDHIREPAEALRR 160 Query: 318 RGAIVYAIGIRVIRSHEFLRACASPNSFY 346 G +VYAIG++ E +SP + Sbjct: 161 AGILVYAIGVKDASQAELREISSSPKDNF 189 >gi|322689979|ref|YP_004209713.1| cell surface protein [Bifidobacterium longum subsp. infantis 157F] gi|320461315|dbj|BAJ71935.1| putative cell surface protein [Bifidobacterium longum subsp. infantis 157F] Length = 794 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 48/266 (18%), Positives = 93/266 (34%), Gaps = 25/266 (9%) Query: 86 IKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFI 145 + + ++ + + D + S + + + P + T Sbjct: 13 VAGVSATAMADDGNASTTQSQTTDEKAAASAPAPLSTEGTN-GVPDDPTLSAPAREKTVT 71 Query: 146 PWYTNSRHIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDS---SITKIDMAI 199 + + + +T + + + LD+++VLDVS SM S TKID Sbjct: 72 ANEDGTYTVALNVTGAKSAGTGEIVTNQPLDIVLVLDVSGSMAEKIASGWNQPTKIDSLK 131 Query: 200 KSINAMLE----EVKLIPDVNNVVQSGLVTF-----SNKIEEFFLLEWGVSHLQRKIKYL 250 ++N + E I D + + LV F ++ +F+ W + + + L Sbjct: 132 TAVNKFINATAAENAKITDQSQRNRIALVKFAGTEKTSVGNDFYREGWSSYNYTQIVSNL 191 Query: 251 SKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCN---TEDANYKKIIVFMTDGENLS----- 301 + T++ GL + D R AN KK+++F TDGE Sbjct: 192 TYDVSGLTSTVNGLSASGATSADYAFNRAQAALTYQPRANAKKVVIFFTDGEPNHGSGFD 251 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGI 327 ++ K G +Y+IG+ Sbjct: 252 PTVAATAVNKAKSLKDAGTTIYSIGV 277 >gi|23466092|ref|NP_696695.1| hypothetical protein BL1539 [Bifidobacterium longum NCC2705] gi|322691915|ref|YP_004221485.1| cell surface protein [Bifidobacterium longum subsp. longum JCM 1217] gi|23326823|gb|AAN25331.1| hypothetical protein with gram positive cell wall anchoring domain [Bifidobacterium longum NCC2705] gi|320456771|dbj|BAJ67393.1| putative cell surface protein [Bifidobacterium longum subsp. longum JCM 1217] Length = 794 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 48/266 (18%), Positives = 93/266 (34%), Gaps = 25/266 (9%) Query: 86 IKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFI 145 + + ++ + + D + S + + + P + T Sbjct: 13 VAGVSATAMADDGNASTTQSQTTDEKAAASAPAPLSTEGTN-GVPDDPTLSAPAREKTVT 71 Query: 146 PWYTNSRHIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDS---SITKIDMAI 199 + + + +T + + + LD+++VLDVS SM S TKID Sbjct: 72 ANEDGTYTVALNVTGAKSAGTGEIVTNQPLDIVLVLDVSGSMAEKIASGWNQPTKIDSLK 131 Query: 200 KSINAMLE----EVKLIPDVNNVVQSGLVTF-----SNKIEEFFLLEWGVSHLQRKIKYL 250 ++N + E I D + + LV F ++ +F+ W + + + L Sbjct: 132 TAVNKFINATAAENAKITDQSQRNRIALVKFAGTEKTSVGNDFYREGWSSYNYTQIVSNL 191 Query: 251 SKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCN---TEDANYKKIIVFMTDGENLS----- 301 + T++ GL + D R AN KK+++F TDGE Sbjct: 192 TYDVSGLTSTVNGLSASGATSADYAFNRAQAALTYQPRANAKKVVIFFTDGEPNHGSGFD 251 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGI 327 ++ K G +Y+IG+ Sbjct: 252 PTVAATAVNKAKSLKDAGTTIYSIGV 277 >gi|323138519|ref|ZP_08073587.1| hypothetical protein Met49242DRAFT_2975 [Methylocystis sp. ATCC 49242] gi|322396153|gb|EFX98686.1| hypothetical protein Met49242DRAFT_2975 [Methylocystis sp. ATCC 49242] Length = 458 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 61/435 (14%), Positives = 132/435 (30%), Gaps = 92/435 (21%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +RNF N +G + ++ + L +F+++G ++ + +++ L+ + DR+ + A Sbjct: 19 LRNFRANERGSIAMIFGLALIPMFMMMGAAVDYTQAVTVRSRLNHLADRAALAAVKAAAQ 78 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDI--VRSTSLDIVVVPQNE 125 + + + G ++ + + + D + I + Sbjct: 79 KES-DCVANPAGNNVSNFQGCGQKDIIKAGVAAGVQYMNGDPLMRGADRKPTIELSSSEG 137 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM- 184 +S + IP + + I + + + T L+ ++LD S SM Sbjct: 138 SWSATVNYSADIPTNIARLM----GVQTIPVNGKVTSNIALGTHMYLNFHLLLDRSMSMG 193 Query: 185 --------------------------------ESFFDSSITKIDMAIKSINAMLEEVKLI 212 + +ID + A++ + K++ Sbjct: 194 IGATSDDISRLQALTGCAFACHSEGYEAQYYDQPKAQGIRFRIDDLRDATGALVAQAKMV 253 Query: 213 PDVN--NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL--SKFGVSTNSTPGLKYAYN 268 N +Q G+ F++ + + ++++ +K L T + + + Sbjct: 254 ASANAREHIQMGVYAFNHHVSPLVEMTSDLTNVANAVKNLDLPTHDDGTQAADAVTWLVA 313 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFM-TDG-----------------------ENLSTKE 304 G IVF+ TDG + TK Sbjct: 314 NKIKGNGTGLTSAAPLE-----IVFLVTDGVEDGIYTGWNKMVGPTGLPLPWWPSWMTKA 368 Query: 305 DQQS--LYYCNEAKKRGAIV---YA--------------IGIRVIRSHEFLRACASPNSF 345 + + C+ K +GAIV Y IG L+ CAS F Sbjct: 369 PTSAFPVTACDALKSKGAIVAVVYTTYVPFPGTVQYDRLIGPFAPNISPNLQGCASQGYF 428 Query: 346 YLVENPHSMYDAFSH 360 + P + Sbjct: 429 FTASEPGDITRGMQS 443 >gi|47219516|emb|CAG09870.1| unnamed protein product [Tetraodon nigroviridis] Length = 1259 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 40/265 (15%), Positives = 89/265 (33%), Gaps = 25/265 (9%) Query: 105 NDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKV 164 +D TS+ + + SI++ + + + + + + + + Sbjct: 28 FTLDAYTSVTSITDLTPDLDYSVSINSFDGSEESIPIFGQLTIQSGNSSDRVRRPTD-TI 86 Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 A D++ ++D S S+ + I I S+ + + +V Sbjct: 87 KCSVSAITDLVFLVDGSWSVGRE---NFKHIRSFIASLAG------AFDIGEDKTRVAVV 137 Query: 225 TFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S F L L + I L G +T + + Y IF G Sbjct: 138 QYSTDTRTEFPLTRYTRRGDLLQAINSLPYKGGNTMTGDAIDYLLQNIFTEAGGS----- 192 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 ++ K+ + +TDG++ E + + G ++ +GI+ E ++P Sbjct: 193 -RKSFPKVAMIITDGKSQDPVE-----EHARRLRNIGVEIFVLGIKGADEDELREIASTP 246 Query: 343 --NSFYLVENPHSMYDAFSHIGKDI 365 Y V N + + I +++ Sbjct: 247 HSKHMYNVPNFDKIQEVQKKIIREV 271 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 46/221 (20%), Positives = 94/221 (42%), Gaps = 31/221 (14%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P+ + + + D + D+++++D S S+ + + ++ + P Sbjct: 379 PVKVTTECSLGVDVQADVVLLVDGSYSIG------LQNFAKVRAFLEVLVNSFDIGPSK- 431 Query: 217 NVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 VQ LV +S F L ++ + R ++ G STN+ +KY ++IF Sbjct: 432 --VQISLVQYSRDPHTEFALNTHHDINAVVRAVRTFPYRGGSTNTGKAMKYVKDKIFVAS 489 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 N +++V +TDG++ + +D + + ++A+G++ E Sbjct: 490 ------RGARQNVPRVMVLITDGKSSDSFKDAAT-----NLRNIDVEIFAVGVKDAVRSE 538 Query: 335 FLRACASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 L A A+P N + VE+ +DAF I K++ T+ I Sbjct: 539 -LEAIANPPADNHVFEVED----FDAFQRISKEL-TQSICL 573 >gi|46190503|ref|ZP_00121395.2| COG2304: Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Bifidobacterium longum DJO10A] gi|189440499|ref|YP_001955580.1| von Willebrand factor (vWF) domain containing protein [Bifidobacterium longum DJO10A] gi|189428934|gb|ACD99082.1| von Willebrand factor (vWF) domain containing protein [Bifidobacterium longum DJO10A] Length = 794 Score = 83.7 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 48/266 (18%), Positives = 93/266 (34%), Gaps = 25/266 (9%) Query: 86 IKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFI 145 + + ++ + + D + S + + + P + T Sbjct: 13 VAGVSATAMADDGNASTTQSQTTDEKAAASAPAPLSTEGTN-GVPDDPTLSAPAREKTVT 71 Query: 146 PWYTNSRHIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDS---SITKIDMAI 199 + + + +T + + + LD+++VLDVS SM S TKID Sbjct: 72 ANEDGTYTVALNVTGAKSAGTGEIVTNQPLDIVLVLDVSGSMAEKIASGWNQPTKIDSLK 131 Query: 200 KSINAMLE----EVKLIPDVNNVVQSGLVTF-----SNKIEEFFLLEWGVSHLQRKIKYL 250 ++N + E I D + + LV F ++ +F+ W + + + L Sbjct: 132 TAVNKFINATAAENAKITDQSQRNRIALVKFAGTEKTSVGNDFYREGWSSYNYTQIVSNL 191 Query: 251 SKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCN---TEDANYKKIIVFMTDGENLS----- 301 + T++ GL + D R AN KK+++F TDGE Sbjct: 192 TYDVSGLTSTVNGLSASGATSADYAFNRAQAALTYQPRANAKKVVIFFTDGEPNHGSGFD 251 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGI 327 ++ K G +Y+IG+ Sbjct: 252 PTVAATAVNKAKSLKDAGTTIYSIGV 277 >gi|281348096|gb|EFB23680.1| hypothetical protein PANDA_015036 [Ailuropoda melanoleuca] Length = 3047 Score = 83.7 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 52/267 (19%), Positives = 101/267 (37%), Gaps = 37/267 (13%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 ++T L + + + Y IS + + I T + + + V + + Sbjct: 368 PQTTILSVRDLSADTEYQISVSAMKGLTSSEPISIMEKTQPMKVQVECSRGVDIKA---- 423 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++ ++D S S+ ++ K+ ++ + +K N VQ LV +S Sbjct: 424 --DIVFLVDGSYSIG---IANFVKVRAFLEVL------IKSFEISPNRVQISLVQYSRDP 472 Query: 231 EEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L+ + I G STN+ + Y +IF N Sbjct: 473 HTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF------VPSKGSRGNVP 526 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 K+++ +TDG++ D + + ++A+G++ E L A ASP Sbjct: 527 KVMILITDGKSSDAFRD-----PAIKLRNSDVEIFAVGVKDAVRSE-LEAIASPPAETHV 580 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 + VE+ +DAF I ++ T+ I Sbjct: 581 FTVED----FDAFQRISFEL-TQSICL 602 Score = 82.5 bits (202), Expect = 9e-14, Method: Composition-based stats. Identities = 44/253 (17%), Positives = 89/253 (35%), Gaps = 34/253 (13%) Query: 127 YSISAISRYKIP--LKFCTFIPWYTNSRHIVMPI-TSSVKVNSQTDAR-LDMMIVLDVSR 182 Y ++ S I + + T ++ + + D++ ++D S Sbjct: 74 YVVTITSYDDIEESVPVIGQLTIQTGGPTKPGEKKPGKPEIQKCSVSAWTDLVFLVDGSW 133 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGV 240 S+ + I A+ V + G+V +S+ F L + Sbjct: 134 SVGRN------NFKYILDFIGAL---VSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQR 184 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 L IK + G +T + + Y N + G R + K+ + +TDG Sbjct: 185 DELLAAIKKIPYKGGNTMTGDAIDYLIKNTFTESAGARAG-------FPKVAIIITDG-- 235 Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLVENPHSMYDA 357 K + E + G V+++GI+ + E + ++P N + V N +DA Sbjct: 236 ---KSQDEVEIPARELRNIGVEVFSLGIKAADAKELKQIASTPSLNHVFNVAN----FDA 288 Query: 358 FSHIGKDIVTKRI 370 I +I+++ Sbjct: 289 IVDIQNEIISQVC 301 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 50/250 (20%), Positives = 83/250 (33%), Gaps = 26/250 (10%) Query: 124 NEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 Y ++ + T S VMPI SS + T A D+++++D S Sbjct: 1134 GTTYKVNVFGMFDGGESSPLIGQEMTTLSDTTVMPILSS-GMECLTRAEADIVLLVDGSW 1192 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGV 240 S+ I+ ++E ++ P VQ L +S + L Sbjct: 1193 SIG------RANFRTVRSFISRIVEIFEIGP---KRVQIALAQYSGDPRTEWQLNAHRDK 1243 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 L + + L G +T + L + Q F Q +KI V +TDG Sbjct: 1244 KSLLQAVANLPYKGGNTLTGMALNFIRQQSFKTQ------AGMRPRARKIGVLITDG--- 1294 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 K + K G ++AIGI+ E P+ N +++ S Sbjct: 1295 --KSQDDVEAPSKKLKDEGVELFAIGIKNADEDELKMIATDPDDT-HAYNVAD-FESLSK 1350 Query: 361 IGKDIVTKRI 370 I D+ Sbjct: 1351 IVDDLTINLC 1360 >gi|154089852|emb|CAO81741.1| collagen type VI alpha 6 [Homo sapiens] Length = 840 Score = 83.3 bits (204), Expect = 5e-14, Method: Composition-based stats. Identities = 42/196 (21%), Positives = 78/196 (39%), Gaps = 27/196 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M ++D S S+ + +K+ +K++ V + VQ G+V FS+ +E Sbjct: 621 DIMFLVDSSGSIGPE---NFSKMKTFMKNL------VSKSQIGPDRVQIGVVQFSDINKE 671 Query: 233 FFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L +S + I ++ G +T + L + +G R N +K Sbjct: 672 EFQLNRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFSPTKGARP-------NIRKF 724 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDGE ++ G I+Y++G+ + P + VEN Sbjct: 725 LILITDGE-----AQDIVKEPAVVLRQEGVIIYSVGVFGSNVTQLEEISGRPEMVFYVEN 779 Query: 351 PHSMYDAFSHIGKDIV 366 +D I D+V Sbjct: 780 ----FDILQRIEDDLV 791 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 54/356 (15%), Positives = 117/356 (32%), Gaps = 40/356 (11%) Query: 34 LGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIK------ 87 L M I S + L ++ AA + + + + R + + +I Sbjct: 282 LSMGINKSEVLQHIQNLSPRTGKAYTGAAIKKLRKEVFSARNGSRKNQGVPQIAVLVTHR 341 Query: 88 ---NTWNMSFRNELRDNGFVNDID-DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCT 143 + + N R+ + + + T L+ + E Y + + T Sbjct: 342 DSEDNVTKAAVNLRREGVTIFTLGIEGASDTQLEKIASHPAEQYVSKLKTFADLAAHNQT 401 Query: 144 FIPWYTNSRHIVMPI----TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 F+ N + + T ++K D+ +++D S S + T Sbjct: 402 FLKKLRNQITHTVSVFSERTETLKSGCVDTEEADIYLLIDGSGS------TQATDFHEMK 455 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVST 257 ++ ++ + P V+ G V +++ + F + L + I+ + + G +T Sbjct: 456 TFLSEVVGMFNIAPHK---VRVGAVQYADSWDLEFEINKYSNKQDLGKAIENIRQMGGNT 512 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 N+ L + + + + R +V +T+G L N ++ Sbjct: 513 NTGAALNFTLSLLQKAKKQR------GNKVPCHLVVLTNG-----MSKDSILEPANRLRE 561 Query: 318 RGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 VYAIGI+ + Y V + +DA I +V + + Sbjct: 562 EHIRVYAIGIKEANQTQLREIAGEEKRVYYVHD----FDALKDIRNQVVQEICTEE 613 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 43/296 (14%), Positives = 96/296 (32%), Gaps = 31/296 (10%) Query: 64 QIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQ 123 + + E + G + + +E DN + V V + Sbjct: 108 KALQEAHRTYFSAPANGRDKKQFPPILVVLASSESEDNVEEASKALRKDGVKIISVGVQK 167 Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK----------VNSQTDARL- 172 ++ A++ + T S+++ I +K V + + Sbjct: 168 ASEENLKAMATSQFHFNLRTVRDLSMFSQNMTHIIKDVIKYKEGAVDDIFVEACQGPSMA 227 Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD+S + S D + E V + N ++ GLV +SN+ + Sbjct: 228 DVVFLLDMS------INGSEENFDYLKGFLE---ESVSALDIKENCMRVGLVAYSNETKV 278 Query: 233 FFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L G S + + I+ LS + +K ++F + + ++ +I Sbjct: 279 INSLSMGINKSEVLQHIQNLSPRTGKAYTGAAIKKLRKEVFSARNGSR----KNQGVPQI 334 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 V +T + ++ G ++ +GI + + + P Y Sbjct: 335 AVLVT-----HRDSEDNVTKAAVNLRREGVTIFTLGIEGASDTQLEKIASHPAEQY 385 >gi|45384318|ref|NP_990352.1| collagen alpha-1(XII) chain precursor [Gallus gallus] gi|2506307|sp|P13944|COCA1_CHICK RecName: Full=Collagen alpha-1(XII) chain; AltName: Full=Fibrochimerin; Flags: Precursor gi|222811|dbj|BAA00701.1| alpha 1 chain of type XII collagen [Gallus gallus] gi|2326442|emb|CAA43358.1| collagen type XII alpha 1 chain [Gallus gallus] Length = 3124 Score = 83.3 bits (204), Expect = 5e-14, Method: Composition-based stats. Identities = 51/268 (19%), Positives = 106/268 (39%), Gaps = 39/268 (14%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD 169 ++T+L++ + + Y I+ + + P + T + ++ + + + S V+ + Sbjct: 383 PQTTALNVKDLSPDTEYQINVYAMKGLTPSEPITIME---KTQQVKVQVECSRGVDVKA- 438 Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D++ ++D S S+ ++ K+ ++ + VK VQ LV +S Sbjct: 439 ---DVVFLVDGSYSIG---IANFVKVRAFLEVL------VKSFEISPRKVQISLVQYSRD 486 Query: 230 IEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 F L V + + I G STN+ + Y ++F N Sbjct: 487 PHMEFSLNRYNRVKDIIQAINTFPYRGGSTNTGKAMTYVREKVFVTS------KGSRPNV 540 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NS 344 ++++ +TDG K + + ++A+G++ E L A ASP Sbjct: 541 PRVMILITDG-----KSSDAFKEPAIKLRDADVEIFAVGVKDAVRTE-LEAIASPPAETH 594 Query: 345 FYLVENPHSMYDAFSHIGKDIVTKRIWY 372 Y VE+ +DAF I ++ T+ + Sbjct: 595 VYTVED----FDAFQRISFEL-TQSVCL 617 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 46/274 (16%), Positives = 94/274 (34%), Gaps = 31/274 (11%) Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKI---PLKFCTFIPWYTNSRHIVMP 157 +G + +T + + Y +S I+ Y L + T I Sbjct: 64 DGPTKEFTLSPSTTQTVLSDLIPEIEYVVS-IASYDEVEESLPVFGQLTIQTGGPGIPEE 122 Query: 158 ITSSVKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 ++ + + + D++ ++D S S+ ++ M+ V Sbjct: 123 KKVEAQIQKCSISAMTDLVFLVDGSWSVGRNN---------FRYILDFMVALVSAFDIGE 173 Query: 217 NVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDM 273 + G+V +S+ F L + S L IK + G +T + + Y N + Sbjct: 174 EKTRVGVVQYSSDTRTEFNLNQYFRRSDLLDAIKRIPYKGGNTMTGEAIDYLVKNTFTES 233 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 G R+ + K+ + +TDG K + E + G V+++GI+ + Sbjct: 234 AGARKG-------FPKVAIVITDG-----KAQDEVEIPARELRNIGVEVFSLGIKAADAK 281 Query: 334 EFLRACASP--NSFYLVENPHSMYDAFSHIGKDI 365 E + P + V N + D + I + Sbjct: 282 ELKLIASQPSLKHVFNVANFDGIVDIQNEIILQV 315 Score = 69.4 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 47/265 (17%), Positives = 90/265 (33%), Gaps = 34/265 (12%) Query: 113 STSLDIVVVPQNEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR 171 +++ + + Y ++ + T S P S + +T A Sbjct: 1139 DSTVVLEELRAGTTYKVNVFGMFDGGESNPLVGQEMTTLSDTTTEPFLSR-GLECRTRAE 1197 Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D+++++D S S+ I+ ++E + PD VQ GL +S Sbjct: 1198 ADIVLLVDGSWSIG------RPNFKTVRNFISRIVEVFDIGPDK---VQIGLAQYSGDPR 1248 Query: 232 EFFLLEWGVSHLQR------KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 EW ++ + + L G +T + L F ++ + Sbjct: 1249 T----EWNLNAYRTKEALLDAVTNLPYKGGNTLTGMALD------FILKNNFKQEAGLRP 1298 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +KI V +TDG K + + G +YAIGI+ +E + P+ Sbjct: 1299 RARKIGVLITDG-----KSQDDVVTPSRRLRDEGVELYAIGIKNADENELKQIATDPDDI 1353 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRI 370 N + + IG+D+ T Sbjct: 1354 -HAYNVAD-FSFLASIGEDVTTNLC 1376 >gi|297286916|ref|XP_002808380.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-5(VI) chain-like [Macaca mulatta] Length = 2604 Score = 83.3 bits (204), Expect = 5e-14, Method: Composition-based stats. Identities = 61/332 (18%), Positives = 123/332 (37%), Gaps = 29/332 (8%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + +F + + IDR + + + + + + + Sbjct: 682 LNRYFTRQEISDAIDRMSLINKGTLTGKALNFVGQYFTHSKGARLGAKKF-LILITDGVA 740 Query: 101 NGFVNDIDDIVRSTSLDIVVVP-QNEGYS----ISAISRYKIPLKFCTFIPWYTNSRHIV 155 +V D I+R ++ I V N S IS ++ + R +V Sbjct: 741 RDYVRDPARILRGKNVTIFSVGVYNANRSQLEEISGDGSLVFHVENFDHLKAL--ERKLV 798 Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 + + T LD++ VLD S S++ + + IN + VK Sbjct: 799 FRVCALHDCKRITG--LDVVFVLDHSGSIKKQYQDHM---------INLTIHLVKKADVG 847 Query: 216 NNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 N VQ G + +S++ F L S + ++ G ST + LK+A N +F Sbjct: 848 RNGVQFGALKYSDQPNILFYLNTYSNRSAIIENLRMRRDTGGSTYTAKALKHA-NALFT- 905 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 +H + N K++++ +TDG++ + Q +E + +G + A+G+ Sbjct: 906 ---EEHGSRIKENVKQMLIVITDGKS---HDHDQLNDTASELRDKGITILAVGVGKANQK 959 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 E + N+ V+N + D F+ + + + Sbjct: 960 ELEGMAGNKNNTIYVDNFDKLKDVFTLVQESM 991 Score = 73.7 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 75/210 (35%), Gaps = 27/210 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D + D+M ++D S S+ + + K+ + +K++ + I + Q G+V Sbjct: 620 KGCEDMKADIMFLVDSSWSIG---NENFRKMKIFMKNL------LTKIQIGADKTQIGVV 670 Query: 225 TFSNKIEEFFLL-EWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+ +E F L + + I +S T + L + +G R Sbjct: 671 QFSDTTKEEFQLNRYFTRQEISDAIDRMSLINKGTLTGKALNFVGQYFTHSKGARLGA-- 728 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 KK ++ +TDG + + ++++G+ + Sbjct: 729 -----KKFLILITDG-----VARDYVRDPARILRGKNVTIFSVGVYNANRSQLEEISGDG 778 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + + VEN +D + + +V + Sbjct: 779 SLVFHVEN----FDHLKALERKLVFRVCAL 804 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 30/202 (14%), Positives = 71/202 (35%), Gaps = 26/202 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S++ + + + + + E + PD V+ G+V +S+ E Sbjct: 442 DIHFLIDGSSSIQE------KEFEQIKRFMLEVTEMFSIGPDK---VRVGVVQYSDNAEV 492 Query: 233 FFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + + + ++ I + + T + L Y + R Sbjct: 493 EFYISDYSNDIGLRKAIFNIKQLTGRTYTGKALDYILQIXKNGSKDRMSK------VPCY 546 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDG ++ + + + ++A+GI E + +N Sbjct: 547 LIVLTDGMSMD-----RVVEPGKRLRAEQITIHAVGIGAANKIELQEIAGKEERVHFGQN 601 Query: 351 PHSMYDAFSHIGKDIVTKRIWY 372 +DA I ++V + Sbjct: 602 ----FDALKSIKNEVVREICTE 619 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 72/201 (35%), Gaps = 25/201 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + S + I + + +P N + L +S++ Sbjct: 30 DVVFLVDSS---DHLGPKSFPFVKTFIS------KMINSLPIETNKYRVALAQYSDEFHS 80 Query: 233 FFLLEW--GVSHLQRKIK-YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L G S + +K G S L+ A+ F + + + + Sbjct: 81 EFHLNTFKGRSPMLNHLKMNFQFIGGSLQIGKALQEAHRTYFSVPTNGR----DRKQFPP 136 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 I+V + S + + + +K G + ++G++ L+A A+ + + + Sbjct: 137 ILVVL-----ASAESEDEVEEASKALQKDGVKIISVGVQKASEEN-LKAMATSHFHFNLR 190 Query: 350 NPHSM---YDAFSHIGKDIVT 367 + + + I KD+ Sbjct: 191 TIRDLSTFSQSMTQIIKDVTK 211 Score = 39.8 bits (91), Expect = 0.72, Method: Composition-based stats. Identities = 22/174 (12%), Positives = 63/174 (36%), Gaps = 21/174 (12%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 ++ Q D+ D++ ++D S + + + + + N ++ GL Sbjct: 227 ISCQKDSLADLVFLVDESLGTRGN-------LRHLQTFLENITSSMDV---KENCMRLGL 276 Query: 224 VTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +++S+ + L+ S +++IK LS +N+ +++ + + + Sbjct: 277 ISYSSSAKTISFLKSSTTQSEFKQQIKNLSIQVGKSNTGAAIEH----MRRDGFSESYGS 332 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 +I V +T D + + ++A+ I+ + + Sbjct: 333 RRAQGVPQIAVLVT-----HRPSDDKVHDAALNLRLEDVTMFALSIQGANNTQL 381 >gi|83816834|ref|YP_446668.1| von Willebrand factor type A domain-containing protein [Salinibacter ruber DSM 13855] gi|83758228|gb|ABC46341.1| von Willebrand factor type A domain protein [Salinibacter ruber DSM 13855] Length = 289 Score = 83.3 bits (204), Expect = 5e-14, Method: Composition-based stats. Identities = 44/214 (20%), Positives = 83/214 (38%), Gaps = 45/214 (21%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+VLD S SM++ D T+ + A ++ A +E + GL+ F+ + Sbjct: 51 IDIMMVLDASTSMQAE-DFQPTRFEAAREAAGAFVEGRVSD-------RVGLIVFAAEAY 102 Query: 232 EFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L S LQR ++ + T L A N++ D + K Sbjct: 103 TQAPLTLDYSFLQRMLEDVEVGAVEDGTAVGTALATAVNRLKDSEAES-----------K 151 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-----------VIRSH----- 333 + + +TDG N + D ++ A+ G VYAIG+ + + Sbjct: 152 VAILLTDGRNNRGQIDPRTA--AEVARTMGVRVYAIGVGSSEDRDTWEEPLPQGQRDESA 209 Query: 334 ----EFLR--ACASPNSFYLVENPHSMYDAFSHI 361 E LR + ++ ++ N ++ ++ I Sbjct: 210 GVDAEMLRSVSVSTGGQYFSATNRDALERIYAEI 243 >gi|254692857|ref|NP_081450.1| collagen, type XXII, alpha 1 [Mus musculus] Length = 1613 Score = 83.3 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 32/216 (14%), Positives = 84/216 (38%), Gaps = 26/216 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + + D++ +LD S S+ + + + +++ ++ P + Sbjct: 36 AQRAGCKSVQYDLVFLLDTSSSVGK------EDFEKVRQWVANLVDTFEVGP---GHTRV 86 Query: 222 GLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G+V +S++ F L ++ + ++ G +TN+ L+Y ++ F Q Sbjct: 87 GVVRYSDRPTTAFELGHFNSREEVKAAARRITYHGGNTNTGDALRYITSRSFSAQ---AG 143 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 + +K++ + +TDG + D + + G ++A+G+ E Sbjct: 144 GRPGNRAFKQVAILLTDGRSQDLVLDAAAAAH-----AAGIRIFAVGVGAALKEELDEIA 198 Query: 340 ASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P + V + ++A I + + +R+ + Sbjct: 199 SEPKSAHVFHVSD----FNAIDKI-RGKLRRRLCEN 229 >gi|327274978|ref|XP_003222251.1| PREDICTED: collagen alpha-6(VI) chain-like [Anolis carolinensis] Length = 2025 Score = 83.3 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 48/217 (22%), Positives = 79/217 (36%), Gaps = 27/217 (12%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 T +IV I S + + + D+M ++D S S+ D + K+ IK + Sbjct: 606 KTIKDNIVQDICSE---EACKEMKADIMFLVDSSGSIG---DDNFKKMKTFIKEVVNRSY 659 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLK 264 V V FS+ +E F L S I+ +S G +TN+ L+ Sbjct: 660 IGVDQVQVGVVH------FSDTSKEVFSLNKNTSKKSDIIQAVEDMSLIGSTTNTGGALR 713 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 + + R N KI+V +TDGE + E + G I+Y+ Sbjct: 714 FVSKYFKLAKQARPSVN-------KILVLITDGE-----ASDEVTAPATELRNDGIIIYS 761 Query: 325 IGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHI 361 +G+ + P + VEN + D I Sbjct: 762 VGVFNANKTQLEEISGKPEKVFYVENFDILEDIKGDI 798 Score = 80.2 bits (196), Expect = 4e-13, Method: Composition-based stats. Identities = 42/213 (19%), Positives = 83/213 (38%), Gaps = 18/213 (8%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKS 201 F I+ I S K A L D++ V+D S S+ + + + Sbjct: 787 NFDILEDIKGDIIFGICSPYKPEDCKRANLLDVVFVIDSSGSIGPYNYAIMKDF------ 840 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 +++ V NN VQ G V +S + F L + + KIK + +T + Sbjct: 841 ---VIDIVNKSDVGNNRVQFGAVKYSAYPQILFNLNGNKADIIDKIKGDTLLNDTTYTAE 897 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 L++A N + +G R+ ++++ +TDG ++ + + + G Sbjct: 898 ALRHAENLFTESKGSRK-----RRGVPQLLMVITDG---TSHDKDKLDAVSTRIRNDGIT 949 Query: 322 VYAIGIRVIRSHEFLRACASPNSFYLVENPHSM 354 +YAIGI+ + E + + + V+ + Sbjct: 950 IYAIGIKDAKREELEIIAENKDHVFFVDTFDGL 982 Score = 72.5 bits (176), Expect = 9e-11, Method: Composition-based stats. Identities = 54/345 (15%), Positives = 116/345 (33%), Gaps = 36/345 (10%) Query: 42 HIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDN 101 I ++ + AA I + R + + +I ++ Sbjct: 299 DILQRIQSFSPRAGKANLGAAINITRKQVFTERAGSRKNQGVKQIATIITHRPSDDSLTE 358 Query: 102 GFVNDIDDIVRSTSLDIV---------VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSR 152 I+ + ++ I VV +I ++ +P + T N Sbjct: 359 AAEQLINADITVFAIGIEGANISQLNQVVSYPPNRNIIQVTFSDLPRESDTLKKKLFNEI 418 Query: 153 HIVMPITSSVKVNSQTD----ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 ++ + KV+ +T R D+ ++D S S+ S DS + +K + Sbjct: 419 QNMLYVKRERKVHLKTGCVDTERADIYFLIDGSTSINS--DSFHKDMKTFLKEV------ 470 Query: 209 VKLIPDVNNVVQSGLVTFS--NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 +K+ + V+ G+V +S ++ E +S+L++ I + T + L Sbjct: 471 IKMFTVGPDHVRFGVVQYSNIHRTEFEIDKHSTISNLEKAINNIQYLTGDTYTGAAL--- 527 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 + + + ++ +TDG+ + + +K G V AIG Sbjct: 528 -ESMLGLFESARKQRKN--KVPTYLIVLTDGDPHD-----KVKEPADRLRKAGINVIAIG 579 Query: 327 IRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + I+ S + Y V S+ +I +DI ++ Sbjct: 580 VGDIKWKGLQEIGES--NVYYVHQYASLKTIKDNIVQDICSEEAC 622 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 38/195 (19%), Positives = 76/195 (38%), Gaps = 24/195 (12%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S S+ ++ + +++ + ++ Q G+ FS+ E Sbjct: 1003 IVFLIDGSGSIPKN------DFEIVKDFLTKLVDSISFHDNI----QFGMAQFSDIYSEE 1052 Query: 234 FLLEW--GVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L S L+ KI +S + G+ T GLK F R N ++K+ Sbjct: 1053 FPLGHYQSKSELKNKIANVSMQAGLHTYIGKGLKE-VKAFFKSPRRRVARN----VHQKL 1107 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++F TDGE + ++ G ++A+G+ I + + SP Y N Sbjct: 1108 LIF-TDGE-----SKDSFTQPAEDLRREGVEIHAVGVGKIEHAKLQQITVSPERIYTTAN 1161 Query: 351 PHSMYDAFSHIGKDI 365 + I +++ Sbjct: 1162 YTGLPHITKGITEEM 1176 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 40/248 (16%), Positives = 73/248 (29%), Gaps = 37/248 (14%) Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITS 160 G + D + Q A Y F T S ++ I S Sbjct: 160 EGPAKMLQDAGIKIIALGIQDAQQHELQSMATKFY----HFRTPRDLQAFSPNMSSVIES 215 Query: 161 SVKVNSQTDARL-----------DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 +V+++ T A L D++ V+D ES ++++ I + + Sbjct: 216 AVEMDIGTVADLFYKVCRSDSVADVVFVVD-----ESVGNANVEYIKTF------LQNSI 264 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAY 267 + ++ GLV +S + + L ++I+ S N + Sbjct: 265 NSLDVTEECIRIGLVKYSTETQVVSFLSKETEKKDILQRIQSFSPRAGKANLGAAINITR 324 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 Q+F R K+I +T D + V+AIGI Sbjct: 325 KQVFT---ERAGSRKN-QGVKQIATIIT-----HRPSDDSLTEAAEQLINADITVFAIGI 375 Query: 328 RVIRSHEF 335 + Sbjct: 376 EGANISQL 383 >gi|301616677|ref|XP_002937788.1| PREDICTED: collagen alpha-6(VI) chain-like, partial [Xenopus (Silurana) tropicalis] Length = 1529 Score = 83.3 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 51/337 (15%), Positives = 122/337 (36%), Gaps = 28/337 (8%) Query: 42 HIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELR-- 99 + K + ID + GN + + + + L Sbjct: 62 NKHATKNAIWDSIDNMNLMGNVA----KTGNALANVADYFTEAKGARPSSKVSKILLLIT 117 Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 DN +++ S + +++ G+S + +I K + + + I + Sbjct: 118 DNPSQDEVKVPADSLRSNGLIIYSIGGFSANKKELQEISGKITPYYQSFDKLQTIENDLL 177 Query: 160 SSVKVNSQTDARL---DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 + + R+ D++ V+D S S+ T+ + +++ + + PD Sbjct: 178 FRICNPEEECKRIEVADIVFVIDSSGSI------DYTEYKEMQNFMVSLVNKSAVGPDN- 230 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 VQ G + +S+ E F L + + I + G +T + ++++ + Sbjct: 231 --VQFGALKYSDYNTELFYLNRYTNKVDIINHINKDTTQGGNTYTAGAVRFSKEFFTEKH 288 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 G R+ +I++ +TDG++ + + + ++ G I+YAIGI ++E Sbjct: 289 GSRKARG-----VPQIVMVITDGDS---HDKDKLNETARQLEQEGIIIYAIGIDQANTNE 340 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + +++V N + D + + + K Sbjct: 341 LETLAGTEGKWFMVANFSGLQDILVQVSEAMCNKTDC 377 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 32/230 (13%), Positives = 79/230 (34%), Gaps = 26/230 (11%) Query: 147 WYTNSRHIVMPITSSVKVNS-QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 ++ + I++ ++ ++ + + D++ ++D S ++S + ++ Sbjct: 356 NFSGLQDILVQVSEAMCNKTDCKVEQADLLFLIDGSS------NTSEETFREIKNFVVSV 409 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGL 263 +++ + P V + +S ++ L+ +I + K + L Sbjct: 410 MDDFNVGPVN---VHIAVSQYSESCIREINFDYSTERGTLKNEIINIRKTKGRRHIGAAL 466 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 + + ++ N + K+++V +T G Q + RG +Y Sbjct: 467 DFTKSTVYSPSSD----NRLNQGVKQLLVVITAG-----NASDQVARPAKALRDRGVDIY 517 Query: 324 AIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 A+GI I + + SP Y I K +V D Sbjct: 518 AVGIGNICKTQLTQITGSPEKIY-----TDDVSGLKAIKKRLVRDTCSKD 562 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 35/188 (18%), Positives = 71/188 (37%), Gaps = 20/188 (10%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P S ++Q +D+ +LD S ++ + + A + + +M++ + + Sbjct: 1240 PEKSCGDKSTQQKPPMDVAFLLDSS------YNVKYDEFEAAREFLGSMIDHFDIEGSAS 1293 Query: 217 NVVQSGLVTFSNKIEEFFLL--EWGVSHLQRKIK-YLSKFGVSTNSTPGLKYAYNQIFDM 273 + + LVTF+ F L S +++ I+ + + L+Y I Sbjct: 1294 D--RVALVTFNESYHVDFNLLSYDNKSRIKKHIQESVQQVSGQPALGLSLQYTIENIM-- 1349 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 T + K+I+ + GE S + Q +AK G V+ + I ++ Sbjct: 1350 ------AKTPNLRKHKVIIIILFGE-TSQWDKQTLSDASQKAKCEGYAVFVLSIGRSHNN 1402 Query: 334 EFLRACAS 341 L AS Sbjct: 1403 TELTELAS 1410 >gi|293348732|ref|XP_001072793.2| PREDICTED: collagen, type XXII, alpha 1 [Rattus norvegicus] gi|293360639|ref|XP_243609.5| PREDICTED: collagen, type XXII, alpha 1 [Rattus norvegicus] Length = 1613 Score = 83.3 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 33/216 (15%), Positives = 85/216 (39%), Gaps = 26/216 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + + D++ +LD S S+ + + + +++ ++ P + Sbjct: 36 AQRAGCKSVQYDLVFLLDTSSSVGK------EDFEKVRQWVANLVDTFEVGP---GHTRV 86 Query: 222 GLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G+V +S++ F L S ++ + ++ G +TN+ L+Y ++ F Q Sbjct: 87 GVVRYSDRPTTAFELGHFSSREEVKAAARRITYHGGNTNTGDALRYITSRSFSAQ---AG 143 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 + +K++ + +TDG + D + + G ++A+G+ E Sbjct: 144 GRPGNRAFKQVAILLTDGRSQDLVLDAAAAAH-----AAGIRIFAVGVGAALKEELDEIA 198 Query: 340 ASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P + V + ++A I + + +R+ + Sbjct: 199 SEPKSAHVFHVSD----FNAIDKI-RGKLRRRLCEN 229 >gi|315649108|ref|ZP_07902201.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315275543|gb|EFU38898.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 983 Score = 82.9 bits (203), Expect = 7e-14, Method: Composition-based stats. Identities = 48/273 (17%), Positives = 97/273 (35%), Gaps = 39/273 (14%) Query: 98 LRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMP 157 L + I + S V Q E A+ + I + + P Sbjct: 329 LEAAKYAVYDSIIFNNVSGSDVGGKQMELIE-QAVRSFGIGFMMAGGEDSFGMGGYFKTP 387 Query: 158 ITSSVKVNSQ-----TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 I ++ V+ + L +++V+D S SM TKI++A +S +E ++ Sbjct: 388 IEKALPVSMELEGKREIPSLGLILVIDRSGSM------DGTKIELAKESAMRTVELLRSK 441 Query: 213 PDVNNVVQSGLVTFSNK---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 G+V F ++ + L + I+ + G TN P + A + Sbjct: 442 DT------VGVVAFDDQPWWVVPPQKLG-NKEEVLSSIQSIPSAG-GTNIYPAVSSALEE 493 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 + ++ R+H I+ MTDG++ Q + + + ++ + Sbjct: 494 MLKIKSQRRH-----------IILMTDGQSAMNSGYQD---LTDTMVENKITMSSVAVGT 539 Query: 330 IRSHEFLRAC--ASPNSFYLVENPHSMYDAFSH 360 L++ A+ +Y VE+ ++ FS Sbjct: 540 DADTHLLQSLAEAAKGRYYFVEDETTLPAVFSR 572 >gi|269315863|ref|NP_001161395.1| collagen alpha-5(VI) chain precursor [Mus musculus] Length = 2640 Score = 82.9 bits (203), Expect = 7e-14, Method: Composition-based stats. Identities = 44/281 (15%), Positives = 100/281 (35%), Gaps = 33/281 (11%) Query: 97 ELRDNGFVNDIDDIVRSTSLDI--VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH- 153 + + N V + + +R+ + I + + + + I+ + F + ++ Sbjct: 584 DGKSNDSVLEPANRLRAEQITIHAIGIGEANKTQLRQIAGKDERVNFGQNFDSLKSIKNE 643 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 IV I S D + D+M ++D S S+ T + + ++ I Sbjct: 644 IVHRICSE---KGCEDMKADIMFLVDSSGSIGP------TNFETMKTFMKNLVG---KIQ 691 Query: 214 DVNNVVQSGLVTFSNKIEEFFLL-EWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 + Q G+V FS+ E F L ++ + I +S +T + L + N+ F Sbjct: 692 IGADRSQVGVVQFSDYNREEFQLNKYSTHEEIYAAIDRMSPINRNTLTGGALTF-VNEYF 750 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 D+ +K ++ +TDG K + + + ++++G+ Sbjct: 751 DLS------KGGRPQVRKFLILLTDG-----KAQDEVGGPATALRSKSVTIFSVGVYGAN 799 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + + + VEN +D I ++ + Sbjct: 800 RAQLEEISGDGSLVFHVEN----FDHLKAIESKLIFRVCAL 836 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 73/202 (36%), Gaps = 26/202 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S+ +I + + S+ + + P N V+ G+V +S+K E Sbjct: 474 DIYFLIDGSSSIRKK---EFEQIQIFMSSV------IDMFPIGPNKVRVGVVQYSHKNEV 524 Query: 233 FFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + + ++ + + + T + L F + +++ Sbjct: 525 EFPVSRYTDGIDLKKAVFNIKQLKGLTFTGKALD------FILPLIKKGKTERTDRAPCY 578 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDG K + L N + ++AIGI + + +N Sbjct: 579 LIVLTDG-----KSNDSVLEPANRLRAEQITIHAIGIGEANKTQLRQIAGKDERVNFGQN 633 Query: 351 PHSMYDAFSHIGKDIVTKRIWY 372 +D+ I +IV + Sbjct: 634 ----FDSLKSIKNEIVHRICSE 651 Score = 79.4 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 39/189 (20%), Positives = 77/189 (40%), Gaps = 19/189 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ VLD S S+ S+ +N + VK + VQ G +T+SN E Sbjct: 845 LDIVFVLDHSGSIGPREQESM---------MNLTIHLVKKADVGRDRVQIGALTYSNHPE 895 Query: 232 EFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L S + ++ G T + L+++ N +F +H + N ++ Sbjct: 896 ILFYLNTYSSGSAIAEHLRRPRDTGGETYTAKALQHS-NVLFT----EEHGSRLTQNVRQ 950 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDG + + + E + +G ++A+G+ E + V+ Sbjct: 951 LMIVITDG---VSHDRDKLDEAARELRDKGITIFAVGVGNANQDELETMAGKKENTVHVD 1007 Query: 350 NPHSMYDAF 358 N + D + Sbjct: 1008 NFDKLRDIY 1016 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 27/206 (13%), Positives = 67/206 (32%), Gaps = 25/206 (12%) Query: 154 IVMPITSSVKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 +++ + + D++ ++D S + I +N M+ + Sbjct: 10 LILWTETLADQSPGPGPEYADVVFLVDSSNYLG------IKSFPFVRTFLNRMISSL--- 60 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYN 268 P N + L +S+ + F L + + +L K G S L+ A+ Sbjct: 61 PIEANKYRVALAQYSDALHNEFQLGTFKNR-NPMLNHLKKNFGFIGGSLKIGNALQEAHR 119 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 F + + I+V + S + + ++ G + ++G++ Sbjct: 120 TYFSAPTNGRDKK----QFPPILVVL-----ASAESEDDVEEAAKALREDGVKIISVGVQ 170 Query: 329 VIRSHEFLRACASPNSFYLVENPHSM 354 L+A A+ + + + Sbjct: 171 KASEEN-LKAMATSQFHFNLRTARDL 195 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 37/240 (15%), Positives = 80/240 (33%), Gaps = 26/240 (10%) Query: 121 VPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDV 180 I + + T P T + + ++ +D+ +LD Sbjct: 1946 DDTGTFQVIPVPPVGDY--EPLEKLRRCTLCYDKCFPNTCAEEPFFPENSYMDVAFLLDN 2003 Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS----------NKI 230 S+++ S + + S+ + + L+++S K Sbjct: 2004 SKNIAS---DDFQAVKALVSSVIDSFHITSNPSASESGDRVALLSYSPSESSRRKGRVKT 2060 Query: 231 EEFFLLEWGVSHLQRKI-KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 E F S ++ I L + L++A +F T + K Sbjct: 2061 EFAFTTYDNQSIMKNYIYTSLQQLNGDATIGLALQWAMEGLFLG--------TPNPRKHK 2112 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +I+ ++ GEN KE +++ AK +G +V+ I + + E + P +L++ Sbjct: 2113 VIIVISAGENHEEKEFVKTVAL--RAKCQGYVVFVISLGSTQRDEMEELASYPLDHHLIQ 2170 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 28/183 (15%), Positives = 66/183 (36%), Gaps = 25/183 (13%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 +P +S Q D+ D++ ++D S + + + + V + + Sbjct: 254 TVPFPTSC----QKDSLADLIFLVDES-------VGTTQNLRDLQNFLENVTSSVDVKDN 302 Query: 215 VNNVVQSGLVTFSNKIEEFFLL--EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 ++ GL++FS++ + L S Q++I+ LS ++N + Q+ Sbjct: 303 C---MRLGLMSFSDRAQTISSLRSSANQSEFQQQIQKLSLQTGASNVGAAI----EQMRK 355 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + + +I V +T D + + G ++A+GI + Sbjct: 356 EGFSESSGSRKAQGVPQIAVLVT-----HRASDDMVREAALDLRLEGVTMFAMGIEGANN 410 Query: 333 HEF 335 + Sbjct: 411 TQL 413 >gi|189082901|sp|A6H584|CO6A5_MOUSE RecName: Full=Collagen alpha-5(VI) chain; AltName: Full=Collagen alpha-1(XXIX) chain; Flags: Precursor Length = 2640 Score = 82.9 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 44/281 (15%), Positives = 100/281 (35%), Gaps = 33/281 (11%) Query: 97 ELRDNGFVNDIDDIVRSTSLDI--VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH- 153 + + N V + + +R+ + I + + + + I+ + F + ++ Sbjct: 584 DGKSNDSVLEPANRLRAEQITIHAIGIGEANKTQLRQIAGKDERVNFGQNFDSLKSIKNE 643 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 IV I S D + D+M ++D S S+ T + + ++ I Sbjct: 644 IVHRICSE---KGCEDMKADIMFLVDSSGSIGP------TNFETMKTFMKNLVG---KIQ 691 Query: 214 DVNNVVQSGLVTFSNKIEEFFLL-EWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 + Q G+V FS+ E F L ++ + I +S +T + L + N+ F Sbjct: 692 IGADRSQVGVVQFSDYNREEFQLNKYSTHEEIYAAIDRMSPINRNTLTGGALTF-VNEYF 750 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 D+ +K ++ +TDG K + + + ++++G+ Sbjct: 751 DLS------KGGRPQVRKFLILLTDG-----KAQDEVGGPATALRSKSVTIFSVGVYGAN 799 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + + + VEN +D I ++ + Sbjct: 800 RAQLEEISGDGSLVFHVEN----FDHLKAIESKLIFRVCAL 836 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 73/202 (36%), Gaps = 26/202 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S+ +I + + S+ + + P N V+ G+V +S+K E Sbjct: 474 DIYFLIDGSSSIRKK---EFEQIQIFMSSV------IDMFPIGPNKVRVGVVQYSHKNEV 524 Query: 233 FFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + + ++ + + + T + L F + +++ Sbjct: 525 EFPVSRYTDGIDLKKAVFNIKQLKGLTFTGKALD------FILPLIKKGKTERTDRAPCY 578 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDG K + L N + ++AIGI + + +N Sbjct: 579 LIVLTDG-----KSNDSVLEPANRLRAEQITIHAIGIGEANKTQLRQIAGKDERVNFGQN 633 Query: 351 PHSMYDAFSHIGKDIVTKRIWY 372 +D+ I +IV + Sbjct: 634 ----FDSLKSIKNEIVHRICSE 651 Score = 79.4 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 39/189 (20%), Positives = 77/189 (40%), Gaps = 19/189 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ VLD S S+ S+ +N + VK + VQ G +T+SN E Sbjct: 845 LDIVFVLDHSGSIGPREQESM---------MNLTIHLVKKADVGRDRVQIGALTYSNHPE 895 Query: 232 EFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L S + ++ G T + L+++ N +F +H + N ++ Sbjct: 896 ILFYLNTYSSGSAIAEHLRRPRDTGGETYTAKALQHS-NVLFT----EEHGSRLTQNVRQ 950 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDG + + + E + +G ++A+G+ E + V+ Sbjct: 951 LMIVITDG---VSHDRDKLDEAARELRDKGITIFAVGVGNANQDELETMAGKKENTVHVD 1007 Query: 350 NPHSMYDAF 358 N + D + Sbjct: 1008 NFDKLRDIY 1016 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 27/206 (13%), Positives = 67/206 (32%), Gaps = 25/206 (12%) Query: 154 IVMPITSSVKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 +++ + + D++ ++D S + I +N M+ + Sbjct: 10 LILWTETLADQSPGPGPEYADVVFLVDSSNYLG------IKSFPFVRTFLNRMISSL--- 60 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYN 268 P N + L +S+ + F L + + +L K G S L+ A+ Sbjct: 61 PIEANKYRVALAQYSDALHNEFQLGTFKNR-NPMLNHLKKNFGFIGGSLKIGNALQEAHR 119 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 F + + I+V + S + + ++ G + ++G++ Sbjct: 120 TYFSAPTNGRDKK----QFPPILVVL-----ASAESEDDVEEAAKALREDGVKIISVGVQ 170 Query: 329 VIRSHEFLRACASPNSFYLVENPHSM 354 L+A A+ + + + Sbjct: 171 KASEEN-LKAMATSQFHFNLRTARDL 195 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 37/240 (15%), Positives = 80/240 (33%), Gaps = 26/240 (10%) Query: 121 VPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDV 180 I + + T P T + + ++ +D+ +LD Sbjct: 1946 DDTGTFQVIPVPPVGDY--EPLEKLRRCTLCYDKCFPNTCAEEPFFPENSYMDVAFLLDN 2003 Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS----------NKI 230 S+++ S + + S+ + + L+++S K Sbjct: 2004 SKNIAS---DDFQAVKALVSSVIDSFHITSNPSASESGDRVALLSYSPSESSRRKGRVKT 2060 Query: 231 EEFFLLEWGVSHLQRKI-KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 E F S ++ I L + L++A +F T + K Sbjct: 2061 EFAFTTYDNQSIMKNYIYTSLQQLNGDATIGLALQWAMEGLFLG--------TPNPRKHK 2112 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +I+ ++ GEN KE +++ AK +G +V+ I + + E + P +L++ Sbjct: 2113 VIIVISAGENHEEKEFVKTVAL--RAKCQGYVVFVISLGSTQRDEMEELASYPLDHHLIQ 2170 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 28/183 (15%), Positives = 66/183 (36%), Gaps = 25/183 (13%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 +P +S Q D+ D++ ++D S + + + + V + + Sbjct: 254 TVPFPTSC----QKDSLADLIFLVDES-------VGTTQNLRDLQNFLENVTSSVDVKDN 302 Query: 215 VNNVVQSGLVTFSNKIEEFFLL--EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 ++ GL++FS++ + L S Q++I+ LS ++N + Q+ Sbjct: 303 C---MRLGLMSFSDRAQTISSLRSSANQSEFQQQIQKLSLQTGASNVGAAI----EQMRK 355 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + + +I V +T D + + G ++A+GI + Sbjct: 356 EGFSESSGSRKAQGVPQIAVLVT-----HRASDDMVREAALDLRLEGVTMFAMGIEGANN 410 Query: 333 HEF 335 + Sbjct: 411 TQL 413 >gi|114705525|ref|ZP_01438428.1| Flp pilus assembly protein TadG [Fulvimarina pelagi HTCC2506] gi|114538371|gb|EAU41492.1| Flp pilus assembly protein TadG [Fulvimarina pelagi HTCC2506] Length = 461 Score = 82.9 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 58/381 (15%), Positives = 116/381 (30%), Gaps = 39/381 (10%) Query: 23 TAIFLPIIFLVLGMIIEVSHIFFMKTV-LHSMIDRSLV--HAATQIMNEGNGNNRKKLKG 79 TA P + G+ + I+ +LV + + +G +R Sbjct: 85 TAEITPFFLGLAGIKNLTAQRATKTAREARGEIEVALVLDTTWSMSEKDSSGKSRLDSLK 144 Query: 80 GDILCRIKNTWNMSFR---NELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYK 136 G + + + + +VN + LD+ YS + R Sbjct: 145 GAAAKLVDTIFTEDGKTRVAVVPYADYVNVGTQHRNQSWLDV-----PPSYSTTPSERRC 199 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 T Y + + + + V V+ S D Sbjct: 200 ETRTTRTQCTSYAPTYQCTRTVDGVSESTTCGGGCTSSETV-QVAPYEYCTGGGSSRTYD 258 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF-LLEWGVSHLQRKIKYLSKFGV 255 + + + +L ++ G + S + L + ++ I LS G Sbjct: 259 WYGCVASRTVGDYRLTDARPDIRYPGFLGTSRECPGPLLSLSTREADVKTSISNLSYGGG 318 Query: 256 S----TNSTPGLKYAYNQIFDMQGMRQHC-NTEDANYKKIIVFMTDGENL---------- 300 T GL + N + + + + +K +V MTDG N Sbjct: 319 GYRPSTFIPAGLIWGLNVLSPPAPFEEQAYDPNNKLPRKALVLMTDGANTMVFNSSDGRH 378 Query: 301 --------STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH--EFLRACASPN-SFYLVE 349 + D+ ++ CN K+ G ++ +G V S + L+ CA+ ++ Sbjct: 379 RNARSGTEVAQSDRDTISICNNIKRSGIEIFTVGFMVNSSSALDLLKECATDGEHYFDAT 438 Query: 350 NPHSMYDAFSHIGKDIVTKRI 370 +P ++ AF I + R+ Sbjct: 439 SPEELHSAFGRIADGLTQIRL 459 >gi|312794604|ref|YP_004027527.1| von willebrand factor type a [Caldicellulosiruptor kristjanssonii 177R1B] gi|312181744|gb|ADQ41914.1| von Willebrand factor type A [Caldicellulosiruptor kristjanssonii 177R1B] Length = 900 Score = 82.9 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 47/212 (22%), Positives = 81/212 (38%), Gaps = 26/212 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + +D+M+VLD S SM D+ I K+++A + M+E ++ G++ F Sbjct: 401 EKEKNIDVMLVLDHSGSMADTEDAGIPKLEIAKSASAKMVEHLESSDG------VGVIAF 454 Query: 227 SNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + + + I + + G T P L A + + Sbjct: 455 DHNYYWAYKFGKLVRKEDVIESISSI-EVGGGTAIIPPLSEAVKTL-----------KKS 502 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SP 342 K++V +TDG + + +EAK+ + IG+ + L A + Sbjct: 503 KAKNKLVVLLTDGMGEQSGYEI----PADEAKRNNIKITTIGVGKFVNASVLSWIAAYTS 558 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 FYLV NP + D F K I K I K Sbjct: 559 GRFYLVSNPSELVDVFLKETKIIKGKYIKEKK 590 >gi|315649824|ref|ZP_07902907.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315274798|gb|EFU38179.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 1316 Score = 82.9 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 39/204 (19%), Positives = 81/204 (39%), Gaps = 32/204 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE- 231 D+++++D S SM K+ AI S ++ + L + + G+V +S+ Sbjct: 69 DVILIIDRSGSMNDE-----NKMQSAINSAKGFIDLMDL-----SKHKVGIVDYSSANNI 118 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 F L ++ + L G T + +K A + + + + +I Sbjct: 119 SSFPLSTDKEAVKNYVNGLRANG-GTATGDAIKKARELLVN----------HRPDAQPVI 167 Query: 292 VFMTDGENLSTKED--QQSLYYCNEAKKRGAIVYAIGIRVIRSHE-------FLRACA-S 341 V +TDG+ + +L NEAK+ G + Y I + ++ L+ A + Sbjct: 168 VLLTDGDATEPNGNAYNYALTNSNEAKQEGIVFYTIALLNTNANPDTSGPNLLLKQMATT 227 Query: 342 PNSFYLVENPHSMYDAFSHIGKDI 365 + + V + D ++ I ++I Sbjct: 228 SHHHHFVLGSVGLGDIYAAIVQEI 251 >gi|294508603|ref|YP_003572662.1| von Willebrand factor type A domain protein [Salinibacter ruber M8] gi|294344932|emb|CBH25710.1| von Willebrand factor type A domain protein [Salinibacter ruber M8] Length = 317 Score = 82.9 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 44/214 (20%), Positives = 83/214 (38%), Gaps = 45/214 (21%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+VLD S SM++ D T+ + A ++ A +E + GL+ F+ + Sbjct: 79 IDIMMVLDASTSMQAE-DFQPTRFEAAREAAGAFVEGRVSD-------RVGLIVFAAEAY 130 Query: 232 EFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L S LQR ++ + T L A N++ D + K Sbjct: 131 TQAPLTLDYSFLQRMLEDVEVGAVEDGTAVGTALATAVNRLKDSEAES-----------K 179 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-----------VIRSH----- 333 + + +TDG N + D ++ A+ G VYAIG+ + + Sbjct: 180 VAILLTDGRNNRGQIDPRTA--AEVAQTMGVRVYAIGVGSSEDRDTWEEPLPQGQRDESA 237 Query: 334 ----EFLRACASP--NSFYLVENPHSMYDAFSHI 361 E LR+ ++ ++ N ++ ++ I Sbjct: 238 GVDAEMLRSVSTSTGGQYFSATNRDALERIYAEI 271 >gi|326676330|ref|XP_003200547.1| PREDICTED: collagen alpha-1(XIV) chain-like [Danio rerio] Length = 1164 Score = 82.9 bits (203), Expect = 9e-14, Method: Composition-based stats. Identities = 53/254 (20%), Positives = 93/254 (36%), Gaps = 34/254 (13%) Query: 124 NEGYSISAISRYKIPLKFC-----TFIPWYTNSRHIVM----PITSSVKVNSQTDARLDM 174 N YSIS ++ + +P T I P T A+ D+ Sbjct: 391 NTQYSISVHTQLQETEGPAVSTTERTLPVSTAPPTIASSSAPPPTLPATKEVCRAAKADL 450 Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 + ++D S S+ D + KI + S L+++ + Q + FS+ F Sbjct: 451 VFLVDGSWSIG---DDNFQKIIRFLHSTAGALDQIGP-----DGTQVAIAQFSDDARTEF 502 Query: 235 LLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFD-MQGMRQHCNTEDANYKKII 291 L S L I+ +S G +T + +K+ + +F + G R+ K++ Sbjct: 503 SLSSHSSKEELLTAIQRVSYKGGNTKTGRAMKHVKDSVFAPVGGARRG-------VPKVL 555 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SFYLVE 349 V +TDG L E + G IV+AIG E L + P + V+ Sbjct: 556 VVLTDGR-----SQDDVLQVSQELQAEGYIVFAIGFADADYGELLSIASRPGDRHVFFVD 610 Query: 350 NPHSMYDAFSHIGK 363 + + +I + Sbjct: 611 DLDAFRTIVENILQ 624 >gi|198421751|ref|XP_002123463.1| PREDICTED: similar to cartilage matrix protein [Ciona intestinalis] Length = 272 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 38/223 (17%), Positives = 83/223 (37%), Gaps = 21/223 (9%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 +V ++ + + LD++ +LD SRS+ + + + Sbjct: 15 LVSANDAANENETCRPRPLDIVFMLDGSRSVRP------KNFQTVKDYVKNFTDIFEAFG 68 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N +Q G++ F + + E LL + L I + T + L+ + Sbjct: 69 P--NDMQVGVIQFGSGVREEILLNQFYVRHELMEAIDNIRYMETGTMTGLALRKLVTETL 126 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-I 330 ++ ++ ++V +TDG++ + + EAK RG ++AIGI Sbjct: 127 TVE---HGARVDNPIVHTVVVIITDGKSQDYSRGGVT-KWTKEAKARGFEIFAIGIGRKA 182 Query: 331 RSHEFLRACASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIW 371 E L + P + V+N ++A + ++ + + Sbjct: 183 NRKELLEMASEPKELHTFRVQN----FNAIKRVDVNLKDRILC 221 >gi|194215132|ref|XP_001499245.2| PREDICTED: similar to Collagen alpha-1(XXII) chain [Equus caballus] Length = 1632 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 78/216 (36%), Gaps = 26/216 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + D++ +LD S S+ + + + + V + + Sbjct: 36 AQRAGCKTVHYDLVFLLDTSSSVGK------EDFEKVQQWVANL---VDTFEVGADHTRV 86 Query: 222 GLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G+V +S++ F L S ++ + L+ G +TN+ L+Y F Q Sbjct: 87 GVVRYSDQPTTAFELGHFRSREAVKAAARRLAYHGGNTNTGDALRYITRHSFSPQ---AG 143 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 D +K++ + +TDG L A + G ++A+G+ E Sbjct: 144 GRPGDRAFKQVAILLTDGR-----SQDLVLDAAATAHRAGIRIFAVGVGEALKEELEEIA 198 Query: 340 ASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P + V + ++A I + + +R+ + Sbjct: 199 SEPKSAHIFHVSD----FNAIDKI-RGKLRRRLCEN 229 >gi|297482250|ref|XP_002692646.1| PREDICTED: collagen, type XXII, alpha 1-like [Bos taurus] gi|296480820|gb|DAA22935.1| collagen, type XXII, alpha 1-like [Bos taurus] Length = 1605 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 79/216 (36%), Gaps = 26/216 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + D++ VLD S S+ + + + +++ ++ P + Sbjct: 27 AQRAGCKTVHYDLVFVLDSSSSVGK------ENFEKVRQWVANLVDTFEVGP---ERTRV 77 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G+V +S++ F L S + L+ G T++ L++ F R Sbjct: 78 GVVRYSDRPATAFELGRFGSRAAVRAAARQLAYHGGHTHTGDALRFITRHSFT---PRAG 134 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 D +K++ + +TDG L A++ G ++A+G+ E Sbjct: 135 GRPGDRAFKQVAILLTDGR-----SQDLVLPAATAARRAGIRIFAVGVGEALREELEEIA 189 Query: 340 ASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P + V + +DA I + + +R+ + Sbjct: 190 SEPTAAHVFHVSD----FDAIDKI-RGKLRRRLCEN 220 >gi|297460736|ref|XP_599315.5| PREDICTED: collagen, type XXII, alpha 1 [Bos taurus] Length = 1605 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 79/216 (36%), Gaps = 26/216 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + D++ VLD S S+ + + + +++ ++ P + Sbjct: 27 AQRAGCKTVHYDLVFVLDSSSSVGK------ENFEKVRQWVANLVDTFEVGP---ERTRV 77 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G+V +S++ F L S + L+ G T++ L++ F R Sbjct: 78 GVVRYSDRPATAFELGRFGSRAAVRAAARQLAYHGGHTHTGDALRFITRHSFT---PRAG 134 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 D +K++ + +TDG L A++ G ++A+G+ E Sbjct: 135 GRPGDRAFKQVAILLTDGR-----SQDLVLPAATAARRAGIRIFAVGVGEALREELEEIA 189 Query: 340 ASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P + V + +DA I + + +R+ + Sbjct: 190 SEPTAAHVFHVSD----FDAIDKI-RGKLRRRLCEN 220 >gi|260797291|ref|XP_002593637.1| hypothetical protein BRAFLDRAFT_235784 [Branchiostoma floridae] gi|229278863|gb|EEN49648.1| hypothetical protein BRAFLDRAFT_235784 [Branchiostoma floridae] Length = 371 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 38/181 (20%), Positives = 70/181 (38%), Gaps = 22/181 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 KV + +D++ VLD S S+ + K+ +K I + + G Sbjct: 211 KVTPPCNNPVDIVFVLDGSGSVG---RRNFEKVQAGVKKIVGDFNIAL------DSTRVG 261 Query: 223 LVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V +S+ + + F L+ + L+ I+ + T + ++YA F Sbjct: 262 VVQYSSIVRQEFALDTFSNLQGLESGIQSIPYMAGGTRTGAAMEYAIQNSFTS------A 315 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 N + +IV +TDG + +AK+ G +V+A+GI L A Sbjct: 316 NGARPDVGHVIVLVTDGRSYDDVSQAS-----QKAKQAGIVVFAVGIGDGAVESQLNQIA 370 Query: 341 S 341 S Sbjct: 371 S 371 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 74/189 (39%), Gaps = 22/189 (11%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D++ +LD S S+ + + + K++ L I N + ++ +S+ + Sbjct: 1 PIDIIFMLDGSGSVGPDNFNKMKEF--VKKTVGGYL-----IGPSN--TRVAVMQYSSSV 51 Query: 231 EEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + F L+ + L I+ + T + L Q F N N Sbjct: 52 RQEFALDAFNTLEDLLVGIEEIRYMRGGTRTGKALTRLRRQGFLES------NGARKNVP 105 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 + V +TDG S DQ +L ++ G ++YA+G+ + ++ + +V Sbjct: 106 HVAVIVTDGR-SSDSVDQAALET----RQSGIVLYAVGVGNYDLGQLTDIASTNETLGVV 160 Query: 349 ENPHSMYDA 357 +N + + D Sbjct: 161 DNFNLLDDV 169 >gi|224046761|ref|XP_002188607.1| PREDICTED: collagen, type XXII, alpha 1 [Taeniopygia guttata] Length = 1598 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 36/220 (16%), Positives = 83/220 (37%), Gaps = 25/220 (11%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + + D++ +LD S S+ + + ++ ++E ++ PD Sbjct: 27 TEAQRAGCKSVHYDLVFLLDTSSSVGK------EDFEKVRQWVSNLVETFEIGPDK---T 77 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIK----YLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + G+V +S++ F L G + +IK + +G +TN+ L+Y F + Sbjct: 78 RVGVVRYSDRPSTEFDL--GKYKTREEIKEAARKIQYYGGNTNTGDALRYITTYSFSKE- 134 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 D KK+ + +TDG + L A + G ++A+G+ E Sbjct: 135 --AGGRLSDRTVKKVAILLTDGRSQDF-----VLDPATAAHQAGIRIFAVGVGEALKEEL 187 Query: 336 LRACASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P + V + +++ + + + + + Sbjct: 188 DEIASEPKSAHVFHVSDYNAIDKIRGKLRRRLCENVLCPN 227 >gi|119569133|gb|EAW48748.1| collagen, type XII, alpha 1, isoform CRA_a [Homo sapiens] Length = 821 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 53/267 (19%), Positives = 103/267 (38%), Gaps = 37/267 (13%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 ++T+L + + + Y IS + + I T + + + V + + Sbjct: 384 PQTTTLSVRDLSADTEYQISVSAMKGMTSSEPISIMEKTQPMKVQVECSRGVDIKA---- 439 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++ ++D S S+ ++ K+ ++ + VK N VQ LV +S Sbjct: 440 --DIVFLVDGSYSIG---IANFVKVRAFLEVL------VKSFEISPNRVQISLVQYSRDP 488 Query: 231 EEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L+ + I G STN+ + Y +IF +N Sbjct: 489 HTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF------VPSKGSRSNVP 542 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 K+++ +TDG++ D + + ++A+G++ E L A ASP Sbjct: 543 KVMILITDGKSSDAFRD-----PAIKLRNSDVEIFAVGVKDAVRSE-LEAIASPPAETHV 596 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 + VE+ +DAF I ++ T+ I Sbjct: 597 FTVED----FDAFQRISFEL-TQSICL 618 Score = 81.7 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 46/279 (16%), Positives = 100/279 (35%), Gaps = 34/279 (12%) Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP--LKFCTFIPWYTNSRHIVMPI 158 +G + +T + + Y ++ S ++ + + T S + Sbjct: 64 DGPTKEFTLSASTTETLLSELVPETEYVVTITSYDEVEESVPVIGQLTIQTGSSTKPVEK 123 Query: 159 -TSSVKVNSQTDAR-LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 ++ + + D++ ++D S S+ + I I ++ V Sbjct: 124 KPGKTEIQKCSVSAWTDLVFLVDGSWSVGRN---NFKYILDFIAAL------VSAFDIGE 174 Query: 217 NVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDM 273 + G+V +S+ F L + L IK + G +T + + Y N + Sbjct: 175 EKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTES 234 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 G R + K+ + +TDG K + E + G V+++GI+ + Sbjct: 235 AGARVG-------FPKVAIIITDG-----KSQDEVEIPARELRNVGVEVFSLGIKAADAK 282 Query: 334 EFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 E + ++P N + V N +DA I +I+++ Sbjct: 283 ELKQIASTPSLNHVFNVAN----FDAIVDIQNEIISQVC 317 >gi|47216147|emb|CAG10021.1| unnamed protein product [Tetraodon nigroviridis] Length = 1453 Score = 82.5 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 36/189 (19%), Positives = 79/189 (41%), Gaps = 24/189 (12%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + ++A D++ ++D S+S+ ++ K IN +++++ + + +V GLV Sbjct: 766 ACSNAATDVVFLIDGSKSVRP------ENFELVKKWINQIIDKLDVSDNKAHV---GLVQ 816 Query: 226 FSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +S+ +++ F L L+ +K ++ T + L+Y + F G Q Sbjct: 817 YSSAVKQEFPLGRYNNKKDLKEAVKKMAYMERGTMTGQALRYLTDNSF---GPGQGARPG 873 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP- 342 K+ + TDG + D +AK +G +YA+G+ E + P Sbjct: 874 ---VTKVGIVFTDGRSQDYIGDA-----AKKAKDQGFKMYAVGVGNAVEDELKEIASEPT 925 Query: 343 -NSFYLVEN 350 ++ + Sbjct: 926 AEHYFYTAD 934 Score = 81.7 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 27/178 (15%), Positives = 66/178 (37%), Gaps = 20/178 (11%) Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIK 248 ++ + + ++E + + P+ + G+V ++++++ L+ + L + + Sbjct: 556 DPSEFEQVKVFLAKVIEGLDVGPNA---TRVGVVNYASRVKNEVSLKTHRTKAGLIKAVT 612 Query: 249 YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS 308 + T + +++A N F + K+ + +TDG +D Sbjct: 613 KIEPLSTGTMTGLAIQFAMNVAFSEAE---GARLRSPDISKVAIVVTDGRPQDNVKD--- 666 Query: 309 LYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---PNSFYLVEN---PHSMYDAFSH 360 A+ G ++AIG+ + LR AS + VE+ + F Sbjct: 667 --VAQRARDAGIEIFAIGVGRVE-MSTLRQMASDPLDDHVDYVESYSVIEKLTKKFQE 721 >gi|317419404|emb|CBN81441.1| von Willebrand factor A domain-containing protein 2 [Dicentrarchus labrax] Length = 761 Score = 82.1 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 34/214 (15%), Positives = 72/214 (33%), Gaps = 29/214 (13%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 SV +D++ LD S S+ + + + + +V Q Sbjct: 489 SVDTQGCLGQAVDLVFALDASGSVSP------DNFATMRDFVRGLSVQFDI---NRDVAQ 539 Query: 221 SGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD-MQGMR 277 LV + + F L+ S + + + + G ++ L + ++ I +G R Sbjct: 540 MALVAYGRRATTVFNLDTHDTGSAILKAVGDANYMGGVASTGTALLHVHSDILTVAKGAR 599 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N K +V +TDG + + + G ++ IGI I+ L+ Sbjct: 600 PGVN-------KAVVVVTDG-----SGGDDAAVPAQKLRDNGVSLFVIGIGDIQKERLLQ 647 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 S V + + +D++ + + Sbjct: 648 IAGSEEHMISVLSYEDL-----KYFEDVLVQMLC 676 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 42/198 (21%), Positives = 83/198 (41%), Gaps = 27/198 (13%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 S Q A +D++ ++D S SM + + + + + + PD V+ Sbjct: 8 SAGEMMQCSAAMDILFLMDGSYSMGK------GSFERSKHYAIKLCQALDVSPDK---VR 58 Query: 221 SGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 GL+ F + F L+ + L++ +K +S G ST + LKY + F+ Sbjct: 59 VGLIQFGSVPRLEFALDSHTTKQDLKKHMKKVSYRGGSTQTGLALKYVLRKGFEGG---- 114 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + +I + ++DG + ++ + K+ G +++A+G+R R E L A Sbjct: 115 ---RNSSAAARIAILLSDGRSQGN-----AVQAAAQLKETGVVLFAVGLRYPRWEE-LHA 165 Query: 339 CAS---PNSFYLVENPHS 353 AS + + E+ H Sbjct: 166 LASAPMESHVFFAEHFHD 183 Score = 45.9 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 60/168 (35%), Gaps = 14/168 (8%) Query: 175 MIVLDVSRSMESFFDSSIT-KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 + LD S + + S T ++ ++ + + ++ + N+ + GL + + Sbjct: 304 ALSLDCSVDLLFLVEGSATLTLEGFLRLKSFLKRFLQTVVGSNSPSKVGLAVYGGETRVE 363 Query: 232 -EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + + L + ++ L G T + L+Y F + + ++ Sbjct: 364 AQVGKFKGNLKGLLKAVEALQPIGGETQTGQALRYVTRHGFVSAPVFADVT---DDLPRV 420 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 +V +T +T + + A+ R ++ I + R L Sbjct: 421 VVLLT-----ATPSSDEVVEPSKYARDR--EIFLIAVGPDRLKGQLNN 461 >gi|180654|gb|AAA63904.1| cartilage matrix protein [Homo sapiens] Length = 340 Score = 82.1 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 39/220 (17%), Positives = 84/220 (38%), Gaps = 30/220 (13%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T +V + D++ ++D S+S+ +++ K I+ +++ + + + Sbjct: 105 TCNVCSGGGGSSATDLVFLIDGSKSVRP------ENLELVKKFISQIVDTLDVSD---KL 155 Query: 219 VQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAY-NQIFDMQG 275 Q GLV +S+ + + F L ++ ++ +S T + LKY N G Sbjct: 156 AQVGLVQYSSSVRQEFPLGRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSG 215 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R +K+ + TDG + D +AK G ++A+G+ E Sbjct: 216 ARPGA-------QKVGIVFTDGRSQDYINDA-----AKKAKDLGFKMFAVGVGNAVEDEL 263 Query: 336 LRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P ++ + + + IGK + K + Sbjct: 264 REIASEPVAEHYFYTAD----FKTINQIGKKLQKKICVEE 299 >gi|293361345|ref|XP_236596.5| PREDICTED: collagen type VI alpha 4 [Rattus norvegicus] Length = 2327 Score = 82.1 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 51/327 (15%), Positives = 119/327 (36%), Gaps = 31/327 (9%) Query: 46 MKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN 105 M L +++ A ++G + G + L+ G + Sbjct: 111 MGQSLQFILEHHFPEGAGSRASQGVPQVALVMSTGVAEDHFREP-----AEALKREGILL 165 Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT-SSVKV 164 + + ++ + + + + L + + + Sbjct: 166 YAIGVKDAAQAELREIASSPKDNFTFFVPNFSGLPGLAQKLRPELCTTLAKVVQHTEQGS 225 Query: 165 NSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + T+A L D++ ++D S S+ + +++++ + + ++ VQ GL Sbjct: 226 PACTEAFLADIVFLVDSSTSIG------LQNFQKVKNFLHSIVSGLDV---RSDQVQVGL 276 Query: 224 VTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYA-YNQIFDMQGMRQHC 280 V +S+ I F L+ S + +I+ L STN+ L++ N + +M G R Sbjct: 277 VQYSDNIYPAFQLKQSSLKSVVLEQIRNLPYNMGSTNTGSALEFIRANYLTEMSGSRA-- 334 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +I++ +TDGE+ +D ++ K+ G VY +GI + E + + Sbjct: 335 ---KDGVPQIVILVTDGESNDEVQDA-----ADQLKRDGVFVYVVGINIQDVQELQKIAS 386 Query: 341 SP--NSFYLVENPHSMYDAFSHIGKDI 365 P + EN + + + + + + Sbjct: 387 EPFEEFLFTTENFNILQELSGSLSQAV 413 Score = 76.7 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 52/272 (19%), Positives = 93/272 (34%), Gaps = 34/272 (12%) Query: 97 ELRDNGFVNDIDDIVRSTSLDI----VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSR 152 + + + V D +R T ++I V F + + Sbjct: 976 DGQSSDPVADAAQGLRDTGINIYAIGVRDANTTELE---EIANNRVFFTDDFHFLKSIHQ 1032 Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 +V I S SQ D++ ++D S S+ S KI +K V Sbjct: 1033 EVVRDICSFENCRSQKA---DIIFLIDGSESISSE---DFEKIKDFVK------RMVNQS 1080 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVS---HLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + +Q GL+ FS+ E F L+ S + R I +++ T + L + Sbjct: 1081 NIGADKIQIGLLQFSSTPREEFTLKNNYSSKDEMCRAISNVTQINSGTETGKALNFTLPF 1140 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 QG R + ++ +TDG+ + + R I++AIG+ Sbjct: 1141 FDISQGGRPG-------VHQYLIVITDGD-----SHDDIVSPAKALRDRNIIIFAIGVGK 1188 Query: 330 IRSHEFLRACASPNSFYLVENPHSMYDAFSHI 361 I+ + L + Y EN S+ + I Sbjct: 1189 IQRAQLLAITNDQDKVYHEENFESLQNLEKEI 1220 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 37/193 (19%), Positives = 72/193 (37%), Gaps = 23/193 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S+ T++ +K + +K+ + V+ G+V +S+KI Sbjct: 869 DIYFLIDGSGSIS---PKDFTEMKEFMKDV------IKMFHIGPDGVRFGVVQYSDKIIS 919 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 FLL S K+ A +++ NT N + ++ Sbjct: 920 QFLLTQYTSM--EKLGTAIGNIQQGGGGTTTGEALSKM-----ALVFRNTARTNVAQYLI 972 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPH 352 +TDG++ D + G +YAIG+R + E N + ++ H Sbjct: 973 VITDGQSSDPVADA-----AQGLRDTGINIYAIGVRDANTTELEEIA--NNRVFFTDDFH 1025 Query: 353 SMYDAFSHIGKDI 365 + + +DI Sbjct: 1026 FLKSIHQEVVRDI 1038 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 46/348 (13%), Positives = 119/348 (34%), Gaps = 42/348 (12%) Query: 3 SLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAA 62 S ++ ++NF +I++ + + + +G++ +I+ + S + ++ Sbjct: 243 STSIGLQNFQKVKNFLHSIVSGLDVRSDQVQVGLVQYSDNIYPAFQLKQSSLKSVVLEQI 302 Query: 63 TQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRN----------ELRDNGFVNDIDD--- 109 + N + ++ + N V D D Sbjct: 303 RNLPYNMGSTNTGSALEFIRANYLTEMSGSRAKDGVPQIVILVTDGESNDEVQDAADQLK 362 Query: 110 ----IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVN 165 V ++I V + + I++ + F S + + S+V+ Sbjct: 363 RDGVFVYVVGINIQDVQELQK--IASEPFEEFLFTTENFNILQELSGSLSQAVCSTVERE 420 Query: 166 SQ--TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + T+ D++ ++D S +S I +++ +++ D + GL Sbjct: 421 MKKPTETYADVVFLIDSS------QGTSQASFRWMRHFIARIIDMLEVGKDN---YRIGL 471 Query: 224 VTFSNKIEEFFLLEWGV--SHLQRKI-KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +S++ FL + + I ++L G S + GL++ + F + Sbjct: 472 AQYSDQGHTEFLFNTHKTQNEMVAHIYEHLVLQGGSRKTGQGLRFLHRTFFQEAAGSRFL 531 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +V +T G K + + + RG ++ ++G++ Sbjct: 532 ----QGVPQYVVVVTSG-----KSEDEVREPAQILRNRGVVIMSVGLQ 570 Score = 38.6 bits (88), Expect = 1.4, Method: Composition-based stats. Identities = 24/160 (15%), Positives = 55/160 (34%), Gaps = 14/160 (8%) Query: 200 KSINAMLEEV-KLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVS 256 +S+ L + + + ++ L + + FLL + + I+ L Sbjct: 49 QSVRNFLYILANSLQVGGDNIRVSLALYGDTPTTEFLLSAYPRKGDVLKHIRGLQFKPGG 108 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 L++ I + + ++ + M+ G + K Sbjct: 109 NRMGQSLQF----ILEHHFPEGAGSRASQGVPQVALVMSTG-----VAEDHFREPAEALK 159 Query: 317 KRGAIVYAIGIRVIRSHEFLRACASP-NSF-YLVENPHSM 354 + G ++YAIG++ E +SP ++F + V N + Sbjct: 160 REGILLYAIGVKDAAQAELREIASSPKDNFTFFVPNFSGL 199 >gi|293349452|ref|XP_002727145.1| PREDICTED: similar to procollagen, type VI, alpha 3 isoform 4 [Rattus norvegicus] Length = 2114 Score = 82.1 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 51/327 (15%), Positives = 119/327 (36%), Gaps = 31/327 (9%) Query: 46 MKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN 105 M L +++ A ++G + G + L+ G + Sbjct: 111 MGQSLQFILEHHFPEGAGSRASQGVPQVALVMSTGVAEDHFREP-----AEALKREGILL 165 Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT-SSVKV 164 + + ++ + + + + L + + + Sbjct: 166 YAIGVKDAAQAELREIASSPKDNFTFFVPNFSGLPGLAQKLRPELCTTLAKVVQHTEQGS 225 Query: 165 NSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + T+A L D++ ++D S S+ + +++++ + + ++ VQ GL Sbjct: 226 PACTEAFLADIVFLVDSSTSIG------LQNFQKVKNFLHSIVSGLDV---RSDQVQVGL 276 Query: 224 VTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYA-YNQIFDMQGMRQHC 280 V +S+ I F L+ S + +I+ L STN+ L++ N + +M G R Sbjct: 277 VQYSDNIYPAFQLKQSSLKSVVLEQIRNLPYNMGSTNTGSALEFIRANYLTEMSGSRA-- 334 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +I++ +TDGE+ +D ++ K+ G VY +GI + E + + Sbjct: 335 ---KDGVPQIVILVTDGESNDEVQDA-----ADQLKRDGVFVYVVGINIQDVQELQKIAS 386 Query: 341 SP--NSFYLVENPHSMYDAFSHIGKDI 365 P + EN + + + + + + Sbjct: 387 EPFEEFLFTTENFNILQELSGSLSQAV 413 Score = 76.7 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 52/272 (19%), Positives = 93/272 (34%), Gaps = 34/272 (12%) Query: 97 ELRDNGFVNDIDDIVRSTSLDI----VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSR 152 + + + V D +R T ++I V F + + Sbjct: 773 DGQSSDPVADAAQGLRDTGINIYAIGVRDANTTELE---EIANNRVFFTDDFHFLKSIHQ 829 Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 +V I S SQ D++ ++D S S+ S KI +K V Sbjct: 830 EVVRDICSFENCRSQKA---DIIFLIDGSESISSE---DFEKIKDFVK------RMVNQS 877 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVS---HLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + +Q GL+ FS+ E F L+ S + R I +++ T + L + Sbjct: 878 NIGADKIQIGLLQFSSTPREEFTLKNNYSSKDEMCRAISNVTQINSGTETGKALNFTLPF 937 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 QG R + ++ +TDG+ + + R I++AIG+ Sbjct: 938 FDISQGGRPG-------VHQYLIVITDGD-----SHDDIVSPAKALRDRNIIIFAIGVGK 985 Query: 330 IRSHEFLRACASPNSFYLVENPHSMYDAFSHI 361 I+ + L + Y EN S+ + I Sbjct: 986 IQRAQLLAITNDQDKVYHEENFESLQNLEKEI 1017 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 37/193 (19%), Positives = 72/193 (37%), Gaps = 23/193 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S+ T++ +K + +K+ + V+ G+V +S+KI Sbjct: 666 DIYFLIDGSGSIS---PKDFTEMKEFMKDV------IKMFHIGPDGVRFGVVQYSDKIIS 716 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 FLL S K+ A +++ NT N + ++ Sbjct: 717 QFLLTQYTSM--EKLGTAIGNIQQGGGGTTTGEALSKM-----ALVFRNTARTNVAQYLI 769 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPH 352 +TDG++ D + G +YAIG+R + E N + ++ H Sbjct: 770 VITDGQSSDPVADA-----AQGLRDTGINIYAIGVRDANTTELEEIA--NNRVFFTDDFH 822 Query: 353 SMYDAFSHIGKDI 365 + + +DI Sbjct: 823 FLKSIHQEVVRDI 835 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 54/395 (13%), Positives = 128/395 (32%), Gaps = 50/395 (12%) Query: 3 SLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAA 62 S ++ ++NF +I++ + + + +G++ +I+ + S + ++ Sbjct: 243 STSIGLQNFQKVKNFLHSIVSGLDVRSDQVQVGLVQYSDNIYPAFQLKQSSLKSVVLEQI 302 Query: 63 TQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRN----------ELRDNGFVNDIDD--- 109 + N + ++ + N V D D Sbjct: 303 RNLPYNMGSTNTGSALEFIRANYLTEMSGSRAKDGVPQIVILVTDGESNDEVQDAADQLK 362 Query: 110 ----IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVN 165 V ++I V + + I++ + F S + + S+V+ Sbjct: 363 RDGVFVYVVGINIQDVQELQK--IASEPFEEFLFTTENFNILQELSGSLSQAVCSTVERE 420 Query: 166 SQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + D++ + +E F + IK + + + + PD+ V+ Sbjct: 421 MKKCLAPIPTDLVFL------VEEFTSAGQPNFQQVIKLLKTTVHSLNIHPDI---VRVS 471 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIK----YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 LV +S K + F L + Q T + L + ++F + + Sbjct: 472 LVFYSEKPQLKFSLNTFQNAAQVLTSLDQLTFRARRGRTKAGAALDFLRKEVFLPEKGSR 531 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 ++I V + S D S + ++ G +YA+GI+ + L Sbjct: 532 SIWG----VQQIAVII----MESPSLDNVS-TPASHLRRTGVTIYAVGIQSASESKDLEK 582 Query: 339 CASPNSFYLVENPHSMYDAFSH--IGKDIVTKRIW 371 A+ Y ++F + + I+ K++ Sbjct: 583 IAT----YPPGKHTIHLESFLQLDVVRKILNKKLC 613 Score = 38.6 bits (88), Expect = 1.4, Method: Composition-based stats. Identities = 24/160 (15%), Positives = 55/160 (34%), Gaps = 14/160 (8%) Query: 200 KSINAMLEEV-KLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVS 256 +S+ L + + + ++ L + + FLL + + I+ L Sbjct: 49 QSVRNFLYILANSLQVGGDNIRVSLALYGDTPTTEFLLSAYPRKGDVLKHIRGLQFKPGG 108 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 L++ I + + ++ + M+ G + K Sbjct: 109 NRMGQSLQF----ILEHHFPEGAGSRASQGVPQVALVMSTG-----VAEDHFREPAEALK 159 Query: 317 KRGAIVYAIGIRVIRSHEFLRACASP-NSF-YLVENPHSM 354 + G ++YAIG++ E +SP ++F + V N + Sbjct: 160 REGILLYAIGVKDAAQAELREIASSPKDNFTFFVPNFSGL 199 >gi|154089854|emb|CAO81739.1| collagen type VI alpha 6 [Homo sapiens] Length = 631 Score = 82.1 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 37/192 (19%), Positives = 76/192 (39%), Gaps = 19/192 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D S S+ + ++ M+ VK N V+ G + +++ E Sbjct: 51 LDVVFVIDSSGSI------DYDEYNIMKDF---MIGLVKKADVGKNQVRFGALKYADDPE 101 Query: 232 EFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L+ + L+ ++ G ST + L ++ + N + Sbjct: 102 VLFYLDDFGTKLEVISVLQNDQAMGGSTYTAEAL--GFSDHMFTEARGSRLNKG---VPQ 156 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDGE+ + + + +G +V A+GI E L S + ++ VE Sbjct: 157 VLIVITDGES---HDADKLNATAKALRDKGILVLAVGIDGANPVELLAMAGSSDKYFFVE 213 Query: 350 NPHSMYDAFSHI 361 + FS + Sbjct: 214 TFGGLKGIFSDV 225 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 83/214 (38%), Gaps = 22/214 (10%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + +S KV+ + D +D++ ++D S S++ K+ + S+ V+ Sbjct: 225 VTASVCNSSKVDCEIDK-VDLVFLMDGSTSIQPN---DFKKMKEFLASV------VQDFD 274 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N V+ G FS+ F L G + +I+ + + +T+ L+ + Sbjct: 275 VSLNRVRIGAAQFSDTYHPEFPLGTFIGEKEISFQIENIKQIFGNTHIGAALREVEHYFR 334 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 G R + ++++ +TDG+ + + RG +Y++GI + Sbjct: 335 PDMGSRI-----NTGTPQVLLVLTDGQ-----SQDEVAQAAEALRHRGIDIYSVGIGDVD 384 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 + ++ + V N + I ++I Sbjct: 385 DQQLIQITGTAEKKLTVHNFDELKKVNKRIVRNI 418 >gi|167759260|ref|ZP_02431387.1| hypothetical protein CLOSCI_01607 [Clostridium scindens ATCC 35704] gi|167663134|gb|EDS07264.1| hypothetical protein CLOSCI_01607 [Clostridium scindens ATCC 35704] Length = 800 Score = 82.1 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 54/351 (15%), Positives = 107/351 (30%), Gaps = 52/351 (14%) Query: 52 SMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIV 111 S + S + EG+ N + + + + D+ + Sbjct: 65 SGTESSTDGTDEILNTEGSPENGSGNGKTADTGAVSDAEVTPEKEAATEGITEQDVPEAE 124 Query: 112 RSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR 171 + + V V + G + A ++ N + + + + T Sbjct: 125 NVGAGNPVAVHLSRGAASEAAPEHQ-----KYIKKNAENDYTLTLNVKGMYDSET-TKPM 178 Query: 172 LDMMIVLDVSRSMESFFDSS----ITKIDMAIKSINAMLEEVKLIPDVNNV-VQSGLVTF 226 +D+++++D S SM D+ +++D+ + + I + Q +VT+ Sbjct: 179 IDVLLIVDKSGSMNWKMDTDKVGKPSRMDVLKQVVTGTGGLTDSIFGNTQIDAQMAVVTY 238 Query: 227 S--------NKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 S + + EW + + + G TN GL+ + Sbjct: 239 SGSNDFLDQRYDDAEIIQEWTKQKDTVNNAVNNIQAKG-GTNCEAGLRTGATAL------ 291 Query: 277 RQHCNTEDANYKKIIVFMTDGE--------------NLSTKEDQQSLYYCNEAKKRGAI- 321 N KK ++F++DG+ + + K G Sbjct: 292 ----EGSRENAKKFVIFLSDGDATFYYGDDGYTKGPGSGSSPTAREKAIAQVQKITGLEG 347 Query: 322 VYAIGIRVIRSHEFLRAC-----ASPNSFYLVENPHSMYDAFSHIGKDIVT 367 Y IG+ S EFL AS FY N ++ AF I + Sbjct: 348 FYTIGMTSSSSSEFLTNLANNSKASEKRFYPANNTEALEKAFQEIVGETTE 398 >gi|301784737|ref|XP_002927782.1| PREDICTED: collagen alpha-5(VI) chain-like [Ailuropoda melanoleuca] Length = 2524 Score = 82.1 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 35/210 (16%), Positives = 75/210 (35%), Gaps = 27/210 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D + D+M ++D S S+ + K+ +KS+ + I + G++ Sbjct: 620 KGCEDMKADIMFLVDSSGSIGLE---NFGKMKTFMKSL------LAKIQIGPDRTHIGVI 670 Query: 225 TFSNKIEEFFLL-EWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+K E F L ++ + + I +S +T + L +G R Sbjct: 671 QFSDKTREEFQLNKYFTQNEISDAIDRMSLIDKNTLTGNALISVDQYFTPAKGARIGI-- 728 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 KK ++ +TDGE + +G +++++G+ + Sbjct: 729 -----KKFLILITDGE-----AQDAVRDPAKALRDKGVVIFSVGVYGANRTQLEEISGDG 778 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 N + VE+ + I ++ + Sbjct: 779 NLVFQVESFDDL----KAIESKLIFRVCAL 804 Score = 79.0 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 49/324 (15%), Positives = 112/324 (34%), Gaps = 27/324 (8%) Query: 41 SHIFFMKTVLHSMIDR-SLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELR 99 + +F + + IDR SL+ T N ++ IK + Sbjct: 682 LNKYFTQNEISDAIDRMSLIDKNTLTGNALISVDQYFTPAKGARIGIKKFLI--LITDGE 739 Query: 100 DNGFVNDIDDIVRSTSL---DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVM 156 V D +R + + V N + + +F ++ Sbjct: 740 AQDAVRDPAKALRDKGVVIFSVGVYGANRTQLEEISGDGNLVFQVESFDDLKAIESKLIF 799 Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 + + + LD++ VLD S S+ + ++ + + VK + Sbjct: 800 RVCALHDCKNIK--VLDIVFVLDHSGSINTQEQENM---------MALTIHLVKKADVGS 848 Query: 217 NVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + V+ G + +S+ E F L S + ++ G T + L + +M Sbjct: 849 DRVRIGALKYSDYPEILFHLGKYSNRSSVIEHLRRRRSTGGDTYTARALDH-----TNMM 903 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++ + N K++++ +TDG + ++ + + +G +YA+G+ E Sbjct: 904 FTEEYGSRIQQNVKQMLIVITDGVSHDRNLLNETAL---KLRNKGIDIYAVGVGQADQLE 960 Query: 335 FLRACASPNSFYLVENPHSMYDAF 358 + + + V+N + + D + Sbjct: 961 LEAMAGNKSKTFHVDNFNKLKDIY 984 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 40/338 (11%), Positives = 108/338 (31%), Gaps = 48/338 (14%) Query: 46 MKTVLHSMIDRSLVHAA------TQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELR 99 K L + + S A ++ + + + + + M+ + Sbjct: 319 EKMRLEAFSESSGSRKAQGVPQIAVLVTHRPSTDEVRDVAIQLRLQDVTVFGMNIQGA-- 376 Query: 100 DNGFVNDIDDIVRSTSLDIVV---VPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVM 156 D + +I + ++ + + R +I + T + + + Sbjct: 377 DETQLEEIASYPPRQMVSMLKSYADLETYSKNFQKKLRNEIWSQI------STRAEQMDL 430 Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 T + D+ ++D S S++ +I + ++ M Sbjct: 431 DRTGCIDTKEA-----DIYFLIDGSTSIQ---GKHFEQIKEFMLAVTGMFSI------GP 476 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + VQ G V +S+K+ F + +++ + I + + +T + L + + I + + Sbjct: 477 DKVQVGAVQYSDKMRVEFYINDNSNNVNLKNAILNIEQLQGNTYTGEALNFTLSIIKEDR 536 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 +R + ++ +TDG + L + ++A+GI + Sbjct: 537 KLRT------SQVPCYLIVLTDGRSTD-----DVLEPAERLRAEQVTIHAVGIGEAIKVQ 585 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + +N +D+ I ++V + Sbjct: 586 LQQIAGGEERVSFGQN----FDSLRSIKNEVVHRICTE 619 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 80/223 (35%), Gaps = 24/223 (10%) Query: 154 IVMPITSSVKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 +++ + + + D++ ++D S D TK +K+ + V + Sbjct: 10 LILWTETLADQSPEPGPEYADVVFLVDSS-------DHLGTKSFPLVKAF--IHRMVSSL 60 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 P + + + +S+++ F L S L K L G S + L+ A+ Sbjct: 61 PIEAHKYRVAVAQYSDRLHSEFQLGTFKSRNPMLNHLKKNLGFLGGSLRTGHALREAHRT 120 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 F + + I+V + S + + ++ G + ++G++ Sbjct: 121 YFSAPAGGRDKK----QFPPILVVL-----ASAQSEDDVEEASKALREDGVRIVSVGLQS 171 Query: 330 IRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 E L+A A+ + + + + AFS I+ + Y Sbjct: 172 ASEEE-LKAMATAQFHFNLRSARDL-GAFSQNMTQIIKEATQY 212 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 38/341 (11%), Positives = 98/341 (28%), Gaps = 51/341 (14%) Query: 31 FLVLGMIIEVSHIFFMKTVLHSMIDRS---------LVHAATQIMNEGNGNNRKKLKGGD 81 LG + H + ++ +E + K D Sbjct: 101 LGFLGGSLRTGHALREAHRTYFSAPAGGRDKKQFPPILVVLASAQSEDDVEEASKALRED 160 Query: 82 ILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKF 141 + + + EL+ ++ + L QN I ++Y+ Sbjct: 161 GVRIVSVGLQSASEEELKAMATAQFHFNLRSARDLG--AFSQNMTQIIKEATQYRDGAAP 218 Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + +P V Q D+ D++ ++D S + + Sbjct: 219 ----------HSVDVPFP----VACQKDSLADLIFLVDES-------VGTGQNLRNLQNF 257 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNS 259 + + + + + + GL+++S++ LL + Q +I+ LS +N+ Sbjct: 258 LRNIAASMDVRYNC---TRLGLMSYSDRTNIISLLNSSTTQYEFQEQIQKLSVQAGKSNA 314 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 + ++ + + +I V +T + + + + Sbjct: 315 GAAI----EKMRLEAFSESSGSRKAQGVPQIAVLVT-----HRPSTDEVRDVAIQLRLQD 365 Query: 320 AIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 V+ + I+ + + Y SM +++ Sbjct: 366 VTVFGMNIQGADETQLEEIAS-----YPPRQMVSMLKSYAD 401 >gi|312877126|ref|ZP_07737097.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] gi|311796100|gb|EFR12458.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] Length = 900 Score = 81.7 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 46/212 (21%), Positives = 81/212 (38%), Gaps = 26/212 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + +D+++VLD S SM D+ I K+++A + M+E ++ G++ F Sbjct: 401 EKEKNIDVVLVLDHSGSMADTEDAGIPKLEIAKSASAKMVEHLESSDG------VGVIAF 454 Query: 227 SNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + + + I + + G T P L A + + Sbjct: 455 DHNYYWAYKFGKLVRKEDVIESISSI-EVGGGTAIIPPLSEAVKTL-----------KKS 502 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SP 342 K++V +TDG + + +EAK+ + IG+ + L A + Sbjct: 503 KAKNKLVVLLTDGMGEQSGYEI----PADEAKRNNIKITTIGVGKFVNASVLSWIADYTS 558 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 FYLV NP + D F K I K I K Sbjct: 559 GRFYLVSNPSELVDVFLKETKIIKGKYIKEKK 590 >gi|149634622|ref|XP_001513644.1| PREDICTED: similar to AMACO [Ornithorhynchus anatinus] Length = 801 Score = 81.7 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 35/217 (16%), Positives = 74/217 (34%), Gaps = 28/217 (12%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-NVV 219 S+ + LD++ ++D S ++ + + L D+N +V Sbjct: 534 SLDRQACQVPSLDLVFLVDASAAVGRENFTHT----------RNFVRSSSLRFDINRDVT 583 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 Q GLV F +I F L+ S + + + G ++ L + Y+++ +Q Sbjct: 584 QIGLVVFGRQIRTVFALDTHPTGSGVLEAVSQMPFVGGVGSAGTALLHVYDEVMTVQ--- 640 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 K +V +TDG + ++ + + G V+ I + + LR Sbjct: 641 ---KGARPGVSKAVVLITDG-----TGIEDAVVPAQKLRSNGVSVFVIRVGPFQKEALLR 692 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 SP+ + + + + K Sbjct: 693 IAGSPSYLVQASSYKDL----KSYEDSFIDRICEEAK 725 Score = 77.9 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 49/233 (21%), Positives = 91/233 (39%), Gaps = 25/233 (10%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 F + ++ + S+ Q A +D++ +LD S S+ + A Sbjct: 33 FLSHCIQELHASKETIMKISAASQLMQCSAAVDILFLLDGSYSIGK------GSFETAKY 86 Query: 201 SINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTN 258 + + + + + PD V+ G V FS F L+ + KIK + G ST Sbjct: 87 FVVKLCDALDINPD---RVRVGAVQFSVASWLEFPLDSCLTRQEAKDKIKKIVFRGGSTE 143 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + LKY ++ + +A+ +I++ +TDG K + K R Sbjct: 144 TGLALKY---ILW-----KGFPGGRNASVPQILILITDG-----KSQGNVTVPAQQLKDR 190 Query: 319 GAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 G V+A+G+R R E L AS + V + DA++ + + + + Sbjct: 191 GITVFAVGVRFPRWEE-LHLLASEPNEQHVLFAEDVDDAYNGLMSTLTSSTVC 242 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 27/200 (13%), Positives = 64/200 (32%), Gaps = 27/200 (13%) Query: 171 RLDMMIVLDVS--RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++ ++D S S+E+F + + ++ V + G+ +S Sbjct: 355 SVDLLFLVDSSHGTSLEAFLE--------LKSFLKYFVQAVVSRDS---PTRVGVSQYSQ 403 Query: 229 KIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++E + W + L + ++ L + +A + T+ N Sbjct: 404 EVEVLVPVGEHWDSADLVKGVEALRLGEGGALTG----WALRHVARPGFWEASNLTDVFN 459 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-SHEFLRACASPNSF 345 +++V T A+ +G + I + E P Sbjct: 460 KPRVVVLF-----AGTGSQDPVSEAARTARDQGL--FLIAVGREALEMELQEIAGHPKRT 512 Query: 346 YLVENPHSMYDAFSHIGKDI 365 +P ++ D + + K I Sbjct: 513 ITYSDPQNLLDRITELQKTI 532 >gi|148689167|gb|EDL21114.1| mCG140659 [Mus musculus] Length = 1670 Score = 81.7 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 44/281 (15%), Positives = 100/281 (35%), Gaps = 33/281 (11%) Query: 97 ELRDNGFVNDIDDIVRSTSLDI--VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH- 153 + + N V + + +R+ + I + + + + I+ + F + ++ Sbjct: 331 DGKSNDSVLEPANRLRAEQITIHAIGIGEANKTQLRQIAGKDERVNFGQNFDSLKSIKNE 390 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 IV I S D + D+M ++D S S+ T + + ++ I Sbjct: 391 IVHRICSE---KGCEDMKADIMFLVDSSGSIGP------TNFETMKTFMKNLVG---KIQ 438 Query: 214 DVNNVVQSGLVTFSNKIEEFFLL-EWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 + Q G+V FS+ E F L ++ + I +S +T + L + N+ F Sbjct: 439 IGADRSQVGVVQFSDYNREEFQLNKYSTHEEIYAAIDRMSPINRNTLTGGALTF-VNEYF 497 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 D+ +K ++ +TDG K + + + ++++G+ Sbjct: 498 DLS------KGGRPQVRKFLILLTDG-----KAQDEVGGPAMALRSKSVTIFSVGVYGAN 546 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + + + VEN +D I ++ + Sbjct: 547 RAQLEEISGDGSLVFHVEN----FDHLKAIESKLIFRVCAL 583 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 73/202 (36%), Gaps = 26/202 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S+ +I + + S+ + + P N V+ G+V +S+K E Sbjct: 221 DIYFLIDGSSSIRKK---EFEQIQVFMSSV------IDMFPIGPNKVRVGVVQYSHKNEV 271 Query: 233 FFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + + ++ + + + T + L F + +++ Sbjct: 272 EFPVSRYTDGIDLKKAVFNIKQLKGLTFTGKALD------FILPLIKKGKTERTDRAPCY 325 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDG K + L N + ++AIGI + + +N Sbjct: 326 LIVLTDG-----KSNDSVLEPANRLRAEQITIHAIGIGEANKTQLRQIAGKDERVNFGQN 380 Query: 351 PHSMYDAFSHIGKDIVTKRIWY 372 +D+ I +IV + Sbjct: 381 ----FDSLKSIKNEIVHRICSE 398 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 39/189 (20%), Positives = 77/189 (40%), Gaps = 19/189 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ VLD S S+ S+ +N + VK + VQ G +T+SN E Sbjct: 592 LDIVFVLDHSGSIGPREQESM---------MNLTIHLVKKADIGRDRVQIGALTYSNHPE 642 Query: 232 EFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L S + ++ G T + L+++ N +F +H + N ++ Sbjct: 643 ILFYLNTYSSGSAIAEHLRRPRDTGGETYTAKALQHS-NILFT----EEHGSRLTQNVRQ 697 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDG + + + E + +G ++A+G+ E + V+ Sbjct: 698 LMIVITDG---VSHDRDKLDEAARELRDKGITIFAVGVGNANQDELETMAGKKENTVHVD 754 Query: 350 NPHSMYDAF 358 N + D + Sbjct: 755 NFDKLRDIY 763 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 26/171 (15%), Positives = 62/171 (36%), Gaps = 21/171 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 Q D+ D++ ++D S + + + + V + + ++ GL++F Sbjct: 9 QKDSLADLIFLVDES-------VGTTQNLRDLQNFLENVTSSVDVKDNC---MRLGLMSF 58 Query: 227 SNKIEEFFLL--EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S++ + L S Q++I+ LS ++N + Q+ + + Sbjct: 59 SDRAQTISSLRSSANQSEFQQQIQKLSLQTGASNVGAAI----EQMRKEGFSESSGSRKA 114 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 +I V +T D + + G ++A+GI + + Sbjct: 115 QGVPQIAVLVT-----HRASDDVVREAALDLRLEGVTMFAMGIEGANNTQL 160 >gi|309266594|ref|XP_003086799.1| PREDICTED: collagen alpha-5(VI) chain-like [Mus musculus] Length = 2601 Score = 81.7 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 44/281 (15%), Positives = 100/281 (35%), Gaps = 33/281 (11%) Query: 97 ELRDNGFVNDIDDIVRSTSLDI--VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH- 153 + + N V + + +R+ + I + + + + I+ + F + ++ Sbjct: 543 DGKSNDSVLEPANRLRAEQITIHAIGIGEANKTQLRQIAGKDERVNFGQNFDSLKSIKNE 602 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 IV I S D + D+M ++D S S+ T + + ++ I Sbjct: 603 IVHRICSE---KGCEDMKADIMFLVDSSGSIGP------TNFETMKTFMKNLVG---KIQ 650 Query: 214 DVNNVVQSGLVTFSNKIEEFFLL-EWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 + Q G+V FS+ E F L ++ + I +S +T + L + N+ F Sbjct: 651 IGADRSQVGVVQFSDYNREEFQLNKYSTHEEIYAAIDRMSPINRNTLTGGALTF-VNEYF 709 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 D+ +K ++ +TDG K + + + ++++G+ Sbjct: 710 DLS------KGGRPQVRKFLILLTDG-----KAQDEVGGPAMALRSKSVTIFSVGVYGAN 758 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + + + VEN +D I ++ + Sbjct: 759 RAQLEEISGDGSLVFHVEN----FDHLKAIESKLIFRVCAL 795 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 73/202 (36%), Gaps = 26/202 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S+ +I + + S+ + + P N V+ G+V +S+K E Sbjct: 433 DIYFLIDGSSSIRKK---EFEQIQVFMSSV------IDMFPIGPNKVRVGVVQYSHKNEV 483 Query: 233 FFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + + ++ + + + T + L F + +++ Sbjct: 484 EFPVSRYTDGIDLKKAVFNIKQLKGLTFTGKALD------FILPLIKKGKTERTDRAPCY 537 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDG K + L N + ++AIGI + + +N Sbjct: 538 LIVLTDG-----KSNDSVLEPANRLRAEQITIHAIGIGEANKTQLRQIAGKDERVNFGQN 592 Query: 351 PHSMYDAFSHIGKDIVTKRIWY 372 +D+ I +IV + Sbjct: 593 ----FDSLKSIKNEIVHRICSE 610 Score = 79.8 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 39/189 (20%), Positives = 77/189 (40%), Gaps = 19/189 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ VLD S S+ S+ +N + VK + VQ G +T+SN E Sbjct: 804 LDIVFVLDHSGSIGPREQESM---------MNLTIHLVKKADIGRDRVQIGALTYSNHPE 854 Query: 232 EFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L S + ++ G T + L+++ N +F +H + N ++ Sbjct: 855 ILFYLNTYSSGSAIAEHLRRPRDTGGETYTAKALQHS-NILFT----EEHGSRLTQNVRQ 909 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDG + + + E + +G ++A+G+ E + V+ Sbjct: 910 LMIVITDG---VSHDRDKLDEAARELRDKGITIFAVGVGNANQDELETMAGKKENTVHVD 966 Query: 350 NPHSMYDAF 358 N + D + Sbjct: 967 NFDKLRDIY 975 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 36/222 (16%), Positives = 79/222 (35%), Gaps = 24/222 (10%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA 198 + + T P T + + ++ +D+ +LD S+++ S + Sbjct: 1921 YEPLEKLRRCTLCYDKCFPNTCAEEPFFPENSYMDVAFLLDNSKNIAS---DDFQAVKAL 1977 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFS----------NKIEEFFLLEWGVSHLQRKI- 247 + S+ + + L+++S K E F S ++ I Sbjct: 1978 VSSVIDSFHITSNPSASESGDRVALLSYSPSESSRRKGRVKTEFAFTTYDNQSIMKNYIY 2037 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 L + L++A +F T + K+I+ ++ GEN KE + Sbjct: 2038 TSLQQLNGDATIGLALQWAMEGLFLG--------TPNPRKHKVIIVISAGENHEEKEFVK 2089 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 ++ AK +G +V+ I + + E + P +L++ Sbjct: 2090 TVAL--RAKCQGYVVFVISLGSTQRDEMEELASYPLDHHLIQ 2129 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 42/330 (12%), Positives = 102/330 (30%), Gaps = 48/330 (14%) Query: 31 FLVLGMIIEVSHIFFMKTVLHSMIDRS---------LVHAATQIMNEGNGNNRKKLKGGD 81 F +G +++ + + RS ++ +E + K D Sbjct: 66 FGFIGGSLKIGNALQEAHRTYFSAPRSGRDKKQFPPILVVLASAESEDDVEEASKALRED 125 Query: 82 ILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSL------------DIVVVPQNEGYSI 129 + I + L+ ++ + L D++ + + Sbjct: 126 GVKIISVGVQKASEENLKAMATSQFHFNLRTARDLGMFAPNMTRIIKDVIQYREGTTVDL 185 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHI--VMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 P T + H+ +P +S Q D+ D++ ++D S Sbjct: 186 ITAVAPATPAAPTIPAALTTAANHVDKTVPFPTSC----QKDSLADLIFLVDES------ 235 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL--EWGVSHLQR 245 + + + + V + + ++ GL++FS++ + L S Q+ Sbjct: 236 -VGTTQNLRDLQNFLENVTSSVDVKDNC---MRLGLMSFSDRAQTISSLRSSANQSEFQQ 291 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 +I+ LS ++N + Q+ + + +I V +T D Sbjct: 292 QIQKLSLQTGASNVGAAI----EQMRKEGFSESSGSRKAQGVPQIAVLVT-----HRASD 342 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + + G ++A+GI + + Sbjct: 343 DVVREAALDLRLEGVTMFAMGIEGANNTQL 372 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 22/146 (15%), Positives = 49/146 (33%), Gaps = 15/146 (10%) Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYN 268 P N + L +S+ + F L + + +L K G S L+ A+ Sbjct: 26 PIEANKYRVALAQYSDALHNEFQLGTFKNR-NPMLNHLKKNFGFIGGSLKIGNALQEAHR 84 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 F R + + I+V + S + + ++ G + ++G++ Sbjct: 85 TYFSA--PRSGRDKKQFPP--ILVVL-----ASAESEDDVEEASKALREDGVKIISVGVQ 135 Query: 329 VIRSHEFLRACASPNSFYLVENPHSM 354 L+A A+ + + + Sbjct: 136 KASEEN-LKAMATSQFHFNLRTARDL 160 >gi|152990152|ref|YP_001355874.1| von Willebrand factor A [Nitratiruptor sp. SB155-2] gi|151422013|dbj|BAF69517.1| von Willebrand factor type A domain protein [Nitratiruptor sp. SB155-2] Length = 305 Score = 81.7 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 40/206 (19%), Positives = 73/206 (35%), Gaps = 30/206 (14%) Query: 168 TDARLDMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 D+++ +D S SM+ D TK D+ + A + + + G+ Sbjct: 79 KKKGYDIVLAIDASGSMQEKGFDPTDPQKTKFDVVRSLVKAFISKRRNDN-------IGV 131 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 V F + L + +++ + YL G T L + + Sbjct: 132 VIFGSFAYIASPLTFNKEAVKKILDYLDIGVAGSKTAIDDALIESVRLL----------- 180 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--HEFLRAC 339 E KI++ +TDG + ++K AKK G +Y IGI R FLR Sbjct: 181 KESQAKSKIVILLTDGIDTASKTPPDVAV--KMAKKYGVKIYTIGIGDKRGIDEAFLRWL 238 Query: 340 ASPNS--FYLVENPHSMYDAFSHIGK 363 A ++ ++ + + I + Sbjct: 239 AQQGHGYYFYAKDASMLRKIYDEINR 264 >gi|315498202|ref|YP_004087006.1| von willebrand factor type a [Asticcacaulis excentricus CB 48] gi|315416214|gb|ADU12855.1| von Willebrand factor type A [Asticcacaulis excentricus CB 48] Length = 489 Score = 81.7 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 36/257 (14%), Positives = 92/257 (35%), Gaps = 34/257 (13%) Query: 6 LNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 + + +FF + +G ++ +F I+ + + ++ S++ ++ +D + + A Sbjct: 1 MGLPDFFRDRRGNTAVMFGLFFSILIVSMAGAVDYSNVISRRSKAQDALDAATLAVA--- 57 Query: 66 MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 + + + + + EL DN I T + Sbjct: 58 -----------VLRPATVEQAQAAVKLRLDKELGDNPDKVVIGQFNYDT--------KTR 98 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 Y ++A YK F+ N + I + S + L++ +VLD + SM Sbjct: 99 TYYVTAKGTYK------PFLLGVVNIKEIPYEVISETIQAAN--GTLELALVLDNTDSMG 150 Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE---EFFLLEWGVSH 242 + S T++D+ + ++ V + + V+ +V +++ + W Sbjct: 151 QILNGSSTRLDVLKTAATNLVNTVMTSAN-KDYVKVAVVPYADYVNVGLANRSQSWVSVG 209 Query: 243 LQRKIKYLSKFGVSTNS 259 + +K + ++ Sbjct: 210 ADYTVPAAAKTCTTIST 226 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 61/160 (38%), Gaps = 29/160 (18%) Query: 240 VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC--NTEDANYKKIIVFMTDG 297 + ++ + + + T GL + N + ++ ++++ KK+IV MTDG Sbjct: 328 TNSIKGLVNSIGSYKPDTFIPGGLHWGVNTLSPPAPFKEGMAYDSKNKEPKKVIVLMTDG 387 Query: 298 ENL---------------STKEDQQSL---------YYCNEAKKRGAIVYAIGIRVIRSH 333 N S SL C AK + V+ IG+ V Sbjct: 388 ANTLYTNSSGQIVSAATGSPPTISSSLVAPTYTAQDNACKYAKGKNIEVFVIGLGVTDPT 447 Query: 334 EF--LRACASP-NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 L++CA+ ++ +N + + +AF IG + R+ Sbjct: 448 ALSALKSCATDAQHYFDAQNANDLIEAFEIIGGKLSVVRL 487 >gi|116252440|ref|YP_768278.1| hypothetical protein RL2693 [Rhizobium leguminosarum bv. viciae 3841] gi|115257088|emb|CAK08182.1| conserved hypothetical exported protein [Rhizobium leguminosarum bv. viciae 3841] Length = 427 Score = 81.7 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 66/426 (15%), Positives = 134/426 (31%), Gaps = 95/426 (22%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F + G I+TA+ + +F GM ++ +H ++T L++ D + V + + + Sbjct: 8 FISDRSGNFGIMTALLVVPLFGAAGMAVDFAHALSLRTQLYAAADAAAVGSIAE-KSGAV 66 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 G I + + + ++ D+ +++ +S Sbjct: 67 AAAMTMSGNGTI------SLGKDDARNIFMSQMSGELTDVHIDLGINVTKTANKLNSQVS 120 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME----- 185 + +P F + + I+ + QT A +D I+LD + SM Sbjct: 121 FSAT--VPTTFMRILGRD------SITISGAATAEYQTAAFMDFYILLDNTPSMGVGATA 172 Query: 186 ----------------------------SFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + +ID+ ++ A+ + K ++ Sbjct: 173 NDVSKLQAKTGCAFACHQMDQSTNNYTIAKGLGVAMRIDVVRQATQALTDTAKTERVSSD 232 Query: 218 VVQSGLVTFSNKIEEF-----FLLEWGVSHLQRK--------IKYLSKFGVSTNSTPGLK 264 + G+ TF K E+ ++ ++ I Y + S Sbjct: 233 QFRMGVYTFGTKAEDAKLTTISSPTSDLTKVKNYTDTVDLMTIPYQNYNQDQLTSFDSAL 292 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY------------- 311 N I D G T + + +KI+ F++DG S K + Sbjct: 293 TQMNTIIDPAGDG----TSNISPEKILFFVSDGVGDSYKPSTCTKKTTGGRCQEPIDTSF 348 Query: 312 CNEAKKRGAIV---YAIGIRVIRSH--------------EFLRACASPNSFYLVENPHSM 354 C K RG + Y + + + ++ACASP ++ V + Sbjct: 349 CKPLKDRGVKIAVLYTTYLPLPSNDWYNKWISPFQSEIPTKMQACASPGFYFEVTPTEGI 408 Query: 355 YDAFSH 360 DA Sbjct: 409 TDAMKA 414 >gi|313235286|emb|CBY10850.1| unnamed protein product [Oikopleura dioica] Length = 977 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 46/263 (17%), Positives = 92/263 (34%), Gaps = 33/263 (12%) Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVN-SQT 168 I + ++I ++ Q I+ +Y ++ + I V+ S Sbjct: 524 IGNESQMNIKMMNQIATEPITKHIKYANTVEGINKFKNALTGQ-----ICEDVRSKQSCD 578 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 A +DM IV D S S+ ++++ + V LV ++ Sbjct: 579 SASMDMAIVFDGSDSV------KADNFKKLKTWTGEFIDKLGVQEYGAQV---ALVKYAT 629 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 I + L V L+ K+ + TN+ L+ A + + Sbjct: 630 SIIKVSELSSDVDELKEKLMKVPFIQGKTNTGGALERAQQML----------AEGRPSVP 679 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRACASPNSFYL 347 KII+ +TDG+ + + + KK ++Y IG+ +I +E R + Y Sbjct: 680 KIILLITDGDATDKERLDAQI---EKLKKSNILIYTIGVGDLIDRNELNRIATDEDFVYE 736 Query: 348 VENPHSMYDAFSHIGKDIVTKRI 370 + +D+ S I ++ + Sbjct: 737 TRD----FDSISKIKSSLLGRVC 755 Score = 66.3 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 48/330 (14%), Positives = 108/330 (32%), Gaps = 44/330 (13%) Query: 53 MIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNG---FVNDIDD 109 + A Q++ EG + K + + +L+ + + + D Sbjct: 658 TNTGGALERAQQMLAEGRPSVPKIILLITDGDATDKERLDAQIEKLKKSNILIYTIGVGD 717 Query: 110 IVRSTSLDIVVVPQNEGY-SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT 168 ++ L+ + ++ Y + S KI + + Sbjct: 718 LIDRNELNRIATDEDFVYETRDFDSISKIKSSLLGRVCKKAKPKT----------SGVCG 767 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV-NNVVQSGLVTFS 227 D +D+ ++D S S+ + V + D+ + VQ G++T+S Sbjct: 768 DISVDLQFIVDSSSSVTRKNFG----------FAKNFVANVSSVFDLRSGDVQVGVLTYS 817 Query: 228 NKIEEFFLLEWGVSHLQ----RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + + G H Q K++ + G T++ L+Y N Sbjct: 818 TNVHSDSAIGLGAIHSQDDFVEKVQSMKYTGGDTHTGTALRYI-----------STNNRW 866 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 KI++F+TDG + ++ + +G ++AIG+ E + P Sbjct: 867 REEVPKILIFVTDG---TPQDRAIVPAAARSLRDKGVRIFAIGVGNAVESELKEIASEPY 923 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + + + Y A + + + + + D Sbjct: 924 ENHAIFIQGADYSAVQRV-RGHLERLVCND 952 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 33/215 (15%), Positives = 76/215 (35%), Gaps = 34/215 (15%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 I++ K + F I+ + + D D++ V+D S S+ Sbjct: 154 IASKPADKYVFEINNFDQLILKRIGILGDVCQEAE---CPDIFADIVFVVDSSGSIGP-- 208 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW-GVSHLQRKI 247 + D + ++ K+ N + +V ++ + + +S + +KI Sbjct: 209 ----KRFDYMKNWVKSIAASFKVGE---NFARFSVVQYTKTAKTVVDFQTLDLSSISQKI 261 Query: 248 KYLSKF------GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 + F G T + L+ A+ + + + R K+I++ +TDG Sbjct: 262 DSMIYFQGRNGRGGKTFTGNALERAHTLLKESEPGR----------KRIVLLLTDG---- 307 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 D + + ++A+G+ R +E + Sbjct: 308 -SSDDEYGPVAKAIRDDKVDIFAVGVGRARKNELV 341 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 28/181 (15%), Positives = 75/181 (41%), Gaps = 19/181 (10%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 ++ ++ + S+ + + D LD++ +LD S S+ ++A + ++ + + Sbjct: 363 NQKLLAEVCSASE-DVCPDQELDLVFMLDSSGSIG------TANFEIARDWLISVTKSFQ 415 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + P Q ++ ++++ F L + ++ IK + ST + A + Sbjct: 416 IGPVA---TQVSVIQYTSEPIPEFDLNDFKTAAEVEDGIKRMRLQNGSTRTDKA--RALD 470 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + D+ + + K++V ++DG++ ++ L + + V+ IGI Sbjct: 471 FVSDVIFTKNFGA--RSQAPKVLVLISDGQDNHP---ERVLEAAKKLHGQDVSVFVIGIG 525 Query: 329 V 329 Sbjct: 526 N 526 Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 24/133 (18%), Positives = 42/133 (31%), Gaps = 23/133 (17%) Query: 242 HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 I + K T L Y Y +F Q + ++ ++ ++ MTDG+ Sbjct: 57 DFAGAIDGMKKLNGDTCIGEALDYFYRNMFTSQ------AGQRSDVEQRVIVMTDGKRNC 110 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC---------AS---PNSFYLVE 349 + + + A +YAIGI + C AS + + Sbjct: 111 P---AEIAKPAELIRAQEAEIYAIGIGHQCGYGENHNCYDRQELHEIASKPADKYVFEIN 167 Query: 350 NPHSMYDAFSHIG 362 N + IG Sbjct: 168 NFDQLI--LKRIG 178 >gi|329928736|ref|ZP_08282585.1| IPT/TIG domain protein [Paenibacillus sp. HGF5] gi|328937517|gb|EGG33935.1| IPT/TIG domain protein [Paenibacillus sp. HGF5] Length = 964 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 78/202 (38%), Gaps = 33/202 (16%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+++++D S SM++ +I+ A + ++ + + Q G+V +S+ E Sbjct: 70 DVILIIDKSGSMQTD-----NRINAAKNAAKGFIDLMDMTK-----HQVGIVGYSSVAET 119 Query: 233 FF-LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L + ++ I + G T + + A + + +I Sbjct: 120 SSLPLTTDTAAAKQFIDPIVASG-GTETGYAIDQAITLLSS----------HRPEAQPVI 168 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-------HEFLRACASPN- 343 V MTDGE S++ +L AK G + Y I + +E L+ A+ N Sbjct: 169 VIMTDGEANSSQA---ALERAQAAKDAGIVFYTIALLGPNDNPDTSAPNELLKQMATTNS 225 Query: 344 SFYLVENPHSMYDAFSHIGKDI 365 + V + + ++ I +I Sbjct: 226 HHHFVLGSTGLAEIYAAIVAEI 247 >gi|296474583|gb|DAA16698.1| collagen, type XXI, alpha 1 [Bos taurus] Length = 507 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 42/214 (19%), Positives = 83/214 (38%), Gaps = 33/214 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S A D++ +LD S S+ ++ K + + + + P +Q G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGP------ENFEIVKKWLVNITKNFDIGP---KFIQVGVV 79 Query: 225 TFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ L S L ++ + G +T + +++A + +F Sbjct: 80 QYSDYPVLEIPLGSHESGENLMAAMESIHYLGGNTRTGKAIQFALDYLF---------AK 130 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 KI V +TDG K + A+ ++AIG+ LRA A Sbjct: 131 SARFLTKIAVVLTDG-----KSQDEVKDAAEAARDSRITLFAIGVGSETEEAELRAIANK 185 Query: 341 -SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 S + VE+ Y A S I ++++ +++ + Sbjct: 186 PSSTYVFYVED----YIAISKI-REVMKQKLCEE 214 >gi|151556217|gb|AAI49226.1| COL21A1 protein [Bos taurus] Length = 518 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 42/214 (19%), Positives = 83/214 (38%), Gaps = 33/214 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S A D++ +LD S S+ ++ K + + + + P +Q G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGP------ENFEIVKKWLVNITKNFDIGP---KFIQVGVV 79 Query: 225 TFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ L S L ++ + G +T + +++A + +F Sbjct: 80 QYSDYPVLEIPLGSHESGENLMAAMESIHYLGGNTRTGKAIQFALDYLF---------AK 130 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 KI V +TDG K + A+ ++AIG+ LRA A Sbjct: 131 SARFLTKIAVVLTDG-----KSQDEVKDAAEAARDSRITLFAIGVGSETEEAELRAIANK 185 Query: 341 -SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 S + VE+ Y A S I ++++ +++ + Sbjct: 186 PSSTYVFYVED----YIAISKI-REVMKQKLCEE 214 >gi|260810222|ref|XP_002599902.1| hypothetical protein BRAFLDRAFT_74022 [Branchiostoma floridae] gi|229285186|gb|EEN55914.1| hypothetical protein BRAFLDRAFT_74022 [Branchiostoma floridae] Length = 1201 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 41/218 (18%), Positives = 78/218 (35%), Gaps = 27/218 (12%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 + N IV +S A LD+ +LD S S+ + ++ K+ ++ + Sbjct: 644 FPNGDRIVYTACRGSAGSSSCAAPLDLFFLLDGSGSVNA---ANFVKVKQFAVNVVNTFD 700 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKY 265 + G+V +S++ F L V+ I + TN+ L+Y Sbjct: 701 VSLTA------TRVGVVQYSDRNTLVFNLGNKVNKPSTVSAINNIVYQSGGTNTGAALQY 754 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 RQ+ N K+I+ +TDG++ + G VYAI Sbjct: 755 V----------RQYAAWRGGNVPKVIIVLTDGKSSDSVSG-----PSQNLVAAGVEVYAI 799 Query: 326 GIRVIRSHEFLRACAS-PNSFYLVENPHSMYDAFSHIG 362 G+ + L+ + N+ + N +++ I Sbjct: 800 GVGSFDHGQLLQIANNKQNNVIELNNFNALATKIDMIS 837 >gi|303240108|ref|ZP_07326629.1| conserved hypothetical protein [Acetivibrio cellulolyticus CD2] gi|302592377|gb|EFL62104.1| conserved hypothetical protein [Acetivibrio cellulolyticus CD2] Length = 323 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 43/272 (15%), Positives = 100/272 (36%), Gaps = 54/272 (19%) Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSV------KVNSQTDARLDMMIVL 178 + S+ + Y K I Y +++ + + K + + +D+ + L Sbjct: 29 KFSSVKLLKSYGYKNKIRHKIGRYLVVLSLILMVIAMARPQLPEKNSPISKQGIDIAVAL 88 Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW 238 DVS +M+S D ++++A K+I +++ + L+ F+ L Sbjct: 89 DVSGTMQS-VDFEPNRLEVARKTIQDFVDQRPSD-------RIALIAFAGTAYTRVPLTL 140 Query: 239 GVSHLQ---RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 + ++ + I + S T + N++ + + KI++ +T Sbjct: 141 DHNVVRESLQDISFKSVNEEGTAIGMAISVGLNRL-----------KKSTSPSKIMILLT 189 Query: 296 DGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR----------------------SH 333 DG+N + D + AK G +Y IG+ + + Sbjct: 190 DGDNNAGSIDPNTASTL--AKDSGIKIYTIGVGSDKTIIPGTNEFGQTVYQEYESGLLNE 247 Query: 334 EFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 + L+ A + +Y ++ +++ F++I K Sbjct: 248 DLLKKIAETTNGQYYRAKDSNALSQVFANINK 279 >gi|119596272|gb|EAW75866.1| matrilin 4, isoform CRA_a [Homo sapiens] Length = 391 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 75/180 (41%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 147 NRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 197 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 198 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 254 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G ++YA+G+ E + P Sbjct: 255 RALNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVMYAVGVGKAVEAELREIASEP 309 >gi|56417742|emb|CAI21077.1| matrilin 4 [Homo sapiens] Length = 432 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 75/180 (41%), Gaps = 19/180 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV Sbjct: 188 NRCREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLV 238 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L +G + +++ + + T + L++ F Q Sbjct: 239 QFSSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARP 295 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G ++YA+G+ E + P Sbjct: 296 RALNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVMYAVGVGKAVEAELREIASEP 350 >gi|134093164|gb|ABO53024.1| matrilin 4 isoform 1 precursor, 3 prime [Chlorocebus aethiops] Length = 243 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 74/178 (41%), Gaps = 19/178 (10%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV F Sbjct: 1 CREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLVQF 51 Query: 227 SNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S+++ F L +G + +++ + + T + L++ F Q Sbjct: 52 SSRVRTEFPLGRYGTAVEVKQAVLAMEYMERGTMTGLALRHMVEHSFSEA---QGARPRA 108 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G ++YA+G+ E + P Sbjct: 109 LNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIVMYAVGVGKAVEAELREIASEP 161 >gi|310829014|ref|YP_003961371.1| hypothetical protein ELI_3449 [Eubacterium limosum KIST612] gi|308740748|gb|ADO38408.1| hypothetical protein ELI_3449 [Eubacterium limosum KIST612] Length = 684 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 54/302 (17%), Positives = 101/302 (33%), Gaps = 76/302 (25%) Query: 121 VPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDV 180 P + ++ + K W + + I + K +S D+++V+D Sbjct: 33 EPSSPNIEVTKTAALK---------DWESRTYQINL----GAKASSSAVGAADVVLVIDR 79 Query: 181 SRSMESFFD-SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW- 238 S SM +D S TK+++ + + ++ + V +V++++ + L Sbjct: 80 SGSMGERYDGSRQTKMEVLKDTAKDFITQLSAQSPASQV---SVVSYASDSKTNIGLTSL 136 Query: 239 ----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + L R I L G T S GL+ AY+ + K+ ++F+ Sbjct: 137 DTQENIQSLNRAIDKLWASGA-TRSDLGLEDAYSVLGAADSGN----------KQFVIFL 185 Query: 295 TDGENLSTK-----------EDQQSLYYCNEAKKRG------------------------ 319 TDGE S + + K+ G Sbjct: 186 TDGEPNSYSGFDREIAARAESTASIIKGEDLIKRDGRIFGDYDGSLDDGSSHNPDWEFEG 245 Query: 320 ----AIVYAIGI----RVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 A +++IGI R H++L S +S L + ++ D F I I Sbjct: 246 DPLSAEIFSIGILKSWSSQRVHDYLNYIDSQHSAALADTAQALQDIFEAITHQIAVTATV 305 Query: 372 YD 373 D Sbjct: 306 SD 307 >gi|194221585|ref|XP_001495200.2| PREDICTED: collagen, type XXIX, alpha 1 [Equus caballus] Length = 2617 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 84/232 (36%), Gaps = 28/232 (12%) Query: 144 FIPWYTNSRHIVMPITSSV-KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 F + + + I + S+ D + D+M ++D S S+ + K+ +K Sbjct: 599 FGQNFDSLKSIKSEVVHSICTEKGCEDMKADIMFLVDSSGSIGPE---NFGKMKTFMK-- 653 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGV-SHLQRKIKYLSKFGVSTNST 260 +L ++++ PD Q G+V FS +E F L ++ + I +S +T + Sbjct: 654 -NLLAKIQIGPDK---TQIGVVQFSTDAKEEFQLNKYFTQKEISDAIDRMSLINQNTYTG 709 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA 320 L++ H KK ++ +TDGE +++ Sbjct: 710 HALEFVDQYF-------SHLKGARPGVKKFLILITDGE-----ARDGVRDPARALREKEV 757 Query: 321 IVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 +++++G+ + + + VE + I ++ + Sbjct: 758 VIFSVGVYGANRTQLEEISGDGSLVFQVEKFDDL----KAIESKLIFRVCAL 805 Score = 76.7 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 79/202 (39%), Gaps = 24/202 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S+++ I + MLE ++ VQ G+V +S++I E Sbjct: 441 DLYFLIDGSTSIQNEQFEQIKRF---------MLEVTEVFSIGPGRVQVGVVQYSHEIRE 491 Query: 233 FFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + + + ++ + + + T++ L F + +R+ + Sbjct: 492 EFSIGVYSNDVDLRKAVLNIKQLTGDTHTGAALA------FMLPKIREGRKQRPSKVPCY 545 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDG++ + L N + G ++A+GI E + + + +N Sbjct: 546 LIVLTDGQSQDHH---RILETANRIRAEGVTIHAVGIGEADKTELQQIAGNEERVHFGQN 602 Query: 351 PHSMYDAFSHIGKDIVTKRIWY 372 +D+ I ++V Sbjct: 603 ----FDSLKSIKSEVVHSICTE 620 Score = 74.0 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 39/190 (20%), Positives = 78/190 (41%), Gaps = 19/190 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ VLD S S+ S S+ +N + V+ VQ G + +S + E Sbjct: 814 LDIVFVLDHSGSINSQQQESM---------MNLTIHLVEKANVGRGQVQFGALKYSKEPE 864 Query: 232 EFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + F L + ++ T + L++A + Q +H + N K+ Sbjct: 865 DLFYLNTFSKGAAITENLRRRRDTYGETYTAKALEHA-----NSQFTEEHGSRIKQNVKQ 919 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDG + + +Q + + +G I+YA+G+ +E N+ V+ Sbjct: 920 MLIVITDG---VSHDREQLSDTALKLRNKGIIIYAVGVGEADQYELEAIAGDKNNTRHVD 976 Query: 350 NPHSMYDAFS 359 N + D + Sbjct: 977 NFDKLKDIYQ 986 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 39/280 (13%), Positives = 90/280 (32%), Gaps = 37/280 (13%) Query: 58 LVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLD 117 ++ +E + K D + I + L+ ++ T D Sbjct: 136 ILVVLASAKSEDDVEEASKALRRDGVRIISVGVQKASEENLKAMATAQFHFNLR--TVRD 193 Query: 118 IVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 + QN I ++YK I +P V Q D+ D++ + Sbjct: 194 LSTFSQNMTQIIKDAAQYKEGEDDTD----------IEVPFP----VACQKDSLADVVFM 239 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 +D S + + ++ + + + ++ GL+++SN+ + L+ Sbjct: 240 VDESLGTRENLRN-------LQNFLKSITTSLDVKDNC---MRIGLMSYSNRAKTISSLK 289 Query: 238 WGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 S +I+ LS +N+ + + F + +I V +T Sbjct: 290 SSTTQSEFLEQIQKLSLQAGKSNAGAAIDMMRREAFSESSGSRRA----QGVPQIAVLVT 345 Query: 296 DGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + D + + + IV+A+ I+ + + Sbjct: 346 -----NRPSDDEVREAAQKLLREDVIVFAMSIQGANNTQL 380 >gi|85374104|ref|YP_458166.1| hypothetical protein ELI_06385 [Erythrobacter litoralis HTCC2594] gi|84787187|gb|ABC63369.1| hypothetical protein ELI_06385 [Erythrobacter litoralis HTCC2594] Length = 623 Score = 81.4 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 45/268 (16%), Positives = 100/268 (37%), Gaps = 31/268 (11%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R + G L A L + + G +++S + ++ L D ++ A + Sbjct: 1 MRRLASDRSGNTLALIAAGLLPLLAMAGSGVDMSRAYLAESRLQQACDSGVLAARKALGT 60 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 E D R + +N +F D + ++ + ++V+ Y Sbjct: 61 EIATL---TDIPTDAGTRGQEFFNSNF----------QDGNYGTQNRTFNMVLEN---DY 104 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 S+S + +P T I + + +++ +D+M+VLDV+ SM+ Sbjct: 105 SVSGTATVDVPTSVMTVF----GFTKIPVKVECQARISFSD---VDVMMVLDVTGSMKHT 157 Query: 188 FDSSI-TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVS--- 241 +KID ++ ++++ ++ G V +++ + LL+ W V+ Sbjct: 158 NSGDTLSKIDSLKATVRNFYDQMEGAKSAGTRIRYGFVPYASNVNVGHLLKDEWVVNSWA 217 Query: 242 HLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + R I + T + +AY Sbjct: 218 YQSRAISGTTTVEAGTKTRE--NWAYKS 243 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 35/187 (18%), Positives = 66/187 (35%), Gaps = 31/187 (16%) Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 +Q+G FS L+ S L + L+ G T G+ + + Sbjct: 437 DTKKEFIQTGNWWFSGCPAPAQKLKAMTSGELDSYLDSLTPHGA-TYHDGGMIWGGRLLS 495 Query: 272 DMQ-GMRQHCNTEDANYKKIIVFMTDGENL---------------------------STK 303 ++ + + ++F+TDG+ + Sbjct: 496 QYGLFAAENSSKPGRTTSRHLIFLTDGQTEPYDLAYGSYGIDPIDERRWTQTSSLTLAQT 555 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGK 363 +++ L+ CNE KK GA V+ + +++ ++ CA ++ N + DAFS I K Sbjct: 556 VEERFLFACNEVKKLGATVWVVAFGTA-ANDKMKTCAGSGRYFEAANASQLNDAFSTIAK 614 Query: 364 DIVTKRI 370 RI Sbjct: 615 STGDLRI 621 >gi|260828797|ref|XP_002609349.1| hypothetical protein BRAFLDRAFT_99028 [Branchiostoma floridae] gi|229294705|gb|EEN65359.1| hypothetical protein BRAFLDRAFT_99028 [Branchiostoma floridae] Length = 421 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 44/214 (20%), Positives = 74/214 (34%), Gaps = 22/214 (10%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 S V S D+M VLD S S+ D + + + + V + Sbjct: 18 GPSFVLAASGDAG--DIMFVLDGSGSI---------SADDFVSAKSFISRVVDAFDIAAD 66 Query: 218 VVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + G+V FS+ E F L+ + L++ I + + G T + Y N F Sbjct: 67 FTRVGVVQFSSFFTEEFPLDRYSDKASLKQAIGNIPQRGGGTLLGQVINYLVNTSFTEAK 126 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 +I V MTDG + + + G I ++IG+ + + Sbjct: 127 ---GARPLSDGIPRIAVLMTDGSAHDNPTTVLAPAI-DALRASGIIAFSIGVGPSVNRDQ 182 Query: 336 LRACASP-NSFYLVENPHSMYDAFSHIGKDIVTK 368 L A A + +LV Y I +V + Sbjct: 183 LEAVAGDTDRVFLVGA----YSVIDDIRDLLVER 212 >gi|73990549|ref|XP_542778.2| PREDICTED: similar to alpha 3 type VI collagen isoform 4 precursor [Canis familiaris] Length = 1320 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 45/233 (19%), Positives = 88/233 (37%), Gaps = 36/233 (15%) Query: 145 IPWYTNSRHIVMPIT-SSVKVNSQTD----ARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 I + ++ S + Q D R D+ ++D S S++ D++ ++ + + Sbjct: 810 IKNQLCPETVDTSVSISGMGQAPQEDCVHIQRADIYFLIDGSSSIK---DANFLEMKVFM 866 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVST 257 E +K + VQ G+V +S+ I F L S L+ I + + T Sbjct: 867 N------EAIKRFQIGPDRVQFGVVQYSDGINIQFALSQYSSMAELKAAIDDIQQRKGGT 920 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 + L + +T +N ++ +TDG++ + Sbjct: 921 MTGEALSRM---------AQVFVDTARSNVPWYLIIITDGKSEDP-----VAEPAEALRG 966 Query: 318 RGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 G I+YAIG++ E L+ A +F+ E +D+ I +D+V Sbjct: 967 EGVIIYAIGVKNANVME-LKEIAKDKTFFTPE-----FDSLKVIQRDVVQDIC 1013 Score = 76.0 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 56/320 (17%), Positives = 110/320 (34%), Gaps = 37/320 (11%) Query: 57 SLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSL 116 +L ++ + ++ ++ KGG + + F + R N I I S Sbjct: 896 ALSQYSSMAELKAAIDDIQQRKGGTMTGEALSRMAQVFVDTARSNVPWYLII-ITDGKSE 954 Query: 117 DIVVVPQNEG-------YSISAISRYKIPLKFCT-----FIPWYTNSRHIVMPITSSV-K 163 D V P Y+I + + LK F P + + + I + + Sbjct: 955 DPVAEPAEALRGEGVIIYAIGVKNANVMELKEIAKDKTFFTPEFDSLKVIQRDVVQDICS 1014 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + + + D++ ++D S S+ K+ +K V + +Q GL Sbjct: 1015 SETCKNRQADIIFLIDGSESISPN---DFEKMKGFVK------RMVNQANIGADEIQIGL 1065 Query: 224 VTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 + FS+ +E F L S + R I + + T + L + +G Sbjct: 1066 LQFSSSPQEEFRLNQYSSKADIHRAISKVVQMNDGTRTGKALTFTLPFFDSSRG------ 1119 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 N + ++ +TDG + R +++AIG+ +S + L+ Sbjct: 1120 -GRPNVHQYLIVITDG-----VAQDDVAIPAKALRDRNIVIFAIGVGEAKSAQLLQITDD 1173 Query: 342 PNSFYLVENPHSMYDAFSHI 361 Y EN S+ + I Sbjct: 1174 VQKVYYEENFESLQNLEKKI 1193 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 48/314 (15%), Positives = 109/314 (34%), Gaps = 31/314 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L ++D A ++G + ++ L+ G + Sbjct: 115 LQFLLDHHFQATAGSRASQGVPQMAMVISNSPAEDPVQEAAKA-----LKRAGVLLYTVG 169 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPI--TSSVKVNSQ 167 + + ++ + + ++ L + SV + Sbjct: 170 VKDAVLAELKEIASSPAEKFTSFVPNFPDLGSHAQKLRQQVCDTLAKAAQPVDSVFPACR 229 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 A D++ ++D S S+ + K+ + S+ L ++ V+ GL ++ Sbjct: 230 EAALADIVFLVDSSTSIG---PQNFQKVKNFLYSVILGL------DISSDQVRVGLAQYN 280 Query: 228 NKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 + I F L S + +I+ L TN+ L++ + + + G R Sbjct: 281 DNIYPAFQLNQYPLKSVVLEQIQNLPYRTGDTNTGSALEFIRMHYLTEAAGSRA-----K 335 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP-- 342 + +I++ +TDGE + + N+ K+ G +VY +G+ V E + + P Sbjct: 336 DSVPQIVILVTDGE-----SNDEVQEAANKLKEDGVVVYVVGVNVQDVQELQKIASEPFE 390 Query: 343 NSFYLVENPHSMYD 356 + +EN + + D Sbjct: 391 KFLFNIENFNILQD 404 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 40/246 (16%), Positives = 83/246 (33%), Gaps = 28/246 (11%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT--DARLDMMIVLDVSRSMES 186 I++ K F S I+ + S+V+ + A D++ + D S Sbjct: 384 IASEPFEKFLFNIENFNILQDFSGGILQTLCSAVEGKIKEFTQAYADVVFLADTS----- 438 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ-R 245 ++S I+ ++ + + + Q GL + + FLL H + Sbjct: 439 -QNTSQASFQWMQNFISRVVGLLDV---GRDKYQIGLAQYGGQGHTEFLL--NTYHTRDE 492 Query: 246 KIKYLSKF----GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 I ++ + G S + L+Y + F + + V MT G Sbjct: 493 MIAHIREHFLLRGGSRRTGKALRYLHQTFFQEAAGSRFL----QGIPQYAVVMTSG---- 544 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHI 361 K + + +++G V ++G++ E L A+P Y ++ + + Sbjct: 545 -KSEDEVWDAAQTLREKGVKVMSVGVQDFDRKE-LEGMATPPLIYEMQGEDGVRQLMQDV 602 Query: 362 GKDIVT 367 I Sbjct: 603 SVVIQG 608 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 41/248 (16%), Positives = 84/248 (33%), Gaps = 21/248 (8%) Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITS----SVKVNSQTDARLDMMIVLDVSRSMES 186 A ++ + +V+ T + +T M I D+ +E Sbjct: 581 ATPPLIYEMQGEDGVRQLMQDVSVVIQGTGKPQFGIASEKETRVACPMAIPADLVFLVEE 640 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQ 244 F I + + + + PD V+ GLV +S + F L+ + + Sbjct: 641 FSRDKQWNFQQVIDFLKTTVSSLNVHPDG---VRIGLVFYSEEPRLEFSLDTFQTPAKML 697 Query: 245 RKIKYL--SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 + L + T + L + N++F + + + ++ V +T+G + Sbjct: 698 EHLDKLTYRRRSGRTKTGAALDFLRNEVF----VEERGSRSKQGVLQMAVVITEGFSQDQ 753 Query: 303 KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIG 362 + SL ++ G +YA+G + L AS + V + S S +G Sbjct: 754 LSEPASL-----LRRAGVTIYAVGTHRASESKDLENIASYPPWKHVISLESFLQ-LSVVG 807 Query: 363 KDIVTKRI 370 I + Sbjct: 808 SKIKNQLC 815 >gi|332255311|ref|XP_003276777.1| PREDICTED: collagen alpha-1(XXII) chain-like, partial [Nomascus leucogenys] Length = 695 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 37/216 (17%), Positives = 85/216 (39%), Gaps = 26/216 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + D++ +LD S S+ + + + +++ ++ PD + Sbjct: 27 AQRAGCKSVHYDLVFLLDTSSSVGK------EDFEKVRQWVANLVDTFEVGPD---RTRV 77 Query: 222 GLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G+V +S++ F L S ++ + L+ G +TN+ L+Y + F R Sbjct: 78 GVVRYSDRPTTAFELGLFGSQEEVKAAARRLAYHGGNTNTGDALRYITARSFS---PRAG 134 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 D YK++ + +TDG + D + + + G ++A+G+ E Sbjct: 135 GRPGDRAYKQVAILLTDGRSQDLVLDAAAAAH-----RAGIRIFAVGVGEALKEELEEIA 189 Query: 340 ASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P + V + ++A I + + +R+ + Sbjct: 190 SEPKSAHVFHVSD----FNAIDKI-RGKLRRRLCEN 220 >gi|170591769|ref|XP_001900642.1| von Willebrand factor type A domain containing protein [Brugia malayi] gi|158591794|gb|EDP30397.1| von Willebrand factor type A domain containing protein [Brugia malayi] Length = 381 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 38/201 (18%), Positives = 76/201 (37%), Gaps = 25/201 (12%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 +D LD+++++D S S+ FD I ++ ++ + + PD V+ L+T Sbjct: 185 RGSDKPLDLILIIDSSESVAHLFDEQI------RFAVERIVRNINVHPDA---VRLALIT 235 Query: 226 FSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +S + F + + R + L +T++ L AY + D N Sbjct: 236 YSGQAYIHFKFNDPQIGNNTSVIRHLNGLKSIKGTTSTHIALHQAYKLLTDTDNE----N 291 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV---IRSHEFLRA 338 KK+I+ TDG + + +D K +G ++AI + E L Sbjct: 292 GVREGVKKMIIIFTDGHSQRSPQDMALR-----LKDKGVEIFAITLTPAPYADEGELLSI 346 Query: 339 CASPNSFYLVENPHSMYDAFS 359 + + + N + + Sbjct: 347 TQNTDHIFTPVNLKVLITTYK 367 Score = 39.8 bits (91), Expect = 0.81, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 58/186 (31%), Gaps = 20/186 (10%) Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--EEFFL 235 LD S S++ K I I ++ V + + L+ FS E F Sbjct: 2 LDSSGSLK-------NKFQDEIDIIRRIVNHVTIGEPA---TRVMLIQFSGVQHLEFNFK 51 Query: 236 LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 + + L T +Y + + + GMR ++ KII ++ Sbjct: 52 KFKDREDILGALDVLRHVSGITRLGDAFEYTLSMLNEENGMRP------SDIPKIIYLLS 105 Query: 296 DGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRACASPNSFYLVENPHSM 354 DG +D + + +YA G + +E L P +N + Sbjct: 106 DGRTHDYPKDTEMADLLRQQIDN-VDIYAYGTGEYVAMNELLAITKDPKKIVTNQNLDDL 164 Query: 355 YDAFSH 360 F Sbjct: 165 EPMFDR 170 >gi|327261831|ref|XP_003215731.1| PREDICTED: collagen alpha-1(XII) chain-like [Anolis carolinensis] Length = 3118 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 50/281 (17%), Positives = 109/281 (38%), Gaps = 39/281 (13%) Query: 98 LRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVM 156 + + ++T+ ++ + + Y I+ + + P + T + T + + Sbjct: 372 MLAGVKQQSLSVGPQTTAFNVKDLSADTEYQINVYAMKGLTPSEPITIME-KTQPVKVQV 430 Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 + V V + D+++++D S S+ ++ K+ ++ + VK Sbjct: 431 ECSKGVDVKA------DIVLLVDGSYSIG---IANFVKVRAFLEVL------VKSFEISP 475 Query: 217 NVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 VQ LV +S F L + + + I G STN+ + Y +IF Sbjct: 476 EKVQISLVQYSRDPHTEFTLNRYNRIDDIIQAINTFPYRGGSTNTGKAMTYVREKIFVTG 535 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 N ++++ +TDG++ +D + + ++A+G++ E Sbjct: 536 ------RGARPNVPRVMILITDGKSSDAFKD-----PAIKLRNSDVEIFAVGVKDAVRTE 584 Query: 335 FLRACASP---NSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 L A A+P Y VE+ +DAF I ++ T+ + Sbjct: 585 -LEAIATPPAETHVYTVED----FDAFQRISFEL-TQSVCL 619 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 54/339 (15%), Positives = 109/339 (32%), Gaps = 35/339 (10%) Query: 46 MKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDIL--CRIKNTWNM--------SFR 95 MK L S I + + A + + N I+ ++ TW Sbjct: 1 MKIRL-SAIAVAALCTALLLCSIEAEVNPPSDLKFKIIDERTVQMTWTRPSDAIEGYRIT 59 Query: 96 NELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIV 155 +G + ST + + + Y ++ S I + + Sbjct: 60 VTPTTDGPSREFTLAPSSTETLLRDLTADIEYLVTISSYDNIEESIS--VSGQLTIQTGG 117 Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 + T K+ R + V D+ ++ + + I A+ V Sbjct: 118 LITTGEKKIEEIQLQRCSLSAVTDLVFLVDGSWSVGRNNFKYILDFIVAL---VSAFDIG 174 Query: 216 NNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + + G+V +S+ F L + L IK + G +T + ++Y F Sbjct: 175 EDKTRVGIVQYSSDTRTEFNLNQYYRQRDLIEAIKNIPYKGGNTMTGEAIEYLMRNTF-- 232 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 ++ K+ + +TDG K + E + G V+++GI+ + Sbjct: 233 ----VESAGSRKDFPKVAIIITDG-----KSQDEVEIPARELRAAGVEVFSLGIKAADAK 283 Query: 334 EFLRACASPN--SFYLVENPHSMYDAFSHIGKDIVTKRI 370 E + P+ + V N + D I +IV++ Sbjct: 284 ELKLIASQPSLTHVFNVANFDGIVD----IQNEIVSQVC 318 Score = 64.4 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 45/253 (17%), Positives = 83/253 (32%), Gaps = 32/253 (12%) Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 Y ++ ++ T + S + +T A D+++++D S S Sbjct: 1150 GTSYKVNVFGVFEGGESMPLIGEEMTTLSDATVVPILSTGLECKTRAEADIVLLVDGSWS 1209 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 + I ++E + PD VQ GL +S + L Sbjct: 1210 IG------RPNFKTVRSFIARIVEVFDIGPDK---VQIGLAQYSGDPRTEWQL--NSHKT 1258 Query: 244 QR----KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 ++ + L G +T + L N I + A +KI V +TDG Sbjct: 1259 KQSLMDAVANLPYKGGNTLTGMAL----NFILRNNFKPEAGMRPGA--RKIGVLITDG-- 1310 Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SFYLVENPHSMYDA 357 K + K G +YA+GI+ +E + + P+ Y V + + Sbjct: 1311 ---KSQDDIVAPSQRLKDLGVELYAVGIKNADENELKQIASDPDETHAYNVGDFTLLV-- 1365 Query: 358 FSHIGKDIVTKRI 370 +I D+ Sbjct: 1366 --NIVDDLTVNLC 1376 >gi|86357991|ref|YP_469883.1| hypothetical protein RHE_CH02376 [Rhizobium etli CFN 42] gi|86282093|gb|ABC91156.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 427 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 67/423 (15%), Positives = 132/423 (31%), Gaps = 89/423 (21%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F N G I+TA+ + + GM ++V+H ++T L++ D + V I + Sbjct: 8 FISNRSGNFGIMTALLMVPLMGAAGMAVDVAHALSLRTQLYAAADAAAVG---SIAEKSG 64 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 G+ + T + ++ DI +D+ +S Sbjct: 65 AVAAAMTMNGNGTVSLGKTDARNIFMSQTSG----ELTDIHIDLGIDVTKTANKLNSQVS 120 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME----- 185 + +P F + I+ + QT A +D I+LD + SM Sbjct: 121 FTAT--VPTTFMRIFGRD------SIIISGTATAEYQTAAFMDFYILLDNTPSMGVGATA 172 Query: 186 ----------------------------SFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + +ID+ ++ A+ + K ++ Sbjct: 173 SDVSKLQAKTGCAFACHQMDQSTNNYTIAKSLGVTMRIDVVRQATQALTDTAKAERVSSD 232 Query: 218 VVQSGLVTFSNKIEEF-----FLLEWGVSHLQRKIKYL-----SKFGVSTNSTPGLKYAY 267 + G+ TF K E+ L ++ ++ + +++ A Sbjct: 233 QFRMGVYTFGTKAEDAKLTTISGLTSDLTKVKNYTNAVDLMTIPYQNYNSDQLTSFDSAM 292 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-------------CNE 314 QI + T + + +KI+ F+ DG S K + C Sbjct: 293 TQINTIIDP-AGDGTSNISPEKILFFVADGVGDSYKPSTCTKKTTGGRCQEPIDTTFCKP 351 Query: 315 AKKRGAIV---YAIGIRVIRSH--------------EFLRACASPNSFYLVENPHSMYDA 357 K RG + Y + + + ++ACASP ++ V + DA Sbjct: 352 LKDRGVKIAVLYTTYLPLPSNSWYNTWIKPFQNEIPTKMQACASPGLYFEVTPTDGIADA 411 Query: 358 FSH 360 Sbjct: 412 MKA 414 >gi|281345873|gb|EFB21457.1| hypothetical protein PANDA_003362 [Ailuropoda melanoleuca] Length = 399 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 77/203 (37%), Gaps = 23/203 (11%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 S+ + + + D++ ++D S S+ K+ +K V + +Q Sbjct: 210 SLLSPACKNRQADIIFLMDGSESISPE---DFEKMKGFVK------RMVNQADISTDEIQ 260 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 GL+ FS+ +E F L+ S + R I + + T + L + FD Sbjct: 261 IGLLQFSSTPQEEFRLDQYSSKVDIHRAITNVQQMNDGTRTGKALNFT-RPFFDSS---- 315 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 N ++ ++ +TDG + + R +++AIG+ ++ + L+ Sbjct: 316 --RGGRPNVQQYLIVITDG-----VAQDDVVMPAKALRDRNIVIFAIGVGEAKNAQLLQI 368 Query: 339 CASPNSFYLVENPHSMYDAFSHI 361 P Y EN S+ + I Sbjct: 369 TDDPQKVYYEENFESLQNLEKKI 391 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 34/191 (17%), Positives = 71/191 (37%), Gaps = 17/191 (8%) Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVS 241 +E F + + + + + + + PD V+ GLV +S + F L+ + Sbjct: 8 IEEFSWDRQSNFQQVVNFLKSTVSSLNVHPDG---VRIGLVFYSEEPRLEFSLDAFQNPA 64 Query: 242 HLQRKIKYL--SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 + + L + T + L + N++F R + +++ V +T+G + Sbjct: 65 SILEYLDRLTYRRRSGRTKTGAALDFLRNEVFI--EERGSRSKHG--VQQMAVVITEGFS 120 Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFS 359 SL ++ G +YA+G + + L AS + V + S S Sbjct: 121 QDQLSKSASL-----LRRAGVTIYAVGTHLASESKDLENIASYPPWKHVISLESFLQ-LS 174 Query: 360 HIGKDIVTKRI 370 +G I + Sbjct: 175 VVGNKIKNQLC 185 >gi|324499923|gb|ADY39979.1| Transmembrane cell adhesion receptor mua-3 [Ascaris suum] Length = 2123 Score = 81.0 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 30/239 (12%), Positives = 88/239 (36%), Gaps = 22/239 (9%) Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 + K + +++ ++ + ++ + D++ ++D S S+ S Sbjct: 406 VMLPDGYTCKCFSGYVDVSSNANLPLGRVCTL-QTTCPAQPTDLVFLIDGSGSIGS---- 460 Query: 191 SITKIDMAIKSINAMLEE-VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKI 247 D+ + L+E V+L ++ + +V +S++I F L S +++ I Sbjct: 461 -----DVFRGEVLRFLKEFVELFDISSDRTRVAVVQYSDRIRHEFDLNQYSSIQNVEDAI 515 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 + T + +++ N+ F+ R+ ++ + +TDG + Sbjct: 516 DRIQYMTGLTRTGAAIEHVRNEAFN---ERRGARPLSDKISRVTIVITDG-----RSQDN 567 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA-CASPNSFYLVENPHSMYDAFSHIGKDI 365 +A+++ ++A+G+ L + + ++ V + + + Sbjct: 568 VSLPAQQARQQHIQLFAVGVTNHVLDSELETIAGAKDRYFHVTGFKDLNARLRSAIQKV 626 >gi|109094740|ref|XP_001104627.1| PREDICTED: matrilin-4-like, partial [Macaca mulatta] Length = 222 Score = 80.6 bits (197), Expect = 3e-13, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 73/178 (41%), Gaps = 19/178 (10%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + +D+++++D S+S+ ++ + +N +++ + + P+ + GLV F Sbjct: 1 CREGHVDLVLLVDGSKSVRP------QNFELVKRFVNQIVDFLDVSPEG---TRVGLVQF 51 Query: 227 SNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S+++ F L +G + +++ + + T + L++ F Q Sbjct: 52 SSRVRTEFPLGRYGTAAEVKQAVLAVEYMERGTMTGLALRHMVEHSFSEA---QGARPRA 108 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N ++ + TDG + + AK+ G +YA+G+ E + P Sbjct: 109 LNVPRVGLVFTDGRSQDDIS-----VWAARAKEEGIAMYAVGVGKAVEAELREIASEP 161 >gi|47217757|emb|CAG05979.1| unnamed protein product [Tetraodon nigroviridis] Length = 1380 Score = 80.6 bits (197), Expect = 3e-13, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 77/213 (36%), Gaps = 21/213 (9%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + D+ +LD S S+ + + + +++ + PD + +V Sbjct: 1 AGCKTVQYDLAFILDTSSSVGK------ENFEKIRQWVANLVDSFDVAPDK---TRVAVV 51 Query: 225 TFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S++ F L + ++R + + G +T + + Y + IF + + Sbjct: 52 RYSDRPTTEFNLARYRTLEDVKRAARNIRYLGGNTMTGDAISYTTSNIFTERNGARPIAR 111 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 +++ + +TDG L A K G ++A+GI E A P Sbjct: 112 G---IQRVAILLTDGR-----SQDYVLEPSKAAAKAGIRMFAVGIGEALKVELDEIAAEP 163 Query: 343 NS--FYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + + V + +++ + K + + + Sbjct: 164 KNAHVFHVTDFNAIDKIRGRLRKRLCENVLCPN 196 >gi|126731914|ref|ZP_01747718.1| BatB protein, putative [Sagittula stellata E-37] gi|126707741|gb|EBA06803.1| BatB protein, putative [Sagittula stellata E-37] Length = 323 Score = 80.6 bits (197), Expect = 3e-13, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 76/203 (37%), Gaps = 32/203 (15%) Query: 173 DMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+ +D+S SME T++ + ++ + + + GLV FS+ Sbjct: 92 DIMMAIDLSGSMEERDFAVGGRPATRLSIVKETADDFISRRDGD-------RLGLVLFSD 144 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + +++ + + G T + + ++ D Sbjct: 145 RAYLQAPLTFDREAVRKLLDQAQVGLTGQKTAIGDAIAVSVKRLKDRPEDG--------- 195 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF----LR--ACA 340 +++V +TDG N + + A K G +Y IG+ RS + LR A A Sbjct: 196 --RVLVLLTDGANN--EGVMSPDKAADLAAKLGIRIYTIGVGSARSRDLDERTLRQIADA 251 Query: 341 SPNSFYLVENPHSMYDAFSHIGK 363 + +++ + + + I + Sbjct: 252 TGGAYFRATDVQGLAQIYRAIDR 274 >gi|320102588|ref|YP_004178179.1| Heat shock protein 70 [Isosphaera pallida ATCC 43644] gi|319749870|gb|ADV61630.1| Heat shock protein 70 [Isosphaera pallida ATCC 43644] Length = 688 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 39/197 (19%), Positives = 75/197 (38%), Gaps = 26/197 (13%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 L +++++DVS SM +D A ++ + L++ + GL+++S+++ Sbjct: 510 PLAILLLIDVSSSMAG------PPLDEAREAARSFLDQCDFT-----TTRVGLISYSDQV 558 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L V ++ + L G +TN L+ ++ + + K Sbjct: 559 VLQTDLTDNVRKVEAGLARLEADG-TTNLAGALELGRRKLATVP----------TGHVKY 607 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 +V +TDG +L AK G + AIG +L AS + + Sbjct: 608 LVVLTDGYPDDP---DNALLEAAHAKGSGIEIVAIGTGEAD-QAYLDRIASTQAGSIFAR 663 Query: 351 PHSMYDAFSHIGKDIVT 367 + AF HI + I Sbjct: 664 KGELVRAFGHIARVIAE 680 >gi|331006778|ref|ZP_08330044.1| BatA [gamma proteobacterium IMCC1989] gi|330419396|gb|EGG93796.1| BatA [gamma proteobacterium IMCC1989] Length = 364 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 36/217 (16%), Positives = 80/217 (36%), Gaps = 39/217 (17%) Query: 173 DMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +D+S SM+ + T++ K ++ +++ + GL+ F Sbjct: 95 DLLLAVDISGSMQQEDMQINNRPATRLAAVKKVVSDFIDQ-------RQGDRIGLILFGT 147 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + + + ++ L G T + + ++ + + Sbjct: 148 QAYLQTPLTFDTQSVNQFLQEAQLGFAGKDTAIGDAIGLSVKRLKNQSSASSAKPSNS-- 205 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------- 331 K+I+ +TDGEN + + + L A+K GA +Y +GI Sbjct: 206 --KVIILLTDGENTA--GEVEPLQAAKLAEKIGAKIYTVGIGADEMIVRGFFGNRRVNPS 261 Query: 332 ---SHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 E L A A + ++ N + + +S + K Sbjct: 262 ASLDEETLTAIANTTGGLYFRARNTQELNNIYSELDK 298 >gi|296108502|ref|YP_003620203.1| hypothetical protein lpa_04155 [Legionella pneumophila 2300/99 Alcoy] gi|295650404|gb|ADG26251.1| Hypothetical protein lpa_04155 [Legionella pneumophila 2300/99 Alcoy] Length = 352 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 79/227 (34%), Gaps = 46/227 (20%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + ++M+VLD+S SME ++++ + ++ +E+ ++ Sbjct: 89 EPKPVAREGYNIMMVLDLSGSMEITDMLLHGRPVSRLLVVKRAAEQFVED-RVGD----- 142 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGM 276 + GL+ F + L + + +I + G +T+ + A ++ D+ Sbjct: 143 -RIGLILFGTRAYLQTPLTYDRHSVLMRIDDATAGLAGKTTSIGDAVGLAVKRLQDVPSK 201 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-------- 328 ++I+ +TDG N L AK+ G +Y IG+ Sbjct: 202 G-----------RVIILLTDGANN--SGVLAPLKAAELAKQDGIKIYTIGLGSEADPRAL 248 Query: 329 ----------VIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 + L A + ++ +P S+ + I + Sbjct: 249 TGDFFAPTLSAELDEKTLEKMAKMTGGRYFRATDPESLQSIYQTINQ 295 >gi|148361167|ref|YP_001252374.1| Von Willebrand factor type A (vWA) domain-containing protein [Legionella pneumophila str. Corby] gi|148282940|gb|ABQ57028.1| conserved hypothetical protein [Legionella pneumophila str. Corby] Length = 344 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 79/227 (34%), Gaps = 46/227 (20%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + ++M+VLD+S SME ++++ + ++ +E+ ++ Sbjct: 81 EPKPVAREGYNIMMVLDLSGSMEITDMLLHGRPVSRLLVVKRAAEQFVED-RVGD----- 134 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGM 276 + GL+ F + L + + +I + G +T+ + A ++ D+ Sbjct: 135 -RIGLILFGTRAYLQTPLTYDRHSVLMRIDDATAGLAGKTTSIGDAVGLAVKRLQDVPSK 193 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-------- 328 ++I+ +TDG N L AK+ G +Y IG+ Sbjct: 194 G-----------RVIILLTDGANN--SGVLAPLKAAELAKQDGIKIYTIGLGSEADPRAL 240 Query: 329 ----------VIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 + L A + ++ +P S+ + I + Sbjct: 241 TGDFFAPTLSAELDEKTLEKMAKMTGGRYFRATDPESLQSIYQTINQ 287 >gi|52843052|ref|YP_096851.1| hypothetical protein lpg2856 [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] gi|52630163|gb|AAU28904.1| hypothetical protein lpg2856 [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] Length = 352 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 79/227 (34%), Gaps = 46/227 (20%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + ++M+VLD+S SME ++++ + ++ +E+ ++ Sbjct: 89 EPKPVAREGYNIMMVLDLSGSMEITDMLLHGRPVSRLLVVKRAAEQFVED-RVGD----- 142 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGM 276 + GL+ F + L + + +I + G +T+ + A ++ D+ Sbjct: 143 -RIGLILFGTRAYLQTPLTYDRHSVLMRIDDATAGLAGKTTSIGDAVGLAVKRLQDVPSK 201 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-------- 328 ++I+ +TDG N L AK+ G +Y IG+ Sbjct: 202 G-----------RVIILLTDGANN--SGVLAPLKAAELAKQDGIKIYTIGLGSEADPRAL 248 Query: 329 ----------VIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 + L A + ++ +P S+ + I + Sbjct: 249 TGDFFAPTLSAELDEKTLEKMAKMTGGRYFRATDPESLQSIYQTINQ 295 >gi|54295680|ref|YP_128095.1| hypothetical protein lpl2768 [Legionella pneumophila str. Lens] gi|53755512|emb|CAH17011.1| hypothetical protein lpl2768 [Legionella pneumophila str. Lens] gi|307611729|emb|CBX01432.1| hypothetical protein LPW_31221 [Legionella pneumophila 130b] Length = 344 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 79/227 (34%), Gaps = 46/227 (20%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + ++M+VLD+S SME ++++ + ++ +E+ ++ Sbjct: 81 EPKPVAREGYNIMMVLDLSGSMEITDMLLHGRPVSRLLVVKRAAEQFVED-RVGD----- 134 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGM 276 + GL+ F + L + + +I + G +T+ + A ++ D+ Sbjct: 135 -RIGLILFGTRAYLQTPLTYDRHSVLMRIDDATAGLAGKTTSIGDAVGLAVKRLQDVPSK 193 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-------- 328 ++I+ +TDG N L AK+ G +Y IG+ Sbjct: 194 G-----------RVIILLTDGANN--SGVLAPLKAAELAKQDGIKIYTIGLGSEADPRAL 240 Query: 329 ----------VIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 + L A + ++ +P S+ + I + Sbjct: 241 TGDFFAPTLSAELDEKTLEKMAKMTGGRYFRATDPESLQSIYQTINQ 287 >gi|116329598|ref|YP_799317.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis L550] gi|116332487|ref|YP_802204.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis JB197] gi|116122491|gb|ABJ80384.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis L550] gi|116127354|gb|ABJ77446.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis JB197] Length = 312 Score = 80.6 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 44/207 (21%), Positives = 76/207 (36%), Gaps = 31/207 (14%) Query: 166 SQTDARLDMMIVLDVSRSMESFFD-SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +D+MI LDVS SM D T++ ++ K + +++ K + GLV Sbjct: 80 PNEKKGVDVMIALDVSGSMSRSRDFLPETRLGVSKKLLRKFIDKRKSD-------RLGLV 132 Query: 225 TFSNKIEEFFLLEWGVSHLQR---KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 F+ L L I+ + T + I +R Sbjct: 133 VFAGAAYLQAPLTGDRESLNEILGTIEEETVAEQGTAIGDAI------ILSTYRLRA--- 183 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-----HEFL 336 K+IV +TDG + + K D + + A+ G +Y++GI E L Sbjct: 184 --SQARSKVIVLITDGVSNTGKIDPVTAT--DLAEHIGVKIYSVGIGKEDGSYEINFEIL 239 Query: 337 R--ACASPNSFYLVENPHSMYDAFSHI 361 R + ++ F+ E+P M + I Sbjct: 240 RELSASTGGKFFRAEDPEEMKAVLTSI 266 >gi|326335930|ref|ZP_08202107.1| aerotolerance protein BatA [Capnocytophaga sp. oral taxon 338 str. F0234] gi|325691894|gb|EGD33856.1| aerotolerance protein BatA [Capnocytophaga sp. oral taxon 338 str. F0234] Length = 332 Score = 80.2 bits (196), Expect = 4e-13, Method: Composition-based stats. Identities = 47/218 (21%), Positives = 88/218 (40%), Gaps = 28/218 (12%) Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVM----PITSSVKVNSQTDARLDMMIVLD 179 + ++++I +KI L+ FI +++ P +SS ++T +D+++ +D Sbjct: 39 SSSQALTSIHTWKIRLRPILFILRLLALSCLIIALARPQSSSEITKTKTTEGIDIILAID 98 Query: 180 VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG 239 +S SM + D +I+ + + +EE K + G+V +S + Sbjct: 99 MSSSMLAK-DLKPNRIEALKRVASQFIEERKSD-------RIGIVVYSGESYTKVPATTD 150 Query: 240 VSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 S + + +K + + T GL A N++ + K+I+ MTDG Sbjct: 151 KSIVLQSLKDIKQGEIEDGTAIGMGLGTAINRL-----------KDSKTKSKVIILMTDG 199 Query: 298 ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 N + D L AK+ G VY IGI Sbjct: 200 VNNTGVIDP--LSAAELAKEYGIRVYTIGIGT-NGKAL 234 >gi|39936212|ref|NP_948488.1| hypothetical protein RPA3149 [Rhodopseudomonas palustris CGA009] gi|39650067|emb|CAE28590.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009] Length = 455 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 63/441 (14%), Positives = 138/441 (31%), Gaps = 81/441 (18%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDR------------ 56 R F G + ++ A+ L + +G+ ++ S +T L + +D Sbjct: 15 RRFPQANGGNIAVIFALALVPLLGFIGVAVDYSRANNARTSLQNALDSAALMLSRDLGVG 74 Query: 57 -------------------------SLVHAATQIMNEGNGNNRKKLKG-GDILCRIKNTW 90 ++ AT +G+G++ + G G + + Sbjct: 75 TITPDQVSSKAQTYFNSLYTNKETGAVTVTATYTAKDGSGSSTIAMSGQGAVQTQFMKIL 134 Query: 91 NMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTN 150 + + +LD+ + G + + K + + Sbjct: 135 GFQTMAIGSSTTTTWGGTRLRVAMALDVTGSMASAGKMSAMKTAAKNLVDSLRASAQTVD 194 Query: 151 SRHIVM-PITSSVKVNSQTDARLDMMIVL-DVS-RSMESFFDSSITKIDMAIKS------ 201 +I + P V V S + L D S S S++ S+ + + A ++ Sbjct: 195 DVYISVVPFAQMVNVGSSNRNASWVRWDLWDESNGSCSSWWYSTKSSCEYAGRTWTATSH 254 Query: 202 ---INAMLEEVKLIPDVNNV-----VQSGLVTFSNKIEE------FFLLEWGVSHLQRKI 247 + + + +V + V + ++ + L + ++ KI Sbjct: 255 NQWAGCVTDRDQPADTTKDVPTSYATRFPAVDYDACPQQLLGMTSAYSLSNATT-IKNKI 313 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN-------- 299 LS G TN G+ +A+ + + + Y I+ ++DG N Sbjct: 314 DALSPNG-GTNQAIGMHWAWMSLRTGDPLNTPAKDSNYKYTDAIILLSDGLNTVDRWYGN 372 Query: 300 ---LSTKEDQQSLYYCNEAKKRG-----AIVYAIGIRVIR--SHEFLRACASPNSFYLVE 349 S + D + C+ + ++Y I + L+ CA +F+ Sbjct: 373 GRDWSPQVDARQRILCDNIRASATNTNPVVIYTIQVNTDGDPESAVLKYCADSGNFFATT 432 Query: 350 NPHSMYDAFSHIGKDIVTKRI 370 + AF+ IG + R+ Sbjct: 433 TSSGIGTAFAQIGSSLSKLRV 453 >gi|332831234|ref|XP_528236.3| PREDICTED: collagen alpha-1(XXII) chain, partial [Pan troglodytes] Length = 695 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 37/223 (16%), Positives = 86/223 (38%), Gaps = 26/223 (11%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + + D++ +LD S S+ + + + +++ ++ PD Sbjct: 20 SGGGSCQAQRAGCKSVHYDLVFLLDTSSSVGK------EDFEKVRQWVANLVDTFEVGPD 73 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + G+V +S++ F L S ++ + L+ G +TN+ L+Y + F Sbjct: 74 ---RTRVGVVRYSDRPTTAFELGLFGSQEEVKAAARRLAYHGGNTNTGDALRYITARSFS 130 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 R D YK++ + +TDG + D + + + G ++A+G+ Sbjct: 131 ---PRAGGRPRDRAYKQVAILLTDGRSQDLVLDAAAAAH-----RAGIRIFAVGVGEALK 182 Query: 333 HEFLRACASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 E + P + V + ++A I + + +R+ + Sbjct: 183 EELEEIASEPKSAHVFHVSD----FNAIDKI-RGKLRRRLCEN 220 >gi|261408991|ref|YP_003245232.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261285454|gb|ACX67425.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 1007 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 47/273 (17%), Positives = 97/273 (35%), Gaps = 39/273 (14%) Query: 98 LRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMP 157 L + I + S V Q E A+ + I + + P Sbjct: 329 LEAAKYAVYDSIIFNNVSGSDVGGKQMELIE-QAVRSFGIGFMMAGGEDSFGMGGYFKTP 387 Query: 158 ITSSVKVNSQ-----TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 I ++ V+ + L +++V+D S SM+ KI++A +S +E ++ Sbjct: 388 IEKALPVSMELEGKREIPSLGLILVIDRSGSMDGN------KIELAKESAMRTVELMRAK 441 Query: 213 PDVNNVVQSGLVTFSNK---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 G+V F ++ + L + I+ + G TN P + A + Sbjct: 442 DT------VGVVAFDDQPWWVVPPQKLGD-KEEVLSSIQSIPSAG-GTNIYPAVSSALEE 493 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 + + R+H I+ MTDG++ Q + + + ++ + + Sbjct: 494 MLKIDAQRRH-----------IILMTDGQSAMNSGYQD---LTDTMVENKITMSSVAVGM 539 Query: 330 IRSHEFLRAC--ASPNSFYLVENPHSMYDAFSH 360 L++ A+ +Y VE+ ++ FS Sbjct: 540 DADTNLLQSLADAAKGRYYFVEDETTLPAVFSR 572 >gi|260781661|ref|XP_002585922.1| hypothetical protein BRAFLDRAFT_90332 [Branchiostoma floridae] gi|229270990|gb|EEN41933.1| hypothetical protein BRAFLDRAFT_90332 [Branchiostoma floridae] Length = 4065 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 36/218 (16%), Positives = 72/218 (33%), Gaps = 27/218 (12%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 PI S LD++ +LD S S+ S ++ + I Sbjct: 1572 GPICQFYAACSNRTLNLDVVFLLDGSGSVGS---ANFDLLKTFTTRIAT------NFDVS 1622 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 N+ + G+V +S++ F+L + + I +S T++ L Y +F Sbjct: 1623 TNLTRVGVVQYSDQTNSEFVLNTFSTEAEVLAAIAAISYQNGGTSTGAALDYVRQNVFIS 1682 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI-RS 332 + + + I++ +TDG + A+ G +Y++GI Sbjct: 1683 ------ASGDRPDAANILIVLTDG-----VSSDDVSFPAMAARNAGITIYSVGIGDGVDY 1731 Query: 333 HEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + + PN + + IG + Sbjct: 1732 NTLQQIAGDPNKVLQATG----FSSLDDIGGQLEELVC 1765 Score = 79.4 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 37/191 (19%), Positives = 70/191 (36%), Gaps = 23/191 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D++ +LD S S+ S ++ I L P +N + GLV FS+ Sbjct: 1839 GGWDLVFLLDGSGSVGSNNFLNVKNFTKLIT---------DLFPVGDNATKVGLVQFSDT 1889 Query: 230 IEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 I++ F L + + I +S G T + + Y F+ N ++ Sbjct: 1890 IQKEFDLRDYDTKAEILSAIDNISYLGGGTYTGNAIDYVRQVSFNT------INGNRGSH 1943 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL-RACASPNSFY 346 +++ +TDGE+ + A+ +G ++AIG+ + L P Sbjct: 1944 PDMLIVLTDGESFDP-----VTFASQSARDQGITIFAIGVGTGVDYATLEEIAGDPQKVQ 1998 Query: 347 LVENPHSMYDA 357 V + + Sbjct: 1999 QVTDFADLTSV 2009 Score = 77.9 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 34/238 (14%), Positives = 76/238 (31%), Gaps = 24/238 (10%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA 198 L+ F+ + S+ + I + D LD++ +LD S S+ S D+ Sbjct: 719 LQINNFVDLASASQTLPDYICNYASCQG-RDVPLDIVFLLDGSGSVGS------ANFDLV 771 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVS 256 + + N+ Q G+V +S+ + F L + I +S Sbjct: 772 KDFTRTL---ARNFDIAANMTQIGVVQYSDTVNREFGLGDFHNRQDVLNAISAVSYQQGG 828 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 T + + + F + + + +++ +TDG + A+ Sbjct: 829 TLTGAAIDFVRQTSFTTG------DGDRPDVPNMLIVVTDG-----VSGDSVQGPADAAR 877 Query: 317 KRGAIVYAIGIRV-IRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + G + +GI I L ++ ++ + + + D Sbjct: 878 REGITTFGVGIGNGIDFGTLLEIAGDSARVLQADDFGALATVAQRLQEVVCDLEYCGD 935 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 31/197 (15%), Positives = 71/197 (36%), Gaps = 23/197 (11%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 V +D++ ++D S S+ S+ D+ + + + + +++ + Sbjct: 342 LTTVEPCERLEIDVIFLIDGSSSI------SLLNFDLLKTFLQNITMKFDV---SSDITR 392 Query: 221 SGLVTFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 G+V +S + F L + + I +++ ST G+ + F + + Sbjct: 393 IGVVQYSTDVNTEFELKTYATEAEVIHAISNITRQRGSTFIGAGINFVRTNSFTVAAGDR 452 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLR 337 I+V +TDG A+ +G + Y+IGI I+ L Sbjct: 453 PLAPN------ILVTITDG-----ISADDVAGPAQAARDQGILTYSIGIGEEIQWPTLLS 501 Query: 338 ACASPNSFYLVENPHSM 354 + + + V + + Sbjct: 502 IAGARHRVFNVTSFSEL 518 Score = 69.4 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 33/231 (14%), Positives = 71/231 (30%), Gaps = 27/231 (11%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 + LD++ +LD S S+ ++ Sbjct: 1270 QPTGTTSRCLCPAGYSGDFCQFYTACLNHTIALDLIFLLDGSGSIT------APNFELVK 1323 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVST 257 ++ + P+ + G+ +S+ F L + + + +S G T Sbjct: 1324 SFTYSVSRNFDVSPNA---TRIGVAQYSDTNSLEFNLNRYSTKDEVLNAVNGISYQGGGT 1380 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 + L + + + I+V TDGE+ +DQ++ + Sbjct: 1381 YTGAALDFVRQTMM------VESAGDRTMSPNILVVATDGES---SDDQRT--PAEVLRN 1429 Query: 318 RGAIVYAIGIRVI-RSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 G +VYA+GI S L + + + + IG+++ Sbjct: 1430 AGTLVYAVGIGAGVSSTTLLDIAGYNSRVLQATD----FASLEVIGRELQE 1476 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 27/168 (16%), Positives = 57/168 (33%), Gaps = 14/168 (8%) Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYA 266 V + G+V +S+ F L + + I ++ TN+ + Y Sbjct: 2440 VDNFEISQTDTKVGVVQYSSSSNVEFYLNAFSTKQAVLDAINAVTYQQGGTNTGAAITYT 2499 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 +IF N ANY +++ +TDGE+ A+ G ++YA+G Sbjct: 2500 MQEIFAS------ANGARANYPDVLIVVTDGESSD-----DVAVPALSARNAGTLIYAVG 2548 Query: 327 IRVIRSHE-FLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + + L+ + + + + +++ D Sbjct: 2549 VGNGVNQATLLQIAGNAGQVLQAADFAGLTTVVQSLQQNLCDAAYCGD 2596 Score = 68.6 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 34/190 (17%), Positives = 70/190 (36%), Gaps = 23/190 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ +LD S S+ T ++ + ++ + D NV G+V +S+ Sbjct: 1018 GGLDLVFLLDGSGSVG------TTNFELVKDFTSEVVLNFNISADTTNV---GVVQYSDT 1068 Query: 230 I-EEFFLLEWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + EFFL + L I +S T + + Y F + Sbjct: 1069 VRNEFFLSSYDTKLPLIDAINQISYLTGGTLTGFAIDYVRQSSFS------RPAGARNTF 1122 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACASPNSFY 346 ++V +TDG+ + A+ +G ++A+GI L+ P+ Sbjct: 1123 PDVLVVLTDGQ-----SQDDVVSSAAAARSQGITIFAVGIGSEVDFTTLLQISGYPSRIL 1177 Query: 347 LVENPHSMYD 356 +++ ++ Sbjct: 1178 QIQDFATLVT 1187 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 66/204 (32%), Gaps = 27/204 (13%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ +LD S S+ + D+ N + + P + G+V +S Sbjct: 2121 NAQTFDLVFLLDGSGSVG------ASSFDLMKSFTNRITTNFDVSPTS---TRVGVVQYS 2171 Query: 228 NKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 ++ + + + +T + L Y F + N Sbjct: 2172 SQGSVATEFRLDSYSNKDDVIAAVNGIVYQNGNTYTGEALNYVRQNSFAV------ANGG 2225 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 A+ I+V +TDG+ + + G VYA+GI + L A A Sbjct: 2226 RADVANILVVITDGQ-----SVDDVTGPAQDLLREGVTVYALGIGDGIQYSTLEAIAQDQ 2280 Query: 344 SFYLVENPHSMYDAFSHIGKDIVT 367 S L N + S+ + + Sbjct: 2281 SRVLQANT---FTNLSNTAQALQE 2301 Score = 63.6 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 64/193 (33%), Gaps = 23/193 (11%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + LD++ ++D S S+ S + I + +++I A E + G+V + Sbjct: 130 CSGFALDLVFLVDGSSSVGS---DNFETIKVFLEAITAGFEVSSSQ------TRVGVVQY 180 Query: 227 SNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S I F L + + I+ LS ST + G+ F + + Sbjct: 181 STGINTEFDLNSFATEAEVINAIRGLSHQRGSTFTGAGIT------FTRLESFTGASGDR 234 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRACASPN 343 + +++ +TDG A+ Y+IGI I L Sbjct: 235 PDAPNVLIVITDG-----ISADSVDAPAEAARADNITTYSIGIGDEINYLTLLSIAGMRE 289 Query: 344 SFYLVENPHSMYD 356 V + D Sbjct: 290 RVLNVTTFGDLND 302 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 61/194 (31%), Gaps = 23/194 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV------VQSGLVTF 226 D++ +LD S S+ S + + + + D + G V + Sbjct: 551 DLVFLLDGSASITS---PNFELVKDFAERVARHFTISSSRNDNMSYRSFTAATNVGAVQY 607 Query: 227 SNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S+ + F L + + R + +S T + L + F Sbjct: 608 SDTVRSEFFLSSFDTDFEVVRALDGISYLAGGTFTGFALDFVQQSAFS------PVAGAR 661 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRACASPN 343 Y I+V +TDG + A+K G V+A+GI + L+ Sbjct: 662 DGYPDILVVVTDG-----VSQDDVVAPAESARKEGIAVFAVGIGSAVDYATLLQIAGIDG 716 Query: 344 SFYLVENPHSMYDA 357 + N + A Sbjct: 717 RILQINNFVDLASA 730 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 60/193 (31%), Gaps = 27/193 (13%) Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWG 239 S S+ S ++ ++ + + + G+V +S+ F L + Sbjct: 2707 SGSVGS------DNFNLLKAFTQNIVGNFDIAVNN---TRVGVVQYSDFNNIEFNLNAYA 2757 Query: 240 VS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE 298 + I +S T + + + +F G A+ I++ +TDGE Sbjct: 2758 TEAEVLAAIGAISYQRGGTFTGAAIDFVRQDVFTTAG------GNRADKPDILLVLTDGE 2811 Query: 299 NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR-ACASPNSFYLVENPHSMYDA 357 + G +YA+GI + + L+ P V + + Sbjct: 2812 SSD-----SVAGPAQNTLNAGITIYAVGIGSGVNADTLQEIAGDPGRVLQVAD----FQG 2862 Query: 358 FSHIGKDIVTKRI 370 + I + Sbjct: 2863 LAAITNQLQEALC 2875 >gi|329928982|ref|ZP_08282792.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] gi|328937234|gb|EGG33661.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] Length = 899 Score = 80.2 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 47/274 (17%), Positives = 97/274 (35%), Gaps = 39/274 (14%) Query: 97 ELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVM 156 L + I + S V Q E A+ + I + + Sbjct: 328 SLEAAKYAVYDSIIFNNVSGSDVGGKQMELIE-QAVRSFGIGFMMAGGEDSFGMGGYFKT 386 Query: 157 PITSSVKVNSQ-----TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 PI ++ V+ + L +++V+D S SM+ KI++A +S +E ++ Sbjct: 387 PIEKALPVSMELEGKREIPSLGLILVIDRSGSMDGN------KIELAKESAMRTVELMRA 440 Query: 212 IPDVNNVVQSGLVTFSNK---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 G+V F ++ + L + I+ + G TN P + A Sbjct: 441 KDT------VGVVAFDDQPWWVVPPQKLGD-KEEVLSSIQSIPSAG-GTNIYPAVSSALE 492 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 ++ + R+H I+ MTDG++ Q + + + ++ + Sbjct: 493 EMLKIDAQRRH-----------IILMTDGQSAMNSGYQD---LTDTMVENKITMSSVAVG 538 Query: 329 VIRSHEFLRAC--ASPNSFYLVENPHSMYDAFSH 360 + L++ A+ +Y VE+ ++ FS Sbjct: 539 MDADTNLLQSLADAAKGRYYFVEDETTLPAVFSR 572 >gi|87310694|ref|ZP_01092822.1| BatA [Blastopirellula marina DSM 3645] gi|87286675|gb|EAQ78581.1| BatA [Blastopirellula marina DSM 3645] Length = 355 Score = 79.8 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 40/263 (15%), Positives = 90/263 (34%), Gaps = 48/263 (18%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARL------DMMIVLDVSRSMES----FFD 189 + ++P +V+ I + + + + + +V+D S SM++ D Sbjct: 47 QRLMWLPGALTLTAMVLLIIALARPREGREQAIVENDGIAIEMVVDRSGSMQAMDFQLGD 106 Query: 190 SSITKIDMAIKSINAMLEEVKLIPD-VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIK 248 + ++ K + + ++++V GL+TF+ + + L ++ Sbjct: 107 EHVDRLTAIKKVAGDFVTGGDNLDGRLSDLV--GLITFAGYADGVTPPTLDHAFLVSQLN 164 Query: 249 ----YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 ++ T + A ++ + R+ KII+ +TDGEN + Sbjct: 165 HSQIVTNRSEDGTAIGDAISLAVEKLNALDARRKEKIQ-----SKIIILLTDGENNA--G 217 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIRS----------------------HEFLRACAS- 341 D + + A+ G VY IG+ E L+ AS Sbjct: 218 DLEPIQAAELAQTMGIKVYTIGVGTKGRAPMPVTDMFGRQSMQWMSVNIDEETLQKVASI 277 Query: 342 -PNSFYLVENPHSMYDAFSHIGK 363 ++ + S+ + I + Sbjct: 278 TGGKYFRATDTDSLAKIYGEIDQ 300 >gi|170578661|ref|XP_001894496.1| Transmembrane cell adhesion receptor mua-3 precursor [Brugia malayi] gi|158598882|gb|EDP36664.1| Transmembrane cell adhesion receptor mua-3 precursor, putative [Brugia malayi] Length = 1742 Score = 79.8 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 30/212 (14%), Positives = 76/212 (35%), Gaps = 21/212 (9%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE-VKLIPDV 215 P + + D+M ++D S S+ S+ + + ++E V+L Sbjct: 489 PGRVCTVQTTCPKQKTDLMFLIDGSGSIGSY---------VFKNEVLRFIKEFVELFDIG 539 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + + GL+ +S++I F L + I + T + +++ + + Sbjct: 540 LDNTRVGLIQYSDQIRHEFDLSQYTDKASVISAISQVQYLTGLTRTGAAIQH---MVMEG 596 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRS 332 R+ E + ++ + +TDG + A+K ++++G+ + Sbjct: 597 FSERRGARKEGDDVARVSIVITDG-----RSQDNVTEPAXNARKSHINMFSVGVTDHVLG 651 Query: 333 HEFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 E SP ++ V+ + + + Sbjct: 652 PELEAIAGSPLRWFHVDKFKDLDTRLRSLIQK 683 >gi|149624862|ref|XP_001517471.1| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1) [Ornithorhynchus anatinus] Length = 238 Score = 79.8 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 78/205 (38%), Gaps = 28/205 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S+S+ ++ K IN +++ + + V GLV +S+ + + Sbjct: 17 DLVFLIDGSKSVRP------ENFELVKKFINQIVDSLDVSEQNAQV---GLVQYSSSVRQ 67 Query: 233 FFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L S ++ +K ++ T + L Y + D A +K+ Sbjct: 68 EFPLGRFTSKRDIKAAVKKMTYMEKGTMTGTALNY----LIDNTFAISSGARPGA--QKV 121 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLV 348 + TDG + D +AK G ++A+G+ E + P ++ Sbjct: 122 GIVFTDGRSQDYISDA-----AKKAKDLGFKMFAVGVGNAVEDELREIASDPVAEHYFYT 176 Query: 349 ENPHSMYDAFSHIGKDIVTKRIWYD 373 + + + IGK + K + Sbjct: 177 AD----FKTINQIGKKLQKKICVEE 197 >gi|109087573|ref|XP_001097885.1| PREDICTED: collagen alpha-1(XXII) chain-like [Macaca mulatta] Length = 232 Score = 79.8 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 79/213 (37%), Gaps = 25/213 (11%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + D++ +LD S S+ + + + +++ ++ PD + Sbjct: 27 AQRAGCKSVHYDLVFLLDTSSSVGK------EDFEKVRQWVANLVDTFEVGPD---RTRV 77 Query: 222 GLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G+V +S++ F L ++ + L+ G +TN+ L+Y F R Sbjct: 78 GVVRYSDRPTTAFELGLFGSREEVKAAARRLAYHGGNTNTGDALRYITALSFS---PRAG 134 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 D YK++ + +TDG + D + + + G ++A+G+ E Sbjct: 135 GRPGDRAYKQVAILLTDGRSQDLVLDAAAAAH-----RAGIRIFAVGVGKALKEELEEIA 189 Query: 340 ASPN--SFYLVENPHSMYDAFSHIGKDIVTKRI 370 + P + V + ++A I + + Sbjct: 190 SEPKSAHVFHVSD----FNAIDKIRGKLRRRLC 218 >gi|327399949|ref|YP_004340788.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] gi|327315457|gb|AEA46073.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] Length = 527 Score = 79.8 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 42/205 (20%), Positives = 82/205 (40%), Gaps = 31/205 (15%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D++ LD S SM S + K + + ++++ D Q+G+V++ N I Sbjct: 70 PIDVVFALDSSGSMGWNDPSGLRK-----TAAKSFVDKLNSTTD-----QAGVVSWDNNI 119 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L S ++ KI + G T+ GL A + + + + Sbjct: 120 DFTQTLTNNFSLVKSKIDAVDSSG-GTDLNVGLNAAISLLDTGKQANSSW---------V 169 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS---HEFLRACA--SPNSF 345 I+F+++G+ + A +G VY IG+ + L+ A + + Sbjct: 170 IIFLSNGQGTYSHSTAVV------AANKGYTVYTIGLAISPGSTAESNLKDIANTTGGKY 223 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRI 370 Y N ++ F+ I K++VT I Sbjct: 224 YSSPNATNLDAVFNDIYKEVVTSTI 248 >gi|284040938|ref|YP_003390868.1| von Willebrand factor A [Spirosoma linguale DSM 74] gi|283820231|gb|ADB42069.1| von Willebrand factor type A [Spirosoma linguale DSM 74] Length = 359 Score = 79.8 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 40/230 (17%), Positives = 80/230 (34%), Gaps = 30/230 (13%) Query: 154 IVMPITSSVKVNSQTDA-RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 I + ++ + + +D+M+ +DVS SM S D T++ A + A + Sbjct: 94 IALARPQIIRELREEQSEGIDIMLAMDVSVSM-SESDILPTRLAAARRVAQAFVR----- 147 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQ 269 + GLV F+ + L + L + + L+ T L N+ Sbjct: 148 --GRRNDRIGLVIFAGEAFSLCPLTTDYNLLNQYLNDLNDGMIRTSGTAIGDALARCINR 205 Query: 270 IFDMQGMRQHCNTEDAN-----YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 + D K+I+ ++DG+N + D + + AK +Y Sbjct: 206 MRDRPAASSDTTQAKTEQWKSERSKVIILLSDGDNTAGNLDPITA--ASLAKAFNIKIYT 263 Query: 325 IGIRVIRSHE---------FLRACAS--PNSFYLVENPHSMYDAFSHIGK 363 I + + L+ A+ SF+ + + F+ I + Sbjct: 264 IAVGQPVASASEASTVDEGILKKIATIGKGSFFRAVDSGRLKTVFAQISQ 313 >gi|297286920|ref|XP_001082067.2| PREDICTED: collagen alpha-4(VI) chain-like, partial [Macaca mulatta] Length = 1624 Score = 79.8 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 50/274 (18%), Positives = 100/274 (36%), Gaps = 35/274 (12%) Query: 103 FVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKI---PLKFCTFIPWYTNSRHIVMPIT 159 + ++ S L+ + + YS+ S ++ K + + Sbjct: 769 YAVGTQNVSESKELEKIASYPHWKYSVPLESFLQLSVVRSKLINQLCSEMVDSKVSFRGM 828 Query: 160 SSVKVNSQTD-ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 S + D+ ++D S S+ ++ IK + +K+ N Sbjct: 829 SYPLQEGCVHVEKADIYFLIDGSGSIN---PQDFLEMKAFIKEV------IKMFQIGPNR 879 Query: 219 VQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 VQ G++ +S+KI+ F+L V+ L+ I + + G T + L Sbjct: 880 VQFGVIQYSDKIQSQFILSQYPSVAELKVAIDNIQQGGGGTATGEALN---------NMT 930 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 + +T N + ++ +TDG++ ++ G I+YAIG+R E L Sbjct: 931 QVFADTGRINVARYLIVITDGKSSDP-----VAEAAEGLRENGVIIYAIGVREANIDE-L 984 Query: 337 RACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + A F++ E +D I K++V Sbjct: 985 KEIAKDKIFFVYE-----FDLLKDIQKEVVQDIC 1013 Score = 76.7 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 38/230 (16%), Positives = 87/230 (37%), Gaps = 30/230 (13%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 F + +V I SS + ++ D++ ++D S S+ + + + Sbjct: 997 EFDLLKDIQKEVVQDICSS---EACKTSKADIIFLIDGSESISP------EDFEKMKRFV 1047 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNST 260 +M+ + + D +Q GL+ FS+ +E F L S + I + + T + Sbjct: 1048 ASMVNQSNIGTDG---IQIGLLQFSSIPQEEFRLNQYSSKVDIYSAIFDVQQMRDGTRTG 1104 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA 320 L + +G + ++ ++ +TDG + + + Sbjct: 1105 KALNFTLPFFDSSKG-------GRPSVQQYLIVITDG-----VAQDNVIIPAKALRDKNI 1152 Query: 321 IVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 I++AIG+ + + L + Y N ++A ++ K+I++K Sbjct: 1153 IIFAIGVGEAKKSQLLEITNDEDKVYHDVN----FEALQNLEKEILSKVC 1198 Score = 68.6 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 47/278 (16%), Positives = 100/278 (35%), Gaps = 26/278 (9%) Query: 95 RNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHI 154 LR G + + + ++ + + + ++ Y L + Sbjct: 155 AEALRRAGILLYATGVRDAVWAELREIASSPQENFTSFVPYFSGLSNLAQKLRQELCDML 214 Query: 155 --VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 P V + A D++ ++D S S+ + K+ + S+ L Sbjct: 215 AKAAPRVDHVSPACREAALADIVFLVDSSTSIG---PQNFQKVKNFLYSVVLGL------ 265 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYA-YNQ 269 ++ V+ GL +++ I F L S + +I+ L TN+ L++ N Sbjct: 266 DISSDHVRVGLAQYNDNIYPAFQLNQHPLKSTVLEQIQNLPYRTGGTNTGSALEFIRTNY 325 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 + + G R +I++ +TDGE + + + K+ G +VY +G+ V Sbjct: 326 LTEESGSRA-----KDRVPQIVILVTDGE-----SNDEVQEVADRLKEDGVVVYVVGVNV 375 Query: 330 IRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDI 365 E + + P + EN + + D I + + Sbjct: 376 QDVQELQKIASEPFEKFLFNTENFNILQDFSGSILQTL 413 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 39/244 (15%), Positives = 81/244 (33%), Gaps = 24/244 (9%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT--DARLDMMIVLDVSRSMES 186 I++ K F S I+ + S+V+ + A D++ + D S+ Sbjct: 384 IASEPFEKFLFNTENFNILQDFSGSILQTLCSAVEGKIKEFIKAYADVVFLADTSQDTSW 443 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQ 244 +S ++ + + MLE + Q GL + ++ FLL + Sbjct: 444 ---ASFQRMQNFLSRVVGMLEV------GRDKYQIGLAQYGDQGHTEFLLNTYKNQKEMI 494 Query: 245 RKI-KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 I + G S + L+Y F + ++ + V + S K Sbjct: 495 AHIHERFVPLGGSRRTGKALQYLLQTFFQEEAGSRYL----QGIPQYAVVI-----NSGK 545 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGK 363 + +++G V ++G++ E L SP+ Y ++ + + Sbjct: 546 SKDEVQDAAQRLREKGVKVMSVGVQDFDRRE-LEGMGSPDLVYDMQREDEVRHIVEDVNV 604 Query: 364 DIVT 367 I Sbjct: 605 VIQG 608 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 63/170 (37%), Gaps = 17/170 (10%) Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVS 241 +E F + + ++ + + PD V+ GLV +S + F L+ + Sbjct: 638 IEEFSRVRQPNFQQVVNFLKTIVSSLSIHPD---TVRFGLVFYSEEPRLEFSLDTFQNPA 694 Query: 242 HLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 + + L+ T + L + N++F + + + ++I V + +G Sbjct: 695 KILEHLDKLTYRERRGRTKTGAALDFLRNEVFIQEKGSRSNHG----VQQIAVVIMEG-- 748 Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGI-RVIRSHEFLRACASPNSFYLV 348 + ++ G +YA+G V S E + + P+ Y V Sbjct: 749 ---FSQDSVSRPASHLRRAGITIYAVGTQNVSESKELEKIASYPHWKYSV 795 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 25/150 (16%), Positives = 49/150 (32%), Gaps = 12/150 (8%) Query: 200 KSINAMLEE-VKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVS 256 +S+ L V + ++ GL +S+ FLL + + R I+ Sbjct: 49 RSVRNFLYILVNSFNVSSKTIRVGLAQYSDVPHSEFLLSTYHRKADVLRHIRQFKFKPGG 108 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 LK+ I D + +I + ++ G + + Sbjct: 109 KKMGLALKF----ILDHHFQEASGSRASQGVPQIAMVISSG-----PVEDHVHGPAEALR 159 Query: 317 KRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 + G ++YA G+R E +SP + Sbjct: 160 RAGILLYATGVRDAVWAELREIASSPQENF 189 >gi|2654431|gb|AAC01506.1| type XII collagen [Homo sapiens] Length = 517 Score = 79.8 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 54/267 (20%), Positives = 101/267 (37%), Gaps = 38/267 (14%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 ++T+L + + + Y IS + + I T + + + V + + Sbjct: 86 PQTTTLSVRDLSADTEYQISVSAMKGMTSSEPISIMEKTQPMKVQVECSRGVDIKA---- 141 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 DM +V D S S+ ++ K+ ++ + VK N VQ LV +S Sbjct: 142 --DMFLV-DGSYSIG---IANFVKVRAFLEVL------VKSFEISPNRVQISLVQYSRDP 189 Query: 231 EEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L+ + I G STN+ + Y +IF +N Sbjct: 190 HTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF------VPSKGSRSNVP 243 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 K+++ +TDG++ D + + ++A+G++ L A ASP Sbjct: 244 KVMILITDGKSSDAFRD-----PAIKLRNSDVEIFAVGVKDA-VDSELEAIASPPAETHV 297 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 + VE+ +DAF I ++ T+ I Sbjct: 298 FTVED----FDAFQRISFEL-TQSICL 319 >gi|190892054|ref|YP_001978596.1| hypothetical protein RHECIAT_CH0002466 [Rhizobium etli CIAT 652] gi|190697333|gb|ACE91418.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 427 Score = 79.4 bits (194), Expect = 7e-13, Method: Composition-based stats. Identities = 68/427 (15%), Positives = 134/427 (31%), Gaps = 97/427 (22%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F + G I+TA+ + + GM ++ +H ++T L++ D + V I + Sbjct: 8 FISDRSGNFGIMTALLMVPLVGTAGMAVDFAHALSLRTQLYAAADAAAVG---SIAEKSG 64 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 G+ + T + ++ ++ +D+ +S Sbjct: 65 AVAAAMAMNGNGTISLGKTDARNIFMSQVSG----ELAEVHVDLGIDVTKTANKLNSQVS 120 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME-SFFD 189 + +P F + I+ + QT A +D I+LD + SM Sbjct: 121 FTAT--VPTTFMQIFGRD------SITISGTATAEYQTAAFMDFYILLDNTPSMGVGATP 172 Query: 190 SSITK--------------------------------IDMAIKSINAMLEEVKLIPDVNN 217 S ++K ID+ ++ A+ + K ++ Sbjct: 173 SDVSKLEAKVGCAFACHQMDKSTNNYTIAKSLGVAMRIDVVRQATQALTDTAKTERVSSD 232 Query: 218 VVQSGLVTFSNKIEEF-----FLLEWGVSHLQRKIKYLS---------KFGVSTNSTPGL 263 + G+ TF K E+ L ++ ++ + TN + Sbjct: 233 QFRMGVYTFGTKAEDAKLTTISGLTSDLTKVKSYTDAVDLMTIPYQNYNSDQITNFDSAM 292 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY------------ 311 N I D G T + + +KI+ F++DG S K + Sbjct: 293 -TQMNTIIDPAGDG----TSNTSAEKILFFVSDGVGDSYKPSTCTKKTTGGRCQEPIDTS 347 Query: 312 -CNEAKKRGAIV---YAIGIRVIRSH--------------EFLRACASPNSFYLVENPHS 353 C K RG + Y + + + ++ACASP ++ V Sbjct: 348 FCKPLKDRGVKIAVLYTTYLPLPSNSWYNTWIKPFQSEIPTKMQACASPGFYFEVSPTDG 407 Query: 354 MYDAFSH 360 + DA Sbjct: 408 ITDAMKA 414 >gi|224370036|ref|YP_002604200.1| hypothetical protein HRM2_29490 [Desulfobacterium autotrophicum HRM2] gi|223692753|gb|ACN16036.1| conserved hypothetical protein [Desulfobacterium autotrophicum HRM2] Length = 598 Score = 79.4 bits (194), Expect = 7e-13, Method: Composition-based stats. Identities = 35/171 (20%), Positives = 66/171 (38%), Gaps = 26/171 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + +D+MI LD SRSM + D T+++ A + I ++ ++ ++GLV Sbjct: 82 QTVEQKGVDIMICLDCSRSMLAQ-DIKPTRLERAKREIIDLMGMIQSD-------RAGLV 133 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 F+ + L S + L T+ ++ A N Sbjct: 134 AFAGRAILQCPLTLDHSAFNLFLNALEPDYLPVGGTDLGGAIETALN-----------GF 182 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 ++ +K I+ +TDGEN S+ +A +G ++ IG+ Sbjct: 183 EKEVESEKAIILITDGENT----TGDSIEMAKKAADQGVKIFCIGVGSPEG 229 >gi|86134839|ref|ZP_01053421.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] gi|85821702|gb|EAQ42849.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] Length = 336 Score = 79.4 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 44/233 (18%), Positives = 85/233 (36%), Gaps = 49/233 (21%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P SV ++T+ +D+++ +DVS SM + D +++ K V P+ Sbjct: 79 PRNVSVSKRTKTNRGIDIVMAIDVSASMLA-RDLKPNRLEALKKVAVDF---VDRRPND- 133 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQ 274 + G+V ++ + + + ++R I L + T GL N++ D + Sbjct: 134 ---RIGIVVYAGESFTQTPITSDKTIVKRTINRLQWGQLEGGTAIGMGLGSRVNRLKDSK 190 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR------ 328 K+I+ +TDG N + D + AK+ G VY IGI Sbjct: 191 AKS-----------KVIILLTDGVNNAGNIDPTTAT--ELAKELGIKVYTIGIGTNGMAD 237 Query: 329 ------------------VIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHI 361 V + L+ A + ++ + S+ + + I Sbjct: 238 FPWSKDPRTGMLNFRKQQVQIDEDLLKNIAEETQGKYFRATDNTSLKEIYDEI 290 >gi|54298847|ref|YP_125216.1| hypothetical protein lpp2914 [Legionella pneumophila str. Paris] gi|53752632|emb|CAH14067.1| hypothetical protein lpp2914 [Legionella pneumophila str. Paris] Length = 344 Score = 79.4 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 79/227 (34%), Gaps = 46/227 (20%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + ++M+VLD+S SME ++++ + ++ +E+ ++ Sbjct: 81 EPKPVAREGYNIMMVLDLSGSMEITDMLLHGRPVSRLLVVKRAAEQFVED-RVGD----- 134 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGM 276 + GL+ F + L + + +I + G +T+ + A ++ D+ Sbjct: 135 -RIGLILFGTRAYLQTPLTYDRHSVLMRIDDATAGLAGKTTSIGDAVGLAVKRLQDVPSK 193 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-------- 328 ++I+ +TDG N L AK+ G +Y IG+ Sbjct: 194 G-----------RVIILLTDGANN--SGVLAPLKAAELAKQDGIKIYTIGLGSEADPRAL 240 Query: 329 ----------VIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 + L A + ++ +P S+ + I + Sbjct: 241 TGDFFAPTLSAELDEKTLEEMAKMTGGRYFRATDPESLQSIYQTINQ 287 >gi|260425757|ref|ZP_05779737.1| conserved hypothetical protein [Citreicella sp. SE45] gi|260423697|gb|EEX16947.1| conserved hypothetical protein [Citreicella sp. SE45] Length = 479 Score = 79.4 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 78/495 (15%), Positives = 146/495 (29%), Gaps = 164/495 (33%) Query: 3 SLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAA 62 +L R F + G MT + + ++ + G+ I++ + +T + + +DR+++ AA Sbjct: 22 ALATRARRFAGDESGSMTYMAVVLSMMMMIFGGLGIDMIYAELQRTKVQNTLDRAVLAAA 81 Query: 63 TQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVV- 121 NEL G V D D + I V Sbjct: 82 ------------------------------DLDNELEAQGVVEDYMDKMALADALISVDV 111 Query: 122 PQNEGYSISAISRYK-IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDV 180 + Y YK +P F + ++ + +++ +VLD+ Sbjct: 112 DEGLNYRTVVAEGYKTMPSNFMQIL----GVDNLQAYGLAEATERINK---VEVSLVLDI 164 Query: 181 SRSMESF------FDSSITKIDMAIKSINAMLEEVKLIPDV------------------- 215 S SM+ D++ T ID + N L + L+P Sbjct: 165 SGSMDDNDKLANMQDAAGTFIDTLLAEGNEDLVSISLVPYSEQVNAGPEILSYLSANWKH 224 Query: 216 ---------NNVVQSGLVTFSNKIEEFFLLEWG--------------------------- 239 N+V S + FS E+ +W Sbjct: 225 GYSHCIEMPNSVFGSAALDFSRTYEQMQHYQWNYDGYNNTLSDTVCPRYGYERIQAWSHD 284 Query: 240 VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI----------------FDMQGMRQHCNTE 283 S L+ ++ L T+ G+K+ + D + + Sbjct: 285 ASALKAQVNQLQPR-AGTSIFMGMKWGTALLDPSTRPIASGMIARGSVDQVFEGRPVAYD 343 Query: 284 DANYKKIIVFMTDGEN-------------------------------------------- 299 D + K +V MTDG++ Sbjct: 344 DTDVLKTVVLMTDGQHDRSYRIQDWAYNSESEYAHWNRYNLWYYLSRYVSSYERSSFYYQ 403 Query: 300 --LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-SHEFLRACASPNSFYLVENPHSMYD 356 + D C AK +G I++++G V + + +CAS + + + + Sbjct: 404 KYNADLGDALLGSICAAAKAQGIIIWSVGFEVGDHGADVMESCASSPAHFFRVEGVEITE 463 Query: 357 AFSHIGKDIVTKRIW 371 AFS I + R+ Sbjct: 464 AFSTIAHTLNQLRLT 478 >gi|297683708|ref|XP_002819511.1| PREDICTED: collagen alpha-1(XXII) chain-like [Pongo abelii] Length = 259 Score = 79.4 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 36/212 (16%), Positives = 80/212 (37%), Gaps = 25/212 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 D++ +LD S S+ + + + +++ ++ PD + G Sbjct: 3 DRAGCKSVHYDLVFLLDTSSSVGK------EDFEKVRQWVANLVDTFEVGPD---RTRVG 53 Query: 223 LVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V +S++ F L S ++ + L+ G +TN+ L+Y + F R Sbjct: 54 VVRYSDRPTTAFELGLFGSQEEVKAAARRLAYHGGNTNTGDALRYITARSFS---PRTGG 110 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 D YK++ + +TDG + D + + + G ++A+G+ E + Sbjct: 111 RPGDRAYKQVAILLTDGRSQDLVLDAAAAAH-----RAGIRIFAVGVGEALKEELEEIAS 165 Query: 341 SPN--SFYLVENPHSMYDAFSHIGKDIVTKRI 370 P + V + ++A I + + Sbjct: 166 EPKSAHVFHVSD----FNAIDKIRGKLRRRLC 193 >gi|300023811|ref|YP_003756422.1| von Willebrand factor A [Hyphomicrobium denitrificans ATCC 51888] gi|299525632|gb|ADJ24101.1| von Willebrand factor type A [Hyphomicrobium denitrificans ATCC 51888] Length = 466 Score = 79.4 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 68/453 (15%), Positives = 136/453 (30%), Gaps = 113/453 (24%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 I F + +G + IL + ++F ++G+ ++ ++ + D +++ A + Sbjct: 29 IEKFSRDTRGDVAILFGLMALVLFAMIGLAVDYGRFVNARSQTIAATDAAVLAGARAL-- 86 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 + NG ++ L ++ + + +N L ND + + + +V N Sbjct: 87 QTNGGDQ-----AAALRVAQSYYAQATKNRL---SLSNDTINFAIADNATAMVTTGNAVI 138 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 + + S+ + + V + L++ ++LD++ SM Sbjct: 139 TTPFMGLAGTGSLPILRKDGSDYSKAV-------LAVGGNAELNLEIAMMLDITGSMR-- 189 Query: 188 FDSSITKIDMAIKSINAMLEEVK--------LIPDVNNVVQSGLVTF---SNKIEEFFLL 236 +T + A + ++ I V+ F + + Sbjct: 190 -GQKLTDMKAAASDLLNIVVWTDQSKFTSKVAIVPFAYDVRLPAAAFKKATGTTSTNYPC 248 Query: 237 EWGVSHLQRKIKYLSKFGV-----STNSTPGLKYAY------------------------ 267 + ++ G +T+ST K Y Sbjct: 249 VVERTGTEKYTDAAPATGKYVMVHNTSSTKKNKTTYSPTCDVASSAEVLPLTSDKSTLLA 308 Query: 268 --NQIFDMQGMRQHCNT-------------------------EDANYKKIIVFMTDGE-- 298 N + H T N +KI V MTDGE Sbjct: 309 KVNGLSTAGSTAGHIGTAWAWYMLAPNWSSLWTSASSTPAAYNADNLRKIAVLMTDGEYN 368 Query: 299 ----NLSTKEDQQSLYYCNEA-----------------KKRGAIVYAIGIRVIR--SHEF 335 +D SL C A K +G VY +G ++ + + Sbjct: 369 TQYTTNGVPDDSSSLTRCPNAANGVCSSAQAVSQCTAMKAKGIEVYTVGFQLDNQTAIDT 428 Query: 336 LRACAS-PNSFYLVENPHSMYDAFSHIGKDIVT 367 L CA+ + FY ++ AF I I T Sbjct: 429 LSQCATDSSHFYNSTTGDALKAAFRDIALKIST 461 >gi|209549601|ref|YP_002281518.1| hypothetical protein Rleg2_2008 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209535357|gb|ACI55292.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 429 Score = 79.4 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 60/410 (14%), Positives = 128/410 (31%), Gaps = 61/410 (14%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVL----------HSMIDRSLVH 60 F + G I+TA+ + + GM ++ +H ++T L V Sbjct: 8 FISDRSGNFGIMTALLMVPLLGTAGMAVDFAHAMSLRTQLFAAADAAAVGSIAEKSGAVA 67 Query: 61 AATQIMNEGNGNNRKKLKGGDILCRIKNTWNM-SFRNELRDNGFVNDIDDIVRSTSLDIV 119 AA + G + K L ++ + + N ++ V T++ Sbjct: 68 AAMTMTGNGTISLGKTDARSIFLSQVSGELADVNVDLGIDVTKTANKLNSQVSFTAVVPT 127 Query: 120 VVPQ---NEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMM 175 + + +IS + + F F N+ + + T+ + + Sbjct: 128 TFMRVLGKDSITISGTATAEYLTASFMDFYILLDNTPSMGVGATAKDVATMEKNTSDSCA 187 Query: 176 IVLDVSRSMESFFDSSIT-----KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 + + ++++ + T +ID+ ++ + K N + G+ TF K Sbjct: 188 FACHETENKNNYYNLAKTLGVSMRIDVVRQATKELTLTAKSTRVSTNQFRMGVYTFGTKA 247 Query: 231 EEF-----FLLEWGVSHLQRKIKYL-----SKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 E+ + ++ + K G + + A Q+ D+ Sbjct: 248 EDANLTTISDPTDDLDKVRTYTDAVDLMTIPKQGYNNDQQTSFDNALTQMKDII-TTPGD 306 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYY-------------CNEAKKRGAIV----- 322 + +KI+ F++DG S K + C K +G + Sbjct: 307 GSTATTPQKILFFVSDGVGDSEKPKGCTKKLTGNRCQEPIDTSFCKPLKDKGIRIAVLYT 366 Query: 323 ----------YAIGIRVIRSH--EFLRACASPNSFYLVENPHSMYDAFSH 360 Y I +S ++ CASP ++ V + DA Sbjct: 367 TYLPLPKNSWYNTWISPFQSQIPTKMQECASPGLYFEVTPTEGIADAMKA 416 >gi|327262385|ref|XP_003216005.1| PREDICTED: vitrin-like [Anolis carolinensis] Length = 748 Score = 79.4 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 48/265 (18%), Positives = 98/265 (36%), Gaps = 35/265 (13%) Query: 105 NDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKV 164 N+ ++V + +D V N YSI+ S + + + +S H+ T Sbjct: 505 NEKQNVVEANFVDKAVCRTNGYYSINVPSWFSLHKVVQPLVKRICDSDHLACSKTCL--- 561 Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D+ V+D S S+ ++ + + +E ++ + G V Sbjct: 562 -----NSADIGFVIDGSSSVG------TGNFRTVLQFVANISKEFEISDTD---TRIGAV 607 Query: 225 TFS--NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 ++ ++E F + + IK ++ + T++ + YA+ Q+F Sbjct: 608 QYTYEQRLEFGFEKQSTKQDVLNAIKRINYWSGGTSTGAAINYAFEQLF---------IK 658 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 N +KI++ +TDG + A + G I Y++GI E P Sbjct: 659 SKPNKRKIMILITDG-----RSYDDVQGPATAAHQNGVITYSVGIAWAAQDELEAIATDP 713 Query: 343 N--SFYLVENPHSMYDAFSHIGKDI 365 + + V+ S+Y I ++I Sbjct: 714 DKEHSFFVDEFDSLYRFVPRIIQNI 738 >gi|167752252|ref|ZP_02424379.1| hypothetical protein ALIPUT_00495 [Alistipes putredinis DSM 17216] gi|167660493|gb|EDS04623.1| hypothetical protein ALIPUT_00495 [Alistipes putredinis DSM 17216] Length = 328 Score = 79.4 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 73/197 (37%), Gaps = 26/197 (13%) Query: 137 IPLKFCTFIPWYTNSRHIVMPIT---SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSIT 193 L+ F +++ + S + ++ +D+++ +D+S SM + D Sbjct: 50 YYLRHLPFALRCAAVALLIVALARPQSVDEGSTSNTEGIDIVLAIDISTSMLAQ-DLQPD 108 Query: 194 KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF 253 +I A + + + + + GLV F+ + L LQ + L Sbjct: 109 RIQAAKQVAGNFITD-RPGD------RIGLVAFAGEAFTQSPLTTDQGTLQTLLGRLRSG 161 Query: 254 --GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY 311 T GL A N++ + K+I+ +TDGEN + + L Sbjct: 162 VVEDGTAIGNGLATAINRLRESNAKS-----------KVIILLTDGENN--RGEIAPLTA 208 Query: 312 CNEAKKRGAIVYAIGIR 328 A+ +G VY IG+ Sbjct: 209 AEIARDQGIRVYTIGVG 225 >gi|139439379|ref|ZP_01772820.1| Hypothetical protein COLAER_01839 [Collinsella aerofaciens ATCC 25986] gi|133775158|gb|EBA38978.1| Hypothetical protein COLAER_01839 [Collinsella aerofaciens ATCC 25986] Length = 2432 Score = 79.0 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 40/201 (19%), Positives = 72/201 (35%), Gaps = 31/201 (15%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS--SITKIDMAIKSINAMLEEV--- 209 + SS + + LD+++VLD S SM+ + + +ID + N ++ + Sbjct: 95 TLSAISSTSDTTISGKPLDIVMVLDASGSMDDPMGTGDNTKRIDALKTAANTFIDAIAAQ 154 Query: 210 -KLIPDVNNVVQSGLVTFSNKIEEF-----------------FLLEWGVSHLQRKIKYLS 251 + I D + + +V F+ K + ++ S + L Sbjct: 155 NQSITDASKQHRVAIVKFAGKKKTDKVGNDTYRDGRYTYNYSQTMKNLTSCKGKDADSLK 214 Query: 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE-----DQ 306 + N + Y N+ A+ KKI+VF TDG S+ Sbjct: 215 DTVGNINPAGSTQADYGLELAENIT---INSGRADAKKIVVFFTDGSPTSSSGFQASVAD 271 Query: 307 QSLYYCNEAKKRGAIVYAIGI 327 ++ K GA +Y IGI Sbjct: 272 SAIASAKSLKANGADIYTIGI 292 >gi|116623631|ref|YP_825787.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116226793|gb|ABJ85502.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 589 Score = 79.0 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 43/335 (12%), Positives = 99/335 (29%), Gaps = 41/335 (12%) Query: 6 LNIRNFFYNYKGG--MTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAAT 63 + ++N +GG + + T + ++ ++G+ I+ + ++ L S +D + AA Sbjct: 1 MRVKNTSKRRQGGQAIVMFTLLVSSVLIPMVGLAIDGGRGYLVRLKLSSAVDGGALAAAR 60 Query: 64 QIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTS----LDIV 119 + + N + + + + F +D S+ + Sbjct: 61 LLGSGSNAAQQLSMAKATAAQFVNANFPAKFFGASLSGAANVCVDPGTDSSDPCGVGNGS 120 Query: 120 VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLD 179 + + +++ + +P F I MP + + + + +++V+D Sbjct: 121 GISTYKVRTVAVKATATMPTLFMRI---------IGMPTVTVSGSGTASRRDVRVILVMD 171 Query: 180 VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG 239 S SM +++ + + + GLV + + Sbjct: 172 RSSSMGTYYSGINQTPPSINDMALKFVNSFSGAGEFGGRDEVGLVVYGGSGIVAYPPRDI 231 Query: 240 VSHLQRKIKYLSK------------------FGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 K+ G +T + L AY + R Sbjct: 232 TKDYTDYTKFTPPDNNFKASGNIPKYIADITSGSNTGTAEALYLAYMTL------RADAA 285 Query: 282 TEDANYKK--IIVFMTDGENLSTKEDQQSLYYCNE 314 T K +IV TDG N+ Sbjct: 286 TNPDLATKLNVIVLFTDGIPNGVTAMANDKTIANQ 320 >gi|118591415|ref|ZP_01548813.1| hypothetical protein SIAM614_27248 [Stappia aggregata IAM 12614] gi|118436087|gb|EAV42730.1| hypothetical protein SIAM614_27248 [Stappia aggregata IAM 12614] Length = 474 Score = 79.0 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 65/459 (14%), Positives = 132/459 (28%), Gaps = 103/459 (22%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 ++ F + K + + + + +I ++ G+ I+VS + L ID + + A + Sbjct: 13 LKGFTGDRKASILPVFGLMVVLIVVIAGITIDVSRTVNAREKLSFAIDAAALSVAADLST 72 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDN-GFVNDIDDIVRSTSLDIVVVP---Q 123 + + K D N ++ F +E N FV D ++ S + Sbjct: 73 SVMSDEQIKAALADSFK--ANLADVEFLDEAIKNLSFVVDAENGTIKVSSFATLDNYFID 130 Query: 124 NEGYSISAIS--RYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV---- 177 GY + A+ + + +V+ +T S++ + T +V Sbjct: 131 MGGYGMQALGPETFNFGTSSQVTYSRFDVELALVVDVTGSMRNDMDTLRDASKGLVNILI 190 Query: 178 ----------LDVSRSMESFFDSSITKIDMAIKSINAMLEE------------------- 208 + +S S + T + + Sbjct: 191 PETTEEADSKVRISLVPYSQGVNLGTYAAKVKGGVYGYADSSVCVTERQDYDDGEDIYKV 250 Query: 209 ----------VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTN 258 VK P +V G + + L L I L G + Sbjct: 251 RYTDMPYNYYVKTDPPPKDVFYGGGSNRCSGTSKMIPLTADRDTLLDAIADLDDNGGTAG 310 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYK-----KIIVFMTDGENL------------- 300 T G+ + +N I ++ Y K + MTDG+N Sbjct: 311 QT-GVVWGWNSISPNYSDVWPLASKPEPYDNDDVLKFAIIMTDGDNNRFYEFVKEREECD 369 Query: 301 ------------------------------STKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + + C K G ++ + Sbjct: 370 WVYSRRYGWQWTCEMVSVNQWQERSESESYNNNSSKAQRALCQAMKDEGISIFGVYFGTN 429 Query: 331 R---SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIV 366 + +++CAS ++Y + + +AF++I K I Sbjct: 430 DSSAGSKNMQSCASTGNYYKATSSDELINAFANIAKKIQ 468 >gi|149066378|gb|EDM16251.1| procollagen, type XIV, alpha 1 (predicted), isoform CRA_b [Rattus norvegicus] Length = 1127 Score = 79.0 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 48/319 (15%), Positives = 112/319 (35%), Gaps = 60/319 (18%) Query: 56 RSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTS 115 + + +I +E + + + D++ + + + + + + Sbjct: 297 NADLSELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRVEEQD------------- 343 Query: 116 LDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMM 175 ++ Y ++ Y T + + A+ D++ Sbjct: 344 ----KEIKDTEYKVTVTPIY-------------TVGEGVSVSAPGK---TLCKAAKADLV 383 Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 ++D S S+ D + KI I + + + + I + Q +V F++ F Sbjct: 384 FMVDGSWSIG---DDNFNKI---INFLYSTVGALDKI--GADGTQVAMVQFTDDPRTEFK 435 Query: 236 LEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L+ + L I+++S G +T + +K+ + +F + K+IV Sbjct: 436 LDAYKTKETLLDAIRHISYKGGNTKTGKAIKHVRDTLFTADSGTRRG------IPKVIVV 489 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENP 351 +TDG + E + G ++AIG+ E +R + S + V++ Sbjct: 490 ITDG-----RSQDDVNKISREMQADGYNIFAIGVADADYSELVRIGSKPSSRHVFFVDD- 543 Query: 352 HSMYDAFSHIGKDIVTKRI 370 +DAF I +++T Sbjct: 544 ---FDAFKKIEDELITFVC 559 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 47/263 (17%), Positives = 93/263 (35%), Gaps = 28/263 (10%) Query: 117 DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMI 176 D P + I + + K P K N P +K +T A D++I Sbjct: 105 DKESKPAQGQFRIKDLEKRKDPTKPRVKAVDKGNGSKSTSP--EEIKFFCETPAIADIVI 162 Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV-NNVVQSGLVTFS-NKIEEFF 234 ++D S S+ F + + LE + +V + + GL +S + E+ Sbjct: 163 LVDGSWSIGRFN----------FRLVRNFLENLVTAFNVGSEKTRIGLAQYSGDPRIEWH 212 Query: 235 LLEWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L + + ++ L G +T + L + + F + + KI + Sbjct: 213 LNAFNTKDEVIDAVRSLPYKGGNTLTGLALNFIFENSFKPE------AGSRSGVSKIGIL 266 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SFYLVENP 351 +TDG K + ++ G ++AIG++ E + P+ Y V Sbjct: 267 ITDG-----KSQDDIIPPSRNLREAGVELFAIGVKNADLSELQEIASEPDSTHVYNVAEF 321 Query: 352 HSMYDAFSHIGKDIVTKRIWYDK 374 M+ + + + ++ DK Sbjct: 322 DLMHTVVESLTRTVCSRVEEQDK 344 >gi|291388581|ref|XP_002710673.1| PREDICTED: collagen, type XXII, alpha 1 [Oryctolagus cuniculus] Length = 1571 Score = 79.0 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 32/216 (14%), Positives = 83/216 (38%), Gaps = 26/216 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + D++ +LD S S+ + + + +++ ++ P + Sbjct: 27 AQRAGCKSVHYDLVFLLDTSSSVGK------EDFEKVRQWVANLVDTFEVGP---EHTRV 77 Query: 222 GLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G+V +S++ F L S ++ + ++ G +TN+ L++ + F Q Sbjct: 78 GVVRYSDRPSTAFELGHFSSREEVKAAARRIAYHGGNTNTGDALRFITARSFSPQ---AG 134 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 D ++++ + +TDG + D + + G ++A+G+ E Sbjct: 135 GRPGDRAFQQVAILLTDGRSQDLVRDAAAAAH-----AAGIRIFAVGVGEALREELHEIA 189 Query: 340 A--SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + + + V + +DA I + + +R+ + Sbjct: 190 SEPTSAHVFHVSD----FDAIDKI-RGKLRRRLCEN 220 >gi|327190622|gb|EGE57710.1| hypothetical protein RHECNPAF_409007 [Rhizobium etli CNPAF512] Length = 427 Score = 79.0 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 68/427 (15%), Positives = 134/427 (31%), Gaps = 97/427 (22%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F + G I+TA+ + + GM ++ +H ++T L++ D + V I + Sbjct: 8 FISDRSGNFGIMTALLMVPLVGTAGMAVDFAHALSLRTQLYAAADAAAVG---SIAEKSG 64 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 G+ + T ++ ++ +D+ +S Sbjct: 65 AVAAAMAMNGNGTISLGKTDARDIFMSQVSG----ELAEVHVDLGIDVTKTANKLNSQVS 120 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME-SFFD 189 + +P F + I+ + QT A +D I+LD + SM Sbjct: 121 FTAT--VPTTFMRIFGRD------SITISGTATAEYQTAAFMDFYILLDNTPSMGVGATP 172 Query: 190 SSITK--------------------------------IDMAIKSINAMLEEVKLIPDVNN 217 S ++K ID+ ++ A+ + K ++ Sbjct: 173 SDVSKLEAKVGCAFACHQMDKSTNNYTIAKSLGVAMRIDVVRQATQALTDTAKTERVSSD 232 Query: 218 VVQSGLVTFSNKIEEF-----FLLEWGVSHLQRKIKYLS---------KFGVSTNSTPGL 263 + G+ TF K E+ L ++ ++ + TN + Sbjct: 233 QFRMGVYTFGTKAEDAKLTTISGLTSDLTKVKNYTDAVDLMTIPYQNYNSDQITNFDSAM 292 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY------------ 311 N I D+ G T + + +KI+ F++DG S K + Sbjct: 293 -TQMNTIIDLAGDG----TSNTSAEKILFFVSDGVGDSYKPSTCTKKTTGGRCQEPIDTS 347 Query: 312 -CNEAKKRGAIV---YAIGIRVIRSH--------------EFLRACASPNSFYLVENPHS 353 C K RG + Y + + + ++ACASP ++ V Sbjct: 348 FCKPLKDRGVKIAVLYTTYLPLPSNSWYNTWIKPFQSEIPTKMQACASPGFYFEVSPTDG 407 Query: 354 MYDAFSH 360 + DA Sbjct: 408 ITDAMKA 414 >gi|326677363|ref|XP_691404.5| PREDICTED: collagen alpha-1(XII) chain-like [Danio rerio] Length = 3085 Score = 79.0 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 50/267 (18%), Positives = 102/267 (38%), Gaps = 37/267 (13%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 TS+++ + + Y IS + + T + + + V V + Sbjct: 363 PTQTSVNVRDLSPDTEYEISLFALKGLTPSEAVMAMEKTQPLKVSLECSLGVDVQA---- 418 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D+++++D S S+ IT + ++ + P+ VQ LV +S Sbjct: 419 --DIVLLVDGSYSIG------ITNFAKVRAFLEVLVNTFDIGPNK---VQISLVQYSRDP 467 Query: 231 EEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L ++ + + ++ G STN+ + Y +IF N Sbjct: 468 YTEFYLNTHHDLNAVVKAVRTFPYRGGSTNTGKAMTYVRERIFIAT------RGARENVP 521 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 ++ + +TDG++ +D + + ++A+G++ E L A A+P Sbjct: 522 RVTILITDGKSSDAFKD-----PAAKLRNTDVEIFAVGVKDAVRSE-LEAIANPPAETHV 575 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 Y VE+ +DAF I ++ T+ I Sbjct: 576 YTVED----FDAFQRISNEL-TQSICL 597 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 49/321 (15%), Positives = 94/321 (29%), Gaps = 38/321 (11%) Query: 56 RSLVHAATQIMNEGNGNNRKKLKGGDILC--RIKNTWNMSFRNELRDNGFVNDIDDIVRS 113 + + + IL ++ TW D V D Sbjct: 8 AVALFTLAALTSVRAQVMPPSDLRFKILNENTVQMTWKQPLSRV--DGFRVQVTSDTEEP 65 Query: 114 TSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLD 173 + + + +++ +PI+ + S A D Sbjct: 66 VKEFTLSPTSTKTSIRDLTPDVDYVVTITSYLGSEE-----SIPISGQITKCSI-GAIAD 119 Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S S+ I A+ L D + G+V +S Sbjct: 120 LVFLVDGSWSVG------RENFKFIRNFIAALAGAFDLGEDK---TRVGVVQYSTDTRTE 170 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 F L + R I L G +T + L Y +F + ++ Sbjct: 171 FNLNQHFRRVDLLRAINNLPYKGGNTMTGEALDYLLKNMFT------EAAGARKGFPRVA 224 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLVE 349 V +TDG++ E Y + K G ++ +GI+ E + ++P Y V Sbjct: 225 VVITDGKSQDPVEG-----YAKKLKNAGVELFTLGIKEADEEELKQMSSTPYRTHVYTVP 279 Query: 350 NPHSMYDAFSHIGKDIVTKRI 370 N +D + K + + Sbjct: 280 N----FDMIKAVEKSFIAQVC 296 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 44/247 (17%), Positives = 82/247 (33%), Gaps = 25/247 (10%) Query: 124 NEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 YS++ + T S P +T A+ D+++++D S Sbjct: 1131 GTKYSVAVFGMFDGGQSMPLAGEEKTTLSDAPDSPPVKYSGNECKTSAKADIVLLVDGSW 1190 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH 242 S+ I M+ + PD VQ GL +S + + L + Sbjct: 1191 SIGRL------NFKTIRAFIGRMVGVFDIGPDK---VQIGLAQYSGDPKTEWHLNAHPTR 1241 Query: 243 LQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 + L G +T + L Y F + + +KI V +TDG Sbjct: 1242 ASLLDAVANLPYKGGNTMTGMALNYILQNNF------RPNVGMRPDSRKIGVLVTDG--- 1292 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SFYLVENPHSMYDAF 358 K + + + G +YAIG++ +E P+ Y V + + D Sbjct: 1293 --KSQDEIVVNSQRLRDSGIELYAIGVKNADENELRSIATDPDEIHMYNVNDFSFLLDIV 1350 Query: 359 SHIGKDI 365 + +++ Sbjct: 1351 DDLTENL 1357 >gi|148697326|gb|EDL29273.1| procollagen, type XIV, alpha 1, isoform CRA_b [Mus musculus] Length = 1093 Score = 79.0 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 47/319 (14%), Positives = 111/319 (34%), Gaps = 60/319 (18%) Query: 56 RSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTS 115 + + +I +E + + + D++ + + + + + + Sbjct: 297 NADLSELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRVEEQD------------- 343 Query: 116 LDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMM 175 ++ Y ++ Y T + + A+ D++ Sbjct: 344 ----KEIKDTEYKVTVTPVY-------------TVGEGVSVSAPGK---TLCKAAKADLV 383 Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 ++D S S+ D + KI I + + + + I + Q +V F++ F Sbjct: 384 FMVDGSWSIG---DDNFNKI---INFLYSTVGALDKI--GADGTQVAMVQFTDDPRTEFK 435 Query: 236 L-EWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L + L I+++S G +T + +K+ + +F + K+IV Sbjct: 436 LDSYKTKETLLDAIRHISYKGGNTKTGKAIKHVRDTLFTSDSGTRRG------IPKVIVV 489 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENP 351 +TDG + E + G ++AIG+ E ++ + S + V++ Sbjct: 490 ITDG-----RSQDDVNKISREMQADGFNIFAIGVADADYSELVQIGSKPSSRHVFFVDD- 543 Query: 352 HSMYDAFSHIGKDIVTKRI 370 +DAF I +++T Sbjct: 544 ---FDAFKKIEDELITFVC 559 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 48/263 (18%), Positives = 94/263 (35%), Gaps = 28/263 (10%) Query: 117 DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMI 176 D P + I + + K P K + N P VK +T A D++I Sbjct: 105 DKESKPAQGQFRIKDLEKRKDPTKPKVKVVDKGNGSKPTSP--EEVKFFCETPAIADIVI 162 Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV-NNVVQSGLVTFS-NKIEEFF 234 ++D S S+ F + + LE + +V + + GL +S + E+ Sbjct: 163 LVDGSWSIGRFN----------FRLVRNFLENLVTAFNVGSEKTRIGLAQYSGDPRIEWH 212 Query: 235 LLEWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L + + ++ L G +T + L + + F + + KI + Sbjct: 213 LNAFNTKDEVIDAVRSLPYKGGNTLTGLALNFIFENSFKPE------AGSRSGVSKIGIL 266 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SFYLVENP 351 +TDG K + ++ G ++AIG++ E + P+ Y V Sbjct: 267 ITDG-----KSQDDIIPPSRNLRESGVELFAIGVKNADLSELQEIASEPDSTHVYNVAEF 321 Query: 352 HSMYDAFSHIGKDIVTKRIWYDK 374 M+ + + + ++ DK Sbjct: 322 DLMHTVVESLTRTVCSRVEEQDK 344 >gi|329848522|ref|ZP_08263550.1| flp pilus assembly protein TadG [Asticcacaulis biprosthecum C19] gi|328843585|gb|EGF93154.1| flp pilus assembly protein TadG [Asticcacaulis biprosthecum C19] Length = 486 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 37/196 (18%), Positives = 64/196 (32%), Gaps = 26/196 (13%) Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNK-IEEFFLLEWGVSHLQRKIKYLSKFGVS---- 256 + + K++ GLV S + L + + IK L Sbjct: 290 VKNQVASSKVVMPDPTTPYGGLVQTSQTCLNPILPLSNDATVVTNTIKGLVVNIGGYKPE 349 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHC--NTEDANYKKIIVFMTDGENL-------------- 300 T G+ + N + + + + +K IV MTDG N Sbjct: 350 TYIPGGMIWGVNALTPPAPFTEGKPYDANNKEPRKTIVLMTDGANTLYANTSGGIAVANA 409 Query: 301 --STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--LRACASP-NSFYLVENPHSMY 355 + C+ AK + +Y IG V S L+ACA+ ++ ++ + Sbjct: 410 TQVAVTYSDQIRVCDYAKSKKIEIYTIGFDVTDSKALSTLKACATDAQHYFDAKSSADLI 469 Query: 356 DAFSHIGKDIVTKRIW 371 AF IG + R+ Sbjct: 470 KAFETIGGKLSKVRLT 485 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 38/221 (17%), Positives = 79/221 (35%), Gaps = 32/221 (14%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F + G T++ + + I LG I+ + + K +D +++ AA I+N N Sbjct: 15 FLRSRGGNTTMIFGLAIFAIMAALGTAIDFAVLQRAKRSTQDALDSAVLAAA--IVNNSN 72 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 + KKL F+ L + + + Sbjct: 73 EGDLKKLAADV------------FKENLGAADLDAKVTAFK--------YDAKARTVKAT 112 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 A Y I +++ +TS D L++ +VLD + SM + + Sbjct: 113 AQGSYD------PVIMQLFGFKNLPYAVTSDAIKA--ADGTLEVALVLDNTWSMSATVNG 164 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 + KID+ + ++ + L D + V+ +V +++ + Sbjct: 165 TP-KIDILKTAAQGLVSTI-LTKDNKDYVKIAVVPYADYVN 203 >gi|332291974|ref|YP_004430583.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] gi|332170060|gb|AEE19315.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] Length = 334 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 41/238 (17%), Positives = 81/238 (34%), Gaps = 49/238 (20%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P V ++T +D++I +DVS SM + D +++ K + + Sbjct: 74 LARPRNVEVSTKTKTTKGIDIVIAIDVSASMLAK-DLRPNRLEALKKVAASFI------- 125 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQI 270 + + GL+ ++ + + S + +K + T GL N++ Sbjct: 126 NGRPNDRIGLIEYAGESFTKTPITSDKSIVLSALKSIQYNNIIEGGTAIGMGLATGVNRL 185 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-- 328 + K+I+ MTDGEN + + D + A++ G VY IG+ Sbjct: 186 -----------KDSKALSKVIILMTDGENNAGQIDPRIA--AELAQEFGIKVYTIGMGTN 232 Query: 329 ---------VIRS------------HEFLR--ACASPNSFYLVENPHSMYDAFSHIGK 363 E L A + ++ N + + + I K Sbjct: 233 GMALSPYARNANGTFVYENIQVTIDEELLEEIAATTGGQYFRATNNEKLQEIYDEIDK 290 >gi|260834334|ref|XP_002612166.1| hypothetical protein BRAFLDRAFT_88905 [Branchiostoma floridae] gi|229297540|gb|EEN68175.1| hypothetical protein BRAFLDRAFT_88905 [Branchiostoma floridae] Length = 815 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 77/227 (33%), Gaps = 28/227 (12%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 T + + + ++ A +D+ +LD S S+ + Sbjct: 280 ITNSVRGAGAHGLSVQNPRGSGSSTCE-APVDLFFLLDGSGSV------KAANFAKVKQF 332 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNS 259 M+ + P + G++ +SN+ F L V+ I +S G T + Sbjct: 333 AVDMVNSFDVSPAA---TRVGVLQYSNRNTLVFNLGNKVNKPTTVSAINSISYQGGGTRT 389 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 L+Y +R + N K+++ +TDG K + Sbjct: 390 GAALQY----------IRGNAAWRRGNVPKVLIVLTDG-----KSEDSVSGPSQNLVSDR 434 Query: 320 AIVYAIGIRVIRSHEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDI 365 VYAIG+ E L+ + ++ + + +++ I +D+ Sbjct: 435 VEVYAIGVSNFDHEELLQIVNNKQSNVIELNDFNALATKIDEIAQDV 481 >gi|123718334|emb|CAJ77150.1| procollagen type VI alpha 4 [Mus musculus] Length = 762 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 56/318 (17%), Positives = 115/318 (36%), Gaps = 26/318 (8%) Query: 46 MKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN 105 M L +++ A ++G + G I+ R + + Sbjct: 111 MGQALQFILEHHFREGAGSRASQGVPQVAVVVSSGLTEDHIREPAEALRRAGIL--VYAI 168 Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVN 165 + D ++ +I P++ ++ + +P P ++ T Sbjct: 169 GVKDASQAELREISSSPKD-NFTFFVPNFPGLPGLAQKLRPELCSTLGKAAQYTERESPA 227 Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + D++ ++D S S+ + K+ + S+ V + ++ VQ GLV Sbjct: 228 CSEASPADIVFLVDSSTSIG---LQNFQKVKHFLHSV------VSGLDVRSDQVQVGLVQ 278 Query: 226 FSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYA-YNQIFDMQGMRQHCNT 282 +S+ I F L+ S + +I+ L T++ L++ N + +M G R Sbjct: 279 YSDNIYPAFPLKQSSLKSAVLDRIRNLPYSMGGTSTGSALEFIRANSLTEMSGSRA---- 334 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 +I+V +TDGE+ +D ++ K+ G V+ +GI + E + P Sbjct: 335 -KDGVPQIVVLVTDGESSDEVQD-----VADQLKRDGVFVFVVGINIQDVQELQKIANEP 388 Query: 343 -NSFYLVENPHSMYDAFS 359 F S+ A S Sbjct: 389 FEEFLFTTENFSILQALS 406 Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 46/355 (12%), Positives = 121/355 (34%), Gaps = 42/355 (11%) Query: 3 SLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAA 62 S ++ ++NF ++++ + + + +G++ +I+ + S + +++ Sbjct: 243 STSIGLQNFQKVKHFLHSVVSGLDVRSDQVQVGLVQYSDNIYPAFPLKQSSLKSAVLDRI 302 Query: 63 TQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELR-------DNGFVNDIDDIVRS-- 113 + G + + ++ + D +++ D+ Sbjct: 303 RNLPYSMGGTSTGSALEFIRANSLTEMSGSRAKDGVPQIVVLVTDGESSDEVQDVADQLK 362 Query: 114 --------TSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVN 165 ++I V + + I+ + F S ++ + S+V+ Sbjct: 363 RDGVFVFVVGINIQDVQELQK--IANEPFEEFLFTTENFSILQALSGTLLQALCSTVERQ 420 Query: 166 SQ--TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + T D++ ++D S +S I+ ++ +++ D Q GL Sbjct: 421 MKKSTKTYADVVFLIDTS------QGTSQASFQWMQNFISRIIGILEVGQDK---YQIGL 471 Query: 224 VTFSNKIEEFFLLEWGV--SHLQRKIKYLSKF-GVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +S++ FL + + I L F G S + GL++ + F + Sbjct: 472 AQYSDQGHTEFLFNTHKTRNEMVAHIHELLVFQGGSRKTGQGLRFLHRTFFQEAAGSRLL 531 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + +V +T G K + + +KRG + ++G++ E Sbjct: 532 ----QGVPQYVVVITSG-----KSEDEVGEVAQILRKRGVDIVSVGLQDFDRAEL 577 Score = 44.4 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 23/149 (15%), Positives = 50/149 (33%), Gaps = 12/149 (8%) Query: 201 SINAMLEEV-KLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVST 257 S+ L + + + ++ GL +S+ FLL + + I+ L Sbjct: 50 SVRNFLYILANSLQVGRDNIRVGLAQYSDTPTSEFLLSVYHRKGDVLKHIRGLQFKPGGN 109 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 L++ I + + ++ V ++ G + ++ Sbjct: 110 RMGQALQF----ILEHHFREGAGSRASQGVPQVAVVVSSG-----LTEDHIREPAEALRR 160 Query: 318 RGAIVYAIGIRVIRSHEFLRACASPNSFY 346 G +VYAIG++ E +SP + Sbjct: 161 AGILVYAIGVKDASQAELREISSSPKDNF 189 >gi|120616160|gb|ABG80452.1| collagen [Hydra vulgaris] Length = 2439 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 42/213 (19%), Positives = 87/213 (40%), Gaps = 22/213 (10%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 ++ D D+ LD S S F++ + IKS+ + I ++ V+ G Sbjct: 1830 EIGEINDIPTDIAFALDASAS---FYEEGFQQEKDFIKSV------IDKIELSSSGVRVG 1880 Query: 223 LVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 ++T+S++ + ++ +++ I + + T GL+ A + G Sbjct: 1881 VLTYSDEAKIRIRFDYSFDKEDVKKAIDNIPYDSMGTRIDLGLEAAKELFLEKSG----- 1935 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + KK+++ +TDG+ + + + Y E + G ++AIGI + L Sbjct: 1936 --GRGSSKKVLILLTDGQQTYIPDAKDPVDYAKELAEYGVDIFAIGIGDEINKVDLEDLI 1993 Query: 341 S-PNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 S P +L ++ +S+ + KDI T Sbjct: 1994 SKPQHIFLSDDINSLIT---DLSKDISTALSCE 2023 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 41/208 (19%), Positives = 80/208 (38%), Gaps = 25/208 (12%) Query: 169 DARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF- 226 D+ +LD SRS+ F+ + + +L E P + Q+G++ + Sbjct: 1250 GDPTDIAFILDSSRSVTRDHFNRQ-------KEFVKTILGEF---PLGEELTQAGIIKYG 1299 Query: 227 -SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP-GLKYAYNQIFDMQGMRQHCNTED 284 + IE F + L I + + LK A+ ++F QG R D Sbjct: 1300 RTADIEINFGEFLTQTDLFNAIDKIKHSQADESRLDLALKKAHEELFTSQGARS-----D 1354 Query: 285 ANYKKIIVFMTDG-----ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 + +K IV + DG N S + + + ++ G +++ IG+ + L+ Sbjct: 1355 KDIEKAIVILGDGYISGGGNRSRDLIESAKKEAAKLRELGVLIFTIGVGAEPNSLLLQNF 1414 Query: 340 ASPNSFY-LVENPHSMYDAFSHIGKDIV 366 AS ++Y V++ + + I Sbjct: 1415 ASKKTYYITVKDYGQLIGKIGALKTSIS 1442 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 36/191 (18%), Positives = 63/191 (32%), Gaps = 20/191 (10%) Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 +LD S S+ D + K+ +KSI N + + G++ F + E L Sbjct: 1576 LLDSSGSVG---DENFDKMKEFVKSIVL------NFDVDNQLTRIGIIRFDSDAEIIIQL 1626 Query: 237 EWGVS--HLQRKIKYLSKF-GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + L I + G+ T L+ A + G A+ K +V Sbjct: 1627 SDHKTLKDLLNDIDSIRYNEGIQTRIDKALERAMEAFSEKNG-------GRADATKALVL 1679 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHS 353 + DG+N + Q Y IGI + L A+ N ++ Sbjct: 1680 LADGQNSFIEGSQDLNEELKPLIDAKVFRYVIGIGRELDLKELEDIATNNIAIYADSFDE 1739 Query: 354 M-YDAFSHIGK 363 + + I + Sbjct: 1740 LKFKVDEQISR 1750 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 30/195 (15%), Positives = 67/195 (34%), Gaps = 19/195 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF--SNK 229 +D+ + D S S+ + K + V + + + + Sbjct: 3 VDVALAFDSSSSVGELAYEEMKKFAH---------QVVDSFSISQQNARFAALVYGSNAS 53 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 +E F+ +++ I+ LS +T L+ A + +F +QG + Sbjct: 54 VEFNFVRYDSALEIKQAIQSLSYLKSNTRIDKALEVAKSDLFSLQG------KVRSRRPM 107 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 I+ DG + D +S+ K G + AIG+ + L+ + N+ + + Sbjct: 108 ILYVFFDGTVTRSMSDLESVV--QPLKDYGVKIIAIGVGPEVNRYQLKKISEDNAIFSGK 165 Query: 350 NPHSMYDAFSHIGKD 364 + + I + Sbjct: 166 SFKELAPLLYSIVEQ 180 >gi|149632103|ref|XP_001514474.1| PREDICTED: similar to collagen type VI alpha 4 [Ornithorhynchus anatinus] Length = 1844 Score = 78.7 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 69/192 (35%), Gaps = 23/192 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S S++ S+ K+ +K + M I N V+ G++ FS+ E Sbjct: 622 DIIFLIDGSESIKE---SNFEKMKEFMKLMVNMSN----IGPEN--VRIGVLQFSSSPRE 672 Query: 233 FFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F+L + L R I + + T + L + + + Sbjct: 673 EFMLNKYTTKEDLSRAISDIKQIKAGTQTGQALTFTLPYFDTSRW-------GRPTEPQY 725 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDGE + +G ++AIG+ + L + + + + Sbjct: 726 LIVITDGE-----AQDSVKGPAKALRDKGISIFAIGVLEANKTQLLEITGTEDQVFYEND 780 Query: 351 PHSMYDAFSHIG 362 S+ I Sbjct: 781 FDSLIFLKKKIS 792 Score = 73.7 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 45/314 (14%), Positives = 104/314 (33%), Gaps = 47/314 (14%) Query: 63 TQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRST--SLDIVV 120 ++ EG +R + + + + L R LD+ Sbjct: 342 AVVITEGFSEDRVSKPASQLRRAGVTIYALGIQRGLERGNLEKMASYPPRKHVMKLDLFQ 401 Query: 121 VPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDV 180 G I + L+ + +++K + ++D+ ++D Sbjct: 402 DHSYTGRRIGKLLCTDDLLQPSVVVLN------------NALKRGCRDTEKVDIYFLIDG 449 Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--W 238 S S++ + M + + ++ + NN V+ G+V +S+ F + Sbjct: 450 SSSIDH---GDFLDMKMFMSEVLSVFQM------GNNRVRFGVVQYSDSPHLEFEVGQYH 500 Query: 239 GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE 298 L+ I+ + + L Y + D + KI++ +T G Sbjct: 501 STVKLKEAIRGIKQLRGRDRIGEALNYMNQRFMDNDRV------------KILILITAG- 547 Query: 299 NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAF 358 + E ++RG ++YAIG++ + + + + V + +D Sbjct: 548 ----NFQDEVAESAQELRQRGIVIYAIGVKTDNQLKLISIAGTEENVLCVND----FDTL 599 Query: 359 SHIGKDIVTKRIWY 372 HI ++V + I + Sbjct: 600 KHIKDEVV-QDICF 612 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 38/178 (21%), Positives = 67/178 (37%), Gaps = 19/178 (10%) Query: 202 INAMLEE-VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTN 258 I +L E VK + + V+ GLV +S++ F LE + + L G Sbjct: 254 IGNILRETVKSLVIGPDNVRIGLVLYSDEPRLVFSLETFQTKESIMSHLAKLPYRGGKPK 313 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + LK+ IF G R++ + +++ V +T+G + + ++ ++ Sbjct: 314 TGAALKFLRENIFTQDGNRRY----EKRVQRMAVVITEG-----FSEDRVSKPASQLRRA 364 Query: 319 GAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH---IGKDIVTKRIWYD 373 G +YA+GI+ L AS Y D F G+ I D Sbjct: 365 GVTIYALGIQRGLERGNLEKMAS----YPPRKHVMKLDLFQDHSYTGRRIGKLLCTDD 418 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 64/181 (35%), Gaps = 24/181 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S +E I N + ++ + +N Q GL FS Sbjct: 32 DVVFLVDASEKLELNNFPLIK---------NFIFRIIRTLEVGSNRYQIGLAQFSGTGHV 82 Query: 233 FFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 FLL ++ + I Y+ + + L++ F + Sbjct: 83 EFLLNTYLTKAE-MIDYVQQGFTLRHGPRRTGNALQFLQKTFFKEAAGSRF----GQGVP 137 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 + V +T G K + + + +G + ++GI+ E L ASP+ + + Sbjct: 138 QYAVVITSG-----KSEDAVAGAARKLRGKGVNILSVGIQNFDKKE-LETMASPSLVFKI 191 Query: 349 E 349 + Sbjct: 192 Q 192 >gi|194223903|ref|XP_001494710.2| PREDICTED: integrin, alpha 1 [Equus caballus] Length = 1208 Score = 78.7 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 48/296 (16%), Positives = 108/296 (36%), Gaps = 35/296 (11%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 + +L N + ++ ++ + + +V +G ++ Y + + Sbjct: 122 IKLDLPVNTSIPNVTEVKENMTFGSTLVTNPKGGFLACGPLYAYRCGHLHYTTGICSDVS 181 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + +S+ + ++LD++IVLD S S+ + +N +LE + + P Sbjct: 182 PTFQVVNSIAPVRECSSQLDIVIVLDGSNSIYPW--------KSVTDFLNDLLERMDIGP 233 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQI 270 Q G+V + + F L S + + + + G T + G+ A + Sbjct: 234 ---KQTQVGIVQYGENVTHEFNLNKYSSTEEVLVAANKIVQRGGRQTMTALGIDTARKEA 290 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-RV 329 F +H KK++V +TDGE+ + Q + C + + ++I I Sbjct: 291 FTEARGARHG------VKKVMVIVTDGESHDNHQLNQVIQDCEK---QNIQRFSIAILGH 341 Query: 330 IRSHEF--------LRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 +++ AS F+ V + ++ +G+ I D+ Sbjct: 342 YNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVEALGERIFALEATVDQ 397 >gi|116753518|ref|YP_842636.1| von Willebrand factor, type A [Methanosaeta thermophila PT] gi|116664969|gb|ABK13996.1| von Willebrand factor, type A [Methanosaeta thermophila PT] Length = 795 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 78/223 (34%), Gaps = 29/223 (13%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 + I + + +D+++ +D S SM + + K A + + + + Sbjct: 42 SPSEISTVTITLRGGEIPCASPVDVVLSIDSSGSMTTSDPGDLRK-SAAKEFVTGLDLSM 100 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + G+V+++ + L ++ I G + T GLK A + Sbjct: 101 D---------RVGVVSWNTSAIS-WPLTNNTKDIESAIDSTGADGNTCLDT-GLKSAIDL 149 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDG---ENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 + + G K+IV +TDG + +EA+ +G +V+ IG Sbjct: 150 LSECSGS------------KVIVLLTDGISTDGGHYTPPGVPGSPVDEARSKGILVFTIG 197 Query: 327 IRV-IRSHEFLRAC-ASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + + ++ FY + +++ + I I Sbjct: 198 LGPDADARNLTEIAHSTGGEFYSAPDANALAGIYKRIRSSITG 240 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 26/144 (18%), Positives = 56/144 (38%), Gaps = 15/144 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + D++ V+D S SM S + ++ ++ ML P++ N + +V Sbjct: 392 APEKSCSADILFVIDKSGSMRDLDKSGLKNYEIMQAALYEMLNMASNTPELRN-ARIAIV 450 Query: 225 TFSN------KIEEFFLLEW---GVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIF 271 ++ + +W G ++ I+ + K T GL+ A QI Sbjct: 451 SWDDLDGTDEDRITTLDPQWLAVGDPRIKATIQQYNEETCKETDLTFYETGLQKAM-QIM 509 Query: 272 DMQGMRQHCNTEDANYKKIIVFMT 295 + + Q + + ++ I+F+T Sbjct: 510 HSRILSQANDPLSCDTRRFIIFIT 533 >gi|326916308|ref|XP_003204450.1| PREDICTED: collagen alpha-1(XII) chain-like, partial [Meleagris gallopavo] Length = 902 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 49/265 (18%), Positives = 104/265 (39%), Gaps = 33/265 (12%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD 169 ++T+L++ + + Y I+ + + P + T + ++ + + + S V+ + Sbjct: 383 PQTTALNVKDLSPDTEYQINVYAMKGLTPSEPITIME---KTQQVKVQVECSRGVDVKA- 438 Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D++ ++D S S+ ++ K+ ++ + VK VQ LV +S Sbjct: 439 ---DVVFLVDGSYSIG---IANFVKVRAFLEVL------VKSFEISPRKVQISLVQYSRD 486 Query: 230 IEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 F L V + + I G STN+ + Y ++F N Sbjct: 487 PHMEFSLNRYNRVEDIIQAINTFPYRGGSTNTGKAMTYVREKVFVTS------KGSRPNV 540 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 ++++ +TDG K + + ++A+G++ E L A ASP + Sbjct: 541 PRVMILITDG-----KSSDAFKEPAIKLRDADVEIFAVGVKDAVRTE-LEAIASPPAETH 594 Query: 348 VENPHSMYDAFSHIGKDIVTKRIWY 372 V +DAF I ++ T+ + Sbjct: 595 VYTVED-FDAFQRISFEL-TQSVCL 617 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 46/274 (16%), Positives = 95/274 (34%), Gaps = 31/274 (11%) Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKI---PLKFCTFIPWYTNSRHIVMP 157 +G + +T + + + Y +S I+ Y L + T I Sbjct: 64 DGPTKEFTLSPSTTQTVLSDLIPDIEYVVS-IASYDEVEESLPVFGQLTIQTGGPGIPEE 122 Query: 158 ITSSVKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 ++ + + + D++ ++D S S+ ++ M+ V Sbjct: 123 KKVEAQIQKCSISAMTDLVFLVDGSWSVGRNN---------FRYILDFMVALVSAFDIGE 173 Query: 217 NVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDM 273 + G+V +S+ F L + S L IK + G +T + + Y N + Sbjct: 174 EKTRVGVVQYSSDTRTEFNLNQYFRRSDLLDAIKRIPYKGGNTMTGEAIDYLVKNTFTES 233 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 G R+ + K+ + +TDG K + E + G V+++GI+ + Sbjct: 234 AGARKG-------FPKVAIVITDG-----KAQDEVEIPARELRNIGVEVFSLGIKAADAK 281 Query: 334 EFLRACASP--NSFYLVENPHSMYDAFSHIGKDI 365 E + P + V N + D + I + Sbjct: 282 ELKLIASQPSLKHVFNVANFDGIVDIQNEIILQV 315 >gi|218887819|ref|YP_002437140.1| von Willebrand factor A [Desulfovibrio vulgaris str. 'Miyazaki F'] gi|218758773|gb|ACL09672.1| von Willebrand factor type A [Desulfovibrio vulgaris str. 'Miyazaki F'] Length = 406 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 57/440 (12%), Positives = 130/440 (29%), Gaps = 133/440 (30%) Query: 19 MTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLK 78 M +L A+ LP++ + G+ I+ ++ L +D + + + ++ + + Sbjct: 1 MAMLMAVLLPVVLGLAGLGIDSGMLYLAHNRLQGAVDAAALAGSLELPYDPQLD------ 54 Query: 79 GGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP 138 G + + ++ + + + ++ A + Sbjct: 55 KGLVKGAVNQYMAANYPAAVLKG----------------VTPGTEERSVTVKAEATVD-- 96 Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA 198 T + ++ N+ L+++ V+D + SM T I A Sbjct: 97 ----TIFMGALGIGSSTVRAQATAGYNN-----LEVVFVIDNTGSM------KGTAIQQA 141 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI----------------EEFFLLEW---- 238 + L E+ + + V+ GLV F K+ + W Sbjct: 142 -NAAATQLAELIMPDGMETSVKVGLVPFRGKVHIPAGVDGLADGCRNADGTLAPSWILEE 200 Query: 239 ------------------GVSHLQRKIKYLSKFGVSTNSTPGLKYAY-----NQIFDMQG 275 G +++ L+ + S + A I + Sbjct: 201 YKQTKYRYPTGSSLNVPKGTCDSIPRVQALTSNRTTIVSAIAKQDALGDASGTVISEGIK 260 Query: 276 MRQHCNTEDANYK---------KIIVFMTDGEN--------------------------- 299 +H T +A + K+++ +TDG+ Sbjct: 261 WGRHVLTPEAPFTQGSSNKDMRKVMIVLTDGDTEDGKCGGNYALNYTPNAYWTNAYYGMF 320 Query: 300 -------LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE--FLRACASP-----NSF 345 K + L AK +G ++AI S + ++A AS + + Sbjct: 321 DMNTHCENGGKLNAAMLSEAQIAKDKGIEIFAIRYGDSDSTDISLMKAIASSKAGTDDHY 380 Query: 346 YLVENPHSMYDAFSHIGKDI 365 Y + + + + F IG+ + Sbjct: 381 YNAPSAYDLEEIFKKIGRQL 400 >gi|198412648|ref|XP_002125850.1| PREDICTED: similar to Vwa1 protein, partial [Ciona intestinalis] Length = 516 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 40/249 (16%), Positives = 74/249 (29%), Gaps = 33/249 (13%) Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 Y + F T N +P + A +D + VLD S S+ S Sbjct: 284 VDPGYDLYPSFITQNT-CLNDTTWSLPQPCCSRPCP-PFAVMDAVFVLDSSSSVGEENWS 341 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL--LEWGVSHLQ---R 245 + ++E L D + + ++ +I+ L S + Sbjct: 342 ------TMKTFVRKIIESFILADDAA---RISVFRYNREIDTTSQILLSDFSSDIDALLS 392 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 + G T + L +A + I N + K +++ +TDG Sbjct: 393 AFDAIPYDGSGTWTGQALTHAKDVILTN------ANGNRPDVKDLVLVITDGR-----SQ 441 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIRSH----EFLRACASPNSFYLVENPHSMYDAFSHI 361 + + G + YA+GI R + L S ++ + D Sbjct: 442 DSVTEVSQQLRAGGVLTYAVGIVPPRGRLDESQLLDMAGSSDNLIIARGGFEGLD--DEF 499 Query: 362 GKDIVTKRI 370 I T+ Sbjct: 500 SAQISTQIC 508 >gi|47219688|emb|CAG12610.1| unnamed protein product [Tetraodon nigroviridis] Length = 717 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 38/214 (17%), Positives = 78/214 (36%), Gaps = 29/214 (13%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-NVV 219 SV LD++ LD S + D+ +T D + + + D+N +V Sbjct: 445 SVDTQGCLGQALDLVFALDASDGVG--RDNFLTLCD--------FVRSLSVQFDINRDVA 494 Query: 220 QSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 Q LV +S + F L+ S + I S G ++ L + ++ + + Sbjct: 495 QLALVAYSRRATTVFNLDTHDSGSAVLTAIGEASYMGGVASTGTALLHVHSDVLTVD--- 551 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 K +V +TDG ++ + + G V+ +GI ++ + L+ Sbjct: 552 ---KGARLGVNKAVVVVTDG-----SGGTDAVVPAQKLRDNGVSVFVVGIGDMQREKLLQ 603 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 S LV + + +D++ + + Sbjct: 604 IAGSEEHLILVPSYEDL-----KYFEDVLVQMLC 632 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 83/209 (39%), Gaps = 27/209 (12%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 + H + ++ Q A +D++ ++D S S+ + + + Sbjct: 1 LQEIQTSHENIKKITAAGQLMQCSAAMDILFLMDGSYSVGK------GSFERSKHYALKL 54 Query: 206 LEEVKLIPDVNNVVQSGLVTFSN--KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL 263 + + + PD V+ GL+ F + ++E L L R +K +S G ST + L Sbjct: 55 CQALDIRPDK---VRVGLIQFGSAPRLEFALDLHATKQELMRHMKKISYRGGSTQTGLAL 111 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 KY + ++ +I++ ++DG K + + K+ G +++ Sbjct: 112 KYVLRKGLPGG-------RNSSDAAQIVIVLSDG-----KSQGNVMQAAAQLKETGVVLF 159 Query: 324 AIGIRVIRSHEFLRACAS---PNSFYLVE 349 A+G+R R E L A AS N + E Sbjct: 160 AVGLRYPRWEE-LHALASEPVENHVFFAE 187 >gi|209809314|ref|YP_002264852.1| hypothetical protein VSAL_II0524 [Aliivibrio salmonicida LFI1238] gi|208010876|emb|CAQ81278.1| putative membrane protein [Aliivibrio salmonicida LFI1238] Length = 320 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 72/217 (33%), Gaps = 46/217 (21%) Query: 173 DMMIVLDVSRSMESFF----DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 DMM+V+D+S SM + ++ + ++ +E+ K + GLV F + Sbjct: 84 DMMLVVDLSGSMSEEDMKTDSGFVDRLTAVKRVVSDFIEKRKGD-------RLGLVLFGD 136 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + + +Q ++ L G T GL A + Sbjct: 137 HAYLQTPLTFDRNTVQEQLNRTVLGLVGQRTAIGEGLGLATKTFIESNA----------- 185 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH------------- 333 ++ I+ ++DG N + + AK A +Y +GI Sbjct: 186 PQRTIILLSDGANTA--GVLDPIEAAQLAKDNNAKIYTVGIGAGEMQVRGFFGNQTVNTA 243 Query: 334 -----EFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 + L A + ++ N + + + I K Sbjct: 244 RDLDEDTLTKIATMTGGQYFRARNADELAEIYQTIDK 280 >gi|119358220|ref|YP_912864.1| von Willebrand factor, type A [Chlorobium phaeobacteroides DSM 266] gi|119355569|gb|ABL66440.1| von Willebrand factor, type A [Chlorobium phaeobacteroides DSM 266] Length = 344 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 35/182 (19%), Positives = 63/182 (34%), Gaps = 23/182 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + +D+++VLDVSRSM + ++++ + + G Sbjct: 94 RQTVAETRGIDLLLVLDVSRSMHQQDFNGQSRLEAVKGVGKQFVLSRSAD-------RIG 146 Query: 223 LVTFSNKIEEFFLLEWG---VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 LV FS K L + + I T + A N++ Sbjct: 147 LVVFSGKGYTPCPLTLDHLTLGTVLDNISSEVIQEEGTAIGTAILIAVNRL--------- 197 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 + +K I+ +TDG+N + D L A + G +Y I + F+R+ Sbjct: 198 --RASESRQKAIILLTDGQNNA--GDIDPLTAAGFALQDGIKIYTIAATAQDARPFVRSA 253 Query: 340 AS 341 S Sbjct: 254 ES 255 >gi|163759224|ref|ZP_02166310.1| hypothetical protein HPDFL43_05650 [Hoeflea phototrophica DFL-43] gi|162283628|gb|EDQ33913.1| hypothetical protein HPDFL43_05650 [Hoeflea phototrophica DFL-43] Length = 541 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 38/184 (20%), Positives = 69/184 (37%), Gaps = 38/184 (20%) Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 F ++E L S ++ +K L G STN G+ + + + D + Q D Sbjct: 358 GFGCEMEPLVPLTTDFSKIRTTVKALEANG-STNMLEGVMWGWRVLSDREPFAQGAPKSD 416 Query: 285 ANYKKIIVFMTDGEN----------------------------------LSTKEDQQSLY 310 A+ +KI++F+TDG+N + D+++ Sbjct: 417 ASVEKIMIFLTDGQNSFGNLNNDLGSAYTSMGYLVDGRLDGMTAANIGQTNNALDKKTKA 476 Query: 311 YCNEAKKRGAIVYAIGIRVIR--SHEFLRACA-SPNSFYLVENPHSMYDAFSHIGKDIVT 367 C AK+ G +Y I + + + L CA S ++ + + F I K +V Sbjct: 477 ACENAKEDGVTIYTIRLEEADVGTGKMLEECATSSAHYFDAPSRQQLTPIFDAIKKGVVK 536 Query: 368 KRIW 371 R+ Sbjct: 537 LRLT 540 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 24/238 (10%), Positives = 79/238 (33%), Gaps = 45/238 (18%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F + G ++ I + + G+ ++ + K+ L + +D + + A Sbjct: 7 FLKDTSGNFALVFGILAVPVMVAGGLAVDYVGLSVEKSKLQNAVDSAALLIARA------ 60 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 G+ + I + ++ V ++ +V + Sbjct: 61 GDMSETQAMKLAKTTITTNYGIN-----------------VAKVAVSMVDGD------AT 97 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 + L F F+ ++ + ++ + + ++ +VLD + SM Sbjct: 98 VKASMDQALVFGGFM----GRKNAAVSAEATA---TYAYTKYEIALVLDTTGSM---LGG 147 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE---EFFLLEWGVSHLQR 245 +T ++ +++ ++ + ++ +V ++ + E+ G +++ Sbjct: 148 KLTS---LQNAVIGLVDGMEALGLNKEQLKFAVVPYAGFVNVGPEYGPTINGAGKVKK 202 >gi|326675803|ref|XP_002665305.2| PREDICTED: collagen alpha-1(XII) chain [Danio rerio] Length = 3039 Score = 78.3 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 49/341 (14%), Positives = 112/341 (32%), Gaps = 37/341 (10%) Query: 44 FFMKTVLHSMIDRSLVHAATQ----IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELR 99 F M + + + + +++ + Sbjct: 270 FDMINQVQKALITEVCSGVEEQLNSLVSGEEVVEPASNLRVTDISSKSMRIAWDSSPGDI 329 Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSI-SAISRYKIPLKFCTFIPWYTNSRHIVMPI 158 + I + ++ P ++ + + + +V Sbjct: 330 TGYKLQVFPTIASAKKQELYTGPTMTLANVRDLTPEMEYEISLFALKGLTPSVPQVVTEK 389 Query: 159 TSSVKV----NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 T VKV + + D + D+++++D S S+ + K+ ++ + V Sbjct: 390 TQPVKVFTECSLEVDVQADIVLLVDGSYSIG---LPNFAKVRAFLEVL------VNSFDI 440 Query: 215 VNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 N VQ LV +S F L + + + ++ G STN+ + Y +IF Sbjct: 441 GPNKVQISLVQYSRDPHTEFALNKFDDNAAMVKAVRTFPYRGGSTNTGKAMTYVREKIFV 500 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 N +++V +TDG++ + +D N+ + ++A+G++ Sbjct: 501 SG------RGARDNVPRVMVLITDGKSSDSFKD-----PANKLRDTDVEIFAVGVKDAVR 549 Query: 333 HEF--LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 E + + N + VE+ +DAF I K++ Sbjct: 550 SELEAIANVPADNHVFEVED----FDAFERISKELTASICL 586 Score = 73.7 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 46/260 (17%), Positives = 82/260 (31%), Gaps = 42/260 (16%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 STS I + + YS+S S +PI + S + Sbjct: 62 ASSTSTSITDLTPDVDYSVSINSYDGAEE---------------SIPILGQITKCSVSAV 106 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++ ++D S S+ I AM + + +V +S+ Sbjct: 107 A-DLVFLVDGSWSVG------RENFRFIRSFIGAM---AGAFDIEEDKTRVAVVQYSSDT 156 Query: 231 EEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L + R IK L G +T + + Y F + Sbjct: 157 RTEFSLNTHFRRPDVLRAIKNLPYKGGNTMTGDAMDYLVKNTFT------QAAGARKGFP 210 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---PNSF 345 K+ + +TDG++ E Y + G ++ +GI+ E L+ AS Sbjct: 211 KVAMIITDGKSQDPVE-----EYAERLRNIGVEIFVLGIKGADEDE-LKEIASRPHSKHV 264 Query: 346 YLVENPHSMYDAFSHIGKDI 365 Y V N + + ++ Sbjct: 265 YNVPNFDMINQVQKALITEV 284 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 41/210 (19%), Positives = 75/210 (35%), Gaps = 24/210 (11%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + +T A+ D+++++D S S+ I M+ + PD V Sbjct: 1156 TPSDAQCKTTAQADIVLLVDGSWSIGRL------NFKTIRNFIARMVGVFDIGPD---RV 1206 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 Q GL +S + + L + Q + L G +T + L Y F Sbjct: 1207 QIGLAQYSGDPKTEWHLNAHRTRTQLLDAVANLPYKGGNTLTGLALNYILQNNFKPN--- 1263 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +KI V +TDG K + + +G +YAIG++ +E Sbjct: 1264 ---VGMRPNSRKIGVLVTDG-----KSQDDIVANSQNLRDQGIELYAIGVKNADENELRT 1315 Query: 338 ACASPN--SFYLVENPHSMYDAFSHIGKDI 365 + P+ Y V + + D + ++ Sbjct: 1316 IASDPDDIHMYNVADFSFLLDIVDDLTNNL 1345 >gi|156409367|ref|XP_001642141.1| predicted protein [Nematostella vectensis] gi|156229282|gb|EDO50078.1| predicted protein [Nematostella vectensis] Length = 193 Score = 77.9 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 31/184 (16%), Positives = 69/184 (37%), Gaps = 19/184 (10%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ I++D S S+E + ++ +K + V + GL+ +S++ Sbjct: 1 DLGILIDSSNSIEKHGRGNFRRVLEFVKRL------VSTFHVSPRRARIGLIVYSSRSYL 54 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC-----NTEDA 285 + + + IK + T + +KYA ++F + H + Sbjct: 55 VGGFRRYRNLRSVLQAIKRIRYIRGGTYTGKAMKYALRKLFSRRAGYHHARVRLFRSSRK 114 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRACASPNS 344 KI+V +TDG + KK G +++++G+ R + ++ + P Sbjct: 115 GAAKILVMITDG-----ISQDRVTTPALRLKKMGVVIFSVGVGKRYRLKQLMQIASRPRL 169 Query: 345 FYLV 348 + Sbjct: 170 VFTA 173 >gi|296227162|ref|XP_002759255.1| PREDICTED: hypothetical protein LOC100397584 [Callithrix jacchus] Length = 1319 Score = 77.9 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 35/216 (16%), Positives = 83/216 (38%), Gaps = 26/216 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + D++ +LD S S+ + + + +++ ++ PD + Sbjct: 27 AQRAGCKSVHYDLVFLLDTSSSVGK------EDFEKVRQWVANLVDTFEVGPD---RTRV 77 Query: 222 GLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G+V +S++ F L ++ + L+ G +TN+ L+Y + F Sbjct: 78 GVVRYSDQPTTAFELGLFGSREEVKAAARRLAYHGGNTNTGDALRYITARSFS---PHAG 134 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 D YK++ + +TDG + D + + + G ++A+G+ E Sbjct: 135 GRPGDRAYKQVAILLTDGRSQDLVLDAAAAAH-----RAGIRIFAVGVGEALKEELEEIA 189 Query: 340 ASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P + V + ++A I + + +R+ + Sbjct: 190 SEPKSAHVFHVSD----FNAIDKI-RGKLRRRLCEN 220 >gi|126303712|ref|XP_001380869.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 628 Score = 77.9 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 41/270 (15%), Positives = 95/270 (35%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + N+ +++ +D V +N YS++ +S + + + ++ + T Sbjct: 380 EGAVENEKQNVIEPNFVDKAVCRRNGFYSLNVLSWFSLHKIVQPLVKRVCDTNRLACSKT 439 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S S+ ++ + + +E ++ Sbjct: 440 CL--------NSADIGFVIDGSSSVG------TGNFRTLLQFVANLSKEFEISDTD---T 482 Query: 220 QSGLVTFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ + F ++ + IK ++ + T++ + YA +F Sbjct: 483 RIGAVQYTYEQRLEFGFDKYSTKQDILNAIKRVNYWSGGTSTGAAINYALEHLF------ 536 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG A + G I Y+IGI E Sbjct: 537 ---KKSKPNKRKLMILITDGR-----SYDDVRIPAMAAHQNGVITYSIGIAWAAQDELEV 588 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V+ +Y + I ++I Sbjct: 589 IATHPTKDHSFFVDEFDDLYKSVPKIIQNI 618 >gi|110598614|ref|ZP_01386881.1| von Willebrand factor, type A [Chlorobium ferrooxidans DSM 13031] gi|110339783|gb|EAT58291.1| von Willebrand factor, type A [Chlorobium ferrooxidans DSM 13031] Length = 336 Score = 77.9 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 43/238 (18%), Positives = 77/238 (32%), Gaps = 43/238 (18%) Query: 154 IVMPITSSVKVNSQTDAR-LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 + V ++ +AR +D+M+ LD+S SM S +++D A + + Sbjct: 80 FALAGPRMVVRQTEAEARGIDVMLALDISESMLQKDGSGKSRLDAAREVARKFVLRRSSD 139 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQ 269 + GLV F K L L I ++S T + A N+ Sbjct: 140 -------RIGLVVFRGKGYTQCPLTIDHDVLAMLIDHISPQVIQDEGTAIGSAILIATNR 192 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 + +K+I+ +TDGEN D A + G +Y + Sbjct: 193 F-----------KGSTSLQKVIILITDGENN--TGDVGPATAATLAAQNGIRIYVVNAGF 239 Query: 330 IRSHEF-----------------LRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 LR A + ++ E+P + + IG+ + Sbjct: 240 KSGGSAGNLSAESSAHAAMDEASLRGIARTTGGGYFRAEDPSVLDNTIKTIGRLETAR 297 >gi|46395320|dbj|BAD16597.1| DEC-1 [Lymnaea stagnalis] Length = 919 Score = 77.9 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 41/223 (18%), Positives = 88/223 (39%), Gaps = 24/223 (10%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + I I V + T + D++ V+D S S+ + + + ++ L Sbjct: 412 TYDITKAICKQVSNINCTQSYGDIIFVMDSSSSIT--YPNYVKQLSFVANVTRNFL---- 465 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + V+ G + F + +E+ F L+ +++ I + ST++ L+Y + Sbjct: 466 ---IGKDDVRYGALIFGSNVEKLFDLKKYDSPVDVEQHIMEATYLASSTDTAAALQYILD 522 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 Q + KII+ +TDGE+ +++ + + G + +IGI Sbjct: 523 QRMFADE-----QGGRPDAVKIIIVLTDGESTYP---EKTRAEATKLQSLGYHMMSIGIG 574 Query: 329 VIRSHEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + L A AS ++ + + Y + K++VT+ Sbjct: 575 NEINELELNALASNTSNIFKAAS----YQVLDQLHKEVVTRAC 613 >gi|228472814|ref|ZP_04057572.1| BatA protein [Capnocytophaga gingivalis ATCC 33624] gi|228275865|gb|EEK14631.1| BatA protein [Capnocytophaga gingivalis ATCC 33624] Length = 332 Score = 77.9 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 43/215 (20%), Positives = 80/215 (37%), Gaps = 29/215 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P +SS ++T +D+++ +D+S SM + D +I+ + +++ Sbjct: 73 LARPRSSSEITKTKTTEGIDIILSIDMSSSMLAK-DLKPNRIEALKRVAAQFIQQRASD- 130 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIF 271 + G+V +S + S + + +K + + T GL A N++ Sbjct: 131 ------RIGIVVYSGESYTKVPATTDKSIVLQALKEIRQGEIEDGTAIGMGLGTAINRL- 183 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 + K+I+ MTDG N + D L AK+ G VY IGI Sbjct: 184 ----------KDSKTKSKVIILMTDGVNNTGVIDP--LSAAELAKEYGIRVYTIGIGT-N 230 Query: 332 SHEFLRACASPNSFY-----LVENPHSMYDAFSHI 361 +P+ + VE + S I Sbjct: 231 GKALSPVAYNPDGSFQYDMVPVEIDEKLLAEISKI 265 >gi|241204947|ref|YP_002976043.1| hypothetical protein Rleg_2227 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240858837|gb|ACS56504.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 429 Score = 77.9 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 56/410 (13%), Positives = 127/410 (30%), Gaps = 61/410 (14%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHS----------MIDRSLVH 60 F + G I+TA+ + +F GM ++ +H ++T L++ V Sbjct: 8 FISDRSGNFGIMTALLVVPLFGAAGMAVDFAHALSLRTQLYAAADAAAVGSIAEKSGAVA 67 Query: 61 AATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRN----ELRDNGFVNDIDDIVRSTSL 116 AA + G + K + +I + + +N + Sbjct: 68 AAMTMSGNGTISLGKDDARSIFMSQISGELTDVQVDLGIDVTKTANKLNSQVSFSATVPT 127 Query: 117 DIVVVPQNEGYSISAISRYKIPL-KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMM 175 + V + +IS + + F F N+ + + T++ + + Sbjct: 128 TFMRVLGRDSITISGTATAEYQTASFMDFYILLDNTPSMGVGATATDVSTMEKNTSDTCA 187 Query: 176 IVLDVSRSMESFFD-----SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +++ ++++ +ID+ ++ + K +N + G+ TF K Sbjct: 188 FACHETQNNNNYYNLAKKLGVSMRIDVVRQATKELTVTAKSTRVSSNQFRMGVYTFGTKA 247 Query: 231 EEF-----FLLEWGVSHLQRKIKYL-----SKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 E+ + ++ + G + + A Q+ + Sbjct: 248 EDAKLTTISDPTDDLDKVRSYTDAVDLMTIPFQGYNNDQQTSFDSALTQMKTII-TTPGD 306 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYY-------------CNEAKKRGAIV----- 322 + +KI+ F++DG S K + C K + + Sbjct: 307 GSTATTPQKILFFVSDGVGDSEKPKGCTKKLTGNRCQEPIDTSFCQPLKDKSIRIAVLYT 366 Query: 323 ----------YAIGIRVIRSH--EFLRACASPNSFYLVENPHSMYDAFSH 360 Y I+ + ++ACASP ++ V + DA Sbjct: 367 TYLPLPKNSWYNTWIKPFQGEIPTKMQACASPGLYFEVTPTEGIADAMKA 416 >gi|47219514|emb|CAG09868.1| unnamed protein product [Tetraodon nigroviridis] Length = 1450 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 42/208 (20%), Positives = 84/208 (40%), Gaps = 27/208 (12%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 A+ D++ ++D S S+ + I+ + +M+ ++I N +Q L Sbjct: 686 ATVCKGAKADLVFLIDGSWSIGD------ESFNKVIQFVTSMIGAFEVISP--NGMQVSL 737 Query: 224 VTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 V +S+ + F L + + +K + G +T + LK+ Y ++F + Sbjct: 738 VQYSDDAKTEFKLNTYYNKGIVISALKSVRYRGGNTKTGIALKHVYEKVFTSD------S 791 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 N K++V +TDG + + + G V+ +G+ + E LR S Sbjct: 792 GMRRNVPKVLVVLTDG-----RSQDDVKKSAEKLQHSGYSVFVVGVADVDMTE-LRIIGS 845 Query: 342 --PNSFYLVENPHSMYDAFSHIGKDIVT 367 V + YDAF+ I +++T Sbjct: 846 KPSERHVFVVDD---YDAFAKIQDNLIT 870 >gi|32474857|ref|NP_867851.1| chloride channel [Rhodopirellula baltica SH 1] gi|32445397|emb|CAD75398.1| conserved hypothetical protein-putative chloride channel [Rhodopirellula baltica SH 1] Length = 900 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 49/335 (14%), Positives = 100/335 (29%), Gaps = 36/335 (10%) Query: 33 VLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNM 92 + + +++ S I +E + + + + + Sbjct: 322 FAARLRDCDDTLLDNNEAQAIVYASGKPRVLLIDSEPDETDSLRWALDEQSIDVDVRPPE 381 Query: 93 SFRNEL-RDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNS 151 ++L G+ I V +TS+ + + Y + F Sbjct: 382 GIPSDLSEMQGYECLILSNVPATSMSMRQMDLIRIYVQDLGGGLIMLGGDQAFGLGGYYR 441 Query: 152 RHI--VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 I ++P+ S+ + + L MM+V+D S SM KI++A + A +E + Sbjct: 442 TQIEEILPVRSNFEKERE-KPSLAMMLVIDKSGSMGG------QKIELAKDAAQAAVELL 494 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLL--EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 G++ F L + I + G TN P + AY Sbjct: 495 GPKDA------IGVIAFDGDSYTVSELRSTSDRGAISDAISTIEASG-GTNMYPAMADAY 547 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + +H ++ MTDG + + + + Sbjct: 548 EALLGATAKLKH-----------VILMTDG----VSSPGDFQGVAGDMSASRITLSTVAL 592 Query: 328 RVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSH 360 S + L A +Y ++P S+ F+ Sbjct: 593 GQGSSEDLLEELAQIGGGRYYFCDDPQSVPQVFAK 627 >gi|320158179|ref|YP_004190557.1| BatA [Vibrio vulnificus MO6-24/O] gi|319933491|gb|ADV88354.1| BatA [Vibrio vulnificus MO6-24/O] Length = 323 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 34/217 (15%), Positives = 70/217 (32%), Gaps = 46/217 (21%) Query: 173 DMMIVLDVSRSMESFF----DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+V+D+S SM+ I ++ + +E+ + GLV F++ Sbjct: 87 DLMLVVDLSGSMQQEDILQDGDYIDRLSAVKNVVTQFIEQ-------RQGDRLGLVLFAD 139 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + ++ + G T GL A D + Sbjct: 140 HAYLQTPLTADRQTVANQLNQTIIGLIGQKTAIGDGLALATKTFVDSEA----------- 188 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------- 331 +++++ ++DG N + D + N AKK G +Y IGI Sbjct: 189 PQRVVILLSDGSNTAGTLDP--IEAANIAKKYGVKIYTIGIGAGEMEVKQFFMTRKVNTS 246 Query: 332 ----SHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 + + ++ + + + I + Sbjct: 247 ADLDEKTLTKIATMTGGQYFRARDAQELQAIYQAINQ 283 >gi|268573054|ref|XP_002641504.1| C. briggsae CBR-MUA-3 protein [Caenorhabditis briggsae] gi|187031289|emb|CAP29350.1| CBR-MUA-3 protein [Caenorhabditis briggsae AF16] Length = 3770 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 35/238 (14%), Positives = 87/238 (36%), Gaps = 22/238 (9%) Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 + + +++ ++ P + + D++ ++D S S+ S+ Sbjct: 1196 VDTPDGYTCQCYGGFVDVSSNANLP-PGRVCTVQTTCPKQKTDLVFLIDGSGSIGSY--- 1251 Query: 191 SITKIDMAIKSINAMLEE-VKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWG-VSHLQRKI 247 + + + E V+L + GL+ +S++I F L ++G S L + I Sbjct: 1252 ------VFKNEVLRFVREFVELFEIGRGKTRVGLIQYSDQIRHEFDLDQYGDRSSLLKGI 1305 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 T + +++ + F R+ + + ++ + +TDG + Sbjct: 1306 SETQYLTGLTRTGAAIQHMVQEGFS---ERRGARPQQNDIARVAIILTDG-----RSQDN 1357 Query: 308 SLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 + A+K +AIG+ + + E SPN ++ V+ + + + Sbjct: 1358 VTGPADAARKLSINTFAIGVTDHVLASELESIAGSPNRWFYVDKFKDLDTRLRSMIQK 1415 >gi|320333536|ref|YP_004170247.1| von Willebrand factor type A [Deinococcus maricopensis DSM 21211] gi|319754825|gb|ADV66582.1| von Willebrand factor type A [Deinococcus maricopensis DSM 21211] Length = 509 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 46/265 (17%), Positives = 86/265 (32%), Gaps = 36/265 (13%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 + I+ + + P +P+ ++ + +++ ++ Q Sbjct: 263 APAVQARIMRDTLRRPVNTTVPLTRDFPDALLLELPFPRSASTLDAVVSTYLQDTRQPAN 322 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP----DVNNVVQSGLVTF 226 + VLDVS SM +ID ++ + + N + L+ F Sbjct: 323 TI---FVLDVSGSMRG------ARIDALKTALRGLSGADTTLTGRYATFANRERVTLIPF 373 Query: 227 SN-----KIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 S+ + E G + L+ ++ L+ G TN L+ AY Q R Sbjct: 374 SSAPGAPRTTELTPATRGAALKQLRAQVDALTPDG-GTNIYGALQAAYEQARAAPAGRYT 432 Query: 280 CNTEDANYKKIIVFMTDGENL-STKEDQ-QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 IV MTDGE DQ ++ Y + R + + + E R Sbjct: 433 S----------IVLMTDGERTEGPSADQFRATYAALPERARQVKTFTVLFGDSDATEMNR 482 Query: 338 ACA-SPNSFYLVENPHSMYDAFSHI 361 + + +N + AF I Sbjct: 483 IATLTGGRTFDGQN--DLRAAFKDI 505 >gi|316975135|gb|EFV58594.1| putative calcium binding EGF domain protein [Trichinella spiralis] Length = 2537 Score = 77.9 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 38/204 (18%), Positives = 83/204 (40%), Gaps = 22/204 (10%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-NVVQSGLVT 225 D++ ++D S S+ + ++ K + + E + D+N + + G++ Sbjct: 476 CPTQPTDLVFLIDGSGSIGT---------EIFYKYVMRFVHEFVTLFDINEDRTRVGIIQ 526 Query: 226 FSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 ++ +++ F L + LQ+ I+ + G T + L++ F Q T Sbjct: 527 YTGQVKPEFYLNQHKNIDQLQQAIRNIRYVGGLTKTGAALQFMTKNTFTAQ---MGARTR 583 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-RVIRSHEFLRACASP 342 D N KI V +TDG + +EA++ +YA+G+ + E + S Sbjct: 584 DPNVYKIGVVITDG-----RAQDNVKIPADEARRHNISLYAVGVTNHVLESELEQIAGSK 638 Query: 343 NSFYLVENPHSMYDAFS-HIGKDI 365 + +++V + I K++ Sbjct: 639 DRYFIVGTFAELNTRLRAKIQKEM 662 >gi|260834336|ref|XP_002612167.1| hypothetical protein BRAFLDRAFT_88906 [Branchiostoma floridae] gi|229297541|gb|EEN68176.1| hypothetical protein BRAFLDRAFT_88906 [Branchiostoma floridae] Length = 954 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 48/274 (17%), Positives = 84/274 (30%), Gaps = 34/274 (12%) Query: 95 RNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHI 154 +E+ N + DI + + +S S I T Sbjct: 107 SDEVAGNSDGYNEGDIKAQANDGDTEDKEG----VSEDSSEDIEDMKMKNTVRGTGVHGA 162 Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 I S V+S + +D+ +LD S S+ + + + ++ + P Sbjct: 163 TFQIRSQSNVSSTCEVPVDLFWLLDGSNSVG------TANFEKVKQFVVDVVNSFDVSPT 216 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + G+V +SNK F L V I + G T + LKY Sbjct: 217 A---TRVGVVQYSNKNTLMFNLGDKVDKPSTVNAINSIQYQGGGTYTGYALKYV------ 267 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 RQ N K+ V +TDGE G V+A+G+ Sbjct: 268 ----RQKAAWRGGNVPKVAVVLTDGE-----SYDSVSVAAQNLLSDGVEVFAVGVA---G 315 Query: 333 HEFLRACASPN-SFYLVENPHSMYDAFSHIGKDI 365 + + S + + + + + I K + Sbjct: 316 FKLIAIANSNETNVIELNDFNDLTTKIGEIAKKV 349 >gi|34558787|gb|AAQ75132.1| BatA protein [Alvinella pompejana epibiont 6C6] Length = 300 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 45/220 (20%), Positives = 83/220 (37%), Gaps = 32/220 (14%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEV 209 + P T K S R D+++ +DVS SM S +S ++ ++A + ++ Sbjct: 64 LASPFTYEAKELSTKKGR-DLILTIDVSGSMAQKGFSKEESEKSRYEVAKEIAKRFIK-- 120 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYA 266 D G+V F + L + + L +S G +T + A Sbjct: 121 NRFSDN-----IGIVIFGSFSFSASPLTYDLKALLEMFDLMSDVGIAGNNTAIGDAIFEA 175 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAI 325 + + K+I+ +TDG+ N K ++ + EAKKRG +Y + Sbjct: 176 IKNLESGEAKS-----------KVIILLTDGKHNFGKKSPKEGVV---EAKKRGIKIYTV 221 Query: 326 GIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 GI + L A + + +N + + F I + Sbjct: 222 GIGTDYDKKLLEKMAKETNAKSFFAKNSKELEEVFKEIEE 261 >gi|149922008|ref|ZP_01910450.1| hypothetical protein PPSIR1_18327 [Plesiocystis pacifica SIR-1] gi|149817173|gb|EDM76653.1| hypothetical protein PPSIR1_18327 [Plesiocystis pacifica SIR-1] Length = 996 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 79/202 (39%), Gaps = 31/202 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 L +++V+D S SM SS ++D+ ++ A + + G++ F Sbjct: 524 EQPTLALILVIDKSGSM-----SSGDRLDLVKEAARATARTLDPSDE------IGVIAFD 572 Query: 228 NKIEEFFLL--EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 N + L + I+ LS G TN+ P L+ AY Q+ + + +H Sbjct: 573 NSPQVLVRLQPAANRLRISSSIRRLSA-GGGTNAMPALREAYLQLAGSKALVKH------ 625 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--PN 343 ++ ++DGE+ + + ++ V ++G+ +FL A Sbjct: 626 -----VILLSDGESPENGIN----ALLGDMRQSDITVSSVGVGDGAGKDFLIRVAERGRG 676 Query: 344 SFYLVENPHSMYDAFSHIGKDI 365 ++ E+ + FS +++ Sbjct: 677 RYFYSEDGTDVPRIFSREAREV 698 >gi|240137370|ref|YP_002961839.1| hypothetical protein MexAM1_META1p0632 [Methylobacterium extorquens AM1] gi|240007336|gb|ACS38562.1| Conserved hypothetical protein containing a von Willebrand factor type A (vWA) domain; putative membrane protein [Methylobacterium extorquens AM1] Length = 339 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 45/229 (19%), Positives = 81/229 (35%), Gaps = 33/229 (14%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF---FDSSITKIDMAIKSINAMLEEVK 210 +V+P T+ + +++ LD+S SME D A+K + A + Sbjct: 86 LVLPATALPASARE------IVLALDLSGSMERKDFSLDGETVSRLAAVKRVGAEFIRRR 139 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYN 268 + GLV F+++ + + + R ++ + G ST GL A Sbjct: 140 AGD------RIGLVEFADQAYVAAAPTFDTATVARTLEEATIGLVGRSTGIGDGLGLALK 193 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 ++ Q + K++V ++DG N + AK G VY I + Sbjct: 194 RLAPAQVAAADGAGPPPSRDKVVVLLSDGANNA--GQTAPKDVAALAKDLGVRVYTIALG 251 Query: 329 VIRSH------------EFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 I E LRA A S + V+ + + + I + Sbjct: 252 PIDMADNPNNEQDVVDVETLRAMAETSGGRAFRVKTTDDLENVANAIDE 300 >gi|325106974|ref|YP_004268042.1| von Willebrand factor A [Planctomyces brasiliensis DSM 5305] gi|324967242|gb|ADY58020.1| von Willebrand factor type A [Planctomyces brasiliensis DSM 5305] Length = 396 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 63/395 (15%), Positives = 126/395 (31%), Gaps = 68/395 (17%) Query: 15 YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN-EGNGNN 73 +G M +L A L ++ +++ +V+++ ++T LH D + + E G Sbjct: 16 RRGAMLVLIAALLSVMLILVVFTTDVAYMQLVRTQLHVSTDAAAKAGMEALARTESRGQA 75 Query: 74 RKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAIS 133 R K I + +D F + T + + IS Sbjct: 76 RVVAKDIFSKNLIG---GRELKLHNKDIEFGRTDAN-PDGTWEFLPNERPFQAIRISVNL 131 Query: 134 RYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARL--DMMIVLDVSRSM------- 184 +P V+ +S +S A L ++++ LD S SM Sbjct: 132 DDNRQKGRNGSVPLLFGK---VLGQSSFATNHSSVAANLVHEIVLCLDRSHSMCFDETGV 188 Query: 185 --------ESFFDSSIT-------KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 S+ IT + +I ++ + + V +V G+VT+ + Sbjct: 189 DYAYPPGTPSYPAGYITPPNPVGSRWAKLQGAIQVFVDTLDDLQIVPDV---GVVTWGSD 245 Query: 230 I-----------------EEFFLLEWGVSHLQRKIKYL--SKFGVSTNSTPGLKYAYNQI 270 I L ++ + I TN + G+ + + + Sbjct: 246 ITLSWSWYPFQGRSFPAVMVDVPLGQNLNLVSPAIAAKLGDIMMGGTNMSSGIDRSVSLL 305 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 N + +K I+ M+DG+ + L N+A + ++ I Sbjct: 306 T--------ANGTHSLAQKTIILMSDGQ---WNAGRNPLDAANDAADKNITIHTIAFLNG 354 Query: 331 RSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 +R A + F+ + S+ D F + K Sbjct: 355 D-QSVMRQIAERTGGKFFNAPDGESLEDTFKELAK 388 >gi|86131264|ref|ZP_01049863.1| aerotolerance-related exported protein BatA [Dokdonia donghaensis MED134] gi|85818675|gb|EAQ39835.1| aerotolerance-related exported protein BatA [Dokdonia donghaensis MED134] Length = 334 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 43/238 (18%), Positives = 82/238 (34%), Gaps = 49/238 (20%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P V ++T +D++I +DVS SM + D +++ K ++ + Sbjct: 74 LARPRNVEVSTKTKTTKGIDIVIAIDVSASMLAK-DLRPNRLEALKKVASSFI------- 125 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQI 270 + + GLV ++ + + S + +K + T GL N+I Sbjct: 126 NGRPNDRIGLVEYAGESFTKTPITSDKSIVLSALKGIQYNSIIEGGTAIGMGLATGVNRI 185 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-- 328 + K+I+ MTDGEN + + D + A++ G VY IG+ Sbjct: 186 -----------KDSKALSKVIILMTDGENNAGQIDPRIA--AELAQEFGIKVYTIGMGTN 232 Query: 329 ---------VIRS------------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 E L A + ++ N + + + I K Sbjct: 233 GTALSPYARNPNGTFVYENIQVTIDEELLEEIAETTGGQYFRATNNKKLQEIYDEIDK 290 >gi|304407684|ref|ZP_07389335.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] gi|304343167|gb|EFM09010.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] Length = 966 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 75/204 (36%), Gaps = 27/204 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIE 231 D+++++D S SM +++ K+ A ++ ++ + L + +V F S+ + Sbjct: 71 DVVLIIDKSGSMAPSYNNGEDKMLNAKEAAKGFVDLMDLTK-----HRVAIVDFSSSNMI 125 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + + I ++ G ST + + A + + + +I Sbjct: 126 GNLPFTTNPTEAKNYIDTINANG-STATGDAIDSAIALLAN----------HRPEAQPVI 174 Query: 292 VFMTDGENLSTKED--QQSLYYCNEAKKRGAIVYAIGIRVIRS-------HEFLRACA-S 341 V MTDG+ D + AK G I Y I + + L+ A + Sbjct: 175 VIMTDGDATQPSTDPYGYAKQKALLAKDNGIIFYTIALLKSTDDPVTSGPNILLKEMATT 234 Query: 342 PNSFYLVENPHSMYDAFSHIGKDI 365 + + V + ++ I K+I Sbjct: 235 SDHHHFVLGSTGLSQIYAAIVKEI 258 >gi|317154611|ref|YP_004122659.1| von Willebrand factor type A [Desulfovibrio aespoeensis Aspo-2] gi|316944862|gb|ADU63913.1| von Willebrand factor type A [Desulfovibrio aespoeensis Aspo-2] Length = 395 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 61/432 (14%), Positives = 137/432 (31%), Gaps = 138/432 (31%) Query: 29 IIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKN 88 ++ V G+ +++ +++ T L + +D + + ++ + + + K Sbjct: 1 MLLAVAGLAVDMGNMYVTHTRLQAAVDAGALAGSLELPYDPDLS--------------KG 46 Query: 89 TWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY 148 + + + N ++ + T + S+ ++ K+ L F+ Sbjct: 47 IVQQAVSDMIHTNMPDAVVESVSPGTEV----------RSVVVTAKAKVNLLVMGFLNLA 96 Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + + L+++ V+D S SM T I++ ++ + + Sbjct: 97 DQ------WVEAGAAAGFNK---LEIVFVIDNSGSM------KGTPINLVKEASIGLTDL 141 Query: 209 VKLIPDVNNV-VQSGLVTFSNKI-------------------------EEFFLLEWGVSH 242 + IPD + GLV F K+ E+F + W +S Sbjct: 142 L--IPDGQQPDTKVGLVAFRGKVRLGGDVDGLEAGCRNADGSVNTGIHEDFMSMYWALSS 199 Query: 243 LQR--------------------------KIKYLSKFG--VSTNSTPGLKYAYNQIFDMQ 274 R I + G T + G+K+A + + Sbjct: 200 YYRNQIDLDTCSSIPESRPLSQDKGDIVEGINSQTALGSASGTVISEGIKWARHMLT--P 257 Query: 275 GMRQHCNTEDANYKKIIVFMTDGE-----------------NLSTKE------------- 304 + +++KI++ +TDG+ N T Sbjct: 258 EAPYTQAGDKKDFRKIMIVLTDGDTEDGECGGSYRASFRPNNYWTNAYYGMGVDTAHCQD 317 Query: 305 ----DQQSLYYCNEAKKRGAIVYAIGIRVIRSHE--FLRACASP-----NSFYLVENPHS 353 +Q L AK G ++AI V + + ++ AS + ++ + + Sbjct: 318 GGVLNQDMLAEAQLAKDEGIEIFAIRFGVSDNTDISLMKQIASSKAGTNDHYFDAPSVYD 377 Query: 354 MYDAFSHIGKDI 365 + D F IGK + Sbjct: 378 IPDVFKKIGKQL 389 >gi|222528098|ref|YP_002571980.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] gi|222454945|gb|ACM59207.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] Length = 902 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 50/212 (23%), Positives = 82/212 (38%), Gaps = 26/212 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + +D+++VLD S SM D+ I K+++A + M+E ++ G++ F Sbjct: 401 EKEKNIDVVLVLDHSGSMADTEDAGIPKLEIAKSASAKMIEHLESSDG------VGVIAF 454 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + + +S + I+ +S G T P L A + + + Sbjct: 455 DHNYYWAYKFG-KISKKEDVIESISSIEVGGGTAIIPPLSEAVKTLKKSKAKSKL----- 508 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--P 342 IV +TDG + NEAK+ + IG+ + L AS Sbjct: 509 ------IVLLTDGMGEQGGYEI----PANEAKRNNIKITTIGVGKYVNATVLSWIASFTS 558 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 FYLV NP + D F K I K I K Sbjct: 559 GRFYLVSNPSELVDVFLKETKIIKGKYIKEKK 590 >gi|254820233|ref|ZP_05225234.1| hypothetical protein MintA_09911 [Mycobacterium intracellulare ATCC 13950] Length = 339 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 37/245 (15%), Positives = 76/245 (31%), Gaps = 33/245 (13%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLD---MMIVLDVSRSMESFFDSSITKIDMAI 199 +P + +V+ T+ S L+ +M+V+DVS SM S D ++ A Sbjct: 69 RHVPTILLATSLVLLTTAMAGPTSDVRIPLNRAVVMLVIDVSESMASN-DVPPNRLAAAK 127 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNS 259 ++ +++ + GLV F+ ++ I L T + Sbjct: 128 EAGKQFADQLTPAIN------LGLVEFAANATLLVPPTTNRGAVKSGIDSLQP-APKTAT 180 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL---STKEDQQSLYYCNEAK 316 G+ A I + + IV +DG Q + AK Sbjct: 181 GEGIFTALQAIATVGSVMGGGEGPPPAR---IVLESDGAENVPLDPNAPQGAFTAARAAK 237 Query: 317 KRGAIVYAIGIRVIRS---------------HEFLRACA-SPNSFYLVENPHSMYDAFSH 360 +G + I + C + + ++ S+ + ++ Sbjct: 238 GQGVQISTISFGTPYGTVDYEGATIPVPVDDQTLQKICEITDGEAFHADSLDSLKNVYTT 297 Query: 361 IGKDI 365 + + I Sbjct: 298 LQRQI 302 >gi|188580059|ref|YP_001923504.1| von Willebrand factor type A [Methylobacterium populi BJ001] gi|179343557|gb|ACB78969.1| von Willebrand factor type A [Methylobacterium populi BJ001] Length = 339 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 73/209 (34%), Gaps = 27/209 (12%) Query: 174 MMIVLDVSRSMESF---FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +++ LD+S SME D A+K + A + + GLV F+++ Sbjct: 100 IVLALDLSGSMERKDFSLDGETVSRLAAVKRVGAEFIRRRAGD------RIGLVEFADQA 153 Query: 231 EEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + + R ++ + G ST GL A ++ Q + Sbjct: 154 YVAAAPTFDTAAVARTLEEATIGLVGRSTGIGDGLGLALKRLAPAQLADAEGGGPPPSRD 213 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH------------EFL 336 K++V ++DG N + AK G VY I + I E L Sbjct: 214 KVVVLLSDGANNA--GQTAPKDVAALAKDLGVRVYTIALGPIDMADNPNNEQDVVDVETL 271 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGK 363 RA A S + V+ + + I + Sbjct: 272 RAMAETSGGRAFRVKTTDDLESVAAAIDE 300 >gi|310115525|ref|XP_003120135.1| PREDICTED: collagen alpha-1(XXII) chain-like [Homo sapiens] Length = 788 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 84/216 (38%), Gaps = 26/216 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + D++ +LD S S+ + + + +++ ++ PD + Sbjct: 27 AQRAGCKSVHYDLVFLLDTSSSVGK------EDFEKVRQWVANLVDTFEVGPD---RTRV 77 Query: 222 GLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G+V +S++ F L S ++ + L+ G +TN+ L+Y + F Sbjct: 78 GVVRYSDRPTTAFELGLFGSQEEVKAAARRLAYHGGNTNTGDALRYITARSFS---PHAG 134 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 D YK++ + +TDG + D + + + G ++A+G+ E Sbjct: 135 GRPRDRAYKQVAILLTDGRSQDLVLDAAAAAH-----RAGIRIFAVGVGEALKEELEEIA 189 Query: 340 ASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P + V + ++A I + + +R+ + Sbjct: 190 SEPKSAHVFHVSD----FNAIDKI-RGKLRRRLCEN 220 >gi|59713864|ref|YP_206639.1| hypothetical protein VF_A0681 [Vibrio fischeri ES114] gi|59482112|gb|AAW87751.1| hypothetical membrane spanning protein [Vibrio fischeri ES114] Length = 321 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 37/216 (17%), Positives = 76/216 (35%), Gaps = 47/216 (21%) Query: 173 DMMIVLDVSRSM-ESFFDSS----ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 DMM+V+D+S SM E +S + ++ + ++ +++ K + GLV F Sbjct: 84 DMMLVVDLSGSMAEEDMKTSNGDFVDRLTAVKQVVSDFIDQRKGD-------RLGLVLFG 136 Query: 228 NKIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L + + ++ ++ L+ G T GL A + Sbjct: 137 DHAYLQTPLTFDRNTVREQLDRTVLNLVGQRTAIGEGLGLATKTFIESNA---------- 186 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH------------ 333 ++ I+ ++DG N + + L AK A +Y +GI Sbjct: 187 -PQRTIILLSDGANTA--GVLEPLEAAQLAKDNHAKIYTVGIGAGEMQVRGFFGKQTVNT 243 Query: 334 ------EFLRACA--SPNSFYLVENPHSMYDAFSHI 361 + L A + ++ N + + + I Sbjct: 244 ARDLDEDTLTKIATMTGGQYFRARNADELAEIYQTI 279 >gi|119913152|ref|XP_616068.3| PREDICTED: integrin, alpha 1 [Bos taurus] gi|297487671|ref|XP_002696385.1| PREDICTED: integrin, alpha 1 [Bos taurus] gi|296475808|gb|DAA17923.1| integrin, alpha 1 [Bos taurus] Length = 1195 Score = 77.5 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 47/296 (15%), Positives = 106/296 (35%), Gaps = 35/296 (11%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 + +L N + ++ ++ + + +V +G ++ Y + + Sbjct: 109 IKLDLPVNTSIPNVTEVKENMTFGSTLVTNPKGGFLACGPLYAYRCGHLHYTTGICSDVS 168 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + +S+ + +LD++IVLD S S+ + + +N +L+ + + P Sbjct: 169 PTFQVVNSIAPVQECSTQLDIVIVLDGSNSIYPW--------ESVTAFLNDLLKRMDIGP 220 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQI 270 Q G+V + + F L S + + + + G T + G+ A + Sbjct: 221 ---KQTQVGIVQYGENVTHEFNLNKYSSTEEVLVAAKKIVQRGGRQTMTALGIDTARKEA 277 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--R 328 F KK++V +TDGE+ + + C ++I I Sbjct: 278 FT------EARGARRGVKKVMVIVTDGESHDNHRLNKVIQDCE---DESIQRFSIAILGS 328 Query: 329 VIRSH-------EFLRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 R + E +++ AS F+ V + ++ +G+ I D+ Sbjct: 329 YNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVEALGERIFALEATVDQ 384 >gi|78484419|ref|YP_390344.1| von Willebrand factor, type A [Thiomicrospira crunogena XCL-2] gi|78362705|gb|ABB40670.1| Type A von Willebrand factor-like [Thiomicrospira crunogena XCL-2] Length = 349 Score = 77.5 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 31/214 (14%), Positives = 78/214 (36%), Gaps = 43/214 (20%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+ +D+S SME + ++ + +++ + GLV F + Sbjct: 104 DLMLAVDLSGSMEKTDMPLRGVEVDRLTAVKSVVKNFIQK-------RQGDRMGLVVFGS 156 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ ++ + + G +T + A + + Sbjct: 157 QAFLQSPLTYDLNTVETLLNETEIGMAGNNTAIGDAIGIALKHLHQNSEKKA-------- 208 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF----------- 335 +++ +TDG N + Q L +A++ G +Y IGI ++ Sbjct: 209 ---VLILLTDGSNTA--GAVQPLDAAKQAQEMGLKIYTIGIGQNQATGLDAFIFGPNRNM 263 Query: 336 ----LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + F++ ++ + + + + I + Sbjct: 264 DTTTLQKIAELTQGRFFMAKDTNQLNEIYQLIDQ 297 >gi|218528855|ref|YP_002419671.1| von Willebrand factor type A [Methylobacterium chloromethanicum CM4] gi|254559548|ref|YP_003066643.1| hypothetical protein METDI1003 [Methylobacterium extorquens DM4] gi|218521158|gb|ACK81743.1| von Willebrand factor type A [Methylobacterium chloromethanicum CM4] gi|254266826|emb|CAX22625.1| Conserved hypothetical protein containing a von Willebrand factor type A (vWA) domain; putative membrane protein [Methylobacterium extorquens DM4] Length = 339 Score = 77.1 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 45/229 (19%), Positives = 80/229 (34%), Gaps = 33/229 (14%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF---FDSSITKIDMAIKSINAMLEEVK 210 +V+P T+ + +++ LD+S SME D A+K + A + Sbjct: 86 LVLPATALPASARE------IVLALDLSGSMERKDFSLDGETVSRLAAVKRVGAEFIRRR 139 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYN 268 + GLV F+++ + + + R ++ + G ST GL A Sbjct: 140 AGD------RIGLVEFADQAYVAAAPTFDTATVARTLEEATIGLVGRSTGIGDGLGLALK 193 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 ++ Q K++V ++DG N + AK G VY I + Sbjct: 194 RLAPAQVAAADGEGPPPARDKVVVLLSDGANNA--GQTAPKDVAALAKDLGVRVYTIALG 251 Query: 329 VIRSH------------EFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 I E LRA A S + V+ + + + I + Sbjct: 252 PIDMADNPNNEQDVVDVETLRAMAETSGGRAFRVKTTDDLENVANAIDE 300 >gi|163850298|ref|YP_001638341.1| von Willebrand factor type A [Methylobacterium extorquens PA1] gi|163661903|gb|ABY29270.1| von Willebrand factor type A [Methylobacterium extorquens PA1] Length = 339 Score = 77.1 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 45/229 (19%), Positives = 80/229 (34%), Gaps = 33/229 (14%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF---FDSSITKIDMAIKSINAMLEEVK 210 +V+P T+ + +++ LD+S SME D A+K + A + Sbjct: 86 LVLPATALPASARE------IVLALDLSGSMERKDFSLDGETVSRLAAVKRVGAEFIRRR 139 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYN 268 + GLV F+++ + + + R ++ + G ST GL A Sbjct: 140 AGD------RIGLVEFADQAYVAAAPTFDTATVARTLEEATIGLVGRSTGIGDGLGLALK 193 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 ++ Q K++V ++DG N + AK G VY I + Sbjct: 194 RLAPAQVAAADGEGPPPARDKVVVLLSDGANNA--GQTAPKDVAALAKDLGVRVYTIALG 251 Query: 329 VIRSH------------EFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 I E LRA A S + V+ + + + I + Sbjct: 252 PIDMADNPNNEQDVVDVETLRAMAETSGGRAFRVKTTDDLENVANAIDE 300 >gi|308081588|ref|NP_001183957.1| vitrin isoform 2 [Mus musculus] gi|26342052|dbj|BAC34688.1| unnamed protein product [Mus musculus] Length = 628 Score = 77.1 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 45/270 (16%), Positives = 93/270 (34%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + DI +V V N YS + S + + ++ + T Sbjct: 380 EGAAERDIQHVVEPGFASKAVCRTNGFYSFNVQSWLSLHKTVQPLVKRVCDTDRLACSKT 439 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S SM + ++ + + +E ++ Sbjct: 440 CL--------NSADIGFVIDGSSSMG------TSNFRTVLQFVANLSKEFEISDTD---T 482 Query: 220 QSGLVTFS--NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ ++E F + + I+ + + T++ ++YA Q+F Sbjct: 483 RVGAVQYTYEQRLEFGFDKYNSKADILSAIRRVGYWSGGTSTGAAIQYALEQLF------ 536 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG A ++G I YAIGI E Sbjct: 537 ---KKSKPNKRKVMIIITDGR-----SYDDVRIPAMAAYQKGVITYAIGIAWAAQDELEV 588 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V++ ++Y I ++I Sbjct: 589 MATHPAKDHSFFVDDFDNLYKIAPRIIQNI 618 >gi|312126757|ref|YP_003991631.1| hypothetical protein Calhy_0520 [Caldicellulosiruptor hydrothermalis 108] gi|311776776|gb|ADQ06262.1| protein of unknown function DUF1355 [Caldicellulosiruptor hydrothermalis 108] Length = 909 Score = 77.1 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 41/212 (19%), Positives = 79/212 (37%), Gaps = 26/212 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + + +++V+D S SM +I K+++A + M++ ++ G++ F Sbjct: 401 EKERNVAVVLVIDHSGSMGESNLGNINKLEIAKSAAAKMIDHLESSDS------VGVIAF 454 Query: 227 SNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + + + I + + G T P L A N + + Sbjct: 455 DHNFYWASKFGKLKSKNEVIENISGI-QIGGGTAIIPPLTEAVNTLRKSKAK-------- 505 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SP 342 K+IV +TD E+ Y + AK+ + IG+ + L A + Sbjct: 506 ---DKVIVLLTD----GYGEEGGYEYPASIAKRNNIKITTIGVGSSINAPILSWMAAYTS 558 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 FY V++ ++ D F K I K I K Sbjct: 559 GRFYYVKDASNLIDVFLKEAKIIKGKYIKEKK 590 >gi|40805823|ref|NP_690848.1| collagen, type XXII, alpha 1 [Homo sapiens] gi|296434458|sp|Q8NFW1|COMA1_HUMAN RecName: Full=Collagen alpha-1(XXII) chain; Flags: Precursor gi|225000822|gb|AAI72420.1| Collagen, type XXII, alpha 1 [synthetic construct] Length = 1626 Score = 77.1 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 84/216 (38%), Gaps = 26/216 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + D++ +LD S S+ + + + +++ ++ PD + Sbjct: 27 AQRAGCKSVHYDLVFLLDTSSSVGK------EDFEKVRQWVANLVDTFEVGPD---RTRV 77 Query: 222 GLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G+V +S++ F L S ++ + L+ G +TN+ L+Y + F Sbjct: 78 GVVRYSDRPTTAFELGLFGSQEEVKAAARRLAYHGGNTNTGDALRYITARSFS---PHAG 134 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 D YK++ + +TDG + D + + + G ++A+G+ E Sbjct: 135 GRPRDRAYKQVAILLTDGRSQDLVLDAAAAAH-----RAGIRIFAVGVGEALKEELEEIA 189 Query: 340 ASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P + V + ++A I + + +R+ + Sbjct: 190 SEPKSAHVFHVSD----FNAIDKI-RGKLRRRLCEN 220 >gi|119612600|gb|EAW92194.1| collagen, type XXII, alpha 1, isoform CRA_b [Homo sapiens] Length = 1626 Score = 77.1 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 84/216 (38%), Gaps = 26/216 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + D++ +LD S S+ + + + +++ ++ PD + Sbjct: 27 AQRAGCKSVHYDLVFLLDTSSSVGK------EDFEKVRQWVANLVDTFEVGPD---RTRV 77 Query: 222 GLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G+V +S++ F L S ++ + L+ G +TN+ L+Y + F Sbjct: 78 GVVRYSDRPTTAFELGLFGSQEEVKAAARRLAYHGGNTNTGDALRYITARSFS---PHAG 134 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 D YK++ + +TDG + D + + + G ++A+G+ E Sbjct: 135 GRPRDRAYKQVAILLTDGRSQDLVLDAAAAAH-----RAGIRIFAVGVGEALKEELEEIA 189 Query: 340 ASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P + V + ++A I + + +R+ + Sbjct: 190 SEPKSAHVFHVSD----FNAIDKI-RGKLRRRLCEN 220 >gi|22652113|gb|AAN03620.1|AF406780_1 alpha 1 type XXII collagen [Homo sapiens] Length = 1626 Score = 77.1 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 84/216 (38%), Gaps = 26/216 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + D++ +LD S S+ + + + +++ ++ PD + Sbjct: 27 AQRAGCKSVHYDLVFLLDTSSSVGK------EDFEKVRQWVANLVDTFEVGPD---RTRV 77 Query: 222 GLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G+V +S++ F L S ++ + L+ G +TN+ L+Y + F Sbjct: 78 GVVRYSDRPTTAFELGLFGSQEEVKAAARRLAYHGGNTNTGDALRYITARSFS---PHAG 134 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 D YK++ + +TDG + D + + + G ++A+G+ E Sbjct: 135 GRPRDRAYKQVAILLTDGRSQDLVLDAAAAAH-----RAGIRIFAVGVGEALKEELEEIA 189 Query: 340 ASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P + V + ++A I + + +R+ + Sbjct: 190 SEPKSAHVFHVSD----FNAIDKI-RGKLRRRLCEN 220 >gi|58037355|ref|NP_083089.1| vitrin isoform 1 precursor [Mus musculus] gi|114154829|sp|Q8VHI5|VITRN_MOUSE RecName: Full=Vitrin; Flags: Precursor gi|21707639|gb|AAH34120.1| Vitrin [Mus musculus] Length = 650 Score = 77.1 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 45/270 (16%), Positives = 94/270 (34%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + DI +V V N YS + S + + ++ + T Sbjct: 402 EGAAERDIQHVVEPGFASKAVCRTNGFYSFNVQSWLSLHKTVQPLVKRVCDTDRLACSKT 461 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S SM + ++ + + +E ++ Sbjct: 462 CL--------NSADIGFVIDGSSSMG------TSNFRTVLQFVANLSKEFEISDTD---T 504 Query: 220 QSGLVTFS--NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ ++E F + + I+ + + T++ ++YA Q+F Sbjct: 505 RVGAVQYTYEQRLEFGFDKYNSKADILSAIRRVGYWSGGTSTGAAIQYALEQLF------ 558 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG + A ++G I YAIGI E Sbjct: 559 ---KKSKPNKRKVMIIITDG-----RSYDDVRIPAMAAYQKGVITYAIGIAWAAQDELEV 610 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V++ ++Y I ++I Sbjct: 611 MATHPAKDHSFFVDDFDNLYKIAPRIIQNI 640 >gi|308501643|ref|XP_003113006.1| CRE-MUA-3 protein [Caenorhabditis remanei] gi|308265307|gb|EFP09260.1| CRE-MUA-3 protein [Caenorhabditis remanei] Length = 3860 Score = 77.1 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 35/212 (16%), Positives = 82/212 (38%), Gaps = 21/212 (9%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE-VKLIPDV 215 P + + D++ ++D S S+ S+ + + + E V+L Sbjct: 1215 PGRVCTVQTTCPKQKTDLVFLIDGSGSIGSY---------VFKNEVLRFVSEFVELFEIG 1265 Query: 216 NNVVQSGLVTFSNKIEEFFLL-EWG-VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + + GL+ +S++I F L ++G S L + I T + +++ + F Sbjct: 1266 RSKTRVGLIQYSDQIRHEFDLDQYGDRSSLLKGISETQYLTGLTRTGAAIQHMVQEGFS- 1324 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRS 332 R+ + ++ ++ + +TDG + + A+K +AIG+ + + Sbjct: 1325 --ERRGARPQQSDIARVAIILTDG-----RSQDNVTGPADSARKLSINTFAIGVTDHVLA 1377 Query: 333 HEFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 E SPN ++ V+ + + + Sbjct: 1378 SELESIAGSPNRWFYVDKFKDLDTRLRSMIQK 1409 >gi|219841908|gb|AAI44536.1| COL22A1 protein [Homo sapiens] Length = 1319 Score = 77.1 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 84/216 (38%), Gaps = 26/216 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + D++ +LD S S+ + + + +++ ++ PD + Sbjct: 27 AQRAGCKSVHYDLVFLLDTSSSVGK------EDFEKVRQWVANLVDTFEVGPD---RTRV 77 Query: 222 GLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G+V +S++ F L S ++ + L+ G +TN+ L+Y + F Sbjct: 78 GVVRYSDRPTTAFELGLFGSQEEVKAAARRLAYHGGNTNTGDALRYITARSFS---PHAG 134 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 D YK++ + +TDG + D + + + G ++A+G+ E Sbjct: 135 GRPRDRAYKQVAILLTDGRSQDLVLDAAAAAH-----RAGIRIFAVGVGEALKEELEEIA 189 Query: 340 ASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P + V + ++A I + + +R+ + Sbjct: 190 SEPKSAHVFHVSD----FNAIDKI-RGKLRRRLCEN 220 >gi|332254886|ref|XP_003276564.1| PREDICTED: integrin alpha-1 [Nomascus leucogenys] Length = 1179 Score = 77.1 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 49/296 (16%), Positives = 106/296 (35%), Gaps = 35/296 (11%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 + +L N + ++ ++ + + +V G ++ Y + + Sbjct: 93 IKLDLPVNTSIPNVTEVKENMTFGSTLVTNPNGGFLACGPLYAYRCGHLHYTTGICSDVS 152 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + +S+ + +LD++IVLD S S+ + D +N +LE + + P Sbjct: 153 PTFQVVNSIAPVRECSTQLDIVIVLDGSNSIYPW--------DSVTAFLNDLLERMDIGP 204 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQI 270 Q G+V + + F L S + + + + G T + G+ A + Sbjct: 205 ---KQTQVGIVQYGENVTHEFNLNKYSSTKEVLVAAKKIVQRGGRQTMTALGIDTARKEA 261 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--R 328 F KK++V +TDGE+ ++ + C ++I I Sbjct: 262 FT------EARGARRGVKKVMVIVTDGESHDNHRLKKVIQDCE---DENIQRFSIAILGS 312 Query: 329 VIRSH-------EFLRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 R + E +++ AS F+ V + ++ +G+ I D+ Sbjct: 313 YNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKTLGERIFALEATADQ 368 >gi|150251390|gb|ABR68007.1| matrilin-like 40 kDa protein [Lehmannia valentiana] Length = 390 Score = 77.1 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 79/196 (40%), Gaps = 20/196 (10%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+++VLD S SME ++ +++ A + ++ L N+ + G + FS+ +E Sbjct: 209 DIVLVLDSSGSMEDK--NNELQLNFASRFVSHFLV-------GNSKARFGALLFSDFVEN 259 Query: 233 FFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L + + + I +T + +G+ +N I Sbjct: 260 LFYLNKYTSTADVSKAILRAPYHRGTTLTNEAF-----DFIRTEGVFSTPKGGRSNAPDI 314 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP-NSFYLVE 349 +V TDG++ +L + K++ + A+GI S E LR AS + + Sbjct: 315 VVVFTDGQSTKP---ALTLAAADNLKRQNVRIVAVGIGNEVSKEELRQVASSRDDVFEAS 371 Query: 350 NPHSMYDAFSHIGKDI 365 + ++ + K++ Sbjct: 372 SFENLDYIEQKLAKNV 387 >gi|316956996|gb|EFV46978.1| putative von Willebrand factor type A domain protein [Trichinella spiralis] Length = 328 Score = 77.1 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 74/213 (34%), Gaps = 23/213 (10%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE-VKLIPDV 215 P D+M VLD S S+ S + I L E + L Sbjct: 21 PGRVCTIQTYCPTQPTDLMFVLDGSGSIGS---------AVFKNEILRFLREFINLFTIG 71 Query: 216 NNVVQSGLVTFSNKI--EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 +N + ++ +S++I E F + + + + T + L +F + Sbjct: 72 SNHTRLAIIQYSDQIRHELDFKEANSKAEVDEALNRVEYLTGLTKTGDAL----TDMFKI 127 Query: 274 QGMRQHCNTE-DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIR 331 + + ++ + +TDG + + NEAKK +++A+G+ + Sbjct: 128 GFSKSRGARPIETGVHRVAIVITDG-----RSQDIVSFSANEAKKSNVLMFAVGVTDHVS 182 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 E + S + +LV+ + + + Sbjct: 183 EAELVEIAGSKDRVFLVKEFTDLNVRLRSLIQK 215 >gi|109090611|ref|XP_001091779.1| PREDICTED: von Willebrand factor A domain-containing protein 2 isoform 3 [Macaca mulatta] Length = 781 Score = 77.1 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 47/209 (22%), Positives = 79/209 (37%), Gaps = 31/209 (14%) Query: 153 HIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 H+ + V S+ A +D++ +LD S S+ S + ++ Sbjct: 28 HVSRETIGKISVASKMMWCSAAVDILFLLDGSNSVGK---GSFERSKHFAITV------C 78 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAY 267 + V+ G FS+ F L+ + ++ KIK + G T++ LKY Sbjct: 79 DALDISTERVRVGAFQFSSTPHLEFPLDSFSTQQEVKAKIKRMIFKGGHTDTGLALKYLL 138 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 N R +A+ +I+V +TDG K + K++G V+A+G+ Sbjct: 139 N--------RGFPGGRNASVPQILVIVTDG-----KSQGHVALPAKQLKEKGVTVFAVGV 185 Query: 328 RVIRSHEFLRACAS---PNSFYLVENPHS 353 R R E L A AS L E Sbjct: 186 RFPRWEE-LHALASEPREQHVLLAEQVED 213 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 33/193 (17%), Positives = 67/193 (34%), Gaps = 24/193 (12%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-NVVQSGLV 224 LD++ +LD S S+ + + + L +VN +V Q GLV Sbjct: 524 GCRTQALDLVFMLDTSASVGPENFAQMQS----------FVRSCALQFEVNPDVTQVGLV 573 Query: 225 TFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + ++++ F L+ + R I G ++ L + Y+++ +Q Sbjct: 574 VYGSQVQTAFGLDAKPTQAALLRAISQAPYLGGVGSAGTALLHIYDKVMTVQ------RG 627 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 K +V +T G + + + + G V +G+ + S R Sbjct: 628 ARPGVPKAVVVLTGGRGA-----EDAAVPAQKLRNNGISVLVMGVGPVLSEGLRRLAGPR 682 Query: 343 NSFYLVENPHSMY 355 +S V + Sbjct: 683 DSLIHVAAYTDLR 695 Score = 45.5 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 73/187 (39%), Gaps = 22/187 (11%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 R+D++ +LD S T +D +++ + + V+ + ++ + G+ T+S ++ Sbjct: 341 RVDLLFLLDSSAG---------TALDGFLRAKAFVKQFVQAVLSEDSRARVGVATYSREL 391 Query: 231 EEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + V L R + + G T L+ A +G T + Sbjct: 392 LVAVPVGEYQDVPDLVRSLDGIPFRGGPTLMGSALRQA-----ADRGFGSATRTGQDRPR 446 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 +++V +T+ ++ + + A+ R ++ +G +R+ E SP + Sbjct: 447 RVVVLLTE-----SRSEDEVAGPARHARARELLLLGVGSEAVRA-ELEEITGSPKHVMVY 500 Query: 349 ENPHSMY 355 +P + Sbjct: 501 SDPQDLL 507 >gi|109090613|ref|XP_001091550.1| PREDICTED: von Willebrand factor A domain-containing protein 2 isoform 1 [Macaca mulatta] Length = 725 Score = 77.1 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 47/209 (22%), Positives = 79/209 (37%), Gaps = 31/209 (14%) Query: 153 HIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 H+ + V S+ A +D++ +LD S S+ S + ++ Sbjct: 28 HVSRETIGKISVASKMMWCSAAVDILFLLDGSNSVGK---GSFERSKHFAITV------C 78 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAY 267 + V+ G FS+ F L+ + ++ KIK + G T++ LKY Sbjct: 79 DALDISTERVRVGAFQFSSTPHLEFPLDSFSTQQEVKAKIKRMIFKGGHTDTGLALKYLL 138 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 N R +A+ +I+V +TDG K + K++G V+A+G+ Sbjct: 139 N--------RGFPGGRNASVPQILVIVTDG-----KSQGHVALPAKQLKEKGVTVFAVGV 185 Query: 328 RVIRSHEFLRACAS---PNSFYLVENPHS 353 R R E L A AS L E Sbjct: 186 RFPRWEE-LHALASEPREQHVLLAEQVED 213 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 33/193 (17%), Positives = 67/193 (34%), Gaps = 24/193 (12%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-NVVQSGLV 224 LD++ +LD S S+ + + + L +VN +V Q GLV Sbjct: 524 GCRTQALDLVFMLDTSASVGPENFAQMQS----------FVRSCALQFEVNPDVTQVGLV 573 Query: 225 TFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + ++++ F L+ + R I G ++ L + Y+++ +Q Sbjct: 574 VYGSQVQTAFGLDAKPTQAALLRAISQAPYLGGVGSAGTALLHIYDKVMTVQ------RG 627 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 K +V +T G + + + + G V +G+ + S R Sbjct: 628 ARPGVPKAVVVLTGGRGA-----EDAAVPAQKLRNNGISVLVMGVGPVLSEGLRRLAGPR 682 Query: 343 NSFYLVENPHSMY 355 +S V + Sbjct: 683 DSLIHVAAYTDLR 695 Score = 45.5 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 73/187 (39%), Gaps = 22/187 (11%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 R+D++ +LD S T +D +++ + + V+ + ++ + G+ T+S ++ Sbjct: 341 RVDLLFLLDSSAG---------TALDGFLRAKAFVKQFVQAVLSEDSRARVGVATYSREL 391 Query: 231 EEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + V L R + + G T L+ A +G T + Sbjct: 392 LVAVPVGEYQDVPDLVRSLDGIPFRGGPTLMGSALRQA-----ADRGFGSATRTGQDRPR 446 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 +++V +T+ ++ + + A+ R ++ +G +R+ E SP + Sbjct: 447 RVVVLLTE-----SRSEDEVAGPARHARARELLLLGVGSEAVRA-ELEEITGSPKHVMVY 500 Query: 349 ENPHSMY 355 +P + Sbjct: 501 SDPQDLL 507 >gi|284046349|ref|YP_003396689.1| von Willebrand factor A [Conexibacter woesei DSM 14684] gi|283950570|gb|ADB53314.1| von Willebrand factor type A [Conexibacter woesei DSM 14684] Length = 319 Score = 77.1 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 31/217 (14%), Positives = 69/217 (31%), Gaps = 38/217 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 + +V DVS SM + D ++ A ++ ++EV V G+++F+N Sbjct: 87 SIALVTDVSGSMLA-TDVQPNRMIAAKRAARRFVDEVP------RTVNLGVISFNNTATV 139 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 S + I L+ G T + + A + R IV Sbjct: 140 LQSPTRNRSDVLTAIDRLAVSG-GTATGEAIATATEML------RNQPGENGRRPPSAIV 192 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--------------------VIRS 332 ++DG + + ++ + EA++ +Y + Sbjct: 193 LISDGTSTNGRD---PIEAAAEARRLRIPIYTVAFGTDQGTITVPGRDGVERTERVPPDP 249 Query: 333 HEFLRACA-SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 + + + ++ + F +G + T+ Sbjct: 250 TALAQIAEMTGGETFTADSADRLDTVFERLGSQLGTR 286 >gi|170750695|ref|YP_001756955.1| von Willebrand factor type A [Methylobacterium radiotolerans JCM 2831] gi|170657217|gb|ACB26272.1| von Willebrand factor type A [Methylobacterium radiotolerans JCM 2831] Length = 345 Score = 76.7 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 39/210 (18%), Positives = 68/210 (32%), Gaps = 28/210 (13%) Query: 174 MMIVLDVSRSME---SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +MI +D+S SME D A+K + + + GLV F+++ Sbjct: 105 IMIAMDLSGSMERRDFALDGETVNRLTAVKRVGTDFIRRRAGD------RIGLVIFADQA 158 Query: 231 EEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQI--FDMQGMRQHCNTEDAN 286 + + + R + + G ST GL A ++ D G + Sbjct: 159 YVAAAPSFDTAAVARALDEATIGISGRSTGIGDGLGLALRRLDPRDAGGEAASGSKPGEK 218 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-----------SHEF 335 K ++ ++DG N + A++ G VY I + E Sbjct: 219 PAKAVILLSDGANNA--GQTAPKDVAELARELGIKVYTIALGPRDMADADGEQDVVDTET 276 Query: 336 LRAC--ASPNSFYLVENPHSMYDAFSHIGK 363 LR AS + V + I + Sbjct: 277 LRDMARASGGEAFRVRTTEDLVRVADAIDR 306 >gi|198436415|ref|XP_002121394.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 904 Score = 76.7 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 38/220 (17%), Positives = 83/220 (37%), Gaps = 41/220 (18%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 ++D+++VLD S S+ + I + L ++ P+ +V + ++ Sbjct: 709 ASPKIDIVMVLDSSSSVTEPGWRKM------INFVKTALGFYEMGPNSTSV---SVFRYN 759 Query: 228 NKIEEFFLLE------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +I+E + +G L R+I L G T + L YA + + + Sbjct: 760 AEIDEANKISFQYTQTYGKEQLLRRIGRLPYNGQGTRTGQALSYALHILTN--------E 811 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS----HEFLR 337 + +++ +TDG K ++ G + YAI I+ R ++ Sbjct: 812 INRPDAVDVVLVLTDG-----KSQDAVKAPAEALRRNGVLTYAIAIQPERGVLNMNQLND 866 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDI---VTKRIWYDK 374 +P++ +L + FS +D + + +D+ Sbjct: 867 IAGTPHNLFL------LRSGFSSFTEDFAREIRNSVCHDQ 900 >gi|78776847|ref|YP_393162.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] gi|78497387|gb|ABB43927.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] Length = 307 Score = 76.7 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 74/204 (36%), Gaps = 31/204 (15%) Query: 173 DMMIVLDVSRSM---------ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 D+++ +D S SM E+ +++ ++A + +++ +L + G+ Sbjct: 83 DIVLAIDASGSMNSTGFDFEGEAALPQKLSRFEIAKIVASEFIQK-RLSDN------VGI 135 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 V + + + + + + + YL++ N+ G A + Sbjct: 136 VLYGDFAFIASPITYEKNIIIEMLSYLNQGMAGQNTAIGEAIAMS---------LRAFKH 186 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--HEFLRACA- 340 KI+V +TDGE D AK+ +Y IG+ L+ A Sbjct: 187 SKAKSKIVVLLTDGE--HNSGDISPKDALVLAKEENIKIYTIGMGNRGEADEALLKKIAD 244 Query: 341 -SPNSFYLVENPHSMYDAFSHIGK 363 S F+ N + + + HI + Sbjct: 245 ESGGEFFYATNAKELKEIYEHIDE 268 >gi|311268857|ref|XP_003132237.1| PREDICTED: collagen alpha-1(VII) chain-like [Sus scrofa] Length = 2945 Score = 76.7 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 37/203 (18%), Positives = 78/203 (38%), Gaps = 28/203 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ S+ ++ ++ + V+ V +S+ Sbjct: 38 DIVFLLDGSSSIG---RSNFREVRGFLEGLVLPFSRAASAQG----VRFAAVQYSDDPRT 90 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L+ + R I+ LS G +T + + + +Q+F Q R K+ Sbjct: 91 EFSLDTLGSGGDVIRAIRELSYKGGNTRTGAAILHVADQVFLPQLARPG-------VPKV 143 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLV 348 + +TDG K K++G ++A+GI+ E R + + + F+ V Sbjct: 144 CILITDG-----KSQDLVDAAAQRLKEQGVKLFAVGIKNADPEELKRIASQPTSDFFFFV 198 Query: 349 ENPHSMYDAFSHIGKDIVTKRIW 371 + + +V++R+ Sbjct: 199 NDFSILRTLL-----PLVSRRVC 216 >gi|149773083|emb|CAO01891.1| collagen typeVI alpha 5 [Mus musculus] Length = 1212 Score = 76.7 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 44/281 (15%), Positives = 100/281 (35%), Gaps = 33/281 (11%) Query: 97 ELRDNGFVNDIDDIVRSTSLDI--VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH- 153 + + N V + + +R+ + I + + + + I+ + F + ++ Sbjct: 584 DGKSNDSVLEPANRLRAEQITIHAIGIGEANKTQLRQIAGKDERVNFGQNFDSLKSIKNE 643 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 IV I S D + D+M ++D S S+ T + + ++ I Sbjct: 644 IVHRICSE---KGCEDMKADIMFLVDSSGSIGP------TNFETMKTFMKNLVG---KIQ 691 Query: 214 DVNNVVQSGLVTFSNKIEEFFLL-EWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 + Q G+V FS+ E F L ++ + I +S +T + L + N+ F Sbjct: 692 IGADRSQVGVVQFSDYNREEFQLNKYSTHEEIYAAIDRMSPINRNTLTGGALTF-VNEYF 750 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 D+ +K ++ +TDG K + + + ++++G+ Sbjct: 751 DLS------KGGRPQVRKFLILLTDG-----KAQDEVGGPATALRSKSVTIFSVGVYGAN 799 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + + + VEN +D I ++ + Sbjct: 800 RAQLEEISGDGSLVFHVEN----FDHLKAIESKLIFRVCAL 836 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 73/202 (36%), Gaps = 26/202 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S+ +I + + S+ + + P N V+ G+V +S+K E Sbjct: 474 DIYFLIDGSSSIRKK---EFEQIQIFMSSV------IDMFPIGPNKVRVGVVQYSHKNEV 524 Query: 233 FFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + + ++ + + + T + L F + +++ Sbjct: 525 EFPVSRYTDGIDLKKAVFNIKQLKGLTFTGKALD------FILPLIKKGKTERTDRAPCY 578 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDG K + L N + ++AIGI + + +N Sbjct: 579 LIVLTDG-----KSNDSVLEPANRLRAEQITIHAIGIGEANKTQLRQIAGKDERVNFGQN 633 Query: 351 PHSMYDAFSHIGKDIVTKRIWY 372 +D+ I +IV + Sbjct: 634 ----FDSLKSIKNEIVHRICSE 651 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 39/189 (20%), Positives = 77/189 (40%), Gaps = 19/189 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ VLD S S+ S+ +N + VK + VQ G +T+SN E Sbjct: 845 LDIVFVLDHSGSIGPREQESM---------MNLTIHLVKKADVGRDRVQIGALTYSNHPE 895 Query: 232 EFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L S + ++ G T + L+++ N +F +H + N ++ Sbjct: 896 ILFYLNTYSSGSAIAEHLRRPRDTGGETYTAKALQHS-NVLFT----EEHGSRLTQNVRQ 950 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDG + + + E + +G ++A+G+ E + V+ Sbjct: 951 LMIVITDG---VSHDRDKLDEAARELRDKGITIFAVGVGNANQDELETMAGKKENTVHVD 1007 Query: 350 NPHSMYDAF 358 N + D + Sbjct: 1008 NFDKLRDIY 1016 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 27/205 (13%), Positives = 65/205 (31%), Gaps = 23/205 (11%) Query: 154 IVMPITSSVKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 +++ + + D++ ++D S + I +N M+ + Sbjct: 10 LILWTETLADQSPGPGPEYADVVFLVDSSNYLG------IKSFPFVRTFLNRMISSL--- 60 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 P N + L +S+ + F L + L K G S L+ A+ Sbjct: 61 PIEANKYRVALAQYSDALHNEFQLGTFKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRT 120 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 F + + I+V + S + + ++ G + ++G++ Sbjct: 121 YFSAPTNGRDKK----QFPPILVVL-----ASAESEDDVEEAAKALREDGVKIISVGVQK 171 Query: 330 IRSHEFLRACASPNSFYLVENPHSM 354 L+A A+ + + + Sbjct: 172 ASEEN-LKAMATSQFHFNLRTARDL 195 Score = 44.0 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 26/171 (15%), Positives = 62/171 (36%), Gaps = 21/171 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 Q D+ D++ ++D S + + + + V + + ++ GL++F Sbjct: 262 QKDSLADLIFLVDES-------VGTTQNLRDLQNFLENVTSSVDVKDNC---MRLGLMSF 311 Query: 227 SNKIEEFFLL--EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S++ + L S Q++I+ LS ++N + Q+ + + Sbjct: 312 SDRAQTISSLRSSANQSEFQQQIQKLSLQTGASNVGAAI----EQMRKEGFSESSGSRKA 367 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 +I V +T D + + G ++A+GI + + Sbjct: 368 QGVPQIAVLVT-----HRASDDMVREAALDLRLEGVTMFAMGIEGANNTQL 413 >gi|12850399|dbj|BAB28702.1| unnamed protein product [Mus musculus] Length = 650 Score = 76.7 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 45/270 (16%), Positives = 93/270 (34%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + DI +V V N YS + S + + ++ + T Sbjct: 402 EGAAERDIQHVVEPGFASKAVCRTNGFYSFNVQSWLSLHKTVQPLVKRVCDTDRLACSKT 461 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S SM + ++ + + +E ++ Sbjct: 462 CL--------NSADIGFVIDGSSSMG------TSNFRTVLQFVANLSKEFEISDTD---T 504 Query: 220 QSGLVTFS--NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ ++E F + + I+ + + T++ ++YA Q+F Sbjct: 505 RVGAVQYTYEQRLEFGFDKYNSKADILSAIRRVGYWSGGTSTGAAIQYALEQLF------ 558 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG A ++G I YAIGI E Sbjct: 559 ---KKSKPNKRKVMIIITDGR-----SYDDVRIPAMAACQKGVITYAIGIAWAAQDELEV 610 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V++ ++Y I ++I Sbjct: 611 MATHPAKDHSFFVDDFDNLYKIAPRIIQNI 640 >gi|149018695|gb|EDL77336.1| similar to procollagen, type VI, alpha 3 isoform 4 (predicted) [Rattus norvegicus] Length = 719 Score = 76.7 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 52/272 (19%), Positives = 93/272 (34%), Gaps = 34/272 (12%) Query: 97 ELRDNGFVNDIDDIVRSTSLDI----VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSR 152 + + + V D +R T ++I V F + + Sbjct: 89 DGQSSDPVADAAQGLRDTGINIYAIGVRDANTTELE---EIANNRVFFTDDFHFLKSIHQ 145 Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 +V I S SQ D++ ++D S S+ S KI +K V Sbjct: 146 EVVRDICSFENCRSQKA---DIIFLIDGSESISSE---DFEKIKDFVK------RMVNQS 193 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVS---HLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + +Q GL+ FS+ E F L+ S + R I +++ T + L + Sbjct: 194 NIGADKIQIGLLQFSSTPREEFTLKNNYSSKDEMCRAISNVTQINSGTETGKALNFTLPF 253 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 QG R + ++ +TDG+ + + R I++AIG+ Sbjct: 254 FDISQGGRPG-------VHQYLIVITDGD-----SHDDIVSPAKALRDRNIIIFAIGVGK 301 Query: 330 IRSHEFLRACASPNSFYLVENPHSMYDAFSHI 361 I+ + L + Y EN S+ + I Sbjct: 302 IQRAQLLAITNDQDKVYHEENFESLQNLEKEI 333 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 32/168 (19%), Positives = 62/168 (36%), Gaps = 17/168 (10%) Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST 257 + + +++ + PD V+ G+V +S+KI FLL S K+ Sbjct: 1 MKEFMKDVIKMFHIGPDG---VRFGVVQYSDKIISQFLLTQYTSM--EKLGTAIGNIQQG 55 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 A +++ NT N + ++ +TDG++ D + Sbjct: 56 GGGTTTGEALSKM-----ALVFRNTARTNVAQYLIVITDGQSSDPVADA-----AQGLRD 105 Query: 318 RGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 G +YAIG+R + E N + ++ H + + +DI Sbjct: 106 TGINIYAIGVRDANTTELEEIA--NNRVFFTDDFHFLKSIHQEVVRDI 151 >gi|154488145|ref|ZP_02029262.1| hypothetical protein BIFADO_01716 [Bifidobacterium adolescentis L2-32] gi|154083618|gb|EDN82663.1| hypothetical protein BIFADO_01716 [Bifidobacterium adolescentis L2-32] Length = 835 Score = 76.7 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 51/351 (14%), Positives = 108/351 (30%), Gaps = 77/351 (21%) Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 ++ K D + + + F + ++R+ V P + Sbjct: 196 AGATADDGKTGDADDADNDGNTAEDADDAEHIMRDRFTFNRKTVMRAARNVAVPQPDH-- 253 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS-QTDARLDMMIVLDVSRSME 185 + Y + + + S +T ++++++VLD S SM Sbjct: 254 ---TKSITY-----------NNGGKYTLNLNVVGKDTRESHETTEKIEVVLVLDTSGSMN 299 Query: 186 SFFD--------SSITKIDMAIKSINAMLEEV----KLIPDVNNVVQSGLVTFSNKIEEF 233 D S+ ++ ++ + ++ I D N+ V+ + F Sbjct: 300 YCMDGSQRGCNKSNPKRLTALKEAATSFIDATETTNDTIQDENSKVRIAIAQFGQTSGVV 359 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L + L+ + LS G T + G+ A + KK+++F Sbjct: 360 SSLTSDTAALKSSVSRLSANGA-TPADKGMAAAQTALLRA----------RPGAKKVVIF 408 Query: 294 MTDGENL-----STKEDQQSLYYCNEAKKRGAIVYAIGI--------------RVIRSHE 334 DG ST+ ++ K G ++Y+IGI ++++ Sbjct: 409 FADGVPTTQNTFSTRVANDAVTTALAMKSAGTLIYSIGIFEGANPEQQSFGNRENDQANQ 468 Query: 335 FLRACASP------------------NSFYLVENPHSMYDAFSHIGKDIVT 367 F+ A +S + + + F I K+I T Sbjct: 469 FMHAVSSNYPNATAYNKTNWGTGSNLGYYKATNSADDLTKIFDDIQKEITT 519 >gi|13476808|ref|NP_108377.1| hypothetical protein mll8241 [Mesorhizobium loti MAFF303099] gi|14027569|dbj|BAB53838.1| mll8241 [Mesorhizobium loti MAFF303099] Length = 678 Score = 76.7 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 42/229 (18%), Positives = 91/229 (39%), Gaps = 24/229 (10%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R F + +G ++T + + + L + ++ + + K ++ + +D + A ++ E Sbjct: 23 RQFRRDRRGNYALMTVVAMVPLMGGLAIAVDFTEMNREKQMVTNALDAANFATARRLT-E 81 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG-Y 127 G +++ K D N +NDID + ++ + G Sbjct: 82 GATDDQLKAYALDFF-----------------NANLNDIDPASATLNVTLPSNTSGGGLL 124 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 +++A YK ++ I SV + L++ +VLD S SM + Sbjct: 125 TMTAQLAYKPYFYPAFAQLVGKSATDANQKINFSVTSQVRLKNTLEVALVLDNSGSMTTL 184 Query: 188 -FDSSITKIDMAIKS----INAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 S +ID+ + ++ + ++ +I V+ VQ GLV F+ + Sbjct: 185 GTGSGQKRIDLLKTASKQLVDTLAQQAVMIKQVDKPVQFGLVPFAASVN 233 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 35/270 (12%), Positives = 78/270 (28%), Gaps = 83/270 (30%) Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG-LVTFSNKIEEFFLLEW-- 238 ++ S+++ T + M + + P + +G +S L Sbjct: 409 GAVNSWWNDDPTS-GTGQSRLRNMAKYFQPRPIDAPALPAGNGPNYSCTTNPITPLTDVS 467 Query: 239 ---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 G + ++ I + G TN G+ + + + + Q + K+++ +T Sbjct: 468 VADGATSIKAAIDLMQPNG-GTNVPEGMAWGWRVVSSGEPFTQGRRETEKGNDKVVIVLT 526 Query: 296 DGENL---------STKEDQQS-------------------------------------- 308 DG N S + +S Sbjct: 527 DGANTYYTPSSLGYSDPANSKSTYASYGYLNPGYNGTSVGRMFMGTSSAIGQLDYSNGNY 586 Query: 309 --------LYYCNEAKKRGAIVYAIGIRVIRSH-------EFLRACASPNSF-------- 345 CN AK +V + + + + + L++C+S + F Sbjct: 587 TNALNEQMATLCNNAKAANIMVMTVALDLSTTKASDKLAIDALKSCSSDSRFRKDPTDPS 646 Query: 346 -----YLVENPHSMYDAFSHIGKDIVTKRI 370 + S+ + F IG ++ R+ Sbjct: 647 KPAKLFWNATGASLSNDFKEIGNELSNLRV 676 >gi|120403735|ref|YP_953564.1| hypothetical protein Mvan_2751 [Mycobacterium vanbaalenii PYR-1] gi|166988604|sp|A1T8Q8|Y2751_MYCVP RecName: Full=UPF0353 protein Mvan_2751 gi|119956553|gb|ABM13558.1| von Willebrand factor, type A [Mycobacterium vanbaalenii PYR-1] Length = 335 Score = 76.7 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 31/211 (14%), Positives = 70/211 (33%), Gaps = 30/211 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+V+DVS+SM + D + ++ A ++ +++ + GL+ ++ Sbjct: 99 VMLVIDVSQSMRA-TDVAPNRLVAAQEAAKQFADQLTPGIN------LGLIAYAGTATVL 151 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + I L + T + G+ A + + + D IV Sbjct: 152 VSPTTNREATKAAIDKL-QLADRTATGEGIFTALQAVATVGAV---IGGGDEPPPARIVL 207 Query: 294 MTDGENLSTKEDQQ---SLYYCNEAKKRGAIVYAIGIRVIRSH--------------EFL 336 M+DG+ + AK +G + + + E L Sbjct: 208 MSDGKETVPSNPDNPKGAYTAARTAKDQGVPISTVSFGTPYGYVEINDQRQPVPVDDEML 267 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 + A S + + + F+++ + I Sbjct: 268 KKIADLSGGDAFTASSLEQLKQVFTNLQEQI 298 >gi|326672458|ref|XP_695815.4| PREDICTED: von Willebrand factor A domain-containing protein 2 [Danio rerio] Length = 795 Score = 76.7 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 40/217 (18%), Positives = 76/217 (35%), Gaps = 31/217 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-NVV 219 SV LD++ VLD S + + + D+N +V Sbjct: 520 SVDNQGCLGQALDLVFVLDASSGVGKENFIHFQ----------DFVRSTSVQFDINRDVA 569 Query: 220 QSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ-IFDMQGM 276 Q GLV + + F L+ S + R + + G + L + +Q + +G Sbjct: 570 QVGLVVYGRRPVTVFDLDKYNSGSAVLRAVGDAAFLGGKASVGSALLHVLSQSLTVGKGA 629 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 R N K +V +TDG + + + + G ++ IG+ I+ L Sbjct: 630 RPGVN-------KAVVVLTDG-----TGVEDAAVPAQKIRDSGVSIFLIGVGDIQQELLL 677 Query: 337 RACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 R S + V + + +D++ + + D Sbjct: 678 RISGSEDHMITVPSYDDL-----KYSEDVLVQMVCSD 709 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 40/192 (20%), Positives = 80/192 (41%), Gaps = 26/192 (13%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 Q A +D++ V+D S S+ +S + ML+ + + + V+ G+V Sbjct: 44 MQCSAAMDILFVMDSSYSVGK---------GGFERSRHYMLKLCEALDVRQDKVRVGVVQ 94 Query: 226 FSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 F + + L+ L++K+K + G ST + LK+ + F Sbjct: 95 FGSTPKLEVSLDSYKTKEELKKKLKKIHYRGGSTQTGLALKFVLRKGFSGGR-------- 146 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--LRACAS 341 +++ +++V ++DG K +E K G +++A+GIR R E L + S Sbjct: 147 NSSVPRVVVLLSDG-----KSQGAVQLPASELKLSGVLLFAVGIRYPRWDELRELASGPS 201 Query: 342 PNSFYLVENPHS 353 + + E+ Sbjct: 202 DSHVFFAEHFSD 213 >gi|147921050|ref|YP_685140.1| hypothetical protein RCIX370 [uncultured methanogenic archaeon RC-I] gi|110620536|emb|CAJ35814.1| hypothetical protein RCIX370 [uncultured methanogenic archaeon RC-I] Length = 1310 Score = 76.7 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 39/183 (21%), Positives = 75/183 (40%), Gaps = 24/183 (13%) Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE-----WGVSH 242 S +T +A S + +E Q G+V+F L+ + Sbjct: 899 LGSKLTTDSVAKTSAVSFVESRGDGD------QVGVVSFYTSASLNSALKQMNSGTNKTT 952 Query: 243 LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 ++ I LS G T+ + G+K A + + H + K+ I+ +TDG + Sbjct: 953 VKNAINSLSASG-GTDISSGIKKA------IAELDAHK---RSTAKQYIIVLTDGYSQYP 1002 Query: 303 KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIG 362 + D L ++AK +G ++ IG+ + + + P +Y V +P + A+ IG Sbjct: 1003 EFD---LIEADKAKAKGYTIFTIGMGMADEDTLKKIASKPEYYYRVLSPEQLEAAYYDIG 1059 Query: 363 KDI 365 ++I Sbjct: 1060 QEI 1062 >gi|88801581|ref|ZP_01117109.1| batA protein [Polaribacter irgensii 23-P] gi|88782239|gb|EAR13416.1| batA protein [Polaribacter irgensii 23-P] Length = 334 Score = 76.7 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 41/233 (17%), Positives = 87/233 (37%), Gaps = 49/233 (21%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P +V ++T++ +D+++ +DVS SM + D +++ K ++ V P+ Sbjct: 77 PRNVAVSKKTKTNSGIDIIMAIDVSASMLA-RDLKPNRLEALKKVA---IDFVDRRPND- 131 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQ 274 + G+V ++ + + + ++R I L + T GL N++ Sbjct: 132 ---RIGIVVYAGESFTQTPITSDKNIVKRTISELQWGQLDGGTAIGMGLGSGVNRL---- 184 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--- 331 E K+I+ +TDG N + D ++ A++ VY IGI Sbjct: 185 -------KESTAKSKVIILLTDGVNNAGNIDPRTAT--ELARELEIKVYTIGIGTNGMAD 235 Query: 332 ---------------------SHEFLR--ACASPNSFYLVENPHSMYDAFSHI 361 + L+ A A+ ++ + S+ + + I Sbjct: 236 FPWSKDPRTGKLNFRKQQVEIDEKLLQEIATATDGKYFRATDNQSLKEIYDEI 288 >gi|156409373|ref|XP_001642144.1| predicted protein [Nematostella vectensis] gi|156229285|gb|EDO50081.1| predicted protein [Nematostella vectensis] Length = 203 Score = 76.7 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 76/202 (37%), Gaps = 27/202 (13%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RLD+ ++D S S+E + ++ IK+I ++L P + G V FS++ Sbjct: 4 RLDLGFLIDGSGSIERQGRGNFGRVINFIKTIVSLL------PVSPRQTRIGAVLFSSRP 57 Query: 231 E--EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F V + ++ + T + L+Y Y+++F + K Sbjct: 58 YLMFNFQKYRTVRQVLAALQRIRYPRGGTKTGRALRYTYSRLFRSR---------SRVRK 108 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 + ++ +TDG K K +G ++AIG+ L AS + + Sbjct: 109 QALIVLTDG-----KSQDSVGQPAAFIKNQGVELFAIGVGRNYRRRDLNQIASRGNVFTA 163 Query: 349 ENPHSMYDAFSHIGKDIVTKRI 370 + ++ I I +K Sbjct: 164 K-----FENLGRIIGAIKSKVC 180 >gi|115525407|ref|YP_782318.1| hypothetical protein RPE_3406 [Rhodopseudomonas palustris BisA53] gi|115519354|gb|ABJ07338.1| conserved hypothetical protein [Rhodopseudomonas palustris BisA53] Length = 580 Score = 76.7 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 43/267 (16%), Positives = 87/267 (32%), Gaps = 38/267 (14%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 F + G + +L I + +G ++ S +T + S +D + + A Sbjct: 15 RFIADRSGNIAVLFGIACVPLITFVGAAVDYSRAVAARTAMQSALDSTALMVA------- 67 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 K I + S + L N N ++ + ++ +G +I Sbjct: 68 -----KDYSLNKISASEIDGKAKSIFSALYTNKSANSVEVVA------VLTPNTGKGSTI 116 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 K+P F + N I + +S+ S RL + +VLD + SM Sbjct: 117 KVDGTGKVPTDFMKLV----NISQIDIGASSTTTWGS---TRLRVALVLDTTGSMNDNG- 168 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL------LEWGVSHL 243 KI + +L ++K V ++ FS + ++W + Sbjct: 169 ----KIGALKTATQNLLTQLKDAAGKPEDVYVSIIPFSKDVNVGASNYTANWIDW--TDW 222 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + + L + S A+ + Sbjct: 223 KSQPPVLDYAKSGSKSGLVDNVAWKDV 249 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 56/165 (33%), Gaps = 25/165 (15%) Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA-NYKKI 290 + + + + + L G TN GL + + + ++ Y+ I Sbjct: 417 KVTEMNNNWATMNTTVDGLFPVG-GTNQPIGLVWGWQSLVGGGPFPTPPVKDEQYTYQDI 475 Query: 291 IVFMTDGENLSTK----------------EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 IV M+DG N + + C K G VY + + S E Sbjct: 476 IVLMSDGLNTVDRWYGNGWDTNTSVDNRMYASATTGTCVNVKAAGIKVYTVHVNTNGSPE 535 Query: 335 --FLRACASP-----NSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 L+ CASP F +V + + AF+ I + R+ + Sbjct: 536 STLLKNCASPADDGGKEFQMVTSASGLNAAFNSIATKLTDLRVAF 580 >gi|146298482|ref|YP_001193073.1| von Willebrand factor, type A [Flavobacterium johnsoniae UW101] gi|146152900|gb|ABQ03754.1| BatA-like protein [Flavobacterium johnsoniae UW101] Length = 334 Score = 76.7 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 38/238 (15%), Positives = 87/238 (36%), Gaps = 49/238 (20%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P T + ++T +D+++ +DVS SM + D +++ + +EE Sbjct: 74 LARPRTVDISNQTKTTKGIDIVMAIDVSGSMLAK-DLKPNRMEALKRVAADFVEERPND- 131 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQI 270 + GLV ++++ + + IK + T GL A N++ Sbjct: 132 ------RIGLVLYASEAYTKTPVTSDKPIILEAIKGIRYDTVLQDGTGIGMGLATAVNRL 185 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-- 328 D + ++I+ +TDG N + + ++ + AK+ G VY IG+ Sbjct: 186 KDSKAKS-----------RVIILLTDGVNNAGFIEPETA--ADIAKQYGIKVYTIGLGTN 232 Query: 329 ---------VIRS------------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 +++ A + +++ + + + ++ I K Sbjct: 233 GMAESPYAYAPNGGFLFKMQKVEIDERLMKSIAKKTDGTYFRATSNDKLAEIYNSINK 290 >gi|71988405|ref|NP_001022675.1| MUscle Attachment abnormal family member (mua-3) [Caenorhabditis elegans] gi|51587416|emb|CAH19087.1| C. elegans protein K08E5.3b, confirmed by transcript evidence [Caenorhabditis elegans] gi|51591699|emb|CAH19103.1| C. elegans protein K08E5.3b, confirmed by transcript evidence [Caenorhabditis elegans] Length = 3183 Score = 76.3 bits (186), Expect = 6e-12, Method: Composition-based stats. Identities = 34/238 (14%), Positives = 88/238 (36%), Gaps = 22/238 (9%) Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 + + +++ ++ P + + D++ ++D S S+ S+ Sbjct: 605 VDTPDGYTCQCYGGFVDVSSNANLP-PGRVCTVQTTCPKQKTDLVFLIDGSGSIGSY--- 660 Query: 191 SITKIDMAIKSINAMLEE-VKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWG-VSHLQRKI 247 + + + E V+L + + GL+ +S++I F L ++G L + I Sbjct: 661 ------VFKNEVLRFVREFVELFEIGRSKTRVGLIQYSDQIRHEFDLDQYGDRDSLLKGI 714 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 T + +++ + F R+ + ++ ++ + +TDG + Sbjct: 715 SETQYLTGLTRTGAAIQHMVQEGFS---ERRGARPQQSDIARVAIILTDG-----RSQDN 766 Query: 308 SLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 + A+K +AIG+ + + E SPN ++ V+ + + + Sbjct: 767 VTGPADSARKLSINTFAIGVTDHVLASELESIAGSPNRWFYVDKFKDLDTRLRSMIQK 824 >gi|47214619|emb|CAG01460.1| unnamed protein product [Tetraodon nigroviridis] Length = 1723 Score = 76.3 bits (186), Expect = 6e-12, Method: Composition-based stats. Identities = 42/221 (19%), Positives = 86/221 (38%), Gaps = 31/221 (14%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P+ SV+ + D + D+++++D S S+ ++ K+ ++ + V Sbjct: 422 PVKVSVECSLGVDVQADVVLLVDGSYSIG---LANFAKVRAFLEVL------VNTFDIGP 472 Query: 217 NVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF-DM 273 + VQ LV +S F L+ + + ++ G STN+ + Y +F Sbjct: 473 DKVQISLVQYSRDPHTEFYLDSHHNLEAVVTALRTFPYRGGSTNTGRAMTYVRETVFQAS 532 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 +G R H ++ + +TDG K + ++A+G++ Sbjct: 533 RGARAH-------VPRVTILITDG-----KSSDAFQEPAANLRNSDVEIFAVGVKDAVRS 580 Query: 334 EF--LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 E + + Y VE+ +DAF I ++ T+ I Sbjct: 581 ELEAIANAPAETHVYTVED----FDAFQRISTEL-TQSICL 616 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 32/226 (14%), Positives = 72/226 (31%), Gaps = 40/226 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 A D++ ++D S S+ + + + G+V + Sbjct: 108 SAGAAADLLFLVDGSWSVGK---------ANFKHIRAFLSAAASAFQIGQDRTRVGVVQY 158 Query: 227 SNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTP---------GLKYAYNQI----- 270 + F L+ S L R I L G T + G Y + Sbjct: 159 GSDARVEFRLDAHPSRPALLRAIGTLPYMGGDTRTGSRRSPGRCWLGSNYVVHPFSRSPG 218 Query: 271 ----FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 F ++ ++ K++V +TD +++ E+ + G V+ +G Sbjct: 219 HALKFLLEKSLTEEAGARKDFPKVLVVVTDSKSVDPVENSAGR-----LRSAGVEVFVLG 273 Query: 327 IRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + E + ++P N Y V + + ++++++ Sbjct: 274 VGQADEAEMKQIASTPYRNHVYSVAT----FQTIKSVQRELISQLC 315 >gi|71988401|ref|NP_001022674.1| MUscle Attachment abnormal family member (mua-3) [Caenorhabditis elegans] gi|22096361|sp|P34576|MUA3_CAEEL RecName: Full=Transmembrane cell adhesion receptor mua-3; AltName: Full=Muscle attachment abnormal protein 3; Flags: Precursor gi|4761646|gb|AAD29428.1|AF139060_1 transmembrane cell adhesion receptor MUA-3 precursor [Caenorhabditis elegans] gi|14530487|emb|CAA83226.2| C. elegans protein K08E5.3a, confirmed by transcript evidence [Caenorhabditis elegans] gi|14530561|emb|CAC42345.1| C. elegans protein K08E5.3a, confirmed by transcript evidence [Caenorhabditis elegans] Length = 3767 Score = 76.3 bits (186), Expect = 6e-12, Method: Composition-based stats. Identities = 34/238 (14%), Positives = 88/238 (36%), Gaps = 22/238 (9%) Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 + + +++ ++ P + + D++ ++D S S+ S+ Sbjct: 1189 VDTPDGYTCQCYGGFVDVSSNANLP-PGRVCTVQTTCPKQKTDLVFLIDGSGSIGSY--- 1244 Query: 191 SITKIDMAIKSINAMLEE-VKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWG-VSHLQRKI 247 + + + E V+L + + GL+ +S++I F L ++G L + I Sbjct: 1245 ------VFKNEVLRFVREFVELFEIGRSKTRVGLIQYSDQIRHEFDLDQYGDRDSLLKGI 1298 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 T + +++ + F R+ + ++ ++ + +TDG + Sbjct: 1299 SETQYLTGLTRTGAAIQHMVQEGFS---ERRGARPQQSDIARVAIILTDG-----RSQDN 1350 Query: 308 SLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 + A+K +AIG+ + + E SPN ++ V+ + + + Sbjct: 1351 VTGPADSARKLSINTFAIGVTDHVLASELESIAGSPNRWFYVDKFKDLDTRLRSMIQK 1408 >gi|198436156|ref|XP_002124087.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 1702 Score = 76.3 bits (186), Expect = 6e-12, Method: Composition-based stats. Identities = 38/208 (18%), Positives = 82/208 (39%), Gaps = 27/208 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + +LD++ ++D S S+ ++ I + ++ + P Q G Sbjct: 363 RTQVCNSGKLDLIFLIDESTSV---LENDFDGIKVWLR------NTISSFPIGEEYTQIG 413 Query: 223 LVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 L T+S+ F L + +++ + + T + + Y N +F H Sbjct: 414 LATYSDNPRIIFHLNKYHKLDDIRKAVLEVEHTSGGTATGKAILYLTNNMFT------HE 467 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 N N K+++V +TDG K + AK+ G +++AIG+ + E + Sbjct: 468 NGVRPNAKRLVVVLTDG-----KSQDDVIVPSRIAKESGIVMFAIGVGKVVMGELRAIAS 522 Query: 341 SPNSF-YLVENPHSMYDAFSHIGKDIVT 367 P+ + Y + + + A I +++ Sbjct: 523 DPDRYVYKIND----FSALESIRRELSH 546 >gi|197336671|ref|YP_002158318.1| von Willebrand factor, type A [Vibrio fischeri MJ11] gi|197313923|gb|ACH63372.1| von Willebrand factor, type A [Vibrio fischeri MJ11] Length = 321 Score = 76.3 bits (186), Expect = 6e-12, Method: Composition-based stats. Identities = 37/216 (17%), Positives = 75/216 (34%), Gaps = 47/216 (21%) Query: 173 DMMIVLDVSRSM-ESFFDSS----ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 DMM+V+D+S SM E +S + ++ + ++ +++ K + GLV F Sbjct: 84 DMMLVVDLSGSMAEEDMKTSNGDFVDRLTAVKQVVSDFIDQRKGD-------RLGLVLFG 136 Query: 228 NKIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L + + ++ ++ L G T GL A + Sbjct: 137 DHAYLQTPLTFDRNTVREQLDRTVLRLVGQMTAMGEGLGLATKTFIESNA---------- 186 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH------------ 333 ++ I+ ++DG N + + L AK A +Y +GI Sbjct: 187 -PQRTIILLSDGANTA--GVLEPLEAAQLAKDNHAKIYTVGIGAGEMQVRGFFGKQTVNT 243 Query: 334 ------EFLRACA--SPNSFYLVENPHSMYDAFSHI 361 + L A + ++ N + + + I Sbjct: 244 ARDLDEDTLTKIATMTGGQYFRARNADELAEIYQTI 279 >gi|260463263|ref|ZP_05811464.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259030853|gb|EEW32128.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 661 Score = 76.3 bits (186), Expect = 6e-12, Method: Composition-based stats. Identities = 41/231 (17%), Positives = 93/231 (40%), Gaps = 28/231 (12%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R F + +G ++TA+ + + L + I+ + + K ++ + +D + A ++ E Sbjct: 6 RQFRRDRRGNYALMTAVAMIPLMGGLALAIDFTEMNREKQMVTNALDAANFATARRLT-E 64 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG-- 126 G +++ + D N +ND++ + +L++ + G Sbjct: 65 GATDDQLRAYALDFF-----------------NANLNDLN--PANATLNLTLPSNTAGGG 105 Query: 127 -YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 ++A YK ++ I+ V + L++ +VLD S SM Sbjct: 106 LLKMTARLNYKPYFYPAFAQLVGKSATDANQSISFDVTSQVRLKNTLEVALVLDNSGSMT 165 Query: 186 SF-FDSSITKIDMAIKS----INAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 + S +ID+ + ++ + ++ +I V+ VQ GLV F+ + Sbjct: 166 TLGTGSGQKRIDLLKTASKQLVDTLAQQAVMIKQVDRPVQFGLVPFAASVN 216 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 35/270 (12%), Positives = 78/270 (28%), Gaps = 83/270 (30%) Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG-LVTFSNKIEEFFLLEW-- 238 ++ S+++ + + M + + P + +G +S L Sbjct: 392 GAVNSWWNDDPSSTSG-QSRLRNMAKYFQPRPINAPALPTGNGPNYSCTTNAITPLTDVS 450 Query: 239 ---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 G + ++ I + G TN G+ + + + + Q + K+++ +T Sbjct: 451 VTDGATAIKAAIDLMQPNG-GTNVPEGMAWGWRVVSSGEPFTQGRLETEKGNDKVVIVLT 509 Query: 296 DGENL---------STKEDQQS-------------------------------------- 308 DG N S D +S Sbjct: 510 DGANTYYTPSSLSYSDPADSKSTYASYGYLNPGYNGTSVGRMFMGTSTAIGQFDYSNGNY 569 Query: 309 --------LYYCNEAKKRGAIVYAIGIRVIRSH-------EFLRACASPNSF-------- 345 CN AK +V + + + + + L++C+S + F Sbjct: 570 TNALNEQMATLCNNAKAANIMVMTVALDLSTTKASDKLAIDALKSCSSESRFRKDPTDPS 629 Query: 346 -----YLVENPHSMYDAFSHIGKDIVTKRI 370 + S+ + F IG ++ R+ Sbjct: 630 KPAKLFWNATGASLSNDFKEIGNELSNLRV 659 >gi|281350861|gb|EFB26445.1| hypothetical protein PANDA_015099 [Ailuropoda melanoleuca] Length = 1741 Score = 76.3 bits (186), Expect = 6e-12, Method: Composition-based stats. Identities = 47/251 (18%), Positives = 94/251 (37%), Gaps = 30/251 (11%) Query: 127 YSISAISRYKI---PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 Y IS ++ + P ++ P T A+ D++ ++D S S Sbjct: 955 YKISVYTKLQEIEGPSVSIMEKTQSPPTQPPTFPPTIPPAKEVCRAAKADLVFMVDGSWS 1014 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVSH 242 + D + KI I + + + + I + Q +V F++ F L + Sbjct: 1015 IG---DENFNKI---INFLYSTVGALNKI--GADGTQVAMVQFTDDPRTEFKLNAYNTKE 1066 Query: 243 -LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 L IK +S G +T + +K+ + +F + + K+IV +TDG Sbjct: 1067 TLLDAIKRISYKGGNTKTGKAIKHVRDSLFTAESGTRRG------IPKVIVVITDG---- 1116 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFS 359 + E + G ++A+G+ E + + S + V++ +DAF Sbjct: 1117 -RSQDDVNKISGEMQSNGYNIFAVGVADADYSELVSIGSKPSSRHVFFVDD----FDAFK 1171 Query: 360 HIGKDIVTKRI 370 I +++T Sbjct: 1172 KIEDELITFVC 1182 Score = 69.4 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 41/231 (17%), Positives = 85/231 (36%), Gaps = 26/231 (11%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + + P VK + A D++I++D S S+ F + + LE Sbjct: 106 KGNGNKPTPPEEEVKFFCEIPAIADIVILVDGSWSIGRFN----------FRLVRLFLEN 155 Query: 209 VKLIPDV-NNVVQSGLVTFS-NKIEEFFLLEWGVS-HLQRKIKYLSKFGVSTNSTPGLKY 265 + +V + + GL +S + E+ L + + ++ L G +T + L Y Sbjct: 156 LVTAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIEAVRNLPYKGGNTLTGLALNY 215 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 + F + A KI + +TDG K + ++ G ++AI Sbjct: 216 IFENSFKPE------AGARAGVSKIGILITDG-----KSQDDIIPPSRNLRESGVELFAI 264 Query: 326 GIRVIRSHEFLRACASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 G++ +E + P+ Y V M+ + + + ++ D+ Sbjct: 265 GVKNADENELREIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRVEEQDR 315 >gi|301780322|ref|XP_002925578.1| PREDICTED: collagen alpha-1(XIV) chain-like [Ailuropoda melanoleuca] Length = 1796 Score = 76.3 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 47/251 (18%), Positives = 94/251 (37%), Gaps = 30/251 (11%) Query: 127 YSISAISRYKI---PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 Y IS ++ + P ++ P T A+ D++ ++D S S Sbjct: 983 YKISVYTKLQEIEGPSVSIMEKTQSPPTQPPTFPPTIPPAKEVCRAAKADLVFMVDGSWS 1042 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVSH 242 + D + KI I + + + + I + Q +V F++ F L + Sbjct: 1043 IG---DENFNKI---INFLYSTVGALNKI--GADGTQVAMVQFTDDPRTEFKLNAYNTKE 1094 Query: 243 -LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 L IK +S G +T + +K+ + +F + + K+IV +TDG Sbjct: 1095 TLLDAIKRISYKGGNTKTGKAIKHVRDSLFTAESGTRRG------IPKVIVVITDG---- 1144 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFS 359 + E + G ++A+G+ E + + S + V++ +DAF Sbjct: 1145 -RSQDDVNKISGEMQSNGYNIFAVGVADADYSELVSIGSKPSSRHVFFVDD----FDAFK 1199 Query: 360 HIGKDIVTKRI 370 I +++T Sbjct: 1200 KIEDELITFVC 1210 Score = 69.4 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 37/207 (17%), Positives = 78/207 (37%), Gaps = 26/207 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV-NNVVQSGLVTFS-NKI 230 D++I++D S S+ F + + LE + +V + + GL +S + Sbjct: 158 DIVILVDGSWSIGRFN----------FRLVRLFLENLVTAFNVGSEKTRIGLAQYSGDPR 207 Query: 231 EEFFLLEWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 E+ L + + ++ L G +T + L Y + F + A K Sbjct: 208 IEWHLNAFNTKDEVIEAVRNLPYKGGNTLTGLALNYIFENSFKPE------AGARAGVSK 261 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SFYL 347 I + +TDG K + ++ G ++AIG++ +E + P+ Y Sbjct: 262 IGILITDG-----KSQDDIIPPSRNLRESGVELFAIGVKNADENELREIASEPDSTHVYN 316 Query: 348 VENPHSMYDAFSHIGKDIVTKRIWYDK 374 V M+ + + + ++ D+ Sbjct: 317 VAEFDLMHTVVESLTRTVCSRVEEQDR 343 >gi|260818477|ref|XP_002604409.1| hypothetical protein BRAFLDRAFT_220331 [Branchiostoma floridae] gi|229289736|gb|EEN60420.1| hypothetical protein BRAFLDRAFT_220331 [Branchiostoma floridae] Length = 192 Score = 76.3 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 41/213 (19%), Positives = 77/213 (36%), Gaps = 27/213 (12%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 M +A +D+ VLD S S+ D++ K+ K++ V Sbjct: 1 MFTPGPTTPAPICNALMDLFFVLDGSGSVT---DANFDKMKQFAKNV------VNAFDIS 51 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + + G+V +S+ F L I + G T + L++A Sbjct: 52 ASSTRVGVVQYSDSNTLEFNLGDHADKPSTLAAIDSIVYQGGGTRTGSALEFA------- 104 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 R + + K+++ +TDG K N +G VYAIG+ RS Sbjct: 105 ---RVNAAWRGESVPKVMIVVTDG-----KSADSVTSSANNLASQGVDVYAIGVGNYRST 156 Query: 334 EFLRACA-SPNSFYLVENPHSMYDAFSHIGKDI 365 + L A + N+ + + +++ I + + Sbjct: 157 QLLEIAAGNQNNVIELTDFNALSAEIEQIAQAV 189 >gi|47228042|emb|CAF97671.1| unnamed protein product [Tetraodon nigroviridis] Length = 1071 Score = 76.3 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 39/183 (21%), Positives = 73/183 (39%), Gaps = 16/183 (8%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS---GLVTFS 227 +D+ VLD S S+ ID ++E+K I + + + G + +S Sbjct: 3 PVDLFFVLDTSESVALRQKPPGYYIDQIKTFTTNFIDELKNIRHQCDRILTWNSGALHYS 62 Query: 228 NKIEEFFLL---EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 +++ L + S L+ I + G T + +K +++ + G H N Sbjct: 63 DEVILVGELMDMQTQRSTLKTSISGIEYIGKGTYTDCAIKRGLSELL-IGGSHYHEN--- 118 Query: 285 ANYKKIIVFMTDGENL--STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 K IV +TDG L + NEA++ G V+A+ I + L A+ Sbjct: 119 ----KYIVVVTDGHPLTGYKEPCGGVQEAANEARQHGVKVFAVAISPDQEDTRLSLIATD 174 Query: 343 NSF 345 +++ Sbjct: 175 HNY 177 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 28/173 (16%), Positives = 54/173 (31%), Gaps = 16/173 (9%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP-DVNNVVQSGLVTFSNKIE 231 D+++++D S S+ + + K + + E V+ GL +S Sbjct: 879 DILVMMDSSASVG------QKNFETSKKFVRLLAERFLSAERQGGASVRVGLAQYSRIAR 932 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L ++HL + + TN ++YA + + Sbjct: 933 LEAELTNNLTHLVHHTEQAAFQNDGTNVLGAMEYAIRSLEGRGDTSGGRKK--------L 984 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 V +DG + + E A G V+ I + + LR S Sbjct: 985 VLFSDGRSQAITEALLEKRAREVA-DAGVEVFVIAVGNQVNEANLRTLVSRGR 1036 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 38/220 (17%), Positives = 85/220 (38%), Gaps = 31/220 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVVQSGL 223 A LD+ ++D S S+ S T +A I A+++ + V N + + Sbjct: 663 CKCATLDLAFIVDSSESIGS------TNFALAKDFIIAVIDRLAKDQQVKFAGNDSRVSV 716 Query: 224 VTFSN----KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 V +S ++ + + ++ ++ +K L +T + L++A ++ MRQ Sbjct: 717 VQYSGARAQEVVQLGVNVNSITDFKQAVKDLRWLAEATYTGEALEFALKN--TIRLMRQE 774 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--- 335 ++++ +TDG + + ++ C N + G V R + Sbjct: 775 N--------RVVLVLTDGRSDTDRDQVPLNVLCGNNVRVGGLGVKDYSGRRPNQEQLEDV 826 Query: 336 -LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + P ++EN + D +++ ++ I DK Sbjct: 827 VCKNDPKPGFSLVLENFAELLD--DTFLQNLTSQ-ICQDK 863 >gi|110679843|ref|YP_682850.1| hypothetical protein RD1_2614 [Roseobacter denitrificans OCh 114] gi|109455959|gb|ABG32164.1| conserved hypothetical protein [Roseobacter denitrificans OCh 114] Length = 488 Score = 76.3 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 68/469 (14%), Positives = 141/469 (30%), Gaps = 106/469 (22%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLV-------- 59 + F G MTI + + ++ LV G+ +++ M+T + + +DR+++ Sbjct: 20 LMGFRREEDGAMTIFATMMVLMMLLVCGIAVDLMQNEMMRTRVQNTLDRAILAASDLDQP 79 Query: 60 -------------HAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGF--- 103 T+ +++ L + + + G Sbjct: 80 LPADEVVDDYFAKAGMTEFLDDVQITPGAHLPTTNFRVVQAEARTRTPSIYMAMTGVRSL 139 Query: 104 ----VNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLK-FCTFIPWYTNSRHIVMPI 158 ++ + +T + +V+ + I + K F + + + I Sbjct: 140 PVYVAGTAEETIENTEISLVLDISGSMRNNGKIGNLRTAAKDFIGAVLEGNAANTTSLNI 199 Query: 159 TSSVKVNS-------QTDARLDMMIVLDVSRSM----ESFFDSSITKIDMAIKSINAMLE 207 + + R + D + ++F D ID+ ++++ L+ Sbjct: 200 VPYAGQTNPGPIVFQRAGGRPFATFIEDSDGNEILYGQTFVDDEGNSIDVPYNTMSSCLD 259 Query: 208 EVKLIPDVNNVVQSG---LVTFSNKIEEFFLLEWG---------------VSHLQRKIKY 249 D ++ G F N + ++WG LQ I Sbjct: 260 LTNGDFDNIDLPSGGYDQTPYFMNWPIDAPTMDWGWCPQNKSSIRYAQNNAGQLQDFIDD 319 Query: 250 LSKFGVSTNSTPGLKYAYNQIFDMQGM----------------RQHCNTEDANYKKIIVF 293 + + T + G+KY + + + + +K IV Sbjct: 320 M-RLHDGTGTQYGMKYGVALLNPSSRDTFVALNAAGLVPDGFKDRPADFGTTDTRKFIVL 378 Query: 294 MTDG---------ENLSTKEDQQSLY---------------------YCNEAKKRGAIVY 323 MTDG + K D+ +L CN+AK G VY Sbjct: 379 MTDGQITDQFRPEDKNDPKNDEIALNQRIGDRDTYATQSTNVANFYSICNKAKAAGITVY 438 Query: 324 AIGIRVI-RSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 I + +R CA+ +F+ + AF I + I R+ Sbjct: 439 TIAFEAPANAITQMRTCATSPAFFYKVEGVEIKTAFKSIARQINELRLT 487 >gi|37676036|ref|NP_936432.1| hypothetical protein VVA0376 [Vibrio vulnificus YJ016] gi|37200576|dbj|BAC96402.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 323 Score = 76.3 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 34/217 (15%), Positives = 70/217 (32%), Gaps = 46/217 (21%) Query: 173 DMMIVLDVSRSMESFF----DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+V+D+S SM+ I ++ + +E+ + GLV F++ Sbjct: 87 DLMLVVDLSGSMQQEDILQDGDYIDRLSSVKNVVTQFIEQ-------RQGDRLGLVLFAD 139 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + ++ + G T GL A D + Sbjct: 140 HAYLQTPLTADRQTVANQLNQTIIGLIGQKTAIGDGLALATKTFVDSEA----------- 188 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------- 331 +++++ ++DG N + D + N AKK G +Y IGI Sbjct: 189 PQRVVILLSDGSNTAGTLDP--IEAANIAKKYGVKIYTIGIGAGEMEVKQFFMTRKVNTS 246 Query: 332 ----SHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 + + ++ + + + I + Sbjct: 247 ADLDEKTLTKIATMTGGQYFRARDAQELQTIYQAINQ 283 >gi|188580137|ref|YP_001923582.1| hypothetical protein Mpop_0869 [Methylobacterium populi BJ001] gi|179343635|gb|ACB79047.1| conserved hypothetical protein [Methylobacterium populi BJ001] Length = 477 Score = 76.3 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 69/462 (14%), Positives = 131/462 (28%), Gaps = 123/462 (26%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 + G + I+ A+ L ++G+ I+ KT L + D +++ N Sbjct: 20 SDRGGSINIMFALALLPTLGLVGLGIDYGMAITSKTRLDNAADAAVLAGVVTAKEYIASN 79 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 ++ L +N +F F V + LD+ Q +++A Sbjct: 80 AKQGDATAAGLTAGRNQATKAFAINTGKVPFA-----TVSVSRLDVTRSGQ----TLTAT 130 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM-------- 184 Y ++ +S IT+S + S LD +++DVS SM Sbjct: 131 VIYTATIQNTFGKILGLSSTTFTNTITASADLAS----YLDFYLMVDVSGSMGLPTAAAD 186 Query: 185 -----------------------ESFFDSSITKIDMAIKSINAMLEEV---KLIPDVNNV 218 ++++ KI + ++N + E+ P V N Sbjct: 187 AEKLASITKEDQGNCQFACHFPGRKGWNNAAGKIQLRSDAVNNAVCELLKRAATPVVPNQ 246 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMR 277 + G F N++ L + + T L Q+F Sbjct: 247 YRIGFYPFINRLATLSPLSDTTTSMTALRTAAQCDKTWPLAFTNLLDTGSTQLFTGNNPT 306 Query: 278 QHCNTEDANYKK-----------------------IIVFMTDG----ENLSTKEDQQSL- 309 + +++K + +TDG ++ ST D ++ Sbjct: 307 TGTGSGGTHFEKALPQMKATIQPYGDGSSTTNSKPFVFLITDGMQNSQSYSTNNDARTFP 366 Query: 310 ------------------------YYCNEAKKRGAIVYAIGIRVIRSHEF---------- 335 C E K GAI+ + I + + Sbjct: 367 GSPSLFKGYGNAGWDGSQPAQIDPSKCKELKDAGAIISILYIPYNQVKNYTNDSYIVWEN 426 Query: 336 -------------LRACASPNSFYLVENPHSMYDAFSHIGKD 364 LR CAS FY + + + + Sbjct: 427 NRVNGFSPTLADPLRKCASQGFFYTANSADDITASLGAMFDQ 468 >gi|150024244|ref|YP_001295070.1| BatA protein [Flavobacterium psychrophilum JIP02/86] gi|149770785|emb|CAL42250.1| BatA protein [Flavobacterium psychrophilum JIP02/86] Length = 333 Score = 76.0 bits (185), Expect = 8e-12, Method: Composition-based stats. Identities = 43/257 (16%), Positives = 88/257 (34%), Gaps = 53/257 (20%) Query: 139 LKFCTFIPWYTNSRHIVM----PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITK 194 LK F +++ P T V T +D+++ +D+S SM + D + Sbjct: 54 LKPVLFAMRLLALSALIVAMARPRTVDVSNKRNTTNGIDIVMAIDLSSSMLAK-DFKPNR 112 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG 254 ++ + + +E + + G+V ++ + + + + I + Sbjct: 113 MEALKEVAASFVEARQSD-------RIGVVVYTAEAYTKTPVTSDKAVVLDAINTIKYDN 165 Query: 255 ---VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY 311 T GL A N++ D + K+I+ MTDG N + + + Sbjct: 166 VLQDGTGIGMGLATAVNRLKDSKAKS-----------KVIILMTDGVNNAGFIEPVTA-- 212 Query: 312 CNEAKKRGAIVYAIGIR-----------VIRS------------HEFLRACA--SPNSFY 346 AK+ G VY IGI + ++ A + ++ Sbjct: 213 AEFAKEFGIKVYTIGIGTNGNAPFPYAIAPNGGFLYKMLPVEIDEQLMKDIAKKTGGKYF 272 Query: 347 LVENPHSMYDAFSHIGK 363 ++ S+ +S I K Sbjct: 273 RAQSNSSLESIYSEINK 289 >gi|27367909|ref|NP_763436.1| aerotolerance operon protein BatA [Vibrio vulnificus CMCP6] gi|27359482|gb|AAO08426.1| BatA (Bacteroides aerotolerance operon) [Vibrio vulnificus CMCP6] Length = 323 Score = 76.0 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 34/217 (15%), Positives = 70/217 (32%), Gaps = 46/217 (21%) Query: 173 DMMIVLDVSRSMESFF----DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+V+D+S SM+ I ++ + +E+ + GLV F++ Sbjct: 87 DLMLVVDLSGSMQQADILQDGDYIDRLSAVKNVVTQFIEQ-------RQGDRLGLVLFAD 139 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + ++ + G T GL A D + Sbjct: 140 HAYLQTPLTADRQTVANQLNQTIIGLIGQKTAIGDGLALATKTFVDSEA----------- 188 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------- 331 +++++ ++DG N + D + N AKK G +Y IGI Sbjct: 189 PQRVVILLSDGSNTAGTLDP--IEAANIAKKYGVKIYTIGIGAGEMEVKQFFMTRKVNTS 246 Query: 332 ----SHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 + + ++ + + + I + Sbjct: 247 ADLDEKTLTKVATMTGGQYFRARDAQELQTIYQAINQ 283 >gi|326681146|ref|XP_002665520.2| PREDICTED: collagen alpha-1(XXII) chain [Danio rerio] Length = 1623 Score = 76.0 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 35/215 (16%), Positives = 77/215 (35%), Gaps = 26/215 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + D++ +LD S S+ + + + ++E + D + Sbjct: 28 QRAGCKNVHYDLVFILDTSSSVGK------ENFEKIRQWVANLVESFDVGVDK---TRVA 78 Query: 223 LVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V +S++ F L + ++R + G +T + + + IF R Sbjct: 79 VVRYSDRPTVEFNLARYKTLEEVKRAAGNIRYLGGNTKTGDAISFTTTNIFT---ERAGA 135 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +K+ + +TDG++ L A G ++A+GI E A Sbjct: 136 RPAAKGIQKVAILLTDGQSQDF-----VLEPSVAAAAAGIRLFAVGIGEALKEELEEIAA 190 Query: 341 SPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 P + V + +DA I + + +R+ + Sbjct: 191 EPKSAHVFHVTD----FDAIDKI-RGRLRRRLCEN 220 >gi|108799422|ref|YP_639619.1| hypothetical protein Mmcs_2455 [Mycobacterium sp. MCS] gi|119868535|ref|YP_938487.1| hypothetical protein Mkms_2500 [Mycobacterium sp. KMS] gi|126435076|ref|YP_001070767.1| hypothetical protein Mjls_2492 [Mycobacterium sp. JLS] gi|122976988|sp|Q1B971|Y2455_MYCSS RecName: Full=UPF0353 protein Mmcs_2455 gi|166987492|sp|A3PZE9|Y2492_MYCSJ RecName: Full=UPF0353 protein Mjls_2492 gi|166987495|sp|A1UFT9|Y2500_MYCSK RecName: Full=UPF0353 protein Mkms_2500 gi|108769841|gb|ABG08563.1| von Willebrand factor, type A [Mycobacterium sp. MCS] gi|119694624|gb|ABL91697.1| von Willebrand factor, type A [Mycobacterium sp. KMS] gi|126234876|gb|ABN98276.1| von Willebrand factor, type A [Mycobacterium sp. JLS] Length = 335 Score = 76.0 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 71/211 (33%), Gaps = 30/211 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+V+DVS+SM + D S +++ A ++ +E+ + GL+ ++ Sbjct: 99 VMLVIDVSQSMRA-TDVSPSRLAAAQEASKQFADELTPGIN------LGLIAYAGTATVL 151 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + I L + T + G+ A I + + D IV Sbjct: 152 VSPTTNREATKTAIDKL-QLADRTATGEGIFTALQAIATVGAV---IGGGDEPPPARIVL 207 Query: 294 MTDGENLSTKEDQQ---SLYYCNEAKKRGAIVYAIGIRVIRSH--------------EFL 336 +DG+ + AK +G + I + + L Sbjct: 208 FSDGKETVPSNPDNPKGAFTAARTAKDQGVPISTISFGTPYGYVEINEQRQPVPVDDQML 267 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 + A S + + + + ++++ + I Sbjct: 268 KKIADLSEGEAFTASSLEQLREVYANLQQQI 298 >gi|255522879|ref|NP_001157342.1| collagen alpha-1(XIV) chain [Equus caballus] Length = 1796 Score = 76.0 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 54/335 (16%), Positives = 115/335 (34%), Gaps = 42/335 (12%) Query: 51 HSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDI 110 + + + + G N + LC + + + + D Sbjct: 903 QASGFSDALTGVVKTLFLGVTNLQADQIQMTSLCARWQVQRHATAYRVV----IESVQDT 958 Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKI-----------PLKFCTFIPWYTNSRHIVMPIT 159 + ++ ++ Y + S YKI P ++ P T Sbjct: 959 KKQETIVGGGTTRHCFYGLQPDSEYKISVYTKLQEIEGPSVSIMEKTQSPPTQPPTFPPT 1018 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 A+ D++ ++D S S+ D + KI I + + + + I + Sbjct: 1019 VPPAKEVCKAAKADLVFMVDGSWSIG---DENFNKI---INFLYSTVGALNKI--GADGT 1070 Query: 220 QSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 Q +V F++ F L+ L IK++S G +T + +K+ + +F + Sbjct: 1071 QVAIVQFTDDPRTEFKLDAYKNKETLLDAIKHISYKGGNTKTGKAIKHVRDTLFTEESGT 1130 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + K+IV +TDG + E + G ++A+G+ E + Sbjct: 1131 RRG------IPKVIVVITDG-----RSQDDVNKISKEMQSDGYSIFAVGVADADYSELVS 1179 Query: 338 ACASPN--SFYLVENPHSMYDAFSHIGKDIVTKRI 370 + P+ + V++ +DAF I +++T Sbjct: 1180 IGSKPSARHVFFVDD----FDAFKKIEDELITFVC 1210 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 42/223 (18%), Positives = 83/223 (37%), Gaps = 26/223 (11%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV- 215 VK +T A D++I++D S S+ F + + LE + DV Sbjct: 142 SSPEEVKFFCETPAIADIVILVDGSWSIGRFN----------FRLVRLFLENLVTAFDVG 191 Query: 216 NNVVQSGLVTFS-NKIEEFFLLEWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + + GL +S + E+ L + + ++ L G +T + L Y + F Sbjct: 192 SEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRNLPYKGGNTLTGLALNYIFENSFKP 251 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + + KI + +TDG K + ++ G ++AIG++ + Sbjct: 252 E------AGARSGVSKIGILITDG-----KSQDDIIPPSRNLRESGVELFAIGVKNADEN 300 Query: 334 EFLRACASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 E + P+ Y V M+ + + + ++ DK Sbjct: 301 ELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRVEEQDK 343 >gi|109088926|ref|XP_001104056.1| PREDICTED: anthrax toxin receptor-like isoform 1 [Macaca mulatta] Length = 557 Score = 76.0 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 70/203 (34%), Gaps = 22/203 (10%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+ +LD S S+ + + + + + ++ +T+S Sbjct: 70 QGSFDLYFILDKSGSVNNNWID-------LYMWVEETVARFQSSD-----IRMCFITYST 117 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L + ++ + L K ++ ++ + + N Sbjct: 118 DGQTVLPLTSDKNRIKNGLDQLRKIVPDGHTF--MQAGFRKAIQQIETFNSGNK----VP 171 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 +I+ MTDGE L Q +L +A+K GA VY +G+ + + SP + V Sbjct: 172 SMIIAMTDGE-LVAHAFQDTLREAQKARKLGANVYTVGVADYKLDQITAIADSPEHVFAV 230 Query: 349 ENPHSMYDAFSHIGKDIVTKRIW 371 EN + A + +K Sbjct: 231 ENG---FKAMRDTVDALTSKVCL 250 >gi|109088928|ref|XP_001104141.1| PREDICTED: anthrax toxin receptor-like isoform 2 [Macaca mulatta] Length = 564 Score = 76.0 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 70/203 (34%), Gaps = 22/203 (10%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+ +LD S S+ + + + + + ++ +T+S Sbjct: 70 QGSFDLYFILDKSGSVNNNWID-------LYMWVEETVARFQSSD-----IRMCFITYST 117 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L + ++ + L K ++ ++ + + N Sbjct: 118 DGQTVLPLTSDKNRIKNGLDQLRKIVPDGHTF--MQAGFRKAIQQIETFNSGNK----VP 171 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 +I+ MTDGE L Q +L +A+K GA VY +G+ + + SP + V Sbjct: 172 SMIIAMTDGE-LVAHAFQDTLREAQKARKLGANVYTVGVADYKLDQITAIADSPEHVFAV 230 Query: 349 ENPHSMYDAFSHIGKDIVTKRIW 371 EN + A + +K Sbjct: 231 ENG---FKAMRDTVDALTSKVCL 250 >gi|296227292|ref|XP_002759309.1| PREDICTED: collagen alpha-1(XIV) chain [Callithrix jacchus] Length = 1796 Score = 76.0 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 57/331 (17%), Positives = 114/331 (34%), Gaps = 34/331 (10%) Query: 51 HSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILC---RIKNTWNM-SFRNELRDNGFVND 106 S + + + G N + K LC ++ E + + Sbjct: 903 QSSGFSDALTGMVKTLFLGVTNLQTKHVEMTSLCAHWQVHRHATAYRVVIESLQDTQKQE 962 Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKI---PLKFCTFIPWYTNSRHIVMPITSSVK 163 +T + + Y IS ++ + P +R P T Sbjct: 963 STVGGGTTRHCFYGLQPDSEYKISVYTKLQEIEGPSVSIMEKTQSLPTRPPTFPPTIPPA 1022 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 A+ D++ ++D S S+ D + KI I + + + + I + Q + Sbjct: 1023 KEVCKAAKADLVFMVDGSWSIG---DDNFNKI---INFLYSTVGALNKI--GTDGTQVAM 1074 Query: 224 VTFSNKIEEFFLL-EWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 V F++ F L + L IK +S G +T + +KY + +F + + Sbjct: 1075 VQFTDDPRTEFKLNAYKTKETLLDAIKRISYKGGNTKTGKAIKYVRDTLFTAESGTRRG- 1133 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 K+IV +TDG + E + G ++A+G+ E + + Sbjct: 1134 -----IPKVIVVITDG-----RSQDDVNKISKEMQSDGYSIFAVGVADADYSELVSIGSK 1183 Query: 342 PN--SFYLVENPHSMYDAFSHIGKDIVTKRI 370 P+ + V++ +DAF I +++T Sbjct: 1184 PSARHVFFVDD----FDAFKKIEDELITFVC 1210 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 41/223 (18%), Positives = 80/223 (35%), Gaps = 26/223 (11%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV- 215 VK QT A D++I++D S S+ F + + LE + DV Sbjct: 142 SSPEEVKFVCQTPAIADIVILVDGSWSIGRFN----------FRLVRLFLENLVTAFDVG 191 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + + GL +S + L + + ++ L G +T + L Y + F Sbjct: 192 SEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNYIFENSFKP 251 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + KI + +TDG K + ++ G ++AIG++ + Sbjct: 252 E------AGSRTGVSKIGILITDG-----KSQDDIIPPSRNLRESGVELFAIGVKNADVN 300 Query: 334 EFLRACASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 E + P+ Y V M+ + + + ++ D+ Sbjct: 301 ELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRVEEQDR 343 >gi|254496635|ref|ZP_05109500.1| conserved hypothetical protein [Legionella drancourtii LLAP12] gi|254354157|gb|EET12827.1| conserved hypothetical protein [Legionella drancourtii LLAP12] Length = 342 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 33/227 (14%), Positives = 78/227 (34%), Gaps = 46/227 (20%) Query: 163 KVNSQTDARLDMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + ++M+ LD+S SME D +++ + + + + +L Sbjct: 81 EPKPIEREGYNIMMALDLSGSMEIPDMILHDRPASRLTVVKNAAEQFVRD-RLGD----- 134 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 + GL+ F ++ L + + +I+ + G +T+ + A ++ + Sbjct: 135 -KIGLILFGSRAYLQTPLTYDRQTVLLRIEDATVGLAGKTTSIGDAVGLAVKRLDAVPQK 193 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR----- 331 ++I+ +TDG N + L AK G +Y IG+ Sbjct: 194 G-----------RVIILLTDGANN--SGILEPLKAAELAKDEGIKIYTIGLGAATDPRAL 240 Query: 332 SHEFLRACA---------------SPNSFYLVENPHSMYDAFSHIGK 363 ++ FL A + ++ + ++ + I + Sbjct: 241 TNGFLMQAAAADLDEETLKEMSAMTGGRYFRATDTATLNSIYKTINQ 287 >gi|260797332|ref|XP_002593657.1| hypothetical protein BRAFLDRAFT_131951 [Branchiostoma floridae] gi|229278884|gb|EEN49668.1| hypothetical protein BRAFLDRAFT_131951 [Branchiostoma floridae] Length = 949 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 70/199 (35%), Gaps = 27/199 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 A+LD+M+VLD S S+ K++ V ++ + G+V +S+ Sbjct: 25 AKLDLMLVLDGSGSVGD---------ADFAKTLEFAENVVNAFDIGTDLTRVGVVQYSDT 75 Query: 230 IEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 F L + + T + L++A R + N A Sbjct: 76 PTMEFNLGVHADKGSTIAAVNNIQYQNGGTATGAALEFA----------RANANWRGAPV 125 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC-ASPNSFY 346 K+++ +TDG K G VYAIG+ E + + N+ Sbjct: 126 PKVMIVVTDG-----KSGDDVTAAAQALAGEGVAVYAIGVGNYDLPELQQIANGNNNNVI 180 Query: 347 LVENPHSMYDAFSHIGKDI 365 +++ +++ A I + Sbjct: 181 ELQDYNALTAAIDQIAGQV 199 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 31/250 (12%), Positives = 70/250 (28%), Gaps = 14/250 (5%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + F KT + + ID + + + + + +I ++ Sbjct: 382 LNAHFTKTAVLNAIDNIVYMGGGTLTGTAITYMKDNSQWRPNVAKIAIVVTDGKSSDDVA 441 Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT- 159 + ++ + S S + + + + Sbjct: 442 APSSAAQQAGITMHAIGVGANVDQTELS-QIASTSQYVTNVADYDALDAQMAQLTASVCN 500 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +A LD+ VLD S S+ ++ K+ K++ V Sbjct: 501 GPTTPAPTCNAPLDLFFVLDGSGSVT---GANFDKVKQFTKNV------VNAFDISATAT 551 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYA-YNQIFDMQGM 276 + G+V +S+ F L I + G T + L++A N + + Sbjct: 552 RVGVVQYSDSNTLEFNLGDHADKPSTLAAIDSIVYQGGGTTTGSALEFARVNAAWRGGSV 611 Query: 277 RQHCNTEDAN 286 +H + Sbjct: 612 PKHLVRTRSP 621 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 25/127 (19%), Positives = 48/127 (37%), Gaps = 18/127 (14%) Query: 221 SGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 G++ +S+ ++E F L + + I + G T + + Y + D R Sbjct: 721 VGVIQYSSTVQEEFSLNAHFTKTAVLNAIDNIVYMGGGTLTGAAITY----MKDNSQWRP 776 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 KI + +TDG++ S A++ G ++AIG+ L Sbjct: 777 GVA-------KIAIVVTDGKSSDDVGPPSSA-----AQQTGITMHAIGVGANVDQTELSQ 824 Query: 339 CASPNSF 345 AS + + Sbjct: 825 IASTSQY 831 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 24/128 (18%), Positives = 47/128 (36%), Gaps = 18/128 (14%) Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 Q G++ +S+ ++E F L + + I + G T + + Y + Sbjct: 366 QIGVIQYSSTVQEEFSLNAHFTKTAVLNAIDNIVYMGGGTLTGTAITYMKD--------- 416 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + N KI + +TDG++ S + G ++AIG+ L Sbjct: 417 --NSQWRPNVAKIAIVVTDGKSSDDVAAPSSAAQ-----QAGITMHAIGVGANVDQTELS 469 Query: 338 ACASPNSF 345 AS + + Sbjct: 470 QIASTSQY 477 >gi|297471458|ref|XP_002685231.1| PREDICTED: collagen type VI alpha 4-like [Bos taurus] gi|296490811|gb|DAA32924.1| collagen type VI alpha 4-like [Bos taurus] Length = 780 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 57/350 (16%), Positives = 116/350 (33%), Gaps = 39/350 (11%) Query: 29 IIFLVLGMIIEVSHIFFMKTV-----LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDIL 83 + F V+ EVS F + L +D M E G ++ Sbjct: 206 VQFGVVQYSDEVSPQFTLSQHSSVAGLEVAVDSIQQKGGGTKMGEALG---SMIQVFADS 262 Query: 84 CRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVP-QNEGYSISAISRYKIPLKFC 142 R W + + + V D + +R + I V ++ + Sbjct: 263 ARSNVPWYLIVVTDGQSMDPVADAAEALRGHGVTIYAVGVRDANIAELQEIAEDRMFFVH 322 Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 F T + +V I S + + + D++ ++D S S+ K+ +K Sbjct: 323 DFESLKTIQQEVVQDICSL---ETCKNRKADIIFLMDGSESIS---PKDFEKMKEFMK-- 374 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNST 260 V + +Q GL+ FS+ +E F L S + R I + + T + Sbjct: 375 ----RMVNQSNIGADEIQIGLLQFSSDPQEEFRLNRYSSKVDVHRAISDVKQINGGTYTG 430 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA 320 L + +G + + ++ +TDG + R Sbjct: 431 KALNFTLPFFGSSRG-------GRPSVHQYLIVVTDG-----VSRDNVALPAKALRDRNI 478 Query: 321 IVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 I++AIG+ ++ + L + Y E +++ ++ K+I+ + Sbjct: 479 IIFAIGVGEVKFSQLLEITNDQSKVYYEEK----FESLQNLEKEILYQVC 524 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 76/204 (37%), Gaps = 27/204 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S + +N +++ + PD VQ G+V +S+++ Sbjct: 169 DIYFLIDGSGS------THQDDFLAMKVFMNEVIKMFHVGPD---RVQFGVVQYSDEVSP 219 Query: 233 FFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L S L+ + + + G T L ++ ++ +N Sbjct: 220 QFTLSQHSSVAGLEVAVDSIQQKGGGTKMGEAL---------GSMIQVFADSARSNVPWY 270 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDG+++ D + G +YA+G+R E + + V + Sbjct: 271 LIVVTDGQSMDPVADA-----AEALRGHGVTIYAVGVRDANIAELQEIA--EDRMFFVHD 323 Query: 351 PHSMYDAFSHIGKDIVTKRIWYDK 374 S+ + +DI + ++ Sbjct: 324 FESLKTIQQEVVQDICSLETCKNR 347 >gi|85716351|ref|ZP_01047324.1| hypothetical protein NB311A_19225 [Nitrobacter sp. Nb-311A] gi|85696867|gb|EAQ34752.1| hypothetical protein NB311A_19225 [Nitrobacter sp. Nb-311A] Length = 542 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 33/155 (21%), Positives = 59/155 (38%), Gaps = 19/155 (12%) Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA-NYKKI 290 +W L+ +I ++ G +TN + GL + + + G E Y+ Sbjct: 389 TAMSSQWST--LKNQIDAMTPSG-NTNQSIGLAWGWQSLSTTNGPIAAPGKESGYVYQDY 445 Query: 291 IVFMTDGENLS------------TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH---EF 335 IV ++DG N D + C + K G ++ I + V + Sbjct: 446 IVLLSDGLNTQNRWYSCPPSGPCPTIDARQALLCQKVKDSGVTIFTIQVNVGSKDPLSQV 505 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 L+ CAS +F ++ + DAF +I I R+ Sbjct: 506 LQNCASDGNFQMITSATETADAFQNILTQISQLRL 540 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 28/205 (13%), Positives = 61/205 (29%), Gaps = 28/205 (13%) Query: 31 FLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTW 90 +G ++ + ++ + + +D +++ + ++ +I N Sbjct: 1 MGFVGAAVDYTRANAARSSMQAALDSAVLMVSKDAAANPTMTSQ----------QITNAV 50 Query: 91 NMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTN 150 F ND + S + I A + I F I + Sbjct: 51 QRYFT------SLYNDKSAFGVTVSATYTPSSSSAAAKILASGQGAIQTDF-MKIAGFPQ 103 Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 TSS + R + +VLD + SM S K+ ++ M++ + Sbjct: 104 LS----FGTSSTSTWGNSRMR--VALVLDNTGSMSSNG-----KMSALQRAAKDMIDSLS 152 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFL 235 V ++ FS + Sbjct: 153 AFAKKTGDVYISIIPFSKDVNVDTS 177 >gi|312621140|ref|YP_004022753.1| yd repeat protein [Caldicellulosiruptor kronotskyensis 2002] gi|312201607|gb|ADQ44934.1| YD repeat protein [Caldicellulosiruptor kronotskyensis 2002] Length = 2994 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 49/208 (23%), Positives = 81/208 (38%), Gaps = 30/208 (14%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 + H + K +++D++ VLD S SM S D + +I+ K I Sbjct: 746 KTEHFSTFLLGD-KNMPVDLSKVDIVFVLDNSGSMSSN-DPNYYRIEATKKFIQN----- 798 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 I ++NN + GLV F + + L + L + + + G STN GLK A Sbjct: 799 --IDELNN--RVGLVDFDSSVYVRSNLTSDKNKLLQALNAMRWTGGSTNIGGGLKAALE- 853 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +FD + KKIIV ++DG N + K+ +V I + Sbjct: 854 LFDQEQS-----------KKIIVLLSDGYHNTGIHPNDVLPELI----KQEIVVNTIALG 898 Query: 329 VIRSHEFLRACA--SPNSFYLVENPHSM 354 E L A + ++ V+N + Sbjct: 899 KDCDRELLHDIADKTKGDYFYVDNTGGL 926 >gi|260823577|ref|XP_002606157.1| hypothetical protein BRAFLDRAFT_92024 [Branchiostoma floridae] gi|229291496|gb|EEN62167.1| hypothetical protein BRAFLDRAFT_92024 [Branchiostoma floridae] Length = 550 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 36/212 (16%), Positives = 83/212 (39%), Gaps = 19/212 (8%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 +T+ +V LD++ LD S S+ ++ IN +++ + PD Sbjct: 196 VTAVEEVAGCQATSLDLVFALDGSASVGDN------NFQLSKDFINTIIQSFTIGPD--- 246 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKF-GVSTNSTPGLKYAYNQIFDMQ 274 + + G+V +S + L+ +S L + ++ + G+ T++ L+Y F ++ Sbjct: 247 LTRVGVVQYSFSVNLELELKDHLSAAPLIQAVEEIEYDEGLFTHTGEALQYITTHSFAVE 306 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K+ + +TDGE+ + + A++ G V+ +G+ + Sbjct: 307 NGAREVRKG---VPKVSILLTDGESGDYHD---VTVWSQRAQEAGITVFPVGVTNNVDQQ 360 Query: 335 FLRACAS-PNSFYLVENPHSMYDAFSHIGKDI 365 L A Y V + + + + + Sbjct: 361 ELETMAGHAGKVYRVGWFRDLEKILTDMEESL 392 >gi|119575261|gb|EAW54866.1| hCG2002731, isoform CRA_c [Homo sapiens] Length = 766 Score = 76.0 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 49/296 (16%), Positives = 106/296 (35%), Gaps = 35/296 (11%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 + +L N + ++ ++ + + +V G ++ Y + + Sbjct: 93 VKLDLPVNTSIPNVTEVKENMTFGSTLVTNPNGGFLACGPLYAYRCGHLHYTTGICSDVS 152 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + +S+ + +LD++IVLD S S+ + D +N +LE + + P Sbjct: 153 PTFQVVNSIAPVQECSTQLDIVIVLDGSNSIYPW--------DSVTAFLNDLLERMDIGP 204 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQI 270 Q G+V + + F L S + + + + G T + G+ A + Sbjct: 205 ---KQTQVGIVQYGENVTHEFNLNKYSSTEEVLVAAKKIVQRGGRQTMTALGIDTARKEA 261 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--R 328 F KK++V +TDGE+ ++ + C ++I I Sbjct: 262 FT------EARGARRGVKKVMVIVTDGESHDNHRLKKVIQDCE---DENIQRFSIAILGS 312 Query: 329 VIRSH-------EFLRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 R + E +++ AS F+ V + ++ +G+ I D+ Sbjct: 313 YNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKTLGERIFALEATADQ 368 >gi|323136144|ref|ZP_08071226.1| hypothetical protein Met49242DRAFT_0613 [Methylocystis sp. ATCC 49242] gi|322398218|gb|EFY00738.1| hypothetical protein Met49242DRAFT_0613 [Methylocystis sp. ATCC 49242] Length = 652 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 35/182 (19%), Positives = 55/182 (30%), Gaps = 49/182 (26%) Query: 230 IEEFFLLEWGVSHLQRKIKYLSK-FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E L +S + I ++ T + GL +A+ + + D K Sbjct: 459 PEPLTRLTNNLSTVTAAIDSMNYWLNGGTVISEGLMWAWRTLSPQKPYADGAAYTDKKTK 518 Query: 289 KIIVFMTDGEN------LSTKE--------------------------------DQQSLY 310 K+IV MTDG N + D + Sbjct: 519 KVIVLMTDGVNGLADNGNAASANISDYSAYGYMGASRLSVADGVTTYAGLQTFLDDRLKK 578 Query: 311 YCNEAKKRGAIVYAIGIR---------VIRSHEFLRACAS-PNSFYLVENPHSMYDAFSH 360 C+ AK +G +Y + RS L CAS P +L + ++ AF Sbjct: 579 ACDNAKAKGISIYTVMFNHNGFLSATEQARSATLLSYCASKPEYAFLATDSAALNSAFGQ 638 Query: 361 IG 362 I Sbjct: 639 IA 640 Score = 39.4 bits (90), Expect = 0.85, Method: Composition-based stats. Identities = 29/207 (14%), Positives = 66/207 (31%), Gaps = 38/207 (18%) Query: 26 FLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCR 85 +P+ F+ G I+ +++ L D G + Sbjct: 8 MIPVTFM-AGAAIDYGRATLLRSSLQKAAD-----------------------AGALAAG 43 Query: 86 IKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFI 145 + + R ++ N + ++ R+ SL I + G + + +I T I Sbjct: 44 ARTSLTQLAREQIAKNAVLANLGAKARNLSLTITETEPSAG-----VFQVQIQASIATSI 98 Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 I + +TS +V + +++ + LD + SM ++ + Sbjct: 99 MKVARFDTIPVVVTSEARVVGGSTNPIEIALALDNTGSMRDDMP-------ALKQAAKTL 151 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEE 232 + V N V+ +V + + Sbjct: 152 AQNVMSGSGGN--VKVSVVPYVAAVNP 176 >gi|119575262|gb|EAW54867.1| hCG2002731, isoform CRA_d [Homo sapiens] Length = 768 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 49/296 (16%), Positives = 106/296 (35%), Gaps = 35/296 (11%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 + +L N + ++ ++ + + +V G ++ Y + + Sbjct: 93 VKLDLPVNTSIPNVTEVKENMTFGSTLVTNPNGGFLACGPLYAYRCGHLHYTTGICSDVS 152 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + +S+ + +LD++IVLD S S+ + D +N +LE + + P Sbjct: 153 PTFQVVNSIAPVQECSTQLDIVIVLDGSNSIYPW--------DSVTAFLNDLLERMDIGP 204 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQI 270 Q G+V + + F L S + + + + G T + G+ A + Sbjct: 205 ---KQTQVGIVQYGENVTHEFNLNKYSSTEEVLVAAKKIVQRGGRQTMTALGIDTARKEA 261 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--R 328 F KK++V +TDGE+ ++ + C ++I I Sbjct: 262 FT------EARGARRGVKKVMVIVTDGESHDNHRLKKVIQDCE---DENIQRFSIAILGS 312 Query: 329 VIRSH-------EFLRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 R + E +++ AS F+ V + ++ +G+ I D+ Sbjct: 313 YNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKTLGERIFALEATADQ 368 >gi|254292617|ref|YP_003058640.1| hypothetical protein Hbal_0241 [Hirschia baltica ATCC 49814] gi|254041148|gb|ACT57943.1| hypothetical protein Hbal_0241 [Hirschia baltica ATCC 49814] Length = 514 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 41/231 (17%), Positives = 81/231 (35%), Gaps = 25/231 (10%) Query: 1 MFSLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVH 60 M S+ ++ F G+ + A+FL +I ++G I+ + K L + D +++ Sbjct: 1 MRSIQKTLKQFLNATNAGVAPMFALFLTVILFIIGFTIDFRRMDSAKMHLQAATDSAVLA 60 Query: 61 AATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVV 120 AA + K + D + + + N F N+ +V +IV Sbjct: 61 AARAYLTSSVQVKETKRQE-DSQKIASDYLTANLLSS--SNNFENNQIQLVFKEDGEIVG 117 Query: 121 VPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDV 180 K +V+P ++ V +L++++VLD Sbjct: 118 NAST---------------KIKLIFGGLFGKSDVVLPALAAATVG--DSRKLEIVLVLDT 160 Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 S SM SS ++ + + V VQ G+V ++ + Sbjct: 161 SGSM-----SSQNRMKQLRTASINFVNSVFDNAVYERTVQVGVVPWNATVN 206 Score = 39.4 bits (90), Expect = 0.96, Method: Composition-based stats. Identities = 30/230 (13%), Positives = 65/230 (28%), Gaps = 60/230 (26%) Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK 252 TK + I S + + P + + + + + IK L++ Sbjct: 277 TKDERKISSSGNVTPLTDVPPSKMKWPARKVAGWDPNSDCPSPMLAMSQSRPQIIKKLNQ 336 Query: 253 FGVSTNSTPGL-----------KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 S N+ + + +N F + + +KI++ +TDGEN + Sbjct: 337 LNPSGNTHADIGLMWGYRMFSQQANWNNFFGYNSDTKPDSFHSTKSRKIMIMLTDGENTA 396 Query: 302 TKEDQQS--------------------------------------------LYYCNEAKK 317 T + S L C + Sbjct: 397 TNSEGYSYYGWCTYTNHYNKWGRYTGSTKDCEVPKGINKDEISNNDLNSLMLDACEVIRS 456 Query: 318 RGAIVYAIGIRVIRSHE-----FLRACASPNSFYLVENPHSMYDAFSHIG 362 + ++ I + + ++ LR CA +S + + + F + Sbjct: 457 KDVELFTIALDLHSYYDSTAIALLRECAGSDSHAYNIKGNELDETFQELA 506 >gi|118093056|ref|XP_421769.2| PREDICTED: similar to AMACO [Gallus gallus] Length = 800 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 33/219 (15%), Positives = 80/219 (36%), Gaps = 29/219 (13%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN- 216 I S LD++ +D S + + ++ + + +N Sbjct: 517 ICSVNSPEGCQVQSLDLVFAVDASAGVG---------LANFLQ-LRDFVRSSSWQFTINR 566 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 +V Q LV + ++ F L+ ++ L + I + G +++ L + ++ + +Q Sbjct: 567 DVTQIALVAYGSRARTVFALDTYTNNSALLQAINQMPFLGDVASASSALLHIHSDVMTVQ 626 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 K++V +T+G + + + + G +V+ + I Sbjct: 627 ------KGARPGVSKVVVLLTNGGGM-----EDAAAPARHLRDNGVMVFVVVIGDAERDT 675 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 LR SP+ + + + +D++T+RI + Sbjct: 676 LLRVAGSPSYLVHISSYEDLQRY-----QDLITERICEE 709 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 40/189 (21%), Positives = 73/189 (38%), Gaps = 28/189 (14%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 A +D++ +LD S S+ + + + + + + P V+ G+V FS+ Sbjct: 47 ASVDILFLLDGSYSIG------RGSFERSKHFAGKLCDALDIHP---GRVRVGMVQFSSA 97 Query: 230 IEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 F L+ ++ +IK + G ST + LKY + + Sbjct: 98 PHLEFSLDSYLTKQEVKERIKRTAFRGGSTETGRALKYIL--------HKGFPGGRNLTV 149 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---PNS 344 KI++ ++DG K + + K+RG V+A+GI+ R E L A AS Sbjct: 150 PKILIIISDG-----KSQGSTAVPAMQVKERGTTVFAVGIKFPRWEE-LHAVASEPTEQH 203 Query: 345 FYLVENPHS 353 + Sbjct: 204 VLFAGDASD 212 >gi|123718338|emb|CAJ77152.1| collagen type VI alpha 4 [Mus musculus] Length = 1451 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 52/341 (15%), Positives = 113/341 (33%), Gaps = 39/341 (11%) Query: 37 IIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRN 96 +++ S + L + + AA + + G ++ +NT + Sbjct: 32 VVQYSDKIISQFFLTQYASMAGLSAAIDNIQQVGGGTTTGKALSKMVPVFQNTARIDVAR 91 Query: 97 ------ELRDNGFVNDIDDIVRSTSLDI----VVVPQNEGYSISAISRYKIPLKFCTFIP 146 + + V + +R ++I V + I+ K+ F Sbjct: 92 YLIVITDGQSTDPVAEAAQGLRDIGVNIYAIGVRDANTT--ELEEIASKKM-FFIYEFDS 148 Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 + + ++ I SS SQ D++ ++D S S+ K+ ++ Sbjct: 149 LKSIHQEVIRDICSSENCKSQKA---DIIFLIDGSESI---APKDFEKMKDFME------ 196 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLK 264 V + +Q GL+ FS+ +E F L S + R I + + T++ L Sbjct: 197 RMVNQSNIGADEIQIGLLQFSSNPQEEFRLNRYSSKVDMCRAILSVQQMSDGTHTGKALN 256 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 + +G + ++ +TDG + R I++A Sbjct: 257 FTLPFFDSSRG-------GRPRVHQYLIVITDG-----VSQDNVAPPAKALRDRNIIIFA 304 Query: 325 IGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 IG+ ++ + L + + EN S+ I ++ Sbjct: 305 IGVGNVQRAQLLEITNDQDKVFQEENFESLQSLEKEILSEV 345 Score = 69.4 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 34/179 (18%), Positives = 69/179 (38%), Gaps = 24/179 (13%) Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRK 246 + ++ +K + +K+ + V+ G+V +S+KI F L ++ L Sbjct: 4 PNDFIEMKDFMKEV------IKMFHIGPDRVRFGVVQYSDKIISQFFLTQYASMAGLSAA 57 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 I + + G T + L + + NT + + ++ +TDG++ Sbjct: 58 IDNIQQVGGGTTTGKAL---------SKMVPVFQNTARIDVARYLIVITDGQSTDP---- 104 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 + G +YAIG+R + E L AS F+ + S+ + +DI Sbjct: 105 -VAEAAQGLRDIGVNIYAIGVRDANTTE-LEEIASKKMFF-IYEFDSLKSIHQEVIRDI 160 >gi|311273682|ref|XP_003133985.1| PREDICTED: integrin alpha-1-like, partial [Sus scrofa] Length = 506 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 46/296 (15%), Positives = 105/296 (35%), Gaps = 35/296 (11%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 + +L + ++ ++ + + +V +G ++ Y + + Sbjct: 115 IKLDLPVTTSIPNVTEVKENMTFGSTLVTNPKGGFLACGPLYAYRCGHLHYTTGICSDVS 174 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + +S+ + +LD++IVLD S S+ + + +N +L+ + + P Sbjct: 175 PTFQVLNSIAPVQECSTQLDIVIVLDGSNSIYPW--------ESVTAFLNDLLKRMDIGP 226 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQI 270 Q G+V + + F L S + + + + G T + G+ A + Sbjct: 227 ---KQTQVGIVQYGENVTHEFNLNKYSSTEEVLVAANEIVQRGGRQTMTALGIDTARKEA 283 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--R 328 F KK++V +TDGE+ + + C ++I I Sbjct: 284 FT------EARGARRGVKKVMVIVTDGESHDNHRLNKVIQDCE---DESIQRFSIAILGS 334 Query: 329 VIRSH-------EFLRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 R + E +++ AS F+ V + ++ +G+ I D+ Sbjct: 335 YNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVEALGERIFALEATADQ 390 >gi|149773093|emb|CAO01896.1| collagen type VI alpha 6 [Mus musculus] Length = 1120 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 74/201 (36%), Gaps = 21/201 (10%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 ++D++ ++D S S+ K+ + S+ V+ N V+ G+ F Sbjct: 5 CEIEKVDLVFLMDGSNSIH---PDDFQKMKGFLVSV------VQDFDVSLNRVRIGVAQF 55 Query: 227 SNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S+ FLL + +I+ + + T+ L+ + + M N Sbjct: 56 SDSYRSEFLLGTFTGEREISTQIEGIQQIFGYTHIGDALRKV--KYYFQPDMGSRINAGT 113 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 ++++ +TDG + E + +G +Y++GI + E ++ + Sbjct: 114 ---PQVLLVLTDGR-----SQDEVAQAAEELRHKGVDIYSVGIGDVDDQELVQITGTAEK 165 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 V N + I ++I Sbjct: 166 KLTVHNFDELKKVKKRIVRNI 186 >gi|149520989|ref|XP_001515959.1| PREDICTED: similar to Matn2 protein, partial [Ornithorhynchus anatinus] Length = 161 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 24/154 (15%), Positives = 63/154 (40%), Gaps = 19/154 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 T+ +D++ V+D S+S+ ++ + + +L+ +++ P + GL+ + Sbjct: 1 CTEGPVDLVFVIDGSKSLGE------ENFEVVKRFVAGILDSLEVSPHAA---RVGLLQY 51 Query: 227 SNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S + F L S +++ + ++ G + + LK + + F Sbjct: 52 STHVRTEFTLRDFGSARDMKKAVAHMRYMGKGSMTGMALKQMFERSFTEAE---GARPPS 108 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 A ++ + +TDG + + ++AK+ Sbjct: 109 AGVPRVAIAVTDG-----RAQDDVSEWADKAKQN 137 >gi|301772356|ref|XP_002921600.1| PREDICTED: integrin alpha-1-like, partial [Ailuropoda melanoleuca] Length = 1160 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 48/296 (16%), Positives = 106/296 (35%), Gaps = 35/296 (11%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 + +L N + ++ ++ + + +V G ++ Y + + Sbjct: 74 IKLDLPVNTSIPNVTEVKENMTFGSTLVTNPNGGFLACGPLYAYRCGHLHYTTGICSDVS 133 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + +S+ + +LD++IVLD S S+ + + +N +LE + + P Sbjct: 134 PTFQVVNSIAPVRECSTQLDIVIVLDGSNSIYPW--------ESVTAFLNDLLERMDIGP 185 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQI 270 Q G+V + + F L S + + + + G T + G+ A + Sbjct: 186 ---KQTQVGIVQYGENVTHEFNLNKYSSTEEVLVAANKIIQRGGRQTMTALGIDTARKEA 242 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--R 328 F KK++V +TDGE+ + + + C ++I I Sbjct: 243 FT------EARGARRGVKKVMVIVTDGESHDNHQLNKVIQDCE---DENIQRFSIAILGS 293 Query: 329 VIRSH-------EFLRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 R + E +++ AS F+ V + ++ +G+ I D+ Sbjct: 294 YNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVEALGERIFALEATADQ 349 >gi|256376278|ref|YP_003099938.1| hypothetical protein Amir_2147 [Actinosynnema mirum DSM 43827] gi|255920581|gb|ACU36092.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] Length = 321 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 39/228 (17%), Positives = 80/228 (35%), Gaps = 29/228 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + R +M+V+DVS SM++ D T+++ A + + E + Sbjct: 71 LTVALAGPTAEQKVPRNRATVMLVIDVSLSMKA-TDVQPTRLEAAQVAAKSFAEGLTPGI 129 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + GL++F+ S + + I L ST + + A + I Sbjct: 130 N------LGLISFAGSATVLVAPTTDRSAVSQGIDGLK-LAQSTATGDAIVAALSAIDSF 182 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS- 332 + + +V MTDG+ T +++ +AK+ G + I R Sbjct: 183 GKVVGGADGPPPAR---VVLMTDGK--ETVGTRKATDAAGDAKEAGIPISTISFGTERGS 237 Query: 333 -------------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 E ++ A S F+ + + + +G+ I Sbjct: 238 VDINGKAQEVPVDDESMKEIAKISGGEFFKAASAEELRRVYDTLGEQI 285 >gi|194474004|ref|NP_001124020.1| collagen alpha-1(XIV) chain [Rattus norvegicus] gi|149066377|gb|EDM16250.1| procollagen, type XIV, alpha 1 (predicted), isoform CRA_a [Rattus norvegicus] Length = 1794 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 48/251 (19%), Positives = 95/251 (37%), Gaps = 30/251 (11%) Query: 127 YSISAISRYKI---PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 Y IS ++ + P + P T A+ D++ ++D S S Sbjct: 981 YKISVYTKLQEIEGPSVSIMQKTQSLPTEPPTFPPTIPPAKEVCKAAKADLVFMVDGSWS 1040 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH- 242 + D + KI I + + + + I + Q +V F++ F L+ + Sbjct: 1041 IG---DDNFNKI---INFLYSTVGALDKI--GADGTQVAMVQFTDDPRTEFKLDAYKTKE 1092 Query: 243 -LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 L I+++S G +T + +K+ + +F + K+IV +TDG Sbjct: 1093 TLLDAIRHISYKGGNTKTGKAIKHVRDTLFTADSGTRRG------IPKVIVVITDG---- 1142 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFS 359 + E + G ++AIG+ E +R + S + V++ +DAF Sbjct: 1143 -RSQDDVNKISREMQADGYNIFAIGVADADYSELVRIGSKPSSRHVFFVDD----FDAFK 1197 Query: 360 HIGKDIVTKRI 370 I +++T Sbjct: 1198 KIEDELITFVC 1208 Score = 68.6 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 47/263 (17%), Positives = 93/263 (35%), Gaps = 28/263 (10%) Query: 117 DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMI 176 D P + I + + K P K N P +K +T A D++I Sbjct: 105 DKESKPAQGQFRIKDLEKRKDPTKPRVKAVDKGNGSKSTSP--EEIKFFCETPAIADIVI 162 Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV-NNVVQSGLVTFS-NKIEEFF 234 ++D S S+ F + + LE + +V + + GL +S + E+ Sbjct: 163 LVDGSWSIGRFN----------FRLVRNFLENLVTAFNVGSEKTRIGLAQYSGDPRIEWH 212 Query: 235 LLEWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L + + ++ L G +T + L + + F + + KI + Sbjct: 213 LNAFNTKDEVIDAVRSLPYKGGNTLTGLALNFIFENSFKPE------AGSRSGVSKIGIL 266 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SFYLVENP 351 +TDG K + ++ G ++AIG++ E + P+ Y V Sbjct: 267 ITDG-----KSQDDIIPPSRNLREAGVELFAIGVKNADLSELQEIASEPDSTHVYNVAEF 321 Query: 352 HSMYDAFSHIGKDIVTKRIWYDK 374 M+ + + + ++ DK Sbjct: 322 DLMHTVVESLTRTVCSRVEEQDK 344 >gi|109077202|ref|XP_001094788.1| PREDICTED: integrin alpha-1 [Macaca mulatta] Length = 1179 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 49/296 (16%), Positives = 106/296 (35%), Gaps = 35/296 (11%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 + +L N + ++ ++ + + +V G ++ Y + + Sbjct: 93 VKMDLPVNTSIPNVTEVKENMTFGSTLVTNPNGGFLACGPLYAYRCGHLHYTTGICSDVS 152 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + +S+ + +LD++IVLD S S+ + D +N +LE + + P Sbjct: 153 PTFQVVNSIAPVRECSTQLDIVIVLDGSNSIYPW--------DSVTAFLNDLLERMDIGP 204 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQI 270 Q G+V + + F L S + + + + G T + G+ A + Sbjct: 205 ---KQTQVGIVQYGENVTHEFNLNKYSSTEEVLVAAKKIVQRGGRQTMTALGIDTARKEA 261 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--R 328 F KK++V +TDGE+ ++ + C ++I I Sbjct: 262 FT------EARGARRGVKKVMVIVTDGESHDNHRLKKVIQDCE---DENIQRFSIAILGS 312 Query: 329 VIRSH-------EFLRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 R + E +++ AS F+ V + ++ +G+ I D+ Sbjct: 313 YNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKTLGERIFALEATADQ 368 >gi|281339017|gb|EFB14601.1| hypothetical protein PANDA_010505 [Ailuropoda melanoleuca] Length = 1153 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 48/296 (16%), Positives = 106/296 (35%), Gaps = 35/296 (11%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 + +L N + ++ ++ + + +V G ++ Y + + Sbjct: 75 IKLDLPVNTSIPNVTEVKENMTFGSTLVTNPNGGFLACGPLYAYRCGHLHYTTGICSDVS 134 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + +S+ + +LD++IVLD S S+ + + +N +LE + + P Sbjct: 135 PTFQVVNSIAPVRECSTQLDIVIVLDGSNSIYPW--------ESVTAFLNDLLERMDIGP 186 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQI 270 Q G+V + + F L S + + + + G T + G+ A + Sbjct: 187 ---KQTQVGIVQYGENVTHEFNLNKYSSTEEVLVAANKIIQRGGRQTMTALGIDTARKEA 243 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--R 328 F KK++V +TDGE+ + + + C ++I I Sbjct: 244 FT------EARGARRGVKKVMVIVTDGESHDNHQLNKVIQDCE---DENIQRFSIAILGS 294 Query: 329 VIRSH-------EFLRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 R + E +++ AS F+ V + ++ +G+ I D+ Sbjct: 295 YNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVEALGERIFALEATADQ 350 >gi|316973220|gb|EFV56840.1| putative transmembrane cell adhesion receptor mua-3 [Trichinella spiralis] Length = 3249 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 35/221 (15%), Positives = 75/221 (33%), Gaps = 27/221 (12%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE-VKLIPDV 215 P D+M VLD S S+ S + I L E + L Sbjct: 873 PGRVCTIQTYCPTQPTDLMFVLDGSGSIGS---------AVFKNEILRFLREFINLFTIG 923 Query: 216 NNVVQSGLVTFSNKI--EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN----- 268 +N + ++ +S++I E F + + + + T + L + Sbjct: 924 SNHTRLAIIQYSDQIRHELDFKEANSKAEVDEALNRVEYLTGLTKTGDALTDMFKIGFSS 983 Query: 269 ----QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 + F + + + ++ + +TDG + + NEAKK +++A Sbjct: 984 TFATKFFTILSESRGARPIETGVHRVAIVITDG-----RSQDIVSFSANEAKKSNVLMFA 1038 Query: 325 IGIR-VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 +G+ + E + S + +LV+ + + + Sbjct: 1039 VGVTDHVSEAELVEIAGSKDRVFLVKEFTDLNVRLRSLIQK 1079 >gi|311252833|ref|XP_003125290.1| PREDICTED: vitrin-like isoform 2 [Sus scrofa] Length = 634 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 41/270 (15%), Positives = 91/270 (33%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + N+ ++ + V N YS++ + Y + + ++ + T Sbjct: 386 EGAVENEKQYVLEPNFANKAVCRTNGFYSLNVQNWYGLHKSVQPLVKRVCDTDRLACSKT 445 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S S+ ++ + + +E + Sbjct: 446 CL--------NSADIGFVIDGSSSVG------TGNFRTVLQFVANLSKEFDISDTD---T 488 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ + F + + + IK + + T++ + YA Q+F Sbjct: 489 RVGAVQYTYEQRLEFGFDQYTTKPDVLNAIKRVGYWSGGTSTGAAINYALEQLF------ 542 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG A +G I YAIG+ E Sbjct: 543 ---KKSKPNKRKLMILITDGR-----SYDDVRIPAMVAHHKGVITYAIGVAWAAQEELEI 594 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V+ ++Y + I ++I Sbjct: 595 IATHPARDHAFFVDEFDNLYKSVPKIIQNI 624 >gi|187470892|sp|A6NF34|ANTRL_HUMAN RecName: Full=Anthrax toxin receptor-like; Flags: Precursor Length = 565 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 39/203 (19%), Positives = 74/203 (36%), Gaps = 22/203 (10%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+ +LD S S+ + + ID+ + + E V N ++ +T+S Sbjct: 72 QGSFDLYFILDKSGSVNNNW------IDLYM----WVEETVARFQSPN--IRMCFITYST 119 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L + ++ + L K ++ ++ + + N Sbjct: 120 DGQTVLPLTSDKNRIKNGLDQLQKIVPDGHTF--MQAGFRKAIQQIESFNSGNK----VP 173 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 +I+ MTDGE L Q +L +A+K GA VY +G+ + SP + V Sbjct: 174 SMIIAMTDGE-LVAHAFQDTLREAQKARKLGANVYTLGVADYNLDQITAIADSPGHVFAV 232 Query: 349 ENPHSMYDAFSHIGKDIVTKRIW 371 EN + A + +K Sbjct: 233 ENG---FKALRSTIDALTSKVCL 252 >gi|198422516|ref|XP_002123195.1| PREDICTED: similar to EGF-like domain-containing protein [Ciona intestinalis] Length = 2053 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 70/186 (37%), Gaps = 25/186 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ IV+D S S++ + K+ S+ + ++ V++ LV ++ ++++ Sbjct: 1527 DVAIVVDSSSSVKYN---NFQKMKSFTSSL------FQSFKLGDDRVRAALVRYNRRVDQ 1577 Query: 233 FFLL--EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY-NQIFDMQGMRQHCNTEDANYKK 289 F +Q + + G T + L++ N + G R + +K Sbjct: 1578 RFGFADTNTKEEVQDGVDAMPYRGGGTLTGQALQHVIDNTLKTENGAR-------DDAQK 1630 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-SHEFLRACASPNSFYLV 348 I++ +TDG + L + G I + IGI + P+ + V Sbjct: 1631 IVITITDG-----RSQDDVLTPSRALRAAGVITFGIGIGARVLETQINEIGGDPSRTFSV 1685 Query: 349 ENPHSM 354 E + Sbjct: 1686 ETFDQL 1691 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 67/181 (37%), Gaps = 21/181 (11%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 S N + LD+ +LD S S+ + ++ K+ + V + + Sbjct: 587 CSCPDSNPCENQLLDLFFILDSSSSVRA---ANFEKMKNFVD------RMVDPLNVGQDR 637 Query: 219 VQSGLVTFSNKIEE--FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 V+ G++T++ K + F + K+ + G T + + +A + Sbjct: 638 VRVGVMTYNRKTFKRIDFNEAANNTDFSEKLAAIQYSGRGTKTAQAINFAASNCLHESRG 697 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-RVIRSHEF 335 R N ++ MTDG + ++ + ++ K + ++IGI + E Sbjct: 698 R------RPNVPLSVILMTDGRSQDWRQLPSAAATMHQ-KAN--MFFSIGITNRVNEREL 748 Query: 336 L 336 L Sbjct: 749 L 749 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 30/182 (16%), Positives = 69/182 (37%), Gaps = 20/182 (10%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + D +D+ ++D S S+ + + ++ + + + L + Sbjct: 43 ECDNPIDLAFIVDSSSSVG------ADNFQLIKDFVYEIINGLNISETE---TRVTLARY 93 Query: 227 SNKIEE--FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN-QIFDMQGMRQHCNTE 283 +N ++ FF + S + + + +G T + L + + G R+ + Sbjct: 94 NNWVDSRFFFNTHFDKSAMLDAVSNMPYYGRGTKTAQALHVVTSHHLQKYNGWREGVDA- 152 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-SHEFLRACASP 342 Y +++ +TDG TK+ Q + K + V AIG+ + E + +P Sbjct: 153 ---YPTVVIVLTDGR---TKDFQDLGRFSRALKAKATRVIAIGVGDGAYADELVTIATAP 206 Query: 343 NS 344 +S Sbjct: 207 SS 208 >gi|66793453|ref|NP_001019751.1| vitrin [Gallus gallus] gi|56744182|dbj|BAD81032.1| Akhirin [Gallus gallus] Length = 748 Score = 75.6 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 46/278 (16%), Positives = 98/278 (35%), Gaps = 35/278 (12%) Query: 83 LCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFC 142 R+ ++ + N+ +++ +D V N YSI+ S + + Sbjct: 483 ASRLARESGINIFFVTIEAAAQNEKQNVIEPNFVDKAVCRTNGFYSITVPSWFSL---HK 539 Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 P I + S +NS D+ V+D S S+ + ++ + Sbjct: 540 VVQPLVKRVCDIDRLVCSKTCLNSA-----DIGFVIDGSSSVG------TSNFRTVLQFV 588 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIE-EFFLLEWGVS-HLQRKIKYLSKFGVSTNST 260 + +E ++ + G + ++ + EF ++ + IK ++ + T++ Sbjct: 589 ANISKEFEISDTD---TRIGAIQYTYEQRLEFSFDKYSTKQDVLSAIKRINYWSGGTSTG 645 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA 320 + YA Q+F N +KI++ +TDG A + G Sbjct: 646 AAISYASEQLFT---------KSKPNKRKIMILITDGR-----SYDDVRMPALTAHQNGV 691 Query: 321 IVYAIGIRVIRSHEFLRACASPN--SFYLVENPHSMYD 356 I Y+IG+ E P+ + V+ ++Y Sbjct: 692 IAYSIGVAWAAQDELEAIATDPDKEHSFFVDEFDNLYQ 729 >gi|198426873|ref|XP_002129255.1| PREDICTED: similar to Collagen alpha-1(XIV) chain [Ciona intestinalis] Length = 725 Score = 75.2 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 58/316 (18%), Positives = 109/316 (34%), Gaps = 47/316 (14%) Query: 40 VSHIFFMKTVLHSMIDRSLVHAATQIMNEGN---GNNRKKLKGGDILCRIKNTWNMSFRN 96 + I+ + T + + + SL + I+NE G L D I++ + Sbjct: 82 LGGIWEITTSIQATLRGSLSSSVQTIVNEAGCILGQENTNLTNTDFSTPIRSAFAARI-- 139 Query: 97 ELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVM 156 I V S+++ I + Q + Y+ FI + ++ + Sbjct: 140 ---------FIQQSVGSSAIPITLTQQAT----WWSTVYRPGANATKFIELVSQVENVTV 186 Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 +LD+ V+D S S+ + +++ + ++ + + PD Sbjct: 187 ---------GCATHKLDLWFVIDGSGSVG------FSNFQDSLRFLASLTKRFTIGPDD- 230 Query: 217 NVVQSGLVTFS--NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 V+ G +S + I F S L+ +I S G T++ + N F Sbjct: 231 --VRVGFSVYSSTSTIHSHFNQHMNNSALEAEILGTSYTGGGTSTGRAINDVLNNGFV-- 286 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSH 333 R +I+V MTDG+ + K G V+ +GI I Sbjct: 287 -ERNGARPASEGVPRILVVMTDGQ-----SGDSVKTPSDNVKAAGITVFGVGIGSGIDIA 340 Query: 334 EFLRACASPNSFYLVE 349 E ++P+S Y E Sbjct: 341 EVNEIASNPDSRYAYE 356 >gi|268316013|ref|YP_003289732.1| von Willebrand factor type A [Rhodothermus marinus DSM 4252] gi|262333547|gb|ACY47344.1| von Willebrand factor type A [Rhodothermus marinus DSM 4252] Length = 329 Score = 75.2 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 35/216 (16%), Positives = 77/216 (35%), Gaps = 46/216 (21%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M+VLD+S SM + D S ++ ++A ++ ++ + + GLV F+ + Sbjct: 89 DLMLVLDLSSSMLAQ-DFSPSRFEVARRTAIQFVQGRRAD-------RIGLVVFAGQAFT 140 Query: 233 FFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L ++ L + T + A N++ K+ Sbjct: 141 QVPPTLDYRFLLTMLQRLQVGRLEDGTAIGTAIATAINRL-----------KNSEARSKV 189 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS------------------ 332 I+ +TDG+N + + L A++ G +Y IG+ Sbjct: 190 IILLTDGQNN--RGEIDPLTAAELARQAGIRIYTIGLSGRGEAPYPVQTPFGTRPQPVPV 247 Query: 333 ---HEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 +R A + ++ + ++ ++ I + Sbjct: 248 EIDEAMMREVAEKTGGRYFRATDARTLEAIYAEIDR 283 >gi|291395333|ref|XP_002714013.1| PREDICTED: integrin, alpha 1 [Oryctolagus cuniculus] Length = 1200 Score = 75.2 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 48/295 (16%), Positives = 106/295 (35%), Gaps = 35/295 (11%) Query: 95 RNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHI 154 + +L N + ++ ++ + + +V G ++ Y + ++ Sbjct: 116 KLDLPVNTSIANVTEVKENMTFGSTLVSNPRGGFLACGPLYAYRCGHVHYTTGICSNVSP 175 Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + +S + +LD++IVLD S S+ + + +N +LE + + P Sbjct: 176 TFQVVNSFAPVQECSTQLDIVIVLDGSNSIYPW--------ESVTHFLNDLLERMDIGP- 226 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQIF 271 Q G+V + + F L S + + + + G T + G+ A + F Sbjct: 227 --KQTQVGIVQYGENVTHEFNLNKYSSTEEVLVAANKIIQRGGRQTMTALGIDTARKEAF 284 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--RV 329 KK++V +TDGE+ ++ + C ++I I Sbjct: 285 T------EARGARRGVKKVMVIVTDGESHDNHRLKKVIQDCE---DENIQRFSIAILGSY 335 Query: 330 IRSH-------EFLRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 R + E +++ AS F+ V + ++ +G+ I D+ Sbjct: 336 NRGNLSAEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKALGERIFALEATADQ 390 >gi|254443725|ref|ZP_05057201.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] gi|198258033|gb|EDY82341.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] Length = 339 Score = 75.2 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 43/239 (17%), Positives = 83/239 (34%), Gaps = 46/239 (19%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMES---FFDSSIT-KIDMAIKSINAMLEEV 209 + P + + +S++ D+++ +D+SRSME+ F D + ++ ++A + Sbjct: 69 LARPQAVTTERHSKSRG-YDIVLAVDLSRSMEAEDYFVDRKRSNRLQAVKPVLSAFINR- 126 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAY 267 + GL+ F+ + L + L R+ + L T L A Sbjct: 127 ------RENDRIGLIAFAGRAYTVAPLTFDHKWLARQTERLQIGLIEDGTAIGDSLAVAT 180 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG- 326 +++ + R IV +TDGEN + D + AK G VY I Sbjct: 181 SRLLEGAKERAGEREGA-----FIVLLTDGENTAGMMDP--MEGATLAKDAGIRVYTIAA 233 Query: 327 -----------------------IRVIRSHEFLRAC-ASPNSFYLVENPHSMYDAFSHI 361 + + ++ + F+ EN ++ AF I Sbjct: 234 GKNGYVPFPRRNERGERIGTTQEFLRVDTETLMKIANETNGEFFRAENSDTIDQAFEKI 292 >gi|47211020|emb|CAF94689.1| unnamed protein product [Tetraodon nigroviridis] Length = 2225 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 42/210 (20%), Positives = 81/210 (38%), Gaps = 27/210 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + AR D+ ++D S S+ D + KI + S L+ + Q +V Sbjct: 1260 DVCRAARADLAFLVDGSWSIG---DDNFLKITRFLYSAVGALDRIGP-----EGTQVAIV 1311 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS++ L+ L I +S G +T + +++ + +F +G Sbjct: 1312 QFSDEPRTEVQLKSYRKKERLLEAISSISYKGGNTKTGRAIQHMKDSVFTEEG------G 1365 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 K++V +TDG + E +K+G +V+AIG E + + Sbjct: 1366 ARTAVPKVLVLLTDG-----RSQDDVSKVSKELQKQGFVVFAIGFADADYGELVNVASKP 1420 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 S + V++ DAF + +++V Sbjct: 1421 SSTHVFFVDD----LDAFKEMEEELVASVC 1446 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 47/214 (21%), Positives = 78/214 (36%), Gaps = 28/214 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-NVVQSGLVT 225 QT+A DM+ ++D S S+ + + LE + DV+ + + GL Sbjct: 2 QTEAVADMVFLVDGSWSIG----------RTNFRLVRVFLESLVKAFDVDLDRTRIGLAQ 51 Query: 226 FSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 FS + + L + + + L G +T + L + F + + Sbjct: 52 FSGEPRIEWHLNTHTTKEAVMEAARNLPYKGGNTLTGLALTFILENSFSPE------SGS 105 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS-- 341 KI V +TDG K + + G V+AIG++ E LRA AS Sbjct: 106 RPGIPKIGVLLTDG-----KSQDDVIPPAQRLRDAGVEVFAIGVKNADEGE-LRAIASVS 159 Query: 342 -PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 Y V + H M D + + I + K Sbjct: 160 EDTHVYNVADFHLMADIVDVLTRTICERMETLHK 193 >gi|293356227|ref|XP_574569.3| PREDICTED: rCG40584-like [Rattus norvegicus] Length = 1182 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 41/217 (18%), Positives = 79/217 (36%), Gaps = 34/217 (15%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 D++ ++D S S+ + K+ +K++ + L++ + Sbjct: 140 AAQQGCPRQDQDIVFLIDGSGSITY---TDFEKMLAFVKAVMSQLQQSST--------RF 188 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 L+ FS+ F +S R + ++++ ST + +K+ ++F Q Sbjct: 189 SLMQFSHTFRTHFTFNDFISTSSPLRLLDFVNQLRGSTRTASAIKHVITELFTTQ----- 243 Query: 280 CNTEDANYKKIIVFMTDGENLSTKED-QQSLYYCNEAKKRGAIVYAIGIR-----VIRSH 333 + KI++ +TDG + D + A G I YAIG+ Sbjct: 244 -KGARKDATKILIVITDGRKEGDRLDYGDVIPMAEAA---GIIRYAIGVGQAFYQAQSRQ 299 Query: 334 EFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTK 368 E +SP + VEN +DA I + K Sbjct: 300 ELKDIASSPSREYVFSVEN----FDALKDIQNQLKEK 332 >gi|149559056|ref|XP_001512734.1| PREDICTED: similar to collagen, type XXI, alpha 1, partial [Ornithorhynchus anatinus] Length = 225 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 79/210 (37%), Gaps = 27/210 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S A D++ +LD S S+ ++ K + + + + P +Q G+V Sbjct: 41 SSCRTAPTDLVFILDGSYSVGP------ENFEIVKKWLVNITKNFDIGP---KFIQVGVV 91 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ L + +L + ++ + G +T + +++A + +F Sbjct: 92 QYSDYPVLEIPLGSHDSLENLIQAMESIQYLGGNTRTGKAIQFALDHLF---------AK 142 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 KI V +TDG K + A+ ++AIG+ LRA A+ Sbjct: 143 SQRFLTKIAVVLTDG-----KSQDEVKDAAEAARDSKITMFAIGVGSETEDAELRAIANK 197 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 S+ V ++++ +++ Sbjct: 198 PSYTYVFYVEDYIAISK--IREVIKQKLCE 225 >gi|293344414|ref|XP_001080404.2| PREDICTED: rCG40584-like [Rattus norvegicus] gi|149067645|gb|EDM17197.1| rCG40584, isoform CRA_b [Rattus norvegicus] Length = 1182 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 41/217 (18%), Positives = 79/217 (36%), Gaps = 34/217 (15%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 D++ ++D S S+ + K+ +K++ + L++ + Sbjct: 140 AAQQGCPRQDQDIVFLIDGSGSITY---TDFEKMLAFVKAVMSQLQQSST--------RF 188 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 L+ FS+ F +S R + ++++ ST + +K+ ++F Q Sbjct: 189 SLMQFSHTFRTHFTFNDFISTSSPLRLLDFVNQLRGSTRTASAIKHVITELFTTQ----- 243 Query: 280 CNTEDANYKKIIVFMTDGENLSTKED-QQSLYYCNEAKKRGAIVYAIGIR-----VIRSH 333 + KI++ +TDG + D + A G I YAIG+ Sbjct: 244 -KGARKDATKILIVITDGRKEGDRLDYGDVIPMAEAA---GIIRYAIGVGQAFYQAQSRQ 299 Query: 334 EFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTK 368 E +SP + VEN +DA I + K Sbjct: 300 ELKDIASSPSREYVFSVEN----FDALKDIQNQLKEK 332 >gi|149067644|gb|EDM17196.1| rCG40584, isoform CRA_a [Rattus norvegicus] Length = 1084 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 41/217 (18%), Positives = 79/217 (36%), Gaps = 34/217 (15%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 D++ ++D S S+ + K+ +K++ + L++ + Sbjct: 140 AAQQGCPRQDQDIVFLIDGSGSITY---TDFEKMLAFVKAVMSQLQQSST--------RF 188 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 L+ FS+ F +S R + ++++ ST + +K+ ++F Q Sbjct: 189 SLMQFSHTFRTHFTFNDFISTSSPLRLLDFVNQLRGSTRTASAIKHVITELFTTQ----- 243 Query: 280 CNTEDANYKKIIVFMTDGENLSTKED-QQSLYYCNEAKKRGAIVYAIGIR-----VIRSH 333 + KI++ +TDG + D + A G I YAIG+ Sbjct: 244 -KGARKDATKILIVITDGRKEGDRLDYGDVIPMAEAA---GIIRYAIGVGQAFYQAQSRQ 299 Query: 334 EFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTK 368 E +SP + VEN +DA I + K Sbjct: 300 ELKDIASSPSREYVFSVEN----FDALKDIQNQLKEK 332 >gi|73974565|ref|XP_532319.2| PREDICTED: similar to collagen, type XIV, alpha 1 [Canis familiaris] Length = 1796 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 41/210 (19%), Positives = 84/210 (40%), Gaps = 27/210 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 A+ D++ ++D S S+ D + KI I + + + + I + Q +V Sbjct: 1024 EVCRAAKADLVFMVDGSWSIG---DENFNKI---INFLYSTVGALNKI--GADGTQVAMV 1075 Query: 225 TFSNKIEEFFLL-EWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 F++ F L + L IK++S G +T + +K+ + +F + + Sbjct: 1076 QFTDDPRTEFKLNAYNTKETLLDAIKHISYKGGNTKTGKAIKHVRDSLFTAESGTRRG-- 1133 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 K+IV +TDG + E + G ++A+G+ E + + Sbjct: 1134 ----IPKVIVVITDG-----RSQDDVNKISGEMQSNGYSIFAVGVADADYSELVSIGSKP 1184 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 S + V++ +DAF I +++T Sbjct: 1185 SSRHVFFVDD----FDAFKKIEDELITFVC 1210 Score = 69.4 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 77/207 (37%), Gaps = 26/207 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV-NNVVQSGLVTFS-NKI 230 D++I++D S S+ F + + LE + +V + + GL +S + Sbjct: 158 DIVILVDGSWSIGRFN----------FRLVRLFLENLVTAFNVGSEKTRIGLAQYSGDPR 207 Query: 231 EEFFLLEWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 E+ L + + ++ L G +T + L Y + F + K Sbjct: 208 IEWHLNAFNTKDEVIEAVRNLPYKGGNTLTGLALNYIFENSFKPE------AGARTGVSK 261 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SFYL 347 I + +TDG K + ++ G ++AIG++ +E + P+ Y Sbjct: 262 IGILITDG-----KSQDDIIPPSRNLRESGVELFAIGVKNADENELREIASEPDSTHVYN 316 Query: 348 VENPHSMYDAFSHIGKDIVTKRIWYDK 374 V M+ + + + ++ D+ Sbjct: 317 VAEFDLMHTVVESLTRTVCSRVEEQDR 343 >gi|330447847|ref|ZP_08311495.1| von Willebrand factor type A domain protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] gi|328492038|dbj|GAA05992.1| von Willebrand factor type A domain protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] Length = 321 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 41/213 (19%), Positives = 80/213 (37%), Gaps = 37/213 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 DM++ +D+S SM +TK +I + A+ + + + GLV F++ Sbjct: 84 DMLLAVDLSGSMS--IPDMVTKNGQSIDRLTAVKHVLSDFIEKRKGDRLGLVLFADHAYL 141 Query: 233 FFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + + +++++ L G ST GL A + + +++ Sbjct: 142 QTPLTFDRNTVEQQLDRTVLGLIGQSTAIGEGLGIATKTFINSKA-----------PQRV 190 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------SHEF- 335 I+ ++DG N S D L AK+ G +Y +G+ + S + Sbjct: 191 IILLSDGANTSGVIDP--LEAAKLAKESGVKIYTVGVGADQMVQKGFFGDRLVNPSQDLD 248 Query: 336 ---LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L A + ++ NP + + I K Sbjct: 249 EKTLTEIAKMTGGEYFRARNPQQLEKIYDIINK 281 >gi|198435715|ref|XP_002125840.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 3908 Score = 75.2 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 42/214 (19%), Positives = 89/214 (41%), Gaps = 28/214 (13%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + A D++ ++D S S+ A + A++E ++ D + + +V Sbjct: 3418 ACKTAVTDVIFIVDGSWSVGEI------NFRKAKDFLKALVEPFEVGWDNS---RFAVVQ 3468 Query: 226 FSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYA-YNQIFDMQGMRQHCNT 282 +S+ FL+ + V+ + I + G +TN+ L ++ Y + G R + N Sbjct: 3469 YSDDPRTEFLMNEHFTVTDVLNAIDAIPYKGGNTNTGKALAFSLYTALSPANGARPYVN- 3527 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 K+ + +TDG + + E ++ G V +G+ +E L++ ASP Sbjct: 3528 ------KVALVLTDGRSQDEVGN-----PARELRQAGVKVLTVGVGDADKNE-LKSIASP 3575 Query: 343 ---NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 +S Y V + S+ + +H+ + + D Sbjct: 3576 PYDSSVYHVSDYDSISEIKAHLAAKLCEGEVLRD 3609 Score = 74.0 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 45/226 (19%), Positives = 78/226 (34%), Gaps = 32/226 (14%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARL---------DMMIVLDVSRSMESFFDSSI 192 P S + TS + ++QT + D++++ D S S+ + Sbjct: 360 FDVSPSVGGSSAMGAGRTSDIDSDTQTGRFVQDCKKAQKTDLVVLTDGSWSVG---PQNF 416 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYL 250 KI + S+ V N V G +S+ F L V+ L R I + Sbjct: 417 KKIQAFLVSL------VDAFSIGFNNVLMGYAQYSDDARTEFNLNEHVTKDDLIRAINQV 470 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 G +T + L Y +F +G + K + +TDGE Sbjct: 471 QYKGGNTATGGALDYIRTNLFTSEGGTRRG------VLKTAIVITDGE----SILDDVTE 520 Query: 311 YCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLVENPHSM 354 K+ G V++IG+ E +SP + + V+N + Sbjct: 521 PARMLKEIGVEVFSIGVAAALRSELEDIASSPASDHVFSVDNFDDI 566 >gi|221108467|ref|XP_002170770.1| PREDICTED: similar to collagen, type XXIX, alpha 1, partial [Hydra magnipapillata] Length = 592 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 54/313 (17%), Positives = 110/313 (35%), Gaps = 32/313 (10%) Query: 54 IDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNM------SFRNELRDNGFVNDI 107 ID++L A ++ NG K +L T + NELR G + Sbjct: 89 IDKALRLAQKEVFELENGARPGVAKILFLLTDGSQTQERGSENPVAIANELRSAGVTIIV 148 Query: 108 DDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 I + ++ + + A ++ LK F+ I + + KV Sbjct: 149 IGITNAVNVSELSDIAGGEENAYATESFE-KLKDVNFL------DVIKTKMCETAKVEPT 201 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +A +D++ +LD S S+ ++ + + I + ++ ++TFS Sbjct: 202 CEAIVDIVFLLDSSGSLRKYYQNEKDFLKSVISAFGVSFNG----------TRAAVITFS 251 Query: 228 NKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L ++ + + + G +T L+ A ++F+++ N Sbjct: 252 YHAQLSIKLNKYSNLNSFKEAVDNIVLMGSTTRIDKALRLAQKEVFELE------NGARP 305 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-RVIRSHEFLRACASPNS 344 KI+ +TDG + + + NE + G + IGI + E + Sbjct: 306 GVAKILFLLTDGSQTQERGSENPVAIANELRSAGVTIIVIGITNAVNVSELSDIAGGEEN 365 Query: 345 FYLVENPHSMYDA 357 Y E+ + D Sbjct: 366 AYAAESFEKLKDV 378 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 55/313 (17%), Positives = 113/313 (36%), Gaps = 32/313 (10%) Query: 54 IDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNM------SFRNELRDNGFVNDI 107 ID++L A ++ NG K +L T + NELR G + Sbjct: 285 IDKALRLAQKEVFELENGARPGVAKILFLLTDGSQTQERGSENPVAIANELRSAGVTIIV 344 Query: 108 DDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 I + ++ + + A ++ LK F+ I + + Sbjct: 345 IGITNAVNVSELSDIAGGEENAYAAESFE-KLKDVNFL------DVIKTKMCETATFQPT 397 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +A +D++ +LD S S+ ++ + + D +I+A V N ++ ++TFS Sbjct: 398 CEAVVDIVFLLDSSGSLRKYYQN---EKDFLKSAISAFGVSV-------NGTRAAVITFS 447 Query: 228 NKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L ++ + + + G +T L+ A ++F+++ N Sbjct: 448 YHAQLSIKLNKYSNLNSFKEAVDNIVLMGSTTRIDKALRLAQKEVFELE------NGARP 501 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-RVIRSHEFLRACASPNS 344 KI+ +TDG + + + NE + G + IGI + E + Sbjct: 502 GVAKILFLLTDGSQTQERGSENPVAIANELRSAGVTIIVIGITNAVDVSELFDIAGGEEN 561 Query: 345 FYLVENPHSMYDA 357 Y ++ + D Sbjct: 562 AYFADSFEKLKDV 574 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 73/198 (36%), Gaps = 19/198 (9%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 KV +A +D++ +LD S S+ ++ + + I + ++ Sbjct: 1 KVEPTCEAIVDIVFLLDSSGSLRKYYQNEKDFLKSVISAFGVSFNG----------TRAA 50 Query: 223 LVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 ++TFS + L ++ + + + G +T L+ A ++F+++ Sbjct: 51 VITFSYHAQLSIKLNKYSNLNSFKEAVDNIVLMGSTTRIDKALRLAQKEVFELE------ 104 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-RVIRSHEFLRAC 339 N KI+ +TDG + + + NE + G + IGI + E Sbjct: 105 NGARPGVAKILFLLTDGSQTQERGSENPVAIANELRSAGVTIIVIGITNAVNVSELSDIA 164 Query: 340 ASPNSFYLVENPHSMYDA 357 + Y E+ + D Sbjct: 165 GGEENAYATESFEKLKDV 182 >gi|325678004|ref|ZP_08157643.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] gi|324110284|gb|EGC04461.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] Length = 812 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 38/254 (14%), Positives = 84/254 (33%), Gaps = 36/254 (14%) Query: 114 TSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLD 173 LD ++ +N SI+ K W+ + + + L Sbjct: 194 VELDTILDEENSTVSINTPHFSKY--MLVDKKEWFDAWKRASLYFQDEYE-------PLA 244 Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 I D S SM K+ +I+ ++ + L ++ L++F ++ + Sbjct: 245 AAICYDCSGSMSGNDPKGYRKL-----AIDNFIDSMTLTD------KTALISFEDEAKLV 293 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L+ + FG TN ++ A Q+ +Q Y + I+ Sbjct: 294 SEFSDNKEELKGLVN--PYFGGGTNVRASVEMAIEQLNTVQHW----------YTRHIIL 341 Query: 294 MTDGENLSTK--EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVE 349 ++DG+ + +A ++ IG+ ++ L+ CA + ++ E Sbjct: 342 LSDGDVNININLANNTVDDLIKKAVDNNIKIHTIGLGSGADNQKLKDCAEYTGGQYFTAE 401 Query: 350 NPHSMYDAFSHIGK 363 + + + K Sbjct: 402 TAEKLDAIYKDLSK 415 >gi|145224243|ref|YP_001134921.1| hypothetical protein Mflv_3659 [Mycobacterium gilvum PYR-GCK] gi|189040172|sp|A4T9I4|Y3659_MYCGI RecName: Full=UPF0353 protein Mflv_3659 gi|145216729|gb|ABP46133.1| von Willebrand factor, type A [Mycobacterium gilvum PYR-GCK] Length = 335 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 31/211 (14%), Positives = 70/211 (33%), Gaps = 30/211 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+V+DVS+SM + D + ++ A ++ +++ + GL+ ++ Sbjct: 99 VMLVIDVSQSMRA-TDVAPNRLTAAQEAAKQFADQLTPGIN------LGLIAYAGTATVL 151 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + I L + T + G+ A I + + D +V Sbjct: 152 VSPTTNRESTKTAIDKL-QLADRTATGEGIFTALQAIATVGAV---IGGGDEPPPARVVL 207 Query: 294 MTDGENLSTKEDQQ---SLYYCNEAKKRGAIVYAIGIRVIRSH--------------EFL 336 M+DG+ + AK +G + + + E L Sbjct: 208 MSDGKETVPSNPDNPKGAYTAARTAKDQGVPISTVSFGTPYGYVEINEQRQPVPVDDEML 267 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 + A S + + + F+++ + I Sbjct: 268 KKIADLSGGEAFTASSLEQLKQVFTNLQEQI 298 >gi|294997271|ref|NP_001171103.1| integrin alpha-D isoform 2 [Mus musculus] Length = 1169 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 40/217 (18%), Positives = 83/217 (38%), Gaps = 28/217 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D+ ++D S S++ S T++ +K++ L L+ + Sbjct: 151 CPGQEMDIAFLIDGSGSID---QSDFTQMKDFVKALMGQLASTSTSFS--------LMQY 199 Query: 227 SNKIEEFFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 SN ++ F S L Q + + + T + G++ ++F N Sbjct: 200 SNILKTHFTFTEFKSSLSPQSLVDAIVQLQGLTYTASGIQKVVKELF------HSKNGAR 253 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-------VIRSHEFLR 337 + KKI++ +TDG+ + + EA+K G I YAIG+ ++ + Sbjct: 254 KSAKKILIVITDGQKFRDP--LEYRHVIPEAEKAGIIRYAIGVGDAFREPTALQELNTIG 311 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + S + + V N ++ I + I + Y + Sbjct: 312 SAPSQDHVFKVGNFVALRSIQRQIQEKIFAIEVLYKQ 348 >gi|151555227|gb|AAI48416.1| Integrin, alpha D [synthetic construct] Length = 1164 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 40/217 (18%), Positives = 83/217 (38%), Gaps = 28/217 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D+ ++D S S++ S T++ +K++ L L+ + Sbjct: 146 CPGQEMDIAFLIDGSGSID---QSDFTQMKDFVKALMGQLASTSTSFS--------LMQY 194 Query: 227 SNKIEEFFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 SN ++ F S L Q + + + T + G++ ++F N Sbjct: 195 SNILKTHFTFTEFKSSLSPQSLVDAIVQLQGLTYTASGIQKVVKELF------HSKNGAR 248 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-------VIRSHEFLR 337 + KKI++ +TDG+ + + EA+K G I YAIG+ ++ + Sbjct: 249 KSAKKILIVITDGQKFRDP--LEYRHVIPEAEKAGIIRYAIGVGDAFREPTALQELNTIG 306 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + S + + V N ++ I + I + Y + Sbjct: 307 SAPSQDHVFKVGNFVALRSIQRQIQEKIFAIEVLYKQ 343 >gi|226423922|ref|NP_851794.3| collagen alpha-1(XIV) chain [Mus musculus] gi|148697325|gb|EDL29272.1| procollagen, type XIV, alpha 1, isoform CRA_a [Mus musculus] gi|219519338|gb|AAI45245.1| Collagen, type XIV, alpha 1 [Mus musculus] gi|223459902|gb|AAI38346.1| Collagen, type XIV, alpha 1 [Mus musculus] Length = 1794 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 47/251 (18%), Positives = 94/251 (37%), Gaps = 30/251 (11%) Query: 127 YSISAISRYKI---PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 Y IS ++ + P + P T A+ D++ ++D S S Sbjct: 981 YKISVYTKLQELEGPSVSIMQKTQSLPTEPPTFPPTIPPAKEVCKAAKADLVFMVDGSWS 1040 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVSH 242 + D + KI I + + + + I + Q +V F++ F L + Sbjct: 1041 IG---DDNFNKI---INFLYSTVGALDKI--GADGTQVAMVQFTDDPRTEFKLDSYKTKE 1092 Query: 243 -LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 L I+++S G +T + +K+ + +F + K+IV +TDG Sbjct: 1093 TLLDAIRHISYKGGNTKTGKAIKHVRDTLFTSDSGTRRG------IPKVIVVITDG---- 1142 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFS 359 + E + G ++AIG+ E ++ + S + V++ +DAF Sbjct: 1143 -RSQDDVNKISREMQADGFNIFAIGVADADYSELVQIGSKPSSRHVFFVDD----FDAFK 1197 Query: 360 HIGKDIVTKRI 370 I +++T Sbjct: 1198 KIEDELITFVC 1208 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 48/263 (18%), Positives = 94/263 (35%), Gaps = 28/263 (10%) Query: 117 DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMI 176 D P + I + + K P K + N P VK +T A D++I Sbjct: 105 DKESKPAQGQFRIKDLEKRKDPTKPKVKVVDKGNGSKPTSP--EEVKFFCETPAIADIVI 162 Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV-NNVVQSGLVTFS-NKIEEFF 234 ++D S S+ F + + LE + +V + + GL +S + E+ Sbjct: 163 LVDGSWSIGRFN----------FRLVRNFLENLVTAFNVGSEKTRIGLAQYSGDPRIEWH 212 Query: 235 LLEWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L + + ++ L G +T + L + + F + + KI + Sbjct: 213 LNAFNTKDEVIDAVRSLPYKGGNTLTGLALNFIFENSFKPE------AGSRSGVSKIGIL 266 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SFYLVENP 351 +TDG K + ++ G ++AIG++ E + P+ Y V Sbjct: 267 ITDG-----KSQDDIIPPSRNLRESGVELFAIGVKNADLSELQEIASEPDSTHVYNVAEF 321 Query: 352 HSMYDAFSHIGKDIVTKRIWYDK 374 M+ + + + ++ DK Sbjct: 322 DLMHTVVESLTRTVCSRVEEQDK 344 >gi|30420885|gb|AAO64442.1| collagen type XIV [Mus musculus] Length = 1797 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 47/251 (18%), Positives = 94/251 (37%), Gaps = 30/251 (11%) Query: 127 YSISAISRYKI---PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 Y IS ++ + P + P T A+ D++ ++D S S Sbjct: 984 YKISVYTKLQELEGPSVSIMQKTQSLPTEPPTFPPTIPPAKEVCKAAKADLVFMVDGSWS 1043 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVSH 242 + D + KI I + + + + I + Q +V F++ F L + Sbjct: 1044 IG---DDNFNKI---INFLYSTVGALDKI--GADGTQVAMVQFTDDPRTEFKLDSYKTKE 1095 Query: 243 -LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 L I+++S G +T + +K+ + +F + K+IV +TDG Sbjct: 1096 TLLDAIRHISYKGGNTKTGKAIKHVRDTLFTSDSGTRRG------IPKVIVVITDG---- 1145 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFS 359 + E + G ++AIG+ E ++ + S + V++ +DAF Sbjct: 1146 -RSQDDVNKISREMQADGFNIFAIGVADADYSELVQIGSKPSSRHVFFVDD----FDAFK 1200 Query: 360 HIGKDIVTKRI 370 I +++T Sbjct: 1201 KIEDELITFVC 1211 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 48/263 (18%), Positives = 94/263 (35%), Gaps = 28/263 (10%) Query: 117 DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMI 176 D P + I + + K P K + N P VK +T A D++I Sbjct: 105 DKESKPAQGQFRIKDLEKRKDPTKPKVKVVDKGNGSKPTSP--EEVKFFCETPAIADIVI 162 Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV-NNVVQSGLVTFS-NKIEEFF 234 ++D S S+ F + + LE + +V + + GL +S + E+ Sbjct: 163 LVDGSWSIGRFN----------FRLVRNFLENLVTAFNVGSEKTRIGLAQYSGDPRIEWH 212 Query: 235 LLEWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L + + ++ L G +T + L + + F + + KI + Sbjct: 213 LNAFNTKDEVIDAVRSLPYKGGNTLTGLALNFIFENSFKPE------AGSRSGVSKIGIL 266 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SFYLVENP 351 +TDG K + ++ G ++AIG++ E + P+ Y V Sbjct: 267 ITDG-----KSQDDIIPPSRNLRESGVELFAIGVKNADLSELQEIASEPDSTHVYNVAEF 321 Query: 352 HSMYDAFSHIGKDIVTKRIWYDK 374 M+ + + + ++ DK Sbjct: 322 DLMHTVVESLTRTVCSRVEEQDK 344 >gi|146345398|sp|Q80X19|COEA1_MOUSE RecName: Full=Collagen alpha-1(XIV) chain; Flags: Precursor Length = 1797 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 47/251 (18%), Positives = 94/251 (37%), Gaps = 30/251 (11%) Query: 127 YSISAISRYKI---PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 Y IS ++ + P + P T A+ D++ ++D S S Sbjct: 984 YKISVYTKLQELEGPSVSIMQKTQSLPTEPPTFPPTIPPAKEVCKAAKADLVFMVDGSWS 1043 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVSH 242 + D + KI I + + + + I + Q +V F++ F L + Sbjct: 1044 IG---DDNFNKI---INFLYSTVGALDKI--GADGTQVAMVQFTDDPRTEFKLDSYKTKE 1095 Query: 243 -LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 L I+++S G +T + +K+ + +F + K+IV +TDG Sbjct: 1096 TLLDAIRHISYKGGNTKTGKAIKHVRDTLFTSDSGTRRG------IPKVIVVITDG---- 1145 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFS 359 + E + G ++AIG+ E ++ + S + V++ +DAF Sbjct: 1146 -RSQDDVNKISREMQADGFNIFAIGVADADYSELVQIGSKPSSRHVFFVDD----FDAFK 1200 Query: 360 HIGKDIVTKRI 370 I +++T Sbjct: 1201 KIEDELITFVC 1211 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 48/263 (18%), Positives = 94/263 (35%), Gaps = 28/263 (10%) Query: 117 DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMI 176 D P + I + + K P K + N P VK +T A D++I Sbjct: 105 DKESKPAQGQFRIKDLEKRKDPTKPKVKVVDKGNGSKPTSP--EEVKFFCETPAIADIVI 162 Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV-NNVVQSGLVTFS-NKIEEFF 234 ++D S S+ F + + LE + +V + + GL +S + E+ Sbjct: 163 LVDGSWSIGRFN----------FRLVRNFLENLVTAFNVGSEKTRIGLAQYSGDPRIEWH 212 Query: 235 LLEWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L + + ++ L G +T + L + + F + + KI + Sbjct: 213 LNAFNTKDEVIDAVRSLPYKGGNTLTGLALNFIFENSFKPE------AGSRSGVSKIGIL 266 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SFYLVENP 351 +TDG K + ++ G ++AIG++ E + P+ Y V Sbjct: 267 ITDG-----KSQDDIIPPSRNLRESGVELFAIGVKNADLSELQEIASEPDSTHVYNVAEF 321 Query: 352 HSMYDAFSHIGKDIVTKRIWYDK 374 M+ + + + ++ DK Sbjct: 322 DLMHTVVESLTRTVCSRVEEQDK 344 >gi|126322632|ref|XP_001381040.1| PREDICTED: similar to collagen, type XIV, alpha 1 [Monodelphis domestica] Length = 1892 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 45/210 (21%), Positives = 85/210 (40%), Gaps = 27/210 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 A+ D++ ++D S S+ D + KI I + + + I + Q +V Sbjct: 1171 EVCKAAKADLVFMVDGSWSIG---DDNFNKI---INFLYNTVGALDKI--GVDGTQVSIV 1222 Query: 225 TFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 F++ F+L + L IK LS G +T + LK+ + +F +G + Sbjct: 1223 QFTDDPRTEFMLNTYKTKDTLLEGIKNLSYKGGNTKTGKALKHVRDALFTAEGGTRRG-- 1280 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 K+IV +TDG + E + G ++AIG+ E + + P Sbjct: 1281 ----IPKVIVVITDG-----RSQDDVNKISRELQLEGISIFAIGVADADYAELVSIGSQP 1331 Query: 343 N--SFYLVENPHSMYDAFSHIGKDIVTKRI 370 + + V++ +DAF I +++T Sbjct: 1332 SARHVFFVDD----FDAFKKIEDELITFVC 1357 Score = 65.9 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 37/201 (18%), Positives = 71/201 (35%), Gaps = 26/201 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV-NNVVQSGLVTFSNKIE 231 D++I++D S S+ F + + LE + +V N + GL +S Sbjct: 305 DIVILVDGSWSIGRFN----------FRLVRLFLENLVTAFNVGANKTRIGLAQYSGDPR 354 Query: 232 EFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + L + + ++ L G +T + L Y + F + K Sbjct: 355 IEWHLNTFSTKDAVIDAVRNLPYKGGNTLTGLALNYIFENSFKPE------AGARTGVSK 408 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SFYL 347 I + +TDG K + + G ++AIG++ E + P+ Y Sbjct: 409 IGILITDG-----KSQDDIIPASKTLRDSGVELFAIGVKNADVAELQEIASEPDSTHVYN 463 Query: 348 VENPHSMYDAFSHIGKDIVTK 368 V M+ + K + T+ Sbjct: 464 VAEFDLMHTVVEGLTKTVCTR 484 >gi|166033217|ref|ZP_02236046.1| hypothetical protein DORFOR_02942 [Dorea formicigenerans ATCC 27755] gi|166027574|gb|EDR46331.1| hypothetical protein DORFOR_02942 [Dorea formicigenerans ATCC 27755] Length = 1465 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 43/205 (20%), Positives = 80/205 (39%), Gaps = 35/205 (17%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS--SITKIDMAIKSINAMLEEVKLIP 213 ++S+ K+ QT LD+++VLDVS SM+ S + +ID ++N+ ++ + Sbjct: 106 SALSSTAKITGQTTVPLDIVLVLDVSGSMDDPMGSGDNTKRIDALKAAVNSFIDGSAKVN 165 Query: 214 ----DVNNVVQSGLVTFSNKI------EEFFLLEW-----------------GVSHLQRK 246 DVN + +V F+ +++ + S + Sbjct: 166 DQRADVNKQNRIAVVKFAGNKTDKIGNDQYSQNRYWYNYTQVVSGYKAYTSGNKSEWETT 225 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK--E 304 + L G + + + D + N + N K++++F TDGE + Sbjct: 226 VNALKPAGCTAADY-AMDLT-KTLVDQSKTDANNNADRKNVKRVVIFFTDGEPNHQSGFD 283 Query: 305 DQQSLYYCNEAK--KRGAIVYAIGI 327 D + AK K A +Y IGI Sbjct: 284 DDVANDAITSAKTIKTDADIYTIGI 308 >gi|218528586|ref|YP_002419402.1| hypothetical protein Mchl_0543 [Methylobacterium chloromethanicum CM4] gi|218520889|gb|ACK81474.1| conserved hypothetical protein [Methylobacterium chloromethanicum CM4] Length = 518 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 49/268 (18%), Positives = 83/268 (30%), Gaps = 39/268 (14%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 P P + +P + + ++ + A D M L S + + + Sbjct: 253 PYDVQDTAPSQATPSTLFVPFFAPDESDNDSRAVNDYMADLPSGGSAGGASNRQLQGMTA 312 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST 257 + N + +I+ L + L I ++ G T Sbjct: 313 KYDKNAFKVSTTARQDGTNYLFGPNA---GCEIQPLTRLTTSQTQLTNAIAAMTVIG-DT 368 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE------------- 304 N GL + ++ + + + KK IV MTDG+N S Sbjct: 369 NIPIGLAWGWHLLSPNGPFKDGVAYGEIKTKKFIVLMTDGQNQSAVSSSDNRSYYSGLGF 428 Query: 305 --------------------DQQSLYYCNEAKKRGAIVYAIGIRVIRSHE-FLRACA-SP 342 D + C+ +K V+A+ + V L+ACA SP Sbjct: 429 IWQNRIGTTSNDNAVRTKAIDTRLTLLCDNIRKARIQVFAVRVEVNDGDSAVLKACATSP 488 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 N F+ V+N + F I I RI Sbjct: 489 NMFFDVKNSSGLPAVFRAIADQISELRI 516 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 31/236 (13%), Positives = 84/236 (35%), Gaps = 38/236 (16%) Query: 3 SLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAA 62 SL + F + G + ++ A+ L + + GM I+ + ++ +D + + A Sbjct: 7 SLTARLARFRHTESGSVLVIFALALVPMAFLAGMTIDYAQNTNLRQQAQVAVDATALALA 66 Query: 63 TQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVP 122 ++ + + +L +K + +R NG + ++ +T + + Sbjct: 67 -KLPLDTTDKDLAAKAEAQVLTALKGLPIDALTVTMRHNGDLIEV-AAKGATPTSLTRLA 124 Query: 123 QNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 +S + +++++ L++ +VLD + Sbjct: 125 GFMSMPLSVSA------------------------------ISNRSMTNLEIALVLDNTG 154 Query: 183 SMESFFDSSITKIDMAIKSINA-MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 SM+ + +T + A + + + ++ N ++ G+V FS + Sbjct: 155 SMK---GTKLTNLKAAARDLVTSLFQQADPAKP--NALKIGVVPFSMTVNVGSGFA 205 >gi|297675239|ref|XP_002815595.1| PREDICTED: integrin alpha-1-like [Pongo abelii] Length = 1179 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 49/296 (16%), Positives = 107/296 (36%), Gaps = 35/296 (11%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 + +L N + ++ ++ + + +V G ++ Y + + Sbjct: 93 VKLDLPVNTSIPNVTEVKENMTFGSTLVTNPNGGFLACGPLYAYRCGHLHYTTGICSDVS 152 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + +S+ + +LD++IVLD S S+ + D +N +LE + + P Sbjct: 153 PTFQVVNSIAPVRECSTQLDIVIVLDGSNSIYPW--------DSVTAFLNDLLERMDIGP 204 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQI 270 Q G+V + + F L S + + + + G T + G+ A + Sbjct: 205 ---KQTQVGIVQYGENVTHEFNLNKYSSTEEVLVAAKKIVQRGGRQTMTALGIDTARKEA 261 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--R 328 F KK++V +TDGE+ ++ + C + ++I I Sbjct: 262 FT------EARGARRGVKKVMVIVTDGESHDNHRLKKVIQDCE---EENIQRFSIAILGS 312 Query: 329 VIRSH-------EFLRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 R + E +++ AS F+ V + ++ +G+ I D+ Sbjct: 313 YNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKTLGERIFALEATADQ 368 >gi|148697327|gb|EDL29274.1| procollagen, type XIV, alpha 1, isoform CRA_c [Mus musculus] Length = 1802 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 47/251 (18%), Positives = 94/251 (37%), Gaps = 30/251 (11%) Query: 127 YSISAISRYKI---PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 Y IS ++ + P + P T A+ D++ ++D S S Sbjct: 989 YKISVYTKLQELEGPSVSIMQKTQSLPTEPPTFPPTIPPAKEVCKAAKADLVFMVDGSWS 1048 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVSH 242 + D + KI I + + + + I + Q +V F++ F L + Sbjct: 1049 IG---DDNFNKI---INFLYSTVGALDKI--GADGTQVAMVQFTDDPRTEFKLDSYKTKE 1100 Query: 243 -LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 L I+++S G +T + +K+ + +F + K+IV +TDG Sbjct: 1101 TLLDAIRHISYKGGNTKTGKAIKHVRDTLFTSDSGTRRG------IPKVIVVITDG---- 1150 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFS 359 + E + G ++AIG+ E ++ + S + V++ +DAF Sbjct: 1151 -RSQDDVNKISREMQADGFNIFAIGVADADYSELVQIGSKPSSRHVFFVDD----FDAFK 1205 Query: 360 HIGKDIVTKRI 370 I +++T Sbjct: 1206 KIEDELITFVC 1216 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 48/263 (18%), Positives = 94/263 (35%), Gaps = 28/263 (10%) Query: 117 DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMI 176 D P + I + + K P K + N P VK +T A D++I Sbjct: 110 DKESKPAQGQFRIKDLEKRKDPTKPKVKVVDKGNGSKPTSP--EEVKFFCETPAIADIVI 167 Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV-NNVVQSGLVTFS-NKIEEFF 234 ++D S S+ F + + LE + +V + + GL +S + E+ Sbjct: 168 LVDGSWSIGRFN----------FRLVRNFLENLVTAFNVGSEKTRIGLAQYSGDPRIEWH 217 Query: 235 LLEWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L + + ++ L G +T + L + + F + + KI + Sbjct: 218 LNAFNTKDEVIDAVRSLPYKGGNTLTGLALNFIFENSFKPE------AGSRSGVSKIGIL 271 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SFYLVENP 351 +TDG K + ++ G ++AIG++ E + P+ Y V Sbjct: 272 ITDG-----KSQDDIIPPSRNLRESGVELFAIGVKNADLSELQEIASEPDSTHVYNVAEF 326 Query: 352 HSMYDAFSHIGKDIVTKRIWYDK 374 M+ + + + ++ DK Sbjct: 327 DLMHTVVESLTRTVCSRVEEQDK 349 >gi|315444579|ref|YP_004077458.1| Mg-chelatase subunit ChlD [Mycobacterium sp. Spyr1] gi|315262882|gb|ADT99623.1| Mg-chelatase subunit ChlD [Mycobacterium sp. Spyr1] Length = 335 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 37/257 (14%), Positives = 83/257 (32%), Gaps = 32/257 (12%) Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 S++ ++ + S I M + S+ A +M+V+DVS+SM + Sbjct: 55 SVAPKQPSRMRHLPAVLMILSLVSFTIAMAGPTHDVRISRNRAV--VMLVIDVSQSMRA- 111 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI 247 D + ++ A ++ +++ + GL+ ++ + I Sbjct: 112 TDVAPNRLTAAQEAAKQFADQLTPGIN------LGLIAYAGTATVLVSPTTNRESTKTAI 165 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 L + T + G+ A I + + D +V M+DG+ Sbjct: 166 DKL-QLADRTATGEGIFTALQAIATVGAV---IGGGDEPPPARVVLMSDGKETVPSNPDN 221 Query: 308 ---SLYYCNEAKKRGAIVYAIGIRVIRSH--------------EFLRACA--SPNSFYLV 348 + AK +G + + + E L+ A S + Sbjct: 222 PKGAYTAARTAKDQGVPISTVSFGTPYGYVEINEQRQPVPVDDEMLKKIADLSGGEAFTA 281 Query: 349 ENPHSMYDAFSHIGKDI 365 + + F+++ + I Sbjct: 282 SSLEQLKQVFTNLQEQI 298 >gi|307943467|ref|ZP_07658811.1| putative Flp pilus assembly protein TadG [Roseibium sp. TrichSKD4] gi|307773097|gb|EFO32314.1| putative Flp pilus assembly protein TadG [Roseibium sp. TrichSKD4] Length = 466 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 68/454 (14%), Positives = 132/454 (29%), Gaps = 98/454 (21%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 F + +G + L A I+ +V G ++ + + + +D + + A Q+ Sbjct: 10 FHKFGSDERGSLLPLVAGVCLILLVVAGSAVDYGRALGYRHKIANAVDAAALTVAKQLST 69 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 N+ + + N ++ DN +D + + V Q Sbjct: 70 TVLTENQIRTGLKNAFR--ANLNAAGINSQGIDN-LDFKVDPGEGTLDVWSSVDIQTNFI 126 Query: 128 SISAISRYKIPLKFCTFI--PWYTNSRHIVMPITSSVKVNSQTDARLDMMIV----LDVS 181 + I K+ + + + + +V+ +T S++ + IV D S Sbjct: 127 KLGGIGPEKLEVGAASQVNYSRFDVELALVLDVTGSMRPDMNALKEASKSIVNILLPDDS 186 Query: 182 RSMES-------------FFDSSITKI----DMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 S ES S T++ +N K N S Sbjct: 187 NSRESKVRISLVPYSQGVNLGSYATRVTNGGSTWRNCVNERSGPQKFTDAPYNYAGSRSD 246 Query: 225 TFSNKIEEFF----------------------LLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 F K ++F L + L R I L G + T Sbjct: 247 FFHGKPKQFVWDYGWTEQWQTRPEACPKTAVEPLTADRTKLLRAISGLKDGGGTGGQTGI 306 Query: 263 ------LKYAYNQIF--DMQGMRQHCNTEDANYKKIIVFMTDGENL-------------- 300 L + ++ D + + KK + MTDG+ Sbjct: 307 AWGWYTLSPKWKNLWPRDSAPATYGTGSHTDDTKKFALIMTDGDFNAAYGWDCGCRKIRD 366 Query: 301 ------------------------STKEDQQSLYYCNEAKKRGAIVYAIGI---RVIRSH 333 S Q++ C+E K + ++ + Sbjct: 367 KPLYCRKKSNKKSWIERYFSPSKISHAPAQRAKKLCDEMKSKNIEIFTVYFDTGGATFGD 426 Query: 334 EFLRACAS-PNSFYLVENPHSMYDAFSHIGKDIV 366 + + CAS ++Y +N + + AFS+I +I Sbjct: 427 DLMSYCASGSRNYYRADNSNELIQAFSNIANEIQ 460 >gi|148689378|gb|EDL21325.1| procollagen, type VII, alpha 1 [Mus musculus] Length = 2944 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 76/197 (38%), Gaps = 23/197 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ S+ ++ ++ + V+ V +S+ + Sbjct: 39 DIVFLLDGSSSIG---RSNFREVRGFLEGLVLPFSGAASAQG----VRFATVQYSDDPQT 91 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L+ S R I+ LS G +T + L + +++F + R K+ Sbjct: 92 EFGLDTLGSGSDTIRAIRELSYKGGNTRTGAALHHVSDRVFLPRLTRPG-------VPKV 144 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLV 348 + +TDG K + K +G ++A+GI+ E R + + + F+ V Sbjct: 145 CILITDG-----KSQDLVDTAAQKLKGQGVKLFAVGIKNADPEELKRVASQPTSDFFFFV 199 Query: 349 ENPHSMYDAFSHIGKDI 365 + + I + + Sbjct: 200 NDFSILRTLLPLISRRV 216 Score = 44.0 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 33/224 (14%), Positives = 75/224 (33%), Gaps = 33/224 (14%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S + + + +D++ +L +R ++ ++ + ++ + + Sbjct: 1042 SVTQTPACSHGPVDVVFLLHATRDNAHNAEAVRRVLERLVSALGPLGPQA---------A 1092 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH-------LQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 Q GL+T+S++ F L SH R I Y+ G N + A+ + Sbjct: 1093 QVGLLTYSHRPSPLFPL--NSSHDLGIILRKIRDIPYVDPSGN--NLGTAVTTAHRYLLA 1148 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + ++V + D L EA+ G V A+ + Sbjct: 1149 SNAPGR-----RQQVPGVMVLLVD-----EPLRGDILSPIREAQTSGLKVMALSLVGADP 1198 Query: 333 HEFLRACASPN---SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + R + +F+ V+N + A S + + + + Sbjct: 1199 EQLRRLAPGTDPIQNFFAVDNGPGLDRAVSDLAVALCQAAVTIE 1242 >gi|115647999|ref|NP_031764.2| collagen alpha-1(VII) chain precursor [Mus musculus] gi|143955303|sp|Q63870|CO7A1_MOUSE RecName: Full=Collagen alpha-1(VII) chain; AltName: Full=Long-chain collagen; Short=LC collagen; Flags: Precursor Length = 2944 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 76/197 (38%), Gaps = 23/197 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ S+ ++ ++ + V+ V +S+ + Sbjct: 39 DIVFLLDGSSSIG---RSNFREVRGFLEGLVLPFSGAASAQG----VRFATVQYSDDPQT 91 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L+ S R I+ LS G +T + L + +++F + R K+ Sbjct: 92 EFGLDTLGSGSDTIRAIRELSYKGGNTRTGAALHHVSDRVFLPRLTRPG-------VPKV 144 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLV 348 + +TDG K + K +G ++A+GI+ E R + + + F+ V Sbjct: 145 CILITDG-----KSQDLVDTAAQKLKGQGVKLFAVGIKNADPEELKRVASQPTSDFFFFV 199 Query: 349 ENPHSMYDAFSHIGKDI 365 + + I + + Sbjct: 200 NDFSILRTLLPLISRRV 216 Score = 44.0 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 33/224 (14%), Positives = 75/224 (33%), Gaps = 33/224 (14%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S + + + +D++ +L +R ++ ++ + ++ + + Sbjct: 1042 SVTQTPACSHGPVDVVFLLHATRDNAHNAEAVRRVLERLVSALGPLGPQA---------A 1092 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH-------LQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 Q GL+T+S++ F L SH R I Y+ G N + A+ + Sbjct: 1093 QVGLLTYSHRPSPLFPL--NSSHDLGIILRKIRDIPYVDPSGN--NLGTAVTTAHRYLLA 1148 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + ++V + D L EA+ G V A+ + Sbjct: 1149 SNAPGR-----RQQVPGVMVLLVD-----EPLRGDILSPIREAQTSGLKVMALSLVGADP 1198 Query: 333 HEFLRACASPN---SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + R + +F+ V+N + A S + + + + Sbjct: 1199 EQLRRLAPGTDPIQNFFAVDNGPGLDRAVSDLAVALCQAAVTIE 1242 >gi|2326168|gb|AAB66593.1| type VII collagen [Mus musculus] Length = 2944 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 76/197 (38%), Gaps = 23/197 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ S+ ++ ++ + V+ V +S+ + Sbjct: 39 DIVFLLDGSSSIG---RSNFREVRGFLEGLVLPFSGAASAQG----VRFATVQYSDDPQT 91 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L+ S R I+ LS G +T + L + +++F + R K+ Sbjct: 92 EFGLDTLGSGSDTIRAIRELSYKGGNTRTGAALHHVSDRVFLPRLTRPG-------VPKV 144 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLV 348 + +TDG K + K +G ++A+GI+ E R + + + F+ V Sbjct: 145 CILITDG-----KSQDLVDTAAQKLKGQGVKLFAVGIKNADPEELKRVASQPTSDFFFFV 199 Query: 349 ENPHSMYDAFSHIGKDI 365 + + I + + Sbjct: 200 NDFSILRTLLPLISRRV 216 Score = 44.0 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 33/224 (14%), Positives = 75/224 (33%), Gaps = 33/224 (14%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S + + + +D++ +L +R ++ ++ + ++ + + Sbjct: 1042 SVTQTPACSHGPVDVVFLLHATRDNAHNAEAVRRVLERLVSALGPLGPQA---------A 1092 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH-------LQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 Q GL+T+S++ F L SH R I Y+ G N + A+ + Sbjct: 1093 QVGLLTYSHRPSPLFPL--NSSHDLGIILRKIRDIPYVDPSGN--NLGTAVTTAHRYLLA 1148 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + ++V + D L EA+ G V A+ + Sbjct: 1149 SNAPGR-----RQQVPGVMVLLVD-----EPLRGDILSPIREAQTSGLKVMALSLVGADP 1198 Query: 333 HEFLRACASPN---SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + R + +F+ V+N + A S + + + + Sbjct: 1199 EQLRRLAPGTDPIQNFFAVDNGPGLDRAVSDLAVALCQAAVTIE 1242 >gi|326328639|ref|ZP_08194979.1| LigA [Nocardioidaceae bacterium Broad-1] gi|325953600|gb|EGD45600.1| LigA [Nocardioidaceae bacterium Broad-1] Length = 871 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 73/210 (34%), Gaps = 31/210 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 +++++D S SM KID + NA + + ++ + TF + + + Sbjct: 650 QVILLIDNSGSMNDEVAPGAAKIDRVQSAANAAIGLLAPKDELA------VWTFGSSVHK 703 Query: 233 F--FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + +S ++ +I + G +T ++ A++ + N D K Sbjct: 704 TALAPMGNRISQVRAEIGAIEAGGTTTQLPAAVQAAHDAL-------AQTNDPDNPKTKA 756 Query: 291 IVFMTDGENLSTK--EDQQSLYYCNEA-------KKRGAIVYAIGIRVIRSHEFLR--AC 339 +V +TDG T D++ N+A + +Y I L A Sbjct: 757 VVLLTDGATNLTPDGADEEENKAANDALVADIRGSESHVRIYTIPYGNSADKCLLEKVAA 816 Query: 340 ASPNSFYLVENPHSM-----YDAFSHIGKD 364 AS +Y S+ F + G Sbjct: 817 ASGARYYGAGARESLINDVMLAVFGNFGTQ 846 >gi|295132198|ref|YP_003582874.1| von Willebrand factor(vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] gi|294980213|gb|ADF50678.1| von Willebrand factor(vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] Length = 334 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 41/235 (17%), Positives = 81/235 (34%), Gaps = 49/235 (20%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P T V + + +D+++ +DVS SM + D +++ ++ Sbjct: 77 PRTVDVSTRTNSTQGIDIVMAIDVSASMLA-RDLQPNRLEATKAVGEEFIKGRPSD---- 131 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQIFDM 273 + GLV +S + + S + R ++ + + T GL + N++ D Sbjct: 132 ---RIGLVLYSGESFTKTPITSDKSVVLRALEDVEFNNILESGTAIGSGLATSVNRLKDS 188 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR----- 328 + K+I+ +TDG N S D + AK+ G VY IG+ Sbjct: 189 KAES-----------KVIILLTDGVNNSGFIDPKVAS--ELAKEFGIKVYTIGVGTNGMA 235 Query: 329 ------VIRS------------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 + L+ A + ++ N + D + I + Sbjct: 236 LTPVGIAANGRFQFGNRQVEIDEDLLKQIADETGGKYFRATNNEKLEDIYDEIDQ 290 >gi|194390782|dbj|BAG62150.1| unnamed protein product [Homo sapiens] Length = 1173 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 49/296 (16%), Positives = 106/296 (35%), Gaps = 35/296 (11%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 + +L N + ++ ++ + + +V G ++ Y + + Sbjct: 93 VKLDLPVNTSIPNVTEVKENMTFGSTLVTNPNGGFLACGPLYAYRCGHLHYTTGICSDVS 152 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + +S+ + +LD++IVLD S S+ + D +N +LE + + P Sbjct: 153 PTFQVVNSIAPVQECSTQLDIVIVLDGSNSIYPW--------DSVTAFLNDLLERMDIGP 204 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQI 270 Q G+V + + F L S + + + + G T + G+ A + Sbjct: 205 ---KQTQVGIVQYGENVTHEFNLNKYSSTEEVLVAAKKIVQRGGRQTMTALGIDTARKEA 261 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--R 328 F KK++V +TDGE+ ++ + C ++I I Sbjct: 262 FT------EARGARRGVKKVMVIVTDGESHDNHRLKKVIQDCE---DENIQRFSIAILGS 312 Query: 329 VIRSH-------EFLRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 R + E +++ AS F+ V + ++ +G+ I D+ Sbjct: 313 YNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKTLGERIFALEATADQ 368 >gi|89072369|ref|ZP_01158948.1| hypothetical protein SKA34_06335 [Photobacterium sp. SKA34] gi|89051901|gb|EAR57353.1| hypothetical protein SKA34_06335 [Photobacterium sp. SKA34] Length = 321 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 40/213 (18%), Positives = 79/213 (37%), Gaps = 37/213 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 DM++ +D+S SM +TK ++ + A+ + + + GLV F++ Sbjct: 84 DMLLAVDLSGSMS--IPDMVTKNGQSVDRLTAVKHVLSDFIEKRKGDRLGLVLFADHAYL 141 Query: 233 FFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + +++++ L G ST GL A + + +++ Sbjct: 142 QTPLTFDRKTVEKQLDRTVLGLIGQSTAIGEGLGIATKTFINSKA-----------PQRV 190 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------SHEF- 335 I+ ++DG N S D L AK+ G +Y +G+ + S + Sbjct: 191 IILLSDGANTSGVIDP--LEAAKLAKESGVKIYTVGVGADQMVQQGFFGDRIVNPSQDLD 248 Query: 336 ---LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L A + ++ NP + + I K Sbjct: 249 EKTLTDIAKMTGGEYFRARNPQQLEKIYDIINK 281 >gi|187736265|ref|YP_001878377.1| von Willebrand factor type A [Akkermansia muciniphila ATCC BAA-835] gi|187426317|gb|ACD05596.1| von Willebrand factor type A [Akkermansia muciniphila ATCC BAA-835] Length = 328 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 45/211 (21%), Positives = 74/211 (35%), Gaps = 35/211 (16%) Query: 172 LDMMIVLDVSRSME-SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D+MI D+S SME + I+ + + + + + V PD + G+V F+ K Sbjct: 90 IDIMIAFDLSYSMETPDMVLNRMPINRLVAAKHVITQFVDSRPDD----RIGIVGFAGKT 145 Query: 231 EEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + F L + + I+ T + A ++ D + + Sbjct: 146 KSFCPLTLDHALVNSIIRDFHPRMIQADGTAIGSAIAAAATRLDDRKETKS--------- 196 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-------------HE 334 KII+ +TDG S L A K G +Y I + Sbjct: 197 -KIIILVTDG--ASNSGQISPLVAAENAAKLGIKIYTIAVGTEEGTLANGMVVQSEFDEP 253 Query: 335 FLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 LR A + + N S AF+ IGK Sbjct: 254 TLRKIAQLTGGEHFRATNMASFNKAFTSIGK 284 >gi|296232325|ref|XP_002761549.1| PREDICTED: hypothetical protein LOC100408376 [Callithrix jacchus] Length = 912 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 43/209 (20%), Positives = 76/209 (36%), Gaps = 19/209 (9%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP---DVNNVVQSGLV 224 D +D+ VLD S S+ +D ++ ++ D N V +G + Sbjct: 32 QDCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGAL 91 Query: 225 TFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +S+++E L G L+ + + FG T + +K Q+ H Sbjct: 92 HYSDEVEIIQGLTRMPGGRDSLKSSVDAVKYFGKGTYTDCAIKKGLEQLLVGG---SHLK 148 Query: 282 TEDANYKKIIVFMTDGEN-LSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRAC 339 K ++ +TDG KE L NEAK G V+++ I L Sbjct: 149 EN-----KYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGIKVFSVAITPDHLEPRLSII 203 Query: 340 ASPNSF---YLVENPHSMYDAFSHIGKDI 365 A+ +++ + + DA I + I Sbjct: 204 ATDHTYRRNFTAADWGQSRDAEEAISQTI 232 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 63/170 (37%), Gaps = 29/170 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVVQSGL 223 +D++ VLD S S+ + ++A + +++ + V +G+ Sbjct: 591 CKCGPIDLLFVLDSSESIG------LQNFEIAKDFVVKVIDRLSRDELVKFEPGQSSAGV 644 Query: 224 VTFSNKIEEF-----FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V +S+ + + L+ IK L T + L+Y +Q+ Sbjct: 645 VQYSHSQMQEHVSLRSPGIRNMQELKEAIKSLQWMAGGTFTGEALQYTRDQLL------- 697 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + + +I + +TDG + + ++ C G V ++GI+ Sbjct: 698 ----PPSPHSRIALVITDGRSDTQRDTTPLNVLCGP----GIQVVSVGIK 739 >gi|119575263|gb|EAW54868.1| hCG2002731, isoform CRA_e [Homo sapiens] Length = 1177 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 49/296 (16%), Positives = 106/296 (35%), Gaps = 35/296 (11%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 + +L N + ++ ++ + + +V G ++ Y + + Sbjct: 93 VKLDLPVNTSIPNVTEVKENMTFGSTLVTNPNGGFLACGPLYAYRCGHLHYTTGICSDVS 152 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + +S+ + +LD++IVLD S S+ + D +N +LE + + P Sbjct: 153 PTFQVVNSIAPVQECSTQLDIVIVLDGSNSIYPW--------DSVTAFLNDLLERMDIGP 204 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQI 270 Q G+V + + F L S + + + + G T + G+ A + Sbjct: 205 ---KQTQVGIVQYGENVTHEFNLNKYSSTEEVLVAAKKIVQRGGRQTMTALGIDTARKEA 261 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--R 328 F KK++V +TDGE+ ++ + C ++I I Sbjct: 262 FT------EARGARRGVKKVMVIVTDGESHDNHRLKKVIQDCE---DENIQRFSIAILGS 312 Query: 329 VIRSH-------EFLRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 R + E +++ AS F+ V + ++ +G+ I D+ Sbjct: 313 YNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKTLGERIFALEATADQ 368 >gi|114600333|ref|XP_517769.2| PREDICTED: integrin alpha-1 [Pan troglodytes] Length = 1179 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 49/296 (16%), Positives = 106/296 (35%), Gaps = 35/296 (11%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 + +L N + ++ ++ + + +V G ++ Y + + Sbjct: 93 VKLDLPVNTSIPNVTEVKENMTFGSTLVTNPNGGFLACGPLYAYRCGHLHYTTGICSDVS 152 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + +S+ + +LD++IVLD S S+ + D +N +LE + + P Sbjct: 153 PTFQVVNSIAPVQECSTQLDIVIVLDGSNSIYPW--------DSVTAFLNDLLERMDIGP 204 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQI 270 Q G+V + + F L S + + + + G T + G+ A + Sbjct: 205 ---KQTQVGIVQYGENVTHEFNLNKYSSTEEVLVAAKKIVQRGGRQTMTALGIDTARKEA 261 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--R 328 F KK++V +TDGE+ ++ + C ++I I Sbjct: 262 FT------EARGARRGVKKVMVIVTDGESHDNHRLKKVIQDCE---DENIQRFSIAILGS 312 Query: 329 VIRSH-------EFLRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 R + E +++ AS F+ V + ++ +G+ I D+ Sbjct: 313 YNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKTLGERIFALEATADQ 368 >gi|90577284|ref|ZP_01233095.1| hypothetical protein VAS14_09574 [Vibrio angustum S14] gi|90440370|gb|EAS65550.1| hypothetical protein VAS14_09574 [Vibrio angustum S14] Length = 321 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 41/213 (19%), Positives = 79/213 (37%), Gaps = 37/213 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 DM++ +D+S SM +TK +I + A+ + + + GLV F++ Sbjct: 84 DMLLAVDLSGSMS--IPDMVTKNGQSIDRLTAVKHVLSDFIEKRKGDRLGLVLFADHAYL 141 Query: 233 FFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + +++++ L G ST GL A + + +++ Sbjct: 142 QTPLTFDRKTVEQQLDRTVLGLIGQSTAIGEGLGIATKTFINSKA-----------PQRV 190 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------SHEF- 335 I+ ++DG N S D L AK+ G +Y +G+ + S + Sbjct: 191 IILLSDGANTSGVIDP--LEAAKLAKESGVKIYTVGVGADQMVQQGFFGDRIVNPSQDLD 248 Query: 336 ---LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L A + ++ NP + + I K Sbjct: 249 EKTLTEIAKMTGGEYFRARNPQQLEKIYDIINK 281 >gi|31657142|ref|NP_852478.1| integrin alpha-1 precursor [Homo sapiens] gi|124056463|sp|P56199|ITA1_HUMAN RecName: Full=Integrin alpha-1; AltName: Full=CD49 antigen-like family member A; AltName: Full=Laminin and collagen receptor; AltName: Full=VLA-1; AltName: CD_antigen=CD49a; Flags: Precursor gi|187951605|gb|AAI37122.1| Integrin, alpha 1 [Homo sapiens] gi|187957526|gb|AAI37123.1| Integrin, alpha 1 [Homo sapiens] Length = 1179 Score = 74.8 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 49/296 (16%), Positives = 106/296 (35%), Gaps = 35/296 (11%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 + +L N + ++ ++ + + +V G ++ Y + + Sbjct: 93 VKLDLPVNTSIPNVTEVKENMTFGSTLVTNPNGGFLACGPLYAYRCGHLHYTTGICSDVS 152 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + +S+ + +LD++IVLD S S+ + D +N +LE + + P Sbjct: 153 PTFQVVNSIAPVQECSTQLDIVIVLDGSNSIYPW--------DSVTAFLNDLLERMDIGP 204 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQI 270 Q G+V + + F L S + + + + G T + G+ A + Sbjct: 205 ---KQTQVGIVQYGENVTHEFNLNKYSSTEEVLVAAKKIVQRGGRQTMTALGIDTARKEA 261 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--R 328 F KK++V +TDGE+ ++ + C ++I I Sbjct: 262 FT------EARGARRGVKKVMVIVTDGESHDNHRLKKVIQDCE---DENIQRFSIAILGS 312 Query: 329 VIRSH-------EFLRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 R + E +++ AS F+ V + ++ +G+ I D+ Sbjct: 313 YNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKTLGERIFALEATADQ 368 >gi|158425008|ref|YP_001526300.1| von Willebrand factor type A domain-containing protein [Azorhizobium caulinodans ORS 571] gi|158331897|dbj|BAF89382.1| von Willebrand factor type A domain protein [Azorhizobium caulinodans ORS 571] Length = 343 Score = 74.4 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 30/213 (14%), Positives = 70/213 (32%), Gaps = 42/213 (19%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+ +D+S SM S+ + + ++ + + + + K + GL+ FS Sbjct: 92 DLMLAVDLSGSMSRQDLSYDNIPVDRLTIIKGVADDFIAKRKGD-------RIGLILFST 144 Query: 229 KIEEFFLLEWGVSHLQR--KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + + ++ + + G T + A + Sbjct: 145 RAYVQAPLTFDRNVVRDLLRTSSIGMTGQETAIGDAIALAVKTLRT-----------RPQ 193 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV---IRSHEF-------- 335 ++++V +TDG N + AK G +Y IG+ Sbjct: 194 EQRVLVLLTDGANN--SGMLSPIPAAEIAKANGVKIYTIGVGADAFAVGQRMVNPSFDLD 251 Query: 336 ---LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L A + ++ + + ++ I + Sbjct: 252 EGALEQIAQMTGGRYFRARDAAGLAAIYNDIDR 284 >gi|73954248|ref|XP_546328.2| PREDICTED: similar to integrin, alpha 1 precursor [Canis familiaris] Length = 1182 Score = 74.4 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 48/296 (16%), Positives = 105/296 (35%), Gaps = 35/296 (11%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 + +L N + ++ ++ + + +V G ++ Y + + Sbjct: 96 IKLDLPVNTSIPNVTEVKENMTFGSTLVTNPNGGFLACGPLYAYRCGHLHYTTGICSDVS 155 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + +S+ + +LD++IVLD S S+ + + +N +LE + + P Sbjct: 156 PTFQVVNSIAPVRECSTQLDIVIVLDGSNSIYPW--------ESVTAFLNDLLERMDIGP 207 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQI 270 Q G+V + + F L S + + + + G T + G+ A + Sbjct: 208 ---KQTQVGIVQYGENVTHEFNLNKYSSTEEVLVAANQIIQRGGRQTMTALGIDTARKEA 264 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--R 328 F KK++V +TDGE+ + + C ++I I Sbjct: 265 FT------EARGARRGVKKVMVIVTDGESHDNHLLNKVIQDCE---DENIQRFSIAILGS 315 Query: 329 VIRSH-------EFLRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 R + E +++ AS F+ V + ++ +G+ I D+ Sbjct: 316 YNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVEALGERIFALEATADQ 371 >gi|160858157|emb|CAP19998.1| collagen type VI alpha 5 [Homo sapiens] Length = 609 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 70/192 (36%), Gaps = 23/192 (11%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D + D+M ++D S S+ + + K+ + +K++ + I + Q G+V Sbjct: 273 KGCEDMKADIMFLVDSSWSIG---NENFRKMKIFMKNL------LTKIQIGADKTQIGVV 323 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+K +E F L + + I +S T + L + +G R Sbjct: 324 QFSDKTKEEFQLNRYFTQQEISDAIDRMSLINEGTLTGKALNFVGQYFTHSKGARLGA-- 381 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 KK ++ +TDG + + ++++G+ + Sbjct: 382 -----KKFLILITDG-----VAQDDVRDPARILRGKDVTIFSVGVYNANRSQLEEISGDS 431 Query: 343 NSFYLVENPHSM 354 + + VEN + Sbjct: 432 SLVFHVENFDHL 443 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 50/293 (17%), Positives = 105/293 (35%), Gaps = 25/293 (8%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + +F + + IDR + + + + + + + + Sbjct: 335 LNRYFTQQEISDAIDRMSLINEGTLTGKALNFVGQYFTHSKGARLGAKKFLILITDGVAQ 394 Query: 101 NGFVNDIDDIVRSTSLDIVVVP---QNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMP 157 + V D I+R + I V N + F R ++ Sbjct: 395 DD-VRDPARILRGKDVTIFSVGVYNANRSQLEEISGDSSLVFHVENFDHLKALERKLIFR 453 Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + + T LD++ VLD S S++ + + IN + VK + Sbjct: 454 VCALHDCKRITL--LDVVFVLDHSGSIKKQYQDHM---------INLTIHLVKKADVGRD 502 Query: 218 VVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 VQ G + +S++ F L S + ++ G +T + LK+A N +F Sbjct: 503 RVQFGALKYSDQPNILFYLNTYSNRSAIIENLRKRRDTGGNTYTAKALKHA-NALFT--- 558 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +H + N K++++ +TDGE+ + + E + +G ++A+G+ Sbjct: 559 -EEHGSRIKQNVKQMLIVITDGESHDHDQLNDTAL---ELRNKGITIFAVGVG 607 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 31/203 (15%), Positives = 71/203 (34%), Gaps = 22/203 (10%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S++ I + MLE ++ + V+ G+V +S+ E Sbjct: 95 DIHFLIDGSSSIQEKQFEQIKRF---------MLEVTEMFSIGPDKVRVGVVQYSDDTEV 145 Query: 233 FFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + + + ++ I + + T + L Y I + R Sbjct: 146 EFYITDYSNDIDLRKAIFNIKQLTGGTYTGKALDYILQIIKNGMKDRMSK------VPCY 199 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDG + + + + V+A+GI E +N Sbjct: 200 LIVLTDGMSTD-----RVVEPAKRLRAEQITVHAVGIGAANKIELQEIAGKEERVSFGQN 254 Query: 351 PHSMYDAFSHIGKDIVTKRIWYD 373 ++ + + ++I ++ D Sbjct: 255 FDALKSIKNEVVREICAEKGCED 277 >gi|146295744|ref|YP_001179515.1| von Willebrand factor, type A [Caldicellulosiruptor saccharolyticus DSM 8903] gi|145409320|gb|ABP66324.1| von Willebrand factor, type A [Caldicellulosiruptor saccharolyticus DSM 8903] Length = 909 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 41/212 (19%), Positives = 79/212 (37%), Gaps = 26/212 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + + +++V+D S SM +I K+++A + M++ ++ G++ F Sbjct: 401 EKERNVAVVLVIDHSGSMGGSNLRNINKLEIAKSAAAKMIDHLESSDS------VGVIAF 454 Query: 227 SNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + + + I + + G T P L A N + + Sbjct: 455 DHNFYWASKFGKLKSKNEVIENISTI-QVGGGTAIIPPLTEAVNLL-----------KKS 502 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SP 342 K+IV +TD E+ Y + AK+ + IG+ + L A + Sbjct: 503 KAKDKVIVLLTD----GYGEEGGYEYPASIAKRNNIKITTIGVGSSINAPILSWMAAYTS 558 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 FY V++ ++ D F K I K I K Sbjct: 559 GRFYYVKDASNLIDVFLKEAKIIKGKYIKEKK 590 >gi|269968855|ref|ZP_06182838.1| hypothetical protein VMC_42680 [Vibrio alginolyticus 40B] gi|269826535|gb|EEZ80886.1| hypothetical protein VMC_42680 [Vibrio alginolyticus 40B] Length = 356 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 82/210 (39%), Gaps = 32/210 (15%) Query: 173 DMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+M+V+D+S SM S I+++D A + ++ ++ K + GLV F Sbjct: 107 DVMVVVDLSGSMAEPDFTSRTGEKISRLDAAKEVLSEFVQSRKGD-------RLGLVLFG 159 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + F+ + + ++ L++ G ST+ + A ++F+ Sbjct: 160 ---DAAFVQTPFTADQKVWLELLNQTDVAMAGQSTHLGDAIGLAI-KVFEQSDKSSGALE 215 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--SHEFLR--- 337 +D N +K+ + +TDG T + + AK +G V+ I + L Sbjct: 216 QDQNREKVAIVLTDG--NDTGSFVEPIDAAKVAKAKGVRVHVIAMGDPETIGETALDMET 273 Query: 338 ----ACASPNSFYLVENPHSMYDAFSHIGK 363 A S + N + A+ IGK Sbjct: 274 IHRIAKESGGEAFEALNRDELSAAYDEIGK 303 >gi|73998866|ref|XP_535021.2| PREDICTED: similar to A-domain containing protein similar to matrilin and collagen [Canis familiaris] Length = 788 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 40/198 (20%), Positives = 75/198 (37%), Gaps = 26/198 (13%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S+ Q A +D++ ++D S S+ + + + + + + P V Sbjct: 56 SAASKMMQCSAAVDILFLIDGSHSVGK------GSFERSKHFAIMVCDALDINP---ERV 106 Query: 220 QSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G + FS+ F L+ S ++ KIK + G T + LKY R Sbjct: 107 RVGAIQFSSAPHLEFPLDSFSSQQEVKAKIKRMVFKGGRTETGLALKYLLR--------R 158 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +A+ +I++ +TDG + K+RG ++A+G+R R E R Sbjct: 159 GFPGGRNASVPQILIVITDGR-----SQGHVAVPTKQLKERGVTIFAVGVRFPRWEELHR 213 Query: 338 ACASP--NSFYLVENPHS 353 + P + E Sbjct: 214 LASEPREQHVLMAEQVED 231 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 69/192 (35%), Gaps = 22/192 (11%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ +LD S S+ + + + + ++ PD V Q GLV Sbjct: 542 GCRTQSLDLLFMLDASASVGP------ENFAQMQSFVRSCMLQFEVNPD---VTQVGLVV 592 Query: 226 FSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + ++++ F L+ ++ + R + G + ++ L + Y+++ +Q Sbjct: 593 YGSQVQTAFGLDAHLTRAAVLRAMSQAPYLGGAGSAGTALLHVYDKVMTVQ------RGA 646 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 K +V +T G + + + + G V +G+ + R + Sbjct: 647 RPGVPKAVVVLTGGMGT-----EDAAVPVQKLRNNGISVLVVGVGPVLREALRRLAGPRD 701 Query: 344 SFYLVENPHSMY 355 + V + Sbjct: 702 ALIHVATYADLR 713 >gi|167534461|ref|XP_001748906.1| hypothetical protein [Monosiga brevicollis MX1] gi|163772586|gb|EDQ86236.1| predicted protein [Monosiga brevicollis MX1] Length = 2847 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 74/211 (35%), Gaps = 21/211 (9%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 +D+ +LD S S+ ++ + ++ + + Sbjct: 806 PGVPGGAVCVGREIDVFFLLDGSGSI------DGRDFELQRSFVRDLVSNLMSGDND--- 856 Query: 219 VQSGLVTFSN-KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 V+ G+ +S+ + F S + + + + +T + L A + I Sbjct: 857 VRVGVAEYSSTYTQIVFPFSSSQSAIDSSLSSMIQTAGATATGTSLGEAADDI------- 909 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFL 336 +T ++ ++++ MTDGE S ++Q + + G + AIG+ S E L Sbjct: 910 --GSTARSSAARVLILMTDGE-TSDGDEQNIDPSVDALRALGVSITAIGVGNSASESELL 966 Query: 337 RACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + S + + + + I I+ Sbjct: 967 QIAGSSDHVFNNIAFVDLSSFINQIIGQILD 997 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 34/191 (17%), Positives = 69/191 (36%), Gaps = 26/191 (13%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P + D++++ D S S+E +I+ + A++ + + Sbjct: 1228 PGPRGASGSGTCTGSTDLILIQDNSGSIEE------RDFQTSIQFLRALVNGADIESSGS 1281 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL----SKFGVSTNSTPGLKYAYNQIFD 272 + +TF + E LL VS + L + T + L + I Sbjct: 1282 ---RIAAITFCS---EPTLLTDYVSTTSEALDALNTASNTLTCGTATGAALDFVRENILT 1335 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + ++ +++++ +TDGE+ Q+ +A+ VYAIG+ Sbjct: 1336 --------DRSNSGARRVVIVITDGESQEDFSVVQNAGARLQAEVDD--VYAIGVGSGTD 1385 Query: 333 HEFLRACASPN 343 LR AS + Sbjct: 1386 LAELRVIASSD 1396 >gi|311252831|ref|XP_003125289.1| PREDICTED: vitrin-like isoform 1 [Sus scrofa] Length = 656 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 41/270 (15%), Positives = 91/270 (33%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + N+ ++ + V N YS++ + Y + + ++ + T Sbjct: 408 EGAVENEKQYVLEPNFANKAVCRTNGFYSLNVQNWYGLHKSVQPLVKRVCDTDRLACSKT 467 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S S+ ++ + + +E + Sbjct: 468 CL--------NSADIGFVIDGSSSVG------TGNFRTVLQFVANLSKEFDISDTD---T 510 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ + F + + + IK + + T++ + YA Q+F Sbjct: 511 RVGAVQYTYEQRLEFGFDQYTTKPDVLNAIKRVGYWSGGTSTGAAINYALEQLF------ 564 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG A +G I YAIG+ E Sbjct: 565 ---KKSKPNKRKLMILITDGR-----SYDDVRIPAMVAHHKGVITYAIGVAWAAQEELEI 616 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V+ ++Y + I ++I Sbjct: 617 IATHPARDHAFFVDEFDNLYKSVPKIIQNI 646 >gi|313207255|ref|YP_004046432.1| von willebrand factor type a [Riemerella anatipestifer DSM 15868] gi|312446571|gb|ADQ82926.1| von Willebrand factor type A [Riemerella anatipestifer DSM 15868] gi|315023479|gb|EFT36485.1| aerotolerance operon BatA [Riemerella anatipestifer RA-YM] gi|325335298|gb|ADZ11572.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Riemerella anatipestifer RA-GD] Length = 330 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 41/235 (17%), Positives = 75/235 (31%), Gaps = 47/235 (20%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 I P T S+ + +D+++ +DVS SM + D ++ + +++ Sbjct: 70 IARPRTFSISEDRDETKGMDIVLSIDVSLSMLAK-DLEPDRLTALKEIARTFIKQRTTD- 127 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIF 271 + GLV +S + L ++ ++ + TN GL A + + Sbjct: 128 ------RIGLVEYSGEALMRVPLTSDHRVVEEELMSFNPMDLEGGTNIGDGLAVAVSHLR 181 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-- 329 + KII+ MTDG N L A+ VY IGI Sbjct: 182 KSKAKS-----------KIIILMTDGVNTIDNA-MSPLTAAELARNNDIKVYTIGIGSNG 229 Query: 330 ----IRSHE-----------------FLRACA--SPNSFYLVENPHSMYDAFSHI 361 + LR A + ++ + S+ + I Sbjct: 230 LALMPTQQDIFGNLVFTEEQVKIDEYLLRDVAQITGGKYFRATSNESLKQIYEEI 284 >gi|238789155|ref|ZP_04632943.1| hypothetical protein yfred0001_29950 [Yersinia frederiksenii ATCC 33641] gi|238722687|gb|EEQ14339.1| hypothetical protein yfred0001_29950 [Yersinia frederiksenii ATCC 33641] Length = 522 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 42/235 (17%), Positives = 82/235 (34%), Gaps = 25/235 (10%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F N G + + LPI ++ + E+SH K L I+++ + A + N+ + Sbjct: 17 FIKNENGAILLSFIFILPIFIGLVFLSFEISHFIQKKARLSDAIEQATL--ALTVDNDES 74 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 ++ K + I + L + F + ++I Y + Sbjct: 75 PDDDNIKKEKNSKFIIN-----YAKAYLPNEKFSKPV--------INITSHSDYINYQVD 121 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 Y P K I + + ++K + D++ V D S SM +FD Sbjct: 122 MTIYY--PTKILNKIFQTVSPEVSISDNARALKYTTTDSKPTDVVFVADYSGSMNEYFDE 179 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR 245 S + I ++ + ++++ NNV + I F WG + Sbjct: 180 SDESDEKKIVALRRIFKDIQNEIKYNNV--------NIDIIGFVPFSWGTKNFYS 226 Score = 40.1 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 19/82 (23%), Positives = 35/82 (42%), Gaps = 11/82 (13%) Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLY----------YCNEAKKRGAIVYAIGIR-VIRS 332 D +Y K+++ ++DG++ + ++ Y C + K G + I I V R Sbjct: 419 DNSYNKLMIIISDGDDSRQTDKEKRYYNISKNLIKDGMCEKIKDNGIKMAFIAIGYVPRE 478 Query: 333 HEFLRACASPNSFYLVENPHSM 354 R C +FY +N H + Sbjct: 479 DIDWRRCVGEENFYFAKNAHEL 500 >gi|260901770|ref|ZP_05910165.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AQ4037] gi|308108909|gb|EFO46449.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AQ4037] gi|328470487|gb|EGF41398.1| protein BatA [Vibrio parahaemolyticus 10329] Length = 356 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 82/210 (39%), Gaps = 32/210 (15%) Query: 173 DMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+M+V+D+S SM S I+++D A + + ++ K + GLV F Sbjct: 107 DVMVVVDLSGSMAEPDFTSRTGEKISRLDAAKEVLTEFVQSRKGD-------RLGLVLFG 159 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + F+ + + ++ L++ G ST+ + A ++F+ + Sbjct: 160 ---DAAFVQTPFTADQKVWLELLNQTDVAMAGQSTHLGDAIGLAI-KVFEQSDKSRGALE 215 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--SHEFLR--- 337 +D N +K+ + +TDG T + + AK +G V+ I + L Sbjct: 216 QDQNREKVAIVLTDG--NDTGSFVEPIDAAKVAKAKGVRVHVIAMGDPETIGETALDMDT 273 Query: 338 ----ACASPNSFYLVENPHSMYDAFSHIGK 363 A S + N + A+ IGK Sbjct: 274 IHRIAKESGGEAFEALNRDELSAAYDEIGK 303 >gi|254786433|ref|YP_003073862.1| von Willebrand factor A [Teredinibacter turnerae T7901] gi|237687231|gb|ACR14495.1| von Willebrand factor type A domain protein [Teredinibacter turnerae T7901] Length = 347 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 34/220 (15%), Positives = 77/220 (35%), Gaps = 49/220 (22%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +D+S SM++ D I +I + +N +E + + GL+ F + Sbjct: 92 DLLLAVDISGSMKTPDMVVQDKQIARILVVKYVVNEFIERRESD-------RLGLILFGS 144 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + + + L G T + A ++ E Sbjct: 145 QAYLQAPLTFDRKTVSTLLDEAQLGFAGEQTAIGDAVGLAIKRL-----------RERPA 193 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------- 331 +++++ +TDG N + + + AK+ G +Y +G+ + Sbjct: 194 SQRVLILLTDGANTA--GEVAPRQAADLAKQAGIKIYTVGVGADQMEQRMGLFGGFSRTV 251 Query: 332 ------SHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 + LR A + ++ NP + + + K Sbjct: 252 NPSSDLDEDTLRYMAETTGGLYFRARNPQELQAIYEELDK 291 >gi|42524204|ref|NP_969584.1| hypothetical protein Bd2794 [Bdellovibrio bacteriovorus HD100] gi|39576412|emb|CAE80577.1| conserved hypothetical protein [Bdellovibrio bacteriovorus HD100] Length = 336 Score = 74.4 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 42/223 (18%), Positives = 75/223 (33%), Gaps = 53/223 (23%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++I LDVS SM + +++ A ++I + + + GLV F+ + Sbjct: 88 IDIVICLDVSDSMLIEDMKPLNRLEAAKETIAKFI-SARTSD------RIGLVVFAGESF 140 Query: 232 EFFLLEWGVSHLQRKIKYLS-----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + +++ +S K T + A + Sbjct: 141 TMVPPTLDYQMILQRVNEISSASSAKIKDGTALGVAMANA-----------AGRLKDSQA 189 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------- 331 ++++FMTDGEN S D ++ AK G VY+IGI Sbjct: 190 RSRVMIFMTDGENNSGTIDPETGL--EIAKGYGIKVYSIGIGKDGPTRIPVYSRDIFGQK 247 Query: 332 -----------SHEFLRACASP--NSFYLVENPHSMYDAFSHI 361 + + L AS +Y ++ FS I Sbjct: 248 VKTYQPFESTVNEDLLGRMASDTGGKYYRATTEGALQKVFSDI 290 >gi|320160918|ref|YP_004174142.1| hypothetical protein ANT_15140 [Anaerolinea thermophila UNI-1] gi|319994771|dbj|BAJ63542.1| hypothetical protein ANT_15140 [Anaerolinea thermophila UNI-1] Length = 486 Score = 74.0 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 37/203 (18%), Positives = 75/203 (36%), Gaps = 27/203 (13%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + + N + + +VLD S SM ++DM S +L++ + ++ Sbjct: 106 TAEYNVKEAPPFHICLVLDRSTSMHG------ARMDMVKSSALNLLKQFRKQDLIS---- 155 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIK-YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 +V FS++ E + + + + G T GL+ Q+ + Sbjct: 156 --VVAFSDRAEVVIPPTRVPDLAKDDHRISMLQVGGGTEIYQGLQLGIEQLRSI------ 207 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 D + + ++ +TDG +D+ + EA + G + +GI + E L Sbjct: 208 ----DPRFMRQLILLTDGHTYG--DDEACIELAEEAAQDGIQINTMGIGHEWNDELLDKI 261 Query: 340 A--SPNSFYLVENPHSMYDAFSH 360 A S + V +P + F Sbjct: 262 ATISGANSIFVTSPKDLNKFFEQ 284 >gi|153836342|ref|ZP_01989009.1| von Willebrand factor, type A [Vibrio parahaemolyticus AQ3810] gi|149750244|gb|EDM60989.1| von Willebrand factor, type A [Vibrio parahaemolyticus AQ3810] Length = 356 Score = 74.0 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 82/210 (39%), Gaps = 32/210 (15%) Query: 173 DMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+M+V+D+S SM S I+++D A + + ++ K + GLV F Sbjct: 107 DVMVVVDLSGSMAEPDFTSRTGEKISRLDAAKEVLTEFVQSRKGD-------RLGLVLFG 159 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + F+ + + ++ L++ G ST+ + A ++F+ + Sbjct: 160 ---DAAFVQTPFTADQKVWLELLNQTDVAMAGQSTHLGDAIGLAI-KVFEQSDKSRGALE 215 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--SHEFLR--- 337 +D N +K+ + +TDG T + + AK +G V+ I + L Sbjct: 216 QDQNREKVAIVLTDG--NDTGSFVEPIDAAKVAKAKGVRVHVIAMGDPETIGETALDMDT 273 Query: 338 ----ACASPNSFYLVENPHSMYDAFSHIGK 363 A S + N + A+ IGK Sbjct: 274 IHRIAKESGGEAFEALNRDELSAAYDEIGK 303 >gi|28900543|ref|NP_800198.1| hypothetical protein VPA0688 [Vibrio parahaemolyticus RIMD 2210633] gi|260365425|ref|ZP_05777962.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus K5030] gi|260877490|ref|ZP_05889845.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AN-5034] gi|260894838|ref|ZP_05903334.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus Peru-466] gi|28808923|dbj|BAC62031.1| conserved hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] gi|308085296|gb|EFO34991.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus Peru-466] gi|308090935|gb|EFO40630.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AN-5034] gi|308114289|gb|EFO51829.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus K5030] Length = 356 Score = 74.0 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 82/210 (39%), Gaps = 32/210 (15%) Query: 173 DMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+M+V+D+S SM S I+++D A + + ++ K + GLV F Sbjct: 107 DVMVVVDLSGSMAEPDFTSRTGEKISRLDAAKEVLTEFVQSRKGD-------RLGLVLFG 159 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + F+ + + ++ L++ G ST+ + A ++F+ + Sbjct: 160 ---DAAFVQTPFTADQKVWLELLNQTDVAMAGQSTHLGDAIGLAI-KVFEQSDKSRGALE 215 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--SHEFLR--- 337 +D N +K+ + +TDG T + + AK +G V+ I + L Sbjct: 216 QDQNREKVAIVLTDG--NDTGSFVEPIDAAKVAKAKGVRVHVIAMGDPETIGETALDMDT 273 Query: 338 ----ACASPNSFYLVENPHSMYDAFSHIGK 363 A S + N + A+ IGK Sbjct: 274 IHRIAKESGGEAFEALNRDELSAAYDEIGK 303 >gi|149050644|gb|EDM02817.1| similar to vitrin (predicted) [Rattus norvegicus] Length = 427 Score = 74.0 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 43/291 (14%), Positives = 98/291 (33%), Gaps = 35/291 (12%) Query: 79 GGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP 138 + + R+ ++ + + +V V N YS + S + Sbjct: 158 KIEEVSRVARESGINVFFVTVEGAAEREKQHVVEPNFASKAVCRTNGFYSFNVQSWLSLH 217 Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA 198 + ++ + T D+ V+D S S+ + Sbjct: 218 KTVQPLVKRVCDTDRLACSKTCL--------NSADIGFVIDGSSSVG------TSNFRTV 263 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFS--NKIEEFFLLEWGVSHLQRKIKYLSKFGVS 256 ++ + + +E ++ + G V ++ ++E F + + I+ + + Sbjct: 264 LQFVANLSKEFEISDTD---TRIGAVQYTYEQRLEFGFDKYNSKADVLSAIRRVGYWSGG 320 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 T++ ++YA Q+F N +K+++ +TDG + A Sbjct: 321 TSTGAAIQYALEQLF---------KKSKPNKRKVMILITDG-----RSYDDVRIPAMAAY 366 Query: 317 KRGAIVYAIGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDI 365 ++G I YAIGI E P + + V+ ++Y I ++I Sbjct: 367 QKGVITYAIGIAWAAQDELEVIATHPARDHSFFVDEFDNLYKFVPRIIRNI 417 >gi|325954650|ref|YP_004238310.1| von Willebrand factor type A [Weeksella virosa DSM 16922] gi|323437268|gb|ADX67732.1| von Willebrand factor type A [Weeksella virosa DSM 16922] Length = 338 Score = 74.0 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 48/265 (18%), Positives = 84/265 (31%), Gaps = 50/265 (18%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 SA+++ + L ++ + P V ++D +D+M+ +D S SM + D Sbjct: 49 SALTQLRPLLPILRYLALALCIVALARPRIVDVSTKIKSDKGVDIMLTVDTSLSMLA-RD 107 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY 249 ++ +E + + GLV++S + L L R+I Sbjct: 108 LEPDRLTALKAVAVKFSKERQAD-------RLGLVSYSGEALTRVPLTTDREVLIREINA 160 Query: 250 LSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ- 306 L T GL A N I D + K+I+ MTDG + Sbjct: 161 LESGELEDGTAIGIGLATAINHIKDSKAKS-----------KVIILMTDGVESINPTNDL 209 Query: 307 ---QSLYYCNEAKKRGAIVYAIGIRVIR-----------------------SHEFLRACA 340 A RG VY IGI + L+ A Sbjct: 210 MYISPQTAAEMATSRGIKVYTIGIGTRGLAPFPTAYDMYGNYIFDMMPVDIDEKLLQNIA 269 Query: 341 --SPNSFYLVENPHSMYDAFSHIGK 363 + ++ + S+ + I + Sbjct: 270 DLTGGLYFRATDNQSLQKIYQEIDR 294 >gi|327274818|ref|XP_003222173.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Anolis carolinensis] Length = 1097 Score = 74.0 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 36/236 (15%), Positives = 83/236 (35%), Gaps = 39/236 (16%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 ++M I + L+++ V+D S S+ + +I +K++ + + Sbjct: 696 KLIMEICGC--GVKCKENPLELVFVIDSSESVGPE---NFDRIKRFVKTV------IDAV 744 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + G++ FS+K++ L+ L+ I + G T + + A N Sbjct: 745 TVNQATARVGIINFSHKVDVVSTLQQYPNKESLKGAIDVMQYLGEGTYTATAISNATNIF 804 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGAIVYAIGI-- 327 +K+ + +TDG+ + + D +L +A ++ IG+ Sbjct: 805 KSA----------RPGVRKVAIVITDGQ--ADRRDPNTLEVVVKDAHASNIEIFVIGVVQ 852 Query: 328 -RVIRSHEFLRAC---ASP---NSFYLVENPHSM----YDAFSHIGKDIVTKRIWY 372 F + AS Y +++ ++ F I ++ + I Sbjct: 853 KSDPNFDSFRKEMDLIASDPDIEHVYQIDDFITLQALENKVFKQICENDFSVYITE 908 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 39/203 (19%), Positives = 80/203 (39%), Gaps = 18/203 (8%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ VLD S ES D K + ++ L +K + ++ ++ FS+ + Sbjct: 48 IDIVYVLDSS---ESAKDVLFDKQKEFVTLLSDKLFLMKPTRVLRYDIKLAIMQFSSSVR 104 Query: 232 EFFLLE-WGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + + W + ++K ++ G T S + A Q+F +G + K Sbjct: 105 IDYPFDEWRSLPDFKLRVKEMTYIGHGTYSYYAISNA-TQLFKTEGRKSSV--------K 155 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-RVIRSHEFLRACA-SPNSFYL 347 ++V M DG + D Q++ A+ G IG+ V + L SP + Sbjct: 156 VVVLMADGIDHPKSPDVQAIS--EAARTFGISFITIGLSNVADKVKLLSISGNSPGTPVF 213 Query: 348 VENPHSMYDAFSHIGKDIVTKRI 370 + N ++ D + + ++ Sbjct: 214 ILNDPNLLDKIRNQLAALFDEQC 236 >gi|291514853|emb|CBK64063.1| Mg-chelatase subunit ChlD [Alistipes shahii WAL 8301] Length = 328 Score = 74.0 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 39/218 (17%), Positives = 74/218 (33%), Gaps = 47/218 (21%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +DVS SM + D +I A + + + + + GLV F+ + Sbjct: 88 IDIMLAIDVSGSMLA-RDFKPDRITAAKEVAGSFIAD-------RYGDRIGLVAFAGEAF 139 Query: 232 EFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L S LQ + + T GL A N++ + K Sbjct: 140 TQSPLTTDQSTLQTLLARIRSGLIEDGTAIGNGLATAINRLRESDAKS-----------K 188 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR------------------ 331 +I+ +TDG N + + AK +G VY IG+ Sbjct: 189 VIILLTDGVNN--QGQIAPMTAAEIAKAQGIRVYTIGVGTEGMAPYPAIDMFGNLTFVNQ 246 Query: 332 ----SHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 + L+A + + ++ + + + I + Sbjct: 247 KVEIDEKVLKAISDMTGGRYFRATDKEKLKAVYDEINQ 284 >gi|156523281|ref|NP_001096038.1| anthrax toxin receptor 2b [Danio rerio] gi|151335854|gb|ABS00409.1| capillary morphogenesis protein 2B [Danio rerio] Length = 487 Score = 74.0 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 40/228 (17%), Positives = 81/228 (35%), Gaps = 28/228 (12%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 W + + + D+ VLD S S+ + I + Sbjct: 13 WVLQLFCLSLLTPFGTGEDGSCHGAFDLYFVLDRSGSVSDNW-----------LEIYGFV 61 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLK 264 E++ V+ ++ + FS+ E L + ++ LSK T GLK Sbjct: 62 EQLTNRF-VSPKMRVSFIVFSSSAEIILPLTGDRVDIDSGLQQLSKIRPAGDTYMHEGLK 120 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 A Q+ II+ +TDG+ L ++ ++ + A++ GA VY Sbjct: 121 KAIEQMTSQGARASS----------IIIALTDGK-LEVFMNELAIKEADLARQYGARVYC 169 Query: 325 IGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 +G++ +++ + + + V + + A +I I+ K Sbjct: 170 VGVKDFDANQLTEIADNKDQVFPVVDG---FQALKNIVNSILQKSCVE 214 >gi|294054315|ref|YP_003547973.1| hypothetical protein Caka_0779 [Coraliomargarita akajimensis DSM 45221] gi|293613648|gb|ADE53803.1| conserved hypothetical protein [Coraliomargarita akajimensis DSM 45221] Length = 345 Score = 74.0 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 35/178 (19%), Positives = 63/178 (35%), Gaps = 27/178 (15%) Query: 165 NSQTDARLDMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 Q +D+++ LD+S SM S ++ +T++D A + + + + + Sbjct: 82 EGQEAEGIDIVLALDLSGSMRALDLSTRENIVTRLDAAKEVVQEFIGK-------RPHDR 134 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMR 277 GLV F+ L L++ ++ L T L + N++ D Sbjct: 135 IGLVAFAADAFVVSPLTLNHDWLKKNVQRLELGDINLSGTAIGTALGASVNRLRD----- 189 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + +I++ +TDGEN L AK VY I E Sbjct: 190 ------HESRSRIVILLTDGENN--SGTLSPLSAAEAAKSLNVKVYTIATGRKGRVEV 239 >gi|270158235|ref|ZP_06186892.1| von Willebrand factor type A domain protein [Legionella longbeachae D-4968] gi|289163509|ref|YP_003453647.1| hypothetical protein LLO_0165 [Legionella longbeachae NSW150] gi|269990260|gb|EEZ96514.1| von Willebrand factor type A domain protein [Legionella longbeachae D-4968] gi|288856682|emb|CBJ10493.1| putative unknown protein [Legionella longbeachae NSW150] Length = 342 Score = 74.0 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 73/216 (33%), Gaps = 46/216 (21%) Query: 174 MMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +M+ LD+S SME +++++ + + E + + GL+ F + Sbjct: 92 IMMALDLSGSMEIPDMILHGRPTSRLNIVKSAAEQFVRE-RSGD------KIGLILFGTR 144 Query: 230 IEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + + +++ + G +T+ + A ++ Sbjct: 145 AYLQTPLTYDRHSILLRLEDATAGLAGKTTSIGDAVGLAVKRLDSAPKKG---------- 194 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR---------------- 331 ++I+ +TDG N L AK+ G +Y IG+ Sbjct: 195 -RVIILLTDGANN--SGVLAPLKAAELAKEEGIKIYTIGLGSEGDSRALVGDFLMQSPAA 251 Query: 332 --SHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 E L+ + + ++ + S++ + I + Sbjct: 252 DLDEETLKKMSDMTGGRYFRATDTESLHLIYKTINQ 287 >gi|149922245|ref|ZP_01910682.1| hypothetical protein PPSIR1_07355 [Plesiocystis pacifica SIR-1] gi|149816878|gb|EDM76364.1| hypothetical protein PPSIR1_07355 [Plesiocystis pacifica SIR-1] Length = 370 Score = 73.7 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 43/287 (14%), Positives = 90/287 (31%), Gaps = 47/287 (16%) Query: 102 GFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSS 161 F + D S L + + GY S + ++ F+P + + S Sbjct: 50 AFFSTTADPGASAGLTPISAVRQIGYRFSPNFKSDGKVEIL-FVPLAPGGAVLDGKVRVS 108 Query: 162 VKVNSQTDARLD------------------MMIVLDVSRSMESFFDSSITKIDMAIKSIN 203 V S + ++I +D S SM D ++ A + Sbjct: 109 FIVTSNLRFEISGSAAISAVRPAPGGRPTQVVIDIDGSSSMR-RSDPKRERVRAAKR--- 164 Query: 204 AMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL 263 +E + + N G++ F+ +EE + G+ I+ + G + Sbjct: 165 -FVETLSRVDKRNQF---GVIEFNTTVEERAPMGSGMKATSDAIQAVDAVGSTAL----- 215 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 Y + + Y++ I+ +TDG++ ++ ++ N AK +Y Sbjct: 216 ---YTSLIRSIDALEGSGKTG--YRRAILVLTDGKDTASSHGVATVI--NRAKAAKVRIY 268 Query: 324 AIGIRVIRSHEFLRACA--------SPNSFYLVENPHSMYDAFSHIG 362 + + + L + F V+ + F I Sbjct: 269 VVSLGGAGDQKGLGYVGPMQRLTTETGGVFTHVDRADDLVARFDAIA 315 >gi|18042139|gb|AAL57848.1|AF454755_1 vitrin [Mus musculus] Length = 650 Score = 73.7 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 41/270 (15%), Positives = 91/270 (33%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + + +V V N YS + S + + ++ + T Sbjct: 402 EGAAEREKQHVVEPVFASKAVCRTNGFYSFNVQSWLSLHKTVQPLVKRVCDTDRLACSKT 461 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S S+ + ++ + + +E ++ Sbjct: 462 CL--------NSADIGFVIDGSSSVG------TSNFRTVLQFVANLSKEFEISDTD---T 504 Query: 220 QSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ + F + + + I+ + + T++ ++YA Q+F Sbjct: 505 RVGAVQYTYEQRLQFGFDKYNSKADILSAIRRVGYWSGGTSTGAAIQYALEQLF------ 558 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG A ++G I YAIGI E Sbjct: 559 ---KKSKPNKRKVMIIITDGR-----SYDDVRIPAMAAYQKGVITYAIGIAWAAQDELEV 610 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V++ ++Y I ++I Sbjct: 611 MATHPAKDHSFFVDDFDNLYKIAPRIIQNI 640 >gi|311252837|ref|XP_003125292.1| PREDICTED: vitrin-like [Sus scrofa] Length = 595 Score = 73.7 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 41/270 (15%), Positives = 91/270 (33%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + N+ ++ + V N YS++ + Y + + ++ + T Sbjct: 347 EGAVENEKQYVLEPNFANKAVCRTNGFYSLNVQNWYGLHKSVQPLVKRVCDTDRLACSKT 406 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S S+ ++ + + +E + Sbjct: 407 CL--------NSADIGFVIDGSSSVG------TGNFRTVLQFVANLSKEFDISDTD---T 449 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ + F + + + IK + + T++ + YA Q+F Sbjct: 450 RVGAVQYTYEQRLEFGFDQYTTKPDVLNAIKRVGYWSGGTSTGAAINYALEQLF------ 503 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG A +G I YAIG+ E Sbjct: 504 ---KKSKPNKRKLMILITDGR-----SYDDVRIPAMVAHHKGVITYAIGVAWAAQEELEI 555 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V+ ++Y + I ++I Sbjct: 556 IATHPARDHAFFVDEFDNLYKSVPKIIQNI 585 >gi|272938026|gb|ACZ96962.1| vwa2 protein [Danio rerio] Length = 260 Score = 73.7 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 38/209 (18%), Positives = 74/209 (35%), Gaps = 31/209 (14%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-NVVQSGLVTFS 227 LD++ VLD S + + + D+N +V Q GLV + Sbjct: 1 GQALDLVFVLDASSGVGKENFIHFQ----------DFVRSTSVQFDINRDVAQVGLVVYG 50 Query: 228 NKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ-IFDMQGMRQHCNTED 284 + F L+ S + R + + G + L + +Q + +G R N Sbjct: 51 RRPVTVFDLDKYNSGSAVLRAVGDAAFLGGKASVGSALLHVLSQSLTVGKGARPGVN--- 107 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 K +V +TDG + + + + G ++ IG+ I+ LR S + Sbjct: 108 ----KAVVVLTDG-----TGVEDAAVPAQKIRDSGVSIFLIGVGDIQQELLLRISGSEDH 158 Query: 345 FYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 V + + +D++ + + D Sbjct: 159 MITVPSYDDL-----KYSEDVLVQMVCSD 182 >gi|75675889|ref|YP_318310.1| hypothetical protein Nwi_1697 [Nitrobacter winogradskyi Nb-255] gi|74420759|gb|ABA04958.1| hypothetical protein Nwi_1697 [Nitrobacter winogradskyi Nb-255] Length = 605 Score = 73.7 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 35/152 (23%), Positives = 63/152 (41%), Gaps = 16/152 (10%) Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA-NYKKIIV 292 + S L+ +I +S G STN GL + + + G + A Y+ +V Sbjct: 453 TPVSSQSSTLKNQIDSMSPSG-STNQAIGLAWGWQTLSTTNGPFPAPAKDKAYVYQDYLV 511 Query: 293 FMTDGENL-----------STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH---EFLRA 338 ++DG N S + D + C + K G +++ + + V + L+ Sbjct: 512 LLSDGLNTRNRWSGNGSDHSPEVDVRQALLCQKVKDSGTVIFTVQVNVGNRDPLSQVLQD 571 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 CAS +F ++ + + DAF +I I RI Sbjct: 572 CASNGNFQMITSANQTADAFQNILTQISQLRI 603 Score = 69.4 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 32/225 (14%), Positives = 73/225 (32%), Gaps = 28/225 (12%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM 66 +I F + + + + AI L + +G ++ + ++ + + +D +++ + Sbjct: 13 SISRFDRDIRANIAPIFAIALLPVLGFVGAAVDYTRANAARSSMQAAMDSAVLMVSRDAA 72 Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 ++ +I + F N ND S S + Sbjct: 73 ANPAMTSQ----------QITDAVQRYF------NSLYNDKSAFNVSVSAAYTPSTSSAA 116 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 I A + I F I + TSS + R + +VLD + SM Sbjct: 117 AKILASGQGAIETDF-MKIAGFPQLS----FGTSSTSTWGNSRMR--VALVLDNTGSMRD 169 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 K+ ++ M++ + + V ++ F+ + Sbjct: 170 NG-----KMAALQRAAKDMIDSLSAFAKTADDVYISIIPFAKDVN 209 >gi|74196449|dbj|BAE34363.1| unnamed protein product [Mus musculus] Length = 650 Score = 73.7 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 76/197 (38%), Gaps = 27/197 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS--NKI 230 D+ V+D S S+ + ++ + + +E ++ + G V ++ ++ Sbjct: 467 DIGFVIDGSSSVG------TSNFRTVLQFVANLSKEFEISDTD---TRVGAVQYTYEQRL 517 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 E F + + I+ + + T++ ++YA Q+F N +K+ Sbjct: 518 EFGFDKYNSKADILSAIRRVGYWSGGTSTGAAIQYALEQLF---------KKSKPNKRKV 568 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLV 348 ++ +TDG A ++G I YAIGI E P + + V Sbjct: 569 MIIITDGR-----SYDDVRIPAMAAYQKGVITYAIGIAWAAQDELEVMATHPAKDHSFFV 623 Query: 349 ENPHSMYDAFSHIGKDI 365 ++ ++Y I ++I Sbjct: 624 DDFDNLYKIAPRIIQNI 640 >gi|114704798|ref|ZP_01437706.1| hypothetical protein FP2506_07676 [Fulvimarina pelagi HTCC2506] gi|114539583|gb|EAU42703.1| hypothetical protein FP2506_07676 [Fulvimarina pelagi HTCC2506] Length = 545 Score = 73.7 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 29/154 (18%), Positives = 68/154 (44%), Gaps = 14/154 (9%) Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + +E L + + ++ + L+ G +TN T G+++ + + + Sbjct: 394 TGSLEPITGLTFDLQSVETAVNKLTPSG-NTNVTIGVQWGMEALTAAAPLT--GVRTGSE 450 Query: 287 YKKIIVFMTDGENLST---------KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +K+++ +TDG N K D ++L CN AK G +Y + + V + + L+ Sbjct: 451 VRKVMIVLTDGLNTQNRWWGSRDRNKIDARTLAACNNAKAMGIELYTVRL-VEGNEDLLK 509 Query: 338 ACA-SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 CA + + ++ V + + F+ + + + R+ Sbjct: 510 TCAETEDKYHYVTSASQLKTTFADLARQVKGVRL 543 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 35/229 (15%), Positives = 75/229 (32%), Gaps = 44/229 (19%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 ++ + + +G ++T + L + G +++ + ++ + + +D S + A + + Sbjct: 28 LKQYRDDRRGQFAVITCLALVPLIAAAGGAVDLWNARRVQNAVQNAVDTSALAAVS--YS 85 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 R+K L D +V Sbjct: 86 GEEQTEREKRADTLFLNNTAGIAIEDTDLSEEDGAWV----------------------- 122 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 YK K T M + + ++ +D+++VLD S SM Sbjct: 123 -------YKAEYKIKTNFLRVVGIDEFEMESQGAAAL---ANSPMDVVLVLDSSGSM--- 169 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 + ++ S+ LEE K N++ Q LV F +++ L Sbjct: 170 --AQDNRMVELKASVKLFLEEFKS----NDLTQVALVPFDTQVKATSSL 212 >gi|326673138|ref|XP_001334803.4| PREDICTED: collagen alpha-4(VI) chain-like [Danio rerio] Length = 1356 Score = 73.7 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 40/201 (19%), Positives = 83/201 (41%), Gaps = 22/201 (10%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S S+ + + +++++E ++ PD V+ GLV +S+ Sbjct: 32 DIVFLVDGSASIG------LDNFQQIRQFLSSLVENFEVAPDK---VRIGLVQYSDTPRT 82 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L + I+ L T++ GL++ I + + + N +I Sbjct: 83 EFSLNTYQNKEEILDYIRNLRYKTGGTHTGQGLEF----ILKQHFIEEAGSRAQQNVPQI 138 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 + +TDG++ + Q E ++RG ++AIGI+ LR A+ V + Sbjct: 139 AIVITDGDSQDEVDLQ-----AQELRQRGIKIFAIGIKDADV-RLLRQIANEPYDQYVYS 192 Query: 351 PHSMYDAFSHIGKDIVTKRIW 371 + A I + +V + Sbjct: 193 VSD-FAALQGISQSVVRELCT 212 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 43/329 (13%), Positives = 107/329 (32%), Gaps = 31/329 (9%) Query: 40 VSHIFFMKTVLHSMIDRSLVH-AATQIMNEGNGNNRKKLKGGDILCRIKNTWNMS---FR 95 + + + + + I + + + + R D R + Sbjct: 472 LLNTHRTRNEMSTSIRNLQLTPTGERRIGHAIEHARSNFFNRDAGSRAAEGYKQFLLVIA 531 Query: 96 NELRDNGFVNDIDDIVRSTSLDIVVVPQNEG--YSISAISRYKIPLKFCTFIPWYTNSRH 153 +G + I + ++ + N Y + I+ K IP Sbjct: 532 AGESADGVIQASRKIKKD-AVTVFAAGLNRADAYEMKDIASQSHNYKLVGNIPLVQQKIK 590 Query: 154 IVMPITSSVKVNSQTDARL--DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 + + V V+ + + D+ ++D S S++S + + + + + Sbjct: 591 VAVDTWEDVAVSKDCEFDVVADIAFIVDQSSSIKS------RNFQLVRDFLENTIGRLDV 644 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + +Q ++ +S+ L + + R I L T + L++A Sbjct: 645 ---GKDKIQIAVILYSDFPRADVYLNTFSNKNDILRYINTLPYGRGKTYTGAALRFAKEH 701 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 +F + + D +++ V +TDG++ E ++ G ++A+GI+ Sbjct: 702 VFT----KARGSRRDKYVQQVAVVITDGKSTDDAASAA-----AELRRSGVSIFALGIKD 752 Query: 330 IRSHEFLRACASPNSFY--LVENPHSMYD 356 + + + P + VEN + Sbjct: 753 TKEDDLREIASYPPKKFVLNVENFDQLNS 781 Score = 63.3 bits (152), Expect = 5e-08, Method: Composition-based stats. Identities = 41/238 (17%), Positives = 79/238 (33%), Gaps = 33/238 (13%) Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK------VNSQTDARLDMMIVLDV 180 I+ + F S+ +V + +SVK A+ D+++++D Sbjct: 179 RQIANEPYDQYVYSVSDFAALQGISQSVVRELCTSVKDVIETTRGCIDTAQADIVLLVDS 238 Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW-- 238 S S+ + K ++A ++ L PD +V+ GL FS++ + FLL Sbjct: 239 SGSIGDN------DFEEVKKFLHAFVDRFNLRPD---LVRLGLAQFSDRPYQEFLLGDYA 289 Query: 239 GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE 298 L +K+ L T + L + F + N I + +TDGE Sbjct: 290 DKKDLHQKLNNLIYRKGGTQTGQALTFIRENYFSLA---------RPNVPGIAIVITDGE 340 Query: 299 NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLVENPHSM 354 + G ++ I + + P + + N + Sbjct: 341 -----SRDDVEEPAQRLRNTGVSLFVIRVGKGNMEKLRAIANIPHEEFLFSINNYQEL 393 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 35/229 (15%), Positives = 80/229 (34%), Gaps = 29/229 (12%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 + I+ ++ D++ ++D S S++ D+ +++ ++ Sbjct: 1141 GLRGLQKIITQELCNLTKPICEMEVADLVFLIDGSESIKP------PSWDILKQTMIGIV 1194 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLK 264 +E+ + + + G+ FS+ + F L S ++ I + + TN+ LK Sbjct: 1195 KELDIA---KDKWRVGVAQFSDILLHQFYLNTYTSFAEVEEAINNIKQRKQGTNTWDALK 1251 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 G R + ++ +TDGE K+ L + K + + Sbjct: 1252 LIKYYFTKENGSRIEGG-----VAQNLLLITDGEANDEKD----LNALADLKNKKIAITV 1302 Query: 325 IGIRVIRSHEFLR-ACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 IGI LR SP+ + + F + +++ + Sbjct: 1303 IGIGNEIKKSELREIAGSPDRV--------LIETFESLELKTTIRKVLH 1343 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 37/260 (14%), Positives = 79/260 (30%), Gaps = 61/260 (23%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 I +V + A+ D+ +LD S S+ + K ++E + + Sbjct: 799 IPKFSLGAVLEGCKQTAKADIYFLLDESGSISYPDFEDMKKF---------IMECLDVFQ 849 Query: 214 DVNNVVQSGLVTFSNKIEEFFLL-EWGV-SHLQRKIKYLSKFGV-------------STN 258 + V+ G+V F++K F L ++ S +++ +K L +G ++ Sbjct: 850 IGKDHVRIGVVKFASKATTVFRLHDYSTKSDVEKAVKDLEMYGGCEGLLADVVFLIDGSD 909 Query: 259 STPG---------LKYAYNQIFDMQGMR-----QHCNTEDANYK--------------KI 290 S ++Y + Q+ + + + Sbjct: 910 SVSAEDFEKMKDIMEYVIEKFAIGSEKERVAVVQYGTNPNEEFSLNAFDNKDRLLQEIRN 969 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 I +TDGE + YAIG+R + L S + Sbjct: 970 IRQVTDGE-----SRDDVALPAKALRDNSINTYAIGLRHANRSQILAIAGSHGEVFY--- 1021 Query: 351 PHSMYDAFSHIGKDIVTKRI 370 + + +++ K Sbjct: 1022 -EDAVASLKELSSEVLLKIC 1040 >gi|302870768|ref|YP_003839404.1| von Willebrand factor type A [Caldicellulosiruptor obsidiansis OB47] gi|302573627|gb|ADL41418.1| von Willebrand factor type A [Caldicellulosiruptor obsidiansis OB47] Length = 900 Score = 73.7 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 47/212 (22%), Positives = 81/212 (38%), Gaps = 26/212 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + +D+++VLD S SM D+ I+K+++A + M+E ++ G++ F Sbjct: 401 EKEKNIDVVLVLDHSGSMADTEDAGISKLEIAKSASAKMIEHLESSDG------VGVIAF 454 Query: 227 SNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + + + I + + G T P L A + + + Sbjct: 455 DHNYYWAYEFSKLVRKKDVIESISSI-EVGGGTAIIPPLSEAVKTLKKSKAKSKL----- 508 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--P 342 IV +TDG + NEAK+ + IG+ + L AS Sbjct: 509 ------IVLLTDGMGEQGGYEI----PANEAKRNNIKITTIGVGKFVNLPVLSWIASFTS 558 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 FYLV NP+ + D F K I K + K Sbjct: 559 GRFYLVSNPYELVDVFLKETKIIKGKYMKEKK 590 >gi|91225506|ref|ZP_01260628.1| hypothetical protein V12G01_09265 [Vibrio alginolyticus 12G01] gi|91189869|gb|EAS76142.1| hypothetical protein V12G01_09265 [Vibrio alginolyticus 12G01] Length = 356 Score = 73.7 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 83/210 (39%), Gaps = 32/210 (15%) Query: 173 DMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+M+V+D+S SM S I+++D A + ++ ++ K + GLV F Sbjct: 107 DVMVVVDLSGSMAEPDFTSRTGEKISRLDAAKEVLSEFVQSRKGD-------RLGLVLFG 159 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + F+ + + ++ L++ G ST+ + A ++F+ + Sbjct: 160 ---DAAFVQTPFTADQKVWLELLNQTDVAMAGQSTHLGDAIGLAI-KVFEQSDQSRGALE 215 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--SHEFLR--- 337 +D N +K+ + +TDG T + + AK +G V+ I + L Sbjct: 216 QDQNREKVAIVLTDG--NDTGSFVEPIDAAKVAKAKGVRVHVIAMGDPETIGETALDMDT 273 Query: 338 ----ACASPNSFYLVENPHSMYDAFSHIGK 363 A S + N + A+ IGK Sbjct: 274 IHRIAKESGGEAFEALNRDELSAAYDEIGK 303 >gi|47220812|emb|CAG00019.1| unnamed protein product [Tetraodon nigroviridis] Length = 1557 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 40/210 (19%), Positives = 78/210 (37%), Gaps = 27/210 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +A+ D++ ++D S S+ IT+ +++ + + LI + Q + Sbjct: 883 EVCKEAKADLVFLVDGSWSIGDENFMKITR------FLHSTVGSLDLI--GTDGTQVAIA 934 Query: 225 TFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+ F L L I+ +S G +T + +K+ IF ++ Sbjct: 935 QFSDDARTEFQLSSHSNKEALLEAIQKISYKGGNTKTGRAIKHVKESIFSLE------AG 988 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 K++V +TDG E + G I++AIG E + + Sbjct: 989 ARRGVPKVLVVLTDGR-----SQDDVNKVSKEMQMDGYIIFAIGFADADYGELVNIASKP 1043 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 S + V++ DA I + ++T Sbjct: 1044 SDRHVFFVDD----LDAVKKIEEQLITFVC 1069 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 28/160 (17%), Positives = 56/160 (35%), Gaps = 15/160 (9%) Query: 219 VQSGLVTFS-NKIEEFFLLEWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 V GL +S + E+ L + + ++ L G +T + L + F + Sbjct: 83 VFLGLAQYSGDPRIEWHLNAYSTKDAVIDAVRNLPYKGGNTLTGLALNFILENCFKPE-- 140 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 + KI + +TDG K + + G ++AIG++ +E Sbjct: 141 ----SGSREGLPKIGILITDG-----KSQDDVVPPAESLRNAGIELFAIGVKNADENELQ 191 Query: 337 RACASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + P Y V + M + + + + + DK Sbjct: 192 SIASPPEDTHVYNVADFSVMNSIVEALTRTVCEQVVQQDK 231 >gi|149187170|ref|ZP_01865468.1| hypothetical protein VSAK1_16642 [Vibrio shilonii AK1] gi|148838706|gb|EDL55645.1| hypothetical protein VSAK1_16642 [Vibrio shilonii AK1] Length = 324 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 34/215 (15%), Positives = 72/215 (33%), Gaps = 46/215 (21%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 DMM+V+D+S SM ++ D I ++ ++ ++ K + GLV F++ Sbjct: 86 DMMLVIDLSYSMSQQDMAYQDDYIDRLTAVKHVVSDFVDRRKGD-------RVGLVYFAD 138 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + ++ ++ L G T G+ A D Sbjct: 139 HAYLQTPLTFDRETVKTQLNQTVLKLIGTQTAIGDGIGLATKTFVDSNA----------- 187 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------- 331 +++++ ++DG N + + AKK G +Y IG+ Sbjct: 188 PQRVMILLSDGSNNA--GVLDPVQAAEIAKKYGTTIYTIGVGAGEMQVKDFFMTRTVNTA 245 Query: 332 ----SHEFLRACA-SPNSFYLVENPHSMYDAFSHI 361 ++ + ++ N + + I Sbjct: 246 EDLDEKTLIKIANITGGQYFRARNADELATIYDTI 280 >gi|291388471|ref|XP_002710799.1| PREDICTED: collagen, type XIV, alpha 1 [Oryctolagus cuniculus] Length = 1796 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 55/335 (16%), Positives = 111/335 (33%), Gaps = 42/335 (12%) Query: 51 HSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDI 110 + + + + G N + LC + + + D Sbjct: 903 QASGFSDALTGMVKTLFLGVTNLQANQIEMTSLCAQWQMHRHATAYRVVIES-LQDTQKQ 961 Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKI-----------PLKFCTFIPWYTNSRHIVMPIT 159 + S ++ Y + S YKI P + P T Sbjct: 962 ESTVSGGTT---RHCFYGLQPDSEYKISVYTKLQELEGPSVSIMEKTESFPTEPPTFPPT 1018 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 A+ D++ ++D S S+ D + KI I + + + + I + Sbjct: 1019 IPPAREVCKAAKADLVFMVDGSWSIG---DDNFNKI---INFLYSTVGALDKI--GTDGT 1070 Query: 220 QSGLVTFSNKIEEFFLL-EWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 Q +V F++ F L + L IK +S G +T + +K+ + +F + Sbjct: 1071 QVAMVQFTDDPRTEFKLNAYETKETLLDAIKRISYKGGNTKTGKAIKHVRDTLFTAESGT 1130 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + K+IV +TDG + E + G ++A+G+ E + Sbjct: 1131 RRG------IPKVIVVITDG-----RSQDDVNKISREMQSDGYNIFAVGVADADYSELVN 1179 Query: 338 ACASPN--SFYLVENPHSMYDAFSHIGKDIVTKRI 370 + P+ + V++ +DAF I +++T Sbjct: 1180 IGSKPSARHVFFVDD----FDAFKKIEDELITFVC 1210 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 81/223 (36%), Gaps = 26/223 (11%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV- 215 + VK QT A D++I++D S S+ F + + LE + +V Sbjct: 142 SVPEEVKFVCQTPAIADIVILVDGSWSIGRFN----------FRLVRLFLENLVTAFNVG 191 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + + GL +S + L + + ++ L G +T + L Y + F Sbjct: 192 SEKTRVGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNYIFENSFKP 251 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + KI + +TDG K + ++ G ++AIG++ + Sbjct: 252 E------AGSRTGVPKIGILITDG-----KSQDDIIPPSRNLRESGVELFAIGVKNADVN 300 Query: 334 EFLRACASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 E + P+ Y V M+ + + + ++ D+ Sbjct: 301 ELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRVEEQDR 343 >gi|149176865|ref|ZP_01855475.1| BatA [Planctomyces maris DSM 8797] gi|148844302|gb|EDL58655.1| BatA [Planctomyces maris DSM 8797] Length = 356 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 32/232 (13%), Positives = 76/232 (32%), Gaps = 41/232 (17%) Query: 163 KVNSQTDARLDMMIVLDVSRSM---ESFFDSS-ITKIDMAIKSINAMLEEVKLIPDVNNV 218 + T + + +V+D S SM + D + ++ +E + + N Sbjct: 76 EQQVTTSEGIAIEMVVDRSGSMQAMDFKIDGEHVDRLTAIKNVAGKFVEGKEELEGRFND 135 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL----SKFGVSTNSTPGLKYAYNQIFDMQ 274 + GL+TF+ + +L ++ + ++ T + A ++ + Sbjct: 136 L-VGLMTFAGYADGITPPTLDHPYLVSQLNNIQIVTNRSEDGTAIGDAISLAVEKLNALD 194 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR------ 328 R K+I+ +TDGEN + + + + A+ G VY IG+ Sbjct: 195 ARRDEKVK-----SKVIILLTDGENNA--GEVEPIQAAELAETLGIKVYTIGVGTKGEAP 247 Query: 329 -----VIRSHEFLRACA--------------SPNSFYLVENPHSMYDAFSHI 361 + ++ + ++ + S+ + I Sbjct: 248 VPVTDPFSGKQVVQWMPVNIDEATLQKVADLTHGKYFRATDTDSLEKIYHEI 299 >gi|327277464|ref|XP_003223484.1| PREDICTED: von Willebrand factor A domain-containing protein 2-like, partial [Anolis carolinensis] Length = 750 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 43/204 (21%), Positives = 73/204 (35%), Gaps = 28/204 (13%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S+ A +D+M +LD S S+ +S + ++ + V Sbjct: 38 SAAGQLMNCSAPVDVMFLLDGSYSIGK---------GSFERSKHLAIKLCDALDISPEKV 88 Query: 220 QSGLVTFSNKIEEFFLL-EWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G + FSN F L + H ++ K+K + G T + LKY + F Sbjct: 89 KVGAIEFSNTAYLEFSLDAYFTKHQIKDKLKRIVFKGGRTETGLALKYILRKGF------ 142 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 H + KI++ +TDG K + K+ G V+ +G+ R E L Sbjct: 143 -HGSRNST-VPKILIILTDG-----KSQGNIAAPAKQLKEMGITVFVVGVSFPRWEE-LH 194 Query: 338 ACASPN---SFYLVENPHSMYDAF 358 AS E+ + F Sbjct: 195 ILASDPTEWHLLFAEDTDDAVNGF 218 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 36/211 (17%), Positives = 72/211 (34%), Gaps = 22/211 (10%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 S++ +D++ LD S ++ I+++ +V Q Sbjct: 521 SIESQDCPSQPVDLVFALDSSATVG------RDSFAHLKTFISSL---SSKFDINRDVTQ 571 Query: 221 SGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 GLV F + + F L+ S+L I G S + L Y Y+ + +Q Sbjct: 572 IGLVVFGKRPKTIFGLDTHVASSNLHEAISQAPFMGGSASVGSSLLYIYDHVMTVQ---- 627 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 KKI+V +T G ++ + + + +GI + + L+ Sbjct: 628 --KGARPGVKKIVVVITSG-----TGVDDAIIPAQQLRNNDISLLVVGIGPVLTDSLLKI 680 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 S N+ + + + I + I + Sbjct: 681 TGSHNNLLRIFSYKDLKYNEGLIIEKICDES 711 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 73/203 (35%), Gaps = 32/203 (15%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D++ ++D S S T ++ ++ + ++ + G+ +S+ + Sbjct: 341 SVDLLFLVDSSSS---------TTLEGFLRYKAFLKRFLQAVWSSETSGNVGVAQYSDDV 391 Query: 231 EEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYA-YNQIFDMQGMRQHCNTEDANY 287 E + L + I + G ST + L+Y YN + D + Sbjct: 392 EMTVQVGDYKDVLSLVKVIDSMQFNGGSTLTGKALRYVTYNGFQSAP---IFADVPD-DL 447 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA-----CASP 342 ++++ +TDG + RG V+ IG+ EFLRA S Sbjct: 448 PRVVILLTDG-----NAQDSVVEAAKY--SRGQDVFLIGV----GSEFLRAELEEITGSS 496 Query: 343 NSFYLVENPHSMYDAFSHIGKDI 365 + P +++ + + + I Sbjct: 497 KRTVIYSTPQDLFNKITELQRKI 519 >gi|293359740|ref|XP_233802.5| PREDICTED: vitrin [Rattus norvegicus] Length = 648 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 43/297 (14%), Positives = 99/297 (33%), Gaps = 35/297 (11%) Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 + + + R+ ++ + + +V V N YS + Sbjct: 373 DGGPTDKIEEVSRVARESGINVFFVTVEGAAEREKQHVVEPNFASKAVCRTNGFYSFNVQ 432 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI 192 S + + ++ + T D+ V+D S S+ Sbjct: 433 SWLSLHKTVQPLVKRVCDTDRLACSKTCL--------NSADIGFVIDGSSSVG------T 478 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS--NKIEEFFLLEWGVSHLQRKIKYL 250 + ++ + + +E ++ + G V ++ ++E F + + I+ + Sbjct: 479 SNFRTVLQFVANLSKEFEISDTD---TRIGAVQYTYEQRLEFGFDKYNSKADVLSAIRRV 535 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 + T++ ++YA Q+F N +K+++ +TDG + Sbjct: 536 GYWSGGTSTGAAIQYALEQLF---------KKSKPNKRKVMILITDG-----RSYDDVRI 581 Query: 311 YCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDI 365 A ++G I YAIGI E P + + V+ ++Y I ++I Sbjct: 582 PAMAAYQKGVITYAIGIAWAAQDELEVIATHPARDHSFFVDEFDNLYKFVPRIIRNI 638 >gi|194387934|dbj|BAG61380.1| unnamed protein product [Homo sapiens] Length = 589 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 43/209 (20%), Positives = 76/209 (36%), Gaps = 19/209 (9%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP---DVNNVVQSGLV 224 D +D+ VLD S S+ +D ++ ++ D N V +G + Sbjct: 32 QDCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGAL 91 Query: 225 TFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +S+++E L G L+ + + FG T + +K Q+ H Sbjct: 92 HYSDEVEIIQGLTRMPGGRDALKSSVDAVKYFGKGTYTDCAIKKGLEQLLVGG---SHLK 148 Query: 282 TEDANYKKIIVFMTDGEN-LSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRAC 339 K ++ +TDG KE L NEAK G V+++ I L Sbjct: 149 EN-----KYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRLSII 203 Query: 340 ASPNSF---YLVENPHSMYDAFSHIGKDI 365 A+ +++ + + DA I + I Sbjct: 204 ATDHTYRRNFTAADWGQSRDAEEAISQTI 232 >gi|194386506|dbj|BAG61063.1| unnamed protein product [Homo sapiens] Length = 404 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 43/209 (20%), Positives = 76/209 (36%), Gaps = 19/209 (9%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP---DVNNVVQSGLV 224 D +D+ VLD S S+ +D ++ ++ D N V +G + Sbjct: 32 QDCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGAL 91 Query: 225 TFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +S+++E L G L+ + + FG T + +K Q+ H Sbjct: 92 HYSDEVEIIQGLTRMPGGRDALKSSVDAVKYFGKGTYTDCAIKKGLEQLLVGG---SHLK 148 Query: 282 TEDANYKKIIVFMTDGEN-LSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRAC 339 K ++ +TDG KE L NEAK G V+++ I L Sbjct: 149 EN-----KYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRLSII 203 Query: 340 ASPNSF---YLVENPHSMYDAFSHIGKDI 365 A+ +++ + + DA I + I Sbjct: 204 ATDHTYRRNFTAADWGQSRDAEEAISQTI 232 >gi|119629725|gb|EAX09320.1| collagen, type VI, alpha 1, isoform CRA_a [Homo sapiens] gi|119629726|gb|EAX09321.1| collagen, type VI, alpha 1, isoform CRA_a [Homo sapiens] Length = 726 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 43/209 (20%), Positives = 76/209 (36%), Gaps = 19/209 (9%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP---DVNNVVQSGLV 224 D +D+ VLD S S+ +D ++ ++ D N V +G + Sbjct: 32 QDCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGAL 91 Query: 225 TFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +S+++E L G L+ + + FG T + +K Q+ H Sbjct: 92 HYSDEVEIIQGLTRMPGGRDALKSSVDAVKYFGKGTYTDCAIKKGLEQLLVGG---SHLK 148 Query: 282 TEDANYKKIIVFMTDGEN-LSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRAC 339 K ++ +TDG KE L NEAK G V+++ I L Sbjct: 149 EN-----KYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRLSII 203 Query: 340 ASPNSF---YLVENPHSMYDAFSHIGKDI 365 A+ +++ + + DA I + I Sbjct: 204 ATDHTYRRNFTAADWGQSRDAEEAISQTI 232 >gi|114684807|ref|XP_001158390.1| PREDICTED: collagen, type VI, alpha 1 isoform 1 [Pan troglodytes] gi|114684809|ref|XP_001158445.1| PREDICTED: collagen alpha-1(VI) chain isoform 2 [Pan troglodytes] Length = 1028 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 43/209 (20%), Positives = 76/209 (36%), Gaps = 19/209 (9%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP---DVNNVVQSGLV 224 D +D+ VLD S S+ +D ++ ++ D N V +G + Sbjct: 32 QDCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGAL 91 Query: 225 TFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +S+++E L G L+ + + FG T + +K Q+ H Sbjct: 92 HYSDEVEIIQGLTRMPGGRDALKSSVDAVKYFGKGTYTDCAIKKGLEQLLVGG---SHLK 148 Query: 282 TEDANYKKIIVFMTDGEN-LSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRAC 339 K ++ +TDG KE L NEAK G V+++ I L Sbjct: 149 EN-----KYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRLSII 203 Query: 340 ASPNSF---YLVENPHSMYDAFSHIGKDI 365 A+ +++ + + DA I + I Sbjct: 204 ATDHTYRRNFTAADWGQSRDAEEAISQTI 232 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 28/172 (16%), Positives = 61/172 (35%), Gaps = 33/172 (19%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK-----LIPDVNNVVQS 221 +D++ VLD S S+ + ++A + +++ + + Sbjct: 609 CKCGPIDLLFVLDSSESIG------LQNFEIAKDFVVKVIDRLSRDELVKFEPGQSYA-- 660 Query: 222 GLVTFSNKIEEF-----FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 G+V +S+ + V L+ IK L T + L+Y +Q+ Sbjct: 661 GVVQYSHSQMQEHVSLRSPSIRNVQELKEAIKSLQWMAGGTFTGEALQYTRDQLL----- 715 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +I + +TDG + + ++ C G V ++GI+ Sbjct: 716 ------PPSPNNRIALVITDGRSDTQRDTTPLNVLC----SPGIQVVSVGIK 757 Score = 53.6 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 24/210 (11%), Positives = 65/210 (30%), Gaps = 23/210 (10%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS-- 227 + D+ I+LD S S+ S + K + ++ V+ +V +S Sbjct: 826 SPADITILLDGSASVGSH---NFDTTKRFAKRLAERFLTAGRTDPAHD-VRVAVVQYSGT 881 Query: 228 ---NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + L + + +T+ L Y R + Sbjct: 882 GQQRPERASLQFLQNYTALASAVDAMDFINDATDVNDALGYV---------TRFYREASS 932 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 KK ++ +DG + EA++ G ++ + + + +R + + Sbjct: 933 GAAKKRLLLFSDGNSQGATPAAIEKAV-QEAQRAGIEIFVVVVGRQVNEPHIRVLVTGKT 991 Query: 345 --FYLVENPHSMYDA--FSHIGKDIVTKRI 370 + + ++ + + + + + + Sbjct: 992 AEYDVAYGERHLFRVPSYQALLRGVFHQTV 1021 >gi|19031201|gb|AAL17974.1| proximal thread matrix protein 1 [Mytilus galloprovincialis] Length = 453 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 43/201 (21%), Positives = 78/201 (38%), Gaps = 19/201 (9%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ V D S S+ + ++ + +K I K PD +VTF+++ + Sbjct: 253 DIAFVFDASSSINANNPNNYQLMKNFMKDIVDRFN--KTGPDGTQF---AVVTFADRATK 307 Query: 233 FFLLEWGVSH--LQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L+ S ++ I +S + T GL+ A ++F +K Sbjct: 308 QFGLKDYSSKADIKGAIDKVSPSIIGQTAIGDGLENARLEVFP-----NRNGGGREEVQK 362 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDG+N K + +K G ++ AIG+ L AS + Sbjct: 363 VVILLTDGQNNGHKSPEHESSL---LRKEGVVIVAIGVGTGFLKSELINIASSEEYVF-- 417 Query: 350 NPHSMYDAFSHIGKDIVTKRI 370 S +D S I +D+V Sbjct: 418 -TTSSFDKLSKIMEDVVKLAC 437 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 84/210 (40%), Gaps = 13/210 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + D + D++++ D S S++ + + +K + V + + N Q G+V F Sbjct: 48 ECDVQADIIVLFDDSSSIQYDNKENYQMMKDFVKELVDSFTTVGV--NGRNGSQFGVVQF 105 Query: 227 SNKIEEFFLLEWGVSH--LQRKI-KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 S ++ F L + +++ I + + G T GLK+ F Sbjct: 106 SQGVKTAFPLNKFKTKEDIKKGIQDMVPRNGGQTEIGTGLKHVRENSFSGAEGG-----G 160 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 + + +KI++ MTDG++ + Q + ++ K G V AIGI L A+ Sbjct: 161 NPDKQKIVILMTDGKSNAGAPPQ---HEAHKLKAEGVTVIAIGIGQGFVKTELEQIATMK 217 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 ++ L N S + D+ + D Sbjct: 218 NYVLTTNSFSELSTLLKLVIDLACEVCVVD 247 >gi|87196339|ref|NP_001839.2| collagen alpha-1(VI) chain precursor [Homo sapiens] gi|125987811|sp|P12109|CO6A1_HUMAN RecName: Full=Collagen alpha-1(VI) chain; Flags: Precursor gi|30851190|gb|AAH52575.1| Collagen, type VI, alpha 1 [Homo sapiens] gi|119629727|gb|EAX09322.1| collagen, type VI, alpha 1, isoform CRA_b [Homo sapiens] Length = 1028 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 43/209 (20%), Positives = 76/209 (36%), Gaps = 19/209 (9%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP---DVNNVVQSGLV 224 D +D+ VLD S S+ +D ++ ++ D N V +G + Sbjct: 32 QDCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGAL 91 Query: 225 TFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +S+++E L G L+ + + FG T + +K Q+ H Sbjct: 92 HYSDEVEIIQGLTRMPGGRDALKSSVDAVKYFGKGTYTDCAIKKGLEQLLVGG---SHLK 148 Query: 282 TEDANYKKIIVFMTDGEN-LSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRAC 339 K ++ +TDG KE L NEAK G V+++ I L Sbjct: 149 EN-----KYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRLSII 203 Query: 340 ASPNSF---YLVENPHSMYDAFSHIGKDI 365 A+ +++ + + DA I + I Sbjct: 204 ATDHTYRRNFTAADWGQSRDAEEAISQTI 232 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 28/172 (16%), Positives = 61/172 (35%), Gaps = 33/172 (19%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK-----LIPDVNNVVQS 221 +D++ VLD S S+ + ++A + +++ + + Sbjct: 609 CKCGPIDLLFVLDSSESIG------LQNFEIAKDFVVKVIDRLSRDELVKFEPGQSYA-- 660 Query: 222 GLVTFSNKIEEF-----FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 G+V +S+ + V L+ IK L T + L+Y +Q+ Sbjct: 661 GVVQYSHSQMQEHVSLRSPSIRNVQELKEAIKSLQWMAGGTFTGEALQYTRDQLL----- 715 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +I + +TDG + + ++ C G V ++GI+ Sbjct: 716 ------PPSPNNRIALVITDGRSDTQRDTTPLNVLC----SPGIQVVSVGIK 757 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 24/210 (11%), Positives = 65/210 (30%), Gaps = 23/210 (10%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS-- 227 + D+ I+LD S S+ S + K + ++ V+ +V +S Sbjct: 826 SPADITILLDGSASVGSH---NFDTTKRFAKRLAERFLTAGRTDPAHD-VRVAVVQYSGT 881 Query: 228 ---NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + L + + +T+ L Y R + Sbjct: 882 GQQRPERASLQFLQNYTALASAVDAMDFINDATDVNDALGYV---------TRFYREASS 932 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 KK ++ +DG + EA++ G ++ + + + +R + + Sbjct: 933 GAAKKRLLLFSDGNSQGATPAAIEKAV-QEAQRAGIEIFVVVVGRQVNEPHIRVLVTGKT 991 Query: 345 --FYLVENPHSMYDA--FSHIGKDIVTKRI 370 + + ++ + + + + + + Sbjct: 992 AEYDVAYGESHLFRVPSYQALLRGVFHQTV 1021 >gi|30032|emb|CAA33888.1| precursor polypeptide (AA -19 to 237) [Homo sapiens] Length = 256 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 43/209 (20%), Positives = 76/209 (36%), Gaps = 19/209 (9%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP---DVNNVVQSGLV 224 D +D+ VLD S S+ +D ++ ++ D N V +G + Sbjct: 32 QDCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGAL 91 Query: 225 TFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +S+++E L G L+ + + FG T + +K Q+ H Sbjct: 92 HYSDEVEIIQGLTRMPGGRDALKSSVDAVKYFGKGTYTDCAIKKGLEQLLVGG---SHLK 148 Query: 282 TEDANYKKIIVFMTDGEN-LSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRAC 339 K ++ +TDG KE L NEAK G V+++ I L Sbjct: 149 EN-----KYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRLSII 203 Query: 340 ASPNSF---YLVENPHSMYDAFSHIGKDI 365 A+ +++ + + DA I + I Sbjct: 204 ATDHTYRRNFTAADWGQSRDAEEAISQTI 232 >gi|293347920|ref|XP_001064219.2| PREDICTED: vitrin-like [Rattus norvegicus] Length = 648 Score = 73.7 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 43/291 (14%), Positives = 98/291 (33%), Gaps = 35/291 (12%) Query: 79 GGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP 138 + + R+ ++ + + +V V N YS + S + Sbjct: 379 KIEEVSRVARESGINVFFVTVEGAAEREKQHVVEPNFASKAVCRTNGFYSFNVQSWLSLH 438 Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA 198 + ++ + T D+ V+D S S+ + Sbjct: 439 KTVQPLVKRVCDTDRLACSKTCL--------NSADIGFVIDGSSSVG------TSNFRTV 484 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFS--NKIEEFFLLEWGVSHLQRKIKYLSKFGVS 256 ++ + + +E ++ + G V ++ ++E F + + I+ + + Sbjct: 485 LQFVANLSKEFEISDTD---TRIGAVQYTYEQRLEFGFDKYNSKADVLSAIRRVGYWSGG 541 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 T++ ++YA Q+F N +K+++ +TDG + A Sbjct: 542 TSTGAAIQYALEQLF---------KKSKPNKRKVMILITDG-----RSYDDVRIPAMAAY 587 Query: 317 KRGAIVYAIGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDI 365 ++G I YAIGI E P + + V+ ++Y I ++I Sbjct: 588 QKGVITYAIGIAWAAQDELEVIATHPARDHSFFVDEFDNLYKFVPRIIRNI 638 >gi|307720603|ref|YP_003891743.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306978696|gb|ADN08731.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 310 Score = 73.3 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 32/198 (16%), Positives = 69/198 (34%), Gaps = 24/198 (12%) Query: 174 MMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + ++LD S SM+ + + ++ D+ + + + + + GLV F + Sbjct: 86 IALILDASGSMKERGFDPVNPAASRFDVVKSIVKDFISQ-RTNDN------MGLVVFGSY 138 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + L R + L G+ Y +++ + K Sbjct: 139 SFIASPLTYDKHILSRIVSQLE---------VGMAGKYTALYEALAQGVNLLKMSKAKSK 189 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE---FLRAC-ASPNSF 345 + + +TDG + + + + AKK G VY IGI + L+ + Sbjct: 190 VAILLTDGYSTAGADKIPLDVVLDMAKKEGVKVYPIGIGGPDEYNRAVLLKIAKETGGVA 249 Query: 346 YLVENPHSMYDAFSHIGK 363 + N + + + I + Sbjct: 250 FGASNASQLKEVYKKIDE 267 >gi|148706512|gb|EDL38459.1| vitrin, isoform CRA_a [Mus musculus] Length = 650 Score = 73.3 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 77/197 (39%), Gaps = 27/197 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS--NKI 230 D+ V+D S S+ + ++ + + +E ++ + G V ++ ++ Sbjct: 467 DIGFVIDGSSSVG------TSNFRTVLQFVANLSKEFEISDTD---TRVGAVQYTYEQRL 517 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 E F + + I+ + + T++ ++YA Q+F N +K+ Sbjct: 518 EFGFDKYNSKADILSAIRRVGYWSGGTSTGAAIQYALEQLF---------KKSKPNKRKV 568 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLV 348 ++ +TDG + A ++G I YAIGI E P + + V Sbjct: 569 MIIITDG-----RSYDDVRIPAMAAYQKGVITYAIGIAWAAQDELEVMATHPAKDHSFFV 623 Query: 349 ENPHSMYDAFSHIGKDI 365 ++ ++Y I ++I Sbjct: 624 DDFDNLYKIAPRIIQNI 640 >gi|310694574|gb|ADP05359.1| collagen type VI alpha 1 protein [Bubalus bubalis] Length = 1027 Score = 73.3 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 45/211 (21%), Positives = 77/211 (36%), Gaps = 19/211 (9%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP---DVNNVVQSGLV 224 D +D+ VLD S S+ +D ++ +K D N V +G + Sbjct: 32 QDCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLKDRYYRCDRNLVWNAGAL 91 Query: 225 TFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +S+++E L G L+ + + FG T + +K ++ H Sbjct: 92 HYSDEVEIIRGLTRMPSGRDELKSSVDAVKYFGKGTYTDCAIKKGLEELLVGG---SHLK 148 Query: 282 TEDANYKKIIVFMTDGEN-LSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRAC 339 K +V +TDG KE L NEAK G V+++ I L Sbjct: 149 EN-----KYLVVVTDGHPLEGYKEPCGGLEDAVNEAKHLGIKVFSVAITPDHLEPRLSII 203 Query: 340 ASPNSF---YLVENPHSMYDAFSHIGKDIVT 367 A+ +++ + + DA I + I T Sbjct: 204 ATDHTYRRNFTAADWGQSRDAEEVISQTIET 234 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 60/170 (35%), Gaps = 29/170 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVVQSGL 223 +D++ VLD S S+ + ++A I +++ + V +G+ Sbjct: 609 CKCGPIDILFVLDSSESIG------LQNFEIAKDFIVKVIDRLSKDELVKFEPGQSHAGV 662 Query: 224 VTFSNKIEEFF-----LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V +S+ + L+ IK L G T + L+Y +++ Sbjct: 663 VQYSHNQMQEHVDLRDPNIRSAQDLKEAIKKLQWMGGGTFTGEALQYTRSRLL------- 715 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +I + +TDG + + ++ C V ++GI+ Sbjct: 716 ----PPTPNNRIALVITDGRSDTQRDTTPLSVLCGP----DIQVVSVGIK 757 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 56/165 (33%), Gaps = 19/165 (11%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + D+ I+LD S S+ S + ITK K + + V+ +V +S Sbjct: 825 SPADITILLDGSASVGSH-NFDITK--RFAKRLAERFLTASRTDPGQD-VRVAVVQYSGT 880 Query: 230 IEEFFLLEW-----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ + L + + F +T+ L Y R + Sbjct: 881 GQQRPERAALQFLQNYTVLANTVDSMDFFNDATDVIDALGYV---------TRFYREASS 931 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 KK ++ +DG + EA++ G ++A+ + Sbjct: 932 TAAKKRLLLFSDGNSQGATPATIEKAV-QEAQRAGLEIFAVVVGR 975 >gi|94498567|ref|ZP_01305122.1| hypothetical protein SKA58_08339 [Sphingomonas sp. SKA58] gi|94422010|gb|EAT07056.1| hypothetical protein SKA58_08339 [Sphingomonas sp. SKA58] Length = 678 Score = 73.3 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 31/266 (11%), Positives = 82/266 (30%), Gaps = 42/266 (15%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 N KG + + A + + ++G +++ + + + D + + + Sbjct: 14 RLARNQKGNVMAMVAAAIIPLAALIGGGLDMGRAYMARARMQQACDAAALAGRRAMTTSS 73 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 K F +T ++ E ++ Sbjct: 74 MTQANKDEAKKFF-------------------DFNFPQGTFQAATFTPVIRSKPGETTTV 114 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM----E 185 + +P + + + +T + + D+M+VLD + SM Sbjct: 115 QVTASTTMPTTVMKIFRYE----TLPLSVTCEARFDIGNT---DVMLVLDTTGSMAYAIS 167 Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR 245 S T++ +++ + + + ++ G + +S+ + + L ++L Sbjct: 168 DGKGGSTTRLAALKQAVKDFYDTLGAGSNATGRIRYGFMPYSSTVNVGYQLP--TNYLVG 225 Query: 246 KIKY----------LSKFGVSTNSTP 261 I L+ +G +TN T Sbjct: 226 GISGETWDYQTRRLLTTYGTATNETG 251 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 34/182 (18%), Positives = 60/182 (32%), Gaps = 38/182 (20%) Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ--GMRQHC 280 L +++N+ S I L G T G+ + + + Sbjct: 499 LTSYTNRTSTPTGQS---SSFNSYIDNLIAVG-GTYHDIGMLWGARFLSPKGIFASDNNS 554 Query: 281 NTEDANYKKIIVFMTDGE-----------------------NLSTKE-----DQQSLYYC 312 N + IVFMTDG+ N S + + + C Sbjct: 555 APNGFNISRHIVFMTDGDMSAYQQVYGAYGYQQLDARVAPGNTSDTDLTAIHNTRLQMLC 614 Query: 313 NEAKKRGAIVYAIGIRVIRSHEF---LRACASPNSFY-LVENPHSMYDAFSHIGKDIVTK 368 N K +G ++ IG R L+ CA+ ++ + + + S+ F I K+I Sbjct: 615 NAIKAKGITIWVIGFRNQSEGNIQTPLQNCATSSNHWTMAYDATSLSQKFKDIAKNIGGL 674 Query: 369 RI 370 R+ Sbjct: 675 RV 676 >gi|51095063|gb|EAL24307.1| similar to matrilin 2 precursor [Homo sapiens] Length = 448 Score = 73.3 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 42/275 (15%), Positives = 93/275 (33%), Gaps = 40/275 (14%) Query: 108 DDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + + + E +SA S PL+ +P + +S Sbjct: 63 QEFPADRDGSLALHSTYESLRLSASSWTVNPLRGINMMPS-------SLAPSSQGCGPKC 115 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + L+++ V+D S S+ + + M + V L + G++ +S Sbjct: 116 KETPLELVFVIDSSESVGP------ENFQIIKNFVKTMADRVAL---DLATARIGIINYS 166 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 +K+E+ L+ S + + + G T + L+ A N +F+ Sbjct: 167 HKVEKVANLKQFSSKDDFKLAVDNMQYLGEGTYTATALQAA-NDMFEDA---------RP 216 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-------HEFLRA 338 KK+ + +TDG+ +++ ++ A ++ IG+ E Sbjct: 217 GVKKVALVITDGQ-TDSRDKEKLTEVVKNASDTNVEIFVIGVVKKNDPNFEIFHKEMNLI 275 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 P Y ++ ++ D + ++I D Sbjct: 276 ATDPEHVYQFDDFFTLQDTLK----QKLFQKICED 306 >gi|75076662|sp|Q4R7B7|ANTRL_MACFA RecName: Full=Anthrax toxin receptor-like; Flags: Precursor gi|67969305|dbj|BAE01005.1| unnamed protein product [Macaca fascicularis] Length = 557 Score = 73.3 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 33/203 (16%), Positives = 69/203 (33%), Gaps = 22/203 (10%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+ +LD S S+ + + + + + ++ +T+S Sbjct: 70 QGSFDLYFILDKSGSVNNNWID-------LYMWVEETVARFQSSD-----IRMCFITYST 117 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L + ++ + L K ++ ++ + + N Sbjct: 118 DGQTVLPLTSDKNRIKNGLDQLRKIVPDGHTF--MQAGFRKAIQQIETFNSGNK----VP 171 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 +I+ MTDGE L Q +L +A+K GA VY + + + + SP + V Sbjct: 172 SMIIAMTDGE-LVAHAFQDTLREAQKARKLGANVYTVDVADYKLDQITAIADSPEHVFAV 230 Query: 349 ENPHSMYDAFSHIGKDIVTKRIW 371 EN + A + +K Sbjct: 231 ENG---FKAMRDTVDALTSKVCL 250 >gi|54025448|ref|YP_119690.1| hypothetical protein nfa34780 [Nocardia farcinica IFM 10152] gi|81374389|sp|Q5YU15|Y3478_NOCFA RecName: Full=UPF0353 protein NFA_34780 gi|54016956|dbj|BAD58326.1| hypothetical protein [Nocardia farcinica IFM 10152] Length = 335 Score = 73.3 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 42/264 (15%), Positives = 79/264 (29%), Gaps = 39/264 (14%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 PL+ + + V R +++V+DVS SME+ D Sbjct: 47 KVAPSRPSPLRHAPIALMLVGLVFLTIAAAGPTSVQKVPRNRATVVLVMDVSLSMEA-TD 105 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY 249 +++++A ++ + + + G VTF+ ++ I Sbjct: 106 VPPSRLEVAQQAGKEF------VDGLTQGINLGFVTFAGTASVMQSPTTNREAVKAAIDN 159 Query: 250 LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ-- 307 + T + G+ A I + + T IV M+DG+ + Sbjct: 160 IK-LAERTATGEGILTALQSIETLATVLGGAETPPPAR---IVLMSDGKQTVPDDKDVDN 215 Query: 308 ---SLYYCNEAKKRGAIVYAIGIR---------------------VIRSHEFLRACA--S 341 + AK +G V I V +E LR A S Sbjct: 216 PRHAFTAARLAKSKGIPVSTISFGTEWGSVEIPDQDGQGGSQRVKVPVDNESLREIAKLS 275 Query: 342 PNSFYLVENPHSMYDAFSHIGKDI 365 FY + + + + + I Sbjct: 276 GGEFYTASSLEELTAVYDTLEEQI 299 >gi|291393629|ref|XP_002713393.1| PREDICTED: alpha 1 type VII collagen [Oryctolagus cuniculus] Length = 2937 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 40/203 (19%), Positives = 79/203 (38%), Gaps = 28/203 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ S+ ++ ++ + V+ V +S+ Sbjct: 38 DIVFLLDGSSSIG---RSNFREVRGFLEGLVLPFSGAASAQG----VRFAAVQYSDDPRT 90 Query: 233 FFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L+ S + R I+ LS G +T + +++ + IF Q R K+ Sbjct: 91 EFGLDTLASGGEVIRAIRELSYKGGNTRTGAAIRHVADHIFLPQLTRPGI-------PKV 143 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLV 348 + +TDG K Q K +G ++A+GI+ E R + P + F+ V Sbjct: 144 CILITDG-----KSQDQVDAAAQRLKGQGVKLFAVGIKNADPEELKRVASQPTGDFFFFV 198 Query: 349 ENPHSMYDAFSHIGKDIVTKRIW 371 + + +V++R+ Sbjct: 199 NDFSILRTLL-----PLVSRRVC 216 Score = 39.8 bits (91), Expect = 0.77, Method: Composition-based stats. Identities = 38/214 (17%), Positives = 73/214 (34%), Gaps = 33/214 (15%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P TS + D++ +L +R + + A + + ++ + + Sbjct: 1035 VRGPETSVTQNPGCPRGLADVVFLLHATR-------DNAHRAGAAKRVLERLVSALGPLG 1087 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSH-------LQRKIKYLSKFGVSTNSTPGLKYA 266 VQ GL+++S++ F L SH R I Y+ G + + + Sbjct: 1088 P--QAVQVGLLSYSHRPSPLFPL--NSSHDLGIILQKIRDIPYMDPSGNNLGTAVVTAHR 1143 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 Y D G R H ++V + D EA+ G V +G Sbjct: 1144 YLLAPDAPGRRPH-------VPGVMVLLVD-----EPLRSDIFSPIREAQAAGLKVMMLG 1191 Query: 327 IRVIRSHEFLRACASPN---SFYLVENPHSMYDA 357 + + R + SF+ V++ S+ A Sbjct: 1192 LAGADPEQLRRLVPGMDPVQSFFAVDDGPSLDRA 1225 >gi|83941160|ref|ZP_00953622.1| hypothetical protein EE36_02988 [Sulfitobacter sp. EE-36] gi|83846980|gb|EAP84855.1| hypothetical protein EE36_02988 [Sulfitobacter sp. EE-36] Length = 480 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 65/463 (14%), Positives = 143/463 (30%), Gaps = 110/463 (23%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 + F G +TIL + ++ V G+ ++ ++ L ++ DR+++ AA Sbjct: 28 LTRFAREDDGLVTILALFMIMMMIAVGGIQLDFMRHEMERSRLQAVSDRAVLAAA----- 82 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 + + + K K+ N + D+G + S +D + + Sbjct: 83 --DLDQMRDPKTVVEDYFAKSGMTEFLSNVVVDDGLNFRTVTVDASKDMDTQFIGRFGFP 140 Query: 128 SISAIS-------RYKIPLKFCTFIPWYTNSRH--------IVMPITSSVKVNSQTDARL 172 ++ + K+ + I + + + + + +K ++ + Sbjct: 141 TLEVPAHSQAEERVAKVEISLVLDISGSMATNNRLGEVQDAADIFLDTVLKDENEDLISV 200 Query: 173 DMM----------IVLDVSRSME-----------SFFDSSITKIDMAIKSINAMLEEVKL 211 ++ +++D M F + I M + ++ + Sbjct: 201 SLVPYSEQVNAGPLIMDR---MNVNRKHDYSHCIDFDNGDFDSIAMNSSTRYNQMQHFQW 257 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA----- 266 D N + V E L+ +I L T+ G+K+A Sbjct: 258 NYDGRNNYRDDTVCPRYDYERITPFSQNKRTLKNQIDDLVPR-AGTSIFLGMKWAAAMLD 316 Query: 267 --YNQIFDMQGMRQHCNTE---------DANYKKIIVFMTDG--------ENLSTKEDQQ 307 + I + H + E D+ K ++ MTDG N ED + Sbjct: 317 PAFRDINNSLVNAGHVDREFYNRPASYTDSETLKTVILMTDGANDNSFRISNTYYNEDSE 376 Query: 308 SLYY--------------------------------------CNEAKKRGAIVYAIGIRV 329 +++ C+ AK + ++++IG V Sbjct: 377 YVHWNRYNLWWYLRREVNSRYWGYFYYQKYNKSLGNTLLSNICDAAKAKRIVIWSIGFEV 436 Query: 330 IRSH-EFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 ++ CAS S + + +AF I + I R+ Sbjct: 437 DDEDVPAMQDCASSPSHFFRVEGVELSEAFRAIARQINQLRLT 479 >gi|254229828|ref|ZP_04923234.1| von Willebrand factor, type A [Vibrio sp. Ex25] gi|262395606|ref|YP_003287459.1| protein BatA [Vibrio sp. Ex25] gi|151937664|gb|EDN56516.1| von Willebrand factor, type A [Vibrio sp. Ex25] gi|262339200|gb|ACY52994.1| protein BatA [Vibrio sp. Ex25] Length = 356 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 81/210 (38%), Gaps = 32/210 (15%) Query: 173 DMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+M+V+D+S SM S I+++D A + + ++ K + GLV F Sbjct: 107 DVMVVVDLSGSMAEPDFTSRTGEKISRLDAAKEVLTEFVQSRKGD-------RLGLVLFG 159 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + F+ + ++ L++ G ST+ + A ++F+ + Sbjct: 160 ---DAAFVQTPFTVDQKVWLELLNQTDVAMAGQSTHLGDAIGLAI-KVFEQSDKSRGALE 215 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--SHEFLR--- 337 +D N +K+ + +TDG T + + AK +G V+ I + L Sbjct: 216 QDQNREKVAIVLTDG--NDTGSFVEPIDAAKVAKAKGVRVHVIAMGDPETIGETALDMDT 273 Query: 338 ----ACASPNSFYLVENPHSMYDAFSHIGK 363 A S + N + A+ IGK Sbjct: 274 IHRIAKESGGEAFEALNRDELSAAYDEIGK 303 >gi|260781663|ref|XP_002585923.1| hypothetical protein BRAFLDRAFT_90333 [Branchiostoma floridae] gi|229270991|gb|EEN41934.1| hypothetical protein BRAFLDRAFT_90333 [Branchiostoma floridae] Length = 2692 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 34/219 (15%), Positives = 77/219 (35%), Gaps = 27/219 (12%) Query: 154 IVMPITSSVKVNSQTDARL----DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 I + + + + D L D++ +LD S S+ S D+ +++ Sbjct: 1075 IAITTPAPTQAPACNDFPLFNGTDLVFLLDGSGSVGSN------NFDLVKTFTKNVVQNF 1128 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAY 267 + + +V +S++ F L + + I +S T + + + Sbjct: 1129 DISETA---TRVAVVQYSDQFSTEFSLNAFSTKTEVYNAIDNISYLTGGTFTGFAIDFVM 1185 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 +F + E Y ++V +TDG + + A+ +G +YA+G+ Sbjct: 1186 QSVFTS------ISGERDGYPDLLVVVTDGLSTDDVSG-----PADTARAQGVTIYAVGV 1234 Query: 328 RV-IRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 I + + + V + S+ + +DI Sbjct: 1235 GSDIDFNTLEQIAGLTSRVSQVSDFSSLVTLSQTLSQDI 1273 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 33/208 (15%), Positives = 73/208 (35%), Gaps = 24/208 (11%) Query: 154 IVMPITSSVKVNSQ-TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 I P + V + + LD++ +LD S S+ D+ + ++ E ++ Sbjct: 1616 ITTPSPTRAPVCTDLSFGGLDLVFLLDGSGSVT------AVNFDLVKDFASGVVSEFQIS 1669 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + G+V FS+ + F + + + + I + +T + + +A Sbjct: 1670 TTE---TRVGVVQFSDTLRTEFFMSSFSTKQQVLQAISDIDYIQGNTLTGAAITFATASS 1726 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 F AN+ ++ +TDG + A+ +G ++A+G+ Sbjct: 1727 FST------PAGNRANFPDFMIVVTDG-----LSQDSVVQPAQSARDQGITIFAVGVGNE 1775 Query: 331 -RSHEFLRACASPNSFYLVENPHSMYDA 357 L+ P V + + A Sbjct: 1776 VDFATLLQITGVPEYILQVTDFSDLLAA 1803 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 33/235 (14%), Positives = 82/235 (34%), Gaps = 29/235 (12%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 + F T + + LD++ +LD S S+ + ++ + Sbjct: 1347 QAVAFTSSCTCPPMYEGDVCQRYSPCYNRNIELDLVFLLDGSGSVTT---ANFDIVKEFT 1403 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFS-NKIEEFFLLEWGVSH-LQRKIKYLSKFGVST 257 + + + + L + G+V +S + EF L + + + I+ + T Sbjct: 1404 RRLANNFD-ISLAD-----TRVGVVQYSDSPTLEFNLNSFNTNELVDLAIRNIQYQQGGT 1457 Query: 258 NSTPGLKYA-YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 N+ + + N G R + +++ +TDG++ + A+ Sbjct: 1458 NTGQAIDFVRVNSFSANNGDRS-------DVPNVMIVVTDGQSSD-----DVVGPAQTAR 1505 Query: 317 KRGAIVYAIGIRVI-RSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 G +YA+GI ++E L+ + + +++ G+++ Sbjct: 1506 NAGISMYAVGIGNGVDTNELLQIAGQVDRVVQSADFSTLFSV----GENLQELVC 1556 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 33/208 (15%), Positives = 60/208 (28%), Gaps = 24/208 (11%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D++ +LD S S+ + M + P+ + G+V +SN Sbjct: 295 PEFDLIFLLDESGSIG------TDNFKLVKSFTERMANNFDISPNS---TRVGVVQYSNF 345 Query: 230 IEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 F L + I + G ST + + + N F N + + Sbjct: 346 PGTEFSLNAFTDKAAVLDAISKIDYNGGSTFTGAAIDFVRNNEFTS------VNGDRDDV 399 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL--RACASPNSF 345 I++ +TDG A G YA+GI L P Sbjct: 400 PNILIVITDGNPNDDVSG-----PAISANNAGITTYAVGIGSNVDQANLVQMTAGRPGRV 454 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + + + +++ D Sbjct: 455 LQAADFTDLTTVVGTLQENVCDAVYCGD 482 Score = 68.6 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 67/207 (32%), Gaps = 23/207 (11%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +D++ +LD S S+ S + ++ + NV GLV +S+ Sbjct: 2247 TPVDLVFLLDGSSSITS------PNFQIVKDFTADVVRTFNVSSAATNV---GLVQYSDT 2297 Query: 230 IEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 I F L S + I + +T + + + F + N Sbjct: 2298 IRTEFFLNSFDTKSGVLNAIGNIGYLQGNTRTGAAIDFVRISSFSV------PAGNRGNQ 2351 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACASPNSFY 346 ++ +TDG + A+ G ++A+GI I L SPN Sbjct: 2352 PDYLIVVTDG-----LSQDDVVVPAQTARNDGISIFAVGIGSEIDFATLLNIAGSPNRIL 2406 Query: 347 LVENPHSMYDAFSHIGKDIVTKRIWYD 373 + + + +A + + D Sbjct: 2407 QINDFAGLANAQEQLTDIVCNLTCCGD 2433 Score = 65.9 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 65/197 (32%), Gaps = 23/197 (11%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +D++ +LD S S+ S + ++ + NV GLV +S+ Sbjct: 2506 TPVDLVFLLDGSSSITS------PNFQIVKDFTADVVRTFNVSSAATNV---GLVQYSDT 2556 Query: 230 IEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 I F L S + I + +T + + + F + N Sbjct: 2557 IRTEFFLNSFDTKSEVLNAIGNIGYLQGNTRTGAAIDFVRISSFSV------PAGNRGNQ 2610 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACASPNSFY 346 ++ +TDG + L A+ G ++A+GI I L SPN Sbjct: 2611 PDYLIVVTDG-----LSQDEVLGPAQTARFEGINIFAVGIGNEIDFTTLLHIAGSPNRVL 2665 Query: 347 LVENPHSMYDAFSHIGK 363 + + + A + Sbjct: 2666 QINDFAGLASATGQLTD 2682 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 31/205 (15%), Positives = 63/205 (30%), Gaps = 34/205 (16%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +DM+ +LD S S+ ++ + ++ + + + GLV +S+ Sbjct: 1891 TPVDMVFLLDGSGSVT------QPNFELVKQFTQNVVVNFNI---SSATTRVGLVQYSDT 1941 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 I F L S +T + + + F + Sbjct: 1942 IRTEFFLNSHPSR-------------NTLTGAAIDFVRTSSFSI------PAGNRLTLPD 1982 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACASPNSFYLV 348 ++V +TDG A+ G +YA+GI L + + Sbjct: 1983 VLVVVTDG-----LSQDDVAGPAQIARDNGIAIYAVGIGSEVDFATLLDIAGLQSRVLQI 2037 Query: 349 ENPHSMYDAFSHIGKDIVTKRIWYD 373 + S+ DA + + + D Sbjct: 2038 NDFSSLLDAEEQLTEIVCNISYCGD 2062 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 28/210 (13%), Positives = 69/210 (32%), Gaps = 28/210 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +D++ +LD S S+ + ++ + + + + +S+ Sbjct: 12 GGVDLVFLLDGSASVG------ASNFELVKDF---TQQTTAKFDISDGSTRVAVAQYSST 62 Query: 230 IEEFFLLEWGV-----SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + F L S+ +I Y++ G ST + +++ F N Sbjct: 63 PQVEFNLNTNSDVDTLSNAIEQITYMN--GDSTFTGFAIEFVRQSAFSS------FNGAR 114 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR-ACASPN 343 + I+V +TDG+ A+++G ++A+G+ L+ + Sbjct: 115 DDKPDIMVVVTDGQ-----SADSVTSSAATAREQGVTMFAVGVGTGVGLSELQDIAGYTD 169 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + + + + I + D Sbjct: 170 RVLQLNDFVQLAQSADTIQSTLCGLAYCGD 199 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 31/158 (19%), Positives = 61/158 (38%), Gaps = 22/158 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE- 231 D++ +LD S S+ + ++ + P + GL+ +S+ I+ Sbjct: 555 DLVFLLDGSGSIG------TDNFQLVKAFTKEVIRNFAISPTA---TRVGLLQYSDTIDN 605 Query: 232 EFFLLEWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 EFF+ E+ L + + T + ++ F Q + NY I Sbjct: 606 EFFMNEFNTRDELYTAVDNVVYKTGGTFTGFAVE------FTRQIAFRTSAGTRDNYPDI 659 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 ++ +TDG ED + A +G ++YA+G+ Sbjct: 660 LIVVTDG----NSEDVVTSAVA-SAIDQGILIYAVGVG 692 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/203 (12%), Positives = 64/203 (31%), Gaps = 16/203 (7%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D++ ++D S S+ ++ + + + + G V F++ Sbjct: 837 FDLVFLVDKSSSVGP------ANFELVKEFMYDFTNTFSVGLSD---TRIGAVQFADAQT 887 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + F ++ + ++ + + + N G+ F Q N + + ++ Sbjct: 888 KDFDMDTFATK-EQTLAGIQNIVYTDNQVGGVATGAAIDFVRQNSYTRGNGDRTSVPDLL 946 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLRACASPNSFYLVEN 350 V +T S+ A+K G +Y +G+ S E S + + Sbjct: 947 VVVT-----SSASTDDVASAQETAEKEGITIYTVGVTNSVSFAELTSTAGSFSRVLRAND 1001 Query: 351 PHSMYDAFSHIGKDIVTKRIWYD 373 + + + I D Sbjct: 1002 FSDLSAIRQPLHETICQAAFCGD 1024 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 14/107 (13%), Positives = 33/107 (30%), Gaps = 12/107 (11%) Query: 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY 311 +T + + + F A ++ +TDG Sbjct: 2121 PAERNTLTGAAIDFVRTSTFST------PAGNRAGQPDFLIVVTDG-----LSQDNVAVP 2169 Query: 312 CNEAKKRGAIVYAIGIRVI-RSHEFLRACASPNSFYLVENPHSMYDA 357 A+ G ++A+GI + L+ +P+ + + + +A Sbjct: 2170 AQTARNNGISIFAVGIGSEVDADTLLQIAGTPSRTLQINDFAGLVNA 2216 >gi|317508725|ref|ZP_07966378.1| von Willebrand factor type A domain-containing protein [Segniliparus rugosus ATCC BAA-974] gi|316252973|gb|EFV12390.1| von Willebrand factor type A domain-containing protein [Segniliparus rugosus ATCC BAA-974] Length = 350 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 45/244 (18%), Positives = 90/244 (36%), Gaps = 40/244 (16%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 + + +++ + A +++V+D+S SM D T++D A ++ ++E++ Sbjct: 90 TVALSGPTTLAKVPKNRA--TVVLVIDISLSMMCD-DVRPTRVDAARQAAIKFVDEMEP- 145 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIK---YLSKFGVSTNSTPGLKYAYNQ 269 +Q GLVTF+ + ++R + K T + G+ A Q Sbjct: 146 -----TLQLGLVTFAGTAQTLIAPSSDHEVVKRALDDAIRPDKLAARTATGEGIYTALQQ 200 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGE---NLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 I ++G+ + IV +DG+ + + EAK + +Y+I Sbjct: 201 IETLKGILGGASKAPPAR---IVLESDGKETVPDDLNAPRGAFTAAKEAKAKEVPIYSIS 257 Query: 327 IR----------------VIRSHEFLRACA--SPNSFYLVENPHSMYDAFS----HIGKD 364 V L+ A S F+ + + D +S IG D Sbjct: 258 FGTASPIPYVNIQGSRVPVPADDASLQKVAELSGGKFFTASSLDQLTDVYSSLNAEIGYD 317 Query: 365 IVTK 368 +V + Sbjct: 318 LVKQ 321 >gi|222528069|ref|YP_002571951.1| YD repeat-containing protein [Caldicellulosiruptor bescii DSM 6725] gi|222454916|gb|ACM59178.1| YD repeat protein [Caldicellulosiruptor bescii DSM 6725] Length = 3027 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 50/208 (24%), Positives = 81/208 (38%), Gaps = 30/208 (14%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 + H + K +++D++ VLD S SM S D + +I+ K I Sbjct: 746 KTEHFSTFLLGD-KNMPVDLSKVDIVFVLDNSGSMSSN-DPNYYRIEATKKFIQN----- 798 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 I ++NN + GLV F + + L S L + + + G STN GLK A Sbjct: 799 --IDELNN--RVGLVDFDSSVSVRSNLTSDKSKLLQALNAMRWTGGSTNIGGGLKAAL-G 853 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +FD + KKIIV ++DG N + K+ +V I + Sbjct: 854 LFDQEQS-----------KKIIVLLSDGYHNTGIHPNDVLPELI----KQEIVVNTIALG 898 Query: 329 VIRSHEFLRACA--SPNSFYLVENPHSM 354 E L A + ++ V+N + Sbjct: 899 KDCDRELLHDIADKTKGGYFYVDNTGGL 926 >gi|296170658|ref|ZP_06852233.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295894647|gb|EFG74381.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] Length = 335 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 71/211 (33%), Gaps = 30/211 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+V+DVS+SM + D ++ A ++ +E+ + GL+ ++ Sbjct: 99 VMLVIDVSQSMRA-TDVQPNRMAAAQEAAKQFADELTPGIN------LGLIAYAGTATVL 151 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 +R + L +F T + G+ A I + + DA IV Sbjct: 152 VSPTTNRDSTKRALDKL-QFADRTATGEGIFTALQAIATVGAV---IGGGDAPPPARIVL 207 Query: 294 MTDGE---NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------------HEFL 336 +DG+ + + + AK +G + I E L Sbjct: 208 FSDGKETMPTNPDNPKGAFTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETL 267 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 + A S + Y + ++ + + I Sbjct: 268 KKVAQLSGGNAYNAATLQELKSVYATLQQQI 298 >gi|327262912|ref|XP_003216267.1| PREDICTED: integrin alpha-1-like [Anolis carolinensis] Length = 1166 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 46/297 (15%), Positives = 107/297 (36%), Gaps = 36/297 (12%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 + L N + + ++ + +L +V +G ++ Y + ++ Sbjct: 139 VKLNLPANTSIPQVAEVKENMTLGSTLVTNPKGGFLACGPLYAYKCGRLHYTTGICSNVS 198 Query: 154 IVMPITSSVKVNSQT-DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 + +S+ + Q + +LD++IVLD S S+ + + + +N++L+ + + Sbjct: 199 SNFEVVNSIAPSVQECNTQLDIVIVLDGSNSIYPW--------ESVTEFLNSLLQNMNIG 250 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS---KFGVSTNSTPGLKYAYNQ 269 P Q G+V + + F L + + + + G TN+ G+ A + Sbjct: 251 P---QQTQVGIVQYGENVTHEFNLNTYTTVEEVLVAAKKIGQRGGTRTNTALGIDTARKE 307 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-R 328 F +K++V +TDGE+ + + C ++I I Sbjct: 308 AFT------EARGARRGVQKVMVVVTDGESHDNYRLGEVIQDCE---DENIQRFSIAILG 358 Query: 329 VIRSHEF--------LRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 +++ AS F+ V + ++ +G+ I D+ Sbjct: 359 HYNRGNLSTEKLVEEIKSIASEPTEKHFFNVSDELALLTIVEALGERIFALEATKDQ 415 >gi|194226345|ref|XP_001488401.2| PREDICTED: similar to Collagen alpha-1(VI) chain [Equus caballus] Length = 1027 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 44/218 (20%), Positives = 82/218 (37%), Gaps = 19/218 (8%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP---DV 215 T++V+ + D +D+ VLD S S+ +D ++ ++ D Sbjct: 23 TAAVRTVAFQDCPVDLFFVLDTSESVALRLKPYGALVDKVKAFTKRFIDNLRDRYYRCDR 82 Query: 216 NNVVQSGLVTFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 N V +G + +S+++E L G L+ + + FG T + +K ++ Sbjct: 83 NLVWNAGALHYSDEVEIIRGLTRMPSGRDELKASVDAVKYFGKGTYTDCAIKKGLEELLV 142 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGEN-LSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVI 330 H K ++ +TDG KE L NEAK G V+++ I Sbjct: 143 GG---SHLKEN-----KYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGIKVFSVAITPD 194 Query: 331 RSHEFLRACASPNSF---YLVENPHSMYDAFSHIGKDI 365 L A+ +++ + + DA I + I Sbjct: 195 HLEPRLSIIATDHTYRRNFTAADWGQSRDAEEIISQTI 232 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 38/228 (16%), Positives = 78/228 (34%), Gaps = 44/228 (19%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV---KLIPDVNNVVQSGL 223 +D++ VLD S S+ + ++A I +++ + +L+ +G+ Sbjct: 609 CKCGPIDILFVLDSSESIG------LQNFEIAKDFIVKVIDRLSRDELVKFEAGQSHAGV 662 Query: 224 VTFSNKIEE-----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V +S+ + L+ IK L T + L+Y +++ Sbjct: 663 VQYSHNQMQEHVGLSDPNIRNTQELKEAIKKLQWMAGGTFTGEALQYTRSRLL------- 715 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHE 334 +I + +TDG + + ++ C V ++G I + Sbjct: 716 ----PPTQNTRIALVITDGRSDTQRDTTPLSVLCGP----DIQVVSVGIKDVFGSIAGSD 767 Query: 335 FLR--AC------ASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 L +C P + EN + + K+I T+ I DK Sbjct: 768 QLNVISCQGLAPQGRPGISLVKENYAELLE--DAFLKNITTQ-ICIDK 812 Score = 42.8 bits (99), Expect = 0.077, Method: Composition-based stats. Identities = 22/154 (14%), Positives = 48/154 (31%), Gaps = 18/154 (11%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + D+ I+LD S S+ S + ITK + + K + + V+ ++ +S Sbjct: 825 SPADITILLDGSASVGSH-NFDITKRFAKRLAERFLTARRK---NPAHEVRVSVMQYSGT 880 Query: 230 IEEFFLLE-----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ + L I + +T+ T L Y R + Sbjct: 881 GQQQPERASLQFLQNYTVLASTIDSMDFINDATDVTDALSYV---------TRFYREASS 931 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + ++ +DG + + Sbjct: 932 EEVDRRLLLFSDGNSQGATAAAIKKAVQEAQRAN 965 >gi|320106407|ref|YP_004181997.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319924928|gb|ADV82003.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 305 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 47/214 (21%), Positives = 86/214 (40%), Gaps = 33/214 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 Q L M++ +D S S+ + F T+ D A + + ML E + L+ Sbjct: 70 ERQATTPLSMVMAIDTSESVITQF---QTERDAAKRFVKQMLREQDEMD---------LI 117 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 +FS+ ++E + I L K G +T+ + A ++ + D Sbjct: 118 SFSDTVDEIVPFTNDAGRMNAGIGNLHK-GDATSLYDAIYLASQRLTE--------AKRD 168 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------SHEFL 336 A +KI+V +TDG N + + A++ GA +Y I + I L Sbjct: 169 ATRRKILVIVTDGGNTTK--GMRYQQAVEAAERAGAAIYPIIMVPIEADAGRNTGGEHAL 226 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 A + ++ V + H + AF+H+ D+ T+ Sbjct: 227 IQMAQDTGGKYFYVLDKHDLDKAFAHLSDDLRTQ 260 >gi|156371145|ref|XP_001628626.1| predicted protein [Nematostella vectensis] gi|156215607|gb|EDO36563.1| predicted protein [Nematostella vectensis] Length = 484 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 39/205 (19%), Positives = 72/205 (35%), Gaps = 19/205 (9%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ +D S SM A + + A++ K+ +V G++ FS + + Sbjct: 271 DLAFAIDASGSMGDQGFLR------AKRFVKALIGSFKVSQKGTHV---GIIRFSTRAKV 321 Query: 233 FFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F +H + I + T + L+ A ++F +Q + K+ Sbjct: 322 MFTFTEHFTHEDVNYAIDDIEYTEGGTKTELALRLARTELFS----KQGGSRTSPLIFKL 377 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS-FYLVE 349 V MTDG + + K+ G V A+GI + L AS S V Sbjct: 378 FVLMTDGRSEYFHAVARQAKM---LKRSGVHVMAVGIGKYTNQRELEVIASSKSDVIGVV 434 Query: 350 NPHSMYDAFSHIGKDIVTKRIWYDK 374 + + + I + + +K Sbjct: 435 SFRDLMIRMNEIKDKLCEIALLNEK 459 >gi|260061451|ref|YP_003194531.1| aerotolerance operon BatA [Robiginitalea biformata HTCC2501] gi|88785583|gb|EAR16752.1| BatA (Bacteroides aerotolerance operon) [Robiginitalea biformata HTCC2501] Length = 333 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 38/234 (16%), Positives = 82/234 (35%), Gaps = 48/234 (20%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P T + ++T +D+++ +DVS SM + D ++ + + + P+ Sbjct: 77 PQTQDISTRTKTTKGIDIVMAIDVSSSMLA-RDLRPNRLSALKEVAAEFI---RKRPND- 131 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQ 274 + GLV ++ + + S + ++ ++ + T GL + N++ Sbjct: 132 ---RIGLVAYAGESYTKTPITSDKSIVLGALREITYGQLNDGTAIGMGLATSVNRL---- 184 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR------ 328 E K+I+ +TDG N + + Q+ + A + G Y IG+ Sbjct: 185 -------KESTAISKVIILLTDGVNNAGFIEPQTA--ADLALEYGIKTYTIGLGTNGNAL 235 Query: 329 -----VIRS------------HEFLR--ACASPNSFYLVENPHSMYDAFSHIGK 363 E L A A+ ++ + + + I K Sbjct: 236 SPIGYNPDGSFRYGMRQVEIDEELLTDIATATGGEYFRATDNEKLEAIYEEINK 289 >gi|332519334|ref|ZP_08395801.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] gi|332045182|gb|EGI81375.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] Length = 334 Score = 73.3 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 44/284 (15%), Positives = 93/284 (32%), Gaps = 51/284 (17%) Query: 110 IVRSTSLDIVVVPQNEGYSISAI--SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + + +G+ +++ ++K L I I P T V ++ Sbjct: 28 FKNKQQTAELKISSLKGFKVTSSWLPKFKHFLFALRLIALALIITAIARPRTVDVSTKTK 87 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 T +D+++ +DVS SM + D +++ + ++ + GLV ++ Sbjct: 88 TTRGIDIVMAIDVSASMLAK-DLKPNRLEALKNVASEFIK-------GRPNDRIGLVEYA 139 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + + S + R ++ + T GL A N++ D + Sbjct: 140 GESYTKTPITSDKSIVLRSLQEIRYNNIIEGGTAIGMGLATAVNRLKDSKAKS------- 192 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-----------IRSH 333 K+I+ +TDG N S + + A + G Y IG+ Sbjct: 193 ----KVIILLTDGVNNSGSINPKIAS--ELAVEFGIKTYTIGLGTNGMALSPIAIKQNGQ 246 Query: 334 ------------EFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + ++ N + + + I K Sbjct: 247 FQYGRVKVEIDETLLKEIAQVTGGKYFRATNNKKLAEIYDEINK 290 >gi|283782262|ref|YP_003373017.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283440715|gb|ADB19157.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 395 Score = 73.3 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 55/391 (14%), Positives = 126/391 (32%), Gaps = 63/391 (16%) Query: 15 YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNR 74 +G M +L A L ++ + I+VS++ +++ L + D + + Sbjct: 16 RRGAMLVLIAFLLVVVVCMAAFAIDVSYMQLVRSELRAATDAAAKAGTLALAK------- 68 Query: 75 KKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISR 134 GD + RN++ V D + + N +S +A Sbjct: 69 ---TDGDAASARTAAIQAAARNKVAGRALVLTTDQV----QVGRSAAQANGTWSFTANQT 121 Query: 135 YKIPLKFCTFIPWYTNSRHIVMPITS--------SVKVNSQTDARLDMMIVLDVSRSMES 186 +K + + T + + + + + + + + ++ +V+D S SM Sbjct: 122 PYTSVKILSSMSDSTAAGSVPLFLGTFMGRGSFQPAQSATASQMEQEICLVIDRSHSMCF 181 Query: 187 FFDS----------------------SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 ++++ S+N + I + NN + L+ Sbjct: 182 NMSGVEWSYPPGTKTTPHTICYPPHATLSRWAALQSSVNLFM---DTILETNNTPRVALI 238 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIK---------YLSKFGVSTNSTPGLKYAYNQIFDMQG 275 T+ + I + + K ++ +T + N + Sbjct: 239 TWGSTIGTNTAEYSYTKKTEVAVANELGLSTDYAAVKSKIAARTTKVMLGGTNMSAGIDA 298 Query: 276 MRQHCNTE--DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 R N A KK ++ MTDG+ ++ + +A G ++ I + Sbjct: 299 GRTLLNGNTVRALAKKTMILMTDGQWNQGRDPIDA---AEDAADEGIQIHTITFLSGSAQ 355 Query: 334 EFLRACA--SPNSFYLVENPHSMYDAFSHIG 362 +R A + +Y+ N + +AF + Sbjct: 356 NTMRQVAEITGGKYYVSSNQAELEEAFRDLA 386 >gi|310814568|ref|YP_003962532.1| von Willebrand factor, type A [Ketogulonicigenium vulgare Y25] gi|308753303|gb|ADO41232.1| von Willebrand factor, type A [Ketogulonicigenium vulgare Y25] Length = 1160 Score = 73.3 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 38/165 (23%), Positives = 65/165 (39%), Gaps = 24/165 (14%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + M+ VLD S SM S +T++D+A ++++A + G+V Sbjct: 743 PHEGPGIAMVFVLDRSGSM-SQTVGDVTRLDVAKQAVSAAANLLDPQTGS-----LGVVM 796 Query: 226 FSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 F ++ E L + + + +L G TN PGL+ A+ + +H Sbjct: 797 FGSEAEVALPLGPLPDAAGIAAALGHLQP-GGGTNIYPGLQLAFQALRASDADARH---- 851 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 IV MTDG +S + D L + G V ++ I Sbjct: 852 -------IVVMTDG--MSDEADFPGLLAA--IRAEGITVSSVAIG 885 >gi|260797293|ref|XP_002593638.1| hypothetical protein BRAFLDRAFT_155309 [Branchiostoma floridae] gi|229278864|gb|EEN49649.1| hypothetical protein BRAFLDRAFT_155309 [Branchiostoma floridae] Length = 388 Score = 73.3 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 34/188 (18%), Positives = 73/188 (38%), Gaps = 22/188 (11%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 LD++ +LD S S+ + S+ K+ K + + + Q G++ +S Sbjct: 4 RNPLDIIFLLDGSGSVGA---SNFEKVKQFTK------KAISGFDISPSGTQVGVIQYST 54 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + F + V+ L I + T + ++Y F + Sbjct: 55 RTRQEFSMNSFVTKETLSSAIDEVQYMRGGTLTGKAIRYVTKYGFGKS------DGARPG 108 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 K+++ +TDG EA+++G VYAIG+ + + + ++ N+ Sbjct: 109 VPKVVIVVTDG-----VSYDAVAAPALEAQQKGITVYAIGVSGYDADQLEQIASNNNTLA 163 Query: 347 LVENPHSM 354 V+N + + Sbjct: 164 FVDNFNLL 171 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 63/169 (37%), Gaps = 22/169 (13%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 LD++ +LD S S+ + S+ K+ K + + + Q G++ +S Sbjct: 235 RNPLDIIFLLDGSGSVGA---SNFEKVKQFTK------KAISGFDISPSGTQVGVIQYST 285 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + F + V+ L I + T + ++Y F + Sbjct: 286 RTRQEFSMNSFVTKETLSSAIDEVQYMRGGTLTGKAIRYVTKYGFGKS------DGARPG 339 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 K+++ +TDG EA+++G VYAIG+ + + Sbjct: 340 VPKVVIVVTDG-----VSYDAVAAPALEAQQKGITVYAIGVSGYDADQL 383 >gi|254281808|ref|ZP_04956776.1| von Willebrand factor, type A [gamma proteobacterium NOR51-B] gi|219678011|gb|EED34360.1| von Willebrand factor, type A [gamma proteobacterium NOR51-B] Length = 328 Score = 72.9 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 71/201 (35%), Gaps = 30/201 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M+ +D+S SM+ ++ I+++ A+ + + + GL+ F + Sbjct: 92 DLMLAIDLSGSMQIEDMQVGARLVSRIEAVKAIASDFTSQRVGD---RVGLILFGTRAYV 148 Query: 233 FFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + + + R I+ L G T L A ++ E ++ Sbjct: 149 QAPLTFDTATVTRFIREAQLGFAGEDTAIGDALGLAIKRL-----------RERPAESRV 197 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS----------HEFLRAC- 339 ++ +TDG+ T + A + G VY IGI L A Sbjct: 198 LILLTDGQ--DTASTVDPMEATALAAESGIKVYTIGISRRIGARAGGSGEVDEALLNAIA 255 Query: 340 -ASPNSFYLVENPHSMYDAFS 359 A+ ++ NP + + Sbjct: 256 EATGGEYFRARNPAELQSIYG 276 >gi|31789431|gb|AAP58546.1| hypothetical protein [uncultured Acidobacteria bacterium] Length = 327 Score = 72.9 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 42/226 (18%), Positives = 81/226 (35%), Gaps = 42/226 (18%) Query: 172 LDMMIVLDVSRSMESFFDSSI-----------TKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 LD++++LD+S SM+ S T++D ++ + + Sbjct: 85 LDIVLLLDLSSSMQEEMGSGQSLKTGTTAAGRTRMDAVKDAVRTFVR-------GRRDDR 137 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK----FGVSTNSTPGLKYAYNQIFDMQGM 276 GLV FS+ L + +L + ++ T GL A + Sbjct: 138 IGLVVFSDNAYVISPLTFDHQYLLDYLGFVDGEILLGEGQTAIGDGLALA-------SAV 190 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG------IRVI 330 DA +++V TDGE+ ++ + EAK G V+ IG ++ Sbjct: 191 LARQAGRDARGHQVVVLFTDGESNRGRD---PIEVVGEAKSAGIRVHVIGVDLDAEVKTR 247 Query: 331 RSHEFLRA--CASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + LR A+ ++ ++ + A I D + K + + Sbjct: 248 PGVQLLRRGVVAAGGRYFAADSERDLLTASRTI--DAMEKGVLVSR 291 >gi|221109528|ref|XP_002169888.1| PREDICTED: similar to collagen type VI alpha 6 [Hydra magnipapillata] Length = 366 Score = 72.9 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 37/176 (21%), Positives = 73/176 (41%), Gaps = 20/176 (11%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 T S +DA +D+ +LD S S+ + K++ L+ + D+ Sbjct: 7 TETTGKPSKESCSDAIVDVGFILDSSGSLRRDY-----------KNLKEFLKTIASFFDI 55 Query: 216 N-NVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 N Q+G++TFS++ E L ++ + + G +T L+++ N +F Sbjct: 56 KINGSQAGVITFSHRSEHSIKLNDFSDGDSFEKAVDKIPLMGSTTRIDKALRHSKNVMFT 115 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 Q N K+++ +TDG + + + +E + G ++ AIGI Sbjct: 116 NQ------NGGRLEATKLLILLTDGSQTFSAKQEDPSIIADEIRNDGVLIIAIGIG 165 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 39/196 (19%), Positives = 79/196 (40%), Gaps = 21/196 (10%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-N 217 + S +DA +D+ +LD S S+ + K++ L+ + D+ N Sbjct: 175 SGKPSKESCSDAIVDVGFILDSSGSLRRDY-----------KNVKEFLKTIASFFDIKIN 223 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 Q+G++TFS++ E L ++ + + G +T L+++ N +F Q Sbjct: 224 GSQAGVITFSHRSEHSIKLNDFSDVDSFEKAVDKIPLMGSTTRIDKALRHSKNVMFTNQ- 282 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-E 334 N K+++ +TDG + + + +E + G ++ AIGI + E Sbjct: 283 -----NGGRLEATKLLILLTDGSQTFSAKQEDPSIIADEIRNDGVLIIAIGIGEGINKTE 337 Query: 335 FLRACASPNSFYLVEN 350 R + Y + Sbjct: 338 LNRIAGKDENTYNADT 353 >gi|269926132|ref|YP_003322755.1| von Willebrand factor type A; type II secretion system protein [Thermobaculum terrenum ATCC BAA-798] gi|269789792|gb|ACZ41933.1| von Willebrand factor type A; type II secretion system protein [Thermobaculum terrenum ATCC BAA-798] Length = 643 Score = 72.9 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 41/201 (20%), Positives = 72/201 (35%), Gaps = 29/201 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 Q +D+++ LD S SM A + ++ + + GL+TF Sbjct: 92 QNPDPIDVVLALDTSASMND------DAFTAAQDAAYGLINGLSPED------KVGLITF 139 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + +Q I+ LS+ V T GL A ++ Q Sbjct: 140 DKTARVIEPLAQDHARVQESIQKLSRS-VGTALYQGLSLAAQEVAKGQNT---------- 188 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNS 344 K IV MTDG N S + +A++ GA V+ +G + L+ A + Sbjct: 189 --KAIVLMTDGFNTSRNTTLE--EAVAKAQEVGASVFTVGFGKKVDTQGLQKIANETGGE 244 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 ++ + F+ I + + Sbjct: 245 YFSAPTNAQLRRVFADISQKL 265 >gi|313159758|gb|EFR59115.1| von Willebrand factor type A domain protein [Alistipes sp. HGB5] Length = 330 Score = 72.9 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 39/220 (17%), Positives = 73/220 (33%), Gaps = 49/220 (22%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +DVS SM + D +I A + + + + + GLV F+ + Sbjct: 88 IDIMLAIDVSGSMLA-RDFRPDRITAAKEVAGSFIAD-------RYGDRIGLVAFAGEAF 139 Query: 232 EFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L LQ + + T GL A N++ + + K Sbjct: 140 TQSPLTTDQGTLQTLLARIRSGLIEDGTAIGNGLATAINRLRESEAKS-----------K 188 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--------------RVIRSHEF 335 +I+ +TDG N + + AK +G VY IG+ Sbjct: 189 VIILLTDGVNN--RGEIAPQTAAEIAKAQGIRVYTIGVGTEGMAPYPAVDIYGTPTGGTV 246 Query: 336 ----------LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 LR+ A + ++ + + + I + Sbjct: 247 MAKVEIDEKTLRSIAEQTGGQYFRATDKAKLKAIYDQINQ 286 >gi|328675375|gb|AEB28050.1| BatA in aerotolerance operon [Francisella cf. novicida 3523] Length = 333 Score = 72.9 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 75/218 (34%), Gaps = 47/218 (21%) Query: 173 DMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+++ +D+S SM + ++ D+ ++ N L+ K + GL+ F Sbjct: 93 DLIMAIDLSGSMAIQDMKKSNGQMESRFDLVMRVANQFLDTRKGD-------RVGLILFG 145 Query: 228 NKIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L + ++ +++ + ++ G T + A ++ G Sbjct: 146 TRAYLQTPLTFDIATVKKMLDDASIALPGPQTAIGDAIGLAVKKLKKYPGDS-------- 197 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-------SHEF--- 335 K ++ +TDGEN Q L AK+ +Y IG+ + Sbjct: 198 ---KALILLTDGENN--SGTLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLINT 252 Query: 336 --------LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L A + ++ +N + + I K Sbjct: 253 SEDLDTTVLEKIAEMTGGKYFRAQNSSDLKKVYESIDK 290 >gi|121637412|ref|YP_977635.1| hypothetical protein BCG_1543 [Mycobacterium bovis BCG str. Pasteur 1173P2] gi|224989887|ref|YP_002644574.1| hypothetical protein JTY_1518 [Mycobacterium bovis BCG str. Tokyo 172] gi|166979775|sp|A1KIS1|Y1543_MYCBP RecName: Full=UPF0353 protein BCG_1543 gi|254800546|sp|C1ANC7|Y1518_MYCBT RecName: Full=UPF0353 protein JTY_1518 gi|121493059|emb|CAL71530.1| Probable membrane protein [Mycobacterium bovis BCG str. Pasteur 1173P2] gi|224773000|dbj|BAH25806.1| hypothetical protein JTY_1518 [Mycobacterium bovis BCG str. Tokyo 172] Length = 335 Score = 72.9 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 69/211 (32%), Gaps = 30/211 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+V+DVS+SM + D +++ A ++ +E+ + GL+ ++ Sbjct: 99 VMLVIDVSQSMRA-TDVEPSRMVAAQEAAKQFADELTPGIN------LGLIAYAGTATVL 151 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + L +F T + + A I + + D IV Sbjct: 152 VSPTTNREATKNALDKL-QFADRTATGEAIFTALQAIATVGAV---IGGGDTPPPARIVL 207 Query: 294 MTDGE---NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------------HEFL 336 +DG+ + + + AK +G + I E + Sbjct: 208 FSDGKETMPTNPDNPKGAYTAARTAKDQGVPISTISFGTPYGFVEIDDQRQPVPVDDETM 267 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 + A S + Y + +S + + I Sbjct: 268 KKVAQLSGGNSYNAATLAELRAVYSSLQQQI 298 >gi|118468162|ref|YP_887464.1| hypothetical protein MSMEG_3149 [Mycobacterium smegmatis str. MC2 155] gi|118169449|gb|ABK70345.1| conserved hypothetical protein [Mycobacterium smegmatis str. MC2 155] Length = 327 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 31/211 (14%), Positives = 69/211 (32%), Gaps = 30/211 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+V+DVS+SM + D + +++ A ++ +++ + GL+ ++ Sbjct: 91 VMLVIDVSQSMRA-TDVAPSRLVAAQEAAKQFADQLTPGIN------LGLIAYAGTATVL 143 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + L + T + G+ A I + + D IV Sbjct: 144 VQPTTNREATKNGLDKL-QLADRTATGEGIFTALQAIATVGAV---IGGGDEPPPARIVL 199 Query: 294 MTDGENLSTKEDQQ---SLYYCNEAKKRGAIVYAIGIRVIRSH--------------EFL 336 M+DG+ + AK +G + + + E L Sbjct: 200 MSDGKETVPSNPDNPKGAFTAARTAKDQGVPISTVSFGTPYGYVEINDQRQPVPVDDEML 259 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 A S + + + F+ + + I Sbjct: 260 EKIAQLSGGDAFTASSLEQLKAVFTSLQQQI 290 >gi|301604858|ref|XP_002932077.1| PREDICTED: collagen alpha-1(VI) chain-like [Xenopus (Silurana) tropicalis] Length = 1025 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 71/197 (36%), Gaps = 16/197 (8%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP---DVNNVVQSGLV 224 D +D+ VLD S S+ T + ++++ D N V +G + Sbjct: 61 QDCPVDVFFVLDTSESVALRVKPFKTLVTQVKDFTKKFIDKLTSRYYRCDRNLVWNAGAL 120 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 +S+++ L + L+ ++ + G T++ +K ++ ++ Sbjct: 121 HYSDEVIMISSLTRDMKTLRDDVETVEYIGKGTHTDCAIKRGIEEVLIGGSHQKEN---- 176 Query: 285 ANYKKIIVFMTDGENL--STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 K ++ +TDG L + NEAK G V+++ I L AS Sbjct: 177 ----KYLIVVTDGHPLEGYKEPCGGLEDAANEAKHLGIKVFSVAISPNHLEPRLSVIASD 232 Query: 343 NSF---YLVENPHSMYD 356 S + + + D Sbjct: 233 ASHRRNFTATSAAGLTD 249 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 30/179 (16%), Positives = 61/179 (34%), Gaps = 24/179 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVVQSGL 223 T LD++ VLD S S+ + ++ I +++ + V G+ Sbjct: 612 CTCGPLDLLFVLDSSESIGH------SNFQISKDFIIKVIDRLSRDEHVKFDAENSHVGV 665 Query: 224 VTFSN-KIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V +S+ K +E + + L+ +K L T + L + + + + Sbjct: 666 VQYSHGKTQEVVAMGDASIQSIGQLKEAVKNLKWIAGGTWTGEALAFTKDNLLKRFKLE- 724 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 KKI + +TDG + ++ C V I S + ++ Sbjct: 725 ---------KKIALVLTDGHSDILRDTTPLNTLCEVTPVVSVGVGDIFQNAPNSDQLVQ 774 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 32/216 (14%), Positives = 78/216 (36%), Gaps = 34/216 (15%) Query: 170 ARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS- 227 D+ +++D S + F +S + + + + +K + + +V +S Sbjct: 826 GPADITLLVDSSTRVGNQHFQTSKSFVKLLAQ------RFLKAKAPPSGSARVSVVQYSG 879 Query: 228 -NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 N+ + + L+ + + +TN L A +++ ++ Sbjct: 880 LNQQKVEAQFVSNYTVLEVPVDNMQFINGATNVVSAL-RAVTELY-------REDSLAGV 931 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR----VIRSHEFLRACASP 342 KK++VF +DG +T++++ L +A+ G +Y + + L A+ Sbjct: 932 SKKLLVF-SDG---NTQDEKDMLKAVQDARAAGIEIYVLAVGSRLNYPNLQVMLTGSAAD 987 Query: 343 -------NSFYLVENPHSMYD--AFSHIGKDIVTKR 369 + V + S+ + I + I K Sbjct: 988 ITAPFPEERLFRVPDYPSLLQGVRYQSISRRISLKS 1023 >gi|297680998|ref|XP_002818254.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Pongo abelii] Length = 1125 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 34/216 (15%), Positives = 77/216 (35%), Gaps = 33/216 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + L+++ V+D S S+ + + M + V L + G++ + Sbjct: 792 CKETPLELVFVIDSSESVGP------ENFQIIKNFVKTMADRVAL---DLATARIGVINY 842 Query: 227 SNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S+K+E+ L+ S + + + G T + L+ A N +F+ Sbjct: 843 SHKVEKVANLKQFSSKDDFKLAVDNMQYLGEGTYTATALQAA-NDMFEDA---------R 892 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-------HEFLR 337 KK+ + +TDG+ +++ ++ A ++ IG+ E Sbjct: 893 PGVKKVALVITDGQ-TDSRDKEKLTEVVKNASDTNVEIFVIGVVKKNDPNFEIFHKEMNL 951 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 P Y ++ ++ D + ++I D Sbjct: 952 IATDPEHVYQFDDFFTLQDTLK----QKLFQKICED 983 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 35/209 (16%), Positives = 83/209 (39%), Gaps = 22/209 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ ++D S S + K + S++ + ++ + ++ + FS+ ++ Sbjct: 47 IDIVFIVDSSESSKIVL---FDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQ 103 Query: 232 EFFLLE-W-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 W + ++K+K ++ G T S + A ++ +G + K Sbjct: 104 IDPPFSSWKDLQTFKQKVKSMNLIGQGTFSYYAISNA-TRLLKREGRKDGV--------K 154 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC---ASPNSFY 346 + + MTDG + D QS+ +A+ G IG+ + + LR +S Sbjct: 155 VALLMTDGIDHPRNPDVQSIS--EDARISGISFITIGLSTVVNEAKLRLISGDSSSEPTL 212 Query: 347 LVENP---HSMYDAFSHIGKDIVTKRIWY 372 L+ +P + D ++ + ++I Sbjct: 213 LLSDPTLVDKIQDRLDNLFEKKCERKICE 241 >gi|170751925|ref|YP_001758185.1| hypothetical protein Mrad2831_5557 [Methylobacterium radiotolerans JCM 2831] gi|170658447|gb|ACB27502.1| hypothetical protein Mrad2831_5557 [Methylobacterium radiotolerans JCM 2831] Length = 568 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 42/206 (20%), Positives = 59/206 (28%), Gaps = 61/206 (29%) Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY-----NQIFDMQG--MRQ 278 F + L + L+ I ++ G STN G + + N +F Sbjct: 363 FGCTTQPLQRLTNNTNALKTLINNMAPSG-STNIHEGFMWGWRTLSPNSVFADGQPYASS 421 Query: 279 HCNTEDANYKKIIVFMTDGEN-LSTKE--------------------------------- 304 ++ N KII+ MTDG N T Sbjct: 422 ANSSNATNINKIIILMTDGTNSWGTNSSAPTGSLYFAAGYFRNANGTTPNPRLTTAYQNT 481 Query: 305 ------------DQQSLYYCNEAKKRGAIVYAIGIRVI------RSHEFLRACAS-PNSF 345 D + C K +Y IG V LR CAS P+ F Sbjct: 482 NIADGNTARKALDALTAEACANTKAVNISIYTIGFSVPTDPIDSAGQTLLRNCASSPDQF 541 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIW 371 YL + + AF I I R+ Sbjct: 542 YLANSSDDLIKAFKSIQASIGALRLT 567 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 25/213 (11%), Positives = 65/213 (30%), Gaps = 32/213 (15%) Query: 18 GMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKL 77 + +L A + ++ G I+ ++T L + D + + ++ + + Sbjct: 26 NVALLFAFLSVPMVMIGGAAIDYGFATRLETKLQTATDATAL-----LLCQTPLTTSEAE 80 Query: 78 KGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKI 137 + N V+ + + + Q+ Sbjct: 81 LNTLAQTTMTGAMGA-------ANLVVDRLAITSSPRKITLTAHKQST------------ 121 Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 TF T ++ I S + + ++ +VLD + SM + +K+ Sbjct: 122 -----TFFGGLTGTQRINPGAVS--QCATPLPKTFEIALVLDNTGSM-AASSGGQSKLRA 173 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 + + V P ++ + +V F+ + Sbjct: 174 VQTAATDFVNYVYTSPAFSSATKVAIVPFAAAV 206 >gi|297300028|ref|XP_002805526.1| PREDICTED: collagen alpha-1(XIV) chain-like [Macaca mulatta] Length = 1717 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 49/251 (19%), Positives = 95/251 (37%), Gaps = 30/251 (11%) Query: 127 YSISAISRYKI---PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 Y IS ++ + P ++ P T A+ D++ ++D S S Sbjct: 983 YKISVYTKLQEIEGPSVSIMEKTQSLPTQPPTFPPTIPPAKEVCKAAKADLVFMVDGSWS 1042 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVSH 242 + D + KI + S L ++ + Q +V F++ F L + Sbjct: 1043 IG---DDNFNKIISFLYSTVGALNKI-----GTDGTQVAMVQFTDDPRTEFKLNAYKTKE 1094 Query: 243 -LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 L IK++S G +T + +KY + +F + + K+IV +TDG Sbjct: 1095 TLLDAIKHISYKGGNTKTGKAIKYVRDSLFTAESGTRRG------IPKVIVVITDG---- 1144 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SFYLVENPHSMYDAFS 359 + E + G ++AIG+ E + + P+ + V++ +DAF Sbjct: 1145 -RSQDDVNKISREMQLDGYSIFAIGVADADYSELVSIGSKPSARHVFFVDD----FDAFK 1199 Query: 360 HIGKDIVTKRI 370 I +++T Sbjct: 1200 KIEDELITFVC 1210 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 41/223 (18%), Positives = 80/223 (35%), Gaps = 26/223 (11%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV- 215 VK QT A D++I++D S S+ F + + LE + DV Sbjct: 142 SSPEEVKFVCQTPAIADIVILVDGSWSIGRFN----------FRLVRLFLENLVTAFDVG 191 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + + GL +S + L + + ++ L G +T + L Y + F Sbjct: 192 SEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNYIFENSFKP 251 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + KI + +TDG K + ++ G ++AIG++ + Sbjct: 252 E------AGSRTGVSKIGILITDG-----KSQDDIIPPSRNLRESGVELFAIGVKNADVN 300 Query: 334 EFLRACASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 E + P+ Y V M+ + + + ++ D+ Sbjct: 301 ELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRVEEQDR 343 >gi|224282379|ref|ZP_03645701.1| hypothetical protein BbifN4_00972 [Bifidobacterium bifidum NCIMB 41171] Length = 1153 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 46/338 (13%), Positives = 111/338 (32%), Gaps = 53/338 (15%) Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 G++ K++ ++ + NG + S + + V V + +++ Sbjct: 509 GSDAKRITESSKY-KVTEINVDQDTYAVSANGGQVKVTQEKDSATTEPVSVGEVPRITVT 567 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD-ARLDMMIVLDVSRSMESFFD 189 P + + + +T + +SQT + D+++V D S SM + Sbjct: 568 NTVV-TAPRYRKYIKANNDGTYDLSLNVTGTQSGSSQTTVSPADIVVVFDTSGSMSNPMG 626 Query: 190 SSITKIDMAIKSINAMLEEVKLIPD--VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI 247 + +++++A ++N+M + + + ++ ++ LV FS + + + Sbjct: 627 HN-SRLEVAKTAVNSMAQHLLTSENQGKDSNIRMALVPFSTTVGNVSNFTDNAMDIVSAV 685 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 L G + N ++ + KK IVFM+DG+ + Sbjct: 686 NGLRADGGT-----------NWEAALKAANAKLTSGRKGVKKYIVFMSDGDPTFRTSSVR 734 Query: 308 S------------------------------------LYYCNEAKKRGAIVYAIGIRVIR 331 + ++ A ++++G+ Sbjct: 735 TGTDWWGRPTYDDDDRRGLPAGVHGSGSSDQYGANLSSAVAEANRRGDATLFSVGVSSDP 794 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 + A + S+Y + + AF+ I I K Sbjct: 795 TKMRGFADQTKGSYYSATSTDELNKAFADIIGQINRKS 832 >gi|153871328|ref|ZP_02000529.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] gi|152072210|gb|EDN69475.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] Length = 280 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 38/195 (19%), Positives = 67/195 (34%), Gaps = 28/195 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 + +++DVS SM + + A ++ + + L GL+ F +K + Sbjct: 90 SVFLLIDVSYSM------DGSALAEAKQAAQEFVRKSDLAHTA-----IGLIEFGSKAKI 138 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L HL + I L G STN T GL AY ++ ++ + I+ Sbjct: 139 ISGLTQNAKHLYKAINRLKTNG-STNMTEGLTTAYLKLKNVDD------------PRFII 185 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPH 352 +TDG K + E G + IG + + + Sbjct: 186 LLTDGLPNHPK---NTQQIAQEICADGIELITIGTGDADKTYLQSLACYDQNSFFAK-AG 241 Query: 353 SMYDAFSHIGKDIVT 367 +M FS I + + Sbjct: 242 TMVSTFSRIAQVLTE 256 >gi|86747937|ref|YP_484433.1| hypothetical protein RPB_0811 [Rhodopseudomonas palustris HaA2] gi|86570965|gb|ABD05522.1| conserved hypothetical protein [Rhodopseudomonas palustris HaA2] Length = 435 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 62/419 (14%), Positives = 136/419 (32%), Gaps = 60/419 (14%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAAT-QIMN 67 R F + G + ++ AI L I +G I+ + ++T L S D +++ A + +N Sbjct: 15 RRFGRDRSGNIAVIFAIALLPILGFIGAAIDYATANRIRTKLQSAQDAAVLLAVSNSEIN 74 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDID-DIVRSTSLDIVVVPQNEG 126 + + + + E+ +N D + + + + + Sbjct: 75 RTTAQAKADAEQFFNATIGAYGLTATIKIEVTENDGKRSATADFTSTVTTNFLNLIGYPT 134 Query: 127 YSISAISRYKIPLKFC-TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVL-DVSRSM 184 +I S + F NS + + T++ + D+S S Sbjct: 135 LAIGNRSTSTVSRPIYQDFYLLLDNSPSMGVAATTADIATMVGNTSDKCAFACHDLSDSN 194 Query: 185 ESFFDSSI----TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV 240 + + +ID+ +++ + L+ VNN + + ++ L + Sbjct: 195 NYYNLAKKLGVKMRIDVVRQAVQQLTSTATLMTAVNNQFRMAV--YTLGGSCASLGLTTI 252 Query: 241 SHLQRKIKYLSKFGVSTN--STPGLKYAYNQIFDMQGMRQHCNTE--------DANYKKI 290 + L + + + + S P Y +Q D NT A +K Sbjct: 253 ASLSSAMSSVQTAAGAIDLMSIPKQNYNNDQCTDFNSALAAMNTTIPSSGTGTAAQPQKW 312 Query: 291 IVFMTDG--ENLSTKEDQQ------------SLYYCNEAKKRGAIV---YAIGIRVIRSH 333 + F++DG + + Q ++ C K RG + Y + + + Sbjct: 313 LFFVSDGVADFNNPSGCTQPTVSGGRCQEPLTVTQCKAMKDRGIQIAVLYTTYLALPTNQ 372 Query: 334 -----------------------EFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 +++CASP+ ++ V + +A + K V K Sbjct: 373 WYNDHIAPFNAGPYGPSVNSQIAAKMKSCASPDFYFEVSPTQGISEAMDALFKKAVAKA 431 >gi|289178041|gb|ADC85287.1| Fibronectin-binding protein [Bifidobacterium animalis subsp. lactis BB-12] Length = 2710 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 37/187 (19%), Positives = 72/187 (38%), Gaps = 31/187 (16%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 SRH V P TS+ ++ + T++ +++N +++ Sbjct: 205 GSRHYVTPKTSAADSDAAHTQFYS-----------RRRLTTQDTRMYALKQAVNGFIDQT 253 Query: 210 ----KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 + D N + GLVT+++ + L +S L+ + L G T + G++ Sbjct: 254 IAANAKVSDPNKKNRIGLVTYASDVNTRSGLTDSLSGLKSTVDDLKASGA-TRADLGMQT 312 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE-----DQQSLYYCNEAKKRGA 320 A + + A+ KI++F TDG+ + ++ K GA Sbjct: 313 ANTVLGNA----------RADASKIVIFFTDGQPTKSNGFENDVANDAIGAAKTMKTNGA 362 Query: 321 IVYAIGI 327 VY++GI Sbjct: 363 SVYSVGI 369 >gi|15608619|ref|NP_215997.1| hypothetical protein Rv1481 [Mycobacterium tuberculosis H37Rv] gi|31792676|ref|NP_855169.1| hypothetical protein Mb1517 [Mycobacterium bovis AF2122/97] gi|148661274|ref|YP_001282797.1| hypothetical protein MRA_1491 [Mycobacterium tuberculosis H37Ra] gi|148822701|ref|YP_001287455.1| hypothetical protein TBFG_11510 [Mycobacterium tuberculosis F11] gi|167968028|ref|ZP_02550305.1| hypothetical membrane protein [Mycobacterium tuberculosis H37Ra] gi|215403336|ref|ZP_03415517.1| hypothetical protein Mtub0_06533 [Mycobacterium tuberculosis 02_1987] gi|215411140|ref|ZP_03419948.1| hypothetical protein Mtub9_07385 [Mycobacterium tuberculosis 94_M4241A] gi|215426820|ref|ZP_03424739.1| hypothetical protein MtubT9_10680 [Mycobacterium tuberculosis T92] gi|215430374|ref|ZP_03428293.1| hypothetical protein MtubE_06801 [Mycobacterium tuberculosis EAS054] gi|215445676|ref|ZP_03432428.1| hypothetical protein MtubT_06934 [Mycobacterium tuberculosis T85] gi|218753198|ref|ZP_03531994.1| hypothetical protein MtubG1_07054 [Mycobacterium tuberculosis GM 1503] gi|219557390|ref|ZP_03536466.1| hypothetical protein MtubT1_08827 [Mycobacterium tuberculosis T17] gi|253799469|ref|YP_003032470.1| hypothetical protein TBMG_02500 [Mycobacterium tuberculosis KZN 1435] gi|254231712|ref|ZP_04925039.1| hypothetical protein TBCG_01457 [Mycobacterium tuberculosis C] gi|254364352|ref|ZP_04980398.1| hypothetical membrane protein [Mycobacterium tuberculosis str. Haarlem] gi|254550498|ref|ZP_05140945.1| hypothetical protein Mtube_08557 [Mycobacterium tuberculosis '98-R604 INH-RIF-EM'] gi|260186427|ref|ZP_05763901.1| hypothetical protein MtubCP_10429 [Mycobacterium tuberculosis CPHL_A] gi|260204765|ref|ZP_05772256.1| hypothetical protein MtubK8_10713 [Mycobacterium tuberculosis K85] gi|289447084|ref|ZP_06436828.1| membrane protein [Mycobacterium tuberculosis CPHL_A] gi|289554729|ref|ZP_06443939.1| membrane protein [Mycobacterium tuberculosis KZN 605] gi|289569506|ref|ZP_06449733.1| membrane protein [Mycobacterium tuberculosis T17] gi|289574162|ref|ZP_06454389.1| membrane protein [Mycobacterium tuberculosis K85] gi|289745232|ref|ZP_06504610.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987] gi|289750042|ref|ZP_06509420.1| membrane protein [Mycobacterium tuberculosis T92] gi|289753564|ref|ZP_06512942.1| hypothetical protein TBGG_00680 [Mycobacterium tuberculosis EAS054] gi|289757593|ref|ZP_06516971.1| conserved hypothetical protein [Mycobacterium tuberculosis T85] gi|289761639|ref|ZP_06521017.1| conserved hypothetical protein [Mycobacterium tuberculosis GM 1503] gi|294993225|ref|ZP_06798916.1| hypothetical protein Mtub2_01637 [Mycobacterium tuberculosis 210] gi|297634047|ref|ZP_06951827.1| hypothetical protein MtubK4_07987 [Mycobacterium tuberculosis KZN 4207] gi|297731033|ref|ZP_06960151.1| hypothetical protein MtubKR_08072 [Mycobacterium tuberculosis KZN R506] gi|298524990|ref|ZP_07012399.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A] gi|306775670|ref|ZP_07414007.1| membrane protein [Mycobacterium tuberculosis SUMu001] gi|306779490|ref|ZP_07417827.1| membrane protein [Mycobacterium tuberculosis SUMu002] gi|306784220|ref|ZP_07422542.1| membrane protein [Mycobacterium tuberculosis SUMu003] gi|306788587|ref|ZP_07426909.1| membrane protein [Mycobacterium tuberculosis SUMu004] gi|306792930|ref|ZP_07431232.1| membrane protein [Mycobacterium tuberculosis SUMu005] gi|306797308|ref|ZP_07435610.1| membrane protein [Mycobacterium tuberculosis SUMu006] gi|306803189|ref|ZP_07439857.1| membrane protein [Mycobacterium tuberculosis SUMu008] gi|306967588|ref|ZP_07480249.1| membrane protein [Mycobacterium tuberculosis SUMu009] gi|306971779|ref|ZP_07484440.1| membrane protein [Mycobacterium tuberculosis SUMu010] gi|307079498|ref|ZP_07488668.1| membrane protein [Mycobacterium tuberculosis SUMu011] gi|307084057|ref|ZP_07493170.1| membrane protein [Mycobacterium tuberculosis SUMu012] gi|313658366|ref|ZP_07815246.1| hypothetical protein MtubKV_08092 [Mycobacterium tuberculosis KZN V2475] gi|54040185|sp|P64856|Y1517_MYCBO RecName: Full=UPF0353 protein Mb1517 gi|54042534|sp|P64855|Y1481_MYCTU RecName: Full=UPF0353 protein Rv1481/MT1528 gi|166979870|sp|A5U2I5|Y1491_MYCTA RecName: Full=UPF0353 protein MRA_1491 gi|3261503|emb|CAA16011.1| PROBABLE MEMBRANE PROTEIN [Mycobacterium tuberculosis H37Rv] gi|31618266|emb|CAD96184.1| PROBABLE MEMBRANE PROTEIN [Mycobacterium bovis AF2122/97] gi|124600771|gb|EAY59781.1| hypothetical protein TBCG_01457 [Mycobacterium tuberculosis C] gi|134149866|gb|EBA41911.1| hypothetical membrane protein [Mycobacterium tuberculosis str. Haarlem] gi|148505426|gb|ABQ73235.1| putative membrane protein [Mycobacterium tuberculosis H37Ra] gi|148721228|gb|ABR05853.1| hypothetical membrane protein [Mycobacterium tuberculosis F11] gi|253320972|gb|ACT25575.1| membrane protein [Mycobacterium tuberculosis KZN 1435] gi|289420042|gb|EFD17243.1| membrane protein [Mycobacterium tuberculosis CPHL_A] gi|289439361|gb|EFD21854.1| membrane protein [Mycobacterium tuberculosis KZN 605] gi|289538593|gb|EFD43171.1| membrane protein [Mycobacterium tuberculosis K85] gi|289543260|gb|EFD46908.1| membrane protein [Mycobacterium tuberculosis T17] gi|289685760|gb|EFD53248.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987] gi|289690629|gb|EFD58058.1| membrane protein [Mycobacterium tuberculosis T92] gi|289694151|gb|EFD61580.1| hypothetical protein TBGG_00680 [Mycobacterium tuberculosis EAS054] gi|289709145|gb|EFD73161.1| conserved hypothetical protein [Mycobacterium tuberculosis GM 1503] gi|289713157|gb|EFD77169.1| conserved hypothetical protein [Mycobacterium tuberculosis T85] gi|298494784|gb|EFI30078.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A] gi|308215767|gb|EFO75166.1| membrane protein [Mycobacterium tuberculosis SUMu001] gi|308327531|gb|EFP16382.1| membrane protein [Mycobacterium tuberculosis SUMu002] gi|308330994|gb|EFP19845.1| membrane protein [Mycobacterium tuberculosis SUMu003] gi|308334816|gb|EFP23667.1| membrane protein [Mycobacterium tuberculosis SUMu004] gi|308338604|gb|EFP27455.1| membrane protein [Mycobacterium tuberculosis SUMu005] gi|308342306|gb|EFP31157.1| membrane protein [Mycobacterium tuberculosis SUMu006] gi|308350100|gb|EFP38951.1| membrane protein [Mycobacterium tuberculosis SUMu008] gi|308354737|gb|EFP43588.1| membrane protein [Mycobacterium tuberculosis SUMu009] gi|308358644|gb|EFP47495.1| membrane protein [Mycobacterium tuberculosis SUMu010] gi|308362622|gb|EFP51473.1| membrane protein [Mycobacterium tuberculosis SUMu011] gi|308366304|gb|EFP55155.1| membrane protein [Mycobacterium tuberculosis SUMu012] gi|323719929|gb|EGB29041.1| membrane protein [Mycobacterium tuberculosis CDC1551A] gi|326903107|gb|EGE50040.1| membrane protein [Mycobacterium tuberculosis W-148] gi|328459217|gb|AEB04640.1| membrane protein [Mycobacterium tuberculosis KZN 4207] Length = 335 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 69/211 (32%), Gaps = 30/211 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+V+DVS+SM + D +++ A ++ +E+ + GL+ ++ Sbjct: 99 VMLVIDVSQSMRA-TDVEPSRMVAAQEAAKQFADELTPGIN------LGLIAYAGTATVL 151 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + L +F T + + A I + + D IV Sbjct: 152 VSPTTNREATKNALDKL-QFADRTATGEAIFTALQAIATVGAV---IGGGDTPPPARIVL 207 Query: 294 MTDGE---NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------------HEFL 336 +DG+ + + + AK +G + I E + Sbjct: 208 FSDGKETMPTNPDNPKGAYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETM 267 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 + A S + Y + +S + + I Sbjct: 268 KKVAQLSGGNSYNAATLAELRAVYSSLQQQI 298 >gi|260837294|ref|XP_002613640.1| hypothetical protein BRAFLDRAFT_227016 [Branchiostoma floridae] gi|229299026|gb|EEN69649.1| hypothetical protein BRAFLDRAFT_227016 [Branchiostoma floridae] Length = 216 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 79/204 (38%), Gaps = 28/204 (13%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 + D++ +++ SRS+ + + + +N ++ + + P Q G+V +S I Sbjct: 2 QADILFLVEGSRSVSAL------EFEKMKTFLNNIVGQFDIGPTA---TQVGVVQYSWFI 52 Query: 231 EEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L S LQ+ I ++ G+ T++ L +A N N Sbjct: 53 RQECALNAHSSLASLQQAISNITVLGLGTHTGAALTFARNTALTA------ANGARPGVP 106 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--RVIRSHEFLRACASPNSFY 346 KI+V MTDG ED +L + G I +AI + + SP+ + Sbjct: 107 KIVVVMTDG----ASEDDVTL-PSQNLRNDGVITFAISVSWSLPNDRLLQDIAGSPDRIF 161 Query: 347 LVENPHSMYDAFSHIGKDIVTKRI 370 + +DA I + ++ Sbjct: 162 AATD----FDALDGIKVTLSSQLC 181 >gi|219682744|ref|YP_002469127.1| Rhs family protein [Bifidobacterium animalis subsp. lactis AD011] gi|219620394|gb|ACL28551.1| Rhs family protein [Bifidobacterium animalis subsp. lactis AD011] Length = 2582 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 37/187 (19%), Positives = 72/187 (38%), Gaps = 31/187 (16%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 SRH V P TS+ ++ + T++ +++N +++ Sbjct: 205 GSRHYVTPKTSAADSDAAHTQFYS-----------RRRLTTQDTRMYALKQAVNGFIDQT 253 Query: 210 ----KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 + D N + GLVT+++ + L +S L+ + L G T + G++ Sbjct: 254 IAANAKVSDPNKKNRIGLVTYASDVNTRSGLTDSLSGLKSTVDDLKASGA-TRADLGMQT 312 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE-----DQQSLYYCNEAKKRGA 320 A + + A+ KI++F TDG+ + ++ K GA Sbjct: 313 ANTVLGNA----------RADASKIVIFFTDGQPTKSNGFENDVANDAIGAAKTMKTNGA 362 Query: 321 IVYAIGI 327 VY++GI Sbjct: 363 SVYSVGI 369 >gi|157819015|ref|NP_001100328.1| procollagen, type VII, alpha 1 [Rattus norvegicus] gi|149018489|gb|EDL77130.1| procollagen, type VII, alpha 1 (predicted) [Rattus norvegicus] Length = 2588 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 73/197 (37%), Gaps = 23/197 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S S+ + + ++ + V+ V +S+ + Sbjct: 39 DIVFLIDGSSSIG------RSNFREVRGFLEGLVLPFSGAANAQG-VRFATVQYSDDPQT 91 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L+ R I+ LS G +T + L + +++F H K+ Sbjct: 92 EFGLDTLGSGGDTIRAIRELSYKGGNTRTGAALLHVSDRVF-----LPHLTRPG--IPKV 144 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLV 348 + +TDG K + K++G ++A+GI+ E R + + + F+ V Sbjct: 145 CILITDG-----KSQDLVDTAAQKLKRQGVKLFAVGIKNADPEELKRIASQPTSDFFFFV 199 Query: 349 ENPHSMYDAFSHIGKDI 365 + + I + + Sbjct: 200 NDFSILRTLLPLISRRV 216 >gi|183601829|ref|ZP_02963198.1| hypothetical protein BIFLAC_06106 [Bifidobacterium animalis subsp. lactis HN019] gi|241190320|ref|YP_002967714.1| hypothetical protein Balac_0261 [Bifidobacterium animalis subsp. lactis Bl-04] gi|241195726|ref|YP_002969281.1| hypothetical protein Balat_0261 [Bifidobacterium animalis subsp. lactis DSM 10140] gi|183218714|gb|EDT89356.1| hypothetical protein BIFLAC_06106 [Bifidobacterium animalis subsp. lactis HN019] gi|240248712|gb|ACS45652.1| hypothetical fibronectin binding protein [Bifidobacterium animalis subsp. lactis Bl-04] gi|240250280|gb|ACS47219.1| hypothetical fibronectin binding protein [Bifidobacterium animalis subsp. lactis DSM 10140] gi|295793307|gb|ADG32842.1| hypothetical fibronectin binding protein [Bifidobacterium animalis subsp. lactis V9] Length = 2696 Score = 72.9 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 37/187 (19%), Positives = 72/187 (38%), Gaps = 31/187 (16%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 SRH V P TS+ ++ + T++ +++N +++ Sbjct: 191 GSRHYVTPKTSAADSDAAHTQFYS-----------RRRLTTQDTRMYALKQAVNGFIDQT 239 Query: 210 ----KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 + D N + GLVT+++ + L +S L+ + L G T + G++ Sbjct: 240 IAANAKVSDPNKKNRIGLVTYASDVNTRSGLTDSLSGLKSTVDDLKASGA-TRADLGMQT 298 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE-----DQQSLYYCNEAKKRGA 320 A + + A+ KI++F TDG+ + ++ K GA Sbjct: 299 ANTVLGNA----------RADASKIVIFFTDGQPTKSNGFENDVANDAIGAAKTMKTNGA 348 Query: 321 IVYAIGI 327 VY++GI Sbjct: 349 SVYSVGI 355 >gi|326668779|ref|XP_002662551.2| PREDICTED: collagen alpha-1(VII) chain-like [Danio rerio] Length = 2698 Score = 72.9 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 38/191 (19%), Positives = 76/191 (39%), Gaps = 23/191 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 +V+P S + D++ ++D S S+ + + +++ Sbjct: 15 LVLPSLSKAQGQCNDVVAADVVFLVDGSSSIG------RANFMLVKSFMAGIVKPFAKAV 68 Query: 214 DVNNVVQSGLVTFSN--KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N + + G V +S+ ++E F + L ++ ++ G +T + GLKY + F Sbjct: 69 GPNGI-RFGAVQYSDTARVEFTFTAYLNGTELITAVENINYKGGNTRTGAGLKYIADNFF 127 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 +R + KI + +TDG K L + + G ++A+GI+ Sbjct: 128 SPASIR--------DVPKISILITDG-----KSQDNVLEPSQKLRGLGVKIFAVGIKSAD 174 Query: 332 SHEFLRACASP 342 E L+ ASP Sbjct: 175 PAE-LKLIASP 184 >gi|219804724|ref|NP_001137337.1| collagen alpha-1(VI) chain [Bos taurus] gi|296490826|gb|DAA32939.1| collagen, type VI, alpha 1 [Bos taurus] Length = 1027 Score = 72.9 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 43/209 (20%), Positives = 75/209 (35%), Gaps = 19/209 (9%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP---DVNNVVQSGLV 224 D +D+ VLD S S+ +D ++ + D N V +G + Sbjct: 32 QDCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLNDRYYRCDRNLVWNAGAL 91 Query: 225 TFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +S+++E L G L+ + + FG T + +K ++ H Sbjct: 92 HYSDEVEIIRGLTRMPSGRDELKSSVDAVKYFGKGTYTDCAIKKGLEELLVGG---SHLK 148 Query: 282 TEDANYKKIIVFMTDGEN-LSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRAC 339 K +V +TDG KE L NEAK G V+++ I L Sbjct: 149 EN-----KYLVVVTDGHPLEGYKEPCGGLEDAVNEAKHLGIKVFSVAITPDHLEPRLSII 203 Query: 340 ASPNSF---YLVENPHSMYDAFSHIGKDI 365 A+ +++ + + DA I + I Sbjct: 204 ATDHTYRRNFTAADWGQSRDAEEVISQTI 232 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 60/170 (35%), Gaps = 29/170 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVVQSGL 223 +D++ VLD S S+ + ++A I +++ + V +G+ Sbjct: 609 CKCGPIDILFVLDSSESIG------LQNFEIAKDFIVKVIDRLSKDELVKFEPGQSHAGV 662 Query: 224 VTFSNKIEEFF-----LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V +S+ + L+ IK L G T + L+Y +++ Sbjct: 663 VQYSHNQMQEHVDLRDPNIRNAQDLKEAIKKLQWMGGGTFTGEALQYTRSRLL------- 715 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +I + +TDG + + ++ C V ++GI+ Sbjct: 716 ----PPTPNNRIALVITDGRSDTQRDTTPLSVLCGP----DIQVVSVGIK 757 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 56/165 (33%), Gaps = 19/165 (11%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + D+ I+LD S S+ S + ITK K + + V+ +V +S Sbjct: 825 SPADITILLDGSASVGSH-NFDITK--RFAKRLAERFLTASRTDPGQD-VRVAVVQYSGT 880 Query: 230 IEEFFLLEW-----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ + L + + F +T+ L Y R + Sbjct: 881 GQQRPERAALQFLQNYTVLANTVDSMDFFNDATDVMDALGYV---------TRFYREASS 931 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 KK ++ +DG + EA++ G ++A+ + Sbjct: 932 NAAKKRLLLFSDGNSQGATPAAIEKAV-QEAQRAGVEIFAVVVGR 975 >gi|327260888|ref|XP_003215265.1| PREDICTED: collagen alpha-1(VI) chain-like [Anolis carolinensis] Length = 1026 Score = 72.9 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 73/193 (37%), Gaps = 16/193 (8%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP---DVNN 217 S V + D +D+ VLD S S+ +D + ++++ D N Sbjct: 38 SSNVVTFQDCPVDLFFVLDTSESVALREKPFGALVDNIKQFTTQFIDKLNERYYRCDRNL 97 Query: 218 VVQSGLVTFSNKIEEFFLLE---WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + +G + +S++++ L G S L+ ++ + G T + +K ++ Sbjct: 98 MWNAGALHYSDEVQLISGLTSMRTGRSGLKDQVSKVVSIGKGTYTDCAIKRGIEELLIGG 157 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENL--STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + K ++ +TDG L + NEAK +G V+++ I Sbjct: 158 SHHKEN--------KYMIVVTDGHPLEGYKEPCGGLEDAANEAKHQGIKVFSVAISPNHL 209 Query: 333 HEFLRACASPNSF 345 L A+ ++ Sbjct: 210 ESRLSVIATDQAY 222 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 63/177 (35%), Gaps = 17/177 (9%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+ +V+D S S+ S + + TK +K + + + V+ +V +S + Sbjct: 828 GPTDITLVVDSSTSVGS-RNFNTTK--KFVKRLAE--RFLSAAKPTEDAVRVSVVQYSGR 882 Query: 230 IEE--FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 ++ E + + + + +T+ L Y + + Sbjct: 883 TQQKLEVPFEQNYTVIADSVDKMQFINDATDVNAALNYV---------TSLFRRSSRSGA 933 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 KK ++ +DG + + EA++ G +Y + + + +R + + Sbjct: 934 KKRMLIFSDGNSQGITQSAIERAV-QEARQAGIEIYVLVVGTQANEPNVRVLVTGKT 989 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 25/151 (16%), Positives = 54/151 (35%), Gaps = 18/151 (11%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D++ VLD S S+ + + D IK I+ +L + + G+V + Sbjct: 619 CKCGPVDVLFVLDSSESIG--LQNFMIAKDFIIKVIDRLLRDEHVTFTPGESH-VGVVQY 675 Query: 227 SNKIEEFF-----LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 S+ + + L+ +K L T + L++ R + Sbjct: 676 SHDKTQELVALGDANIDNIGALKEAVKNLRWIAGGTYTGEALQF----------TRDNLA 725 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 + K++ + +TDG + ++ C Sbjct: 726 QRFTSEKRVAIVITDGRSDILRDQTPLSALC 756 >gi|260837103|ref|XP_002613545.1| hypothetical protein BRAFLDRAFT_208193 [Branchiostoma floridae] gi|229298930|gb|EEN69554.1| hypothetical protein BRAFLDRAFT_208193 [Branchiostoma floridae] Length = 184 Score = 72.5 bits (176), Expect = 9e-11, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 59/173 (34%), Gaps = 22/173 (12%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + ++ +D++ +LD S S+ + + + + ++ PD Sbjct: 32 VNGTICTEVICRMPVDLVFLLDGSGSIGD------SNFQVTKNFVATTTSDFQIGPDNAQ 85 Query: 218 VVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 V +V + + E F L+ + L I + G T + + Y + Sbjct: 86 V---SVVQYESSPTEEFPLDRYATLEDLLSAINLIPYRGGGTRTGRAIDYVVTTTLTVS- 141 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 K+I+ +TDG+ AK+ G I+ AIG+ Sbjct: 142 -----RGARQGVPKVIIVVTDGQ-----SGDDVREPARRAKQSGIIMVAIGVG 184 >gi|233142080|gb|ACQ91095.1| complement factor B-like protein [Ruditapes decussatus] Length = 697 Score = 72.5 bits (176), Expect = 9e-11, Method: Composition-based stats. Identities = 50/228 (21%), Positives = 83/228 (36%), Gaps = 14/228 (6%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 T + S + LD+++++DVS S+ D S+ +K + + Sbjct: 178 TGSKTGAQSRLSPGKSGLDVVLLVDVSSSIG---DRSMESAKKFMKLLVDIFGVSNETSG 234 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGV----SHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 N + L+TFSN+ + F L G ++R+I + G TN L I Sbjct: 235 GKNGTRFALLTFSNEADIVFNLNDGTARSKEEVKRRIDEIQNTGGGTNFRAALLKVVGGI 294 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ---QSLYYCNEAKKRG-AIVYAIG 326 F ++ A + + +TD E ST E + N+ K G ++ IG Sbjct: 295 FFNVIKKESQRLNHAT--RAVFLLTDAEETSTLEKDRLPRIRQAANDLKNEGHFEIFCIG 352 Query: 327 IRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + L AS V +D +G I K I Y + Sbjct: 353 VGQNIDETTLAEIASTPHIEHVFTLSK-FDDLEKVGDIIAEKNIDYGR 399 >gi|47230696|emb|CAF99889.1| unnamed protein product [Tetraodon nigroviridis] Length = 1031 Score = 72.5 bits (176), Expect = 9e-11, Method: Composition-based stats. Identities = 49/215 (22%), Positives = 89/215 (41%), Gaps = 40/215 (18%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV----QSGLVTFS 227 +D++IVLD S S+ +++ + +L++ + P + Q G++ + Sbjct: 110 MDIVIVLDGSNSIYPWYE--------VQAFLINILQKFYIGPGQIQFLWFTGQVGVLQYG 161 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 K+ F L S ++ +K G TN+ G+ A +Q F G R Sbjct: 162 EKVVHEFKLSDYKS-VEEVVKRARSINQRGGEETNTALGINVACSQAFKHGGRRG----- 215 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL- 336 KK+++ +TDGE+ + + QQ + C + G YAI + R I FL Sbjct: 216 ---AKKVMIVITDGESHDSADLQQVIKDCEK---DGITRYAIAVLGYYNRRGINPEAFLN 269 Query: 337 --RACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + AS F+ V + ++ D +G+ I Sbjct: 270 EIKYIASDPDDKHFFNVTDEAALKDIVDALGERIF 304 >gi|83955719|ref|ZP_00964299.1| hypothetical protein NAS141_07930 [Sulfitobacter sp. NAS-14.1] gi|83840013|gb|EAP79189.1| hypothetical protein NAS141_07930 [Sulfitobacter sp. NAS-14.1] Length = 480 Score = 72.5 bits (176), Expect = 9e-11, Method: Composition-based stats. Identities = 64/461 (13%), Positives = 139/461 (30%), Gaps = 106/461 (22%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 + F G +TIL + ++ V G+ ++ ++ L ++ DR+++ AA Sbjct: 28 LTRFAREDDGLVTILALFMIMMMIAVGGIQLDFMRHEMERSRLQAVSDRAVLAAA----- 82 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 + + + K K+ N + D+G + S ++D + + Sbjct: 83 --DLDQMRDPKTVVEDYFAKSGMTEFLSNVVVDDGLNFRTVTVDASKNMDTQFIGRFGFP 140 Query: 128 SISAIS-------RYKIPLKFCTFIPWYTNSRHIVMPITSSVKV------NSQTDARLDM 174 ++ + K+ + I + + + + ++ + + + + Sbjct: 141 TLEVPAHSQAEERVAKVEISLVLDISGSMATNNRLGEVQNAADIFLDTVLKDENQDLISV 200 Query: 175 MIVL---------------------DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 +V D S ++ F + I M + ++ + Sbjct: 201 SLVPYSEQVNAGPLIMDRMNVNRKHDYSHCID-FDNGDFDSIAMNSSTRYNQMQHFQWNY 259 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY------ 267 D N + V E L+ +I L T+ G+K+A Sbjct: 260 DGRNNYRDDTVCPRYDYERITPFSQNKRTLKNQIDDLVPR-AGTSIFLGMKWAAAMLDPA 318 Query: 268 -----NQIFDMQG-MRQHCNTE----DANYKKIIVFMTDGEN------------------ 299 N + + R+ N D+ K ++ MTDG N Sbjct: 319 FRDINNSLVNAGYVDREFYNRPASYTDSETLKTVILMTDGANDNSYRIRSNYYDSDSEYV 378 Query: 300 ----------------------------LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 T + C+ AK + ++++IG V Sbjct: 379 HWNKYNLWWYLRREVDSRYWGYFYYHKYNKTLGNTLLSNICDAAKAKRIVIWSIGFEVDD 438 Query: 332 SH-EFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 ++ CAS S + + +AF I + I R+ Sbjct: 439 EDVPAMQDCASSPSHFFRVEGVELSEAFRAIARQINQLRLT 479 >gi|114799275|ref|YP_759187.1| hypothetical protein HNE_0457 [Hyphomonas neptunium ATCC 15444] gi|114739449|gb|ABI77574.1| conserved domain protein [Hyphomonas neptunium ATCC 15444] Length = 512 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 53/159 (33%), Gaps = 22/159 (13%) Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK---- 288 L + L+ + L G T G+ + + + +E Y+ Sbjct: 353 PVPLTEDKTVLKDHVNALVAEG-GTAGHLGIAWGWYLVSPEWAAIWPEASEPLPYRQPQT 411 Query: 289 -KIIVFMTDGENLSTKEDQQ------SLYYCNEAKKRG--AIVYAIGIRVI-------RS 332 K ++ MTDG+ S+ C+ K +Y +G +V Sbjct: 412 SKAVILMTDGDFNIEHPTASRDSFRQSMDLCDGMKASSRRIQIYTVGFQVPSSVQRTGDG 471 Query: 333 HEFLRACA-SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 L CA SP+ + ++ + + + I + I R+ Sbjct: 472 RTILEYCATSPSHAFSADSGEELIEVYRSIARSISDLRL 510 Score = 68.6 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 35/228 (15%), Positives = 84/228 (36%), Gaps = 37/228 (16%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 + G + ++TA +P I + G+ I++ + K+ + + +D +++ A + Sbjct: 11 LPALRSAEGGNVAMITAFVIPCILALTGIAIDLQNTVRQKSKVQAALDSAVLAGA---LG 67 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDI--VVVPQNE 125 GN + T+ ++ + + + T+LDI V + Sbjct: 68 RQAGNTAAETTLDV------QTYALALFTDQGGGLDCDPVAVTFDETNLDILGTVRCRQP 121 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 Y S I ++ + TS+ V +LD+ + DVS SM Sbjct: 122 TYLSSLIGHDELEFN---------------VASTSTYGV-----GKLDVAFIFDVSGSMN 161 Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNN-VVQSGLVTFSNKIEE 232 S+ ++ + A ++E+ + V+ + ++++ + Sbjct: 162 SY-----NRLAQLKTAAVAAVDELLPDSRERDGTVRLAIASYNHSLNA 204 >gi|209809179|ref|YP_002264717.1| membrane associated secretion system protein [Aliivibrio salmonicida LFI1238] gi|208010741|emb|CAQ81132.1| membrane associated secretion system protein [Aliivibrio salmonicida LFI1238] Length = 422 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 57/429 (13%), Positives = 129/429 (30%), Gaps = 83/429 (19%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 + KG IL A+ +P +F + + + + K + + + + + N+ Sbjct: 1 MKLRRHQKGHAAILFAMMIPALFGIFTLASDGARAIQTKARIEDAAEVATLAVSAH--ND 58 Query: 69 GN----GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQN 124 N G I+ N + + + + ++I + V P+ Sbjct: 59 PNQDYGGGGSPSSANQQIVTDYINAYISDVDSINEIKVYKRNCEEIPECKAGLAVGEPRY 118 Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ-----TDARLDMMIVLD 179 + + + K W+ + IV S +D+M D Sbjct: 119 FEHEVGVTTSQK---------SWFPGNDAIVGMGDSFSTSGHSLARKYQSEAVDVMFAAD 169 Query: 180 VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV---NNVVQSGLVTFSNKIEEFFLL 236 S SM + K + I I+++ +E++ D+ +N G+ ++ + Sbjct: 170 FSGSMGDRWTGGNKKYEDLIDIIDSISKELQKFNDLEHNDNDNTMGITAYNEYTYSQYSG 229 Query: 237 E-------------------WGVSHLQRKIKYL----------SKFGVS-------TNST 260 WG + + I L + + T++ Sbjct: 230 SSGGWWGDDCYLSQAESDGFWGGVSISKTIDGLWNEKSKDHCNNSYNSGRFNDIPLTSNF 289 Query: 261 PGLKYAYNQIFDMQGMRQHCN--------TEDANYKKIIVFMTDG---ENLSTKEDQQS- 308 + ++ + G + T N +++++ ++DG +N T + Sbjct: 290 DVVNQDVSRFWPEGGTSSYQALIRGAQLLTYGTNSRRLLIVLSDGMDTDNNLTSSLVNAG 349 Query: 309 --------LYYCNEAKKRGA--IVYAIGIRV-IRSHEFLRACASPNSFYLVENPHS-MYD 356 L R + IG ++ L+ C + Y EN + Sbjct: 350 MCRDIQQGLESDKTLDNRPIRAQMAVIGFDYEPSENQALKDCVGAENVYKAENSDDILNT 409 Query: 357 AFSHIGKDI 365 I ++I Sbjct: 410 ILELISEEI 418 >gi|156742542|ref|YP_001432671.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233870|gb|ABU58653.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 547 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 75/195 (38%), Gaps = 26/195 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--- 229 D+++V+DVS SME K++ A + L + + GLVTFS + Sbjct: 373 DILLVVDVSGSMEG------DKLEAAKAGLGTFLSRILPED------RVGLVTFSTESRL 420 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + L L I + G + Y+ + D + + + + Sbjct: 421 VVPPAPLSDTRIRLDDAIAVMRAQGRTAL--------YDALIDGKEALDSLPSTGDDRIR 472 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 IV ++DG + S++ + + E + G ++ + + L+ A+ + LV+ Sbjct: 473 AIVLLSDGLDNSSRATLEQVRLAFE--ESGISIFPVAYGADADTDALQQIATFSRTILVQ 530 Query: 350 -NPHSMYDAFSHIGK 363 + + F ++ + Sbjct: 531 GDAGDIGQIFENLSR 545 >gi|328676285|gb|AEB27155.1| BatA in aerotolerance operon [Francisella cf. novicida Fx1] Length = 333 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 75/218 (34%), Gaps = 47/218 (21%) Query: 173 DMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+++ +D+S SM + ++ D+ ++ N L+ K + GL+ F Sbjct: 93 DLIMAIDLSGSMAIQDMKKANGQMESRFDLVMRVANQFLDTRKGD-------RVGLILFG 145 Query: 228 NKIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L + ++ +++ + ++ G T + A ++ G Sbjct: 146 TRAYLQTPLTFDIATVKKMLDDASIALPGPQTAIGDAIGLAVKKLKKYPGDS-------- 197 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-------SHEF--- 335 K ++ +TDGEN Q L AK+ +Y IG+ + Sbjct: 198 ---KALILLTDGENN--SGTLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNT 252 Query: 336 --------LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L A + ++ +N + + I K Sbjct: 253 SEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDK 290 >gi|313139523|ref|ZP_07801716.1| conserved hypothetical protein [Bifidobacterium bifidum NCIMB 41171] gi|313132033|gb|EFR49650.1| conserved hypothetical protein [Bifidobacterium bifidum NCIMB 41171] Length = 835 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 46/338 (13%), Positives = 111/338 (32%), Gaps = 53/338 (15%) Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 G++ K++ ++ + NG + S + + V V + +++ Sbjct: 191 GSDAKRITESSKY-KVTEINVDQDTYAVSANGGQVKVTQEKDSATTEPVSVGEVPRITVT 249 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD-ARLDMMIVLDVSRSMESFFD 189 P + + + +T + +SQT + D+++V D S SM + Sbjct: 250 NTVV-TAPRYRKYIKANNDGTYDLSLNVTGTQSGSSQTTVSPADIVVVFDTSGSMSNPMG 308 Query: 190 SSITKIDMAIKSINAMLEEVKLIPD--VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI 247 + +++++A ++N+M + + + ++ ++ LV FS + + + Sbjct: 309 HN-SRLEVAKTAVNSMAQHLLTSENQGKDSNIRMALVPFSTTVGNVSNFTDNAMDIVSAV 367 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 L G + N ++ + KK IVFM+DG+ + Sbjct: 368 NGLRADGGT-----------NWEAALKAANAKLTSGRKGVKKYIVFMSDGDPTFRTSSVR 416 Query: 308 S------------------------------------LYYCNEAKKRGAIVYAIGIRVIR 331 + ++ A ++++G+ Sbjct: 417 TGTDWWGRPTYDDDDRRGLPAGVHGSGSSDQYGANLSSAVAEANRRGDATLFSVGVSSDP 476 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 + A + S+Y + + AF+ I I K Sbjct: 477 TKMRGFADQTKGSYYSATSTDELNKAFADIIGQINRKS 514 >gi|55743096|ref|NP_066933.1| collagen alpha-1(XIV) chain precursor [Homo sapiens] gi|125987815|sp|Q05707|COEA1_HUMAN RecName: Full=Collagen alpha-1(XIV) chain; AltName: Full=Undulin; Flags: Precursor gi|187954653|gb|AAI40894.1| Collagen, type XIV, alpha 1 [Homo sapiens] Length = 1796 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 50/251 (19%), Positives = 95/251 (37%), Gaps = 30/251 (11%) Query: 127 YSISAISRYKI---PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 Y IS ++ + P +R P T A+ D++ ++D S S Sbjct: 983 YKISVYTKLQEIEGPSVSIMEKTQSLPTRPPTFPPTIPPAKEVCKAAKADLVFMVDGSWS 1042 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVSH 242 + D + KI + S L ++ + Q +V F++ F L + Sbjct: 1043 IG---DENFNKIISFLYSTVGALNKI-----GTDGTQVAMVQFTDDPRTEFKLNAYKTKE 1094 Query: 243 -LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 L IK++S G +T + +KY + +F + + K+IV +TDG Sbjct: 1095 TLLDAIKHISYKGGNTKTGKAIKYVRDTLFTAESGTRRG------IPKVIVVITDG---- 1144 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SFYLVENPHSMYDAFS 359 + E + G ++AIG+ E + + P+ + V++ +DAF Sbjct: 1145 -RSQDDVNKISREMQLDGYSIFAIGVADADYSELVSIGSKPSARHVFFVDD----FDAFK 1199 Query: 360 HIGKDIVTKRI 370 I +++T Sbjct: 1200 KIEDELITFVC 1210 Score = 70.2 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 37/222 (16%), Positives = 78/222 (35%), Gaps = 24/222 (10%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 VK QT A D++I++D S S+ F + + ++ + + Sbjct: 142 SSPEEVKFVCQTPAIADIVILVDGSWSIGRF------NFRLVRHFLENLVTAFDV---GS 192 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + GL +S + L + + ++ L G +T + L Y + F + Sbjct: 193 EKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNYIFENSFKPE 252 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 KI + +TDG K + ++ G ++AIG++ +E Sbjct: 253 ------AGSRTGVSKIGILITDG-----KSQDDIIPPSRNLRESGVELFAIGVKNADVNE 301 Query: 335 FLRACASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + P+ Y V M+ + + + ++ D+ Sbjct: 302 LQEIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSRVEEQDR 343 >gi|148685685|gb|EDL17632.1| mCG133512, isoform CRA_a [Mus musculus] Length = 1164 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 41/211 (19%), Positives = 81/211 (38%), Gaps = 32/211 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D+ ++D S S++ S T++ +K++ L L+ + Sbjct: 151 CPGQEMDIAFLIDGSGSID---QSDFTQMKDFVKALMGQLASTSTSFS--------LMQY 199 Query: 227 SNKIEEFFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 SN ++ F S L Q + + + T + G++ ++F N Sbjct: 200 SNILKTHFTFTEFKSSLSPQSLVDAIVQLQGLTYTASGIQKVVKELF------HSKNGAR 253 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-------VIRSHEFLR 337 + KKI++ +TDG+ + + EA+K G I YAIG+ ++ + Sbjct: 254 KSAKKILIVITDGQKFRDP--LEYRHVIPEAEKAGIIRYAIGVGDAFREPTALQELNTIG 311 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 + S + + V N + A I + I K Sbjct: 312 SAPSQDHVFKVGN----FVALRSIQRQIQEK 338 >gi|148685686|gb|EDL17633.1| mCG133512, isoform CRA_b [Mus musculus] Length = 1168 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 41/211 (19%), Positives = 81/211 (38%), Gaps = 32/211 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D+ ++D S S++ S T++ +K++ L L+ + Sbjct: 151 CPGQEMDIAFLIDGSGSID---QSDFTQMKDFVKALMGQLASTSTSFS--------LMQY 199 Query: 227 SNKIEEFFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 SN ++ F S L Q + + + T + G++ ++F N Sbjct: 200 SNILKTHFTFTEFKSSLSPQSLVDAIVQLQGLTYTASGIQKVVKELF------HSKNGAR 253 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-------VIRSHEFLR 337 + KKI++ +TDG+ + + EA+K G I YAIG+ ++ + Sbjct: 254 KSAKKILIVITDGQKFRDP--LEYRHVIPEAEKAGIIRYAIGVGDAFREPTALQELNTIG 311 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 + S + + V N + A I + I K Sbjct: 312 SAPSQDHVFKVGN----FVALRSIQRQIQEK 338 >gi|88911344|sp|Q3V0T4|ITAD_MOUSE RecName: Full=Integrin alpha-D; AltName: CD_antigen=CD11d; Flags: Precursor gi|74215609|dbj|BAE21419.1| unnamed protein product [Mus musculus] Length = 1168 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 41/211 (19%), Positives = 81/211 (38%), Gaps = 32/211 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D+ ++D S S++ S T++ +K++ L L+ + Sbjct: 153 CPGQEMDIAFLIDGSGSID---QSDFTQMKDFVKALMGQLASTSTSFS--------LMQY 201 Query: 227 SNKIEEFFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 SN ++ F S L Q + + + T + G++ ++F N Sbjct: 202 SNILKTHFTFTEFKSSLSPQSLVDAIVQLQGLTYTASGIQKVVKELF------HSKNGAR 255 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-------VIRSHEFLR 337 + KKI++ +TDG+ + + EA+K G I YAIG+ ++ + Sbjct: 256 KSAKKILIVITDGQKFRDP--LEYRHVIPEAEKAGIIRYAIGVGDAFREPTALQELNTIG 313 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 + S + + V N + A I + I K Sbjct: 314 SAPSQDHVFKVGN----FVALRSIQRQIQEK 340 >gi|332214177|ref|XP_003256207.1| PREDICTED: collagen alpha-1(XIV) chain [Nomascus leucogenys] Length = 1796 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 50/251 (19%), Positives = 95/251 (37%), Gaps = 30/251 (11%) Query: 127 YSISAISRYKI---PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 Y IS ++ + P +R P T A+ D++ ++D S S Sbjct: 983 YKISVYTKLQEIEGPSVSIMEKTQSLPTRPPTFPPTIPPAKEVCKAAKADLVFMVDGSWS 1042 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVSH 242 + D + KI + S L ++ + Q +V F++ F L + Sbjct: 1043 IG---DDNFNKIISFLYSTVGALNKI-----GTDGTQVAMVQFTDDPRTEFKLNAYKTKE 1094 Query: 243 -LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 L IK++S G +T + +KY + +F + + K+IV +TDG Sbjct: 1095 TLLDAIKHISYKGGNTKTGKAIKYVRDTLFTAESGTRRG------IPKVIVVITDG---- 1144 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SFYLVENPHSMYDAFS 359 + E + G ++AIG+ E + + P+ + V++ +DAF Sbjct: 1145 -RSQDDVNKISREMQLDGYSIFAIGVADADYSELVSIGSKPSARHVFFVDD----FDAFK 1199 Query: 360 HIGKDIVTKRI 370 I +++T Sbjct: 1200 KIEDELITFVC 1210 Score = 70.2 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 41/223 (18%), Positives = 80/223 (35%), Gaps = 26/223 (11%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV- 215 VK QT A D++I++D S S+ F + + LE + DV Sbjct: 142 SSPEEVKFVCQTPAIADIVILVDGSWSIGRFN----------FRLVRLFLENLVTAFDVG 191 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + + GL +S + L + + ++ L G +T + L Y + F Sbjct: 192 SEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNYIFENSFKP 251 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + KI + +TDG K + ++ G ++AIG++ + Sbjct: 252 E------AGSRTGVSKIGILITDG-----KSQDDIIPPSRNLRESGVELFAIGVKNADVN 300 Query: 334 EFLRACASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 E + P+ Y V M+ + + + ++ D+ Sbjct: 301 ELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSRVEEQDR 343 >gi|269104787|ref|ZP_06157483.1| protein BatA [Photobacterium damselae subsp. damselae CIP 102761] gi|268161427|gb|EEZ39924.1| protein BatA [Photobacterium damselae subsp. damselae CIP 102761] Length = 321 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 41/218 (18%), Positives = 78/218 (35%), Gaps = 47/218 (21%) Query: 173 DMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 DMM+ +D+S SM ++ SI ++ ++ +E+ K + GLV F Sbjct: 84 DMMLAVDLSGSMAIKDMQTQSGQSIDRLTAIKHVLSNFIEKRKGD-------RLGLVLFG 136 Query: 228 NKIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L + +++++ L G ST GL A + Sbjct: 137 DHAYLQTPLTFDRHTVEQQLDRTVLGLVGQSTAIGEGLGIATKTFIKSKA---------- 186 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-------------- 331 +++I+ ++DG N + D L AK+ G +Y +GI Sbjct: 187 -PQRVIILLSDGANTAGVIDP--LEAAKLAKESGVTIYTVGIGADEMLQRSIFGVQKVNP 243 Query: 332 SHEF----LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 S + L A + ++ NP + + I + Sbjct: 244 SQDLDEKTLTKIAQMTGGKYFRARNPQELDKIYQIINQ 281 >gi|189219434|ref|YP_001940075.1| hypothetical protein Minf_1423 [Methylacidiphilum infernorum V4] gi|189186292|gb|ACD83477.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Methylacidiphilum infernorum V4] Length = 334 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 39/217 (17%), Positives = 75/217 (34%), Gaps = 39/217 (17%) Query: 173 DMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D+++VLD+S SM ++ ++ ++ +LE VK D + GLV F+ + Sbjct: 88 DIILVLDISGSMLAEDYEIDQKRV----SRLDIVLEVVKTFLDKRTNDRIGLVAFAGRAY 143 Query: 232 EFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + + L+RKI L T L A +++ + + Sbjct: 144 TVCPLTFDHNWLKRKIDQLQAGTIEDGTAIGDALGLALSRLEGKKESGERKKIGS----- 198 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS----------------- 332 ++ +TDG N + + A V+ IG + Sbjct: 199 FLILLTDGANNC--GNLTPIEAARLAAHAAVPVFTIGAGINGEVTMPVMDEERRKIGSQT 256 Query: 333 ------HEFLRACA--SPNSFYLVENPHSMYDAFSHI 361 LR A + ++ + +++ AF I Sbjct: 257 VVSEVDEGLLRNIAQLTGGEYFRATDSNAIVSAFQAI 293 >gi|307591433|ref|YP_003900232.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306986287|gb|ADN18166.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 491 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 74/207 (35%), Gaps = 18/207 (8%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + ++++++D S SM+ K+ + + ++ LI + + ++ Sbjct: 46 TLEKTPQEIVLLIDCSGSMDGN------KLSEVKTAATSFVQRQDLITN-----RIAVMG 94 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F + ++ L V+ LQ I L G T L A +Q+ + + Sbjct: 95 FGSGVQLGTPLTSDVNVLQTAIANL-YDGGGTMMDQALTAATDQLHNASASLESAIPSGE 153 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 N + I+ TDG +L A+ + A+ ++ + PN Sbjct: 154 N--QHILLFTDGVAADP---YNTLVAGQTAQNAQINIVAVATGDADTNFLSQLTGDPNLV 208 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWY 372 + N + AF K I +K++ Sbjct: 209 FYA-NTGNFDAAFQAAEKAIYSKQLVE 234 >gi|260813586|ref|XP_002601498.1| hypothetical protein BRAFLDRAFT_146514 [Branchiostoma floridae] gi|229286795|gb|EEN57510.1| hypothetical protein BRAFLDRAFT_146514 [Branchiostoma floridae] Length = 384 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 73/188 (38%), Gaps = 22/188 (11%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 LD++ +LD S S+ + S+ K+ K + + + Q G++ +S Sbjct: 4 RNPLDIIFLLDGSGSVGA---SNFEKVKQFTK------KTISGFDISPSGTQVGVIQYST 54 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + F + ++ L I + T + ++Y F + Sbjct: 55 RTRQEFSMNSFLTKETLSSAIDEVQYMRGGTLTGKAIRYVTKYGFGKS------DGARPG 108 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 K+++ +TDG EA+++G VYAIG+ + + + ++ N+ Sbjct: 109 VPKVVIVVTDG-----VSYDAVAAPALEAQQKGITVYAIGVSGYDADQLEQIASNNNTLA 163 Query: 347 LVENPHSM 354 V+N + + Sbjct: 164 FVDNFNLL 171 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 63/169 (37%), Gaps = 22/169 (13%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 LD++ +LD S S+ + S+ K+ K + + + Q G++ +S Sbjct: 231 RNPLDIIFLLDGSGSVGA---SNFEKVKQFTK------KTISGFDISPSGTQVGVIQYST 281 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + F + ++ L I + T + ++Y F + Sbjct: 282 RTRQEFSMNSFLTKETLSSAIDEVQYMRGGTLTGKAIRYVTKYGFGKS------DGARPG 335 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 K+++ +TDG EA+++G VYAIG+ + + Sbjct: 336 VPKVVIVVTDG-----VSYDAVAAPALEAQQKGITVYAIGVSGYDADQL 379 >gi|304393172|ref|ZP_07375100.1| Flp pilus assembly protein TadG [Ahrensia sp. R2A130] gi|303294179|gb|EFL88551.1| Flp pilus assembly protein TadG [Ahrensia sp. R2A130] Length = 692 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 56/194 (28%), Gaps = 56/194 (28%) Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L + Q K+ + G TN G+ + + + + + + + KKI+ Sbjct: 498 SVSDLTDNKNTTQAKLTSMQASGA-TNVQMGVAWGWRTLSPGEPFTEGRPYDAEDNKKIM 556 Query: 292 VFMTDGENLSTKE----------------------------------------------- 304 + MTDG N Sbjct: 557 IIMTDGNNTYYPTNIYGNQYAQDNKSFYGGHGHSVKGRIFDGYDGEANPGHNSQTFTKAM 616 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIRS---HEFLRACASPNS-----FYLVENPHSMYD 356 D+ C AK G +Y+I V L CAS + ++ N ++ D Sbjct: 617 DEHLTETCTNAKNAGITIYSIAFDVPNGSSVKATLEDCASSDVGGGKLYFDANNNAALID 676 Query: 357 AFSHIGKDIVTKRI 370 F I + + RI Sbjct: 677 TFEKIAERLADLRI 690 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 41/229 (17%), Positives = 96/229 (41%), Gaps = 13/229 (5%) Query: 4 LNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAAT 63 L+ ++ F + +G + + TA+ LP++ + +G + + ++ + S +D + AA Sbjct: 19 LSNTVKRFRDDERGNVFVFTALSLPVMLMAIGAGADYAELYRARVNFQSAVDAGAIAAAK 78 Query: 64 QIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQ 123 + G K + G ++ + E + N + D D V+ + +P Sbjct: 79 NLAATGQVQTSKDI-GEEVFRSNLSHLGEKAVREGQINFDMGDGDCAVQGV-ITTATLPH 136 Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 + +S+S + + + K ++ +S+V+ + T +++ +VLD S S Sbjct: 137 DRFFSLSFVDQSQ--QKGFGANKIVKGQEEFILSASSTVECGNDT---IEIALVLDNSGS 191 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLI-PDVNNVVQSGLVTFSNKIE 231 M KI ++ N+++E + N +Q +V F+ + Sbjct: 192 MRWNG-----KIGTLRQASNSLVETLHTTMGSANKAIQFSVVPFAATVN 235 >gi|311253435|ref|XP_001924360.2| PREDICTED: collagen alpha-1(XIV) chain [Sus scrofa] Length = 1795 Score = 72.5 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 49/333 (14%), Positives = 110/333 (33%), Gaps = 38/333 (11%) Query: 51 HSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCR----IKNTWNMSFRNELRDNGFVND 106 + + + + G + + LC ++ E + + Sbjct: 902 QASGFSDALTGVVKTLFLGVTDLQADQIQMTSLCARWQVQRHATAYRIVIESLQDTQKQE 961 Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKI---PLKFCTFIPWYTNSRHIVMPITSSVK 163 +T + + Y IS ++ + P + P T Sbjct: 962 STLGGGTTRHCFYGLQPDSQYKISVYTKLQEMEGPSVSIMEKTKSLPTPPPTFPPTIPPA 1021 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 A+ D++ ++D S S+ + IT + + + + I + Q + Sbjct: 1022 KEVCKAAKADLVFMVDGSWSIGDENFNKIT------NFLYSTVGALDKI--GADGTQVAM 1073 Query: 224 VTFSNKIEEFFLLEWGVSHLQR----KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 V F++ F L + IK++S G +T + +K+ + +F + + Sbjct: 1074 VQFTDDPRTEFKL--NTYKTKETLLDAIKHISYKGGNTKTGKAIKHVRDNLFTAESGIRR 1131 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 K+IV +TDG + E + G ++A+G+ E + Sbjct: 1132 G------IPKVIVVITDG-----RSQDDVNKISKEMQLDGYSIFAVGVADADYSELVSIG 1180 Query: 340 ASPN--SFYLVENPHSMYDAFSHIGKDIVTKRI 370 + P+ + V++ +DAF I +++T Sbjct: 1181 SKPSARHVFFVDD----FDAFKKIEDELITFVC 1209 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 82/223 (36%), Gaps = 26/223 (11%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV- 215 VK QT A D++I++D S S+ F + + + LE + +V Sbjct: 141 STPEEVKFFCQTPAIADIVILVDGSWSIGRFN----------FRLVRSFLENLVTAFNVG 190 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + + GL +S + L + + ++ L G +T + L Y + F Sbjct: 191 SEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNYIFENSFKP 250 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + KI + +TDG K + ++ G ++AIG++ + Sbjct: 251 E------AGARTGVSKIGILITDG-----KSQDDVIPPSRNLRESGVELFAIGVKNADEN 299 Query: 334 EFLRACASPNS--FYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 E + P++ Y V M+ + + + ++ D+ Sbjct: 300 ELREIASEPDNTHVYNVAEFDLMHTVVESLTRTVCSRVEEQDR 342 >gi|113682008|ref|NP_001038479.1| hypothetical protein LOC563353 [Danio rerio] gi|94732542|emb|CAK03688.1| novel collagen protein [Danio rerio] Length = 873 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 78/212 (36%), Gaps = 24/212 (11%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + D+ ++D S S+ + + ++ K + Q +V Sbjct: 6 AGCSTTPNDLAFIIDGSSSLG---------VPNFETAKRWLINITKGFDVSSRHTQVAVV 56 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ L L + +S G +T + +K+A + +F M Sbjct: 57 QYSDTPRLEIPLGKHQNSQELVEAVGSVSYLGGNTRTGRAIKFATDHVFGMPNHTSQ--- 113 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 + +I V +TDG + ED EA+ + +++A+G+ ++ L + A Sbjct: 114 --SPRNRIAVVLTDGRSQDDVEDA-----AMEARAQNIVLFAVGVGNEITNSELVSMANK 166 Query: 341 -SPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + VE+ +S+ + + + + + + Sbjct: 167 PASTYVLHVEDYNSIASIWDLMEQKLCEESVC 198 >gi|119612406|gb|EAW92000.1| collagen, type XIV, alpha 1 (undulin), isoform CRA_c [Homo sapiens] Length = 849 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 50/251 (19%), Positives = 95/251 (37%), Gaps = 30/251 (11%) Query: 127 YSISAISRYKI---PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 Y IS ++ + P +R P T A+ D++ ++D S S Sbjct: 52 YKISVYTKLQEIEGPSVSIMEKTQSLPTRPPTFPPTIPPAKEVCKAAKADLVFMVDGSWS 111 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVSH 242 + D + KI + S L ++ + Q +V F++ F L + Sbjct: 112 IG---DENFNKIISFLYSTVGALNKI-----GTDGTQVAMVQFTDDPRTEFKLNAYKTKE 163 Query: 243 -LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 L IK++S G +T + +KY + +F + + K+IV +TDG Sbjct: 164 TLLDAIKHISYKGGNTKTGKAIKYVRDTLFTAESGTRRG------IPKVIVVITDG---- 213 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SFYLVENPHSMYDAFS 359 + E + G ++AIG+ E + + P+ + V++ +DAF Sbjct: 214 -RSQDDVNKISREMQLDGYSIFAIGVADADYSELVSIGSKPSARHVFFVDD----FDAFK 268 Query: 360 HIGKDIVTKRI 370 I +++T Sbjct: 269 KIEDELITFVC 279 >gi|45384200|ref|NP_990400.1| integrin alpha-1 [Gallus gallus] gi|2582830|dbj|BAA23160.1| alpha1 integrin [Gallus gallus] Length = 1171 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 52/297 (17%), Positives = 111/297 (37%), Gaps = 36/297 (12%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 + L D V ++ ++ + +L +V +G ++ Y + ++ Sbjct: 82 IKLNLPDATSVPNVMEVKENMTLGTTLVTNPKGGFLACGPLYAYKCGRLHYTTGVCSNVS 141 Query: 154 IVMPITSSVKVNSQT-DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 +V + Q +LD++IVLD S S+ + + +N++L + + Sbjct: 142 STFETVKAVAPSVQECKTQLDIVIVLDGSNSIYPW--------ESVTAFLNSLLRNMDIG 193 Query: 213 PDVNNVVQSGLVTFS-NKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 P Q G+V + + EF+L + + + ++ + G T + G+ A + Sbjct: 194 P---QQTQVGIVQYGQTVVHEFYLNTYSTTEEVMDAALRIRQRGGTQTMTALGIDTAREE 250 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-- 327 F H +K++V +TDGE+ Q+ + C +AI I Sbjct: 251 AFT----EAHGARRG--VQKVMVIVTDGESHDNYRLQEVIDKCE---DENIQRFAIAILG 301 Query: 328 RVIRSH-------EFLRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 R + E +++ AS F+ V + ++ +G+ I D+ Sbjct: 302 SYSRGNLSTEKFVEEIKSIASKPTEKHFFNVSDELALVTIVEALGERIFALEATTDQ 358 >gi|126662671|ref|ZP_01733670.1| batA protein [Flavobacteria bacterium BAL38] gi|126626050|gb|EAZ96739.1| batA protein [Flavobacteria bacterium BAL38] Length = 334 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 41/176 (23%), Positives = 75/176 (42%), Gaps = 25/176 (14%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P + V S+T +D+++ +DVS SM + D +++ K ++ +N Sbjct: 76 PRSVDVTSKSKTTRGIDIVMAIDVSSSMLAN-DLKPNRLEALKKVAATFVQ-----DRIN 129 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL----SKFGVSTNSTPGLKYAYNQIFD 272 + + GLV ++ + + + + + +K + S T GL A N+I D Sbjct: 130 D--RIGLVVYAGESYTRTPVTSDKTIILQSLKSVEFDDSIIADGTGIGVGLATAINRIKD 187 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +II+ +TDG N S D ++ + AK+ G VY IGI Sbjct: 188 SKAKS-----------RIIILLTDGVNNSGTIDPRTA--ASIAKEYGIKVYTIGIG 230 >gi|119612404|gb|EAW91998.1| collagen, type XIV, alpha 1 (undulin), isoform CRA_a [Homo sapiens] Length = 865 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 50/251 (19%), Positives = 95/251 (37%), Gaps = 30/251 (11%) Query: 127 YSISAISRYKI---PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 Y IS ++ + P +R P T A+ D++ ++D S S Sbjct: 52 YKISVYTKLQEIEGPSVSIMEKTQSLPTRPPTFPPTIPPAKEVCKAAKADLVFMVDGSWS 111 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVSH 242 + D + KI + S L ++ + Q +V F++ F L + Sbjct: 112 IG---DENFNKIISFLYSTVGALNKI-----GTDGTQVAMVQFTDDPRTEFKLNAYKTKE 163 Query: 243 -LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 L IK++S G +T + +KY + +F + + K+IV +TDG Sbjct: 164 TLLDAIKHISYKGGNTKTGKAIKYVRDTLFTAESGTRRG------IPKVIVVITDG---- 213 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SFYLVENPHSMYDAFS 359 + E + G ++AIG+ E + + P+ + V++ +DAF Sbjct: 214 -RSQDDVNKISREMQLDGYSIFAIGVADADYSELVSIGSKPSARHVFFVDD----FDAFK 268 Query: 360 HIGKDIVTKRI 370 I +++T Sbjct: 269 KIEDELITFVC 279 >gi|308375589|ref|ZP_07444436.2| membrane protein [Mycobacterium tuberculosis SUMu007] gi|308345800|gb|EFP34651.1| membrane protein [Mycobacterium tuberculosis SUMu007] Length = 327 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 69/211 (32%), Gaps = 30/211 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+V+DVS+SM + D +++ A ++ +E+ + GL+ ++ Sbjct: 91 VMLVIDVSQSMRA-TDVEPSRMVAAQEAAKQFADELTPGIN------LGLIAYAGTATVL 143 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + L +F T + + A I + + D IV Sbjct: 144 VSPTTNREATKNALDKL-QFADRTATGEAIFTALQAIATVGAV---IGGGDTPPPARIVL 199 Query: 294 MTDGE---NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------------HEFL 336 +DG+ + + + AK +G + I E + Sbjct: 200 FSDGKETMPTNPDNPKGAYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETM 259 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 + A S + Y + +S + + I Sbjct: 260 KKVAQLSGGNSYNAATLAELRAVYSSLQQQI 290 >gi|332256727|ref|XP_003277467.1| PREDICTED: collagen alpha-1(VI) chain, partial [Nomascus leucogenys] Length = 1104 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 43/209 (20%), Positives = 76/209 (36%), Gaps = 19/209 (9%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP---DVNNVVQSGLV 224 D +D+ VLD S S+ +D ++ ++ D N V +G + Sbjct: 32 QDCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGAL 91 Query: 225 TFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +S+++E L G L+ + + FG T + +K Q+ H Sbjct: 92 HYSDEVEIIQGLTRMPGGRDALKSSVDAVKYFGKGTYTDCAIKKGLEQLLVGG---SHLK 148 Query: 282 TEDANYKKIIVFMTDGEN-LSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRAC 339 K ++ +TDG KE L NEAK G V+++ I L Sbjct: 149 EN-----KYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRLSII 203 Query: 340 ASPNSF---YLVENPHSMYDAFSHIGKDI 365 A+ +++ + + DA I + I Sbjct: 204 ATDHTYRRNFTAADWGQSRDAEEVISQTI 232 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 28/172 (16%), Positives = 61/172 (35%), Gaps = 33/172 (19%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK-----LIPDVNNVVQS 221 +D++ VLD S S+ + ++A + +++ + + Sbjct: 702 CKCGPIDLLFVLDSSESIG------LQNFEIAKDFVVKVIDRLSRDELVKFEPGQSYA-- 753 Query: 222 GLVTFSNKIEEF-----FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 G+V +S+ + V L+ IK L T + L+Y +Q+ Sbjct: 754 GVVQYSHSQMQEHVSLRSPSIRNVQELKEAIKSLQWMAGGTFTGEALQYTRDQLL----- 808 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +I + +TDG + + ++ C G V ++GI+ Sbjct: 809 ------PPSPNNRIALVITDGRSDTQRDTTPLNVLC----SPGIQVVSVGIK 850 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 51/165 (30%), Gaps = 19/165 (11%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS-- 227 + D+ I+LD S S+ S + K + ++ V+ +V +S Sbjct: 919 SPADITILLDGSASVGSH---NFDTTRRFTKRLAERFLTAGRTDPAHD-VRVAVVQYSGT 974 Query: 228 ---NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + L I + +T+ L Y R + + Sbjct: 975 GQQRPERASLQFLQNYTALASAIDAMDFINDATDVNDALGYV---------TRFYRDASS 1025 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 KK ++ +DG + EA++ G ++ + + Sbjct: 1026 GAAKKRLLLFSDGNSQGATPAAIEKAV-QEAQRAGIEIFVVVVGR 1069 >gi|327403932|ref|YP_004344770.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] gi|327319440|gb|AEA43932.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] Length = 341 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 45/269 (16%), Positives = 87/269 (32%), Gaps = 56/269 (20%) Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 S Y + F+ + I P +D+++ +D S SM + D Sbjct: 68 ITSFYAFSMACLVFVFAEPYNNSIDPP-------KIDYKNGIDIILSIDASGSMLAQ-DF 119 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL 250 ++++A + ++ K + GLV + + L+ +I + Sbjct: 120 DPNRLEVAKRVAKKFVDSRKGD-------RVGLVVYEGEAYTACPATLDYKLLKEQISAI 172 Query: 251 SK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS 308 T GL A ++ D+ K+I+ +TDG + + E Sbjct: 173 EPGHLEPGTAIGSGLGVAVTRLRS-----------DSLISKVIILLTDGSSNTGPE---P 218 Query: 309 LYYCNEAKKRGAIVYAIGIRVIR---------------------SHEFLRACASP--NSF 345 L AK + VY IG+ L+ AS + Sbjct: 219 LEVAELAKAKKCRVYTIGVGADGMAPTPVNTPFGVVYQNLPVEIDEGVLKEIASATNGKY 278 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + ++ S+ ++ I D + KR D+ Sbjct: 279 FRAQDEKSLEKIYAEI--DKLEKRKMEDQ 305 >gi|309790845|ref|ZP_07685389.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308227132|gb|EFO80816.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 885 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 37/214 (17%), Positives = 78/214 (36%), Gaps = 33/214 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + L ++ V+D S SM ++ K+D+A +++ + + + G+V Sbjct: 403 DREQRPDLALVFVIDRSGSMAE-PAGNVQKLDIAKEALVQAIRMLYGED------RVGIV 455 Query: 225 TFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 TF ++ + GV + + I ++ G TN GL + ++ +H Sbjct: 456 TFDSQAYTTMPITQGVGEEEVLQAIASVTADG-GTNIGAGLSAGQRMLTGVEAKIKH--- 511 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS- 341 ++ +TDG L + +G + + + E L+ A+ Sbjct: 512 --------MILLTDGWGEGN----DQLAVVEAMRAQGITLSVVAAGSDTAEE-LKTLATA 558 Query: 342 -PNSFYLVENPHSMYDAF--SHI---GKDIVTKR 369 +Y ++ I GK IV +R Sbjct: 559 GGGRYYAAAIMQAVPQILVDETITVAGKLIVERR 592 >gi|313225346|emb|CBY06820.1| unnamed protein product [Oikopleura dioica] Length = 369 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 37/189 (19%), Positives = 74/189 (39%), Gaps = 26/189 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN--KI 230 D++ ++D S S+ + + K +KS+ V + + ++ +S+ +I Sbjct: 90 DLVFIIDGSWSVG---NVNFRKAKDFMKSL------VNPFEIGWDYTRVSVLQYSDDPRI 140 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 E + + L I ++ G +T + ++Y QIF ++ KK Sbjct: 141 EFYLKDYQDKTTLLNAIDAITYKGGNTRTGEAIRYMMGQIFSVE------AGSRPYVKKH 194 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSFYL 347 +V +TDG+ AK +AIG+ E L+ A+P ++ Y Sbjct: 195 MVLLTDGQ-----SQDDVGAPARAAKNFNIRTFAIGVGDAIEDE-LKLVATPPFSDTLYH 248 Query: 348 VENPHSMYD 356 VE+ + Sbjct: 249 VEDYDGIRH 257 >gi|194220813|ref|XP_001500011.2| PREDICTED: vitrin [Equus caballus] Length = 662 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 41/270 (15%), Positives = 94/270 (34%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + N+ ++ + V N YS++ S + + + ++ + T Sbjct: 414 EGAVENEKQYVIEPNFANKAVCRTNGFYSLNVQSWFSLHKTVQPLVKRVCDTERLACSKT 473 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S S+ + ++ + + +E ++ Sbjct: 474 CF--------NSADIGFVIDGSSSVG------TSNFRTVLQFVANLSKEFEISDTD---T 516 Query: 220 QSGLVTFSNKIEEFFLLE-WGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ + F + + + IK + + T++ + YA Q+F Sbjct: 517 RIGAVQYTYEQRLEFGFDDYNTKPDILNAIKRVGYWSGGTSTGAAINYALEQLF------ 570 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG + A +G I YAIG+ E Sbjct: 571 ---KKSKPNKRKLMILITDG-----RSYDDVRIPAMVAHHKGVITYAIGVAWAAQDELEV 622 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V+ ++Y + I ++I Sbjct: 623 IATHPAKDHSFFVDEFDNLYKSVPKIIQNI 652 >gi|163848731|ref|YP_001636775.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|163670020|gb|ABY36386.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] Length = 845 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 41/197 (20%), Positives = 69/197 (35%), Gaps = 34/197 (17%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 A + ++ ++D S SM + F I+K DMA ++ L ++ + G++ F Sbjct: 391 QRAPVSILFIIDRSASMSATFG--ISKFDMAKEAAILSLTTLQPGD------RVGVLAFD 442 Query: 228 NKIEEFFLLEW---GVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + GVS LQ +I +S G TN L + + +H Sbjct: 443 TETIWTVPFRTVGEGVSLVELQDQIATMS-LGGGTNIERALSVGLPALANEPYSTRHA-- 499 Query: 283 EDANYKKIIVFMTDGE---NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 V +TDG N + Q A+ + I I E L Sbjct: 500 ---------VLLTDGRSYSNNYPRYQQLV----ETARAAQITLSTIAIGSDSDTELLNQL 546 Query: 340 ASPNS--FYLVENPHSM 354 AS + +Y V + + Sbjct: 547 ASWGNGRYYFVADATDL 563 >gi|156383644|ref|XP_001632943.1| predicted protein [Nematostella vectensis] gi|156220006|gb|EDO40880.1| predicted protein [Nematostella vectensis] Length = 982 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 37/214 (17%), Positives = 77/214 (35%), Gaps = 23/214 (10%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 ++ + +D+ +VLD S S+ K+ KS+ V Sbjct: 69 CTAPVAATSCPIPIDLAMVLDSSGSIGKK---DWVKLLEFTKSV------VDAYSVSEEA 119 Query: 219 VQSGLVTFSNKI------EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 G++T+S + +++ +E +L++ I + + T L+ A + +F Sbjct: 120 THVGVITYSTEATLDIAFDKYSGVEMNSVNLKKDIDIIPQKNNLTFMDKALELANSVLFT 179 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VI 330 N K++ +F+TDG + + + K RG VY +G+ + Sbjct: 180 ------EARGMRPNKKQVCLFLTDGIQTFDQGPYTKPSIVSQKLKDRGIDVYTVGVGDDV 233 Query: 331 RSHEFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 E L + Y +N + I ++ Sbjct: 234 DLFELLSISSGDKYTYSAKNFDELQAKVQEILQE 267 Score = 44.0 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 44/253 (17%), Positives = 82/253 (32%), Gaps = 32/253 (12%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 +S S K F + I + + LD++ + S+S Sbjct: 239 LSISSGDKYTYSAKNFDELQAKVQEI------LQEQCTGCKNTLDLVFGIPNSQS----L 288 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL--EWGVSHLQRK 246 S IT + + + + + N+ GLV + L + V +Q+ Sbjct: 289 GSEITNVKQVMSRLAGLFDVA------NSKAHIGLVAYDQGAHMALPLDKAYSVGAVQQA 342 Query: 247 IKYLSKFGV-STNSTPGLKYAYNQIFDM-QGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 I L+ ++ +K A + F M G+RQ K ++F G ST Sbjct: 343 INNLATSKKHGFDTAAAIKVAADHAFSMFGGVRQ------TQPKVFVLFAPRG---STST 393 Query: 305 DQQSLYYCNEAKKRGAIVYAIGI-RVIRSHEFLRACASPNSFYLVE--NPHSMYDAFSHI 361 + + KK G + +GI + + A P +L+ + + A I Sbjct: 394 AAEIKEAAEKLKKNGIRLMVVGIDNSADTATYSSAVTQPAKRFLMNTKDYDDLNAAVWEI 453 Query: 362 GKDIVTKRIWYDK 374 + + K Sbjct: 454 ADTVCKSAVTPGK 466 >gi|149412375|ref|XP_001507696.1| PREDICTED: hypothetical protein [Ornithorhynchus anatinus] Length = 691 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 45/271 (16%), Positives = 99/271 (36%), Gaps = 37/271 (13%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + ++ ++V +D V +N YS++ S + + ++ + T Sbjct: 443 EGAAESEKQNVVEPNFVDKAVCRRNGFYSLNVPSWFGLQKVARPLAKRVCDTHRLACSKT 502 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S S+ ++ + + +E ++ Sbjct: 503 CL--------NSADVGFVIDGSSSVG------TGNFRTVLQFVANISKEFEVSDTD---T 545 Query: 220 QSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ + F + S L IK ++ + T++ ++YA ++F Sbjct: 546 RVGAVQYTYEQRLEFGFDQHRTKSDLLSAIKRVNYWSGGTSTGAAIRYALERLF------ 599 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG A ++G I YAIGI + + L Sbjct: 600 ---EKSKPNKRKLMIVITDGR-----SYDDVRIPALAAHRKGVITYAIGITWA-AQDELE 650 Query: 338 ACASP---NSFYLVENPHSMYDAFSHIGKDI 365 AS + + V+ ++Y I ++I Sbjct: 651 VMASDPDKDHAFFVDEFDNLYTFVPQIIQNI 681 >gi|34534804|dbj|BAC87116.1| unnamed protein product [Homo sapiens] Length = 725 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 45/209 (21%), Positives = 76/209 (36%), Gaps = 31/209 (14%) Query: 153 HIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 H+ + S+ A +D+M +LD S S+ S + ++ Sbjct: 28 HVSKETIGKISAASKMMWCSAAVDIMFLLDGSNSVGK---GSFERSKHFAIAV------C 78 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAY 267 + V+ G FS+ F L+ + ++ +IK + G T + LKY Sbjct: 79 DGLDISPERVRVGAFQFSSTPHLEFPLDSFSTQQEVKARIKRMVFKGGRTETGLALKYLL 138 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 R +A+ +I++ +TDG K + K+RG V+A+G+ Sbjct: 139 --------HRGLPGGRNASVPQILIIVTDG-----KSQGDVALPSKQLKERGVTVFAVGV 185 Query: 328 RVIRSHEFLRACASP---NSFYLVENPHS 353 R R E L A AS L E Sbjct: 186 RFPRWEE-LHALASEPRGQHVLLAEQVED 213 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 57/159 (35%), Gaps = 24/159 (15%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-NVVQSGLV 224 LD++ +LD S S+ + + + L +VN +V Q GLV Sbjct: 524 GCRTQALDLVFMLDTSASVGPENFAQMQS----------FVRSCALQFEVNPDVTQVGLV 573 Query: 225 TFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + ++++ F L+ + + R I G ++ L + Y+++ +Q Sbjct: 574 VYGSQVQTAFGLDTKPTRAAMLRAISQAPYLGGVGSAGTALLHIYDKVMTVQ------RG 627 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 K +V +T G + + + + G Sbjct: 628 ARPGVPKAVVVLTGGRGA-----EDAAVPAQKLRNNGIS 661 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 72/187 (38%), Gaps = 22/187 (11%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 R+D++ +LD S T +D +++ + V+ + ++ + G+ T+S ++ Sbjct: 341 RVDLLFLLDSSAG---------TTLDGFLRAKVFVKRFVRAVLSEDSRARVGVATYSREL 391 Query: 231 EEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + V L + + G T + L+ A +G T + Sbjct: 392 LVAVPVGEYQDVPDLVWSLDGIPFRGGPTLTGSALRQA-----AERGFGSATRTGQDRPR 446 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 +++V +T+ + + + A+ R ++ +G +R+ E SP + Sbjct: 447 RVVVLLTE-----SHSEDEVAGPARHARARELLLLGVGSEAVRA-ELEEITGSPKHVMVY 500 Query: 349 ENPHSMY 355 +P ++ Sbjct: 501 SDPQDLF 507 >gi|156409371|ref|XP_001642143.1| predicted protein [Nematostella vectensis] gi|156229284|gb|EDO50080.1| predicted protein [Nematostella vectensis] Length = 332 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 72/198 (36%), Gaps = 22/198 (11%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 SA YK LK + R I +P + K + +D+ +++D S S+ Sbjct: 155 SASLVYKTSLKSVRSLVKRLQ-RSICLPTRPTKKPTRLCNRPIDLGLLVDGSGSIVLHGK 213 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKI 247 + ++ ++S+ + + G++ +S + F L S + KI Sbjct: 214 DNFGRLIEFVQSLVSFFRI------SRRHTRVGMILYSTRSYPIFRLNQYTSKRAIMGKI 267 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 + + T + L+YA F + + K++++ +TDG Sbjct: 268 RNVRYPAGGTRTGQALRYARRYFFSGRKPKGR--------KRVLILLTDG-----ISQDS 314 Query: 308 SLYYCNEAKKRGAIVYAI 325 + + GA V+ I Sbjct: 315 VKGPALQLRNAGAEVFTI 332 >gi|332206625|ref|XP_003252399.1| PREDICTED: collagen alpha-1(XXVIII) chain [Nomascus leucogenys] Length = 1129 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 34/216 (15%), Positives = 77/216 (35%), Gaps = 33/216 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + L+++ V+D S S+ + + M + V L + G++ + Sbjct: 825 CKETPLELVFVIDSSESVGP------ENFQIIKNFVKTMADRVAL---DLATARIGIINY 875 Query: 227 SNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S+K+E+ L+ S + + + G T + L+ A N +F+ Sbjct: 876 SHKVEKVANLKQFSSKDDFKLAVDNMQYLGEGTYTATALQAA-NDMFEDA---------R 925 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-------HEFLR 337 KK+ + +TDG+ +++ ++ A ++ IG+ E Sbjct: 926 PGVKKVALVITDGQ-TDSRDKEKLTEVVKNASDTNVEIFVIGVVKKNDPNFEIFHKEMNL 984 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 P Y ++ ++ D + ++I D Sbjct: 985 IATDPEHVYQFDDFFTLQDTLK----QKLFQKICED 1016 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 30/168 (17%), Positives = 69/168 (41%), Gaps = 16/168 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ ++D S S + K + S++ + ++ + ++ + FS+ ++ Sbjct: 47 IDIVFIVDSSESSKIVL---FDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQ 103 Query: 232 EFFLLE-W-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 W + ++K+K ++ G T S + A ++ +G + K Sbjct: 104 IDPPFSSWKDLQTFKQKVKSMNLIGQGTFSYYAISNA-TRLLKREGRKDGV--------K 154 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + + MTDG + D QS+ +A+ G IG+ + + LR Sbjct: 155 VALLMTDGIDHPKNPDVQSIS--EDARISGISFITIGLSTVVNEAKLR 200 >gi|154759255|ref|NP_001032852.2| collagen alpha-1(XXVIII) chain precursor [Homo sapiens] gi|167009138|sp|Q2UY09|COSA1_HUMAN RecName: Full=Collagen alpha-1(XXVIII) chain; Flags: Precursor Length = 1125 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 34/216 (15%), Positives = 77/216 (35%), Gaps = 33/216 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + L+++ V+D S S+ + + M + V L + G++ + Sbjct: 792 CKETPLELVFVIDSSESVGP------ENFQIIKNFVKTMADRVAL---DLATARIGIINY 842 Query: 227 SNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S+K+E+ L+ S + + + G T + L+ A N +F+ Sbjct: 843 SHKVEKVANLKQFSSKDDFKLAVDNMQYLGEGTYTATALQAA-NDMFEDA---------R 892 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-------HEFLR 337 KK+ + +TDG+ +++ ++ A ++ IG+ E Sbjct: 893 PGVKKVALVITDGQ-TDSRDKEKLTEVVKNASDTNVEIFVIGVVKKNDPNFEIFHKEMNL 951 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 P Y ++ ++ D + ++I D Sbjct: 952 IATDPEHVYQFDDFFTLQDTLK----QKLFQKICED 983 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/168 (17%), Positives = 70/168 (41%), Gaps = 16/168 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ ++D S ES + K + S++ + ++ + ++ + FS+ ++ Sbjct: 47 IDIVFIVDSS---ESSKIALFDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQ 103 Query: 232 EFFLLE-W-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 W + ++K+K ++ G T S + A ++ +G + K Sbjct: 104 IDPPFSSWKDLQTFKQKVKSMNLIGQGTFSYYAISNA-TRLLKREGRKDGV--------K 154 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +++ MTDG + D QS+ +A+ G I + + + LR Sbjct: 155 VVLLMTDGIDHPKNPDVQSIS--EDARISGISFITIALSTVVNEAKLR 200 >gi|114612128|ref|XP_518969.2| PREDICTED: collagen alpha-1(XXVIII) chain [Pan troglodytes] Length = 1125 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 34/216 (15%), Positives = 77/216 (35%), Gaps = 33/216 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + L+++ V+D S S+ + + M + V L + G++ + Sbjct: 792 CKETPLELVFVIDSSESVGP------ENFQIIKNFVKTMADRVAL---DLATARIGIINY 842 Query: 227 SNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S+K+E+ L+ S + + + G T + L+ A N +F+ Sbjct: 843 SHKVEKVANLKQFSSKDDFKLAVDNMQYLGEGTYTATALQAA-NDMFEDA---------R 892 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-------HEFLR 337 KK+ + +TDG+ +++ ++ A ++ IG+ E Sbjct: 893 PGVKKVALVITDGQ-TDSRDKEKLTEVVKNASDTNVEIFVIGVVKKNDPNFEIFHKEMNL 951 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 P Y ++ ++ D + ++I D Sbjct: 952 IATDPEHVYQFDDFFTLQDTLK----QKLFQKICED 983 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 30/168 (17%), Positives = 70/168 (41%), Gaps = 16/168 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ ++D S S + K + S++ + ++ + ++ + FS+ ++ Sbjct: 47 IDIVFIVDSSESSKIVL---FDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQ 103 Query: 232 EFFLLE-W-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 W + ++K+K ++ G T S + A ++ +G + K Sbjct: 104 IDPPFSSWKDLQTFKQKVKSMNLIGQGTFSYYAISNA-TRLLKREGRKDGV--------K 154 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +++ MTDG + D QS+ +A+ G IG+ + + LR Sbjct: 155 VVLLMTDGIDHPKNPDVQSIS--EDARISGISFITIGLSTVVNEAKLR 200 >gi|89513613|gb|ABD74633.1| capillary morphogenesis protein 2A [Danio rerio] gi|122891370|emb|CAM13145.1| novel protein similar to vertebrate anthrax toxin receptor family protein [Danio rerio] Length = 478 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 43/206 (20%), Positives = 76/206 (36%), Gaps = 28/206 (13%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+ VLD S S+ + + + + E V+ ++ + FS+ Sbjct: 32 HGAYDLYFVLDRSGSVSTDWS-------EIYDFVKNLTERF-----VSPNLRVSFIVFSS 79 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + E L S + + +K LS+ T G+K A Q+ E Sbjct: 80 RAEIVLPLTGDRSEINKGLKTLSEVNPAGETYMHEGIKLATEQM----------KKEPKK 129 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 IIV +TDG+ L T Q ++ + A+K GA VY +G++ + S + Sbjct: 130 SSSIIVALTDGK-LETYIHQLTIDEADSARKYGARVYCVGVKDFDEEQLADVADSKEQVF 188 Query: 347 LVENPHSMYDAFSHIGKDIVTKRIWY 372 V+ + A I I+ + Sbjct: 189 PVKGG---FQALKGIVNSILKQSCTE 211 >gi|83423290|emb|CAI67595.1| collagen, type XXVIII [Homo sapiens] gi|223462744|gb|AAI36893.1| Collagen, type XXVIII, alpha 1 [Homo sapiens] Length = 1125 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 34/216 (15%), Positives = 77/216 (35%), Gaps = 33/216 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + L+++ V+D S S+ + + M + V L + G++ + Sbjct: 792 CKETPLELVFVIDSSESVGP------ENFQIIKNFVKTMADRVAL---DLATARIGIINY 842 Query: 227 SNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S+K+E+ L+ S + + + G T + L+ A N +F+ Sbjct: 843 SHKVEKVANLKQFSSKDDFKLAVDNMQYLGEGTYTATALQAA-NDMFEDA---------R 892 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-------HEFLR 337 KK+ + +TDG+ +++ ++ A ++ IG+ E Sbjct: 893 PGVKKVALVITDGQ-TDSRDKEKLTEVVKNASDTNVEIFVIGVVKKNDPNFEIFHKEMNL 951 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 P Y ++ ++ D + ++I D Sbjct: 952 IATDPEHVYQFDDFFTLQDTLK----QKLFQKICED 983 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 31/168 (18%), Positives = 71/168 (42%), Gaps = 16/168 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ ++D S ES + K + S++ + ++ + ++ + FS+ ++ Sbjct: 47 IDIVFIVDSS---ESSKIALFDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQ 103 Query: 232 EFFLLE-W-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 W + ++K+K ++ G T S + A ++ +G + K Sbjct: 104 IDPPFSSWKDLQTFKQKVKSMNLIGQGTFSYYAISNA-TRLLKREGRKDGV--------K 154 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +++ MTDG + D QS+ +A+ G IG+ + + LR Sbjct: 155 VVLLMTDGIDHPKNPDVQSIS--EDARISGISFITIGLSTVVNEAKLR 200 >gi|326434435|gb|EGD80005.1| hypothetical protein PTSG_10281 [Salpingoeca sp. ATCC 50818] Length = 736 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 71/217 (32%), Gaps = 23/217 (10%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 SV + +A D++ +LD S S+ S S+ + +++ + + Sbjct: 170 SVPKMTCGNAAADLLFILDGSGSVGS---SNFQTMLSFTRTVATF------FDVSADTTR 220 Query: 221 SGLVTFSNK----IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 ++ +++ + ++L L I ++ T + L YA +F Sbjct: 221 IAVMVYASYNYLIFDFNYILTHTKDELLDAISAINYPYGGTRTGGALDYARTVMFTAD-- 278 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 + ++ + +TDG + + G +YAIGI +E Sbjct: 279 -RGVRPSSEGIPRVAMVITDG-----ASADDVAAPAQQLRDEGVTLYAIGIAGANENELN 332 Query: 337 RACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + P ++ + + I + + Sbjct: 333 EIASPPITSNVVFISTFSEFGTLAAAISRANCDRAAT 369 >gi|194209663|ref|XP_001495019.2| PREDICTED: similar to collagen, type XXVIII [Equus caballus] Length = 1127 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 63/163 (38%), Gaps = 22/163 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + L+++ V+D S S+ + + + ++V L + + G++ + Sbjct: 792 CKETPLELLFVIDSSESVGP------ENFQIIKNFVKTLTDQVAL---DLDTARIGIINY 842 Query: 227 SNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S+K+E L S + + + G T + L A N +F+ Sbjct: 843 SHKVEMVAHLTQFSSKDDFKLAVDNMQYLGEGTYTATALHAA-NHMFEAA---------R 892 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 KK+ + +TDG+ T++++ A ++ IG+ Sbjct: 893 PGVKKVALVITDGQ-TDTRDEKNLTEVVKNASDASVEIFVIGV 934 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 70/168 (41%), Gaps = 16/168 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ ++D S S + K + S++ + ++ + + ++ + FS+ ++ Sbjct: 47 IDVVFIVDSSESSKIVL---FDKQKDFVDSLSDKVFQLTPVRSLKYDIKLAALQFSSSVQ 103 Query: 232 EFFLLE-W-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 W + ++++K ++ G T S + A ++ +G + K Sbjct: 104 IDSPFSSWKDLQTFKQRVKSMNLIGQGTFSYYAISNA-TRLLQREGRKDGV--------K 154 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + + MTDG + D QS+ +A+ G + IG+ + + L Sbjct: 155 VALLMTDGIDHPKNPDVQSIS--EDARTAGILFITIGLSTVVNEAKLH 200 >gi|114586712|ref|XP_001158576.1| PREDICTED: alpha 1 type VII collagen isoform 1 [Pan troglodytes] Length = 2944 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 37/203 (18%), Positives = 77/203 (37%), Gaps = 28/203 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ S+ ++ ++ + V+ V +S+ Sbjct: 38 DIVFLLDGSSSIG---RSNFREVRSFLEGLVLPFSGAASAQG----VRFATVQYSDDPRT 90 Query: 233 FFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L+ S + R I+ LS G +T + + + + +F Q R K+ Sbjct: 91 EFGLDAFGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVFLPQLARPG-------VPKV 143 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLV 348 + +TDG K K +G ++A+GI+ E R + + + F+ V Sbjct: 144 CILITDG-----KSQDLVDTAAQRLKGQGVKLFAVGIKNADPEELKRVASQPTSDFFFFV 198 Query: 349 ENPHSMYDAFSHIGKDIVTKRIW 371 + + +V++R+ Sbjct: 199 NDFSILRTLL-----PLVSRRVC 216 >gi|114586714|ref|XP_516439.2| PREDICTED: alpha 1 type VII collagen isoform 2 [Pan troglodytes] Length = 2912 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 37/203 (18%), Positives = 77/203 (37%), Gaps = 28/203 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ S+ ++ ++ + V+ V +S+ Sbjct: 38 DIVFLLDGSSSIG---RSNFREVRSFLEGLVLPFSGAASAQG----VRFATVQYSDDPRT 90 Query: 233 FFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L+ S + R I+ LS G +T + + + + +F Q R K+ Sbjct: 91 EFGLDAFGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVFLPQLARPG-------VPKV 143 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLV 348 + +TDG K K +G ++A+GI+ E R + + + F+ V Sbjct: 144 CILITDG-----KSQDLVDTAAQRLKGQGVKLFAVGIKNADPEELKRVASQPTSDFFFFV 198 Query: 349 ENPHSMYDAFSHIGKDIVTKRIW 371 + + +V++R+ Sbjct: 199 NDFSILRTLL-----PLVSRRVC 216 >gi|260813588|ref|XP_002601499.1| hypothetical protein BRAFLDRAFT_248612 [Branchiostoma floridae] gi|229286796|gb|EEN57511.1| hypothetical protein BRAFLDRAFT_248612 [Branchiostoma floridae] Length = 375 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 34/188 (18%), Positives = 73/188 (38%), Gaps = 22/188 (11%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 LD++ +LD S S+ + S+ K+ K + + + Q G++ +S Sbjct: 4 RNPLDIIFLLDGSGSVGA---SNFDKVKQFTK------KAISGFDISPSGTQVGVIQYST 54 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + F L ++ L I + T + ++Y F + Sbjct: 55 RTRQEFSLNSFLTKETLSSAIDEVQYMRGGTLTGKAIRYVTKYGFGKS------DGARPG 108 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 K+++ +TDG EA+++G VYAIG+ + + + ++ N+ Sbjct: 109 VPKVVIVVTDG-----VSYDAVAAPALEAQQKGITVYAIGVSGYDADQLEQIASNNNTLA 163 Query: 347 LVENPHSM 354 V+N + + Sbjct: 164 FVDNFNLL 171 Score = 68.6 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 62/169 (36%), Gaps = 22/169 (13%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 LD++ +LD S S+ + S+ K+ K + + + Q G++ +S Sbjct: 222 RNPLDIIFLLDGSGSVGA---SNFEKVKQFTK------KTISGFDISPSGTQVGVIQYST 272 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + F + ++ L I + T + ++Y F + Sbjct: 273 RTRQEFSMNSFLTKETLSAAIDEVQYMRGGTLTGKAIRYVTKYGFGKS------DGARPG 326 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 K+++ +TDG EA+++G VYAIG+ + Sbjct: 327 VPKVVIVVTDG-----VSYDAVAAPALEAQQKGITVYAIGVSGYDVDQL 370 >gi|319956032|ref|YP_004167295.1| von willebrand factor type a [Nitratifractor salsuginis DSM 16511] gi|319418436|gb|ADV45546.1| von Willebrand factor type A [Nitratifractor salsuginis DSM 16511] Length = 306 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 41/221 (18%), Positives = 80/221 (36%), Gaps = 33/221 (14%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEV 209 + P+ ++ + R D+M+V+D S SM D + +K D+ + + +++ Sbjct: 66 LASPVLTNEYKEIKKKGR-DIMLVIDSSDSMNQWGFDPGDPNKSKFDVVKEVVGDFIDKR 124 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAY 267 K + GL+ F++ L + L++ ++ G T L Y Sbjct: 125 KND-------RIGLINFASVAFVASPLTFEKDFLRKILQMQEPGIAGKRTAINDALLQTY 177 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 N + KI + +TDG +N S + +Y IG Sbjct: 178 NILSKSDAKS-----------KIAILLTDGIDNASRISFDEIRRL---ISDSDIKLYTIG 223 Query: 327 IRVIRSHE--FLRACASPNS--FYLVENPHSMYDAFSHIGK 363 I R + +L+A A F+ + S+ + I + Sbjct: 224 IGSYRDFDAPYLKALAQAGHGRFFAASDRRSLQKIYEAIDR 264 >gi|145587695|ref|NP_001038174.2| anthrax toxin receptor 2a [Danio rerio] gi|141796884|gb|AAI39637.1| Anthrax toxin receptor 2a [Danio rerio] Length = 478 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 43/206 (20%), Positives = 76/206 (36%), Gaps = 28/206 (13%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+ VLD S S+ + + + + E V+ ++ + FS+ Sbjct: 32 HGAYDLYFVLDRSGSVSTDWS-------EIYDFVKNLTERF-----VSPNLRVSFIVFSS 79 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + E L S + + +K LS+ T G+K A Q+ E Sbjct: 80 RAEIVLPLTGDRSEINKGLKTLSEVNPAGETYMHEGIKLATEQM----------KKEPKK 129 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 IIV +TDG+ L T Q ++ + A+K GA VY +G++ + S + Sbjct: 130 SSSIIVALTDGK-LETYIHQLTIDEADSARKYGARVYCVGVKDFDEEQLADVADSKEQVF 188 Query: 347 LVENPHSMYDAFSHIGKDIVTKRIWY 372 V+ + A I I+ + Sbjct: 189 PVKGG---FQALKGIVNSILKQSCTE 211 >gi|118496821|ref|YP_897871.1| von Willebrand factor type A domain-containing protein [Francisella tularensis subsp. novicida U112] gi|118422727|gb|ABK89117.1| von Willebrand factor type A domain protein [Francisella novicida U112] Length = 333 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 75/218 (34%), Gaps = 47/218 (21%) Query: 173 DMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+++ +D+S SM + ++ D+ ++ N L+ K + GL+ F Sbjct: 93 DLIMAIDLSGSMAIQDMKKANGQMESRFDLVMRVANQFLDTRKGD-------RVGLILFG 145 Query: 228 NKIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L + ++ +++ + ++ G T + A ++ G Sbjct: 146 TRAYLQTPLTFDIATVKKMLDDASIALPGPQTAIGDAIGLAVKKLKKYPGDS-------- 197 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-------SHEF--- 335 K ++ +TDGEN Q L AK+ +Y IG+ + Sbjct: 198 ---KALILLTDGENN--SGTLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNT 252 Query: 336 --------LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L A + ++ +N + + I K Sbjct: 253 SEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDK 290 >gi|217978613|ref|YP_002362760.1| von Willebrand factor type A [Methylocella silvestris BL2] gi|217503989|gb|ACK51398.1| von Willebrand factor type A [Methylocella silvestris BL2] Length = 325 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 42/197 (21%), Positives = 75/197 (38%), Gaps = 27/197 (13%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM--ESFF--DSSITKIDMAIK 200 I W + P T + R D++ LD+S SM E F + ++ID + Sbjct: 69 IAWILLVVALAGPRTIAASPAQPASGR-DIVFALDLSGSMAAEDFVLDGHAASRIDALKR 127 Query: 201 SINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTN 258 A+++ + + GLV F+ + L + V + R + + G ST Sbjct: 128 VGAALIKR-RTGD------RIGLVIFAERAYAAAPLSFDVDAVSRTLAEIPLGLVGHSTA 180 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 GL A ++ + + ++IV ++DG N + D A Sbjct: 181 IGEGLGLALKRLTESKA-----------PSRVIVLLSDGANDAGTTDPTG--VAELANNL 227 Query: 319 GAIVYAIGIRVIRSHEF 335 G +Y IG+ V+ + F Sbjct: 228 GVKIYTIGLGVVDTQTF 244 >gi|198435896|ref|XP_002123489.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 1595 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 76/205 (37%), Gaps = 29/205 (14%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 ++D+++V+D S S+ + K+ + + ++ L +V G+ Sbjct: 408 SQECIRGKIDIVLVVDQSGSVN---QCNFQKV---KRWLRDIVRSFNLGVTEQDV---GV 458 Query: 224 VTFSNKIEEFFLLEWGVS-----------HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 V +S K +++ G S + + +K L+ G +T + K A + Sbjct: 459 VVYSKKATTSTVVDLGFSDYDSDGHTKKQEMTKILKKLAYEGGTTYTGYAFKLANEMLTG 518 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 N + KK+I+ +TDG + Q + ++ ++ A+G+ Sbjct: 519 --------NKSRPDAKKMIILLTDGATTAANT-LQLKEELDVSRAANVMILAVGVGKFNQ 569 Query: 333 HEFLRACASPNSFYLVENPHSMYDA 357 E ++ +F+ V + Sbjct: 570 TELIQIAGDRKNFFAVTKFSELEKV 594 >gi|254373668|ref|ZP_04989152.1| conserved hypothetical protein [Francisella novicida GA99-3548] gi|151571390|gb|EDN37044.1| conserved hypothetical protein [Francisella novicida GA99-3548] Length = 339 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 75/218 (34%), Gaps = 47/218 (21%) Query: 173 DMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+++ +D+S SM + ++ D+ ++ N L+ K + GL+ F Sbjct: 99 DLIMAIDLSGSMAIQDMKKANGQMESRFDLVMRVANQFLDTRKGD-------RVGLILFG 151 Query: 228 NKIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L + ++ +++ + ++ G T + A ++ G Sbjct: 152 TRAYLQTPLTFDIATVKKMLDDASIALPGPQTAIGDAIGLAVKKLKKFPGDS-------- 203 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-------SHEF--- 335 K ++ +TDGEN Q L AK+ +Y IG+ + Sbjct: 204 ---KALILLTDGENN--SGTLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNT 258 Query: 336 --------LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L A + ++ +N + + I K Sbjct: 259 SEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDK 296 >gi|167626845|ref|YP_001677345.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Francisella philomiragia subsp. philomiragia ATCC 25017] gi|167596846|gb|ABZ86844.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Francisella philomiragia subsp. philomiragia ATCC 25017] Length = 333 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 40/234 (17%), Positives = 78/234 (33%), Gaps = 54/234 (23%) Query: 173 DMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+M+ +D+S SM + ++ D+ ++ N L D + GL+ F Sbjct: 93 DLMMAIDLSGSMAIQDMQKSNGKMESRFDLVMRVANEFL-------DTRQGDRVGLILFG 145 Query: 228 NKIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 L + + +++ + ++ G T + A ++ G Sbjct: 146 TWAYLQTPLTFDIPTVKKMLDDASIALPGPQTAIGDAIGLAVKKLKRYPGDS-------- 197 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-------------- 331 K +V +TDGEN Q L AK+ +Y IG+ + Sbjct: 198 ---KALVLLTDGENN--SGALQPLQAAELAKQYHIKIYTIGLGGGQMMVKTTFGERLVNT 252 Query: 332 ----SHEFLRACA--SPNSFYLVENPHSMYDAFSHI-------GKDIVTKRIWY 372 E L+ A + F+ +N + + I V +++ Y Sbjct: 253 SEDLDTEVLQKIATMTGGKFFRAQNSADLKQVYESIDQLEPIESDKTVVRQVTY 306 >gi|156308416|ref|XP_001617662.1| hypothetical protein NEMVEDRAFT_v1g225902 [Nematostella vectensis] gi|156195093|gb|EDO25562.1| predicted protein [Nematostella vectensis] Length = 273 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 39/179 (21%), Positives = 75/179 (41%), Gaps = 25/179 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P + V S+T +D+++ +DVS SM + D ++D + + +E Sbjct: 11 LARPRSVDVTAKSRTTKGIDIVMAIDVSGSMLAK-DFKPNRLDALKRVASTFIE-----D 64 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK----FGVSTNSTPGLKYAYNQ 269 +N+ + GLV ++ + + + + + +K + T GL A N+ Sbjct: 65 RIND--RIGLVVYAGESYTRTPITSDKTVILQSLKTVEYDDSIIADGTGIGVGLATAINR 122 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 I D + ++I+ +TDG N + D + + AK+ G VY IGI Sbjct: 123 IKDSKAKS-----------RVIILLTDGVNNAGTIDPRMA--ADIAKQYGIKVYTIGIG 168 >gi|254283762|ref|ZP_04958730.1| conserved hypothetical protein [gamma proteobacterium NOR51-B] gi|219679965|gb|EED36314.1| conserved hypothetical protein [gamma proteobacterium NOR51-B] Length = 325 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 31/210 (14%), Positives = 71/210 (33%), Gaps = 32/210 (15%) Query: 166 SQTDARLDMMIVLDVSRSMESFF-----DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 ++T+ D+M+ +D+S SM+ +++ D + ++ + + Sbjct: 88 TKTETARDVMLAIDLSASMDYRDFPGPDGKPVSRFDAVQRVVDQFVA-------NREGDR 140 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 GL+ F K ++ + + + + G D G+ Sbjct: 141 VGLIVFGAKAYLQLPFTRDLNTARALVDLMQVGMAGPQTALG---------DSIGLAIRA 191 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-------H 333 ++++ +TDG T + A+ G +Y IGI + Sbjct: 192 FESSEVDDRVLILLTDG--NDTASKMTPINAAEIAQLNGIEIYTIGIGDAEATGEDRIDF 249 Query: 334 EFLRACA--SPNSFYLVENPHSMYDAFSHI 361 E L + A S F+ ++ ++ + I Sbjct: 250 ETLASIAERSGGQFFDAQDETALRQVYDRI 279 >gi|254372185|ref|ZP_04987677.1| conserved hypothetical protein [Francisella tularensis subsp. novicida GA99-3549] gi|151569915|gb|EDN35569.1| conserved hypothetical protein [Francisella novicida GA99-3549] Length = 339 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 33/218 (15%), Positives = 74/218 (33%), Gaps = 47/218 (21%) Query: 173 DMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+++ +D+S SM + ++ D+ ++ N L+ K + GL+ F Sbjct: 99 DLIMAIDLSGSMAIQDMKKANGQMESRFDLVMRVANQFLDTRKGD-------RVGLILFG 151 Query: 228 NKIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L + ++ +++ + ++ G T + A ++ G Sbjct: 152 TRAYLQTPLTFDIATVKKMLDDASIALPGPQTAIGDAIGLAVKKLKKYPGDS-------- 203 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-------SHEFLRA 338 K ++ +TDGEN Q L AK+ +Y IG+ + + Sbjct: 204 ---KALILLTDGENN--SGTLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNT 258 Query: 339 CA-------------SPNSFYLVENPHSMYDAFSHIGK 363 + ++ +N + + I K Sbjct: 259 SEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDK 296 >gi|301755498|ref|XP_002913610.1| PREDICTED: von Willebrand factor A domain-containing protein 2-like [Ailuropoda melanoleuca] Length = 765 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 47/219 (21%), Positives = 82/219 (37%), Gaps = 28/219 (12%) Query: 144 FIPWYTNSRHIVMPITSSVKVNS---QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 + H+ + S Q A +D++ ++D S S+ + + Sbjct: 19 HLSHPLQEVHVSRETVGKISAASKMIQCFAAVDVLFLIDGSHSVGK------GSFERSKH 72 Query: 201 SINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTN 258 + + + + P V+ G + FS+ F L+ S ++ KIK + G T Sbjct: 73 FAIMVCDALDINP---ERVRVGALQFSSAPRLEFPLDSFSSQQEVKAKIKRMVFKGGRTE 129 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + LKY R +A+ +I+V +TDG + E + K+R Sbjct: 130 TGLALKYLLR--------RGFPGGRNASVPQILVVITDGRSQGPVE-----LPAKQLKER 176 Query: 319 GAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDA 357 G V+A+G+R R E L AS V + DA Sbjct: 177 GVTVFAVGVRFPRWEE-LHTLASEPREQHVLMAEQVDDA 214 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 28/135 (20%), Positives = 55/135 (40%), Gaps = 19/135 (14%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-NVVQSGLV 224 LD++ +LD S S+ S + + + + L DVN +V Q GLV Sbjct: 524 GCQAQSLDLLFMLDASASVGSENFAQMQS----------FVRSLTLQFDVNPDVTQVGLV 573 Query: 225 TFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + ++++ F L+ G + + R + G ++ L + Y+++ +Q Sbjct: 574 VYGSRVQTAFGLDTHLGRAAVLRAMSQAPYLGGVGSAGTALLHVYDKVMTVQ------RG 627 Query: 283 EDANYKKIIVFMTDG 297 K +V +T G Sbjct: 628 ARPGVPKAVVLLTGG 642 >gi|319784437|ref|YP_004143913.1| hypothetical protein Mesci_4754 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317170325|gb|ADV13863.1| hypothetical protein Mesci_4754 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 553 Score = 72.1 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 31/234 (13%), Positives = 83/234 (35%), Gaps = 39/234 (16%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 + F+ + G T++ + LP+I + +VS + K+ L + +D + + ++ Sbjct: 3 LNKFWRSKSGNFTLVLGLGLPVILTAVAFATDVSTLMRAKSNLQNALDSANLASSHLGDL 62 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 + N+ + + + + + V Sbjct: 63 DITRNDAFNRY-----------FQANIVGHGE-----------LDNAQATLTVDKGVNFV 100 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 A++ + L F + +S+HIV+ +++ +L++++VLD + SM Sbjct: 101 KTKAVASADVHLNF---AFLFGDSKHIVVDA-----SAVESNNQLEVVLVLDNTGSMAG- 151 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF---FLLEW 238 ++ + ++L+ ++ +++ V F + F W Sbjct: 152 -----ARMTALRTATKSLLDTLEAAKSPTRKIRASPVPFVTAVNVNGDGFDPSW 200 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 58/177 (32%), Gaps = 37/177 (20%) Query: 232 EFFLLEWGVSHLQRKIKYLSK-FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L L++ +++ G TN + GL + + + KI Sbjct: 376 PVVPLTDDFDKLRKAASQMTEWNGSGTNVSEGLSWGMRVLSPAAPYTDGAPWKTPGISKI 435 Query: 291 IVFMTDGEN---------------------------------LSTKEDQQSLYYCNEAKK 317 ++ +TDGEN + D + C + K Sbjct: 436 VLLLTDGENVVYGASEQEPTKSDYTSYGYLAGGRFGSDNQTTAARNVDGWTKNVCTQLKN 495 Query: 318 RGAIVYAIGIRV--IRSHEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 G +Y + ++ + ACAS P+++Y V +P + + F I + ++ Sbjct: 496 EGVQIYTMVLQSDTAANRALYSACASDPSNYYAVNDPTKLPNVFLQIANNFTKLQLT 552 >gi|328951280|ref|YP_004368615.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] gi|328451604|gb|AEB12505.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] Length = 320 Score = 72.1 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 33/209 (15%), Positives = 72/209 (34%), Gaps = 36/209 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +D SRSM + D ++++ A + + + V+ GLV FS+ Sbjct: 95 VVLAIDTSRSMRAE-DLEPSRLEAAKAAAREFIRAMPPG------VEVGLVAFSSYATLL 147 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L++ + L T GL A + + +V Sbjct: 148 QPPTTDRERLEQAVDLLD-LAHRTAIGDGLVAALRVLPLEDSDAPGGMS--------VVL 198 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS---------------HEFLRA 338 ++DG N L +A+ +G VY +G+ + + E L+ Sbjct: 199 LSDGRNNY---GIDPLEAARQAEAQGVRVYTVGVGLSENTYVFANGYYIRAGLDEETLQE 255 Query: 339 CA--SPNSFYLVENPHSMYDAFSHIGKDI 365 A + ++Y + + + + + + Sbjct: 256 IAALTGGAYYRASSADELRAVYQTLARAV 284 >gi|116283392|gb|AAH14765.1| Itga1 protein [Mus musculus] Length = 685 Score = 72.1 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 46/296 (15%), Positives = 102/296 (34%), Gaps = 35/296 (11%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 + +L N + ++ +I + + +V +G ++ Y + + Sbjct: 93 VKLDLPVNTSIPNVTEIKENMTFGSTLVTNPKGGFLACGPLYAYRCGHLHYTTGICSDVS 152 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + +S + +LD++IVLD S S+ + + +N +L+ + + P Sbjct: 153 PTFQVVNSFAPVQECSTQLDIVIVLDGSNSIYPW--------ESVTAFLNDLLKRMDIGP 204 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYL-SKFGVSTNSTPGLKYAYNQI 270 Q G+V + + F L + + + G+ T + G+ A + Sbjct: 205 ---KQTQVGIVQYGANVTHEFNLNKYSSTEEVLVAANKIGRRGGLQTMTALGIDTARKEA 261 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-RV 329 F KK++V +TDGE+ +Q + C ++I I Sbjct: 262 FT------EARGARRGVKKVMVIVTDGESHDNYRLKQVIQDCE---DENIQRFSIAILGH 312 Query: 330 IRSHEF--------LRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 +++ AS F+ V + ++ +G+ I D+ Sbjct: 313 YNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKALGERIFALEATADQ 368 >gi|119720657|ref|YP_921152.1| von Willebrand factor, type A [Thermofilum pendens Hrk 5] gi|119525777|gb|ABL79149.1| von Willebrand factor, type A [Thermofilum pendens Hrk 5] Length = 327 Score = 72.1 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 38/201 (18%), Positives = 76/201 (37%), Gaps = 26/201 (12%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 ++V+DVS SME + KI++A ++ ++E + V GL+ FS++I Sbjct: 103 VLVVDVSGSMEDSIPGGV-KIEVARRAATLLVERMPGG------VDVGLLAFSDRIVLSL 155 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + I+ L G + + P L+ A + + + ++VF+ Sbjct: 156 PPTGDRRRVLDAIESLKPGGGTMYTYP-LQAALSWLKPYKLFNAST---------LVVFV 205 Query: 295 TDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIRVI--RSHEFLRACA--SPNSFYLVE 349 +DG D + +E + G VY + I L+ A + Y Sbjct: 206 SDG----LPADAATYRTLLSEFRSLGIPVYTVYIGPGGDEGERELKLIAGSTGGEEYTAG 261 Query: 350 NPHSMYDAFSHIGKDIVTKRI 370 + + AF + + + + Sbjct: 262 SAEELLKAFKTLAEKASSILV 282 >gi|224090449|ref|XP_002195035.1| PREDICTED: integrin, alpha 1 [Taeniopygia guttata] Length = 1184 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 49/297 (16%), Positives = 107/297 (36%), Gaps = 36/297 (12%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 + L + V ++ ++ + +L +V +G ++ Y + ++ Sbjct: 95 IKLNLPASTSVPNVVEVKENMTLGTTLVTNPKGGFLACGPLYAYKCGRLHYTTGVCSNVS 154 Query: 154 IVMPITSSVKVNSQT-DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 ++ + Q +LD++IVLD S S+ + + +N++L + + Sbjct: 155 STFETVEAIAPSVQECKTQLDIVIVLDGSNSIYPW--------ESVTDFLNSLLRNMDIG 206 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQ 269 P Q G+V + + F L + + + G + T + G+ A + Sbjct: 207 P---QQTQVGIVQYGQTVVHEFYLNTYSTTEDVMAAASRIRQRGGTQTMTALGIDTAREE 263 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-- 327 F H +K++V +TDGE+ Q+ + C +AI I Sbjct: 264 AFT----EAHGARRG--VQKVMVIVTDGESHDNYRLQEVIDDCE---DENIQRFAIAILG 314 Query: 328 RVIRSH-------EFLRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 R + E +++ AS F+ V + ++ +G+ I D+ Sbjct: 315 SYSRGNLSTEKFVEEIKSIASKPTEKHFFNVSDELALLTIVEALGERIFALEATTDQ 371 >gi|297567412|ref|YP_003686384.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] gi|296851861|gb|ADH64876.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] Length = 319 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 37/218 (16%), Positives = 83/218 (38%), Gaps = 33/218 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D R +++ LDVSRSM + D ++ + A +++ + E+ + GLVTFS Sbjct: 82 ADPRTTIVLALDVSRSMRA-TDVLPSRFEAAREALKVFIRELP------QGARIGLVTFS 134 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 E L ++ L T G+ + + + Q + +D + Sbjct: 135 RAATEVVAPTTNRQRLLDSVE-LIGLEFGTAIGEGILT---SLQALPPLEQRKDAKDPSE 190 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------------- 332 I+ +TDG ++S + ++ A ++ ++ IG+ + Sbjct: 191 LATIILLTDGRSISGIDPLEAARI---AAEQKVRIHTIGVGRVTEGPVPGLESVYQWAAY 247 Query: 333 --HEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIV 366 + L+ A + ++ V + + + + + + V Sbjct: 248 FDEDVLKQIAAITGGKYFFVNSAGKLRETYQQLSQSFV 285 >gi|157105665|ref|XP_001648969.1| hypothetical protein AaeL_AAEL014547 [Aedes aegypti] gi|108868963|gb|EAT33188.1| hypothetical protein AaeL_AAEL014547 [Aedes aegypti] Length = 541 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 40/264 (15%), Positives = 93/264 (35%), Gaps = 42/264 (15%) Query: 112 RSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-----TNSRHIVMPITSSVKVNS 166 ++ + + + ++ + + + SV+ Sbjct: 70 PTSHISMRDMLSFRSIAVDSSGSQEGLSMSNEIFKNENSVIKAKVDRLSGSFKKSVEKIK 129 Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 R+D++ ++D S S+ IK + +L + + N + ++TF Sbjct: 130 IKHKRVDIVFLIDASSSVGK------ANFYSEIKFVKKLLSDFNVSY---NYTRVAVITF 180 Query: 227 SNKIEEFFLLEWGVSHLQR---------KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 S++++ F ++ + ++ +I + G T + LK A + Sbjct: 181 SSQMKIFRHIDQISTSVEDNDKCLLLNYQIPKIEFSGGGTYTYGALKEAEEIFQNA---- 236 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 A+ KKII +TDG + + + K++ ++Y+IGI+ E Sbjct: 237 ------RADSKKIIFLITDGFSNG----RDPIPLAESLKRKNVVIYSIGIQSGNYAELYN 286 Query: 338 ACASPNSFYLVENPHSMYDAFSHI 361 +SP ++ + D+F H Sbjct: 287 MSSSPG-----DSHSFLLDSFDHF 305 >gi|241667423|ref|ZP_04755001.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Francisella philomiragia subsp. philomiragia ATCC 25015] Length = 333 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 38/218 (17%), Positives = 74/218 (33%), Gaps = 47/218 (21%) Query: 173 DMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+M+ +D+S SM + ++ D+ ++ N L D + GL+ F Sbjct: 93 DLMMAIDLSGSMAIQDMQKSNGKMESRFDLVMRVANEFL-------DTRQGDRVGLILFG 145 Query: 228 NKIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 L + + +++ + ++ G T + A ++ G Sbjct: 146 TWAYLQTPLTFDIPTVKKMLDDASIALPGPQTAIGDAIGLAVKKLKRYPGDS-------- 197 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-------------- 331 K +V +TDGEN Q L AK+ +Y IG+ + Sbjct: 198 ---KALVLLTDGENN--SGALQPLQAAELAKQYHIKIYTIGLGGGQMMVKTTFGERLVNT 252 Query: 332 ----SHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 E L+ A + F+ +N + + I + Sbjct: 253 SEDLDTEVLQKIATMTGGKFFRAQNSTDLKQVYESIDQ 290 >gi|326795817|ref|YP_004313637.1| von Willebrand factor type A [Marinomonas mediterranea MMB-1] gi|326546581|gb|ADZ91801.1| von Willebrand factor type A [Marinomonas mediterranea MMB-1] Length = 337 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 35/215 (16%), Positives = 77/215 (35%), Gaps = 46/215 (21%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+ + LD+S SM+ + + ++ ++ ++ +E+ K + G++ F Sbjct: 94 DLFVALDLSGSMQISDMYYQSRPVNRLVISKHVLSDFIEKRKGD-------RIGVIVFGT 146 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 K L + +++ I+ + G T + Q+ ++ Sbjct: 147 KAYLQAPLSFDTKTVRQLIQETQIGFAGEKTAIGDAIGLGIKQLSELPSD---------- 196 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------- 331 KK+++ MTDG N + + L N A ++G ++ IGI Sbjct: 197 -KKVLILMTDGANTAGR--VSPLQAANFAAEQGVTIHTIGIGADEMEVQGFFGPQTVNPS 253 Query: 332 ---SHEFLRACAS--PNSFYLVENPHSMYDAFSHI 361 L AS +Y ++ + + + I Sbjct: 254 EDLDEALLENVASLTGGKYYRAKSTSDLEEIYGDI 288 >gi|291386938|ref|XP_002709809.1| PREDICTED: vitrin [Oryctolagus cuniculus] Length = 869 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 42/270 (15%), Positives = 90/270 (33%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + N+ +V + V N YS + S + + + ++ + T Sbjct: 621 EGAVENEKQYVVEPNFANKAVCRTNGFYSFNVQSWFGLHKTVQPLVKRVCDTDRLACSKT 680 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S S+ ++ + + +E ++ Sbjct: 681 CF--------NSADLGFVIDGSSSVG------TGNFRTVLQFVANLSKEFEISETD---T 723 Query: 220 QSGLVTFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ + F ++ + IK + + T++ + YA Q+F Sbjct: 724 RIGAVQYTYEQRLEFGFDKYNTKPDILNAIKRVGYWSGGTSTGAAINYALEQLF------ 777 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG A +G I YAIG+ E Sbjct: 778 ---KKSKPNKRKLMILITDGR-----SYDDVRIPAMAAHHKGVITYAIGVAWAAQDELEV 829 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V+ ++Y I ++I Sbjct: 830 IATYPAKDHSFFVDEFDNLYKFVPRIIQNI 859 >gi|153791389|ref|NP_001028400.2| integrin alpha-1 precursor [Mus musculus] gi|189442109|gb|AAI67237.1| Integrin alpha 1 [synthetic construct] Length = 1179 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 46/296 (15%), Positives = 102/296 (34%), Gaps = 35/296 (11%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 + +L N + ++ +I + + +V +G ++ Y + + Sbjct: 93 VKLDLPVNTSIPNVTEIKENMTFGSTLVTNPKGGFLACGPLYAYRCGHLHYTTGICSDVS 152 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + +S + +LD++IVLD S S+ + + +N +L+ + + P Sbjct: 153 PTFQVVNSFAPVQECSTQLDIVIVLDGSNSIYPW--------ESVTAFLNDLLKRMDIGP 204 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYL-SKFGVSTNSTPGLKYAYNQI 270 Q G+V + + F L + + + G+ T + G+ A + Sbjct: 205 ---KQTQVGIVQYGANVTHEFNLNKYSSTEEVLVAANKIGRRGGLQTMTALGIDTARKEA 261 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-RV 329 F KK++V +TDGE+ +Q + C ++I I Sbjct: 262 FT------EARGARRGVKKVMVIVTDGESHDNYRLKQVIQDCE---DENIQRFSIAILGH 312 Query: 330 IRSHEF--------LRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 +++ AS F+ V + ++ +G+ I D+ Sbjct: 313 YNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKALGERIFALEATADQ 368 >gi|255526268|ref|ZP_05393185.1| von Willebrand factor type A [Clostridium carboxidivorans P7] gi|296186262|ref|ZP_06854666.1| von Willebrand factor type A domain protein [Clostridium carboxidivorans P7] gi|255510048|gb|EET86371.1| von Willebrand factor type A [Clostridium carboxidivorans P7] gi|296049063|gb|EFG88493.1| von Willebrand factor type A domain protein [Clostridium carboxidivorans P7] Length = 422 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 36/208 (17%), Positives = 82/208 (39%), Gaps = 28/208 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +D S SM+ D + + A + LI +++ + + F + E+ Sbjct: 115 DIVFAIDTSGSMK-NTDPNNERFSAA----------LNLIDNMDKNNRFSMYKFDDTAEK 163 Query: 233 FFLLEWGVS----HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + K+K + +TN L+ AY +I + ++ Sbjct: 164 IIPMSQVTKQSREEVSGKLKDMQNPKGNTNMRDALEKAYEEIKSSETKDKNAM------- 216 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFY 346 ++ ++DG + ++ K++ +Y IG+ + L+ A S ++Y Sbjct: 217 --VIMLSDGGDTYDLS-KKFDETLKPFKEKNISIYTIGMSNGNNFSMLKEIAKESGGNYY 273 Query: 347 LVENPHSMYDAFSHIGKDIVTKRIWYDK 374 V+ + + F+ I +D +R+ DK Sbjct: 274 NVKEIKDLKNVFNKIYRDRQ-QRLLVDK 300 >gi|315498201|ref|YP_004087005.1| von willebrand factor type a [Asticcacaulis excentricus CB 48] gi|315416213|gb|ADU12854.1| von Willebrand factor type A [Asticcacaulis excentricus CB 48] Length = 570 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 35/170 (20%), Positives = 66/170 (38%), Gaps = 10/170 (5%) Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 I + + +N + L ++ ++ + L+ G +TN T G+++ Sbjct: 402 SPIASNTDTLYPAAKCATNNLLPVMGLTTDIAAVRAHAQKLTPAG-NTNITIGVQWGMEL 460 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK-------EDQQSLYYCNEAKKRGAIV 322 + D K ++ +TDGEN + + ++L C AK G V Sbjct: 461 LSPELPFNTAKPYSDKTNYKYMIVITDGENTQNRWSTSASTINARTLLACQAAKDLGITV 520 Query: 323 YAIGIRVIRSHEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 Y I + S + L++CAS P FY V + + + I + R+ Sbjct: 521 YTIRVMEGNS-DMLKSCASRPEYFYDVTASSQLTSTLAKVFYSIQSTRLT 569 Score = 44.0 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 35/251 (13%), Positives = 77/251 (30%), Gaps = 35/251 (13%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F+ + G I + + G +E S L D + + A +E Sbjct: 21 FWRDVSGNTMIAFGLIAATLVAAAGAGVEFSQAQEQTNRLQDAADAAALRGALMAKDETA 80 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 + ++ + +I S Sbjct: 81 AKAAAD-----------TVFTLNLTDSGIAPTSKGLKFEISGSH---------------- 113 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 + Y + T + + TS+ + + ++ +VLD + SM DS Sbjct: 114 KTAVYTATAQIKTTFLKLAGIETLTVGATSTAEAEMRKS---EIALVLDSTGSMS--RDS 168 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL 250 +T + A+ S+ A L + +V + + G+V F ++ + G++ + Y Sbjct: 169 RMTNLKAAVDSVLASL--LVSGENVWD-ARVGIVPFDTQVSIRGVPAAGIAGEFGETPYT 225 Query: 251 SKFGVSTNSTP 261 S+ ++ Sbjct: 226 YSCTGSSMTSD 236 >gi|118085865|ref|XP_418677.2| PREDICTED: similar to collagen, type XXVIII [Gallus gallus] Length = 1144 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 80/215 (37%), Gaps = 36/215 (16%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 L+++ V+D S S+ + +K++ + + + + G++ FS+K Sbjct: 772 TPLELIFVIDSSESVG---PDNFNSTKTFMKTV------IDEVSANHATTRIGIINFSHK 822 Query: 230 IEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 +E LE + L+ + + G T + +K A + Sbjct: 823 VELVSSLETYTTKESLKSAVDKMLYLGEGTYTASAIKKAISLFQAA----------RPAV 872 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV---IRSHEFLRAC---AS 341 +K+ + +TDG+ ++ EA ++ IGI H FL+ A+ Sbjct: 873 RKVALVVTDGQ-ADNRDKVHLDLVVKEAHAANIEIFVIGIVQKTDPHYHNFLKEMHLIAT 931 Query: 342 P---NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 FY +E+ + S + ++TK I + Sbjct: 932 DPDEEHFYQIED----FKTLSALADKLITK-ICEN 961 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 36/177 (20%), Positives = 68/177 (38%), Gaps = 16/177 (9%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S V+ + +D++ ++D S ES + + S+ + ++K + N V Sbjct: 34 SVVQSDEDDLCLIDIVFIVDSS---ESAKNQLFDLQKNFVLSLTDNIFQMKPVKSQNYNV 90 Query: 220 QSGLVTFSNKIEEFFLLE-W-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + + FS+ + W V + + KI+ L G T S + A Q+F +G Sbjct: 91 KLAGMQFSSTVSIDHPFTAWKNVQNFKEKIRALVYIGQGTYSYYAISNA-TQLFKTEGRE 149 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K+ MTDG + + Q + A+ G + IG+ E Sbjct: 150 RSI--------KVAFLMTDGVDHPNSPNVQGIATA--ARSLGIHFFTIGLSKKNVKE 196 >gi|301058342|ref|ZP_07199375.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300447578|gb|EFK11310.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 331 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 35/222 (15%), Positives = 72/222 (32%), Gaps = 45/222 (20%) Query: 170 ARLDMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 +D+M+ LD S SM++ S+T+++ K + + + + GLV Sbjct: 85 PGVDIMLCLDTSGSMQALDFKVEGKSVTRLEAVKKVVADFIGK-------RETDRIGLVV 137 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F + L L + + + G + G R + Sbjct: 138 FGEEAFTQSPLTIDKGLLLELVNRMKIGMAGDRTAIG------SAIAIGGKRL---KDLK 188 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR----------------- 328 + KI++ +TDG N + + ++ G +Y IG+ Sbjct: 189 SKSKILILLTDGRNNA--GEISPQAAARAVREFGIKLYTIGVGGKGPAPFRMKTLFGTRL 246 Query: 329 VIRSHEF----LRACAS--PNSFYLVENPHSMYDAFSHIGKD 364 V + + LR A ++ N + + + I + Sbjct: 247 VPQHVDLDEVTLRNVAKTGGGKYFRAANSQELQEIYDIIDRA 288 >gi|123781093|sp|Q3V3R4|ITA1_MOUSE RecName: Full=Integrin alpha-1; AltName: Full=CD49 antigen-like family member A; AltName: Full=Laminin and collagen receptor; AltName: Full=VLA-1; AltName: CD_antigen=CD49a; Flags: Precursor gi|74186862|dbj|BAE20498.1| unnamed protein product [Mus musculus] Length = 1179 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 46/296 (15%), Positives = 102/296 (34%), Gaps = 35/296 (11%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 + +L N + ++ +I + + +V +G ++ Y + + Sbjct: 93 VKLDLPVNTSIPNVTEIKENMTFGSTLVTNPKGGFLACGPLYAYRCGHLHYTTGICSDVS 152 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + +S + +LD++IVLD S S+ + + +N +L+ + + P Sbjct: 153 PTFQVVNSFAPVQECSTQLDIVIVLDGSNSIYPW--------ESVTAFLNDLLKRMDIGP 204 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYL-SKFGVSTNSTPGLKYAYNQI 270 Q G+V + + F L + + + G+ T + G+ A + Sbjct: 205 ---KQTQVGIVQYGANVTHEFNLNKYSSTEEVLVAANKIGRRGGLQTMTALGIDTARKEA 261 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-RV 329 F KK++V +TDGE+ +Q + C ++I I Sbjct: 262 FT------EARGARRGVKKVMVIVTDGESHDNYRLKQVIQDCE---DENIQRFSIAILGH 312 Query: 330 IRSHEF--------LRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 +++ AS F+ V + ++ +G+ I D+ Sbjct: 313 YNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKALGERIFALEATADQ 368 >gi|261876471|dbj|BAI47561.1| collagen type VI alpha 1 subunit [Mesocricetus auratus] Length = 1026 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 76/209 (36%), Gaps = 19/209 (9%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP---DVNNVVQSGLV 224 D +D+ VLD S S+ +D ++ ++ D N V +G + Sbjct: 31 QDCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGAL 90 Query: 225 TFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +S+++E L G L+ + + FG T + +K ++ H Sbjct: 91 HYSDEVEIIRGLTRMPSGRDELKASVDAVKYFGKGTYTDCAIKKGLEELLIGG---SHLK 147 Query: 282 TEDANYKKIIVFMTDGEN-LSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRAC 339 K ++ +TDG KE L NEAK G V+++ I L Sbjct: 148 EN-----KYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRLSII 202 Query: 340 ASPNSF---YLVENPHSMYDAFSHIGKDI 365 A+ +++ + + DA I + I Sbjct: 203 ATDHTYRRNFTAADWGQSRDAEETISQTI 231 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 59/170 (34%), Gaps = 29/170 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVVQSGL 223 T +D++ VLD S S+ + ++A I +++ + V +G+ Sbjct: 608 CTCGPIDILFVLDSSESIG------LQNFEIAKDFIIKVIDRLSKDELVKFEPGQSHAGV 661 Query: 224 VTFSNKIEEF-----FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V +S+ + + +K L T + L+Y +++ Sbjct: 662 VQYSHNQMQEHVDMRSPNIRNAQDFKEAVKKLQWMAGGTFTGEALQYTRDRLL------- 714 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +I + +TDG + + ++ C V ++GI+ Sbjct: 715 ----PPTQNNRIALVITDGRSDTQRDTTPLSVLCGP----DIQVVSVGIK 756 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/165 (13%), Positives = 53/165 (32%), Gaps = 19/165 (11%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + D+ I+LD S S+ S + + K + + V+ +V +S Sbjct: 824 SPTDITILLDGSASVGSH---NFETTKVFAKRLAERFLSADRTDPSQD-VRVAVVQYSGL 879 Query: 230 IEEFFLLEW-----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ + L + + +T+ L Y R + Sbjct: 880 GQQQPGRTALQFLQNYTVLASSVDSMDFINDATDVNDALSYV---------TRFYRENSL 930 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 KK ++ +DG + + EA++ G ++ + + Sbjct: 931 GATKKRVLLFSDGNSQGATAEAIEKAV-QEAQRAGIEIFVVVVGP 974 >gi|150375951|ref|YP_001312547.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] gi|150030498|gb|ABR62614.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] Length = 334 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 38/243 (15%), Positives = 85/243 (34%), Gaps = 33/243 (13%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 S + R + K I W + P + +T+ + D+M+ LD+S+SM++ Sbjct: 53 SVVPRANLLQKIIAPICWLLVLTALARPQFVEPPIE-KTEPQRDLMLALDLSQSMDTRDF 111 Query: 190 SSI-----TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ 244 S ++D + ++ + GLV F + + ++ Sbjct: 112 SDPQGNLQARVDAVKTVVADFVDR-------RPYDRLGLVAFGDAPYPLVPFTMDHATVR 164 Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 + T + PG+ + D G+ + K++V +TDG T Sbjct: 165 SML---------TGALPGMAGPKTALGDALGLSIKLFQQSQAPDKVLVVLTDG--NDTAS 213 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIRSH-------EFLR--ACASPNSFYLVENPHSMY 355 A + ++ +GI + E L+ A A+ ++ ++ ++ Sbjct: 214 KMPPDKAAEIASQNHIRIHTVGIGNPDAQGEEKLDTETLQKIATATGGRYFFGQDQQALA 273 Query: 356 DAF 358 + + Sbjct: 274 EIY 276 >gi|224078385|ref|XP_002194338.1| PREDICTED: collagen, type XX, alpha 1 [Taeniopygia guttata] Length = 1505 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 28/186 (15%), Positives = 69/186 (37%), Gaps = 22/186 (11%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + T A +D+++++D S S+ + + ++ ++ + D Sbjct: 247 PAGSQFQCDTPAMIDLVLLVDGSWSIGRN------NFKLIKEFLSNLISPFSIAEDK--- 297 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 ++ GL +S+ + L + + ++ L G +T + L + Q Sbjct: 298 IRVGLSQYSSDPRTEWELSAYSTREQVLEAVRNLRYKGGNTFTGLALTHVLEQ------N 351 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 + +K+++ +TDG K + K G ++AIG++ E Sbjct: 352 LKPDAGARLEAEKLVILLTDG-----KSQDDANLAAQTLKNLGIEIFAIGVKNADEAELR 406 Query: 337 RACASP 342 + + P Sbjct: 407 QVASEP 412 >gi|194324498|ref|ZP_03058270.1| von Willebrand factor type A domain membrane protein [Francisella tularensis subsp. novicida FTE] gi|194321333|gb|EDX18819.1| von Willebrand factor type A domain membrane protein [Francisella tularensis subsp. novicida FTE] Length = 339 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 33/218 (15%), Positives = 74/218 (33%), Gaps = 47/218 (21%) Query: 173 DMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+++ +D+S SM + ++ D+ ++ N L+ K + GL+ F Sbjct: 99 DLIMAIDLSGSMAIQDMKKANGQMESRFDLVMRVANQFLDTRKGD-------RVGLILFG 151 Query: 228 NKIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L + ++ +++ + ++ G T + A ++ G Sbjct: 152 TRAYLQTPLTFDIATVKKMLDDASIALPGPQTAIGDAIGLAVKKLKKYPGDS-------- 203 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-------SHEFLRA 338 K ++ +TDGEN Q L AK+ +Y IG+ + + Sbjct: 204 ---KALILLTDGENN--SGTLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNT 258 Query: 339 CA-------------SPNSFYLVENPHSMYDAFSHIGK 363 + ++ +N + + I K Sbjct: 259 SEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDK 296 >gi|194679013|ref|XP_608702.4| PREDICTED: matrilin 4-like [Bos taurus] gi|297490994|ref|XP_002698558.1| PREDICTED: matrilin 4-like [Bos taurus] gi|296472628|gb|DAA14743.1| matrilin 4-like [Bos taurus] Length = 788 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 40/208 (19%), Positives = 77/208 (37%), Gaps = 29/208 (13%) Query: 153 HIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 H+ I + V S+ A +D++ ++D S S+ S + ++ Sbjct: 28 HVSKEIIGKISVASKMMWCSAAVDVLFLMDGSHSVGK---GSFERSKHFAITV------C 78 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAY 267 + V+ G V FS+ F L+ + ++ KIK + G T ++ LKY Sbjct: 79 DALDINPRKVRVGAVQFSSVPRLEFPLDAFSTQQGVKGKIKRMVFKGGHTETSLALKYLL 138 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + F +A+ ++++ +TDG + K+RG V+++GI Sbjct: 139 RKGFPGGR--------NASVPQVLIIITDGR-----SQGHVALPAKQLKQRGITVFSVGI 185 Query: 328 RVIRSHEFLRACASP--NSFYLVENPHS 353 R E + P + E Sbjct: 186 HFPRWEELHSLASEPREQHVLMAEEVDD 213 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 22/112 (19%), Positives = 46/112 (41%), Gaps = 11/112 (9%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ +LD S S+ + + + + PD V Q GLV Sbjct: 524 GCQAQSLDLVFMLDASVSVGP------ENFAQMQSFVRSCVLRFDVNPD---VTQIGLVV 574 Query: 226 FSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + ++++ F L+ +S + R + G ++ L + Y+++ +QG Sbjct: 575 YGSRVQTAFGLDTHLSRAAVLRALSQAPYLGGVGSAGTALLHIYDKVMTVQG 626 >gi|74193348|dbj|BAE20643.1| unnamed protein product [Mus musculus] Length = 682 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 46/296 (15%), Positives = 102/296 (34%), Gaps = 35/296 (11%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 + +L N + ++ +I + + +V +G ++ Y + + Sbjct: 93 VKLDLPVNTSIPNVTEIKENMTFGSTLVTNPKGGFLACGPLYAYRCGHLHYTTGICSDVS 152 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + +S + +LD++IVLD S S+ + + +N +L+ + + P Sbjct: 153 PTFQVVNSFAPVQECSTQLDIVIVLDGSNSIYPW--------ESVTAFLNDLLKRMDIGP 204 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYL-SKFGVSTNSTPGLKYAYNQI 270 Q G+V + + F L + + + G+ T + G+ A + Sbjct: 205 ---KQTQVGIVQYGANVTHEFNLNKYSSTEEVLVAANKIGRRGGLQTMTALGIDTARKEA 261 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-RV 329 F KK++V +TDGE+ +Q + C ++I I Sbjct: 262 FT------EARGARRGVKKVMVIVTDGESHDNYRLKQVIQDCE---DENIQRFSIAILGH 312 Query: 330 IRSHEF--------LRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 +++ AS F+ V + ++ +G+ I D+ Sbjct: 313 YNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKALGERIFALEATADQ 368 >gi|13473479|ref|NP_105046.1| hypothetical protein mll4092 [Mesorhizobium loti MAFF303099] gi|14024228|dbj|BAB50832.1| mll4092 [Mesorhizobium loti MAFF303099] Length = 477 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 32/224 (14%), Positives = 80/224 (35%), Gaps = 27/224 (12%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F + G IL ++ L G +++S ++ K+ L ++D ++ A + Sbjct: 14 FARHSGGNFAILFGFAASVLALAAGFSVDISQLYNAKSGLQGVVDAAVTSTARDLTTGV- 72 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 +K D ++N + + + +V T+ + + Sbjct: 73 ------IKEADASKAVQNFLVANSMAGILQPDQIVLDRLVVDRTANTVQADA-----HVD 121 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 + + N++ + TS +D +++ ++LDV+ SM + + + Sbjct: 122 VALFF--------PVFGMGNTQRVTASTTSLY-----SDKTIEVAMMLDVTGSMAANWWA 168 Query: 191 SITKIDMAIKSINAMLEEV--KLIPDVNNVVQSGLVTFSNKIEE 232 KI + + +E + I N V+ +V ++ + Sbjct: 169 KTDKIGDLQAAASTAVENLLDNNIDPNNPRVRVAIVPYAEAVNT 212 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 62/194 (31%), Gaps = 43/194 (22%) Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY------- 267 VN + G + E L L I GV T +++ Y Sbjct: 287 VNRDDRMG----TCPKPELIPLTADKQKLLDTIADFKAAGV-TAGGIAVQWGYYMLSPSW 341 Query: 268 -NQIFDMQGMRQHCNTEDANYKKIIVFMTDGE---------------NLSTKEDQQSLYY 311 + I + + N ++ K+ + MTDG+ N + Sbjct: 342 RSTIVNARLGSGPANFDNRKVGKVAILMTDGQFNTAFAAGRGAPRSQNAGQMSRSNAESI 401 Query: 312 CNEAKKRGAIVYAIGI----------RVIRSHEFLRACASPN-----SFYLVENPHSMYD 356 C+ K+ G ++ IG ++ L+ C++ + +Y + + Sbjct: 402 CDNMKRDGIEIFTIGFDLDDPSMTSTERDQAKSVLQDCSTADTSTLKHYYEAATGPELDE 461 Query: 357 AFSHIGKDIVTKRI 370 AF+ I ++I I Sbjct: 462 AFNAIVQNIERLTI 475 >gi|17554334|ref|NP_498645.1| MUscle Positioning family member (mup-4) [Caenorhabditis elegans] gi|15789306|gb|AAA28092.5| Muscle positioning protein 4 [Caenorhabditis elegans] Length = 2104 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 28/206 (13%), Positives = 71/206 (34%), Gaps = 22/206 (10%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV-KLIPDVNNVVQSG 222 + D++ ++D S S+ S+ + + L E +L + Sbjct: 428 STACPAQPTDLVFLIDGSGSIGSY---------VFQTEVLRFLAEFTELFDIAPQKTRVS 478 Query: 223 LVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V +S++I F L+ LQ I+ + T + +++ N+ F + Sbjct: 479 VVQYSDQIRHEFGLDNYSDRKSLQNAIRNIEYLTGLTRTGAAIEHVANEAFS----ERRG 534 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-RVIRSHEFLRAC 339 ++ + +TDG + + A+++ ++A+G+ + E Sbjct: 535 ARPVGQVSRVAIVITDG-----RSQDNVTRPSDNARRQDIQLFAVGVTNHVLDAELEEIS 589 Query: 340 ASPNSFYLVENPHSMYDAFSHIGKDI 365 S + + V + + + Sbjct: 590 GSKDRTFHVSGFEDLNTRLRSAIQRV 615 >gi|14579227|gb|AAK69172.1|AF289202_1 transmembrane matrix receptor MUP-4 [Caenorhabditis elegans] Length = 2104 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 28/206 (13%), Positives = 71/206 (34%), Gaps = 22/206 (10%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV-KLIPDVNNVVQSG 222 + D++ ++D S S+ S+ + + L E +L + Sbjct: 428 STACPAQPTDLVFLIDGSGSIGSY---------VFQTEVLRFLAEFTELFDIAPQKTRVS 478 Query: 223 LVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V +S++I F L+ LQ I+ + T + +++ N+ F + Sbjct: 479 VVQYSDQIRHEFGLDNYSDRKSLQNAIRNIEYLTGLTRTGAAIEHVANEAFS----ERRG 534 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-RVIRSHEFLRAC 339 ++ + +TDG + + A+++ ++A+G+ + E Sbjct: 535 ARPVGQVSRVAIVITDG-----RSQDNVTRPSDNARRQDIQLFAVGVTNHVLDAELEEIS 589 Query: 340 ASPNSFYLVENPHSMYDAFSHIGKDI 365 S + + V + + + Sbjct: 590 GSKDRTFHVSGFEDLNTRLRSAIQRV 615 >gi|13591884|ref|NP_112256.1| integrin alpha-1 precursor [Rattus norvegicus] gi|124941|sp|P18614|ITA1_RAT RecName: Full=Integrin alpha-1; AltName: Full=CD49 antigen-like family member A; AltName: Full=Laminin and collagen receptor; AltName: Full=VLA-1; AltName: CD_antigen=CD49a; Flags: Precursor gi|56494|emb|CAA36384.1| unnamed protein product [Rattus norvegicus] gi|149059385|gb|EDM10392.1| integrin alpha 1, isoform CRA_b [Rattus norvegicus] Length = 1180 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 47/296 (15%), Positives = 102/296 (34%), Gaps = 35/296 (11%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 + +L N + ++ +I + + +V G ++ Y + + Sbjct: 93 VKLDLPVNTSIPNVTEIKENMTFGSTLVTNPNGGFLACGPLYAYRCGHLHYTTGICSDVS 152 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + +S + +LD++IVLD S S+ + + I +N +L+ + + P Sbjct: 153 PTFQVVNSFAPVQECSTQLDIVIVLDGSNSIYPW--------ESVIAFLNDLLKRMDIGP 204 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYL-SKFGVSTNSTPGLKYAYNQI 270 Q G+V + + F L + + + G+ T + G+ A + Sbjct: 205 ---KQTQVGIVQYGENVTHEFNLNKYSSTEEVLVAANKIGRQGGLQTMTALGIDTARKEA 261 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-RV 329 F KK++V +TDGE+ +Q + C ++I I Sbjct: 262 FT------EARGARRGVKKVMVIVTDGESHDNYRLKQVIQDCE---DENIQRFSIAILGH 312 Query: 330 IRSHEF--------LRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 +++ AS F+ V + ++ +G+ I D+ Sbjct: 313 YNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKALGERIFALEATADQ 368 >gi|324499530|gb|ADY39800.1| Transmembrane cell adhesion receptor mua-3 [Ascaris suum] Length = 3675 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 31/212 (14%), Positives = 80/212 (37%), Gaps = 21/212 (9%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE-VKLIPDV 215 P + + D+M ++D S S+ S+ + + ++E V+L Sbjct: 1176 PGRVCTVQTTCPKQKTDLMFLVDGSGSIGSY---------VFKHEVLRFVKEFVELFDIG 1226 Query: 216 NNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + + GL+ +S++I F L + + + I + T + +++ + + Sbjct: 1227 LDNTRVGLIQYSDQIRHEFDLSQYRDKASVIQAISQVHYLTGLTRTGAAIQH---MVMEG 1283 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRS 332 R+ E + ++ + +TDG + A++ ++AIG+ + + Sbjct: 1284 FSERRGARAESDDVARVAIVITDG-----RSQDNVTEPAIAARRLHVNMFAIGVTDHVLA 1338 Query: 333 HEFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 E SP+ ++ V+ + + + Sbjct: 1339 SELESIAGSPSRWFYVDRFKDLDTRLRSLIQK 1370 >gi|198417199|ref|XP_002122571.1| PREDICTED: similar to MGC81791 protein, partial [Ciona intestinalis] Length = 847 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 35/216 (16%), Positives = 81/216 (37%), Gaps = 22/216 (10%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 SS D++ VLD S S+ + K ++ ++ + PD Sbjct: 22 SSDNRKGCRTVYYDLVFVLDASSSVGDQDFGRVRKW------VSDLVATFDIGPD---YT 72 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G+V ++ + E L L + + ++ +T + +++ + F + Sbjct: 73 RVGVVVYAEEPEMAIALNQYTDRDSLIQAVGNITYLNGNTRTGKAIRFMNEESFSIANGA 132 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + Y ++ + +TDG EA+ G +YA+G+ E Sbjct: 133 RDIEFG---YNRLAIVLTDGR-----AQDNVFNPSLEAQNNGIQLYAVGVSTAVVEELNE 184 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + P+S ++++ + A I ++++ + I D Sbjct: 185 IASDPDSRHVMQV--DDFQAIERI-RELLRQIICVD 217 >gi|91216721|ref|ZP_01253686.1| batA protein [Psychroflexus torquis ATCC 700755] gi|91185190|gb|EAS71568.1| batA protein [Psychroflexus torquis ATCC 700755] Length = 334 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 45/278 (16%), Positives = 86/278 (30%), Gaps = 57/278 (20%) Query: 120 VVPQNEGYSISAISRYKI----PLKFCTFIPWYTNSRHIVM----PITSSVKVNSQTDAR 171 QN +S+ +K+ L+ FI + + P T V + Sbjct: 32 RDQQNASIKMSSTQGFKMSTLAKLRPLLFILKMLALVLLTIAMARPRTVDVTTKVKKTEG 91 Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++ +D+S SM + D +++ K +E + GLV ++ + Sbjct: 92 IDIIMAVDISASMLA-RDLEPNRLEALKKVAINFIE-------GRPNDRIGLVIYAGESY 143 Query: 232 EFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L S + I L T GL + N++ D + Sbjct: 144 TKTPLTTDKSIIFNAINDLEYSQNIEGGTAIGMGLATSVNKLKDSKAES----------- 192 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-----------IRSH---- 333 K+I+ +TDGEN + D ++ A + Y IG+ Sbjct: 193 KVIILLTDGENNAGFIDPKTAT--QLATEYDIKTYTIGVGSNGMALSPVGIKANGQFEYR 250 Query: 334 --------EFLRACASP--NSFYLVENPHSMYDAFSHI 361 L+ A ++ + + I Sbjct: 251 NIEVKIDEALLKTIAESNGGKYFRATDNQKFEAIYEEI 288 >gi|2065167|emb|CAA72402.1| collagen type XIV [Homo sapiens] Length = 755 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 43/208 (20%), Positives = 84/208 (40%), Gaps = 27/208 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 A+ D++ ++D S S+ D + KI + S L ++ + Q +V F Sbjct: 1 CKAAKADLVFMVDGSWSIG---DENFNKIISFLYSTVGALNKI-----GTDGTQVAMVQF 52 Query: 227 SNKIEEFFLL-EWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ F L + L IK++S G +T + +KY + +F + + Sbjct: 53 TDDPRTEFKLNAYKTKETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTAESGTRRG---- 108 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN- 343 K+IV +TDG + E + G ++AIG+ E + + P+ Sbjct: 109 --IPKVIVVITDG-----RSQDDVNKISREMQLDGYSIFAIGVADADYSELVSIGSKPSA 161 Query: 344 -SFYLVENPHSMYDAFSHIGKDIVTKRI 370 + V++ +DAF I +++T Sbjct: 162 RHVFFVDD----FDAFKKIEDELITFVC 185 >gi|295092462|emb|CBK78569.1| von Willebrand factor type A domain. [Clostridium cf. saccharolyticum K10] Length = 2061 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 48/256 (18%), Positives = 92/256 (35%), Gaps = 25/256 (9%) Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPI---TSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 P + + + + SV + +M V+D S SM+ F Sbjct: 1062 PATNHPEYKKYIKDNGNGTYTLTLDVKSDVGSVTTGQKDPTPTAVMFVIDKSGSMDQSFG 1121 Query: 190 SSIT--KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI 247 S + + ++ ++ ++ D + +Q G FS+ E +WG + Sbjct: 1122 SGNSDARREVVNSALELFFNQLS---DGDYNIQFGGYKFSDSGERVNFNDWGWQDKYWET 1178 Query: 248 ---KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN--LST 302 LS +++ T G Y + ++ + N K+ ++F+TDGE S Sbjct: 1179 DTSNALSHLKLTSWETDGSTYPSQTLRSAISALENVELGE-NGKRYLIFLTDGEPGQNSY 1237 Query: 303 KEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACASPNSF--------YLVENP 351 ++ C A K G YAI + SH F+ + S +F ++ + Sbjct: 1238 SFSKEEAENCYSAIKNLDSGTTFYAIQVANSDSHGFMESMVSNANFVDGVTAQKFVGNSA 1297 Query: 352 HSMYDAFSHIGKDIVT 367 + AFS + +I Sbjct: 1298 DELNAAFSQMAAEISG 1313 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 50/241 (20%), Positives = 89/241 (36%), Gaps = 23/241 (9%) Query: 142 CTFIPWYTNSRHIVMPITSSVK--VNSQTDARLDMMIVLDVSRSMESFFDSSITK----- 194 P + + + +T T +D++ V+D S SM+ D K Sbjct: 1492 KRIEPNDDGTYSLTLDVTGIEGNPATVTTKYPVDLVFVIDKSLSMDYDIDGDEIKWWEDE 1551 Query: 195 IDMAIKSINAMLEEVKLIPDVNNV-VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF 253 + +N L+E+ +Q FS +L+W Q+ + L Sbjct: 1552 TESRKDIVNDALDEIIPDLCSQQYDIQIAGYQFSGS--STRVLDWSREE-QQVLNNLKIS 1608 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 S N+ P A DM + +N KK ++FMTDGE + ++ S Y + Sbjct: 1609 NTSYNTEP--SQALADALDMLKTGSQAHQNQSNVKKYLIFMTDGEPT--ESEELSYYAIS 1664 Query: 314 EAKKRGAIVYAIGIRVIRSHEFLRACASPN--------SFYLVENPHSMYDAFSHIGKDI 365 + GA +Y IG+ S + + S + + + + DAF+ I +I Sbjct: 1665 KNPVPGASIYTIGVSSDASTDLMEGIRSTAEGNGMTAPATFKGTSAQLIKDAFTQIKDEI 1724 Query: 366 V 366 + Sbjct: 1725 I 1725 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 31/172 (18%), Positives = 62/172 (36%), Gaps = 38/172 (22%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFF--DSSITKIDMAIKSINAMLEEVK------LIP 213 + + LD+++V+D+S SM+ DSS +++ + ++ E I Sbjct: 631 LGITGGDQQGLDIVLVIDLSNSMDDGISEDSSDSRLKVLKDTLGYYKESYNSRPGKPTID 690 Query: 214 DVNNVV----------QSGLVTFSNKIEEFFLLEW--------GVSHLQRKIKYLSKFGV 255 + + + + +VT+S L+W G ++ I L G Sbjct: 691 EKSGFIDDLFEQSPNSRFSIVTYST--YASTELDWTEYGMNGSGQQTIKEAIGELQANG- 747 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 TN GL A + + N +++F++DG+ D Sbjct: 748 GTNYEAGLYQAMEVL---KERGNSSNIP------VVIFLSDGKPTYYYSDVD 790 >gi|254819550|ref|ZP_05224551.1| hypothetical protein MintA_06484 [Mycobacterium intracellulare ATCC 13950] Length = 335 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 33/211 (15%), Positives = 70/211 (33%), Gaps = 30/211 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+V+DVS+SM + D ++ A ++ +E+ + GL+ ++ Sbjct: 99 VMLVIDVSQSMRA-TDVQPNRMAAAQEAAKQFADELTPGIN------LGLIAYAGTATVL 151 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + L +F T + G+ A I + + D IV Sbjct: 152 VSPTTNRDSTKAALDKL-QFADRTATGEGIFTALQAIATVGAV---IGGGDKPPPARIVL 207 Query: 294 MTDGE---NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------------HEFL 336 +DG+ + + + AK +G + I E L Sbjct: 208 FSDGKETMPTNPDNPKGAFTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETL 267 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 + A S + Y + + ++ + + I Sbjct: 268 KKVAQLSGGNAYNAASLQELKAVYATLQQQI 298 >gi|156408065|ref|XP_001641677.1| predicted protein [Nematostella vectensis] gi|156228817|gb|EDO49614.1| predicted protein [Nematostella vectensis] Length = 1418 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 68/189 (35%), Gaps = 12/189 (6%) Query: 173 DMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D++ ++D S S+ D +N++L +++ + V L S I+ Sbjct: 3 DLIFLVDTSGSLQYWSGGGWKNGFDDEKVFVNSLLSHIRVSY-KSTYVSVVLFGTSATID 61 Query: 232 EFFL----LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 ++ + +R L TN + AY+ IF G Sbjct: 62 INYIFNPHPNNHKCNFRRDFSNLRFRSGMTNMHDAFQAAYDIIF---GKYSGHKRPTHQV 118 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 K + +TDG+ + K RG ++ IG+ + LR+ ASPN+++ Sbjct: 119 KTAVFLLTDGQWNW---NGDPWPIAKRLKDRGIEIFTIGVTNGVNVNTLRSLASPNNYFH 175 Query: 348 VENPHSMYD 356 + + Sbjct: 176 YNDFTQFRE 184 >gi|149371021|ref|ZP_01890616.1| aerotolerance-related membrane protein [unidentified eubacterium SCB49] gi|149355807|gb|EDM44365.1| aerotolerance-related membrane protein [unidentified eubacterium SCB49] Length = 334 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 74/225 (32%), Gaps = 49/225 (21%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 ++ +D+++ +DVS SM + D ++ + + + + GLV + Sbjct: 87 KSTKGIDIVMAIDVSASMLA-RDLKPDRLQALKQVAARFI-------NGRPNDRIGLVEY 138 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + + L + + + + T GL A N++ E Sbjct: 139 AGESYTKTPLTSDKTVVLSSLNSIEYNSIIEGGTAIGMGLATAVNRL-----------KE 187 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-----------VIRS 332 K+I+ +TDGEN S D + A + G VY IG+ + Sbjct: 188 STAKSKVIILLTDGENNSGFIDPKIAS--ELAVEFGIKVYTIGLGTNGMASSPIGILPNG 245 Query: 333 ------------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + ++ + + + + I K Sbjct: 246 RFQYGNQPVKIDETLLKEIAKTTGGQYFRATSNTKLNEIYEEINK 290 >gi|220672885|emb|CAX13868.1| novel protein similar to vertebrate vitrin (VIT) [Danio rerio] Length = 761 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 74/197 (37%), Gaps = 27/197 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS--NKI 230 D+ V+D S S+ ++ + + E ++ + G+V ++ ++ Sbjct: 578 DIAFVIDGSSSVG------TGNFRTVLQFVANVTREFEISDTD---TRVGIVQYTYEQRL 628 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 E F + L IK ++ + T++ + YA +Q+F N +KI Sbjct: 629 EFAFGQHNNKADLLNAIKRINYWSGGTSTGAAITYAADQLFS---------KSKPNKRKI 679 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SFYLV 348 ++ +TDG + + G I Y+IGI E P+ + V Sbjct: 680 MIVITDG-----RSYDDVRAPALAVHRTGVIAYSIGIAWAAQDELEYIATDPDREHSFFV 734 Query: 349 ENPHSMYDAFSHIGKDI 365 + ++Y S I +I Sbjct: 735 DEFDNLYKYVSKIIHNI 751 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 28/207 (13%), Positives = 65/207 (31%), Gaps = 22/207 (10%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 + P + F +T+ V + ++DM ++D S S+ Sbjct: 333 ETSHTRETPAESNPFESGFTSRVMDTTARAPEVPSQGDPNCKVDMAFLMDGSWSIGK--- 389 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKI 247 + + + + + + + G++ + + F L+ L+ I Sbjct: 390 ---RRFKIQKDFLVEVSQALNVGVAGA---MMGIIQYGDDPVTEFSLKQFSNSKDLKPAI 443 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 + + G ++ L Y Q F + + V + DG E+ Sbjct: 444 SKIVQKGGPSHVGKALSYINKQFFSDANGNRGGAPN------VAVVLVDGWPTDRVEEAS 497 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRSHE 334 L A++ G ++ + I +E Sbjct: 498 RL-----ARESGINIFFVTIEGPDDNE 519 >gi|198426251|ref|XP_002120495.1| PREDICTED: similar to Vwa1 protein [Ciona intestinalis] Length = 1059 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 41/224 (18%), Positives = 77/224 (34%), Gaps = 26/224 (11%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 P+ K T +D+++VLD S S F + ++ M I+ I + Sbjct: 849 DNGLGVWTAPTPVC---KARCSTFGFMDLVVVLDSSSS---FGKKNWAEMKMFIRQILSS 902 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 + + L V + + +++ S L + G T + L+Y Sbjct: 903 FD-ISLTATRAAVFRYNSMVYNDSQILLSDFPTNKSGLLEAYDAIVHNGRGTMTGQALQY 961 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 + I +++ +TDG + + ++RGA+ YAI Sbjct: 962 VNDTILAD------VAGNRPEAVDVVLTVTDG-----RSVDDVTAISDSLRQRGALTYAI 1010 Query: 326 GIRVIRSH-----EFLRACASPNSFYLV-ENPHSMYDAFSHIGK 363 G+ + L N+ +L N ++D F IGK Sbjct: 1011 GVIPPNKKGADRDQLLSIAGEANNIFLAINNFDGVFDKF--IGK 1052 Score = 65.9 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 71/207 (34%), Gaps = 30/207 (14%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + +DM+ +LD S S+ S D+ + ++ + + + + ++ Sbjct: 606 NHSVIDMIFILDSSSSVGS------DNWDVMKNFVRDVINLLSITETG---TRVSIFRYN 656 Query: 228 NKIEEFFLLEWG-----VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + + + L + + G T L +A + I Q N Sbjct: 657 RRPDRQSQILLNDHIGDKQGLLSALDQMPYNGRGTWIGNALNHAKDFIL------QLRNG 710 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-----EFLR 337 + + +++ +TDG + E +K+GA+ YA+G+ E L+ Sbjct: 711 DRPDVIDVVLTITDGRSKDDVSVVS-----EELRKQGALTYAVGVIPPNGKGPREEELLK 765 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKD 364 S + + + D F + D Sbjct: 766 IAGSTDRVKITTSFEGTKDHFEKLMSD 792 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 69/211 (32%), Gaps = 32/211 (15%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++IVLD S S+ ++ + ++ ++ + + ++ +I+ Sbjct: 90 MDLVIVLDSSSSVG------QQNWEIMKSFVREIINTFEVSSTA---TRIAVFRYNKEID 140 Query: 232 EFFL--LEWGVSHLQ---RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L Q +K + G T + L + N + N + Sbjct: 141 TTSQILLSDFPDDKQALLKKYNAIPYDGSGTKTGQALSHVLNVVLS------PANGNRPD 194 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-----HEFLRACAS 341 K +I+ +TDG + + + +GA+ Y +GI + S Sbjct: 195 VKDVILTITDGRSFDDVS-----QPSLQLRNQGALTYVLGILPSNGLGLDDEQLDVIAGS 249 Query: 342 PNSFYLVENP-HSMYDAF-SHIGKDIVTKRI 370 + + F + IG+ I K Sbjct: 250 SERLLVANAGFSGLNQDFAARIGQQICGKPC 280 >gi|260837292|ref|XP_002613639.1| hypothetical protein BRAFLDRAFT_226979 [Branchiostoma floridae] gi|229299025|gb|EEN69648.1| hypothetical protein BRAFLDRAFT_226979 [Branchiostoma floridae] Length = 240 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 39/203 (19%), Positives = 76/203 (37%), Gaps = 27/203 (13%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 + D++ V+D S S+ + + +N+++ + P Q G+V +S+ Sbjct: 2 QADILFVVDGSSSI------PADEFEKVKTFLNSIVGHFDIGPTA---TQVGVVQYSSSP 52 Query: 231 EEFFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 ++ F L S + Q+ I + G TN+ L +A + N Sbjct: 53 QQEFALNAHSSLVSLQQAITNIIIIGRGTNTGSALTFARDVALTA------ANGARPGLP 106 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-RVIRSHEFLRACASPNSFYL 347 KI+V MTDG + L + G I +AIG+ + S + + Sbjct: 107 KIVVTMTDG-----ASSEDVLTPSQNLRNDGVITFAIGVTSRASDWQVEEIAGSLDRVFT 161 Query: 348 VENPHSMYDAFSHIGKDIVTKRI 370 + +DA +I + ++ Sbjct: 162 ASD----FDALDNIKVTLSSQLC 180 >gi|208780564|ref|ZP_03247903.1| von Willebrand factor type A domain protein [Francisella novicida FTG] gi|208743539|gb|EDZ89844.1| von Willebrand factor type A domain protein [Francisella novicida FTG] Length = 333 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 33/218 (15%), Positives = 74/218 (33%), Gaps = 47/218 (21%) Query: 173 DMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+++ +D+S SM + ++ D+ ++ N L+ K + GL+ F Sbjct: 93 DLIMAIDLSGSMAIQDMKKANGQMESRFDLVMRVANQFLDTRKGD-------RVGLILFG 145 Query: 228 NKIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L + ++ +++ + ++ G T + A ++ G Sbjct: 146 TRAYLQTPLTFDIATVKKMLDDASIALPGPQTAIGDAIGLAVKKLKKFPGDS-------- 197 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-------SHEFLRA 338 K ++ +TDGEN Q L AK+ +Y IG+ + + Sbjct: 198 ---KALILLTDGENN--SGTLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNT 252 Query: 339 CA-------------SPNSFYLVENPHSMYDAFSHIGK 363 + ++ +N + + I K Sbjct: 253 SEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDK 290 >gi|326670350|ref|XP_001332841.4| PREDICTED: anthrax toxin receptor 1-like [Danio rerio] Length = 609 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 36/220 (16%), Positives = 77/220 (35%), Gaps = 25/220 (11%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 S D+ VLD S S++ ++ I +E + Sbjct: 23 TGLTPEKGTAASSCYGGFDLYFVLDKSGSVQHHWN-----------EIYNFVEHL-AQKF 70 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFD 272 ++ ++ + FS++ + L +++ +K L + G T G++ A QI+ Sbjct: 71 ISPQLRMSFIVFSDQGKILMQLTEDREQIRKGLKELQDVRPGGDTFMHEGIQRASEQIYY 130 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + +I+ +TDGE N ++ GA VY +G++ Sbjct: 131 GNTEGYRTAS-------VIIALTDGELHENHFYYAERE-ANRSRSLGASVYCVGVKDFNE 182 Query: 333 HEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + + S + + V + ++A + I+ K Sbjct: 183 TQLAKIADSKDHVFPVNDG---FEALQGVIDSILKKSCIE 219 >gi|255261929|ref|ZP_05341271.1| conserved hypothetical protein [Thalassiobium sp. R2A62] gi|255104264|gb|EET46938.1| conserved hypothetical protein [Thalassiobium sp. R2A62] Length = 478 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 58/462 (12%), Positives = 134/462 (29%), Gaps = 104/462 (22%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 ++ F + G I + ++ L GM +++ + L +DR+++ AA Sbjct: 24 VKEFARDEDGAFIIFSLFMFVLMLLTAGMALDLMRYETHRARLQGTLDRAVLAAADL--- 80 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRST-----------SL 116 + + + + + + ++ + ++ ++ T ++ Sbjct: 81 DQTLSPAAVVTDYFAKAGLSSFLTSTTVDQGLNYRIISAQGNMTMPTTFMRLSGQTELAI 140 Query: 117 DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMM- 175 + ++ I T I + ++ ++ L+++ Sbjct: 141 RGDATAEERVSNVEISLVVDISGSMGRNNKLSTLRTASHTFIDTVIRPETEDLISLNIIP 200 Query: 176 --------------IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + +D + D + A + + + + + Sbjct: 201 YTAQVNAGPDIFDQLTVDQKHNFSHCIDFEPADFNTAALDVPPV--STRTYKQMQHFQYG 258 Query: 222 GLVTFSNKI-------EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI---- 270 ++ N E + L+ + L +T G+K+ + + Sbjct: 259 WSSSYVNNPGCPMQSYERIVPFSQDATSLKSTVTSLRAR-ANTAIHLGMKWGVSMLDPTF 317 Query: 271 ------------FDMQGMRQHCNTEDANYKKIIVFMTDGEN------------------- 299 D + + D K IV MTDG+N Sbjct: 318 RPIVTAMIANNKVDPEFAGRPVAYNDPETLKTIVLMTDGQNVDTYRISDEFYSTPSQIAH 377 Query: 300 ---------------------------LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR- 331 +T+ D C+ AK G +V+ IG V Sbjct: 378 WDRYQLFFFTNNYIDRDIDQNYYYKKFTATQADTMLQSICDAAKAEGILVWTIGFEVSNH 437 Query: 332 -SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + E L +SP+ F+ VE + +AF+ I + I R+ Sbjct: 438 AAGEMLDCASSPSHFFRVEGV-ELSEAFASIARQINQLRLVL 478 >gi|297527229|ref|YP_003669253.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] gi|297256145|gb|ADI32354.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] Length = 333 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 52/241 (21%), Positives = 92/241 (38%), Gaps = 35/241 (14%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA 198 + F +P+ R++ T K++ Q ++IVLD S SM KI A Sbjct: 72 IMFSLALPYTIIPRYVKTTQTLEAKISLQRKPP--VVIVLDTSGSM------KGDKIITA 123 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTN 258 I ++ +++ + V GL+TF++ + L +K+ + FG + Sbjct: 124 INAVKKFIDQTI------DYVLIGLITFNDHVRIAIPPTSDQELLYKKLGEIKAFGGTIY 177 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 S P L+ AY+ + N I+F+TDG S A+ Sbjct: 178 SKP-LEIAYDWLVPFAEF---------NLSPTIIFVTDGLPYSQDAPLYREVVYKCARYN 227 Query: 319 GAIVYAIGIRVIR--------SHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 +Y I I + + LR A + FY V+ +S+ + F + + V+K Sbjct: 228 -ITIYPIFIETPGMSIYETMMAQQRLREIANITKGQFYNVKQTNSLINLFEKLAEKTVSK 286 Query: 369 R 369 Sbjct: 287 A 287 >gi|15840942|ref|NP_335979.1| hypothetical protein MT1528 [Mycobacterium tuberculosis CDC1551] gi|13881148|gb|AAK45793.1| conserved hypothetical protein [Mycobacterium tuberculosis CDC1551] Length = 335 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 69/211 (32%), Gaps = 30/211 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+V+DVS+SM + D +++ A ++ +E+ + GL+ ++ Sbjct: 99 VMLVIDVSQSMRA-TDVEPSRMVAAQEAAKQFADELTPGIN------LGLIAYAGTATVL 151 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + L +F T + + A I + + D IV Sbjct: 152 VSPTTNREATKNALDKL-QFADRTATGEAIFTALQAIATVGAV---IGGGDTXPPARIVL 207 Query: 294 MTDGE---NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------------HEFL 336 +DG+ + + + AK +G + I E + Sbjct: 208 FSDGKETMPTNPDNPKGAYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETM 267 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 + A S + Y + +S + + I Sbjct: 268 KKVAQLSGGNSYNAATLAELRAVYSSLQQQI 298 >gi|183982301|ref|YP_001850592.1| membrane protein [Mycobacterium marinum M] gi|226701243|sp|B2HPD3|Y2288_MYCMM RecName: Full=UPF0353 protein MMAR_2288 gi|183175627|gb|ACC40737.1| membrane protein [Mycobacterium marinum M] Length = 335 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 33/255 (12%), Positives = 76/255 (29%), Gaps = 30/255 (11%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 S + + + + + + R +M+V+DVS+SM + D Sbjct: 55 SVAPQRPSRFRHIPAMLLALSLVLFTVAMAGPTHDVRIPRNRAVVMLVIDVSQSMRA-TD 113 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY 249 ++ A ++ +E+ + GL+ ++ + + Sbjct: 114 VEPNRMVAAQEAAKQFADELTPGIN------LGLIAYAGTATVLVSPTTNREATKAALDK 167 Query: 250 LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE---NLSTKEDQ 306 L +F T + + A I + + D IV +DG+ + + Sbjct: 168 L-QFADRTATGEAIFTALQAIATVGAV---IGGGDTPPPARIVLFSDGKETMPTNPDNPK 223 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRS--------------HEFLRACA--SPNSFYLVEN 350 + AK +G + I E ++ A S + Y Sbjct: 224 GAYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETMKKVAQLSGGNSYNAAT 283 Query: 351 PHSMYDAFSHIGKDI 365 + ++ + + I Sbjct: 284 LAELNSVYASLQQQI 298 >gi|114621486|ref|XP_001143977.1| PREDICTED: collagen, type XIV, alpha 1 isoform 2 [Pan troglodytes] Length = 1685 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 50/251 (19%), Positives = 95/251 (37%), Gaps = 30/251 (11%) Query: 127 YSISAISRYKI---PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 Y IS ++ + P +R P T A+ D++ ++D S S Sbjct: 888 YKISVYTKLQEIEGPSVSIMEKTQSLPTRPPTFPPTIPPAKEVCKAAKADLVFMVDGSWS 947 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVSH 242 + D + KI + S L ++ + Q +V F++ F L + Sbjct: 948 IG---DENFNKIISFLYSTVGALHKI-----GTDGTQVAMVQFTDDPRTEFKLNAYKTKE 999 Query: 243 -LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 L IK++S G +T + +KY + +F + + K+IV +TDG Sbjct: 1000 TLLDAIKHISYKGGNTKTGKAIKYVRDTLFTAESGTRRG------IPKVIVVITDG---- 1049 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SFYLVENPHSMYDAFS 359 + E + G ++AIG+ E + + P+ + V++ +DAF Sbjct: 1050 -RSQDDVNKISREMQLDGYSIFAIGVADADYSELVSIGSKPSARHVFFVDD----FDAFK 1104 Query: 360 HIGKDIVTKRI 370 I +++T Sbjct: 1105 KIEDELITFVC 1115 >gi|114621482|ref|XP_001144037.1| PREDICTED: collagen, type XIV, alpha 1 isoform 3 [Pan troglodytes] Length = 1780 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 50/251 (19%), Positives = 95/251 (37%), Gaps = 30/251 (11%) Query: 127 YSISAISRYKI---PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 Y IS ++ + P +R P T A+ D++ ++D S S Sbjct: 983 YKISVYTKLQEIEGPSVSIMEKTQSLPTRPPTFPPTIPPAKEVCKAAKADLVFMVDGSWS 1042 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVSH 242 + D + KI + S L ++ + Q +V F++ F L + Sbjct: 1043 IG---DENFNKIISFLYSTVGALHKI-----GTDGTQVAMVQFTDDPRTEFKLNAYKTKE 1094 Query: 243 -LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 L IK++S G +T + +KY + +F + + K+IV +TDG Sbjct: 1095 TLLDAIKHISYKGGNTKTGKAIKYVRDTLFTAESGTRRG------IPKVIVVITDG---- 1144 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SFYLVENPHSMYDAFS 359 + E + G ++AIG+ E + + P+ + V++ +DAF Sbjct: 1145 -RSQDDVNKISREMQLDGYSIFAIGVADADYSELVSIGSKPSARHVFFVDD----FDAFK 1199 Query: 360 HIGKDIVTKRI 370 I +++T Sbjct: 1200 KIEDELITFVC 1210 Score = 70.2 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 41/223 (18%), Positives = 80/223 (35%), Gaps = 26/223 (11%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV- 215 VK QT A D++I++D S S+ F + + LE + DV Sbjct: 142 SSPEEVKFVCQTPAIADIVILVDGSWSIGRFN----------FRLVRLFLENLVTAFDVG 191 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + + GL +S + L + + ++ L G +T + L Y + F Sbjct: 192 SEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNYIFENSFKP 251 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + KI + +TDG K + ++ G ++AIG++ + Sbjct: 252 E------AGSRTGVSKIGILITDG-----KSQDDIIPPSRNLRESGVELFAIGVKNADVN 300 Query: 334 EFLRACASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 E + P+ Y V M+ + + + ++ D+ Sbjct: 301 ELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSRVEEQDR 343 >gi|114621484|ref|XP_001143906.1| PREDICTED: collagen, type XIV, alpha 1 isoform 1 [Pan troglodytes] Length = 1800 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 50/251 (19%), Positives = 95/251 (37%), Gaps = 30/251 (11%) Query: 127 YSISAISRYKI---PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 Y IS ++ + P +R P T A+ D++ ++D S S Sbjct: 983 YKISVYTKLQEIEGPSVSIMEKTQSLPTRPPTFPPTIPPAKEVCKAAKADLVFMVDGSWS 1042 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVSH 242 + D + KI + S L ++ + Q +V F++ F L + Sbjct: 1043 IG---DENFNKIISFLYSTVGALHKI-----GTDGTQVAMVQFTDDPRTEFKLNAYKTKE 1094 Query: 243 -LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 L IK++S G +T + +KY + +F + + K+IV +TDG Sbjct: 1095 TLLDAIKHISYKGGNTKTGKAIKYVRDTLFTAESGTRRG------IPKVIVVITDG---- 1144 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SFYLVENPHSMYDAFS 359 + E + G ++AIG+ E + + P+ + V++ +DAF Sbjct: 1145 -RSQDDVNKISREMQLDGYSIFAIGVADADYSELVSIGSKPSARHVFFVDD----FDAFK 1199 Query: 360 HIGKDIVTKRI 370 I +++T Sbjct: 1200 KIEDELITFVC 1210 Score = 70.2 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 41/223 (18%), Positives = 80/223 (35%), Gaps = 26/223 (11%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV- 215 VK QT A D++I++D S S+ F + + LE + DV Sbjct: 142 SSPEEVKFVCQTPAIADIVILVDGSWSIGRFN----------FRLVRLFLENLVTAFDVG 191 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + + GL +S + L + + ++ L G +T + L Y + F Sbjct: 192 SEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNYIFENSFKP 251 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + KI + +TDG K + ++ G ++AIG++ + Sbjct: 252 E------AGSRTGVSKIGILITDG-----KSQDDIIPPSRNLRESGVELFAIGVKNADVN 300 Query: 334 EFLRACASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 E + P+ Y V M+ + + + ++ D+ Sbjct: 301 ELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSRVEEQDR 343 >gi|114621480|ref|XP_519927.2| PREDICTED: collagen alpha-1(XIV) chain isoform 4 [Pan troglodytes] Length = 1796 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 50/251 (19%), Positives = 95/251 (37%), Gaps = 30/251 (11%) Query: 127 YSISAISRYKI---PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 Y IS ++ + P +R P T A+ D++ ++D S S Sbjct: 983 YKISVYTKLQEIEGPSVSIMEKTQSLPTRPPTFPPTIPPAKEVCKAAKADLVFMVDGSWS 1042 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVSH 242 + D + KI + S L ++ + Q +V F++ F L + Sbjct: 1043 IG---DENFNKIISFLYSTVGALHKI-----GTDGTQVAMVQFTDDPRTEFKLNAYKTKE 1094 Query: 243 -LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 L IK++S G +T + +KY + +F + + K+IV +TDG Sbjct: 1095 TLLDAIKHISYKGGNTKTGKAIKYVRDTLFTAESGTRRG------IPKVIVVITDG---- 1144 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SFYLVENPHSMYDAFS 359 + E + G ++AIG+ E + + P+ + V++ +DAF Sbjct: 1145 -RSQDDVNKISREMQLDGYSIFAIGVADADYSELVSIGSKPSARHVFFVDD----FDAFK 1199 Query: 360 HIGKDIVTKRI 370 I +++T Sbjct: 1200 KIEDELITFVC 1210 Score = 70.2 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 41/223 (18%), Positives = 80/223 (35%), Gaps = 26/223 (11%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV- 215 VK QT A D++I++D S S+ F + + LE + DV Sbjct: 142 SSPEEVKFVCQTPAIADIVILVDGSWSIGRFN----------FRLVRLFLENLVTAFDVG 191 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + + GL +S + L + + ++ L G +T + L Y + F Sbjct: 192 SEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNYIFENSFKP 251 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + KI + +TDG K + ++ G ++AIG++ + Sbjct: 252 E------AGSRTGVSKIGILITDG-----KSQDDIIPPSRNLRESGVELFAIGVKNADVN 300 Query: 334 EFLRACASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 E + P+ Y V M+ + + + ++ D+ Sbjct: 301 ELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSRVEEQDR 343 >gi|167946540|ref|ZP_02533614.1| BatB protein, putative [Endoriftia persephone 'Hot96_1+Hot96_2'] Length = 345 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 31/226 (13%), Positives = 77/226 (34%), Gaps = 49/226 (21%) Query: 167 QTDARLDMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 A D+M+ +D SRSM + ++++ + + ++ ++ + G Sbjct: 91 NRTAGYDLMLAVDTSRSMTAEDFTVHGREVSRLSVLKGIMGKFVDG-RVGD------RIG 143 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 L+ F + L + + + + + + G T G+ ++ Sbjct: 144 LIIFGDTSYVLSPLTFDRNAIHQLLDGIVPTLAGGGTAIGDGIGLGIKKL---------- 193 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-------- 332 E ++++ +TDG+N L AK+ G +Y IG+ ++ Sbjct: 194 -RERPEGSRVLILVTDGKN--ETGTIPPLKAAQLAKQEGIRIYTIGVGSTKNRVRLLSPD 250 Query: 333 -------------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 E L+ A + +++ + + + I + Sbjct: 251 LRTYEIATGLAIDEETLQQIAETTGGAYFRANDTAGLEKVYQRIDE 296 >gi|21228580|ref|NP_634502.1| putative chloride channel [Methanosarcina mazei Go1] gi|20907073|gb|AAM32174.1| putative chloride channel [Methanosarcina mazei Go1] Length = 1004 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 54/309 (17%), Positives = 110/309 (35%), Gaps = 48/309 (15%) Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 N + K + I K+ + + ++ ++ V+D+ + S + N Sbjct: 221 NLLTVDAEVKSENVPITGLNKDNFTIEIGSKKVNDVTVSDVGEGKYKLSFNPPKQDSNGN 280 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK-VNSQTDARLDMMIVLDVSRSME 185 Y ++ +YK + ++V+ +A ++M+V+D S SM Sbjct: 281 YDLNVYVKYK--------------KVTLSDSELNAVRYGEDNANANANVMLVIDRSGSM- 325 Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV----S 241 S + I A S N ++ ++ G+V+FS+ + L + Sbjct: 326 -----SGSPISSAKNSANLFIDYMEAEDMA------GVVSFSSSARYDYHLATLTPEVKN 374 Query: 242 HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENL 300 +++KI + GV T G++Y N + N D N IV ++DG +N Sbjct: 375 SIKQKINSIYASGV-TAIGSGMRYGLNDLL---------NYGDPNNPWAIVLLSDGYQNS 424 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAF 358 + K VY +G+ + L A + +Y + + + Sbjct: 425 GENPNNVIPSI----KASNIQVYTVGLGPAVDQKLLGNIADQTGGKYYYSPTDSQLQEIY 480 Query: 359 SHIGKDIVT 367 + I I+ Sbjct: 481 NDIVGKIIG 489 >gi|319780897|ref|YP_004140373.1| hypothetical protein Mesci_1159 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317166785|gb|ADV10323.1| hypothetical protein Mesci_1159 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 492 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 34/247 (13%), Positives = 87/247 (35%), Gaps = 29/247 (11%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F + G IL + ++ L +G + VS ++ ++ L ++D ++ A + Sbjct: 14 FARDRGGNFAILFGLSASVLALAVGFSVNVSQLYNARSSLQGVVDAAVTSTARDLTTGA- 72 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 +K D ++ + + + + + IV T+ ++ Sbjct: 73 ------IKEADANKSVQAFLDANSQAGILQADQIVLDRLIVNRTA-----------KTVQ 115 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 A + + L F F M ++ +D +++ ++LD++ SM Sbjct: 116 ADAHVDVGLYFPIFGTG-------DMKRVAASTTALYSDKTVEVAMMLDITGSMAKR--G 166 Query: 191 SITKIDMAIKSINAMLE-EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY 249 + KI + ++ ++ N ++ +V +++ + L V ++ Sbjct: 167 KVDKIGDLKTAAKNAVQTMLQKQDPQNPRIRVAIVPYASGVN-AGKLAENVYAEKQASTE 225 Query: 250 LSKFGVS 256 L S Sbjct: 226 LPPVAGS 232 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 30/172 (17%), Positives = 53/172 (30%), Gaps = 38/172 (22%) Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI-----FDMQGMRQHCNTE 283 + L L I+ G T +++ Y + ++ Sbjct: 313 PDAKVIPLTADSDALLESIEDFRANGF-TAGAIAIQWTYYMLSPQWRTAIRNAGLGKGAS 371 Query: 284 DANYKKII---VFMTDGENL-------------STKEDQQSLYYCNEAKKRGAIVYAIGI 327 DA+ KKI + MTDG+ T + C+ K G ++ IG Sbjct: 372 DADPKKIAKVAILMTDGQFNTAFAGAGDSYNRQGTLARGNAETLCDNMKNDGIEIFTIGF 431 Query: 328 ----------RVIRSHEFLRACASPN------SFYLVENPHSMYDAFSHIGK 363 ++ L+ C+S + F+ V + DAF I + Sbjct: 432 DLDDKDMSTTERDQAKAVLKDCSSKDTSGAKRHFFDVSTGAELDDAFQEIIR 483 >gi|254875972|ref|ZP_05248682.1| conserved hypothetical protein [Francisella philomiragia subsp. philomiragia ATCC 25015] gi|254841993|gb|EET20407.1| conserved hypothetical protein [Francisella philomiragia subsp. philomiragia ATCC 25015] Length = 339 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 38/218 (17%), Positives = 74/218 (33%), Gaps = 47/218 (21%) Query: 173 DMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+M+ +D+S SM + ++ D+ ++ N L D + GL+ F Sbjct: 99 DLMMAIDLSGSMAIQDMQKSNGKMESRFDLVMRVANEFL-------DTRQGDRVGLILFG 151 Query: 228 NKIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 L + + +++ + ++ G T + A ++ G Sbjct: 152 TWAYLQTPLTFDIPTVKKMLDDASIALPGPQTAIGDAIGLAVKKLKRYPGDS-------- 203 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-------------- 331 K +V +TDGEN Q L AK+ +Y IG+ + Sbjct: 204 ---KALVLLTDGENN--SGALQPLQAAELAKQYHIKIYTIGLGGGQMMVKTTFGERLVNT 258 Query: 332 ----SHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 E L+ A + F+ +N + + I + Sbjct: 259 SEDLDTEVLQKIATMTGGKFFRAQNSTDLKQVYESIDQ 296 >gi|218462234|ref|ZP_03502325.1| hypothetical protein RetlK5_23393 [Rhizobium etli Kim 5] Length = 66 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 17/61 (27%), Positives = 24/61 (39%), Gaps = 2/61 (3%) Query: 311 YCNEAKKRGAIVYAIGIRVIR-SHEFLRACASPNSFYL-VENPHSMYDAFSHIGKDIVTK 368 C+ AK +G +Y I L CAS +S Y E + AF IG ++ Sbjct: 1 TCDTAKSKGIEIYTIAFMAPAGGQALLHYCASDDSHYFQAEKMEDLLAAFKAIGAKASSQ 60 Query: 369 R 369 Sbjct: 61 L 61 >gi|192360615|ref|YP_001982630.1| von Willebrand factor type A domain-containing protein [Cellvibrio japonicus Ueda107] gi|190686780|gb|ACE84458.1| von Willebrand factor type A domain protein [Cellvibrio japonicus Ueda107] Length = 318 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 34/196 (17%), Positives = 70/196 (35%), Gaps = 31/196 (15%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +D+S SM + + IT++ K + + + + GLV F Sbjct: 90 DLLLAVDISGSMREPDMVYNNRRITRLMAVKKVVGDFVARRQSD-------RLGLVLFGT 142 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + V +Q + G +T + + ++ E N Sbjct: 143 QAFLQAPLTFDVKTVQEMLIEAESGYAGEATAIGDAIALSIKRL-----------REQPN 191 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI----RSHEFLRACA-S 341 K++I+ +TDGEN + + + + A K +Y I SH + + Sbjct: 192 AKRVIILLTDGENTAGELGIATAT--DLAVKANTKIYTIAFSPYDREVDSHSMQQIAEQT 249 Query: 342 PNSFYLVENPHSMYDA 357 F+ N + + Sbjct: 250 GGEFFRARNTRDLEEI 265 >gi|146307954|ref|YP_001188419.1| von Willebrand factor, type A [Pseudomonas mendocina ymp] gi|145576155|gb|ABP85687.1| von Willebrand factor, type A [Pseudomonas mendocina ymp] Length = 334 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 75/209 (35%), Gaps = 44/209 (21%) Query: 173 DMMIVLDVSRSM---ESFFD-SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM + +D I+++++ + + +E+ + + GL+ F + Sbjct: 91 DLLLAVDVSGSMAYEDMHWDEQPISRLELVKRLLGDFIEDRRGD-------RVGLILFGS 143 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + + G +T + A ++ + Sbjct: 144 QAYLQAPLTFDRHTVRTWLDEAMIGIAGKNTAIGDAIGLAVKRL-----------RQRPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE------------ 334 +++V +TDG N + D A + G +YAIGI Sbjct: 193 QSRVLVLITDGANNGGEIDPMVA--AQLAAEEGVRIYAIGIGADPRQSGVLGAFGFSALD 250 Query: 335 ----FLRAC--ASPNSFYLVENPHSMYDA 357 LRA A+ ++ N + Sbjct: 251 LDETSLRAIAEATGGEYFRARNQAELTQI 279 >gi|74203017|dbj|BAE26210.1| unnamed protein product [Mus musculus] Length = 791 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 36/188 (19%), Positives = 72/188 (38%), Gaps = 27/188 (14%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 A +D++ +LD S S+ + + + A + + + P V+ G + F + Sbjct: 48 AAVDILFLLDGSHSIGK------GSFERSKRFAIAACDALDISP---GRVRVGALQFGST 98 Query: 230 IEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 F L+ + ++ IK + G T + LK R + + Sbjct: 99 PHLEFPLDSFSTRQEVKESIKGIVFKGGRTETGLALKRL---------SRGFPGGRNGSV 149 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SF 345 +I++ +TDG++ + ++RG +V+A+G+R R E L + P Sbjct: 150 PQILIIITDGKSQGP-----VALPAKQLRERGIVVFAVGVRFPRWDELLTLASEPKDQHV 204 Query: 346 YLVENPHS 353 L E Sbjct: 205 LLAEQVED 212 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 61/191 (31%), Gaps = 22/191 (11%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ +LD S S+ I + PD V Q GLV Sbjct: 523 GCQAQSLDLVFLLDASASVG------RENFAQMQSFIRKCTLRFDVNPD---VTQVGLVV 573 Query: 226 FSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + ++++ F L+ + + R + G ++ L + +++ +Q Sbjct: 574 YGSRVQTAFGLDTHPTRAAVLRAMSQAPYLGGVGSAGTALLHIEDKVMTVQ------RGA 627 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 K +V +T G + + + + G V + + + R + Sbjct: 628 RPGVPKAVVMLTGG-----SGAEDAAVPAQKLRGNGISVLVMSVGAVLREAVRRLAGPRD 682 Query: 344 SFYLVENPHSM 354 S V + Sbjct: 683 SLIHVAAYTDL 693 >gi|221123691|ref|XP_002160228.1| PREDICTED: similar to CnPolydom [Hydra magnipapillata] Length = 954 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 45/235 (19%), Positives = 88/235 (37%), Gaps = 29/235 (12%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQ------TDARLDMMIVLDVSRSME---SFFD 189 + F N R + PI + D++I+LD S SM SF Sbjct: 14 INFIGGQKRCINKRPLTGPIPDTQLEFMNCWSNEVEHGASDLVILLDSSGSMYSTGSFQG 73 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS----HLQR 245 ++T D+ INA+L +V + + + G ++KI+ F+L S ++ Sbjct: 74 VTMTGFDIGKTFINALLSKVHISFNATRIA-IGTFGTNHKIDINFILRPDYSMHKCKFKK 132 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQ-GMRQHCNTEDANYKKIIVFMTDGE-NLSTK 303 + + +G TN L+ + N ++ H N ++++ ++DGE N+ Sbjct: 133 DFEKIRIYGGMTNLRGALQDSLNIFRELDSNPDTHKKRHKTN--RVVILLSDGEGNVMDN 190 Query: 304 EDQQSLY-----------YCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 + + + + + VY IG+ L A+ + +L Sbjct: 191 PNGRGVTHNDGLARNPHDIAHNLRLGLVEVYTIGVTSAPDRAVLEGLATEKNLFL 245 >gi|301781320|ref|XP_002926070.1| PREDICTED: collagen alpha-1(VII) chain-like [Ailuropoda melanoleuca] Length = 2994 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 75/203 (36%), Gaps = 28/203 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ + ++ ++ + V+ V +S+ Sbjct: 38 DIVFLLDGSSSIG---RGNFREVRGFLEGLVLPFSGAASAQG----VRFAAVQYSDDPRT 90 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L+ + R I+ LS G +T + + + + +F Q R K+ Sbjct: 91 EFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVFLPQLARPG-------VPKV 143 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLV 348 + +TDG K K +G ++A+GI+ E R + + + F+ V Sbjct: 144 CILITDG-----KSQDLVDTAAQRLKGQGVKLFAVGIKNADPEELKRVASQPTSDFFFFV 198 Query: 349 ENPHSMYDAFSHIGKDIVTKRIW 371 + + +V++R+ Sbjct: 199 NDFSILRTLL-----PLVSRRVC 216 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 29/218 (13%), Positives = 74/218 (33%), Gaps = 25/218 (11%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P S ++ +D++ +L + + + +++ ++ + + Sbjct: 1035 VRGPEASVTQIPVCPHGLMDVVFLL-------HTTQDNAHRAEAVKRALERLVSALGPLG 1087 Query: 214 DVNNVVQSGLVTFSNKIEEFFLL--EWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQI 270 VQ GL+++S++ F L + + +KI+ + N + A+ + Sbjct: 1088 P--QAVQVGLLSYSHRPSPLFPLNSSYDPGVILQKIRNIPYVDPSGNNLGTAVVTAHRHL 1145 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + + I+V + D EA+ G V +G Sbjct: 1146 LAPDAPGR-----RQHVPGIMVLLVD-----EPLRGDIFNPIREAQAAGLKVMILGQAGA 1195 Query: 331 RSHEFLRACASPN---SFYLVENPHSMYDAFSHIGKDI 365 + R + +F+ V++ S+ A S + + Sbjct: 1196 DPEQLRRLVPGTDPVQTFFAVDDGPSLERAVSSLATSL 1233 >gi|281343115|gb|EFB18699.1| hypothetical protein PANDA_015680 [Ailuropoda melanoleuca] Length = 2904 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 75/203 (36%), Gaps = 28/203 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ + ++ ++ + V+ V +S+ Sbjct: 10 DIVFLLDGSSSIG---RGNFREVRGFLEGLVLPFSGAASAQG----VRFAAVQYSDDPRT 62 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L+ + R I+ LS G +T + + + + +F Q R K+ Sbjct: 63 EFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVFLPQLARPG-------VPKV 115 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLV 348 + +TDG K K +G ++A+GI+ E R + + + F+ V Sbjct: 116 CILITDG-----KSQDLVDTAAQRLKGQGVKLFAVGIKNADPEELKRVASQPTSDFFFFV 170 Query: 349 ENPHSMYDAFSHIGKDIVTKRIW 371 + + +V++R+ Sbjct: 171 NDFSILRTLL-----PLVSRRVC 188 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 29/218 (13%), Positives = 74/218 (33%), Gaps = 25/218 (11%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P S ++ +D++ +L + + + +++ ++ + + Sbjct: 1007 VRGPEASVTQIPVCPHGLMDVVFLL-------HTTQDNAHRAEAVKRALERLVSALGPLG 1059 Query: 214 DVNNVVQSGLVTFSNKIEEFFLL--EWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQI 270 VQ GL+++S++ F L + + +KI+ + N + A+ + Sbjct: 1060 P--QAVQVGLLSYSHRPSPLFPLNSSYDPGVILQKIRNIPYVDPSGNNLGTAVVTAHRHL 1117 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + + I+V + D EA+ G V +G Sbjct: 1118 LAPDAPGR-----RQHVPGIMVLLVD-----EPLRGDIFNPIREAQAAGLKVMILGQAGA 1167 Query: 331 RSHEFLRACASPN---SFYLVENPHSMYDAFSHIGKDI 365 + R + +F+ V++ S+ A S + + Sbjct: 1168 DPEQLRRLVPGTDPVQTFFAVDDGPSLERAVSSLATSL 1205 >gi|237737388|ref|ZP_04567869.1| BatA protein [Fusobacterium mortiferum ATCC 9817] gi|229421250|gb|EEO36297.1| BatA protein [Fusobacterium mortiferum ATCC 9817] Length = 319 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 37/218 (16%), Positives = 79/218 (36%), Gaps = 36/218 (16%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + +D+++ LD+S+SM D +++ A K + +++ +N+ + Sbjct: 73 ENKIIKKEGIDIVVALDLSQSMLQ-RDFKPNRLETAKKLLEEFIDK-----RIND--RIS 124 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 LV F L + + ++ L+ +++N+ + G+ + Sbjct: 125 LVVFGGDAYTKVPLTFDHNVVKDITSKLTTDDITSNNRTAIGMGL-------GVSLNRLK 177 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH--------- 333 + K+I+ MTDGEN + + AK+ G +Y IGI Sbjct: 178 DSEAKSKVIILMTDGENN--SGEMSPMGASEIAKELGIKIYTIGIGAREIQIRVPFGHTT 235 Query: 334 --------EFLRACAS--PNSFYLVENPHSMYDAFSHI 361 L+ AS ++ + + F+ I Sbjct: 236 VKNTELDENLLKNIASTTGGEYFRAGSEKEFQEIFNRI 273 >gi|330504126|ref|YP_004380995.1| von Willebrand factor, type A [Pseudomonas mendocina NK-01] gi|328918412|gb|AEB59243.1| von Willebrand factor, type A [Pseudomonas mendocina NK-01] Length = 334 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 62/168 (36%), Gaps = 26/168 (15%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + D I+++++ + + +E + GL+ F + Sbjct: 91 DLLLAVDVSGSMDYADMQWDDEPISRLELVKRLLGDFIE-------GRRGDRVGLILFGS 143 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + + G +T + A ++ + Sbjct: 144 QAYLQAPLTFDRHTVRTWLDEALIGIAGKNTAIGDAIGLAVKRL-----------RQRPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 +++V +TDG N + D A G +Y IGI Sbjct: 193 QSRVLVLITDGANNGGEIDPMVA--AQLAADEGVRIYTIGIGADPQQS 238 >gi|328954590|ref|YP_004371924.1| VWFA-related domain-containing protein [Desulfobacca acetoxidans DSM 11109] gi|328454914|gb|AEB10743.1| VWFA-related domain-containing protein [Desulfobacca acetoxidans DSM 11109] Length = 543 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 32/231 (13%), Positives = 79/231 (34%), Gaps = 33/231 (14%) Query: 149 TNSRHIVMPITS------SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 + + + S++ S + L +++ +DVS SM + A K+ Sbjct: 58 LTPENFSLSVEGQPIKDFSLEPISSAKSPLSVVLAIDVSGSM------KGEPMAEARKAA 111 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 L+E++ V L+TF + ++ ++++L T Sbjct: 112 AIFLDELEKDDHVA------LITFGQGVYHLSDFAAKKHEVREQLQHLEAKEQWTWLYQA 165 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 A + R ++ +TDG++ + ++++ + K + Sbjct: 166 TLEAMERAVQAPTTRA-----------AVILLTDGKDEGSPVSEEAVL--DRIKGAQVPI 212 Query: 323 YAIGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 YA+G ++L+ AS +F + + + + + + I Sbjct: 213 YAMGFGSKAQVDYLQKVASASQGAFLSTPQAADLTNLYQTVLDYLKNQYIL 263 >gi|194666191|ref|XP_604080.4| PREDICTED: collagen, type XXVIII-like [Bos taurus] Length = 1147 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 66/163 (40%), Gaps = 22/163 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + L+++ V+D S S+ + + + + + V L + G++ + Sbjct: 791 CKETPLELLFVIDSSESVG------LKNFQIIKNFVKTLTDRVAL---DLATTRVGIINY 841 Query: 227 SNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S+K+EE L S L+R + + G T + L A N++F+ Sbjct: 842 SHKVEEVAHLTQFSSKDDLKRAVDNMQYLGEGTYTATALHAA-NRMFEAAKPG------- 893 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 KK+ + +TDG+ T++++ A ++ IG+ Sbjct: 894 --VKKVALVITDGQ-TDTRDEKNLTEVVKNASDINVEIFVIGV 933 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 31/168 (18%), Positives = 69/168 (41%), Gaps = 16/168 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ ++D S S + F K + S++ + ++ + + ++ + FS+ ++ Sbjct: 46 IDLVFIVDSSESSKIFL---FDKQKDFVGSLSDKIFQLTPVGSLKYDIKLAALQFSSSVQ 102 Query: 232 EFFLLE-W-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 W + ++++K ++ G T S + A R + K Sbjct: 103 IDPPFSSWKDLQTFKQRVKSMNFIGQGTFSYYAIANA---------TRLLKREGRKDGMK 153 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + V MTDG + D QS+ +A+ G + IG+ + + LR Sbjct: 154 VAVLMTDGIDHPKNPDVQSIS--EDARTAGILFITIGLSAVVNETKLR 199 >gi|326672739|ref|XP_002664110.2| PREDICTED: vitrin [Danio rerio] Length = 868 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 73/197 (37%), Gaps = 27/197 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS--NKI 230 D+ V+D S S+ ++ + + E ++ + G+V ++ ++ Sbjct: 685 DIAFVIDGSSSVG------TGNFRTVLQFVANVTREFEISDTD---TRVGIVQYTYEQRL 735 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 E F + L IK ++ + T++ + YA +Q+F N +KI Sbjct: 736 EFAFGQHNNKADLLNAIKRINYWSGGTSTGAAITYAADQLFS---------KSKPNKRKI 786 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SFYLV 348 ++ +TDG + G I Y+IGI E P+ + V Sbjct: 787 MIVITDGR-----SYDDVRAPALAVHRTGVIAYSIGIAWAAQDELEYIATDPDREHSFFV 841 Query: 349 ENPHSMYDAFSHIGKDI 365 + ++Y S I +I Sbjct: 842 DEFDNLYKYVSKIIHNI 858 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 28/207 (13%), Positives = 65/207 (31%), Gaps = 22/207 (10%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 + P + F +T+ V + ++DM ++D S S+ Sbjct: 440 ETSHTRETPAESNPFESGFTSRVMDTTARAPEVPSQGDPNCKVDMAFLMDGSWSIGK--- 496 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKI 247 + + + + + + + G++ + + F L+ L+ I Sbjct: 497 ---RRFKIQKDFLVEVSQALNVGVAGA---MMGIIQYGDDPVTEFSLKQFSNSKDLKPAI 550 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 + + G ++ L Y Q F + + V + DG E+ Sbjct: 551 SKIVQKGGPSHVGKALSYINKQFFSDANGNRGGAPN------VAVVLVDGWPTDRVEEAS 604 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRSHE 334 L A++ G ++ + I +E Sbjct: 605 RL-----ARESGINIFFVTIEGPDDNE 626 >gi|262202333|ref|YP_003273541.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] gi|262085680|gb|ACY21648.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] Length = 325 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 76/211 (36%), Gaps = 30/211 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++V+DVSRSM + D + ++I A + D+ + GL++F+ Sbjct: 90 VILVMDVSRSMNA-TDVAPSRIRAAQSAAKKF------ADDLTEGINLGLISFAGTPSTL 142 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + ++ + L T + G+ A +QI + + +A IV Sbjct: 143 VSPTPDHTATKKAVDKLV-LADKTATGEGIFAALDQIRTLNAVLGG---PEAAPPAHIVL 198 Query: 294 MTDGE---NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI------RSHEF--------L 336 ++DG+ + + + +AK+ G V I L Sbjct: 199 LSDGKQTVPDEPTDPRGAFTAARKAKEEGIPVSTISFGTAYGTVELDGDRVPVPVDDPSL 258 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 + A S +F+ + + + + + +I Sbjct: 259 KQIANLSGGNFFTASSLDELNEVYEKLQSEI 289 >gi|27365660|ref|NP_761188.1| hypothetical protein VV1_2340 [Vibrio vulnificus CMCP6] gi|27361808|gb|AAO10715.1| hypothetical protein VV1_2340 [Vibrio vulnificus CMCP6] Length = 465 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 30/154 (19%), Positives = 60/154 (38%), Gaps = 12/154 (7%) Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA-------YNQIFDMQGMRQHCNT 282 + H + ++ L G++TN+ G+ +A + I+D Sbjct: 313 VNPIVPFITERRHFESTVQRLVP-GMNTNNAEGMVWAMRLLSPYWQGIWDKTRPELPRRY 371 Query: 283 EDANYKKIIVFMTDGENLSTKE--DQQSLYYCNEAKK--RGAIVYAIGIRVIRSHEFLRA 338 D K +V +DG +L D++ C + K+ RG V + S +++ Sbjct: 372 SDETSNKYLVMFSDGNHLIDPAFRDKKMKLICTQLKQPGRGVKVMTVNFGGAASERLMQS 431 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 CAS +Y V + S+ F I + +++ + Sbjct: 432 CASGPEYYHVASLFSVEKVFEQIAEQVISSSLIE 465 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 27/206 (13%), Positives = 70/206 (33%), Gaps = 40/206 (19%) Query: 12 FYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNG 71 GG+++ + L + + + +V I+ + + + D +L A+ I++E Sbjct: 1 MRKQTGGISVFMLVLLMSMLVFAAWVTDVMRIYSVHNQMANATDAAL---ASAIISEVPE 57 Query: 72 NNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISA 131 + +L ++ + + R L + Q E ++ Sbjct: 58 STAVELLHANLTSGAASPYVEEVR--------------------LTHLRDEQEESLQVA- 96 Query: 132 ISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSS 191 L F + + + ++S +++ +LDVS SM S Sbjct: 97 -------LDFVPNSLNIAAQESVPIRTNAKAGISSNKA---EIVFMLDVSNSM------S 140 Query: 192 ITKIDMAIKSINAMLEEVKLIPDVNN 217 ++ +++ A +++ + N Sbjct: 141 GEPMNKTKEALLAFADKLYARGNRNQ 166 >gi|254802534|sp|Q0PMD2|ANTR1_RAT RecName: Full=Anthrax toxin receptor 1; Flags: Precursor Length = 562 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 39/224 (17%), Positives = 77/224 (34%), Gaps = 30/224 (13%) Query: 156 MPITSSVKVNSQTDAR-----LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + + + + D+ +LD S S+ ++ I +E++ Sbjct: 20 VLVCAGQGGRREDGGPACYGGFDLYFILDKSGSVLHHWN-----------EIYYFVEQLA 68 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYN 268 ++ + FS + L +++ ++ L K G T G + A Sbjct: 69 HRFISPQ-LRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASE 127 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 QI+ + +I+ +TDGE L S N ++ GAIVY +G++ Sbjct: 128 QIYYENSQGYRTAS-------VIIALTDGE-LHEDLFFYSEREANRSRDLGAIVYCVGVK 179 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + R S + + V + + A I I+ K Sbjct: 180 DFNETQLARIADSKDHVFPVNDG---FQALQGIIHSILKKSCIE 220 >gi|113205504|ref|NP_001037714.1| anthrax toxin receptor 1 [Rattus norvegicus] gi|111052876|gb|ABH03702.1| anthrax toxin receptor [Rattus norvegicus] gi|124297147|gb|AAI31854.1| Anthrax toxin receptor 1 [Rattus norvegicus] Length = 576 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 39/224 (17%), Positives = 77/224 (34%), Gaps = 30/224 (13%) Query: 156 MPITSSVKVNSQTDAR-----LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + + + + D+ +LD S S+ ++ I +E++ Sbjct: 34 VLVCAGQGGRREDGGPACYGGFDLYFILDKSGSVLHHWN-----------EIYYFVEQLA 82 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYN 268 ++ + FS + L +++ ++ L K G T G + A Sbjct: 83 HRFISPQ-LRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASE 141 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 QI+ + +I+ +TDGE L S N ++ GAIVY +G++ Sbjct: 142 QIYYENSQGYRTAS-------VIIALTDGE-LHEDLFFYSEREANRSRDLGAIVYCVGVK 193 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + R S + + V + + A I I+ K Sbjct: 194 DFNETQLARIADSKDHVFPVNDG---FQALQGIIHSILKKSCIE 234 >gi|147898761|ref|NP_001080437.1| collagen alpha-1(VI) chain precursor [Xenopus laevis] gi|82210072|sp|Q801S8|CO6A1_XENLA RecName: Full=Collagen alpha-1(VI) chain; Flags: Precursor gi|28703819|gb|AAH47255.1| Col6a1 protein [Xenopus laevis] Length = 1045 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 40/227 (17%), Positives = 80/227 (35%), Gaps = 17/227 (7%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 P+K +P V P + D +D+ VLD S S+ T + Sbjct: 31 PIKDINGLPAQ-GPTVSVRPGPDPSDKVTFQDCPVDIFFVLDTSESVALRVKPFKTLVTQ 89 Query: 198 AIKSINAMLEEVKLIP---DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG 254 + ++++ D N V +G + +S+++ L + L+ ++ + G Sbjct: 90 VKEFTKKFIDKLTSRYYRCDRNLVWNAGALHYSDEVILINSLTRDMKTLRDNVETVEYIG 149 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL--STKEDQQSLYYC 312 T++ +K ++ ++ K ++ +TDG L + Sbjct: 150 KGTHTDCAIKRGIEEVLIGGSHQKEN--------KYLIVVTDGHPLEGYKEPCGGLEDAA 201 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF---YLVENPHSMYD 356 NEAK G V+++ I L AS S + + + D Sbjct: 202 NEAKHLGIKVFSVAISPNHLEPRLSVIASDASHRRNFTATSAVGLTD 248 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 29/179 (16%), Positives = 62/179 (34%), Gaps = 24/179 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVVQSGL 223 T LD++ VLD S S+ ++ ++ I +++ + V + G+ Sbjct: 632 CTCGPLDLLFVLDSSESIG------LSNFQISKDFILKVIDRLSRDEHVKFDADNSHVGV 685 Query: 224 VTFSN-KIEEFFLLEWGV----SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V +S+ + +E + L+ +K L T + L + + + + Sbjct: 686 VQYSHGQTQEVVAMGDSSIQSIGQLKEAVKNLKWIAGGTWTGEALAFTKDNLLKRFTLE- 744 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 KKI + +TDG + ++ C V I S + ++ Sbjct: 745 ---------KKIALVLTDGHSDILRDKTPLNTLCEVTPVVSVGVGDIFQNAPNSDQLVQ 794 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 49/343 (14%), Positives = 113/343 (32%), Gaps = 51/343 (14%) Query: 59 VHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSL-- 116 + T LK + +I + LRD +N + ++ S+ Sbjct: 720 IAGGTWTGEALAFTKDNLLKRFTLEKKIALVLTDGHSDILRDKTPLNTLCEVTPVVSVGV 779 Query: 117 -DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD------ 169 DI N + K P + + + + +V + +D Sbjct: 780 GDIFQNAPNSDQLVQISCGGK-PYSKGLSLQRTSFAELLDDGFLHNVTSHMCSDRKCPDY 838 Query: 170 -------ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE-VKLIPDVNNVVQS 221 D+ +++D S + + + + + E +K P + + Sbjct: 839 TCPITYEGPADITMLVDSSTRVGNQH------FQTSKSFVKLLAERFLKAKPPPSGSARV 892 Query: 222 GLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 +V +S + ++ ++ ++ L+ + + +TN L A +++ + Sbjct: 893 SVVQYSGQNQQIVEAQFLTNYTVLEVPVDNMQFINGATNVVSAL-RAVTELY-REDSLAG 950 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR----VIRSHEF 335 N KK++VF +DG +T+E++ L +A+ G +Y + + Sbjct: 951 VN------KKLLVF-SDG---NTQEEKGLLKVVQDAQSAGIEIYVLAVGSRLNYPNLQVM 1000 Query: 336 LRACASP-------NSFYLVENPHSMYD--AFSHIGKDIVTKR 369 L A+ + V + S+ + I + I K Sbjct: 1001 LTGSAADIAGPFPEERLFRVPDYTSLLQGVRYQSISRRIALKS 1043 >gi|163754426|ref|ZP_02161548.1| BatA (Bacteroides aerotolerance operon) [Kordia algicida OT-1] gi|161325367|gb|EDP96694.1| BatA (Bacteroides aerotolerance operon) [Kordia algicida OT-1] Length = 335 Score = 71.3 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 47/268 (17%), Positives = 83/268 (30%), Gaps = 54/268 (20%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVM----PITSSVKVNSQTDARLDMMIVLDVSRSM 184 A LK F+ ++ P T V +T+ +D+++ +DVS SM Sbjct: 45 FKATPSILPKLKPILFVLRMLAIMALITALARPQTKEVSTRIKTNKGIDIVMAIDVSASM 104 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ 244 S D ++ K +E + GLV ++ + + S +Q Sbjct: 105 LSK-DLRPNRLTALKKVAAEFIEGRPSD-------RIGLVVYAGESFTKTPITTDKSIIQ 156 Query: 245 RKIKYLSK----FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 +K + T GL A N++ D + K+I+ +TDG N Sbjct: 157 NALKDIKYKHGELIGGTAIGMGLATAVNRLKDSKAKS-----------KVIILLTDGVNN 205 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIR-----------VIRSHEFLRAC---------- 339 + + Q A + G Y IGI R Sbjct: 206 AGFIEPQIAS--ELAVEYGIKTYTIGIGTNGMASTPVALNPDGTILFRNMQVEIDEKLLQ 263 Query: 340 ----ASPNSFYLVENPHSMYDAFSHIGK 363 + ++ N + + + I K Sbjct: 264 QIAKTTGGKYFRATNTKKLAEIYDEINK 291 >gi|45384490|ref|NP_990665.1| collagen alpha-1(XIV) chain precursor [Gallus gallus] gi|1705533|sp|P32018|COEA1_CHICK RecName: Full=Collagen alpha-1(XIV) chain; AltName: Full=Undulin; Flags: Precursor gi|288873|emb|CAA50064.1| collagen XIV [Gallus gallus] Length = 1888 Score = 71.3 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 50/288 (17%), Positives = 95/288 (32%), Gaps = 33/288 (11%) Query: 86 IKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFI 145 + + D + I P + I I R K K Sbjct: 78 QNTATKAIIQGLIPDQNYALQIIAFSDDKE----SKPAQGQFRIKDIERRKETSKSKVKD 133 Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 P TN+ P +T A D++I++D S S+ F + + Sbjct: 134 PEKTNASK---PTPEGNLFTCKTPAIADIVILVDGSWSIGRFN----------FRLVRLF 180 Query: 206 LEEVKLIPDV-NNVVQSGLVTFS-NKIEEFFLLEWGVSH-LQRKIKYLSKFGVSTNSTPG 262 LE + +V + + GL +S + E+ L +G + ++ L G +T + Sbjct: 181 LENLVSAFNVGSEKTRVGLAQYSGDPRIEWHLNAYGTKDAVLDAVRNLPYKGGNTLTGLA 240 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 L Y F + KI + +TDG K + + G + Sbjct: 241 LTYILENSFKPE------AGARPGVSKIGILITDG-----KSQDDVIPPAKNLRDAGIEL 289 Query: 323 YAIGIRVIRSHEFLRACASPN--SFYLVENPHSMYDAFSHIGKDIVTK 368 +AIG++ +E + P+ Y V + + M + + + ++ Sbjct: 290 FAIGVKNADINELKEIASEPDSTHVYNVADFNFMNSIVEGLTRTVCSR 337 >gi|288875|emb|CAA50063.1| collagen XIV [Gallus gallus] Length = 1857 Score = 71.3 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 50/288 (17%), Positives = 95/288 (32%), Gaps = 33/288 (11%) Query: 86 IKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFI 145 + + D + I P + I I R K K Sbjct: 78 QNTATKAIIQGLIPDQNYALQIIAFSDDKE----SKPAQGQFRIKDIERRKETSKSKVKD 133 Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 P TN+ P +T A D++I++D S S+ F + + Sbjct: 134 PEKTNASK---PTPEGNLFTCKTPAIADIVILVDGSWSIGRFN----------FRLVRLF 180 Query: 206 LEEVKLIPDV-NNVVQSGLVTFS-NKIEEFFLLEWGVSH-LQRKIKYLSKFGVSTNSTPG 262 LE + +V + + GL +S + E+ L +G + ++ L G +T + Sbjct: 181 LENLVSAFNVGSEKTRVGLAQYSGDPRIEWHLNAYGTKDAVLDAVRNLPYKGGNTLTGLA 240 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 L Y F + KI + +TDG K + + G + Sbjct: 241 LTYILENSFKPE------AGARPGVSKIGILITDG-----KSQDDVIPPAKNLRDAGIEL 289 Query: 323 YAIGIRVIRSHEFLRACASPN--SFYLVENPHSMYDAFSHIGKDIVTK 368 +AIG++ +E + P+ Y V + + M + + + ++ Sbjct: 290 FAIGVKNADINELKEIASEPDSTHVYNVADFNFMNSIVEGLTRTVCSR 337 >gi|149197908|ref|ZP_01874957.1| BatA [Lentisphaera araneosa HTCC2155] gi|149139129|gb|EDM27533.1| BatA [Lentisphaera araneosa HTCC2155] Length = 341 Score = 71.3 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 45/234 (19%), Positives = 85/234 (36%), Gaps = 40/234 (17%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSME---SFFDSSITKIDMAIKSINAML----EEVKLI 212 + V Q + + +++DVS SM+ + + +T++D+A + + +E+ Sbjct: 74 TEVTKKRQDRQGIAIQVLVDVSSSMDINMKYGEERLTRMDVAKIVVEKFIGGDGDELVGR 133 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 PD GL+TF+ + L L ++ ++ AY Sbjct: 134 PDD----LIGLITFARYADTIAPLSLAHEALISIVQDVTINTRPNEDG----TAYGDATA 185 Query: 273 MQGMRQHCNTEDANYK-KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--- 328 + + D + K KII+ +TDGEN + L + AK+ G +Y I I+ Sbjct: 186 LAAAQLDLLQGDQDIKSKIIILLTDGENNC--GNHLPLQAASLAKEWGIKIYTISIQNKP 243 Query: 329 -----------------VIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 + L+ A + F L + S+ + I K Sbjct: 244 TPERKKTDQGTFFVPPTPSAGDQVLKKMAESTGGVFRLAHDYDSLKSVYKEINK 297 >gi|297560911|ref|YP_003679885.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|296845359|gb|ADH67379.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 315 Score = 71.3 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 44/216 (20%), Positives = 79/216 (36%), Gaps = 32/216 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+ +DVS SM + D +++ A KS +E + + GLV FS+ Sbjct: 88 IMVAVDVSLSMAAN-DIDPNRLEAAKKSAQGFVETLP------DRFNVGLVAFSSTATVV 140 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + I+ L + G T G+ + I + + IV Sbjct: 141 SSPTHDHQAVIGSIENL-QLGPGTAIGEGVFASLESISSFDE-----DADVDPPPSAIVL 194 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------------HEFLRAC 339 ++DGEN S ++ Q++ A ++ V I + E LR Sbjct: 195 LSDGENTSGRDISQAVAM---AAEQEVPVSTIAFGTGAAMIEIDGYQVPADIDKEALRGL 251 Query: 340 ASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 AS FY E+ + + + IG + T+ + + Sbjct: 252 ASDTGGHFYEAESETELDEVYEDIGSSLGTELVHEE 287 >gi|146337718|ref|YP_001202766.1| hypothetical protein BRADO0587 [Bradyrhizobium sp. ORS278] gi|146190524|emb|CAL74523.1| conserved hypothetical protein; putative vWFA domain [Bradyrhizobium sp. ORS278] Length = 442 Score = 71.3 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 62/451 (13%), Positives = 138/451 (30%), Gaps = 99/451 (21%) Query: 1 MFSLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVH 60 M SL+L + F N G + + AI + +G ++ S M+ L + +D + V Sbjct: 1 MRSLSLLLSRFRRNAGGNVAVTFAIVCVPVITAVGCGVDYSRTNQMRAKLQAAVDAASVG 60 Query: 61 AATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVV 120 A ++ + G I N N +D + Sbjct: 61 AVSR-TSPAFIAAGAMTTDGVIAA--GNDDARKIFNGNMSGTTGYTLDSLTPEVK----- 112 Query: 121 VPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDV 180 + + +P F + + + + + +S+ K + +D ++LD Sbjct: 113 -KTGSVLTATVSFSATVPTLFMSIV----GYKTMSLQGSSTAKASMPK--YIDFYLLLDN 165 Query: 181 SRSM------------------------ESFFDSS-----------ITKIDMAIKSINAM 205 S SM + D++ T+ID+ + + Sbjct: 166 SPSMGVAATPADVTKMVSATSDKCAFACHDYNDANNYYNLAKTLGVTTRIDVLRSATQQL 225 Query: 206 LEEVKLIPDVNNVVQS-----GLVTFSNKIEEFFLLEWGVSHLQRK---IKYLSKFGVST 257 ++ + +N + G + + + F L ++ + I + +G + Sbjct: 226 MDTAQQTQTYSNQFRMAIYDFGASSKTIGLRALFALSSSLTSAKSAAGNIDLMGVYGNND 285 Query: 258 NSTPGLKYAYNQIFDM--QGMRQHCNTEDANYKKIIVFMTDG---ENLSTKEDQQSL--- 309 + T Y + + + K + F++DG E+ + ++ Sbjct: 286 SFTADKDTPYTTALPAINNEIATPGDGTSGSPLKYLFFVSDGVADESNAACLKPKASGNR 345 Query: 310 -------YYCNEAKKRGAIV---YAIGIRVIRSH-----------------------EFL 336 C K RG + Y +++ + + + Sbjct: 346 CQSPINPALCTALKNRGIKIAVLYTTYLQLPTNSWYMSWIDPFNKGPFGPSPNSEIAQNM 405 Query: 337 RACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 +ACAS ++ V + DA + + K V Sbjct: 406 QACASDGFYFEVSPTQGIADAMNALFKKAVA 436 >gi|332226724|ref|XP_003262542.1| PREDICTED: anthrax toxin receptor 1 isoform 2 [Nomascus leucogenys] Length = 333 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 77/224 (34%), Gaps = 30/224 (13%) Query: 156 MPITSSVKVNSQTDAR-----LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + I + + D+ +LD S S+ ++ I +E++ Sbjct: 22 VLICAGQGGRREDGGPACYGGFDLYFILDKSGSVLHHWN-----------EIYYFVEQLA 70 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYN 268 ++ + FS + L +++ ++ L K G T G + A Sbjct: 71 HKFISPQ-LRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASE 129 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 QI+ + +I+ +TDGE L S N ++ GAIVY +G++ Sbjct: 130 QIYYENRQGYRTAS-------VIIALTDGE-LHEDLFFYSEREANRSRDLGAIVYCVGVK 181 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + R S + + V + + A I I+ K Sbjct: 182 DFNETQLARIADSKDHVFPVNDG---FQALQGIIHSILKKSCIE 222 >gi|320156062|ref|YP_004188441.1| hypothetical protein VVM_02402 [Vibrio vulnificus MO6-24/O] gi|319931374|gb|ADV86238.1| hypothetical protein VVMO6_01216 [Vibrio vulnificus MO6-24/O] Length = 465 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 30/154 (19%), Positives = 60/154 (38%), Gaps = 12/154 (7%) Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA-------YNQIFDMQGMRQHCNT 282 + H + ++ L G++TN+ G+ +A + I+D Sbjct: 313 VNPIVPFITERRHFESTVQRLVP-GMNTNNAEGMVWAMRLLSPYWQGIWDKTRPELPRRY 371 Query: 283 EDANYKKIIVFMTDGENLSTKE--DQQSLYYCNEAKK--RGAIVYAIGIRVIRSHEFLRA 338 D K +V +DG +L D++ C + K+ RG V + S +++ Sbjct: 372 SDETSNKYLVMFSDGNHLIDPAFRDKKMKLICTQLKQPGRGVKVMTVNFGGAASERLMQS 431 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 CAS +Y V + S+ F I + +++ + Sbjct: 432 CASGPEYYHVASLFSVEKVFEQIAEQVISSSLIE 465 >gi|297266186|ref|XP_002799329.1| PREDICTED: anthrax toxin receptor 1-like [Macaca mulatta] Length = 484 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 77/224 (34%), Gaps = 30/224 (13%) Query: 156 MPITSSVKVNSQTDAR-----LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + I + + D+ +LD S S+ ++ I +E++ Sbjct: 22 VLICAGQGGRREDGGPACYGGFDLYFILDKSGSVLHHWN-----------EIYYFVEQLA 70 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYN 268 ++ + FS + L +++ ++ L K G T G + A Sbjct: 71 HKFISPQ-LRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASE 129 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 QI+ + +I+ +TDGE L S N ++ GAIVY +G++ Sbjct: 130 QIYYENRQGYRTAS-------VIIALTDGE-LHEDLFFYSEREANRSRDLGAIVYCVGVK 181 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + R S + + V + + A I I+ K Sbjct: 182 DFNETQLARIADSKDHVFPVNDG---FQALQGIIHSILKKSCIE 222 >gi|158257658|dbj|BAF84802.1| unnamed protein product [Homo sapiens] Length = 333 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 77/224 (34%), Gaps = 30/224 (13%) Query: 156 MPITSSVKVNSQTDAR-----LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + I + + D+ +LD S S+ ++ I +E++ Sbjct: 22 VLICAGQGGRREDGGPACYGGFDLYFILDKSGSVLHHWN-----------EIYYFVEQLA 70 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYN 268 ++ + FS + L +++ ++ L K G T G + A Sbjct: 71 HKFISPQ-LRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASE 129 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 QI+ + +I+ +TDGE L S N ++ GAIVY +G++ Sbjct: 130 QIYYENRQGYRTAS-------VIIALTDGE-LHEDLFFYSEREANRSRDLGAIVYCVGVK 181 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + R S + + V + + A I I+ K Sbjct: 182 DFNETQLARIADSKDHVFPVNDG---FQALQGIIHSILKKSCIE 222 >gi|114577872|ref|XP_001136716.1| PREDICTED: anthrax toxin receptor 1 isoform 5 [Pan troglodytes] Length = 321 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 77/224 (34%), Gaps = 30/224 (13%) Query: 156 MPITSSVKVNSQTDAR-----LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + I + + D+ +LD S S+ ++ I +E++ Sbjct: 22 VLICAGQGGRREDGGPACYGGFDLYFILDKSGSVLHHWN-----------EIYYFVEQLA 70 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYN 268 ++ + FS + L +++ ++ L K G T G + A Sbjct: 71 HKFISPQ-LRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASE 129 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 QI+ + +I+ +TDGE L S N ++ GAIVY +G++ Sbjct: 130 QIYYENRQGYRTAS-------VIIALTDGE-LHEDLFFYSEREANRSRDLGAIVYCVGVK 181 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + R S + + V + + A I I+ K Sbjct: 182 DFNETQLARIADSKDHVFPVNDG---FQALQGIIHSILKKSCIE 222 >gi|114577870|ref|XP_001136388.1| PREDICTED: tumor endothelial marker 8 isoform 1 [Pan troglodytes] Length = 326 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 77/224 (34%), Gaps = 30/224 (13%) Query: 156 MPITSSVKVNSQTDAR-----LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + I + + D+ +LD S S+ ++ I +E++ Sbjct: 22 VLICAGQGGRREDGGPACYGGFDLYFILDKSGSVLHHWN-----------EIYYFVEQLA 70 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYN 268 ++ + FS + L +++ ++ L K G T G + A Sbjct: 71 HKFISPQ-LRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASE 129 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 QI+ + +I+ +TDGE L S N ++ GAIVY +G++ Sbjct: 130 QIYYENRQGYRTAS-------VIIALTDGE-LHEDLFFYSEREANRSRDLGAIVYCVGVK 181 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + R S + + V + + A I I+ K Sbjct: 182 DFNETQLARIADSKDHVFPVNDG---FQALQGIIHSILKKSCIE 222 >gi|114577874|ref|XP_001136473.1| PREDICTED: tumor endothelial marker 8 isoform 2 [Pan troglodytes] Length = 297 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 77/224 (34%), Gaps = 30/224 (13%) Query: 156 MPITSSVKVNSQTDAR-----LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + I + + D+ +LD S S+ ++ I +E++ Sbjct: 22 VLICAGQGGRREDGGPACYGGFDLYFILDKSGSVLHHWN-----------EIYYFVEQLA 70 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYN 268 ++ + FS + L +++ ++ L K G T G + A Sbjct: 71 HKFISPQ-LRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASE 129 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 QI+ + +I+ +TDGE L S N ++ GAIVY +G++ Sbjct: 130 QIYYENRQGYRTAS-------VIIALTDGE-LHEDLFFYSEREANRSRDLGAIVYCVGVK 181 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + R S + + V + + A I I+ K Sbjct: 182 DFNETQLARIADSKDHVFPVNDG---FQALQGIIHSILKKSCIE 222 >gi|114577864|ref|XP_525774.2| PREDICTED: anthrax toxin receptor 1 isoform 6 [Pan troglodytes] Length = 564 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 77/224 (34%), Gaps = 30/224 (13%) Query: 156 MPITSSVKVNSQTDAR-----LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + I + + D+ +LD S S+ ++ I +E++ Sbjct: 22 VLICAGQGGRREDGGPACYGGFDLYFILDKSGSVLHHWN-----------EIYYFVEQLA 70 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYN 268 ++ + FS + L +++ ++ L K G T G + A Sbjct: 71 HKFISPQ-LRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASE 129 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 QI+ + +I+ +TDGE L S N ++ GAIVY +G++ Sbjct: 130 QIYYENRQGYRTAS-------VIIALTDGE-LHEDLFFYSEREANRSRDLGAIVYCVGVK 181 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + R S + + V + + A I I+ K Sbjct: 182 DFNETQLARIADSKDHVFPVNDG---FQALQGIIHSILKKSCIE 222 >gi|62898107|dbj|BAD96993.1| tumor endothelial marker 8 isoform 3 precursor variant [Homo sapiens] Length = 317 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 77/224 (34%), Gaps = 30/224 (13%) Query: 156 MPITSSVKVNSQTDAR-----LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + I + + D+ +LD S S+ ++ I +E++ Sbjct: 22 VLICAGQGGRREDGGPACYGGFDLYFILDKSGSVLHHWN-----------EIYYFVEQLA 70 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYN 268 ++ + FS + L +++ ++ L K G T G + A Sbjct: 71 HKFISPQ-LRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASE 129 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 QI+ + +I+ +TDGE L S N ++ GAIVY +G++ Sbjct: 130 QIYYENRQGYRTAS-------VIIALTDGE-LHEDLFFYSEREANRSRDLGAIVYCVGVK 181 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + R S + + V + + A I I+ K Sbjct: 182 DFNETQLARIADSKDHVFPVNDG---FQALQGIIHSILKKSCIE 222 >gi|16933551|ref|NP_444262.1| anthrax toxin receptor 1 isoform 2 precursor [Homo sapiens] gi|114577866|ref|XP_001136564.1| PREDICTED: anthrax toxin receptor 1 isoform 3 [Pan troglodytes] gi|332226726|ref|XP_003262543.1| PREDICTED: anthrax toxin receptor 1 isoform 3 [Nomascus leucogenys] gi|16566413|gb|AAL26496.1|AF421380_1 anthrax toxin receptor [Homo sapiens] Length = 368 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 77/224 (34%), Gaps = 30/224 (13%) Query: 156 MPITSSVKVNSQTDAR-----LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + I + + D+ +LD S S+ ++ I +E++ Sbjct: 22 VLICAGQGGRREDGGPACYGGFDLYFILDKSGSVLHHWN-----------EIYYFVEQLA 70 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYN 268 ++ + FS + L +++ ++ L K G T G + A Sbjct: 71 HKFISPQ-LRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASE 129 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 QI+ + +I+ +TDGE L S N ++ GAIVY +G++ Sbjct: 130 QIYYENRQGYRTAS-------VIIALTDGE-LHEDLFFYSEREANRSRDLGAIVYCVGVK 181 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + R S + + V + + A I I+ K Sbjct: 182 DFNETQLARIADSKDHVFPVNDG---FQALQGIIHSILKKSCIE 222 >gi|16933553|ref|NP_060623.2| anthrax toxin receptor 1 isoform 3 precursor [Homo sapiens] gi|114577868|ref|XP_001136638.1| PREDICTED: anthrax toxin receptor 1 isoform 4 [Pan troglodytes] gi|15082333|gb|AAH12074.1| Anthrax toxin receptor 1 [Homo sapiens] gi|190690489|gb|ACE87019.1| anthrax toxin receptor 1 protein [synthetic construct] gi|190691863|gb|ACE87706.1| anthrax toxin receptor 1 protein [synthetic construct] gi|312150350|gb|ADQ31687.1| anthrax toxin receptor 1 [synthetic construct] Length = 333 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 77/224 (34%), Gaps = 30/224 (13%) Query: 156 MPITSSVKVNSQTDAR-----LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + I + + D+ +LD S S+ ++ I +E++ Sbjct: 22 VLICAGQGGRREDGGPACYGGFDLYFILDKSGSVLHHWN-----------EIYYFVEQLA 70 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYN 268 ++ + FS + L +++ ++ L K G T G + A Sbjct: 71 HKFISPQ-LRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASE 129 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 QI+ + +I+ +TDGE L S N ++ GAIVY +G++ Sbjct: 130 QIYYENRQGYRTAS-------VIIALTDGE-LHEDLFFYSEREANRSRDLGAIVYCVGVK 181 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + R S + + V + + A I I+ K Sbjct: 182 DFNETQLARIADSKDHVFPVNDG---FQALQGIIHSILKKSCIE 222 >gi|62988680|gb|AAY24067.1| unknown [Homo sapiens] Length = 234 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 77/224 (34%), Gaps = 30/224 (13%) Query: 156 MPITSSVKVNSQTDAR-----LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + I + + D+ +LD S S+ ++ I +E++ Sbjct: 22 VLICAGQGGRREDGGPACYGGFDLYFILDKSGSVLHHWN-----------EIYYFVEQLA 70 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYN 268 ++ + FS + L +++ ++ L K G T G + A Sbjct: 71 HKFISPQ-LRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASE 129 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 QI+ + +I+ +TDGE L S N ++ GAIVY +G++ Sbjct: 130 QIYYENRQGYRTAS-------VIIALTDGE-LHEDLFFYSEREANRSRDLGAIVYCVGVK 181 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + R S + + V + + A I I+ K Sbjct: 182 DFNETQLARIADSKDHVFPVNDG---FQALQGIIHSILKKSCIE 222 >gi|62870691|gb|AAY18344.1| anthrax toxin receptor/neuroblastoma fusion protein transcript variant 4 [Homo sapiens] Length = 411 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 77/224 (34%), Gaps = 30/224 (13%) Query: 156 MPITSSVKVNSQTDAR-----LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + I + + D+ +LD S S+ ++ I +E++ Sbjct: 22 VLICAGQGGRREDGGPACYGGFDLYFILDKSGSVLHHWN-----------EIYYFVEQLA 70 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYN 268 ++ + FS + L +++ ++ L K G T G + A Sbjct: 71 HKFISPQ-LRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASE 129 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 QI+ + +I+ +TDGE L S N ++ GAIVY +G++ Sbjct: 130 QIYYENRQGYRTAS-------VIIALTDGE-LHEDLFFYSEREANRSRDLGAIVYCVGVK 181 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + R S + + V + + A I I+ K Sbjct: 182 DFNETQLARIADSKDHVFPVNDG---FQALQGIIHSILKKSCIE 222 >gi|62870685|gb|AAY18341.1| anthrax toxin receptor/neuroblastoma fusion protein transcript variant 1 [Homo sapiens] Length = 387 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 77/224 (34%), Gaps = 30/224 (13%) Query: 156 MPITSSVKVNSQTDAR-----LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + I + + D+ +LD S S+ ++ I +E++ Sbjct: 22 VLICAGQGGRREDGGPACYGGFDLYFILDKSGSVLHHWN-----------EIYYFVEQLA 70 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYN 268 ++ + FS + L +++ ++ L K G T G + A Sbjct: 71 HKFISPQ-LRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASE 129 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 QI+ + +I+ +TDGE L S N ++ GAIVY +G++ Sbjct: 130 QIYYENRQGYRTAS-------VIIALTDGE-LHEDLFFYSEREANRSRDLGAIVYCVGVK 181 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + R S + + V + + A I I+ K Sbjct: 182 DFNETQLARIADSKDHVFPVNDG---FQALQGIIHSILKKSCIE 222 >gi|62870689|gb|AAY18343.1| anthrax toxin receptor/neuroblastoma fusion protein transcript variant 3 [Homo sapiens] Length = 410 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 77/224 (34%), Gaps = 30/224 (13%) Query: 156 MPITSSVKVNSQTDAR-----LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + I + + D+ +LD S S+ ++ I +E++ Sbjct: 22 VLICAGQGGRREDGGPACYGGFDLYFILDKSGSVLHHWN-----------EIYYFVEQLA 70 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYN 268 ++ + FS + L +++ ++ L K G T G + A Sbjct: 71 HKFISPQ-LRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASE 129 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 QI+ + +I+ +TDGE L S N ++ GAIVY +G++ Sbjct: 130 QIYYENRQGYRTAS-------VIIALTDGE-LHEDLFFYSEREANRSRDLGAIVYCVGVK 181 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + R S + + V + + A I I+ K Sbjct: 182 DFNETQLARIADSKDHVFPVNDG---FQALQGIIHSILKKSCIE 222 >gi|62870687|gb|AAY18342.1| anthrax toxin receptor/neuroblastoma fusion protein transcript variant 2 [Homo sapiens] Length = 419 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 77/224 (34%), Gaps = 30/224 (13%) Query: 156 MPITSSVKVNSQTDAR-----LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + I + + D+ +LD S S+ ++ I +E++ Sbjct: 22 VLICAGQGGRREDGGPACYGGFDLYFILDKSGSVLHHWN-----------EIYYFVEQLA 70 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYN 268 ++ + FS + L +++ ++ L K G T G + A Sbjct: 71 HKFISPQ-LRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASE 129 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 QI+ + +I+ +TDGE L S N ++ GAIVY +G++ Sbjct: 130 QIYYENRQGYRTAS-------VIIALTDGE-LHEDLFFYSEREANRSRDLGAIVYCVGVK 181 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + R S + + V + + A I I+ K Sbjct: 182 DFNETQLARIADSKDHVFPVNDG---FQALQGIIHSILKKSCIE 222 >gi|14149904|ref|NP_115584.1| anthrax toxin receptor 1 isoform 1 precursor [Homo sapiens] gi|332226722|ref|XP_003262541.1| PREDICTED: anthrax toxin receptor 1 isoform 1 [Nomascus leucogenys] gi|17366074|sp|Q9H6X2|ANTR1_HUMAN RecName: Full=Anthrax toxin receptor 1; AltName: Full=Tumor endothelial marker 8; Flags: Precursor gi|14017381|gb|AAK52094.1| tumor endothelial marker 8 precursor [Homo sapiens] Length = 564 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 77/224 (34%), Gaps = 30/224 (13%) Query: 156 MPITSSVKVNSQTDAR-----LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + I + + D+ +LD S S+ ++ I +E++ Sbjct: 22 VLICAGQGGRREDGGPACYGGFDLYFILDKSGSVLHHWN-----------EIYYFVEQLA 70 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYN 268 ++ + FS + L +++ ++ L K G T G + A Sbjct: 71 HKFISPQ-LRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASE 129 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 QI+ + +I+ +TDGE L S N ++ GAIVY +G++ Sbjct: 130 QIYYENRQGYRTAS-------VIIALTDGE-LHEDLFFYSEREANRSRDLGAIVYCVGVK 181 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + R S + + V + + A I I+ K Sbjct: 182 DFNETQLARIADSKDHVFPVNDG---FQALQGIIHSILKKSCIE 222 >gi|169826904|ref|YP_001697062.1| hypothetical protein Bsph_1324 [Lysinibacillus sphaericus C3-41] gi|168991392|gb|ACA38932.1| conserved hypothetical protein [Lysinibacillus sphaericus C3-41] Length = 825 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 77/210 (36%), Gaps = 35/210 (16%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 +P+ +K + L ++IVLD S SM S +K+++A ++ +E ++ Sbjct: 352 LPVEMEIKGK-EQLPSLGLVIVLDRSGSM------SGSKLELAKEAAARSVEMLRDEDT- 403 Query: 216 NNVVQSGLVTFSNK---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 G + F ++ I E L I ++ G T L AY + D Sbjct: 404 -----LGFIAFDDRPWEIIETGPL-NNKEEAVDTILSVTP-GGGTEIYGSLAKAYENLAD 456 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 M+ R+H I+ +TDG++ D + K G + + I Sbjct: 457 MKLQRKH-----------IILLTDGQSQPGNYDD----LIEQGKDNGITLSTVAIGQDAD 501 Query: 333 HEFLRACAS--PNSFYLVENPHSMYDAFSH 360 L A + FY V + ++ S Sbjct: 502 ANLLEALSEMGSGRFYNVIDEQTIPSILSR 531 >gi|149918750|ref|ZP_01907237.1| aerotolerance-related membrane protein [Plesiocystis pacifica SIR-1] gi|149820351|gb|EDM79767.1| aerotolerance-related membrane protein [Plesiocystis pacifica SIR-1] Length = 350 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 41/218 (18%), Positives = 77/218 (35%), Gaps = 35/218 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSS----ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 +D++I LD+S SM + D + ++ +A + I+ + + L Sbjct: 117 EHEGIDIVIALDLSDSMSNPMDGRRGLGLDRLTVAKQVIDEFIRR-------RPHDRIAL 169 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLS---KFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 V F L + L+ I + G T GL + N++ Sbjct: 170 VGFGAHASTIAPLTLDHAVLRNLIVQVRLGVVDGQETAIGAGLGVSLNRL---------- 219 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-------H 333 E KIIV +TDG + + D + A +RG ++Y + + Sbjct: 220 -KESQAATKIIVLLTDGVHNADGMDPDT--VAQTAAERGVVIYTVLMGQQTGDRSSVDAG 276 Query: 334 EFLRAC-ASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + R A+ YL E+ ++ +F + + I Sbjct: 277 QLERLAGATDGYAYLAEDTQTLETSFQDLLDKLEKSSI 314 >gi|89255637|ref|YP_512998.1| hypothetical protein FTL_0203 [Francisella tularensis subsp. holarctica LVS] gi|134302613|ref|YP_001122584.1| hypothetical protein FTW_1793 [Francisella tularensis subsp. tularensis WY96-3418] gi|156501587|ref|YP_001427652.1| hypothetical protein FTA_0219 [Francisella tularensis subsp. holarctica FTNF002-00] gi|167009921|ref|ZP_02274852.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Francisella tularensis subsp. holarctica FSC200] gi|224456527|ref|ZP_03665000.1| hypothetical protein FtultM_01598 [Francisella tularensis subsp. tularensis MA00-2987] gi|254367031|ref|ZP_04983067.1| hypothetical protein FTHG_00206 [Francisella tularensis subsp. holarctica 257] gi|290953465|ref|ZP_06558086.1| hypothetical protein FtulhU_03745 [Francisella tularensis subsp. holarctica URFT1] gi|295313263|ref|ZP_06803900.1| hypothetical protein FtulhU_03730 [Francisella tularensis subsp. holarctica URFT1] gi|89143468|emb|CAJ78644.1| hypothetical membrane protein [Francisella tularensis subsp. holarctica LVS] gi|134050390|gb|ABO47461.1| conserved membrane protein with von Willebrand factor type A domain [Francisella tularensis subsp. tularensis WY96-3418] gi|134252857|gb|EBA51951.1| hypothetical protein FTHG_00206 [Francisella tularensis subsp. holarctica 257] gi|156252190|gb|ABU60696.1| conserved membrane protein with von Willebrand factor, type A domain [Francisella tularensis subsp. holarctica FTNF002-00] gi|282158589|gb|ADA77980.1| hypothetical protein NE061598_01650 [Francisella tularensis subsp. tularensis NE061598] Length = 333 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 32/218 (14%), Positives = 74/218 (33%), Gaps = 47/218 (21%) Query: 173 DMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+++ +D+S SM + ++ D+ ++ N ++ K + GL+ F Sbjct: 93 DLIMAIDLSGSMAIQDMKKANGQMESRFDLVMRVANQFIDTRKGD-------RVGLILFG 145 Query: 228 NKIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L + ++ +++ + ++ G T + A ++ G Sbjct: 146 TRAYLQTPLTFDIATVKKMLDDASIALPGPQTAIGDAIGLAVKKLKKYPGDS-------- 197 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-------SHEFLRA 338 K ++ +TDGEN Q L AK+ +Y IG+ + + Sbjct: 198 ---KALILLTDGENN--SGTLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNT 252 Query: 339 CA-------------SPNSFYLVENPHSMYDAFSHIGK 363 + ++ +N + + I K Sbjct: 253 SEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDK 290 >gi|291229807|ref|XP_002734862.1| PREDICTED: polydom-like [Saccoglossus kowalevskii] Length = 1730 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 29/197 (14%), Positives = 68/197 (34%), Gaps = 26/197 (13%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S S+ S + + ++L + + P + +V++S Sbjct: 31 QGKSDLVFLLDRSASVGS------ANFEAEKGFVESLLGQFSISPAS---TRVDVVSYSE 81 Query: 229 KIEE---FFLLEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + + H + I++++ TN+ L+ A N Sbjct: 82 DVVRHIDYIREPKNKCHFSQDIRHVTYRNSGKTNTNGALQEARNIFVGS----------R 131 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + K++V ++DG++ E ++ G ++ I I + E S + Sbjct: 132 QDVHKVVVLLSDGQSN---TGGDPTTTAEELRQDGVEIFTIAIGLFNKDELNSIATSDHH 188 Query: 345 FYLVENPHSMYDAFSHI 361 + + S I Sbjct: 189 TFEYSSFREFKKLASRI 205 >gi|149727851|ref|XP_001493364.1| PREDICTED: anthrax toxin receptor 1 [Equus caballus] Length = 603 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 39/205 (19%), Positives = 73/205 (35%), Gaps = 25/205 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+ +LD S S+ ++ I +E++ ++ + FS + Sbjct: 41 GGFDLYFILDKSGSVLHHWN-----------EIYYFVEQLAHKFISPQ-LRMSFIVFSTR 88 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L +++ ++ L K G T G + A QI+ + Sbjct: 89 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFQRASEQIYYENSQGYRTAS----- 143 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +I+ +TDGE L S N ++ GAIVY +G++ + R S + + Sbjct: 144 --VIIALTDGE-LHEDLFFYSEREANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVFP 200 Query: 348 VENPHSMYDAFSHIGKDIVTKRIWY 372 V + + A I I+ K Sbjct: 201 VNDG---FQALQGIIHSILKKSCIE 222 >gi|148256121|ref|YP_001240706.1| hypothetical protein BBta_4775 [Bradyrhizobium sp. BTAi1] gi|146408294|gb|ABQ36800.1| hypothetical protein BBta_4775 [Bradyrhizobium sp. BTAi1] Length = 602 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 33/185 (17%), Positives = 66/185 (35%), Gaps = 28/185 (15%) Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + N VQ + S K+ + L + + L+ + + G TN G+ +A + Sbjct: 417 QHMENNVQYCSSSASTKLGQIVPLSYNWTSLKSAVNAMEPTG-GTNQAIGMAWAVQSLIP 475 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENL---------------STKEDQQSLYYCNEAKK 317 + + Y ++I+ ++DG N D + C+ K Sbjct: 476 NGVLGAPAEDANTTYNRVIILLSDGLNTEDRWPDYGNGSTQASGNPIDARQALLCSNLKN 535 Query: 318 RG-------AIVYAIGIR----VIRSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGKDI 365 +Y I + + L+ CA SP+ FY++ + + F+ IG + Sbjct: 536 TKDSKGNAMYTIYTIQVNTSSPADPTSTVLQNCASSPDKFYMLTSSSQIVTTFNSIGTAL 595 Query: 366 VTKRI 370 R+ Sbjct: 596 SKLRV 600 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 28/223 (12%), Positives = 72/223 (32%), Gaps = 26/223 (11%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R F + G + L AI L I +G I+ S ++ + +D + + + + Sbjct: 26 RRFSGDISGNIATLFAIALLPILAFIGAAIDYSRANAARSAMQGALDSTALMLSRDL--- 82 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 ++ + D+ + + + D + + S Sbjct: 83 ----SQGTITAADVAAKASTYFKALYT--------STDAQSVAVTASYTASTSSSASNIQ 130 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 ++A + P T + + D ++ + + LD + SM + Sbjct: 131 LNASGQIVTQFMKLVGFPTMTFNTKATT---------TWGDVKMRVALALDNTGSM--AY 179 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +T + A+ +++++ + V L+ F+ + Sbjct: 180 SGKMTALQNAVAGSGGLIDQLSALAKSPGDVYISLIPFAKVVN 222 >gi|312881786|ref|ZP_07741560.1| hypothetical protein VIBC2010_06474 [Vibrio caribbenthicus ATCC BAA-2122] gi|309370537|gb|EFP98015.1| hypothetical protein VIBC2010_06474 [Vibrio caribbenthicus ATCC BAA-2122] Length = 323 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 46/254 (18%), Positives = 87/254 (34%), Gaps = 42/254 (16%) Query: 132 ISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM--ESFFD 189 + K + + W + P+ V +Q R D+M+VLD+S SM E D Sbjct: 46 TAPRNYFTKTLSILLWCALITALARPVWYGEPVTTQPKHR-DLMLVLDLSYSMSQEDMQD 104 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY 249 SS ID ++ ++ + + + GLV F++ L + + ++ Sbjct: 105 SSGNYIDRL-TAVKNVVSQFAQQRKGD---RLGLVLFADHAYLQTPLTLDRNTISEQVNS 160 Query: 250 L--SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 L G T G+ A D +++++ ++DG N Sbjct: 161 LVLQLIGQKTAIGEGIGLATKTFIDSDA-----------PQRVMILLSDGSNT--SGVLD 207 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIR--------------SHEF----LRACA--SPNSFYL 347 + N AKK A +Y IG+ + + L + A + ++ Sbjct: 208 PIEAANIAKKYNATIYTIGVGAGEMMVKDFFMTRKVNTAQDLDEKTLMSIAKITGGQYFR 267 Query: 348 VENPHSMYDAFSHI 361 N + + I Sbjct: 268 ARNAQELATIYDTI 281 >gi|148669822|gb|EDL01769.1| von Willebrand factor A domain containing 2 [Mus musculus] Length = 748 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 36/188 (19%), Positives = 72/188 (38%), Gaps = 27/188 (14%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 A +D++ +LD S S+ + + + A + + + P V+ G + F + Sbjct: 5 AAVDILFLLDGSHSIGK------GSFERSKRFAIAACDALDISP---GRVRVGALQFGST 55 Query: 230 IEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 F L+ + ++ IK + G T + LK R + + Sbjct: 56 PHLEFPLDSFSTRQEVKESIKGIVFKGGRTETGLALKRL---------SRGFPGGRNGSV 106 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SF 345 +I++ +TDG++ + ++RG +V+A+G+R R E L + P Sbjct: 107 PQILIIVTDGKSQGP-----VALPAKQLRERGIVVFAVGVRFPRWDELLTLASEPKDRHV 161 Query: 346 YLVENPHS 353 L E Sbjct: 162 LLAEQVED 169 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 61/191 (31%), Gaps = 22/191 (11%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ +LD S S+ I + PD V Q GLV Sbjct: 444 GCQAQSLDLVFLLDASASVG------RENFAQMQSFIRKCTLRFDVNPD---VTQVGLVV 494 Query: 226 FSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + ++++ F L+ + + R + G ++ L + +++ +Q Sbjct: 495 YGSRVQTAFGLDTHPTRAAVLRAMSQAPYLGGVGSAGTALLHIEDKVMTVQ------RGA 548 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 K +V +T G + + + + G V + + + R + Sbjct: 549 RPGVPKAVVMLTGG-----SGAEDAAVPAQKLRGNGISVLVMSVGAVLREAVRRLAGPRD 603 Query: 344 SFYLVENPHSM 354 S V + Sbjct: 604 SLIHVAAYTDL 614 >gi|109733269|gb|AAI16637.1| Von Willebrand factor A domain containing 2 [Mus musculus] Length = 791 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 36/188 (19%), Positives = 72/188 (38%), Gaps = 27/188 (14%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 A +D++ +LD S S+ + + + A + + + P V+ G + F + Sbjct: 48 AAVDILFLLDGSHSIGK------GSFERSKRFAIAACDALDISP---GRVRVGALQFGST 98 Query: 230 IEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 F L+ + ++ IK + G T + LK R + + Sbjct: 99 PHLEFPLDSFSTRQEVKESIKGIVFKGGRTETGLALKRL---------SRGFPGGRNGSV 149 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SF 345 +I++ +TDG++ + ++RG +V+A+G+R R E L + P Sbjct: 150 PQILIIVTDGKSQGP-----VALPAKQLRERGIVVFAVGVRFPRWDELLTLASEPKDRHV 204 Query: 346 YLVENPHS 353 L E Sbjct: 205 LLAEQVED 212 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 61/191 (31%), Gaps = 22/191 (11%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ +LD S S+ I + PD V Q GLV Sbjct: 523 GCQAQSLDLVFLLDASASVG------RENFAQMQSFIRKCTLRFDVNPD---VTQVGLVV 573 Query: 226 FSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + ++++ F L+ + + R + G ++ L + +++ +Q Sbjct: 574 YGSRVQTAFGLDTHPTRAAVLRAMSQAPYLGGVGSAGTALLHIEDKVMTVQ------RGA 627 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 K +V +T G + + + + G V + + + R + Sbjct: 628 RPGVPKAVVMLTGG-----SGAEDAAVPAQKLRGNGISVLVMSVGAVLREAVRRLAGPRD 682 Query: 344 SFYLVENPHSM 354 S V + Sbjct: 683 SLIHVAAYTDL 693 >gi|26325252|dbj|BAC26380.1| unnamed protein product [Mus musculus] Length = 721 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 36/188 (19%), Positives = 72/188 (38%), Gaps = 27/188 (14%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 A +D++ +LD S S+ + + + A + + + P V+ G + F + Sbjct: 48 AAVDILFLLDGSHSIGK------GSFERSKRFAIAACDALDISP---GRVRVGALQFGST 98 Query: 230 IEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 F L+ + ++ IK + G T + LK R + + Sbjct: 99 PHLEFPLDSFSTRQEVKESIKGIVFKGGRTETGLALKRL---------SRGFPGGRNGSV 149 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SF 345 +I++ +TDG++ + ++RG +V+A+G+R R E L + P Sbjct: 150 PQILIIVTDGKSQGP-----VALPAKQLRERGIVVFAVGVRFPRWDELLTLASEPKDRHV 204 Query: 346 YLVENPHS 353 L E Sbjct: 205 LLAEQVED 212 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 61/191 (31%), Gaps = 22/191 (11%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ +LD S S+ I + PD V Q GLV Sbjct: 523 GCQAQSLDLVFLLDASASVG------RENFAQMQSFIRKCTLRFDVNPD---VTQVGLVV 573 Query: 226 FSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + ++++ F L+ + + R + G ++ L + +++ +Q Sbjct: 574 YGSRVQTAFGLDTHPTRAAVLRAMSQAPYLGGVGSAGTALLHIEDKVMTVQ------RGA 627 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 K +V +T G + + + + G V + + + R + Sbjct: 628 RPGVPKAVVMLTGG-----SGAEDAAVPAQKLRGNGISVLVMSVGAVLREAVRRLAGPRD 682 Query: 344 SFYLVENPHSM 354 S V + Sbjct: 683 SLIHVAAYTDL 693 >gi|85374478|ref|YP_458540.1| hypothetical protein ELI_08255 [Erythrobacter litoralis HTCC2594] gi|84787561|gb|ABC63743.1| hypothetical protein ELI_08255 [Erythrobacter litoralis HTCC2594] Length = 626 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 76/212 (35%), Gaps = 24/212 (11%) Query: 19 MTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLK 78 M I A +P + V+G ++ S ++ K+ L D + + A ++ N Sbjct: 2 MAIGAASIIP-LVGVVGGGVDASRMYLAKSRLQQACDAATLAARKELAGSSISNGTIPA- 59 Query: 79 GGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP 138 +I + N ++ +F + + V +T +D + Sbjct: 60 --NIQDKADNFFDTNFPSGMYGTTNVGYTLSAGTATQMDGAATASVPTTLMKV------- 110 Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA 198 N I + + S ++ +D+++VLD+S SM S + +I Sbjct: 111 ----------FNVPQIDIAVNCSAEL---DLPNIDVVLVLDMSGSMNSNGTTGSKRITAL 157 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 ++ + + V V+ G+V ++ + Sbjct: 158 KNAVFSFYDVVMAAKPAGTRVRIGIVPYNGAV 189 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 24/161 (14%), Positives = 46/161 (28%), Gaps = 37/161 (22%) Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ--GMRQHCNTEDANYKKIIVFMTDGENLST 302 +I LS G T G+ +A I + ++FMTDGE ++ Sbjct: 466 SRINALSPKG-GTMHDIGMIWAGRLISPDGIFAADNASAPNGDPISRHVIFMTDGEMGAS 524 Query: 303 KEDQQSL--------------------------------YYCNEAKKRGAIVYAIGIRVI 330 + + C + + +++I + Sbjct: 525 PSNTTAYGNYDMDGRMAGFAASGSWTENQLAAIHNLRLEAICKAIRNKNVTIWSIAFGLP 584 Query: 331 RSHEFLRACASP-NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 S + + CA+ + N + F I I R+ Sbjct: 585 HS-AYTQGCATGTSRALTAANSSELDSRFRDIAGSIAELRL 624 >gi|42741661|ref|NP_766428.2| von Willebrand factor A domain-containing protein 2 precursor [Mus musculus] gi|81893914|sp|Q70UZ7|VWA2_MOUSE RecName: Full=von Willebrand factor A domain-containing protein 2; AltName: Full=A domain-containing protein similar to matrilin and collagen; Short=AMACO; Flags: Precursor gi|27657433|emb|CAD60277.1| AMACO [Mus musculus] gi|74225702|dbj|BAE21683.1| unnamed protein product [Mus musculus] Length = 791 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 36/188 (19%), Positives = 72/188 (38%), Gaps = 27/188 (14%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 A +D++ +LD S S+ + + + A + + + P V+ G + F + Sbjct: 48 AAVDILFLLDGSHSIGK------GSFERSKRFAIAACDALDISP---GRVRVGALQFGST 98 Query: 230 IEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 F L+ + ++ IK + G T + LK R + + Sbjct: 99 PHLEFPLDSFSTRQEVKESIKGIVFKGGRTETGLALKRL---------SRGFPGGRNGSV 149 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SF 345 +I++ +TDG++ + ++RG +V+A+G+R R E L + P Sbjct: 150 PQILIIVTDGKSQGP-----VALPAKQLRERGIVVFAVGVRFPRWDELLTLASEPKDRHV 204 Query: 346 YLVENPHS 353 L E Sbjct: 205 LLAEQVED 212 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 61/191 (31%), Gaps = 22/191 (11%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ +LD S S+ I + PD V Q GLV Sbjct: 523 GCQAQSLDLVFLLDASASVG------RENFAQMQSFIRKCTLRFDVNPD---VTQVGLVV 573 Query: 226 FSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + ++++ F L+ + + R + G ++ L + +++ +Q Sbjct: 574 YGSRVQTAFGLDTHPTRAAVLRAMSQAPYLGGVGSAGTALLHIEDKVMTVQ------RGA 627 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 K +V +T G + + + + G V + + + R + Sbjct: 628 RPGVPKAVVMLTGG-----SGAEDAAVPAQKLRGNGISVLVMSVGAVLREAVRRLAGPRD 682 Query: 344 SFYLVENPHSM 354 S V + Sbjct: 683 SLIHVAAYTDL 693 >gi|291087243|ref|ZP_06571866.1| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] gi|291076088|gb|EFE13452.1| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] Length = 2012 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 49/241 (20%), Positives = 85/241 (35%), Gaps = 23/241 (9%) Query: 142 CTFIPWYTNSRHIVMPITSSVK--VNSQTDARLDMMIVLDVSRSMESFFDSSITK----- 194 P + + + +T T +D++ V+D S SM+ D + K Sbjct: 1443 KRIEPNDDGTYSLTLDVTGIEGNPATVTTKYPVDLVFVIDKSLSMDYDIDGNEIKWWDDE 1502 Query: 195 IDMAIKSINAMLEEVKLIPDVNNV-VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF 253 + +N LEE+ +Q FS +L+W Q+ + L Sbjct: 1503 TESRKDIVNDALEEIIPDLCSQQYDIQIAGYQFSGS--STRVLDWSREE-QQVLSGLKIA 1559 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 S+++ P A DM + +N KK ++FMTDGE + S Sbjct: 1560 RTSSSTEP--SQALADALDMLKTGSPAHRNQSNVKKYLIFMTDGEPTEP--EDWSYNAVR 1615 Query: 314 EAKKRGAIVYAIGIRVIRSHEFL----RACASPNSF----YLVENPHSMYDAFSHIGKDI 365 GA +Y IG+ S + S + + + + DAF+ I +I Sbjct: 1616 NHAVPGASIYTIGVSSDASTNLMEGIRSTALSNGMYAPATFKGTSAQLIRDAFTQIKDEI 1675 Query: 366 V 366 + Sbjct: 1676 I 1676 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 46/256 (17%), Positives = 91/256 (35%), Gaps = 25/256 (9%) Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPI---TSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 P + + + + SV + +M V+D S SM+ F Sbjct: 1013 PATNHPEYRKYIKDNGNGTYTLTLDVKSDVGSVTTGQKDPTPTAVMFVIDKSGSMDQSFG 1072 Query: 190 SSIT--KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQR 245 S + + ++ ++ ++ D + +Q G FS+ E + W + + Sbjct: 1073 SGNSDARREVVNSALELFFNQLS---DGDYNIQFGGYKFSDSGERVNFNDQGWETEYWET 1129 Query: 246 KI-KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN--LST 302 LS +++ T G Y + ++ + N K+ ++F+TDGE S Sbjct: 1130 DTSNALSHLKLTSRETDGSTYPSQTLRSAISALENVELGE-NGKRYLIFLTDGEPGQNSY 1188 Query: 303 KEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACASPNSF--------YLVENP 351 ++ C A K G YAI + SH F+ + S + ++ + Sbjct: 1189 SFSEKEAENCYSAIKNLDSGTTFYAIQVANSDSHGFMESMVSNANSVDGVTAQKFVGNSA 1248 Query: 352 HSMYDAFSHIGKDIVT 367 + AFS + +I Sbjct: 1249 DELNAAFSQMAAEISG 1264 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 30/174 (17%), Positives = 63/174 (36%), Gaps = 43/174 (24%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFF--DSSITKIDMAIKSIN---------------- 203 + + LD+++V+D+S SM+ DSS +++ + ++ Sbjct: 584 LGITGGDQQGLDIVLVIDLSNSMDDGISEDSSDSRLKVLKDTLGYYRTSYGSHGRPSTEG 643 Query: 204 --AMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--------GVSHLQRKIKYLSKF 253 ++E+ + + +VT+S + L W G +++ I L Sbjct: 644 KEGFIDELFEQSPNS---RFSIVTYST--DASTELGWTEYGRNGSGQQTIKKAIGDLQAN 698 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 G TN GL A + + N +++F++DG+ D Sbjct: 699 G-GTNYEAGLYQAVEVL---KERGNSSNIP------VVIFLSDGKPTYYYSDVD 742 >gi|224052500|ref|XP_002194907.1| PREDICTED: similar to anthrax toxin receptor 1 [Taeniopygia guttata] Length = 537 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 40/206 (19%), Positives = 77/206 (37%), Gaps = 25/206 (12%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+ VLD S S+++ + T+I ++S+ ++ +++ + FS+ Sbjct: 23 HGAFDLYFVLDKSGSVKNHW----TEIYSFVESLAEKF--------ISPMLRMSFIVFSS 70 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L ++R + L + G T G K A QI+ H Sbjct: 71 RGTTIMKLTENREAIRRGLDTLKEELPGGDTFMHEGFKRANEQIY-------HETYGGVR 123 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 +I+ +TDGE + + N A+ GAIVY +G++ + S + + Sbjct: 124 TASVIIALTDGELQDAQFYY-AEQEANRARSFGAIVYCVGVKDFNETQLSTIADSIDHVF 182 Query: 347 LVENPHSMYDAFSHIGKDIVTKRIWY 372 V+ + A I+ K Sbjct: 183 PVKGG---FYALRGTIDSILKKSCIE 205 >gi|240172225|ref|ZP_04750884.1| hypothetical protein MkanA1_23119 [Mycobacterium kansasii ATCC 12478] Length = 335 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 68/211 (32%), Gaps = 30/211 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+V+DVS+SM + D ++ A ++ +E+ + GL+ ++ Sbjct: 99 VMLVIDVSQSMRA-TDVEPNRMVAAQEAAKQFADELTPGIN------LGLIAYAGTATVL 151 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + L +F T + + A I + + D IV Sbjct: 152 VSPTTNRDATKNALDKL-QFADRTATGEAIFTALQAIATVGAV---IGGGDTPPPARIVL 207 Query: 294 MTDGE---NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------------HEFL 336 +DG+ + + + AK +G + I E L Sbjct: 208 FSDGKETMPTNPDNPKGAFTAARTAKDQGVPISTISFGTPYGFVEINGQRQPVPVDDETL 267 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 + A S + Y + ++ + + I Sbjct: 268 KKVAQLSGGNAYNAATLAELKSVYASLQQQI 298 >gi|156402479|ref|XP_001639618.1| predicted protein [Nematostella vectensis] gi|156226747|gb|EDO47555.1| predicted protein [Nematostella vectensis] Length = 412 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 50/334 (14%), Positives = 102/334 (30%), Gaps = 39/334 (11%) Query: 45 FMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFV 104 K L+S RS V +M +G ++ + + + Sbjct: 95 LAKDDLYSAPVRSGVPRILIVMTDGISSDDVAGPAKALRDMGVEIFALGIGKNYDQGQLD 154 Query: 105 NDIDDIVRSTSLD---------IVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIV 155 + D + I + I +++ Sbjct: 155 SMGSDPKPDHVVTADFDKLDPVIQTIKDKACNGIDKEQTESHSASPVD--DGALKTKNQN 212 Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 +SSV V A++D+ +LD S S+E + + + IN ++ + D Sbjct: 213 RFRSSSVTVC---KAKVDVGFLLDGSGSVEFYAKGNFQRC---KNFINKFVKSFMVSKDD 266 Query: 216 NNVVQSGLVTFSNK--IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 ++ GLV FS+ +E F + + + + + G+ T + GL A + ++ Sbjct: 267 SHF---GLVLFSSDSNVEFKFDDHYDAASITAAVNATNYPGMGTYAGKGLTLAKDDLYS- 322 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + +I++ MTDG + G ++A+GI Sbjct: 323 -------APVRSGVPRILIVMTDG-----ISSDDVAGPAKALRDMGVEIFALGIGKNYDQ 370 Query: 334 EFLRACAS---PNSFYLVENPHSMYDAFSHIGKD 364 L + S P+ + + I Sbjct: 371 GQLDSMGSDPKPDHVVTA-DFDKLDPVIQTIKDK 403 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 35/201 (17%), Positives = 71/201 (35%), Gaps = 25/201 (12%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 A++D+ +LD S S+E + + + IN ++ + D ++ GLV FS+ Sbjct: 3 KAKVDVGFLLDGSGSVEFYAKGNFQRC---KNFINKFVKSFMVSKDDSHF---GLVLFSS 56 Query: 229 K--IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 +E F + + + + G+ T + GL A + ++ + Sbjct: 57 DSNVEFKFDDHYDAASITAAVNATKYPGMGTYAGKGLTLAKDDLYS--------APVRSG 108 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---PN 343 +I++ MTDG + G ++A+GI L + S P+ Sbjct: 109 VPRILIVMTDG-----ISSDDVAGPAKALRDMGVEIFALGIGKNYDQGQLDSMGSDPKPD 163 Query: 344 SFYLVENPHSMYDAFSHIGKD 364 + + I Sbjct: 164 HVVTA-DFDKLDPVIQTIKDK 183 >gi|327280282|ref|XP_003224881.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-1(XIV) chain-like [Anolis carolinensis] Length = 1885 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 41/210 (19%), Positives = 80/210 (38%), Gaps = 27/210 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 A+ D++ ++D S S+ D + KI I + + + + I + Q + Sbjct: 1031 EVCKAAKADLVFLVDGSWSIG---DDNFNKI---IGFLYSTVGALDKIGP--DGTQVAIA 1082 Query: 225 TFSNKIEEFFLL-EWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+ F L + L I+ ++ G +T + +K+ +F + + Sbjct: 1083 QFSDDPRTEFKLNAYRTKETLLEAIQQIAYKGGNTKTGKAIKHVQQVLFTID------SG 1136 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 K++V +TDG E + G ++AIG+ E L + Sbjct: 1137 TRKGIPKVLVVITDGR-----SQDDVNKVSREMQLDGFSIFAIGVADADYSELLNIGSKP 1191 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 S + V++ +DAF I ++VT Sbjct: 1192 SERHVFFVDD----FDAFKKIEDELVTFVC 1217 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 73/201 (36%), Gaps = 26/201 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV-NNVVQSGLVTFS-NKI 230 D++I++D S S+ F + + LE + +V + + GL +S + Sbjct: 159 DIVILVDGSWSIGRFN----------FRLVRLFLENLVAAFNVGSEKTRIGLAQYSGDPR 208 Query: 231 EEFFLLEWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 E+ L + + ++ L G +T + L + F + K Sbjct: 209 IEWHLNTYSTKDAVLDAVRNLPYKGGNTLTGLALTFILENNFKSE------AGARPGVPK 262 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--FYL 347 I + +TDG K + K G ++AIG++ E + P++ Y Sbjct: 263 IGILITDG-----KSQDDVIPPAKNLKDAGIELFAIGVKNADETELKEIASEPDNTHVYN 317 Query: 348 VENPHSMYDAFSHIGKDIVTK 368 V + M + K + ++ Sbjct: 318 VADFSFMNSIVEGLTKTVCSR 338 >gi|326921803|ref|XP_003207144.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Meleagris gallopavo] Length = 1054 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 75/207 (36%), Gaps = 31/207 (14%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 L+++ V+D S S+ + +K++ + + + + G++ FS+K Sbjct: 686 TPLELIFVIDSSESVG---PDNFISTKTFMKTV------IDEVLANHAKTRIGVINFSHK 736 Query: 230 IEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 +E LE + L+ + + G T + +K A N Sbjct: 737 VELVSSLEKYTTKESLKSAVDKMLYLGEGTYTASAIKKAINLFQAA----------RPAV 786 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV---IRSHEFLRAC---AS 341 +K+ V +TDG+ +++ EA ++ IGI FL+ A+ Sbjct: 787 RKVAVVVTDGQ-ADARDEVHLDMVVREAHAANIEIFVIGIVQETDPHYDNFLKEMHLIAT 845 Query: 342 P---NSFYLVENPHSMYDAFSHIGKDI 365 FY +E+ ++ + I Sbjct: 846 DPDEEHFYRIEDFKTLSALTDKLITKI 872 Score = 63.6 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 36/180 (20%), Positives = 69/180 (38%), Gaps = 17/180 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ ++D S S ++ + S+ + ++K + V+ + FS+ + Sbjct: 9 IDIVFIVDSSESAKNQLFGLQ---KNFVLSLTDNIFQMKPVKSQKYDVKLAGMQFSSTVS 65 Query: 232 EFFL-LEW-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + W V + + KI+ L G T S + A Q+F +G K Sbjct: 66 VDHPFIAWKNVQNFKEKIRALVYIGQGTYSYYAISNA-TQLFKTEGREGSI--------K 116 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLRACASPNSFYLV 348 + MTDG + + + A+ G + IG+ E LR + +SF V Sbjct: 117 VAFLMTDGVDHPNSPSVEGIATA--ARSLGIHFFTIGLSKKNVKEEKLRLISGDSSFKHV 174 >gi|323495646|ref|ZP_08100717.1| membrane associated secretion system protein [Vibrio sinaloensis DSM 21326] gi|323319281|gb|EGA72221.1| membrane associated secretion system protein [Vibrio sinaloensis DSM 21326] Length = 419 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 53/422 (12%), Positives = 127/422 (30%), Gaps = 72/422 (17%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 G IL A+ +P +F + + + + K + + +++ A N+ Sbjct: 1 MKALRKQSGHAAILFAMIIPGLFGLFTLASDGARAIQTKARIEDASEIAVLAIAAH--ND 58 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRD----NGFVNDIDDIVRSTSLDIVVVPQN 124 N N++ G + +I + ++ +++ + D I + P+ Sbjct: 59 DNKNSQGSGSGSAVNRKIATDYLEAYLHDVDSVNNLKIHKYNCDQIPECVAGLARGEPRF 118 Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 Y + A SR+ + IP + + + +D++ V D S SM Sbjct: 119 FQYEVEATSRHVSWFPGDSSIPGFGKT----FDAKGAATARKYQSEAVDILFVADYSGSM 174 Query: 185 --ESFFDSSITKIDMAIKSINAMLEEVKLI--PDVNNVVQSGLVTFSNKIEEFFLLEWGV 240 S+ ID I + +E++ + + G+ F+ + Sbjct: 175 AGGWNGGSNRKYID-LRNIIKVVTDELQKFNDLNNTDNNTVGMTGFNYYTKTKPTNRSNS 233 Query: 241 SHLQRKIKYLSKFGVSTNS------------------------------TPGLKYAYNQI 270 + + + + T + N Sbjct: 234 CFMTQLVYNNNYNINYTKTVNNIFNEKNNKYCVSHSDSSRFRDIDLTDNYSSFNTTVNGF 293 Query: 271 FDMQGM--------RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-----CNEAKK 317 + G + N +++++ ++DG++ T + C + K+ Sbjct: 294 YPNHGTASYQGIMRGAQMLKKGTNPRRLLIVLSDGDDSGTSQKNIHKQLVNAGMCTKIKQ 353 Query: 318 R------------GAIVYAIGIRVI-RSHEFLRACASPNSFYLVENPHS-MYDAFSHIGK 363 A + +G ++ LR CA + + +N + I + Sbjct: 354 ELSTGISSSGQSIKARLAVVGFDYNVNNNTALRDCAGAENVFKAQNTDDILNKILELITE 413 Query: 364 DI 365 +I Sbjct: 414 EI 415 >gi|187932172|ref|YP_001892157.1| protein of unknown function containing a von Willebrand factor type A (vWA) domain [Francisella tularensis subsp. mediasiatica FSC147] gi|187713081|gb|ACD31378.1| protein of unknown function containing a von Willebrand factor type A (vWA) domain [Francisella tularensis subsp. mediasiatica FSC147] Length = 333 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 32/218 (14%), Positives = 74/218 (33%), Gaps = 47/218 (21%) Query: 173 DMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+++ +D+S SM + ++ D+ ++ N ++ K + GL+ F Sbjct: 93 DLIMAIDLSGSMAIQDMKKANGQMESRFDLVMRVANQFIDTRKGD-------RVGLILFG 145 Query: 228 NKIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L + ++ +++ + ++ G T + A ++ G Sbjct: 146 TRAYLQTPLTFDIATVKKMLDDASIALPGPQTAIGDAIGLAVKKLKKYPGDS-------- 197 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-------SHEFLRA 338 K ++ +TDGEN Q L AK+ +Y IG+ + + Sbjct: 198 ---KALILLTDGENN--SGTLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNT 252 Query: 339 CA-------------SPNSFYLVENPHSMYDAFSHIGK 363 + ++ +N + + I K Sbjct: 253 SEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDK 290 >gi|281347736|gb|EFB23320.1| hypothetical protein PANDA_001404 [Ailuropoda melanoleuca] Length = 708 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 44/190 (23%), Positives = 76/190 (40%), Gaps = 25/190 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 A +D++ ++D S S+ + + + + + + P V+ G + FS+ Sbjct: 6 AAVDVLFLIDGSHSVGK------GSFERSKHFAIMVCDALDINP---ERVRVGALQFSSA 56 Query: 230 IEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 F L+ S ++ KIK + G T + LKY R +A+ Sbjct: 57 PRLEFPLDSFSSQQEVKAKIKRMVFKGGRTETGLALKYLLR--------RGFPGGRNASV 108 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +I+V +TDG + E + K+RG V+A+G+R R E L AS Sbjct: 109 PQILVVITDGRSQGPVE-----LPAKQLKERGVTVFAVGVRFPRWEE-LHTLASEPREQH 162 Query: 348 VENPHSMYDA 357 V + DA Sbjct: 163 VLMAEQVDDA 172 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 28/135 (20%), Positives = 55/135 (40%), Gaps = 19/135 (14%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-NVVQSGLV 224 LD++ +LD S S+ S + + + + L DVN +V Q GLV Sbjct: 482 GCQAQSLDLLFMLDASASVGSENFAQMQS----------FVRSLTLQFDVNPDVTQVGLV 531 Query: 225 TFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + ++++ F L+ G + + R + G ++ L + Y+++ +Q Sbjct: 532 VYGSRVQTAFGLDTHLGRAAVLRAMSQAPYLGGVGSAGTALLHVYDKVMTVQ------RG 585 Query: 283 EDANYKKIIVFMTDG 297 K +V +T G Sbjct: 586 ARPGVPKAVVLLTGG 600 >gi|86143679|ref|ZP_01062055.1| batA protein [Leeuwenhoekiella blandensis MED217] gi|85829722|gb|EAQ48184.1| batA protein [Leeuwenhoekiella blandensis MED217] Length = 334 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 38/238 (15%), Positives = 81/238 (34%), Gaps = 49/238 (20%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P V + T +D++I +DVS SM + D +++ + + + + Sbjct: 74 LARPRNVDVSTRTNTTRGIDIVIAIDVSASMLA-RDLKPNRLEALKEVASQFIADRPSD- 131 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQI 270 + GLV ++ + + S + + + T GL + N++ Sbjct: 132 ------RIGLVEYAGESYTRTPITSDKSIVLSSLNDIQYNSIIEGGTAIGMGLATSVNRL 185 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-- 328 D + K+I+ MTDG N + + + A++ G VY IG+ Sbjct: 186 KDSRAKS-----------KVIILMTDGVNNAGFIEPSTAS--ELAQEFGIKVYTIGLGTN 232 Query: 329 ---------VIRS------------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + ++ + S+ + ++ I K Sbjct: 233 GTALSPVALRPDGSFQYGSIPVEIDEALLQEIADKTGGLYFRATDNESLEEIYAEINK 290 >gi|41407305|ref|NP_960141.1| hypothetical protein MAP1207 [Mycobacterium avium subsp. paratuberculosis K-10] gi|118463234|ref|YP_882479.1| hypothetical protein MAV_3297 [Mycobacterium avium 104] gi|81414471|sp|Q740Y5|Y1207_MYCPA RecName: Full=UPF0353 protein MAP_1207 gi|41395657|gb|AAS03524.1| hypothetical protein MAP_1207 [Mycobacterium avium subsp. paratuberculosis K-10] gi|118164521|gb|ABK65418.1| protein Nfa34780 [Mycobacterium avium 104] Length = 335 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 33/211 (15%), Positives = 71/211 (33%), Gaps = 30/211 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+V+DVS+SM + D + ++ A ++ +E+ + GL+ ++ Sbjct: 99 VMLVIDVSQSMRA-TDVAPNRMAAAQEAAKQFADELTPGIN------LGLIAYAGTATVL 151 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + L +F T + G+ A I + + D IV Sbjct: 152 VSPTTNREATKNALDKL-QFADRTATGEGIFTALQAIATVGAV---IGGGDKPPPARIVL 207 Query: 294 MTDGE---NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------------HEFL 336 +DG+ + + + AK +G + I E L Sbjct: 208 FSDGKETMPTNPDNPKGAFTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETL 267 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 + A S + Y + + ++ + + I Sbjct: 268 KKVAQLSGGNAYNAASLQELKSVYATLQQQI 298 >gi|148699893|gb|EDL31840.1| procollagen, type VI, alpha 1, isoform CRA_b [Mus musculus] Length = 643 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 76/209 (36%), Gaps = 19/209 (9%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP---DVNNVVQSGLV 224 D +D+ VLD S S+ +D ++ ++ D N V +G + Sbjct: 63 QDCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGAL 122 Query: 225 TFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +S+++E L G L+ + + FG T + +K ++ H Sbjct: 123 HYSDEVEIIRGLTRMPSGRDELKASVDAVKYFGKGTYTDCAIKKGLEELLIGG---SHLK 179 Query: 282 TEDANYKKIIVFMTDGEN-LSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRAC 339 K ++ +TDG KE L NEAK G V+++ I L Sbjct: 180 EN-----KYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGIKVFSVAITPDHLEPRLSII 234 Query: 340 ASPNSF---YLVENPHSMYDAFSHIGKDI 365 A+ +++ + + DA I + I Sbjct: 235 ATDHTYRRNFTAADWGHSRDAEEVISQTI 263 >gi|26343093|dbj|BAC35203.1| unnamed protein product [Mus musculus] Length = 266 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 76/209 (36%), Gaps = 19/209 (9%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP---DVNNVVQSGLV 224 D +D+ VLD S S+ +D ++ ++ D N V +G + Sbjct: 31 QDCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGAL 90 Query: 225 TFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +S+++E L G L+ + + FG T + +K ++ H Sbjct: 91 HYSDEVEIIRGLTRMPSGRDELKASVDAVKYFGKGTYTDCAIKKGLEELLIGG---SHLK 147 Query: 282 TEDANYKKIIVFMTDGEN-LSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRAC 339 K ++ +TDG KE L NEAK G V+++ I L Sbjct: 148 EN-----KYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGIKVFSVAITPDHLEPRLSII 202 Query: 340 ASPNSF---YLVENPHSMYDAFSHIGKDI 365 A+ +++ + + DA I + I Sbjct: 203 ATDHTYRRNFTAADWGHSRDAEEVISQTI 231 >gi|86361153|ref|YP_473040.1| hypothetical protein RHE_PF00423 [Rhizobium etli CFN 42] gi|86285255|gb|ABC94313.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 545 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 43/303 (14%), Positives = 114/303 (37%), Gaps = 16/303 (5%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R+ + G + I+ A+ L + + +G + + ++ + S +D +L+ A QI N Sbjct: 107 LRSLERDRGGNVGIVVALSLVPMLVAVGASFDYIRSYNVRQRMQSDLDAALIAAVKQINN 166 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNM-SFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 + + K+ ++ N++ + + ++ + V +T + I E Sbjct: 167 TEDTDALKEKVSDWFHAQVDNSYTLGDIDIDTVNHNITATANGTVPTTFMKI---ANIET 223 Query: 127 YSISAISRYKIP-LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMI------VLD 179 +S S K P + S +++ T+S + + + Sbjct: 224 VPVSVASAVKGPATSYLNVYVVIDTSPSMLLAATTSGQSTMYSGIGCQFACHTGDAHTVG 283 Query: 180 VSRSMESFFDSSITKI----DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 ++ ++ S+ KI D+A ++ +L+ + + + ++ GL + + + E Sbjct: 284 KTKYANNYAYSTAKKIKLRADVAGDAVREVLDMIDESDENHERIKVGLYSLGDTLSEVLA 343 Query: 236 LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA-NYKKIIVFM 294 + ++ S S S + + Q + + + + K+++ + Sbjct: 344 PTLSTDTARTRLADASYGLTSATSKAATYFDVSLATLKQKVGAGGDGTSSGSPLKLVLLL 403 Query: 295 TDG 297 TDG Sbjct: 404 TDG 406 >gi|6753484|ref|NP_034063.1| collagen alpha-1(VI) chain precursor [Mus musculus] gi|543913|sp|Q04857|CO6A1_MOUSE RecName: Full=Collagen alpha-1(VI) chain; Flags: Precursor gi|50479|emb|CAA47032.1| collagen alpha1 type VI-precursor [Mus musculus] gi|148699892|gb|EDL31839.1| procollagen, type VI, alpha 1, isoform CRA_a [Mus musculus] gi|162318378|gb|AAI56501.1| Collagen, type VI, alpha 1 [synthetic construct] gi|225000678|gb|AAI72708.1| Collagen, type VI, alpha 1 [synthetic construct] Length = 1025 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 76/209 (36%), Gaps = 19/209 (9%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP---DVNNVVQSGLV 224 D +D+ VLD S S+ +D ++ ++ D N V +G + Sbjct: 31 QDCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGAL 90 Query: 225 TFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +S+++E L G L+ + + FG T + +K ++ H Sbjct: 91 HYSDEVEIIRGLTRMPSGRDELKASVDAVKYFGKGTYTDCAIKKGLEELLIGG---SHLK 147 Query: 282 TEDANYKKIIVFMTDGEN-LSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRAC 339 K ++ +TDG KE L NEAK G V+++ I L Sbjct: 148 EN-----KYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGIKVFSVAITPDHLEPRLSII 202 Query: 340 ASPNSF---YLVENPHSMYDAFSHIGKDI 365 A+ +++ + + DA I + I Sbjct: 203 ATDHTYRRNFTAADWGHSRDAEEVISQTI 231 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 59/170 (34%), Gaps = 29/170 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVVQSGL 223 T +D++ VLD S S+ + ++A I +++ + V +G+ Sbjct: 608 CTCGPIDILFVLDSSESIG------LQNFEIAKDFIIKVIDRLSKDELVKFEPGQSHAGV 661 Query: 224 VTFSNKIEEF-----FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V +S+ + + +K L T + L+Y +++ Sbjct: 662 VQYSHNQMQEHVDMRSPNVRNAQDFKEAVKKLQWMAGGTFTGEALQYTRDRLL------- 714 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +I + +TDG + + ++ C V ++GI+ Sbjct: 715 ----PPTQNNRIALVITDGRSDTQRDTTPLSVLCG----ADIQVVSVGIK 756 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 23/165 (13%), Positives = 54/165 (32%), Gaps = 19/165 (11%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + D+ I+LD S S+ S + + K + + V+ +V +S + Sbjct: 823 SPADITILLDSSASVGSH---NFETTKVFAKRLAERFLSAGRADPSQD-VRVAVVQYSGQ 878 Query: 230 IEEFFLLEW-----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ + L + + +T+ L Y R + Sbjct: 879 GQQQPGRAALQFLQNYTVLASSVDSMDFINDATDVNDALSYV---------TRFYREASS 929 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 KK ++ +DG + + EA++ G ++ + + Sbjct: 930 GATKKRVLLFSDGNSQGATAEAIEKAV-QEAQRAGIEIFVVVVGP 973 >gi|148975506|ref|ZP_01812377.1| hypothetical protein VSWAT3_03061 [Vibrionales bacterium SWAT-3] gi|145964934|gb|EDK30185.1| hypothetical protein VSWAT3_03061 [Vibrionales bacterium SWAT-3] Length = 357 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 38/218 (17%), Positives = 79/218 (36%), Gaps = 26/218 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSM--ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + + D+M+V+D+S SM + F KI + + + K + Sbjct: 92 EPQVREQLGRDVMVVVDLSGSMAEQDFTSKQGKKISRLQATKEVLADFAKTRKGD----R 147 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQI-FDMQ 274 GL+ F + F+ + ++ L++ G ST+ + A + Sbjct: 148 LGLILFG---DAAFVQTPFTADQDVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQSGK 204 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-- 332 M D +K+++ +TDG T + + AK +G ++ I + ++ Sbjct: 205 QMSAEQAQNDIEREKVVIVLTDG--NDTGSFVEPIDAAKVAKAKGVRIHVIAMGDPQTVG 262 Query: 333 -----HEFLRACASP--NSFYLVENPHSMYDAFSHIGK 363 E ++ AS + N + A+ IG+ Sbjct: 263 EVALDMETIKRIASESGGEAFEALNRDELSTAYEQIGQ 300 >gi|37680183|ref|NP_934792.1| hypothetical protein VV1999 [Vibrio vulnificus YJ016] gi|37198930|dbj|BAC94763.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 481 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 30/154 (19%), Positives = 60/154 (38%), Gaps = 12/154 (7%) Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA-------YNQIFDMQGMRQHCNT 282 + H + ++ L G++TN+ G+ +A + I+D Sbjct: 329 VNPIVPFITERRHFESTVQRLVP-GMNTNNAEGMVWAMRLLSPYWQGIWDKTRPELPRRY 387 Query: 283 EDANYKKIIVFMTDGENLSTKE--DQQSLYYCNEAKK--RGAIVYAIGIRVIRSHEFLRA 338 D K +V +DG +L D++ C + K+ RG V + S +++ Sbjct: 388 SDETSNKYLVMFSDGNHLIDPAFRDKKMKLICTQLKQPGRGVKVMTVNFGGAASERLMQS 447 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 CAS +Y V + S+ F I + +++ + Sbjct: 448 CASGPEYYHVASLFSVEKVFEQIAEQVISSSLIE 481 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 75/216 (34%), Gaps = 44/216 (20%) Query: 2 FSLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHA 61 F+ N+R GG+++ + L + + +++V I+ + + + D +L Sbjct: 11 FTKGDNMRK----QTGGISVFMLVLLMSMLVFAAWVMDVMRIYSVHNQMANATDAAL--- 63 Query: 62 ATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVV 121 A+ I++E + +L ++ + + R L + Sbjct: 64 ASAIISEVPESTAVELLHANLTSGAASPYVEEVR--------------------LTHLRD 103 Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 Q E + L F + + + ++S +++ +LDVS Sbjct: 104 EQEESLQVV--------LDFVPNSLNIAAQESVPIRTNAKAGISSNKA---EIVFMLDVS 152 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 SM S ++ +++ A +++ + N Sbjct: 153 NSM------SGEPMNKTKEALLAFADKLYARGNRNQ 182 >gi|194334883|ref|YP_002016743.1| von Willebrand factor type A [Prosthecochloris aestuarii DSM 271] gi|194312701|gb|ACF47096.1| von Willebrand factor type A [Prosthecochloris aestuarii DSM 271] Length = 327 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 36/218 (16%), Positives = 76/218 (34%), Gaps = 35/218 (16%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 S +++ +D+M+ LD+S+SM +++D A +E + Sbjct: 86 SERLSPGEAKGIDIMLALDISKSMLQEDFDGKSRLDAAKTVALQFIE-------NRRRDR 138 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMR 277 GLV F K L L ++ S T + + A N++ Sbjct: 139 IGLVLFKGKSFTQCPLTLDHDVLSMLVRAASVDAVPESGTATGSAILIAVNRL------- 191 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI---RSHE 334 + +++++ +TDGE+ + + D + A G +Y + V + Sbjct: 192 ----RASESPERVLILLTDGEHNAGEVDPVTA--AGIAAGEGVRIYMATVSVPGSRSGED 245 Query: 335 FLRACA---------SPNSFYLVENPHSMYDAFSHIGK 363 L + + + + +S+ FS I + Sbjct: 246 MLASARDLSGEVSRITGGRSFRANDANSLNRTFSEIDQ 283 >gi|326675264|ref|XP_002665076.2| PREDICTED: collagen alpha-1(XXVIII) chain-like [Danio rerio] Length = 1046 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 83/203 (40%), Gaps = 31/203 (15%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 L+++ V+D S S+ ++ +N +++ + P V + G+V +SN Sbjct: 757 PLELVFVIDSSESVGP------ENFEVIKDFVNTLIDRTSVSP---EVTRVGIVLYSNIN 807 Query: 231 EEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 ++ ++ ++ ++ + G T + G++ A N++F + Sbjct: 808 LLVTNIQDRLTRDEVKAAVRRMPYIGEGTYTGSGIRKA-NEMFAFA---------RPGVR 857 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS------HEFLRACASP 342 K+ + +TDG+ ++ + EA ++AIG+ + L++ ASP Sbjct: 858 KVAMVITDGQ-TDHRDTVKLEDAVREAHLDNITMFAIGVVNQSDPIYDDFKQELKSIASP 916 Query: 343 ---NSFYLVENPHSMYDAFSHIG 362 + VE+ ++ AF G Sbjct: 917 PTEEHMFSVEDFRMLHGAFEESG 939 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 40/216 (18%), Positives = 84/216 (38%), Gaps = 22/216 (10%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 +R +++ L++ +LD S S + ++ ++S + L E+ Sbjct: 22 QNRRQSGETPNNLTTKQDNACSLEVAFILDSSESAKGLL---FSRQKEFVRSFSRRLMEM 78 Query: 210 KLIPDVNNVVQSGLVTFSNK--IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 + + + + L+ +S+ I + F + +++ S G T ST + A Sbjct: 79 Q-VSGWHLRTRLALIYYSSSVHINQHFNDWQDLDVFLDQLEDASYIGQGTYSTYAISNA- 136 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG- 326 Q+F + Q ++ + MTDG + D ++ EAK ++AIG Sbjct: 137 TQLFIRETSGQSV--------RVSLLMTDGSDHPRNPDIMTVV--AEAKSHNIKIFAIGL 186 Query: 327 -IRVIRSHEF-LRACASP--NSFYLVENPHSMYDAF 358 +R + S+ LRA AS ++ + + Sbjct: 187 SMRAMDSNSAKLRAVASSPAQQYFHSLTDRGLEERL 222 >gi|313681552|ref|YP_004059290.1| von willebrand factor type a [Sulfuricurvum kujiense DSM 16994] gi|313154412|gb|ADR33090.1| von Willebrand factor type A [Sulfuricurvum kujiense DSM 16994] Length = 311 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 40/216 (18%), Positives = 77/216 (35%), Gaps = 32/216 (14%) Query: 164 VNSQTDARLDMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + + +D+++ LD S SM S + +++ ++ K + + +K I D Sbjct: 82 SDPRNRNGIDIVLSLDGSGSMNASGFSKEEPRLSRFEVVQKIASDFV--MKRIEDN---- 135 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMR 277 G+V F + + + + I YLS G +T G+ + D + Sbjct: 136 -VGVVLFGDFAFIATPVTYEKEIVSEMIGYLSHGMAGQNTAIGEGIAMGVRALRDSKAKS 194 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--HEF 335 K+I+ +TDGE+ S K +Y IGI + Sbjct: 195 -----------KVIILLTDGEHNSGS--ISPKEAVAMVGKEHIRLYTIGIGQKGEFDNAL 241 Query: 336 LRACASPNS--FYLVENPHSMYDAFSHIGKDIVTKR 369 L+ A F+ N + + I D++ + Sbjct: 242 LKQLAHDGHGKFFAAANEKELQSVYDEI--DMLERS 275 >gi|304347707|gb|ADM25314.1| MIC2-like protein 1 [Neospora caninum] gi|325118031|emb|CBZ53582.1| hypothetical protein NCLIV_033690 [Neospora caninum Liverpool] Length = 756 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 42/207 (20%), Positives = 84/207 (40%), Gaps = 27/207 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++++D S S+ S + K +L+ + L+P + V +VTF++ + Sbjct: 67 MDLLLLVDDSASIGSRNFEQVRKF---------VLDFIDLVPISSEEVHLSVVTFADSPQ 117 Query: 232 EFF----LLEWGVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + F + KYL + G ST + GL A + + AN Sbjct: 118 DVFTFKQPQATNKQLAKEAFKYLRYRRGGSTATDKGLIRARRYLT------RPVYGTRAN 171 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA----SP 342 K++V MTDGE+ ++ ++A+ G V+ +G+ + E C P Sbjct: 172 VPKVLVLMTDGESDR---HYDTIQAADQARAEGISVFVVGVGMANPVECRGVCGCGRYGP 228 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVTKR 369 +++ N + + I ++ K Sbjct: 229 CPQFIMSNWNELVQTVDSIMGEVCKKL 255 >gi|163786711|ref|ZP_02181159.1| aerotolerance-related membrane protein [Flavobacteriales bacterium ALC-1] gi|159878571|gb|EDP72627.1| aerotolerance-related membrane protein [Flavobacteriales bacterium ALC-1] Length = 335 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 45/238 (18%), Positives = 82/238 (34%), Gaps = 49/238 (20%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 +V P T V ++T +D+++ +DVS SM + D +++ K +E Sbjct: 75 LVRPRTVDVSTKTKTTRGIDIVMSIDVSASMLAK-DLLPNRLEALKKVAADFIE------ 127 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHL---QRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + GLV ++ + + S + R IKY + T GL + N++ Sbjct: 128 -GRPNDRIGLVEYAGEAYTKTPITSDKSIVLRSMRDIKYNTIIEGGTAIGMGLATSVNRL 186 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-- 328 D + K+I+ +TDG N D + A + G VY IG+ Sbjct: 187 KDSRAKS-----------KVIILLTDGVNNGGFIDPKIAS--ELAVEYGIKVYTIGLGTN 233 Query: 329 ---------VIRS------------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 + L+ A + ++ N + + I K Sbjct: 234 GTALSPVRINPNGSFQYGRQKVEIDEDLLKEIADVTGGKYFRATNNKKLAQIYDEINK 291 >gi|47219917|emb|CAF97187.1| unnamed protein product [Tetraodon nigroviridis] Length = 1251 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 45/267 (16%), Positives = 93/267 (34%), Gaps = 25/267 (9%) Query: 65 IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQN 124 ++ K G C + N + EL N V ++ ++ + ++ +V Sbjct: 36 LIGSPLLGQPAKRTGDVYKCPVDKEHNTCVKLELPKNTTVPNLHEVKENMAMGTTLVTNP 95 Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 G ++ +Y +I + + +S+ + LD++I+LD S S+ Sbjct: 96 SGGFLACGPQYGYMCGQQQYISGVCANVSSSFQVLNSIASVQECAKELDIVILLDGSNSI 155 Query: 185 ESFFDSSITKIDMAIKSINAMLEE-VKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVS 241 + SI LE +K I + Q G+V++ + L + Sbjct: 156 YPWP------------SITDFLERFIKTIDIGPKLSQVGIVSYGETVTHNVNLSQFDNTA 203 Query: 242 HLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 L ++ L + G T + G+ KK++V +TDGE+ Sbjct: 204 ALMDFVRQLPQQTGFKTMTFMGINT------TRLEAFLPERGARQGVKKVMVIVTDGESH 257 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGI 327 + + C+ + G + I + Sbjct: 258 DFHALDEVIAACD---RDGIERFGIAV 281 >gi|297463635|ref|XP_002702824.1| PREDICTED: collagen, type VII, alpha 1 [Bos taurus] Length = 2933 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 74/203 (36%), Gaps = 28/203 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ + + ++ V+ V +S+ Sbjct: 38 DIVFLLDGSSSIG------RSNFREVRGFLEGLVLPFSGAAGAQG-VRFAAVQYSDDPRT 90 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L+ + R I+ LS G +T + + + +++F Q R K+ Sbjct: 91 EFDLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADRVFLPQLARPG-------VPKV 143 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLV 348 + +TDG K K +G ++A+GI+ E R + + + F+ V Sbjct: 144 CILITDG-----KSQDMVDTAAQRLKGQGVKLFAVGIKNADPEELKRIASQPTSDFFFFV 198 Query: 349 ENPHSMYDAFSHIGKDIVTKRIW 371 + + +V++R+ Sbjct: 199 NDFSILRTLL-----PLVSRRVC 216 >gi|297488708|ref|XP_002697097.1| PREDICTED: collagen, type VII, alpha 1 [Bos taurus] gi|296474920|gb|DAA17035.1| collagen, type VII, alpha 1 [Bos taurus] Length = 2932 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 74/203 (36%), Gaps = 28/203 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ + + ++ V+ V +S+ Sbjct: 38 DIVFLLDGSSSIG------RSNFREVRGFLEGLVLPFSGAAGAQG-VRFAAVQYSDDPRT 90 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L+ + R I+ LS G +T + + + +++F Q R K+ Sbjct: 91 EFDLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADRVFLPQLARPG-------VPKV 143 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLV 348 + +TDG K K +G ++A+GI+ E R + + + F+ V Sbjct: 144 CILITDG-----KSQDMVDTAAQRLKGQGVKLFAVGIKNADPEELKRIASQPTSDFFFFV 198 Query: 349 ENPHSMYDAFSHIGKDIVTKRIW 371 + + +V++R+ Sbjct: 199 NDFSILRTLL-----PLVSRRVC 216 >gi|119585300|gb|EAW64896.1| collagen, type VII, alpha 1 (epidermolysis bullosa, dystrophic, dominant and recessive), isoform CRA_d [Homo sapiens] Length = 2978 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 76/203 (37%), Gaps = 28/203 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ S+ ++ ++ + V+ V +S+ Sbjct: 38 DIVFLLDGSSSIG---RSNFREVRSFLEGLVLPFSGAASAQG----VRFATVQYSDDPRT 90 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L+ + R I+ LS G +T + + + + +F Q R K+ Sbjct: 91 EFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVFLPQLARPG-------VPKV 143 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLV 348 + +TDG K K +G ++A+GI+ E R + + + F+ V Sbjct: 144 CILITDG-----KSQDLVDTAAQRLKGQGVKLFAVGIKNADPEELKRVASQPTSDFFFFV 198 Query: 349 ENPHSMYDAFSHIGKDIVTKRIW 371 + + +V++R+ Sbjct: 199 NDFSILRTLL-----PLVSRRVC 216 >gi|119585298|gb|EAW64894.1| collagen, type VII, alpha 1 (epidermolysis bullosa, dystrophic, dominant and recessive), isoform CRA_b [Homo sapiens] Length = 2609 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 76/203 (37%), Gaps = 28/203 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ S+ ++ ++ + V+ V +S+ Sbjct: 38 DIVFLLDGSSSIG---RSNFREVRSFLEGLVLPFSGAASAQG----VRFATVQYSDDPRT 90 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L+ + R I+ LS G +T + + + + +F Q R K+ Sbjct: 91 EFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVFLPQLARPG-------VPKV 143 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLV 348 + +TDG K K +G ++A+GI+ E R + + + F+ V Sbjct: 144 CILITDG-----KSQDLVDTAAQRLKGQGVKLFAVGIKNADPEELKRVASQPTSDFFFFV 198 Query: 349 ENPHSMYDAFSHIGKDIVTKRIW 371 + + +V++R+ Sbjct: 199 NDFSILRTLL-----PLVSRRVC 216 >gi|119585297|gb|EAW64893.1| collagen, type VII, alpha 1 (epidermolysis bullosa, dystrophic, dominant and recessive), isoform CRA_a [Homo sapiens] Length = 2944 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 76/203 (37%), Gaps = 28/203 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ S+ ++ ++ + V+ V +S+ Sbjct: 38 DIVFLLDGSSSIG---RSNFREVRSFLEGLVLPFSGAASAQG----VRFATVQYSDDPRT 90 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L+ + R I+ LS G +T + + + + +F Q R K+ Sbjct: 91 EFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVFLPQLARPG-------VPKV 143 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLV 348 + +TDG K K +G ++A+GI+ E R + + + F+ V Sbjct: 144 CILITDG-----KSQDLVDTAAQRLKGQGVKLFAVGIKNADPEELKRVASQPTSDFFFFV 198 Query: 349 ENPHSMYDAFSHIGKDIVTKRIW 371 + + +V++R+ Sbjct: 199 NDFSILRTLL-----PLVSRRVC 216 >gi|119585301|gb|EAW64897.1| collagen, type VII, alpha 1 (epidermolysis bullosa, dystrophic, dominant and recessive), isoform CRA_e [Homo sapiens] Length = 2597 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 76/203 (37%), Gaps = 28/203 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ S+ ++ ++ + V+ V +S+ Sbjct: 38 DIVFLLDGSSSIG---RSNFREVRSFLEGLVLPFSGAASAQG----VRFATVQYSDDPRT 90 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L+ + R I+ LS G +T + + + + +F Q R K+ Sbjct: 91 EFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVFLPQLARPG-------VPKV 143 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLV 348 + +TDG K K +G ++A+GI+ E R + + + F+ V Sbjct: 144 CILITDG-----KSQDLVDTAAQRLKGQGVKLFAVGIKNADPEELKRVASQPTSDFFFFV 198 Query: 349 ENPHSMYDAFSHIGKDIVTKRIW 371 + + +V++R+ Sbjct: 199 NDFSILRTLL-----PLVSRRVC 216 >gi|495866|gb|AAA58965.1| collagen type VII [Homo sapiens] Length = 2912 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 76/203 (37%), Gaps = 28/203 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ S+ ++ ++ + V+ V +S+ Sbjct: 38 DIVFLLDGSSSIG---RSNFREVRSFLEGLVLPFSGAASAQG----VRFATVQYSDDPRT 90 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L+ + R I+ LS G +T + + + + +F Q R K+ Sbjct: 91 EFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVFLPQLARPG-------VPKV 143 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLV 348 + +TDG K K +G ++A+GI+ E R + + + F+ V Sbjct: 144 CILITDG-----KSQDLVDTAAQRLKGQGVKLFAVGIKNADPEELKRVASQPTSDFFFFV 198 Query: 349 ENPHSMYDAFSHIGKDIVTKRIW 371 + + +V++R+ Sbjct: 199 NDFSILRTLL-----PLVSRRVC 216 >gi|4502961|ref|NP_000085.1| collagen alpha-1(VII) chain precursor [Homo sapiens] gi|1345650|sp|Q02388|CO7A1_HUMAN RecName: Full=Collagen alpha-1(VII) chain; AltName: Full=Long-chain collagen; Short=LC collagen; Flags: Precursor gi|987125|gb|AAA75438.1| alpha-1 type VII collagen [Homo sapiens] Length = 2944 Score = 71.0 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 76/203 (37%), Gaps = 28/203 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ S+ ++ ++ + V+ V +S+ Sbjct: 38 DIVFLLDGSSSIG---RSNFREVRSFLEGLVLPFSGAASAQG----VRFATVQYSDDPRT 90 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L+ + R I+ LS G +T + + + + +F Q R K+ Sbjct: 91 EFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVFLPQLARPG-------VPKV 143 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLV 348 + +TDG K K +G ++A+GI+ E R + + + F+ V Sbjct: 144 CILITDG-----KSQDLVDTAAQRLKGQGVKLFAVGIKNADPEELKRVASQPTSDFFFFV 198 Query: 349 ENPHSMYDAFSHIGKDIVTKRIW 371 + + +V++R+ Sbjct: 199 NDFSILRTLL-----PLVSRRVC 216 >gi|260813598|ref|XP_002601504.1| hypothetical protein BRAFLDRAFT_185472 [Branchiostoma floridae] gi|229286801|gb|EEN57516.1| hypothetical protein BRAFLDRAFT_185472 [Branchiostoma floridae] Length = 400 Score = 70.6 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 39/189 (20%), Positives = 83/189 (43%), Gaps = 24/189 (12%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMA-IKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 LD++ ++D S S+ + + K+ + K+++ + I Q G++ +S Sbjct: 221 GNSLDIIYLVDGSGSVGAN---NFEKVKLFIKKAVSGFV-----IGPAA--TQVGVIQYS 270 Query: 228 NKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 +KI + F + VS L I + T + ++YA F + + Sbjct: 271 SKIRQEFSMNSFQTVSGLLGAIDAMEYMQGGTLTGRAIRYASKYGFSV------FDGARR 324 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 K++V +TDG D+ ++ EA+++G VYAIG+ + + + ++ S Sbjct: 325 GVPKVLVVVTDG----VSSDEVAI-PALEAQRQGIFVYAIGVSNYDAEQLQKIASTNESS 379 Query: 346 YLVENPHSM 354 +V+N + + Sbjct: 380 AMVDNFNLL 388 >gi|281354485|gb|EFB30069.1| hypothetical protein PANDA_020540 [Ailuropoda melanoleuca] Length = 1096 Score = 70.6 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 31/215 (14%), Positives = 78/215 (36%), Gaps = 25/215 (11%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S+ D++ +LD S S+ + + + +++ ++ P Sbjct: 537 SAAAGGGCKTVHYDLVFLLDTSSSVGK------EDFEKVRQWVANLVDTFEVGP---ERT 587 Query: 220 QSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G+V +S++ F L ++ ++L+ G +TN+ L++ F Q Sbjct: 588 RVGVVRYSDQPTTAFELGLFGSREAVKAAARHLAYHGGNTNTGDALRFITRHSFSPQ--- 644 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 D +K++ + + G + D + + + G ++A+G+ E Sbjct: 645 AGGRPGDRAFKQVAILLPAGRSQDLVLDAAAAAH-----RAGIRIFAVGVGAALKEELEE 699 Query: 338 ACASPN--SFYLVENPHSMYDAFSHIGKDIVTKRI 370 + P + V + ++A I + + Sbjct: 700 IASEPKSAHVFHVSD----FNAIDKIRGKLRRRLC 730 >gi|56707447|ref|YP_169343.1| hypothetical protein FTT_0293 [Francisella tularensis subsp. tularensis SCHU S4] gi|110669918|ref|YP_666475.1| hypothetical protein FTF0293 [Francisella tularensis subsp. tularensis FSC198] gi|115314141|ref|YP_762864.1| hypothetical protein FTH_0198 [Francisella tularensis subsp. holarctica OSU18] gi|254370860|ref|ZP_04986865.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis FSC033] gi|254874284|ref|ZP_05246994.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis MA00-2987] gi|56603939|emb|CAG44926.1| hypothetical membrane protein [Francisella tularensis subsp. tularensis SCHU S4] gi|110320251|emb|CAL08309.1| hypothetical membrane protein [Francisella tularensis subsp. tularensis FSC198] gi|115129040|gb|ABI82227.1| conserved hypothetical protein [Francisella tularensis subsp. holarctica OSU18] gi|151569103|gb|EDN34757.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis FSC033] gi|254840283|gb|EET18719.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis MA00-2987] Length = 339 Score = 70.6 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 32/218 (14%), Positives = 74/218 (33%), Gaps = 47/218 (21%) Query: 173 DMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+++ +D+S SM + ++ D+ ++ N ++ K + GL+ F Sbjct: 99 DLIMAIDLSGSMAIQDMKKANGQMESRFDLVMRVANQFIDTRKGD-------RVGLILFG 151 Query: 228 NKIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L + ++ +++ + ++ G T + A ++ G Sbjct: 152 TRAYLQTPLTFDIATVKKMLDDASIALPGPQTAIGDAIGLAVKKLKKYPGDS-------- 203 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-------SHEFLRA 338 K ++ +TDGEN Q L AK+ +Y IG+ + + Sbjct: 204 ---KALILLTDGENN--SGTLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNT 258 Query: 339 CA-------------SPNSFYLVENPHSMYDAFSHIGK 363 + ++ +N + + I K Sbjct: 259 SEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDK 296 >gi|224046671|ref|XP_002199336.1| PREDICTED: collagen, type XIV, alpha 1 (undulin) [Taeniopygia guttata] Length = 1883 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 42/226 (18%), Positives = 80/226 (35%), Gaps = 27/226 (11%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + P +T A D++I++D S S+ F + + LE Sbjct: 134 PEKTNASKPAPEGSLFTCKTPAIADIVILVDGSWSIGRFN----------FRLVRLFLEN 183 Query: 209 VKLIPDV-NNVVQSGLVTFS-NKIEEFFLLEWGVSH-LQRKIKYLSKFGVSTNSTPGLKY 265 + +V + + GL +S + E+ L +G + ++ L G +T + L Y Sbjct: 184 LVSAFNVGSEKTRVGLAQYSGDPRIEWHLNAYGTKDAVLDAVRNLPYKGGNTLTGLALTY 243 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 F + KI + +TDG K + + G ++AI Sbjct: 244 ILENSFKPE------AGARPGVSKIGILITDG-----KSQDDVIPPAKNLRDAGIELFAI 292 Query: 326 GIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 G++ +E + P+S V N F H + +T+ + Sbjct: 293 GVKNADINELKEIASEPDST-HVYNVADFN--FMHTIVEGLTRTVC 335 >gi|189912860|ref|YP_001964749.1| BatA [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|189913185|ref|YP_001964414.1| Hypothetical BatA protein; putative von Willebrand factor, type A domain containing protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] gi|167777536|gb|ABZ95836.1| BatA [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167781253|gb|ABZ99550.1| Hypothetical BatA protein; putative von Willebrand factor, type A domain containing protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] Length = 317 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 42/212 (19%), Positives = 80/212 (37%), Gaps = 25/212 (11%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFD-SSITKIDMAIKSINAMLEEVKLIPDVNNV 218 S K++ + +D+MI LD+S SM + +D ++ ++ + +++ Sbjct: 79 SKYKLSPDSTKGVDIMIALDISGSMVNSYDFLPRNRLSVSKDLLREFVKK-------RLY 131 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + G+V F+ L L I T+S ++ + D + Sbjct: 132 DRIGIVVFAGAAYLQSPLSSDRFALDELI-------AGTSSED-IEEQGTAVGDALVLSS 183 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH----- 333 + K+I+ +TDG + + K D + Y K G VY IGI Sbjct: 184 YRLKNSEAKSKVIILLTDGVSNTGKLDPDTAAY--TTKTMGIKVYCIGIGKEEGQYEINY 241 Query: 334 EFLRACAS--PNSFYLVENPHSMYDAFSHIGK 363 E L+ +S F+ E+P + + I + Sbjct: 242 ESLQKISSNTNGKFFRAESPEVLESVLNEIDQ 273 >gi|163801617|ref|ZP_02195515.1| hypothetical protein 1103602000597_AND4_09192 [Vibrio sp. AND4] gi|159174534|gb|EDP59336.1| hypothetical protein AND4_09192 [Vibrio sp. AND4] Length = 367 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 39/210 (18%), Positives = 83/210 (39%), Gaps = 35/210 (16%) Query: 173 DMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+M+V+D+S SM S +I++++ A + ++ ++ K + GL+ F Sbjct: 107 DVMVVVDLSGSMAEQDFTSKAGENISRLNAAKEVLSDFVKTRKGD-------RLGLILFG 159 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + F+ + ++ L++ G ST+ + A ++F+ R Sbjct: 160 ---DAAFVQTPFTPDQKVWLELLNQTDVAMAGQSTHLGDAMGLAI-KVFEQSKSRIGV-- 213 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--SHEFLR--- 337 + N +K+ + +TDG T + + AK +G ++ I + + L Sbjct: 214 -EENKEKVAIVLTDG--NDTGSFVEPIEAAKVAKAKGVRIHVIAMGDPQTLGEAALDMKT 270 Query: 338 ----ACASPNSFYLVENPHSMYDAFSHIGK 363 A S + N + A+ IG+ Sbjct: 271 IRRIAKESGGKAFEAMNRDELAKAYDDIGR 300 >gi|332215868|ref|XP_003257064.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-1(VII) chain-like [Nomascus leucogenys] Length = 2944 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 77/203 (37%), Gaps = 28/203 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ S+ ++ ++ + V+ V +S+ Sbjct: 38 DIVFLLDGSSSIG---RSNFREVRSFLEGLVLPFSGAVSAQG----VRFATVQYSDDPRT 90 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L+ + R I+ LS G +T + + + +++F Q R K+ Sbjct: 91 EFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADRVFLPQLARPGI-------PKV 143 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLV 348 + +TDG K K +G ++A+GI+ E R + + + F+ V Sbjct: 144 CILITDG-----KSQDLVDTAAQRLKGQGVKLFAVGIKNADPEELKRVASQPTSDFFFFV 198 Query: 349 ENPHSMYDAFSHIGKDIVTKRIW 371 + + +V++R+ Sbjct: 199 NDFSILRTLL-----PLVSRRVC 216 >gi|317055486|ref|YP_004103953.1| von Willebrand factor type A [Ruminococcus albus 7] gi|315447755|gb|ADU21319.1| von Willebrand factor type A [Ruminococcus albus 7] Length = 1311 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 47/300 (15%), Positives = 95/300 (31%), Gaps = 43/300 (14%) Query: 81 DILCRIKNTWNMSFRN--ELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP 138 +K T++ S + + + DI + D V P + + Sbjct: 555 FDTATMKYTFDESLLDTPDFEPAVYYCDIVNQKMVLLDDQVYDPVTCTVTAKTTHFSRY- 613 Query: 139 LKFCTFIPWYTNSRHI-VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 + T I + NS +D+ +V+D S SM ++ K Sbjct: 614 -----ILLNKTAFEKIWDKDFAGTSVDNSGKTVAMDIALVIDSSGSMTWNDPKNLRK--- 665 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST 257 + ++++ I + ++ F + + L + L I + G T Sbjct: 666 --DAAKEFVDKLSSIDEAA------IIDFDSSSKINRNLTSNRTLLYSAIDDIDSSG-GT 716 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 + T G+ + KKI++ +TDG+ K +A Sbjct: 717 SLTAGVSKGLEALSKSND------------KKIMILLTDGKGPYDKS------LTTQAIN 758 Query: 318 RGAIVYAIGIRVIR--SHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 G +Y IG+ L + A+ +Y + + +F ++ D+ K D Sbjct: 759 AGVTIYTIGLGTNNDIDQPLLNSIATETGGKYYHAKKDIDIQGSFDNVSGDLGNKDSDED 818 >gi|118617151|ref|YP_905483.1| hypothetical protein MUL_1490 [Mycobacterium ulcerans Agy99] gi|166979868|sp|A0PNU3|Y1490_MYCUA RecName: Full=UPF0353 protein MUL_1490 gi|118569261|gb|ABL04012.1| membrane protein [Mycobacterium ulcerans Agy99] Length = 335 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 31/211 (14%), Positives = 67/211 (31%), Gaps = 30/211 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+V+DVS+SM + D ++ A ++ +E+ + GL+ ++ Sbjct: 99 VMLVIDVSQSMRA-TDVEPNRMVAAQEAAKQFADELTPGIN------LGLIAYAGTATVL 151 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + L +F T + + A I + + D IV Sbjct: 152 VSPTTNREATKAALDKL-QFADRTATGEAIFTALQAIATVGAV---IGGGDTPPPARIVL 207 Query: 294 MTDGE---NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------------HEFL 336 +DG+ + + + AK +G + I E + Sbjct: 208 FSDGKETMPTNPDNPKGAYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETM 267 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 + A S + Y + + + + I Sbjct: 268 KKVAQLSGGNSYNAATLAELNSVYVSLQQQI 298 >gi|257056239|ref|YP_003134071.1| hypothetical protein Svir_22360 [Saccharomonospora viridis DSM 43017] gi|256586111|gb|ACU97244.1| Mg-chelatase subunit ChlD [Saccharomonospora viridis DSM 43017] Length = 326 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 40/211 (18%), Positives = 70/211 (33%), Gaps = 30/211 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+V+DVS SME+ D T++ A + + E + V GL++F+ Sbjct: 91 VMLVIDVSLSMEA-TDVKPTRLRAAQDAARSFAEGLTPG------VNLGLISFAGTATVL 143 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + I+ L ST + G+ A I + IV Sbjct: 144 AAPTTEREGVVHAIENLK-LAQSTATGEGIFAALQAIESFSAVIGGAEGPPPAR---IVL 199 Query: 294 MTDGENLSTKEDQQ---SLYYCNEAKKRGAIVYAIGIRVIRS--------------HEFL 336 MTDG+ +++ + AK++G + I + Sbjct: 200 MTDGKQTVPQDEYAPRGAFTAAGVAKQKGIPITTISFGTSYGSVEIDGTRVPVEVDDASM 259 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 R A S FY + + +G+ I Sbjct: 260 REIARLSGGDFYKAATAEELKQVYDSLGEQI 290 >gi|296221263|ref|XP_002756657.1| PREDICTED: von Willebrand factor A domain-containing protein 2 [Callithrix jacchus] Length = 725 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 41/218 (18%), Positives = 84/218 (38%), Gaps = 31/218 (14%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTD-----ARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 +P + + + + + K+++ + A +D++ +LD S S+ + + Sbjct: 18 VPPSLSLQEVHVSRETIGKISAASKMMWCSAAVDILFLLDGSHSIGK------GSFERSK 71 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVST 257 + + + + P V+ G FS+ F L+ + ++ KIK + G T Sbjct: 72 HFAITVCDALDISP---ERVRVGAFQFSSTPHLEFPLDSFSTQQEVKAKIKRMIFKGGRT 128 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 + LK N + R +A+ +I++ +TDG K + K+ Sbjct: 129 ETGLALK---NLLH-----RGFPGGRNASVPQILIIITDG-----KSQGHVALPAKQLKE 175 Query: 318 RGAIVYAIGIRVIRSHEFLRACASP--NSFYLVENPHS 353 +G V+A+G+R R E + P L E Sbjct: 176 KGVTVFAVGVRFPRWEELHMLASEPREQHVLLAEQVED 213 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 66/196 (33%), Gaps = 29/196 (14%) Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 S+ + Y P IP + P LD++ +LD S S+ Sbjct: 493 SLKHVMVYSDPQDLLNQIP-ELQGKLCSQPRPG------CKTRALDLVFMLDTSASVGPE 545 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQR 245 + ++ ++S E +V Q GLV + ++++ F L+ + + R Sbjct: 546 ---NFAQMQSFVRSCALEFEV------NPDVTQVGLVVYGSQVQTAFGLDTKPTRAAVLR 596 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 I G ++ L + Y+++ +Q K +V +T G Sbjct: 597 AISQAPYLGGVGSAGTALLHIYDKVMTVQ------RGARPGVPKAVVVLTGGRGA----- 645 Query: 306 QQSLYYCNEAKKRGAI 321 + + + + G Sbjct: 646 EDAAVPAQKLRNNGIS 661 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 74/187 (39%), Gaps = 22/187 (11%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 R+D++ +LD S T +D +++ + + V+ + ++ + G+ T+S ++ Sbjct: 341 RIDLLFLLDSSAG---------TTLDGFLRTKAFVKQFVQAVLSDDSRARVGVATYSREL 391 Query: 231 EEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + L R + + G T + L+ A +G T + Sbjct: 392 LVAVPVGEYQDMPDLVRSLDGIPFRGGPTLTGSALRQA-----AERGFGSVTRTGQDRPR 446 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 +++V +T+ ++ + + + + RG ++ +G +R+ E S + Sbjct: 447 RVVVLLTE-----SRAEDEVAGPAHHVRARGLLLLGVGSEAMRA-ELEEITGSLKHVMVY 500 Query: 349 ENPHSMY 355 +P + Sbjct: 501 SDPQDLL 507 >gi|254459074|ref|ZP_05072497.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084345|gb|EDZ61634.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 279 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 36/209 (17%), Positives = 74/209 (35%), Gaps = 21/209 (10%) Query: 163 KVNSQTDARLDMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 K++ D+++ +D S SM S F D ++ I ++ + +++ Sbjct: 45 KLSPNNRHGKDIVLAIDASGSMNSSGFDFEDEVSDGKRLSRFEITKIIASEFIQKRISDN 104 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V G+V + + + + + + + YL++ N+ G A Sbjct: 105 V--GVVLYGDFAFIASPITYEKEIVTQMLGYLTQGMAGQNTAIGEAIAM---------GV 153 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--HEFL 336 K+IV ++DGE+ S AK++G +Y I + L Sbjct: 154 RSFKHSKAKTKVIVLLSDGEHNSGS--VSPKEATELAKEQGIKIYTIAMGNKGEADEALL 211 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGK 363 A S F+ + + + + I K Sbjct: 212 ETIAKDSNGEFFSASSAKELKNIYDEIDK 240 >gi|149067646|gb|EDM17198.1| integrin alpha M [Rattus norvegicus] Length = 1151 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 37/211 (17%), Positives = 79/211 (37%), Gaps = 28/211 (13%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 D+ ++D S S+ S K+ + ++ ++ K + L+ Sbjct: 143 GCPQQESDIAFLIDGSGSINSI---DFQKMKEFVSTVMDQFQKSKTLFS--------LMQ 191 Query: 226 FSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +S++ F + ++ + + T + G++ ++F Q N Sbjct: 192 YSDEFRTHFTFNDFKRNPDPKSHVRPIRQLNGRTKTASGIRKVVRELF------QKINGA 245 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV----IRSHEFLRAC 339 N KI+V +TDGE + + + EA++ G I Y IG+ +S L Sbjct: 246 RDNAAKILVVITDGEKFGDPLNYEDVIP--EAEEAGIIRYVIGVGNAFHKPQSRRELDTI 303 Query: 340 AS---PNSFYLVENPHSMYDAFSHIGKDIVT 367 AS + + V+N ++ + + + I Sbjct: 304 ASKPAGDHVFQVDNFEALNTIRNQLQEKIFA 334 >gi|260797295|ref|XP_002593639.1| hypothetical protein BRAFLDRAFT_235786 [Branchiostoma floridae] gi|229278865|gb|EEN49650.1| hypothetical protein BRAFLDRAFT_235786 [Branchiostoma floridae] Length = 373 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 34/188 (18%), Positives = 73/188 (38%), Gaps = 22/188 (11%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 LD++ +LD S S+ + S+ K+ K + + + Q G++ +S Sbjct: 4 RNPLDIIFLLDGSGSVGA---SNFVKVKQFTK------KTISGFDISPSGTQVGVIQYST 54 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + F + V+ L I + T + ++Y F + Sbjct: 55 RTRQEFSMNSFVTKETLSAAIDEVQYMRGGTLTGKAIRYVTKYGFGKS------DGARPG 108 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 K+++ +TDG EA+++G VYAIG+ + + + ++ N+ Sbjct: 109 VPKVVIVVTDG-----VSYDAVAAPALEAQQKGITVYAIGVSGYDADQLEQIASNNNTLA 163 Query: 347 LVENPHSM 354 V+N + + Sbjct: 164 FVDNFNLL 171 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 63/169 (37%), Gaps = 22/169 (13%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 LD++ +LD S S+ + S+ K+ K + + + Q G++ +S Sbjct: 220 RNPLDIIFLLDGSGSVGA---SNFEKVKQFTK------KTISGFDISPSGTQVGVIQYST 270 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + F + ++ L I + T + ++Y F + Sbjct: 271 RTRQEFSMNSFLTKETLSSAIDEVQYMRGGTLTGKAIRYVTKYGFGKS------DGARPG 324 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 K+++ +TDG EA+++G VYAIG+ + + Sbjct: 325 VPKVVIVVTDG-----VSYDAVAAPALEAQQKGITVYAIGVSGYDADQL 368 >gi|3182939|sp|Q91145|COCA1_NOTVI RecName: Full=Collagen alpha-1(XII) chain gi|632648|gb|AAA80217.1| type XII collagen alpha-1 chain [Notophthalmus viridescens] Length = 929 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 42/277 (15%), Positives = 96/277 (34%), Gaps = 32/277 (11%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 D+G++ ++ +++ + + Y ++ + T R Sbjct: 560 DDGYLGEMMVGPYDSTVVLEELRARTSYKVNVFGVFDDGQSPPLIGHETTTLRDAPRSPI 619 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEV-KLIPDVNN 217 S ++ T A+ D+++++D S S+ F K + + V ++ ++ Sbjct: 620 PSSGLDCTTKAQADIVLLVDGSWSIGRPNF-----------KIVRNFISRVVEVFDIGSD 668 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQR----KIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 VQ + +S + L ++ + L G +TN+ LK F + Sbjct: 669 RVQIAVSQYSGDPRTEWQL--NTHKTKKSLMDAVANLPYKGGNTNTGSALK------FIL 720 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + + +KI + +TDG K + G +YA+GI+ + Sbjct: 721 ENNFRPGVGMREKARKIAILLTDG-----KSQDDIVAPSKRYADEGIELYAVGIKNADEN 775 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 E + P+ Y+ + ++I D+ Sbjct: 776 ELKEIASDPDELYMYNVAD--FSLLTNIVNDLTENVC 810 >gi|255037594|ref|YP_003088215.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] gi|254950350|gb|ACT95050.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] Length = 339 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 37/212 (17%), Positives = 72/212 (33%), Gaps = 41/212 (19%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M++LD+S SM D S +++ A + ++ +L + GL+ F+ + Sbjct: 103 IDIMLLLDISDSMIEK-DLSPNRLEAAKRMARQFIKG-RLQD------RIGLIVFAGEAV 154 Query: 232 EFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L L + ++ T L A N++ D G Sbjct: 155 SLCPLTTDYELLYGFLDEVTPSLIPTPGTAIGSALAVAVNRMRDTAGES----------- 203 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP------ 342 K+ + ++DG+N S + A G VY I + +S AS Sbjct: 204 KVAILISDGDNTSGNLGPTT--SAQLANAFGVKVYTISVGKPKSASKADTTASAGALMDE 261 Query: 343 -----------NSFYLVENPHSMYDAFSHIGK 363 ++ + ++ F I + Sbjct: 262 GELQNIAGIGNGKYFRATDNTALESVFKQIDQ 293 >gi|316983245|pdb|3N2N|F Chain F, The Crystal Structure Of Tumor Endothelial Marker 8 (Tem8) Extracellular Domain gi|316983246|pdb|3N2N|A Chain A, The Crystal Structure Of Tumor Endothelial Marker 8 (Tem8) Extracellular Domain gi|316983247|pdb|3N2N|B Chain B, The Crystal Structure Of Tumor Endothelial Marker 8 (Tem8) Extracellular Domain gi|316983248|pdb|3N2N|C Chain C, The Crystal Structure Of Tumor Endothelial Marker 8 (Tem8) Extracellular Domain gi|316983249|pdb|3N2N|D Chain D, The Crystal Structure Of Tumor Endothelial Marker 8 (Tem8) Extracellular Domain gi|316983250|pdb|3N2N|E Chain E, The Crystal Structure Of Tumor Endothelial Marker 8 (Tem8) Extracellular Domain Length = 185 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 40/203 (19%), Positives = 74/203 (36%), Gaps = 25/203 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+ +LD S S+ ++ I +E++ ++ + FS + Sbjct: 6 GGFDLYFILDKSGSVLHHWN-----------EIYYFVEQLAHKFISPQ-LRMSFIVFSTR 53 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L +++ ++ L K G T G + A QI+ + Sbjct: 54 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRTAS----- 108 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +I+ +TDGE L S N ++ GAIVYA+G++ + R S + + Sbjct: 109 --VIIALTDGE-LHEDLFFYSEREANRSRDLGAIVYAVGVKDFNETQLARIADSKDHVFP 165 Query: 348 VENPHSMYDAFSHIGKDIVTKRI 370 V + + A I I+ K Sbjct: 166 VNDG---FQALQGIIHSILKKSC 185 >gi|293344916|ref|XP_001079629.2| PREDICTED: collagen, type VI, alpha 1 [Rattus norvegicus] gi|293356747|ref|XP_215375.5| PREDICTED: collagen, type VI, alpha 1 [Rattus norvegicus] gi|149043683|gb|EDL97134.1| procollagen, type VI, alpha 1 (predicted), isoform CRA_b [Rattus norvegicus] Length = 1034 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 43/209 (20%), Positives = 76/209 (36%), Gaps = 19/209 (9%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP---DVNNVVQSGLV 224 D +D+ VLD S S+ +D ++ ++ D N V +G + Sbjct: 40 QDCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGAL 99 Query: 225 TFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +S+++E L G L+ I + FG T + +K ++ H Sbjct: 100 HYSDEVEIIRGLMRMPSGRDELKASIDAVKYFGKGTYTDCAIKKGLEELLIGG---SHLK 156 Query: 282 TEDANYKKIIVFMTDGEN-LSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRAC 339 K ++ +TDG KE L NEAK G V+++ I L Sbjct: 157 EN-----KYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGIKVFSVAITPDHLEPRLSII 211 Query: 340 ASPNSF---YLVENPHSMYDAFSHIGKDI 365 A+ +++ + + DA I + I Sbjct: 212 ATDHTYRRNFTAADWGHSRDAEETISQTI 240 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 59/170 (34%), Gaps = 29/170 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVVQSGL 223 T +D++ VLD S S+ + ++A I +++ + V +G+ Sbjct: 617 CTCGPIDILFVLDSSESIG------LQNFEIAKDFIIKVIDRLSKDELVKFEPGQSHAGV 670 Query: 224 VTFSNKIEEF-----FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V +S+ + + +K L T + L+Y +++ Sbjct: 671 VQYSHNQMQEHVDMRSPNVRNAQDFKEAVKKLQWMAGGTFTGEALQYTRDRLL------- 723 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +I + +TDG + + ++ C V ++GI+ Sbjct: 724 ----PPTQNNRIALVITDGRSDTQRDTTPLSVLCG----SDIQVVSVGIK 765 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 55/165 (33%), Gaps = 19/165 (11%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + D+ I+LD S S+ S + + K + VV+ +V +S + Sbjct: 832 SPTDITILLDSSASVGSH---NFETTKVFAKRLAERFLSAGREDP-TQVVRVAVVQYSGQ 887 Query: 230 IEE-----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ + + L + + +T+ L Y R + Sbjct: 888 GQQQPGRASLQFQQNYTVLASSVDSMDFINDATDVNDALSYV---------TRFYRENSS 938 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 KK ++ +DG + + EA++ G ++ + + Sbjct: 939 GATKKRVLLFSDGNSQGATAEAIEKAV-QEAQRGGIEIFVMVVGP 982 >gi|32394600|gb|AAM93998.1| proximal thread matrix protein 1 [Griffithsia japonica] Length = 218 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 33/174 (18%), Positives = 61/174 (35%), Gaps = 27/174 (15%) Query: 174 MMIVLDVSRS-MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 + LD S S + +I + ++ K D V F++ ++ Sbjct: 32 ICFSLDQSGSIVSPGLYPNIREF---------TVDAAKEFDDRTKDSYFSAVGFASGVKL 82 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 +S + +S TN GL+ Y Q+ ++++ Sbjct: 83 IQAPTQSLSTFNTAVNTVSPLNGGTNIFRGLRGCYQQL-----------KTKPMTDRVLI 131 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 +TDG + YCN K +G ++ +GI + FL+ CA+ FY Sbjct: 132 LVTDGFGGQP------INYCNFIKSKGILLVTVGIGTSINQNFLKNCATSEEFY 179 >gi|325116955|emb|CBZ52508.1| unnamed protein product [Neospora caninum Liverpool] Length = 765 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 44/217 (20%), Positives = 86/217 (39%), Gaps = 30/217 (13%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 ++ ++LD+ ++D S S+ + + ++A L ++ P N+ V Sbjct: 64 GTIGAAEGCTSQLDICFLVDSSGSIGEAH------YEEVKQFLHAFLSKL---PIGNDEV 114 Query: 220 QSGLVTFSNKIEEFFLLEWG----VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + LV FS + + L + + + G +TN+ GL+ +FD Sbjct: 115 NTSLVIFSTTVHPHWSLRANNASDKETAMQDVLTIPYHGGTTNTAAGLQTCNQMLFD--- 171 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG--AIVYAIGIRVIRSH 333 + E K+++ MTDGE+ S D ++ ++RG V ++G + + Sbjct: 172 ---YPREERQTVPKLVIAMTDGESDS---DFHTVNEAKVIRERGGIITVLSVG-MYVNHN 224 Query: 334 EFLRACA-----SPNSFYLVENPHSMYDAFSHIGKDI 365 E C SP YL + + S I K++ Sbjct: 225 ECRSMCGCRNDSSPCPLYLQTEWSQLLPSISPILKEV 261 >gi|6017001|gb|AAF01565.1|AF061273_1 thrombospondin-related adhesive protein homolog [Neospora caninum] Length = 765 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 44/217 (20%), Positives = 86/217 (39%), Gaps = 30/217 (13%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 ++ ++LD+ ++D S S+ + + ++A L ++ P N+ V Sbjct: 64 GTIGAAEGCTSQLDICFLVDSSGSIGEAH------YEEVKQFLHAFLSKL---PIGNDEV 114 Query: 220 QSGLVTFSNKIEEFFLLEWG----VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + LV FS + + L + + + G +TN+ GL+ +FD Sbjct: 115 NTSLVIFSTTVHPHWSLRANNASDKETAMQDVLTIPYHGGTTNTAAGLQTCNQMLFD--- 171 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG--AIVYAIGIRVIRSH 333 + E K+++ MTDGE+ S D ++ ++RG V ++G + + Sbjct: 172 ---YPREERQTVPKLVIAMTDGESDS---DFHTVNEAKVIRERGGIITVLSVG-MYVNHN 224 Query: 334 EFLRACA-----SPNSFYLVENPHSMYDAFSHIGKDI 365 E C SP YL + + S I K++ Sbjct: 225 ECRSMCGCRNDSSPCPLYLQTEWSQLLPSISPILKEV 261 >gi|13476511|ref|NP_108081.1| hypothetical protein mlr7847 [Mesorhizobium loti MAFF303099] gi|14027272|dbj|BAB54226.1| mlr7847 [Mesorhizobium loti MAFF303099] Length = 548 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 33/234 (14%), Positives = 81/234 (34%), Gaps = 39/234 (16%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 + F+ + G +L + LP I + ++VS + K+ L + +D + + + Sbjct: 3 LNKFWRSKSGNFALLMGLGLPAILSAVAFAVDVSTVMRAKSNLQNALDAANLA------S 56 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 G+ + + + + + V Sbjct: 57 SHLGDLDISRTDAFD-----RYFQANIAGHGE-----------LANAQATLTVDRGVNFI 100 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 A++ + L F F+ + +RHI + ++D +L++++VLD + SM Sbjct: 101 KTKAVASADVNLNF-GFLFGH--NRHIAVDA-----SAVESDNQLEVVLVLDNTGSMAG- 151 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF---FLLEW 238 ++ + ++L+ ++ +++ LV F + F W Sbjct: 152 -----ARMTALRTATKSLLDTLEATKSPTRQIRASLVPFVTAVNVNGDEFDPSW 200 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 31/176 (17%), Positives = 58/176 (32%), Gaps = 36/176 (20%) Query: 232 EFFLLEWGVSHLQRKIKYLSK-FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L L++ +++ G TN + GL + + + KI Sbjct: 372 PVVSLTDDFDKLRKAASEMTEWNGSGTNVSEGLSWGMRVLSPAAPYTDGAPWKTPGISKI 431 Query: 291 IVFMTDGEN--------------------------------LSTKEDQQSLYYCNEAKKR 318 ++ +TDGEN + D + C + K + Sbjct: 432 VLLLTDGENVVYGASEQPTKSDYTSYGYLAGGRFGSDDQTAAARNVDGWTKSVCTQLKNQ 491 Query: 319 GAIVYAIGIRV--IRSHEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 G +Y + ++ + ACAS P+ +Y V +P + D F HI ++ Sbjct: 492 GVQIYTMVLQSDTAANRALYSACASDPSGYYAVNDPAKLPDVFQHIANKFSRLQLT 547 >gi|254775742|ref|ZP_05217258.1| hypothetical protein MaviaA2_13890 [Mycobacterium avium subsp. avium ATCC 25291] Length = 335 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 33/211 (15%), Positives = 71/211 (33%), Gaps = 30/211 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+V+DVS+SM + D + ++ A ++ +E+ + GL+ ++ Sbjct: 99 VMLVIDVSQSMRA-TDVAPNRMAAAQEAAKQFADELTPGIN------LGLIAYAGTATVL 151 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + L +F T + G+ A I + + D IV Sbjct: 152 VSPTTNREATKNALDKL-QFADRTATGEGIFTALQAIATVGAV---IGGGDKPPPARIVL 207 Query: 294 MTDGE---NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------------HEFL 336 +DG+ + + + AK +G + I E L Sbjct: 208 FSDGKETMPTNPDNPKGAFTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETL 267 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 + A S + Y + + ++ + + I Sbjct: 268 KKVAQLSGGNAYNAASLQELKSVYATLQQQI 298 >gi|38348304|ref|NP_940898.1| von Willebrand factor A domain-containing protein 2 [Homo sapiens] gi|34527908|dbj|BAC85505.1| unnamed protein product [Homo sapiens] Length = 725 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 45/209 (21%), Positives = 78/209 (37%), Gaps = 31/209 (14%) Query: 153 HIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 H+ + S+ A +D+M +LD S S+ + + + + + Sbjct: 28 HVSKETIGKISAASKMMWCSAAVDIMFLLDGSNSVGK------GSFERSKHFAITVCDGL 81 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAY 267 + P V+ G FS+ F L+ + ++ +IK + G T + LKY Sbjct: 82 DISP---ERVRVGAFQFSSTPHLEFPLDSFSTQQEVKARIKRMVFKGGRTETELALKYLL 138 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 R +A+ +I++ +TDG K + K+RG V+A+G+ Sbjct: 139 --------HRGLPGGRNASVPQILIIVTDG-----KSQGDVALPSKQLKERGVTVFAVGV 185 Query: 328 RVIRSHEFLRACASP---NSFYLVENPHS 353 R R E L A AS L E Sbjct: 186 RFPRWEE-LHALASEPRGQHVLLAEQVED 213 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 57/159 (35%), Gaps = 24/159 (15%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-NVVQSGLV 224 LD++ +LD S S+ + + + L +VN +V Q GLV Sbjct: 524 GCRTQALDLVFMLDTSASVGPENFAQMQS----------FVRSCALQFEVNPDVTQVGLV 573 Query: 225 TFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + ++++ F L+ + + R I G ++ L + Y+++ +Q Sbjct: 574 VYGSQVQTAFGLDTKPTRAAMLRAISQAPYLGGVGSAGTALLHIYDKVMTVQ------RG 627 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 K +V +T G + + + + G Sbjct: 628 ARPGVPKAVVVLTGGRGA-----EDAAVPAQKLRNNGIS 661 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 72/187 (38%), Gaps = 22/187 (11%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 R+D++ +LD S T +D +++ + V+ + ++ + G+ T+S ++ Sbjct: 341 RVDLLFLLDSSAG---------TTLDGFLRAKVFVKRFVRAVLSEDSRARVGVATYSREL 391 Query: 231 EEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + V L + + G T + L+ A +G T + Sbjct: 392 LVAVPVGEYQDVPDLVWSLDGIPFRGGPTLTGSALRQA-----AERGFGSATRTGQDRPR 446 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 +++V +T+ + + + A+ R ++ +G +R+ E SP + Sbjct: 447 RVVVLLTE-----SHSEDEVAGPARHARARELLLLGVGSEAVRA-ELEEITGSPKHVMVY 500 Query: 349 ENPHSMY 355 +P ++ Sbjct: 501 SDPQDLF 507 >gi|74722595|sp|Q5GFL6|VWA2_HUMAN RecName: Full=von Willebrand factor A domain-containing protein 2; AltName: Full=A domain-containing protein similar to matrilin and collagen; Short=AMACO; AltName: Full=Colon cancer secreted protein 2; Short=CCSP-2; Flags: Precursor gi|50429312|gb|AAT77225.1| colon cancer secreted protein-2 [Homo sapiens] gi|50429314|gb|AAT77226.1| colon cancer secreted protein-2 [Homo sapiens] Length = 755 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 45/209 (21%), Positives = 78/209 (37%), Gaps = 31/209 (14%) Query: 153 HIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 H+ + S+ A +D+M +LD S S+ + + + + + Sbjct: 28 HVSKETIGKISAASKMMWCSAAVDIMFLLDGSNSVGK------GSFERSKHFAITVCDGL 81 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAY 267 + P V+ G FS+ F L+ + ++ +IK + G T + LKY Sbjct: 82 DISP---ERVRVGAFQFSSTPHLEFPLDSFSTQQEVKARIKRMVFKGGRTETELALKYLL 138 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 R +A+ +I++ +TDG K + K+RG V+A+G+ Sbjct: 139 --------HRGLPGGRNASVPQILIIVTDG-----KSQGDVALPSKQLKERGVTVFAVGV 185 Query: 328 RVIRSHEFLRACASP---NSFYLVENPHS 353 R R E L A AS L E Sbjct: 186 RFPRWEE-LHALASEPRGQHVLLAEQVED 213 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 57/159 (35%), Gaps = 24/159 (15%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-NVVQSGLV 224 LD++ +LD S S+ + + + L +VN +V Q GLV Sbjct: 524 GCRTQALDLVFMLDTSASVGPENFAQMQS----------FVRSCALQFEVNPDVTQVGLV 573 Query: 225 TFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + ++++ F L+ + + R I G ++ L + Y+++ +Q Sbjct: 574 VYGSQVQTAFGLDTKPTRAAMLRAISQAPYLGGVGSAGTALLHIYDKVMTVQ------RG 627 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 K +V +T G + + + + G Sbjct: 628 ARPGVPKAVVVLTGGRGA-----EDAAVPAQKLRNNGIS 661 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 72/187 (38%), Gaps = 22/187 (11%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 R+D++ +LD S T +D +++ + V+ + ++ + G+ T+S ++ Sbjct: 341 RVDLLFLLDSSAG---------TTLDGFLRAKVFVKRFVRAVLSEDSRARVGVATYSREL 391 Query: 231 EEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + V L + + G T + L+ A +G T + Sbjct: 392 LVAVPVGEYQDVPDLVWSLDGIPFRGGPTLTGSALRQA-----AERGFGSATRTGQDRPR 446 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 +++V +T+ + + + A+ R ++ +G +R+ E SP + Sbjct: 447 RVVVLLTE-----SHSEDEVAGPARHARARELLLLGVGSEAVRA-ELEEITGSPKHVMVY 500 Query: 349 ENPHSMY 355 +P ++ Sbjct: 501 SDPQDLF 507 >gi|27657363|emb|CAD60276.1| AMACO [Homo sapiens] Length = 755 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 45/209 (21%), Positives = 78/209 (37%), Gaps = 31/209 (14%) Query: 153 HIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 H+ + S+ A +D+M +LD S S+ + + + + + Sbjct: 28 HVSKETIGKISAASKMMWCSAAVDIMFLLDGSNSVGK------GSFERSKHFAITVCDGL 81 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAY 267 + P V+ G FS+ F L+ + ++ +IK + G T + LKY Sbjct: 82 DISP---ERVRVGAFQFSSTPHLEFPLDSFSTQQEVKARIKRMVFKGGRTETELALKYLL 138 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 R +A+ +I++ +TDG K + K+RG V+A+G+ Sbjct: 139 --------HRGLPGGRNASVPQILIIVTDG-----KSQGDVALPSKQLKERGVTVFAVGV 185 Query: 328 RVIRSHEFLRACASP---NSFYLVENPHS 353 R R E L A AS L E Sbjct: 186 RFPRWEE-LHALASEPRGQHVLLAEQVED 213 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 57/159 (35%), Gaps = 24/159 (15%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-NVVQSGLV 224 LD++ +LD S S+ + + + L +VN +V Q GLV Sbjct: 524 GCRTQALDLVFMLDTSASVGPENFAQMQS----------FVRSCALQFEVNPDVTQVGLV 573 Query: 225 TFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + ++++ F L+ + + R I G ++ L + Y+++ +Q Sbjct: 574 VYGSQVQTAFGLDTKPTRAAMLRAISQAPYLGGVGSAGTALLHIYDKVMTVQ------RG 627 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 K +V +T G + + + + G Sbjct: 628 ARPGVPKAVVVLTGGRGA-----EDAAVPAQKLRNNGIS 661 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 72/187 (38%), Gaps = 22/187 (11%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 R+D++ +LD S T +D +++ + V+ + ++ + G+ T+S ++ Sbjct: 341 RVDLLFLLDSSAG---------TTLDGFLRAKVFVKRFVRAVLSEDSRARVGVATYSREL 391 Query: 231 EEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + V L + + G T + L+ A +G T + Sbjct: 392 LVAVPVGEYQDVPDLVWSLDGIPFRGGPTLTGSALRRA-----AERGFGSATRTGQDRPR 446 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 +++V +T+ + + + A+ R ++ +G +R+ E SP + Sbjct: 447 RVVVLLTE-----SHSEDEVAGPARHARARELLLLGVGSEAVRA-ELEEITGSPKHVMVY 500 Query: 349 ENPHSMY 355 +P ++ Sbjct: 501 SDPQDLF 507 >gi|32189436|ref|NP_473382.1| anthrax toxin receptor 1 precursor [Mus musculus] gi|17366052|sp|Q9CZ52|ANTR1_MOUSE RecName: Full=Anthrax toxin receptor 1; AltName: Full=Tumor endothelial marker 8; Flags: Precursor gi|15987505|gb|AAL11999.1|AF378762_1 tumor endothelial marker 8 precursor [Mus musculus] gi|63100398|gb|AAH94544.1| Anthrax toxin receptor 1 [Mus musculus] Length = 562 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 39/224 (17%), Positives = 77/224 (34%), Gaps = 30/224 (13%) Query: 156 MPITSSVKVNSQTDAR-----LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + + + + D+ +LD S S+ ++ I +E++ Sbjct: 20 VLVCAGHGGRREDGGPACYGGFDLYFILDKSGSVLHHWN-----------EIYYFVEQLA 68 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYN 268 ++ + FS + L +++ ++ L K G T G + A Sbjct: 69 HRFISPQ-LRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASE 127 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 QI+ + +I+ +TDGE L S N ++ GAIVY +G++ Sbjct: 128 QIYYENSQGYRTAS-------VIIALTDGE-LHEDLFFYSEREANRSRDLGAIVYCVGVK 179 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + R S + + V + + A I I+ K Sbjct: 180 DFNETQLARIADSKDHVFPVNDG---FQALQGIIHSILKKSCIE 220 >gi|260825786|ref|XP_002607847.1| hypothetical protein BRAFLDRAFT_199461 [Branchiostoma floridae] gi|229293196|gb|EEN63857.1| hypothetical protein BRAFLDRAFT_199461 [Branchiostoma floridae] Length = 187 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 75/199 (37%), Gaps = 27/199 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 A LD+ VLD S S+ ++ K+ +++ V + + G+V +S Sbjct: 10 APLDLFFVLDGSGSVTY---ANFDKVKEFTENV------VNAFDISASSTRVGVVQYSTS 60 Query: 230 IEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 F L I +S G T + L++A R + + Sbjct: 61 NTLEFNLGDHADKPSTLAAIDSISYQGGGTRTGSALEFA----------RLNAAWRGGSV 110 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-SPNSFY 346 K+++ +TDG K N+ +G VYAIG+ + + L A + N+ Sbjct: 111 PKVMIVVTDG-----KSGDSVASSANDLASQGVDVYAIGVGNYDATQLLEIAAGNQNNVI 165 Query: 347 LVENPHSMYDAFSHIGKDI 365 + + +++ + I + + Sbjct: 166 ELTDFNALSAEINQIAQTV 184 >gi|307353371|ref|YP_003894422.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] gi|307156604|gb|ADN35984.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] Length = 317 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 40/222 (18%), Positives = 82/222 (36%), Gaps = 39/222 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 QT ++++ LD S SME+ D +I A ++I ++ ++ L +G++TF Sbjct: 84 QTKEGVNIVFALDTSGSMEAA-DYQPDRITAAKEAIGTLINQLDLKD------YAGIITF 136 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + K+ ++ ST GL A + + N Sbjct: 137 DSGASTAAYLSPDKQRVIEKLGMIAASDDSTAIGDGLALAVD-----------MSKSIPN 185 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-------------- 332 K +++ ++DGE+ + AK+ G V+ + + Sbjct: 186 RKSVVILLSDGESNA--GYVSPETAAEFAKESGVQVFTVAMGSSEKVLVGYDWANNPQYA 243 Query: 333 ---HEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 E L A + FY + ++ + +S + IV ++ Sbjct: 244 TVDEETLEYIADSTGGGFYSSVDEKTLGNIYSQLDDAIVHEK 285 >gi|226314609|ref|YP_002774505.1| hypothetical protein BBR47_50240 [Brevibacillus brevis NBRC 100599] gi|226097559|dbj|BAH46001.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 947 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 47/327 (14%), Positives = 100/327 (30%), Gaps = 40/327 (12%) Query: 51 HSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD--------NG 102 + + A + + G I+ + + ELRD G Sbjct: 273 QDTVAANNQATAYTQVAGAPVVLVAEGHPGAASNLIQALEAGNIKVELRDLALLPKELEG 332 Query: 103 FVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSV 162 + ++ + E +A+ I L + PI ++ Sbjct: 333 YKQFASIVLADVPATSMTDADMERMR-TAVRDLGIGLIMTGGKDSFGMGGWFQTPIEEAL 391 Query: 163 KVNS-----QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 V+ + L + +V+D S SM S K+ +A ++ + Sbjct: 392 PVHMDLKGKEQLPSLGLQLVIDKSGSMSSDARG-ADKMALAREAAIRATTMMNAQD---- 446 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 G++ F + + + +Q++I + G T+ P L+ Y ++ M Sbjct: 447 --YIGVIAFDDTPWDVVAPQSVTKLDEIQQQISRIQADG-GTDIFPALQLGYERVKAMNT 503 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R+H ++ +TDG++ + + L + V + + Sbjct: 504 QRKH-----------VILLTDGQSALDDDYEGLLQ---QMTAENITVSTVALGDDSDRGL 549 Query: 336 LRACAS--PNSFYLVENPHSMYDAFSH 360 L A +Y + S+ FS Sbjct: 550 LEMIAELGKGRYYFANDAESIPKIFSK 576 Score = 40.5 bits (93), Expect = 0.38, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 63/197 (31%), Gaps = 35/197 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D S SM+ + + L E + + +++ Sbjct: 68 IVFVVDRSASMKDDP------------RVLSFLREAVGQKQAADKYAVIAIGAEAAVDQP 115 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + + GV N N +G+R N + +V Sbjct: 116 MTIR----------QEVQPLGVDVNR--------NATNLAEGIRLASAMIPTNARGKVVL 157 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE-FLRACASPNSFYLVENPH 352 +TDG S +Q+ A++RG V A+ ++ E L + P Y E Sbjct: 158 LTDGLETSGDAARQTRL----ARERGIAVEAVSLQQPNGDEVVLTSVQVPQRLYAGEEYG 213 Query: 353 SMYDAFSHIGKDIVTKR 369 D S I + + + Sbjct: 214 ITVDVESTITTEAILRL 230 >gi|103487755|ref|YP_617316.1| hypothetical protein Sala_2274 [Sphingopyxis alaskensis RB2256] gi|98977832|gb|ABF53983.1| hypothetical protein Sala_2274 [Sphingopyxis alaskensis RB2256] Length = 666 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 28/156 (17%), Positives = 51/156 (32%), Gaps = 30/156 (19%) Query: 243 LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ--GMRQHCNTEDANYKKIIVFMTDG--- 297 ++ L G T G+ + + D + IVFMTDG Sbjct: 511 FNTYVQSLQPLG-GTYHDAGMVWGARLLSPTGLFADENATAPNDRPISRHIVFMTDGAMA 569 Query: 298 ENLST-----------------------KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 N+ + + + C A++RG ++ + V S++ Sbjct: 570 PNMGNLTFQGYEFLMHRVGGTSDSDLRDRHNNRFTQLCRAARQRGITIWVVSFGVG-SND 628 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 L CAS + +N + + F I + I R+ Sbjct: 629 SLNNCASSGQAFEADNAAELNEQFQAIARQISKLRL 664 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 27/231 (11%), Positives = 75/231 (32%), Gaps = 29/231 (12%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM 66 ++ + +G +LTA + + +G +++ + + L D ++ + Sbjct: 12 GTKSLISDQRGNAFMLTAAAIIPVIGFVGSAVDIGRAYMTQLRLQQACDAGVLAGRRAM- 70 Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 +N +F + + ++ ++ Sbjct: 71 -------GGASYDEAAQAEANKMFNFNFPEAKYGATGILFSSRALNASDVEG-------- 115 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM-E 185 SA+ ++ F + + K+ +D+M+VLDV+ SM + Sbjct: 116 -QASAVLPTELMFMF--------GKEEFRLSADCTAKLEISN---VDVMLVLDVTGSMAQ 163 Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 + S+ +I + + + + ++ G+V +S+ +L Sbjct: 164 TNAGDSVNRITALKDATMDFFDTLTNADVGDGRLRFGVVPYSSTANVGQIL 214 >gi|253584083|ref|ZP_04861281.1| BatA protein [Fusobacterium varium ATCC 27725] gi|251834655|gb|EES63218.1| BatA protein [Fusobacterium varium ATCC 27725] Length = 319 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 39/215 (18%), Positives = 77/215 (35%), Gaps = 48/215 (22%) Query: 172 LDMMIVLDVSRSM--ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +D+ I LD+S+SM E F + + K E + D + L+ F Sbjct: 82 IDIAISLDLSQSMLQEDFTPNRLEKAK----------EVLDEFIDKRGNDRLSLIVFGGD 131 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNS----TPGLKYAYNQIFDMQGMRQHCNTEDA 285 L + + ++ + L+ +++N+ G+ A N++ D + Sbjct: 132 AYTKVPLTFDHNVIKEMTRKLTVDDITSNTRTAIGMGIGVALNRLKDSEAKS-------- 183 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-------------- 331 K+I+ +TDGEN + + AK+ G +Y IGI Sbjct: 184 ---KVIILLTDGENN--SGEMSPSAAADIAKELGIKIYTIGIGAKEIKVPSFFGYKTVKN 238 Query: 332 ---SHEFLRACA--SPNSFYLVENPHSMYDAFSHI 361 L++ A + ++ + + F+ I Sbjct: 239 TELDENMLKSIAETTGGEYFRASDSKEFKEIFNKI 273 >gi|149180101|ref|ZP_01858606.1| hypothetical protein BSG1_03760 [Bacillus sp. SG-1] gi|148852293|gb|EDL66438.1| hypothetical protein BSG1_03760 [Bacillus sp. SG-1] Length = 931 Score = 70.6 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 39/198 (19%), Positives = 75/198 (37%), Gaps = 31/198 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + L M+IVLD S SM + KI +A ++ ++ + G + F Sbjct: 403 KELPSLGMVIVLDRSGSMAGY------KIQLAKEAA------IRSAELLREKDTLGFIAF 450 Query: 227 SNKIEE--FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ + + KI L+ G TN P L+ AY Q+ ++ R+H Sbjct: 451 DDRPWQIIDTEPIKDKEKVIEKINGLTS-GGGTNIFPSLELAYEQLTPLELQRKH----- 504 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP-- 342 I+ +TDG++ ++ + ++ E K+ + + I L + Sbjct: 505 ------IILLTDGQSATSPDYLTTIQ---EGKENNITLSTVAIGEGSDSVLLEELSDEGG 555 Query: 343 NSFYLVENPHSMYDAFSH 360 FY V + ++ S Sbjct: 556 GRFYDVNDSSTIPSILSR 573 >gi|320102039|ref|YP_004177630.1| VWFA-like domain-containing protein [Isosphaera pallida ATCC 43644] gi|319749321|gb|ADV61081.1| VWFA-related domain protein [Isosphaera pallida ATCC 43644] Length = 784 Score = 70.2 bits (170), Expect = 4e-10, Method: Composition-based stats. Identities = 47/337 (13%), Positives = 107/337 (31%), Gaps = 43/337 (12%) Query: 43 IFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNG 102 +F K L +DR L ++ G + + + N + F +L DN Sbjct: 404 LFLRKANLAFELDRVL-----RLPPLEAGRPDRPAALTPVSFQQSNANDEPFDFDLEDNA 458 Query: 103 FVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSV 162 +ND + + + + + I V Sbjct: 459 ALNDPNGVRVIIRSADETNFPKITLDFEVKGPDGKAITSARKEDFKVTEYDREVEILDFV 518 Query: 163 KVNSQTD-ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + +++VLD S SM +I +++ L + P + V Sbjct: 519 SPKGEVQLRPTTVVLVLDTSGSM-----LQDNRIGALKEAVGVFLGTL---PPGSKV--- 567 Query: 222 GLVTFSNKIEE------FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 ++ F++ + + ++ ++ G T+ + A I + G Sbjct: 568 AVIEFNSFVNPLVFGPANEIFTTRFDDVKSQVNRFRANG-GTSYYDAVDRALELIANQTG 626 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTK-EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 R ++ +TDGE+ S++ S+ +A+ G V+ +G+ E Sbjct: 627 RRA------------VLALTDGEDTSSRLAGLDSVIL--KARNLGLPVHTLGVGREDEIE 672 Query: 335 F--LRACA--SPNSFYLVENPHSMYDAFSHIGKDIVT 367 L+ A + ++ + + F+ + + + Sbjct: 673 VGELQRLARETRGRYFPARDATKLRVIFAELAQSLRE 709 >gi|119504633|ref|ZP_01626712.1| BatB protein, putative [marine gamma proteobacterium HTCC2080] gi|119459655|gb|EAW40751.1| BatB protein, putative [marine gamma proteobacterium HTCC2080] Length = 332 Score = 70.2 bits (170), Expect = 4e-10, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 74/205 (36%), Gaps = 38/205 (18%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +D+S SM+ +S +++I A+K+I A + + GL+ F Sbjct: 91 DLLLAIDLSGSMQIEDMQIGNSLVSRI-TAVKAIAADFASRRTGD------RVGLILFGT 143 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + V +++ I+ L G T L A ++ + Sbjct: 144 RAYVQAPLTFDVKTVKQFIEEAQLGFAGEDTAIGDALGLAVKRLRERPADS--------- 194 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS----------HEFL 336 ++++ +TDG+ T + A + +Y IGI L Sbjct: 195 --RVLILLTDGQ--DTASTVDPMEAAALASEMNVKIYTIGISRRLGTSSNSSGEVDEALL 250 Query: 337 RAC--ASPNSFYLVENPHSMYDAFS 359 A A+ ++ P + D + Sbjct: 251 TAIAQATGGRYFRARTPKELQDIYQ 275 >gi|256261598|gb|ACU65921.1| CR4 receptor subunit alphaX [Ovis aries] Length = 1158 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 36/204 (17%), Positives = 77/204 (37%), Gaps = 29/204 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S+ S K+ +K++ + + LV FS++ +E Sbjct: 151 DIAFLIDGSGSISS---RDFNKMLSFVKAVMSQFQRPSSQFS--------LVQFSDRFQE 199 Query: 233 FFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + S + + + G T + +++ +++ + KI Sbjct: 200 HFTFKDFATSSDPLNLLNSVWQLGGWTFTASAIRFVTDRLLSA------AYGARKDASKI 253 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-----VIRSHEFLRACASP--N 343 ++ +TDGE + ++ + A G I YAIG+ E + ++P Sbjct: 254 LIVITDGEKTEKVDYKEVIPRAEAA---GIIRYAIGVGSAFQYRNSLQELIDIASTPSKE 310 Query: 344 SFYLVENPHSMYDAFSHIGKDIVT 367 + VEN ++ D + + I Sbjct: 311 HVFQVENFDALRDIQKQLKEKIFA 334 >gi|126334040|ref|XP_001370580.1| PREDICTED: similar to Integrin, alpha M (complement component 3 receptor 3 subunit) [Monodelphis domestica] Length = 1156 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 39/205 (19%), Positives = 81/205 (39%), Gaps = 32/205 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S S+ + + ++++ K Q L+ +S+ + Sbjct: 156 DIVFLIDGSGSIRPL------QFVQMKNFVMTVMDQFKGTD-----TQFSLMQYSDDFKT 204 Query: 233 FFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + + + + + T++ G++ ++F Q N + KKI Sbjct: 205 HFTFNNFKNDPTSKNLVGPIEQLNGKTHTASGIRKVVRELF------QEWNGARKDAKKI 258 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV----IRSHEFLRACAS---PN 343 ++ +TDG+ + + + EA+K G I YAIG+ + + LR AS Sbjct: 259 LIVITDGQIQGDSLNYRDVIP--EAEKEGVIRYAIGVGYAFNTPSARQELRTIASQPAQE 316 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTK 368 + V N +DA +I + K Sbjct: 317 HVFQVNN----FDALKNIQNQLQEK 337 >gi|254448210|ref|ZP_05061672.1| von Willebrand factor, type A [gamma proteobacterium HTCC5015] gi|198262077|gb|EDY86360.1| von Willebrand factor, type A [gamma proteobacterium HTCC5015] Length = 336 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 69/222 (31%), Gaps = 52/222 (23%) Query: 174 MMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +++ LD+S SME ++I + +++ + LV F Sbjct: 88 LVVALDISGSMEEQDMDDNGQRRSRIAVTKDVAMDFVKQ-------REGDRIALVLFGTH 140 Query: 230 IEEFFLLEWGVSHLQRKI--------KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 L + + + I L + +T + A ++ D+ Sbjct: 141 PYLQTPLTFDHPTVMQHIYEAQLTMADDLQRGIHATAIGDAIGLAVKRLRDIDA------ 194 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR---- 337 K ++ +TDG + + L A + G +Y IG+ + L Sbjct: 195 -----PDKTLILLTDGSDNA--SQVAPLKAAQIAAREGLKIYTIGLGAEQRQASLLGFDF 247 Query: 338 ----------------ACASPNSFYLVENPHSMYDAFSHIGK 363 A A+ ++ NP + + + HI + Sbjct: 248 GFGKNREIDEKTLKDIAKATDGRYFRARNPEELREIYQHIDR 289 >gi|327193254|gb|EGE60160.1| hypothetical protein RHECNPAF_1700073 [Rhizobium etli CNPAF512] Length = 457 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 52/334 (15%), Positives = 115/334 (34%), Gaps = 78/334 (23%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 R+ + G + I+ A+ L + + +G + + ++ + S +D +L+ A QI N Sbjct: 29 FRSLGRDRTGNVAIVVALSLVPMLVAVGASFDYIRSYNVRQRMQSDLDAALIAAVKQINN 88 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 + + K+ +++N++ + + + Sbjct: 89 SEDTDALKQKVYDWFHAQVENSYALGE-------------------------IEIDTTNH 123 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM--- 184 +I+A + +P F N + + + S+VK + L++ IV+D S SM Sbjct: 124 NITATASGTVPTTFMKIA----NIDTVPVSVGSAVKGP--ATSYLNVYIVIDRSPSMLLA 177 Query: 185 --------------------------------ESFFDSSITK-----IDMAIKSINAMLE 207 + +D S K D+A ++ +L+ Sbjct: 178 ATTSGQSTMYSGIGCQFACHTGDAHTVGKKTYANNYDYSTEKNIKLRADVAGDAVREVLD 237 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 + + ++ GL + + +E S+ ++ LS S + Y Y Sbjct: 238 MIDESDSNHERIKVGLYSLGDTTKEVLAPTLDTSNARK---RLSDDSYGLTSATSMNYTY 294 Query: 268 NQIFDMQGMRQHCNTED----ANYKKIIVFMTDG 297 + + D AN K+++ +TDG Sbjct: 295 FDVALAALQKIVGTGGDGTSSANPLKLVLLLTDG 328 >gi|194221347|ref|XP_001494879.2| PREDICTED: similar to alpha 1 type VII collagen [Equus caballus] Length = 3065 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 75/203 (36%), Gaps = 28/203 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ S+ ++ ++ + V+ V +S+ Sbjct: 38 DIVFLLDGSSSIG---RSNFREVRGFLEGLVLPFSGAASAQG----VRFAAVQYSDDPRT 90 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L+ + I+ LS G +T + + + + +F Q R K+ Sbjct: 91 EFGLDALGSGGDVIHAIRELSYKGGNTRTGAAILHVADHVFLPQLARPG-------VPKV 143 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLV 348 + +TDG K K +G ++A+GI+ E R + + + F+ V Sbjct: 144 CILITDG-----KSQDLVDTAAQRLKGQGVKLFAVGIKNADHEELKRVASQPTSDFFFFV 198 Query: 349 ENPHSMYDAFSHIGKDIVTKRIW 371 + + +V++R+ Sbjct: 199 NDFSILRTLL-----PLVSRRVC 216 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 31/218 (14%), Positives = 75/218 (34%), Gaps = 25/218 (11%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 I P S + +D++ +L +R + + + +++ ++ + + Sbjct: 1035 IQGPEASITQSPVCPHGLMDVVFLLHATR-------DNAHRAEAVKRALERLVSALGPLG 1087 Query: 214 DVNNVVQSGLVTFSNKIEEFFLL--EWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQI 270 VQ GL+++S++ F L + +KI+ + N + A+ + Sbjct: 1088 P--QAVQIGLLSYSHRPSPLFPLNSSHDPGVILQKIRSIPYMDPSGNNLGIAVVTAHRHL 1145 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + + ++V + D EA+ G V +G+ Sbjct: 1146 MAPDAPGR-----RQHVPGVMVLLVD-----EPLRGDIFNPVREAQAAGLKVMMLGLVGA 1195 Query: 331 RSHEFLRACASPN---SFYLVENPHSMYDAFSHIGKDI 365 + R S + +F+ V + S+ A S + + Sbjct: 1196 DPEQLRRLVPSTDPVQNFFAVNDGSSLDQAVSSLATAL 1233 >gi|307941972|ref|ZP_07657325.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307945282|ref|ZP_07660618.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307771155|gb|EFO30380.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307774878|gb|EFO34086.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 412 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 63/400 (15%), Positives = 147/400 (36%), Gaps = 58/400 (14%) Query: 14 NYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAAT---QIMNEGN 70 N KG + I A+ +I + + I++S + + + D + A T + + +G Sbjct: 15 NIKGSIAIPFALLATLILAAISVGIDMSFAYNKRDQSQLVADEVSLFAVTTFRKYVADGM 74 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRD-NGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 N+ + + + S + +N +D + ++ + ++E Y Sbjct: 75 SKNQARKRAETDARKFLTARTKSLDGTTEKFSIKINIVDREAKVVKANVNISGKHESYMT 134 Query: 130 SAISRYKIPLKFCTFIPWYTNSRH----IVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 A+ I + ++ ++ S+ + + R M + + Sbjct: 135 HAMGFDNIDYTADSESTISFGQGKYEFIFLVDVSPSMGIGASNRDRQIMQRAIGCQFACH 194 Query: 186 SFFDSSIT---------KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 + SS++ +ID+ ++ +++ +++ +V+ +++GL +FSN + L Sbjct: 195 EPWYSSVSRAKSAGARLRIDVVKDALKSLVTQLEEATEVD--LRTGLYSFSNYLHIQTGL 252 Query: 237 EWGVSHLQRKIKYLSKF-----GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 G+S +R+ ++ G TN +N + A+ K+ I Sbjct: 253 NKGISKFKREANKIAIHREYLRGGGTNFHGVFS-DFNGVL-------RSLKPKADVKQHI 304 Query: 292 VFMTDG------------------ENLSTKEDQQSLYYCNEAKKRGAI-VYAIGIRVIRS 332 + ++DG N + +C+E KK V+ + + R+ Sbjct: 305 IIISDGVNHLNLRSGTNRHLWNQTPNWRPYNYSFNPRWCDEFKKGEVRTVHTMLVEPDRA 364 Query: 333 H------EFLRACA-SPNSFYLVENPHSMYDAFSHIGKDI 365 H +RACA S + FY + + AF + + + Sbjct: 365 HYVRASTSSMRACATSADFFYSANSAAEIDKAFKDLFEAL 404 >gi|188578240|ref|YP_001915169.1| von Willebrand factor type A domain protein [Xanthomonas oryzae pv. oryzae PXO99A] gi|188522692|gb|ACD60637.1| von Willebrand factor type A domain protein [Xanthomonas oryzae pv. oryzae PXO99A] Length = 335 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 35/261 (13%), Positives = 84/261 (32%), Gaps = 47/261 (18%) Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES- 186 ++P ++ ++ W+ + P + +AR MM+ +D+S SM Sbjct: 57 QAKTTPSLRMP-RWLAWLGWFLLCAALARPQQLGEVIQPPREAR-QMMLAVDLSGSMNEP 114 Query: 187 ---FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 + ++ A ++ L+ + GL+ F + L ++ + Sbjct: 115 DMVLGGKVVDRLTAAKAVLSDFLDRRDGD-------RVGLLVFGQRAYALTPLTADLTSV 167 Query: 244 QRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 + ++ + G T + + ++ E ++++V +TDG N + Sbjct: 168 RDQLRDSVVGLAGRETAIGDAIALSVKRL-----------REQKQGQRVVVLLTDGVNTA 216 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-----------------LRACA--SP 342 L AK G +Y I + LR A + Sbjct: 217 --GVLDPLKAAELAKAEGVRIYTIAFGGGGGYSLFGVPIPAGGNDDIDEDGLRKIAQQTG 274 Query: 343 NSFYLVENPHSMYDAFSHIGK 363 F+ + + ++ + + Sbjct: 275 GRFFRARDTEELAGIYAELDR 295 >gi|301609920|ref|XP_002934508.1| PREDICTED: vitrin-like [Xenopus (Silurana) tropicalis] Length = 779 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 69/197 (35%), Gaps = 27/197 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ V+D S S+ ++ I + E ++ + G V ++ + Sbjct: 596 DIGFVIDGSSSVG------TGNFRTVLQFIANITNEFEISDTD---TRIGAVQYTYEQRL 646 Query: 233 FFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F ++ + I + + T++ + YA Q+F N +KI Sbjct: 647 EFGFDKYSTKQDVMNAIMRIGYWSGGTSTGAAITYASEQLFS---------KSKPNKRKI 697 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLV 348 ++ +TDG + G I YA+GI E P + + V Sbjct: 698 LIVITDGR-----SYDDVRAPAAAVHRNGVIAYAVGIAWAAQDELESIATDPDKDHSFFV 752 Query: 349 ENPHSMYDAFSHIGKDI 365 E+ S+Y I ++I Sbjct: 753 EDFDSLYKFVGKIFQNI 769 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 39/253 (15%), Positives = 89/253 (35%), Gaps = 28/253 (11%) Query: 90 WNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI------SAISRYKIPLKFCT 143 W + R+ N + + + +E ++ +++ ++ L Sbjct: 305 WPVETRDTQVHNSKTSVSETDRWKSGFSQTNSKNSESDTVLKAVQETSVPKFNEDLWRYG 364 Query: 144 FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSIN 203 F N+R V+ V T+ ++D++ ++D S S+ + + +N Sbjct: 365 FSETAINAREEVITKPPETAVLGGTNCKIDLVFLIDGSWSIGK------RRFKIQKFFLN 418 Query: 204 AMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTP 261 M E + + ++ + G+V + + F L + V L+ I+ + + G +N Sbjct: 419 EMAEALDVG--LSGPL-MGVVQYGDDPSTEFGLGAHFNVGDLKNAIEKIPQKGGHSNVGK 475 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 L Y F + + V + DG E+ L A++ G Sbjct: 476 ALSYVNKNYFSDSSGNRGSAPN------VAVVLVDGWPTDKVEEASRL-----ARESGIN 524 Query: 322 VYAIGIRVIRSHE 334 ++ + I +E Sbjct: 525 IFFVTIEGADDNE 537 >gi|284030499|ref|YP_003380430.1| von Willebrand factor type A [Kribbella flavida DSM 17836] gi|283809792|gb|ADB31631.1| von Willebrand factor type A [Kribbella flavida DSM 17836] Length = 317 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 40/208 (19%), Positives = 76/208 (36%), Gaps = 30/208 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +DVS SM + D +++ A KS + ++ +V LV F+ Sbjct: 88 IVVAIDVSLSMMA-TDVDPNRLEAAKKSAKNFVNQLPSKFNVA------LVNFAGTASII 140 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + +QR I L + ST + G+ + + + +H N IV Sbjct: 141 VPPTTDRATVQRSIDGL-ELAESTATGEGIFTSLQALTQVPPDPEHPNDPAPAR---IVL 196 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--------------VIRSHEFLRAC 339 ++DG+ + Q+ AK++ +Y I V LR+ Sbjct: 197 LSDGKRTVGRTAQEG---AQAAKEKNTPIYTITFGTDSGFIEMDGIRQRVPPDRAELRSV 253 Query: 340 A--SPNSFYLVENPHSMYDAFSHIGKDI 365 A + Y E+ + D + IG + Sbjct: 254 AEITGGEAYTAESAGELEDVYKDIGSSV 281 >gi|307718398|ref|YP_003873930.1| hypothetical protein STHERM_c06990 [Spirochaeta thermophila DSM 6192] gi|306532123|gb|ADN01657.1| hypothetical protein STHERM_c06990 [Spirochaeta thermophila DSM 6192] Length = 458 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 37/216 (17%), Positives = 78/216 (36%), Gaps = 27/216 (12%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSIT------KIDMAIKSINAMLEEVKLIPD 214 SV+ + + ++VLD S SM D + T +I A ++I L P Sbjct: 78 SVRRGVNREEGISFLLVLDASGSMWDALDGTPTEDPDRMRITHAKRAIREFL------PL 131 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 ++ + GL F+ + S + K+ + + P + AY +++ Sbjct: 132 LSGRDRVGLAVFNRTYRVIQPIVGDPSLVLEKLDAIER--------PSREQAYTELYRSM 183 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENL--STKEDQQSLYYC-NEAKKRGAIVYAIGIRVIR 331 + + ++++V ++DGEN E + + A + G Y I + Sbjct: 184 E-EALTDFGEEGRRRVLVVLSDGENFPVDPSESPSTPGTAIDLAHRYGITCYVIHFGTEK 242 Query: 332 SHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDI 365 + AS + N + ++ I + + Sbjct: 243 -DRLIGDLASETGGRVFDARNALELASVYTAIQEQV 277 >gi|149481218|ref|XP_001506719.1| PREDICTED: similar to matrilin 4, partial [Ornithorhynchus anatinus] Length = 312 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 70/204 (34%), Gaps = 24/204 (11%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D+++V+D S+S+ + ++ + + ++ + + P + GLV Sbjct: 126 GRCGSGPTDLVLVIDGSKSVG------AQQFELVKRWVGELVGSLDVSPAG---TRVGLV 176 Query: 225 TFSNKIEEFFLLEW-GVSHLQRKIKYL-SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+++ F L G + T + L++ + F Sbjct: 177 QFSSRVRTEFPLGRHGTKAEAEAAVRAVTPMDKGTMTGLALRHLVERGFSEAE------G 230 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 +++ + +TDG + AK RG +++A+G+ E + P Sbjct: 231 ARPGSRRVGLLVTDGR-----SQDDVSPWAARAKDRGIVMFAVGVGKAVEEELREIASDP 285 Query: 343 --NSFYLVENPHSMYDAFSHIGKD 364 + ++ H+ D Sbjct: 286 VERHVSYAADFGALTRLLDHLKLD 309 >gi|296473276|gb|DAA15391.1| integrin, alpha D [Bos taurus] Length = 1165 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 43/227 (18%), Positives = 89/227 (39%), Gaps = 32/227 (14%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 S H+ + T + T+ +D+ ++D S S++ + ++ ++++ + Sbjct: 133 SSHLQIIRTVPAALPECTNQEIDIAFLIDGSGSID---QTDFKRMKNFVRAVMDRSKGTN 189 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 L+ +SN ++ F W Q + + + T + G++ Sbjct: 190 TQFS--------LMQYSNLMKTHFTFNQFWTSRSSQSLVDPIVQLNGLTFTATGIRTVVR 241 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG-- 326 ++F N + +KII+ +TDGE K+ + EA+K I YAIG Sbjct: 242 ELF------HSKNGARKSARKIIIVITDGE--KYKDPLEYKDVIPEAEKANIIRYAIGVG 293 Query: 327 --IRVIRSHEFLRACA---SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 + + E L+ S + + V++ + A S I K + K Sbjct: 294 DAFQAHAAREELKIIGSVPSEDHVFKVDS----FAALSSIQKQLQEK 336 >gi|254776724|ref|ZP_05218240.1| hypothetical protein MaviaA2_18936 [Mycobacterium avium subsp. avium ATCC 25291] Length = 335 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 39/245 (15%), Positives = 76/245 (31%), Gaps = 33/245 (13%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLD---MMIVLDVSRSMESFFDSSITKIDMAI 199 +P + +V+ T+ S L+ +M+V+DVS SM S D ++ A Sbjct: 65 RHVPTILLATSLVLLTTAMAGPTSDVRIPLNRAVVMLVIDVSESMAS-TDVPPNRLAAAK 123 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNS 259 ++ +++ + GLV F+ S ++ I L T + Sbjct: 124 EAGKQFADQLTPAIN------LGLVEFAANATLLVPPTTNRSAVKAGIDSLQP-APKTAT 176 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL---STKEDQQSLYYCNEAK 316 G+ A I + + IV +DG Q + AK Sbjct: 177 GEGIFTALQAIATVGSVMGGGEGPPPAR---IVLESDGAENVPLDPNAPQGAFTAARAAK 233 Query: 317 KRGAIVYAIGIRVIRS---------------HEFLRACA-SPNSFYLVENPHSMYDAFSH 360 G + I + C + + ++ S+ + +S Sbjct: 234 AEGVQISTISFGTPYGTVDYEGATIPVPVDDQTLQKICEITDGQAFHADSLDSLKNVYST 293 Query: 361 IGKDI 365 + + I Sbjct: 294 LQRQI 298 >gi|156523104|ref|NP_001095966.1| integrin alpha-D [Bos taurus] gi|151556938|gb|AAI49717.1| ITGAD protein [Bos taurus] Length = 1165 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 43/227 (18%), Positives = 89/227 (39%), Gaps = 32/227 (14%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 S H+ + T + T+ +D+ ++D S S++ + ++ ++++ + Sbjct: 133 SSHLQIIRTVPAALPECTNQEIDIAFLIDGSGSID---QTDFKRMKNFVRAVMDRSKGTN 189 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 L+ +SN ++ F W Q + + + T + G++ Sbjct: 190 TQFS--------LMQYSNLMKTHFTFNQFWTSRSSQSLVDPIVQLNGLTFTATGIRTVVR 241 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG-- 326 ++F N + +KII+ +TDGE K+ + EA+K I YAIG Sbjct: 242 ELF------HSKNGARKSARKIIIVITDGE--KYKDPLEYKDVIPEAEKANIIRYAIGVG 293 Query: 327 --IRVIRSHEFLRACA---SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 + + E L+ S + + V++ + A S I K + K Sbjct: 294 DAFQAHAAREELKIIGSVPSEDHVFKVDS----FAALSSIQKQLQEK 336 >gi|301165481|emb|CBW25052.1| putative membrane protein (von Willebrand factor type A) [Bacteriovorax marinus SJ] Length = 329 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 37/159 (23%), Positives = 66/159 (41%), Gaps = 24/159 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ V+DVSRSM + D ++++A I+ + ++ + GL+ FS + Sbjct: 86 DIFFVIDVSRSMLAD-DFRPNRLEVAKDKISDFVA-LRPTD------RIGLIMFSERAFT 137 Query: 233 FFLLEWGVSHLQRKIKYLS---KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + +++ + ++ G TN L A + + K Sbjct: 138 LLPLSTDLKLIKQMVGEINVGGMLGSGTNIGDALGLAVAR-----------GAQSLAKNK 186 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +I+ +TDG +S + EAKK+G VY IGI Sbjct: 187 VIILLTDG--VSNVGFLTPIQAAEEAKKQGIKVYTIGIG 223 >gi|88858061|ref|ZP_01132703.1| hypothetical protein PTD2_11764 [Pseudoalteromonas tunicata D2] gi|88819678|gb|EAR29491.1| hypothetical protein PTD2_11764 [Pseudoalteromonas tunicata D2] Length = 328 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 32/215 (14%), Positives = 77/215 (35%), Gaps = 45/215 (20%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+ +D+S SM + + ++ M + + + + GL+ F + Sbjct: 88 DIMLAVDLSGSMVEQDMAYQGRYVDRLSMVKAVLKNFIAQ-------RQGDRLGLILFGD 140 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L ++ + + ++ G +T L A + ++ + Sbjct: 141 TAFLQTPLTRDLNTVSKMLEEAQIGLVGRATAIGDALGLAVKRF-----------SQKQD 189 Query: 287 YKKIIVFMTDGENLSTK-EDQQSLYYCNEAKKRGAIVYAIGIRVIRSH------------ 333 +I+V +TDGEN + +++L A++ G VY +G+ + Sbjct: 190 SNRILVLLTDGENTAGNLAPEEALLL---AREEGIKVYTVGVGSQGGNRFNLFSMSGSSS 246 Query: 334 ---EFLRACASP--NSFYLVENPHSMYDAFSHIGK 363 L+ A+ ++ + S+ + + K Sbjct: 247 LDESLLQKIATETGGLYFRATDVASLQQIYQELDK 281 >gi|296394903|ref|YP_003659787.1| von Willebrand factor type A [Segniliparus rotundus DSM 44985] gi|296182050|gb|ADG98956.1| von Willebrand factor type A [Segniliparus rotundus DSM 44985] Length = 343 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 38/239 (15%), Positives = 84/239 (35%), Gaps = 37/239 (15%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + V+ + +++VLD+S+SM + D +++D A + ++ + Sbjct: 79 LTVALAGPTAVSQVAKNQATVILVLDISKSM-AATDVKPSRVDAARAAAIKFVDGMAP-- 135 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQI 270 VQ G+VTF+ + ++ I + K T + G+ A QI Sbjct: 136 ----TVQLGVVTFAGSAQPLVRPSTDHETAKKVIDQMIRADKLEKQTATGEGIYTALQQI 191 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGE---NLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + G + IV ++DG+ + + AK++ V + Sbjct: 192 ETIAGALGG--KNHTPPAR-IVLVSDGKETVPDDLNAPRGAYAAARTAKEKHIPVCTVAF 248 Query: 328 RVIRSH--------------EFLRACA----SPNS---FYLVENPHSMYDAFSHIGKDI 365 + L+ + SP + F+ E+ + + + +DI Sbjct: 249 GTKSGKITIDNQVDEVPVDLDSLKKISDLSNSPGNSCRFFPAESQAELAQIYQSLNEDI 307 >gi|326435586|gb|EGD81156.1| hypothetical protein PTSG_11196 [Salpingoeca sp. ATCC 50818] Length = 1445 Score = 70.2 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 69/201 (34%), Gaps = 26/201 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + +D++ +LD S S+ S + + + + + PD + L+ + Sbjct: 177 CGRSAVDLLFILDGSGSIGS------SNFETMRQFTATVTSFFDVSPD---TTRVALMVY 227 Query: 227 SNKIEEFFLLEWGVSHLQ----RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 S+ + E F + +S+ + I+ + G T + L YA +F + Sbjct: 228 SSSVTEIFDFSYVLSNTRDEIITTIRNTNYPGGGTRTGSALDYARTNMFLTS---RGARP 284 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 A ++ + + DG+ + ++AIGI +E + P Sbjct: 285 SSAGVPRVAIVIIDGQ-----SGDSVAQPAENLRNENVNIFAIGISGADVNELNMIASPP 339 Query: 343 NSFYLVENPHSMYDAFSHIGK 363 + N D F + Sbjct: 340 -----ITNNVKFIDTFQAFSQ 355 >gi|297683547|ref|XP_002819437.1| PREDICTED: collagen alpha-1(XIV) chain-like [Pongo abelii] Length = 1761 Score = 70.2 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 41/223 (18%), Positives = 80/223 (35%), Gaps = 26/223 (11%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV- 215 VK QT A D++I++D S S+ F + + LE + DV Sbjct: 142 SSPEEVKFVCQTPAIADIVILVDGSWSIGRFN----------FRLVRLFLENLVTAFDVG 191 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + + GL +S + L + + ++ L G +T + L Y + F Sbjct: 192 SEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNYIFENSFKP 251 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + KI + +TDG K + ++ G ++AIG++ + Sbjct: 252 E------AGSRTGVSKIGILITDG-----KSQDDIIPPSRNLRESGVELFAIGVKNADVN 300 Query: 334 EFLRACASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 E + P+ Y V M+ + + + ++ D+ Sbjct: 301 ELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSRVEEQDR 343 >gi|198426249|ref|XP_002120426.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 1937 Score = 70.2 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 27/199 (13%), Positives = 72/199 (36%), Gaps = 30/199 (15%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +D+++VLD SRS+ + + + +++ + P+ ++ ++ + Sbjct: 1742 GVIDLVVVLDSSRSIGLKSWTDM------KAFVRSLIGSFVIGPNA---MRVSAFRYNRR 1792 Query: 230 IEEFFLL---EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ + + + + G T + L++ + I Sbjct: 1793 VDTDTQILLNSYSDNSFFAAYDAIPYNGAGTRTALALQHVRDVILTRANGEWPAARN--- 1849 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-----EFLRACAS 341 +++ +TDG + + + + G + YA+G+ I + L S Sbjct: 1850 ---VVLTVTDG-----RSVSSIVPISEQLRANGVLTYALGVPPIIGNGMNTVNLLALAGS 1901 Query: 342 PNSFYLVENPH--SMYDAF 358 P+ +L N + +Y+ F Sbjct: 1902 PSRRFLANNGNYGGLYEQF 1920 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 62/184 (33%), Gaps = 29/184 (15%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LDM+I++D S S+ S ++ + ++ + Q ++ ++ ++ Sbjct: 90 LDMVILIDSSSSVGS------ENWNIMKNFVRTIINSFQRSATS---TQISVLRYNRVVD 140 Query: 232 EFFLLEWG-----VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + S + G T + L+Y + I N + Sbjct: 141 TSTQILLNEYLTDQSGFLAAYDRIPYNGGGTLTGNALRYVNDVILT------GANGDRPG 194 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI----RVIRSHEFLRACASP 342 + +++ +TDG + L + +G Y +GI +R + L S Sbjct: 195 VRDVLITLTDG-----RAHDNVLAPSRMLRAKGVETYVVGIQSRLGALRESQLLEISGSR 249 Query: 343 NSFY 346 + + Sbjct: 250 DRMF 253 >gi|296223654|ref|XP_002757711.1| PREDICTED: anthrax toxin receptor 1 [Callithrix jacchus] Length = 568 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 39/205 (19%), Positives = 73/205 (35%), Gaps = 25/205 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+ +LD S S+ ++ I +E++ ++ + FS + Sbjct: 43 GGFDLYFILDKSGSVLHHWN-----------EIYYFVEQLAHKFISPQ-LRMSFIVFSTR 90 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L +++ ++ L K G T G + A QI+ + Sbjct: 91 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENSQGYRTAS----- 145 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +I+ +TDGE L S N ++ GAIVY +G++ + R S + + Sbjct: 146 --VIIALTDGE-LHEDLFFYSEREANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVFP 202 Query: 348 VENPHSMYDAFSHIGKDIVTKRIWY 372 V + + A I I+ K Sbjct: 203 VNDG---FQALQGIIHSILKKSCIE 224 >gi|124002443|ref|ZP_01687296.1| OmpA family protein [Microscilla marina ATCC 23134] gi|123992272|gb|EAY31640.1| OmpA family protein [Microscilla marina ATCC 23134] Length = 756 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 54/301 (17%), Positives = 107/301 (35%), Gaps = 42/301 (13%) Query: 74 RKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAIS 133 RK +K +K + F + F D + L ++V + + Sbjct: 308 RKFIKNVAKKTTLKKGQKLDF------DIFATDRSKFPKEVKLYVLVTDTSGNFISHLAP 361 Query: 134 ---RYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 Y+ KF I R + +++ D+ +V+D S SM Sbjct: 362 PYVSYRESRKFFRKIVESVKGRRYAINNFKVREIHEMISKPYDISLVMDYSGSM----AG 417 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW-GVSHLQRKIKY 249 +I K++ A + + L N+ + +V F ++E L G K Sbjct: 418 NIKKLEEATR-------KFILTKHPND--KISVVKFDERLETELRLTAQGSKTDCVKFDG 468 Query: 250 LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST-----KE 304 L+++G ST G + + Q K+++ TDGE S+ K Sbjct: 469 LTRYGGSTALYAGADEGLESLKNAQNN------------KVMLLFTDGEENSSLQYFGKR 516 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIG 362 ++ +A+++G V+ I +++ L A + + Y +ENP + + + Sbjct: 517 AFRASEVVKKAREKGIRVFTIAYGTGVNNKTLNALSMLTDGKTYFIENPDEIKQVYEELP 576 Query: 363 K 363 + Sbjct: 577 R 577 >gi|291386646|ref|XP_002709870.1| PREDICTED: anthrax toxin receptor 1 [Oryctolagus cuniculus] Length = 564 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 39/205 (19%), Positives = 73/205 (35%), Gaps = 25/205 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+ +LD S S+ ++ I +E++ ++ + FS + Sbjct: 41 GGFDLYFILDKSGSVLHHWN-----------EIYFFVEQLAHKFISPQ-LRMSFIVFSTR 88 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L +++ ++ L K G T G + A QI+ + Sbjct: 89 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENSQGYRTAS----- 143 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +I+ +TDGE L S N ++ GAIVY +G++ + R S + + Sbjct: 144 --VIIALTDGE-LHEDLFFYSEREANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVFP 200 Query: 348 VENPHSMYDAFSHIGKDIVTKRIWY 372 V + + A I I+ K Sbjct: 201 VNDG---FQALQGIIHSILKKSCIE 222 >gi|91201135|emb|CAJ74194.1| conserved hypothetical protein [Candidatus Kuenenia stuttgartiensis] Length = 333 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 33/168 (19%), Positives = 69/168 (41%), Gaps = 26/168 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +D+MI +D SRSM + D ++++A + I +L+ ++ + GL+ F+ Sbjct: 85 EKKGIDIMIAVDTSRSMLAD-DVKPNRLEVAKREIEDLLKILEGD-------RVGLIAFA 136 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + + L S + + L+ T + + E+ Sbjct: 137 GRAFTYCPLTSDYSAFRLFLNDLNVNIIPVGGTAIAEAIYKGID-----------AFGEN 185 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 N K ++ +TDGEN + L ++AK++G ++Y +G+ Sbjct: 186 ENNHKAMIIITDGENH----ETDPLKAASKAKEKGIVIYTVGVGKKEG 229 >gi|297565073|ref|YP_003684045.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] gi|296849522|gb|ADH62537.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] Length = 308 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 74/204 (36%), Gaps = 30/204 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +DVS SM + D +++D A + + +E + V+ GLV+F+ Sbjct: 88 VVLAIDVSGSMMAD-DLKPSRLDAAKAAARSFVERMPAG------VKVGLVSFAAGAVLE 140 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L + +I L + +T GL + ++ Sbjct: 141 SGLTADHQGVIERIDLLERR-ANTAIGEGLLESLKAFPTGAN-------HQVAVPATVIL 192 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF----------LRACA--S 341 ++DG N + EAK+RG VY IG+ + LR A + Sbjct: 193 LSDGRN---RIGIAPQEAAQEAKRRGVRVYTIGVGSDDPNASVDWAGFDEAELRGIAEVT 249 Query: 342 PNSFYLVENPHSMYDAFSHIGKDI 365 ++ ++ + + + +G I Sbjct: 250 GGRYFAADSADRLQEIYRELGSQI 273 >gi|260800527|ref|XP_002595180.1| hypothetical protein BRAFLDRAFT_240914 [Branchiostoma floridae] gi|229280424|gb|EEN51192.1| hypothetical protein BRAFLDRAFT_240914 [Branchiostoma floridae] Length = 419 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 39/201 (19%), Positives = 73/201 (36%), Gaps = 27/201 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ VLD S S+ + + +I + Q G++ +SN+ + Sbjct: 1 DIIFVLDGSGSIGT---DNFERIKTFVSKAVTRFNIGPTQ------TQIGVIQYSNQPQS 51 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 LL + LQ+ I ++ TN+ L+Y N F N A K+ Sbjct: 52 EILLNDHQDAASLQQAISSINYLQGGTNTGKALRYLANNAFS------GKNGARAGVSKV 105 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-RVIRSHEFLRACASPNSFYLVE 349 + +TDG + + A K G ++YA+GI + E +S + V Sbjct: 106 AIVVTDGRSSD-----DVVRPALNAGKEGIVLYAVGIGGSVDYQELRDIASSDDKVVNVT 160 Query: 350 NPHSMYDAFSHIGKDIVTKRI 370 + + +G + + Sbjct: 161 DFSGL----QSVGNTLPDQVC 177 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 39/206 (18%), Positives = 72/206 (34%), Gaps = 23/206 (11%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+ I+LD S + S + L E + D + + + + + Sbjct: 230 TPLDIAILLDGSDGVSS------DDFEAEKSFAKLFLNEFDIGQDNS---RVTVFQYGTE 280 Query: 230 IEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + F L+ + +Q I G N ++Y F N + Sbjct: 281 PRQEFALDTYETDQDVQDAIADTEYMGGDRNLGQAIRYMATYGFS------GRNGARRSI 334 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF-Y 346 + + +T GE+L S +A++ G I+YAIG+ L A A+ + Y Sbjct: 335 PSVAIIITGGESLDEVASAAS-----KARRSGIILYAIGVGNATVPAELAAIATTANTSY 389 Query: 347 LVENPHSMYDAFSHIGKDIVTKRIWY 372 + ++ D + +I T Y Sbjct: 390 AAASFAALKDLRGALADEICTSEFCY 415 >gi|149620125|ref|XP_001521750.1| PREDICTED: similar to Collagen, type VI, alpha 1 [Ornithorhynchus anatinus] Length = 1163 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 42/218 (19%), Positives = 82/218 (37%), Gaps = 22/218 (10%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP---DVNNVVQSGLV 224 D +D+ VLD S S+ ++ ++++ D N V +G + Sbjct: 180 QDCPVDLFFVLDTSESVALRVKPFGYLVEQVKIFTKKFIDKLTDRYYRCDRNLVWNAGAL 239 Query: 225 TFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +S+ +E L G +L+ +++ + G T++ +K ++ ++ Sbjct: 240 HYSDSVELIQGLTRMPSGQKNLKDRVEAVQYIGKGTHTDCAIKRGIEELLIGGSHQKEN- 298 Query: 282 TEDANYKKIIVFMTDGEN-LSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRAC 339 K ++ +TDG KE L NEAK G V++I I L Sbjct: 299 -------KYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGIKVFSIAISPGHLEPRLSII 351 Query: 340 ASPNSF---YLV---ENPHSMYDAFSHIGKDIVTKRIW 371 A+ +++ + V EN D ++ K V + Sbjct: 352 ATDHTYRRNFTVNSEENVDESIDTMINMIKKNVEQVCC 389 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 43/228 (18%), Positives = 81/228 (35%), Gaps = 43/228 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV---NNVVQSGL 223 +D+ VLD S S+ + ++A I +++ + V + G+ Sbjct: 749 CKCGPIDLFFVLDSSESIG------LQNFEIAKDFIIKVIDRLSRDEQVMFEPGQSKVGV 802 Query: 224 VTFSN-KIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V +S+ K +E + + L+ +K L+ T + L +A N + G Sbjct: 803 VQYSHSKTQELVSMSDVNIRNIRDLKEAVKKLNWIAGGTFTGEALDFASNTLGRPDGN-- 860 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHE 334 ++I + +TDG + + ++ C GA V A+G + E Sbjct: 861 ---------QRIAIVITDGRSDTKRDPSPLNALCQT----GAQVVAVGINDIFSKPSNQE 907 Query: 335 FLRA--CASP------NSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 L A C +P EN + D K+I ++ K Sbjct: 908 GLSAVTCEAPAPGQKKGLSISKENYAELLD--DAFLKNITSQICIEKK 953 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 23/174 (13%), Positives = 61/174 (35%), Gaps = 17/174 (9%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ I++D S S+ S + +K + + V+ +V +S + ++ Sbjct: 968 DITILVDSSTSVGSH---NFKTSKQFVKRLAERFLSADKTDTAD--VRVSVVQYSGRNQQ 1022 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 ++ + + I + +T+ ++Y ++ R KK Sbjct: 1023 KLEAQFLQNYTEIASIIDDMEFINDATDVNAAIRY-VTTLYQKSSPRG--------VKKR 1073 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 ++ +DG + EA++ G +Y + + + +R + + Sbjct: 1074 LLLFSDGNSQGITGKAIEAAV-QEAQRAGIEIYVLVVGRHANEPNIRVLVTGKT 1126 >gi|284046352|ref|YP_003396692.1| hypothetical protein Cwoe_4905 [Conexibacter woesei DSM 14684] gi|283950573|gb|ADB53317.1| conserved hypothetical protein [Conexibacter woesei DSM 14684] Length = 317 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 34/216 (15%), Positives = 78/216 (36%), Gaps = 38/216 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 +M+V D SRSM + D +I A ++ + L+++ ++ G+ TFS+ + Sbjct: 87 SVMLVTDHSRSMLAE-DVEPDRITAAKRAASRFLDQLPPG------IRVGVTTFSDVPDG 139 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + ++R I+ G T + L+ A + + + +V Sbjct: 140 TQTPTYDHDLIRRTIEAQIADG-GTATGDALQVALDTL-------ERLEQNGERTPAAMV 191 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH------------------E 334 ++DG + ++ EA+ +Y + + + E Sbjct: 192 LLSDGATTTGRDPVMVARAAGEAR---IPIYTVALGTRDATVPNPGPTGPPLLPVAPDPE 248 Query: 335 FLRAC--ASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 L+A AS + ++ + + +G + T+ Sbjct: 249 TLQAIADASGGRAFQAQDDQELSSIYETLGSRLGTR 284 >gi|292624276|ref|XP_002665574.1| PREDICTED: collagen alpha-1(XXVIII) chain [Danio rerio] gi|225310547|emb|CAQ19234.1| collagen type XXVIII alpha 1 c precursor [Danio rerio] Length = 1170 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 25/163 (15%), Positives = 63/163 (38%), Gaps = 22/163 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + L+++ V+D S S+ ++ +N++++ V + + G+V + Sbjct: 796 CRTSPLELVFVIDSSESVGP------DNYEVVKDFVNSLIDHVSV---SREATRVGVVLY 846 Query: 227 SNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S+ L+ + + ++ ++ + G T + ++ A Q+F Sbjct: 847 SHVEVVVASLQQLYDQAAVKTAVRRMPYLGEGTFTGSAIRRA-TQLFQAA---------R 896 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 +K+ V +TDG ++ A G ++ +GI Sbjct: 897 PGVRKVAVVLTDGL-ADNRDAVSLKDAAEGAHSAGIEIFVVGI 938 >gi|329848392|ref|ZP_08263420.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] gi|328843455|gb|EGF93024.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] Length = 434 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 53/381 (13%), Positives = 111/381 (29%), Gaps = 86/381 (22%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM 66 ++ FF + +G + ++ + LP++FL +G ++ S + +K L D + V + Sbjct: 4 GLKRFFRDTRGNVIMIIGLALPVVFLAIGGAVDFSRVMQLKKELQDAADVASVGSVAVNS 63 Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 N + + +N + + ND+++I + Q+ Sbjct: 64 YAYKANTKGHSSFKTGENQALAIFNSNVKKH-------NDLNNIKVKAKIK----KQSTN 112 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 + T MPIT +S +D ++LD S SM Sbjct: 113 LVSEIGVTADYRPYLLGLMGMNT------MPITIKSTSSSTFPPYIDFYLLLDNSPSMGV 166 Query: 187 FFDSS------------------------------------ITKIDMAIKSINAMLEEVK 210 + T+ID+ ++ ++ K Sbjct: 167 GATTKDIDTMVANTSDKCAFACHQMDKAGNDYYALAKKLKVTTRIDVVRQATQNLMTTAK 226 Query: 211 LIPDVNNVVQSGLVTF----------SNKIEEFFLLEWGVSHLQR--------KIKYLSK 252 + + + + F + E L +S I Y + Sbjct: 227 NTQTLTDQYRMAIYHFGMAADQIDSKNPAPYEVSALTTNLSTSASNAAKIDLMTIPYQNY 286 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS---TKEDQQS- 308 + P N++ G + ++++ F++DG N + S Sbjct: 287 NSDRQTNFPSYLLGMNKVIPSSGDGSSSSKP----QQVLFFVSDGANDGYDCAYSNGASC 342 Query: 309 -------LYYCNEAKKRGAIV 322 C K RG + Sbjct: 343 RRISPLDTPQCKAMKARGVKI 363 >gi|91975399|ref|YP_568058.1| hypothetical protein RPD_0919 [Rhodopseudomonas palustris BisB5] gi|91681855|gb|ABE38157.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB5] Length = 435 Score = 69.8 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 63/427 (14%), Positives = 139/427 (32%), Gaps = 72/427 (16%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAAT--- 63 ++ F + G + ++ AI L I +G ++ ++ ++ L S D +++ A + Sbjct: 13 SLGRFASDRSGNIAVIFAIALLPILGFIGAAVDYTNASRVRAKLESAQDAAVLLAVSNSA 72 Query: 64 --QIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVV 121 + + + + + + T ++S V+ V++ LD++ Sbjct: 73 INKTVADAQADAVQFFNATLDGYGLSATIDLSVSENDGKRSAVSSFSSSVKTHFLDMIGY 132 Query: 122 PQ----NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 P N S ++ Y F NS + + T+S + Sbjct: 133 PTLAIGNRSTSTVSLPVY------VDFYLLLDNSPSMGVAATTSDIATMVANTSDQCAFA 186 Query: 178 L-DVSRSMESFFDSS----ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+S S + + +ID+ +++ + + V N + G+ TF + Sbjct: 187 CHDLSTSNNYYNLAKKLGVTMRIDVVRQAVQRLTTTATAMSAVTNQFRMGVYTFGSSCTA 246 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN--QIFDMQGMRQHCNTE------- 283 L V++L + + + + YN Q D G N+ Sbjct: 247 IGLTT--VANLSSSMSSVQTSVGTIDLMTIPYQGYNNDQCTDFDGSLTAINSAIPSPGSG 304 Query: 284 -DANYKKIIVFMTDG--ENLSTKEDQQ------------SLYYCNEAKKRGAIV---YAI 325 +K + F++DG + + ++ C K RG + Y Sbjct: 305 ISTQPQKWLFFVSDGVADANYPSTCTKPTVSGGRCQEPLTVAQCTAIKSRGIQIAVLYTT 364 Query: 326 GIRVIRSH-----------------------EFLRACASPNSFYLVENPHSMYDAFSHIG 362 + + + +++CASP ++ V + +A + Sbjct: 365 YLALPTNSWYNTYIAPFNPGPYGPSTNSQIAANMQSCASPGFYFEVSPTQGIAEAMDALF 424 Query: 363 KDIVTKR 369 K V K Sbjct: 425 KKAVAKA 431 >gi|254368552|ref|ZP_04984568.1| conserved hypothetical protein [Francisella tularensis subsp. holarctica FSC022] gi|157121455|gb|EDO65646.1| conserved hypothetical protein [Francisella tularensis subsp. holarctica FSC022] Length = 339 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 32/218 (14%), Positives = 74/218 (33%), Gaps = 47/218 (21%) Query: 173 DMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+++ +D+S SM + ++ D+ ++ N ++ K + GL+ F Sbjct: 99 DLIMAIDLSGSMAIQDMKKANGQMESRFDLVMRVANQFIDTRKGD-------RVGLILFG 151 Query: 228 NKIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L + ++ +++ + ++ G T + A ++ G Sbjct: 152 TRAYLQTPLTFDIATVKKMLDDASIALPGPQTAIGDAIGLAVKKLKKYPGDS-------- 203 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-------SHEFLRA 338 K ++ +TDGEN Q L AK+ +Y IG+ + + Sbjct: 204 ---KALILLTDGENN--SGTLQPLQAAEIAKQYHIKIYTIGLGGDQMIVETTFGQRLVNT 258 Query: 339 CA-------------SPNSFYLVENPHSMYDAFSHIGK 363 + ++ +N + + I K Sbjct: 259 SEDLDTTVLEKIATMTGGKYFRAQNSSDLKKVYESIDK 296 >gi|156347845|ref|XP_001621774.1| hypothetical protein NEMVEDRAFT_v1g221583 [Nematostella vectensis] gi|156208029|gb|EDO29674.1| predicted protein [Nematostella vectensis] Length = 357 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 37/207 (17%), Positives = 71/207 (34%), Gaps = 32/207 (15%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 ++++ ++D S S+ T+ D + + + V F+ Sbjct: 145 MNLVFLIDNSGSIND------TEFDNFKEFAKKL---AESFTISATYTHVAAVYFNTLAN 195 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 F L++ ++ ++ I L G T+ L Y + +F + N K ++ Sbjct: 196 FGFNLKYDINVIKTAIDNLPNIGGGTHIGKALTYTLDNVFKV--------APRQNVKNVL 247 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKK--RGAIVYAIGIRVIRSH-EFLRACASP---NSF 345 V +TDG K + G V+A+G+ S L AS + Sbjct: 248 VVLTDG-----KSHDSVTLPAAAVRNYGPGVEVFAVGVGAGDSFVAQLNVIASDPDEDHV 302 Query: 346 YLVENPHSMY----DAFSHIGKDIVTK 368 + VE+ + I KD V + Sbjct: 303 FHVEHFSQIESTTGAVEDEICKDTVHR 329 >gi|297473612|ref|XP_002686715.1| PREDICTED: collagen, type XXVIII-like, partial [Bos taurus] gi|296488704|gb|DAA30817.1| collagen, type XXVIII-like [Bos taurus] Length = 327 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 66/163 (40%), Gaps = 22/163 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + L+++ V+D S S+ + + + + + V L + G++ + Sbjct: 5 CKETPLELLFVIDSSESVG------LKNFQIIKNFVKTLTDRVAL---DLATTRVGIINY 55 Query: 227 SNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S+K+EE L S L+R + + G T + L A N++F+ Sbjct: 56 SHKVEEVAHLTQFSSKDDLKRAVDNMQYLGEGTYTATALHAA-NRMFEAAKPG------- 107 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 KK+ + +TDG+ T++++ A ++ IG+ Sbjct: 108 --VKKVALVITDGQ-TDTRDEKNLTEVVKNASDINVEIFVIGV 147 >gi|126649837|ref|ZP_01722073.1| hypothetical protein BB14905_16605 [Bacillus sp. B14905] gi|126593556|gb|EAZ87501.1| hypothetical protein BB14905_16605 [Bacillus sp. B14905] Length = 865 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 41/210 (19%), Positives = 76/210 (36%), Gaps = 35/210 (16%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 +P+ +K + L + IVLD S SM S +K+++A ++ +E ++ Sbjct: 392 LPVEMEIKGK-EQLPSLGLAIVLDRSGSM------SGSKLELAKEAAARSVEMLRDEDT- 443 Query: 216 NNVVQSGLVTFSNK---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 G + F ++ I E L I ++ G T L AY + D Sbjct: 444 -----LGFIAFDDRPWEIIETGPL-NNKEEAVDTILSVTP-GGGTEIYGSLAKAYENLAD 496 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 ++ R+H I+ +TDG++ + + K G + + I Sbjct: 497 IKLQRKH-----------IILLTDGQSQPGNYED----LIEQGKDNGITLSTVAIGQDAD 541 Query: 333 HEFLRACAS--PNSFYLVENPHSMYDAFSH 360 L A + FY V + ++ S Sbjct: 542 ANLLEALSEMGSGRFYNVIDEQTIPSILSR 571 >gi|312196190|ref|YP_004016251.1| von Willebrand factor type A [Frankia sp. EuI1c] gi|311227526|gb|ADP80381.1| von Willebrand factor type A [Frankia sp. EuI1c] Length = 319 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 39/245 (15%), Positives = 82/245 (33%), Gaps = 31/245 (12%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 + + + + ++ + A + ++ +DVS SM + D S +++ A + Sbjct: 58 PAALLLLSLTGMVLALARPARTELVPRERATI--VLAIDVSNSM-AATDISPSRLAAAKQ 114 Query: 201 SINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNST 260 A ++++ + GLV+FS ++ I L + G +T Sbjct: 115 GAQAFVDQLPP------KINLGLVSFSGTAAVLVPPTTDRDAVKAGINGL-QLGPATAIG 167 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA 320 G+ + I + Q N+ A IV ++DGE + + Q+ AK Sbjct: 168 EGIYAGLSAINTVS--SQFVNSGQAVPPAAIVLLSDGETTRGRPNNQA---AQAAKDAHI 222 Query: 321 IVYAIGIRVIRS--------------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGKD 364 V I L A + S + + + + +G Sbjct: 223 PVSTIAYGTPNGTLDVGGQLIPVPVNEPALSQIAEQTGGSHHRATSGDELTSIYKGLGSS 282 Query: 365 IVTKR 369 I ++ Sbjct: 283 IGYRK 287 >gi|223936327|ref|ZP_03628239.1| von Willebrand factor type A [bacterium Ellin514] gi|223894845|gb|EEF61294.1| von Willebrand factor type A [bacterium Ellin514] Length = 338 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 37/164 (22%), Positives = 67/164 (40%), Gaps = 17/164 (10%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + + +D+++ LD+S SM + + + A + I A + +K D + GLV F Sbjct: 84 SASGVDIVVALDMSGSMLAEDEGFVLNGQQATRFIIAR-DVLKKFVDKRQSDRIGLVVFG 142 Query: 228 NKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + L + ++ L S G T L + N++ +++ Sbjct: 143 TQAYVAVPPTLDHEFLLKNLERLGIGSINGNQTAIGSALSTSMNRLRELKSKS------- 195 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 KII+ MTDG+N + K L A+ G +Y IG+ Sbjct: 196 ----KIIILMTDGQNNAGK--VPPLTAAEAARALGIKIYTIGVG 233 >gi|126465452|ref|YP_001040561.1| von Willebrand factor, type A [Staphylothermus marinus F1] gi|126014275|gb|ABN69653.1| von Willebrand factor, type A [Staphylothermus marinus F1] Length = 416 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 47/236 (19%), Positives = 87/236 (36%), Gaps = 25/236 (10%) Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 I F R ++P SV+ + +IV+D S SM KI Sbjct: 4 IKASFRMGKDSVIEGREDIIPFVLSVEGVYSAHPPIAFLIVIDTSYSM------DGEKIF 57 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL--LEWGVSHLQRKIKYLSKFG 254 A ++ +L+ ++ + V G+ F+ K + + +++ I L G Sbjct: 58 RAKQAALRLLDILR----DKDYV--GVYGFAGKFYKVLEPVPATNRNEVEKAIIGLK-LG 110 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN-LSTKEDQQSLYYCN 313 TN LK ++ ++ + + + I+F+TDGE K+ ++ L Sbjct: 111 SGTNIYDTLKK------LVEETKKVLESGAISLVR-IIFITDGEPTTGQKKPEKILEMAK 163 Query: 314 EAKKRGAIVYAIGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSHIGKDIVT 367 + ++ GA IG+ + + L A F V +P S+ S K Sbjct: 164 KLREAGASALIIGVGTEYNEKLLSRMAMVLNGEFEHVSDPASLEKLISEYAKSTQE 219 >gi|189485266|ref|YP_001956207.1| aerotolerance-related cytoplasmic membrane protein BatA [uncultured Termite group 1 bacterium phylotype Rs-D17] gi|170287225|dbj|BAG13746.1| aerotolerance-related cytoplasmic membrane protein BatA [uncultured Termite group 1 bacterium phylotype Rs-D17] Length = 333 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 41/226 (18%), Positives = 78/226 (34%), Gaps = 52/226 (23%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +D +D+++ LD S SM S S+ +++ A K I ++E + GLV FS Sbjct: 86 SDQGIDIIVALDTSTSMRSLDFRSLNRMEAAKKVIRDFMKE-------RKYDRIGLVIFS 138 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGV---STNSTPGLKYAYNQIFDMQGMRQHCNTED 284 L L I ++ T + + N++ D + Sbjct: 139 GLAFTQCPLTTDKDSLAEFINNINIGDTGLDGTAIGSAIMTSVNRLKDSRAKS------- 191 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS------------ 332 +II+ +TDG N + D + A+ +YA+G+ + Sbjct: 192 ----RIIILVTDGNNNMGEIDPLTAS--KIARSYDIKIYAVGVGSLDGAIYEVDDPFLGK 245 Query: 333 -----------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 L+ A + ++ ++ +F +I K I Sbjct: 246 REIKYRKDAINESVLKEVAYNTSGGYFRAQD----VKSFENIMKQI 287 >gi|149911739|ref|ZP_01900346.1| von Willebrand factor type A domain protein [Moritella sp. PE36] gi|149805212|gb|EDM65230.1| von Willebrand factor type A domain protein [Moritella sp. PE36] Length = 330 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 34/227 (14%), Positives = 86/227 (37%), Gaps = 46/227 (20%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + S +MM+ +D+SRSM++ + + ++ + + +++ K Sbjct: 76 EPQSIPQQGREMMLAVDLSRSMQAEDMQINNRMVDRLSLVKTVVADFIQQRKGD------ 129 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGM 276 + GL+ F++ L + + + ++ L G T G+ A + Sbjct: 130 -RVGLIFFADNAYLQAPLTFDLKTVSGYMQQAVLGLVGEQTAIGEGIGLALKRFDAAD-- 186 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR----- 331 N +K+++ +TDG+N + + + L A+++G +Y IG+ Sbjct: 187 ---------NPQKVLILLTDGQNSA--GEVKPLDAAKFAQEQGVKIYTIGVGADAYYKRT 235 Query: 332 --SHE-----------FLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 ++ L+ A + ++ + S+ ++ + K Sbjct: 236 LFGNQKVDPSRDLDEVTLKTIAAQTGGQYFRARDASSLAAIYAELDK 282 >gi|331694297|ref|YP_004330536.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] gi|326948986|gb|AEA22683.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] Length = 332 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 34/169 (20%), Positives = 58/169 (34%), Gaps = 21/169 (12%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ DVS SM + D + T+++ A + ++ V+ G+V F Sbjct: 89 VVLAFDVSGSM-AATDIAPTRLEAAKAAARGFVQRQPAA------VRIGIVAFGATGLVT 141 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK---- 289 + + I LS G T GL+ A I + + Sbjct: 142 QQPTSDRASVVAAIDRLSPQG-GTALGGGLQTALGAIVGKPVVVPGSDPGGGPEPSGPDL 200 Query: 290 ------IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 +V +TDGEN + + L + A G VY IG+ Sbjct: 201 GYHGSAAVVLLTDGENTAQPD---PLQVADIASTAGVKVYPIGLGSPAG 246 >gi|221111394|ref|XP_002160866.1| PREDICTED: similar to collagen type VI alpha 6 [Hydra magnipapillata] Length = 419 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 48/282 (17%), Positives = 97/282 (34%), Gaps = 34/282 (12%) Query: 92 MSFRNELRDNGFV---NDIDDIVRSTSLDIVVVPQNEGYS---ISAISRYKIPLKFCTFI 145 +ELR++G V I V T L + + Y+ +++ + K Sbjct: 140 ERIADELRNDGVVILGVGIGSAVNETELSHITGGKKNAYTAATFDSLTDNEFIHKIKNGS 199 Query: 146 PWYTNSRHIVMPITSSVKVNSQTD-------ARLDMMIVLDVSRSMESFFDSSITKIDMA 198 +S P ++ + +S A +D+ +LD S S+E+ + Sbjct: 200 CEIVSSVLQSTPTIATTQASSTDGHVQPRCEAVVDVAFILDSSHSLEASYQKE------- 252 Query: 199 IKSINAMLEEVKLIPDVN-NVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGV 255 L+++ + ++ N + G++TFS + E L S + + Sbjct: 253 ----KNFLKKLAAVFGISSNGSRVGVITFSYRAELSVKLNSFTDLSSFNEAVDKIPLMNF 308 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 +T L+ A +F N K+I+ +TDG + + +E Sbjct: 309 TTRIDRALRLAQKDMFTS------ANGGRVGVSKLIILLTDGSQTPGGDAEDPERIADEL 362 Query: 316 KKRGAIVYAIGIR-VIRSHEFLRACASPNSFYLVENPHSMYD 356 + G ++ +GI + E + Y S+ D Sbjct: 363 RNDGVVILGVGIGSAVNETELSHITGGKKNAYTAATFDSLTD 404 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 71/197 (36%), Gaps = 21/197 (10%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-NVVQSG 222 V + +A +D+ +LD S S+E+ + L+++ + ++ N + G Sbjct: 9 VQPRCEAVVDVAFILDSSHSLEASYQKE-----------KNFLKKLAAVFGISSNGSRVG 57 Query: 223 LVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 ++TFS + E L S + + +T L+ A +F Sbjct: 58 VITFSYRAELSVKLNSFTDLSSFNEAVDKIPLMNFTTRIDRALRLAQKDMFTS------A 111 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRAC 339 N K+I+ +TDG + + +E + G ++ +GI + E Sbjct: 112 NGGRVGVSKLIILLTDGSQTPGGDAEDPERIADELRNDGVVILGVGIGSAVNETELSHIT 171 Query: 340 ASPNSFYLVENPHSMYD 356 + Y S+ D Sbjct: 172 GGKKNAYTAATFDSLTD 188 >gi|78776855|ref|YP_393170.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] gi|78497395|gb|ABB43935.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] Length = 309 Score = 69.8 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 72/197 (36%), Gaps = 26/197 (13%) Query: 174 MMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + ++LD S SM+ + +++ D+ + ++ + + K GLV F Sbjct: 86 IALILDASESMKAQGFDVQNQHLSRFDVVKEIVSDFISQRKNDN-------MGLVVFGAY 138 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + V+ L + + L G+ Y + + + + K Sbjct: 139 SFIASPLTYDVNILNKILSQLQ---------IGMAGKYTALNTSLAQGANLLKQSKSKTK 189 Query: 290 IIVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVI---RSHEFLRAC-ASPNS 344 I + +TDG + + D +L + KK G VY IGI + + L+ S Sbjct: 190 IAILLTDGYST-PQVDTITLDIALDMIKKEGIKVYPIGIGMPHEYNTEALLKIANESGGV 248 Query: 345 FYLVENPHSMYDAFSHI 361 + + + + + I Sbjct: 249 AFGASSAAELQEVYKKI 265 >gi|50950211|ref|NP_001002980.1| collagen alpha-1(VII) chain [Canis lupus familiaris] gi|33149359|gb|AAO64414.1| type VII collagen [Canis lupus familiaris] Length = 2936 Score = 69.4 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 74/203 (36%), Gaps = 28/203 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ + ++ ++ + V+ V +S+ Sbjct: 38 DIVFLLDGSSSIG---RGNFREVRGFLEGLVWPFSGAASAQG----VRFAAVQYSDDPRT 90 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L + R I+ LS G +T + + + + +F Q R K+ Sbjct: 91 EFGLGALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVFLPQLARPG-------VPKV 143 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLV 348 + +TDG K K +G ++A+GI+ E R + + + F+ V Sbjct: 144 CILITDG-----KSQDLVDTAAQRLKGQGVKLFAVGIKNADPEELKRVASQPTSDFFFFV 198 Query: 349 ENPHSMYDAFSHIGKDIVTKRIW 371 + + +V++R+ Sbjct: 199 NDFSILRTLL-----PLVSRRVC 216 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 33/228 (14%), Positives = 75/228 (32%), Gaps = 32/228 (14%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P S + +D++ +L + + + A +++ ++ + + Sbjct: 1035 VRGPEASLTQKPVCPRGLMDVVFLL-------HTTQDNAHRAEAAKRALERLVSALGPLG 1087 Query: 214 DVNNVVQSGLVTFSNKIEEFFLL--EWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQI 270 VQ GL+++S++ F L + + +KI + N + A+ + Sbjct: 1088 P--QAVQVGLLSYSHRPSPLFPLNSSYNPDVILQKIHSIPYVDPSGNNLGTAVVTAHRHL 1145 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + + I+V + D EA+ G V +G Sbjct: 1146 LAPDAPGR-----RQHIPGIMVLLVD-----EPLRGDIFNPIREAQAAGLKVMILGQAGA 1195 Query: 331 RSHEFLRACASPN---SFYLVENPHSMYDAFS-------HIGKDIVTK 368 + R + +F+ V++ S+ A S IG I + Sbjct: 1196 DPEQLRRLVPGMDPVQTFFAVDDGSSLDRAVSGLATSLCQIGSTIQPQ 1243 >gi|3850207|gb|AAC72024.1| alpha-1 type VII collagen non-collagenous domain [Canis lupus familiaris] Length = 1253 Score = 69.4 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 74/203 (36%), Gaps = 28/203 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ + ++ ++ + V+ V +S+ Sbjct: 38 DIVFLLDGSSSIG---RGNFREVRGFLEGLVWPFSGAASAQG----VRFAAVQYSDDPRT 90 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L + R I+ LS G +T + + + + +F Q R K+ Sbjct: 91 EFGLGALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVFLPQLARPG-------VPKV 143 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLV 348 + +TDG K K +G ++A+GI+ E R + + + F+ V Sbjct: 144 CILITDG-----KSQDLVDTAAQRLKGQGVKLFAVGIKNADPEELKRVASQPTSDFFFFV 198 Query: 349 ENPHSMYDAFSHIGKDIVTKRIW 371 + + +V++R+ Sbjct: 199 NDFSILRTLL-----PLVSRRVC 216 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 32/228 (14%), Positives = 74/228 (32%), Gaps = 32/228 (14%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P S + +D++ +L + + + +++ ++ + + Sbjct: 1035 VRGPEASLTQKPVCPHGLMDVVFLL-------HTTQDNAHRAEAVKRALERLVSALGPLG 1087 Query: 214 DVNNVVQSGLVTFSNKIEEFFLL--EWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQI 270 VQ GL+++S++ F L + + +KI + N + A+ + Sbjct: 1088 P--QAVQVGLLSYSHRPSPLFPLNSSYNPDVILQKIHSIPYVDPSGNNLGTAVVTAHRHL 1145 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + + I+V + D EA+ G V +G Sbjct: 1146 LAPDAPGR-----RQHIPGIMVLLVD-----ESLRGDIFNPIREAQAAGLKVMILGQAGA 1195 Query: 331 RSHEFLRACASPN---SFYLVENPHSMYDAFS-------HIGKDIVTK 368 + R + +F+ V++ S+ A S IG I + Sbjct: 1196 DPEQLRRLVPGMDPVQTFFAVDDGSSLDRAVSGLATSLCQIGSTIQPQ 1243 >gi|332534652|ref|ZP_08410484.1| protein BatA [Pseudoalteromonas haloplanktis ANT/505] gi|332035932|gb|EGI72413.1| protein BatA [Pseudoalteromonas haloplanktis ANT/505] Length = 328 Score = 69.4 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 41/241 (17%), Positives = 82/241 (34%), Gaps = 30/241 (12%) Query: 112 RSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR 171 ST L I ++ + +I + L +I W P ++ + R Sbjct: 29 ASTRLRIPSFAKH-NLTSQSIEPHARRLNPLEWIIWLLLVTAAANPTWLDEPISLPNEGR 87 Query: 172 LDMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+M+ +D+S SM + + ++ M ++ +E+ + GL+ F Sbjct: 88 -DIMLAVDLSGSMTEQDMAYNGQYVDRLTMVKAVLSDFIEQ-------RQGDRLGLILFG 139 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L V + + + G +T L + + + Sbjct: 140 DTAFLQTPLTRDVKTVSKMLSEAQIGLVGRATAIGDALGLSVKRFANKDESN-------- 191 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--VIRSHEFLRACASPN 343 +I+V +TDG+N + + + A++ G VY IG+ R S Sbjct: 192 ---RIVVLLTDGQNTAGNLNPEDALLL--AREEGIKVYTIGVGSDNPRGFSLFNVGGSSG 246 Query: 344 S 344 S Sbjct: 247 S 247 >gi|162312016|gb|ABX84114.1| hedgling [Nematostella vectensis] Length = 3480 Score = 69.4 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 69/193 (35%), Gaps = 22/193 (11%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++ +LD S S+ S+ + K+ +K++ V ++T+S Sbjct: 175 QTSVDLVFILDTSGSVGSY---NFEKMKTFVKNVVDFFNIGPKGTHVA------VITYST 225 Query: 229 KIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + F L+ + L+ + + T + L A IF + D Sbjct: 226 WAQVEFNLKAHHSSKAALKNAVNAIYYRSGWTYTADALDLAGRNIFQVAN----GMRPDK 281 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--N 343 KI V +TDG + L N+ + G V+ +GI E P + Sbjct: 282 GIPKIAVLLTDGYSNGNNP----LGPANDLRAAGVNVFCVGIGNYYERELNDIATDPDKD 337 Query: 344 SFYLVENPHSMYD 356 + +EN + + Sbjct: 338 HVFKLENFNDLNS 350 >gi|257469959|ref|ZP_05634051.1| hypothetical protein FulcA4_11506 [Fusobacterium ulcerans ATCC 49185] gi|317064188|ref|ZP_07928673.1| BatA protein [Fusobacterium ulcerans ATCC 49185] gi|313689864|gb|EFS26699.1| BatA protein [Fusobacterium ulcerans ATCC 49185] Length = 319 Score = 69.4 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 81/213 (38%), Gaps = 44/213 (20%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+ I LD+S+SM D + +++ A + ++ ++ K D + L+ F Sbjct: 82 IDIAISLDLSQSMLQE-DFTPNRLEKAKEVLSEFID--KRTDD-----RLALIVFGGDAY 133 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNS----TPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + + ++ L+ +++N+ G+ A N++ D + Sbjct: 134 TKVPLTFDHNVIKEMTGKLTVDDITSNTRTAIGMGIGVALNRLKDSEAKS---------- 183 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR---------------- 331 K+I+ +TDGEN + + AK+ G +Y IGI Sbjct: 184 -KVIILLTDGENN--SGEMSPSAAADIAKELGIKIYTIGIGAKEIKVPSFFGYTTVKNTE 240 Query: 332 -SHEFLRACA--SPNSFYLVENPHSMYDAFSHI 361 L++ A + ++ + + F+ I Sbjct: 241 LDENMLKSIAETTGGEYFRASDSKEFKEIFNKI 273 >gi|95928343|ref|ZP_01311091.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684] gi|95135614|gb|EAT17265.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684] Length = 329 Score = 69.4 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 35/217 (16%), Positives = 74/217 (34%), Gaps = 46/217 (21%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+ +D+S SM + S+ ++ A +++ + GL+ F Sbjct: 91 DLMLAVDLSGSMRTDDFQLSGRSVDRLTALKAVAGAFIDQ-------RQGDRIGLILFGE 143 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + + + R + + G T + A ++ +D Sbjct: 144 QPYIQAPLTFDHNTVTRLLHEAVVGLAGNKTAIGDAIGLAVKRL-----------RKDPQ 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-------------- 332 K +++ +TDG + S D L A +RG VY IGI Sbjct: 193 AKNVLILLTDGASNSGSLDP--LKAAKLAAQRGLKVYTIGIGAEAVEVGSFFFKRTVNPS 250 Query: 333 ----HEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 + L+A A + ++ + + + + + Sbjct: 251 LDLDEKTLKAIAETTGGRYFRARDTEELAQIYQQLDQ 287 >gi|311264542|ref|XP_003130217.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Sus scrofa] Length = 998 Score = 69.4 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 31/160 (19%), Positives = 65/160 (40%), Gaps = 22/160 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 L+++ V+D S S+ + + + + + V L + G++ +S+K Sbjct: 649 TPLELLFVIDSSESVG------LENFQIIKNFVKTLTDGVAL---DLATARIGIINYSHK 699 Query: 230 IEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 +EE L S L+R + + G T + L A N++F+ Sbjct: 700 VEEVAHLTQFSSKDALKRAVDNMQYLGEGTYTATALHAA-NRMFEAS---------RPEV 749 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 KK+ + +TDG+ T++++ A ++ IG+ Sbjct: 750 KKVALVITDGQ-TDTRDEKNLTDVVKNASDINVEIFVIGV 788 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 37/203 (18%), Positives = 81/203 (39%), Gaps = 18/203 (8%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ ++D S S + F K + S++ L ++ + + ++ + FS+ ++ Sbjct: 48 IDLIFIVDSSESSKIFL---FDKQKDFVDSLSDKLFQLTPVGSLKYDIKLAALQFSSSVQ 104 Query: 232 EFFLLE-W-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 W + ++++K ++ G T S + A R + K Sbjct: 105 IDPPFSSWKDLHTFKQRVKSMNFIGQGTFSYYAIANA---------TRLLKREGRKDSVK 155 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYL 347 + + MTDG + D QS+ +A+ G I IG+ + + L + SP L Sbjct: 156 VALLMTDGIDHPKNPDVQSIS--EDARNAGIIFITIGLSTVVNETKLHLISGNSPGEPIL 213 Query: 348 VENPHSMYDAFSHIGKDIVTKRI 370 + N ++ D + + K+ Sbjct: 214 LLNDSTLVDKIQNRLDTLFEKKC 236 >gi|126334034|ref|XP_001370526.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 1247 Score = 69.4 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 81/205 (39%), Gaps = 32/205 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ V+D S S++S ++ ++++ +E + L+ +SN+++ Sbjct: 203 DIVFVIDGSTSIDSH---DFQQMKSFVRAVINQFKETNTLFS--------LMQYSNRLKI 251 Query: 233 FFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + + + T + ++ ++F Q N N KI Sbjct: 252 HFTFADFQRSTNWGNLVNPILQLRGLTYTATAIRKVVTELF------QSRNGARKNATKI 305 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR----VIRSHEFLRACASP---N 343 ++ +TDGE K + + EA++ G I YAIG+ + E L AS Sbjct: 306 LIVITDGEKYQDKLQYKDVIP--EAEQAGIIRYAIGVGDAFEYASAQEELNIIASQPAKE 363 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTK 368 + V+N + A I +D+ K Sbjct: 364 HVFQVDN----FSALKTIQEDLQEK 384 >gi|47207527|emb|CAF87062.1| unnamed protein product [Tetraodon nigroviridis] Length = 409 Score = 69.4 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 42/219 (19%), Positives = 87/219 (39%), Gaps = 32/219 (14%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + A D++ VLD S S+ ++ D A + + + + + ++ Q G+V Sbjct: 3 AGCSTAANDLVYVLDGSWSVG------VSDFDTAKQWLINITSQFDI---SSHYTQVGVV 53 Query: 225 TFSN--KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNS-------TPGLKYAYNQIFDMQG 275 +S+ ++E G L R I+ +S G +T + P +K+A + +F Sbjct: 54 QYSDAPRLEIPLGKHQGQDELIRAIQSISYLGGNTQARRRSERHLPAIKFAVDHVFSSSQ 113 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 +I V +TDG++ D EA+ +G V+A+G+ + Sbjct: 114 RASQVK------NRIAVVVTDGKSQDDVVDASM-----EARTQGVTVFAVGVGSEITTSE 162 Query: 336 LRACA---SPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 L A A S E+ +++ + + + + + Sbjct: 163 LIAIANKPSSTYVLYAEDYTTIHHIRDAMEQKLCEESVC 201 >gi|224054051|ref|XP_002190865.1| PREDICTED: collagen, type VI, alpha 1 [Taeniopygia guttata] Length = 1023 Score = 69.4 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 75/192 (39%), Gaps = 16/192 (8%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP---DVNNV 218 +V+S D +D+ VLD S S+ + N ++++ D N V Sbjct: 30 TQVSSAEDCPVDLFFVLDTSESVALRVKPFGDLVTQVKDFTNQFIDKLTQRYYRCDRNLV 89 Query: 219 VQSGLVTFSNK---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 +G + +S++ I+ + G S L+ ++ ++ G T + +K ++ Sbjct: 90 WNAGALHYSDEVVLIKSLTPMPSGQSELKNRVSAINYIGKGTYTDCAIKRGIEELLIGGS 149 Query: 276 MRQHCNTEDANYKKIIVFMTDGENL--STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + K ++ +TDG L + NEAK G V+++ I Sbjct: 150 HHKEN--------KYLIVVTDGHPLEGYKEPCGGLDDAANEAKLLGIKVFSVAISPNHLD 201 Query: 334 EFLRACASPNSF 345 + L A+ +++ Sbjct: 202 QRLNIIATDHAY 213 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 40/313 (12%), Positives = 99/313 (31%), Gaps = 36/313 (11%) Query: 51 HSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDI 110 ++ + + T L+ R+ + LRD +N + D+ Sbjct: 691 QAVKNLQWIAGGTYTGEALQFTKDNLLRRFTSDKRVAIVITDGRSDTLRDPTPLNSLCDV 750 Query: 111 VRSTSL---DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 SL DI P N ++ I+ P + I + + ++ Sbjct: 751 TPVVSLGIGDIFRNPPNPD-HLNDIACLSRPTRPGLSIQRDNYAELLDDTFLQNITSYIC 809 Query: 168 T-------------DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM-LEEVKLIP 213 D+ +++D S S+ S + K + + ++ Sbjct: 810 QEKKCPDYTCPITFTGPADITLLVDSSTSVGS------KNFETTKKFVKQLSGRFLEASK 863 Query: 214 DVNNVVQSGLVTFSNKIEE--FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 + V+ +V +S + ++ ++ + + + I + +T+ L++ Sbjct: 864 PTDESVRISVVQYSGRNQQKVEAQFQYNYTVIAKAIDNMEFMNDATDVNSALQFI----- 918 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 + + A KK ++ +DG + +A+K G +Y + + Sbjct: 919 ----TELYRRSARAAAKKRVLVFSDGHSQGITARAIERAV-QDAQKAGIEIYVLAVGSQA 973 Query: 332 SHEFLRACASPNS 344 + +R + S Sbjct: 974 NEPNIRVLVTGKS 986 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 23/155 (14%), Positives = 56/155 (36%), Gaps = 24/155 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVVQSGL 223 T +D++ VLD S S+ + +A I +++ + V + G+ Sbjct: 611 CTCGPVDLLFVLDSSESIG------LQNFQIAKDFIIKVIDRLSKDERVKFDAGESRVGV 664 Query: 224 VTFSNKIEEFF-----LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V +S+ + + L++ +K L T + L++ + + Sbjct: 665 VQYSHDNTQELVAMGDANIDNIGALKQAVKNLQWIAGGTYTGEALQFTKDNLLRRFTSD- 723 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 K++ + +TDG + + ++ C+ Sbjct: 724 ---------KRVAIVITDGRSDTLRDPTPLNSLCD 749 >gi|227820127|ref|YP_002824098.1| transmembrane protein [Sinorhizobium fredii NGR234] gi|227339126|gb|ACP23345.1| putative transmembrane protein [Sinorhizobium fredii NGR234] Length = 451 Score = 69.4 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 49/334 (14%), Positives = 107/334 (32%), Gaps = 76/334 (22%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 + G + + AI + + L +G ++ + + +++ + S +D +LV A +I Sbjct: 18 TMVSDRGGNVALTVAICIIPMILAVGAGLDYTRAYNVQSRMQSDLDAALVAAIKEI---- 73 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 I F + D+ +I S + + Sbjct: 74 ---------DEYDEDEIAEKIKDWFDAQSEKQSATYDLTEITVDKSGHTITASASGTVPT 124 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM----- 184 + ++ I + + + + S+++ + L++ IV+D S SM Sbjct: 125 TLMTLADI--------------KTVPVGVISAIE--GPATSYLEVYIVIDKSPSMLLAAT 168 Query: 185 -------------ESFFDSSITK--------------------------IDMAIKSINAM 205 F TK D+A+ ++ + Sbjct: 169 SEDQAMLRADANITCEFACHDTKDPVKKNGTVIASTYYNYIKSLGVKLRTDVALDAVEEV 228 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG--L 263 L+ V + + ++ GL + I E + S ++K+ S S S Sbjct: 229 LDMVDAADEDHARIKVGLYSLGETISEVLEPTYSTSTARKKLSDDSSGLTSATSMSATYF 288 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 + A + G T A+ K+++ +TDG Sbjct: 289 QTALKALKKKVGT-AGDGTSAASPLKLVLLLTDG 321 >gi|149635474|ref|XP_001506111.1| PREDICTED: similar to alpha-1 type VII collagen [Ornithorhynchus anatinus] Length = 2993 Score = 69.4 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 67/188 (35%), Gaps = 23/188 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S S+ + D + + V+ + V+ G V +S+ Sbjct: 38 DVVFLVDGSSSIGRNNFRMVR--DFLEGLVLPFVNVVR-----DTGVRFGAVQYSDDPRT 90 Query: 233 FFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L S + R ++ L+ +T + GL++ F H K+ Sbjct: 91 EFALGTHASGQEVMRAVRELNYKRGNTRTGAGLRHVAEHFF-------HPQLARPGVPKV 143 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLV 348 + +TDG + K + V+A+GI+ E + +SP + + Sbjct: 144 CILITDGR-----SQDDVEQGALKLKNQNVKVFAVGIKNAHEEELRKVASSPVEEYHFFI 198 Query: 349 ENPHSMYD 356 + + Sbjct: 199 PDFAILRT 206 >gi|328545070|ref|YP_004305179.1| von Willebrand factor type A [polymorphum gilvum SL003B-26A1] gi|326414812|gb|ADZ71875.1| von Willebrand factor type A [Polymorphum gilvum SL003B-26A1] Length = 552 Score = 69.4 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 40/226 (17%), Positives = 72/226 (31%), Gaps = 35/226 (15%) Query: 6 LNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 + I F + +G + I F ++ G+ I++S + K+ L S Sbjct: 1 MKIFRFLSDRRGNIAIAFGSFAFLLTAGSGVGIDMSRVVTEKSRLQSAA----------- 49 Query: 66 MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 K G ++ + L + Sbjct: 50 ---DATALAANYKSGTYTAEQIRQHAEAYFDGLY-----------TAPERGSVSRNVTVG 95 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 +IS + +P TF I + + KV + A D+++VLD S SM Sbjct: 96 DGTISVEAGVTMP----TFFAPLLGVEEISFAVMAESKVGT---ASFDVVLVLDNSGSM- 147 Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 S +T + A + L + I ++ V GLV F+ + Sbjct: 148 --AGSRMTTLKQAASDLIRTLMSINEISTEDDRVMVGLVPFTAFVN 191 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 33/182 (18%), Positives = 62/182 (34%), Gaps = 43/182 (23%) Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVS--TNSTPGLKYAYNQIFDMQGM---RQHCNTEDAN 286 + + + Q + ++ TN G+ + + + + R + + Sbjct: 369 KTQPITDLTNDKQALLDAVAAMRADGYTNIHQGVVWGWRVLTPQEPFSRGRSPDQKREKD 428 Query: 287 YKKIIVFMTDGENLS----------------------------------TKEDQQSLYYC 312 +++I++ MTDG N K D+++ C Sbjct: 429 HRRIMIVMTDGANTYQDKSSSHNRTEYNAYGYGTEQRLGSGIDTAGEIAAKMDERTALAC 488 Query: 313 -NEAKKRGAIVYAIGIRVIR--SHEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDIVTK 368 N A VY I +V + + LR CAS P + + + AF IGK+I Sbjct: 489 RNAATYEATQVYTIAFQVGDYATRKLLRDCASSPEMAFDAGSNSELVTAFERIGKEISRL 548 Query: 369 RI 370 R+ Sbjct: 549 RL 550 >gi|291228410|ref|XP_002734180.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii] Length = 945 Score = 69.4 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 54/321 (16%), Positives = 109/321 (33%), Gaps = 45/321 (14%) Query: 48 TVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNG-FVND 106 + I + +V I N R+ ++ I T N+L N F Sbjct: 49 EQMELAIRQKIVSTVKAI-----ENIRESVEKSFEQTEIAFTPCCEIPNDLEFNTAFGKS 103 Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH---IVMPITSSVK 163 +D + T + P S I K + + W + S + Sbjct: 104 LDPRIGCTRIADNADPNPRFLSNEVIDVMKENYNSFSNLKWQYFGSEEGIFTIYPASLID 163 Query: 164 VNSQTDARL-------------DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + D R +++IV+D S SM + S + I++AI + +L+ + Sbjct: 164 DCANYDHRFRPWYVEAATPEPKNVVIVIDTSGSM-ANLHSGKSLINIAIDAAITVLDTMN 222 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEW--------------GVSHLQRKIKYLSKFGVS 256 + G++ FS++++ + + +L++ + L G Sbjct: 223 PND------KVGVIAFSDELKLPPKIGDASCYANELALATTINIQNLKQFVLSLVARG-G 275 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ-QSLYYCNEA 315 T+ A+N + + + ++I+F+TDGE L K + + NE Sbjct: 276 THYGKAFDAAFNLLKESYTLDADNERGKIERDQVIIFLTDGEPLDDKTSIMRKIRSNNEE 335 Query: 316 KKRGAIVYAIGIRVIRSHEFL 336 + + G+ + FL Sbjct: 336 MENKVTILTFGLGLDSGINFL 356 >gi|114586161|ref|XP_001172548.1| PREDICTED: hypothetical protein [Pan troglodytes] Length = 543 Score = 69.4 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 33/210 (15%), Positives = 83/210 (39%), Gaps = 27/210 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + +++ D++ ++D S S+ + + + +M+ + + D +Q G Sbjct: 86 SSEACKNSKADIIFLIDGSESISP------KDFEKMKRFVESMVNQSNIGTDG---IQIG 136 Query: 223 LVTFSN-KIEEFFLLEWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 L+ FS+ +EEF L ++ + R + + T + L + +G Sbjct: 137 LLQFSSIPLEEFRLNQYSSKVDIYRATFDVQQMRDGTRTGKALNFTLPFFDSSKG----- 191 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + ++ ++ +TDG + + + I++AIG+ + + L Sbjct: 192 --GRPSVQQYLIVITDG-----VAQDNVIIPAKALRDKNIIIFAIGVGEAKKSQLLEITN 244 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + Y N ++A ++ K+I++K Sbjct: 245 DEDKVYHDVN----FEALQNLEKEILSKVC 270 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 18/95 (18%), Positives = 32/95 (33%), Gaps = 7/95 (7%) Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 +T N + ++ +TDG K + G +YAIGIR E Sbjct: 5 FADTSRINVARYLIVITDG-----KSSDSVAEAAEGLRANGVNIYAIGIREANIDELKEI 59 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + + V + D + +DI + + Sbjct: 60 A--KDKIFFVYEFDLLKDIQKEVVQDICSSEACKN 92 >gi|221109964|ref|XP_002168937.1| PREDICTED: similar to predicted protein [Hydra magnipapillata] Length = 221 Score = 69.4 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 70/189 (37%), Gaps = 19/189 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ +LD S S+ ++ + + I + + ++ ++TFS + Sbjct: 31 VDIVFLLDSSASLRKYYQNEKDFLKSLISAFGVSINS----------TRAAVITFSYHAQ 80 Query: 232 EFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L ++ + + + G +T L+ A ++F+++ N K Sbjct: 81 LSIKLNKYSNLNSFKEAVDNIVLMGSTTRIDKALRLAQKEVFELE------NGARPGVAK 134 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-RVIRSHEFLRACASPNSFYLV 348 I+ +TDG + + + NE + G + IGI + E + Y Sbjct: 135 ILFLLTDGSQTQERGSENPVVIANELRSAGVTIIVIGITNAVDVSELFDIAGGEENAYFA 194 Query: 349 ENPHSMYDA 357 ++ + D Sbjct: 195 DSFEKLKDV 203 >gi|158259621|dbj|BAF85769.1| unnamed protein product [Homo sapiens] Length = 1105 Score = 69.4 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 48/296 (16%), Positives = 98/296 (33%), Gaps = 41/296 (13%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPL--------------KFCTFI 145 G N ++ +T I + ++ Y++ I+ K K Sbjct: 68 SGGKTNQLNLQNTATKATIQGLMPDQNYTVQIIAYNKDKESKPAQGQFRIKDLEKRKDPK 127 Query: 146 PWYTNSRHIVMPITSS---VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 P SS VK QT A D++I++D S S+ F + + Sbjct: 128 PRVKVVDRGNGSRPSSPEEVKFVCQTPAIADIVILVDGSWSIGRF------NFRLVRHFL 181 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNST 260 ++ + + + GL +S + L + + ++ L G +T + Sbjct: 182 ENLVTAFDV---GSEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTG 238 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA 320 L Y + F + KI + +TDG K + ++ G Sbjct: 239 LALNYIFENSFKPE------AGSRTGVSKIGILITDG-----KSQDDIIPPSRNLRESGV 287 Query: 321 IVYAIGIRVIRSHEFLRACASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 ++AIG++ +E + P+ Y V M+ + + + ++ D+ Sbjct: 288 ELFAIGVKNADVNELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSRVEEQDR 343 >gi|292618048|ref|XP_699485.3| PREDICTED: integrin alpha-1-like [Danio rerio] Length = 1201 Score = 69.4 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 52/332 (15%), Positives = 123/332 (37%), Gaps = 45/332 (13%) Query: 65 IMNEGNGNNRKKLKGGDILCRI-KNTWNMSFRNELRDNGFVNDIDDIVRSTSL--DIVVV 121 ++ K G CR+ N+ + + L +N + +I+++ + ++ +VV Sbjct: 57 LIGSPLTGQPAKRTGDVYRCRVGSNSDSKCEKFLLSENTTIPNINEVKENMTMGTTLVVN 116 Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR-LDMMIVLDV 180 P G+ ++ +Y +I ++ + +S+ + ++ +D++IVLD Sbjct: 117 PDGNGF-LACGPQYGYMCGKQQYITGICSNVSSSFKVLNSIAPTVRECSQDMDIVIVLDG 175 Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV 240 S S+ + D + L+ +++ P + G+V++ + + F L Sbjct: 176 SNSIYPW--------DHITDFLVKFLQNIEIGP-----ARVGIVSYGDDVGHVFNLSQFS 222 Query: 241 SH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 + ++ + G T + G+ A + F ++ KK++V +TDG Sbjct: 223 NTKELVKNAADIRQRTGHKTMTALGIDTARKEAFTVE------RGARPGVKKVMVIVTDG 276 Query: 298 ENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-RVIRSH-----------EFLRACAS---P 342 E+ + + C + G +A+ + E + AS Sbjct: 277 ESHDHHNLKSVIDQCQ---EDGIERFAVAVLGDYNRQNKSIDEIKKFIEEIEYIASETKS 333 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + F+ V + ++ +G I K Sbjct: 334 DHFFNVSDERALVTIVDTLGSKIFALEATSGK 365 >gi|73956560|ref|XP_848795.1| PREDICTED: similar to von Willebrand factor A domain-related protein isoform 1 [Canis familiaris] Length = 415 Score = 69.4 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 37/212 (17%), Positives = 75/212 (35%), Gaps = 25/212 (11%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + + + + D++ +LD S S+ + S + + + ++ + L P + Sbjct: 21 GAERGAPASAPQGDLLFLLDSSASVSHYEFSRV------REFLGQLVALLPLGPGA---L 71 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 ++ LV ++ F S +Q I+ ++ TN+ L YA Q+F Sbjct: 72 RASLVHVGSRPYTEFPFGQHSSGEAVQDAIRAAAQRMGDTNTGLALAYAKEQLF------ 125 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 K++V++TDG + E K G V+ + E L Sbjct: 126 AEVAGARPGVPKVLVWVTDGGSSDPVG-----PPMQELKDLGVTVFIVSTGRGNLLE-LS 179 Query: 338 ACAS--PNSFYLVENPHSMYDAFSHIGKDIVT 367 A AS P + ++ + I+ Sbjct: 180 AAASAPPEKHLQFVDVDDLHIIAQELTGSILE 211 >gi|221111396|ref|XP_002160892.1| PREDICTED: similar to proximal thread matrix protein 1 [Hydra magnipapillata] Length = 315 Score = 69.4 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 48/282 (17%), Positives = 97/282 (34%), Gaps = 34/282 (12%) Query: 92 MSFRNELRDNGFV---NDIDDIVRSTSLDIVVVPQNEGYS---ISAISRYKIPLKFCTFI 145 +ELR++G V I V T L + + Y+ +++ + K Sbjct: 34 ERIADELRNDGVVILGVGIGSAVNETELSHITGGKKNAYTAATFDSLTDNEFIHKIKNGS 93 Query: 146 PWYTNSRHIVMPITSSVKVNSQTD-------ARLDMMIVLDVSRSMESFFDSSITKIDMA 198 +S P ++ + +S A +D+ +LD S S+E+ + Sbjct: 94 CEIVSSVLQSTPTIATTQASSTDGHVQPRCEAVVDVAFILDSSHSLEASYQKE------- 146 Query: 199 IKSINAMLEEVKLIPDVN-NVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGV 255 L+++ + ++ N + G++TFS + E L S + + Sbjct: 147 ----KNFLKKLAAVFGISSNGSRVGVITFSYRAELSVKLNSFTDLSSFNEAVDKIPLMNF 202 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 +T L+ A +F N K+I+ +TDG + + +E Sbjct: 203 TTRIDRALRLAQKDMFTS------ANGGRVGVSKLIILLTDGSQTPGGDAEDPERIADEL 256 Query: 316 KKRGAIVYAIGIR-VIRSHEFLRACASPNSFYLVENPHSMYD 356 + G ++ +GI + E + Y S+ D Sbjct: 257 RNDGVVILGVGIGSAVNETELSHITGGKKNAYTAATFDSLTD 298 Score = 40.1 bits (92), Expect = 0.52, Method: Composition-based stats. Identities = 14/78 (17%), Positives = 27/78 (34%), Gaps = 1/78 (1%) Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRA 338 N K+I+ +TDG + + +E + G ++ +GI + E Sbjct: 5 ANGGRVGVSKLIILLTDGSQTPGGDAEDPERIADELRNDGVVILGVGIGSAVNETELSHI 64 Query: 339 CASPNSFYLVENPHSMYD 356 + Y S+ D Sbjct: 65 TGGKKNAYTAATFDSLTD 82 >gi|268575272|ref|XP_002642615.1| C. briggsae CBR-MUP-4 protein [Caenorhabditis briggsae] gi|187032080|emb|CAP28849.1| CBR-MUP-4 protein [Caenorhabditis briggsae AF16] Length = 2101 Score = 69.4 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 30/206 (14%), Positives = 70/206 (33%), Gaps = 22/206 (10%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV-KLIPDVNNVVQSG 222 + D++ ++D S S+ S+ + + L E +L + Sbjct: 428 STACPAQPTDLVFLIDGSGSIGSY---------VFQTEVLRFLAEFTELFDIAPQKTRVS 478 Query: 223 LVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V +S++I F L+ LQ I+ + T + +++ N+ F + Sbjct: 479 VVQYSDQIRHEFGLDNYSDRKSLQNAIRNIEYLTGLTRTGAAIEHVANEAFS----ERRG 534 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-RVIRSHEFLRAC 339 ++ + +TDG + N A K+ ++A+G+ + E Sbjct: 535 ARPVGQVSRVAIVITDG-----RSQDNVTRPSNNAHKQDIQLFAVGVTNHVLDAELEEIS 589 Query: 340 ASPNSFYLVENPHSMYDAFSHIGKDI 365 S + + V + + + Sbjct: 590 GSKDRTFHVSGFEDLNTRLRSAIQRV 615 >gi|310286822|ref|YP_003938080.1| von Willebrand factor type A domain [Bifidobacterium bifidum S17] gi|309250758|gb|ADO52506.1| conserved hypothetical protein containing von Willebrand factor type A domain [Bifidobacterium bifidum S17] Length = 1156 Score = 69.4 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 38/265 (14%), Positives = 88/265 (33%), Gaps = 51/265 (19%) Query: 144 FIPWYTNSRHIVMPITSSVKVNSQTD-ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 + + + +T + +SQT + D+++V D S SM + + +++++A ++ Sbjct: 583 IKANNDGTYDLSLNVTGTQSGSSQTTVSPADIVVVFDTSGSMSNPMGHN-SRLEVAKTAV 641 Query: 203 NAMLEEVKLIPD--VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNST 260 N+M + + + ++ ++ LV FS + + L G + Sbjct: 642 NSMAQHLLTSENQGKDSNIRMALVPFSTTAGNVSNFTDNAMDIVSAVNGLGADGGT---- 697 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS------------ 308 N ++ + KK IVFM+DG+ ++ Sbjct: 698 -------NWEAALKAANAKLTSGRKGVKKYIVFMSDGDPTYRTSSVRTGTDWLGRPIYDA 750 Query: 309 ------------------------LYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 ++ A ++++G+ + A + S Sbjct: 751 DDGWGLPAGVHGSGLSDRYGANLSSAVAEANRRGDATLFSVGVSSDPTKMRGFADQTKGS 810 Query: 345 FYLVENPHSMYDAFSHIGKDIVTKR 369 +Y + + AF+ I I K Sbjct: 811 YYSATSTDELNKAFADIIGQINRKS 835 >gi|297667858|ref|XP_002812180.1| PREDICTED: vitrin-like isoform 1 [Pongo abelii] Length = 693 Score = 69.4 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 42/270 (15%), Positives = 91/270 (33%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + N+ +V + V N YS+ S + + + ++ + T Sbjct: 445 EGAAENEKQYVVEPNFANKAVCRTNGFYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKT 504 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S S+ ++ + + +E ++ Sbjct: 505 CL--------NSADIGFVIDGSSSVG------TGNFRTVLQFVTNLTKEFEISDTD---T 547 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ + F + S + IK + + T++ + +A Q+F Sbjct: 548 RIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF------ 601 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG + A +G I YAIG+ E Sbjct: 602 ---KKSKPNKRKLMILITDG-----RSYDDVRIPAMAAHLKGVITYAIGVAWAAQEELEV 653 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V+ ++Y I ++I Sbjct: 654 IATHPARDHSFFVDEFDNLYQYVPRIIQNI 683 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 62/165 (37%), Gaps = 22/165 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+ ++D S S+ + + + + A+ + + + P G+V + + Sbjct: 307 IDLSFLIDGSTSIGK------RRFRIQKQLLAAVAQALDIGPAGP---LMGVVQYGDSPA 357 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L+ L+ I+ +++ G +N + + F N ++ Sbjct: 358 THFNLKTHMNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSK------ANGNRSSAPN 411 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++V M DG E+ L A++ G ++ I I +E Sbjct: 412 VVVVMVDGWPTDKVEEASRL-----ARESGINIFFITIEGAAENE 451 >gi|225310541|emb|CAQ19231.1| collagen type XXVIII alpha 1 b precursor [Danio rerio] Length = 806 Score = 69.4 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 35/199 (17%), Positives = 79/199 (39%), Gaps = 30/199 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + ++++ V+D S S+ ++ + A+++ + N + GLV + Sbjct: 436 CKERPMELVFVVDSSESIGP------ENFEIIKDFVAALVDRL---TIGRNATRVGLVLY 486 Query: 227 SNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S +++ F L + ++ I+ + G T + + A ++ F Sbjct: 487 SLEVQLEFNLARYTTKQDIKEAIRRVHYIGEGTYTGSAIHNATHEAFYSA---------R 537 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR---VIRSHEFLRAC-- 339 KK+ + +TDG+ +E + EA +YA+GI +EFL+ Sbjct: 538 TGVKKVAIVITDGQ-TDKREPVKLEIAVREAHAANIEMYALGIMNISNPTQNEFLQELSL 596 Query: 340 -AS---PNSFYLVENPHSM 354 AS Y +++ +++ Sbjct: 597 IASHPDSEHMYYIKDFNTL 615 >gi|52138687|ref|NP_001004392.1| collagen alpha-1(XX) chain [Gallus gallus] gi|14280020|gb|AAK58847.1| collagen type XX alpha 1 precursor [Gallus gallus] Length = 1472 Score = 69.4 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 39/305 (12%), Positives = 96/305 (31%), Gaps = 36/305 (11%) Query: 47 KTVLHSMIDR--SLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWN-----MSFRNELR 99 K + +R S + + G+ + L + + + + Sbjct: 115 KNASQTRNNRRNSGAAPGKNLTSVGSSTAEQSLGMETAPPPLSTVLTHPAKDRAEKKRHK 174 Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + + S+ + P S + P + R Sbjct: 175 GTQPKGSGETMRSQPSVGVTQTPPPTTKSSQRATANTEREPPGKEKPTRDSLRR------ 228 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 ++ T A D+++++D S S+ + + + ++A++ + D + Sbjct: 229 -GSQLQCDTSAMTDIVLLVDGSWSIG------RSNFKLIKEFLSALISPFNIAQDK---I 278 Query: 220 QSGLVTFSNKIEEFFLLE-WGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL +S+ + L + + ++ L G +T + L + Q Sbjct: 279 RVGLSQYSSDPRTEWDLSAYATRDQVLEAVRNLRYKGGNTFTGLALTHVLEQ------NL 332 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + +K+++ +TDG K + K G ++AIG++ E + Sbjct: 333 KPDAGARLEAEKLVILLTDG-----KSQDDANLAAQTLKNMGIEIFAIGVKNADEAELKQ 387 Query: 338 ACASP 342 + P Sbjct: 388 VASEP 392 >gi|83745131|ref|NP_001032954.1| collagen alpha-1(XXVIII) chain precursor [Mus musculus] gi|123789585|sp|Q2UY11|COSA1_MOUSE RecName: Full=Collagen alpha-1(XXVIII) chain; Flags: Precursor gi|83423286|emb|CAI67593.1| collagen, type XXVIII [Mus musculus] gi|189442117|gb|AAI67245.1| Collagen, type XXVIII, alpha 1 [synthetic construct] Length = 1141 Score = 69.4 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 35/215 (16%), Positives = 79/215 (36%), Gaps = 31/215 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + L+++ V+D S S+ + + + + V L + G++ + Sbjct: 792 CKETPLELVFVIDSSESVGP------ENFQIIQSFVKTLADRVAL---DLGTARIGIINY 842 Query: 227 SNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S+K+E+ L+ S + + + G T + L+ A N +F + R Sbjct: 843 SHKVEKVASLKQFSSKDDFKLVVDNMQYLGEGTYTATALQAA-NDMF--KEARPG----- 894 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF------LRA 338 KK+ + +TDG+ +++ ++ +A ++ IG+ F + Sbjct: 895 --VKKVALVITDGQ-TDSRDKKKLADVVKDANDSNVEIFVIGVVKKDDPNFEIFHKEMNL 951 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 A+ V + K ++K+I D Sbjct: 952 IATDAEH--VYQFDDFFT-LQDTLKQKLSKKICED 983 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 30/168 (17%), Positives = 70/168 (41%), Gaps = 16/168 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ +LD S S + + + S++ + ++ + ++ + FS+ ++ Sbjct: 47 IDVVFILDSSESSKIVLFDNQ---KDFVDSLSEKIFQLTPGRSLKYDIKLAALQFSSSVQ 103 Query: 232 EFFLLE-W-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L W + ++++K L+ G T S + A ++ +G + K Sbjct: 104 IDPPLSSWKDLRTFKQRVKSLNLIGQGTFSYYAISNA-TRLLKREGRKDGV--------K 154 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + + MTDG + D QS+ +A+ G +G+ + + LR Sbjct: 155 VALLMTDGIDHPKSPDVQSIS--EDARILGISFITVGLSTVVNEAKLR 200 >gi|198426626|ref|XP_002122822.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 1823 Score = 69.4 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 42/279 (15%), Positives = 92/279 (32%), Gaps = 34/279 (12%) Query: 105 NDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPW-YTNSRHIVMPITSSVK 163 N I + N + S L N P + Sbjct: 1561 NAISSLKNGHINCTDDNFVNSVCTFQCDSSNGYFLFPANETTSTCLNDTTWNKPTPCCAR 1620 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 A +D ++V+D S S+ + + +L L D + + Sbjct: 1621 SCP-PYASMDFVMVVDSSSSIG------RKNWRIMKEFGRNVLNTFVLAEDAA---RMAV 1670 Query: 224 VTFSNKIEEFFL--LEWGVSHLQRKI---KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ K++ L + + ++ G T + L++A + + + Sbjct: 1671 FRYNRKVDNVTQVLLNDHIGDKDAFLAAYDRINYNGRGTWTGHALQHAKDVMLAAE---- 1726 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-----RVIRSH 333 N A+ K +++ +TDG + + + + G + YA+GI + I + Sbjct: 1727 --NGNRADIKDVVLTITDG-----RAQDNVVDISKQLRDMGVLTYAVGIVPGNKKAIDRN 1779 Query: 334 EFLRACASPNSFYLVENP-HSMYDAF-SHIGKDIVTKRI 370 + + S + ++VE+ + + F S + + I T R Sbjct: 1780 QLMAIAGSEENLFMVESGFQGLDEQFASKLARSICTNRC 1818 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 74/213 (34%), Gaps = 35/213 (16%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 A +D +++LD S S++ + I + ++ + + ++ + Sbjct: 650 AAMDFVLILDSSSSVKRRNWNIIKRFGK---------NILQAFDLGEETGRMAVFRYNKR 700 Query: 230 IEEFFL------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 ++ L+ S + + G T + L++A N I + N Sbjct: 701 VDTNTQILFKDHLKNRTSFFKDY-DRIPYNGGGTLTGQALRHAKNVILAEE------NGN 753 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-----SHEFLRA 338 + +++ +TDG + E + GA+ Y IG++ +E L Sbjct: 754 RPSVVDVVLTITDG-----RAQDDVGNISRELRANGALTYVIGVQPGNKKKLDENELLEI 808 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + +LV + + A + +++ I Sbjct: 809 AGRKENMFLVTSG---FAALDQQFAEKLSQSIC 838 Score = 43.6 bits (101), Expect = 0.045, Method: Composition-based stats. Identities = 24/133 (18%), Positives = 47/133 (35%), Gaps = 19/133 (14%) Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK-FGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 G + E L G +++ SK + + T + L++A + +F + Sbjct: 956 GYSVYPETNNEITCLSNGT--WDKQVPCCSKEYHLGTLTGQALRHAKDVMFSEE------ 1007 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-----HEF 335 N +++ +TDG + G + Y IGI+ + +E Sbjct: 1008 NGNRPGVVDVVLTITDGR-----SQDHVRNVSRALRANGVLTYVIGIQPGNNKKLDKNEL 1062 Query: 336 LRACASPNSFYLV 348 L S + +LV Sbjct: 1063 LDIAGSRENMFLV 1075 >gi|118763608|gb|AAI28261.1| VIT protein [Homo sapiens] Length = 657 Score = 69.4 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 42/270 (15%), Positives = 90/270 (33%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + N+ +V + V N YS+ S + + + ++ + T Sbjct: 409 EGAAENEKQYVVEPNFANKAVCRTNGFYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKT 468 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S S+ ++ + + +E ++ Sbjct: 469 CL--------NSADIGFVIDGSSSVG------TGNFRTVLQFVTNLTKEFEISDTD---T 511 Query: 220 QSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ + F + S + IK + + T++ + +A Q+F Sbjct: 512 RIGAVQYTYEQRLEFGFDKYSSRPDIINAIKRVGYWSGGTSTGAAINFALEQLF------ 565 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG A +G I YAIG+ E Sbjct: 566 ---KKSKPNKRKLMILITDGR-----SYDDVRIPAMAAHLKGVITYAIGVAWAAQEELEV 617 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V+ ++Y I ++I Sbjct: 618 IATHPARDHSFFVDEFDNLYQYVPRIIQNI 647 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 28/175 (16%), Positives = 64/175 (36%), Gaps = 22/175 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 V + D ++D+ ++D S S+ + + + + + + + + P Sbjct: 261 VGADVSLDCKIDLSFLIDGSTSIGK------RRFRIQKQLLADVAQALDIGPAGP---LM 311 Query: 222 GLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G+V + + F L+ L+ I+ +++ G +N + + F Sbjct: 312 GVVQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSK------ 365 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 N + ++V M DG E+ L A++ G ++ I I +E Sbjct: 366 ANGNRSGAPNVVVVMVDGWPTDKVEEASRL-----ARESGINIFFITIEGAAENE 415 >gi|256261600|gb|ACU65922.1| CR4 receptor subunit alphaX [Ovis aries] Length = 1158 Score = 69.4 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 36/204 (17%), Positives = 77/204 (37%), Gaps = 29/204 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S+ S K+ +K++ + + LV FS++ +E Sbjct: 151 DIAFLIDGSGSISS---RDFNKMLSFVKAVMSQFQRPSSQFS--------LVQFSDRFQE 199 Query: 233 FFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + S + + + G T + +++ +++ + KI Sbjct: 200 HFTFKDFATSSDPLNLLNSVWQLGGWTFTASAIRFVTDRLLSA------AYGARKDASKI 253 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-----VIRSHEFLRACASP--N 343 ++ +TDGE + ++ + A G I YAIG+ E + ++P Sbjct: 254 LIVITDGEKTEKVDYKEVIPRAEAA---GVIRYAIGVGSAFQYRNSLQELIDIASTPSKE 310 Query: 344 SFYLVENPHSMYDAFSHIGKDIVT 367 + VEN ++ D + + I Sbjct: 311 HVFQVENFDALRDIQKQLKEKIFA 334 >gi|291514852|emb|CBK64062.1| Mg-chelatase subunit ChlD [Alistipes shahii WAL 8301] Length = 341 Score = 69.4 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 31/175 (17%), Positives = 67/175 (38%), Gaps = 24/175 (13%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + ++MM+ +DVS SM + D +++ +I + E ++ + Sbjct: 79 GSKLREEKAQGIEMMLTVDVSNSMLAE-DFEPNRLERTKYAIGKLFEGLQQD-------R 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 GLV F+ + + + + + + T L+ A + G Sbjct: 131 VGLVVFAGEPKVQLPITSDYRMARAFARRIDPSLVSVQGTAIGKALEQA---LLAFSGDT 187 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + ++ ++I+ +TDGEN D ++ A + G ++ IGI Sbjct: 188 E------QSHGRVIILITDGENH----DDDAIAVAERAAQMGVKIFTIGIGTPEG 232 >gi|119620823|gb|EAX00418.1| vitrin, isoform CRA_c [Homo sapiens] Length = 700 Score = 69.4 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 41/270 (15%), Positives = 91/270 (33%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + N+ +V + V N YS+ S + + + ++ + T Sbjct: 452 EGAAENEKQYVVEPNFANKAVCRTNGFYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKT 511 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S S+ ++ + + +E ++ Sbjct: 512 CL--------NSADIGFVIDGSSSVG------TGNFRTVLQFVTNLTKEFEISDTD---T 554 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ + F + S + IK + + T++ + +A Q+F Sbjct: 555 RIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF------ 608 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG + A +G I YAIG+ E Sbjct: 609 ---KKSKPNKRKLMILITDG-----RSYDDVRIPAMAAHLKGVITYAIGVAWAAQEELEV 660 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V+ +++ I ++I Sbjct: 661 IATHPARDHSFFVDEFDNLHQYVPRIIQNI 690 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 27/177 (15%), Positives = 62/177 (35%), Gaps = 22/177 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D ++D+ ++D S S+ + + + + + + + + P Sbjct: 302 GERGTRGCPDCKIDLSFLIDGSTSIGK------RRFRIQKQLLADVAQALDIGPAGP--- 352 Query: 220 QSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 G+V + + F L+ L+ I+ +++ G +N + + F Sbjct: 353 LMGVVQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSK---- 408 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 N + ++V M DG E+ L A++ G ++ I I +E Sbjct: 409 --ANGNRSGAPNVVVVMVDGWPTDKVEEASRL-----ARESGINIFFITIEGAAENE 458 >gi|198434986|ref|XP_002126110.1| PREDICTED: similar to RIKEN cDNA E330026B02 [Ciona intestinalis] Length = 1715 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 46/275 (16%), Positives = 95/275 (34%), Gaps = 20/275 (7%) Query: 98 LRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMP 157 + N + + + I+ + +S S + + + V Sbjct: 173 IPQNLSCFNTESRGCNYDFTIIAIGITRAGIMSLKSMHVAQMHKIENAGRLLVIDNDVGK 232 Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 ++ A D++ +LD S S+ ++ M + + + Sbjct: 233 SLCDAAIHGCRTAYHDLLFLLDGSSSISPND---------FSTTLTWMRDIAEQFTVGSQ 283 Query: 218 VVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + G++ + ++ F L S + I +++ G + + + Q Sbjct: 284 FTRVGMMQYGDEPHTEFDLNTFQNGSQVFEAISNVTQIGGESGPYAAILQVLRRSLTAQ- 342 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 N +II+F+TDG + E+ Q++ NE + GA+VY IG+ + S Sbjct: 343 -----YGSRENVSQIIIFVTDGGVVDDSEESQTIL--NELRFSGALVYTIGVGRMVSRPQ 395 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 LR AS + + V S Y S I+ + Sbjct: 396 LRMIASRPASHHVTTIAS-YSELSATKSQIIDRIC 429 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 32/245 (13%), Positives = 81/245 (33%), Gaps = 41/245 (16%) Query: 115 SLDIVVVPQNEGYSISAISRYK---IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR 171 S+ + + + Y + +P + + I +++ Sbjct: 924 SICVGIKVNRTQIQKIVTAPYSSFFYSVSHFRELP--KIEKSLSRGICAAIAPE-CRMRA 980 Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ ++D S S+ ++ T + + + + + + P+ G F N+ Sbjct: 981 VDVIFLIDGSSSVT---NADFTAVKLLM---VNLTDSFDIGPE-------GARDFDNR-- 1025 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L+ I L +T + +K+ N F + N K+ Sbjct: 1026 ---------EDLRESIMSLEHCTGNTFTAKAIKFVTNYAFSTE------RGGRPNVLKVA 1070 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYLVE 349 + +TDG ++ + +A G ++ IG+ + E + + + V+ Sbjct: 1071 LLITDGR---AQDYRNIRSEAEQAHTEGIKIFGIGVGEAQLSELEDMSSLPTSQHTMFVQ 1127 Query: 350 NPHSM 354 N + Sbjct: 1128 NYQDI 1132 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 49/315 (15%), Positives = 102/315 (32%), Gaps = 55/315 (17%) Query: 64 QIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQ 123 ++ + + D + RI N + L+ F I + L Sbjct: 551 RVDAQPVVVHLTDSGSTDDVTRIGNEFRA-----LKIPVFSIGIGSGIDQVELS------ 599 Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 SI+ S+ + + + + +V I+ + K + LD+ +LD S S Sbjct: 600 ----SIATSSQNVMIVNS-DITLLPSQASELVTRISRAAKTE-CENIELDVQFLLDGSTS 653 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 + S ++ K+ +KS+ + G+ +S+ F + + Sbjct: 654 INS---ANFAKVKDWVKSLT------STFEIGYYTTRVGINQYSDHPNVEFTMADYFTKT 704 Query: 244 Q--RKIKYLSKFGVSTNSTPGLKYA-YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN- 299 I + T+ + + N G R KK+++ MTDG + Sbjct: 705 SILGAIDDIVYKKGRTSIGEAINFTRVNSFTPAAGARSF-------VKKVLLVMTDGVSQ 757 Query: 300 ---------LSTKEDQQSLYY-CNEAKKRGAIVYAIGIRV-----IRSHEFLRACASPNS 344 + D ++ C EA+ ++ I + +FL+ AS Sbjct: 758 DSIQEASALMHNMSDVKTFAVLCPEAEMDLVVLLDSSIGARDNAFGNTKKFLKELASS-- 815 Query: 345 FYLVENPHSMYDAFS 359 + V +S++ Sbjct: 816 -FAVGPYNSLFSVIQ 829 Score = 41.3 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 65/187 (34%), Gaps = 19/187 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSI--NAMLEEV-KLIPDVNNVVQSGLVTFSN 228 +D+ ++ S D T+ + ++ A +++V + + ++ +S Sbjct: 441 IDLQFLIHTSIPEHEGGDIDTTRFLATMTNVFTAAFVKQVVDSFFIGIHYTRVSIIVYSG 500 Query: 229 K--IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC-NTEDA 285 +E F L I ++ F P N F + +H N+ Sbjct: 501 TCSLEFSFEEYLRKGELLEYINNMTSFVQDE---PA-----NAGFALNFTLKHILNSLRV 552 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 + + ++V +TD + NE + V++IGI L + A+ + Sbjct: 553 DAQPVVVHLTD-----SGSTDDVTRIGNEFRALKIPVFSIGIGSGIDQVELSSIATSSQN 607 Query: 346 YLVENPH 352 ++ N Sbjct: 608 VMIVNSD 614 >gi|156402981|ref|XP_001639868.1| predicted protein [Nematostella vectensis] gi|156226999|gb|EDO47805.1| predicted protein [Nematostella vectensis] Length = 240 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 60/190 (31%), Gaps = 22/190 (11%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 ++++ ++D SRS+E+ + ++ + I + + G+ +S Sbjct: 35 GQVELGFIVDGSRSIEASACGNFKRMLDFTQRIASGFGIASSQ------TRVGVGLYSTF 88 Query: 230 IEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 S IK L G T + LK +F + H Sbjct: 89 ASVPIPFGKYTSLQETVEGIKKLRYPGEGTRTGRALKLMKTHLFSQSRPKAH-------- 140 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 K+++ +TDG ++ G V+A+GI L+ A+ L Sbjct: 141 -KVLIVLTDG-----TSVDDVKAPAKALRESGVEVFAVGIGEHYRPRELKDIATDTGHVL 194 Query: 348 VENPHSMYDA 357 + Sbjct: 195 TAGFRDLMSV 204 >gi|297287371|ref|XP_001099130.2| PREDICTED: collagen alpha-2(VI) chain-like isoform 3 [Macaca mulatta] Length = 1029 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 42/233 (18%), Positives = 87/233 (37%), Gaps = 18/233 (7%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQ----TDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 I T ++ + +++ N+ TD + + VLD S S+ + I + Sbjct: 14 ILGATQAQQQEVISPDTMERNNNCPEKTDCPIHVYFVLDTSESVTMQSPTDILLFH-MKQ 72 Query: 201 SINAMLEEVKLIPDVNNVV---QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST 257 + + +++ ++ V + G + FS+++E F + + ++ +S F T Sbjct: 73 FVPQFISQLQNEFYLDQVALSWRYGGLHFSDQVEVFSPPGSDRASFIKSLQGISSFRRGT 132 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 + L QI RQH + V +TDG + L A++ Sbjct: 133 FTDCALANMTEQI------RQHGTKGTVH---FAVVITDGHVTGSPCGGIKLQ-AERARE 182 Query: 318 RGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 G ++A+ + LR AS + +M + I +D + + I Sbjct: 183 EGIRLFAVAPNRNLKEQGLRDIASTPHELYRNDYATMLPDSTEIDQDTINRII 235 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 33/164 (20%), Positives = 63/164 (38%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + ++ L + P + G+V +S++ Sbjct: 622 GALDVVFVIDSSESIGY---TNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 678 Query: 230 -----IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 I+ +S + +K L T + LK+AY+++ + Sbjct: 679 GTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLI--------KESRR 730 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG + +D C+ R V AIGI Sbjct: 731 QKTRVFAVVITDGRHDPRDDDLNLRALCD----RDVTVTAIGIG 770 >gi|260800505|ref|XP_002595170.1| hypothetical protein BRAFLDRAFT_240981 [Branchiostoma floridae] gi|229280413|gb|EEN51181.1| hypothetical protein BRAFLDRAFT_240981 [Branchiostoma floridae] Length = 183 Score = 69.4 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 33/190 (17%), Positives = 73/190 (38%), Gaps = 23/190 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 A++D++ VLD S S+ + ++ +K + + P Q L +SN Sbjct: 4 AKMDVVFVLDGSGSVGH---DNFQRMKSFVKRVAG---SFAIGPSSA---QISLFQYSNF 54 Query: 230 IEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + F L+ + + + + + G T + L F N Sbjct: 55 VVQEFALDTYGSIGEINQAVDAVMYQGGGTATGLALYEMRQYGFS------FANGGRPGT 108 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRACASPNSFY 346 +++ + +TDG + T + + + G +Y +GI + +E L +P++ + Sbjct: 109 RRVAILLTDGMSSDTVDKHAMAAW-----QAGISLYVVGIGSNVDMNELLAIGGTPDNVF 163 Query: 347 LVENPHSMYD 356 + N + D Sbjct: 164 SLGNFGQLQD 173 >gi|125829720|ref|XP_698253.2| PREDICTED: collagen alpha-1(VI) chain [Danio rerio] Length = 1000 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 43/222 (19%), Positives = 76/222 (34%), Gaps = 23/222 (10%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV-KLIPDVNNVV- 219 V +S + +D+ VLD S S+ I+ ++E+ L + V Sbjct: 23 VSRSSFRECPVDLFFVLDTSESVALRAKPPEFYINQIKTFTKLFIDELKDLRQPCDRDVT 82 Query: 220 -QSGLVTFSNKIEEFFLL---EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 SG + +S+ E L + L+ I + G T + +K ++ G Sbjct: 83 WNSGALHYSDDTELVMGLVDLNTKRADLKAAIDRIKYIGKGTYTDCAIKEGIAELLRA-G 141 Query: 276 MRQHCNTEDANYKKIIVFMTDGEN--LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 H N K IV +TDG + NEA++ V+A+ I + Sbjct: 142 SHYHEN-------KYIVVVTDGHPITGYKEPCGGIQEAANEARQHAIKVFAVAISPDQED 194 Query: 334 EFLRACASP----NSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 L A+ +F +N S + + I+ Sbjct: 195 TRLSVIATDINYRQNFTAADNSRSTQMS---TIRSIIHMICC 233 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 59/187 (31%), Gaps = 20/187 (10%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++I++D S S+ +M + + N V+ G+ +SN Sbjct: 808 DVLIMMDSSASVG------AKNFEMTKDFSRMLAKRFLSAERGNFQVRIGVGQYSNNANL 861 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +I T T L +A + KK++V Sbjct: 862 EAEFSTNATQVVAQIADAKFQNAGTQVTNALNFAIERF----------RGGRTRKKKLLV 911 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPH 352 F +DG + Q + G +Y + + + LR S Y +N + Sbjct: 912 F-SDGRSQGVNSIQIEKAV-EQVSNAGIELYVLAVGNQVNEAHLRTLVSRGRPY--DNTY 967 Query: 353 SMYDAFS 359 + F Sbjct: 968 AYRHLFK 974 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 25/151 (16%), Positives = 61/151 (40%), Gaps = 21/151 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 T LD+ ++D S S+ + ++ + I +++ +KL N + G+V + Sbjct: 594 CTCGPLDIAFIVDSSESIG------ASNFAISKEFIVTIMDRLKLRQFGANESRIGVVQY 647 Query: 227 SNKI-EEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 S + +E L ++ L++ +K L +T + L+Y+ + + Sbjct: 648 SGEKSQEVVQLGDPNIKTLTDLKQAVKDLRWLAEATYTGEALQYSLRNMLSKLITERS-- 705 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 +++ +TDG + + ++ C Sbjct: 706 --------VVLVLTDGRSDTIRDKVPLNVLC 728 >gi|54303502|ref|YP_133495.1| hypothetical protein PBPRB1845 [Photobacterium profundum SS9] gi|46916932|emb|CAG23695.1| conserved hypothetical protein [Photobacterium profundum SS9] Length = 321 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 36/218 (16%), Positives = 75/218 (34%), Gaps = 47/218 (21%) Query: 173 DMMIVLDVSRSME-----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 DM++V+D+S SM SI ++ + +E+ K + GLV F+ Sbjct: 84 DMLLVVDLSGSMSIEDMIIKNGESIDRLAAVKDVLAEFIEQRKGD-------RLGLVLFA 136 Query: 228 NKIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 L + + ++++++ L G ST GL A + + Sbjct: 137 QHAYLQTPLTFDRNTVKQQLERTVLGLIGQSTAIGEGLGIATKTFINSEA---------- 186 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-------------- 331 +++I+ ++DG N + + L A + +Y +G+ Sbjct: 187 -PQRVIILLSDGANTA--GVIEPLEAAKLAAESNVTIYTVGVGAEEMIQKSFFGNRKVNP 243 Query: 332 ----SHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L A + ++ NP + + I + Sbjct: 244 SQDLDERMLTKIADMTGGQYFRARNPQELEHIYQLIDQ 281 >gi|299139026|ref|ZP_07032203.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] gi|298599180|gb|EFI55341.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] Length = 318 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 39/214 (18%), Positives = 89/214 (41%), Gaps = 33/214 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + ++ L++++ +D S S+ F+ + + A K + ++L + I L+ Sbjct: 83 DRESSTPLEIVLAIDASESV---FNDEHLEREAAKKFMASLLRKQDQID---------LM 130 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 F++ ++E V + + + G +T + A ++ Sbjct: 131 DFADDVDELVSFTSDVQKIDSGLGRI-HHGDATALYDAVYLASQRL--------GETPTS 181 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI---------RVIRSHEF 335 A ++++V +TDGEN + + +A++ GA++YA+ I H Sbjct: 182 AGQRRVLVLITDGENTTHHGSYDAAL--EQAQRAGAMIYALIIVPVSADAGRNTGGEHAL 239 Query: 336 LRAC-ASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 ++ + +Y VE+ H + AF H+ D+ T+ Sbjct: 240 IQLARDTGGKYYYVEDKHDLAPAFQHVSDDLRTQ 273 >gi|301618735|ref|XP_002938765.1| PREDICTED: hypothetical protein LOC100488728 [Xenopus (Silurana) tropicalis] Length = 672 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 31/168 (18%), Positives = 67/168 (39%), Gaps = 22/168 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE---VKLIPDVNNVVQSGLVTFSN 228 LD+M ++D S S + + ++ +KL + ++ FS+ Sbjct: 55 LDVMFIVDGSESTKGHL------FKQQKDFVLNFTDQISHLKLAKPWKTKTKMAIIQFSS 108 Query: 229 --KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 +IE+ F GV + +R + ++ G T + + A N + H + + Sbjct: 109 SVRIEQSFNEWTGVENFKRIVNSMTYIGQGTYTYYAIMNATNIF------KAHKSAGNV- 161 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 K+ + MTDG + D + + A+ G +IG+ ++++ Sbjct: 162 --KVAILMTDGIDHPKSPD--ARQASDFARAAGINFISIGLSTQKANK 205 >gi|296209627|ref|XP_002751626.1| PREDICTED: collagen alpha-1(XXVIII) chain [Callithrix jacchus] Length = 1125 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 33/216 (15%), Positives = 75/216 (34%), Gaps = 33/216 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + L+++ V+D S S+ + + M + V L + G++ + Sbjct: 792 CKETPLELVFVIDSSESVGP------ENFQIIKNFVKIMADRVAL---DLATARIGVINY 842 Query: 227 SNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S+K+E+ L+ S + + + G T + L+ A N +F Sbjct: 843 SHKVEKVANLKQFSSKDDFKLAVDNMQYLGEGTYTATALQAA-NDMF---------KDAR 892 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-------HEFLR 337 KK+ + +TDG+ +++ ++ ++ IG+ E Sbjct: 893 PGVKKVALVITDGQ-TDSRDKEKLTEVVKNVSDTNVEIFVIGVVKKNDPNFEIFHKEMNL 951 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 P Y ++ ++ D + ++I D Sbjct: 952 IATDPEHVYQFDDFFTLQDTLK----QKLFQKICED 983 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 84/206 (40%), Gaps = 24/206 (11%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD---VNNVVQSGLVTFSN 228 +D++ ++D S S S I D ++ + +++ + V ++ + FS+ Sbjct: 47 IDIVFIVDSSES------SKIVLFDKQKDFVDGLSDKIFRLTPRRSVEYDIKLAALQFSS 100 Query: 229 KIEEFFLLE-W-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ W + ++K+K ++ G T S + A ++ +G + Sbjct: 101 SVQIDPPFSSWKDLQTFKQKVKSMNLIGQGTFSYYAISNA-TKLLKREGRKGSV------ 153 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNS 344 K+ + MTDG + D QS+ ++A+ G IG+ + + LR + S N Sbjct: 154 --KVALLMTDGIDHPKNPDVQSIS--DDARISGISFITIGLSTVVNEAKLRLISGDSSNE 209 Query: 345 FYLVENPHSMYDAFSHIGKDIVTKRI 370 L+ + ++ D + + K+ Sbjct: 210 PILLLSDPTLVDKIQDHLETLFEKKC 235 >gi|221111402|ref|XP_002161005.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 1100 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 39/204 (19%), Positives = 78/204 (38%), Gaps = 19/204 (9%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 + + K + +D+ ++D S S+ + + K + L + I Sbjct: 51 SVLADKKNEVCNEGIVDIGFIMDSSGSLGKNYKNE--------KDLLKTLASLFSIKP-- 100 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 N Q+G++TFS E L + + G +T GL+ A ++F ++ Sbjct: 101 NGSQAGVITFSFYTEHSIKLNQFSDQDSFNDAVDRIPLMGHTTRIDKGLRLAQKEMFKVE 160 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSH 333 N K++V +TDG K +E +K+G + AIGI I + Sbjct: 161 ------NGGRPGVSKLLVLLTDGSQTQGKGVIDPAIIADEIRKQGVPIIAIGIGKEINKN 214 Query: 334 EFLRACASPNSFYLVENPHSMYDA 357 E ++ + Y ++ + ++ Sbjct: 215 ELIKIGGGEANTYSADDFEKLKES 238 Score = 37.8 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 61/180 (33%), Gaps = 19/180 (10%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + + + +DM+ +D+S S E K+I ++++ Sbjct: 788 DTANELKGFVQGVHDGAEGRAFNVDMVFAMDLSSSSEEILQKQ-------KKAIRSLIDY 840 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI-KYLSKFGVSTNSTPGLKYAY 267 +P +N + GL+T+S+ L S +I + G N + A Sbjct: 841 --HLPSKSN--ELGLITYSDIANVNSEL---TSKFNSEILDKIVNNGRRQNVASAITVAS 893 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 IF ++ + + ++++V G+ S Y ++ IG+ Sbjct: 894 ENIFKIRNNDEKLKKK----QRVLVLFVVGKPSSNYPPVVPERYGKLLEENNVKTIIIGL 949 >gi|90420284|ref|ZP_01228192.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] gi|90335618|gb|EAS49368.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] Length = 593 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 53/327 (16%), Positives = 112/327 (34%), Gaps = 43/327 (13%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 + F+ G + ++ + LPI+ +G +++S I+ K L +D + + A + N Sbjct: 13 KRFWTAKSGNVAVVFGLTLPILACCMGAAVDISGIYASKRNLQHSVDIAALAAGREYSNN 72 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 ++ K+ G F N D +T + +G + Sbjct: 73 QQDSHLSKVAEGYF--------------------FENAGADARANTDFSYDGIFNEDGST 112 Query: 129 I-SAISRYKIPLKFCTFI----PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 + + + P F + + R + S + V +Q +++++VLD S S Sbjct: 113 VLQVSAARRHPTIFGDLLSFVTAGELDWRAFPLAARSQIVVQNQ---SIELVMVLDNSGS 169 Query: 184 M--ESFFDSSITKIDMAIKSINAMLEEV---KLIPDVNNVVQSGLVTFSNKIE---EFFL 235 M KID ++ + + + VQ G+V F+ + + Sbjct: 170 MTGRPKSGGGKRKIDTIKEAAIGLTGQFLKGAASSTLKLPVQFGVVPFAAAVNIGPDNHD 229 Query: 236 LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 +W + + I + ++ G+K A G + + V+ Sbjct: 230 AKWMDTKGRSSIHNEYLDWANWKTSRGVKLAER---SPTGDYWQEISSKTPLTRFFVY-- 284 Query: 296 DGENLSTKEDQQSLYYCNEAKKRGAIV 322 EN K + C E++ G + Sbjct: 285 --ENAHHKNELGPWLGCVESRPNGLAI 309 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 27/206 (13%), Positives = 57/206 (27%), Gaps = 68/206 (33%) Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L + + I + G TN G+ + + + + Q + + K++ Sbjct: 387 PVTPLTDNQATINAAINAMDADGE-TNIPEGIAWGWRLLSAREPFTQGRANDAKDNLKVL 445 Query: 292 VFMTDGENLSTKE----------------------------------------------- 304 V MTDG+N + Sbjct: 446 VLMTDGDNNYGSDENDYNESGYGTFGYASTYDAYGNHSWGRIFDDTSTTSKRANRSSFVS 505 Query: 305 --DQQSLYYCNEAKKRG--------AIVYAIGIRVIRS---HEFLRACAS-------PNS 344 +++ C K G +++ I + + + CAS Sbjct: 506 AMNEKVAAICQNIKDDGRKATGEDGIVIFTIAFDLNDGSSVKKLMEQCASYGITDPTKKL 565 Query: 345 FYLVENPHSMYDAFSHIGKDIVTKRI 370 +Y ++ + AF I + + + RI Sbjct: 566 YYDAKSSSDLMAAFDSITEQVSSLRI 591 >gi|198426622|ref|XP_002122673.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 721 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 66/190 (34%), Gaps = 32/190 (16%) Query: 170 ARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 A +D++++LD S S+ ++ + + +++ + + + ++ Sbjct: 518 APMDLVLILDSSSSVKRPNWN-------TMKQFVRSIITTFNFGENEA---RMAVFRYNR 567 Query: 229 KIEEFFLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +++ + + L G T + L++A N I N Sbjct: 568 QVDTRNQILLSDHINNKTTFLEAYDKLPYNGFGTFTGRALRHAKNVILAN------RNGN 621 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-----HEFLRA 338 N K +I+ +TDG + + E ++ G + IGI+ + L Sbjct: 622 RPNVKDVILTITDGRSQDNVATIST-----ELREMGVTTFVIGIQPGNGAGLDQDQLLAM 676 Query: 339 CASPNSFYLV 348 +P + L Sbjct: 677 GGTPENTILA 686 >gi|308474504|ref|XP_003099473.1| CRE-MUP-4 protein [Caenorhabditis remanei] gi|308266662|gb|EFP10615.1| CRE-MUP-4 protein [Caenorhabditis remanei] Length = 2161 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 28/206 (13%), Positives = 71/206 (34%), Gaps = 22/206 (10%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV-KLIPDVNNVVQSG 222 + D++ ++D S S+ S+ + + L E +L + Sbjct: 428 STACPAQPTDLVFLIDGSGSIGSY---------VFQTEVLRFLAEFTELFDIAPQKTRVS 478 Query: 223 LVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V +S++I F L+ LQ I+ + T + +++ N+ F + Sbjct: 479 VVQYSDQIRHEFGLDNYSDRKSLQNAIRNIEYLTGLTRTGAAIEHVANEAFS----ERRG 534 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-RVIRSHEFLRAC 339 ++ + +TDG + + A+K+ ++A+G+ + E Sbjct: 535 ARPVGQVSRVAIVITDG-----RSQDNVTRPSDNARKQEIQLFAVGVTNHVLDAELEEIS 589 Query: 340 ASPNSFYLVENPHSMYDAFSHIGKDI 365 + + + V + + + Sbjct: 590 GAKDRTFHVSGFEDLNTRLRSAIQRV 615 >gi|297667860|ref|XP_002812181.1| PREDICTED: vitrin-like isoform 2 [Pongo abelii] Length = 678 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 42/270 (15%), Positives = 90/270 (33%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + N+ +V + V N YS+ S + + + ++ + T Sbjct: 430 EGAAENEKQYVVEPNFANKAVCRTNGFYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKT 489 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S S+ ++ + + +E ++ Sbjct: 490 CL--------NSADIGFVIDGSSSVG------TGNFRTVLQFVTNLTKEFEISDTD---T 532 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ + F + S + IK + + T++ + +A Q+F Sbjct: 533 RIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF------ 586 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG A +G I YAIG+ E Sbjct: 587 ---KKSKPNKRKLMILITDGR-----SYDDVRIPAMAAHLKGVITYAIGVAWAAQEELEV 638 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V+ ++Y I ++I Sbjct: 639 IATHPARDHSFFVDEFDNLYQYVPRIIQNI 668 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 62/165 (37%), Gaps = 22/165 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+ ++D S S+ + + + + A+ + + + P G+V + + Sbjct: 292 IDLSFLIDGSTSIGK------RRFRIQKQLLAAVAQALDIGPAGP---LMGVVQYGDSPA 342 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L+ L+ I+ +++ G +N + + F N ++ Sbjct: 343 THFNLKTHMNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSK------ANGNRSSAPN 396 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++V M DG E+ L A++ G ++ I I +E Sbjct: 397 VVVVMVDGWPTDKVEEASRL-----ARESGINIFFITIEGAAENE 436 >gi|87121300|ref|ZP_01077190.1| batB protein, putative [Marinomonas sp. MED121] gi|86163457|gb|EAQ64732.1| batB protein, putative [Marinomonas sp. MED121] Length = 333 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 41/259 (15%), Positives = 94/259 (36%), Gaps = 47/259 (18%) Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 S+ S K ++ +I W I P+ + S T + D++I LD+S SM+ Sbjct: 46 NLSVKKQSFSKHIIRLFLWISWLLLVVAIARPVW-LGEPKSVTPSGRDLLIALDLSGSMQ 104 Query: 186 SF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS 241 + + ++D A + +N + E + G++ F +K L + + Sbjct: 105 TADMKINQQAANRLDAAKQVLNRFITE-------RQGDRIGIIVFGSKAYLQAPLSYDLD 157 Query: 242 HLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 + + + + G +T + ++ ++ K++++ MTDG N Sbjct: 158 TIAQLVNETQIGFAGENTAIGDAIGLGIKRLANIDAD-----------KRVMILMTDGAN 206 Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR------------------SHEFLRACA- 340 + + + A K+G ++ IGI + E L+ A Sbjct: 207 TAGR--VKPDQAAQFAAKQGVKIHTIGIGAEQMVSQGFFGPRVINPSTDLDEELLQKVAD 264 Query: 341 -SPNSFYLVENPHSMYDAF 358 + ++ ++ + + Sbjct: 265 LTQGQYFRAKSTQELASIY 283 >gi|19031199|gb|AAL17973.1| proximal thread matrix protein 1b [Mytilus edulis] Length = 444 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 40/201 (19%), Positives = 78/201 (38%), Gaps = 19/201 (9%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ V D S S+ + ++ + +K I K PD +VTF+++ + Sbjct: 244 DIAFVFDASSSINANNPNNYGLMKDFMKDIVDRFN--KTGPDGTQF---AVVTFADRATK 298 Query: 233 FFLLEWGVSH--LQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L+ S ++ I ++ + T GL+ A ++F +K Sbjct: 299 QFGLKDYSSKAEIKGAIDKVTPSIIGQTAIGDGLENARLEVFP-----NRNGGGREEVQK 353 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDG+N K + +K G ++ AIG+ L AS + Sbjct: 354 VVILLTDGQNNGHKSPEHESSL---LRKEGVVIVAIGVGTGFLKSELINIASSEEYVF-- 408 Query: 350 NPHSMYDAFSHIGKDIVTKRI 370 S ++ S I +++V Sbjct: 409 -TTSSFNKLSKIMENVVKLAC 428 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 42/210 (20%), Positives = 84/210 (40%), Gaps = 13/210 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + D + D++++ D S S++ + + +K + V + + N Q G+V F Sbjct: 39 ECDVQADIIVLFDDSSSIQYDNKENYQMMKNFVKELVDSFTTVGV--NGRNGSQFGVVQF 96 Query: 227 SNKIEEFFLLEWGVSH--LQRKI-KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 S ++ F L + +++ I + + G T GL++ F Sbjct: 97 SQGVKTAFPLNKFKTKEDIKKGIQDMVPRNGGQTEIGTGLQHVRENSFSGAEGG-----G 151 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 + + +KI++ MTDG++ + Q + ++ K G V AIGI L A+ Sbjct: 152 NPDKQKIVILMTDGKSNAGAPPQ---HEAHKLKAEGVTVIAIGIGQGFVKTELEQIATMK 208 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 ++ L N S + D+ + D Sbjct: 209 NYVLTTNSFSELSTLLKLVIDLACEVCVVD 238 >gi|120437734|ref|YP_863420.1| von Willebrand factor(vWA) type A domain-containing protein [Gramella forsetii KT0803] gi|117579884|emb|CAL68353.1| membrane protein containing von Willebrand factor(vWA) type A domain [Gramella forsetii KT0803] Length = 335 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 40/234 (17%), Positives = 80/234 (34%), Gaps = 50/234 (21%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P + V + + +D+++ +DVS SM + D ++D +++ + Sbjct: 77 PRSVDVSTQTSSTQGIDIVMAIDVSASMLA-RDLQPNRLDATKNVAEEFIQD-RPGD--- 131 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVS---HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + GLV ++ + + + I+Y + T GL A N+I D Sbjct: 132 ---RIGLVVYAGESFTKTPITSDKAIVLDALEDIEYNNVLENGTAIGSGLATAVNRIKDS 188 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR----- 328 K+I+ +TDG N + D + A + G VY IG+ Sbjct: 189 DAES-----------KVIILLTDGVNNAGFIDPSTAS--ELAVEFGIKVYTIGVGSNGMA 235 Query: 329 -------VIRS------------HEFLR--ACASPNSFYLVENPHSMYDAFSHI 361 + L+ A A+ ++ N + + ++ I Sbjct: 236 LSPVGVNPANGRLRFGNVQVEIDEDLLKEIAAATGGKYFRATNNEKLEEIYAEI 289 >gi|314918209|gb|EFS82040.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA1] Length = 320 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 39/213 (18%), Positives = 76/213 (35%), Gaps = 30/213 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +DVSRSM + D T++ A + L ++ +V+ LV F+ + Sbjct: 91 VVVAIDVSRSMVA-TDVEPTRLSAAKTAAKDFLGDLPPRFNVS------LVKFAASAQVV 143 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + I L ST G+ + N + + +H IV Sbjct: 144 VPPTTDRAAVSTAITNLQVL-PSTAIGEGIYSSLNALKLVPDDPKH---PGQKPPAAIVL 199 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH---------------EFLRA 338 ++DG +T + SL EA ++ VY I + E Sbjct: 200 LSDG---ATNVGRPSLEAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAI 256 Query: 339 C-ASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 AS + E+ + D + I + + +++ Sbjct: 257 AKASGGEKFSAESLGQLSDVYKSIAQSVGYEKV 289 >gi|297667864|ref|XP_002812183.1| PREDICTED: vitrin-like isoform 4 [Pongo abelii] Length = 656 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 42/270 (15%), Positives = 90/270 (33%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + N+ +V + V N YS+ S + + + ++ + T Sbjct: 408 EGAAENEKQYVVEPNFANKAVCRTNGFYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKT 467 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S S+ ++ + + +E ++ Sbjct: 468 CL--------NSADIGFVIDGSSSVG------TGNFRTVLQFVTNLTKEFEISDTD---T 510 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ + F + S + IK + + T++ + +A Q+F Sbjct: 511 RIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF------ 564 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG A +G I YAIG+ E Sbjct: 565 ---KKSKPNKRKLMILITDGR-----SYDDVRIPAMAAHLKGVITYAIGVAWAAQEELEV 616 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V+ ++Y I ++I Sbjct: 617 IATHPARDHSFFVDEFDNLYQYVPRIIQNI 646 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 62/165 (37%), Gaps = 22/165 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+ ++D S S+ + + + + A+ + + + P G+V + + Sbjct: 270 IDLSFLIDGSTSIGK------RRFRIQKQLLAAVAQALDIGPAGP---LMGVVQYGDSPA 320 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L+ L+ I+ +++ G +N + + F N ++ Sbjct: 321 THFNLKTHMNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSK------ANGNRSSAPN 374 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++V M DG E+ L A++ G ++ I I +E Sbjct: 375 VVVVMVDGWPTDKVEEASRL-----ARESGINIFFITIEGAAENE 414 >gi|19033105|gb|AAL83537.1|AF414454_1 proximal thread matrix protein 1 variant a [Mytilus edulis] Length = 441 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 40/201 (19%), Positives = 78/201 (38%), Gaps = 19/201 (9%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ V D S S+ + ++ + +K I K PD +VTF+++ + Sbjct: 241 DIAFVFDASSSINANNPNNYGLMKDFMKDIVDRFN--KTGPDGTQF---AVVTFADRATK 295 Query: 233 FFLLEWGVSH--LQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L+ S ++ I ++ + T GL+ A ++F +K Sbjct: 296 QFGLKDYSSKAEIKGAIDKVTPSIIGQTAIGDGLENARLEVFP-----NRNGGGREEVQK 350 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDG+N K + +K G ++ AIG+ L AS + Sbjct: 351 VVILLTDGQNNGHKSPEHESSL---LRKEGVVIVAIGVGTGFLKSELINIASSEEYVF-- 405 Query: 350 NPHSMYDAFSHIGKDIVTKRI 370 S ++ S I +++V Sbjct: 406 -TTSSFNKLSKIMENVVKLAC 425 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 42/210 (20%), Positives = 84/210 (40%), Gaps = 13/210 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + D + D++++ D S S++ + + +K + V + + N Q G+V F Sbjct: 36 ECDVQADIIVLFDDSSSIQYDNKENYQMMKNFVKELVDSFTTVGV--NGRNGSQFGVVQF 93 Query: 227 SNKIEEFFLLEWGVSH--LQRKI-KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 S ++ F L + +++ I + + G T GL++ F Sbjct: 94 SQGVKTAFPLNKFKTKEDIKKGIQDMVPRNGGQTEIGTGLQHVRENSFSGAEGG-----G 148 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 + + +KI++ MTDG++ + Q + ++ K G V AIGI L A+ Sbjct: 149 NPDKQKIVILMTDGKSNAGAPPQ---HEAHKLKAEGVTVIAIGIGQGFVKTELEQIATMK 205 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 ++ L N S + D+ + D Sbjct: 206 NYVLTTNSFSELSTLLKLVIDLACEVCVVD 235 >gi|329664416|ref|NP_001193157.1| anthrax toxin receptor 1 [Bos taurus] gi|297480483|ref|XP_002691486.1| PREDICTED: anthrax toxin receptor 1 [Bos taurus] gi|296482440|gb|DAA24555.1| anthrax toxin receptor 1 [Bos taurus] Length = 564 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 39/205 (19%), Positives = 73/205 (35%), Gaps = 25/205 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+ +LD S S+ ++ I +E++ ++ + FS + Sbjct: 41 GGFDLYFILDKSGSVLHHWN-----------EIYYFVEQLAHKFISPQ-LRMSFIVFSTQ 88 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L +++ ++ L K G T G + A QI+ + Sbjct: 89 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENSQGYRTAS----- 143 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +I+ +TDGE L S N ++ GAIVY +G++ + R S + + Sbjct: 144 --VIIALTDGE-LHEDLFFYSEREANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVFP 200 Query: 348 VENPHSMYDAFSHIGKDIVTKRIWY 372 V + + A I I+ K Sbjct: 201 VNDG---FQALQGIIHSILKKSCIE 222 >gi|161086980|ref|NP_631887.2| chloride channel calcium activated 4 [Mus musculus] Length = 1044 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 44/191 (23%), Positives = 68/191 (35%), Gaps = 29/191 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK---I 230 + +VLD S SM S IT++ + ++ L ++ + GLVTF + Sbjct: 309 VCLVLDKSGSMR--LGSPITRLTLMNQAAELYLIQIIEKESL-----VGLVTFDSTATIQ 361 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + S+L K T+ GLK + I Sbjct: 362 TNLIRIINDSSYLAISTKLPQYPNGGTSICNGLKKGFEAITSSDQSTSGSE--------- 412 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ C E K GAI++ I + + E L FY Sbjct: 413 IVLLTDGEDNRISS-------CFQEVKHSGAIIHTIALGPSAARELETLSDMTGGLRFYA 465 Query: 348 VENPHSMYDAF 358 E+ + + DAF Sbjct: 466 KEDVNGLIDAF 476 >gi|242097658|emb|CAY86115.1| truncated collagen type VI alpha 4 precursor [Homo sapiens] Length = 385 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 37/233 (15%), Positives = 81/233 (34%), Gaps = 24/233 (10%) Query: 95 RNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHI 154 LR G + + + ++ + + + ++ L + Sbjct: 154 AKALRKAGILLYAIGVRDAVWAELREIASSPQENFTSFVPNFSGLSNLAQKLRQELCDTL 213 Query: 155 V--MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 P V + A D++ ++D S S+ + K+ + S+ L Sbjct: 214 AKAAPHVDHVSPACREAALADIVFLVDSSTSIG---PQNFQKVKNFLYSVILGL------ 264 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYA-YNQ 269 ++ V+ GL +++ I F L S + +I+ L TN+ L++ N Sbjct: 265 DISSDRVRVGLAQYNDNIYPAFQLNQHPLKSMILEQIQNLPYRTGGTNTGSALEFIRTNY 324 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 + + G R +I++ +TDGE + + + K+ G +V Sbjct: 325 LTEESGSRA-----KDRVPQIVILVTDGE-----SNDEVQEVADRLKEDGVVV 367 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 34/173 (19%), Positives = 55/173 (31%), Gaps = 17/173 (9%) Query: 200 KSINAMLEE-VKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVS 256 +S+ L V + ++ GL +S+ FLL + R I+ Sbjct: 48 RSVRNFLYILVNSFNVSSETIRVGLAKYSDVPHSEFLLSTYHRKGDVLRHIRQFQFKPGG 107 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 LK+ I D + +I V ++ G + + Sbjct: 108 KKMGLALKF----ILDHHFQEASGSRASQEVPQIAVVISSG-----PVEDHVHGPAKALR 158 Query: 317 KRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 K G ++YAIG+R E +SP EN S FS + R Sbjct: 159 KAGILLYAIGVRDAVWAELREIASSPQ-----ENFTSFVPNFSGLSNLAQKLR 206 >gi|161408065|dbj|BAF94136.1| Dual Intracellular Von Willebrand factor domain A [Homo sapiens] Length = 276 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 37/233 (15%), Positives = 81/233 (34%), Gaps = 24/233 (10%) Query: 95 RNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHI 154 LR G + + + ++ + + + ++ L + Sbjct: 45 AKALRKAGILLYAIGVRDAVWAELREIASSPQENFTSFVPNFSGLSNLAQKLRQELCDTL 104 Query: 155 V--MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 P V + A D++ ++D S S+ + K+ + S+ L Sbjct: 105 AKAAPHVDHVSPACREAALADIVFLVDSSTSIG---PQNFQKVKNFLYSVILGL------ 155 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYA-YNQ 269 ++ V+ GL +++ I F L S + +I+ L TN+ L++ N Sbjct: 156 DISSDRVRVGLAQYNDNIYPAFQLNQHPLKSMILEQIQNLPYRTGGTNTGSALEFIRTNY 215 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 + + G R +I++ +TDGE + + + K+ G +V Sbjct: 216 LTEESGSRA-----KDRVPQIVILVTDGE-----SNDEVQEVADRLKEDGVVV 258 >gi|118464548|ref|YP_883428.1| hypothetical protein MAV_4290 [Mycobacterium avium 104] gi|118165835|gb|ABK66732.1| conserved hypothetical protein [Mycobacterium avium 104] Length = 335 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 38/245 (15%), Positives = 76/245 (31%), Gaps = 33/245 (13%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLD---MMIVLDVSRSMESFFDSSITKIDMAI 199 +P + +V+ T+ S L+ +M+V+DVS SM S D ++ A Sbjct: 65 RHVPTILLATSLVLLTTAMAGPTSDVRIPLNRAVVMLVIDVSESMAS-TDVPPNRLAAAK 123 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNS 259 ++ +++ + GLV F+ + ++ I L T + Sbjct: 124 EAGKQFADQLTPAIN------LGLVEFAANATLLVPPTTNRAAVKAGIDSLQP-APKTAT 176 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL---STKEDQQSLYYCNEAK 316 G+ A I + + IV +DG Q + AK Sbjct: 177 GEGIFTALQAIATVGSVMGGGEGPPPAR---IVLESDGAENVPLDPNAPQGAFTAARAAK 233 Query: 317 KRGAIVYAIGIRVIRS---------------HEFLRACA-SPNSFYLVENPHSMYDAFSH 360 G + I + C + + ++ S+ + +S Sbjct: 234 AEGVQISTISFGTPYGTVDYEGATIPVPVDDQTLQKICEITDGQAFHADSLDSLKNVYST 293 Query: 361 IGKDI 365 + + I Sbjct: 294 LQRQI 298 >gi|305663382|ref|YP_003859670.1| von Willebrand factor type A [Ignisphaera aggregans DSM 17230] gi|304377951|gb|ADM27790.1| von Willebrand factor type A [Ignisphaera aggregans DSM 17230] Length = 323 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 47/224 (20%), Positives = 84/224 (37%), Gaps = 29/224 (12%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + + ++ +PI SS + +AR ++I++D S SM DS ++ Sbjct: 76 LAGISIVEYVRIPIESSSISSLSFNARTPVVIIVDTSGSMADNMDS-------VKYALRT 128 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 M+ NN + GLV FS+ I+ S++ I + G T + L Sbjct: 129 MVSLF------NNTIDIGLVEFSHSIKSAIPPTPNRSYIDMVIDRMEA-GGGTMYSFALS 181 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 A + + + N VF+TDG ++ + L N+ G +Y Sbjct: 182 TALSWLRPYRE---------LNVSAFTVFITDGLPGDPQDYRPLLDEYNKL---GIPIYT 229 Query: 325 IGIRV-IRSHEFLRACAS--PNSFYLVENPHSMYDAFSHIGKDI 365 + I R + + AS + VE+ + D + I I Sbjct: 230 VFIGEDPRGIDETKLIASKTGGEQFTVESIDRLSDTLNTIASKI 273 >gi|258654082|ref|YP_003203238.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] gi|258557307|gb|ACV80249.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] Length = 618 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 43/201 (21%), Positives = 76/201 (37%), Gaps = 11/201 (5%) Query: 166 SQTDARLDMMIVLDVSRSMESFFD-SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +Q +++V+DVS SM S S +K+D+A K+ + L ++ + Q GL Sbjct: 420 TQVRKPARVLVVMDVSGSMASESGYGSESKLDLAKKAATSALGQL------TDTDQMGLW 473 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 F+ + + + + + + +S L + + +D Sbjct: 474 AFTTDLPTPDTITADLVGVGPLAQTRQPIIDAISSLTPLNGTPLYAATREAAKAMNAQKD 533 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSL--YYCNEAKKRGAIVYAIGIRVIRSHEFLRAC--A 340 N +V +TDG N T D L A++ G V+ I L+ A Sbjct: 534 PNSINAVVVLTDGRNEYTDNDLDGLLRELNASAEEDGVRVFTIAYGPDADLATLQEISEA 593 Query: 341 SPNSFYLVENPHSMYDAFSHI 361 S + Y NP S+ FS + Sbjct: 594 SRAAAYDARNPTSIDKVFSDV 614 >gi|41409533|ref|NP_962369.1| hypothetical protein MAP3435c [Mycobacterium avium subsp. paratuberculosis K-10] gi|81570936|sp|Q73UD3|Y3435_MYCPA RecName: Full=UPF0353 protein MAP_3435c gi|41398364|gb|AAS05985.1| hypothetical protein MAP_3435c [Mycobacterium avium subsp. paratuberculosis K-10] Length = 335 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 38/245 (15%), Positives = 76/245 (31%), Gaps = 33/245 (13%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLD---MMIVLDVSRSMESFFDSSITKIDMAI 199 +P + +V+ T+ S L+ +M+V+DVS SM S D ++ A Sbjct: 65 RHVPTILLATSLVLLTTAMAGPTSDVRIPLNRAVVMLVIDVSESMAS-TDVPPNRLAAAK 123 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNS 259 ++ +++ + GLV F+ + ++ I L T + Sbjct: 124 EAGKQFADQLTPAIN------LGLVEFAANATLLVPPTTNRAAVKAGIDSLQP-APKTAT 176 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL---STKEDQQSLYYCNEAK 316 G+ A I + + IV +DG Q + AK Sbjct: 177 GEGIFTALQAIATVGSVMGGGEGPPPAR---IVLESDGAENVPLDPNAPQGAFTAARAAK 233 Query: 317 KRGAIVYAIGIRVIRS---------------HEFLRACA-SPNSFYLVENPHSMYDAFSH 360 G + I + C + + ++ S+ + +S Sbjct: 234 AEGVQISTISFGTPYGTVDYEGATIPVPVDDQTLQKICEITDGQAFHADSLDSLKNVYST 293 Query: 361 IGKDI 365 + + I Sbjct: 294 LQRQI 298 >gi|327542237|gb|EGF28726.1| BatA aerotolerance operon protein [Rhodopirellula baltica WH47] Length = 345 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 36/258 (13%), Positives = 86/258 (33%), Gaps = 44/258 (17%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS---MESFFDS-SITKI 195 K T I + + P + SQT+ + + +V+D S S M+ D + ++ Sbjct: 43 KALTLIAFALLVVALARPREGREQTVSQTEG-IAIEMVIDRSGSMQAMDFNIDGEPVDRL 101 Query: 196 DMAIKSINAMLEEVKLIPD-VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL---- 250 + + + + +++V GL+TF+ + S + ++ Sbjct: 102 TAVKNVASKFITGGEDLEGRFSDLV--GLITFAAYADAETPPTLDHSFVVSRLNQTEIVS 159 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 + T + + ++ + ++ KI++ +TDGEN + + + Sbjct: 160 RRDEDGTAIGDAIALSVEKLNALDARQE-----RKVQSKILILLTDGENTA--GELDPIQ 212 Query: 311 YCNEAKKRGAIVYAIGIR-----------VIRSHEFLRACA--------------SPNSF 345 A+ G +YAIG+ + L + + Sbjct: 213 AAELAETLGIKIYAIGVGTKGKAPVPVRDPFTGRQRLHYMEVNIDEATLQKVAEITGGKY 272 Query: 346 YLVENPHSMYDAFSHIGK 363 + + S+ + I + Sbjct: 273 FRATDTDSLDAIYREIDQ 290 >gi|256820507|ref|YP_003141786.1| von Willebrand factor type A [Capnocytophaga ochracea DSM 7271] gi|256582090|gb|ACU93225.1| von Willebrand factor type A [Capnocytophaga ochracea DSM 7271] Length = 333 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 42/242 (17%), Positives = 83/242 (34%), Gaps = 48/242 (19%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P T S +++ +D+++ +DVS SM S D + + K + ++ Sbjct: 74 LARPQTHSENAHTKITDGIDIVMAIDVSASMLSK-DLKPNRFEALKKVASQFVK-----D 127 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIF 271 N+ + GLV ++ + + + + L+ + T GL A N++ Sbjct: 128 RPND--RIGLVIYAGESYTKTPVTTDKGIILNALSELTYGQIEDGTAIGMGLATAVNRL- 184 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--- 328 E ++I+ +TDG N + D Q+ A + G VY IGI Sbjct: 185 ----------KESKAKSRVIILLTDGVNNTGFIDPQTA--AELAAEYGIRVYTIGIGSNG 232 Query: 329 --------VIRS------------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIV 366 ++ A + ++ + + + I K Sbjct: 233 TALSPYALNPDGSIMYRMLQVEIDEPLMKKIAEVTHGRYFRATDNQKLQQIYDEINKMET 292 Query: 367 TK 368 TK Sbjct: 293 TK 294 >gi|21359974|ref|NP_444506.2| vitrin isoform 1 [Homo sapiens] gi|62702118|gb|AAF19243.2|AC007363_1 unknown [Homo sapiens] gi|16552271|dbj|BAB71279.1| unnamed protein product [Homo sapiens] gi|119620821|gb|EAX00416.1| vitrin, isoform CRA_a [Homo sapiens] Length = 693 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 41/270 (15%), Positives = 91/270 (33%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + N+ +V + V N YS+ S + + + ++ + T Sbjct: 445 EGAAENEKQYVVEPNFANKAVCRTNGFYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKT 504 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S S+ ++ + + +E ++ Sbjct: 505 CL--------NSADIGFVIDGSSSVG------TGNFRTVLQFVTNLTKEFEISDTD---T 547 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ + F + S + IK + + T++ + +A Q+F Sbjct: 548 RIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF------ 601 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG + A +G I YAIG+ E Sbjct: 602 ---KKSKPNKRKLMILITDG-----RSYDDVRIPAMAAHLKGVITYAIGVAWAAQEELEV 653 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V+ +++ I ++I Sbjct: 654 IATHPARDHSFFVDEFDNLHQYVPRIIQNI 683 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 60/165 (36%), Gaps = 22/165 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+ ++D S S+ + + + + + + + + P G+V + + Sbjct: 307 IDLSFLIDGSTSIGK------RRFRIQKQLLADVAQALDIGPAGP---LMGVVQYGDNPA 357 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L+ L+ I+ +++ G +N + + F N + Sbjct: 358 THFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSK------ANGNRSGAPN 411 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++V M DG E+ L A++ G ++ I I +E Sbjct: 412 VVVVMVDGWPTDKVEEASRL-----ARESGINIFFITIEGAAENE 451 >gi|332524448|ref|ZP_08400660.1| von Willebrand factor type A [Rubrivivax benzoatilyticus JA2] gi|332107769|gb|EGJ08993.1| von Willebrand factor type A [Rubrivivax benzoatilyticus JA2] Length = 343 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 35/235 (14%), Positives = 77/235 (32%), Gaps = 50/235 (21%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +DVS SM + D ++ A ++ A +E + V+ G+V+F+ Sbjct: 88 IILAMDVSGSMRAE-DVKPNRLVAAQEAARAFVESLP------REVRVGVVSFAGTAAVV 140 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII-- 291 + I+ + T G+ + IF G+ T +++ Sbjct: 141 QAPTTSRDDVFAAIERF-QLQRGTAIGSGIVLSLATIFPDAGIDIQQITGQRTMPRMLGD 199 Query: 292 --------------------VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 + +TDG+ + + + A RG VY +G+ + Sbjct: 200 PEKKAEFTPVPPGSYASAAMILLTDGQRTTGPDPIDAAKM---AADRGIRVYTVGVGTTQ 256 Query: 332 S---------------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 + LR A + ++ + + +G +V +R Sbjct: 257 GEIIGFEGWSMRVRLDEDTLRQIAQMTTGEYFYAGTAEDLKKVYQRLGSRMVVER 311 >gi|313674519|ref|YP_004052515.1| von willebrand factor type a [Marivirga tractuosa DSM 4126] gi|312941217|gb|ADR20407.1| von Willebrand factor type A [Marivirga tractuosa DSM 4126] Length = 345 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 41/216 (18%), Positives = 77/216 (35%), Gaps = 45/216 (20%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+VLD+S SM+ D + +++ A + N + D + GL FS + Sbjct: 105 IDIMLVLDISESMK-IQDFTPNRLEAAKQVANDFI-------DGRFQDRIGLTIFSGEAY 156 Query: 232 EFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L L+ +I + T L N++ + Sbjct: 157 SLSPLTTDYKMLKNQITDIDFKMMEASGTAIGSALAVGTNRMRESDSKS----------- 205 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR----VIRSHEF--------- 335 K+++ ++DG+N + D ++ A G +Y I I V +F Sbjct: 206 KVLILLSDGDNNAGNIDPET--SAKLANAYGIKIYTIAIGKEGKVPYGKDFFGRTRYIEN 263 Query: 336 ------LRACAS--PNSFYLVENPHSMYDAFSHIGK 363 L+ A FY + ++ + FS I + Sbjct: 264 SMDVTGLKNIAKIGEGQFYRATDNQALEEVFSIIDQ 299 >gi|312961300|ref|ZP_07775805.1| von Willebrand factor, type A [Pseudomonas fluorescens WH6] gi|311284958|gb|EFQ63534.1| von Willebrand factor, type A [Pseudomonas fluorescens WH6] Length = 362 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 29/207 (14%), Positives = 72/207 (34%), Gaps = 45/207 (21%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + D ++++ + + L++ + GL+ F + Sbjct: 91 DLLVAVDVSGSMDFPDMHWRDEDVSRLSLVKHLLGDFLQQ-------REGDRVGLILFGS 143 Query: 229 KIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + + G +T + A ++ Sbjct: 144 QAYLQAPLTFDRRTVRTWLDEARIGIAGKNTAIGDAIGLALKRL-----------RLRPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE------------ 334 ++++ +TDG N + + D + A + G +Y IGI Sbjct: 193 QSRVLILITDGANNAGQIDPLTA--ARLAAEEGVKIYPIGIGADPEQTGSLGILGVNPSL 250 Query: 335 -----FLR--ACASPNSFYLVENPHSM 354 L+ A A+ ++ + + Sbjct: 251 DLDEPALKAIAAATGGQYFRARDGEEL 277 >gi|311252475|ref|XP_003125114.1| PREDICTED: anthrax toxin receptor 1-like [Sus scrofa] Length = 564 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 39/205 (19%), Positives = 74/205 (36%), Gaps = 25/205 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+ +LD S S+ ++ I +E++ ++ + FS + Sbjct: 41 GGFDLYFILDKSGSVLHHWN-----------EIYYFVEQLAHKFISPQ-LRMSFIVFSTQ 88 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L +++ ++ L K G T G + A QI+ + Sbjct: 89 GTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRTAS----- 143 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +I+ +TDGE L + S N ++ GAIVY +G++ + R S + + Sbjct: 144 --VIIALTDGE-LHEELFFYSEREANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVFP 200 Query: 348 VENPHSMYDAFSHIGKDIVTKRIWY 372 V + + A I I+ K Sbjct: 201 VNDG---FQALQGIIHSILKKSCIE 222 >gi|326674791|ref|XP_001922046.3| PREDICTED: collagen alpha-1(XIV) chain [Danio rerio] Length = 1852 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 41/210 (19%), Positives = 76/210 (36%), Gaps = 27/210 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 A+ D+ ++D S S+ D + KI I+ + + + +I Q + Sbjct: 1012 EVCKAAKADLTFLVDGSWSIG---DDNFQKI---IRFLYSTTGALDVIGP--EGTQVAIA 1063 Query: 225 TFSNKIEEFFLL-EWG-VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+ F L + L ++ +S G +T + +K+ IF Sbjct: 1064 QFSDDARTEFKLNSYSDKEALLDAVQRISYKGGNTKTGRAIKHVKEAIFSED------AG 1117 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 K++V +TDG E + G I++AIG E + + Sbjct: 1118 VRRGIPKVLVVLTDGR-----SQDDVNKISKEMQMEGYIIFAIGFADADYGELVNIASKP 1172 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 S + V++ DAF I + ++T Sbjct: 1173 SERHVFFVDD----LDAFKKIEEQLITFVC 1198 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 40/199 (20%), Positives = 70/199 (35%), Gaps = 26/199 (13%) Query: 160 SSVKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 S+V T + D++I++D S S+ + + ++ + D Sbjct: 144 SAVDQFVCTTPAIADIVILVDGSWSIG------RINFRLVRMFLENLVNAFDVGIDK--- 194 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQIFDMQG 275 + GL +S + L G S + I L G +T + L Y F + Sbjct: 195 TRIGLAQYSGDPRIEWHL-NGFSTKEAVIDAVKNLPYKGGNTLTGLALTYVLENSFKPE- 252 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + N KI + +TDG K + + G ++AIG++ +E Sbjct: 253 -----SGARDNIPKIGILITDG-----KSQDDVISPAQTLRSSGVELFAIGVKNADENE- 301 Query: 336 LRACASPNSFYLVENPHSM 354 L+A AS V N Sbjct: 302 LKAIASEPEDTHVYNVADF 320 >gi|332227196|ref|XP_003262777.1| PREDICTED: vitrin isoform 1 [Nomascus leucogenys] Length = 694 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 41/270 (15%), Positives = 89/270 (32%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + N+ +V + V N YS S + + + ++ + T Sbjct: 446 EGAAENEKQYVVEPNFANKAVCRTNGFYSFHVQSWFGLHKTLQPLVKRVCDTDRLACSKT 505 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S S+ ++ + + +E ++ Sbjct: 506 CL--------NSADIGFVIDGSSSVG------TGNFRTVLQFVTNLTKEFEISDTD---T 548 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ + F + S + IK + + T++ + +A Q+F Sbjct: 549 RIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF------ 602 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG + A +G YAIG+ E Sbjct: 603 ---KKSKPNKRKLMILITDG-----RSYDDVRIPAMAAHLKGVFTYAIGVAWAAQEELEV 654 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V+ ++Y I ++I Sbjct: 655 IATHPARDHSFFVDEFDNLYQYVPRIIQNI 684 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 60/165 (36%), Gaps = 22/165 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+ ++D S S+ + + + + + + + + P G+V + + Sbjct: 308 IDLSFLIDGSTSIGK------RRFRIQKQLLADVAQALDIGPAGP---LMGVVQYGDNPA 358 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L+ L+ I+ +++ G +N + + F N + Sbjct: 359 THFNLKTHMNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSK------ANGNRSGAPN 412 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++V M DG E+ L A++ G ++ I I +E Sbjct: 413 VVVVMVDGWPTDKVEEASRL-----ARESGINIFFITIEGAAENE 452 >gi|209884898|ref|YP_002288755.1| hypothetical protein OCAR_5764 [Oligotropha carboxidovorans OM5] gi|209873094|gb|ACI92890.1| conserved hypothetical protein [Oligotropha carboxidovorans OM5] Length = 600 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 29/161 (18%), Positives = 56/161 (34%), Gaps = 21/161 (13%) Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM-RQHCNTEDAN 286 + +W L K+ ++ G +TN GL + + + + + Sbjct: 441 SATITPMSNQWAT--LNSKVNAMNPSG-NTNQAIGLFWGWQTLNTANDPFKAPSKDPNWV 497 Query: 287 YKKIIVFMTDGENLS------------TKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--- 331 Y+ IV ++DG N D + C+ K ++ I + + Sbjct: 498 YQDYIVILSDGLNTQNRWYTCPNAGPCPTIDGREKTLCDNIKADKITIFTIQVNINSKDP 557 Query: 332 SHEFLRACASPNSFYL--VENPHSMYDAFSHIGKDIVTKRI 370 + L+ CAS S Y + + + AF ++ I RI Sbjct: 558 ESQVLKDCASSGSGYFQLITSANDTATAFDNVLNKIAKLRI 598 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 28/223 (12%), Positives = 75/223 (33%), Gaps = 30/223 (13%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLV--HAATQIMNE 68 F + +G + I+ + + ++G ++ + + +T + S +D + + M++ Sbjct: 16 FRKDARGNVAIIFTLVAIPLVALVGAAVDYTRVSSARTAMQSALDSAALMISKDAATMSD 75 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 R + + + +NG ++ + Y Sbjct: 76 SEITTRARQYVNSLYTNTETPIQTFSAVYTPNNG---------SGATILLNAGGNMPTYF 126 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 + + TN + + S+ K S +R+ + +VLD + SM+ Sbjct: 127 MKIVG---------------TNFSTLPINTASTTKWGS---SRMRVALVLDNTGSMDQNG 168 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 + K A + +++++ V +V F+ + Sbjct: 169 KMTALK-KAAANATTGLIKKLSAFNTNEGDVYISVVPFAKDVN 210 >gi|114764812|ref|ZP_01443994.1| hypothetical protein 1100011001322_R2601_10469 [Pelagibaca bermudensis HTCC2601] gi|114542698|gb|EAU45721.1| hypothetical protein R2601_10469 [Roseovarius sp. HTCC2601] Length = 477 Score = 69.0 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 55/459 (11%), Positives = 142/459 (30%), Gaps = 93/459 (20%) Query: 3 SLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAA 62 +L ++ F G MTI+ ++ + G+ I++ + +T + + +DR+++ AA Sbjct: 21 ALVAQLQGFRRAESGSMTIMAVALSLLMMIFGGIGIDMMYAELQRTKIQNTLDRAVLAAA 80 Query: 63 ---TQIMNEGNGNNRKKLKG--------------------GDILCRIKNTWNMSFRNELR 99 ++ +G + D + + + E Sbjct: 81 DLDNELDAQGVVEDYMSKMSLADALVSVNVDEGLNYRTVTADGYRTMPSNFMQLIGIENM 140 Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 G + + + + +V+ ++ + S +V Sbjct: 141 QAGGHSQAMERINKVEVSMVLDISGSMDDGDKMAELQTAASDFVDTLLDDGSEDLVSISL 200 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP----DV 215 + + + ++ + + + A ++ ++++ Sbjct: 201 VPYSEHVNAGPEILSYLNVNYMHDDSYCLEMPNSAFNSAALDLSLTYDQMQHFQWNYSGS 260 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI----- 270 N++ + ++ + + + G L+ +I L T+ G+K+A + Sbjct: 261 NSLTDTVCPRYAYEQIRPWSQDAG--ALKTQISQLQPR-AGTSIFMGMKWASALLDPSTR 317 Query: 271 -----------FDMQGMRQHCNTEDANYKKIIVFMTDGE--------NLSTKEDQQSLYY 311 D + D + K IV MTDG+ N + ++ + ++ Sbjct: 318 PIASGMIADGTVDAVFEGRPVAYSDTDVLKTIVLMTDGQHDRSFRIQNWAYNDENEVEHW 377 Query: 312 --------------------------------------CNEAKKRGAIVYAIGIRVIR-S 332 C AK++G ++++IG V Sbjct: 378 SQYNLWHYLNYYVNSWNRSSFYYQKYDAATGDTLLSSVCTAAKRQGILIWSIGFEVSDHG 437 Query: 333 HEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + +CAS + + + +AFS I + + R+ Sbjct: 438 ANVMESCASSPAHFFRVEGVEISEAFSTIAQTLNQLRLT 476 >gi|323493494|ref|ZP_08098616.1| hypothetical protein VIBR0546_14275 [Vibrio brasiliensis LMG 20546] gi|323312317|gb|EGA65459.1| hypothetical protein VIBR0546_14275 [Vibrio brasiliensis LMG 20546] Length = 393 Score = 68.6 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 59/380 (15%), Positives = 116/380 (30%), Gaps = 35/380 (9%) Query: 10 NFFYNYKGGMTILT-AIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 +G + I A+ +P+I I+ + + +D + + + + Sbjct: 5 RLLSKQRGSVAISYLAMLIPMIIAAASTIVIGYQVLLSNRAMQ-AVDTASLACEFRGEYD 63 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIV--VVPQNEG 126 + + ++ T S ++ + I + + V V + Sbjct: 64 RSIAQGYLDYYKPKIDKVTATLGASSGCKVELGYSYSSIFTSLTFSDASYVAGVTASQKV 123 Query: 127 YSISAISRYKIPLKFCTFIPWYTN----------SRHIVMPITSSVKVNSQTDARLDMM- 175 Y I L I +R + + V Q ++ ++ Sbjct: 124 YVTEVTDSDPIELVLVLDISGSMMGALDELKSILNRGLTTLRSQQANVAGQDHIKVSIVP 183 Query: 176 -----IVLDVSRSMESFFDSSITKIDMAIKSINAM-----LEEVKLIPDVNNVVQSGLVT 225 V D +D + S + + + + D V S + Sbjct: 184 FSNGVSVTDA----PWLKSGGTLCVDATVNSGGSFSPANTVANLDVTHDQAPVTTSSSSS 239 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD--MQGMRQHCNTE 283 + L ++ + + L G ST S GL + Q+ R N Sbjct: 240 DCSLTSVILPLTSNLNDVVDAVNRLQTIG-STASYQGLLWGLRQLTPNWQSAWRVGPNRN 298 Query: 284 DANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI--RSHEFLRACA 340 N ++ +V MTDG ++ S ++ + C AK G + IG V R +F R Sbjct: 299 QDNVQRKLVLMTDGMDDNSHLDELINAGLCTRAKDLGIELNFIGFGVQSWRLEQFTRCAG 358 Query: 341 SPNSFYLVENPHSMYDAFSH 360 S + + N + D FS Sbjct: 359 SAGAVFSANNTQDLDDYFSQ 378 >gi|297667862|ref|XP_002812182.1| PREDICTED: vitrin-like isoform 3 [Pongo abelii] Length = 657 Score = 68.6 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 42/270 (15%), Positives = 91/270 (33%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + N+ +V + V N YS+ S + + + ++ + T Sbjct: 409 EGAAENEKQYVVEPNFANKAVCRTNGFYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKT 468 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S S+ ++ + + +E ++ Sbjct: 469 CL--------NSADIGFVIDGSSSVG------TGNFRTVLQFVTNLTKEFEISDTD---T 511 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ + F + S + IK + + T++ + +A Q+F Sbjct: 512 RIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF------ 565 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG + A +G I YAIG+ E Sbjct: 566 ---KKSKPNKRKLMILITDG-----RSYDDVRIPAMAAHLKGVITYAIGVAWAAQEELEV 617 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V+ ++Y I ++I Sbjct: 618 IATHPARDHSFFVDEFDNLYQYVPRIIQNI 647 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 29/175 (16%), Positives = 66/175 (37%), Gaps = 22/175 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 V + D ++D+ ++D S S+ + + + + A+ + + + P Sbjct: 261 VGADVSLDCKIDLSFLIDGSTSIGK------RRFRIQKQLLAAVAQALDIGPAGP---LM 311 Query: 222 GLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G+V + + F L+ L+ I+ +++ G +N + + F Sbjct: 312 GVVQYGDSPATHFNLKTHMNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSK------ 365 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 N ++ ++V M DG E+ L A++ G ++ I I +E Sbjct: 366 ANGNRSSAPNVVVVMVDGWPTDKVEEASRL-----ARESGINIFFITIEGAAENE 415 >gi|317505805|ref|ZP_07963650.1| von Willebrand factor type A domain-containing protein [Segniliparus rugosus ATCC BAA-974] gi|316255887|gb|EFV15112.1| von Willebrand factor type A domain-containing protein [Segniliparus rugosus ATCC BAA-974] Length = 343 Score = 68.6 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 38/239 (15%), Positives = 84/239 (35%), Gaps = 37/239 (15%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + V+ + ++++LD+S SM + D ++++ A + ++ + Sbjct: 79 LTVALAGPTAVSQVAKNQATVILLLDISESMRA-TDVKPSRVEAARAAAIKFVDGMAP-- 135 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQI 270 VQ G+VTF+ + ++ I + K T + G+ A QI Sbjct: 136 ----TVQLGVVTFAGNAQPLVRPSTDHETAKKVIDQMIRPDKLEKQTATGEGIYTALQQI 191 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGE---NLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + G A + IV ++DG+ + + AK++ V + Sbjct: 192 ETIAGALGG--KNHAPPAR-IVLVSDGKETVPDDLNAPRGAYAAARTAKEKHVPVCTVAF 248 Query: 328 RVIRSH--------------EFLRACA----SPNS---FYLVENPHSMYDAFSHIGKDI 365 E L+ + SP + F+ E+ + + + +DI Sbjct: 249 GTKAGKITLDNQVDEVPVDLESLKKISDLSNSPGNSCRFFPAESQGELAQIYQSLNEDI 307 >gi|221104447|ref|XP_002170122.1| PREDICTED: similar to tyrosine kinase receptor, partial [Hydra magnipapillata] Length = 898 Score = 68.6 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 38/208 (18%), Positives = 79/208 (37%), Gaps = 24/208 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 +S LD+ ++D S S+ + + + + + NN +G Sbjct: 82 DQDSDCAGVLDVGFIIDSSGSLRNQYRQEVEFLKSL----------ARTFKISNNGAHAG 131 Query: 223 LVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +VTFS+ E L + +R + + G T L+ + ++ G R+ Sbjct: 132 VVTFSSIAELSIKLNQYYDQEQFERAVDDIPYMGYVTRIDLALRKSLEMFDEINGARK-- 189 Query: 281 NTEDANYKKIIVFMTDGENLSTKE--DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR- 337 + +I+ +TDGE + K D+ + + +G +++AIGI L Sbjct: 190 -----SIPQILFLLTDGEQYAGKGVVDENPVSIAKLLRDKGIVIFAIGIGSAVRQSQLND 244 Query: 338 ACASPNSFYLVENPHSMYDA--FSHIGK 363 S +L +N + + ++ I + Sbjct: 245 IAGSSEKAFLAKNFNELVNSDFLKKIKE 272 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 40/206 (19%), Positives = 75/206 (36%), Gaps = 25/206 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K S +D+ ++D S S+E+ + + + ++ +G Sbjct: 709 KSESDCAVIVDVGFIIDSSGSLENHY----------QQEVEFLINLASTFNISKYGAHAG 758 Query: 223 LVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +VTFS L + +K +S T L+ + ++ G R+ Sbjct: 759 VVTFSYDAFLSIKLNDYFNQAQFNNAVKDISYLNGGTRIDLALEKSLEMFDELNGARK-- 816 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA-C 339 N +I+ +TDGE D+ + + RG I++AIGI + L Sbjct: 817 -----NTPQILFLLTDGE---QSGDKNPVDIAKRLRDRGIIIFAIGIGSYVNKTELNNIV 868 Query: 340 ASPNSFYLVENPHSMY--DAFSHIGK 363 S + +L EN + + D I + Sbjct: 869 GSNDKAFLAENFNELVNNDFLKKIKE 894 >gi|259416688|ref|ZP_05740608.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] gi|259348127|gb|EEW59904.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] Length = 583 Score = 68.6 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 22/73 (30%), Positives = 37/73 (50%), Gaps = 1/73 (1%) Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACASPNSFYLVENPHSMYDAFS 359 + +D ++L C+ AK++G +V+ IG R + L+ CAS S Y + + DAF+ Sbjct: 511 DSTKDSRTLAVCDAAKEKGIVVFTIGFEAPWRGQQVLQQCASSASHYYDVDGLEISDAFA 570 Query: 360 HIGKDIVTKRIWY 372 I I R+ Sbjct: 571 SIASAIRQLRLTE 583 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 45/305 (14%), Positives = 109/305 (35%), Gaps = 43/305 (14%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R F + G + + L ++F + G+ +++ + +T L +DR+++ AA Sbjct: 24 LRAFRSDESGVLAKPMIMILVLMFALGGLGMDLVRMERDRTNLQYTLDRAVLAAA----- 78 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 + + + + + + +G + +V S V + Sbjct: 79 -----DLDQPLDPEAVVI----------DYMSKSGLSDYTTVVVPEVSPTAKRVKASVDT 123 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 +A I + P I +P+ +S ++ +++ +VLDVS SM S Sbjct: 124 EFTAGWMNSIFYEDYMRNPDTYELEPITLPLLASST-AVESIGNVEISLVLDVSGSMRSN 182 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE--EFFLLEWGVSHLQR 245 ++ ++ ++ + + + +V +S ++ FL E VS Sbjct: 183 -----NRLVNLKRAAKEFVQTMD-DNTEDGKMSISIVPYSTQVSMPAAFLDEMRVSDEHS 236 Query: 246 KIKYLSKFGVSTNSTP---------GLKYAYNQIFDMQGMRQHCNTE-----DANYKKII 291 ++ G N+T + ++ +D + +H N ++ Sbjct: 237 YSNCINFDGSDFNTTGLNLSREYERTMHFSVWNYYDYRDDDEHVRQPTCASDADNPERTA 296 Query: 292 VFMTD 296 + M+D Sbjct: 297 LLMSD 301 >gi|198436264|ref|XP_002122997.1| PREDICTED: similar to HyTSR1 protein [Ciona intestinalis] Length = 1993 Score = 68.6 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 40/232 (17%), Positives = 82/232 (35%), Gaps = 31/232 (13%) Query: 141 FCTFIPWYTNSRHIVMPITSSVK----VNSQTDARL----DMMIVLDVSRSMESFFDSSI 192 F T + + +V I+S +K + + T L D+++++D S S+ Sbjct: 1768 FATTVAGFDGLNRVVTTISSQIKLCQAITNLTTCPLITVMDIVLIVDSSSSIGD------ 1821 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYL 250 ++ I ++ + + + G V ++N ++E F L +Q I+ + Sbjct: 1822 DNFELMRNFILELVRSFNV---SRDTTRIGYVRYNNAVDERFQLNTFNTSEEVQEAIRAV 1878 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 G T + L YA R+ + + +TDG D + Sbjct: 1879 PYRGSGTLTGQALSYASRTSVRAPAGRRPGVPG------VAIVITDGRAQDAV-DAPARE 1931 Query: 311 YCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIG 362 +R V AIGIR + + + +E+ + + + I Sbjct: 1932 L-----QRLMQVVAIGIRGAVPEQLNAIASQQGYVFNIEDFNRLDEVLGSIS 1978 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 65/199 (32%), Gaps = 24/199 (12%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 ++D+ V+D S S+ + A+++ + P+ + + ++ I Sbjct: 1611 QVDVAFVIDSSSSIGP------ANFRTIRNFLIALVQRFSIGPEGA---RFAAIRYNRDI 1661 Query: 231 EEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E + L+ + L I + GV T + + Y IF + R Sbjct: 1662 EHLWNLDQYTTRAALIEGINNIPYNGVGTLTGAAINYTAENIFLPELGR------RKGVP 1715 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 KI+V +TDG K I+ A+GI + + P+ + Sbjct: 1716 KIVVVLTDG-----VSYDDVSIPSQRLKSDNTIIVAVGIGRYDQDQINEIASDPDQDFAT 1770 Query: 349 ENP--HSMYDAFSHIGKDI 365 + + I I Sbjct: 1771 TVAGFDGLNRVVTTISSQI 1789 >gi|16416477|gb|AAL18263.1| vitrin [Homo sapiens] Length = 656 Score = 68.6 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 42/291 (14%), Positives = 95/291 (32%), Gaps = 35/291 (12%) Query: 79 GGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP 138 + R+ ++ + N+ +V + V N YS+ S + + Sbjct: 387 KVEEASRLARVSGINIFFITIEGAAENEKQYVVEPNFANKAVCRTNGFYSLHVQSWFGLH 446 Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA 198 + ++ + T D+ V+D S S+ Sbjct: 447 KTLQPLVKRVCDTDRLACSKTCL--------NSADIGFVIDGSSSVG------TGNFRTV 492 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVS 256 ++ + + +E ++ + G V ++ + F + S + IK + + Sbjct: 493 LQFVTNLTKEFEISDTD---TRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGG 549 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 T++ + +A Q+F N +K+++ +TDG A Sbjct: 550 TSTGAAINFALEQLF---------KKSKPNKRKLMILITDGR-----SYDDVRIPAMAAH 595 Query: 317 KRGAIVYAIGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDI 365 +G I YAIG+ E P + + V+ +++ I ++I Sbjct: 596 LKGVITYAIGVAWAAQEELEVIATHPARDHSFFVDEFDNLHQYVPRIIQNI 646 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 59/165 (35%), Gaps = 22/165 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+ ++D S S+ + + + + + + + + P G+V + + Sbjct: 270 IDLSFLIDGSTSIGK------RRFRIQKQLLADVAQALDIGPAGP---LMGVVQYGDNPA 320 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L+ L+ I+ +++ G +N + + F N + Sbjct: 321 THFNLKTHTNSRDLKTAIEKITQRGGLSNVGRTISFVTKNFFSK------ANGNRSGAPN 374 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++V M DG E+ L A+ G ++ I I +E Sbjct: 375 VVVVMVDGWPTDKVEEASRL-----ARVSGINIFFITIEGAAENE 414 >gi|71051532|gb|AAH36192.1| COL14A1 protein [Homo sapiens] Length = 534 Score = 68.6 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 37/222 (16%), Positives = 78/222 (35%), Gaps = 24/222 (10%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 VK QT A D++I++D S S+ F + + ++ + + Sbjct: 142 SSPEEVKFVCQTPAIADIVILVDGSWSIGRF------NFRLVRHFLENLVTAFDV---GS 192 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + GL +S + L + + ++ L G +T + L Y + F + Sbjct: 193 EKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNYIFENSFKPE 252 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 KI + +TDG K + ++ G ++AIG++ +E Sbjct: 253 ------AGSRTGVSKIGILITDG-----KSQDDIIPPSRNLRESGVELFAIGVKNADVNE 301 Query: 335 FLRACASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + P+ Y V M+ + + + ++ D+ Sbjct: 302 LQEIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSRVEEQDR 343 >gi|312131680|ref|YP_003999020.1| von willebrand factor type a [Leadbetterella byssophila DSM 17132] gi|311908226|gb|ADQ18667.1| von Willebrand factor type A [Leadbetterella byssophila DSM 17132] Length = 328 Score = 68.6 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 41/211 (19%), Positives = 73/211 (34%), Gaps = 31/211 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 S + A +D+ I +D+S SM + D +++ A +E + Sbjct: 91 SGESTDSLTAGVDIAIAIDISDSMLAE-DLKPNRLEAAKGMALRFIE-------GRTTDR 142 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 LV F+ + L + L+ + ++ T L N++ D+ G Sbjct: 143 IALVAFAGETATLSPLTTDYTALKEYLASINTNIIRTSGTALGMALSSCVNKLRDVAGKS 202 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE--- 334 +I + ++DG+N + ++ AK G VY I I S E Sbjct: 203 -----------RIAIIISDGDNTAGTIPPETAV--ELAKSFGVRVYTIAIGKPGSEEGVD 249 Query: 335 --FLRACASP--NSFYLVENPHSMYDAFSHI 361 LR A F+ + S+ F I Sbjct: 250 EKTLRMLAGGPNGRFFKAADNSSLSKIFDEI 280 >gi|294653581|ref|NP_714598.2| von Willebrand factor type A domain-containing protein [Leptospira interrogans serovar Lai str. 56601] gi|293630705|gb|AAN51613.2| BatA [Leptospira interrogans serovar Lai str. 56601] Length = 312 Score = 68.6 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 45/220 (20%), Positives = 82/220 (37%), Gaps = 32/220 (14%) Query: 154 IVMPITSS-VKVNSQTDARLDMMIVLDVSRSMESFFD-SSITKIDMAIKSINAMLEEVKL 211 + + + +D++I LDVS SM D T++ ++ K + +E+ Sbjct: 67 LSLSGPGKKITFLPDEKEGVDILIALDVSGSMSRSRDFLPETRLGVSKKLLKRFIEK--- 123 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLE---WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 N + GLV F+ L +S + I+ + T + Sbjct: 124 ----RNNDRLGLVVFAGAAYLQAPLTGDRESLSEILETIEEETVTEQGTAIGDAI----- 174 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 I +R K+IV +TDG + + K D + + A++ GA +Y++GI Sbjct: 175 -ILSTYRLR-----NSKARSKVIVLITDGVSNTGKIDPVTAT--DLAEQIGAKIYSVGIG 226 Query: 329 VIRS-----HEFLRACA--SPNSFYLVENPHSMYDAFSHI 361 E L+ + + F+ E+P M S I Sbjct: 227 KEDGSYEINFEILQELSANTGGRFFRAEDPEEMKAVLSSI 266 >gi|295789104|ref|NP_001171440.1| vitrin isoform 2 [Homo sapiens] gi|74739159|sp|Q6UXI7|VITRN_HUMAN RecName: Full=Vitrin; Flags: Precursor gi|37181801|gb|AAQ88704.1| VIT [Homo sapiens] gi|119620824|gb|EAX00419.1| vitrin, isoform CRA_d [Homo sapiens] Length = 678 Score = 68.6 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 41/270 (15%), Positives = 91/270 (33%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + N+ +V + V N YS+ S + + + ++ + T Sbjct: 430 EGAAENEKQYVVEPNFANKAVCRTNGFYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKT 489 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S S+ ++ + + +E ++ Sbjct: 490 CL--------NSADIGFVIDGSSSVG------TGNFRTVLQFVTNLTKEFEISDTD---T 532 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ + F + S + IK + + T++ + +A Q+F Sbjct: 533 RIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF------ 586 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG + A +G I YAIG+ E Sbjct: 587 ---KKSKPNKRKLMILITDG-----RSYDDVRIPAMAAHLKGVITYAIGVAWAAQEELEV 638 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V+ +++ I ++I Sbjct: 639 IATHPARDHSFFVDEFDNLHQYVPRIIQNI 668 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 60/165 (36%), Gaps = 22/165 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+ ++D S S+ + + + + + + + + P G+V + + Sbjct: 292 IDLSFLIDGSTSIGK------RRFRIQKQLLADVAQALDIGPAGP---LMGVVQYGDNPA 342 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L+ L+ I+ +++ G +N + + F N + Sbjct: 343 THFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSK------ANGNRSGAPN 396 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++V M DG E+ L A++ G ++ I I +E Sbjct: 397 VVVVMVDGWPTDKVEEASRL-----ARESGINIFFITIEGAAENE 436 >gi|330810109|ref|YP_004354571.1| hypothetical protein PSEBR_a3255 [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327378217|gb|AEA69567.1| Conserved hypothetical protein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 359 Score = 68.6 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 29/208 (13%), Positives = 71/208 (34%), Gaps = 45/208 (21%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + D ++++ + + LE + GL+ F + Sbjct: 91 DLLVAVDVSGSMDFPDMQWKDEEVSRLALVQHMLGDFLE-------GREGDRVGLILFGS 143 Query: 229 KIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + + G +T + A ++ + Sbjct: 144 QAYLQAPLTFDRRTVRHWLDEARIGIAGKNTAIGDAIGLALKRL-----------RQRPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE------------ 334 + ++++ +TDG N + D + A G +Y IGI Sbjct: 193 HSRVLILVTDGANNGGEIDPLTA--ARLAADEGVKIYPIGIGAAPEQSGTTGSLGVNPSL 250 Query: 335 -----FLRACA--SPNSFYLVENPHSMY 355 L+ A + ++ ++ + Sbjct: 251 DLDEPTLKEIAEVTGGQYFRAQDGEQLL 278 >gi|326326039|ref|YP_004250848.1| hypothetical protein VIBNI_0107 [Vibrio nigripulchritudo] gi|323669090|emb|CBJ93137.1| Protein of unknown function (exported) [Vibrio nigripulchritudo] Length = 1081 Score = 68.6 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 27/201 (13%), Positives = 61/201 (30%), Gaps = 30/201 (14%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 +S L++ VLD S SM +I ++ L+ + V + + Sbjct: 304 ADSNNVRPLELAFVLDSSGSMRWSDPDNI-----------RIVGSKHLVDRLKEVDRGAV 352 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + F + + L + ++ + + G T+ G+ A + + + Sbjct: 353 IDFDSTAQLLQSLTDNKAVIKSALDLIDASG-GTDIGDGVSKALEEFANARSASDWA--- 408 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP- 342 +V +TDG L ++ V I + + +R + Sbjct: 409 -------VVLLTDGSGSYNHALTTELV------QKNIRVLGITMGSGANQSLIRGISDST 455 Query: 343 -NSFYLVENPHSMYDAFSHIG 362 + V + + F Sbjct: 456 YGIYQHVNTADELIEVFERFS 476 >gi|190894968|ref|YP_001985261.1| hypothetical protein RHECIAT_PC0000634 [Rhizobium etli CIAT 652] gi|190700629|gb|ACE94711.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 444 Score = 68.6 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 52/334 (15%), Positives = 115/334 (34%), Gaps = 78/334 (23%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 R+ + G + I+ A+ L + + +G + + ++ + S +D +L+ A QI N Sbjct: 16 FRSLGRDRTGNVAIVVALSLVPMLVAVGASFDYIRSYNVRQRMQSDLDAALIAAVKQINN 75 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 + + K+ +++N++ + + + Sbjct: 76 SEDTDALKQKVSDWFHAQVENSYALGE-------------------------IEIDTTNH 110 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM--- 184 +I+A + +P F N + + + S+VK + L++ IV+D S SM Sbjct: 111 NITATASGTVPTTFMKIA----NIDTVPVSVGSAVK--GPATSYLNVYIVIDRSPSMLLA 164 Query: 185 --------------------------------ESFFDSSITK-----IDMAIKSINAMLE 207 + +D S K D+A ++ +L+ Sbjct: 165 ATTSGQSTMYSGIGCQFACHTGDAHTVGKKTYANNYDYSTEKNIKLRADVAGDAVREVLD 224 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 + + ++ GL + + +E S+ ++ LS S + Y Y Sbjct: 225 MIDESDSNHERIKVGLYSLGDTTKEVLAPTLDTSNARK---RLSDDSYGLTSATSMNYTY 281 Query: 268 NQIFDMQGMRQHCNTED----ANYKKIIVFMTDG 297 + + D AN K+++ +TDG Sbjct: 282 FDVALAALQKIVGTGGDGTSSANPLKLVLLLTDG 315 >gi|257052324|ref|YP_003130157.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] gi|256691087|gb|ACV11424.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] Length = 592 Score = 68.6 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 40/206 (19%), Positives = 84/206 (40%), Gaps = 35/206 (16%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS--------------- 190 + R+ + + S++ ++ RLD++IVLD+S SM S FD Sbjct: 163 LGESTGRYFTVGLNSTLDTSTFERKRLDVVIVLDISGSMGSQFDQYYYDRFGNRHTVEEG 222 Query: 191 -SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL----EWGVSHLQR 245 S +K+ +A ++ A+ E++ + G+V F+N+ L + ++ Sbjct: 223 DSRSKMAVAKDALVALTEQLHPDD------RVGVVLFNNEPTVAKPLRDVETTDMDAIRG 276 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 I+ + G TN G+ A + ++ +++ + + +TD + + D Sbjct: 277 HIREDIEAGGGTNIADGMAEA------ADMLGEYADSDPTEAETRQIVITDAMPNTGQTD 330 Query: 306 QQSLY--YCNEAKKRGAIVYAIGIRV 329 Q+L A + G +G+ V Sbjct: 331 DQALQDRLAGYA-EDGIHTSFVGVGV 355 >gi|332664649|ref|YP_004447437.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] gi|332333463|gb|AEE50564.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] Length = 328 Score = 68.6 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 49/240 (20%), Positives = 87/240 (36%), Gaps = 52/240 (21%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 +K +D+M+ +D+S SM + D +++++ K +E VK P + Sbjct: 77 LKEEKIKANGIDIMLSMDLSSSMLAQ-DFEPNRLEVSKKMA---IEFVKGRPHD----RI 128 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 GLV F+ + L L+ ++ L T GL A N++ Sbjct: 129 GLVVFAGEAFTQCPLTTDHKILETFLEQLECGNLEDGTAIGMGLAGAVNRL--------- 179 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS------- 332 + K+I+ +TDG N + L AK+ G VY+IG+ I Sbjct: 180 --KKSPAKSKVIILLTDGVNN--VGYFKPLTAGELAKELGIKVYSIGVGTIGEALTPVSR 235 Query: 333 ----------------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGK----DIVTKRI 370 E LR A + ++ +N + ++ I + +I RI Sbjct: 236 LSDGSFFLDYAQVEIDEELLREIARMTGGQYFRAKNNQDLRQIYNTIDRLEKTEIQVTRI 295 >gi|295789109|ref|NP_001171442.1| vitrin isoform 4 [Homo sapiens] gi|119620822|gb|EAX00417.1| vitrin, isoform CRA_b [Homo sapiens] Length = 656 Score = 68.6 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 41/270 (15%), Positives = 91/270 (33%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + N+ +V + V N YS+ S + + + ++ + T Sbjct: 408 EGAAENEKQYVVEPNFANKAVCRTNGFYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKT 467 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S S+ ++ + + +E ++ Sbjct: 468 CL--------NSADIGFVIDGSSSVG------TGNFRTVLQFVTNLTKEFEISDTD---T 510 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ + F + S + IK + + T++ + +A Q+F Sbjct: 511 RIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF------ 564 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG + A +G I YAIG+ E Sbjct: 565 ---KKSKPNKRKLMILITDG-----RSYDDVRIPAMAAHLKGVITYAIGVAWAAQEELEV 616 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V+ +++ I ++I Sbjct: 617 IATHPARDHSFFVDEFDNLHQYVPRIIQNI 646 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 60/165 (36%), Gaps = 22/165 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+ ++D S S+ + + + + + + + + P G+V + + Sbjct: 270 IDLSFLIDGSTSIGK------RRFRIQKQLLADVAQALDIGPAGP---LMGVVQYGDNPA 320 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L+ L+ I+ +++ G +N + + F N + Sbjct: 321 THFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSK------ANGNRSGAPN 374 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++V M DG E+ L A++ G ++ I I +E Sbjct: 375 VVVVMVDGWPTDKVEEASRL-----ARESGINIFFITIEGAAENE 414 >gi|45655623|ref|YP_003432.1| BatA [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] gi|45602594|gb|AAS72069.1| BatA [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] Length = 320 Score = 68.6 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 45/220 (20%), Positives = 82/220 (37%), Gaps = 32/220 (14%) Query: 154 IVMPITSS-VKVNSQTDARLDMMIVLDVSRSMESFFD-SSITKIDMAIKSINAMLEEVKL 211 + + + +D++I LDVS SM D T++ ++ K + +E+ Sbjct: 75 LSLSGPGKKITFLPDEKEGVDILIALDVSGSMSRSRDFLPETRLGVSKKLLKRFIEK--- 131 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLE---WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 N + GLV F+ L +S + I+ + T + Sbjct: 132 ----RNNDRLGLVVFAGAAYLQAPLTGDRESLSEILETIEEETVTEQGTAIGDAI----- 182 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 I +R K+IV +TDG + + K D + + A++ GA +Y++GI Sbjct: 183 -ILSTYRLR-----NSKARSKVIVLITDGVSNTGKIDPVTAT--DLAEQIGAKIYSVGIG 234 Query: 329 VIRS-----HEFLRACA--SPNSFYLVENPHSMYDAFSHI 361 E L+ + + F+ E+P M S I Sbjct: 235 KEDGSYEINFEILQELSANTGGRFFRAEDPEEMKAVLSSI 274 >gi|188994393|ref|YP_001928645.1| aerotolerance-related membrane protein BatA [Porphyromonas gingivalis ATCC 33277] gi|188594073|dbj|BAG33048.1| aerotolerance-related membrane protein BatA [Porphyromonas gingivalis ATCC 33277] Length = 327 Score = 68.6 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 41/225 (18%), Positives = 80/225 (35%), Gaps = 46/225 (20%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +DVS SM++ D +++ A + + P+ N G+VTF+ + Sbjct: 88 IDIMLAMDVSGSMQA-MDFKPNRLEAAKDVAISFINN---RPNDN----IGMVTFAGESF 139 Query: 232 EFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + L ++ L T GL A N++ D + + Sbjct: 140 TQCPLTTDHTVLLNMVQDLQMGVLDDGTAIGMGLATAVNRLKDSKAKS-----------R 188 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-------------- 335 +++ +TDG N D + A+ G VY +G+ F Sbjct: 189 VVILLTDGSNNM--GDITPRMAADIARTFGIRVYTVGVGTRGEAPFPIQTEFGVRIQNVP 246 Query: 336 -------LRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 L A S ++ + ++ + + I K T+ + Sbjct: 247 VDIDEPTLDGIAEVSGGKYFRAVDNETLNEIYKEIDKLEKTRLMT 291 >gi|311063719|ref|YP_003970444.1| cell surface protein [Bifidobacterium bifidum PRL2010] gi|310866038|gb|ADP35407.1| Cell surface protein with gram positive anchor domain [Bifidobacterium bifidum PRL2010] Length = 1176 Score = 68.6 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 38/265 (14%), Positives = 88/265 (33%), Gaps = 51/265 (19%) Query: 144 FIPWYTNSRHIVMPITSSVKVNSQTD-ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 + + + +T + +SQT + D+++V D S SM + + +++++A ++ Sbjct: 603 IKANNDGTYDLSLNVTGTQSGSSQTTVSPADIVVVFDTSGSMSNPMGHN-SRLEVAKTAV 661 Query: 203 NAMLEEVKLIPD--VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNST 260 N+M + + + ++ ++ LV FS + + L G + Sbjct: 662 NSMAQHLLTSENQGKDSNIRMALVPFSTTAGNVSNFTDNAMDIVSAVNGLGADGGT---- 717 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS------------ 308 N ++ + KK IVFM+DG+ ++ Sbjct: 718 -------NWEAALKAANAKLTSGRKGVKKYIVFMSDGDPTFRTSSVRTGTDWWGRPTYDD 770 Query: 309 ------------------------LYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 ++ A ++++G+ + A + S Sbjct: 771 DDRRGLPAGVHGSGSSDQYGANLSSAVAEANRRGDATLFSVGVSSDPTKMRGFADQTKGS 830 Query: 345 FYLVENPHSMYDAFSHIGKDIVTKR 369 +Y + + AF+ I I K Sbjct: 831 YYSATSTDELNKAFADIIGQINRKS 855 >gi|307825379|ref|ZP_07655598.1| von Willebrand factor type A [Methylobacter tundripaludum SV96] gi|307733554|gb|EFO04412.1| von Willebrand factor type A [Methylobacter tundripaludum SV96] Length = 326 Score = 68.6 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 31/212 (14%), Positives = 65/212 (30%), Gaps = 46/212 (21%) Query: 173 DMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+ +D+S SME S+ ++ A + ++ + GL+ F Sbjct: 90 DLMLAVDLSGSMEEQDFVINKRSVDRLTAAKMVAADFINR-RVGD------RVGLILFGT 142 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + + + + G +T + A ++ + Sbjct: 143 QAYLQTPLTFDRKTVMTLLNEAVIGLAGDNTAIGDAIGLAVKRL-----------KSEQV 191 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------- 331 +++V MTDG N + + L A +Y IGI Sbjct: 192 NSRVLVLMTDGANTA--GEVSPLKAAELAAANHLKIYTIGIGADEMIVRSFFGNRKINPS 249 Query: 332 ----SHEFLRACAS-PNSFYLVENPHSMYDAF 358 ++ S +Y N + + + Sbjct: 250 VDLDEKTLIKIAESTGGQYYRARNTDELNNIY 281 >gi|163745746|ref|ZP_02153106.1| hypothetical protein OIHEL45_09145 [Oceanibulbus indolifex HEL-45] gi|161382564|gb|EDQ06973.1| hypothetical protein OIHEL45_09145 [Oceanibulbus indolifex HEL-45] Length = 554 Score = 68.6 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 38/230 (16%), Positives = 81/230 (35%), Gaps = 32/230 (13%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 I +F + GGM +L I I + G+ +++++ +T + +D +++ AA+ Sbjct: 7 IAHFRKDEGGGMLVLMLIVFFGITIFGGLAVDLANHERTRTTFQTHLDNAVLAAAS---- 62 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 + E ++ + + G Sbjct: 63 ----------------------LSQDLDAEEVVRSYLTSAGLDPSEVEI-ETREEKIGGI 99 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES- 186 + +P T+ + + + M I+S + +++ +VLDVS SM Sbjct: 100 LVGRTVEASLPAGLNTYFFRFFDIDTLGMTISSEATERVED---IEISLVLDVSGSMGDI 156 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 D S K+D+ ++ +E + L V +V +S K+ L Sbjct: 157 TSDRSGIKMDLLKRAAGDFVETI-LSDAEEGRVSISIVPYSTKVNPGSAL 205 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 20/74 (27%), Positives = 33/74 (44%), Gaps = 2/74 (2%) Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYLVENPHSMYDA 357 S ++D++ C A G ++Y+IG+ V ++ L+ CAS S Y + A Sbjct: 480 DSVEKDRRLRQICGVANAAGVVIYSIGMDVDNTNSLNLLKDCASSESHYFDVEGLEIQTA 539 Query: 358 FSHIGKDIVTKRIW 371 F I I R+ Sbjct: 540 FDMIAASISMLRLT 553 >gi|198426022|ref|XP_002124623.1| PREDICTED: similar to FiBrilliN homolog family member (fbn-1) [Ciona intestinalis] Length = 990 Score = 68.6 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 68/207 (32%), Gaps = 33/207 (15%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 AR+D++ ++D S S+ + + + E L + +VTF + Sbjct: 809 KARMDLIFIMDSSGSIGE------ENFKTMKQFVKNVYERFTLSD---EFTRIAVVTFHS 859 Query: 229 KIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ EW + L I L G T + L + R+H + Sbjct: 860 VVQLANDTEWFYSKTELDNAIDSLQFAGKGTLTGQALTF----------TREHLIGKREG 909 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NS 344 +++ +TDG S E + V A+GI + + P + Sbjct: 910 STNVVIAVTDG-----NSKDNSKAAAAELRNMNVHVMAVGITGSHLRDLSMIASKPASEN 964 Query: 345 FYLVENPHSMYDAFSHIGKDIVTKRIW 371 + + D D+V +R+ Sbjct: 965 VLSLSQVEDINDVI-----DVVGRRVC 986 >gi|307721534|ref|YP_003892674.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306979627|gb|ADN09662.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 303 Score = 68.6 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 41/217 (18%), Positives = 75/217 (34%), Gaps = 23/217 (10%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 + PI K +S+ R D++ LD S SM + + ++ +L Sbjct: 63 SLASPIIYDQKTSSKRKGR-DLVFALDTSGSMAESGFNPENVQNRKFDALKELLRSFITK 121 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQI 270 +NV G+ F L + + + + + G ST GL A + Sbjct: 122 RYNDNV---GVSIFGTYAYPAIPLSYDMGSVAFLLDFFDVGIAGDSTAIGEGLAMALKIL 178 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + +K+I+ +TDG +AKK+ +Y IGI Sbjct: 179 -----------KKGEAKEKVIILITDGYQN--SGAVSVKEAVQKAKKQHVKIYTIGIGDR 225 Query: 331 RSHE--FLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 + + L+ A + + +N + D + I K Sbjct: 226 SAFDANLLQLIAKNTDAKMFEAKNVKMLQDIYKEIDK 262 >gi|221115612|ref|XP_002166292.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 484 Score = 68.6 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 75/210 (35%), Gaps = 23/210 (10%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 I I +++K S +D+ ++D S S+ + + I ++ K Sbjct: 124 PYKINSDINTNLKNPSCA-GIVDVGFIIDSSWSLRDQYH----------QEIEFLINLAK 172 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 N +G++TFS++ L + + I + G T L+ A Sbjct: 173 TFNISKNGAHAGVITFSSRAVLNIKLNQYYDQEQFEFAIDEIPYMGYVTRIDLALRKALE 232 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE--DQQSLYYCNEAKKRGAIVYAIG 326 ++ G R+ + +++ +TDGE + D+ + RG + A+G Sbjct: 233 MFDEINGARK-------DIPQLLFLLTDGEQYAGDGVVDENPANVAQLVRDRGIEIIAVG 285 Query: 327 IRVIRSHEFL-RACASPNSFYLVENPHSMY 355 I + L S +L EN + Sbjct: 286 IGSGVNQSELNSIAGSSEKVFLAENFDELI 315 >gi|194205647|ref|XP_001498059.2| PREDICTED: similar to von Willebrand factor A domain-containing protein 2 precursor (A domain-containing protein similar to matrilin and collagen) (AMACO) (Colon cancer secreted protein 2) (CCSP-2) [Equus caballus] Length = 784 Score = 68.6 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 77/210 (36%), Gaps = 28/210 (13%) Query: 153 HIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 H+ + S+ A +D++ ++D S S+ S + ++ Sbjct: 28 HVSRDTIGKISAASKMMWCSAAVDILFLMDGSHSVGK---GSFERSKHFAITV------C 78 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAY 267 + V+ G FS+ F L+ + ++ KIK + G T + LKY Sbjct: 79 DALDINLKRVRVGAFQFSSAPHLEFPLDSFSTQQEVKAKIKRMVFKGGRTETGLALKYLL 138 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + F +A+ +I++ +TDG + K+RG V+A+G+ Sbjct: 139 RKGFPGGR--------NASVPQILIIITDGR-----SQGHVALPAKQLKERGITVFAVGV 185 Query: 328 RVIRSHEFLRACASPNSFYLVENPHSMYDA 357 R R E L AS V + DA Sbjct: 186 RFPRWEE-LHTLASEPREQHVLMAEQVEDA 214 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 58/156 (37%), Gaps = 24/156 (15%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-NVVQSGLV 224 LD++ +LD S S+ + ++ + + L DVN +V Q GLV Sbjct: 509 GCQAQSLDLVFMLDASASVG---PGNFAQMQ-------SFVRSCTLQFDVNPDVTQVGLV 558 Query: 225 TFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + ++++ F L+ + R + G + ++ L + Y+++ +Q Sbjct: 559 VYGSQVQTAFGLDTHPTRASVLRAMSQAPYLGGAGSAGMALLHIYDKVMTVQ------KG 612 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 K +V +T G + + + + Sbjct: 613 ARPGVPKAVVMLTGGRGA-----EDAAVPAQKLRDN 643 >gi|87306401|ref|ZP_01088548.1| hypothetical protein DSM3645_08717 [Blastopirellula marina DSM 3645] gi|87290580|gb|EAQ82467.1| hypothetical protein DSM3645_08717 [Blastopirellula marina DSM 3645] Length = 578 Score = 68.6 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 51/314 (16%), Positives = 110/314 (35%), Gaps = 40/314 (12%) Query: 15 YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG---NG 71 +G + +L A+ + ++ + + ++V ++F M++ L +D + + A ++ G Sbjct: 19 RRGVIVVLAAVLMIVMMGFMALSVDVGYMFTMQSQLQRSVDSAALAGAGTLIEGEDVATG 78 Query: 72 NNRKKLKGGDILCRIKNTWNMSFRNELRD-----NGFVNDIDDIVRSTSLDIVVVPQNEG 126 + L + + K + + + + TS +V +N Sbjct: 79 TVHEYLTHNPVGLQWKEFTEGNTADNVDKFLTKYGDGLQLTIGEWNDTSGQVVAAEKNPT 138 Query: 127 YSISAISRYK-IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 ++S Y+ +P F + S S+ D+M+VLD+S SM Sbjct: 139 -TVSVRMTYENMPF----FFGHLLGRDSFDITAESIATYQSR-----DIMLVLDLSGSMN 188 Query: 186 SFFDSSITKIDMA---------IKSINAMLEEV--KLIPDVNNVVQSGLVTFSNKIEEFF 234 DS I + + + L D V +G S+ + Sbjct: 189 D--DSEFNSIGKLGFDHIYSNSQQMYADLGSPIFGNLQFDPQYAVVNGPTPQSSGQAKSS 246 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + G S + + K + + V T++ Y + + N + V++ Sbjct: 247 VTYRGNSVVVKSDKTIKQISVKTSNGSTYNY-----YPGSSLNYTANPNKEI--RY-VWV 298 Query: 295 TDGENLSTKEDQQS 308 T G+N + + QS Sbjct: 299 TSGKNSNNSDQVQS 312 >gi|301778042|ref|XP_002924421.1| PREDICTED: von Willebrand factor A domain-containing protein 1-like [Ailuropoda melanoleuca] Length = 366 Score = 68.6 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 35/199 (17%), Positives = 72/199 (36%), Gaps = 23/199 (11%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ + + D++ +LD S S+ + S + + + ++ + L P + Sbjct: 9 AAMAGPPASAPQGDLLFLLDSSASVSHYEFSRV------REFVGQLVGLLPLGPGA---L 59 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 ++ LV ++ F S +Q I+ ++ TN+ L YA Q+F Sbjct: 60 RTSLVHVGSRPYTEFPFGQHSSGEAVQDAIRAAAQRMGDTNTGLALAYAKEQLF------ 113 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 K++V++TDG + E K G V+ + E Sbjct: 114 AEVAGARPGVPKVLVWVTDGGSSDPVG-----PPMQELKDLGVTVFIVSTGRGNLLELSD 168 Query: 338 AC-ASPNSFYLVENPHSMY 355 A A P L + ++ Sbjct: 169 AASAPPEKHLLFVDVDDLH 187 >gi|116625272|ref|YP_827428.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228434|gb|ABJ87143.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 323 Score = 68.6 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 42/226 (18%), Positives = 81/226 (35%), Gaps = 40/226 (17%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + S DA L + +V D S SM K+D + ++++ KL + Sbjct: 83 PQEISQFSSEDAPLSVGVVFDCSGSMGQ-------KLDKSRQAVSQF---FKLANPEDEF 132 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 LV F++ + +Q + ++ T + A Sbjct: 133 F---LVQFNDSASLIQPFTRNLEEIQNHL-AFTQSKGRTALLDAVYLAL----------- 177 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI---RVIRSH-- 333 H + N +K ++ ++DG + S++ + + N K+ +YAIGI R Sbjct: 178 HEMKKAKNPRKALLLISDGGDNSSRYTEPEIK--NLVKEADVQIYAIGIYESAAGRGRTP 235 Query: 334 ------EFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 L A + Y V+N + + D + IG ++ + I Sbjct: 236 EESSGPALLTEIAEQTGGRQYQVDNLNELPDVAAKIGVELRNQYIL 281 >gi|332227198|ref|XP_003262778.1| PREDICTED: vitrin isoform 2 [Nomascus leucogenys] Length = 679 Score = 68.6 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 41/270 (15%), Positives = 89/270 (32%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + N+ +V + V N YS S + + + ++ + T Sbjct: 431 EGAAENEKQYVVEPNFANKAVCRTNGFYSFHVQSWFGLHKTLQPLVKRVCDTDRLACSKT 490 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S S+ ++ + + +E ++ Sbjct: 491 CL--------NSADIGFVIDGSSSVG------TGNFRTVLQFVTNLTKEFEISDTD---T 533 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ + F + S + IK + + T++ + +A Q+F Sbjct: 534 RIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF------ 587 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG + A +G YAIG+ E Sbjct: 588 ---KKSKPNKRKLMILITDG-----RSYDDVRIPAMAAHLKGVFTYAIGVAWAAQEELEV 639 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V+ ++Y I ++I Sbjct: 640 IATHPARDHSFFVDEFDNLYQYVPRIIQNI 669 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 60/165 (36%), Gaps = 22/165 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+ ++D S S+ + + + + + + + + P G+V + + Sbjct: 293 IDLSFLIDGSTSIGK------RRFRIQKQLLADVAQALDIGPAGP---LMGVVQYGDNPA 343 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L+ L+ I+ +++ G +N + + F N + Sbjct: 344 THFNLKTHMNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSK------ANGNRSGAPN 397 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++V M DG E+ L A++ G ++ I I +E Sbjct: 398 VVVVMVDGWPTDKVEEASRL-----ARESGINIFFITIEGAAENE 437 >gi|118092778|ref|XP_421647.2| PREDICTED: similar to tumor endothelial marker 8 [Gallus gallus] Length = 555 Score = 68.6 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 39/206 (18%), Positives = 75/206 (36%), Gaps = 25/206 (12%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+ +LD S S+ + + T+I ++S+ ++ +++ + FS+ Sbjct: 43 HGAFDLYFILDKSGSVRNHW----TEIYSFVESLAEKF--------ISPMLRMSFIVFSS 90 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L ++R ++ L G T G K A QI+ H Sbjct: 91 RGTTIMKLTENREAIRRGLEILQYEVPGGDTFMHEGFKRANEQIY-------HETYGGVR 143 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 +I+ +TDGE + + N A+ GAIVY +G++ + S + + Sbjct: 144 TASVIIALTDGELQDVQFYY-AEQEANRARSFGAIVYCVGVKDFNETQLSTIADSIDHVF 202 Query: 347 LVENPHSMYDAFSHIGKDIVTKRIWY 372 V + A I+ K Sbjct: 203 PVTGG---FYALRGTIDSILKKSCIE 225 >gi|322436225|ref|YP_004218437.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321163952|gb|ADW69657.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 304 Score = 68.6 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 82/217 (37%), Gaps = 37/217 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 ++ L +++ +D S S+ + + + K + A+L E L+ F Sbjct: 72 ESATPLSIVMAIDGSESV---VTNDRLEKEAGKKFVRALLREQDEFD---------LMDF 119 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 S+ + E ++ + L K G +T + A ++ + Sbjct: 120 SDTVREVVSFTNDKKRIENGLNELRK-GDATAVYDAVYLASQRLGETNAGGGRR------ 172 Query: 287 YKKIIVFMTDGENL--STKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------SHEFL 336 +++V +TDG+N DQ +A++ G +VYA+ + I L Sbjct: 173 --RVLVLITDGDNTVHGVGYDQAV----EQAQRAGVMVYALIVVPIEADAGRNTGGEHAL 226 Query: 337 RACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 A+ ++Y V +P + ++ + D+ T+ + Sbjct: 227 IQMATDTGGNYYYVNDPRDLAKVYAKVSDDLRTQYVL 263 >gi|260841562|ref|XP_002613981.1| hypothetical protein BRAFLDRAFT_118457 [Branchiostoma floridae] gi|229299371|gb|EEN69990.1| hypothetical protein BRAFLDRAFT_118457 [Branchiostoma floridae] Length = 2122 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 40/211 (18%), Positives = 81/211 (38%), Gaps = 21/211 (9%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+R D++ +LD S S+ + + ++++A + +L ++ + P + + +V+F Sbjct: 41 QDSRADIVFLLDNSGSVGRY---NFEEVEIA--FVENLLSQLTISPQAS---RVAVVSFD 92 Query: 228 NKIEE---FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + + R++K + G TN+ + A + + Sbjct: 93 DVARTHIDYIKYPKNKCSFLRELKTVKYIGEWTNTEDAFRLAQELL-----RPPSAFKNE 147 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIRVIRSHEFLRACASPN 343 K++++ +TDG + N K A +++IGI + + L CA+ Sbjct: 148 RPVKQVVILLTDGRPTR---GGDPVKRANNLKSVYNAEIFSIGIGGNLNKQQLEDCATDA 204 Query: 344 SF-YLVENPHSMYDAFSHIGKDIVTKRIWYD 373 YL N D I D KR D Sbjct: 205 QHLYLSPNFVDFKDLAKRIRGDPYAKRYVTD 235 >gi|260774144|ref|ZP_05883059.1| protein BatA [Vibrio metschnikovii CIP 69.14] gi|260611105|gb|EEX36309.1| protein BatA [Vibrio metschnikovii CIP 69.14] Length = 322 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 32/224 (14%), Positives = 76/224 (33%), Gaps = 46/224 (20%) Query: 166 SQTDARLDMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + + DMM+V+D+S SM D I ++ + ++ + + + Sbjct: 79 THSQPHRDMMLVVDLSYSMNQKDMQIGDDYIDRLTAVKQVLSDFIAQ-------RQGDRL 131 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 GL+ F++ L + + +++ L G T G+ A + Sbjct: 132 GLIFFADHAYLQTPLTLDRTTVAQQLNQAVLRLIGTQTAIGDGIGLATKTFIESDA---- 187 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-------- 331 +++++ ++DG N + + + AK+ +Y +G+ Sbjct: 188 -------PQRVMILLSDGSNNA--GVLDPIEAAHIAKQYHTTIYTVGVGAGEMMVRDFFM 238 Query: 332 ----------SHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 E L+ A + ++ + H + + I K Sbjct: 239 TRRINTAEDLDEETLQKIADLTGGQYFRARDKHDLQTIYQTIDK 282 >gi|189230272|ref|NP_001121460.1| anthrax toxin receptor 2 [Xenopus (Silurana) tropicalis] gi|183985706|gb|AAI66225.1| LOC100158556 protein [Xenopus (Silurana) tropicalis] Length = 488 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 44/221 (19%), Positives = 80/221 (36%), Gaps = 29/221 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 I P T + + + Q D+ +LD S S+ S + ++ + Sbjct: 25 ISPPYTGAEEPSCQ--GAFDLYFILDKSGSVASNWVEIYEFVEKLTERF----------- 71 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIF 271 V+ ++ + FS + + L + + +K LS T G K A QI Sbjct: 72 -VSPRMRLSFIVFSTQAKIILPLTGDRYEITKGLKDLSSVIPAGETYMHEGFKLANEQI- 129 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 + +I+ +TDG+ L+ + + N A+ RGA VY +G+ Sbjct: 130 --------VKAGGKSTASVIIALTDGK-LADQIPVLTEKEANIARGRGARVYCVGVLDFN 180 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + R A+P + + VE + I I+ K Sbjct: 181 FDQLKRIAAAPENVFRVEGG---FKDLGLIINSILEKSCTE 218 >gi|149909538|ref|ZP_01898192.1| TadG-like protein [Moritella sp. PE36] gi|149807443|gb|EDM67394.1| TadG-like protein [Moritella sp. PE36] Length = 405 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 54/417 (12%), Positives = 116/417 (27%), Gaps = 84/417 (20%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 G +L A+ +P F + + + + K L + Sbjct: 5 RKQSGHAAMLFAMMIPAFFGIFTLASDGARALQSKARLEDAA---------EAAVLAIAA 55 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 + G I + + + I + DI + S Sbjct: 56 HNADNSGSSSGSAINKKIASDWIGQYMQDMQAISDIKITKLNCNDIAECKEGLENGESRY 115 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMP-----ITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 +Y+I K + W+ + + S +D+M V D S SM + Sbjct: 116 FQYEILAK-TNHLSWFPGNNSTAGFGESFDVVGSATARKFQSESVDVMFVSDFSGSMNNK 174 Query: 188 FD--SSITKIDMAIKSINAMLEEVKLIPDVNNVV--QSGLVTFSNKIEE----------- 232 + S+ + IK I +++E+ + + + G F+ + Sbjct: 175 WSGGSNSRRYKDLIKIIGDVIKELDKFNNAHTTTTNRVGFTGFNTYTRKTADNSCYQDQY 234 Query: 233 ----------------------------FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 + + + IKY G T S G+ Sbjct: 235 DRSAGRTVNKIFEVKGCKSRSSGGAKFHDIAMTDNYNEFKNTIKYFKP-GGGTASYQGI- 292 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY--YCNEA------- 315 ++G + + ++I++ ++DG + + + C++ Sbjct: 293 --------IRGAQMMDAAPEPRPRRIMIILSDGIDSKRSRANKLVEEGMCSKILLKLGNA 344 Query: 316 -----KKRGAIVYAIGIR-VIRSHEFLRACASPNSFYLVENPHS-MYDAFSHIGKDI 365 K + +G S+ L C ++ Y NP + I ++I Sbjct: 345 NTSDGKAIKTKMAVVGFDYNPASNPSLAKCVGEHNVYGANNPEDVLNKILELISEEI 401 >gi|332227202|ref|XP_003262780.1| PREDICTED: vitrin isoform 4 [Nomascus leucogenys] Length = 657 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 41/270 (15%), Positives = 89/270 (32%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + N+ +V + V N YS S + + + ++ + T Sbjct: 409 EGAAENEKQYVVEPNFANKAVCRTNGFYSFHVQSWFGLHKTLQPLVKRVCDTDRLACSKT 468 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S S+ ++ + + +E ++ Sbjct: 469 CL--------NSADIGFVIDGSSSVG------TGNFRTVLQFVTNLTKEFEISDTD---T 511 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ + F + S + IK + + T++ + +A Q+F Sbjct: 512 RIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF------ 565 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG + A +G YAIG+ E Sbjct: 566 ---KKSKPNKRKLMILITDG-----RSYDDVRIPAMAAHLKGVFTYAIGVAWAAQEELEV 617 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V+ ++Y I ++I Sbjct: 618 IATHPARDHSFFVDEFDNLYQYVPRIIQNI 647 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 60/165 (36%), Gaps = 22/165 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+ ++D S S+ + + + + + + + + P G+V + + Sbjct: 271 IDLSFLIDGSTSIGK------RRFRIQKQLLADVAQALDIGPAGP---LMGVVQYGDNPA 321 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L+ L+ I+ +++ G +N + + F N + Sbjct: 322 THFNLKTHMNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSK------ANGNRSGAPN 375 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++V M DG E+ L A++ G ++ I I +E Sbjct: 376 VVVVMVDGWPTDKVEEASRL-----ARESGINIFFITIEGAAENE 415 >gi|229825750|ref|ZP_04451819.1| hypothetical protein GCWU000182_01113 [Abiotrophia defectiva ATCC 49176] gi|229790313|gb|EEP26427.1| hypothetical protein GCWU000182_01113 [Abiotrophia defectiva ATCC 49176] Length = 1659 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 43/158 (27%), Positives = 69/158 (43%), Gaps = 21/158 (13%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 + I + + S +V Q + LD+++V+D S SME ++ A K+ N Sbjct: 64 ADGTYDIELKVKGSTEV-VQNNKILDIVLVMDTSGSMEG------KSLENAKKAANNF-- 114 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 V + NN V G+V+F+ K E L V+ L+ IK L G T + GL+ A Sbjct: 115 -VDKLLPQNNNVNIGIVSFAEKGEIKSGLTRNVTTLKNAIKGLKADG-GTYTQQGLEKAA 172 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 + N A +KK++V + DGE + Sbjct: 173 TVL----------NGAPAEHKKVMVVIGDGEPTYANGE 200 >gi|148680072|gb|EDL12019.1| mCG141954, isoform CRA_b [Mus musculus] Length = 972 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 44/191 (23%), Positives = 68/191 (35%), Gaps = 29/191 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK---I 230 + +VLD S SM S IT++ + ++ L ++ + GLVTF + Sbjct: 309 VCLVLDKSGSMR--LGSPITRLTLMNQAAELYLIQIIEKESL-----VGLVTFDSTATIQ 361 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + S+L K T+ GLK + I Sbjct: 362 TNLIRIINDSSYLAISTKLPQYPNGGTSICNGLKKGFEAITSSDQSTSGSE--------- 412 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ C E K GAI++ I + + E L FY Sbjct: 413 IVLLTDGEDNRISS-------CFQEVKHSGAIIHTIALGPSAARELETLSDMTGGLRFYA 465 Query: 348 VENPHSMYDAF 358 E+ + + DAF Sbjct: 466 KEDVNGLIDAF 476 >gi|297287373|ref|XP_001118050.2| PREDICTED: hypothetical protein LOC721855 [Macaca mulatta] Length = 2077 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 41/206 (19%), Positives = 74/206 (35%), Gaps = 19/206 (9%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP---DVNNVVQSGLVTFS 227 +D+ VLD S S+ +D ++ ++ D N V +G + +S Sbjct: 1102 PVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGALHYS 1161 Query: 228 NKIEEFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + +E L L+ ++ + FG T + +K Q+ H Sbjct: 1162 DDVEIIQGLTRMPGDRDTLKSRVDAIKYFGKGTYTDCAIKKGLEQLLVGG---SHLKEN- 1217 Query: 285 ANYKKIIVFMTDGEN-LSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 K ++ +TDG KE L NEAK G V+++ I L A+ Sbjct: 1218 ----KYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRLSIIATD 1273 Query: 343 NSF---YLVENPHSMYDAFSHIGKDI 365 +++ + + DA I + I Sbjct: 1274 HTYRRNFTAADWGQSRDAEEVISQTI 1299 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 28/172 (16%), Positives = 61/172 (35%), Gaps = 33/172 (19%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK-----LIPDVNNVVQS 221 +D++ VLD S S+ + ++A + +++ + + Sbjct: 1658 CKCGPIDLLFVLDSSESIG------LQNFEIAKDFVVKVIDRLSRDELVKFEPGQSYA-- 1709 Query: 222 GLVTFSNKIEEF-----FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 G+V +S+ + V L+ IK L T + L+Y +Q+ Sbjct: 1710 GVVQYSHSQMQEHVSLRSPSIRNVQELKEAIKNLQWMAGGTFTGEALQYTRDQLL----- 1764 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +I + +TDG + + ++ C G V ++GI+ Sbjct: 1765 ------PPSPNNRIALVITDGRSDTQRDTTPLNVLC----SPGVQVVSVGIK 1806 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 24/210 (11%), Positives = 65/210 (30%), Gaps = 23/210 (10%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS-- 227 + D+ I+LD S S+ S + K + ++ V+ +V +S Sbjct: 1875 SPADITILLDSSASVGSH---NFDTTKRFAKRLAERFLMAGRTDPAHD-VRVAVVQYSGT 1930 Query: 228 ---NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + L + + +T+ L Y R + Sbjct: 1931 GQQRPERASLQFLQNYTALASAVDAMDFINDATDVNDALGYV---------TRFYREASS 1981 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 KK ++ +DG + EA++ G ++ + + + +R + + Sbjct: 1982 GAAKKRLLLFSDGNSQGATPAAIEKAV-QEAQRAGIEIFVVVVGRQVNEPHIRVLVTGKT 2040 Query: 345 --FYLVENPHSMYDA--FSHIGKDIVTKRI 370 + + ++ + + + + + + Sbjct: 2041 AEYDVAYGERHLFRVPSYQALLRGVFHQTV 2070 >gi|295789107|ref|NP_001171441.1| vitrin isoform 3 [Homo sapiens] Length = 657 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 41/270 (15%), Positives = 90/270 (33%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + N+ +V + V N YS+ S + + + ++ + T Sbjct: 409 EGAAENEKQYVVEPNFANKAVCRTNGFYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKT 468 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S S+ ++ + + +E ++ Sbjct: 469 CL--------NSADIGFVIDGSSSVG------TGNFRTVLQFVTNLTKEFEISDTD---T 511 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ + F + S + IK + + T++ + +A Q+F Sbjct: 512 RIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF------ 565 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG A +G I YAIG+ E Sbjct: 566 ---KKSKPNKRKLMILITDGR-----SYDDVRIPAMAAHLKGVITYAIGVAWAAQEELEV 617 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V+ +++ I ++I Sbjct: 618 IATHPARDHSFFVDEFDNLHQYVPRIIQNI 647 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 28/175 (16%), Positives = 64/175 (36%), Gaps = 22/175 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 V + D ++D+ ++D S S+ + + + + + + + + P Sbjct: 261 VGADVSLDCKIDLSFLIDGSTSIGK------RRFRIQKQLLADVAQALDIGPAGP---LM 311 Query: 222 GLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G+V + + F L+ L+ I+ +++ G +N + + F Sbjct: 312 GVVQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSK------ 365 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 N + ++V M DG E+ L A++ G ++ I I +E Sbjct: 366 ANGNRSGAPNVVVVMVDGWPTDKVEEASRL-----ARESGINIFFITIEGAAENE 415 >gi|32475535|ref|NP_868529.1| BatA [Rhodopirellula baltica SH 1] gi|32446077|emb|CAD75906.1| BatA [Rhodopirellula baltica SH 1] Length = 357 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 29/235 (12%), Positives = 76/235 (32%), Gaps = 43/235 (18%) Query: 163 KVNSQTDARLDMMIVLDVSRSM---ESFFDS-SITKIDMAIKSINAMLEEVKLIPD-VNN 217 + + + +V+D S SM + D + ++ + + + + ++ Sbjct: 77 EQTVSQTEGIAIEMVIDRSGSMQALDFNIDGEPVDRLTAVKNVASKFITGGEDLEGRFSD 136 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL----SKFGVSTNSTPGLKYAYNQIFDM 273 +V GL+TF+ + S + ++ + T + + ++ + Sbjct: 137 LV--GLITFAAYADAETPPTLDHSFVVSRLNQTEIVSRRDEDGTAIGDAIALSVEKLNAL 194 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR----- 328 ++ KI++ +TDGEN + + + A+ G +YAIG+ Sbjct: 195 DARQE-----RKVQSKILILLTDGENTA--GELDPVQAAELAETLGIKIYAIGVGTTGKA 247 Query: 329 ------VIRSHEFLRACA--------------SPNSFYLVENPHSMYDAFSHIGK 363 + L + ++ + S+ + I + Sbjct: 248 PVPVRDPFTGRQRLHYMEVNIDEATLQKVAEITGGKYFRATDTDSLDAIYREIDQ 302 >gi|325927915|ref|ZP_08189139.1| Mg-chelatase subunit ChlD [Xanthomonas perforans 91-118] gi|325541755|gb|EGD13273.1| Mg-chelatase subunit ChlD [Xanthomonas perforans 91-118] Length = 338 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 69/216 (31%), Gaps = 45/216 (20%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 MM+ +D+S SM + ++ A ++ L+ + GL+ F Sbjct: 100 QMMLAVDLSGSMSEPDMVLGGKVVDRLTAAKAVLSDFLDRRDGD-------RVGLLVFGQ 152 Query: 229 KIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L ++ ++ ++ + G T + + ++ E Sbjct: 153 RAYALTPLTADLTSVRDQLSDSVVGLAGRETAIGDAIALSVKRL-----------REQKQ 201 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF----------- 335 ++++V +TDG N + L AK G V+ I + Sbjct: 202 GQRVVVLLTDGVNTA--GALNPLKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGND 259 Query: 336 ------LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 LR A + F+ + + ++ + + Sbjct: 260 DIDEDGLRKIAQQTGGRFFRARDTEELAGIYAELDR 295 >gi|226314068|ref|YP_002773964.1| hypothetical protein BBR47_44830 [Brevibacillus brevis NBRC 100599] gi|226097018|dbj|BAH45460.1| hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 677 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 40/236 (16%), Positives = 89/236 (37%), Gaps = 39/236 (16%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + N +A +D + V+D S SM A + ++ ++ Sbjct: 27 LFLSAPLTAGANGTAEAGVDAVFVVDTSNSMNKTDPG-----KTAAEVMSMFIDM----- 76 Query: 214 DVNNVVQSGLVTFSNKIEEFFLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + G V ++++I + L+R I+ L G S + GL+ Sbjct: 77 SEATRTRIGFVAYNDRIVQAQSPASMAEARNREQLKRTIQGLRYSGYS-DLGLGLRR--- 132 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDG-----ENLSTK----EDQQSLYYCNEAKKRG 319 G +D K ++ ++DG +N + ++ ++AK +G Sbjct: 133 ------GAEMIEKAKDPARKPFLILLSDGGTDLRQNAGGRSVAASNKDVETVISKAKAQG 186 Query: 320 AIVYAIGIRVIRS--HEFLRAC--ASPNSFYLVENPHSMYDAFSHI-GKDIVTKRI 370 +Y IG+ S E L+ A+ + ++ ++ + + F+ I K I ++ + Sbjct: 187 YPIYTIGLNNDGSVQKEQLKKIAEATGGTSFVTQSTDDLPEIFNQIFAKHIQSQLV 242 >gi|291404848|ref|XP_002718766.1| PREDICTED: von Willebrand factor A domain containing 2 [Oryctolagus cuniculus] Length = 787 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 69/199 (34%), Gaps = 28/199 (14%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S+ + A +D++ +LD S S+ + + + V Sbjct: 38 SAASKMMRCSAAMDILFLLDGSHSVGKASFERSKRFA---------ITVCDALDISQERV 88 Query: 220 QSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G + F + F L+ + ++ ++K + G T + LK Sbjct: 89 RVGAMQFGSAPHLEFPLDAFPTRQEVKAQVKRMIFKGGRTETGLALKRLL--------HG 140 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + + +++V +TDG K + K+RG V+A+G+R R E L Sbjct: 141 GFPGGRNGSVPQLLVIVTDG-----KSQGHVASPAKQLKERGVTVFAVGVRFPRWEE-LH 194 Query: 338 ACAS---PNSFYLVENPHS 353 A AS L E Sbjct: 195 ALASEPWEQHVLLAEQVED 213 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 67/192 (34%), Gaps = 24/192 (12%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-NVVQSGLV 224 LD++ +LD S S+ + + L L DVN +V Q GLV Sbjct: 524 GCQAQSLDLVFLLDASSSVGPENFAHMQS----------FLRSCALRFDVNPDVTQVGLV 573 Query: 225 TFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + ++++ F L+ + + R + G ++ L + ++Q+ +Q Sbjct: 574 VYGSRVQVAFGLDAHPTRAAVLRAMSQAPYLGGVGSAGTALLHVHDQVMTVQ------RG 627 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 K +V +T G + + + + G V +G+ + R Sbjct: 628 ARPGVPKALVMLTGGR-----GVEDATVPAQKLRDNGVSVLVVGMGPVLRDALRRLAGPR 682 Query: 343 NSFYLVENPHSM 354 +S V + Sbjct: 683 DSLIHVAAYADL 694 >gi|299135165|ref|ZP_07028356.1| conserved hypothetical protein [Afipia sp. 1NLS2] gi|298590142|gb|EFI50346.1| conserved hypothetical protein [Afipia sp. 1NLS2] Length = 601 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 32/227 (14%), Positives = 69/227 (30%), Gaps = 30/227 (13%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R F + +G + I+ AI + ++G ++ + +T L S +D + + Sbjct: 14 RRFQTDARGNVAIIFAIVSIPLVALVGAAVDYTRAVSDRTALQSALDSAAL--------- 64 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 + T + + L + I T+ Sbjct: 65 -----MISKDAATMSASQITTRARQYVDSL----YTATDAPIQNFTATYTPNSGSGASIL 115 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 +SA +P F + N+ + T+ R+ + +VLD + SM Sbjct: 116 LSANGT--MPTYFMRVLGSNFNTLPVATSSTTKWGST-----RMRVALVLDNTGSMAQNG 168 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 K+ + M+ ++ V +V F+ + Sbjct: 169 -----KMAALQSAATDMITKLSAFNTTTGDVYISIVPFAKDVNVSTS 210 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 31/158 (19%), Positives = 59/158 (37%), Gaps = 22/158 (13%) Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM-RQHCNTEDANYKKI 290 +W L+ +I ++ G +TN GL + + + + + YK Sbjct: 445 TPMSNQWST--LKSQINAMTPSG-NTNQAVGLFWGWQTLNTTNDPFKAPAKDPNWVYKDY 501 Query: 291 IVFMTDGENLSTK-------EDQQSLYYCNEAKK------RGAIVYAIGIRVIR---SHE 334 IV ++DG N + D + C K V++I + + + + Sbjct: 502 IVLLSDGLNTQNRWTQTVSDIDARQELLCKNIKDPAQNGGNQITVFSIQVNISSKDPTSK 561 Query: 335 FLRACASPNSFYL--VENPHSMYDAFSHIGKDIVTKRI 370 L+ CA+P + Y + DAF+++ I RI Sbjct: 562 VLQDCATPGAGYFQMITQSSQTADAFNNVLATIAKLRI 599 >gi|146307722|ref|YP_001188187.1| von Willebrand factor, type A [Pseudomonas mendocina ymp] gi|145575923|gb|ABP85455.1| von Willebrand factor, type A [Pseudomonas mendocina ymp] Length = 566 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 47/321 (14%), Positives = 109/321 (33%), Gaps = 38/321 (11%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN---- 105 + S + + + + E N + F ++ + N Sbjct: 74 MPSSVADAALPGYRDVPREQYQNYPDNPVFAVSETPVST-----FSIDVDTGSYANVRRF 128 Query: 106 -DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFI---PWYTNSRHIVMPITSS 161 + + ++ + + Y+ + + P T + PW +R + + I +S Sbjct: 129 LNGGQLPPKDAVRLEELVNYFPYAYP-LPQGDAPFGVSTELAVTPWNPQTRLLRIAIKAS 187 Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + + +++ ++DVS SM + M ++ ++++++ V+ V + Sbjct: 188 -DRSVEELPPANLVFLVDVSGSMH-----RREGLPMVQGTLKLLVDQLRPQDRVSLVTYA 241 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 G + + ++ I L+ G ST G++ AY Q +H Sbjct: 242 GATQ----VVLDSTPGSDKAKIRAAIDQLTA-GGSTAGESGIQLAYQQ------ASKHLI 290 Query: 282 TEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR--- 337 N I+ TDG+ N+ + + +K G + +G V +E L Sbjct: 291 EGGINR---ILLATDGDFNVGISDFDSLKQLAADKRKTGVSLTTLGFGVDNYNERLMEQL 347 Query: 338 ACASPNSFYLVENPHSMYDAF 358 A A ++ ++N Sbjct: 348 ADAGNGNYAYIDNLREARKVL 368 >gi|289547502|ref|NP_001166098.1| integrin, alpha 11a [Danio rerio] Length = 1190 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 50/211 (23%), Positives = 90/211 (42%), Gaps = 39/211 (18%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++IVLD S S+ + + + +L + + P +Q G++ + K+ Sbjct: 167 MDIVIVLDGSNSIYPW--------NEVQDFLINILRKFYVGP---GQIQVGVLQYGEKVV 215 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGV----STNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 F L S ++ +K K G TN+ G+ A ++ F G R Sbjct: 216 SEFQLNDFRS-VEDVVKAARKIGQRGGEETNTALGINVARSEAFKQGGRRG--------A 266 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL---RA 338 KK+++ +TDGE+ + + QQ + E++K G YAI + R I FL + Sbjct: 267 KKVMIVITDGESHDSADLQQVI---EESEKDGITRYAIAVLGYYNRRGINPEAFLNEIKY 323 Query: 339 CAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 AS F+ V + ++ D +G+ I Sbjct: 324 IASDPDDKHFFNVTDEAALKDIVDALGERIF 354 >gi|55793317|gb|AAV65699.1| integrin alpha 11 subunit [Danio rerio] Length = 1168 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 50/211 (23%), Positives = 90/211 (42%), Gaps = 39/211 (18%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++IVLD S S+ + + + +L + + P +Q G++ + K+ Sbjct: 145 MDIVIVLDGSNSIYPW--------NEVQDFLINILRKFYVGP---GQIQVGVLQYGEKVV 193 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGV----STNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 F L S ++ +K K G TN+ G+ A ++ F G R Sbjct: 194 SEFQLNDFRS-VEDVVKAARKIGQRGGEETNTALGINVARSEAFKQGGRRG--------A 244 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL---RA 338 KK+++ +TDGE+ + + QQ + E++K G YAI + R I FL + Sbjct: 245 KKVMIVITDGESHDSADLQQVI---EESEKDGITRYAIAVLGYYNRRGINPEAFLNEIKY 301 Query: 339 CAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 AS F+ V + ++ D +G+ I Sbjct: 302 IASDPDDKHFFNVTDEAALKDIVDALGERIF 332 >gi|158316887|ref|YP_001509395.1| von Willebrand factor type A [Frankia sp. EAN1pec] gi|158112292|gb|ABW14489.1| von Willebrand factor type A [Frankia sp. EAN1pec] Length = 319 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 38/228 (16%), Positives = 79/228 (34%), Gaps = 31/228 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 I M + + + A + ++ +DVS SM + D + T+++ A + A ++++ Sbjct: 71 ISMARPARAERVPRERATI--ILAIDVSNSM-AATDITPTRLEAAKQGAQAFVDQLPP-- 125 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + GLV+F+ ++ I+ L + G +T G+ + I Sbjct: 126 ----RINLGLVSFAGSAAVLVPASTDRESVRSGIRGL-QLGPATAVGEGIYASLQAIATA 180 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS- 332 + IV ++DGE + + Q+ A+ V I Sbjct: 181 GQRLS--DEGQPPPPAAIVLLSDGETTRGRPNTQA---ATAARDAEIPVDTIAYGTSDGT 235 Query: 333 -------------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 E LR A + S++ + + + +G I Sbjct: 236 LDVGGQQIPVPVNEEALRELADQTGGSYHRATSGDELQSVYRGLGSSI 283 >gi|90021389|ref|YP_527216.1| BatB protein [Saccharophagus degradans 2-40] gi|89950989|gb|ABD81004.1| von Willebrand factor, type A [Saccharophagus degradans 2-40] Length = 341 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 75/216 (34%), Gaps = 49/216 (22%) Query: 173 DMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +D+S SM++ + I +I + + +E ++ + GLV F Sbjct: 92 DLLVAVDISGSMDTKDMVVQNQQIPRIAVVKHIVGDFIER-RVGD------RLGLVLFGT 144 Query: 229 KIEEFFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + + +++ + + G +T + + ++ D Sbjct: 145 SAYLQSPLTFDRTTVKQLLVESQIGFAGPNTAIGDAIGLSIKRLRDRPAEN--------- 195 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------- 331 ++++ +TDG+N + + + AK+ G VY IG+ Sbjct: 196 --RVVILLTDGQNTA--GEVSPRQAADLAKQSGVKVYTIGVGANEMIVSDGFFGNFQRKI 251 Query: 332 ------SHEFLRACA--SPNSFYLVENPHSMYDAFS 359 + L A + ++ +P + + Sbjct: 252 NPSRDLDEDTLTYIAETTGGRYFRAHSPQELNQIYQ 287 >gi|332227200|ref|XP_003262779.1| PREDICTED: vitrin isoform 3 [Nomascus leucogenys] Length = 658 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 41/270 (15%), Positives = 89/270 (32%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + N+ +V + V N YS S + + + ++ + T Sbjct: 410 EGAAENEKQYVVEPNFANKAVCRTNGFYSFHVQSWFGLHKTLQPLVKRVCDTDRLACSKT 469 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S S+ ++ + + +E ++ Sbjct: 470 CL--------NSADIGFVIDGSSSVG------TGNFRTVLQFVTNLTKEFEISDTD---T 512 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ + F + S + IK + + T++ + +A Q+F Sbjct: 513 RIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF------ 566 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG + A +G YAIG+ E Sbjct: 567 ---KKSKPNKRKLMILITDG-----RSYDDVRIPAMAAHLKGVFTYAIGVAWAAQEELEV 618 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V+ ++Y I ++I Sbjct: 619 IATHPARDHSFFVDEFDNLYQYVPRIIQNI 648 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 29/175 (16%), Positives = 65/175 (37%), Gaps = 22/175 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 V V+ D ++D+ ++D S S+ + + + + + + + + P Sbjct: 262 VGVDVSLDCKIDLSFLIDGSTSIGK------RRFRIQKQLLADVAQALDIGPAGP---LM 312 Query: 222 GLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G+V + + F L+ L+ I+ +++ G +N + + F Sbjct: 313 GVVQYGDNPATHFNLKTHMNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSK------ 366 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 N + ++V M DG E+ L A++ G ++ I I +E Sbjct: 367 ANGNRSGAPNVVVVMVDGWPTDKVEEASRL-----ARESGINIFFITIEGAAENE 416 >gi|34541234|ref|NP_905713.1| batA protein [Porphyromonas gingivalis W83] gi|34397550|gb|AAQ66612.1| batA protein [Porphyromonas gingivalis W83] Length = 327 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 41/225 (18%), Positives = 80/225 (35%), Gaps = 46/225 (20%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +DVS SM++ D +++ A + + P+ N G+VTF+ + Sbjct: 88 IDIMLAMDVSGSMQA-MDFKPNRLEAAKDVAISFINN---RPNDN----IGMVTFAGESF 139 Query: 232 EFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + L ++ L T GL A N++ D + + Sbjct: 140 TQCPLTTDHTVLLNMVQDLQMGVLDDGTAIGMGLATAVNRLKDSKAKS-----------R 188 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-------------- 335 +++ +TDG N D + A+ G VY +G+ F Sbjct: 189 VVILLTDGSNNM--GDITPRMAADIARTFGIRVYTVGVGTRGEAPFPIQTEFGVRIQNVP 246 Query: 336 -------LRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 L A S ++ + ++ + + I K T+ + Sbjct: 247 VDIDEPTLDGIAEVSGGKYFRAVDNETLNEIYKEIDKLEKTRLMT 291 >gi|297265788|ref|XP_001107747.2| PREDICTED: vitrin-like isoform 3 [Macaca mulatta] Length = 693 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 41/270 (15%), Positives = 90/270 (33%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + N+ +V + V N YS S + + + ++ + T Sbjct: 445 EGAAENEKQYVVEPNFANKAVCRTNGFYSFHVQSWFGLHKTLQPLVKRVCDTDRLACSKT 504 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ ++D S S+ ++ + + +E ++ Sbjct: 505 CL--------NSADIGFIIDGSSSVG------TGNFRTVLQFVTNLTKEFEISDTD---T 547 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ + F + S + IK + + T++ + +A Q+F Sbjct: 548 RVGAVQYTYEQRLEFGFDQYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF------ 601 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG + A +G I YAIG+ E Sbjct: 602 ---KKSKPNKRKLMILITDG-----RSYDDVRIPAMAAHLKGVITYAIGVAWAAQEELEV 653 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V+ ++Y I ++I Sbjct: 654 IATHPARDHSFFVDEFDNLYQYVPRIIQNI 683 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 61/165 (36%), Gaps = 22/165 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+ ++D S S+ + + + + + + + + P G+V + + Sbjct: 307 IDLSFLIDGSTSIGK------RRFRIQKQLLAGVAQALDIGPAGP---LMGVVQYGDNPA 357 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L+ L+ I+ +++ G +N + + F N ++ Sbjct: 358 THFNLKTHMNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSK------ANGNRSSAPN 411 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++V M DG E+ L A++ G ++ I + +E Sbjct: 412 VVVVMVDGWPTDKVEEASRL-----ARESGINIFFITVEGAAENE 451 >gi|294054129|ref|YP_003547787.1| hypothetical protein Caka_0592 [Coraliomargarita akajimensis DSM 45221] gi|293613462|gb|ADE53617.1| conserved hypothetical protein [Coraliomargarita akajimensis DSM 45221] Length = 339 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 42/217 (19%), Positives = 83/217 (38%), Gaps = 28/217 (12%) Query: 172 LDMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + + +++DVS SM+ +F S T++++A + + + GL+TF+ Sbjct: 82 IAIQMLVDVSSSMDMSVKNFDGKSTTRMEVAKEMVERFIAGDGEDLQGRPHDLIGLITFA 141 Query: 228 NKIEEFFLLEWGVSHLQRKIKYL----SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + L +G L + +++L T L A ++ + Q +R H Sbjct: 142 RYADTRSPLTFGHDALLQIVRHLTIQERPNEDGTAYGDALALAAARLKNPQELR-HGKRP 200 Query: 284 DANYK----KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR----------- 328 DA + K+I+ +TDGEN S + AK +YAI + Sbjct: 201 DAQAEAIESKVIILLTDGENNSGS--HLPIEAAGLAKAWDCKIYAISLGESLDAENPLDA 258 Query: 329 VIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 + + L + + F + S+ + I + Sbjct: 259 LSPAERVLEHISIETGGVFRQAHDFESLLSVYEEIDR 295 >gi|149026142|gb|EDL82385.1| rCG29121 [Rattus norvegicus] Length = 905 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 44/195 (22%), Positives = 71/195 (36%), Gaps = 29/195 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN---KI 230 + +VLD S SM S IT++ + ++ L ++ + GLVTF + Sbjct: 309 VCLVLDKSGSMSS--GDPITRLTLMNQAAELYLIQILEKESL-----VGLVTFDSIAIVQ 361 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + S+L+ K + T+ GL+ + I Sbjct: 362 NNLIRMINDSSYLEISAKLPQEAAGGTSICNGLRKGFETITSSDQSTCGSE--------- 412 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ C E K GA+++ I + + E L FY Sbjct: 413 IVLLTDGEDDQISS-------CFEEVKHSGAVIHTIALGPDAARELETLSDMTGGRRFYA 465 Query: 348 VENPHSMYDAFSHIG 362 E + + DAFS I Sbjct: 466 SEGINGLIDAFSGIS 480 >gi|116487355|ref|NP_001070824.1| chloride channel calcium activated 4-like [Rattus norvegicus] gi|116013527|dbj|BAF34587.1| calcium-activated chloride channel [Rattus norvegicus] Length = 905 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 44/195 (22%), Positives = 71/195 (36%), Gaps = 29/195 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN---KI 230 + +VLD S SM S IT++ + ++ L ++ + GLVTF + Sbjct: 309 VCLVLDKSGSMSS--GDPITRLTLMNQAAELYLIQILEKESL-----VGLVTFDSIAIVQ 361 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + S+L+ K + T+ GL+ + I Sbjct: 362 NNLIRMINDSSYLEISAKLPQEAAGGTSICNGLRKGFETITSSDQSTCGSE--------- 412 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ C E K GA+++ I + + E L FY Sbjct: 413 IVLLTDGEDDQISS-------CFEEVKHSGAVIHTIALGPDAARELETLSDMTGGRRFYA 465 Query: 348 VENPHSMYDAFSHIG 362 E + + DAFS I Sbjct: 466 SEGINGLIDAFSGIS 480 >gi|261250853|ref|ZP_05943427.1| protein BatA [Vibrio orientalis CIP 102891] gi|260937726|gb|EEX93714.1| protein BatA [Vibrio orientalis CIP 102891] Length = 322 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 71/215 (33%), Gaps = 46/215 (21%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+VLD+S SM D I ++ ++ +++ + GLV F++ Sbjct: 86 DLMLVLDLSYSMSQEDMQEGDQYIDRLSAVKNVVSDFVKQ-------REGDRLGLVLFAD 138 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + ++ L G T G+ A D + Sbjct: 139 HAYLQTPLTLDRETISDQVNSLVLRLIGDKTAIGEGIGLATKTFVDSEA----------- 187 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------- 331 ++++V ++DG N + L AKK A +Y IGI Sbjct: 188 PQRVMVLLSDGSNT--SGVLEPLEAARIAKKYNATIYTIGIGAGEMMVKEFFMTRKVNTA 245 Query: 332 ---SHEFLRACA--SPNSFYLVENPHSMYDAFSHI 361 + L+ A + ++ N + + I Sbjct: 246 KDLDEKTLKQIADLTGGQYFRARNADELATIYDTI 280 >gi|163747459|ref|ZP_02154811.1| hypothetical protein OIHEL45_00415 [Oceanibulbus indolifex HEL-45] gi|161379312|gb|EDQ03729.1| hypothetical protein OIHEL45_00415 [Oceanibulbus indolifex HEL-45] Length = 476 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 65/462 (14%), Positives = 146/462 (31%), Gaps = 103/462 (22%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAA----- 62 +R F G +T+ + + ++ L+ G+ +++ + + ++ DR+++ AA Sbjct: 19 LRRFAREEDGLVTLFAILMILLMILLGGVGVDLMRHERERARVQAVADRAVLAAADLDQT 78 Query: 63 --------------------TQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNG 102 + + E N R+ D +K + F E Sbjct: 79 LSPEAVARDYFDKSGLADYISSVTVEEGLNYRR--VTVDASRDLKTMFIDKFGQEKLHVP 136 Query: 103 FVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIV----MPI 158 ++ V + +V+ ++ T++ ++ +P Sbjct: 137 AKATAEEKVAKVEISMVLDISGSMRENDKMNNLHDASNVFIDTVIQTDTEDLISISVVPY 196 Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP--DVN 216 T+ V V L++ + S ++ F DS ++ ++ + + N Sbjct: 197 TAQVNVGKDIMDELNVTQLHSYSHCVD-FEDSDFNLTTISQTRSYEHMQHFEAGYYWNGN 255 Query: 217 NVVQSGLVTFSN------KIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKY--- 265 + ++G + N + + +E + ++ F +T GLK+ Sbjct: 256 DRDRTG--HYDNISNPGCPKQSYEEIETFSQNAAALKSRIANFQPRANTAIHLGLKWGVA 313 Query: 266 -------AYNQIFDMQGM--RQHCNTEDANYKKIIVFMTDGEN----------------- 299 A N+ + + D + K ++ MTDG N Sbjct: 314 LLDPSFRAINEAIGGDAVFRGRPAEYNDIDTLKTVILMTDGVNVTTRRIAPEAYSNRDHY 373 Query: 300 -----------------------------LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + + D C+ AK +G ++++IG V Sbjct: 374 RHWSDYPFYWWLGRNVRSSEHYRWYRTKYTAGQADNLLDNICDAAKAKGIVIWSIGFEVT 433 Query: 331 R-SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 ++ CAS +S + + DAF I + I R+ Sbjct: 434 DHGAAVMKNCASSDSHFFRVEGVEIVDAFEAIARQINQLRLT 475 >gi|47847422|dbj|BAD21383.1| mFLJ00114 protein [Mus musculus] Length = 1188 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 40/206 (19%), Positives = 79/206 (38%), Gaps = 34/206 (16%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV---NNVVQSGLVTFSNK 229 D++ ++D S S+ S + K+ +K++ + L+ + ++ + TF+N Sbjct: 171 DIVFLIDGSGSISS---TDFEKMLDFVKAVMSQLQRPSTRFSLMQFSDYFRVHF-TFNNF 226 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 I L + + + T + +K+ ++F Q + + K Sbjct: 227 ISTSSPLS--------LLDSVRQLRGYTYTASAIKHVITELFTTQ------SGARQDATK 272 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI----RSHEFLRACA---SP 342 +++ +TDG S+ A+ I YAIG+ S + L+A A S Sbjct: 273 VLIVITDGRKQGDNLSYDSVIP--MAEAASIIRYAIGVGKAFYNEHSKQELKAIASMPSH 330 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVTK 368 + VEN +DA I + K Sbjct: 331 EYVFSVEN----FDALKDIENQLKEK 352 >gi|310823567|ref|YP_003955925.1| Bata protein [Stigmatella aurantiaca DW4/3-1] gi|309396639|gb|ADO74098.1| BatA protein [Stigmatella aurantiaca DW4/3-1] Length = 302 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 36/228 (15%), Positives = 78/228 (34%), Gaps = 47/228 (20%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++ LD+S SME+ ++ +A + + + VN+ + GLV F+ Sbjct: 58 IDIVVALDLSTSMEAGDFRPQNRLHVAKEVLAEFISN-----RVND--RIGLVVFAGAAY 110 Query: 232 EFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L ++ +K + T L + N++ D + + Sbjct: 111 TQAPLTLDYGVVREVLKQIRTRVLEDGTAIGDALATSLNRLRDSEAKS-----------R 159 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV------IRSH---------- 333 ++V +TDG+N + K L + A+ +Y I + + Sbjct: 160 VVVLITDGDNNAGK--ISPLDAASMAESLKIPIYTILVGKGGKVPFPQGQDLFGNTVWRD 217 Query: 334 -------EFLRACAS--PNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 E L+ AS +Y +P + + + ++ Sbjct: 218 TEIPINPELLQDIASRTGGEYYRATDPEGLKQGLQKVLDSLERSKLME 265 >gi|73958316|ref|XP_848776.1| PREDICTED: similar to integrin, alpha D precursor [Canis familiaris] Length = 1166 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 38/224 (16%), Positives = 85/224 (37%), Gaps = 28/224 (12%) Query: 152 RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 H+ T + +D++ ++D S S+ ++SS ++ ++++ E Sbjct: 139 SHLQTIWTVPAALPECPSQEMDIVFLIDGSGSI---YESSFKQMKDFVRALMGHFEGTNT 195 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 + + + F+ F W L + + + T + G++ ++F Sbjct: 196 LFSLIQYSHLLKIHFTFTQ---FQNSWNPLSL---VDPIVQLKGLTYTATGIRKVVEELF 249 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--- 328 N + KKI++ +TDG+ + + +A++ G I YAIG+ Sbjct: 250 ------HSKNGARKSAKKILIVITDGQKYKDPLEYSDVIP--QAERAGIIRYAIGVGDAF 301 Query: 329 -VIRSHEFLRACASP---NSFYLVENPHSMYDAFSHIGKDIVTK 368 + + L S + + V+N + A S I + + K Sbjct: 302 WKPSAKQELDNIGSEPAQDHVFRVDN----FAALSSIQEQLQEK 341 >gi|222081474|ref|YP_002540837.1| von Willebrand factor, type A [Agrobacterium radiobacter K84] gi|221726153|gb|ACM29242.1| von Willebrand factor, type A [Agrobacterium radiobacter K84] Length = 329 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 32/200 (16%), Positives = 76/200 (38%), Gaps = 24/200 (12%) Query: 170 ARLDMMIVLDVSRSMESF-FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D+M+ LD+S+SM++ F + ++ + ++ ++ + + + GL+ F + Sbjct: 92 PQRDLMLGLDLSQSMDTKDFRAPDGNLEARVDAVRKVVGDFVARRPGD---RIGLIAFGD 148 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 ++ I + PG+ + D G+ + + Sbjct: 149 APYPLAPFTMDHELVREIISG---------TLPGIAGPRTSLGDAVGLAIKMFEKTTVPE 199 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--SHEFLRACA------ 340 K+++ +TDG T L AK+ G +V+ +GI + + L A A Sbjct: 200 KVLIVLTDG--NDTASKMPPLKAAEIAKRNGVVVHTVGIGDPQATGEDRLDATALEKIAE 257 Query: 341 -SPNSFYLVENPHSMYDAFS 359 + ++ + + A+ Sbjct: 258 TTGGRYFFGGDQAQLAAAYD 277 >gi|198420236|ref|XP_002121660.1| PREDICTED: similar to collagen type VI alpha 6 [Ciona intestinalis] Length = 1638 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 48/348 (13%), Positives = 106/348 (30%), Gaps = 57/348 (16%) Query: 44 FFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCR--------------IKNT 89 F K + ID+ + + + R+ + R + Sbjct: 661 FSTKREIDYAIDQMEFAGGATLTAQALVHIRENGFTEESGARPGAPKVLVVISDGRSADN 720 Query: 90 WNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYT 149 R +V I S ++ + + + Y F + Sbjct: 721 IETPARKLHESGVYVFAIGVGNTWRSALEIIGSEPVVTHVQEGASYDAINNFRRDLVR-- 778 Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 I K + +D++ ++D S S+ + + + I + + E Sbjct: 779 -------NICRETKPE-CAETEMDLIFLIDGSNSIGP------REFETTKEWIGSFVREF 824 Query: 210 KLIPDVNNVVQSGLVTFSNKI--EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 ++ + G+V +S+++ E + L I + +TN+ L+Y Sbjct: 825 EIGE---YNTKIGVVQYSSRVRSEIDIGDYDSKADLLAAISSIEFAAGNTNTGSALEY-- 879 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + + +H K+++ +TDG L ++ + G VYAIG+ Sbjct: 880 --VRTVGFSGRHGARNG--VPKVLIVLTDG-----NAQDGVLDAASKLHRDGVAVYAIGV 930 Query: 328 RVIRSHEFLRACASPN-----SFYLVENPHSMYDAFSHIGKDIVTKRI 370 + + + Y V N +DA I ++ + Sbjct: 931 GRPNMGQL--NAVTSEPITSPNIYHVRN----FDAIQTIQSGLLRRVC 972 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 33/236 (13%), Positives = 85/236 (36%), Gaps = 36/236 (15%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDM 174 + ++ ++ Y I A F R ++ I + + + A ++ Sbjct: 372 EIKVIASDPDKNYVIEA----------LNFDIIELKRRGLIKSICTDAE-QTCPAATAEL 420 Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 + ++D S S+ + + + ++++ ++ P + +V F+N+ F Sbjct: 421 VFLIDGSTSIG------FDNFEKLKRWLKSIVDAFQVGPH---YTRVAVVQFTNRPVLEF 471 Query: 235 LLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L + I+ + ST++ +++ N++F N KI++ Sbjct: 472 GLNDHSTTQATLQAIQRIRYRRGSTSTGRAIEFVMNEVFTHS---------RENVPKILI 522 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 +TDG+ +D + + A + G GI R + + + + + Sbjct: 523 ALTDGQ----SQDDVTQATASAA-EAGVHTLVFGIGNTRPGQLRQLVSKEDHVFQA 573 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 42/248 (16%), Positives = 85/248 (34%), Gaps = 27/248 (10%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 +S+ + F +V I + + +D+ ++D S S+ + Sbjct: 562 QLVSKEDHVFQAAGFDVIQKMQSKLVSLICVKAEPE-CSSQEIDLHFMVDGSGSIG---N 617 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKI 247 + K+ IK++ V+ + L +++ I F + + + I Sbjct: 618 ENFRKVKAWIKNV------VRSFDVGKYTTRVALTQYTSTINTEFDFKKFSTKREIDYAI 671 Query: 248 KYLSKFGVSTNSTPGLKYAY-NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + G +T + L + N + G R K++V ++DG + E Sbjct: 672 DQMEFAGGATLTAQALVHIRENGFTEESGARPGA-------PKVLVVISDGRSADNIE-- 722 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIV 366 + + G V+AIG+ L S V+ S YDA ++ +D+V Sbjct: 723 ---TPARKLHESGVYVFAIGVGN-TWRSALEIIGSEPVVTHVQEGAS-YDAINNFRRDLV 777 Query: 367 TKRIWYDK 374 K Sbjct: 778 RNICRETK 785 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 38/220 (17%), Positives = 77/220 (35%), Gaps = 23/220 (10%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 + V ++ D+ V+D S ++ K+ IK I VK Sbjct: 210 VSAVCEQTVQECPTSKHDLAFVIDASSTIGYN---DFMKVKAWIKKI------VKAFDVG 260 Query: 216 NNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + +V +S ++E F + I + T++ L Y ++I+ Sbjct: 261 PEETRVAVVQYSTSVQEEFNFGHLLSKQQVLDAIDNMDYIMGDTHTGAALTYMLDEIYSS 320 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 N + + + + MTDG+ +E + N + G VY +G+ Sbjct: 321 ------ANGDRPDVPDLAIVMTDGK---AQEPDLVVEAANRVHEAGVTVYTVGVADYSLE 371 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 E + P+ Y++E ++ + + + K I D Sbjct: 372 EIKVIASDPDKNYVIE---ALNFDIIELKRRGLIKSICTD 408 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 33/211 (15%), Positives = 76/211 (36%), Gaps = 27/211 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 D+M ++D S S+ I M + + V+ G++ + Sbjct: 979 CARTATDLMFLVDGSTSVGHESWGIIKSF---------MQNMTQKFQIGPDAVRVGMIQY 1029 Query: 227 SNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S + + + LQ + T + L+Y ++ + R+ + Sbjct: 1030 STRPKTNIAIGQYNDKESLQEAFGQVEWQLGDTYTARALRY-VSKSYARATTRE-----N 1083 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP-- 342 + K+++ +TDG+ E +Q++ G ++AIG+ E ++P Sbjct: 1084 LHATKLLIIITDGQPQDRNEVKQAVR---NLHSEGWRIFAIGVGQSDISELGILASNPDS 1140 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + + N +S + I + +++ I D Sbjct: 1141 DHVFYANNFNS-----TRIFQGRLSRLICGD 1166 >gi|193214188|ref|YP_001995387.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] gi|193087665|gb|ACF12940.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] Length = 340 Score = 68.3 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 37/219 (16%), Positives = 70/219 (31%), Gaps = 47/219 (21%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++ +D+S SM + +I+ A + + + GLV FS K Sbjct: 98 IDIVLAIDLSGSMLAEDFEPKNRIEAAKSVATDFIHQRLSD-------RIGLVVFSGKSF 150 Query: 232 EFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L L I L T + A N++ E Sbjct: 151 TQCPLTLDYRLLTNFISELKAGTIEEDGTAIGTAIATATNRL-----------RESTAKS 199 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG---------------------- 326 K+I+ +TDG+N + + + + A G +Y +G Sbjct: 200 KVIILLTDGQNNAGEIEPVTA--AELAAALGIKIYTVGAGTRGYARYPIPDPLFGKRYVQ 257 Query: 327 IRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 ++V L A S ++ + S+ + I + Sbjct: 258 MKVDVDDSTLTRIARISGGRYFRATDLESLKKTYHEIDE 296 >gi|221108025|ref|XP_002157834.1| PREDICTED: similar to proximal thread matrix protein 1, partial [Hydra magnipapillata] Length = 299 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 45/263 (17%), Positives = 92/263 (34%), Gaps = 33/263 (12%) Query: 92 MSFRNELRDNGFV---NDIDDIVRSTSLDIVVVPQNEGYS---ISAISRYKIPLKFCTFI 145 +ELR++G V I V T L + + Y+ +++ + K Sbjct: 51 ERIADELRNDGVVILGVGIGSAVNETELSHITGGKKNAYTAATFDSLTDNEFIHKIKNGS 110 Query: 146 PWYTNSRHIVMPITSSVKVNSQTD-------ARLDMMIVLDVSRSMESFFDSSITKIDMA 198 +S P ++ + +S A +D+ +LD S S+E+ + Sbjct: 111 CEIVSSVLQSTPTIATTQASSTDGHVQPRCEAVVDVAFILDSSHSLEASYQKE------- 163 Query: 199 IKSINAMLEEVKLIPDVN-NVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGV 255 L+++ + ++ N + G++TFS + E L S + + Sbjct: 164 ----KNFLKKLAAVFGISSNGSRVGVITFSYRAELSVKLNSFTDLSSFNEAVDKIPLMNF 219 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 +T L+ A +F N K+I+ +TDG + + +E Sbjct: 220 TTRIDRALRLAQKDMFTS------ANGGRVGVSKLIILLTDGSQTPGGDAEDPERIADEL 273 Query: 316 KKRGAIVYAIGIRVIRSHEFLRA 338 + G ++ +GI + L Sbjct: 274 RNDGVVILGVGIGSAVNETELSH 296 Score = 42.8 bits (99), Expect = 0.079, Method: Composition-based stats. Identities = 18/105 (17%), Positives = 34/105 (32%), Gaps = 7/105 (6%) Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 +T L+ A +F N K+I+ +TDG + + Sbjct: 1 MNFTTRIDRALRLAQKDMFTS------ANGGRVGVSKLIILLTDGSQTPGGDAEDPERIA 54 Query: 313 NEAKKRGAIVYAIGIR-VIRSHEFLRACASPNSFYLVENPHSMYD 356 +E + G ++ +GI + E + Y S+ D Sbjct: 55 DELRNDGVVILGVGIGSAVNETELSHITGGKKNAYTAATFDSLTD 99 >gi|294664114|ref|ZP_06729507.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] gi|292606114|gb|EFF49372.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] Length = 451 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 69/216 (31%), Gaps = 45/216 (20%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 MM+ +D+S SM + ++ A ++ L+ + GL+ F Sbjct: 213 QMMLAVDLSGSMSEPDMVLGGKVVDRLTAAKAVLSDFLDRRDGD-------RVGLLVFGQ 265 Query: 229 KIEEFFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L ++ ++ ++ + G T + + ++ E Sbjct: 266 RAYALTPLTADLTSVRDQLRDSVVGLAGRETAIGDAIALSVKRL-----------REQKQ 314 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF----------- 335 ++++V +TDG N + L AK G V+ I + Sbjct: 315 GQRVVVLLTDGVNTA--GVLNPLKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGDD 372 Query: 336 ------LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 LR A + F+ + + ++ + + Sbjct: 373 DIDEDGLRKIAQQTGGRFFRARDTEELAGIYAELDR 408 >gi|294627092|ref|ZP_06705680.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 11122] gi|292598525|gb|EFF42674.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 11122] Length = 451 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 69/216 (31%), Gaps = 45/216 (20%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 MM+ +D+S SM + ++ A ++ L+ + GL+ F Sbjct: 213 QMMLAVDLSGSMSEPDMVLGGKVVDRLTAAKAVLSDFLDRRDGD-------RVGLLVFGQ 265 Query: 229 KIEEFFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L ++ ++ ++ + G T + + ++ E Sbjct: 266 RAYALTPLTADLTSVRDQLRDSVVGLAGRETAIGDAIALSVKRL-----------REQKQ 314 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF----------- 335 ++++V +TDG N + L AK G V+ I + Sbjct: 315 GQRVVVLLTDGVNTA--GVLNPLKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGDD 372 Query: 336 ------LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 LR A + F+ + + ++ + + Sbjct: 373 DIDEDGLRKIAQQTGGRFFRARDTEELAGIYAELDR 408 >gi|289667993|ref|ZP_06489068.1| hypothetical protein XcampmN_05693 [Xanthomonas campestris pv. musacearum NCPPB4381] Length = 310 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 69/216 (31%), Gaps = 45/216 (20%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 MM+ +D+S SM + ++ A ++ L+ + GL+ F Sbjct: 75 QMMLAVDLSGSMSEPDMVLGGKVVDRLTAAKAVLSDFLDRRDGD-------RVGLLVFGQ 127 Query: 229 KIEEFFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L ++ ++ ++ + G T + + ++ E Sbjct: 128 RAYALTPLTADLTSVRDQLRDSVVGLAGRETAIGDAIALSVKRL-----------REQKQ 176 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF----------- 335 ++++V +TDG N + L AK G V+ I + Sbjct: 177 GQRVVVLLTDGVNTA--GVLNPLKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGND 234 Query: 336 ------LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 LR A + F+ + + ++ + + Sbjct: 235 DIDEEGLRKIAQQTGGRFFRARDTEELAGIYAELDR 270 >gi|289662175|ref|ZP_06483756.1| hypothetical protein XcampvN_03493 [Xanthomonas campestris pv. vasculorum NCPPB702] Length = 335 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 69/216 (31%), Gaps = 45/216 (20%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 MM+ +D+S SM + ++ A ++ L+ + GL+ F Sbjct: 100 QMMLAVDLSGSMSEPDMVLGGKVVDRLTAAKAVLSDFLDRRDGD-------RVGLLVFGQ 152 Query: 229 KIEEFFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L ++ ++ ++ + G T + + ++ E Sbjct: 153 RAYALTPLTADLTSVRDQLRDSVVGLAGRETAIGDAIALSVKRL-----------REQKQ 201 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF----------- 335 ++++V +TDG N + L AK G V+ I + Sbjct: 202 GQRVVVLLTDGVNTA--GVLNPLKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGND 259 Query: 336 ------LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 LR A + F+ + + ++ + + Sbjct: 260 DIDEEGLRKIAQQTGGRFFRARDTEELAGIYAELDR 295 >gi|302382135|ref|YP_003817958.1| von Willebrand factor A [Brevundimonas subvibrioides ATCC 15264] gi|302192763|gb|ADL00335.1| von Willebrand factor type A [Brevundimonas subvibrioides ATCC 15264] Length = 560 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 31/182 (17%), Positives = 56/182 (30%), Gaps = 43/182 (23%) Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L + L+ + + G +TN G + ++ + KKI Sbjct: 378 QPIIRLTDNYTALRTAVNNMIASG-NTNVPLGTMWGWHTLSPNAPFGDGRPYGTERLKKI 436 Query: 291 IVFMTDG----------------------------------------ENLSTKEDQQSLY 310 I+ MTDG +N + Sbjct: 437 IIIMTDGANVMSDTTSPNDSTYNGLGYIWQNRLGIVSGNDTTRRTRMDNRFDHATAATED 496 Query: 311 YCNEAKKRGAIVYAIGIRVIR-SHEFLRACASP-NSFYLVENPHSMYDAFSHIGKDIVTK 368 C K + VY + ++V + LR CA+ + ++ V++ + AF I I Sbjct: 497 MCGNMKDKDIEVYTVAVQVDSTAQTLLRRCATDTDHYFPVDSAAGIGAAFDRIAGAIENL 556 Query: 369 RI 370 RI Sbjct: 557 RI 558 Score = 38.6 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 31/266 (11%), Positives = 78/266 (29%), Gaps = 29/266 (10%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 F + +G + ++ A+ LP + L+ ++++ + ++ + + AAT Sbjct: 17 RFSDDRRGNVAMMFALALPPMMLMTLGGVDIARVSTVRMNVQDAL-----DAATLAAARS 71 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 + ++ + N D + +T + + Sbjct: 72 QYTDNPRINAVGLAALQAN--------------LAPYGDVTLDTTQTNFRLNTATGAVEA 117 Query: 130 SAISRYKIPL-KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 A + + + S V ++ R+++ +V+D + SM+ Sbjct: 118 DAKVNVRALVANIFLPPYGQFFDDQLPANAHSEVLRSNN---RIEVALVIDNTGSMD--- 171 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIK 248 + +T A + LE + ++ LV FS + Sbjct: 172 GAKLTNTKTAAIDLINRLEAADGRSIEQDAIKISLVPFSMTVRVAQGGTNTPPSFMSNAD 231 Query: 249 YLSKFGV---STNSTPGLKYAYNQIF 271 + G +N A + Sbjct: 232 THTGGGAWNSGSNPYSAFDTAVGRFT 257 >gi|90414549|ref|ZP_01222523.1| hypothetical protein P3TCK_02206 [Photobacterium profundum 3TCK] gi|90324356|gb|EAS40922.1| hypothetical protein P3TCK_02206 [Photobacterium profundum 3TCK] Length = 321 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 75/218 (34%), Gaps = 47/218 (21%) Query: 173 DMMIVLDVSRSME-----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 DM++ +D+S SM + SI ++ + +E+ K + GLV F+ Sbjct: 84 DMLLAVDLSGSMSIEDMITQSGESIDRLAAVKDVLAEFIEQRKGD-------RLGLVLFA 136 Query: 228 NKIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 L + + ++++++ L G ST GL A + + Sbjct: 137 QHAYLQTPLTFDRNTVKQQLERTVLGLIGQSTAIGEGLGIATKTFINSEA---------- 186 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-------------- 331 +++I+ ++DG N + + L A + +Y +G+ Sbjct: 187 -PQRVIILLSDGANTA--GVIEPLEAAKLAAESNVTIYTVGVGAEEMIQKSFFGNRKVNP 243 Query: 332 ----SHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L A + ++ NP + + I + Sbjct: 244 SQDLDERMLTKIADMTGGQYFRARNPQELEHIYQLIDQ 281 >gi|78049050|ref|YP_365225.1| hypothetical protein XCV3494 [Xanthomonas campestris pv. vesicatoria str. 85-10] gi|78037480|emb|CAJ25225.1| putative membrane protein [Xanthomonas campestris pv. vesicatoria str. 85-10] Length = 451 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 69/216 (31%), Gaps = 45/216 (20%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 MM+ +D+S SM + ++ A ++ L+ + GL+ F Sbjct: 213 QMMLAVDLSGSMSEPDMVLGGKVVDRLTAAKAVLSDFLDRRDGD-------RVGLLVFGQ 265 Query: 229 KIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L ++ ++ ++ + G T + + ++ E Sbjct: 266 RAYALTPLTADLTSVRDQLSDSVVGLAGRETAIGDAIALSVKRL-----------REQKQ 314 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF----------- 335 ++++V +TDG N + L AK G V+ I + Sbjct: 315 GQRVVVLLTDGVNTA--GVLNPLKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGND 372 Query: 336 ------LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 LR A + F+ + + ++ + + Sbjct: 373 DIDEDGLRKIAQQTGGRFFRARDTEELAGIYAELDR 408 >gi|328907235|gb|EGG27001.1| von Willebrand factor type A domain protein [Propionibacterium sp. P08] Length = 318 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 76/213 (35%), Gaps = 30/213 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +DVSRSM + D +++ A + L ++ +V+ LV F+ + Sbjct: 89 VVVAIDVSRSMVA-TDVDPSRLSAAKTAAKDFLGDLPPRFNVS------LVKFAASSQVV 141 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + I L ST G+ + N + + +H IV Sbjct: 142 VPPTTDRAVVSTAIANLQVL-PSTAIGEGIYSSLNALKLVPDDPKH---PGQKPPAAIVL 197 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH---------------EFLRA 338 ++DG +T + SL EA ++ VY I + E Sbjct: 198 LSDG---ATNVGRPSLEAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAV 254 Query: 339 C-ASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 AS + E+ + D + I + + +++ Sbjct: 255 AKASGGEKFSAESLGQLSDVYKSIAQSVGYEKV 287 >gi|313837214|gb|EFS74928.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA2] gi|314927768|gb|EFS91599.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL044PA1] gi|314971985|gb|EFT16083.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA3] Length = 320 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 76/213 (35%), Gaps = 30/213 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +DVSRSM + D +++ A + L ++ +V+ LV F+ + Sbjct: 91 VVVAIDVSRSMVA-TDVDPSRLSAAKTAAKDFLGDLPPRFNVS------LVKFAASSQVV 143 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + I L ST G+ + N + + +H IV Sbjct: 144 VPPTTDRAVVSTAIANLQVL-PSTAIGEGIYSSLNALKLVPDDPKH---PGQKPPAAIVL 199 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH---------------EFLRA 338 ++DG +T + SL EA ++ VY I + E Sbjct: 200 LSDG---ATNVGRPSLEAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAV 256 Query: 339 C-ASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 AS + E+ + D + I + + +++ Sbjct: 257 AKASGGEKFSAESLGQLSDVYKSIAQSVGYEKV 289 >gi|325922265|ref|ZP_08184046.1| Mg-chelatase subunit ChlD [Xanthomonas gardneri ATCC 19865] gi|325547218|gb|EGD18291.1| Mg-chelatase subunit ChlD [Xanthomonas gardneri ATCC 19865] Length = 335 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 70/216 (32%), Gaps = 45/216 (20%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 MM+ +D+S SM S + ++ A ++ L+ + GL+ F Sbjct: 100 QMMLAVDLSGSMSEPDMVLGGSVVDRLTAAKAVLSDFLDR-------REGDRVGLLVFGQ 152 Query: 229 KIEEFFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L ++ ++ ++ + G T + + ++ E Sbjct: 153 RAYALTPLTADLTSVRDQLADSVVGLAGRETAIGDAIALSVKRL-----------REQKQ 201 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF----------- 335 ++++V +TDG N + L AK G V+ I + Sbjct: 202 GQRVVVLLTDGVNTA--GVLNPLKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGND 259 Query: 336 ------LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 LR A + F+ + + ++ + + Sbjct: 260 DIDEAGLRKIAEQTGGRFFRARDTEELAGIYAELDR 295 >gi|326916310|ref|XP_003204451.1| PREDICTED: collagen alpha-1(XII) chain-like, partial [Meleagris gallopavo] Length = 2040 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 46/265 (17%), Positives = 89/265 (33%), Gaps = 34/265 (12%) Query: 113 STSLDIVVVPQNEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR 171 +++ + + Y ++ + T S P S + +T A Sbjct: 55 DSTVVLEELRAGTTYKVNVFGMFDGGESNPLVGQEMTTLSDTTTEPFLSR-GLECRTRAE 113 Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D+++++D S S+ I+ ++E + PD VQ GL +S Sbjct: 114 ADIVLLVDGSWSIG------RPNFKTVRNFISRIVEVFDIGPDK---VQIGLAQYSGDPR 164 Query: 232 EFFLLEWGVSHLQ------RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 EW ++ + + L G +T + L F ++ + Sbjct: 165 T----EWNLNAYRTKEALLEAVTNLPYKGGNTLTGMALD------FILKNNFKQDAGLRP 214 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +KI V +TDG K + + G +YAIGI+ +E + P+ Sbjct: 215 RARKIGVLITDG-----KSQDDVVTPSRRLRDEGVELYAIGIKNADENELKQIATDPDDI 269 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRI 370 N + + I +D+ T Sbjct: 270 -HAYNVAD-FSFLASIVEDVTTNLC 292 >gi|300786826|ref|YP_003767117.1| hypothetical protein AMED_4949 [Amycolatopsis mediterranei U32] gi|299796340|gb|ADJ46715.1| conserved hypothetical protein [Amycolatopsis mediterranei U32] Length = 326 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 41/211 (19%), Positives = 77/211 (36%), Gaps = 30/211 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+V+DVS SME+ D T++ A ++ + + P +N GL++F+ Sbjct: 91 VMLVIDVSLSMEA-TDVLPTRLQAAQEAATSFAR--NMTPGIN----LGLISFAGTATVL 143 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + + I+ L ST + G+ A + ++ D IV Sbjct: 144 VNPTTDRNGVIKAIENLK-LAQSTATGEGIFAA---LQSVESFSSLVGGADGPPPARIVL 199 Query: 294 MTDGENLSTK---EDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------------HEFL 336 M+DG+ + + AK+ G + +I E L Sbjct: 200 MSDGKQTVPEDLYAARGGYTAAQAAKQAGVPISSISFGTTHGSVTIDDKPQPVSVDDESL 259 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 R A S FY + + ++ +G+ I Sbjct: 260 REIARLSGGDFYKAASAEELKKVYADLGEQI 290 >gi|84385370|ref|ZP_00988402.1| hypothetical protein V12B01_16906 [Vibrio splendidus 12B01] gi|84379967|gb|EAP96818.1| hypothetical protein V12B01_16906 [Vibrio splendidus 12B01] Length = 319 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 44/260 (16%), Positives = 85/260 (32%), Gaps = 48/260 (18%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME---- 185 + ++P K + I W + P+ V Q R D+M+V+D+S SM Sbjct: 42 TKTPSSRLP-KALSVIIWLLLVTAMARPVWYGEPVEFQPKHR-DLMLVVDLSGSMSQEDM 99 Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR 245 F I ++ ++ +E K + GLV F++ L L + Sbjct: 100 QFNGEYIDRLSAVKHVLSDFIERRKGD-------RVGLVLFADHAYLQTPLTLDRDTLSQ 152 Query: 246 KIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 ++ L G T G+ A D +++++ ++DG N + Sbjct: 153 QLNQAVLKLIGTQTAIGDGIGLATKTFVDSDA-----------PQRVMILLSDGSNTA-- 199 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIR------------------SHEFLRACA--SPN 343 L + AKK A +Y +G+ L A + Sbjct: 200 GVLDPLEAADIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAQDLDERTLMEIAKRTGG 259 Query: 344 SFYLVENPHSMYDAFSHIGK 363 ++ + + + I + Sbjct: 260 QYFRARDSKELATIYDTINQ 279 >gi|315499132|ref|YP_004087936.1| von willebrand factor type a [Asticcacaulis excentricus CB 48] gi|315417144|gb|ADU13785.1| von Willebrand factor type A [Asticcacaulis excentricus CB 48] Length = 519 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 67/176 (38%), Gaps = 13/176 (7%) Query: 208 EVKLIPDVNNVV---QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 +V + P +VV + + + L + + + LS G +TN T G++ Sbjct: 346 DVSVAPYATDVVDSNYIAASCSTTALAKVLDLTSDFTSVNTYLSSLSP-GGNTNITLGVQ 404 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK-------EDQQSLYYCNEAKK 317 + + + + D + KK ++ +TDG N + + ++ C AK Sbjct: 405 FGMEMLSPAEPYTKATAFGDTDVKKYMIIVTDGANTQNRWSTSNSAINARTALACTAAKA 464 Query: 318 RGAIVYAIGIRVIRSHEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 +G ++ + + S L ACAS + +Y + + I I R+ Sbjct: 465 QGITLFVVRVEDGDS-SLLEACASQSSYYYDLSQASDLTKTMQDIFATINKLRLIE 519 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 30/245 (12%), Positives = 84/245 (34%), Gaps = 32/245 (13%) Query: 1 MFSLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVH 60 M ++ ++ F + +G I+ + I+ G +++ F + L D +++ Sbjct: 1 MMWISQSLTKFIGDRQGNTLIIFGLCAVILVGAAGGAVDMMRYFDTSSRLQDATDAAVLK 60 Query: 61 AATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVV 120 A +I ++ + M+ + S + I Sbjct: 61 ATQKI----------EVSEAAAKTAAAMAFEMNLSDHPELQT---------ASHTFAIET 101 Query: 121 VPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDV 180 + ++ +I + + I + ++ + + +++ VLD Sbjct: 102 SDNAKVVHYTS----EITQRPYFLQLLGLGEQTIRVASSAQSESD-----PFELLFVLDT 152 Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV 240 + SM S ++ +T + ++ S+ + + D N V+ G+V F+ ++ + Sbjct: 153 TGSMAS--NNKMTYLKTSVSSVLS--SLISTYGDGNEDVKVGVVAFNTQVRLPASTSYSF 208 Query: 241 SHLQR 245 + Sbjct: 209 VDYTQ 213 >gi|115379116|ref|ZP_01466240.1| von Willebrand factor type A domain, putative [Stigmatella aurantiaca DW4/3-1] gi|115363899|gb|EAU63010.1| von Willebrand factor type A domain, putative [Stigmatella aurantiaca DW4/3-1] Length = 284 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 36/228 (15%), Positives = 78/228 (34%), Gaps = 47/228 (20%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++ LD+S SME+ ++ +A + + + VN+ + GLV F+ Sbjct: 40 IDIVVALDLSTSMEAGDFRPQNRLHVAKEVLAEFISN-----RVND--RIGLVVFAGAAY 92 Query: 232 EFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L ++ +K + T L + N++ D + + Sbjct: 93 TQAPLTLDYGVVREVLKQIRTRVLEDGTAIGDALATSLNRLRDSEAKS-----------R 141 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV------IRSH---------- 333 ++V +TDG+N + K L + A+ +Y I + + Sbjct: 142 VVVLITDGDNNAGK--ISPLDAASMAESLKIPIYTILVGKGGKVPFPQGQDLFGNTVWRD 199 Query: 334 -------EFLRACAS--PNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 E L+ AS +Y +P + + + ++ Sbjct: 200 TEIPINPELLQDIASRTGGEYYRATDPEGLKQGLQKVLDSLERSKLME 247 >gi|315080701|gb|EFT52677.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL078PA1] Length = 320 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 76/213 (35%), Gaps = 30/213 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +DVSRSM + D +++ A + L ++ +V+ LV F+ + Sbjct: 91 VVVAIDVSRSMVA-TDVEPSRLSAAKTAAKDFLGDLPPRFNVS------LVKFAASAQVV 143 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + I L ST G+ + N + + +H IV Sbjct: 144 VPPTTDRAAVSTAITNLQVL-PSTAIGEGIYSSLNALKLVPDDPKH---PGQKPPAAIVL 199 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH---------------EFLRA 338 ++DG +T + SL EA ++ VY I + E Sbjct: 200 LSDG---ATNVGRPSLEAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAI 256 Query: 339 C-ASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 AS + E+ + D + I + + +++ Sbjct: 257 AKASGGEKFSAESLGQLSDVYKSIAQSVGYEKV 289 >gi|314981157|gb|EFT25251.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA3] gi|315091980|gb|EFT63956.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA4] Length = 320 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 76/213 (35%), Gaps = 30/213 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +DVSRSM + D +++ A + L ++ +V+ LV F+ + Sbjct: 91 VVVAIDVSRSMVA-TDVEPSRLSAAKTAAKDFLGDLPPRFNVS------LVKFAASAQVV 143 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + I L ST G+ + N + + +H IV Sbjct: 144 VPPTTDRAAVSTAITNLQVL-PSTAIGEGIYSSLNALKLVPDDPKH---PGQKPPAAIVL 199 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH---------------EFLRA 338 ++DG +T + SL EA ++ VY I + E Sbjct: 200 LSDG---ATNVGRPSLEAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAI 256 Query: 339 C-ASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 AS + E+ + D + I + + +++ Sbjct: 257 AKASGGEKFSAESLGQLSDVYKSIAQSVGYEKV 289 >gi|314923047|gb|EFS86878.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL001PA1] gi|314966819|gb|EFT10918.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL082PA2] gi|315093261|gb|EFT65237.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL060PA1] gi|315103481|gb|EFT75457.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA2] gi|327327645|gb|EGE69421.1| von Willebrand factor, type A [Propionibacterium acnes HL103PA1] Length = 320 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 76/213 (35%), Gaps = 30/213 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +DVSRSM + D +++ A + L ++ +V+ LV F+ + Sbjct: 91 VVVAIDVSRSMVA-TDVEPSRLSAAKTAAKDFLGDLPPRFNVS------LVKFAASAQVV 143 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + I L ST G+ + N + + +H IV Sbjct: 144 VPPTTDRAAVSTAITNLQVL-PSTAIGEGIYSSLNALKLVPDDPKH---PGQKPPAAIVL 199 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH---------------EFLRA 338 ++DG +T + SL EA ++ VY I + E Sbjct: 200 LSDG---ATNVGRPSLEAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAI 256 Query: 339 C-ASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 AS + E+ + D + I + + +++ Sbjct: 257 AKASGGEKFSAESLGQLSDVYKSIAQSVGYEKV 289 >gi|289425049|ref|ZP_06426826.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK187] gi|289154027|gb|EFD02715.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK187] gi|313764512|gb|EFS35876.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL013PA1] gi|313772105|gb|EFS38071.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL074PA1] gi|313801850|gb|EFS43084.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA2] gi|313807459|gb|EFS45946.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA2] gi|313809969|gb|EFS47690.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL083PA1] gi|313812999|gb|EFS50713.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL025PA1] gi|313816053|gb|EFS53767.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL059PA1] gi|313818504|gb|EFS56218.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL046PA2] gi|313820270|gb|EFS57984.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA1] gi|313822921|gb|EFS60635.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA2] gi|313825147|gb|EFS62861.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL063PA1] gi|313827718|gb|EFS65432.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL063PA2] gi|313830298|gb|EFS68012.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL007PA1] gi|313833672|gb|EFS71386.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL056PA1] gi|314915506|gb|EFS79337.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA4] gi|314920024|gb|EFS83855.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA3] gi|314925157|gb|EFS88988.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA3] gi|314932038|gb|EFS95869.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL067PA1] gi|314955908|gb|EFT00308.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL027PA1] gi|314958220|gb|EFT02323.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA1] gi|314960059|gb|EFT04161.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA2] gi|314962858|gb|EFT06958.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL082PA1] gi|314967774|gb|EFT11873.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA1] gi|314973303|gb|EFT17399.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL053PA1] gi|314975981|gb|EFT20076.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL045PA1] gi|314978482|gb|EFT22576.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL072PA2] gi|314988184|gb|EFT32275.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA2] gi|314989987|gb|EFT34078.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA3] gi|315078073|gb|EFT50124.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL053PA2] gi|315084373|gb|EFT56349.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL027PA2] gi|315085714|gb|EFT57690.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA3] gi|315088866|gb|EFT60842.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL072PA1] gi|315096218|gb|EFT68194.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL038PA1] gi|315098476|gb|EFT70452.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL059PA2] gi|315101164|gb|EFT73140.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL046PA1] gi|315105440|gb|EFT77416.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL030PA1] gi|315108385|gb|EFT80361.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL030PA2] gi|327326130|gb|EGE67920.1| von Willebrand factor, type A [Propionibacterium acnes HL096PA2] gi|327330198|gb|EGE71947.1| von Willebrand factor, type A [Propionibacterium acnes HL097PA1] gi|327331995|gb|EGE73732.1| von Willebrand factor, type A [Propionibacterium acnes HL096PA3] gi|327443197|gb|EGE89851.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL013PA2] gi|327445982|gb|EGE92636.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL043PA2] gi|327448038|gb|EGE94692.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL043PA1] gi|327450840|gb|EGE97494.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA3] gi|327453083|gb|EGE99737.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL092PA1] gi|327453814|gb|EGF00469.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL083PA2] gi|328753528|gb|EGF67144.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL020PA1] gi|328754259|gb|EGF67875.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA1] gi|328754490|gb|EGF68106.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL025PA2] gi|328760648|gb|EGF74215.1| von Willebrand factor, type A [Propionibacterium acnes HL099PA1] Length = 320 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 76/213 (35%), Gaps = 30/213 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +DVSRSM + D +++ A + L ++ +V+ LV F+ + Sbjct: 91 VVVAIDVSRSMVA-TDVEPSRLSAAKTAAKDFLGDLPPRFNVS------LVKFAASAQVV 143 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + I L ST G+ + N + + +H IV Sbjct: 144 VPPTTDRAAVSTAITNLQVL-PSTAIGEGIYSSLNALKLVPDDPKH---PGQKPPAAIVL 199 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH---------------EFLRA 338 ++DG +T + SL EA ++ VY I + E Sbjct: 200 LSDG---ATNVGRPSLEAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAI 256 Query: 339 C-ASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 AS + E+ + D + I + + +++ Sbjct: 257 AKASGGEKFSAESLGQLSDVYKSIAQSVGYEKV 289 >gi|282854077|ref|ZP_06263414.1| von Willebrand factor type A domain protein [Propionibacterium acnes J139] gi|282583530|gb|EFB88910.1| von Willebrand factor type A domain protein [Propionibacterium acnes J139] Length = 318 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 76/213 (35%), Gaps = 30/213 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +DVSRSM + D +++ A + L ++ +V+ LV F+ + Sbjct: 89 VVVAIDVSRSMVA-TDVEPSRLSAAKTAAKDFLGDLPPRFNVS------LVKFAASAQVV 141 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + I L ST G+ + N + + +H IV Sbjct: 142 VPPTTDRAAVSTAITNLQVL-PSTAIGEGIYSSLNALKLVPDDPKH---PGQKPPAAIVL 197 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH---------------EFLRA 338 ++DG +T + SL EA ++ VY I + E Sbjct: 198 LSDG---ATNVGRPSLEAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAI 254 Query: 339 C-ASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 AS + E+ + D + I + + +++ Sbjct: 255 AKASGGEKFSAESLGQLSDVYKSIAQSVGYEKV 287 >gi|50842462|ref|YP_055689.1| aerotolerance protein BatA [Propionibacterium acnes KPA171202] gi|289427042|ref|ZP_06428758.1| von Willebrand factor type A domain protein [Propionibacterium acnes J165] gi|295130539|ref|YP_003581202.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK137] gi|50840064|gb|AAT82731.1| conserved protein, putative BatA (bacteroides aerotolerance operon) [Propionibacterium acnes KPA171202] gi|289159511|gb|EFD07699.1| von Willebrand factor type A domain protein [Propionibacterium acnes J165] gi|291375600|gb|ADD99454.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK137] gi|332675379|gb|AEE72195.1| hypothetical protein PAZ_c10200 [Propionibacterium acnes 266] Length = 318 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 76/213 (35%), Gaps = 30/213 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +DVSRSM + D +++ A + L ++ +V+ LV F+ + Sbjct: 89 VVVAIDVSRSMVA-TDVEPSRLSAAKTAAKDFLGDLPPRFNVS------LVKFAASAQVV 141 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + I L ST G+ + N + + +H IV Sbjct: 142 VPPTTDRAAVSTAITNLQVL-PSTAIGEGIYSSLNALKLVPDDPKH---PGQKPPAAIVL 197 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH---------------EFLRA 338 ++DG +T + SL EA ++ VY I + E Sbjct: 198 LSDG---ATNVGRPSLEAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAI 254 Query: 339 C-ASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 AS + E+ + D + I + + +++ Sbjct: 255 AKASGGEKFSAESLGQLSDVYKSIAQSVGYEKV 287 >gi|325917650|ref|ZP_08179844.1| Mg-chelatase subunit ChlD [Xanthomonas vesicatoria ATCC 35937] gi|325536114|gb|EGD07916.1| Mg-chelatase subunit ChlD [Xanthomonas vesicatoria ATCC 35937] Length = 335 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 70/216 (32%), Gaps = 45/216 (20%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 MM+ +D+S SM + + ++ A ++ L+ + GL+ F Sbjct: 100 QMMLAVDLSGSMSEPDMVLGGNVVDRLTAAKAVLSDFLDRRDGD-------RVGLLVFGQ 152 Query: 229 KIEEFFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L ++ ++ ++ + G T + + ++ E Sbjct: 153 RAYALTPLTADLTSVRDQLADSVVGLAGRETAIGDAIALSVKRL-----------REQKQ 201 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF----------- 335 ++++V +TDG N + L AK G V+ I + Sbjct: 202 GQRVVVLLTDGVNTA--GVLNPLKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGND 259 Query: 336 ------LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 LR A + F+ + + ++ + + Sbjct: 260 DIDEAGLRKIAEQTGGRFFRARDTEELAGIYAELDR 295 >gi|301768895|ref|XP_002919864.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Ailuropoda melanoleuca] Length = 1127 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 27/163 (16%), Positives = 66/163 (40%), Gaps = 22/163 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + L+++ V+D S S+ + ++ + + ++V L + G++ + Sbjct: 792 CKETPLELLFVIDSSESVG------LENFEIIKSLVKTLSDQVALDLAAA---RVGIINY 842 Query: 227 SNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S+K+E+ L + + + + G T + L A N +F+ Sbjct: 843 SHKVEKVAHLTQFSTKDDFKLAVDRMQYLGEGTYTASALHEA-NHMFEAA---------R 892 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 KK+ + +TDG+ T++++ +A ++ IG+ Sbjct: 893 PGVKKVALVITDGQ-TDTRDEKNLTEVVKKASDINVEIFVIGV 934 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 70/168 (41%), Gaps = 16/168 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ ++D S S + K + S++ + ++ + + ++ + FS+ ++ Sbjct: 47 IDVVFIVDSSESSKIIL---FDKQKDFVDSLSDRVFQLTPVRSLKYDIKLAALQFSSSVQ 103 Query: 232 EFFLLE-W-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 W + ++++K ++ G T S + A + +G + K Sbjct: 104 IDPSFSSWKDLQTFKQRVKSMNFIGQGTFSYYAISNA-TGLLKREGRKDGV--------K 154 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + + MTDG + D QS+ +A+ G + IG+ + + LR Sbjct: 155 VALLMTDGIDHPKNPDVQSIS--EDARTAGILFITIGLSTVVNEAKLR 200 >gi|281350503|gb|EFB26087.1| hypothetical protein PANDA_008525 [Ailuropoda melanoleuca] Length = 961 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 27/163 (16%), Positives = 66/163 (40%), Gaps = 22/163 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + L+++ V+D S S+ + ++ + + ++V L + G++ + Sbjct: 621 CKETPLELLFVIDSSESVG------LENFEIIKSLVKTLSDQVALDLAAA---RVGIINY 671 Query: 227 SNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S+K+E+ L + + + + G T + L A N +F+ Sbjct: 672 SHKVEKVAHLTQFSTKDDFKLAVDRMQYLGEGTYTASALHEA-NHMFEAA---------R 721 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 KK+ + +TDG+ T++++ +A ++ IG+ Sbjct: 722 PGVKKVALVITDGQ-TDTRDEKNLTEVVKKASDINVEIFVIGV 763 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 80/207 (38%), Gaps = 19/207 (9%) Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI 192 +R++ P S + + ++ +D++ ++D S S + Sbjct: 7 TRHRYDSGEVDRTPLCLKSSSAKRKLPALTVDSTC---FIDVVFIVDSSESSKIIL---F 60 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE-W-GVSHLQRKIKYL 250 K + S++ + ++ + + ++ + FS+ ++ W + ++++K + Sbjct: 61 DKQKDFVDSLSDRVFQLTPVRSLKYDIKLAALQFSSSVQIDPSFSSWKDLQTFKQRVKSM 120 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 + G T S + A + +G + K+ + MTDG + D QS+ Sbjct: 121 NFIGQGTFSYYAISNA-TGLLKREGRKDGV--------KVALLMTDGIDHPKNPDVQSIS 171 Query: 311 YCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +A+ G + IG+ + + LR Sbjct: 172 --EDARTAGILFITIGLSTVVNEAKLR 196 >gi|315223608|ref|ZP_07865462.1| aerotolerance protein BatA [Capnocytophaga ochracea F0287] gi|314946389|gb|EFS98384.1| aerotolerance protein BatA [Capnocytophaga ochracea F0287] Length = 340 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 38/192 (19%), Positives = 73/192 (38%), Gaps = 24/192 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P T S +++ +D+++ +DVS SM S D + + K + ++ Sbjct: 81 LARPQTHSENAHTKITDGIDIVMAIDVSASMLSK-DLKPNRFEALKKVASQFVK-----D 134 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIF 271 N+ + GLV ++ + + + + L+ + T GL A N++ Sbjct: 135 RPND--RIGLVIYAGESYTKTPVTTDKGIILNALSELTYGQIEDGTAIGMGLATAVNRL- 191 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 E ++I+ +TDG N + D Q+ A + G VY IGI Sbjct: 192 ----------KESKAKSRVIILLTDGVNNTGFIDPQTA--AELAAEYGIRVYTIGIGS-N 238 Query: 332 SHEFLRACASPN 343 +P+ Sbjct: 239 GTALSPYALNPD 250 >gi|297265790|ref|XP_001107629.2| PREDICTED: vitrin-like isoform 1 [Macaca mulatta] Length = 678 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 41/270 (15%), Positives = 90/270 (33%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + N+ +V + V N YS S + + + ++ + T Sbjct: 430 EGAAENEKQYVVEPNFANKAVCRTNGFYSFHVQSWFGLHKTLQPLVKRVCDTDRLACSKT 489 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ ++D S S+ ++ + + +E ++ Sbjct: 490 CL--------NSADIGFIIDGSSSVG------TGNFRTVLQFVTNLTKEFEISDTD---T 532 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ + F + S + IK + + T++ + +A Q+F Sbjct: 533 RVGAVQYTYEQRLEFGFDQYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF------ 586 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG + A +G I YAIG+ E Sbjct: 587 ---KKSKPNKRKLMILITDG-----RSYDDVRIPAMAAHLKGVITYAIGVAWAAQEELEV 638 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V+ ++Y I ++I Sbjct: 639 IATHPARDHSFFVDEFDNLYQYVPRIIQNI 668 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 61/165 (36%), Gaps = 22/165 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+ ++D S S+ + + + + + + + + P G+V + + Sbjct: 292 IDLSFLIDGSTSIGK------RRFRIQKQLLAGVAQALDIGPAGP---LMGVVQYGDNPA 342 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L+ L+ I+ +++ G +N + + F N ++ Sbjct: 343 THFNLKTHMNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSK------ANGNRSSAPN 396 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++V M DG E+ L A++ G ++ I + +E Sbjct: 397 VVVVMVDGWPTDKVEEASRL-----ARESGINIFFITVEGAAENE 436 >gi|226358120|ref|YP_002787859.1| hypothetical protein Deide_2p00900 [Deinococcus deserti VCD115] gi|226319763|gb|ACO47757.1| Conserved hypothetical protein [Deinococcus deserti VCD115] Length = 418 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 39/239 (16%), Positives = 82/239 (34%), Gaps = 34/239 (14%) Query: 132 ISRYKIPLKFCTFIPWYTNSRHIVMPI---TSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 ++ +I L+ + + + I + V L++ V+D S SM Sbjct: 1 MTTPRIELRPLRAGLTAGQTTTLTLLIRVHPAPVTTQVSQRPPLNLAFVIDRSGSMSGLP 60 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE--EFFLLEWGVSHLQRK 246 + MA ++ A + + + + +V F ++++ L + + Sbjct: 61 ------LQMAKQAAIAAVRQARPDD------RVSVVAFDDRVDVIVPSQLATSREAVIQA 108 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 I + G + L+ A QH N ++ ++DG+ D+ Sbjct: 109 IGTIDDRGSTNLHGGWLEGATQVA-------QHLTPGALNR---VILLSDGQANVGVTDR 158 Query: 307 QSLYYCNEAK---KRGAIVYAIGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSH 360 + + + +RG IG+ E L A A+ +F VE+P + F Sbjct: 159 R--EIARQVRGLTERGISTTTIGLGSHYDEELLLAIANAGDGNFEHVEDPSRLPTFFEE 215 >gi|194226347|ref|XP_001489610.2| PREDICTED: similar to Collagen, type VI, alpha 2 [Equus caballus] Length = 1019 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 39/243 (16%), Positives = 86/243 (35%), Gaps = 15/243 (6%) Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 S Y+ L+ + + I + + D +++ VLD S S+ + Sbjct: 5 KDSTYQPLLQLGMPMSYQWKCGDIQGGRPEEANSSEKADCPVNVYFVLDTSESVTMQSPT 64 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVV---QSGLVTFSNKIEEFFLLEWGVSHLQRKI 247 + + + + + +++ ++ V + G + FS+++E F + + + Sbjct: 65 D-SLLYHMQQFVPQFISQLQDEFYLDQVALSWRYGGLHFSDQVEVFSPPNSDRASFIKSL 123 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 + +S F T + L Q +R+H V +TDG + Sbjct: 124 QSISSFRRGTFTDCALAN------MTQEIRRHVKKGVN----FAVVITDGHVTGSPCGGI 173 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 A++ G ++A+ + LR AS N +M + I +D + Sbjct: 174 KRQ-AERAREEGIRLFAVAPNRNLHEQGLRDIASTPLELYRSNYATMRPDSTDIDQDTIN 232 Query: 368 KRI 370 + I Sbjct: 233 RII 235 Score = 66.3 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 35/164 (21%), Positives = 62/164 (37%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + ++ L + P + G+V +S++ Sbjct: 612 GALDVVFVIDSSESIGY---TNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 668 Query: 230 -----IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 I+ +S + +K L T + LK+AYNQ+ + Sbjct: 669 GTFEAIQLDDERINSLSSFKEAVKNLEWIAGGTWTPSALKFAYNQLI--------KESRR 720 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG + +D CN V AIGI Sbjct: 721 QKTRVFAVVITDGRHDPRDDDLNLRALCNH----DVTVTAIGIG 760 >gi|90417299|ref|ZP_01225225.1| batB protein, putative [marine gamma proteobacterium HTCC2207] gi|90330884|gb|EAS46147.1| batB protein, putative [marine gamma proteobacterium HTCC2207] Length = 330 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 28/217 (12%), Positives = 64/217 (29%), Gaps = 46/217 (21%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +D+S SME ++ ++ + + E + GL+ F Sbjct: 91 DILLAVDISGSMEREDMQLSGQTVNRLMAVKAVVGNFVTE-------REGDRLGLILFGE 143 Query: 229 KIEEFFLLEWGVSHLQR--KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 K L + +Q L G T + + ++ + Sbjct: 144 KAYLQTPLTFDRKTMQTLLYEAQLGFAGNGTAIGDAIGLSVKRL-----------QQRPE 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI---------------- 330 ++++ +TDG N + + L A +Y IG+ Sbjct: 193 NHRVVILLTDGANNA--GELDPLKAAELASSAKVKIYTIGVGAETQEAWGLFGKRVTNPS 250 Query: 331 ---RSHEFLRAC-ASPNSFYLVENPHSMYDAFSHIGK 363 A+ ++ NP + + + + Sbjct: 251 ADLDEQTLTAIAEATGGQYFRARNPEELMAIYQELNR 287 >gi|260814492|ref|XP_002601949.1| hypothetical protein BRAFLDRAFT_86433 [Branchiostoma floridae] gi|229287252|gb|EEN57961.1| hypothetical protein BRAFLDRAFT_86433 [Branchiostoma floridae] Length = 1774 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 75/192 (39%), Gaps = 27/192 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ +VLD S S+ S++ D + + A+ V + G++ +S++ Sbjct: 1140 DLFLVLDGSGSV------SVSDFDTVKQFVVAV---VSAFTIGLADTRVGVLQYSDRNTL 1190 Query: 233 FFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L I +++ G T++ +++A RQ+ A KI Sbjct: 1191 GCNLGDHPDEASFVSSINTMTRQGGGTSTGAAMEFA----------RQNAAWRPAPVPKI 1240 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS-PNSFYLVE 349 ++ +TDG++ + ++ V+AIG+ E L + P+ + + Sbjct: 1241 MIVLTDGKSSDSVVAAAHALAADQ-----VTVFAIGVGSFDHSELLEITNNKPSQVFELA 1295 Query: 350 NPHSMYDAFSHI 361 + + + + + I Sbjct: 1296 DFNVLAQSINRI 1307 >gi|156324784|ref|XP_001618484.1| hypothetical protein NEMVEDRAFT_v1g2784 [Nematostella vectensis] gi|156199073|gb|EDO26384.1| predicted protein [Nematostella vectensis] Length = 410 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 69/206 (33%), Gaps = 27/206 (13%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + + +D+ VLD S S+ + + + N M+ VK + Sbjct: 222 AGLTENSRLAVDLAFVLDGSTSINNADPGNFQLLK------NFMINIVKSFKISSERTHV 275 Query: 222 GLVTFS--NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 GLV +S ++ F + + + I T + LK A +Q+F Sbjct: 276 GLVLYSFFTQLMFNFDKYSDSASIVKAINTTDYPKGGTRTGEALKMAKSQLFGASM---- 331 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 + K+++ +TDG + + K G +++A+G+ L Sbjct: 332 -----RSVPKVLIVLTDGRSSD-----KVEAPSKALKDEGVVIFAVGVGDQIDPSELNVM 381 Query: 340 AS---PNSFYLV--ENPHSMYDAFSH 360 AS + + + + D Sbjct: 382 ASDSKSDHVFKAKFKELDRLVDLIKR 407 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 31/207 (14%), Positives = 60/207 (28%), Gaps = 42/207 (20%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV-------NNVVQSGLVT 225 D++ V+D S + TK +K + + V + V L+ Sbjct: 1 DLVFVIDGS---NDLTADAFTKAKDFMKKVYNNFPVSESATHVGLALSGCTSKVMFNLIE 57 Query: 226 F------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + + ++ + + + + G ST + GL+ + Sbjct: 58 YFTKPALDSAVDSARKPD--PYEVTGAVDKIKYPGGSTFTGAGLRMVKS----------- 104 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 N I++ MTDGE ++ I+YA+G + L Sbjct: 105 ------NVPHILIVMTDGE-----SHDDVAEPSRVLREDNTIIYAVGEGYYFKLKQLNEM 153 Query: 340 ASPN--SFYLVENPHSMYDAFSHIGKD 364 AS + + M I Sbjct: 154 ASEPKINHVFTADFGQMNTIVEDIKDK 180 >gi|2811055|sp|O07395|Y335_MYCAV RecName: Full=UPF0353 protein MAV335 gi|2183263|gb|AAC46199.1| MAV335 [Mycobacterium avium] Length = 335 Score = 67.9 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 33/213 (15%), Positives = 70/213 (32%), Gaps = 33/213 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+V+DVS+SM + D + ++ A ++ +E+ + GL+ ++ Sbjct: 99 VMLVIDVSQSMRA-TDVAPNRMAAAQEAAKQFADELTPGIN------LGLIAYAGTATVL 151 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA--YNQIFDMQGMRQHCNTEDANYKKII 291 + + L +F T + G+ A I + D I Sbjct: 152 VSPTTNREATKNALDKL-QFADRTATGEGIFTALQVQAIATVGA----VIAGDKPPPARI 206 Query: 292 VFMTDGE---NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------------HE 334 V +DG+ + + + AK +G + I E Sbjct: 207 VLFSDGKETMPTNPDNPKGAFTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDE 266 Query: 335 FLRACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 L+ A S + Y + + ++ + + I Sbjct: 267 TLKKVAQLSGGNAYNARSLQELKSVYATLQQQI 299 >gi|268572467|ref|XP_002648969.1| Hypothetical protein CBG21291 [Caenorhabditis briggsae] Length = 427 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 54/329 (16%), Positives = 114/329 (34%), Gaps = 34/329 (10%) Query: 46 MKTVLHSMID-----RSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 K L ++ + ++ +G + K G + + NTW + + Sbjct: 112 TKKALEVALELMAQRNVSIPTLVMVVTDGRSADDPKGPGKLLQAQ-PNTWVFAAATGDPE 170 Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITS 160 ++ DI +++ +V+ + + R + T + T+ Sbjct: 171 KVDTRELMDITG--NINHIVMHRGRDLATDITRRLLREAQDKCRTTTTTTTTTTTTTTTT 228 Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + N T LD+++VLD S + + ++S +++++K+ P V Sbjct: 229 TTTTNPITGCELDLVLVLDFSTTTDPVYNSY-------KDLSKRLVQQLKIGPHYTQVAA 281 Query: 221 SGLVTFSNKIEEFFLLEWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 T F L ++ + R I L G +T G++ A QI + +G R Sbjct: 282 VTFATVGRTRVRFNLKKYSTQEEVLRGIDKLQSKGGTTAIGAGIEKALTQIDESEGARPG 341 Query: 280 CNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR------S 332 T K+++ TDG N ++++ +A G +Y + + Sbjct: 342 IAT------KVMIVFTDGWSNKGPDPEKRAR----DAVNAGFEMYTVAYTARAPGSVTLN 391 Query: 333 HEFLRA-CASPNSFYLVENPHSMYDAFSH 360 +E L A S + ++ D Sbjct: 392 NETLSAISGSSGHAFTDVTFQTLVDKIKQ 420 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 55/136 (40%), Gaps = 24/136 (17%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 L+++IVLD S S++ F+ S K + + EE+++ P + ++ +S Sbjct: 31 LNVLIVLDRSDSVKGGFNKS-------RKFVTDVSEELQIGP---EKHRVAMIVYSGLSY 80 Query: 232 EFFLLEWGVSHLQ----RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 +L W + +K L G +TN+ L+ A + + + Sbjct: 81 RREVLPWNFAKSNEEFVQKTNALRAIGGTTNTKKALEVALELM----------AQRNVSI 130 Query: 288 KKIIVFMTDGENLSTK 303 +++ +TDG + Sbjct: 131 PTLVMVVTDGRSADDP 146 >gi|320158501|ref|YP_004190879.1| BatA [Vibrio vulnificus MO6-24/O] gi|319933813|gb|ADV88676.1| BatA [Vibrio vulnificus MO6-24/O] Length = 362 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 77/210 (36%), Gaps = 34/210 (16%) Query: 173 DMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+M+V+D+S SM S ++I+++D + + + + GL+ F Sbjct: 107 DVMVVVDLSGSMAEQDFTSASGANISRLDATKEVLAEFAKT-------RQGDRLGLILFG 159 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + F+ + + + L++ G ST+ + A ++F+ Q Sbjct: 160 ---DAAFVQTPFTADQKVWLALLNQTDVAMAGQSTHLGDAIGLAI-KVFEQSEPSQ--AA 213 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--SHEFLR--- 337 +K+ + +TDG T + + AK +G ++ I + L Sbjct: 214 SSKPRQKVAIVLTDG--NDTGSFVEPIDAAKVAKAKGVRIHVIAMGDPSTVGESALDLQT 271 Query: 338 ----ACASPNSFYLVENPHSMYDAFSHIGK 363 A S + N + A+ IGK Sbjct: 272 IERIASESGGKAFQALNRDELASAYDDIGK 301 >gi|21244101|ref|NP_643683.1| hypothetical protein XAC3376 [Xanthomonas axonopodis pv. citri str. 306] gi|21109728|gb|AAM38219.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri str. 306] Length = 323 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 69/216 (31%), Gaps = 45/216 (20%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 MM+ +D+S SM + ++ A ++ L+ + GL+ F Sbjct: 88 QMMLAVDLSGSMSEPDMVLGGKVVDRLTAAKAVLSDFLDRRDGD-------RVGLLVFGQ 140 Query: 229 KIEEFFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L ++ ++ ++ + G T + + ++ E Sbjct: 141 RAYALTPLTADLTSVRDQLRDSVVGLAGRETAIGDAIALSVKRL-----------REQKQ 189 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF----------- 335 ++++V +TDG N + L AK G V+ I + Sbjct: 190 GQRVVVLLTDGVNTA--GVLNPLKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGDD 247 Query: 336 ------LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 LR A + F+ + + ++ + + Sbjct: 248 DIDEDGLRKIAQQTGGRFFRARDTDELAGIYAELDR 283 >gi|297265794|ref|XP_001107688.2| PREDICTED: vitrin-like isoform 2 [Macaca mulatta] Length = 656 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 41/270 (15%), Positives = 90/270 (33%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + N+ +V + V N YS S + + + ++ + T Sbjct: 408 EGAAENEKQYVVEPNFANKAVCRTNGFYSFHVQSWFGLHKTLQPLVKRVCDTDRLACSKT 467 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ ++D S S+ ++ + + +E ++ Sbjct: 468 CL--------NSADIGFIIDGSSSVG------TGNFRTVLQFVTNLTKEFEISDTD---T 510 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ + F + S + IK + + T++ + +A Q+F Sbjct: 511 RVGAVQYTYEQRLEFGFDQYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF------ 564 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG + A +G I YAIG+ E Sbjct: 565 ---KKSKPNKRKLMILITDG-----RSYDDVRIPAMAAHLKGVITYAIGVAWAAQEELEV 616 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V+ ++Y I ++I Sbjct: 617 IATHPARDHSFFVDEFDNLYQYVPRIIQNI 646 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 61/165 (36%), Gaps = 22/165 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+ ++D S S+ + + + + + + + + P G+V + + Sbjct: 270 IDLSFLIDGSTSIGK------RRFRIQKQLLAGVAQALDIGPAGP---LMGVVQYGDNPA 320 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L+ L+ I+ +++ G +N + + F N ++ Sbjct: 321 THFNLKTHMNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSK------ANGNRSSAPN 374 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++V M DG E+ L A++ G ++ I + +E Sbjct: 375 VVVVMVDGWPTDKVEEASRL-----ARESGINIFFITVEGAAENE 414 >gi|281345102|gb|EFB20686.1| hypothetical protein PANDA_013779 [Ailuropoda melanoleuca] Length = 285 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 35/191 (18%), Positives = 69/191 (36%), Gaps = 23/191 (12%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + + D++ +LD S S+ + S + + + ++ + L P +++ LV Sbjct: 5 SAPQGDLLFLLDSSASVSHYEFSRV------REFVGQLVGLLPLGPGA---LRTSLVHVG 55 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 ++ F S +Q I+ ++ TN+ L YA Q+F Sbjct: 56 SRPYTEFPFGQHSSGEAVQDAIRAAAQRMGDTNTGLALAYAKEQLF------AEVAGARP 109 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC-ASPNS 344 K++V++TDG + E K G V+ + E A A P Sbjct: 110 GVPKVLVWVTDGGSSDPVG-----PPMQELKDLGVTVFIVSTGRGNLLELSDAASAPPEK 164 Query: 345 FYLVENPHSMY 355 L + ++ Sbjct: 165 HLLFVDVDDLH 175 >gi|223558081|gb|ACM91085.1| aerotolerance protein BatA [uncultured bacterium Rlip1] Length = 332 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 36/217 (16%), Positives = 73/217 (33%), Gaps = 46/217 (21%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++ +DVS SM + D ++ A + ++ + + GLV FS + Sbjct: 92 IDIVMAMDVSGSMLA-RDLKPDRLTAAKNVASDFVKG-RPGD------RMGLVIFSGETF 143 Query: 232 EFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + + + T GL A +++ + K Sbjct: 144 TQVPLTTDHGVMLNMLAEMKNGLIDDGTAIGDGLATAISRL-----------KDSEAISK 192 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS----------------- 332 +++ +TDG N + D + AK G VY IG+ + Sbjct: 193 VVILLTDGMNNAGSVDPYTA--AEIAKLYGIRVYTIGVGSYGTAPYPVQTPFGTQIQQMK 250 Query: 333 ----HEFLRACAS--PNSFYLVENPHSMYDAFSHIGK 363 + L + AS ++ + + + + I K Sbjct: 251 VEIDEKLLASVASMTGGKYFRATSNQKLDEIYEEIDK 287 >gi|288919019|ref|ZP_06413360.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288349559|gb|EFC83795.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 319 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 36/228 (15%), Positives = 77/228 (33%), Gaps = 31/228 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 I + + + + A + ++ +DVS SM + D T+++ A + A ++++ Sbjct: 71 ISLARPARAERVPRERATI--ILAIDVSNSM-AATDIQPTRLEAAKQGAQAFVDQLPP-- 125 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + GLV+F+ ++ I+ L + G +T G+ + I Sbjct: 126 ----RINLGLVSFAGSAAVLVPASTDRESVRSGIRGL-QLGPATAVGEGIFASLQAITTA 180 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS- 332 + IV ++DGE + + Q+ A+ V I Sbjct: 181 GERMS--DEGQPPPPAAIVLLSDGETTRGRPNTQA---ATAARDAEVPVDTIAYGTSDGT 235 Query: 333 -------------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 + LR A + S++ + + +G I Sbjct: 236 LDVGGQQIPVPVNEDALRELAEQTGGSYHRATTGDELQSVYRGLGSSI 283 >gi|86740090|ref|YP_480490.1| von Willebrand factor, type A [Frankia sp. CcI3] gi|86566952|gb|ABD10761.1| von Willebrand factor, type A [Frankia sp. CcI3] Length = 319 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 36/228 (15%), Positives = 84/228 (36%), Gaps = 31/228 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + + A + ++ +DVS SM + D + T++ A + +A ++++ Sbjct: 71 FSLARPARAERVPRERATI--ILAIDVSNSM-AATDIAPTRLAAAKQGASAFVDQLPP-- 125 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + GLV+F+ ++ I+ L + G +T G+ + I Sbjct: 126 ----RINLGLVSFAGSATVLVPASADRESVRAGIRGL-QLGPATAVGEGIFASLQAITTA 180 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS- 332 ++ +T + IV ++DGE + + Q++ +A+ V I Sbjct: 181 G--KRFSDTGQSAPPAAIVLLSDGETTRGRPNNQAIEAARQAR---IPVDTIAYGTADGT 235 Query: 333 -------------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 + LR A + S++ + + + +G I Sbjct: 236 LDVGGQEVPVPVNEQALRDIAEQTGGSYHRATSGDELRSVYRGLGSSI 283 >gi|99081991|ref|YP_614145.1| hypothetical protein TM1040_2151 [Ruegeria sp. TM1040] gi|99038271|gb|ABF64883.1| hypothetical protein TM1040_2151 [Ruegeria sp. TM1040] Length = 582 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 26/73 (35%), Positives = 37/73 (50%), Gaps = 1/73 (1%) Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACASPNSFYLVENPHSMYDAFS 359 ST +D ++L C AK +G +V+ IG R E L+ACAS S Y + + DAF+ Sbjct: 510 STTKDARTLDICEAAKAKGVVVFTIGFEAPSRGQEVLQACASSASHYYDVDGLEISDAFA 569 Query: 360 HIGKDIVTKRIWY 372 I I R+ Sbjct: 570 SIASAIRQLRLTE 582 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 45/305 (14%), Positives = 109/305 (35%), Gaps = 43/305 (14%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R F + G + + + ++F++ G+ +++ + +T L +DR+++ AA Sbjct: 24 MRQFRRDESGVLAKPMIMIVVLMFMIGGLGMDMVRLERDRTKLQYTLDRAVLAAA----- 78 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 ++ +L + +G + +V S V + Sbjct: 79 ---DLDQPLDPEAVVLDYMS------------KSGLGDYTTVVVPEVSPTAKRVKASVDT 123 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 + +A + P I +P+ +S ++ +++ +VLDVS SM S Sbjct: 124 NFTASWMNNVFYDDYIRNPDTYQLEPITLPLLASST-AVESIGNVEISLVLDVSGSMRSN 182 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE--EFFLLEWGVSHLQR 245 ++ ++ ++ + + + +V +S ++ E FL E VS Sbjct: 183 -----DRLVNLKRAAKEFVQTMD-DNTEDGKMSISIVPYSTQVSMPEAFLDELNVSSEHD 236 Query: 246 KIKYLSKFGVS-TNSTPGLKYAYNQ-----IFDMQGMRQH--------CNTEDANYKKII 291 ++ G N+ AY + +++ R C N ++ Sbjct: 237 YSHCINFSGSDFNNAGISTTQAYERTMHFTVWNSGDYRSRTRLVRQPTCAAHSDNPERTA 296 Query: 292 VFMTD 296 + ++D Sbjct: 297 LLLSD 301 >gi|330862285|emb|CBX72446.1| hypothetical protein YEW_HH31780 [Yersinia enterocolitica W22703] Length = 457 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 81/234 (34%), Gaps = 28/234 (11%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F N +G + I I P ++ + E+SH K L I+++ + A I N Sbjct: 17 FKKNEQGAILISFMIIFPFFIALIFITFEISHYLQRKAKLSDAIEQATLALA--IENNEI 74 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 + +++K ++ N + L F+ I +I +T Y+ + Sbjct: 75 PDEPQQIKNNALVLSYVNAY-------LPSKKFLVPIININDNT--------HYLEYNAA 119 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 Y + + +I + + D++ V D S SM F+ Sbjct: 120 VTMAYPAKFLSQSPFTNTISDMNITDNGVAIKNKAIEASEPTDVIFVADYSGSMLYNFNE 179 Query: 191 SITKIDMAIKSINAMLEEV-KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 + + I ++ + ++ +I D +N+ G + WG + Sbjct: 180 NKPRDHERIDALRSAFRKLHDIIMDNSNINAIG----------YIPFSWGTKRI 223 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 63/193 (32%), Gaps = 39/193 (20%) Query: 200 KSINAMLEEVKLIPDVNNVVQSGLV------TFSNKIEEFFLLEWGVSHLQRKIKYLSKF 253 K+I+++ + I + V+ G V +S + E++ I + Sbjct: 275 KTIDSITGNAQTIDIPMSDVRFGDVCLQGSNAYSLEQEQYINNIDN-------IIEMEPH 327 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-- 311 G + S+ L N IF + H KK+++ ++DG + + + Sbjct: 328 GWTLISSGILSA--NNIFKNKAKNGH--------KKLMIILSDGVDTDDFPSSKGIIISK 377 Query: 312 -------CNEAKKRGAIVYAIGIRVIRSHEF-------LRACASPNSFYLVENPHSMYDA 357 C E K+ + I I + + C +++Y N H + Sbjct: 378 MLVEKGMCEEIKENDIQMAFIAIAYSPDNNKNEPYHINWKKCVGEDNYYEAHNAHELEHK 437 Query: 358 FSHIGKDIVTKRI 370 T+ + Sbjct: 438 LQQAVSGSTTREV 450 >gi|108762540|ref|YP_633801.1| BatA protein [Myxococcus xanthus DK 1622] gi|108466420|gb|ABF91605.1| batA protein [Myxococcus xanthus DK 1622] Length = 336 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 35/228 (15%), Positives = 77/228 (33%), Gaps = 47/228 (20%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++ LD+S SME+ ++ +A + ++ + VN+ + GLV F+ Sbjct: 92 IDIVVALDLSTSMEAGDFRPQNRMHVAKEVLSEFI-----ANRVND--RIGLVVFAGAAY 144 Query: 232 EFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L L+ +K L T L + N++ D + + Sbjct: 145 TQAPLTLDYGVLKEVVKQLRTRVLEDGTAIGDALATSLNRLRDSEAKS-----------R 193 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV------IRSHEFLRACA--- 340 ++V +TDG+N S K + N A+ +Y I + + + Sbjct: 194 VVVLITDGDNNSGK--ISPMDSANMAQALKVPIYTILVGKGGKVPFPQGTDLFGNTVWRD 251 Query: 341 ----------------SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + +Y +P + + + + ++ Sbjct: 252 TEIPINPELMQDIADRTGGEYYRATDPEQLREGLQKVLDSLERSKLME 299 >gi|198426624|ref|XP_002127463.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 1326 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 70/200 (35%), Gaps = 31/200 (15%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D +++LD S S+ + +++E +L D + ++ ++ Sbjct: 1105 MDFVMILDSSSSVR------RRNWRIMKNFGRSIIETFRLAEDAA---RMAAFRYNRVVD 1155 Query: 232 EFFLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + ++ + G T + L++A + I N N Sbjct: 1156 TRSQILFRQHVNDMAGFLASYNAIPYNGGGTLTGQALRHAKDVIL------ARRNGNRPN 1209 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-----HEFLRACAS 341 K +++ +TDG + + E ++ G + + +G+ ++ L S Sbjct: 1210 VKDVVLTITDGRSHDNVANVS-----RELREMGVLTFVVGVVPGNGKQLDENQLLSIAGS 1264 Query: 342 PNSFYLVENP-HSMYDAFSH 360 ++ +L + + AFS Sbjct: 1265 RSNMFLATSGFSGLNSAFSA 1284 >gi|326922323|ref|XP_003207399.1| PREDICTED: collagen alpha-2(VI) chain-like [Meleagris gallopavo] Length = 1022 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 64/164 (39%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+M V+D S S+ ++ T + ++ + L + P + G+V +S++ Sbjct: 612 GALDIMFVIDSSESIGY---TNFTLEKNFVVNVVSRLGSIAKDPKSETGARVGVVQYSHE 668 Query: 230 -IEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E L+ +S + +K L T + L++AYN++ + Sbjct: 669 GTFEAIKLDDERINSLSSFKEAVKRLEWIAGGTWTPSALQFAYNKLI--------KESRR 720 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG +D+ C R +V IGI Sbjct: 721 EKAQVFAVVITDGRYDPRDDDKNLGALCG----RDVLVNTIGIG 760 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 41/217 (18%), Positives = 81/217 (37%), Gaps = 17/217 (7%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P+TS + +TD + + V+D S S+ + +D + I +E+++ N Sbjct: 33 PVTSCTE---KTDCPISVYFVIDTSESIALQTVPIQSLVDQIKQFIPRFIEKLENEVYQN 89 Query: 217 NVVQS---GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 V + G + +S+ +E + L K++ ++ G T + + Sbjct: 90 QVSITWMFGGLHYSDVVEIYSPLTRSKDTYLTKLRAINYLGRGTFTDCAI---------- 139 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 M Q ++ A K V +TDG + + A+ G ++A+ Sbjct: 140 SNMTQQFQSQTARDVKFAVVITDGHVTGSPCGGMKMQ-AERARDMGIKLFAVAPSEDVYE 198 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + LR ASP N HI ++ + + I Sbjct: 199 QGLREIASPPHDLYRSNYTITPKDALHIDENTIERII 235 >gi|256426121|ref|YP_003126774.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256041029|gb|ACU64573.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 462 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 37/218 (16%), Positives = 82/218 (37%), Gaps = 28/218 (12%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 ++ + I S+ L++ +VLD S SM S KI A ++ ++ Sbjct: 56 NSNGDCYLYVNIKGGEGEASKPRVPLNISLVLDRSGSM------SGDKIKYARQAAKFLI 109 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLK 264 +++ ++ +V + +++E + L+ I + G STN + G+ Sbjct: 110 DQLNSTDHLS------IVNYDDRVEVTSPSQSVKNKEALKAAIDKIHDRG-STNLSGGML 162 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYC-NEAKKRGAIV 322 Y Q+ + ++ +TDG N + + N+ K+ G + Sbjct: 163 EGYTQVKSTRKEGYVNR---------VLLLTDGLANQGITDPLELKRLAENKYKEDGIAL 213 Query: 323 YAIGIRVIRSHEFLRACASPN--SFYLVENPHSMYDAF 358 G+ + + L A ++Y +++P + F Sbjct: 214 STFGVGADYNEDLLTMLAENGRANYYFIDSPDKIPQIF 251 >gi|211616|gb|AAA48705.1| type VI collagen, alpha-2 subunit [Gallus gallus] Length = 720 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 64/164 (39%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+M V+D S S+ ++ T + ++ + L + P + G+V +S++ Sbjct: 536 GALDIMFVIDSSESIGY---TNFTLEKNFVVNVVSRLGSIAKDPKSETGARVGVVQYSHE 592 Query: 230 -IEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E L+ +S + +K L T + L++AYN++ + Sbjct: 593 GTFEAIKLDDERINSLSSFKEAVKRLEWIAGGTWTPSALQFAYNKLI--------KESRR 644 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG +D+ C R +V IGI Sbjct: 645 EKAQVFAVVITDGRYDPRDDDKNLGALCG----RDVLVNTIGIG 684 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 27/149 (18%), Positives = 51/149 (34%), Gaps = 11/149 (7%) Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 G + +S+ +E + L K++ + G T + + M Q Sbjct: 22 GGLHYSDVVEIYSPLTRSKDTYLTKLRAIRYLGRGTFTDCAI----------SNMTQQFQ 71 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 ++ A K V +TDG + + A+ G ++A+ + LR AS Sbjct: 72 SQTARDVKFAVVITDGHVTGSPCGGMKMQ-AERARDMGIKLFAVAPSEDVYEQGLREIAS 130 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 P N HI ++ + + I Sbjct: 131 PPHDLYRSNYTITPKDALHIDENTIERII 159 >gi|62881|emb|CAA39981.1| type VI collagen subunit alpha2 [Gallus gallus] Length = 918 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 64/164 (39%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+M V+D S S+ ++ T + ++ + L + P + G+V +S++ Sbjct: 612 GALDIMFVIDSSESIGY---TNFTLEKNFVVNVVSRLGSIAKDPKSETGARVGVVQYSHE 668 Query: 230 -IEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E L+ +S + +K L T + L++AYN++ + Sbjct: 669 GTFEAIKLDDERINSLSSFKEAVKRLEWIAGGTWTPSALQFAYNKLI--------KESRR 720 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG +D+ C R +V IGI Sbjct: 721 EKAQVFAVVITDGRYDPRDDDKNLGALCG----RDVLVNTIGIG 760 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 41/217 (18%), Positives = 80/217 (36%), Gaps = 17/217 (7%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P+TS + +TD + + V+D S S+ + +D + I +E+++ N Sbjct: 33 PVTSCTE---KTDCPISVYFVIDTSESIALQTVPIQSLVDQIKQFIPRFIEKLENEVYQN 89 Query: 217 NVVQS---GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 V + G + +S+ +E + L K++ + G T + + Sbjct: 90 QVSITWMFGGLHYSDVVEIYSPLTRSKDTYLTKLRAIRYLGRGTFTDCAI---------- 139 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 M Q ++ A K V +TDG + + A+ G ++A+ Sbjct: 140 SNMTQQFQSQTARDVKFAVVITDGHVTGSPCGGMKMQ-AERARDMGIKLFAVAPSEDVYE 198 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + LR ASP N HI ++ + + I Sbjct: 199 QGLREIASPPHDLYRSNYTITPKDALHIDENTIERII 235 >gi|45384382|ref|NP_990679.1| collagen alpha-2(VI) chain precursor [Gallus gallus] gi|115352|sp|P15988|CO6A2_CHICK RecName: Full=Collagen alpha-2(VI) chain; Flags: Precursor gi|62877|emb|CAA33144.1| type VI collagen alpha-2 subunit preprotein [Gallus gallus] gi|62882|emb|CAA39982.1| type VI collagen subunit alpha2 [Gallus gallus] Length = 1022 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 64/164 (39%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+M V+D S S+ ++ T + ++ + L + P + G+V +S++ Sbjct: 612 GALDIMFVIDSSESIGY---TNFTLEKNFVVNVVSRLGSIAKDPKSETGARVGVVQYSHE 668 Query: 230 -IEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E L+ +S + +K L T + L++AYN++ + Sbjct: 669 GTFEAIKLDDERINSLSSFKEAVKRLEWIAGGTWTPSALQFAYNKLI--------KESRR 720 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG +D+ C R +V IGI Sbjct: 721 EKAQVFAVVITDGRYDPRDDDKNLGALCG----RDVLVNTIGIG 760 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 41/217 (18%), Positives = 80/217 (36%), Gaps = 17/217 (7%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P+TS + +TD + + V+D S S+ + +D + I +E+++ N Sbjct: 33 PVTSCTE---KTDCPISVYFVIDTSESIALQTVPIQSLVDQIKQFIPRFIEKLENEVYQN 89 Query: 217 NVVQS---GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 V + G + +S+ +E + L K++ + G T + + Sbjct: 90 QVSITWMFGGLHYSDVVEIYSPLTRSKDTYLTKLRAIRYLGRGTFTDCAI---------- 139 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 M Q ++ A K V +TDG + + A+ G ++A+ Sbjct: 140 SNMTQQFQSQTARDVKFAVVITDGHVTGSPCGGMKMQ-AERARDMGIKLFAVAPSEDVYE 198 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + LR ASP N HI ++ + + I Sbjct: 199 QGLREIASPPHDLYRSNYTITPKDALHIDENTIERII 235 >gi|326923641|ref|XP_003208043.1| PREDICTED: anthrax toxin receptor 1-like [Meleagris gallopavo] Length = 597 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 39/206 (18%), Positives = 75/206 (36%), Gaps = 25/206 (12%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+ +LD S S+ + + T+I ++S+ ++ +++ + FS+ Sbjct: 84 HGAFDLYFILDKSGSVRNHW----TEIYSFVESLAEKF--------ISPMLRMSFIVFSS 131 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L ++R ++ L G T G K A QI+ H Sbjct: 132 RGTTIMKLTENREAIRRGLEILQHEVPGGDTFMHEGFKRANEQIY-------HETYGGVR 184 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 +I+ +TDGE + + N A+ GAIVY +G++ + S + + Sbjct: 185 TASVIIALTDGELQDVQFYY-AEQEANRARSFGAIVYCVGVKDFNETQLSTIADSIDHVF 243 Query: 347 LVENPHSMYDAFSHIGKDIVTKRIWY 372 V + A I+ K Sbjct: 244 PVTGG---FYALRGTIDSILKKSCIE 266 >gi|269962784|ref|ZP_06177125.1| conserved hypothetical protein [Vibrio harveyi 1DA3] gi|269832474|gb|EEZ86592.1| conserved hypothetical protein [Vibrio harveyi 1DA3] Length = 353 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 31/214 (14%), Positives = 67/214 (31%), Gaps = 40/214 (18%) Query: 173 DMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 DMM+V+D+S SM + + + ID + + V+ + G+V F + Sbjct: 117 DMMLVVDLSGSMQKEDMNDNGEYIDRLTTVKRVLSDFVEKRQGD----RLGVVLFGDHAY 172 Query: 232 EFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + ++I + G T G+ D ++ Sbjct: 173 LQTPLTADRKTVMQQINQTVIGLVGQRTAIGDGIGLGTKTFVDSDA-----------PQR 221 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR------------------ 331 +++ ++DG N + L AKK A +Y +G+ Sbjct: 222 VMILLSDGSNTA--GVLDPLEAAEIAKKYNATIYTVGVGAGEMMVKDFFMTRKVNTASDL 279 Query: 332 -SHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 + + ++ + + + I + Sbjct: 280 DEQTLTKIAEMTGGKYFRARDAKELETIYDTINQ 313 >gi|73976417|ref|XP_539432.2| PREDICTED: similar to Matrilin-2 precursor [Canis familiaris] Length = 524 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 31/212 (14%), Positives = 75/212 (35%), Gaps = 28/212 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + L++M V+D S S+ + ++ + + ++V L + G++ + Sbjct: 173 CKETPLELMFVIDSSESVG------LENFEIIKSLVKTLSDQVAL---DLATARIGIINY 223 Query: 227 SNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S+K+E+ L + + + + G T + L A N +F+ Sbjct: 224 SHKVEKVAHLTQFSNKDDFKLAVDNMQYLGEGTYTATALHEA-NHMFEAA---------R 273 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF------LRA 338 KK+ + +TDG+ +++++ A ++ IG+ F + Sbjct: 274 PGVKKVALVITDGQ-TDSRDEKNLTEVVKRASDINVEIFVIGVAKKNDPNFEMFHKEMNL 332 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 A+ V + + + K+ Sbjct: 333 IATDPDSEHVYQFDDFITLQDTLKQKLFKKKC 364 >gi|88798929|ref|ZP_01114511.1| hypothetical protein MED297_12762 [Reinekea sp. MED297] gi|88778409|gb|EAR09602.1| hypothetical protein MED297_12762 [Reinekea sp. MED297] Length = 322 Score = 67.9 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 35/216 (16%), Positives = 71/216 (32%), Gaps = 47/216 (21%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + + +D+S SM + +++ + I+ +E+ + GLV F + Sbjct: 85 SLYLAVDLSESMLEQDMIWNQRPVSRYEAMQAVISEFVEDRRGDF-------IGLVVFGS 137 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVS--TNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L ++ +Q + L T GL A Q+ E Sbjct: 138 FADVQAPLTPDLNAIQSLLADLRPGMADSRTAIGDGLALAVRQL-----------RESTT 186 Query: 287 YKKIIVFMTDGENLS--TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA---- 340 +++V ++DGEN S + D+ + A VY IG L++ Sbjct: 187 EDRVVVLLSDGENNSGEIRPDEATAV----AAAENIRVYTIGFGSAGRDSLLQSFGLRSS 242 Query: 341 -------------SPNSFYLVENPHSMYDAFSHIGK 363 + +Y + + + F I + Sbjct: 243 SLDEQTLREIAEQTQGRYYRATSSAELAEVFRDIER 278 >gi|15822539|gb|AAG23712.1| calcium-activated chloride channel CLCA4 [Mus musculus] gi|148680071|gb|EDL12018.1| mCG141954, isoform CRA_a [Mus musculus] gi|162317876|gb|AAI56643.1| Chloride channel calcium activated 4 [synthetic construct] Length = 909 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 44/191 (23%), Positives = 68/191 (35%), Gaps = 29/191 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK---I 230 + +VLD S SM S IT++ + ++ L ++ + GLVTF + Sbjct: 309 VCLVLDKSGSMR--LGSPITRLTLMNQAAELYLIQIIEKESL-----VGLVTFDSTATIQ 361 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + S+L K T+ GLK + I Sbjct: 362 TNLIRIINDSSYLAISTKLPQYPNGGTSICNGLKKGFEAITSSDQSTSGSE--------- 412 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ C E K GAI++ I + + E L FY Sbjct: 413 IVLLTDGEDNRISS-------CFQEVKHSGAIIHTIALGPSAARELETLSDMTGGLRFYA 465 Query: 348 VENPHSMYDAF 358 E+ + + DAF Sbjct: 466 KEDVNGLIDAF 476 >gi|261409634|ref|YP_003245875.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261286097|gb|ACX68068.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 968 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 71/207 (34%), Gaps = 31/207 (14%) Query: 173 DMMIVLDVSRSMESFFDSS--ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D+++++D S SM + + K+ A ++ ++ + + + +V + Sbjct: 72 DVVLIIDKSGSMAPTYGPNNGEDKMTNAKEAAKGFVDLMDMTK-----HRVAVVD-FSSS 125 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + I ++ G T + + A + D + + Sbjct: 126 ASSFPFTVDKDAAKSYINTINS-GGGTATGNAIDAAVALLAD----------HRTEAQPV 174 Query: 291 IVFMTDGENLSTKEDQQ----SLYYCNEAKKRGAIVYAIGIRVIRS-------HEFLRAC 339 IV MTDG + ++ +L AK G I Y I + + ++ Sbjct: 175 IVLMTDGAATESPKNTDPFDYALQRAQAAKDAGVIFYTIALLNPNEDPITSAPNVLMKNM 234 Query: 340 ASPN-SFYLVENPHSMYDAFSHIGKDI 365 A+ + V + ++ I K+I Sbjct: 235 ATTATHHHFVLGSKGLNQIYAAIVKEI 261 >gi|330502932|ref|YP_004379801.1| von Willebrand factor, type A [Pseudomonas mendocina NK-01] gi|328917218|gb|AEB58049.1| von Willebrand factor, type A [Pseudomonas mendocina NK-01] Length = 566 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 45/300 (15%), Positives = 110/300 (36%), Gaps = 37/300 (12%) Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN-----DIDDIVRSTSLDIVVVPQNEGY 127 + + + + + T +F ++ + N + + ++ + + Y Sbjct: 92 EQYQAYADNPVFAVAETPVSTFSIDVDTGSYANVRRFLNDGQLPPKDAVRLEELVNYFPY 151 Query: 128 SISAISRYKIPLKFCTFI---PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 + + + +P T + PW +R + + I +S + + +++ ++DVS SM Sbjct: 152 AYP-LPQGDVPFGVSTELAVTPWNPQTRLLRIAIKAS-DRSVEELPPANLVFLVDVSGSM 209 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF--SNKIEEFFLLEWGVSH 242 + M ++ ++++++ + LVT+ +++ + Sbjct: 210 H-----RREGLPMVQGTLKLLVDQLRPQD------RVSLVTYAGDSQVLLDSAPGSDKAK 258 Query: 243 LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLS 301 ++ I L+ G ST G++ AY Q +H N I+ TDG+ N+ Sbjct: 259 IRAAIDQLTA-GGSTAGESGIQLAYQQ------ASKHLIDGGINR---ILLATDGDFNVG 308 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR---ACASPNSFYLVENPHSMYDAF 358 + + +K G + +G V +E L A A ++ ++N Sbjct: 309 ISDFDSLKQLAADKRKSGVSLTTLGFGVDNYNERLMEQLADAGNGNYAYIDNLREARKVL 368 >gi|301780608|ref|XP_002925750.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-1(XX) chain-like [Ailuropoda melanoleuca] Length = 1307 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 33/175 (18%), Positives = 66/175 (37%), Gaps = 23/175 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +DM+ ++D S S+ + + +++E ++ P+ VQ GL +S Sbjct: 171 TPVDMIFLVDGSWSIGH------SHFQQVKDFLASVIEPFEIGPNK---VQVGLTQYSGD 221 Query: 230 IEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + L + + + L G +T + L + Q + Sbjct: 222 PQTEWDLNAFRTKEVVLAAVYSLRYKGGNTFTGLALTHVREQ------NLKPGAGPRPEA 275 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 K+++ +TDG K + K G V+A+G++ E L+ ASP Sbjct: 276 AKVVILVTDG-----KSQDDARAAGRILKDLGVAVFAVGVKNADEAE-LQLLASP 324 >gi|156742544|ref|YP_001432673.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233872|gb|ABU58655.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 562 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 43/207 (20%), Positives = 78/207 (37%), Gaps = 28/207 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 Q R+D+M VLDVS SME +++ A ++ +E+++ GL F Sbjct: 376 QNKKRVDVMAVLDVSGSMEDEG-----RLEQAKAALRIFVEQLQDDDGF------GLTIF 424 Query: 227 SNK---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 S++ + + + + +I L+ G T + AY ++ Sbjct: 425 SDQATVLTPISPIGSRRTEVLNRIAGLTPRG-GTRLLDTVVEAYQELTAT--------PP 475 Query: 284 DANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIRVIRSHEFLRACAS 341 + +V +TDG +N S + + L + ++ V+ I L+ AS Sbjct: 476 GQRI-RAVVVLTDGLDNRSQRSAEDVLDLLRQDREGYSIKVFTIAFGGDADVHLLKEIAS 534 Query: 342 -PNSFYLVENPHSMYDAFSHIGKDIVT 367 + V P A I +DI T Sbjct: 535 ATGAKSYVGKPGE-RGAIERIYQDITT 560 >gi|77463970|ref|YP_353474.1| hypothetical protein RSP_0399 [Rhodobacter sphaeroides 2.4.1] gi|77388388|gb|ABA79573.1| conserved hypothetical protein [Rhodobacter sphaeroides 2.4.1] Length = 566 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 90/234 (38%), Gaps = 40/234 (17%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R F + G + I L ++ ++ G+ ++V F + L +DR+++ AA+ Sbjct: 13 LRRFGRSEDGSILIFGIFMLILMLMIGGLAVDVMRFEFQRARLQGTLDRAVLAAASL--- 69 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD-IVRSTSLDIVVVPQNEG 126 + R+ + G + +D+ +V + +L++ Sbjct: 70 -----------------TQSRSPAEVVRDYVTKAGLADYLDEPVVNANTLNV-------- 104 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 S++A + Y +P F + + P S+ + +++ +VLD+S SM + Sbjct: 105 RSVTATAAYSMPTVFMKLLD----IDRLEAPAVSTAEERVSN---VEISLVLDMSNSMVT 157 Query: 187 FFDSSITKIDMAIKSINAMLEEV----KLIPDVNNVVQSGLVTFSNKIEEFFLL 236 + ++D + ++ V D V+ +V ++ ++ L Sbjct: 158 DGTNPRDRLDNLKVAARDFIDIVMAGANSGLDGAPVISVSIVPYTGQVNAGADL 211 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 14/68 (20%), Positives = 26/68 (38%), Gaps = 1/68 (1%) Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACASPNSFYLVENPHSMYDAFSHIGK 363 ++++ C+ A+ +G VY++ L+ CAS Y + F I Sbjct: 498 NERTRQICDAARAQGITVYSVAFEAEAGGQALLQYCASTTGHYYATVGPQIRTVFHSIAS 557 Query: 364 DIVTKRIW 371 I R+ Sbjct: 558 HITQLRLT 565 >gi|37676326|ref|NP_936722.1| hypothetical protein VVA0666 [Vibrio vulnificus YJ016] gi|37200868|dbj|BAC96692.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 362 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 77/210 (36%), Gaps = 34/210 (16%) Query: 173 DMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+M+V+D+S SM S ++I+++D + + + + GL+ F Sbjct: 107 DVMVVVDLSGSMAEQDFTSASGANISRLDATKEVLAEFAKT-------RQGDRLGLILFG 159 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + F+ + + + L++ G ST+ + A ++F+ Q Sbjct: 160 ---DAAFVQTPFTADQKVWLALLNQTDVAMAGQSTHLGDAIGLAI-KVFEQSESNQ--AA 213 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--SHEFLR--- 337 +K+ + +TDG T + + AK +G ++ I + L Sbjct: 214 SSKPRQKVAIVLTDG--NDTGSFVEPIDAAKVAKAKGVRIHVIAMGDPSTVGESALDLQT 271 Query: 338 ----ACASPNSFYLVENPHSMYDAFSHIGK 363 A S + N + A+ IGK Sbjct: 272 IERIASESGGKAFQALNRDELARAYDDIGK 301 >gi|254514588|ref|ZP_05126649.1| von Willebrand factor, type A [gamma proteobacterium NOR5-3] gi|219676831|gb|EED33196.1| von Willebrand factor, type A [gamma proteobacterium NOR5-3] Length = 347 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 31/212 (14%), Positives = 69/212 (32%), Gaps = 42/212 (19%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M+ +D+S SM ++ I ++ + + + + GL+ F ++ Sbjct: 93 DLMLAVDISGSMRVEDMQVGNRMARRIDAVKQLGSDFMSRRSGD---RLGLILFGSRAYL 149 Query: 233 FFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + + +QR + + G T + A ++ E ++ Sbjct: 150 QSPLSFDIQTVQRFLLESQIGFAGQETAIGDAIGLAVKRL-----------QERPATSRV 198 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI-------------------- 330 ++ +TDG+ T L N A G +Y IGI Sbjct: 199 LILLTDGQ--DTASTVDPLEAANLAADLGVRIYTIGIGADSLTLPGLLGSPLGARTVNPS 256 Query: 331 ---RSHEFLRACAS-PNSFYLVENPHSMYDAF 358 + + +S ++ +P + + Sbjct: 257 ADLDENSLIAIASSTGGQYFRARDPEELATVY 288 >gi|260434111|ref|ZP_05788082.1| conserved hypothetical protein [Silicibacter lacuscaerulensis ITI-1157] gi|260417939|gb|EEX11198.1| conserved hypothetical protein [Silicibacter lacuscaerulensis ITI-1157] Length = 600 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 56/314 (17%), Positives = 111/314 (35%), Gaps = 47/314 (14%) Query: 5 NLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQ 64 L +F + G MTILT + I+F+ G ++V + L +DR+++ AA Sbjct: 19 GLQDDSFATSESGAMTILTLFLIMIVFVASGFAVDVMRYDRERAKLQYALDRAVLAAA-- 76 Query: 65 IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQN 124 + + K I K + + + D+ ST+ + Sbjct: 77 -----DLDQELCPKDVVIDYLKKEGLDKYLTGDPKVEP------DVCGSTAAVL------ 119 Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 +GY + + ++ F+ W I TS + ++ +++ +VLDVS SM Sbjct: 120 KGYR-RVEANADMDIEM-HFMKW-RGIETIASAATSVAE---ESIGNVEISLVLDVSGSM 173 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-----EWG 239 +K++ K+ N ++++ + V +V +S ++ L G Sbjct: 174 RG------SKLENLKKAANLFIDDM-FAKTEDGKVSISIVPYSEQVSIPDYLMNKLNTQG 226 Query: 240 VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF-MTDGE 298 + + + + S +T T FD+ T + I + DG Sbjct: 227 TNSIANCVDFASADFATTRFT---------AFDVTDPVTGIVTPGTTLARTIHHDIGDGS 277 Query: 299 NLSTKEDQQSLYYC 312 + S C Sbjct: 278 DRRPYNGFVSSTIC 291 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 17/75 (22%), Positives = 31/75 (41%), Gaps = 3/75 (4%) Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-HEFLRACASPN-SFYLV-ENPHSMYD 356 ++D + C +AK I+++I L+ C S + ++Y +N + Sbjct: 525 NQVQKDPRLTSICQKAKDEKIIIFSIAFDAPDGVKPLLKGCVSDDGAYYEAKDNDKDIIS 584 Query: 357 AFSHIGKDIVTKRIW 371 FS IG I R+ Sbjct: 585 VFSSIGSTIQNLRLT 599 >gi|313885991|ref|ZP_07819729.1| von Willebrand factor type A domain protein [Porphyromonas asaccharolytica PR426713P-I] gi|312924521|gb|EFR35292.1| von Willebrand factor type A domain protein [Porphyromonas asaccharolytica PR426713P-I] Length = 326 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 38/217 (17%), Positives = 77/217 (35%), Gaps = 46/217 (21%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +D+++ +D+S SM++ D + + A + M+ P+ N GLV F+ + Sbjct: 85 QGIDLVLAMDLSGSMQA-LDLKPNRFEAARDVASEMIA---ARPNDN----IGLVVFAGE 136 Query: 230 IEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + +Q+ ++ + + T GL A N + N Sbjct: 137 SFTLCPLTVDHNVIQQMLETTEIGQLEDGTAIGLGLATAINTL-----------RGSDNK 185 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF------------ 335 K+I+ +TDG N + D A++ G +Y + +F Sbjct: 186 SKVIILLTDGSNNA--GDITPSMAAELAQQYGIRIYTVAAGTNGVAKFPVQTAFGTEYVE 243 Query: 336 ---------LRACA--SPNSFYLVENPHSMYDAFSHI 361 LR A + +Y + +++ + I Sbjct: 244 ADVQIDEGTLRHIAEQTGGKYYRATDETKLHEIYKEI 280 >gi|297265792|ref|XP_002799252.1| PREDICTED: vitrin-like [Macaca mulatta] Length = 657 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 41/270 (15%), Positives = 89/270 (32%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + N+ +V + V N YS S + + + ++ + T Sbjct: 409 EGAAENEKQYVVEPNFANKAVCRTNGFYSFHVQSWFGLHKTLQPLVKRVCDTDRLACSKT 468 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ ++D S S+ ++ + + +E ++ Sbjct: 469 CL--------NSADIGFIIDGSSSVG------TGNFRTVLQFVTNLTKEFEISDTD---T 511 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ + F + S + IK + + T++ + +A Q+F Sbjct: 512 RVGAVQYTYEQRLEFGFDQYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF------ 565 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG A +G I YAIG+ E Sbjct: 566 ---KKSKPNKRKLMILITDGR-----SYDDVRIPAMAAHLKGVITYAIGVAWAAQEELEV 617 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V+ ++Y I ++I Sbjct: 618 IATHPARDHSFFVDEFDNLYQYVPRIIQNI 647 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 29/208 (13%), Positives = 71/208 (34%), Gaps = 22/208 (10%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 + + + Y P V+ + D ++D+ ++D S S+ Sbjct: 228 LWSTTTYTSSQNRPGADPGIQRQDPSGAAFEKPVRADVSLDCKIDLSFLIDGSTSIGK-- 285 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRK 246 + + + + + + + + P G+V + + F L+ L+ Sbjct: 286 ----RRFRIQKQLLAGVAQALDIGPAGP---LMGVVQYGDNPATHFNLKTHMNSRDLKTA 338 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 I+ +++ G +N + + F N ++ ++V M DG E+ Sbjct: 339 IEKITQRGGLSNVGRAISFVTKNFFSK------ANGNRSSAPNVVVVMVDGWPTDKVEEA 392 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 L A++ G ++ I + +E Sbjct: 393 SRL-----ARESGINIFFITVEGAAENE 415 >gi|262073024|ref|NP_001159971.1| integrin alpha-2 [Bos taurus] gi|296475809|gb|DAA17924.1| integrin alpha-2 [Bos taurus] Length = 1179 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 78/210 (37%), Gaps = 35/210 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++V D S S+ + D + ++ + + P Q GL+ ++N Sbjct: 171 IDVVVVCDESNSIYPW--------DAVKNFLEKFVQGLDIGPTK---TQMGLIQYANNPR 219 Query: 232 EFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L S ++ + G TN+ ++YA + + Sbjct: 220 VVFNLNTFKSKDEMIKATSQTFQYGGDLTNTFKAIQYARDTAYST------AAGGRPGAT 273 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEF---LRAC 339 K++V +TDGE+ + + + CN+ + + I + + + ++A Sbjct: 274 KVMVVVTDGESHDGSKLKAVIDQCNK---DNILRFGIAVLGYLNRNALDTKNLIKEIKAI 330 Query: 340 AS---PNSFYLVENPHSMYDAFSHIGKDIV 366 AS F+ V + + + IG+ I Sbjct: 331 ASIPTERHFFNVSDEADLLEKAGTIGEQIF 360 >gi|219847249|ref|YP_002461682.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219541508|gb|ACL23246.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 842 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 65/197 (32%), Gaps = 34/197 (17%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 A + +++++D S SM + F ++K D+A ++ L ++ + G++ F Sbjct: 391 QRAPISLLLIIDRSASMSASFG--VSKFDLAKEAAILALTALQAGD------RIGVLAFD 442 Query: 228 NKIEEFFLLE-----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + V+ LQ +I + G TN L + +H Sbjct: 443 TDTIWVIPFQAVGEGAAVAELQTRIATM-AIGGGTNIERALAVGLPALAAEPHSVRHA-- 499 Query: 283 EDANYKKIIVFMTDGE---NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 V +TDG N + Q A+ + I I + L Sbjct: 500 ---------VLLTDGRSYSNNYPRYQQLV----ETARAAQITLSTIAIGTDADTDLLEQL 546 Query: 340 A--SPNSFYLVENPHSM 354 A +Y V + + Sbjct: 547 ARWGNGRYYFVPDAADL 563 >gi|289640775|ref|ZP_06472946.1| von Willebrand factor type A [Frankia symbiont of Datisca glomerata] gi|289509351|gb|EFD30279.1| von Willebrand factor type A [Frankia symbiont of Datisca glomerata] Length = 319 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 39/228 (17%), Positives = 77/228 (33%), Gaps = 31/228 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + + A + ++ +DVS SM + D + ++ A +A ++++ Sbjct: 71 FSLARPAQARRVPRERATI--ILAIDVSNSM-AATDVAPNRLAAAKDGADAFIDQLPP-- 125 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + GLV+FS ++ I L + G ST G+ I Sbjct: 126 ----RINLGLVSFSGSAALLVPPTTDRQSVRSGIHGL-QLGPSTAIGEGIFAGLQAITTA 180 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS- 332 Q IV ++DGE + + Q+ A+ G V I Sbjct: 181 GE--QLAADGGTPPPAAIVLLSDGETQRGRPNAQAAQA---ARDAGVPVDTIAYGTADGS 235 Query: 333 -------------HEFLR--ACASPNSFYLVENPHSMYDAFSHIGKDI 365 + LR A A+ S++ + + + ++G I Sbjct: 236 LDVGGQEIPVPVNEDALREIARATDGSYHRAASGDELRSVYENLGSSI 283 >gi|326922791|ref|XP_003207628.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Meleagris gallopavo] Length = 1224 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 67/163 (41%), Gaps = 21/163 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + ++++ V+D S S+ ++ + A+++ V + N + GLV + Sbjct: 767 CKEIPMELVFVIDSSESVGP------ENFEIIKDFVTALVDRVTV---GRNATRIGLVLY 817 Query: 227 SNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S +++ F L + ++R I+ + G T + ++ A ++ Sbjct: 818 SLEVQLEFGLNKHTTQQDVKRAIRKMQYMGEGTYTGTAIRKA---------TQEGFLGAR 868 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 +K+ V +TDG+ +E + EA +YAIGI Sbjct: 869 TGVRKVAVVLTDGQ-ADKREAVKLDVVVREAHAANIEMYAIGI 910 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 34/192 (17%), Positives = 72/192 (37%), Gaps = 20/192 (10%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D L++ +LD S S + + K + + ++ ++L + L+ +S Sbjct: 19 EDCILEIAFLLDSSESAKHYNHEQQKK---FVLEVVDQMKGLQLSSGRTLSWRMALLQYS 75 Query: 228 NKIEEFFLL-EW-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + + +W G + I ++ G T +T + Q++ +G Sbjct: 76 STVSTEQTFHDWKGPEAFKSHIAPITYIGHGTYTTYAI-TNLTQLYMTEGT--------P 126 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH----EFLRACAS 341 + K+ V TDG + D + +AK +G +++ +G+ + LR AS Sbjct: 127 DSLKLAVLFTDGVDHPRNPDIFAAT--ADAKNQGIVLFTMGMTRVAEEVSNAAKLRLLAS 184 Query: 342 PNSFYLVENPHS 353 + V N Sbjct: 185 VPASRFVSNLQE 196 >gi|255039218|ref|YP_003089839.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] gi|254951974|gb|ACT96674.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] Length = 320 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 83/209 (39%), Gaps = 24/209 (11%) Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 + +R I F I + ++ P + + Q + D+ +V+D+S+SM++ Sbjct: 34 RQLQTTARTLIIKLFLRSITFSLLIISLLGPSFGEAERDIQAKGK-DIFMVVDLSKSMDA 92 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK 246 D + ++++ +N +E + G++ FSN L + + L+ Sbjct: 93 A-DVTPSRLEKVKFELNRFIE-------NERANRIGIIIFSNDAYIHVPLTYDAAALELF 144 Query: 247 IKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 I+ L TN ++ AYN++ + K++V TDGEN S+ Sbjct: 145 IQSLQTDLLPTNGTNVCGAIEMAYNKLMNSADPTSRA--------KMMVLFTDGENSSSC 196 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + N ++ G VY++ + Sbjct: 197 TN----ALFNNLRRFGIGVYSVAVGTKVG 221 >gi|309355882|emb|CAP38139.2| hypothetical protein CBG_21291 [Caenorhabditis briggsae AF16] Length = 430 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 54/329 (16%), Positives = 114/329 (34%), Gaps = 34/329 (10%) Query: 46 MKTVLHSMID-----RSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 K L ++ + ++ +G + K G + + NTW + + Sbjct: 115 TKKALEVALELMAQRNVSIPTLVMVVTDGRSADDPKGPGKLLQAQ-PNTWVFAAATGDPE 173 Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITS 160 ++ DI +++ +V+ + + R + T + T+ Sbjct: 174 KVDTRELMDITG--NINHIVMHRGRDLATDITRRLLREAQDKCRTTTTTTTTTTTTTTTT 231 Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + N T LD+++VLD S + + ++S +++++K+ P V Sbjct: 232 TTTTNPITGCELDLVLVLDFSTTTDPVYNSY-------KDLSKRLVQQLKIGPHYTQVAA 284 Query: 221 SGLVTFSNKIEEFFLLEWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 T F L ++ + R I L G +T G++ A QI + +G R Sbjct: 285 VTFATVGRTRVRFNLKKYSTQEEVLRGIDKLQSKGGTTAIGAGIEKALTQIDESEGARPG 344 Query: 280 CNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR------S 332 T K+++ TDG N ++++ +A G +Y + + Sbjct: 345 IAT------KVMIVFTDGWSNKGPDPEKRAR----DAVNAGFEMYTVAYTARAPGSVTLN 394 Query: 333 HEFLRA-CASPNSFYLVENPHSMYDAFSH 360 +E L A S + ++ D Sbjct: 395 NETLSAISGSSGHAFTDVTFQTLVDKIKQ 423 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 55/136 (40%), Gaps = 24/136 (17%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 L+++IVLD S S++ F+ S K + + EE+++ P + ++ +S Sbjct: 34 LNVLIVLDRSDSVKGGFNKS-------RKFVTDVSEELQIGP---EKHRVAMIVYSGLSY 83 Query: 232 EFFLLEWGVSHLQ----RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 +L W + +K L G +TN+ L+ A + + + Sbjct: 84 RREVLPWNFAKSNEEFVQKTNALRAIGGTTNTKKALEVALELM----------AQRNVSI 133 Query: 288 KKIIVFMTDGENLSTK 303 +++ +TDG + Sbjct: 134 PTLVMVVTDGRSADDP 149 >gi|294670381|ref|ZP_06735263.1| hypothetical protein NEIELOOT_02099 [Neisseria elongata subsp. glycolytica ATCC 29315] gi|291307845|gb|EFE49088.1| hypothetical protein NEIELOOT_02099 [Neisseria elongata subsp. glycolytica ATCC 29315] Length = 553 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 52/341 (15%), Positives = 110/341 (32%), Gaps = 47/341 (13%) Query: 39 EVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNEL 98 + ++ + + + R + + + + +F ++ Sbjct: 52 QAANKMMVMRHAEYAAETGPFRQEALSIRPAADTERYQKQPDQPVKAVAQEPVSTFSIDV 111 Query: 99 RDNGFVNDIDDIVRSTSL---DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIV 155 + N + + + L D V + + Y Y PL P+ +++ I Sbjct: 112 DTGSYAN-VRRFLNNGRLPPKDTVRIEEIVNY-----FPYNYPL-PTGGHPFAIHTQTID 164 Query: 156 MP----------ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 P + + + +++ ++D+S SM S K+ + K++ + Sbjct: 165 SPWQHEAKLIKIGIQAQDLAKKELPPANLVFLVDISGSMNS-----PEKLPLVKKTLRIL 219 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGL 263 E+++ V L+T+++ E G + + R I L G ST L Sbjct: 220 TEQLRPQDKVT------LITYASGEELVLPPTSGRNKDEILRAINKLQA-GGSTAGESAL 272 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIV 322 K AY Q Q I+ TDG+ N+ E +K G + Sbjct: 273 KMAYEQ---AQKAYVKNGINR------ILLATDGDFNVGVSSTDALKSMVAEKRKSGISL 323 Query: 323 YAIGIRVIRSHEFLR---ACASPNSFYLVENPHSMYDAFSH 360 +G +E + A A ++ ++N H Sbjct: 324 TTLGFGTGNYNEDMMEQIADAGDGNYSYIDNEKEAKKVLQH 364 >gi|162448865|ref|YP_001611232.1| hypothetical protein sce0595 [Sorangium cellulosum 'So ce 56'] gi|161159447|emb|CAN90752.1| hypothetical protein sce0595 [Sorangium cellulosum 'So ce 56'] Length = 656 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 87/217 (40%), Gaps = 23/217 (10%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 +T+ H+V +V + + ++ ++D S SM+S KI++A KS+ + + Sbjct: 275 FTSGHHLVRVAVQGKRVPVKERTPVHLVYLVDTSGSMQS-----PDKIELAKKSLKMLTD 329 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 +K V +G V ++ +E + + L+ G + S+ G+ AY Sbjct: 330 TLKPGDTVALCTYAGSV---REVLAPTGIE-SKGKILAALADLTAGGSTAMSS-GIDLAY 384 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 + + + ++ H N ++ ++DG+ N+ + L A+ +G + +G Sbjct: 385 S-LAERTLVKGHVNR--------VIVLSDGDANVGPTSHDEILKTIKRARDKGITLSTVG 435 Query: 327 IRVIRSHEFLR---ACASPNSFYLVENPHSMYDAFSH 360 + + A ++ +++ FS Sbjct: 436 FGQGNYKDLMMEQLANQGDGNYAYIDSEAQARRVFSE 472 >gi|323320820|gb|ADX36428.1| complement factor B [Apostichopus japonicus] Length = 913 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 73/211 (34%), Gaps = 24/211 (11%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D LD+ + D S S+ +MA K +++E+ + N+ ++ G + F++ Sbjct: 455 DTGLDLYFMFDGSSSVGE------DNFNMAKKFAKELVKEIGVTDRPNS-LRVGALVFNS 507 Query: 229 KIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + E F + + I + TN ++ + Q Sbjct: 508 EAEIGFHTVAFDSTADVLDAIDSMEYKEGGTNIAKAF-----EVLSNVMIPQTAKLNREK 562 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---PN 343 K + +TDG+ E + + + ++ IGI + L AS Sbjct: 563 SFKTVFLITDGD---ATEGGDAQEDARAVRDQDVTIHCIGISENATRRTLSGMASEPLSE 619 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + +++ ++ + + + I Y + Sbjct: 620 HLFFLKDYSTL----EEFIQIVTNQTIDYSE 646 >gi|315185579|gb|EFU19348.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6578] Length = 459 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 35/216 (16%), Positives = 79/216 (36%), Gaps = 27/216 (12%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSIT------KIDMAIKSINAMLEEVKLIPD 214 S++ + + ++VLD S SM D + T +I A ++I L P Sbjct: 78 SLRRGVNREEGISFLLVLDASGSMWDALDGTPTEDPDRMRITHAKRAIREFL------PL 131 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 ++ + GL F+ + + + K+ + + P + AY +++ Sbjct: 132 LSERDRVGLAVFNRTYRMIQPIVDDPALVLEKLDAIER--------PSREQAYTELYRSM 183 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY---CNEAKKRGAIVYAIGIRVIR 331 + E+ ++++V ++DGEN ++ + A + G Y I + Sbjct: 184 E-EALTSFEEEGRRRVLVVLSDGENFPVDPEKSPATPGTAVDLAHRYGITCYVIHFGTEK 242 Query: 332 SHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDI 365 + AS + N + ++ I + + Sbjct: 243 -DRLIGDLASETGGRVFDARNALELASVYTAIQEQV 277 >gi|218676637|ref|YP_002395456.1| hypothetical protein VS_II0874 [Vibrio splendidus LGP32] gi|218324905|emb|CAV26683.1| Conserved hypothetical protein [Vibrio splendidus LGP32] Length = 355 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 38/220 (17%), Positives = 86/220 (39%), Gaps = 32/220 (14%) Query: 163 KVNSQTDARLDMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + + D+M+V+D+S SM S I+++D + + + K Sbjct: 92 EPQVREQLGRDVMVVVDLSGSMAEQDFTSKQGDKISRLDATKEVLADFAKTRKGD----- 146 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFD 272 + GL+ F + F+ + ++ L++ G ST+ + A ++F+ Sbjct: 147 --RLGLILFG---DAAFVQTPFTADQDVWLELLNQTDVAMAGQSTHLGDAIGLAI-KVFE 200 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 Q + + +K+++ +TDG T + + AK +G ++ I + ++ Sbjct: 201 QSAAVQDSSVDANVKEKVVIVLTDG--NDTGSFVEPIDAAKVAKAKGVRIHVIAMGDPQT 258 Query: 333 -------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 E ++ A S + N + A++ IG+ Sbjct: 259 VGEVALDMETIKRVAQESGGEAFEALNRDELTKAYAQIGE 298 >gi|47223676|emb|CAF99285.1| unnamed protein product [Tetraodon nigroviridis] Length = 628 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 71/205 (34%), Gaps = 25/205 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+ VLD S S++ +++ + ++ ++ + FS Sbjct: 42 GGFDLYFVLDKSGSVQHYWNEIFYFVHHLAHKF------------ISPQMRMSFIVFSTD 89 Query: 230 IEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L ++ ++ + + G T GL A QI+ G + Sbjct: 90 GRTLMALTEDRDKIRAGLEELRMVQPGGDTYMDRGLHRASEQIYYAAGDGYRAAS----- 144 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +I+ +TDGE + D A++ GA VY +G++ + S + + Sbjct: 145 --VIIALTDGELREDQFDTAQRE-AGRARQLGASVYCVGLKDFNETQLSTIADSKDHVFP 201 Query: 348 VENPHSMYDAFSHIGKDIVTKRIWY 372 V H ++A + I+ + Sbjct: 202 V---HDGFEALQSVIDSILKRSCIE 223 >gi|198436258|ref|XP_002122777.1| PREDICTED: similar to HyTSR1 protein [Ciona intestinalis] Length = 4258 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 59/167 (35%), Gaps = 17/167 (10%) Query: 202 INAMLEEVKLIPDVN-NVVQSGLVTFSNKIEE--FFLLEWGVSHLQRKIKYLSKFGVSTN 258 + + L V +V N + + F+ + F L +I ++ G T+ Sbjct: 2103 VRSFLRRVMAQLNVGLNSTRIAALRFNRDVIPLWSFAQSTSSEDLIERIDRVTYDGSGTH 2162 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + L YA N++F + + + V +TDG D K + Sbjct: 2163 TGKALTYAANRLFTEAE------GDRPDVPDLAVVLTDGRAQDNPGDTV-----QALKNK 2211 Query: 319 GAIVYAIGI-RVIRSHEFLRACASPN--SFYLVENPHSMYDAFSHIG 362 G V AI + + +E + P+ + + V + ++ +I Sbjct: 2212 GVKVIAIAVTNRVDINEIYAIASDPDEQNAFFVNDFEGLFSVVENIA 2258 >gi|156409365|ref|XP_001642140.1| predicted protein [Nematostella vectensis] gi|156229281|gb|EDO50077.1| predicted protein [Nematostella vectensis] Length = 156 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 31/175 (17%), Positives = 63/175 (36%), Gaps = 27/175 (15%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 ARLD+ ++D S S+ + + I ++ + P + GLV +SN Sbjct: 7 KARLDLGFLVDGSGSI------KAARFKGVKRFIQNVISRFHISP---KHTRVGLVLYSN 57 Query: 229 KIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + F + + + T + L Y +F R Sbjct: 58 NPYKIFGFNKYTNKNAAMKATGRIPYPRRGTKTGRALAYTGRYLFRSSKRR--------- 108 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 ++++ +TDG + + ++ G +YA+G+ + + LR+ AS Sbjct: 109 --RVLILLTDGR-----SYDRVSAPARKLRQAGIHIYAVGVGRNYNIKQLRSIAS 156 >gi|300853773|ref|YP_003778757.1| hypothetical protein CLJU_c05730 [Clostridium ljungdahlii DSM 13528] gi|300433888|gb|ADK13655.1| hypothetical protein CLJU_c05730 [Clostridium ljungdahlii DSM 13528] Length = 419 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 73/204 (35%), Gaps = 27/204 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ V+D S SM + + + L+ ++N + + F + + Sbjct: 115 DIVFVIDTSGSMANTDPQN-----------ERFSSVLNLMDNMNTQNRVSIYKFDDTSKR 163 Query: 233 FFLLEWGVSHLQR----KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L++ ++K +TN + AYN+I + K Sbjct: 164 IIPMTEVSESLKKNAEEELKQYEIPAGNTNMGEAIDSAYNEINSTK---------RPGRK 214 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFY 346 ++ ++DGE+ +++ K +Y IG+ + L+ A + +Y Sbjct: 215 AAVILLSDGEDN-FGLNKKFDETLKPFKDSNISIYTIGMSNENNFTTLKKIAKDTHGEYY 273 Query: 347 LVENPHSMYDAFSHIGKDIVTKRI 370 V+N + FS I + + Sbjct: 274 NVKNASDLKGTFSKIYYATQQRLL 297 >gi|218781310|ref|YP_002432628.1| hypothetical protein Dalk_3472 [Desulfatibacillum alkenivorans AK-01] gi|218762694|gb|ACL05160.1| conserved hypothetical protein [Desulfatibacillum alkenivorans AK-01] Length = 308 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 40/220 (18%), Positives = 74/220 (33%), Gaps = 30/220 (13%) Query: 155 VMPITSSVKVNSQ-TDARLDMMIVLDVSRSM---ESFFDS-SITKIDMAIKSINAMLEEV 209 + +V + + +D+++ LD S SM + D + ++ K ++ ++ Sbjct: 68 AIARPQTVDASREIKTPGVDIILCLDASESMAQPDFAIDGQRVNRLTAVKKVVHDFVKRR 127 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS--TNSTPGLKYAY 267 + GLV F + L L I+ L T L A Sbjct: 128 DTD-------RIGLVVFGDYAFTQAPLTLDKGLLLNLIENLRIGMAGRKTAIGDALGVAG 180 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 +I D+ M K+++ ++DGEN + D G +Y IG+ Sbjct: 181 KRIKDIPAMS-----------KVVILLSDGENTA--GDMTPQGAAEALAALGIKIYTIGM 227 Query: 328 RVIR--SHEFLRACASP-NSFYLVENPHSMYDAFSHIGKD 364 + S E + A +Y N + + I K Sbjct: 228 GTEQAGSKELAQIAAIGQGKYYHASNTEQLDSIYKEIDKA 267 >gi|114762302|ref|ZP_01441760.1| von Willebrand factor type A domain protein [Pelagibaca bermudensis HTCC2601] gi|114544920|gb|EAU47924.1| von Willebrand factor type A domain protein [Roseovarius sp. HTCC2601] Length = 335 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 29/209 (13%), Positives = 74/209 (35%), Gaps = 26/209 (12%) Query: 166 SQTDARLDMMIVLDVSRSM--ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + +A D+++ +D+S SM F +I ++ + E V+ + L Sbjct: 88 TIENAARDVVMAIDISGSMDARDFATPEGERIQ----RLSGVREVVRAFVSGREGDRMAL 143 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + F L + + + + G+ + + D G+ Sbjct: 144 IVFGTSAYLQAPLTDDLETIIALLDR---------TEVGMAGPHTALGDAIGLSIRTFET 194 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-------EFL 336 +++++ ++DG + +++ + A RG +Y IG+ + L Sbjct: 195 SEIDQRLLILLSDGSDTASR--MSPVNAAEIAADRGVEIYTIGVGDPDATGENRVDLTTL 252 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGK 363 + A + ++ E+ S+ + I + Sbjct: 253 KEVAQRTGGQYFFAEDAASLEAVYDRIDE 281 >gi|73992734|ref|XP_543096.2| PREDICTED: similar to Protein KIAA1510 precursor [Canis familiaris] Length = 1405 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 77/204 (37%), Gaps = 26/204 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + +DM+ ++D S S+ + + +++E ++ P VQ GL +S Sbjct: 203 SPVDMIFLVDGSWSIGH------SHFQQVKDFLASVIEPFEIGPSK---VQVGLTQYSGD 253 Query: 230 IEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + L + ++ L G +T + L + ++ + Sbjct: 254 PQTEWDLNALRTKEDVLAAVRRLRYKGGNTFTGLALTHV------LEHNLRPAAGPRPEA 307 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---PNS 344 K+++ +TDG K + K G ++A+G++ E L+ AS + Sbjct: 308 TKVLILVTDG-----KSQDDARAAGRILKDLGVAIFAVGVKNADEAE-LQLLASQPLDIT 361 Query: 345 FYLVENPHSMYDAFSHIGKDIVTK 368 + V++ + +G+ I K Sbjct: 362 VHNVQDFPQLGTLAGLLGRLICQK 385 >gi|312108072|ref|XP_003151046.1| hypothetical protein LOAG_15508 [Loa loa] gi|307753789|gb|EFO13023.1| hypothetical protein LOAG_15508 [Loa loa] Length = 203 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 25/174 (14%), Positives = 65/174 (37%), Gaps = 20/174 (11%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE-VKLIPDV 215 P + + D+M ++D S S+ S+ + + ++E V+L Sbjct: 44 PGRVCTVQTTCPKQKTDLMFLIDGSGSIGSY---------VFKNEVLRFIKEFVELFDIG 94 Query: 216 NNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + + GL+ +S++I F L + I + T + +++ + + Sbjct: 95 LDNTRVGLIQYSDQIRHEFDLSQYTDKESVINAISQVQYLTGLTRTGAAIQH---MVMEG 151 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 R+ + + ++ + +TDG + A+K ++++G+ Sbjct: 152 FSERRGARKQGDDVARVSIVITDG-----RSQDNVTEAAISARKSHINMFSVGV 200 >gi|257785062|ref|YP_003180279.1| Cna B domain-containing protein [Atopobium parvulum DSM 20469] gi|257473569|gb|ACV51688.1| Cna B domain protein [Atopobium parvulum DSM 20469] Length = 863 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 51/258 (19%), Positives = 94/258 (36%), Gaps = 37/258 (14%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 P+ T P + + + I S+ ++++V D S SM + ++D Sbjct: 54 PVHKKTISPNTDGTYDLTLTIKGETSAASEEQKA-NVLVVFDNSSSMTAQTGGGEMRLDA 112 Query: 198 AIKSINAMLEEVKLIPDV--NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGV 255 A + +N + + I +VV+ L++F+ K ++ QR + F Sbjct: 113 AKRVVNQLSSTILGINRNAQKDVVEMALLSFNEKPNLECGWTADLNEFQRATNNM-GFHT 171 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-----ENLSTKEDQQSLY 310 TN L+ A ++ Q ++F+TDG N + +Q Sbjct: 172 GTNWESALERA--KVLADQKAANGN-------PTYVIFVTDGLPTQDRNGWVRNNQIGYE 222 Query: 311 YC-NEAK---KRGAIVYAIGIRVIRSHEFLRAC----------ASPN--SFYLVENPHSM 354 + +EA+ G Y++ + H +LR +P +Y N M Sbjct: 223 HALDEARAIGSAGYHFYSVYM--YGGHAYLRQLTNYAYTGNPFGNPGGTYYYEANNTAQM 280 Query: 355 YDAFSHIGKDIVTKRIWY 372 AF I ++TK I Y Sbjct: 281 EQAFKEIA-SVITKSITY 297 >gi|261876473|dbj|BAI47562.1| collagen type VI alpha 2 subunit [Mesocricetus auratus] Length = 1026 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 36/164 (21%), Positives = 64/164 (39%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + ++ L + P + G+V +S++ Sbjct: 619 GALDVVFVIDSSESIGY---TNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 675 Query: 230 -IEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E L+ +S + +K L T + LK+AYNQ+ + Sbjct: 676 GTFEAIRLDDERVNSLSSFKEAVKNLEWIAGGTWTPSALKFAYNQLI--------KESRR 727 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG + +D C+ R V AIGI Sbjct: 728 QKTRVFAVVITDGRHDPRDDDLNLRALCD----RDVTVTAIGIG 767 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 39/230 (16%), Positives = 87/230 (37%), Gaps = 18/230 (7%) Query: 148 YTNSRHIVMPITSSVKVNSQ----TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSIN 203 + + ++ P S+ N+ D +++ VLD S S+ + + + + + Sbjct: 24 HAQQQEVISPDISTTDRNNNCPEKADCPVNVYFVLDTSESVAMQSPTD-SLLYHMQQFVP 82 Query: 204 AMLEEVKLIPDVNNVV---QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNST 260 + +++ ++ V + G + FS+++E F + + ++ + F T + Sbjct: 83 QFISQLQNEFYLDQVALSWRYGGLHFSDQVEVFSPPGSDRASFTKSLQSIRSFRRGTFTD 142 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA 320 L QI RQH N V +TDG + + A++ G Sbjct: 143 CALANMTQQI------RQHVGRGVVN---FAVVITDGHVTGSPCGGIKMQ-AERAREEGI 192 Query: 321 IVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 ++A+ + + LR A+ N +M + I +D + + I Sbjct: 193 RLFAVAPNRNLNEQGLRDIANTPHELYRNNYATMRPDSTEIDQDTINRII 242 >gi|213963729|ref|ZP_03391979.1| BatA protein [Capnocytophaga sputigena Capno] gi|213953609|gb|EEB64941.1| BatA protein [Capnocytophaga sputigena Capno] Length = 333 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 44/282 (15%), Positives = 88/282 (31%), Gaps = 49/282 (17%) Query: 109 DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT 168 + ++ + + S Y F F + P T S ++ Sbjct: 30 NKKSKPNVTLSSTIAFKKMSSWKDYLYHSLFAFRLFAVTLIIIA-LARPQTHSENAQTKI 88 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D+++ +DVS SM S D + + K + ++ N+ + GLV ++ Sbjct: 89 TDGIDIVMAIDVSSSMLSQ-DLKPNRFEALKKVASQFVK-----DRPND--RIGLVVYAG 140 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + + + L+ + T GL A N++ E Sbjct: 141 ESYTKTPVTTDKGIILSSLAELTYGQVEDGTAIGMGLATAVNRL-----------KESKA 189 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-----------VIRS--- 332 ++I+ +TDG N + D L A + G VY +GI Sbjct: 190 KSRVIILLTDGVNNTGVIDP--LIAAELAAEYGIKVYTVGIGTNGMALSPYALNPDGSIM 247 Query: 333 ---------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 ++ A + ++ N + + I K Sbjct: 248 YRMLQVEIDESLMKKIAQVTHGRYFRATNNQKLQQIYDEINK 289 >gi|281427229|ref|NP_001094211.1| collagen, type VI, alpha 2 [Rattus norvegicus] gi|149043684|gb|EDL97135.1| procollagen, type VI, alpha 2, isoform CRA_a [Rattus norvegicus] Length = 1027 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 36/164 (21%), Positives = 64/164 (39%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + ++ L + P + G+V +S++ Sbjct: 620 GALDVVFVIDSSESIGY---TNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 676 Query: 230 -IEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E L+ +S + +K L T + LK+AYNQ+ + Sbjct: 677 GTFEAIRLDDERVNSLSSFKEAVKNLEWIAGGTWTPSALKFAYNQLI--------KESRR 728 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG + +D C+ R V AIGI Sbjct: 729 QKTRVFAVVITDGRHDPRDDDLNLRALCD----RDVTVTAIGIG 768 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 77/207 (37%), Gaps = 14/207 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV---QSGL 223 + D +++ VLD S S+ + + + + + + +++ + V + G Sbjct: 48 KADCPVNVYFVLDTSESVAMQSPTD-SLLYHMQQFVPQFISQLQNEFYLEQVALSWRYGG 106 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + FS+++E F + + ++ + F T + L QI RQH Sbjct: 107 LHFSDQVEVFSPPGSDRASFTKSLQGIRSFRRGTFTDCALANMTQQI------RQHVGRG 160 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 N V +TDG + A++ G ++A+ + + LR A+ Sbjct: 161 VVN---FAVVITDGHVTGNPCGGIKMQ-AERAREEGIRLFAVAPNRNLNEQGLRDIANTP 216 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRI 370 N +M + I +D + + I Sbjct: 217 HELYRNNYATMRPDSTEIDQDTINRII 243 >gi|22203747|ref|NP_666119.1| collagen alpha-2(VI) chain precursor [Mus musculus] gi|125987813|sp|Q02788|CO6A2_MOUSE RecName: Full=Collagen alpha-2(VI) chain; Flags: Precursor gi|21706759|gb|AAH34414.1| Collagen, type VI, alpha 2 [Mus musculus] gi|148699895|gb|EDL31842.1| procollagen, type VI, alpha 2 [Mus musculus] Length = 1034 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 36/164 (21%), Positives = 64/164 (39%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + ++ L + P + G+V +S++ Sbjct: 627 GALDVVFVIDSSESIGY---TNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 683 Query: 230 -IEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E L+ +S + +K L T + LK+AYNQ+ + Sbjct: 684 GTFEAIRLDDERVNSLSSFKEAVKNLEWIAGGTWTPSALKFAYNQLI--------KESRR 735 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG + +D C+ R V AIGI Sbjct: 736 QKTRVFAVVITDGRHDPRDDDLNLRALCD----RDVTVTAIGIG 775 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 79/207 (38%), Gaps = 14/207 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV---QSGL 223 + D +++ VLD S S+ + + + + + + +++ ++ V + G Sbjct: 55 KADCPVNVYFVLDTSESVAMQSPTD-SLLYHMQQFVPQFISQLQNEFYLDQVALSWRYGG 113 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + FS+++E F + + ++ + F T + L QI RQH Sbjct: 114 LHFSDQVEVFSPPGSDRASFTKSLQGIRSFRRGTFTDCALANMTQQI------RQHVGKG 167 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 N V +TDG + + A++ G ++A+ + + LR A+ Sbjct: 168 VVN---FAVVITDGHVTGSPCGGIKMQ-AERAREEGIRLFAVAPNRNLNEQGLRDIANSP 223 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRI 370 N +M + I +D + + I Sbjct: 224 HELYRNNYATMRPDSTEIDQDTINRII 250 >gi|49809|emb|CAA46541.1| alpha-2 collagen [Mus musculus] Length = 1029 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 36/164 (21%), Positives = 64/164 (39%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + ++ L + P + G+V +S++ Sbjct: 622 GALDVVFVIDSSESIGY---TNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 678 Query: 230 -IEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E L+ +S + +K L T + LK+AYNQ+ + Sbjct: 679 GTFEAIRLDDERVNSLSSFKEAVKNLEWIAGGTWTPSALKFAYNQLI--------KESRR 730 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG + +D C+ R V AIGI Sbjct: 731 QKTRVFAVVITDGRHDPRDDDLNLRALCD----RDVTVTAIGIG 770 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 79/207 (38%), Gaps = 14/207 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV---QSGL 223 + D +++ VLD S S+ + + + + + + +++ ++ V + G Sbjct: 50 KADCPVNVYFVLDTSESVAMQSPTD-SLLYHMQQFVPQFISQLQNEFYLDQVALSWRYGG 108 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + FS+++E F + + ++ + F T + L QI RQH Sbjct: 109 LHFSDQVEVFSPPGSDRASFTKSLQGIRSFRRGTFTDCALANMTQQI------RQHVGKG 162 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 N V +TDG + + A++ G ++A+ + + LR A+ Sbjct: 163 VVN---FAVVITDGHVTGSPCGGIKMQ-AERAREEGIRLFALAPNRNLNEQGLRDIANSP 218 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRI 370 N +M + I +D + + I Sbjct: 219 HELYRNNYATMRPDSTEIDQDTINRII 245 >gi|49907|emb|CAA44206.1| alpha-2 collagen type VI, subunit [Mus musculus] Length = 764 Score = 67.5 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 36/164 (21%), Positives = 64/164 (39%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + ++ L + P + G+V +S++ Sbjct: 357 GALDVVFVIDSSESIGY---TNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 413 Query: 230 -IEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E L+ +S + +K L T + LK+AYNQ+ + Sbjct: 414 GTFEAIRLDDERVNSLSSFKEAVKNLEWIAGGTWTPSALKFAYNQLI--------KESRR 465 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG + +D C+ R V AIGI Sbjct: 466 QKTRVFAVVITDGRHDPRDDDLNLRALCD----RDVTVTAIGIG 505 >gi|313225343|emb|CBY06817.1| unnamed protein product [Oikopleura dioica] Length = 321 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 81/200 (40%), Gaps = 26/200 (13%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 + D++I++D S S+ ++ ++ + + ++ +K ++ G+ +S+ Sbjct: 133 QTDLVILIDGSWSVT---PTNFERVKIFLSAL------LKHFSIGHDASMIGIAQYSDNP 183 Query: 231 EEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L + L + + G +T + L +A + +F N + Sbjct: 184 RLEFGLNEHYDFPSLNAAVNRMKYKGGNTATGKALTFALDHVFGRSS--------RPNAQ 235 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---PNSF 345 K+++ +TDGE ++ G +++IG+ + E L+ A+ N Sbjct: 236 KVVLIITDGE----SLQDTVTEPARRLRENGVEIFSIGVGDEINLEELKDMATDPDSNHV 291 Query: 346 YLVENPHSMYDAFSHIGKDI 365 + V +++ + + KDI Sbjct: 292 FQVGGYNAITGITTQVLKDI 311 >gi|296482552|gb|DAA24667.1| vitrin precursor [Bos taurus] Length = 652 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 39/270 (14%), Positives = 93/270 (34%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + N+ ++ + V N YS++ + + + + ++ + T Sbjct: 404 EGASENEKQYMLEPNFANKAVCRTNGFYSLTVQNWFSLHKTVQPLVKRVCDTDRLACSKT 463 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S S+ + ++ + + E ++ Sbjct: 464 CL--------NSADIGFVIDGSSSVG------TSNFRTVLQFVANLSREFEISDTD---T 506 Query: 220 QSGLVTFSNKIEEFFLLE-WGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G + ++ + F + + S + IK + + T++ + YA Q+F Sbjct: 507 RIGAMQYTYEQRLEFGFDEYSTKSDVLNAIKRVGYWSGGTSTGAAIHYALEQLF------ 560 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG + A +G I YAIG+ E Sbjct: 561 ---KKSKPNKRKLMILITDG-----RSYDDIRIPAMLAHHKGVITYAIGVAWAAQDELDI 612 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V+ ++Y + ++I Sbjct: 613 IATHPARDHAFFVDEFDNLYKVVPKVIQNI 642 >gi|296269770|ref|YP_003652402.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] gi|296092557|gb|ADG88509.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] Length = 315 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 36/242 (14%), Positives = 75/242 (30%), Gaps = 35/242 (14%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLD---MMIVLDVSRSMESFFDSSITKIDMAI 199 +P + + I + + + D ++I +DVS SME+ D + ++ A Sbjct: 54 RHVPAALFLVMMSLLIVGAARPAGEVRVPRDRATIIIAVDVSLSMEA-RDVAPNRLIAAK 112 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNS 259 ++ + D+ G+V F+ + + I L+ T+ Sbjct: 113 EAAQQFVR------DLPERFNVGVVAFARTAAVVISPTTDHAAVTNAIAGLTTR-PGTSI 165 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 + A + I IV ++DG+N S + +++ AK Sbjct: 166 GEAVFNALDSIRSFD-----REAATDPPPAAIVLLSDGDNTSGRPVSEAIEAAANAK--- 217 Query: 320 AIVYAIGIRVIRS--------------HEFLRAC--ASPNSFYLVENPHSMYDAFSHIGK 363 V I L+ + Y E+ + + + IG Sbjct: 218 VPVSTIAYGTPDGYVMIDNRPVQVPVNKAALQELSEGTGGRAYTAESASELREVYQQIGT 277 Query: 364 DI 365 + Sbjct: 278 SL 279 >gi|1708567|sp|P53710|ITA2_BOVIN RecName: Full=Integrin alpha-2; AltName: Full=CD49 antigen-like family member B; AltName: Full=Collagen receptor; AltName: Full=Platelet membrane glycoprotein Ia; Short=GPIa; AltName: Full=VLA-2 subunit alpha; AltName: CD_antigen=CD49b; Flags: Precursor gi|439696|gb|AAB59255.1| integrin alpha 2 subunit [Bos taurus] Length = 1170 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 78/210 (37%), Gaps = 35/210 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++V D S S+ + D + ++ + + P Q GL+ ++N Sbjct: 162 IDVVVVCDESNSIYPW--------DAVKNFLEKFVQGLDIGPTK---TQMGLIQYANNPR 210 Query: 232 EFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L S ++ + G TN+ ++YA + + Sbjct: 211 VVFNLNTFKSKDEMIKATSQTFQYGGDLTNTFKAIQYARDTAYST------AAGGRPGAT 264 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEF---LRAC 339 K++V +TDGE+ + + + CN+ + + I + + + ++A Sbjct: 265 KVMVVVTDGESHDGSKLKAVIDQCNK---DNILRFGIAVLGYLNRNALDTKNLIKEIKAI 321 Query: 340 AS---PNSFYLVENPHSMYDAFSHIGKDIV 366 AS F+ V + + + IG+ I Sbjct: 322 ASIPTERHFFNVSDEADLLEKAGTIGEQIF 351 >gi|320353059|ref|YP_004194398.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] gi|320121561|gb|ADW17107.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] Length = 336 Score = 67.5 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 51/226 (22%), Positives = 88/226 (38%), Gaps = 30/226 (13%) Query: 149 TNSRHIVMPITSSVKVNSQTDA-RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 + IT + T LD+M+V+DVS SM S +K+ + A++ Sbjct: 45 LGASQPDGAITGQAGASGMTGGTGLDLMLVIDVSGSM------SGSKLTAVKAAAVALVN 98 Query: 208 EVKLIPDVNNVVQSGLVTFSN---KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 + NN Q G+V +S+ +E L S L I LS G ST + ++ Sbjct: 99 SL-----PNNTTQVGIVKYSSSANMVEMLQDLTSNKSDLIATINGLSASG-STATGTAIQ 152 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 A ++ + + H K+ V ++DGE +A +G V+ Sbjct: 153 VATAELLSSRAIAGHA--------KMEVVLSDGE---YNVGIDPKIAAAQAHAQGITVHT 201 Query: 325 IGIRVIRSHEFLR---ACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 +G+++ + A A F V N + + FS G ++V Sbjct: 202 VGVQLYGTGYTSMQQTAVAGGGIFTNVNNLNDLVALFSGTGGNLVG 247 >gi|160837835|ref|NP_001104272.1| integrin, alpha D [Canis lupus familiaris] Length = 1168 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 38/224 (16%), Positives = 83/224 (37%), Gaps = 28/224 (12%) Query: 152 RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 H+ T + +D++ ++D S S+E S ++ ++++ E Sbjct: 137 SHLQTIWTVPAALPECPSQEMDIVFLIDGSGSIE---QSDFKQMKDFVRAVMGQFEGTNT 193 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 + + + F+ F W L + + + T + G++ ++F Sbjct: 194 LFSLIQYSHLLKIHFTFTQ---FQSSWNPLSL---VDPIVQLDGLTYTATGIRKVVEELF 247 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--- 328 N + KKI++ +TDG+ + + +A++ G I YAIG+ Sbjct: 248 ------HSKNGARKSAKKILIVITDGQKYKDPLEYSDVIP--QAERAGIIRYAIGVGDAF 299 Query: 329 -VIRSHEFLRACASP---NSFYLVENPHSMYDAFSHIGKDIVTK 368 + + L S + + V+N + A S I + + K Sbjct: 300 WKPSAKQELDNIGSEPAQDHVFRVDN----FAALSSIQEQLQEK 339 >gi|51893456|ref|YP_076147.1| hypothetical protein STH2318 [Symbiobacterium thermophilum IAM 14863] gi|51857145|dbj|BAD41303.1| conserved hypothetical protein [Symbiobacterium thermophilum IAM 14863] Length = 414 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 84/213 (39%), Gaps = 27/213 (12%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 ++++ + + + L++ V+D S SM + +++ +++++ Sbjct: 24 YLLVTVKAPRMPAPEGRPPLNLAAVVDRSGSMAG------AALYFTKQALRFLVDQMAEE 77 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + +VT+ +++ F + V ++ + ++ G +TN + GL QI Sbjct: 78 D------RLAIVTYDDQVHVPFPSQPVVQKDAVRLLVDGITA-GGTTNLSGGLATGMQQI 130 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 R H + ++ MTDG N+ + + +++G V +G+ Sbjct: 131 ------RPHAGPGRVSR---VLLMTDGLANVGVTDPDVLAGWARAWREKGLAVSTMGVGP 181 Query: 330 IRSHEFLRACAS--PNSFYLVENPHSMYDAFSH 360 S + L A A +F+ + NP + F Sbjct: 182 HFSEDLLVALAEAGGGNFHYIANPDQIPRIFQE 214 >gi|56675030|gb|AAW19657.1| matrilin-3 [Cervus elaphus] Length = 146 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 26/149 (17%), Positives = 54/149 (36%), Gaps = 17/149 (11%) Query: 221 SGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 +V +++ ++ F L+ L+R + ++ T S ++ A ++ F ++ Sbjct: 1 VAVVNYASTVKIEFHLQTHSDKQSLKRAVARITPLSTGTMSGLAIQTAMDEAFTVE---A 57 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 +N K+ + +TDG Q A+ G +YA+G+ L+ Sbjct: 58 GARGPSSNIPKVAIIVTDGRPQD-----QVNEVAARARASGIELYAVGVDRADMES-LKM 111 Query: 339 CAS---PNSFYLVEN---PHSMYDAFSHI 361 AS + VE + F Sbjct: 112 MASEPLDEHVFYVETYGVIEKLSSRFQET 140 >gi|254477542|ref|ZP_05090928.1| conserved hypothetical protein [Ruegeria sp. R11] gi|214031785|gb|EEB72620.1| conserved hypothetical protein [Ruegeria sp. R11] Length = 523 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 27/111 (24%), Positives = 43/111 (38%), Gaps = 10/111 (9%) Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE-DQQSLYYCNEAKKRGAI 321 L+Y Y ++F M Y ++ T + ++ C+ AK RG + Sbjct: 420 LRYVYQRLFA-DWMGNSAAKNSWYYGVY-------DSWGTSTKNARTKAICDAAKARGIV 471 Query: 322 VYAIGIRVIRSH-EFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 VY IG L+ CAS ++ Y + DAF+ I I R+ Sbjct: 472 VYTIGFEAPSGGVSVLKDCASSDAHYFDVQGLEISDAFASIATSIRQLRLT 522 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 53/334 (15%), Positives = 105/334 (31%), Gaps = 61/334 (18%) Query: 21 ILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAA------------TQIMNE 68 + A L + V G+ I++ + +T+L +DR+++ AA +N+ Sbjct: 5 PMIAFLL-SMVAVGGIGIDLMRMERDRTILQYTLDRAVLAAADLDQPLPPDVVVQDYLNK 63 Query: 69 GNGNN-------RKKLKGGDILCRIKNTWNMS---FRNELRDNGFVN-------DIDDIV 111 N + + + I T+ F + N D +I Sbjct: 64 ANLSEYYQPPIAETGIGYKRVESTIDTTFETQWLDFSGGQDMPLYANSRAEESIDGLEIS 123 Query: 112 RSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR 171 + + + Y++ +R F + T + + I S Sbjct: 124 LVLDVSGSMNSNSRLYNLKNAAR-----DFIDTMVANTADNKMSVSIVPYATQVSLPKDM 178 Query: 172 LDMMIVLD---VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 LD V D S + F + T ++ + + +GL+ + Sbjct: 179 LDQYNVTDEHEYSNCVN-FTGTHFTSTGLSTTASLNRTMHFTPWWSGDARPSNGLIQYPV 237 Query: 229 KIE----EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY-----------AYNQIFDM 273 E E + + L+ I+ L +G +T+ G+K+ + + Sbjct: 238 CDERAHREVMPFQKDANRLKDFIQNLQAWG-NTSIDVGMKWGTVLLDPSAQPVISALTSS 296 Query: 274 Q------GMRQHCNTEDANYKKIIVFMTDGENLS 301 + D K+IV MTDG+N S Sbjct: 297 SVNVPGVFADRPAAYNDTETVKVIVLMTDGQNTS 330 >gi|241113476|ref|YP_002973311.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240861684|gb|ACS59350.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 329 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 40/256 (15%), Positives = 92/256 (35%), Gaps = 30/256 (11%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT-DARLDMMIVLDVSRSMESF- 187 S ++R P C + W + + V+ + + + D+++ LD+S+SM++ Sbjct: 53 SVVTRRTWPQLVCETLAWCLVV--LALARPQFVEPPIEKVEPQRDILLALDLSQSMDARD 110 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI 247 F + K ++++ ++ + + + GLV F + ++ I Sbjct: 111 FPGADGKPLARVEAVRQVVADFVGKRPGD---RIGLVAFGDAPYPLAPFTMDHELVRTMI 167 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 ++ PG+ + D G+ + +K+++ +TDG T Sbjct: 168 ---------ADTVPGMAGPRTSLGDALGLAIKMFGKTTAPEKVLIVLTDG--NDTASRMP 216 Query: 308 SLYYCNEAKKRGAIVYAIGIR--VIRSHEFLR-------ACASPNSFYLVENPHSM---Y 355 L AK +G I + +GI + L A ++ ++ + + Y Sbjct: 217 PLKAAEIAKSKGVIFHTVGIGDPAATGEDKLDTATLQKIAASTGGRYFFGGDQSQLAAIY 276 Query: 356 DAFSHIGKDIVTKRIW 371 + I + W Sbjct: 277 EVLDQITPEDQKNLSW 292 >gi|225012026|ref|ZP_03702463.1| von Willebrand factor type A [Flavobacteria bacterium MS024-2A] gi|225003581|gb|EEG41554.1| von Willebrand factor type A [Flavobacteria bacterium MS024-2A] Length = 334 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 40/238 (16%), Positives = 88/238 (36%), Gaps = 49/238 (20%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 I P T + ++T+ +D+++ +DVS SM + D ++ + +A ++ Sbjct: 74 IARPQTVDISTRTKTNKGIDIVMAIDVSSSMLAQ-DLKPDRLSALKRVASAFVD-----D 127 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQI 270 +++ + GLV ++ + + ++ ++ +S G T GL + N++ Sbjct: 128 RLSD--RIGLVVYAGESYTLTPITSDKGIVKGSLREISYQGLIEDGTAIGMGLATSVNRL 185 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-- 328 D + K+I+ +TDG N S D + A + G Y IG+ Sbjct: 186 KDSRAKS-----------KVIILLTDGVNNSGFIDPKIAT--ELAVEFGIKTYTIGLGSN 232 Query: 329 ---------VIRS------------HEFLR--ACASPNSFYLVENPHSMYDAFSHIGK 363 + L+ A A+ ++ + + + + I K Sbjct: 233 GTARAPVGILPNGSFQYAMTKVEIDEALLQEIATATGGIYFRATDNKKLEEIYEEINK 290 >gi|260463262|ref|ZP_05811463.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259030852|gb|EEW32127.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 644 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 40/233 (17%), Positives = 89/233 (38%), Gaps = 31/233 (13%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 IR F + +G ++T + + + L + I+ + + + + +D + + A QI+ Sbjct: 2 IRKFANDRRGNYALMTVLAMVPLMGALAIGIDYTEMVRERQNALNALDAAGIATAQQIVA 61 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG- 126 + + KN + + + N T+L + + N G Sbjct: 62 G--------ATDAEAIAYAKNFFEANLAHIDPAN------------TTLAVTLPNNNTGG 101 Query: 127 --YSISAISRYK-IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 + YK L + T+ M + +V + L++ +VLD S S Sbjct: 102 GTLKLCGTLTYKPYFLPTAKILAGGTSGNATTMAFNTCSEVRLKNT--LEVSLVLDNSGS 159 Query: 184 MESFF-DSSITKIDMAIKSINAMLEEV----KLIPDVNNVVQSGLVTFSNKIE 231 M+ S+ + D+ + +++++ +L+ V+ VQ LV F+ + Sbjct: 160 MKELGKGSNKVRFDLLKDAAKQLVDQLAGQAQLMKQVSKPVQFSLVPFAASVN 212 Score = 40.5 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 13/75 (17%), Positives = 27/75 (36%), Gaps = 1/75 (1%) Query: 237 EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 G S ++ I ++ G TN G+ + + + + + K+++ +TD Sbjct: 423 TAGASAVKTAIDAMAADGA-TNVPEGMAWGWRTLSSTAPFTEGRPETERGNDKVLIVLTD 481 Query: 297 GENLSTKEDQQSLYY 311 G N D Sbjct: 482 GANTYYTPDSVIAQT 496 >gi|224072363|ref|XP_002303700.1| predicted protein [Populus trichocarpa] gi|222841132|gb|EEE78679.1| predicted protein [Populus trichocarpa] Length = 587 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 49/255 (19%), Positives = 98/255 (38%), Gaps = 31/255 (12%) Query: 114 TSLDIVVVPQNEGYSISAISRYKIP-LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARL 172 + + + ++I+ + + P + + I+ P + + + A + Sbjct: 98 SDVSPDQSLSSRPHAITVKTFTEYPAVSASESFSNFGVLVRILAPPLDNTLPHHRARAPI 157 Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ VLDVS SM +K+ + +++N +++ + + +VTFS+ Sbjct: 158 DVVTVLDVSGSM-------ASKLILLKRAVNFIIQNLGPSD------RLSIVTFSSSARR 204 Query: 233 FFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L G + +S G TN GLK + + RQH + Sbjct: 205 MLPLRRMSGSGREDATSVVDSISAIG-GTNIVAGLKKGVQVL---EERRQHNSVA----- 255 Query: 289 KIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC--ASPNSF 345 I+ ++DG + S + Y E+K+ VY G + A AS +F Sbjct: 256 -TIILLSDGCDTQSHNAQNRLDYLKEESKQPTFPVYTFGFGSDHDSAAMHAISDASRGTF 314 Query: 346 YLVENPHSMYDAFSH 360 +E+ + + DAF+ Sbjct: 315 SFIESINILQDAFAR 329 >gi|89889805|ref|ZP_01201316.1| BatA, aerotolerance operon [Flavobacteria bacterium BBFL7] gi|89518078|gb|EAS20734.1| BatA, aerotolerance operon [Flavobacteria bacterium BBFL7] Length = 337 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 47/267 (17%), Positives = 89/267 (33%), Gaps = 53/267 (19%) Query: 129 ISAISRYKIPLKFCTFIPW----YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 + + L+ FI + P T+ V ++ +D+++ +DVS SM Sbjct: 48 FKVETSWLANLRPLLFILRLIVLALIITALARPQTTDVTTKTKKTEGIDIVLAVDVSASM 107 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ 244 + D +++ K + K P+ + G+V ++ + + Sbjct: 108 LAE-DLKPNRLEATKKVAADFI---KGRPND----RIGVVVYAGESYTKTPITTDEMISL 159 Query: 245 RKIKYLSKFGV---STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 R I ++ GV T GL A N++ + K+I+ MTDG N S Sbjct: 160 RAINEIAFDGVLENGTAIGMGLATAVNRL-----------KDSEALSKVIILMTDGVNNS 208 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-----------------------HEFLRA 338 D + A + VY IGI + E ++ Sbjct: 209 GFIDPKIAS--ELALEYDIKVYTIGIGTNGNAPSPVAQIGRNKFRMAMMPVEIDEELMKQ 266 Query: 339 CA--SPNSFYLVENPHSMYDAFSHIGK 363 A + ++ N + + + I K Sbjct: 267 IAVDTGGKYFRATNNKKLEEIYGEIDK 293 >gi|262193845|ref|YP_003265054.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262077192|gb|ACY13161.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 346 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 40/247 (16%), Positives = 81/247 (32%), Gaps = 51/247 (20%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVN 216 +V N+ + +M+V+D S SM + T++++ A + + + Sbjct: 80 AVGENTIRREGIAIMMVVDTSGSMRALDLADGGLDQTRLEVVKDVFRAFVAGEDGLDGRS 139 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL----SKFGVSTNSTPGLKYAYNQIFD 272 N GLV+F+ + L L + L + T GL A ++ + Sbjct: 140 NDT-IGLVSFAGFADTRCPLTLNHGSLLTILDDLEIVRERAEDGTAIGDGLGLAVERLRE 198 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR---- 328 + ++I+ +TDG N + + L A + G VY IG Sbjct: 199 SEASS-----------RVIILLTDGVNNA--GIETPLEAAELASRLGIKVYTIGAGTDGV 245 Query: 329 -------VIRSHEFLRACA--------------SPNSFYLVENPHSMYDAFSHIGK---- 363 + E LR + ++ + + + I + Sbjct: 246 APVRVTNPLTGAEELRPMPVEIDEATLEAIAEHTGGRYFRATDGDGLRQVYEQIDRLERT 305 Query: 364 DIVTKRI 370 +I +R+ Sbjct: 306 EISERRL 312 >gi|119476361|ref|ZP_01616712.1| batB protein, putative [marine gamma proteobacterium HTCC2143] gi|119450225|gb|EAW31460.1| batB protein, putative [marine gamma proteobacterium HTCC2143] Length = 354 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 34/217 (15%), Positives = 75/217 (34%), Gaps = 51/217 (23%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +D+S SM+ + +I +N ++ K + GL+ F + Sbjct: 91 DLLLAVDLSGSMKIEDMEVNGDRVPRIVAVKTVLNEFIQRRKGD-------RLGLILFGS 143 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS---KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L + + +QR ++ +T + + ++ D G R Sbjct: 144 QAYVQAPLTFDQTTVQRFMREAQIGFAGEENTAIGDAIGLSVKRLRDRPGDRH------- 196 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-------------- 331 +++ +TDG+N K + + A G I+Y IG+ Sbjct: 197 ----VMILLTDGQNNGGKINP--IPASKIAANNGIIIYTIGVGADEMVMPGVLGSSFGSR 250 Query: 332 --------SHEFLR--ACASPNSFYLVENPHSMYDAF 358 + L+ A A+ ++ NP + + Sbjct: 251 RVNPSADLDEKTLQQVATATGGQYFRARNPQELEKIY 287 >gi|148657647|ref|YP_001277852.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569757|gb|ABQ91902.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 966 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 80/215 (37%), Gaps = 32/215 (14%) Query: 165 NSQTDARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 +++ L +++V+D S SM E K+D+A +++ + I Q GL Sbjct: 402 DTKQQPDLALVMVIDRSGSMAEPVAGGRRNKLDLAKEAVYQASLGLTPID------QVGL 455 Query: 224 VTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 V F + L+ S ++R + G TN PG++ A + +H Sbjct: 456 VVFDDTANWVLQLQPLPSMVEIERALGSF-GIGGGTNIRPGIEQAALALASTDAKIKH-- 512 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC-A 340 ++ +TDG S D + + + G + + + + + + A Sbjct: 513 ---------VLLLTDGIAESNYSDLIA-----QMRASGITISTVAVGLDANPNLVDVANA 558 Query: 341 SPNSFYLVENPHSMYDAF--SHI---GKDIVTKRI 370 Y V + + F I G+DI+ + I Sbjct: 559 GGGRSYRVTSIDEVPRIFLQETIIAAGRDIIEQPI 593 >gi|171913221|ref|ZP_02928691.1| hypothetical protein VspiD_18615 [Verrucomicrobium spinosum DSM 4136] Length = 868 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 44/281 (15%), Positives = 89/281 (31%), Gaps = 33/281 (11%) Query: 98 LRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMP 157 +GF I V + + + Y + + +F I Sbjct: 338 QELSGFDGVILSDVPAHQVGETAMVAIRDYVDKLGGGFIMLGGPNSFGVGGYYRTPIEEV 397 Query: 158 ITSSVKVNSQTDARLD-MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 + +K + + + + +V+D S SM S K++MA + A E + Sbjct: 398 LPVRLKAPDEEEKQSSALALVIDRSGSM------SGEKLEMAKSAAIATAEVLTRNDS-- 449 Query: 217 NVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 G+ F ++ + S + +I L+ G TN P A N + + Sbjct: 450 ----IGVYAFDSEAHVVVPMTRLTSSSAVAGQIAGLTS-GGGTNLHPAFTEARNALQRTK 504 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 +H ++ +TDG+ + ++ + G + + I Sbjct: 505 AKIKH-----------MIILTDGQTSGQGYE----ALASQCRAEGVTISTVAIGDGAHVG 549 Query: 335 FLRACAS--PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 L+A AS Y + ++ F+ + I D Sbjct: 550 LLQAIASLGGGKSYTTLDAANIVRIFTQDTLVHTGRMIRED 590 >gi|148685682|gb|EDL17629.1| mCG133494, isoform CRA_a [Mus musculus] gi|148685684|gb|EDL17631.1| mCG133494, isoform CRA_a [Mus musculus] Length = 828 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 40/206 (19%), Positives = 79/206 (38%), Gaps = 34/206 (16%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV---NNVVQSGLVTFSNK 229 D++ ++D S S+ S + K+ +K++ + L+ + ++ + TF+N Sbjct: 152 DIVFLIDGSGSISS---TDFEKMLDFVKAVMSQLQRPSTRFSLMQFSDYFRVHF-TFNNF 207 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 I L + + + T + +K+ ++F Q + + K Sbjct: 208 ISTSSPLS--------LLGSVRQLRGYTYTASAIKHVITELFTTQ------SGARQDATK 253 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI----RSHEFLRACA---SP 342 +++ +TDG S+ A+ I YAIG+ S + L+A A S Sbjct: 254 VLIVITDGRKQGDNLSYDSVIP--MAEAASIIRYAIGVGKAFYNEHSKQELKAIASMPSH 311 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVTK 368 + VEN +DA I + K Sbjct: 312 EYVFSVEN----FDALKDIENQLKEK 333 >gi|10946646|ref|NP_067309.1| integrin alpha-X precursor [Mus musculus] gi|48428495|sp|Q9QXH4|ITAX_MOUSE RecName: Full=Integrin alpha-X; AltName: Full=CD11 antigen-like family member C; AltName: Full=Leukocyte adhesion glycoprotein p150,95 alpha chain; AltName: Full=Leukocyte adhesion receptor p150,95; AltName: CD_antigen=CD11c; Flags: Precursor gi|6684131|gb|AAF23492.1|AF211864_1 leukocyte adhesion glycoprotein p150,95 alpha integrin subunit [Mus musculus] gi|189442099|gb|AAI67225.1| Integrin alpha X [synthetic construct] Length = 1169 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 40/206 (19%), Positives = 79/206 (38%), Gaps = 34/206 (16%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV---NNVVQSGLVTFSNK 229 D++ ++D S S+ S + K+ +K++ + L+ + ++ + TF+N Sbjct: 152 DIVFLIDGSGSISS---TDFEKMLDFVKAVMSQLQRPSTRFSLMQFSDYFRVHF-TFNNF 207 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 I L + + + T + +K+ ++F Q + + K Sbjct: 208 ISTSSPLS--------LLGSVRQLRGYTYTASAIKHVITELFTTQ------SGARQDATK 253 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI----RSHEFLRACA---SP 342 +++ +TDG S+ A+ I YAIG+ S + L+A A S Sbjct: 254 VLIVITDGRKQGDNLSYDSVIP--MAEAASIIRYAIGVGKAFYNEHSKQELKAIASMPSH 311 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVTK 368 + VEN +DA I + K Sbjct: 312 EYVFSVEN----FDALKDIENQLKEK 333 >gi|74215118|dbj|BAE41792.1| unnamed protein product [Mus musculus] Length = 1169 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 40/206 (19%), Positives = 79/206 (38%), Gaps = 34/206 (16%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV---NNVVQSGLVTFSNK 229 D++ ++D S S+ S + K+ +K++ + L+ + ++ + TF+N Sbjct: 152 DIVFLIDGSGSISS---TDFEKMLDFVKAVMSQLQRPSTRFSLMQFSDYFRVHF-TFNNF 207 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 I L + + + T + +K+ ++F Q + + K Sbjct: 208 ISTSSPLS--------LLGSVRQLRGYTYTASAIKHVITELFTTQ------SGARQDATK 253 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI----RSHEFLRACA---SP 342 +++ +TDG S+ A+ I YAIG+ S + L+A A S Sbjct: 254 VLIVITDGRKQGDNLSYDSVIP--MAEAASIIRYAIGVGKAFYNEHSKQELKAIASMPSH 311 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVTK 368 + VEN +DA I + K Sbjct: 312 EYVFSVEN----FDALKDIENQLKEK 333 >gi|126314401|ref|XP_001377042.1| PREDICTED: similar to Procollagen, type VI, alpha 2, partial [Monodelphis domestica] Length = 762 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 32/164 (19%), Positives = 65/164 (39%), Gaps = 18/164 (10%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ + + ++ L + P+ + + G+V +S+ Sbjct: 353 GALDIVFVIDSSESIGY---TNFSLEKNFVINVVNRLGSITKDPNSDTGTRIGVVQYSHD 409 Query: 230 -IEEFFLLEWG----VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E L+ G ++ + ++K L T + L Y YN++ Sbjct: 410 GTFEAIKLDDGRIGSLAQFKEEVKKLEWIAGGTWTPSALNYTYNELI----KGSRRKKTR 465 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 V +TDG + +Q C++ + +V AIGI Sbjct: 466 V----FAVVITDGRHDPRDNEQSLKALCDKVEN--VVVTAIGIG 503 >gi|83647467|ref|YP_435902.1| von Willebrand factor type A (vWA) domain-containing protein [Hahella chejuensis KCTC 2396] gi|83635510|gb|ABC31477.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Hahella chejuensis KCTC 2396] Length = 345 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 28/217 (12%), Positives = 69/217 (31%), Gaps = 46/217 (21%) Query: 173 DMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +D+S SM+ + T++D+ + ++ V + GL+ F Sbjct: 95 DLLLAVDISPSMQETDLQLKGNQATRLDVVKSVVTDFIQ-------VRQGDRLGLILFGA 147 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + + + + L G +T + ++ + Sbjct: 148 QPYIQAPLTYDLVTVGELLNEATLGIAGNATAIGDAIGLGIKRLRERPADS--------- 198 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------- 331 +++V +TDG N + A G +Y +G+ Sbjct: 199 --RVLVLLTDGANTG--GEVSPEQAAKLAADAGIKIYTVGVGADEIIRRGIFGYRKENPS 254 Query: 332 ---SHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L++ A + ++ N + + I + Sbjct: 255 ADLDETLLQSIADETDGQYFRARNTGELELIYESINQ 291 >gi|292627386|ref|XP_001332035.3| PREDICTED: collagen alpha-1(XXI) chain-like [Danio rerio] Length = 1056 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 35/212 (16%), Positives = 81/212 (38%), Gaps = 26/212 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + A D++ ++D S S+ + A + + + + ++ Q G+V Sbjct: 94 AGCSTAVNDLVYIMDGSWSVGD------VDFETAKRWLINVTSGFDV---SSHYSQVGVV 144 Query: 225 TFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ L L I+ +S G +T + +K+A + +F N Sbjct: 145 QYSDTPRLEIPLGQHKTTQQLIEAIEKISYLGGNTQTGRAIKFAVDHVFPSSRRNDVKN- 203 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACAS 341 +I V +TDG++ D +A+ +G V+A+G+ I + E + + Sbjct: 204 ------RIAVVVTDGKSQDDVTDASL-----DARTQGITVFAVGVGSEITNSELVTIANT 252 Query: 342 P--NSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 P + E+ ++ + + + + + Sbjct: 253 PAGDYVLFAEDYTNIERIRDAMEQKLCEESVC 284 >gi|189524674|ref|XP_684145.3| PREDICTED: anthrax toxin receptor 1-like [Danio rerio] Length = 607 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 75/212 (35%), Gaps = 25/212 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + S D+ VLD S S++ ++ I ++ + ++ Sbjct: 29 EAGSSCYGGFDLYFVLDKSGSVQHHWN-----------EIYYFVDHLAHKFISPQ-LRMS 76 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + FS + L ++ ++ L + G T G + A QI+ G Sbjct: 77 FIVFSTEGRILMELTEDRDQIRAGLEELQRVLPGGDTFMHKGFQKASQQIYYGTGDGYRT 136 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + +I+ +TDGE + D + +++ GA VY +G++ + Sbjct: 137 AS-------VIIALTDGELRENEFDLAARE-AGRSRQLGASVYCVGVKDFNETQLATIAD 188 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 S + + V + ++A + I+ + Sbjct: 189 SKDHVFPVNDG---FEALQGVIDSILKRSCIE 217 >gi|325279872|ref|YP_004252414.1| von Willebrand factor type A [Odoribacter splanchnicus DSM 20712] gi|324311681|gb|ADY32234.1| von Willebrand factor type A [Odoribacter splanchnicus DSM 20712] Length = 330 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 42/241 (17%), Positives = 83/241 (34%), Gaps = 52/241 (21%) Query: 154 IVMPITSSVKVNSQTDAR------LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 I++ +T+ + S + +D+ + LDVS SM + D + +++ A + + Sbjct: 67 IILLVTALARPQSSNSWQTYTSEGIDIALALDVSTSMLA-RDFTPDRLEAAKEVATKFIL 125 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKY 265 E + + GLV F+ + L + L ++ + T GL Sbjct: 126 E-RPQD------KIGLVVFAGESFTQCPLTTDQAVLVNLLREVKSGMIQDGTAIGLGLAN 178 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 A N++ D G K+++ +TDG N + + AK G VY I Sbjct: 179 AVNRLKDSPGKS-----------KVVILLTDGINN--QGAIAPVTAAELAKAFGIRVYTI 225 Query: 326 GIRVIRS---------------------HEFLRACA--SPNSFYLVENPHSMYDAFSHIG 362 G+ L+ A + ++ + + +S I Sbjct: 226 GVGTYGEAPYPVPTPFGVQLQNMPVEIDEGVLQQIANVTGGKYFRATDNDKLQQIYSEID 285 Query: 363 K 363 + Sbjct: 286 Q 286 >gi|332299342|ref|YP_004441263.1| von Willebrand factor type A [Porphyromonas asaccharolytica DSM 20707] gi|332176405|gb|AEE12095.1| von Willebrand factor type A [Porphyromonas asaccharolytica DSM 20707] Length = 326 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 38/217 (17%), Positives = 77/217 (35%), Gaps = 46/217 (21%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +D+++ +D+S SM++ D + + A + M+ P+ N GLV F+ + Sbjct: 85 QGIDLVLAMDLSGSMQA-LDLKPNRFEAARDVASEMIA---ARPNDN----IGLVVFAGE 136 Query: 230 IEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + +Q+ ++ + + T GL A N + N Sbjct: 137 SFTLCPLTVDHNVIQQMLETTEIGQLEDGTAIGLGLATAINTL-----------RGSDNK 185 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF------------ 335 K+I+ +TDG N + D A++ G +Y + +F Sbjct: 186 SKVIILLTDGSNNA--GDITPSMAAELAQQYGIRIYTVAAGTNGVAKFPVQTAFGTEYVE 243 Query: 336 ---------LRACA--SPNSFYLVENPHSMYDAFSHI 361 LR A + +Y + +++ + I Sbjct: 244 ADVQIDEGTLRHIAEQTGGKYYRATDETKLHEIYKEI 280 >gi|297708173|ref|XP_002830853.1| PREDICTED: hypothetical protein LOC100461231, partial [Pongo abelii] Length = 885 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 48/270 (17%), Positives = 97/270 (35%), Gaps = 23/270 (8%) Query: 113 STSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPW-------YTNSRHIVMPITSSVKVN 165 ST+ + P E A K+ C+ + + ++ P T+ N Sbjct: 599 STAGNHTDRPSVEPRDFRATGAAKMLQGPCSVLLLWGILGAIQAQQQEVISPDTTERNNN 658 Query: 166 --SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV---Q 220 +TD + + VLD S S+ + I + + + +++ ++ V + Sbjct: 659 CPEKTDCPIHVYFVLDTSESVTMQSPTDILLFH-MKQFVPQFISQLQNEFYLDQVALSWR 717 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 G + FS+++E F + + ++ +S F T + L QI RQ Sbjct: 718 YGGLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQI------RQDR 771 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + + V +TDG + L A++ G ++A+ + LR A Sbjct: 772 SKGTVH---FAVVITDGHVTGSPCGGIKLQ-AERAREEGIRLFAVAPNQNLKEQGLRDIA 827 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 S + +M + I +D + + I Sbjct: 828 STPHELYRNDYATMLPDSTEIDQDTINRII 857 >gi|124006869|ref|ZP_01691699.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] gi|123987550|gb|EAY27259.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] Length = 351 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 41/223 (18%), Positives = 75/223 (33%), Gaps = 49/223 (21%) Query: 167 QTDARLDMMIVLDVSRSM--ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 QT +D+++ LD+S SM E F + +++ A + + GLV Sbjct: 106 QTSEGIDILLTLDISESMLIEDFTPN---RLEAAKLVAKNFV-------HGRKYDRIGLV 155 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGV---STNSTPGLKYAYNQIFDMQGMRQHCN 281 FS + L L+R I+ + + + T L GM Sbjct: 156 IFSGEAYSVSPLTTDYKLLKRYIEDIREDMIQENGTAIGSAL-----------GMGTIRM 204 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------- 332 E A+ K+++ ++DG+N + D + A +Y I + Sbjct: 205 QESASRSKVVILISDGDNTAGNLDPITASRLATAHN--IKIYTILVGRSGKVPYGRDMFG 262 Query: 333 ----------HEFLRACAS--PNSFYLVENPHSMYDAFSHIGK 363 LR A FY + ++ + F+ I + Sbjct: 263 QPQYVNNTVDESVLREIAKIGEGKFYRASDNQALKNVFAEINR 305 >gi|156409361|ref|XP_001642138.1| predicted protein [Nematostella vectensis] gi|156229279|gb|EDO50075.1| predicted protein [Nematostella vectensis] Length = 989 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 51/313 (16%), Positives = 98/313 (31%), Gaps = 32/313 (10%) Query: 63 TQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVR--STSLDIVV 120 + E K+ K ++ + ++ D V + + + Sbjct: 229 AEKAKEDKAGPAKEDKPSKTSPKVGKPHPKGEILGGESSDPLDIAGDTVDTDAPAEEETT 288 Query: 121 VPQNEGYSIS---AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 + G S Y P + + + T+ V + A +D++I+ Sbjct: 289 DEEGSGQDYWGPEEDSAYDYP--PRSSCVPCPPGPELPISNTA-VPAKTAASANVDLVIL 345 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL- 236 +D SRS+E + + AI + + P + GL+ + + + F Sbjct: 346 IDGSRSVERSGVGNFRR---AIDFARDLTSSFVVSP---RHTRVGLMVYGKRAYKVFGFN 399 Query: 237 --EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 K + + L+ AY F +K++V + Sbjct: 400 DYRDNNRLFTGFNKPIRYPRERAQTATALRSAYRTFFGRNKRSA---------QKVLVLV 450 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--PNSFYLVENPH 352 TDG+ K +QS K+RG +Y +G + + L A AS P Sbjct: 451 TDGKIRDAKAKRQS----QSIKRRGVKIYVVGAGKYFNIKQLEAMASRRPKDHVFTSGWR 506 Query: 353 SMYDAFSHIGKDI 365 + S + K I Sbjct: 507 HLGHIVSDVKKHI 519 >gi|149725809|ref|XP_001495640.1| PREDICTED: similar to leukocyte adhesion glycoprotein [Equus caballus] Length = 1140 Score = 67.1 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 84/212 (39%), Gaps = 34/212 (16%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 D+ ++D S S+ +++ K+ + + ++ K + L+ + Sbjct: 144 CPQQESDIAFLIDGSGSI---YENDFQKMKEFVTIVMNQFKKSKTLFS--------LMQY 192 Query: 227 SNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S+ + F + ++ ++ +++ G T++ G++ ++F N Sbjct: 193 SDTFQTHFTFKEFANNPNPGSLVRPINQLGGRTHTATGIRKVVRELF------HSRNGAR 246 Query: 285 ANYKKIIVFMTDGENLSTK-EDQQSLYYCNEAKKRGAIVYAIGIRVI----RSHEFLRAC 339 N KI+V +TDGE + E + + ++ G I Y IG+ + +S E L Sbjct: 247 KNALKILVVITDGEKFGDRLEYEDVIPEADQ---EGIIRYVIGVGIAFSIEKSREELNTI 303 Query: 340 ASP---NSFYLVENPHSMYDAFSHIGKDIVTK 368 AS + + V N ++A I + K Sbjct: 304 ASKPARDHVFRVNN----FEALKTIQNQLQEK 331 >gi|148699894|gb|EDL31841.1| procollagen, type VI, alpha 1, isoform CRA_c [Mus musculus] Length = 227 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 36/170 (21%), Positives = 63/170 (37%), Gaps = 16/170 (9%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP---DVNNVVQSGLV 224 D +D+ VLD S S+ +D ++ ++ D N V +G + Sbjct: 31 QDCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGAL 90 Query: 225 TFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +S+++E L G L+ + + FG T + +K ++ H Sbjct: 91 HYSDEVEIIRGLTRMPSGRDELKASVDAVKYFGKGTYTDCAIKKGLEELLIGG---SHLK 147 Query: 282 TEDANYKKIIVFMTDGEN-LSTKEDQQSLYYC-NEAKKRGAIVYAIGIRV 329 K ++ +TDG KE L NEAK G V+++ I Sbjct: 148 EN-----KYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGIKVFSVAITP 192 >gi|301781662|ref|XP_002926252.1| PREDICTED: collagen alpha-3(VI) chain-like [Ailuropoda melanoleuca] Length = 3167 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 38/218 (17%), Positives = 75/218 (34%), Gaps = 24/218 (11%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 + + + + A D++ ++D S S+ + + + ++E Sbjct: 14 CLFLSGFSLTRAQQQQADVKNGAAADVIFLVDSSWSIGK------EHFQLVREFLYDVIE 67 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKY 265 + + + LV F+ FLL + + I +S G S + GL+Y Sbjct: 68 SLAVGDSD---FRFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGSNETGKGLEY 124 Query: 266 AY-NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 N + + G R ++IV +TDG + D K V+A Sbjct: 125 VMQNHLTEAAGSRA-----GDGVPQVIVVLTDG-----RSDDGLALPSAGLKSADVNVFA 174 Query: 325 IGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSH 360 IG+ + P + +EN S++D + Sbjct: 175 IGVEDADEGALKEIASEPLNMHVFNLENFTSLHDIVGN 212 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 44/311 (14%), Positives = 107/311 (34%), Gaps = 27/311 (8%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + GG I F + +G V+ Sbjct: 1520 LEFVARNLFVKSAGSRIEDGVPQHLVLFLGGKSQDDISR-----FSQVISSSGIVSLGVG 1574 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + ++P + + + P + Sbjct: 1575 DRNIDRTELQTITNDPRLVFTVREFRELPSIEDRVMHAFGPSGVTPAPPGVDIPSPSRPE 1634 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ + D + + + E V + + + +Q GLV +++ Sbjct: 1635 KKKADVVFLLD--GSINF-------RRDTFQEVLRFVSEIVDTLYEGGDSIQVGLVQYNS 1685 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + F L+ + + I + G N+ G+++ F + + D Sbjct: 1686 DPTDEFFLKDFSTKQQIIDAINKVVYKGGRHANTKVGIEHLRQNHFVPEAGSR----LDQ 1741 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +I +T G K + + ++G V+A+G++ I S E + ++ + Sbjct: 1742 RVPQIAFVITGG-----KSVEDAQEASLALTQKGVKVFAVGVKNIDSEEVGKIASNSATA 1796 Query: 346 YLVENPHSMYD 356 + V N + + Sbjct: 1797 FRVGNVQELSE 1807 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 76/200 (38%), Gaps = 26/200 (13%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + +V+ D++ +LD S ++ T + ++ + + D Sbjct: 628 SGTSEVHVNKR---DIIFLLDGSSNVGK------TNFPYVRDFVTNVVNSLDVGSDN--- 675 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYL-SKFGVSTNSTPGLKYAY-NQIFDMQ 274 ++ GLV FS+ F L+ + + ++ L + G N+ L Y + N + Sbjct: 676 IRVGLVQFSDTPVTEFSLDTYQTKAELLAHLRRLQPQGGSGLNTGSALSYVHANHFTEAG 735 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 G R + ++++ +T G + L N + G + +G E Sbjct: 736 GSRS-----REHVPQLLLLLTAG-----PAEDAYLPAANALARAGVLTLCVGASRANKAE 785 Query: 335 FLRACASPNSFYLVENPHSM 354 + +P+ YL+++ S+ Sbjct: 786 LEQIAFNPSLVYLMDDFSSL 805 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 37/274 (13%), Positives = 84/274 (30%), Gaps = 45/274 (16%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS-------- 166 ++ + + I++ F + ++V + SSV Sbjct: 174 AIGVEDADEGALKEIASEPLNMHVFNLENFTSLHDIVGNLVSCVQSSVAPEGAGGTETLK 233 Query: 167 --QTDARLDMMIVLDVS---RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 D++ ++D S S+ + + +LE + + ++ Sbjct: 234 DITAQDSADIIFLVDGSNNTGSVH---------FAVIRDFLVNLLERLSV---GAQQIRV 281 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQ 278 G+V +S++ F L+ + Q +K L G N L + F R Sbjct: 282 GVVQYSDEPRTVFSLDTYSTKAQVLDAVKALVFTGGELANVGLALDFVVENHFT----RA 337 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + + +++V ++ G + D K+ + +G + E Sbjct: 338 GGSRVEEGVPQVLVLISAGPSSDEIRDGVVA-----LKQASVFSFGLGAQAASRAELQHI 392 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + N + V F G D+ + + Y Sbjct: 393 ATNDNLVFTVPE-------FRSFG-DLQEQLLPY 418 Score = 45.1 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 43/338 (12%), Positives = 107/338 (31%), Gaps = 43/338 (12%) Query: 40 VSHIFFMKTVLHSMIDR--------SLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWN 91 + + K + + + R + A + ++ + + + + + Sbjct: 1285 LLNAHSSKDEVQNAVRRLRPKGGRQVNIGGALEYVSRNIFKRPLGSRIEEGVPQFLVLIS 1344 Query: 92 MSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQ-------NEGYSISAISRYKIPL---KF 141 ++ D+ V ++ P+ + Y S + ++P K Sbjct: 1345 SGKSDDEIDDSAAELKQSGVAPFTVARNADPEELVKISLSPEYVFSVSTFRELPSLEQKL 1404 Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 T I T+ + + ++ + D++ ++D S S++ + Sbjct: 1405 LTPITTLTSEQIQKILASTRYPSPAIESDAADIVFLIDSSDSIKP------DDVAHIRDF 1458 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI-KYLSKF----GVS 256 + ++ + + P+ V+ G+V FSN++ F L+ Q + L + G Sbjct: 1459 VIKIVRRLNIGPNK---VRVGVVQFSNEVFPEFYLK--THKSQAAVLDALRRLRFRGGSP 1513 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 N+ L++ +F ED + +++F+ K + Sbjct: 1514 LNTGRALEFVARNLFVKS---AGSRIEDGVPQHLVLFL------GGKSQDDISRFSQVIS 1564 Query: 317 KRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSM 354 G + +G R I E P + V + Sbjct: 1565 SSGIVSLGVGDRNIDRTELQTITNDPRLVFTVREFREL 1602 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 29/206 (14%), Positives = 70/206 (33%), Gaps = 28/206 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + S F + + + ++E + + PD V+ +V +S++ Sbjct: 1029 DVVFLIDGSEGVRSGFP-------LLKEFVQRVVESLDVGPD---RVRVAVVQYSDRTRP 1078 Query: 233 FFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L Q + + + G + N+ L + I + Sbjct: 1079 EFYL-NSYMDQQSVVGAVRRLTLLGGPAPNTGAALDFVLRSILTSSAGSRIAEG----VP 1133 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 ++++ +T + + K+ GA+ IGI E P+ + Sbjct: 1134 QLLIVLT-----ADRSGDDVRGPSVVVKRGGAVPIGIGIGNADITEMQTISFIPDFAVAI 1188 Query: 349 ENPHSMYDAFSHIGKDIVTKRIWYDK 374 + + + I + I ++ Sbjct: 1189 PTFRQL----GTVQQAISDRVIQLNR 1210 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 71/202 (35%), Gaps = 32/202 (15%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVV 219 + + +DA +DM VLD S S F + + K I ++ ++ L PD + Sbjct: 2609 RRAAGSDADIDMAFVLDSSESTTLF------QFNEMRKYIGYLVRQLDLSPDPKASQHFT 2662 Query: 220 QSGLVTFS----------NKIEEFFLLE-WGVSH--LQRKIKYLSKFGVSTNSTPGLKYA 266 + +V + + ++ F L +G + +++ + + + Y Sbjct: 2663 RVAVVQHAPYESVGNVSVSPVKVEFSLTDYGSKEKLVDFLHSRMTQLQGTRDLGRAIGYT 2722 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 +F+ + KI+ M GE + ++ +AK +G +G Sbjct: 2723 IENVFES--------APNPRDLKIMFLMLTGEVEKGQLEEAQRVIL-QAKCKGYFFVILG 2773 Query: 327 IRVI-RSHEFLRACASPNSFYL 347 I E + PN + Sbjct: 2774 IGRKVNVKEVYGFASEPNDVFF 2795 >gi|62087470|dbj|BAD92182.1| PREDICTED: integrin, alpha D variant [Homo sapiens] Length = 1177 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 35/209 (16%), Positives = 77/209 (36%), Gaps = 28/209 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D++ ++D S S++ ++ ++++ E + + + F Sbjct: 160 CPHQEMDIVFLIDGSGSIDQN---DFNQMKGFVQAVMGQFEGTDTLFALMQYSNLLKIHF 216 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + Q + + + T + G+ Q+F H N + Sbjct: 217 TFTQFRTSPSQ------QSLVDPIVQLKGLTFTATGILTVVTQLF------HHKNGARKS 264 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFLRACASP 342 KKI++ +TDG+ + + +A+K G I YAIG + + + L +S Sbjct: 265 AKKILIVITDGQKYKDPLEYSDVIP--QAEKAGIIRYAIGVGHAFQGPTARQELNTISSA 322 Query: 343 ---NSFYLVENPHSMYDAFSHIGKDIVTK 368 + + V+N + A I K + K Sbjct: 323 PPQDHVFKVDN----FAALGSIQKQLQEK 347 >gi|1167550|gb|AAB38547.1| leukointegrin alpha d chain [Homo sapiens] Length = 1162 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 35/209 (16%), Positives = 77/209 (36%), Gaps = 28/209 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D++ ++D S S++ ++ ++++ E + + + F Sbjct: 144 CPHQEMDIVFLIDGSGSIDQN---DFNQMKGFVQAVMGQFEGTDTLFALMQYSNLLKIHF 200 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + Q + + + T + G+ Q+F H N + Sbjct: 201 TFTQFRTSPSQ------QSLVDPIVQLKGLTFTATGILTVVTQLF------HHKNGARKS 248 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFLRACASP 342 KKI++ +TDG+ + + +A+K G I YAIG + + + L +S Sbjct: 249 AKKILIVITDGQKYKDPLEYSDVIP--QAEKAGIIRYAIGVGHAFQGPTARQELNTISSA 306 Query: 343 ---NSFYLVENPHSMYDAFSHIGKDIVTK 368 + + V+N + A I K + K Sbjct: 307 PPQDHVFKVDN----FAALGSIQKQLQEK 331 >gi|62548866|ref|NP_005344.2| integrin alpha-D precursor [Homo sapiens] gi|296434544|sp|Q13349|ITAD_HUMAN RecName: Full=Integrin alpha-D; AltName: Full=ADB2; AltName: Full=CD11 antigen-like family member D; AltName: Full=Leukointegrin alpha D; AltName: CD_antigen=CD11d; Flags: Precursor gi|162317970|gb|AAI56096.1| Integrin, alpha D [synthetic construct] gi|168275856|dbj|BAG10648.1| integrin alpha-D precursor [synthetic construct] Length = 1161 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 35/209 (16%), Positives = 77/209 (36%), Gaps = 28/209 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D++ ++D S S++ ++ ++++ E + + + F Sbjct: 144 CPHQEMDIVFLIDGSGSIDQN---DFNQMKGFVQAVMGQFEGTDTLFALMQYSNLLKIHF 200 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + Q + + + T + G+ Q+F H N + Sbjct: 201 TFTQFRTSPSQ------QSLVDPIVQLKGLTFTATGILTVVTQLF------HHKNGARKS 248 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFLRACASP 342 KKI++ +TDG+ + + +A+K G I YAIG + + + L +S Sbjct: 249 AKKILIVITDGQKYKDPLEYSDVIP--QAEKAGIIRYAIGVGHAFQGPTARQELNTISSA 306 Query: 343 ---NSFYLVENPHSMYDAFSHIGKDIVTK 368 + + V+N + A I K + K Sbjct: 307 PPQDHVFKVDN----FAALGSIQKQLQEK 331 >gi|242076422|ref|XP_002448147.1| hypothetical protein SORBIDRAFT_06g022130 [Sorghum bicolor] gi|241939330|gb|EES12475.1| hypothetical protein SORBIDRAFT_06g022130 [Sorghum bicolor] Length = 697 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 44/226 (19%), Positives = 86/226 (38%), Gaps = 37/226 (16%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 +S + + + + S +D++ VLDVS SM TK+ + +++ +++ + Sbjct: 242 SSNDVTGSLVNESSMRSSRRVPIDLVTVLDVSGSMAG------TKLALLKQAMGFVIQHL 295 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKY 265 + + ++ FS+ F L+ G + I L G TN LK Sbjct: 296 RPSD------RLSVIAFSSTARRLFPLQRMSHHGRQQALQAISSLGA-GGGTNIADALKK 348 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL--------STKEDQQSLYYCNEAKK 317 A I D C+ I+ ++DG++ T ++SL + + Sbjct: 349 AVKVIEDRNYKNSVCS---------IILLSDGQDTFNISSNFQGTSAGRRSLVPPSILNE 399 Query: 318 -RGAIVYAIGIRVIRSHEFLRAC--ASPNSFYLVENPHSMYDAFSH 360 ++ G + L + AS +F +E+ M DAF+ Sbjct: 400 LHMVPLHTFGFGADHDSDTLHSISEASGGTFSFIEDEGVMQDAFAQ 445 >gi|15827966|ref|NP_302229.1| hypothetical protein ML1808 [Mycobacterium leprae TN] gi|221230443|ref|YP_002503859.1| hypothetical protein MLBr_01808 [Mycobacterium leprae Br4923] gi|81536900|sp|Q9CBL9|Y1808_MYCLE RecName: Full=UPF0353 protein ML1808 gi|254800638|sp|B8ZS82|Y1808_MYCLB RecName: Full=UPF0353 protein MLBr01808 gi|13093519|emb|CAC30761.1| possible membrane protein [Mycobacterium leprae] gi|219933550|emb|CAR71903.1| possible membrane protein [Mycobacterium leprae Br4923] Length = 335 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 31/211 (14%), Positives = 68/211 (32%), Gaps = 30/211 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+V+DVS+SM + D ++ A ++ E+ + GL+ ++ Sbjct: 99 VMLVIDVSQSMRA-TDVEPNRMAAAQEAAKQFAGELTPGIN------LGLIAYAGTATVL 151 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + L +F T + + A I + + + IV Sbjct: 152 VSPTTNRYATKNALDKL-QFADRTATGEAIFTALQAIATVGAV---IGGGEMPPPARIVL 207 Query: 294 MTDGE---NLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--------------VIRSHEFL 336 +DG+ + + + AK +G + I V E + Sbjct: 208 FSDGKETMPTNPDNPKGAYTAARTAKDQGVPISTISFGTVYGFVEINGQRQPVPVDDETM 267 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 + A S + Y + ++ + + I Sbjct: 268 KKVAQLSGGNSYNAATLAELKAVYASLQQQI 298 >gi|332558842|ref|ZP_08413164.1| hypothetical protein RSWS8N_07295 [Rhodobacter sphaeroides WS8N] gi|332276554|gb|EGJ21869.1| hypothetical protein RSWS8N_07295 [Rhodobacter sphaeroides WS8N] Length = 566 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 90/234 (38%), Gaps = 40/234 (17%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R F + G + I L ++ ++ G+ ++V F + L +DR+++ AA+ Sbjct: 13 LRRFGRSEDGSILIFGIFMLILMLMIGGLAVDVMRFEFQRARLQGTLDRAVLAAASL--- 69 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD-IVRSTSLDIVVVPQNEG 126 + R+ + G + +D+ +V + +L++ Sbjct: 70 -----------------TQSRSPAEVVRDYVAKAGLEDYLDEPVVNANTLNV-------- 104 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 S++A + Y +P F + + P S+ + +++ +VLD+S SM + Sbjct: 105 RSVTATAAYSMPTVFMKLLD----IDRLEAPAVSTAEERVSN---VEISLVLDMSNSMVT 157 Query: 187 FFDSSITKIDMAIKSINAMLEEV----KLIPDVNNVVQSGLVTFSNKIEEFFLL 236 + ++D + ++ V D V+ +V ++ ++ L Sbjct: 158 DGTNPRDRLDNLKVAARDFIDIVMAGANSGLDGAPVISVSIVPYTGQVNAGADL 211 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 14/68 (20%), Positives = 26/68 (38%), Gaps = 1/68 (1%) Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACASPNSFYLVENPHSMYDAFSHIGK 363 ++++ C+ A+ +G VY++ L+ CAS Y + F I Sbjct: 498 NERTRQICDAARAQGITVYSVAFEAEAGGQALLQYCASTTGHYYATVGPQIRTVFHSIAS 557 Query: 364 DIVTKRIW 371 I R+ Sbjct: 558 HITQLRLT 565 >gi|289442929|ref|ZP_06432673.1| LOW QUALITY PROTEIN: membrane protein [Mycobacterium tuberculosis T46] gi|289415848|gb|EFD13088.1| LOW QUALITY PROTEIN: membrane protein [Mycobacterium tuberculosis T46] Length = 246 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 31/209 (14%), Positives = 67/209 (32%), Gaps = 30/209 (14%) Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 +V+DVS+SM + D +++ A ++ +E+ + GL+ ++ Sbjct: 12 VVIDVSQSMRA-TDVEPSRMVAAQEAAKQFADELTPGIN------LGLIAYAGTATVLVS 64 Query: 236 LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 + + L +F T + + A I + + D IV + Sbjct: 65 PTTNREATKNALDKL-QFADRTATGEAIFTALQAIATVGAV---IGGGDTPPPARIVLFS 120 Query: 296 DGE---NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------------HEFLRA 338 DG+ + + + AK +G + I E ++ Sbjct: 121 DGKETMPTNPDNPKGAYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETMKK 180 Query: 339 CA--SPNSFYLVENPHSMYDAFSHIGKDI 365 A S + Y + +S + + I Sbjct: 181 VAQLSGGNSYNAATLAELRAVYSSLQQQI 209 >gi|15239414|ref|NP_200879.1| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|9759335|dbj|BAB09844.1| retroelement pol polyprotein-like [Arabidopsis thaliana] gi|332009986|gb|AED97369.1| C3H4 type zinc finger protein [Arabidopsis thaliana] Length = 704 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 76/205 (37%), Gaps = 35/205 (17%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 A +D++ VLDVS SM TK+ + +++ +++ + + +++FS+ Sbjct: 248 RAPVDLVTVLDVSGSMAG------TKLALLKRAMGFVIQNLGPFD------RLSVISFSS 295 Query: 229 KIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 F L G + + L G TN GLK + D R+ N Sbjct: 296 TARRNFPLRLMTETGKQEALQAVNSLVSNG-GTNIAEGLKKGARVLID----RRFKNPVS 350 Query: 285 ANYKKIIVFMTDGE-------NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + IV ++DG+ ++ E V+A G + Sbjct: 351 S-----IVLLSDGQDTYTMTSPNGSRGTDYKALLPKEINGNRIPVHAFGFGADHDASLMH 405 Query: 338 ACA--SPNSFYLVENPHSMYDAFSH 360 + A S +F +E+ + DAF+ Sbjct: 406 SIAENSGGTFSFIESETVIQDAFAQ 430 >gi|299534564|ref|ZP_07047896.1| hypothetical protein BFZC1_01007 [Lysinibacillus fusiformis ZC1] gi|298729937|gb|EFI70480.1| hypothetical protein BFZC1_01007 [Lysinibacillus fusiformis ZC1] Length = 864 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 40/210 (19%), Positives = 78/210 (37%), Gaps = 35/210 (16%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 +P+ +K + L ++IVLD S SM+ +K+++A ++ +E ++ Sbjct: 392 LPVEMEIKGK-EQLPSLGLVIVLDRSGSMQG------SKLELAKEAAARSVEMLRDEDT- 443 Query: 216 NNVVQSGLVTFSNK---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 G + F ++ I E L + + G T L AY + D Sbjct: 444 -----LGFIAFDDRPWEIIETGPLSSKEEAVDTILSVTP--GGGTEIYSSLAKAYENLAD 496 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 ++ R+H I+ +TDG++ + + E K+ G + + I Sbjct: 497 LKLQRKH-----------IILLTDGQSQAGNYED----LITEGKEDGITLSTVAIGQDAD 541 Query: 333 HEFLRACAS--PNSFYLVENPHSMYDAFSH 360 L A + FY V + ++ S Sbjct: 542 ANLLEALSDMGSGRFYDVIDEQTIPSILSR 571 >gi|28901309|ref|NP_800964.1| hypothetical protein VPA1454 [Vibrio parahaemolyticus RIMD 2210633] gi|308125557|ref|ZP_05775735.2| von Willebrand factor type A [Vibrio parahaemolyticus K5030] gi|28809856|dbj|BAC62797.1| conserved hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] gi|308112309|gb|EFO49849.1| von Willebrand factor type A [Vibrio parahaemolyticus K5030] Length = 328 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 70/217 (32%), Gaps = 46/217 (21%) Query: 173 DMMIVLDVSRSMES---FFDS-SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+V+D+S SM+ D I ++ K ++ + + K + G+V F + Sbjct: 92 DLMLVVDLSGSMQKEDMNLDGEYIDRLSAVKKVLSDFVAKRKGD-------RLGVVLFGD 144 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + ++IK + G T G+ D Sbjct: 145 HAYLQTPLTADRQTVIQQIKQTVIGLVGQRTAIGDGIGLGTKTFVDSDA----------- 193 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------- 331 +++++ ++DG N + + AKK A +Y +G+ Sbjct: 194 PQRVMILLSDGSNTA--GVLDPIEAAEIAKKYNATIYTVGVGAGEMMVKDFFMTRKVDTA 251 Query: 332 ----SHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 + + ++ + + + I K Sbjct: 252 ADLDEQTLTKIAEMTGGQYFRARDAEQLEKIYDTINK 288 >gi|149636044|ref|XP_001506552.1| PREDICTED: similar to collagen type XX alpha 1 [Ornithorhynchus anatinus] Length = 1500 Score = 67.1 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 28/178 (15%), Positives = 66/178 (37%), Gaps = 22/178 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + A +D++ ++D S S+ + + + + +++ + + + GL + Sbjct: 230 NSSAPVDIIFLVDGSWSIG------RSNFRLVREFLASLISPFNIA---RDKISIGLSQY 280 Query: 227 SNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S + L S + ++ L G +T + L + Q ++ Sbjct: 281 SGDPRTEWDLNKFASKDKVLEAVRNLRYKGGNTFTGLALTHVLEQNLKLE------AGPR 334 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 KI++ +TDG K ++ K G ++AIG++ E + + P Sbjct: 335 PEADKIVILLTDG-----KSQDEANAAAQALKDLGISIFAIGVKNADEAELRQVASHP 387 >gi|311258307|ref|XP_003127549.1| PREDICTED: von Willebrand factor A domain-containing protein 1-like [Sus scrofa] Length = 441 Score = 66.7 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 75/206 (36%), Gaps = 23/206 (11%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 + + + + + + + D++ +LD S S+ + +++ + + A+L Sbjct: 12 SLRLARSGAERGPPTSAPQGDLLFLLDSSASVSHY---EFSRVREFLGRLAALL------ 62 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 P +++ LV ++ F S +Q I+ + TN+ L YA Q+ Sbjct: 63 PLGPGALRASLVHVGSRPHTEFPFGQHSSGSAVQDAIRAAVQRMGDTNTGLALAYAKEQL 122 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 F + K++V++TDG + E K GA V+ + Sbjct: 123 FAKE------AGARPGVPKVLVWVTDGGSSDPVG-----PPMQELKDLGATVFIVSTGRG 171 Query: 331 RSHEFLRACASP-NSFYLVENPHSMY 355 E A ++P + ++ Sbjct: 172 NLLELSAAASAPAEKHLHFVDVDDLH 197 >gi|319761860|ref|YP_004125797.1| von willebrand factor type a [Alicycliphilus denitrificans BC] gi|330826288|ref|YP_004389591.1| von Willebrand factor type A [Alicycliphilus denitrificans K601] gi|317116421|gb|ADU98909.1| von Willebrand factor type A [Alicycliphilus denitrificans BC] gi|329311660|gb|AEB86075.1| von Willebrand factor type A [Alicycliphilus denitrificans K601] Length = 348 Score = 66.7 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 38/240 (15%), Positives = 81/240 (33%), Gaps = 55/240 (22%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +DVS SM + D ++ A + A I D+ V+ G+V F+ + Sbjct: 88 IILAMDVSGSMRA-TDVQPDRLTAAQNAAKAF------IQDLPRHVRVGVVAFAGTAQLA 140 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG-------------MRQHC 280 L L + I + T + G+ A +F G +R Sbjct: 141 QLPTQSHEDLLKAIDSF-QLQRGTATGNGIMMALATLFPDAGIDIAALGGRQSMRVRPID 199 Query: 281 NTEDANYKK--------------IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 A+ K I+ +TDG+ + + ++ + A RG VY +G Sbjct: 200 EVGRADPAKKPFTPVAPGSYRSAAIIMLTDGQRTTGVDPLEAAQW---AADRGVRVYTVG 256 Query: 327 IRVIRS---------------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 + ++ + L+A A + ++ + + + + ++ Sbjct: 257 VGTVQGELIGFEGWSMRVRLDEDTLKAVALRTNAEYFHAATAQDLRKVYETLSSRLTVEK 316 >gi|260836190|ref|XP_002613089.1| hypothetical protein BRAFLDRAFT_89971 [Branchiostoma floridae] gi|229298473|gb|EEN69098.1| hypothetical protein BRAFLDRAFT_89971 [Branchiostoma floridae] Length = 267 Score = 66.7 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 68/178 (38%), Gaps = 25/178 (14%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D++ +LD S S+ ++ + + ++ Q G+V + + Sbjct: 2 PVDLVFLLDGSGSIT------APNFEITKSFVQNTTSDFQI---GTAHTQVGVVQYEDNP 52 Query: 231 EEFFLL-EWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAY-NQIFDMQGMRQHCNTEDANY 287 + F L ++ L I+ ++ G T + + + N + + G R Sbjct: 53 YDEFPLNQYATLDELLTAIRNITYRGGGTQTGKAIDHVVDNSLTESHGARPG-------V 105 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACASPNS 344 K+++ +TDG++ + A G I+ AIG+ +E + +S ++ Sbjct: 106 PKVVIVVTDGQSWD-----SVVAPAQRANHSGIIMVAIGVGSGYDINELMEIASSNDT 158 >gi|329894014|ref|ZP_08270022.1| BatA [gamma proteobacterium IMCC3088] gi|328923357|gb|EGG30676.1| BatA [gamma proteobacterium IMCC3088] Length = 339 Score = 66.7 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 72/213 (33%), Gaps = 46/213 (21%) Query: 173 DMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +D+S SM + ++ +ID +E + GL+ F + Sbjct: 92 DLLLAVDISGSMRVEDMVIANQAVRRIDAVRDIGAEFIER-------REGDRVGLILFGS 144 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + +++ + G T L A ++ D + + Sbjct: 145 RAYMQSPLSFDRDTVKQFLSEAQIGFAGSETAIGDALGLAVKRLRDKE-----------D 193 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------- 331 ++++ +TDG++ ++ D L A G VY IGI Sbjct: 194 GDRVVILLTDGQDTASSVDP--LDATALAANYGVKVYTIGIGADEMLVPSLFGNRRVNPS 251 Query: 332 ---SHEFLRACA--SPNSFYLVENPHSMYDAFS 359 E L A A + ++ +P + + Sbjct: 252 AELDEETLSAMAESTGGRYFRARSPDELAKIYD 284 >gi|317419330|emb|CBN81367.1| Integrin alpha-M [Dicentrarchus labrax] Length = 1058 Score = 66.7 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 85/223 (38%), Gaps = 29/223 (13%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 S + PI SS+ D+ +LD S S+ S TK+ +K++ + Sbjct: 43 SNRVGSPIPSSLDECRSEA---DIAFLLDGSGSVAS---QDFTKMKDFVKNLVNSFQGKD 96 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + + LV + + F + W +I + + T + +++ N + Sbjct: 97 TKFAIAQFSNAPLVHY--YFDTFDINNWRT-----QIDRIRQLTGGTYTAAAIEHVVNNV 149 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG---- 326 FD N KK+++ +TDGE+ ++ + A+ + + +AIG Sbjct: 150 FDPS------RGSRLNVKKVLIVITDGESHDRRDLPYAASL---AQGKNIVRFAIGVGGA 200 Query: 327 IRVIRSHEFLRACASP---NSFYLVENPHSMYDAFSHIGKDIV 366 + + + L AS + + V+N ++ ++ I Sbjct: 201 FSNVAAKQELDTIASDPPASHVFRVDNFGALEQIRQNLQDKIF 243 >gi|224047663|ref|XP_002193801.1| PREDICTED: vitrin [Taeniopygia guttata] Length = 746 Score = 66.7 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 42/287 (14%), Positives = 100/287 (34%), Gaps = 35/287 (12%) Query: 83 LCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFC 142 R+ ++ ++ +++ +D V + YSI+ S + + Sbjct: 481 ASRLARESGINIFFVTVAAAAQSEKQNVIEPNFVDKAVCRTSGFYSINVPSWFSLHKVVQ 540 Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 + ++ + T D+ V+D S S+ ++ + Sbjct: 541 PLVKRVCDTDRLACSKTCL--------NAADIGFVIDGSSSVG------TGNFRTVLQFV 586 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIE-EFFLLEWGVS-HLQRKIKYLSKFGVSTNST 260 + +E ++ + G V ++ + EF ++ + IK +S + T++ Sbjct: 587 ANISKEFEISDTD---TRIGAVQYTYEQRLEFSFDKYSTKQDVLNAIKRISYWSGGTSTG 643 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA 320 + YA Q+F N +KI++ +TDG + A + G Sbjct: 644 AAISYASEQLFS---------KSKPNKRKIMILITDG-----RSYDDVSVPAMAAHQNGV 689 Query: 321 IVYAIGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDI 365 I Y++G+ E + P + V+ ++Y + + ++I Sbjct: 690 IAYSVGVAWAAPDELEAIASDPAKEHSFFVDEFDNLYRYVNQLIQNI 736 >gi|254409659|ref|ZP_05023440.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] gi|196183656|gb|EDX78639.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] Length = 413 Score = 66.7 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 37/222 (16%), Positives = 83/222 (37%), Gaps = 29/222 (13%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 + R + + I + + Q+ L++ ++LD S SM ++ K+ ++E + Sbjct: 21 SQRQLSIAIRAITQSQDQSL-PLNLCLILDHSGSMHGRP------LETVKKAAMQLIERL 73 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 K + ++ F ++ + + ++ ++ +I+ LS G T GLK Sbjct: 74 KEGD------RICVIAFDHRAKVLVPNQAIDNLNTIKSQIRQLSADG-GTAIDEGLKLGI 126 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 ++ A+ + +TDGEN ++++ L + A + + +G Sbjct: 127 EEV----------AKGKADAVSQVFLLTDGENEH-GDNERCLKLAHFAVEHKLTINTLGF 175 Query: 328 RVIRSHEFLRACASPNS--FYLVENPHSMYDAFSHIGKDIVT 367 + + L A S +E P F + I Sbjct: 176 GASWNQDVLEKIADSGSGTLCYIEQPEQAVQEFGRLFNRIQA 217 >gi|156976371|ref|YP_001447277.1| hypothetical protein VIBHAR_05144 [Vibrio harveyi ATCC BAA-1116] gi|156527965|gb|ABU73050.1| hypothetical protein VIBHAR_05144 [Vibrio harveyi ATCC BAA-1116] Length = 334 Score = 66.7 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 69/217 (31%), Gaps = 46/217 (21%) Query: 173 DMMIVLDVSRSMESFFDSS----ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 DMM+V+D+S SM+ + I ++ + ++ +E+ + G+V F + Sbjct: 98 DMMLVVDLSGSMQKEDMNDNGEYIDRLTAVKRVLSDFVEK-------RQGDRLGVVLFGD 150 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + ++I + G T G+ D Sbjct: 151 HAYLQTPLTADRKTVMQQINQTVIGLVGQRTAIGDGIGLGTKTFVDSDA----------- 199 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------- 331 +++++ ++DG N + L AKK A +Y +G+ Sbjct: 200 PQRVMILLSDGSNTA--GVLDPLEAAEIAKKYNATIYTVGVGAGEMMVKDFFMTRKVNTA 257 Query: 332 ----SHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 + + ++ + + + I + Sbjct: 258 SELDEQTLTKIAEMTGGKYFRARDAKELETIYDTINQ 294 >gi|21539497|gb|AAM53301.1| putative protein [Arabidopsis thaliana] gi|23198328|gb|AAN15691.1| putative protein [Arabidopsis thaliana] Length = 704 Score = 66.7 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 76/205 (37%), Gaps = 35/205 (17%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 A +D++ VLDVS SM TK+ + +++ +++ + + +++FS+ Sbjct: 248 RAPVDLVTVLDVSGSMAG------TKLALLKRAMGFVIQNLGPFD------RLSVISFSS 295 Query: 229 KIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 F L G + + L G TN GLK + D R+ N Sbjct: 296 TARRNFPLRLMTETGKQEALQAVNSLVSNG-GTNIAEGLKKGARVLID----RRFKNPVS 350 Query: 285 ANYKKIIVFMTDGE-------NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + IV ++DG+ ++ E V+A G + Sbjct: 351 S-----IVLLSDGQDTYTMTSPNGSRGTDYKALLPKEINGNRIPVHAFGFGADHDASLMH 405 Query: 338 ACA--SPNSFYLVENPHSMYDAFSH 360 + A S +F +E+ + DAF+ Sbjct: 406 SIAENSGGTFSFIESETVIQDAFAQ 430 >gi|328865949|gb|EGG14335.1| Ubiquitin-conjugating enzyme E2 4 [Dictyostelium fasciculatum] Length = 563 Score = 66.7 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 74/209 (35%), Gaps = 20/209 (9%) Query: 171 RLDMMIVLDVSRSM--------ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 +LD+ IVLD+S SM + +T+I+ A + ++ V G Sbjct: 145 QLDV-IVLDLSGSMQQPAYIGSRVPGELEMTRIEAAQATFQTFIDRFVS---YRYPVAVG 200 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 LV F KIE F + ++ + T +K A I ++ + Sbjct: 201 LVCFGQKIEATFPISSNFDSFSNELGEVEAHQSQTRLWEAIKRAAEVI--VEFKKSPTLK 258 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC--A 340 N + I +TDGE+ + + + G I+ +I I + L A A Sbjct: 259 LAPNVRSRIFCLTDGEDNGSTP---VFTVFDYLRTHGIILDSIPIG-QQGRATLSAFSKA 314 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 + + ++ + F ++ R Sbjct: 315 TGGTCFVANSSIECVQLFEREALLMLEHR 343 >gi|301611663|ref|XP_002935353.1| PREDICTED: LOW QUALITY PROTEIN: anthrax toxin receptor 1-like [Xenopus (Silurana) tropicalis] Length = 565 Score = 66.7 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 39/219 (17%), Positives = 75/219 (34%), Gaps = 28/219 (12%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID-MAIKSINAMLEEVKLIPDV 215 +S D+ VLD S S+ + ++ +A + I L Sbjct: 27 STPGQGGSHSCYGG-FDLYFVLDKSGSVLHHWSEIFYFVEHLAQRFIGPQL--------- 76 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDM 273 + + FS + L +++ ++ L K G T G++ A QI+ Sbjct: 77 ----RMSFIVFSTRGSTLMRLTEDREQIRQGLEELRKVLPGGDTYMHEGIERASEQIY-- 130 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 H + + +I+ +TDGE N +++ GA VY +G++ Sbjct: 131 -----HESIKGYRTASVIIALTDGELHEDLFYYAERE-ANRSRELGAQVYCVGVKDFNET 184 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + R S + + V ++A I I+ Sbjct: 185 QLARIADSKDHVFPVNGG---FEALQDIIGSILKMSCIE 220 >gi|58580793|ref|YP_199809.1| hypothetical protein XOO1170 [Xanthomonas oryzae pv. oryzae KACC10331] gi|58425387|gb|AAW74424.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC10331] Length = 335 Score = 66.7 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 34/261 (13%), Positives = 84/261 (32%), Gaps = 47/261 (18%) Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES- 186 ++P ++ ++ W+ + P + +AR MM+ +D+S SM Sbjct: 57 QAKTTPSLRMP-RWLAWLGWFLLCAALARPQQLGEVIQPPREAR-QMMLAVDLSGSMNEP 114 Query: 187 ---FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 + ++ A ++ L+ + GL+ F + L ++ + Sbjct: 115 DMVLGGKVVDRLTAAKAVLSDFLDRRDGD-------RVGLLVFGQRAYALTPLTADLTSV 167 Query: 244 QRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 + ++ + G T + + ++ E ++++V +TDG N + Sbjct: 168 RDQLRDSVVGLAGRETAIGDAIALSVKRL-----------REQKQGQRVVVLLTDGVNTA 216 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-----------------LRACA--SP 342 L AK G ++ I + LR A + Sbjct: 217 --GVLDPLKAAELAKAEGVRIHTIAFGGGGGYSLFGVPIPAGGNDDIDEDGLRKIAQQTG 274 Query: 343 NSFYLVENPHSMYDAFSHIGK 363 F+ + + ++ + + Sbjct: 275 GRFFRARDTEELAGIYAELDR 295 >gi|170079352|ref|YP_001735990.1| von Willebrand factor type A domain-containing protein [Synechococcus sp. PCC 7002] gi|169887021|gb|ACB00735.1| von Willebrand factor type A domain protein [Synechococcus sp. PCC 7002] Length = 414 Score = 66.7 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 38/227 (16%), Positives = 80/227 (35%), Gaps = 30/227 (13%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + R + I + L++ +VLD S SM + ++ Sbjct: 16 ATQAQSQRQLSFSIGAIASDFQDATLPLNLCLVLDHSGSMAG------QPLRTVKEAAIQ 69 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPG 262 ++++++ + ++ F +K + + +H++ +I L G T G Sbjct: 70 LVDQLREGD------RLSVIAFDHKAKVIVPNQDVTDKAHIKAQIDRLEAAG-GTCIDDG 122 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM-TDGENLSTKEDQQSLYYCNEAKKRGAI 321 +K ++ G R VFM TDGEN ++ + L A + G Sbjct: 123 IKLGLQELASSPGKRAAQ-----------VFMLTDGENEH-GDNGRCLEIAAVAAEHGVT 170 Query: 322 VYAIGIRVIRSHEFLRAC--ASPNSFYLVENPHSMYDAFSHIGKDIV 366 + ++G + + L A+ S +E P+ F + + Sbjct: 171 LNSLGFGENWNQDVLEKIADAANGSLAYIETPNQALTEFERLLQRAQ 217 >gi|166713250|ref|ZP_02244457.1| hypothetical protein Xoryp_17865 [Xanthomonas oryzae pv. oryzicola BLS256] Length = 335 Score = 66.7 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 34/261 (13%), Positives = 84/261 (32%), Gaps = 47/261 (18%) Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES- 186 ++P ++ ++ W+ + P + +AR MM+ +D+S SM Sbjct: 57 QAKTTPSLRMP-RWLAWLGWFLLCAALARPQQLGEVIQPPREAR-QMMLAVDLSGSMNEP 114 Query: 187 ---FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 + ++ A ++ L+ + GL+ F + L ++ + Sbjct: 115 DMVLGGKVVDRLTAAKAVLSDFLDRRDGD-------RVGLLVFGQRAYALTPLTADLTSV 167 Query: 244 QRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 + ++ + G T + + ++ E ++++V +TDG N + Sbjct: 168 RDQLRDSVVGLAGRETAIGDAIALSVKRL-----------REQKQGQRVVVLLTDGVNTA 216 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-----------------LRACA--SP 342 L AK G ++ I + LR A + Sbjct: 217 --GVLDPLKAAELAKAEGVRIHTIAFGGGGGYSLFGVPIPAGGNDDIDEDGLRKIAQQTG 274 Query: 343 NSFYLVENPHSMYDAFSHIGK 363 F+ + + ++ + + Sbjct: 275 GRFFRARDTEELAGIYAELDR 295 >gi|73980136|ref|XP_851163.1| PREDICTED: similar to vitrin isoform 2 [Canis familiaris] Length = 645 Score = 66.7 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 42/270 (15%), Positives = 90/270 (33%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + N+ + + V N YS++ S + + + ++ + T Sbjct: 397 EGATENEKQYVTEPNFSNKAVCRTNGFYSLNVQSWFSLHKTVQPLVKRVCDTDRLACSKT 456 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S S+ ++ + + +E ++ Sbjct: 457 CL--------NSADIGFVIDGSSSVG------TGNFRTVLQFVANLSKEFEISDTD---T 499 Query: 220 QSGLVTFS--NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ ++E F S + IK + + T++ + YA Q+F Sbjct: 500 RIGAVQYTYEQRLEFGFDDYNTKSDILNAIKRVGYWSGGTSTGAAINYALEQLF------ 553 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG + A +G I YAIG+ E Sbjct: 554 ---KKSKPNKRKLMILITDG-----RSYDDVRIPAMVAHHKGVITYAIGVAWAAQDELEV 605 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V+ +Y + + I Sbjct: 606 IATHPANDHSFFVDEFDHLYKFVPKVIQSI 635 >gi|118093660|ref|XP_001235190.1| PREDICTED: similar to Bardet-Biedl syndrome 5 [Gallus gallus] Length = 1556 Score = 66.7 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 67/163 (41%), Gaps = 21/163 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + ++++ V+D S S+ ++ + A+++ V + N + GLV + Sbjct: 739 CKEIPMELVFVIDSSESVGP------ENFEIIKDFVTALVDRVTV---GRNATRIGLVLY 789 Query: 227 SNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S +++ F L + ++R I+ + G T + ++ A ++ Sbjct: 790 SLEVQLEFGLNKHTTQQDVKRAIRKMQYMGEGTYTGTAIRKA---------TQEGFLGAR 840 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 +K+ + +TDG+ +E + EA +YAIGI Sbjct: 841 TGVRKVAIVLTDGQ-ADKREAVKLDIVVREAHAANIEMYAIGI 882 >gi|148652289|ref|YP_001279382.1| von Willebrand factor, type A [Psychrobacter sp. PRwf-1] gi|148571373|gb|ABQ93432.1| von Willebrand factor, type A [Psychrobacter sp. PRwf-1] Length = 571 Score = 66.7 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 34/191 (17%), Positives = 77/191 (40%), Gaps = 21/191 (10%) Query: 146 PWYTNSRHIVMPITSS-VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 PW+ ++ + + I + + Q +++ ++DVS SM++ K+ +A S+ Sbjct: 188 PWHPTNQIVKVGIKAEDLLTAKQKQPPANLVFLVDVSGSMDTE-----DKLQLAKSSLKM 242 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 + ++++ + + +G + K+ + I L+ G STN +K Sbjct: 243 LTKQLRAQDSITLITYAG----NTKVVLPSTPGNQTQKILNAIDNLTASG-STNGEAAIK 297 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVY 323 AY Q +H + N I+ +TDG+ N+ + L + +G + Sbjct: 298 LAYQQ------ATEHFKKDGINR---ILMLTDGDFNVGVSSVKDMLQIIRSNRDKGISLS 348 Query: 324 AIGIRVIRSHE 334 +G ++ Sbjct: 349 TLGFGQGNYND 359 >gi|325286051|ref|YP_004261841.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324321505|gb|ADY28970.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 332 Score = 66.7 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 76/195 (38%), Gaps = 24/195 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P T V ++T +D+++ +DVS SM + D ++ K + K P Sbjct: 73 LARPQTKDVSTRTKTTKGIDIVMAIDVSSSMLA-RDLKPDRLTALKKVAADFI---KKRP 128 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIF 271 + + GLV ++ + + S + ++ ++ + T GL A N++ Sbjct: 129 ND----RIGLVAYAAESYTKTPITSDKSIVLSSLRQITHGQLEDGTAIGMGLATAVNRL- 183 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 + + K+I+ +TDG N S + ++ + A + Y IG+ Sbjct: 184 ----------KDSKSKSKVIILLTDGVNNSGFIEPKTA--ADLAVEYKIKTYTIGLGT-N 230 Query: 332 SHEFLRACASPNSFY 346 + +P+ Y Sbjct: 231 GNALTPIAFNPDRTY 245 >gi|116624819|ref|YP_826975.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116227981|gb|ABJ86690.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 837 Score = 66.7 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 43/207 (20%), Positives = 77/207 (37%), Gaps = 37/207 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++++D S SME KI++A + ++E ++ I G++ F N + Sbjct: 395 VVLIIDKSSSMEG------RKIELARLAAIGVVENLRPIDS------VGVLIFDNSFQWA 442 Query: 234 FLL--EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + + +++ I ++ G T P L AY +I K I Sbjct: 443 VPIRKAEDRATIKKLISGITPDG-GTQIAPALTEAYQRIL-----------PQTAMYKHI 490 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--PNSFYLVE 349 V +TDG E+ S+ EA+ + +G+ + FL AS Y + Sbjct: 491 VLLTDG----ISEEGDSMTLTKEAQANHVTISTVGLGQDVNRAFLEKVASNADGKAYFLN 546 Query: 350 NPHSMYDAF-----SHIGKDIVTKRIW 371 +P + H G V K I Sbjct: 547 DPSGLEQLLLRDVEEHTGVTAVEKAIT 573 >gi|327284423|ref|XP_003226937.1| PREDICTED: anthrax toxin receptor 2-like [Anolis carolinensis] Length = 441 Score = 66.7 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 38/214 (17%), Positives = 74/214 (34%), Gaps = 27/214 (12%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + D+ VLD S S+ + I ++++ V+ ++ Sbjct: 31 AQDQQPSCHGAFDLYFVLDKSGSVTDNWF-----------EIVDFVKQL-TDRFVSPRMR 78 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + FS + + L + ++R ++ L K G T G+K A QI G R Sbjct: 79 LSFIVFSMQAKVILQLTENRAQIERGLEELRNVKPGGETYMHEGIKEANRQIETAGGQRT 138 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + II+ +TDG+ + +++ GA VY +G+ + Sbjct: 139 NS---------IIIALTDGKLEGLIPQYAEKQ-ADISRRLGARVYCVGVLNFNQEQLESI 188 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 S + + V + A I I+ + Sbjct: 189 ADSRDQVFPV---KEGFQALRGIINSILKQSCTE 219 >gi|296224100|ref|XP_002757907.1| PREDICTED: vitrin [Callithrix jacchus] Length = 656 Score = 66.7 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 42/270 (15%), Positives = 91/270 (33%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + N+ +V + V N YS+ S + + T + ++ + T Sbjct: 408 EGAAENEKQYVVEPNFANKAVCRTNGFYSLHVQSWFGLHKTLQTLVKRVCDTDRLACSKT 467 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S S+ ++ + + +E ++ Sbjct: 468 CL--------NSADIGFVIDGSSSVG------TGNFRTVLQFVTNISKEFEISDTD---T 510 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ + F + S + IK + + T++ + +A Q+F Sbjct: 511 RIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF------ 564 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG A +G I YAIG+ E Sbjct: 565 ---MKSKPNKRKLMILITDGR-----SYDDVRIPAMAAHLKGVITYAIGVAWAAQEELEV 616 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V+ ++Y + ++I Sbjct: 617 IATHPARDHSFFVDEFDNLYQYVPRVIQNI 646 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 29/208 (13%), Positives = 70/208 (33%), Gaps = 23/208 (11%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 + + + Y P + + V + + ++D+ ++D S S+ Sbjct: 228 LWSTTTYTSSQNRPGANPGFVPKEELSTQSLEPVSL-GDPNCKVDLSFLIDGSASIGK-- 284 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRK 246 + + + + + + + + P G+V + + F L+ L+ Sbjct: 285 ----RRFRIQKQLLADVAQALDIGPAGP---LMGVVQYGDNPATHFNLKTHMNSQDLKTA 337 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 I+ + + G +N + + F N + ++V M DG E+ Sbjct: 338 IEKIPQRGGLSNVGRAISFVTKNFFSK------ANGNRSGAPNVVVVMVDGWPTDKVEEA 391 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 L A++ G ++ I I +E Sbjct: 392 SRL-----ARESGINIFFITIEGAAENE 414 >gi|326424188|ref|NP_762140.2| aerotolerance operon protein BatA [Vibrio vulnificus CMCP6] gi|319999572|gb|AAO07130.2| BatA (Bacteroides aerotolerance operon) [Vibrio vulnificus CMCP6] Length = 362 Score = 66.7 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 76/210 (36%), Gaps = 34/210 (16%) Query: 173 DMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+M+V+D+S SM S + I+++D + + + + GL+ F Sbjct: 107 DVMVVVDLSGSMAEQDFTSASGAKISRLDATKEVLADFAKT-------RQGDRLGLILFG 159 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + F+ + + + L++ G ST+ + A ++F+ Q Sbjct: 160 ---DAAFVQTPFTADQKVWLALLNQTDVAMAGQSTHLGDAIGLAI-KVFEQSEPSQAAF- 214 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--SHEFLR--- 337 +K+ + +TDG T + + AK +G ++ I + L Sbjct: 215 -SKPRQKVAIVLTDG--NDTGSFVEPIDAAKVAKAKGVRIHVIAMGDPSTVGESALDLQT 271 Query: 338 ----ACASPNSFYLVENPHSMYDAFSHIGK 363 A S + N + A+ IGK Sbjct: 272 IERIASESGGKAFQALNRDELASAYDDIGK 301 >gi|260769474|ref|ZP_05878407.1| protein BatA [Vibrio furnissii CIP 102972] gi|260614812|gb|EEX39998.1| protein BatA [Vibrio furnissii CIP 102972] gi|315182004|gb|ADT88917.1| von Willebrand factor type A domain protein [Vibrio furnissii NCTC 11218] Length = 322 Score = 66.7 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 71/213 (33%), Gaps = 38/213 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 DMM+V+D+S SM + D I ++A+ + + + GL+ F++ Sbjct: 86 DMMLVVDLSYSMS---QQDMKSGDQFIDRLSAVKQVLSDFIAKRQGDRLGLIFFADHAYL 142 Query: 233 FFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + +++ L G T G+ A D +++ Sbjct: 143 QTPLTLDRQTIAQQLNQAVLRLIGTQTAIGEGIGLATKTFIDSDA-----------PQRV 191 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR------------------S 332 ++ ++DG N + AKK +Y +G+ Sbjct: 192 MILLSDGSNT--SGVLDPMEAAKIAKKYHTTIYTVGVGAGEMMVKEFFMTRKINTAEDLD 249 Query: 333 HEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 + L+A A + ++ N + + I + Sbjct: 250 EKTLQAIADETGGQYFRARNQQDLQHIYDTINQ 282 >gi|212276002|ref|NP_001130333.1| hypothetical protein LOC100191428 [Zea mays] gi|194688870|gb|ACF78519.1| unknown [Zea mays] Length = 704 Score = 66.7 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 55/315 (17%), Positives = 112/315 (35%), Gaps = 48/315 (15%) Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 + ++ + ++ N R + ND + + R + D + ++ IS Sbjct: 150 TRSPQQDPHLALHQQVSNRRREVRRLRTSEPADYNDDEPLQRMEAFDDLNFGSSKTAEIS 209 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ----------TDARLDMMIVLDV 180 + ++ + T + + H+ P +SS +D++ VLDV Sbjct: 210 SYPEFQA-VPQSTCLDGFDILIHVKAPTSSSDDATGSLVNGSSLRLSRRVPIDIVTVLDV 268 Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW-- 238 S SM TK+ + +++ +++ ++ + ++ FS+ F L+ Sbjct: 269 SGSMAG------TKMALLKQAMGFVIQHLRPSD------RLSVIAFSSTARRLFPLQRMS 316 Query: 239 --GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 G + I L G TN LK A I D C+ I+ ++D Sbjct: 317 HHGRQQALQAINSLGA-GGGTNIADALKKAVKVIADRSYKNSVCS---------IILLSD 366 Query: 297 GENL--------STKEDQQSLYYCNEAKK-RGAIVYAIGIRVIRSHEFLRAC--ASPNSF 345 G++ T ++SL + ++ G + L + AS +F Sbjct: 367 GQDTYNISSNFQGTSAGRRSLVPSANPNELHMVPLHTFGFGADHDSDTLHSISEASGGTF 426 Query: 346 YLVENPHSMYDAFSH 360 +E+ M DAF+ Sbjct: 427 SFIEDEGVMQDAFAQ 441 >gi|296168868|ref|ZP_06850540.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295896485|gb|EFG76135.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] Length = 335 Score = 66.7 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 40/245 (16%), Positives = 77/245 (31%), Gaps = 33/245 (13%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLD---MMIVLDVSRSMESFFDSSITKIDMAI 199 +P + +V+ T+ S L+ +M+V+DVS SM + D ++ A Sbjct: 65 RHVPTILLATSLVLLTTAMAGPTSDVRIPLNRAVVMLVIDVSESM-AATDVPPDRLTAAK 123 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNS 259 ++ +E+ + GLV F+ + ++ I L T + Sbjct: 124 EAGKQFADELTPAIN------LGLVEFAANASLLVSPTTNRAAVKAAIDSLKP-APKTAT 176 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL---STKEDQQSLYYCNEAK 316 GL A I + + D IV +DG Q + AK Sbjct: 177 GEGLFTALQAIATVGSVMGG---GDGPPPARIVLESDGAENVPLDPNAPQGAFTAARAAK 233 Query: 317 KRGAIVYAIGIRVIRS---------------HEFLRACA-SPNSFYLVENPHSMYDAFSH 360 G + I + C + + ++ S+ + +S Sbjct: 234 AEGVQISTISFGTPYGTVEYEGATIPVPVDDQTLQKICEITDGQAFHADSLESLKNVYST 293 Query: 361 IGKDI 365 + + I Sbjct: 294 LQRQI 298 >gi|317057468|ref|YP_004105935.1| von Willebrand factor type A [Ruminococcus albus 7] gi|315449737|gb|ADU23301.1| von Willebrand factor type A [Ruminococcus albus 7] Length = 782 Score = 66.7 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 41/240 (17%), Positives = 83/240 (34%), Gaps = 19/240 (7%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPIT--SSVKVNSQTDARLDMMIVLDVSRSMES 186 SR IPL+ + I P T + A + ++D S SM Sbjct: 239 FDVDSREYIPLESRSDNKRRYVQATITEPGTYVVGADASVNEAATTRVAFLIDNSGSMYP 298 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS---NKIEEFFLLEWGVSHL 243 + + + + I ++ + G+ F+ ++ F +S + Sbjct: 299 KELCPTSSENDVDFKRLDFTQSL--IDKFDSDFRIGISKFTGTYTRMCGFTDDRTALSDV 356 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 ++I+ + T++ LK + T D Y IIV ++DGE+ T Sbjct: 357 IKRIRTEDEIFDGTHNQTALKRCIEEFT---------ATGDGKYVNIIVMLSDGESDETN 407 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHI 361 + A ++ IV +G+ +L+ A + +Y +++ D + I Sbjct: 408 A-ESIKNLARLANEKSVIVLTVGLGREIDRAWLQEMAYSTGGKYYSASEANALDDVYKQI 466 >gi|301619004|ref|XP_002938896.1| PREDICTED: von Willebrand factor A domain-containing protein 1-like [Xenopus (Silurana) tropicalis] Length = 496 Score = 66.7 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 44/205 (21%), Positives = 80/205 (39%), Gaps = 27/205 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS-NKIE 231 D++ +LD S S+ + + + I +L P VQ+ +V S N + Sbjct: 36 DLIFLLDSSGSVSYY------EFAKVREFIGNLLRPFTFGP---QDVQASIVHISTNPVL 86 Query: 232 EFFLLEWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 EF ++G S +QR I+ + + TN+ L Y +FD + A K+ Sbjct: 87 EFPFNQYGSSQEIQRAIQNIKQRMGDTNTGKALSYIKENLFD------ERSGSRAEVPKV 140 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN----SFY 346 +V++TDG + K G V+ + E A ++P+ F Sbjct: 141 MVWVTDGLSTDDIS-----QPMQLLKDMGVTVFIVSTGRGNYLELSAAASTPSDTHLHFV 195 Query: 347 LVENPHSMYDAF-SHIGKDIVTKRI 370 V++ H + I + I +R+ Sbjct: 196 DVDDLHIITKELRDSIIELIRARRL 220 >gi|118094354|ref|XP_422360.2| PREDICTED: similar to calcium-activated chloride channel [Gallus gallus] Length = 928 Score = 66.7 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 54/269 (20%), Positives = 94/269 (34%), Gaps = 43/269 (15%) Query: 119 VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNS---RHIVMPITSSVKVNSQTDARLDMM 175 + + + YK + + NS +V P ++ ++ D + Sbjct: 253 THNSEAPNMQ-NKMCNYKSTWEIIMESDDFRNSSVVNSLVPPFETTFELLQTQDRAVS-- 309 Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 +VLDVS SM + +I + L ++ I + G+VTF + E Sbjct: 310 LVLDVSGSMNTN-----NRITNLRTAAEVFLIQIIEIGS-----RVGIVTFESSAYEKSP 359 Query: 236 LEWGVSHLQRK--IKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L S R+ ++ L G T G++ I + G IV Sbjct: 360 LLQITSVATRQRLVQNLPTTAGGGTKICAGIEKGLEIITNAIGTTYGSE---------IV 410 Query: 293 FMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFY--L 347 +TDGE ++ C + K+ GAI++ I + + E Y Sbjct: 411 LLTDGE-------DSTMSLCREKVKESGAIIHTIALGPSAAKELEEFSNITGGLQLYAVD 463 Query: 348 VENPHSMYDAFSHIGK---DIVTKRIWYD 373 V+ P + +AFS I DI + I + Sbjct: 464 VDVPSKLVEAFSEITTGSGDISEQSIQLE 492 >gi|163749961|ref|ZP_02157205.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] gi|161330235|gb|EDQ01216.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] Length = 648 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 39/225 (17%), Positives = 84/225 (37%), Gaps = 22/225 (9%) Query: 116 LDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITS--SVKVNSQTDARLD 173 + + + Y S S + P T + + H ++ +V+ + Sbjct: 185 VRVEEMINYFNYQYSTPSTVEQPFSVNTELAPSPYNEHKMLLRIGLKGYEVDKSQLGASN 244 Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ +LDVS SM S K+ + S+ + +++ V+ VV +G + + Sbjct: 245 LVFLLDVSGSMNS-----RDKLPLLKTSLKMLSQQLSEQDHVSIVVYAGA----SGVVLD 295 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + + + L G STN G++ AY ++H N ++ Sbjct: 296 GVKGNDTQAINQALNSLKA-GGSTNGGAGIQQAYRL------AQKHFIQGGVNR---VIL 345 Query: 294 MTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 TDG+ N+ T + Q + + +G + +G ++ L Sbjct: 346 ATDGDFNVGTTDHQALMDLIAAKRDQGIALTTLGFGQGNYNDHLM 390 >gi|318604213|emb|CBY25711.1| protein TadG, associated with Flp pilus assembly [Yersinia enterocolitica subsp. palearctica Y11] Length = 457 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 81/234 (34%), Gaps = 28/234 (11%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F N +G + I I P ++ + E+SH K L I+++ + A I N Sbjct: 17 FKKNEQGAILISFMIIFPFFIALIFITFEISHYLQRKAKLSDAIEQATL--ALTIENNEI 74 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 + +++K ++ N + L F+ I +I +T Y+ + Sbjct: 75 PDEPQQIKNNALVLSYVNAY-------LPSKKFLVPIININDNT--------HYLEYNAA 119 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 Y + + +I + + D++ V D S SM F+ Sbjct: 120 VTMAYPAKFLSQSPFTNTISDMNITDNGVAIKNKAIEASEPTDVIFVADYSGSMLYNFNE 179 Query: 191 SITKIDMAIKSINAMLEEV-KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 + + I ++ + ++ +I D +N+ G + WG + Sbjct: 180 NKPRDHERIDALRSAFRKLHDIIMDNSNINAIG----------YIPFSWGTKRI 223 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 63/193 (32%), Gaps = 39/193 (20%) Query: 200 KSINAMLEEVKLIPDVNNVVQSGLV------TFSNKIEEFFLLEWGVSHLQRKIKYLSKF 253 K+I+++ + I + V+ G V +S + E++ I + Sbjct: 275 KTIDSITGNAQTIDIPMSDVRFGDVCLQGSNAYSLEQEQYINNIDN-------IIEMEPH 327 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-- 311 G + S+ L N +F + H KK+++ ++DG + + + Sbjct: 328 GWTLISSGILSA--NNLFKNKAKNGH--------KKLMIILSDGVDTDDFPSSKGIIISK 377 Query: 312 -------CNEAKKRGAIVYAIGIRVIRSHEF-------LRACASPNSFYLVENPHSMYDA 357 C E K+ + I I + + C +++Y N H + Sbjct: 378 MLVEKGMCEEIKENDIQMAFIAIAYSPDNNKNEPYHINWKKCVGEDNYYEAHNAHELEHK 437 Query: 358 FSHIGKDIVTKRI 370 T+ + Sbjct: 438 LQQAVSGSTTREV 450 >gi|316933619|ref|YP_004108601.1| hypothetical protein Rpdx1_2276 [Rhodopseudomonas palustris DX-1] gi|315601333|gb|ADU43868.1| hypothetical protein Rpdx1_2276 [Rhodopseudomonas palustris DX-1] Length = 483 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 31/233 (13%), Positives = 74/233 (31%), Gaps = 35/233 (15%) Query: 16 KGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRK 75 G + ++ I L + +G ++ S +T + S +D + + A + + Sbjct: 22 GGNIAVIFGIALLPLLGFVGAAVDYSRASRARTAMQSALDSTALMVAKDLTSG------- 74 Query: 76 KLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRY 135 K+ ++ + ++N D I + + +N ++ Sbjct: 75 KITAENVQSAANTYFTSLYKNT--------DAPSIDVTATYTPKTSSENAKLTVGGTGSI 126 Query: 136 KIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKI 195 +I + + RL + + LDV+ SM+S K+ Sbjct: 127 NTEFMKVM---------NISQMSLGASSTTTWGGTRLRVALALDVTGSMDSAG-----KL 172 Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE------EFFLLEWGVSH 242 + +++ +K V +V F+ + L+W S+ Sbjct: 173 SAMKTAAKQLIDTLKATSTTKEDVYISIVPFNVMVNVGPGNKNATWLDWDTSY 225 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 30/148 (20%), Positives = 51/148 (34%), Gaps = 19/148 (12%) Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG--- 297 + L+ +I L G TN G+ +A+ + + + Y IV ++DG Sbjct: 335 TTLKGRINTLDAQG-GTNQGIGMFWAWMTLQATAPLYTPAKDSEYKYTDAIVLLSDGMNT 393 Query: 298 --------ENLSTKEDQQSLYYCNEA--KKRGA---IVYAIGIRVIR--SHEFLRACASP 342 N S + D + C+ K G +Y I + L+ C S Sbjct: 394 KNRWYGNGSNWSPQVDDRQKILCDNITTKVNGVPETTIYTIQVNTSGDPESSVLKYCGST 453 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 F+ + AF +G + RI Sbjct: 454 GGFFSTTTASGIQSAFQEVGASLTKLRI 481 >gi|47226573|emb|CAG08589.1| unnamed protein product [Tetraodon nigroviridis] Length = 960 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 52/319 (16%), Positives = 113/319 (35%), Gaps = 36/319 (11%) Query: 65 IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQN 124 +++ + KG C + + + L+D+ + D+ ++ +TSL + + + Sbjct: 35 LVSAPWSGYSRNRKGDLYKCPVSESRTSCDKLNLQDSLSIPDVKNVNDNTSLGLTLAQND 94 Query: 125 EGYSISAISRYKIPLKFCTFIPW-YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 G + + F P + P + +D++IVLD S S Sbjct: 95 GGDLLMCGPLWGQQCSSQRFYPGICARLSPLFQPQPAFSPAVQVCGGPMDVVIVLDGSNS 154 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVSH 242 + + + I ++ + + P Q ++ + + F L ++ Sbjct: 155 IYPW--------EPMTAFIQKLIPSLDIGP---QATQLSIIQYGVDPKFEFRLNQYRTKE 203 Query: 243 --LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 L + +G STN+ ++YA F N K++V +TDGE+ Sbjct: 204 DALAAASRITQMYGHSTNTFQAIQYASQWGF------HQNNGGRPEAAKVMVVVTDGESH 257 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEF---LRACAS---PNSFYLV 348 + L C++ +G + I + I + +++ AS N F+ V Sbjct: 258 DVAFRESVLDECDK---KGITRFGIAVLGYYIRNSIDTENLIAEIKSIASKPTANYFFNV 314 Query: 349 ENPHSMYDAFSHIGKDIVT 367 ++ + +G I Sbjct: 315 SEEAALSNIAGTLGDRIFN 333 >gi|189347765|ref|YP_001944294.1| von Willebrand factor type A [Chlorobium limicola DSM 245] gi|189341912|gb|ACD91315.1| von Willebrand factor type A [Chlorobium limicola DSM 245] Length = 325 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 37/213 (17%), Positives = 71/213 (33%), Gaps = 31/213 (14%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 VK +D+++ LD+S SM+ +++D + N+ + Sbjct: 89 VKQTVAHSRGIDVILALDISESMQLKDAGGRSRLDAVKSVAREFVTR-----HSND--RI 141 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQ 278 G+V F K L I +S T + A N++ Q Sbjct: 142 GVVVFKGKGYTLSPLTLDHRVTGMLIDNVSPDVIRDEGTAVGTAVLIAVNRLRASQSD-- 199 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI-------R 331 +K+I+ +DG + + + D + + A +G +Y G Sbjct: 200 ---------QKVIILFSDGVSNAGEIDPVTA--ASFAAAQGIRIYTAGAGSASSASSALD 248 Query: 332 SHEFLR-ACASPNSFYLVENPHSMYDAFSHIGK 363 E R A + ++ S+ +AF I + Sbjct: 249 EGELRRVALTAGGRYFRAGTSASLAEAFESIDR 281 >gi|319949307|ref|ZP_08023384.1| hypothetical protein ES5_07781 [Dietzia cinnamea P4] gi|319437027|gb|EFV92070.1| hypothetical protein ES5_07781 [Dietzia cinnamea P4] Length = 326 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 68/211 (32%), Gaps = 30/211 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+V+DVS SMES D S ++++ A ++ + V GLV+++ Sbjct: 91 VMLVVDVSLSMES-TDVSPSRLEAAQQAATTFANNLTPG------VNLGLVSYAGTASML 143 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + R + LS T + + A I D IV Sbjct: 144 VAPTTDRGPVVRAVDRLS-LDERTATGEAIYTATQAITTFTESLGG---PDQAPPARIVL 199 Query: 294 MTDGE---NLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--------------VIRSHEFL 336 ++DG+ E++ + A + G V I V L Sbjct: 200 LSDGKETVPADPTEERGAFTAAERAAEAGIPVSTISFGTLYGTVDIQGRPQPVPVDDASL 259 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 R A S F+ + + + + I Sbjct: 260 RTIAELSGGDFFTASTLEELDSVYRTLEEQI 290 >gi|301064759|ref|ZP_07205139.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300441134|gb|EFK05519.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 332 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 41/232 (17%), Positives = 80/232 (34%), Gaps = 50/232 (21%) Query: 168 TDARLDMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + +D+++ +DVS SME+ + ++D+ K + + E + + GL Sbjct: 83 DSSGIDIVLAVDVSGSMEALDFTINNEPANRVDVVKKVVFRFIGE-RPDD------RIGL 135 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCN 281 V F+ + L L R+++ + T + + N++ D + Sbjct: 136 VAFAGRPYMVSPLTLDHDWLGRRLQTIHPGMVEDGTAIGSAIGSSINRLRDQKAKS---- 191 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------- 332 K+++ +TDG N + K + A+ G +Y IG Sbjct: 192 -------KVVILLTDGMNNAGKILPVTA--AEAAETLGIKIYTIGAGSRGEVPVPITDKF 242 Query: 333 -------------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 L A + +Y + S+ +S I K TKR Sbjct: 243 GNQKIVRAKVDIDEATLEKVAQMTGAKYYRATDTDSLKKIYSEINKLETTKR 294 >gi|260797475|ref|XP_002593728.1| hypothetical protein BRAFLDRAFT_199696 [Branchiostoma floridae] gi|229278956|gb|EEN49739.1| hypothetical protein BRAFLDRAFT_199696 [Branchiostoma floridae] Length = 186 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 67/196 (34%), Gaps = 19/196 (9%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ VLD S S+ + + + ++ V + G+V +S+ Sbjct: 2 DLFFVLDGSGSVGP------DNFETVKQFV---VDVVSAFTISLTDTRVGVVQYSDFNTL 52 Query: 233 FFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L I + G T + ++YA ++ + R +I Sbjct: 53 ACNLGDHPDEASFVTAINTMQYQGGGTATGDAMEYARVKLQAV--WRPAPTPRKFPLFQI 110 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC-ASPNSFYLVE 349 ++ +TDG K + G VYAIG+ + E L + + ++ Sbjct: 111 MIVLTDG-----KSGDDVVAAAQALAADGVTVYAIGVANFDTAELLEITNGNQDRVIELK 165 Query: 350 NPHSMYDAFSHIGKDI 365 + ++ + + I + + Sbjct: 166 DYTALTASINSIIRAL 181 >gi|297583258|ref|YP_003699038.1| von Willebrand factor type A [Bacillus selenitireducens MLS10] gi|297141715|gb|ADH98472.1| von Willebrand factor type A [Bacillus selenitireducens MLS10] Length = 978 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 39/227 (17%), Positives = 78/227 (34%), Gaps = 29/227 (12%) Query: 149 TNSRHIVMPITSSVKVNSQT-DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 + H T V+ A LD+M VLD S ++ D+ + ML Sbjct: 37 LGNSHTGASETGQFGVDYCQPGAGLDLMFVLDNSGTVN-LDDTDSIRSSTVSDYAENML- 94 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 + + G+++F+ + + + L + L T+ + G++ A Sbjct: 95 ------PGD---RGGIISFNTEADMLQEMSDNRYDLLDALSALPDPSGGTDLSQGMRAAN 145 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 Q +G K+I+V +TDG + + EA+ G ++ +G+ Sbjct: 146 EQFVQTKGAN----------KQIMVLITDGADTINLA--EVYNQVREARMNGITIFTLGL 193 Query: 328 RVIRS---HEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 + + L+ A + + V N + I + R Sbjct: 194 GSLATGLDEALLQDIADQTRGQYRQVPNATVIESVLQDIRSSLEGMR 240 >gi|325268973|ref|ZP_08135594.1| aerotolerance protein BatA [Prevotella multiformis DSM 16608] gi|324988594|gb|EGC20556.1| aerotolerance protein BatA [Prevotella multiformis DSM 16608] Length = 318 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 41/213 (19%), Positives = 74/213 (34%), Gaps = 37/213 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + +D+M+ +DVS SM + D ++ +A + + + P+ N GL F Sbjct: 83 KETEGIDIMLTMDVSASMLTD-DVYPNRMAVAKEVASEFI---SSRPNDN----IGLTIF 134 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSK-------FGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + + + + L + + T GL A +++ D + Sbjct: 135 AGEAFTQCPMTLDHAALLNLLHNVRPDLVTSGLMKDGTAIGMGLANAVSRLQDSKAKS-- 192 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF---- 335 KI++ +TDG N + AKK G VY IG E Sbjct: 193 ---------KIVILLTDGSNNVGSISPMTA--AAIAKKFGIRVYTIGFGRETGEEIGAID 241 Query: 336 ---LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + FY ++ + + I K Sbjct: 242 YRALQNIAVSTNGEFYRAQSQAELSRIYQDIDK 274 >gi|126304011|ref|XP_001381695.1| PREDICTED: similar to tumor endothelial marker 8 [Monodelphis domestica] Length = 564 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 38/224 (16%), Positives = 77/224 (34%), Gaps = 30/224 (13%) Query: 156 MPITSSVKVNSQTDAR-----LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + + + N + D+ +LD S S+ ++ I +E + Sbjct: 22 VLVCAGQGGNRENGGPACYGGFDLYFILDKSGSVLHHWN-----------EIYYFVEHLA 70 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYN 268 ++ + FS + L +++ ++ L K G T G + A Sbjct: 71 HKFISPQ-LRMSFIVFSTRGSILMRLTEDREQIRQGLEELQKVLPGGDTYMHEGFQRASE 129 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 QI+ + +I+ +TDGE L + N +++ GA VY +G++ Sbjct: 130 QIYYENMQGYRTAS-------VIIALTDGE-LHEDLFFYAEREANRSRELGATVYCVGVK 181 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + R S + + V + + A I I+ K Sbjct: 182 DFNETQLARIADSKDHVFPVNDG---FQALQGIIDSILKKSCIE 222 >gi|153835956|ref|ZP_01988623.1| von Willebrand factor, type A [Vibrio parahaemolyticus AQ3810] gi|260880154|ref|ZP_05892509.1| von Willebrand factor type A [Vibrio parahaemolyticus AN-5034] gi|260895271|ref|ZP_05903767.1| von Willebrand factor type A [Vibrio parahaemolyticus Peru-466] gi|260900622|ref|ZP_05909017.1| von Willebrand factor type A [Vibrio parahaemolyticus AQ4037] gi|149750710|gb|EDM61455.1| von Willebrand factor, type A [Vibrio parahaemolyticus AQ3810] gi|308085798|gb|EFO35493.1| von Willebrand factor type A [Vibrio parahaemolyticus Peru-466] gi|308091801|gb|EFO41496.1| von Willebrand factor type A [Vibrio parahaemolyticus AN-5034] gi|308107055|gb|EFO44595.1| von Willebrand factor type A [Vibrio parahaemolyticus AQ4037] Length = 334 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 70/217 (32%), Gaps = 46/217 (21%) Query: 173 DMMIVLDVSRSMES---FFDS-SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+V+D+S SM+ D I ++ K ++ + + K + G+V F + Sbjct: 98 DLMLVVDLSGSMQKEDMNLDGEYIDRLSAVKKVLSDFVAKRKGD-------RLGVVLFGD 150 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + ++IK + G T G+ D Sbjct: 151 HAYLQTPLTADRQTVIQQIKQTVIGLVGQRTAIGDGIGLGTKTFVDSDA----------- 199 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------- 331 +++++ ++DG N + + AKK A +Y +G+ Sbjct: 200 PQRVMILLSDGSNTA--GVLDPIEAAEIAKKYNATIYTVGVGAGEMMVKDFFMTRKVDTA 257 Query: 332 ----SHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 + + ++ + + + I K Sbjct: 258 ADLDEQTLTKIAEMTGGQYFRARDAEQLEKIYDTINK 294 >gi|84385834|ref|ZP_00988864.1| hypothetical protein V12B01_12445 [Vibrio splendidus 12B01] gi|84379150|gb|EAP96003.1| hypothetical protein V12B01_12445 [Vibrio splendidus 12B01] Length = 359 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 38/223 (17%), Positives = 83/223 (37%), Gaps = 34/223 (15%) Query: 163 KVNSQTDARLDMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + + D+M+V+D+S SM S I+++D + + K Sbjct: 92 EPQVREQLGRDVMVVVDLSGSMAEQDFTSKRGDKISRLDATKEVLADFATTRKGD----- 146 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYA---YNQ 269 + GL+ F + F+ + ++ L++ G ST+ + A + Q Sbjct: 147 --RLGLILFG---DAAFVQTPFTADQDVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQ 201 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 Q + + +K+++ +TDG T + + AK +G ++ I + Sbjct: 202 SEKQSAAVQDSSVDANEKEKVVIVLTDG--NDTGSFVEPIDAAKVAKAKGVRIHVIAMGD 259 Query: 330 IRS-------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 ++ E ++ A S + N + A++ IG+ Sbjct: 260 PQTVGEVALDMETIKRVAQESGGEAFEALNRDELTKAYAQIGE 302 >gi|194218991|ref|XP_001915421.1| PREDICTED: similar to integrin, alpha D [Equus caballus] Length = 1160 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 40/211 (18%), Positives = 85/211 (40%), Gaps = 32/211 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + +D++ ++D S S+ S ++ ++++ E + L+ + Sbjct: 144 CPNQEIDIVFLIDGSGSI---APSEFKQMKDFVRAVMGQFEGTNTLFS--------LMQY 192 Query: 227 SNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S+ ++ F S L R + + + T + G+ N++F N Sbjct: 193 SSGLKTHFTFSKFRSSLSPLRLVDPIVQLQGLTFTATGILAVVNELF------HSKNGAR 246 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-----VIRSHEF--LR 337 + KKI++ +TDG+ + + + +A++ G I YAIG+ I E + Sbjct: 247 RSAKKILIVITDGQKYKDPWEYRDVIP--QAERAGIIRYAIGVGDAFQEPIARQELNTIG 304 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 + S + + V+N + A S I K + K Sbjct: 305 SAPSQDHVFKVDN----FAALSSIQKQLQEK 331 >gi|332162963|ref|YP_004299540.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|325667193|gb|ADZ43837.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] Length = 457 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 81/234 (34%), Gaps = 28/234 (11%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F N +G + I I P ++ + E+SH K L I+++ + A I N Sbjct: 17 FKKNEQGAILISFMIIFPFFIALIFITFEISHYLQRKAKLSDAIEQATL--ALTIENNEI 74 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 + +++K ++ N + L F+ I +I +T Y+ + Sbjct: 75 PDEPQQIKNNALVLSYVNAY-------LPSKKFLVPIININDNT--------HYLEYNAA 119 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 Y + + +I + + D++ V D S SM F+ Sbjct: 120 VTMAYPAKFLSQSPFTNTISDMNITDNGVAIKNKAIEASEPTDVIFVADYSGSMLYNFNE 179 Query: 191 SITKIDMAIKSINAMLEEV-KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 + + I ++ + ++ +I D +N+ G + WG + Sbjct: 180 NKPRDHERIDALRSAFRKLHDIIMDNSNINAIG----------YIPFSWGTKRI 223 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 63/193 (32%), Gaps = 39/193 (20%) Query: 200 KSINAMLEEVKLIPDVNNVVQSGLV------TFSNKIEEFFLLEWGVSHLQRKIKYLSKF 253 K+I+++ + I + V+ G V +S + E++ I + Sbjct: 275 KTIDSITGNAQTIDIPMSDVRFGDVCLQGSNAYSLEQEQYINNIDN-------IIEMEPH 327 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-- 311 G + S+ L N IF + H KK+++ ++DG + + + Sbjct: 328 GWTLISSGILSA--NNIFKNKAKNGH--------KKLMIILSDGVDTDDFPSSKGIIISK 377 Query: 312 -------CNEAKKRGAIVYAIGIRVIRSHEF-------LRACASPNSFYLVENPHSMYDA 357 C E K+ + I I + + C +++Y N H + Sbjct: 378 MLVEKGMCEEIKENDIQMAFIAIAYSPDNNKNEPYHINWKKCVGEDNYYEAHNAHELEHK 437 Query: 358 FSHIGKDIVTKRI 370 T+ + Sbjct: 438 LQQAVSGSTTREV 450 >gi|152995759|ref|YP_001340594.1| von Willebrand factor type A [Marinomonas sp. MWYL1] gi|150836683|gb|ABR70659.1| von Willebrand factor type A [Marinomonas sp. MWYL1] Length = 342 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 30/213 (14%), Positives = 71/213 (33%), Gaps = 46/213 (21%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D++I LD+S SM+ + +++ A ++ ++E + + G++ F + Sbjct: 89 DLLIALDLSGSMQVTDMALNGQPANRLEAAKSVLSDFIQERRGD-------RIGIIVFGS 141 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 K L + + + ++ G T + ++ D Sbjct: 142 KAYLQAPLSFDTKTINQLVQEAQIGFAGEQTAIGDAIGLGIKRLEDKPSD---------- 191 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------- 331 KK+++ MTDG N + + Q A + ++ IGI Sbjct: 192 -KKVLILMTDGANTAGR--VQPQQAATFAASQNVKIHTIGIGADSMIVQSFFGPKAINPS 248 Query: 332 ---SHEFLRACA--SPNSFYLVENPHSMYDAFS 359 L+ A + ++ ++ + + Sbjct: 249 SDLDETLLKNIAAQTGGEYFRAKSTEDLQAIYQ 281 >gi|126462813|ref|YP_001043927.1| hypothetical protein Rsph17029_2052 [Rhodobacter sphaeroides ATCC 17029] gi|126104477|gb|ABN77155.1| conserved hypothetical protein [Rhodobacter sphaeroides ATCC 17029] Length = 566 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 90/234 (38%), Gaps = 40/234 (17%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R F + G + I L ++ ++ G+ ++V F + L +DR+++ AA+ Sbjct: 13 LRRFGRSEDGSILIFGIFMLILMLMIGGLAVDVMRFEFQRARLQGTLDRAVLAAASL--- 69 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD-IVRSTSLDIVVVPQNEG 126 + R+ + G + +D+ +V + +L++ Sbjct: 70 -----------------TQSRSPAEVVRDYVAKAGLEDYLDEPVVNANTLNV-------- 104 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 S++A + Y +P F + + P S+ + +++ +VLD+S SM + Sbjct: 105 RSVTATAAYSMPTVFMKLLD----IDRLEAPAVSTAEERVSN---VEISLVLDMSNSMVT 157 Query: 187 FFDSSITKIDMAIKSINAMLEEV----KLIPDVNNVVQSGLVTFSNKIEEFFLL 236 + ++D + ++ V D V+ +V ++ ++ L Sbjct: 158 DGTNPRDRLDNLKVAARDFIDIVMAGANSGLDGAPVISVSIVPYTGQVNAGADL 211 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 14/68 (20%), Positives = 26/68 (38%), Gaps = 1/68 (1%) Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACASPNSFYLVENPHSMYDAFSHIGK 363 ++++ C+ A+ +G VY++ L+ CAS Y + F I Sbjct: 498 NERTRQICDAARAQGITVYSVAFEAEAGGQALLQYCASTTGHYYATVGPQIRTVFHSIAS 557 Query: 364 DIVTKRIW 371 I R+ Sbjct: 558 HITQLRLT 565 >gi|332654605|ref|ZP_08420348.1| von Willebrand factor, type A [Ruminococcaceae bacterium D16] gi|332516569|gb|EGJ46175.1| von Willebrand factor, type A [Ruminococcaceae bacterium D16] Length = 472 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 48/231 (20%), Positives = 83/231 (35%), Gaps = 35/231 (15%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 + + + +T++ + S D+++VLD S SM T + ++ Sbjct: 32 CDGTLRVTLALTAAPDIVSN---PTDIVLVLDRSGSMTG------TPLADMKLGAKTFID 82 Query: 208 EVKLIPDVNNV------VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 + D + + G+V+FSN L V L+ + LS G STN Sbjct: 83 LIDEATDSSQDGQIGSGSRMGVVSFSNTAVADTQLITSVDALKAAVDNLSA-GGSTNHAD 141 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 A Q+FD K++V TDG +T A+ +G I Sbjct: 142 AFAKAI-QLFDPASANA----------KVMVMFTDG---NTTIGAPPAPVAAAARAQGII 187 Query: 322 VYAIGIRVIRSHEF--LRACAS--PNSFYLVE-NPHSMYDAFSHIGKDIVT 367 +Y IG+ + L A+ S V N + + F+ + +I Sbjct: 188 IYCIGLIGSDGLDITALNDWATDPDASHVAVTPNAADLEELFAELAANISK 238 >gi|332266371|ref|XP_003282183.1| PREDICTED: collagen alpha-4(VI) chain-like, partial [Nomascus leucogenys] Length = 1020 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 46/278 (16%), Positives = 101/278 (36%), Gaps = 26/278 (9%) Query: 95 RNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHI 154 LR G + + + + +++ + + + ++ L + Sbjct: 155 AEALRRAGILLYVVGVRDAVWAELMEIASSPQENFTSFLPNFSGLSNLAQKLRQELCDKL 214 Query: 155 --VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 P V + A D++ ++D S S+ + K+ + S+ L Sbjct: 215 AKAAPRVDHVSPACREAALADIVFLVDSSTSIG---PQNFLKVKNFLYSVVLGL------ 265 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYA-YNQ 269 ++ V+ GL +++ I F L S + +I+ L TN+ L++ N Sbjct: 266 DISSDRVRVGLAQYNDNIYPAFQLNQHPLKSMVLEQIQNLPYRPGGTNTGSALEFIRTNY 325 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 + + G R +I++ +TDGE + + + K+ G +VY +G+ V Sbjct: 326 LTEESGSRA-----KDRVPQIVILVTDGE-----SNDEVQEVADRLKEDGVVVYVVGVNV 375 Query: 330 IRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDI 365 E + + P + EN + + D I + + Sbjct: 376 QDVQELQKIASEPFEKFLFNTENFNILQDFSGSILQTL 413 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 50/397 (12%), Positives = 125/397 (31%), Gaps = 44/397 (11%) Query: 3 SLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAA 62 S ++ +NF +++ + + + +G+ +I+ + + ++ Sbjct: 243 STSIGPQNFLKVKNFLYSVVLGLDISSDRVRVGLAQYNDNIYPAFQLNQHPLKSMVLEQI 302 Query: 63 TQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELR-------DNGFVNDIDDIVRSTS 115 + G N + ++ + D +++ ++ Sbjct: 303 QNLPYRPGGTNTGSALEFIRTNYLTEESGSRAKDRVPQIVILVTDGESNDEVQEVADRLK 362 Query: 116 LDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT------- 168 D VVV KI + + T + +I+ + S+ + Sbjct: 363 EDGVVVYVVGVNVQDVQELQKIASEPFEKFLFNTENFNILQDFSGSILQTLCSAVEGKIK 422 Query: 169 ---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 A D++ + D S D+S ++ ++ +++ + Q GL Sbjct: 423 EFIKAYADVVFLADTS------QDTSRASFQRMQNFLSRVVGMLEV---GRDKYQIGLAQ 473 Query: 226 FSNKIEEFFLL-EWGV-SHLQRKI-KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + ++ FLL + + + I + G S + L+Y F + + Sbjct: 474 YGDEGHTEFLLNTYKTQNEMIAHIHERFVPLGRSRRTGKALRYLLQTFFQEEAGSRFL-- 531 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + V + S K + + +++G V ++G++ E L SP Sbjct: 532 --QGIPQYAVVI-----NSGKSEDEVQDAAQRLREKGVKVMSVGVQDFDRKE-LEGMGSP 583 Query: 343 NSFYLVENPHSMYDAFSHI-----GKDIVTKRIWYDK 374 + Y ++ + + G RI +K Sbjct: 584 DLVYDMQREDEVRHIVEDMNVVIRGTGQQEHRITANK 620 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 32/173 (18%), Positives = 56/173 (32%), Gaps = 17/173 (9%) Query: 200 KSINAMLEE-VKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVS 256 +S+ L V + ++ GL +S+ FLL + R I+ Sbjct: 49 RSVRNFLHILVNSFNVSSETIRVGLAQYSDVPHSEFLLSTYHRKGDVLRHIRQFQFKPGG 108 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 N LK+ I D + +I V ++ G + + Sbjct: 109 KNMGLALKF----ILDHHFQEASGSRASQGVPQIAVVISSG-----PAEDHVHGPAEALR 159 Query: 317 KRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 + G ++Y +G+R E + +SP EN S FS + R Sbjct: 160 RAGILLYVVGVRDAVWAELMEIASSPQ-----ENFTSFLPNFSGLSNLAQKLR 207 >gi|254784286|ref|YP_003071714.1| matrixin family protein [Teredinibacter turnerae T7901] gi|237683907|gb|ACR11171.1| matrixin family protein [Teredinibacter turnerae T7901] Length = 877 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 79/207 (38%), Gaps = 14/207 (6%) Query: 173 DMMIVLDVSRSME--SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D+++V+D S SM S D S++K+D + N ++ + L ++GLV F +++ Sbjct: 421 DVVLVMDRSGSMNLSSAPDPSVSKMDALKYAANVFMDFLDLDAG----HRAGLVQF-HEV 475 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + + + ++ +++ + G+ + + +G+ Q D + ++I Sbjct: 476 VVPFSPAFNLQPVNAASLSAAQTAINSMTAGGMTNIIDGV--NEGIAQLTTAVDPSDRQI 533 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ +TDG + + +Y++G + L A +EN Sbjct: 534 MLLLTDGLHNRPVGTSVTDITAPLL-ASEVTLYSVGFGTSTNEAELTPLALSTGGVHLEN 592 Query: 351 PH----SMYDAFSHIGKDIVTKRIWYD 373 + F I D Sbjct: 593 KDVSDLQLRKHFLSIAASAADSTTLID 619 >gi|163848161|ref|YP_001636205.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222526064|ref|YP_002570535.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163669450|gb|ABY35816.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222449943|gb|ACM54209.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 905 Score = 66.7 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 74/207 (35%), Gaps = 26/207 (12%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + S + + L + +V+D S SM D T++D+A ++ + Sbjct: 395 SLFPVSLDPPSREERFDLALTLVIDRSGSMSELVDGLRTQLDLAREAA------FQASLG 448 Query: 215 VNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 ++ Q ++ F + + L+ ++ ++ + LS G TN G+ A I Sbjct: 449 LSRQDQLSIIAFDSVADVILPLQPLPDLATIEDALSRLSA-GGGTNIRSGMALAAETIAT 507 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 +H ++ +TDG + + D + + +G V + I + Sbjct: 508 ADARIRH-----------VILLTDGVSETEYADLVA-----NLRAQGVTVSTVAIGLNTD 551 Query: 333 HEFLRACA-SPNSFYLVENPHSMYDAF 358 E R +Y+V ++ Sbjct: 552 PELERVAQIGGGKYYVVRQAEALPQIL 578 >gi|254458905|ref|ZP_05072328.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084176|gb|EDZ61465.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 309 Score = 66.3 bits (160), Expect = 6e-09, Method: Composition-based stats. Identities = 46/258 (17%), Positives = 89/258 (34%), Gaps = 35/258 (13%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLD- 173 S+ Q S+SA K F+ W +++ + S VK T D Sbjct: 32 SIYFPHTGQFMNNSVSAS-------KLLFFLKW-LGIIMLILALMSPVKDEPYTLEPKDG 83 Query: 174 --MMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + ++LD S+SM+ + +T+ D+ ++ ++E GLV F Sbjct: 84 YEIALILDASQSMKAQGFDVTNPQLTRFDVVKDIVSNFIKE-------RQNDNIGLVVFG 136 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + + L + + L G+ Y +F + + Sbjct: 137 AYSFIASPLTYDENILNKIVSQL---------YIGMAGKYTALFTSLAQGVNLLKMSESK 187 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH--EFLRACA--SPN 343 K+ + +TDG + + + A K +Y IGI + + E LR A + Sbjct: 188 SKVGILLTDGFSTPEVDKIPFDVALDMAIKEKIKIYPIGIGMPHEYNIEVLRKIAEKTGG 247 Query: 344 SFYLVENPHSMYDAFSHI 361 + + + + + I Sbjct: 248 KAFGAASATELKEVYKEI 265 >gi|159901412|ref|YP_001547659.1| hypothetical protein Haur_4901 [Herpetosiphon aurantiacus ATCC 23779] gi|159894451|gb|ABX07531.1| conserved hypothetical membrane protein [Herpetosiphon aurantiacus ATCC 23779] Length = 330 Score = 66.3 bits (160), Expect = 6e-09, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 72/210 (34%), Gaps = 27/210 (12%) Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD---ARLDMMIVLDVSRSMESF 187 ++I ++ +V+ +T S +D+ + LD+S SM++ Sbjct: 44 VKPSWRIRMRPVLISLRAAAVGLLVVVLTRPQYAQSSERVVREGIDIQLALDISLSMKAG 103 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE---WGVSHLQ 244 +I +A + I ++ + GLV FS L + +L Sbjct: 104 DFDPKDRITVAKEVIAEFVK-------GRKDDRIGLVVFSGHAFTQVPLTLDYDFLQNLL 156 Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 +++ + + T L ++ N + K+++ +TDG N + Sbjct: 157 GQVQTVRR-PDGTAIGLALAHSVNGL-----------RNSTTKSKVVILLTDGSNN--RG 202 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 D + A+ VY I + + E Sbjct: 203 DIEPAQAAEIARALDVRVYTILVGKPGNGE 232 >gi|326382237|ref|ZP_08203929.1| hypothetical protein SCNU_04806 [Gordonia neofelifaecis NRRL B-59395] gi|326198967|gb|EGD56149.1| hypothetical protein SCNU_04806 [Gordonia neofelifaecis NRRL B-59395] Length = 330 Score = 66.3 bits (160), Expect = 6e-09, Method: Composition-based stats. Identities = 43/259 (16%), Positives = 83/259 (32%), Gaps = 35/259 (13%) Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 + + K + + + + R +M+V+DVS SM+S D Sbjct: 47 VTPKNQDRFKHVPVVVLLVGLLLLTIAMAGPQAERDVPRNRATVMLVVDVSNSMKS-TDV 105 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL 250 + +++ A + D+ + + GL++F+ S + + L Sbjct: 106 APSRLKAAQAAGKRF------ADDLTDGINLGLISFAGTASTLVSPTPDHSATKNALDRL 159 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE---NLSTKEDQQ 307 T + G+ A QI + + IV ++DG+ S + + Sbjct: 160 K-LADKTATGEGIFAALQQIDTLNAVLGG---PSGAPPARIVLLSDGKQTVPESPDDPRG 215 Query: 308 SLYYCNEAKKRGAIVYAIGIR-------------------VIRSHEFLRACA--SPNSFY 346 +AK++G V I V E LR A S F+ Sbjct: 216 GFTAARKAKEKGVPVSTISFGTLTGTVDLETPGGGVERVPVPVDDESLRKIANLSGGDFF 275 Query: 347 LVENPHSMYDAFSHIGKDI 365 + + +S + K I Sbjct: 276 TASSLDELNKVYSTLQKQI 294 >gi|120554865|ref|YP_959216.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] gi|120324714|gb|ABM19029.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] Length = 339 Score = 66.3 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 33/219 (15%), Positives = 76/219 (34%), Gaps = 50/219 (22%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+V+D+S SM+ SI ++ + ++ +++ + GL+ F Sbjct: 90 DLMLVVDISPSMDEQDMVLQGRSINRLQAVKRVLDDFIDQ-------REGDRLGLILFGT 142 Query: 229 KIEEFFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + L G +T + + ++ E Sbjct: 143 EPYVQAPLTFDRETVRTLLFEAGLGMAGRATAIGDAIGLSVKRL-----------RERPQ 191 Query: 287 YKKIIVFMTDGENLS--TKEDQQSLYYCNEAKKRGAIVYAIGIRV----IRS-------- 332 +++++ +TDG N + D+ + A+ G +Y IGI R Sbjct: 192 EQRVVILLTDGANTAGQVSPDKAT----EIAQAAGVRLYTIGIGADTMIQRGLLGSRRVN 247 Query: 333 ------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 E L A + ++ + + + I + Sbjct: 248 PSRDLDEELLTRMAEQTGGRYFRARSLPELEMIYDSINQ 286 >gi|153831781|ref|ZP_01984448.1| von Willebrand factor, type A [Vibrio harveyi HY01] gi|148872291|gb|EDL71108.1| von Willebrand factor, type A [Vibrio harveyi HY01] Length = 334 Score = 66.3 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 69/217 (31%), Gaps = 46/217 (21%) Query: 173 DMMIVLDVSRSMESFFDSS----ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 DMM+V+D+S SM+ + I ++ + ++ +E+ + G+V F + Sbjct: 98 DMMLVVDLSGSMQKEDMNDNGEYIDRLTAVKRVLSDFVEK-------RQGDRLGVVLFGD 150 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + ++I + G T G+ D Sbjct: 151 HAYLQTPLTADRKTVMQQINQTVIGLVGQRTAIGDGIGLGTKTFVDSDA----------- 199 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------- 331 +++++ ++DG N + L AKK A +Y +G+ Sbjct: 200 PQRVMILLSDGSNTA--GVLDPLEAAEIAKKYNATIYTVGVGAGEMMVKDFFMTRKVNTA 257 Query: 332 ----SHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 + + ++ + + + I + Sbjct: 258 SDLDEQTLTKIAEMTGGKYFRARDAKELETIYDTINQ 294 >gi|88704964|ref|ZP_01102676.1| conserved hypothetical protein [Congregibacter litoralis KT71] gi|88700659|gb|EAQ97766.1| conserved hypothetical protein [Congregibacter litoralis KT71] Length = 344 Score = 66.3 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 30/159 (18%), Positives = 59/159 (37%), Gaps = 18/159 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M+ +D+S SM ++ I ++ + + + + GL+ F ++ Sbjct: 85 DLMLAVDISGSMRVEDMQVGNRMARRIDAVKQLGSDFMSRRSGD---RLGLILFGSRAYL 141 Query: 233 FFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + + +QR + + G T + A ++ E ++ Sbjct: 142 QSPLSFDIQTVQRFLLEAQIGFAGQETAIGDAIGLAVKRL-----------QERPASSRV 190 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 +V +TDG+ T L N A G +Y IGI Sbjct: 191 LVLLTDGQ--DTASTVDPLEAANLAADLGVRIYTIGIGA 227 >gi|298207017|ref|YP_003715196.1| aerotolerance-related membrane protein [Croceibacter atlanticus HTCC2559] gi|83849651|gb|EAP87519.1| aerotolerance-related membrane protein [Croceibacter atlanticus HTCC2559] Length = 334 Score = 66.3 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 76/235 (32%), Gaps = 49/235 (20%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P T V ++T +D+++ +DVS SM + D +++ + ++ Sbjct: 77 PRTVDVSTKTKTTKGIDIVMAIDVSASMLA-RDLRPNRLEALKDVASEFIQ-------GR 128 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS---KFGVSTNSTPGLKYAYNQIFDM 273 + G+V ++ + + S + + + T GL + N++ Sbjct: 129 PNDRVGIVLYAGESYTKTPITSDKSIVLGALNDVKFSEVLENGTAIGMGLATSVNRL--- 185 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI--- 330 + K+I+ +TDG N S D + A + G Y IGI Sbjct: 186 --------KDSKALSKVIILLTDGVNNSGTIDPKLAS--ELAVEYGIKTYTIGIGSNGMA 235 Query: 331 --------RSH------------EFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 + L+ A + ++ N + + I K Sbjct: 236 LSPIGIKSNGQFQYGNQKVEIDEDLLKQIATVTGGQYFRATNNQKLEAIYEEINK 290 >gi|254460794|ref|ZP_05074210.1| conserved hypothetical protein [Rhodobacterales bacterium HTCC2083] gi|206677383|gb|EDZ41870.1| conserved hypothetical protein [Rhodobacteraceae bacterium HTCC2083] Length = 480 Score = 66.3 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 70/488 (14%), Positives = 142/488 (29%), Gaps = 161/488 (32%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 + +F + G + I + +I + G+ +++ +TVL +DR+++ AA Sbjct: 29 LASFKDDESGSLVIFAVFMVLMILTIGGIGVDLMRSERDRTVLQHTLDRAILSAA----- 83 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 ++ + + + SF + + + +N + S+ + G Sbjct: 84 ---DLDQTQTPQAVVDDYFETAGLESFLSNVTVDQGINYKTVGAEAQSITTTAFMKMAG- 139 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 + ++ V + A +++ +VLD+S SM Sbjct: 140 --------------------------VDTLNATAAGVAEERIANVEISMVLDISGSMG-- 171 Query: 188 FDSSITKIDMAIKSINAM--------LEEVKLIP---------------------DVNNV 218 S +T++ A S L V L+P + ++ Sbjct: 172 IGSKMTQLRSAATSFVNTVLSPENEDLVSVSLVPYSQHVNAGPKIYNELNTNHRHNYSHC 231 Query: 219 VQSGLVTFSNKI-------EEFFLLEWGVSHLQR------------KIKYLSKFG----- 254 V+ +S ++ +W S + +I S+ Sbjct: 232 VEMADSAYSETELDLSVTYDQMQHFQWNYSGANQLTDTICPRYSYERITAFSQDASALNA 291 Query: 255 --------VSTNSTPGLKYA-----------YNQIFDMQG-----MRQHCNTEDANYKKI 290 T G+K+A N + + +D K Sbjct: 292 QIAQLQPRAGTQIFMGMKWAAAMLDPAFNPVVNALVTSNDIDSVFDNRPAAFDDTETLKT 351 Query: 291 IVFMTDGENLS----------------------------------------------TKE 304 +V MTDG+N S + Sbjct: 352 VVLMTDGKNSSSMRIKSWAYDSSSDYYHWSRYNLWYYLRRNVNRHYHSRYYWFTHDAAQG 411 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIR-SHEFLRACASPNSFYLVENPHSMYDAFSHIGK 363 D CN +K G ++++IG V + + CAS S + + +AF I + Sbjct: 412 DALLDDICNASKDAGIVIWSIGFEVDDHGADVMANCASSPSHFFRVEGIEISEAFDAIAR 471 Query: 364 DIVTKRIW 371 I R+ Sbjct: 472 QINQLRLT 479 >gi|331694298|ref|YP_004330537.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] gi|326948987|gb|AEA22684.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] Length = 362 Score = 66.3 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 34/166 (20%), Positives = 60/166 (36%), Gaps = 22/166 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ LDVS SM D ++ A +++ +EE + GLV FS E Sbjct: 88 VIVALDVSGSM-CSTDVVPNRLAAAQEAVRRFVEEQDSG------TRIGLVVFSGFAELA 140 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN-----------T 282 + R + L+ G T + + + I ++ + Sbjct: 141 VAPTTDRDAITRALDGLTT-GRGTTVGSAILKSVDAISEIDPDVAPSDPAPGTVPPPPRA 199 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +I+V +TDG N + + + A +RG VY IG Sbjct: 200 PGTYAPEIVVLLTDGANTTGVTPEDA---AKTAAERGVRVYPIGFG 242 >gi|156382097|ref|XP_001632391.1| predicted protein [Nematostella vectensis] gi|156219446|gb|EDO40328.1| predicted protein [Nematostella vectensis] Length = 286 Score = 66.3 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 38/201 (18%), Positives = 76/201 (37%), Gaps = 23/201 (11%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D+ +LD S SM KI +KSI V + + G++TFS Sbjct: 95 PIDIAFLLDASASMGRRTWG---KIKNYVKSI------VDMGDISDQGTHVGIITFSTDP 145 Query: 231 ------EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 +++ ++ +++R I L + T L A +F + Sbjct: 146 VIDIPFDKYKGVKMNAVNIKRDIDELRRKKGYTFIDKALTLADKSLFTQE------AGMR 199 Query: 285 ANYKKIIVFMTDGENLSTKED-QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 + +K+ V M+DG + ++ N K +G VY +GI L AS + Sbjct: 200 EDSQKVAVLMSDGIQTKDRGPFTPTIIAANRLKMKGVQVYTVGIGASIDVLELMWIASED 259 Query: 344 S-FYLVENPHSMYDAFSHIGK 363 + + ++ + + ++ + Sbjct: 260 TGLFDGQHLELLREDLANFTQ 280 >gi|152993581|ref|YP_001359302.1| von Willebrand factor A [Sulfurovum sp. NBC37-1] gi|151425442|dbj|BAF72945.1| von Willebrand factor type A domain protein [Sulfurovum sp. NBC37-1] Length = 305 Score = 66.3 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 50/227 (22%), Positives = 89/227 (39%), Gaps = 33/227 (14%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEV 209 + PI + +NS+ + R D+++V+D S SM D K D+ + + +++ Sbjct: 66 LASPILTKNYINSKKEGR-DIVLVIDSSDSMRQMGFDPKDPYKNKFDVVKEVVADFIKKR 124 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR--KIKYLSKFGVSTNSTPGLKYAY 267 K + G+VTF++ L + L +++ L G T L AY Sbjct: 125 KND-------RIGMVTFADVAFIASPLTFEKDFLTNITEMQKLGMAGKRTAINDALVQAY 177 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 N + + KII+ +TDG +N+S + + R +Y IG Sbjct: 178 N-LMSKSKAKS----------KIIILLTDGRDNMSKIPLSDVKHMIEK---RDVKLYTIG 223 Query: 327 IRVIRSH--EFLRACASPNS--FYLVENPHSMYDAFSHIGKDIVTKR 369 I R + ++L+ A Y + + + I K VTK Sbjct: 224 IGGPRDYDAQYLKTLAKAGKGQAYAARSAAMLSKIYDEINKLEVTKL 270 >gi|157961563|ref|YP_001501597.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] gi|157846563|gb|ABV87062.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] Length = 328 Score = 66.3 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 71/198 (35%), Gaps = 28/198 (14%) Query: 174 MMIVLDVSRSMESF-FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 MMI +D+S SME+ F +I + + A+L+ L D + + GL+ F Sbjct: 95 MMIAVDLSGSMEARDFVDPQGEILRRVDGVKALLQSFLLKRDSD---RIGLIAFGENAYL 151 Query: 233 FFLLEWGVSHLQRKIKYL--SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + ++ + G T + A N + K+ Sbjct: 152 QAPFTQDKQILSQLLQQMDVRMAGAGTAIGDAIGVAVNHF-----------EQSEVENKV 200 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--SHEFLRACA-------S 341 ++ +TDG T + L + A ++G ++Y I I + + L + Sbjct: 201 LLLLTDG--NDTSSEFPPLDAAHYAGEQGVVIYPIAIGDPKNVGEDSLDIATLERIADLT 258 Query: 342 PNSFYLVENPHSMYDAFS 359 + ++ S+ + + Sbjct: 259 QGRVFEADDGQSLIEVYK 276 >gi|254293317|ref|YP_003059340.1| von Willebrand factor A [Hirschia baltica ATCC 49814] gi|254041848|gb|ACT58643.1| von Willebrand factor type A [Hirschia baltica ATCC 49814] Length = 563 Score = 66.3 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 42/300 (14%), Positives = 109/300 (36%), Gaps = 30/300 (10%) Query: 44 FFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGF 103 K L M + T + ++ + + + + + ++ + F ++ + Sbjct: 55 MLSKHSLGMMAMSAPAPMPTSLPSQLDRDKYEDVDTNPVKLVSEDPVST-FSIDVDTASY 113 Query: 104 VN-----DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKF---CTFIPWYTNSRHIV 155 N + ++ ++ I + Y + +P PW + + Sbjct: 114 SNVRSFLNDGNLPPKDAVRIEELINYFDYDYPIPASKDVPFATHVNVVPAPWAEGKQLMH 173 Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 + I ++ L++ +++DVS SM K+ +A K++ +++++ + Sbjct: 174 VGIKG-YDLDRTEQPPLNLTLLVDVSGSMNHE-----DKLPLAKKALKLLIDKMDEDDHI 227 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + VV +G + + S + + LS G ST GL+ AY+ + + Sbjct: 228 SVVVYAGAAGTVLEPTKGSE----KSKIFAALDNLSA-GGSTAGGEGLRLAYS-LAEQNY 281 Query: 276 MRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 N ++ +TDG+ N+ D++ + ++ G + +G ++ Sbjct: 282 DAASVNR--------VMLLTDGDFNVGVTSDERLEDFVARKRESGVYLSVLGFGRGNYND 333 >gi|198434614|ref|XP_002123557.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 1105 Score = 66.3 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 68/164 (41%), Gaps = 27/164 (16%) Query: 169 DARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 A +D+M +LD S S+ S ++ + I A+L++ + PD ++ G+ F+ Sbjct: 850 HALVDLMFLLDSSSSVGRSNWN-------LLINFTVALLDKFVISPDD---MRVGVARFN 899 Query: 228 NKIEEFFLLEWG----VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + + G +S L++K++ + G T + N ++ M H Sbjct: 900 RHFDRDSEILIGNYSNISELRQKLRRMPYRGRGTLTG-------NALWHMNNHSLHAPGN 952 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 +IV +TDG + L N K++ +Y +G+ Sbjct: 953 RPGVPDVIVVITDG-----LASDEVLRAANALKEQDVKMYVVGL 991 >gi|152993979|ref|YP_001359700.1| von Willebrand factor type A domain-containing protein [Sulfurovum sp. NBC37-1] gi|151425840|dbj|BAF73343.1| von Willebrand factor type A domain protein [Sulfurovum sp. NBC37-1] Length = 307 Score = 66.3 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 68/197 (34%), Gaps = 22/197 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+++ +D S SM S + D L+ G+V F Sbjct: 81 DLVLAIDASGSM---AQSGFDEKDRFKTKYETTLDLSADFIKHRFDDNMGVVIFGTFAYT 137 Query: 233 FFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + + ++ +K + G ST L A + + K Sbjct: 138 ASPLTYDLEAMESMLKMTTVGIAGESTAIGDALMQAMRTLSYGEAQS-----------KA 186 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE--FLRACA--SPNSFY 346 I+ +TDG + + + ++ +AK++G +Y IG+ ++ L A S Y Sbjct: 187 IILLTDGYHNAGRSSPKAAVA--KAKEKGIKIYTIGVGKSSDYDAALLDTIAKESGGKSY 244 Query: 347 LVENPHSMYDAFSHIGK 363 + + + + I K Sbjct: 245 AAASAAQLKEVYKEIDK 261 >gi|46199004|ref|YP_004671.1| hypothetical protein TTC0696 [Thermus thermophilus HB27] gi|46196628|gb|AAS81044.1| hypothetical membrane associated protein [Thermus thermophilus HB27] Length = 706 Score = 66.3 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 38/198 (19%), Positives = 71/198 (35%), Gaps = 33/198 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++VLDVS SM + ++ L V+ + + G+V FS+ Sbjct: 306 LVLVLDVSGSMAG---------EKLSMAVAGALALVESAAPED---RLGVVVFSSGHRVL 353 Query: 234 FLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + +++ + L + G T + A + + G R K Sbjct: 354 FPPRPMTAQAKKEAESLLLSLRAGGGTVLGGAFREAVRLLHGVPGER-----------KA 402 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLV 348 ++ +TDG KE + A+ G V A+ + FL+ A FY Sbjct: 403 VLVLTDGLIADAKE-----PILDLAQTSGVEVSALALGPDADAPFLKELARRGGGRFYQA 457 Query: 349 ENPHSMYDAFSHIGKDIV 366 +P + F G+++ Sbjct: 458 PSPRELPRLFLREGQEVF 475 >gi|313838674|gb|EFS76388.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL086PA1] Length = 320 Score = 66.3 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 76/213 (35%), Gaps = 30/213 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +DVSRSM + D +++ A + L ++ +V+ LV F+ + Sbjct: 91 VVVAIDVSRSMVA-TDVEPSRLSAAKTAAKDFLGDLPPRFNVS------LVKFAASAQVV 143 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + I L ST G+ + N + + +H IV Sbjct: 144 VAPTTDRAAVSTAITNLQVL-PSTAIGEGIYSSLNALKLVPDDPKH---PGQKPPAAIVL 199 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH---------------EFLRA 338 ++DG +T + SL EA ++ VY I + E Sbjct: 200 LSDG---ATNVGRPSLEAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAI 256 Query: 339 C-ASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 AS + E+ + D + I + + +++ Sbjct: 257 AKASGGEKFSAESLGQLSDVYKSIAQSVGYEKV 289 >gi|221127586|ref|XP_002157796.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 449 Score = 66.3 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 40/215 (18%), Positives = 80/215 (37%), Gaps = 22/215 (10%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + + I + + D +D++ LD S+S++ +KI K+I Sbjct: 6 MLLFLVVFTINCVGRTEAQTEDVCDVPIDLVFALDASQSIKK---DGFSKIKEFSKTIIN 62 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPG 262 I + + V G + FS E F L+ + + +I + G T + Sbjct: 63 NF----KIDEFHTHV--GALVFSEIAEVQFRLDELFEKQDIFDRIDKIPYPGYRTATDDA 116 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 L+ A +F + G ++++ + DG E+ S K++G + Sbjct: 117 LRVANKYMFSLSG------GARQGVPQVLILVNDGRCTVCSENVSSAS--APLKEKGVSI 168 Query: 323 YAIGIR-VIRSHEFLRACASP--NSFYLVENPHSM 354 + +GI + E L + P + F+ VE + Sbjct: 169 FCVGISKTVDKKELLSIASEPAEDHFFYVETIDEL 203 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 30/197 (15%), Positives = 67/197 (34%), Gaps = 19/197 (9%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF--S 227 +D+ + D S S+ + K + V + + + + Sbjct: 267 TPVDVALAFDSSSSVGELAYEEMKKFAH---------QVVDSFSISQQNARFAALVYGSN 317 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 +E F+ +++ I+ LS +T L+ A + +F +QG + Sbjct: 318 ASVEFNFVRYDSALEIKQAIQSLSYLKSNTRIDKALEVAKSDLFSLQG------KVRSRR 371 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 I+ DG + D +S+ K G + AIG+ + L+ + N+ + Sbjct: 372 PMILYVFFDGTVTRSMSDLESVV--QPLKDYGVKIIAIGVGPEVNRYQLKKISEDNAIFS 429 Query: 348 VENPHSMYDAFSHIGKD 364 ++ + I + Sbjct: 430 GKSFKELAPLLYSIVEQ 446 >gi|239827908|ref|YP_002950532.1| hypothetical protein GWCH70_2571 [Geobacillus sp. WCH70] gi|239808201|gb|ACS25266.1| Ig domain protein group 2 domain protein [Geobacillus sp. WCH70] Length = 942 Score = 66.3 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 32/198 (16%), Positives = 70/198 (35%), Gaps = 21/198 (10%) Query: 119 VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVL 178 N + S K P + + + +V++ +D++ V Sbjct: 31 SNDSNNATLDFTITSSQLEYAKP----PNGDAQGRLDVTLIPKGRVDNIVRPPIDVVFVF 86 Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE--EFFLL 236 DVS SM K+ A ++ + ++ K + N+ + L+ FS+ ++ + Sbjct: 87 DVSGSMTPL------KLQSAKYALQSAVDYFKANANPND--RFALIPFSSDVQYNKVVPF 138 Query: 237 EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 G +++ ++ ++ + G Y + D KK I+F+TD Sbjct: 139 PTGAYDVKQHLERIANVANDLRAYGGTNY------TQSLQQAQSFFNDPTRKKYIIFLTD 192 Query: 297 GENL-STKEDQQSLYYCN 313 G S ++ + C Sbjct: 193 GMPTVSIAKEPITYKVCE 210 >gi|198435757|ref|XP_002131969.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 594 Score = 66.3 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 31/168 (18%), Positives = 63/168 (37%), Gaps = 25/168 (14%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 A LD++++ D S S+ ++ K + E V ++++ G ++ + Sbjct: 398 AILDLVLIFDSSSSVG---------LENWKKLMKFCAEIVGSFTIGRDLMRVGAFRYNQR 448 Query: 230 IEEFFLLEWG----VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 ++ + G L+ K+ + G T + L +AYN + G Sbjct: 449 VDTATEVLLGEIDTFDELKTKVHKIPYNGSGTRTGNALLHAYNHSLNAPG-------NRP 501 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 N + I++ TDG + + RGA + +GI+ R Sbjct: 502 NVRDIVLVFTDG-----VSHDDVIEPARLLQSRGADINVVGIKNSRGR 544 >gi|296168869|ref|ZP_06850541.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295896486|gb|EFG76136.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] Length = 327 Score = 66.3 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 35/219 (15%), Positives = 68/219 (31%), Gaps = 37/219 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++V+D+S+SM + D ++ A ++ + ++ + GLV F+ Sbjct: 92 VVLVIDMSQSMRA-TDVPPNRLKAAEEAASQFAAQLTPGIN------LGLVGFAGTPYLL 144 Query: 234 FLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 Q I L K F ST + + A + + D I Sbjct: 145 VPP---TPQHQATIDALKKLDFADSTATGEAIFTALHAVSATA-----ITGGDTPPPARI 196 Query: 292 VFMTDGENLSTKEDQQ----SLYYCNEAKKRGAIVYAIGIRVIRSH-------------- 333 V ++DG A+ G + I Sbjct: 197 VLLSDGRENKPSNPSDPHDGVYTAARLARDEGVPISTISFGTKTGEIEMDGQRVAVPVST 256 Query: 334 EFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + ++ A S Y N + +++ I KDI + + Sbjct: 257 DQMKTIAKLSGGQSYTAGNLAELNKSYNAIEKDIGYRTV 295 >gi|254820232|ref|ZP_05225233.1| hypothetical protein MintA_09906 [Mycobacterium intracellulare ATCC 13950] Length = 327 Score = 66.3 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 36/219 (16%), Positives = 67/219 (30%), Gaps = 37/219 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+V+D+S+SM + D ++ A ++ + ++ + GLV F+ Sbjct: 92 IMLVIDMSQSMRA-TDVEPNRLKAAEQAASQFASQLTPGIN------LGLVGFAGTPYLL 144 Query: 234 FLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 Q I L K F ST + + A + I D I Sbjct: 145 VPP---TPQHQATIDALKKLDFADSTATGQAIFTALHAIGATA-----VTGGDNPPPARI 196 Query: 292 VFMTDGENLSTKEDQQ----SLYYCNEAKKRGAIVYAIGIRVIRSH-------------- 333 V ++DG AK G + I Sbjct: 197 VLLSDGRENKPSNPSDPHDGVYTAARLAKDEGVPISTISFGTKGGEIEMDGQRVAVPVST 256 Query: 334 EFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + ++ A S Y N + +++ I +I + + Sbjct: 257 DQMKTIARLSGGQPYTATNIGELNKSYNAIENEIGYRTV 295 >gi|91223292|ref|ZP_01258558.1| hypothetical protein V12G01_05596 [Vibrio alginolyticus 12G01] gi|91192105|gb|EAS78368.1| hypothetical protein V12G01_05596 [Vibrio alginolyticus 12G01] Length = 334 Score = 66.3 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 69/217 (31%), Gaps = 46/217 (21%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+V+D+S SM+ I ++ K ++ + + K + G+V F + Sbjct: 98 DLMLVVDLSGSMQQEDMELNGEYIDRLTAVKKVLSDFVAKRKGD-------RLGVVLFGD 150 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + ++I + G T G+ D Sbjct: 151 HAYLQTPLTADRKTVMQQINQTVIGLVGQRTAIGDGIGLGTKTFVDSDA----------- 199 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------- 331 +++++ ++DG N + + L AKK A +Y +G+ Sbjct: 200 PQRVMILLSDGSNTA--GVLEPLEAAEIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTA 257 Query: 332 ----SHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 + + ++ + + + I + Sbjct: 258 ADLDEQTLTKVAEVTGGQYFRARDTEELEKIYDTINQ 294 >gi|225010241|ref|ZP_03700713.1| von Willebrand factor type A [Flavobacteria bacterium MS024-3C] gi|225005720|gb|EEG43670.1| von Willebrand factor type A [Flavobacteria bacterium MS024-3C] Length = 330 Score = 66.3 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 40/241 (16%), Positives = 89/241 (36%), Gaps = 24/241 (9%) Query: 108 DDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + + SL + +P +G++ S +R K L + + P SS ++ Sbjct: 25 QLLKQKKSLVPLKMPSTKGFNNSIYNRLKPMLFILRLLSISALIIGLARPQNSSENTKTE 84 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 T +D+++ +DVS SM + D S ++ + + ++ N+ + GLV ++ Sbjct: 85 TTKGIDIVMAIDVSSSMLAK-DLSPNRLVALKEVASNFIK-----DRPND--RIGLVVYA 136 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + + + +K + T GL + N++ + Sbjct: 137 GESYTKTPITSDKRLVLEALKEIKYGALTDGTAIGMGLATSVNRL-----------KDSK 185 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 K+I+ +TDG N + + + A + Y IG+ + +P+ Sbjct: 186 ALSKVIILLTDGVNNAGFIEPATA--AELAVEYDIKTYTIGLGT-NGNALSPIAYNPDGS 242 Query: 346 Y 346 + Sbjct: 243 F 243 >gi|330829762|ref|YP_004392714.1| von Willebrand factor, type A [Aeromonas veronii B565] gi|328804898|gb|AEB50097.1| von Willebrand factor, type A [Aeromonas veronii B565] Length = 347 Score = 66.3 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 74/203 (36%), Gaps = 18/203 (8%) Query: 173 DMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D+MIVLD+S SM E+ F K + + +L++ + + + GL+ F + Sbjct: 99 DVMIVLDLSGSMAETDFSPDPGKSLSRLDAAKEVLKQFAATREGD---RLGLILFGDAAF 155 Query: 232 EFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + Q ++ ++ G ST+ + A ++F+ + A +K Sbjct: 156 LQAPFTADLETWQTLLQETDVAMAGQSTHLGDAIGLAI-KVFNNSDRHGQQDQNSAKREK 214 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR---SHEF----LRACA-- 340 + + +TDG T A G ++ I + L+ A Sbjct: 215 VAIILTDG--NDTGSFVSPRDAARVAAVNGVRLHTIAMGDPATVGEQALDLDTLQQLATL 272 Query: 341 SPNSFYLVENPHSMYDAFSHIGK 363 + + + + A+ IG+ Sbjct: 273 TGGQLFQALDEAQLTRAYQVIGE 295 >gi|260466792|ref|ZP_05812977.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259029404|gb|EEW30695.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 492 Score = 66.3 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 37/284 (13%), Positives = 90/284 (31%), Gaps = 34/284 (11%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F + G +L ++ L G + +S ++ K+ L ++D ++ A + Sbjct: 14 FARDRGGNFAVLFGFAASVLALAAGFSVNISQLYNAKSSLQGVVDAAVTSTARDLTTGV- 72 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 +K D +K + + V IV T+ ++ Sbjct: 73 ------IKEADADNSVKAFLVANSAAGILQPDQVVLDKLIVDKTA-----------KTVQ 115 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 A + L F F M ++ +D +++ ++LD++ SM Sbjct: 116 ANVHVDVALYFPLFGIG-------DMQRVAASTTALYSDKTVEVAMMLDITGSMAKR--G 166 Query: 191 SITKIDMAIKSINAMLE-EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY 249 ++ KI + ++ ++ ++ +V +++ + L V + Sbjct: 167 NVDKIGDLRAAARNAVQTMLQNQDPKRPRIRVAIVPYASGVN-AGKLAENVYAETQGSSE 225 Query: 250 LSKFGVS----TNSTPGLKYAYNQIFD-MQGMRQHCNTEDANYK 288 L S + L +++ + H + K Sbjct: 226 LPPVAGSSLLVAKTGKALLPSFSDYISIVGAAMPHPDNCTTERK 269 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 24/172 (13%), Positives = 45/172 (26%), Gaps = 38/172 (22%) Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY--------NQIFDMQGMRQHC 280 E L L I G T +++ Y I ++ Sbjct: 313 PDAEVIPLTADSDALLDSIDDFRAAGY-TAGAIAIQWTYYMLSPQWRAAIKNVGLGNGAS 371 Query: 281 NTEDANYKKIIVFMTDGE-------------NLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + K+ + MTDG+ + C K G ++ IG Sbjct: 372 DANAKKIAKVAILMTDGQFNTAFAGAGGSYNGQGDLARGNAEALCGNMKNDGIEIFTIGF 431 Query: 328 ----------RVIRSHEFLRACASPN------SFYLVENPHSMYDAFSHIGK 363 ++ L+ C+S + ++ + AF I + Sbjct: 432 DLNDKDMSATERDQAKAVLKGCSSKDASAAERHYFEASTGAELDAAFQEIIR 483 >gi|297526263|ref|YP_003668287.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] gi|297255179|gb|ADI31388.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] Length = 416 Score = 66.3 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 47/236 (19%), Positives = 85/236 (36%), Gaps = 25/236 (10%) Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 I F R +P S+K + +IV+D S SM KI Sbjct: 4 IKASFRMGKDSIIEGREDTIPFVLSIKGVYSAHPPIAFLIVIDTSYSM------DGEKIF 57 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL--LEWGVSHLQRKIKYLSKFG 254 A ++ +L+ ++ + V G+ F+ K + ++R I L G Sbjct: 58 RAKQAALGLLDILR----DKDYV--GVYGFAGKFYKVLEPVPATKRGEVERAIISLK-LG 110 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED-QQSLYYCN 313 TN LK ++ ++ + + I+F+TDGE K++ ++ L Sbjct: 111 SGTNIYDTLKK------LVEETKKVLQNGALSLVR-IIFITDGEPTVGKKNPKKILEMAK 163 Query: 314 EAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVT 367 + ++ GA IG+ + + L A F + +P S+ S K Sbjct: 164 KLREAGASALIIGVGTEYNEKLLSRMAMALNGEFEHISDPASLEKLISEYAKSTQE 219 >gi|149202124|ref|ZP_01879097.1| hypothetical protein RTM1035_12393 [Roseovarius sp. TM1035] gi|149144222|gb|EDM32253.1| hypothetical protein RTM1035_12393 [Roseovarius sp. TM1035] Length = 584 Score = 66.3 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 39/276 (14%), Positives = 89/276 (32%), Gaps = 64/276 (23%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 F + +G +T+L + ++ G+ I++ + L + +DR+++ AT + Sbjct: 13 RFAKDEEGTVTVLAFAIFVMFLVMGGIGIDMMRQEMARASLQATLDRAVLAGATAV---- 68 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 I++ + S +++ DID + S+ + + Y Sbjct: 69 --------NNATARAVIEDYFAKSGQSDYLAAQEAGDIDIRLNSSKVTARATQTLDTY-- 118 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 + + S+ +V +L++ + LDVS SM Sbjct: 119 ---------------LMRLAGVDTLTSAGNSTAEVTI---PKLEIAMALDVSGSM----- 155 Query: 190 SSITKIDMAIKSINAMLEE-VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIK 248 +ID + ++ + ++ + WGV+ + + Sbjct: 156 -IGARIDALKPAAIEFVDSILDSTEP------------NDAVISVVPFSWGVTPSKEIYE 202 Query: 249 YLSKFGVS-------------TNSTPGLKYAYNQIF 271 L+ T++T AYNQ+ Sbjct: 203 ALTVNETHKYSSCLELNDSHFTDTTIDPNTAYNQLI 238 Score = 43.6 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 14/57 (24%), Positives = 24/57 (42%), Gaps = 4/57 (7%) Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGI---RVIRSHEFLRACASP-NSFYLVENPH 352 +K+D++ C EAK G ++Y I + ++ CAS N Y + Sbjct: 509 NRSKKDERLDDICREAKSEGIVIYTIAFEMGSQPTGADKIKKCASSVNHHYNATTVN 565 >gi|305665951|ref|YP_003862238.1| BatA protein [Maribacter sp. HTCC2170] gi|88710726|gb|EAR02958.1| batA protein [Maribacter sp. HTCC2170] Length = 332 Score = 66.3 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 40/256 (15%), Positives = 87/256 (33%), Gaps = 52/256 (20%) Query: 139 LKFCTFIPWYTNSRHIVM----PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITK 194 +K F+ I++ P T + ++T +D+++ +DVS SM + D + Sbjct: 54 IKPLLFVFRILALASIIVAMARPQTEDISTRTKTTKGIDIVMAIDVSSSMLA-RDLKPNR 112 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS--K 252 + + + + + GLV ++ + + S + ++ ++ + Sbjct: 113 LSALKEVAADFIRQRPND-------RIGLVAYAGEAFTKTPITSDKSIVLNSLREITYGQ 165 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 T GL + N++ E KII+ +TDG N S + Q+ Sbjct: 166 LNDGTAIGMGLATSVNRL-----------KESKAISKIIILLTDGVNNSGFIEPQTA--A 212 Query: 313 NEAKKRGAIVYAIGIRVIRS-----------------------HEFLRACA--SPNSFYL 347 + A + G Y IG+ + + L A + ++ Sbjct: 213 DLAVEYGIKSYTIGLGTNGNALSPIAYNADGSYRYGMRQVEIDEKLLEGIAETTGGKYFR 272 Query: 348 VENPHSMYDAFSHIGK 363 + + + I K Sbjct: 273 ATDNEKLEAIYDEINK 288 >gi|313792199|gb|EFS40300.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA1] gi|314984000|gb|EFT28092.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA1] Length = 320 Score = 66.3 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 76/213 (35%), Gaps = 30/213 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +DVSRSM + D +++ A + L ++ +V+ LV F+ + Sbjct: 91 VVVAIDVSRSMVA-TDVEPSRLSAAKTAAKDFLGDLPPRFNVS------LVKFAASAQVV 143 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + I L ST G+ + N + + +H IV Sbjct: 144 VPPTPDRAAVSTAITNLQVL-PSTAIGEGIYSSLNALKLVPDDPKH---PGQKPPAAIVL 199 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH---------------EFLRA 338 ++DG +T + SL EA ++ VY I + E Sbjct: 200 LSDG---ATNVGRPSLEAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPVNHYELAAI 256 Query: 339 C-ASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 AS + E+ + D + I + + +++ Sbjct: 257 AKASGGEKFSAESLGQLSDVYKSIAQSVGYEKV 289 >gi|74205761|dbj|BAE23197.1| unnamed protein product [Mus musculus] Length = 751 Score = 66.3 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 33/180 (18%), Positives = 70/180 (38%), Gaps = 21/180 (11%) Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVSH-LQRKIKYLSK 252 + I + + + + I + Q +V F++ F L + L I+++S Sbjct: 3 FNKIINFLYSTVGALDKI--GADGTQVAMVQFTDDPRTEFKLDSYKTKETLLDAIRHISY 60 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 G +T + +K+ + +F + K+IV +TDG + Sbjct: 61 KGGNTKTGKAIKHVRDTLFTSDSGTRRG------IPKVIVVITDG-----RSQDDVNKIS 109 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 E + G ++AIG+ E ++ + S + V++ +DAF I +++T Sbjct: 110 REMQADGFNIFAIGVADADYSELVQIGSKPSSRHVFFVDD----FDAFKKIEDELITFVC 165 >gi|149199796|ref|ZP_01876826.1| hypothetical protein LNTAR_23599 [Lentisphaera araneosa HTCC2155] gi|149137084|gb|EDM25507.1| hypothetical protein LNTAR_23599 [Lentisphaera araneosa HTCC2155] Length = 333 Score = 66.3 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 77/214 (35%), Gaps = 36/214 (16%) Query: 166 SQTDARLDMMIVLDVSRSMESF-----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 ++T A D+++ +D+S SME+ ++T++D + ++ L E + Sbjct: 87 TKTIASRDLLLAVDLSGSMETKDFKNKSGENVTRLDSVKEVLSEFLAE-------REGDR 139 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 GLV F + + Q + + G T + + + IFD + Sbjct: 140 VGLVFFGSAAFIQMPFTEDLEICQELMDEAQVRMAGPQTMLGDAIGLSIS-IFDQSELE- 197 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS------ 332 K+++ +TDG T A+ +G +++ + + + Sbjct: 198 ---------DKVLILLTDG--NDTGSLVAPEKAAQIARDKGIVIHTVAVGDPAAAGEQAL 246 Query: 333 -HEFLRACAS--PNSFYLVENPHSMYDAFSHIGK 363 LR+ +S +Y N + + I K Sbjct: 247 DEATLRSISSLTKGKYYWAGNREELAGIYDEIDK 280 >gi|73980138|ref|XP_540147.2| PREDICTED: similar to vitrin isoform 1 [Canis familiaris] Length = 649 Score = 66.3 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 42/270 (15%), Positives = 90/270 (33%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + N+ + + V N YS++ S + + + ++ + T Sbjct: 401 EGATENEKQYVTEPNFSNKAVCRTNGFYSLNVQSWFSLHKTVQPLVKRVCDTDRLACSKT 460 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S S+ ++ + + +E ++ Sbjct: 461 CL--------NSADIGFVIDGSSSVG------TGNFRTVLQFVANLSKEFEISDTD---T 503 Query: 220 QSGLVTFS--NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G V ++ ++E F S + IK + + T++ + YA Q+F Sbjct: 504 RIGAVQYTYEQRLEFGFDDYNTKSDILNAIKRVGYWSGGTSTGAAINYALEQLF------ 557 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG + A +G I YAIG+ E Sbjct: 558 ---KKSKPNKRKLMILITDG-----RSYDDVRIPAMVAHHKGVITYAIGVAWAAQDELEV 609 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V+ +Y + + I Sbjct: 610 IATHPANDHSFFVDEFDHLYKFVPKVIQSI 639 >gi|27806781|ref|NP_776396.1| vitrin precursor [Bos taurus] gi|75054534|sp|Q95LI2|VITRN_BOVIN RecName: Full=Vitrin; Flags: Precursor gi|17941422|gb|AAL18262.2| vitrin [Bos taurus] Length = 652 Score = 66.3 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 39/270 (14%), Positives = 93/270 (34%), Gaps = 35/270 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + N+ ++ + V N YS++ + + + + ++ + T Sbjct: 404 EGASENEKQYMLEPNFANKAVCRTNGFYSLTVQNWFSLHKTVQPLVKRVCDTDRLACSKT 463 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 D+ V+D S S+ + ++ + + E ++ Sbjct: 464 CL--------NSADIGFVIDGSSSVG------TSNFRTVLQFVANLSREFEISDMD---T 506 Query: 220 QSGLVTFSNKIEEFFLLE-WGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G + ++ + F + + S + IK + + T++ + YA Q+F Sbjct: 507 RIGAMQYTYEQRLEFGFDEYSTKSDVLNAIKRVGYWSGGTSTGAAIHYALEQLF------ 560 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 N +K+++ +TDG + A +G I YAIG+ E Sbjct: 561 ---KKSKPNKRKLMILITDG-----RSYDDIRIPAMLAHHKGVITYAIGVAWAAQDELDI 612 Query: 338 ACASP--NSFYLVENPHSMYDAFSHIGKDI 365 P + + V+ ++Y + ++I Sbjct: 613 IATHPARDHAFFVDEFDNLYKVVPKVIQNI 642 >gi|326932352|ref|XP_003212283.1| PREDICTED: von Willebrand factor A domain-containing protein 1-like [Meleagris gallopavo] Length = 450 Score = 66.3 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 37/201 (18%), Positives = 71/201 (35%), Gaps = 29/201 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++++LD S S+ + S + + M + ++ N VQ+ ++ S Sbjct: 36 DILLLLDSSGSVSYYEFSKVKEF---------MWDLMQPFTFGPNDVQTSIIHISTTPTM 86 Query: 233 FFL----LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L G +Q+ I+ + TN+ L +A ++F + Sbjct: 87 EFPFDRYLSRGT--VQQAIRNTRQLMGDTNTGKALSFAKEKLFSND------AGARPDVP 138 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--PNSFY 346 K++V++TDG D S K G V+ + E L A AS P Sbjct: 139 KVLVWVTDG----FSSDDIS-EPMQLLKDMGVTVFIVSTGRGNYLE-LSAAASQPPEKHL 192 Query: 347 LVENPHSMYDAFSHIGKDIVT 367 + + + ++ Sbjct: 193 HFVDVDDLPIITKELRDAMLD 213 >gi|187251529|ref|YP_001876011.1| von Willebrand factor type A [Elusimicrobium minutum Pei191] gi|186971689|gb|ACC98674.1| Von Willebrand factor type [Elusimicrobium minutum Pei191] Length = 335 Score = 66.3 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 35/183 (19%), Positives = 72/183 (39%), Gaps = 27/183 (14%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P K+N + + I +DVS SM++ D T+++ A + ++ Sbjct: 74 LAGPQWGVEKINVTAQSSHSV-IAVDVSDSMKA-RDLKPTRLENAKTMLKMLI------- 124 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQI 270 ++G+V F++K + V L+ + L T P ++ A + Sbjct: 125 SAKGEQRTGIVAFTSKAYTQCPITNDVEALKYFVNQLRPEMLNAKGTALAPAVQRAAEML 184 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 G KK ++ +TDGE+ ++ ++++ A+K G + A+GI Sbjct: 185 SKYPG------------KKALILLTDGEDHEPEQIEEAIK---TAQKEGIKIIAVGIGTE 229 Query: 331 RSH 333 Sbjct: 230 EGE 232 >gi|71983551|ref|NP_509176.2| hypothetical protein C16E9.1 [Caenorhabditis elegans] gi|34366001|gb|AAC47957.2| Hypothetical protein C16E9.1 [Caenorhabditis elegans] Length = 565 Score = 66.3 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 42/219 (19%), Positives = 75/219 (34%), Gaps = 31/219 (14%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 +H + + K D D+ I+ D S S+E F + Sbjct: 365 NLEAKKHTEDFVKTPEKRGPVKDCIYDIGIIFDSSGSLEKNFQKQLAFAKQL-------- 416 Query: 207 EEVKLIPDVNNVVQSGLVTFS--NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 V+ +P +N + G+V F+ K+ S L+ I + +T + LK Sbjct: 417 --VEQMPISDNATRVGIVQFAGKTKVRVLANFSQNKSQLKTIIDRSPFYSGTTFTNQALK 474 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 M + + +A K ++ TDG + + + K +G +VY Sbjct: 475 K-------MAALYEESKRPNAKLK--LMLFTDGYSAEDTSEGE-----EALKSQGVVVYT 520 Query: 325 IGIRVIRSHEF----LRACA-SPNSFYLVENPHSMYDAF 358 +GI +S LR A S +Y + + F Sbjct: 521 VGISTDKSAGLNMKELRGMATSSEHYYDSSDFADLLKHF 559 >gi|86145196|ref|ZP_01063527.1| hypothetical protein MED222_04745 [Vibrio sp. MED222] gi|85836773|gb|EAQ54893.1| hypothetical protein MED222_04745 [Vibrio sp. MED222] Length = 359 Score = 66.3 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 38/223 (17%), Positives = 84/223 (37%), Gaps = 34/223 (15%) Query: 163 KVNSQTDARLDMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + + D+M+V+D+S SM S I+++D + + + K Sbjct: 92 EPQVREQLGRDVMVVVDLSGSMAEQDFTSKQGDKISRLDATKEVLADFAKTRKGD----- 146 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYA---YNQ 269 + GL+ F + F+ + ++ L++ G ST+ + A + Q Sbjct: 147 --RLGLILFG---DAAFVQTPFTADQDVWLELLNQTDVAMAGQSTHLGDAIGLATKVFEQ 201 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 Q + + +K+++ +TDG T + + AK +G ++ I + Sbjct: 202 SEKQSAAVQDSSIDANVKEKVVIVLTDG--NDTGSFVEPIDAAKVAKAKGVRIHVIAMGD 259 Query: 330 IRS-------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 ++ E ++ A S + N + A++ IG+ Sbjct: 260 PQTVGEVALDMETIKRVAQESGGEAFEALNRDELTKAYAQIGE 302 >gi|296393889|ref|YP_003658773.1| von Willebrand factor type A [Segniliparus rotundus DSM 44985] gi|296181036|gb|ADG97942.1| von Willebrand factor type A [Segniliparus rotundus DSM 44985] Length = 343 Score = 66.3 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 42/223 (18%), Positives = 81/223 (36%), Gaps = 38/223 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++V+D+S SM D T++D A ++ ++E++ +Q GLVTF+ + Sbjct: 102 VVLVVDISLSMVCD-DVRPTRVDAARQAAIKFVDEMEP------TLQLGLVTFAGTAQTL 154 Query: 234 FLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 ++ + K T + G+ A QI + + + + Sbjct: 155 IAPSSDHEIVKHALDEAIRPDKLAARTATGEGIYTALQQIETLSSILGGKSKAPSAR--- 211 Query: 291 IVFMTDGE---NLSTKEDQQSLYYCNEAKKRGAIVYAIGIR----------------VIR 331 IV +DG+ + + EAK + +Y+I V Sbjct: 212 IVLESDGKETVPDDLNAPRGAFTAAKEAKAKEVPIYSISFGTTRPIPYVNIQGSRVPVPA 271 Query: 332 SHEFLRACA--SPNSFYLVENPHSMYDAFS----HIGKDIVTK 368 L+ A S F+ + + D +S IG D+V + Sbjct: 272 DDASLQKVAELSGGKFFTAGSLDQLSDVYSSLNAEIGYDLVKQ 314 >gi|77920224|ref|YP_358039.1| von Willebrand factor type A domain-containing protein [Pelobacter carbinolicus DSM 2380] gi|77546307|gb|ABA89869.1| von Willebrand factor type A domain protein [Pelobacter carbinolicus DSM 2380] Length = 442 Score = 65.9 bits (159), Expect = 8e-09, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 71/200 (35%), Gaps = 27/200 (13%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + +++ +VLD S SM KI A ++ + + + LV Sbjct: 60 TAQRPPVNLALVLDRSGSMSGN------KIAKAREAAIEAVRRLSDGDLFS------LVV 107 Query: 226 FSNKIEE--FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + + +E + ++ +I+ + G + L A +Q +R+H + Sbjct: 108 YDDSVETLVPAQPVSDIGDIEARIRRIRPGGST-----ALFGAVSQ--GAAEVRKHSDAP 160 Query: 284 DANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 N +V ++DG N+ K G V +G+ + + + A Sbjct: 161 YVNR---VVLLSDGLANVGPSRPADLARLGAALLKEGISVTTVGVGTDFNEDLMTQLAER 217 Query: 341 SPNSFYLVENPHSMYDAFSH 360 S + Y VE+ + F+ Sbjct: 218 SDGNHYFVESSRDLPRIFAA 237 >gi|21232653|ref|NP_638570.1| hypothetical protein XCC3224 [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66767265|ref|YP_242027.1| hypothetical protein XC_0933 [Xanthomonas campestris pv. campestris str. 8004] gi|21114459|gb|AAM42494.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66572597|gb|AAY48007.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. 8004] Length = 335 Score = 65.9 bits (159), Expect = 8e-09, Method: Composition-based stats. Identities = 32/216 (14%), Positives = 73/216 (33%), Gaps = 45/216 (20%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 MM+ +D+S SM + + ++ A ++ L+ + GL+ F Sbjct: 100 QMMLAVDLSGSMSEPDMVLGGNVVDRLTAAKAVLSDFLDR-------REGDRVGLLVFGQ 152 Query: 229 KIEEFFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L ++ ++ ++ + G T + + ++ + RQ Sbjct: 153 RAYALTPLTADLTSVRDQLADSVVGLAGRETAIGDAIALSVKRL---REQRQG------- 202 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF----------- 335 ++++V +TDG N + L AK G V+ I S+ Sbjct: 203 -QRVVVLLTDGVNTA--GVLNPLKAAELAKAEGVRVHTIAFGGSGSYSLFGVPIPAGGGD 259 Query: 336 ------LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 LR A + F+ + + ++ + + Sbjct: 260 DIDEDGLRKIAEQTGGRFFRARDTEELAGIYAELDR 295 >gi|198426244|ref|XP_002124558.1| PREDICTED: similar to Vwa1 protein [Ciona intestinalis] Length = 430 Score = 65.9 bits (159), Expect = 8e-09, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 62/169 (36%), Gaps = 27/169 (15%) Query: 170 ARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 AR D +IV+D S S+ +D I I ML + + ++ G ++ Sbjct: 235 ARTDAIIVVDSSSSVKRPNWDK-------MIAFIVNMLRQFTIDQSS---LRIGAFRYNR 284 Query: 229 KIEEFFLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 ++ + L I+ L G TN+ + + N + + N Sbjct: 285 RVHTDTQILLNQFNNDKDGLLAAIQSLPYNGGGTNTGRAIAHVTNVMLKAE------NGN 338 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + + ++V +TDG + + + GA+V+ + + + S Sbjct: 339 RPDVQDLVVLITDGR-----AQDRVDLVSADLRATGAVVFVVAVILPGS 382 >gi|156367148|ref|XP_001627281.1| predicted protein [Nematostella vectensis] gi|156214186|gb|EDO35181.1| predicted protein [Nematostella vectensis] Length = 166 Score = 65.9 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 34/177 (19%), Positives = 62/177 (35%), Gaps = 24/177 (13%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV-VQSGLVTFS 227 A LD++ ++D SRS+E + + +M+ ++ ++ V+ GLV + Sbjct: 2 KAPLDVVFLVDGSRSVERQGVGNF-------RRELSMIRDMSAGFIISRTNVRFGLVVYG 54 Query: 228 NKIEEFFLL---EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + F L + + + L+ AY ++ Sbjct: 55 TRPRVVFGLNGFRNNGGLFNALNRPIKNPQTGSRIGLALRAAYTRVLARS--------PR 106 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 KIIV + DG + N + RG ++AIGI + L AS Sbjct: 107 RGATKIIVVLADGR-----SEDDVRRPSNALQARGVKIFAIGIGRYINGRQLDQLAS 158 >gi|70730104|ref|YP_259843.1| von Willebrand factor type A domain-containing protein [Pseudomonas fluorescens Pf-5] gi|68344403|gb|AAY92009.1| von Willebrand factor type A domain protein [Pseudomonas fluorescens Pf-5] Length = 358 Score = 65.9 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 30/218 (13%), Positives = 72/218 (33%), Gaps = 46/218 (21%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + D ++++++ + LE + GL+ F + Sbjct: 91 DLLVAVDVSGSMDFPDMQWQDEDVSRLNLVKHLLGDFLEH-------REGDRVGLILFGS 143 Query: 229 KIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 K L + ++ + + G +T + A ++ + Sbjct: 144 KAYLQAPLTFDRHTVRVWLDEAKIGIAGKNTAIGDAIGLALKRL-----------RQRPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE------------ 334 ++++ +TDG N + + A + G +Y IGI Sbjct: 193 QSRVLILVTDGANNG--GEIAPITAARLAAEEGVKIYPIGIGADPEQSATLGVLGINPSL 250 Query: 335 -----FLRACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 L+ A + ++ + + A + Sbjct: 251 DLDEPALKELAQVTGGRYFRARDGQEL-QAIKQTLDQL 287 >gi|162424746|gb|ABX90059.1| hedgling [Amphimedon queenslandica] Length = 2416 Score = 65.9 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 42/194 (21%), Positives = 71/194 (36%), Gaps = 22/194 (11%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D LD++ VLD S S+ + ++N + + V+ N Q GL+T+S Sbjct: 181 DTNLDVVFVLDQSGSIGYYNH---------QLALNFLSKVVEFFKIGANKTQVGLITYST 231 Query: 229 KIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 F L S + +I + G T + GL A + + Sbjct: 232 HAYVQFDLNDYHSKSTILNRISRIYYTGGWTATALGLFQA--GVILNPQQMRGARPISQG 289 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 ++++ +TDG + D G VY +G+ I E L+ AS Y Sbjct: 290 VPRVVILLTDGRSNRVPID----EVAPSLHDFGIQVYTVGVGNIYLPE-LKFIASDPDPY 344 Query: 347 LVENPHSMYDAFSH 360 + + D+FS Sbjct: 345 HI----FLLDSFSD 354 >gi|74214868|dbj|BAE33446.1| unnamed protein product [Mus musculus] Length = 1169 Score = 65.9 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 80/205 (39%), Gaps = 30/205 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV---NNVVQSGLVTFSNK 229 D++ ++D S S+ S + K+ +K++ + L+ + ++ + TF+N Sbjct: 152 DIVFLIDGSGSISS---TDFEKMLDFVKAVMSQLQRPSTRFSLMQFSDYFRVHF-TFNNF 207 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 I L + + + T + +K+ ++F Q + + K Sbjct: 208 ISTSSPLS--------LLGSVRQLRGYTYTASAIKHVITELFTTQ------SGARQDATK 253 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI----RSHEFLRACA---SP 342 +++ +TDG S+ A+ I YAIG+ S + L+A A S Sbjct: 254 VLIVITDGRKQGDNLSYDSVIP--MAEAASIIRYAIGVGKAFYNEHSKQELKAIASMPSH 311 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVT 367 + VEN ++ D + + + I Sbjct: 312 EYVFSVENFDTLKDIENQLKEKIFA 336 >gi|94499146|ref|ZP_01305684.1| hypothetical protein RED65_10169 [Oceanobacter sp. RED65] gi|94428778|gb|EAT13750.1| hypothetical protein RED65_10169 [Oceanobacter sp. RED65] Length = 340 Score = 65.9 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 36/223 (16%), Positives = 79/223 (35%), Gaps = 46/223 (20%) Query: 163 KVNSQTDARLDMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + +MM+ +D+S+SM + + ++ + +EE K Sbjct: 79 EPKALQQTDRNMMLAVDISKSMLEEDMQYQGRLVNRLQTVKAVVTDFVEERKGD------ 132 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGM 276 + GL+ F + L + +S ++R + + G T + ++ D+ Sbjct: 133 -RLGLILFGEQAYIQTPLTFDLSTVKRLLDEAVVGLAGNKTAIGDAIGLGVKRLQDLPES 191 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR----- 331 ++++ +TDG+N + + + L A+K G +YAIGI Sbjct: 192 N-----------RVLILLTDGQNTA--GEIEPLKAAELAEKAGVKIYAIGIGADEMVIQG 238 Query: 332 -------------SHEFLRACA--SPNSFYLVENPHSMYDAFS 359 + L A A + +Y N + + + Sbjct: 239 FFGPRRVNPSRDLDEDTLTAIAENTGGQYYRARNVNELEQIYD 281 >gi|74213162|dbj|BAE41718.1| unnamed protein product [Mus musculus] Length = 1153 Score = 65.9 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 75/211 (35%), Gaps = 30/211 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV---VQSGL 223 D++ ++D S S+ + K+ + ++ ++ K + + + Sbjct: 144 CPQQESDIVFLIDGSGSIN---NIDFQKMKEFVSTVMEQFKKSKTLFSLMQYSDEFRIHF 200 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 TF++ + + + + T + G++ ++F N Sbjct: 201 -TFNDFKRNPSP--------RSHVSPIKQLNGRTKTASGIRKVVRELFHKT------NGA 245 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV----IRSHEFLRAC 339 N KI+V +TDGE D + + EA + G I Y IG+ +S L Sbjct: 246 RENAAKILVVITDGEKFGDPLDYKDVIP--EADRAGVIRYVIGVGNAFNKPQSRRELDTI 303 Query: 340 AS---PNSFYLVENPHSMYDAFSHIGKDIVT 367 AS + V+N ++ + + + I Sbjct: 304 ASKPAGEHVFQVDNFEALNTIQNQLQEKIFA 334 >gi|325273881|ref|ZP_08140055.1| von Willebrand factor type A [Pseudomonas sp. TJI-51] gi|324100983|gb|EGB98655.1| von Willebrand factor type A [Pseudomonas sp. TJI-51] Length = 311 Score = 65.9 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 61/163 (37%), Gaps = 26/163 (15%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + + I+++D+ + L++ + GL+ F + Sbjct: 44 DLLVAVDVSGSMDFPDMQWKNEDISRLDLVKALLGDFLQD-------REGDRVGLILFGS 96 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + G +T + A ++ + Sbjct: 97 QAYLQAPLTFDRRTVRTFLDEAQIGIAGKNTAIGDAIGLAVKRL-----------RQRPA 145 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 +++V +TDG N L A + G +Y IGI Sbjct: 146 QSRVLVLITDGANNG--GQIHPLTAARLAAQEGVRIYTIGIGA 186 >gi|319638170|ref|ZP_07992933.1| von Willebrand factor type A domain-containing protein [Neisseria mucosa C102] gi|317400443|gb|EFV81101.1| von Willebrand factor type A domain-containing protein [Neisseria mucosa C102] Length = 530 Score = 65.9 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 51/328 (15%), Positives = 107/328 (32%), Gaps = 37/328 (11%) Query: 48 TVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDIL--CRIKNTWNMSFRNELRDNGFVN 105 LH D +L A N N ++ + + +F ++ + N Sbjct: 32 ENLHGAPDSALPATAVAEENLSLTENTERYQDQPDQPVKSVAQEPVSTFSIDVDTGSYAN 91 Query: 106 -------DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPI 158 + ++ ++ Y + R PW ++ I + I Sbjct: 92 VRRFLNSGKQPPKDAVRIEEIINYFPYNYPLPTDGRPFAVHTETIDSPWQPEAKLIKIGI 151 Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + +++ ++DVS SM+ K+ + K++ + ++++ V Sbjct: 152 QA-QDTAKKDLPPANLVFLVDVSGSMDEE-----NKLPLVQKTLRILTQQLRPQDKVT-- 203 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 L+T+S+ E G + + I L G ST+ L+ AY + Q Sbjct: 204 ----LITYSSGEELVLPPTSGSDKETILKAIDKLKAEG-STSGESALRMAYEE---AQKA 255 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGAIVYAIGIRVIRSHEF 335 I+ TDG+ D ++L E +K G + +G +E Sbjct: 256 FVPNGINR------ILLATDGDFNVGVSDTETLKSMVAEKRKTGISLSTLGFGTDNYNED 309 Query: 336 LR---ACASPNSFYLVENPHSMYDAFSH 360 + A A ++ ++N Sbjct: 310 MMEQIADAGDGNYSYIDNEKEAKKVLQQ 337 >gi|148685680|gb|EDL17627.1| integrin alpha M, isoform CRA_b [Mus musculus] Length = 1168 Score = 65.9 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 75/211 (35%), Gaps = 30/211 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV---VQSGL 223 D++ ++D S S+ + K+ + ++ ++ K + + + Sbjct: 145 CPQQESDIVFLIDGSGSIN---NIDFQKMKEFVSTVMEQFKKSKTLFSLMQYSDEFRIHF 201 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 TF++ + + + + T + G++ ++F N Sbjct: 202 -TFNDFKRNPSP--------RSHVSPIKQLNGRTKTASGIRKVVRELFHKT------NGA 246 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV----IRSHEFLRAC 339 N KI+V +TDGE D + + EA + G I Y IG+ +S L Sbjct: 247 RENAAKILVVITDGEKFGDPLDYKDVIP--EADRAGVIRYVIGVGNAFNKPQSRRELDTI 304 Query: 340 AS---PNSFYLVENPHSMYDAFSHIGKDIVT 367 AS + V+N ++ + + + I Sbjct: 305 ASKPAGEHVFQVDNFEALNTIQNQLQEKIFA 335 >gi|74212839|dbj|BAE33379.1| unnamed protein product [Mus musculus] Length = 967 Score = 65.9 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 75/211 (35%), Gaps = 30/211 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV---VQSGL 223 D++ ++D S S+ + K+ + ++ ++ K + + + Sbjct: 144 CPQQESDIVFLIDGSGSIN---NIDFQKMKEFVSTVMEQFKKSKTLFSLMQYSDEFRIHF 200 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 TF++ + + + + T + G++ ++F N Sbjct: 201 -TFNDFKRNPSP--------RSHVSPIKQLNGRTKTASGIRKVVRELFHKT------NGA 245 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV----IRSHEFLRAC 339 N KI+V +TDGE D + + EA + G I Y IG+ +S L Sbjct: 246 RENAAKILVVITDGEKFGDPLDYKDVIP--EADRAGVIRYVIGVGNAFNKPQSRRELDTI 303 Query: 340 AS---PNSFYLVENPHSMYDAFSHIGKDIVT 367 AS + V+N ++ + + + I Sbjct: 304 ASKPAGEHVFQVDNFEALNTIQNQLQEKIFA 334 >gi|124956|sp|P05555|ITAM_MOUSE RecName: Full=Integrin alpha-M; AltName: Full=CD11 antigen-like family member B; AltName: Full=CR-3 alpha chain; AltName: Full=Cell surface glycoprotein MAC-1 subunit alpha; AltName: Full=Leukocyte adhesion receptor MO1; AltName: CD_antigen=CD11b; Flags: Precursor gi|52983|emb|CAA30479.1| unnamed protein product [Mus musculus] Length = 1153 Score = 65.9 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 75/211 (35%), Gaps = 30/211 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV---VQSGL 223 D++ ++D S S+ + K+ + ++ ++ K + + + Sbjct: 144 CPQQESDIVFLIDGSGSIN---NIDFQKMKEFVSTVMEQFKKSKTLFSLMQYSDEFRIHF 200 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 TF++ + + + + T + G++ ++F N Sbjct: 201 -TFNDFKRNPSP--------RSHVSPIKQLNGRTKTASGIRKVVRELFHKT------NGA 245 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV----IRSHEFLRAC 339 N KI+V +TDGE D + + EA + G I Y IG+ +S L Sbjct: 246 RENAAKILVVITDGEKFGDPLDYKDVIP--EADRAGVIRYVIGVGNAFNKPQSRRELDTI 303 Query: 340 AS---PNSFYLVENPHSMYDAFSHIGKDIVT 367 AS + V+N ++ + + + I Sbjct: 304 ASKPAGEHVFQVDNFEALNTIQNQLQEKIFA 334 >gi|308493174|ref|XP_003108777.1| hypothetical protein CRE_11006 [Caenorhabditis remanei] gi|308248517|gb|EFO92469.1| hypothetical protein CRE_11006 [Caenorhabditis remanei] Length = 425 Score = 65.9 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 52/314 (16%), Positives = 110/314 (35%), Gaps = 40/314 (12%) Query: 59 VHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDI 118 + ++ +G + K G + + NTW + ++ ++ DI +++ Sbjct: 133 IPTLVMVVTDGRSADDPKGPGKLLQAQ-PNTWVFAAATGDPESVDTRELMDITG--NINH 189 Query: 119 VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVL 178 +V+ + + R + + T++ N T LD+++VL Sbjct: 190 IVMHRGRDLATDITRRLLREAQ-----DKCRTTTTTTTTTTTTTTTNPITGCELDLVLVL 244 Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS----NKIEEFF 234 D S + + ++S ++ ++K+ P Q VTF+ ++ Sbjct: 245 DFSTTTDPVYNSY-------KDLSKRLVSQLKIGPH---YTQVAAVTFATVGRTRVRFNL 294 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + R I L G +T G++ A QI + +G R T K+++ Sbjct: 295 KKYTTQEEVLRGIDKLQSKGGTTAIGAGIEKALTQIDESEGARPGIAT------KVMIVF 348 Query: 295 TDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR------SHEFLRA-CASPNSFY 346 TDG N ++++ +A G +Y + ++E L A S + Sbjct: 349 TDGWSNKGPDPEKRAK----DAVNAGFEMYTVAYTARAPNSVTLNNETLSAISGSSGHAF 404 Query: 347 LVENPHSMYDAFSH 360 S+ D Sbjct: 405 TDVTFQSLVDKIKQ 418 Score = 42.8 bits (99), Expect = 0.086, Method: Composition-based stats. Identities = 24/136 (17%), Positives = 53/136 (38%), Gaps = 24/136 (17%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 L+++IVLD S S++ F+ S K + + EE+++ P + ++ +S Sbjct: 34 LNVLIVLDRSDSVKGGFNKS-------RKFVTDVSEELQIGP---EKHRVAMIVYSGLSY 83 Query: 232 EFFLLEWGVSHLQ----RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + W + +K L G +TN+ L+ + + + Sbjct: 84 RREVFPWNFAKSNEEFVQKTNALRAIGGTTNTKKALEVGLELM----------AQRNVSI 133 Query: 288 KKIIVFMTDGENLSTK 303 +++ +TDG + Sbjct: 134 PTLVMVVTDGRSADDP 149 >gi|291225695|ref|XP_002732834.1| PREDICTED: MUscle Positioning family member (mup-4)-like [Saccoglossus kowalevskii] Length = 317 Score = 65.9 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 75/197 (38%), Gaps = 29/197 (14%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 ++ +D++ V+D S S+ S KI IK + + + + + Sbjct: 101 ASNGNPPPIDLLFVIDKSGSIG---QSDFNKIIEHIKELVDLF----TVEISVDKTRVSA 153 Query: 224 VTFSN--KIEEFFLL--------EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 +++S+ K++ F + ++ +I + G ST++T L A ++ F Sbjct: 154 ISYSSSNKVDLDFNFRRCLFESSSASKTCIKSEIDKIDFEGGSTHTTKALVKARDEAFKS 213 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + N K++ +TDG + ++ N K +YA+G+ Sbjct: 214 ------FHGSRTNSHKVLFLVTDGRSNGNGPLVET---ANSLKNDDVEIYALGVTSDVVE 264 Query: 334 EFLRACAS---PNSFYL 347 LR+ S P+ + Sbjct: 265 AELRSIVSDPIPDHLFY 281 >gi|119569860|gb|EAW49475.1| von Willebrand factor A domain containing 2 [Homo sapiens] Length = 576 Score = 65.9 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 41/184 (22%), Positives = 69/184 (37%), Gaps = 28/184 (15%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 M +LD S S+ + + + + + + P V+ G FS+ F Sbjct: 1 MFLLDGSNSVGK------GSFERSKHFAITVCDGLDISP---ERVRVGAFQFSSTPHLEF 51 Query: 235 LLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L+ + ++ +IK + G T + LKY R +A+ +I++ Sbjct: 52 PLDSFSTQQEVKARIKRMVFKGGRTETELALKYLL--------HRGLPGGRNASVPQILI 103 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSFYLVE 349 +TDG K + K+RG V+A+G+R R E L A AS L E Sbjct: 104 IVTDG-----KSQGDVALPSKQLKERGVTVFAVGVRFPRWEE-LHALASEPRGQHVLLAE 157 Query: 350 NPHS 353 Sbjct: 158 QVED 161 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 62/176 (35%), Gaps = 27/176 (15%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 T + P+ + LD++ +LD S S+ + + + Sbjct: 385 TGTSTAEAPLLA---PAGCRTQALDLVFMLDTSASVGPENFAQMQS----------FVRS 431 Query: 209 VKLIPDVN-NVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKY 265 L +VN +V Q GLV + ++++ F L+ + + R I G ++ L + Sbjct: 432 CALQFEVNPDVTQVGLVVYGSQVQTAFGLDTKPTRAAMLRAISQAPYLGGVGSAGTALLH 491 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 Y+++ +Q K +V +T G + + + + G Sbjct: 492 IYDKVMTVQ------RGARPGVPKAVVVLTGGRGA-----EDAAVPAQKLRNNGIS 536 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 71/186 (38%), Gaps = 22/186 (11%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 R+D++ +LD S T +D +++ + V+ + ++ + G+ T+S ++ Sbjct: 198 RVDLLFLLDSSAG---------TTLDGFLRAKVFVKRFVRAVLSEDSRARVGVATYSREL 248 Query: 231 EEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + V L + + G T + L+ A +G T + Sbjct: 249 LVAVPVGEYQDVPDLVWSLDGIPFRGGPTLTGSALRQA-----AERGFGSATRTGQDRPR 303 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 +++V +T+ + + + A+ R ++ +G +R+ E SP + Sbjct: 304 RVVVLLTE-----SHSEDEVAGPARHARARELLLLGVGSEAVRA-ELEEITGSPKHVMVY 357 Query: 349 ENPHSM 354 +P + Sbjct: 358 SDPQDL 363 >gi|254228714|ref|ZP_04922137.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Vibrio sp. Ex25] gi|262396564|ref|YP_003288417.1| protein BatA [Vibrio sp. Ex25] gi|151938661|gb|EDN57496.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Vibrio sp. Ex25] gi|262340158|gb|ACY53952.1| protein BatA [Vibrio sp. Ex25] Length = 334 Score = 65.9 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 69/217 (31%), Gaps = 46/217 (21%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+V+D+S SM+ I ++ K ++ + + K + G+V F + Sbjct: 98 DLMLVVDLSGSMQQEDMELNGEYIDRLTAVKKVLSDFVAKRKGD-------RLGVVLFGD 150 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + ++I + G T G+ D Sbjct: 151 HAYLQTPLTADRKTVMQQINQTVIGLVGQRTAIGDGIGLGTKTFVDSDA----------- 199 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------- 331 +++++ ++DG N + + L AKK A +Y +G+ Sbjct: 200 PQRVMILLSDGSNTA--GVLEPLEAAEIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTA 257 Query: 332 ----SHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 + + ++ + + + I + Sbjct: 258 ADLDEQTLTKVAEMTGGQYFRARDTDQLEKIYDTINQ 294 >gi|237689767|gb|ACQ59162.1| factor B/C2 [Larimichthys crocea] Length = 755 Score = 65.9 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 59/340 (17%), Positives = 115/340 (33%), Gaps = 39/340 (11%) Query: 60 HAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIV 119 +T I + G++ R N + + + N + + R + Sbjct: 128 SGSTTICSHDAGDDCADPGIPAGASRAGNMFGIDDKVTYSCNSNLFLVGSSERVCQENGQ 187 Query: 120 VVPQNEG--YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 Y + + +I F + I + K+ + L++ I Sbjct: 188 WTGMEPECYYKHTYDTPVEISQAFGSAIKNTLTKEEPLDGSPEGRKLRISKEGTLNIYIA 247 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE--FFL 235 +DVS S+E + S+ T + I +++++ N +V FS++IEE L Sbjct: 248 VDVSASIEDKYISNTTSV------IKKLIQKISSFTVSPNYE---IVFFSSEIEEVVNIL 298 Query: 236 LEWGVSHLQRKI-KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA------NYK 288 W +++ I + L FG++ S+ G N +F R +++ Sbjct: 299 DFWEKKDVRKDILEKLDTFGINDRSSTG--TDLNLVFFTFWERMSFIKTRVGANAFKDHR 356 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEA-------------KKRGAIVYAIGIRVIRSHEF 335 +I+ TDG +L + ++ VY I + Sbjct: 357 HVIILFTDGAYNMGGSPAPTLARIKDMVYMNQTGEKEVNSREEYLDVYVFAIGAETFDDD 416 Query: 336 LRAC----ASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 L++ A F+ V+ MYDAF I + K + Sbjct: 417 LKSFTVGKAEEKHFFKVKGIDKMYDAFDDIIDENDVKGLC 456 >gi|3183041|sp|Q90615|ITA1_CHICK RecName: Full=Integrin alpha-1; AltName: Full=Laminin and collagen receptor; AltName: Full=VLA-1 gi|497990|gb|AAA59067.1| alpha 1 integrin [Gallus gallus] Length = 285 Score = 65.9 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 43/232 (18%), Positives = 89/232 (38%), Gaps = 35/232 (15%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + + + +LD++IVLD S S+ + + +N++L + + P Sbjct: 48 VKAVAPSVQECKTQLDIVIVLDGSNSIYPW--------ESVTAFLNSLLRNMDIGP---Q 96 Query: 218 VVQSGLVTFS-NKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 Q G+V + + EF+L + + + ++ + G T + G+ A + F Sbjct: 97 QTQVGIVQYGQTVVHEFYLNTYSTTEEVMDAALRIRQRGGTQTMTALGIDTAREEAFT-- 154 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--RVIRS 332 H +K++V +TDGE+ Q+ + C +AI I R Sbjct: 155 --EAHGARRG--VQKVMVIVTDGESHDNYRLQEVIDKCE---DENIQRFAIAILGSYSRG 207 Query: 333 H-------EFLRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + E +++ AS F+ V + ++ +G+ I D+ Sbjct: 208 NLSTEKFVEEIKSIASKPTEKHFFNVSDELALVTIVEALGERIFALEATTDQ 259 >gi|156358451|ref|XP_001624532.1| predicted protein [Nematostella vectensis] gi|156211319|gb|EDO32432.1| predicted protein [Nematostella vectensis] Length = 180 Score = 65.9 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 66/187 (35%), Gaps = 27/187 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 T +L++ V+D S S+ ++ + +K + + P Q V + Sbjct: 1 CTTKKLNLAFVVDGSGSIN---NARFGRFREFVK------KMAESFPVSATNTQVATVVY 51 Query: 227 SNKIE--EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S + E F ++ ++ + + G +T++ LK+ ++F Sbjct: 52 SEEPELIFNFGKYNDINEIKTAVDNMPYHGKTTHTGKALKFTLEEVF---------KKAR 102 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP-- 342 N K +++ +TDG + G V+A+G+ E P Sbjct: 103 KNVKNVLIALTDG-----HASDLVKKPAQAVRDYGIEVFAVGVGSPDIAELEEIATDPDK 157 Query: 343 NSFYLVE 349 + + V+ Sbjct: 158 DHVFNVD 164 >gi|132626289|ref|NP_001076429.1| integrin alpha-M isoform 1 precursor [Mus musculus] Length = 1154 Score = 65.9 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 75/211 (35%), Gaps = 30/211 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV---VQSGL 223 D++ ++D S S+ + K+ + ++ ++ K + + + Sbjct: 144 CPQQESDIVFLIDGSGSIN---NIDFQKMKEFVSTVMEQFKKSKTLFSLMQYSDEFRIHF 200 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 TF++ + + + + T + G++ ++F N Sbjct: 201 -TFNDFKRNPSP--------RSHVSPIKQLNGRTKTASGIRKVVRELFHKT------NGA 245 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV----IRSHEFLRAC 339 N KI+V +TDGE D + + EA + G I Y IG+ +S L Sbjct: 246 RENAAKILVVITDGEKFGDPLDYKDVIP--EADRAGVIRYVIGVGNAFNKPQSRRELDTI 303 Query: 340 AS---PNSFYLVENPHSMYDAFSHIGKDIVT 367 AS + V+N ++ + + + I Sbjct: 304 ASKPAGEHVFQVDNFEALNTIQNQLQEKIFA 334 >gi|132626295|ref|NP_032427.2| integrin alpha-M isoform 2 precursor [Mus musculus] gi|148685679|gb|EDL17626.1| integrin alpha M, isoform CRA_a [Mus musculus] gi|162318464|gb|AAI56095.1| Integrin alpha M [synthetic construct] gi|162319590|gb|AAI56992.1| Integrin alpha M [synthetic construct] Length = 1153 Score = 65.9 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 75/211 (35%), Gaps = 30/211 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV---VQSGL 223 D++ ++D S S+ + K+ + ++ ++ K + + + Sbjct: 144 CPQQESDIVFLIDGSGSIN---NIDFQKMKEFVSTVMEQFKKSKTLFSLMQYSDEFRIHF 200 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 TF++ + + + + T + G++ ++F N Sbjct: 201 -TFNDFKRNPSP--------RSHVSPIKQLNGRTKTASGIRKVVRELFHKT------NGA 245 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV----IRSHEFLRAC 339 N KI+V +TDGE D + + EA + G I Y IG+ +S L Sbjct: 246 RENAAKILVVITDGEKFGDPLDYKDVIP--EADRAGVIRYVIGVGNAFNKPQSRRELDTI 303 Query: 340 AS---PNSFYLVENPHSMYDAFSHIGKDIVT 367 AS + V+N ++ + + + I Sbjct: 304 ASKPAGEHVFQVDNFEALNTIQNQLQEKIFA 334 >gi|255557538|ref|XP_002519799.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative [Ricinus communis] gi|223541038|gb|EEF42595.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative [Ricinus communis] Length = 514 Score = 65.9 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 47/225 (20%), Positives = 82/225 (36%), Gaps = 37/225 (16%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 P + +++ +T +S LD++ VLDVS SM KI+ ++ Sbjct: 35 NTAPLQESKFKVMLELTG--GDSSNDRPGLDLVAVLDVSGSMAG------DKIEKVKTAM 86 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTN 258 +++++ I + +VTFS L L++ I L+ G TN Sbjct: 87 LFVIKKLSPID------RLSVVTFSADANRLCPLRQITENSQKDLEKLINGLNADGA-TN 139 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 T GL+ + D I+ M+DGE + + Q Sbjct: 140 ITAGLQTGLKVLSDRSLSGGRVVG--------IMLMSDGEQNAGGDAAQVPV-------G 184 Query: 319 GAIVYAIGIRVIRSHEFLRACAS---PNSFYLVENPHSMYDAFSH 360 VY G + L+A A +F V++ +++ AFS Sbjct: 185 NVPVYTFGFGINHEPRVLKAIAHNSIGGTFSDVQDTNNLSKAFSQ 229 >gi|323493530|ref|ZP_08098652.1| hypothetical protein VIBR0546_14455 [Vibrio brasiliensis LMG 20546] gi|323312353|gb|EGA65495.1| hypothetical protein VIBR0546_14455 [Vibrio brasiliensis LMG 20546] Length = 322 Score = 65.9 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 35/215 (16%), Positives = 71/215 (33%), Gaps = 46/215 (21%) Query: 173 DMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+V+D+S SM D I ++ K ++ ++ + GLV F++ Sbjct: 86 DLMLVVDLSYSMSQKDMLSQDDYIDRLTAVKKVVSDFAQQ-------REGDRLGLVLFAD 138 Query: 229 KIEEFFLLEWGVSHLQRKIKYL--SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + + ++++ L G T G+ A D Sbjct: 139 HAYLQTPLTLDRNTIAKQVESLVLRLIGDKTAIGEGIGLATKTFIDSDA----------- 187 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------- 331 +++++ ++DG N + AKK A +Y IG+ Sbjct: 188 PQRVMILLSDGSNT--SGVLDPIEAAKIAKKYNATIYTIGVGAGEMMVKEFFMTRKVNTA 245 Query: 332 ---SHEFLRACA--SPNSFYLVENPHSMYDAFSHI 361 L+ A + ++ N + + I Sbjct: 246 KDLDENTLQEIADLTGGQYFRARNADELATIYDTI 280 >gi|260837139|ref|XP_002613563.1| hypothetical protein BRAFLDRAFT_149227 [Branchiostoma floridae] gi|229298948|gb|EEN69572.1| hypothetical protein BRAFLDRAFT_149227 [Branchiostoma floridae] Length = 195 Score = 65.9 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 35/170 (20%), Positives = 69/170 (40%), Gaps = 25/170 (14%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D++ +LD S S+ + + + + ++ P+ V G+V ++N + Sbjct: 43 PVDLVFLLDGSGSIGD------SNFQVTKNFVATTTSDFQIGPNNAQV---GIVQYANWL 93 Query: 231 EEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQ-IFDMQGMRQHCNTEDANY 287 E L + L I +S +G T + + Y N + + +G RQ + Sbjct: 94 YEEVSLNQYKTLDELLPAIYNISYWGGGTYTGWAIDYVVNATLTESRGARQ-------DV 146 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFL 336 K+++ +TDG+ AK+ G I+ AIG+ I E + Sbjct: 147 PKVVIVVTDGQ-----SADDVRQPALRAKQSGIIMVAIGVGSIYDGTELV 191 >gi|77359908|ref|YP_339483.1| von Willebrand factor type A [Pseudoalteromonas haloplanktis TAC125] gi|76874819|emb|CAI86040.1| conserved protein of unknown function; putative Von Willebrand factor type A domain protein [Pseudoalteromonas haloplanktis TAC125] Length = 328 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 32/216 (14%), Positives = 72/216 (33%), Gaps = 45/216 (20%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+ +D+S SM + + ++ M + +E+ + GL+ F + Sbjct: 88 DIMLAVDLSGSMTEQDMAYNGQYVDRLTMVKAVLTDFIEQ-------RQGDRLGLILFGD 140 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L V + + + G +T L + + + Sbjct: 141 TAFLQTPLTRDVKTVSKMLSEAQIGLVGRATAIGDALGLSVKRFANKDKSN--------- 191 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--VIRSHE---------- 334 +I+V +TDG+N + + + A+ G VY IG+ R Sbjct: 192 --RIVVLLTDGQNTA--GNLKPEEALLLARDAGIKVYTIGVGSDNPRGFSLFNMGGMSGD 247 Query: 335 -----FLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + ++ ++ + ++ + K Sbjct: 248 TIDEGLLKRIAEQTGGLYFRAKDVAGLQQIYAELDK 283 >gi|331090683|ref|ZP_08339532.1| hypothetical protein HMPREF9477_00175 [Lachnospiraceae bacterium 2_1_46FAA] gi|330400097|gb|EGG79748.1| hypothetical protein HMPREF9477_00175 [Lachnospiraceae bacterium 2_1_46FAA] Length = 3699 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 33/214 (15%), Positives = 70/214 (32%), Gaps = 48/214 (22%) Query: 189 DSSITKIDMAIKSINAMLEEV----KLIPDVNNVVQSGLVTFSNKIEEFFLL----EWGV 240 +S +K+ ++N +E I D + +VTFS++ L V Sbjct: 218 PNSQSKMGALKIAVNQFAQETAKRNDSITDAAKQHRMSIVTFSSESYIRQSLKAYNSNTV 277 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG--- 297 S +R I L+ G T + G++ A + +++ +K+++F TDG Sbjct: 278 SEFERTINGLNANGA-TYANLGMEKAKESLKNVREK----------AQKVVIFFTDGTPG 326 Query: 298 -ENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-------------------------RVIR 331 ++ K +Y+IG+ + Sbjct: 327 RSGFDDDTANNTIQAAKSLKDDLTKIYSIGVFDQANPDNTSSSFNAYMHGVSSNYPNATK 386 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 E + N + ++ + F I +++ Sbjct: 387 WTELGERAENSNYYKAAQDADELNKIFEEIFEEM 420 >gi|32475925|ref|NP_868919.1| hypothetical protein RB9502 [Rhodopirellula baltica SH 1] gi|32446468|emb|CAD76304.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 368 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 38/221 (17%), Positives = 78/221 (35%), Gaps = 37/221 (16%) Query: 166 SQTDARLDMMIVLDVSRSM--ESFFD---SSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 ++ D+++++D+S SM E F + ++++D + ++ L + K + Sbjct: 109 TKEIPTRDLLLLVDLSGSMAQEDFKNDAGKKVSRLDAVKEVLDGFLAKRKGD-------R 161 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 GLV F + + Q + + G T + N +FD R Sbjct: 162 VGLVVFGDAAYLQAPFTTDLQLSQELLGECEVGMAGPRTAFGDAIGLGVN-LFDEDTERA 220 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS------ 332 K I+ +TDG TK + A +R +Y + I + Sbjct: 221 ----------KTIIALTDG--NDTKSKVPPVEAARVATQRDIKIYTVAIGDPTTVGEDKL 268 Query: 333 -HEFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + L+ AS ++ + + + + I T+ I Sbjct: 269 DEQSLKDVASETGGKYFFAADREHLAGIYDEL-DKIETQTI 308 >gi|326932668|ref|XP_003212436.1| PREDICTED: anthrax toxin receptor 1-like [Meleagris gallopavo] Length = 499 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 73/205 (35%), Gaps = 25/205 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+ +LD S S+ ++ I +E + ++ + FS + Sbjct: 41 GGFDLYFILDKSGSVLHHWN-----------EIYHFVEHLARKFISPQ-LRMSFIVFSTR 88 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L +++ ++ L K G T G + A QI+ + Sbjct: 89 GTILMRLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENVHGYRTAS----- 143 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +I+ +TDGE L S N +++ GA VY +G++ + R S + + Sbjct: 144 --VIIALTDGE-LHEDLFFYSEREANRSRELGATVYCVGVKDFNETQLARIADSKDHVFP 200 Query: 348 VENPHSMYDAFSHIGKDIVTKRIWY 372 V + ++A I I+ K Sbjct: 201 VNDG---FEALQGIIDSILKKSCIE 222 >gi|291544120|emb|CBL17229.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Ruminococcus sp. 18P13] Length = 1117 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 34/215 (15%), Positives = 73/215 (33%), Gaps = 31/215 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 S + D+ + +DVS SM +++ A ++N ++ + + Sbjct: 570 SGDAGTSQLEPYDIGLCVDVSGSMYG------DRLEKAKTALNTFIDAMLPQDNAC---- 619 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V+FS+ ++ + L TN+ GL + + D Sbjct: 620 --MVSFSDNAYLVAGYGASKEVMRSRTNQLRDL-YGTNTDVGLSKTISILADQ------- 669 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 ++ K+I+ + DG+ + + AK G VY I + ++ L+ A Sbjct: 670 --GRSDASKMIIMICDGDVNYIQGTV------DAAKAAGIAVYTINVVSGD-NDLLQKIA 720 Query: 341 --SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + +Y + I + V+ D Sbjct: 721 DETGGEYYYAATTEEVVSQVEAIRGETVSAVDTTD 755 >gi|269965331|ref|ZP_06179451.1| hypothetical protein VMC_08810 [Vibrio alginolyticus 40B] gi|269829977|gb|EEZ84206.1| hypothetical protein VMC_08810 [Vibrio alginolyticus 40B] Length = 334 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 29/217 (13%), Positives = 69/217 (31%), Gaps = 46/217 (21%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+V+D+S SM+ I ++ + ++ + + K + G+V F + Sbjct: 98 DLMLVVDLSGSMQQEDMELNGEYIDRLTAVKQVLSDFVAKRKGD-------RLGVVLFGD 150 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + ++I + G T G+ D Sbjct: 151 HAYLQTPLTADRKSVMQQINQTVIGLVGQRTAIGDGIGLGTKTFVDSDA----------- 199 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------- 331 +++++ ++DG N + + L AKK A +Y +G+ Sbjct: 200 PQRVMILLSDGSNTA--GVLEPLEAAEIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTA 257 Query: 332 ----SHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 + + ++ + + + I + Sbjct: 258 ADLDEQTLTKVAEVTGGQYFRARDTEELEKIYDTINQ 294 >gi|118101296|ref|XP_425758.2| PREDICTED: similar to tumor endothelial marker 8 [Gallus gallus] Length = 552 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 73/205 (35%), Gaps = 25/205 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+ +LD S S+ ++ I +E + ++ + FS + Sbjct: 41 GGFDLYFILDKSGSVLHHWN-----------EIYHFVEHLARKFISPQ-LRMSFIVFSTR 88 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L +++ ++ L K G T G + A QI+ + Sbjct: 89 GTILMRLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENVHGYRTAS----- 143 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +I+ +TDGE L S N +++ GA VY +G++ + R S + + Sbjct: 144 --VIIALTDGE-LHEDLFFYSEREANRSRELGATVYCVGVKDFNETQLARIADSKDHVFP 200 Query: 348 VENPHSMYDAFSHIGKDIVTKRIWY 372 V + ++A I I+ K Sbjct: 201 VNDG---FEALQGIIDSILKKSCIE 222 >gi|326434685|gb|EGD80255.1| hypothetical protein PTSG_10931 [Salpingoeca sp. ATCC 50818] Length = 706 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 32/208 (15%), Positives = 69/208 (33%), Gaps = 23/208 (11%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 DA D++ +LD S S+ S + ++ + PD + + + Sbjct: 174 CGDAVADLLFILDGSGSVGS------GNFQTMLNFAQEVVSFFDVAPDK---TRVAAMVY 224 Query: 227 SNKIEEFFLLEWGVSHLQRK-IKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + F ++ S +++ I Y F T + L +A + +F + Sbjct: 225 DSSNYRKFDFDYIQSVSKQQLINYFDTFAYPDGGTETGSALSFALSSMFVTS---RGARD 281 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 ++ + +TDG K + G +YA+GI E + + P Sbjct: 282 LSEGVPRVAIVITDG-----KSGDDVSAPAQALRDAGVTLYAVGISGADVSELNQIASPP 336 Query: 343 --NSFYLVENPHSMYDAFSHIGKDIVTK 368 ++ ++ S I + + Sbjct: 337 VEDNVVFIDTFSEFSALASKISRANCDQ 364 >gi|301767378|ref|XP_002919104.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-2(VI) chain-like [Ailuropoda melanoleuca] Length = 1011 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 35/164 (21%), Positives = 62/164 (37%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + ++ L + P + G+V +S++ Sbjct: 610 GALDVVFVIDSSESIGY---TNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 666 Query: 230 -----IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 I+ +S + +K L T + LK+AYNQ+ + Sbjct: 667 GTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYNQLI--------KESRR 718 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG + +D CN V AIGI Sbjct: 719 QKTRVFAVVITDGRHDPRDDDLNLRALCNH----DVTVTAIGIG 758 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 75/197 (38%), Gaps = 13/197 (6%) Query: 148 YTNSRHIVMPITSSVKVNSQT-DARLDMMIVLDVSRS--MESFFDSSITKIDMAIKSINA 204 + + ++ P S + D +++ VLD S S M+S DS + + ++ + Sbjct: 19 HAQQQEVIAPGNSERNSCPEKADCPINVYFVLDTSESVTMQSPIDSLLYHMKQFVRQFIS 78 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 L++ + V + G + FS+ + F + + + ++ + T + L Sbjct: 79 QLQDETYLEQVALSWRYGGLHFSDVVRVFSPPDSDRASFTKSLESIVSIRKGTFTDCALA 138 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 Q +RQ + + V +TDG + L A++ G ++ Sbjct: 139 N------MTQEIRQLKSKGGVH---FAVVITDGYVTGSPCGGIKLQ-AERAREEGIRIFT 188 Query: 325 IGIRVIRSHEFLRACAS 341 + + + + LR AS Sbjct: 189 VAPDQVPNEQGLRDMAS 205 >gi|24431113|gb|AAN61407.1|AF486289_1 matrilin [Biomphalaria glabrata] Length = 394 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 40/210 (19%), Positives = 83/210 (39%), Gaps = 27/210 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 L++ IVLD S S+ + AIK + L + ++ D N V+ ++++ Sbjct: 48 CRQEPLELGIVLDSSSSLRN------RDFKTAIKFLQEFLSQYEISSDPNG-VRVSIISY 100 Query: 227 SNKIEEFFLLEWGVSHLQ----RKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 I + I + K G+ T++ ++Y + +R Sbjct: 101 GKGIYPEIGFNLTTYDTKDEVIEAIGRIPHKAGLRTDTGRAIQYMHEAQLANGVVRPG-- 158 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACA 340 K+ + +TDG +++E + + EA+K +++AIG+ IR+ E L Sbjct: 159 -----VTKVSIVITDG---NSQEWKLTKEAAEEARKDNIVMFAIGVGTDIRNSELLNIAG 210 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + V+N + + S I + + + Sbjct: 211 DQSRVTKVDNYNQL----SSIKESLAHQTC 236 >gi|313238340|emb|CBY13422.1| unnamed protein product [Oikopleura dioica] Length = 345 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 48/330 (14%), Positives = 108/330 (32%), Gaps = 44/330 (13%) Query: 53 MIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNG---FVNDIDD 109 + A Q++ EG + K + + +L+ + + + D Sbjct: 26 TNTGGALERAQQMLAEGRPSVPKIILLITDGDATDKERLDAQIEKLKKSNILIYTIGVGD 85 Query: 110 IVRSTSLDIVVVPQNEGY-SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT 168 ++ L+ + ++ Y + S KI + + Sbjct: 86 LIDRNELNRIATDEDFVYETRDFDSISKIKSSLLGRVCKKAKPKT----------SGVCG 135 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV-NNVVQSGLVTFS 227 D +D+ ++D S S+ + V + D+ + VQ G++T+S Sbjct: 136 DISVDLQFIVDSSSSVTRKNFG----------FAKNFVANVSSVFDLRSGDVQVGVLTYS 185 Query: 228 NKIEEFFLLEWGVSHLQ----RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + + G H Q K++ + G T++ L+Y N Sbjct: 186 TNVHSDSAIGLGAIHSQDDFVEKVQSMKYTGGDTHTGTALRYI-----------STNNRW 234 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 KI++F+TDG + ++ + +G ++AIG+ E + P Sbjct: 235 REEVPKILIFVTDG---TPQDRAIVPAAARSLRDKGVRIFAIGVGNAVESELKEIASEPY 291 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + + + Y A + + + + + D Sbjct: 292 ENHAIFIQGADYSAVQRV-RGHLERLVCND 320 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 24/130 (18%), Positives = 49/130 (37%), Gaps = 18/130 (13%) Query: 242 HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 L+ K+ + TN+ L+ A + + KII+ +TDG+ Sbjct: 11 ELKEKLMKVPFIQGKTNTGGALERAQQML----------AEGRPSVPKIILLITDGDATD 60 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 + + + KK ++Y IG+ +I +E R + Y + +D+ S Sbjct: 61 KERLDAQI---EKLKKSNILIYTIGVGDLIDRNELNRIATDEDFVYETRD----FDSISK 113 Query: 361 IGKDIVTKRI 370 I ++ + Sbjct: 114 IKSSLLGRVC 123 >gi|291295702|ref|YP_003507100.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290470661|gb|ADD28080.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 318 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 79/216 (36%), Gaps = 33/216 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 Q D + +++ +D+S SM++ D ++ + A ++ + E+ ++ LVTF Sbjct: 80 QADPKAAVVLAVDISLSMQA-TDVQPSRFEAARAALRTFIRELP------EGLRLALVTF 132 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L L + +L + + T + I + + + D Sbjct: 133 ARDAHLVVPLTTDRGRLLEAVDFL-QLNLGTAIGDAIL---ESIQALPPLSERAEDPDPR 188 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-------------- 332 I+ +TDG +L + EA ++ V+ IGI S Sbjct: 189 RLATIILLTDGRSLG---GVDPVVAAQEAARQQIRVHTIGIGRTTSGPVPGLPEVYAQAA 245 Query: 333 ---HEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 E L+ A ++ V++ + +A+ + + Sbjct: 246 LFDEETLKEVARVGDGQYFYVDSAEKLKEAYRDLTR 281 >gi|188990358|ref|YP_001902368.1| hypothetical protein xccb100_0962 [Xanthomonas campestris pv. campestris str. B100] gi|167732118|emb|CAP50310.1| putative membrane protein [Xanthomonas campestris pv. campestris] Length = 335 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 72/216 (33%), Gaps = 45/216 (20%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 MM+ +D+S SM + + ++ A ++ L+ + GL+ F Sbjct: 100 QMMLAVDLSGSMSEPDMVLGGNVVDRLTAAKAVLSDFLDR-------REGDRVGLLVFGQ 152 Query: 229 KIEEFFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L ++ ++ ++ + G T + + ++ E + Sbjct: 153 RAYALTPLTADLTSVRDQLADSVVGLAGRETAIGDAIALSVKRL-----------REQRH 201 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF----------- 335 ++++V +TDG N + L AK G V+ I S+ Sbjct: 202 GQRVVVLLTDGVNTA--GVLNPLKAAELAKAEGVRVHTIAFGGSGSYSLFGVPIPAGGGD 259 Query: 336 ------LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 LR A + F+ + + ++ + + Sbjct: 260 DIDEDGLRKIAEQTGGRFFRARDTEELAGIYAELDR 295 >gi|74191759|dbj|BAE32836.1| unnamed protein product [Mus musculus] Length = 1166 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 75/211 (35%), Gaps = 30/211 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV---VQSGL 223 D++ ++D S S+ + K+ + ++ ++ K + + + Sbjct: 144 CPQQESDIVFLIDGSGSIN---NIDFQKMKEFVSTVMEQFKKSKTLFSLMQYSDEFRIHF 200 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 TF++ + + + + T + G++ ++F N Sbjct: 201 -TFNDFKRNPSP--------RSHVSPIKQLNGRTKTASGIRKVVRELFHKT------NGA 245 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV----IRSHEFLRAC 339 N KI+V +TDGE D + + EA + G I Y IG+ +S L Sbjct: 246 RENAAKILVVITDGEKFGDPLDYKDVIP--EADRAGVIRYVIGVGNAFNKPQSRRELDTI 303 Query: 340 AS---PNSFYLVENPHSMYDAFSHIGKDIVT 367 AS + V+N ++ + + + I Sbjct: 304 ASKPAGEHVFQVDNFEALNTIQNQLQEKIFA 334 >gi|74178358|dbj|BAE32446.1| unnamed protein product [Mus musculus] Length = 1153 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 75/211 (35%), Gaps = 30/211 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV---VQSGL 223 D++ ++D S S+ + K+ + ++ ++ K + + + Sbjct: 144 CPQQESDIVFLIDGSGSIN---NIDFQKMKEFVSTVMEQFKKSKTLFSLMQYSDEFRIHF 200 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 TF++ + + + + T + G++ ++F N Sbjct: 201 -TFNDFKRNPSP--------RSHVSPIKQLNGRTKTASGIRKVVRELFHKT------NGA 245 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV----IRSHEFLRAC 339 N KI+V +TDGE D + + EA + G I Y IG+ +S L Sbjct: 246 RENAAKILVVITDGEKFGDPLDYKDVIP--EADRAGVIRYVIGVGNAFNKPQSRRELDTI 303 Query: 340 AS---PNSFYLVENPHSMYDAFSHIGKDIVT 367 AS + V+N ++ + + + I Sbjct: 304 ASKPAGEHVFQVDNFEALNTIQNQLQEKIFA 334 >gi|254481548|ref|ZP_05094792.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] gi|214038176|gb|EEB78839.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] Length = 345 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 63/163 (38%), Gaps = 26/163 (15%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+ +D+S SM+ D +++I + + +E+ + GL+ F + Sbjct: 92 DLMLAVDISGSMKIEDMEVSDELVSRIRAVKQVGSRFIEQ-------REGDRLGLILFGS 144 Query: 229 KIEEFFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + + ++R + + G T + A ++ E Sbjct: 145 NAYVQSPLSFDTATVKRFLLEAQIGFAGQDTAIGDAIGLAVKRL-----------KERPA 193 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 ++++ ++DG++ ++ Q L A G +Y IGI Sbjct: 194 ENRVLILLSDGKDTASS--VQPLNAAKLAADLGIRIYTIGIGA 234 >gi|224049043|ref|XP_002191793.1| PREDICTED: similar to capillary morphogenesis protein 2 [Taeniopygia guttata] Length = 554 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 36/217 (16%), Positives = 74/217 (34%), Gaps = 27/217 (12%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 ++ DM VLD S S+ + + + I + ++ V+ Sbjct: 104 GSAPAAGEPSCHGAFDMYFVLDKSGSVATNW-----------REIFDFVNQLTERF-VSP 151 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQG 275 ++ + FS + L +++ +K L + T GLK A QI Sbjct: 152 KMRLSFIVFSTQAHVIMPLTGDREKIKKGLKDLEEVKPAGETYIHEGLKQANEQIAKQGA 211 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R + II+ +TDG+ + + ++++ GA VY +G++ + Sbjct: 212 SR---------FSSIIIALTDGKLDG-QIPLYAEKEAKKSRELGARVYCVGVQDFEPEQL 261 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 R + V + A I ++ + Sbjct: 262 ERIADVKEQVFPVTGG---FQALKGIINSVLKQSCTE 295 >gi|157073982|ref|NP_001096700.1| von Willebrand factor A domain-containing protein 1 precursor [Bos taurus] gi|160221291|sp|A6QLN9|VWA1_BOVIN RecName: Full=von Willebrand factor A domain-containing protein 1; Flags: Precursor gi|151553526|gb|AAI48034.1| VWA1 protein [Bos taurus] gi|296478966|gb|DAA21081.1| von Willebrand factor A domain-containing protein 1 precursor [Bos taurus] Length = 413 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 82/225 (36%), Gaps = 27/225 (12%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 + + + + + + + D++ +LD S S+ + + + + + + + + Sbjct: 12 SLRLARSGAERGLPASALQGDLLFLLDSSASVSHY------EFNRVREFLGRLAALLPVG 65 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 P +++ LV ++ F S +Q I+ ++ TN+ L YA Q+ Sbjct: 66 PGA---LRASLVHVGSRPHTEFPFGQHSSGSAVQDAIRAAAQRMGDTNTGLALAYAKKQL 122 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 F K++V++TDG + E K G V+ + Sbjct: 123 FAK------AAGARPGVPKVLVWVTDGGSSDPVG-----PPMQELKDLGVTVFIVSTGRG 171 Query: 331 RSHEFLRACASPN----SFYLVENPHSMYDAF-SHIGKDIVTKRI 370 E A ++P F V++ H + A I + +++ Sbjct: 172 NLLELSAAASAPAEKHLHFVDVDDLHIITQALRGSILDAMWPQQL 216 >gi|291008772|ref|ZP_06566745.1| hypothetical protein SeryN2_29978 [Saccharopolyspora erythraea NRRL 2338] Length = 324 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 37/233 (15%), Positives = 75/233 (32%), Gaps = 34/233 (14%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + R +M+ +DVS SM++ D +++ A + +++ Sbjct: 69 LTVALAGPTAEQRIPRNRATVMLTVDVSLSMKA-TDVEPNRLEAAKVAAKEFADQLTPGI 127 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + GLV+F+ + + +++ I L +T + G+ A + I Sbjct: 128 N------LGLVSFAGTATVLVMPTTDRASVKQAIDNLK-LSEATATGDGINAAMSAIDSF 180 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ---SLYYCNEAKKRGAIVYAIGIRVI 330 M IV M DG +E + EAKK + I Sbjct: 181 GKM---VGGPSGAPPARIVLMADGGQTIPRELDAPRGAYTKAQEAKKANIPISTISFGTK 237 Query: 331 RS------------------HEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 E R S F+ + + + ++ +G+ I Sbjct: 238 HGSIEIEGEQEFVEVDDEAMQEIARL--SGGEFHKAASAEQLREVYATLGEQI 288 >gi|198423392|ref|XP_002124188.1| PREDICTED: similar to fibropellin Ia [Ciona intestinalis] Length = 1781 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 60/323 (18%), Positives = 115/323 (35%), Gaps = 48/323 (14%) Query: 64 QIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQ 123 QI+ G NN ++ + ++ T + + E++ + ++ + Sbjct: 232 QIIANGELNNNWRIYQLNSFNQLPTTVQ-NLQREIKAVYDIPACTPLIPPIHGQLTCTNA 290 Query: 124 NE-----GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT---SSVKVNSQTDARLDMM 175 NE Y+ Y I +++ M + + A D++ Sbjct: 291 NEIGSTCSYNCDHAGGYGIHGS------ENRTCQYLTMGVPRWSGDISTCGVCTASFDLL 344 Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 +++D S S+ F + I +++ V++ PD VQ ++T++ +I Sbjct: 345 VIIDSSSSIGLF------NFQIMKDFIIRLVDTVEIGPDK---VQVSVITYNRRIY---- 391 Query: 236 LEWG------VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 +W + L + I L+ G T + L YA + F R+ Sbjct: 392 DKWNYDAYSNKNQLLQAIIGLTYSGSGTRTGRVLGYAVDYKFLPSYGRRLNKPH------ 445 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGA-IVYAIGIRVIRSHEFLRACASPNSFYLV 348 I + +TDG+ D + N + G V AIG+ E LR ++P+S Y Sbjct: 446 ITIIITDGK----SSDAITSNTLN--RLHGMSKVIAIGLHGANQPEILRIASNPDSLYAH 499 Query: 349 ENPHSMYDAFSHIGKDIVTKRIW 371 S S I +DI + Sbjct: 500 SVQGSFVGLLSTI-RDIASTLCT 521 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 25/183 (13%), Positives = 64/183 (34%), Gaps = 33/183 (18%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 +++ +D++ ++D S S+ + ++ IK++ + N V G+V Sbjct: 69 TESSVLVDIIFLVDGSGSVNYNAPGNWRRVLTWIKAVASGFNI------SNENVNIGVVQ 122 Query: 226 FSN---------KIEEFFLLEWGVSHLQR----KIKYLSKFGVSTNSTPGLKYAYNQIFD 272 +S+ +I + Q + + G +T + + + F Sbjct: 123 YSHWYRTLPMSSQIYLKTEIALNRCRTQACFQYLVDKIQIMGYTTYTGAAINKTILEDFS 182 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 R + KI++ +TDG + + A+++ ++ IG+ Sbjct: 183 TTPARSN---------KILILLTDGISKDDVNYASAF-----ARRQNITIFCIGVGSYSL 228 Query: 333 HEF 335 + Sbjct: 229 SQL 231 >gi|134100328|ref|YP_001105989.1| hypothetical protein SACE_3793 [Saccharopolyspora erythraea NRRL 2338] gi|133912951|emb|CAM03064.1| von Willebrand factor, type A [Saccharopolyspora erythraea NRRL 2338] Length = 327 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 37/233 (15%), Positives = 75/233 (32%), Gaps = 34/233 (14%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + R +M+ +DVS SM++ D +++ A + +++ Sbjct: 72 LTVALAGPTAEQRIPRNRATVMLTVDVSLSMKA-TDVEPNRLEAAKVAAKEFADQLTPGI 130 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + GLV+F+ + + +++ I L +T + G+ A + I Sbjct: 131 N------LGLVSFAGTATVLVMPTTDRASVKQAIDNLK-LSEATATGDGINAAMSAIDSF 183 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ---SLYYCNEAKKRGAIVYAIGIRVI 330 M IV M DG +E + EAKK + I Sbjct: 184 GKM---VGGPSGAPPARIVLMADGGQTIPRELDAPRGAYTKAQEAKKANIPISTISFGTK 240 Query: 331 RS------------------HEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 E R S F+ + + + ++ +G+ I Sbjct: 241 HGSIEIEGEQEFVEVDDEAMQEIARL--SGGEFHKAASAEQLREVYATLGEQI 291 >gi|71061058|dbj|BAE16255.1| calcium activated chloride channel [Rattus norvegicus] Length = 902 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 38/194 (19%), Positives = 73/194 (37%), Gaps = 30/194 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN--KIE 231 + +VLD S SM++ ++ ++ L ++ + GLVTF + +I+ Sbjct: 309 ICLVLDKSGSMDTE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSTAQIQ 358 Query: 232 EFFLLEWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + ++ L + T+ GL+ + I Sbjct: 359 NYLIKITNTGDYKKITGNLPQQAVGGTSICRGLEAGFQAITSSDQSTSGSE--------- 409 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYLV 348 IV +TDGE+ + + K GA+++ I + + E L FY Sbjct: 410 IVLLTDGEDDLISSCFEVV------KHSGAVIHTIALGPKAARELETLSDMTGGLRFYAN 463 Query: 349 ENPHSMYDAFSHIG 362 ++ +S+ DAFS I Sbjct: 464 KDVNSLMDAFSGIS 477 >gi|255066322|ref|ZP_05318177.1| von Willebrand factor type A domain protein [Neisseria sicca ATCC 29256] gi|255049532|gb|EET44996.1| von Willebrand factor type A domain protein [Neisseria sicca ATCC 29256] Length = 538 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 50/326 (15%), Positives = 109/326 (33%), Gaps = 37/326 (11%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDIL--CRIKNTWNMSFRNELRDNGF---- 103 L S + +L AA N N ++ + + +F ++ + Sbjct: 42 LQSPPNAALSTAAVAEENLPLAENTERYQDQPDQPVKSVAQEPVSTFSIDVDTGSYANVR 101 Query: 104 ---VNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITS 160 N + ++ +V Y + +R PW ++ I + I + Sbjct: 102 RFLTNGEQPPKDAVRIEEIVNYFPYNYPLPTDNRPFAVHTETIDSPWQPEAKLIKIGIQA 161 Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + +++ ++DVS SM+ K+ + K++ + ++++ V Sbjct: 162 -QDTAKKDLPPANLVFLVDVSGSMDEE-----NKLPLVQKTLRILTQQLRPQDKVT---- 211 Query: 221 SGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 L+T+++ + G + I L G +T+ L+ AY Q Q Sbjct: 212 --LITYASGEDLVLPPTSGADKETILSAIDKLRA-GGATDGESALQMAYEQ---AQKAFV 265 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGAIVYAIGIRVIRSHEFLR 337 I+ TDG+ D ++L E +K G + +G + +E + Sbjct: 266 PNGINR------ILLATDGDFNVGVSDTETLKSMVAEKRKSGVSLSTLGFGMGNYNEDMM 319 Query: 338 ---ACASPNSFYLVENPHSMYDAFSH 360 A A ++ ++N Sbjct: 320 EQIADAGDGNYSYIDNEKEAKKVLQQ 345 >gi|225174955|ref|ZP_03728952.1| von Willebrand factor type A [Dethiobacter alkaliphilus AHT 1] gi|225169595|gb|EEG78392.1| von Willebrand factor type A [Dethiobacter alkaliphilus AHT 1] Length = 841 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 82/241 (34%), Gaps = 38/241 (15%) Query: 135 YKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITK 194 ++ + + + + + S T L +++V+D+S SM+ + + Sbjct: 3 HRFRSSLIKLVVFAMLLSTLAGAVVQARSGRSHTTEALSVILVVDISGSMDRNDPQYLRE 62 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL---EWGVSHLQRKIKYLS 251 I ++ + G++ F ++IEE L + ++ Sbjct: 63 TATLI-----FMDLLGPKD------YLGVLAFDDRIEELVPLQQVADNKGTFKEAVEGNL 111 Query: 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL---STKEDQQS 308 T+ L+ A+ Q+ ++ + ++++VF+TDGE + D + Sbjct: 112 VPRGFTDYVGALEEAFEQLHSVETG---------DARQVVVFLTDGEPNPHLDARNDDEF 162 Query: 309 LY---------YCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDA 357 + A G VY + E L A + F L+ +P + Sbjct: 163 MEGYLGELWDLTGEYA-AAGVPVYPVAFSDEVGPEVLEQIAGHTGADFVLMPDPGDLVVT 221 Query: 358 F 358 F Sbjct: 222 F 222 >gi|311271865|ref|XP_001927121.2| PREDICTED: LOW QUALITY PROTEIN: von Willebrand factor A domain-containing protein 2 [Sus scrofa] Length = 769 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 38/190 (20%), Positives = 75/190 (39%), Gaps = 25/190 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 A +D++ ++D S S+ + + + E + + P V+ G V F + Sbjct: 48 AAVDILFLIDGSHSIGK------GSFERSKHFAITVCEALDVDP---ARVRVGAVQFGST 98 Query: 230 IEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 F L+ + ++ +I+ ++ G T + LKY + F +A+ Sbjct: 99 PRLEFPLDAFSTQQEVKAEIRRMAFKGGRTETGLALKYLLRKGFPGGR--------NASV 150 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 ++++ +TDG + K+R V+A+G+R R E L AS + Sbjct: 151 PQVLLIVTDGR-----SQGHVAEPAEQLKQRDVTVFAVGVRFPRWEE-LHILASEPTEQH 204 Query: 348 VENPHSMYDA 357 V + DA Sbjct: 205 VLMAEQVEDA 214 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 73/192 (38%), Gaps = 24/192 (12%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-NVVQSGLV 224 LD++ +LD S S+ + ++ + L L DVN +V+Q GLV Sbjct: 524 GCRARPLDLVFMLDASASVGPE---NFARMQ-------SFLRSCTLQFDVNPDVMQMGLV 573 Query: 225 TFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + +++ F L+ + + R + G + ++ L + Y+++ +Q Sbjct: 574 VYGGQVQTAFGLDTHTTRATVLRALSQAPYLGGAGSAGTALLHIYDKVMTVQ------MG 627 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 K+++ +T G+ + + + + G V +G+ + + S Sbjct: 628 ARPGVPKVVIVVTGGQ-----GVEDAAVPAEKLRDNGVSVLVVGVGPVLREALRKLAGSR 682 Query: 343 NSFYLVENPHSM 354 +S V + Sbjct: 683 DSLIXVAGYEDL 694 >gi|256787646|ref|ZP_05526077.1| lipoprotein [Streptomyces lividans TK24] gi|289771539|ref|ZP_06530917.1| lipoprotein [Streptomyces lividans TK24] gi|289701738|gb|EFD69167.1| lipoprotein [Streptomyces lividans TK24] Length = 532 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 67/201 (33%), Gaps = 28/201 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + V+D+S SM ++D+A +++ M + ++ V LVTF Sbjct: 179 SERPPAALTFVIDISGSM-----GEPGRLDLAQEALGTMTDRLRDDDSVA------LVTF 227 Query: 227 SNKIEEFFLLE---WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 S++ E + + I L STN G++ Y R+ T Sbjct: 228 SDEAETVLPMTRLGDHRGRVHDAIDGLEPT-DSTNLGAGMETGYE---TAVEGRREGATN 283 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSL--YYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 +V ++D + D ++ E ++ G ++ +G+ + A Sbjct: 284 R------VVLVSDALANTGDTDADTILERIATERREHGITLFGVGVGSDYGDALMERLAD 337 Query: 342 --PNSFYLVENPHSMYDAFSH 360 V + FS Sbjct: 338 KGDGHTTYVSTTEDAREVFSE 358 >gi|126316412|ref|XP_001380737.1| PREDICTED: similar to integrin, alpha 1 [Monodelphis domestica] Length = 1183 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 48/284 (16%), Positives = 97/284 (34%), Gaps = 47/284 (16%) Query: 117 DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYT-NSRHIVMPITSSVKVNSQT------- 168 ++ V +N + + ++ K C + Y + I S V Q Sbjct: 109 NVTEVKENMTFGSTLVTNPKGGFLACGPLYAYRCGHTYYTTGICSDVDSKFQVVNSIAPS 168 Query: 169 ----DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + +LD++IVLD S S+ + +N++L ++ + P Q G+V Sbjct: 169 VQGCNTQLDIVIVLDGSNSIYPWTS--------VTDFLNSLLGKMDIGP---KQTQVGIV 217 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 + + F L + + I + + G T + G+ A + F Sbjct: 218 QYGENVTHEFNLNKYTTTEEVLIAANQIVQRQGRQTMTALGIDTARKEAFTK------AR 271 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-RVIRSHEF----- 335 + KK++V +TDGE+ + + C ++I I Sbjct: 272 GARSGVKKVMVIVTDGESHDNHRLNEVIQDCE---DEDIQRFSIAILGHYNRGNLSTEKF 328 Query: 336 ---LRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 +++ AS F+ V + ++ +G+ I D Sbjct: 329 VEEIKSIASEPTEKHFFNVSDELALVTIAEVLGERIFALEATTD 372 >gi|21221175|ref|NP_626954.1| lipoprotein [Streptomyces coelicolor A3(2)] gi|6969217|emb|CAB75310.1| putative lipoprotein [Streptomyces coelicolor A3(2)] Length = 532 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 67/201 (33%), Gaps = 28/201 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + V+D+S SM ++D+A +++ M + ++ V LVTF Sbjct: 179 SERPPAALTFVIDISGSM-----GEPGRLDLAQEALGTMTDRLRDDDSVA------LVTF 227 Query: 227 SNKIEEFFLLE---WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 S++ E + + I L STN G++ Y R+ T Sbjct: 228 SDEAETVLPMTRLGDHRGRVHDAIDGLEPT-DSTNLGAGMETGYE---TAVEGRREGATN 283 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSL--YYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 +V ++D + D ++ E ++ G ++ +G+ + A Sbjct: 284 R------VVLVSDALANTGDTDADTILERIATERREHGITLFGVGVGSDYGDALMERLAD 337 Query: 342 --PNSFYLVENPHSMYDAFSH 360 V + FS Sbjct: 338 KGDGHTTYVSTTEDAREVFSE 358 >gi|74222716|dbj|BAE42227.1| unnamed protein product [Mus musculus] Length = 1064 Score = 65.9 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 75/211 (35%), Gaps = 30/211 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV---VQSGL 223 D++ ++D S S+ + K+ + ++ ++ K + + + Sbjct: 54 CPQQESDIVFLIDGSGSIN---NIDFQKMKEFVSTVMEQFKKSKTLFSLMQYSDEFRIHF 110 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 TF++ + + + + T + G++ ++F N Sbjct: 111 -TFNDFKRNPSP--------RSHVSPIKQLNGRTKTASGIRKVVRELFHKT------NGA 155 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV----IRSHEFLRAC 339 N KI+V +TDGE D + + EA + G I Y IG+ +S L Sbjct: 156 RENAAKILVVITDGEKFGDPLDYKDVIP--EADRAGVIRYVIGVGNAFNKPQSRRELDTI 213 Query: 340 AS---PNSFYLVENPHSMYDAFSHIGKDIVT 367 AS + V+N ++ + + + I Sbjct: 214 ASKPAGEHVFQVDNFEALNTIQNQLQEKIFA 244 >gi|281341943|gb|EFB17527.1| hypothetical protein PANDA_002811 [Ailuropoda melanoleuca] Length = 652 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 71/197 (36%), Gaps = 27/197 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ V+D S S+ ++ + + +E ++ + G V ++ + Sbjct: 469 DIGFVIDGSSSVG------TGNFRTVLQFVANISKEFEISETD---TRVGAVQYTYEQRL 519 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + S + IK + + T++ + YA Q+F N +K+ Sbjct: 520 EFGFDDYHTKSDILNAIKRVGYWSGGTSTGAAINYALEQLF---------KKSKPNKRKL 570 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLV 348 ++ +TDG + A +G YAIG+ E P + + V Sbjct: 571 MILITDG-----RSYDDVRIPAMVAHHKGVTTYAIGVAWAAQDELEVIATHPASDHSFFV 625 Query: 349 ENPHSMYDAFSHIGKDI 365 + ++Y + ++I Sbjct: 626 DEFDNLYKFVPKVIRNI 642 >gi|194335401|ref|YP_002017195.1| von Willebrand factor type A [Pelodictyon phaeoclathratiforme BU-1] gi|194307878|gb|ACF42578.1| von Willebrand factor type A [Pelodictyon phaeoclathratiforme BU-1] Length = 336 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 39/214 (18%), Positives = 74/214 (34%), Gaps = 42/214 (19%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++ LD+S SM +++D A + + + GLV F K Sbjct: 99 IDVILALDISESMLQKDVGGTSRLDAAREVSRNFVLRRSND-------RIGLVVFRGKGY 151 Query: 232 EFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L L + LS T + A N++ + Sbjct: 152 TQCPLTLDHEVLAMLLDRLSPGVIQDDGTAIGTAILIAVNRL-----------KASESLH 200 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI--GIRV---------------IR 331 K+++ +TDGEN + + + + A + G +Y I G +V + Sbjct: 201 KVLILVTDGENNAGEVGPGTA--ASIAARSGVRIYVINAGFKVVEDRIDPPEESGRYIQK 258 Query: 332 SHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 E L+ A + ++ VE+P + I + Sbjct: 259 DEESLQGIARTTGGGYFRVEDPAAFDQTIRSIDR 292 >gi|327283159|ref|XP_003226309.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Anolis carolinensis] Length = 1076 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 80/199 (40%), Gaps = 30/199 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + ++++ V+D S S+ ++ + A+++ V + N + GLV + Sbjct: 798 CKEIPMELVFVIDSSESVGP------ENFEIIKDFVTALVDRVTV---GRNATRIGLVLY 848 Query: 227 SNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S ++ F L + +++ I+ + G T + ++ A ++ Sbjct: 849 SLEVRLEFGLNRYTTQQDVKQAIRKMLYMGEGTYTGTAIRKA---------TQEGFFGAR 899 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI---RVIRSHEFLRAC-- 339 +K+ + +TDG+ +E + EA +YAIGI EFLR Sbjct: 900 TGVRKVAIVLTDGQ-ADKREAVKLDIAVREAHAANIEMYAIGIVNTSDPTQVEFLRELNL 958 Query: 340 -ASP---NSFYLVENPHSM 354 AS YL+++ +++ Sbjct: 959 IASDPDREHMYLIDDFNTL 977 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 75/203 (36%), Gaps = 23/203 (11%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + L+M +LD S S ++F K + + ++L + + + L+ + Sbjct: 49 NENCILEMAFLLDSSESAKNFNHEQQKK---FVLETVNRMNGLQLSSNRHLSWRIALLQY 105 Query: 227 SNK--IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S+ IE+ F G + +I ++ G T +T + Q++ +G Sbjct: 106 SSTVLIEQTFKDWKGPDAFKSRIAPITYIGHGTYTTYAI-TNLTQLYMTEGTHGSV---- 160 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH----EFLRACA 340 K+ + +TDG + D S +AK +G + + +G+ + LR A Sbjct: 161 ----KVAILLTDGVDHPRNPDIFSAT--ADAKHQGIVFFTVGMTRVAEEVANAAKLRLLA 214 Query: 341 S---PNSFYLVENPHSMYDAFSH 360 S + ++ + Sbjct: 215 SVPASRFVFNLQETGIVEKILKE 237 >gi|241767791|ref|ZP_04765389.1| von Willebrand factor type A [Acidovorax delafieldii 2AN] gi|241361168|gb|EER57806.1| von Willebrand factor type A [Acidovorax delafieldii 2AN] Length = 277 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 42/243 (17%), Positives = 79/243 (32%), Gaps = 55/243 (22%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+ +DVS SM + D +I A + A + E+ V+ G+V F+ + Sbjct: 18 IMLAMDVSGSMRA-TDVQPDRITAAQNAAKAFIAELP------RHVRVGIVAFAGSAQLA 70 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG------------------ 275 L L + I + T + G+ A +F G Sbjct: 71 QLPTQSHEDLAKAIDSF-QLQRGTATGNGIMLALATLFPDAGIDIAALGGRQAMHPRPLD 129 Query: 276 --MRQHCNTEDANY------KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 RQ I+ +TDG+ + + ++ + A RG VY +G+ Sbjct: 130 EVTRQDPAKPFTPVAPGSYTSAAIIMLTDGQRTTGVDPLEAAQW---AADRGVRVYTVGV 186 Query: 328 RVIRS---------------HEFLRACA--SPNSFYLVENPHSMYDAFSHI-GKDIVTKR 369 ++ E L+A A + ++ + + + + V KR Sbjct: 187 GTVQGETIGFEGWSMRVRLDEETLKAVAGRTHAEYFHAATAADLKKVYETLSSRLAVEKR 246 Query: 370 IWY 372 Sbjct: 247 ETE 249 >gi|326671055|ref|XP_003199351.1| PREDICTED: integrin alpha-2-like [Danio rerio] Length = 1492 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 57/321 (17%), Positives = 119/321 (37%), Gaps = 45/321 (14%) Query: 65 IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQN 124 ++ + KG C I + N + L+D+ ++ + I +TSL + ++P Sbjct: 56 LVGSPWSGFPRNRKGDLYKCDIGDPRNPCQKLNLQDSVIIDGVQSINTNTSLGLTLIPVK 115 Query: 125 EGYS----ISAI---SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 + + + A S+Y P T + + ++++ +D+ IV Sbjct: 116 KRFMTCGPLWAQRCGSQYFYP----GVCAEVTQRFTLKSAFSPAIQIC---GGPMDVAIV 168 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK---IEEFF 234 LD S S+ + D K + +LE + + PD + ++ +S + F Sbjct: 169 LDGSNSIYPWSD--------VKKFLLNLLENLDIGPDQ---TRVSIMQYSEDLSFLYHFS 217 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + L + G TN+ L Q F + N K++V + Sbjct: 218 FDQNKQKVLLAASDIDQQTGQETNTFAALDKTREQAFLPE------NGGRPGATKVLVVV 271 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH--------EFLRACASPNSFY 346 TDGE+ Q+ + + + G I + I I ++ E + + + N + Sbjct: 272 TDGESADGYRGQEVIQ---KLDRDGIIRFGIAILKESANIQKFVEEIELIASTPTENYMF 328 Query: 347 LVENPHSMYDAFSHIGKDIVT 367 V + ++ D + +G+ I Sbjct: 329 NVSSEGALVDITATLGERIFN 349 >gi|327290735|ref|XP_003230077.1| PREDICTED: collagen alpha-1(XIV) chain-like, partial [Anolis carolinensis] Length = 562 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 73/201 (36%), Gaps = 26/201 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV-NNVVQSGLVTFS-NKI 230 D++I++D S S+ F + + LE + +V + + GL +S + Sbjct: 52 DIVILVDGSWSIGRFN----------FRLVRLFLENLVAAFNVGSEKTRIGLAQYSGDPR 101 Query: 231 EEFFLLEWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 E+ L + + ++ L G +T + L + F + K Sbjct: 102 IEWHLNTYSTKDAVLDAVRNLPYKGGNTLTGLALTFILENNFKSE------AGARPGVPK 155 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--FYL 347 I + +TDG K + K G ++AIG++ E + P++ Y Sbjct: 156 IGILITDG-----KSQDDVIPPAKNLKDAGIELFAIGVKNADETELKEIASEPDNTHVYN 210 Query: 348 VENPHSMYDAFSHIGKDIVTK 368 V + M + K + ++ Sbjct: 211 VADFSFMNSIVEGLTKTVCSR 231 >gi|294054316|ref|YP_003547974.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] gi|293613649|gb|ADE53804.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] Length = 678 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 37/169 (21%), Positives = 69/169 (40%), Gaps = 31/169 (18%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF--SNK 229 LD++ VLD S+SM + D T+++ A +I ++E+++ + GLV F S Sbjct: 90 LDIVFVLDSSKSMLAS-DLRPTRLERAKLAILDLVEQLESD-------RIGLVAFAGSAF 141 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ L++G + + + ++ L+ A N Sbjct: 142 LQTPPTLDYG--AFRESLDATAPDMMSRGGSDLGVALREATKAF------------PVEN 187 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 K +V +TDGE+L ++ +A K G V+AIG+ Sbjct: 188 NYKAVVLLTDGEDLGGHAIDEAK----KASKEGVKVFAIGLGTPEGDYL 232 >gi|16331837|ref|NP_442565.1| hypothetical protein sll0103 [Synechocystis sp. PCC 6803] gi|2496792|sp|Q55874|Y103_SYNY3 RecName: Full=Uncharacterized protein sll0103 gi|1208467|dbj|BAA10635.1| sll0103 [Synechocystis sp. PCC 6803] Length = 420 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 73/202 (36%), Gaps = 30/202 (14%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK- 229 L++ +VLD S SM ++ + +++ ++ + ++ F ++ Sbjct: 41 PLNLCLVLDHSGSM------DGQPLETVKSAALGLIDRLEEDD------RLSVIAFDHRA 88 Query: 230 --IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + E + G + + + I+ L G T GLK + + R Sbjct: 89 KIVIENQQVRNG-AAIAKAIERLKAEG-GTAIDEGLKLGIQEAAKGKEDRVS-------- 138 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSF 345 I +TDGEN ++ + L A V+ +G + + L A A+ S Sbjct: 139 --HIFLLTDGENEH-GDNDRCLKLGTVASDYKLTVHTLGFGDHWNQDVLEAIAASAQGSL 195 Query: 346 YLVENPHSMYDAFSHIGKDIVT 367 +ENP F + + + Sbjct: 196 SYIENPSEALHTFRQLFQRMSN 217 >gi|301767380|ref|XP_002919113.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-1(VI) chain-like [Ailuropoda melanoleuca] Length = 1059 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 43/238 (18%), Positives = 83/238 (34%), Gaps = 29/238 (12%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP--- 213 ++ + + + D +D+ VLD S S+ +D ++ ++ Sbjct: 21 DVSGNARAVAFQDCPVDLFFVLDTSESVALRLKPYGALVDKVKAFTKRFIDNLRDRYYRC 80 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 D N V +G + +S+++E L S L+ + + FG T + +K + Sbjct: 81 DRNLVWNAGALHYSDEVEIISPLRPMPSDRDALKASVDAVKYFGKGTYTDCAIKKGLEGL 140 Query: 271 FDMQG----------------MRQHCNTEDANYK--KIIVFMTDGEN-LSTKEDQQSLYY 311 R ++ K K ++ +TDG KE L Sbjct: 141 XXXXXXXXXXXXXASAGGRRPARSRGCQGGSHLKENKYLIVVTDGHPLEGYKEPCGGLED 200 Query: 312 C-NEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF---YLVENPHSMYDAFSHIGKDI 365 NEAK G V+++ I L A+ +++ + + DA I + I Sbjct: 201 AVNEAKHLGVKVFSVAITPDHLEPRLSIIATDHTYRRNFTAADWGQTRDAEEIITQTI 258 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 33/170 (19%), Positives = 64/170 (37%), Gaps = 29/170 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVVQSGL 223 +D++ VLD S S+ + ++A I +L+ + V +G+ Sbjct: 640 CKCGPIDILFVLDSSESIG------LQNFEIAKDFIVKVLDRLSRDELVKFEPGHSHAGV 693 Query: 224 VTFSN-KIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V +S+ +++E L V L+ IK L T + L+Y NQ+ Sbjct: 694 VQYSHNQMQEHVGLRDANIRNVQELKEAIKNLRWMAGGTFTGEALQYTRNQLL------- 746 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +I + +TDG + + ++ C V ++GI+ Sbjct: 747 ----PPTQNNRIALVITDGRSDTQRDTTPLSVLCGP----DIQVVSVGIK 788 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 54/165 (32%), Gaps = 18/165 (10%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + D+ I+LD S S+ S + TK K + ++ V+ +V +S Sbjct: 856 SPADITILLDGSASVGSSHNFDTTK--RFAKRLAERFLTAGRTDPSHD-VRVAVVQYSGP 912 Query: 230 IEE-----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ + L + + F +T+ L Y R + Sbjct: 913 GQQRPERGSLQFLQNYTVLAGTVDGMDFFNDATDVNDALSYV---------TRFYREASS 963 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 KK ++ +DG + EA++ ++ + + Sbjct: 964 REAKKRLLLFSDGNSQGATAAAIEKAV-QEAQRADIEIFVVVVGR 1007 >gi|119572524|gb|EAW52139.1| integrin, alpha D [Homo sapiens] Length = 366 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 35/209 (16%), Positives = 77/209 (36%), Gaps = 28/209 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D++ ++D S S++ ++ ++++ E + + + F Sbjct: 144 CPHQEMDIVFLIDGSGSIDQN---DFNQMKGFVQAVMGQFEGTDTLFALMQYSNLLKIHF 200 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + Q + + + T + G+ Q+F H N + Sbjct: 201 TFTQFRTSPSQ------QSLVDPIVQLKGLTFTATGILTVVTQLF------HHKNGARKS 248 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFLRACASP 342 KKI++ +TDG+ + + +A+K G I YAIG + + + L +S Sbjct: 249 AKKILIVITDGQKYKDPLEYSDVIP--QAEKAGIIRYAIGVGHAFQGPTARQELNTISSA 306 Query: 343 ---NSFYLVENPHSMYDAFSHIGKDIVTK 368 + + V+N + A I K + K Sbjct: 307 PPQDHVFKVDN----FAALGSIQKQLQEK 331 >gi|74212905|dbj|BAE33399.1| unnamed protein product [Mus musculus] Length = 1232 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 75/211 (35%), Gaps = 30/211 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV---VQSGL 223 D++ ++D S S+ + K+ + ++ ++ K + + + Sbjct: 144 CPQQESDIVFLIDGSGSIN---NIDFQKMKEFVSTVMEQFKKSKTLFSLMQYSDEFRIHF 200 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 TF++ + + + + T + G++ ++F N Sbjct: 201 -TFNDFKRNPSP--------RSHVSPIKQLNGRTKTASGIRKVVRELFHKT------NGA 245 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV----IRSHEFLRAC 339 N KI+V +TDGE D + + EA + G I Y IG+ +S L Sbjct: 246 RENAAKILVVITDGEKFGDPLDYKDVIP--EADRAGVIRYVIGVGNAFNKPQSRRELDTI 303 Query: 340 AS---PNSFYLVENPHSMYDAFSHIGKDIVT 367 AS + V+N ++ + + + I Sbjct: 304 ASKPAGEHVFQVDNFEALNTIQNQLQEKIFA 334 >gi|292627943|ref|XP_695559.4| PREDICTED: integrin alpha-11 [Danio rerio] Length = 1104 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 44/242 (18%), Positives = 87/242 (35%), Gaps = 47/242 (19%) Query: 150 NSRHIVMPITSSVKVNSQTDARL-----------DMMIVLDVSRSMESFFDSSITKIDMA 198 S + I S V N + D++IVLD S S+ +++ Sbjct: 117 GSSYYSTGICSRVDSNFNFTHSIAPAYQRCETYMDIVIVLDGSNSIYPWYEVQ------- 169 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGV 255 + ++ ++ +Q G+V + ++ F L+ + + + G Sbjct: 170 ----DFLINVLQKFYIGPGQIQVGVVQYGERVVNEFRLDDFRTVDEVVAAAKNIDQRGGE 225 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 T + G+ A Q F G + KK+++ +TDGE+ + + + ++ Sbjct: 226 ETRTALGINVARTQAFKHGG--------RPDAKKVMIVITDGESHDSPDLKAAVE--ESE 275 Query: 316 KKRGAIVYAIGI------RVIRSHEFLR-----ACASPNSFYLVENPHSMYDAFSHIGKD 364 K +Y I + R I FLR A F+ V + ++ D +G+ Sbjct: 276 KDN-ITLYGIAVLGYYNRRGINPEAFLREIKFIATDPDEHFFSVTDESALKDIVDALGEK 334 Query: 365 IV 366 I Sbjct: 335 IF 336 >gi|218515577|ref|ZP_03512417.1| hypothetical protein Retl8_18742 [Rhizobium etli 8C-3] Length = 54 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 12/42 (28%), Positives = 19/42 (45%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTV 49 +R + G I+TAI P++ GM I+V + K Sbjct: 11 LRRMLSDRGGNFGIMTAILAPVLLGAAGMAIQVGDMLLSKIH 52 >gi|224080732|ref|XP_002192406.1| PREDICTED: anthrax toxin receptor 1 [Taeniopygia guttata] Length = 516 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 71/205 (34%), Gaps = 25/205 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+ +LD S S+ ++ I +E + ++ + FS + Sbjct: 41 GGFDLYFILDKSGSVLHHWN-----------EIYHFVEHLARKFISPQ-LRMSFIVFSTR 88 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L +++ ++ L K G T G + A QI+ + Sbjct: 89 GTILMRLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENVHGYRTAS----- 143 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +I+ +TDGE L S N ++ GA VY +G++ + R S + + Sbjct: 144 --VIIALTDGE-LHEDLFFYSEREANRSRDLGATVYCVGVKDFNETQLARIADSRDHVFP 200 Query: 348 VENPHSMYDAFSHIGKDIVTKRIWY 372 V + + A I I+ K Sbjct: 201 VNDG---FQALQGIIDSILKKSCIE 222 >gi|224054053|ref|XP_002190891.1| PREDICTED: collagen, type VI, alpha 2 [Taeniopygia guttata] Length = 1016 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 64/164 (39%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+M V+D S S+ ++ T + ++ + L + P + G+V +S++ Sbjct: 606 GALDIMFVIDSSESIGY---TNFTLEKNFVINVVSRLGSIAKDPKSLTGARVGVVQYSHE 662 Query: 230 -IEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E L+ +S + +K L T + L++AYN++ + Sbjct: 663 GTFEAIKLDDERIDSLSSFKEAVKRLEWIAGGTWTPSALQFAYNKLI--------KESRR 714 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG +D+ C R +V IGI Sbjct: 715 EKAQVFAVVITDGRYDPRDDDKNLGALCG----RDVVVNTIGIG 754 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 26/159 (16%), Positives = 59/159 (37%), Gaps = 14/159 (8%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV---QSGLVTFS 227 +++ ++D S S+ + +D + I +++++ N V Q G + +S Sbjct: 38 PINVYFIIDTSESVALQTVPIQSLVDQIKRFIPMFIDKLESELYQNQVYITWQFGGLHYS 97 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + +E + L ++ ++ G T + + M Q T+ AN Sbjct: 98 DVVEIYSPLTSSKDIYLPRLSAINYLGRGTFTDCAI----------SNMTQQIQTQMANG 147 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 V +TDG + + A+ G ++A+ Sbjct: 148 VNFAVVITDGHVTGSPCGGMKMQ-AERARDMGIKLFAVA 185 >gi|223670962|dbj|BAH22728.1| complement factor B precursor [Nematostella vectensis] Length = 858 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 43/253 (16%), Positives = 81/253 (32%), Gaps = 34/253 (13%) Query: 117 DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMI 176 ++ + Y + + C+ + N + M + + LD++ Sbjct: 341 YMIKDVNSTAYQLKRNIDTMLEYT-CSGMNSTCNLTEVDMRARA---IELNEAGGLDVVF 396 Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV--VQSGLVTFSNKIEEFF 234 V D S S+ K+D ++ +E VKL+ +T+ + F Sbjct: 397 VFDASSSI---------KMDDFRLGLDFSIELVKLLGTSWKPGGTHVAAITYGTESHLEF 447 Query: 235 LLEWG----VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + KI + + G T S L Q+ + R+ +K Sbjct: 448 NLGDAGALTAKSVIAKIGKIKRSGGGTASRLALDTTIRQV--VPFTREGS-------QKA 498 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---PNSFYL 347 + F+TDG + +++ K +G +YAIG+ L AS Sbjct: 499 LFFITDGHSNIGGSPRKAAKI---LKDKGFQIYAIGVGKKVRRRELMEIASEPEDEYVIS 555 Query: 348 VENPHSMYDAFSH 360 V + A Sbjct: 556 VRKYKQLLSAVKK 568 >gi|156390865|ref|XP_001635490.1| predicted protein [Nematostella vectensis] gi|156222584|gb|EDO43427.1| predicted protein [Nematostella vectensis] Length = 851 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 43/253 (16%), Positives = 81/253 (32%), Gaps = 34/253 (13%) Query: 117 DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMI 176 ++ + Y + + C+ + N + M + + LD++ Sbjct: 334 YMIKDVNSTAYQLKRNIDTMLEYT-CSGMNSTCNLTEVDMRARA---IELNEAGGLDVVF 389 Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV--VQSGLVTFSNKIEEFF 234 V D S S+ K+D ++ +E VKL+ +T+ + F Sbjct: 390 VFDASSSI---------KMDDFRLGLDFSIELVKLLGTSWKPGGTHVAAITYGTESHLEF 440 Query: 235 LLEWG----VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + KI + + G T S L Q+ + R+ +K Sbjct: 441 NLGDAGALTAKSVIAKIGKIKRSGGGTASRLALDTTIRQV--VPFTREGS-------QKA 491 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---PNSFYL 347 + F+TDG + +++ K +G +YAIG+ L AS Sbjct: 492 LFFITDGHSNIGGSPRKAAKI---LKDKGFQIYAIGVGKKVRRRELMEIASEPEDEYVIS 548 Query: 348 VENPHSMYDAFSH 360 V + A Sbjct: 549 VRKYKQLLSAVKK 561 >gi|12583699|dbj|BAB21479.1| integrin alpha Hr1 precursor [Halocynthia roretzi] Length = 1332 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 36/276 (13%), Positives = 93/276 (33%), Gaps = 41/276 (14%) Query: 109 DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMP--ITSSVKVNS 166 I ++ + Q EG ++ ++ + P + + Sbjct: 140 SIEPGDNIGLTASVQPEGDIVNCSPTRELKCSSMKYNPGFCYKSTDYGGNWRKEGSRNTE 199 Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + +D++ VLD S S+ FD + + ++ + +V+ G+V + Sbjct: 200 CPSSGVDVLFVLDGSGSVGKNFDK-------VKDWVKNITAKLDI---GKEIVRVGVVQY 249 Query: 227 SNKIEEFFLL--EWGVSHL-----------QRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 S+ +E + ++ + + + + + G +T + L+ D Sbjct: 250 SHYVEGKSINKQKYITTEISIGEFKLLDNFENAVDRIQLQGYTTYTGRALQKVIRDFDDA 309 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 K++++ +TDG+ K+++ L N + +G +A+G+ Sbjct: 310 YIGN----------KQVLLLLTDGQ---AKDNKLILPNANRLRNKGIATFAVGVGEYDIS 356 Query: 334 EF---LRACASPNSFYLVENPHSMYDAFSHIGKDIV 366 E S + + V + + + +I Sbjct: 357 ELKLIASGTDSTDRVFTVTDFGELDSIVKSLQTEIQ 392 >gi|332262298|ref|XP_003280198.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-1(XX) chain-like [Nomascus leucogenys] Length = 1284 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 35/174 (20%), Positives = 64/174 (36%), Gaps = 23/174 (13%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 DM+ ++D S S+ + + + +++ ++ PD VQ GL +S Sbjct: 177 PADMVFLVDGSWSIGH------SHFQQVKEFLASVIAPFEIGPDK---VQVGLTQYSGDA 227 Query: 231 EEFFLL-EWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + L G + + L G +T + L + + Q Sbjct: 228 QTEWDLNSLGTKEQVLAAVHRLRYKGGNTFTGLALTHV------LGQNLQPAAGLRPEAA 281 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 K+++ +TDG K K G V+A+G++ E LR ASP Sbjct: 282 KVVILVTDG-----KSQDDVHTAARVLKDLGVNVFAVGVKNADEAE-LRLLASP 329 >gi|159896929|ref|YP_001543176.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159889968|gb|ABX03048.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 579 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 53/329 (16%), Positives = 105/329 (31%), Gaps = 29/329 (8%) Query: 46 MKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIK-NTWNMSFRNELRDNGFV 104 K + A + G +R+ + + F+ NG Sbjct: 234 KKKPSQWAWLAGAITTAAIMWLVIYGLSRQDKVTDTEFGQAQIEQGTGDFQPGTSSNGLA 293 Query: 105 NDID-DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK 163 D D V +D P+ + Y + + + N+ I S Sbjct: 294 MLGDIDGVEVNRIDDTRHPEIDMYLSIMRPTGVVTDVPRQNVKVFENNNQIEGF---SWV 350 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSIT-KIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 S+ L++M+V+D S SM + +D A + ++ + P NV G Sbjct: 351 NLSRVQDPLNIMLVIDTSGSMGPSKEGLTDGGLDAAKIAALDFIDHL---PSNANV---G 404 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 L+ F + L + +++ I L G T L +Y Q+ +G Sbjct: 405 LIHFGTLVTVDHSLTNDIGAVRQSISELKPEGQ-TAIYDALAISYTQLRRAKG------- 456 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-SHEFLRACA- 340 + IV ++DG + ++K D K Y IG+ + L Sbjct: 457 -----QTFIVLISDGADTASKGDNYDSIVAKATKAN-IPTYIIGLTSPEFDGQLLEDLQR 510 Query: 341 -SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 + Y + + ++ + +++ + Sbjct: 511 DTKAMIYQTPSKEQLGGFYTEVAQEVSGQ 539 >gi|87200512|ref|YP_497769.1| hypothetical protein Saro_2499 [Novosphingobium aromaticivorans DSM 12444] gi|87136193|gb|ABD26935.1| hypothetical protein Saro_2499 [Novosphingobium aromaticivorans DSM 12444] Length = 631 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 45/304 (14%), Positives = 90/304 (29%), Gaps = 40/304 (13%) Query: 19 MTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLK 78 M L A +P++ L++G +++ ++ + L S D + + + G + K Sbjct: 1 MLPLAATCVPVLILLIGSGLDMGRLYKARNRLQSACDAGALAGRRSVSSAGYDDAAKAQA 60 Query: 79 GGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP 138 D + +T + + G + I+ + Sbjct: 61 AAFFNANFNE--------------------DDLGATETNFATSSADGGSLVEGIATTDVE 100 Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITK-IDM 197 + + + + S ++ D+ +VLD + SM + TK ID Sbjct: 101 ----MVLMNLFGVISVPINVECSATMDIGNT---DVTMVLDTTGSMSQTLSGTTTKRIDA 153 Query: 198 AIKSINAMLEEVKLIPDVNN-VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS 256 ++ + V +N V+ V +S+ + L+ YL Sbjct: 154 LRTAMKNFYDTVSAATTGSNARVRYSFVPYSSSVNVGQLI------YDLDPDYLVDTWAI 207 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 + TP QI + + T + SL CN AK Sbjct: 208 QSRTPVFNTVTEQILTGYDTPVTTTASSYSNE-----TTGNDQSYNSTRYNSLSACNTAK 262 Query: 317 KRGA 320 Sbjct: 263 PADV 266 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 31/159 (19%), Positives = 51/159 (32%), Gaps = 31/159 (19%) Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT---EDANYKKIIVFMTDG 297 S LS G ST G+ + + N + I+FMTDG Sbjct: 473 SAFYAYADALSANG-STYHDLGMLWGLRLSSPDGPWQAMVNETPENGGEVSRHIIFMTDG 531 Query: 298 ENLSTKEDQQS--------------------------LYYCNEAKKRGAIVYAIGIRVIR 331 + + + + C+ AK +G V+ I Sbjct: 532 QMDTNYKVMSTYGIEWHDRRITDDGVTDQDARHTLRFRALCDAAKAKGFRVWVIAF-ASD 590 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 ++ L CAS +S + N + AF I K++ R+ Sbjct: 591 LNDDLSYCASASSTFPATNATELNTAFQEIAKNVAELRV 629 >gi|326428615|gb|EGD74185.1| hypothetical protein PTSG_12412 [Salpingoeca sp. ATCC 50818] Length = 1720 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 39/215 (18%), Positives = 68/215 (31%), Gaps = 24/215 (11%) Query: 168 TDARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 D + VLD S S+ ++ + + ++ I G V F Sbjct: 213 QQFPFDFVYVLDASGSVGRDNWNR-------VLNFTADSISTLRTIDPQAQF---GAVVF 262 Query: 227 SNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S E L+ S +Q ++ L G ST + L +IF Sbjct: 263 STTAEIAVPLQAFDSQQAVQDTVRTLPYAGESTATGNALNLVRREIFSDDAAAISGFRGG 322 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLR---A 338 ++V +TDGE + + D + G V+ +G+ S + L+ Sbjct: 323 RA---VVVLVTDGE--TLELDGVLENAADRLHATGPLGVDVFVLGVGAASSPQLLQDVYT 377 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 AS V P ++ +R+ D Sbjct: 378 AASGPPETHVFTPDVFDQLLPDTRTPLIAQRLACD 412 >gi|6469599|gb|AAF13350.1|AF121336_1 unknown [Eufolliculina uhligi] Length = 494 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 50/273 (18%), Positives = 99/273 (36%), Gaps = 30/273 (10%) Query: 96 NELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIV 155 ++ DD ++ + ++ + + P Sbjct: 13 GSIQSGAGDFADDDPIQIIRGQEESKGEPTVGAVDIAAYGVFAFNYLQLSPEKAQEIPCT 72 Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 + + S + + + + +D++ V+DVS SM+ KI + ++N M+E + Sbjct: 73 INLESPAQTSEASRSGVDIVCVIDVSGSMQG------EKIQLVQTTLNFMVERLSPAD-- 124 Query: 216 NNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 + L++FSN + L G L+ I L G TN GL+Y + Sbjct: 125 ----RICLISFSNDATKISRLVQMSPKGKKQLKSMIPRLVASG-GTNIVGGLEYGLQAL- 178 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAK-KRGAIVYAIGIRV 329 RQ + I+ ++DG +N T Q++ + + V+ G Sbjct: 179 -----RQRRTINQLSS---IILLSDGQDNNGTTVLQRAKATMDSIVIRDDYSVHTFGYGH 230 Query: 330 IRSHEFLRACASPNS--FYLVENPHSMYDAFSH 360 L A A P + FY V++ ++ AF++ Sbjct: 231 GHDSTLLNALAEPKNGAFYYVKDEETIATAFAN 263 >gi|61557272|ref|NP_001013220.1| chloride channel calcium activated 2 [Rattus norvegicus] gi|38175219|dbj|BAD01114.1| Ca(2+)-activated chloride channel [Rattus norvegicus] Length = 903 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 38/194 (19%), Positives = 73/194 (37%), Gaps = 30/194 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN--KIE 231 + +VLD S SM++ ++ ++ L ++ + GLVTF + +I+ Sbjct: 309 ICLVLDKSGSMDTE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSTAQIQ 358 Query: 232 EFFLLEWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + ++ L + T+ GL+ + I Sbjct: 359 NYLIKITNTGDYKKITGNLPQQAVGGTSICRGLEAGFQAITSSDQSTSGSE--------- 409 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYLV 348 IV +TDGE+ + + K GA+++ I + + E L FY Sbjct: 410 IVLLTDGEDDLISSCFEVV------KHSGAVIHTIALGPKAARELETLSDMTGGLRFYAN 463 Query: 349 ENPHSMYDAFSHIG 362 ++ +S+ DAFS I Sbjct: 464 KDVNSLMDAFSGIS 477 >gi|269105138|ref|ZP_06157832.1| protein TadG associated with Flp pilus assembly [Photobacterium damselae subsp. damselae CIP 102761] gi|268160588|gb|EEZ39087.1| protein TadG associated with Flp pilus assembly [Photobacterium damselae subsp. damselae CIP 102761] Length = 436 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 43/292 (14%), Positives = 98/292 (33%), Gaps = 26/292 (8%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 +G +IL AI +P++F + + + + K + + + + A N+ Sbjct: 1 MKLKKAQQGHASILFAIMIPVLFGIFTLASDGARAIQTKARIEDATEAASLAIAAH--ND 58 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNG----FVNDIDDIVRSTSLDIVVVPQN 124 N N+ G + RI + ++ ++ + + +DI +S + Sbjct: 59 PNVNSDGLGSGSKVNRRIATDYLKAYITDIDSISSLKIYRRNCEDIPECSSGLNKGKSRF 118 Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ-----TDARLDMMIVLD 179 Y + A++ T W+ + I + +D++ D Sbjct: 119 FEYEVEALT---------TQNSWFPGNNVISGFGDTFSTRGHSLARKYQSEAVDVVFAAD 169 Query: 180 VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-----NVVQSGLVTFSNKIEEFF 234 S+SME + K ++ IN + E++ ++N N G+ +++ F Sbjct: 170 FSKSMEEPWTGGRQKYKDLVRVINDVTSELEKFNNINIADKKNQNTIGISPYNSNTYSKF 229 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + +++ + +K N IF +G D Sbjct: 230 -DNYNSCFMKQDYFEKNSRDHRKKKYVDIKRTLNNIFIEKGNDSCGFKSDDP 280 >gi|167032571|ref|YP_001667802.1| von Willebrand factor type A [Pseudomonas putida GB-1] gi|166859059|gb|ABY97466.1| von Willebrand factor type A [Pseudomonas putida GB-1] Length = 358 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 62/163 (38%), Gaps = 26/163 (15%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + + I+++D+ + L++ + GL+ F + Sbjct: 91 DLLVAVDVSGSMDFPDMQWKNEDISRLDLVKALMGDFLQD-------REGDRVGLILFGS 143 Query: 229 KIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + + G +T + A ++ + Sbjct: 144 QAYLQAPLTFDRRTVRTFLDEAKIGIAGKNTAIGDAIGLAVKRL-----------RQRPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 +++V +TDG N L A + G +Y IGI Sbjct: 193 QSRVLVLITDGANNG--GQIHPLTAARLAAQEGVRIYTIGIGA 233 >gi|156383823|ref|XP_001633032.1| predicted protein [Nematostella vectensis] gi|156220096|gb|EDO40969.1| predicted protein [Nematostella vectensis] Length = 182 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 68/187 (36%), Gaps = 28/187 (14%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN--- 228 ++++ ++D S S+ D+ K M IK + ++ + V FS Sbjct: 1 MNLVFLVDSSGSVN---DTDFGKFQMFIKDLAEEFKDAIS----EGDTEVAAVLFSTIPK 53 Query: 229 -KIEEFFLLEWGVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 KIE ++ ++ + S + G T + L + ++F + Sbjct: 54 TKIEFDLDDYDHINDIKAAVDAFSHQHGGQTRTGEALTFTLEEVF--------KKAPRPS 105 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---PN 343 K ++V +TDG K + + G V+AIG+ + L+ AS Sbjct: 106 VKNVLVVLTDG-----KAQGNVTGPAQDVRDHGVEVFAIGVGPHSNEAQLKDIASDPDDK 160 Query: 344 SFYLVEN 350 + V + Sbjct: 161 HVFHVTD 167 >gi|260823774|ref|XP_002606843.1| hypothetical protein BRAFLDRAFT_103549 [Branchiostoma floridae] gi|229292188|gb|EEN62853.1| hypothetical protein BRAFLDRAFT_103549 [Branchiostoma floridae] Length = 1317 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 34/216 (15%), Positives = 78/216 (36%), Gaps = 40/216 (18%) Query: 168 TDARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + ++++ ++D S S+ F+S + + + +L + L + + +VTF Sbjct: 70 ENQTVELVFLVDSSASVGNENFNSEL-------RFVKKLLADFTLAENAA---RVAIVTF 119 Query: 227 SNK------IEEFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 S++ ++ + L+ ++ + G T + + A Q + Sbjct: 120 SSRNKVVNHVDHLSKPSYHKHKCSLLEEELPRIKYAGGGTYTKGAMIKA-------QEVL 172 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +H K + MTDG + L + K+ ++ GIR E L+ Sbjct: 173 RHARPNAT---KAVFLMTDGYSNG----GDPLPEARKLKQNDVQIFTFGIRSGNVKE-LQ 224 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 A+ E D+F+ + + + + D Sbjct: 225 NMATD----PAEEHSYFLDSFAEF-EALARRALHED 255 >gi|26352386|dbj|BAC39823.1| unnamed protein product [Mus musculus] Length = 902 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 44/195 (22%), Positives = 74/195 (37%), Gaps = 32/195 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--IE 231 + +VLD S SM+ ++ ++ L ++ + GLVTF + I+ Sbjct: 309 VCLVLDKSGSMDKE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSAAHIQ 358 Query: 232 EFFLLEWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + S Q+ L + T+ GL+ + I Sbjct: 359 NYLIKITSSSDYQKITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE--------- 409 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ + C EA R GAI++ I + + E L FY Sbjct: 410 IVLLTDGEDNGIRS-------CFEAVSRSGAIIHTIALGPSAARELETLSDMTGGLRFYA 462 Query: 348 VENPHSMYDAFSHIG 362 ++ +S+ DAFS I Sbjct: 463 NKDLNSLIDAFSRIS 477 >gi|21703186|gb|AAM76090.1| Vwa1 protein [Boltenia villosa] Length = 599 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 70/210 (33%), Gaps = 31/210 (14%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 AR+D ++LD S S+ + I +L + D + +V ++ Sbjct: 406 ARMDAFVILDSSSSIGD------ENWLIMKAFIRNILGSFTISDDTTHF---AIVRYNGL 456 Query: 230 IEEFFL-----LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ + L L G T + + + + + N Sbjct: 457 VDTSTQVLLNDFPNSKAGLLAAFDKLPYNGSGTKTGQAIAHVRDNMMSS------ANGNR 510 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS----HEFLRACA 340 + +++ +TDG K L N+ +K GA+++AIGI R + L Sbjct: 511 EGIQDLVMVITDG-----KSQDDVLKPSNDLRKMGALIFAIGITPPRGALDEAQLLEIAG 565 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 SP + + E DA I K Sbjct: 566 SPLNLPIAEGGFEGLDA--GFALQITDKVC 593 >gi|326916561|ref|XP_003204575.1| PREDICTED: matrilin-3-like [Meleagris gallopavo] Length = 363 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 26/203 (12%), Positives = 62/203 (30%), Gaps = 68/203 (33%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + LD++ ++D SRS+ + + ++ M++ + + + ++ Sbjct: 3 TACKNRPLDLVFIIDSSRSVRP------EEFEKVKIFLSEMIDTLDVGE---RTTRVAVM 53 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 +++ ++ F L Sbjct: 54 NYASTVKVEFPL-----------------------------------------------R 66 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--- 341 + K+++ +TDG E+ + A+ G +YA+G+ LR AS Sbjct: 67 TYFDKVVIVVTDGRPQDQVENVAA-----NARTAGIEIYAVGVGRADMQS-LRIMASEPL 120 Query: 342 PNSFYLVEN---PHSMYDAFSHI 361 + VE + F Sbjct: 121 DEHVFYVETYGVIEKLTAKFRET 143 >gi|255535987|ref|YP_003096358.1| aerotolerance operon BatA [Flavobacteriaceae bacterium 3519-10] gi|255342183|gb|ACU08296.1| BatA (Bacteroides aerotolerance operon) [Flavobacteriaceae bacterium 3519-10] Length = 334 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 43/235 (18%), Positives = 82/235 (34%), Gaps = 49/235 (20%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P T ++ N+ +D+M+ +DVS SM + D ++ +++ + Sbjct: 77 PRTFTISENNDDTKGIDIMMSVDVSLSMLA-RDLEPDRLTALKNIAKKFVDK-RPGD--- 131 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQ 274 + GLVT+S + + + L +++ L+ T GL A + + Sbjct: 132 ---RIGLVTYSGEAFTKVPVTSDHAVLLEELENLNPLELQPGTAIGEGLSVAVSHL---- 184 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 +H + KII+ MTDG N + AK VY+IGI + Sbjct: 185 ---RHSKAKS----KIIILMTDGVNTIENA-MPAQVGAQLAKSNDIRVYSIGIGT-NGYA 235 Query: 335 F------------------------LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 LR A + ++ + S+ + + I + Sbjct: 236 LMPTQTDIFGDLVFTEVEVKIDEPVLREIAQTTGGKYFRATSNQSLEEVYEEINQ 290 >gi|256419476|ref|YP_003120129.1| hypothetical protein Cpin_0430 [Chitinophaga pinensis DSM 2588] gi|256034384|gb|ACU57928.1| conserved hypothetical protein [Chitinophaga pinensis DSM 2588] Length = 336 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 64/169 (37%), Gaps = 19/169 (11%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +S S +D+++ +D+S SM + D +++ A + ++ Sbjct: 84 TSNTSESIDSEGIDIVLAMDISGSMLAQ-DLQPDRLEAAKRVAMNFVDSRISD------- 135 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + GLV FS + + L+ +I + + + G+ A ++ +R Sbjct: 136 RIGLVIFSGESFTQCPITTDHGVLKNQIAQVKSGMLQDGTAIGMGLA----TSVERLRTS 191 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 K+I+ +TDG N + D + AK VY IG+ Sbjct: 192 KAK-----SKVIILLTDGVNNTGLIDPLTAL--EIAKAFKIRVYTIGVG 233 >gi|262196446|ref|YP_003267655.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262079793|gb|ACY15762.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 903 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 34/193 (17%), Positives = 65/193 (33%), Gaps = 29/193 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + + +V+D S SM KI+ A +S A E + + +V F Sbjct: 454 EQPHVAIALVVDRSGSMSGL------KIEAAKESARATAEVLSPSDLIT------VVAFD 501 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 N+ L+ + ++ + G TN P L+ AY + +H Sbjct: 502 NQPTTIVRLQRASNRMRIATDIARLQAGGGTNIYPALREAYEILQGANAKVKH------- 554 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRACASPNSF 345 ++ ++DG+ C E + V A+GI + L Sbjct: 555 ----VIVLSDGQ----APYDGIADLCQEMRSARITVSAVGIGDADRNLLNLITDNGDGRL 606 Query: 346 YLVENPHSMYDAF 358 Y+ ++ ++ F Sbjct: 607 YMTDDLAALPRIF 619 >gi|221639828|ref|YP_002526090.1| hypothetical protein RSKD131_1729 [Rhodobacter sphaeroides KD131] gi|221160609|gb|ACM01589.1| Hypothetical Protein RSKD131_1729 [Rhodobacter sphaeroides KD131] Length = 566 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 89/234 (38%), Gaps = 40/234 (17%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R F + G + I L ++ ++ G+ ++V F + L +DR+++ AA+ Sbjct: 13 LRRFGRSEDGSILIFGIFMLILMLMIGGLAVDVMRFEFQRARLQGTLDRAVLAAASL--- 69 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD-IVRSTSLDIVVVPQNEG 126 + + + G + +D+ +V + +L++ Sbjct: 70 -----------------TQSRSPAEVVEDYVTKAGLEDYLDEPVVNANTLNV-------- 104 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 S++A + Y +P F + + P S+ + +++ +VLD+S SM + Sbjct: 105 RSVTATAAYSMPTVFMKLLD----IDRLEAPAVSTAEERVSN---VEISLVLDMSNSMVT 157 Query: 187 FFDSSITKIDMAIKSINAMLEEV----KLIPDVNNVVQSGLVTFSNKIEEFFLL 236 + ++D + ++ V D V+ +V ++ ++ L Sbjct: 158 DGTNPRDRLDNLKVAARDFIDIVMAGANSGLDGAPVISISIVPYTGQVNAGADL 211 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 14/68 (20%), Positives = 26/68 (38%), Gaps = 1/68 (1%) Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACASPNSFYLVENPHSMYDAFSHIGK 363 ++++ C+ A+ +G VY++ L+ CAS Y + F I Sbjct: 498 NERTRQICDAARAQGITVYSVAFEAEAGGQALLQYCASTTGHYYATVGPQIRTVFHSIAS 557 Query: 364 DIVTKRIW 371 I R+ Sbjct: 558 HITQLRLT 565 >gi|320450208|ref|YP_004202304.1| von Willebrand factor, type A [Thermus scotoductus SA-01] gi|320150377|gb|ADW21755.1| von Willebrand factor, type A [Thermus scotoductus SA-01] Length = 414 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 36/247 (14%), Positives = 85/247 (34%), Gaps = 29/247 (11%) Query: 121 VPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSV--KVNSQTDARLDMMIVL 178 + + A R ++ L T + M + + L++ VL Sbjct: 1 MKNKKKAVSQANPRPELELLPLKPGVRATGPTRMPMLLRVKIPPVQAEVERPPLNLAFVL 60 Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE--EFFLL 236 D S SM K+ A K++ +E ++ + +V + +++E L Sbjct: 61 DRSGSMAG------DKLKFAKKAVAYAVENLRPHD------RVAVVIYDHQVEVVVPSTL 108 Query: 237 EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 + R+++ + G + L + + + H + + N ++ ++D Sbjct: 109 AENKEEILRRLRPVRPRGSTN-----LHAGW--LEGSTQVAAHLDAKRLNR---VIVLSD 158 Query: 297 G-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLVENPHS 353 G N +RG +G+ + + + + A A ++Y +E+P Sbjct: 159 GLANTGETNPNVIAEQVRGLSQRGVSTSTLGVGLDYNEDLMMAMAEAGQGNYYFIESPDD 218 Query: 354 MYDAFSH 360 + F+ Sbjct: 219 LPGIFAQ 225 >gi|156399648|ref|XP_001638613.1| predicted protein [Nematostella vectensis] gi|156225735|gb|EDO46550.1| predicted protein [Nematostella vectensis] Length = 1841 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 40/237 (16%), Positives = 86/237 (36%), Gaps = 21/237 (8%) Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 D++VR I PQ++ +++ + + L+ + ++ S+ + Sbjct: 1586 GDNVVRKELETIASSPQSDHVIMTSYNMLEATLEQIKQKVCSSAAKKENTHRASNRMPHD 1645 Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 A D+ ++D SR++E + I +K++ K+ P + G V + Sbjct: 1646 CDQAA-DIGFLIDGSRTIEVMGKGNFGTIIEFVKNVT------KMFPLSRDAFSVGAVVY 1698 Query: 227 SNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ L L + + + G +T + L+YA +F + R+ Sbjct: 1699 GSEPSLEIPLGSHNTSKGLLKALSKIKYPGTATKTGKALEYARLNLFGSRNARR------ 1752 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 KI+V +T G + + G V ++G+ + L AS Sbjct: 1753 -KVPKILVVLTKG-----SSRDDIREASEDLHRDGVHVVSVGMGPVNDRMELANMAS 1803 Score = 39.8 bits (91), Expect = 0.81, Method: Composition-based stats. Identities = 29/214 (13%), Positives = 76/214 (35%), Gaps = 26/214 (12%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 PI+SS + D +++ +VLD + S D++ + A K + + Sbjct: 1433 PISSSGPTVKREDCSVNLGVVLDSKTTAYSSDDNARKMLQFAKKVTHLF-----PVSPEG 1487 Query: 217 NVVQSGLVTFS-NKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 N + G++ +S + + + + + + I S + + L +A +F Sbjct: 1488 N--RVGMMVYSGSPKLKVVHFDHFLDQDTMDKAIDSASYLNMEIKTGKVLNFAQEHLFSR 1545 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + +++ +TDG + + +G ++ +G+ Sbjct: 1546 VPAGKQN---------VLLLITDG-----VSYDDVTKPAVQLQHKGVEIFCVGVGDNVVR 1591 Query: 334 EFLRACASP--NSFYLVENPHSMYDAFSHIGKDI 365 + L AS + ++ + + + I + + Sbjct: 1592 KELETIASSPQSDHVIMTSYNMLEATLEQIKQKV 1625 >gi|320352592|ref|YP_004193931.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] gi|320121094|gb|ADW16640.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] Length = 798 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 38/211 (18%), Positives = 64/211 (30%), Gaps = 38/211 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++VLD S SM + I ++ +A K V L + G+V++++ Sbjct: 329 VVLVLDESGSMNAETPKRIERLKVAAK------NFVSLAENGTE---LGIVSYASDAAVA 379 Query: 234 F--------LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 L + I L TN GL+ A + I G+ + Sbjct: 380 SGRTEVAIAPLGANRAAWNNAIDGLGPS-TRTNIGAGLQKARDLITAAGGVTANT----- 433 Query: 286 NYKKIIVFMTDGENLST----KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC-- 339 IV M+DG N D G VY C Sbjct: 434 ----YIVLMSDGLNNEPAPQANADADLNGKIAMLLADGIPVYVTCTGSDLG--LASQCSE 487 Query: 340 ---ASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + + + + +AF+ + IV Sbjct: 488 IGTGTGGHYVDSADSARLPEAFADFHERIVA 518 >gi|218509981|ref|ZP_03507859.1| hypothetical protein RetlB5_22275 [Rhizobium etli Brasil 5] Length = 448 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 48/331 (14%), Positives = 112/331 (33%), Gaps = 72/331 (21%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R+ + G + I+ A+ L + + +G + + ++ + S +D +L+ A QI N Sbjct: 13 LRSLGRDRTGNVAIVVALSLVPMLVAVGASFDYIRSYNVRQRMQSDLDAALIAAVKQINN 72 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 + + K+ +++N++ + + + Sbjct: 73 TEDTDALKQKVSDWFHAQVENSYALGE-------------------------IEIDTTNH 107 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM--- 184 +I+A + +P F N + + + S+VK + L++ IV+D S SM Sbjct: 108 NITATASGTVPTTFMKIA----NIDTVPVSVASAVK--GPATSYLNVYIVVDTSPSMLLA 161 Query: 185 --------------------------------ESFFDSSITK-----IDMAIKSINAMLE 207 + +D S K D+A ++ +L+ Sbjct: 162 ATTAGQSTMYSGIKCQFACHTGDTHTIGKKTYANNYDYSTEKGIKLRADVAGDAVREVLD 221 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 + + ++ GL + + E + ++ S S S + Sbjct: 222 MIDESDSNHERIKVGLYGLGDTLTEVLAPTLSTDIARTRLADSSYGLTSATSKAATYFDV 281 Query: 268 NQIFDMQGMRQHCN-TEDANYKKIIVFMTDG 297 + Q + + T K+++ +TDG Sbjct: 282 SLATLKQKVGAGGDGTTSGTPLKLVLLLTDG 312 >gi|254512360|ref|ZP_05124427.1| conserved hypothetical protein [Rhodobacteraceae bacterium KLH11] gi|221536071|gb|EEE39059.1| conserved hypothetical protein [Rhodobacteraceae bacterium KLH11] Length = 668 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 39/249 (15%), Positives = 88/249 (35%), Gaps = 33/249 (13%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAA----- 62 +R F + G MTILT + I+F V G +++ + L +DR+++ AA Sbjct: 23 LRRFAVSTDGSMTILTLFLIMIVFTVAGFAVDLMRYDRERVRLQYALDRAVLAAADLDQE 82 Query: 63 ---TQIMNEGNGNNRKKLK-GGDILCRIKNTWNMSFRNELRDNG--------FVNDIDDI 110 ++N+ +I + N + D + D Sbjct: 83 LCPRVVVNDYISKEGFDPGIIDEIKVDPETCLNTDSSDSDGDGTDSSDASGSDSDPSDTA 142 Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLK---------FCTFIPWYTNSRHIVMPITSS 161 T + ++ + + L+ + + +++ Sbjct: 143 SSGTESGSDGTSSGGDTAGTSTTTNAVELQGKRKVEASAQLNIETHFMKWSGVDTINSTA 202 Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 V ++ +++ +VLDVS SME +++ K A + MLE+ ++ + Sbjct: 203 VSAAEESIGNVEISLVLDVSGSMEGAKLTNLQK--AAKDFVKEMLEK-----SADDSLSI 255 Query: 222 GLVTFSNKI 230 ++ +S ++ Sbjct: 256 SIIPYSEQV 264 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 15/72 (20%), Positives = 33/72 (45%), Gaps = 3/72 (4%) Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-HEFLRACA-SPNSFYLVENPHSMYDAFS 359 +K+++Q + C +A+++ ++++I S + L+ CA P +Y + F Sbjct: 597 SKKNEQVVSLCGKAEEKEVLIFSIAFEAPSSVKQMLKDCAVKPARYYEA-TGTQIERVFD 655 Query: 360 HIGKDIVTKRIW 371 I I R+ Sbjct: 656 SISTSIQNLRLT 667 >gi|118101030|ref|XP_417574.2| PREDICTED: similar to Von Willebrand factor A domain containing 1 [Gallus gallus] Length = 491 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 38/201 (18%), Positives = 70/201 (34%), Gaps = 29/201 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ + S + + M + V+ N VQ+ ++ S Sbjct: 36 DILFLLDSSGSVSYYEFSKVKEF---------MWDLVRPFTFGPNDVQTSIIHISTTPTM 86 Query: 233 FFL----LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L G +Q+ I+ + TN+ L +A ++F + Sbjct: 87 EFPFDRYLSRGT--VQQAIRNTQQLMGDTNTGKALSFAKEKLFSND------AGARPDVP 138 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--PNSFY 346 K++V++TDG D S K G V+ + E L A AS P Sbjct: 139 KVLVWVTDG----FSSDDIS-EPMQLLKDMGVTVFIVSTGRGNYLE-LSAAASQPPEKHL 192 Query: 347 LVENPHSMYDAFSHIGKDIVT 367 + + + ++ Sbjct: 193 HFVDVDDLPIITKELRDAMLD 213 >gi|73669697|ref|YP_305712.1| BatA [Methanosarcina barkeri str. Fusaro] gi|72396859|gb|AAZ71132.1| BatA [Methanosarcina barkeri str. Fusaro] Length = 317 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 40/218 (18%), Positives = 85/218 (38%), Gaps = 39/218 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 QT +++++V+DVS SM++ D + ++++ A S ++ +K +G+VTF Sbjct: 84 QTKEGVNVVLVMDVSGSMQAQ-DYTPSRLEAAKSSAEILINSLKSKD------YAGIVTF 136 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + K++ ++ ST GL + + N Sbjct: 137 ESGATTAAYLSPYKEKVIEKLRNVAPKEGSTAIGDGLSLGIDMASSI-----------PN 185 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-------------- 332 KK+I+ ++DG N + AK VY IG+ + Sbjct: 186 KKKVIILLSDGVNNA--GYISPDEAIQYAKANNIQVYTIGMGSNGNVLLGYDWFGNPQYA 243 Query: 333 ---HEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 L+A A + ++ + ++ + + +I ++I Sbjct: 244 ELDEATLQAIANDTGGKYFKSIDDKTLDEIYKNISENI 281 >gi|313675311|ref|YP_004053307.1| von willebrand factor type a [Marivirga tractuosa DSM 4126] gi|312942009|gb|ADR21199.1| von Willebrand factor type A [Marivirga tractuosa DSM 4126] Length = 322 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 29/163 (17%), Positives = 67/163 (41%), Gaps = 21/163 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+MI +D+S SM++ D + ++++ + +++ + GL+ FS++ Sbjct: 79 DIMISVDLSASMDAN-DVAPSRLEKIKYELKNIVDAFNSD-------RIGLIIFSSEAFV 130 Query: 233 FFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + + L I+ L+ G ST+ L A+ ++ K Sbjct: 131 QCPLTYDQNALNLFIETLNTGLVPGSSTDFGSALNMAHEKLTS------EAAPSSQQKSK 184 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 II+ ++DGE+ + + G ++++G+ + Sbjct: 185 IIILISDGEDFGDDTEGAVSKIND----SGIRLFSLGVGTEQG 223 >gi|190336734|gb|AAI62194.1| Coagulation factor C homolog, cochlin (Limulus polyphemus) [Danio rerio] gi|190339304|gb|AAI62181.1| Coagulation factor C homolog, cochlin (Limulus polyphemus) [Danio rerio] Length = 553 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 27/200 (13%), Positives = 68/200 (34%), Gaps = 29/200 (14%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT--FSN 228 +D+ ++D S S+ ++ ++ + + + G + + Sbjct: 370 SVDLGFLIDGSSSVGD---------GNFRLVLDLLVSIARSFDISDIGSRIGAIQFTYDQ 420 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 ++E F + R ++ + T + + +A +F ++ + Sbjct: 421 RMEFNFNDHVLKDNALRALQKIPYMSGGTATGDAINFAVRSLF---------KPRSSSNR 471 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---PNSF 345 K ++ +TDG+ A++ G VYA+G+ + L+A AS + Sbjct: 472 KFLIIITDGQ-----SYDDVRVPAMAAQREGITVYAVGVAWAPMED-LKAMASEPKESHV 525 Query: 346 YLVENPHSMYDAFSHIGKDI 365 + + I + I Sbjct: 526 FFTREFTGLGQFQQPIVRGI 545 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 75/221 (33%), Gaps = 31/221 (14%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + D +DM ++LD S ++ + ++ ++ + +K+ +V G+ Sbjct: 161 ATAHKDCPVDMALLLDSS------YNIGQRRFNLQKNFVSKLATMLKVGTQGPHV---GV 211 Query: 224 VTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 V S F L + + IK + G +TN+ + + F Sbjct: 212 VQTSETPRTEFYLTNYTTAKDVTFAIKEIPYIGGNTNTGKAILHTVRNFFSPD------F 265 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI--------RSH 333 Y ++IV DG E+ L A++ G ++ + + Sbjct: 266 GVRRGYPRVIVVFVDGWPSDNVEEAAIL-----ARESGINIFFVSVAKPSPEEASLVSDQ 320 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 +F+R ++ + S + + K + K D+ Sbjct: 321 DFMRKAVCKDNEFFTFTMPSWFST-NKFVKPLAQKLCSIDQ 360 >gi|51467747|ref|NP_001003823.1| cochlin [Danio rerio] gi|26788036|emb|CAD58748.1| novel protein similar to human coagulation factor C homolog (cochlin, COCH) [Danio rerio] Length = 553 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 27/200 (13%), Positives = 68/200 (34%), Gaps = 29/200 (14%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT--FSN 228 +D+ ++D S S+ ++ ++ + + + G + + Sbjct: 370 SVDLGFLIDGSSSVGD---------GNFRLVLDLLVSIARSFDISDIGSRIGAIQFTYDQ 420 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 ++E F + R ++ + T + + +A +F ++ + Sbjct: 421 RMEFNFNDHVLKDNALRALQKIPYMSGGTATGDAINFAVRSLF---------KPRSSSNR 471 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---PNSF 345 K ++ +TDG+ A++ G VYA+G+ + L+A AS + Sbjct: 472 KFLIIITDGQ-----SYDDVRVPAMAAQREGITVYAVGVAWAPMED-LKAMASEPKESHV 525 Query: 346 YLVENPHSMYDAFSHIGKDI 365 + + I + I Sbjct: 526 FFTREFTGLGQFQQPIVRGI 545 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 75/221 (33%), Gaps = 31/221 (14%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + D +DM ++LD S ++ + ++ ++ + +K+ +V G+ Sbjct: 161 ATAHKDCPVDMALLLDSS------YNIGQRRFNLQKNFVSKLATMLKVGTQGPHV---GV 211 Query: 224 VTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 V S F L + + IK + G +TN+ + + F Sbjct: 212 VQTSETPRTEFYLTNYTTAKDVTFAIKEIPYIGGNTNTGKAILHTVRNFFSPD------F 265 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI--------RSH 333 Y ++IV DG E+ L A++ G ++ + + Sbjct: 266 GVRRGYPRVIVVFVDGWPSDNVEEAAIL-----ARESGINIFFVSVAKPSPEEASLVSDQ 320 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 +F+R ++ + S + + K + K D+ Sbjct: 321 DFMRKAVCKDNEFFTFTMPSWFST-NKFVKPLAQKLCSIDQ 360 >gi|158256546|dbj|BAF84246.1| unnamed protein product [Homo sapiens] Length = 445 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 38/233 (16%), Positives = 82/233 (35%), Gaps = 27/233 (11%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 S + + + + + + R D+M +LD S S+ + S + + + Sbjct: 6 ALGQALSLRLALARSGAERGPPASAPRGDLMFLLDSSASVSHYEFSRV------REFVGQ 59 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPG 262 + V +P +++ LV ++ F S Q ++ ++ T++ Sbjct: 60 L---VAPLPLGTGALRASLVHVGSRPYTEFPFGQHSSGEAAQDAVRASAQRMGDTHTGLA 116 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 L YA Q+F + K++V++TDG + E K G V Sbjct: 117 LVYAKEQLF------AEASGARPGVPKVLVWVTDGGSSDPVG-----PPMQELKDLGVTV 165 Query: 323 YAIGIRVIRSHEFLRACASPN----SFYLVENPHSMYDAF-SHIGKDIVTKRI 370 + + E A ++P F V++ H + I + +++ Sbjct: 166 FIVSTGRGNFLELSAAASAPAEKHLHFVDVDDLHIIVQELRGSILDAMRPQQL 218 >gi|148685681|gb|EDL17628.1| integrin alpha M, isoform CRA_c [Mus musculus] Length = 1037 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 75/211 (35%), Gaps = 30/211 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV---VQSGL 223 D++ ++D S S+ + K+ + ++ ++ K + + + Sbjct: 145 CPQQESDIVFLIDGSGSIN---NIDFQKMKEFVSTVMEQFKKSKTLFSLMQYSDEFRIHF 201 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 TF++ + + + + T + G++ ++F N Sbjct: 202 -TFNDFKRNPSP--------RSHVSPIKQLNGRTKTASGIRKVVRELFHKT------NGA 246 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV----IRSHEFLRAC 339 N KI+V +TDGE D + + EA + G I Y IG+ +S L Sbjct: 247 RENAAKILVVITDGEKFGDPLDYKDVIP--EADRAGVIRYVIGVGNAFNKPQSRRELDTI 304 Query: 340 AS---PNSFYLVENPHSMYDAFSHIGKDIVT 367 AS + V+N ++ + + + I Sbjct: 305 ASKPAGEHVFQVDNFEALNTIQNQLQEKIFA 335 >gi|26333265|dbj|BAC30350.1| unnamed protein product [Mus musculus] Length = 1036 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 75/211 (35%), Gaps = 30/211 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV---VQSGL 223 D++ ++D S S+ + K+ + ++ ++ K + + + Sbjct: 144 CPQQESDIVFLIDGSGSIN---NIDFQKMKEFVSTVMEQFKKSKTLFSLMQYSDEFRIHF 200 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 TF++ + + + + T + G++ ++F N Sbjct: 201 -TFNDFKRNPSP--------RSHVSPIKQLNGRTKTASGIRKVVRELFHKT------NGA 245 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV----IRSHEFLRAC 339 N KI+V +TDGE D + + EA + G I Y IG+ +S L Sbjct: 246 RENAAKILVVITDGEKFGDPLDYKDVIP--EADRAGVIRYVIGVGNAFNKPQSRRELDTI 303 Query: 340 AS---PNSFYLVENPHSMYDAFSHIGKDIVT 367 AS + V+N ++ + + + I Sbjct: 304 ASKPAGEHVFQVDNFEALNTIQNQLQEKIFA 334 >gi|260437096|ref|ZP_05790912.1| putative von Willebrand factor type A domain protein [Butyrivibrio crossotus DSM 2876] gi|292810406|gb|EFF69611.1| putative von Willebrand factor type A domain protein [Butyrivibrio crossotus DSM 2876] Length = 623 Score = 65.6 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 69/200 (34%), Gaps = 27/200 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMA--------IKSINAMLEEVKLIPDVNNVVQSGL 223 D +I++D S SM + + I + L + N ++G+ Sbjct: 97 FDTVILIDCSGSMRTNDPDFEYSVKNTLYPGSSYQITTCYRKLASKNYVKAQGNDDRTGI 156 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 V F+++ L L I + G TN +K + + + + + Sbjct: 157 VLFTSEANTVCELTNSEYVLMNAIDKIYSNG-GTNFNNAIKESIRILTNTRNDSE----- 210 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--S 341 K I+ ++DGE+ + + A + + + I ++E L+ A + Sbjct: 211 -----KRILLVSDGESELSSS------VIDLAIENNIKINTVYIGGQNNNELLKNVAERT 259 Query: 342 PNSFYLVENPHSMYDAFSHI 361 ++ + + +S I Sbjct: 260 GGKYFKAVTADELINIYSEI 279 >gi|296125842|ref|YP_003633094.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] gi|296017658|gb|ADG70895.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] Length = 328 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 53/290 (18%), Positives = 108/290 (37%), Gaps = 51/290 (17%) Query: 114 TSLDIVVVPQNEGYSISAISRYKIPLK--FCTFIPWYTNSRHIVMPITSSVK-VNSQTDA 170 T + ++ S+S + + + +K I I + + V ++ Sbjct: 25 TRKNHSYSVKHPRVSMSKVLKSRYYVKDIPFALIILALLFSIIGLARPAKVDHLSDINGE 84 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 + + +V+DVS SM + D T+++ + K++ +++ N + LV F+ + Sbjct: 85 GIYISLVVDVSPSMMAE-DMMPTRLEASKKTMIDFIKK-------RNFDKISLVAFALRA 136 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + + L+ +IK + ST+ G+ A + + ++G + K Sbjct: 137 SVLSPSTFDYTLLEEEIKNIKIDEEGSTSIGLGIATAVDMLRSVKGDNE----------K 186 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS----------------- 332 II+ +TDGEN S + D + A +Y IGI Sbjct: 187 IIILLTDGENNSGEIDPKLAS--EIASNFNIKIYTIGIGDANGSHAWVTYDDPNYGKRRI 244 Query: 333 -------HEFLRACAS--PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 E L AS ++ +N ++ + ++ I + I K I D Sbjct: 245 RADFSLNEEALIDIASTTGGKYFNAQNASALDNVYNTIDR-IEKKPILDD 293 >gi|254506100|ref|ZP_05118244.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus 16] gi|219550918|gb|EED27899.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus 16] Length = 415 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 51/422 (12%), Positives = 121/422 (28%), Gaps = 76/422 (18%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 G +L A+ +P +F + + + + K + + +++ A N+ Sbjct: 1 MRKIKKQSGHAALLFAMIIPGLFGIFTLATDGARALQTKARIEDASEIAVLAIAAH--ND 58 Query: 69 GNGNNRKKLKGGDILCRIKN-TWNMSFRNELRDNGF---VNDIDDIVRSTSLDIVVVPQN 124 N +++ G + +I N R+ + G + D I + P+ Sbjct: 59 DNQDSQGAGSGSRVNRQIATDYLNAYLRDSTQLTGLKVKKYNCDQIAECRAGLARGEPRF 118 Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPI-----TSSVKVNSQTDARLDMMIVLD 179 Y I S W+ + I + +D++ V D Sbjct: 119 FQYEIEVSSVQD---------TWFPGNDSIEGFGDTFSAKGAAVARKYQSEAVDIIFVSD 169 Query: 180 VSRSMESFFDSSITKIDM-AIKSINAMLEEVKLI--PDVNNVVQSGLVTFSNKIEEFFLL 236 S SM + + + I + +E++ + + GL F+ + Sbjct: 170 YSGSMAWNWSGGRNRKYIDLRNIIQEVTDELQKFNDLNNTDNNTVGLTAFNYYTKTVPSN 229 Query: 237 EWGVSHLQRKIKYLSKFGVS-----------------------------TNSTPGLKYAY 267 + + + +F S T++ + Sbjct: 230 RSNHCFMTQLVNPNGRFSASQTVRNIFVEKNNRYCVNHGDSSRFQDLPLTDNYSSFNNSV 289 Query: 268 NQIFDMQGMRQHCNT--------EDANYKKIIVFMTDGENLSTK--------------ED 305 + G + N +++++ ++DGE+ + Sbjct: 290 RSFYPNHGTASFQGIIRGAQMLRKGRNPRRLLIVLSDGEDGDPSRHMQLVNAGMCSTIVN 349 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACASPNSFYLVENPHS-MYDAFSHIGK 363 S + K A + +G + + L+ C + Y +N + I + Sbjct: 350 TLSGDLTPDGHKVKARLAVVGFDYDVNKNRALQKCVGAENVYKAQNRDDILNKILELITE 409 Query: 364 DI 365 +I Sbjct: 410 EI 411 >gi|291411003|ref|XP_002721794.1| PREDICTED: integrin, alpha X (complement component 3 receptor 4 subunit-like [Oryctolagus cuniculus] Length = 1100 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 42/205 (20%), Positives = 82/205 (40%), Gaps = 30/205 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S S+ S + + A++ + Q L+ FSN+ + Sbjct: 152 DIVFLIDGSGSI------SFSNFATMKNFVKAVMSQF-----PRPSTQFSLMQFSNEFQT 200 Query: 233 FFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE-DANYKK 289 F VS + + + + +T++ + +Q+F H +T + K Sbjct: 201 HFTFNDFVSSTNPLQLLDRVYQLMGTTHTATAILRVVDQLF-------HASTGARKDATK 253 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFLRACASP--- 342 I++ +TDG+ L + + +A+ G I YAIG +V+ S L AS Sbjct: 254 ILIVITDGQKLDDPLGYEDVIP--KAEAAGIIRYAIGVGLAFQVVSSLRELHDIASEPAH 311 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVT 367 + VEN ++ D + + I Sbjct: 312 EHVFRVENFDALRDIQGRLKEKIFA 336 >gi|228471029|ref|ZP_04055873.1| BatA protein [Porphyromonas uenonis 60-3] gi|228307249|gb|EEK16272.1| BatA protein [Porphyromonas uenonis 60-3] Length = 326 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 74/217 (34%), Gaps = 46/217 (21%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +D+++ +D+S SM++ D + + A + M+ P+ N GLV F+ + Sbjct: 85 QGIDLVLAMDLSGSMQA-LDLKPNRFEAARDVASEMIA---ARPNDN----IGLVVFAGE 136 Query: 230 IEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + + + + + T GL A N + N Sbjct: 137 SFTLCPLTVDHDVILQMLDATEIGQLEDGTAIGLGLATAINTL-----------RGSDNK 185 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF------------ 335 K+I+ +TDG N + D A++ G +Y + +F Sbjct: 186 SKVIILLTDGSNNA--GDITPSMAAELAQQYGIRIYTVAAGTNGVAKFPVQTASGIEYVE 243 Query: 336 ---------LRACA--SPNSFYLVENPHSMYDAFSHI 361 LR A + +Y + +++ + I Sbjct: 244 ADVQIDEGTLRHIAQQTGGKYYRATDETKLHEIYKEI 280 >gi|332872319|ref|XP_003319171.1| PREDICTED: collagen alpha-2(VI) chain isoform 2 [Pan troglodytes] Length = 828 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 33/164 (20%), Positives = 63/164 (38%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + ++ L + P + G+V +S++ Sbjct: 612 GALDVVFVIDSSESIGY---TNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 668 Query: 230 -----IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 I+ +S + +K L T + LK+AY+++ + Sbjct: 669 GTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLI--------KESRR 720 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG + +D C+ R V AIGI Sbjct: 721 QKTRVFAVVITDGRHDPRDDDLNLRALCD----RDVTVTAIGIG 760 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 78/207 (37%), Gaps = 14/207 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV---QSGL 223 +TD + + VLD S S+ + I + + + +++ ++ V + G Sbjct: 40 KTDCPIHVYFVLDTSESVTMQSPTDILLFH-MKQFVPQFISQLQNEFYLDQVALSWRYGG 98 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + FS+++E F + + ++ +S F T + L QI RQ + Sbjct: 99 LHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQI------RQDRSKG 152 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 + V +TDG + L A++ G ++A+ + LR AS Sbjct: 153 TVH---FAVVITDGHVTGSPCGGIKLQ-AERAREEGIRLFAVAPNQNLKEQGLRDIASTP 208 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRI 370 + +M + I +D + + I Sbjct: 209 HELYRNDYATMLPDSTEIDQDTINRII 235 >gi|332872317|ref|XP_003319170.1| PREDICTED: collagen alpha-2(VI) chain isoform 1 [Pan troglodytes] Length = 918 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 33/164 (20%), Positives = 63/164 (38%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + ++ L + P + G+V +S++ Sbjct: 612 GALDVVFVIDSSESIGY---TNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 668 Query: 230 -----IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 I+ +S + +K L T + LK+AY+++ + Sbjct: 669 GTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLI--------KESRR 720 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG + +D C+ R V AIGI Sbjct: 721 QKTRVFAVVITDGRHDPRDDDLNLRALCD----RDVTVTAIGIG 760 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 78/207 (37%), Gaps = 14/207 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV---QSGL 223 +TD + + VLD S S+ + I + + + +++ ++ V + G Sbjct: 40 KTDCPIHVYFVLDTSESVTMQSPTDILLFH-MKQFVPQFISQLQNEFYLDQVALSWRYGG 98 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + FS+++E F + + ++ +S F T + L QI RQ + Sbjct: 99 LHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQI------RQDRSKG 152 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 + V +TDG + L A++ G ++A+ + LR AS Sbjct: 153 TVH---FAVVITDGHVTGSPCGGIKLQ-AERAREEGIRLFAVAPNQNLKEQGLRDIASTP 208 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRI 370 + +M + I +D + + I Sbjct: 209 HELYRNDYATMLPDSTEIDQDTINRII 235 >gi|332872315|ref|XP_531504.3| PREDICTED: collagen alpha-2(VI) chain isoform 3 [Pan troglodytes] Length = 1019 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 33/164 (20%), Positives = 63/164 (38%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + ++ L + P + G+V +S++ Sbjct: 612 GALDVVFVIDSSESIGY---TNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 668 Query: 230 -----IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 I+ +S + +K L T + LK+AY+++ + Sbjct: 669 GTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLI--------KESRR 720 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG + +D C+ R V AIGI Sbjct: 721 QKTRVFAVVITDGRHDPRDDDLNLRALCD----RDVTVTAIGIG 760 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 78/207 (37%), Gaps = 14/207 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV---QSGL 223 +TD + + VLD S S+ + I + + + +++ ++ V + G Sbjct: 40 KTDCPIHVYFVLDTSESVTMQSPTDILLFH-MKQFVPQFISQLQNEFYLDQVALSWRYGG 98 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + FS+++E F + + ++ +S F T + L QI RQ + Sbjct: 99 LHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQI------RQDRSKG 152 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 + V +TDG + L A++ G ++A+ + LR AS Sbjct: 153 TVH---FAVVITDGHVTGSPCGGIKLQ-AERAREEGIRLFAVAPNQNLKEQGLRDIASTP 208 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRI 370 + +M + I +D + + I Sbjct: 209 HELYRNDYATMLPDSTEIDQDTINRII 235 >gi|332877593|ref|ZP_08445337.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332684442|gb|EGJ57295.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 333 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 41/287 (14%), Positives = 90/287 (31%), Gaps = 49/287 (17%) Query: 109 DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT 168 + ++ + + + Y + + P T S ++ Sbjct: 30 NKKSQANVTFSTTIAFKKTKSWSDALYHLLFVLRMIAI-ALIVVALARPQTHSENAKTKI 88 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D+++ +DVS SM S D + + K + ++ N+ + GLV ++ Sbjct: 89 TDGIDIVMAIDVSASMLSQ-DLKPNRFEALKKVASQFVK-----DRPND--RIGLVIYAG 140 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + + + ++ + T GL A N++ E Sbjct: 141 ESYTKTPVTTDKLIILNALSEITYGQIEDGTAIGMGLATAVNRL-----------KESKA 189 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-----------VIRS--- 332 ++I+ +TDG N + D Q+ A + G VY +GI Sbjct: 190 KSRVIILLTDGVNNTGFIDPQTA--AELAAEYGIKVYTVGIGTNGMALSPYALNADGSII 247 Query: 333 ---------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 ++ A + ++ N + + I + TK Sbjct: 248 YRMQQVDIDEPLMKKIAQVTKGRYFRATNNQKLQQIYDEINQMETTK 294 >gi|332256729|ref|XP_003277468.1| PREDICTED: collagen alpha-2(VI) chain [Nomascus leucogenys] Length = 1124 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 33/164 (20%), Positives = 63/164 (38%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + ++ L + P + G+V +S++ Sbjct: 747 GALDVVFVIDSSESIGY---TNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 803 Query: 230 -----IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 I+ +S + +K L T + LK+AY+++ + Sbjct: 804 GTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLI--------KESRR 855 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG + +D C+ R V AIGI Sbjct: 856 QKTRVFAVVITDGRHDPRDDDLNLRALCD----RDVTVTAIGIG 895 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 78/207 (37%), Gaps = 14/207 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV---QSGL 223 +TD + + VLD S S+ + I + + + +++ ++ V + G Sbjct: 173 KTDCPIHVYFVLDTSESVAMQSPTDILLFH-MKQFVPQFISQLQNEFYLDQVALSWRYGG 231 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + FS+++E F + + ++ +S F T + L QI RQ + Sbjct: 232 LHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQI------RQDRSKG 285 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 + V +TDG + L A++ G ++A+ + LR AS Sbjct: 286 TVH---FAVVITDGHVTGSPCGGIKLQ-AERAREEGIRLFAVAPNQNLKEQGLRDIASTP 341 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRI 370 + +M + I +D + + I Sbjct: 342 HELYRNDYATMLPDSTEIDQDTINRII 368 >gi|297716667|ref|XP_002834627.1| PREDICTED: collagen alpha-2(VI) chain-like, partial [Pongo abelii] Length = 279 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 33/164 (20%), Positives = 63/164 (38%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + ++ L + P + G+V +S++ Sbjct: 71 GALDVVFVIDSSESIGY---TNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 127 Query: 230 -----IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 I+ +S + +K L T + LK+AY+++ + Sbjct: 128 GTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLI--------KESRR 179 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG + +D C+ R V AIGI Sbjct: 180 QKTRVFAVVITDGRHDPRDDDLNLRALCD----RDVTVTAIGIG 219 >gi|296232327|ref|XP_002807820.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-2(VI) chain-like [Callithrix jacchus] Length = 1018 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 33/164 (20%), Positives = 63/164 (38%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + ++ L + P + G+V +S++ Sbjct: 612 GALDVVFVIDSSESIGY---TNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 668 Query: 230 -----IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 I+ +S + +K L T + LK+AY+++ + Sbjct: 669 GTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLI--------KESRR 720 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG + +D C+ R V AIGI Sbjct: 721 QKTRVFAVVITDGRHDPRDDDLNLRALCD----RDVTVTAIGIG 760 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 79/207 (38%), Gaps = 14/207 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV---QSGL 223 +TD +++ VLD S S+ + I + + + +++ ++ V + G Sbjct: 40 KTDCPINVYFVLDTSESVAMQSPTDILLFH-MKQFVPQFISQLQNEFYLDQVALSWRYGG 98 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + FS+++E F + + ++ +S F T + L QI R H + Sbjct: 99 LHFSDQVEVFSPPGSDRASFIKSLQGISSFRRGTFTDCALANMTEQI------RLHGSKG 152 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 + V +TDG + L A++ G ++A+ + LR AS Sbjct: 153 TVH---FAVVITDGHVTGSPCGGIKLQ-AERAREEGIRLFAVAPNQNLKEQGLRDIASTP 208 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRI 370 + +M + I +D + + I Sbjct: 209 HELYRSDYATMLPDSTEIDQDTINRII 235 >gi|291087628|ref|ZP_06346959.2| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] gi|291074491|gb|EFE11855.1| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] Length = 473 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 46/216 (21%), Positives = 86/216 (39%), Gaps = 30/216 (13%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 + + + + + + ++ D++ +D S ME + ++ A K I A +E Sbjct: 24 CGGTVKVTLSLAACPCIGKESH---DIVFAIDRSAKMEG------SALEAAKKGIKAFIE 74 Query: 208 EVK--LIPDV--NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL 263 ++ + GLV+FS+ +L V R + L+ G S N + Sbjct: 75 TLERESAQPEGYAGEKRVGLVSFSDTATVNSMLSPVVEQAARAAEGLTAGGKS-NQAEAI 133 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 + A ++ DM+ +K++ +TDG+ T Q+ EA++ G VY Sbjct: 134 RAAV-KLLDMK----------TPGEKMLFLITDGQ---TPFRSQTDSAAAEARQAGVTVY 179 Query: 324 AIGIRVIRS--HEFLRACASPNSFYLVENPHSMYDA 357 IGI E LR+ AS S + + +A Sbjct: 180 CIGIAAPDGVNREALRSWASGPSDSHIIEIRELGEA 215 >gi|115527066|ref|NP_478054.2| collagen alpha-2(VI) chain isoform 2C2a precursor [Homo sapiens] Length = 918 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 33/164 (20%), Positives = 63/164 (38%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + ++ L + P + G+V +S++ Sbjct: 612 GALDVVFVIDSSESIGY---TNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 668 Query: 230 -----IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 I+ +S + +K L T + LK+AY+++ + Sbjct: 669 GTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLI--------KESRR 720 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG + +D C+ R V AIGI Sbjct: 721 QKTRVFAVVITDGRHDPRDDDLNLRALCD----RDVTVTAIGIG 760 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 78/207 (37%), Gaps = 14/207 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV---QSGL 223 +TD + + VLD S S+ + I + + + +++ ++ V + G Sbjct: 40 KTDCPIHVYFVLDTSESVTMQSPTDILLFH-MKQFVPQFISQLQNEFYLDQVALSWRYGG 98 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + FS+++E F + + ++ +S F T + L QI RQ + Sbjct: 99 LHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQI------RQDRSKG 152 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 + V +TDG + L A++ G ++A+ + LR AS Sbjct: 153 TVH---FAVVITDGHVTGSPCGGIKLQ-AERAREEGIRLFAVAPNQNLKEQGLRDIASTP 208 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRI 370 + +M + I +D + + I Sbjct: 209 HELYRNDYATMLPDSTEIDQDTINRII 235 >gi|115527070|ref|NP_478055.2| collagen alpha-2(VI) chain isoform 2C2a' precursor [Homo sapiens] Length = 828 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 33/164 (20%), Positives = 63/164 (38%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + ++ L + P + G+V +S++ Sbjct: 612 GALDVVFVIDSSESIGY---TNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 668 Query: 230 -----IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 I+ +S + +K L T + LK+AY+++ + Sbjct: 669 GTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLI--------KESRR 720 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG + +D C+ R V AIGI Sbjct: 721 QKTRVFAVVITDGRHDPRDDDLNLRALCD----RDVTVTAIGIG 760 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 78/207 (37%), Gaps = 14/207 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV---QSGL 223 +TD + + VLD S S+ + I + + + +++ ++ V + G Sbjct: 40 KTDCPIHVYFVLDTSESVTMQSPTDILLFH-MKQFVPQFISQLQNEFYLDQVALSWRYGG 98 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + FS+++E F + + ++ +S F T + L QI RQ + Sbjct: 99 LHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQI------RQDRSKG 152 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 + V +TDG + L A++ G ++A+ + LR AS Sbjct: 153 TVH---FAVVITDGHVTGSPCGGIKLQ-AERAREEGIRLFAVAPNQNLKEQGLRDIASTP 208 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRI 370 + +M + I +D + + I Sbjct: 209 HELYRNDYATMLPDSTEIDQDTINRII 235 >gi|115527062|ref|NP_001840.3| collagen alpha-2(VI) chain isoform 2C2 precursor [Homo sapiens] gi|125987812|sp|P12110|CO6A2_HUMAN RecName: Full=Collagen alpha-2(VI) chain; Flags: Precursor Length = 1019 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 33/164 (20%), Positives = 63/164 (38%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + ++ L + P + G+V +S++ Sbjct: 612 GALDVVFVIDSSESIGY---TNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 668 Query: 230 -----IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 I+ +S + +K L T + LK+AY+++ + Sbjct: 669 GTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLI--------KESRR 720 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG + +D C+ R V AIGI Sbjct: 721 QKTRVFAVVITDGRHDPRDDDLNLRALCD----RDVTVTAIGIG 760 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 78/207 (37%), Gaps = 14/207 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV---QSGL 223 +TD + + VLD S S+ + I + + + +++ ++ V + G Sbjct: 40 KTDCPIHVYFVLDTSESVTMQSPTDILLFH-MKQFVPQFISQLQNEFYLDQVALSWRYGG 98 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + FS+++E F + + ++ +S F T + L QI RQ + Sbjct: 99 LHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQI------RQDRSKG 152 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 + V +TDG + L A++ G ++A+ + LR AS Sbjct: 153 TVH---FAVVITDGHVTGSPCGGIKLQ-AERAREEGIRLFAVAPNQNLKEQGLRDIASTP 208 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRI 370 + +M + I +D + + I Sbjct: 209 HELYRNDYATMLPDSTEIDQDTINRII 235 >gi|41350923|gb|AAH65509.1| Collagen, type VI, alpha 2 [Homo sapiens] gi|190690005|gb|ACE86777.1| collagen, type VI, alpha 2 protein [synthetic construct] gi|190691377|gb|ACE87463.1| collagen, type VI, alpha 2 protein [synthetic construct] Length = 1019 Score = 65.2 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 33/164 (20%), Positives = 63/164 (38%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + ++ L + P + G+V +S++ Sbjct: 612 GALDVVFVIDSSESIGY---TNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 668 Query: 230 -----IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 I+ +S + +K L T + LK+AY+++ + Sbjct: 669 GTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLI--------KESRR 720 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG + +D C+ R V AIGI Sbjct: 721 QKTRVFAVVITDGRHDPRDDDLNLRALCD----RDVTVTAIGIG 760 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 78/207 (37%), Gaps = 14/207 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV---QSGL 223 +TD + + VLD S S+ + I + + + +++ ++ V + G Sbjct: 40 KTDCPIHVYFVLDTSESVTMQSPTDILLFH-MKQFVPQFISQLQNEFYLDQVALSWRYGG 98 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + FS+++E F + + ++ +S F T + L QI RQ + Sbjct: 99 LHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQI------RQDRSKG 152 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 + V +TDG + L A++ G ++A+ + LR AS Sbjct: 153 TVH---FAVVITDGHVTGSPCGGIKLQ-AERAREEGIRLFAVAPNQNLKEQGLRDIASTP 208 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRI 370 + +M + I +D + + I Sbjct: 209 HELYRNDYATMLPDSTEIDQDTINRII 235 >gi|224079848|ref|XP_002194049.1| PREDICTED: similar to von Willebrand factor A domain containing 1 [Taeniopygia guttata] Length = 525 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 33/218 (15%), Positives = 71/218 (32%), Gaps = 27/218 (12%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 + W + P + S + D++ +LD S S+ + S + + Sbjct: 6 LLSLFLWLRFAAGQDTPEPGAQPPISSAEG--DLLFLLDSSASVSHYEFSKVKEF----- 58 Query: 201 SINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTN 258 M + + VQ+ ++ S F + +++ I+ + TN Sbjct: 59 ----MWDLLHPFTFGPRDVQTSIIHISTTPTMEFPFDQHLSSGTVRKAIRDTRQLMGDTN 114 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + L +A ++F + + K++V++TDG + K Sbjct: 115 TGKALSFAKEKLFSGE------AGARPDVPKVLVWVTDGFSTDDIS-----EPMQLLKDM 163 Query: 319 GAIVYAIGIRVIRSHEFLRACAS--PNSFYLVENPHSM 354 G V+ + E L A AS + + + Sbjct: 164 GVTVFIVSTGRGNFLE-LSAAASQPSDKHLHFVDVDDL 200 >gi|260786070|ref|XP_002588082.1| hypothetical protein BRAFLDRAFT_83083 [Branchiostoma floridae] gi|229273239|gb|EEN44093.1| hypothetical protein BRAFLDRAFT_83083 [Branchiostoma floridae] Length = 528 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 32/175 (18%), Positives = 63/175 (36%), Gaps = 21/175 (12%) Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIK 248 + + ++ + P + G+V +S++ F L V+ I Sbjct: 7 NAANFAKVKQFAVNVVNTFDISPTA---TRVGVVQYSDRNSLVFNLGNKVNKPSTVSAIN 63 Query: 249 YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS 308 +S TN+ LKY RQ+ + N K+I+ +TDG++ + Sbjct: 64 GISYQSGGTNTGAALKYV----------RQYAAWREGNVPKVIIVLTDGKSSDSVSG--- 110 Query: 309 LYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS-PNSFYLVENPHSMYDAFSHIG 362 + G VYAIG+ + L+ + N+ + N +++ I Sbjct: 111 --PSRDLVAAGVEVYAIGVGKFDHGQLLQIANNKQNNVIELNNFNALATKIDMIS 163 >gi|156741348|ref|YP_001431477.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156232676|gb|ABU57459.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 972 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 43/215 (20%), Positives = 77/215 (35%), Gaps = 32/215 (14%) Query: 165 NSQTDARLDMMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 +++ L +++V+D S SM S ++D+A +++ + I Q GL Sbjct: 402 DTKQQPDLALVMVIDRSGSMSELVGGSRRNRLDLAKEAVYQASLGLTPID------QVGL 455 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 V F + L+ S ++ + + L F G TN PG+ Sbjct: 456 VVFDDAANWVLPLQRLPSVVEIE-RALGSFGIGGGTNIRPGI-----------EQAAQAL 503 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC-A 340 K ++ +TDG S D + + + G + + I + + A Sbjct: 504 ASADAKVKHVILLTDGIAESNYSDLIA-----QMRAAGVTISTVAIGEDANPNLVDVANA 558 Query: 341 SPNSFYLVENPHSMYDAF--SHI---GKDIVTKRI 370 Y V + F I G+DIV +RI Sbjct: 559 GGGRSYRVTRIEDVPRIFLQETIIAAGRDIVEERI 593 >gi|81897704|sp|Q8BVM2|ANTRL_MOUSE RecName: Full=Anthrax toxin receptor-like; Flags: Precursor gi|26346064|dbj|BAC36683.1| unnamed protein product [Mus musculus] Length = 641 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 33/226 (14%), Positives = 77/226 (34%), Gaps = 22/226 (9%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 ++ + + + D+ +VLD S S+ + + + Sbjct: 51 HQRQTQQMRQNVPQGQSGDDC-QGIFDLYLVLDKSGSVADNWIHIYSFAEGL-------- 101 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 VK + N ++ ++T+S + E L + + + L GL + Sbjct: 102 --VKKFTNPN--LRISIITYSTEAEVILPLTSDSKEINKSLLVLKSIVPQ-----GLTHM 152 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 + + +I+ +TDG L K ++ +A++ GAIVY +G Sbjct: 153 QKGLRKANEQIRKSTLGGRIVNSVIIALTDGL-LLLKPYLDTMEEAKKARRMGAIVYTVG 211 Query: 327 IRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + + + + P+ + V+ + A + + +K Sbjct: 212 VFMYSKQQLVNIAGDPDRCFGVD---EGFSALEGVVDPLTSKSCTE 254 >gi|78357411|ref|YP_388860.1| von Willebrand factor, type A [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78219816|gb|ABB39165.1| von Willebrand factor, type A [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 402 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 60/434 (13%), Positives = 137/434 (31%), Gaps = 131/434 (30%) Query: 24 AIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDIL 83 A+ LP+I ++G+ ++ ++ + L + +D + + + Q+ + Sbjct: 2 AVLLPVILGIMGLGLDSGMLYLSHSRLQAAVDAAALAGSLQLPYDPA------------- 48 Query: 84 CRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCT 143 + + +E F + V+P E S++ + + F Sbjct: 49 --MDKGLVRAAVDEYMHANFPQAVVQ---------SVLPGAEERSVTVNAEATVGTIFMG 97 Query: 144 FIPWYTNSRHIVMPITSSVKVNSQTD-ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 + +S+V+ + L+++ V+D S SM+ S I + + A + Sbjct: 98 AL----------GIGSSTVRAQASAGYNNLEVVFVIDNSGSMK---GSPINETNAAATRL 144 Query: 203 NAMLEEVKLIPDVN-------NVVQSGL-------------------------------V 224 ++ + V V+ Sbjct: 145 VDLIMPEGMATSVKIGLVPFRGKVRIPADVDGLPSGCRNADGSLNEDGLLDEYKKPEYRY 204 Query: 225 TFSNKIE------EFFLLEWGVSH----LQRKIKYLSKFG--VSTNSTPGLKYAYNQIFD 272 +++++ L G++ + + I G T + GLK+A + + Sbjct: 205 PYNDRLRVTPYSCSSIPLTQGLTADRATITQAIGRQDARGDSSGTVISEGLKWARHVLTP 264 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGE-----------------NLSTKE----------- 304 + + +D +K+I+ +TDG+ N T Sbjct: 265 EAPFTEGSSAKD--MRKVIILLTDGDTEDGNCGGNYSVYYRPNNYWTNAYYGMMDMDSHC 322 Query: 305 ------DQQSLYYCNEAKKRGAIVYAIGIRVIRS--HEFLRACASP-----NSFYLVENP 351 + L AK G ++AI + +RA AS + ++ +P Sbjct: 323 EDGGVLNNAMLSEAALAKDAGIEIFAIRYGSSDAVDRNLMRAVASSKEGTDDHYFDAPSP 382 Query: 352 HSMYDAFSHIGKDI 365 + + D F IG+ + Sbjct: 383 YDIDDVFKLIGRQL 396 >gi|332706285|ref|ZP_08426352.1| hypothetical protein LYNGBM3L_16440 [Lyngbya majuscula 3L] gi|332354933|gb|EGJ34406.1| hypothetical protein LYNGBM3L_16440 [Lyngbya majuscula 3L] Length = 413 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 71/201 (35%), Gaps = 30/201 (14%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 L++ +VLD S SM ++ ++ ++E ++ + +V F ++ Sbjct: 41 PLNLCLVLDHSGSMHGQP------LETVKQAAVGLIERLQPDD------RLSIVAFDHRA 88 Query: 231 E---EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + + ++RKI L G T GLK ++ + Sbjct: 89 KVLVRNQPMG-NLDQIKRKINRLGADG-GTAIDEGLKLGVKELIKAKQDTVSQ------- 139 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSF 345 + +TDGEN ++ + A + + ++G + + L A + S Sbjct: 140 ---VFLLTDGENEHGN-NESCIKLAELAAENNLTINSLGFGANWNQDILEKIADIATGSL 195 Query: 346 YLVENPHSMYDAFSHIGKDIV 366 +E P F+ + + Sbjct: 196 SYIEEPEQALSEFARLFNRMQ 216 >gi|119899154|ref|YP_934367.1| hypothetical protein azo2864 [Azoarcus sp. BH72] gi|119671567|emb|CAL95480.1| conserved hypothetical membrane protein [Azoarcus sp. BH72] Length = 339 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 38/236 (16%), Positives = 69/236 (29%), Gaps = 50/236 (21%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +DVS SM S D ++ A + V+ P V+ G+V+F+ Sbjct: 88 IILAIDVSLSM-SAPDVLPDRLSAAQAAARDF---VRNQPPD---VRIGIVSFAGTATVV 140 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF------------DMQGMRQHCN 281 L I L + T G+ A + +F + N Sbjct: 141 QAPTDNREDLLGAIDRL-QLARHTAIGSGIIVALSALFPEESFDPDPTMMSSAEPGRAPN 199 Query: 282 TEDANY------KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 ++ +TDG S E + A RG V+ +G Sbjct: 200 APREEVAPGSNGSAAVILLTDGRRTSGPEPVDAARM---AAVRGIRVFTVGFGTAEGATI 256 Query: 336 ---------------LRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 LRA A + ++ + I D+ + + + Sbjct: 257 QNEGWSVFMRFDEGTLRAIADLTQAKYFHAGTAAEL----QQIYHDLNARYVLERR 308 >gi|3560547|gb|AAC35003.1| chloride channel CaCC [Mus musculus] Length = 901 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 44/195 (22%), Positives = 74/195 (37%), Gaps = 32/195 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--IE 231 + +VLD S SM+ ++ ++ L ++ + GLVTF + I+ Sbjct: 309 VCLVLDKSGSMDKE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSAAHIQ 358 Query: 232 EFFLLEWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + S Q+ L + T+ GL+ + I Sbjct: 359 NYLIKITSSSDYQKITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE--------- 409 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ + C EA R GAI++ I + + E L FY Sbjct: 410 IVLLTDGEDNGIRS-------CFEAVSRSGAIIHTIALGPSAARELETLSDMTGGLRFYA 462 Query: 348 VENPHSMYDAFSHIG 362 ++ +S+ DAFS I Sbjct: 463 NKDLNSLIDAFSRIS 477 >gi|323499301|ref|ZP_08104278.1| hypothetical protein VISI1226_03745 [Vibrio sinaloensis DSM 21326] gi|323315689|gb|EGA68723.1| hypothetical protein VISI1226_03745 [Vibrio sinaloensis DSM 21326] Length = 322 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 72/211 (34%), Gaps = 38/211 (18%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M+VLD+S SM S + + ++ ++ + + + + + G+V F++ Sbjct: 86 DLMLVLDLSYSMSQEDMSDGSDYVDRLTAVKKVVSDFAIKREGD---RLGVVLFADHAYL 142 Query: 233 FFLLEWGVSHLQRKIKYL--SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + + ++ L G T G+ A D +++ Sbjct: 143 QTPLTLDRTTVADQVNQLVLRLIGDKTAIGEGIGLATKTFIDSDA-----------PQRV 191 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR------------------- 331 ++ ++DG N S D + AKK A +Y IG+ Sbjct: 192 MILLSDGSNTSGVIDP--IEAAKIAKKYDATIYTIGVGAGEMMVKEFFMTRKVNTAQDLD 249 Query: 332 SHEFLRACA-SPNSFYLVENPHSMYDAFSHI 361 ++ + ++ + + + I Sbjct: 250 EKALMQIAQITGGQYFRARDAKELATIYDTI 280 >gi|313203640|ref|YP_004042297.1| von willebrand factor type a [Paludibacter propionicigenes WB4] gi|312442956|gb|ADQ79312.1| von Willebrand factor type A [Paludibacter propionicigenes WB4] Length = 327 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 43/228 (18%), Positives = 77/228 (33%), Gaps = 46/228 (20%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 +Q +D+M+ LD+S +M + D T+++ A + P+ N GLV Sbjct: 82 TQNTEGIDIMMALDISSTMLA-GDIKPTRLEAAKSVATEFILS---RPNDN----IGLVI 133 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 F+ + L + L ++ T GL A N+I + Sbjct: 134 FARESFTQCPLTTDHAVLVNLFNGVNNGMIEDGTAIGLGLANAVNRI-----------KD 182 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--------------- 328 + K+I+ +TDG N D + AK G +Y IG+ Sbjct: 183 GKSKSKVIILLTDGSNN--SGDIAPITAAEIAKTFGIRIYTIGVGTHGVINIPVSTPMGI 240 Query: 329 -VIRSHE-----FLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 R L A + ++ + + + + I K T+ Sbjct: 241 QYQRVQSEFDAKSLENIANLTGGKYFGATDNSKLRNIYQEIDKLEKTR 288 >gi|119629723|gb|EAX09318.1| collagen, type VI, alpha 2, isoform CRA_c [Homo sapiens] Length = 1019 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 64/164 (39%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + ++ L + P + G+V +S++ Sbjct: 612 GALDVVFVIDSSESIGY---TNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 668 Query: 230 -IEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E L+ +S + +K L T + LK+AY+++ + Sbjct: 669 GTFEAIQLDDEHIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLI--------KESRR 720 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG + +D C+ R V AIGI Sbjct: 721 QKTRVFAVVITDGRHDPRDDDLNLRALCD----RDVTVTAIGIG 760 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 78/207 (37%), Gaps = 14/207 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV---QSGL 223 +TD + + VLD S S+ + I + + + +++ ++ V + G Sbjct: 40 KTDCPIHVYFVLDTSESVTMQSPTDILLFH-MKQFVPQFISQLQNEFYLDQVALSWRYGG 98 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + FS+++E F + + ++ +S F T + L QI RQ + Sbjct: 99 LHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQI------RQDRSKG 152 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 + V +TDG + L A++ G ++A+ + LR AS Sbjct: 153 TVH---FAVVITDGHVTGSPCGGIKLQ-AERAREEGIRLFAVAPNQNLKEQGLRDIASTP 208 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRI 370 + +M + I +D + + I Sbjct: 209 HELYRNDYATMLPDSTEIDQDTINRII 235 >gi|119629721|gb|EAX09316.1| collagen, type VI, alpha 2, isoform CRA_a [Homo sapiens] Length = 828 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 64/164 (39%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + ++ L + P + G+V +S++ Sbjct: 612 GALDVVFVIDSSESIGY---TNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 668 Query: 230 -IEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E L+ +S + +K L T + LK+AY+++ + Sbjct: 669 GTFEAIQLDDEHIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLI--------KESRR 720 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG + +D C+ R V AIGI Sbjct: 721 QKTRVFAVVITDGRHDPRDDDLNLRALCD----RDVTVTAIGIG 760 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 78/207 (37%), Gaps = 14/207 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV---QSGL 223 +TD + + VLD S S+ + I + + + +++ ++ V + G Sbjct: 40 KTDCPIHVYFVLDTSESVTMQSPTDILLFH-MKQFVPQFISQLQNEFYLDQVALSWRYGG 98 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + FS+++E F + + ++ +S F T + L QI RQ + Sbjct: 99 LHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQI------RQDRSKG 152 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 + V +TDG + L A++ G ++A+ + LR AS Sbjct: 153 TVH---FAVVITDGHVTGSPCGGIKLQ-AERAREEGIRLFAVAPNQNLKEQGLRDIASTP 208 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRI 370 + +M + I +D + + I Sbjct: 209 HELYRNDYATMLPDSTEIDQDTINRII 235 >gi|119629722|gb|EAX09317.1| collagen, type VI, alpha 2, isoform CRA_b [Homo sapiens] Length = 918 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 64/164 (39%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + ++ L + P + G+V +S++ Sbjct: 612 GALDVVFVIDSSESIGY---TNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 668 Query: 230 -IEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E L+ +S + +K L T + LK+AY+++ + Sbjct: 669 GTFEAIQLDDEHIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLI--------KESRR 720 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG + +D C+ R V AIGI Sbjct: 721 QKTRVFAVVITDGRHDPRDDDLNLRALCD----RDVTVTAIGIG 760 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 78/207 (37%), Gaps = 14/207 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV---QSGL 223 +TD + + VLD S S+ + I + + + +++ ++ V + G Sbjct: 40 KTDCPIHVYFVLDTSESVTMQSPTDILLFH-MKQFVPQFISQLQNEFYLDQVALSWRYGG 98 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + FS+++E F + + ++ +S F T + L QI RQ + Sbjct: 99 LHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQI------RQDRSKG 152 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 + V +TDG + L A++ G ++A+ + LR AS Sbjct: 153 TVH---FAVVITDGHVTGSPCGGIKLQ-AERAREEGIRLFAVAPNQNLKEQGLRDIASTP 208 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRI 370 + +M + I +D + + I Sbjct: 209 HELYRNDYATMLPDSTEIDQDTINRII 235 >gi|12803331|gb|AAH02484.1| COL6A2 protein [Homo sapiens] gi|30582665|gb|AAP35559.1| collagen, type VI, alpha 2 [Homo sapiens] Length = 425 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 64/164 (39%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + ++ L + P + G+V +S++ Sbjct: 18 GALDVVFVIDSSESIGY---TNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 74 Query: 230 -IEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E L+ +S + +K L T + LK+AY+++ + Sbjct: 75 GTFEAIQLDDEHIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLI--------KESRR 126 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG + +D C+ R V AIGI Sbjct: 127 QKTRVFAVVITDGRHDPRDDDLNLRALCD----RDVTVTAIGIG 166 >gi|30584073|gb|AAP36285.1| Homo sapiens collagen, type VI, alpha 2 [synthetic construct] Length = 426 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 64/164 (39%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + ++ L + P + G+V +S++ Sbjct: 18 GALDVVFVIDSSESIGY---TNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 74 Query: 230 -IEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E L+ +S + +K L T + LK+AY+++ + Sbjct: 75 GTFEAIQLDDEHIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLI--------KESRR 126 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG + +D C+ R V AIGI Sbjct: 127 QKTRVFAVVITDGRHDPRDDDLNLRALCD----RDVTVTAIGIG 166 >gi|179711|gb|AAA35620.1| alpha-2 collagen type VI-a' [Homo sapiens] Length = 429 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 64/164 (39%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + ++ L + P + G+V +S++ Sbjct: 22 GALDVVFVIDSSESIGY---TNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 78 Query: 230 -IEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E L+ +S + +K L T + LK+AY+++ + Sbjct: 79 GTFEAIQLDDEHIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLI--------KESRR 130 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG + +D C+ R V AIGI Sbjct: 131 QKTRVFAVVITDGRHDPRDDDLNLRALCD----RDVTVTAIGIG 170 >gi|179710|gb|AAA35619.1| alpha-2 collagen type VI-a [Homo sapiens] Length = 328 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 64/164 (39%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + ++ L + P + G+V +S++ Sbjct: 22 GALDVVFVIDSSESIGY---TNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 78 Query: 230 -IEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E L+ +S + +K L T + LK+AY+++ + Sbjct: 79 GTFEAIQLDDEHIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLI--------KESRR 130 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG + +D C+ R V AIGI Sbjct: 131 QKTRVFAVVITDGRHDPRDDDLNLRALCD----RDVTVTAIGIG 170 >gi|13603394|gb|AAA52056.2| type VI collagen alpha 2 chain precursor [Homo sapiens] Length = 1019 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 64/164 (39%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + ++ L + P + G+V +S++ Sbjct: 612 GALDVVFVIDSSESIGY---TNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 668 Query: 230 -IEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E L+ +S + +K L T + LK+AY+++ + Sbjct: 669 GTFEAIQLDDEHIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLI--------KESRR 720 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG + +D C+ R V AIGI Sbjct: 721 QKTRVFAVVITDGRHDPRDDDLNLRALCD----RDVTVTAIGIG 760 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 78/207 (37%), Gaps = 14/207 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV---QSGL 223 +TD + + VLD S S+ + I + + + +++ ++ V + G Sbjct: 40 KTDCPIHVYFVLDTSESVTMQSPTDILLFH-MKQFVPQFISQLQNEFYLDQVALSWRYGG 98 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + FS+++E F + + ++ +S F T + L QI RQ + Sbjct: 99 LHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQI------RQDRSKG 152 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 + V +TDG + L A++ G ++A+ + LR AS Sbjct: 153 TVH---FAVVITDGHVTGSPCGGIKLQ-AERAREEGIRLFAVAPNQNLKEQGLRDIASTP 208 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRI 370 + +M + I +D + + I Sbjct: 209 HELYRNDYATMLPDSTEINQDTINRII 235 >gi|105706|pir||C35243 collagen alpha 2(VI) chain precursor, short splice form - human (fragment) gi|179709|gb|AAA35618.1| alpha-2 collagen type VI [Homo sapiens] Length = 238 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 64/164 (39%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + ++ L + P + G+V +S++ Sbjct: 22 GALDVVFVIDSSESIGY---TNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 78 Query: 230 -IEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E L+ +S + +K L T + LK+AY+++ + Sbjct: 79 GTFEAIQLDDEHIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYDRLI--------KESRR 130 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG + +D C+ R V AIGI Sbjct: 131 QKTRVFAVVITDGRHDPRDDDLNLRALCD----RDVTVTAIGIG 170 >gi|86144576|ref|ZP_01062908.1| hypothetical protein MED222_09203 [Vibrio sp. MED222] gi|85837475|gb|EAQ55587.1| hypothetical protein MED222_09203 [Vibrio sp. MED222] Length = 330 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 44/260 (16%), Positives = 85/260 (32%), Gaps = 48/260 (18%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES--- 186 + ++P K + I W + P+ V Q R D+M+V+D+S SM Sbjct: 53 TKTPSSRLP-KALSVIIWLLLVTAMARPVWYGEPVEFQPKHR-DLMLVVDLSYSMSQEDM 110 Query: 187 -FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR 245 F I ++ ++ +E K + GLV F++ L L + Sbjct: 111 QFNGEYIDRLSAVKHVLSDFIERRKGD-------RVGLVLFADHAYLQTPLTLDRDTLSQ 163 Query: 246 KIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 ++ L G T G+ A D +++++ ++DG N + Sbjct: 164 QLNQAVLKLIGTQTAIGDGIGLATKTFVDSDA-----------PQRVMILLSDGSNTA-- 210 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIR------------------SHEFLRACA--SPN 343 L + AKK A +Y +G+ L A + Sbjct: 211 GVLDPLEAADIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAQDLDERTLMEIAKRTGG 270 Query: 344 SFYLVENPHSMYDAFSHIGK 363 ++ + + + I + Sbjct: 271 QYFRARDSKELATIYDTINQ 290 >gi|55981030|ref|YP_144327.1| hypothetical protein TTHA1061 [Thermus thermophilus HB8] gi|55772443|dbj|BAD70884.1| conserved hypothetical protein [Thermus thermophilus HB8] Length = 706 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 38/198 (19%), Positives = 71/198 (35%), Gaps = 33/198 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++VLDVS SM + ++ L V+ + + G+V FS+ Sbjct: 306 LVLVLDVSGSMAG---------EKLSMAVAGALALVESAAPED---RLGVVVFSSGHRVL 353 Query: 234 FLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + +++ + L + G T + A + + G R K Sbjct: 354 FPPRPMTAQAKKEAESLLLSLRAGGGTVLGGAFREAVRLLQGVPGER-----------KA 402 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLV 348 ++ +TDG KE + A+ G V A+ + FL+ A FY Sbjct: 403 VLVLTDGLIADAKE-----PILDLAQTSGVEVSALALGPDADAPFLKELARRGGGRFYQA 457 Query: 349 ENPHSMYDAFSHIGKDIV 366 +P + F G+++ Sbjct: 458 PSPRELPRLFLREGQEVF 475 >gi|94498564|ref|ZP_01305119.1| hypothetical protein SKA58_08324 [Sphingomonas sp. SKA58] gi|94422007|gb|EAT07053.1| hypothetical protein SKA58_08324 [Sphingomonas sp. SKA58] Length = 634 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 33/249 (13%), Positives = 88/249 (35%), Gaps = 31/249 (12%) Query: 14 NYKGG-MTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 + +G + ++ A +P+I LG I+ ++ +K+ L + +D AA Sbjct: 29 DRRGSTLALMAAGLIPVI-AALGAGIDAGRLYLVKSQLQAGVDA----AALAGARAFAVT 83 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 + + + +F ++ + D +++ + Sbjct: 84 DGSPAARDK---QASAYFYGNFASDYMGVSNLQLTPDFKTVGGINVT----------TIT 130 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA-RLDMMIVLDVSRSMESFFDSS 191 +R +P+ F + P T ++ L++M+VLD + SM++ Sbjct: 131 ARAIVPMTFMRIFGFQ--------PRTMQAVAKAELQPRPLEVMVVLDDTGSMKANLSGG 182 Query: 192 ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS 251 T++ ++ N ++ + + G + + + LL + +K++ + Sbjct: 183 RTRMVALKEAANDFVDILHQGASSRRDLAMGFIGYDVTVNVGHLL---PGNAVKKVEGFN 239 Query: 252 KFGVSTNST 260 F ++ Sbjct: 240 SFAGASAYG 248 >gi|307942638|ref|ZP_07657986.1| hypothetical protein TRICHSKD4_1260 [Roseibium sp. TrichSKD4] gi|307774277|gb|EFO33490.1| hypothetical protein TRICHSKD4_1260 [Roseibium sp. TrichSKD4] Length = 403 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 62/396 (15%), Positives = 144/396 (36%), Gaps = 58/396 (14%) Query: 14 NYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAAT---QIMNEGN 70 N KG + I A+ +I + + I++S + + + D + A T + + +G Sbjct: 15 NIKGSIAIPFALLATLILAAISVGIDMSFAYNKRDQSQLVADEVSLFAVTTFRKYVADGM 74 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRD-NGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 N+ + + + S + +N +D + ++ + ++E Y Sbjct: 75 SKNQARKRAETDARKFLTARTKSLDGTTEKFSIKINIVDREAKVVKANVNISGKHESYMT 134 Query: 130 SAISRYKIPLKFCTFIPWYTNSRH----IVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 A+ I + ++ ++ S+ + + R M + + Sbjct: 135 HAMGFDNIDYTADSESTISFGQGKYEFIFLVDVSPSMGIGASNRDRQIMQRAIGCQFACH 194 Query: 186 SFFDSSIT---------KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 + SS++ +ID+ ++ +++ +++ +V+ +++GL +FSN + L Sbjct: 195 EPWYSSVSRAKSAGARLRIDVVKDALKSLVTQLEEATEVD--LRTGLYSFSNYLHIQTGL 252 Query: 237 EWGVSHLQRKIKYLSKF-----GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 G+S +R+ ++ G TN +N + A+ K+ I Sbjct: 253 NKGISKFKREANKIAIHREYLRGGGTNFHGVFS-DFNGVL-------RSLKPKADVKQHI 304 Query: 292 VFMTDG------------------ENLSTKEDQQSLYYCNEAKKRGAI-VYAIGIRVIRS 332 + ++DG N + +C+E KK V+ + + R+ Sbjct: 305 IIISDGVNHLNLRSGTNRHLWNQTPNWRPYNYSFNPRWCDEFKKGEVRTVHTMLVEPDRA 364 Query: 333 H------EFLRACA-SPNSFYLVENPHSMYDAFSHI 361 H +RACA S + FY + + A + Sbjct: 365 HYVRASTSSMRACATSADFFYSANSAAEIDKASKTV 400 >gi|56696619|ref|YP_166980.1| hypothetical protein SPO1742 [Ruegeria pomeroyi DSS-3] gi|56678356|gb|AAV95022.1| conserved hypothetical protein [Ruegeria pomeroyi DSS-3] Length = 558 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 22/68 (32%), Positives = 33/68 (48%), Gaps = 1/68 (1%) Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACASPNSFYLVENPHSMYDAFSHIGK 363 DQ++ + C+ AK G IVY +G L+ CAS +S Y + + DAF+ I Sbjct: 490 DQRTDHVCDAAKDEGIIVYTVGFEAPYSGRRVLKRCASSDSHYYDADGLEISDAFTSIAS 549 Query: 364 DIVTKRIW 371 I R+ Sbjct: 550 SIRKLRLT 557 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 33/223 (14%), Positives = 80/223 (35%), Gaps = 43/223 (19%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F G MT++ + G+ +++ + L +DR+++ AA Sbjct: 27 FAREEDGLMTVMALFLFLALVGAAGIGVDLMRYEQKRAALQYTMDRAVLAAA-------- 78 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY-SI 129 L + + R+ L G + + + V + GY + Sbjct: 79 -----------DLDQQVSP-ETVVRSYLEKAGLLEYLSSV---------TVQEGLGYRKV 117 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 SA + ++P F + + +P S+ + ++ +++ +VLDVS SM S Sbjct: 118 SATATAELPTHFMKL----SGYDSLTIPAASTAE---ESIGNVEISLVLDVSGSMNSN-- 168 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 +++ + ++ + + V +V ++ ++ Sbjct: 169 ---SRLYNLKNAAKEFVDHMLSATEP-GTVSISIVPYATQVNA 207 >gi|148680077|gb|EDL12024.1| mCG3350, isoform CRA_d [Mus musculus] Length = 902 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 44/195 (22%), Positives = 74/195 (37%), Gaps = 32/195 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--IE 231 + +VLD S SM+ ++ ++ L ++ + GLVTF + I+ Sbjct: 309 VCLVLDKSGSMDKE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSAAHIQ 358 Query: 232 EFFLLEWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + S Q+ L + T+ GL+ + I Sbjct: 359 NYLIKITSSSDYQKITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE--------- 409 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ + C EA R GAI++ I + + E L FY Sbjct: 410 IVLLTDGEDNGIRS-------CFEAVSRSGAIIHTIALGPSAARELETLSDMTGGLRFYA 462 Query: 348 VENPHSMYDAFSHIG 362 ++ +S+ DAFS I Sbjct: 463 NKDLNSLIDAFSRIS 477 >gi|74209191|dbj|BAE24978.1| unnamed protein product [Mus musculus] Length = 902 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 44/195 (22%), Positives = 74/195 (37%), Gaps = 32/195 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--IE 231 + +VLD S SM+ ++ ++ L ++ + GLVTF + I+ Sbjct: 309 VCLVLDKSGSMDKE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSAAHIQ 358 Query: 232 EFFLLEWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + S Q+ L + T+ GL+ + I Sbjct: 359 NYLIKITSSSDYQKITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE--------- 409 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ + C EA R GAI++ I + + E L FY Sbjct: 410 IVLLTDGEDNGIRS-------CFEAVSRSGAIIHTIALGPSAARELETLSDMTGGLRFYA 462 Query: 348 VENPHSMYDAFSHIG 362 ++ +S+ DAFS I Sbjct: 463 NKDLNSLIDAFSRIS 477 >gi|328469247|gb|EGF40193.1| hypothetical protein VP10329_10201 [Vibrio parahaemolyticus 10329] Length = 334 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 71/217 (32%), Gaps = 46/217 (21%) Query: 173 DMMIVLDVSRSMES---FFDS-SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+V+D+S SM+ D I ++ K ++ + + K + G+V F + Sbjct: 98 DLMLVVDLSGSMQKEDMNLDGEYIDRLSAVKKVLSDFVAKRKGD-------RLGVVLFGD 150 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + ++IK + G T G+ D Sbjct: 151 HAYLQTPLTADRQTVIQQIKQTVIGLVGQRTAIGDGIGLGTKTFVDSDA----------- 199 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------- 331 +++++ ++DG N + + AKK A +Y +G+ Sbjct: 200 PQRVMILLSDGSNTA--GVLDPIEAAEIAKKYNATIYTVGVGAGEMMVKDFFMTRKVDTA 257 Query: 332 ----SHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 + + ++ + + ++ I K Sbjct: 258 ADLDEQTLTKIAEMTGGQYFRARDAEQLEKIYNTINK 294 >gi|312621090|ref|YP_003993818.1| protein tadg, associated with flp pilus assembly [Photobacterium damselae subsp. damselae] gi|311872811|emb|CBX86902.1| Protein TadG, associated with Flp pilus assembly [Photobacterium damselae subsp. damselae] Length = 436 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 43/292 (14%), Positives = 98/292 (33%), Gaps = 26/292 (8%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 +G +IL AI +P++F + + + + K + + + + A N+ Sbjct: 1 MKLKKAQQGHASILFAIMIPVLFGIFTLASDGARAIQTKARIEDATEAASLAIAAH--ND 58 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNG----FVNDIDDIVRSTSLDIVVVPQN 124 N N+ G + RI + ++ ++ + + +DI +S + Sbjct: 59 PNVNSDGLGSGSKVNRRIATDYLKAYITDIDSISSLKIYRRNCEDIPECSSGLNKGKSRF 118 Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ-----TDARLDMMIVLD 179 Y + A++ T W+ + I + +D++ D Sbjct: 119 FEYEVEALT---------TQNSWFPGNNVISGFGDTFSTRGHSLARKYQSEAVDVVFAAD 169 Query: 180 VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-----NVVQSGLVTFSNKIEEFF 234 S+SME + K ++ IN + E++ ++N N G+ +++ F Sbjct: 170 FSKSMEEPWTGGRQKYKDLVRVINDVTSELEKFNNINIADKKNQNTIGISPYNSNTYSKF 229 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + +++ + +K N IF +G D Sbjct: 230 -DNYNSCFMKQDYFEKNSRDHRKKKYVDIKRTLNNIFIEKGNDSCGFKSDDP 280 >gi|237800421|ref|ZP_04588882.1| von Willebrand factor, type A [Pseudomonas syringae pv. oryzae str. 1_6] gi|331023280|gb|EGI03337.1| von Willebrand factor, type A [Pseudomonas syringae pv. oryzae str. 1_6] Length = 352 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 65/173 (37%), Gaps = 26/173 (15%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + + ++++ + + + LE K + GL+ F Sbjct: 91 DLLVAVDVSGSMDYPDMQWKNDEVSRLVLVQQLLGDFLESRKGD-------RVGLILFGT 143 Query: 229 KIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + + G +T + A ++ Sbjct: 144 QAFLQAPLTYDRQTVRVWLDEAKIGIAGKNTAVGDAIGLALKRL-----------RMRPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 +++V +TDG N + + D + A G +Y IGI L++ Sbjct: 193 NSRVLVLVTDGANNAGQIDPLTA--ARLAADEGVKIYTIGIGSDPEKNALQSA 243 >gi|156409369|ref|XP_001642142.1| predicted protein [Nematostella vectensis] gi|156229283|gb|EDO50079.1| predicted protein [Nematostella vectensis] Length = 182 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 79/206 (38%), Gaps = 32/206 (15%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ +D S S+ + ++ I+ + V+ + + +S++ Sbjct: 6 IDLVFAIDASSSVGKV---NFERVKGFIRRL------VESFHISRTSTRVAAIVYSSRPR 56 Query: 232 EFFLLEWGVSHLQRK--IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F S + +K L T++ L+ A +++F G ++ K Sbjct: 57 VAFDFNRYTSARRAAHAVKRLRFLRGGTSTGRALRLASSRLFRRYGRKRR---------K 107 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-RVIRSHEFLRACASPNSFYLV 348 +++ +TDG K L K++G ++A+G+ + +E + + P+ Y Sbjct: 108 VLMLITDG-----KSSDDVLKPSKALKRKGVQIFAVGVGMSVSRNELILIASHPSQVYQA 162 Query: 349 ENPHSMYDAFSHIGKDIVTKRIWYDK 374 + + S I K + ++ K Sbjct: 163 S-----FTSLSAIVKSL-ARKTCESK 182 >gi|219850571|ref|YP_002465004.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219544830|gb|ACL26568.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 418 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 35/226 (15%), Positives = 86/226 (38%), Gaps = 30/226 (13%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 P + + ++ +++ L++ +V+D S SM ++ Sbjct: 10 PSRSVLPALNEPQLLYALIELSAQSGATKMPRLPLNLCLVIDRSSSMRG------ERLQQ 63 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE--EFFLLEWGVSHLQRKIKYLSKFGV 255 ++ +L+ + + + LVTF+++ E L + ++R+I + G Sbjct: 64 VKQAAMQILDLL--GDNES----FALVTFNDRAEVVVSSQLARARAEIKRQISAIEAAGG 117 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 + +T GL ++ Q A ++ ++ +TDG ++ + + A Sbjct: 118 TEMAT-GLALGVQEL-------QRAMMPRAIHR--LLLLTDGRTYG--DESRCVEIARRA 165 Query: 316 KKRGAIVYAIGIRVIRSHEFLRACASPNS---FYLVENPHSMYDAF 358 + RG + A+GI + + L A+ + Y + + + F Sbjct: 166 QARGIGITALGIGSEWNEDLLETIAARENSRTHY-ITSAADITKIF 210 >gi|281357358|ref|ZP_06243847.1| von Willebrand factor type A [Victivallis vadensis ATCC BAA-548] gi|281316389|gb|EFB00414.1| von Willebrand factor type A [Victivallis vadensis ATCC BAA-548] Length = 342 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 31/229 (13%), Positives = 76/229 (33%), Gaps = 47/229 (20%) Query: 170 ARLDMMIVLDVSRSMESF-FDSSITKIDMAIKSINA---------MLEEVKLIPDVNNVV 219 +D+++ LD+S SME++ +I I ++ +E++ + Sbjct: 82 QGIDIVLALDMSGSMEAYDVPRNINDARTLIAAVKNKEVENRIEVAKKEIRRFIEQRPND 141 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG--LKYAYNQIFDMQGMR 277 + GL+ F+++ F + L ++ L + + L N++ Sbjct: 142 RIGLIGFADQAYSFAPPTLDHAWLLAHLEQLEPGMIGQQTGIAAPLASGVNRL------- 194 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS----- 332 + ++++V TDG N K+ +++ +GI + Sbjct: 195 ----KKSDAPRRVLVLFTDGRNNVDN-RLTPEQAAALGKEFDVVIHTVGIGSRNAFVLVT 249 Query: 333 ----------------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 + LR+ A + +++ + M I + Sbjct: 250 DPFGRQQFQGIEDEFDEKLLRSLAEITGGTYFHAADADGMKQVMDEINQ 298 >gi|301753369|ref|XP_002912539.1| PREDICTED: anthrax toxin receptor 2-like [Ailuropoda melanoleuca] Length = 611 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 39/220 (17%), Positives = 76/220 (34%), Gaps = 31/220 (14%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 S + D+ VLD S S+ + + I ++++ V+ Sbjct: 151 GGPVSAQEQPSCRGAFDLYFVLDKSGSVANNW-----------IEIYNFVQQLTERF-VS 198 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQ 274 ++ + FS++ L S + + ++ L + T GLK A QI Sbjct: 199 PQMRLSFIVFSSQATIILPLTGDRSKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKAG 258 Query: 275 GMRQHCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 G + II+ +TDG + L ++ ++ GA VY +G+ Sbjct: 259 GFKASS---------IIIALTDGKLDGLVPSYAEKEAKI---SRSFGARVYCVGVLDFEQ 306 Query: 333 HEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + R S + + V+ + A I I+ + Sbjct: 307 AQLERIADSKDQVFPVKGG---FQALKGIINSILDRSCTE 343 >gi|281346820|gb|EFB22404.1| hypothetical protein PANDA_000280 [Ailuropoda melanoleuca] Length = 482 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 39/220 (17%), Positives = 76/220 (34%), Gaps = 31/220 (14%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 S + D+ VLD S S+ + + I ++++ V+ Sbjct: 21 GGPVSAQEQPSCRGAFDLYFVLDKSGSVANNW-----------IEIYNFVQQLTERF-VS 68 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQ 274 ++ + FS++ L S + + ++ L + T GLK A QI Sbjct: 69 PQMRLSFIVFSSQATIILPLTGDRSKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKAG 128 Query: 275 GMRQHCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 G + II+ +TDG + L ++ ++ GA VY +G+ Sbjct: 129 GFKASS---------IIIALTDGKLDGLVPSYAEKEAKI---SRSFGARVYCVGVLDFEQ 176 Query: 333 HEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + R S + + V+ + A I I+ + Sbjct: 177 AQLERIADSKDQVFPVKGG---FQALKGIINSILDRSCTE 213 >gi|84502751|ref|ZP_01000870.1| hypothetical protein OB2597_00965 [Oceanicola batsensis HTCC2597] gi|84389146|gb|EAQ01944.1| hypothetical protein OB2597_00965 [Oceanicola batsensis HTCC2597] Length = 470 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 61/452 (13%), Positives = 134/452 (29%), Gaps = 111/452 (24%) Query: 20 TILTAIFLPIIFL---VLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKK 76 ++ A+ + ++ V G+ ++V +T + +ID S + AA K Sbjct: 31 GVMLALVMFMLLTMMTVAGIGVDVMRTEMERTRIQQVIDASTLAAA-----------HKD 79 Query: 77 LKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYK 136 + + + + + + + + +++ + + Sbjct: 80 NALDPKQVVLDYFDKAALASYISADDILVGGGETSTAVEVNLTAQVKTPFIRHLGNESFN 139 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVK-------VNSQTDARLDMMIVLD----VSRSME 185 +P + + + +V+ I+ S+ ++ + +D ++ D VS S+ Sbjct: 140 VPARGRAEQAYGNSEVSLVLDISGSMDDNRRMSRLHRAANEFVDTVLTPDSVDRVSVSLI 199 Query: 186 SFFDS-----------SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI---- 230 + ++ ++ + ++ FS+ Sbjct: 200 PYTGDVNVGWDIFSRMNVRQLHDYSYCVQFTPDDFSTTAIDPEDAYIQGQHFSHVDARFN 259 Query: 231 ---------EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA-----------YNQI 270 E + L+ +I L+ T+ G+K+ N + Sbjct: 260 YISCPTQSYETVTPFSQNNAALEAQINRLTGRER-TSIHIGIKWGAAMLDEAFRPLVNDL 318 Query: 271 FDM----QGMRQHCNTEDANYKKIIVFMTDGENLSTK----------------------- 303 D + R +N K+IV MTDG N TK Sbjct: 319 VDNSIVDEAFRDRPAPFTSNTLKVIVVMTDGMNTETKRIKEFAYDTPDMRAHWARHAMDD 378 Query: 304 ----------------------EDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRACA 340 + CN AK G I+Y+IG + + + CA Sbjct: 379 WDNDVDGSVEDHLFDTYYDTAIGNALLQNICNAAKANGIIIYSIGFEINNDAAQEMEDCA 438 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 S S + + +AFS I + + R+ Sbjct: 439 SSPSHFYRVEGVQISEAFSSIAQQLKQLRLTL 470 >gi|301758046|ref|XP_002914871.1| PREDICTED: LOW QUALITY PROTEIN: vitrin-like [Ailuropoda melanoleuca] Length = 686 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 71/197 (36%), Gaps = 27/197 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ V+D S S+ ++ + + +E ++ + G V ++ + Sbjct: 503 DIGFVIDGSSSVG------TGNFRTVLQFVANISKEFEISETD---TRVGAVQYTYEQRL 553 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + S + IK + + T++ + YA Q+F N +K+ Sbjct: 554 EFGFDDYHTKSDILNAIKRVGYWSGGTSTGAAINYALEQLF---------KKSKPNKRKL 604 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLV 348 ++ +TDG + A +G YAIG+ E P + + V Sbjct: 605 MILITDG-----RSYDDVRIPAMVAHHKGVTTYAIGVAWAAQDELEVIATHPASDHSFFV 659 Query: 349 ENPHSMYDAFSHIGKDI 365 + ++Y + ++I Sbjct: 660 DEFDNLYKFVPKVIRNI 676 >gi|198421589|ref|XP_002123523.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 1306 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 37/207 (17%), Positives = 81/207 (39%), Gaps = 31/207 (14%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D M V+D SRS+ + + + + +++V D+++ VQ G+V +S Sbjct: 205 IDFMFVVDGSRSVG----------NESFEVVKHWIQQVTSGFDISSSVQVGVVQYSTYQY 254 Query: 232 EF-------FLLEWGVSH----LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + G + + ST + ++ N+ F R Sbjct: 255 RKVVQPFIKTEIRLGEYKDHILFDAAVDKIKYHDRSTFTAYAIRKTVNEDFKGNMSRY-- 312 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRAC 339 + ++++V +TDG++ ++ + EAK+ G +A+G+ I E + Sbjct: 313 ----PDSRRVMVLLTDGQSTDKEDLSSAAA---EAKQEGVETFAVGVGSKIILSELVLIA 365 Query: 340 ASPNSFYLVENPHSMYDAFSHIGKDIV 366 SP+ V + + + + + DI Sbjct: 366 GSPDKVITVNDFNELLGIVNQLQGDIQ 392 >gi|32964827|ref|NP_034029.2| chloride channel calcium activated 1 [Mus musculus] gi|3925281|gb|AAC79982.1| calcium-sensitive chloride conductance protein-1 [Mus musculus] gi|74183411|dbj|BAE36582.1| unnamed protein product [Mus musculus] gi|124376304|gb|AAI32343.1| Chloride channel calcium activated 1 [Mus musculus] Length = 902 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 44/195 (22%), Positives = 74/195 (37%), Gaps = 32/195 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--IE 231 + +VLD S SM+ ++ ++ L ++ + GLVTF + I+ Sbjct: 309 VCLVLDKSGSMDKE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSAAHIQ 358 Query: 232 EFFLLEWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + S Q+ L + T+ GL+ + I Sbjct: 359 NYLIKITSSSDYQKITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE--------- 409 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ + C EA R GAI++ I + + E L FY Sbjct: 410 IVLLTDGEDNGIRS-------CFEAVSRSGAIIHTIALGPSAARELETLSDMTGGLRFYA 462 Query: 348 VENPHSMYDAFSHIG 362 ++ +S+ DAFS I Sbjct: 463 NKDLNSLIDAFSRIS 477 >gi|47227632|emb|CAG09629.1| unnamed protein product [Tetraodon nigroviridis] Length = 457 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 37/219 (16%), Positives = 77/219 (35%), Gaps = 28/219 (12%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 + +SS + D+ VLD S S+ + + + + + V Sbjct: 33 VFASSSWAEEASCHGAYDLYFVLDKSGSVAGDWGEIYS-------FVKNLTDRF-----V 80 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL--SKFGVSTNSTPGLKYAYNQIFDM 273 + ++ + FS + + L ++ ++ L K T G+K Sbjct: 81 SPRMRVSFIVFSAQAKVLLPLTGDSYKIKEGLRKLYDVKPAGETFMHVGIK--------- 131 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + + II+ +TDG+ D ++ NEA+K GA VY +GI+ Sbjct: 132 -EASVQIRAQPSPTSSIILALTDGKLEVYVHDL-TVKEANEARKYGARVYCVGIKDFDEQ 189 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + + + + V++ + A I I+ + Sbjct: 190 QLANIADTKDQVFPVKDG---FHALKGIVNSILKRSCTE 225 >gi|304312669|ref|YP_003812267.1| von Willebrand factor, type A protein [gamma proteobacterium HdN1] gi|301798402|emb|CBL46626.1| von Willebrand factor, type A protein [gamma proteobacterium HdN1] Length = 347 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 30/219 (13%), Positives = 69/219 (31%), Gaps = 48/219 (21%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+ +D S+SME + ++ + ++ + K + GL+ F Sbjct: 94 DLMLAVDTSQSMEIQDMRLHGEPVDRLTVIKSVVDDFISHRKND-------RIGLILFGT 146 Query: 229 KIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + + G T + A ++ Sbjct: 147 QAYLQTPLTFDHKTVRTLLNESRIGIAGGQTAIGDAIGLALKRL-----------KNHKT 195 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------- 331 K+++ +TDG N + + A ++G +Y +G+ Sbjct: 196 GSKVLILLTDGANTAGS--VSPVQAAELAARQGMKIYTVGVGADEMRIPGVLGFGSQIVN 253 Query: 332 -----SHEFLRACAS--PNSFYLVENPHSMYDAFSHIGK 363 ++ AS ++ N + + HI K Sbjct: 254 PSADLDEVTMKKIASLTGAQYFRARNTDELRRIYQHIDK 292 >gi|218675994|ref|YP_002394813.1| hypothetical protein VS_II0212 [Vibrio splendidus LGP32] gi|218324262|emb|CAV25554.1| Conserved hypothetical protein [Vibrio splendidus LGP32] Length = 347 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 44/260 (16%), Positives = 85/260 (32%), Gaps = 48/260 (18%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES--- 186 + ++P K + I W + P+ V Q R D+M+V+D+S SM Sbjct: 70 TKTPSSRLP-KALSVIIWLLLVTAMARPVWYGEPVEFQPKHR-DLMLVVDLSYSMSQEDM 127 Query: 187 -FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR 245 F I ++ ++ +E K + GLV F++ L L + Sbjct: 128 QFNGEYIDRLSAVKHVLSDFIERRKGD-------RVGLVLFADHAYLQTPLTLDRDTLSQ 180 Query: 246 KIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 ++ L G T G+ A D +++++ ++DG N + Sbjct: 181 QLNQAVLKLIGTQTAIGDGIGLATKTFVDSDA-----------PQRVMILLSDGSNTA-- 227 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIR------------------SHEFLRACA--SPN 343 L + AKK A +Y +G+ L A + Sbjct: 228 GVLDPLEAADIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAQDLDERTLMEIAKRTGG 287 Query: 344 SFYLVENPHSMYDAFSHIGK 363 ++ + + + I + Sbjct: 288 QYFRARDSKELATIYDTINQ 307 >gi|22299719|ref|NP_682966.1| hypothetical protein tlr2176 [Thermosynechococcus elongatus BP-1] gi|22295903|dbj|BAC09728.1| tlr2176 [Thermosynechococcus elongatus BP-1] Length = 415 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 36/198 (18%), Positives = 71/198 (35%), Gaps = 28/198 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 Q A L++ ++LD S SM + M ++ ++++ + + ++ F Sbjct: 34 QQRAPLNLCLILDHSGSMAWQP------LAMVKQAAASLVDRLLPSD------RLSVIAF 81 Query: 227 SNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 +K + + W ++ +I L G T G+K +I + Sbjct: 82 DHKAKVLVPNQTVWDKEAIKAQIATLEP-GGGTAIDEGMKLGLKEIAAGKQGTISQ---- 136 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR--ACASP 342 I +TDGEN ++Q+ L A + + A+G V + + L A A+ Sbjct: 137 ------IFLLTDGENEH-GDNQRCLELAKLAAEYNITLNALGFGVHWNQDVLEQIADAAG 189 Query: 343 NSFYLVENPHSMYDAFSH 360 +E F Sbjct: 190 GRLVFIEYAEQAIACFQS 207 >gi|148680076|gb|EDL12023.1| mCG3350, isoform CRA_c [Mus musculus] Length = 907 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 44/195 (22%), Positives = 74/195 (37%), Gaps = 32/195 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--IE 231 + +VLD S SM+ ++ ++ L ++ + GLVTF + I+ Sbjct: 314 VCLVLDKSGSMDKE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSAAHIQ 363 Query: 232 EFFLLEWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + S Q+ L + T+ GL+ + I Sbjct: 364 NYLIKITSSSDYQKITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE--------- 414 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ + C EA R GAI++ I + + E L FY Sbjct: 415 IVLLTDGEDNGIRS-------CFEAVSRSGAIIHTIALGPSAARELETLSDMTGGLRFYA 467 Query: 348 VENPHSMYDAFSHIG 362 ++ +S+ DAFS I Sbjct: 468 NKDLNSLIDAFSRIS 482 >gi|116251678|ref|YP_767516.1| transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] gi|115256326|emb|CAK07407.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] Length = 329 Score = 65.2 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 41/260 (15%), Positives = 88/260 (33%), Gaps = 38/260 (14%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT-DARLDMMIVLDVSRSMESFF 188 S I+R P C W + + V+ + + + D+++ LD+S+SM++ Sbjct: 53 SVIARRTWPQILCEGAAWCLVV--VALARPQFVEPPIEKVEPQRDILLALDLSQSMDTKD 110 Query: 189 -----DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 + ++D + + + + + GLV F + + + Sbjct: 111 FPGADGKPLARVDAVKQVVADFVGR-RPGD------RIGLVAFGDAPYPLAPFTMDHALV 163 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 + I ++ PG+ + D G+ + +K+++ +TDG T Sbjct: 164 ETMI---------ADAVPGMAGPRTSLGDALGLAVKMFEKTTVPEKVLIVLTDG--NDTA 212 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVI--RSHEFLRACA-------SPNSFYLVENPHSM 354 L AK +G +V+A+GI + L + ++ + + Sbjct: 213 SRMPPLKAAEIAKSKGVVVHAVGIGDPLATGEDKLDTATLQKIAEKTGGRYFFGGDQAQL 272 Query: 355 ---YDAFSHIGKDIVTKRIW 371 Y I K W Sbjct: 273 ASIYQVLDQITPGDQKKLSW 292 >gi|311746225|ref|ZP_07720010.1| BatA protein [Algoriphagus sp. PR1] gi|126576455|gb|EAZ80733.1| BatA protein [Algoriphagus sp. PR1] Length = 347 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 80/227 (35%), Gaps = 46/227 (20%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+V+D+S SM+ D +++ A + + + + G+V F+ + Sbjct: 107 IDIMLVMDISESMD-LQDFKPNRLEAAKATAIDFI-------NGRFGDRIGMVVFAGEAY 158 Query: 232 EFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L L I+ +S T + A N++ E + Sbjct: 159 SLAPLTNDYKLLTDLIQDISFNMMEAKGTAIGSAIASATNRM-----------KESESAS 207 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI----RVIRSHEF--------- 335 K+++ ++DGE+ + + L+ A +Y I + V +F Sbjct: 208 KVLILLSDGESNA--GNVDPLFAAQLASALDIKIYTIAVGKDGMVPYGTDFFGRPQMVES 265 Query: 336 ------LRACASPNS--FYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 LR A + F+ + ++ + F I + I ++ Sbjct: 266 YLDETNLREIAKIGNGEFFRASDGGTLNNIFDRI-DTMEKAEILENR 311 >gi|269961128|ref|ZP_06175496.1| conserved hypothetical protein [Vibrio harveyi 1DA3] gi|269834079|gb|EEZ88170.1| conserved hypothetical protein [Vibrio harveyi 1DA3] Length = 362 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 77/210 (36%), Gaps = 34/210 (16%) Query: 173 DMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+M+V+D+S SM S I+++D A + ++ + K + GL+ F Sbjct: 107 DVMVVVDLSGSMAEQDFTSKTGEKISRLDAAKEVLSDFAKTRKGD-------RLGLILFG 159 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + F+ + ++ L++ G ST+ + A + R Sbjct: 160 ---DAAFVQTPFTPDQKVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQSEKSRTDV-- 214 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--SHEFLR--- 337 + + +K+ + +TDG T + + AK + ++ I + + L Sbjct: 215 -EESKEKVAIVLTDG--NDTGSFVEPIDAAKVAKAKDVRIHVIAMGDPQTVGETALDMNT 271 Query: 338 ----ACASPNSFYLVENPHSMYDAFSHIGK 363 A S + N + A+ IGK Sbjct: 272 IKRIAKESGGEAFEALNRDELAKAYDEIGK 301 >gi|197118196|ref|YP_002138623.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] gi|197087556|gb|ACH38827.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] Length = 331 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 40/255 (15%), Positives = 80/255 (31%), Gaps = 52/255 (20%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSS----ITK 194 L + + P + + Q+ +D+++ LD+S SM + + Sbjct: 55 LPYLRLAVLALGIVALARPQAVARESQVQSRG-MDLVLALDLSTSMLAEEQGREGRGENR 113 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL--SK 252 + A + ++ + + + GLV F+ + L LQ ++ L + Sbjct: 114 LAAAKRVLSEFIG-ARKQD------RIGLVAFAGRPYPAAPLTSDHQWLQGIVERLDTNS 166 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 T + N++ RQ A ++ +TDG N + Sbjct: 167 VEDGTALGDAILAGVNRL------RQRPAEGRA-----LILITDGRNN---AGAEPQLAA 212 Query: 313 NEAKKRGAIVYAIGIRVIRS----------------------HEFLRACA--SPNSFYLV 348 AK G V+AIGI S L+ A + ++ Sbjct: 213 QAAKALGIRVHAIGIGSRGSAVIPVPSPLGGTIYRRLDADLDAATLKGVAEITGGRYFEA 272 Query: 349 ENPHSMYDAFSHIGK 363 + + F+ I + Sbjct: 273 GDATVLSRVFAEIDR 287 >gi|212635916|ref|YP_002312441.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212557400|gb|ACJ29854.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 333 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 32/216 (14%), Positives = 70/216 (32%), Gaps = 44/216 (20%) Query: 173 DMMIVLDVSRSM---ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+M+ +D+S SM + D +T I+ + + E + + GL+ F++ Sbjct: 86 DLMLSVDLSGSMQIEDMVIDGKVTDRFTLIQHVISQFIERRKGD------RIGLILFADH 139 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + + + G T + + ++ Sbjct: 140 AYLQSPLTQDRRTVAQYLNEAEIGLVGRQTAIGEAIALGVKRFDQVENSN---------- 189 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI--RSHEF---------- 335 ++++ +TDG N + + + A KRG +Y +G+ Sbjct: 190 -RVLILLTDGSNNAGSISPE--QATDIAAKRGITIYTVGVGAEVMERRTLFGKERVNPSM 246 Query: 336 ------LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + S++ N + + I K Sbjct: 247 DLDETQLKQIAEKTGGSYFRARNTEELERIYQEIDK 282 >gi|300716700|ref|YP_003741503.1| von Willebrand factor, type A domain protein [Erwinia billingiae Eb661] gi|299062536|emb|CAX59653.1| von Willebrand factor, type A domain protein [Erwinia billingiae Eb661] Length = 325 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 36/162 (22%), Positives = 59/162 (36%), Gaps = 21/162 (12%) Query: 174 MMIVLDVSRSMESFFDSS-ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 M+++LDVS SME IT++ +S+ A + + K + GLV F++ Sbjct: 98 MVLILDVSGSMEKNDAQDGITRLQAVQRSVRAFVAQRKTD-------RIGLVIFASSAWP 150 Query: 233 FFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + L +I L+ G T L A ++ D R + ++ Sbjct: 151 FAPISEDKQALLARINQLAPGMIGQQTAIGDALGVAV-KLLDSSLDR--------DASRL 201 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + +TDG T A V+ I I S Sbjct: 202 AILLTDG--NDTASQLSPALAAQLAASHHVQVHTIAFGDINS 241 >gi|225621320|ref|YP_002722578.1| von Willebrand factor type A (vWA) domain-containing protein [Brachyspira hyodysenteriae WA1] gi|225216140|gb|ACN84874.1| von Willebrand factor type A (vWA) domain containing protein [Brachyspira hyodysenteriae WA1] Length = 289 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 44/260 (16%), Positives = 96/260 (36%), Gaps = 48/260 (18%) Query: 132 ISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK-VNSQTDARLDMMIVLDVSRSMESFFDS 190 SRY + I + I + + V ++ + + +V+DVS SM + D Sbjct: 6 KSRYYVKDIPFMLIIFALAFSIIGLSRPAKVSHLSDINGEGVYISLVVDVSPSMMAE-DM 64 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL 250 T+++ + K++ +++ N + LV+F+ + + + L+ +IK + Sbjct: 65 IPTRLEASKKTMIDFIKK-------RNFDKISLVSFALRASVLSPATFDYTSLEEEIKKI 117 Query: 251 SKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL 309 ST+ G+ A + + ++ + KII+ +TDGEN S + D + Sbjct: 118 EIDEEGSTSIGLGIATAVDMLRSVKEDNE----------KIIILLTDGENNSGEIDPKLA 167 Query: 310 YYCNEAKKRGAIVYAIGIRVIRS-------------------------HEFLRACA-SPN 343 A +Y IGI + A + Sbjct: 168 S--EIASNFNIKIYTIGIGDANGSHAWVTYDDPNYGKRRIRADFTLNEESLIDIAATTGG 225 Query: 344 SFYLVENPHSMYDAFSHIGK 363 ++ +N ++ + ++ I + Sbjct: 226 KYFNAKNASALDNVYNTIDR 245 >gi|220921017|ref|YP_002496318.1| von Willebrand factor type A [Methylobacterium nodulans ORS 2060] gi|219945623|gb|ACL56015.1| von Willebrand factor type A [Methylobacterium nodulans ORS 2060] Length = 324 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 69/207 (33%), Gaps = 39/207 (18%) Query: 174 MMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +M+ LD+S SME S ++ ++ + + + + GLV F+++ Sbjct: 97 IMLALDLSGSMERVDFSLDGRNVARLAAVKRVGADFIRR-RAGD------RIGLVIFADQ 149 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + L + + + + G ST GL A ++ Sbjct: 150 ADVAASLSFDTASVAHALDEAQIGLVGRSTGIGDGLGLALKRLDATPA-----------R 198 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-----------SHEFL 336 +K++V ++DG N + A++ G V+ I + E L Sbjct: 199 EKVVVLLSDGANNA--GQTTPHDVAALARELGIRVHTIALGPRDLSDAEGDPDVVDTEAL 256 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHI 361 R A S F+ V + I Sbjct: 257 RDVATTSGGRFFRVRTTDDLAAVADSI 283 >gi|196231436|ref|ZP_03130294.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] gi|196224289|gb|EDY18801.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] Length = 341 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 39/227 (17%), Positives = 78/227 (34%), Gaps = 50/227 (22%) Query: 168 TDARLDMMIVLDVSRSM--ESFFDS--SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + +D+M+ LDVS SM E F +++D+ + +E P+ + G+ Sbjct: 90 QASGIDIMLALDVSGSMIAEDFTIGGERASRVDVVKQVTQKFIE---ARPND----RIGM 142 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 + F+ + L L + + + T + ++ + + + Sbjct: 143 IAFAARPYLVSPLTLDHGWLIQNLDRVKLGLVEDGTAIGSAIASCTTRLIERKDSKS--- 199 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG--------------- 326 +I+V +TDG+N + K L A G VY IG Sbjct: 200 -------RIVVLLTDGDNNAGK--VSPLTAAEAASALGVKVYTIGAGTKGFAPMPVGRDV 250 Query: 327 --------IRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 ++V + L+ A + FY + S+ + I + Sbjct: 251 FGRKVYQNVKVDVDEDTLKKIADMTKAKFYRATDTKSLTQIYEEIDQ 297 >gi|26328325|dbj|BAC27903.1| unnamed protein product [Mus musculus] Length = 902 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 44/195 (22%), Positives = 74/195 (37%), Gaps = 32/195 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--IE 231 + +VLD S SM+ ++ ++ L ++ + GLVTF + I+ Sbjct: 309 VCLVLDKSGSMDKE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSAAHIQ 358 Query: 232 EFFLLEWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + S Q+ L + T+ GL+ + I Sbjct: 359 NYLIKITSSSDYQKITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE--------- 409 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ + C EA R GAI++ I + + E L FY Sbjct: 410 IVLLTDGEDNGIRS-------CFEAVSRSGAIIHTIALGPSAARELETLSDMTGGLRFYA 462 Query: 348 VENPHSMYDAFSHIG 362 ++ +S+ DAFS I Sbjct: 463 NKDLNSLIDAFSRIS 477 >gi|219518504|gb|AAI45058.1| Chloride channel calcium activated 1 [Mus musculus] Length = 902 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 44/195 (22%), Positives = 74/195 (37%), Gaps = 32/195 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--IE 231 + +VLD S SM+ ++ ++ L ++ + GLVTF + I+ Sbjct: 309 VCLVLDKSGSMDKE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSAAHIQ 358 Query: 232 EFFLLEWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + S Q+ L + T+ GL+ + I Sbjct: 359 NYLIKITSSSDYQKITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE--------- 409 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ + C EA R GAI++ I + + E L FY Sbjct: 410 IVLLTDGEDNGIRS-------CFEAVSRSGAIIHTIALGPSAARELETLSDMTGGLRFYA 462 Query: 348 VENPHSMYDAFSHIG 362 ++ +S+ DAFS I Sbjct: 463 NKDLNSLIDAFSRIS 477 >gi|302336993|ref|YP_003802199.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] gi|301634178|gb|ADK79605.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] Length = 333 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 45/228 (19%), Positives = 78/228 (34%), Gaps = 44/228 (19%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ VLD S SM T+ D A +I +E + P GLV F ++ Sbjct: 91 VDIVFVLDQSPSMIVRDFGGDTRFDAAKHAIRTFVEGREHDP-------LGLVIFGDEAA 143 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + ++ + + S GL A H A +K++ Sbjct: 144 LVTPPTLDYTSFLSRMDAVRVMKLGRGSALGLGMAV--------ATVHLEKSSAE-RKVM 194 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--------------------RVIR 331 V ++DGEN + + +S A G +YA+G+ Sbjct: 195 VIVSDGENNAGEITPESA--ARVAASLGIRIYAVGVGGEGSVATEFTDPETGKSYRGTYE 252 Query: 332 SH---EFLRAC--ASPNSFYLVENPHSMYDAFSHI-GKDIVTKRIWYD 373 E L+A ++ +L +P ++ F I + V R D Sbjct: 253 GKIDMELLKAVTESTRGQAFLAGSPGALSQVFREIDALESVELRSTTD 300 >gi|300776751|ref|ZP_07086609.1| aerotolerance protein BatA [Chryseobacterium gleum ATCC 35910] gi|300502261|gb|EFK33401.1| aerotolerance protein BatA [Chryseobacterium gleum ATCC 35910] Length = 330 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 43/234 (18%), Positives = 80/234 (34%), Gaps = 47/234 (20%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P T ++ + +D+M+ +DVS SM + D + +I V+ P+ Sbjct: 73 PRTFTISQDRDDTKGVDIMLSIDVSLSMLAK-DLNPDRITALKDIAVKF---VQKRPND- 127 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQ 274 + G+V ++ + + + +IK L+ G T GL A N + + Sbjct: 128 ---RIGVVAYAAEAFTKVPVTSDHQVVIDEIKNLNSAGLEPGTAIGEGLSVAVNHLVKSK 184 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV----- 329 K+++ MTDG + + AK VYAIGI Sbjct: 185 AKS-----------KVVILMTDGVSNIQNAIPPQVA-AELAKNNNIKVYAIGIGTNGYAL 232 Query: 330 -IRSHEF-----------------LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 S + LR A + ++ + S+ + + I + Sbjct: 233 MPTSQDIFGDLVFTETEVTIDENTLREIAQTTGGKYFRATSNSSLEEVYDEINQ 286 >gi|156383825|ref|XP_001633033.1| predicted protein [Nematostella vectensis] gi|156220097|gb|EDO40970.1| predicted protein [Nematostella vectensis] Length = 204 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 69/200 (34%), Gaps = 28/200 (14%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 ++++ ++D S S+ T+ D + + + V F+ Sbjct: 22 MNLVFLIDNSGSIND------TEFDNFKEFAKKL---AESFTISATYTHVAAVYFNTLAN 72 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 F L++ ++ ++ I L G T+ L Y + +F + N K ++ Sbjct: 73 FGFNLKYDINVIKTAIDNLPNIGGGTHIGKALTYTLDNVFKV--------APRQNVKNVL 124 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKK--RGAIVYAIGIRVIRSH-EFLRACASP---NSF 345 V +TDG K + G V+A+G+ S L AS + Sbjct: 125 VVLTDG-----KSHDSVTLPAAAVRNYGPGVEVFAVGVGAGDSFVAQLNVIASDPDEDHV 179 Query: 346 YLVENPHSMYDAFSHIGKDI 365 + VE+ + + +I Sbjct: 180 FHVEHFSQIESTTGAVEDEI 199 >gi|170727657|ref|YP_001761683.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] gi|169813004|gb|ACA87588.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] Length = 640 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 51/299 (17%), Positives = 107/299 (35%), Gaps = 38/299 (12%) Query: 94 FRNELRDNGFVNDIDDIVRST---SLDIVVVPQNEGYSISAIS---RYKIPLKFCTFIPW 147 E+ + F D+D ST S++ V+P+ + + Y+ P P+ Sbjct: 158 VAGEIPVSTFSIDVDTGSYSTLRRSINHGVLPERGTVRVEELINYFAYQYPAPDAGEQPF 217 Query: 148 YTNSRHIVMP----------ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 N+ P + ++ +LDVS SM SS K+ + Sbjct: 218 SVNTELAPSPYNPHKMLLRIGLKGFEKEKADLGASQLVFLLDVSGSM-----SSQDKLPL 272 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST 257 ++ + +++ ++ VV +G + + + + + + L G ST Sbjct: 273 LKNALKMLSQQLDEGDRISIVVYAGA----SGVVLDGVKGNDTLAISQALDKLKA-GGST 327 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAK 316 N G++ AY ++H N ++ TDG+ DQQ+L E + Sbjct: 328 NGGAGIELAYQL------AQKHFIAGGVNR---VILATDGDFNVGVSDQQALEDMIEEKR 378 Query: 317 KRGAIVYAIGIRVIRSHE-FLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 K+G + +G ++ + A + + ++ +A + +I + K Sbjct: 379 KQGIALTTLGFGQGNYNDHLMEQLADKGNGHYAY-IDTLNEARKVLVDEISATLLTIAK 436 >gi|329888194|ref|ZP_08266792.1| von Willebrand factor type A domain protein [Brevundimonas diminuta ATCC 11568] gi|328846750|gb|EGF96312.1| von Willebrand factor type A domain protein [Brevundimonas diminuta ATCC 11568] Length = 655 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 41/264 (15%), Positives = 93/264 (35%), Gaps = 33/264 (12%) Query: 82 ILCRIKNTWNMSFRNELRDNGFVN-----DIDDIVRSTSLDIVVVPQNEGYSISAISRYK 136 + R + +F ++ + N D ++ + + Y + + Sbjct: 193 AVKRTADQPVSTFSIDVDTAAYANVRRFIDNGRAPPRDAVRVEEMINYFDYGYTRPTSAA 252 Query: 137 IPLKFC---TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSIT 193 P T PW R + + + ++ L++ ++DVS SM S Sbjct: 253 RPFAVTATTTASPWSEGRRIVHVGLQG-YELPENQRRPLNLTFLVDVSGSMNS-----PD 306 Query: 194 KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLS 251 K+D+A +S+N +++ ++ + + ++ G L+ + L Sbjct: 307 KLDLAKQSMNLIIDRLRPQD------RVAVAYYAEGAGTTLAPTAGTQKLKLRCAVASLR 360 Query: 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLY 310 G T G+ AY+Q + +I++F TDG+ N+ +D++ Sbjct: 361 ASG-GTAGATGMTNAYDQ--------AQASFGRNKVNRILMF-TDGDFNVGVTDDKRLED 410 Query: 311 YCNEAKKRGAIVYAIGIRVIRSHE 334 Y + ++ G + G + Sbjct: 411 YVADKRRTGIYLSVYGFGRGNYQD 434 >gi|112734845|ref|NP_065933.2| collagen alpha-1(XX) chain [Homo sapiens] gi|292495087|sp|Q9P218|COKA1_HUMAN RecName: Full=Collagen alpha-1(XX) chain; Flags: Precursor gi|66347352|emb|CAI95065.1| collagen, type XX, alpha 1 [Homo sapiens] gi|119595695|gb|EAW75289.1| collagen, type XX, alpha 1, isoform CRA_e [Homo sapiens] Length = 1284 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 41/219 (18%), Positives = 74/219 (33%), Gaps = 39/219 (17%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARL--------DMMIVLDVSRSMESFFD 189 P T P + + T S + + DM+ ++D S S+ Sbjct: 136 PTPSHTGSPDPEQASEPQVAFTPSQDPRTPAGPQFRCLPPVPADMVFLVDGSWSIGH--- 192 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ----- 244 + + +++ ++ PD VQ GL +S + EW ++ L Sbjct: 193 ---SHFQQVKDFLASVIAPFEIGPDK---VQVGLTQYSGDAQT----EWDLNSLSTKEQV 242 Query: 245 -RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 ++ L G +T + L + + Q K+++ +TDG K Sbjct: 243 LAAVRRLRYKGGNTFTGLALTHV------LGQNLQPAAGLRPEAAKVVILVTDG-----K 291 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 K G V+A+G++ E LR ASP Sbjct: 292 SQDDVHTAARVLKDLGVNVFAVGVKNADEAE-LRLLASP 329 >gi|45946128|gb|AAH43183.1| Collagen, type XX, alpha 1 [Homo sapiens] Length = 1284 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 41/219 (18%), Positives = 74/219 (33%), Gaps = 39/219 (17%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARL--------DMMIVLDVSRSMESFFD 189 P T P + + T S + + DM+ ++D S S+ Sbjct: 136 PTPSHTGSPDPEQASEPQVAFTPSQDPRTPAGPQFRCLPPVPADMVFLVDGSWSIGH--- 192 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ----- 244 + + +++ ++ PD VQ GL +S + EW ++ L Sbjct: 193 ---SHFQQVKDFLASVIAPFEIGPDK---VQVGLTQYSGDAQT----EWDLNSLSTKEQV 242 Query: 245 -RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 ++ L G +T + L + + Q K+++ +TDG K Sbjct: 243 LAAVRRLRYKGGNTFTGLALTHV------LGQNLQPAAGLRPEAAKVVILVTDG-----K 291 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 K G V+A+G++ E LR ASP Sbjct: 292 SQDDVHTAARVLKDLGVNVFAVGVKNADEAE-LRLLASP 329 >gi|119595694|gb|EAW75288.1| collagen, type XX, alpha 1, isoform CRA_d [Homo sapiens] Length = 637 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 41/219 (18%), Positives = 74/219 (33%), Gaps = 39/219 (17%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARL--------DMMIVLDVSRSMESFFD 189 P T P + + T S + + DM+ ++D S S+ Sbjct: 136 PTPSHTGSPDPEQASEPQVAFTPSQDPRTPAGPQFRCLPPVPADMVFLVDGSWSIGH--- 192 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ----- 244 + + +++ ++ PD VQ GL +S + EW ++ L Sbjct: 193 ---SHFQQVKDFLASVIAPFEIGPDK---VQVGLTQYSGDAQT----EWDLNSLSTKEQV 242 Query: 245 -RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 ++ L G +T + L + + Q K+++ +TDG K Sbjct: 243 LAAVRRLRYKGGNTFTGLALTHV------LGQNLQPAAGLRPEAAKVVILVTDG-----K 291 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 K G V+A+G++ E LR ASP Sbjct: 292 SQDDVHTAARVLKDLGVNVFAVGVKNADEAE-LRLLASP 329 >gi|296124353|ref|YP_003632131.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] gi|296016693|gb|ADG69932.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] Length = 390 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 48/372 (12%), Positives = 117/372 (31%), Gaps = 62/372 (16%) Query: 36 MIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFR 95 ++V+++ ++T L + D S + + + S Sbjct: 33 FTVDVAYMQLVRTELRAATDASAKAGMEAL--RRTQDTEAAIDAAIATAAANKVGGRSLT 90 Query: 96 NELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIV 155 F ++ S S + +P SA++ F + Sbjct: 91 LTADQIEFGLAFRNVDNSVSFNAGQLPYTAVRVNSAMTESSAAGAVPLFFGSIFGTGQFE 150 Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSM-------ESFFDSSI---------------T 193 P S+V +++ +++ +D S SM + + + Sbjct: 151 -PTRSAVSASTE----VEICFAIDRSHSMCFDLTGVDWSYPPGTPRNPDPVAFPPHPTLS 205 Query: 194 KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE-----------------FFLL 236 + +++ + + +VT+++KI + L Sbjct: 206 RWASLSRAMQTFVSITASQEPKP---RVAMVTWASKITQSNYEGKLTKTNSPEVFVDVPL 262 Query: 237 EWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 ++ L + IK S+ +TN G+ A + + R + + +II+ M Sbjct: 263 TTNLADLNQAIKGRSEKVMLGATNMAAGIDEARKILNATKSTRPYAH-------RIIILM 315 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF-YLVENPHS 353 TDG + + L +A G +++++ + + ++ Y N + Sbjct: 316 TDG---LWNQGRNPLLAAQDAANEGIVIHSVSLLPRSGDITPQVSSTTGGVNYPATNSAA 372 Query: 354 MYDAFSHIGKDI 365 + AF+ I + + Sbjct: 373 LEAAFADIARTL 384 >gi|221109526|ref|XP_002169886.1| PREDICTED: similar to coagulation factor C homolog, cochlin [Hydra magnipapillata] Length = 336 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 71/197 (36%), Gaps = 21/197 (10%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-NVVQSG 222 V + +A +D+ +LD S S+E+ + L+++ + ++ N + G Sbjct: 9 VQPRCEAVVDVAFILDSSHSLEASYQKE-----------KNFLKKLAAVFGISSNGSRVG 57 Query: 223 LVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 ++TFS + E L S + + +T L+ A +F Sbjct: 58 VITFSYRAELSVKLNSFTDLSSFNEAVDKIPLMNFTTRIDRALRLAQKDMFTS------A 111 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRAC 339 N K+I+ +TDG + + +E + G ++ +GI + E Sbjct: 112 NGGRVGVSKLIILLTDGSQTPGGDAEDPERIADELRNDGVVILGVGIGSAVNETELSHIT 171 Query: 340 ASPNSFYLVENPHSMYD 356 + Y S+ D Sbjct: 172 GGKKNAYTAATFDSLTD 188 >gi|115496702|ref|NP_001068594.1| collagen alpha-2(VI) chain [Bos taurus] gi|94574217|gb|AAI16098.1| Collagen, type VI, alpha 2 [Bos taurus] gi|296490819|gb|DAA32932.1| collagen, type VI, alpha 2 [Bos taurus] Length = 917 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 62/164 (37%), Gaps = 20/164 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + ++ L + P + G+V +S++ Sbjct: 611 GALDVVFVIDSSESIGY---TNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHE 667 Query: 230 -----IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 I+ +S + +K L T + LK+AYN++ + Sbjct: 668 GTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALKFAYNKLI--------KESRR 719 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V +TDG + +D CN V AIGI Sbjct: 720 QKTRVFAVVITDGRHDPRDDDLNLRALCNHE----VTVTAIGIG 759 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 43/226 (19%), Positives = 86/226 (38%), Gaps = 13/226 (5%) Query: 148 YTNSRHIVMPITSSVKVNSQT-DARLDMMIVLDVSRS--MESFFDSSITKIDMAIKSINA 204 + + ++ P S + D + + VLD S S M+S DS + + + + Sbjct: 19 HAQQQEVISPGPSDRNSCPEKADCPVHVYFVLDTSESITMQSPTDSLLYHMQQFVLQFIS 78 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 L++ + V + G + FS+ +E F + + ++ +S F T T + Sbjct: 79 QLQDELYLDQVALSWRYGGLHFSDLVEVFSPPGSDRASFTKSLQSISSFRRGT-FTDCML 137 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 Q +R+H N V +TDG + L A++ G ++A Sbjct: 138 ANMTQ-----EVRRHVGKGVVN---FAVVITDGHVTGSPCGGIKLQ-AERAREEGIRLFA 188 Query: 325 IGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + + + + LR A+ N +M + I +D + + I Sbjct: 189 VPPNLKLNEQGLRDIANTPHELYRNNYATMRPDSTEIDQDTINRII 234 >gi|332982109|ref|YP_004463550.1| von Willebrand factor type A [Mahella australiensis 50-1 BON] gi|332699787|gb|AEE96728.1| von Willebrand factor type A [Mahella australiensis 50-1 BON] Length = 948 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 71/199 (35%), Gaps = 27/199 (13%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 L +++V+D S SM ITK++MA ++ E ++ G+ Sbjct: 397 SKKADIPSLGLVLVIDKSGSMTDGQYG-ITKLEMAKEAAIRSTEALRPTDS------VGV 449 Query: 224 VTFSNKIEEFFLL--EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 + F + + ++ +Q I + G TN P L AY + + +H Sbjct: 450 ICFDDAASWVVGMRQADDLAEIQDSIGTIRP-GGGTNMYPALDLAYKALEEADTKLKH-- 506 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 I+ +TDG++ + D + + G + ++ + + L A Sbjct: 507 ---------IIVLTDGQSATGDFDG----IAHRMAEDGITLSSVAVGMDADKNLLSRLAE 553 Query: 342 PNS--FYLVENPHSMYDAF 358 + +Y + ++ Sbjct: 554 IGNGRYYYTDEFSNIPKIL 572 >gi|322378392|ref|ZP_08052846.1| phage/colicin/tellurite resistance cluster TerY protein [Helicobacter suis HS1] gi|322380073|ref|ZP_08054329.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter suis HS5] gi|321147480|gb|EFX42124.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter suis HS5] gi|321149148|gb|EFX43594.1| phage/colicin/tellurite resistance cluster TerY protein [Helicobacter suis HS1] Length = 236 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 74/204 (36%), Gaps = 13/204 (6%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 + + ++LD S SM + + T+I + M++ +K NV + ++TF Sbjct: 15 IPIFLLLDTSSSMSTNMNGGQTRIGCLNDCVQTMIDLLKEEAKRENVSKLAVITFGA--- 71 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L+ +S ++ S G N+ G+ + + Y + Sbjct: 72 GGVKLQTPLSKIESI--QFSPLGTGGNTPLGMALELTRDYIQNKDTF----PGKFYTPYV 125 Query: 292 VFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIRVIRS--HEFLRACASPNSFYLV 348 V ++DGE + + N+ + ++ Y++ I SPN Y Sbjct: 126 VMVSDGEPNDDWQGPLHDFIHNKENRSSKSVRYSVFIGNEGEEPQAVHDFSGSPNQVYYA 185 Query: 349 ENPHSMYDAFSHIGKDI-VTKRIW 371 + S+ + F I + ++I Sbjct: 186 NDVQSLINCFKAITASVTQGRKIT 209 >gi|307153048|ref|YP_003888432.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306983276|gb|ADN15157.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 413 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 37/215 (17%), Positives = 74/215 (34%), Gaps = 28/215 (13%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 NS + + S + L++ +VLD S SM ++ K+ +++++ Sbjct: 19 NSSQRQLSLAISAQGEPGRTLPLNLCLVLDHSGSMGGRP------LETVKKAAIELVKQL 72 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAY 267 + ++ F ++ + + G+ L I+ + T GLK Sbjct: 73 NPED------RVSVIAFDHRAKVIVPNQ-GIEDLNTVIEQIKALRAAGGTAIDEGLKLGI 125 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + + R I +TDGEN ++++ L A + +G Sbjct: 126 KESALGKQERVSQ----------IFLLTDGENEH-GDNERCLKLAQVASDYNITLNTLGF 174 Query: 328 RVIRSHEFLRACASP--NSFYLVENPHSMYDAFSH 360 + + L A S +ENP + FS Sbjct: 175 GNHWNQDVLEKIADSAGGSLSYIENPEKALEEFSR 209 >gi|262172998|ref|ZP_06040675.1| protein BatA [Vibrio mimicus MB-451] gi|261890356|gb|EEY36343.1| protein BatA [Vibrio mimicus MB-451] Length = 318 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 70/213 (32%), Gaps = 38/213 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M+V+D+S SM + + + A+ + + + GL+ F++ Sbjct: 85 DLMLVVDLSYSMSQE---DMQSGQQMVDRLTAVKQVLSEFITKREGDRVGLILFADHAYL 141 Query: 233 FFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + ++ L G T G+ A D +++ Sbjct: 142 QTPLTMDRQTVISQLNQAVLKLIGTQTAIGEGIGLATKTFIDSDA-----------PQRV 190 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------SHEF- 335 ++ ++DG N + L N AK+ +Y +G+ + + Sbjct: 191 MILLSDGSNTA--GVLDPLEAANIAKQYQTTIYTVGVGAGEMIVKDFLFSRKVNTAQDLD 248 Query: 336 ---LRACAS--PNSFYLVENPHSMYDAFSHIGK 363 L+ AS ++ N + + I + Sbjct: 249 EKTLQTIASTTGGQYFRARNQQDLQSIYDTINQ 281 >gi|198435216|ref|XP_002126368.1| PREDICTED: similar to integrin alpha Hr1 precursor-like [Ciona intestinalis] Length = 1274 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 43/339 (12%), Positives = 115/339 (33%), Gaps = 41/339 (12%) Query: 55 DRSLVH-AATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRS 113 D +++H A+ ++ + + + + + SF + N + + Sbjct: 42 DSAVIHFGASVVIKTSPMKSIAYVGAPADGNKNGSVYKCSFSGKSYGNTSCSKDISFDNN 101 Query: 114 TSLDIVVVPQNEGYSISAISRYKIPLKFCTFI---------PWYTNSRHIVMPITSSVKV 164 ++ + + + ++ T + S M T V Sbjct: 102 AAIGMSIGADSSFSNLYICGNQHTTSCPTTPLQKRMVGACYKKPMTSSTATMFKTPCVPG 161 Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV--NNVVQSG 222 + D+M VLD S S++ ++N +++ V + ++ G Sbjct: 162 CPKIILIADIMFVLDDSSSVDDT---------AFRSALNWIIQVVSYFSSYIDSGDLRVG 212 Query: 223 LVTFSNKIEEFF----LLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQIFDMQG 275 + FSN L +W + L+++I L G T + +K + Sbjct: 213 VYGFSNDDHRSGIRIGLRKWTSATLKKQIGELLNVKSTGAGTYISHAIK---------ET 263 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHE 334 ++ K I+ +TDG + + A+ +G ++ ++G+ + + + Sbjct: 264 VKVFEANGRKGISKEIILLTDG---GASDWWLLKGEADTARDKGIVLVSVGVGTSVNNDQ 320 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 L + + + + +++ + +++ I +I + Sbjct: 321 LLAIAGNKSRVFQATDYNTLDEVVNNVVSTIDAIKITVN 359 >gi|146282738|ref|YP_001172891.1| von Willebrand factor type A domain-containing protein [Pseudomonas stutzeri A1501] gi|145570943|gb|ABP80049.1| von Willebrand factor type A domain protein [Pseudomonas stutzeri A1501] Length = 339 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 28/173 (16%), Positives = 63/173 (36%), Gaps = 27/173 (15%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + +T++++ + +E+ + + GL+ F + Sbjct: 91 DLLLAVDVSGSMDYPDMQWQGEELTRLELVKVLLGDFIEQ-------RHGDRVGLILFGS 143 Query: 229 KIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 K L + ++ + + G +T + A ++ E Sbjct: 144 KAYLQAPLTFDRRTVRVWLDEARVGIAGSNTAIGDAIGLAVKRL-----------RERPT 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRA 338 +++V +TDG N + L A + ++ IGI L Sbjct: 193 NSRVLVLITDGANNG--GELDPLLAATLAAEESVRIHTIGIGAVPEEGGVLSR 243 >gi|309791336|ref|ZP_07685859.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308226646|gb|EFO80351.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 853 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 35/194 (18%), Positives = 74/194 (38%), Gaps = 28/194 (14%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + + +++++D S SM + F ++K DMA ++ E ++ + GL+ F Sbjct: 393 QRSDVALLLIMDRSASMLASFG--VSKFDMAKEAAQLATESLQPED------RIGLLAFD 444 Query: 228 NK---IEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + + F L+ G+S +Q +I L G T L+ + + +H Sbjct: 445 TETLWVVPFQLISGGLSVAQIQEQIASLPS-GGGTRIERALEVGLPALAEQPTKVRHA-- 501 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS- 341 V +TDG + ++ A+ + + I I + L+ A+ Sbjct: 502 ---------VLLTDGRSF-MNDNALYQRLVETARSQQITLSTIAIGLDSDTALLKQLAAW 551 Query: 342 -PNSFYLVENPHSM 354 +Y + P + Sbjct: 552 GGGRYYYADQPADI 565 >gi|262164788|ref|ZP_06032526.1| protein BatA [Vibrio mimicus VM223] gi|262027168|gb|EEY45835.1| protein BatA [Vibrio mimicus VM223] Length = 318 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 70/213 (32%), Gaps = 38/213 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M+V+D+S SM + + + A+ + + + GL+ F++ Sbjct: 85 DLMLVVDLSYSMSQE---DMQSGQQMVDRLTAVKQVLSEFITKREGDRVGLILFADHAYL 141 Query: 233 FFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + ++ L G T G+ A D +++ Sbjct: 142 QTPLTLDRQTVISQLNQAVLKLIGTQTAIGEGIGLATKTFIDSDA-----------PQRV 190 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------SHEF- 335 ++ ++DG N + L N AK+ +Y +G+ + + Sbjct: 191 MILLSDGSNTA--GVLDPLEAANIAKQYQTTIYTVGVGAGEMIVKDFLFSRKVNTAQDLD 248 Query: 336 ---LRACAS--PNSFYLVENPHSMYDAFSHIGK 363 L+ AS ++ N + + I + Sbjct: 249 EKTLQTIASTTGGQYFRARNQQDLQSIYDTINQ 281 >gi|255557532|ref|XP_002519796.1| protein binding protein, putative [Ricinus communis] gi|223541035|gb|EEF42592.1| protein binding protein, putative [Ricinus communis] Length = 477 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 53/251 (21%), Positives = 92/251 (36%), Gaps = 37/251 (14%) Query: 118 IVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 ++V PQ+ G + + I ++ LK + + + + +S LD+++V Sbjct: 8 MIVTPQDSGSTRTPIVPGRVQLKCMNSTTAPLEESKLKVMLELTGGDSSNDRPGLDLVVV 67 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 LD+S SME KI+ +I M++++ I + +VTFS L Sbjct: 68 LDLSGSMEG------EKIEKLKAAILFMIKKLSSID------RLSIVTFSRDARRLCPLR 115 Query: 238 ----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L+ I L +G N T GL+ + D + T I+ Sbjct: 116 QITENSQKDLENLINGLHAYGA-ANITAGLQTGLKVLNDRRFTGGRVAT--------IML 166 Query: 294 MTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---PNSFYLVE 349 ++ E N DQ + V+ G L+A A +F V+ Sbjct: 167 VSSSEQNNGDDADQILVG--------NVPVHTFGFGAYHEPGVLKAIAHNSIGGTFSDVQ 218 Query: 350 NPHSMYDAFSH 360 N ++ AFS Sbjct: 219 NMDNLNKAFSQ 229 >gi|260813584|ref|XP_002601497.1| hypothetical protein BRAFLDRAFT_134626 [Branchiostoma floridae] gi|229286794|gb|EEN57509.1| hypothetical protein BRAFLDRAFT_134626 [Branchiostoma floridae] Length = 260 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 68/186 (36%), Gaps = 22/186 (11%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 LD++ +LD S S+ + K+ L+ V Q G++ +S + Sbjct: 1 PLDVIFLLDGSGSVG---VVNFEKVKQF------TLKTVIGFDIGPTATQVGIIQYSTRP 51 Query: 231 EEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + F + L I+ ++ T + ++Y F + Sbjct: 52 RQEFSMNSFQTKESLSTAIENVNYMAGGTLTGRAIRYVTKYGFGES------DGARPGIP 105 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 KI++ +TDG EA+++G +YAIG+ + R ++ + + Sbjct: 106 KIVILVTDG-----VSSDDIEQPALEAQQKGISLYAIGVSGYDMDQLERIASNNRTLAVA 160 Query: 349 ENPHSM 354 EN + + Sbjct: 161 ENFNLL 166 >gi|2707733|gb|AAD03350.1| microneme protein precursor Etmic-1 [Eimeria tenella] Length = 712 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 36/175 (20%), Positives = 65/175 (37%), Gaps = 25/175 (14%) Query: 154 IVMPITSSVKVNSQTD---ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + + Q + LD+M+V+D S S+ + S+ K+ I+ + V Sbjct: 28 FSSLQPGATTSSGQDQVCTSLLDVMLVVDESGSIGT---SNFRKVRQFIE------DFVN 78 Query: 211 LIPDVNNVVQSGLVTFSNKIEE----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 +P V+ GL+TF+ + + S + LS T + GL+ A Sbjct: 79 SMPISPEDVRVGLITFATRSKVRWNLSDPKATNPSLAISAARSLSYSTGVTYTHYGLQDA 138 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 ++D N K+++ MTDG + Q+ + GAI Sbjct: 139 KKLLYDTN------AGARNNVPKLVLVMTDGASNLPS---QTRSSAAALRDAGAI 184 >gi|327260860|ref|XP_003215251.1| PREDICTED: collagen alpha-2(VI) chain-like [Anolis carolinensis] Length = 1019 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 34/184 (18%), Positives = 67/184 (36%), Gaps = 24/184 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ ++D S S+ + + + ++ + L + P + G+V +S++ Sbjct: 609 GPLDIVFIIDSSESIGYN---NFSLEKNFVINVVSRLGSIAKDPKSETGARVGVVQYSHE 665 Query: 230 -----IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 I+ +S + +K L T + L++AYN + ++ Sbjct: 666 GTFEAIQLNDKRIDSLSSFKEAVKKLEWIAGGTWTLSALQFAYNTLI--------KESQR 717 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFLRACA 340 + V +TDG + D C R V AIG E LR+ A Sbjct: 718 EKARVFAVVVTDGRHDPRDNDSHLQALCG----RNVTVTAIGIGDMFNAKEEDETLRSIA 773 Query: 341 SPNS 344 ++ Sbjct: 774 CNDN 777 Score = 63.6 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 69/186 (37%), Gaps = 12/186 (6%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 +P + + V T +D++ +LD S E + K ++ + L + D Sbjct: 812 ELPCQTELAVAQCTQRPVDVVFLLDGS---ERIGGLNFHKAHHFVEDVARHLTLARSNSD 868 Query: 215 VNNVVQSGLVTFSNKIEE--FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 N + L+ + ++ E F L + ++ + + + S+N + YA N + Sbjct: 869 NMN-ARIALLQYGSENEHVVAFPLTYNITEISDALAQIRYLDSSSNLGSAIIYAVNNLVI 927 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 RQ +A VF+TDG + D+ + KK+ + + + Sbjct: 928 NPRDRQRAARRNAELS--FVFITDGITGNKNLDEAI----DSMKKQNVMPTVVALGSDVD 981 Query: 333 HEFLRA 338 + L Sbjct: 982 MDVLHK 987 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 57/190 (30%), Gaps = 13/190 (6%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 I + +TD + + V+D S S+ + +D + + + ++ N Sbjct: 27 ILGASSCADKTDCPVRVYFVIDTSESIALQTVPIQSLVDHIKRFVPEFITRLENELYQNQ 86 Query: 218 VV---QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 V Q + FS+ + + K+ + G T + Sbjct: 87 VSITWQFAGLHFSDVVIFYSDFTNSKEIYLDKLNNIQYIGRGT---------FTDCALSN 137 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 Q V +TDG + + A++ G ++A+ + Sbjct: 138 MTAQILANTSPGITNYAVVITDGHVTGSPCGGM-KHQAERAREAGIKLFAVAPSQNIYEQ 196 Query: 335 FLRACASPNS 344 LR AS Sbjct: 197 GLREIASSPH 206 >gi|325678986|ref|ZP_08158584.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] gi|324109490|gb|EGC03708.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] Length = 782 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 72/204 (35%), Gaps = 13/204 (6%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + A + ++D S SM + + + + I ++ + Sbjct: 273 GADKSVNEPAVTRVAFLIDNSGSMYPKELCPTSSENDVDFKRLDFTQSL--IDKFDDDFR 330 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 G+ F+ + + L++ + + YNQ + + + Sbjct: 331 IGISKFTGTYTKMCDFTDDRTELRKVLNRIRTEDE------IFDGTYNQTALKKCINEFS 384 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 D Y IIV ++DGE S + D +++ N A ++ IV +G+ +L+ Sbjct: 385 AAGDGKYVNIIVMLSDGE--SDEVDAETIESLSNLANEKSVIVLTVGLGREIDRAWLQEV 442 Query: 340 A--SPNSFYLVENPHSMYDAFSHI 361 A + +Y + S+ D + I Sbjct: 443 AYSTGGKYYSASDATSLDDVYKQI 466 >gi|21675084|ref|NP_663149.1| hypothetical protein CT2278 [Chlorobium tepidum TLS] gi|21648324|gb|AAM73491.1| conserved hypothetical protein [Chlorobium tepidum TLS] Length = 332 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 38/225 (16%), Positives = 74/225 (32%), Gaps = 39/225 (17%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 +S+ +D+MI LDVS SM + ++ A + ++ + + GL Sbjct: 90 PSSRDTVGIDIMIALDVSDSMNTPDFGGKSRFAGARTAAMRFIDN-RPAD------RIGL 142 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHC 280 V FS L L R + ++ T + A N++ Sbjct: 143 VVFSGGSFTRCPLTLDHEVLGRLAETVAPGFFDEPGTAIGTAILTATNRL---------- 192 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR------------ 328 ++ +K +V +TDGEN + + ++ A G +Y + Sbjct: 193 -KASSSKEKALVLITDGENNAGEVTPETA--ARLAANYGIRIYTVFAGKEARAFENTSNT 249 Query: 329 --VIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 + L A S + + + +F I + T+ Sbjct: 250 ALNRKGRSELETVARISGGRMFSAGDVFGLMKSFRDIDRLEKTRL 294 >gi|326426687|gb|EGD72257.1| hypothetical protein PTSG_00277 [Salpingoeca sp. ATCC 50818] Length = 2847 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 33/190 (17%), Positives = 61/190 (32%), Gaps = 16/190 (8%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+ +LD S S+ S + + +++E+ + + + + FS+ Sbjct: 454 GASDIAFILDSSGSILPEDYS------LLKQFAENVVDELVISESES---RIAVEIFSDV 504 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 E + + I+ L T + L+ A IF + Sbjct: 505 QEVSLQFSYDKPTIVDTIRNLRHIQRGTATGAALELARRDIFSSSDPNYR----NLAVPA 560 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACASPNSFYLV 348 I V +TDG++ ++ G V A+GI E L S + V Sbjct: 561 IAVVITDGQSQEEPA--FVAEQALRLRELGVTVLALGIGDGTDPDELLAIAGSADRVEQV 618 Query: 349 ENPHSMYDAF 358 E + F Sbjct: 619 EQFGDLVQEF 628 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 32/210 (15%), Positives = 68/210 (32%), Gaps = 20/210 (9%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +D+ ++D S S+ S + + ++ + + + P N V Sbjct: 813 GPSGGAECVGRAIDVFYLIDGSGSIVS------SDFERERTFLSDLTDALFAFPG--NDV 864 Query: 220 QSGLVTFSNKIEEF-FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + + +S + + Q I + + G +T + L A + I Sbjct: 865 RISIAEYSTTYTQVLVPYTADETTAQNTISNVVQSGGATATGTALGLAADDI-------- 916 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLR 337 + +++V +TDG S + Q ++ G + AIGI E L+ Sbjct: 917 -GANARPDAARVLVLLTDGA-TSDGDQQNIDPSVSDLNSIGVSITAIGIGDNADETELLQ 974 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 P + + D I ++ Sbjct: 975 IAGDPTRVFNNIAFVDLGDFIDEIIGSVLD 1004 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 25/204 (12%), Positives = 70/204 (34%), Gaps = 28/204 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV-VQSGLVTFSN 228 D++++ D S S+ + +SI + + V+ D+ + +TF Sbjct: 1248 GSTDLVLIQDNSGSISE---------EDFQQSIAFLRDIVQTQIDLEGTGSRVAAMTFCA 1298 Query: 229 KIEEFFLLEWGVSHLQRKIK-YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + ++ T + L +A I + + Sbjct: 1299 TQTLLTTFTGVTTDALNALDSAVNTVSCGTATGAALDFAIANILNDNS--------NPGA 1350 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP-NSFY 346 +++++ +TDG + + + V+AIG+ + L+ A+ ++ + Sbjct: 1351 RRVVIVVTDGASQEDSSFVTAAS--DRLHAEVDEVFAIGVGPASNLAELQTIATSNDNAF 1408 Query: 347 LVENPHSMYDAFSHIGKDIVTKRI 370 + + ++ D +D+ + Sbjct: 1409 IETDFDALRD------RDLTQQLC 1426 >gi|322435250|ref|YP_004217462.1| Protein of unknown function DUF2134, membrane [Acidobacterium sp. MP5ACTX9] gi|321162977|gb|ADW68682.1| Protein of unknown function DUF2134, membrane [Acidobacterium sp. MP5ACTX9] Length = 515 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 39/264 (14%), Positives = 90/264 (34%), Gaps = 20/264 (7%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F + +G + + + + +I ++G++++V HI+ + L + D + + A I Sbjct: 11 FLRDQRGQVLPIAGLMMFVITAMIGLVVDVGHIYLCQRELQASSDAAALAGAEIIPTATT 70 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 + N +G+ T I V SI Sbjct: 71 AAAVYAKATAYSSTTGAANVYKNMTNITMVSGYPILKCLSTMQTQG-ISCVGPLSYNSIQ 129 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF-FD 189 + + +PL F I + + TS+ + ++ +VLD + S S+ D Sbjct: 130 VMQQAVVPLYFARII----GRSSMTISATSTAAKGGASSRPYNVALVLDTTYSEISYDSD 185 Query: 190 SSITKIDMAIKSINAMLEEVK------------LIPDVNNVVQSGLVTFSNKIEEFFLLE 237 +++ ++ + +L ++ N+VV+ G+ TF + + Sbjct: 186 CGNSQMLCTLQGVQILLNQLDPCGTSVTTCSVTSGQATNSVVRVGIFTFPQMVTSTVSSD 245 Query: 238 W--GVSHLQRKIKYLSKFGVSTNS 259 + + + G T + Sbjct: 246 YDCSSNTPANTVYTFPIPGAGTYA 269 >gi|15072454|gb|AAK38350.1| von Willebrand factor A-related protein [Mus musculus] Length = 415 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 69/180 (38%), Gaps = 22/180 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + +S + ++ +D + D++ +LD S S+ + S + + + ++ + P Sbjct: 15 LALARSSIERGSTASDPQGDLLFLLDSSASVSHYEFSRV------REFVGQLVATMSFGP 68 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 +++ LV ++ F + S ++ I+ + TN+ L YA Q+F Sbjct: 69 GA---LRASLVHVGSQPHTEFTFDQYSSGQAIRDAIRVAPQRMGDTNTGLALAYAKEQLF 125 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 + K++V++TDG + E K G ++ + Sbjct: 126 AEE------AGARPGVPKVLVWVTDGGSSDPVG-----PPMQELKDLGVTIFIVSTGRGN 174 >gi|260430630|ref|ZP_05784603.1| von Willebrand factor, type A [Citreicella sp. SE45] gi|260418659|gb|EEX11916.1| von Willebrand factor, type A [Citreicella sp. SE45] Length = 318 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 36/206 (17%), Positives = 69/206 (33%), Gaps = 31/206 (15%) Query: 173 DMMIVLDVSRSMESF---FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+++ LD+S SME D A++ + A + + GLV F ++ Sbjct: 88 DIVLALDLSGSMEREDFVLDGRTVSRLAAVQDVAAQFVRGRTGD------RVGLVVFGDR 141 Query: 230 IEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 V + + I L G +T GL A ++ Sbjct: 142 AYVAAAPTHDVGAVAQVIGTLQIGVSGKATAIADGLGLAIRRL-----------RGREAE 190 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 ++I+ ++DG+ T + A++ G VY I + +SP++ Sbjct: 191 SRVIILLSDGQ--DTTGAVDPVAAAQAAQELGMRVYTIALGPAD------LASSPDAR-D 241 Query: 348 VENPHSMYDAFSHIGKDIVTKRIWYD 373 + ++ G + R D Sbjct: 242 AVDSETLRRIAEVAGGETFRVRSTED 267 >gi|189526999|ref|XP_691588.2| PREDICTED: anthrax toxin receptor 1 [Danio rerio] Length = 554 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 39/212 (18%), Positives = 75/212 (35%), Gaps = 28/212 (13%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + D+ VLD S S+++ + I + +E + + +++ Sbjct: 32 EGKMSCQGAFDLYFVLDKSGSIQNHW-----------IEIYSFVEHLAEKFT-SPMLRMS 79 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + FS + L + R + L + G T GL+ A QI+ H Sbjct: 80 FIVFSTRGTTIMRLTENRDDITRGLNTLKREIPGGDTYMNLGLEEANVQIY-------HG 132 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 N A+ +I+ +TDGE + + A+ GAIVY +G++ + Sbjct: 133 NYGAAS---VIIALTDGELND-HQFVTAQQEAQRARSMGAIVYCVGVKDFNETQLATIAD 188 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + + V + A + I+ K Sbjct: 189 TIEHVFPVIGG---FQALEGMIDSIIKKSCIE 217 >gi|167526012|ref|XP_001747340.1| hypothetical protein [Monosiga brevicollis MX1] gi|163774175|gb|EDQ87807.1| predicted protein [Monosiga brevicollis MX1] Length = 1632 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 39/196 (19%), Positives = 72/196 (36%), Gaps = 15/196 (7%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-NVVQSGLVTFS 227 + +D++ VLD S S++ I +++V D++ N + G++TFS Sbjct: 328 KSAIDLIFVLDGSGSIDRESYGGTP--GNFQYKILDFVKQVVSYFDISANATRVGVITFS 385 Query: 228 NKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + F L + S + I ++ ST + L + E Sbjct: 386 SSAVINFNLNSFYDKSDMLDAIDNINYPASSTRISLALASVRQNML---KEYNGMRPESE 442 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 K++V +TDG+ + Y K G +++IGI + L A+ Sbjct: 443 GVPKVVVVLTDGQ---ASSGYEPAYEAALLKDMGVNMFSIGIGSSIDTDQLEDMATAP-- 497 Query: 346 YLVENPHSMYDAFSHI 361 L + H + F I Sbjct: 498 -LASHMH-LLKNFDAI 511 >gi|91977525|ref|YP_570184.1| hypothetical protein RPD_3057 [Rhodopseudomonas palustris BisB5] gi|91683981|gb|ABE40283.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB5] Length = 464 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 70/222 (31%), Gaps = 27/222 (12%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 F + G + ++ A+ L I +G I+ S +T + + +D + + + + + Sbjct: 16 RFVGDDGGNIAVIFALTLLPILGFIGAAIDYSRASRARTAMQAALDSTALMVSKDLGADK 75 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 + K + G + + S VV +G Sbjct: 76 IKTSEVSEKAQTYFNSLYT--------GTEARGVTLTTNYTAKDDSGSSTVVVNGDG--- 124 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 + S + RL + + LDV+ SM Sbjct: 125 -------------AVSTHFMKMFGFPSLAIGSAATATWGGTRLRVAMALDVTGSM---VL 168 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 + TK+ K+ +A+++ ++ + + +V F+ + Sbjct: 169 NGSTKLAEMKKAASALVDTLRASAQSKDDLYISVVPFAQMVN 210 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 30/151 (19%), Positives = 50/151 (33%), Gaps = 22/151 (14%) Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 ++ I L G TN G+ +A+ + + + Y I+ ++DG N Sbjct: 313 QKIKDVIDDLVADG-GTNQPIGMAWAWMSLQQGNPLNTPAKDPNYKYTDAIILLSDGLNT 371 Query: 301 --------------STKEDQQSLYYCNEAK---KRGAI--VYAIGIRVIR--SHEFLRAC 339 K D + C+ K G VY I + L+ C Sbjct: 372 MDRWPDYGDGQRQFDGKIDARQKLLCDNIKLPDSNGKRPVVYTIQVNTTGDPESTILKYC 431 Query: 340 ASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 A +F+ + AF+ IG + RI Sbjct: 432 ADGGNFFATTTASGIGTAFAQIGSSLSKLRI 462 >gi|194367004|ref|YP_002029614.1| von Willebrand factor type A [Stenotrophomonas maltophilia R551-3] gi|194349808|gb|ACF52931.1| von Willebrand factor type A [Stenotrophomonas maltophilia R551-3] Length = 334 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 25/215 (11%), Positives = 70/215 (32%), Gaps = 44/215 (20%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 MM+ +DVS SM ++ ++ A + L+ + + GL+ F + Sbjct: 100 QMMLAMDVSGSMGEGDMVLGGQAVDRLTAAKAVLADFLDR-RAGD------RIGLLVFGD 152 Query: 229 KIEEFFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L ++ ++ ++ + G T + A ++ Sbjct: 153 RAYTLTPLTADLASVRDQLRDSVVGLAGRETAIGDAIGLAVKRLRSQPEG---------- 202 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV----------------I 330 +++++ +TDG + + + L A+ G ++ + Sbjct: 203 -QRVLILLTDGVSNA--GVLEPLRAAEVARAEGVRIHTVAFGGDGSMRLFGIPISADQDP 259 Query: 331 RSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + F+ + + ++ + + Sbjct: 260 VDEATLKKIATMTGGQFFRARDTAQLAGIYAELDR 294 >gi|84386025|ref|ZP_00989055.1| von Willebrand factor type A domain protein [Vibrio splendidus 12B01] gi|84379341|gb|EAP96194.1| von Willebrand factor type A domain protein [Vibrio splendidus 12B01] Length = 345 Score = 64.8 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 33/210 (15%), Positives = 74/210 (35%), Gaps = 30/210 (14%) Query: 167 QTDARLDMMIVLDVSRSM--ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 Q + ++M+ LD+S SM E F D K D + + ++ + GL+ Sbjct: 89 QKKSAREIMVALDLSGSMSEEDFADKKGNKHDRLTIA----KQVLREFAAQREHDRLGLI 144 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 F++ ++ Q ++ L G T + G+ Sbjct: 145 LFADSAYVQAPFTEDINVWQSLLEDVELGYAGFKTAFGDAI-----------GLSIAVFE 193 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR----- 337 ++ + +++++ +TDG+ T + A K G +Y I I + + Sbjct: 194 QEQSRQRVMILLTDGD--DTSSKMPPVKAAEIAAKYGVKIYTIAIGDPSTKGRYKMDLPT 251 Query: 338 ----ACASPNSFYLVENPHSMYDAFSHIGK 363 + A+ + + + A++ I + Sbjct: 252 LEKVSAATGGQMFHAMDRKQLDQAYATIDQ 281 >gi|212634559|ref|YP_002311084.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212556043|gb|ACJ28497.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 328 Score = 64.4 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 69/204 (33%), Gaps = 36/204 (17%) Query: 174 MMIVLDVSRSME-----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 MM+ +D+S SME +ID + L + + GL+ F + Sbjct: 95 MMVAVDLSGSMEARDFVDLKGDKTRRIDGVKSLLLDFLAQRASD-------RVGLIAFGD 147 Query: 229 KIEEFFLLEWGVSHLQRKIKYL--SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L +K + G T + A N H +T++ Sbjct: 148 AAYLQAPFTEDKGALSLLLKEMDVRMAGAGTALGDAIGVAVNHF-------SHSDTDN-- 198 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-------EFLRAC 339 K+++ +TDG T + L A ++G ++Y I I + E L+ Sbjct: 199 --KVLLLLTDG--NDTSSEFPPLEAARYAAQQGIVIYPIAIGDPANVGEDSLDIEMLQQI 254 Query: 340 A--SPNSFYLVENPHSMYDAFSHI 361 A + + ++ + +S I Sbjct: 255 ADITYGQVFEAQDGEAFTQVYSII 278 >gi|260808371|ref|XP_002598981.1| hypothetical protein BRAFLDRAFT_221835 [Branchiostoma floridae] gi|229284256|gb|EEN54993.1| hypothetical protein BRAFLDRAFT_221835 [Branchiostoma floridae] Length = 193 Score = 64.4 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 33/203 (16%), Positives = 61/203 (30%), Gaps = 23/203 (11%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +LDM+ VLD S S+ K + + + + P + GL+ ++ Sbjct: 1 KHQQLDMVFVLDGSGSI------QAVNFAKVKKFAVDLSDGLNISPTA---TRVGLIEYT 51 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + F L + L I +S T + L A Q+ RQ Sbjct: 52 DSPTVEFKLADHTNKASLATAINNVSYQSGGTQTGRALDAARTQM----DWRQPPVPNVC 107 Query: 286 -NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRACASP- 342 + + + +TDG + Y +GI I ++E Sbjct: 108 FSLLQAAIVVTDG-----MSGDNVQQPAKALRDNDISAYGVGIGPAINANELNEIAGGDA 162 Query: 343 NSFYLVENPHSMYDAFSHIGKDI 365 + + N + I + Sbjct: 163 GHVFYIPNYDKLEKEMEKISNSV 185 >gi|119595693|gb|EAW75287.1| collagen, type XX, alpha 1, isoform CRA_c [Homo sapiens] Length = 1329 Score = 64.4 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 36/178 (20%), Positives = 65/178 (36%), Gaps = 31/178 (17%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 DM+ ++D S S+ + + +++ ++ PD VQ GL +S Sbjct: 184 PADMVFLVDGSWSIGH------SHFQQVKDFLASVIAPFEIGPDK---VQVGLTQYSGDA 234 Query: 231 EEFFLLEWGVSHLQ------RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + EW ++ L ++ L G +T + L + + Q Sbjct: 235 QT----EWDLNSLSTKEQVLAAVRRLRYKGGNTFTGLALTHV------LGQNLQPAAGLR 284 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 K+++ +TDG K K G V+A+G++ E LR ASP Sbjct: 285 PEAAKVVILVTDG-----KSQDDVHTAARVLKDLGVNVFAVGVKNADEAE-LRLLASP 336 >gi|119470787|ref|ZP_01613398.1| hypothetical protein ATW7_05591 [Alteromonadales bacterium TW-7] gi|119446014|gb|EAW27293.1| hypothetical protein ATW7_05591 [Alteromonadales bacterium TW-7] Length = 328 Score = 64.4 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 74/216 (34%), Gaps = 45/216 (20%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+ +D+S SM + + ++ M ++ +E+ + + GL+ F + Sbjct: 88 DIMLAVDLSGSMTEQDMAYNGQYVDRLTMVKAVLSDFIEQ-RTGD------RLGLILFGD 140 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + + + + G +T L + + + Sbjct: 141 TAFLQTPLTRDLKTVTKMLNEAQIGLVGRATAIGDALGLSVKRFASKD-----------D 189 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--VIRS------------ 332 +I+V +TDG+N + + A++ G VY IG+ R Sbjct: 190 SNRIVVLLTDGQNTAGNLNPDDALLL--AREEGIKVYTIGVGSDNPRGFSLFNMGGSGGS 247 Query: 333 ---HEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + ++ ++ + ++ + K Sbjct: 248 NLDERLLKNIADDTGGLYFRAKDVAGLKQIYAELDK 283 >gi|66347353|emb|CAI95066.1| collagen, type XX, alpha 1 [Homo sapiens] gi|119595691|gb|EAW75285.1| collagen, type XX, alpha 1, isoform CRA_a [Homo sapiens] Length = 1297 Score = 64.4 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 36/178 (20%), Positives = 65/178 (36%), Gaps = 31/178 (17%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 DM+ ++D S S+ + + +++ ++ PD VQ GL +S Sbjct: 184 PADMVFLVDGSWSIGH------SHFQQVKDFLASVIAPFEIGPDK---VQVGLTQYSGDA 234 Query: 231 EEFFLLEWGVSHLQ------RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + EW ++ L ++ L G +T + L + + Q Sbjct: 235 QT----EWDLNSLSTKEQVLAAVRRLRYKGGNTFTGLALTHV------LGQNLQPAAGLR 284 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 K+++ +TDG K K G V+A+G++ E LR ASP Sbjct: 285 PEAAKVVILVTDG-----KSQDDVHTAARVLKDLGVNVFAVGVKNADEAE-LRLLASP 336 >gi|83312059|ref|YP_422323.1| hypothetical protein amb2960 [Magnetospirillum magneticum AMB-1] gi|82946900|dbj|BAE51764.1| hypothetical protein [Magnetospirillum magneticum AMB-1] Length = 1171 Score = 64.4 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 36/223 (16%), Positives = 78/223 (34%), Gaps = 32/223 (14%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 T + + + +T + + LD++++LD S SM + S + M +++ L Sbjct: 39 CTVGQRVKVSLTLTAPPSRHPPPPLDVVMLLDHSSSMGAAPGSP---LQMMLRAAGNFLR 95 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 ++ + +V F+ L + + ++ +S G +T+ L A Sbjct: 96 QLSPDS------RVAVVGFNQVPSVHCTLAATPAQARSALQAISP-GGATSIAAALNQAV 148 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE-AKKRGA---IVY 323 + K++V +DG+ + A+ + V Sbjct: 149 ELL----------AHGRPGMDKVVVLCSDGQ-------DDIAEIADALARLKAIPSVRVL 191 Query: 324 AIGIRVIRSHE-FLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 A+G H FL A ++ + M D F + K++ Sbjct: 192 AVGFGDEVIHATFLAMVADRQDYFHLTRARDMDDVFQRLAKEV 234 >gi|7959281|dbj|BAA96034.1| KIAA1510 protein [Homo sapiens] Length = 1140 Score = 64.4 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 36/178 (20%), Positives = 65/178 (36%), Gaps = 31/178 (17%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 DM+ ++D S S+ + + +++ ++ PD VQ GL +S Sbjct: 27 PADMVFLVDGSWSIGH------SHFQQVKDFLASVIAPFEIGPDK---VQVGLTQYSGDA 77 Query: 231 EEFFLLEWGVSHLQ------RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + EW ++ L ++ L G +T + L + + Q Sbjct: 78 QT----EWDLNSLSTKEQVLAAVRRLRYKGGNTFTGLALTHV------LGQNLQPAAGLR 127 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 K+++ +TDG K K G V+A+G++ E LR ASP Sbjct: 128 PEAAKVVILVTDG-----KSQDDVHTAARVLKDLGVNVFAVGVKNADEAE-LRLLASP 179 >gi|89098674|ref|ZP_01171556.1| hypothetical protein B14911_00755 [Bacillus sp. NRRL B-14911] gi|89086636|gb|EAR65755.1| hypothetical protein B14911_00755 [Bacillus sp. NRRL B-14911] Length = 920 Score = 64.4 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 81/210 (38%), Gaps = 34/210 (16%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 +P+ +K + L +MIV+D S SM +K+++A ++ +E ++ Sbjct: 388 LPVNMDIKGK-KEMPSLGLMIVMDRSGSMAG------SKLELAKEAAARSVELLREKDT- 439 Query: 216 NNVVQSGLVTFSNK---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 G + F ++ I E LE + + I ++ G T L+ AY ++ + Sbjct: 440 -----LGFIAFDDRPWVIVETGPLEDKKDAVDK-IGSVTP-GGGTEIFTSLEKAYEELEN 492 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 ++ R+H I+ +TDG++ + + + + K+ + + + Sbjct: 493 LKLQRKH-----------IILLTDGQSARSTDYESMI---ETGKENNITLSTVALGSDAD 538 Query: 333 HEFLRACA--SPNSFYLVENPHSMYDAFSH 360 L A FY V + + S Sbjct: 539 RNLLEELAGLGAGRFYDVTDSSVIPSILSR 568 >gi|301788514|ref|XP_002929673.1| PREDICTED: complement C2-like isoform 2 [Ailuropoda melanoleuca] Length = 749 Score = 64.4 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 35/238 (14%), Positives = 84/238 (35%), Gaps = 28/238 (11%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + + TN K++ Q L++ ++LD S+S+ + +S Sbjct: 223 FSHLLGATNPTQKKTKQGVGRKIHIQRSGHLNLYLLLDASQSVAE------DDFQIFKES 276 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWGVSHLQRKIKYLS-------KF 253 M++ + V ++TF++K + L + I L+ + Sbjct: 277 AILMVDRIFSFEIN---VSVAIITFASKPQVVMSVLYDNSRDVTEVINSLNNINYKDHEN 333 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS------TKEDQQ 307 G TN+ L + + + + I+ +TDG++ ++ + Sbjct: 334 GTGTNTYAALNSVHIMMNNQMQRLGMKTAAWQEIRHAIILLTDGKSNMGGSPKLAVDNIR 393 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 + N+ + +YAIG+ + +E +++++ ++Y F H Sbjct: 394 EILNINQQRSDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTEALYQVFEH 451 >gi|226226933|ref|YP_002761039.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] gi|226090124|dbj|BAH38569.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] Length = 326 Score = 64.4 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 36/230 (15%), Positives = 76/230 (33%), Gaps = 46/230 (20%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S + + +D+ + +D+S SM + ++++A + + K Sbjct: 74 SGARAERVSSDGIDIALTVDISSSMLAEDFQPQNRMEVAKDKVKRFVMGRKSD------- 126 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GLV FS + L + I L + T + A N++ + G Sbjct: 127 RVGLVAFSGEALTQVPLTTDYPVVLAAIDNLQVGQLEDGTAIGTAIATAANRLRNSPGRS 186 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR------VIR 331 +++V +TDGEN D ++ A G +Y IG+ V Sbjct: 187 -----------RVMVLLTDGENNRGAIDPRTA--AQAAGTFGIRIYTIGVGTDGMAAVPV 233 Query: 332 SHEF--LRA----------------CASPNSFYLVENPHSMYDAFSHIGK 363 LR ++ ++ ++ ++ + I + Sbjct: 234 GRGLFGLRYENRPVKIDEALLTEIANSTGGRYFRAKDAAALQSIYEQIDR 283 >gi|240255600|ref|NP_680085.3| von Willebrand factor A domain-containing protein 1 precursor [Mus musculus] gi|81901574|sp|Q8R2Z5|VWA1_MOUSE RecName: Full=von Willebrand factor A domain-containing protein 1; AltName: Full=von Willebrand factor A domain-related protein; Flags: Precursor gi|20071274|gb|AAH26919.1| Von Willebrand factor A domain containing 1 [Mus musculus] gi|26346104|dbj|BAC36703.1| unnamed protein product [Mus musculus] gi|54611403|gb|AAH36166.1| Von Willebrand factor A domain containing 1 [Mus musculus] gi|123295149|emb|CAM18384.1| von Willebrand factor A domain containing 1 [Mus musculus] gi|148683083|gb|EDL15030.1| von Willebrand factor A domain containing 1 [Mus musculus] Length = 415 Score = 64.4 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 69/180 (38%), Gaps = 22/180 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + +S + ++ +D + D++ +LD S S+ + S + + + ++ + P Sbjct: 15 LALARSSIERGSTASDPQGDLLFLLDSSASVSHYEFSRV------REFVGQLVATMSFGP 68 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 +++ LV ++ F + S +Q I+ + TN+ L YA Q+F Sbjct: 69 GA---LRASLVHVGSQPHTEFTFDQYSSGQAIQDAIRVAPQRMGDTNTGLALAYAKEQLF 125 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 + K++V++TDG + E K G ++ + Sbjct: 126 AEE------AGARPGVPKVLVWVTDGGSSDPVG-----PPMQELKDLGVTIFIVSTGRGN 174 >gi|158891|gb|AAA29076.1| em100 gene is homologous the Eimeria tenella gene et100 (accession number M73495) encoding the microneme protein Etp100 [Eimeria maxima] Length = 724 Score = 64.4 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 47/215 (21%), Positives = 77/215 (35%), Gaps = 33/215 (15%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD+M+V+D S S+ + + I+ + L PD V+ GLVTF Sbjct: 46 LDVMLVVDESGSIGTSNYGKV------RSFISNFAGTMPLSPDD---VRVGLVTFGTSAV 96 Query: 232 EFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + L L K L ST + GL A +F N Sbjct: 97 TRWDLSDSRAQNADLLAAAAKKLPYAAGSTYTHLGLAKAEEILFS------FQKGGRDNA 150 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA---CASPNS 344 K+I+ MTDG ++ Q+L + + RG I+ +G+ + R+ C + ++ Sbjct: 151 PKMILVMTDG---ASSRRSQTLSAAEKLRNRGVIIVVLGVGTGVNSAECRSIAGCDTSDT 207 Query: 345 F----YLVEN----PHSMYDAFSHIGKDIVTKRIW 371 YL N + KD+ + Sbjct: 208 VECPRYLQSNWGGVSSQINGIIKAACKDLAKDAVC 242 >gi|226504618|ref|NP_001148048.1| protein binding protein [Zea mays] gi|195615516|gb|ACG29588.1| protein binding protein [Zea mays] Length = 696 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 50/268 (18%), Positives = 100/268 (37%), Gaps = 40/268 (14%) Query: 109 DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT 168 +I S +++I + +I S+ + P+ + I P+ ++ Sbjct: 196 NIGSSRTVEIKTYSEFS--AIQQSSQDDFAVLIHLKAPYANPEQVIGRPVNATSVGYPTA 253 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 A +D++ VLDVS SM TK+ + +++ +++ + + ++ FS+ Sbjct: 254 RAPVDLVTVLDVSGSMAG------TKLALLKRAMGFVIQHLGPSD------RLSVIAFSS 301 Query: 229 KIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 F L+ G + + L G TN LK A I R H N Sbjct: 302 TARRLFHLQRMSHSGRQQALQTVNSLVASG-GTNIADALKKAAKVI----EDRSHQNPVC 356 Query: 285 ANYKKIIVFMTDGENLST-KEDQQSLYYCNEA--------KKRG-AIVYAIGIRVIRSHE 334 + I+ ++DG++ + + + + G V+ G V + Sbjct: 357 S-----IILLSDGQDTYNIPSNIRGARPEYSSLVPSSILNRTFGLVPVHGFGFGVDHDSD 411 Query: 335 FLRAC--ASPNSFYLVENPHSMYDAFSH 360 L + AS +F +E+ + DAF+ Sbjct: 412 ALHSIAEASGGTFSFIEDEGVIQDAFAQ 439 >gi|114683021|ref|XP_001148451.1| PREDICTED: collagen alpha-1(XX) chain [Pan troglodytes] Length = 1284 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 40/219 (18%), Positives = 73/219 (33%), Gaps = 39/219 (17%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARL--------DMMIVLDVSRSMESFFD 189 P P + + T S + + DM+ ++D S S+ Sbjct: 136 PTPSHMGSPDPEQASEPQVAFTPSQDPPTPAGPQFRCLPPVPADMVFLVDGSWSIGH--- 192 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ----- 244 + + +++ ++ PD VQ GL +S + EW ++ L Sbjct: 193 ---SHFQQVKDFLASVIAPFEIGPDK---VQVGLTQYSGDAQT----EWDLNSLSTKEQV 242 Query: 245 -RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 ++ L G +T + L + + Q K+++ +TDG K Sbjct: 243 LAAVRRLRYKGGNTFTGLALTHV------LGQNLQPVAGLRPEAAKVVILVTDG-----K 291 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 K G V+A+G++ E LR ASP Sbjct: 292 SQDDVHTAARVLKDLGVNVFAVGVKNADEAE-LRLLASP 329 >gi|332217050|ref|XP_003257666.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy chain H5-like [Nomascus leucogenys] Length = 941 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 76/202 (37%), Gaps = 33/202 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ VLD S SM TK+ ++ +L +++ + ++ FSN+I+ Sbjct: 296 VVFVLDSSASM------VGTKLRQTKDALFTILHDLRPQD------RFSIIGFSNRIKVW 343 Query: 232 EFFLLE---WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + +L+ + + I ++S G T+ L+ A + H + D + Sbjct: 344 KDYLISVTPDSIRDGKVYIHHMSPTG-GTDINGALQRAIRLL---NKYVAHSDIGDRSVS 399 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEF-----LRACA 340 ++VF+TDG+ + +L N ++ ++ IGI L C Sbjct: 400 -LVVFLTDGKPT--VGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCG 456 Query: 341 SPNSFYLVENP-HSMYDAFSHI 361 + E+ + + I Sbjct: 457 LTRRVHEEEDAGSQLIGFYDEI 478 >gi|229495775|ref|ZP_04389503.1| BatB protein [Porphyromonas endodontalis ATCC 35406] gi|229317349|gb|EEN83254.1| BatB protein [Porphyromonas endodontalis ATCC 35406] Length = 338 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 36/179 (20%), Positives = 68/179 (37%), Gaps = 20/179 (11%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P K N ++ MI LD+S SM + D S ++ A +I+ +L+ + Sbjct: 72 LARPQLQERKENPGEAKGIEAMIALDISNSMLAE-DLSPNRLQFAKLTIHRLLDYLAES- 129 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + G+V F+ + ++ ++ + + +S N + A + Sbjct: 130 ------KVGVVVFAGNAYMQLPITTDLAMAKKMVDDANPDMLS-NQGTAIASAIDLSLGS 182 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 R + K I+ TDGEN + +L +AK +G VY I + Sbjct: 183 FSDRH-------DVGKAIILFTDGENH----EGDALEAAKKAKSQGVKVYTIAVGSEEG 230 >gi|315126124|ref|YP_004068127.1| von Willebrand factor type A [Pseudoalteromonas sp. SM9913] gi|315014638|gb|ADT67976.1| von Willebrand factor type A [Pseudoalteromonas sp. SM9913] Length = 327 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 32/215 (14%), Positives = 74/215 (34%), Gaps = 44/215 (20%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+ +D+S SM + + ++ M ++ +E+ + GL+ F + Sbjct: 88 DIMLAVDLSGSMTEQDMAYNGQYVDRLTMVKAVLSDFIEQ-------RQGDRLGLILFGD 140 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L V + + + G +T L + + + Sbjct: 141 TAFLQTPLTRDVKTVSKMLSEAQIGLVGRATAIGDALGLSVKRFASKKESN--------- 191 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--VIRSHE---------- 334 +I+V +TDG+N + + + A++ G VY IG+ R Sbjct: 192 --RIVVLLTDGQNTAGNLNPEDALLL--AREEGIKVYTIGVGSDNPRGFSLFNMGSGGSN 247 Query: 335 ----FLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + ++ ++ + ++ + K Sbjct: 248 LDEGLLKKIAEQTGGLYFRAKDVAGLQQIYAELDK 282 >gi|260827154|ref|XP_002608530.1| hypothetical protein BRAFLDRAFT_92388 [Branchiostoma floridae] gi|229293881|gb|EEN64540.1| hypothetical protein BRAFLDRAFT_92388 [Branchiostoma floridae] Length = 306 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 33/225 (14%), Positives = 75/225 (33%), Gaps = 23/225 (10%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 ++ + + D++++LD+S S+ S T + +L + + Sbjct: 90 ILAARGTARTAVNKR---DLVLLLDMSGSIGS------TDFTSLKTYVAELLSYICPENE 140 Query: 215 VNNVVQSGLVTFSNKI--EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + + +VTFS+ + F + +Q I L G ST + + + Q+F Sbjct: 141 MGTFHRVAVVTFSSSVVLNFNFHEATSLGQIQASIHSLPYEGGSTRTADAINFVRTQVFQ 200 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR---- 328 R ++ + ++ +TDG Q + EA A ++A+G Sbjct: 201 TGNYR-----DEPDVDLEVLLITDGHPNGAGNSPQDVELAAEALGERANIFALGYGSAYS 255 Query: 329 VIRSHEFLRACASP---NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + L+ + + +E+ + Sbjct: 256 SASDFDHLKNLVTAPWYKHIFNLEHVSDFETMLDTLANRPSHMVC 300 >gi|144898053|emb|CAM74917.1| conserved hypothetical protein, secreted [Magnetospirillum gryphiswaldense MSR-1] Length = 460 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 25/225 (11%), Positives = 78/225 (34%), Gaps = 38/225 (16%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R ++ +G + I+ A+ L + L +G+ ++ + + +K+ L +D + + + Sbjct: 3 LRRLMHDRRGTVAIIFALALIPLSLSVGLAVDTARAYAVKSKLSQALDAAALAVGS---- 58 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 ++ + ++ +F++ D + S + Q Sbjct: 59 -------STGTAAELQQIGQKFFDANFKDSGLDAAGSFSVSVTGDVVSANGSAQVQTT-- 109 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 + I + ++ V + + L++ +VLD + SM + Sbjct: 110 -----------------LMQLVGIDTIAVSESAQVIRSIK---GLELALVLDNTGSMTTS 149 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 + I + +++ + + ++ +V +S + Sbjct: 150 DN-----IGALRDAAQELVDILFGGRADHPTLRVAVVPYSASVNP 189 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 25/183 (13%), Positives = 50/183 (27%), Gaps = 46/183 (25%) Query: 234 FLLEWGVSHLQRKIKYLSK-FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L + + I+ L T G+ + + + + K ++ Sbjct: 276 TPLTGVKATVDSAIQALRAWSRGGTMGDIGMAWGLRVLSPEPPFTEGLAWNTPKWAKAVI 335 Query: 293 FMTDGENLSTK-----------------------------------------EDQQSLYY 311 MTDG+N K + + Sbjct: 336 LMTDGDNQFYKLTSTTGPNKVNSAVNSDYSGYGRLDQYGALGTTSTTTAKSVINTRLTQV 395 Query: 312 CNEAKKRGAIVYAIGIR---VIRSHEFLRACASP-NSFYLVENPHSMYDAFSHIGKDIVT 367 C K +G VY I + + +ACAS ++ + + +F I ++ Sbjct: 396 CQAMKDKGITVYTITFTSGINQATKDIYKACASSTAKWFDSPSQADLRASFRAIATELSQ 455 Query: 368 KRI 370 R+ Sbjct: 456 LRV 458 >gi|270006429|gb|EFA02877.1| hypothetical protein TcasGA2_TC008029 [Tribolium castaneum] Length = 1868 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 82/261 (31%), Gaps = 45/261 (17%) Query: 118 IVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS----------Q 167 + ++ ++ Y L F + I S V V + Sbjct: 14 TSASANTQKITLETLNNYSEFLNFGVKKIKLPKNSLENSIIKSKVDVLGDIFKKNIETFK 73 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +L+++ ++D S S+ T +K + +L +V + N + + TFS Sbjct: 74 NTQKLELIFLIDGSSSVGE------TNFRSELKFVKKLLSDVTVDY---NHTRVAIATFS 124 Query: 228 NKIEEFF-LLEWGVSH------LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + + + + L + + + G TN+ + A + + Sbjct: 125 SSVSKNIDQISDPRKENNKCFLLSKLLSKIEYTGGGTNTLKAFEVAKEIFTQSRNDSE-- 182 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 K++ +TDG + + E KK ++ IGI E + Sbjct: 183 --------KVLFLITDGFSNG----GDPIPLAAELKKDQVKIFTIGIANGNYKELYELAS 230 Query: 341 SPNSFYLVENPHSMYDAFSHI 361 +P Y + D+F Sbjct: 231 TPGEIY-----SYLLDSFEEF 246 >gi|218961689|ref|YP_001741464.1| hypothetical protein; putative membrane protein [Candidatus Cloacamonas acidaminovorans] gi|167730346|emb|CAO81258.1| hypothetical protein; putative membrane protein [Candidatus Cloacamonas acidaminovorans] Length = 331 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 35/168 (20%), Positives = 69/168 (41%), Gaps = 27/168 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + +D++ LDVS+SM++ D +++ AI I + LE+VK + G++ F+ Sbjct: 85 QSSGMDIIFALDVSKSMDA-TDMMPSRLLRAILQIGSFLEQVKTD-------RIGIIAFA 136 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGV---STNSTPGLKYAYNQIFDMQGMRQHCNTED 284 L ++ + L+ V T+ L+ A N + Sbjct: 137 GTATLQCPLTDDYEAVRIVLNGLNSNTVEIPGTDIGSALRLAENAF------------PE 184 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + K +V ++DGE+L +++ K +G VY +G+ Sbjct: 185 GSKSKTLVLISDGEDLQHSALREARIL----KTKGIRVYTMGVGSPEG 228 >gi|41409532|ref|NP_962368.1| hypothetical protein MAP3434 [Mycobacterium avium subsp. paratuberculosis K-10] gi|81570937|sp|Q73UD4|Y3434_MYCPA RecName: Full=UPF0353 protein MAP_3434 gi|41398363|gb|AAS05984.1| hypothetical protein MAP_3434 [Mycobacterium avium subsp. paratuberculosis K-10] Length = 330 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 36/219 (16%), Positives = 67/219 (30%), Gaps = 37/219 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+V+D+S+SM + D ++ A ++ + ++ + GLV F+ Sbjct: 95 IMLVIDMSQSMRA-TDVEPNRLKAAEQAASQFASQLTPGIN------LGLVGFAGTPYLL 147 Query: 234 FLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 Q I L K F ST + + A + I D I Sbjct: 148 VPP---TPQHQATIDALKKLDFADSTATGEAIFTALHAISATA-----VAGGDTPPPARI 199 Query: 292 VFMTDGENLSTKEDQQ----SLYYCNEAKKRGAIVYAIGIRVIRSH-------------- 333 V ++DG AK G + I Sbjct: 200 VLLSDGGENKPSNPSDPHDGVYTAARLAKDEGVPISTITFGTKGGEIEMDGQKVAVPVST 259 Query: 334 EFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + ++ A S Y N + +++ I +I + + Sbjct: 260 DQMKMVAKLSGGQSYTATNLGELQKSYNAIENEIGYRTV 298 >gi|308497300|ref|XP_003110837.1| CRE-CUT-6 protein [Caenorhabditis remanei] gi|308242717|gb|EFO86669.1| CRE-CUT-6 protein [Caenorhabditis remanei] Length = 576 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 37/235 (15%), Positives = 82/235 (34%), Gaps = 29/235 (12%) Query: 132 ISRYKIPLKFCTFIPWYTNSRHIVMP---ITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 I RY I + TF+ + + + S + T ++++++LD S S+ Sbjct: 4 IPRYDIIISIITFLTLIQTNSANPIDNGLVDSELIHECVTHKAVEVILLLDASGSIGD-- 61 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRK 246 D K ++ + + L+ ++ + F L + L+ Sbjct: 62 -------DTFKKQLSFAMHLASRLNISEEGSHMALIQYAETPKLEFSLGQFNHPTQLEWA 114 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 I+ + +TN+ L+ ++ Q K+ + +TDG+ Sbjct: 115 IQRIEYQSGATNTGQALR------LTLEKGLQGARPG---IPKVAIVITDGQ-----SQD 160 Query: 307 QSLYYCNEAKKRGAIVYAIGI-RVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 + +VYAIG+ ++ H+ + +P + VE + A + Sbjct: 161 DVSEPSQLLRDADVMVYAIGVTNLVNVHQLHQMTGNPVRVFTVETFEQLDRALAD 215 >gi|149709406|ref|XP_001496048.1| PREDICTED: similar to calcium-activated chloride channel [Equus caballus] Length = 904 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 46/191 (24%), Positives = 75/191 (39%), Gaps = 32/191 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN--KIE 231 + +VLD S SM+S D + +++ A E+ LI + G+VTF + +I+ Sbjct: 310 VCLVLDKSGSMDS--DDRLLRMNQAA--------ELYLIQIIEKESLVGMVTFDSSAEIQ 359 Query: 232 EFFLLEWGVSHLQRKIKYLSKF-GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + Q I L +F G T+ GLK + I Sbjct: 360 NNLTKITDDNAYQNIIAKLPQFAGGGTSICNGLKAGFQAIVYSNQSTSGSE--------- 410 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 I+ +TDGE+ C EAK GAI+++I + + E L F Sbjct: 411 IILLTDGEDNQMSS-------CFEEAKASGAIIHSIALGPSAAKELETLSNMTGGLRFSA 463 Query: 348 VENPHSMYDAF 358 ++ + + DAF Sbjct: 464 NKDINGLIDAF 474 >gi|125548980|gb|EAY94802.1| hypothetical protein OsI_16587 [Oryza sativa Indica Group] Length = 708 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 55/274 (20%), Positives = 103/274 (37%), Gaps = 41/274 (14%) Query: 104 VNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK 163 V D ++ + + +I P+ + S+ + P + S Sbjct: 207 VFDNLNVRSTKTAEINTYPEFSTIP-QSSSKDDFAILIHLKAPSANPDQGTGKLANESSA 265 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 +S+ A +D++ VLDVS SM TK+ + +++ +++ + + + Sbjct: 266 GSSRNRAPVDLVTVLDVSGSMAG------TKLALLKRAMGFVVQHLGPSD------RLSV 313 Query: 224 VTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + FS+ F L+ G + I L G TN LK A I R + Sbjct: 314 IAFSSSARRLFHLQRISHHGRQQALQAINSLGASG-GTNIADALKKAMKVI----EDRSY 368 Query: 280 CNTEDANYKKIIVFMTDGENLST--------KEDQQSL---YYCNEAKKRGAIVYAIGIR 328 N+ + I+ ++DG++ D +SL N+A+ ++A G Sbjct: 369 KNSVCS-----IILLSDGQDTYNISSSVQGASPDYKSLVPSSIINDARHT-VPLHAFGFG 422 Query: 329 VIRSHEFLR--ACASPNSFYLVENPHSMYDAFSH 360 + L A AS +F +E+ M DAF+ Sbjct: 423 ADHDSDSLHSIAQASGGTFSFIEDEGVMQDAFAQ 456 >gi|77459433|ref|YP_348940.1| von Willebrand factor, type A [Pseudomonas fluorescens Pf0-1] gi|77383436|gb|ABA74949.1| putative exported protein [Pseudomonas fluorescens Pf0-1] Length = 359 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 31/218 (14%), Positives = 71/218 (32%), Gaps = 46/218 (21%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + D ++++ + + LE + GL+ F + Sbjct: 91 DLLVAVDVSGSMDFPDMQWNDEDVSRLSLVQHLLGDFLESRDGD-------RVGLILFGS 143 Query: 229 KIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + + G +T + A ++ Sbjct: 144 QAYLQAPLTFDRRTVRVWLDEARIGIAGKNTAIGDAIGLALKRL-----------RMRPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE------------ 334 ++++ +TDG N + D + A G +Y IGI Sbjct: 193 QSRVLILVTDGANNGGEIDPLTA--AKLAASEGVKIYPIGIGADPEESGATALLGGNPTL 250 Query: 335 -----FLRACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 L+A A + ++ + + A + Sbjct: 251 DLDEPALKAIAEVTGGRYFRARDGKQL-QAIKDTLDQL 287 >gi|156741949|ref|YP_001432078.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233277|gb|ABU58060.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 847 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 37/194 (19%), Positives = 75/194 (38%), Gaps = 28/194 (14%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF- 226 + +++++D S SM ++ ++K MA ++ E ++ + G++ F Sbjct: 388 ERSDTTLLLIIDQSASMGP--ETGLSKFTMAKEAAIMATESLRAED------RIGVLAFD 439 Query: 227 --SNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + + +F + G+S +QR+I L G T+ L+ ++ G +H Sbjct: 440 VSTRWVVDFQPVGTGLSLADIQRRISTLP-LGGGTDIYNALQTGLPELARQPGRVRHA-- 496 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 V +TDG + T + Q EA+ R + I I + L+ A Sbjct: 497 ---------VLLTDGRSF-TDDRQAYQALIEEARSRNITLSTIAIGTDADIDLLQTLARW 546 Query: 341 SPNSFYLVENPHSM 354 +Y P + Sbjct: 547 GAGRYYFAAEPGDI 560 >gi|169629808|ref|YP_001703457.1| hypothetical protein MAB_2724c [Mycobacterium abscessus ATCC 19977] gi|169241775|emb|CAM62803.1| Conserved hypothetical protein [Mycobacterium abscessus] Length = 336 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 66/211 (31%), Gaps = 30/211 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+V+DVSRSMES D + ++ A ++ + + GL+ ++ Sbjct: 100 VMLVIDVSRSMES-TDVAPNRLGAAKEAGKEFARNLTPGIN------LGLIAYAGTATVL 152 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + L + T + G+ A I + + D IV Sbjct: 153 VSPTTNRDATVNALDNL-QLADRTATGEGIFTALQAIATVGAV---IGGGDKPPPARIVL 208 Query: 294 MTDGENLSTKEDQQ---SLYYCNEAKKRGAIVYAIGIRVIRSH--------------EFL 336 M+DG+ + AK + + I + + + Sbjct: 209 MSDGKETVPSNPDNPKGAYTAARTAKDQQVPISTIAFGTKDGYVEINGQRQNVPYAPDMM 268 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 A S Y + + ++++ + I Sbjct: 269 EKVAKLSGGETYTASTLGQLKEVYANLQQQI 299 >gi|301788516|ref|XP_002929674.1| PREDICTED: complement C2-like isoform 3 [Ailuropoda melanoleuca] Length = 617 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 34/250 (13%), Positives = 83/250 (33%), Gaps = 32/250 (12%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 T + + K++ Q L++ ++LD S+S+ Sbjct: 83 DVAPALGTSFSHLLGATNPTQKKTKGVGR----KIHIQRSGHLNLYLLLDASQSVAE--- 135 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWGVSHLQRKIK 248 + +S M++ + V ++TF++K + L + I Sbjct: 136 ---DDFQIFKESAILMVDRIFSFEIN---VSVAIITFASKPQVVMSVLYDNSRDVTEVIN 189 Query: 249 YLS-------KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 L+ + G TN+ L + + + + I+ +TDG++ Sbjct: 190 SLNNINYKDHENGTGTNTYAALNSVHIMMNNQMQRLGMKTAAWQEIRHAIILLTDGKSNM 249 Query: 302 ------TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVEN 350 ++ + + N+ + +YAIG+ + +E +++++ Sbjct: 250 GGSPKLAVDNIREILNINQQRSDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQD 309 Query: 351 PHSMYDAFSH 360 ++Y F H Sbjct: 310 TEALYQVFEH 319 >gi|301788512|ref|XP_002929672.1| PREDICTED: complement C2-like isoform 1 [Ailuropoda melanoleuca] gi|281345620|gb|EFB21204.1| hypothetical protein PANDA_019912 [Ailuropoda melanoleuca] Length = 748 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 34/250 (13%), Positives = 83/250 (33%), Gaps = 32/250 (12%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 T + + K++ Q L++ ++LD S+S+ Sbjct: 214 DVAPALGTSFSHLLGATNPTQKKTKGVGR----KIHIQRSGHLNLYLLLDASQSVAE--- 266 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWGVSHLQRKIK 248 + +S M++ + V ++TF++K + L + I Sbjct: 267 ---DDFQIFKESAILMVDRIFSFEIN---VSVAIITFASKPQVVMSVLYDNSRDVTEVIN 320 Query: 249 YLS-------KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 L+ + G TN+ L + + + + I+ +TDG++ Sbjct: 321 SLNNINYKDHENGTGTNTYAALNSVHIMMNNQMQRLGMKTAAWQEIRHAIILLTDGKSNM 380 Query: 302 ------TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVEN 350 ++ + + N+ + +YAIG+ + +E +++++ Sbjct: 381 GGSPKLAVDNIREILNINQQRSDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQD 440 Query: 351 PHSMYDAFSH 360 ++Y F H Sbjct: 441 TEALYQVFEH 450 >gi|260912478|ref|ZP_05919014.1| aerotolerance protein BatA [Prevotella sp. oral taxon 472 str. F0295] gi|260633397|gb|EEX51551.1| aerotolerance protein BatA [Prevotella sp. oral taxon 472 str. F0295] Length = 332 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 39/235 (16%), Positives = 79/235 (33%), Gaps = 40/235 (17%) Query: 154 IVMPITSSVKVNSQTD-ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 I++ + +D+M+ +DVS SM + D + +++ A + + + Sbjct: 69 IILARPQTYTSWGNKQVEGIDIMLAMDVSTSMLAE-DLTPNRMEAAKDVAAEFIAD-RPN 126 Query: 213 PDVNNVVQSGLVTFSNKIEEF-FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 ++ + +G + L + ++ I T GL A +++ Sbjct: 127 DNIGLTIFAGEAFTQCPMTTDHTSLLNMLQTVRTDIAAKGLIQDGTAIGMGLANAVSRLK 186 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 D + K+++ +TDG N D + N AK G VY IG+ + Sbjct: 187 DSKAKS-----------KVVILLTDGSNNM--GDLSPMTSANIAKSLGIRVYTIGVGTNK 233 Query: 332 SHE---------------------FLR--ACASPNSFYLVENPHSMYDAFSHIGK 363 L+ A ++ +FY N + + I K Sbjct: 234 VARYPMPVAGGVQYVNMPVEIDTKVLKDIAASTDGNFYRATNNQELKQIYKDIDK 288 >gi|115459346|ref|NP_001053273.1| Os04g0508800 [Oryza sativa Japonica Group] gi|32489531|emb|CAE04734.1| OSJNBa0043L24.22 [Oryza sativa Japonica Group] gi|113564844|dbj|BAF15187.1| Os04g0508800 [Oryza sativa Japonica Group] gi|116310776|emb|CAH67569.1| OSIGBa0101P20.12 [Oryza sativa Indica Group] gi|125590953|gb|EAZ31303.1| hypothetical protein OsJ_15416 [Oryza sativa Japonica Group] Length = 708 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 55/274 (20%), Positives = 103/274 (37%), Gaps = 41/274 (14%) Query: 104 VNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK 163 V D ++ + + +I P+ + S+ + P + S Sbjct: 207 VFDNLNVRSTKTAEINTYPEFSTIP-QSSSKDDFAILIHLKAPSANPDQGTGKLANESSA 265 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 +S+ A +D++ VLDVS SM TK+ + +++ +++ + + + Sbjct: 266 GSSRNRAPVDLVTVLDVSGSMAG------TKLALLKRAMGFVVQHLGPSD------RLSV 313 Query: 224 VTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + FS+ F L+ G + I L G TN LK A I R + Sbjct: 314 IAFSSSARRLFHLQRISHHGRQQALQAINSLGASG-GTNIADALKKAMKVI----EDRSY 368 Query: 280 CNTEDANYKKIIVFMTDGENLST--------KEDQQSL---YYCNEAKKRGAIVYAIGIR 328 N+ + I+ ++DG++ D +SL N+A+ ++A G Sbjct: 369 KNSVCS-----IILLSDGQDTYNISSSVQGASPDYKSLVPSSIINDARHT-VPLHAFGFG 422 Query: 329 VIRSHEFLR--ACASPNSFYLVENPHSMYDAFSH 360 + L A AS +F +E+ M DAF+ Sbjct: 423 ADHDSDSLHSIAQASGGTFSFIEDEGVMQDAFAQ 456 >gi|294141918|ref|YP_003557896.1| von Willebrand factor type A domain-containing protein [Shewanella violacea DSS12] gi|194578715|dbj|BAG66042.1| von Willebrand factor typeA domain protein [Shewanella violacea] gi|293328387|dbj|BAJ03118.1| von Willebrand factor type A domain protein [Shewanella violacea DSS12] Length = 689 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 39/225 (17%), Positives = 85/225 (37%), Gaps = 22/225 (9%) Query: 116 LDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITS--SVKVNSQTDARLD 173 + + + Y S S + P T + + H ++ +V+ + Sbjct: 210 VRVEEMINYFNYQYSTPSTVEQPFSVNTELAPSPYNDHKMLLRIGLKGYEVDKSQLGASN 269 Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ +LDVS SM S K+ + S+ + +++ V+ VV +G + + Sbjct: 270 LVFLLDVSGSMNS-----RDKLPLLKTSLKMLSQQLSEQDHVSIVVYAGA----SGVVLD 320 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + + + + L G STN G++ AY ++H N ++ Sbjct: 321 GVKGNDIYAINQALNNLKA-GGSTNGGAGIQQAY------GLAQKHFIQGGVNR---VIL 370 Query: 294 MTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 TDG+ N+ T + Q + + +G + +G ++ L Sbjct: 371 ATDGDFNVGTTDHQALMDLIASKRDQGIALTTLGFGQGNYNDHLM 415 >gi|258620051|ref|ZP_05715090.1| conserved hypothetical protein [Vibrio mimicus VM573] gi|258587409|gb|EEW12119.1| conserved hypothetical protein [Vibrio mimicus VM573] Length = 308 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 70/213 (32%), Gaps = 38/213 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M+V+D+S SM + + + A+ + + + GL+ F++ Sbjct: 75 DLMLVVDLSYSMSQE---DMQSGQQMVDRLTAVKQVLSEFITKREGDRMGLILFADHAYL 131 Query: 233 FFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + ++ L G T G+ A D +++ Sbjct: 132 QTPLTLDRQTVISQLNQAVLKLIGTQTAIGEGIGLATKTFIDSDA-----------PQRV 180 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------SHEF- 335 ++ ++DG N + L N AK+ +Y +G+ + + Sbjct: 181 MILLSDGSNTA--GVLDPLEAANIAKQYQTTIYTVGVGAGEMIVKDFLFSRKVNTAQDLD 238 Query: 336 ---LRACAS--PNSFYLVENPHSMYDAFSHIGK 363 L+ AS ++ N + + I + Sbjct: 239 EKTLQTIASTTGGQYFRARNQQDLQSIYDTINQ 271 >gi|148974032|ref|ZP_01811565.1| hypothetical protein VSWAT3_12932 [Vibrionales bacterium SWAT-3] gi|145965729|gb|EDK30977.1| hypothetical protein VSWAT3_12932 [Vibrionales bacterium SWAT-3] Length = 330 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 71/217 (32%), Gaps = 46/217 (21%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+V+D+S SM F I ++ ++ +E K + GLV F++ Sbjct: 94 DLMLVVDLSYSMSQEDMQFNGEYIDRLSAVKHVLSDFIERRKGD-------RVGLVLFAD 146 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L L +++ L G T G+ A D Sbjct: 147 HAYLQTPLTLDRDTLSQQLNQAVLRLIGNQTAIGDGIGLATKTFVDSDA----------- 195 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------- 331 ++++V ++DG N + L + AKK A +Y +G+ Sbjct: 196 PQRVMVLLSDGSNTA--GVLDPLEAADIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTA 253 Query: 332 ---SHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L A + ++ + + + I + Sbjct: 254 QDLDERTLMEIAKRTGGQYFRARDSKELATIYDTINQ 290 >gi|326669364|ref|XP_695742.5| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Danio rerio] Length = 3651 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 35/201 (17%), Positives = 67/201 (33%), Gaps = 36/201 (17%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + LD++ ++D S S+ + ++ + ML + + P + LVT Sbjct: 70 RERGGCLDLVFLVDESSSVG------ASNFKSELRFVRKMLSDFPVAP---EATRVALVT 120 Query: 226 FSNK---------IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 FS+K + + S ++I ++ G T + + A + Sbjct: 121 FSSKSHVVTRADYVSAPKAHQHKCSLFSKEIPSITYRGGGTYTRGAFQRAAQILRQS--- 177 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 N K+I +TDG + ++RG ++ +GI E L Sbjct: 178 -------RENATKVIFLITDGYSNG----GDPRPVAAALRERGVEIFTLGIWQGNIRE-L 225 Query: 337 RACAS---PNSFYLVENPHSM 354 AS + V N Sbjct: 226 HEMASQPKDQHCFFVHNFAEF 246 >gi|120407060|ref|NP_766396.2| anthrax toxin receptor-like precursor [Mus musculus] Length = 641 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 33/226 (14%), Positives = 77/226 (34%), Gaps = 22/226 (9%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 ++ + + + D+ +VLD S S+ + + + Sbjct: 51 HQRQTQQMRQNVPQGQSGDDC-QGIFDLYLVLDKSGSVADNWIHIYSFAEGL-------- 101 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 VK + N ++ ++T+S + E L + + + L GL + Sbjct: 102 --VKKFTNPN--LRISIITYSTEAEVILPLTSDSKEINKSLLVLKNIVPQ-----GLTHM 152 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 + + +I+ +TDG L K ++ +A++ GAIVY +G Sbjct: 153 QKGLRKANEQIRKSTLGGRIVNSVIIALTDGL-LLLKPYLDTMEEAKKARRMGAIVYTVG 211 Query: 327 IRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + + + + P+ + V+ + A + + +K Sbjct: 212 VFMYSKQQLVNIAGDPDRCFGVD---EGFSALEGVVDPLTSKSCTE 254 >gi|167647386|ref|YP_001685049.1| von Willebrand factor type A [Caulobacter sp. K31] gi|167349816|gb|ABZ72551.1| von Willebrand factor type A [Caulobacter sp. K31] Length = 592 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 40/266 (15%), Positives = 99/266 (37%), Gaps = 33/266 (12%) Query: 83 LCRIKNTWNMSFRNELRDNGFVN-----DIDDIVRSTSLDIVVVPQNEGYSISAISRYKI 137 + R+ +F ++ + N + +L + + Y + + + Sbjct: 134 VKRVAEEPVSTFSIDVDTAAYANVRRFLNEGAAPPHDALRVEELINYFDYGYARPTAQEP 193 Query: 138 PLKFCT---FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITK 194 P K PW + + + + + + L+++ ++D S SM S + Sbjct: 194 PFKPTVTVVPSPWSQDRQLMHIGVQGYATPRAG-QPPLNLVFLIDTSGSM-----SGPDR 247 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSK 252 + +A K++N ++++++ + +V ++ G S ++ + L Sbjct: 248 LPLAKKALNVLIDQLRPQD------RVSMVAYAGSAGAVLSPTDGKSKLKMRCALTALRS 301 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYY 311 G ST GL+ AY RQ+ + + N ++ MTDG+ N+ + + + Sbjct: 302 -GGSTAGGQGLELAY------ALARQNLDPKAVNR---VILMTDGDFNVGIADPTRLKDF 351 Query: 312 CNEAKKRGAIVYAIGIRVIRSHEFLR 337 + +K G + G ++ + Sbjct: 352 VADQRKSGVYLSVYGFGRGNYNDTMM 377 >gi|315649632|ref|ZP_07902717.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315275105|gb|EFU38480.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 421 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 75/211 (35%), Gaps = 34/211 (16%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+++V+D S SM+ D + + A IN M + + ++ F + Sbjct: 114 DVVLVIDNSGSMKD-TDPNQDRYTAAKNLINRM----------DRDNRVSVIVFDHATTL 162 Query: 233 FFLLEWGVS-----HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + +I L+ T+ + L+ + I + Q + Sbjct: 163 LQPFTRVKNQEIKDEIMAEIDGLATTDGGTDISLALEDTMSHIQESQDAGRSAM------ 216 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--VIRSHEFLR--ACASPN 343 ++ ++DG S + + L + K++ V IG+ L+ A + Sbjct: 217 ---VIMLSDG--FSETDHDRVLA---DYKQQQIAVNTIGLSLVYKDGANLLQTIAAETGG 268 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 +Y V+N + F I D+ + + + Sbjct: 269 QYYDVQNAADLSFVFQKIYDDVGDRSLLTKR 299 >gi|315122852|ref|YP_004063341.1| hypothetical protein CKC_05540 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496254|gb|ADR52853.1| hypothetical protein CKC_05540 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 494 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 32/138 (23%), Positives = 60/138 (43%), Gaps = 11/138 (7%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE-- 68 F++ KG +++AI +P + L+LG+++ S+ K + S + +L H + I + Sbjct: 30 LFFSKKGNFAMISAIMIPSLALLLGIVLVTSNYLLHKYSVESASEEALSHGMSLICYQND 89 Query: 69 -GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 N K + I+ KN + + + N ++ I ST++ + Y Sbjct: 90 IERDNLAKIILNDLIVSLKKNNFTKQEADLVAKNSKIDITTLINDSTNV--------KSY 141 Query: 128 SISAISRYKIPLKFCTFI 145 S YK+PL T I Sbjct: 142 HFYIKSVYKMPLNKITKI 159 >gi|114778216|ref|ZP_01453088.1| batB protein, putative [Mariprofundus ferrooxydans PV-1] gi|114551463|gb|EAU54018.1| batB protein, putative [Mariprofundus ferrooxydans PV-1] Length = 355 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 28/216 (12%), Positives = 71/216 (32%), Gaps = 38/216 (17%) Query: 173 DMMIVLDVSRSM---ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+++ +D+S SM + + A K++ ++ + GL+ F + Sbjct: 108 DLLLAVDISGSMQIKDFEMNGQQVSRLTATKAVARQFISRRVGD------RVGLILFGSN 161 Query: 230 IEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + + + + G +T + A ++ ++ + Sbjct: 162 AYVQTPLTFDRKTVITLLDEAAVGLAGKATAIGDAIGLAVKRLEQSNRDKRIASK----- 216 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR---------------- 331 +++++ +TDG N + + A + G +Y IGI Sbjct: 217 EQVLILLTDGVNTA--GQLSAPQAAELAAEHGLTIYTIGIGADAMTVQSFFGTQRVNPSA 274 Query: 332 --SHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 + L A + ++ + + ++ I K Sbjct: 275 DLDEKMLTDIATKTGGRYFRAHDTQELQKIYAMIDK 310 >gi|73972314|ref|XP_860410.1| PREDICTED: similar to complement component 2 precursor isoform 5 [Canis familiaris] Length = 748 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 36/250 (14%), Positives = 83/250 (33%), Gaps = 32/250 (12%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 L T + + K+ Q L++ ++LD S+S++ Sbjct: 214 DVAPALGASLSHLLGATNPTQKKRENLGR----KIQIQRSGHLNLYLLLDASQSVKE--- 266 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWGVSHLQRKIK 248 + +S M++ + V ++TF++K + L I Sbjct: 267 ---EDFHVFKESAILMVDRIFSFEIN---VSVAIITFASKPKIIMSVLNDNSRDATEVIN 320 Query: 249 YLSK-------FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 L+K G TN+ L + + + + I+ +TDG++ Sbjct: 321 SLNKVNYKDHENGTGTNTYAALNSVHIMMNNQMDRLGMKTAAWQEIRHAIILLTDGKSNM 380 Query: 302 ------TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVEN 350 ++ + + N+ + +YAIG+ + +E +++++ Sbjct: 381 GGSPKLAVDNIKEILNINQQRSDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQD 440 Query: 351 PHSMYDAFSH 360 ++Y F H Sbjct: 441 TEALYQVFEH 450 >gi|73972312|ref|XP_860371.1| PREDICTED: similar to complement component 2 precursor isoform 4 [Canis familiaris] Length = 682 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 36/250 (14%), Positives = 83/250 (33%), Gaps = 32/250 (12%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 L T + + K+ Q L++ ++LD S+S++ Sbjct: 148 DVAPALGASLSHLLGATNPTQKKRENLGR----KIQIQRSGHLNLYLLLDASQSVKE--- 200 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWGVSHLQRKIK 248 + +S M++ + V ++TF++K + L I Sbjct: 201 ---EDFHVFKESAILMVDRIFSFEIN---VSVAIITFASKPKIIMSVLNDNSRDATEVIN 254 Query: 249 YLSK-------FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 L+K G TN+ L + + + + I+ +TDG++ Sbjct: 255 SLNKVNYKDHENGTGTNTYAALNSVHIMMNNQMDRLGMKTAAWQEIRHAIILLTDGKSNM 314 Query: 302 ------TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVEN 350 ++ + + N+ + +YAIG+ + +E +++++ Sbjct: 315 GGSPKLAVDNIKEILNINQQRSDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQD 374 Query: 351 PHSMYDAFSH 360 ++Y F H Sbjct: 375 TEALYQVFEH 384 >gi|332185455|ref|ZP_08387203.1| hypothetical protein SUS17_655 [Sphingomonas sp. S17] gi|332014433|gb|EGI56490.1| hypothetical protein SUS17_655 [Sphingomonas sp. S17] Length = 530 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 29/217 (13%), Positives = 79/217 (36%), Gaps = 32/217 (14%) Query: 21 ILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGG 80 +L A+FL + ++G I++ + + + D +++ + N + Sbjct: 1 MLWALFLIPLVALVGSGIDLGTRYVTRKQMQIACDAAVLAGRRAMTNG--------IVDD 52 Query: 81 DILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLK 140 + +N +F+ + + ++ +V+ Sbjct: 53 GVRAEATKFFNFNFQQGMFGSKPFTPSISSATTSKTTVVINAATT--------------- 97 Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS--SITKIDMA 198 T + S + + ++ + SQ D++ VLD + SM S S TKI+ Sbjct: 98 VPTSLMRIFGSDELPVSVSCNA---SQDFVNTDIVFVLDTTGSMRDKATSSDSQTKIEAL 154 Query: 199 IKSINAMLEEVKLIPD----VNNVVQSGLVTFSNKIE 231 ++ A+ +++ + + ++ G+V +++ + Sbjct: 155 RSAVLALYDQLAPVQNQLAASGMRLRYGVVPYASAVN 191 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 33/180 (18%), Positives = 59/180 (32%), Gaps = 34/180 (18%) Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA-YNQIFDMQGMRQHCNTE 283 F+ + W L++ + L+ G + + + A + + G Sbjct: 351 QFACPSPAASMQGWSRDDLKKYLNTLTPDGGTYHDNGMMWGARWASSGGIFGGNNPEKYN 410 Query: 284 DANYKKIIVFMTDG-------------------------------ENLSTKEDQQSLYYC 312 KK I+FMTDG ++ + Q+ C Sbjct: 411 MMPVKKYIIFMTDGLFETGYSRLYSSYGVEQLDARATPGGAYSNQDDQLARHKQRFNLLC 470 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 ++AK G ++ +G L CAS P+ N ++ F IGK+I R+ Sbjct: 471 SKAKSMGYSIWVLGF-ATTLDASLTNCASTPSQASTSSNQAALMARFVEIGKNIGALRLT 529 >gi|327541056|gb|EGF27607.1| von Willebrand factor type A [Rhodopirellula baltica WH47] Length = 497 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 55/422 (13%), Positives = 115/422 (27%), Gaps = 83/422 (19%) Query: 15 YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNR 74 +GG+T+L A LP++ L+ I ++ + +KT L D + E Sbjct: 82 RRGGITVLMAFVLPMLALLAAFCINLAQMQLVKTELAIATDAAARAGGRAFSEEQTVEAA 141 Query: 75 KKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISR 134 K + N F + + + + Sbjct: 142 KAAARLTAAMNEVAGEPYQLNTDDSANEFEFG---------VSAQTDGNTGRFYFTKVPT 192 Query: 135 YKIPLKFCTF----IPWYTNSRHIVMPIT------------SSVKVNSQTDARLDMMIVL 178 + I ++ P+ S V + D+ +VL Sbjct: 193 SDVAANLVAVSSVRINGKRTDDSLLGPVPFIFPNTFSIGDFSPVASATAMQVDRDISLVL 252 Query: 179 DVSRSMESFF-----DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 D S SM+ D+ D I + +A + +++ V+++ +E+ Sbjct: 253 DRSGSMDWKTYDWPDDADPWGEDSLISAEDAGIVDLEWKYRNGQPQYIRRVSYNRGYDEY 312 Query: 234 FLLEWGVSHL------------------------------QRKIKYLSKFGVST------ 257 L + + ++ S T Sbjct: 313 DLYDHAWEEVFGLGPAPNTPWEDLVLAVDAFLRVLDQTPQNEQVSIASYNSHGTLDCWLL 372 Query: 258 NSTPGLKYAYNQI--FDMQGMRQHCNTEDANYK---------KIIVFMTDGE-NLSTKED 305 + ++ A Q+ G+ N+ + K +V MTDG N T+ + Sbjct: 373 DDFDSVRAAVAQLGPNGSTGIGNGMNSGKTAFTHENARPYASKTMVVMTDGNHNYGTQPN 432 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGK 363 + + + E ++ A Y ++ + AF I Sbjct: 433 TVAQQL---MSSSNLNIQTVTFGGGADQETMQEVAVTGLGRHYHADSGDELVSAFEEIAN 489 Query: 364 DI 365 ++ Sbjct: 490 NL 491 >gi|332667371|ref|YP_004450159.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] gi|332336185|gb|AEE53286.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] Length = 425 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 36/219 (16%), Positives = 78/219 (35%), Gaps = 30/219 (13%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + ++ + + S S L++ +V+D S SM KI A K+ +++ Sbjct: 21 KQNFYLYLELQGSEAPASADRVPLNLSLVIDRSGSMAG------DKIAYAKKAAQFIVDN 74 Query: 209 VKLIPDVNNVVQSGLVTFSNKIE--EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 + + +V + + +E L+++I + + TN + G+ Sbjct: 75 LSPED------RVSIVQYDDIVEVLSPSAPVLNKQELRQRIALMEARNM-TNLSGGMLAG 127 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY--YCNEAKKRGAIVYA 324 Y Q+ + R ++ ++DG D L + + G V Sbjct: 128 YEQVERTKQARFVNR---------VLLLSDGLANHGITDPTVLQQMVQEKFRNAGIAVST 178 Query: 325 IGIRVIRSHEFLRACASP---NSFYLVENPHSMYDAFSH 360 G+ +E L S ++Y +E+P + F+ Sbjct: 179 FGVGA-DFNELLMTSLSEYGGANYYFIESPDKIPGIFAE 216 >gi|146337717|ref|YP_001202765.1| hypothetical protein BRADO0586 [Bradyrhizobium sp. ORS278] gi|146190523|emb|CAL74522.1| hypothetical protein BRADO0586 [Bradyrhizobium sp. ORS278] Length = 418 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 49/362 (13%), Positives = 110/362 (30%), Gaps = 66/362 (18%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R F ++ +G + +L AI + +G I+ S ++T L ID +++ Sbjct: 8 LRKFGHDQRGNIAVLFAIACVPVLAFVGAGIDYSMANKLRTKLQMAIDEAVLAGVAA--- 64 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 + ++ + F + I + +++ + + Sbjct: 65 ----------GKAALDSGATQAAAIAMAQAASSSYFTGNTAKIDATPTINFTTMGRTLSG 114 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM--- 184 + SA S P M + +S ++ L++ +++D+S SM Sbjct: 115 TGSATSVMNTSFMRLVGFP--------TMTLNASSASSATMQPYLNVYLLVDISSSMLLP 166 Query: 185 ------------------------------ESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + ++ + + + + + +L + Sbjct: 167 ATQAGITQMRNGTGCALACHETTNGTDSYSYALKNNVLLRYQVVNQGVQNLLTYLNSSAV 226 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 N V+ GL +F N++ + L S + + Sbjct: 227 YKNYVKVGLWSFDNQLTQLSSLTSSFSSVAANFPAPGLAYNDAAAATPFDSLIGSFVSSV 286 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ-----------SLYYCNEAKKRGAIVY 323 G + A +K+++ TDG N T+ + +CN K G V Sbjct: 287 GT-AGDGSTSATPQKLVIIATDGVNDPTRAWTSQTSLRSQVRVFNTAFCNTFKSNGVTVA 345 Query: 324 AI 325 I Sbjct: 346 II 347 >gi|229527849|ref|ZP_04417240.1| protein BatA [Vibrio cholerae 12129(1)] gi|229334211|gb|EEN99696.1| protein BatA [Vibrio cholerae 12129(1)] gi|327485392|gb|AEA79798.1| BatA aerotolerance operon [Vibrio cholerae LMA3894-4] Length = 318 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 71/213 (33%), Gaps = 38/213 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M+V+D+S SM + + + A+ + + + GL+ F++ Sbjct: 85 DLMLVVDLSYSMSQE---DMQSGQQMVDRLTAVKQVLSEFIAKREGDRIGLILFADHAYL 141 Query: 233 FFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + ++ L G T G+ A D +++ Sbjct: 142 QTPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFIDSNA-----------PQRV 190 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------SHEF- 335 ++ ++DG N + L N AK+ +Y +G+ + + Sbjct: 191 MILLSDGSNTA--GVLDPLEAANIAKQYHTTIYTVGVGAGEMVVKDFLFSRKLNTAQDLD 248 Query: 336 ---LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + ++ N + + + I + Sbjct: 249 EKTLQTIATTTGGHYFRARNQQDLQNIYDTINQ 281 >gi|260799304|ref|XP_002594637.1| hypothetical protein BRAFLDRAFT_217575 [Branchiostoma floridae] gi|229279872|gb|EEN50648.1| hypothetical protein BRAFLDRAFT_217575 [Branchiostoma floridae] Length = 181 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 37/176 (21%), Positives = 70/176 (39%), Gaps = 28/176 (15%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D++ ++D S S+ + D + + ++ + ++ PD V G++ +S+ + Sbjct: 1 FDLVFLVDGSTSVGPN------EFDKSKAFLRNVINQFQIGPDATKV---GVIQYSSTVR 51 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 E F L + R I + G T + + + M+QH N K Sbjct: 52 EEFSLNRYLTKQDVMRAIDRIPFLGGFTRTGEAITF----------MKQHSQF-RGNVPK 100 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 I + +TDG + Y A G I+YAI + + +E L+ AS + Sbjct: 101 IAIVITDG-----RAQDDVSYPSQMAHGAGVIMYAISVGSVNMNE-LKLIASAERY 150 >gi|198426950|ref|XP_002125765.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 918 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 41/245 (16%), Positives = 81/245 (33%), Gaps = 33/245 (13%) Query: 128 SISAISRYKIPLKFCTFIPW-YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 S + L F N +P + A +D++IV+D S S+ Sbjct: 673 SYQCDASQGYTLFPRNFTHNTCLNDTTWSLPRPCCARSCP-PYAVMDLVIVMDSSSSIRP 731 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-----EWGVS 241 +TK + + + NV Q + ++ +++ + Sbjct: 732 NNWRKMTKF---------TRDLISMFEISPNVTQVSVFRYNKNVDKNTQILLKDHPNSKE 782 Query: 242 HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 L + + G T + LK+ ++ I +H + ++V +TDG Sbjct: 783 ELLQAFDRIPYNGSGTQTGKALKHVHDVILAEGNGNRHG------VRDVVVTITDG---- 832 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-----EFLRACASPNSFYLVENP-HSMY 355 K + + + GA+ YAIGIR E S + +L+ + Sbjct: 833 -KAADNVTQHSLDIRGTGALTYAIGIRTPGRRGPDRAELGVIAGSEKNLFLLTSGFEGFN 891 Query: 356 DAFSH 360 + F+ Sbjct: 892 EVFAR 896 >gi|229590954|ref|YP_002873073.1| hypothetical protein PFLU3509 [Pseudomonas fluorescens SBW25] gi|229362820|emb|CAY49730.1| putative exported protein [Pseudomonas fluorescens SBW25] Length = 362 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 29/207 (14%), Positives = 71/207 (34%), Gaps = 45/207 (21%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + ++++ + + LE + GL+ F + Sbjct: 91 DLLVAVDVSGSMDFPDMNWQGEDVSRLSLVKHLLGDFLE-------GREGDRVGLILFGS 143 Query: 229 KIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + + G +T + A ++ + Sbjct: 144 QAYLQAPLTFDRRTVRTWLDEARIGIAGKNTAIGDAIGLALKRL-----------RQRPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE------------ 334 ++++ +TDG N + + D + A + G +Y IGI Sbjct: 193 QSRVLILVTDGANNAGQIDPLTA--ARLAAEEGVKIYPIGIGADPEQTGSLGILGVNPSL 250 Query: 335 -----FLRAC--ASPNSFYLVENPHSM 354 L+A A+ ++ + + Sbjct: 251 DLDEPALKAIAEATGGQYFRARDGEEL 277 >gi|6465945|gb|AAF12731.1|AF108501_1 Ca(2+)-sensitive chloride channel 2 [Mus musculus] Length = 902 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 44/195 (22%), Positives = 72/195 (36%), Gaps = 32/195 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--IE 231 + +VLD S SM+ ++ ++ L ++ + GLVTF + I+ Sbjct: 309 VCLVLDKSGSMDKE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSAAHIQ 358 Query: 232 EFFLLEWGVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + S Q+ L + T+ GL+ + I Sbjct: 359 NYLIKITSSSDYQKITANLPQQATGGTSICHGLQAGFQAITSSDQSTSGSE--------- 409 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ C EA R GAI++ I + + E L FY Sbjct: 410 IVLLTDGEDNGISS-------CFEAVSRSGAIIHTIALGPSAARELETLSDMTGGLRFYA 462 Query: 348 VENPHSMYDAFSHIG 362 ++ S+ DAFS I Sbjct: 463 NKHVSSLIDAFSRIS 477 >gi|260800509|ref|XP_002595172.1| hypothetical protein BRAFLDRAFT_241020 [Branchiostoma floridae] gi|229280415|gb|EEN51183.1| hypothetical protein BRAFLDRAFT_241020 [Branchiostoma floridae] Length = 299 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 68/187 (36%), Gaps = 23/187 (12%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 ++D++ VLD + S+ T + + M+ + +L P + G+V +SN+ Sbjct: 17 QVDLVFVLDGTGSVG------ATNFERMKTFVQKMISDFELGP---EATRIGVVVYSNRA 67 Query: 231 EEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L+ LQ + ++ G T + + Y N + Sbjct: 68 SLEISLDAFEDQESLQDAVAGIAYPGGYTLTGAAIDY------TTTFAFSTRNGAREGVR 121 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACASPNSFYL 347 K+ V +TDG + +K I YA+GI + P++ ++ Sbjct: 122 KVAVILTDGVSYDDPA-----EPAQSMRKAAIITYAVGIGSNLDRDQLDVIAGVPDNLFV 176 Query: 348 VENPHSM 354 +++ + Sbjct: 177 LDDFSML 183 >gi|218778178|ref|YP_002429496.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] gi|218759562|gb|ACL02028.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] Length = 480 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 36/211 (17%), Positives = 73/211 (34%), Gaps = 22/211 (10%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + M + +DM+IVLD S SM K+ A ++ ++E ++ Sbjct: 75 LTMTADEVLAPEQTKTKPVDMVIVLDRSGSMGG------QKVRDAKAAVKGLVEGLRSQD 128 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + LVT+SN + L + + + + ++ +S P Sbjct: 129 ------RFSLVTYSNSVNGGDGLHYLTADKRNSLNWMV------DSIPAGGGTNLGGGLE 176 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI--VYAIGIRVIR 331 +G+ + ++ ++DG+ D L + G + V +GI Sbjct: 177 KGVGVLRAYGAPDRMGKVILISDGQANQGVTDPNQLAAMAALRDDGLVYSVTTVGIGQDF 236 Query: 332 SHEFLRACASP--NSFYLVENPHSMYDAFSH 360 + + + A +Y +ENP F Sbjct: 237 NEQLMATVADGGRGRYYYLENPGDFLAVFQE 267 >gi|153802375|ref|ZP_01956961.1| conserved hypothetical protein [Vibrio cholerae MZO-3] gi|124122094|gb|EAY40837.1| conserved hypothetical protein [Vibrio cholerae MZO-3] Length = 318 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 30/213 (14%), Positives = 71/213 (33%), Gaps = 38/213 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M+V+D+S SM + + + A+ + + + GL+ F++ Sbjct: 85 DLMLVVDLSYSMSQE---DMQSGQQMVDRLTAVKQVLSEFIAKREGDRIGLILFADHAYL 141 Query: 233 FFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + ++ L G T G+ A D +++ Sbjct: 142 QTPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFIDSDA-----------PQRV 190 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------SHEF- 335 ++ ++DG N + L + AK+ +Y +G+ + + Sbjct: 191 MILLSDGSNTA--GVLDPLEAADIAKQYHTTIYTVGVGAGEMVVKDFLFSRKVNTAQDLD 248 Query: 336 ---LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + ++ N + + + I + Sbjct: 249 EKTLQTIATTTGGHYFRARNQQDLQNIYDTINQ 281 >gi|107102622|ref|ZP_01366540.1| hypothetical protein PaerPA_01003686 [Pseudomonas aeruginosa PACS2] Length = 340 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 27/163 (16%), Positives = 61/163 (37%), Gaps = 26/163 (15%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + D I+++++ K +E+ + + GL+ F + Sbjct: 91 DLLLAVDVSGSMDYRDMRWQDDEISRLELIKKLFGDFIEDRRGD-------RVGLILFGS 143 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + G +T + A ++ + Sbjct: 144 QAYLQAPLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAVKRL-----------RQRPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 +++V +TDG N A ++ +Y IGI Sbjct: 193 ESRVLVLITDGANTG--GQIAPQIAAQLAAEQQVKIYTIGIGA 233 >gi|15598269|ref|NP_251763.1| hypothetical protein PA3073 [Pseudomonas aeruginosa PAO1] gi|9949180|gb|AAG06461.1|AE004731_9 hypothetical protein PA3073 [Pseudomonas aeruginosa PAO1] Length = 340 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 27/163 (16%), Positives = 61/163 (37%), Gaps = 26/163 (15%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + D I+++++ K +E+ + + GL+ F + Sbjct: 91 DLLLAVDVSGSMDYRDMRWQDDEISRLELIKKLFGDFIEDRRGD-------RVGLILFGS 143 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + G +T + A ++ + Sbjct: 144 QAYLQAPLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAVKRL-----------RQRPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 +++V +TDG N A ++ +Y IGI Sbjct: 193 ESRVLVLITDGANTG--GQIAPQIAAQLAAEQQVKIYTIGIGA 233 >gi|254241773|ref|ZP_04935095.1| hypothetical protein PA2G_02484 [Pseudomonas aeruginosa 2192] gi|126195151|gb|EAZ59214.1| hypothetical protein PA2G_02484 [Pseudomonas aeruginosa 2192] Length = 340 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 27/163 (16%), Positives = 61/163 (37%), Gaps = 26/163 (15%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + D I+++++ K +E+ + + GL+ F + Sbjct: 91 DLLLAVDVSGSMDYRDMRWQDDEISRLELIKKLFGDFIEDRRGD-------RVGLILFGS 143 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + G +T + A ++ + Sbjct: 144 QAYLQAPLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAVKRL-----------RQRPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 +++V +TDG N A ++ +Y IGI Sbjct: 193 ESRVLVLITDGANTG--GQIAPQIAAQLAAEQQVKIYTIGIGA 233 >gi|229514670|ref|ZP_04404131.1| protein BatA [Vibrio cholerae TMA 21] gi|229348650|gb|EEO13608.1| protein BatA [Vibrio cholerae TMA 21] Length = 318 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 71/213 (33%), Gaps = 38/213 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M+V+D+S SM + + + A+ + + + GL+ F++ Sbjct: 85 DLMLVVDLSYSMSQE---DMQSGQQMVDRLTAVKQVLSEFIAKREGDRIGLILFADHAYL 141 Query: 233 FFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + ++ L G T G+ A D +++ Sbjct: 142 QTPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFIDSNA-----------PQRV 190 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------SHEF- 335 ++ ++DG N + L N AK+ +Y +G+ + + Sbjct: 191 MILLSDGSNTA--GVLDPLEAANIAKQYHTTIYTVGVGAGEMVVKDFLFSRKVNTAQDLD 248 Query: 336 ---LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + ++ N + + + I + Sbjct: 249 EKTLQTIATTTGGHYFRARNQQDLQNIYDTINQ 281 >gi|163731887|ref|ZP_02139334.1| hypothetical protein RLO149_21324 [Roseobacter litoralis Och 149] gi|161395341|gb|EDQ19663.1| hypothetical protein RLO149_21324 [Roseobacter litoralis Och 149] Length = 468 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 30/145 (20%), Positives = 48/145 (33%), Gaps = 33/145 (22%) Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG---------ENLSTKEDQQS 308 N+ L A + + + + +K IV MTDG + K D+ + Sbjct: 325 NTFLALNAA--GLVPDGFKNRPADFGTTDTRKFIVLMTDGQITDQFRPEDKNDPKNDEIA 382 Query: 309 LY---------------------YCNEAKKRGAIVYAIGIRVI-RSHEFLRACASPNSFY 346 L CN+AK G VY I + +R CA+ +F+ Sbjct: 383 LNQRTGDRDTYSTQSTNVTNFYSVCNKAKAEGITVYTIAFEAPADAVTQMRTCATSPAFF 442 Query: 347 LVENPHSMYDAFSHIGKDIVTKRIW 371 + AF I + I R+ Sbjct: 443 YKVEGVQIKTAFKSIARQINELRLT 467 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 27/199 (13%), Positives = 69/199 (34%), Gaps = 35/199 (17%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F G MTI I + ++ LV G+ +++ M+T + + +DR+++ A ++ + Sbjct: 3 FKREEDGAMTIFATIMVLMMLLVCGIAVDLMQNEMMRTRVQNTLDRAILAA-----SDLD 57 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 D + + + + + + I Sbjct: 58 QPLPADEVVDDYFAKA-------------------GMTEFLNDVRITPGSDLPTTNFRI- 97 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 + + + + T R + + ++ + + + +++ +VLD+S SM + Sbjct: 98 --VQAEARTRTPSIYMAMTGVRTLPVYVSGTAEETIEK---IEISLVLDISGSMRNNG-- 150 Query: 191 SITKIDMAIKSINAMLEEV 209 KI + + V Sbjct: 151 ---KIGNLRTAAKDFIGAV 166 >gi|170739681|ref|YP_001768336.1| von Willebrand factor type A [Methylobacterium sp. 4-46] gi|168193955|gb|ACA15902.1| von Willebrand factor type A [Methylobacterium sp. 4-46] Length = 329 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 37/229 (16%), Positives = 78/229 (34%), Gaps = 45/229 (19%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEV 209 +V+P T+ + +M+ LD+S SME S +++++ + + Sbjct: 88 LVLPATALPASGRE------IMLALDLSGSMERVDFSIDGRNVSRLAAVKQVGADFIRR- 140 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAY 267 + + GLV F+++ + L + + + ++ G ST GL A Sbjct: 141 RAGD------RIGLVIFADQADVAASLSFDTAAVVHALEEAQIGLVGRSTGIGDGLGLAL 194 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 ++ +K+++ ++DG N + A+ G V+ I + Sbjct: 195 KRLDAA-----------TAREKVVILLSDGANNA--GQTTPHDVAGLARDLGIRVHTIAL 241 Query: 328 RVIR-----------SHEFLR--ACASPNSFYLVENPHSMYDAFSHIGK 363 E LR + S F+ V + I + Sbjct: 242 GPRDLSDADGDPDVVDTEALRDVSATSGGRFFRVRTTDDLAAVADAIDE 290 >gi|297579701|ref|ZP_06941628.1| conserved hypothetical protein [Vibrio cholerae RC385] gi|297535347|gb|EFH74181.1| conserved hypothetical protein [Vibrio cholerae RC385] Length = 318 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 71/213 (33%), Gaps = 38/213 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M+V+D+S SM + + + A+ + + + GL+ F++ Sbjct: 85 DLMLVVDLSYSMSQE---DMQSGQQMVDRLTAVKQVLSEFIAKREGDRIGLILFADHAYL 141 Query: 233 FFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + ++ L G T G+ A D +++ Sbjct: 142 QTPLTLDRQTVASQLNQAVLKLIGTQTAIGEGIGLATKTFIDSDA-----------PQRV 190 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------SHEF- 335 ++ ++DG N + L N AK+ +Y +G+ + + Sbjct: 191 MILLSDGSNTA--GVLDPLEAANIAKQYHTTIYTVGVGAGEMVVKDFLFSRKVNTAQDLD 248 Query: 336 ---LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + ++ N + + + I + Sbjct: 249 EKTLQTIATTTGGHYFRARNQQDLQNIYDTINQ 281 >gi|119383876|ref|YP_914932.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] gi|119373643|gb|ABL69236.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] Length = 282 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 63/165 (38%), Gaps = 26/165 (15%) Query: 173 DMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +D+S SM+ I+++D ++ + + + GLV F + Sbjct: 82 DIVLAIDLSGSMQKEDFQLDGQPISRLDAVKRTASRFVA-------ARRGDRIGLVIFGD 134 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + V + R + G +T + GL A ++ + Sbjct: 135 RAYFAQPLTFDVDAVARAVDEAQIGISGRATAISDGLGLAMKRLAASEA----------- 183 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 +++V M+DG + T + Q++ A G ++ I + Sbjct: 184 PTRVVVLMSDG--VDTSGNVQAVDAARLAAGHGIRIHTIALGPED 226 >gi|194227185|ref|XP_001916970.1| PREDICTED: inter-alpha (globulin) inhibitor H5 [Equus caballus] Length = 905 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 75/202 (37%), Gaps = 33/202 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ VLD S SM TK+ ++ +L +++ N ++ FSN+I+ Sbjct: 253 VVFVLDSSASM------VGTKLRQTKDALFTILHDLRPQDHFN------VIGFSNRIKVW 300 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + V + I ++S G T+ L+ A + H + ED + Sbjct: 301 KDHLVSVTPDSVRDGKVYIHHMSPTG-GTDINGALQRAIRLL---NNYVAHNDIEDRSVS 356 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEF-----LRACA 340 +I+F+TDG+ + +L N K+ ++ IGI + L C Sbjct: 357 -LIIFLTDGKPT--VGETNTLKILNNTKEATRGQICIFTIGIGDDVDFKLLEKLSLENCG 413 Query: 341 SPNSFYLVENP-HSMYDAFSHI 361 + ++ + + I Sbjct: 414 LTRRVHDEDDAGSQLIGFYDEI 435 >gi|121586746|ref|ZP_01676529.1| conserved hypothetical protein [Vibrio cholerae 2740-80] gi|121728206|ref|ZP_01681240.1| conserved hypothetical protein [Vibrio cholerae V52] gi|147672023|ref|YP_001215942.1| hypothetical protein VC0395_1106 [Vibrio cholerae O395] gi|153816797|ref|ZP_01969464.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457] gi|227811796|ref|YP_002811806.1| hypothetical protein VCM66_A0168 [Vibrio cholerae M66-2] gi|229506663|ref|ZP_04396172.1| protein BatA [Vibrio cholerae BX 330286] gi|262167807|ref|ZP_06035508.1| protein BatA [Vibrio cholerae RC27] gi|298500027|ref|ZP_07009833.1| conserved hypothetical protein [Vibrio cholerae MAK 757] gi|121549043|gb|EAX59080.1| conserved hypothetical protein [Vibrio cholerae 2740-80] gi|121629529|gb|EAX61953.1| conserved hypothetical protein [Vibrio cholerae V52] gi|126512600|gb|EAZ75194.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457] gi|146314406|gb|ABQ18946.1| conserved hypothetical protein [Vibrio cholerae O395] gi|227010938|gb|ACP07149.1| conserved hypothetical protein [Vibrio cholerae M66-2] gi|227014797|gb|ACP11006.1| conserved hypothetical protein [Vibrio cholerae O395] gi|229357014|gb|EEO21932.1| protein BatA [Vibrio cholerae BX 330286] gi|262023715|gb|EEY42415.1| protein BatA [Vibrio cholerae RC27] gi|297542008|gb|EFH78059.1| conserved hypothetical protein [Vibrio cholerae MAK 757] Length = 318 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 71/213 (33%), Gaps = 38/213 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M+V+D+S SM + + + A+ + + + GL+ F++ Sbjct: 85 DLMLVVDLSYSMSQE---DMQSGQQMVDRLTAVKQVLSEFIAKREGDRIGLILFADHAYL 141 Query: 233 FFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + ++ L G T G+ A D +++ Sbjct: 142 QTPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFIDSNA-----------PQRV 190 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------SHEF- 335 ++ ++DG N + L N AK+ +Y +G+ + + Sbjct: 191 MILLSDGSNTA--GVLDPLEAANIAKQYHTTIYTVGVGAGEMVVKDFLFSRKVNTAQDLD 248 Query: 336 ---LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + ++ N + + + I + Sbjct: 249 EKTLQTIATTTGGHYFRARNQQDLQNIYDTINQ 281 >gi|13447394|ref|NP_085104.1| chloride channel calcium activated 2 [Mus musculus] gi|12043705|gb|AAG47626.1|AF115852_1 endothelial chloride channel [Mus musculus] gi|14198178|gb|AAH08147.1| Chloride channel calcium activated 2 [Mus musculus] gi|74208910|dbj|BAE21205.1| unnamed protein product [Mus musculus] gi|148680073|gb|EDL12020.1| mCG120735 [Mus musculus] Length = 902 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 44/195 (22%), Positives = 72/195 (36%), Gaps = 32/195 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--IE 231 + +VLD S SM+ ++ ++ L ++ + GLVTF + I+ Sbjct: 309 VCLVLDKSGSMDKE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSAAHIQ 358 Query: 232 EFFLLEWGVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + S Q+ L + T+ GL+ + I Sbjct: 359 NYLIKITSSSDYQKITANLPQQATGGTSICHGLQAGFQAITSSDQSTSGSE--------- 409 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ C EA R GAI++ I + + E L FY Sbjct: 410 IVLLTDGEDNGISS-------CFEAVSRSGAIIHTIALGPSAARELETLSDMTGGLRFYA 462 Query: 348 VENPHSMYDAFSHIG 362 ++ S+ DAFS I Sbjct: 463 NKHVSSLIDAFSRIS 477 >gi|90422080|ref|YP_530450.1| hypothetical protein RPC_0556 [Rhodopseudomonas palustris BisB18] gi|90104094|gb|ABD86131.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB18] Length = 453 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 56/453 (12%), Positives = 119/453 (26%), Gaps = 113/453 (24%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE-- 68 F ++ +G + IL A L + + +G I+ + +++ + S D + V + ++ Sbjct: 12 FHHDRRGNIAILFAFSLIPLLVAIGCAIDYARATQIRSKMQSAADAASVGSVSKASPAFL 71 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 G+ N +N + + + LD V + Sbjct: 72 AAGSMTTDGPIAVGSTDATNIFNGNMAS-----------QSGYTLSKLDAAVTKSGATLT 120 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME--- 185 + + F T I + I + S +D ++LD S SM Sbjct: 121 STVTFSASVATTFLTII------GKTALAIGGTSVSTSSMPVYIDFYLLLDNSPSMGVGA 174 Query: 186 --------------------------------SFFDSSITKIDMAIKSINAMLEEVKLIP 213 + T+ID+ + +++ Sbjct: 175 TPTDVATMVDNTSDKCAFACHDVNDEHNYYELAKTLGVKTRIDVLRDATQQLMDTAAATA 234 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK------YAY 267 N + + F + L + + + G +Y Sbjct: 235 TYPNQFRMAIYDFGASAQSAALRRLFALSSSLSSAKTAAGAIDLMTVKGQNDNDDRDTSY 294 Query: 268 NQIFDM--QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL---------------- 309 +++ + + +K ++F++DG T Sbjct: 295 SKLLPAIDKQITAAGAGTSDAPQKYLLFVSDGVADETNAGCAKTMKNAFWGNKSPRCQSP 354 Query: 310 ---YYCNEAKKRGAIV---YAIGIRVIRSHE----------------------------- 334 C RG V Y + + Sbjct: 355 IDPALCKAMTDRGVKVAVLYTTYLALPLKQANGDPSWYASWIAPFNVGPYGPSPNSEIAN 414 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 ++ACASP ++ V + DA + I + V Sbjct: 415 NMKACASPGFYFEVSPTDGIADAMNAIFRKAVA 447 >gi|308050346|ref|YP_003913912.1| von Willebrand factor type A [Ferrimonas balearica DSM 9799] gi|307632536|gb|ADN76838.1| von Willebrand factor type A [Ferrimonas balearica DSM 9799] Length = 322 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 34/222 (15%), Positives = 74/222 (33%), Gaps = 54/222 (24%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+ +D+S SM+ + + + M ++ +E + GL+ F++ Sbjct: 85 DLMLAVDLSGSMQIEDMELGNRVVDRFTMVRHVLSDFIERRDGD-------RLGLILFAD 137 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + + R + L G T + + D++ Sbjct: 138 QAYLQAPLTFDRFAVARFLDEAVLGLVGQQTAIGDAIALGVKRFNDLEQSS--------- 188 Query: 287 YKKIIVFMTDGENLS---TKEDQQSLYYCNEAKKRGAIVYAIGIRVIR------------ 331 +++V +TDGEN + T SL A++ G +Y IGI Sbjct: 189 --RVLVLLTDGENNAGRFTPAQAVSL-----ARQSGVKLYTIGIGSAEIRRRGLLGTRTV 241 Query: 332 ---------SHEFLRACAS-PNSFYLVENPHSMYDAFSHIGK 363 F++ S ++ + + + + + Sbjct: 242 NPSSDLDQAEKSFIQLSESTGGRYFRARSTEELESIYQELDQ 283 >gi|226366409|ref|YP_002784192.1| hypothetical protein ROP_70000 [Rhodococcus opacus B4] gi|226244899|dbj|BAH55247.1| hypothetical membrane protein [Rhodococcus opacus B4] Length = 328 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 34/231 (14%), Positives = 74/231 (32%), Gaps = 30/231 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + R +++V+DVS SM++ D T++ A + + + + Sbjct: 73 FSVALAGPTEDKRVPRNRATVILVIDVSLSMKA-TDVEPTRLAAAQDAAKSFADGLTPGI 131 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + GLV F+ + I L + T + + + + + Sbjct: 132 N------LGLVAFAGTASVLVSPTTNREATKVAIDNL-QLSERTATGEAI---FTSLQSI 181 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGE---NLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-- 328 + D IV ++DG+ + + + +AK + + I Sbjct: 182 DTLAAVLGGSDQAPPARIVLLSDGKQTVPENPDDPRGGFTAARQAKDKDVPISTISFGTS 241 Query: 329 ------------VIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 V LR A S SF+ + + D + + + I Sbjct: 242 YGKVEIEDERIPVPVDDPSLREIANLSGGSFFTASSLEELRDVYDTLEEQI 292 >gi|99031847|pdb|2B2X|A Chain A, Vla1 Rdeltah I-Domain Complexed With A Quadruple Mutant Of The Aqc2 Fab gi|99031850|pdb|2B2X|B Chain B, Vla1 Rdeltah I-Domain Complexed With A Quadruple Mutant Of The Aqc2 Fab Length = 223 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 40/218 (18%), Positives = 78/218 (35%), Gaps = 35/218 (16%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +LD++IVLD S S+ + + I +N +L+ + + P Q G+V + Sbjct: 21 TQLDIVIVLDGSNSIYPW--------ESVIAFLNDLLKRMDIGP---KQTQVGIVQYGEN 69 Query: 230 IEEFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + F L S + + + + G T + G+ A + F Sbjct: 70 VTHEFNLNKYSSTEEVLVAANKIVQRGGRQTMTALGIDTARKEAFT------EARGARRG 123 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-RVIRSHEF--------LR 337 KK++V +TDGE+ +Q + C ++I I ++ Sbjct: 124 VKKVMVIVTDGESHDNYRLKQVIQDCE---DENIQRFSIAILGHYNRGNLSTEKFVEEIK 180 Query: 338 ACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + AS F+ V + ++ +G+ I Sbjct: 181 SIASEPTEKHFFNVSDELALVTIVKALGERIFALEALE 218 >gi|153833319|ref|ZP_01985986.1| von Willebrand factor, type A [Vibrio harveyi HY01] gi|148870455|gb|EDL69376.1| von Willebrand factor, type A [Vibrio harveyi HY01] Length = 363 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 77/210 (36%), Gaps = 34/210 (16%) Query: 173 DMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+M+V+D+S SM S I+++D A + ++ + K + GL+ F Sbjct: 107 DVMVVVDLSGSMAEQDFTSKNGEKISRLDAAKEVLSDFAKTRKGD-------RLGLILFG 159 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + F+ + ++ L++ G ST+ + A + R Sbjct: 160 ---DAAFVQTPFTPDQKVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQSEKSRTDV-- 214 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--SHEFLR--- 337 + + +K+ + +TDG T + + AK + ++ I + + L Sbjct: 215 -EESKEKVAIVLTDG--NDTGSFVEPIDAAKVAKAKDVRIHVIAMGDPQTVGETALDMNT 271 Query: 338 ----ACASPNSFYLVENPHSMYDAFSHIGK 363 A S + N + A+ IGK Sbjct: 272 IKRIAKESGGEAFEALNRDELAKAYDEIGK 301 >gi|148225160|ref|NP_001089228.1| hypothetical protein LOC734275 [Xenopus laevis] gi|58047691|gb|AAH89181.1| MGC98917 protein [Xenopus laevis] Length = 1014 Score = 64.0 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 34/184 (18%), Positives = 68/184 (36%), Gaps = 24/184 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ ++D S S+ ++ + + ++ + L + P + + G+V +S++ Sbjct: 605 GALDIVFIIDSSESIGY---TNFSLEKNFVINVVSRLGSIAKDPKSDTGARVGVVQYSHE 661 Query: 230 -----IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 I+ +S + ++ L T + L++AYN++ T Sbjct: 662 GTFEAIQLDDPRIDSLSSFKEAVRRLEWIAGGTWTPSALQFAYNKLI--------KETRR 713 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFLRACA 340 K V +TDG + D++ C V AIG E L + A Sbjct: 714 DKAKVFAVVITDGRHDPRDPDERLQVLCG----GDVDVNAIGIGDMFNKPEEDETLTSIA 769 Query: 341 SPNS 344 N Sbjct: 770 CSNK 773 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 33/180 (18%), Positives = 69/180 (38%), Gaps = 15/180 (8%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 +P + + V T +D++ +LD SM + + + ++ L K D Sbjct: 808 DLPCQTELSVAQCTQRPVDLVFLLD--GSMRT-GEQNFKYAAGFVEEAAQRLSLAKKHDD 864 Query: 215 VNNVVQSGLVTFSNKIEE--FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 +N + LV + + ++ F L + ++ + + I+ L G S+ P + +A N + Sbjct: 865 PSN-ARVSLVQYGGQDKQSVAFPLAFDLTEVSQAIEKLRYNGSSSVIGPAIIHAINNVLQ 923 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 G E + VF+TDGE ++ +K + + + S Sbjct: 924 NTGRPARRYAEPS-----FVFITDGETGQEGLEEAVTA----MRKNNIVSTLVSVGPEDS 974 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 22/159 (13%), Positives = 59/159 (37%), Gaps = 14/159 (8%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV---QSGLVTFS 227 +++ ++D S S+ +D I L++++ ++ V+ Q G + +S Sbjct: 36 PINVFFIIDTSESIILQTAPIEILLDNMKVFIPRFLDKLEDAAYLDQVILNWQYGGLHYS 95 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 +++ F + + K+ ++ G T + L M + + Sbjct: 96 DEVIIFSDITTNKQEYKSKLNAVTYIGRGTFTDCAL----------SNMTALIQRQGGDA 145 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 V +TDG + ++ + A+ G ++++ Sbjct: 146 INFAVVITDGHVTGSPCGGM-MHQADRARNAGIKLFSVA 183 >gi|301607027|ref|XP_002933125.1| PREDICTED: collagen alpha-1(VII) chain-like [Xenopus (Silurana) tropicalis] Length = 2671 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 31/146 (21%), Positives = 57/146 (39%), Gaps = 19/146 (13%) Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + E F + L + I+ L G +T + GL+YA + F N Sbjct: 66 TRTEFTFTTHRNGTELVQAIRNLGYKGGNTRTGTGLRYAADNFF-------GPTIIRPNV 118 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSF 345 K+ + +TDG K K +G ++A+GI+ S E R ++P + F Sbjct: 119 PKVAILITDG-----KSQDDVDPPTQRLKSQGIKMFAVGIKNADSRELTRVASTPTEDFF 173 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIW 371 + V + + +VT+++ Sbjct: 174 FYVNDFRILGSLL-----PVVTRKVC 194 >gi|312958282|ref|ZP_07772803.1| von Willebrand factor type A domain [Pseudomonas fluorescens WH6] gi|311287346|gb|EFQ65906.1| von Willebrand factor type A domain [Pseudomonas fluorescens WH6] Length = 546 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 45/288 (15%), Positives = 104/288 (36%), Gaps = 37/288 (12%) Query: 85 RIKNTWNMSFRNELRDNGFVN-----DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPL 139 R+ T +F ++ + N + + ++ + + Y A+ P Sbjct: 97 RVAETPVSTFSVDVDTGSYANVRRFLNQGSLPPEGAVRLKEMVNYFPYHY-ALPTDGSPF 155 Query: 140 KFCTFI---PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 T + PW ++ + + I +S A +++ ++DVS SM+ + Sbjct: 156 GVTTEVAATPWNPRTQLLRIGIKAS-DRAVAELAPANLVFLVDVSGSMD-----RREGLP 209 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFG 254 + ++ ++++++ + LV ++ + G + ++ I L+ G Sbjct: 210 LVQSTLKLLVDQLREQD------RVSLVVYAGESRVVLPPTSGRDKAKIRTAIDQLTA-G 262 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCN 313 ST G++ AY Q+ + + N I+ TDG+ N+ + Sbjct: 263 GSTAGASGIQLAY-QMAREGFIDKGINR--------ILLATDGDFNVGISDFDSLKQMAA 313 Query: 314 EAKKRGAIVYAIGIRVIRSHEFLR---ACASPNSFYLVENPHSMYDAF 358 E +K G + +G V +E L A A ++ ++ Sbjct: 314 EQRKSGVSLTTLGFGVDNYNEHLMEQLADAGDGNYAYIDTLREARKVL 361 >gi|256821839|ref|YP_003145802.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] gi|256795378|gb|ACV26034.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] Length = 958 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 74/207 (35%), Gaps = 19/207 (9%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M+V+D S SM + +KID A S + ++ V+ + + GLV+FS Sbjct: 375 DIMLVIDRSGSMSGDAGTGQSKIDEAKDSASLFVQLVEA----SAGHRMGLVSFSTSASI 430 Query: 233 FFLLEW-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + + I G + + + I D Q T AN K I Sbjct: 431 DEGIGNLNPGKKNQLIGPAPYSGGAV--GGLIPDGWTSIGDGIDKAQSELTGGANP-KTI 487 Query: 292 VFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--VIRSHEFLRAC--ASPNSFY 346 + +TDG +N + + + ++AIG+ + L ++ ++ Sbjct: 488 LLLTDGLQNTPPMIETATNDIGDT------RIHAIGLGTEANLNGGLLSDLTQSTGGAYT 541 Query: 347 LVENPHSMYDAFSHIGKDIVTKRIWYD 373 + + F+ DI D Sbjct: 542 RAGDGLELKKFFALAFGDIFEDGTLID 568 >gi|311030436|ref|ZP_07708526.1| hypothetical protein Bm3-1_07816 [Bacillus sp. m3-13] Length = 921 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 39/196 (19%), Positives = 67/196 (34%), Gaps = 34/196 (17%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 L ++IVLD S SM K D+A ++ +E +K G + F + Sbjct: 404 PSLGLIIVLDRSGSM------MGEKFDLAKEAAARSVELLKEEDTF------GFIAFDTE 451 Query: 230 ---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + E ++ + I+ + G T+ P L AY Q+ +M R+H Sbjct: 452 AWTVVETEPIK-NKDEVIETIRS-TALGGGTDIFPALNQAYQQLNEMDLKRKH------- 502 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--PNS 344 I+ +TDG++ + E + + I L A Sbjct: 503 ----IILLTDGQSNDGPYE----EIIEEGLTNNVTLSTVAIGGDADTSLLEELAEIGTGR 554 Query: 345 FYLVENPHSMYDAFSH 360 FY V ++ S Sbjct: 555 FYEVYEASAVPSILSR 570 >gi|153830331|ref|ZP_01982998.1| conserved hypothetical protein [Vibrio cholerae 623-39] gi|148874174|gb|EDL72309.1| conserved hypothetical protein [Vibrio cholerae 623-39] Length = 318 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 71/213 (33%), Gaps = 38/213 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M+V+D+S SM + + + A+ + + + GL+ F++ Sbjct: 85 DLMLVVDLSYSMSQE---DMQSGQQMVDRLTAVKQVLSEFIAKREGDRIGLILFADHAYL 141 Query: 233 FFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + ++ L G T G+ A D +++ Sbjct: 142 QTPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFIDSNA-----------PQRV 190 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------SHEF- 335 ++ ++DG N + L N AK+ +Y +G+ + + Sbjct: 191 MILLSDGSNTA--GVLDPLEAANIAKQYHTTIYTVGVGAGEMVVKDFLFSRKVNTAQDLD 248 Query: 336 ---LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + ++ N + + + I + Sbjct: 249 EKTLQTIATTTGGHYFRARNQQDLQNIYDTINQ 281 >gi|70733679|ref|YP_257319.1| von Willebrand factor type A domain-containing protein [Pseudomonas fluorescens Pf-5] gi|68347978|gb|AAY95584.1| von Willebrand factor type A domain protein [Pseudomonas fluorescens Pf-5] Length = 582 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 85/223 (38%), Gaps = 28/223 (12%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 PW ++R + + I +S A +++ ++DVS SM+ + + + Sbjct: 171 LAPSPWNPHTRLLRIGIKAS-DRAVAELAPANLVFLVDVSGSMD-----RREGLPLVKST 224 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNS 259 + ++++++ + LV ++ + G + ++ I L+ G ST Sbjct: 225 LKLLVDQLRDQD------RVSLVVYAGESRVVLEPTSGRDKAKIRTAIDQLTA-GGSTAG 277 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKR 318 G++ AY Q+ + Q N I+ TDG+ N+ + E +K Sbjct: 278 ASGIQLAY-QMAQQGFIDQGINR--------ILLATDGDFNVGVSDFDSLKAMAAEKRKS 328 Query: 319 GAIVYAIGIRVIRSHEFLR---ACASPNSFYLVENPHSMYDAF 358 G + +G V +E L A A ++ ++N Sbjct: 329 GVSLTTLGFGVDNYNEHLMEQLADAGDGNYAYIDNLREARKVL 371 >gi|291395815|ref|XP_002714336.1| PREDICTED: complement component 2 isoform 2 [Oryctolagus cuniculus] Length = 613 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 40/239 (16%), Positives = 83/239 (34%), Gaps = 31/239 (12%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + TN + K+ Q L++ ++LD S+S+ + S Sbjct: 92 LAHLFGATNPTQVKKGSLGR-KIQIQRSGHLNLYLLLDASQSVSE------EDFKIFKDS 144 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWGVSHLQRKIKYLS-------KF 253 M++ + + ++TF+++ + L + I L + Sbjct: 145 ATHMVDRIFSFEIN---LSVAVITFASQPKVIMSVLHDNSRDMTEVINSLENAKYTDHEN 201 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTED-ANYKKIIVFMTDGENL---STKEDQQSL 309 G TN L A N + + Q R T + I+ +TDG++ S K S+ Sbjct: 202 GTGTNIYKALN-AVNIMMNNQMQRLGMETTGWQEIRHAIILLTDGKSNMGGSPKPAVDSI 260 Query: 310 YYCNEAKKRG---AIVYAIGIRVIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 K++ +YAIG+ + +E +++++ ++ F H Sbjct: 261 KDVLNIKEKRNDYLDIYAIGVGKLDVDWRELNELASKKDGERHAFILQDARALQQVFEH 319 >gi|291395813|ref|XP_002714335.1| PREDICTED: complement component 2 isoform 1 [Oryctolagus cuniculus] Length = 744 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 40/239 (16%), Positives = 83/239 (34%), Gaps = 31/239 (12%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + TN + K+ Q L++ ++LD S+S+ + S Sbjct: 223 LAHLFGATNPTQVKKGSLGR-KIQIQRSGHLNLYLLLDASQSVSE------EDFKIFKDS 275 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWGVSHLQRKIKYLS-------KF 253 M++ + + ++TF+++ + L + I L + Sbjct: 276 ATHMVDRIFSFEIN---LSVAVITFASQPKVIMSVLHDNSRDMTEVINSLENAKYTDHEN 332 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTED-ANYKKIIVFMTDGENL---STKEDQQSL 309 G TN L A N + + Q R T + I+ +TDG++ S K S+ Sbjct: 333 GTGTNIYKALN-AVNIMMNNQMQRLGMETTGWQEIRHAIILLTDGKSNMGGSPKPAVDSI 391 Query: 310 YYCNEAKKRG---AIVYAIGIRVIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 K++ +YAIG+ + +E +++++ ++ F H Sbjct: 392 KDVLNIKEKRNDYLDIYAIGVGKLDVDWRELNELASKKDGERHAFILQDARALQQVFEH 450 >gi|262191198|ref|ZP_06049398.1| protein BatA [Vibrio cholerae CT 5369-93] gi|262032938|gb|EEY51476.1| protein BatA [Vibrio cholerae CT 5369-93] Length = 477 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 71/213 (33%), Gaps = 38/213 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M+V+D+S SM + + + A+ + + + GL+ F++ Sbjct: 244 DLMLVVDLSYSMSQE---DMQSGQQMVDRLTAVKQVLSEFIAKREGDRIGLILFADHAYL 300 Query: 233 FFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + ++ L G T G+ A D +++ Sbjct: 301 QTPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFIDSDA-----------PQRV 349 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------SHEF- 335 ++ ++DG N + L N AK+ +Y +G+ + + Sbjct: 350 MILLSDGSNTA--GVLDPLEAANIAKQYHTTIYTVGVGAGEMVVKDFLFSRKVNTAQDLD 407 Query: 336 ---LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + ++ N + + + I + Sbjct: 408 EKTLQTIATTTGGHYFRARNQQDLQNIYDTINQ 440 >gi|254525166|ref|ZP_05137221.1| von Willebrand factor, type A [Stenotrophomonas sp. SKA14] gi|219722757|gb|EED41282.1| von Willebrand factor, type A [Stenotrophomonas sp. SKA14] Length = 334 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 25/215 (11%), Positives = 70/215 (32%), Gaps = 44/215 (20%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 MM+ +DVS SM ++ ++ A + L+ + + GL+ F + Sbjct: 100 QMMLAMDVSGSMGEGDMVLGGQAVDRLTAAKAVLADFLDR-RAGD------RIGLLIFGD 152 Query: 229 KIEEFFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L ++ ++ ++ + G T + A ++ Sbjct: 153 RAYTLTPLTADLASVRDQLRDSVVGLAGRETAIGDAIGLAVKRLRSQPEG---------- 202 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV----------------I 330 +++++ +TDG + + + L A+ G ++ + Sbjct: 203 -QRVLILLTDGVSNA--GVLEPLRAAEVARAEGVRIHTVAFGGDGSMRVFGISISADQDP 259 Query: 331 RSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + F+ + + ++ + + Sbjct: 260 VDEATLKKIAGMTGGQFFRARDTAQLAGIYAELDR 294 >gi|196250158|ref|ZP_03148852.1| von Willebrand factor type A [Geobacillus sp. G11MC16] gi|196210342|gb|EDY05107.1| von Willebrand factor type A [Geobacillus sp. G11MC16] Length = 668 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 33/163 (20%), Positives = 61/163 (37%), Gaps = 29/163 (17%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++ V+DVS SM K+ A ++ A + K N+ + L+ FS+ Sbjct: 194 RPPIDVVFVMDVSGSMT------TMKLQSAKSALQAAVNYFKTNYHPND--RFALIPFSD 245 Query: 229 KIEEFFLLEWGV-SHLQRKIKYLSKFG------VSTNSTPGLKYAYNQIFDMQGMRQHCN 281 ++ ++ +G S++ ++ + G TN + L A + Sbjct: 246 DVKATSVVPFGSKSNVISQLDAILDEGNRLTANGGTNYSAALSLAQSYF----------- 294 Query: 282 TEDANYKKIIVFMTDGENL--STKEDQQSLYYCNEAKKRGAIV 322 D KK I+F+TDG +T K G + Sbjct: 295 -NDPERKKYIIFLTDGMPTVLNTTSSITHKEIKKGFKDDGEKI 336 >gi|227822378|ref|YP_002826350.1| hypothetical protein NGR_c18330 [Sinorhizobium fredii NGR234] gi|227341379|gb|ACP25597.1| conserved hypothetical protein [Sinorhizobium fredii NGR234] Length = 602 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 40/236 (16%), Positives = 88/236 (37%), Gaps = 24/236 (10%) Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFC---TFIPWYTNSRHIVMPITSSV 162 ++ ++ + + Y + P K T PW +R + + I Sbjct: 167 MAGEMPNPDAVRVEEMVNYFPYDWPRPTTAAEPFKATVTVTPTPWNAGTRLMHVAIKG-Y 225 Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 +V + R +++ ++DVS SM+ K+ + + +++ ++ V+ V +G Sbjct: 226 EVVQKEAPRANLVFLIDVSGSMDE-----PDKLPLLKNAFRLLVDRLRPDDTVSIVTYAG 280 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + E ++ + + I L G ST G+ AY Q+ + +R N Sbjct: 281 NAG---TVLEPTAVKD-KTKILSAIDTLQP-GGSTAGAAGIDAAY-QLAEKAFVRDGVNR 334 Query: 283 EDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 I+ TDG+ N+ D++ ++ G + +G ++ L Sbjct: 335 --------ILLATDGDFNVGPSSDEELKRMVETKRRSGIFLSVLGFGRGNYNDALM 382 >gi|159040640|ref|YP_001539892.1| von Willebrand factor type A [Caldivirga maquilingensis IC-167] gi|157919475|gb|ABW00902.1| von Willebrand factor type A [Caldivirga maquilingensis IC-167] Length = 474 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 38/236 (16%), Positives = 85/236 (36%), Gaps = 34/236 (14%) Query: 132 ISRYKIPL-KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 Y+I L K + T M V +D+++ LD S SM+ F + Sbjct: 257 DPAYRIDLDKTSMNMVRKTFLNK-PMSTRDIVVREYADVKLMDIVLCLDTSGSMKEFSGA 315 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL 250 + K+D+A ++I + + D + +V F+ + + + WG +++ I + Sbjct: 316 YM-KMDIAKEAIVKYIRYLSRTND-----RLSMVLFNFRAD----ILWGPHSVKKYINEM 365 Query: 251 SKF------GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 + G TN L+ A + +H I+ +TDG ++ Sbjct: 366 EEMSRYIYPGGGTNIANALEKARIILSKSNYPNKH-----------IICITDGRTVNASS 414 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--FYLVENPHSMYDAF 358 + + ++ G + + + + L + + F + + ++ A Sbjct: 415 ---CIKEAVKLRRMGVTLSTVAVGDNSDFDLLMRLSKIGNGLFIKINDISNLDKAL 467 >gi|138896202|ref|YP_001126655.1| hypothetical protein GTNG_2565 [Geobacillus thermodenitrificans NG80-2] gi|134267715|gb|ABO67910.1| Conserved hypothetical protein [Geobacillus thermodenitrificans NG80-2] Length = 668 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 33/163 (20%), Positives = 61/163 (37%), Gaps = 29/163 (17%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++ V+DVS SM K+ A ++ A + K N+ + L+ FS+ Sbjct: 194 RPPIDVVFVMDVSGSMT------TMKLQSAKSALQAAVNYFKTNYHPND--RFALIPFSD 245 Query: 229 KIEEFFLLEWGV-SHLQRKIKYLSKFG------VSTNSTPGLKYAYNQIFDMQGMRQHCN 281 ++ ++ +G S++ ++ + G TN + L A + Sbjct: 246 DVKATSVVPFGSKSNVISQLDAILDEGNRLTANGGTNYSAALSLAQSYF----------- 294 Query: 282 TEDANYKKIIVFMTDGENL--STKEDQQSLYYCNEAKKRGAIV 322 D KK I+F+TDG +T K G + Sbjct: 295 -NDPERKKYIIFLTDGMPTVLNTTSSITHKEIKKGFKDDGEKI 336 >gi|319952789|ref|YP_004164056.1| von willebrand factor type a [Cellulophaga algicola DSM 14237] gi|319421449|gb|ADV48558.1| von Willebrand factor type A [Cellulophaga algicola DSM 14237] Length = 332 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 40/234 (17%), Positives = 79/234 (33%), Gaps = 48/234 (20%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P T + ++T +D+++ +DVS SM + D ++ K + K P+ Sbjct: 76 PQTEDISTKTKTTKGIDIVMAIDVSSSMLA-RDLKPNRLASLKKVAADFI---KKRPND- 130 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQ 274 + GLV ++ + + + +K ++ T GL + N++ Sbjct: 131 ---RIGLVVYAGESYTKTPITSDKGIVLNALKEITYGSLEDGTAIGMGLATSVNRL---- 183 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-- 332 E K+I+ +TDG N S + Q+ A + Y IG+ + Sbjct: 184 -------KESKALSKVIILLTDGINNSGFIEPQTA--AELAVEYDIKTYTIGLGTNGNAL 234 Query: 333 ---------------------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L A + +++ N S+ + I K Sbjct: 235 SPIAINSDGSFRYGMKPVEIDEGLLEQIAKTTGGAYFRATNNESLASIYDEINK 288 >gi|85712923|ref|ZP_01043963.1| Uncharacterized protein containing a von Willebrand factor type A(vWA) domain [Idiomarina baltica OS145] gi|85693229|gb|EAQ31187.1| Uncharacterized protein containing a von Willebrand factor type A(vWA) domain [Idiomarina baltica OS145] Length = 328 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 27/212 (12%), Positives = 68/212 (32%), Gaps = 46/212 (21%) Query: 174 MMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +M+ +D+S SME + ++ +++M + +E K + GL+ F++ Sbjct: 88 IMLAVDLSGSMEIADMTLDGRNVDRLEMVKAVLGDFIERRKGD-------RLGLILFADT 140 Query: 230 IEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + + +Q+ + L G T + A + Sbjct: 141 AFLQTPITYDRNTVQQMLDESVLGLVGERTAIGDAIALAVKRF-----------KGKQQT 189 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-------SHE------ 334 +++V +TDG+N + + AK +Y I + Sbjct: 190 NRVLVLLTDGQNTA--GNLSPEQALELAKAYDVRIYPIAVGAEEVVVDSVFGRRKVNPSR 247 Query: 335 -----FLRACA--SPNSFYLVENPHSMYDAFS 359 ++ A + ++ + + + Sbjct: 248 DLDVPLMQNLADETGGEYFRARSTEELERIYQ 279 >gi|116051069|ref|YP_790101.1| von Willebrand factor type A domain-containing protein [Pseudomonas aeruginosa UCBPP-PA14] gi|296388430|ref|ZP_06877905.1| von Willebrand factor type A domain-containing protein [Pseudomonas aeruginosa PAb1] gi|313108364|ref|ZP_07794396.1| putative von Willebrand factor type A domain-containing protein [Pseudomonas aeruginosa 39016] gi|115586290|gb|ABJ12305.1| putative von Willebrand factor type A domain [Pseudomonas aeruginosa UCBPP-PA14] gi|310880898|gb|EFQ39492.1| putative von Willebrand factor type A domain-containing protein [Pseudomonas aeruginosa 39016] Length = 340 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 26/163 (15%), Positives = 61/163 (37%), Gaps = 26/163 (15%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + D I+++++ K +E+ + + GL+ F + Sbjct: 91 DLLLAVDVSGSMDYRDMRWQDDEISRLELIKKLFGDFIEDRRGD-------RVGLILFGS 143 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + G +T + A ++ + Sbjct: 144 QAYLQAPLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAVKRL-----------RQRPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 +++V +TDG N A ++ +Y IG+ Sbjct: 193 ESRVLVLITDGANTG--GQIAPQIAAQLAAEQQVKIYTIGVGA 233 >gi|292619294|ref|XP_692164.4| PREDICTED: integrin alpha-M-like [Danio rerio] Length = 806 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 73/198 (36%), Gaps = 24/198 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 ++D+ +LD S S+ + T + + ++ E + + ++ Sbjct: 138 GQIDIAFLLDGSGSIGIY---DFTVMKGFVTNVIRRFIERDAQFAIAQYSNDCDIHYNFN 194 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + W + K+ + T + ++ N +F G + KK Sbjct: 195 DLKLDDGTW-----ESKVANIPYHEGGTFTASAIQKLVNYLFTPNG------GTRPSAKK 243 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR----VIRSHEFLRACASP--- 342 I+V +TDGE+ + + ++A+K + +AIG+ + E L AS Sbjct: 244 ILVVITDGESHDRNLLKDA---ASQAEKNSIVRFAIGVGKAFDYYNAREELNTIASDPDT 300 Query: 343 NSFYLVENPHSMYDAFSH 360 + + V + +++ + Sbjct: 301 DYVFKVTDFNALKNILQK 318 >gi|170594383|ref|XP_001901943.1| von Willebrand factor type A domain containing protein [Brugia malayi] gi|158590887|gb|EDP29502.1| von Willebrand factor type A domain containing protein [Brugia malayi] Length = 415 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 37/222 (16%), Positives = 79/222 (35%), Gaps = 31/222 (13%) Query: 151 SRHIVMPITSSVKV-NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 S VM TS+ V N + D++ ++D S + D I + + I+S+ Sbjct: 212 SSSAVMSTTSTKDVVNIKPGCLFDVVFLMDFSGGVSDKRDIYIDYVSILIRSL------- 264 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS----HLQRKIKYLSKFGVSTNSTPGLKY 265 V + +S L + + ++ G +T + + Y Sbjct: 265 ---DLNRTAVHVAAIYYSGPKRARTLFHLRKHSRAENAIKDLQRAPSNGGTTRTGEAIYY 321 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 A N+ + G R+ + +K+I+ TDG + + A+ +G + A+ Sbjct: 322 ATNEFNEKFGARK-------DARKMIIIFTDGHSQDNPTEAS-----RTARNKGIELKAV 369 Query: 326 GIR----VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGK 363 I +++ + P+ Y +N + + F + Sbjct: 370 SIEDENIPPDTNQIIAITGDPSDAYSSKNFNKLQSFFDEYSR 411 >gi|126277540|ref|XP_001376725.1| PREDICTED: similar to integrin alpha 11 subunit [Monodelphis domestica] Length = 1530 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 86/214 (40%), Gaps = 39/214 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++IVLD S S+ + + + ++ +K +Q G+V + Sbjct: 502 QTYMDIIIVLDGSNSIYPWVEVQ-----------HFLINILKKFYIGPGQIQVGVVQYGE 550 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + F L S ++ ++ S + G T + G+++A ++ F G + Sbjct: 551 DVVHEFHLNDYRS-VKEVVEAASHIEQRGGTETRTAFGIEFARSEAFQKGGRKG------ 603 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL-- 336 KK+++ +TDGE+ + + ++ + + K YA+ + R I FL Sbjct: 604 --AKKVMIVITDGESHDSPDLEKVIE--DSEKDN-VTRYAVAVLGYYNRRGINPEAFLNE 658 Query: 337 -RACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + AS F+ V + ++ D +G I Sbjct: 659 IKYIASDPDDKHFFNVTDEAALKDIVDALGDRIF 692 >gi|110677910|ref|YP_680917.1| hypothetical protein RD1_0526 [Roseobacter denitrificans OCh 114] gi|109454026|gb|ABG30231.1| conserved hypothetical protein [Roseobacter denitrificans OCh 114] Length = 327 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 33/230 (14%), Positives = 79/230 (34%), Gaps = 27/230 (11%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM--ESFFDSSITKIDMAIKSI 202 + W I P + + AR D+++ +D+S SM F D+S + + Sbjct: 68 LIWILLVIAISQPERLGAPIETSKSAR-DLILAIDISGSMDTRDFTDASNENLQ----RL 122 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 + + V+ + + L+ F +K L + + + G Sbjct: 123 AGVRDVVRAFVEGREGDRMALIVFGSKAYLQSPLTEDTGTIVELLDQ---------TEVG 173 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 + + I D G+ +++++ ++DG T L A+ G + Sbjct: 174 MAGPHTAIGDAIGLSIRTFEASEIEQRLLILLSDG--ADTASRMSPLNAAEIARGAGVEI 231 Query: 323 YAIGIRVIRSH-------EFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 + I + L+ A + S++ + ++ + ++ I + Sbjct: 232 FTIAVGDPDGTGENRVDVAALQDIANRTSGSYFFAADQAALDEIYARIDE 281 >gi|224012789|ref|XP_002295047.1| predicted protein [Thalassiosira pseudonana CCMP1335] gi|220969486|gb|EED87827.1| predicted protein [Thalassiosira pseudonana CCMP1335] Length = 818 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 39/232 (16%), Positives = 82/232 (35%), Gaps = 31/232 (13%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSM--ESFFDSSITKIDMAIKSINAMLEE---VKLIPDV 215 + + T+ D+ I +D+S S+ FF ++ + + + + DV Sbjct: 596 TARPTCPTEEDFDLCIAVDMSGSVCNSGFFGNNCVGCSPFVFCQSLFVSQETCCANFGDV 655 Query: 216 NNVVQS---GLVTFSNKIEEFFLLEW--------GVSHLQRKIKYLSKF---GVSTNSTP 261 + L F +K F ++ + G+S + I L + G STN Sbjct: 656 QQFARLMVYNLSQFGDKNTSFSVVSFASDAEILSGLSSADKTINVLDQLIYSGGSTNHGQ 715 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN--LSTKEDQQSLYYCNEAKKRG 319 + +F ++ N KK I+ +TDG + + ++ AK G Sbjct: 716 AINACQESLFTSD--------QNINRKKFIMLITDGVSATDDLNPEADAIDAAETAKSSG 767 Query: 320 AIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + I I + + +S ++ V + +D+ + +V + Sbjct: 768 ITIIPIFISPYNDIDAVSFMSSLSNDGEVYVTN--FDSLDSLKDQLVEQVSC 817 >gi|261409463|ref|YP_003245704.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261285926|gb|ACX67897.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 421 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 33/211 (15%), Positives = 75/211 (35%), Gaps = 34/211 (16%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+++V+D S SM D + + A IN M + + ++ F + Sbjct: 114 DIVLVIDNSGSMNE-TDPNQDRYTAAKNLINRM----------DRDNRVSVMVFDHATTL 162 Query: 233 FFLLEWGVS-----HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + +I L+ T+ + L+ + I + + + Sbjct: 163 LQPFTRVKNQETKDEIIAEIDGLATNDGGTDISLALEDTMSHIQESRDAGRSAM------ 216 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--VIRSHEFLR--ACASPN 343 ++ ++DG S + + L E K++ V IG+ + L+ A + Sbjct: 217 ---VIMLSDG--FSETDHDRVLA---EYKQQQIAVNTIGLSLVNPDGAQLLQTIAAETGG 268 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 +Y V++ + F I D+ + + + Sbjct: 269 QYYDVQHAEDLSFVFQKIYDDVGDRSLLTKR 299 >gi|24374613|ref|NP_718656.1| von Willebrand factor type A domain-containing protein [Shewanella oneidensis MR-1] gi|24349233|gb|AAN56100.1|AE015746_4 von Willebrand factor type A domain protein [Shewanella oneidensis MR-1] Length = 338 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 68/216 (31%), Gaps = 44/216 (20%) Query: 173 DMMIVLDVSRSM---ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+M+ +D+S SM + D + I+ + + E + + GL+ F++ Sbjct: 85 DLMLAVDLSGSMQIEDMVIDGKVVDRFTLIQHVVSEFIERRKGD------RIGLILFADH 138 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + + +K G T + A + M Sbjct: 139 AYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGESIALAVKRFDKMDESN---------- 188 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI----------------- 330 +++V +TDG N + + + A R +Y +G+ Sbjct: 189 -RVLVLLTDGSNNA--GNIEPQQAAQIAANRKVTIYTVGVGADVMERRTLFGRERVNPSM 245 Query: 331 --RSHEFLRAC-ASPNSFYLVENPHSMYDAFSHIGK 363 ++ + A+ ++ N + + I K Sbjct: 246 DLDENQLQQIADATHGRYFRARNSEELEQIYQEIDK 281 >gi|291398577|ref|XP_002715569.1| PREDICTED: Epithelial chloride channel protein-like [Oryctolagus cuniculus] Length = 958 Score = 63.6 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 39/191 (20%), Positives = 67/191 (35%), Gaps = 32/191 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK---I 230 + +VLD S SM+S ++ ++ L I + G+VTF + Sbjct: 364 VCLVLDKSGSMDSE-----DRLLRMNQAAALYL-----IQIIERESLVGMVTFESTAKIQ 413 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + ++ + G T+ GLK + I Sbjct: 414 NNLTKITDDDTYQKITANLPQVAGGGTSICSGLKAGFQAITYSNQNTSGSE--------- 464 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ C E K+ GAI++ I + + E L + FY Sbjct: 465 IVLLTDGEDNGIHS-------CFEEVKQSGAIIHTIALGPSAAKELEILSSMTGGYRFYA 517 Query: 348 VENPHSMYDAF 358 ++ + + DAF Sbjct: 518 NKDINGLIDAF 528 >gi|288928458|ref|ZP_06422305.1| BatA protein [Prevotella sp. oral taxon 317 str. F0108] gi|288331292|gb|EFC69876.1| BatA protein [Prevotella sp. oral taxon 317 str. F0108] Length = 332 Score = 63.6 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 38/216 (17%), Positives = 74/216 (34%), Gaps = 39/216 (18%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +DVS SM + D + +++ A + + + ++ + +G + Sbjct: 88 IDIMLAMDVSTSMLAE-DLTPNRMEAAKDVAAEFIAD-RPNDNIGLTIFAGEAFTQCPMT 145 Query: 232 EF-FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + ++ I T GL A +++ D + K+ Sbjct: 146 TDHTSLLNMLQTVRTDIAAKGLIQDGTAIGMGLANAVSRLKDSKAKS-----------KV 194 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE---------------- 334 ++ +TDG N D + N AK G VY IG+ + Sbjct: 195 VILLTDGSNNM--GDLSPMTSANIAKSLGIRVYTIGVGTNKVARYPMPVAGGVQYVNMPV 252 Query: 335 -----FLR--ACASPNSFYLVENPHSMYDAFSHIGK 363 L+ A + +FY N + + I K Sbjct: 253 EIDTKVLKDIAATTDGNFYRATNNQELKQIYKDIDK 288 >gi|26325991|dbj|BAC26739.1| unnamed protein product [Mus musculus] Length = 415 Score = 63.6 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 70/180 (38%), Gaps = 22/180 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + +S + ++ +D + D++ +LD S S+ + S + + + ++ + P Sbjct: 15 LALARSSIERGSTASDPQGDLLFLLDSSASVSHYEFSRV------REFVGQLVATMSFGP 68 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 +++ LV ++ F + S +Q I+ + TN+ L YA Q+F Sbjct: 69 GA---LRASLVHVGSQPHTEFTFDQYSSGQAIQDAIRVAPQRMGDTNTGLALAYAKEQLF 125 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 + K++V++TDG + E K GA ++ + Sbjct: 126 AEE------AGARPGVPKVLVWVTDGGSSDPVG-----PPMQELKDLGATIFIVSTGRGN 174 >gi|49087064|gb|AAT51411.1| PA3073 [synthetic construct] Length = 341 Score = 63.6 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 27/163 (16%), Positives = 61/163 (37%), Gaps = 26/163 (15%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + D I+++++ K +E+ + + GL+ F + Sbjct: 91 DLLLAVDVSGSMDYRDMRWQDYEISRLELIKKLFGDFIEDRRGD-------RVGLILFGS 143 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + G +T + A ++ + Sbjct: 144 QAYLQAPLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAVKRL-----------RQRPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 +++V +TDG N A ++ +Y IGI Sbjct: 193 ESRVLVLITDGANTG--GQIAPQIAAQLAAEQQVKIYTIGIGA 233 >gi|300853770|ref|YP_003778754.1| hypothetical protein CLJU_c05700 [Clostridium ljungdahlii DSM 13528] gi|300433885|gb|ADK13652.1| hypothetical protein CLJU_c05700 [Clostridium ljungdahlii DSM 13528] Length = 484 Score = 63.6 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 50/227 (22%), Positives = 92/227 (40%), Gaps = 31/227 (13%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 I S S + +D++ VLD S SM+ S +I ++I L+ ++ + Sbjct: 20 GFIPSQKVYASTSSPNVDVVFVLDSSGSMK---QSDPEEIRT--EAIKMFLDMGQVQGNK 74 Query: 216 NNVVQSGLVTFSNKIEEFFLLEW-GVSHLQRKIKYLS---KFGVSTNSTPGLKYAYNQIF 271 GLV +S+ I + L+ + +I+ ++ G T+ GL Sbjct: 75 A-----GLVAYSDTIIKEHNLDAINSESDKERIEEMASNIPLGQKTDIGRGL-------- 121 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA----KKRGAIVYAIGI 327 ++G + + D+N + +I+ ++DG+N S + +SL N A K +G VY IG+ Sbjct: 122 -LEGAKVLDSGHDSNNRPLIILLSDGKNDSQRSASESLKDLNSAISTCKSKGYPVYTIGL 180 Query: 328 RVIRS--HEFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + L AS Y+ + D I D ++ Sbjct: 181 NYDGTVDKAQLTQIASETKGKNYITNKASDLTDILKDIYGDSANVKV 227 >gi|242065788|ref|XP_002454183.1| hypothetical protein SORBIDRAFT_04g026250 [Sorghum bicolor] gi|241934014|gb|EES07159.1| hypothetical protein SORBIDRAFT_04g026250 [Sorghum bicolor] Length = 703 Score = 63.6 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 50/270 (18%), Positives = 103/270 (38%), Gaps = 38/270 (14%) Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVN 165 + +I S +++I + +I S+ + P+ + ++++ Sbjct: 197 EAANIGSSRTVEIKTYSEFS--AIQQSSQDDFAVLIHLKAPYANPEQVTGRSVSATSVGY 254 Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + A +D++ VLDVS SM TK+ + +++ +++ + + ++ Sbjct: 255 PTSRAPVDLVTVLDVSGSMAG------TKLALLKRAMGFVIQHLGPSD------RLSVIA 302 Query: 226 FSNKIEEFFLLEWGV-SHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+ F L S Q+ ++ ++ G TN LK A I R H N Sbjct: 303 FSSTARRLFHLRRMSHSGRQQALQAVNSLGASGGTNIADALKKAAKVI----EDRSHQNP 358 Query: 283 EDANYKKIIVFMTDGENLST--------KEDQQSLYYCNEAKKRG--AIVYAIGIRVIRS 332 + I+ ++DG++ + D SL + V+ G V Sbjct: 359 VCS-----IILLSDGQDTYNIPSNIRGARPDYSSLVPSSILNHTFRLVPVHGFGFGVDHD 413 Query: 333 HEFLRAC--ASPNSFYLVENPHSMYDAFSH 360 + L + AS +F +E+ + DAF+ Sbjct: 414 SDALHSIAEASGGTFSFIEDEGVIQDAFAQ 443 >gi|198435588|ref|XP_002122129.1| PREDICTED: similar to von Willebrand factor precursor (vWF) [Ciona intestinalis] Length = 3684 Score = 63.6 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 42/274 (15%), Positives = 90/274 (32%), Gaps = 32/274 (11%) Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKF-CTFIPWYTNSRHIVMPITSSVKVNSQ- 167 + +T++ V S+ ++ IP T IP +P + Sbjct: 209 VSTTTTIPSVPGATTTFSSVPGVTTTTIPSVPGATTIPSVPGVTTTTVPFLPGLITTPPL 268 Query: 168 ----------TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 T +D+++V+D S S+ D IT + ++++ + N+ Sbjct: 269 LPQPHTTLGLTGCTVDLVLVVDSSYSIG--IDGFIT----LKDILGNLIKKFDV---PND 319 Query: 218 VVQSGLVTFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + LV +S + + +LL + + I + T + LK + D Sbjct: 320 ETRVSLVQYSKRSQVEWLLSTYPGNLDGMLHTIAGMQMLQGVTYTYHALKLVLQTVIDGS 379 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY---AIGIRVIR 331 + + +IV +TDG ++ L + + IG+ + Sbjct: 380 DSGR-----RPDVPFVIVLITDGRAKDEDIREEVLNKIQVVHDKWNETFHLITIGVGSVD 434 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 H+ + + Y VE+ + + I Sbjct: 435 PHQLEGIASHDDYVYTVEDMNEASSLVDEVAASI 468 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 48/286 (16%), Positives = 92/286 (32%), Gaps = 28/286 (9%) Query: 11 FFYNYKGGM-TILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 Y G + +L I + + +K VL ++ID S + Sbjct: 337 LLSTYPGNLDGMLHTIAGMQMLQGVTYT-----YHALKLVLQTVIDGSDSGRRPDVPFVI 391 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 + K DI + N + + V L+ + + Y++ Sbjct: 392 VLITDGRAKDEDIREEVLNKIQVVHDKWNETFHLITIGVGSVDPHQLEGIASHDDYVYTV 451 Query: 130 S-----AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 + ++ C + T R + I SSV+ D++ VLD SRS Sbjct: 452 EDMNEASSLVDEVAASICRPLARETYPRILTTTI-SSVQPECSYMPPTDIIFVLDGSRSE 510 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF----FLLE--W 238 S KI ++ + + K+ P Q G+ + +++ F L Sbjct: 511 NSELRLMQHKI-----LLSNLTKLFKIGPSD---TQVGVFLYDDELHGSGLNGFPLSEKS 562 Query: 239 GVSHLQRKIKYL-SKFGVSTNSTPGLKYAY-NQIFDMQGMRQHCNT 282 ++ +Q I L + TP ++ A ++ + G R Sbjct: 563 NINEVQDAIDKLENPNNDVMFFTPAIQEALSQKLLESAGGRPDARK 608 >gi|15600942|ref|NP_232572.1| hypothetical protein VCA0172 [Vibrio cholerae O1 biovar eltor str. N16961] gi|229510539|ref|ZP_04400019.1| protein BatA [Vibrio cholerae B33] gi|229517329|ref|ZP_04406774.1| protein BatA [Vibrio cholerae RC9] gi|229605140|ref|YP_002875844.1| protein BatA [Vibrio cholerae MJ-1236] gi|254286663|ref|ZP_04961618.1| conserved hypothetical protein [Vibrio cholerae AM-19226] gi|254850438|ref|ZP_05239788.1| conserved hypothetical protein [Vibrio cholerae MO10] gi|255746016|ref|ZP_05419963.1| protein BatA [Vibrio cholera CIRS 101] gi|262162145|ref|ZP_06031160.1| protein BatA [Vibrio cholerae INDRE 91/1] gi|9657562|gb|AAF96085.1| conserved hypothetical protein [Vibrio cholerae O1 biovar El Tor str. N16961] gi|150423247|gb|EDN15193.1| conserved hypothetical protein [Vibrio cholerae AM-19226] gi|229345365|gb|EEO10338.1| protein BatA [Vibrio cholerae RC9] gi|229352984|gb|EEO17924.1| protein BatA [Vibrio cholerae B33] gi|229371626|gb|ACQ62048.1| protein BatA [Vibrio cholerae MJ-1236] gi|254846143|gb|EET24557.1| conserved hypothetical protein [Vibrio cholerae MO10] gi|255735770|gb|EET91168.1| protein BatA [Vibrio cholera CIRS 101] gi|262028220|gb|EEY46878.1| protein BatA [Vibrio cholerae INDRE 91/1] Length = 318 Score = 63.6 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 71/213 (33%), Gaps = 38/213 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M+V+D+S SM + + + A+ + + + GL+ F++ Sbjct: 85 DLMLVVDLSYSMSQE---DMQSGQQMVDRLTAVKQVLSEFIAKREGDRIGLILFADHAYL 141 Query: 233 FFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + ++ L G T G+ A D +++ Sbjct: 142 QTPLTLDRQTVANQLNQTVLKLIGTQTAIGEGIGLATKTFIDSDA-----------PQRV 190 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------SHEF- 335 ++ ++DG N + L N AK+ +Y +G+ + + Sbjct: 191 MILLSDGSNTA--GVLDPLEAANIAKQYNTTIYTVGVGAGEMVVKDFLFSRKVNTAQDLD 248 Query: 336 ---LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + ++ N + + + I + Sbjct: 249 EKTLQTIATTTGGHYFRARNQQDLQNIYDTINQ 281 >gi|198429401|ref|XP_002121222.1| PREDICTED: similar to integrin alpha 9 [Ciona intestinalis] Length = 1242 Score = 63.6 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 39/314 (12%), Positives = 101/314 (32%), Gaps = 36/314 (11%) Query: 61 AATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVV 120 A+ ++ + N++ + + + + +F + ++ ++ I + Sbjct: 50 GASVVVKKTASNSQAFVGAPLDERKNGSLYKCTFDGRTSGSSLCIRETAFDKTGAMGISI 109 Query: 121 VPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDV 180 + +I Y L + N+ ++ + + D++ V+D Sbjct: 110 SMNTDDENIYVSVAYAQRL-----LEGDINTLYLSIACPT-------EGLTADIIFVVDE 157 Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL---- 236 S S++ D ++ + I++ + V G++ FS + Sbjct: 158 SGSVD--VDEYRDSLNWMKQVISSFRSYIDKGD-----VHVGVIGFSRLNNIDTKVRIRL 210 Query: 237 -EWGVSHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 W + L +I + V+ S GL Y Y + H + K ++ + Sbjct: 211 QAWSYTSLTSQINNM----VNVRSLNGLTYIGYAINLTITEFDDH---GRESVPKEMILL 263 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRACASPNSFYLVENPHS 353 TDG + A+ G + ++G+ + + L + + + N + Sbjct: 264 TDGAATKP---ENVKPAAERARANGIVTVSVGVGSRVDETQLLTIAGNASRVFKATNYDN 320 Query: 354 MYDAFSHIGKDIVT 367 + + I Sbjct: 321 LDSVVEGVKSTIQD 334 >gi|221133174|ref|XP_002171310.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 221 Score = 63.6 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 71/197 (36%), Gaps = 21/197 (10%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-NVVQSG 222 V + +A +D+ +LD S S+E+ + L+++ + ++ N + G Sbjct: 9 VQPRCEAVVDVAFILDSSHSLEASYQKE-----------KNFLKKLAAVFGISSNGSRVG 57 Query: 223 LVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 ++TFS + + L S + + +T L+ A +F Sbjct: 58 VITFSYRAKLSVKLNSFTDLSSFNEAVDKIPLMNFTTRIDRALRLAQKDMFTS------A 111 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRAC 339 N K+I+ +TDG + + +E + G ++ +GI + E Sbjct: 112 NGGRVGVSKLIILLTDGSQTPGGDAEDPERIADELRNDGVVILGVGIGSAVNETELSHIT 171 Query: 340 ASPNSFYLVENPHSMYD 356 + Y S+ D Sbjct: 172 GGKKNAYTAATFDSLTD 188 >gi|148258759|ref|YP_001243344.1| hypothetical protein BBta_7591 [Bradyrhizobium sp. BTAi1] gi|146410932|gb|ABQ39438.1| hypothetical protein BBta_7591 [Bradyrhizobium sp. BTAi1] Length = 449 Score = 63.6 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 57/451 (12%), Positives = 130/451 (28%), Gaps = 106/451 (23%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 + F + +G + + AI + +G ++ S ++ L S +D + V A ++ + Sbjct: 8 LARFRSDIQGNVAVTFAIVCVPLITAVGCGVDYSRANQLRAKLQSAVDAASVGAVSR-TS 66 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 G I N + N + ++ + Sbjct: 67 PAFIAAGAMTADGIITA--GNDDARNIFNGNMNGTTGYTLNSVTPEVK------KTGSVL 118 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM--- 184 + + +P+ F + + + + S + +D ++LD S SM Sbjct: 119 TATVSFSASVPMMFMNIV----GIKTMTLQGMSKATASMPK--YIDFYLLLDNSPSMGVA 172 Query: 185 ----------------------------ESFFDSS-----------ITKIDMAIKSINAM 205 + DS+ T+ID+ + + Sbjct: 173 ATPDDVTKMVNATSDAKYGSNRYCAFACHDYNDSNNFYNLAKSIGVTTRIDVLRSATQQL 232 Query: 206 LEEVKLIPDVNNVVQS-----GLVTFSNKIEEFFLLEWGVSHLQRK---IKYLSKFGVST 257 ++ N + G + + + F L +S + I + +G + Sbjct: 233 MDTATQTQTYPNQFRMAIYDFGAASKTIGLRALFALSANLSSAKSAAGNIDLMGVYGNND 292 Query: 258 NSTPGLKYAYNQIFDM--QGMRQHCNTEDANYKKIIVFMTDGENLSTKED---------- 305 T + +F + + + K + F++DG + Sbjct: 293 AYTADKDTPFTAVFPAVNNEISTPGDGTTGSPLKYLFFVSDGVADESNAACLKPKASGNR 352 Query: 306 ---QQSLYYCNEAKKRGAIV---YAIGIRVIRSH-----------------------EFL 336 + C K RG + Y +++ + + + Sbjct: 353 CQSPINPALCTTLKNRGIKIAVLYTTYLQLPTNSWYMSWIDPFNKGPFGPSPNSEIAQNM 412 Query: 337 RACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 +ACASP ++ V + DA + + K V Sbjct: 413 QACASPGFYFEVSPTQGIADAMNALFKKAVA 443 >gi|312139646|ref|YP_004006982.1| integral membrane protein [Rhodococcus equi 103S] gi|311888985|emb|CBH48298.1| putative integral membrane protein [Rhodococcus equi 103S] Length = 326 Score = 63.6 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 34/232 (14%), Positives = 79/232 (34%), Gaps = 32/232 (13%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 + + ++ K + A +++V+DVS SM++ D +++ A ++ + + + Sbjct: 72 TVALAGPTADKRVPRNRA--TVVLVIDVSLSMKA-TDVEPSRLAAAQEAAKSFADGLTPG 128 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + GLV F+ + I L+ T + + + + Sbjct: 129 IN------LGLVAFAGTASVLVSPTPNRDETKAAIDNLT-LSERTATGEAI---FTSLQS 178 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGE---NLSTKEDQQSLYYCNEAKKRGAIVYAIGIR- 328 + + + IV ++DG+ S + + +AK +G + I Sbjct: 179 IDTLAAVLGGSEQAPPARIVLLSDGKQTVPESPDDPRGGFTAARQAKDKGVPISTISFGT 238 Query: 329 -------------VIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 V LR A S +F+ + + D + + + I Sbjct: 239 GYGTVEIEGDRIPVPVDDPSLREIANLSGGNFFTASSLEELRDVYDTLEEQI 290 >gi|325676908|ref|ZP_08156581.1| von Willebrand factor [Rhodococcus equi ATCC 33707] gi|325552456|gb|EGD22145.1| von Willebrand factor [Rhodococcus equi ATCC 33707] Length = 326 Score = 63.6 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 34/232 (14%), Positives = 79/232 (34%), Gaps = 32/232 (13%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 + + ++ K + A +++V+DVS SM++ D +++ A ++ + + + Sbjct: 72 TVALAGPTADKRVPRNRA--TVVLVIDVSLSMKA-TDVEPSRLAAAQEAAKSFADGLTPG 128 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + GLV F+ + I L+ T + + + + Sbjct: 129 IN------LGLVAFAGTASVLVSPTPNRDETKAAIDNLT-LSERTATGEAI---FTSLQS 178 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGE---NLSTKEDQQSLYYCNEAKKRGAIVYAIGIR- 328 + + + IV ++DG+ S + + +AK +G + I Sbjct: 179 IDTLAAVLGGSEQAPPARIVLLSDGKQTVPESPDDPRGGFTAARQAKDKGVPISTISFGT 238 Query: 329 -------------VIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 V LR A S +F+ + + D + + + I Sbjct: 239 GYGTVEIEGDRIPVPVDDPSLREIANLSGGNFFTASSLEELRDVYDTLEEQI 290 >gi|73949160|ref|XP_544264.2| PREDICTED: similar to inter-alpha trypsin inhibitor heavy chain precursor 5 isoform 1 [Canis familiaris] Length = 893 Score = 63.6 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 40/202 (19%), Positives = 75/202 (37%), Gaps = 33/202 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ VLD S SM TK+ ++ +L +++ + ++ FSN+I+ Sbjct: 253 VVFVLDSSASM------VGTKLRQTKDALFTILHDLRPQDHFS------IIGFSNRIKVW 300 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + V + I ++S G T+ L+ A + D H + ED + Sbjct: 301 KDHLVSVTPDNVRDGKIYIHHMSPTG-GTDINGALQRAIKLLND---YVAHNDIEDRSVS 356 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEF-----LRACA 340 +IVF+TDG+ + +L N K+ ++ IGI + L C Sbjct: 357 -LIVFLTDGKPT--VGETHTLKILNNTKEAARGQVCIFTIGIGDDVDFKLLEKLSLENCG 413 Query: 341 SPNSFYLVENP-HSMYDAFSHI 361 E+ + + I Sbjct: 414 LTRRVLEEEDAGSQLIGFYDEI 435 >gi|327481077|gb|AEA84387.1| von Willebrand factor type A domain-containing protein [Pseudomonas stutzeri DSM 4166] Length = 339 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 31/218 (14%), Positives = 73/218 (33%), Gaps = 49/218 (22%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + +T++++ + +E+ + + GL+ F + Sbjct: 91 DLLLAVDVSGSMDYPDMQWQGEELTRLELVKVLLGDFIEQ-------RHGDRVGLILFGS 143 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 K L + ++ + S G +T + A ++ E Sbjct: 144 KAYLQSPLTFDRRTVRVWLDEASVGIAGSNTAIGDAIGLALKRL-----------RERPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACA----- 340 +++V +TDG N + + L A + ++ IGI L Sbjct: 193 NSRVLVLVTDGANNG--GEIEPLLAATLAAEENVRIHTIGIGAVPEEGGVLSRFGFNPGL 250 Query: 341 -------------SPNSFYLVENPHSMYDAFSHIGKDI 365 + ++ + + IG+ + Sbjct: 251 DLDEPTLRAIAEQTGGEYFRAASSEQL----QAIGEAL 284 >gi|261251272|ref|ZP_05943846.1| protein TadG associated with Flp pilus assembly [Vibrio orientalis CIP 102891] gi|260938145|gb|EEX94133.1| protein TadG associated with Flp pilus assembly [Vibrio orientalis CIP 102891] Length = 436 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 39/297 (13%), Positives = 100/297 (33%), Gaps = 36/297 (12%) Query: 16 KGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVH--AATQIMNEGNGNN 73 +G IL LPI+ + + +++S + + + + A+ + + N Sbjct: 8 RGVAGILFIGLLPIMVIFMAFSMQMSQQMLAHARVLEAAEVASLALIASPKESEDDNVKY 67 Query: 74 RKKLKGGDILCRIKNT-WNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 ++L ++ I + + R +G V ++ + + +++ + Sbjct: 68 ARQLVDRYVVDNINDVDVEVYTRKCEYKDGCVQASGEVAPFSDFVVSAKAEHKSWIAYEE 127 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI 192 + K + +D+ + D S SM + Sbjct: 128 AELKPEFEVAGKSVTRKYLPQ-----------------PVDVYFIGDFSGSMTGHWKGGK 170 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK 252 TK+D+ ++I ++E++ + + L+ ++ + Q YL+ Sbjct: 171 TKLDVVKQTIERVVEDI-ADFNTEEKSRVALLGYNP-----------LHVKQSGTVYLNA 218 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA--NYKKIIVFMT--DGENLSTKED 305 +G + + Y Y + Q + + + ++ + M+ D ENL D Sbjct: 219 YGYRRSWPKKIAYDYARGTSAQTVAKMFDPPSVYSRVQEYVRGMSRLDVENLVVNND 275 >gi|226306560|ref|YP_002766520.1| hypothetical protein RER_30730 [Rhodococcus erythropolis PR4] gi|226185677|dbj|BAH33781.1| conserved hypothetical membrane protein [Rhodococcus erythropolis PR4] Length = 326 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 76/234 (32%), Gaps = 36/234 (15%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 + + ++ K + A +++V+DVS SM++ D T++ A ++ + + + Sbjct: 72 TVALAGPTADKKVPRNRA--TVILVIDVSLSMQA-TDVEPTRLAAAQEAAKSFADGLTPG 128 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + GLV F+ + I L T + + + + Sbjct: 129 IN------LGLVAFAGTASVLVSPTTNRDATKVAIDNLK-LSERTATGEAI---FTSLQS 178 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDG-----ENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + + D IV ++DG EN + +AK +G + I Sbjct: 179 IDTLSAVLGGSDQAPPARIVLLSDGKQTVPENSDDPRGGFTA--ARQAKDKGVPISTISF 236 Query: 328 R--------------VIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 V L+ A S SF+ + + + + + I Sbjct: 237 GTTYGRVEIEGDRIPVPVDDASLKEIANLSGGSFFTASSLEELRQVYDTLEEQI 290 >gi|74189919|dbj|BAE24585.1| unnamed protein product [Mus musculus] Length = 319 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 38/236 (16%), Positives = 84/236 (35%), Gaps = 23/236 (9%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 + +L N + ++ +I + + +V +G ++ Y + + Sbjct: 93 VKLDLPVNTSIPNVTEIKENMTFGSTLVTNPKGGFLACGPLYAYRCGHLHYTTGICSDVS 152 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + +S + +LD++IVLD S S+ + + +N +L+ + + P Sbjct: 153 PTFQVVNSFAPVQECSTQLDIVIVLDGSNSIYPW--------ESVTAFLNDLLKRMDIGP 204 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYL-SKFGVSTNSTPGLKYAYNQI 270 Q G+V + + F L + + + G+ T + G+ A + Sbjct: 205 ---KQTQVGIVQYGANVTHEFNLNKYSSTEEVLVAANKIGRRGGLQTMTALGIDTARKEA 261 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 F KK++V +TDGE+ +Q + C ++I Sbjct: 262 FT------EARGARRGVKKVMVIVTDGESHDNYRLKQVIQDCE---DENIQRFSIA 308 >gi|15779150|gb|AAH14640.1| COL14A1 protein [Homo sapiens] Length = 759 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 35/183 (19%), Positives = 72/183 (39%), Gaps = 21/183 (11%) Query: 192 ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVSH-LQRKIKY 249 + + I + + + + I + Q +V F++ F L + L IK+ Sbjct: 8 MKNFNKIISFLYSTVGALNKI--GTDGTQVAMVQFTDDPRTEFKLNAYKTKETLLDAIKH 65 Query: 250 LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL 309 +S G +T + +KY + +F + + K+IV +TDG + Sbjct: 66 ISYKGGNTKTGKAIKYVRDTLFTAESGTRRG------IPKVIVVITDG-----RSQDDVN 114 Query: 310 YYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SFYLVENPHSMYDAFSHIGKDIVT 367 E + G ++AIG+ E + + P+ + V++ +DAF I +++T Sbjct: 115 KISREMQLDGYSIFAIGVADADYSELVSIGSKPSARHVFFVDD----FDAFKKIEDELIT 170 Query: 368 KRI 370 Sbjct: 171 FVC 173 >gi|328784200|ref|XP_003250409.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Apis mellifera] Length = 2258 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 37/211 (17%), Positives = 72/211 (34%), Gaps = 37/211 (17%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + + + +++++ ++D S S+ S + + +L + +IP Sbjct: 89 THIDLLRNETDKVELVFLVDSSGSVGS------KNFHSELNFVKHVLSDFPVIPSA---T 139 Query: 220 QSGLVTFSNK---------IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + +VTF + I + L +++ +S G T + L AY + Sbjct: 140 RIAIVTFGGRRHIRRNVDQISRTNENDNKCYLLNKQLTNISYTGGGTYTRGALLEAYRIL 199 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 +N KK + +TDG + N K GA ++ GIR Sbjct: 200 ----------EKARSNAKKAVFLITDGFSNG----GDPRPAANLLKGAGATIFTFGIRTG 245 Query: 331 RSHEFLRACASPNSFYLVENPHSMYDAFSHI 361 E +SP Y D+F+ Sbjct: 246 NVDELHDIASSPKDTY-----SYFLDSFTEF 271 >gi|224372482|ref|YP_002606854.1| von Willebrand factor, type A [Nautilia profundicola AmH] gi|223588580|gb|ACM92316.1| von Willebrand factor, type A [Nautilia profundicola AmH] Length = 288 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 52/235 (22%), Positives = 84/235 (35%), Gaps = 33/235 (14%) Query: 134 RYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSIT 193 + K F + S + P+ S + N+ +++I LD S SM F Sbjct: 39 KSKNRFSLLEFFIIFFLSVALASPVKSKIITNTHKKG-YNIVIDLDTSGSMAEF-----N 92 Query: 194 KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIK--YLS 251 KID A ++ K GLV F N L + + +K Y+S Sbjct: 93 KIDAAKAVSLDFAKKRKNDA-------LGLVVFGNIAYIASPLTFDKKTFEDILKRIYVS 145 Query: 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLY 310 G T L + N + G +KII+ +TDG +N+S + Sbjct: 146 IAGGKTAIYDALFLSSNLFKNANG------------EKIIILLTDGMDNMSITPLDVVIK 193 Query: 311 YCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 + KK VY+I I L+ + + FY+ + + +S I K Sbjct: 194 ---KLKKEHIKVYSIAIGGDADLSVLKKISKETNGKFYIASSLEDLKKIYSDINK 245 >gi|229522840|ref|ZP_04412254.1| protein BatA [Vibrio cholerae TM 11079-80] gi|229340057|gb|EEO05065.1| protein BatA [Vibrio cholerae TM 11079-80] Length = 318 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 71/213 (33%), Gaps = 38/213 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M+V+D+S SM + + + A+ + + + GL+ F++ Sbjct: 85 DLMLVVDLSYSMSQE---DMQSGQQMVDRLTAVKQVLSEFIAKREGDRIGLILFADHAYL 141 Query: 233 FFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + ++ L G T G+ A D +++ Sbjct: 142 QTPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFIDSDA-----------PQRV 190 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------SHEF- 335 ++ ++DG N + L N AK+ +Y +G+ + + Sbjct: 191 MILLSDGSNTA--GVLDPLEAANIAKQYHTTIYTVGVGAGEMVVKDFLFSRKVNTAQDLD 248 Query: 336 ---LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + ++ N + + + I + Sbjct: 249 EKTLQTIATTTGGHYFRARNQQDLQNIYDTINQ 281 >gi|91205150|ref|YP_537505.1| hypothetical protein RBE_0335 [Rickettsia bellii RML369-C] gi|91068694|gb|ABE04416.1| unknown [Rickettsia bellii RML369-C] Length = 516 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 48/320 (15%), Positives = 107/320 (33%), Gaps = 40/320 (12%) Query: 65 IMNEGNGNN-RKKLKGGDILCRIKNTWNMSFRNELR----DNGFVNDIDDIVRSTSLDIV 119 ++NE + +K + ++ + + N L + + I+ Sbjct: 161 LVNEEDITPFQKAIYHPTDFSQLITQISSNEENSLNFIMNNGAIAQSVQVYTADGKAPII 220 Query: 120 VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKV-NSQTDARLDMMIVL 178 +G+ I K L W + +M + KV + + + ++ +++ Sbjct: 221 ASDLKDGFIIDKQYLLKYLLPIFNGFIWNEEGKFPIMFAPKNPKVLDGENNYAHNISLLI 280 Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE--EFFLL 236 D+S SME F + +I +L+++ IP+ Q +V F+++ F Sbjct: 281 DISGSMEKDFS-------VYKNNILKILDKLAEIPNW----QINIVVFNDESTARSFSNQ 329 Query: 237 EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 E + ++ I L G Y +++ + ++ TD Sbjct: 330 ENNIEDIKVYINNLKANG------------YTKLYGTIKEALESFKGKIDESSTLIVFTD 377 Query: 297 GENLSTKEDQQSLYYCNE----AKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE--- 349 G++ T + + K +Y +G + EF A+ F V Sbjct: 378 GKDEGTNSNVTEKDVVDVTSEVIKNPQFNMYTVGFGQYYNQEFFEQVATRGGFTHVSLND 437 Query: 350 --NPHSMYDAFSHIGKDIVT 367 H + +I + ++T Sbjct: 438 PTGMHQLQQYIDNIEQKVMT 457 >gi|295399398|ref|ZP_06809380.1| Ig domain protein group 2 domain protein [Geobacillus thermoglucosidasius C56-YS93] gi|294978864|gb|EFG54460.1| Ig domain protein group 2 domain protein [Geobacillus thermoglucosidasius C56-YS93] Length = 929 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 78/212 (36%), Gaps = 31/212 (14%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 V + + + + S K P + + +T +V++ Sbjct: 23 VPAVAFGADNGSSTAALNFTITSSQTEYAKP----PNADAQGRLDITLTPQGRVDNIIRP 78 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D++ V DVS SM K+D A ++ + ++ K + N+ + LV FS+ + Sbjct: 79 PIDVVFVFDVSGSM----VMPSLKLDSAKYALQSAVDYFKANANPND--RFALVPFSDDV 132 Query: 231 E--EFFLLEWGVSHLQRKIKYLS------KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + + G +++ + +++ + TN T L+ A Sbjct: 133 QYSKVVPFPSGTYDVKQHLNWIATVANSLRANGGTNYTQALQQA------------QSFF 180 Query: 283 EDANYKKIIVFMTDGENL-STKEDQQSLYYCN 313 D KK I+F+TDG S ++ + C Sbjct: 181 NDPARKKYIIFLTDGMPTVSIAKEPITYKVCE 212 >gi|220908581|ref|YP_002483892.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] gi|219865192|gb|ACL45531.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] Length = 421 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 83/225 (36%), Gaps = 28/225 (12%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 P + R + + + + + + + +A L++ ++LD S SM ++ ++ + Sbjct: 16 PTRVSQRQLEISVAAIAQASGERNAPLNLGLILDHSGSMAGQP------LETVKRAAQKL 69 Query: 206 LEEVKLIPDVNNVVQSGLVTFSN--KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL 263 ++ + + ++ F + K+ ++ +I +L+ G T GL Sbjct: 70 VDRLLPSD------RLAVIVFDHVAKVLIPNQPVTDRDKIKTRISHLAAMG-GTAIDEGL 122 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 + ++ + I +TDGEN + + L EA K + Sbjct: 123 QLGLTELIAAKAGAISQ----------IFLLTDGENEHGN-NSRCLQLAEEAAKENITLN 171 Query: 324 AIGIRVIRSHEFLR--ACASPNSFYLVENPHSMYDAFSHIGKDIV 366 +G + + L A A+ S +E P + F + I+ Sbjct: 172 TLGFGYHWNQDVLEQIADAAGGSLMFIEYPQDVLIGFERLFNQII 216 >gi|301784617|ref|XP_002927724.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Ailuropoda melanoleuca] Length = 898 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 74/202 (36%), Gaps = 33/202 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ VLD S SM TK+ ++ +L +++ + ++ FSN+I+ Sbjct: 253 VVFVLDSSASM------VGTKLRQTKDALFTILHDLRPQDHFS------IIGFSNRIKVW 300 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + V + I ++S G T+ L+ A + D H + ED + Sbjct: 301 KDHLVSVTPDNVRDGKVYIHHMSPTG-GTDINGALQRAIKLLND---YVAHNDIEDRSVS 356 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEF-----LRACA 340 +I+F+TDG+ + +L N K+ ++ IGI L C Sbjct: 357 -LIIFLTDGKPT--VGETHTLKILNNTKEAARGQICIFTIGIGNDVDFMLLEKLSLENCG 413 Query: 341 SPNSFYLVENP-HSMYDAFSHI 361 ++ + + I Sbjct: 414 LTRRVLEEDDAGAQLIGFYDEI 435 >gi|301606773|ref|XP_002932992.1| PREDICTED: integrin alpha-11-like [Xenopus (Silurana) tropicalis] Length = 1188 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 39/210 (18%), Positives = 84/210 (40%), Gaps = 37/210 (17%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++IVLD S S+ + + + ++ ++ +Q G++ + + Sbjct: 166 MDIVIVLDGSNSIYPWVEVQ-----------SFLISILQKFYIAPGQIQVGVLQYGETVV 214 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L V+ + K + + G + T + G++ A + F G + K Sbjct: 215 HEFYLNNYRSVTDVVEAAKRIEQRGGTETRTALGIEKAVTEAFQRGGRKG--------AK 266 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL---RAC 339 K+++ +TDGE+ + + Q+ + + YA+ + R I FL + Sbjct: 267 KVMIVITDGESHDSPDLQRVIESSEK---DNITRYAVAVLGYYNRRGINPEAFLNEIKYI 323 Query: 340 AS---PNSFYLVENPHSMYDAFSHIGKDIV 366 AS F+ V + ++ D +G+ I Sbjct: 324 ASDPDDKHFFNVTDEAALKDIVDALGERIF 353 >gi|281346829|gb|EFB22413.1| hypothetical protein PANDA_017530 [Ailuropoda melanoleuca] Length = 895 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 74/202 (36%), Gaps = 33/202 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ VLD S SM TK+ ++ +L +++ + ++ FSN+I+ Sbjct: 251 VVFVLDSSASM------VGTKLRQTKDALFTILHDLRPQDHFS------IIGFSNRIKVW 298 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + V + I ++S G T+ L+ A + D H + ED + Sbjct: 299 KDHLVSVTPDNVRDGKVYIHHMSPTG-GTDINGALQRAIKLLND---YVAHNDIEDRSVS 354 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEF-----LRACA 340 +I+F+TDG+ + +L N K+ ++ IGI L C Sbjct: 355 -LIIFLTDGKPT--VGETHTLKILNNTKEAARGQICIFTIGIGNDVDFMLLEKLSLENCG 411 Query: 341 SPNSFYLVENP-HSMYDAFSHI 361 ++ + + I Sbjct: 412 LTRRVLEEDDAGAQLIGFYDEI 433 >gi|152985991|ref|YP_001347440.1| hypothetical protein PSPA7_2067 [Pseudomonas aeruginosa PA7] gi|150961149|gb|ABR83174.1| hypothetical protein PSPA7_2067 [Pseudomonas aeruginosa PA7] Length = 337 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 61/163 (37%), Gaps = 26/163 (15%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + D I+++++ K +E + GL+ F + Sbjct: 91 DLLLAVDVSGSMDYRDMRWQDDEISRLELVKKLFGDFIE-------GRRGDRVGLILFGS 143 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + G +T + A ++ + Sbjct: 144 QAYLQAPLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLALKRL-----------RQRPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 +++V +TDG N + Q+ A + +Y IGI Sbjct: 193 ESRVLVLITDGANTGGQISPQTA--ARLAAEERVKIYTIGIGA 233 >gi|319784280|ref|YP_004143756.1| hypothetical protein Mesci_4597 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317170168|gb|ADV13706.1| hypothetical protein Mesci_4597 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 643 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 37/218 (16%), Positives = 80/218 (36%), Gaps = 28/218 (12%) Query: 22 LTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGD 81 +T + + + L M ++ + + K + + +D + A ++ EG +++ + D Sbjct: 1 MTVVAMVPLMGALAMAVDFTEMSREKQAVSNALDAANFATARRLT-EGATDDQLRAYALD 59 Query: 82 ILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG---YSISAISRYKIP 138 N N +T+L + + G ++A YK Sbjct: 60 FFNANLNKIN-------------------PANTTLTVTLPSNTTGGGLLKMTARLDYKPY 100 Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF-FDSSITKIDM 197 + I+ ++ + L++ +VLD S SM S T+ID+ Sbjct: 101 FYPVFGQLVGKSETDANQRISFNITSEVRLKNTLEVALVLDNSGSMTKTGTGSGQTRIDL 160 Query: 198 AIKSINAMLE----EVKLIPDVNNVVQSGLVTFSNKIE 231 + +++ + +I V+ VQ GLV F+ + Sbjct: 161 LKTAAKQLVDTLAQQAAMIKQVDRPVQFGLVPFAASVN 198 Score = 45.9 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 38/270 (14%), Positives = 78/270 (28%), Gaps = 83/270 (30%) Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG-LVTFSNKIEEFFLLEW-- 238 ++ S+++ T + M + + P + +G +S L Sbjct: 374 GAINSWWNDDPTS-STGQSRLRNMSKYFQPRPINAPALAAGNGPNYSCSTNPITPLTDVS 432 Query: 239 ---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 G++ ++ I + G TN G+ + + + + Q + K+++ +T Sbjct: 433 VTTGLTAIKAAIDLMKPDG-GTNVPEGMAWGWRVVSSGEPFTQGRPETERGNDKVVIVLT 491 Query: 296 DGENL---------STKEDQQS-------------------------------------- 308 DG N S D +S Sbjct: 492 DGANTYYTPSSLSHSDPADSKSTYASFGYLNPGYNGTSVGRLFMGTSSAIGQFDYSNGNY 551 Query: 309 --------LYYCNEAKKRGAIVYAIGIRVIRSH-------EFLRACAS------------ 341 CN AK +V + + + + E L++C+S Sbjct: 552 TNALNEQMATLCNNAKAANIMVMTVALDLSTTKTADQQAIEALKSCSSNSRFRKDPTDAS 611 Query: 342 -PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 P + S+ + F IG ++ RI Sbjct: 612 KPAKLFWNATGASLSNDFKEIGNELSNLRI 641 >gi|229493542|ref|ZP_04387327.1| von Willebrand factor type A domain protein [Rhodococcus erythropolis SK121] gi|229319503|gb|EEN85339.1| von Willebrand factor type A domain protein [Rhodococcus erythropolis SK121] Length = 326 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 76/234 (32%), Gaps = 36/234 (15%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 + + ++ K + A +++V+DVS SM++ D T++ A ++ + + + Sbjct: 72 TVALAGPTADKKVPRNRA--TVILVIDVSLSMQA-TDVEPTRLAAAQEAAKSFADGLTPG 128 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + GLV F+ + I L T + + + + Sbjct: 129 IN------LGLVAFAGTASVLVSPTTNRDATKVAIDNLK-LSERTATGEAI---FTSLQS 178 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDG-----ENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + + D IV ++DG EN + +AK +G + I Sbjct: 179 IDTLSAVLGGSDQAPPARIVLLSDGKQTVPENSDDPRGGFTA--ARQAKDKGVPISTISF 236 Query: 328 R--------------VIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 V L+ A S SF+ + + + + + I Sbjct: 237 GTTYGRVEIEGDRIPVPVDDASLKEIANLSGGSFFTASSLEELRQVYDTLEEQI 290 >gi|326923981|ref|XP_003208211.1| PREDICTED: von Willebrand factor A domain-containing protein 2-like [Meleagris gallopavo] Length = 776 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 43/230 (18%), Positives = 85/230 (36%), Gaps = 28/230 (12%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 ++ + +P + ++ + S+ A +D++ +LD S S+ Sbjct: 26 FQSLCAFLLPQVWLALSMQEIHADQETIAKISAAGQLMWCSASVDILFLLDGSYSVG--- 82 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRK 246 + + + + + + P V+ G+V FS+ F L+ ++ + Sbjct: 83 ---RGSFERSKHFAGKLCDALDIHP---GRVRVGMVQFSSAPHLEFSLDSYLTKQEVKER 136 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 IK + G ST + LKY ++ F + KI++ ++DG K Sbjct: 137 IKRTAFRGGSTETGRALKYILSKGFPGGR--------NLTVPKILIIISDG-----KSQG 183 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---PNSFYLVENPHS 353 + + K+RG V+A GI+ R E L A AS + Sbjct: 184 STAMPAMQVKERGTTVFAAGIKFPRWEE-LHALASEPTEQHVLFAGDVSD 232 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 32/226 (14%), Positives = 69/226 (30%), Gaps = 48/226 (21%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN- 216 I S LD++ +D S + + L+ + + Sbjct: 537 ICSVNNPAGCQVQSLDLVYAVDASAGVG----------------LVNFLQLRDFVRSSSW 580 Query: 217 ------NVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYN 268 +V Q L + ++ F L+ ++ L + I + G +++ L + Sbjct: 581 HFTINWDVTQIALAAYGSRAYTVFALDTHTNNSALLQAINQMPFLGDVASASSALLHINT 640 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +Q K++V +T+G + + + + + G +V+ + I Sbjct: 641 DVMTVQ------KGARPGVSKVVVVLTNGGGM-----EDAAAPAQQLRDNGVVVFVVVIG 689 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 LR C Y E+ H I + K Sbjct: 690 DTERDTLLRVC-----MYSYED-------LQHYQDLITERICEEAK 723 >gi|229577024|ref|NP_001153314.1| collagen type XXVIII alpha 1 a [Danio rerio] gi|228007387|emb|CAQ51228.1| collagen type XXVIII alpha 1 a [Danio rerio] Length = 1208 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 70/205 (34%), Gaps = 24/205 (11%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D L++ +LD S + + I L+ ++L ++ L+ +S Sbjct: 56 EDCSLELAFLLDSSETAKDNHQQE----KKFTMDIIEGLQSIRLDTGRKLSWRAALLQYS 111 Query: 228 NKIEEFFLL-EW-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + + L +W G + + I ++ G T +T + + Sbjct: 112 SHVIIEQTLKQWKGTENFKSSIAPMAYIGHGTYTTYAI---------TNMTKIFVEESSP 162 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-------EFLRA 338 KI + +TDG + +AK +G V+ IGI + L + Sbjct: 163 ERIKIALLLTDG--FFHPRNPDIFSAMADAKNQGVKVFTIGITRTANDPVNAANLRLLSS 220 Query: 339 CASPNSFYLVENPHSMYDAFSHIGK 363 + Y +++ + M + I + Sbjct: 221 TPASRFLYNLQDTNVMEKVITQIAQ 245 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 34/199 (17%), Positives = 80/199 (40%), Gaps = 30/199 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + ++++ V+D S S+ ++ + A+++ V + N + GL+ + Sbjct: 810 CKERPMELVFVIDSSESVGP------ENFEIIKDFVIALVDRVTV---GRNATRIGLLLY 860 Query: 227 SNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S ++ F L +++ ++ + G T + ++ A + F Sbjct: 861 SLEVHLEFNLARYMTKQDVKKAVRNMPYMGEGTYTGTAIRKATQEAFFSARNG------- 913 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI---RVIRSHEFLRAC-- 339 +K+ + +TDG+ +E + EA+ +YA+GI EFLR Sbjct: 914 --VRKVAIVITDGQ-TDKREPVKLDIAVREAQVANIEMYALGIVNTSDPTQAEFLRELNL 970 Query: 340 -AS---PNSFYLVENPHSM 354 AS +L+++ +++ Sbjct: 971 IASDPDSEHMFLIDDYNTL 989 >gi|94482904|gb|ABF22495.1| anthrax toxin receptor 2 [Rattus norvegicus] Length = 487 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 37/230 (16%), Positives = 77/230 (33%), Gaps = 30/230 (13%) Query: 148 YTNSRHIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + + + S+ + D+ VLD S S+ + + I Sbjct: 16 FPGLWLLTVGGPGSLLQAQEQPSCKKAFDLYFVLDKSGSVANNW-----------IEIYN 64 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPG 262 + ++ V+ ++ + FS++ L + + ++ L T G Sbjct: 65 FVHQLTERF-VSPEMRLSFIVFSSQATIILPLTGDRYKISKGLEDLKAVQPVGETYIHEG 123 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 LK A QI + G++ II+ +TDG+ +++ GA V Sbjct: 124 LKLANEQIQNAGGLKTSS---------IIIALTDGKLDGLVPSYAEKE-AKKSRSLGASV 173 Query: 323 YAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 Y +G+ + R S + + V+ + A I I+ + Sbjct: 174 YCVGVLDFEQAQLERIADSKDQVFPVKGG---FQALKGIINSILAQSCTE 220 >gi|149641794|ref|XP_001509032.1| PREDICTED: similar to integrin alpha E2 [Ornithorhynchus anatinus] Length = 1679 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 40/212 (18%), Positives = 81/212 (38%), Gaps = 24/212 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + D ++ IVLD S S+ + + A I ++ Sbjct: 149 EQEMDEDTGTEIAIVLDGSGSI------NPEDFEKAKDFIFNIMTRFYKKCFECEF---A 199 Query: 223 LVTFSNKIEEFFLL--EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V + + I+ F L + +K+K + + G T + + + +IF+++ + Sbjct: 200 VVQYGSVIQTEFDLLASRDANSSLQKVKNIKQVGNVTKTASAIYHVLKEIFNVEKGSRRQ 259 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI--RSHEFLRA 338 N+ KII+ +TDG+ ++ ++ E K G +AIG+ S + L Sbjct: 260 NS------KIIIVLTDGDVFQDPKNLATVMNLPEMK--GIERFAIGVGNEFSASKKTLEL 311 Query: 339 CASPNSF---YLVENPHSMYDAFSHIGKDIVT 367 AS + VE + + + I+T Sbjct: 312 IASEPHHSHKFRVEKFSGLDGLLRGLEQKIIT 343 >gi|219849077|ref|YP_002463510.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219543336|gb|ACL25074.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 546 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 73/195 (37%), Gaps = 26/195 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--- 229 D+++V+D S SME K+ M I L + + GL+TF++ Sbjct: 372 DIVLVVDTSGSMEG------DKLTMVKAGIETFLMRILPED------RLGLITFASAARL 419 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + L LQ ++ + G + ++ + + + + D + + Sbjct: 420 VVPMAPLSDNRIALQDAVQAMRASGRTAL--------FDALVLGKQVLEQLPPADDDRIR 471 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS-PNSFYLV 348 IV ++DG + S++ + + + G ++ + + L A A + +V Sbjct: 472 AIVLLSDGADNSSQASLDQIRTLFD--ESGISIFPVAYGNDADRQVLDAIAEFSRTIVVV 529 Query: 349 ENPHSMYDAFSHIGK 363 + + F ++ + Sbjct: 530 GDTGDIAQIFENLSR 544 >gi|327271908|ref|XP_003220729.1| PREDICTED: collagen alpha-1(XX) chain-like [Anolis carolinensis] Length = 1480 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 33/243 (13%), Positives = 83/243 (34%), Gaps = 26/243 (10%) Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 +P + + + + T A D+++++D S S+ Sbjct: 211 LPSPIEFELDYLAQEKTTRESFRRVSHFHCDTTAMTDIILLVDGSWSIGRN------NFG 264 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFG 254 + + + +++ + D ++ GL +S+ + L + + ++ L G Sbjct: 265 LIREFLASLVAPFNVAMDK---IRVGLTQYSSDPRTEWDLNTYATRDEVLEALRSLRYKG 321 Query: 255 VSTNSTPGLKYAYN-QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 +T + L + + G R K+I+ +TDG K + Sbjct: 322 GNTFTGLALTHVLEHNLKADTGARS-------EAPKLIILLTDG-----KSQDDANPPAQ 369 Query: 314 EAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 K G ++A+G++ E + + P + Y V + + + + + + Sbjct: 370 VLKNMGIQIFAVGVKNADETELRQVASDPLELNVYNVLDFPLLSSLVGRLTRVLCAQIKV 429 Query: 372 YDK 374 +K Sbjct: 430 KNK 432 >gi|310817054|ref|YP_003965018.1| hypothetical protein EIO_2641 [Ketogulonicigenium vulgare Y25] gi|308755789|gb|ADO43718.1| conserved hypothetical protein [Ketogulonicigenium vulgare Y25] Length = 733 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 40/201 (19%), Positives = 67/201 (33%), Gaps = 36/201 (17%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + + M+ VLD S SM+ ++ +A + + LE + Q G++ Sbjct: 260 PREAPEVTMVFVLDRSGSMQQAV-GDSNRLGVAKNATLSALELLNPQS------QIGVIV 312 Query: 226 FSNKIEEFFLLEW-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 F + L + Q + + G T PGL AY ++ Sbjct: 313 FDTEETTVVPLSTLDIPAAQIALDRVDT-GGGTAIYPGLVAAYREL-----------QRS 360 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + K I+ MTDG LS D + + A AIG+ Sbjct: 361 ESPAKHIIVMTDG--LSQPGDWEGILRQITADGTTVSAVAIGVGADTG------------ 406 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 EN + + +HI +D Sbjct: 407 --AAENIARLGNGVAHISRDF 425 >gi|295688686|ref|YP_003592379.1| von Willebrand factor type A [Caulobacter segnis ATCC 21756] gi|295430589|gb|ADG09761.1| von Willebrand factor type A [Caulobacter segnis ATCC 21756] Length = 583 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 40/295 (13%), Positives = 107/295 (36%), Gaps = 35/295 (11%) Query: 83 LCRIKNTWNMSFRNELRDNGFVND-----IDDIVRSTSLDIVVVPQNEGYSISAISRYKI 137 + R+ +F ++ + N + ++ + + Y + + Sbjct: 123 VKRVAEEPVSTFSIDVDTAAYANTRRFLNDGAVPPRDAIRVEELVNYFDYGYPRPTSAQT 182 Query: 138 PLKFCTFI---PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITK 194 P + I PW + + + + + S+ L+++ ++D S SM + Sbjct: 183 PFRATVAIAPSPWSSQRQILHIGLQGYAAPRSEA-PPLNLVFLVDTSGSM-----MGPDR 236 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSK 252 + +A K++N ++++++ + +V ++ G S ++ + L Sbjct: 237 LPLAQKALNVLIDQLRPQD------RVAMVAYAGSAGAVLAPTDGRSKLKMRCALGALQA 290 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYY 311 G ST GL+ AY + ++ N ++ +TDG+ N+ + + + Sbjct: 291 -GGSTAGGRGLELAY-GLAKQNFDKKAVNR--------VILITDGDFNVGIADPSRLKDF 340 Query: 312 CNEAKKRGAIVYAIGIRVIRSHE-FLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 + +K G + G ++ ++A A + ++ +A + D Sbjct: 341 VADQRKSGVYLSVYGFGRGNYNDTMMQALAQNGNGVAAY-VDTLNEARKLLRDDF 394 >gi|153214389|ref|ZP_01949360.1| conserved hypothetical protein [Vibrio cholerae 1587] gi|124115338|gb|EAY34158.1| conserved hypothetical protein [Vibrio cholerae 1587] Length = 318 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 71/213 (33%), Gaps = 38/213 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M+V+D+S SM + + + A+ + + + GL+ F++ Sbjct: 85 DLMLVVDLSYSMSQE---DMQSGQQMVDRLTAVKQVLSEFIAKREGDRIGLILFADHAYL 141 Query: 233 FFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + ++ L G T G+ A D +++ Sbjct: 142 QTPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFIDSDA-----------PQRV 190 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------SHEF- 335 ++ ++DG N + L N AK+ +Y +G+ + + Sbjct: 191 MILLSDGSNTA--GVLDPLEAANIAKQYHTTIYTVGVGAGEMVVKDFLFSRKVNTAQDLD 248 Query: 336 ---LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + ++ N + + + I + Sbjct: 249 EKTLQTIATTTGGHYFRARNQQDLQNIYDTINQ 281 >gi|197336748|ref|YP_002158568.1| von Willebrand factor, type A [Vibrio fischeri MJ11] gi|197314000|gb|ACH63449.1| von Willebrand factor, type A [Vibrio fischeri MJ11] Length = 350 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 80/215 (37%), Gaps = 26/215 (12%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAI-KSINAMLEEVKLIPDVNNVVQSG 222 ++ D+M+V+D+S SM +SI I ++ ++ +L + + + G Sbjct: 93 PQTREQFGRDVMVVVDLSGSMAEKDFTSIDGIKISRLDAVKKVLNDFAKTRKGD---RLG 149 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFDMQGMR 277 L+ F + F+ + + + L++ G ST+ + + + Sbjct: 150 LILFG---DAAFVQTPFTADHEVWLDLLNQTRVEMAGKSTHLGDAIGLTIKRF---EEND 203 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--SHEF 335 + KK+ + +TDG T + AK +G ++ I I + + Sbjct: 204 NSQPLSTTSRKKVAIILTDG--NDTDSYVPPMDAAKVAKVKGIRIHMIAIGDPQTVGEQA 261 Query: 336 LR-------ACASPNSFYLVENPHSMYDAFSHIGK 363 L A AS + N + +A++ I K Sbjct: 262 LDMDTINTIADASGGQAFQALNQDELINAYAEISK 296 >gi|254225237|ref|ZP_04918850.1| conserved hypothetical protein [Vibrio cholerae V51] gi|125622336|gb|EAZ50657.1| conserved hypothetical protein [Vibrio cholerae V51] Length = 318 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 72/213 (33%), Gaps = 38/213 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M+V+D+S SM + + + A+ + + + GL+ F++ Sbjct: 85 DLMLVVDLSYSMSQE---DMQSGQQMVDRLTAVKQVLSEFIAKREGDRIGLILFADHAYL 141 Query: 233 FFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + ++ L G T G+ A D +++ Sbjct: 142 QTPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFIDSDA-----------PQRV 190 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------SHEF- 335 ++ ++DG N + L N AK+ +Y +G+ S + Sbjct: 191 MILLSDGSNTA--GVLDPLEAANIAKQYHTTIYTVGVGAGEMVVKDFLFSRKVNTSQDLD 248 Query: 336 ---LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L++ A + ++ N + + + I + Sbjct: 249 EKTLQSIATTTGGHYFRARNQQDLQNIYDTINQ 281 >gi|32472883|ref|NP_865877.1| signal peptide [Rhodopirellula baltica SH 1] gi|32444120|emb|CAD73562.1| hypothetical protein-signal peptide and transmembrane prediction [Rhodopirellula baltica SH 1] Length = 434 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 55/422 (13%), Positives = 115/422 (27%), Gaps = 83/422 (19%) Query: 15 YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNR 74 +GG+T+L A LP++ L+ I ++ + +KT L D + E Sbjct: 19 RRGGITVLMAFVLPMLALLAAFCINLAQMQLVKTELAIATDAAARAGGRAFSEEQTVEAA 78 Query: 75 KKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISR 134 K + N F + + + + Sbjct: 79 KAAARLTAAMNEVAGEPYQLNTDDSANEFEFG---------VSAQTDGNTGRFYFTKVPT 129 Query: 135 YKIPLKFCTF----IPWYTNSRHIVMPIT------------SSVKVNSQTDARLDMMIVL 178 + I ++ P+ S V + D+ +VL Sbjct: 130 SDVAANLVAVSSVRINGKRTDDSLLGPVPFIFPNTFSIGDFSPVASATAMQVDRDISLVL 189 Query: 179 DVSRSMESFF-----DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 D S SM+ D+ D I + +A + +++ V+++ +E+ Sbjct: 190 DRSGSMDWKTYDWPDDADPWGEDSLISAEDAGIVDLEWKYRNGQPQYIRRVSYNRGYDEY 249 Query: 234 FLLEWGVSHL------------------------------QRKIKYLSKFGVST------ 257 L + + ++ S T Sbjct: 250 DLYDHAWEEVFGLGPAPNTPWEDLVLAVDAFLRVLDQTPQNEQVSIASYNSHGTLDCWLL 309 Query: 258 NSTPGLKYAYNQI--FDMQGMRQHCNTEDANYK---------KIIVFMTDGE-NLSTKED 305 + ++ A Q+ G+ N+ + K +V MTDG N T+ + Sbjct: 310 DDFDSVRAAVAQLAPNGSTGIGNGMNSGKTAFTHENARPYASKTMVVMTDGNHNYGTQPN 369 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGK 363 + + + E ++ A Y ++ + AF I Sbjct: 370 TVAQQL---MSSSNLNIQTVTFGGGADQETMQEVAVTGLGRHYHADSGDELVSAFEEIAN 426 Query: 364 DI 365 ++ Sbjct: 427 NL 428 >gi|172087820|ref|YP_001816750.1| hypothetical protein VF_A1192 [Vibrio fischeri ES114] gi|171902402|gb|ACB55718.1| conserved hypothetical protein containing von Willebrand factor type A domain [Vibrio fischeri ES114] Length = 350 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 41/219 (18%), Positives = 80/219 (36%), Gaps = 34/219 (15%) Query: 164 VNSQTDARLDMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + D+M+V+D+S SM S I+++D K +N ++ K Sbjct: 93 PQIREQFGRDVMVVVDLSGSMAEKDFTSTNGLKISRLDAVKKVLNDFVKTRKGD------ 146 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFDM 273 + GL+ F + F+ + + + L++ G ST+ + + D Sbjct: 147 -RLGLILFG---DAAFVQTPFTADHKVWLDLLNQTHVEMAGKSTHLGDAIGLTIKRFEDS 202 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-- 331 + T +K+ + +TDG T + AK +G ++ I I + Sbjct: 203 NNSQPLSTTSR---EKVAIILTDG--NDTDSYVPPMDAAKVAKVKGVRIHMIAIGDPQTV 257 Query: 332 SHEFLR-------ACASPNSFYLVENPHSMYDAFSHIGK 363 + L A AS + N + +A++ I K Sbjct: 258 GEQALDMDTINTIADASGGQAFQALNQDELINAYAEISK 296 >gi|219848163|ref|YP_002462596.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219542422|gb|ACL24160.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 914 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 71/195 (36%), Gaps = 26/195 (13%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + L +++V+D S SM D T++D+A +++ + + Q L+ Sbjct: 405 REERPDLALVLVIDRSGSMRELVDDGRTQLDLAREAV------YQASRGLTQRDQIALIA 458 Query: 226 FSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 F + + L+ ++ + L G TN G+ A I Q +H Sbjct: 459 FDSIADTLLPLQPLPGLFTIEDALSRLVA-GGGTNIRSGIALAAETIATSQARIRH---- 513 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-SP 342 ++ +TDG + + D + + + +G V AI I + R Sbjct: 514 -------VILLTDGVSETEYADLVA-----DLRAQGITVSAIAIGLDTDPALERVAQIGG 561 Query: 343 NSFYLVENPHSMYDA 357 +YLV+ + Sbjct: 562 GKYYLVQRVPDLPQV 576 >gi|148656912|ref|YP_001277117.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569022|gb|ABQ91167.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 543 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 72/202 (35%), Gaps = 26/202 (12%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 S R D+++V+DVS SME K++ A + L + + GL+ Sbjct: 362 SINRKRADILLVVDVSGSMEG------EKLEAAKSGLGTFLSRILPED------RVGLIV 409 Query: 226 FSNK---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS + L L I L+ G + Y+ + + + Sbjct: 410 FSTDARVVVPPAPLSEARIALDDAIAQLNARGKTAL--------YDALITGKQVFDDLPP 461 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 D + IV ++DG + +++ + + + G ++ + L A+ Sbjct: 462 PDEERIRAIVLLSDGLDNASRTTLDQVRLAFD--ETGISIFPVAYGSDADLAALEQIATF 519 Query: 343 NSFYLVE-NPHSMYDAFSHIGK 363 + +V+ + + F ++ + Sbjct: 520 SRTIVVQGDTGDIGQIFENLSR 541 >gi|260785816|ref|XP_002587956.1| hypothetical protein BRAFLDRAFT_87344 [Branchiostoma floridae] gi|229273111|gb|EEN43967.1| hypothetical protein BRAFLDRAFT_87344 [Branchiostoma floridae] Length = 1412 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 28/161 (17%), Positives = 63/161 (39%), Gaps = 26/161 (16%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D+++VLDVS S+ + +A + A ++ +N ++ G++ ++ + Sbjct: 389 AIDIVLVLDVSSSI------PQDQFLLARDFMMAFVDCA-AFQGLN--IRIGVICYNCEA 439 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + +F L+ + + I Y+ G T + + Y + ++ Sbjct: 440 KTYFGLQPIYNGMSYSIHYVMYKGGETRTGHAIY--YMTCTSDFEAK----------PRV 487 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 V +TDG + + A+ G +YA+ I R Sbjct: 488 AVILTDG-----RSGDNEVAEAENARDMGITLYAVRIGDPR 523 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 24/152 (15%), Positives = 56/152 (36%), Gaps = 22/152 (14%) Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDM 273 Q +++++ + +F L + +I++L + G T + + + Sbjct: 100 TGPFQIRVISYTCEAHTYFSLTPITMGMSYEIEHLMRGDGGGETRTGHAIYH-------- 151 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 +H + A V +TDG+ D ++ +A+ G +YA+ + ++ Sbjct: 152 ---MRHTSKFGAESHHAAVILTDGQ-----SDDDAIAEAEDARDAGIDLYAVVVGNFQNR 203 Query: 334 EFLRACAS-PNSFYLVENPHSMYDAFSHIGKD 364 A + P+ + + DA I D Sbjct: 204 ASFPAMTNDPDRVF---DTSQACDAAQKIVDD 232 >gi|26988754|ref|NP_744179.1| von Willebrand factor type A domain-containing protein [Pseudomonas putida KT2440] gi|24983548|gb|AAN67643.1|AE016394_4 von Willebrand factor type A domain protein [Pseudomonas putida KT2440] gi|313499848|gb|ADR61214.1| Von Willebrand factor type A domain-containing protein [Pseudomonas putida BIRD-1] Length = 358 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 29/163 (17%), Positives = 62/163 (38%), Gaps = 26/163 (15%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + + I+++D+ + L++ + GL+ F + Sbjct: 91 DLLVAVDVSGSMDFPDMQWQNEDISRLDLVKALMGDFLQD-------REGDRVGLILFGS 143 Query: 229 KIEEFFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + + G +T + A ++ E Sbjct: 144 QAYLQAPLTFDRRTVRTFLIEAQIGIAGKNTAIGDAIGLAVKRL-----------RERPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 +++V +TDG N L A + G +Y IGI Sbjct: 193 QSRVLVLITDGANNG--GQIHPLTAARLAAQEGVRIYTIGIGA 233 >gi|312110043|ref|YP_003988359.1| hypothetical protein GY4MC1_0934 [Geobacillus sp. Y4.1MC1] gi|311215144|gb|ADP73748.1| Ig domain protein group 2 domain protein [Geobacillus sp. Y4.1MC1] Length = 932 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 78/212 (36%), Gaps = 31/212 (14%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 V + + + + S K P + + +T +V++ Sbjct: 23 VPAVAFGADNGSSTAALNFTITSSQTEYAKP----PNADAQGRLDITLTPQGRVDNIIRP 78 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D++ V DVS SM K+D A ++ + ++ K + N+ + LV FS+ + Sbjct: 79 PIDVVFVFDVSGSM----VMPSLKLDSAKYALQSAVDYFKANANPND--RFALVPFSDGV 132 Query: 231 --EEFFLLEWGVSHLQRKIKYLS------KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 ++ G +++ + +++ + TN T L+ A Sbjct: 133 QSDKVVPFPSGTYDVKQHLNWIATVANSLRANGGTNYTQALQQA------------QSFF 180 Query: 283 EDANYKKIIVFMTDGENL-STKEDQQSLYYCN 313 D KK I+F+TDG S ++ + C Sbjct: 181 NDPARKKYIIFLTDGMPTVSIAKEPITYKVCE 212 >gi|297265178|ref|XP_002799142.1| PREDICTED: collagen alpha-3(VI) chain [Macaca mulatta] Length = 2568 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 51/312 (16%), Positives = 115/312 (36%), Gaps = 29/312 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + + GG + F +R +G V+ Sbjct: 913 LEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR-----FAQVIRSSGIVSLGVG 967 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + ++P + + T++ P ++ + Sbjct: 968 DRNIDRAELQTITNDPRLVFTVREFRELPNIEERIMTSFGTSAATPAPPGVATPSPSRPE 1027 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + + E V + + + +Q GLV +++ Sbjct: 1028 KKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEDGDSIQVGLVQYNS 1078 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 + F L+ + + I + G N+ GL++ N G R D Sbjct: 1079 DPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSR-----LD 1133 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I +T G+++ +D +RG V+A+G+R I S E + ++ + Sbjct: 1134 QRVPQIAFVITGGKSVEDAQDVSLALT-----QRGVKVFAVGVRNIDSEEVGKIASNSAT 1188 Query: 345 FYLVENPHSMYD 356 + V N + + Sbjct: 1189 AFRVGNVQELSE 1200 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 39/257 (15%), Positives = 86/257 (33%), Gaps = 24/257 (9%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + ++V + + Y S + ++P K T I T+ + + ++ + Sbjct: 764 RNADQEELVKISLSPEYVFSVNTFRELPSLEQKLLTPITTLTSEQIQQLLASTRYPPPAV 823 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S + ++ ++ + + P V+ G+V FS Sbjct: 824 ESDAADIVFLIDSSEGVRPDG------FAHIRDFVSRIVRRLNIGPSK---VRVGVVQFS 874 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 N + F L+ S + I+ L G S N+ L++ +F + + Sbjct: 875 NDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGRALEFVARNLFVKSAGSRIEDG-- 932 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + +V + G K + + G + +G R I E P Sbjct: 933 --VPQHLVLVLGG-----KSQDDVSRFAQVIRSSGIVSLGVGDRNIDRAELQTITNDPRL 985 Query: 345 FYLVENPHSMYDAFSHI 361 + V + + I Sbjct: 986 VFTVREFRELPNIEERI 1002 >gi|109101588|ref|XP_001084624.1| PREDICTED: collagen alpha-3(VI) chain isoform 5 [Macaca mulatta] Length = 2969 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 51/312 (16%), Positives = 115/312 (36%), Gaps = 29/312 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + + GG + F +R +G V+ Sbjct: 1314 LEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR-----FAQVIRSSGIVSLGVG 1368 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + ++P + + T++ P ++ + Sbjct: 1369 DRNIDRAELQTITNDPRLVFTVREFRELPNIEERIMTSFGTSAATPAPPGVATPSPSRPE 1428 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + + E V + + + +Q GLV +++ Sbjct: 1429 KKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEDGDSIQVGLVQYNS 1479 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 + F L+ + + I + G N+ GL++ N G R D Sbjct: 1480 DPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSR-----LD 1534 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I +T G+++ +D +RG V+A+G+R I S E + ++ + Sbjct: 1535 QRVPQIAFVITGGKSVEDAQDVSLALT-----QRGVKVFAVGVRNIDSEEVGKIASNSAT 1589 Query: 345 FYLVENPHSMYD 356 + V N + + Sbjct: 1590 AFRVGNVQELSE 1601 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 83/212 (39%), Gaps = 24/212 (11%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T + +V++ D++ +LD S ++ T + ++ + + N+ Sbjct: 422 TGTPEVHANKR---DIIFLLDGSANVGK------TNFPYVRDFVMNLVNSLDV---GNDN 469 Query: 219 VQSGLVTFSN-KIEEFFLLEWGV-SHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQG 275 ++ GLV FS+ + EF L + S + ++ L + G N+ L Y + Sbjct: 470 IRVGLVQFSDTPVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTGSALSY----VHANHF 525 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + + ++++ +T G+ + L N + G + + +G E Sbjct: 526 TEAGGSRIREHVPQLLLLLTAGQ-----SEDSYLQAANALTRAGILTFCVGASQANKAEL 580 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + +P+ YL+++ S+ + + + T Sbjct: 581 EQIAFNPSLVYLMDDFSSLPALPQQLIQPLTT 612 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 67/204 (32%), Gaps = 21/204 (10%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + T+ + D++ ++D S ++ + + + +LE++ P Sbjct: 17 LSGFPTTHAQQQQAAQDSADIIFLIDGSN------NTGSVNFAVILDFLVNLLEKL---P 67 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLS-KFGVSTNSTPGLKYAYNQI 270 ++ G+V FS++ F L+ + Q +K L G N L + Sbjct: 68 IGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENH 127 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 F R + + +++V ++ G + K+ + +G + Sbjct: 128 FT----RAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVA-----LKQASVFSFGLGAQAA 178 Query: 331 RSHEFLRACASPNSFYLVENPHSM 354 E N + V HS Sbjct: 179 SRAELQHIATDDNLVFTVPEFHSF 202 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 39/257 (15%), Positives = 86/257 (33%), Gaps = 24/257 (9%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + ++V + + Y S + ++P K T I T+ + + ++ + Sbjct: 1165 RNADQEELVKISLSPEYVFSVNTFRELPSLEQKLLTPITTLTSEQIQQLLASTRYPPPAV 1224 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S + ++ ++ + + P V+ G+V FS Sbjct: 1225 ESDAADIVFLIDSSEGVRPDG------FAHIRDFVSRIVRRLNIGPSK---VRVGVVQFS 1275 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 N + F L+ S + I+ L G S N+ L++ +F + + Sbjct: 1276 NDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGRALEFVARNLFVKSAGSRIEDG-- 1333 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + +V + G K + + G + +G R I E P Sbjct: 1334 --VPQHLVLVLGG-----KSQDDVSRFAQVIRSSGIVSLGVGDRNIDRAELQTITNDPRL 1386 Query: 345 FYLVENPHSMYDAFSHI 361 + V + + I Sbjct: 1387 VFTVREFRELPNIEERI 1403 >gi|148548919|ref|YP_001269021.1| von Willebrand factor, type A [Pseudomonas putida F1] gi|148512977|gb|ABQ79837.1| von Willebrand factor, type A [Pseudomonas putida F1] Length = 358 Score = 63.3 bits (152), Expect = 5e-08, Method: Composition-based stats. Identities = 29/163 (17%), Positives = 62/163 (38%), Gaps = 26/163 (15%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + + I+++D+ + L++ + GL+ F + Sbjct: 91 DLLVAVDVSGSMDFPDMQWKNEDISRLDLVKALMGDFLQD-------REGDRVGLILFGS 143 Query: 229 KIEEFFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + + G +T + A ++ E Sbjct: 144 QAYLQAPLTFDRRTVRTFLIEAQIGIAGKNTAIGDAIGLAVKRL-----------RERPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 +++V +TDG N L A + G +Y IGI Sbjct: 193 QSRVLVLITDGANNG--GQIHPLTAARLAAQEGVRIYTIGIGA 233 >gi|156350148|ref|XP_001622163.1| hypothetical protein NEMVEDRAFT_v1g221080 [Nematostella vectensis] gi|156208611|gb|EDO30063.1| predicted protein [Nematostella vectensis] Length = 2040 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 73/190 (38%), Gaps = 22/190 (11%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S + + ++D+ +D S + E+ + K+ +K + + P + + Sbjct: 189 GSRRQQTAPALQMDIAFAVDGSDATENQGSGNFRKLLQFVKDV------MSSFPLSESGI 242 Query: 220 QSGLVTFSNKIEEF--FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 +G+V++ K + F + + ++ I ++ G +T++ G++ +F R Sbjct: 243 HTGVVSYGTKAKTNLGFDVHFSQANFNSAIDGINFPGGATDTGNGIRKVMKSLFKTSKRR 302 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-RVIRSHEFL 336 I+V +T G++ E + G ++ +G+ + Sbjct: 303 S--------IPHILVLITAGKSTGDPSLN-----AEELRASGVRLFCVGVGGAYDRTQLD 349 Query: 337 RACASPNSFY 346 +SP++ Y Sbjct: 350 AIASSPSTTY 359 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 48/317 (15%), Positives = 104/317 (32%), Gaps = 46/317 (14%) Query: 44 FFMKTVLHSMIDRS-------------LVHAATQIMNEGNGNNRKKLKGGDILCRIKNTW 90 +F K L S +D + + + +++ G+ +L + Sbjct: 1512 YFTKPALDSAVDSARKPDPCVQNLIGAGIALSKKMVFGGSARPGSHKILIVLLAGNADDD 1571 Query: 91 NMSFRNELRDNGFVNDIDDIVRSTSL----DIVVVPQNEGYSISAISRYKIPLKFCTFIP 146 ++ ELRD G + + +T ++V PQNE +S+ + L Sbjct: 1572 AVTPSRELRDQGVTVFVIALGSNTDFGQLPEVVSDPQNEHLFVSSPNELMKQLAPTVENI 1631 Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 S+ + + +V + LD+ F ++ + + M Sbjct: 1632 NKVLSKAAPL---TPTEVKMELGFALDIT---------RDAFPAN-----LFASVLRFMK 1674 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH-LQRKIKYLSKFGVSTNSTPGLKY 265 ++ + G+V F++ +F L + + I +++ S+NS PG Sbjct: 1675 ITCSAFVISSDKTRVGVVVFNSPSVQFPLDNYSSKEEVYSAIDTITQ---SSNSGPG--- 1728 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 + + KI++ +T S K +AK+ + + Sbjct: 1729 TVGEAIAFANTNLFKGKTRSQTPKILIVIT-----SKKSGDDVTSPSQDAKRENITMLVV 1783 Query: 326 GIRVIRSHEFLRACASP 342 G+ S + A+P Sbjct: 1784 GVGPGASKSDMDVIAAP 1800 >gi|126738776|ref|ZP_01754472.1| hypothetical protein RSK20926_02629 [Roseobacter sp. SK209-2-6] gi|126719957|gb|EBA16664.1| hypothetical protein RSK20926_02629 [Roseobacter sp. SK209-2-6] Length = 530 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 21/72 (29%), Positives = 34/72 (47%), Gaps = 1/72 (1%) Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACASPNSFYLVENPHSMYDAFS 359 ++ ++ ++ C+ AK +G IVY IG L+ CAS +S Y + + DAF Sbjct: 458 NSTKNTRTSNVCSAAKAQGIIVYTIGFEAPSNGVAVLQDCASSDSHYFDVDGLEIRDAFE 517 Query: 360 HIGKDIVTKRIW 371 I I R+ Sbjct: 518 SIATSIRKLRLT 529 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 36/261 (13%), Positives = 95/261 (36%), Gaps = 44/261 (16%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM 66 N+R+F + G + T F + V G+ +++ + +TVL +DR+++ AA Sbjct: 22 NLRSFRSDESGALAYPTIAFFLAMLAVGGVGVDLMRLERDRTVLQYTLDRAVLAAA---- 77 Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 + + + ++ + ++N + ++ + Sbjct: 78 ------DLDQTQEPAVVVQ----------------DYLNKAGLGEYYEAPEVETGLGYKK 115 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 + + + L + + + +S+ + ++ L++ +VLDVS SM S Sbjct: 116 VKATIDATFDAHL------LQFAGGSDLPVYASSTAE---ESIDGLEISLVLDVSGSMNS 166 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL--EWGVSHLQ 244 +++ + ++ + + + + +V ++ ++ L E+ S Sbjct: 167 N-----SRLSNLKVAARDFIDTM-VENTTDGRMSISIVPYATQVSVSDELFDEYTTSGTN 220 Query: 245 RKIKYLSKFGVSTNSTPGLKY 265 ++ F S ST L Sbjct: 221 NFANCIN-FETSDYSTTALST 240 >gi|329893975|ref|ZP_08269994.1| BatA [gamma proteobacterium IMCC3088] gi|328923374|gb|EGG30692.1| BatA [gamma proteobacterium IMCC3088] Length = 323 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 41/185 (22%), Positives = 70/185 (37%), Gaps = 24/185 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M+VLD S+SME + MA+ ++A+ + V + D + + GL+ F + Sbjct: 90 DIMLVLDTSQSMEIADIEGQSGQAMALSRLDAVKQGVMSLLDASEGNRIGLIAFGEQSFV 149 Query: 233 FFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L ++ + L G ST G+ YA + + D+ R I Sbjct: 150 MSDLTAYGDTVRYMVSQLETGFAGDSTRLGDGVGYAVSLLADVDSERA-----------I 198 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-------IRSHEFLRACA--S 341 +V +TDG T D + AK +Y + LR A + Sbjct: 199 VVLITDG--NDTGSDLPPVEAARLAKALDVKLYVAAVGADVSTDREPIDEALLRRLAERT 256 Query: 342 PNSFY 346 +F+ Sbjct: 257 GGAFF 261 >gi|301627723|ref|XP_002943019.1| PREDICTED: complement C2-like [Xenopus (Silurana) tropicalis] Length = 678 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 37/254 (14%), Positives = 88/254 (34%), Gaps = 29/254 (11%) Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 Y S + +F + N + ++K+ D L++ +LD SRS+ Sbjct: 134 QYPYSFDLPEDVQEQFKASLSGILNIKERSASFGRTIKIKR--DGILNVYFLLDASRSVG 191 Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR 245 D+ + +++E+ +Q G+++++ + + S Sbjct: 192 E------ANFDIYKECSVYLVDELASFD---MTIQFGIISYATVPKVIIPIYDENSDNDA 242 Query: 246 KI-----KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA---NYKKIIVFMTDG 297 + L + +K A ++++M ++ ++ + II+ +TDG Sbjct: 243 HVFEVIENDLKYSDHKDKTGTNIKTALEEVYNMMSFQKETYKNESVWNSIHHIIILLTDG 302 Query: 298 ENLSTKEDQQSLYYCN------EAKKRGAIVYAIGIRVIRSHEFLRACAS----PNSFYL 347 + ++ + + ++ VY GI L AS + + Sbjct: 303 KANIGGRPADTIKHIEEFLDIKKKREDYLDVYTFGIGPDVDMADLSEIASKKDGESHVFR 362 Query: 348 VENPHSMYDAFSHI 361 +E+ + M F I Sbjct: 363 MESANEMKTVFQKI 376 >gi|294011439|ref|YP_003544899.1| Flp pilus assembly protein TadG [Sphingobium japonicum UT26S] gi|292674769|dbj|BAI96287.1| Flp pilus assembly protein TadG [Sphingobium japonicum UT26S] Length = 771 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 41/283 (14%), Positives = 88/283 (31%), Gaps = 56/283 (19%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 + ++N G + +TA + ++G +++ I+ +KT L S D + + + Sbjct: 12 LMRLYHNQAGNILAITAAAIIPTIGLVGGAFDMARIYAVKTRLQSACDAGALAGRRIMGS 71 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 +N + + N SF E R + + + S D+ Sbjct: 72 GRWTDNNGRPNTTALATFDLNFAQNSFGAENRTRSYSESDGTVSGTASADV--------- 122 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 ++ + +P ++ + + +++ D+M VLD S SM Sbjct: 123 PMTLMRVLNVP------------TKRVEVTCEGQMRI-----PNTDVMFVLDNSGSMNEV 165 Query: 188 FDSSIT---KIDMAIKSINAMLEEVK--------------LIPDVNNV-----VQSGLVT 225 T K+ +I E + ++ ++ G V Sbjct: 166 IPGDTTGLKKMAGLQLAIRCFYEALARQNITDVSAAQCGTSADPTGDLSSQVQLRFGFVN 225 Query: 226 FSNKIEEFFLLEWG--------VSHLQRKIKYLSKFGVSTNST 260 + N + LL S I+ + + T S Sbjct: 226 YDNMVNVGKLLPNNYLADSWTYRSRTATGIQTVYAWTAGTQSA 268 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 24/125 (19%), Positives = 39/125 (31%), Gaps = 36/125 (28%) Query: 283 EDANYKKIIVFMTDG---------------------------------ENLSTKEDQQSL 309 A ++ ++FMTDG +NL+ + +S Sbjct: 646 GGAQIQRHLIFMTDGATATTVNNYASYGLEWWDRRQIAPAGPNDANYDDNLNAVNNARSN 705 Query: 310 YYCNEAKKRGAIVYAIGIRVIRS--HEFLRACA-SPNSFYLVENPHSMYDAFSHIGKDIV 366 C K + ++ I + L CA SP+ FY N + F I I Sbjct: 706 ALCTAIKNKNITLWVIYYGSSDTATKTRLTNCATSPSYFYEARNTTLLIGKFREIADRIS 765 Query: 367 TKRIW 371 R+ Sbjct: 766 NLRLT 770 >gi|218708116|ref|YP_002415737.1| hypothetical protein VS_0028 [Vibrio splendidus LGP32] gi|218321135|emb|CAV17085.1| Conserved hypothetical protein, putative exported, TadG [Vibrio splendidus LGP32] Length = 435 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 59/460 (12%), Positives = 134/460 (29%), Gaps = 132/460 (28%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +++ G +L AI +P +F V + + + K L + +++ + + Sbjct: 1 MKHAMRKQSGHAAMLFAIMIPALFGVFMLGSDGARALQTKARLEEASEAAVLAVSAK--- 57 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 + + + L + + ++ + D++ + + Y Sbjct: 58 DEQDHQLAERYIQHYLYDMDSILDIEVKK--------LGCDEMPECIAATERGEARYFEY 109 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMP-----ITSSVKVNSQTDARLDMMIVLDVSR 182 ++ + +K W+ + I +T S K +D+ ++D S Sbjct: 110 RVAGQTLHK---------SWFPGNDVISGFGDSFNVTGSSKARRYQSQPIDITFIVDFSE 160 Query: 183 SMESFFDSSI-TKIDMAIKSINAMLEEVKLIPDV--NNVVQSGLVTFSNKI--------- 230 SM + +K++ I + +E+ D+ + + L F+ + Sbjct: 161 SMNDSWSGGRHSKLNDLKDIIEDVADELGAYNDLYPEHPHRVALTGFNRRTINKDKNDNL 220 Query: 231 --------------EEFFLLEWGVSHLQRKI----------------------------- 247 ++ + + + Q+ I Sbjct: 221 VVRDQRVVSREGEYDKDDTVNFNKTIAQQFIVKGEASRVPNGDDDARFYDLYFTTDFSSF 280 Query: 248 -KYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 K + KF G T S G+ A + M N K++I+ ++DGE+ + Sbjct: 281 TKKVKKFKAGGGTASLQGIIRAGQIVTSM----------SKNQKQLIIILSDGEDWNHYA 330 Query: 305 DQQ----SLYYCNEAKK--RGAIV--------------YAIGIRVIRSHEF--------- 335 Q S C+ G V + G+ Sbjct: 331 GQTNKLVSKGMCSNILNMVNGGKVSADNTHDDVEVIGGVSQGMMTPDGERMNARMAVIGF 390 Query: 336 ---------LRACASPNSFYLVENPHS-MYDAFSHIGKDI 365 LR C ++ Y EN + I +++ Sbjct: 391 DYELNKNVGLRNCVGRDNVYKAENKEDILNKILGLITEEV 430 >gi|86147193|ref|ZP_01065509.1| TadG-like protein [Vibrio sp. MED222] gi|85835077|gb|EAQ53219.1| TadG-like protein [Vibrio sp. MED222] Length = 435 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 59/451 (13%), Positives = 130/451 (28%), Gaps = 132/451 (29%) Query: 17 GGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKK 76 G +L AI +P +F V + + + K L + +++ + + + + + Sbjct: 10 GHAAMLFAIMIPALFGVFMLGSDGARALQTKARLEEASEAAVLAVSAK---DEQDHQLAE 66 Query: 77 LKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYK 136 L + + ++ + D+I + + Y ++ + +K Sbjct: 67 RYIQHYLYDMDSILDIEVKK--------LGCDEIPECIAATERGEARYFEYRVAGQTLHK 118 Query: 137 IPLKFCTFIPWYTNSRHIVMP-----ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSS 191 W+ + I +T S K +D+ ++D S SM + Sbjct: 119 ---------SWFPGNDVISGFGDSFNVTGSSKARRYQSQPIDITFIVDFSESMNDSWSGG 169 Query: 192 I-TKIDMAIKSINAMLEEVKLIPDV--NNVVQSGLVTFSNKI------------------ 230 +K++ I + +E+ D+ + + L F+ + Sbjct: 170 RHSKLNDLKDIIEDVADELGAYNDLYPEHPHRVALTGFNRRTINKDKNDNLVVRDQRVVS 229 Query: 231 -----EEFFLLEWGVSHLQRKI------------------------------KYLSKF-- 253 ++ + + + Q+ I K + KF Sbjct: 230 REGEYDKDDTVNFNKTIAQQFIVKGEASRVPNSDDDARFYDLYFTTDFSSFTKKVKKFKA 289 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ----SL 309 G T S G+ A + M N K++I+ ++DGE+ + Q S Sbjct: 290 GGGTASLQGIIRAGQIVTSM----------SKNQKQLIIILSDGEDWNHYAGQTNKLVSK 339 Query: 310 YYCNEAKK-------------RGAIV---YAIGIRVIRSHEF------------------ 335 C+ V + G+ Sbjct: 340 GMCSNILNMVNGGKVSADNTHDDIEVIGGVSQGMMTPDGERMNARMAVIGFDYELNKNVG 399 Query: 336 LRACASPNSFYLVENPHS-MYDAFSHIGKDI 365 LR C ++ Y EN + I +++ Sbjct: 400 LRNCVGRDNVYKAENKEDILNKILGLITEEV 430 >gi|116622501|ref|YP_824657.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225663|gb|ABJ84372.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 337 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 39/224 (17%), Positives = 80/224 (35%), Gaps = 40/224 (17%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + + + D + ++IV D S SM K+ + ++ A L + + Sbjct: 96 VEQEITTFASEDVPVSIVIVFDCSGSMGP-------KLAKSRAAVAAFLSSANPEDEFS- 147 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 LV F+++ + LQ K+ Y G T + A +Q+ Sbjct: 148 -----LVLFNDRAQLVSGFNRQTDELQSKLFYAQSKGR-TALLDAIYLAMDQM------- 194 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE--- 334 + +K ++ ++DG + ++ + + N K+ A +Y+IGI Sbjct: 195 ----KHAKHSRKAVLVISDGGDNCSRYSMREVK--NRVKEGDAQIYSIGILEAMGFRGRS 248 Query: 335 --------FLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 L A S + ++N + + D S IG + + Sbjct: 249 AEELAGPALLDDIASQSGGRLFEIDNLNELSDVASKIGMALRNQ 292 >gi|159045656|ref|YP_001534450.1| von Willebrand factor type A domain-containing protein [Dinoroseobacter shibae DFL 12] gi|157913416|gb|ABV94849.1| von Willebrand factor type A domain protein [Dinoroseobacter shibae DFL 12] Length = 328 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 32/215 (14%), Positives = 74/215 (34%), Gaps = 42/215 (19%) Query: 166 SQTDARLDMMIVLDVSRSMESFF-----DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + T A D+++ +D+S SM+ + + ++ + A + E + Sbjct: 89 TITSAARDLVLAVDISGSMDDRDMTAPDGTRLQRLQAVKDVVGAFVAE-------REGDR 141 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFDMQG 275 L+ F K F+ L ++ L++ G +T + A D + Sbjct: 142 ISLIVFGAK---PFIQAPFTEDLDSVVELLNQVQTGMAGPNTAIGDAIGLAIRSFEDSEI 198 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-- 333 +++++ ++DG T + A + G +Y IG+ Sbjct: 199 E-----------ERLLILLSDG--ADTASTMTPINAAQIAAQEGITIYTIGVGNPDGSGE 245 Query: 334 -----EFLRACAS--PNSFYLVENPHSMYDAFSHI 361 L A+ +FY ++ + + ++ I Sbjct: 246 ERLDPATLEDIATRGGGAFYFADDVEGLSEIYAEI 280 >gi|53802771|ref|YP_115472.1| batB protein [Methylococcus capsulatus str. Bath] gi|53756532|gb|AAU90823.1| putative batB protein [Methylococcus capsulatus str. Bath] Length = 328 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 27/163 (16%), Positives = 64/163 (39%), Gaps = 26/163 (15%) Query: 173 DMMIVLDVSRSM---ESFFDSSIT-KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+ +D+S SM + D ++ +++ + +A +E + + GL+ F Sbjct: 91 DLMLAVDLSGSMDIEDFVVDGEVSNRLEAVKRVASAFIER-RSGD------RIGLILFGE 143 Query: 229 KIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + +++ + + G T + A ++ D Sbjct: 144 QAYLQVPLTFDRKTVEKLLDEAAIGLAGDKTAIGDAIGLAIKRLRDNPAD---------- 193 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 +++++ ++DG N + Q L A + G +Y IG+ Sbjct: 194 -QRVLILLSDGANTA--GQVQPLQAAELAAREGLKIYTIGVGA 233 >gi|325279871|ref|YP_004252413.1| von Willebrand factor type A [Odoribacter splanchnicus DSM 20712] gi|324311680|gb|ADY32233.1| von Willebrand factor type A [Odoribacter splanchnicus DSM 20712] Length = 341 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 32/168 (19%), Positives = 67/168 (39%), Gaps = 26/168 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +++MI LDVS SM + D ++++ A +I+ M+E++ + GL+ F+ Sbjct: 86 KKKGVELMIALDVSNSMMAQ-DIKPSRLEKAKMAISRMVEKLSND-------KIGLIVFA 137 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + S + + +S T + A T + Sbjct: 138 GDAYVQLPITTDYSSAKLFLSNISTDIVPVQGTAIGSAIDLA-----------ARSFTPE 186 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 K I+ +TDGEN ++ +A ++G +++ IG+ + + Sbjct: 187 TETSKAIIVITDGENH----QDDAVAAAKQAHEKGIVIHTIGMGLEQG 230 >gi|239831900|ref|ZP_04680229.1| Poly [ADP-ribose] polymerase 4 [Ochrobactrum intermedium LMG 3301] gi|239824167|gb|EEQ95735.1| Poly [ADP-ribose] polymerase 4 [Ochrobactrum intermedium LMG 3301] Length = 777 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 46/297 (15%), Positives = 101/297 (34%), Gaps = 38/297 (12%) Query: 82 ILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKF 141 I +K + + L + +D R +LD + + P Sbjct: 291 ISVELKPGFPLGKVESLYHKVRIETTNDATREITLDGTAAADRDFVLEWSAVANDAP--Q 348 Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + V+ + V S A+ +++ V+D S SM T I+ A S Sbjct: 349 VGLFREHVGKDDYVLAYVTPPAVASAKKAQREVVFVIDNSGSMGG------TSIEQAKAS 402 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE-----WGVSHLQRKIKYLSKFGVS 256 ++ L ++ + ++ F + + FF + ++ + + L G Sbjct: 403 LDYALSHLQPGD------RFNVIRFDDTLTRFFEVSVEASQQNIASARHFVMSLEAQG-G 455 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 T P L A + G+RQ IVF+TDGE + ++ ++ A+ Sbjct: 456 TAMLPALHAALDDSHQGNGLRQ------------IVFLTDGEISNEQQLLDAIA----AR 499 Query: 317 KRGAIVYAIGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + + ++ +GI + + A +F + + + + + + + Sbjct: 500 RGRSRIFMVGIGTAPNSYLMNHAAELGRGTFTHIGSAAEVDERMRALFDKLENPAVT 556 >gi|291394751|ref|XP_002713732.1| PREDICTED: collagen, type XXVIII [Oryctolagus cuniculus] Length = 1132 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 68/177 (38%), Gaps = 24/177 (13%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 +++ I L+++ V+D S S+ + + +K++ + V L Sbjct: 780 KLIVEICGC--GPKCQVPSLELVFVIDSSESVG---PKNFQIVKDLVKTLADRIA-VDLA 833 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + G++ +S+++EE L+ S + + + G T + L+ A N Sbjct: 834 -----TTRIGIINYSHQVEEVANLKHFSSKDDFKLAVDNMQYLGEGTYTATALQAANNMF 888 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 D +K+ + +TDG+ S ++ + EA+ + IG+ Sbjct: 889 KDA----------RPGIEKVALVITDGQRDS-RDPKNLTEVVKEARNISVETFVIGV 934 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 30/168 (17%), Positives = 66/168 (39%), Gaps = 16/168 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ ++D S ES K ++ ++ + +V + ++ + FS+ ++ Sbjct: 47 IDIVFIVDSS---ESSKIVHFDKQKDFVERLSDKIFQVTPGHSLKYDIRLAALQFSSSVQ 103 Query: 232 EFFLLE-W-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 W + +++ K L+ G T S + ++F +G + K Sbjct: 104 IDPPFSSWKDLKTFKQRAKSLNLIGQGTFSYYAISN-VTRLFKREGRKNGV--------K 154 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + + MTDG + D +S+ +A+ G IG + LR Sbjct: 155 VALLMTDGIDHPKNPDVKSIS--EDARTSGISFITIGHSTDVNEAKLR 200 >gi|330959358|gb|EGH59618.1| von Willebrand factor, type A [Pseudomonas syringae pv. maculicola str. ES4326] Length = 353 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 28/171 (16%), Positives = 63/171 (36%), Gaps = 26/171 (15%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + + ++++ + + + LE K + GL+ F Sbjct: 91 DLLVAVDVSGSMDYPDMQWKNDEVSRLVLVQQLLGDFLEGRKGD-------RVGLILFGT 143 Query: 229 KIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + + G +T + A ++ Sbjct: 144 QAFLQAPLTYDRRTVRVWLDEAKIGIAGKNTAIGDAIGLALKRL-----------RMRPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +++V +TDG N + D + A G +Y +GI L+ Sbjct: 193 NSRVLVLVTDGANNGGQIDPITA--ARLAADEGVKIYTVGIGSDPDKNALQ 241 >gi|329922584|ref|ZP_08278159.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] gi|328942128|gb|EGG38410.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] Length = 421 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 33/211 (15%), Positives = 74/211 (35%), Gaps = 34/211 (16%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+++V+D S SM D + + A IN M + + ++ F + Sbjct: 114 DIVLVIDNSGSMNE-TDPNQDRYTAAKNLINRM----------DRDNRVSVIMFDHATTL 162 Query: 233 FFLLEW-----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + +I L+ T+ + L+ + I + + + Sbjct: 163 LQPFTRVNNQETKDEIIAEIDGLATTDGGTDISLALEDTMSHIQESRDAGRSAM------ 216 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--VIRSHEFLR--ACASPN 343 ++ ++DG S + + L E K++ V IG+ + L+ A + Sbjct: 217 ---VIMLSDG--FSETDHDRVLA---EYKQQQIAVNTIGLSLVNPDGAQLLQTIAAETGG 268 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 +Y V++ + F I D+ + + + Sbjct: 269 QYYDVQHAEDLSFVFQKIYDDVGDRSLLTKR 299 >gi|190575666|ref|YP_001973511.1| putative von Willebrand factor-like protein [Stenotrophomonas maltophilia K279a] gi|190013588|emb|CAQ47223.1| putative von Willebrand factor-like protein [Stenotrophomonas maltophilia K279a] Length = 334 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 26/215 (12%), Positives = 70/215 (32%), Gaps = 44/215 (20%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 MM+ +DVS SM ++ ++ A + L+ + + GL+ F + Sbjct: 100 QMMLAMDVSGSMGEGDMVLGGQAVDRLTAAKAVLADFLDR-RAGD------RIGLLIFGD 152 Query: 229 KIEEFFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L ++ ++ ++ + G T + A ++ Sbjct: 153 RAYTLTPLTADLASVRDQLRDSVVGLAGRETAIGDAIGLAVKRLRSQPEG---------- 202 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV----------------I 330 +++++ +TDG + + + L A+ G ++ + Sbjct: 203 -QRVLILLTDGVSNA--GVLEPLRAAEVAQAEGVRIHTVAFGGDGSMRFLGIPISADQDP 259 Query: 331 RSHEFLRACAS--PNSFYLVENPHSMYDAFSHIGK 363 L+ AS F+ + + ++ + + Sbjct: 260 VDEATLKKIASLTGGQFFRARDTAQLAGIYAELDR 294 >gi|229587743|ref|YP_002869862.1| hypothetical protein PFLU0165 [Pseudomonas fluorescens SBW25] gi|229359609|emb|CAY46451.1| conserved hypothetical protein [Pseudomonas fluorescens SBW25] Length = 551 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 46/288 (15%), Positives = 103/288 (35%), Gaps = 37/288 (12%) Query: 85 RIKNTWNMSFRNELRDNGFVN-----DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPL 139 R+ T +F ++ + N + + ++ + + Y A+ P Sbjct: 102 RVAETPVSTFSVDVDTGSYANVRRFLNQGSLPPEGAVRLEEMVNYFPYHY-ALPTDGSPF 160 Query: 140 KFCTFI---PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 T + PW ++ + + I +S A +++ ++DVS SM+ + Sbjct: 161 GVTTEVAATPWNPRTQLLRIGIKAS-DRPVAELAPANLVFLVDVSGSMD-----RREGLP 214 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFG 254 + ++ ++++++ + LV ++ + G ++ I L+ G Sbjct: 215 LVKSTLKLLVDQLREQD------RVSLVVYAGESRVVLKPTSGRDKVKIRNAIDQLTA-G 267 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCN 313 ST G++ AY R+ N I+ TDG+ N+ + Sbjct: 268 GSTAGASGIELAYQM------AREGFIDNGINR---ILLATDGDFNVGISDFDSLKQMAV 318 Query: 314 EAKKRGAIVYAIGIRVIRSHEFLR---ACASPNSFYLVENPHSMYDAF 358 E +K G + +G V +E L A A ++ ++N + Sbjct: 319 EQRKSGVSLTTLGFGVDNYNEHLMEQLADAGDGNYAYIDNLREAHKVL 366 >gi|116626306|ref|YP_828462.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116229468|gb|ABJ88177.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 310 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 33/215 (15%), Positives = 80/215 (37%), Gaps = 27/215 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S ++ N+++D L + I++D S S+ + + + ++ V + Sbjct: 75 SIMEFNAESDLPLRLGILIDTSNSIRD-------RFKFEQDAASEFIKGVVHANQDKAM- 126 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 LV+F K E L L I+ L G + Y+ IF + Sbjct: 127 ---LVSFDTKAELVSDLIGDTEKLDHAIRSLRPGGGTAL--------YDAIFFACRDKLS 175 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR----SHEF 335 + +++ IV ++DG++ ++ + A+K ++Y+I + + + Sbjct: 176 QDQPKHKFRRAIVIVSDGDDNQSQYTRD--QALEMAQKADVVLYSISTNISKIESDGDKV 233 Query: 336 LRA--CASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 L+ + + + +F +I ++ + Sbjct: 234 LKYYAAETGGKAFFPFKVEDLEQSFENIANELRHQ 268 >gi|307354884|ref|YP_003895935.1| hypothetical protein Mpet_2754 [Methanoplanus petrolearius DSM 11571] gi|307158117|gb|ADN37497.1| conserved hypothetical protein [Methanoplanus petrolearius DSM 11571] Length = 316 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 33/219 (15%), Positives = 81/219 (36%), Gaps = 41/219 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 QT +++++V+D S SM++ D S +++ + ++ ++ G+V F Sbjct: 83 QTKEGVNVVLVIDDSGSMQA-TDYSPNRLEATKSAAEELINDLDPKD------YVGIVVF 135 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + + +T GL N + N Sbjct: 136 ESGASTASYLSPDKDSVIENLENIMEKDGATAIGDGLSLGINMADSI-----------PN 184 Query: 287 YKKIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-------------- 331 KK+++ ++DG N +++ + AK V+ IG+ + Sbjct: 185 RKKVVILLSDGVNNAGVISPDEAIQF---AKDSDIQVFTIGMGSEQPVVMGYDWFGNPQY 241 Query: 332 ---SHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 L+ A + ++ + ++ + +S+I +I Sbjct: 242 AELDEATLKEIADETGGKYFKSVDDQTLNEIYSNINSEI 280 >gi|150005795|ref|YP_001300539.1| hypothetical protein BVU_3288 [Bacteroides vulgatus ATCC 8482] gi|149934219|gb|ABR40917.1| conserved hypothetical protein BatA [Bacteroides vulgatus ATCC 8482] Length = 332 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 37/222 (16%), Positives = 73/222 (32%), Gaps = 51/222 (22%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +DVS SM + D +++ A + + + P+ N GL F+ + Sbjct: 88 IDIMLAVDVSTSMLAE-DLKPNRLEAAKQVASEFING---RPNDN----IGLTIFAGESF 139 Query: 232 EFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 L + ++ I T G+ A ++ D + Sbjct: 140 TQCPLTVDHGVLLNLFNSIKGDIAQRGLIEDGTAIGMGIANAVTRLKDSKAKS------- 192 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV--------------- 329 K+I+ +TDG N + D L AK+ G +Y IG+ Sbjct: 193 ----KVIILLTDGSNN--RGDISPLTAAEIAKQFGIRIYTIGVGTNGTAPYPMQTYAGTQ 246 Query: 330 -------IRSHEFLRACASP-NSFYLVENPHSMYDAFSHIGK 363 I + +++ + + + + I K Sbjct: 247 YVNVPVEIDEKTLTEIAGTTNGNYFRATSNSKLKEVYQEIDK 288 >gi|296206125|ref|XP_002750075.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Callithrix jacchus] Length = 940 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 36/204 (17%), Positives = 73/204 (35%), Gaps = 37/204 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ VLD S SM TK+ ++ +L +++ + ++ FSN+I+ Sbjct: 295 VVFVLDSSASM------VGTKLRQTKDALFTILHDLRPQD------RFSIIGFSNRIKVW 342 Query: 232 -----EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 V + I ++S G T+ L+ A + H ++ D + Sbjct: 343 KDHLISVTPDS--VRDGKVYIHHMSPTG-GTDINEALQTAIRLL---NKYVAHSDSGDRS 396 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEF-----LRA 338 +I+F+TDG+ + +L N ++ ++ IGI L Sbjct: 397 VS-LIIFLTDGKPT--VGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLEN 453 Query: 339 CASPNSFYLVENP-HSMYDAFSHI 361 C + E+ + + I Sbjct: 454 CGLTRRVHEEEDAGSQLIGFYDEI 477 >gi|170727371|ref|YP_001761397.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] gi|169812718|gb|ACA87302.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] Length = 330 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 34/216 (15%), Positives = 70/216 (32%), Gaps = 44/216 (20%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ---SGLVTFSNK 229 D+M+ +D+S SM I + + K++N + ++ D + GL+ F++ Sbjct: 84 DLMVAVDLSGSM------QIEDMVLDGKAVNRFIMVQSVVSDFIERRKGDKLGLILFADH 137 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + + +K G T + + + + Sbjct: 138 AYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIALSVKRFDLVDESN---------- 187 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI----------------- 330 +I+V +TDG N S + + A KRG +Y+IG+ Sbjct: 188 -RILVLLTDGSNNSGSISPE--QAADIAAKRGIKIYSIGVGADVMERRTLFGKERVNPSM 244 Query: 331 --RSHEFLRAC-ASPNSFYLVENPHSMYDAFSHIGK 363 + + ++ N + + I K Sbjct: 245 DLDEEQLTSLAQTTGGRYFRARNAQELEQIYQEIDK 280 >gi|329896848|ref|ZP_08271743.1| BatA [gamma proteobacterium IMCC3088] gi|328921553|gb|EGG28934.1| BatA [gamma proteobacterium IMCC3088] Length = 328 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 33/237 (13%), Positives = 72/237 (30%), Gaps = 37/237 (15%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME-----SFFDSSITKIDM 197 + I W + P+ + + AR D+M+ +D+S SME ++ Sbjct: 66 SLIGWCLLVLALARPVWVGEPIKIEKTAR-DLMLAVDISGSMEATDFVDATGKQTDRLSA 124 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY--LSKFGV 255 A + + + + GL+ F + + ++ G Sbjct: 125 AKQVLKQFVA-------GREGDRLGLIVFGSAAYLQAPFTDDRETWLALLDESIVNMAGP 177 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 ST + + R+ ++++ +TDG T L A Sbjct: 178 STALGDAIG------LSIAHFRESKTKN-----RVLIVLTDG--NDTGSKVPPLDAAQVA 224 Query: 316 KKRGAIVYAIGIRVIR--SHEFLR-------ACASPNSFYLVENPHSMYDAFSHIGK 363 K G +Y + + E L A + + + ++ + + I + Sbjct: 225 KAEGVTIYTVAVGDPETVGEEALDLEVLDSIAQTTGGVSFNAADLKALQETYQRIDE 281 >gi|254882023|ref|ZP_05254733.1| BatA aerotolerance operon protein [Bacteroides sp. 4_3_47FAA] gi|294776174|ref|ZP_06741663.1| von Willebrand factor type A domain protein [Bacteroides vulgatus PC510] gi|319640969|ref|ZP_07995677.1| hypothetical protein HMPREF9011_01274 [Bacteroides sp. 3_1_40A] gi|254834816|gb|EET15125.1| BatA aerotolerance operon protein [Bacteroides sp. 4_3_47FAA] gi|294449997|gb|EFG18508.1| von Willebrand factor type A domain protein [Bacteroides vulgatus PC510] gi|317387414|gb|EFV68285.1| hypothetical protein HMPREF9011_01274 [Bacteroides sp. 3_1_40A] Length = 332 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 37/222 (16%), Positives = 73/222 (32%), Gaps = 51/222 (22%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +DVS SM + D +++ A + + + P+ N GL F+ + Sbjct: 88 IDIMLAVDVSTSMLAE-DLKPNRLEAAKQVASEFING---RPNDN----IGLTIFAGESF 139 Query: 232 EFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 L + ++ I T G+ A ++ D + Sbjct: 140 TQCPLTVDHGVLLNLFNSIKGDIAQRGLIEDGTAIGMGIANAVTRLKDSKAKS------- 192 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV--------------- 329 K+I+ +TDG N + D L AK+ G +Y IG+ Sbjct: 193 ----KVIILLTDGSNN--RGDISPLTAAEIAKQFGIRIYTIGVGTNGTAPYPMQTYAGTQ 246 Query: 330 -------IRSHEFLRACASP-NSFYLVENPHSMYDAFSHIGK 363 I + +++ + + + + I K Sbjct: 247 YVNVPVEIDEKTLTEIAGTTNGNYFRATSNSKLKEVYQEIDK 288 >gi|271966806|ref|YP_003341002.1| von Willebrand factor, type A [Streptosporangium roseum DSM 43021] gi|270509981|gb|ACZ88259.1| von Willebrand factor, type A [Streptosporangium roseum DSM 43021] Length = 315 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 34/214 (15%), Positives = 73/214 (34%), Gaps = 34/214 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++I LD+S SME+ D +I A ++ +E D+ G+V F+ Sbjct: 88 IIIALDISLSMEAA-DVQPNRITAAKEAAQKFVE------DLPERFNVGVVAFARSASVV 140 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA-YNQIFDMQGMRQHCNTEDANYKKIIV 292 + + L +T + + A +N + ++ Q T+ IV Sbjct: 141 VSPTTDHQAVSASLGNL-----TTRAGTAIGEAVFNSLDAVRSFDQQAVTD--PPPAAIV 193 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------------HEFLRA 338 ++DG+N S + +++ A+ + I L+ Sbjct: 194 LLSDGDNTSGRSVAEAVDAAMSAR---VPISTIAYGTQEGTVSIDGRDVNVPVNKATLQT 250 Query: 339 C--ASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + Y E+ + + + IG + + + Sbjct: 251 LSEGTSGRAYEAESGSQLREVYEQIGTSLGYRTV 284 >gi|187607706|ref|NP_001120436.1| hypothetical protein LOC100145523 [Xenopus (Silurana) tropicalis] gi|170284610|gb|AAI61198.1| LOC100145523 protein [Xenopus (Silurana) tropicalis] Length = 911 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 33/180 (18%), Positives = 67/180 (37%), Gaps = 24/180 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ ++D S S+ ++ + + ++ + L + P + + G+V +S++ Sbjct: 605 GALDIVFIIDSSESIGY---TNFSLEKNFVINVVSRLGSIAKDPKSDTGARVGVVQYSHE 661 Query: 230 -----IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 I+ +S + ++ L T + L++AYN++ T Sbjct: 662 GTFEAIQLDDPRIDSLSSFKEAVRRLEWIAGGTWTPSALQFAYNKLI--------KETRR 713 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFLRACA 340 K V +TDG + D++ C V AIG E L + A Sbjct: 714 DKAKVFAVVITDGRHDPRDPDERLQVLCG----GDVDVNAIGIGDMFNKPEEDETLTSIA 769 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 21/159 (13%), Positives = 57/159 (35%), Gaps = 14/159 (8%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS---GLVTFS 227 +++ ++D S S+ +D I L++++ ++ V + G + +S Sbjct: 36 PINVFFIIDTSESIILQTAPIEILLDNMKVFIPRFLDKLEDAAYLDQVTLNWLYGGLHYS 95 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 +++ F + + K+ ++ G T + L M + Sbjct: 96 DEVIIFSDITTSKQEYKSKLNAVNYIGRGTFTDCAL----------SNMTALIQKHGGDA 145 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 V +TDG + ++ + A+ G ++++ Sbjct: 146 INFAVVITDGHVTGSPCGGM-MHQADRARNAGIKLFSVA 183 >gi|51244491|ref|YP_064375.1| hypothetical protein DP0639 [Desulfotalea psychrophila LSv54] gi|50875528|emb|CAG35368.1| hypothetical membrane protein (BatB) [Desulfotalea psychrophila LSv54] Length = 566 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 35/168 (20%), Positives = 57/168 (33%), Gaps = 33/168 (19%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ +D SRSM S D +++ A ++ + + + GL+ F+ Sbjct: 95 IDILFAIDTSRSMLSQ-DLKPNRLERARYAVMDFVATLGGD-------RVGLIPFAGSSY 146 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGV------STNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 L Q L TN + A + D Sbjct: 147 LMCPL---TLDYQAFTDSLKALDTKIIPRRGTNIAKVIALAEKTVADSSNH--------- 194 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 KI++ +TDGENL L + AKK G +Y IG+ Sbjct: 195 ---KILIILTDGENL----QGDVLKAADLAKKNGLTIYTIGVGTAAGE 235 >gi|332833576|ref|XP_003312497.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Pan troglodytes] Length = 728 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 72/206 (34%), Gaps = 41/206 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ VLD S SM TK+ ++ +L +++ + ++ FSN+I+ Sbjct: 82 VVFVLDSSASM------VGTKLRQTKDALFTILHDLRPQD------RFSIIGFSNRIKVW 129 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + G + I ++S G T+ L+ A + H D Sbjct: 130 KDHLISVTPDSIRDG----KVYIHHMSPTG-GTDINGALQRAIRLL---NKYVAHSGIGD 181 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEF-----L 336 + +IVF+TDG+ + +L N ++ ++ IGI L Sbjct: 182 RSVS-LIVFLTDGKPT--VGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSL 238 Query: 337 RACASPNSFYLVENP-HSMYDAFSHI 361 C + E+ + + I Sbjct: 239 ENCGLTRRVHEEEDAGSQLIGFYDEI 264 >gi|329954838|ref|ZP_08295855.1| von Willebrand factor type A domain protein [Bacteroides clarus YIT 12056] gi|328526942|gb|EGF53953.1| von Willebrand factor type A domain protein [Bacteroides clarus YIT 12056] Length = 327 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 41/270 (15%), Positives = 86/270 (31%), Gaps = 46/270 (17%) Query: 119 VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVL 178 + + ++ + Y + F I ++ ++ + +D+M+ + Sbjct: 35 ISDARVYAHTPKSYKNYLLHAPFALRIIALALIIIVLSRPQTTDSWQNSEIEGIDIMLAI 94 Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW 238 DVS SM + D +++ A + P+ N G+ F+ + L Sbjct: 95 DVSTSMLAE-DLKPNRLEAAKDVAAEFING---RPNDN----VGITLFAGESFTQCPLTV 146 Query: 239 GVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 + L IK + T G+ A ++ D + K+I+ +TD Sbjct: 147 DHAVLLNLIKDVKCGLIEDGTAVGMGIANAVTRLKDSKAKS-----------KVIILLTD 195 Query: 297 GENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV----------------------IRSHE 334 G N + + L AK G VY IG+ I Sbjct: 196 GTNN--RGEISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVEIDEKT 253 Query: 335 FLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 + + +++ + + + + I K Sbjct: 254 LTQIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|312434033|ref|NP_116206.4| inter-alpha-trypsin inhibitor heavy chain H5 isoform 2 [Homo sapiens] Length = 728 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 72/206 (34%), Gaps = 41/206 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ VLD S SM TK+ ++ +L +++ + ++ FSN+I+ Sbjct: 82 VVFVLDSSASM------VGTKLRQTKDALFTILHDLRPQD------RFSIIGFSNRIKVW 129 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + G + I ++S G T+ L+ A + H D Sbjct: 130 KDHLISVTPDSIRDG----KVYIHHMSPTG-GTDINGALQRAIRLL---NKYVAHSGIGD 181 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEF-----L 336 + +IVF+TDG+ + +L N ++ ++ IGI L Sbjct: 182 RSVS-LIVFLTDGKPT--VGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSL 238 Query: 337 RACASPNSFYLVENP-HSMYDAFSHI 361 C + E+ + + I Sbjct: 239 ENCGLTRRVHEEEDAGSQLIGFYDEI 264 >gi|310703621|ref|NP_085046.5| inter-alpha-trypsin inhibitor heavy chain H5 isoform 1 precursor [Homo sapiens] Length = 942 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 72/206 (34%), Gaps = 41/206 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ VLD S SM TK+ ++ +L +++ + ++ FSN+I+ Sbjct: 296 VVFVLDSSASM------VGTKLRQTKDALFTILHDLRPQD------RFSIIGFSNRIKVW 343 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + G + I ++S G T+ L+ A + H D Sbjct: 344 KDHLISVTPDSIRDG----KVYIHHMSPTG-GTDINGALQRAIRLL---NKYVAHSGIGD 395 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEF-----L 336 + +IVF+TDG+ + +L N ++ ++ IGI L Sbjct: 396 RSVS-LIVFLTDGKPT--VGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSL 452 Query: 337 RACASPNSFYLVENP-HSMYDAFSHI 361 C + E+ + + I Sbjct: 453 ENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|187609608|sp|Q86UX2|ITIH5_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H5; Short=ITI heavy chain H5; Short=ITI-HC5; Short=Inter-alpha-inhibitor heavy chain 5; Flags: Precursor Length = 942 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 72/206 (34%), Gaps = 41/206 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ VLD S SM TK+ ++ +L +++ + ++ FSN+I+ Sbjct: 296 VVFVLDSSASM------VGTKLRQTKDALFTILHDLRPQD------RFSIIGFSNRIKVW 343 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + G + I ++S G T+ L+ A + H D Sbjct: 344 KDHLISVTPDSIRDG----KVYIHHMSPTG-GTDINGALQRAIRLL---NKYVAHSGIGD 395 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEF-----L 336 + +IVF+TDG+ + +L N ++ ++ IGI L Sbjct: 396 RSVS-LIVFLTDGKPT--VGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSL 452 Query: 337 RACASPNSFYLVENP-HSMYDAFSHI 361 C + E+ + + I Sbjct: 453 ENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|167758708|ref|ZP_02430835.1| hypothetical protein CLOSCI_01050 [Clostridium scindens ATCC 35704] gi|167663904|gb|EDS08034.1| hypothetical protein CLOSCI_01050 [Clostridium scindens ATCC 35704] Length = 1865 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 66/165 (40%), Gaps = 27/165 (16%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITK-----IDMAIKSI 202 + + +T A +D+++V+D S SM + + K++ Sbjct: 525 KDKDYTLTLDVTG----KRGKKAGVDVLLVIDKSGSMGLNDNGRTDSNYFNLMPTLKKTV 580 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEE--FFLLEW----GVSHLQRKIKYLSKFGVS 256 +++ + +PD ++V + ++FS+ +W G S RKI+ L G Sbjct: 581 PTLVDTI--LPDSDSVNRVAAISFSSDDYTGNDISTDWVDYNGKSGFNRKIEGLGTKG-G 637 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 TN ++ A ++ + KK++VF++DGE Sbjct: 638 TNWQLAMRNADKKLKPRAESQN---------KKVVVFLSDGEPTY 673 Score = 53.6 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 37/253 (14%), Positives = 76/253 (30%), Gaps = 52/253 (20%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 I + ++ + +++VLD S SM+ K+ + A + K Sbjct: 1048 QIELSASTKGRDEGIAAQAASIVLVLDASASMQENG----KKLKDIQDAAKAFVNTTKEK 1103 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWG---VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 ++ ++ + + + +++F + N++ G Sbjct: 1104 SPISE---IAVIWYQGSEGSSSTITDSGFYTLDTSDNVDAINRFISNKNASGG-----TP 1155 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ-----SLYYCNEAKKRG--AIV 322 + D + N K + TDG + + + NEAK+ A + Sbjct: 1156 MGDALEEANSILSGRPNSSKYALLFTDGMPGYNSSNNSFNCMVANHANNEAKEIKEYAKL 1215 Query: 323 YAIGIRVIRSH------------------------EFLRA--CASPN----SFYLVENPH 352 Y IG ++ S +FL+ +SP Y +N Sbjct: 1216 YTIGYKLSGSFKWEEGHSQDSTNNHGSHKTETKAADFLKNYLASSPEGDRTYAYTTDNTD 1275 Query: 353 SMYDAFSHIGKDI 365 + F I I Sbjct: 1276 GLTKIFEDIAGQI 1288 >gi|119606787|gb|EAW86381.1| inter-alpha (globulin) inhibitor H5, isoform CRA_c [Homo sapiens] Length = 748 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 72/206 (34%), Gaps = 41/206 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ VLD S SM TK+ ++ +L +++ + ++ FSN+I+ Sbjct: 102 VVFVLDSSASM------VGTKLRQTKDALFTILHDLRPQD------RFSIIGFSNRIKVW 149 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + G + I ++S G T+ L+ A + H D Sbjct: 150 KDHLISVTPDSIRDG----KVYIHHMSPTG-GTDINGALQRAIRLL---NKYVAHSGIGD 201 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEF-----L 336 + +IVF+TDG+ + +L N ++ ++ IGI L Sbjct: 202 RSVS-LIVFLTDGKPT--VGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSL 258 Query: 337 RACASPNSFYLVENP-HSMYDAFSHI 361 C + E+ + + I Sbjct: 259 ENCGLTRRVHEEEDAGSQLIGFYDEI 284 >gi|119606785|gb|EAW86379.1| inter-alpha (globulin) inhibitor H5, isoform CRA_a [Homo sapiens] gi|168275576|dbj|BAG10508.1| inter-alpha trypsin inhibitor heavy chain precursor 5 isoform 1 [synthetic construct] Length = 942 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 72/206 (34%), Gaps = 41/206 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ VLD S SM TK+ ++ +L +++ + ++ FSN+I+ Sbjct: 296 VVFVLDSSASM------VGTKLRQTKDALFTILHDLRPQD------RFSIIGFSNRIKVW 343 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + G + I ++S G T+ L+ A + H D Sbjct: 344 KDHLISVTPDSIRDG----KVYIHHMSPTG-GTDINGALQRAIRLL---NKYVAHSGIGD 395 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEF-----L 336 + +IVF+TDG+ + +L N ++ ++ IGI L Sbjct: 396 RSVS-LIVFLTDGKPT--VGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSL 452 Query: 337 RACASPNSFYLVENP-HSMYDAFSHI 361 C + E+ + + I Sbjct: 453 ENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|55958058|emb|CAI12953.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] gi|55958528|emb|CAI16360.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] Length = 742 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 72/206 (34%), Gaps = 41/206 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ VLD S SM TK+ ++ +L +++ + ++ FSN+I+ Sbjct: 82 VVFVLDSSASM------VGTKLRQTKDALFTILHDLRPQD------RFSIIGFSNRIKVW 129 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + G + I ++S G T+ L+ A + H D Sbjct: 130 KDHLISVTPDSIRDG----KVYIHHMSPTG-GTDINGALQRAIRLL---NKYVAHSGIGD 181 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEF-----L 336 + +IVF+TDG+ + +L N ++ ++ IGI L Sbjct: 182 RSVS-LIVFLTDGKPT--VGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSL 238 Query: 337 RACASPNSFYLVENP-HSMYDAFSHI 361 C + E+ + + I Sbjct: 239 ENCGLTRRVHEEEDAGSQLIGFYDEI 264 >gi|30314037|gb|AAO49812.1| inter-alpha trypsin inhibitor heavy chain precursor 5 [Homo sapiens] Length = 942 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 72/206 (34%), Gaps = 41/206 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ VLD S SM TK+ ++ +L +++ + ++ FSN+I+ Sbjct: 296 VVFVLDSSASM------VGTKLRQTKDALFTILHDLRPQD------RFSIIGFSNRIKVW 343 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + G + I ++S G T+ L+ A + H D Sbjct: 344 KDHLISVTPDSIRDG----KVYIHHMSPTG-GTDINGALQRAIRLL---NKYVAHSGIGD 395 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEF-----L 336 + +IVF+TDG+ + +L N ++ ++ IGI L Sbjct: 396 RSVS-LIVFLTDGKPT--VGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSL 452 Query: 337 RACASPNSFYLVENP-HSMYDAFSHI 361 C + E+ + + I Sbjct: 453 ENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|55958060|emb|CAI12955.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] gi|55958531|emb|CAI16363.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] gi|189442558|gb|AAI67770.1| Inter-alpha (globulin) inhibitor H5 [synthetic construct] Length = 956 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 72/206 (34%), Gaps = 41/206 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ VLD S SM TK+ ++ +L +++ + ++ FSN+I+ Sbjct: 296 VVFVLDSSASM------VGTKLRQTKDALFTILHDLRPQD------RFSIIGFSNRIKVW 343 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + G + I ++S G T+ L+ A + H D Sbjct: 344 KDHLISVTPDSIRDG----KVYIHHMSPTG-GTDINGALQRAIRLL---NKYVAHSGIGD 395 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEF-----L 336 + +IVF+TDG+ + +L N ++ ++ IGI L Sbjct: 396 RSVS-LIVFLTDGKPT--VGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSL 452 Query: 337 RACASPNSFYLVENP-HSMYDAFSHI 361 C + E+ + + I Sbjct: 453 ENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|50949741|emb|CAH10363.1| hypothetical protein [Homo sapiens] Length = 460 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 72/206 (34%), Gaps = 41/206 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ VLD S SM TK+ ++ +L +++ + ++ FSN+I+ Sbjct: 79 VVFVLDSSASM------VGTKLRQTKDALFTILHDLRPQD------RFSIIGFSNRIKVR 126 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + G + I ++S G T+ L+ A + H D Sbjct: 127 KDHLISVTPDSIRDG----KVYIHHMSPTG-GTDINGALQRAIRLL---NKYVAHSGIGD 178 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEF-----L 336 + +IVF+TDG+ + +L N ++ ++ IGI L Sbjct: 179 RSVS-LIVFLTDGKPT--VGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSL 235 Query: 337 RACASPNSFYLVENP-HSMYDAFSHI 361 C + E+ + + I Sbjct: 236 ENCGLTRRVHEEEDAGSQLIGFYDEI 261 >gi|18916771|dbj|BAB85539.1| KIAA1953 protein [Homo sapiens] Length = 824 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 72/206 (34%), Gaps = 41/206 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ VLD S SM TK+ ++ +L +++ + ++ FSN+I+ Sbjct: 178 VVFVLDSSASM------VGTKLRQTKDALFTILHDLRPQD------RFSIIGFSNRIKVW 225 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + G + I ++S G T+ L+ A + H D Sbjct: 226 KDHLISVTPDSIRDG----KVYIHHMSPTG-GTDINGALQRAIRLL---NKYVAHSGIGD 277 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEF-----L 336 + +IVF+TDG+ + +L N ++ ++ IGI L Sbjct: 278 RSVS-LIVFLTDGKPT--VGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSL 334 Query: 337 RACASPNSFYLVENP-HSMYDAFSHI 361 C + E+ + + I Sbjct: 335 ENCGLTRRVHEEEDAGSQLIGFYDEI 360 >gi|328951307|ref|YP_004368642.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] gi|328451631|gb|AEB12532.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] Length = 744 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 39/225 (17%), Positives = 79/225 (35%), Gaps = 29/225 (12%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 P+ + V + + +++VLDVS SM D + +K+ +A+ +++E + Sbjct: 318 SPLADEIPVRPAGRSGVALVLVLDVSGSM---ADGNPSKLALAVAGALSLVETARPED-- 372 Query: 216 NNVVQSGLVTFSNKIEEFFLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 + G+VTFS+ F G + + L G ST + AY Q Sbjct: 373 ----RLGIVTFSSGPRWLFPPRPMTARGKLEAKTLLDRLRP-GGSTR----MLEAYRQAI 423 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 + + + I+ +TDG+ +L A+ +G ++ + Sbjct: 424 EALEALELETKQ-------ILVLTDGQVEEDPAALVALAE--AARAQGIRTNSVALGGDA 474 Query: 332 SHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 L + F+ V P + F + + + Sbjct: 475 DRALLARMSRVGEGRFWDVPTPEDLPRLFLEEAERTFGREALEGR 519 >gi|111221591|ref|YP_712385.1| hypothetical protein FRAAL2157 [Frankia alni ACN14a] gi|111149123|emb|CAJ60806.1| conserved hypothetical protein; putative membrane protein [Frankia alni ACN14a] Length = 319 Score = 63.3 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 33/228 (14%), Positives = 80/228 (35%), Gaps = 31/228 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + + A + ++ +DVS SM + D + +++ A + A ++++ Sbjct: 71 FSLARPARAERVPRERATI--ILAIDVSNSM-AATDIAPNRLEAAKQGAEAFVDQLPP-- 125 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + GLV+F+ ++ I+ L + G +T G+ + I Sbjct: 126 ----RINLGLVSFAGSATVLVPASTDRESVRAGIRGL-QLGPATAIGEGIFASLQAINTA 180 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS- 332 ++ + + IV ++DGE + + Q+ + A++ V I Sbjct: 181 G--KRFSDAGQSPPPAAIVLLSDGETTRGRPNTQAT---DAARQAHVPVDTIAYGTSDGT 235 Query: 333 -------------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 + L A + S++ + + +G I Sbjct: 236 LDVGGQEVPVPVNEQALNEIADQTEGSYHRAATGDELRSVYKGLGSSI 283 >gi|15893764|ref|NP_347113.1| heat shock protein DnaK [Clostridium acetobutylicum ATCC 824] gi|15023331|gb|AAK78453.1|AE007561_14 DnaK protein (heat shock protein), C-terminal region has VWA type A domain [Clostridium acetobutylicum ATCC 824] gi|325507887|gb|ADZ19523.1| DnaK protein (heat shock protein), C-terminal region has VWA type A domain [Clostridium acetobutylicum EA 2018] Length = 698 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 40/194 (20%), Positives = 75/194 (38%), Gaps = 28/194 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++I +D+S SM ++ AI++ ++ + L+ F++K++ Sbjct: 514 IVIAIDLSGSMRG------KPLEEAIEASKTFVDSID-----EGSFSLALIGFADKVKTL 562 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNS-TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L + R I L K V T++ + AYN + D G +V Sbjct: 563 INLTEDREEIFRAIDGLKKADVGTSTMSEPFSEAYNILKDAYGDC------------FVV 610 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPH 352 +TDG+ K+ + C K+ + AIG + +FL A+ + Sbjct: 611 VLTDGQWYGKKDIMAEVNKC---KEYEIEIAAIGFGNAK-KDFLDKIATCEENSIFTEVS 666 Query: 353 SMYDAFSHIGKDIV 366 ++ +FS I K I Sbjct: 667 NLKQSFSRIAKVIS 680 >gi|268574618|ref|XP_002642288.1| C. briggsae CBR-CUT-6 protein [Caenorhabditis briggsae] gi|187025290|emb|CAP35757.1| CBR-CUT-6 protein [Caenorhabditis briggsae AF16] Length = 357 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 39/235 (16%), Positives = 84/235 (35%), Gaps = 29/235 (12%) Query: 132 ISRYKIPLKFCTF---IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 I RY + TF IP + + + S + T ++++++LD S S+ Sbjct: 4 IPRYDFIIPIFTFLSLIPTHLANPIDNGLVDSELIHECVTHKAVEVILLLDASGSIGD-- 61 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRK 246 D K ++ + + L+ ++ + F L + L+ Sbjct: 62 -------DTFKKQLSFAMHLASRLNISEEGSHMALIQYAETPKLEFSLGQFNHPTQLEWA 114 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 I+ + +TN+ L+ ++ Q T K+ + +TDG+ Sbjct: 115 IQRIEYQSGATNTGQALR------LTLEKGLQGARTG---IPKVAIVITDGQ-----SQD 160 Query: 307 QSLYYCNEAKKRGAIVYAIGI-RVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 + +VYAIG+ ++ H+ + +P + VE+ + A + Sbjct: 161 DVSEPSQLLRDADVMVYAIGVTNLVNVHQLHQMTGNPVRVFTVESFEQLDRALAD 215 >gi|163800205|ref|ZP_02194106.1| hypothetical protein 1103602000595_AND4_05979 [Vibrio sp. AND4] gi|159175648|gb|EDP60442.1| hypothetical protein AND4_05979 [Vibrio sp. AND4] Length = 334 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 28/215 (13%), Positives = 71/215 (33%), Gaps = 42/215 (19%) Query: 173 DMMIVLDVSRSMESFFDSS----ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 DMM+++D+S SM+ + I ++ + ++ +E+ + G+V F + Sbjct: 98 DMMLLVDLSGSMQKEDMNDNGEYIDRLTAVKRVLSDFVEK-------RQGDRLGVVLFGD 150 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L + ++I V + G D G+ + + Sbjct: 151 HAYLQTPLTADRRTVMQQINQAVIGLVGERTAIG---------DGIGLGTKTFVDSDAPQ 201 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR----------------- 331 ++++ ++DG N + + L A+K A +Y +G+ Sbjct: 202 RVMILLSDGSNTA--GVLEPLEATEIAQKYNATIYTVGVGAGEMMVKDFFMTRKVNTASD 259 Query: 332 --SHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 + + ++ + + + I + Sbjct: 260 LDEQTLTKIAEMTGGKYFRARDAKELEAIYDTINQ 294 >gi|209546922|ref|YP_002278840.1| hypothetical protein Rleg2_4864 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209538166|gb|ACI58100.1| hypothetical protein Rleg2_4864 [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 462 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 49/340 (14%), Positives = 111/340 (32%), Gaps = 75/340 (22%) Query: 2 FSLNL---NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSL 58 FSL +R + G + I+ A+ L + + +G + + ++ + S +D +L Sbjct: 15 FSLGRYFHTLRGLRRDRTGNVAIVVALSLVPMLVAVGASFDYIRSYNVRQKMQSDLDAAL 74 Query: 59 VHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDI 118 + A QI N + + K ++ N++ + Sbjct: 75 IAAVKQINNTADADALKAKVSDWFHAQVDNSYTLGE------------------------ 110 Query: 119 VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVL 178 + ++I+A + +P F N + + + S+VK + L++ IV+ Sbjct: 111 -IDIDTANHNITATASGTVPTTFMKIA----NIDTVPVSVGSAVK--GPATSYLNVYIVI 163 Query: 179 DVSRSM-------------------------ESFFDSSIT---------------KIDMA 198 D S SM ++ T + D+A Sbjct: 164 DTSPSMLLAATTSGQSTMYSGIGCQFACHTGDAHTVGKKTYANNYEYSTAKNIKLRADVA 223 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTN 258 ++ +L + + ++ GL + + + E + ++ S S Sbjct: 224 GDAVKDVLSLIDTSDSNHERIKVGLYSLGDTLTEVLAPTLSTDTARTRLSTASYGLTSAT 283 Query: 259 STPGLKYAYNQIFDMQGMRQHCN-TEDANYKKIIVFMTDG 297 S + + Q + + T K+++ +TDG Sbjct: 284 SKAATYFDVSLATLKQKVGAGGDGTTSGTPLKLVLLLTDG 323 >gi|312080770|ref|XP_003142742.1| hypothetical protein LOAG_07160 [Loa loa] gi|307762094|gb|EFO21328.1| hypothetical protein LOAG_07160 [Loa loa] Length = 634 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 70/204 (34%), Gaps = 21/204 (10%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML-EEVKLIPDVNNVVQSGLVT 225 D++ ++D S S+ S D+ K + + + ++L ++ Sbjct: 440 CPAQSTDLVFLVDGSGSIGS---------DVFHKEVLRFVKDFIELFDVSPQQTHVAVIQ 490 Query: 226 FSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +S+ I L S L++ I + T + +++ ++ F R+ Sbjct: 491 YSDVIRHEIDLNQYSSMEQLKQAINGIEYLTGLTRTGAAIQHVTDEGFS---ERRGARPI 547 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR-ACASP 342 +I++ +TDG +D ++ A+ + ++A+G+ L S Sbjct: 548 GNGVPRILIVITDGR----SQDDVTIAV-RNARMKQIQLFAVGVTNHALDSELEVISGST 602 Query: 343 NSFYLVENPHSMYDAFSHIGKDIV 366 + V + + + Sbjct: 603 KRTFHVNAFEDLNARLRSAIQKVT 626 >gi|37181977|gb|AAQ88792.1| LLLL311 [Homo sapiens] Length = 694 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 72/206 (34%), Gaps = 41/206 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ VLD S SM TK+ ++ +L +++ + ++ FSN+I+ Sbjct: 296 VVFVLDSSASM------VGTKLRQTKDALFTILHDLRPQD------RFSIIGFSNRIKVW 343 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + G + I ++S G T+ L+ A + H D Sbjct: 344 KDHLISVTPDSIRDG----KVYIHHMSPTG-GTDINGALQRAIRLL---NKYVAHSGIGD 395 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEF-----L 336 + +IVF+TDG+ + +L N ++ ++ IGI L Sbjct: 396 RSVS-LIVFLTDGKPT--VGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSL 452 Query: 337 RACASPNSFYLVENP-HSMYDAFSHI 361 C + E+ + + I Sbjct: 453 ENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|7495466|pir||T32949 hypothetical protein C05G6.3 - Caenorhabditis elegans Length = 341 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 48/283 (16%), Positives = 102/283 (36%), Gaps = 29/283 (10%) Query: 59 VHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDI 118 + ++ +G + K G IL NTW + + ++ DI +++ Sbjct: 8 IPTLVMVVTDGRSADDPK-GPGQILQAQPNTWVFAAATGDPEKVDTRELMDITG--NINH 64 Query: 119 VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVL 178 +V+ + + R + + T++ + T LD+++VL Sbjct: 65 IVMHRGRDLATDITRRLLRQAQ-----DKCRTTTTTTTTTTTTTTTDPITGCELDLVLVL 119 Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW 238 D S + + ++S ++ ++K+ P V T F L ++ Sbjct: 120 DFSTTTDPVYNSY-------KDLSKRLVSQLKIGPHYTQVAAVTFATVGRTRVRFNLKKY 172 Query: 239 GV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 + R I L G +T G++ A Q+ + +G R T K++V TDG Sbjct: 173 QTQEEVLRGIDNLKSRGGTTAIGAGIEKALTQLDESEGARPGIAT------KVMVVFTDG 226 Query: 298 -ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 N ++++ +A G +Y + + + + C Sbjct: 227 WSNKGPDPEKRAR----DAVSSGFEMYTVAYTRVETRQT--YC 263 >gi|260824533|ref|XP_002607222.1| hypothetical protein BRAFLDRAFT_67980 [Branchiostoma floridae] gi|229292568|gb|EEN63232.1| hypothetical protein BRAFLDRAFT_67980 [Branchiostoma floridae] Length = 1897 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 32/166 (19%), Positives = 60/166 (36%), Gaps = 24/166 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ +++D S+S+ + I + ++ P+ + G+ F+ ++ Sbjct: 1585 DIHVLVDGSKSV------KTRNFPAVRQFILKLAAGFEIGPNKA---RFGVYQFAKDMQT 1635 Query: 233 FFLLE--WGVSHLQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F + L IK + T + LK Y + G R +K Sbjct: 1636 EFKMNQYNNREALLDAIKKIEYMNQYQTKTGQSLKAVYEEFTKANGARDG-------VEK 1688 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 II+ +TDG K Q K +GA V+ +G+ + E Sbjct: 1689 IIILITDG-----KATDQVRQPAQYVKNKGAHVFTVGVAKYKISEL 1729 >gi|167763116|ref|ZP_02435243.1| hypothetical protein BACSTE_01485 [Bacteroides stercoris ATCC 43183] gi|167699456|gb|EDS16035.1| hypothetical protein BACSTE_01485 [Bacteroides stercoris ATCC 43183] Length = 327 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 42/256 (16%), Positives = 81/256 (31%), Gaps = 46/256 (17%) Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI 192 Y + F I ++ ++ + +D+M+ +DVS SM + D Sbjct: 49 KNYLLHAPFALRIIALALIIIVLARPQTTDSWQNSEIEGIDIMLAIDVSTSMLAE-DLKP 107 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK 252 +++ A + P+ N G+ F+ + L + L IK + Sbjct: 108 NRLEAAKDVAAEFING---RPNDN----VGITLFAGESFTQCPLTVDHAVLLNLIKDVKC 160 Query: 253 --FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 T G+ A ++ D + K+I+ +TDG N + D L Sbjct: 161 GLIEDGTAVGMGIANAVTRLKDSKAKS-----------KVIILLTDGTNN--RGDISPLT 207 Query: 311 YCNEAKKRGAIVYAIGIRV----------------------IRSHEFLRACA-SPNSFYL 347 AK G VY IG+ I + + +++ Sbjct: 208 AAEIAKSFGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVEIDEKTLTQIAGTTDGNYFR 267 Query: 348 VENPHSMYDAFSHIGK 363 + + + + I K Sbjct: 268 ATSNSKLKEVYEEIDK 283 >gi|167644155|ref|YP_001681818.1| Flp pilus assembly protein TadG [Caulobacter sp. K31] gi|167346585|gb|ABZ69320.1| Flp pilus assembly protein TadG [Caulobacter sp. K31] Length = 562 Score = 63.3 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 37/174 (21%), Positives = 59/174 (33%), Gaps = 46/174 (26%) Query: 243 LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK---KIIVFMTDGEN 299 ++ K+ + G +TN GL + ++ + + K K+IV +TDG+N Sbjct: 389 VKGKLDQMIASG-NTNVAMGLIWGWHTLSKNAPFADGVDPATTVGKRTTKVIVLLTDGDN 447 Query: 300 LSTKE----------------------------------------DQQSLYYCNEAKKRG 319 + D + C AK G Sbjct: 448 TNDTYNNPNASIYTGYGYITQGRLLNASNSPLGATSTATNRRDAIDSREARACTNAKAAG 507 Query: 320 AIVYAIGIRVIR-SHEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 +YAIG+ V S L+ CAS P +Y V + + F+ I I RI Sbjct: 508 VQIYAIGVGVSSHSRGILQDCASKPEMYYDVTDAAQLASVFNTIAGSIQNLRIT 561 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 38/235 (16%), Positives = 81/235 (34%), Gaps = 32/235 (13%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 + +G + + A+ L I ++ +I++S K L +D + + AA Sbjct: 16 RLGADERGAIAVQFALLLIPIAVLTFGLIDISRASVQKRQLQDALDAATLMAARS----- 70 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 L + ++ FV ++ V T ++V+ P Sbjct: 71 TATTNADLDTIGDAALATEMAGLGVTFGPGNSSFVLGDNNTVVGTIQNVVIKP------- 123 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 I +S + + T++V + L++ +VLD + SM S Sbjct: 124 --------------IISNLWSSTNTPVSATATVMRSINH---LEVALVLDNTGSMASSLG 166 Query: 190 SSITKIDMAIKSINAMLEEVK---LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS 241 S +KI I + ++++ + + V+ +V FS + + S Sbjct: 167 SGGSKITALITASKSLVDVLSAAAARATEADAVKISVVPFSMTVNIGSTYQTQTS 221 >gi|55958059|emb|CAI12954.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] gi|55958529|emb|CAI16361.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] Length = 577 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 72/206 (34%), Gaps = 41/206 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ VLD S SM TK+ ++ +L +++ + ++ FSN+I+ Sbjct: 171 VVFVLDSSASM------VGTKLRQTKDALFTILHDLRPQD------RFSIIGFSNRIKVW 218 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + G + I ++S G T+ L+ A + H D Sbjct: 219 KDHLISVTPDSIRDG----KVYIHHMSPTG-GTDINGALQRAIRLL---NKYVAHSGIGD 270 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEF-----L 336 + +IVF+TDG+ + +L N ++ ++ IGI L Sbjct: 271 RSVS-LIVFLTDGKPT--VGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSL 327 Query: 337 RACASPNSFYLVENP-HSMYDAFSHI 361 C + E+ + + I Sbjct: 328 ENCGLTRRVHEEEDAGSQLIGFYDEI 353 >gi|157374763|ref|YP_001473363.1| von Willebrand factor, type A [Shewanella sediminis HAW-EB3] gi|157317137|gb|ABV36235.1| von Willebrand factor, type A [Shewanella sediminis HAW-EB3] Length = 330 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 37/216 (17%), Positives = 69/216 (31%), Gaps = 44/216 (20%) Query: 173 DMMIVLDVSRSM---ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+M+ +D+S SM + D I+++ + E + + GL+ F++ Sbjct: 84 DLMMAVDLSGSMQIEDMVLDGKTVDRFTMIQAVVSDFIERRKGD------KLGLILFADH 137 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + + +K G T + A + + Sbjct: 138 AYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIALAVKRFDRVDESN---------- 187 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI--RSHEF---------- 335 +I+V +TDG N S + A KRG +Y+IG+ Sbjct: 188 -RILVLLTDGSNNSGSISPE--QAAAIAAKRGVKIYSIGVGAEVMERRTLFGKERVNPSM 244 Query: 336 ------LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L A A + ++ N + + I K Sbjct: 245 DLDETQLTALAQTTGGLYFRARNAQELESIYQEIDK 280 >gi|115379114|ref|ZP_01466238.1| TPR domain protein [Stigmatella aurantiaca DW4/3-1] gi|310823566|ref|YP_003955924.1| Batb protein [Stigmatella aurantiaca DW4/3-1] gi|115363897|gb|EAU63008.1| TPR domain protein [Stigmatella aurantiaca DW4/3-1] gi|309396638|gb|ADO74097.1| BatB protein [Stigmatella aurantiaca DW4/3-1] Length = 352 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 39/180 (21%), Positives = 74/180 (41%), Gaps = 25/180 (13%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P S K +D+++VLD S+SM + D ++++ A +N +L+E+K Sbjct: 90 PQCGS-KSEMTKRKGIDVVVVLDASKSMLA-RDVQPSRLERAKLELNTLLDELKGD---- 143 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDM 273 + GLV F+ L S ++ ++ + +N LK A NQ+ Sbjct: 144 ---RVGLVVFAGDAFIQSPLTSDYSAVKLFLRAVDPEQMPQGGSNIGAALKLA-NQVLS- 198 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 N + ++ +V ++DGE+L + + + K G V A+G+ Sbjct: 199 -------NADRGAKERAVVLLSDGEDLFGEVGEAT----EALKDGGVQVLAVGVGSESGE 247 >gi|87308177|ref|ZP_01090319.1| hypothetical protein DSM3645_21307 [Blastopirellula marina DSM 3645] gi|87289259|gb|EAQ81151.1| hypothetical protein DSM3645_21307 [Blastopirellula marina DSM 3645] Length = 1032 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 75/201 (37%), Gaps = 32/201 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+VLD S SM+ K+ M + A + + +G++ F ++ + Sbjct: 459 LMLVLDKSGSMQG------EKMQMTQGAALAAIRAMGAAD------FAGVIGFDSQAQRI 506 Query: 234 FLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + +++ LS G TN TPG+ + + ++ +H + Sbjct: 507 VPIRKVDNPGMFVAQVRKLSASG-GTNMTPGVALGFRDLQNVDAGVKH-----------M 554 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVE 349 + ++DG+ E ++ KK G V A+ + + + A FY V Sbjct: 555 IVLSDGQ----TEPGNVAQIASDMKKMGMTVSAVAVGSDADQKLMATVARNGGGKFYAVN 610 Query: 350 NPHSMYDAFSHIGKDIVTKRI 370 NP ++ F + + + Sbjct: 611 NPKAIPRIFMREARRVAQPLV 631 >gi|229005450|ref|ZP_04163163.1| D-amino acid dehydrogenase, large subunit [Bacillus mycoides Rock1-4] gi|228755812|gb|EEM05144.1| D-amino acid dehydrogenase, large subunit [Bacillus mycoides Rock1-4] Length = 474 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 61/340 (17%), Positives = 116/340 (34%), Gaps = 20/340 (5%) Query: 40 VSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN-NRKKLKGGDILCRIKNTWNMSFRNEL 98 VS K + S+ ++ NG N K G + + + Sbjct: 47 VSQKESKKIDNEKQVVHSIPVTFEEMTKYPNGKYNGKYKVGLLDISKESEPIMKEAARDT 106 Query: 99 RDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPI 158 N D+ + S D+ V I K+PL+ + Sbjct: 107 ESNIKSADLKNKSNSEQADLYVHMMYSLLKQEIIPFDKMPLQILEIGRVEDEEKK--SNG 164 Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T + N + ++ I+LD S SM D + K+D+A ++I + ++ +V+ Sbjct: 165 TKGEQKNKEDRGNYNIEILLDASGSMAGKIDGKM-KMDIAKEAIQQFVSDLPEAVNVSLR 223 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST--NSTPGLKYA-YNQIFDMQG 275 V + +K + I L K+ +T S G + + + + Sbjct: 224 V------YGHKGSNDEKDKTASCGAIENIYTLQKYDQTTFRQSLDGFQPVGWTPLAEAIK 277 Query: 276 MRQHCNTEDANYKKIIVFM-TDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI-RS 332 K I+++ +DG E +++ N K + IG +V + Sbjct: 278 KSTETFQSAKENDKNIIYIVSDGVETCGGNPVEEAQKVSNSNIKPIMNI--IGFQVDHEA 335 Query: 333 HEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + L+ A S + L + + D F GKDI ++R+ Sbjct: 336 EKQLKEIAEVSKGKYVLANSAKELQDQFKETGKDITSRRL 375 >gi|90418447|ref|ZP_01226359.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] gi|90338119|gb|EAS51770.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] Length = 636 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 42/234 (17%), Positives = 83/234 (35%), Gaps = 33/234 (14%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 I F G + ++ + LP++ L +++S I+ L D Sbjct: 23 IMRFRREKAGNVAVVFGLTLPVLALCFATAVDLSGIYGANRSLQQAADV---------AA 73 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 G + + D L + + F N D+ +T V + +G Sbjct: 74 LAAGREYGRTQDADYLSSVSEAF-----------FFHNAGDETRGTTQFSYDGVFREDGL 122 Query: 128 SI-SAISRYKIPLKFCTFIPWYTNSR----HIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 +I +R ++P F + W T + + S + V ++ +++ +VLD S Sbjct: 123 TILKVTARRQLPTFFGDALMWVTGGKLDWRQFPLYAKSEIVVQNR---SIELALVLDNSG 179 Query: 183 SM--ESFFDSSITKIDMAIKSINAMLEEV---KLIPDVNNVVQSGLVTFSNKIE 231 SM S +KID+ + + ++ VQ +V FS+ + Sbjct: 180 SMQDRPRSGGSKSKIDIIKDAAEDLAKQFLSSDKGSTEEFPVQFAVVPFSSSVN 233 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 24/140 (17%), Positives = 45/140 (32%), Gaps = 23/140 (16%) Query: 249 YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS 308 + + ST + G AY+ + + + N K V ++ DQ Sbjct: 500 EIRDYNRSTYGSYGYGAAYSHGSSSRKPGRIYDGTTGNAKDYSV-----DSYVAAMDQNV 554 Query: 309 LYYCNEAKKRG--------AIVYAIGIRVIRS---HEFLRACAS-------PNSFYLVEN 350 C K G +++ I + + + CAS +Y ++ Sbjct: 555 AKVCENVKADGRKPGGTDGILIFTIAFDLRDGEPVKKLMEDCASNGLIDASEKLYYDAQS 614 Query: 351 PHSMYDAFSHIGKDIVTKRI 370 + AF I + I + RI Sbjct: 615 QEELAAAFQSITEQISSLRI 634 Score = 40.9 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 15/85 (17%), Positives = 31/85 (36%), Gaps = 2/85 (2%) Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L + + I + G TN GL + + + + + ++ + K++V Sbjct: 424 TPLTKTLKTVTDAIDVMGAQGA-TNIPHGLAWGWRLLTARPPFTEGRSHDEPDNLKVLVL 482 Query: 294 MTDGENLST-KEDQQSLYYCNEAKK 317 MTDG N + L + + Sbjct: 483 MTDGNNTYNLNSGGRPLEIRDYNRS 507 >gi|218131126|ref|ZP_03459930.1| hypothetical protein BACEGG_02731 [Bacteroides eggerthii DSM 20697] gi|317476996|ref|ZP_07936238.1| von Willebrand factor type A domain-containing protein [Bacteroides eggerthii 1_2_48FAA] gi|217986646|gb|EEC52980.1| hypothetical protein BACEGG_02731 [Bacteroides eggerthii DSM 20697] gi|316906789|gb|EFV28501.1| von Willebrand factor type A domain-containing protein [Bacteroides eggerthii 1_2_48FAA] Length = 327 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 44/235 (18%), Positives = 81/235 (34%), Gaps = 47/235 (20%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P T++ NS+ + +D+M+ +DVS SM + D +++ A + P Sbjct: 71 LARPQTTNSWQNSEIEG-IDIMLAIDVSTSMLAE-DLKPNRLEAAKDVAAEFING---RP 125 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIF 271 + N G+ F+ + L + L IK + T G+ A ++ Sbjct: 126 NDN----IGITLFAGESFTQCPLTVDHAVLLNLIKDVKCGLIEDGTAVGMGIANAVTRLK 181 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-- 329 D + K+I+ +TDG N K D L AK G VY IG+ Sbjct: 182 DSKAKS-----------KVIILLTDGTNN--KGDISPLTAAEIAKSFGIRVYTIGVGTNG 228 Query: 330 --------------------IRSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 I + + +++ + + + + I K Sbjct: 229 MAPYPYPVGGTVQYVNMPVEIDEKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|46362531|gb|AAH68979.1| Slc35c2 protein [Danio rerio] Length = 1816 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 35/230 (15%), Positives = 78/230 (33%), Gaps = 24/230 (10%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 T T +++ +A+ ++++++D S S+ T + + Sbjct: 546 TSKLRKTTPTQPGTAVSTRKPFECSAEAQAELVLLVDGSWSIG------RTNFRKVREFL 599 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNST 260 + + P+ VQ L +S + L S L ++ G +T + Sbjct: 600 QGLAVPFHIGPEG---VQIALSQYSGDPRTEWHLNNFTSKEPLLEAVRNFRYKGGNTFTG 656 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA 320 L I ++ + N + ++ +TDG K ++ N K G Sbjct: 657 QAL------IHALENNLKEEVGARPNTPQFLLLLTDG-----KSQDDAIAAANRLKNAGV 705 Query: 321 IVYAIGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTK 368 + AIG++ E + + P + Y V + + + + + K Sbjct: 706 EIIAIGVKNADEAELRQVASEPLELNVYKVNDFPLLSKLVGKLARILCGK 755 >gi|33596464|ref|NP_884107.1| hypothetical protein BPP1839 [Bordetella parapertussis 12822] gi|33566233|emb|CAE37141.1| putative exported protein [Bordetella parapertussis] Length = 571 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 39/238 (16%), Positives = 84/238 (35%), Gaps = 33/238 (13%) Query: 144 FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSIN 203 PW + +++ I +V Q +++++++D S SM + K+ + ++ Sbjct: 182 PAPWNPQRQLLLVGIQG-YRVAPQDIPAVNLVLLIDTSGSM-----ADRAKLPLLKSALR 235 Query: 204 AMLEEVKLIPDVNNVVQSGLVTF--SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 ++ +++ + +V + S + + + I L G STN Sbjct: 236 QLVTQMRAQD------RVAIVAYAGSAGLVLPSTPGDRHAQILAAIDGLQASG-STNGGA 288 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGA 320 GL+ AY + + N IV +DG+ + D L Y +KRG Sbjct: 289 GLELAY------AEAAKGLVKDGVNR---IVLASDGDFNVGRTDLAQLKDYVGSQRKRGI 339 Query: 321 IVYAIGIRVIRSHEF----LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + +G+ ++ L + Y S+ A ++ + K Sbjct: 340 ALTTLGLGSGNYNDAMAMQLANAGDGSYHY----IDSLLQARKVFASELSATLLTIAK 393 >gi|241672093|ref|XP_002411438.1| neurogenic locus notch, putative [Ixodes scapularis] gi|215504089|gb|EEC13583.1| neurogenic locus notch, putative [Ixodes scapularis] Length = 1597 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 77/211 (36%), Gaps = 29/211 (13%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 S+++ S T D++ VLD S S+ + A + M+ + ++ Sbjct: 31 FVSTIERYSTTKN--DIVFVLDESGSIGADVFP-------AELAFTEMVARLLVVSP--E 79 Query: 218 VVQSGLVTFSNK-----IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + ++TFSN + + + ++ + T + L YA +++ Sbjct: 80 YSRLTVMTFSNDNLVHIDQVGSSGDTNMCKFVHELNQIPYRSGGTRTREALGYAGEILWN 139 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 +I+V ++DG+ S E + +G +++ +G+ I Sbjct: 140 A----------RQEANRIVVLISDGQANSGSEPS---EIARLLRVKGIVIFGVGVAHINK 186 Query: 333 HEFLRACASPNSFYLVENPHSMYDAFSHIGK 363 E L +SP Y++ N + + K Sbjct: 187 DELLDVASSPAHTYMLRNFEYIKKVNKDLRK 217 >gi|170041024|ref|XP_001848278.1| sushi [Culex quinquefasciatus] gi|167864620|gb|EDS28003.1| sushi [Culex quinquefasciatus] Length = 2239 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 37/219 (16%), Positives = 79/219 (36%), Gaps = 36/219 (16%) Query: 152 RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 + SV+ + R+D++ ++D S S+ IK + +L + + Sbjct: 115 DRLSGSFKKSVEKIKTKNKRVDIVFLIDASSSVG------RQNFASEIKFVKKLLSDFNV 168 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR---------KIKYLSKFGVSTNSTPG 262 N + ++TFS++ + F ++ ++ ++ ++ G T + Sbjct: 169 SY---NYTRVAVITFSSQKKIFRHIDQISQSVEDNDKCLLLNYQVPRIAFSGGGTYTYGA 225 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 LK A + + KKII +TDG + + K ++ Sbjct: 226 LKEAEEIFKNA----------RLDSKKIIFLITDGFSNGRDPIPLAGRL---KKDNNVVI 272 Query: 323 YAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHI 361 Y+IGI+ E ++P + + D+F H Sbjct: 273 YSIGIQSGNYAELHAIASAPE-----GDHCYLLDSFDHF 306 >gi|170720775|ref|YP_001748463.1| von Willebrand factor type A [Pseudomonas putida W619] gi|169758778|gb|ACA72094.1| von Willebrand factor type A [Pseudomonas putida W619] Length = 358 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 26/163 (15%), Positives = 61/163 (37%), Gaps = 26/163 (15%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + D ++++D+ + L++ + GL+ F + Sbjct: 91 DLLVAVDVSGSMDFPDMQWKDEEVSRLDLVKALLGDFLQD-------REGDRVGLILFGS 143 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + G +T + A ++ + Sbjct: 144 QAYLQAPLTFDRRTVRTFLDEAQIGIAGKNTAIGDAIGLAVKRL-----------RQRPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 +++V +TDG N + + A + +Y IGI Sbjct: 193 QSRVLVLITDGANNGGRIHPLTA--ARLAAQEDVRIYTIGIGA 233 >gi|74217197|dbj|BAC31374.2| unnamed protein product [Mus musculus] Length = 338 Score = 62.9 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 79/207 (38%), Gaps = 14/207 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV---QSGL 223 + D +++ VLD S S+ + + + + + + +++ ++ V + G Sbjct: 50 KADCPVNVYFVLDTSESVAMQSPTD-SLLYHMQQFVPQFISQLQNEFYLDQVALSWRYGG 108 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + FS+++E F + + ++ + F T + L QI RQH Sbjct: 109 LHFSDQVEVFSPPGSDRASFTKSLQGIRSFRRGTFTDCALANMTQQI------RQHVGKG 162 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 N V +TDG + + A++ G ++A+ + + LR A+ Sbjct: 163 VVN---FAVVITDGHVTGSPCGGIKMQ-AERAREEGIRLFAVAPNRNLNEQGLRDIANSP 218 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRI 370 N +M + I +D + + I Sbjct: 219 HELYRNNYATMRPDSTEIDQDTINRII 245 >gi|171914502|ref|ZP_02929972.1| von Willebrand factor type A domain protein [Verrucomicrobium spinosum DSM 4136] Length = 424 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 39/242 (16%), Positives = 85/242 (35%), Gaps = 30/242 (12%) Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 + L + + ++ + +T S A +++ IV+D S SM K+ Sbjct: 5 VELAHPQILAGRKMTTYLKVGLTGQELEASAKRAPVNVTIVIDKSGSMGG------DKMV 58 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFG 254 A ++ L+ + V+ +V + + + ++ I + G Sbjct: 59 HAREAAKQALDRLGAGDMVS------VVAYDDAVSLISPATDLTDRDRVKAAIDRIQAGG 112 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCN 313 + + K A + +R++ N +V ++DG N+ Q Sbjct: 113 STALFSGISKGA-------EELRRNKRPNQVNR---VVLLSDGMANVGPSSPQDLGRLGA 162 Query: 314 EAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFS-HIGK--DIVTK 368 K G V +G+ + + + + A S + +EN ++ F G +V + Sbjct: 163 SLAKEGITVTTLGLGLGYNEDLMTELALRSDGNHAFIENSQNLAGIFQTEFGDILSVVAQ 222 Query: 369 RI 370 RI Sbjct: 223 RI 224 >gi|262193846|ref|YP_003265055.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262077193|gb|ACY13162.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 344 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 37/181 (20%), Positives = 67/181 (37%), Gaps = 27/181 (14%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 I + + ++ D+M+VLDVSRSM + D + T++ A + + ++ Sbjct: 81 IPGRTETVTSSEVSADIMVVLDVSRSMLAD-DVAPTRLARAKAEVAELSSALRG------ 133 Query: 218 VVQSGLVTFSNKIEEFFLLE--WGVSHLQ-RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + GLV F+ + L +G + + S T L+ A Sbjct: 134 -HRIGLVAFAGRASVLAPLTPDYGFFRMILDGVDTKSVSRGGTEIGQALRKAVRSFDPGP 192 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 G K+I+ +TDGE+ + + A + G V AIG + + Sbjct: 193 GA------------KMILLITDGEDHGGYAEDAARE----ALEAGVRVVAIGFGSEQGSQ 236 Query: 335 F 335 Sbjct: 237 I 237 >gi|253701051|ref|YP_003022240.1| von Willebrand factor A [Geobacter sp. M21] gi|251775901|gb|ACT18482.1| von Willebrand factor type A [Geobacter sp. M21] Length = 331 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 36/255 (14%), Positives = 74/255 (29%), Gaps = 52/255 (20%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI----TK 194 L + + P + ++ +D+++ LD+S SM + ++ Sbjct: 55 LPYLRLAVLALGIAALARPQAVERESRVRSRG-MDLVLALDLSTSMLAEEQGREGRGESR 113 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS--K 252 + A + + + + GLV F+ + L LQ + L Sbjct: 114 LAAAKRVLAGFIG-------GRKDDRIGLVAFAGRPYPAAPLTSDHQWLQGVVDRLDTGA 166 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 T + N++ + ++ +TDG N + Sbjct: 167 VEDGTALGDAILSGVNRL-----------RRRPAESRALILITDGRNN---AGAEPQLAA 212 Query: 313 NEAKKRGAIVYAIGIRVIRS----------------------HEFLRACA--SPNSFYLV 348 AK G V+AIGI S L+ A + ++ Sbjct: 213 QAAKALGIRVHAIGIGSRGSAVIPVPSPLGGTIYRRLDAELDAATLKGVAELTGGRYFEA 272 Query: 349 ENPHSMYDAFSHIGK 363 + + F+ I + Sbjct: 273 GDATVLSRVFAEIDR 287 >gi|255570576|ref|XP_002526245.1| protein binding protein, putative [Ricinus communis] gi|223534439|gb|EEF36142.1| protein binding protein, putative [Ricinus communis] Length = 540 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 47/237 (19%), Positives = 86/237 (36%), Gaps = 36/237 (15%) Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 +R K+ P + +++ +T +S LD++ VLDVSRSME Sbjct: 60 VPARVKLRSINNDMAPLEESKLKVMLELTGG-DSSSYGRPGLDLVAVLDVSRSMEG---- 114 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK---- 246 K++ ++ +++++ + +VTFS L Q + Sbjct: 115 --DKMEKMKTAMLFIIKKLGPTD------RLSIVTFSGGANRLCPLRQTTGKSQEEFENL 166 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 I L+ G TN T GL+ A + + I+ M+DGE + + Sbjct: 167 INGLNADGA-TNITAGLQTALKVLKGRSFNGERVVG--------IMLMSDGEQNAGSDAT 217 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---PNSFYLVENPHSMYDAFSH 360 ++ G + + L+A A +F V+N S+ AF+ Sbjct: 218 GVSV-------GNVPIHTFGFGINHEPKGLKAIAHNSIGGTFSDVQNIDSLTKAFAQ 267 >gi|50355939|ref|NP_598499.1| anthrax toxin receptor 2 precursor [Mus musculus] gi|68052321|sp|Q6DFX2|ANTR2_MOUSE RecName: Full=Anthrax toxin receptor 2; Flags: Precursor gi|49901393|gb|AAH76595.1| Anthrax toxin receptor 2 [Mus musculus] gi|74141652|dbj|BAE38584.1| unnamed protein product [Mus musculus] gi|74217872|dbj|BAE41940.1| unnamed protein product [Mus musculus] Length = 487 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 37/230 (16%), Positives = 78/230 (33%), Gaps = 30/230 (13%) Query: 148 YTNSRHIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + + + S+ + D+ VLD S S+ + + I Sbjct: 16 FPGLWLLAVGGPGSLLQAQEQPSCKKAFDLYFVLDKSGSVANNW-----------IEIYN 64 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPG 262 + ++ V+ ++ + FS++ L + + ++ L T G Sbjct: 65 FVHQLTERF-VSPEMRLSFIVFSSQATIILPLTGDRYKIGKGLEDLKAVKPVGETYIHEG 123 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 LK A QI + G++ II+ +TDG+ + +++ GA V Sbjct: 124 LKLANEQIQNAGGLKASS---------IIIALTDGKLDGLVP-SYAENEAKKSRSLGASV 173 Query: 323 YAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 Y +G+ + R S + + V+ + A I I+ + Sbjct: 174 YCVGVLDFEQAQLERIADSKDQVFPVKGG---FQALKGIINSILAQSCTE 220 >gi|86137906|ref|ZP_01056482.1| hypothetical protein MED193_08588 [Roseobacter sp. MED193] gi|85825498|gb|EAQ45697.1| hypothetical protein MED193_08588 [Roseobacter sp. MED193] Length = 543 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 20/72 (27%), Positives = 34/72 (47%), Gaps = 1/72 (1%) Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACASPNSFYLVENPHSMYDAFS 359 ++ ++ ++ C AK +G +VY IG LR CAS ++ Y + + DAF+ Sbjct: 471 NSTKNARTRSVCEAAKAKGIVVYTIGFEAPSNGVAVLRDCASSDAHYFDVDGLEIKDAFA 530 Query: 360 HIGKDIVTKRIW 371 I I R+ Sbjct: 531 SIATSIRQLRLT 542 Score = 52.8 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 33/224 (14%), Positives = 80/224 (35%), Gaps = 41/224 (18%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM 66 ++R F + G M T F + V G+ +++ + +TVL +DR+++ AA Sbjct: 22 SLRAFRRDDSGVMAYPTVAFFLAMLAVGGIGVDLMRMERDRTVLQYTLDRAVLAAA---- 77 Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 + + N+ + + +V Sbjct: 78 -------------DLDQTQPPAVVVQDYLNKAGLGEYYQE-----------PIVESGLGY 113 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 + A I F + ++N + + TS + ++ L++ +VLDVS SM S Sbjct: 114 KRVQAT----IDATFEAHLLRFSNGNDLPVFATSKAE---ESIDGLEISLVLDVSGSMNS 166 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +++ + ++ + + + + +V ++ ++ Sbjct: 167 N-----SRLSNLKVAAKDFIDTM-VANTTDGKMSISVVPYATQV 204 >gi|71988814|ref|NP_499400.2| CUTiclin family member (cut-6) [Caenorhabditis elegans] gi|35210138|emb|CAA97806.2| C. elegans protein M142.2, confirmed by transcript evidence [Caenorhabditis elegans] Length = 572 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 36/234 (15%), Positives = 82/234 (35%), Gaps = 29/234 (12%) Query: 133 SRYKIPLK---FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 Y I L F + I + + + S + T ++++++LD S S+ Sbjct: 4 IPYDISLSITSFLSLILICSANPIDNGLVDSELIHECVTHKAVEVILLLDASGSIGD--- 60 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKI 247 D K ++ + + + L+ ++ + F L + L+ I Sbjct: 61 ------DTFKKQLSFAMHLASRLNISEDGSHMALIQYAETPKLEFSLGQFNHPTQLEWAI 114 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 + + +TN+ L+ ++ Q K+ + +TDG+ Sbjct: 115 QRIEYQSGATNTGQALR------LTLEKGLQGARPG---IPKVAIVITDGQ-----SQDD 160 Query: 308 SLYYCNEAKKRGAIVYAIGI-RVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 + +VYAIG+ ++ H+ + +P + VE+ + A + Sbjct: 161 VSEPSQLLRDADVMVYAIGVTNLVNVHQLHQMTGNPVRVFTVESFEQLDRALAD 214 >gi|163849338|ref|YP_001637382.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222527332|ref|YP_002571803.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163670627|gb|ABY36993.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222451211|gb|ACM55477.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 418 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 35/226 (15%), Positives = 85/226 (37%), Gaps = 30/226 (13%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 P + + ++ +++ L++ +V+D S SM ++ Sbjct: 10 PSRSVLPASQEPQLLYALVELSAQSGATKMPRLPLNLCLVIDRSSSMRG------ERLQQ 63 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE--EFFLLEWGVSHLQRKIKYLSKFGV 255 ++ +L+ + + LVTF+++ E L + ++R+I + G Sbjct: 64 VKQAAMQILDLL------GDHESFALVTFNDRAEVVVSAQLARARAEIKRQISAIEAAGG 117 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 + +T GL ++ Q A ++ ++ +TDG ++ + + A Sbjct: 118 TEMAT-GLALGVQEL-------QRAMMPRAVHR--LLLLTDGRTYG--DEGRCVEIARRA 165 Query: 316 KKRGAIVYAIGIRVIRSHEFLRACASPNS---FYLVENPHSMYDAF 358 + RG + A+GI + + L A+ + Y + + + F Sbjct: 166 QSRGIGITALGIGSEWNEDLLETIAARENSRTHY-ITSAAEITKIF 210 >gi|223939937|ref|ZP_03631805.1| von Willebrand factor type A [bacterium Ellin514] gi|223891428|gb|EEF57921.1| von Willebrand factor type A [bacterium Ellin514] Length = 342 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 66/163 (40%), Gaps = 27/163 (16%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M +LD S+SM + D +++ + +I +++ + + GLV F+ + Sbjct: 96 DVMFLLDCSKSMLAA-DVQPSRLSRSKYAILDFVQQ-------HGRGRVGLVAFAGQAFL 147 Query: 233 FFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + + + + + T+ L AY + D + K Sbjct: 148 QCPLTFDYDAFRDALLAIDEQTIPVGGTDIGRALDEAYRAM----------EKNDRH--K 195 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 I+V +TDGE+L ++ + ++G +VY IG+ Sbjct: 196 ILVLITDGEDL----EKAGIKTAQALAEKGIVVYTIGVGTAAG 234 >gi|153825062|ref|ZP_01977729.1| von Willebrand factor type A domain protein [Vibrio cholerae MZO-2] gi|149741387|gb|EDM55421.1| von Willebrand factor type A domain protein [Vibrio cholerae MZO-2] Length = 318 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 71/213 (33%), Gaps = 38/213 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M+V+D+S SM + + + A+ + + + GL+ F++ Sbjct: 85 DLMLVVDLSYSMSQE---DMQSGQQMVDRLTAVKQVLSEFIAKREGDRIGLILFADHAYL 141 Query: 233 FFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + ++ L G T G+ A D +++ Sbjct: 142 QTPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFIDSNA-----------PQRV 190 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------SHEF- 335 ++ ++DG N + L N AK+ +Y +G+ + + Sbjct: 191 MILLSDGSNTA--GVLDPLEAANIAKQYHTTIYTLGVGAGEMVVKDFLFSRKVNTAQDLD 248 Query: 336 ---LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + ++ N + + + I + Sbjct: 249 EKTLQTIATTTGGHYFRARNQQDLQNIYDTINQ 281 >gi|218890727|ref|YP_002439591.1| putative von Willebrand factor type A domain [Pseudomonas aeruginosa LESB58] gi|254236045|ref|ZP_04929368.1| hypothetical protein PACG_02002 [Pseudomonas aeruginosa C3719] gi|126167976|gb|EAZ53487.1| hypothetical protein PACG_02002 [Pseudomonas aeruginosa C3719] gi|218770950|emb|CAW26715.1| putative von Willebrand factor type A domain [Pseudomonas aeruginosa LESB58] Length = 340 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 26/163 (15%), Positives = 61/163 (37%), Gaps = 26/163 (15%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + + I+++++ K +E+ + + GL+ F + Sbjct: 91 DLLLAVDVSGSMDYRDMRWQEDEISRLELIKKLFGDFIEDRRGD-------RVGLILFGS 143 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + G +T + A ++ + Sbjct: 144 QAYLQAPLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAVKRL-----------RQRPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 +++V +TDG N A ++ +Y IGI Sbjct: 193 ESRVLVLITDGANTG--GQIAPQIAAQLAAEQQVKIYTIGIGA 233 >gi|149732066|ref|XP_001492602.1| PREDICTED: similar to complement factor B [Equus caballus] Length = 768 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 37/221 (16%), Positives = 77/221 (34%), Gaps = 28/221 (12%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN---- 228 IVLD S SM + S ++ N + + ++ + + GLVT++ Sbjct: 261 IVLDPSGSMNIYMVLDGSDSIGAHNFTRAKNCLRDFIEKVASYGVKPRYGLVTYATVPKV 320 Query: 229 ----KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E +W L K TN+ L+ Y+ + +G Sbjct: 321 LIRVSQERSSDADWVTEKLNEISYEDHKLKTGTNTRKALQAVYS-MMSWEGNAPPEGWNR 379 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEA----------KKRGAIVYAIGIRVIRSHE 334 + +I+ MTDG + + ++ + ++ +Y G+ + + E Sbjct: 380 TRH--VILLMTDGLHNMGGDPVPVIHEIRDLLDIGRDRKNPREDYLDIYVFGVGPLVNQE 437 Query: 335 FLRACAS----PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + A AS + V++ ++ D F + + T + Sbjct: 438 NINALASKKDGEQHVFKVKDMENLEDVFFQMLDETRTLGLC 478 >gi|315266493|gb|ADT93346.1| von Willebrand factor type A [Shewanella baltica OS678] Length = 627 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 50/321 (15%), Positives = 103/321 (32%), Gaps = 47/321 (14%) Query: 45 FMKTVLHSMI--DRSLVHAATQIMNEGNGNNRKKLKGGDILCRI---KNTWNMSFRNELR 99 + + + + V A Q+ + G I +N + +N + Sbjct: 69 QRQAEMRDAAKVEMARVAAPMQMSSNGAVMGMSIAPMPRDYAAIPLAQNKFEQQVQNGIM 128 Query: 100 DNG------FVNDIDDIVRSTSLDIVVV---PQNEGYSISAIS---RYKIPLKFCTFIPW 147 G F D+D +T ++ P+ + + Y PL P+ Sbjct: 129 VAGEIPVSTFSIDVDTGSYATLRRMLREGHLPEKGTVRVEEMLNYFAYDYPLPAKNAAPF 188 Query: 148 YTNSRHIVMP----------ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 + P + +++ +LDVS SM +S+ K+ + Sbjct: 189 SVTTELAPSPYNDDMMLLRIGLKGYDLPKSQLGASNLVFLLDVSGSM-----ASVDKLPL 243 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST 257 ++ + ++ V+ VV +G + L ++ LS G ST Sbjct: 244 LQTALKLLTAQLSAQDKVSIVVYAGAAG----VVLDGASGNDTQTLNYALEQLSA-GGST 298 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAK 316 N G+ AY ++H N ++ TDG+ N+ + + + K Sbjct: 299 NGGQGITQAYQL------AKKHFIPNGINR---VILATDGDFNVGVTDFDDLIALIEKEK 349 Query: 317 KRGAIVYAIGIRVIRSHEFLR 337 G + +G + ++ L Sbjct: 350 DHGIGLTTLGFGLGNYNDQLM 370 >gi|159036783|ref|YP_001536036.1| von Willebrand factor type A [Salinispora arenicola CNS-205] gi|157915618|gb|ABV97045.1| von Willebrand factor type A [Salinispora arenicola CNS-205] Length = 319 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 32/208 (15%), Positives = 63/208 (30%), Gaps = 28/208 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+ +DVS SM + D ++ A ++ ++ + + GLV F+ Sbjct: 88 VMVAVDVSTSMLA-GDVEPDRLTAAKEAARRFVDGLP------DEFNVGLVAFAGSAAVL 140 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + L I L + + A N +D + IV Sbjct: 141 VPPDTDREALDEGIDRLVEGATGVQ-GTAIGEAINTSLGAVKALDGEAAKDPPPAR-IVL 198 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------------HEFLRAC 339 ++DG N S + +A V+ I + L Sbjct: 199 LSDGANTS---GMDPMEAATDAVAMDVPVHTIAFGTASGYVDRGGRPIQVPVDGQTLDEV 255 Query: 340 A--SPNSFYLVENPHSMYDAFSHIGKDI 365 A + F+ ++ + + IG + Sbjct: 256 ARETGGQFHEADSAKELRAVYDDIGSSV 283 >gi|148681993|gb|EDL13940.1| mCG48880 [Mus musculus] Length = 249 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 27/163 (16%), Positives = 64/163 (39%), Gaps = 22/163 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + L+++ V+D S S+ + + + + V L + G++ + Sbjct: 5 CKETPLELVFVIDSSESVGP------ENFQIIQSFVKTLADRVAL---DLGTARIGIINY 55 Query: 227 SNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S+K+E+ L+ S + + + G T + L+ A N +F Sbjct: 56 SHKVEKVASLKQFSSKDDFKLVVDNMQYLGEGTYTATALQAA-NDMF---------KEAR 105 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 KK+ + +TDG+ +++ ++ +A ++ IG+ Sbjct: 106 PGVKKVALVITDGQ-TDSRDKKKLADVVKDANDSNVEIFVIGV 147 >gi|228997913|ref|ZP_04157515.1| D-amino acid dehydrogenase, large subunit [Bacillus mycoides Rock3-17] gi|228761788|gb|EEM10732.1| D-amino acid dehydrogenase, large subunit [Bacillus mycoides Rock3-17] Length = 474 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 61/340 (17%), Positives = 116/340 (34%), Gaps = 20/340 (5%) Query: 40 VSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN-NRKKLKGGDILCRIKNTWNMSFRNEL 98 VS K + S+ ++ NG N K G + + + Sbjct: 47 VSQKESKKIDNEKQVVHSIPVTFEEMTKYPNGKYNGKYKVGLLDISKESEPIMKEAARDT 106 Query: 99 RDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPI 158 N D+ + S D+ V I K+PL+ + Sbjct: 107 ESNIKSADLKNKSNSEQADLYVHMMYSLLKQEIIPFDKMPLQILEIGRVEDEEKK--SNG 164 Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T + N + ++ I+LD S SM D + K+D+A ++I + ++ +V+ Sbjct: 165 TKGEQKNKEDRGNYNIEILLDASGSMAGKIDGKM-KMDIAKEAIQQFVSDLPEAVNVSLR 223 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST--NSTPGLKYA-YNQIFDMQG 275 V + +K + I L K+ +T S G + + + + Sbjct: 224 V------YGHKGSNDEKDKTASCGAIENIYTLQKYDQTTFRQSLDGFQPVGWTPLAEAIK 277 Query: 276 MRQHCNTEDANYKKIIVFM-TDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI-RS 332 K I+++ +DG E +++ N K + IG +V + Sbjct: 278 KSTETFQSAKENDKNIIYIVSDGVETCGGNPVEEAQKVSNSNIKPIMNI--IGFQVDHEA 335 Query: 333 HEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + L+ A S + L + + D F GKDI ++R+ Sbjct: 336 EKQLKEIAEVSKGKYVLANSAKELQDQFKETGKDITSRRL 375 >gi|157151714|ref|NP_001096685.1| collagen alpha-3(VI) chain [Canis lupus familiaris] gi|70571939|tpe|CAI77244.1| TPA: collagen, type VI, alpha 3 [Canis lupus familiaris] Length = 3169 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 37/218 (16%), Positives = 76/218 (34%), Gaps = 24/218 (11%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 + + + + A D++ ++D S S+ + + + +++ Sbjct: 14 CLFLSGFSLTRAQQQQADVKNGAAADIIFLVDSSWSIGK------EHFQLVREFLYDVIK 67 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKY 265 + + + LV F+ FLL + + I +S G S + GL+Y Sbjct: 68 SLAVGDSD---FRFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGSNETGKGLEY 124 Query: 266 AYNQ-IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 + + G R ++IV +T G D + E K A V+A Sbjct: 125 VMQHHLTEAAGSRA-----GDGVPQVIVVLTHG-----HSDDRLALPSAELKSADANVFA 174 Query: 325 IGIRVIRSHEFLRACASPNS--FYLVENPHSMYDAFSH 360 IG+ + P + + +EN S++D + Sbjct: 175 IGVEDADEGALKEIASEPPNMHVFNLENFTSLHDIVGN 212 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 38/271 (14%), Positives = 96/271 (35%), Gaps = 24/271 (8%) Query: 91 NMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTN 150 F + +G V+ ++ + N + ++P + Sbjct: 1556 TSRFSQVISSSGIVSLGVGNRNIDRAELQTITNNPRLVFTVREFRELP-NIEDRVMHAFG 1614 Query: 151 SRHIVM--PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + P + + + D++ +LD S+ DS + + + E Sbjct: 1615 PSGVTPAPPGVDTPSPSRPEKKKADVVFLLD--GSINFRRDS-------FQEVLRFVSEI 1665 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKY 265 V + + + +Q GLV +++ + F L+ + + I + G N+ G+++ Sbjct: 1666 VDTLYEGGDSIQVGLVQYNSDPTDEFFLKDFSTKQQIIDAINKVVYKGGRHANTKVGIEH 1725 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 + + + + D +I +T G K + + ++G V+A+ Sbjct: 1726 ----LRLNHFVPEAGSRLDQRVPQIAFVITGG-----KSVEDAQEASLALTQKGVKVFAV 1776 Query: 326 GIRVIRSHEFLRACASPNSFYLVENPHSMYD 356 G++ I S E + ++ + + V N + + Sbjct: 1777 GVKNIDSEEVGKIASNSATAFRVGNVQELSE 1807 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 40/251 (15%), Positives = 85/251 (33%), Gaps = 26/251 (10%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + +V + + Y S + ++P K T I T+ + + ++ Sbjct: 1371 RDADQEQLVKISLSPEYVFSVNTFRELPSLEQKLLTPITTLTSEQIQQILASTRYPSPGI 1430 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S S++ I + ++ + + P+ V+ G V FS Sbjct: 1431 ESDAADIIFLIDSSDSVKP------DDIAYIRDFVIKIVRRLNIGPNK---VRIGAVQFS 1481 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 N++ F L+ S I L + G N+ L++ +F E Sbjct: 1482 NEVFPVFYLKTHKSQA-SVINALRQLRHRGGSPLNTGKALEFVARNLFVKS---AGSRIE 1537 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 D + +++F+ K + + G + +G R I E +P Sbjct: 1538 DGVPQHLVLFL------GGKSQDDTSRFSQVISSSGIVSLGVGNRNIDRAELQTITNNPR 1591 Query: 344 SFYLVENPHSM 354 + V + Sbjct: 1592 LVFTVREFREL 1602 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 52/330 (15%), Positives = 109/330 (33%), Gaps = 34/330 (10%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + + + L D +A EG + GG L I EL+ Sbjct: 520 GSVLYTGSALDFARDNLFTSSAGYRAAEGVPKLLVLITGGKSLDEISQPAQ-----ELKR 574 Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITS 160 N ++ + ++ + + IP +F + T Sbjct: 575 NSIMSFAVGSKAADQAELEEIAFDSSLVF-------IPTEFRAAPLQGLLPGLLAPLRTL 627 Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 S + R D++ +LD S F+ + ++ + + D ++ Sbjct: 628 SGTSEVHANRR-DIIFLLDGS------FNVGKANFPYVRDFVMNVVNNLDVGSDN---IR 677 Query: 221 SGLVTFSN-KIEEFFLLEWGV-SHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMR 277 GLV FS+ + EF L + S L ++ L K G N+ L Y + Sbjct: 678 VGLVQFSDTPVTEFSLNTYQTKSDLLAHLRQLQLKGGSGLNTGSALSY----VHANHFTE 733 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + + ++++ +T G + + L N + G + + +G E + Sbjct: 734 AGGSRIREHVPQLLLLLTAG-----RSEDAYLPAANALARAGVLTFCVGANQANRAELEQ 788 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 +P+ YL+++ S+ + + + T Sbjct: 789 IAFNPSLVYLMDDFSSLPALPQQLIQPLTT 818 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 36/271 (13%), Positives = 85/271 (31%), Gaps = 39/271 (14%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS-------- 166 ++ + + I++ F + ++V + SSV Sbjct: 174 AIGVEDADEGALKEIASEPPNMHVFNLENFTSLHDIVGNLVSCVQSSVAPEGAGGTETLK 233 Query: 167 --QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D++ ++D S ++ + + +LE + + ++ G+V Sbjct: 234 DITAQDSADIIFLIDGSN------NTGSAHFAVIRDFLVNLLERLSV---GAQQIRVGVV 284 Query: 225 TFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCN 281 +S++ F L+ + Q +K L+ G N L + F R + Sbjct: 285 QYSDEPRTVFSLDTYSTKAQVLDAVKALAFTGGELANVGLALDFVVENHFT----RAGGS 340 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 + +++V ++ G + D K+ + +G + E + Sbjct: 341 RVEEGVPQVLVLISAGPSSDEIRDGVVA-----LKQASVFSFGLGAQAASRAELQHIATN 395 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 N + V F G D+ + + Y Sbjct: 396 DNLVFTVPE-------FRSFG-DLQEQLLPY 418 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 78/213 (36%), Gaps = 34/213 (15%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVV 219 + + +D +DM +LD S S F + + K I ++ ++ + PD + Sbjct: 2605 RRAAGSDVDIDMAFILDSSESTSLF------QFNEMKKYIGYLVRQLDISPDPKASQHFA 2658 Query: 220 QSGLVT---FSN-------KIEEFFLLE-WGVSH--LQRKIKYLSKFGVSTNSTPGLKYA 266 + +V + + ++ F L +G + + +++ + + ++Y Sbjct: 2659 RVAVVQHAPYESLGNPSLPPVKVEFSLTDYGSKEKLVDFLSRRMTQLEGTRDLGSAIEYT 2718 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 +F+ + KIIV M GE + ++ +AK +G +G Sbjct: 2719 IENVFES--------APNPRDLKIIVLMLTGEVEKEQLEEAHRVIL-QAKCKGYFFVILG 2769 Query: 327 IRVI-RSHEFLRACASPNSFYL--VENPHSMYD 356 I E + PN + ++ P + + Sbjct: 2770 IGRKVNVKEVYSFASEPNDVFFKLLDKPTELNE 2802 Score = 42.4 bits (98), Expect = 0.099, Method: Composition-based stats. Identities = 29/163 (17%), Positives = 61/163 (37%), Gaps = 24/163 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + S F + + + ++E + + PD V+ +V +S++ Sbjct: 1029 DVVFLIDGSEGVRSGFP-------LLKEFVQRVVESLDVGPD---RVRVAVVQYSDRTRP 1078 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + I+ L+ G T N+ L + N + G R Sbjct: 1079 EFYLNSYMDQQSVVSAIRRLTILGGPTPNTGAALDFVLRNILISSAGSRI-----AEGVP 1133 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ +T + + K+ GA+ IGI Sbjct: 1134 QLLIVLT-----ADRSGDDVRGPSVVLKRGGAVPIGIGIGNAD 1171 >gi|8569519|pdb|1QC5|B Chain B, I Domain From Integrin Alpha1-Beta1 Length = 192 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 42/211 (19%), Positives = 80/211 (37%), Gaps = 35/211 (16%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +LD++IVLD S S+ + D +N +LE + + P Q G+V + Sbjct: 2 TQLDIVIVLDGSNSIYPW--------DSVTAFLNDLLERMDIGP---KQTQVGIVQYGEN 50 Query: 230 IEEFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + F L S + + + + G T + G+ A + F Sbjct: 51 VTHEFNLNKYSSTEEVLVAAKKIVQRGGRQTMTALGIDTARKEAFT------EARGARRG 104 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--RVIRSH-------EFLR 337 KK++V +TDGE+ ++ + C ++I I R + E ++ Sbjct: 105 VKKVMVIVTDGESHDNHRLKKVIQDCE---DENIQRFSIAILGSYNRGNLSTEKFVEEIK 161 Query: 338 ACAS---PNSFYLVENPHSMYDAFSHIGKDI 365 + AS F+ V + ++ +G+ I Sbjct: 162 SIASEPTEKHFFNVSDEIALVTIVKTLGERI 192 >gi|163738634|ref|ZP_02146048.1| hypothetical protein RGBS107_11437 [Phaeobacter gallaeciensis BS107] gi|161387962|gb|EDQ12317.1| hypothetical protein RGBS107_11437 [Phaeobacter gallaeciensis BS107] Length = 558 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 24/74 (32%), Positives = 35/74 (47%), Gaps = 2/74 (2%) Query: 299 NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACASPNSFYLVENPHSMYDA 357 N STK D ++ CN AK +G +VY IG L+ CAS ++ + + DA Sbjct: 485 NTSTK-DARTRAVCNAAKNQGIVVYTIGFEAPSSGTAVLKDCASSDAHHFDVRGLEIRDA 543 Query: 358 FSHIGKDIVTKRIW 371 F+ I I R+ Sbjct: 544 FASIATSIRQLRLT 557 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/258 (12%), Positives = 89/258 (34%), Gaps = 43/258 (16%) Query: 5 NLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQ 64 +R F + G + FL + V G+ +++ + +T+L +DR+++ AA Sbjct: 22 GAQLRAFRQDDSGVLLKPMVGFLLSMLAVGGIGVDLMRMERDRTILQYTLDRAVLAAADL 81 Query: 65 IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQN 124 + L + + + + +T Sbjct: 82 DQPLPPAAVVQDYLSKAGLNKYYTPPVAETGLGFKK------VQSTIDTT---------- 125 Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 F T + +++ + + + TS + ++ L++ +VLDVS SM Sbjct: 126 ----------------FETHMLKFSSGQDMPLYATSRAE---ESIDGLEISLVLDVSGSM 166 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL--EWGVSH 242 S +++ + ++ + + ++N + ++ ++ ++ L ++ + Sbjct: 167 GSN-----SRLANLKVAAKDFVDTM-IANTIDNKMSISIIPYATQVSLPTELMDQYNTTD 220 Query: 243 LQRKIKYLSKFGVSTNST 260 ++ G +T Sbjct: 221 EHAYSNCVNFVGSHFQTT 238 >gi|163742980|ref|ZP_02150363.1| hypothetical protein RG210_01902 [Phaeobacter gallaeciensis 2.10] gi|161383663|gb|EDQ08049.1| hypothetical protein RG210_01902 [Phaeobacter gallaeciensis 2.10] Length = 560 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 24/74 (32%), Positives = 35/74 (47%), Gaps = 2/74 (2%) Query: 299 NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACASPNSFYLVENPHSMYDA 357 N STK D ++ CN AK +G +VY IG L+ CAS ++ + + DA Sbjct: 487 NTSTK-DARTRAVCNAAKNQGIVVYTIGFEAPSSGTAVLKDCASSDAHHFDVRGLEIRDA 545 Query: 358 FSHIGKDIVTKRIW 371 F+ I I R+ Sbjct: 546 FASIATSIRQLRLT 559 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/258 (12%), Positives = 89/258 (34%), Gaps = 43/258 (16%) Query: 5 NLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQ 64 +R F + G + FL + V G+ +++ + +T+L +DR+++ AA Sbjct: 24 GAQLRAFRQDDSGVLLKPMVGFLLSMLAVGGIGVDLMRMERDRTILQYTLDRAVLAAADL 83 Query: 65 IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQN 124 + L + + + + +T Sbjct: 84 DQPLPPAAVVQDYLSKAGLNKYYTPPVAETGLGFKK------VQSTIDTT---------- 127 Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 F T + +++ + + + TS + ++ L++ +VLDVS SM Sbjct: 128 ----------------FETHMLKFSSGQDMPLYATSRAE---ESIDGLEISLVLDVSGSM 168 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL--EWGVSH 242 S +++ + ++ + + ++N + ++ ++ ++ L ++ + Sbjct: 169 GSN-----SRLANLKVAAKDFVDTM-IANTIDNKMSISIIPYATQVSLPTELMDQYNTTD 222 Query: 243 LQRKIKYLSKFGVSTNST 260 ++ G +T Sbjct: 223 EHAYSNCVNFVGSHFQTT 240 >gi|104780882|ref|YP_607380.1| hypothetical protein PSEEN1727 [Pseudomonas entomophila L48] gi|95109869|emb|CAK14574.1| conserved hypothetical protein; Willebrand factor type A domain protein [Pseudomonas entomophila L48] Length = 358 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 27/163 (16%), Positives = 60/163 (36%), Gaps = 26/163 (15%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + I+++D+ + L++ + GL+ F + Sbjct: 91 DLLVAVDVSGSMDFPDMQWQGDEISRLDLVKALMGDFLQD-------RQGDRVGLILFGS 143 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + G +T + A ++ + Sbjct: 144 QAYLQAPLTFDRRTVRTFLDEAQIGIAGKNTAIGDAIGLAVKRL-----------RQRPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 ++++ +TDG N L A + G +Y IGI Sbjct: 193 QSRVLILITDGANNG--GQIHPLTAARLAAQEGVRIYTIGIGA 233 >gi|254513911|ref|ZP_05125972.1| von Willebrand factor type A domain protein [gamma proteobacterium NOR5-3] gi|219676154|gb|EED32519.1| von Willebrand factor type A domain protein [gamma proteobacterium NOR5-3] Length = 330 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 38/214 (17%), Positives = 76/214 (35%), Gaps = 36/214 (16%) Query: 166 SQTDARLDMMIVLDVSRSM--ESFFDSS---ITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 S + D+M+ LD+S SM F D + ++ A + + + + Sbjct: 88 SVEKSARDLMLALDLSGSMDARDFRDQNGQQQNRLSAAKQVLKDFAAQRDGD-------R 140 Query: 221 SGLVTFSN--KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 GL+ F N ++ F + + ++ G ST + A + +F R Sbjct: 141 LGLIVFGNAAYLQAPFTDDIATWLTLLEESEVAMAGPSTALGDAIGLAIS-LFQASETRN 199 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS------ 332 +++V +TDG T + + A +Y + + + Sbjct: 200 ----------RVLVVLTDG--NDTGSRVPPVDAASIAAANDVTIYTVAVGDPSTIGEEAL 247 Query: 333 -HEFLRACASP--NSFYLVENPHSMYDAFSHIGK 363 E L A AS + +L + ++ DA+ I + Sbjct: 248 DLETLDAIASSTRGASFLALDTLALKDAYEQINR 281 >gi|108995558|ref|XP_001094724.1| PREDICTED: von Willebrand factor A domain-containing protein 1 isoform 2 [Macaca mulatta] Length = 438 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 37/218 (16%), Positives = 74/218 (33%), Gaps = 27/218 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + + R D+M +LD S S+ + S + + + + V +P + Sbjct: 21 GAESGPPASAPRGDLMFLLDSSASVSHYEFSRV------REFVGQL---VAPLPLGTGAL 71 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 ++ LV + F S Q ++ + T++ L YA Q+F Sbjct: 72 RASLVHVGTRPYTEFPFGQHSSGEAAQDAVRASVQRMGDTHTGLALAYAKEQLF------ 125 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 K++V++TDG + E K G V+ + E Sbjct: 126 AEAAGARPGVPKVLVWVTDGGSSDPVG-----PPMQELKDLGVTVFIVSTGRGNFLELSA 180 Query: 338 ACASPN----SFYLVENPHSMYDAF-SHIGKDIVTKRI 370 A ++P F V++ H + I + +++ Sbjct: 181 AASAPAEKHLHFVDVDDLHIIVQELRGSILDAMRPQQL 218 >gi|108760959|ref|YP_630756.1| von Willebrand factor type A domain-containing protein [Myxococcus xanthus DK 1622] gi|108464839|gb|ABF90024.1| von Willebrand factor type A domain protein [Myxococcus xanthus DK 1622] Length = 476 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 35/239 (14%), Positives = 81/239 (33%), Gaps = 34/239 (14%) Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 S++ SR P S S +V + +++ +V+D S SM + Sbjct: 57 SLTLTSRLSHPYVPAG------TSEVFATFDLSGAQVPGAQRSPVNLALVIDRSGSMSGY 110 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL---EWGVSHLQ 244 K+ A ++ ++ + + ++ + + ++ L + Sbjct: 111 ------KLAQAKQAARHLIGLLNDQD------RLAIIHYGSDVKSLPSLEATAANRERMF 158 Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL-STK 303 + + + G TN GL Q+ Q ++ M+DG+ Sbjct: 159 QYVDGIWDEG-GTNIGAGLSAGRYQLSTAQRTYGVNR---------LILMSDGQPTEGLT 208 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSH 360 D++ E + G + AIG+ + + ++A A ++ +E+ + F Sbjct: 209 ADEELTRMARELRATGLTLSAIGVGTDFNEDLMQAFAEYGAGAYGFLEDAAQLSTLFQK 267 >gi|306922629|gb|ADN07507.1| collagen, type VII, alpha 1 [Microtus ochrogaster] Length = 2189 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 33/172 (19%), Positives = 65/172 (37%), Gaps = 21/172 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ S+ ++ ++ + V+ V +S+ + Sbjct: 39 DIVFLLDGSSSIG---RSNFREVRGFLEGLVLPFSGAASAQG----VRFATVQYSDDPQT 91 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L+ R I+ LS G +T + L++ + +F H K+ Sbjct: 92 EFGLDALGSGGDTVRAIRELSYKGGNTRTGAALRHVSDHVF-----LPHLTRPG--IPKV 144 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + +TDG K + K +G ++A+GI+ E R + P Sbjct: 145 CILITDG-----KSQDLVDPAAQKLKGQGVKLFAVGIKNADPEELKRVASQP 191 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 32/228 (14%), Positives = 75/228 (32%), Gaps = 33/228 (14%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 I P S + ++D + +L +R + ++ + ++ + + Sbjct: 1036 IRGPEVSVTQKPVCPHGQVDAVFLLHTTRDNAHNAEVVRRVLERLVSALGPLGPQA---- 1091 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSH-------LQRKIKYLSKFGVSTNSTPGLKYA 266 Q GL+++S++ F L SH R I Y+ G N + A Sbjct: 1092 -----AQVGLLSYSHRPSPLFPL--NSSHDLGIILQKIRDIPYVDPSGN--NLGTAVITA 1142 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 ++ + + ++V + D ++A+ G V +G Sbjct: 1143 HSHLLAPNAPGR-----RQRVPGVMVLLVD-----EPLRGDIFSPIHDAQASGLKVMILG 1192 Query: 327 IRVIRSHEFLRACASPN---SFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + + R + +F+ ++N S+ A S + + + Sbjct: 1193 MAGADPEQLRRLAPGTDTIQNFFALDNGPSLDRAVSDLAVALCQAAVT 1240 >gi|306922621|gb|ADN07500.1| collagen, type VII, alpha 1 [Microtus ochrogaster] Length = 2189 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 33/172 (19%), Positives = 65/172 (37%), Gaps = 21/172 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ S+ ++ ++ + V+ V +S+ + Sbjct: 39 DIVFLLDGSSSIG---RSNFREVRGFLEGLVLPFSGAASAQG----VRFATVQYSDDPQT 91 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L+ R I+ LS G +T + L++ + +F H K+ Sbjct: 92 EFGLDALGSGGDTVRAIRELSYKGGNTRTGAALRHVSDHVF-----LPHLTRPG--IPKV 144 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + +TDG K + K +G ++A+GI+ E R + P Sbjct: 145 CILITDG-----KSQDLVDPAAQKLKGQGVKLFAVGIKNADPEELKRVASQP 191 Score = 40.5 bits (93), Expect = 0.38, Method: Composition-based stats. Identities = 31/226 (13%), Positives = 74/226 (32%), Gaps = 33/226 (14%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 P S + ++D + +L +R + ++ + ++ + + Sbjct: 1038 GPEVSVTQKPVCPHGQVDAVFLLHTTRDNAHNAEVVRRVLERLVSALGPLGPQA------ 1091 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSH-------LQRKIKYLSKFGVSTNSTPGLKYAYN 268 Q GL+++S++ F L SH R I Y+ G N + A++ Sbjct: 1092 ---AQVGLLSYSHRPSPLFPL--NSSHDLGIILQKIRDIPYVDPSGN--NLGTAVITAHS 1144 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + + ++V + D ++A+ G V +G+ Sbjct: 1145 HLLAPNAPGR-----RQRVPGVMVLLVD-----EPLRGDIFSPIHDAQASGLKVMILGMA 1194 Query: 329 VIRSHEFLRACASPN---SFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + R + +F+ ++N S+ A S + + + Sbjct: 1195 GADPEQLRRLAPGTDTIQNFFALDNGPSLDRAVSDLAVALCQAAVT 1240 >gi|111024162|ref|YP_707134.1| hypothetical protein RHA1_ro07212 [Rhodococcus jostii RHA1] gi|110823692|gb|ABG98976.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 326 Score = 62.9 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 34/231 (14%), Positives = 74/231 (32%), Gaps = 30/231 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + R +++V+DVS SM++ D T++ A + + + + Sbjct: 71 FSVALAGPTEDKRVPRNRATVILVIDVSLSMKA-TDVEPTRLAAAQDAAKSFADGLTPGI 129 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + GLV F+ + I L + T + + + + + Sbjct: 130 N------LGLVAFAGTASVLVSPTTNREASKVAIDNL-QLSERTATGEAI---FTSLQSI 179 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGE---NLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-- 328 + D IV ++DG+ + + + +AK + + I Sbjct: 180 DTLAAVLGGSDQAPPARIVLLSDGKQTVPENPDDPRGGFTAARQAKDKDVPISTISFGTS 239 Query: 329 ------------VIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 V LR A S SF+ + + D + + + I Sbjct: 240 YGKVEIEDERIPVPVDDPSLREIANLSGGSFFTASSLEELRDVYDTLEEQI 290 >gi|332884779|gb|EGK05035.1| hypothetical protein HMPREF9456_03188 [Dysgonomonas mossii DSM 22836] Length = 327 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 42/270 (15%), Positives = 85/270 (31%), Gaps = 52/270 (19%) Query: 124 NEGYSISAISR----YKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLD 179 + GY+ Y F + ++ S + +D+++ LD Sbjct: 36 SSGYAFEKAPSTLRVYMRHFPFLLRVIAIALVIIVLARPQSVNSSDISKSEGIDIIMALD 95 Query: 180 VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG 239 +S +M + D S T+++ A K + + + + + GLV F+ + L Sbjct: 96 ISGTMMAN-DFSPTRLEAAKKVASEFINDRQSD-------RIGLVIFAGESFTQCPLTTD 147 Query: 240 VSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 L + + T GL + N++ + + ++++ +TDG Sbjct: 148 HRVLLNLLSEVKFGMIEDGTAIGLGLANSVNRL-----------KDSQSKSRVVILLTDG 196 Query: 298 ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA----------------- 340 N + L A G VY IGI R R Sbjct: 197 SNNA--GQIAPLTAAELAASYGIRVYTIGIGS-RGTSVARVMTPYGMQSMNVSGDFDERT 253 Query: 341 -------SPNSFYLVENPHSMYDAFSHIGK 363 + S++ + S+ + I + Sbjct: 254 LTEIASKTGGSYFRATDNTSLSGIYDEIDQ 283 >gi|73695954|gb|AAZ80783.1| matrilin [Biomphalaria glabrata] Length = 418 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 40/210 (19%), Positives = 83/210 (39%), Gaps = 27/210 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 L++ IVLD S S+ + AIK + L + ++ D N V+ ++++ Sbjct: 42 CRQEPLELGIVLDSSSSLRN------RDFKTAIKFLQEFLSQYEISSDPNG-VRVSIISY 94 Query: 227 SNKIEEFFLLEWGVSHLQ----RKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 I + I + K G+ T++ ++Y + +R Sbjct: 95 GKGIYPEIGFNLTTYDTKDEVIEAIGRIPHKAGLRTDTGRAIQYMHEAQLANGVVRPG-- 152 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACA 340 K+ + +TDG +++E + + EA+K +++AIG+ IR+ E L Sbjct: 153 -----VTKVSIVITDG---NSQEWKLTKEAAEEARKDNIVMFAIGVGTDIRNSELLNIAG 204 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + V+N + + S I + + + Sbjct: 205 DQSRVTKVDNYNQL----SSIKESLAHQTC 230 >gi|84498078|ref|ZP_00996875.1| putative secreted protein [Janibacter sp. HTCC2649] gi|84381578|gb|EAP97461.1| putative secreted protein [Janibacter sp. HTCC2649] Length = 659 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 41/223 (18%), Positives = 84/223 (37%), Gaps = 33/223 (14%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 +P S+V S ++++LD S SM++ S +TKI+ A K++ ++ + Sbjct: 20 LPTVSAVADVSDDPVPGKLLLMLDASGSMKAKDPSGLTKIEAAKKALTGVVGALP----- 74 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + Q GL + ++ + ++ G T P L A + I + Sbjct: 75 -DTAQVGLRVYGATVDGKGKP---TPAACADTQLITPIG--TIDKPALTSAISAINALGE 128 Query: 276 M------RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK-----RGAIVYA 324 + K+ IV ++DGE T + C KK + Sbjct: 129 TPIAHSLTEALKDLGPTGKRNIVLVSDGEESCTPD------PCPAVKKLTAGGVDLQIDT 182 Query: 325 IGIRV-IRSHEFLRAC---ASPNSFYLVENPHSMYDAFSHIGK 363 +G V ++ L+ C A ++Y ++ ++ + S + + Sbjct: 183 VGFGVNAKARSQLQ-CIADAGKGTYYDAKDAPALAASLSKLSQ 224 >gi|189461337|ref|ZP_03010122.1| hypothetical protein BACCOP_01987 [Bacteroides coprocola DSM 17136] gi|189431866|gb|EDV00851.1| hypothetical protein BACCOP_01987 [Bacteroides coprocola DSM 17136] Length = 332 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 39/222 (17%), Positives = 74/222 (33%), Gaps = 51/222 (22%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +DVS SM + D +++ A + + P+ N GL F+ + Sbjct: 88 IDIMLAVDVSTSMLAE-DLKPNRLEAAKQVAAEFING---RPNDN----IGLTVFAGEAF 139 Query: 232 EFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 L + ++ I T GL A +++ D + Sbjct: 140 TQCPLTVDHGVLLNLFNSIKGDIAQRGMIEDGTAIGMGLANAISRLKDSKAKS------- 192 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV--------------- 329 K+I+ +TDG N + D L AK+ G VY IG+ Sbjct: 193 ----KVIILLTDGSNN--RGDISPLTAAEIAKQFGIRVYTIGVGTNGTAPYPMQTYAGVQ 246 Query: 330 -------IRSHEFLRACASP-NSFYLVENPHSMYDAFSHIGK 363 I + + +++ + + + + I K Sbjct: 247 YVNVPVEIDEQTLTQIAGTTNGNYFRATSNSKLEEVYREIDK 288 >gi|254776723|ref|ZP_05218239.1| hypothetical protein MaviaA2_18931 [Mycobacterium avium subsp. avium ATCC 25291] Length = 309 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 36/219 (16%), Positives = 67/219 (30%), Gaps = 37/219 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+V+D+S+SM + D ++ A ++ + ++ + GLV F+ Sbjct: 74 IMLVIDMSQSMRA-TDVEPNRLKAAEQAASQFASQLTPGIN------LGLVGFAGTPYLL 126 Query: 234 FLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 Q I L K F ST + + A + I D I Sbjct: 127 VPP---TPQHQATIDALKKLDFADSTATGEAIFTALHAISATA-----VAGGDTPPPARI 178 Query: 292 VFMTDGENLSTKEDQQ----SLYYCNEAKKRGAIVYAIGIRVIRSH-------------- 333 V ++DG AK G + I Sbjct: 179 VLLSDGGENKPSNPSDPHDGVYTAARLAKDEGVPISTITFGTKGGEIEMDGQKVAVPVST 238 Query: 334 EFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + ++ A S Y N + +++ I +I + + Sbjct: 239 DQMKMVAKLSGGQSYTATNLGELQKSYNAIENEIGYRTV 277 >gi|182414212|ref|YP_001819278.1| von Willebrand factor type A [Opitutus terrae PB90-1] gi|177841426|gb|ACB75678.1| von Willebrand factor type A [Opitutus terrae PB90-1] Length = 377 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 39/223 (17%), Positives = 69/223 (30%), Gaps = 45/223 (20%) Query: 173 DMMIVLDVSRSM----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+ +D+S SM I ++ I A +E + G+V FS Sbjct: 112 DLMLCIDLSGSMLSEDYERGGDRINRLQAIKPVIQAFIERRPSD-------RIGIVLFSG 164 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + L +++ + T GL ++ + + Sbjct: 165 RAYTMAPLTFDHRWLGSQLERIKVGLIEDGTAIGDGLGVGLTRL-----EQAQRESGGKR 219 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV----------IRSHEF- 335 +V +TDG N + AK RG VY IG + + Sbjct: 220 QGAFVVLLTDGANN--RGSLTPQQAAELAKARGIPVYTIGAGQDGIVPFPVFDDKGRKLG 277 Query: 336 ------------LRACA--SPNSFYLVENPHSMYDAFSHIGKD 364 LR A + F+ + ++ AF I + Sbjct: 278 YRRIMSDLDEGALRDIAEMTGGHFFRAADVGTVESAFRAIDRA 320 >gi|159044810|ref|YP_001533604.1| hypothetical protein Dshi_2267 [Dinoroseobacter shibae DFL 12] gi|157912570|gb|ABV94003.1| hypothetical protein Dshi_2267 [Dinoroseobacter shibae DFL 12] Length = 553 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 17/73 (23%), Positives = 32/73 (43%), Gaps = 1/73 (1%) Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACASPNSFYLVENPHSMYDAFS 359 ++ D + C++ K + +++ IG R + +R CAS + Y + +AFS Sbjct: 481 NSTADGYTEQICDQLKAQDVVIFTIGFEAPQRGQDLMRYCASSSGHYFDVEGVEISEAFS 540 Query: 360 HIGKDIVTKRIWY 372 I I R+ Sbjct: 541 SIANTIQQLRLSL 553 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 30/231 (12%), Positives = 70/231 (30%), Gaps = 45/231 (19%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F + G +T + ++ ++ G+ I++ ++T L + DR+++ AA Sbjct: 24 FCRDEAGVLTGFSLYIFILMMMIAGLTIDLMRYEAVRTRLQATSDRAVLAAADL------ 77 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 + G +D + S L+ + Sbjct: 78 --------------DQTTNAKAVVEDYFAKAGMSQYLDGVQVSKGLN--------FKEVE 115 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 A IP F + + S + Q +++ +VLD+S SM Sbjct: 116 AQVSATIPTWFM----NMSGIETLDAFARSKAEERIQN---IEVSLVLDISGSMGWDG-- 166 Query: 191 SITKIDMAIKSINAMLEEV-----KLIPDVNNVVQSGLVTFSNKIEEFFLL 236 K+ + + + + + D + ++ + + L Sbjct: 167 ---KLANMRTAADQFVRTMMAGNDNVAADGTGLTSVSIIPYHAVVNVPDEL 214 >gi|198425808|ref|XP_002121992.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 431 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 80/213 (37%), Gaps = 33/213 (15%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK- 229 LD++ +LD S S+ + ++ +K++ D++ G++ +S+ Sbjct: 175 DLDLIFLLDGSGSVTVPDPLNFDRVKQWVKNVTDRF-------DISTFANVGVIQYSHYY 227 Query: 230 ----IEEFFLLEWGVSHLQRK------IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ + +E G+ + + + + G +T + L + + Sbjct: 228 ETRTVQPYMKVEIGLGQYKTQAEFQMAVDSIQFQGFTTFTAHALNRTVEEFMNSTRYS-- 285 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 D +K+IV +TDG++ + +++ Y A+ G ++A+G+ E L+ Sbjct: 286 ----DPTTRKVIVLLTDGQSNDREFLEETSAY---ARGLGITIFAVGVEGYSEEE-LQII 337 Query: 340 ASP-----NSFYLVENPHSMYDAFSHIGKDIVT 367 S + ++ + + I Sbjct: 338 TSGELGNNERVFGLDTFSDLNKVVDSLQLSITG 370 >gi|49225581|ref|NP_990438.1| collagen alpha-1(VI) chain precursor [Gallus gallus] gi|115314|sp|P20785|CO6A1_CHICK RecName: Full=Collagen alpha-1(VI) chain; Flags: Precursor gi|62875|emb|CAA41062.1| collagen alpha 1 type VI [Gallus gallus] gi|63302|emb|CAA45788.1| collagen type VI alpha 1 subunit [Gallus gallus] gi|211354|gb|AAB59954.1| alpha-1 type VI collagen precursor [Gallus gallus] Length = 1019 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 33/228 (14%), Positives = 75/228 (32%), Gaps = 37/228 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ------- 220 D +D+ VLD S S+ + N ++++ + Sbjct: 32 QDCPVDLFFVLDTSESVALRVKPFGDLVAQVKDFTNRFIDKL-----TERYFRCDRFLAW 86 Query: 221 -SGLVTFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 +G + +S+ + L G + L+ + ++ G T++ +K ++ Sbjct: 87 NAGALHYSDSVVIIKDLTAMPSGRAELKNSVSAINYIGKGTHTDCAIKQGIERLLLGG-- 144 Query: 277 RQHCNTEDANYKKIIVFMTDGENL--STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 H K ++ +TDG L + NEAK G V+++ I + Sbjct: 145 -SHLKEN-----KYLIVVTDGHPLEGYKEPCGGLDDAANEAKHLGIKVFSVAISPHHLDQ 198 Query: 335 FLRACASPNSF---YLVE--------NPHSMYDAFSHIGKDIVTKRIW 371 L A+ +++ + + + + KD + + Sbjct: 199 RLNIIATDHAYRRNFTATSLKPTRDLDVEETINNIIEMIKDNMEQSCC 246 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 24/163 (14%), Positives = 59/163 (36%), Gaps = 19/163 (11%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE-VKLIPDVNNVVQSGLVTFS 227 D+M+++D S S+ S D + + E ++ + V+ +V +S Sbjct: 820 ANPADIMLLVDSSTSVGS------KNFDTTKNFVKRLAERFLEASKPAEDSVRVSVVQYS 873 Query: 228 NKIEE--FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + ++ + + + + + + +T+ L+Y ++ Sbjct: 874 GRNQQKVEVPFQRNYTVIAKAVDNMEFMNEATDVNAALQYIM--------GLYQRSSRSG 925 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 KK++VF +DG + E ++ G VY + + Sbjct: 926 AKKKVLVF-SDGNSQGITARAIERTV-QEVQQAGIEVYVLAVG 966 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 23/155 (14%), Positives = 58/155 (37%), Gaps = 24/155 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVVQSGL 223 T +D++ VLD S S+ + +A I +++ + V + G+ Sbjct: 607 CTCGPVDLLFVLDSSESIG------LQNFQIAKDFIIKVIDRLSKDERVKFEPGESRVGV 660 Query: 224 VTFSNKIEEFF-----LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V +S+ + + L++ +K L T++ L+++ + R+ Sbjct: 661 VQYSHNNTQELVAMGDANIDNIGALKQAVKNLKWIAGGTHTGEALQFSKENLL-----RR 715 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 + + + + +TDG + + ++ C Sbjct: 716 FTSNNN-----VAIVITDGRSDTLRDRTPLTSLCE 745 >gi|326922309|ref|XP_003207392.1| PREDICTED: collagen alpha-1(VI) chain-like [Meleagris gallopavo] Length = 998 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 33/228 (14%), Positives = 75/228 (32%), Gaps = 37/228 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ------- 220 D +D+ VLD S S+ + N ++++ + Sbjct: 32 QDCPVDLFFVLDTSESVALRVKPFGDLVAQVKDFTNRFIDKL-----TERYFRCDRFLAW 86 Query: 221 -SGLVTFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 +G + +S+ + L G + L+ + ++ G T++ +K ++ Sbjct: 87 NAGALHYSDSVVIIKDLTAMPSGRAELKNSVSAINYIGKGTHTDCAIKQGIERLLVGG-- 144 Query: 277 RQHCNTEDANYKKIIVFMTDGENL--STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 H K ++ +TDG L + NEAK G V+++ I + Sbjct: 145 -SHLKEN-----KYLIVVTDGHPLEGYKEPCGGLDDAANEAKHLGIKVFSVAISPHHLDQ 198 Query: 335 FLRACASPNSF---YLVE--------NPHSMYDAFSHIGKDIVTKRIW 371 L A+ +++ + + + + KD + + Sbjct: 199 RLNIIATDHAYRRNFTATSLKPTRDLDVEETINNIIEMIKDNMEQSCC 246 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 23/161 (14%), Positives = 59/161 (36%), Gaps = 19/161 (11%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE-VKLIPDVNNVVQSGLVTFSNK 229 D+M+++D S S+ S D + + E ++ + V+ +V +S + Sbjct: 801 PADIMLLVDSSTSVGS------KNFDTTKSFVKRLAERFLEASKPAEDSVRVSVVQYSGR 854 Query: 230 IEE--FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 ++ + + + + + + +T+ + L+Y + + A Sbjct: 855 NQQKVEVPFQRNYTVIAKAVDNMEFMNEATDVSAALQYV---------TGLYQRSSRAGA 905 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 KK ++ +DG + E ++ G VY + + Sbjct: 906 KKKVLVFSDGNSQGITARAIERTV-QEVQQAGIEVYVLAVG 945 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 23/155 (14%), Positives = 58/155 (37%), Gaps = 24/155 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVVQSGL 223 T +D++ VLD S S+ + +A I +++ + V + G+ Sbjct: 586 CTCGPVDLLFVLDSSESIG------LQNFQIAKDFIIKVIDRLSKDERVKFEPGESRVGV 639 Query: 224 VTFSNKIEEFF-----LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V +S+ + + L++ +K L T++ L+++ + R+ Sbjct: 640 VQYSHNNTQELVAMGDANIDNIGALKQAVKNLKWIAGGTHTGEALQFSKENLL-----RR 694 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 + + + + +TDG + + ++ C Sbjct: 695 FTSNNN-----VAIVITDGRSDTLRDRTPLTSLCE 724 >gi|323488845|ref|ZP_08094085.1| hypothetical protein GPDM_05856 [Planococcus donghaensis MPA1U2] gi|323397543|gb|EGA90349.1| hypothetical protein GPDM_05856 [Planococcus donghaensis MPA1U2] Length = 857 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 39/209 (18%), Positives = 76/209 (36%), Gaps = 32/209 (15%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 +P+ VK L +MIV+D S SM K+++A ++ +E ++ Sbjct: 391 LPVEMEVKGK-HELPSLGLMIVMDRSGSM------MGLKMELAKEAAARSVELLRSDDT- 442 Query: 216 NNVVQSGLVTFSNKIEE--FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 G++ F ++ E KI ++ G T L+ AY ++ D+ Sbjct: 443 -----LGVIAFDDQPWEILPTGKVDDPKKAADKILSITP-GGGTEIYRSLEQAYTELEDL 496 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + R+H I+ +TDG++ ++ + + K + + I Sbjct: 497 ELQRKH-----------IILLTDGQSSTSNDYDALI---ENGKDHNITLSTVSIGQDADR 542 Query: 334 EFLRACA--SPNSFYLVENPHSMYDAFSH 360 L A FY V + ++ S Sbjct: 543 NLLEQLAGTGSGRFYDVTDATTIPAILSR 571 >gi|198422181|ref|XP_002120553.1| PREDICTED: similar to predicted protein [Ciona intestinalis] Length = 1038 Score = 62.9 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 70/196 (35%), Gaps = 32/196 (16%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 V + ++IV+D S SM +T +++A ++ +++ + + + Sbjct: 183 VQANVPKPKQIVIVIDKSGSMG------VTNMNLAKEAAKSVVNTLNPQD------RFAV 230 Query: 224 VTFSN---------KIEEFFLLEW------GVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + FS+ ++ F + ++ + +S G TN P L+ A++ Sbjct: 231 MAFSSIFVPFQSTVASDQCFATTFADASPQNKKKVEDFVDTISS-GGGTNYAPALQKAFS 289 Query: 269 QIFDMQGMRQHCNT--EDANYKKIIVFMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAI 325 + + + ++I+FM+DG S NE I+ Sbjct: 290 FFQQEPSVSDFNIKKIDPSEIDRVILFMSDGIPNDPGSTILSAQIRANEQLNNSVIILTY 349 Query: 326 GIRVIRSHEFLRACAS 341 G+ LR A+ Sbjct: 350 GLGNAD-FGVLRNMAT 364 >gi|326433400|gb|EGD78970.1| hypothetical protein PTSG_11807 [Salpingoeca sp. ATCC 50818] Length = 2673 Score = 62.5 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 55/316 (17%), Positives = 100/316 (31%), Gaps = 25/316 (7%) Query: 61 AATQIMN-EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRS----TS 115 AT+ + + +N L R T R D+ +V ++ V + Sbjct: 536 TATETVKPDPPTSNGDPSVMSKRLRRHITTLASVVRGVFGDDAYVRVLEAYVEPPADISK 595 Query: 116 LDIVVVPQNEGYSISAIS--RYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA--- 170 + V + +I + L + ++ S+ V+ D Sbjct: 596 ASLHNVGRAARITIEGVPDDFASDRLGVLGGLAVEAGFDYVAYTSRDSLYVSVIADQCRQ 655 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN-VVQSGLVTFSNK 229 LD++ +LD S S+ES + A + +V + + + TF++ Sbjct: 656 NLDLVFLLDGSGSIESTALGGAP--GTFQDRVLAFVSQVTTYFTIGEHDTRVAVATFASG 713 Query: 230 IEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L L+ I + T ++ GL+ I Sbjct: 714 ATVNIRLNDHFDGDALRDAIADIPYPQGQTYTSLGLRAVRQDILTEAN---GMRPASEGV 770 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +++V +TDG N D + + + V+AIG+ S L AS Sbjct: 771 PRVLVVLTDG-NSQPSYDPATE--ASILHDQNVNVFAIGVGSSISQSQLEDIASDPDARH 827 Query: 348 VENPHSMYDAFSHIGK 363 V N +FS IG Sbjct: 828 VFN----LRSFSLIGD 839 >gi|307943460|ref|ZP_07658804.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307773090|gb|EFO32307.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 320 Score = 62.5 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 37/195 (18%), Positives = 59/195 (30%), Gaps = 61/195 (31%) Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNS----------TPGLKYAYNQIFDMQGMRQHC-NT 282 +L + L+ + L+ + +P L+ A D + H + Sbjct: 127 YLNRNNLGGLKAAVDRLTLSDGTGMDIGLLWEAKALSPKLRTA--AALDGGLLPGHPTDW 184 Query: 283 EDANYKKIIVFMTDG-------------------------------------------EN 299 D +K+IV MTDG N Sbjct: 185 SDKQTQKVIVLMTDGGITAQYRPKDPWKGLNPKDMRRGIVNARRNVQYVTTRGNMNSPAN 244 Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF----LRACASPNSFYLVENPHSMY 355 C++AK +G I+Y +G + IR + L CA+ S Y + Sbjct: 245 SKHNSVAYMKTMCDQAKAKGIIIYTVGFQ-IRRNTLPDLSLSYCATSPSHYYFVESSDLS 303 Query: 356 DAFSHIGKDIVTKRI 370 AF I I + RI Sbjct: 304 AAFKAIASSIKSLRI 318 >gi|308511201|ref|XP_003117783.1| hypothetical protein CRE_00574 [Caenorhabditis remanei] gi|308238429|gb|EFO82381.1| hypothetical protein CRE_00574 [Caenorhabditis remanei] Length = 566 Score = 62.5 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 42/210 (20%), Positives = 76/210 (36%), Gaps = 33/210 (15%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 I + D R D+ I+ D S S+E F + K ++E++ + P+ Sbjct: 375 IVQKTEQLPVKDCRYDIGIIFDSSGSLEKNFQKQL-------KFATTLVEQMPISPNA-- 425 Query: 218 VVQSGLVTFSNKIE----EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + ++ F+ K + F + + L+ I F +T + LK Sbjct: 426 -TRVAIIQFAGKTKLRVLADFAQKKSAAELKTIIGRSHFFSGTTFTNGALKTM------- 477 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 ++ A+ K +V TDG + + K +G +VY +GI +S Sbjct: 478 --ADLFQKSKRADAKLKVVLFTDG-----YSAEDTSEGAEALKSQGVVVYTVGISTEKST 530 Query: 334 EF----LRACA-SPNSFYLVENPHSMYDAF 358 L A SPN F+ + + F Sbjct: 531 GLNMKELHGMATSPNHFFNASDFVELSKNF 560 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 25/131 (19%), Positives = 52/131 (39%), Gaps = 24/131 (18%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++ ++D + S+ +FF+ + + ++E + + P V+ + G + +S+ Sbjct: 160 PSDVIFIIDATSSVRTFFEQYV-------HFVEKVIEGLDIQPSVD---RVGAIVYSSAH 209 Query: 231 EEFFLLEWG----VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ + G L ++ L F T + LK+A N H N Sbjct: 210 KQRVKISLGEHKDKGSLIAAVESLPFFSGITATGEALKFAAN----------HTEGRRQN 259 Query: 287 YKKIIVFMTDG 297 V +TDG Sbjct: 260 LTLTFVVLTDG 270 >gi|291401974|ref|XP_002717657.1| PREDICTED: inter-alpha trypsin inhibitor heavy chain precursor 5 [Oryctolagus cuniculus] Length = 940 Score = 62.5 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 74/202 (36%), Gaps = 33/202 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ VLD S SM K+ ++ +L +++ + ++ FSN+I+ Sbjct: 296 VVFVLDSSASM------VGAKLRQTKDALFTILHDLRPQD------RFNIIGFSNRIKVW 343 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + + I ++S G T+ L+ A + H + ED + Sbjct: 344 KDNLISVTPNSIRDGKIYIHHMSPTG-GTDINGALQTAIRLL---NNYVAHNDIEDRSVS 399 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEF-----LRACA 340 +IVF+TDG+ + +L N K+ ++ IGI + L C Sbjct: 400 -LIVFLTDGKPT--VGETHTLKILNNTKEAAQGRVCIFTIGIGNDVDFKLLEKLSLENCG 456 Query: 341 SPNSFYLVENP-HSMYDAFSHI 361 + EN + + I Sbjct: 457 LTRRVHEEENAGAQLIGFYDEI 478 >gi|302143246|emb|CBI20541.3| unnamed protein product [Vitis vinifera] Length = 630 Score = 62.5 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 38/204 (18%), Positives = 76/204 (37%), Gaps = 35/204 (17%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 A +D++ VLDVS SM +K+ + +++ +++ + + +V+FS Sbjct: 201 DRAPIDLVAVLDVSGSMAG------SKLSLLKRAVCFLIQNLGPSD------RLSIVSFS 248 Query: 228 NKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + F L G I L+ G TN GLK + + Sbjct: 249 STARRIFPLRRMSDNGREAAGLAINSLTSSG-GTNIVEGLKKGVRVLEERSE-------- 299 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-------KKRGAIVYAIGIRVIRSHEFL 336 N I+ ++DG++ ++ + A ++ V+ G + Sbjct: 300 -QNPVASIILLSDGKDTYNCDNVNRRQTSHCASSNPRQGRQAIIPVHTFGFGSDHDSTAM 358 Query: 337 RACA--SPNSFYLVENPHSMYDAF 358 A + S +F +E+ ++ DAF Sbjct: 359 HAISDESGGTFSFIESVATVQDAF 382 >gi|156257452|gb|ABU63134.1| microneme 1 precursor [Eimeria tenella] Length = 675 Score = 62.5 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 35/154 (22%), Positives = 61/154 (39%), Gaps = 22/154 (14%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD+M+V+D S S+ + S+ K+ I+ + V +P V+ GL+TF+ + + Sbjct: 13 LDVMLVVDESGSIGT---SNFRKVRQFIE------DFVNSMPISPEDVRVGLITFATRSK 63 Query: 232 E----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 S + LS T + GL+ A ++D N Sbjct: 64 VRWNLSDPKATNPSLAISAARSLSYSTGVTYTHYGLQDAKKLLYDTN------AGARNNV 117 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 K+++ MTDG + Q+ + GAI Sbjct: 118 PKLVLVMTDGASNLPS---QTRSSAAALRDAGAI 148 >gi|15602708|ref|NP_245780.1| hypothetical protein PM0843 [Pasteurella multocida subsp. multocida str. Pm70] gi|12721152|gb|AAK02927.1| TadG [Pasteurella multocida subsp. multocida str. Pm70] Length = 588 Score = 62.5 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 43/273 (15%), Positives = 96/273 (35%), Gaps = 52/273 (19%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM 66 +IR F+ N G T++TA+ + +++G ++ + + K L +D++ + Sbjct: 13 SIRKFYENELGVYTVMTALLAFPLLVLIGFTVDGTGVVLDKARLAQGMDQAALALVA--- 69 Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMS--------FRNELRDNGFVNDI-------DDIV 111 N + R+ K GD+ ++ + + + + E R+ + I ++ Sbjct: 70 --ENNDYRENKKHGDVNRQVVSPQDKAKFGGNEFMAKQEKRNQELIQGIAKLYLRSENAN 127 Query: 112 RSTSLDIVVV------------PQNEGYS--------ISAISRYKIPLKFCTFIPWYTNS 151 S+ I + P Y+ + K L + Sbjct: 128 ASSDAPITIDKPFHYSCEELDLPTGNEYARRKPIVCEVQGGVNRKFWLPVSESLVSADKL 187 Query: 152 RHIVMPITSSVKVNSQTDA---RLDMMIVLDVSRSMESFFD---------SSITKIDMAI 199 + + + S + +++M+V D S SM S + IT + + Sbjct: 188 KQDRIRMESDTSYAIKEKGIVIPVELMLVSDFSGSMNSHLQDKNGRSLGKTKITILREVV 247 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 I+ +L + V+ + G TFS + + Sbjct: 248 SEISKILLPEDVSEGVSPFNRIGFTTFSGGVRQ 280 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 17/140 (12%), Positives = 49/140 (35%), Gaps = 20/140 (14%) Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 + R++ + G ++ S+ L A + + + + N +++I+ ++DGE+ Sbjct: 433 KDITRELNIVRPSGWTSASSGLLVGANIMMDENKSPDAKPSKLGTNIQRVILVLSDGEDN 492 Query: 301 STKEDQQSLYY----CNEAKKR--------------GAIVYAIGIRVIRSH-EFLRACAS 341 + C++ +++ A G + + C Sbjct: 493 WPTYSTLTTLLNNGMCDKIREQLGKLQDPNLRELPGRIAFVAFGYSPPANQVAAWKKCV- 551 Query: 342 PNSFYLVENPHSMYDAFSHI 361 + +Y + + ++F I Sbjct: 552 GDQYYTAYSKEELLESFKQI 571 >gi|218442094|ref|YP_002380423.1| von Willebrand factor A [Cyanothece sp. PCC 7424] gi|218174822|gb|ACK73555.1| von Willebrand factor type A [Cyanothece sp. PCC 7424] Length = 412 Score = 62.5 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 33/219 (15%), Positives = 80/219 (36%), Gaps = 31/219 (14%) Query: 147 WYTNSR-HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 TN++ + I ++ + L++ +VLD S SM ++ ++ + Sbjct: 17 NQTNTQRQFSLAICATGDQD--KTLPLNLCLVLDHSGSMAG------KPLETVKQAAIEL 68 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGL 263 ++++ + + ++ F ++ + + ++ + +I L G T GL Sbjct: 69 VKQLNVED------RLSIIAFDHRAKVLVPNQGIDNLNTIIEQINSLKPAG-GTAIDEGL 121 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 K + + + R I +TDGEN ++++ L + A + Sbjct: 122 KLGIQESANGKKDRVSQ----------IFLLTDGENEH-GDNERCLKLAHVASDYNITLN 170 Query: 324 AIGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSH 360 +G + + L + + +E P + FS Sbjct: 171 TLGFGNHWNQDVLEKISDSAGGTLCYIETPDKAIEEFSR 209 >gi|123443829|ref|YP_001007800.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. enterocolitica 8081] gi|122090790|emb|CAL13672.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. enterocolitica 8081] Length = 459 Score = 62.5 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 41/234 (17%), Positives = 80/234 (34%), Gaps = 28/234 (11%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F N +G + I I LP ++ + E+SH K L I+++ + A I N Sbjct: 17 FKKNEQGTILISFMIILPFFIALIFITFEISHYLQRKAKLSDAIEQATL--ALTIENNAI 74 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 + +++K ++ N + L F I +I +T Y+ + Sbjct: 75 PDEPQQIKNNALVLSYANAY-------LPSKEFSVPIININDNTY--------YLEYNAA 119 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 Y T + +I + + D++ V D S SM FD Sbjct: 120 VTMAYPAKFLTQTSLTNAITDINITDNGVAIKNKAIEASDLTDVIFVADYSGSMLYNFDV 179 Query: 191 SITKIDMAIKSINAMLEEV-KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 + I ++ + ++ +I + +N+ G + WG + Sbjct: 180 NEPNDHERINALRSAFRKLHDIIMNNSNINAIG----------YIPFSWGTKRI 223 >gi|52548946|gb|AAU82795.1| conserved hypothetical protein [uncultured archaeon GZfos1C11] Length = 438 Score = 62.5 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 48/240 (20%), Positives = 95/240 (39%), Gaps = 28/240 (11%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS------------SITKIDMA 198 ++ + + + + + +L++ +VLD+S SM S FD + T+ Sbjct: 156 GYYLSVGLNTGIIESDFQRKKLNLALVLDISGSMGSSFDEYYYDRFGNHVAVNDTEDAEK 215 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFG 254 K A V L+ + + + GLV F+ E + + L+ + +S G Sbjct: 216 SKIEIAAAAIVALLDHLEDDDRLGLVLFNTGAELAEPVSLVGAKNMQKLKGDVLEISATG 275 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 T + G++ A +++D + + Y+ I+F+TD S + ++SL E Sbjct: 276 -GTRLSAGMQMA-TELYD-----EFLEVNQSEYENRIIFLTDAMPNSGQTSEESLLGMIE 328 Query: 315 AKKRGAIVYA--IGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 A VY IGI V + E + ++Y V + + + +VT + Sbjct: 329 ANANK-NVYTTFIGIGVDFNTELVEYITKIRGANYYSVHSATQFKERMDDEFEYMVTPLV 387 >gi|228991799|ref|ZP_04151737.1| D-amino acid dehydrogenase, large subunit [Bacillus pseudomycoides DSM 12442] gi|228767939|gb|EEM16564.1| D-amino acid dehydrogenase, large subunit [Bacillus pseudomycoides DSM 12442] Length = 453 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 61/340 (17%), Positives = 118/340 (34%), Gaps = 20/340 (5%) Query: 40 VSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN-NRKKLKGGDILCRIKNTWNMSFRNEL 98 VS K + S+ ++ NG N K G + + ++ Sbjct: 26 VSQKESKKIDNEKQVVHSIPVTFEEMTKYPNGKYNGKYKVGLLDISKESEPIMKEVAHDT 85 Query: 99 RDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPI 158 +N D+ + S D+ V I K+PL+ + Sbjct: 86 EENVKSADLKNKSNSEQADLYVHTMYSLLKQEIIPFDKMPLQILGIGRVEDEEKK--SNG 143 Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T + N + ++ I+LD S SM D + K+D+A ++I + ++ +V+ Sbjct: 144 TKGEQKNKEDRGNYNIEILLDASGSMAGKIDGKM-KMDIAKEAIQQFVSDLPEAVNVSLR 202 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST--NSTPGLKYA-YNQIFDMQG 275 V + +K + + L K+ +T S G + + + + Sbjct: 203 V------YGHKGSNDEKDKTASCGAIENVYTLQKYNQTTLRQSLDGFQPVGWTPLAEAIK 256 Query: 276 MRQHCNTEDANYKKIIVFM-TDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI-RS 332 K I+++ +DG E +++ N K + IG +V + Sbjct: 257 RSTETFQSAKANDKNIMYVVSDGVETCGGNPVEEAQKVSNSNIKPIMNI--IGFQVDHEA 314 Query: 333 HEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + L+ A S + L N + D F GKDI ++R+ Sbjct: 315 EKQLKEIAEVSKGKYVLANNAKELQDQFKETGKDITSRRL 354 >gi|212693197|ref|ZP_03301325.1| hypothetical protein BACDOR_02707 [Bacteroides dorei DSM 17855] gi|237709939|ref|ZP_04540420.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|237725394|ref|ZP_04555875.1| conserved hypothetical protein [Bacteroides sp. D4] gi|265753591|ref|ZP_06088946.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] gi|212664302|gb|EEB24874.1| hypothetical protein BACDOR_02707 [Bacteroides dorei DSM 17855] gi|229436081|gb|EEO46158.1| conserved hypothetical protein [Bacteroides dorei 5_1_36/D4] gi|229456032|gb|EEO61753.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|263235305|gb|EEZ20829.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] Length = 332 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 37/222 (16%), Positives = 73/222 (32%), Gaps = 51/222 (22%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +DVS SM + D +++ A + + + P+ N GL F+ + Sbjct: 88 IDIMLAVDVSTSMLAE-DLKPNRLEAAKQVASEFING---RPNDN----IGLTIFAGESF 139 Query: 232 EFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 L + ++ I T G+ A ++ D + Sbjct: 140 TQCPLTVDHGVLLNLFNSIKGDIAQRGLIEDGTAIGMGIANAVTRLKDSKAKS------- 192 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV--------------- 329 K+I+ +TDG N + D L AK+ G +Y IG+ Sbjct: 193 ----KVIILLTDGSNN--RGDISPLTAAEIAKQFGIRIYTIGVGTNGTAPYPMQTYAGTQ 246 Query: 330 -------IRSHEFLRACASP-NSFYLVENPHSMYDAFSHIGK 363 I + +++ + + + + I K Sbjct: 247 YVNVPVEIDEKTLTEIAGTTNGNYFRATSNSKLKEVYQEIDK 288 >gi|163734461|ref|ZP_02141901.1| hypothetical protein RLO149_09454 [Roseobacter litoralis Och 149] gi|161392469|gb|EDQ16798.1| hypothetical protein RLO149_09454 [Roseobacter litoralis Och 149] Length = 327 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 29/228 (12%), Positives = 80/228 (35%), Gaps = 23/228 (10%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + W I P + + AR D+++ +D+S SM++ + + + ++ + Sbjct: 68 LIWILLVIAISQPERLGAPIETSKSAR-DLILAIDISGSMDTRDFTDAS--NENVQRLAG 124 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 + + V+ + + L+ F +K L + + + G+ Sbjct: 125 VRDVVRAFVEGREGDRMALIVFGSKAYLQSPLTEDTGTIVELLDQ---------TEVGMA 175 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 + I D G+ +++++ ++DG T L A+ G ++ Sbjct: 176 GPHTAIGDAIGLSIRTFEASEIEQRLLILLSDG--ADTASRMSPLNAAEIARGAGVEIFT 233 Query: 325 IGIRVIR--SHEFLRACA-------SPNSFYLVENPHSMYDAFSHIGK 363 I + + A + S++ + ++ + ++ I + Sbjct: 234 IAVGDPDATGENRVDVAALQDIANRTSGSYFFAADQAALDEIYARIDE 281 >gi|156383257|ref|XP_001632751.1| predicted protein [Nematostella vectensis] gi|156219811|gb|EDO40688.1| predicted protein [Nematostella vectensis] Length = 161 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 65/176 (36%), Gaps = 22/176 (12%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-- 228 +D+ VLD S S+ + + + N M+ VK + GLV +S+ Sbjct: 1 AVDLAFVLDGSTSINNADPGNFQLLK------NFMINIVKSFKISSERTHVGLVLYSSFT 54 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 +++ F + + + I T + LK A +Q+F + Sbjct: 55 QLKFNFDKYSDSASIVKAINTTDYPKGGTRTGEALKMAKSQLFGASM---------RSVP 105 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 K+++ +TDG + + K G +++A+G+ L AS + Sbjct: 106 KVLIVLTDGRSSD-----KVEAPSKALKDEGVVIFAVGVGDQIDPSELNVMASDSK 156 >gi|189501234|ref|YP_001960704.1| von Willebrand factor type A [Chlorobium phaeobacteroides BS1] gi|189496675|gb|ACE05223.1| von Willebrand factor type A [Chlorobium phaeobacteroides BS1] Length = 331 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 43/224 (19%), Positives = 82/224 (36%), Gaps = 43/224 (19%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 IT +V S+ +D++ LD+S SM D +++D A K + E + Sbjct: 84 ITRAVTEASEKG--IDIVFALDISESMLEE-DFEGSRLDAAKKIALRFIRE-RPQD---- 135 Query: 218 VVQSGLVTFSNKIEEFFLLE---WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + GLV F K L + L R++ + T + + Sbjct: 136 --RFGLVLFRGKSFTLCPLTLDHRLLGMLVRQVSVDAISDKGTAIGSAI------LVGTN 187 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-- 332 +R + +++++ +TDGE+ S + + A+ G +Y IG+R Sbjct: 188 RLRASVSK-----ERVLLLLTDGEHNSGEVGPVTAS--EIAQSEGIRIYVIGVRNEEEAG 240 Query: 333 -------------HEFLRACA--SPNSFYLVENPHSMYDAFSHI 361 + L A + ++ + +S+ DAF I Sbjct: 241 SPESMDAEREGVDEQVLGTVAGMTGGRYFRASDENSLKDAFGEI 284 >gi|292490950|ref|YP_003526389.1| von Willebrand factor A [Nitrosococcus halophilus Nc4] gi|291579545|gb|ADE14002.1| Forkhead-associated protein [Nitrosococcus halophilus Nc4] Length = 510 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 75/203 (36%), Gaps = 25/203 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++++L+ S+SM+ +T K + + ++ + ++ G + + + Sbjct: 45 DIILLLENSQSMQENDPEFLT-----QKFVLNFINGLRGDSQIAIILFDGGTSVAMPLVP 99 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L + L+ G NS G++ A ++ N +K I+ Sbjct: 100 VSQKT--RDALVTNLDKLTYTGRFRNSAAGMERALYEL---------KNFGRPEAEKAII 148 Query: 293 FMTDG--ENLSTKEDQQSLYY-----CNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPN 343 +T+G E K D+ + +EA + G V+ I L+ A + Sbjct: 149 LLTNGPIETGDEKRDRDFSRWMQEYLAHEAAEAGIKVFGIAFTEAADFHLLQILAHTTGG 208 Query: 344 SFYLVENPHSMYDAFSHIGKDIV 366 ++Y + AF+ I K I Sbjct: 209 TYYRAPQAVDLQSAFNRIRKVIT 231 >gi|254505681|ref|ZP_05117827.1| von Willebrand factor, type A [Vibrio parahaemolyticus 16] gi|219551334|gb|EED28313.1| von Willebrand factor, type A [Vibrio parahaemolyticus 16] Length = 322 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 35/215 (16%), Positives = 64/215 (29%), Gaps = 46/215 (21%) Query: 173 DMMIVLDVSRSMESFF----DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+VLD+S SM + ++ K ++ + GLV F++ Sbjct: 86 DLMLVLDLSYSMSKEDMLDDGDYVDRLTAVKKVVSDF-------ASKREGDRLGLVLFAD 138 Query: 229 KIEEFFLLEWGVSHLQRKIKYL--SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + ++ L G T G+ A D Sbjct: 139 HAYLQTPLTLDRKTIAEQVNQLVLRLIGEKTAIGEGIGLATKTFVDSDA----------- 187 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------- 331 ++++V ++DG N L AKK A +Y IG+ Sbjct: 188 PQRVMVLLSDGSNT--SGVLDPLEAAKIAKKYNATIYTIGVGAGEMVVKEFFMTRKVNTA 245 Query: 332 ---SHEFLRACA--SPNSFYLVENPHSMYDAFSHI 361 L A + ++ + + + I Sbjct: 246 QDLDERTLMDIAQVTGGQYFRARDAKELATIYDTI 280 >gi|74002027|ref|XP_544943.2| PREDICTED: similar to anthrax toxin receptor 2 [Canis familiaris] Length = 646 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 39/220 (17%), Positives = 75/220 (34%), Gaps = 31/220 (14%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 S + D+ VLD S S+ + + I ++++ V+ Sbjct: 186 GGPVSAQEQPSCRGAFDLYFVLDKSGSVANNW-----------IEIYNFVQQLTERF-VS 233 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQ 274 ++ + FS++ L S + + ++ L T GLK A QI Sbjct: 234 PQMRLSFIVFSSQATIILPLTGDRSKISKGLEDLKNVSPVGETYIHEGLKLANEQIQKAG 293 Query: 275 GMRQHCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 G++ II+ +TDG + L ++ ++ GA VY +G+ Sbjct: 294 GLKTSS---------IIIALTDGKLDGLVPSYAEKEAKI---SRSFGARVYCVGVLDFEQ 341 Query: 333 HEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + R S + V+ + A I I+ + Sbjct: 342 AQLERIADSKEQVFPVKGG---FQALKGIINSILAQSCTE 378 >gi|324503617|gb|ADY41568.1| Cuticlin-1 [Ascaris suum] Length = 786 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 67/184 (36%), Gaps = 22/184 (11%) Query: 154 IVMPITSSVKVNSQT--------DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 + + I +++ D +LD++ +LD S S+ S + D + +N + Sbjct: 9 LSVSICNAIDYEDNDLAKTCRPLDRQLDLLFILDGSGSV------SGSTFDTQMSMLNRI 62 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 +E V++ PD + +++ ++E F L+ ++ + +T + L Sbjct: 63 VEMVEIGPDRTQIAVMQYSSYT-RVEFGFTANPNKEKLRLALQKIRHISGTTRTGKALDK 121 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 A G + +I V ++DG + ++ G + + Sbjct: 122 ALQVF--KHGETSGARVNQEDVAQIAVVVSDG-----HSHDDPVPAAQRLRRAGVQILTL 174 Query: 326 GIRV 329 GI Sbjct: 175 GIGA 178 >gi|156403935|ref|XP_001640163.1| predicted protein [Nematostella vectensis] gi|156227296|gb|EDO48100.1| predicted protein [Nematostella vectensis] Length = 247 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 42/220 (19%), Positives = 80/220 (36%), Gaps = 19/220 (8%) Query: 154 IVMPITSSVKVNSQTDA---RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 +++P+ + S++ R+D+ V+D S SM I + K + L Sbjct: 11 VIVPLLALFVNRSESSPCRRRMDVAFVIDRSASMGDENFGYIKQF--IKKVSHEFLLS-- 66 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYN 268 + VQ G+V FS+ F L +H L I + G + + LK Sbjct: 67 -----KDSVQVGVVPFSHHYALEFGLTNYTNHKALDAAIDKIQFEGSFSMLSGALKVVQQ 121 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 ++F Q + + + +V + DG NLS + K G ++ +G+ Sbjct: 122 KLFMPQLVEVKSKAKKDKPLQAVVIVGDGGNLSGSD--ALYESSLALKDTGKRLFVVGLG 179 Query: 329 VIRSHEFLRACASP---NSFYLVENPHSMYDAFSHIGKDI 365 + +R AS F+ ++ + + I Sbjct: 180 RLEYEASMRMLASEPPKTHFFNAGTGKNLKNFVKRLANSI 219 >gi|77465284|ref|YP_354787.1| von Willebrand factor domain-containing protein [Rhodobacter sphaeroides 2.4.1] gi|77389702|gb|ABA80886.1| Von Willebrand domain containing protein [Rhodobacter sphaeroides 2.4.1] Length = 328 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 74/202 (36%), Gaps = 39/202 (19%) Query: 174 MMIVLDVSRSM---ESFFDS-SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +++ LD+S SM + D T+++ + + +EE + GLV F+N+ Sbjct: 90 IVLTLDMSGSMLIEDFDIDGVQSTRLEAVKRVARSFVEE-------RQGDRIGLVLFANR 142 Query: 230 IEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + ++ + R I+ S G ST GL A + + Sbjct: 143 AYVAAPLTFDLAAVGRAIEEASIGITGRSTAIADGLGLALKSVTESSAAS---------- 192 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-----------EFL 336 ++IV ++DG++ + + D + A + G ++ I + L Sbjct: 193 -RVIVLLSDGQDNAHQID--ARQVAGLAARHGVRIHTIALGPDDLETRPAARDAVDTATL 249 Query: 337 RAC--ASPNSFYLVENPHSMYD 356 RA AS Y V + Sbjct: 250 RAIAEASGGRSYRVRGMEDLRA 271 >gi|332560892|ref|ZP_08415210.1| von Willebrand factor, type A [Rhodobacter sphaeroides WS8N] gi|332274690|gb|EGJ20006.1| von Willebrand factor, type A [Rhodobacter sphaeroides WS8N] Length = 341 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 75/202 (37%), Gaps = 39/202 (19%) Query: 174 MMIVLDVSRSM---ESFFDS-SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +++ LD+S SM + D T+++ + + +EE + GLV F+N+ Sbjct: 103 IVLTLDMSGSMLIEDFDIDGVQSTRLEAVKRVARSFVEE-------RQGDRIGLVLFANR 155 Query: 230 IEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + ++ + R I+ S G ST GL A ++ + Sbjct: 156 AYVAAPLTFDLAAVGRAIEEASIGITGRSTAIADGLGLALKRVTESGAAS---------- 205 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-----------EFL 336 ++IV ++DG++ + + D + A + G ++ I + L Sbjct: 206 -RVIVLLSDGQDNAHQID--ARQVAGLAARHGVRIHTIALGPDDLETRPAARDAVDTATL 262 Query: 337 RAC--ASPNSFYLVENPHSMYD 356 RA AS Y V + Sbjct: 263 RAIAEASGGRSYRVRGMEDLRA 284 >gi|156743215|ref|YP_001433344.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156234543|gb|ABU59326.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 419 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 35/212 (16%), Positives = 81/212 (38%), Gaps = 30/212 (14%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 ++ +Q A +++ V+D S SM KID ++ +E + V+ Sbjct: 35 QIMTQVRAPVNVCFVIDRSGSM------KGEKIDRVRRATIRAIEMLDAQDVVS------ 82 Query: 223 LVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V F ++ E + L ++ + G T P ++ +I Sbjct: 83 VVIFDHRTEVLIPATPVAKPAELADRVNRVRDSG-GTRIAPAIEAGLREI---------- 131 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + ++ + ++ +TDG+ T+ + L +A +R + A+G+ + + L A Sbjct: 132 DKGPSHMVRRLILLTDGQ---TESESDCLRRAEDAGRRNVPITALGVGKDWNEDLLIEMA 188 Query: 341 --SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 S + ++ P + D F + + + Sbjct: 189 NRSGGTADYIDRPEKIVDYFQNTIQRAQATTV 220 >gi|73695956|gb|AAZ80784.1| matrilin [Biomphalaria glabrata] Length = 418 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 40/210 (19%), Positives = 83/210 (39%), Gaps = 27/210 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 L++ IVLD S S+ + AIK + L + ++ D N V+ ++++ Sbjct: 42 CRQEPLELGIVLDSSSSLRN------RDFKTAIKFLQEFLSQYEISSDPNG-VRVSIISY 94 Query: 227 SNKIEEFFLLEWGVSHLQ----RKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 I + I + K G+ T++ ++Y + +R Sbjct: 95 GKGIYPEIGFNLTTYDTKDEVIEAIGKIPHKAGLRTDTGRAIQYMHEAQLANGVVRPG-- 152 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACA 340 K+ + +TDG +++E + + EA+K +++AIG+ IR+ E L Sbjct: 153 -----VTKVSIVITDG---NSQEWKLTKEAAEEARKDNIVMFAIGVGTDIRNSELLNIAG 204 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + V+N + + S I + + + Sbjct: 205 DQSRVTKVDNYNQL----SSIKESLAHQTC 230 >gi|311245368|ref|XP_003121804.1| PREDICTED: integrin alpha-11-like [Sus scrofa] Length = 1055 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 88/214 (41%), Gaps = 39/214 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++IVLD S S+ + + + ++ +K +Q G+V + Sbjct: 166 QTYMDIVIVLDGSNSIYPWVEVQ-----------HFLINILKKFYIGPGQIQVGVVQYGE 214 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + F L S ++ ++ S + G T + G+++A ++ F G + Sbjct: 215 DVVHEFHLNDYRS-VKDVVEAASHIEQRGGTETRTAFGIEFARSEAFQKGGRKG------ 267 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL-- 336 KK+++ +TDGE+ + + ++ + +++K YA+ + R I FL Sbjct: 268 --AKKVMIVITDGESHDSPDLEKVIQ---QSEKDNVTRYAVAVLGYYNRRGINPEAFLNE 322 Query: 337 -RACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + AS F+ V + ++ D +G I Sbjct: 323 IKYIASDPDDKHFFNVTDEAALKDIVDALGDRIF 356 >gi|126314598|ref|XP_001371349.1| PREDICTED: similar to SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily c, member 2 [Monodelphis domestica] Length = 3274 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 79/213 (37%), Gaps = 25/213 (11%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + QT A D++ ++D S S+ + + + ++E + + Sbjct: 17 VSGFGVTDAQEAVQTSAAADIIFLVDSSWSIGK------EHFQLVREFLYDVIESLAVAG 70 Query: 214 DVNNVVQSGLVTFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAY-NQI 270 + LV F+ FLL + + + I ++ G + + GL+Y N + Sbjct: 71 SD---FRFALVQFNGNPHTEFLLNTYRTNQEVLSHIANMTYLGGDSKTGRGLRYVIQNHL 127 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 G R ++IV +TDG + + A V AIG++ Sbjct: 128 TPAAGSRA-----RDGVPQVIVVLTDGRSQDDVAGPSAELKSAAA----VDVVAIGVQDA 178 Query: 331 RSHEFLRACASPN---SFYLVENPHSMYDAFSH 360 E L+ A+ + +EN +++D + Sbjct: 179 EEGE-LKEMATEPLDLHVFNLENFTALHDIVGN 210 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 69/187 (36%), Gaps = 21/187 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S+ D+ + + + V I + + +Q GLV +++ + Sbjct: 1636 DIVFLLD--GSINFKRDN-------FQEVLRFVSGIVDTIYEGGDSIQVGLVQYNSDPTD 1686 Query: 233 FFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L+ + + I + G N+ GL++ + + D + Sbjct: 1687 EFFLKDFSTKEEILDAINKVVYKGGRQANTLVGLEH-LRKNHFVPEAGSRI---DQRIPQ 1742 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 I +T G + ++G V+A+G+R + E R ++ + V Sbjct: 1743 IAFVITGG-----SSVEDVEEATRALSQKGVKVFAVGVRNVDLREVSRIASNSAIAFRVS 1797 Query: 350 NPHSMYD 356 N + + Sbjct: 1798 NVQELSE 1804 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 41/235 (17%), Positives = 78/235 (33%), Gaps = 27/235 (11%) Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVK 163 ++ + I + P+ Y S + ++P K T I T+ + + + Sbjct: 1368 ARNVDPEELVRISLSPE---YVFSVSTFRELPSLEQKLLTPITTLTSEQIKSILSQTPHP 1424 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 D++ ++D S S+ I I ++ + + P N ++ GL Sbjct: 1425 RPDVEGNAADIVFLVDSSDSVRPEG------IVHIRDFIRNIVRKFDVGP---NRIRIGL 1475 Query: 224 VTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHC 280 FSN++ F L S + I+ L G S N+ L++ F Sbjct: 1476 AQFSNEVFPEFYLRTHKSQSDVLEAIRRLRFRGGSPLNTGKALEFVAKNFFVKS---AGS 1532 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 ED + +++F+ K G I +G R + E Sbjct: 1533 RIEDGVPQHLVLFL------GGKSQDDVSRAARVMSSSGIIRVGVGSRNVDRTEL 1581 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 62/165 (37%), Gaps = 15/165 (9%) Query: 189 DSSITKIDMAIKSINAML-EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQR 245 D + T+ +I L + V+ + +++Q + +++ ++ F + S + Sbjct: 449 DGTSTRGSANFNAIRDFLYKVVQKLEIGQDLIQVAVAQYADTVKLGFHFKDFQSRRDVLS 508 Query: 246 KIKYLSKFGVST-NSTPGLKYAYNQIFDMQ-GMRQHCNTEDANYKKIIVFMTDGENLSTK 303 I+ L + + L Y N F+ G R K++V +T G++L Sbjct: 509 AIRRLRPINGTVLYTGAALDYVRNNFFNGSVGYRA-----SEGVPKLLVLITGGKSLDDI 563 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 Y E K+ G + +AIG + E P+ + Sbjct: 564 S-----YPAQELKRSGIMAFAIGSPMADRTELEEVAFDPSLVFEP 603 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 39/236 (16%), Positives = 85/236 (36%), Gaps = 27/236 (11%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARL---DMMIVLDVSRSMESFFDSSITKI 195 + F P + + ++ ++ T + D++ +LD S ++ + + Sbjct: 601 FEPAEFRPGPMQAI-LPGFLSPLRTLSGTTQVPVNKRDIIFLLDGSLNVGNANFPYV--- 656 Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKF 253 + ++ + + D ++ GLV FS+ E F L+ S L +++ L Sbjct: 657 ---RDFVMNLVNSLDVGYDN---IRVGLVQFSDTPETEFSLDTYQSKSDLLARLRQLQLK 710 Query: 254 GVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY 311 G ST N L + + N + G R H N + + L Sbjct: 711 GGSTLNIGSALDFVHSNHFTEAGGSRIHENIPQLLLL----------LAAGQSGDSYLQA 760 Query: 312 CNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 N + G + + +G E + +P+ YL+++ S+ + + + T Sbjct: 761 SNALARAGVLTFCVGASQANKAELEQIAFNPSLVYLMDDFSSLPALPQQLIQPLTT 816 Score = 45.1 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 28/184 (15%), Positives = 60/184 (32%), Gaps = 24/184 (13%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S + + LE + P +Q G+V +S Sbjct: 235 AQESADLIFLIDGS------DKTGSAHFAAIRDFLVNFLERL---PIGARQIQVGVVQYS 285 Query: 228 NKIEEFFLL-EWGVS-HLQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMRQHCNTED 284 N F L + + +K L G TN L + F G + + Sbjct: 286 NGPRTAFSLNSYSTKADVLDAVKALRVIGGEETNVGAALDFVVENHFTQAGGSR----VE 341 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRACASPN 343 +++V ++ ++ D + A +++ G+ + L+ A+ Sbjct: 342 EGVPQVLVLISGSQSTDDIRDGVVAL-------KQASIFSFGLGAQGADRAELQHIATDE 394 Query: 344 SFYL 347 +F Sbjct: 395 NFVF 398 Score = 42.4 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 33/216 (15%), Positives = 76/216 (35%), Gaps = 28/216 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + R D++ ++D S + S F + ++E + + PD V+ Sbjct: 1018 GPSPGPGERKDIVFLIDGSDRVRSDFP-------FLKDFVRKVVESLDVGPD---RVRVA 1067 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQIFDMQGMRQ 278 +V FS + F L S + ++ + + G S ++ L + + + Sbjct: 1068 VVQFSERARPAFYL-NSYSDQRAVLQAVRQLSPLGGASLHTGSALDFVLRNVLVESAGSR 1126 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 ++++ ++ G + + L K+ GA+ + IGI E Sbjct: 1127 MAEG----VPQVLIVLSAGRSQDDIQGPAVL-----LKRSGAVPFGIGIGHSDDAEMRTI 1177 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 P+ + P + +H+ + + K I + Sbjct: 1178 SYVPD--FAAVIPT--FQQLAHVQQLVSEKVILLSQ 1209 >gi|84622723|ref|YP_450095.1| hypothetical protein XOO_1066 [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|84366663|dbj|BAE67821.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF 311018] Length = 335 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 29/216 (13%), Positives = 71/216 (32%), Gaps = 45/216 (20%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 MM+ +D+S SM + ++ A ++ L+ + GL+ F Sbjct: 100 QMMLAVDLSGSMNEPDMVLGGKVVDRLTAAKAVLSDFLDRRDGD-------RVGLLVFGQ 152 Query: 229 KIEEFFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L ++ ++ ++ + G T + + ++ E Sbjct: 153 RAYALTPLTADLTSVRDQLRDSVVGLAGRETAIGDAIALSVKRL-----------REQKQ 201 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG-----------IRVIRS--- 332 ++++V +TDG N + L AK G ++ I + + Sbjct: 202 GQRVVVLLTDGVNTA--GVLDPLKAAELAKAEGVRIHTIAFGGGGGSSLFGVPIPAGGND 259 Query: 333 ---HEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 + LR A + F+ + + ++ + + Sbjct: 260 DIDEDGLRKIAQQTGGRFFRARDTEELAGIYAELDR 295 >gi|119606788|gb|EAW86382.1| inter-alpha (globulin) inhibitor H5, isoform CRA_d [Homo sapiens] Length = 735 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 72/202 (35%), Gaps = 33/202 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ VLD S SM TK+ ++ +L +++ + ++ FSN+I+ Sbjct: 296 VVFVLDSSASM------VGTKLRQTKDALFTILHDLRPQD------RFSIIGFSNRIKVW 343 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + + I ++S G T+ L+ A + H D + Sbjct: 344 KDHLISVTPDSIRDGKVYIHHMSPTG-GTDINGALQRAIRLL---NKYVAHSGIGDRSVS 399 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEF-----LRACA 340 +IVF+TDG+ + +L N ++ ++ IGI L C Sbjct: 400 -LIVFLTDGKPT--VGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCG 456 Query: 341 SPNSFYLVENP-HSMYDAFSHI 361 + E+ + + I Sbjct: 457 LTRRVHEEEDAGSQLIGFYDEI 478 >gi|260592520|ref|ZP_05857978.1| BatA protein [Prevotella veroralis F0319] gi|260535566|gb|EEX18183.1| BatA protein [Prevotella veroralis F0319] Length = 318 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 37/213 (17%), Positives = 78/213 (36%), Gaps = 37/213 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + +D+M+ +D+S SM + D +I++A + + + P+ N GL F Sbjct: 83 KDAEGIDIMLTMDISASMLTE-DVFPNRIEVAKEVASDFI---SGRPNDN----IGLTIF 134 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + + + + +++ + T GL + +++ D + Sbjct: 135 AGEAFTQCPMTVDHAALLNLLHNVRTDLVVKGLIQDGTAIGMGLANSVSRLKDSKAKS-- 192 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF---- 335 K+I+ +TDG N + + AKK G +Y IG+ + Sbjct: 193 ---------KVIILLTDGSNNVGSISPMTA--ASIAKKYGIRIYTIGLGKESEGDLGAID 241 Query: 336 ---LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + FY ++ + + I K Sbjct: 242 YKTLQNIAVSTNGEFYRAQSQAELSKIYQDIDK 274 >gi|160874259|ref|YP_001553575.1| von Willebrand factor type A [Shewanella baltica OS195] gi|160859781|gb|ABX48315.1| von Willebrand factor type A [Shewanella baltica OS195] Length = 642 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 50/310 (16%), Positives = 102/310 (32%), Gaps = 45/310 (14%) Query: 54 IDRSLVHAATQIMNEGNGNNRKKLKGGDILCRI---KNTWNMSFRNELRDNG------FV 104 ++ + V A Q+ + G I +N + +N + G F Sbjct: 95 VEMARVAAPMQMSSNGAVMGMSIAPMPRDYAAIPLAQNKFEQQVQNGIMVAGEIPVSTFS 154 Query: 105 NDIDDIVRSTSLDIVVV---PQNEGYSISAIS---RYKIPLKFCTFIPWYTNSRHIVMP- 157 D+D +T ++ P+ + + Y PL P+ + P Sbjct: 155 IDVDTGSYATLRRMLREGHLPEKGTVRVEEMLNYFAYDYPLPAKNAAPFSVTTELAPSPY 214 Query: 158 ---------ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + +++ +LDVS SM +S+ K+ + ++ + + Sbjct: 215 NDDMMLLRIGLKGYDLPKSQLGASNLVFLLDVSGSM-----ASVDKLPLLQTALKLLTAQ 269 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + V+ VV +G + L ++ LS G STN G+ AY Sbjct: 270 LSAQDKVSIVVYAGAAG----VVLDGASGNDTQTLNYALEQLSA-GGSTNGGQGITQAYQ 324 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 ++H N ++ TDG+ N+ + + + K G + +G Sbjct: 325 L------AKKHFIPNGINR---VILATDGDFNVGVTDFDDLIALIEKEKDHGIGLTTLGF 375 Query: 328 RVIRSHEFLR 337 + ++ L Sbjct: 376 GLGNYNDQLM 385 >gi|167034052|ref|YP_001669283.1| von Willebrand factor type A [Pseudomonas putida GB-1] gi|166860540|gb|ABY98947.1| von Willebrand factor type A [Pseudomonas putida GB-1] Length = 324 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 34/219 (15%), Positives = 78/219 (35%), Gaps = 32/219 (14%) Query: 168 TDARLDMMIVLDVSRSME--SFFDSSITK---IDMAIKSINAMLEEVKLIPDVNNVVQSG 222 D+M+ +D+S+SME + D++ K + + + K + G Sbjct: 86 EQPIRDLMLAIDISQSMEATDYTDANGAKSDRLSAVKSVVRDFIARRKDD-------RIG 138 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 L+ F L + + S + G+ + D G+ Sbjct: 139 LIVFGTGAYPQAPLT---------LDHASLLLLLDEVGIGMAGPNTALGDAIGLTIKALE 189 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-------EF 335 + +K+++ +TDG T + + A+ G +V+ IGI ++ Sbjct: 190 KTPEQEKVLILLTDG--NDTSSAITPDHAAHLAQANGIVVHTIGIGDPQATGDAKVDLTT 247 Query: 336 LRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 L+A A + F+ ++ ++ ++ + + K + Sbjct: 248 LQAIARTTGGQFFRADDRQALQQVYATLDRLTPHKVQTF 286 >gi|300120207|emb|CBK19761.2| unnamed protein product [Blastocystis hominis] Length = 474 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 35/205 (17%), Positives = 76/205 (37%), Gaps = 37/205 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++ ++D S S+ ++ +ID + ++L+ L P N ++ FS Sbjct: 112 QGVMDVVFLIDSSSSITD--ENYRKEID----FVKSILDYYYLHP---NYTLVSILEFST 162 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L + +++ I + TN ++ A+ + + ++ Sbjct: 163 DVRVLQELTYDACDVRKAIDS-DRMSGLTNIAKAIEEAHRILKNS----------RSDIP 211 Query: 289 KIIVFMTDG---------ENLSTKE-DQQSLYYCNEAKKRGAIVYAIGIRVIR--SHEFL 336 IV +TDG N + + ++ AK +Y IG+ + Sbjct: 212 DQIVLITDGFQTVHSSINCNDHPHDCNAYAIEKARAAKADDIQIYTIGVGAASYYEDDLR 271 Query: 337 RACASPNSFYLVENPHSMYDAFSHI 361 + +SP+ Y S+ D +S I Sbjct: 272 QIASSPSDQYF-----SLVDDYSSI 291 >gi|297620568|ref|YP_003708705.1| hypothetical protein wcw_0325 [Waddlia chondrophila WSU 86-1044] gi|297375869|gb|ADI37699.1| putative membrane protein [Waddlia chondrophila WSU 86-1044] Length = 374 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 46/221 (20%), Positives = 78/221 (35%), Gaps = 30/221 (13%) Query: 172 LDMMIVLDVSRSMESF---FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ--SGLVTF 226 + + +VLD S SM F +ITK+D+ + + + Q GLVTF Sbjct: 99 IAIYLVLDQSGSMSEEVKVFRKTITKMDLLKEVTKGFV-LGNKQEGLTGRPQDMMGLVTF 157 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + L + ++ L + YA + ++ +H E Sbjct: 158 ARGAQVLAPLTLDHQAIIDQLSKLQYTTDLEQDGTAIGYAIYKTANLIAATRHYAEELEG 217 Query: 287 YKK--------IIVFMTDGENLSTKEDQ-------QSLYYCNEAKKRGAIVYAIGIRVI- 330 K I++ +TDG DQ + L AKK G VY I + Sbjct: 218 AGKPAYTIKNSIMILVTDGLQAPNPLDQGKEFRNVELLDAAVYAKKLGVKVYIINVEPRI 277 Query: 331 -----RSHEFLRACASP---NSFYLVENPHSMYDAFSHIGK 363 +H L + FY+V+N ++ +S I + Sbjct: 278 ASEEFSAHRLLMKKITELTGGRFYMVDNSLNLSSIYSEIDQ 318 >gi|327313515|ref|YP_004328952.1| von Willebrand factor type A domain-containing protein [Prevotella denticola F0289] gi|326945266|gb|AEA21151.1| von Willebrand factor type A domain protein [Prevotella denticola F0289] Length = 318 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 76/213 (35%), Gaps = 37/213 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + +D+M+ +DVS SM + D ++ +A + + + P+ N GL F Sbjct: 83 KDTEGIDIMLTMDVSASMLTE-DVYPNRMVVAKEVASEFI---SGRPNDN----IGLTIF 134 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + + + + ++ + T GL A +++ D + Sbjct: 135 AGEAFTQCPMTLDHAALLNLLHGVRTDLVTSGLMQDGTAIGMGLANAVSRLKDSKAKS-- 192 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF---- 335 KI++ +TDG N + + A+K G +Y IG E Sbjct: 193 ---------KIVILLTDGSNNAGSISPMTA--AAIARKFGIRIYTIGFGKETGEEIGAID 241 Query: 336 ---LR--ACASPNSFYLVENPHSMYDAFSHIGK 363 L+ A ++ FY ++ + + I K Sbjct: 242 YKTLQDIAVSTNGEFYRAQSQAELSRIYQDIDK 274 >gi|241554201|ref|YP_002979414.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240863507|gb|ACS61169.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 706 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 44/305 (14%), Positives = 91/305 (29%), Gaps = 36/305 (11%) Query: 49 VLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNM-------SFRNELRDN 101 + + + EG + N +F ++ Sbjct: 201 QRQFAEPMAAIAPSPVPPAEGRMQMQLDPNRERFANAAANPIKSVATDPVSTFSADVDSA 260 Query: 102 GFVN-----DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKF---CTFIPWYTNSRH 153 + + S+ + + Y + P K PW ++ Sbjct: 261 SYAFVRRSLTGGAMPDPLSVRVEEMINYFPYDWPGPNNADQPFKATVTVMPTPWNRDTEL 320 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + I + T R +++ ++DVS SM+ K+ + + M+ +K Sbjct: 321 MHVAIKG-YDIAPATTPRANLVFLIDVSGSMDE-----PDKLPLLKSAFRLMVNRLKADD 374 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 V+ V +G + S + I L G ST G++ AY + Sbjct: 375 TVSIVTYAG----NAGTVLAPTRVAEKSKILSAIDRLEP-GGSTGGAEGIEAAY-DLAKQ 428 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 ++ N ++ TDG+ N+ D E +K G + +G Sbjct: 429 GFVKDGVNR--------VMLATDGDFNVGPSSDGDLKRIIEEKRKDGIFLTVLGFGRGNL 480 Query: 333 HEFLR 337 ++ L Sbjct: 481 NDSLM 485 >gi|116876155|gb|ABK30937.1| complement component 2/factor B variant 2 [Carcinoscorpius rotundicauda] Length = 889 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 42/244 (17%), Positives = 82/244 (33%), Gaps = 25/244 (10%) Query: 118 IVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 I N + +S K+ + + + +++ RL + V Sbjct: 371 IFKTSSNNNFRMSTNKVSKMLANKGFEMSRRSIFQKSPRIEPRRRRIDLNFPGRLVIYFV 430 Query: 178 LDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV-VQSGLVTFSNKIEEFFL 235 D S S+ +F+SSI + + V + G V+FS+ + FL Sbjct: 431 FDASGSIGRKYFNSSIKFAKGLVTRMG-----------VKEFGTRFGAVSFSSTVSASFL 479 Query: 236 LEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + + + T + L + Q+ + H + A K II Sbjct: 480 PQDYTTEEEVLNALDKFDFTEGGTAISSALDFVKTQMIPLSK---HTFADRA-MKTIIFL 535 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIRVIRSHEFLRACASP--NSFYLVEN 350 +TDG+ E K A +Y+I + LR AS + Y++++ Sbjct: 536 LTDGK---ANMRGDPKQVAKELKADVKAEIYSIALTGDYDINKLREVASSKKDHVYILKD 592 Query: 351 PHSM 354 ++ Sbjct: 593 YETL 596 >gi|326426681|gb|EGD72251.1| hypothetical protein PTSG_11571 [Salpingoeca sp. ATCC 50818] Length = 1748 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 48/230 (20%), Positives = 83/230 (36%), Gaps = 37/230 (16%) Query: 155 VMPITSSVKVNS-QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 P +S V T D+++VLD S S+ ++ ++ V+ + Sbjct: 28 SGPSSSLVAATPDCTRTDFDLVLVLDESGSVG---------MEDWQHTLTFASHFVRALD 78 Query: 214 DVNNVVQSGLVTFS-NKIEEFFLLEWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N +Q VTFS + + +F L + H+ + L G ST++ L+ + + Sbjct: 79 TETNSIQVAAVTFSTDPVLQFSLNTYSTEEHITNALTSLPYAGKSTDTGAALELVLSGVL 138 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VI 330 D ++V MTDG + L K A V+A+G+ I Sbjct: 139 DNPAGGYRGGRA------VVVVMTDGHTQDP----ERLQLAAPLLKARADVFAVGVGDNI 188 Query: 331 RSHEFLRACASP-------NSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 E ++P +S + V N + +DIVT D Sbjct: 189 NVPELFLIASAPTESHVRWSSDFTVVNANEFV-------QDIVTAVRCED 231 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 68/181 (37%), Gaps = 21/181 (11%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 D D++ LD SR++ S S ++ + ++ + V+ Sbjct: 407 RCRDTAFDVVFALDASRAVSSADWSY---------TLEHLRTTMRAFDFGEGATRVAAVS 457 Query: 226 FSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 F ++ F LE + + I L+ G +T++ L+ ++ + ++ G Sbjct: 458 FGDRTRVGFTLEHVASPADIDSAIARLAYIGSATDTGLALEVVHSSVLELVG-------G 510 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 ++V +T ++ + L + G +V +IGI L + ++P+ Sbjct: 511 RVTRPTMVVLVT---AAPASDEARLLSAAGALMRSGVLVSSIGIGAGVDAAQLTSISTPD 567 Query: 344 S 344 + Sbjct: 568 N 568 >gi|241672104|ref|XP_002411442.1| hypothetical protein IscW_ISCW011070 [Ixodes scapularis] gi|215504093|gb|EEC13587.1| hypothetical protein IscW_ISCW011070 [Ixodes scapularis] Length = 1021 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 34/177 (19%), Positives = 71/177 (40%), Gaps = 26/177 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--I 230 D++ +LD S S+ ++ ++A L+ + P N + +++FS + Sbjct: 50 DLVFLLDRSGSVGQAG------FEVETGFVHAFLKGFDVAP---NTTRVAVISFSEDAVV 100 Query: 231 EEFFLLEWGVS-HLQRKIKYL-SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 FL + G HL RK++ + S +TN+ GL+ A+ K Sbjct: 101 HADFLKDPGNKCHLSRKMQGVHSANQGATNTGAGLQAAWEVF----------QRSRPTAK 150 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 K+++ +TDG + + + K G ++ GI + + ++P + Sbjct: 151 KLLILVTDGMATMGPDPVKK---AEKLKNMGVDIFVFGIGRMLKQHLEQLASTPANV 204 >gi|115315541|gb|AAV65032.2| complement component 2/factor B variant 1 [Carcinoscorpius rotundicauda] Length = 889 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 42/244 (17%), Positives = 82/244 (33%), Gaps = 25/244 (10%) Query: 118 IVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 I N + +S K+ + + + +++ RL + V Sbjct: 371 IFKTSSNNNFRMSTNKVSKMLANKGFEMSRRSIFQKSPRIEPRRRRIDLNFPGRLVIYFV 430 Query: 178 LDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV-VQSGLVTFSNKIEEFFL 235 D S S+ +F+SSI + + V + G V+FS+ + FL Sbjct: 431 FDASGSIGRKYFNSSIKFAKGLVTRMG-----------VKEFGTRFGAVSFSSTVSASFL 479 Query: 236 LEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + + + T + L + Q+ + H + A K II Sbjct: 480 PQDYTTEEEVLNALDKFDFTEGGTAISSALDFVKTQMIPLSK---HTFADRA-MKTIIFL 535 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIRVIRSHEFLRACASP--NSFYLVEN 350 +TDG+ E K A +Y+I + LR AS + Y++++ Sbjct: 536 LTDGK---ANMRGDPKQVAKELKADVKAEIYSIALTGDYDINKLREVASSKKDHVYILKD 592 Query: 351 PHSM 354 ++ Sbjct: 593 YETL 596 >gi|329850248|ref|ZP_08265093.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] gi|328840563|gb|EGF90134.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] Length = 575 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 34/224 (15%), Positives = 83/224 (37%), Gaps = 19/224 (8%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R F + +G +T++ A + I +G ++ ++I + L +D + A Sbjct: 8 LRKFSADIRGNVTMIVAFSVIPIVAAVGGGLDFANIQAARAKLQDAVDAGAIAATIDPTA 67 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDID-DIVRSTSLDIVVVPQNEG 126 R+ + IK + N F N D + + S + N Sbjct: 68 TPTQTTREAVAKKAFCGNIKQ-------SGGLQNSFCNTTTLDTLGTASATLSTATSNNI 120 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 +++ + +P T++ + + + V++ T ++ VLD + SM S Sbjct: 121 MTVTYSATAHVP----TYLLGLVGIDTVDIDAVAKSGVSTSTA---EVAFVLDNTGSMSS 173 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 ++ +T + ++ ++ + + + LV F ++ Sbjct: 174 --NNKMTYLKSSLDAV--LASMLDSTGKNYAKTKVALVPFDTQV 213 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 30/153 (19%), Positives = 59/153 (38%), Gaps = 10/153 (6%) Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 +N + L ++ + ++ G +TN T G+++ + + D Sbjct: 424 TNTLLPVMALTQDIAAARTYAARMAPAG-NTNVTIGVQWGMEVLSPTAPFSEGGAFTDKA 482 Query: 287 YKKIIVFMTDGEN-------LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 K ++ +TDG N +++ + + C AK G V+ + + S L+ C Sbjct: 483 VLKYMIVLTDGINTQNRWTTNNSQINARLALACTNAKNLGITVFTVRVEQGDS-TTLQNC 541 Query: 340 AS-PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 AS +Y + N + S I K I R+ Sbjct: 542 ASQTAYYYNLSNADQLPATMSKIMKSIRKVRLT 574 >gi|326918656|ref|XP_003205604.1| PREDICTED: anthrax toxin receptor 2-like, partial [Meleagris gallopavo] Length = 480 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 36/213 (16%), Positives = 70/213 (32%), Gaps = 27/213 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 DM VLD S S+ + +N + E V+ ++ Sbjct: 38 ADEQPSCHGAFDMYFVLDKSGSVAQNW-------HEIFDFVNQLTERF-----VSPKMRL 85 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQH 279 + FS++ + L ++ +K LS+ T GLK A Q Sbjct: 86 SFIVFSSQAQVIMPLTGDREKIKEGLKNLSEVKPAGDTYIHEGLKQA---------NLQI 136 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 + + II+ +TDG+ + + +++ GA VY +G+ + R Sbjct: 137 EKQGASRFSSIIIALTDGKLDG-QIPLYAEKEAKTSRQLGARVYCVGVLDFVQAQLERIA 195 Query: 340 ASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + + V + A I ++ + Sbjct: 196 DTKEQVFPVTGG---FQALKGIINSVLKQSCTE 225 >gi|153806291|ref|ZP_01958959.1| hypothetical protein BACCAC_00547 [Bacteroides caccae ATCC 43185] gi|149130968|gb|EDM22174.1| hypothetical protein BACCAC_00547 [Bacteroides caccae ATCC 43185] Length = 327 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 39/217 (17%), Positives = 71/217 (32%), Gaps = 46/217 (21%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +DVS SM + D +++ A + P+ N G+ F+ + Sbjct: 88 IDIMLAIDVSTSMLAE-DLKPNRLEAAKDVAAEFING---RPNDN----IGITLFAGETF 139 Query: 232 EFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + L I + T G+ A ++ D + K Sbjct: 140 TQCPLTVDHAVLLDMIHNIKCGLIEDGTAVGMGIANAVTRLKDSKAKS-----------K 188 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-------------------- 329 +I+ +TDG N K D L AK G VY IG+ Sbjct: 189 VIILLTDGTNN--KGDISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVNMP 246 Query: 330 --IRSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 I + + +++ + + + + I K Sbjct: 247 VEIDEKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|110639040|ref|YP_679249.1| BatA-like protein [Cytophaga hutchinsonii ATCC 33406] gi|110281721|gb|ABG59907.1| BatA-like protein, aerotolerance-related protein [Cytophaga hutchinsonii ATCC 33406] Length = 351 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 44/233 (18%), Positives = 85/233 (36%), Gaps = 44/233 (18%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 S + N+Q ++M+ +DVS SM+ D ++ D A + ++ + + Sbjct: 101 SNETNTQYTEGINMIFAIDVSESMK-ITDIHPSRFDAAKQICTDIINKRSND-------R 152 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMR 277 G+V FS + L L+ ++ L + T L A N++ Sbjct: 153 IGIVIFSGEAVTLSPLTNDYVLLKNQLNDLKQNKDLQSGTAIGTALGTAINRL------- 205 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-- 335 ++IIV ++DGEN S D + + + +Y IG+ +H+F Sbjct: 206 ----KNAETKERIIVLISDGENTSGLMDPITA--ADLCLEYNIKIYCIGLGKDGTHQFKD 259 Query: 336 ----------------LRA--CASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 L+ + FY + S+ D ++I + K + Sbjct: 260 DNGTIQYVESKLDENTLKNISATTKGKFYRAYDKKSLDDVIANIDQLEKGKIV 312 >gi|222147837|ref|YP_002548794.1| hypothetical protein Avi_1104 [Agrobacterium vitis S4] gi|221734825|gb|ACM35788.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 483 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 34/168 (20%), Positives = 55/168 (32%), Gaps = 33/168 (19%) Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF----DMQGMRQHCNTEDANYKKI 290 L S LQ I ++ G ST G+ + + G KK+ Sbjct: 313 PLTSSFSTLQSAIADMTSEG-STRLDAGMLAGWYTLSPKWRSAWGGGTAPADYSEKVKKV 371 Query: 291 IVFMTDGENL----------------------------STKEDQQSLYYCNEAKKRGAIV 322 IVFMTDGE + L C+ K + Sbjct: 372 IVFMTDGEMNVKFGSTDPAKSSTEKLDWICDKNRTKSCNDTATNALLTTCDSIKSNNIEI 431 Query: 323 YAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 YAI + L+ C+S +Y + ++ D ++ I K+I+ + Sbjct: 432 YAISYSSEADVQNLQTCSSGTKYYFSASTTNIKDVYTAISKNIIGSTV 479 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 65/222 (29%), Gaps = 25/222 (11%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 + F + G I++AI L + L +G ++ S + + + Sbjct: 7 LSKFIADDNGNFAIMSAILLMPLLLAVGAAVDYSSARDHRNDIQVTA------------D 54 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 + + + ++L D G + V + + Sbjct: 55 SAILAAASSYSSSSGVDSLAAGIDSYLDSKLTDQGSNDVDTAAVPKRLSGPTLSADGKEI 114 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 I +P F + + + S + D L++ +VLDVS SM Sbjct: 115 CIVVGE--GVPTSFMQLA----GVKTVDVSAKSCAALPGNID--LEVSLVLDVSSSMIEE 166 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + ++ + L V + + FS++ Sbjct: 167 G-----RFVPMQTAVKSFLTSFANDATVAKRSKIAIAPFSSR 203 >gi|254440702|ref|ZP_05054195.1| hypothetical protein OA307_117 [Octadecabacter antarcticus 307] gi|198250780|gb|EDY75095.1| hypothetical protein OA307_117 [Octadecabacter antarcticus 307] Length = 590 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 16/73 (21%), Positives = 32/73 (43%), Gaps = 1/73 (1%) Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRACASPNSFYLVENPHSMYDAF 358 ++ + + C A+ +G ++Y + + L+ CAS +S Y + + AF Sbjct: 517 NGSEANTRLSNICAAARAQGIVIYTVAFEAPSGGQTALQDCASSSSHYFDVDGTDISGAF 576 Query: 359 SHIGKDIVTKRIW 371 S I DI ++ Sbjct: 577 SAIASDIRNLKLT 589 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 32/233 (13%), Positives = 82/233 (35%), Gaps = 34/233 (14%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 + F + G + I + + + +I GM +++ + L +DR+ + AA Sbjct: 19 FQRFRKDEDGALIIFSLMMMVMILWFGGMAVDLMRYETTRAKLQGSLDRATLAAA----- 73 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 ++ + + + F D +V Sbjct: 74 ---DLDQVMAPADVVRDYMDKAGMLHFL-------------------QGDPIVDQGINYR 111 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD--ARLDMMIVLDVSRSME 185 ++A + +PL F +++ M + ++ + + +++ +VLDVS SM Sbjct: 112 IVTANASAPMPLFFYDLPKVFSSPFTPGMSSLTVSGSSTAEERVSDVEISLVLDVSSSMN 171 Query: 186 SFFDSSITKID-MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 S ++ +T + A + + +L P ++ ++ +S + + Sbjct: 172 S--NNRMTNLRPAAREFVTTVLANNTNAPQ--GLITISMIPYSAVVNPGTDIA 220 >gi|153954292|ref|YP_001395057.1| hypothetical protein CKL_1667 [Clostridium kluyveri DSM 555] gi|219854893|ref|YP_002472015.1| hypothetical protein CKR_1550 [Clostridium kluyveri NBRC 12016] gi|146347173|gb|EDK33709.1| Conserved hypothetical protein [Clostridium kluyveri DSM 555] gi|219568617|dbj|BAH06601.1| hypothetical protein [Clostridium kluyveri NBRC 12016] Length = 580 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 47/226 (20%), Positives = 89/226 (39%), Gaps = 31/226 (13%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 I S + +Q+ +D++ VLD S SM S KI ++I L+ ++ + Sbjct: 21 FIPFSGVLAAQSGTSMDVIFVLDSSGSMAE---SDPEKIRE--EAIKMFLDMGQIQGNKA 75 Query: 217 NVVQSGLVTFSNKIEEFFLL-EWGVSHLQRKIKYLS---KFGVSTNSTPGLKYAYNQIFD 272 GLV +S+ I L S + +IK ++ G T++ GL A + Sbjct: 76 -----GLVAYSDSIVREHNLDSINSSEDKDRIKSMASDISLGQRTDTGRGLLEAVKLM-- 128 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA----KKRGAIVYAIGIR 328 G + N +I+ ++DG+N + +SL A K +G VY IG+ Sbjct: 129 ESGHKSGNNP-------VIILLSDGKNDPERSQDESLNDLKNALQICKSKGYPVYTIGLN 181 Query: 329 VIRSHEFLR----ACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + + + + ++ Y+ + + I D ++ Sbjct: 182 YNGTVDKTQLGDISSSTGGKDYITNTASDLPAILTDIYADNSKLKV 227 >gi|296445280|ref|ZP_06887239.1| von Willebrand factor type A [Methylosinus trichosporium OB3b] gi|296257235|gb|EFH04303.1| von Willebrand factor type A [Methylosinus trichosporium OB3b] Length = 575 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 31/221 (14%), Positives = 69/221 (31%), Gaps = 32/221 (14%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 F +G + ++ + + L+LG ++ K+ L D + + A I+ Sbjct: 15 TFSSCDRGNVAVIFGLSFIPLVLMLGAGVDYGRAVSTKSNLQQATDSAALAVAKTIVATT 74 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 + +L ++N + + E+ + ++ Sbjct: 75 TNQQAQSQAQVYLLTNVRNAVAVVTKAEISAD------------------------RLTL 110 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 S +IP + I T+ + + ++ +VLD S SM Sbjct: 111 CLDSTAQIPTTIMKIA----HIETITTKATTCAQTPGGMNGTYEIALVLDNSGSMSKSAG 166 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +KI + + + I V+ +V FS + Sbjct: 167 GK-SKIAALRDAATSFVNN---IYSKTTDVKMSIVPFSAGV 203 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 21/76 (27%), Positives = 36/76 (47%), Gaps = 8/76 (10%) Query: 295 TDGENLSTK-EDQQSLYYCNEAKKRGAIVYAIGIRVI------RSHEFLRACASP-NSFY 346 TD + S +D+ +L C AK G +Y IG V + + ++ CA+ N ++ Sbjct: 490 TDYHDTSRNMQDELTLEACTNAKTAGVEIYTIGFSVPVDPIDAQGLKMMQDCATDANHYF 549 Query: 347 LVENPHSMYDAFSHIG 362 + S+ AF+ IG Sbjct: 550 AATDVDSLNAAFASIG 565 Score = 39.4 bits (90), Expect = 0.99, Method: Composition-based stats. Identities = 21/89 (23%), Positives = 31/89 (34%), Gaps = 1/89 (1%) Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ + L S + KI L++ G TN G + + I + Sbjct: 364 TSSSQTVLQLTATQSTITTKISGLTENGY-TNLHEGFMWGWRTISPTGPFAAGRAYATKD 422 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 KIIVFMTDG N +A Sbjct: 423 NHKIIVFMTDGFNNWQSATSTVTGSAYQA 451 >gi|30749469|pdb|1MHP|A Chain A, Crystal Structure Of A Chimeric Alpha1 Integrin I-Domain In Complex With The Fab Fragment Of A Humanized Neutralizing Antibody gi|30749472|pdb|1MHP|B Chain B, Crystal Structure Of A Chimeric Alpha1 Integrin I-Domain In Complex With The Fab Fragment Of A Humanized Neutralizing Antibody Length = 192 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 40/212 (18%), Positives = 78/212 (36%), Gaps = 35/212 (16%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +LD++IVLD S S+ + + I +N +L+ + + P Q G+V + Sbjct: 1 TQLDIVIVLDGSNSIYPW--------ESVIAFLNDLLKRMDIGP---KQTQVGIVQYGEN 49 Query: 230 IEEFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + F L S + + + + G T + G+ A + F Sbjct: 50 VTHEFNLNKYSSTEEVLVAANKIVQRGGRQTMTALGIDTARKEAFT------EARGARRG 103 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-RVIRSHEF--------LR 337 KK++V +TDGE+ +Q + C ++I I ++ Sbjct: 104 VKKVMVIVTDGESHDNYRLKQVIQDCE---DENIQRFSIAILGHYNRGNLSTEKFVEEIK 160 Query: 338 ACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + AS F+ V + ++ +G+ I Sbjct: 161 SIASEPTEKHFFNVSDELALVTIVKALGERIF 192 >gi|48425220|pdb|1PT6|A Chain A, I Domain From Human Integrin Alpha1-Beta1 gi|48425221|pdb|1PT6|B Chain B, I Domain From Human Integrin Alpha1-Beta1 Length = 213 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 42/218 (19%), Positives = 81/218 (37%), Gaps = 35/218 (16%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 S+ +LD++IVLD S S+ + D +N +L+ + + P Q G+V Sbjct: 1 GSECSTQLDIVIVLDGSNSIYPW--------DSVTAFLNDLLKRMDIGP---KQTQVGIV 49 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 + + F L S + + + + G T + G A + F Sbjct: 50 QYGENVTHEFNLNKYSSTEEVLVAAKKIVQRGGRQTMTALGTDTARKEAFT------EAR 103 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--RVIRSH------ 333 KK++V +TDGE+ ++ + C ++I I R + Sbjct: 104 GARRGVKKVMVIVTDGESHDNHRLKKVIQDCE---DENIQRFSIAILGSYNRGNLSTEKF 160 Query: 334 -EFLRACAS---PNSFYLVENPHSMYDAFSHIGKDIVT 367 E +++ AS F+ V + ++ +G+ I Sbjct: 161 VEEIKSIASEPTEKHFFNVSDELALVTIVKTLGERIFA 198 >gi|296139788|ref|YP_003647031.1| von Willebrand factor type A [Tsukamurella paurometabola DSM 20162] gi|296027922|gb|ADG78692.1| von Willebrand factor type A [Tsukamurella paurometabola DSM 20162] Length = 327 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 83/261 (31%), Gaps = 30/261 (11%) Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 N S K +++ ++ + R +++ +DVS S Sbjct: 42 NTSVLDSVAPPGTNRWKHVPIALLAIGLVLLMVALSGPQAMRKVPRNRATVVLAIDVSLS 101 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 ME+ D ++ A ++ + E+ N V G+V+F+ + Sbjct: 102 MEA-RDVEPDRLTAAKEAAKKFVTELP------NGVNLGIVSFAGTASLLVSPTPDRTLA 154 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 + L + T + G+ Y I ++ +R ED I+ +DG+ Sbjct: 155 LNAVDKL-ELAQRTATGEGI---YTSIQSIKNIRDVLGGEDNAPPARIILESDGKQTVPT 210 Query: 304 EDQQSL---YYCNEAKKRGAIVYAIGIRVIRS--------------HEFLRACA--SPNS 344 + +AK+ G + I L+ A S Sbjct: 211 DLDDPRGGFTAARKAKEEGIPISTISFGTTSGSVNIGGQNIPVPVDDASLKRIAELSGGQ 270 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 F+ + + + +A+ + +I Sbjct: 271 FFAASSLNDLNEAYGSLRDEI 291 >gi|254504856|ref|ZP_05117007.1| hypothetical protein SADFL11_4895 [Labrenzia alexandrii DFL-11] gi|222440927|gb|EEE47606.1| hypothetical protein SADFL11_4895 [Labrenzia alexandrii DFL-11] Length = 455 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 62/456 (13%), Positives = 130/456 (28%), Gaps = 99/456 (21%) Query: 11 FFYNYKGGMTILTAI-FLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 + G + ILTA+ F+P++ + +G ++V + + L S +D + + AA+ + N Sbjct: 1 MGADRSGNVAILTALAFVPLMLITIG-SLDVVRMTTAQAKLQSTLDSATLAAAS-LSNTA 58 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 + + L + + +I + +++ ++ Sbjct: 59 DIEDTVDEYIQANLPDTAPWTTLKLTMGDVTDSLNAKSVEITATVDIEMTILKLAGIDKT 118 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLD-----VSRSM 184 S ++ M + + +D M+ D S S+ Sbjct: 119 SVLASSVAQQAAQNIEVSVVLDISSSMGGSKITSLREAAKGFIDTMLKEDEDKEYTSLSI 178 Query: 185 ESFFDS-SITKIDMAIKSINAMLEEVKLIPDVNNVV--------------QSGLVTF--- 226 F + +I ++ + N V + G + + Sbjct: 179 IPFGGTVNIGDFYDTYAVNSSTPGVIDSPSSANYYVNKNVPYGKFMFSTEREGCIEYTDD 238 Query: 227 ---------SNKIEEFFLLEW-------------------GVSHLQRKIKYLSKFGVSTN 258 +++ + +W + L+ I + + Sbjct: 239 DFDMAAIPANSRPQVPDFTKWVATNPWCPSEDSAMVLNSNNTTDLKALIDDMDLSDGTGM 298 Query: 259 STPGLKYA------YNQIFDMQGMRQHCNTEDANYKKIIVFMTDG--------------- 297 L A + + D + K+ V MTDG Sbjct: 299 DIGALWGAKVLSGSMRGQLGGDFSDRPADFNDEDTLKVAVIMTDGAITAQFRPRDYTTTG 358 Query: 298 ----------------ENLSTKEDQQS---LYYCNEAKKRGAIVYAIGIRVIRS---HEF 335 STK D C VY IG ++ + Sbjct: 359 KIKNKTQQTIVSKGNINTASTKADDAVAYFKRVCEYLNDNNVQVYTIGFQINSGSLPDQL 418 Query: 336 LRACASP-NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 L+ CAS +++Y VE + + DAF+ I + R+ Sbjct: 419 LKYCASSLSNYYFVEGLN-IEDAFNAIASAVNNLRV 453 >gi|149376601|ref|ZP_01894361.1| hypothetical protein MDG893_00577 [Marinobacter algicola DG893] gi|149359119|gb|EDM47583.1| hypothetical protein MDG893_00577 [Marinobacter algicola DG893] Length = 340 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 27/163 (16%), Positives = 62/163 (38%), Gaps = 26/163 (15%) Query: 173 DMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +D+S SM+ SI ++ + ++ ++ + GL+ F Sbjct: 90 DLLLAVDISPSMDEQDMVIQGRSINRLQAVKRVLDDFIQR-------REGDRLGLLLFGT 142 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ ++ + + G +T L A ++ D Sbjct: 143 EPYIQAPLTFDLATVRTLLHEAGIGMAGRATAIGDALGLAVKRLRD-----------RPQ 191 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 +++++ +TDG N + + AK G +Y IGI Sbjct: 192 EQRVVILLTDGANTA--GEIAPDKAAEIAKAAGVRIYTIGIGA 232 >gi|52548788|gb|AAU82637.1| cell surface protein [uncultured archaeon GZfos18H11] Length = 1359 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 49/240 (20%), Positives = 95/240 (39%), Gaps = 28/240 (11%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS------------SITKIDMA 198 ++ + + S + + +L++ +VLD+S SM S FD + T+ Sbjct: 930 GYYLSVGLNSGIIESDFQRKKLNLALVLDISGSMGSSFDEYYYDRFGNHVAVNDTEDAEK 989 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFG 254 K A V L+ + + + GLV F+ E + + L+ + +S G Sbjct: 990 SKIEIAAAAIVALLDHLEDDDRLGLVLFNTGAELAEPVSLVGAKNMQKLKGDVLEISATG 1049 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 T + G++ A +++D + + Y+ I+F+TD S + ++SL E Sbjct: 1050 -GTRLSAGMQMA-TELYD-----EFLEVNQSEYENRIIFLTDAMPNSGQTSEESLLGMIE 1102 Query: 315 AKKRGAIVYA--IGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 A VY IGI V + E + ++Y V + + + +VT + Sbjct: 1103 ANANK-NVYTTFIGIGVDFNTELVEYITKIRGANYYSVHSATQFKERMDDEFEYMVTPLV 1161 >gi|224024929|ref|ZP_03643295.1| hypothetical protein BACCOPRO_01660 [Bacteroides coprophilus DSM 18228] gi|224018165|gb|EEF76163.1| hypothetical protein BACCOPRO_01660 [Bacteroides coprophilus DSM 18228] Length = 332 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 39/222 (17%), Positives = 73/222 (32%), Gaps = 51/222 (22%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +DVS SM + D +++ A + + P+ N GL F+ + Sbjct: 88 IDIMLAVDVSTSMLAE-DLKPNRLEAAKQVAAEFING---RPNDN----IGLTIFAGEAF 139 Query: 232 EFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 L ++ I T GL A +++ D + Sbjct: 140 TQCPLTVDHGVLLNLFQSIKCDIAQKGLIMDGTALGMGLANAVSRLKDSKAKS------- 192 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV--------------- 329 K+I+ +TDG N + D L AK+ G VY IG+ Sbjct: 193 ----KVIILLTDGVNN--RGDISPLTAAEIAKQFGIRVYTIGVGTNGTAPYPMQTYAGVQ 246 Query: 330 -------IRSHEFLRACASP-NSFYLVENPHSMYDAFSHIGK 363 I + + +++ + + + + I K Sbjct: 247 YVQVPVEIDEQTLTQIAGTTNGNYFRATSNSKLKEVYQEIDK 288 >gi|255526265|ref|ZP_05393182.1| von Willebrand factor type A [Clostridium carboxidivorans P7] gi|296186259|ref|ZP_06854663.1| von Willebrand factor type A domain protein [Clostridium carboxidivorans P7] gi|255510045|gb|EET86368.1| von Willebrand factor type A [Clostridium carboxidivorans P7] gi|296049060|gb|EFG88490.1| von Willebrand factor type A domain protein [Clostridium carboxidivorans P7] Length = 580 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 40/173 (23%), Positives = 78/173 (45%), Gaps = 27/173 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S T + LD++ VLD S SM+ S +I ++I L+ ++ + + GLV Sbjct: 29 SSNTSSNLDVVFVLDSSGSMKE---SDPEEIRT--EAIKMFLDMSQVQGN-----KFGLV 78 Query: 225 TFSNKIEEFFLLEW-GVSHLQRKIKYLS---KFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +S+ + L+ + + +IK ++ G T++ G+ A N + + + H Sbjct: 79 AYSDNVVREHNLDTINSNDDKERIKNMALNIPLGQKTDTGAGILEAVN-LMNSGHDKNH- 136 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN----EAKKRGAIVYAIGIRV 329 K +I+ ++DG+N ++ + SL K +G VY IG+ Sbjct: 137 -------KPVIILLSDGKNDPQRKTEDSLKDLKSSISTCKDKGYPVYTIGLNY 182 >gi|255566338|ref|XP_002524155.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative [Ricinus communis] gi|223536573|gb|EEF38218.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative [Ricinus communis] Length = 514 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 51/249 (20%), Positives = 88/249 (35%), Gaps = 35/249 (14%) Query: 119 VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVL 178 +V PQ+ G + I ++ L + + + + ++ LD++ VL Sbjct: 9 IVAPQDSGSRPTPIVPGRVQLTSINNNTAPLEESKLKVMLELTGGDSTNDRPGLDLVAVL 68 Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE- 237 DVS SM KI ++ +++++ I + +V FS L Sbjct: 69 DVSGSMAG------DKIAKVKTAMLFVIKKLSPID------RLSVVKFSADASRLCPLRQ 116 Query: 238 ---WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 L+ I L+ G TN T GL+ + D I+ M Sbjct: 117 ITEDSQKDLENLINGLNADGA-TNITAGLQTGLKVLNDRSLSSGRVVG--------IILM 167 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---PNSFYLVENP 351 +DGE + D + N VY G + L+A A+ +F V+N Sbjct: 168 SDGEQNA-GGDAAQVPIGN------VPVYTFGFGINHEPRVLKAIANNSMGGTFSDVQNT 220 Query: 352 HSMYDAFSH 360 ++ AFS Sbjct: 221 DNLSLAFSQ 229 >gi|47228041|emb|CAF97670.1| unnamed protein product [Tetraodon nigroviridis] Length = 1009 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 38/193 (19%), Positives = 68/193 (35%), Gaps = 33/193 (17%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN- 228 LD++ V+D S S+ ++ T + + L + P + G+V +S+ Sbjct: 706 GALDIVFVIDSSESVG---LTNFTLEKNFVINTINRLGSLAKDPKSETGTRVGVVQYSHS 762 Query: 229 ----KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 I ++ + +K + T + LKYAY+ + ++ Sbjct: 763 GTFQAIRPDDPKIDSLTSFKEAVKQMEWIAGGTWTPSALKYAYDNLI--------RDSRR 814 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 A +V +TDG +D Y C++ K V AIGI + Sbjct: 815 AKASVSVVVITDGRFDPRDDDSLLTYLCSDPK---VDVNAIGIG--------------DM 857 Query: 345 FYLVENPHSMYDA 357 FY VE + Sbjct: 858 FYQVEENEILRSI 870 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 53/185 (28%), Gaps = 16/185 (8%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ----SG 222 Q + +++ +D S ++ + ++ ++ + + V+ G Sbjct: 45 QDECNIEVYFTIDTSETIALQESPPGSLVESIKDFTIEFVKRL-ADEEYRGAVRLSWKMG 103 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + FS + F L + I + G T L ++ Sbjct: 104 GLHFSQEQRVFSRLGT-KAQFINGISGIRYLGKGTYIDCALTNMTQEMTQS--------P 154 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-S 341 + V +TDG A+ G ++A+ +R A S Sbjct: 155 SPFKPLRFAVVITDGHVTGNPCGGI-KVSAERARDAGIRIFAVAASRNIDETGMREIANS 213 Query: 342 PNSFY 346 P Y Sbjct: 214 PAMVY 218 >gi|47216852|emb|CAG11659.1| unnamed protein product [Tetraodon nigroviridis] Length = 1042 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 71/188 (37%), Gaps = 25/188 (13%) Query: 192 ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKY 249 ++ +NA++E V L + + G+V FS+ L+ +S L+ ++ Sbjct: 696 PKNFELVKDFVNALVERVWLSQEAA---RMGVVLFSHTSLAVASLQPRSSLSELKATVRS 752 Query: 250 LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL 309 + G T + + A+ +K+ + +TDG+ ++ Q Sbjct: 753 MPYLGEGTFTGSAIHRAHQLFQAS----------RPGVRKVALVLTDGQ-ADPRDVVQVG 801 Query: 310 YYCNEAKKRGAIVYAIGIRV------IRSHEFLRACAS---PNSFYLVENPHSMYDAFSH 360 EA+ RG ++ IG+ ++A AS +L+++ S++ Sbjct: 802 VSAAEAQARGIEMFVIGVMNESHPLYPDFSAEMKAIASNPKEGHVHLIQDFGSLHVLEKV 861 Query: 361 IGKDIVTK 368 I I + Sbjct: 862 IVNQICEQ 869 >gi|33602243|ref|NP_889803.1| hypothetical protein BB3267 [Bordetella bronchiseptica RB50] gi|33576682|emb|CAE33759.1| putative exported protein [Bordetella bronchiseptica RB50] Length = 571 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 39/238 (16%), Positives = 84/238 (35%), Gaps = 33/238 (13%) Query: 144 FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSIN 203 PW + +++ I +V Q +++++++D S SM + K+ + ++ Sbjct: 182 PAPWNAQRQLLLVGIQG-YRVAPQDIPAVNLVLLIDTSGSM-----ADRAKLPLLKSALR 235 Query: 204 AMLEEVKLIPDVNNVVQSGLVTF--SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 ++ +++ + +V + S + + + I L G STN Sbjct: 236 QLVTQMRAQD------RVAIVAYAGSAGLVLPSTPGDRHAQILAAIDGLQASG-STNGGA 288 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGA 320 GL+ AY + + N IV +DG+ + D L Y +KRG Sbjct: 289 GLELAY------AEAAKGLVKDGVNR---IVLASDGDFNVGRTDLAQLKDYVGSQRKRGI 339 Query: 321 IVYAIGIRVIRSHEF----LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + +G+ ++ L + Y S+ A ++ + K Sbjct: 340 ALTTLGLGSGNYNDAMAMQLANAGDGSYHY----IDSLLQARKVFASELSATLLTIAK 393 >gi|197334600|ref|YP_002156233.1| von Willebrand factor, type A [Vibrio fischeri MJ11] gi|197316090|gb|ACH65537.1| von Willebrand factor, type A [Vibrio fischeri MJ11] Length = 356 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 38/230 (16%), Positives = 80/230 (34%), Gaps = 39/230 (16%) Query: 161 SVKVNSQTDARL--DMMIVLDVSRSM-----------ESFFDSSITKIDMAIKSINAMLE 207 ++ QT L D+M+V+D+S SM +S I++++ + + ++ Sbjct: 85 TILGEPQTRESLGRDVMVVVDLSGSMAEQDFVSKQSSDSGTVKKISRLEATKEVLADFVK 144 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPG 262 K + GL+ F + F+ + ++ L++ G ST+ Sbjct: 145 TRKGD-------RLGLILFG---DAAFVQTPFTADQSVWLELLNQTDVAMAGQSTHLGDA 194 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 + A + +K+ + +TDG T + + A +G + Sbjct: 195 IGLAIKVFEQSSEDKASAEENAKPREKVAIVLTDG--NDTGSYVEPIDAAKVAAAKGVRI 252 Query: 323 YAIGIRVIR--SHEFLR-------ACASPNSFYLVENPHSMYDAFSHIGK 363 + I + R + L A S + N + A+ IG+ Sbjct: 253 HMIAMGDPRTVGEQALDMNIINRVAKESGGKAFQAINRDELEQAYDEIGE 302 >gi|156367146|ref|XP_001627280.1| predicted protein [Nematostella vectensis] gi|156214185|gb|EDO35180.1| predicted protein [Nematostella vectensis] Length = 244 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 35/206 (16%), Positives = 69/206 (33%), Gaps = 22/206 (10%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 R+D+ ++DVS M D T + + I +L L + + G++T+S K Sbjct: 11 RIDLGFLIDVSHGMR--RDGGST-LPQLKRFIRQILRRFAL---GSRHTRVGIITYSTKS 64 Query: 231 EEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + G L R + + L+YA +F + Sbjct: 65 RAMISIHRGITTRTLTRVTSRIRFLRGPRRTGRALRYARKYLF--------RGKPRCGRR 116 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 ++++ +T G + + +E + G + A+ I + R AS S V Sbjct: 117 RVLIVVTSG-----RSGDRVRKPSSELQAVGVEIVAVSIGGGGRRQVTRI-ASTRSHVFV 170 Query: 349 ENPHSMYDAFSHIGKDIVTKRIWYDK 374 + I + T + + Sbjct: 171 VGIKRLVKIVGRIQARVCTASVCRQR 196 >gi|49355778|ref|NP_001001851.1| inter-alpha-trypsin inhibitor heavy chain H5 isoform 3 precursor [Homo sapiens] gi|119606789|gb|EAW86383.1| inter-alpha (globulin) inhibitor H5, isoform CRA_e [Homo sapiens] Length = 702 Score = 62.5 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 72/202 (35%), Gaps = 33/202 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ VLD S SM TK+ ++ +L +++ + ++ FSN+I+ Sbjct: 296 VVFVLDSSASM------VGTKLRQTKDALFTILHDLRPQD------RFSIIGFSNRIKVW 343 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + + I ++S G T+ L+ A + H D + Sbjct: 344 KDHLISVTPDSIRDGKVYIHHMSPTG-GTDINGALQRAIRLL---NKYVAHSGIGDRSVS 399 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEF-----LRACA 340 +IVF+TDG+ + +L N ++ ++ IGI L C Sbjct: 400 -LIVFLTDGKPT--VGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCG 456 Query: 341 SPNSFYLVENP-HSMYDAFSHI 361 + E+ + + I Sbjct: 457 LTRRVHEEEDAGSQLIGFYDEI 478 >gi|326675074|ref|XP_003200270.1| PREDICTED: collagen alpha-1(XIV) chain-like [Danio rerio] Length = 164 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 27/125 (21%), Positives = 52/125 (41%), Gaps = 11/125 (8%) Query: 224 VTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 V ++++ F L+ + + IK + G T + LK+A N +F + + Sbjct: 2 VLYNDRPSAEFYLDTFANKNDIMNYIKIIPYRGGGTATGAALKFAQNNLFTQK----RGS 57 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 + K+I + MTDGE + E ++ G VYA+G++ E + + Sbjct: 58 RKALGVKQIAIVMTDGE-----SEDDVTTTAAELRRSGVTVYALGVKNASVEELKKIGSY 112 Query: 342 PNSFY 346 P + Sbjct: 113 PEHEF 117 >gi|312072922|ref|XP_003139287.1| von Willebrand factor domain-containing protein [Loa loa] gi|307765555|gb|EFO24789.1| von Willebrand factor domain-containing protein [Loa loa] Length = 2142 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 37/316 (11%), Positives = 101/316 (31%), Gaps = 47/316 (14%) Query: 55 DRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRST 114 + + V A NR+K+ + + + +N + +I S+ Sbjct: 1865 ETTAVAGARIPRPSAKSVNRQKIAKEKSVLAVSKAEKQNEALLNGENLKDKQLKEIPGSS 1924 Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDM 174 ++ +I I + + P T+++ +D+ Sbjct: 1925 R-------SSKLSTIQIIPSSNFHRQP---------TSTTFSPNTATIFSPGCE---IDV 1965 Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 ++++D S S+E F + M+ +++ P + +V F+ K + Sbjct: 1966 IMLIDSSGSVEKTFSRE-------KELAAEMINRLRIGPKNA---RVAIVKFAAKEKVKT 2015 Query: 235 LLEWGV----SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + + + ++ + +T L A + +G R T Sbjct: 2016 IWSFDKPQEQQKVLQALQEIPFSSGTTAIHAALLQAVTEYSAAKGARPKLATP------F 2069 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI---RSHEFLRACASPNSFYL 347 ++ TDG + ++ + ++A+ I E ++ S + ++ Sbjct: 2070 VIVFTDG-----FGQKDTMEAATLLRNLIPSIFAVAISRQHPVNEEELIKIAGSKDRVFI 2124 Query: 348 VENPHSMYDAFSHIGK 363 + +++ I + Sbjct: 2125 DNDIRKLFEMLEKITR 2140 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 54/319 (16%), Positives = 111/319 (34%), Gaps = 52/319 (16%) Query: 46 MKTVLHSMIDRSLVHAATQI------MNEGNGNNRKKLKGGD--ILCRIKNTWNMSFRNE 97 + + ++ + T + E + L GD IL + F N Sbjct: 761 TQDSMQTITTAAAKLRQTDAEVYVVPITENYSKDELSLYVGDQGILITHPKAYKKKFANY 820 Query: 98 LRDNGFVNDIDDIVRSTSLDI--VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIV 155 N + D V +D + ++ + K + + T Sbjct: 821 APLNDNNTVVKDFVEIARMDEPFTQIIVTTLFNDQSHVTNKPDMIVKSIAENETRHPLFD 880 Query: 156 MPITSS-VKVNSQTDAR---LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 ++S+ +K +Q +D++ ++D S+S+E K + + +K Sbjct: 881 FDLSSAEIKSKAQIQDPNCLVDLIFIVDTSQSVE----------KTFQKQLQFAVTLIKQ 930 Query: 212 IPDV--NNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 IP NN ++ ++FS++ + F + + + L+ G +T+S G+ A Sbjct: 931 IPPSAFNNRIRVAAISFSSEAQINFQFNEFNNRTEILNALLSLTHSGGNTSSVSGINLAI 990 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 +I + +++IV M+DG + ED L + IVYAI Sbjct: 991 KEIL---------ERGREDVRRMIVLMSDGNSQDCWEDL--LDASDRLHATNTIVYAIA- 1038 Query: 328 RVIRSHEFLRACASPNSFY 346 A+P+ ++ Sbjct: 1039 ------------ANPDYYF 1045 Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 32/205 (15%), Positives = 71/205 (34%), Gaps = 22/205 (10%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + S++ N +++D+ I+LD S S + F+ + + L E I Sbjct: 1369 SALVAKSLESNRCKYSKMDLEIILDASASRQQVFEH--------QRELALSLIERLPIDA 1420 Query: 215 VNNVVQSGLVTFSNKIE--EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 V G+ +F++ + L ++ I+ + G ST + ++ + + Sbjct: 1421 GETHVAVGINSFTSVPTLRQTLGLGRDKQMVRHAIEDIKYIGGSTFTAQAVELSVQDL-- 1478 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIR 331 + +++V M DG + ++ L K GA ++ + + I Sbjct: 1479 -------ERGRRPDAIQVVVLMNDGMSQDP--WEKVLEASQLLKATGAKLFGVALGESID 1529 Query: 332 SHEFLRACASPNSFYLVENPHSMYD 356 E S + Y + Sbjct: 1530 LRELKHYIGSTDRIYRDNSTERFLA 1554 Score = 45.1 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 42/294 (14%), Positives = 93/294 (31%), Gaps = 42/294 (14%) Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 N + + + N L +G D+ V + + +P+ Y+ Sbjct: 521 SNSSAVTGITLATQYAQKSRRSNARLIIILVSDGSSQDLWHSVVQAAKKLHELPETIIYA 580 Query: 129 ISAISRYKIPL-------------------------KFCTFIPWYTNSRHIVMPITSSVK 163 ++ +YK K N I + ++ Sbjct: 581 VTTSKQYKFAELEAFTKDKWKVYVDGRVRRFITDASKHLMKKEGDENDEIIDAGFSPIIE 640 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + +D++I++D S + F +++ K + +L +++I + V+ L Sbjct: 641 SIKAHNDPVDLIILVDTSTPADQDF-------EVSKKFLAELLRSLQVIDFQSR-VRISL 692 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 TF++ L + I Y + S + A + Q Sbjct: 693 TTFTDNAHIEIELR--KPTAKENILYAVGKLQNEYSNASVSAAVDVALA-----QISVPG 745 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + ++I V +TDG + + + + ++ A VY + I S + L Sbjct: 746 EGPRQRIFVILTDGSTQDSMQTITTA--AAKLRQTDAEVYVVPITENYSKDELS 797 Score = 42.8 bits (99), Expect = 0.092, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 75/204 (36%), Gaps = 21/204 (10%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + ++D+MI LD S S+ + F+ K + +++ + + + +Q ++ Sbjct: 116 TCKAGKIDIMIALDSSGSVFNVFEDE-------RKLAHDLIDSLVPVTLKDGRIQVSVMR 168 Query: 226 FSN--KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 F++ ++ F + + + K+ + G ST + A + D+ R++ Sbjct: 169 FASSAEVVIPFKISRTPNEIMEKLDKIKFTGGSTRIAKAVDLA---LTDLSRWRRNDAI- 224 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-HEFLRACASP 342 +I + ++DG + + + + V+A+ + + +E + Sbjct: 225 -----QIFILISDG--NGHELWHVAQTAGRKLQNANIEVFAVPVSQDHNLNELILYTGDA 277 Query: 343 NSFYLVENPHSMYDAFSHIGKDIV 366 Y+ S + V Sbjct: 278 KRVYVGAKQSQFVHTISSLINKCV 301 Score = 37.8 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 35/217 (16%), Positives = 78/217 (35%), Gaps = 31/217 (14%) Query: 95 RNELRDNGFVNDIDDIVRSTSL-DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 + L FVN +DD+ + +I + +S K K + Sbjct: 363 KGNLNLEAFVNSLDDLSSPIEIPNIRGQLMFKADGTPLLSDNKQNAKLSKNLKSELIMNE 422 Query: 154 IVMPITSSVKVNSQTDAR---LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + ++ + ++ LD++ +D+S +F ++ + +K + ++ +V Sbjct: 423 -AVEKQTAANITTKISPPVSYLDLLFAIDLS----PYFPAN---LKNQLKLVTELVNKVT 474 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIK------YLSKFGVSTNSTPGLK 264 ++ G+VTF+ E LEWG + + ++ S + L Sbjct: 475 DEDINEERIRIGVVTFT--QEARLDLEWGKATTKPQVLQHFESIEYVASNSSAVTGITLA 532 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 Y Q + +N + II+ ++DG + Sbjct: 533 TQYAQ-----------KSRRSNARLIIILVSDGSSQD 558 >gi|256822867|ref|YP_003146830.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] gi|256796406|gb|ACV27062.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] Length = 986 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 39/265 (14%), Positives = 81/265 (30%), Gaps = 68/265 (25%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSI------------------TKIDMAIKSINAMLEE 208 +++++VLDVS SM D+ +++D+ +++ L Sbjct: 32 GQSQPVNLLLVLDVSGSMAWTTDACRLNRWGQPYPSCYPGNGEKSRLDIMKEALELFL-- 89 Query: 209 VKLIPDVNNVVQSGLVTFS-----NKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTP 261 D+ + V+ G++T+S + + E L I L G T + Sbjct: 90 ----DDLPDNVKVGILTYSAGNNIDLLHEVKQLSDNNHKATLLTTIDGLEANG-GTLTAG 144 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ--------------- 306 L A + T + IVF+TDG+ S + Sbjct: 145 ALYEAGSYFRGQYDNLPSPITPGCSNASNIVFLTDGQPNSMSYNGYSYRNSIINMTGSSC 204 Query: 307 -----------------QSLYYCNEAKKRGAIVYAIGIRVIRSHE--FLRACASPNS--F 345 ++ + + I + ++ FL A + Sbjct: 205 ARSDDGKECSEKLAGFLSTVDQIEDLTPSKVKTHTIAFALEDNNARTFLENVADAGNGQS 264 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRI 370 Y ++ + DAF + + + + Sbjct: 265 YTADSTDGLVDAFKSSIQTDIEQSM 289 >gi|260777338|ref|ZP_05886232.1| protein BatA [Vibrio coralliilyticus ATCC BAA-450] gi|260607004|gb|EEX33278.1| protein BatA [Vibrio coralliilyticus ATCC BAA-450] Length = 271 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 31/215 (14%), Positives = 66/215 (30%), Gaps = 46/215 (21%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 DMM+V+D+S SM F I ++ + ++ + + GLV F++ Sbjct: 35 DMMLVVDLSYSMSKEDMQFNGDYIDRLSAVKQVLSDFI-------SKRQGDRLGLVLFAD 87 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + ++ L G T G+ A D Sbjct: 88 HAYLQTPLTLDRHTVAEQLNQTVLRLIGTKTAIGEGIGLATKTFVDSDA----------- 136 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------- 331 +++++ ++DG N + + AKK A +Y +G+ Sbjct: 137 PQRVMILLSDGSNTA--GVLDPIEAAKIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTA 194 Query: 332 ----SHEFLRACA-SPNSFYLVENPHSMYDAFSHI 361 + + ++ + + + I Sbjct: 195 QDLDEKSLMEIAKLTGGQYFRARDSKELATIYDTI 229 >gi|126174972|ref|YP_001051121.1| von Willebrand factor type A [Shewanella baltica OS155] gi|125998177|gb|ABN62252.1| von Willebrand factor, type A [Shewanella baltica OS155] Length = 339 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 28/217 (12%), Positives = 66/217 (30%), Gaps = 46/217 (21%) Query: 173 DMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+ +D+S SM+ ++ + + ++ +E K + GL+ F++ Sbjct: 85 DLMMAVDLSGSMQIEDMVVNGKTVDRFTLIQHVVSDFIERRKGD-------RIGLILFAD 137 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + + +K G T + A + M Sbjct: 138 HAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIGLAVKRFDKMDESN--------- 188 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI---------------- 330 ++++ +TDG N + + A R +Y +G+ Sbjct: 189 --RVLILLTDGSNN--SGNIEPEQAAQIAANRKVTIYTVGVGADVMERRTLFGRERVNPS 244 Query: 331 ---RSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 ++ + ++ N + + I K Sbjct: 245 MDLDENQLKHIAEVTHGRYFRARNSQELDQIYQEIDK 281 >gi|91789735|ref|YP_550687.1| von Willebrand factor, type A [Polaromonas sp. JS666] gi|91698960|gb|ABE45789.1| von Willebrand factor, type A [Polaromonas sp. JS666] Length = 346 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 39/238 (16%), Positives = 74/238 (31%), Gaps = 53/238 (22%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+ +DVS SM + D +++ A + A L E+ V+ GLV F+ + Sbjct: 88 IMLAIDVSGSMRA-TDVQPSRLVAAQTAAKAFLTELP------RTVKVGLVAFAGSAQVA 140 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK---- 289 + L I + T G+ + +F G+ ++ Sbjct: 141 QIPTVNREDLVSAIDRF-QLQRGTAIGNGIVMSLATLFPDAGIDLQSMQSGRERQRGFAI 199 Query: 290 ---------------------IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 I+ +TDG+ SL A RG VY +GI Sbjct: 200 DQEKKEAKEFTPVAPGSYNSAAIILLTDGQRT---TGVDSLDAAKLAADRGVRVYTVGIG 256 Query: 329 VIRS---------------HEFLR--ACASPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 + E L+ A A+ ++ + + + + ++ Sbjct: 257 TVDGETIGFEGWSMRVRLDEETLKGIARATQAEYFYAGTATDLKKVYETLSSRLTVEK 314 >gi|304382530|ref|ZP_07365025.1| aerotolerance protein BatA [Prevotella marshii DSM 16973] gi|304336361|gb|EFM02602.1| aerotolerance protein BatA [Prevotella marshii DSM 16973] Length = 332 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 41/222 (18%), Positives = 76/222 (34%), Gaps = 51/222 (22%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +DVS SM + D +++ A + P+ N GL F+ + Sbjct: 88 IDIMLAMDVSTSMLAE-DLKPNRMEAAKDVAAEFI---SGRPNDN----IGLTIFAGEAF 139 Query: 232 EFFLLEWG-------VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + + +++ + T GL A +++ D + Sbjct: 140 TQCPMTTDHQSLLNLLQNVRTDLSARGLIEDGTAVGMGLANAVSRLKDSKAKS------- 192 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS------------ 332 K+++ +TDG N + D + N AK G VY IG+ + Sbjct: 193 ----KVVILLTDGSNN--RGDLSPMTSANIAKSLGIRVYTIGVGTNKVAPYPMPVAGGIQ 246 Query: 333 ---------HEFLRACAS--PNSFYLVENPHSMYDAFSHIGK 363 + L AS +FY N + + + I K Sbjct: 247 YVNIPVEIDTKTLSGIASVTHGNFYRATNNNELKQIYKDIDK 288 >gi|146338996|ref|YP_001204044.1| hypothetical protein BRADO1945 [Bradyrhizobium sp. ORS278] gi|146191802|emb|CAL75807.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 432 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 61/417 (14%), Positives = 126/417 (30%), Gaps = 64/417 (15%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 + F N G + ++ AI L I +G I+ S K L + ID +L+ A Sbjct: 11 LSRFRRNDSGNIAVIFAIALLPILAFIGSAIDYSMAVRAKAKLSASIDAALLAATGYTAM 70 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 G+ ++ K ++ S + V+ +T + GY Sbjct: 71 RGSSSDAKTAATNMFNGQMSAHKLTSNSLSIDITDSVSARTVTGSATVVVKTSFMYMFGY 130 Query: 128 -SISAISRYKIPLKFCTFIPWY----------TNSRHIVMPITSSVKVNSQTDARLDMMI 176 +++ + F T++ +Y + M + ++ A D Sbjct: 131 PTMTVSASSSASASFPTYMDFYVLVDNSPSQGLGATTADMTTLQNATTDTCAFACHDTY- 189 Query: 177 VLDVSRS-----MESFFDSSIT-----KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 S S S++D + +ID+ + ++ + V+N + + + Sbjct: 190 ---TSSSKKTLQTNSYYDKAKKLGVTMRIDVVRSATQSLTDTATSSQIVSNQYRMAVYSM 246 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ-------- 278 L S + + P Y + D G Sbjct: 247 GADCGSLGLTTVASLSSSMSSVKSSVGALDLMTIPYSGYNNDMCTDFDGTMSAMSGVIPT 306 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQ--------------QSLYYCNEAKKRGAIV-- 322 + N +K + F++DG + C K RG + Sbjct: 307 QGDGSSTNPQKWLFFVSDGVADYAYPTTCSKTTQSGGRCVEPLTTTTCTALKARGIKIAV 366 Query: 323 -YAIGIRVIRS--------------HEFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 Y + + + +++CASP +Y V++ S+ A + + + Sbjct: 367 LYTTYLAITSNGYYNTWVKPWRDSIGTIMKSCASPGYYYEVDSSGSIGSALTALFQQ 423 >gi|115374996|ref|ZP_01462267.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|310820519|ref|YP_003952877.1| von willebrand factor type a domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|115368023|gb|EAU66987.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|309393591|gb|ADO71050.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 476 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 36/220 (16%), Positives = 90/220 (40%), Gaps = 27/220 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + IT +V + +++ +++D S SM + K++ A ++ ++ +K Sbjct: 78 VTVDITG-QEVPGARRSPVNLALIIDRSGSMSGY------KLEQAKQAARHLVTLLKDDD 130 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIF 271 + +V + + ++ L+ ++ +R I+Y+ TN + GL Q+ Sbjct: 131 ------RLAIVHYGSDVKSLPGLQATPANRERMIQYIEGIWDEGGTNISAGLLAGQAQVE 184 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL-YYCNEAKKRGAIVYAIGIRVI 330 + + ++ ++DG+ D+ SL + + RG V +IG+ Sbjct: 185 TARSDYRVNR---------LILISDGQPTEGSTDEGSLKQVVKDIRTRGITVSSIGVGTD 235 Query: 331 RSHEFLRACAS--PNSFYLVENPHSMYDAFSHIGKDIVTK 368 + + ++A A S+ +E+ + F + T+ Sbjct: 236 FNEDLMQAFAEYGAGSYGFLEDAGKLATLFQKDLQQASTQ 275 >gi|285019106|ref|YP_003376817.1| von willebrand factor, type a protein [Xanthomonas albilineans GPE PC73] gi|283474324|emb|CBA16825.1| putative von willebrand factor, type a protein [Xanthomonas albilineans] Length = 343 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 29/222 (13%), Positives = 68/222 (30%), Gaps = 47/222 (21%) Query: 169 DARLDMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +M+ +D+S SM + ++ A + L+ + GL+ Sbjct: 102 QQARQLMLAVDLSGSMSDPDMRLGGRVVDRLTAAKAVLADFLDRRDGD-------RIGLL 154 Query: 225 TFSNKIEEFFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 F + L ++ ++ ++ + G T + A ++ Sbjct: 155 VFGQQAYALTPLTADLATVRDQLRDSVVGLAGRETALGDAIALAVKRL-----------R 203 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS---------- 332 E +++++ +TDG N + L AK VY I + Sbjct: 204 EQPQGERVLILLTDGVNTA--GVLDPLKAAELAKAEHVRVYTIALGGDGGGMSLFGMPIP 261 Query: 333 ---------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 + LR A + F+ + + ++ + + Sbjct: 262 GSGGDDEVDEDTLRKIAQDTGGRFFRARDTAQLASIYAELDR 303 >gi|109897980|ref|YP_661235.1| von Willebrand factor, type A [Pseudoalteromonas atlantica T6c] gi|109700261|gb|ABG40181.1| von Willebrand factor, type A [Pseudoalteromonas atlantica T6c] Length = 343 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 31/217 (14%), Positives = 75/217 (34%), Gaps = 46/217 (21%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+MI +D+S SM+ + ++ M ++ ++ ++ + GL+ F++ Sbjct: 88 DLMIAVDLSGSMKIDDMQVNGRQVDRLQMIKSVLHDFIQR-RIGD------RLGLIFFAD 140 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + + + + + G T + A + + Sbjct: 141 TAYLQAPLTYDRETVSQLLNESLIGLVGEQTAIGDAIGLAIKRFKSKEESN--------- 191 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------S 332 K+++ +TDG+N + + A G +Y IG+ + S Sbjct: 192 --KVLILLTDGQNTAGNITPEQAN--ELAINNGVTLYTIGVGADQMLVQSIFGSRQVNPS 247 Query: 333 HEF----LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 E L A + ++ + S+ + +S + + Sbjct: 248 QELDEGMLTTLAESTGGRYFRARDAQSLTEIYSKLDE 284 >gi|302527162|ref|ZP_07279504.1| von Willebrand factor [Streptomyces sp. AA4] gi|302436057|gb|EFL07873.1| von Willebrand factor [Streptomyces sp. AA4] Length = 326 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 40/231 (17%), Positives = 77/231 (33%), Gaps = 30/231 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + R +M+V+DVS SME+ D + T++ A + Sbjct: 71 LTVSLAGPTAEQKVPRNRATVMLVIDVSLSMEA-TDVAPTRLKAAQDAAKQF------AQ 123 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 ++ V GL++F+ + + + I L ST + G+ A I Sbjct: 124 NMTPGVNLGLISFAGTATVLVNPTTDRAGVTKAIDNLK-LAQSTATGEGIYAAMQSIQSF 182 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTK---EDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + + IV M+DG+ + + + AK+ + +I Sbjct: 183 SAVVGGADGPPPAR---IVLMSDGKQTVPEDLYAPRGAYTAAQAAKQAQMPISSISFGTE 239 Query: 331 RS--------------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 E LR A S FY + + ++ +G+ I Sbjct: 240 HGSVDIEGKQQDVRVDDESLREIARLSGGEFYKAASADELKRVYADLGEQI 290 >gi|326315855|ref|YP_004233527.1| von Willebrand factor type A [Acidovorax avenae subsp. avenae ATCC 19860] gi|323372691|gb|ADX44960.1| von Willebrand factor type A [Acidovorax avenae subsp. avenae ATCC 19860] Length = 355 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 34/247 (13%), Positives = 77/247 (31%), Gaps = 62/247 (25%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+ +DVS SM + D ++ A + A + E+ V+ G+V F+ + Sbjct: 88 IMLAMDVSGSMRAA-DVHPDRLTAAQDAAKAFIAELP------RHVRVGIVAFAGSAQLA 140 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK---- 289 L L + I + T + G+ + +F G+ + Sbjct: 141 QLPTQNHEDLFKAIDSF-QLQRGTATGNGILLSLATLFPDTGIDVSALGGRQAMPRPQSM 199 Query: 290 ------------------------------IIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 I+ +TDG+ + + ++ + A RG Sbjct: 200 DEIGRPPHRGSNGRGADRPAPVAPGSYSSAAIIMLTDGQRTTGVDPMEAAQW---AADRG 256 Query: 320 AIVYAIGIRVIRS---------------HEFLRACA--SPNSFYLVENPHSMYDAFSHIG 362 VY +G+ + + L+A A + ++ + + + Sbjct: 257 VRVYTVGVGTVAGETIGFEGWSMRVRLDEDTLKAVAQRTNAEYFHAATAADLKKVYETLS 316 Query: 363 KDIVTKR 369 + ++ Sbjct: 317 SRLTVEK 323 >gi|150251392|gb|ABR68008.1| matrilin-like 85 kDa protein [Lehmannia valentiana] Length = 716 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 35/194 (18%), Positives = 67/194 (34%), Gaps = 20/194 (10%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++++ D S S+ + K + K + +K +N V+ G V FS K + Sbjct: 529 DIIMLFDASNSI---LLENFDKQFIFAKRL------IKNFKIGSNDVRFGGVVFSQKTQL 579 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L+ L + + + ST + N + D I Sbjct: 580 LFNLKDHDDFDGLSKGLTNVKYLDSSTKTDEAFNLVVNDKMFSVEKGGRVSAPD-----I 634 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS-PNSFYLVE 349 ++ TDG S +++ + KK G + ++ I + LR +S P Sbjct: 635 VLLFTDGNPTSP---TKTITSADTVKKNGISIISLAIGKDLDMDILRTISSKPEFAIEAT 691 Query: 350 NPHSMYDAFSHIGK 363 N + + + Sbjct: 692 NYDMLDYVEKKLAQ 705 >gi|222478562|ref|YP_002564799.1| von Willebrand factor type A [Halorubrum lacusprofundi ATCC 49239] gi|222451464|gb|ACM55729.1| von Willebrand factor type A [Halorubrum lacusprofundi ATCC 49239] Length = 491 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 37/203 (18%), Positives = 69/203 (33%), Gaps = 34/203 (16%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + + +V+D S SM ++I + E + N V G+ F Sbjct: 221 EAGDSVCVQLVIDTSGSMGG------SRIANTKSGAKQLAETILDANPDNQV---GVTRF 271 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 +N L + ++ I LS G TN+ G+ ++ H N Sbjct: 272 NNGASTPQQLTDDLDDVEAAIDGLSASG-GTNAQAGVDAGQAEL----ENCPHDN----- 321 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---PN 343 +++V DG+ + AK G ++AIG+ S L AS Sbjct: 322 --RVMVVFGDGDINTDGS---------AAKVAGTEIFAIGVG-GASFSDLEDLASDPADE 369 Query: 344 SFYLVENPHSMYDAFSHIGKDIV 366 + + ++ F + + I Sbjct: 370 HVFFAIDDGAIEQIFGQVAETIT 392 >gi|17538702|ref|NP_499959.1| hypothetical protein C18H7.1 [Caenorhabditis elegans] gi|14573846|gb|AAF98615.2| Hypothetical protein C18H7.1 [Caenorhabditis elegans] Length = 425 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 47/273 (17%), Positives = 98/273 (35%), Gaps = 27/273 (9%) Query: 59 VHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDI 118 + ++ +G + K G IL NTW + + ++ DI +++ Sbjct: 133 IPTLVMVVTDGRSADDPK-GPGQILQAQPNTWVFAAATGDPEKVDTRELMDITG--NINH 189 Query: 119 VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVL 178 +V+ + + R + + T++ + T LD+++VL Sbjct: 190 IVMHRGRDLATDITRRLLRQAQ-----DKCRTTTTTTTTTTTTTTTDPITGCELDLVLVL 244 Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW 238 D S + + ++S ++ ++K+ P V T F L ++ Sbjct: 245 DFSTTTDPVYNSY-------KDLSKRLVSQLKIGPHYTQVAAVTFATVGRTRVRFNLKKY 297 Query: 239 GV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 + R I L G +T G++ A Q+ + +G R T K++V TDG Sbjct: 298 QTQEEVLRGIDNLKSRGGTTAIGAGIEKALTQLDESEGARPGIAT------KVMVVFTDG 351 Query: 298 -ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 N ++++ +A G +Y + Sbjct: 352 WSNKGPDPEKRAR----DAVSSGFEMYTVAYTA 380 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 55/136 (40%), Gaps = 24/136 (17%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 L+++IVLD S S++ F+ S K + + EE+++ P + L+ +S Sbjct: 34 LNVLIVLDRSDSVKGGFNKS-------RKFVTDVSEELQIGP---EKHRVALIVYSGLSY 83 Query: 232 EFFLLEWGVSHLQ----RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 ++ W + RK+ L G +TN+ L+ + + + Sbjct: 84 RREVMPWNFAKSNEEFVRKVNALRAIGGTTNTKKALEVGLELM----------AQRNVSI 133 Query: 288 KKIIVFMTDGENLSTK 303 +++ +TDG + Sbjct: 134 PTLVMVVTDGRSADDP 149 >gi|7766811|pdb|1CK4|A Chain A, Crystal Structure Of Rat A1b1 Integrin I-Domain. gi|7766812|pdb|1CK4|B Chain B, Crystal Structure Of Rat A1b1 Integrin I-Domain Length = 198 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 39/213 (18%), Positives = 77/213 (36%), Gaps = 35/213 (16%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +LD++IVLD S S+ + + I +N +L+ + + P Q G+V + Sbjct: 3 TQLDIVIVLDGSNSIYPW--------ESVIAFLNDLLKRMDIGP---KQTQVGIVQYGEN 51 Query: 230 IEEFFLLE--WGVSHLQRKIKYL-SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + F L + + + G+ T + G+ A + F Sbjct: 52 VTHEFNLNKYSSTEEVLVAANKIGRQGGLQTMTALGIDTARKEAFT------EARGARRG 105 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-RVIRSHEF--------LR 337 KK++V +TDGE+ +Q + C ++I I ++ Sbjct: 106 VKKVMVIVTDGESHDNYRLKQVIQDCE---DENIQRFSIAILGHYNRGNLSTEKFVEEIK 162 Query: 338 ACAS---PNSFYLVENPHSMYDAFSHIGKDIVT 367 + AS F+ V + ++ +G+ I Sbjct: 163 SIASEPTEKHFFNVSDELALVTIVKALGERIFA 195 >gi|294673502|ref|YP_003574118.1| BatA protein [Prevotella ruminicola 23] gi|294472594|gb|ADE81983.1| putative BatA protein [Prevotella ruminicola 23] Length = 332 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 42/229 (18%), Positives = 79/229 (34%), Gaps = 51/229 (22%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S+T +D+M+ +DVS SM + D +I+ A + + + + +N GL Sbjct: 81 DSKTVEGIDIMLAMDVSTSMLAE-DLRPNRIEAAKQVASEFI----IGRPNDN---IGLA 132 Query: 225 TFSNKIEEFFLLEWG-------VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 F+ + + + +++ I T GL A +++ D + Sbjct: 133 IFAGESFTQCPMTTDHASLLNLLQNVRTDIAARGLIEDGTAIGMGLANAVSRLKDSKAKS 192 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS----- 332 K+++ +TDG N + D AK G VY IG+ + Sbjct: 193 -----------KVVILLTDGSNN--RGDISPSTAAEIAKSLGIRVYTIGVGTNKVAPYPM 239 Query: 333 ----------------HEFLRACAS--PNSFYLVENPHSMYDAFSHIGK 363 + L AS FY N + + + I + Sbjct: 240 PVAGGVQYVNVPVEIDTKTLSEIASITEGDFYRATNTNELRKIYKEIDQ 288 >gi|304411849|ref|ZP_07393460.1| von Willebrand factor type A [Shewanella baltica OS183] gi|307303383|ref|ZP_07583138.1| von Willebrand factor type A [Shewanella baltica BA175] gi|304349709|gb|EFM14116.1| von Willebrand factor type A [Shewanella baltica OS183] gi|306913743|gb|EFN44165.1| von Willebrand factor type A [Shewanella baltica BA175] Length = 339 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 28/217 (12%), Positives = 67/217 (30%), Gaps = 46/217 (21%) Query: 173 DMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+ +D+S SM+ ++ + + ++ +E K + GL+ F++ Sbjct: 85 DLMMAVDLSGSMQIEDMVVNGKTVDRFTLIQHVVSDFIERRKGD-------RIGLILFAD 137 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + + +K G T + A + M Sbjct: 138 HAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIALAVKRFDKMDESN--------- 188 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI---------------- 330 ++++ +TDG N + + + A R +Y +G+ Sbjct: 189 --RVLILLTDGSNNA--GNIEPEQAAQIAANRKVTIYTVGVGADVMERRTLFGRERVNPS 244 Query: 331 ---RSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 ++ + ++ N + + I K Sbjct: 245 MDLDENQLKHIAEVTHGRYFRARNSQELDQIYQEIDK 281 >gi|326789709|ref|YP_004307530.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] gi|326540473|gb|ADZ82332.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] Length = 593 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 34/181 (18%), Positives = 77/181 (42%), Gaps = 32/181 (17%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 S+ + + +DA+LD ++V+D S SM+ + + ++ + ++ + L + Q Sbjct: 22 SMLMAATSDAQLDAILVIDASGSMKETDPN-----KLGLEGVKLFVDMLGLTDN-----Q 71 Query: 221 SGLVTFSNKIEEFFLLE-----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 G+VT+ + + + + + +++ + +++ T+ T GLK A + Sbjct: 72 VGVVTYGSDVSQTYPMSLVKNQSDKENIKNFVDGITRDLEYTDITSGLKEAVKML----- 126 Query: 276 MRQHCNTEDANYKKIIVFMTDGEN--------LSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 Q + ++ IV TDG N D+ ++A+ G +Y IG+ Sbjct: 127 -NQRNASGNSPL---IVVFTDGNNAIGGVANRTPADIDKDLAAIISQAQSEGYPIYTIGL 182 Query: 328 R 328 Sbjct: 183 N 183 >gi|300871001|ref|YP_003785873.1| aerotolerance-like membrane protein [Brachyspira pilosicoli 95/1000] gi|300688701|gb|ADK31372.1| aerotolerance-related membrane protein [Brachyspira pilosicoli 95/1000] Length = 328 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 40/224 (17%), Positives = 90/224 (40%), Gaps = 24/224 (10%) Query: 113 STSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNS--RHIVMPITSSVKVNSQ-TD 169 T + + ++ S++ + K K FI I + ++V ++ Sbjct: 24 KTRKNHLYSVKHPRVSMAHTLKSKYYAKDIPFILLMLGLTFSIIGLARPATVDSSANING 83 Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + + +V+D+S SM + D T+++ + K++ +++ N + LV F+ + Sbjct: 84 EGIYISMVVDISPSMMAE-DMLPTRLEASKKTMADFIKK-------RNFDKISLVAFALR 135 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + L+++I + ST+ G+ A + + ++ + + Sbjct: 136 ASVLSPSTFDYTSLEKEIGNIKIDEEGSTSIGLGIATAVDMLRSVK----------DDAE 185 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 K+I+ +TDGEN S + D + A +Y IGI Sbjct: 186 KVIILLTDGENNSGEIDPKLAS--EIASNFNIKIYTIGIGDAAG 227 >gi|218191186|gb|EEC73613.1| hypothetical protein OsI_08104 [Oryza sativa Indica Group] Length = 709 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 41/230 (17%), Positives = 82/230 (35%), Gaps = 39/230 (16%) Query: 148 YTNSRHIVM-PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 N + P ++ + A +D++ VLDVS SM TK+ + +++ ++ Sbjct: 240 CANPEQVTGRPFNATSIGYPTSRAPVDLVTVLDVSGSMAG------TKLALLKRAMGFVI 293 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPG 262 + + + ++ FS+ F L G + + L G TN Sbjct: 294 QHLGPSD------RLSVIAFSSTARRLFHLRRMSHSGRQQALQAVNLLGA-GGGTNIADA 346 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST-KEDQQ-SLYYCNEAKKRGA 320 LK A I R + N + I+ ++DG++ + + + Sbjct: 347 LKKAAKVI----EDRNYKNPVCS-----IILLSDGQDTYNISSNVRGTRPDYRSLVPSSI 397 Query: 321 IVYAI--------GIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSH 360 + + I G + L + A S +F +E+ + DAF+ Sbjct: 398 LNHTICTVPVHGFGFGADHDSDALHSIAESSGGTFSFIEDESVIQDAFAQ 447 >gi|115447343|ref|NP_001047451.1| Os02g0619600 [Oryza sativa Japonica Group] gi|47847560|dbj|BAD21612.1| zinc finger (C3HC4-type RING finger)-like protein [Oryza sativa Japonica Group] gi|47847788|dbj|BAD21564.1| zinc finger (C3HC4-type RING finger)-like protein [Oryza sativa Japonica Group] gi|113536982|dbj|BAF09365.1| Os02g0619600 [Oryza sativa Japonica Group] gi|215701433|dbj|BAG92857.1| unnamed protein product [Oryza sativa Japonica Group] gi|222623257|gb|EEE57389.1| hypothetical protein OsJ_07557 [Oryza sativa Japonica Group] Length = 709 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 41/230 (17%), Positives = 82/230 (35%), Gaps = 39/230 (16%) Query: 148 YTNSRHIVM-PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 N + P ++ + A +D++ VLDVS SM TK+ + +++ ++ Sbjct: 240 CANPEQVTGRPFNATSIGYPTSRAPVDLVTVLDVSGSMAG------TKLALLKRAMGFVI 293 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPG 262 + + + ++ FS+ F L G + + L G TN Sbjct: 294 QHLGPSD------RLSVIAFSSTARRLFHLRRMSHSGRQQALQAVNLLGA-GGGTNIADA 346 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST-KEDQQ-SLYYCNEAKKRGA 320 LK A I R + N + I+ ++DG++ + + + Sbjct: 347 LKKAAKVI----EDRNYKNPVCS-----IILLSDGQDTYNISSNVRGTRPDYRSLVPSSI 397 Query: 321 IVYAI--------GIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSH 360 + + I G + L + A S +F +E+ + DAF+ Sbjct: 398 LNHTICTVPVHGFGFGADHDSDALHSIAESSGGTFSFIEDESVIQDAFAQ 447 >gi|327400025|ref|YP_004340864.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] gi|327315533|gb|AEA46149.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] Length = 790 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 43/283 (15%), Positives = 84/283 (29%), Gaps = 33/283 (11%) Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCT----FIPWYT--NSRHIVMPITSSVK 163 +V S + + PQ ++ ++ KI W Sbjct: 410 LVDSHNFTLYAAPQTFTINVPSVENLKIEATPVNGTKELHLWVEDGGLYGPYSSSNGEAY 469 Query: 164 VNSQTDARLDMMIVLD-VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + +V D E + + I KID A + +K V G Sbjct: 470 ETTNAGGTYTAYVVADFPYGEQEFYLNVYIAKIDAAKIAAKTFNGFLKSSDQVGVAYFGG 529 Query: 223 LV--TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 V ++ + + L I L +G T G+K A ++ Sbjct: 530 DVPGGYTPRYDVSQTLTNDTLSANNSIDDLWAYG-GTPMGGGIKVARQELVANTAPGNIP 588 Query: 281 NTEDANYKKIIVFMTDGENLSTKED--------QQSLYYCNEAKK-----RGAIVYAIGI 327 +++ ++DG T + Q+++ K+ ++Y IG Sbjct: 589 ---------VMIVLSDGNPTLTSDGTASETLAIQEAIEEAETTKQTTIGGEQILIYTIGF 639 Query: 328 RVIRSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 + L+ A SP+ +Y + + I K++ K Sbjct: 640 GNDANETLLKQIATSPDYYYFAATSEELSSIYRQIAKELKEKA 682 >gi|324231484|emb|CBZ42120.1| C. elegans protein C29A12.6b, partially confirmed by transcript evidence [Caenorhabditis elegans] gi|324231581|emb|CBZ42125.1| C. elegans protein C29A12.6b, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 980 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 39/250 (15%), Positives = 89/250 (35%), Gaps = 34/250 (13%) Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 S+ K P + + + + S T D+ ++DVS Sbjct: 756 RSRSVQFAMTEKPPFTTVMNPMKFFTTTRTPITKPKPLIPYSCTA---DVFFLVDVS--- 809 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK----IEEFFLLEWGV 240 + D S +D+A +I+++ P + V+ GL+++S + F Sbjct: 810 QGTGDKSQQYLDIAASAISSL-------PISQDTVRVGLISYSGPGRTHVRVFLDKHNEK 862 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 L ++ + + G +T + ++YA + + H N KK++V TDG + Sbjct: 863 EKLIEEMFLMERHGGTTRTADAIRYA----TKIFEGKAHPA--RKNVKKVLVVFTDGYSQ 916 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGI----RVIRSHEFLRACASPNSFYLVENPHSMYD 356 ++ + A+ +G + A+ + + + + S ++ + + + Sbjct: 917 DNPKEASRM-----ARAKGIQLIAVAVKDRLAPPDTEQLTEIGGNGRSIFISPSGRDLRE 971 Query: 357 AFSHIGKDIV 366 IG Sbjct: 972 --KIIGTQCQ 979 >gi|149414665|ref|XP_001516049.1| PREDICTED: similar to integrin alpha 11 subunit [Ornithorhynchus anatinus] Length = 1194 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 42/214 (19%), Positives = 88/214 (41%), Gaps = 39/214 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++IVLD S S+ + + + ++ +K +Q G+V + Sbjct: 166 QTYMDIIIVLDGSNSIYPWVEVQ-----------HFLINILKKFYIGPGQIQVGVVQYGE 214 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + F L S ++ ++ S + G T + G+++A ++ F G + Sbjct: 215 DVVHEFHLNDYRS-VKDVVEAASHIEQRGGTETRTAYGIEFARSEAFQKGGRKG------ 267 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL-- 336 KK+++ +TDGE+ + + +Q + + ++K YA+ + R I FL Sbjct: 268 --AKKVMIVITDGESHDSPDLEQVI---DASEKDNITRYAVAVLGYYNRRGINPEAFLSE 322 Query: 337 -RACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + AS F+ V + ++ D +G I Sbjct: 323 IKYIASDPDDKHFFNVTDEAALKDIVDALGDRIF 356 >gi|59712029|ref|YP_204805.1| von Willebrand factor type A domain-containing protein [Vibrio fischeri ES114] gi|59480130|gb|AAW85917.1| von Willebrand factor type A domain protein [Vibrio fischeri ES114] Length = 356 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 38/223 (17%), Positives = 76/223 (34%), Gaps = 25/223 (11%) Query: 161 SVKVNSQTDARL--DMMIVLDVSRSM--ESFFDSSITKIDMAIK--SINAMLEEVKLIPD 214 ++ QT L D+M+V+D+S SM + F + + K + A E + Sbjct: 85 TILGEPQTRESLGRDVMVVVDLSGSMAEQDFVSKQSSDLGAVKKISRLEATKEVLADFVK 144 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQ 269 + GL+ F + F+ + ++ L++ G ST+ + A Sbjct: 145 TRKGDRLGLILFG---DAAFVQTPFTADQSVWLELLNQTDVAMAGQSTHLGDAIGLAIKV 201 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 + +K+ + +TDG T + + A + ++ I + Sbjct: 202 FEQSSEDKASAEENAKPREKVAIVLTDG--NDTGSYVEPIDAAKVAAAKDVRIHMIAMGD 259 Query: 330 IR--SHEFLR-------ACASPNSFYLVENPHSMYDAFSHIGK 363 R + L A S + N + A+ IG+ Sbjct: 260 PRTVGEQALDMNIINRVAKESGGKAFQAINRDELEQAYDEIGE 302 >gi|296268803|ref|YP_003651435.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] gi|296091590|gb|ADG87542.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] Length = 607 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 38/228 (16%), Positives = 77/228 (33%), Gaps = 33/228 (14%) Query: 151 SRHIVMPITS----SVKVNSQTDARLDMMIVLDVSRSMESFFDS-SITKIDMAIKSINAM 205 ++ + +P ++ + +++V+DVS SM + TK+++A ++ Sbjct: 392 AKTLSVPAPRVLDRILRSWDELRKPAHVLMVIDVSGSMGADVPGTGQTKLELAKQAAINA 451 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEE-------FFLLEWGVSHLQRKIKYLSKFGVSTN 258 L ++ Q GL FS + + L+++I+ L G T Sbjct: 452 LPQLGPND------QVGLWMFSTNQDGGKDYRELVPMGRNNRDLLKKRIQGLIP-GGGTG 504 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + AY + + +V +TDG+N + Sbjct: 505 LYDTTRAAYRTVL---------ERHSNDVINAVVVLTDGKNEDDNSISLEDLLAELRTET 555 Query: 319 G---AIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHI 361 G V+ I E LR + + + Y P S+ F+ + Sbjct: 556 GQETVRVFTIAYGNDADLEVLRQISQVTDAAAYDSREPGSIDQVFTAV 603 >gi|114571147|ref|YP_757827.1| hypothetical protein Mmar10_2603 [Maricaulis maris MCS10] gi|114341609|gb|ABI66889.1| conserved hypothetical protein [Maricaulis maris MCS10] Length = 520 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 36/175 (20%), Positives = 61/175 (34%), Gaps = 37/175 (21%) Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L + I + G +TN G+ + I + +D Y K + Sbjct: 345 PVTPLTSTERTVLNAIGDMGASG-TTNIPNGVGWGIRLISPGAPFTEGSAWDDDEYIKAM 403 Query: 292 VFMTDGEN---------------------------------LSTKEDQQSLYYCNEAKKR 318 V +TDG+N LS + D ++ C A+ Sbjct: 404 VILTDGDNVMRGRNTDQMSDYEAYGFVADGRLGRRSSSSNVLSNELDDRTEAACAYARSL 463 Query: 319 GAIVYAIGIRVIRS--HEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 G VY I +V S ++ CAS P+ ++ + ++ DAF I D+ R+ Sbjct: 464 GIRVYTITFQVNSSSTRSLMQNCASNPSLYFDSPSSEALEDAFEMIAGDLTNLRL 518 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 36/247 (14%), Positives = 84/247 (34%), Gaps = 47/247 (19%) Query: 1 MFSLNLNIRNFFYNYKGGMTILTAI-FLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLV 59 +F+L +G + + A+ +P+ L G +++S + Sbjct: 6 LFNLGA---RLCRETRGNVATIFALTLVPVALLSGG-AVDLSQSMNAR------------ 49 Query: 60 HAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIV 119 + + + ++ N+ + +V++ ++ I Sbjct: 50 -------SRLAQALDAAALAVGVNTNLSSSEATGIANDFIAANYPGRELGVVQNVNVYI- 101 Query: 120 VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLD 179 + + ++S +R T + +I + S V+ Q RL++++VLD Sbjct: 102 -DDETDTVTVSGEAR------VRTTMLGMIGLDYITVHWESEVQRARQ---RLELVMVLD 151 Query: 180 VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG 239 + SM +KI +S + + D + V+ GLV F+ + G Sbjct: 152 NTGSMGG------SKIRNLRESAELLTGILFDAADDPSDVKIGLVPFAATVNV------G 199 Query: 240 VSHLQRK 246 +H + Sbjct: 200 TNHARAW 206 >gi|148253748|ref|YP_001238333.1| hypothetical protein BBta_2249 [Bradyrhizobium sp. BTAi1] gi|146405921|gb|ABQ34427.1| putative exported protein of unknown function [Bradyrhizobium sp. BTAi1] Length = 432 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 55/418 (13%), Positives = 124/418 (29%), Gaps = 66/418 (15%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 + F N G + I+ A+ L I +G I+ S K L + +D +++ A Sbjct: 11 LSRFRRNESGNIAIIFALALLPILTFVGSAIDYSMAVRAKAKLSASLDAAMLAATGYTAM 70 Query: 68 EGNGNNRKKLKGGDILCRIKNT-WNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 G + K ++ + + N + S ++ + Sbjct: 71 RGTAADAKTSATNMYNGQMSSHKLTSNSLNITVTDSVTARTVTGTASVVVNTAFMYMFGF 130 Query: 127 YSISAISRYKIPLKFCTFIPWY----------TNSRHIVMPITSSVKVNSQTDARLDMMI 176 +++ + F T++ +Y + M + + A D Sbjct: 131 PTMTVTASSSASASFPTYMDFYVLVDNSPSQGLGATTADMTTLQNATSDKCAFACHDTY- 189 Query: 177 VLDVSRSMESFFDSS-----------ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + S + ++ +ID+ + ++ + V+N + + + Sbjct: 190 ----TSSTKKTLQTNSYYQIAKNKGVTMRIDVVRSATQSLTDTATSSQVVSNQYRMAVYS 245 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT--- 282 + L S + + P Y + D G N Sbjct: 246 LGSDCGSLGLTTVASLSSSMSSVKSSVGALDLMTIPYSGYNNDMCTDFDGAMSGMNGVIP 305 Query: 283 -----EDANYKKIIVFMTDG--ENLSTKEDQQSL------------YYCNEAKKRGAIV- 322 + +K + F++DG + +++ C+ K RG + Sbjct: 306 AQGDGSSTSPQKWLFFVSDGVADYSYPTTCSKTVLSGGRCQEPLNTTTCDTLKARGIKIA 365 Query: 323 --------------YAIGIRVIRSH--EFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 Y I R +++CASP +Y V++ S+ A + + + Sbjct: 366 VLYTTYLAITNNSWYTTYIAPWRDSISGIMKSCASPGYYYEVDSSGSIGAALTALFQQ 423 >gi|221128149|ref|XP_002161198.1| PREDICTED: similar to inter-alpha trypsin inhibitor, heavy chain 3, partial [Hydra magnipapillata] Length = 464 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 45/276 (16%), Positives = 94/276 (34%), Gaps = 45/276 (16%) Query: 123 QNEGYSISAISRY-KIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 ++ +++ + Y P K I W S + + + A +D+++V+D S Sbjct: 2 NDKKLTVACSTEYKDYPFKEKLDI-WTLISLKAPSLGMTLDEKEHRKRAPIDLVVVIDKS 60 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE---- 237 SM K+ + K++ ++ ++ N + L+TF + F L Sbjct: 61 GSMAG------EKLALVKKTLEFVVSQL------NEKDRLCLITFDTSVYLDFKLTPMTP 108 Query: 238 WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 + IK +S ++ +K I + + ++ TDG Sbjct: 109 MNKYQTLKIIKDISPGSMTNLCGGLMKGLCEVIDRADEEKNEVAS--------VLLFTDG 160 Query: 298 -------ENLSTKEDQQSLYYCNEA--KKRGAIVYAIGIRVIRSHEFLRAC--ASPNSFY 346 N+ Q + Y K A +Y G + + L+ A +Y Sbjct: 161 FANKGGLTNIYCSSSQTAKYTIGIVGPKTADASIYTFGFGSNHNAQMLKEISDAGSGMYY 220 Query: 347 LVENPHSMYDAFSH--------IGKDIVTKRIWYDK 374 +EN + +AF + + I + + +K Sbjct: 221 YIENVDMIAEAFGQCLGGLLSTVAQGIQVEIMMENK 256 >gi|149732356|ref|XP_001492551.1| PREDICTED: similar to complement component 2 [Equus caballus] Length = 751 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 33/250 (13%), Positives = 78/250 (31%), Gaps = 32/250 (12%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 L T + + K+ Q L++ ++LD S+S+ Sbjct: 215 DVAPALGTSLSHLLGATNPTQKKKENVGR----KIQIQRSGHLNLYLLLDASQSVSE--- 267 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWGVSHLQRKIK 248 + S M++ + V ++TF+++ L + I Sbjct: 268 ---KDFGIFKNSAILMVDRIFSFEIN---VSVAIITFASRPRIVMSVLHHNSRDVMEVIN 321 Query: 249 YLS-------KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 L + G TN+ L Y + + + ++ +TDG++ Sbjct: 322 SLDNIHYKDHENGTGTNTYEALNSVYIMMNNQMQRLGMNTVAWQEIRHAVILLTDGKSNM 381 Query: 302 ------TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVEN 350 ++ + L + + +YAIG+ + +E +++++ Sbjct: 382 GGSPKLAVDNIKELLNIKQKRNDYLDIYAIGVGNLDVDWRELNELGSKKDGERHAFILKD 441 Query: 351 PHSMYDAFSH 360 ++ F H Sbjct: 442 AEALSQVFEH 451 >gi|114571146|ref|YP_757826.1| Flp pilus assembly protein TadG [Maricaulis maris MCS10] gi|114341608|gb|ABI66888.1| Flp pilus assembly protein TadG [Maricaulis maris MCS10] Length = 500 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 31/173 (17%), Positives = 60/173 (34%), Gaps = 37/173 (21%) Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L + + I+ + G +TN G+ + + + + ++ K +V Sbjct: 327 TPLTNQRNVIDDAIEDMIASG-TTNIPIGISWGVRVLSPGMPFTEGVSYDEEGTIKAMVV 385 Query: 294 MTDGEN-----LSTKE----------------------------DQQSLYYCNEAKKRGA 320 +TDGEN + + ++ C AK G Sbjct: 386 LTDGENYLDGRNNPNYSHYSGYGYMRDGRLGIQTSSDSTIRNALNDRTEAACEYAKSLGI 445 Query: 321 IVYAIGIRVIRS--HEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 VY I +V S + +R CA+ P ++ + ++ AF I D+ R+ Sbjct: 446 RVYTITFQVNSSSTRDMMRDCATHPTLYFDSPSDDALRSAFEMIAGDLTNLRL 498 Score = 40.9 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 78/222 (35%), Gaps = 39/222 (17%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F N +G + +L ++ L I ++ G ++++ + + Sbjct: 12 FPRNRRGNVAMLFSLMLIPITVLSGGAVDINQALNAR-------------------ARLS 52 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNE-LRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 + + T +E + N ++ + + + + P + ++ Sbjct: 53 AALDAAALAVGVHTSVSETEAAGIASEFIAANYPDRELGLVGN---IVVQLDPDQDRVTV 109 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 A SR + T + +I + S V+ + L++++VLD + SM Sbjct: 110 GAESRVE------TIVLGLIGIEYITVHWESEVQR---ARSSLELVMVLDNTGSMGG--- 157 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +KI + + + + D N ++ GLV FS + Sbjct: 158 ---SKISSLRSAGLLLTDILFDGADPNR-LKIGLVPFSATVN 195 >gi|47218626|emb|CAG04955.1| unnamed protein product [Tetraodon nigroviridis] Length = 320 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 29/163 (17%), Positives = 65/163 (39%), Gaps = 21/163 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + ++++ V+D S S+ ++ +NA+++ V + N + GLV + Sbjct: 6 CKERPMELVFVIDSSESVGP------ENYEIIKDFVNALVDRVTV---GRNATRIGLVLY 56 Query: 227 SNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S ++ F L +++ I+ + G T + ++ A + F Sbjct: 57 SLEVNLVFNLARYSNQQDIKQAIRNIPYMGEGTYTGTAIRKATQEAFHGS---------R 107 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 K+ + +TDG+ +E + EA ++A+GI Sbjct: 108 LGVSKVAIVITDGQ-TDKREPVKLDIAVREAHAANIEIFALGI 149 >gi|126464748|ref|YP_001045861.1| von Willebrand factor, type A [Rhodobacter sphaeroides ATCC 17029] gi|126106559|gb|ABN79089.1| von Willebrand factor, type A [Rhodobacter sphaeroides ATCC 17029] Length = 328 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 75/202 (37%), Gaps = 39/202 (19%) Query: 174 MMIVLDVSRSM---ESFFDS-SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +++ LD+S SM + D T+++ + + +EE + GLV F+N+ Sbjct: 90 IVLTLDMSGSMLIEDFDIDGVQSTRLEAVKRVARSFVEE-------RQGDRIGLVLFANR 142 Query: 230 IEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + ++ + R I+ S G ST GL A ++ + Sbjct: 143 AYVAAPLTFDLAAVGRAIEEASIGITGRSTAIADGLGLALKRVTESAAAS---------- 192 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-----------EFL 336 ++IV ++DG++ + + D + A + G ++ I + L Sbjct: 193 -RVIVLLSDGQDNAHQID--ARQVAGLAARHGVRIHTIALGPDDLETRPAARDAVDTATL 249 Query: 337 RAC--ASPNSFYLVENPHSMYD 356 RA AS Y V + Sbjct: 250 RAIAEASGGRSYRVRGMEDLRA 271 >gi|301770509|ref|XP_002920678.1| PREDICTED: integrin alpha-11-like [Ailuropoda melanoleuca] Length = 1203 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 88/214 (41%), Gaps = 39/214 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++IVLD S S+ + + + ++ +K +Q G+V + Sbjct: 175 QTYMDIVIVLDGSNSIYPWVEVQ-----------HFLINILKKFYIGPGQIQVGVVQYGE 223 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + F L S ++ ++ S + G T + G+++A ++ F G + Sbjct: 224 DVVHEFHLNDYRS-VRDVVEAASHIEQRGGTETRTAFGIEFARSEAFQKGGRKG------ 276 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL-- 336 KK++V +TDGE+ + + ++ + ++++ YA+ + R I FL Sbjct: 277 --AKKVMVVITDGESHDSPDLEKVIQ---QSERDNVTRYAVAVLGYYNRRGINPEAFLNE 331 Query: 337 -RACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + AS F+ V + ++ D +G I Sbjct: 332 IKYIASDPDDKHFFNVTDEAALKDIVDALGDRIF 365 >gi|315502365|ref|YP_004081252.1| von willebrand factor type a [Micromonospora sp. L5] gi|315408984|gb|ADU07101.1| von Willebrand factor type A [Micromonospora sp. L5] Length = 319 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 67/208 (32%), Gaps = 28/208 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+ +DVS SM + D ++ A ++ ++ + + GLV F+ Sbjct: 88 VMVAVDVSTSMLA-GDVDPDRLTAAKEAGRRFVDGLP------DEFNVGLVAFAGSAAVL 140 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L I L G++ + A + +D +II+ Sbjct: 141 VPPSTDREALHDGIGRL-AEGITGVQGTAIGEAISTSLGAVKSLDATAAKDPPPARIII- 198 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------------HEFLRAC 339 ++DG N S + ++A V+ I + LRA Sbjct: 199 LSDGANTS---GMDPMEAADQAVAAKVPVHTISFGTPGGSVDRGGRAIQVPVDGQTLRAV 255 Query: 340 A--SPNSFYLVENPHSMYDAFSHIGKDI 365 A + F+ + D + IG + Sbjct: 256 AEQTGGGFHEASTTAELKDVYEDIGTSV 283 >gi|281343950|gb|EFB19534.1| hypothetical protein PANDA_009430 [Ailuropoda melanoleuca] Length = 1112 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 88/214 (41%), Gaps = 39/214 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++IVLD S S+ + + + ++ +K +Q G+V + Sbjct: 144 QTYMDIVIVLDGSNSIYPWVEVQ-----------HFLINILKKFYIGPGQIQVGVVQYGE 192 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + F L S ++ ++ S + G T + G+++A ++ F G + Sbjct: 193 DVVHEFHLNDYRS-VRDVVEAASHIEQRGGTETRTAFGIEFARSEAFQKGGRKG------ 245 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL-- 336 KK++V +TDGE+ + + ++ + ++++ YA+ + R I FL Sbjct: 246 --AKKVMVVITDGESHDSPDLEKVIQ---QSERDNVTRYAVAVLGYYNRRGINPEAFLNE 300 Query: 337 -RACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + AS F+ V + ++ D +G I Sbjct: 301 IKYIASDPDDKHFFNVTDEAALKDIVDALGDRIF 334 >gi|302865820|ref|YP_003834457.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|302568679|gb|ADL44881.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] Length = 319 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 67/208 (32%), Gaps = 28/208 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+ +DVS SM + D ++ A ++ ++ + + GLV F+ Sbjct: 88 VMVAVDVSTSMLA-GDVDPDRLTAAKEAGRRFVDGLP------DEFNVGLVAFAGSAAVL 140 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L I L G++ + A + +D +II+ Sbjct: 141 VPPSTDREALHDGIGRL-AEGITGVQGTAIGEAISTSLGAVKSLDATAAKDPPPARIII- 198 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------------HEFLRAC 339 ++DG N S + ++A V+ I + LRA Sbjct: 199 LSDGANTS---GMDPMEAADQAVAAKVPVHTISFGTPGGSVDRGGRAIQVPVDGQTLRAV 255 Query: 340 A--SPNSFYLVENPHSMYDAFSHIGKDI 365 A + F+ + D + IG + Sbjct: 256 AEQTGGGFHEASTTAELKDVYEDIGTSV 283 >gi|340082|gb|AAA36794.1| undulin 1 [Homo sapiens] Length = 843 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 26/167 (15%), Positives = 57/167 (34%), Gaps = 15/167 (8%) Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + + GL +S + L + + ++ L G +T + L Y + Sbjct: 1 FDVGSEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNYIFEN 60 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 F + KI + +TDG K + ++ G ++AIG++ Sbjct: 61 SFKPE------AGSRTGVSKIGILITDG-----KSQDDIIPPSRNLRESGVELFAIGVKN 109 Query: 330 IRSHEFLRACASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 +E + P+ Y V M+ + + + ++ D+ Sbjct: 110 ADVNELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSRVEEQDR 156 >gi|310657870|ref|YP_003935591.1| hypothetical protein CLOST_0560 [Clostridium sticklandii DSM 519] gi|308824648|emb|CBH20686.1| exported protein of unknown function [Clostridium sticklandii] Length = 873 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 53/301 (17%), Positives = 100/301 (33%), Gaps = 56/301 (18%) Query: 96 NELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIV 155 E+ + D ++RS + V +N+ SI + T + + Sbjct: 338 GEIYLDLSEFDSQPMMRSFQMMTVNPYENDHLSIDKTA---------TTLDLEERMYQVD 388 Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 IT + + +D+++V+D S SM + + A + I D Sbjct: 389 FEITG-----TPPEKPVDVILVIDTSGSMGTRIPGD--SKAPLYYAKLAAINFANSIIDE 441 Query: 216 NNVVQSGLVTFSNKIEEFFL-------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 N + G++ FS + L ++L I L+ + TN G + AYN Sbjct: 442 NPDSRVGVIEFSGGYYGYASDASTVINLTNNKANLASSINGLTTHNM-TNIQAGFRLAYN 500 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDG------ENLSTKEDQQSLYYCNEAKKRGAIV 322 +I + R K +VF+TDG N S+ + A G + Sbjct: 501 KISAISSTRDSV--------KSVVFLTDGVANVSIGNWSSSNPVVHNTHTIAAYTEGQSL 552 Query: 323 Y--------AIGIRVIRSHEFLRACASP----------NSFYLVENPHSMYDAFSHIGKD 364 Y IG+ S+ +++ A +Y + + + I + Sbjct: 553 YSYINGNLFTIGLFGAISNSSVKSIARDTLQKAVYDDLEKYYEASSAVDLGPVYETISQK 612 Query: 365 I 365 + Sbjct: 613 L 613 >gi|301626998|ref|XP_002942667.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Xenopus (Silurana) tropicalis] Length = 4207 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 46/274 (16%), Positives = 89/274 (32%), Gaps = 51/274 (18%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCT----FIPWYT 149 LR + I I S + + +G + Y L I +T Sbjct: 504 IATALRKRNSAHTIT-IGASVVANTIAEENCKGLMSKLWTTYSHQLPSYRSSSSQIVQFT 562 Query: 150 NSRHIVMPITSSVKVN-------------SQTDARLDMMIVLDVSRSMESFFDSSITKID 196 +SR T+ KV + LD++ ++D S S+ + Sbjct: 563 DSRKQQSSETTENKVEKLGQYFKKNIRRLREKSLSLDLVFLVDESSSVGH------SNFV 616 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---------LQRKI 247 ++ + +L + ++P + ++TFS+K +++ S L R+I Sbjct: 617 NELRFVKKLLSDFPVVPSA---TRVAIITFSSKTNVQTRVDYISSSEPHQHKCSLLNREI 673 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 ++ G T + + A + +N K+I +TDG + Sbjct: 674 PAITYKGGGTFTKGAFQQAAQILRYS----------RSNSTKVIFLITDGYSNG----GD 719 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 + G ++ +GI E L AS Sbjct: 720 PRPIAANLRDLGVEIFTVGIWQGNIRE-LHDMAS 752 >gi|145593798|ref|YP_001158095.1| von Willebrand factor, type A [Salinispora tropica CNB-440] gi|145303135|gb|ABP53717.1| von Willebrand factor, type A [Salinispora tropica CNB-440] Length = 319 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 34/208 (16%), Positives = 64/208 (30%), Gaps = 28/208 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+ +DVS SM + D ++ A +S ++ + + GLV F+ Sbjct: 88 VMVAVDVSTSMLA-GDVDPDRLTAAKESARRFVDGLP------DEFNVGLVAFAGSAAVL 140 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + L I L + + A N +D + IV Sbjct: 141 VPPDTDREALDEGIDRLVEGATGVQ-GTAIGEAINTSLGAVKALDGEAAKDPPPAR-IVL 198 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------------HEFLRAC 339 ++DG N S + +A + V+ I + L A Sbjct: 199 LSDGANTS---GMDPMEAAADAVEMEVPVHTIAFGTASGYVDRGGRPIQVPVDGQTLDAV 255 Query: 340 A--SPNSFYLVENPHSMYDAFSHIGKDI 365 A + F+ + + + IG + Sbjct: 256 ARETGGQFHEAVSVEELRAVYDDIGSSV 283 >gi|311273688|ref|XP_003133992.1| PREDICTED: integrin alpha-2 [Sus scrofa] Length = 1186 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 75/210 (35%), Gaps = 35/210 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++V D S S+ + D + ++ + + P Q GL+ ++N Sbjct: 178 IDVVVVCDESNSIYPW--------DAVKNFLEKFVQGLDIGPTK---TQVGLIQYANNPR 226 Query: 232 EFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L + ++ G TN+ ++YA + + Sbjct: 227 VVFNLNTFKTKAEMVEATSHTTQYGGDLTNTFKAIQYARDSAYSA------AAGGRPGAT 280 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEF---LRAC 339 K++V +TDGE+ + + CN + + I + + + ++A Sbjct: 281 KVMVVVTDGESHDGSMLKAVIDQCN---NDNILRFGIAVLGYLNRNALDTKNLIKEIKAI 337 Query: 340 AS---PNSFYLVENPHSMYDAFSHIGKDIV 366 AS F+ V + + + +G+ I Sbjct: 338 ASIPTERYFFNVSDEADLLEKAGTLGEQIF 367 >gi|326792960|ref|YP_004310781.1| von Willebrand factor A [Clostridium lentocellum DSM 5427] gi|326543724|gb|ADZ85583.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] Length = 903 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 38/189 (20%), Positives = 73/189 (38%), Gaps = 32/189 (16%) Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIK 248 +++ D + L+ ++ DVN +VT++ + S ++ Sbjct: 298 SCYVSRADKVKDASYTFLDTLQEKEDVN----ISVVTYAGTASKVT-NSNLKSGIESAYN 352 Query: 249 YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS 308 L G TN+ G++ A + + K+IV ++DGE+ + Sbjct: 353 VLGTDG--TNTGRGIEIASQIL-----------SNSTAPNKMIVVLSDGESNAGNS---- 395 Query: 309 LYYCNEAKKRGAIVYAIGIRVIRSHEFLRA---CASPN------SFYLVENP-HSMYDAF 358 N AK +G IVY IG + + CAS + FYL ++ +++ + F Sbjct: 396 RTAANSAKNKGCIVYTIGAGIASGSNGAKELFDCASVDQSTNKAKFYLADDTGNALNEIF 455 Query: 359 SHIGKDIVT 367 + I +I Sbjct: 456 AEIAGEIQE 464 >gi|118096903|ref|XP_425157.2| PREDICTED: similar to type VII collagen [Gallus gallus] Length = 1645 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 33/147 (22%), Positives = 61/147 (41%), Gaps = 23/147 (15%) Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 ++E F + +Q+ I+ LS G +T + GL+Y + F +R Sbjct: 7 RVEFGFSQHINGTGIQKAIQQLSYKGGNTRTGAGLRYISDNFFGPTQLRPG-------VP 59 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFY 346 KI + +TDG K + K G V+A+GI+ E +R + + + F+ Sbjct: 60 KICILITDG-----KSQDDAEQPALRLKSLGIKVFAVGIKNADRQELIRVASLPTDSFFF 114 Query: 347 LVENPHSMYDAFSHIGKDI--VTKRIW 371 V + F +G + +T+R+ Sbjct: 115 YVGD-------FKLLGTLVPLMTRRVC 134 >gi|120609754|ref|YP_969432.1| von Willebrand factor, type A [Acidovorax citrulli AAC00-1] gi|120588218|gb|ABM31658.1| von Willebrand factor, type A [Acidovorax citrulli AAC00-1] Length = 355 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 35/247 (14%), Positives = 77/247 (31%), Gaps = 62/247 (25%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+ +DVS SM + D ++ A + A + D+ V+ G+V F+ + Sbjct: 88 IMLAMDVSGSMRAA-DVHPDRLTAAQDAAKAFIA------DLPRHVRVGIVAFAGSAQLA 140 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK---- 289 L L R I + T + G+ + +F G+ + Sbjct: 141 QLPTQNHEDLFRAIDSF-QLQRGTATGNGILLSLATLFPDTGIDVSALGGRQAMPRPQSM 199 Query: 290 ------------------------------IIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 I+ +TDG+ + + ++ + A RG Sbjct: 200 DEIGRPPHRGGNGKGADRPAPVAPGSYTSAAIIMLTDGQRTTGVDPMEAAQW---AADRG 256 Query: 320 AIVYAIGIRVIRS---------------HEFLRACA--SPNSFYLVENPHSMYDAFSHIG 362 VY +G+ + + L+A A + ++ + + + Sbjct: 257 VRVYTVGVGTVAGETIGFEGWSMRVRLDEDTLKAVAQRTNAEYFHAATAADLKKVYETLS 316 Query: 363 KDIVTKR 369 + ++ Sbjct: 317 SRLTVEK 323 >gi|284052945|ref|ZP_06383155.1| von Willebrand factor, type A [Arthrospira platensis str. Paraca] gi|291569123|dbj|BAI91395.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 489 Score = 62.1 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 35/199 (17%), Positives = 70/199 (35%), Gaps = 28/199 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++++D S SM S +K+ ++ + + L D +V FS++ Sbjct: 54 VVMLIDTSGSM------SGSKLPEVQRAASEFVSRQNLKRDD-----LAVVEFSSRASVV 102 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 LQ+ I LS +G TN + G A + + + I+ Sbjct: 103 ADFTRDERELQQAIARLSAWG-GTNLSEGFNLATSVLQNSDRPGN------------ILL 149 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHS 353 TDGE + + + + + G + A+G + P+ + N Sbjct: 150 FTDGEPNNRR---MAASIAQQIRASGINLVAVGTGDAPVNYLTALTGDPDLVFYA-NFGD 205 Query: 354 MYDAFSHIGKDIVTKRIWY 372 + AF K I +++ Sbjct: 206 LDSAFRGAEKAIYGQQLVE 224 >gi|329963581|ref|ZP_08301060.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] gi|328528570|gb|EGF55541.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] Length = 327 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 38/217 (17%), Positives = 72/217 (33%), Gaps = 46/217 (21%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +DVS SM + D +++ A + P+ N G+ F+ + Sbjct: 88 IDIMLAMDVSTSMLAE-DLKPNRLEAAKDVAAEFING---RPNDN----IGITLFAGESF 139 Query: 232 EFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + L +K + T G+ A ++ D + K Sbjct: 140 TQCPLTVDHAVLLNLLKDMKCGLIEDGTAIGMGIANAVTRLKDSKAKS-----------K 188 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-------------------- 329 +I+ +TDG N K D L AK G +Y IG+ Sbjct: 189 VIILLTDGVNN--KGDISPLTAAEIAKSFGIRIYTIGVGTNGMAPYPYPVGGTVQYVNMP 246 Query: 330 --IRSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 I + + +++ + + + + I K Sbjct: 247 VEIDEKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|294997269|ref|NP_001025043.3| integrin alpha-D isoform 1 [Mus musculus] Length = 1202 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 41/241 (17%), Positives = 85/241 (35%), Gaps = 56/241 (23%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D+ ++D S S++ S T++ +K++ L L+ + Sbjct: 151 CPGQEMDIAFLIDGSGSID---QSDFTQMKDFVKALMGQLASTSTSFS--------LMQY 199 Query: 227 SNKIEEFFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLK-------------------- 264 SN ++ F S L Q + + + T + G++ Sbjct: 200 SNILKTHFTFTEFKSSLSPQSLVDAIVQLQGLTYTASGIQKVVDSNRLLLLRVWSALKPG 259 Query: 265 ---------YAYNQI-FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 + + ++ F + + N + KKI++ +TDG+ + + E Sbjct: 260 CHYNTHMPWWVWQRVRFSWKELFHSKNGARKSAKKILIVITDGQKFRDP--LEYRHVIPE 317 Query: 315 AKKRGAIVYAIGIR-------VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 A+K G I YAIG+ ++ + + S + + V N + A I + I Sbjct: 318 AEKAGIIRYAIGVGDAFREPTALQELNTIGSAPSQDHVFKVGN----FVALRSIQRQIQE 373 Query: 368 K 368 K Sbjct: 374 K 374 >gi|266620637|ref|ZP_06113572.1| putative von Willebrand factor type A domain protein [Clostridium hathewayi DSM 13479] gi|288867752|gb|EFD00051.1| putative von Willebrand factor type A domain protein [Clostridium hathewayi DSM 13479] Length = 2963 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 57/341 (16%), Positives = 110/341 (32%), Gaps = 61/341 (17%) Query: 66 MNEGNGNNRKKLKGGDILCRIKNTWNMSFR----NELRDNGFVNDIDDIVRSTSLDIV-V 120 + EG+ N + S E R + + N I + + I Sbjct: 450 LKEGDVINPADYAWNKDFLTYAGSSQDSIVISSQTENRVDLYYNAIAEEKSEDEVTITGA 509 Query: 121 VPQNEGYSISA----ISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMI 176 +P++ S+ A P K ++ I I + D+++ Sbjct: 510 IPRSYFRSVKADDNTTPGKVFPTKTAEWVDEANGIGKINFTIYGNPIRRGS-----DVIL 564 Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEV-KLIPDVNNVVQSGLVTFSN------- 228 V+D S SME K A + ++ + + V + + +V F + Sbjct: 565 VIDSSGSMEG------EKWSTAKTAAKGFIDNLYQNKDGVVSDDRIAIVDFDSSAKAYPG 618 Query: 229 ---------KIEEFFLLEWGVSHLQRKIKYLS-----KFGVSTNSTPGLKYAYNQIFDMQ 274 K+++ ++ + +K K T+ L+ A + I Sbjct: 619 TNSGSETFLKVDDKITIKNKTYSAKDYLKSYVLDSQMKDTGGTDYNKALQTAQSVI---- 674 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLS-----TKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 N D++ IVFM+DGE T Y E K G +Y++G+ + Sbjct: 675 -----NNRRDSSRPAYIVFMSDGEPNGYWDWLTYRYYDGQKYATELKSDGVTIYSLGLNI 729 Query: 330 IRS--HEFLRACASPNSFYLVEN---PHSMYDAFSHIGKDI 365 + ++F+ AS + +N + + I I Sbjct: 730 GSTNFNKFIVPLASDPTSTYAKNIVKTSDLVGIYDAIASSI 770 >gi|74220882|dbj|BAE33629.1| unnamed protein product [Mus musculus] Length = 1202 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 41/241 (17%), Positives = 85/241 (35%), Gaps = 56/241 (23%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D+ ++D S S++ S T++ +K++ L L+ + Sbjct: 151 CPGQEMDIAFLIDGSGSID---QSDFTQMKDFVKALMGQLASTSTSFS--------LMQY 199 Query: 227 SNKIEEFFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLK-------------------- 264 SN ++ F S L Q + + + T + G++ Sbjct: 200 SNILKTHFTFTEFKSSLSPQSLVDAIVQLQGLTYTASGIQKVVDSNRLLLLRVWSALKPG 259 Query: 265 ---------YAYNQI-FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 + + ++ F + + N + KKI++ +TDG+ + + E Sbjct: 260 CHYNTHMPWWVWQRVRFSWKELFHSKNGARKSAKKILIVITDGQKFRDP--LEYRHVIPE 317 Query: 315 AKKRGAIVYAIGIR-------VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 A+K G I YAIG+ ++ + + S + + V N + A I + I Sbjct: 318 AEKAGIIRYAIGVGDAFREPTALQELNTIGSAPSQDHVFKVGN----FVALRSIQRQIQE 373 Query: 368 K 368 K Sbjct: 374 K 374 >gi|229526203|ref|ZP_04415607.1| protein BatA [Vibrio cholerae bv. albensis VL426] gi|229336361|gb|EEO01379.1| protein BatA [Vibrio cholerae bv. albensis VL426] Length = 318 Score = 62.1 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 30/213 (14%), Positives = 70/213 (32%), Gaps = 38/213 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M V+D+S SM + + + A+ + + + GL+ F++ Sbjct: 85 DLMQVVDLSYSMSQE---DMQSGQQMVDRLTAVKQVLSEFIAKREGDRIGLILFADHAYL 141 Query: 233 FFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + ++ L G T G+ A D +++ Sbjct: 142 QTPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFIDSNA-----------PQRV 190 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------SHEF- 335 ++ ++DG N + L + AK+ +Y +G+ + + Sbjct: 191 MILLSDGSNTA--GVLDPLEAADIAKQYHTTIYTVGVGAGEMVVKDFLFSRKVNTAQDLD 248 Query: 336 ---LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + ++ N + + + I + Sbjct: 249 EKTLQTIATTTGGHYFRARNQQDLQNIYDTINQ 281 >gi|323493925|ref|ZP_08099042.1| putative Flp pilus assembly protein TadG [Vibrio brasiliensis LMG 20546] gi|323311866|gb|EGA65013.1| putative Flp pilus assembly protein TadG [Vibrio brasiliensis LMG 20546] Length = 427 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 29/211 (13%), Positives = 83/211 (39%), Gaps = 21/211 (9%) Query: 20 TILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVH--AATQIMNEGNGNNRKKL 77 IL LPI+ +++ +++S + + + + A+ + + N ++L Sbjct: 3 GILFIGLLPIMVILMAFSMQMSQQMLAHARVLEAAEVASLALIASPKESEDENVKYARQL 62 Query: 78 KGGDILCRIKNT-WNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYK 136 ++ I + + R +G V + ++ + + +++ + I+ K Sbjct: 63 VDRYVVDNINDVDVEVYTRKCEYKDGCVQESGEVAPFSDFVVSAKAEHKSW----IAYEK 118 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 + LK + + +R + +D+ + D S SM + TK+D Sbjct: 119 VDLKPEFEVAGKSVTRKYL-------------PQPVDVYFIGDFSGSMNGHWKGGKTKLD 165 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + ++I ++E+++ + + L+ ++ Sbjct: 166 VVKQTIERVVEDIENF-NTEEKSRVALLGYN 195 >gi|220910752|ref|YP_002486062.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] gi|219867524|gb|ACL47861.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] Length = 411 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 37/236 (15%), Positives = 81/236 (34%), Gaps = 29/236 (12%) Query: 132 ISRYKIPLKFCTFIPWYTNSRHIVMPITSS--VKVNSQTDARLDMMIVLDVSRSMESFFD 189 ++ + L + + + V L++ +V+D S SM+ Sbjct: 1 MTTPQFQLLPLRAAVASDQPTTLDVLVKIIPPVPETRPQRPPLNLGLVIDRSGSMQG--- 57 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL--LEWGVSHLQRKI 247 K+++A ++ +E++ + + F +++E L + + R I Sbjct: 58 ---AKMEVARQAACFAVEQLLPSD------RLSVTIFDDRVECPVPSTLVRDKATIIRTI 108 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQ 306 + + G + L + + + QH N N ++ ++DG N+ Sbjct: 109 QGIHSRGST-----ALHDGW--VQGGIQVSQHLNPAHLNR---VILLSDGLANVGETNPD 158 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSH 360 + + +RG +GI + L A A SFY +E + F Sbjct: 159 AIAQHVHGLAQRGVSTSTMGIGEDYGEDLLEAMARSGAGSFYHIERTEQLAAIFQA 214 >gi|160882770|ref|ZP_02063773.1| hypothetical protein BACOVA_00731 [Bacteroides ovatus ATCC 8483] gi|237720676|ref|ZP_04551157.1| BatA [Bacteroides sp. 2_2_4] gi|260170239|ref|ZP_05756651.1| aerotolerance protein BatA [Bacteroides sp. D2] gi|293373990|ref|ZP_06620331.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|299145608|ref|ZP_07038676.1| BatA protein [Bacteroides sp. 3_1_23] gi|315918602|ref|ZP_07914842.1| conserved hypothetical protein [Bacteroides sp. D2] gi|156111794|gb|EDO13539.1| hypothetical protein BACOVA_00731 [Bacteroides ovatus ATCC 8483] gi|229449511|gb|EEO55302.1| BatA [Bacteroides sp. 2_2_4] gi|292631066|gb|EFF49703.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|298516099|gb|EFI39980.1| BatA protein [Bacteroides sp. 3_1_23] gi|313692477|gb|EFS29312.1| conserved hypothetical protein [Bacteroides sp. D2] Length = 327 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 41/270 (15%), Positives = 85/270 (31%), Gaps = 46/270 (17%) Query: 119 VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVL 178 + + ++ + Y + + F ++ ++ K + +D+M+ + Sbjct: 35 ISDARVYAHTPKSYKNYLLHVPFLLRCLALVLVILVLARPQTTNKWQNSEIEGIDIMLAI 94 Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW 238 DVS SM + D +++ A + P+ N G+ F+ + L Sbjct: 95 DVSTSMLAE-DLKPNRLEAAKDVAAEFING---RPNDN----IGITLFAGETFTQCPLTV 146 Query: 239 GVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 + L I + T G+ A ++ D + K+I+ +TD Sbjct: 147 DHAVLLDMIHNIKCGLITDGTAVGMGIANAVTRLKDSKAKS-----------KVIILLTD 195 Query: 297 GENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV----------------------IRSHE 334 G N K D + AK G VY IG+ I Sbjct: 196 GTNN--KGDISPMTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVSMPVEIDEKT 253 Query: 335 FLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 + +++ + + + + I K Sbjct: 254 LTEIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|255578117|ref|XP_002529928.1| protein binding protein, putative [Ricinus communis] gi|223530558|gb|EEF32436.1| protein binding protein, putative [Ricinus communis] Length = 731 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 50/324 (15%), Positives = 113/324 (34%), Gaps = 51/324 (15%) Query: 57 SLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFR-NELRDNGFVNDIDDIVRSTS 115 + + I + + + L + + + + D+ Sbjct: 173 ARLDTNRHISSLFHAQEPPIFDDDEALDQQHEIAHRNLSTKNDSSDSHSLGTIDVKTYPE 232 Query: 116 LDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMM 175 + V + + + K P+ T I ++S H+ +P +++ + A +D++ Sbjct: 233 VSAVSRSASHD-NFCVLIHLKAPV---TSIRHNSSSNHMELP-----QMSQNSRAPVDLV 283 Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 VLDVS SM TK+ + +++ +++ + + ++ FS+ F Sbjct: 284 TVLDVSGSMAG------TKLALLKRAMGFVIQNLGPSD------RLSVIAFSSTARRLFP 331 Query: 236 LE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L G + L G TN GL+ +G + + + N I Sbjct: 332 LRCMTEAGRQEALLSVNSLVSNG-GTNIAEGLR---------KGAKVIVDRKWKNPVASI 381 Query: 292 VFMTDGENLS---------TKEDQQSLYYCNEAKKRG----AIVYAIGIRVIRSHEFLRA 338 + ++DG++ + D +SL + + G V++ G + + Sbjct: 382 ILLSDGQDTYTVTSPSGMNPRADYKSLLPISIHRNGGTGLKIPVHSFGFGADHDAASMHS 441 Query: 339 CA--SPNSFYLVENPHSMYDAFSH 360 + S +F +E + DAF+ Sbjct: 442 ISEISGGTFSFIEAEGVIQDAFAQ 465 >gi|113969745|ref|YP_733538.1| von Willebrand factor, type A [Shewanella sp. MR-4] gi|113884429|gb|ABI38481.1| von Willebrand factor, type A [Shewanella sp. MR-4] Length = 338 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 29/216 (13%), Positives = 65/216 (30%), Gaps = 44/216 (20%) Query: 173 DMMIVLDVSRSM---ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+M+ +D+S SM + + + I+ + + E + + GL+ F++ Sbjct: 85 DLMLAVDLSGSMQIEDMVINGKVVDRFTLIQHVVSEFIERRKGD------RIGLILFADH 138 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + + +K G T + A + M Sbjct: 139 AYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGESIALAVKRFDKMDESN---------- 188 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI----------------- 330 ++++ +TDG N + D A R +Y +G+ Sbjct: 189 -RVLILLTDGSNNAGNIDPD--QAAQIAANRKVTIYTVGVGADVMERRTLFGRERVNPSM 245 Query: 331 --RSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 ++ + ++ N + + I K Sbjct: 246 DLDENQLKHIAEVTHGRYFRARNSQELEQIYQEIDK 281 >gi|281423276|ref|ZP_06254189.1| BatA protein [Prevotella oris F0302] gi|281402612|gb|EFB33443.1| BatA protein [Prevotella oris F0302] Length = 332 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 39/222 (17%), Positives = 75/222 (33%), Gaps = 51/222 (22%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +DVS SM + D +++ A + + + P+ N GL F+ + Sbjct: 88 IDIMLAMDVSTSMLAE-DLKPNRLEAAKNVASEFIAD---RPNDN----IGLTIFAGEAF 139 Query: 232 EFFLLEWG-------VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + + ++ I T GL A +++ D + Sbjct: 140 TQCPMTTDHVSLINLLQSVRTDIAARGLISDGTAVGMGLANAVSRLKDSKAKS------- 192 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS------------ 332 K+++ +TDG N D + A+ G VY IGI + Sbjct: 193 ----KVVILLTDGSNNM--GDISPMTSAQIARSFGIRVYTIGIGTNKVAPYPMPVAGGIQ 246 Query: 333 ---------HEFLR--ACASPNSFYLVENPHSMYDAFSHIGK 363 + L+ A + +FY N + + I + Sbjct: 247 YVNIPVEIDSKTLKDIAATTEGNFYRATNNRQLKQIYKDIDQ 288 >gi|40068485|ref|NP_073745.2| von Willebrand factor A domain-containing protein 1 isoform 1 precursor [Homo sapiens] gi|74749155|sp|Q6PCB0|VWA1_HUMAN RecName: Full=von Willebrand factor A domain-containing protein 1; Flags: Precursor gi|37747866|gb|AAH59409.1| Von Willebrand factor A domain containing 1 [Homo sapiens] gi|56205506|emb|CAI22657.1| von Willebrand factor A domain containing 1 [Homo sapiens] gi|119576602|gb|EAW56198.1| von Willebrand factor A domain containing 1, isoform CRA_b [Homo sapiens] Length = 445 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 37/218 (16%), Positives = 78/218 (35%), Gaps = 27/218 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + + + R D+M +LD S S+ + S + + + + V +P + Sbjct: 21 GAERGPPASAPRGDLMFLLDSSASVSHYEFSRV------REFVGQL---VAPLPLGTGAL 71 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 ++ LV ++ F S Q ++ ++ T++ L YA Q+F Sbjct: 72 RASLVHVGSRPYTEFPFGQHSSGEAAQDAVRASAQRMGDTHTGLALVYAKEQLF------ 125 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + K++V++TDG + E K G V+ + E Sbjct: 126 AEASGARPGVPKVLVWVTDGGSSDPVG-----PPMQELKDLGVTVFIVSTGRGNFLELSA 180 Query: 338 ACASPN----SFYLVENPHSMYDAF-SHIGKDIVTKRI 370 A ++P F V++ H + I + +++ Sbjct: 181 AASAPAEKHLHFVDVDDLHIIVQELRGSILDAMRPQQL 218 >gi|313241793|emb|CBY34008.1| unnamed protein product [Oikopleura dioica] Length = 694 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 68/212 (32%), Gaps = 29/212 (13%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + + LD++ V+D S S+ + + + + + Sbjct: 171 ITQDQCRTNALDIVFVVDESGSIGTN------NFQLIKDFLEHF---ASDSTIAADATRI 221 Query: 222 GLVTFSNKIEEFFLLEWG-VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + FS+ +F L ++ +IK + TN+ L A + D R Sbjct: 222 AIRPFSSSNYLYFSLNDFKTKNIINEIKNMPYNEGGTNTADALDAA---LTDYGTDR--- 275 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK--RGAIVYAIGIRVIRSHEFLRA 338 K++V +TDG + S + + K R +AIG+ E Sbjct: 276 ----PESVKVMVTITDGASNSF---LSTSAAADRVKNDLRNIQSFAIGVSGANMAELNAI 328 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 S +++ F I +++ K Sbjct: 329 AISAKHVFMLNGWAD----FGPIKSNLLQKVC 356 >gi|260808845|ref|XP_002599217.1| hypothetical protein BRAFLDRAFT_64431 [Branchiostoma floridae] gi|229284494|gb|EEN55229.1| hypothetical protein BRAFLDRAFT_64431 [Branchiostoma floridae] Length = 600 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 31/205 (15%), Positives = 70/205 (34%), Gaps = 27/205 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ VLD S S+ DS KI + ++ V Q G++ +++ + Sbjct: 222 DIVFVLDYSGSI---PDSEFVKIKNFVAAL------VDRFQVGVLDAQIGVIRYNHAVIH 272 Query: 233 FFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L + + + + +T + G + D + Sbjct: 273 EFHLNTHDNKADVLSDVSAMPTATTGGTNTAAALTYVASTMLLPGNGNRPDAPD-----V 327 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACASPNSFYLVE 349 ++ +TDG + L+ G +AIG+ S + + + P+ Y + Sbjct: 328 VIVLTDGYSSGVVGPASVLH------GMGVQTFAIGVGACANSAQLTQIASCPDYIYRLP 381 Query: 350 NPHSMYDAFSHIGKDIVTKRIWYDK 374 + + A I + + +++ Sbjct: 382 D----FSALRSITAGMHDQICCHNQ 402 >gi|153001301|ref|YP_001366982.1| von Willebrand factor type A [Shewanella baltica OS185] gi|151365919|gb|ABS08919.1| von Willebrand factor type A [Shewanella baltica OS185] Length = 340 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 28/217 (12%), Positives = 67/217 (30%), Gaps = 46/217 (21%) Query: 173 DMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+ +D+S SM+ ++ + + ++ +E K + GL+ F++ Sbjct: 85 DLMMAVDLSGSMQIEDMVVNGKTVDRFTLIQHVVSDFIERRKGD-------RIGLILFAD 137 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + + +K G T + A + M Sbjct: 138 HAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIGLAVKRFDKMDESN--------- 188 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI---------------- 330 ++++ +TDG N + + + A R +Y +G+ Sbjct: 189 --RVLILLTDGSNNA--GNIEPEQAAQIAANRKVTIYTVGVGADVMERRTLFGRERVNPS 244 Query: 331 ---RSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 ++ + ++ N + + I K Sbjct: 245 MDLDENQLKHIAEVTHGRYFRARNSQELDQIYKEIDK 281 >gi|313238855|emb|CBY13854.1| unnamed protein product [Oikopleura dioica] Length = 977 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 37/242 (15%), Positives = 83/242 (34%), Gaps = 41/242 (16%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 I++ K + F I+ + + D D++ V+D S S+ Sbjct: 319 IASKPADKYVFEINNFDQLILKRIGILGDVCQEAE---CPDIFADIVFVVDSSGSIGP-- 373 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW-GVSHLQRKI 247 + D + ++ K+ N + +V ++ + + +S + +KI Sbjct: 374 ----KRFDYMKNWVKSIAASFKVGE---NFARFSVVQYTKTAKTVVDFQTLDLSSISQKI 426 Query: 248 KYLSKF------GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 + F G T + L+ A+ + + + R K+I++ +TDG Sbjct: 427 DSMIYFQGRNGRGGKTFTGNALERAHTLLKESEPGR----------KRIVLLLTDG---- 472 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHI 361 D + + ++A+G+ R E + A + F++I Sbjct: 473 -SSDDEYGSVAKAIRDDKVDIFAVGVGRARKDELVEITADEQRVWQ-------TRTFNNI 524 Query: 362 GK 363 G+ Sbjct: 525 GQ 526 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 72/206 (34%), Gaps = 36/206 (17%) Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--------I 230 D+S ++S +I+ + +++E++++ + + + ++ FS + I Sbjct: 152 DLSFIVDSSSSITISDYQKLKTWMKSIIEKLEIGDNAS---RVSILQFSGQSARPQGRWI 208 Query: 231 EEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + S + I + K T L Y Y +F Q + ++ + Sbjct: 209 NPVLTFDRSTSKEAVIGAIDGMKKLNGDTCIGEALDYFYRNMFTSQ------AGQRSDVE 262 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC--------- 339 + ++ MTDG+ + + + A +YAIGI + C Sbjct: 263 QRVIVMTDGKRNCP---AEIAKPAELIRAQEAEIYAIGIGHQCGYGENHNCYDRQELHEI 319 Query: 340 AS---PNSFYLVENPHSMYDAFSHIG 362 AS + + N + IG Sbjct: 320 ASKPADKYVFEINNFDQLI--LKRIG 343 >gi|221110023|ref|XP_002170779.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 671 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 69/189 (36%), Gaps = 23/189 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+ +LD S S++S + + S N +G+VTFS+ Sbjct: 38 GFFDVGFILDSSGSLKSQYWKEKDFLKKLANSFGI----------SNKGSHAGVVTFSHY 87 Query: 230 IEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 E L+ + + +S T L A G R + Sbjct: 88 AELSIRLDAFYSSIDFNDAVDRISHMDSFTRIDLALAKALELFDIKNGARN-------DV 140 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLRACASPNSFY 346 ++ +TDG+ + + + +E K++G ++A+GI + E + +P + + Sbjct: 141 PNLLFLLTDGK---QEPEMPLTHISDEIKQKGIQLFAVGIGAGANKTELEKIVGNPENVF 197 Query: 347 LVENPHSMY 355 +V++ + Sbjct: 198 MVDDFDKLL 206 >gi|156377912|ref|XP_001630889.1| predicted protein [Nematostella vectensis] gi|156217919|gb|EDO38826.1| predicted protein [Nematostella vectensis] Length = 372 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 48/326 (14%), Positives = 106/326 (32%), Gaps = 49/326 (15%) Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 ++G+ + KK K R+ V + + P G Sbjct: 63 SDGHLIDWKKWKESMHRTFTSTVDTPRGRDGQGVARPVVTGRRVASEHAPT----PPPRG 118 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKV-----------NSQTDARLDMM 175 A P + C + W ++ H+ +S V +++ LD+ Sbjct: 119 TVAPASLVLDPPPRPCN-VDW--STVHVSSSTPASQGVLIGFPVVGFPMDAKCKRVLDVG 175 Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE--- 232 IV+D S S+ + ++ ++ E+++ P ++V GLV ++ + Sbjct: 176 IVVDSSASVH------RENFFKVKEFLDKLVSELEIGPSKSHV---GLVRYNQVADTLWD 226 Query: 233 -FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + L+ I+ + T + LK IF G + +++ Sbjct: 227 FNGAENNNLKSLKDAIEKIEYLPGGTRTDLALKKVNEDIFSPMG------GARNDVPQVL 280 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAK----------KRGAIVYAIGIRVIRSHEFLRACAS 341 V +TDG+ E S+ + + + + A+G+ + L A Sbjct: 281 VVITDGKTNQRSEPYSSVLQPLKVRILYPRDVTEQDKDVKIVAVGVGHSIGKKELDIIAL 340 Query: 342 PN--SFYLVENPHSMYDAFSHIGKDI 365 + + +++E + + I Sbjct: 341 GDASNVFMLETFDDLVRRLNAIMNSF 366 >gi|217972770|ref|YP_002357521.1| von Willebrand factor type A [Shewanella baltica OS223] gi|217497905|gb|ACK46098.1| von Willebrand factor type A [Shewanella baltica OS223] Length = 340 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 69/217 (31%), Gaps = 46/217 (21%) Query: 173 DMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+ +D+S SM+ ++ + + ++ +E K + GL+ F++ Sbjct: 85 DLMMAVDLSGSMQIEDMVVNGKTVDRFTLIQHVVSDFIERRKGD-------RIGLILFAD 137 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + + +K G T + A + M Sbjct: 138 HAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIALAVKRFDKMDESN--------- 188 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV--IRSHEF--------- 335 ++++ +TDG N + + + A R +Y +G+ + Sbjct: 189 --RVLILLTDGSNNA--GNIEPEQAAQIAANRKVTIYTVGVGADVMERRTLFGRERVNPS 244 Query: 336 -------LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + ++ N + + I K Sbjct: 245 MDLDENQLKHIADVTHGRYFRARNSQELDQIYQEIDK 281 >gi|126731955|ref|ZP_01747758.1| Von Willebrand domain containing protein [Sagittula stellata E-37] gi|126707487|gb|EBA06550.1| Von Willebrand domain containing protein [Sagittula stellata E-37] Length = 318 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 59/165 (35%), Gaps = 26/165 (15%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ LD+S SME S +++++ + + + GLV F + Sbjct: 88 DIVLALDLSGSMEREDFSLNGQTVSRLAAVQGVAADFVRG-RTGD------RVGLVVFGD 140 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + V+ + R I L G +T GL A ++ + Sbjct: 141 RAYVAAPQTHDVASVARLIDGLQIGVSGKATAIADGLGLAIRRLRERDAKS--------- 191 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++I+ ++DG+ T + A+ G VY I + Sbjct: 192 --RVILLLSDGQ--DTTGMVDPVAAAQTARDLGMRVYTIALGPAD 232 >gi|84498071|ref|ZP_00996868.1| hypothetical protein JNB_18328 [Janibacter sp. HTCC2649] gi|84381571|gb|EAP97454.1| hypothetical protein JNB_18328 [Janibacter sp. HTCC2649] Length = 651 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 34/216 (15%), Positives = 80/216 (37%), Gaps = 19/216 (8%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 T++ ++ ++++LD S SM++ S +TKI+ A K++ ++ + Sbjct: 23 TGGTAAATSSADEPVPGKLLLMLDASGSMKAKDPSGLTKIEAAKKALTGVVGALP----- 77 Query: 216 NNVVQSGLVTFSNKIE---EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + Q GL + ++ + + L I L K ++T T A + Sbjct: 78 -DTAQVGLRVYGATVDGKGKPTPAACADTQLIHPIAALDKTKLTT--TIAAIKALGETPI 134 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA--IVYAIGIRVI 330 + + + K+ IV ++DGE + + G + +G V Sbjct: 135 AHSLTEALKDLGTSGKRNIVLVSDGEESCVPD---PCPIVKKLTAAGVDLQIDTVGFGVN 191 Query: 331 RSHEFLRAC---ASPNSFYLVENPHSMYDAFSHIGK 363 C A ++Y ++ ++ + + + + Sbjct: 192 AKARTQLQCIADAGKGTYYDAKDAGALATSLNKLSQ 227 >gi|283852082|ref|ZP_06369356.1| von Willebrand factor type A [Desulfovibrio sp. FW1012B] gi|283572472|gb|EFC20458.1| von Willebrand factor type A [Desulfovibrio sp. FW1012B] Length = 442 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 30/246 (12%), Positives = 78/246 (31%), Gaps = 41/246 (16%) Query: 31 FLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTW 90 G+ +++S ++ L + +D + + + Q+ ++ + N K Sbjct: 1 MAAAGVAVDLSRVYVAHNQLQNAVDAAALAGSLQLPDDPDVTNGK--------------- 45 Query: 91 NMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTN 150 ++ N + +T + + + A + + L Sbjct: 46 -------VKAAVTANLALNDPDATDIQVTSGGATRSVCVDAKANVDMTL----------- 87 Query: 151 SRHIVMPITSSVKVNSQTD-ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 ++ ++V + +++++VLD + SM + ID A + ++ + Sbjct: 88 -TKVIGIGDTTVTAEACAGYNDIELVLVLDSTGSM------KGSPIDSAKDAARDLVNLI 140 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + + GLV F K+ + + T +T LK Y++ Sbjct: 141 MPASTSSTRSKIGLVPFQGKVRIDGSDPVTAERNPDGVGPGCRNADGTLNTGKLKVEYSR 200 Query: 270 IFDMQG 275 Sbjct: 201 TATSTN 206 >gi|217974408|ref|YP_002359159.1| von Willebrand factor type A [Shewanella baltica OS223] gi|217499543|gb|ACK47736.1| von Willebrand factor type A [Shewanella baltica OS223] Length = 627 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 37/215 (17%), Positives = 72/215 (33%), Gaps = 30/215 (13%) Query: 134 RYKIPLKFCTFIPWYTNSRHIVMP----------ITSSVKVNSQTDARLDMMIVLDVSRS 183 Y PL P+ + P + +++ +LDVS S Sbjct: 175 AYDYPLPAKNAAPFSVTTELAPSPYNDDMMLLRIGLKGYDLPKSQLGASNLVFLLDVSGS 234 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 M +S K+ + ++ + ++ V+ VV +G + L Sbjct: 235 M-----ASTDKLPLLQTALKLLTAQLSAQDKVSIVVYAGAAG----VVLDGASGNDTQTL 285 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLST 302 ++ LS G STN G+ AY ++H N ++ TDG+ N+ Sbjct: 286 TYALEQLSA-GGSTNGGQGITQAYQL------AKKHFIPNGINR---VILATDGDFNVGV 335 Query: 303 KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + + + K G + +G + ++ L Sbjct: 336 TDFDDLIALIEKEKDHGIGLTTLGFGLGNYNDQLM 370 >gi|119493582|ref|ZP_01624246.1| von Willebrand factor, type A [Lyngbya sp. PCC 8106] gi|119452572|gb|EAW33755.1| von Willebrand factor, type A [Lyngbya sp. PCC 8106] Length = 414 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 37/226 (16%), Positives = 84/226 (37%), Gaps = 29/226 (12%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 I ++ R + M + S++ + L++ ++LD S SM S + ++ K+ Sbjct: 16 ITQGSSQRQLSMSV-SAIPNSVDRHVPLNLCLILDHSGSM------SGSPLETVKKAAGE 68 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPG 262 +++ + + +V F ++ + + ++++I L G T+ G Sbjct: 69 LIDRLNPGD------RISVVVFDHRAKVLIPNQDIDDPESIKKQINRLRTSG-GTSIDEG 121 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 LK ++ + R +TDGEN ++ + L A + Sbjct: 122 LKLGIEELGKGKVERISQA----------FLLTDGENEH-GDNNRCLKLAKLATDYNLTL 170 Query: 323 YAIGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDIV 366 ++G + + L A + +E P + D FS + + Sbjct: 171 NSLGFGNDWNQDILEKIADEGGGTLAYIEYPEQVIDEFSRLFNRMQ 216 >gi|116751034|ref|YP_847721.1| von Willebrand factor, type A [Syntrophobacter fumaroxidans MPOB] gi|116700098|gb|ABK19286.1| von Willebrand factor, type A [Syntrophobacter fumaroxidans MPOB] Length = 479 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 84/205 (40%), Gaps = 28/205 (13%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 N + LDM++V+D S SM + K+ A +++ +L + + Sbjct: 83 GGNVEARRELDMVVVMDRSGSM-----ADAGKLTHARQAVLNLLSRLSETD------RFA 131 Query: 223 LVTFSNKIEEFFLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 LV++S+ ++ L + L+R ++ + G +TN GL+ +Q+ ++Q + Sbjct: 132 LVSYSDHVQRHGGLLPITPANRATLERIVRGIQP-GGATNLGGGLQEGISQLAELQQNGR 190 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIRVIRSHEFLR 337 ++ ++DG D +L + A +RG V +G+ + + + Sbjct: 191 LSR---------LILISDGLANRGVTDPSALGTMASVAAERGYAVSTVGVGLDFNEHLMT 241 Query: 338 ACAS--PNSFYLVENPHSMYDAFSH 360 + A ++ +E+ + F Sbjct: 242 SIADKGAGNYTFMESASAFAQVFDK 266 >gi|313232459|emb|CBY24127.1| unnamed protein product [Oikopleura dioica] Length = 1632 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 75/209 (35%), Gaps = 30/209 (14%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 RLD+ IV+D S S+ S + + + N + I N V+ GL +FS Sbjct: 1398 GRLDIQIVIDTSGSLTSAPNKD----QVLMNFTNNLANMYDTI----NQVKIGLTSFSES 1449 Query: 230 IEEFFLLE-WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L+ + LQ + ++ G TN T G++ A N + Sbjct: 1450 SVLEMPLDFYNQLELQDGVSNMTWQGSFTNITSGVETALNDMDTSDA-----------VD 1498 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFLRACASPNS 344 +++ +TDG + ++AK G + A+G + +L N Sbjct: 1499 DVMILITDG--FQSTNTTLMFQMIDQAKADGVRLIALGFFGDFAFYSPNLYLMT----NE 1552 Query: 345 FYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 Y N + + I + I + + D Sbjct: 1553 VYHAANYAELLAIDNTIFETICSDGVIPD 1581 >gi|304412560|ref|ZP_07394165.1| von Willebrand factor type A [Shewanella baltica OS183] gi|307303576|ref|ZP_07583329.1| von Willebrand factor type A [Shewanella baltica BA175] gi|304349036|gb|EFM13449.1| von Willebrand factor type A [Shewanella baltica OS183] gi|306912474|gb|EFN42897.1| von Willebrand factor type A [Shewanella baltica BA175] Length = 627 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 37/215 (17%), Positives = 72/215 (33%), Gaps = 30/215 (13%) Query: 134 RYKIPLKFCTFIPWYTNSRHIVMP----------ITSSVKVNSQTDARLDMMIVLDVSRS 183 Y PL P+ + P + +++ +LDVS S Sbjct: 175 AYDYPLPAKNAAPFSVTTELAPSPYNDDMMLLRIGLKGYDLPKSQLGASNLVFLLDVSGS 234 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 M +S K+ + ++ + ++ V+ VV +G + L Sbjct: 235 M-----ASADKLPLLQTALKLLTAQLSAQDKVSIVVYAGAAG----VVLDGASGNDTQTL 285 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLST 302 ++ LS G STN G+ AY ++H N ++ TDG+ N+ Sbjct: 286 TYALEQLSA-GGSTNGGQGITQAYQL------AKKHFIPNGINR---VILATDGDFNVGV 335 Query: 303 KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + + + K G + +G + ++ L Sbjct: 336 TDFDDLIALIEKEKDHGIGLTTLGFGLGNYNDQLM 370 >gi|66805723|ref|XP_636583.1| hypothetical protein DDB_G0288697 [Dictyostelium discoideum AX4] gi|60464969|gb|EAL63080.1| hypothetical protein DDB_G0288697 [Dictyostelium discoideum AX4] Length = 585 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 50/269 (18%), Positives = 92/269 (34%), Gaps = 24/269 (8%) Query: 109 DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ- 167 I STS+ + G ++ + K+ L + + T +K + Sbjct: 95 SIEPSTSVPNQASDNSSGENLISTQLNKLDLNNNNNNNQQQQQQKDSLNSTEPIKKQVEL 154 Query: 168 ---TDARLDMMIVLDVSRSMESF--------FDSSITKIDMAIKSINAMLEEVKLIPDVN 216 +LD+ IVLD+S SM+S + +++I++A + Sbjct: 155 YESEIKQLDV-IVLDLSGSMKSAAFKGSRVPGELEMSRIELAQT---LFQTFTDKAISLE 210 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 V GLVTF +IE F L ++ + T ++ A + + Sbjct: 211 VPVAVGLVTFGERIELTFDLTRNFDSFSTELGEVVANQCKTRLFEAIQLAAETLVKFKES 270 Query: 277 RQHCNTEDANY---KKIIVF-MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 T + VF +TDGE+ S + Y KK I+ +I I + Sbjct: 271 CDAAATGGLKLSSNPMLRVFCLTDGEDNSNFDPYPVYQY---MKKHNIILDSIPIGLDGR 327 Query: 333 HEFLRAC-ASPNSFYLVENPHSMYDAFSH 360 A+ S ++ ++ + F Sbjct: 328 ERLSSFSQATGGSCFIADSSLEGVELFER 356 >gi|225543438|ref|NP_001139375.1| complement C2 isoform 2 preproprotein [Homo sapiens] Length = 620 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 34/238 (14%), Positives = 84/238 (35%), Gaps = 29/238 (12%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + + TN K+ Q L++ ++LD S+S+ + +S Sbjct: 92 FSHMLGATNPTQKTKESLGR-KIQIQRSGHLNLYLLLDCSQSVSEN------DFLIFKES 144 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWGVSHLQRKIKYLS-------KF 253 + M++ + V ++TF+++ + L + I L + Sbjct: 145 ASLMVDRIFSFEIN---VSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANYKDHEN 201 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL------STKEDQQ 307 G TN+ L Y + + + + I+ +TDG++ + + + Sbjct: 202 GTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIR 261 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 + N+ + +YAIG+ + +E +++++ +++ F H Sbjct: 262 EILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQVFEH 319 >gi|207028763|ref|NP_001124799.1| inter-alpha-trypsin inhibitor heavy chain H5 [Pongo abelii] Length = 942 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 72/206 (34%), Gaps = 41/206 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ VLD S SM TK+ ++ +L +++ + ++ FSN+I+ Sbjct: 296 VVFVLDSSASM------VGTKLRQTKDALFTILHDLRPQDHFS------IIGFSNRIKVW 343 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + G + I ++S G T+ L+ A + H D Sbjct: 344 KDHLISVTPDSIRDG----KVYIHHMSPTG-GTDINGALQRAIRLL---NKYVAHSGIGD 395 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEF-----L 336 + +IVF+TDG+ + +L N ++ ++ IGI L Sbjct: 396 RSVS-LIVFLTDGKPT--VGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSL 452 Query: 337 RACASPNSFYLVENP-HSMYDAFSHI 361 C + E+ + + I Sbjct: 453 ENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|194374835|dbj|BAG62532.1| unnamed protein product [Homo sapiens] Length = 620 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 34/238 (14%), Positives = 84/238 (35%), Gaps = 29/238 (12%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + + TN K+ Q L++ ++LD S+S+ + +S Sbjct: 92 FSHMLGATNPTQKTKESLGR-KIQIQRSGHLNLYLLLDCSQSVSEN------DFLIFKES 144 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWGVSHLQRKIKYLS-------KF 253 + M++ + V ++TF+++ + L + I L + Sbjct: 145 ASLMVDRIFSFEIN---VSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANYKDHEN 201 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL------STKEDQQ 307 G TN+ L Y + + + + I+ +TDG++ + + + Sbjct: 202 GTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIR 261 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 + N+ + +YAIG+ + +E +++++ +++ F H Sbjct: 262 EILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQVFEH 319 >gi|56403909|emb|CAI29739.1| hypothetical protein [Pongo abelii] Length = 694 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 72/206 (34%), Gaps = 41/206 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ VLD S SM TK+ ++ +L +++ + ++ FSN+I+ Sbjct: 48 VVFVLDSSASM------VGTKLRQTKDALFTILHDLRPQDHFS------IIGFSNRIKVW 95 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + G + I ++S G T+ L+ A + H D Sbjct: 96 KDHLISVTPDSIRDG----KVYIHHMSPTG-GTDINGALQRAIRLL---NKYVAHSGIGD 147 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEF-----L 336 + +IVF+TDG+ + +L N ++ ++ IGI L Sbjct: 148 RSVS-LIVFLTDGKPT--VGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSL 204 Query: 337 RACASPNSFYLVENP-HSMYDAFSHI 361 C + E+ + + I Sbjct: 205 ENCGLTRRVHEEEDAGSQLIGFYDEI 230 >gi|34628|emb|CAA28169.1| unnamed protein product [Homo sapiens] Length = 752 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 34/238 (14%), Positives = 84/238 (35%), Gaps = 29/238 (12%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + + TN K+ Q L++ ++LD S+S+ + +S Sbjct: 224 FSHMLGATNPTQKTKESLGR-KIQIQRSGHLNLYLLLDCSQSVSEN------DFLIFKES 276 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWGVSHLQRKIKYLS-------KF 253 + M++ + V ++TF+++ + L + I L + Sbjct: 277 ASLMVDRIFSFEIN---VSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANYKDHEN 333 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL------STKEDQQ 307 G TN+ L Y + + + + I+ +TDG++ + + + Sbjct: 334 GTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIR 393 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 + N+ + +YAIG+ + +E +++++ +++ F H Sbjct: 394 EILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQVFEH 451 >gi|15277207|dbj|BAB63292.1| C2 [Homo sapiens] Length = 577 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 34/238 (14%), Positives = 84/238 (35%), Gaps = 29/238 (12%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + + TN K+ Q L++ ++LD S+S+ + +S Sbjct: 224 FSHMLGATNPTQKTKESLGR-KIQIQRSGHLNLYLLLDCSQSVSEN------DFLIFKES 276 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWGVSHLQRKIKYLS-------KF 253 + M++ + V ++TF+++ + L + I L + Sbjct: 277 ASLMVDRIFSFEIN---VSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANYKDHEN 333 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL------STKEDQQ 307 G TN+ L Y + + + + I+ +TDG++ + + + Sbjct: 334 GTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIR 393 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 + N+ + +YAIG+ + +E +++++ +++ F H Sbjct: 394 EILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQVFEH 451 >gi|14550407|ref|NP_000054.2| complement C2 isoform 1 preproprotein [Homo sapiens] gi|3915642|sp|P06681|CO2_HUMAN RecName: Full=Complement C2; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement C2b fragment; Contains: RecName: Full=Complement C2a fragment; Flags: Precursor gi|298124|gb|AAB97607.1| complement component C2 [Homo sapiens] gi|2347131|gb|AAB67975.1| complement component C2 [Homo sapiens] gi|28175369|gb|AAH43484.1| Complement component 2 [Homo sapiens] gi|33346923|gb|AAQ15273.1| complement component 2 [Homo sapiens] gi|55961814|emb|CAI17451.1| complement component 2 [Homo sapiens] gi|57209923|emb|CAI41858.1| complement component 2 [Homo sapiens] gi|119623954|gb|EAX03549.1| complement component 2, isoform CRA_b [Homo sapiens] gi|123857990|emb|CAM25860.1| complement component 2 [Homo sapiens] gi|168983782|emb|CAQ06833.1| complement component 2 [Homo sapiens] gi|168984416|emb|CAQ09272.1| complement component 2 [Homo sapiens] gi|168985077|emb|CAQ07481.1| complement component 2 [Homo sapiens] gi|168985955|emb|CAQ07111.1| complement component 2 [Homo sapiens] gi|189069137|dbj|BAG35475.1| unnamed protein product [Homo sapiens] Length = 752 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 34/238 (14%), Positives = 84/238 (35%), Gaps = 29/238 (12%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + + TN K+ Q L++ ++LD S+S+ + +S Sbjct: 224 FSHMLGATNPTQKTKESLGR-KIQIQRSGHLNLYLLLDCSQSVSEN------DFLIFKES 276 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWGVSHLQRKIKYLS-------KF 253 + M++ + V ++TF+++ + L + I L + Sbjct: 277 ASLMVDRIFSFEIN---VSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANYKDHEN 333 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL------STKEDQQ 307 G TN+ L Y + + + + I+ +TDG++ + + + Sbjct: 334 GTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIR 393 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 + N+ + +YAIG+ + +E +++++ +++ F H Sbjct: 394 EILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQVFEH 451 >gi|302632554|ref|NP_001181863.1| integrin alpha-M [Pan troglodytes] Length = 1153 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 82/210 (39%), Gaps = 28/210 (13%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 D+ ++D S S+ + ++ + ++ L++ K + + + + Sbjct: 143 GCPQEDSDIAFLIDGSGSINPY---DFRRMKEFVSTVMEQLKKSKTLFSLMQYSEEFRIH 199 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F+ K + + + +K +++ T++ G++ ++F++ N Sbjct: 200 FTFKEFQNNP------NPRSLVKPITQLLGRTHTATGIRKVVRELFNIT------NGARK 247 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR----VIRSHEFLRACAS 341 N KI+V +TDGE + + EA + G I Y IG+ +S + L AS Sbjct: 248 NAFKILVVITDGEKFGDPLGYEDVIP--EADREGVIRYVIGVGDAFHSEKSRQELNTIAS 305 Query: 342 P---NSFYLVENPHSMYDAFSHIGKDIVTK 368 + + V N ++A I + K Sbjct: 306 KPPRDHVFQVNN----FEALKTIQNQLREK 331 >gi|260426945|ref|ZP_05780924.1| von Willebrand factor type A [Citreicella sp. SE45] gi|260421437|gb|EEX14688.1| von Willebrand factor type A [Citreicella sp. SE45] Length = 334 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 27/207 (13%), Positives = 79/207 (38%), Gaps = 26/207 (12%) Query: 166 SQTDARLDMMIVLDVSRSMES--FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + A D+++ +D+S SM+S F T+ + + + + V+ + + + L Sbjct: 88 TIETAARDVVLAIDISGSMDSRDFTAPDGTR----KQRLAGVRDVVRGFVEGRDGDRMAL 143 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + F + L + + + + G+ + + D G+ Sbjct: 144 IVFGSAAYLQAPLTADLDAILALLDR---------TQVGMAGPHTALGDSIGLAIRTFES 194 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-------EFL 336 +++++ ++DG + +++ D + A R ++ +G+ + + L Sbjct: 195 SEIDQRLLILLSDGSDTASRMDP--VNAAEIAAGRDVEIFTVGVGDPDATGENRVDLDTL 252 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHI 361 RA A + +++ + ++ + I Sbjct: 253 RAIADRTGGAYFFAADEAALTAVYESI 279 >gi|33340728|gb|AAQ14925.1| Mac-1 alpha subunit [Pan troglodytes] Length = 1144 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 82/210 (39%), Gaps = 28/210 (13%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 D+ ++D S S+ + ++ + ++ L++ K + + + + Sbjct: 134 GCPQEDSDIAFLIDGSGSINPY---DFRRMKEFVSTVMEQLKKSKTLFSLMQYSEEFRIH 190 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F+ K + + + +K +++ T++ G++ ++F++ N Sbjct: 191 FTFKEFQNNP------NPRSLVKPITQLLGRTHTATGIRKVVRELFNIT------NGARK 238 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR----VIRSHEFLRACAS 341 N KI+V +TDGE + + EA + G I Y IG+ +S + L AS Sbjct: 239 NAFKILVVITDGEKFGDPLGYEDVIP--EADREGVIRYVIGVGDAFHSEKSRQELNTIAS 296 Query: 342 P---NSFYLVENPHSMYDAFSHIGKDIVTK 368 + + V N ++A I + K Sbjct: 297 KPPRDHVFQVNN----FEALKTIQNQLREK 322 >gi|119478003|ref|ZP_01618103.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2143] gi|119448916|gb|EAW30158.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2143] Length = 341 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 70/200 (35%), Gaps = 26/200 (13%) Query: 173 DMMIVLDVSRSME--SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D+M+ +D+S SME F KID ++ +L E D + + GL+ F + Sbjct: 105 DLMVAVDLSGSMEAQDFTTEQGEKIDRL-TAVKQVLTEFSQRRDGD---RLGLIVFGSAA 160 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + L + + + A I D G+ ++ Sbjct: 161 YLQAPF---TADKDTWLTLLQE------TEIAMAGASTSIGDAIGLSISTFEHSDTDNRV 211 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-------HEFLRACA--S 341 ++ +TDG T + A R +Y I I + + L+ + + Sbjct: 212 LIVLTDG--NDTGSRVPPVDAARVANARDVKIYTIAIGDPETIGEDAMDVDTLKQVSDIT 269 Query: 342 PNSFYLVENPHSMYDAFSHI 361 +++ + ++ A+ I Sbjct: 270 GGAYFEALDRQALERAYLDI 289 >gi|126730251|ref|ZP_01746062.1| hypothetical protein SSE37_10864 [Sagittula stellata E-37] gi|126708984|gb|EBA08039.1| hypothetical protein SSE37_10864 [Sagittula stellata E-37] Length = 614 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 34/222 (15%), Positives = 75/222 (33%), Gaps = 41/222 (18%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 F+ + G M+ + ++ + G+ I++ H ++ + + +DR+++ AA N Sbjct: 25 RFWADTSGSMSYVALAGSLVMMVFGGIGIDMMHAELKRSQVQNTLDRAVLAAA----NLS 80 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 N + + + T D+ + Sbjct: 81 NTRDPQTVVEDYFRAMKLE------------------------DTLGDVQTGDSLGAKRV 116 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 A I F I + + ++ + A L++ +VLDVS SM+ Sbjct: 117 RAEGNGSINSHFLGLI----GVDQLDVYGAATAE---NATAPLEISLVLDVSGSMQG--- 166 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 KI ++ A ++ V N+ V L+ ++ + Sbjct: 167 ---QKIRDLKEAAKAFVDAVLGEGGDNSRVTVSLIPYNATVN 205 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 16/74 (21%), Positives = 33/74 (44%), Gaps = 1/74 (1%) Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-SHEFLRACASPNSFYLVENPHSMYDAF 358 +++ + C +AK++ ++ IG+ + +R CAS S Y + + + D F Sbjct: 541 DASQANTNLATICAKAKQQDVTIFTIGVEAPQAGLNAMRNCASSASHYYNVSSNQLVDTF 600 Query: 359 SHIGKDIVTKRIWY 372 I +V R+ Sbjct: 601 RSISDVVVELRLTE 614 >gi|62896633|dbj|BAD96257.1| complement component 2 precursor variant [Homo sapiens] Length = 752 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 33/250 (13%), Positives = 82/250 (32%), Gaps = 32/250 (12%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 T + K+ Q L++ ++LD S+S+ Sbjct: 215 DVAPALGTSFSHMLGATNLTQKTKESLGR----KIQIQRSGHLNLYLLLDCSQSVSEN-- 268 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWGVSHLQRKIK 248 + +S + M++ + V ++TF+++ + L + I Sbjct: 269 ----DFLIFKESASLMVDRIFSFEIN---VSVAIITFASEPKVLMSVLNDNSRDMTEVIS 321 Query: 249 YLS-------KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL- 300 L + G TN+ L Y + + + + I+ +TDG++ Sbjct: 322 SLENANYKDHENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNM 381 Query: 301 -----STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVEN 350 + + + + N+ + +YAIG+ + +E +++++ Sbjct: 382 GGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQD 441 Query: 351 PHSMYDAFSH 360 +++ F H Sbjct: 442 TKALHQVFEH 451 >gi|332298719|ref|YP_004440641.1| von Willebrand factor type A [Treponema brennaborense DSM 12168] gi|332181822|gb|AEE17510.1| von Willebrand factor type A [Treponema brennaborense DSM 12168] Length = 333 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 35/222 (15%), Positives = 73/222 (32%), Gaps = 44/222 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ VLDVS SM + + +++++ A +++ ++ GLV +++ Sbjct: 93 ILFVLDVSPSMAAKDIAGMSRLEAAKQAVRVIV-------PEAGGTAFGLVALASEAALM 145 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 ++ L + S G+ + H A KK IV Sbjct: 146 VPPTLDREAFFARLNSLQAGELGDGSAIGM--------GVSTAAYHL-ISSAAPKKSIVL 196 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-----------VIRSHEFLRACAS- 341 +TDGEN + + A + G +Y +G+ + S Sbjct: 197 ITDGENNAGSVHPGTA--AQLAFENGITLYVLGVGTRGSVPLEYVDPATGKTYSGYLDSR 254 Query: 342 -------------PNSFYLVENPHSMYDAFSHI-GKDIVTKR 369 ++ VE+ + A S + G++ + Sbjct: 255 FDESPLQEIALTAGGRYFGVESMGELTAAVSAVTGREQTVQS 296 >gi|47218057|emb|CAG09929.1| unnamed protein product [Tetraodon nigroviridis] Length = 441 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 76/201 (37%), Gaps = 30/201 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + ++++ V+D S S+ ++ + + V N + GLV + Sbjct: 6 CKERPMELVFVIDSSESVGP------ENFEIIKDFV---IRLVDRTTVGRNATRIGLVLY 56 Query: 227 SNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S ++ F L V+ +++ I+ + G T + ++ A + F Sbjct: 57 SLEVRLEFNLARYVTKQDIRQAIRKIPYMGEGTYTGTAIRKATQEAFLNA---------R 107 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI---RVIRSHEFLRAC-- 339 K+ + +TDG+ +E + EA +YA+GI EFL+ Sbjct: 108 RGVSKVAIVITDGQ-TDKREPVKLDLAVREAHAANIEMYALGIVNASDPTQAEFLQELNL 166 Query: 340 -AS---PNSFYLVENPHSMYD 356 AS YL+++ +++ Sbjct: 167 IASDPDSEHMYLIDDFNTLTA 187 >gi|221135528|ref|XP_002156134.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 194 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 69/189 (36%), Gaps = 23/189 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+ +LD S S++S + + S N +G+VTFS+ Sbjct: 11 GFFDVGFILDSSGSLKSQYWKEKDFLKKLANSFGI----------SNKGSHAGVVTFSHY 60 Query: 230 IEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 E L+ + + +S T L A G R + Sbjct: 61 AELSIRLDAFYSSIDFNDAVDRISHMDSFTRIDLALAKALELFDIKNGARN-------DV 113 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLRACASPNSFY 346 ++ +TDG+ + + + +E K++G ++A+GI + E + +P + + Sbjct: 114 PNLLFLLTDGK---QEPEMPLTHISDEIKQKGIQLFAVGIGAGANKTELEKIVGNPENVF 170 Query: 347 LVENPHSMY 355 +V++ + Sbjct: 171 MVDDFDKLL 179 >gi|257454382|ref|ZP_05619644.1| von Willebrand factor, type A [Enhydrobacter aerosaccus SK60] gi|257448148|gb|EEV23129.1| von Willebrand factor, type A [Enhydrobacter aerosaccus SK60] Length = 550 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 38/195 (19%), Positives = 74/195 (37%), Gaps = 26/195 (13%) Query: 145 IPWYTNSRHIVMPITSS--VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 PW T +R I + I + T +++ ++DVS SM K+ + S+ Sbjct: 162 APWRTANRIIKVAIKADDPTITKQSTLPPANLVFLVDVSGSMSDN-----DKLPLVKSSL 216 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNST 260 + ++++ ++ +VT++ + + G + I L G STN Sbjct: 217 KMLTKQLRPQDTIS------IVTYAGRTQVTLPATRGSDTDKILAAIDSLDASG-STNGE 269 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRG 319 +K AY Q + H + N I+ MTDG+ N+ + + L + G Sbjct: 270 AAIKLAYQQ------AKIHYKKDGINR---ILMMTDGDFNVGVSDVDEMLDIIRRERDSG 320 Query: 320 AIVYAIGIRVIRSHE 334 + G ++ Sbjct: 321 VSLSTFGFGEGNLND 335 >gi|320010752|gb|ADW05602.1| Protein of unknown function DUF3520 [Streptomyces flavogriseus ATCC 33331] Length = 528 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 79/208 (37%), Gaps = 29/208 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + + N+ + V+D+S SM + ++D+A S+ + +E++ Sbjct: 152 VRVGLATRAASNTGERPPAALTFVVDISGSM-----AEPGRLDLAKTSLGILADELRDDD 206 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 V+ LVTFS + E + + L+ I+ + STN G++ Y Sbjct: 207 SVS------LVTFSEEAETRLPMTRLRGNRTKLRDAIEEMEP-ADSTNVAAGVERGY--- 256 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIR 328 + + N +V ++D N E L +A++ G ++ +G+ Sbjct: 257 ---EEAVEGHRKGATNR---VVLLSDALANTGETEADAILERVGDARQEYGITLFGVGVG 310 Query: 329 VIRSHEF---LRACASPNSFYLVENPHS 353 E L N+ Y+ + + Sbjct: 311 SDYGDELMERLTNKGDGNTTYIADEAQA 338 >gi|119901955|ref|XP_602058.3| PREDICTED: integrin, alpha 11 [Bos taurus] gi|297479009|ref|XP_002690571.1| PREDICTED: integrin alpha 11 subunit-like [Bos taurus] gi|296483745|gb|DAA25860.1| integrin alpha 11 subunit-like [Bos taurus] Length = 1194 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 88/214 (41%), Gaps = 39/214 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++IVLD S S+ + + + ++ +K +Q G+V + Sbjct: 166 QTYMDIVIVLDGSNSIYPWVEVQ-----------HFLINILKKFYIGPGQIQVGVVQYGE 214 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + F L S ++ ++ S + G T + G+++A ++ F G + Sbjct: 215 DVVHEFHLNDYRS-VKDVVEAASHIEQRGGTETRTAFGIEFARSEAFQKGGRKG------ 267 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL-- 336 KK+++ +TDGE+ + + ++ + ++++ YA+ + R I FL Sbjct: 268 --AKKVMIVITDGESHDSPDLEKVIQ---QSERDNVTRYAVAVLGYYNRRGINPEAFLNE 322 Query: 337 -RACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + AS F+ V + ++ D +G I Sbjct: 323 IKYIASDPDDKHFFNVTDEAALKDIVDALGDRIF 356 >gi|74000923|ref|XP_535527.2| PREDICTED: similar to integrin, alpha 11 precursor [Canis familiaris] Length = 1183 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 88/214 (41%), Gaps = 39/214 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++IVLD S S+ + + + ++ +K +Q G+V + Sbjct: 155 QTYMDIVIVLDGSNSIYPWVEVQ-----------HFLINILKKFYIGPGQIQVGVVQYGE 203 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + F L S ++ ++ S + G T + G+++A ++ F G + Sbjct: 204 DVVHEFHLNDYRS-VKDVVEAASHIEQRGGTETRTAFGIEFARSEAFQKGGRKG------ 256 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL-- 336 KK+++ +TDGE+ + + ++ + ++++ YA+ + R I FL Sbjct: 257 --AKKVMIVITDGESHDSPDLEKVIQ---QSERDNVTRYAVAVLGYYNRRGINPEAFLNE 311 Query: 337 -RACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + AS F+ V + ++ D +G I Sbjct: 312 IKYIASDPDDKHFFNVTDEAALKDIVDALGDRIF 345 >gi|299140484|ref|ZP_07033622.1| BatA protein [Prevotella oris C735] gi|298577450|gb|EFI49318.1| BatA protein [Prevotella oris C735] Length = 332 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 39/222 (17%), Positives = 75/222 (33%), Gaps = 51/222 (22%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +DVS SM + D +++ A + + + P+ N GL F+ + Sbjct: 88 IDIMLAMDVSTSMLAE-DLKPNRLEAAKNVASEFIAD---RPNDN----IGLTIFAGEAF 139 Query: 232 EFFLLEWG-------VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + + ++ I T GL A +++ D + Sbjct: 140 TQCPMTTDHVSLINLLQSVRTDIAARGLISDGTAVGMGLANAVSRLKDSKAKS------- 192 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS------------ 332 K+++ +TDG N D + A+ G VY IGI + Sbjct: 193 ----KVVILLTDGSNNM--GDISPMTSAQIARSFGIRVYTIGIGTNKVAPYPMPVAGGIQ 246 Query: 333 ---------HEFLR--ACASPNSFYLVENPHSMYDAFSHIGK 363 + L+ A + +FY N + + I + Sbjct: 247 YVNIPVEIDSKTLKDIAATTEGNFYRATNNRQLKQIYKDIDQ 288 >gi|34810098|pdb|1QCY|A Chain A, The Crystal Structure Of The I-Domain Of Human Integrin Alpha1beta1 Length = 193 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 41/213 (19%), Positives = 79/213 (37%), Gaps = 35/213 (16%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +LD++IVLD S S+ + D +N +L+ + + P Q G+V + Sbjct: 1 TQLDIVIVLDGSNSIYPW--------DSVTAFLNDLLKRMDIGP---KQTQVGIVQYGEN 49 Query: 230 IEEFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + F L S + + + + G T + G A + F Sbjct: 50 VTHEFNLNKYSSTEEVLVAAKKIVQRGGAQTMTALGTDTARKEAFT------EARGARRG 103 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--RVIRSH-------EFLR 337 KK++V +TDGE+ ++ + C ++I I R + E ++ Sbjct: 104 VKKVMVIVTDGESHDNHRLKKVIQDCE---DENIQRFSIAILGSYNRGNLSTEKFVEEIK 160 Query: 338 ACAS---PNSFYLVENPHSMYDAFSHIGKDIVT 367 + AS F+ V + ++ +G+ I Sbjct: 161 SIASEPTEKHFFNVSDELALVTIVKTLGERIFA 193 >gi|327260894|ref|XP_003215268.1| PREDICTED: collagen alpha-3(VI) chain-like [Anolis carolinensis] Length = 3053 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 35/209 (16%), Positives = 78/209 (37%), Gaps = 24/209 (11%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 Q + D++ ++D S S+ + + + +++++ + N + Sbjct: 23 AAAQQQNNGAADIIFLVDSSWSIGK------EHFQLVREFLYDVVKQLDV---GGNDFRF 73 Query: 222 GLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAY-NQIFDMQGMRQ 278 GLV FS F L + + I ++ G T + GL++ N + + G R Sbjct: 74 GLVQFSGNPHTEFQLNTYHTLQDVLSHISHMPYMGGGTKTGQGLEFLIRNHLTKVSGSRV 133 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 T +++V +TDG + + K ++AIG++ E Sbjct: 134 SDGTP-----QVVVVLTDG-----RSQDDVTLPSSVLKSADVNMFAIGVQDAVEGELKEI 183 Query: 339 CASP--NSFYLVENPHSMYDAFSHIGKDI 365 + P + +EN +++ + + I Sbjct: 184 ASEPLEIHLFNLENFTALHGIVGDLVESI 212 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 33/264 (12%), Positives = 92/264 (34%), Gaps = 22/264 (8%) Query: 95 RNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHI 154 N + +G + + S ++ + ++ + ++P F + + Sbjct: 1558 SNVILSSGIKSLGVGAKNADSAELQRITNDQRTAFIVREFAELPTIEKRFFASFEAPQE- 1616 Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 P ++ + + D++ +LD S + + + ++ I + Sbjct: 1617 --PPLEETEIPLDSKKQADIVFLLDSS------INFGRDNFQEVVDFVYGII---DAIYE 1665 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + ++ GLV +++ + + F L+ + +K ++ G T +T A I Sbjct: 1666 EGDSIKVGLVQYNSDVSDEFFLKDFTDKEQILEAVKRIAYKGGRTANTG---TAIKHIKA 1722 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 +++ + D +I +T G +L + G V+A+G+R I Sbjct: 1723 KHFVKEAGSRVDQKVPQIAFIITGGRPEDD-GQTAALALAQQ----GVKVFAVGVRNIDL 1777 Query: 333 HEFLRACASPNSFYLVENPHSMYD 356 + + + + + + + Sbjct: 1778 GDIAKLSSDSTTGFRAATAQELSE 1801 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 28/223 (12%), Positives = 72/223 (32%), Gaps = 21/223 (9%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 + + T ++ + + + D++ ++D S D++ + + + Sbjct: 1002 GEIQPEIVNLLKTIEIRESVCDEIQRKDVVFLIDGS-------DATRSSFPELKSFVQRV 1054 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGL 263 ++ + + P V+ +V +SN F L + + ++ ++ G +T Sbjct: 1055 VDSLDVGP---GKVRVAVVQYSNDANTEFNLNEYSDKASVITAVQRMTAMGGYAVNTGA- 1110 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 A N + R+ + + ++ +T + + E K RGA+ Sbjct: 1111 --ALNYLISNVFTREAGSRVQEGVPQFVILLT-----AERSRDDVRRPALELKTRGAVPL 1163 Query: 324 AIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIV 366 IG + P V + I + + Sbjct: 1164 GIGFGNADITQLQTISFVPEFAVFVSGVSELGRIQQLIAERVT 1206 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 35/219 (15%), Positives = 85/219 (38%), Gaps = 24/219 (10%) Query: 152 RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 R + +T + V D++ +LD S ++ + + + + ++ + + Sbjct: 620 RTLSGSVTETTDVLINKR---DIIFLLDGSANVGNANFPYVRDL------VVNLVNSLDV 670 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVST-NSTPGLKYAYN 268 D ++ GLV FS + F L ++ + + L G S N+ L + ++ Sbjct: 671 GIDD---IRIGLVQFSENPKTEFFLNSFLTKAEVLSHLSGLKLLGGSVLNTGSALDFVFS 727 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 F + + D N +++VF+T G ++ + ++ + + ++IG+R Sbjct: 728 NHFT----KAGGSRIDENVPQVLVFLTAGPSVDSFQEAARALA-----RARVLTFSIGVR 778 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 E + +P Y S+ + + + + T Sbjct: 779 NADPAELQQISFNPQIVYFRNEFRSLANLPQEMIRPLTT 817 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 30/252 (11%), Positives = 83/252 (32%), Gaps = 23/252 (9%) Query: 99 RDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTN-SRHIVMP 157 + F I + + ++++ + + + + + + +P ++ ++ Sbjct: 366 QAGVFSFSIG-VKNADNVELQQIATDGSFVFTILDTRNLGDLEDLLLPNIVGVAQRFILL 424 Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 T +V R D++ ++D + ++ + I ++ +++ P Sbjct: 425 DTPTVFTEVVEVNRKDVVFLIDGTTALG------TASFNAIRDFIGKIISRLEIGPHA-- 476 Query: 218 VVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQ 274 +Q + + + + L + ++ + G + N+ L+ N F Sbjct: 477 -IQVAVAQYGDTVRPEIYLNTYPNKKDISTNLRKIRPAGGAVLNTGAALRTVKNSFFTSS 535 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + +V +T G K + E K+ G +V AIG + E Sbjct: 536 -VGSRIEEGVLPV---LVLVTGG-----KSRDDVVQPVQELKRGGIMVLAIGAQNADRAE 586 Query: 335 FLRACASPNSFY 346 + P + Sbjct: 587 LEQIAFEPGLVF 598 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 27/202 (13%), Positives = 74/202 (36%), Gaps = 21/202 (10%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 M V + D++ ++D S ++ + ++E + + Sbjct: 220 MAGEKGVVKDITAQESADIIFLIDGSNNIGPVI------FATVRDFVANVIERLSV---G 270 Query: 216 NNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFD 272 + ++ G+VT+S++ F L + + +K LS G N L++ F+ Sbjct: 271 SESIRVGVVTYSDQSRTAFFLNSHTRKADVLEAVKALSFPGGEEANIGEALEFVVQNHFN 330 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 R + + + +++V + S++ K+ G ++IG++ + Sbjct: 331 ----RSGGSRIEEHVPQVLVLI-----SSSESSDDIREGVLAMKQAGVFSFSIGVKNADN 381 Query: 333 HEFLRACASPNSFYLVENPHSM 354 E + + + + + ++ Sbjct: 382 VELQQIATDGSFVFTILDTRNL 403 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 32/242 (13%), Positives = 73/242 (30%), Gaps = 24/242 (9%) Query: 118 IVVVPQNEGYSISAISRYKIPL--KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMM 175 + + + Y + +P + ++ I I + D++ Sbjct: 1378 MQTIALSPQYVFQVSTYQDLPTLEQQLISPVTTLTTQQIQGLIADTSSPTDIDSEAKDIV 1437 Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 ++D S + I I + ++ + + V+ G+V FSN + F Sbjct: 1438 FLIDSS----DNVGADFAHIRDFI------IRIIQQLDVRSRKVRIGVVQFSNNVFPEFF 1487 Query: 236 LEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L+ + + + I+ + G + N L Y F + + + +V Sbjct: 1488 LKTHPTKNAVLQAIRRMRPRGGTPLNVGKALDYVVKNHFIKSAGSRREDG----VPQHLV 1543 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPH 352 + G + N G +G + S E R + ++V Sbjct: 1544 LLLGG-----RSQDDVGRPSNVILSSGIKSLGVGAKNADSAELQRITNDQRTAFIVREFA 1598 Query: 353 SM 354 + Sbjct: 1599 EL 1600 >gi|189465623|ref|ZP_03014408.1| hypothetical protein BACINT_01981 [Bacteroides intestinalis DSM 17393] gi|189437897|gb|EDV06882.1| hypothetical protein BACINT_01981 [Bacteroides intestinalis DSM 17393] Length = 327 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 46/271 (16%), Positives = 91/271 (33%), Gaps = 48/271 (17%) Query: 119 VVVPQNEGYSISAISRYKIPLKF-CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 + + ++ + Y + + F I + P T++ NS+ + +D+M+ Sbjct: 35 ISDARVYAHTPKSYKNYLLHVPFMLRIIALALIIVVLARPQTTNSWQNSEIEG-IDIMMA 93 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 +DVS SM + D +++ A + P+ N G+ F+ + L Sbjct: 94 IDVSTSMLAE-DLKPNRLEAAKDVAAEFING---RPNDN----IGITLFAGESFTQCPLT 145 Query: 238 WGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 + L + + T G+ A ++ D + K+I+ +T Sbjct: 146 VDHAVLLNLFQGIKCGIIEDGTAVGMGIANAVTRLKDSKAKS-----------KVIILLT 194 Query: 296 DGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV----------------------IRSH 333 DG N K D L AK G VY IG+ I Sbjct: 195 DGTNN--KGDISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVNMPVEIDEK 252 Query: 334 EFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 + A + +++ + + + + I K Sbjct: 253 TLTQIAATTEGNYFRATSNSKLKEVYEEIDK 283 >gi|308270599|emb|CBX27211.1| hypothetical protein N47_A12400 [uncultured Desulfobacterium sp.] Length = 330 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 65/196 (33%), Gaps = 35/196 (17%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME----SFFDSSITK 194 L F +I I SS +D+M+ LD S SM+ +T+ Sbjct: 62 LIFLVIAAARPQLYNISRDIRSS---------GVDIMLCLDTSGSMQALDFELDGKPVTR 112 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS--K 252 + + K + ++E + GLV F + L L + + Sbjct: 113 LTVVKKVVADFIKE-------RETDRIGLVVFGQEAFTQSPLTMDKGLLLSLVDKMEIGM 165 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 G ST + A ++ D++ KI++ +TDG + D Sbjct: 166 AGDSTAIGNAIAVAGKRLKDLKAKS-----------KIMIILTDGRSN--TGDITPEEAA 212 Query: 313 NEAKKRGAIVYAIGIR 328 A G +Y IG+ Sbjct: 213 GAAAALGIKIYTIGVG 228 >gi|171058998|ref|YP_001791347.1| von Willebrand factor type A [Leptothrix cholodnii SP-6] gi|170776443|gb|ACB34582.1| von Willebrand factor type A [Leptothrix cholodnii SP-6] Length = 350 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 31/235 (13%), Positives = 72/235 (30%), Gaps = 50/235 (21%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +DVS SM + D ++ + + ++ D+ V+ G+V+F+ Sbjct: 95 LLLAMDVSGSMRA-TDVEPNRLVASQVAAKNFVK------DLPRHVRVGVVSFAGTAAVV 147 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK---- 289 + I + T G+ + +F G+ T + K Sbjct: 148 QAPTHSRDDVFAAIDRF-QLQRGTAIGSGIVLSLATLFPEAGIDLSDITGERRMPKGIGE 206 Query: 290 ------------------IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++ +TDG+ + + + A RG VY +G Sbjct: 207 SDKQDDFKPVAPGSYGSAAVILLTDGQRTTGPDPMDAAKM---AADRGVKVYTVGFGTTS 263 Query: 332 S---------------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 L+ A + ++ + + + + +V +R Sbjct: 264 GEIIGFEGWSMRVRLDEATLKNIANLTQAEYFYAGSATDLQKVYDTLSSRLVFER 318 >gi|260800507|ref|XP_002595171.1| hypothetical protein BRAFLDRAFT_240983 [Branchiostoma floridae] gi|229280414|gb|EEN51182.1| hypothetical protein BRAFLDRAFT_240983 [Branchiostoma floridae] Length = 352 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 40/171 (23%), Positives = 67/171 (39%), Gaps = 22/171 (12%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ +LD S S+ S+ +M + + + + + D Q GLV +S++ E Sbjct: 3 LDLVFILDSSDSV------SVQDFEMMRQFLKKTVGDFNIGYDA---TQIGLVQYSDEAE 53 Query: 232 EFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L+ S L+ I+ + G +TN+ L Y +F N + K Sbjct: 54 TIFALDSFSSPTSLRDAIETIQYTGGATNTGNALDYMVQYMFAS------RNGARQDSTK 107 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 I + +T G +K I YAIGI ++ L A Sbjct: 108 IAIVLTGG-----ASSDDIKAAAQRMRKSSVITYAIGIGSELDYDQLDYIA 153 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 32/172 (18%), Positives = 60/172 (34%), Gaps = 22/172 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 S+ + ++D++ VLD + S+ T + + M+ + L P Sbjct: 188 PSTAGPSPLCSVQVDLVFVLDGTGSVG------ATNFEKMKTFVQKMISDFDLGP---EA 238 Query: 219 VQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 + G+V +SN+ L+ LQ + ++ G T + + Y Sbjct: 239 TRIGVVVYSNRASLEISLDAYDDQEALQDAVADIAYPGGYTLTGAAIDY------TTTFA 292 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 N +K+ V +TDG + +K I YA+GI Sbjct: 293 FSTRNGARDGVRKVAVILTDGVSYDDPA-----EPAQSMRKAAIITYAVGIG 339 >gi|221505369|gb|EEE31023.1| microneme protein, putative [Toxoplasma gondii VEG] Length = 723 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 32/173 (18%), Positives = 68/173 (39%), Gaps = 22/173 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S++ +LD+ ++D S S+ I + + ++ L + + P+ V Sbjct: 17 SAIGAAEGCTNQLDICFLIDSSGSIG------IQNFRLVKQFLHTFLMVLPIGPEE---V 67 Query: 220 QSGLVTFSNKI----EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + +VT+S + + + + STN++ GLK +F Sbjct: 68 NNAVVTYSTDVHLQWDLQSPNAVDKQLAAHAVLEMPYKKGSTNTSDGLKACKQILFTGS- 126 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + K+++ MTDGE+ S D +++ E ++ G IV + + Sbjct: 127 -----RPGREHVPKLVIGMTDGESDS---DFRTVRAAKEIRELGGIVTVLAVG 171 >gi|1923217|gb|AAB63303.1| micronemal protein MIC2 [Toxoplasma gondii] gi|221484107|gb|EEE22411.1| microneme protein, putative [Toxoplasma gondii GT1] Length = 769 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 32/173 (18%), Positives = 68/173 (39%), Gaps = 22/173 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S++ +LD+ ++D S S+ I + + ++ L + + P+ V Sbjct: 63 SAIGAAEGCTNQLDICFLIDSSGSIG------IQNFRLVKQFLHTFLMVLPIGPEE---V 113 Query: 220 QSGLVTFSNKI----EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + +VT+S + + + + STN++ GLK +F Sbjct: 114 NNAVVTYSTDVHLQWDLQSPNAVDKQLAAHAVLEMPYKKGSTNTSDGLKACKQILFTGS- 172 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + K+++ MTDGE+ S D +++ E ++ G IV + + Sbjct: 173 -----RPGREHVPKLVIGMTDGESDS---DFRTVRAAKEIRELGGIVTVLAVG 217 >gi|315649635|ref|ZP_07902720.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315275108|gb|EFU38483.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 595 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 42/223 (18%), Positives = 84/223 (37%), Gaps = 37/223 (16%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + +++D ++V+D S SM+ +S +I +++ ++ + D + G+V Sbjct: 31 AASQGSKIDAVLVMDASNSMK---NSDPDRIGS--EAMKMFIDMLSTTGD-----KVGVV 80 Query: 225 TFSNKIEEFFLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 +++++I+ L E + L+ I L + G T+ + GL A + +Q Sbjct: 81 SYTDRIQREKALLEIQSEADKTALKEFIDQLDR-GPYTDISVGLDEAVKVL------KQG 133 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC--------NEAKKRGAIVYAIGIRVIR 331 A IV + DG N ++ EAK G +Y IG+ Sbjct: 134 MEPAHAPM---IVVLADGNNDLDPNTGKTSQEASDHLNQAVQEAKGSGIPIYTIGLNADG 190 Query: 332 --SHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + E L A + + + + S I +I Sbjct: 191 KLNKEALAELANQTGGKSFTTSSADDLPQILSEIFASHQQLKI 233 >gi|194211467|ref|XP_001916467.1| PREDICTED: similar to collagen, type VI, alpha 3 [Equus caballus] Length = 3165 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 75/210 (35%), Gaps = 24/210 (11%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 + + + + A D++ ++D S S+ + + + +++ + + + Sbjct: 21 SFTRAQQQADVKNGAAADIIFLVDSSWSIGK------EHFQLVREFLYDVIKSLAVGEND 74 Query: 216 NNVVQSGLVTFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQ-IFD 272 LV F+ FLL + + I +S G S + GL+Y + Sbjct: 75 ---FHFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGSNQTGKGLEYVMQTHLTQ 131 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 G R ++IV +TDG ED +L K V+AIG+ Sbjct: 132 AAGSRA-----SDGVPQVIVVLTDGH----SEDGLALPTAE-LKSADVNVFAIGVEDADE 181 Query: 333 HEFLRACASP--NSFYLVENPHSMYDAFSH 360 + P + +EN S++D + Sbjct: 182 GALKEIASEPLNMHVFNLENFTSLHDIVGN 211 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 46/311 (14%), Positives = 113/311 (36%), Gaps = 28/311 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + GG I + + +G V+ Sbjct: 1519 LEHVARNLFVKSAGSRIEDGVPQHLVLFLGGKSQDDISR-----YSQVISSSGIVSLGVG 1573 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + ++P I + + P + + Sbjct: 1574 DRNIDRTELQTITNDPRLVFTVREFRELPNIEEKIITSFGPSGVTPAPPGVDTPSPSRPE 1633 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + + E V + + + +Q GLV +++ Sbjct: 1634 KKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTLYEGGDSIQVGLVQYNS 1684 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + F L+ + + I + G N+ G+++ + + + + D Sbjct: 1685 DPTDEFFLKDFSTKQEIIDAINKVVYKGGRHANTKVGIEH----LRLNHFVPEAGSRLDQ 1740 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +I +T G+++ ++ +L +RG V+A+G+R I S E + ++ + Sbjct: 1741 RVPQIAFVITGGKSVEDAQEASALT------QRGVKVFAVGVRNIDSEEVGKIASNSATA 1794 Query: 346 YLVENPHSMYD 356 + V N + + Sbjct: 1795 FRVGNVQELSE 1805 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 80/212 (37%), Gaps = 24/212 (11%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + +V+ D++ +LD S ++ T + ++ + + D Sbjct: 627 SGTTEVHVNKR---DIIFLLDGSSNVGE------TNFPYVRDFVMNIVNSLDVGSDN--- 674 Query: 219 VQSGLVTFSN-KIEEFFLLEWGVS-HLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQG 275 ++ GLV FS+ + EF L + L ++ L + G N+ L Y + Sbjct: 675 IRVGLVQFSDTPVTEFSLNTYQTKAELLAHLRQLQLQGGSGLNTGSALSY----VHANHF 730 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + ++++ +T G+ + L N + G + + +G R E Sbjct: 731 TEAGGSRIRERVPQLLLLLTAGQ-----SEDSYLQAANALARAGILTFCVGARQANKAEL 785 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + +P+ YL+++ S+ + + + T Sbjct: 786 EQIAFNPSLVYLMDDFSSLPALPQQLIQPLTT 817 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 32/268 (11%), Positives = 86/268 (32%), Gaps = 33/268 (12%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS-------- 166 ++ + + I++ F + ++V + SSV Sbjct: 173 AIGVEDADEGALKEIASEPLNMHVFNLENFTSLHDIVGNLVSCVHSSVTPGRAGDTGTLK 232 Query: 167 --QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D++ ++D S ++ + + +LE + + ++ G+V Sbjct: 233 DITAQDSADIIFLIDGSN------NTGSVNFAVIRDFLVNLLERLSI---GTQQIRVGVV 283 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 +S++ F L+ + Q + + G + + A + + + R + + Sbjct: 284 QYSDEPRTMFSLDTYSTKAQ-VLDAVKALGFTGGELANVGLALDFVVENHFTRAGGSRVE 342 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +++V ++ G + D K+ + +G + E + + N Sbjct: 343 EGVPQVLVLISAGPSSDEIRDGVVA-----LKQASVFSFGLGAQAASKAELQQIATNDNM 397 Query: 345 FYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + V F G D+ + + Y Sbjct: 398 VFTVPE-------FRSFG-DLQEQLLLY 417 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 55/345 (15%), Positives = 112/345 (32%), Gaps = 43/345 (12%) Query: 40 VSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNN-RKKLKGGDILCRIKN--TWNMSFRN 96 + + K + + + R QI G K + + RI+ + + Sbjct: 1284 LLNAHSSKDEVQNAVRRLRPKGGRQINIGGALEYVSKNIFKRPLGSRIEEGVPQFLVLIS 1343 Query: 97 ELRDNGFVNDIDDIVRSTSLDIVVVPQNEG----YSISAISRYKIPLKFCTFIPWYTNSR 152 + N V+D ++ + + +N IS Y + +P + Sbjct: 1344 SGKSNDEVDDSAAELKQFGVAPFTIARNADPEELVKISLSPEYVFSVSTFRELPSL--EQ 1401 Query: 153 HIVMPITSSVKVNSQ-------------TDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 ++ PIT+ Q D++ ++D S S+ I Sbjct: 1402 KLLTPITTLTSEQIQRILASTPYPPPAVESDAADIVFLIDSSDSVRPDG------IAHIR 1455 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVST 257 I+ +++ + + P+ V+ G+V FSN++ F L+ + + I+ L G S Sbjct: 1456 DFISKIVQRLNIGPNK---VRIGVVQFSNEVFPEFYLKTYKSRTAVLDAIRRLRFKGGSP 1512 Query: 258 -NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 N+ L++ +F ED + +++F+ K Y Sbjct: 1513 LNTGKALEHVARNLFVKS---AGSRIEDGVPQHLVLFL------GGKSQDDISRYSQVIS 1563 Query: 317 KRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHI 361 G + +G R I E P + V + + I Sbjct: 1564 SSGIVSLGVGDRNIDRTELQTITNDPRLVFTVREFRELPNIEEKI 1608 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 32/244 (13%), Positives = 86/244 (35%), Gaps = 26/244 (10%) Query: 98 LRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISA---ISRYKIPLKFCTFIPWYTNSRHI 154 L+ + ++ ++ + N+ + S + + +I R + Sbjct: 368 LKQASVFSFGLGAQAASKAELQQIATNDNMVFTVPEFRSFGDLQEQLLLYIVGVAQRRIV 427 Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + P T + + D++ ++D S ++ + I +++ +++ D Sbjct: 428 LQPPT--IVTQVIEVNKRDIVFLVDGSSALGQN------NFNAIRDFIARVIQRLEIGQD 479 Query: 215 VNNVVQSGLVTFSN--KIEEFFLLEWGVSHLQRKIKYLSKF-GVSTNSTPGLKYAYNQIF 271 ++Q + +++ + E +F + ++ + G + + L + N +F Sbjct: 480 ---LIQVAVAQYADTVRPEFYFNSYPSKREVVNAVRKMKSLEGPALYTGSALDFVRNNLF 536 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 + KI+V +T G++L E K+ G + +AIG + Sbjct: 537 TSAAGYRAAEG----VPKILVLITGGKSLDGIS-----QPAQELKRNGIMAFAIGNKAAD 587 Query: 332 SHEF 335 E Sbjct: 588 KAEL 591 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 74/213 (34%), Gaps = 34/213 (15%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVV 219 + + ++A LD+ +LD S S F + + K + ++ ++ + PD + Sbjct: 2607 RRAAGSNADLDVAFILDSSESTTPF------QFNEMRKYVGYLVRQLDVSPDPKASQHFA 2660 Query: 220 QSGLVT---FSN-------KIEEFFLLE-WGVSH--LQRKIKYLSKFGVSTNSTPGLKYA 266 + LV + + ++ F L +G + + + + + Y Sbjct: 2661 RVALVQHAPYESVGNSSVPPVKVEFSLTDYGSKEKLVNFLSSRMMQLQGTRALGSAIDYT 2720 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 IF+ + KI+V M GE + ++ +AK +G +G Sbjct: 2721 IENIFES--------APNPRDLKIVVLMLTGEVQKEQLEEAQRVIL-QAKCKGYFFVILG 2771 Query: 327 IRVI-RSHEFLRACASPNSFYL--VENPHSMYD 356 I E + PN + V+ + + Sbjct: 2772 IGRKVNVKEVYSFASEPNDVFFKLVDKSTELNE 2804 Score = 42.8 bits (99), Expect = 0.085, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 66/165 (40%), Gaps = 28/165 (16%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + S F + + + ++E + + PD V+ +V +S++ Sbjct: 1028 DVVFLIDGSEGVRSGFP-------LLKEFVQRVVESLDVGPD---RVRVAVVQYSDRTRP 1077 Query: 233 FFLLEWGVSHLQRK-----IKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 F L S++ ++ I+ L+ G T N+ L + I +R + + Sbjct: 1078 EFYLN---SYMDQQSVVGAIRRLTLLGGPTPNTGAALNFVLRNIL----IRSAGSRIEEG 1130 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ +T + + K+ GA+ IGI Sbjct: 1131 VPQLLIVLT-----AERSGDDVRGPSVVLKREGAVPIGIGIGNAD 1170 >gi|197098872|ref|NP_001126081.1| complement C2 [Pongo abelii] gi|55730287|emb|CAH91866.1| hypothetical protein [Pongo abelii] Length = 752 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 35/238 (14%), Positives = 81/238 (34%), Gaps = 29/238 (12%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + + TN K+ Q L++ ++LD S+S+ + +S Sbjct: 224 FSHMLGATNPTQKTKESLGR-KIQIQRSGHLNLYLLLDCSQSVSEN------DFLIFKES 276 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWGVSHLQRKIKYLS-------KF 253 + M++ + V ++TF+++ + L + I L + Sbjct: 277 ASLMVDRIFSFEIN---VSVAIITFASEPKVLMSVLNDNSRDITEVISSLENANYKDHEN 333 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYY- 311 G TN+ L Y + + + + I+ +TDG N+ + Sbjct: 334 GTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIR 393 Query: 312 ----CNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 N+ + +YAIG+ + +E +++++ +++ F H Sbjct: 394 ELLNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQVFEH 451 >gi|260814488|ref|XP_002601947.1| hypothetical protein BRAFLDRAFT_86431 [Branchiostoma floridae] gi|229287250|gb|EEN57959.1| hypothetical protein BRAFLDRAFT_86431 [Branchiostoma floridae] Length = 1386 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 72/197 (36%), Gaps = 28/197 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ VLD S S+ S++ + + + A+ V + G++ +S Sbjct: 268 DLFFVLDGSGSV------SVSDFETVKQFVVAV---VSAFTIGLADTRVGVLQYSTSSSL 318 Query: 233 FFLLEWGVSHLQ--RKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L I L + G +T + +++A RQ+ A K Sbjct: 319 ECNLGDHPDEASFVSAINTLVYQKGGNTYTGAAMEFA----------RQNAAWRPAPVPK 368 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS-PNSFYLV 348 I++ +TDG++ + ++ V+AIG+ E L + P + + Sbjct: 369 IMIVLTDGKSSDSVVAAAQALAADQV-----AVFAIGVGSFDHSELLEITNNKPGRVFEL 423 Query: 349 ENPHSMYDAFSHIGKDI 365 ++ + + + I + + Sbjct: 424 DDFDVLAQSINRIVRAV 440 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 36/226 (15%), Positives = 75/226 (33%), Gaps = 32/226 (14%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + P+ + D+ VLD S S+ S++ + + + A+ V Sbjct: 456 PQTTEDPMPTFPTPEPCHLTP-DLFFVLDGSGSV------SVSDFETVKQFVVAV---VS 505 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLS-KFGVSTNSTPGLKYAY 267 + G++ +S L I ++ + G ST + L++A Sbjct: 506 AFTIGLAETRVGVLQYSTSSTLACNLGDHPDEASFVSAINTMTYQKGGSTYTGAALEFA- 564 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 RQ+ A +I++ +TDG++ + ++ V+AIG+ Sbjct: 565 ---------RQNAAWRPAPVSRIMIVLTDGQSHDSVVAAAQALAADQ-----VTVFAIGV 610 Query: 328 RVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 E L + N V +I + + + + D Sbjct: 611 GSFDHSELLEI--TSNKLGHVFELDDFNAMAQNITQ--IVRAVCKD 652 >gi|89068023|ref|ZP_01155440.1| type II/IV secretion system protein, TadC subfamily protein [Oceanicola granulosus HTCC2516] gi|89046262|gb|EAR52319.1| type II/IV secretion system protein, TadC subfamily protein [Oceanicola granulosus HTCC2516] Length = 987 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 69/204 (33%), Gaps = 32/204 (15%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 L +++ LD S SM + A +++ L + +N F + I Sbjct: 96 PLALVLALDSSGSMLEALP----QTQAAARALVGSLADGDRAHILN---------FGDSI 142 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + L + I L +G T +F G A Sbjct: 143 NVAVGMTADRNRLDQAISGLRAWGA-TRLNDA-------VFASAGALAGAEGRGA----- 189 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEA----KKRGAIVYAIGIRVIRSHEFLRAC--ASPNS 344 IV +++G + S+ A + GA VYA+G+ LR A+ + Sbjct: 190 IVLLSEGPDADPSGAPLSVVDSEAALAAVVESGAPVYAVGLGPGADAALLRRLAEATGGA 249 Query: 345 FYLVENPHSMYDAFSHIGKDIVTK 368 ++ V + ++ FS I + + Sbjct: 250 YFPVADAAALPATFSDIATRLRHQ 273 >gi|317502942|ref|ZP_07961034.1| aerotolerance protein BatA [Prevotella salivae DSM 15606] gi|315665941|gb|EFV05516.1| aerotolerance protein BatA [Prevotella salivae DSM 15606] Length = 332 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 75/216 (34%), Gaps = 39/216 (18%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +DVS SM + D +++ A + + + + ++ + +G + Sbjct: 88 IDIMLAMDVSTSMLAE-DLKPNRLEAAKNVASEFISD-RPNDNIGLTIFAGEAFTQCPMT 145 Query: 232 EF-FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + ++ I T GL A +++ D + K+ Sbjct: 146 TDHASLINMLRSVRTDIAARGLISDGTAIGMGLANAVSRLKDSKAKS-----------KV 194 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS------------------ 332 ++ +TDG N D L AK G VY IG+ + Sbjct: 195 VILLTDGSNNM--GDISPLTSAQIAKSLGIRVYTIGVGTNKVAPYPMPVAGGVQYVNIPV 252 Query: 333 ---HEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 + L+ A + ++Y + + + + I K Sbjct: 253 EIDSKTLKNIAETTDGNYYRATSNNQLKQIYKDIDK 288 >gi|327278404|ref|XP_003223952.1| PREDICTED: LOW QUALITY PROTEIN: integrin alpha-X-like [Anolis carolinensis] Length = 1162 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 36/208 (17%), Positives = 75/208 (36%), Gaps = 29/208 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 T D++ ++D S S+ + ++ + L + L+ F Sbjct: 141 CTTRPTDIVFLIDGSGSINK---ADFERMKQFVSETTKRLSGRDT--------RFALIQF 189 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 S++ E F ++ + ++ + G T + ++ ++F Q ++ T Sbjct: 190 SDRYLEHFNF--NSEDPEQLVLHIHQVGGWTETATAIRRVVRELFTSQKGSRNGAT---- 243 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFLRACASP 342 +I++ +TDG + Q + EAK+ G I YAIG + L AS Sbjct: 244 --RILIVITDGVKTDRLQYSQVIP---EAKQAGIIRYAIGVGDAFSSPDAKRELDDIASE 298 Query: 343 ---NSFYLVENPHSMYDAFSHIGKDIVT 367 + V N +++ + I Sbjct: 299 PKAEHIFTVYNFNALRGIQDQLKDKIFA 326 >gi|293335787|ref|NP_001168683.1| hypothetical protein LOC100382472 [Zea mays] gi|223948855|gb|ACN28511.1| unknown [Zea mays] gi|223949305|gb|ACN28736.1| unknown [Zea mays] gi|223949981|gb|ACN29074.1| unknown [Zea mays] gi|223950189|gb|ACN29178.1| unknown [Zea mays] gi|224028553|gb|ACN33352.1| unknown [Zea mays] Length = 731 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 48/330 (14%), Positives = 111/330 (33%), Gaps = 50/330 (15%) Query: 48 TVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDI 107 + +++ + + G ++ L + + +E ++ + Sbjct: 172 NGYQNGGHMNILRPLPRARSSGRLHHLATLLPDTDPSTFNDDEPLDLSSEEANDNHQGCL 231 Query: 108 DDIVRSTSLDIVVVPQNEG-YSISAISRYKIPL-KFCTFIPWYTNSRHIVMPITSSVKVN 165 + T + VP+N + + + K P+ + + + Sbjct: 232 RTVEIKTYPEFTEVPENTSERNFTVLIHLKAPVAQHLQPSSNLGDGNGLST--------- 282 Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 A +D++ VLDVS SM TK+ + +++ +++ + + ++ Sbjct: 283 --ARAPVDLITVLDVSGSMAG------TKLALLKRAMGFVIQNLGSSD------RLSVIA 328 Query: 226 FSNKIEEFFLLEWGVSHLQRK----IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 FS+ F L +++ + L+ G TN L+ I RQ N Sbjct: 329 FSSSARRLFPLRRMTESGRQQSLLAVNSLTSNG-GTNIAEALRKGSKVI----EERQAKN 383 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSL-----YYC----NEAKKRGAIVYAIGIRVIRS 332 + I+ ++DG++ T + YC + + V+ G Sbjct: 384 PVCS-----IILLSDGQDTYTVSPTAGVHKGAPEYCALLPSTNGNQQVPVHVFGFGADHD 438 Query: 333 HEFLRACA--SPNSFYLVENPHSMYDAFSH 360 L + + S +F +E ++ DAF+ Sbjct: 439 SVSLHSISQTSGGTFSFIETEAAIQDAFAQ 468 >gi|109009638|ref|XP_001105446.1| PREDICTED: epithelial chloride channel protein-like [Macaca mulatta] Length = 829 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 41/195 (21%), Positives = 69/195 (35%), Gaps = 32/195 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK---I 230 + +VLD S SM + + +++ A E+ LI + G+VTF + Sbjct: 310 VCLVLDKSGSM--NREDRLFRMNQAA--------ELYLIQIIEKGSLVGMVTFDSSAEIQ 359 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + ++ + K T+ GLK + I Sbjct: 360 NNLTKIIDENTYQKITANLPQKPSGGTSICGGLKAGFQAISQSNQSTSGSE--------- 410 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 I+ +TDGE+ C E K+ GAI++ I + E L FY Sbjct: 411 IILLTDGEDNQMSS-------CFEEVKQSGAIIHTIALGPSADRELETLSNMTRGRRFYA 463 Query: 348 VENPHSMYDAFSHIG 362 ++ + + DAFS I Sbjct: 464 HKDINGLIDAFSRIS 478 >gi|84515372|ref|ZP_01002734.1| hypothetical protein SKA53_01901 [Loktanella vestfoldensis SKA53] gi|84510655|gb|EAQ07110.1| hypothetical protein SKA53_01901 [Loktanella vestfoldensis SKA53] Length = 485 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 71/463 (15%), Positives = 130/463 (28%), Gaps = 107/463 (23%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAAT---QIMN 67 F + G + I+T + L + ++ GM ++ + L S+ DR+++ AA+ + + Sbjct: 25 FGRDEDGSVIIMTILLLVTMLIMGGMAVDFMRYEARRATLQSVSDRAVLAAASLNQTLDS 84 Query: 68 EGNGNN-------RKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVV 120 + L G I+ N+ ++ R+ L N F + + R T+ Sbjct: 85 RDVVEDYFAKAGFPNALVGAPIVVDNGNSRTVTVRSALDVNTFYLRLAGMDRLTAPARSS 144 Query: 121 VPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMI---- 176 + G + + + + ++ + + I Sbjct: 145 ATEGVG---KVEISLVLDISGSMRFSNRFVNMQAAAIAFAEEVLDPANGGTVSLTIIPYA 201 Query: 177 -VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD---------VNNVVQSGLVTF 226 + M +F + + E ++ +GL Sbjct: 202 GATNPGPEMFAFMGGVRYPDTLLAGDDGILGTEDDYFFPQVSSCVEMVGSDWSSAGLPGA 261 Query: 227 SNKIEEFFL--------LEWG---------------VSHLQRKIKYLSKFGVSTNSTPGL 263 F ++WG + + I L T + + Sbjct: 262 GRAQVPHFQVWDIARSVMDWGWCPQDRSSIQYAMATPAQARSFINGLRMH-DGTGTHYAM 320 Query: 264 KYAY------------------NQIFDMQGMRQHCNTEDANYKKIIVFMTDGE------- 298 KYA + Q + +D KKIIV MTDG+ Sbjct: 321 KYALATLDPSSQPAFMHLSHPGRGLVPPQFANRPAAWDDPETKKIIVLMTDGDITQQERP 380 Query: 299 ---------------------NLSTKEDQQS-----LYYCNEAK--KRGAIVYAIGIRV- 329 N D + C A R VY + V Sbjct: 381 RIAQQERDIDYIISRSINGRDNRGQFVDAATNVGRFEAICTLANQPARSVDVYTVAFEVQ 440 Query: 330 IRSHEFL--RACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 S L R CAS S + + + D FS I + I R+ Sbjct: 441 PNSAADLQMRNCASDPSMFFRTSGAELIDVFSGIAERITDLRL 483 >gi|297296749|ref|XP_001083531.2| PREDICTED: integrin alpha-11 [Macaca mulatta] Length = 1149 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 41/222 (18%), Positives = 92/222 (41%), Gaps = 39/222 (17%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++IVLD S S+ + + + ++ +K +Q G+V + Sbjct: 160 QTYMDIVIVLDGSNSIYPWVEVQ-----------HFLINILKKFYIGPGQIQVGVVQYGE 208 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + F L S ++ ++ S + G T + G+++A ++ F G + Sbjct: 209 DVVHEFHLNDYRS-VKDVVEAASHIEQRGGTETRTAFGIEFARSEAFQKGGRKG------ 261 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL-- 336 KK+++ +TDGE+ + + ++ + ++++ YA+ + R I FL Sbjct: 262 --AKKVMIVITDGESHDSPDLEKVIQ---QSERDNVTRYAVAVLGYYNRRGINPETFLNE 316 Query: 337 -RACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + AS F+ V + ++ D +G I + ++K Sbjct: 317 IKYIASDPDDKHFFNVTDEAALKDIVDALGDRIFSLEGEHNK 358 >gi|284029570|ref|YP_003379501.1| von Willebrand factor type A [Kribbella flavida DSM 17836] gi|283808863|gb|ADB30702.1| von Willebrand factor type A [Kribbella flavida DSM 17836] Length = 654 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 42/225 (18%), Positives = 80/225 (35%), Gaps = 37/225 (16%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESF-FDSSITKIDMAIKSINAMLEEVKLIPD 214 + +S + + +M+VLD S SM + S T++D A +++ +M++ + Sbjct: 19 TFVNASTAAAAGELSP--VMVVLDSSGSMTARDAGGSGTRMDAAKRAVGSMVDGLPAG-- 74 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 Q GL + G + + ++ + G P LK A Sbjct: 75 ----AQVGLAIYGAGTGSS-----GAEKVAGCKDVRVVQPVGPVNK--PALKRAVTATKA 123 Query: 273 M------QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK---KRGA--I 321 Q +R ++ IV ++DGE+ C AK K+G Sbjct: 124 SGYTPIGQALRTAAAQLPKEGQRSIVLVSDGEDTCAPP-----QPCEVAKELSKQGVDLH 178 Query: 322 VYAIGIRVIRSHEFLRAC---ASPNSFYLVENPHSMYDAFSHIGK 363 V+ IG RV AC + +++ + S+ + + Sbjct: 179 VHTIGFRVDAKARAQLACIAQNTGGTYHDASDADSLLGVLGRVTE 223 >gi|313219850|emb|CBY30766.1| unnamed protein product [Oikopleura dioica] Length = 1473 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 45/211 (21%), Positives = 77/211 (36%), Gaps = 34/211 (16%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 RLD+ IV+D S S+ S + + + N + I N V+ GL +FS Sbjct: 1239 GRLDIQIVIDTSGSLTSAPNKD----QVLMNFTNNLANMYDTI----NQVKIGLTSFSES 1290 Query: 230 IEEFFLLE-WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L+ + LQ + ++ G TN T G++ A N + Sbjct: 1291 SVLEMPLDFYNQLELQDGVSNMTWQGSFTNITSGVETALNDMDTSDA-----------VD 1339 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF--- 345 +++ +TDG + ++AK G + A+G EF SPN + Sbjct: 1340 DVMILITDG--FQSTNTTLMFQMIDQAKAEGVRLIALGFF----GEFAFY--SPNLYLMT 1391 Query: 346 ---YLVENPHSMYDAFSHIGKDIVTKRIWYD 373 Y N + + I + I + + D Sbjct: 1392 NEVYHAANYAELLAIDNTIFETICSDGVIPD 1422 >gi|148655977|ref|YP_001276182.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148568087|gb|ABQ90232.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 420 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 74/209 (35%), Gaps = 30/209 (14%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 +Q A +++ V+D S SM KID ++ +E + V+ +V Sbjct: 38 TQVRAPVNVCFVIDRSGSM------KGEKIDRVRRATIRAIEMLDAQDVVS------VVI 85 Query: 226 FSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 F ++ E + L +I + G T P ++ +I Sbjct: 86 FDHRTEVLIPATPVTRPAELIDRINRVRDSG-GTRIAPAIEAGLREI----------EKG 134 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--S 341 + ++ +TDG+ + + L +A +R + A+G+ + + L A S Sbjct: 135 PPQMVRRLILLTDGQTENESD---CLRRATDAGQRNVPITALGVGKDWNEDLLIEMANRS 191 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + ++ P + + F + + Sbjct: 192 GGTADYIDRPEKIVEYFQSTIQRAQATAV 220 >gi|303235701|ref|ZP_07322308.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] gi|302484148|gb|EFL47136.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] Length = 322 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 42/254 (16%), Positives = 87/254 (34%), Gaps = 44/254 (17%) Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR---LDMMIVLDVSRSMESFFD 189 +K+ L + +V+ + + + +++M+ +D+S SM S D Sbjct: 46 KSWKMRLIHLPMLLRCLVYALVVIVLARPQTTTPIDNGQVEGINIMLAVDISASMLSD-D 104 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG-------VSH 242 +I++A + P+ N GL F+ + + ++ Sbjct: 105 VDPNRIEVAKDVAKEFIAS---RPNDN----IGLTIFAGEAFTQCPMTTDHASLLNLLAG 157 Query: 243 LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 ++ + T GL A ++ D++ K+++ +TDG N Sbjct: 158 IRADLSVNHLIQDGTAIGMGLANAVGRLKDVKEGS-----------KVVILLTDGSNN-- 204 Query: 303 KEDQQSLYYCNEAKKRGAIVYAIGI--------RVIRSH---EFLR--ACASPNSFYLVE 349 D L + A+K G VY IG+ + L+ A + FY + Sbjct: 205 VGDISPLTAASIARKFGVRVYTIGLGTDGKDIQGRPVGEIDYKTLQDIAMQTDGEFYRAQ 264 Query: 350 NPHSMYDAFSHIGK 363 + + + I K Sbjct: 265 SRAELSQIYKDIDK 278 >gi|284163331|ref|YP_003401610.1| von Willebrand factor A [Haloterrigena turkmenica DSM 5511] gi|284012986|gb|ADB58937.1| von Willebrand factor type A [Haloterrigena turkmenica DSM 5511] Length = 1446 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 38/232 (16%), Positives = 76/232 (32%), Gaps = 36/232 (15%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARL---DMMIVLDVSRSMESFFDSSITKI 195 W + + + + + D + V D S SM S T Sbjct: 498 FSVFRIEEWEDETSDTITLDGNETDGEIGNGSGIETADFVFVNDESGSMS----GSPTHY 553 Query: 196 DMA--IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF 253 + + A+ + ++G V +++ L + ++ LS Sbjct: 554 AELAGKRFVGALTDS----------ERAGRVGYASGANLDQPLTTDHDAVNSSLERLSAS 603 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 G TN+ GL+ N + N +++ ++DG++ S L Sbjct: 604 G-GTNTRAGLRVGLNHL---------EEEGWENRSAVMILLSDGKSGSDP-----LPVAE 648 Query: 314 EAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 +A + G + +G+ + LR A + FY VE + D F + + Sbjct: 649 DAAEAGVEISTVGLGNNINENELREIAAITGGDFYHVEREEDLPDTFERVAE 700 >gi|209546584|ref|YP_002278502.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209537828|gb|ACI57762.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 698 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 40/262 (15%), Positives = 88/262 (33%), Gaps = 29/262 (11%) Query: 85 RIKNTWNMSFRNELRDNGFVN-----DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPL 139 + +F ++ + + S+ + + Y + + P Sbjct: 237 SVATDPVSTFSADVDSASYSFVRRSLTGGAMPDPQSVRVEEMINYFPYDWAGPEKADQPF 296 Query: 140 KF---CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 K PW ++ + + I + T +++ ++DVS SM+ K+ Sbjct: 297 KATVTVMPTPWNHDTELMHVAIKG-YDIAPATAPHANLVFLIDVSGSMDE-----PDKLP 350 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS 256 + + ++ ++K V+ V +G + S + I L G S Sbjct: 351 LLKSAFRLLVSKLKADDTVSIVTYAGNAGTVLEPTRVAE----KSKILSAIDRLEA-GGS 405 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEA 315 T G++ AYN + ++ N ++ TDG+ N+ D+ E Sbjct: 406 TGGAEGIEAAYN-LAKQAFVKDGVNR--------VMLATDGDFNVGPSSDEDLKRIIEEK 456 Query: 316 KKRGAIVYAIGIRVIRSHEFLR 337 +K G + +G ++ L Sbjct: 457 RKDGIFLTVLGFGRGNLNDSLM 478 >gi|282863310|ref|ZP_06272369.1| von Willebrand factor type A [Streptomyces sp. ACTE] gi|282561645|gb|EFB67188.1| von Willebrand factor type A [Streptomyces sp. ACTE] Length = 624 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 30/213 (14%), Positives = 73/213 (34%), Gaps = 30/213 (14%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + + DA +++VLD S SM + T+++ A +++ A+++ + Sbjct: 10 AGALPVAAVPAVTDDAGGSLVMVLDSSGSMGEDDGTGSTRMESARRAVGAVVDALPDGYP 69 Query: 215 VNNVVQSGLVTF----------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 +GL + + + L+ + ++ + + G T L+ Sbjct: 70 ------TGLRVYGADRPQGCADTRLVRPVRPLD--RAAVKSAVAGVRPTG-DTPIGLSLR 120 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 A + + A + IV ++DGE+ + + G + Sbjct: 121 KAAEDL--------PAPRDGAARTRTIVLVSDGEDTCGTPPPCEVAARLAGQGAGLRIDT 172 Query: 325 IGIRVIRSHEFLRAC---ASPNSFYLVENPHSM 354 +G +V + C A +Y + ++ Sbjct: 173 VGFQVKGAAREQLECVAEAGNGRYYDAPDADAL 205 >gi|291295619|ref|YP_003507017.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290470578|gb|ADD27997.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 412 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 38/231 (16%), Positives = 84/231 (36%), Gaps = 31/231 (13%) Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 IPLK + +++ I + L++ +VLD S SM + Sbjct: 16 IPLKP-GVSATRPTRQQVLLRIHTPTPQARPERPLLNLALVLDRSGSMGGSKLKYTK--E 72 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE---EFFLLEWGVSHLQRKIKYLSKF 253 AI +++ +L E + +V + + +E + G + + I+ + Sbjct: 73 AAIYAVHNLLPE----------DRVAVVIYDDAVEVLVPSTPVADGRAAIANLIRTIRTG 122 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL--STKEDQQSLYY 311 G + L + ++G Q ++A +V ++DG T + Sbjct: 123 GST-----ALHAGW-----LEGATQVAAYQEAGRLNRVVLLSDGLANRGETNPGVIAEQV 172 Query: 312 CNEAKKRGAIVYAIGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSH 360 A+ RG +G+ + + + + A ++Y +E+P + F+ Sbjct: 173 RELAR-RGVSTSTLGVGLDYNEDLMTTMADAGEGNYYFIESPADLPRIFAQ 222 >gi|47226324|emb|CAG09292.1| unnamed protein product [Tetraodon nigroviridis] Length = 752 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 36/165 (21%), Positives = 65/165 (39%), Gaps = 21/165 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + + L + P + G+V FS+ Sbjct: 343 GALDIVFVIDSSESVG---LTNFTLEKNFVINTINRLGSMASDPAATTGTRVGVVQFSHN 399 Query: 230 -----IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + +S + +K L T + LK+AY+ + N++ Sbjct: 400 GTFEAVRLDDPNINSMSAFKAAVKRLQWIAGGTFTPSALKFAYDNLI--------RNSKR 451 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIR 328 A K +V +TDG ++D+ L Y C+ V AIG+ Sbjct: 452 ARAKISVVVITDGR-FDPRDDEDLLKYLCDH---DNVAVNAIGVG 492 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 34/215 (15%), Positives = 73/215 (33%), Gaps = 18/215 (8%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 +P S V + +D++ +LD S E S++ + ++ ++ L + D Sbjct: 547 DLPCKSEAHVAPCVERPVDLVFLLDGS---ERLGQSNLQYVREFVQKVSDRLVLARSRTD 603 Query: 215 VNNVVQSGLVTFSNKIEE--FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + L+ F + E F L + + I L F S++ P + + +++ Sbjct: 604 -RMRARMALIEFGKENETHVAFPLTHDQAAISDGIARLPYFDSSSSVGPAIIHTIDKVLG 662 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 R N + VF+TDG S ++ + + + I Sbjct: 663 KGPAR----KTRRNVEISFVFITDGFTDSRNLEEAV----SAMRGAQVVPTVIATGNDVD 714 Query: 333 HEFLRACASPN--SFYLVENPHSMYD--AFSHIGK 363 E L A + + + + + F + Sbjct: 715 EEVLMKLAMGDAEAIFKAKAVTDLSQPALFERFIQ 749 >gi|62286486|sp|Q863A0|CO2_GORGO RecName: Full=Complement C2; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement C2b fragment; Contains: RecName: Full=Complement C2a fragment; Flags: Precursor gi|29690202|gb|AAM10001.1| complement C2 [Gorilla gorilla] Length = 752 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 35/238 (14%), Positives = 84/238 (35%), Gaps = 29/238 (12%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + + TN K+ Q L++ ++LD S+S+ + +S Sbjct: 224 FSHMLGATNPTQKTKESLGR-KIQIQRSGHLNLYLLLDCSQSVSEN------DFLIFKES 276 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWGVSHLQRKIKYLS-------KF 253 + M++ + V ++TF++K + L + I L + Sbjct: 277 ASLMVDRIFSFEIN---VSVAIITFASKPKVLMSVLNDNSRDMTEVISSLENANYKDHEN 333 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL------STKEDQQ 307 G TN+ L Y + + + + I+ +TDG++ + + + Sbjct: 334 GTGTNTYAALNSVYLMMNNQMRILGMETMAWQEIRHAIILLTDGKSNMGGSPKTAVDRIR 393 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 + N+ + +YAIG+ + +E +++++ +++ F H Sbjct: 394 EILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQVFEH 451 >gi|242091866|ref|XP_002436423.1| hypothetical protein SORBIDRAFT_10g002210 [Sorghum bicolor] gi|241914646|gb|EER87790.1| hypothetical protein SORBIDRAFT_10g002210 [Sorghum bicolor] Length = 636 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 42/266 (15%), Positives = 90/266 (33%), Gaps = 52/266 (19%) Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITS-SVKVNSQTDARLDMMIVLDVSRSME- 185 ++ + P +++ + + + V + +D++ VLDVS SM Sbjct: 39 TVKVSTTPIFPQIPLGQA---RKDFQVLLRVEAPTAAVRPEARVPIDVVAVLDVSGSMND 95 Query: 186 -----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--IEEFFLL-- 236 + +++D+ + M+ +++ + +V F+++ E L Sbjct: 96 PAAVPPERRPTTSRLDLLKTAAKFMVAKLEDGD------RLSIVAFNDRPVKELSSGLLY 149 Query: 237 --EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 G + + L G T P + A + G ++ IV + Sbjct: 150 MSADGRRKAMKSVDQLEARG-GTALVPAFEEAVKVLDGRVGDGRNRLG-------FIVLL 201 Query: 295 TDGENLSTKEDQQSLYYCNEAKK--RGAI----VYAIGIRVIRSHEFLRACA--SPNSFY 346 TDGE D ++ RGA+ V+ +G+ E L A S ++ Sbjct: 202 TDGE------DTSGFTLSERRREVIRGALGRYPVHTLGLGRAHDPEVLLYLAQESHGTYS 255 Query: 347 LVENPHSMYDAFSHIGKDIVTKRIWY 372 V++ +IG+ + Sbjct: 256 FVDD--------DNIGEVAGALAVCL 273 >gi|224539999|ref|ZP_03680538.1| hypothetical protein BACCELL_04911 [Bacteroides cellulosilyticus DSM 14838] gi|224518389|gb|EEF87494.1| hypothetical protein BACCELL_04911 [Bacteroides cellulosilyticus DSM 14838] Length = 327 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 46/271 (16%), Positives = 91/271 (33%), Gaps = 48/271 (17%) Query: 119 VVVPQNEGYSISAISRYKIPLKF-CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 + + ++ + Y + + F I + P T++ NS+ + +D+M+ Sbjct: 35 ISDARVYAHTPKSYKNYLLHVPFMLRIIALALIIVVLARPQTTNSWQNSEIEG-IDIMMA 93 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 +DVS SM + D +++ A + P+ N G+ F+ + L Sbjct: 94 IDVSTSMLAE-DLKPNRLEAAKDVAAEFING---RPNDN----IGITLFAGESFTQCPLT 145 Query: 238 WGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 + L + + T G+ A ++ D + K+I+ +T Sbjct: 146 VDHAVLLNLFQGIKCGIIEDGTAVGMGIANAVTRLKDSKAKS-----------KVIILLT 194 Query: 296 DGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV----------------------IRSH 333 DG N K D L AK G VY IG+ I Sbjct: 195 DGTNN--KGDISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVNMPVEIDEK 252 Query: 334 EFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 + A + +++ + + + + I K Sbjct: 253 TLTQIAATTEGNYFRATSNSKLKEVYEEIDK 283 >gi|209527391|ref|ZP_03275898.1| von Willebrand factor type A [Arthrospira maxima CS-328] gi|209492182|gb|EDZ92530.1| von Willebrand factor type A [Arthrospira maxima CS-328] Length = 488 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 66/199 (33%), Gaps = 28/199 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++++D S SM S K+ + + + L +V FS++ Sbjct: 53 VVLLIDTSGSM------SGQKLREVQTAASEFVSRQNL-----KRHDLAVVEFSSRASVV 101 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + LQ+ I LS G TN + G A + + + I+ Sbjct: 102 ADFTRNETELQQAIARLSARG-GTNLSEGFNLATSVLQNSDRTPN------------ILL 148 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHS 353 TDG + + + + G + A+G + + P+ + N Sbjct: 149 FTDGVPNNPP---MAASIAQQIRASGINLVAVGTGDAQINYLTALTGDPDLVFYA-NFGD 204 Query: 354 MYDAFSHIGKDIVTKRIWY 372 + AF K I +++ Sbjct: 205 LDRAFRGAEKAIYGQQLVE 223 >gi|33321021|gb|AAQ06268.1| unknown [Sorghum bicolor] Length = 610 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 42/266 (15%), Positives = 90/266 (33%), Gaps = 52/266 (19%) Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITS-SVKVNSQTDARLDMMIVLDVSRSME- 185 ++ + P +++ + + + V + +D++ VLDVS SM Sbjct: 31 TVKVSTTPIFPQIPLGQA---RKDFQVLLRVEAPTAAVRPEARVPIDVVAVLDVSGSMND 87 Query: 186 -----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--IEEFFLL-- 236 + +++D+ + M+ +++ + +V F+++ E L Sbjct: 88 PAAVPPERRPTTSRLDLLKTAAKFMVAKLEDGD------RLSIVAFNDRPVKELSSGLLY 141 Query: 237 --EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 G + + L G T P + A + G ++ IV + Sbjct: 142 MSADGRRKAMKSVDQLEARG-GTALVPAFEEAVKVLDGRVGDGRNRLG-------FIVLL 193 Query: 295 TDGENLSTKEDQQSLYYCNEAKK--RGAI----VYAIGIRVIRSHEFLRACA--SPNSFY 346 TDGE D ++ RGA+ V+ +G+ E L A S ++ Sbjct: 194 TDGE------DTSGFTLSERRREVIRGALGRYPVHTLGLGRAHDPEVLLYLAQESHGTYS 247 Query: 347 LVENPHSMYDAFSHIGKDIVTKRIWY 372 V++ +IG+ + Sbjct: 248 FVDD--------DNIGEVAGALAVCL 265 >gi|160875970|ref|YP_001555286.1| von Willebrand factor type A [Shewanella baltica OS195] gi|160861492|gb|ABX50026.1| von Willebrand factor type A [Shewanella baltica OS195] gi|315268165|gb|ADT95018.1| von Willebrand factor type A [Shewanella baltica OS678] Length = 339 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 28/217 (12%), Positives = 67/217 (30%), Gaps = 46/217 (21%) Query: 173 DMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+ +D+S SM+ ++ + + ++ +E K + GL+ F++ Sbjct: 85 DLMMAVDLSGSMQIEDMVINGKTVDRFTLIQHVVSDFIERRKGD-------RIGLILFAD 137 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + + +K G T + A + M Sbjct: 138 HAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIGLAVKRFDKMDESN--------- 188 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI---------------- 330 ++++ +TDG N + + + A R +Y +G+ Sbjct: 189 --RVLILLTDGSNNA--GNIEPEQAAQIAANRKVTIYTVGVGADVMERRTLFGRERVNPS 244 Query: 331 ---RSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 ++ + ++ N + + I K Sbjct: 245 MDLDENQLKHIAEVTHGRYFRARNSQELDQIYQEIDK 281 >gi|332844134|ref|XP_510503.3| PREDICTED: integrin alpha-11 [Pan troglodytes] Length = 1188 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 88/214 (41%), Gaps = 39/214 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++IVLD S S+ + + + ++ +K +Q G+V + Sbjct: 160 QTYMDIVIVLDGSNSIYPWVEVQ-----------HFLINILKKFYIGPGQIQVGVVQYGE 208 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + F L S ++ ++ S + G T + G+++A ++ F G + Sbjct: 209 DVVHEFHLNDYRS-VKDVVEAASHIEQRGGTETRTAFGIEFARSEAFQKGGRKG------ 261 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL-- 336 KK+++ +TDGE+ + + ++ + ++++ YA+ + R I FL Sbjct: 262 --AKKVMIVITDGESHDSPDLEKVIQ---QSERDNVTRYAVAVLGYYNRRGINPETFLNE 316 Query: 337 -RACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + AS F+ V + ++ D +G I Sbjct: 317 IKYIASDPDDKHFFNVTDEAALKDIVDALGDRIF 350 >gi|332236004|ref|XP_003267196.1| PREDICTED: integrin alpha-11 [Nomascus leucogenys] Length = 1188 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 88/214 (41%), Gaps = 39/214 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++IVLD S S+ + + + ++ +K +Q G+V + Sbjct: 160 QTYMDIVIVLDGSNSIYPWVEVQ-----------HFLINILKKFYIGPGQIQVGVVQYGE 208 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + F L S ++ ++ S + G T + G+++A ++ F G + Sbjct: 209 DVVHEFHLNDYRS-VKDVVEAASHIEQRGGTETRTAFGIEFARSEAFQKGGRKG------ 261 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL-- 336 KK+++ +TDGE+ + + ++ + ++++ YA+ + R I FL Sbjct: 262 --AKKVMIVITDGESHDSPDLEKVIQ---QSERDNVTRYAVAVLGYYNRRGINPETFLNE 316 Query: 337 -RACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + AS F+ V + ++ D +G I Sbjct: 317 IKYIASDPDDKHFFNVTDEAALKDIVDALGDRIF 350 >gi|325927536|ref|ZP_08188772.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas perforans 91-118] gi|325542075|gb|EGD13581.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas perforans 91-118] Length = 501 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 43/317 (13%), Positives = 107/317 (33%), Gaps = 43/317 (13%) Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN-----DIDDIVRSTSLDIVVVPQNEGY 127 R + + + + +F ++ + N + + + ++ + + Y Sbjct: 27 ERYQHIDDNAIVQAAQHPISTFSIDVDTGSYSNVRRFLNAGSLPPADAVRVEELINYFRY 86 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSV--KVNSQTDARLDMMIVLDVSRSME 185 P T + ++ ++ + + +++ ++DVS SM+ Sbjct: 87 D-DPAPTNGQPFAVRTELATTPWNKDSLLLRVGIAGRDIATADLPPANLVFLVDVSGSMD 145 Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHL 243 + K+ + S+ ++ +++ + LVT++ I G + Sbjct: 146 A-----PDKLPLLQSSLKLLVRQLRAQD------RITLVTYAGNISVVLPPTPGDQQGRI 194 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLST 302 I L + G ST G++ AY + +R N I+ TDG+ N+ Sbjct: 195 VEAIDAL-QSGGSTAGASGIELAY-KAAQQGYLRGGINR--------ILLATDGDFNVGV 244 Query: 303 KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR---ACASPNSFYLVENP-------- 351 Q E ++ G + +G ++ L A A ++ ++ Sbjct: 245 TNFDQLKGMVAEKRRSGIALSTLGFGTGNYNDNLMEQLADAGDGAYAYIDTALEARKVLT 304 Query: 352 HSMYDAFSHIGKDIVTK 368 H + + I +D+ + Sbjct: 305 HELGATLATIARDVKIQ 321 >gi|296213559|ref|XP_002753319.1| PREDICTED: integrin alpha-11 [Callithrix jacchus] Length = 1188 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 88/214 (41%), Gaps = 39/214 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++IVLD S S+ + + + ++ +K +Q G+V + Sbjct: 160 QTYMDIVIVLDGSNSIYPWVEVQ-----------HFLINILKKFYIGPGQIQVGVVQYGE 208 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + F L S ++ ++ S + G T + G+++A ++ F G + Sbjct: 209 DVVHEFHLNDYRS-VKDVVEAASHIEQRGGTETRTAFGIEFARSEAFQKGGRKG------ 261 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL-- 336 KK+++ +TDGE+ + + ++ + ++++ YA+ + R I FL Sbjct: 262 --AKKVMIVITDGESHDSPDLEKVIQ---QSERDNVTRYAVAVLGYYNRRGINPETFLNE 316 Query: 337 -RACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + AS F+ V + ++ D +G I Sbjct: 317 IKYIASDPDDKHFFNVTDEAALKDIVDALGDRIF 350 >gi|226315301|ref|YP_002775197.1| hypothetical protein BBR47_57160 [Brevibacillus brevis NBRC 100599] gi|226098251|dbj|BAH46693.1| hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 597 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 43/211 (20%), Positives = 82/211 (38%), Gaps = 37/211 (17%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + +D ++V+DVS SM + K ++ +++ ++ + + + G+V ++ Sbjct: 35 SGNNMDAVLVVDVSNSM-----TQSDKNKVSNEAMKMFVDMTSIQAN-----KVGVVAYT 84 Query: 228 NKIEEFFLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +KIE L E + ++ I L K G T+ G+ A + N Sbjct: 85 DKIEREKALLEINSEEDKNDIKAFIDSLQK-GAYTDIAVGVTEAVKIL------DAGRNP 137 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNE--------AKKRGAIVYAIGIRVIR--S 332 +A IIV + DG N K ++ ++ AK +G VY IG+ + Sbjct: 138 NNAP---IIVLLADGNNFLNKASSRTQAKSDQELQQAVKEAKDKGYPVYTIGLNADGQLN 194 Query: 333 HEFLR--ACASPNSFYLVENPHSMYDAFSHI 361 L+ A + F+ + S I Sbjct: 195 RTTLQQIAAETNGKFFETSTADKLPQILSEI 225 >gi|158258322|dbj|BAF85134.1| unnamed protein product [Homo sapiens] Length = 1188 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 88/214 (41%), Gaps = 39/214 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++IVLD S S+ + + + ++ +K +Q G+V + Sbjct: 160 QTYMDIVIVLDGSNSIYPWVEVQ-----------HFLINILKKFYIGPGQIQVGVVQYGE 208 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + F L S ++ ++ S + G T + G+++A ++ F G + Sbjct: 209 DVVHEFHLNDYRS-VKDVVEAASHIEQRGGTETRTAFGIEFARSEAFQKGGRKG------ 261 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL-- 336 KK+++ +TDGE+ + + ++ + ++++ YA+ + R I FL Sbjct: 262 --AKKVMIVITDGESHDSPDLEKVIQ---QSERDNVTRYAVAVLGYYNRRGINPETFLNE 316 Query: 337 -RACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + AS F+ V + ++ D +G I Sbjct: 317 IKYIASDPDDKHFFNVTDEAALKDIVDALGDRIF 350 >gi|149918184|ref|ZP_01906676.1| hypothetical protein PPSIR1_11265 [Plesiocystis pacifica SIR-1] gi|149820944|gb|EDM80351.1| hypothetical protein PPSIR1_11265 [Plesiocystis pacifica SIR-1] Length = 522 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 40/237 (16%), Positives = 80/237 (33%), Gaps = 27/237 (11%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 PW +R + + + + +++ +LDVS SM K+ + S Sbjct: 125 VGPCPWSDEARLVHIGLQGKSIPERE-LPPRNLVFLLDVSGSMH-----DQDKLPLLTDS 178 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNS 259 + ++ ++ V VV +G G + I L G STN Sbjct: 179 LRVLVNQLGERDHVAIVVYAGASG------VVLPPTRGSDRGTILAAISELRA-GGSTNG 231 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKR 318 G++ AY R+H + + N ++ TDG+ N+ T + ++ Sbjct: 232 GEGIQKAY------ALAREHFDPQGINR---VILATDGDFNVGTTTESGLENLIERERES 282 Query: 319 GAIVYAIGIRVIR-SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 G + +G + A + S+ +A +G + + + K Sbjct: 283 GVFLTVLGFGRGNLGDRTMEMLADKGNGNYAY-IDSLAEARKVLGTEAGSTLVTIAK 338 >gi|149692293|ref|XP_001495918.1| PREDICTED: integrin, alpha 11 [Equus caballus] Length = 1188 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 88/214 (41%), Gaps = 39/214 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++IVLD S S+ + + + ++ +K +Q G+V + Sbjct: 160 QTYMDIVIVLDGSNSIYPWVEVQ-----------HFLINILKKFYIGPGQIQVGVVQYGE 208 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + F L S ++ ++ S + G T + G+++A ++ F G + Sbjct: 209 DVVHEFHLNDYRS-VKDVVEAASHIEQRGGTETRTAFGIEFARSEAFQKGGRKG------ 261 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL-- 336 KK+++ +TDGE+ + + ++ + ++++ YA+ + R I FL Sbjct: 262 --AKKVMIVITDGESHDSPDLEKVIQ---QSERDNVTRYAVAVLGYYNRRGINPETFLNE 316 Query: 337 -RACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + AS F+ V + ++ D +G I Sbjct: 317 IKYIASDPDDKHFFNVTDEAALKDIVDALGDRIF 350 >gi|119598226|gb|EAW77820.1| integrin, alpha 11, isoform CRA_a [Homo sapiens] Length = 1189 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 88/214 (41%), Gaps = 39/214 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++IVLD S S+ + + + ++ +K +Q G+V + Sbjct: 160 QTYMDIVIVLDGSNSIYPWVEVQ-----------HFLINILKKFYIGPGQIQVGVVQYGE 208 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + F L S ++ ++ S + G T + G+++A ++ F G + Sbjct: 209 DVVHEFHLNDYRS-VKDVVEAASHIEQRGGTETRTAFGIEFARSEAFQKGGRKG------ 261 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL-- 336 KK+++ +TDGE+ + + ++ + ++++ YA+ + R I FL Sbjct: 262 --AKKVMIVITDGESHDSPDLEKVIQ---QSERDNVTRYAVAVLGYYNRRGINPETFLNE 316 Query: 337 -RACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + AS F+ V + ++ D +G I Sbjct: 317 IKYIASDPDDKHFFNVTDEAALKDIVDALGDRIF 350 >gi|5915662|gb|AAD51919.2|AF137378_1 integrin alpha 11 subunit precursor [Homo sapiens] gi|119598227|gb|EAW77821.1| integrin, alpha 11, isoform CRA_b [Homo sapiens] Length = 1188 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 88/214 (41%), Gaps = 39/214 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++IVLD S S+ + + + ++ +K +Q G+V + Sbjct: 160 QTYMDIVIVLDGSNSIYPWVEVQ-----------HFLINILKKFYIGPGQIQVGVVQYGE 208 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + F L S ++ ++ S + G T + G+++A ++ F G + Sbjct: 209 DVVHEFHLNDYRS-VKDVVEAASHIEQRGGTETRTAFGIEFARSEAFQKGGRKG------ 261 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL-- 336 KK+++ +TDGE+ + + ++ + ++++ YA+ + R I FL Sbjct: 262 --AKKVMIVITDGESHDSPDLEKVIQ---QSERDNVTRYAVAVLGYYNRRGINPETFLNE 316 Query: 337 -RACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + AS F+ V + ++ D +G I Sbjct: 317 IKYIASDPDDKHFFNVTDEAALKDIVDALGDRIF 350 >gi|6013141|gb|AAF01258.1|AF109681_1 integrin alpha-11 subunit precursor [Homo sapiens] Length = 1189 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 88/214 (41%), Gaps = 39/214 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++IVLD S S+ + + + ++ +K +Q G+V + Sbjct: 160 QTYMDIVIVLDGSNSIYPWVEVQ-----------HFLINILKKFYIGPGQIQVGVVQYGE 208 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + F L S ++ ++ S + G T + G+++A ++ F G + Sbjct: 209 DVVHEFHLNDYRS-VKDVVEAASHIEQRGGTETRTAFGIEFARSEAFQKGGRKG------ 261 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL-- 336 KK+++ +TDGE+ + + ++ + ++++ YA+ + R I FL Sbjct: 262 --AKKVMIVITDGESHDSPDLEKVIQ---QSERDNVTRYAVAVLGYYNRRGINPETFLNE 316 Query: 337 -RACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + AS F+ V + ++ D +G I Sbjct: 317 IKYIASDPDDKHFFNVTDEAALKDIVDALGDRIF 350 >gi|52485853|ref|NP_001004439.1| integrin alpha-11 precursor [Homo sapiens] gi|313104119|sp|Q9UKX5|ITA11_HUMAN RecName: Full=Integrin alpha-11; Flags: Precursor gi|189442879|gb|AAI67840.1| Integrin, alpha 11 [synthetic construct] Length = 1188 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 88/214 (41%), Gaps = 39/214 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++IVLD S S+ + + + ++ +K +Q G+V + Sbjct: 160 QTYMDIVIVLDGSNSIYPWVEVQ-----------HFLINILKKFYIGPGQIQVGVVQYGE 208 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + F L S ++ ++ S + G T + G+++A ++ F G + Sbjct: 209 DVVHEFHLNDYRS-VKDVVEAASHIEQRGGTETRTAFGIEFARSEAFQKGGRKG------ 261 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL-- 336 KK+++ +TDGE+ + + ++ + ++++ YA+ + R I FL Sbjct: 262 --AKKVMIVITDGESHDSPDLEKVIQ---QSERDNVTRYAVAVLGYYNRRGINPETFLNE 316 Query: 337 -RACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + AS F+ V + ++ D +G I Sbjct: 317 IKYIASDPDDKHFFNVTDEAALKDIVDALGDRIF 350 >gi|330878848|gb|EGH12997.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. morsprunorum str. M302280PT] Length = 352 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 65/173 (37%), Gaps = 26/173 (15%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + ++++ + + + LE K + GL+ F Sbjct: 91 DLLVAIDVSGSMDYPDMQWKSDEVSRLVLVQQLLGDFLEGRKGD-------RVGLILFGT 143 Query: 229 KIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + + G +T + A ++ Sbjct: 144 QAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTAVGDAIGLALKRL-----------RLRPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 +++V +TDG N + + D + A G +Y IGI + L++ Sbjct: 193 NSRVLVLVTDGANNAGQIDPITA--ARLAADEGVKIYPIGIGSDPDKDALQSA 243 >gi|261880541|ref|ZP_06006968.1| BatA protein [Prevotella bergensis DSM 17361] gi|270332764|gb|EFA43550.1| BatA protein [Prevotella bergensis DSM 17361] Length = 332 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 41/222 (18%), Positives = 71/222 (31%), Gaps = 51/222 (22%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +DVS SM + D +++ A + V P N GL F+ + Sbjct: 88 IDIMLAMDVSTSMLAE-DLKPNRLEAAKDVASEF---VSGRPSDN----IGLTIFAGESF 139 Query: 232 EFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 L +++++ I T GL A +++ Sbjct: 140 TQCPLTIDHGSLLNLLNNVRTDIAARGLIQDGTAVGMGLANAVSRL-----------KGS 188 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV--------------- 329 K+++ +TDG N D L A+ G VY IG+ Sbjct: 189 KAKSKVVILLTDGSNNM--GDISPLTAAQIARSLGIRVYTIGVGTNKVAPYPMPVAGGVQ 246 Query: 330 -------IRSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 I + A + +FY N + + I K Sbjct: 247 YVNMPVEIDTKTLSDIAAITEGNFYRATNNRELKQIYRDIDK 288 >gi|238011090|gb|ACR36580.1| unknown [Zea mays] Length = 516 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 45/225 (20%), Positives = 79/225 (35%), Gaps = 37/225 (16%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 P N++ +++ +T ++ + LD++ VLDVS SM+ I K+ A+K + Sbjct: 35 NMAPLEENTQKVLLELTG--GDSTSDRSGLDLVAVLDVSGSMQ---GEKIEKMKTAMKFV 89 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ----RKIKYLSKFGVSTN 258 L + + +VTF + L Q + I L G +TN Sbjct: 90 VKKLSSID---------RLSIVTFLDTANRICPLRQVTEDSQPQLLKLIDALQP-GGNTN 139 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + GL+ + D + ++ M+DG+ K Sbjct: 140 ISDGLQTGLKVLADRKLSSGRVVG--------VMLMSDGQQNRG-------EPAANVKIG 184 Query: 319 GAIVYAIGIRVIRSHEFLRACAS---PNSFYLVENPHSMYDAFSH 360 VY G L A A +F +V + + + AFS Sbjct: 185 NVPVYTFGFGADYDPTVLNAVARNSMGGTFSVVNDVNLLSMAFSQ 229 >gi|160889563|ref|ZP_02070566.1| hypothetical protein BACUNI_01987 [Bacteroides uniformis ATCC 8492] gi|317480055|ref|ZP_07939167.1| von Willebrand factor type A domain-containing protein [Bacteroides sp. 4_1_36] gi|156861080|gb|EDO54511.1| hypothetical protein BACUNI_01987 [Bacteroides uniformis ATCC 8492] gi|316903797|gb|EFV25639.1| von Willebrand factor type A domain-containing protein [Bacteroides sp. 4_1_36] Length = 327 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 42/270 (15%), Positives = 87/270 (32%), Gaps = 46/270 (17%) Query: 119 VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVL 178 + + ++ + Y + + F I ++ ++ + +D+M+ + Sbjct: 35 ISDARVYAHTPKSYKNYLLHVPFVLRIIALILIILVLARPQTTDSWQNSEIEGIDIMLAM 94 Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW 238 DVS SM + D +++ A + P+ N G+ F+ + L Sbjct: 95 DVSTSMLAE-DLKPNRLEAAKDVAAEFING---RPNDN----IGITLFAGESFTQCPLTV 146 Query: 239 GVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 + L +K + T G+ A ++ D + K+I+ +TD Sbjct: 147 DHAVLLNLLKDMKCGLIEDGTAIGMGIANAVTRLKDSKAKS-----------KVIILLTD 195 Query: 297 GENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV----------------------IRSHE 334 G N K D L AK G VY IG+ I Sbjct: 196 GVNN--KGDISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVEIDEKT 253 Query: 335 FLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 + + +++ + + + + I K Sbjct: 254 LTQIAGTTEGNYFRATSNSKLKEVYEEIDK 283 >gi|325860278|ref|ZP_08173400.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] gi|325482157|gb|EGC85168.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] Length = 318 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 39/213 (18%), Positives = 76/213 (35%), Gaps = 37/213 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + +D+M+ +DVS SM + D ++ +A + + + P+ N GL F Sbjct: 83 KDTEGIDIMLTMDVSASMLTE-DVYPNRMVVAKEVASEFI---SGRPNDN----IGLTIF 134 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + + + + ++ + T GL A +++ D + Sbjct: 135 AGEAFTQCPMTLDHAALLNLLHGVRTDLVTSGLMQDGTAIGMGLANAVSRLKDSKAKS-- 192 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF---- 335 KI++ +TDG N + + A+K G VY IG E Sbjct: 193 ---------KIVILLTDGSNNAGSISPMTA--AAIARKFGIRVYTIGFGKETGEEIGAID 241 Query: 336 ---LR--ACASPNSFYLVENPHSMYDAFSHIGK 363 L+ A ++ FY ++ + + I K Sbjct: 242 YKTLQDIAVSTNGEFYRAQSQAELSRIYQDIDK 274 >gi|73960091|ref|XP_537088.2| PREDICTED: similar to chloride channel calcium activated 4 [Canis familiaris] Length = 905 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 43/246 (17%), Positives = 83/246 (33%), Gaps = 35/246 (14%) Query: 119 VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVL 178 P + ++ S + I + F + P + + S+ + +VL Sbjct: 258 TEAPNLQNKMCNSRSTWDIIMNSEDFQNASPMTGTDPPPHPTFSLLKSKQRV---VCLVL 314 Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK---IEEFFL 235 D S SM S ++ ++ L I + +G+VTF + Sbjct: 315 DKSGSMSSE-----DRLFQMNQAAELFL-----IQIIEKGSLTGMVTFESSATIQNYLTE 364 Query: 236 LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 + ++ + G T+ GL+ + I IV +T Sbjct: 365 ITDHNAYEKILANLPQAAGGGTSICSGLRAGFQAIIHSNQNTSGSE---------IVLLT 415 Query: 296 DGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYLVENPH 352 DGE+ + C E KK G++++ I + + E L + FY ++ + Sbjct: 416 DGEDDNIS-------LCFEEVKKSGSVIHTIALGPSAAKELEILSNMTGGHRFYANKDIN 468 Query: 353 SMYDAF 358 + DAF Sbjct: 469 GLIDAF 474 >gi|291523143|emb|CBK81436.1| fibro-slime domain [Coprococcus catus GD/7] Length = 1745 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 41/286 (14%), Positives = 86/286 (30%), Gaps = 22/286 (7%) Query: 55 DRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRST 114 D S + ++ +G +L G T VN + Sbjct: 763 DDSDSSKVYNVTSDSDGRVSVRLHEGTYTMTETQTPTGYLAASGTWKITVNADGTYTITK 822 Query: 115 SLDIVVVPQNEGYSISAISRYK------IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT 168 + + + Y I +++ K + I + ++S + Sbjct: 823 NGRPIDKGSDSVYKIVNKGQHEDAEANLTTSKTVKVTDYNKREYEITLGASTSGREAGTE 882 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +++VLD S SM + + + + ++ +K + V +V F+ Sbjct: 883 AKAASVVLVLDRSGSMG------ADGMTALVNAADTFIDTLKTASPDSQV---AVVYFNG 933 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDANY 287 +E + + K S G Y + D + + + + Sbjct: 934 TQDEDDNTTTSKNFTKLNTDEKVKSIKDFLSNNGYSYGGTPMGDALEKAKGLLDADQTGN 993 Query: 288 KKIIVFMTDGENLSTKED------QQSLYYCNEAKKRGAIVYAIGI 327 +K ++F TDG + +D S C K A +Y +G Sbjct: 994 QKYVLFFTDGLPGHSSDDAFNCMVANSAVNCATDIKANATIYTVGY 1039 >gi|309362033|emb|CAP28676.2| hypothetical protein CBG_09117 [Caenorhabditis briggsae AF16] gi|309362045|emb|CAP28694.2| hypothetical protein CBG_09097 [Caenorhabditis briggsae AF16] Length = 787 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 35/202 (17%), Positives = 73/202 (36%), Gaps = 25/202 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D LD+ +V+D S S++ F S K ++ +L + + P V+ L+T+ Sbjct: 243 GSDKPLDLALVVDASESLDHLF-SDQKKF-----LVDRVLGNINIHP---EAVRVALITY 293 Query: 227 SNKI----EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 S + + L + +Q +K + +T + L A++ + + Sbjct: 294 SGQAFVHFKFNSFLYGNNTSVQGFVKNIRSIKGTTATNVALMDAFDLLTS----KDPSTG 349 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI---RSHEFLRAC 339 K+ + +TDG + + +D + + G I+ A+ + E Sbjct: 350 IREGVPKMALVLTDGHSHRSPKD-----MAEKMRAAGIIMIAVSVTPRPLVDEAELRLIA 404 Query: 340 ASPNSFYLVENPHSMYDAFSHI 361 S + N H F Sbjct: 405 GSEKRAFTPPNLHEFESEFMKY 426 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 25/139 (17%), Positives = 44/139 (31%), Gaps = 17/139 (12%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 LD++ +LD S S+ D I+ + +K + + L+ FS Sbjct: 49 APPLDLIFILDSSGSLR----------DKFQDEIDIIRRILKHVTIGKTATRVMLIQFSG 98 Query: 229 KI--EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 E F L + L T ++A Q+ + D Sbjct: 99 TQHLEFNFEKFTDREELLAALDVLRHVSGITRIGGAFEFALQQLKTPGSGLR-----DGT 153 Query: 287 YKKIIVFMTDGENLSTKED 305 KI+ ++DG +D Sbjct: 154 VPKIVYLLSDGRTHDYPKD 172 >gi|268572089|ref|XP_002641231.1| Hypothetical protein CBG09097 [Caenorhabditis briggsae] gi|268572157|ref|XP_002641249.1| Hypothetical protein CBG09117 [Caenorhabditis briggsae] Length = 772 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 35/202 (17%), Positives = 73/202 (36%), Gaps = 25/202 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D LD+ +V+D S S++ F S K ++ +L + + P V+ L+T+ Sbjct: 243 GSDKPLDLALVVDASESLDHLF-SDQKKF-----LVDRVLGNINIHP---EAVRVALITY 293 Query: 227 SNKI----EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 S + + L + +Q +K + +T + L A++ + + Sbjct: 294 SGQAFVHFKFNSFLYGNNTSVQGFVKNIRSIKGTTATNVALMDAFDLLTS----KDPSTG 349 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI---RSHEFLRAC 339 K+ + +TDG + + +D + + G I+ A+ + E Sbjct: 350 IREGVPKMALVLTDGHSHRSPKD-----MAEKMRAAGIIMIAVSVTPRPLVDEAELRLIA 404 Query: 340 ASPNSFYLVENPHSMYDAFSHI 361 S + N H F Sbjct: 405 GSEKRAFTPPNLHEFESEFMKY 426 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 25/139 (17%), Positives = 44/139 (31%), Gaps = 17/139 (12%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 LD++ +LD S S+ D I+ + +K + + L+ FS Sbjct: 49 APPLDLIFILDSSGSLR----------DKFQDEIDIIRRILKHVTIGKTATRVMLIQFSG 98 Query: 229 KI--EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 E F L + L T ++A Q+ + D Sbjct: 99 TQHLEFNFEKFTDREELLAALDVLRHVSGITRIGGAFEFALQQLKTPGSGLR-----DGT 153 Query: 287 YKKIIVFMTDGENLSTKED 305 KI+ ++DG +D Sbjct: 154 VPKIVYLLSDGRTHDYPKD 172 >gi|123718336|emb|CAJ77151.1| collagen type VI alpha 4 [Mus musculus] Length = 229 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 27/155 (17%), Positives = 63/155 (40%), Gaps = 24/155 (15%) Query: 154 IVMPITSSVKVNSQTD----ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 ++ I S + ++Q + D+ ++D S S++ ++ +K + + Sbjct: 93 LMSFIPESTRQSTQEGCESVEKADIYFLIDGSGSIKPN---DFIEMKDFMKEV------I 143 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 K+ + V+ G+V +S+KI F L ++ L I + + G T + L Sbjct: 144 KMFHIGPDRVRFGVVQYSDKIISQFFLTQYASMAGLSAAIDNIQQVGGGTTTGKAL---- 199 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 + + NT + + ++ +TDG++ Sbjct: 200 -----SKMVPVFQNTARIDVARYLIVITDGQSTDP 229 >gi|291297006|ref|YP_003508404.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290471965|gb|ADD29384.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 313 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 74/213 (34%), Gaps = 36/213 (16%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D +++ +D+S SM + D T+ + A + + + + ++ GLV+F+ Sbjct: 82 DPTATVIVTIDISLSMRAQ-DIQPTRFEAAKQEAKNFVRSLP------DGIKVGLVSFAG 134 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + +I+ L + T GL + + Sbjct: 135 YATLEAEPTTDHQRVIDQIE-LLQMARRTAIGDGLL---ESLRAIPKDENGKPLG----P 186 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS---------------- 332 +V ++DG T + A+ G +V+ IG+ + Sbjct: 187 STVVLLSDGR---TNSGVDPMEVAPFARDMGVVVHTIGLGRRSNPGDPDQYWGGYWMQFD 243 Query: 333 HEFLRAC--ASPNSFYLVENPHSMYDAFSHIGK 363 E LRA A+ +Y + ++ A+ ++G+ Sbjct: 244 EETLRAIAEATGGQYYAAGSAEALRQAYRNLGR 276 >gi|327542784|gb|EGF29248.1| von Willebrand factor type A [Rhodopirellula baltica WH47] Length = 264 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 43/287 (14%), Positives = 92/287 (32%), Gaps = 56/287 (19%) Query: 105 NDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKV 164 +D D+ + L + + + S PL + + + S++ Sbjct: 2 SDDDEFGNAVRLTTLSLSNSPQPVFS-------PLFPTMGTNLEIRPQRVAVSTQSTM-- 52 Query: 165 NSQTDARLDMMIVLDVSRSMESFFD----------------------SSITKIDMAIKSI 202 D+ +V+D S SM D ++ + S+ Sbjct: 53 --------DVALVIDRSGSMAYASDETPDPYVNPASAPPGWTYGDPVPPNSRWLDLVASV 104 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNST 260 NA + P + L T+S+ L + + ++ +S G T+ Sbjct: 105 NAFNGFLVDSP---QYEKLCLATYSSTASRDCDLTHTYAEISNELDAISYQFDGGGTSVG 161 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA 320 GL++ + D R+ +++V MTDG + + K + +Y + G Sbjct: 162 YGLEHGLAVLTDATHARKFAV-------RVMVLMTDGHHNTGKSPESMMY---HLQNHGV 211 Query: 321 IVYAIGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSHIGKDI 365 ++ I + A+ + + + +AF I K + Sbjct: 212 TLFTITFSDDADQSRMSNLANACGGENFHATDASQLQNAFQKIAKKL 258 >gi|118081959|ref|XP_417299.2| PREDICTED: similar to inter-alpha trypsin inhibitor heavy chain precursor 5 [Gallus gallus] Length = 955 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 77/202 (38%), Gaps = 30/202 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ VLD S SM TK+ +++ +L++++ N ++ FSN+I+ + Sbjct: 312 VVFVLDSSASM------VGTKLRQTKEALFTILQDLRPEDHFN------IIGFSNRIKVW 359 Query: 234 FLLE------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + ++ I +S G TN L+ ++ + + + + + Sbjct: 360 QQDRLVPVTPNNIRDAKKYIHNMSPTG-GTNINSALQTG-AKLLNDYIAQNNIDARSVSL 417 Query: 288 KKIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-----LRACAS 341 I+F+TDG + + L +A + ++ IGI ++ L C Sbjct: 418 ---IIFLTDGRPTVGETQSSKILSNTKDAIRDKFCLFTIGIGNDVDYKLLERMALENCGM 474 Query: 342 PNSFYLVENPHSMYDAF-SHIG 362 F E+ S F IG Sbjct: 475 VRHFQEDEDAASHLKGFYDEIG 496 >gi|297565996|ref|YP_003684968.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] gi|296850445|gb|ADH63460.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] Length = 717 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 37/207 (17%), Positives = 70/207 (33%), Gaps = 29/207 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 Q D + +++VLDVS SM K+ +A+ ++ + G+V F Sbjct: 300 QEDGGVGLVLVLDVSGSMLE-----ADKLGLAVAGSLELIRSARDQD------YIGVVAF 348 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 S+ F +R+ + L + G T A + ++ Sbjct: 349 SSSARWVFRPRPMTPQGRREAESLLLSVRAGGGTEIGEAYAEALQALRGLKTE------- 401 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS-- 341 K ++ +TDG +L ++A+ A+ + FLR A Sbjct: 402 ----DKQVLVLTDGLVQDPT--LPTLQAAHQAQANKIRTNAVALGSDADRAFLRELAKQG 455 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTK 368 +FY V +P + F + + Sbjct: 456 GGTFYDVPSPKDLPRFFLEEAQRAFQR 482 >gi|255566346|ref|XP_002524159.1| protein binding protein, putative [Ricinus communis] gi|223536577|gb|EEF38222.1| protein binding protein, putative [Ricinus communis] Length = 514 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 54/250 (21%), Positives = 89/250 (35%), Gaps = 35/250 (14%) Query: 118 IVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 ++V P + G + I ++ L + + + + +S LD++ V Sbjct: 8 MLVSPTDSGSRPTPIVPGRVQLISINNNTVPLEETKLKVMLELTGGDSSNDRPGLDLVAV 67 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 LDVS SME KI ++ M++++ I + +VTFS L Sbjct: 68 LDVSGSMEG------EKISKVKTAMLFMIKKLSSID------RLSIVTFSGDARRLCPLR 115 Query: 238 ----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L+ I L G TN T GLK N + D + I+ Sbjct: 116 QISETSQRELENLINGLKAEGA-TNITAGLKTGLNVLNDRRLSGGRVVG--------IML 166 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---PNSFYLVEN 350 M+DGE + + Q V+ G + L+A A +F V+N Sbjct: 167 MSDGEQNAGGDAAQVPV-------GNVPVHTFGFGINHEPRVLKAIAQNSVGGTFSDVQN 219 Query: 351 PHSMYDAFSH 360 ++ AFS Sbjct: 220 TDNLSKAFSQ 229 >gi|301058344|ref|ZP_07199377.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300447580|gb|EFK11312.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 598 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 36/169 (21%), Positives = 62/169 (36%), Gaps = 26/169 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 T +D++I LDVS SM D +++ A + I L+ VK + GLV FS Sbjct: 85 TQKGVDIVIALDVSPSMLVE-DIKPDRLERAKREITDFLKVVKGD-------RVGLVAFS 136 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGV---STNSTPGLKYAYNQIFDMQGMRQHCNTED 284 L L + L + T+ + A Sbjct: 137 GAAYTQCPLTLDYGALMMFLNILHPNNIPHPGTDLGAAVLGAIK-----------AFDPK 185 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + K+I+ +TDGE+ +++ L EA ++G ++ G+ Sbjct: 186 SETDKVILLITDGEDN----EKRGLDAAREAVRKGIKIFVFGMGDPAGG 230 >gi|270296687|ref|ZP_06202886.1| conserved hypothetical protein [Bacteroides sp. D20] gi|270272674|gb|EFA18537.1| conserved hypothetical protein [Bacteroides sp. D20] Length = 327 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 42/270 (15%), Positives = 87/270 (32%), Gaps = 46/270 (17%) Query: 119 VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVL 178 + + ++ + Y + + F I ++ ++ + +D+M+ + Sbjct: 35 ISDARVYAHTPKSYKNYLLHVPFVLRIIALILIILVLARPQTTDSWQNSEIEGIDIMLAM 94 Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW 238 DVS SM + D +++ A + P+ N G+ F+ + L Sbjct: 95 DVSTSMLAE-DLKPNRLEAAKDVAAEFING---RPNDN----IGITLFAGESFTQCPLTV 146 Query: 239 GVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 + L +K + T G+ A ++ D + K+I+ +TD Sbjct: 147 DHAVLLNLLKDMKCGLIEDGTAIGMGIANAVTRLKDSKAKS-----------KVIILLTD 195 Query: 297 GENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV----------------------IRSHE 334 G N K D L AK G VY IG+ I Sbjct: 196 GVNN--KGDISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVEIDEKT 253 Query: 335 FLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 + + +++ + + + + I K Sbjct: 254 LTQIAGTTEGNYFRATSNSKLKEVYEEIDK 283 >gi|14042009|dbj|BAB55070.1| unnamed protein product [Homo sapiens] Length = 942 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 36/206 (17%), Positives = 71/206 (34%), Gaps = 41/206 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ VLD S SM TK+ ++ +L +++ + ++ F N+I+ Sbjct: 296 VVFVLDSSASM------VGTKLRQTKDALFTILHDLRPQD------RFSIIGFPNRIKVW 343 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + G + I ++S G T+ L+ A + H D Sbjct: 344 KDHLISVTPDSIRDG----KVYIHHMSPTG-GTDINGALQRAIRLL---NKYVAHSGIGD 395 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEF-----L 336 + +IVF+TDG+ + +L N ++ ++ IGI L Sbjct: 396 RSVS-LIVFLTDGKPT--VGETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSL 452 Query: 337 RACASPNSFYLVENP-HSMYDAFSHI 361 C + E+ + + I Sbjct: 453 ENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|226496057|ref|NP_001151334.1| LOC100284967 [Zea mays] gi|195645892|gb|ACG42414.1| protein binding protein [Zea mays] Length = 516 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 45/225 (20%), Positives = 80/225 (35%), Gaps = 37/225 (16%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 P N++ +++ +T ++ + LD++ VLDVS SM+ I K+ A+K + Sbjct: 35 NMAPLEENTQKVLLELTG--GDSTSDRSGLDLVAVLDVSGSMQ---GEKIEKMKTAMKFV 89 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ----RKIKYLSKFGVSTN 258 L + + +VTF + L+ Q + I L G +TN Sbjct: 90 VKKLSSID---------RLSIVTFLDTANRICPLQQVTEDSQPQLLKLIDALQP-GGNTN 139 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + GL+ + D + ++ M+DG+ K Sbjct: 140 ISDGLQTGLKVLADRKLSSGRVVG--------VMLMSDGQQNRG-------EPAANVKIG 184 Query: 319 GAIVYAIGIRVIRSHEFLRACAS---PNSFYLVENPHSMYDAFSH 360 VY G L A A +F +V + + + AFS Sbjct: 185 NVPVYTFGFGADYDPTVLNAVARNSMGGTFSVVNDVNLLSMAFSQ 229 >gi|254414936|ref|ZP_05028700.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] gi|196178425|gb|EDX73425.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] Length = 576 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 77/206 (37%), Gaps = 24/206 (11%) Query: 133 SRYKIPLKFCTF---IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 + P T PW + + + + + ++++ +++ +LDVS SM Sbjct: 172 PQADKPFSITTEVAEAPWNSKHKLVHIGLQGK-SISTENLPPSNLVFLLDVSGSM----- 225 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY 249 S K+ + ++ ++++++ V+ VV +G + I Sbjct: 226 SDANKLPLLKEAFRLLVDQLRDEDKVSIVVYAGAAG----TVLPPTPGNQKDTILAAIDK 281 Query: 250 LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQS 308 L G ST G+K AY ++ + N ++ TDG+ N+ D+Q Sbjct: 282 LEA-GGSTAGGQGIKLAY-KLAQDNFIESGNNR--------VILATDGDFNVGISSDEQL 331 Query: 309 LYYCNEAKKRGAIVYAIGIRVIRSHE 334 + E +++ + +G + Sbjct: 332 VSLIEEKREQDIFLTVLGFGTGNLQD 357 >gi|258544594|ref|ZP_05704828.1| von Willebrand factor type A domain protein [Cardiobacterium hominis ATCC 15826] gi|258520172|gb|EEV89031.1| von Willebrand factor type A domain protein [Cardiobacterium hominis ATCC 15826] Length = 563 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 50/346 (14%), Positives = 115/346 (33%), Gaps = 54/346 (15%) Query: 51 HSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDI 110 +++S+ + E R + + R+ + +F ++ + N + Sbjct: 75 QWALNKSVARMGIRGTVEVQNRERYAHSDANPVHRVSDAPVSTFSIDVDTGSYSNIRRML 134 Query: 111 VRSTSL---DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITS------- 160 R L D V V + Y Y PL P+ +++ + P + Sbjct: 135 TRENRLPPADAVRVEEILNY-----FAYGYPL-PQDGKPFAVHTQTVDSPWQADAKLIRI 188 Query: 161 ---SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + + + +++ ++D S SM+ K+ + K++ E ++ Sbjct: 189 AIQAADLAPEKRPPANLVFLIDTSGSMD-----DPDKLPLVKKTVCHFAEALRADD---- 239 Query: 218 VVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + L+T+S E G + +K L G T L+ AY+ Sbjct: 240 --RISLITYSGSTAEILPPTAGDQKETIIAALKPLRAHGA-TAGGEALRMAYDAA-AKNY 295 Query: 276 MRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + N I+ TDG+ N+ + Y + +K G + +G ++ Sbjct: 296 RKDGINR--------ILLATDGDFNVGISDPATLKNYVADKRKSGISLTTLGYGSGNYND 347 Query: 335 FLR---ACASPNSFYLVEN--------PHSMYDAFSHIGKDIVTKR 369 + A A ++ +++ + + + +DI + Sbjct: 348 EMMEQLADAGDGNYSYIDSEAEAKKVLVRQLTSTLATVARDIKIQL 393 >gi|223974345|gb|ACN31360.1| unknown [Zea mays] Length = 516 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 45/225 (20%), Positives = 79/225 (35%), Gaps = 37/225 (16%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 P N++ +++ +T ++ + LD++ VLDVS SM+ I K+ A+K + Sbjct: 35 NMAPLEENTQKVLLELTG--GDSTSDRSGLDLVAVLDVSGSMQ---GEKIEKMKTAMKFV 89 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ----RKIKYLSKFGVSTN 258 L + + +VTF + L Q + I L G +TN Sbjct: 90 VKKLSSID---------RLSIVTFLDTANRICPLRQVTEDSQPQLLKLIDALQP-GGNTN 139 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + GL+ + D + ++ M+DG+ K Sbjct: 140 ISDGLQTGLKVLADRKLSSGRVVG--------VMLMSDGQQNRG-------EPAANVKIG 184 Query: 319 GAIVYAIGIRVIRSHEFLRACAS---PNSFYLVENPHSMYDAFSH 360 VY G L A A +F +V + + + AFS Sbjct: 185 NVPVYTFGFGADYDPTVLNAVARNSMGGTFSVVNDVNLLSMAFSQ 229 >gi|194388296|dbj|BAG65532.1| unnamed protein product [Homo sapiens] Length = 650 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 88/214 (41%), Gaps = 39/214 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++IVLD S S+ + + + ++ +K +Q G+V + Sbjct: 160 QTYMDIVIVLDGSNSIYPWVEVQ-----------HFLINILKKFYIGPGQIQVGVVQYGE 208 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + F L S ++ ++ S + G T + G+++A ++ F G + Sbjct: 209 DVVHEFHLNDYRS-VKDVVEAASHIEQRGGTETRTAFGIEFARSEAFQKGGRKG------ 261 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL-- 336 KK+++ +TDGE+ + + ++ + ++++ YA+ + R I FL Sbjct: 262 --AKKVMIVITDGESHDSPDLEKVIQ---QSERDNVTRYAVAVLGYYNRRGINPETFLNE 316 Query: 337 -RACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + AS F+ V + ++ D +G I Sbjct: 317 IKYIASDPDDKHFFNVTDEAALKDIVDALGDRIF 350 >gi|221369290|ref|YP_002520386.1| von Willebrand factor, type A precursor [Rhodobacter sphaeroides KD131] gi|221162342|gb|ACM03313.1| von Willebrand factor, type A precursor [Rhodobacter sphaeroides KD131] Length = 328 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 74/202 (36%), Gaps = 39/202 (19%) Query: 174 MMIVLDVSRSM---ESFFDS-SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +++ LD+S SM + D T+++ + + +EE + GL F+N+ Sbjct: 90 IVLTLDMSGSMLIEDFDIDGVQSTRLEAVKRVARSFVEE-------RQGDRIGLALFANR 142 Query: 230 IEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + ++ + R I+ S G ST GL A ++ + Sbjct: 143 AYVAAPLTFDLAAVGRAIEEASIGITGRSTAIADGLGLALKRVTESGAAS---------- 192 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-----------EFL 336 ++IV ++DG++ + + D + A + G ++ I + L Sbjct: 193 -RVIVLLSDGQDNAHQID--ARQVAGLAARHGVRIHTIALGPDDLETRPAARDAVDTATL 249 Query: 337 RAC--ASPNSFYLVENPHSMYD 356 RA AS Y V + Sbjct: 250 RAIAEASGGRSYRVRGMEDLRA 271 >gi|218672104|ref|ZP_03521773.1| von Willebrand factor type A [Rhizobium etli GR56] Length = 366 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 36/236 (15%), Positives = 79/236 (33%), Gaps = 24/236 (10%) Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFC---TFIPWYTNSRHIVMPITSSV 162 + S+ + + Y + P K PW ++ + + I Sbjct: 8 TGGAMPDPQSVRVEEMINYFPYDWPRPDKADQPFKATVTVMPTPWNHDTELMHVAIKG-Y 66 Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + T +++ ++DVS SM+ K+ + + ++ +K V+ V +G Sbjct: 67 DIAPATAPHANLVFLIDVSGSMDE-----PDKLPLLKSAFRLLVNRLKPDDTVSIVTYAG 121 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + S + I L G ST G++ AY + ++ N Sbjct: 122 NAGTVLEPTRVAE----KSKILSAIDRLEA-GGSTGGAEGIEAAY-DLAKKAFVKDGVNR 175 Query: 283 EDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 ++ TDG+ N+ D+ + ++ G + +G ++ L Sbjct: 176 --------VMLATDGDFNVGPSSDEDLKRIIEDKREEGIFLTVLGFGRGNLNDSLM 223 >gi|120598362|ref|YP_962936.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] gi|146293560|ref|YP_001183984.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] gi|120558455|gb|ABM24382.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] gi|145565250|gb|ABP76185.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] Length = 339 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 29/217 (13%), Positives = 69/217 (31%), Gaps = 46/217 (21%) Query: 173 DMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+ +D+S SM+ ++ + + ++ +E K + GL+ F++ Sbjct: 85 DLMMAVDLSGSMQIEDMVVNGKTVDRFTLIQHVVSDFIERRKGD-------RIGLILFAD 137 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + + +K G T + A + + Sbjct: 138 HAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIALAVKRFDKIDESN--------- 188 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV--IRSHEF--------- 335 ++++ +TDG N + + + A R +Y +G+ + Sbjct: 189 --RVLILLTDGSNNA--GNIEPEQAAQIAANRKVTIYTVGVGADVMERRTLFGRERVNPS 244 Query: 336 -------LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + ++ N + + I K Sbjct: 245 MDLDENQLKHIADVTHGRYFRARNSQELDQIYQEIDK 281 >gi|238782874|ref|ZP_04626903.1| tight adherance operon protein [Yersinia bercovieri ATCC 43970] gi|238716297|gb|EEQ08280.1| tight adherance operon protein [Yersinia bercovieri ATCC 43970] Length = 530 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 38/238 (15%), Positives = 75/238 (31%), Gaps = 33/238 (13%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R F N KG + + I LP +L + E+S + K L I+++ + Sbjct: 31 RKFIKNDKGAILLPFIIILPFFIALLFLSFEISQLLQKKAKLSDAIEQATLA-------- 82 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 ++ D+ ++ N+ + + + +DI Y+ Sbjct: 83 ------LTVENDDLPDELQMRKNVDLVSNFSSAYLPLEHFSVP---EIDIKNNCGQLTYN 133 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 Y T + + + +V++ D D++ V D S SM F Sbjct: 134 AKITMSYFANFLSKTAMTNAITTIGTEDNGAAIKQVSTIQDKATDVIFVADYSGSMNEGF 193 Query: 189 DSSI---TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 + KI+ N + + ++N L+ F WG + Sbjct: 194 HGKVPRGEKINALRDVFNRLNGSILKNSNIN------LIGF-------VPFSWGTKRI 238 >gi|198418440|ref|XP_002122148.1| PREDICTED: similar to integrin alpha Hr1 precursor-like [Ciona intestinalis] Length = 1270 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 41/198 (20%), Positives = 72/198 (36%), Gaps = 21/198 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++I+LD S S+ + ++ I +L+ D +NVV GL +FSN I+ Sbjct: 179 VDIIILLDGSTSVFP------SNFELGKSWIKNLLQSFSSDIDKHNVV-VGLYSFSNIIK 231 Query: 232 EFFLLEWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L S L I + T + A ++ A+ K+ Sbjct: 232 REIPLSARTYSTLSGMIDAVRYPYGQTFIHTAINEAVQ---------EYQRAGRASVPKL 282 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACASPNSFYLVE 349 ++ +TDGE S G I+ A+GI +E + + V Sbjct: 283 LIVITDGEATVPSAVAPSANAARA---AGIILTAVGIGSSVNENELTTIAGAAERVFRVS 339 Query: 350 NPHSMYDAFSHIGKDIVT 367 + S+ + + K I Sbjct: 340 DFSSLGSILAPLQKVITD 357 >gi|124127041|gb|ABM92272.1| CD11b [Ovis canadensis] Length = 1152 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 40/230 (17%), Positives = 85/230 (36%), Gaps = 34/230 (14%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 S + P + D+ ++D S S++ ++ + ++ + ++ Sbjct: 126 FGSNLLQKPRRIPAALRECPQQDSDIAFLIDGSGSID---PVDFDRMKKFVSTVMSRFQK 182 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYA 266 K + L+ +S+ F S L+ ++ + + T++ G++ Sbjct: 183 SKTLF--------ALMQYSDDFRTHFTFNDFKRNSDLELLVRPIGQLFGRTHTATGIRKV 234 Query: 267 YNQIF-DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 ++F G R H KI++ +TDGE + + + EA ++G I Y I Sbjct: 235 VRELFHSSNGARNHAL-------KIMIVITDGEKYLDPLEYRDVIP--EADRKGIIRYVI 285 Query: 326 G----IRVIRSHEFLRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTK 368 G +S + L AS + + V N ++A I + K Sbjct: 286 GVGDAFNSKKSRKELDTIASKPPADHVFQVNN----FEALKTIQNQLQEK 331 >gi|126657060|ref|ZP_01728231.1| von Willebrand factor type A domain protein [Cyanothece sp. CCY0110] gi|126621603|gb|EAZ92313.1| von Willebrand factor type A domain protein [Cyanothece sp. CCY0110] Length = 328 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 31/210 (14%), Positives = 69/210 (32%), Gaps = 36/210 (17%) Query: 166 SQTDARLDMMIVLDVSRSMESF-----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 +Q + D+M+++D+S+SME+ + I +++ ++ +E + Sbjct: 88 TQVQSARDLMLLVDLSQSMEAKDFQDQQGNKIDRLEAVKLVVDDFIER-------REGDR 140 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 GL+ F K + + + G T + A D + Sbjct: 141 IGLILFGTKAYLQVPFTQDLETARFLLDEAQIGMAGAQTMLGDAIGLAIQTFEDSKTEN- 199 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS------ 332 ++++ +TDG T A +R ++Y I I + Sbjct: 200 ----------RVLILLTDG--NDTGSQVPPDQAAKFAAQRNIVIYTIAIGNPETEGTEKI 247 Query: 333 -HEFLRACA--SPNSFYLVENPHSMYDAFS 359 E L+ A + F+ + + + Sbjct: 248 DEETLQLIADQTGGQFFRGSDRQGLIQIYD 277 >gi|319426861|gb|ADV54935.1| von Willebrand factor type A [Shewanella putrefaciens 200] Length = 339 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 29/217 (13%), Positives = 69/217 (31%), Gaps = 46/217 (21%) Query: 173 DMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+ +D+S SM+ ++ + + ++ +E K + GL+ F++ Sbjct: 85 DLMMAVDLSGSMQIEDMVVNGKTVDRFTLIQHVVSDFIERRKGD-------RIGLILFAD 137 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + + +K G T + A + + Sbjct: 138 HAYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIALAVKRFDKIDESN--------- 188 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV--IRSHEF--------- 335 ++++ +TDG N + + + A R +Y +G+ + Sbjct: 189 --RVLILLTDGSNNA--GNIEPEQAAQIAANRKVTIYTVGVGADVMERRTLFGRERVNPS 244 Query: 336 -------LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + ++ N + + I K Sbjct: 245 MDLDENQLKHIADVTHGRYFRARNSQELDQIYQEIDK 281 >gi|17555634|ref|NP_498247.1| CUTiclin-Like family member (cutl-23) [Caenorhabditis elegans] gi|13592472|gb|AAK31565.1| Hypothetical protein Y37B11A.1 [Caenorhabditis elegans] Length = 789 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 35/202 (17%), Positives = 72/202 (35%), Gaps = 25/202 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D LD+ +V+D S S++ F S K + +L + + P V+ L+T+ Sbjct: 244 GSDKPLDLALVVDASESLDHLF-SDQKKF-----LVERVLGNINIHP---EAVRVALITY 294 Query: 227 SNKI----EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 S + + L + +Q +K + +T + L A++ + + Sbjct: 295 SGQAFVHFKFNSFLYGNNTSVQGFVKNIRSIKGTTATNVALMDAFDLLTS----KDPSIG 350 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI---RSHEFLRAC 339 K+ + +TDG + + +D + + G I+ A+ + E Sbjct: 351 VREGVPKMALVLTDGHSHKSPKD-----ISEKMRAAGIIMIAVSVTPRPLVDEAELRLIA 405 Query: 340 ASPNSFYLVENPHSMYDAFSHI 361 S + N H F Sbjct: 406 GSEKRAFTPPNLHDFESEFMKY 427 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/139 (17%), Positives = 44/139 (31%), Gaps = 17/139 (12%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 LD++ +LD S S+ D I+ + +K + + L+ FS Sbjct: 50 APPLDLVFILDSSGSLR----------DKFQDEIDIIRRILKHVTIGKTATRVMLIQFSG 99 Query: 229 KI--EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 E F L + L T ++A Q+ + D Sbjct: 100 TQHLEFNFEKFTDREELLAALDVLRHVSGITRIGGAFEFALQQLKTPGSGLR-----DGT 154 Query: 287 YKKIIVFMTDGENLSTKED 305 KI+ ++DG +D Sbjct: 155 VPKIVYLLSDGRTHDFPKD 173 >gi|313238339|emb|CBY13421.1| unnamed protein product [Oikopleura dioica] Length = 501 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 73/193 (37%), Gaps = 28/193 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +++M+ + D S S+ S D+ K I +++ + + +V F Sbjct: 31 CAGVQMEMVFLADGSSSV------SQVSFDLMKKWIADFVDKFDVSQTA---TRVSVVQF 81 Query: 227 SNKIE----EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 ++ + F L + ++ + L +T++ L+Y N + Sbjct: 82 TDNVSDRNGFGFPLSGQAARVKSDLANLRYMTGNTHAGNALRYVKNNVLS---------R 132 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACAS 341 + K+++ +TDG + E S ++ +V+++GI ++E + Sbjct: 133 ARKSASKVLIVLTDGVSQDEIEKAASDLIGDK-----VLVFSVGIGNSVDANELEKIAGL 187 Query: 342 PNSFYLVENPHSM 354 P + N +++ Sbjct: 188 PEYVFKTANYNAL 200 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 72/206 (34%), Gaps = 36/206 (17%) Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--------I 230 D+S ++S +I+ + +++E++++ + + + ++ FS + I Sbjct: 227 DLSFIVDSSSSITISDYQKLKTWMKSIIEKLEIGDNAS---RVSILQFSGQSARPQGRWI 283 Query: 231 EEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + S + I + K T L Y Y +F Q + ++ + Sbjct: 284 NPVLTFDRSTSKEAVIGAIDGMKKLNGDTCIGEALDYFYRNMFTSQ------AGQRSDVE 337 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC--------- 339 + ++ MTDG+ + + + A +YAIGI + C Sbjct: 338 QRVIVMTDGKRNCP---AEIAKPAELIRAQEAEIYAIGIGHQCGYGENHNCYDRQELHEI 394 Query: 340 AS---PNSFYLVENPHSMYDAFSHIG 362 AS + + N + IG Sbjct: 395 ASKPADKYVFEINNFDQLI--LKRIG 418 >gi|313212349|emb|CBY36340.1| unnamed protein product [Oikopleura dioica] Length = 2306 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 40/196 (20%), Positives = 74/196 (37%), Gaps = 26/196 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ VLD S S +I + + + ++ +Q + +S+ I E Sbjct: 578 DIIFVLDGSFSTR---QDGFDRILSFVSA---------TVDALSGNIQYAAIQYSDVITE 625 Query: 233 FFLLEWG-VSHLQRKIKYLSK-FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F ++ L +IK + G ST + ++ A++ +FD Q KI Sbjct: 626 EFNFKYRLKKDLIEEIKAMKYDSGWSTYTGLAMEKAWSMLFDQQ------FGARNAVTKI 679 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRG-AIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +V +TDG TK+D + K G V A+G+ E L S + + Sbjct: 680 MVILTDGR---TKDDIE--KISENIKNAGDTTVLAVGLNSAPLDELLTLATSKDLAFYTH 734 Query: 350 NPHSMYDAFSHIGKDI 365 ++ + + I Sbjct: 735 ELADIFHLLAQLIDKI 750 >gi|297626138|ref|YP_003687901.1| Von Willebrand factor, type A [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] gi|296921903|emb|CBL56463.1| Von Willebrand factor, type A [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] Length = 321 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 40/208 (19%), Positives = 76/208 (36%), Gaps = 29/208 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +DVSRSME+ D + ++D A ++ + +V LVTF+ Sbjct: 91 IVVTIDVSRSMEA-TDVTPNRLDAAKSGAKDFVDSLPSAFNVA------LVTFAGTANVK 143 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + L+ I + + ST G+ + + + + Q + D IV Sbjct: 144 MPPTTDRTQLKAAIDAI-RLAPSTAIGEGIYTSLDVL--EKLAPQDPDHPDDPAPGAIVL 200 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--------------LRAC 339 ++DG +T + S EAKK+ +Y I + L Sbjct: 201 LSDG---ATNMGRDSADAATEAKKKNVPIYTIAYGTSTGYVVENGQRQTVAVNHAELSQV 257 Query: 340 A--SPNSFYLVENPHSMYDAFSHIGKDI 365 A S Y ++ ++ + I + I Sbjct: 258 AKLSGGKKYSADSMKNLQAVYQTISRQI 285 >gi|253568262|ref|ZP_04845673.1| aerotolerance protein BatA [Bacteroides sp. 1_1_6] gi|298385671|ref|ZP_06995229.1| BatA protein [Bacteroides sp. 1_1_14] gi|251842335|gb|EES70415.1| aerotolerance protein BatA [Bacteroides sp. 1_1_6] gi|298261812|gb|EFI04678.1| BatA protein [Bacteroides sp. 1_1_14] Length = 327 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 41/229 (17%), Positives = 75/229 (32%), Gaps = 46/229 (20%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S+ K + +D+M+ +DVS SM + D +++ A + P+ N Sbjct: 76 STNKWQNSEIEGIDIMLAIDVSTSMLAE-DLKPNRLEAAKDVAAEFING---RPNDN--- 128 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMR 277 G+ F+ + L + L I + T G+ A ++ D + Sbjct: 129 -IGITLFAGETFTQCPLTVDHAVLLDMIHNIKCGLIEDGTAVGMGVANAVTRLKDSKAKS 187 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-------- 329 K+I+ +TDG N K D L AK G VY IG+ Sbjct: 188 -----------KVIILLTDGTNN--KGDISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPY 234 Query: 330 --------------IRSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 I + + +++ + + + + I K Sbjct: 235 PVGNTVQYINMPVEIDEKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|29346317|ref|NP_809820.1| aerotolerance protein BatA [Bacteroides thetaiotaomicron VPI-5482] gi|29338212|gb|AAO76014.1| BatA [Bacteroides thetaiotaomicron VPI-5482] Length = 327 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 41/229 (17%), Positives = 75/229 (32%), Gaps = 46/229 (20%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S+ K + +D+M+ +DVS SM + D +++ A + P+ N Sbjct: 76 STNKWQNSEIEGIDIMLAIDVSTSMLAE-DLKPNRLEAAKDVAAEFING---RPNDN--- 128 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMR 277 G+ F+ + L + L I + T G+ A ++ D + Sbjct: 129 -IGITLFAGETFTQCPLTVDHAVLLDMIHNIKCGLIEDGTAVGMGVANAVTRLKDSKAKS 187 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-------- 329 K+I+ +TDG N K D L AK G VY IG+ Sbjct: 188 -----------KVIILLTDGTNN--KGDISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPY 234 Query: 330 --------------IRSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 I + + +++ + + + + I K Sbjct: 235 PVGNTVQYINMPVEIDEKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|326911082|ref|XP_003201891.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Meleagris gallopavo] Length = 951 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 77/202 (38%), Gaps = 30/202 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ VLD S SM TK+ +++ +L++++ N ++ FSN+I+ + Sbjct: 308 VVFVLDSSASM------VGTKLRQTKEALFTILQDLRPEDHFN------IIGFSNRIKVW 355 Query: 234 FLLE------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + ++ I +S G TN L+ ++ + + + + + Sbjct: 356 QQDRLVPVTPNNIRDAKKYIHNMSPTG-GTNINSALQTG-AKLLNDYIAQNNIDARSVSL 413 Query: 288 KKIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-----LRACAS 341 I+F+TDG + + L +A + ++ IGI ++ L C Sbjct: 414 ---IIFLTDGRPTVGETQSSKILSNTKDAIRDKFCLFTIGIGNDVDYKLLERMALENCGM 470 Query: 342 PNSFYLVENPHSMYDAF-SHIG 362 F E+ S F IG Sbjct: 471 VRHFQEDEDAASHLKGFYDEIG 492 >gi|307595413|ref|YP_003901730.1| von Willebrand factor type A [Vulcanisaeta distributa DSM 14429] gi|307550614|gb|ADN50679.1| von Willebrand factor type A [Vulcanisaeta distributa DSM 14429] Length = 495 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 74/195 (37%), Gaps = 32/195 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++ LDVS SM + KI++A ++ ++ + D + +V F+ + + Sbjct: 317 IDIVLCLDVSGSMRELSNGMP-KIEIAKDAVAQYIQFLSKTND-----RLAMVLFNFRAD 370 Query: 232 EFFLLEWGVSHLQRKIKYLSKF------GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 WG+ ++R + ++ G TN L+ + + + +H Sbjct: 371 VL----WGLHQVRRYWQQMNYMLKYVYAGGGTNLANALERSREVLTRSRSNSKH------ 420 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--PN 343 ++ +TDG +++ + ++ G + I I E L + Sbjct: 421 -----VICVTDGRTVNSS---MCIKEAVRLRRSGTTISTIAIGENSDDELLMRLSKIGGG 472 Query: 344 SFYLVENPHSMYDAF 358 F + + H + A Sbjct: 473 LFIKISSIHDLGKAL 487 >gi|260914303|ref|ZP_05920772.1| Flp pilus assembly protein TadG [Pasteurella dagmatis ATCC 43325] gi|260631404|gb|EEX49586.1| Flp pilus assembly protein TadG [Pasteurella dagmatis ATCC 43325] Length = 584 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 43/266 (16%), Positives = 89/266 (33%), Gaps = 49/266 (18%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLV------- 59 +++F+ KG ++TA+ + +++ ++ + I K L D++ + Sbjct: 23 KLKDFYQEEKGVYAVMTALLSFPLLVLIAFTVDGTGIILDKVRLAQATDQAALLLVAENN 82 Query: 60 ----------HAATQIMNEG-------NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNG 102 + E + +K + +++ + + S +DN Sbjct: 83 AYRKNPMHDDVTKQSVSKEELSKFSGDKLSAQKDKRNQELIQGLAKMYLRSENKAQKDNH 142 Query: 103 FVNDID----------DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSR 152 ID D++ + P S + IPL N R Sbjct: 143 LPVTIDQPFDYKCEELDLINPKNQYSRRKPVTCYVQGSVNREFWIPLSADLVKTHTKNGR 202 Query: 153 HIVMPITSSVKVNSQTDA---RLDMMIVLDVSRSMESFFDSSIT------KIDMAIKSIN 203 +PI S + + A +D+M+V D S SM ++ KID+ ++ Sbjct: 203 ---LPINSGISYAVKEKAIVIPVDLMLVSDFSGSMLWDLKNNENAQYPNRKIDILRSVVS 259 Query: 204 AMLEEV---KLIPDVNNVVQSGLVTF 226 + + KL D + + G F Sbjct: 260 DIQNILFPTKLSEDASPYNRMGFAAF 285 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 34/208 (16%), Positives = 66/208 (31%), Gaps = 35/208 (16%) Query: 181 SRSM----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 S SM E D T I + + KL + + G + +E Sbjct: 368 STSMKLIFEDVLDVDKT-IKQVENFDGNRVNDYKLTYNNPDHCLGG----NEGVETSQ-- 420 Query: 237 EWGVS---HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG-MRQHCNTEDANYKKIIV 292 W + + + G + S+ + A N + D N ++I++ Sbjct: 421 AWFTKSKPKVAEALSKIKPTGSTAASSGFIIGA-NLLMDKNTVPEAQPAKLGTNTQRILM 479 Query: 293 FMTDGENLSTKEDQQSLYY----CNEAKKR--------------GAIVYAIGIRVIRSHE 334 ++DGE+ D + C+ +K+ A G + + Sbjct: 480 VLSDGEDNRPTFDTLTTLLNAGLCDNIRKKADSLQDPKFNTLPTKIAFAAFGFQPPPEQK 539 Query: 335 FL-RACASPNSFYLVENPHSMYDAFSHI 361 + C N++Y + ++ DAF I Sbjct: 540 AAWQKCVGENNYYEPSSKEALLDAFKQI 567 >gi|149176866|ref|ZP_01855476.1| BatB [Planctomyces maris DSM 8797] gi|148844303|gb|EDL58656.1| BatB [Planctomyces maris DSM 8797] Length = 798 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 64/163 (39%), Gaps = 25/163 (15%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +++M +LDVSRSM + D S +++D A + I M++E+ + GLV F+ Sbjct: 88 QKGIEVMFLLDVSRSMLAE-DVSPSRLDRAKQQIKDMVDEMSGD-------RVGLVVFAG 139 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + + L ++ + + + ++ A D Sbjct: 140 ETRQSVPLTSHYEDFKQSLDAVGPHSVRRGGSLLGDAIRSATAGFIDKTNDH-------- 191 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 K IV TDGE+ +K + + + G ++ +G+ Sbjct: 192 ---KAIVVFTDGEDQESKPVEAAKEAFTK---NGIRIFTVGLG 228 >gi|312072174|ref|XP_003138945.1| hypothetical protein LOAG_03360 [Loa loa] gi|307765891|gb|EFO25125.1| hypothetical protein LOAG_03360 [Loa loa] Length = 1596 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 47/370 (12%), Positives = 134/370 (36%), Gaps = 54/370 (14%) Query: 27 LPIIFLVLGMIIEVSHIFFMKTVLHSMID--RSLVHAATQIMNEGNGNNRKKLKGGDILC 84 +P+ E+ + + ++L+ A+ + + G+ + ++ Sbjct: 745 IPVSLGDYNTQTELLSRISETRRMRGEANLGQALLDASGEFLIFGS-KDIPRIVIIFSYG 803 Query: 85 RIKNTWNMS---FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKF 141 + + W + R++ + + F+ D+ + T +V N I+ + + + Sbjct: 804 QPRGEWKENARLLRDDTKAHIFLVDVGNQGDKTQNLAIVGESNPHRIITIDEWHGVNSEI 863 Query: 142 CTFIPWYT-----------------NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 + +R M + + V++ ++ D + D+M +LD S ++ Sbjct: 864 LSPFMDELCRLLPQRQDKTSRDGTWPTRQTEMRVATPVRICNRVDFQADVMFILDSSDNV 923 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ 244 S S+ + I+ +++E+ ++V+ G V +S+K L + + +Q Sbjct: 924 TSEEYSN------LKEGISMLIDEI--FDLSPDIVRVGFVEYSDKASVPVPLGYYDNKVQ 975 Query: 245 --RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 I + G + GL+ A Q N +I++ +T G N Sbjct: 976 LLADISNSEQLGGTPIILRGLRAA---------KEQFQRHGRDNVSRILLLVTSGANRGN 1026 Query: 303 KEDQQSLYYCNEAKKR-GAIVYAIGIRVIRSHEFLRACASPNSF-----YLVENPHSMYD 356 + ++ ++ ++ + + + + + + + + + + + + + Sbjct: 1027 -----VAFAADDLREHLNVSIFVLVVNASQGAQIMLNRLTSDEYTQQRVISISSANKLQE 1081 Query: 357 A-FSHIGKDI 365 A IG+ + Sbjct: 1082 AELLQIGQAL 1091 Score = 43.2 bits (100), Expect = 0.059, Method: Composition-based stats. Identities = 28/222 (12%), Positives = 70/222 (31%), Gaps = 45/222 (20%) Query: 121 VPQNEGY--SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVL 178 ++ Y S+ + Y W ++S + V+ + +D++++L Sbjct: 652 EGEHRSYMWSMETTAGYHASTPSGRISTWTSSSPYCVVS------------SNVDILLIL 699 Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-E 237 D S S+ + + I L V N V+ G++ + +K++ L + Sbjct: 700 DSSSSVR------VVDYRIMKDFIKNFL--VNYFNLQRNYVRVGVMKYGDKVQIPVSLGD 751 Query: 238 WGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 + + L +I + N L A + + +I++ + Sbjct: 752 YNTQTELLSRISETRRMRGEANLGQALLDA---------SGEFLIFGSKDIPRIVIIFSY 802 Query: 297 GENLSTKEDQQSLYYCNEAK----KRGAIVYAIGIRVIRSHE 334 G+ + A+ A ++ + + Sbjct: 803 GQPRG--------EWKENARLLRDDTKAHIFLVDVGNQGDKT 836 >gi|301778757|ref|XP_002924796.1| PREDICTED: integrin alpha-M-like [Ailuropoda melanoleuca] Length = 1153 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 76/215 (35%), Gaps = 32/215 (14%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K+ D+ ++D S S+ ++ + ++ + K + Sbjct: 141 KIRECPRQDSDIAFLIDGSGSINPI---DFQRMKEFVSTVMDRFKNSKTLFS-------- 189 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRK--IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 L+ FS + F ++ + + + + T++ G+ ++F Sbjct: 190 LMQFSEDFQTHFTFNEFKANPNPRFLVNAIIQLYGRTHTATGILKVVRELFHSS------ 243 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR----VIRSHEFL 336 + N KI+V +TDGE D + + EA + G I Y IG+ + E L Sbjct: 244 SGARENALKILVVITDGEKFGDPLDYKDVIP--EADREGIIRYVIGVGEAFDNPKHREEL 301 Query: 337 RACASP---NSFYLVENPHSMYDAFSHIGKDIVTK 368 AS + + V N ++A I + K Sbjct: 302 NTIASKPARDHVFRVNN----FEALKTIQNQLQEK 332 >gi|281352695|gb|EFB28279.1| hypothetical protein PANDA_014199 [Ailuropoda melanoleuca] Length = 1110 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 38/215 (17%), Positives = 76/215 (35%), Gaps = 32/215 (14%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K+ D+ ++D S S+ ++ + ++ + K + Sbjct: 96 KIRECPRQDSDIAFLIDGSGSINPI---DFQRMKEFVSTVMDRFKNSKTLFS-------- 144 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRK--IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 L+ FS + F ++ + + + + T++ G+ ++F Sbjct: 145 LMQFSEDFQTHFTFNEFKANPNPRFLVNAIIQLYGRTHTATGILKVVRELFHSS------ 198 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR----VIRSHEFL 336 + N KI+V +TDGE D + + EA + G I Y IG+ + E L Sbjct: 199 SGARENALKILVVITDGEKFGDPLDYKDVIP--EADREGIIRYVIGVGEAFDNPKHREEL 256 Query: 337 RACASP---NSFYLVENPHSMYDAFSHIGKDIVTK 368 AS + + V N ++A I + K Sbjct: 257 NTIASKPARDHVFRVNN----FEALKTIQNQLQEK 287 >gi|330973664|gb|EGH73730.1| von Willebrand factor, type A [Pseudomonas syringae pv. aceris str. M302273PT] Length = 352 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 30/171 (17%), Positives = 65/171 (38%), Gaps = 26/171 (15%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + ++++ + + + LE K + GL+ F Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDEVSRLVLVQQLLGDFLEGRKGD-------RVGLILFGT 143 Query: 229 KIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + + G +T + A ++ Sbjct: 144 QAFVQAPLTYDRRTVRVWLDEARIGIAGKNTALGDAIGLALKRL-----------RMRPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + +V +TDG N + + D + A + G +YAIGI + L+ Sbjct: 193 TSRALVLVTDGANNAGQIDPVTA--ARLAAEEGVKIYAIGIGSDPDKDALQ 241 >gi|55380211|ref|YP_138060.1| calcium-binding protein-like [Haloarcula marismortui ATCC 43049] gi|55232936|gb|AAV48354.1| calcium-binding protein-like [Haloarcula marismortui ATCC 43049] Length = 1562 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 32/198 (16%), Positives = 63/198 (31%), Gaps = 30/198 (15%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D+ +V+D S SM S T + + +L +V ++ +V F + Sbjct: 499 PVDVTLVMDTSGSMSSSVKLRNT---AGQRFVAGLL----------DVDRAAVVDFDSSA 545 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + L G T+ GL A +Q + ++ Sbjct: 546 YVAQDLTSDFGAANSTLDNLGS-GGGTDIGSGLSTANSQFASNSNDSR---------AQV 595 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-SPNSFYLVE 349 ++ +TDG + A + VY +G + + F V Sbjct: 596 MILLTDGRGNG------GISEAQTAANQNTTVYTVGFDNANRDKLRDIANITDGEFNYVT 649 Query: 350 NPHSMYDAFSHIGKDIVT 367 + + + FS I ++ Sbjct: 650 DRSELPNVFSRIAENTTE 667 >gi|66044963|ref|YP_234804.1| von Willebrand factor, type A [Pseudomonas syringae pv. syringae B728a] gi|63255670|gb|AAY36766.1| von Willebrand factor, type A [Pseudomonas syringae pv. syringae B728a] Length = 352 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 30/171 (17%), Positives = 65/171 (38%), Gaps = 26/171 (15%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + ++++ + + + LE K + GL+ F Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDEVSRLVLVQQLLGDFLEGRKGD-------RVGLILFGT 143 Query: 229 KIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + + G +T + A ++ Sbjct: 144 QAFVQAPLTYDRRTVRVWLDEARIGIAGKNTALGDAIGLALKRL-----------RMRPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + +V +TDG N + + D + A + G +YAIGI + L+ Sbjct: 193 TSRALVLVTDGANNAGQIDPVTA--ARLAAEEGVKIYAIGIGSDPDKDALQ 241 >gi|297581617|ref|ZP_06943539.1| flp pilus assembly protein TadG [Vibrio cholerae RC385] gi|297534024|gb|EFH72863.1| flp pilus assembly protein TadG [Vibrio cholerae RC385] Length = 467 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/148 (18%), Positives = 63/148 (42%), Gaps = 14/148 (9%) Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQI-------FDMQGM 276 F + ++ +S + IK L +TN+ G+ +A+ + +D Sbjct: 313 FDSPDCGVSQIQPLLSTRRAFIKALDTLYPEFNTNNAEGVMWAWRLLSPHWRGYWDKGKS 372 Query: 277 RQHCNTEDANYKKIIVFMTDGEN--LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + + N +K+++ TDG + K D++ + C E KK+G + +I + Sbjct: 373 ELPRDYQHPNNRKVMLLFTDGNHLVDVAKRDRKQVALCREMKKQGIEIISIDFN--NRSQ 430 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIG 362 +++CAS +Y+ +N ++ + Sbjct: 431 VMKSCASAGQYYIADN-RTIRSVLKQVA 457 >gi|48427894|sp|Q8SQ75|CO2_PONPY RecName: Full=Complement C2; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement C2b fragment; Contains: RecName: Full=Complement C2a fragment; Flags: Precursor gi|19110309|gb|AAL82820.1| complement C2 [Pongo pygmaeus] Length = 752 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 34/238 (14%), Positives = 84/238 (35%), Gaps = 29/238 (12%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + + TN K+ Q L++ ++LD S+S+ + +S Sbjct: 224 FSHMLGATNPTQKTKESLGR-KIQIQRSGHLNLYLLLDCSQSVSEN------DFLIFKES 276 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWGVSHLQRKIKYLS-------KF 253 + M++ + V ++TF+++ + L + I L + Sbjct: 277 ASLMVDRIFSFEIN---VSVAIITFASEPKVLMSVLNDNSRDMTDVISSLENANYKDHEN 333 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL------STKEDQQ 307 G TN+ L Y + + + + I+ +TDG++ + + + Sbjct: 334 GTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIR 393 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 + N+ + +YAIG+ + +E +++++ +++ F H Sbjct: 394 EILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQVFEH 451 >gi|313240178|emb|CBY32528.1| unnamed protein product [Oikopleura dioica] Length = 524 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 85/214 (39%), Gaps = 22/214 (10%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV-KLIPDVNNVVQSGLVTFSNKIE 231 D++I++D S S+ + +++ M++ + I N + ++ +S+ ++ Sbjct: 220 DLLILVDESTSIG------AENFEHVKRTLGLMIDNLCDGISPDTN--RVAMLRYSSDVK 271 Query: 232 EFFLLEWGVS--HLQRKIKYLSKF------GVSTNSTPGLKYAYNQIFDMQ-GMRQHCNT 282 E G + + R I+ L ST + + A IF + G R Sbjct: 272 EDLNFIEGSNEPKVMRNIQRLKYKPITDDRHGSTYTAHAMDKALKTIFTSEAGWRNGTTE 331 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS- 341 + + +V +TDGE+ E + + G VYA+G+ I+ E R S Sbjct: 332 DGIKVRTEVVIITDGESNDPDETFTIQGQKVKYDEYGIKVYALGVGDIKKDEI-RQLTSM 390 Query: 342 -PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 S + + + + AF+ I + ++ D+ Sbjct: 391 DDESIFYLMSWKDL-AAFNRIIETLIETHAEEDR 423 >gi|194374891|dbj|BAG62560.1| unnamed protein product [Homo sapiens] Length = 723 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 34/238 (14%), Positives = 84/238 (35%), Gaps = 29/238 (12%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + + TN K+ Q L++ ++LD S+S+ + +S Sbjct: 195 FSHMLGATNPTQKTKESLGR-KIQIQRSGHLNLYLLLDCSQSVSEN------DFLIFKES 247 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWGVSHLQRKIKYLS-------KF 253 + M++ + V ++TF+++ + L + I L + Sbjct: 248 ASPMVDRIFSFEIN---VSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANYKDHEN 304 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL------STKEDQQ 307 G TN+ L Y + + + + I+ +TDG++ + + + Sbjct: 305 GTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIR 364 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 + N+ + +YAIG+ + +E +++++ +++ F H Sbjct: 365 EILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQVFEH 422 >gi|168699403|ref|ZP_02731680.1| hypothetical protein GobsU_07777 [Gemmata obscuriglobus UQM 2246] Length = 354 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 71/232 (30%), Gaps = 52/232 (22%) Query: 172 LDMMIVLDVSRSMESF------FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ----S 221 + +++ LDVS SM + S+++++ A +++ L + Sbjct: 89 IALVVALDVSGSMGAEDVVWTPGAPSVSRLEAARRALKLFLAGGAAPDGTAFDPRPGDAV 148 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGV---STNSTPGLKYAYNQIFDMQGMRQ 278 GLV F+ E S L + L G TN L A ++ Sbjct: 149 GLVAFAAVPETVCPATLNHSVLFKVADALQPKGGADAGTNIGDSLAEAVIRLDAADQKS- 207 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQ------------SLYYCNEAKKRGAIVYAI- 325 ++++ ++DGE+ KED + A G VY I Sbjct: 208 ----------RVLILLSDGEHNILKEDVRDAQRPGIDRTLKPREAAQLAANLGVRVYTID 257 Query: 326 -----GIRVI--------RSHEFLRACA--SPNSFYLVENPHSMYDAFSHIG 362 + + L+ A + + + + A+ I Sbjct: 258 AGGDPPLGAPPDAVAQRFAGRKALKDVAEMTGGKSFQATSGAELLSAYREIS 309 >gi|91201136|emb|CAJ74195.1| conserved hypothetical protein [Candidatus Kuenenia stuttgartiensis] Length = 331 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 32/220 (14%), Positives = 71/220 (32%), Gaps = 50/220 (22%) Query: 172 LDMMIVLDVSRSM---ESFFDSS-ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +D+++ +D+S SM + D ++ + + + + + P GLV FS Sbjct: 86 IDIVLAVDISGSMLAEDFEMDGKRQNRLYVVKQVVKDFINKRSTDP-------IGLVVFS 138 Query: 228 NKIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 L L + ++ + T + + +++ + + Sbjct: 139 ANAYTQCPLTLDYGILLQFLEKTEIGLLEDGTAIGSAIASSVDRLRNTKAQS-------- 190 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR----------SHEF 335 K+IV +TDG N S + D + A+ +Y IG + Sbjct: 191 ---KVIVLLTDGRNNSGQIDPLTA--AELAQAFNIKIYTIGAGSKGLVPYPARDLFGNRV 245 Query: 336 LRACA--------------SPNSFYLVENPHSMYDAFSHI 361 +R + +Y + S+ + + I Sbjct: 246 MRQVKIDIDDESLAEIANITGGRYYRATDTGSLKEIYQQI 285 >gi|313230659|emb|CBY18875.1| unnamed protein product [Oikopleura dioica] Length = 524 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 85/214 (39%), Gaps = 22/214 (10%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV-KLIPDVNNVVQSGLVTFSNKIE 231 D++I++D S S+ + +++ M++ + I N + ++ +S+ ++ Sbjct: 220 DLLILVDESTSIG------AENFEHVKRTLGLMIDNLCDGISPDTN--RVAMLRYSSDVK 271 Query: 232 EFFLLEWGVSH--LQRKIKYLSKF------GVSTNSTPGLKYAYNQIFDMQ-GMRQHCNT 282 E G + + R I+ L ST + + A IF + G R Sbjct: 272 EDLNFIEGSNEPTVMRNIQRLKYKPITDDRHGSTYTAHAMDKALKTIFTSEAGWRNGTTE 331 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS- 341 + + +V +TDGE+ E + + G VYA+G+ I+ E R S Sbjct: 332 DGIKVRTEVVIITDGESNDPDETFTIQGQKVKYDEYGIKVYALGVGDIKKDEI-RQLTSM 390 Query: 342 -PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 S + + + + AF+ I + ++ D+ Sbjct: 391 DDESIFYLMSWKDL-AAFNRIIETLIETHAEEDR 423 >gi|307941490|ref|ZP_07656845.1| von Willebrand factor, type A [Roseibium sp. TrichSKD4] gi|307775098|gb|EFO34304.1| von Willebrand factor, type A [Roseibium sp. TrichSKD4] Length = 611 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 48/245 (19%), Positives = 85/245 (34%), Gaps = 32/245 (13%) Query: 134 RYKIPLKFCTFIPWYTNSRHIVMPITSSVK----------VNSQTDARLDMMIVLDVSRS 183 +Y + P+ TN + P S K V +++ ++D S S Sbjct: 185 QYDYKVPDSREAPFSTNVSVVETPWNSDTKLLHIGLKGYTVPLDDLPPQNLVFLIDTSGS 244 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 M S K+ + ++ +L ++ + V +G + E L S + Sbjct: 245 M-----SDENKLPLLQQAFRLLLSTLREDDTIAIVTYAGNAG---VLLEPTALSD-KSKI 295 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLST 302 I L+ G ST GLK AY MQ D + K I+ TDG+ N+ Sbjct: 296 AEAIAALTS-GGSTAGHAGLKEAYRLAETMQ---------DDDTKSRIILATDGDFNVGL 345 Query: 303 KEDQQSLYYCNEAKKRGAIVYAIGIRVIR-SHEFLRACASPNSFYLVENPHSMYDAFSHI 361 + E + G + +G + E ++A A + ++ +A + Sbjct: 346 SSADDMKRFVKEKRDSGITLSVLGFGRGNYNDELMQALAQNGNGVAAY-IDTLSEARKVL 404 Query: 362 GKDIV 366 IV Sbjct: 405 VDQIV 409 >gi|168698099|ref|ZP_02730376.1| von Willebrand factor type A domain protein [Gemmata obscuriglobus UQM 2246] Length = 311 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 36/204 (17%), Positives = 71/204 (34%), Gaps = 34/204 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + I +DVS SM + F T+ D A++++ A GL F N++ + Sbjct: 85 IEICVDVSGSMNNPF-GRATRYDGAMEAVTAF-------TSYRQGDAFGLTFFGNEVLHW 136 Query: 234 FLLEWGVSHLQRKIKYLSKFG-----VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L VS + ++ T L+ ++ + Sbjct: 137 CPLTTDVSAINCATPFMRPGQLPPWFGGTLIAKALRACKAEL-----------IKRPEGD 185 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE--FLRA-------C 339 ++IV +TDG++ + E K G V+A+ I R + +R Sbjct: 186 RMIVLITDGDSQDF-ANGADAEVAEELKAEGITVFAVVIGNDRQFQNPIIRNGSVQTVTA 244 Query: 340 ASPNSFYLVENPHSMYDAFSHIGK 363 + + +P+++ F I + Sbjct: 245 RTGGESFEAGDPNALATVFKRIDE 268 >gi|33152377|ref|NP_873730.1| tight adherence protein G [Haemophilus ducreyi 35000HP] gi|21326716|gb|AAL92476.1| TadG [Haemophilus ducreyi] gi|33148600|gb|AAP96119.1| tight adherence protein G [Haemophilus ducreyi 35000HP] Length = 562 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 39/266 (14%), Positives = 91/266 (34%), Gaps = 31/266 (11%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI--- 65 + F N G I A+ I +L + +EV+ I K L+ ++++++ + Sbjct: 12 KRFIQNQSGVYIIFGALLTLPIVALLFVSLEVAGIIQDKARLNDALEQAVLSLTAENNSG 71 Query: 66 ----------MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTS 115 + NG + G +I T+ + ++ +N + ++ + Sbjct: 72 RKSYDYALTNAEKANGKYLADSEAGKRDSQIVKTFVKLYLPQIDENTMKFEPICTTQNNA 131 Query: 116 LDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA----- 170 + Q S + + P I+ S + Sbjct: 132 ITPKNGKQYAYSSSHVTCTVTGSINHRSLFPMTVGKSKIIPEQVSLSSGSMAQKINNVNL 191 Query: 171 RLDMMIVLDVSRSMESFF-------DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 LD+M+V D+S SM+ ++ +K+ + + + + ++ L + N + + Sbjct: 192 PLDLMVVADLSGSMDYNINNHKVYSNTEASKLTLLKQVLEELTDKYLLSEEANPNNRISM 251 Query: 224 VTFSNKIEEFF------LLEWGVSHL 243 + F+ + EW SH+ Sbjct: 252 IPFAMGAQHPIRNSCVLPFEWNQSHI 277 Score = 37.8 bits (86), Expect = 2.6, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 58/173 (33%), Gaps = 18/173 (10%) Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST 257 I ++N ML E + + ++ S+ + + Q++I +++ ++ Sbjct: 378 LIVAVNNMLNEPARSDVLKQQTRRTILILSDGSDSIGDDSGENNWYQKEIPFMNFSRITE 437 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 N G + +N+ + + H NY + + E Q+ C+ + Sbjct: 438 NLILGKQELFNKSPQSKNLENHIYGYRYNYPIYL--------TNNTEKIQTKGLCDVIRD 489 Query: 318 R----------GAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 + I +G +L ++Y + S+ +AF Sbjct: 490 KLNTKNKDNNTKIIFVELGYNSSSKDTWLHCVGGTQNYYSATSKESLLEAFKQ 542 >gi|325297739|ref|YP_004257656.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] gi|324317292|gb|ADY35183.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] Length = 332 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 38/222 (17%), Positives = 75/222 (33%), Gaps = 51/222 (22%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +DVS SM + D +++ A + + P+ N GL F+ + Sbjct: 88 IDIMLAVDVSTSMLAE-DLKPNRLEAAKQVAAQFING---RPNDN----IGLTIFAGEAF 139 Query: 232 EFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 L ++ I T GL + +++ D + Sbjct: 140 TQCPLTIDHGVLLNLFGSIKGDIAQRGLIEDGTAIGMGLANSISRLKDSKAKS------- 192 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV--------------- 329 K+I+ +TDG N + D L AK+ G VY IG+ Sbjct: 193 ----KVIILLTDGSNN--RGDISPLTAAEIAKQFGIRVYTIGVGTNGTAPYPMPTYAGVQ 246 Query: 330 -------IRSHEFLRACASP-NSFYLVENPHSMYDAFSHIGK 363 I ++ ++ +++ + + + + I K Sbjct: 247 YVNVPVEIDEQTLIQIASTTNGNYFRATSNSKLKEVYEEIDK 288 >gi|78189841|ref|YP_380179.1| von Willebrand factor, type A [Chlorobium chlorochromatii CaD3] gi|78172040|gb|ABB29136.1| von Willebrand factor, type A [Chlorobium chlorochromatii CaD3] Length = 329 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 37/220 (16%), Positives = 77/220 (35%), Gaps = 40/220 (18%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 +++ + +DM++ +D+S SM + ++ ++A ++ ++E+ + Sbjct: 86 TLRSTTAAARGIDMVLAIDISESMMQSQTDTQSRFEIARQAARNVVEQRSND-------R 138 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGMR 277 GLV F + L + L + LS T L A N++ + Sbjct: 139 IGLVVFRGEAYTLSPLTRDHTVLSLLLDNLSSRIIQDDGTAIGSALLVALNRLQASESEL 198 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--------- 328 Q +++ +TDGEN + + L A +RG Y + + Sbjct: 199 Q-----------MVILLTDGENNA--GEVSPLTAAALAARRGVRFYVLNVAFESVKDENA 245 Query: 329 ------VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIG 362 E R + S++ V N + + I Sbjct: 246 PRSALYAAELQEVARR--TGGSYFTVNNKTELETTIASIA 283 >gi|332823604|ref|XP_003311225.1| PREDICTED: complement C2 isoform 2 [Pan troglodytes] Length = 620 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 34/238 (14%), Positives = 83/238 (34%), Gaps = 29/238 (12%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + + TN K+ Q L++ ++LD S+S+ + +S Sbjct: 92 FSHMLGATNPTQKTKESLGR-KIQIQRSGHLNLYLLLDCSQSVSEN------DFLIFKES 144 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWGVSHLQRKIKYLS-------KF 253 + M++ + V ++TF+++ L + I L + Sbjct: 145 ASLMVDRIFSFEIN---VSVAIITFASEPRVLMSVLNDNSRDMTEVISSLENANYKDHEN 201 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL------STKEDQQ 307 G TN+ L Y + + + + I+ +TDG++ + + + Sbjct: 202 GTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIR 261 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 + N+ + +YAIG+ + +E +++++ +++ F H Sbjct: 262 EILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQVFEH 319 >gi|332823602|ref|XP_003311224.1| PREDICTED: complement C2 isoform 1 [Pan troglodytes] Length = 752 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 34/238 (14%), Positives = 83/238 (34%), Gaps = 29/238 (12%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + + TN K+ Q L++ ++LD S+S+ + +S Sbjct: 224 FSHMLGATNPTQKTKESLGR-KIQIQRSGHLNLYLLLDCSQSVSEN------DFLIFKES 276 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWGVSHLQRKIKYLS-------KF 253 + M++ + V ++TF+++ L + I L + Sbjct: 277 ASLMVDRIFSFEIN---VSVAIITFASEPRVLMSVLNDNSRDMTEVISSLENANYKDHEN 333 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL------STKEDQQ 307 G TN+ L Y + + + + I+ +TDG++ + + + Sbjct: 334 GTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIR 393 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 + N+ + +YAIG+ + +E +++++ +++ F H Sbjct: 394 EILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQVFEH 451 >gi|62897125|dbj|BAD96503.1| complement component 2 precursor variant [Homo sapiens] Length = 752 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 34/238 (14%), Positives = 83/238 (34%), Gaps = 29/238 (12%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + + TN K+ Q L++ ++LD S+S+ + +S Sbjct: 224 FSHMLGATNPTQKTKESLGR-KIQIQRSGHLNLYLLLDCSQSVSEN------DFLIFKES 276 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWGVSHLQRKIKYLS-------KF 253 + M++ + V ++TF+++ L + I L + Sbjct: 277 ASLMVDRIFSFEIN---VSVAIITFASEPRVLMSVLNDNSRDMTEVISSLENANYKDHEN 333 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL------STKEDQQ 307 G TN+ L Y + + + + I+ +TDG++ + + + Sbjct: 334 GTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIR 393 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 + N+ + +YAIG+ + +E +++++ +++ F H Sbjct: 394 EILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQVFEH 451 >gi|38257345|sp|Q8SQ74|CO2_PANTR RecName: Full=Complement C2; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement C2b fragment; Contains: RecName: Full=Complement C2a fragment; Flags: Precursor gi|19110330|gb|AAL82821.1| complement C2 [Pan troglodytes] Length = 752 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 34/238 (14%), Positives = 83/238 (34%), Gaps = 29/238 (12%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + + TN K+ Q L++ ++LD S+S+ + +S Sbjct: 224 FSHMLGATNPTQKTKESLGR-KIQIQRSGHLNLYLLLDCSQSVSEN------DFLIFKES 276 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWGVSHLQRKIKYLS-------KF 253 + M++ + V ++TF+++ L + I L + Sbjct: 277 ASLMVDRIFSFEIN---VSVAIITFASEPRVLMSVLNDNSRDMTEVISSLENANYKDHEN 333 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL------STKEDQQ 307 G TN+ L Y + + + + I+ +TDG++ + + + Sbjct: 334 GTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIR 393 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 + N+ + +YAIG+ + +E +++++ +++ F H Sbjct: 394 EILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQVFEH 451 >gi|238759128|ref|ZP_04620297.1| tight adherance operon protein [Yersinia aldovae ATCC 35236] gi|238702676|gb|EEP95224.1| tight adherance operon protein [Yersinia aldovae ATCC 35236] Length = 448 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 42/235 (17%), Positives = 72/235 (30%), Gaps = 31/235 (13%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F + G + IFLP+ +L + E+SH L I+++ + + E N Sbjct: 17 FIKDEIGAILWPFIIFLPLFIGLLYLSFEISHYLQKAAKLSDAIEQATLA----LTIENN 72 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 NN + + + + R L F + +DI+ P Y + Sbjct: 73 TNNPDETQTEKNISLVN----AYARAYLPSESFSAPV--------IDIISHPNYIEYRAA 120 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF-- 188 Y I + + + D++ V+D S SM+ F Sbjct: 121 TTLNYTPKFLTKELITNIDRRIIVSDNGVAIKNKFTSPGEITDVVFVVDYSVSMDGNFGD 180 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 + TKI + + I NN G V WG + Sbjct: 181 EKKTTKIQELRR---IFEDLNNTILKNNNTHTIGFV----------PFSWGTKKI 222 Score = 37.4 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 18/102 (17%), Positives = 32/102 (31%), Gaps = 13/102 (12%) Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYY---------CNEAKKRGAIVYAIGIRVIRS 332 + N K+++ ++DG + D C K+ + IGI Sbjct: 344 KDSKNKDKLMIIISDGNDQEISSDLTQEKITKTLIEKGMCERIKENNIRMVFIGIAYTVK 403 Query: 333 HEFLRACASPNSFYLVENPH----SMYDAFSHIGKDIVTKRI 370 C ++Y +N H + A I V + I Sbjct: 404 EIKWEDCVGKRNYYEAQNAHELEADLRQALGTIEASEVGRNI 445 >gi|237836353|ref|XP_002367474.1| microneme protein 2 [Toxoplasma gondii ME49] gi|211965138|gb|EEB00334.1| microneme protein 2 [Toxoplasma gondii ME49] Length = 723 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 33/178 (18%), Positives = 70/178 (39%), Gaps = 22/178 (12%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 V+ S++ +LD+ ++D S S+ I + + ++ L + + P+ Sbjct: 12 VIQSESAIGAAEGCTNQLDICFLIDSSGSIG------IQNFRLVKQFLHTFLMVLPIGPE 65 Query: 215 VNNVVQSGLVTFSNKI----EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 V + +VT+S + + + + STN++ GLK + Sbjct: 66 E---VNNAVVTYSTDVHLQWDLQSPNAVDKQLAAHAVLDMPYKKGSTNTSDGLKACKQIL 122 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 F + K+++ MTDGE+ S D +++ E ++ G IV + + Sbjct: 123 FTGS------RPGREHVPKLVIGMTDGESDS---DFRTVRAAKEIRELGGIVTVLAVG 171 >gi|149546336|ref|XP_001514218.1| PREDICTED: similar to Procollagen, type VI, alpha 2 [Ornithorhynchus anatinus] Length = 1023 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 60/167 (35%), Gaps = 26/167 (15%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI--PDVNNV-VQSGLVTF 226 +D++ V+D S S+ T + + ++ + I + + G+V + Sbjct: 613 GAVDIVFVIDSSESIG------FTNFSLEKNFVINVVSRLGAIAKDPKSETGARVGVVQY 666 Query: 227 SNK-----IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 S++ I+ +S + +K L T + L++AYN++ Sbjct: 667 SHEGTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSALQFAYNKLI--------KE 718 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + K V +TDG + D C+ V AIGI Sbjct: 719 SRRKKTKVFAVVITDGRHDPRDNDANLRALCHV----DVTVNAIGIG 761 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 65/176 (36%), Gaps = 12/176 (6%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 +P + + V T +D++ +LD S + + K ++ + L + D Sbjct: 816 ELPCQTELYVAQCTQRPVDIVFLLDGSERVGEN---NFHKAQSFVEEVTQRLTLARSDDD 872 Query: 215 VNNVVQSGLVTFSNKIEE--FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 N + L+ + + E+ F L ++ + + L S+N G+ +A N I Sbjct: 873 NMN-ARVALLQYGGEAEQEVVFPLTSNLTVISSALANLRYLDSSSNVGNGIVHAINHIVI 931 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 Q +A VF+TDG S + + KK+ + + + Sbjct: 932 PPVDNQKKARPNAELS--FVFLTDGVTESKSLKEAT----EAMKKQDVVPTVVALG 981 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 29/166 (17%), Positives = 58/166 (34%), Gaps = 17/166 (10%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKI-DMAIKSINAMLEEVKLIPDVNNVV---QSG 222 +TD +++ V+D S S+ I + D + +++ N V G Sbjct: 45 KTDCPINVYFVIDTSESV--AMQPPIESLVDHIKDFVIQFTSQLENEFYQNQVAISWHYG 102 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + FS+ +E F L + K+ + FG T + + Q+ Q + Sbjct: 103 GLHFSDLVEIFSPLPSNKEAFRNKLSGVKYFGRGTFTDCAIANMTEQVTQSQVAGVN--- 159 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 V +TDG + L A+ G ++++ + Sbjct: 160 -------FAVVITDGHVTGSPCGGMKLQ-AERARDAGIKLFSVAVN 197 >gi|332267269|ref|XP_003282606.1| PREDICTED: von Willebrand factor A domain-containing protein 1 isoform 1 [Nomascus leucogenys] Length = 444 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 69/192 (35%), Gaps = 26/192 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M +LD S S+ + S + + + + V +P +++ LV ++ Sbjct: 35 DLMFLLDSSASVSHYEFSRV------REFVGQL---VAPLPLGTGALRASLVHVGSRPYT 85 Query: 233 FFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F S Q ++ ++ T++ L YA Q+F + K+ Sbjct: 86 EFPFGQHSSGKAAQDAVRASAQRMGDTHTGLALVYAKEQLF------AEASGARPGVPKV 139 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN----SFY 346 +V++TDG + E K G V+ + E A ++P F Sbjct: 140 LVWVTDGGSSDPVG-----PPMQELKDLGVTVFIVSTGRGNFLELSAAASAPAEKHLHFV 194 Query: 347 LVENPHSMYDAF 358 V++ H + Sbjct: 195 DVDDLHIIVQEL 206 >gi|255011031|ref|ZP_05283157.1| putative outer membrane protein [Bacteroides fragilis 3_1_12] gi|313148836|ref|ZP_07811029.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] gi|313137603|gb|EFR54963.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] Length = 608 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 47/306 (15%), Positives = 109/306 (35%), Gaps = 26/306 (8%) Query: 32 LVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWN 91 +V+G +E + T + ++ SL++ E K + + Sbjct: 109 VVIGYGVETQKMICGATPI--AMNHSLLYNVEMSAEEYKAIQENGFKKVWDNPLSTFSID 166 Query: 92 MSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNS 151 + + F+N ++ + ++ + Y+ + + P++ + I + Sbjct: 167 VDVASYSNMRRFLNKG-ELPPADAIRTEELINYFSYNYAQPTGND-PVRITSEIGTCPWN 224 Query: 152 RH--IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 +V + ++ ++ +++ ++DVS SM ++D+ S+ ++ + Sbjct: 225 EQHRLVRIGLKAKEIPTENLPASNLIFLIDVSGSMY-----GPERLDLVKSSLKLLVNNL 279 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + V VV SG + ++ I L G ST G+K AY + Sbjct: 280 RDKDKVAIVVYSGAAG----EKLASTPGSDKQKIREAIDELEA-GGSTAGGEGIKLAY-K 333 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 I + N I+ TDG+ N+ DQ+ + +K G + +G Sbjct: 334 IARKNFITGGNNR--------IILCTDGDFNMGVSSDQELKKLIEQKRKSGVFLTVLGYG 385 Query: 329 VIRSHE 334 + + Sbjct: 386 MGNYKD 391 >gi|156350127|ref|XP_001622153.1| hypothetical protein NEMVEDRAFT_v1g828 [Nematostella vectensis] gi|156208600|gb|EDO30053.1| predicted protein [Nematostella vectensis] Length = 339 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 32/189 (16%), Positives = 65/189 (34%), Gaps = 20/189 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ +LD S S+ + K N + + V + + G+V +S++ Sbjct: 1 LIFLLDSSGSVG---------LQNYQKEKNFIKDIVSKQNIGRELTRVGVVVYSHEAIVA 51 Query: 234 FLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYA--YNQIFDMQGMRQHCNTEDANYKK 289 L L + I + +G +T GL A +Q+F + + Sbjct: 52 IKLTDYSSTEDLNKAIDGIDYYGKTTRIDKGLMTADRIDQVFTV------AAGSRPYLPR 105 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACASPNSFYLV 348 ++V +TDG + KK+ A+G+ I + E + +V Sbjct: 106 VLVLLTDGRQTRAPGYKPLKTAVCPLKKKQVKTLAVGVGRGIDARELRELVDHDDDMVMV 165 Query: 349 ENPHSMYDA 357 + + A Sbjct: 166 SDFDDLRTA 174 >gi|332307030|ref|YP_004434881.1| von Willebrand factor type A [Glaciecola agarilytica 4H-3-7+YE-5] gi|332174359|gb|AEE23613.1| von Willebrand factor type A [Glaciecola agarilytica 4H-3-7+YE-5] Length = 338 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/217 (12%), Positives = 73/217 (33%), Gaps = 46/217 (21%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+MI +D+S SM+ + ++ M ++ ++ ++ + GL+ F++ Sbjct: 88 DLMIAVDLSGSMKIDDMQVNGRQVDRLQMIKSVLHDFIQR-RVGD------RLGLIFFAD 140 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + + + + + G T + A + + Sbjct: 141 TAYLQAPLTYDRETVSQLLGESLIGLVGEQTAIGDAIGLAIKRFQSKKESN--------- 191 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------S 332 K+++ +TDG+N + + A G +Y IG+ + S Sbjct: 192 --KVLILLTDGQNTA--GNISPQQANELAINNGVTLYTIGVGADQMMVQSIFGSRQVNPS 247 Query: 333 HEFLRAC------ASPNSFYLVENPHSMYDAFSHIGK 363 E + ++ ++ + S+ + + + Sbjct: 248 QELDESMLTQLAESTGGRYFRARDAESLKAIYDKLDE 284 >gi|167752251|ref|ZP_02424378.1| hypothetical protein ALIPUT_00494 [Alistipes putredinis DSM 17216] gi|167660492|gb|EDS04622.1| hypothetical protein ALIPUT_00494 [Alistipes putredinis DSM 17216] Length = 344 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 59/164 (35%), Gaps = 24/164 (14%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 ++MM+V+DVS SM + D +++ +I+ + + +K + GLV F+ Sbjct: 90 IEMMLVVDVSNSMLAE-DFQPNRLERTKYAIDKLFDGLKQD-------RVGLVVFAGDAV 141 Query: 232 EFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + + +S T+ L A D Sbjct: 142 VQLPITSDYRMAKAFARRISPSMVSVQGTDIGQALSLA---------TMSFSEKGDNPAG 192 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 ++IV +TDGE + + + + G ++ IGI Sbjct: 193 RVIVLITDGEGHDSGAIEAAERAAEQ----GIRIFTIGIGTPEG 232 >gi|255598079|ref|XP_002536925.1| conserved hypothetical protein [Ricinus communis] gi|223518102|gb|EEF25458.1| conserved hypothetical protein [Ricinus communis] Length = 451 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 53/298 (17%), Positives = 103/298 (34%), Gaps = 46/298 (15%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R+F + +GG++++ + L + V+G+ + +++ L++ D + + AA I N Sbjct: 3 RSFPHAQRGGISLMVIVSLTTLLAVVGLAFSAGLSYLVRSKLNAATDAAGLAAARAISNG 62 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 ++ + ++ L N +NDI + + I Sbjct: 63 TTQADQIANAKAAGQRFFHANFPSNY---LMSNATLNDISVTFSGSEVTI---------- 109 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 + +P + + + + K LDM++V+D S S+ Sbjct: 110 -GVSASASLPAALFGGF----GTSALAPAVVTETKRK-----DLDMIVVMDTSGSLSP-- 157 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS-----NKIEEFFLLEWGVSHL 243 S L + D + GLV F+ + + + + Sbjct: 158 -----SAANVRSSAITFLNQFNATRD-----RVGLVHFAFGAIVDDAIRQTARGFDRASM 207 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 IK F ST S G+ A QI + N +N ++IVF +DG S Sbjct: 208 TNHIKAY-AFSGSTASAEGMYTARQQINSVPTA----NLNRSNM-RVIVFFSDGAPNS 259 >gi|89094518|ref|ZP_01167457.1| hypothetical protein MED92_09161 [Oceanospirillum sp. MED92] gi|89081254|gb|EAR60487.1| hypothetical protein MED92_09161 [Oceanospirillum sp. MED92] Length = 445 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 38/211 (18%), Positives = 75/211 (35%), Gaps = 27/211 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 I + + +Q ++ IVLD S SM+ D + AI +IN + Sbjct: 52 IKISLEGHKLEQTQARIPANIAIVLDKSGSMQ--GDKLFRAKEAAIMAINRL-------- 101 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N++V +V++ +++ + + R I + G + K A Sbjct: 102 SQNDIV--SVVSYDSRVNVVVPATKVSDTNTIARAINRIQANGNTALFAGVSKGA----- 154 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 +R+ + N ++ ++DG N+ + K G V IG+ + Sbjct: 155 --NELRKFLDLNKVNR---VILLSDGLANIGPSTPNELGKLGLSLAKEGMSVTTIGLGLG 209 Query: 331 RSHEFLRACA--SPNSFYLVENPHSMYDAFS 359 + + + A S + VEN + F Sbjct: 210 YNEDLMTQLAGFSDGNHAFVENADDLARVFQ 240 >gi|127513358|ref|YP_001094555.1| von Willebrand factor, type A [Shewanella loihica PV-4] gi|126638653|gb|ABO24296.1| von Willebrand factor, type A [Shewanella loihica PV-4] Length = 339 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 33/217 (15%), Positives = 70/217 (32%), Gaps = 46/217 (21%) Query: 173 DMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+ +D+S SM+ ++ + M + ++ +E K + GL+ F++ Sbjct: 91 DLMLAVDLSGSMQIEDMVLNGKAVDRFAMVQQVMSEFIERRKGD-------KLGLILFAD 143 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + + + G T + A + + Sbjct: 144 HAYLQAPLTQDRRSVAQFLTEAQIGLVGKQTAIGEAIALAVKRFDKAKQSN--------- 194 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI---------------- 330 ++++ +TDG N S + + A KRG +Y IG+ Sbjct: 195 --RVLILLTDGSNNSGSITPE--QAADIAAKRGVTIYTIGVGAEVMERRTLFGKERVNPS 250 Query: 331 ----RSHEFLRACASPNSFYLVENPHSMYDAFSHIGK 363 + L A + ++ N + + I K Sbjct: 251 MDLDEAQLTLLAQKTKGRYFRARNSDELEQIYQEIDK 287 >gi|296272313|ref|YP_003654944.1| von Willebrand factor type A [Arcobacter nitrofigilis DSM 7299] gi|296096487|gb|ADG92437.1| von Willebrand factor type A [Arcobacter nitrofigilis DSM 7299] Length = 301 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 37/194 (19%), Positives = 73/194 (37%), Gaps = 17/194 (8%) Query: 172 LDMMIVLDVSRSMESF-FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +++++ LD S SM F+ + + + N + + ++ + N GLV F + Sbjct: 84 INIILDLDTSGSMRERGFNPNDLQQNRWNVVNNVVQDFIEKRVNDN----IGLVVFGTSV 139 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + + + IKY+ V + +FD + I Sbjct: 140 LTASPLSFDKNSQKEIIKYIDIGIVGEQTA---------MFDSLATSINILKNSKAKSNI 190 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC-ASPNSFYLVE 349 I+ +TDGE+ ++K Q + AKK +Y IGI + +L Sbjct: 191 IILLTDGEDNASKIPPQIIL--KLAKKYKIKIYTIGIGESNRQMLSTISQETGAKSFLAN 248 Query: 350 NPHSMYDAFSHIGK 363 + + + ++ I K Sbjct: 249 SKDDLVEVYNTINK 262 >gi|226943994|ref|YP_002799067.1| von Willebrand factor, type A (VWA) domain-containing protein [Azotobacter vinelandii DJ] gi|226718921|gb|ACO78092.1| von Willebrand factor, type A (VWA) domain protein [Azotobacter vinelandii DJ] Length = 335 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 74/216 (34%), Gaps = 45/216 (20%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SME + SI ++++ + +E+ + + GL+ F + Sbjct: 91 DLLLAVDVSGSMEYADMHWQGESIGRLELVKHLLGQFIEDRRGD-------RVGLILFGS 143 Query: 229 KIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ ++ + G T + ++ + Sbjct: 144 QAYLQAPLTFDRRTVRTWLEEAAIGIAGKDTAIGDAIGLGLKRL-----------RQRPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE------------ 334 ++++ +TDG N + + A G ++ IGI + Sbjct: 193 QSRVLILVTDGANTAGEIAPSVA--ARLAAAEGVRIHTIGIGADPRQDGPPGLLGLTPGL 250 Query: 335 -----FLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 LRA A + S++ + + + + Sbjct: 251 DLDEPTLRAIAEETGGSYFRARSSEELRAIEETLAR 286 >gi|195941051|ref|ZP_03086433.1| von Willebrand factor, type A [Escherichia coli O157:H7 str. EC4024] Length = 325 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 39/201 (19%), Positives = 68/201 (33%), Gaps = 31/201 (15%) Query: 174 MMIVLDVSRSMESF-FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 +M++LDVS SME +T++ +S+ + K + GLV F+N Sbjct: 98 IMLILDVSGSMEKNDVAGGLTRLQAVQQSVKKFVAARKSD-------RIGLVIFANSAWP 150 Query: 233 FFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + L+ +I L+ G T L ++ D G + K+ Sbjct: 151 FAPVSEDKQALETRISQLTPGMAGQQTAIGDALGVTV-KLLDSTGDK--------EASKL 201 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-------EFLRACA--S 341 + +TDG T A ++ I + S L+ A + Sbjct: 202 AILLTDG--NDTASQLTPRLAAQLAVSHHVQLHTIAFGDVNSSGDDKVDLNLLQDLARMT 259 Query: 342 PNSFYLVENP-HSMYDAFSHI 361 + EN S+ + I Sbjct: 260 GGRSWTAENSGASLDAVWKEI 280 >gi|170574976|ref|XP_001893043.1| Zona pellucida-like domain containing protein [Brugia malayi] gi|158601129|gb|EDP38122.1| Zona pellucida-like domain containing protein [Brugia malayi] Length = 664 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 27/145 (18%), Positives = 49/145 (33%), Gaps = 16/145 (11%) Query: 221 SGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 L+T+S + F + + R + L +T++ L AY + D Sbjct: 1 LALITYSGQAYIHFKFNDPQIGNNTSVIRHLNGLKSIKGTTSTHIALHQAYKLLTDTDNE 60 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV---IRSH 333 N KK+I+ TDG + + +D K +G ++AI + Sbjct: 61 ----NGVREGVKKMIIIFTDGHSQRSPQDMALR-----LKDKGVEIFAITLTPAPYADEG 111 Query: 334 EFLRACASPNSFYLVENPHSMYDAF 358 E L + + + N F Sbjct: 112 ELLSITQNTDHIFTPVNLKDFEIKF 136 >gi|296474257|gb|DAA16372.1| complement component 2 precursor [Bos taurus] Length = 750 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 39/218 (17%), Positives = 83/218 (38%), Gaps = 30/218 (13%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K+ Q L++ ++LD S+S+ ++ S + M++ + V Sbjct: 245 KIQIQRSGHLNLYLLLDASQSVSK------DDFEIFKDSASRMVDRIFSFEIK---VSVA 295 Query: 223 LVTFSNKIEEFFL-LEWGVSHLQRKIKYLS-------KFGVSTNSTPGLKYAYNQIFDMQ 274 ++TF++K + LE + L + G TN L Y + + Q Sbjct: 296 IITFASKPKIIMSVLEDRSRDVTEVENSLRNINYKDHENGTGTNIYEALHAVY-IMMNNQ 354 Query: 275 GMRQHCNTED-ANYKKIIVFMTDGENL------STKEDQQSLYYCNEAKKRGAIVYAIGI 327 R H N + I+ +TDG++ ++ + + N+ +K +YAIG+ Sbjct: 355 MNRPHMNPGAWQEIRHAIILLTDGKSNMGGSPKVAVDNIKEVLNINQKRKDYLDIYAIGV 414 Query: 328 RVIRSH-EFLRACASPN----SFYLVENPHSMYDAFSH 360 + + L S +++++ ++ F H Sbjct: 415 GSLHVDWKELNNLGSKKDGERHAFILKDVQALSQVFEH 452 >gi|255570578|ref|XP_002526246.1| protein binding protein, putative [Ricinus communis] gi|223534440|gb|EEF36143.1| protein binding protein, putative [Ricinus communis] Length = 513 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 47/225 (20%), Positives = 77/225 (34%), Gaps = 37/225 (16%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 P +++ IT +S LD++ VLDVS SM KI ++ Sbjct: 35 DMAPLQETKLKVMLEITG--GDSSNDRPGLDLVAVLDVSGSMAG------EKIAKLKMAM 86 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTN 258 M++++ I + +VTFS L + I L G TN Sbjct: 87 LFMIKKLSPID------RLSIVTFSTDSTRLCPLRQITENSQKEFENLINRLKADGW-TN 139 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 T GL+ A + D I+ M+DGE+ + + + Sbjct: 140 ITAGLETALKVLNDRSFNGGRVVG--------IMLMSDGEHNTDGDPAEVPL-------G 184 Query: 319 GAIVYAIGIRVIRSHEFLRACAS---PNSFYLVENPHSMYDAFSH 360 V+ G L+A A + V+N +++ AFS Sbjct: 185 NVPVHTFGFGRNYEPRVLKAVAHKSIGGTLSDVQNTNNLGKAFSQ 229 >gi|150007595|ref|YP_001302338.1| hypothetical protein BDI_0948 [Parabacteroides distasonis ATCC 8503] gi|255013876|ref|ZP_05286002.1| hypothetical protein B2_08207 [Bacteroides sp. 2_1_7] gi|256839782|ref|ZP_05545291.1| conserved hypothetical protein [Parabacteroides sp. D13] gi|149936019|gb|ABR42716.1| conserved hypothetical protein BatA [Parabacteroides distasonis ATCC 8503] gi|256738712|gb|EEU52037.1| conserved hypothetical protein [Parabacteroides sp. D13] Length = 328 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 43/217 (19%), Positives = 75/217 (34%), Gaps = 46/217 (21%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +D+S SM + D +++ A + + P+ N GLV FS + Sbjct: 89 IDIMLAMDISGSMLAQ-DLKPNRLEAAKDVAASFING---RPNDN----IGLVVFSAESF 140 Query: 232 EFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + L K + T GL A ++I D K Sbjct: 141 TQCPLTTDHTVLLNLFKDIQSGMIQDGTAIGLGLANAVSRIKDSHAKS-----------K 189 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR------------------ 331 +I+ +TDG N + + + AK G VY IG+ Sbjct: 190 VIILLTDGSNNA--GEIAPVTAAEIAKTFGVRVYTIGVGTKGMAPYPFQTAFGVQYQNIP 247 Query: 332 ---SHEFLRACAS--PNSFYLVENPHSMYDAFSHIGK 363 L+ AS ++ + S+ + +S I + Sbjct: 248 VEIDEATLKQIASTTGGQYFRATDNASLKEIYSEIDQ 284 >gi|111120280|gb|ABH06325.1| complement component 2 precursor [Bos taurus] Length = 787 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 39/218 (17%), Positives = 83/218 (38%), Gaps = 30/218 (13%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K+ Q L++ ++LD S+S+ ++ S + M++ + V Sbjct: 245 KIQIQRSGHLNLYLLLDASQSVSK------DDFEIFKDSASRMVDRIFSFEIK---VSVA 295 Query: 223 LVTFSNKIEEFFL-LEWGVSHLQRKIKYLS-------KFGVSTNSTPGLKYAYNQIFDMQ 274 ++TF++K + LE + L + G TN L Y + + Q Sbjct: 296 IITFASKPKIIMSVLEDRSRDVTEVENSLRNINYKDHENGTGTNIYEALHAVY-IMMNNQ 354 Query: 275 GMRQHCNTED-ANYKKIIVFMTDGENL------STKEDQQSLYYCNEAKKRGAIVYAIGI 327 R H N + I+ +TDG++ ++ + + N+ +K +YAIG+ Sbjct: 355 MNRPHMNPGAWQEIRHAIILLTDGKSNMGGSPKVAVDNIKEVLNINQKRKDYLDIYAIGV 414 Query: 328 RVIRSH-EFLRACASPN----SFYLVENPHSMYDAFSH 360 + + L S +++++ ++ F H Sbjct: 415 GSLHVDWKELNNLGSKKDGERHAFILKDVQALSQVFEH 452 >gi|89055932|ref|YP_511383.1| hypothetical protein Jann_3441 [Jannaschia sp. CCS1] gi|88865481|gb|ABD56358.1| hypothetical protein Jann_3441 [Jannaschia sp. CCS1] Length = 612 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 22/76 (28%), Positives = 34/76 (44%), Gaps = 1/76 (1%) Query: 296 DGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACASPNSFYLVENPHSM 354 DG ++ D + C+ A G IVYAIG R + CAS ++ Y + Sbjct: 535 DGVVGQSQADTNLMAICDVANAAGIIVYAIGFEAPDRGQRVMEHCASVDANYFDVEGREI 594 Query: 355 YDAFSHIGKDIVTKRI 370 +AF+ I + I R+ Sbjct: 595 SEAFASIARSINQLRL 610 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/223 (13%), Positives = 80/223 (35%), Gaps = 37/223 (16%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R F +G +T + ++ G+ I+V ++ L +DR+++ AA+ Sbjct: 42 RQFMRREEGTITAFATMLFILMVGASGIAIDVMRYETQRSQLQYTLDRAVLAAASL---- 97 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 + D +++ + ++ + R + V + + R + Sbjct: 98 --------TQPYDPEGVVRDYFAIAGIDGYRLDVRVEEGLNFRRVHAY------------ 137 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 L+ + R + P + + + R+++ +VLD+S SM Sbjct: 138 --------AELEVRSIFMQMFGVRAMTSPAIGAAE---ERVRRIEVSMVLDISGSMGE-- 184 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 ++ +T + A + + + +V +V ++ ++ Sbjct: 185 NNRMTNMRPAAREFVTEVLSANENVNNELLVSVSIVPYNGRVN 227 >gi|77735935|ref|NP_001029664.1| complement C2 precursor [Bos taurus] gi|115311857|sp|Q3SYW2|CO2_BOVIN RecName: Full=Complement C2; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement C2b fragment; Contains: RecName: Full=Complement C2a fragment; Flags: Precursor gi|74267667|gb|AAI03358.1| Complement component 2 [Bos taurus] Length = 750 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 39/218 (17%), Positives = 83/218 (38%), Gaps = 30/218 (13%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K+ Q L++ ++LD S+S+ ++ S + M++ + V Sbjct: 245 KIQIQRSGHLNLYLLLDASQSVSK------DDFEIFKDSASRMVDRIFSFEIK---VSVA 295 Query: 223 LVTFSNKIEEFFL-LEWGVSHLQRKIKYLS-------KFGVSTNSTPGLKYAYNQIFDMQ 274 ++TF++K + LE + L + G TN L Y + + Q Sbjct: 296 IITFASKPKIIMSVLEDRSRDVTEVENSLRNINYKDHENGTGTNIYEALHAVY-IMMNNQ 354 Query: 275 GMRQHCNTED-ANYKKIIVFMTDGENL------STKEDQQSLYYCNEAKKRGAIVYAIGI 327 R H N + I+ +TDG++ ++ + + N+ +K +YAIG+ Sbjct: 355 MNRPHMNPGAWQEIRHAIILLTDGKSNMGGSPKVAVDNIKEVLNINQKRKDYLDIYAIGV 414 Query: 328 RVIRSH-EFLRACASPN----SFYLVENPHSMYDAFSH 360 + + L S +++++ ++ F H Sbjct: 415 GSLHVDWKELNNLGSKKDGERHAFILKDVQALSQVFEH 452 >gi|327543524|gb|EGF29943.1| von Willebrand factor type A domain-containing protein [Rhodopirellula baltica WH47] Length = 274 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 73/205 (35%), Gaps = 36/205 (17%) Query: 166 SQTDARLDMMIVLDVSRSM--ESFFD---SSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 ++ D+++++D+S SM E F + ++++D + ++ L + K + Sbjct: 86 TKEIPTRDLLLLVDLSGSMAQEDFQNDAGKKVSRLDAVKEVLDGFLAKRKGD-------R 138 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 GLV F + + Q + + G T + N +FD R Sbjct: 139 VGLVVFGDAAYLQAPFTTDLQLSQELLGECEVGMAGPRTAFGDAIGLGVN-LFDEDTERA 197 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS------ 332 K I+ +TDG TK + A +R +Y + I + Sbjct: 198 ----------KTIIALTDG--NDTKSKVPPVEAARVATQRDIKIYTVAIGDPTTVGEDKL 245 Query: 333 -HEFLRACASP--NSFYLVENPHSM 354 + L+ AS ++ P ++ Sbjct: 246 DEQSLKDVASETGGKYFFRRGPRAL 270 >gi|226326038|ref|ZP_03801556.1| hypothetical protein COPCOM_03856 [Coprococcus comes ATCC 27758] gi|225205580|gb|EEG87934.1| hypothetical protein COPCOM_03856 [Coprococcus comes ATCC 27758] Length = 275 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 31/174 (17%), Positives = 62/174 (35%), Gaps = 23/174 (13%) Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW---GVSHLQRKIKYLSKFG 254 ++ ++ + LVTF+ E F + +++ I + G Sbjct: 80 LKQAATNFTTQLAQSSPNSE---IALVTFNKTATEQFDFKNVGKDSAYITETINAMETSG 136 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 T+ GL AY + + + +N K+ +V +TDG DQ + N+ Sbjct: 137 -GTHQNEGLDRAYKILNN--------DQNTSNLKRYVVLLTDGCPNGVTYDQITTSI-NK 186 Query: 315 AKKRGAIVYAIGIRVIRSHEFLRAC-------ASPNSFYLVENPHSMYDAFSHI 361 K + +G+ + ++ L+A A N Y + + F+ I Sbjct: 187 IKSTNTKLITVGVGLDETNTGLKAAKDYLQANADDNMAYNANDASHLNTIFTQI 240 >gi|153010351|ref|YP_001371565.1| hypothetical protein Oant_3028 [Ochrobactrum anthropi ATCC 49188] gi|151562239|gb|ABS15736.1| conserved hypothetical protein [Ochrobactrum anthropi ATCC 49188] Length = 605 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 81/227 (35%), Gaps = 26/227 (11%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 F + +G ++ A+ L + L + ++ +++ ++ + + +D + + + Sbjct: 4 RFAKDERGNFAMIMALVLVPLLLAGMVAVDSANLMRVRNNVQASLDAAALAVGRRFSTGE 63 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 + + + + L + D+ T+ + Y Sbjct: 64 SQT--------VVQVYGAQVFTANLTA-LSADAVNFDVAFPKDKTTDQQIQATAGFTY-- 112 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 S + + T W N + + +++ +VLD SRSM+ Sbjct: 113 --KSLFGVIASRLTGDDWDQNQYTLSSSVR--------LKNTIEVALVLDNSRSMDETRS 162 Query: 190 SSITK-IDMAIKSINAMLEEVK----LIPDVNNVVQSGLVTFSNKIE 231 S K ID+ ++ + ++E + LI V N VQ LV F+ + Sbjct: 163 GSTKKRIDLLKEAASQLVETMAAQSTLITHVENPVQFSLVPFAGSVN 209 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 60/203 (29%), Gaps = 72/203 (35%) Query: 239 GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG- 297 G+ +Q IK + G TN + + + I + + + K+++ +TDG Sbjct: 402 GMKTIQTAIKAMVPSG-GTNVPEAMAWGWRTIVRGAPFTEARPSTERGNDKVVIVLTDGA 460 Query: 298 --------------------------------------------------ENLSTKE--- 304 +N ST Sbjct: 461 NTYYKYDGLAGSGPDRAANFSYYSAHGYTARITKHYSQARLFQESGVSVSQNNSTYTKAM 520 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIRSH-------EFLRACAS----------PNSFYL 347 + + C+ AK IV + + + ++ + LR+C+S P + Sbjct: 521 NARFAKLCDNAKSANIIVMTVALDLSETNSTEKAQIDLLRSCSSNSRVRTESGRPAKLFW 580 Query: 348 VENPHSMYDAFSHIGKDIVTKRI 370 + + F IG ++ RI Sbjct: 581 NSTGGELSETFRQIGDELSNLRI 603 >gi|293396639|ref|ZP_06640915.1| aerotolerance protein BatA [Serratia odorifera DSM 4582] gi|291420903|gb|EFE94156.1| aerotolerance protein BatA [Serratia odorifera DSM 4582] Length = 325 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 33/159 (20%), Positives = 55/159 (34%), Gaps = 21/159 (13%) Query: 173 DMMIVLDVSRSMESF-FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D++++LDVS SM IT++ S++ + + + GLV F+N+ Sbjct: 97 DLVLILDVSGSMAKNDVPGGITRLQAVKNSVSKFVAARQSD-------RIGLVIFANQAW 149 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVS--TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F + LQ +I LS V T L A + + + K Sbjct: 150 PFAPVSEDKQALQTRITQLSPGMVGEQTAIGDALGVAVKLLDSSA---------NQDASK 200 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + + +TDG T A V+ I Sbjct: 201 LAILLTDG--NDTASQLAPPLAAQLAAAHHVQVHTIAFG 237 >gi|293391324|ref|ZP_06635658.1| Flp pilus assembly protein TadG [Aggregatibacter actinomycetemcomitans D7S-1] gi|290951858|gb|EFE01977.1| Flp pilus assembly protein TadG [Aggregatibacter actinomycetemcomitans D7S-1] Length = 525 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 73/499 (14%), Positives = 147/499 (29%), Gaps = 150/499 (30%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM 66 +I+ FF N G I+TA+ + L++ ++ + I K L D++ + + Sbjct: 14 SIKQFFQNEHGVYAIITALLAFQLLLLVAFTVDGTGILLDKARLAQATDQAALLLIAEDN 73 Query: 67 NEGNGNNRKKLK---------GGDILCRIKNTWNM---SFRNELRDNGFVNDIDDIVRST 114 + + + +++ W L +D + +++ Sbjct: 74 KYRKNKDHSDVSRQHVSQQDINREGNSKVQAQWKKRNQELVQGLVKLYLRSDDKNGQKNS 133 Query: 115 SLDIVVVP---------------QNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 S I+ P SI+ + + + KF W + +V Sbjct: 134 SPAIIKDPFLAECLEEKTQPKNKNGTAKSIACVVQGSVQRKF-----WLPWGQTLVSSSR 188 Query: 160 ---SSVKVNSQT---------DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 V +NS +D+M+V D+S SM S D I + I ++ +++ Sbjct: 189 LHDGRVGINSGKTYAVKDKQITIPIDLMMVTDLSGSMVSPIDKRIPSSSIRIDALRDVVK 248 Query: 208 EV-------KLIPDVNNVVQSGLVTF--------------------------------SN 228 ++ D + + G V F +N Sbjct: 249 DIEGILLPKDSRDDTSPYNRMGFVAFAGGARQKTEKNDCVLPYYAQQSKKEEISNLYRNN 308 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVS-------------------------------- 256 K+++ L ++R I + +F S Sbjct: 309 KLDQASKLLDQYMDIERTINQIDQFNGSNISYDFINTTKKCLGKSEGKETTRAWFDKKNL 368 Query: 257 --------------TNSTPGLKYAYNQIFDMQGM-RQHCNTEDANYKKIIVFMTDGENLS 301 T T G+ N + D + + N ++I++ ++DGE+ Sbjct: 369 GVSNALNEIDPDGGTAVTSGMFIGTNLMTDTNKDPEAAPSKLNTNTRRILLVLSDGEDNR 428 Query: 302 TKEDQQSLY----YCNEAK--------------KRGAIVYAIGIRVIRSH-EFLRACASP 342 E CN+ K + A+G + + C Sbjct: 429 PTEGTLVKLMSAGLCNKIKRKIDSLQDTKYPKVEARVAFVALGYNPPQDQVNVWKQCV-G 487 Query: 343 NSFYLVENPHSMYDAFSHI 361 +Y V + + DAF I Sbjct: 488 KQYYTVFSKQGLLDAFRQI 506 >gi|148976671|ref|ZP_01813358.1| von Willebrand factor type A domain protein [Vibrionales bacterium SWAT-3] gi|145964022|gb|EDK29280.1| von Willebrand factor type A domain protein [Vibrionales bacterium SWAT-3] Length = 303 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 54/306 (17%), Positives = 110/306 (35%), Gaps = 52/306 (16%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM 66 R + +G + +++ I LP I LV+G+ I+ + +K+ L + +D + + AA + Sbjct: 4 GTRTKYRYSRGLVVLMSVIALPFILLVVGLSIDAGRAYIVKSKLFAAVDAASIAAARAVA 63 Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 N + + + + + + +I Sbjct: 64 NGE----------DAGRAAAQKYFAANIPADFYSATPNLGDVNFAYDSFGNI-------- 105 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 SI + ++P F I T + P S+ + +D+++V+D + S+ Sbjct: 106 -SIDISATAQVPTVFLPLIGLDTFN-----PGVSAQSIRR----PVDLVLVIDNTTSLRL 155 Query: 187 FFDSSITK--IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--VSH 242 +T+ ID + I E I V F +++ F G S Sbjct: 156 GSIGDVTQDVIDRSKSFIENFHEGFDRISLVK-------FAFGSEVPVGFNATRGHSRST 208 Query: 243 LQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE 298 ++ +I + TN++ G+ A+N++ + D K+IVF TDG Sbjct: 209 IKSEIDSFNFGSTSNAQYTNASEGMYRAFNELRTVT---------DPANLKVIVFFTDGA 259 Query: 299 NLSTKE 304 + Sbjct: 260 PNTFAS 265 >gi|32452632|gb|AAP43994.1| TadG [Aggregatibacter actinomycetemcomitans] Length = 525 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 73/499 (14%), Positives = 147/499 (29%), Gaps = 150/499 (30%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM 66 +I+ FF N G I+TA+ + L++ ++ + I K L D++ + + Sbjct: 14 SIKQFFQNEHGVYAIITALLAFPLLLLVAFTVDGTGILLDKARLAQATDQAALLLIAEDN 73 Query: 67 NEGNGNNRKKLK---------GGDILCRIKNTWNM---SFRNELRDNGFVNDIDDIVRST 114 + + + +++ W L +D + +++ Sbjct: 74 KYRKNKDHSDVSRQHVSQQDINREGNSKVQAQWKKRNQELVQGLVKLYLRSDDKNGQKNS 133 Query: 115 SLDIVVVP---------------QNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 S I+ P SI+ + + + KF W + +V Sbjct: 134 SPAIIKDPFLAECLEEKTQPKNKNGTAKSIACVVQGSVQRKF-----WLPWGQTLVSSSR 188 Query: 160 ---SSVKVNSQT---------DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 V +NS +D+M+V D+S SM S D I + I ++ +++ Sbjct: 189 LHDGRVGINSGKTYAVKDKQITIPIDLMMVTDLSGSMVSPIDKRIPSSSIRIDALRDVVK 248 Query: 208 EV-------KLIPDVNNVVQSGLVTF--------------------------------SN 228 ++ D + + G V F +N Sbjct: 249 DIEGILLPKDSRDDTSPYNRMGFVAFAGGARQKTEKNDCVLPYYAQQSKKEEISNLYRNN 308 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVS-------------------------------- 256 K+++ L ++R I + +F S Sbjct: 309 KLDQASKLLDQYMDIERTINQIDQFNGSNISYDFINTTKKCLGKSEGKETTRAWFDKKNL 368 Query: 257 --------------TNSTPGLKYAYNQIFDMQGM-RQHCNTEDANYKKIIVFMTDGENLS 301 T T G+ N + D + + N ++I++ ++DGE+ Sbjct: 369 GVSNALNEIDPDGGTAVTSGMFIGTNLMTDTNKDPEAAPSKLNTNTRRILLVLSDGEDNR 428 Query: 302 TKEDQQSLY----YCNEAK--------------KRGAIVYAIGIRVIRSH-EFLRACASP 342 E CN+ K + A+G + + C Sbjct: 429 PTEGTLVKLMSAGLCNKIKRKIDSLQDTKYPKVEARVAFVALGYNPPQDQVNVWKQCV-G 487 Query: 343 NSFYLVENPHSMYDAFSHI 361 +Y V + + DAF I Sbjct: 488 KQYYTVFSKQGLLDAFRQI 506 >gi|260818212|ref|XP_002604277.1| hypothetical protein BRAFLDRAFT_88566 [Branchiostoma floridae] gi|229289603|gb|EEN60288.1| hypothetical protein BRAFLDRAFT_88566 [Branchiostoma floridae] Length = 1119 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 66/195 (33%), Gaps = 27/195 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ VLD S S+ + + + + ++ + + N + G++ +S+ Sbjct: 941 DLFFVLDGSGSVGLY------NFNTVKQFVVTLVSAFTIGLNDVNDTRVGVLQYSSSNTL 994 Query: 233 FFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L L + ++ G ST + L+ A Q A Sbjct: 995 GCNLGDHP-DLSSFVNAMNAMRYHYGPSTQTGAALQAA----------GQIAAWRPAPVP 1043 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN-SFYL 347 +I+V +TDG + V+AIG+ E L+ + + Sbjct: 1044 RIMVVVTDG-----MAHDSVVAPSQGLAADQVNVFAIGVGNYVRSELLQIANNNQARVFE 1098 Query: 348 VENPHSMYDAFSHIG 362 + + +++ D + I Sbjct: 1099 LADFNAIRDNINDIA 1113 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 31/203 (15%), Positives = 78/203 (38%), Gaps = 34/203 (16%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE--EVKLIPDVNNVVQSGLVTF 226 D +D+ VLD S S+ S+ D+ + + A++ + L + G++ + Sbjct: 748 DESVDLFFVLDGSDSV------SLADFDIVKEFVVAVVSGFTISLTD-----TRVGVLQY 796 Query: 227 SNKIEEFFLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 S+ L +W S + +++ G T++ L++A ++ Sbjct: 797 SDGSTLECNLGDHPDW--SSFVNSMNTMARQGGGTSTGAALEFA--RLIA-------AWR 845 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 +I++ +TDG+ + + V+AIG+ E L+ + Sbjct: 846 PAPVVPRIMIVLTDGD-----SEDSVVTPAQALATEQVTVFAIGVGSFNRSELLQITNNN 900 Query: 343 -NSFYLVENPHSMYDAFSHIGKD 364 + + + + +++ + + I + Sbjct: 901 QDRVFELADFNAIANIMNRIIQA 923 >gi|325969627|ref|YP_004245819.1| von Willebrand factor type A [Vulcanisaeta moutnovskia 768-28] gi|323708830|gb|ADY02317.1| von Willebrand factor type A [Vulcanisaeta moutnovskia 768-28] Length = 498 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 76/195 (38%), Gaps = 32/195 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++ LDVS SM S + KI++A +++ ++ + D + +V F+ + + Sbjct: 320 IDIVLCLDVSGSMRE-LSSGMPKIEIAKDAVSQYIQFLSKTND-----RLAMVLFNFRAD 373 Query: 232 EFFLLEWGVSHLQRKIKYLSKF------GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 WG+ ++R + ++ G TN L+ + + + +H Sbjct: 374 VL----WGLHQVRRYWQQMNYMLKYVYAGGGTNLANALERSREVLTRSKSNSKH------ 423 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--PN 343 ++ +TDG +++ + ++ G + I I E L + Sbjct: 424 -----VICVTDGRTVNSS---MCVKEAVRLRRNGTTISTIAIGENSDDELLMRLSKIGGG 475 Query: 344 SFYLVENPHSMYDAF 358 F + + H + A Sbjct: 476 LFIKISSIHDLGKAL 490 >gi|297626137|ref|YP_003687900.1| ChlD, Mg-chelatase subunit ChlD [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] gi|296921902|emb|CBL56462.1| ChlD, Mg-chelatase subunit ChlD [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] Length = 324 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 29/208 (13%), Positives = 67/208 (32%), Gaps = 32/208 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++VLD+S+SM + D S +++ + + + +V+ +VT S Sbjct: 97 IVLVLDISQSMMA-TDVSPSRLAAEKDAATKFVAALPAQYNVS------VVTLSGHPNTL 149 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + + IK L + T + + +V Sbjct: 150 VPPTTDRAPVNQGIKTL-ELADGTAIASSIDVGLEALKQAPAGDDGKQAPGL-----MVL 203 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH--------------EFLRAC 339 ++DG S + ++AK++ +Y I + + L+ Sbjct: 204 LSDG---SETGGGDPVASADKAKQQNVPIYTIAFGTQNGYVDLDGQRFNVAPDTDMLKRI 260 Query: 340 --ASPNSFYLVENPHSMYDAFSHIGKDI 365 AS + + D + + D+ Sbjct: 261 ADASSGKALDAASASQLDDVYKTLTSDV 288 >gi|260810969|ref|XP_002600195.1| hypothetical protein BRAFLDRAFT_66700 [Branchiostoma floridae] gi|229285481|gb|EEN56207.1| hypothetical protein BRAFLDRAFT_66700 [Branchiostoma floridae] Length = 323 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 43/215 (20%), Positives = 78/215 (36%), Gaps = 39/215 (18%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL-IPDVNNVVQSGLVTFSNKIE 231 D++I+LD S S+ A SI +L+ + I + Q L+TF + Sbjct: 116 DLLIILDDSGSIG------FDAFQKAKASIATVLDYICPGIGLYSPYHQVALMTFHSTPT 169 Query: 232 EFFLLEWGVSH--LQRKIKYLS-----------KFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + F S+ L+ I + + G T++ L YA +F R Sbjct: 170 KQFDFNDHGSYAELKEAILAVPYEVLMTNFVNKQGGPRTDTHEALDYARTTMFTS---RT 226 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + ++ +TDG+ ED ++ + G V+A+GI +E L Sbjct: 227 GLRPGSL---REVLLLTDGQPN---EDDLTVQAAERLRNSGITVFALGIADGVDNEHLEQ 280 Query: 339 CASPNSFYLV------ENPHSMY----DAFSHIGK 363 S + + E+ M +A+ G+ Sbjct: 281 LVSDPEYKHIFHLNTFEDLADMVANVEEAYDTYGR 315 >gi|145219382|ref|YP_001130091.1| hypothetical protein Cvib_0567 [Prosthecochloris vibrioformis DSM 265] gi|145205546|gb|ABP36589.1| conserved hypothetical protein [Chlorobium phaeovibrioides DSM 265] Length = 356 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 13/83 (15%), Positives = 30/83 (36%), Gaps = 1/83 (1%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R G IL A+ LP++ + ++++ I +K L + D + + A + + Sbjct: 9 RRLHRQRGGT-AILFALVLPVLLGFAALAVDLARIHLVKVELQNAADAASLGGAHSLSDA 67 Query: 69 GNGNNRKKLKGGDILCRIKNTWN 91 G +++ Sbjct: 68 GGQPYNWSAAVNAAQNVVQSNVA 90 >gi|309362046|emb|CAP28695.2| hypothetical protein CBG_09096 [Caenorhabditis briggsae AF16] Length = 516 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 35/202 (17%), Positives = 73/202 (36%), Gaps = 25/202 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D LD+ +V+D S S++ F S K ++ +L + + P V+ L+T+ Sbjct: 243 GSDKPLDLALVVDASESLDHLF-SDQKKF-----LVDRVLGNINIHP---EAVRVALITY 293 Query: 227 SNKI----EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 S + + L + +Q +K + +T + L A++ + + Sbjct: 294 SGQAFVHFKFNSFLYGNNTSVQGFVKNIRSIKGTTATNVALMDAFDLLTS----KDPSTG 349 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI---RSHEFLRAC 339 K+ + +TDG + + +D + + G I+ A+ + E Sbjct: 350 IREGVPKMALVLTDGHSHRSPKD-----MAEKMRAAGIIMIAVSVTPRPLVDEAELRLIA 404 Query: 340 ASPNSFYLVENPHSMYDAFSHI 361 S + N H F Sbjct: 405 GSEKRAFTPPNLHEFESEFMKY 426 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 25/139 (17%), Positives = 44/139 (31%), Gaps = 17/139 (12%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 LD++ +LD S S+ D I+ + +K + + L+ FS Sbjct: 49 APPLDLIFILDSSGSLR----------DKFQDEIDIIRRILKHVTIGKTATRVMLIQFSG 98 Query: 229 KI--EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 E F L + L T ++A Q+ + D Sbjct: 99 TQHLEFNFEKFTDREELLAALDVLRHVSGITRIGGAFEFALQQLKTPGSGLR-----DGT 153 Query: 287 YKKIIVFMTDGENLSTKED 305 KI+ ++DG +D Sbjct: 154 VPKIVYLLSDGRTHDYPKD 172 >gi|237716505|ref|ZP_04546986.1| aerotolerance protein BatA [Bacteroides sp. D1] gi|262408103|ref|ZP_06084651.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|294645097|ref|ZP_06722823.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CC 2a] gi|294809498|ref|ZP_06768201.1| von Willebrand factor type A domain protein [Bacteroides xylanisolvens SD CC 1b] gi|298484179|ref|ZP_07002345.1| BatA protein [Bacteroides sp. D22] gi|229444152|gb|EEO49943.1| aerotolerance protein BatA [Bacteroides sp. D1] gi|262354911|gb|EEZ04003.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|292639603|gb|EFF57895.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CC 2a] gi|294443316|gb|EFG12080.1| von Willebrand factor type A domain protein [Bacteroides xylanisolvens SD CC 1b] gi|295084189|emb|CBK65712.1| von Willebrand factor type A domain. [Bacteroides xylanisolvens XB1A] gi|298269683|gb|EFI11278.1| BatA protein [Bacteroides sp. D22] Length = 327 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 38/217 (17%), Positives = 70/217 (32%), Gaps = 46/217 (21%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +DVS SM + D +++ A + P+ N G+ F+ + Sbjct: 88 IDIMLAIDVSTSMLAE-DLKPNRLEAAKDVAAEFING---RPNDN----IGITLFAGETF 139 Query: 232 EFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + L I + T G+ A ++ D + K Sbjct: 140 TQCPLTVDHAVLLDMIHNIKCGLITDGTAVGMGIANAVTRLKDSKAKS-----------K 188 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-------------------- 329 +I+ +TDG N K D + AK G VY IG+ Sbjct: 189 VIILLTDGTNN--KGDISPMTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVSMP 246 Query: 330 --IRSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 I + +++ + + + + I K Sbjct: 247 VEIDEKTLTEIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|254486311|ref|ZP_05099516.1| conserved hypothetical protein [Roseobacter sp. GAI101] gi|214043180|gb|EEB83818.1| conserved hypothetical protein [Roseobacter sp. GAI101] Length = 476 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 36/215 (16%), Positives = 65/215 (30%), Gaps = 70/215 (32%) Query: 204 AMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL 263 ++ ++ L ++G + ++WG + L F +T S Sbjct: 284 SLKAQIDLFQP-----RAGTAIY-------MGMKWGTALLD------PSFRETTASLVS- 324 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN------------------------ 299 + + + + + D K IV MTDG+N Sbjct: 325 ----DSVVESTFADRPADYSDRETLKTIVLMTDGQNSNSQRISTAYYNSSSEVVHWSKWN 380 Query: 300 ----------------------LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-SHEFL 336 + K + C+ AK G +++ IG V + + Sbjct: 381 FNYYLSQYIKEKDWHRYYYTRYTAEKGNTLMDNICSAAKDEGIVIWTIGFEVNDTGADVM 440 Query: 337 RACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + CAS S + + DAFS I I R+ Sbjct: 441 KKCASSPSHFFRVEGVELTDAFSAIASQINQLRLT 475 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 35/229 (15%), Positives = 78/229 (34%), Gaps = 41/229 (17%) Query: 3 SLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAA 62 +L + + F + G +TILT + ++ +V G+ ++ +T L + D Sbjct: 14 TLGVRLGRFARHEDGSITILTIFIIIMMVMVGGIQLDFMRHEMERTKLQAAAD------- 66 Query: 63 TQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVP 122 D + + F + +S+ I Sbjct: 67 ---------RAVLAAADLDQTLAPADVVDEYFAKSGMSDYL----------SSVTIENGL 107 Query: 123 QNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 ++ A + + T + +P S + + +++ +VLDVS Sbjct: 108 NFRTVTVKANN------EMKTQFLGRFGFPTLDVPALSKAEERVEK---VEISLVLDVSG 158 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 SM++ +K+ + ++ V L P+ N V L+ +S ++ Sbjct: 159 SMKNN-----SKLTTMKDAAKTFIDTV-LRPETKNNVSLSLIPYSEQVN 201 >gi|126664966|ref|ZP_01735949.1| hypothetical protein MELB17_17899 [Marinobacter sp. ELB17] gi|126630336|gb|EBA00951.1| hypothetical protein MELB17_17899 [Marinobacter sp. ELB17] Length = 341 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 30/215 (13%), Positives = 71/215 (33%), Gaps = 44/215 (20%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+V+D+S SM+ I ++ + + +++ + GL+ F + Sbjct: 89 DLMLVVDISPSMDEPDMVRQGRRINRLQAVKQVLAEFIDQ-------RQGDRLGLILFGS 141 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + + + ++ L G +T + A ++ E Sbjct: 142 QAYVQAPLTFDRTTVNILLQEAGLGMAGNATAIGDAVGLAVKRL-----------RERPL 190 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------- 331 +++ + +TDG N + + A+ +Y IGI Sbjct: 191 EQRVAIVLTDGANTAGEITPD--KASELAQASAVRLYTIGIGAGADSAITGLLQRNPSRD 248 Query: 332 -SHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L A + ++ N + ++ I + Sbjct: 249 LDEALLTRMAQQTGGQYFRARNLAELGGIYTSINQ 283 >gi|157827515|ref|YP_001496579.1| hypothetical protein A1I_06110 [Rickettsia bellii OSU 85-389] gi|157802819|gb|ABV79542.1| hypothetical protein A1I_06110 [Rickettsia bellii OSU 85-389] Length = 446 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 45/297 (15%), Positives = 99/297 (33%), Gaps = 35/297 (11%) Query: 65 IMNEGNGNN-RKKLKGGDILCRIKNTWNMSFRNELR----DNGFVNDIDDIVRSTSLDIV 119 ++NE + +K + ++ + + N L + + I+ Sbjct: 161 LVNEEDITPFQKAIYHPTDFSQLITQISSNEENSLNFIMNNGAIAQSVQVYTADGKAPII 220 Query: 120 VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKV-NSQTDARLDMMIVL 178 +G+ I K L W + +M + KV + + + ++ +++ Sbjct: 221 ASDLKDGFIIDKQYLLKYLLPIFNGFIWNEEGKFPIMFAPKNPKVLDGENNYAHNISLLI 280 Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE--EFFLL 236 D+S SME F + +I +L+++ IP+ Q +V F+++ F Sbjct: 281 DISGSMEKDFS-------VYKNNILKILDKLAEIPNW----QINIVVFNDESTARSFSNQ 329 Query: 237 EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 E + ++ I L G Y +++ + ++ TD Sbjct: 330 ENNIEDIKVYINNLKANG------------YTKLYGTIKEALESFKGKIDESSTLIVFTD 377 Query: 297 GENLSTKEDQQSLYYCNE----AKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 G++ T + + K +Y +G + EF A+ F V Sbjct: 378 GKDEGTNSNVTEKDVVDVTSEVIKNPQFNMYTVGFGQYYNQEFFEQVATRGGFTHVS 434 >gi|77456411|ref|YP_345916.1| von Willebrand factor, type A [Pseudomonas fluorescens Pf0-1] gi|77380414|gb|ABA71927.1| putative exported protein [Pseudomonas fluorescens Pf0-1] Length = 563 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 38/219 (17%), Positives = 82/219 (37%), Gaps = 28/219 (12%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 PW ++R + + I +S A +++ ++DVS SM+ + + ++ + Sbjct: 168 PWNPHTRLLRIGIKAS-DRAVAELAPANLVFLVDVSGSMD-----RREGLPLVKSTLKLL 221 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGL 263 +++++ + LV ++ + G ++ I+ L+ G ST G+ Sbjct: 222 VDQLREQD------RVSLVVYAGESSVVLEPTSGREKAKIRTAIERLTA-GGSTAGASGI 274 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIV 322 + AY Q I+ TDG+ N+ T + + +K G + Sbjct: 275 ELAYQM---AQQAFIPKGINR------ILLATDGDFNVGTSDFDSLKQMAVDKRKTGISL 325 Query: 323 YAIGIRVIRSHEFLR---ACASPNSFYLVENPHSMYDAF 358 +G V +E L A A ++ ++N Sbjct: 326 TTLGFGVDNYNEHLMEQLADAGDGNYAYIDNLREARKVL 364 >gi|110634434|ref|YP_674642.1| hypothetical protein Meso_2084 [Mesorhizobium sp. BNC1] gi|110285418|gb|ABG63477.1| conserved hypothetical protein [Chelativorans sp. BNC1] Length = 549 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 27/193 (13%), Positives = 63/193 (32%), Gaps = 47/193 (24%) Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + L + L+R++ + G +TN G+ + ++ + Sbjct: 359 AYGCLSRPITPLSNDYAALKREVSRFTADG-NTNIMEGVAWGM-RVLSPREPFTEGKEPA 416 Query: 285 ANYKKIIVFMTDGENL-----------------------------------STKEDQQSL 309 ++ +KI++ +TDG N + + ++++L Sbjct: 417 SDVEKIMIVLTDGANNMGLSNNRNHALGSSYSSFGYLVEDRLTRERSQRRVTEEMNRRTL 476 Query: 310 YYCNEAK-------KRGAIVYAIGIRVIR--SHEFLRACAS-PNSFYLVENPHSMYDAFS 359 C AK + +Y I + + L+ CA+ P ++ + + F Sbjct: 477 AACENAKREYTPSKEDDVTIYTIRLEEPDVATGTLLQECATGPGYYFDSPSRTQLNAIFK 536 Query: 360 HIGKDIVTKRIWY 372 I I R+ + Sbjct: 537 EIRDGITKLRLSF 549 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 33/233 (14%), Positives = 80/233 (34%), Gaps = 48/233 (20%) Query: 6 LNIRNF-------FYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSL 58 L+IR + +G ++ A+ +F G+ ++ +++ ++ L + +D Sbjct: 2 LSIRRLRSACSALCRDRRGNFAVILALSALPVFGAAGLAVDYTNMSRTRSELQNALD--- 58 Query: 59 VHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDI 118 +G I + SF + + N Sbjct: 59 -----------AAVLAVAQRGDKISDAEARSIAASFLTGNLSSAYKNMA----------- 96 Query: 119 VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVL 178 V +SA + +PL F I + + +S+ + A ++ +VL Sbjct: 97 -VERNGTSVKLSAEAT--MPLSFGGLI----GRKEATVGASSTADMAF---AYYEIALVL 146 Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D + SM K+ +++N +++++ ++ LV F++ + Sbjct: 147 DTTGSMRG------GKLQAMKEAVNGLIDDLSSRVTDKERLKFALVPFASFVN 193 >gi|326335929|ref|ZP_08202106.1| aerotolerance-related membrane protein [Capnocytophaga sp. oral taxon 338 str. F0234] gi|325691893|gb|EGD33855.1| aerotolerance-related membrane protein [Capnocytophaga sp. oral taxon 338 str. F0234] Length = 348 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 29/175 (16%), Positives = 69/175 (39%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +D++ +DVS+SM + D ++I+ A + I+ ++E + + Sbjct: 79 GTKLETVKREGVDIVFAIDVSKSMLAE-DVKPSRIEKAKRIISELIELLHGD-------R 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMR 277 + ++ + L S + ++ ++ T ++ A N D Sbjct: 131 IAFIPYAAQAYPQLPLTSDYSAAKIFLEAINTDMLSSQGTAIGEAIQTAINYFEDSNQSS 190 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 KI++ ++DGE+ Q + E K++G ++ IG+ + Sbjct: 191 -----------KILIILSDGEDH----QQGATEMIQEVKEKGIRIFTIGLGTTQG 230 >gi|326532158|dbj|BAK01455.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 674 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 43/224 (19%), Positives = 83/224 (37%), Gaps = 54/224 (24%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 S + A +D++ V+D+S SM TK+ + +++ +++ + + Sbjct: 251 SSVRSPSGRAPVDLVTVIDISGSMAG------TKLALLKRAMGFVIQHLGPSD------R 298 Query: 221 SGLVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 ++ FS+ F L+ +G + I L G TN LK A I Sbjct: 299 LSVIAFSSSARRLFHLQRMSHYGRQQALQAINSLGA-GGGTNIADALKKATKVI----ED 353 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK------------------KR 318 R + N+ + I+ ++DG Q + C+ + +R Sbjct: 354 RSYKNSVCS-----IILLSDG--------QDTYNICSNVRGGSKDYSSLVPPSILSDTRR 400 Query: 319 GAIVYAIGIRVIRSHEFLRAC--ASPNSFYLVENPHSMYDAFSH 360 ++A G + L + AS +F +E+ M DAF+ Sbjct: 401 MLPIHAFGFGADHDSDSLHSIAEASGGTFSFIEDEGVMQDAFAQ 444 >gi|326513050|dbj|BAK03432.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326519604|dbj|BAK00175.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326532408|dbj|BAK05133.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 700 Score = 60.9 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 43/224 (19%), Positives = 83/224 (37%), Gaps = 54/224 (24%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 S + A +D++ V+D+S SM TK+ + +++ +++ + + Sbjct: 251 SSVRSPSGRAPVDLVTVIDISGSMAG------TKLALLKRAMGFVIQHLGPSD------R 298 Query: 221 SGLVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 ++ FS+ F L+ +G + I L G TN LK A I Sbjct: 299 LSVIAFSSSARRLFHLQRMSHYGRQQALQAINSLGA-GGGTNIADALKKATKVI----ED 353 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK------------------KR 318 R + N+ + I+ ++DG Q + C+ + +R Sbjct: 354 RSYKNSVCS-----IILLSDG--------QDTYNICSNVRGGSKDYSSLVPPSILSDTRR 400 Query: 319 GAIVYAIGIRVIRSHEFLRAC--ASPNSFYLVENPHSMYDAFSH 360 ++A G + L + AS +F +E+ M DAF+ Sbjct: 401 MLPIHAFGFGADHDSDSLHSIAEASGGTFSFIEDEGVMQDAFAQ 444 >gi|284990593|ref|YP_003409147.1| von Willebrand factor type A [Geodermatophilus obscurus DSM 43160] gi|284063838|gb|ADB74776.1| von Willebrand factor type A [Geodermatophilus obscurus DSM 43160] Length = 318 Score = 60.6 bits (145), Expect = 3e-07, Method: Composition-based stats. Identities = 31/209 (14%), Positives = 68/209 (32%), Gaps = 31/209 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +DVS SM++ D ++ + + ++ + + + GLV+F+ Sbjct: 88 VVMAVDVSLSMQA-TDIEPSRFEAMQVAAKEFVDVLP------DRINLGLVSFAGTATTV 140 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI-IV 292 + I L + ST + + ++ + + + IV Sbjct: 141 VTPTTDRGQVSTAIDNL-ELAESTAIGEA---VFTSLTAIENFQSSLDADGEEVPPARIV 196 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--------------VIRSHEFLRA 338 ++DG N + D Q++ A G V I V L Sbjct: 197 LLSDGYNTVGRPDTQAVS---AALDAGIPVSTIAFGTDYGTLDLDGERVPVPVDRATLEE 253 Query: 339 CA--SPNSFYLVENPHSMYDAFSHIGKDI 365 A + S+ + + + +G I Sbjct: 254 IADQTGGSYSEAASAAELEQVYQDLGSQI 282 >gi|323135950|ref|ZP_08071033.1| hypothetical protein Met49242DRAFT_0420 [Methylocystis sp. ATCC 49242] gi|322399041|gb|EFY01560.1| hypothetical protein Met49242DRAFT_0420 [Methylocystis sp. ATCC 49242] Length = 432 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 51/321 (15%), Positives = 98/321 (30%), Gaps = 54/321 (16%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R+F N +GG+ I + L + L+ G ++ S I K+ L+ D ++ A Sbjct: 3 RSFLQNRRGGVAIFFGLALMPLALMAGGAVDFSQISRQKSALNQAADAGVLTA------- 55 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 LK + S + F N+ I + + G Sbjct: 56 --------LKEAREQLKQGKPDWQSIAEKQGGKAFTNNASKIGGVSGTGATINLSLSGGV 107 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM---- 184 +S Y T I + S D+ V+DVS SM Sbjct: 108 LSGSLNYA--ANAPTHFLRIAGLNTINL--KGSASATMSAAQYRDIHFVIDVSASMGIGA 163 Query: 185 -------------------ESFFDSSITKIDMAIKSINAML----------EEVKLIPDV 215 + T A+++I A L + + IP+ Sbjct: 164 TKADQQAMQNSVGCAVACHHAEAADPATDNLAAVRAIGATLRIDVVRKAVMDALAKIPN- 222 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + + + +FSN ++ F L ++ + + + Y+ NQ+ ++ Sbjct: 223 DGSTRVAIHSFSNSLKTVFPLSTNIAGAISATQSIDLTNENGQGGTNFHYSLNQLNNL-L 281 Query: 276 MRQHCNTEDANYKKIIVFMTD 296 + + ++ TD Sbjct: 282 ASAGNGLTASQPRGFVLLATD 302 >gi|290960274|ref|YP_003491456.1| lipoprotein [Streptomyces scabiei 87.22] gi|260649800|emb|CBG72916.1| putative lipoprotein [Streptomyces scabiei 87.22] Length = 537 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 33/217 (15%), Positives = 74/217 (34%), Gaps = 28/217 (12%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + +V ++ + + V+DVS SM + ++D+A ++ M Sbjct: 163 PGNWSLVRVGLATRAAGDRQRPPAALTFVIDVSGSM-----AEPGRLDLAQDALRTMTNR 217 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKY 265 ++ V +VTFS++ E + ++ + L STN G++ Sbjct: 218 LRDDDSVA------VVTFSDEAETVLPMTRLDGNRGEIREAVAGLEPT-DSTNLAAGVET 270 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL--YYCNEAKKRGAIVY 323 Y ++G+R+ +V ++D + D ++ E ++ G ++ Sbjct: 271 GYE--TAVEGLRKGATNR-------VVLLSDALANTGSTDADTILERIAGERREHGITLF 321 Query: 324 AIGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAF 358 +G+ + A V F Sbjct: 322 GVGVGSDYGDALMEQLADKGDGHTTYVSTEEEAEKVF 358 >gi|261409467|ref|YP_003245708.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261285930|gb|ACX67901.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 595 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 42/224 (18%), Positives = 85/224 (37%), Gaps = 37/224 (16%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + + +D ++V+D S SM+ +S +I + +++ ++ + D + G+ Sbjct: 30 AAASQGSNIDAVLVMDASNSMK---NSDPERI--SGEAMKMFIDMLATTGD-----KVGI 79 Query: 224 VTFSNKIEEFFLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V+++++I+ L E + L+ I L + G T+ + GL A + +Q Sbjct: 80 VSYTDRIQREKALLEIQSEADKTALKEFIDQLDR-GPYTDMSVGLDEAVKVL------KQ 132 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC--------NEAKKRGAIVYAIGIRVI 330 + A IV + DG N ++ EAK G +Y IG+ Sbjct: 133 GMDPAHAPM---IVVLADGNNDLDPNTGRTSKEASEQLNQAVKEAKGSGIPIYTIGLNAD 189 Query: 331 R--SHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + E L A + + + + S I +I Sbjct: 190 GKLNKETLAELAKQTGGKSFTTSSADDLPQILSEIFASHQQLKI 233 >gi|307941757|ref|ZP_07657112.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307775365|gb|EFO34571.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 358 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 34/249 (13%), Positives = 84/249 (33%), Gaps = 41/249 (16%) Query: 21 ILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGG 80 I+TA ++ + +G+ ++ S +KT + +D + + AA + G+ + +K Sbjct: 2 IITAFVFFVLIVAIGVGVDYSRALTLKTRVLGSLDTAALAAAVEFSKLGSEQDARKAAKK 61 Query: 81 DILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLK 140 ++ ++ L+IV + S+ A+ + Sbjct: 62 AFDAQVSQ---LNLHGAKLK--------------KLNIVTDDETMKVSVDAV------FE 98 Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS---SITKIDM 197 T + + + + S Q LD+++ +D + SM + S + + Sbjct: 99 LPTTLMQIAGFKTLEVATRSDAVGGGQE-VILDIVMCIDATGSMGATLRSVQRNALSFEA 157 Query: 198 -AIKSINAMLEEVKLI---------PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI 247 + + +V +I D + + ++ K F L + + + Sbjct: 158 NLKNRLKELGRQVDIIRVRPIYYWDYDYDGWSR----SYGLKKSTFLKLPDQRTQFKNFV 213 Query: 248 KYLSKFGVS 256 S +G Sbjct: 214 DSESAYGGG 222 >gi|168698108|ref|ZP_02730385.1| protein containing a von Willebrand factor type A domain [Gemmata obscuriglobus UQM 2246] Length = 821 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 42/199 (21%), Positives = 74/199 (37%), Gaps = 31/199 (15%) Query: 136 KIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKI 195 I L + P T + + I+ V+ + AR D+++VLD S SM K+ Sbjct: 236 DIGLSPLVYKPIQTEDGYFMFLISPQVEAEKKRVAR-DLVLVLDTSSSMSDI------KM 288 Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL--LEWGVSHLQ---RKIKYL 250 A K++ L +++ + G+V FS + +F + +L + I L Sbjct: 289 QQAKKAVKFCLSQLQPED------RFGVVRFSTTVTKFRSELVAANTDYLDLATKWIDGL 342 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 G T P L A + D + +VF TDG+ + + + Sbjct: 343 KTSG-GTAIWPALNDAL-----------AMRSSDPSRPFTMVFFTDGQPTVDETNADKIV 390 Query: 311 YCNEAKKRG-AIVYAIGIR 328 AK G ++ G+ Sbjct: 391 KNVLAKNTGNTRIFTFGVG 409 >gi|126316414|ref|XP_001380743.1| PREDICTED: similar to integrin alpha 2 subunit [Monodelphis domestica] Length = 1214 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 30/210 (14%), Positives = 79/210 (37%), Gaps = 35/210 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++V D S S+ + + ++ + + PD Q GL+ + N Sbjct: 207 IDVVVVCDESNSIYPW--------SAVKNFLVKFVQGLDIGPDK---TQVGLIQYGNYPR 255 Query: 232 EFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F + + ++ + + G TN+ +++A + + + Sbjct: 256 VVFNMSTFKTKEEMVKATSQTIQHGGDLTNTFKAIQFA------REFAYSEASGGRPSAT 309 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFLR----A 338 K++V +TDGE+ ++ + CN + + I + + + ++ Sbjct: 310 KVMVVVTDGESHDGSFLKEVIGQCN---DDNILRFGIAVLGYLNRNALDTKNLIKEIKAI 366 Query: 339 CASP--NSFYLVENPHSMYDAFSHIGKDIV 366 ++P F+ V + ++ + +G+ I Sbjct: 367 ASTPTERYFFNVSDEDALLEKAGTLGERIF 396 >gi|8569518|pdb|1QC5|A Chain A, I Domain From Integrin Alpha1-Beta1 Length = 192 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 42/211 (19%), Positives = 79/211 (37%), Gaps = 35/211 (16%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +LD++IVLD S S+ + D +N +LE + + P Q G+V + Sbjct: 2 TQLDIVIVLDGSNSIYPW--------DSVTAFLNDLLERMDIGP---KQTQVGIVQYGEN 50 Query: 230 IEEFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + F L S + + + + G T + G A + F Sbjct: 51 VTHEFNLNKYSSTEEVLVAAKKIVQRGGRQTMTALGTDTARKEAFT------EARGARRG 104 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--RVIRSH-------EFLR 337 KK++V +TDGE+ ++ + C ++I I R + E ++ Sbjct: 105 VKKVMVIVTDGESHDNHRLKKVIQDCE---DENIQRFSIAILGSYNRGNLSTEKFVEEIK 161 Query: 338 ACAS---PNSFYLVENPHSMYDAFSHIGKDI 365 + AS F+ V + ++ +G+ I Sbjct: 162 SIASEPTEKHFFNVSDEIALVTIVKTLGERI 192 >gi|114684811|ref|XP_531503.2| PREDICTED: collagen, type VI, alpha 1 isoform 3 [Pan troglodytes] Length = 997 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 28/172 (16%), Positives = 61/172 (35%), Gaps = 33/172 (19%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK-----LIPDVNNVVQS 221 +D++ VLD S S+ + ++A + +++ + + Sbjct: 578 CKCGPIDLLFVLDSSESIG------LQNFEIAKDFVVKVIDRLSRDELVKFEPGQSYA-- 629 Query: 222 GLVTFSNKIEEF-----FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 G+V +S+ + V L+ IK L T + L+Y +Q+ Sbjct: 630 GVVQYSHSQMQEHVSLRSPSIRNVQELKEAIKSLQWMAGGTFTGEALQYTRDQLL----- 684 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +I + +TDG + + ++ C G V ++GI+ Sbjct: 685 ------PPSPNNRIALVITDGRSDTQRDTTPLNVLC----SPGIQVVSVGIK 726 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 70/203 (34%), Gaps = 38/203 (18%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D +D+ VLD S S+ + +K A++++VK +F+ Sbjct: 32 QDCPVDLFFVLDTSESV-----------ALRLKPYGALVDKVK--------------SFT 66 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + + + + FG T + +K Q+ H Sbjct: 67 KRFIDNLRDRYYRCDRNLVVDAVKYFGKGTYTDCAIKKGLEQLLVGG---SHLKEN---- 119 Query: 288 KKIIVFMTDGEN-LSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 K ++ +TDG KE L NEAK G V+++ I L A+ +++ Sbjct: 120 -KYLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRLSIIATDHTY 178 Query: 346 ---YLVENPHSMYDAFSHIGKDI 365 + + DA I + I Sbjct: 179 RRNFTAADWGQSRDAEEAISQTI 201 Score = 53.6 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 24/210 (11%), Positives = 65/210 (30%), Gaps = 23/210 (10%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS-- 227 + D+ I+LD S S+ S + K + ++ V+ +V +S Sbjct: 795 SPADITILLDGSASVGSH---NFDTTKRFAKRLAERFLTAGRTDPAHD-VRVAVVQYSGT 850 Query: 228 ---NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + L + + +T+ L Y R + Sbjct: 851 GQQRPERASLQFLQNYTALASAVDAMDFINDATDVNDALGYV---------TRFYREASS 901 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 KK ++ +DG + EA++ G ++ + + + +R + + Sbjct: 902 GAAKKRLLLFSDGNSQGATPAAIEKAV-QEAQRAGIEIFVVVVGRQVNEPHIRVLVTGKT 960 Query: 345 --FYLVENPHSMYDA--FSHIGKDIVTKRI 370 + + ++ + + + + + + Sbjct: 961 AEYDVAYGERHLFRVPSYQALLRGVFHQTV 990 >gi|38197044|gb|AAH05159.2| COL6A1 protein [Homo sapiens] Length = 445 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 28/172 (16%), Positives = 61/172 (35%), Gaps = 33/172 (19%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK-----LIPDVNNVVQS 221 +D++ VLD S S+ + ++A + +++ + + Sbjct: 26 CKCGPIDLLFVLDSSESIG------LQNFEIAKDFVVKVIDRLSRDELVKFEPGQSYA-- 77 Query: 222 GLVTFSNKIEEF-----FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 G+V +S+ + V L+ IK L T + L+Y +Q+ Sbjct: 78 GVVQYSHSQMQEHVSLRSPSIRNVQELKEAIKSLQWMAGGTFTGEALQYTRDQLL----- 132 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +I + +TDG + + ++ C G V ++GI+ Sbjct: 133 ------PPSPNNRIALVITDGRSDTQRDTTPLNVLC----SPGIQVVSVGIK 174 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 24/210 (11%), Positives = 65/210 (30%), Gaps = 23/210 (10%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS-- 227 + D+ I+LD S S+ S + K + ++ V+ +V +S Sbjct: 243 SPADITILLDGSASVGSH---NFDTTKRFAKRLAERFLTAGRTDPAHD-VRVAVVQYSGT 298 Query: 228 ---NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + L + + +T+ L Y R + Sbjct: 299 GQQRPERASLQFLQNYTALASAVDAMDFINDATDVNDALGYV---------TRFYREASS 349 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 KK ++ +DG + EA++ G ++ + + + +R + + Sbjct: 350 GAAKKRLLLFSDGNSQGATPAAIEKAV-QEAQRAGIEIFVVVVGRQVNEPHIRVLVTGKT 408 Query: 345 --FYLVENPHSMYDA--FSHIGKDIVTKRI 370 + + ++ + + + + + + Sbjct: 409 AEYDVAYGESHLFRVPSYQALLRGVFHQTV 438 >gi|30030|emb|CAA33889.1| alpha-1 collagen VI (AA 574-1009) [Homo sapiens] Length = 436 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 28/172 (16%), Positives = 61/172 (35%), Gaps = 33/172 (19%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK-----LIPDVNNVVQS 221 +D++ VLD S S+ + ++A + +++ + + Sbjct: 17 CKCGPIDLLFVLDSSESIG------LQNFEIAKDFVVKVIDRLSRDELVKFEPGQSYA-- 68 Query: 222 GLVTFSNKIEEF-----FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 G+V +S+ + V L+ IK L T + L+Y +Q+ Sbjct: 69 GVVQYSHSQMQEHVSLRSPSIRNVQELKEAIKSLQWMAGGTFTGEALQYTRDQLL----- 123 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +I + +TDG + + ++ C G V ++GI+ Sbjct: 124 ------PPSPNNRIALVITDGRSDTQRDTTPLNVLC----SPGIQVVSVGIK 165 Score = 42.8 bits (99), Expect = 0.091, Method: Composition-based stats. Identities = 21/210 (10%), Positives = 63/210 (30%), Gaps = 23/210 (10%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS-- 227 + D+ I+L+ + S + K + ++ V+ +V +S Sbjct: 234 SPADITILLEPPPDVGSH---NFDTTKRFAKRLAERFLTAGRTDPAHD-VRVAVVQYSGT 289 Query: 228 ---NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + L + + +T+ L Y R + Sbjct: 290 GQQRPERASLQFLQNYTALASAVDAMDFINDATDVNDALGYV---------TRFYREASS 340 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 KK ++ +DG + EA++ G ++ + + + +R + + Sbjct: 341 GAAKKRLLLFSDGNSQGATPAAIEKAV-QEAQRAGIEIFVVVVGRQVNEPHIRVLVTGKT 399 Query: 345 --FYLVENPHSMYDA--FSHIGKDIVTKRI 370 + + ++ + + + + + + Sbjct: 400 AEYDVAYGESHLFRVPSYQALLRGVFHQTV 429 >gi|262381906|ref|ZP_06075044.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] gi|298375541|ref|ZP_06985498.1| BatA protein [Bacteroides sp. 3_1_19] gi|301310439|ref|ZP_07216378.1| BatA protein [Bacteroides sp. 20_3] gi|262297083|gb|EEY85013.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] gi|298268041|gb|EFI09697.1| BatA protein [Bacteroides sp. 3_1_19] gi|300832013|gb|EFK62644.1| BatA protein [Bacteroides sp. 20_3] Length = 328 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 43/217 (19%), Positives = 75/217 (34%), Gaps = 46/217 (21%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +D+S SM + D +++ A + + P+ N GLV FS + Sbjct: 89 IDIMLAMDISGSMLAQ-DLKPNRLEAAKDVAASFING---RPNDN----IGLVVFSAESF 140 Query: 232 EFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + L K + T GL A ++I D K Sbjct: 141 TQCPLTTDHTVLLNLFKDIQSGMIQDGTAIGLGLANAVSRIKDSHAKS-----------K 189 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR------------------ 331 +I+ +TDG N + + + AK G VY IG+ Sbjct: 190 VIILLTDGSNNA--GEIAPVTAAEIAKTFGVRVYTIGVGTKGMAPYPFQTAFGVQYQNIP 247 Query: 332 ---SHEFLRACAS--PNSFYLVENPHSMYDAFSHIGK 363 L+ AS ++ + S+ + +S I + Sbjct: 248 VEIDEATLKQIASTTGGQYFRATDNASLKEIYSEIDQ 284 >gi|219847682|ref|YP_002462115.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219541941|gb|ACL23679.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 418 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 33/229 (14%), Positives = 84/229 (36%), Gaps = 28/229 (12%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 T +P + + + + + + + + L++ VLD S SM+ + + + A + + Sbjct: 14 TPVPTSSTPQVVYLLVEAVAPASPTSALPLNLCFVLDRSGSMQ---GAKLESMKAATRRV 70 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNST 260 +L + +V F + ++ S L ++ +++ G + S Sbjct: 71 IELLRPHDVA---------AIVIFDDTVQTLIPATPVGDRSALLAAVETITEAGGTAMS- 120 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA 320 G++ A +++H + + ++ +TDG+ + + G Sbjct: 121 LGMQAA------QTELQKHLGPDRISR---MLLLTDGQTWGDEP--ICRDLARTLGQAGV 169 Query: 321 IVYAIGIRVIRSHEFLR--ACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + A+G+ + + L A AS + +P + F K+ Sbjct: 170 RITALGLGTEWNEQLLDDIAAASDGYSDYIADPAQIETFFQQAVKEAQA 218 >gi|30687725|ref|NP_850306.1| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|330254526|gb|AEC09620.1| C3HC4-type RING finger-containing protein [Arabidopsis thaliana] Length = 692 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 54/330 (16%), Positives = 118/330 (35%), Gaps = 61/330 (18%) Query: 54 IDRSL-----VHAATQIMNEGNGNNRKKLKGGDILCRIKN--TWNMSFRNELRDNGFVND 106 D ++ + + +MN+G G + D + S+ + L +N V Sbjct: 141 NDAAISLVHRLPRSRGVMNQGRGLAPEPSMFDDDERLEQQLVFSGKSYSDALENNHPVRM 200 Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSV-KVN 165 +D+ + P+ +SA+ R KF + + + S +++ Sbjct: 201 ---------MDLKIYPE-----VSAVPRADSREKF-DVLVHLRAAAMVTGNANSLNNQIS 245 Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 A +D++ VLD+S SM TK+ + +++ +++ + + ++ Sbjct: 246 RYPRAPVDLVTVLDISGSMAG------TKLALLKRAMGFVIQNLGSND------RLSVIA 293 Query: 226 FSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 FS+ F L G + + + G TN GL+ + R+ N Sbjct: 294 FSSTARRLFPLTKMSDAGRQRALQAVNSVVANG-GTNIAEGLRKGVKVM----EDRRDKN 348 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQ---------SLYYCNEAKKRGAIVYAIGIRVIRS 332 + I+ ++DG + T S++ C K+ V++ G Sbjct: 349 PVAS-----IILLSDGRDTYTMNQADPNYKLLLPLSMHGCES-KRFQIPVHSFGFGSDHD 402 Query: 333 HEFLRACA--SPNSFYLVENPHSMYDAFSH 360 + + + S +F +E+ + DA + Sbjct: 403 ASLMHSVSETSGGTFSFIESESVIQDALAQ 432 >gi|329922540|ref|ZP_08278115.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] gi|328942084|gb|EGG38366.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] Length = 595 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 42/224 (18%), Positives = 85/224 (37%), Gaps = 37/224 (16%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + + +D ++V+D S SM+ +S +I + +++ ++ + D + G+ Sbjct: 30 AAASQGSNIDAVLVMDASNSMK---NSDPERI--SGEAMKMFIDMLATTGD-----KVGI 79 Query: 224 VTFSNKIEEFFLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V+++++I+ L E + L+ I L + G T+ + GL A + +Q Sbjct: 80 VSYTDRIQREKALLEIQSEADKTALKEFIDQLDR-GPYTDMSVGLDEAVKVL------KQ 132 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC--------NEAKKRGAIVYAIGIRVI 330 + A IV + DG N ++ EAK G +Y IG+ Sbjct: 133 GMDPAHAPM---IVVLADGNNDLDPNTGRTSKEASEQLAQAVKEAKGSGIPIYTIGLNAD 189 Query: 331 R--SHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + E L A + + + + S I +I Sbjct: 190 GKLNKETLAELANQTGGKSFTTSSADDLPQILSEIFASHQQLKI 233 >gi|288925756|ref|ZP_06419687.1| BatA protein [Prevotella buccae D17] gi|315608294|ref|ZP_07883284.1| aerotolerance protein BatA [Prevotella buccae ATCC 33574] gi|288337411|gb|EFC75766.1| BatA protein [Prevotella buccae D17] gi|315250075|gb|EFU30074.1| aerotolerance protein BatA [Prevotella buccae ATCC 33574] Length = 332 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 38/222 (17%), Positives = 71/222 (31%), Gaps = 51/222 (22%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +DVS SM + D +I+ A + P+ N GL F+ + Sbjct: 88 IDIMLAMDVSASMLAE-DLKPNRIEAAKDVAAEFI---SGRPNDN----IGLTIFAGEAF 139 Query: 232 EFFLLEWGVSHLQRKIKYLSK-------FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + + L ++ + T GL A +++ + Sbjct: 140 TQCPMTTDHASLLTLLQDVRTDMATRGLINDGTAIGMGLANAVSRL-----------KDS 188 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV--------------- 329 ++++ +TDG N + D L AK G VY I + Sbjct: 189 KTKSRVVILLTDGANNA--GDISPLTAAQMAKSLGIRVYTIAVGTSKVAPYPIEVGGRVQ 246 Query: 330 -------IRSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 I + A + +FY N + + I + Sbjct: 247 YISRPADIDTKTLREIAAVTEGNFYSANNTAQLKQIYHDIDQ 288 >gi|256823198|ref|YP_003147161.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] gi|256796737|gb|ACV27393.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] Length = 348 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 29/219 (13%), Positives = 67/219 (30%), Gaps = 48/219 (21%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+MI +D+S SME D + ++ + + K + G++ F Sbjct: 87 DLMISIDISGSMEMPDMVIEDKEVDRLVAVKALLTDFIARRKGD-------RVGMILFGE 139 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS---KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L + + +Q + + T G+ A ++ + Sbjct: 140 QAYLQTPLTFDLKTVQTMLDETTIGLAGSSRTAIGDGIGLAVKRLRERDANN-------- 191 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-------------- 331 ++++ +TDG+N L A+ G +Y IG+ Sbjct: 192 ---RVLILLTDGQNN--TGALNPLQAAELAEHAGITIYTIGVGADEMIVKNRFFGNRRIN 246 Query: 332 ------SHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 + + ++ + M + + I + Sbjct: 247 PSLELDEESLIAVAEKTGGRYFRARDTKEMEEIYQIIDE 285 >gi|255693880|ref|ZP_05417555.1| BatA protein [Bacteroides finegoldii DSM 17565] gi|260620309|gb|EEX43180.1| BatA protein [Bacteroides finegoldii DSM 17565] Length = 327 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 42/270 (15%), Positives = 87/270 (32%), Gaps = 46/270 (17%) Query: 119 VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVL 178 + + ++ + Y + F + T ++ ++ K + +D+M+ + Sbjct: 35 ISDARVYAHTPKSYKNYLLHAPFLLRLFALTLVILVLARPQTTNKWQNSEIEGIDIMLAV 94 Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW 238 DVS SM + D +++ A + P+ N G+ F+ + L Sbjct: 95 DVSTSMLAE-DLKPNRLEAAKDVAAEFING---RPNDN----IGITLFAGESFTQCPLTV 146 Query: 239 GVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 + L I + T G+ A ++ D + K+I+ +TD Sbjct: 147 DHAVLLDMIHNIKCGLIEDGTAVGMGIANAVTRLKDSKAKS-----------KVIILLTD 195 Query: 297 GENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV----------------------IRSHE 334 G N K D + AK G VY IG+ I Sbjct: 196 GTNN--KGDISPMTAAEIAKSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVSMPVEIDEKT 253 Query: 335 FLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 + + +++ + + + + I K Sbjct: 254 LTQIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|327266508|ref|XP_003218047.1| PREDICTED: complement factor B-like [Anolis carolinensis] Length = 767 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 42/243 (17%), Positives = 87/243 (35%), Gaps = 35/243 (14%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + T K+ + D L++ IVLD SRS+ K D + N ++ ++ I Sbjct: 240 NVSTTGKRKIKIEKDGSLNIYIVLDASRSI---------KKDQFKHAQNMSIKLIEKISS 290 Query: 215 VNNVVQSGLVTFSNKI--------EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 + + ++TF+ ++ ++ W + L+ K TN GL Sbjct: 291 YDISPRYAVITFATEVKELVRTTDDQSTDASWVIEKLEGMKYTEHKQKPGTNIQKGLSSV 350 Query: 267 YNQIFDMQGM--RQHCNTEDANYK--KIIVFMTDGENLSTKEDQQSLYYCNE-------- 314 Y+ + Q R+ N + K +IV ++DG+ + + + E Sbjct: 351 YSMMITQQAAERRRGLNPPPVSEKTRHVIVLLSDGDYNMGGDPIRVIRQIREFLNIGRNR 410 Query: 315 --AKKRGAIVYAIGIRVIRSHEFLRACASPN----SFYLVENPHSMYDAFSHIGKDIVTK 368 ++ VY + E + AS + +++ + AF + + T Sbjct: 411 THPREDFLDVYVFAVGGTVVMENVNKIASQKSGERHAFKIKDYSDLQLAFEEMIDESETL 470 Query: 369 RIW 371 + Sbjct: 471 SMC 473 >gi|242066912|ref|XP_002454745.1| hypothetical protein SORBIDRAFT_04g036560 [Sorghum bicolor] gi|241934576|gb|EES07721.1| hypothetical protein SORBIDRAFT_04g036560 [Sorghum bicolor] Length = 737 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 47/330 (14%), Positives = 114/330 (34%), Gaps = 50/330 (15%) Query: 48 TVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDI 107 + +++ + + G ++ L + + E ++ + Sbjct: 178 NGYQNGGHMNILRPLPRARSSGRLHHLATLLPDTDPSTFNDDEPLDLLCEEANDTQQGCL 237 Query: 108 DDIVRSTSLDIVVVPQNEG-YSISAISRYKIPL-KFCTFIPWYTNSRHIVMPITSSVKVN 165 + T + VP+N + + + K PL + + + Sbjct: 238 RTVEIKTYPEFTEVPENTSERNFTVLIHLKAPLAQHLQPSSNLGDGNGLST--------- 288 Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 T A +D++ VLDVS SM TK+ + +++ +++ + + ++ Sbjct: 289 --TRAPVDLITVLDVSGSMAG------TKLALLKRAMGFVIQNLGSSD------RLSVIA 334 Query: 226 FSNKIEEFFLLEWGVSHLQRK----IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 FS+ F L +++ + L+ G TN GL+ +++ + + + Sbjct: 335 FSSSARRLFPLRRMTESGRQQSLLAVNSLTSNG-GTNIAEGLRKG-SKVIEERQAKNPVC 392 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSL-----YYC----NEAKKRGAIVYAIGIRVIRS 332 + I+ ++DG++ T + YC + + V+ G Sbjct: 393 S--------IILLSDGQDTYTVSPTAGVHKGAPEYCALLPSTNGNQQIPVHVFGFGADHD 444 Query: 333 HEFLRACA--SPNSFYLVENPHSMYDAFSH 360 L + + S +F +E ++ DAF+ Sbjct: 445 SVSLHSISQTSGGTFSFIETEAAIQDAFAQ 474 >gi|73542573|ref|YP_297093.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] gi|72119986|gb|AAZ62249.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] Length = 340 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 38/242 (15%), Positives = 84/242 (34%), Gaps = 23/242 (9%) Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 + R + + W + P + +T D+++ LD+S+SM++ Sbjct: 54 VVPRSNWLQRILAPLAWALLVTALARPQFLEAPIE-KTQPVRDLLLALDLSQSMDTRDFR 112 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL 250 + I + A+ E V + GL+ F + + +Q I+ L Sbjct: 113 DPS--GALIPRVQAVREVVSSFVARRPGDRIGLIVFGDAPYPLAPFTLDHALVQTMIRDL 170 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 ++ G D G+ + +K+++ +TDG T Sbjct: 171 LPGMAGPSTALG---------DAVGLGIKMFDQSPAPEKVLIVLTDG--NDTASKMPPER 219 Query: 311 YCNEAKKRGAIVYAIGIRVIRSH-------EFLRACA--SPNSFYLVENPHSMYDAFSHI 361 + AK+R V+ IGI + L+ A + ++ + +S+ ++ + Sbjct: 220 AADIAKQRHVTVHTIGIGDPSAEGEQRVDLGVLQRMAAQTGGRYFFGADQNSLESIYATL 279 Query: 362 GK 363 + Sbjct: 280 DR 281 >gi|154244802|ref|YP_001415760.1| von Willebrand factor type A [Xanthobacter autotrophicus Py2] gi|154158887|gb|ABS66103.1| von Willebrand factor type A [Xanthobacter autotrophicus Py2] Length = 345 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 30/215 (13%), Positives = 69/215 (32%), Gaps = 45/215 (20%) Query: 174 MMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 MM+ +D+S SM S ++ + + + + + + GL+ FS + Sbjct: 93 MMLAVDLSASMSSPDLVQSGVPANRLQVVKRVADDFIAR-RTGD------RIGLILFSTR 145 Query: 230 IEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + +++ + S G +T+ + A + D Sbjct: 146 AYVQAPLTLDRNVVRQLLAEASIGMTGRNTSIGDAIGLAVKTLRD-----------RPAK 194 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV--------------IRS- 332 ++++ +TDG N + A K ++ IG+ Sbjct: 195 DRVLILLTDGANT--SGVLDPMEAAAIAAKENVRIHTIGVGADSNFTDIQPGMLMNPSGD 252 Query: 333 --HEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 E L+ A + ++ N + ++ I + Sbjct: 253 LDEEALKKIAGLTGGQYFRARNDKGLAAIYADIDR 287 >gi|291452634|ref|ZP_06592024.1| von Willebrand factor type A domain-containing protein [Streptomyces albus J1074] gi|291355583|gb|EFE82485.1| von Willebrand factor type A domain-containing protein [Streptomyces albus J1074] Length = 658 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 29/219 (13%), Positives = 75/219 (34%), Gaps = 25/219 (11%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + ++ + +++VLD S SM T+I+ A ++ +++ + Sbjct: 35 LAAGPPATAEPGPGAGG---LVMVLDSSGSMADEAGGGRTRIEAARDAVGTVVDSLPDGY 91 Query: 214 DVNNVV----QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 V ++ T + L+ +++ + + G T L+ A Sbjct: 92 PTGLRVYGADRTSGCTDTRLARPVEPLD--RDAMKKAVAGVEPKG-DTPIGLSLRKAVAD 148 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENL--STKEDQQSLYYCNEAKKRGAIVYAIGI 327 + + + A ++ ++ ++DGE+ S + + + AIG Sbjct: 149 LPEPE--------PGAVGRRTVLLISDGEDNCGSPPPCEAAEELAESGLD--LRIDAIGF 198 Query: 328 RV-IRSHEFLRACASPNS--FYLVENPHSMYDAFSHIGK 363 +V ++ E L A +Y + ++ + Sbjct: 199 QVEGKAREELTCVAEAGHGAYYDAPDAEALARQLQRAAE 237 >gi|239980776|ref|ZP_04703300.1| hypothetical protein SalbJ_15127 [Streptomyces albus J1074] Length = 628 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 29/219 (13%), Positives = 75/219 (34%), Gaps = 25/219 (11%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + ++ + +++VLD S SM T+I+ A ++ +++ + Sbjct: 5 LAAGPPATAEPGPGAGG---LVMVLDSSGSMADEAGGGRTRIEAARDAVGTVVDSLPDGY 61 Query: 214 DVNNVV----QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 V ++ T + L+ +++ + + G T L+ A Sbjct: 62 PTGLRVYGADRTSGCTDTRLARPVEPLD--RDAMKKAVAGVEPKG-DTPIGLSLRKAVAD 118 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENL--STKEDQQSLYYCNEAKKRGAIVYAIGI 327 + + + A ++ ++ ++DGE+ S + + + AIG Sbjct: 119 LPEPE--------PGAVGRRTVLLISDGEDNCGSPPPCEAAEELAESGLD--LRIDAIGF 168 Query: 328 RV-IRSHEFLRACASPNS--FYLVENPHSMYDAFSHIGK 363 +V ++ E L A +Y + ++ + Sbjct: 169 QVEGKAREELTCVAEAGHGAYYDAPDAEALARQLQRAAE 207 >gi|309812068|ref|ZP_07705828.1| Tat pathway signal sequence domain protein [Dermacoccus sp. Ellin185] gi|308433947|gb|EFP57819.1| Tat pathway signal sequence domain protein [Dermacoccus sp. Ellin185] Length = 597 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 36/208 (17%), Positives = 70/208 (33%), Gaps = 38/208 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 M++++DVS SM++ D ++I++ + A L+ + + G FS+ +++ Sbjct: 395 MLVLIDVSGSMQTKIDGGQSRIELMESTAIAALDVLPK------TTRLGAWAFSSNLQKN 448 Query: 233 ---FFLLEWGVSHL------------QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + L G + + L+ T + AY + D Sbjct: 449 HVDYLPLTNGEQPILDDTYRNGLIAKAHTLPGLAAKNGDTALYDTIAAAYKSVTD----- 503 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK-----KRGAIVYAIGIRVIRS 332 T D NY +V +TDG N K + + I + Sbjct: 504 ----TYDPNYVNSVVVLTDGTNDDPNGGLALDQLLARLKSQYSADKPVKIVTISLGTGTD 559 Query: 333 HEFLRAC--ASPNSFYLVENPHSMYDAF 358 + L+ A+ Y + P + F Sbjct: 560 PDALKRIAKATDGLSYQTKTPEQISGVF 587 >gi|156358436|ref|XP_001624525.1| predicted protein [Nematostella vectensis] gi|156211311|gb|EDO32425.1| predicted protein [Nematostella vectensis] Length = 1323 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 74/195 (37%), Gaps = 21/195 (10%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 LD+ I LD S S+ T ++A + ++E +K+ + ++ GL+ +S+ Sbjct: 1134 SLDIGIALDRSTSVGP------TNFNIAKTFLKILVERMKISTNGSHF---GLIAYSSSA 1184 Query: 231 EEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + R+I + G T + L+ A +F + N Sbjct: 1185 SRVISFRFSQKAADINRQIDAIEFTGGKTRTDFALQVAITDLFTNS------AGDRENVT 1238 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACASP-NSFY 346 +++ MT+G +++ K++ V A+GI + E L + Sbjct: 1239 DVLIVMTNGR--TSQGSLPYKDVMKPLKEKKVDVIAVGIGPDVNEAELLEIAEGGLDHVI 1296 Query: 347 LVENPHSMYDAFSHI 361 V++ ++ + I Sbjct: 1297 RVDDYEALATKLNSI 1311 >gi|24375866|ref|NP_719909.1| von Willebrand factor type A domain-containing protein [Shewanella oneidensis MR-1] gi|24350833|gb|AAN57353.1|AE015872_4 von Willebrand factor type A domain protein [Shewanella oneidensis MR-1] Length = 451 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 41/234 (17%), Positives = 87/234 (37%), Gaps = 27/234 (11%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 A K L + + ++ + +T K + + +++ +V+D S SM Sbjct: 30 EAKIITKAELSAPIILENTQENNYLKISLTG-FKQDETEKSPINLSLVIDRSGSM----- 83 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKI 247 S +I+ A ++ + +K V+ ++ +S+ + + + I Sbjct: 84 -SGDRIEKAREAAIMAINMLKDDDIVS------VIAYSDNAYLIIPATKVKNKNEMIKII 136 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQ 306 K G ST L ++ + + + N I+ ++DG+ N+ + Sbjct: 137 NDTIKPGGST----ALFAGVSK--GITEVNKFIKKNQVNR---IILLSDGQANIGPSTTK 187 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAF 358 + A K+G V IG+ + + + A A S + VEN + AF Sbjct: 188 ELADLGQVAGKQGIAVTTIGLGNGYNEDLMTALAGFSDGNHAYVENSADLETAF 241 >gi|261418348|ref|YP_003252030.1| von Willebrand factor A [Geobacillus sp. Y412MC61] gi|319767693|ref|YP_004133194.1| von Willebrand factor type A [Geobacillus sp. Y412MC52] gi|261374805|gb|ACX77548.1| von Willebrand factor type A [Geobacillus sp. Y412MC61] gi|317112559|gb|ADU95051.1| von Willebrand factor type A [Geobacillus sp. Y412MC52] Length = 1077 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 33/166 (19%), Positives = 64/166 (38%), Gaps = 27/166 (16%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 P + + + V+ +D++ V+DVS SM + K+ A ++ A Sbjct: 172 PNGDAQGRLDVTLVPQGAVSGIIRPPIDVVFVMDVSGSMTAM------KLQSAKSALQAA 225 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV-SHLQRKIKYL------SKFGVSTN 258 + K + N+ + L+ FS+ + E ++ +G S++ ++ + G TN Sbjct: 226 VNYFKSNYNQND--RFALIPFSDGVREASVVPFGKYSNVASQLDAILNTGNSLTAGGGTN 283 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 + L A + D KK I+F+TDG Sbjct: 284 YSAALSLAKSYFT------------DPTRKKYIIFLTDGMPTVLNA 317 >gi|149410435|ref|XP_001512838.1| PREDICTED: similar to Coch-5B2 gene product [Ornithorhynchus anatinus] Length = 692 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 32/208 (15%), Positives = 69/208 (33%), Gaps = 32/208 (15%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +++ ++D S S+ + + + + ++E ++ V V F+ Sbjct: 507 SVNIAFLIDGSSSVGD------SNFRLMLDFVARVVETFEISDIGTKV---AAVQFTYDQ 557 Query: 231 EEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F + + I+ + T + + + +F +R N Sbjct: 558 RTEFSFTDYTTKENVLAVIRQIRYMSGGTATGDAVAFTVRNVFGP--LRDSPNKN----- 610 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS----PNS 344 +V +TDG+ A K G V++IG+ + L+ AS ++ Sbjct: 611 -FLVVLTDGQ-----SYDDVRGPAAAAHKAGITVFSIGMAWAPLDD-LKDMASEPKESHT 663 Query: 345 FY---LVENPHSMYDAFSHIGKDIVTKR 369 F+ + D I +D V R Sbjct: 664 FFTREFAGLEQIVTDVIRGICRDFVDSR 691 >gi|333030669|ref|ZP_08458730.1| von Willebrand factor type A [Bacteroides coprosuis DSM 18011] gi|332741266|gb|EGJ71748.1| von Willebrand factor type A [Bacteroides coprosuis DSM 18011] Length = 328 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 44/221 (19%), Positives = 75/221 (33%), Gaps = 26/221 (11%) Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD 169 I S + V ++ S Y I L F I ++ S+ S Sbjct: 29 IDASLQVSDTRVYEHAPKSFKV---YLIHLPFILRIAALALLIIVLARPQSTDNWKSSEV 85 Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +D+M+ +DVS SM + D +++ A + + P+ N G+ F+ + Sbjct: 86 EGIDIMLAIDVSGSMLAE-DLQPNRLEAAKDVAAKFIND---RPNDN----IGITLFAGE 137 Query: 230 IEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + L + T G+ A ++ + Sbjct: 138 SFTQCPLTIDHTALLNLFGNIQTGVIEDGTAIGMGVSNAVARL-----------KDSQAK 186 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 K+I+ +TDG N + D L AK G VY +GI Sbjct: 187 SKVIILLTDGSNNA--GDISPLTSAEIAKTYGIRVYTVGIG 225 >gi|330806846|ref|YP_004351308.1| lipoprotein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327374954|gb|AEA66304.1| Putative lipoprotein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 557 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 44/288 (15%), Positives = 100/288 (34%), Gaps = 37/288 (12%) Query: 85 RIKNTWNMSFRNELRDNGFVN-----DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPL 139 + T +F ++ + N + + ++ + + YS A+ P Sbjct: 102 SVAETPVSTFSVDVDTGSYANVRRLLNQGSLPPEGAVRLEEMVNYFPYSY-ALPTDGSPF 160 Query: 140 KFCTFI---PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 T + PW ++R + + I +S A +++ ++DVS SM+ + Sbjct: 161 GVTTEVAPSPWNPHTRLLRIGIKAS-DRAVADLAPANLVFLVDVSGSMD-----RREGLP 214 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL--QRKIKYLSKFG 254 + ++ ++++++ + LV ++ + G + + I L G Sbjct: 215 LVKSTLKLLVDQLRDQD------RVSLVVYAGESRVVLKPTSGRDKVTIRNAIDQLDA-G 267 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCN 313 ST G++ AY + I+ TDG+ N+ + Sbjct: 268 GSTAGASGIELAYQM-----ARESFIDKGINR----ILLATDGDFNVGVSDFDSLKQMAV 318 Query: 314 EAKKRGAIVYAIGIRVIRSHEFLR---ACASPNSFYLVENPHSMYDAF 358 + +K G + +G V +E L A A ++ ++N Sbjct: 319 DQRKSGVSLTTLGFGVDNYNEHLMEQLADAGDGNYAYIDNLLEARKVL 366 >gi|213968792|ref|ZP_03396933.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato T1] gi|213926395|gb|EEB59949.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato T1] Length = 328 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 66/173 (38%), Gaps = 26/173 (15%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + ++++ + + + LE K + GL+ F Sbjct: 67 DLLVAVDVSGSMDYPDMQWKSDEVSRLVLVQQLLGDFLEGRKGD-------RVGLILFGT 119 Query: 229 KIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + + G +T + A ++ Sbjct: 120 QAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTAVGDAIGLALKRL-----------RLRPA 168 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 +++V +TDG N + + D + A + G +Y IGI + L++ Sbjct: 169 NSRVLVLVTDGANNAGQIDPITA--ARLAAEEGVKIYPIGIGSDPDKDALQSA 219 >gi|28870917|ref|NP_793536.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. tomato str. DC3000] gi|301385766|ref|ZP_07234184.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato Max13] gi|302061830|ref|ZP_07253371.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato K40] gi|302134226|ref|ZP_07260216.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato NCPPB 1108] gi|28854166|gb|AAO57231.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato str. DC3000] gi|331018299|gb|EGH98355.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. lachrymans str. M302278PT] Length = 352 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 66/173 (38%), Gaps = 26/173 (15%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + ++++ + + + LE K + GL+ F Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDEVSRLVLVQQLLGDFLEGRKGD-------RVGLILFGT 143 Query: 229 KIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + + G +T + A ++ Sbjct: 144 QAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTAVGDAIGLALKRL-----------RLRPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 +++V +TDG N + + D + A + G +Y IGI + L++ Sbjct: 193 NSRVLVLVTDGANNAGQIDPITA--ARLAAEEGVKIYPIGIGSDPDKDALQSA 243 >gi|119775307|ref|YP_928047.1| von Willebrand factor type A domain-containing protein [Shewanella amazonensis SB2B] gi|119767807|gb|ABM00378.1| von Willebrand factor type A domain protein [Shewanella amazonensis SB2B] Length = 327 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 27/217 (12%), Positives = 70/217 (32%), Gaps = 46/217 (21%) Query: 173 DMMIVLDVSRSMESFF----DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+ +D+S SM+ + ++ + + ++ +E ++ + GL+ F + Sbjct: 84 DLMVAVDLSGSMQIEDMVLDNKTVDRFTLVQHVVSDFIER-RVGD------RIGLILFGD 136 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + + ++ G T + A + +++ Sbjct: 137 HAYLQSPMTQDRRSVAQYLREAQIGLVGKQTAIGESIALAVKRFENLEESN--------- 187 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI---------------- 330 +++V +TDG N + A +R +Y IG+ Sbjct: 188 --RVLVLLTDGTNNAGSISPD--KAAAIAAERKVTIYTIGVGAEMMERRSFFGRDRVNPS 243 Query: 331 ---RSHEFLRAC-ASPNSFYLVENPHSMYDAFSHIGK 363 + R A+ ++ + + + I K Sbjct: 244 MDLDEEQLQRIANATQGKYFRARSSEDLAAIYQEIDK 280 >gi|294055316|ref|YP_003548974.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] gi|293614649|gb|ADE54804.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] Length = 730 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 73/197 (37%), Gaps = 25/197 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + +++ +LDVS SM S K+ + +++ + + V VV +G Sbjct: 361 EERPASNLVFLLDVSGSM-----SQPNKLPLLKEALMLLTRRLDSRDRVAIVVYAGA--- 412 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + + ++ + L G STN+ G++ AY R+H + N Sbjct: 413 -SGLVLPSTTANNTATIEHALTQLQA-GGSTNAGAGIELAYQV------AREHFIEDGNN 464 Query: 287 YKKIIVFMTDGENL--STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE-FLRACASPN 343 ++ TDG+ T + ++AK G + +G + + L ++ Sbjct: 465 R---VILCTDGDFNVGQTNRGDLAQIVADQAKD-GVSLTVLGFGMGNYKDNMLEELSNKG 520 Query: 344 S--FYLVENPHSMYDAF 358 + V++ F Sbjct: 521 KGTYAYVDSEAEARKVF 537 >gi|219850594|ref|YP_002465027.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219544853|gb|ACL26591.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 958 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 41/223 (18%), Positives = 74/223 (33%), Gaps = 41/223 (18%) Query: 165 NSQTDARLDMMIVLDVSRSME-----------SFFDSSITKIDMAIKSINAMLEEVKLIP 213 N + L ++ V+D S SM+ SS KID+A ++ + Sbjct: 403 NRELRPDLAIVFVIDKSGSMDACHCANPDRGGPITSSSERKIDIAKDAVAQATALLSPQD 462 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 G+VTF F+ G V + + + G TN GL A + Sbjct: 463 T------VGVVTFDGAAFPTFVATRGATVEQVMDAVSGVEPRGP-TNIRAGLLRAEEMLQ 515 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 + +H ++ +TDG + DQ L +++G + + Sbjct: 516 QVDARIKH-----------MILLTDG--WGSGGDQ--LDIAARLREQGITLTVVAAGSGS 560 Query: 332 SHEFLRACAS-PNSFYLVENPHSMYDAF-----SHIGKDIVTK 368 + + A +Y + + F + IG IV + Sbjct: 561 ATYLQQLAAEGGGRYYPAADMADVPQIFVQETITAIGNYIVEQ 603 >gi|17558302|ref|NP_505768.1| hypothetical protein C29A12.6 [Caenorhabditis elegans] gi|3874606|emb|CAA98248.1| C. elegans protein C29A12.6a, partially confirmed by transcript evidence [Caenorhabditis elegans] gi|3877573|emb|CAB01217.1| C. elegans protein C29A12.6a, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 643 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 39/250 (15%), Positives = 89/250 (35%), Gaps = 34/250 (13%) Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 S+ K P + + + + S T D+ ++DVS Sbjct: 419 RSRSVQFAMTEKPPFTTVMNPMKFFTTTRTPITKPKPLIPYSCTA---DVFFLVDVS--- 472 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK----IEEFFLLEWGV 240 + D S +D+A +I+++ P + V+ GL+++S + F Sbjct: 473 QGTGDKSQQYLDIAASAISSL-------PISQDTVRVGLISYSGPGRTHVRVFLDKHNEK 525 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 L ++ + + G +T + ++YA + + H N KK++V TDG + Sbjct: 526 EKLIEEMFLMERHGGTTRTADAIRYA----TKIFEGKAHPA--RKNVKKVLVVFTDGYSQ 579 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGI----RVIRSHEFLRACASPNSFYLVENPHSMYD 356 ++ + A+ +G + A+ + + + + S ++ + + + Sbjct: 580 DNPKEASRM-----ARAKGIQLIAVAVKDRLAPPDTEQLTEIGGNGRSIFISPSGRDLRE 634 Query: 357 AFSHIGKDIV 366 IG Sbjct: 635 --KIIGTQCQ 642 >gi|90424817|ref|YP_533187.1| hypothetical protein RPC_3326 [Rhodopseudomonas palustris BisB18] gi|90106831|gb|ABD88868.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB18] Length = 479 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 32/142 (22%), Positives = 53/142 (37%), Gaps = 13/142 (9%) Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN- 299 S L+ KI L G +TN GL +A+ + + + Y I+ ++DG+N Sbjct: 337 STLKGKINKLDAEG-NTNQPIGLFWAWMSLQTGVPLNTPAKDTEYKYTDAIILLSDGDNT 395 Query: 300 -----LSTKE-DQQSLYYCNEAKK--RGAI-VYAIGIRVI--RSHEFLRACASPNSFYLV 348 S D + C+ K G ++ I + L+ CAS F+ Sbjct: 396 QSGNSNSVSAIDARQKKLCDNIKDPLNGTTTIFTIQVNTDGDDESAVLKYCASDGQFFQS 455 Query: 349 ENPHSMYDAFSHIGKDIVTKRI 370 + AF IG + R+ Sbjct: 456 TTADQIEIAFQSIGSSLTKLRL 477 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 30/221 (13%), Positives = 70/221 (31%), Gaps = 29/221 (13%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F G + +L I + + +G+ ++ S ++ + D + + Sbjct: 17 FHAADDGNIAVLFGIAVIPLISFVGVAVDYSRATAARSAMQGAADSATL----------- 65 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 K G I + L + +N++ + N ++ Sbjct: 66 -MVSKDYAAGVIRASDIQATAEKYFKALYTSPGINNVT-----VTATYTARSANGSSTVV 119 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 + +P F +T +P T+S RL + + LDV+ SM+ Sbjct: 120 MNTSGSMPTSFLKVA-GFT-----ALPFTASSTSTWGA-TRLRVAMALDVTGSMDWD--- 169 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 K+ + ++ +K + V ++ F+ + Sbjct: 170 --DKLTAMKTAAIKLVNTLKATASTDADVYISIIPFNVMVN 208 >gi|327193756|gb|EGE60633.1| hypothetical protein RHECNPAF_136001 [Rhizobium etli CNPAF512] Length = 433 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 38/232 (16%), Positives = 79/232 (34%), Gaps = 24/232 (10%) Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKF---CTFIPWYTNSRHIVMPITSSVKVNS 166 + S+ + + Y P K PW +++ + + I + Sbjct: 1 MPDPQSVRVEEMINYFPYDWPGPETADQPFKATVTVMPTPWNHDTKLMHVAIKG-YDIAP 59 Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 T +++ ++DVS SM+ K+ + + ++ +K V+ V +G Sbjct: 60 ATAPHANLVFLIDVSGSMDE-----PDKLPLLKSAFRLLVNRLKADDTVSIVTYAGNAGT 114 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + S + I L G ST G++ AY + ++ N Sbjct: 115 VLEPTRVAE----KSKILSAIDKLEA-GGSTGGAEGIEAAY-DLAKKAFVKDGVNR---- 164 Query: 287 YKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 ++ TDG+ N+ D+ E +K G + +G ++ L Sbjct: 165 ----VMLATDGDFNVGPSSDEDLKRIIEEKRKDGIFLTVLGFGRGNLNDSLM 212 >gi|313159754|gb|EFR59111.1| von Willebrand factor type A domain protein [Alistipes sp. HGB5] Length = 340 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 32/175 (18%), Positives = 66/175 (37%), Gaps = 24/175 (13%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + ++MM+ +DVS SM + D +++ +IN + + + + Sbjct: 79 GSKLREEKTQGVEMMLAVDVSNSMLAE-DFEPNRLERTKYAINKLFDGLHQD-------R 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 GL+ F+ + + + + K + T L A + G Sbjct: 131 VGLIVFAGEPKVQLPITSDYRMAKAFAKRIDPSLVPVQGTAIGKALSQA---LMSFSGET 187 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + N+ ++++ +TDGEN + +L A + G +Y IGI Sbjct: 188 E------ENHSRVVILITDGENH----EDDALAAARHAAEMGIRIYTIGIGTPEG 232 >gi|148694465|gb|EDL26412.1| procollagen, type XII, alpha 1, isoform CRA_a [Mus musculus] Length = 1722 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 70/205 (34%), Gaps = 24/205 (11%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 T A D+++++D S S+ I+ ++E ++ P VQ L +S Sbjct: 30 TRAEADIVLLVDGSWSIG------RANFRTVRSFISRIVEVFEIGP---KRVQIALAQYS 80 Query: 228 NKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L L + + L G +T + L + Q F Q Sbjct: 81 GDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKTQ------AGMRP 134 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +KI V +TDG K + K G ++AIGI+ E P+ Sbjct: 135 RARKIGVLITDG-----KSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDT 189 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRI 370 N +++ S I D+ Sbjct: 190 -HAYNVAD-FESLSKIVDDLTINLC 212 >gi|66805993|ref|XP_636718.1| hypothetical protein DDB_G0288381 [Dictyostelium discoideum AX4] gi|60465117|gb|EAL63216.1| hypothetical protein DDB_G0288381 [Dictyostelium discoideum AX4] Length = 549 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 44/206 (21%), Positives = 77/206 (37%), Gaps = 14/206 (6%) Query: 171 RLDMMIVLDVSRSMESFFDSSITK-----IDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 ++DM IVLDVS SM + + TK ++ + S + V +GLV Sbjct: 121 QIDM-IVLDVSCSMTAVAYNGSTKPGLLEMNRSEVSQALFQTMLDKYVSFEVPVVAGLVL 179 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F KI+ F + +++ + ST + +A N+I + + D Sbjct: 180 FGQKIDTAFEISKNFDSFSQELGEVVANQGSTRLYEAIYHAANEIEKYRNNPKEKLAPDV 239 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC--ASPN 343 + I +TDG++ S + K I+ AI I + L A+ Sbjct: 240 CCR--IFLLTDGQDTS---NINPYNVYQYLKPLNIILDAIPIGR-DDNSTLSTLTKATGG 293 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKR 369 S ++ + + F I T+R Sbjct: 294 SCFMANSTQEGVELFEREALLIPTQR 319 >gi|73973310|ref|XP_867431.1| PREDICTED: similar to alpha 1 type XII collagen short isoform precursor isoform 2 [Canis familiaris] Length = 1901 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 70/205 (34%), Gaps = 24/205 (11%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 T A D+++++D S S+ I+ ++E ++ P VQ L +S Sbjct: 30 TRAEADIVLLVDGSWSIG------RANFRTVRSFISRIVEVFEIGP---KRVQIALAQYS 80 Query: 228 NKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L L + + L G +T + L + Q F Q Sbjct: 81 GDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQNFRTQ------AGMRP 134 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +KI V +TDG K + K G ++AIGI+ E P+ Sbjct: 135 RARKIGVLITDG-----KSQDDVEAPSKKLKDEGVELFAIGIKNADEDELKMIATDPDDT 189 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRI 370 N +++ S I D+ Sbjct: 190 -HAYNVAD-FESLSKIVDDLTINLC 212 >gi|260837282|ref|XP_002613634.1| hypothetical protein BRAFLDRAFT_93675 [Branchiostoma floridae] gi|229299020|gb|EEN69643.1| hypothetical protein BRAFLDRAFT_93675 [Branchiostoma floridae] Length = 1460 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 64/167 (38%), Gaps = 21/167 (12%) Query: 174 MMIVLDVSRSMES--FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 ++IV+DVS SM ++++A ++ +L+ + +G+V+FS + + Sbjct: 210 VVIVMDVSGSMREPHGVPEEQNRLNLAKQAALTVLDTLTPRD------WAGVVSFSARAK 263 Query: 232 EFF---------LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + ++ I T G K A+N F+ + + Sbjct: 264 APEGCLGDSLGEANPTNIGIMKDFINQRVP-ETITVYAEGFKKAFNMFFESKNKKPEQF- 321 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 + + II+F+TDG+ T + + +R ++ G+ Sbjct: 322 --EDCQNIIIFLTDGQPTDTYFTLDDIVKGQDLMERSVHIFTYGLGA 366 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 60/166 (36%), Gaps = 20/166 (12%) Query: 174 MMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 ++IV+DVS SM E ++++A ++ +L+ + G+V+FS + E Sbjct: 1167 VVIVMDVSGSMREPPGPEEQNRLNLAKQAALTVLDTLTPRDWG------GVVSFSARAET 1220 Query: 233 FF---------LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + ++ I T G + A++ + + + Sbjct: 1221 PEGCLGDSLGEANPTNIGIMKDFINQRVP-ETITMYGVGFRKAFDMFAEARNKKPEQF-- 1277 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 + II+F++DG ++ E R ++ G+ Sbjct: 1278 -EDCYNIIIFLSDGSPTDKDFALNAITQGQELMDRSVYIFTYGLGA 1322 >gi|218296567|ref|ZP_03497295.1| von Willebrand factor type A [Thermus aquaticus Y51MC23] gi|218243109|gb|EED09641.1| von Willebrand factor type A [Thermus aquaticus Y51MC23] Length = 706 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 42/199 (21%), Positives = 68/199 (34%), Gaps = 35/199 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++VLDVS SME + ++ LE V+ + G+V FS+ Sbjct: 309 LVLVLDVSGSMEG---------EKLAMAVAGALELVRSAAPED---YLGVVLFSSSPRVL 356 Query: 234 FLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F G + + L G T + A + D+ R K Sbjct: 357 FPPRPMTAQGKKEAESLLLSLRA-GGGTVLGGAFREALRLLQDVPVER-----------K 404 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--PNSFYL 347 ++ ++DG KE +L A G V A+ + FL A A FY Sbjct: 405 ALLVLSDGIIFDPKEPILAL-----AATAGVEVSALALGPDADAAFLEALAQRGGGRFYR 459 Query: 348 VENPHSMYDAFSHIGKDIV 366 P + F G+++ Sbjct: 460 AATPKELPRLFLKEGQEVF 478 >gi|119572528|gb|EAW52143.1| integrin, alpha M (complement component 3 receptor 3 subunit), isoform CRA_a [Homo sapiens] Length = 1153 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 81/210 (38%), Gaps = 28/210 (13%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 D+ ++D S S+ ++ + ++ L++ K + + + + Sbjct: 143 GCPQEDSDIAFLIDGSGSIIPH---DFRRMKEFVSTVMEQLKKSKTLFSLMQYSEEFRIH 199 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F+ K + + + +K +++ T++ G++ ++F++ N Sbjct: 200 FTFKEFQNNP------NPRSLVKPITQLLGRTHTATGIRKVVRELFNIT------NGARK 247 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFLRACAS 341 N KI+V +TDGE + + EA + G I Y IG R +S + L AS Sbjct: 248 NAFKILVVITDGEKFGDPLGYEDVIP--EADREGVIRYVIGVGDAFRSEKSRQELNTIAS 305 Query: 342 P---NSFYLVENPHSMYDAFSHIGKDIVTK 368 + + V N ++A I + K Sbjct: 306 KPPRDHVFQVNN----FEALKTIQNQLREK 331 >gi|119572529|gb|EAW52144.1| integrin, alpha M (complement component 3 receptor 3 subunit), isoform CRA_b [Homo sapiens] Length = 1152 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 81/210 (38%), Gaps = 28/210 (13%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 D+ ++D S S+ ++ + ++ L++ K + + + + Sbjct: 143 GCPQEDSDIAFLIDGSGSIIPH---DFRRMKEFVSTVMEQLKKSKTLFSLMQYSEEFRIH 199 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F+ K + + + +K +++ T++ G++ ++F++ N Sbjct: 200 FTFKEFQNNP------NPRSLVKPITQLLGRTHTATGIRKVVRELFNIT------NGARK 247 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFLRACAS 341 N KI+V +TDGE + + EA + G I Y IG R +S + L AS Sbjct: 248 NAFKILVVITDGEKFGDPLGYEDVIP--EADREGVIRYVIGVGDAFRSEKSRQELNTIAS 305 Query: 342 P---NSFYLVENPHSMYDAFSHIGKDIVTK 368 + + V N ++A I + K Sbjct: 306 KPPRDHVFQVNN----FEALKTIQNQLREK 331 >gi|64654539|gb|AAH96347.1| Integrin, alpha M (complement component 3 receptor 3 subunit) [Homo sapiens] Length = 1152 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 81/210 (38%), Gaps = 28/210 (13%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 D+ ++D S S+ ++ + ++ L++ K + + + + Sbjct: 143 GCPQEDSDIAFLIDGSGSIIPH---DFRRMKEFVSTVMEQLKKSKTLFSLMQYSEEFRIH 199 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F+ K + + + +K +++ T++ G++ ++F++ N Sbjct: 200 FTFKEFQNNP------NPRSLVKPITQLLGRTHTATGIRKVVRELFNIT------NGARK 247 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFLRACAS 341 N KI+V +TDGE + + EA + G I Y IG R +S + L AS Sbjct: 248 NAFKILVVITDGEKFGDPLGYEDVIP--EADREGVIRYVIGVGDAFRSEKSRQELNTIAS 305 Query: 342 P---NSFYLVENPHSMYDAFSHIGKDIVTK 368 + + V N ++A I + K Sbjct: 306 KPPRDHVFQVNN----FEALKTIQNQLREK 331 >gi|64654595|gb|AAH96346.1| Integrin, alpha M (complement component 3 receptor 3 subunit) [Homo sapiens] Length = 1152 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 81/210 (38%), Gaps = 28/210 (13%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 D+ ++D S S+ ++ + ++ L++ K + + + + Sbjct: 143 GCPQEDSDIAFLIDGSGSIIPH---DFRRMKEFVSTVMEQLKKSKTLFSLMQYSEEFRIH 199 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F+ K + + + +K +++ T++ G++ ++F++ N Sbjct: 200 FTFKEFQNNP------NPRSLVKPITQLLGRTHTATGIRKVVRELFNIT------NGARK 247 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFLRACAS 341 N KI+V +TDGE + + EA + G I Y IG R +S + L AS Sbjct: 248 NAFKILVVITDGEKFGDPLGYEDVIP--EADREGVIRYVIGVGDAFRSEKSRQELNTIAS 305 Query: 342 P---NSFYLVENPHSMYDAFSHIGKDIVTK 368 + + V N ++A I + K Sbjct: 306 KPPRDHVFQVNN----FEALKTIQNQLREK 331 >gi|224831239|ref|NP_001139280.1| integrin alpha-M isoform 1 precursor [Homo sapiens] gi|307148|gb|AAA59544.1| glycoprotein Mac-1 [Homo sapiens] Length = 1153 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 81/210 (38%), Gaps = 28/210 (13%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 D+ ++D S S+ ++ + ++ L++ K + + + + Sbjct: 143 GCPQEDSDIAFLIDGSGSIIPH---DFRRMKEFVSTVMEQLKKSKTLFSLMQYSEEFRIH 199 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F+ K + + + +K +++ T++ G++ ++F++ N Sbjct: 200 FTFKEFQNNP------NPRSLVKPITQLLGRTHTATGIRKVVRELFNIT------NGARK 247 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFLRACAS 341 N KI+V +TDGE + + EA + G I Y IG R +S + L AS Sbjct: 248 NAFKILVVITDGEKFGDPLGYEDVIP--EADREGVIRYVIGVGDAFRSEKSRQELNTIAS 305 Query: 342 P---NSFYLVENPHSMYDAFSHIGKDIVTK 368 + + V N ++A I + K Sbjct: 306 KPPRDHVFQVNN----FEALKTIQNQLREK 331 >gi|386975|gb|AAA59903.1| neutrophil adherence receptor alpha-M subunit [Homo sapiens] Length = 1145 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 81/210 (38%), Gaps = 28/210 (13%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 D+ ++D S S+ ++ + ++ L++ K + + + + Sbjct: 135 GCPQEDSDIAFLIDGSGSIIPH---DFRRMKEFVSTVMEQLKKSKTLFSLMQYSEEFRIH 191 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F+ K + + + +K +++ T++ G++ ++F++ N Sbjct: 192 FTFKEFQNNP------NPRSLVKPITQLLGRTHTATGIRKVVRELFNIT------NGARK 239 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFLRACAS 341 N KI+V +TDGE + + EA + G I Y IG R +S + L AS Sbjct: 240 NAFKILVVITDGEKFGDPLGYEDVIP--EADREGVIRYVIGVGDAFRSEKSRQELNTIAS 297 Query: 342 P---NSFYLVENPHSMYDAFSHIGKDIVTK 368 + + V N ++A I + K Sbjct: 298 KPPRDHVFQVNN----FEALKTIQNQLREK 323 >gi|307114|gb|AAA59491.1| leukocyte adhesion glycoprotein precursor [Homo sapiens] Length = 1152 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 81/210 (38%), Gaps = 28/210 (13%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 D+ ++D S S+ ++ + ++ L++ K + + + + Sbjct: 143 GCPQEDSDIAFLIDGSGSIIPH---DFRRMKEFVSTVMEQLKKSKTLFSLMQYSEEFRIH 199 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F+ K + + + +K +++ T++ G++ ++F++ N Sbjct: 200 FTFKEFQNNP------NPRSLVKPITQLLGRTHTATGIRKVVRELFNIT------NGARK 247 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFLRACAS 341 N KI+V +TDGE + + EA + G I Y IG R +S + L AS Sbjct: 248 NAFKILVVITDGEKFGDPLGYEDVIP--EADREGVIRYVIGVGDAFRSEKSRQELNTIAS 305 Query: 342 P---NSFYLVENPHSMYDAFSHIGKDIVTK 368 + + V N ++A I + K Sbjct: 306 KPPRDHVFQVNN----FEALKTIQNQLREK 331 >gi|88501734|ref|NP_000623.2| integrin alpha-M isoform 2 precursor [Homo sapiens] gi|1708572|sp|P11215|ITAM_HUMAN RecName: Full=Integrin alpha-M; AltName: Full=CD11 antigen-like family member B; AltName: Full=CR-3 alpha chain; AltName: Full=Cell surface glycoprotein MAC-1 subunit alpha; AltName: Full=Leukocyte adhesion receptor MO1; AltName: Full=Neutrophil adherence receptor; AltName: CD_antigen=CD11b; Flags: Precursor gi|263049|gb|AAB24821.1| leukocyte integrin alpha chain [Homo sapiens] gi|64653358|gb|AAH96348.1| Integrin, alpha M (complement component 3 receptor 3 subunit) [Homo sapiens] gi|68563402|gb|AAH99660.1| Integrin, alpha M (complement component 3 receptor 3 subunit) [Homo sapiens] gi|168275740|dbj|BAG10590.1| integrin alpha-M precursor [synthetic construct] Length = 1152 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 81/210 (38%), Gaps = 28/210 (13%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 D+ ++D S S+ ++ + ++ L++ K + + + + Sbjct: 143 GCPQEDSDIAFLIDGSGSIIPH---DFRRMKEFVSTVMEQLKKSKTLFSLMQYSEEFRIH 199 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F+ K + + + +K +++ T++ G++ ++F++ N Sbjct: 200 FTFKEFQNNP------NPRSLVKPITQLLGRTHTATGIRKVVRELFNIT------NGARK 247 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFLRACAS 341 N KI+V +TDGE + + EA + G I Y IG R +S + L AS Sbjct: 248 NAFKILVVITDGEKFGDPLGYEDVIP--EADREGVIRYVIGVGDAFRSEKSRQELNTIAS 305 Query: 342 P---NSFYLVENPHSMYDAFSHIGKDIVTK 368 + + V N ++A I + K Sbjct: 306 KPPRDHVFQVNN----FEALKTIQNQLREK 331 >gi|268572085|ref|XP_002641230.1| Hypothetical protein CBG09096 [Caenorhabditis briggsae] Length = 564 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 35/202 (17%), Positives = 73/202 (36%), Gaps = 25/202 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D LD+ +V+D S S++ F S K ++ +L + + P V+ L+T+ Sbjct: 243 GSDKPLDLALVVDASESLDHLF-SDQKKF-----LVDRVLGNINIHP---EAVRVALITY 293 Query: 227 SNKI----EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 S + + L + +Q +K + +T + L A++ + + Sbjct: 294 SGQAFVHFKFNSFLYGNNTSVQGFVKNIRSIKGTTATNVALMDAFDLLTS----KDPSTG 349 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI---RSHEFLRAC 339 K+ + +TDG + + +D + + G I+ A+ + E Sbjct: 350 IREGVPKMALVLTDGHSHRSPKD-----MAEKMRAAGIIMIAVSVTPRPLVDEAELRLIA 404 Query: 340 ASPNSFYLVENPHSMYDAFSHI 361 S + N H F Sbjct: 405 GSEKRAFTPPNLHEFESEFMKY 426 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 25/139 (17%), Positives = 44/139 (31%), Gaps = 17/139 (12%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 LD++ +LD S S+ D I+ + +K + + L+ FS Sbjct: 49 APPLDLIFILDSSGSLR----------DKFQDEIDIIRRILKHVTIGKTATRVMLIQFSG 98 Query: 229 KI--EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 E F L + L T ++A Q+ + D Sbjct: 99 TQHLEFNFEKFTDREELLAALDVLRHVSGITRIGGAFEFALQQLKTPGSGLR-----DGT 153 Query: 287 YKKIIVFMTDGENLSTKED 305 KI+ ++DG +D Sbjct: 154 VPKIVYLLSDGRTHDYPKD 172 >gi|223973011|gb|ACN30693.1| unknown [Zea mays] Length = 481 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 45/223 (20%), Positives = 79/223 (35%), Gaps = 37/223 (16%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 P N++ +++ +T ++ + LD++ VLDVS SM+ I K+ A+K + Sbjct: 2 APLEENTQKVLLELTG--GDSTSDRSGLDLVAVLDVSGSMQ---GEKIEKMKTAMKFVVK 56 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ----RKIKYLSKFGVSTNST 260 L + + +VTF + L Q + I L G +TN + Sbjct: 57 KLSSID---------RLSIVTFLDTANRICPLRQVTEDSQPQLLKLIDALQP-GGNTNIS 106 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA 320 GL+ + D + ++ M+DG+ K Sbjct: 107 DGLQTGLKVLADRKLSSGRVVG--------VMLMSDGQQNRG-------EPAANVKIGNV 151 Query: 321 IVYAIGIRVIRSHEFLRACAS---PNSFYLVENPHSMYDAFSH 360 VY G L A A +F +V + + + AFS Sbjct: 152 PVYTFGFGADYDPTVLNAVARNSMGGTFSVVNDVNLLSMAFSQ 194 >gi|282897673|ref|ZP_06305672.1| von Willebrand factor, type A Precursor [Raphidiopsis brookii D9] gi|281197352|gb|EFA72249.1| von Willebrand factor, type A Precursor [Raphidiopsis brookii D9] Length = 474 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 66/207 (31%), Gaps = 28/207 (13%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 S T +++++D S SM K+ + + ++ L D Q + Sbjct: 42 GKSSTIKPQAIVLLIDTSSSMSD------GKLAEVKTAASQFIQRRNLESD-----QIAV 90 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 V F ++ L ++ L I L + G ST G+ A +Q+ + Sbjct: 91 VNFGATVQTPAPLTNDINTLNNAIDQLLEIG-STPMGEGINTAQDQL------------Q 137 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 K I+ TDG + + G + A+ ++ + + Sbjct: 138 ATTLNKNIILFTDGLPDDPNF---AYNSALSVRNAGIKLIAVATGGADTNYLTQITGDRS 194 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRI 370 + N AFS I + I Sbjct: 195 LVFYA-NSGQFDQAFSQAEAVIYKQLI 220 >gi|258647263|ref|ZP_05734732.1| BatA protein [Prevotella tannerae ATCC 51259] gi|260852912|gb|EEX72781.1| BatA protein [Prevotella tannerae ATCC 51259] Length = 334 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 43/231 (18%), Positives = 80/231 (34%), Gaps = 53/231 (22%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +++ +++M+ +DVS SM + D ++I+ A + + PD N GL Sbjct: 81 SNKETEGINIMMAIDVSTSMLT-PDLPPSRIETAKQVAYEFINN---RPDDN----IGLT 132 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLS-------KFGVSTNSTPGLKYAYNQIFDMQGMR 277 F + L S L K ++ T GL A + + Sbjct: 133 VFGGEAYTQCPLTTDHSALLNMFKQVNCDLQKEGVISPGTAIGMGLSSAVSHL------- 185 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--------- 328 + + K+I+ +TDGEN + + L AK+ G +Y I + Sbjct: 186 ----EQSKSKSKVIILLTDGENNA--GEISPLTAAEMAKRLGIRIYTISVGTDAAVNQTV 239 Query: 329 --VIRSH------------EFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 + + L A A + FY + + D + +I + Sbjct: 240 ATLPNGETYEAAIKQNTDPKTLEAIANSTGGKFYQARSKAKLRDIYQNIDR 290 >gi|18490111|gb|AAH22236.1| Unknown (protein for IMAGE:4178997) [Homo sapiens] Length = 439 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 28/172 (16%), Positives = 61/172 (35%), Gaps = 33/172 (19%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK-----LIPDVNNVVQS 221 +D++ VLD S S+ + ++A + +++ + + Sbjct: 20 CKCGPIDLLFVLDSSESIG------LQNFEIAKDFVVKVIDRLSRDELVKFEPGQSYA-- 71 Query: 222 GLVTFSNKIEEF-----FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 G+V +S+ + V L+ IK L T + L+Y +Q+ Sbjct: 72 GVVQYSHSQMQEHVSLRSPSIRNVQELKEAIKSLQWMAGGTFTGEALQYTRDQLL----- 126 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +I + +TDG + + ++ C G V ++GI+ Sbjct: 127 ------PPSPNNRIALVITDGRSDTQRDTTPLNVLC----SPGIQVVSVGIK 168 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 24/210 (11%), Positives = 65/210 (30%), Gaps = 23/210 (10%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS-- 227 + D+ I+LD S S+ S + K + ++ V+ +V +S Sbjct: 237 SPADITILLDGSASVGSH---NFDTTKHFAKRLAERFLTAGRTDPAHD-VRVAVVQYSGT 292 Query: 228 ---NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + L + + +T+ L Y R + Sbjct: 293 GQQRPERASLQFLQNYTALASAVDAMDFINDATDVNDALGYV---------TRFYREASS 343 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 KK ++ +DG + EA++ G ++ + + + +R + + Sbjct: 344 GAAKKRLLLFSDGNSQGATPAAIEKAV-QEAQRAGIEIFVVVVGRQVNEPHIRVLVTGKT 402 Query: 345 --FYLVENPHSMYDA--FSHIGKDIVTKRI 370 + + ++ + + + + + + Sbjct: 403 AEYDVAYGESHLFRVPSYQALLRGVFHQTV 432 >gi|332710564|ref|ZP_08430509.1| hypothetical protein LYNGBM3L_52760 [Lyngbya majuscula 3L] gi|332350619|gb|EGJ30214.1| hypothetical protein LYNGBM3L_52760 [Lyngbya majuscula 3L] Length = 579 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 43/285 (15%), Positives = 100/285 (35%), Gaps = 30/285 (10%) Query: 59 VHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN-----DIDDIVRS 113 + A Q + E + + R+K+ +F ++ + N + + + Sbjct: 96 LLTAEQAIPEASNRENYSAIDENAFKRVKHNPLSTFAIDVDTASYSNLRRFLNNGQLPPT 155 Query: 114 TSLDIVVVPQNEGYSISAISRYKIPLKFCTFI---PWYTNSRHIVMPITSSVKVNSQTDA 170 ++ I + Y P T I PW + + + I K+ + Sbjct: 156 NAIRIEELINYFNYDYP-EPESDRPFSITTEISQAPWNPTHQLVHIGIQGE-KMAIEDLP 213 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +++ +LDVS SM + K+ + + ++ E++ V+ VV +G + Sbjct: 214 PSNLVFLLDVSGSMNT-----PNKLPLLKDAFRMLVNELREEDQVSIVVYAGAAG----V 264 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + I+ L+ G + G+K AY ++ ++ N Sbjct: 265 VLPPTPGNEKDKILTAIENLNAGGST-AGGAGIKLAY-KLAQDNFIKSGNNR-------- 314 Query: 291 IVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++ TDG+ N+ D + + + + +G + +G + Sbjct: 315 VILATDGDFNVGVSSDTELVKLIEQKRNKGVFLTVLGFGSGNLQD 359 >gi|303248312|ref|ZP_07334574.1| von Willebrand factor type A [Desulfovibrio fructosovorans JJ] gi|302490337|gb|EFL50249.1| von Willebrand factor type A [Desulfovibrio fructosovorans JJ] Length = 452 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 32/249 (12%), Positives = 82/249 (32%), Gaps = 39/249 (15%) Query: 21 ILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGG 80 ++ A L + +G+ +++ ++ L + +D + + + Q+ ++ + +N K Sbjct: 1 MVVAATLVGLMAAVGVAVDLGRVYVAHNKLQNAVDAAALAGSLQLPDDPDVDNGK----- 55 Query: 81 DILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLK 140 + N + + + DI ++A + + L Sbjct: 56 -----------------VSQAVTTNLAANDPEAKATDISSGGATRSVCVTAEADVDMTL- 97 Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 S+ + + T+ +++++VLD + SM T I + Sbjct: 98 ----------SKVVGLDATTVTAEACAGYNDIELVMVLDATGSMRG------TPIANVKE 141 Query: 201 SINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNST 260 + +++ + N + GLV F K+ + + T + Sbjct: 142 AAANLVDLIMPDSGANTRSKIGLVPFQGKVRIDGNDPVTAERDPDGVGAGCRNADGTLND 201 Query: 261 PGLKYAYNQ 269 LK Y+ Sbjct: 202 GKLKTEYSD 210 >gi|223462031|gb|AAI46869.1| Collagen, type XII, alpha 1 [Homo sapiens] Length = 1899 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 69/205 (33%), Gaps = 24/205 (11%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 T A D+++++D S S+ I+ ++E + P VQ L +S Sbjct: 30 TRAEADIVLLVDGSWSIG------RANFRTVRSFISRIVEVFDIGP---KRVQIALAQYS 80 Query: 228 NKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L L + + L G +T + L + Q F Q Sbjct: 81 GDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQNFRTQ------AGMRP 134 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +KI V +TDG K + K G ++AIGI+ E P+ Sbjct: 135 RARKIGVLITDG-----KSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDT 189 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRI 370 N +++ S I D+ Sbjct: 190 -HAYNVAD-FESLSRIVDDLTINLC 212 >gi|119569134|gb|EAW48749.1| collagen, type XII, alpha 1, isoform CRA_b [Homo sapiens] Length = 1899 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 69/205 (33%), Gaps = 24/205 (11%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 T A D+++++D S S+ I+ ++E + P VQ L +S Sbjct: 30 TRAEADIVLLVDGSWSIG------RANFRTVRSFISRIVEVFDIGP---KRVQIALAQYS 80 Query: 228 NKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L L + + L G +T + L + Q F Q Sbjct: 81 GDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQNFRTQ------AGMRP 134 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +KI V +TDG K + K G ++AIGI+ E P+ Sbjct: 135 RARKIGVLITDG-----KSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDT 189 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRI 370 N +++ S I D+ Sbjct: 190 -HAYNVAD-FESLSRIVDDLTINLC 212 >gi|156742365|ref|YP_001432494.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233693|gb|ABU58476.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 412 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 73/201 (36%), Gaps = 28/201 (13%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 A L+ +VLD S SM+ K+ ++ +++ + V+ + Sbjct: 34 SAVAEKAPLNFCLVLDRSGSMQG------AKLAALKEATRRVIDTLTPQDIVS------I 81 Query: 224 VTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 V F + ++ + L+ ++ + + G + S + +R+H + Sbjct: 82 VLFDDTVQTLVPATFATDRDALKAQVDAIEEAGGTAMSGG-------MAAGIVELRKHHD 134 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR--AC 339 + ++ +TDG+ ++ + E + V A+G+ + + L A Sbjct: 135 PGRVSA---MLLLTDGQTWG--DEDRCRALAQELARDHVRVTALGLGAEWNEKLLDDIAD 189 Query: 340 ASPNSFYLVENPHSMYDAFSH 360 A+ + +P + F H Sbjct: 190 ATGGLSDYIADPSQITTFFQH 210 >gi|114608142|ref|XP_001142833.1| PREDICTED: collagen alpha-1(XII) chain isoform 1 [Pan troglodytes] Length = 1899 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 69/205 (33%), Gaps = 24/205 (11%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 T A D+++++D S S+ I+ ++E + P VQ L +S Sbjct: 30 TRAEADIVLLVDGSWSIG------RANFRTVRSFISRIVEVFDIGP---KRVQIALAQYS 80 Query: 228 NKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L L + + L G +T + L + Q F Q Sbjct: 81 GDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQNFRTQ------AGMRP 134 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +KI V +TDG K + K G ++AIGI+ E P+ Sbjct: 135 RARKIGVLITDG-----KSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDT 189 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRI 370 N +++ S I D+ Sbjct: 190 -HAYNVAD-FESLSRIVDDLTINLC 212 >gi|93141049|ref|NP_542376.2| collagen alpha-1(XII) chain short isoform precursor [Homo sapiens] Length = 1899 Score = 60.6 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 69/205 (33%), Gaps = 24/205 (11%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 T A D+++++D S S+ I+ ++E + P VQ L +S Sbjct: 30 TRAEADIVLLVDGSWSIG------RANFRTVRSFISRIVEVFDIGP---KRVQIALAQYS 80 Query: 228 NKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L L + + L G +T + L + Q F Q Sbjct: 81 GDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQNFRTQ------AGMRP 134 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +KI V +TDG K + K G ++AIGI+ E P+ Sbjct: 135 RARKIGVLITDG-----KSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDT 189 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRI 370 N +++ S I D+ Sbjct: 190 -HAYNVAD-FESLSRIVDDLTINLC 212 >gi|311695164|gb|ADP98037.1| von Willebrand factor type A domain protein [marine bacterium HP15] Length = 342 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 31/219 (14%), Positives = 71/219 (32%), Gaps = 50/219 (22%) Query: 173 DMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+V+D+S SM+ SI ++ + ++ + + GL+ F Sbjct: 90 DLMLVVDISPSMDEQDMVLQGRSINRLQAVKRVLDDFISR-------RQGDRLGLILFGT 142 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + + ++ ++ L G +T + A ++ Sbjct: 143 EPYVQAPLTFDLETVRTLMREAGLGMAGRATAIGDAVGLATKRL-----------RNRPQ 191 Query: 287 YKKIIVFMTDGENLS--TKEDQQSLYYCNEAKKRGAIVYAIGIRVIR------------- 331 ++++V +TDG N + D+ + A +Y IGI Sbjct: 192 DQRVVVLLTDGANTAGEITPDKAT----EIAAAASIRLYTIGIGAESMVQRGLLGSRRVN 247 Query: 332 -----SHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L A + ++ + + + I + Sbjct: 248 PSRDLDENLLTRMAQQTGGEYFRARSLPELELIYESIDR 286 >gi|91792882|ref|YP_562533.1| von Willebrand factor, type A [Shewanella denitrificans OS217] gi|91714884|gb|ABE54810.1| von Willebrand factor, type A [Shewanella denitrificans OS217] Length = 330 Score = 60.6 bits (145), Expect = 5e-07, Method: Composition-based stats. Identities = 28/217 (12%), Positives = 66/217 (30%), Gaps = 46/217 (21%) Query: 173 DMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+ +D+S SM+ ++ + + + +E N + GL+ F++ Sbjct: 85 DLMLAVDLSGSMQIEDMVINGKTVDRFSLIQNVLGEFIER-------RNGDRLGLILFAD 137 Query: 229 KIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + + + G T + A + + Sbjct: 138 HAYLQAPLTQDRRSIATFLADAQIGLVGKQTAIGEAIALAVKRFDQVSESN--------- 188 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI---------------- 330 +++V +TDG N + + A KR +Y +G+ Sbjct: 189 --RVLVLLTDGSNNAGNIEPDVA--AEIAAKRNVTIYTVGVGAELMERRTIFGKERVNPS 244 Query: 331 ---RSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 + R + ++ +N + + I + Sbjct: 245 MDLDEAQLQRLATMTNGYYFRAKNSEDLAQIYQKIDQ 281 >gi|225010242|ref|ZP_03700714.1| von Willebrand factor type A [Flavobacteria bacterium MS024-3C] gi|225005721|gb|EEG43671.1| von Willebrand factor type A [Flavobacteria bacterium MS024-3C] Length = 351 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 28/205 (13%), Positives = 80/205 (39%), Gaps = 25/205 (12%) Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 S +K PLKF + ++ + K+ + +D++ +DVS+SM + D Sbjct: 49 KKSAFKAPLKFSLQLLGVAAIVIALVNPKAGTKLETVKREGVDIVFAVDVSKSMLAE-DI 107 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL 250 + +++ A + ++ ++ E+ + G++ ++ + + + ++ + Sbjct: 108 APNRMEKAKRLVSEIINELASD-------RIGIIAYAAQAYPQLPITTDFGAAKMFLQGM 160 Query: 251 SKFG---VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 + T + ++ A D +++ ++DGE+ S Sbjct: 161 NTDMLSSQGTAISDAIELATTYYNDAAQTN-----------RVLFIVSDGEDHSEGGAVN 209 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRS 332 ++ +A + G ++ IG+ + Sbjct: 210 AVS---KATEAGIKIFTIGVGTEKG 231 >gi|222616426|gb|EEE52558.1| hypothetical protein OsJ_34813 [Oryza sativa Japonica Group] Length = 517 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 28/152 (18%), Positives = 65/152 (42%), Gaps = 21/152 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSM--ESFFDSS--ITKIDMAIKSINAMLEEV 209 +++ + + V + A +D++ VLDVS SM E F +++D+ ++ +++ V Sbjct: 46 VLVTVEAPKVVAPEKRAPIDLVAVLDVSGSMNKEEFVRGKHMSSRLDLLKIAMKYIIKLV 105 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK----IKYLSKFGVSTNSTPGLKY 265 + + +V+F++ + + L + ++K + L G +T+ P LK Sbjct: 106 RDAD------RLAIVSFNHAVVSEYGLTRNSADSRKKLENLVDKLKASG-NTDFRPALKK 158 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 A + + I+ ++DG Sbjct: 159 AVEILDGRGKEEKKKRVG------FILLLSDG 184 >gi|149043685|gb|EDL97136.1| procollagen, type VI, alpha 2, isoform CRA_b [Rattus norvegicus] Length = 369 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 77/207 (37%), Gaps = 14/207 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV---QSGL 223 + D +++ VLD S S+ + + + + + + +++ + V + G Sbjct: 48 KADCPVNVYFVLDTSESVAMQSPTD-SLLYHMQQFVPQFISQLQNEFYLEQVALSWRYGG 106 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + FS+++E F + + ++ + F T + L QI RQH Sbjct: 107 LHFSDQVEVFSPPGSDRASFTKSLQGIRSFRRGTFTDCALANMTQQI------RQHVGRG 160 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 N V +TDG + A++ G ++A+ + + LR A+ Sbjct: 161 VVN---FAVVITDGHVTGNPCGGIKMQ-AERAREEGIRLFAVAPNRNLNEQGLRDIANTP 216 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRI 370 N +M + I +D + + I Sbjct: 217 HELYRNNYATMRPDSTEIDQDTINRII 243 >gi|251791982|ref|YP_003006702.1| TadG [Aggregatibacter aphrophilus NJ8700] gi|247533369|gb|ACS96615.1| TadG [Aggregatibacter aphrophilus NJ8700] Length = 592 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 45/345 (13%), Positives = 112/345 (32%), Gaps = 64/345 (18%) Query: 3 SLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAA 62 SL ++ F+++ KG ++TA+ + ++ ++ S I + L +++ + Sbjct: 9 SLFYMLKRFYHDEKGVYAVMTALLAFPLLFLIAFAVDGSGILLDRARLAQATEQAALLLT 68 Query: 63 TQ---------------IMNEGNGNNRKKLKGGDILCRIKNTWNMS-------------- 93 T+ + +E N + K + + Sbjct: 69 TENNQYRADKSNLSNVQVTDEEIKNAKGSFKTAQDRKKGAQALKRNQELVQGMVKLYLRS 128 Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 + E + + + D V E S++ + + KF W S Sbjct: 129 YDKEQKSSSPITIPKDFVAECRTQTSTRTNGESSSVACLVEGDVKRKF-----WLPWSYT 183 Query: 154 IVMPITSSVKVNSQTD---------ARLDMMIVLDVSRSM------ESFFDSSITKIDMA 198 + ++V +NS +D+M+V D+S SM + I + Sbjct: 184 LTSNNRNTVDINSGKSYAVKEKDILIPIDLMLVNDISTSMFKPPKDDPQGPKKIDSLKTV 243 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--------VSHLQRKIKYL 250 +K++ +L + +++ + G+ +F ++ + + + + Sbjct: 244 VKAVANILIPDEPPKNISKYNRIGITSFGLGAQQAGKQDANQIKKCVLPFRGVDKTVNIK 303 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA---NYKKIIV 292 K+ AYN + + ++ ++ N +K V Sbjct: 304 VKYDGKVYPGS----AYNVLKEQHKLKLFEDSSHTQEFNPQKTTV 344 Score = 43.2 bits (100), Expect = 0.061, Method: Composition-based stats. Identities = 20/153 (13%), Positives = 53/153 (34%), Gaps = 22/153 (14%) Query: 230 IEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + W S ++ + G T ++ G+ N + ++ + N Sbjct: 422 QKSTATDIWFSSKKSSELNKVMSGIHAGGWTLASAGVFVGTNLLMNINKD-ATPDKIKTN 480 Query: 287 YKKIIVFMTDGENLSTKEDQQSLY---YCNEAKKR------------GAIVYAIGIRVIR 331 ++I++ ++DG + + Q L CN+ + + + + + Sbjct: 481 TQRILLVLSDGVDTALPTLTQELLKGGMCNKVRNKLDELQDKNYRILPTKIAFVAFGYEQ 540 Query: 332 SHEFLR---ACASPNSFYLVENPHSMYDAFSHI 361 E + C P +++ +N ++ + F I Sbjct: 541 DSELRKEWENCVGPGNYHQAKNEKALLEVFKQI 573 >gi|18700173|gb|AAL77698.1| At2g38970/T7F6.14 [Arabidopsis thaliana] Length = 692 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 55/330 (16%), Positives = 116/330 (35%), Gaps = 61/330 (18%) Query: 54 IDRSL-----VHAATQIMNEGNGNNRKKLKGGDILCRIKN--TWNMSFRNELRDNGFVND 106 D ++ + + +MN+G G + D + S+ L +N V Sbjct: 141 NDAAISLVHRLPRSRGVMNQGRGLAPEPSMFDDDERLEQQLVFSGKSYSGALENNHPVRM 200 Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSV-KVN 165 +D+ + P+ +SA+ R KF + + + S +++ Sbjct: 201 ---------MDLKIYPE-----VSAVPRADSREKF-DVLVHLRAAAMVTGNANSLNNQIS 245 Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 A +D++ VLD+S SM TK+ + +++ +++ + + ++ Sbjct: 246 RYPRAPVDLVTVLDISGSMAG------TKLALLKRAMGFVIQNLGSND------RLSVIA 293 Query: 226 FSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 FS+ F L G + + + G TN GL+ + D + Sbjct: 294 FSSTARRLFPLTKMSDAGRQRALQAVNSVVANG-GTNIAEGLRKGVKVMEDQR------- 345 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQ---------SLYYCNEAKKRGAIVYAIGIRVIRS 332 D N I+ ++DG T S++ C K+ V++ G Sbjct: 346 --DKNPVASIILLSDGRATYTMNQADPNYKLLLPLSMHGCES-KRFQIPVHSFGFGSDHD 402 Query: 333 HEFLRACA--SPNSFYLVENPHSMYDAFSH 360 + + + S +F +E+ + DA + Sbjct: 403 ASLMHSVSETSGGTFSFIESESVIQDALAQ 432 >gi|123283202|emb|CAM24859.1| complement component 2 [Homo sapiens] gi|123857989|emb|CAM25859.1| complement component 2 [Homo sapiens] gi|168983781|emb|CAQ06832.1| complement component 2 [Homo sapiens] gi|168984348|emb|CAQ08707.1| complement component 2 [Homo sapiens] gi|168984415|emb|CAQ09271.1| complement component 2 [Homo sapiens] gi|168985076|emb|CAQ07480.1| complement component 2 [Homo sapiens] gi|168985954|emb|CAQ07110.1| complement component 2 [Homo sapiens] Length = 353 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 34/238 (14%), Positives = 84/238 (35%), Gaps = 29/238 (12%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + + TN K+ Q L++ ++LD S+S+ + +S Sbjct: 92 FSHMLGATNPTQKTKESLGR-KIQIQRSGHLNLYLLLDCSQSVSEN------DFLIFKES 144 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWGVSHLQRKIKYLS-------KF 253 + M++ + V ++TF+++ + L + I L + Sbjct: 145 ASLMVDRIFSFEIN---VSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANYKDHEN 201 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL------STKEDQQ 307 G TN+ L Y + + + + I+ +TDG++ + + + Sbjct: 202 GTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIR 261 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 + N+ + +YAIG+ + +E +++++ +++ F H Sbjct: 262 EILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQVFEH 319 >gi|118138230|pdb|2I6Q|A Chain A, Complement Component C2a gi|118138231|pdb|2I6S|A Chain A, Complement Component C2a Length = 517 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 80/219 (36%), Gaps = 28/219 (12%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ Q L++ ++LD S+S+ + +S + M++ + V Sbjct: 7 GSKIQIQRSGHLNLYLLLDCSQSVSEN------DFLIFKESASLMVDRIFSFEIN---VS 57 Query: 221 SGLVTFSNKIEEFFL-LEWGVSHLQRKIKYLS-------KFGVSTNSTPGLKYAYNQIFD 272 ++TF+++ + L + I L + G TN+ L Y + + Sbjct: 58 VAIITFASEPKVLMSVLNDNSRDMTEVISSLENANYKDHENGTGTNTYAALNSVYLMMNN 117 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENL------STKEDQQSLYYCNEAKKRGAIVYAIG 326 + + I+ +TDG++ + + + + N+ + +YAIG Sbjct: 118 QMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIG 177 Query: 327 IRVIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 + + +E +++++ +++ F H Sbjct: 178 VGKLDVDWRELNELGSKKDGERHAFILQDTKALHQVFEH 216 >gi|62078605|ref|NP_001013960.1| von Willebrand factor A domain-containing protein 1 precursor [Rattus norvegicus] gi|81884089|sp|Q642A6|VWA1_RAT RecName: Full=von Willebrand factor A domain-containing protein 1; Flags: Precursor gi|51980318|gb|AAH81983.1| Von Willebrand factor A domain containing 1 [Rattus norvegicus] gi|149024826|gb|EDL81323.1| von Willebrand factor A domain containing 1 [Rattus norvegicus] Length = 415 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 27/180 (15%), Positives = 65/180 (36%), Gaps = 22/180 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + + + + + D++ +LD S S+ + S + + + ++ + P Sbjct: 15 LALAQSGIERGPTASAPQGDLLFLLDSSASVSHYEFSRV------REFVGQLVATMPFGP 68 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 +++ LV ++ F + S +Q ++ + TN+ L YA Q+F Sbjct: 69 GA---LRASLVHVGSRPHTEFTFDQYSSGQAIQDAVRVAPQRMGDTNTGLALAYAKEQLF 125 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 + K++V++TDG E K G ++ + Sbjct: 126 AEE------AGARLGVPKVLVWVTDG-----ASSDSVGPPMQELKDLGVTIFIVSTGRGN 174 >gi|1915902|emb|CAA67576.1| collagen (VI) alpha-1 chain [Homo sapiens] Length = 436 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 28/172 (16%), Positives = 61/172 (35%), Gaps = 33/172 (19%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK-----LIPDVNNVVQS 221 +D++ VLD S S+ + ++A + +++ + + Sbjct: 17 CKCGPIDLLFVLDSSESIG------LQNFEIAKDFVVKVIDRLSRDELVKFEPGQSYA-- 68 Query: 222 GLVTFSNKIEEF-----FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 G+V +S+ + V L+ IK L T + L+Y +Q+ Sbjct: 69 GVVQYSHSQMQEHVSLRSPSIRNVQELKEAIKSLQWMAGGTFTGEALQYTRDQLL----- 123 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +I + +TDG + + ++ C G V ++GI+ Sbjct: 124 ------PPSPNNRIALVITDGRSDTQRDTTPLNVLC----SPGIQVVSVGIK 165 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 23/165 (13%), Positives = 50/165 (30%), Gaps = 19/165 (11%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS-- 227 + D+ I+LD S S+ S + K + ++ V+ +V +S Sbjct: 234 SPADITILLDGSASVGSH---NFDTTKRFAKRLAERFLTAGRTDPAHD-VRVAVVQYSGT 289 Query: 228 ---NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + L + + +T+ L Y R + Sbjct: 290 GQQRPERASLQFLQNYTALASAVDAMDFINDATDVNDALGYV---------TRFYREASS 340 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 KK ++ +DG + EA++ G ++ + + Sbjct: 341 GAAKKRLLLFSDGNSQGATPAAIEKAV-QEAQRAGIEIFVVVVGR 384 >gi|86147474|ref|ZP_01065786.1| hypothetical protein MED222_21509 [Vibrio sp. MED222] gi|85834767|gb|EAQ52913.1| hypothetical protein MED222_21509 [Vibrio sp. MED222] Length = 460 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 50/295 (16%), Positives = 105/295 (35%), Gaps = 56/295 (18%) Query: 16 KGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRK 75 +G + +++ I LP I LV+G+ ++ + +K+ L + +D + + AA + N Sbjct: 13 RGLVALMSIIALPFILLVVGLSVDAGRAYIVKSKLFAAVDAASIAAARAVANGE------ 66 Query: 76 KLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRY 135 + ++ + + + + +I SI + Sbjct: 67 ----DAGRAAAQKYFSANIPADFYSATPSLGAVNFAYDSFGNI---------SIDISATA 113 Query: 136 KIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITK- 194 ++P F I T + P S+ + +D+++V+D + S+ +T+ Sbjct: 114 QVPTIFLPLIGLDTFN-----PGVSAQSIRR----PVDLVLVIDNTTSLRLGSIGDVTQD 164 Query: 195 -IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKY 249 ID + + E + LV F+ E S ++ +I Sbjct: 165 VIDRSKSFVENFHEGFD---------RISLVKFAFGAEVPVGFNATRGHSRSSIKSEIDS 215 Query: 250 LS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 + TN++ G+ A N++ + D K+IVF TDG Sbjct: 216 FNFGSTSNAQYTNASEGMYRALNELRTVT---------DPANLKVIVFFTDGAPN 261 >gi|329117975|ref|ZP_08246688.1| von Willebrand factor type A domain protein [Neisseria bacilliformis ATCC BAA-1200] gi|327465863|gb|EGF12135.1| von Willebrand factor type A domain protein [Neisseria bacilliformis ATCC BAA-1200] Length = 562 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 42/243 (17%), Positives = 87/243 (35%), Gaps = 38/243 (15%) Query: 134 RYKIPLKFCTFIPWYTNSRHIVMPITSSVK----------VNSQTDARLDMMIVLDVSRS 183 Y PL P+ +++ + P S K + +++ ++DVS S Sbjct: 149 PYSYPL-PQDGRPFAVHTQTVDSPWQSEAKLIKIGIQAQDTAKKDLPPANLVFLVDVSGS 207 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH- 242 M + K+ + K++ + E+++ V L+T+++ + G Sbjct: 208 M-----TDPDKLPLVKKTLRILTEQLRPQDKVT------LITYASGEQLVLPPTSGKDKD 256 Query: 243 -LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 + R + L G +T+ L+ AY Q + N I+ TDG+ Sbjct: 257 TILRALNALHA-GGATSGERALRMAYEQ------AEKAYVKNGINR---IILATDGDFNV 306 Query: 302 TKEDQQSLY-YCNEAKKRGAIVYAIGIRVIRSHEFLR---ACASPNSFYLVENPHSMYDA 357 D ++L E +K G + +G +E + A A ++ +++ Sbjct: 307 GVSDTETLKSLVAEKRKSGISLSTLGYGTGNYNEAMMEQIADAGDGNYSYIDSEKEARKV 366 Query: 358 FSH 360 H Sbjct: 367 LRH 369 >gi|332977607|gb|EGK14375.1| von Willebrand factor type A domain protein [Psychrobacter sp. 1501(2011)] Length = 556 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 44/280 (15%), Positives = 101/280 (36%), Gaps = 31/280 (11%) Query: 66 MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN-----DIDDIVRSTSLDIVV 120 + N N +++ + + + + + N + + + ++ + Sbjct: 84 TSISNSENYAEIERNAVHATHEQAFAT-LSIDTDTGSYANVRRFLNNGSLPPTDAVRVEE 142 Query: 121 VPQNEGYSIS-AISRYKIPLKFCTFI---PWYTNSRHIVMPITSS-VKVNSQTDARLDMM 175 + Y A + P T + PW+ +R + + I + V Q +++ Sbjct: 143 LINYFNYDFKNAKKQGNAPFLVTTEMVKSPWHATNRIVKVGIKAEDVLAAKQNQPAANLV 202 Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 ++DVS SM S K+ +A S+ + ++++ + + +G + ++ Sbjct: 203 FLVDVSGSMNSD-----DKLQLAKASLKMLTKQLRAQDTITLITYAG----NTEVVLPAT 253 Query: 236 LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 + I LS G STN +K AY Q + +Q N I+ +T Sbjct: 254 SGNQTQKILNAIDNLSANG-STNGEAAIKLAYQQA-EENFKKQGINR--------ILMLT 303 Query: 296 DGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 DG+ N+ + L + +G + +G ++ Sbjct: 304 DGDFNVGVSNVKDMLDIIRNNRDKGISLSTLGFGQGNYND 343 >gi|320334211|ref|YP_004170922.1| von Willebrand factor type A [Deinococcus maricopensis DSM 21211] gi|319755500|gb|ADV67257.1| von Willebrand factor type A [Deinococcus maricopensis DSM 21211] Length = 609 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 31/197 (15%), Positives = 71/197 (36%), Gaps = 27/197 (13%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 T L++ +V+D S SM + + A+K+ ++ + ++ VV + Sbjct: 34 TRRPLNVALVIDRSGSMAG------SPLRYALKAAADFVDRLTETDVLSIVV------YD 81 Query: 228 NKIEE--FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + ++ + ++ +K + G++ S L+ + G R+ Sbjct: 82 DDVDTLLDAQPVRDKAAIKDLLKGVRAGGITNLSGGWLR----GCELVAGARRADAVNR- 136 Query: 286 NYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL--RACASP 342 ++ +TDG+ N + + + G +G + L A AS Sbjct: 137 -----VLLLTDGQANHGVTDTGVLIKTAASKAEAGVSTTTLGFGSSFEEDLLIGMARASG 191 Query: 343 NSFYLVENPHSMYDAFS 359 +FY +++ D F Sbjct: 192 GNFYFIQSMDDAADVFG 208 >gi|288802180|ref|ZP_06407620.1| BatA protein [Prevotella melaninogenica D18] gi|288335147|gb|EFC73582.1| BatA protein [Prevotella melaninogenica D18] Length = 318 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 76/213 (35%), Gaps = 37/213 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + +D+M+ +D+S SM + D ++++A + + + + + GL F Sbjct: 83 KDTEGIDIMLTMDISASMLTE-DVFPNRMEVAKEVASEFISS-RPSDN------IGLTIF 134 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + + + + +++ + T GL + +++ D + Sbjct: 135 AGEAFTQCPMTLDHAALLNLLHNVRTDLVTNGLMQDGTAIGLGLANSVSRLKDSKAKS-- 192 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF---- 335 K+++ +TDG N + AKK G VY IG+ + Sbjct: 193 ---------KVVILLTDGSNNVGSISPMTAATI--AKKFGIRVYTIGLGRETGEDIGAID 241 Query: 336 ---LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + FY ++ + + I K Sbjct: 242 YKTLQDIAVLTNGEFYRAQSQAELSKIYQDIDK 274 >gi|330963348|gb|EGH63608.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. actinidiae str. M302091] Length = 352 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 65/171 (38%), Gaps = 26/171 (15%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + ++++ + + + LE K + GL+ F Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDEVSRLVLVQQLLGDFLEGRKGD-------RVGLILFGT 143 Query: 229 KIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + + G +T + A ++ Sbjct: 144 QAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTAVGDAIGLALKRL-----------RLRPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +++V +TDG N + + D + A + G +Y IGI + L+ Sbjct: 193 NSRVLVLVTDGANNAGQIDPITA--ARLAAEEGVKIYPIGIGSDPDKDALQ 241 >gi|302188504|ref|ZP_07265177.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. syringae 642] Length = 352 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 65/171 (38%), Gaps = 26/171 (15%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + ++++ + + + LE K + GL+ F Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDEVSRLVLVQQLLGDFLEGRKGD-------RVGLILFGT 143 Query: 229 KIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + + G +T + A ++ Sbjct: 144 QAFVQAPLTYDRRTVRIWLDEARIGIAGKNTALGDAIGLALKRL-----------RLRPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +++V +TDG N + + D + A + G +Y IGI + L+ Sbjct: 193 TSRVLVLVTDGANNAGQIDPITA--ARLAAEEGVKIYPIGIGSDPDKDALQ 241 >gi|229495742|ref|ZP_04389470.1| BatA protein [Porphyromonas endodontalis ATCC 35406] gi|229317316|gb|EEN83221.1| BatA protein [Porphyromonas endodontalis ATCC 35406] Length = 325 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 38/227 (16%), Positives = 74/227 (32%), Gaps = 46/227 (20%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +D S SM + D + A++ + P+ N GLV F+ + Sbjct: 86 IDIMLAIDASGSMMA-MDLQPNRFVAAVEVAQKFIGN---RPNDN----IGLVMFAGESF 137 Query: 232 EFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + L ++ + T G+ A N++ E K Sbjct: 138 TQCPLTTDHATLLNRLSEVEIGYLEDGTAIGLGIATACNRL-----------KESHAKSK 186 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS----------------- 332 IIV +TDG N + + A+ G +Y + + Sbjct: 187 IIVLLTDGTNNAGSIAPSMA--ASLAESLGIRIYTVAVGTRGEAPYPHATAFGTVIDNVK 244 Query: 333 ----HEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 L+ A + S++ + S+ + I +K + + Sbjct: 245 VEIDEASLKEIAQTTGGSYFRATDNESLNQIYDEIDSLEKSKLMTQN 291 >gi|329849363|ref|ZP_08264209.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] gi|328841274|gb|EGF90844.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] Length = 590 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 47/305 (15%), Positives = 98/305 (32%), Gaps = 41/305 (13%) Query: 47 KTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN- 105 + L S + A E + + + R+ T +F ++ + N Sbjct: 102 RPALQSSVQMKRSAAPAYEAREAPNTEKYNGESVSSVMRVTETPVSTFSVDVDTGAYANV 161 Query: 106 --DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSV- 162 ++D T V + Y RY PL P+ + P + Sbjct: 162 RRMLNDGTTPTEAA-VRTEELLNY-----FRYDYPLPQDRSKPFSITTDVAQTPWNAQTR 215 Query: 163 ---------KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 V +++ ++DVS SM K+ + +++ + + ++ Sbjct: 216 LMRVGLRAYDVPRSERPAANLVFLVDVSGSMN-----DPDKLPLVKTALSMLSDNLRPDD 270 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 V+ VV +G ++++ ++ LS G ST G+ AY Sbjct: 271 KVSIVVYAGAAG------MVLAPTHEGKYVKQALECLSA-GGSTAGGQGMALAY------ 317 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + N ++ TDG+ N+ + + ++ G + A+G Sbjct: 318 ATAEANFIKGGINR---VILATDGDFNVGISSIGEVEALVKQNRESGVTLTALGFGTGNY 374 Query: 333 HEFLR 337 +E L Sbjct: 375 NEALM 379 >gi|221135318|ref|ZP_03561621.1| von Willebrand factor, type A [Glaciecola sp. HTCC2999] Length = 342 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 28/212 (13%), Positives = 74/212 (34%), Gaps = 46/212 (21%) Query: 173 DMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+ +D+S SM++ + + ++ M + ++ ++ + GL+ F++ Sbjct: 98 DLMVAVDLSGSMQTQDMVVNGNEVDRLVMVKTVLGDFIQR-RVGD------RIGLILFAD 150 Query: 229 KIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + + +++ + + G ST + A ++ + + N Sbjct: 151 TAYLQAPLTFDRTTVEQLLSETVIGLVGDSTAIGDAIGLA---------AKRFSDKPNVN 201 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-------SHE----- 334 K++V +TDG+N + + A + +Y IG+ Sbjct: 202 --KVLVLLTDGQNTAGNITPD--QALSLAVDQNIKIYPIGVGADAMTVNSLFGQRQVNPS 257 Query: 335 ------FLRACA--SPNSFYLVENPHSMYDAF 358 L A + ++ + + + Sbjct: 258 ADLDEGLLTRLAKDTGGQYFRARDTQELEQIY 289 >gi|84498072|ref|ZP_00996869.1| hypothetical protein JNB_18333 [Janibacter sp. HTCC2649] gi|84381572|gb|EAP97455.1| hypothetical protein JNB_18333 [Janibacter sp. HTCC2649] Length = 656 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 82/198 (41%), Gaps = 19/198 (9%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++++LD S SM++ S +TKI+ A +++ ++ + + Q GL + K++ Sbjct: 39 LLLMLDASGSMKAKDPSGLTKIEAAKRALTGVVGALP------DTAQVGLRVYGAKVDGK 92 Query: 232 -EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + L I L K ++ ST A + + + + K+ Sbjct: 93 GKPTPAACADTQLVHPIATLDKPKLT--STIAAIKALGETPIAHSLTEALKDLGTSGKRN 150 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGA--IVYAIGIRVI-RSHEFLR--ACASPNSF 345 IV ++DGE + ++ + G + +G V ++ L+ A A ++ Sbjct: 151 IVLVSDGEESCVPDPCPAI---TKLTAAGVDLQIDTVGFGVNTKARAQLQCIAAAGKGTY 207 Query: 346 YLVENPHSMYDAFSHIGK 363 Y ++ ++ + S + + Sbjct: 208 YDAKDASALTTSLSKLSQ 225 >gi|313231534|emb|CBY08648.1| unnamed protein product [Oikopleura dioica] Length = 959 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 39/200 (19%), Positives = 69/200 (34%), Gaps = 30/200 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S S+ + + V + + + TFS K Sbjct: 500 DIVFLVDESGSVG------APNFQLMKSFLAQFASVVDP-----DRTRIAVRTFSTKTRL 548 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 +F L+ + +Q I L TN++ L A D D N KI++ Sbjct: 549 YFGLKEPKNKVQ-LINELPYLTGWTNTSLALLGALRDFED-----------DRNSVKIVI 596 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRACASPNSFYLVENP 351 +TDG + + ++ K + + I V + F L AS N Sbjct: 597 TITDGISQDPAA---TKRAADKLKADPRNIQSFAIGVAGAQMFELENIASSNDHVE---M 650 Query: 352 HSMYDAFSHIGKDIVTKRIW 371 +DAF+ I ++ K Sbjct: 651 LENFDAFTKILSTVLDKVCT 670 >gi|156358483|ref|XP_001624548.1| predicted protein [Nematostella vectensis] gi|156211335|gb|EDO32448.1| predicted protein [Nematostella vectensis] Length = 186 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 59/170 (34%), Gaps = 22/170 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S+E + + K +K + ++ G + +S + Sbjct: 1 DLGFLIDASGSVEMYGTGTFKKCINFVKRV------LRAFDVSEKGTHVGAIIYSTDTKL 54 Query: 233 FFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F ++ + G +T + GLK A ++++ T + + Sbjct: 55 AFDFNTYKERGDIEAAFDRVKFLGETTFTGKGLKMALSELY---------KTARKDVSNL 105 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +V +TDG + + + G + +G+ L+A A Sbjct: 106 LVVITDG-----RSHDDVVKPSEMLRNAGVRIVTVGLGNSFDINQLKAMA 150 >gi|285808587|gb|ADC36107.1| putative chloride channel [uncultured bacterium 126] Length = 869 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 38/216 (17%), Positives = 77/216 (35%), Gaps = 33/216 (15%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 RH + V + + + ++IVLD S SM + T ++++ + Sbjct: 409 QRGYRHTDLERILPVTFDRDDEPTVALVIVLDRSWSM------NGTAMELSKSAAEGAAN 462 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLK 264 + G++TF++ L L I + G T P L+ Sbjct: 463 ALAPSQ------MLGVLTFNDASNWDIPLGRVRESRPELHDAIGRIKASGP-TAIFPALR 515 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 AY+ + +++ +H ++ ++DG+ S ED + L + V Sbjct: 516 NAYDALANVRVRAKH-----------VILLSDGQ--SDPEDFEGLV--RKMSAAHITVST 560 Query: 325 IGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAF 358 + + LR AS Y+V++ + + F Sbjct: 561 VALGPDADAALLRNLASWGGGRSYVVQDAQQIPEIF 596 >gi|308270598|emb|CBX27210.1| hypothetical protein N47_A12390 [uncultured Desulfobacterium sp.] Length = 312 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 62/169 (36%), Gaps = 26/169 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 T +D+M+++DVS SM D +++ A + + L V+ + GL+ FS Sbjct: 65 TQKGVDIMVLVDVSPSMMVE-DIKPNRLERARREVLDFLNVVQGD-------RIGLIAFS 116 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 L +Q + L T+ ++ + Sbjct: 117 GVAFVQCPLTLDYGAIQMFLDELKPELIPVAGTDLGAAIEAGISSFDFKS---------- 166 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 K+I+ +TDGE+ + + L +AK++G ++ G+ Sbjct: 167 -VTDKVIMLITDGEDN----EGKGLIAAQKAKEKGVKIFVFGMGDPSGG 210 >gi|291405306|ref|XP_002719067.1| PREDICTED: integrin, alpha E [Oryctolagus cuniculus] Length = 1187 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 49/301 (16%), Positives = 108/301 (35%), Gaps = 27/301 (8%) Query: 75 KKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISR 134 + G I ++++ S +EL N + D D+ T + + + + + + Sbjct: 121 RNHHGVLICTQMQSRQPHSLSSELTGNCTLLD-SDLRPRTQVSFIDLENHLDPHVRVDAG 179 Query: 135 YKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITK 194 K + +R + + + +A ++ IVLD S S++ Sbjct: 180 DCYQNKEGGMGEEESTARWRRALEKAEEEEEEEEEAGTEIAIVLDGSGSIDP------PD 233 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR--KIKYLSK 252 A I M+ + LV + I+ F L+ + K++ +++ Sbjct: 234 FQRAKDFIANMMTNFSEKCFECSF---ALVQYGRVIQTEFNLQDSQNMTASLAKVQNITQ 290 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 T + +++ + IF + K+IV +TDG+ + + Sbjct: 291 VRNVTRTASAIQHVLDDIFT------QRHGSRKKASKVIVVLTDGDTFEDPLNLTVVI-- 342 Query: 313 NEAKKRGAIVYAIGIRVI----RSHEFLRACASP---NSFYLVENPHSMYDAFSHIGKDI 365 N K +G +AIG+ ++ + L+ AS + V N ++ S + + I Sbjct: 343 NSPKMQGIERFAIGVGDAFKKHQTEQELKLIASDPDETHAFKVTNYSALDGLLSRLQQSI 402 Query: 366 V 366 + Sbjct: 403 I 403 >gi|224106794|ref|XP_002314287.1| predicted protein [Populus trichocarpa] gi|222850695|gb|EEE88242.1| predicted protein [Populus trichocarpa] Length = 688 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 49/266 (18%), Positives = 95/266 (35%), Gaps = 37/266 (13%) Query: 57 SLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSL 116 + A I + N + L T + + D G VN + + T Sbjct: 158 ARTDANRHISSHYQANEPAIFDDDEDLDPQHETAEGNTSTQ--DAGDVNSVRTVEVFTYT 215 Query: 117 DIVVVPQNEGY-SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMM 175 ++ VP++ Y + + + K PL N H P + + + A +D++ Sbjct: 216 EVSAVPKSVSYDNFTILIHLKAPLTSGRQ---NRNWNHAESP-----QSSQDSRAPVDLV 267 Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 VLDVS SM S TK+ + +++ +++ + + ++ FS+ F Sbjct: 268 TVLDVSGSM------SGTKLALLKRAMGFVIQNLGPSD------RLSVIAFSSTARRHFP 315 Query: 236 LE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L G + + L G TN GL+ + + D + C+ I Sbjct: 316 LRRMTETGKLEALQAVNSLVSSG-GTNIAEGLRKGFKVVVDRKWKNPVCS---------I 365 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKK 317 + ++DG++ T + K Sbjct: 366 ILLSDGQDTYTISGTSMTRPQADYKS 391 >gi|313226592|emb|CBY21738.1| unnamed protein product [Oikopleura dioica] Length = 766 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 67/212 (31%), Gaps = 29/212 (13%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + + LD++ V+D S S+ + + + + Sbjct: 243 ITQDQCRTNALDIVFVVDESGSIG------TPNFQLIKDFLEHF---ASDSTIAADATRI 293 Query: 222 GLVTFSNKIEEFFLLEWG-VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + FS +F L ++ +IK + +TN+ L A + D R Sbjct: 294 AIRPFSTSNYLYFSLNDFKTKNIINEIKNMPYNSGNTNTADALDAA---LTDYGTDR--- 347 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK--RGAIVYAIGIRVIRSHEFLRA 338 K++V +TDG + S + + K R +AIG+ E Sbjct: 348 ----PESVKVMVTITDGASNSF---LSTSAAADRVKNDLRNIQSFAIGVSGANMAELEAI 400 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + +++ F I +++ K Sbjct: 401 AITDKHVFMLNGWAD----FEPIKSNLLQKVC 428 >gi|209524446|ref|ZP_03272995.1| von Willebrand factor type A [Arthrospira maxima CS-328] gi|209495237|gb|EDZ95543.1| von Willebrand factor type A [Arthrospira maxima CS-328] Length = 541 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 31/206 (15%), Positives = 69/206 (33%), Gaps = 24/206 (11%) Query: 133 SRYKIPLKFCTF---IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 + + P T PW + + + + + + +++ +LDVS SM Sbjct: 137 PQAEEPFSITTEVSSAPWSPQHQLVHIGLQGK-TLAIEELPPSNLVFLLDVSGSMN---- 191 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY 249 ++ + + +++++ V V +G + + I Sbjct: 192 -QPNRLPLLKEGFKLLVDQLTEQDTVAIAVYAGAAG----VVLPPTPGNEKQKIIAAIDG 246 Query: 250 LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQS 308 L G ST G+K AY M ++ ++ TDG+ N+ D + Sbjct: 247 LQAQG-STAGGEGIKLAYELATRMLSEGKNNR---------VILATDGDFNVGVSSDAEL 296 Query: 309 LYYCNEAKKRGAIVYAIGIRVIRSHE 334 + + RG + +G + + Sbjct: 297 VRLIESYRDRGIYLTVLGFGMGNYKD 322 >gi|57956|emb|CAA79152.1| collagen alpha 1 chain type VI [Mus musculus] Length = 583 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 27/170 (15%), Positives = 60/170 (35%), Gaps = 29/170 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVVQSGL 223 T +D++ VLD S S+ + ++A I +++ + V +G+ Sbjct: 167 CTCGPIDILFVLDSSESIG------LQNFEIAKDFIIKVIDRLSKDELVKFEPGQSHAGV 220 Query: 224 VTFSNKIEEF-----FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V +S+ + + +K L T + L+YA +++ Sbjct: 221 VQYSHNQMQEHVTLRSPNVRNAQDFKEAVKKLQWMAGGTFTGEALQYARDRLL------- 273 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +I + +TDG + + ++ C V ++GI+ Sbjct: 274 ----PPTQNNRIALVITDGRSDTQRDTTPLSVLCG----ADIQVVSVGIK 315 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 56/165 (33%), Gaps = 20/165 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + D+ I+LD S S+ S + + K + + V+ +V +S + Sbjct: 382 SPADITILLDSSASVGSH---NFETTKVFAKRLAERFLSAGRADPSQD-VRVAVVQYSGQ 437 Query: 230 IEEFFLLEW-----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ + L + + +T+ L Y R + Sbjct: 438 GQQQPGRAALQFLQNYTVLASSVDSMDFINDATDVNDALSYV---------TRFYREASS 488 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 KK ++ +DG +T E + EA++ G ++ + + Sbjct: 489 GATKKRVLLFSDGSQGATAEAIE--KAVQEARRAGIEIFVVVVGP 531 >gi|53713710|ref|YP_099702.1| hypothetical protein BF2419 [Bacteroides fragilis YCH46] gi|60681981|ref|YP_212125.1| aerotolerance-related membrane protein [Bacteroides fragilis NCTC 9343] gi|253565658|ref|ZP_04843113.1| BatA [Bacteroides sp. 3_2_5] gi|265764034|ref|ZP_06092602.1| BatA [Bacteroides sp. 2_1_16] gi|4838138|gb|AAD30858.1|AF116251_1 BatA [Bacteroides fragilis] gi|52216575|dbj|BAD49168.1| conserved hypothetical protein BatA [Bacteroides fragilis YCH46] gi|60493415|emb|CAH08201.1| aerotolerance-related membrane protein [Bacteroides fragilis NCTC 9343] gi|251945937|gb|EES86344.1| BatA [Bacteroides sp. 3_2_5] gi|263256642|gb|EEZ27988.1| BatA [Bacteroides sp. 2_1_16] gi|301163419|emb|CBW22970.1| aerotolerance-related membrane protein [Bacteroides fragilis 638R] Length = 327 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 38/217 (17%), Positives = 71/217 (32%), Gaps = 46/217 (21%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +DVS SM + D +++ A + P+ N G+ F+ + Sbjct: 88 IDIMLAIDVSTSMLAE-DLKPNRLEAAKDVAAEFING---RPNDN----IGITLFAGESF 139 Query: 232 EFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + L + + T G+ A ++ D + K Sbjct: 140 TQCPLTVDHAVLLNLFQGIQCDIIEDGTAVGMGIANAVTRLKDSKAKS-----------K 188 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-------------------- 329 +I+ +TDG N K D L AK G VY IG+ Sbjct: 189 VIILLTDGTNN--KGDISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPVRVGGTTQYINTP 246 Query: 330 --IRSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 I + + +++ + + + + I K Sbjct: 247 VEIDEKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDK 283 >gi|260778153|ref|ZP_05887046.1| hypothetical protein VIC_003555 [Vibrio coralliilyticus ATCC BAA-450] gi|260606166|gb|EEX32451.1| hypothetical protein VIC_003555 [Vibrio coralliilyticus ATCC BAA-450] Length = 397 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 54/386 (13%), Positives = 124/386 (32%), Gaps = 38/386 (9%) Query: 11 FFYNYKGGMTILT-AIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 + G +++ A+ +P++ L I+ + + +D + + A ++ Sbjct: 1 MLKQHTGSVSLSFLALLIPLVVLSAATIMIGFQVQLSSRAMQ-AVDAASLACAFADYSDP 59 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNG-----FVNDIDDIVRSTSLDIVVVPQN 124 + N + +K+ + EL F + ++ V + Sbjct: 60 SVNQAYLEYYQPNVKLVKSEIYSASGCELNMGYQLTGLFSSLKFAQASYSAQSGSVEQAH 119 Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS--- 181 S S ++ L + + +T +++ Q + ++D + +S Sbjct: 120 VNQSASVTPT-EMTLVLDISSSMAGSIDTLKSILTRAIERIEQDNVQIDGRRAISISIVP 178 Query: 182 -------RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE--- 231 R+ + D + ID K + + + +++ + V+ E Sbjct: 179 FSDGVSARNADWLDDKGVFCIDGLTKESGGSVLVNETVQNLDRIHSEKAVSHRAPDEFLA 238 Query: 232 ------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 L +S ++ I L+ G T S G+ + Q+ N Sbjct: 239 DCSASATLVPLTDNMSEVKTAINALTTTG-GTRSYQGVIWGARQLIPRWRQEWGYNPYSL 297 Query: 286 NYKKIIVFMTDG-ENLSTKEDQQSLYYCNE-AKKRGAIVYAIGIRVIRS-----HEFLRA 338 K+ ++ MTDG ++ +D C+ A + + IG V S + A Sbjct: 298 APKQKLILMTDGVDSGYVLDDLIDAGLCDRLANEFAIELNFIGFNVQDSRLAQFQSCINA 357 Query: 339 CASP---NSFYLVENPHSMYDAFSHI 361 + + N + + FS I Sbjct: 358 ANTDGIKGQVFSATNTEKLDEYFSKI 383 >gi|116624267|ref|YP_826423.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116227429|gb|ABJ86138.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 306 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 43/219 (19%), Positives = 80/219 (36%), Gaps = 35/219 (15%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 +KV Q D + + +V+D S SM + D + +K+ N + V Sbjct: 75 EIKVFRQEDVPISLGLVIDTSASMSNKRDRVNSAALAMVKASN----------PEDEVF- 123 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +++FS EE F+ + S +++ L K G + ++ A + D H Sbjct: 124 --VISFS---EEAFITQDFTSDVKQLESSLRKLGSKGET--AMRDALSLGLD------HL 170 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC- 339 KK++V +TDGE+ Q+ A I+Y IG+ + + Sbjct: 171 RAPARKDKKVLVVITDGEDN--SSIQKQENLIRAAHLSNVIIYGIGLLAAEAPASAQRAK 228 Query: 340 --------ASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 A+ + EN + I +I + + Sbjct: 229 ASLDVLTLATGGRSWYPENVADIEKITPEIAHEIRNQYV 267 >gi|326504154|dbj|BAK02863.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 651 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 44/228 (19%), Positives = 87/228 (38%), Gaps = 43/228 (18%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 I T + + A +D++ VLDVS SM +K+ + +++ +++ + Sbjct: 151 ITGSGTEAAGDDPAQRAPVDLVTVLDVSSSMHG------SKLALLKQAMRFVIDILGPDD 204 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + +V+FS++ L G + R ++ L+ TN GL+ A Sbjct: 205 ------RLSVVSFSSRARRVTRLTRMSDAGKALCVRAVESLTAR-TGTNIAEGLRTAAKV 257 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK---------EDQQSLYYCNEAK---- 316 + R+H N +V ++DG++ T + +L + A+ Sbjct: 258 L----DERRHRNGVSC-----VVLLSDGQDNYTPMRQAFGRGLPNYAALLPPSFARTGTG 308 Query: 317 --KRGAIVYAIGIRVIRSHEFLRAC--ASPNSFYLVENPHSMYDAFSH 360 R V+ G + A A+ +F +EN + DAF+ Sbjct: 309 AGDRATPVHTFGFGNDHDATAMHAVSEATGGTFSFIENEAVIQDAFAQ 356 >gi|162454786|ref|YP_001617153.1| hypothetical protein sce6504 [Sorangium cellulosum 'So ce 56'] gi|161165368|emb|CAN96673.1| hypothetical protein sce6504 [Sorangium cellulosum 'So ce 56'] Length = 381 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 62/188 (32%), Gaps = 40/188 (21%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFD---------------SSITKIDMAIKSINA 204 S ++ D +D+++ LD+S SM + D +T++D A + Sbjct: 111 SVLREQRSDDKGIDIVVALDLSGSMRAILDARASDLPGQPKLPRGKRLTRLDTAKLVLQD 170 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTP 261 + + + G+V F L + + ++ G +T Sbjct: 171 FISRRRTD-------RLGVVVFGKAAYVLSPPTLDYHLLTQMVSQMTLNVIDGSATAIGD 223 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE-AKKRGA 320 L A ++ K+++ +TDG++ S Y A GA Sbjct: 224 ALGTAVARLRRSDAQS-----------KVVILLTDGDSN---AGAISPEYATHLATSLGA 269 Query: 321 IVYAIGIR 328 VY I I Sbjct: 270 KVYTIQIG 277 >gi|118090156|ref|XP_420539.2| PREDICTED: hypothetical protein [Gallus gallus] Length = 606 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 71/210 (33%), Gaps = 19/210 (9%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 DM VLD S S+ + ++ S + + V+ ++ + Sbjct: 147 QPSCHGAFDMYFVLDKSGSVAQNWHEIFDFVNQLDGS--GFVRLTERF--VSPKMRLSFI 202 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS++ L ++ +K LS+ T GLK A QI R Sbjct: 203 VFSSQAHVIMPLTGDREKIKEGLKNLSEVKPAGDTYIHEGLKQANMQIEKQGASR----- 257 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + II+ +TDG+ + + +++ GA VY +G+ + R + Sbjct: 258 ----FSSIIIALTDGKLDG-QIPLYAEKEAKISRELGARVYCVGVLDFVQAQLERIADTK 312 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + V + A I ++ + Sbjct: 313 EQVFPVTGG---FQALKGIINSVLKQSCTE 339 >gi|327261941|ref|XP_003215785.1| PREDICTED: hypothetical protein LOC100567114 [Anolis carolinensis] Length = 1225 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 67/197 (34%), Gaps = 16/197 (8%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ V+D+S SM TK+ K+++ +L ++ N V S V + Sbjct: 277 DVVFVIDISGSMYG------TKMKQTKKAMHVILSDLHQDDFFNIVTFSDTVNVWKPSQS 330 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + + + G T+ L A +F+ D +I+ Sbjct: 331 IQATPQNIKKAKDYVSKMEADGW-TDINAALLAA-ASVFNHSSPMAGKIMRDQRIP-LII 387 Query: 293 FMTDGENLS-TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS------F 345 F+TDGE S + L +A K ++ + + LR + N + Sbjct: 388 FLTDGEPTSGVTTGSRILSNAQQALKGTISLFGLAFGDDADYGLLRRLSLENRGVARRIY 447 Query: 346 YLVENPHSMYDAFSHIG 362 + + + I Sbjct: 448 EDADATLQLKGFYDEIA 464 >gi|227818462|ref|YP_002822433.1| hypothetical protein NGR_b02140 [Sinorhizobium fredii NGR234] gi|227337461|gb|ACP21680.1| conserved hypothetical protein [Sinorhizobium fredii NGR234] Length = 440 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 13/125 (10%), Positives = 38/125 (30%), Gaps = 2/125 (1%) Query: 1 MFSLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVH 60 M + ++ + G ++TA+ +P++FL + + +++ + + +D + + Sbjct: 1 MIKVLTAMQRLLKDQSGNFGLMTALLVPVLFLSGSVALNIANATREASKMQDALDAAAIK 60 Query: 61 AATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFV--NDIDDIVRSTSLDI 118 A + N + + N + + T Sbjct: 61 AVRSYGEGESENAVRTEANRLFFANFQTPSATDGYNSASPESPAVEFTFSETGQETRASA 120 Query: 119 VVVPQ 123 Q Sbjct: 121 SYAAQ 125 >gi|86356688|ref|YP_468580.1| hypothetical protein RHE_CH01044 [Rhizobium etli CFN 42] gi|86280790|gb|ABC89853.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 445 Score = 60.2 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 61/452 (13%), Positives = 135/452 (29%), Gaps = 108/452 (23%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 + F ++ G I+TA+ + + G ++ + ++T L++ D + V + T + Sbjct: 2 LLRFIFDRSGNFGIMTALLVVPLLGAAGTAVDFASALSLRTELYAAADAAAVGSITP-TS 60 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 E G + + F + G ++DI V + + Sbjct: 61 EAAAQANTMSGDGSLTLGKSEAQKIFFSQMSKKQG--------DAPVTVDISVQKKGDTL 112 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME-- 185 S + +P F + + + +T + QT + +D ++LD + SM Sbjct: 113 SSTVSFNATMPTTFMQVMGFD------EIAVTGAATAQYQTPSYMDFFMLLDNTPSMGVA 166 Query: 186 ----------------------------------------------SFFDSSITKIDMAI 199 + + +ID+ Sbjct: 167 ATTDDITAMKKATANGHDGGKDKNCAFACHIVSEKGVEDKNSYYNVARNNGVTIRIDVVA 226 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSN--------KIEEFFLLEWGVSHLQRKIK--- 248 ++ A++ + K + + + T K+ + L + + + Sbjct: 227 SAVKALMAKAKDTQSMPSQFRVAAYTSGKTAQDAKAAKLFKVSDLNYDLGAVAAAANMIK 286 Query: 249 --YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + ++ A I T +A+ +KI+ F+ DG S K Sbjct: 287 LMSIPYQNYYSDQQTSFDEALKGIEGEIKGNIGTGTSNADRQKIVFFVADGVGDSYKPTG 346 Query: 307 QS---------------LYYCNEAKKRGAIV---YAIGIRVIRSH--------------E 334 + YC + K RG V Y + + + Sbjct: 347 CTSPKGANGGRCIEPIDTTYCKKLKDRGIKVAVLYTTYLPLPDNGFYKDWVKPFETRIAA 406 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIV 366 + CA+P ++ V + +A + + IV Sbjct: 407 KMEECATPGFYFAVSPTEGIEEAMEALFRKIV 438 >gi|226315298|ref|YP_002775194.1| hypothetical protein BBR47_57130 [Brevibacillus brevis NBRC 100599] gi|226098248|dbj|BAH46690.1| hypothetical membrane protein [Brevibacillus brevis NBRC 100599] Length = 424 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 69/204 (33%), Gaps = 36/204 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 Q ++++VLD S SM+S + K+ M++ + + +VTF Sbjct: 109 QASGANNIVMVLDTSGSMQSSDPDNQ-----LFKAAADMVQRMDSDMN------IAVVTF 157 Query: 227 SNKIEEFFLLEWGVS-----HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 ++ L S + +K+ + T L+ +Q+ Q Sbjct: 158 HDQTNVLQPLTELSSQSVKDEVVKKLLQFPRTDGGTRIDLALQAGLDQLQANQMANS--- 214 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV--IRSHEFLRAC 339 +V M+DG D K+ IV+ +G+ L+ Sbjct: 215 --------TVVLMSDG-----YSDLDVPAALAPYKQNQVIVHTVGMSQIDADGTALLQKI 261 Query: 340 A--SPNSFYLVENPHSMYDAFSHI 361 A + S++ VE+ M F I Sbjct: 262 AAETGGSYFNVEHADQMTGIFGQI 285 >gi|117919904|ref|YP_869096.1| von Willebrand factor, type A [Shewanella sp. ANA-3] gi|117612236|gb|ABK47690.1| von Willebrand factor, type A [Shewanella sp. ANA-3] Length = 338 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 68/216 (31%), Gaps = 44/216 (20%) Query: 173 DMMIVLDVSRSM---ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+M+ +D+S SM + + + I+ + + E + + GL+ F++ Sbjct: 85 DLMLAVDLSGSMQIEDMVINGKVVDRFTLIQHVVSEFIERRKGD------RIGLILFADH 138 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + + +K G T + A + M Sbjct: 139 AYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGESIALAVKRFDKMDESN---------- 188 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV--IRSHEF---------- 335 ++++ +TDG N + + + A R +Y +G+ + Sbjct: 189 -RVLILLTDGSNNA--GNIEPEQAAQIAANRKVTIYTVGVGADVMERRTLFGRERVNPSM 245 Query: 336 ------LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + ++ N + + I K Sbjct: 246 DLDENQLKHIADVTHGRYFRARNSQELEQIYQEIDK 281 >gi|114046974|ref|YP_737524.1| von Willebrand factor, type A [Shewanella sp. MR-7] gi|113888416|gb|ABI42467.1| von Willebrand factor, type A [Shewanella sp. MR-7] Length = 338 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 68/216 (31%), Gaps = 44/216 (20%) Query: 173 DMMIVLDVSRSM---ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+M+ +D+S SM + + + I+ + + E + + GL+ F++ Sbjct: 85 DLMLAVDLSGSMQIEDMVINGKVVDRFTLIQHVVSEFIERRKGD------RIGLILFADH 138 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + + +K G T + A + M Sbjct: 139 AYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGESIALAVKRFDKMDESN---------- 188 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV--IRSHEF---------- 335 ++++ +TDG N + + + A R +Y +G+ + Sbjct: 189 -RVLILLTDGSNNA--GNIEPEQAAQIAANRKVTIYTVGVGADVMERRTLFGRERVNPSM 245 Query: 336 ------LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + ++ N + + I K Sbjct: 246 DLDENQLKHIADVTHGRYFRARNSQELEQIYQEIDK 281 >gi|113476847|ref|YP_722908.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] gi|110167895|gb|ABG52435.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] Length = 477 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 69/196 (35%), Gaps = 29/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++++D S SM + + ++ + +E L + +V FS+ + Sbjct: 48 VVLLIDTSSSM---WGGKLPEVQA---AATGFVERQNLTVNN-----LAIVEFSSNSQVL 96 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + L++ I L+ G TN + GLK + + + I+ Sbjct: 97 TNFDADKTELKQAIANLTPSG-GTNLSQGLKTVASLLRNSNTPN-------------ILL 142 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHS 353 TDG+ + S E ++ G + +G S+ +P+ + N Sbjct: 143 FTDGQPNDPRA---SKSIAREIREAGINLVTVGTGDANSNYLTSLTENPDLVFFA-NSGE 198 Query: 354 MYDAFSHIGKDIVTKR 369 + AF K I Sbjct: 199 IDQAFRAAEKAISQLS 214 >gi|313212379|emb|CBY36366.1| unnamed protein product [Oikopleura dioica] Length = 955 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 39/200 (19%), Positives = 69/200 (34%), Gaps = 30/200 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S S+ + + V + + + TFS K Sbjct: 496 DIVFLVDESGSVG------APNFQLMKSFLAQFASVVDP-----DRTRIAVRTFSTKTRL 544 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 +F L+ + +Q I L TN++ L A D D N KI++ Sbjct: 545 YFGLKEPKNKVQ-LINELPYLTGWTNTSLALLGALRDFED-----------DRNSVKIVI 592 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRACASPNSFYLVENP 351 +TDG + + ++ K + + I V + F L AS N Sbjct: 593 TITDGISQDPAA---TKRAADKLKADPRNIQSFAIGVAGAQMFELENIASSNDHVE---M 646 Query: 352 HSMYDAFSHIGKDIVTKRIW 371 +DAF+ I ++ K Sbjct: 647 LENFDAFTKILSTVLDKVCT 666 >gi|297678516|ref|XP_002817116.1| PREDICTED: collagen alpha-1(XII) chain-like isoform 2 [Pongo abelii] Length = 1899 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 69/205 (33%), Gaps = 24/205 (11%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 T A D+++++D S S+ I+ ++E + P VQ L +S Sbjct: 30 TRAEADIVLLVDGSWSIG------RANFRTVRSFISRIVEVFDIGP---KRVQVALAQYS 80 Query: 228 NKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L L + + L G +T + L + Q F Q Sbjct: 81 GDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQNFRTQ------AGMRP 134 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +KI V +TDG K + K G ++AIGI+ E P+ Sbjct: 135 RARKIGVLITDG-----KSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDT 189 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRI 370 N +++ S I D+ Sbjct: 190 -HAYNVAD-FESLSRIVDDLTINLC 212 >gi|31615583|pdb|1MF7|A Chain A, Integrin Alpha M I Domain Length = 194 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 38/206 (18%), Positives = 81/206 (39%), Gaps = 28/206 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S+ ++ + ++ L++ K + + + + F+ K + Sbjct: 7 DIAFLIDGSGSIIPH---DFRRMKEFVSTVMEQLKKSKTLFSLMQYSEEFRIHFTFKEFQ 63 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +K +++ T++ G++ ++F++ N N KI+V Sbjct: 64 NNP------NPRSLVKPITQLLGRTHTATGIRKVVRELFNIT------NGARKNAFKILV 111 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFLRACASP---NSF 345 +TDGE + + EA + G I Y IG R +S + L AS + Sbjct: 112 VITDGEKFGDPLGYEDVIP--EADREGVIRYVIGVGDAFRSEKSRQELNTIASKPPRDHV 169 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIW 371 + V N ++A I + K Sbjct: 170 FQVNN----FEALKTIQNQLREKIFC 191 >gi|327262910|ref|XP_003216266.1| PREDICTED: integrin alpha-2-like [Anolis carolinensis] Length = 1302 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 81/212 (38%), Gaps = 35/212 (16%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + +D+++V D S S+ + D + ++ + + P Q GL+ + N Sbjct: 292 SAVDVVVVCDESNSIYPW--------DAVKAFLKKFVQGLDIGPTK---TQVGLIQYGNN 340 Query: 230 IEEFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 E F L + +Q + G TN+ ++YA F + + + Sbjct: 341 PREVFNLNTYTNKEDAVQAMSETYQNGGEYTNTFRAIEYARRYAFSKE------SGGRPS 394 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFLR--- 337 K++V +TDGE+ + Q+ + CN + + I + + + ++ Sbjct: 395 ASKVMVVVTDGESHDGSKLQEVIAKCN---EDNITRFGIAVLGYLIRNELDTKNLIKEIK 451 Query: 338 ---ACASPNSFYLVENPHSMYDAFSHIGKDIV 366 + + F+ V + ++ + +G+ + Sbjct: 452 GIASLPTSKYFFNVSSEAALLEEAGTLGERLF 483 >gi|296481522|gb|DAA23637.1| inter-alpha-trypsin inhibitor heavy chain H5 precursor [Bos taurus] Length = 940 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 34/175 (19%), Positives = 69/175 (39%), Gaps = 27/175 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ VLD S SM TK+ ++ +L +++ N +V FSN+I+ Sbjct: 296 VVFVLDSSASM------VGTKLRQTKDALFTILHDLRPQDHFN------IVGFSNRIKVW 343 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + + I ++S G T+ L+ + D H + ED + Sbjct: 344 KDHLVSVTPNSIRDGKVYIHHMSPSG-GTDINGALQRGIQLLND---YVAHNDIEDRSVS 399 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEFLRACA 340 ++VF+TDG+ + +L N ++ ++ +GI + L + Sbjct: 400 -LVVFLTDGKPT--VGETHTLKILNNTREAARGRVCIFTVGIGADVDFKLLEKLS 451 >gi|149019071|gb|EDL77712.1| procollagen, type XII, alpha 1, isoform CRA_c [Rattus norvegicus] Length = 1721 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 70/205 (34%), Gaps = 24/205 (11%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 T A D+++++D S S+ I+ ++E ++ P VQ L +S Sbjct: 30 TRAEADIVLLVDGSWSIG------RANFRTVRSFISRIVEVFEIGP---KRVQIALAQYS 80 Query: 228 NKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L L + + L G +T + L + Q F Q Sbjct: 81 GDPRTEWHLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKTQ------AGMRP 134 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +KI V +TDG K + K G ++AIGI+ E P+ Sbjct: 135 RARKIGVLITDG-----KSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDI 189 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRI 370 N +++ S I D+ Sbjct: 190 -HAYNVAD-FESLSKIVDDLTINLC 212 >gi|94264779|ref|ZP_01288557.1| von Willebrand factor, type A [delta proteobacterium MLMS-1] gi|93454768|gb|EAT05023.1| von Willebrand factor, type A [delta proteobacterium MLMS-1] Length = 771 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 40/233 (17%), Positives = 80/233 (34%), Gaps = 34/233 (14%) Query: 144 FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSIN 203 + N ++ + S S+ + I+LD S SM I A ++I+ Sbjct: 237 LLERDPNGGYVALASFSPRLPPSEQPIPTSLAILLDCSGSMAG------DSIAQAKQAIS 290 Query: 204 AMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-----LEWGVSHLQRKIKYLSKFGVSTN 258 ML ++ N L+ F ++++ F + ++ L+R I+ + T Sbjct: 291 DMLNLLRPEDYCN------LIMFGSEVKSVFPCQVAADKTNITTLRRAIRAIDADMGGTE 344 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTE--DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 ++ A + M + + E A + I+ +TDG+ K Sbjct: 345 ----MQKALVETLKMSPIYKPPEVEVVPARISRNILLITDGQVWGDK------QILRRMA 394 Query: 317 KRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 K V+ +G+ FL AS + + +G+ I + Sbjct: 395 KSDHRVFTVGVGGAVCEAFLHGLASQS-----GGACELVAPNEEMGEKIARQS 442 >gi|255009407|ref|ZP_05281533.1| aerotolerance-related membrane protein [Bacteroides fragilis 3_1_12] gi|313147166|ref|ZP_07809359.1| aerotolerance protein BatA [Bacteroides fragilis 3_1_12] gi|313135933|gb|EFR53293.1| aerotolerance protein BatA [Bacteroides fragilis 3_1_12] Length = 327 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 38/217 (17%), Positives = 71/217 (32%), Gaps = 46/217 (21%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +DVS SM + D +++ A + P+ N G+ F+ + Sbjct: 88 IDIMLAIDVSTSMLAE-DLKPNRLEAAKDVAAEFING---RPNDN----IGITLFAGESF 139 Query: 232 EFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + L + + T G+ A ++ D + K Sbjct: 140 TQCPLTVDHAVLLNLFQGIKCDIIEDGTAVGMGIANAVTRLKDSKAKS-----------K 188 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-------------------- 329 +I+ +TDG N K D L AK G VY IG+ Sbjct: 189 VIILLTDGTNN--KGDISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPVPVGGTVQYINTP 246 Query: 330 --IRSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 I + + +++ + + + + I K Sbjct: 247 VEIDEKTLTQIAGITDGNYFRATSNSKLKEVYEEIDK 283 >gi|238750905|ref|ZP_04612402.1| hypothetical protein yrohd0001_16570 [Yersinia rohdei ATCC 43380] gi|238710819|gb|EEQ03040.1| hypothetical protein yrohd0001_16570 [Yersinia rohdei ATCC 43380] Length = 520 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 41/241 (17%), Positives = 84/241 (34%), Gaps = 27/241 (11%) Query: 5 NLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQ 64 + + F N G + + FLPI ++ + E+S K L ++++ + A Sbjct: 11 GNSFQFFIKNENGTILMSFIFFLPIFIGLIFLSFEISCFIQKKAKLSDAMEQATL--ALT 68 Query: 65 IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQN 124 + N ++ +++K ++ + + L + F + + I + Sbjct: 69 VENNNIPSSEQEVKNNILISSFAHAY-------LPEETFSEPV--------ITINSSASH 113 Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ-TDARLDMMIVLDVSRS 183 Y Y P KF + I + ++ K N+ T D++ V D S S Sbjct: 114 MDYHADITMSY--PAKFLNKAFNLISISDIKLDESAIAKKNTSITAIPTDVVFVTDYSGS 171 Query: 184 M-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH 242 M F + I D++ I A+ K + + + + + F WG Sbjct: 172 MNRDFDGTDIDSTDISKVRIVALRRIFKNLHNEIQQNE------NINLVGFVPFTWGTKR 225 Query: 243 L 243 Sbjct: 226 T 226 Score = 36.3 bits (82), Expect = 7.1, Method: Composition-based stats. Identities = 17/89 (19%), Positives = 30/89 (33%), Gaps = 18/89 (20%) Query: 284 DANYKKIIVFMTDGE-NLSTKEDQQSLY----------------YCNEAKKRGAIVYAIG 326 + N +KI+V ++DG+ N +T + C K + I Sbjct: 410 NNNNRKIMVILSDGDDNDNTHAGDNRINKDAPYLNITKKLIDNGMCERIKDNDIRMVFIA 469 Query: 327 IRV-IRSHEFLRACASPNSFYLVENPHSM 354 I + + C +FYL N + Sbjct: 470 IGYTPDENIDWKKCVGEGNFYLASNAQEL 498 >gi|332256817|ref|XP_003277513.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-3(VI) chain-like [Nomascus leucogenys] Length = 3172 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 50/312 (16%), Positives = 113/312 (36%), Gaps = 29/312 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + + GG + F +R +G V+ Sbjct: 1517 LEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR-----FAQVIRSSGIVSLGVG 1571 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + ++P + + ++ P + + Sbjct: 1572 DRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPSPSRPE 1631 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + + E V + + + +Q GLV +++ Sbjct: 1632 KKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEDGDSIQVGLVQYNS 1682 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 + F L+ + + I + G N+ GL++ N G R D Sbjct: 1683 DPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSR-----LD 1737 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I +T G+++ +D +RG V+A+G+R I S E + ++ + Sbjct: 1738 QRVPQIAFVITGGKSVEDAQDVSLALT-----QRGVKVFAVGVRNIDSEEVGKIASNSAT 1792 Query: 345 FYLVENPHSMYD 356 + V N + + Sbjct: 1793 AFRVGNVQELSE 1804 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 70/202 (34%), Gaps = 24/202 (11%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + + A D++ ++D S ++ + + + +++ + + + L Sbjct: 27 PDVKNGAAADIIFLVDSSWTIGE------EHFQLVREFLYDVVKSLAVGEND---FHFAL 77 Query: 224 VTFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQ-IFDMQGMRQHC 280 V F+ FLL + + I +S G + + GL+Y + G R Sbjct: 78 VQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGTNQTGKGLEYIMQSHLTKAAGSRA-- 135 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 ++IV +TDG E K V++IG+ + Sbjct: 136 ---GDGVPQVIVVLTDG-----HSKDGLALPSAELKSADVNVFSIGVEDADEGALKEIAS 187 Query: 341 SP--NSFYLVENPHSMYDAFSH 360 P + +EN S++D + Sbjct: 188 EPLNMHVFNLENFTSLHDIVGN 209 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 83/212 (39%), Gaps = 24/212 (11%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + +V++ D++ +LD S ++ T + ++ + + N+ Sbjct: 625 SGTPEVHANKR---DIIFLLDGSANVGK------TNFPYVRDFVMNLVNSLDV---GNDN 672 Query: 219 VQSGLVTFSN-KIEEFFLLEWGV-SHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQG 275 ++ GLV FS+ + EF L + S + ++ L + G N+ L Y + Sbjct: 673 IRVGLVQFSDTPVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTGSALSY----VHANHF 728 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + + ++++ +T G+ + L N + G + + +G E Sbjct: 729 TEAGGSRIREHVPQLLLLLTAGQ-----SEDSYLQAANALTRAGILTFCVGASQANKAEL 783 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + +P+ YL+++ S+ + + + T Sbjct: 784 EQIAFNPSLVYLMDDFSSLPALPQQLIQPLTT 815 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 40/272 (14%), Positives = 88/272 (32%), Gaps = 41/272 (15%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS-------- 166 S+ + + I++ F + ++V + SSV Sbjct: 171 SIGVEDADEGALKEIASEPLNMHVFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLK 230 Query: 167 --QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D++ ++D S ++ + + + +LE++ P ++ G+V Sbjct: 231 DITAQDSADIIFLIDGSN------NTGSVNFAVILDFLINLLEKL---PIGTQQIRVGVV 281 Query: 225 TFSNKIEEFFLLEWGVSHLQ--RKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 FS++ F L+ + Q +K L G N L + F R + Sbjct: 282 QFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHFT----RAGGS 337 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACA 340 + +++V ++ G + + A V++ G+ S L+ A Sbjct: 338 RVEEGVPQVLVLISAGPSSDEIRYGVVAL-------KQASVFSFGLGAQAASRAELQHIA 390 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + ++ F G D+ K + Y Sbjct: 391 TDDNLVFT------VPEFRSFG-DLQEKLLPY 415 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 86/261 (32%), Gaps = 24/261 (9%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + ++V + + Y S + ++P K T I T+ + + ++ + Sbjct: 1368 RNADQEELVKISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAV 1427 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S + ++ ++ + + P V+ G+V FS Sbjct: 1428 ESDAADIVFLIDSSEGVRPDG------FAHIRDFVSRIVRRLNIGPSK---VRVGVVQFS 1478 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 N + F L+ S + I+ L G S N+ L++ +F + + Sbjct: 1479 NDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDG-- 1536 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + +V + G K + + G + +G R I E P Sbjct: 1537 --VPQHLVLVLGG-----KSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDPRL 1589 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 + V + + I Sbjct: 1590 VFTVREFRELPNIEERIMNSF 1610 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 25/162 (15%), Positives = 59/162 (36%), Gaps = 22/162 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 1026 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 1075 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L + ++ L+ G T N+ L++ + + + Sbjct: 1076 EFYLNSYMNQQDVVNAVRQLTLLGGPTPNTGAALEFVLRNMLVSSAGSRITEG----VPQ 1131 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 +++ +T + + K+ GA+ IGI Sbjct: 1132 LLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 1168 >gi|302346571|ref|YP_003814869.1| von Willebrand factor type A domain protein [Prevotella melaninogenica ATCC 25845] gi|302150280|gb|ADK96541.1| von Willebrand factor type A domain protein [Prevotella melaninogenica ATCC 25845] Length = 318 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 76/213 (35%), Gaps = 37/213 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + +D+M+ +D+S SM + D ++++A + + + + + GL F Sbjct: 83 KDTEGIDIMLTMDISASMLTE-DVFPNRMEVAKEVASEFISS-RPSDN------IGLTIF 134 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + + + + +++ + T GL + +++ D + Sbjct: 135 AGEAFTQCPMTLDHAALLNLLHNVRTDLVTNGLMQDGTAIGLGLANSVSRLKDSKAKS-- 192 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF---- 335 K+++ +TDG N + AKK G VY IG+ + Sbjct: 193 ---------KVVILLTDGSNNVGSISPMTAATI--AKKFGIRVYTIGLGRETGEDIGAID 241 Query: 336 ---LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + FY ++ + + I K Sbjct: 242 YKTLQDIAVLTNGEFYRAQSQAELSKIYQDIDK 274 >gi|331012285|gb|EGH92341.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. tabaci ATCC 11528] Length = 352 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 63/169 (37%), Gaps = 22/169 (13%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + ++++ + + + LE K + GL+ F Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDEVSRLVLVQQLLGDFLEGRKGD-------RVGLILFGT 143 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L + ++ + N+ G D G+ Sbjct: 144 QAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTALG---------DAIGLGLKRLRLRPATS 194 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +++V +TDG N + + D + A + G +Y IGI + L+ Sbjct: 195 RVLVLVTDGANNAGQIDPITA--ARLAAEEGVKIYPIGIGADPDKDALQ 241 >gi|330989218|gb|EGH87321.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. lachrymans str. M301315] Length = 352 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 63/169 (37%), Gaps = 22/169 (13%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + ++++ + + + LE K + GL+ F Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDEVSRLVLVQQLLGDFLEGRKGD-------RVGLILFGT 143 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L + ++ + N+ G D G+ Sbjct: 144 QAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTALG---------DAIGLGLKRLRLRPATS 194 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +++V +TDG N + + D + A + G +Y IGI + L+ Sbjct: 195 RVLVLVTDGANNAGQIDPITA--ARLAAEEGVKIYPIGIGADPDKDALQ 241 >gi|257482758|ref|ZP_05636799.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. tabaci ATCC 11528] Length = 265 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 63/169 (37%), Gaps = 22/169 (13%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + ++++ + + + LE K + GL+ F Sbjct: 4 DLLVAVDVSGSMDYPDMQWKSDEVSRLVLVQQLLGDFLEGRKGD-------RVGLILFGT 56 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L + ++ + N+ G D G+ Sbjct: 57 QAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTALG---------DAIGLGLKRLRLRPATS 107 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +++V +TDG N + + D + A + G +Y IGI + L+ Sbjct: 108 RVLVLVTDGANNAGQIDPITA--ARLAAEEGVKIYPIGIGADPDKDALQ 154 >gi|239995770|ref|ZP_04716294.1| von Willebrand factor, type A [Alteromonas macleodii ATCC 27126] Length = 358 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 32/212 (15%), Positives = 69/212 (32%), Gaps = 44/212 (20%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD-VNNVV--QSGLVTFSNKI 230 MM+ +D+S SM I + + + +N + ++ D + V + GL+ F++ Sbjct: 90 MMLAVDLSGSM------KIDDMQLNGRQVNRLTMTKSVVYDFIQRRVGDRIGLILFADTA 143 Query: 231 EEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L + + + G T + A + + + Sbjct: 144 YVQAPLTYDRDTVSTLLSEAVIGLVGEQTAIGDAIGLAVKRFDEREESNN---------- 193 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------SHE 334 +++ +TDG+N + + A +G VY IG+ + S E Sbjct: 194 -VLILLTDGQNTAGNITPEQAK--ELAISKGVKVYTIGVGADKMLIQSFFGSRQINPSQE 250 Query: 335 F----LRACASP--NSFYLVENPHSMYDAFSH 360 L A+ ++ N + + Sbjct: 251 LDEGMLTNIATSTGGQYFRARNAQELQAIYQQ 282 >gi|94969532|ref|YP_591580.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94551582|gb|ABF41506.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 430 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 68/218 (31%), Gaps = 33/218 (15%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + D + + +V+D S SM A+ + + VK + V Sbjct: 176 PQQITSFRHEDIPVALGVVIDNSGSMRDKRP--------AVNAA--TINLVKASNPEDEV 225 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 +V F++ V+ L+ ++ G + Y+ + Sbjct: 226 F---VVNFNDDYYLDQDYTDSVAKLKEALEKYETRGGTAL--------YDAVLASN---A 271 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRS 332 H KK++ +TDGE + ++ G +Y IGI R+ Sbjct: 272 HLMKAPKLEKKVLFIVTDGE-DDASLNTLEQTIRKVQQENGPTIYTIGILDETGGHKRRA 330 Query: 333 HEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 LR A + + ++ + I DI + Sbjct: 331 QRALREMAESTGGVAFFPQSLDEVSRITQQIAHDIRNQ 368 >gi|71737462|ref|YP_275714.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. phaseolicola 1448A] gi|71558015|gb|AAZ37226.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. phaseolicola 1448A] gi|320329710|gb|EFW85699.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. glycinea str. race 4] gi|330882170|gb|EGH16319.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. glycinea str. race 4] Length = 352 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 63/169 (37%), Gaps = 22/169 (13%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + ++++ + + + LE K + GL+ F Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDEVSRLVLVQQLLGDFLEGRKGD-------RVGLILFGT 143 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L + ++ + N+ G D G+ Sbjct: 144 QAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTALG---------DAIGLGLKRLRLRPATS 194 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +++V +TDG N + + D + A + G +Y IGI + L+ Sbjct: 195 RVLVLVTDGANNAGQIDPITA--ARLAAEEGVKIYPIGIGADPDKDALQ 241 >gi|326530406|dbj|BAJ97629.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 657 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 44/228 (19%), Positives = 87/228 (38%), Gaps = 43/228 (18%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 I T + + A +D++ VLDVS SM +K+ + +++ +++ + Sbjct: 157 ITGSGTEAAGDDPAQRAPVDLVTVLDVSSSMHG------SKLALLKQAMRFVIDILGPDD 210 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + +V+FS++ L G + R ++ L+ TN GL+ A Sbjct: 211 ------RLSVVSFSSRARRVTRLTRMSDAGKALCVRAVESLTAR-TGTNIAEGLRTAAKV 263 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK---------EDQQSLYYCNEAK---- 316 + R+H N +V ++DG++ T + +L + A+ Sbjct: 264 L----DERRHRNGVSC-----VVLLSDGQDNYTPMRQAFGRGLPNYAALLPPSFARTGTG 314 Query: 317 --KRGAIVYAIGIRVIRSHEFLRAC--ASPNSFYLVENPHSMYDAFSH 360 R V+ G + A A+ +F +EN + DAF+ Sbjct: 315 AGDRATPVHTFGFGNDHDATAMHAVSEATGGTFSFIENEAVIQDAFAQ 362 >gi|119494080|ref|ZP_01624623.1| hypothetical protein L8106_01082 [Lyngbya sp. PCC 8106] gi|119452182|gb|EAW33385.1| hypothetical protein L8106_01082 [Lyngbya sp. PCC 8106] Length = 608 Score = 60.2 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 31/230 (13%), Positives = 84/230 (36%), Gaps = 27/230 (11%) Query: 134 RYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSIT 193 + L + + +++ + + + L++ +V+D S SM Sbjct: 2 QANYSLSYPLIATNTPATIDLILNFNAQTQAETSPRRPLNLSLVIDRSGSMAG------Q 55 Query: 194 KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLS 251 + AIK+ ++E + V+ +V + ++ E + + + ++I + Sbjct: 56 SLRYAIKAAQQLVESLTADDIVS------VVIYDDQPETILTPQTVEDKAAICKQIGRIR 109 Query: 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLY 310 G + S + ++ ++ N ++ +TDG+ N+ + + Sbjct: 110 AGGCTNLSGG-------WLMGCDCVKSRQTSDRLNR---VLLLTDGQANMGITDPKVITK 159 Query: 311 YCNEAKKRGAIVYAIGIRVIRSHEFLRAC--ASPNSFYLVENPHSMYDAF 358 + G I +G + + L + A+ +FY +++P + F Sbjct: 160 TAQNQAETGIITTTLGFGSYFNEDLLISMADAAGGNFYFIQSPDDVAQVF 209 >gi|330830099|ref|YP_004393051.1| RTX toxin-like protein [Aeromonas veronii B565] gi|328805235|gb|AEB50434.1| RTX toxin-like protein [Aeromonas veronii B565] Length = 1553 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 76/207 (36%), Gaps = 30/207 (14%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 V T ++ ++D+S S+ S T+ + +IN +L + I + V+ G Sbjct: 1144 DVGGVTSGSYNLTFMIDMSGSI------SGTEFQLMKDAINNLLAKFSGISQLQ--VEIG 1195 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 ++ + + V+ Q+ + L++ G TN L N + + + H Sbjct: 1196 TFADNSNVVGTYS---SVTAAQQAVSNLTRSGGGTNYQAALTT-LNTMMTVDPVADH--- 1248 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQS-----LYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 K + F+TDGE + + N G ++ A+GI V F Sbjct: 1249 ------KYVYFLTDGEPTVGSWTNSTQIANGMAALNALTAPGVVINAVGIGVPSGASFGN 1302 Query: 338 AC----ASPNSFYLVENPHSMYDAFSH 360 +P+++ V+N + Sbjct: 1303 NLNAIDNTPDNYLAVDNFDDLSSGLGS 1329 Score = 40.9 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 32/180 (17%), Positives = 63/180 (35%), Gaps = 27/180 (15%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 P + + S T+ +++++VLD S S+ T + + ++ ++ Sbjct: 798 DGPSAVADTITSGTNGTVNLVLVLDSSGSIGD------TNMQVIKDAVTNLMNSY----- 846 Query: 215 VNNVVQSGLVTF--SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 N++V+ LV F S ++ W G T + G Y+ Sbjct: 847 GNSLVKVMLVDFAGSATVKSVGSQVWLTKD--------QATGQLTTISSGGSTDYDDALQ 898 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL---YYCNEAKKRGAI-VYAIGIR 328 N + F++DG ST + + + N ++G YA+GI Sbjct: 899 AVQDNYGTPPSADNT--FVFFISDGVPSSTDDAINTEERNEWTNFLTQKGIDGAYAVGIG 956 >gi|254458660|ref|ZP_05072084.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084426|gb|EDZ61714.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 308 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 73/207 (35%), Gaps = 25/207 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFF----DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + ++ ++LD S SM++ + +T+ D+ + ++ + K Sbjct: 77 EIEPKKGYEIALILDASESMKAKGFDEKNRDLTRFDVVKEIVSNFISSRKNDN------- 129 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 G+V F L + + L+ + L G+ + +F+ + Sbjct: 130 MGIVVFGAYSFIASPLTYDSNILKGVVSNL---------YIGMAGKFTALFESLAQGVNL 180 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--SHEFLRA 338 + KI + +TDG N E + A K+G VY IGI + + L Sbjct: 181 LKTSKSKTKIAILLTDGYNTPDSEFPFDAAI-DFANKQGVKVYPIGIGKSDEYNQKMLEK 239 Query: 339 CA--SPNSFYLVENPHSMYDAFSHIGK 363 A + + N + ++ I + Sbjct: 240 IAEQTGGVAFGASNASELAIVYAKINE 266 >gi|126731725|ref|ZP_01747530.1| Von Willebrand domain containing protein [Sagittula stellata E-37] gi|126707891|gb|EBA06952.1| Von Willebrand domain containing protein [Sagittula stellata E-37] Length = 321 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 64/168 (38%), Gaps = 26/168 (15%) Query: 173 DMMIVLDVSRSM---ESFFDS-SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+ I LD+S SM + + D IT+++ + ++ A + + L+ F + Sbjct: 92 DLAIALDLSGSMVRDDFYLDGQPITRLEA-VTTVGAEFARRRAGD------RVALIVFGS 144 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + + R+I+ + G +TN + L A ++ Sbjct: 145 EAYYAAPFTFDTEAIARRIEEATIGISGRATNISDALGLALKRMAGSDADT--------- 195 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++++ ++DG N + + + A + G V+ I + E Sbjct: 196 --RVVILLSDGANNAGATNPRG--VAQLAAQMGVRVHTIAMGPKSVDE 239 >gi|159043014|ref|YP_001531808.1| von Willebrand factor type A [Dinoroseobacter shibae DFL 12] gi|157910774|gb|ABV92207.1| von Willebrand factor type A [Dinoroseobacter shibae DFL 12] Length = 320 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 34/199 (17%), Positives = 70/199 (35%), Gaps = 25/199 (12%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM---ESFFDSSITK 194 P ++ W I P + + R D+ IVLD+S SM + D Sbjct: 58 PRNAALWLIWALVLLAISGPRDLAPVSALKVSGR-DLAIVLDLSGSMVRDDFNLDDRAVT 116 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS--K 252 A+K++ A + + LV F ++ + + R+I+ + Sbjct: 117 RLEAVKAVGADFARRRAGD------RLALVVFGSEAYFASPFTFDTESVARRIEEATIGI 170 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 G +T+ + GL A ++ ++++ ++DG N + + + Sbjct: 171 SGRATSISDGLGLALKRLSTSTATS-----------RVVILLSDGINNAGATNPRG--VA 217 Query: 313 NEAKKRGAIVYAIGIRVIR 331 A + G V+ I + Sbjct: 218 ELAARYGVRVHTIALGPKD 236 >gi|8567336|ref|NP_059502.1| calcium-activated chloride channel regulator 1 precursor [Mus musculus] gi|81881572|sp|Q9D7Z6|CLCA1_MOUSE RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 3; Short=mCLCA3; AltName: Full=Protein gob-5; Flags: Precursor gi|3721912|dbj|BAA33743.1| gob-5 [Mus musculus] gi|15919901|dbj|BAB25815.2| unnamed protein product [Mus musculus] gi|74201990|dbj|BAE22995.1| unnamed protein product [Mus musculus] gi|109731429|gb|AAI16320.1| Chloride channel calcium activated 3 [Mus musculus] gi|109732845|gb|AAI16319.1| Chloride channel calcium activated 3 [Mus musculus] gi|148680065|gb|EDL12012.1| chloride channel calcium activated 3 [Mus musculus] Length = 913 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 52/336 (15%), Positives = 117/336 (34%), Gaps = 51/336 (15%) Query: 44 FFMKTVLHSMIDRSLV---------HAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSF 94 + K ++ + + + I N +R D + + + + Sbjct: 176 YLSKGKPQAVRCSAAITGKNQVRRCQGGSCITNGKCVIDRVTGLYKDNCVFVPDP-HQNE 234 Query: 95 RNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHI 154 + + N +N + + + + P ++ + S +++ + F + Sbjct: 235 KASIMFNQNINSVVEFCTEKNHN-QEAPNDQNQRCNLRSTWEVIQESEDFKQTTPMTAQP 293 Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 P S +++ + + +VLD S SM + +++ ++ L + Sbjct: 294 PAPTFSLLQI-----GQRIVCLVLDKSGSM-----LNDDRLNRMNQASRLFL-----LQT 338 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK--IKYLSK-FGVSTNSTPGLKYAYNQIF 271 V G+VTF + L+ S R IK+L T+ GL+ A+ I Sbjct: 339 VEQGSWVGMVTFDSAAYVQSELKQLNSGADRDLLIKHLPTVSAGGTSICSGLRTAFTVI- 397 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVI 330 +++ IV +TDGE+ ++ C + K+ GAI++ + + Sbjct: 398 ----KKKYPTDGSE-----IVLLTDGEDN-------TISSCFDLVKQSGAIIHTVALGPA 441 Query: 331 RSHEF--LRACASPNSFYLVENPHS--MYDAFSHIG 362 + E L Y + + + DAF+ + Sbjct: 442 AAKELEQLSKMTGGLQTYSSDQVQNNGLVDAFAALS 477 >gi|331697176|ref|YP_004333415.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] gi|326951865|gb|AEA25562.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] Length = 327 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 73/213 (34%), Gaps = 30/213 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++V+DVS SM++ D + T++ A + + +++ + GLV+F+ Sbjct: 88 VVLVIDVSLSMQA-TDVAPTRLAAAQAAAKSFADQLTPGIN------LGLVSFAGTAAVL 140 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + +++ + L ST + + A I TE + IV Sbjct: 141 VSPTTDRTAVKQAVDGLK-LSESTATGEAIFAALQSIDSFSRTVAASGTEGPPPAR-IVL 198 Query: 294 MTDGENLSTKEDQQ-----SLYYCNEAKKRGAIVYAIGIR---------------VIRSH 333 M+DG+ D + S +A V I + Sbjct: 199 MSDGKQTVPGPDGENDPRGSFTAAKQAAAEKIPVSTISFGTDYGTIDIEGGRTRVAVDDA 258 Query: 334 EFLRACA-SPNSFYLVENPHSMYDAFSHIGKDI 365 + + S F+ + + +S +G+ I Sbjct: 259 SMQQIASLSGGQFFTAASESQLRQVYSELGEQI 291 >gi|225435355|ref|XP_002285271.1| PREDICTED: hypothetical protein isoform 2 [Vitis vinifera] Length = 670 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 84/204 (41%), Gaps = 37/204 (18%) Query: 105 NDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK 163 N I I T ++ VP++ ++ + + K PL T+ R ++++ Sbjct: 217 NSIGAIEVKTYPEVSAVPRSTSHNNFTVLIHLKAPL---------TSGRQNSGTNQTNMQ 267 Query: 164 VNSQT-DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 SQ+ A +D++ VLDVS SM TK+ + +++ +++ + + Sbjct: 268 PTSQSCRAPVDLVTVLDVSGSMAG------TKLALLKRAMGFVIQSLGPCD------RLS 315 Query: 223 LVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 +++FS+ F L G + + L G TN GL+ +G + Sbjct: 316 VISFSSTARRLFPLRRMTDTGRQQALQAVNSLVSNG-GTNIAEGLR---------KGAKV 365 Query: 279 HCNTEDANYKKIIVFMTDGENLST 302 + + N I+ ++DG++ T Sbjct: 366 MLDRKWKNPVSSIILLSDGQDTYT 389 >gi|225435353|ref|XP_002285265.1| PREDICTED: hypothetical protein isoform 1 [Vitis vinifera] Length = 729 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 84/204 (41%), Gaps = 37/204 (18%) Query: 105 NDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK 163 N I I T ++ VP++ ++ + + K PL T+ R ++++ Sbjct: 217 NSIGAIEVKTYPEVSAVPRSTSHNNFTVLIHLKAPL---------TSGRQNSGTNQTNMQ 267 Query: 164 VNSQT-DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 SQ+ A +D++ VLDVS SM TK+ + +++ +++ + + Sbjct: 268 PTSQSCRAPVDLVTVLDVSGSMAG------TKLALLKRAMGFVIQSLGPCD------RLS 315 Query: 223 LVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 +++FS+ F L G + + L G TN GL+ +G + Sbjct: 316 VISFSSTARRLFPLRRMTDTGRQQALQAVNSLVSNG-GTNIAEGLR---------KGAKV 365 Query: 279 HCNTEDANYKKIIVFMTDGENLST 302 + + N I+ ++DG++ T Sbjct: 366 MLDRKWKNPVSSIILLSDGQDTYT 389 >gi|156382099|ref|XP_001632392.1| predicted protein [Nematostella vectensis] gi|156219447|gb|EDO40329.1| predicted protein [Nematostella vectensis] Length = 298 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 35/166 (21%), Positives = 63/166 (37%), Gaps = 22/166 (13%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D+ +LD S S+ S +I +KSI V + + G++TFS Sbjct: 107 PIDIAFLLDASGSIG---RRSWEEIKNFVKSI------VDMCDISDQGTHVGIITFSTDP 157 Query: 231 ------EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 +++ +E +++R I L + T L A +F + Sbjct: 158 VIDIAFDKYKGVEMNAVNIKRDIDELRRKKGYTFIDKALTLADKSLFTQE------AGMR 211 Query: 285 ANYKKIIVFMTDGENLSTKED-QQSLYYCNEAKKRGAIVYAIGIRV 329 + +K+ V M+DG + + N K +G VY +GI Sbjct: 212 EDSQKVAVLMSDGIQTKDRGPFTPTDIAANPLKMKGVQVYTVGIGA 257 >gi|13928960|ref|NP_113879.1| integrin alpha-D precursor [Rattus norvegicus] gi|48428189|sp|Q9QYE7|ITAD_RAT RecName: Full=Integrin alpha-D; AltName: CD_antigen=CD11d; Flags: Precursor gi|6648592|gb|AAF21241.1|AF021334_1 alpha D integrin [Rattus norvegicus] Length = 1161 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 36/212 (16%), Positives = 76/212 (35%), Gaps = 34/212 (16%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D+ ++D S S+ ++ +K++ + L+ + Sbjct: 146 CPRQEMDIAFLIDGSGSIN---QRDFAQMKDFVKALMGEFASTSTLFS--------LMQY 194 Query: 227 SNKIEEFFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 SN ++ F + L Q + + + T + G++ ++F N Sbjct: 195 SNILKTHFTFTEFKNILDPQSLVDPIVQLQGLTYTATGIRTVMEELF------HSKNGSR 248 Query: 285 ANYKKIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFLRAC 339 + KKI++ +TDG+ E + ++A G I YAIG + + + L Sbjct: 249 KSAKKILLVITDGQKYRDPLEYSDVIPAADKA---GIIRYAIGVGDAFQEPTALKELNTI 305 Query: 340 ASP---NSFYLVENPHSMYDAFSHIGKDIVTK 368 S + + V N + A I + + K Sbjct: 306 GSAPPQDHVFKVGN----FAALRSIQRQLQEK 333 >gi|56421171|ref|YP_148489.1| hypothetical protein GK2636 [Geobacillus kaustophilus HTA426] gi|56381013|dbj|BAD76921.1| hypothetical protein [Geobacillus kaustophilus HTA426] Length = 960 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 38/186 (20%), Positives = 70/186 (37%), Gaps = 27/186 (14%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 P + + + V+ +D++ V+DVS SM + K+ A ++ A Sbjct: 55 PNGDAQGRLDVTLVPQGAVSGIIRPPIDVVFVMDVSGSMTAM------KLQSAKSALQAA 108 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV-SHLQRKIKYLS------KFGVSTN 258 + K + N+ + L+ FS+ + E ++ +G S++ ++ + G TN Sbjct: 109 VNYFKSNYNQND--RFALIPFSDGVREASVVPFGKYSNVASQLDAILNTGNSLTAGGGTN 166 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + L A + D KK I+F+TDG Y + K R Sbjct: 167 YSAALSLAKSYFTD------------PTRKKYIIFLTDGMPTVLNTVDTITYREVKPKFR 214 Query: 319 GAIVYA 324 G Y Sbjct: 215 GGYQYT 220 >gi|149701450|ref|XP_001492635.1| PREDICTED: anthrax toxin receptor 2 [Equus caballus] Length = 488 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 37/224 (16%), Positives = 74/224 (33%), Gaps = 30/224 (13%) Query: 154 IVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 +V+ + + D+ VLD S S+ + + I ++++ Sbjct: 22 LVLSGPGGLVSAQEQPSCRGAFDLYFVLDKSGSVANNW-----------IEIYDFVKQLT 70 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYN 268 V+ ++ + FS++ L + + ++ L T GLK A + Sbjct: 71 ERF-VSPQMRLSFIVFSSQATIILPLTGDRGKISQGLEDLKHVRPVGETYIHEGLKLAND 129 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 QI G++ II+ +TDG+ ++ GA VY +G+ Sbjct: 130 QIQKAGGLKTSS---------IIIALTDGKLDGLVPSYAEKE-AKLSRSLGARVYCVGVL 179 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + R S + V + A I I+ + Sbjct: 180 DFEQAQLERIADSKEQVFPVTGG---FQALKGIINSILAQSCTE 220 >gi|149037631|gb|EDL92062.1| procollagen, type VI, alpha 3 (predicted), isoform CRA_c [Rattus norvegicus] Length = 2862 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 46/311 (14%), Positives = 112/311 (36%), Gaps = 28/311 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + GG + + + + G + Sbjct: 1518 LEFVARNLFVKSAGSRIEDGVPQHLVLFLGGKSQDDVSRHAQVISSSGIMSLGI--GDRN 1575 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD 169 I R+ I P+ + ++P + + S P + Sbjct: 1576 IDRTDLQTITNDPR---LVFTVREFRELPNIEERVMLSFGPSGPTPQPPEVEFPSSRPEK 1632 Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + D++ +LD S+ DS + + E V + + + ++ GLV +++ Sbjct: 1633 KKADIVFLLD--GSINFRRDS-------FQEVLRFASEIVDTVYEDGDSIRVGLVQYNSD 1683 Query: 230 IEEFFLLEWGVSHLQRKIKYLSK----FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + F L + ++ I ++K G N+ G+++ + + + + D Sbjct: 1684 PTDEFFLRDFSTK-RQIIDAINKVIYKGGRHANTRVGIEH----LLKNHFVSEAGSRLDE 1738 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +I +T G+++ +D ++G V+A+G+R I S E + ++ + Sbjct: 1739 RVPQIAFVITGGKSVEDAQDVSLALT-----QKGVKVFAVGVRNIDSEEVGKIASNSATA 1793 Query: 346 YLVENPHSMYD 356 + V + + + Sbjct: 1794 FRVGSVQELSE 1804 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 79/203 (38%), Gaps = 25/203 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + + + + A D++ ++D S + + + + + ++ ++E + + + Sbjct: 25 AQQQSDVKNGAAADILFLVDSS------WSAGKDRFLLVQEFLSDVVESLSVGDND---F 75 Query: 220 QSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYA-YNQIFDMQGM 276 LV + FLL + I +S G S + GL+Y ++ + + G Sbjct: 76 HFALVRLNGNPHTEFLLNAYHSKQEVLSHILNMSYIGESNQTGKGLEYIIHSHLTEASGS 135 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 R ++IV +TDG+ E+ E K V+A+G+ L Sbjct: 136 RA-----ADGVPQVIVVLTDGQ----SEEDGFALPSAELKSADVNVFAVGVEDAD-ERTL 185 Query: 337 RACAS---PNSFYLVENPHSMYD 356 R AS + +EN S++D Sbjct: 186 REIASEPLSMHVFNLENVTSLHD 208 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 77/212 (36%), Gaps = 24/212 (11%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T + +V+ D++ +LD S ++ + + ++ + + D Sbjct: 628 TGTTEVHVNKR---DIIFLLDGSDNVGKNNFPYV------RDFVTNLVNSLDVGSDN--- 675 Query: 219 VQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQG 275 ++ GLV FS+ F L+ S L ++ L K G N+ L Y + Sbjct: 676 IRVGLVQFSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGTGLNAGSALSY----VHANHF 731 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + + ++++ + G + L N + G + + +G E Sbjct: 732 TEAGGSRIREHVPQLLLLLMAG-----PSEDVYLQAANALVRSGVLTFCVGTNQADKAEL 786 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 R +P+ YL+++ ++ + + + T Sbjct: 787 ERIAFNPSLVYLMDDFSALPALPQQLIQPLTT 818 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 40/243 (16%), Positives = 86/243 (35%), Gaps = 24/243 (9%) Query: 118 IVVVPQNEGYSISAISRYKIP---LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDM 174 +V + + Y S + ++P + T I T+ + + ++ + D+ Sbjct: 1376 LVKISLSPEYVFSVSTFRELPRLEQRLLTPITTLTSQQIQQILASTRYPPSVAESDAADI 1435 Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 + ++D S +M+ I ++ ++ + + P+ V+ G+V FSN + F Sbjct: 1436 VFLIDSSDAMKPDG------IAHIRDFVSRIVRRLNIGPNK---VRIGVVQFSNDVFPEF 1486 Query: 235 LLEWGVS--HLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L+ S + I+ L G S N+ L++ +F ED + ++ Sbjct: 1487 YLKTHKSQNSVLEAIRRLRFKGGSPLNTGRALEFVARNLFVKS---AGSRIEDGVPQHLV 1543 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENP 351 +F+ K + G + IG R I + P + V Sbjct: 1544 LFL------GGKSQDDVSRHAQVISSSGIMSLGIGDRNIDRTDLQTITNDPRLVFTVREF 1597 Query: 352 HSM 354 + Sbjct: 1598 REL 1600 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 39/222 (17%), Positives = 81/222 (36%), Gaps = 33/222 (14%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 S + +Q A D++ ++D S ++ D+ + +LE + + Sbjct: 226 AGDKESLKDITAQDSA--DIIFLIDGS------QNTGKANFDVIRDFLVNVLERLSV--- 274 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLS-KFGVSTNSTPGLKYAYNQIF 271 N V+ G+V +S++ F L+ S + +K LS G N L + F Sbjct: 275 GNQQVRVGVVQYSDEPRTMFSLDSYPSKAAVLDAVKRLSFAGGELANIGQALDFVVENHF 334 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 R + + +++V ++ G + D A K+G V++ G+ Sbjct: 335 T----RTGGSRVEEGVPQVLVLISAGPSSDEIRDAVV------ALKQG-SVFSFGLGAQA 383 Query: 332 SHEF-LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + L+ A+ ++ F G D+ + + Y Sbjct: 384 ASRVELQHIATDDNLVFT------VPEFRSFG-DLQEQLLPY 418 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 74/206 (35%), Gaps = 28/206 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + S F + + + ++E + + P+ V+ LV +S++ Sbjct: 1027 DVVFLIDGSEGVRSGFP-------LLKEFVQRVVESLDVGPNQ---VRVALVQYSDRTRP 1076 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + I+ L+ G T N+ L + N + G R + Sbjct: 1077 EFYLNSHMDQQGVISAIRRLTLLGGPTPNTGAALDFVLRNILTSSTGSRI-----EEGVP 1131 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 ++++ +T + + K+ GA+ IGI E P+ + Sbjct: 1132 QLLIVLT-----AERSGDDVRGPSVVLKQGGAVPIGIGIGNADISEMQTISFIPDFAVAI 1186 Query: 349 ENPHSMYDAFSHIGKDIVTKRIWYDK 374 + I + I + I ++ Sbjct: 1187 PTFREL----GTIQQVISERVIQLNR 1208 Score = 39.4 bits (90), Expect = 0.94, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 75/213 (35%), Gaps = 34/213 (15%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVV 219 + + +D +D+ +LD S + F + + K I+ M+ ++ L PD + Sbjct: 2606 RRAAGSDVDIDLAFILDSSEATTLF------QFNEMKKYISYMVRQLDLSPDPKASQHFA 2659 Query: 220 QSGLVT---FSNKIEEFFLL--------EWGVSH--LQRKIKYLSKFGVSTNSTPGLKYA 266 + +V + + ++G L + +++ + + ++Y Sbjct: 2660 RVAVVQQATYESVDNTSVPPVKVEFSLTDYGAKEKLLDFLSRRMTQLQGTMDLGNAIEYT 2719 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 IF+ + KI+V M GE + ++ +AK +G +G Sbjct: 2720 IENIFES--------APNPRDLKIVVLMLTGEIPREQLEEAQRAIL-QAKCKGYFFVVLG 2770 Query: 327 IRVI-RSHEFLRACASPNSFY--LVENPHSMYD 356 I E + PN + ++ + + Sbjct: 2771 IGRKVNVKEVYSFASEPNDVFFKFMDKSTELNE 2803 >gi|149037630|gb|EDL92061.1| procollagen, type VI, alpha 3 (predicted), isoform CRA_b [Rattus norvegicus] Length = 2867 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 46/311 (14%), Positives = 112/311 (36%), Gaps = 28/311 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + GG + + + + G + Sbjct: 1518 LEFVARNLFVKSAGSRIEDGVPQHLVLFLGGKSQDDVSRHAQVISSSGIMSLGI--GDRN 1575 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD 169 I R+ I P+ + ++P + + S P + Sbjct: 1576 IDRTDLQTITNDPR---LVFTVREFRELPNIEERVMLSFGPSGPTPQPPEVEFPSSRPEK 1632 Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + D++ +LD S+ DS + + E V + + + ++ GLV +++ Sbjct: 1633 KKADIVFLLD--GSINFRRDS-------FQEVLRFASEIVDTVYEDGDSIRVGLVQYNSD 1683 Query: 230 IEEFFLLEWGVSHLQRKIKYLSK----FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + F L + ++ I ++K G N+ G+++ + + + + D Sbjct: 1684 PTDEFFLRDFSTK-RQIIDAINKVIYKGGRHANTRVGIEH----LLKNHFVSEAGSRLDE 1738 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +I +T G+++ +D ++G V+A+G+R I S E + ++ + Sbjct: 1739 RVPQIAFVITGGKSVEDAQDVSLALT-----QKGVKVFAVGVRNIDSEEVGKIASNSATA 1793 Query: 346 YLVENPHSMYD 356 + V + + + Sbjct: 1794 FRVGSVQELSE 1804 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 79/203 (38%), Gaps = 25/203 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + + + + A D++ ++D S + + + + + ++ ++E + + + Sbjct: 25 AQQQSDVKNGAAADILFLVDSS------WSAGKDRFLLVQEFLSDVVESLSVGDND---F 75 Query: 220 QSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYA-YNQIFDMQGM 276 LV + FLL + I +S G S + GL+Y ++ + + G Sbjct: 76 HFALVRLNGNPHTEFLLNAYHSKQEVLSHILNMSYIGESNQTGKGLEYIIHSHLTEASGS 135 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 R ++IV +TDG+ E+ E K V+A+G+ L Sbjct: 136 RA-----ADGVPQVIVVLTDGQ----SEEDGFALPSAELKSADVNVFAVGVEDAD-ERTL 185 Query: 337 RACAS---PNSFYLVENPHSMYD 356 R AS + +EN S++D Sbjct: 186 REIASEPLSMHVFNLENVTSLHD 208 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 77/212 (36%), Gaps = 24/212 (11%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T + +V+ D++ +LD S ++ + + ++ + + D Sbjct: 628 TGTTEVHVNKR---DIIFLLDGSDNVGKNNFPYV------RDFVTNLVNSLDVGSDN--- 675 Query: 219 VQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQG 275 ++ GLV FS+ F L+ S L ++ L K G N+ L Y + Sbjct: 676 IRVGLVQFSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGTGLNAGSALSY----VHANHF 731 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + + ++++ + G + L N + G + + +G E Sbjct: 732 TEAGGSRIREHVPQLLLLLMAG-----PSEDVYLQAANALVRSGVLTFCVGTNQADKAEL 786 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 R +P+ YL+++ ++ + + + T Sbjct: 787 ERIAFNPSLVYLMDDFSALPALPQQLIQPLTT 818 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 40/243 (16%), Positives = 86/243 (35%), Gaps = 24/243 (9%) Query: 118 IVVVPQNEGYSISAISRYKIP---LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDM 174 +V + + Y S + ++P + T I T+ + + ++ + D+ Sbjct: 1376 LVKISLSPEYVFSVSTFRELPRLEQRLLTPITTLTSQQIQQILASTRYPPSVAESDAADI 1435 Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 + ++D S +M+ I ++ ++ + + P+ V+ G+V FSN + F Sbjct: 1436 VFLIDSSDAMKPDG------IAHIRDFVSRIVRRLNIGPNK---VRIGVVQFSNDVFPEF 1486 Query: 235 LLEWGVS--HLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L+ S + I+ L G S N+ L++ +F ED + ++ Sbjct: 1487 YLKTHKSQNSVLEAIRRLRFKGGSPLNTGRALEFVARNLFVKS---AGSRIEDGVPQHLV 1543 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENP 351 +F+ K + G + IG R I + P + V Sbjct: 1544 LFL------GGKSQDDVSRHAQVISSSGIMSLGIGDRNIDRTDLQTITNDPRLVFTVREF 1597 Query: 352 HSM 354 + Sbjct: 1598 REL 1600 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 39/222 (17%), Positives = 81/222 (36%), Gaps = 33/222 (14%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 S + +Q A D++ ++D S ++ D+ + +LE + + Sbjct: 226 AGDKESLKDITAQDSA--DIIFLIDGS------QNTGKANFDVIRDFLVNVLERLSV--- 274 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLS-KFGVSTNSTPGLKYAYNQIF 271 N V+ G+V +S++ F L+ S + +K LS G N L + F Sbjct: 275 GNQQVRVGVVQYSDEPRTMFSLDSYPSKAAVLDAVKRLSFAGGELANIGQALDFVVENHF 334 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 R + + +++V ++ G + D A K+G V++ G+ Sbjct: 335 T----RTGGSRVEEGVPQVLVLISAGPSSDEIRDAVV------ALKQG-SVFSFGLGAQA 383 Query: 332 SHEF-LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + L+ A+ ++ F G D+ + + Y Sbjct: 384 ASRVELQHIATDDNLVFT------VPEFRSFG-DLQEQLLPY 418 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 74/206 (35%), Gaps = 28/206 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + S F + + + ++E + + P+ V+ LV +S++ Sbjct: 1027 DVVFLIDGSEGVRSGFP-------LLKEFVQRVVESLDVGPNQ---VRVALVQYSDRTRP 1076 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + I+ L+ G T N+ L + N + G R + Sbjct: 1077 EFYLNSHMDQQGVISAIRRLTLLGGPTPNTGAALDFVLRNILTSSTGSRI-----EEGVP 1131 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 ++++ +T + + K+ GA+ IGI E P+ + Sbjct: 1132 QLLIVLT-----AERSGDDVRGPSVVLKQGGAVPIGIGIGNADISEMQTISFIPDFAVAI 1186 Query: 349 ENPHSMYDAFSHIGKDIVTKRIWYDK 374 + I + I + I ++ Sbjct: 1187 PTFREL----GTIQQVISERVIQLNR 1208 Score = 39.4 bits (90), Expect = 0.94, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 75/213 (35%), Gaps = 34/213 (15%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVV 219 + + +D +D+ +LD S + F + + K I+ M+ ++ L PD + Sbjct: 2606 RRAAGSDVDIDLAFILDSSEATTLF------QFNEMKKYISYMVRQLDLSPDPKASQHFA 2659 Query: 220 QSGLVT---FSNKIEEFFLL--------EWGVSH--LQRKIKYLSKFGVSTNSTPGLKYA 266 + +V + + ++G L + +++ + + ++Y Sbjct: 2660 RVAVVQQATYESVDNTSVPPVKVEFSLTDYGAKEKLLDFLSRRMTQLQGTMDLGNAIEYT 2719 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 IF+ + KI+V M GE + ++ +AK +G +G Sbjct: 2720 IENIFES--------APNPRDLKIVVLMLTGEIPREQLEEAQRAIL-QAKCKGYFFVVLG 2770 Query: 327 IRVI-RSHEFLRACASPNSFY--LVENPHSMYD 356 I E + PN + ++ + + Sbjct: 2771 IGRKVNVKEVYSFASEPNDVFFKFMDKSTELNE 2803 >gi|149037634|gb|EDL92065.1| procollagen, type VI, alpha 3 (predicted), isoform CRA_e [Rattus norvegicus] Length = 2254 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 46/311 (14%), Positives = 112/311 (36%), Gaps = 28/311 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + GG + + + + G + Sbjct: 910 LEFVARNLFVKSAGSRIEDGVPQHLVLFLGGKSQDDVSRHAQVISSSGIMSLGI--GDRN 967 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD 169 I R+ I P+ + ++P + + S P + Sbjct: 968 IDRTDLQTITNDPR---LVFTVREFRELPNIEERVMLSFGPSGPTPQPPEVEFPSSRPEK 1024 Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + D++ +LD S+ DS + + E V + + + ++ GLV +++ Sbjct: 1025 KKADIVFLLD--GSINFRRDS-------FQEVLRFASEIVDTVYEDGDSIRVGLVQYNSD 1075 Query: 230 IEEFFLLEWGVSHLQRKIKYLSK----FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + F L + ++ I ++K G N+ G+++ + + + + D Sbjct: 1076 PTDEFFLRDFSTK-RQIIDAINKVIYKGGRHANTRVGIEH----LLKNHFVSEAGSRLDE 1130 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +I +T G+++ +D ++G V+A+G+R I S E + ++ + Sbjct: 1131 RVPQIAFVITGGKSVEDAQDVSLALT-----QKGVKVFAVGVRNIDSEEVGKIASNSATA 1185 Query: 346 YLVENPHSMYD 356 + V + + + Sbjct: 1186 FRVGSVQELSE 1196 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 40/243 (16%), Positives = 86/243 (35%), Gaps = 24/243 (9%) Query: 118 IVVVPQNEGYSISAISRYKIP---LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDM 174 +V + + Y S + ++P + T I T+ + + ++ + D+ Sbjct: 768 LVKISLSPEYVFSVSTFRELPRLEQRLLTPITTLTSQQIQQILASTRYPPSVAESDAADI 827 Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 + ++D S +M+ I ++ ++ + + P+ V+ G+V FSN + F Sbjct: 828 VFLIDSSDAMKPDG------IAHIRDFVSRIVRRLNIGPNK---VRIGVVQFSNDVFPEF 878 Query: 235 LLEWGVS--HLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L+ S + I+ L G S N+ L++ +F ED + ++ Sbjct: 879 YLKTHKSQNSVLEAIRRLRFKGGSPLNTGRALEFVARNLFVKS---AGSRIEDGVPQHLV 935 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENP 351 +F+ K + G + IG R I + P + V Sbjct: 936 LFL------GGKSQDDVSRHAQVISSSGIMSLGIGDRNIDRTDLQTITNDPRLVFTVREF 989 Query: 352 HSM 354 + Sbjct: 990 REL 992 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 74/206 (35%), Gaps = 28/206 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + S F + + + ++E + + P+ V+ LV +S++ Sbjct: 419 DVVFLIDGSEGVRSGFP-------LLKEFVQRVVESLDVGPNQ---VRVALVQYSDRTRP 468 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + I+ L+ G T N+ L + N + G R + Sbjct: 469 EFYLNSHMDQQGVISAIRRLTLLGGPTPNTGAALDFVLRNILTSSTGSRI-----EEGVP 523 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 ++++ +T + + K+ GA+ IGI E P+ + Sbjct: 524 QLLIVLT-----AERSGDDVRGPSVVLKQGGAVPIGIGIGNADISEMQTISFIPDFAVAI 578 Query: 349 ENPHSMYDAFSHIGKDIVTKRIWYDK 374 + I + I + I ++ Sbjct: 579 PTFREL----GTIQQVISERVIQLNR 600 Score = 39.4 bits (90), Expect = 0.94, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 75/213 (35%), Gaps = 34/213 (15%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVV 219 + + +D +D+ +LD S + F + + K I+ M+ ++ L PD + Sbjct: 1998 RRAAGSDVDIDLAFILDSSEATTLF------QFNEMKKYISYMVRQLDLSPDPKASQHFA 2051 Query: 220 QSGLVT---FSNKIEEFFLL--------EWGVSH--LQRKIKYLSKFGVSTNSTPGLKYA 266 + +V + + ++G L + +++ + + ++Y Sbjct: 2052 RVAVVQQATYESVDNTSVPPVKVEFSLTDYGAKEKLLDFLSRRMTQLQGTMDLGNAIEYT 2111 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 IF+ + KI+V M GE + ++ +AK +G +G Sbjct: 2112 IENIFES--------APNPRDLKIVVLMLTGEIPREQLEEAQRAIL-QAKCKGYFFVVLG 2162 Query: 327 IRVI-RSHEFLRACASPNSFY--LVENPHSMYD 356 I E + PN + ++ + + Sbjct: 2163 IGRKVNVKEVYSFASEPNDVFFKFMDKSTELNE 2195 >gi|86360183|ref|YP_472072.1| hypothetical protein RHE_PC00139 [Rhizobium etli CFN 42] gi|86284285|gb|ABC93345.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 671 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 39/255 (15%), Positives = 85/255 (33%), Gaps = 29/255 (11%) Query: 92 MSFRNELRDNGFVN-----DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKF---CT 143 +F ++ + + S+ + + Y + P K Sbjct: 216 STFSADVDSASYSFVRRSLTAGAMPDPQSVRVEEMINYFPYDWPGPEKADQPFKATVTVM 275 Query: 144 FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSIN 203 PW +++ + + I + T +++ ++DVS SM+ K+ + + Sbjct: 276 PTPWNHDTQLMHVAIKG-YDIAPATAPHANLVFLIDVSGSMDE-----PDKLPLLKSAFR 329 Query: 204 AMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL 263 ++ ++K V+ V +G + S + I L G ST G+ Sbjct: 330 LLVNKLKADDTVSIVTYAGNAGTVLEPTRVAE----KSKILSAIDRLEA-GGSTGGAEGI 384 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIV 322 AY + ++ N ++ TDG+ N+ D+ E +K G + Sbjct: 385 AAAY-DLAKKAFVKDGVNR--------VMLATDGDFNVGPSIDEDLKRIIEEKRKDGIFL 435 Query: 323 YAIGIRVIRSHEFLR 337 +G ++ L Sbjct: 436 TVLGFGRGNLNDSLM 450 >gi|78048779|ref|YP_364954.1| putative secreted protein [Xanthomonas campestris pv. vesicatoria str. 85-10] gi|78037209|emb|CAJ24954.1| putative secreted protein [Xanthomonas campestris pv. vesicatoria str. 85-10] Length = 602 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 41/317 (12%), Positives = 106/317 (33%), Gaps = 43/317 (13%) Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN-----DIDDIVRSTSLDIVVVPQNEGY 127 R + + + + +F ++ + N + + + ++ + + Y Sbjct: 128 ERYQHIDDNAIVQAAQHPISTFNIDVDTGSYSNVRRFLNAGSLPPADAVRVEELINYFRY 187 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSV--KVNSQTDARLDMMIVLDVSRSME 185 P T + ++ ++ + + +++ ++DVS SM+ Sbjct: 188 D-DPAPTNGQPFAVRTELATTPWNKDSLLLRVGIAGRDIATADLPPANLVFLVDVSGSMD 246 Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF--SNKIEEFFLLEWGVSHL 243 + K+ + S+ ++ +++ + LVT+ + + + Sbjct: 247 A-----PDKLPLLQSSLKLLVRQLRAQD------RITLVTYAGNTSVVLPPTPGDQQGRI 295 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLST 302 I L + G ST G++ AY + +R N I+ TDG+ N+ Sbjct: 296 VEAIDAL-QSGGSTAGASGIELAY-KAAQQGYLRGGINR--------ILLATDGDFNVGV 345 Query: 303 KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR---ACASPNSFYLVENP-------- 351 Q E ++ G + +G ++ L A A ++ ++ Sbjct: 346 TNFDQLKGMVAEKRRSGIALSTLGFGTGNYNDNLMEQLADAGDGAYAYIDTALEARKVLT 405 Query: 352 HSMYDAFSHIGKDIVTK 368 H + + I +D+ + Sbjct: 406 HELGATLATIARDVKIQ 422 >gi|332285111|ref|YP_004417022.1| hypothetical protein PT7_1858 [Pusillimonas sp. T7-7] gi|330429064|gb|AEC20398.1| hypothetical protein PT7_1858 [Pusillimonas sp. T7-7] Length = 342 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 37/214 (17%), Positives = 79/214 (36%), Gaps = 36/214 (16%) Query: 166 SQTDARLDMMIVLDVSRSMES-----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + T+ D+++ LD+S+SM+S D +++ + + ++ K D + Sbjct: 86 THTEPVRDILLALDISQSMDSQDFRDAQDRQVSRWTVVKAVVADFID--KRTDD-----R 138 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 GL+ F L L+ + + G +T + + Q Sbjct: 139 LGLIVFGTGAFPQAPLTRDHKSLRLLLDHTAVGMAGPNTAIGDAIGMGIRMLDSAQE--- 195 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS------ 332 K+++ +TDG T L N A + V+ IGI + Sbjct: 196 --------RDKVLILLTDG--NDTGSAVPPLRAANLAAQHHVTVHTIGIGSPTASGDDQV 245 Query: 333 -HEFLR--ACASPNSFYLVENPHSMYDAFSHIGK 363 + LR + AS F+ ++ +++D ++ + + Sbjct: 246 DFDTLRGISSASGGQFFQAQDGAALHDVYATLDR 279 >gi|323941033|gb|EGB37220.1| von Willebrand protein type A [Escherichia coli E482] Length = 565 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 50/313 (15%), Positives = 110/313 (35%), Gaps = 39/313 (12%) Query: 39 EVSHIFFMKTVLHSMID--RSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRN 96 + S ++ L R+ AT I N G R + + + ++ +F Sbjct: 59 QYSDKQALQGRLQEAPTFARAAKAKATHIAN--PGTARYQQFDDNPVKQVAQNPLATFSL 116 Query: 97 ELRDNGFVN-------DIDDIVRSTSLDIVVV--PQNEGYSISAISRYKIPLKFCT---- 143 ++ + N + + ++ +V P + P+ F Sbjct: 117 DVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYEL 176 Query: 144 -FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 PW + + I + S+ +++ ++D S SM S ++ + S+ Sbjct: 177 APAPWNEQRTLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISD-----ERLPLIQSSL 230 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 +++E++ ++ V +G ++I + + + I L G STN G Sbjct: 231 KLLVKELREQDNIAIVTYAG----DSRIALPSISGSHKAEINAAIDSLDAEG-STNGGAG 285 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGAI 321 L+ AY Q ++ N I+ TDG+ +D +S+ + ++ G Sbjct: 286 LELAYQQA-TKGFIKGGINR--------ILLATDGDFNVGIDDPKSIESMVKKQRESGVT 336 Query: 322 VYAIGIRVIRSHE 334 + G+ +E Sbjct: 337 LSTFGVGNSNYNE 349 >gi|284029341|ref|YP_003379272.1| von Willebrand factor type A [Kribbella flavida DSM 17836] gi|283808634|gb|ADB30473.1| von Willebrand factor type A [Kribbella flavida DSM 17836] Length = 315 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 38/208 (18%), Positives = 68/208 (32%), Gaps = 32/208 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+ +D+S SM + D S + +A ++ + + + GLV+F+ Sbjct: 88 VMVAMDISNSM-AATDVSPDRFTVAKEAATEFVRNLPEQFN------VGLVSFARTATVV 140 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 I+ L+ ST + + + + + IV Sbjct: 141 APPSTNHQAAVDAIEQLT-LTDSTAIGEAVLTSLQAVRSLD-----AQAAEDPPPARIVL 194 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------------HEFLRAC 339 ++DG N S + + EA G V I E LR Sbjct: 195 LSDGGNTSGRPIDEGARAATEA---GVPVSTIAYGTPEGTIDLEGRSIPVPADTESLRGL 251 Query: 340 --ASPNSFYLVENPHSMYDAFSHIGKDI 365 A+ SFY E+ + D +S + I Sbjct: 252 ADATSGSFYAAESDEELRDVYSDLQSSI 279 >gi|254420933|ref|ZP_05034657.1| hypothetical protein BBAL3_3243 [Brevundimonas sp. BAL3] gi|196187110|gb|EDX82086.1| hypothetical protein BBAL3_3243 [Brevundimonas sp. BAL3] Length = 646 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 35/179 (19%), Positives = 58/179 (32%), Gaps = 44/179 (24%) Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM----QGMRQHCNTEDANYKKI 290 L L + ++ G ST GL + + + G+ + + K Sbjct: 467 PLTNVKKTLTDAVDGMTAVG-STAGHIGLAWGWYLVSPNFGLWSGLGAPAAYDSSKTLKA 525 Query: 291 IVFMTDGENLST----------------------------KEDQQSLYYCNEAKKRGAIV 322 +V MTDGE + +Q+ C K IV Sbjct: 526 VVLMTDGEFNTPYFRGVIASDAGNGSGGADTHINQPATNGSSFEQAYRLCENMKAADVIV 585 Query: 323 YAIGIRVI----------RSHEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 Y +G + + E + CA+ P+ + + + DAF IG+DI RI Sbjct: 586 YTVGFDIGAARNMTGPIDSAGELMARCATNPDRAFQASSSTDLSDAFRDIGRDITRLRI 644 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 39/273 (14%), Positives = 93/273 (34%), Gaps = 28/273 (10%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 + +G + ++ + LP+I ++ +++ I ++ +D + + AA +E Sbjct: 19 RLRDDRRGNVAMIFGLSLPVIVMLALGGVDLHRITTARSQFQDALDAATLAAARS--SET 76 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 K + + I+ ++ D+ V ++ I Sbjct: 77 TPAGLKSVALATLHGNIQ-------------------GTEVEPINDADVEVAMNDKSVVI 117 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 + L +P Y +PI++ +VN + +++ +VLD++ SM D Sbjct: 118 ATAQGRVKTLVANIVLPPYGQLLDDTLPISARSEVNRSSR-DVEVALVLDITGSMNDCAD 176 Query: 190 SSIT--KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI---EEFFLLEWGVSHLQ 244 S + KID + +++ V + L +S + + + G Sbjct: 177 SCSSGRKIDNLKSAAKELIDIVVQTNQSPFYSKVALAPYSMGVNVGDTYASRARGSLDSN 236 Query: 245 RK-IKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 + I + S + + AY + Sbjct: 237 TQSITAATWLTGSVKTITSISRAYTAVVTASKH 269 >gi|72007460|ref|XP_780292.1| PREDICTED: similar to polydom protein [Strongylocentrotus purpuratus] gi|115939674|ref|XP_001195885.1| PREDICTED: similar to polydom protein [Strongylocentrotus purpuratus] Length = 1500 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 26/189 (13%), Positives = 63/189 (33%), Gaps = 30/189 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 T +++ +LD S S+ I S+ + K+I + + ++++S Sbjct: 38 TQPVSELVFILDSSGSVAQ---------SDFIISVEFLKFASKIISVSASTTRVAVISYS 88 Query: 228 N------KIEEFFLLEW-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + ++ E + ++ T + L+ A + Sbjct: 89 SCNQIHIRVNYISSPENKNKCTFDNDLTSVNYHPGGTCTAGALEAAGRDVLSH------- 141 Query: 281 NTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 +++++ +TDG N + + K G ++ IGI I+ E Sbjct: 142 --GRPGAQRVVMLLTDGASNDGGPPHANA----QKLKSEGVKIFTIGIGSIKLSELNAIA 195 Query: 340 ASPNSFYLV 348 S + + + Sbjct: 196 TSVDEYVYI 204 >gi|88707026|ref|ZP_01104723.1| von Willebrand factor type A domain protein [Congregibacter litoralis KT71] gi|88698754|gb|EAQ95876.1| von Willebrand factor type A domain protein [Congregibacter litoralis KT71] Length = 330 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 32/209 (15%), Positives = 67/209 (32%), Gaps = 30/209 (14%) Query: 168 TDARLDMMIVLDVSRSM--ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + D+M+ LD+S SM F D+ + + + +LE + + + GL+ Sbjct: 90 EKSARDLMLALDLSGSMDARDFRDAEGHEQNRL-TAAKDVLEGFAAQREGD---RLGLIV 145 Query: 226 FSNKIEEFFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 F N Q + ++ G ST + A + Sbjct: 146 FGNAAYLQAPFTDDRETWQTLLEESEVAMAGQSTALGDAIGLAISIFQASDTTN------ 199 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-------HEFL 336 ++++ +TDG T A +Y + + + E L Sbjct: 200 -----RVLIVLTDG--NDTGSRVPPRDAATIAAANDVTIYTVAVGDPATIGEEALDLETL 252 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGK 363 A A + + + + ++ A+ I + Sbjct: 253 NAVAETTGGASFQALDTQALEKAYDEINR 281 >gi|241983030|emb|CAZ65768.1| antrax toxin receptor 2 [Cricetulus griseus] Length = 463 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 37/224 (16%), Positives = 75/224 (33%), Gaps = 30/224 (13%) Query: 154 IVMPITSS---VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + + S + D+ VLD S S+ + + I + ++ Sbjct: 16 LAVGGPGSLLHAQEQPSCKKAFDLYFVLDKSGSVANNW-----------IEIYNFVHQLT 64 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYN 268 V+ ++ + FS++ L + + ++ L T GLK A Sbjct: 65 ERF-VSPEMRLSFIVFSSQATIILPLTGDRYKIGKGLEDLKAVKPVGETYIHEGLKLANE 123 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 QI G++ II+ +TDG+ + +++ GA VY +G+ Sbjct: 124 QIQSAGGLKTSS---------IIIALTDGKLDGLVP-SYAENEAKKSRTLGASVYCVGVL 173 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + R S + + V+ + A I I+ + Sbjct: 174 DFEQAQLERIADSKDQVFPVKGG---FQALKGIINSILAQSCTE 214 >gi|149200157|ref|ZP_01877181.1| hypothetical protein LNTAR_03324 [Lentisphaera araneosa HTCC2155] gi|149136798|gb|EDM25227.1| hypothetical protein LNTAR_03324 [Lentisphaera araneosa HTCC2155] Length = 348 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 49/261 (18%), Positives = 81/261 (31%), Gaps = 55/261 (21%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSS--------- 191 F F+ + P T S D +D++ LD+S SM S+ Sbjct: 57 FFEFLALSLMIFALARPRTGEENSYSYKD-SVDIVFSLDISGSMSSYDQPEDLAVNRRVI 115 Query: 192 ---------ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH 242 ++ A KSI +++ K + GLV F + + Sbjct: 116 AEAINNKELHPRLHYAKKSIADFIDKRKSD-------RLGLVVFGAEAYSVCPPTNDHEY 168 Query: 243 LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 LQ ++K + ST N + G KKII+ +TDG + + Sbjct: 169 LQNRLKEI-----STEYLGDYNRQTNITAAISGGLARLRK-SKAPKKIIILVTDGSHTA- 221 Query: 303 KEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------------------HEFLRACA-- 340 + A K A++Y IG+ + E L+ A Sbjct: 222 NSNLTPRMAAKAAAKSDAVIYTIGVGNEVAWNVENFFGSSRLNASNSDFDEELLKEIAEK 281 Query: 341 SPNSFYLVENPHSMYDAFSHI 361 + ++ V M D I Sbjct: 282 TGGLYFSVREAEQMKDVLKKI 302 >gi|119591515|gb|EAW71109.1| collagen, type VI, alpha 3, isoform CRA_g [Homo sapiens] Length = 2205 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 86/213 (40%), Gaps = 26/213 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + +V+S D++ +LD S ++ T + ++ + + N+ Sbjct: 422 SGTPEVHSNKR---DIIFLLDGSANVGK------TNFPYVRDFVMNLVNSLDI---GNDN 469 Query: 219 VQSGLVTFSN-KIEEFFLLEWGV-SHLQRKIKYLS-KFGVSTNSTPGLKYAY-NQIFDMQ 274 ++ GLV FS+ + EF L + S + ++ L + G N+ L Y Y N + Sbjct: 470 IRVGLVQFSDTPVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAG 529 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 G R + ++++ +T G+ + L N + G + + +G E Sbjct: 530 GSRI-----REHVPQLLLLLTAGQ-----SEDSYLQAANALTRAGILTFCVGASQANKAE 579 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + +P+ YL+++ S+ + + + T Sbjct: 580 LEQIAFNPSLVYLMDDFSSLPALPQQLIQPLTT 612 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 50/312 (16%), Positives = 113/312 (36%), Gaps = 29/312 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + + GG + F +R +G V+ Sbjct: 1314 LEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR-----FAQVIRSSGIVSLGVG 1368 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + ++P + + ++ P + + Sbjct: 1369 DRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPE 1428 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + + E V + + + +Q GLV +++ Sbjct: 1429 KKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEDGDSIQVGLVQYNS 1479 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 + F L+ + + I + G N+ GL++ N G R D Sbjct: 1480 DPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSR-----LD 1534 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I +T G+++ +D +RG V+A+G+R I S E + ++ + Sbjct: 1535 QRVPQIAFVITGGKSVEDAQDVSLALT-----QRGVKVFAVGVRNIDSEEVGKIASNSAT 1589 Query: 345 FYLVENPHSMYD 356 + V N + + Sbjct: 1590 AFRVGNVQELSE 1601 Score = 45.1 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 34/223 (15%), Positives = 76/223 (34%), Gaps = 31/223 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + T+ + D++ ++D S ++ + + + +LE++ P Sbjct: 17 LSGFPTTHAQQQQAAQDSADIIFLIDGSN------NTGSVNFAVILDFLVNLLEKL---P 67 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLS-KFGVSTNSTPGLKYAYNQI 270 ++ G+V FS++ F L+ + Q +K L G N L + Sbjct: 68 IGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENH 127 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV- 329 F R + + +++V ++ G + + A V++ G+ Sbjct: 128 FT----RAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVAL-------KQASVFSFGLGAQ 176 Query: 330 IRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 S L+ A+ ++ F G D+ K + Y Sbjct: 177 AASRAELQHIATDDNLVFT------VPEFRSFG-DLQEKLLPY 212 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 86/261 (32%), Gaps = 24/261 (9%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + ++V + + Y S + ++P K T I T+ + + ++ + Sbjct: 1165 RNADQEELVKISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAV 1224 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S + ++ ++ + + P V+ G+V FS Sbjct: 1225 ESDAADIVFLIDSSEGVRPDG------FAHIRDFVSRIVRRLNIGPSK---VRVGVVQFS 1275 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 N + F L+ S + I+ L G S N+ L++ +F + + Sbjct: 1276 NDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDG-- 1333 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + +V + G K + + G + +G R I E P Sbjct: 1334 --VPQHLVLVLGG-----KSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDPRL 1386 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 + V + + I Sbjct: 1387 VFTVREFRELPNIEERIMNSF 1407 Score = 43.6 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 61/163 (37%), Gaps = 24/163 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 823 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 872 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + ++ L+ G T N+ L++ N + G R Sbjct: 873 EFYLNSYMNKQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRI-----TEGVP 927 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ +T + + K+ GA+ IGI Sbjct: 928 QLLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 965 >gi|119591510|gb|EAW71104.1| collagen, type VI, alpha 3, isoform CRA_b [Homo sapiens] Length = 2210 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 86/213 (40%), Gaps = 26/213 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + +V+S D++ +LD S ++ T + ++ + + N+ Sbjct: 427 SGTPEVHSNKR---DIIFLLDGSANVGK------TNFPYVRDFVMNLVNSLDI---GNDN 474 Query: 219 VQSGLVTFSN-KIEEFFLLEWGV-SHLQRKIKYLS-KFGVSTNSTPGLKYAY-NQIFDMQ 274 ++ GLV FS+ + EF L + S + ++ L + G N+ L Y Y N + Sbjct: 475 IRVGLVQFSDTPVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAG 534 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 G R + ++++ +T G+ + L N + G + + +G E Sbjct: 535 GSRI-----REHVPQLLLLLTAGQ-----SEDSYLQAANALTRAGILTFCVGASQANKAE 584 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + +P+ YL+++ S+ + + + T Sbjct: 585 LEQIAFNPSLVYLMDDFSSLPALPQQLIQPLTT 617 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 50/312 (16%), Positives = 113/312 (36%), Gaps = 29/312 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + + GG + F +R +G V+ Sbjct: 1319 LEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR-----FAQVIRSSGIVSLGVG 1373 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + ++P + + ++ P + + Sbjct: 1374 DRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPE 1433 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + + E V + + + +Q GLV +++ Sbjct: 1434 KKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEDGDSIQVGLVQYNS 1484 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 + F L+ + + I + G N+ GL++ N G R D Sbjct: 1485 DPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSR-----LD 1539 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I +T G+++ +D +RG V+A+G+R I S E + ++ + Sbjct: 1540 QRVPQIAFVITGGKSVEDAQDVSLALT-----QRGVKVFAVGVRNIDSEEVGKIASNSAT 1594 Query: 345 FYLVENPHSMYD 356 + V N + + Sbjct: 1595 AFRVGNVQELSE 1606 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 71/203 (34%), Gaps = 24/203 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + + A D++ ++D S ++ + + + +++ + + + Sbjct: 29 QADVKNGAAADIIFLVDSSWTIGE------EHFQLVREFLYDVVKSLAVGEND---FHFA 79 Query: 223 LVTFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQ-IFDMQGMRQH 279 LV F+ FLL + + I +S G + + GL+Y + G R Sbjct: 80 LVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGTNQTGKGLEYIMQSHLTKAAGSRA- 138 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 ++IV +TDG E K V+AIG+ Sbjct: 139 ----GDGVPQVIVVLTDG-----HSKDGLALPSAELKSADVNVFAIGVEDADEGALKEIA 189 Query: 340 ASP--NSFYLVENPHSMYDAFSH 360 + P + +EN S++D + Sbjct: 190 SEPLNMHMFNLENFTSLHDIVGN 212 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 86/261 (32%), Gaps = 24/261 (9%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + ++V + + Y S + ++P K T I T+ + + ++ + Sbjct: 1170 RNADQEELVKISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAV 1229 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S + ++ ++ + + P V+ G+V FS Sbjct: 1230 ESDAADIVFLIDSSEGVRPDG------FAHIRDFVSRIVRRLNIGPSK---VRVGVVQFS 1280 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 N + F L+ S + I+ L G S N+ L++ +F + + Sbjct: 1281 NDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDG-- 1338 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + +V + G K + + G + +G R I E P Sbjct: 1339 --VPQHLVLVLGG-----KSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDPRL 1391 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 + V + + I Sbjct: 1392 VFTVREFRELPNIEERIMNSF 1412 Score = 43.6 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 61/163 (37%), Gaps = 24/163 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 828 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 877 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + ++ L+ G T N+ L++ N + G R Sbjct: 878 EFYLNSYMNKQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRI-----TEGVP 932 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ +T + + K+ GA+ IGI Sbjct: 933 QLLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 970 >gi|119591514|gb|EAW71108.1| collagen, type VI, alpha 3, isoform CRA_f [Homo sapiens] Length = 2244 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 86/213 (40%), Gaps = 26/213 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + +V+S D++ +LD S ++ T + ++ + + N+ Sbjct: 628 SGTPEVHSNKR---DIIFLLDGSANVGK------TNFPYVRDFVMNLVNSLDI---GNDN 675 Query: 219 VQSGLVTFSN-KIEEFFLLEWGV-SHLQRKIKYLS-KFGVSTNSTPGLKYAY-NQIFDMQ 274 ++ GLV FS+ + EF L + S + ++ L + G N+ L Y Y N + Sbjct: 676 IRVGLVQFSDTPVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAG 735 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 G R + ++++ +T G+ + L N + G + + +G E Sbjct: 736 GSRI-----REHVPQLLLLLTAGQ-----SEDSYLQAANALTRAGILTFCVGASQANKAE 785 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + +P+ YL+++ S+ + + + T Sbjct: 786 LEQIAFNPSLVYLMDDFSSLPALPQQLIQPLTT 818 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 51/300 (17%), Positives = 106/300 (35%), Gaps = 38/300 (12%) Query: 76 KLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLD------IVVVPQNEGYSI 129 K G +N +V + SL+ I + + + Sbjct: 1360 KQFGVAPFTIARNADQEELVKISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKL 1419 Query: 130 SAISRYKIPLKFCTFIPWY---------TNSRHIVMPITSSVKVNSQTDARLDMMIVLDV 180 A +RY P++ +P ++ P + + + D++ +LD Sbjct: 1420 LASTRYPPPVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPEKKKADIVFLLD- 1478 Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV 240 S+ DS + + + E V + + + +Q GLV +++ + F L+ Sbjct: 1479 -GSINFRRDS-------FQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFS 1530 Query: 241 SH--LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 + + I + G N+ GL++ N G R D +I +T Sbjct: 1531 TKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSR-----LDQRVPQIAFVITG 1585 Query: 297 GENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYD 356 G+++ +D +RG V+A+G+R I S E + ++ + + V N + + Sbjct: 1586 GKSVEDAQDVSLALT-----QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 1640 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 71/203 (34%), Gaps = 24/203 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + + A D++ ++D S ++ + + + +++ + + + Sbjct: 29 QADVKNGAAADIIFLVDSSWTIGE------EHFQLVREFLYDVVKSLAVGEND---FHFA 79 Query: 223 LVTFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQ-IFDMQGMRQH 279 LV F+ FLL + + I +S G + + GL+Y + G R Sbjct: 80 LVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGTNQTGKGLEYIMQSHLTKAAGSRA- 138 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 ++IV +TDG E K V+AIG+ Sbjct: 139 ----GDGVPQVIVVLTDG-----HSKDGLALPSAELKSADVNVFAIGVEDADEGALKEIA 189 Query: 340 ASP--NSFYLVENPHSMYDAFSH 360 + P + +EN S++D + Sbjct: 190 SEPLNMHMFNLENFTSLHDIVGN 212 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 39/272 (14%), Positives = 88/272 (32%), Gaps = 41/272 (15%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS-------- 166 ++ + + I++ F + ++V + SSV Sbjct: 174 AIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLK 233 Query: 167 --QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D++ ++D S ++ + + + +LE++ P ++ G+V Sbjct: 234 DITAQDSADIIFLIDGSN------NTGSVNFAVILDFLVNLLEKL---PIGTQQIRVGVV 284 Query: 225 TFSNKIEEFFLLEWGVSHLQ--RKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 FS++ F L+ + Q +K L G N L + F R + Sbjct: 285 QFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHFT----RAGGS 340 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACA 340 + +++V ++ G + + A V++ G+ S L+ A Sbjct: 341 RVEEGVPQVLVLISAGPSSDEIRYGVVAL-------KQASVFSFGLGAQAASRAELQHIA 393 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + ++ F G D+ K + Y Sbjct: 394 TDDNLVFT------VPEFRSFG-DLQEKLLPY 418 Score = 43.6 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 61/163 (37%), Gaps = 24/163 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 1029 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 1078 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + ++ L+ G T N+ L++ N + G R Sbjct: 1079 EFYLNSYMNKQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRI-----TEGVP 1133 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ +T + + K+ GA+ IGI Sbjct: 1134 QLLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 1171 >gi|119591511|gb|EAW71105.1| collagen, type VI, alpha 3, isoform CRA_c [Homo sapiens] Length = 2971 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 86/213 (40%), Gaps = 26/213 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + +V+S D++ +LD S ++ T + ++ + + N+ Sbjct: 422 SGTPEVHSNKR---DIIFLLDGSANVGK------TNFPYVRDFVMNLVNSLDI---GNDN 469 Query: 219 VQSGLVTFSN-KIEEFFLLEWGV-SHLQRKIKYLS-KFGVSTNSTPGLKYAY-NQIFDMQ 274 ++ GLV FS+ + EF L + S + ++ L + G N+ L Y Y N + Sbjct: 470 IRVGLVQFSDTPVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAG 529 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 G R + ++++ +T G+ + L N + G + + +G E Sbjct: 530 GSRI-----REHVPQLLLLLTAGQ-----SEDSYLQAANALTRAGILTFCVGASQANKAE 579 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + +P+ YL+++ S+ + + + T Sbjct: 580 LEQIAFNPSLVYLMDDFSSLPALPQQLIQPLTT 612 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 50/312 (16%), Positives = 113/312 (36%), Gaps = 29/312 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + + GG + F +R +G V+ Sbjct: 1314 LEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR-----FAQVIRSSGIVSLGVG 1368 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + ++P + + ++ P + + Sbjct: 1369 DRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPE 1428 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + + E V + + + +Q GLV +++ Sbjct: 1429 KKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEDGDSIQVGLVQYNS 1479 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 + F L+ + + I + G N+ GL++ N G R D Sbjct: 1480 DPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSR-----LD 1534 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I +T G+++ +D +RG V+A+G+R I S E + ++ + Sbjct: 1535 QRVPQIAFVITGGKSVEDAQDVSLALT-----QRGVKVFAVGVRNIDSEEVGKIASNSAT 1589 Query: 345 FYLVENPHSMYD 356 + V N + + Sbjct: 1590 AFRVGNVQELSE 1601 Score = 45.1 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 34/223 (15%), Positives = 76/223 (34%), Gaps = 31/223 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + T+ + D++ ++D S ++ + + + +LE++ P Sbjct: 17 LSGFPTTHAQQQQAAQDSADIIFLIDGSN------NTGSVNFAVILDFLVNLLEKL---P 67 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLS-KFGVSTNSTPGLKYAYNQI 270 ++ G+V FS++ F L+ + Q +K L G N L + Sbjct: 68 IGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENH 127 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV- 329 F R + + +++V ++ G + + A V++ G+ Sbjct: 128 FT----RAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVAL-------KQASVFSFGLGAQ 176 Query: 330 IRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 S L+ A+ ++ F G D+ K + Y Sbjct: 177 AASRAELQHIATDDNLVFT------VPEFRSFG-DLQEKLLPY 212 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 86/261 (32%), Gaps = 24/261 (9%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + ++V + + Y S + ++P K T I T+ + + ++ + Sbjct: 1165 RNADQEELVKISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAV 1224 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S + ++ ++ + + P V+ G+V FS Sbjct: 1225 ESDAADIVFLIDSSEGVRPDG------FAHIRDFVSRIVRRLNIGPSK---VRVGVVQFS 1275 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 N + F L+ S + I+ L G S N+ L++ +F + + Sbjct: 1276 NDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDG-- 1333 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + +V + G K + + G + +G R I E P Sbjct: 1334 --VPQHLVLVLGG-----KSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDPRL 1386 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 + V + + I Sbjct: 1387 VFTVREFRELPNIEERIMNSF 1407 Score = 43.6 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 61/163 (37%), Gaps = 24/163 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 823 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 872 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + ++ L+ G T N+ L++ N + G R Sbjct: 873 EFYLNSYMNKQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRI-----TEGVP 927 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ +T + + K+ GA+ IGI Sbjct: 928 QLLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 965 >gi|119591512|gb|EAW71106.1| collagen, type VI, alpha 3, isoform CRA_d [Homo sapiens] Length = 2411 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 86/213 (40%), Gaps = 26/213 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + +V+S D++ +LD S ++ T + ++ + + N+ Sbjct: 628 SGTPEVHSNKR---DIIFLLDGSANVGK------TNFPYVRDFVMNLVNSLDI---GNDN 675 Query: 219 VQSGLVTFSN-KIEEFFLLEWGV-SHLQRKIKYLS-KFGVSTNSTPGLKYAY-NQIFDMQ 274 ++ GLV FS+ + EF L + S + ++ L + G N+ L Y Y N + Sbjct: 676 IRVGLVQFSDTPVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAG 735 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 G R + ++++ +T G+ + L N + G + + +G E Sbjct: 736 GSRI-----REHVPQLLLLLTAGQ-----SEDSYLQAANALTRAGILTFCVGASQANKAE 785 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + +P+ YL+++ S+ + + + T Sbjct: 786 LEQIAFNPSLVYLMDDFSSLPALPQQLIQPLTT 818 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 50/312 (16%), Positives = 113/312 (36%), Gaps = 29/312 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + + GG + F +R +G V+ Sbjct: 1520 LEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR-----FAQVIRSSGIVSLGVG 1574 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + ++P + + ++ P + + Sbjct: 1575 DRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPE 1634 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + + E V + + + +Q GLV +++ Sbjct: 1635 KKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEDGDSIQVGLVQYNS 1685 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 + F L+ + + I + G N+ GL++ N G R D Sbjct: 1686 DPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSR-----LD 1740 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I +T G+++ +D +RG V+A+G+R I S E + ++ + Sbjct: 1741 QRVPQIAFVITGGKSVEDAQDVSLALT-----QRGVKVFAVGVRNIDSEEVGKIASNSAT 1795 Query: 345 FYLVENPHSMYD 356 + V N + + Sbjct: 1796 AFRVGNVQELSE 1807 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 71/203 (34%), Gaps = 24/203 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + + A D++ ++D S ++ + + + +++ + + + Sbjct: 29 QADVKNGAAADIIFLVDSSWTIGE------EHFQLVREFLYDVVKSLAVGEND---FHFA 79 Query: 223 LVTFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQ-IFDMQGMRQH 279 LV F+ FLL + + I +S G + + GL+Y + G R Sbjct: 80 LVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGTNQTGKGLEYIMQSHLTKAAGSRA- 138 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 ++IV +TDG E K V+AIG+ Sbjct: 139 ----GDGVPQVIVVLTDG-----HSKDGLALPSAELKSADVNVFAIGVEDADEGALKEIA 189 Query: 340 ASP--NSFYLVENPHSMYDAFSH 360 + P + +EN S++D + Sbjct: 190 SEPLNMHMFNLENFTSLHDIVGN 212 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 39/272 (14%), Positives = 88/272 (32%), Gaps = 41/272 (15%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS-------- 166 ++ + + I++ F + ++V + SSV Sbjct: 174 AIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLK 233 Query: 167 --QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D++ ++D S ++ + + + +LE++ P ++ G+V Sbjct: 234 DITAQDSADIIFLIDGSN------NTGSVNFAVILDFLVNLLEKL---PIGTQQIRVGVV 284 Query: 225 TFSNKIEEFFLLEWGVSHLQ--RKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 FS++ F L+ + Q +K L G N L + F R + Sbjct: 285 QFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHFT----RAGGS 340 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACA 340 + +++V ++ G + + A V++ G+ S L+ A Sbjct: 341 RVEEGVPQVLVLISAGPSSDEIRYGVVAL-------KQASVFSFGLGAQAASRAELQHIA 393 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + ++ F G D+ K + Y Sbjct: 394 TDDNLVFT------VPEFRSFG-DLQEKLLPY 418 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 86/261 (32%), Gaps = 24/261 (9%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + ++V + + Y S + ++P K T I T+ + + ++ + Sbjct: 1371 RNADQEELVKISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAV 1430 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S + ++ ++ + + P V+ G+V FS Sbjct: 1431 ESDAADIVFLIDSSEGVRPDG------FAHIRDFVSRIVRRLNIGPSK---VRVGVVQFS 1481 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 N + F L+ S + I+ L G S N+ L++ +F + + Sbjct: 1482 NDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDG-- 1539 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + +V + G K + + G + +G R I E P Sbjct: 1540 --VPQHLVLVLGG-----KSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDPRL 1592 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 + V + + I Sbjct: 1593 VFTVREFRELPNIEERIMNSF 1613 Score = 43.6 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 61/163 (37%), Gaps = 24/163 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 1029 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 1078 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + ++ L+ G T N+ L++ N + G R Sbjct: 1079 EFYLNSYMNKQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRI-----TEGVP 1133 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ +T + + K+ GA+ IGI Sbjct: 1134 QLLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 1171 >gi|119591513|gb|EAW71107.1| collagen, type VI, alpha 3, isoform CRA_e [Homo sapiens] Length = 3177 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 86/213 (40%), Gaps = 26/213 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + +V+S D++ +LD S ++ T + ++ + + N+ Sbjct: 628 SGTPEVHSNKR---DIIFLLDGSANVGK------TNFPYVRDFVMNLVNSLDI---GNDN 675 Query: 219 VQSGLVTFSN-KIEEFFLLEWGV-SHLQRKIKYLS-KFGVSTNSTPGLKYAY-NQIFDMQ 274 ++ GLV FS+ + EF L + S + ++ L + G N+ L Y Y N + Sbjct: 676 IRVGLVQFSDTPVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAG 735 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 G R + ++++ +T G+ + L N + G + + +G E Sbjct: 736 GSRI-----REHVPQLLLLLTAGQ-----SEDSYLQAANALTRAGILTFCVGASQANKAE 785 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + +P+ YL+++ S+ + + + T Sbjct: 786 LEQIAFNPSLVYLMDDFSSLPALPQQLIQPLTT 818 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 50/312 (16%), Positives = 113/312 (36%), Gaps = 29/312 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + + GG + F +R +G V+ Sbjct: 1520 LEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR-----FAQVIRSSGIVSLGVG 1574 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + ++P + + ++ P + + Sbjct: 1575 DRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPE 1634 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + + E V + + + +Q GLV +++ Sbjct: 1635 KKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEDGDSIQVGLVQYNS 1685 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 + F L+ + + I + G N+ GL++ N G R D Sbjct: 1686 DPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSR-----LD 1740 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I +T G+++ +D +RG V+A+G+R I S E + ++ + Sbjct: 1741 QRVPQIAFVITGGKSVEDAQDVSLALT-----QRGVKVFAVGVRNIDSEEVGKIASNSAT 1795 Query: 345 FYLVENPHSMYD 356 + V N + + Sbjct: 1796 AFRVGNVQELSE 1807 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 71/203 (34%), Gaps = 24/203 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + + A D++ ++D S ++ + + + +++ + + + Sbjct: 29 QADVKNGAAADIIFLVDSSWTIGE------EHFQLVREFLYDVVKSLAVGEND---FHFA 79 Query: 223 LVTFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQ-IFDMQGMRQH 279 LV F+ FLL + + I +S G + + GL+Y + G R Sbjct: 80 LVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGTNQTGKGLEYIMQSHLTKAAGSRA- 138 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 ++IV +TDG E K V+AIG+ Sbjct: 139 ----GDGVPQVIVVLTDG-----HSKDGLALPSAELKSADVNVFAIGVEDADEGALKEIA 189 Query: 340 ASP--NSFYLVENPHSMYDAFSH 360 + P + +EN S++D + Sbjct: 190 SEPLNMHMFNLENFTSLHDIVGN 212 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 39/272 (14%), Positives = 88/272 (32%), Gaps = 41/272 (15%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS-------- 166 ++ + + I++ F + ++V + SSV Sbjct: 174 AIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLK 233 Query: 167 --QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D++ ++D S ++ + + + +LE++ P ++ G+V Sbjct: 234 DITAQDSADIIFLIDGSN------NTGSVNFAVILDFLVNLLEKL---PIGTQQIRVGVV 284 Query: 225 TFSNKIEEFFLLEWGVSHLQ--RKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 FS++ F L+ + Q +K L G N L + F R + Sbjct: 285 QFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHFT----RAGGS 340 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACA 340 + +++V ++ G + + A V++ G+ S L+ A Sbjct: 341 RVEEGVPQVLVLISAGPSSDEIRYGVVAL-------KQASVFSFGLGAQAASRAELQHIA 393 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + ++ F G D+ K + Y Sbjct: 394 TDDNLVFT------VPEFRSFG-DLQEKLLPY 418 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 86/261 (32%), Gaps = 24/261 (9%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + ++V + + Y S + ++P K T I T+ + + ++ + Sbjct: 1371 RNADQEELVKISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAV 1430 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S + ++ ++ + + P V+ G+V FS Sbjct: 1431 ESDAADIVFLIDSSEGVRPDG------FAHIRDFVSRIVRRLNIGPSK---VRVGVVQFS 1481 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 N + F L+ S + I+ L G S N+ L++ +F + + Sbjct: 1482 NDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDG-- 1539 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + +V + G K + + G + +G R I E P Sbjct: 1540 --VPQHLVLVLGG-----KSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDPRL 1592 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 + V + + I Sbjct: 1593 VFTVREFRELPNIEERIMNSF 1613 Score = 43.6 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 61/163 (37%), Gaps = 24/163 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 1029 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 1078 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + ++ L+ G T N+ L++ N + G R Sbjct: 1079 EFYLNSYMNKQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRI-----TEGVP 1133 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ +T + + K+ GA+ IGI Sbjct: 1134 QLLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 1171 >gi|55743106|ref|NP_476508.2| collagen alpha-3(VI) chain isoform 5 precursor [Homo sapiens] Length = 2971 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 86/213 (40%), Gaps = 26/213 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + +V+S D++ +LD S ++ T + ++ + + N+ Sbjct: 422 SGTPEVHSNKR---DIIFLLDGSANVGK------TNFPYVRDFVMNLVNSLDI---GNDN 469 Query: 219 VQSGLVTFSN-KIEEFFLLEWGV-SHLQRKIKYLS-KFGVSTNSTPGLKYAY-NQIFDMQ 274 ++ GLV FS+ + EF L + S + ++ L + G N+ L Y Y N + Sbjct: 470 IRVGLVQFSDTPVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAG 529 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 G R + ++++ +T G+ + L N + G + + +G E Sbjct: 530 GSRI-----REHVPQLLLLLTAGQ-----SEDSYLQAANALTRAGILTFCVGASQANKAE 579 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + +P+ YL+++ S+ + + + T Sbjct: 580 LEQIAFNPSLVYLMDDFSSLPALPQQLIQPLTT 612 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 50/312 (16%), Positives = 113/312 (36%), Gaps = 29/312 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + + GG + F +R +G V+ Sbjct: 1314 LEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR-----FAQVIRSSGIVSLGVG 1368 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + ++P + + ++ P + + Sbjct: 1369 DRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPE 1428 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + + E V + + + +Q GLV +++ Sbjct: 1429 KKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEDGDSIQVGLVQYNS 1479 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 + F L+ + + I + G N+ GL++ N G R D Sbjct: 1480 DPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSR-----LD 1534 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I +T G+++ +D +RG V+A+G+R I S E + ++ + Sbjct: 1535 QRVPQIAFVITGGKSVEDAQDVSLALT-----QRGVKVFAVGVRNIDSEEVGKIASNSAT 1589 Query: 345 FYLVENPHSMYD 356 + V N + + Sbjct: 1590 AFRVGNVQELSE 1601 Score = 45.1 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 34/223 (15%), Positives = 76/223 (34%), Gaps = 31/223 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + T+ + D++ ++D S ++ + + + +LE++ P Sbjct: 17 LSGFPTTHAQQQQAAQDSADIIFLIDGSN------NTGSVNFAVILDFLVNLLEKL---P 67 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLS-KFGVSTNSTPGLKYAYNQI 270 ++ G+V FS++ F L+ + Q +K L G N L + Sbjct: 68 IGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENH 127 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV- 329 F R + + +++V ++ G + + A V++ G+ Sbjct: 128 FT----RAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVAL-------KQASVFSFGLGAQ 176 Query: 330 IRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 S L+ A+ ++ F G D+ K + Y Sbjct: 177 AASRAELQHIATDDNLVFT------VPEFRSFG-DLQEKLLPY 212 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 86/261 (32%), Gaps = 24/261 (9%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + ++V + + Y S + ++P K T I T+ + + ++ + Sbjct: 1165 RNADQEELVKISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAV 1224 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S + ++ ++ + + P V+ G+V FS Sbjct: 1225 ESDAADIVFLIDSSEGVRPDG------FAHIRDFVSRIVRRLNIGPSK---VRVGVVQFS 1275 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 N + F L+ S + I+ L G S N+ L++ +F + + Sbjct: 1276 NDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDG-- 1333 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + +V + G K + + G + +G R I E P Sbjct: 1334 --VPQHLVLVLGG-----KSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDPRL 1386 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 + V + + I Sbjct: 1387 VFTVREFRELPNIEERIMNSF 1407 Score = 43.6 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 61/163 (37%), Gaps = 24/163 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 823 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 872 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + ++ L+ G T N+ L++ N + G R Sbjct: 873 EFYLNSYMNKQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRI-----TEGVP 927 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ +T + + K+ GA+ IGI Sbjct: 928 QLLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 965 >gi|55743098|ref|NP_004360.2| collagen alpha-3(VI) chain isoform 1 precursor [Homo sapiens] gi|311033499|sp|P12111|CO6A3_HUMAN RecName: Full=Collagen alpha-3(VI) chain; Flags: Precursor gi|225000446|gb|AAI72233.1| Collagen, type VI, alpha 3 [synthetic construct] gi|302313173|gb|ADL14511.1| collagen, type VI, alpha 3 [Homo sapiens] Length = 3177 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 86/213 (40%), Gaps = 26/213 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + +V+S D++ +LD S ++ T + ++ + + N+ Sbjct: 628 SGTPEVHSNKR---DIIFLLDGSANVGK------TNFPYVRDFVMNLVNSLDI---GNDN 675 Query: 219 VQSGLVTFSN-KIEEFFLLEWGV-SHLQRKIKYLS-KFGVSTNSTPGLKYAY-NQIFDMQ 274 ++ GLV FS+ + EF L + S + ++ L + G N+ L Y Y N + Sbjct: 676 IRVGLVQFSDTPVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAG 735 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 G R + ++++ +T G+ + L N + G + + +G E Sbjct: 736 GSRI-----REHVPQLLLLLTAGQ-----SEDSYLQAANALTRAGILTFCVGASQANKAE 785 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + +P+ YL+++ S+ + + + T Sbjct: 786 LEQIAFNPSLVYLMDDFSSLPALPQQLIQPLTT 818 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 50/312 (16%), Positives = 113/312 (36%), Gaps = 29/312 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + + GG + F +R +G V+ Sbjct: 1520 LEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR-----FAQVIRSSGIVSLGVG 1574 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + ++P + + ++ P + + Sbjct: 1575 DRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPE 1634 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + + E V + + + +Q GLV +++ Sbjct: 1635 KKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEDGDSIQVGLVQYNS 1685 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 + F L+ + + I + G N+ GL++ N G R D Sbjct: 1686 DPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSR-----LD 1740 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I +T G+++ +D +RG V+A+G+R I S E + ++ + Sbjct: 1741 QRVPQIAFVITGGKSVEDAQDVSLALT-----QRGVKVFAVGVRNIDSEEVGKIASNSAT 1795 Query: 345 FYLVENPHSMYD 356 + V N + + Sbjct: 1796 AFRVGNVQELSE 1807 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 71/203 (34%), Gaps = 24/203 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + + A D++ ++D S ++ + + + +++ + + + Sbjct: 29 QADVKNGAAADIIFLVDSSWTIGE------EHFQLVREFLYDVVKSLAVGEND---FHFA 79 Query: 223 LVTFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQ-IFDMQGMRQH 279 LV F+ FLL + + I +S G + + GL+Y + G R Sbjct: 80 LVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGTNQTGKGLEYIMQSHLTKAAGSRA- 138 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 ++IV +TDG E K V+AIG+ Sbjct: 139 ----GDGVPQVIVVLTDG-----HSKDGLALPSAELKSADVNVFAIGVEDADEGALKEIA 189 Query: 340 ASP--NSFYLVENPHSMYDAFSH 360 + P + +EN S++D + Sbjct: 190 SEPLNMHMFNLENFTSLHDIVGN 212 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 39/272 (14%), Positives = 88/272 (32%), Gaps = 41/272 (15%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS-------- 166 ++ + + I++ F + ++V + SSV Sbjct: 174 AIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLK 233 Query: 167 --QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D++ ++D S ++ + + + +LE++ P ++ G+V Sbjct: 234 DITAQDSADIIFLIDGSN------NTGSVNFAVILDFLVNLLEKL---PIGTQQIRVGVV 284 Query: 225 TFSNKIEEFFLLEWGVSHLQ--RKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 FS++ F L+ + Q +K L G N L + F R + Sbjct: 285 QFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHFT----RAGGS 340 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACA 340 + +++V ++ G + + A V++ G+ S L+ A Sbjct: 341 RVEEGVPQVLVLISAGPSSDEIRYGVVAL-------KQASVFSFGLGAQAASRAELQHIA 393 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + ++ F G D+ K + Y Sbjct: 394 TDDNLVFT------VPEFRSFG-DLQEKLLPY 418 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 86/261 (32%), Gaps = 24/261 (9%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + ++V + + Y S + ++P K T I T+ + + ++ + Sbjct: 1371 RNADQEELVKISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAV 1430 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S + ++ ++ + + P V+ G+V FS Sbjct: 1431 ESDAADIVFLIDSSEGVRPDG------FAHIRDFVSRIVRRLNIGPSK---VRVGVVQFS 1481 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 N + F L+ S + I+ L G S N+ L++ +F + + Sbjct: 1482 NDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDG-- 1539 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + +V + G K + + G + +G R I E P Sbjct: 1540 --VPQHLVLVLGG-----KSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDPRL 1592 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 + V + + I Sbjct: 1593 VFTVREFRELPNIEERIMNSF 1613 Score = 43.6 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 61/163 (37%), Gaps = 24/163 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 1029 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 1078 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + ++ L+ G T N+ L++ N + G R Sbjct: 1079 EFYLNSYMNKQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRI-----TEGVP 1133 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ +T + + K+ GA+ IGI Sbjct: 1134 QLLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 1171 >gi|62988748|gb|AAY24135.1| unknown [Homo sapiens] Length = 2588 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 86/213 (40%), Gaps = 26/213 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + +V+S D++ +LD S ++ T + ++ + + N+ Sbjct: 628 SGTPEVHSNKR---DIIFLLDGSANVGK------TNFPYVRDFVMNLVNSLDI---GNDN 675 Query: 219 VQSGLVTFSN-KIEEFFLLEWGV-SHLQRKIKYLS-KFGVSTNSTPGLKYAY-NQIFDMQ 274 ++ GLV FS+ + EF L + S + ++ L + G N+ L Y Y N + Sbjct: 676 IRVGLVQFSDTPVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAG 735 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 G R + ++++ +T G+ + L N + G + + +G E Sbjct: 736 GSRI-----REHVPQLLLLLTAGQ-----SEDSYLQAANALTRAGILTFCVGASQANKAE 785 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + +P+ YL+++ S+ + + + T Sbjct: 786 LEQIAFNPSLVYLMDDFSSLPALPQQLIQPLTT 818 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 50/312 (16%), Positives = 113/312 (36%), Gaps = 29/312 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + + GG + F +R +G V+ Sbjct: 1520 LEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR-----FAQVIRSSGIVSLGVG 1574 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + ++P + + ++ P + + Sbjct: 1575 DRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPE 1634 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + + E V + + + +Q GLV +++ Sbjct: 1635 KKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEDGDSIQVGLVQYNS 1685 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 + F L+ + + I + G N+ GL++ N G R D Sbjct: 1686 DPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSR-----LD 1740 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I +T G+++ +D +RG V+A+G+R I S E + ++ + Sbjct: 1741 QRVPQIAFVITGGKSVEDAQDVSLALT-----QRGVKVFAVGVRNIDSEEVGKIASNSAT 1795 Query: 345 FYLVENPHSMYD 356 + V N + + Sbjct: 1796 AFRVGNVQELSE 1807 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 71/203 (34%), Gaps = 24/203 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + + A D++ ++D S ++ + + + +++ + + + Sbjct: 29 QADVKNGAAADIIFLVDSSWTIGE------EHFQLVREFLYDVVKSLAVGEND---FHFA 79 Query: 223 LVTFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQ-IFDMQGMRQH 279 LV F+ FLL + + I +S G + + GL+Y + G R Sbjct: 80 LVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGTNQTGKGLEYIMQSHLTKAAGSRA- 138 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 ++IV +TDG E K V+AIG+ Sbjct: 139 ----GDGVPQVIVVLTDG-----HSKDGLALPSAELKSADVNVFAIGVEDADEGALKEIA 189 Query: 340 ASP--NSFYLVENPHSMYDAFSH 360 + P + +EN S++D + Sbjct: 190 SEPLNMHMFNLENFTSLHDIVGN 212 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 39/272 (14%), Positives = 88/272 (32%), Gaps = 41/272 (15%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS-------- 166 ++ + + I++ F + ++V + SSV Sbjct: 174 AIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLK 233 Query: 167 --QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D++ ++D S ++ + + + +LE++ P ++ G+V Sbjct: 234 DITAQDSADIIFLIDGSN------NTGSVNFAVILDFLVNLLEKL---PIGTQQIRVGVV 284 Query: 225 TFSNKIEEFFLLEWGVSHLQ--RKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 FS++ F L+ + Q +K L G N L + F R + Sbjct: 285 QFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHFT----RAGGS 340 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACA 340 + +++V ++ G + + A V++ G+ S L+ A Sbjct: 341 RVEEGVPQVLVLISAGPSSDEIRYGVVAL-------KQASVFSFGLGAQAASRAELQHIA 393 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + ++ F G D+ K + Y Sbjct: 394 TDDNLVFT------VPEFRSFG-DLQEKLLPY 418 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 86/261 (32%), Gaps = 24/261 (9%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + ++V + + Y S + ++P K T I T+ + + ++ + Sbjct: 1371 RNADQEELVKISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAV 1430 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S + ++ ++ + + P V+ G+V FS Sbjct: 1431 ESDAADIVFLIDSSEGVRPDG------FAHIRDFVSRIVRRLNIGPSK---VRVGVVQFS 1481 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 N + F L+ S + I+ L G S N+ L++ +F + + Sbjct: 1482 NDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDG-- 1539 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + +V + G K + + G + +G R I E P Sbjct: 1540 --VPQHLVLVLGG-----KSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDPRL 1592 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 + V + + I Sbjct: 1593 VFTVREFRELPNIEERIMNSF 1613 Score = 43.6 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 61/163 (37%), Gaps = 24/163 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 1029 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 1078 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + ++ L+ G T N+ L++ N + G R Sbjct: 1079 EFYLNSYMNKQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRI-----TEGVP 1133 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ +T + + K+ GA+ IGI Sbjct: 1134 QLLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 1171 >gi|3127926|emb|CAA36267.1| collagen type VI, alpha 3 chain [Homo sapiens] Length = 3176 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 86/213 (40%), Gaps = 26/213 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + +V+S D++ +LD S ++ T + ++ + + N+ Sbjct: 628 SGTPEVHSNKR---DIIFLLDGSANVGK------TNFPYVRDFVMNLVNSLDI---GNDN 675 Query: 219 VQSGLVTFSN-KIEEFFLLEWGV-SHLQRKIKYLS-KFGVSTNSTPGLKYAY-NQIFDMQ 274 ++ GLV FS+ + EF L + S + ++ L + G N+ L Y Y N + Sbjct: 676 IRVGLVQFSDTPVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAG 735 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 G R + ++++ +T G+ + L N + G + + +G E Sbjct: 736 GSRI-----REHVPQLLLLLTAGQ-----SEDSYLQAANALTRAGILTFCVGASQANKAE 785 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + +P+ YL+++ S+ + + + T Sbjct: 786 LEQIAFNPSLVYLMDDFSSLPALPQQLIQPLTT 818 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 50/312 (16%), Positives = 113/312 (36%), Gaps = 29/312 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + + GG + F +R +G V+ Sbjct: 1520 LEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR-----FAQVIRSSGIVSLGVG 1574 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + ++P + + ++ P + + Sbjct: 1575 DRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPE 1634 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + + E V + + + +Q GLV +++ Sbjct: 1635 KKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEDGDSIQVGLVQYNS 1685 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 + F L+ + + I + G N+ GL++ N G R D Sbjct: 1686 DPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSR-----LD 1740 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I +T G+++ +D +RG V+A+G+R I S E + ++ + Sbjct: 1741 QRVPQIAFVITGGKSVEDAQDVSLALT-----QRGVKVFAVGVRNIDSEEVGKIASNSAT 1795 Query: 345 FYLVENPHSMYD 356 + V N + + Sbjct: 1796 AFRVGNVQELSE 1807 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 71/203 (34%), Gaps = 24/203 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + + A D++ ++D S ++ + + + +++ + + + Sbjct: 29 QADVKNGAAADIIFLVDSSWTIGE------EHFQLVREFLYDVVKSLAVGEND---FHFA 79 Query: 223 LVTFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQ-IFDMQGMRQH 279 LV F+ FLL + + I +S G + + GL+Y + G R Sbjct: 80 LVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGTNQTGKGLEYIMQSHLTKAAGSRA- 138 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 ++IV +TDG E K V+AIG+ Sbjct: 139 ----GDGVPQVIVVLTDG-----HSKDGLALPSAELKSADVNVFAIGVEDADEGALKEIA 189 Query: 340 ASP--NSFYLVENPHSMYDAFSH 360 + P + +EN S++D + Sbjct: 190 SEPLNMHMFNLENFTSLHDIVGN 212 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 39/272 (14%), Positives = 88/272 (32%), Gaps = 41/272 (15%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS-------- 166 ++ + + I++ F + ++V + SSV Sbjct: 174 AIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLK 233 Query: 167 --QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D++ ++D S ++ + + + +LE++ P ++ G+V Sbjct: 234 DITAQDSADIIFLIDGSN------NTGSVNFAVILDFLVNLLEKL---PIGTQQIRVGVV 284 Query: 225 TFSNKIEEFFLLEWGVSHLQ--RKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 FS++ F L+ + Q +K L G N L + F R + Sbjct: 285 QFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHFT----RAGGS 340 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACA 340 + +++V ++ G + + A V++ G+ S L+ A Sbjct: 341 RVEEGVPQVLVLISAGPSSDEIRYGVVAL-------KQASVFSFGLGAQAASRAELQHIA 393 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + ++ F G D+ K + Y Sbjct: 394 TDDNLVFT------VPEFRSFG-DLQEKLLPY 418 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 86/261 (32%), Gaps = 24/261 (9%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + ++V + + Y S + ++P K T I T+ + + ++ + Sbjct: 1371 RNADQEELVKISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAV 1430 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S + ++ ++ + + P V+ G+V FS Sbjct: 1431 ESDAADIVFLIDSSEGVRPDG------FAHIRDFVSRIVRRLNIGPSK---VRVGVVQFS 1481 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 N + F L+ S + I+ L G S N+ L++ +F + + Sbjct: 1482 NDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDG-- 1539 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + +V + G K + + G + +G R I E P Sbjct: 1540 --VPQHLVLVLGG-----KSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDPRL 1592 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 + V + + I Sbjct: 1593 VFTVREFRELPNIEERIMNSF 1613 Score = 43.6 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 61/163 (37%), Gaps = 24/163 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 1029 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 1078 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + ++ L+ G T N+ L++ N + G R Sbjct: 1079 EFYLNSYMNKQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRI-----TEGVP 1133 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ +T + + K+ GA+ IGI Sbjct: 1134 QLLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 1171 >gi|149067641|gb|EDM17193.1| rCG39970 [Rattus norvegicus] Length = 1163 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 36/212 (16%), Positives = 76/212 (35%), Gaps = 34/212 (16%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D+ ++D S S+ ++ +K++ + L+ + Sbjct: 146 CPRQEMDIAFLIDGSGSIN---QRDFAQMKDFVKALMGEFASTSTLFS--------LMQY 194 Query: 227 SNKIEEFFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 SN ++ F + L Q + + + T + G++ ++F N Sbjct: 195 SNILKTHFTFTEFKNILDPQSLVDPIVQLQGLTYTATGIRTVVEELF------HSKNGSR 248 Query: 285 ANYKKIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFLRAC 339 + KKI++ +TDG+ E + ++A G I YAIG + + + L Sbjct: 249 KSAKKILLVITDGQKYRDPLEYSDVIPAADKA---GIIRYAIGVGDAFQEPTALKELNTI 305 Query: 340 ASP---NSFYLVENPHSMYDAFSHIGKDIVTK 368 S + + V N + A I + + K Sbjct: 306 GSAPPQDHVFKVGN----FAALRSIQRQLQEK 333 >gi|116622066|ref|YP_824222.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225228|gb|ABJ83937.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 309 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 39/223 (17%), Positives = 77/223 (34%), Gaps = 39/223 (17%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 + S+++ S D + + +V+D S SM K+ I + ++ Sbjct: 63 GVRQSIRLFSHEDIPVTVGLVIDHSGSMRP-------KMASVIAAARTFIQSSSPED--- 112 Query: 217 NVVQSGLVTFSNKIE----EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 Q +V F+ + L I + G + Y+ ++ Sbjct: 113 ---QMFVVNFNEDVTLGLSTEIPFTNRPEDLTYAISHSPPTGKTAL--------YDAVW- 160 Query: 273 MQGMRQHCNTEDANYKKIIVFMTD-GENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 R+ + KK++V ++D G+N ST + L N+ V+ IGI Sbjct: 161 --KAREWVARGSRD-KKVLVVVSDGGDNASTHTLSEILEAANK---SNIQVFTIGIFDPD 214 Query: 332 SHE----FLRAC--ASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 + LR A+ ++ + + I KDI ++ Sbjct: 215 DPDKNPGVLRQLARATGGEAFVPDELSEVVAICESIAKDIRSQ 257 >gi|148656915|ref|YP_001277120.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569025|gb|ABQ91170.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 561 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 33/179 (18%), Positives = 66/179 (36%), Gaps = 26/179 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + R+D+M VLDVS SM + +++ A ++ +E+++ GL F Sbjct: 375 ENKKRVDVMAVLDVSGSM-----ADEARLEQAKTALRIFIEQLQDDDGF------GLTIF 423 Query: 227 SNK---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 S+ + + + + +I L+ G T AY ++ Sbjct: 424 SDSATVLTPVSPIGPKRAEILNRIAGLTPRG-GTRLLDTTVEAYQEM--------SATPP 474 Query: 284 DANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIRVIRSHEFLRACA 340 + +V +TDG +N S + Q L + ++ V+ + L+ A Sbjct: 475 GQRI-RAVVVLTDGLDNKSQRNAQDVLNLLRQDREGYSIKVFTVAFGGDADVNLLKEIA 532 >gi|294678572|ref|YP_003579187.1| hypothetical protein RCAP_rcc03056 [Rhodobacter capsulatus SB 1003] gi|294477392|gb|ADE86780.1| conserved hypothetical protein [Rhodobacter capsulatus SB 1003] Length = 647 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 15/68 (22%), Positives = 31/68 (45%), Gaps = 1/68 (1%) Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIR-SHEFLRACASPNSFYLVENPHSMYDAFSHIGK 363 D ++ C+ AK +G ++++ L+ C+S S+Y ++ AF+ I Sbjct: 579 DARTKKLCDLAKSKGIYIFSVAADAPSGGKTLLKYCSSGTSYYYEVQGSNLSTAFASIAA 638 Query: 364 DIVTKRIW 371 I + R+ Sbjct: 639 SISSLRLT 646 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 33/245 (13%), Positives = 74/245 (30%), Gaps = 42/245 (17%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 + N G + IL+ ++ + G+ I++ + +T++ + IDR+++ AA+ Sbjct: 21 LHRLLRNEDGALIILSLQVFLVMLITTGIAIDLVRVEERRTLIQNTIDRAVLAAASLTQK 80 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 K L G++ I + Sbjct: 81 RDPTLVVKDYLTKAGL------------------GYIASDSSFTPKVEGSIAL----GWR 118 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 +S +P F + + + Q +++ +VLD+S SM + Sbjct: 119 RVSVEVDDDMPTIF-GPLLG------VSSLAATGDTTAMQAVGNVEISLVLDLSGSMTEY 171 Query: 188 F----------DSSITKIDMAIKSINAMLEEVKLIPD---VNNVVQSGLVTFSNKIEEFF 234 SS T+ + + + V +V +S + Sbjct: 172 VKDNPSCTKNCTSSKTRFQYLQVAAKSFINTVFASSGSGVAAGRTSVSVVPYSTNVYLGS 231 Query: 235 LLEWG 239 ++ G Sbjct: 232 EMQEG 236 >gi|3928084|gb|AAC79610.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 689 Score = 59.8 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 54/330 (16%), Positives = 118/330 (35%), Gaps = 61/330 (18%) Query: 54 IDRSL-----VHAATQIMNEGNGNNRKKLKGGDILCRIKN--TWNMSFRNELRDNGFVND 106 D ++ + + +MN+G G + D + S+ + L +N V Sbjct: 138 NDAAISLVHRLPRSRGVMNQGRGLAPEPSMFDDDERLEQQLVFSGKSYSDALENNHPVRM 197 Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSV-KVN 165 +D+ + P+ +SA+ R KF + + + S +++ Sbjct: 198 ---------MDLKIYPE-----VSAVPRADSREKF-DVLVHLRAAAMVTGNANSLNNQIS 242 Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 A +D++ VLD+S SM TK+ + +++ +++ + + ++ Sbjct: 243 RYPRAPVDLVTVLDISGSMAG------TKLALLKRAMGFVIQNLGSND------RLSVIA 290 Query: 226 FSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 FS+ F L G + + + G TN GL+ + R+ N Sbjct: 291 FSSTARRLFPLTKMSDAGRQRALQAVNSVVANG-GTNIAEGLRKGVKVM----EDRRDKN 345 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQ---------SLYYCNEAKKRGAIVYAIGIRVIRS 332 + I+ ++DG + T S++ C K+ V++ G Sbjct: 346 PVAS-----IILLSDGRDTYTMNQADPNYKLLLPLSMHGCES-KRFQIPVHSFGFGSDHD 399 Query: 333 HEFLRACA--SPNSFYLVENPHSMYDAFSH 360 + + + S +F +E+ + DA + Sbjct: 400 ASLMHSVSETSGGTFSFIESESVIQDALAQ 429 >gi|332262934|ref|XP_003280513.1| PREDICTED: LOW QUALITY PROTEIN: integrin alpha-X-like [Nomascus leucogenys] Length = 1164 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 42/216 (19%), Positives = 78/216 (36%), Gaps = 32/216 (14%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 D++ ++D S S+ S + + A++ + + Q Sbjct: 140 ASRQECPRQEQDIVFLIDGSGSISS------GNFATMMNFVRAVISQFQ-----RPSTQF 188 Query: 222 GLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 L+ FSN+ E F E S+ + + + G T + ++ +Q+F Sbjct: 189 SLMQFSNRFETHFTFEKFRRSSNPLSLLASVRQLGGLTYTATAIQKVVHQLFHAS----- 243 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI----RSHEF 335 + KI++ +TDG+ D + + +A G I YAIG+ S + Sbjct: 244 -YGARRDATKILIVITDGKKEGDSLDYKHVIPMADA--AGIIRYAIGVGSAFQNTNSWKE 300 Query: 336 LRACA---SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 L A S + VE+ +DA I + K Sbjct: 301 LNDIASKPSQEHIFKVED----FDALKDIQNQLKEK 332 >gi|159900556|ref|YP_001546803.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893595|gb|ABX06675.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 423 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 30/234 (12%), Positives = 86/234 (36%), Gaps = 28/234 (11%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 P + ++++ + + ++ +++ ++D S SM ++ Sbjct: 13 PGRPAVAQSNDPQIVYLLVEASPAGIPDADLAIPVNLGFIVDRSSSMRG------ERLYQ 66 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGV 255 ++ N ++ ++ +V+F+++ E + ++R I + G Sbjct: 67 VKEACNNVVNQLNRQD------YFSVVSFNDRAEVVVPCQRPNDKDQIKRAIGMIEAKGG 120 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 + +T + +Q + + + + +V +TDG ++ + + A Sbjct: 121 TEMATGMM-------MGLQEISRPMMSRGISR---MVLLTDGRTYG--DESRCVEIARRA 168 Query: 316 KKRGAIVYAIGIRVIRSHEFLRACASPNSFY--LVENPHSMYDAFSHIGKDIVT 367 + +G + A+GI + + L AS + + N + + FS K + Sbjct: 169 QSKGIGITALGIGDEWNEDLLETIASAENSRTEYITNAQQIVNVFSDEIKRLQN 222 >gi|170767616|ref|ZP_02902069.1| von Willebrand factor type A domain protein [Escherichia albertii TW07627] gi|170123950|gb|EDS92881.1| von Willebrand factor type A domain protein [Escherichia albertii TW07627] Length = 586 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 46/297 (15%), Positives = 107/297 (36%), Gaps = 37/297 (12%) Query: 53 MIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN-----DI 107 + AT+I N G R + + + ++ +F ++ + N + Sbjct: 96 AYESVAKAKATRISN--LGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYANVRRFLNQ 153 Query: 108 DDIVRSTSLDIVVV----PQNEGYSISAISRYKIPLKFCT-----FIPWYTNSRHIVMPI 158 + ++ + + P + + P+ F PW + + I Sbjct: 154 GQLPPPDAVRVEEMVNYFPSDWVINDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDI 213 Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + S+ +++ ++D S SM S ++ + S+ +++E++ +++ V Sbjct: 214 LA-QDLKSEALPASNLVFLIDTSGSMYSD-----ERLPLIQSSLKLLVKELREQDNISIV 267 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 +G ++I + I L+ G STN GL+ AY Q + Sbjct: 268 TYAG----DSRIALPSTSGNHKDEINAAIDSLNARG-STNGGAGLEMAYQQ------AAK 316 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGAIVYAIGIRVIRSHE 334 N I+ TDG+ +D +S+ + ++ G + +G+ +E Sbjct: 317 GFIKGGVNR---ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGRDNYNE 370 >gi|125830338|ref|XP_692362.2| PREDICTED: anthrax toxin receptor 1 [Danio rerio] Length = 552 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 39/214 (18%), Positives = 74/214 (34%), Gaps = 30/214 (14%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA-IKSINAMLEEVKLIPDVNNVVQ 220 V+ D+ VLD S S++ + + +++ K I+ ML + Sbjct: 31 VEEERSCQGAFDLYFVLDKSGSVKHHWQEIYSFVELLEQKFISPML-------------R 77 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQ 278 + FS + L +++ + L + G T GL+ A QI+ Sbjct: 78 MSFIVFSTRGNTIMRLTENRETIRKGLNVLRREIPGGDTFMHLGLEKANEQIYQENYGTA 137 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 +I+ +TDGE L + + A+ GAIVY +G++ + Sbjct: 138 S----------VIIALTDGE-LQEHQLIAAQQEAARARTLGAIVYCVGVKDFNETQLATI 186 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + + V + A + I+ K Sbjct: 187 ADTSKHVFPVLGG---FQALRGMIDSIIKKSCIE 217 >gi|323936560|gb|EGB32847.1| von Willebrand type A protein [Escherichia coli E1520] Length = 565 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 50/313 (15%), Positives = 110/313 (35%), Gaps = 39/313 (12%) Query: 39 EVSHIFFMKTVLHSMID--RSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRN 96 + S ++ L R+ AT I N G R + + + ++ +F Sbjct: 59 QYSDKQALQGRLQEAPTFARAAKAKATHIAN--PGTARYQQFDDNPVKQVAQNPLATFSL 116 Query: 97 ELRDNGFVN-------DIDDIVRSTSLDIVVV--PQNEGYSISAISRYKIPLKFCT---- 143 ++ + N + + ++ +V P + P+ F Sbjct: 117 DVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYEL 176 Query: 144 -FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 PW + + I + S+ +++ ++D S SM S ++ + S+ Sbjct: 177 APAPWNEQRTLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISD-----ERLPLIQSSL 230 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 +++E++ ++ V +G ++I + + + I L G STN G Sbjct: 231 KLLVKELREQDNIAIVTYAG----DSRIALPSISGSHKAEINAAIDSLDAEG-STNGGAG 285 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGAI 321 L+ AY Q ++ N I+ TDG+ +D +S+ + ++ G Sbjct: 286 LELAYQQA-TKGFIKGGINR--------ILLATDGDFNVGIDDPKSIESMVKKQRESGVT 336 Query: 322 VYAIGIRVIRSHE 334 + G+ +E Sbjct: 337 LSTFGVGNSNYNE 349 >gi|257062762|ref|YP_003142434.1| Mg-chelatase subunit ChlD [Slackia heliotrinireducens DSM 20476] gi|256790415|gb|ACV21085.1| Mg-chelatase subunit ChlD [Slackia heliotrinireducens DSM 20476] Length = 2281 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 28/167 (16%), Positives = 57/167 (34%), Gaps = 20/167 (11%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD--SSITKI 195 P + + I + + + ++++++LD S SM++ S T++ Sbjct: 43 PPHTKNLTDNHDGTYTISLDVVGESERKPN---PVNVIVILDNSGSMDTRTGGYGSQTRM 99 Query: 196 DMAIKSINAMLEEVKLIPDVN--NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF 253 A ++N + + ++VQ LV FS + + L Sbjct: 100 AAAQNAVNNLARSLYAYNTTEFPDLVQMALVGFSTTGSVVQGPTNSYNTFSGAVNRLDAD 159 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 G TN L+ A N D + ++F++DG Sbjct: 160 G-GTNWEDALQDA-----------AGINFNDDDPT-YVIFVSDGNPT 193 >gi|125829706|ref|XP_696164.2| PREDICTED: collagen alpha-2(VI) chain [Danio rerio] Length = 1015 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 37/181 (20%), Positives = 68/181 (37%), Gaps = 25/181 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S S+ ++ T + + L + P + G+V +S+ Sbjct: 606 GALDIVFVIDSSESVG---LTNFTLEKNFVINTINRLGSIAKDPSSETGTRVGVVQYSHN 662 Query: 230 -IEEFFLLEWGV----SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + L S + +K L T + LK+AY+ + ++ Sbjct: 663 GTFQAIRLNDSKIDSMSAFKDAVKKLEWIAGGTWTPSALKFAYDNLI--------RDSRR 714 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIR----VIRSHEFLRAC 339 + +V +TDG ++D + L Y C V AIGI +E L++ Sbjct: 715 SKANVTVVVITDGR-YDPRDDDKLLNYLCT---DTSIDVNAIGIGDMFDQPEENESLKSI 770 Query: 340 A 340 A Sbjct: 771 A 771 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 29/203 (14%), Positives = 61/203 (30%), Gaps = 17/203 (8%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 + + S +K +D+ +D S ++ + + ++ E Sbjct: 15 HAQGQTPAGSSCSPIKSTEC---PIDLYFAIDTSETIALQENPPGSLVESIKDFTIRFAE 71 Query: 208 EVKLIP---DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 +++ + VN G + FS E F + ++ + G T+ + Sbjct: 72 KLQNVNYRGSVNITWAIGGLHFSQLQEFFSTITT-KEKFISNLRPIRYLGRGTHIDCAIT 130 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 Q+ + K+ V +TDG + A+ ++A Sbjct: 131 NMTKQLV--------RFPSRPDAKRFAVVITDGHVTANPCGGI-KVAAERARDENIRIFA 181 Query: 325 IGIRVIRSHEFLRACA-SPNSFY 346 + LR A SP Y Sbjct: 182 VASSRNLEETGLREIANSPAGVY 204 >gi|73962941|ref|XP_547762.2| PREDICTED: similar to coagulation factor C homolog, cochlin precursor [Canis familiaris] Length = 847 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 26/209 (12%), Positives = 69/209 (33%), Gaps = 31/209 (14%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTD------ARLDMMIVLDVSRSMESFFDSSITK 194 F +P + + V P+ + + Q +++ ++D S S+ + Sbjct: 626 FSYHMPNWFGTTKYVKPLVQKLCSHEQMMCSKTCYNSVNIAFLIDGSSSVGD------SN 679 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSK 252 + ++ ++ + + ++ + V F+ F + + I+ + Sbjct: 680 FHLMLEFVSNIAKTFEISDIGA---KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRY 736 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 T + + + +F +R N +V +TDG+ Sbjct: 737 MSGGTATGDAISFTVRNVFGP--VRDSPNKN------FLVIVTDGQ-----SYDDVRGPA 783 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 A G ++++G+ + L+ AS Sbjct: 784 AAAHDAGITIFSVGVAWAPLDD-LKDMAS 811 >gi|297529200|ref|YP_003670475.1| von Willebrand factor A [Geobacillus sp. C56-T3] gi|297252452|gb|ADI25898.1| von Willebrand factor type A [Geobacillus sp. C56-T3] Length = 1077 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 33/144 (22%), Positives = 60/144 (41%), Gaps = 27/144 (18%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 V+ +D++ V+DVS SM + K+ A ++ A + K + N+ + L Sbjct: 190 VSGIIRPPIDVVFVMDVSGSMTAM------KLQSAKSALQAAVNYFKSNYNQND--RFAL 241 Query: 224 VTFSNKIEEFFLLEWGV-SHLQRKIKYL------SKFGVSTNSTPGLKYAYNQIFDMQGM 276 V FS+ + E ++ +G S++ ++ + G TN + L A + Sbjct: 242 VPFSDGVREASVVPFGKYSNVASQLDAILNTGNSLTAGGGTNYSAALSLAKSYFT----- 296 Query: 277 RQHCNTEDANYKKIIVFMTDGENL 300 D KK I+F+TDG Sbjct: 297 -------DPTRKKYIIFLTDGMPT 313 >gi|262403351|ref|ZP_06079911.1| protein BatA [Vibrio sp. RC586] gi|262350850|gb|EEY99983.1| protein BatA [Vibrio sp. RC586] Length = 248 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 74/213 (34%), Gaps = 38/213 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M+V+D+S SM S ++ + ++ +L E + + + GL+ F++ Sbjct: 15 DLMLVVDLSYSMSQEDMQSGQQMVDRLTAVKQVLSEFIAQREGD---RIGLILFADHAYL 71 Query: 233 FFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + ++ L G T G+ A D A +++ Sbjct: 72 QTPLTLDRQTVTEQLNQAVLKLIGTQTAMGEGIGLATKTFID-----------SAAPQRV 120 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR------------------S 332 ++ ++DG N + L N AK+ +Y +G+ Sbjct: 121 MILLSDGSNTA--GVLDPLEAANIAKQYQTTIYTVGVGAGEMIVKDFLFSRKVNTAQDLD 178 Query: 333 HEFLRACAS--PNSFYLVENPHSMYDAFSHIGK 363 + L+ AS ++ N + + I + Sbjct: 179 EKTLQTIASTTGGQYFRARNQQDLQSIYDTINQ 211 >gi|94313457|ref|YP_586666.1| hypothetical protein Rmet_4532 [Cupriavidus metallidurans CH34] gi|93357309|gb|ABF11397.1| conserved hypothetical protein, (Von Willebrand factor, type A); putative membrane protein [Cupriavidus metallidurans CH34] Length = 334 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 30/215 (13%), Positives = 79/215 (36%), Gaps = 32/215 (14%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSIT-----KIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 T D+++ LD+S+SM++ + + ++D + + + + + G Sbjct: 90 TQPVRDLLLALDLSQSMDTRDFKTPSGVLEPRVDAVRQVVADFVAR-RTGD------RIG 142 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 L+ F + + ++ + + +++ G D G+ Sbjct: 143 LIVFGDAPYPLAPFTLDHALVRELLADMVPGMAGASTSLG---------DAIGLGIKMFD 193 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-------EF 335 + +K+++ +TDG T + AK RG +V+ +GI + + Sbjct: 194 QSHAQEKVMILLTDG--NDTASRMPPAQAADIAKTRGVVVHTVGIGDPATTGEQKVDLDA 251 Query: 336 LRACAS--PNSFYLVENPHSMYDAFSHIGKDIVTK 368 L+ AS ++ + S+ ++ + + + Sbjct: 252 LKHIASTTGGRYFFGADQTSLASIYATLDRVTPHR 286 >gi|313235273|emb|CBY10837.1| unnamed protein product [Oikopleura dioica] Length = 696 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 37/209 (17%), Positives = 73/209 (34%), Gaps = 25/209 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 N T LD++ V+D S S+ D+ + +++ + N + Sbjct: 172 TQNECTSKALDIVFVVDESGSVGP------DNFDLVKQF---LIDYAQDSNIAANATRIA 222 Query: 223 LVTFSNKIEEFFLLEWG-VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 + T+S + F L S++ +I L TN+ + N + Sbjct: 223 IRTYSTYSDLDFSLNDFKTSNIIFEINNLVHESGGTNTADAITNGLNDFGN--------- 273 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 + + KI+V +TDG++ + ++ +A R +AIGI E + Sbjct: 274 -DRSESVKIMVTITDGQSNYDR--VKAAADLLKADPRNIQSFAIGIDGANMAELQAIATT 330 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + F I K++ +K Sbjct: 331 DPGHIE---MLKNWSDFGPIKKNLQSKVC 356 >gi|293357059|ref|XP_345157.4| PREDICTED: integrin, alpha 2 [Rattus norvegicus] Length = 1269 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 36/286 (12%), Positives = 99/286 (34%), Gaps = 46/286 (16%) Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 + + +++ + + G +++ L +++ I S V + Sbjct: 185 LQNSASISNVTEIKTNMSLGLTLTRNPGTGGFLTCGPLWAHQCGNQYYATGICSDVSPDF 244 Query: 167 QTDARL-----------DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 Q+ D+++V D S S+ + + + ++ + + P Sbjct: 245 QSLTSFSPAVQACPSLVDVVVVCDESNSIYPW--------EAVKNFLEKFVQGLDIGP-- 294 Query: 216 NNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFD 272 Q L+ ++N F L + + ++G TN+ +++A + + Sbjct: 295 -KKTQVALIQYANDPRVVFNLTTYKNKEDMVQATSETRQYGGDLTNTFKAIQFARDIAYL 353 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI----- 327 + + K++V +TDGE+ + Q + CN + + I + Sbjct: 354 PE------SGGRPGATKVMVVVTDGESHDGSKLQTVIQQCN---DDEILRFGIAVLGYLN 404 Query: 328 -RVIRSHEFLR----ACASP--NSFYLVENPHSMYDAFSHIGKDIV 366 + + ++ ++P F+ V + ++ + +G+ I Sbjct: 405 RNALDTKNLIKEIKAIASTPTERYFFNVADEAALLEKAGTLGEHIF 450 >gi|241620324|ref|XP_002408644.1| calcium activated chlorine channel, putative [Ixodes scapularis] gi|215503005|gb|EEC12499.1| calcium activated chlorine channel, putative [Ixodes scapularis] Length = 704 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 37/196 (18%), Positives = 72/196 (36%), Gaps = 22/196 (11%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + ++ VLDVS SM KI+M ++ + LE +PD ++V G++TFS+ Sbjct: 60 QGHIRIVFVLDVSGSM-----GLENKINMLRQAASRSLE--DNVPDGSDV---GIITFSD 109 Query: 229 KIEEFFLLEWGVSHLQRKI-KYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + ++ I + ST L + + + R E+A Sbjct: 110 NATVVAGMRTLSAATRQAIKNAVPSIARGSTAIGKALMTSVQKHGPQELERNGETAENAA 169 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--VIRSHEFLRACASPNS 344 ++ MTDGE L + + V+++ + L + + Sbjct: 170 ----LLLMTDGEENEPPYINDVLPTLLQKR---LRVFSVPVGKEADDGLRVLSE-RTGEN 221 Query: 345 FYLVENPHSMYDAFSH 360 Y + N + D + Sbjct: 222 VYPITNTTKLADRLAE 237 >gi|114563846|ref|YP_751360.1| von Willebrand factor, type A [Shewanella frigidimarina NCIMB 400] gi|114335139|gb|ABI72521.1| von Willebrand factor, type A [Shewanella frigidimarina NCIMB 400] Length = 334 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 66/216 (30%), Gaps = 44/216 (20%) Query: 173 DMMIVLDVSRSM---ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+MI +D+S SM + + I+ + + E + + GL+ F++ Sbjct: 84 DLMIAVDLSGSMQIEDMVINGQTVNRFTLIQHVLSDFIERRKGD------RLGLILFADH 137 Query: 230 IEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + + + G T + A + + Sbjct: 138 AYLQAPLTLDRRSVATFLDDAQIGLVGKQTAIGEAIALAVKRFDKVDESN---------- 187 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI----------------- 330 ++++ +TDG N + + + A KR +Y IG+ Sbjct: 188 -RVLILLTDGSNNAGNIEPEVA--AQIAAKRNITIYTIGVGAEILERRTIFGKERINPSM 244 Query: 331 --RSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 + + A + ++ N + + I K Sbjct: 245 DLDEDQLKKLAAMTKGRYFRARNSEELASIYQEIDK 280 >gi|167549689|ref|ZP_02343448.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA29] gi|205325282|gb|EDZ13121.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA29] Length = 593 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 28/188 (14%), Positives = 71/188 (37%), Gaps = 21/188 (11%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 PW + + + + + + +++ ++D S SM+ ++ + + Sbjct: 203 LAPSPWNAQRTLLKVDVQAR-DMQTSELPPANLVFLIDTSGSMQP-----AERLPLIQSA 256 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 + ++ +++ ++ V +G + + ++ I L +G ST Sbjct: 257 LKLLVNDLRAQDNITIVTYAG----GTHVALASTAGNNTTAIKAAIDNLDAYG-STGGEA 311 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGA 320 GL+ AY Q + N I+F TDG+ NL + + + +++G Sbjct: 312 GLRLAYEQ------AEKGFIKGGVNR---ILFTTDGDFNLGITDPKDIEALVKKEREKGI 362 Query: 321 IVYAIGIR 328 + +G+ Sbjct: 363 TLSTLGVG 370 >gi|324997883|ref|ZP_08118995.1| hypothetical protein PseP1_03919 [Pseudonocardia sp. P1] Length = 329 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 34/214 (15%), Positives = 77/214 (35%), Gaps = 30/214 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+V+DVS SM++ D + T++ A + +++ V GLV+F+ Sbjct: 88 VMLVIDVSLSMKA-TDVAPTRLAAAQAAAKQFADQLTPG------VNLGLVSFAGTAAVL 140 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI-FDMQGMRQHCNTEDANYKKIIV 292 + ++ + L + ST + + A I + ++ + + +V Sbjct: 141 VSPTTDRNAVKNGVDNL-QLAESTATGEAIFTAMQSIDTFSRSLQGGPDQQGTPPPARVV 199 Query: 293 FMTDGENLSTKEDQQ-----SLYYCNEAKKRGAIVYAIGIR---------------VIRS 332 ++DG D + S ++A++RG V I + Sbjct: 200 LLSDGTQTVPGPDGENEPRGSFTAASDAQRRGIPVSTISFGTSYGSIELDGGRTPVAVDD 259 Query: 333 HEFLRACA-SPNSFYLVENPHSMYDAFSHIGKDI 365 R + S F+ + +S + +++ Sbjct: 260 ASMERIASLSGGRFFTAATESELRAVYSDLSEEL 293 >gi|221117277|ref|XP_002154725.1| PREDICTED: similar to HyTSR1 protein, partial [Hydra magnipapillata] Length = 3382 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 37/201 (18%), Positives = 75/201 (37%), Gaps = 20/201 (9%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + ++D+ ++D+S S+ + I IN +++ + P + V + Sbjct: 2984 ECKTQVDLAFIVDMSESVTNVEPDKQEAI------INFIIKLALMFPISKDQVHFAYFPY 3037 Query: 227 SNKIEEFFLLE--------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + + + W +S Q++ ++L K + + G Y ++ + Sbjct: 3038 GTEPKLDIPDQFFDGAMNNWTLSDKQKQEEFLRKMIILE--SRGFSYTWSAAITARETIF 3095 Query: 279 HCNTE-DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +N K++ + TDG + Q+ Y K +G V AIGI + L Sbjct: 3096 TTQKGMRSNVKRVAILFTDGVYNGKHDTQKEWQY---VKDQGIQVVAIGIGSPINISNLE 3152 Query: 338 ACASPNSFYLVENPHSMYDAF 358 AS +F + DAF Sbjct: 3153 LWASSKNFVFNATTYEEADAF 3173 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 38/216 (17%), Positives = 81/216 (37%), Gaps = 20/216 (9%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +++ ++D+ ++D+S S+ + I IN +++ + P + V Sbjct: 2723 STECKTQVDLAFIVDMSESVTNVEPDKQEAI------INFIIKLALMFPISKDQVHFAYF 2776 Query: 225 TFSNKIEEFFLLE--------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 + + + + W +S Q++ ++L K + + G Y ++ + Sbjct: 2777 PYGTEPKLDIPDQFFDGAMNNWTLSDKQKQEEFLRKMIILE--SRGFSYTWSAAITARET 2834 Query: 277 RQHCNTE-DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 +N K++ + TDG + Q+ Y K +G V A+GI + + Sbjct: 2835 IFTTQKGMRSNVKRVAILFTDGVYNGKHDTQKEWQY---VKDQGIQVIAVGIGSPINTDN 2891 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 L AS + + DAF D V K + Sbjct: 2892 LELWASNKNSVFSATTYQEADAFISTIGDNVCKSLT 2927 >gi|170015985|ref|NP_001116166.1| complement component 2 [Xenopus laevis] gi|169642443|gb|AAI60743.1| LOC734198 protein [Xenopus laevis] Length = 662 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 40/255 (15%), Positives = 85/255 (33%), Gaps = 31/255 (12%) Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 Y+ S + +F + N ++K+ D L++ +LD S+S+ Sbjct: 118 QYAYSFDLPEDVQEQFKASLSGILNIMERPASFGRTIKIT--KDGILNVYFLLDASQSVG 175 Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR 245 D+ +++E+ L +Q G+++++ + + S Sbjct: 176 ------QANFDIYKACSEYLVDELALFD---MTIQFGIISYATVPKVIIPIYDEESDNND 226 Query: 246 KIKYLSKFG---------VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 + L + G TN+ L+ Y+ + Q + + II+ +TD Sbjct: 227 HVLTLIRNGLKYSDHKDKTGTNTKAALEEIYS-MMSSQKETYKNESVWNSIHHIIILLTD 285 Query: 297 GENLSTKEDQQSLYYCNE------AKKRGAIVYAIGIRVIRSHEFLRACASPN----SFY 346 G+ ++ + ++ VY GI L AS + Sbjct: 286 GKANLGGRPAHTIKRIEDFLDIKHKREDYLDVYTFGIGPEVDMADLSEMASKKDGETHVF 345 Query: 347 LVENPHSMYDAFSHI 361 +E+ + M F I Sbjct: 346 RMESANEMKTVFQKI 360 >gi|82619290|gb|ABB85337.1| complement C2 [Xenopus laevis] Length = 753 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 40/255 (15%), Positives = 85/255 (33%), Gaps = 31/255 (12%) Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 Y+ S + +F + N ++K+ D L++ +LD S+S+ Sbjct: 209 QYAYSFDLPEDVQEQFKASLSGILNIMERPASFGRTIKIT--KDGILNVYFLLDASQSVG 266 Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR 245 D+ +++E+ L +Q G+++++ + + S Sbjct: 267 ------QANFDIYKACSEYLVDELALFD---MTIQFGIISYATVPKVIIPIYDEESDNND 317 Query: 246 KIKYLSKFG---------VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 + L + G TN+ L+ Y+ + Q + + II+ +TD Sbjct: 318 HVLTLIRNGLKYSDHKDKTGTNTKAALEEIYS-MMSSQKETYKNESVWNSIHHIIILLTD 376 Query: 297 GENLSTKEDQQSLYYCNE------AKKRGAIVYAIGIRVIRSHEFLRACASPN----SFY 346 G+ ++ + ++ VY GI L AS + Sbjct: 377 GKANLGGRPAHTIKRIEDFLDIKHKREDYLDVYTFGIGPEVDMADLSEMASKKDGETHVF 436 Query: 347 LVENPHSMYDAFSHI 361 +E+ + M F I Sbjct: 437 RMESANEMKTVFQKI 451 >gi|313212817|emb|CBY36735.1| unnamed protein product [Oikopleura dioica] Length = 696 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 33/209 (15%), Positives = 71/209 (33%), Gaps = 25/209 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 N T LD++ V+D S S+ + + + +++ + + + Sbjct: 172 TQNECTTKALDIVFVVDESGSVG---PDNFELVKLF------LIDYAQDSNIAADATRIA 222 Query: 223 LVTFSNKIEEFFLLEWG-VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 + T+S + F L ++ +I L TN+ + N + Sbjct: 223 IRTYSTNSDLDFSLNDFKTRNIISEINNLVYASGGTNTADAITKGLNDFGN--------- 273 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 + + KI+V +TDG++ ++ +A R +AIGI E + Sbjct: 274 -DRSESVKIMVTITDGQSSYDH--VKAAADLLKADPRNIQSFAIGIDGANMAELQAIATT 330 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + F I +++ +K Sbjct: 331 DPGHIE---MLKDWTDFGPIKQNLQSKVC 356 >gi|225377140|ref|ZP_03754361.1| hypothetical protein ROSEINA2194_02786 [Roseburia inulinivorans DSM 16841] gi|225211045|gb|EEG93399.1| hypothetical protein ROSEINA2194_02786 [Roseburia inulinivorans DSM 16841] Length = 1406 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 65/198 (32%), Gaps = 20/198 (10%) Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF--LLEW 238 S S +KI + + + + + V + N L Sbjct: 810 SISDSDVMAVWTSKISALKDAASGFVTGISDTSPDSLVGIATFYGIGNGWNSSTEGKLNH 869 Query: 239 GVSHLQRK-----IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 G+S + + + L G T+ GL++AY+++ + KK ++ Sbjct: 870 GLSKVNKNEMLKSVNALFADG-GTSPQKGLEHAYSEL----------QKAEDGNKKYVIL 918 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE--FLRACASPNSFYLVENP 351 +DGE + + ++ + K+ G V +G+ + AS + + Sbjct: 919 FSDGEPSDSNDKMETEASAVKLKEAGYTVITVGLGLNNETATWLGEKVASAGCAFTADTA 978 Query: 352 HSMYDAFSHIGKDIVTKR 369 + F +I I R Sbjct: 979 EELNKIFQNIQSTITQSR 996 >gi|147834997|emb|CAN61381.1| hypothetical protein VITISV_037547 [Vitis vinifera] Length = 1324 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 47/249 (18%), Positives = 97/249 (38%), Gaps = 40/249 (16%) Query: 60 HAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIV 119 A+ I + + + ++L + S DN N I I T ++ Sbjct: 175 DASRHISSLFHAHEPAVFDDDEVLDXQXESTERSSSTRDIDN---NSIGAIEVKTYPEVS 231 Query: 120 VVPQNEGYS-ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT-DARLDMMIV 177 VP++ ++ + + K PL T+ R ++++ SQ+ A +D++ V Sbjct: 232 AVPRSTSHNNFTVLIHLKAPL---------TSGRQNSGTNQTNMQPTSQSCRAPVDLVTV 282 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 LDVS SM TK+ + +++ +++ + + +++FS+ F L Sbjct: 283 LDVSGSMAG------TKLALLKRAMGFVIQSLGPCD------RLSVISFSSTARRLFPLR 330 Query: 238 ----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 G + + L G TN GL+ +G + + + N I+ Sbjct: 331 RMTDTGRQQALQAVNSLISNG-GTNIAEGLR---------KGAKVMLDRKWKNPVSSIIL 380 Query: 294 MTDGENLST 302 ++DG++ T Sbjct: 381 LSDGQDTYT 389 >gi|114584071|ref|XP_001153479.1| PREDICTED: collagen alpha-3(VI) chain isoform 3 [Pan troglodytes] Length = 2971 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 86/213 (40%), Gaps = 26/213 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + +V+S D++ +LD S ++ T + ++ + + N+ Sbjct: 422 SGTPEVHSNKR---DIIFLLDGSANVGK------TNFPYVRDFVMNLVNSLDI---GNDN 469 Query: 219 VQSGLVTFSN-KIEEFFLLEWGV-SHLQRKIKYLS-KFGVSTNSTPGLKYAY-NQIFDMQ 274 ++ GLV FS+ + EF L + S + ++ L + G N+ L Y Y N + Sbjct: 470 IRVGLVQFSDTPVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFMEAG 529 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 G R + ++++ +T G+ + L N + G + + +G E Sbjct: 530 GSRI-----REHVPQLLLLLTAGQ-----SEDSYLQAANALTRAGILTFCVGASQANKAE 579 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + +P+ YL+++ S+ + + + T Sbjct: 580 LEQIAFNPSLVYLMDDFSSLPALPQQLIQPLTT 612 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 50/312 (16%), Positives = 113/312 (36%), Gaps = 29/312 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + + GG + F +R +G V+ Sbjct: 1314 LEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR-----FAQVIRSSGIVSLGVG 1368 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + ++P + + ++ P + + Sbjct: 1369 DRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPE 1428 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + + E V + + + +Q GLV +++ Sbjct: 1429 KKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEDGDSIQVGLVQYNS 1479 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 + F L+ + + I + G N+ GL++ N G R D Sbjct: 1480 DPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSR-----LD 1534 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I +T G+++ +D +RG V+A+G+R I S E + ++ + Sbjct: 1535 QRVPQIAFVITGGKSVEDAQDVSLALT-----QRGVKVFAVGVRNIDSEEVGKIASNSAT 1589 Query: 345 FYLVENPHSMYD 356 + V N + + Sbjct: 1590 AFRVGNVQELSE 1601 Score = 45.1 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 34/223 (15%), Positives = 76/223 (34%), Gaps = 31/223 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + T+ + D++ ++D S ++ + + + +LE++ P Sbjct: 17 LSGFPTTHAQQQQAAQDSADIIFLIDGSN------NTGSVNFAVILDFLVNLLEKL---P 67 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLS-KFGVSTNSTPGLKYAYNQI 270 ++ G+V FS++ F L+ + Q +K L G N L + Sbjct: 68 IGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENH 127 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV- 329 F R + + +++V ++ G + + A V++ G+ Sbjct: 128 FT----RAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVAL-------KQASVFSFGLGAQ 176 Query: 330 IRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 S L+ A+ ++ F G D+ K + Y Sbjct: 177 AASRAELQHIATDDNLVFT------VPEFRSFG-DLQEKLLPY 212 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 86/261 (32%), Gaps = 24/261 (9%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + ++V + + Y S + ++P K T I T+ + + ++ + Sbjct: 1165 RNADQEELVKISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAV 1224 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S + ++ ++ + + P V+ G+V FS Sbjct: 1225 ESDAADIVFLIDSSEGVRPDG------FAHIRDFVSRIVRRLNIGPSK---VRVGVVQFS 1275 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 N + F L+ S + I+ L G S N+ L++ +F + + Sbjct: 1276 NDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDG-- 1333 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + +V + G K + + G + +G R I E P Sbjct: 1334 --VPQHLVLVLGG-----KSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDPRL 1386 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 + V + + I Sbjct: 1387 VFTVREFRELPNIEERIMNSF 1407 Score = 42.4 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 61/163 (37%), Gaps = 24/163 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 823 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 872 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + ++ L+ G T N+ L++ N + G R Sbjct: 873 EFYLNSYMNQQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRI-----TEGVP 927 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ +T + + K+ GA+ IGI Sbjct: 928 QLLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 965 >gi|114584069|ref|XP_001153544.1| PREDICTED: collagen alpha-3(VI) chain isoform 4 [Pan troglodytes] Length = 3177 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 86/213 (40%), Gaps = 26/213 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + +V+S D++ +LD S ++ T + ++ + + N+ Sbjct: 628 SGTPEVHSNKR---DIIFLLDGSANVGK------TNFPYVRDFVMNLVNSLDI---GNDN 675 Query: 219 VQSGLVTFSN-KIEEFFLLEWGV-SHLQRKIKYLS-KFGVSTNSTPGLKYAY-NQIFDMQ 274 ++ GLV FS+ + EF L + S + ++ L + G N+ L Y Y N + Sbjct: 676 IRVGLVQFSDTPVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFMEAG 735 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 G R + ++++ +T G+ + L N + G + + +G E Sbjct: 736 GSRI-----REHVPQLLLLLTAGQ-----SEDSYLQAANALTRAGILTFCVGASQANKAE 785 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + +P+ YL+++ S+ + + + T Sbjct: 786 LEQIAFNPSLVYLMDDFSSLPALPQQLIQPLTT 818 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 50/312 (16%), Positives = 113/312 (36%), Gaps = 29/312 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + + GG + F +R +G V+ Sbjct: 1520 LEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR-----FAQVIRSSGIVSLGVG 1574 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + ++P + + ++ P + + Sbjct: 1575 DRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPE 1634 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + + E V + + + +Q GLV +++ Sbjct: 1635 KKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEDGDSIQVGLVQYNS 1685 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 + F L+ + + I + G N+ GL++ N G R D Sbjct: 1686 DPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSR-----LD 1740 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I +T G+++ +D +RG V+A+G+R I S E + ++ + Sbjct: 1741 QRVPQIAFVITGGKSVEDAQDVSLALT-----QRGVKVFAVGVRNIDSEEVGKIASNSAT 1795 Query: 345 FYLVENPHSMYD 356 + V N + + Sbjct: 1796 AFRVGNVQELSE 1807 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 71/203 (34%), Gaps = 24/203 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + + A D++ ++D S ++ + + + +++ + + + Sbjct: 29 QADVKNGAAADIIFLVDSSWTIGE------EHFQLVREFLYDVVKSLAVGEND---FHFA 79 Query: 223 LVTFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQ-IFDMQGMRQH 279 LV F+ FLL + + I +S G + + GL+Y + G R Sbjct: 80 LVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGTNQTGKGLEYIMQSHLTKAAGSRA- 138 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 ++IV +TDG E K V+AIG+ Sbjct: 139 ----GDGVPQVIVVLTDG-----HSKDGLALPSAELKSADVNVFAIGVEDADEGALKEIA 189 Query: 340 ASP--NSFYLVENPHSMYDAFSH 360 + P + +EN S++D + Sbjct: 190 SEPLNMHMFNLENFTSLHDIVGN 212 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 39/272 (14%), Positives = 88/272 (32%), Gaps = 41/272 (15%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS-------- 166 ++ + + I++ F + ++V + SSV Sbjct: 174 AIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLK 233 Query: 167 --QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D++ ++D S ++ + + + +LE++ P ++ G+V Sbjct: 234 DITAQDSADIIFLIDGSN------NTGSVNFAVILDFLVNLLEKL---PIGTQQIRVGVV 284 Query: 225 TFSNKIEEFFLLEWGVSHLQ--RKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 FS++ F L+ + Q +K L G N L + F R + Sbjct: 285 QFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHFT----RAGGS 340 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACA 340 + +++V ++ G + + A V++ G+ S L+ A Sbjct: 341 RVEEGVPQVLVLISAGPSSDEIRYGVVAL-------KQASVFSFGLGAQAASRAELQHIA 393 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + ++ F G D+ K + Y Sbjct: 394 TDDNLVFT------VPEFRSFG-DLQEKLLPY 418 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 86/261 (32%), Gaps = 24/261 (9%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + ++V + + Y S + ++P K T I T+ + + ++ + Sbjct: 1371 RNADQEELVKISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAV 1430 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S + ++ ++ + + P V+ G+V FS Sbjct: 1431 ESDAADIVFLIDSSEGVRPDG------FAHIRDFVSRIVRRLNIGPSK---VRVGVVQFS 1481 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 N + F L+ S + I+ L G S N+ L++ +F + + Sbjct: 1482 NDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDG-- 1539 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + +V + G K + + G + +G R I E P Sbjct: 1540 --VPQHLVLVLGG-----KSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDPRL 1592 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 + V + + I Sbjct: 1593 VFTVREFRELPNIEERIMNSF 1613 Score = 42.4 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 61/163 (37%), Gaps = 24/163 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 1029 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 1078 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + ++ L+ G T N+ L++ N + G R Sbjct: 1079 EFYLNSYMNQQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRI-----TEGVP 1133 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ +T + + K+ GA+ IGI Sbjct: 1134 QLLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 1171 >gi|114584077|ref|XP_001153230.1| PREDICTED: alpha 3 type VI collagen isoform 1 [Pan troglodytes] Length = 3010 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 86/213 (40%), Gaps = 26/213 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + +V+S D++ +LD S ++ T + ++ + + N+ Sbjct: 628 SGTPEVHSNKR---DIIFLLDGSANVGK------TNFPYVRDFVMNLVNSLDI---GNDN 675 Query: 219 VQSGLVTFSN-KIEEFFLLEWGV-SHLQRKIKYLS-KFGVSTNSTPGLKYAY-NQIFDMQ 274 ++ GLV FS+ + EF L + S + ++ L + G N+ L Y Y N + Sbjct: 676 IRVGLVQFSDTPVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFMEAG 735 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 G R + ++++ +T G+ + L N + G + + +G E Sbjct: 736 GSRI-----REHVPQLLLLLTAGQ-----SEDSYLQAANALTRAGILTFCVGASQANKAE 785 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + +P+ YL+++ S+ + + + T Sbjct: 786 LEQIAFNPSLVYLMDDFSSLPALPQQLIQPLTT 818 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 51/300 (17%), Positives = 106/300 (35%), Gaps = 38/300 (12%) Query: 76 KLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLD------IVVVPQNEGYSI 129 K G +N +V + SL+ I + + + Sbjct: 1360 KQFGVAPFTIARNADQEELVKISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKL 1419 Query: 130 SAISRYKIPLKFCTFIPWY---------TNSRHIVMPITSSVKVNSQTDARLDMMIVLDV 180 A +RY P++ +P ++ P + + + D++ +LD Sbjct: 1420 LASTRYPPPVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPEKKKADIVFLLD- 1478 Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV 240 S+ DS + + + E V + + + +Q GLV +++ + F L+ Sbjct: 1479 -GSINFRRDS-------FQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFS 1530 Query: 241 SH--LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 + + I + G N+ GL++ N G R D +I +T Sbjct: 1531 TKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSR-----LDQRVPQIAFVITG 1585 Query: 297 GENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYD 356 G+++ +D +RG V+A+G+R I S E + ++ + + V N + + Sbjct: 1586 GKSVEDAQDVSLALT-----QRGVKVFAVGVRNIDSEEVGKIASNSATAFRVGNVQELSE 1640 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 71/203 (34%), Gaps = 24/203 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + + A D++ ++D S ++ + + + +++ + + + Sbjct: 29 QADVKNGAAADIIFLVDSSWTIGE------EHFQLVREFLYDVVKSLAVGEND---FHFA 79 Query: 223 LVTFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQ-IFDMQGMRQH 279 LV F+ FLL + + I +S G + + GL+Y + G R Sbjct: 80 LVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGTNQTGKGLEYIMQSHLTKAAGSRA- 138 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 ++IV +TDG E K V+AIG+ Sbjct: 139 ----GDGVPQVIVVLTDG-----HSKDGLALPSAELKSADVNVFAIGVEDADEGALKEIA 189 Query: 340 ASP--NSFYLVENPHSMYDAFSH 360 + P + +EN S++D + Sbjct: 190 SEPLNMHMFNLENFTSLHDIVGN 212 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 39/272 (14%), Positives = 88/272 (32%), Gaps = 41/272 (15%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS-------- 166 ++ + + I++ F + ++V + SSV Sbjct: 174 AIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLK 233 Query: 167 --QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D++ ++D S ++ + + + +LE++ P ++ G+V Sbjct: 234 DITAQDSADIIFLIDGSN------NTGSVNFAVILDFLVNLLEKL---PIGTQQIRVGVV 284 Query: 225 TFSNKIEEFFLLEWGVSHLQ--RKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 FS++ F L+ + Q +K L G N L + F R + Sbjct: 285 QFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHFT----RAGGS 340 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACA 340 + +++V ++ G + + A V++ G+ S L+ A Sbjct: 341 RVEEGVPQVLVLISAGPSSDEIRYGVVAL-------KQASVFSFGLGAQAASRAELQHIA 393 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + ++ F G D+ K + Y Sbjct: 394 TDDNLVFT------VPEFRSFG-DLQEKLLPY 418 Score = 42.4 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 61/163 (37%), Gaps = 24/163 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 1029 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 1078 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + ++ L+ G T N+ L++ N + G R Sbjct: 1079 EFYLNSYMNQQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRI-----TEGVP 1133 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ +T + + K+ GA+ IGI Sbjct: 1134 QLLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 1171 >gi|114584075|ref|XP_516178.2| PREDICTED: alpha 3 type VI collagen isoform 5 [Pan troglodytes] Length = 2976 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 86/213 (40%), Gaps = 26/213 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + +V+S D++ +LD S ++ T + ++ + + N+ Sbjct: 427 SGTPEVHSNKR---DIIFLLDGSANVGK------TNFPYVRDFVMNLVNSLDI---GNDN 474 Query: 219 VQSGLVTFSN-KIEEFFLLEWGV-SHLQRKIKYLS-KFGVSTNSTPGLKYAY-NQIFDMQ 274 ++ GLV FS+ + EF L + S + ++ L + G N+ L Y Y N + Sbjct: 475 IRVGLVQFSDTPVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFMEAG 534 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 G R + ++++ +T G+ + L N + G + + +G E Sbjct: 535 GSRI-----REHVPQLLLLLTAGQ-----SEDSYLQAANALTRAGILTFCVGASQANKAE 584 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + +P+ YL+++ S+ + + + T Sbjct: 585 LEQIAFNPSLVYLMDDFSSLPALPQQLIQPLTT 617 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 50/312 (16%), Positives = 113/312 (36%), Gaps = 29/312 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + + GG + F +R +G V+ Sbjct: 1319 LEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR-----FAQVIRSSGIVSLGVG 1373 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + ++P + + ++ P + + Sbjct: 1374 DRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPE 1433 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + + E V + + + +Q GLV +++ Sbjct: 1434 KKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEDGDSIQVGLVQYNS 1484 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 + F L+ + + I + G N+ GL++ N G R D Sbjct: 1485 DPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSR-----LD 1539 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I +T G+++ +D +RG V+A+G+R I S E + ++ + Sbjct: 1540 QRVPQIAFVITGGKSVEDAQDVSLALT-----QRGVKVFAVGVRNIDSEEVGKIASNSAT 1594 Query: 345 FYLVENPHSMYD 356 + V N + + Sbjct: 1595 AFRVGNVQELSE 1606 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 71/203 (34%), Gaps = 24/203 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + + A D++ ++D S ++ + + + +++ + + + Sbjct: 29 QADVKNGAAADIIFLVDSSWTIGE------EHFQLVREFLYDVVKSLAVGEND---FHFA 79 Query: 223 LVTFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQ-IFDMQGMRQH 279 LV F+ FLL + + I +S G + + GL+Y + G R Sbjct: 80 LVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGTNQTGKGLEYIMQSHLTKAAGSRA- 138 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 ++IV +TDG E K V+AIG+ Sbjct: 139 ----GDGVPQVIVVLTDG-----HSKDGLALPSAELKSADVNVFAIGVEDADEGALKEIA 189 Query: 340 ASP--NSFYLVENPHSMYDAFSH 360 + P + +EN S++D + Sbjct: 190 SEPLNMHMFNLENFTSLHDIVGN 212 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 86/261 (32%), Gaps = 24/261 (9%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + ++V + + Y S + ++P K T I T+ + + ++ + Sbjct: 1170 RNADQEELVKISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAV 1229 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S + ++ ++ + + P V+ G+V FS Sbjct: 1230 ESDAADIVFLIDSSEGVRPDG------FAHIRDFVSRIVRRLNIGPSK---VRVGVVQFS 1280 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 N + F L+ S + I+ L G S N+ L++ +F + + Sbjct: 1281 NDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDG-- 1338 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + +V + G K + + G + +G R I E P Sbjct: 1339 --VPQHLVLVLGG-----KSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDPRL 1391 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 + V + + I Sbjct: 1392 VFTVREFRELPNIEERIMNSF 1412 Score = 42.4 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 61/163 (37%), Gaps = 24/163 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 828 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 877 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + ++ L+ G T N+ L++ N + G R Sbjct: 878 EFYLNSYMNQQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRI-----TEGVP 932 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ +T + + K+ GA+ IGI Sbjct: 933 QLLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 970 >gi|323450885|gb|EGB06764.1| hypothetical protein AURANDRAFT_71955 [Aureococcus anophagefferens] Length = 1008 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 39/218 (17%), Positives = 76/218 (34%), Gaps = 37/218 (16%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSS----------ITKIDMAIKSINAMLEEVKLI 212 + +D+ IVLDVS SM + + + +D+ + + + Sbjct: 348 TSGGAQRSGVDIEIVLDVSGSMATESEVQDAAGNVQRHGFSTLDVCKHAARCVACSLD-- 405 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGV----SHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + + GLV + + L + + ++ L+ G STN GL+ + Sbjct: 406 ----DTCRLGLVAYDAQARVVVGLARVTPAHVAKVHAALEKLAP-GTSTNLWGGLELGVD 460 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRG---AIVYA 324 ++ G + ++ +TDG N S E + + K G V+A Sbjct: 461 ELVGGAGDNA----------RAVLLLTDGVPNNSPPEGEVAALRAKRLTKDGSETVAVFA 510 Query: 325 IGIRVIRSHEFLRACASPNS--FYLVENPHSMYDAFSH 360 G + L + A F V + + +F+H Sbjct: 511 AGFGYALRSDLLLSLAREGGGLFSFVPDAGMVGTSFNH 548 >gi|160858155|emb|CAP19997.1| collagen type VI alpha 5 precursor [Homo sapiens] Length = 591 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 59/165 (35%), Gaps = 22/165 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S++ I + MLE ++ + V+ G+V +S+ E Sbjct: 442 DIHFLIDGSSSIQKKQFEQIKRF---------MLEVTEMFSIGPDKVRVGVVQYSDDTEV 492 Query: 233 FFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + + + ++ I + + T + L Y I + R Sbjct: 493 EFYITDYSNDIDLRKAIFNIKQLTGGTYTGKALDYILQIIKNGMKDRMSK------VPCY 546 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 ++ +TDG + + + + V+A+GI E Sbjct: 547 LIVLTDGMSTD-----RVVEPAKRLRAEQITVHAVGIGAANKIEL 586 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 64/185 (34%), Gaps = 22/185 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + IN M+ + P N + L +S++ Sbjct: 30 DVVFLVDSSDHLGP------KSFPFVKTFINKMINSL---PIEANKYRVALAQYSDEFHS 80 Query: 233 FFLLEW--GVSHLQRKIKY-LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L G S + +K G S L+ A+ F + + + Sbjct: 81 EFHLSTFKGRSPMLNHLKKNFQFIGGSLQIGKALQEAHRTYFSAPINGR----DRKQFPP 136 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 I+V + S + + + +K G + ++G++ L+A A+ + + + Sbjct: 137 ILVVL-----ASAESEDEVEEASKALQKDGVKIISVGVQKASEEN-LKAMATSHFHFNLR 190 Query: 350 NPHSM 354 + Sbjct: 191 TIRDL 195 >gi|86143680|ref|ZP_01062056.1| aerotolerance-related membrane protein [Leeuwenhoekiella blandensis MED217] gi|85829723|gb|EAQ48185.1| aerotolerance-related membrane protein [Leeuwenhoekiella blandensis MED217] Length = 349 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 24/178 (13%), Positives = 67/178 (37%), Gaps = 26/178 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +D++ +DVS+SM++ D + ++ID A + + ++ + + Sbjct: 80 GTKLETVKREGVDVVFAVDVSKSMDAE-DIAPSRIDKAKQLVTQIINNLGSD-------R 131 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMR 277 G++ ++ + S + + ++ T ++ A D + Sbjct: 132 VGIIAYAGSAYPQLPITTDYSSAKMFLNAMNTDMLSSQGTAIRDAIELAKTYYNDEEQTN 191 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 +++V ++DGE+ + + A ++G ++ IG+ Sbjct: 192 -----------RVLVIISDGEDHA----GEVASIAESATEQGIRIFTIGVGSEAGDRI 234 >gi|320323259|gb|EFW79347.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. glycinea str. B076] Length = 352 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 63/169 (37%), Gaps = 22/169 (13%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + ++++ + + + LE K + GL+ F Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDEVSRLVLVQQLLGDFLEGRKGD-------RVGLILFGT 143 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L + ++ + N+ G D G+ Sbjct: 144 QAFVQAPLTYDRRTVRFWLDEAKIGIAGKNTALG---------DAIGLGLKRLRLRPATS 194 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +++V +TDG N + + D + A + G +Y IGI + L+ Sbjct: 195 RVLVLVTDGANNAGQIDPITA--ARLAAEEGVKIYPIGIGADPDKDALQ 241 >gi|296206489|ref|XP_002750241.1| PREDICTED: von Willebrand factor A domain-containing protein 1 [Callithrix jacchus] Length = 445 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 31/211 (14%), Positives = 68/211 (32%), Gaps = 23/211 (10%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + + + D+M +LD S S+ + S + + + + V +P + Sbjct: 21 GAARGPPASAPGGDLMFLLDSSASVSHYEFSRV------REFVGQL---VAPLPLDTGAL 71 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 ++ LV ++ F S Q + ++ T++ L Y Q+F Sbjct: 72 RASLVHVGSRPYTEFSFGQHSSGKAAQDAVLASAQRMGDTHTGLALAYVKEQLF------ 125 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 K++V++TDG + E K G V+ + E Sbjct: 126 AEAAGARPGVPKVLVWVTDGGSSDPVG-----PPMQELKDLGVTVFIVSTGRGNFLELSA 180 Query: 338 ACASP-NSFYLVENPHSMYDAFSHIGKDIVT 367 ++P + ++ + I+ Sbjct: 181 VASAPAEKHLHFVDVDDLHIILQELRGSILD 211 >gi|117921591|ref|YP_870783.1| von Willebrand factor, type A [Shewanella sp. ANA-3] gi|117613923|gb|ABK49377.1| von Willebrand factor, type A [Shewanella sp. ANA-3] Length = 613 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 37/231 (16%), Positives = 86/231 (37%), Gaps = 22/231 (9%) Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITS--SVKVNSQ 167 + + +L + + Y + P T + + +++ + + Sbjct: 159 LPQKDTLRVEEMLNYFSYDYPLPGKNDAPFSVTTELAPSPYNDDMMLLRIGLKGYEQSKA 218 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +++ +LDVS SM +S K+ + ++ + +++ V+ VV +G Sbjct: 219 ELGASNLVFLLDVSGSM-----ASPDKLPLLQTALKMLTQQLDAQDKVSIVVYAGAAG-- 271 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + L ++ LS G STN G++ AY ++H N Sbjct: 272 --VVLDGAAGNDTQTLNYALEQLSA-GGSTNGAQGIQLAYQL------AQKHFVEGGINR 322 Query: 288 KKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 ++ TDG+ N+ T + + + K++G + +G + ++ L Sbjct: 323 ---VILATDGDFNVGTTNLDELIDLVSARKQQGIGLTTLGFGMGDYNDHLM 370 >gi|118764404|gb|AAI28589.1| VWA2 protein [Homo sapiens] Length = 421 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 57/159 (35%), Gaps = 24/159 (15%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-NVVQSGLV 224 LD++ +LD S S+ + + + L +VN +V Q GLV Sbjct: 220 GCRTQALDLVFMLDTSASVGPENFAQMQS----------FVRSCALQFEVNPDVTQVGLV 269 Query: 225 TFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + ++++ F L+ + + R I G ++ L + Y+++ +Q Sbjct: 270 VYGSQVQTAFGLDTKPTRAAMLRAISQAPYLGGVGSAGTALLHIYDKVMTVQ------RG 323 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 K +V +T G + + + + G Sbjct: 324 ARPGVPKAVVVLTGGRGA-----EDAAVPAQKLRNNGIS 357 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 72/187 (38%), Gaps = 22/187 (11%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 R+D++ +LD S T +D +++ + V+ + ++ + G+ T+S ++ Sbjct: 37 RVDLLFLLDSSAG---------TTLDGFLRAKVFVKRFVRAVLSEDSRARVGVATYSREL 87 Query: 231 EEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + V L + + G T + L+ A +G T + Sbjct: 88 LVAVPVGEYQDVPDLVWSLDGIPFRGGPTLTGSALRQA-----AERGFGSATRTGQDRPR 142 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 +++V +T+ + + + A+ R ++ +G +R+ E SP + Sbjct: 143 RVVVLLTE-----SHSEDEVAGPARHARARELLLLGVGSEAVRA-ELEEITGSPKHVMVY 196 Query: 349 ENPHSMY 355 +P ++ Sbjct: 197 SDPQDLF 203 >gi|258651542|ref|YP_003200698.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] gi|258554767|gb|ACV77709.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] Length = 681 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 72/197 (36%), Gaps = 25/197 (12%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEF 233 MIVLD S SM D+ +ID A ++ +L + Q GL+ + ++ Sbjct: 60 MIVLDASGSMNQD-DAPGLRIDAAKAAVTDLLGTLPA------PTQVGLMVYGTSTGSTD 112 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 G ++ + L+ +++ Y I +R + I Sbjct: 113 AERAAGCQDIKTLAPVGTLNAATLTSQVAGITASGYTPI--GNALRAAAQALPNEGPRSI 170 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAK---KRGA--IVYAIGIRVIRSHEFLRAC---ASPN 343 V ++DGE+ C+ A+ ++G V+ +G +V + +C A+ Sbjct: 171 VLVSDGEDTCAPP-----APCDVARELHEQGVDLTVHTVGFKVDATARDQLSCVAQATGG 225 Query: 344 SFYLVENPHSMYDAFSH 360 ++ N + DA Sbjct: 226 TYSDAGNATGLTDALQA 242 >gi|313221050|emb|CBY31881.1| unnamed protein product [Oikopleura dioica] Length = 1282 Score = 59.8 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 32/214 (14%), Positives = 70/214 (32%), Gaps = 20/214 (9%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 + + S + + +DM+ +LD S S+ + + + Sbjct: 456 DNQITENLVELTDPLSSFICENSTKEVDMIFLLDSSGSVGK------PNFQVMKSWMRRL 509 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGL 263 + + + P Q + ++N F L + I + G T L Sbjct: 510 ISGLNIAP---GRTQVSVYLYNNIFRTIFNLNEHQNAYDMITAINKMVYSGKGTRIARAL 566 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 + A +++ + N + + +TDG+ + +A + Sbjct: 567 QSAMSKVLIPESGL------RPNSEIYLYLLTDGKESDVADVNNMANDIKDAFSDRITLT 620 Query: 324 AIGIRVIRSHEFLRACASP---NSFYLVENPHSM 354 A+GI + L A AS ++ +L+EN + Sbjct: 621 AVGISRSVENAELYAIASAPKKDNVFLLENYRDL 654 >gi|293346541|ref|XP_001058131.2| PREDICTED: alpha 1 type VII collagen-like [Rattus norvegicus] gi|293358308|ref|XP_230973.5| PREDICTED: collagen, type XX, alpha 1 [Rattus norvegicus] Length = 1320 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 68/206 (33%), Gaps = 23/206 (11%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 P T P + +PI + D++ ++D S S+ Sbjct: 142 PDSEKTSEPSIAFTSSRDLPILDDPQFQCTPPTPADIIFLVDGSWSIGHN------NFQQ 195 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGV 255 + +++ + PD VQ GL +S + + L + + +L G Sbjct: 196 VKDFLASIITHFAIGPDK---VQVGLTQYSGDPQTEWDLNSFHTKEQVLAAVHHLHYKGG 252 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 +T + L + + + K+++ +TDG K Sbjct: 253 NTFTGLALTHV------LGQNLKPAAGVRPEAAKVLILVTDG-----KSQDDVRTAARVL 301 Query: 316 KKRGAIVYAIGIRVIRSHEFLRACAS 341 K + V+ +G++ + E L+ AS Sbjct: 302 KDQDIDVFTVGVKNVDEAE-LKLLAS 326 >gi|170079457|ref|YP_001736094.1| von Willebrand factor type A domain-containing protein [Synechococcus sp. PCC 7002] gi|169887126|gb|ACB00840.1| Protein containing von Willebrand factor (vWF) type A domain [Synechococcus sp. PCC 7002] Length = 545 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 72/211 (34%), Gaps = 30/211 (14%) Query: 134 RYKIPLK---------FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 Y P + PW + + + I ++ ++ +++ + DVS SM Sbjct: 135 TYDYPAPDNQPFAINTELSQAPWQPQHQLLRIGIKG-QEIENEALPPSNLVFLFDVSGSM 193 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ 244 K+ + + ++ E++ V+ VV +G + + Sbjct: 194 N-----DPDKLPLLKSAFRLLVNELRPEDRVSIVVYAGAAG----LVLPSTSGAEKETIL 244 Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTK 303 + L G ST G++ AY + D + N I+ TDG+ N+ Sbjct: 245 AALDNLEA-GGSTAGGEGIELAYQEAAD-NFLDNGNNR--------IILATDGDFNVGMS 294 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 D + + + +++ + +G + Sbjct: 295 SDAELIRLIEQKREQDIFLTVLGFGTGNLKD 325 >gi|109474242|ref|XP_001066203.1| PREDICTED: von Willebrand factor homolog [Rattus norvegicus] gi|149049388|gb|EDM01842.1| rCG29742 [Rattus norvegicus] Length = 2812 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 41/306 (13%), Positives = 95/306 (31%), Gaps = 33/306 (10%) Query: 78 KGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYK 136 + + + + N ++I + + + V + ISR Sbjct: 1587 EHSFSPSQGDREQAPNLVYMVTGNPASDEIRRLPGDIQVVPIGVGSRANLQELERISRPI 1646 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD--------ARLDMMIVLDVSRSMESFF 188 P+ F + +V+ S + LD++++LD S S+ Sbjct: 1647 APIFIQDFETLPREAPDLVLRTCCSKEGLQLPTLPPLPDCSQPLDVVLLLDGSSSL---- 1702 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN--KIEEFFLLEWGVSHLQRK 246 + D A + + + P + Q ++ + + I+ + + ++LQ Sbjct: 1703 --PASSFDEMKSFAKAFISKANIGPH---LTQVSVIQYGSINTIDVPWNVAQEKAYLQSL 1757 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + + + G + L +A + H A+ +++ M + D Sbjct: 1758 VDLMQQEGGPSQIGNALAFAVRYVTSQ----IHGARPGASKAVVMIIM---DTSLDSVDT 1810 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHI--GKD 364 + A+ V+ IG+ LR A P + V + D + + G Sbjct: 1811 AV----DAARSNRVAVFPIGVGDRYDEAQLRILAGPGASSNVVKLQQVEDLLTMVTPGNS 1866 Query: 365 IVTKRI 370 + Sbjct: 1867 FFHRLC 1872 >gi|52549995|gb|AAU83844.1| cell surface protein [uncultured archaeon GZfos34G5] Length = 1357 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 48/240 (20%), Positives = 94/240 (39%), Gaps = 28/240 (11%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS------------SITKIDMA 198 ++ + + S + + +L+++IVLD+S SM S FD + T+ Sbjct: 928 GYYLSVGLNSGIIESDFQRKKLNLVIVLDISGSMGSSFDEYYYDRFGNRVDVNETEDAEK 987 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFG 254 K A V L+ + + + GLV F+ E + + L+ + +S Sbjct: 988 SKIEIAAAAIVALLDHLEDDDRLGLVLFNTGAELAEPVSLIGAKNMQKLKGDVLEISAT- 1046 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 T + G++ A +++D + + Y+ I+F+TD + ++SL E Sbjct: 1047 DGTRLSAGMQMA-TELYD-----EFLEVNQSEYENRIIFLTDAMPNLGQTSEESLLGMTE 1100 Query: 315 AKKRGAIVYA--IGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 A VY IGI V + E + ++Y V + + + +VT + Sbjct: 1101 ANANK-NVYTTFIGIGVDFNTELVEYITKIRGANYYSVHSAKQFKERMDDEFEYMVTPLV 1159 >gi|293358946|ref|XP_342760.4| PREDICTED: von Willebrand factor [Rattus norvegicus] Length = 2763 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 41/306 (13%), Positives = 95/306 (31%), Gaps = 33/306 (10%) Query: 78 KGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYK 136 + + + + N ++I + + + V + ISR Sbjct: 1538 EHSFSPSQGDREQAPNLVYMVTGNPASDEIRRLPGDIQVVPIGVGSRANLQELERISRPI 1597 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD--------ARLDMMIVLDVSRSMESFF 188 P+ F + +V+ S + LD++++LD S S+ Sbjct: 1598 APIFIQDFETLPREAPDLVLRTCCSKEGLQLPTLPPLPDCSQPLDVVLLLDGSSSL---- 1653 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN--KIEEFFLLEWGVSHLQRK 246 + D A + + + P + Q ++ + + I+ + + ++LQ Sbjct: 1654 --PASSFDEMKSFAKAFISKANIGPH---LTQVSVIQYGSINTIDVPWNVAQEKAYLQSL 1708 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + + + G + L +A + H A+ +++ M + D Sbjct: 1709 VDLMQQEGGPSQIGNALAFAVRYVTSQ----IHGARPGASKAVVMIIM---DTSLDSVDT 1761 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHI--GKD 364 + A+ V+ IG+ LR A P + V + D + + G Sbjct: 1762 AV----DAARSNRVAVFPIGVGDRYDEAQLRILAGPGASSNVVKLQQVEDLLTMVTPGNS 1817 Query: 365 IVTKRI 370 + Sbjct: 1818 FFHRLC 1823 >gi|198430887|ref|XP_002124814.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 400 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 49/333 (14%), Positives = 107/333 (32%), Gaps = 43/333 (12%) Query: 51 HSMIDRSLVHAATQIM-----NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN 105 ++I ++ + +++ + N + I + + + N + Sbjct: 48 QTVIVKASANGTAKLLIGAPNSRRNNFGSSQGVLYGIKFAVASGIYDIVAKDPEKNAPIV 107 Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHI-VMPITSSVKV 164 + + ++ I V E ++ A RY+ NS+ ++ V Sbjct: 108 VTNLRKDADAIGISVSNSKEAVTVCAPMRYRHCRSHNIVSGSCFNSKDFGESWQSAKPTV 167 Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +D LD++ VLD S S+ T + + A+ + + N G++ Sbjct: 168 PFCSDQALDLLFVLDGSFSVGE------TNFGLVKNWVVALAKSFDIEKQDN----IGII 217 Query: 225 TFSN-------------KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 +S+ K E L + ++ + +S G +T + L Sbjct: 218 QYSHWYPGVPYSQQPYMKTEVPLGLYKNFTLFEKIAQNISLQGFTTYTAHALNKTVLDFM 277 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 + N K+++ +TDG + + +G I +AIGI Sbjct: 278 ASERFM------HENVTKVMILITDGR---ADDAIDLYSSAEYVRSQGIITFAIGIGNSV 328 Query: 332 SHEFLRACASPN-----SFYLVENPHSMYDAFS 359 + L+ A+ + V S+ S Sbjct: 329 LRDQLQIVANGKLGEDTRVFEVTTFTSLNSILS 361 >gi|156370955|ref|XP_001628532.1| predicted protein [Nematostella vectensis] gi|156215511|gb|EDO36469.1| predicted protein [Nematostella vectensis] Length = 737 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 76/217 (35%), Gaps = 25/217 (11%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + V + + LD+ I+LD S S+ T + + +++++ + P +V Sbjct: 74 PTPVDLLGKCHDELDLAILLDASYSIT------TTGWEGIATATKDLIDKLDISPAGTHV 127 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 L+ FS +E F++ + + L+ I + G + + L A N IF Sbjct: 128 ---SLMKFSTDVETFYMFDNQATKDRLKSLIDKMEYDGEWSRTDIALNAAKNHIF----- 179 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC-------NEAKKRGAIVYAIGIRV 329 + K IV TDG+ +++ + + + ++ + Sbjct: 180 -VPSTGARGDAPKAIVLFTDGKTDGPTPIDRNIDWVASLIKPLQDLRDANVTIFCVASGN 238 Query: 330 IRSHEFLRACASP-NSFYLVENPHSMYDAFSHIGKDI 365 + + A + + ++ + K + Sbjct: 239 HPDYSQINWLAGDRARVFTTSDMDALVGSLGRASKKV 275 >gi|152999639|ref|YP_001365320.1| von Willebrand factor type A [Shewanella baltica OS185] gi|151364257|gb|ABS07257.1| von Willebrand factor type A [Shewanella baltica OS185] Length = 642 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 50/321 (15%), Positives = 101/321 (31%), Gaps = 47/321 (14%) Query: 45 FMKTVLHSMI--DRSLVHAATQIMNEGNGNNRKKLKGGDILCRI---KNTWNMSFRNELR 99 + + + + V A Q+ + G I +N + +N + Sbjct: 84 QRQAEMRDAAKVEMARVAAPMQMSSNGAVMGMSIAPMPRDYAAIPLAQNKFEQQVQNGIM 143 Query: 100 DNG------FVNDIDDIVRSTSLDIVVV---PQNEGYSISAIS---RYKIPLKFCTFIPW 147 G F D+D +T ++ P+ + + Y PL P+ Sbjct: 144 VAGEIPVSTFSIDVDTGSYATLRRMLREGRLPEKGTVRVEEMLNYFAYDYPLPAKNAAPF 203 Query: 148 YTNSRHIVMP----------ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 + P + +++ +LDVS SM +S K+ + Sbjct: 204 SVTTELAPSPYNDDMMLLRIGLKGYDLLKSQLGASNLVFLLDVSGSM-----ASTDKLPL 258 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST 257 ++ + ++ V+ VV +G + L ++ LS G ST Sbjct: 259 LQTALKLLTAQLSAQDKVSIVVYAGAAG----VVLDGASGNDTQTLTYALEQLSA-GGST 313 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAK 316 N G+ AY ++H N ++ TDG+ N+ + + K Sbjct: 314 NGGQGITQAYQL------AKKHFIPNGINR---VILATDGDFNVGVTDFDDLTALIEKEK 364 Query: 317 KRGAIVYAIGIRVIRSHEFLR 337 G + +G + ++ L Sbjct: 365 AHGIGLTTLGFGLGNYNDQLM 385 >gi|301026928|ref|ZP_07190323.1| von Willebrand factor type A domain protein [Escherichia coli MS 196-1] gi|299879508|gb|EFI87719.1| von Willebrand factor type A domain protein [Escherichia coli MS 196-1] Length = 575 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 50/313 (15%), Positives = 110/313 (35%), Gaps = 39/313 (12%) Query: 39 EVSHIFFMKTVLHSMID--RSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRN 96 + S ++ L R+ AT I N G R + + + ++ +F Sbjct: 69 QYSDKQALQGRLQEAPTFARAAKAKATHIAN--PGTARYQQFDDNPVKQVAQNPLATFSL 126 Query: 97 ELRDNGFVN-------DIDDIVRSTSLDIVVV--PQNEGYSISAISRYKIPLKFCT---- 143 ++ + N + + ++ +V P + P+ F Sbjct: 127 DVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYEL 186 Query: 144 -FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 PW + + I + S+ +++ ++D S SM S ++ + S+ Sbjct: 187 APAPWNEQRTLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISD-----ERLPLIQSSL 240 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 +++E++ ++ V +G ++I + + + I L G STN G Sbjct: 241 KLLVKELREQDNIAIVTYAG----DSRIALPSISGSHKAEINAAIDSLDAEG-STNGGAG 295 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGAI 321 L+ AY Q ++ N I+ TDG+ +D +S+ + ++ G Sbjct: 296 LELAYQQA-TKGFIKGGINR--------ILLATDGDFNVGIDDPKSIESMVKKQRESGVT 346 Query: 322 VYAIGIRVIRSHE 334 + G+ +E Sbjct: 347 LSTFGVGNSNYNE 359 >gi|294508602|ref|YP_003572661.1| aerotolerance-related membrane protein [Salinibacter ruber M8] gi|294344931|emb|CBH25709.1| aerotolerance-related membrane protein [Salinibacter ruber M8] Length = 358 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 65/187 (34%), Gaps = 16/187 (8%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + ++ P + +V + LD+++ LDVS SM + D +++ A I ++++ Sbjct: 67 LGAVALIGPRWGT-EVRTVERRGLDLVVALDVSASMRAQ-DVPPSRLRRAKNEIRTLVDD 124 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKY 265 + + GLV F+ L + + + T+ + + Sbjct: 125 LSGD-------RVGLVLFAGSGFVQSPLTTDYGAFRLFLDAAAPDQISTPGTDVSAAVDA 177 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 + + + ++ ++DGEN + D A++ G + Sbjct: 178 GLQAFGAPRPTDDTTAAPEEPRPRALLIVSDGENHAGDLDAAR----QRAEEAGVTLLTA 233 Query: 326 GIRVIRS 332 G+ Sbjct: 234 GVGTEDG 240 >gi|194223901|ref|XP_001494668.2| PREDICTED: similar to integrin alpha 2 subunit [Equus caballus] Length = 1194 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 33/210 (15%), Positives = 77/210 (36%), Gaps = 35/210 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++V D S S+ + + + ++ + + P Q GL+ ++N Sbjct: 186 IDVVVVCDESNSIYPW--------EAVKNFLEKFVQGLDIGPTK---TQVGLIQYANNPR 234 Query: 232 EFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L + ++ + G TN+ ++YA + + Sbjct: 235 VVFNLNTFKTKAEMVEATSQTFQFGGDLTNTFKAIQYARDSAYSA------AAGGRRGAT 288 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEF---LRAC 339 K++V +TDGE+ + + CN + + I + + + ++A Sbjct: 289 KVMVVVTDGESHDGSMLKAVIDQCN---SDNILRFGIAVLGYLNRNALDTKNLIKEIKAI 345 Query: 340 AS---PNSFYLVENPHSMYDAFSHIGKDIV 366 AS F+ V + ++ + +G+ I Sbjct: 346 ASIPTERYFFNVSDEVALLEKAGTLGEQIF 375 >gi|89889806|ref|ZP_01201317.1| BatB [Flavobacteria bacterium BBFL7] gi|89518079|gb|EAS20735.1| BatB [Flavobacteria bacterium BBFL7] Length = 343 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 25/179 (13%), Positives = 72/179 (40%), Gaps = 26/179 (14%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + K+ + +D++ +D+S+SM + D + ++++ + + + ++ + Sbjct: 78 AGTKIETIKREGVDIVFAVDISKSMLAE-DIAPSRLEKSQQLVTQIINNLASD------- 129 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGM 276 + GL+ ++ + S + ++ ++ T ++ A Sbjct: 130 RIGLIAYAGSAVPQLPITTDYSSAKMFLQSMNTDLVSSQGTAIAEAIQLA---------- 179 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + +ED K++V ++DGE+ + ++L Y A + G + IG+ + Sbjct: 180 -ESYYSEDTEASKVLVIISDGEDH----EGEALDYAEAAAENGIRIITIGVGTEKGGTI 233 >gi|46445753|ref|YP_007118.1| putative batA protein [Candidatus Protochlamydia amoebophila UWE25] gi|46399394|emb|CAF22843.1| putative batA protein [Candidatus Protochlamydia amoebophila UWE25] Length = 362 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 36/228 (15%), Positives = 63/228 (27%), Gaps = 47/228 (20%) Query: 172 LDMMIVLDVSRSMESFFDSS--------ITKIDMAIKSINAMLEEVKLIPDVNNVVQ--- 220 + + +++D S SM S+ TKID+ + + + Sbjct: 96 IAIYLIVDQSSSMNEKISSNSFFERGRSFTKIDLLKVMAGQFI-----------LHRPSD 144 Query: 221 -SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA-YNQIFDMQGMRQ 278 GLV F+ + L L ++ L + YA Y + + Sbjct: 145 LIGLVAFARVPKILSPLTLDHELLINQLNDLKAINSMEEDGTAMGYAIYKTAHLIVATKH 204 Query: 279 HCN------TEDANYKKII-VFMTDGENLSTKED----QQSLYYCNE---AKKRGAIVYA 324 K I V +TDG + D +++ AK G +Y Sbjct: 205 FAQELQKKGKPAYEIKNAIMVVLTDGFQDPNRLDYGNRLRTIELDEAIAYAKNAGIHLYI 264 Query: 325 IGIR--------VIRSHEFLRACAS-PNSFYLVENPHSMYDAFSHIGK 363 I + + S YL + F I + Sbjct: 265 INVDPKFSSPQFAPHRRQIETLAESTGGQLYLANQEKDLKRVFDTIDR 312 >gi|327412874|emb|CAX67882.1| putative Von Willebrand factor, type A [Salmonella bongori] Length = 325 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 34/163 (20%), Positives = 56/163 (34%), Gaps = 21/163 (12%) Query: 173 DMMIVLDVSRSMESF-FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D++++LDVS SM T++ +S+ + + + GLV F+++ Sbjct: 97 DVVLILDVSGSMAKNDVQGGATRLQAVQESVRKFVAARQSD-------RIGLVIFASQAW 149 Query: 232 EFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F + LQ +I LS G T L + NT + K Sbjct: 150 PFAPVSEDKQALQTRINQLSPGMVGQQTAIGDALGVGVKLL---------DNTTNTEASK 200 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + + +TDG T A V+ I I S Sbjct: 201 LAILLTDG--NDTASQLAPALAAQLAAAHHVQVHTIAFGDINS 241 >gi|307298147|ref|ZP_07577951.1| von Willebrand factor type A [Thermotogales bacterium mesG1.Ag.4.2] gi|306916233|gb|EFN46616.1| von Willebrand factor type A [Thermotogales bacterium mesG1.Ag.4.2] Length = 704 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 41/219 (18%), Positives = 82/219 (37%), Gaps = 28/219 (12%) Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 P ++ + S +IP T TN + ++ T ++ + D++ VLD+S Sbjct: 218 PNSDVALVLTSSEDEIPSSLATHWDESTNEGYFLL--TLIPRIKEEIVIPKDVVFVLDIS 275 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS 241 SM KI+ A +++ +L+ ++ + +VTF ++ S Sbjct: 276 GSMYG------EKIEQAKRALEQVLQMLRPGD------RFAIVTFDGRVHNLTGSLLDAS 323 Query: 242 HLQRKIKYLSKFGVS--TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 I+ + + TN L+ + + + D K ++F+TDGE Sbjct: 324 EKAEWIEKVRRIQADGMTNIYGALQTSIDMF----------SKYDTGRFKALLFLTDGEP 373 Query: 300 LSTKEDQQSL--YYCNEAKKRGAIVYAIGIRVIRSHEFL 336 D + EA+ R +++ G+ E L Sbjct: 374 TEGITDIGRIISDATPEARARNVHLFSFGVGTGVVAELL 412 >gi|47216962|emb|CAG04904.1| unnamed protein product [Tetraodon nigroviridis] Length = 519 Score = 59.8 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 72/204 (35%), Gaps = 28/204 (13%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D D+ VLD S S+ + I +E++ V+ ++ + FS Sbjct: 30 DGAFDIYFVLDRSGSVSGHWP-----------EIFGFVEQL-TGRFVSPRMRVSYIVFSA 77 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + + ++ L + T GLK Q+ + + Sbjct: 78 RAVVILPLTGHRAEIDEGLERLRQIKPAGETFMHEGLKAVSGQM----------KAQTSP 127 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 I++ +TDG+ L + S+ + A+ GA V+ +G+ + P + Sbjct: 128 SSSIVIVLTDGK-LEVYPYELSVQEADRARGLGARVFCVGVMDFDHKQLAEIADGPERVF 186 Query: 347 LVENPHSMYDAFSHIGKDIVTKRI 370 V + + A + I+T+ Sbjct: 187 PVLSG---FHALKDVVATILTQSC 207 >gi|148656823|ref|YP_001277028.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148568933|gb|ABQ91078.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 851 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 38/216 (17%), Positives = 76/216 (35%), Gaps = 32/216 (14%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + + +T + +++++D S SM ++ I+K MA ++ Sbjct: 365 LGAYKNTPLEETLPVEMTPPPRPERSDTT---LLLIIDQSASMGP--ETGISKFTMAKEA 419 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTF---SNKIEEFFLLEWGVS--HLQRKIKYLSKFGVS 256 E ++ + G++ F + + +F + G+S +QR+I L G Sbjct: 420 AIMATESLRQED------RIGVLAFDVSTRWVVDFQPVGVGLSLADVQRRISTLP-LGGG 472 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 T+ L+ + G +H V +TDG + T + Q EA+ Sbjct: 473 TDIYNALQEGLPALAQQPGRVRHA-----------VLLTDGRSF-TDDRQAYRMLLEEAR 520 Query: 317 KRGAIVYAIGIRVIRSHEFLRACA---SPNSFYLVE 349 + + I I L+ A + Y E Sbjct: 521 SQNITLSTIAIGTDADINLLQELARWGAGRYHYAAE 556 >gi|94501046|ref|ZP_01307570.1| hypothetical protein RED65_05304 [Oceanobacter sp. RED65] gi|94426793|gb|EAT11777.1| hypothetical protein RED65_05304 [Oceanobacter sp. RED65] Length = 867 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 40/206 (19%), Positives = 77/206 (37%), Gaps = 43/206 (20%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ +++D+S SM+ + ++ V+L+PD + Q+G+ TF + Sbjct: 48 DVRVLIDMSGSMKDNDPEN------LRIPALNLI--VQLLPDGS---QAGVWTFGQWVNM 96 Query: 233 FFLLE-----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 W + ++ K ++ FG+ TN ++ A ++ QH Sbjct: 97 LIPPAEVNSEWRTNAKEKA-KMINSFGLRTNIGEAMERATWKLAADSDFEQHA------- 148 Query: 288 KKIIVFMTDG------------ENLSTKEDQQSLY-YCNEAKKRGAIVYAIGIRVIRSHE 334 + +TDG +N + E Q+ L +E K G ++ I + Sbjct: 149 ----ILLTDGIVDIAADDDPQKDNKNEAERQRILTDVLSEYKNLGVKIHTIALSNAADKV 204 Query: 335 FLRACA--SPNSFYLVENPHSMYDAF 358 L A + +VEN + AF Sbjct: 205 LLEKLALETGGMAEVVENSEQLVKAF 230 >gi|16130205|ref|NP_416773.1| conserved protein [Escherichia coli str. K-12 substr. MG1655] gi|89109088|ref|AP_002868.1| hypothetical protein [Escherichia coli str. K-12 substr. W3110] gi|157161758|ref|YP_001459076.1| von Willebrand factor type A domain-containing protein [Escherichia coli HS] gi|238901445|ref|YP_002927241.1| hypothetical protein BWG_2044 [Escherichia coli BW2952] gi|256022046|ref|ZP_05435911.1| hypothetical protein E4_01620 [Escherichia sp. 4_1_40B] gi|300948978|ref|ZP_07163036.1| von Willebrand factor type A domain protein [Escherichia coli MS 116-1] gi|300956471|ref|ZP_07168759.1| von Willebrand factor type A domain protein [Escherichia coli MS 175-1] gi|301647634|ref|ZP_07247429.1| von Willebrand factor type A domain protein [Escherichia coli MS 146-1] gi|307138934|ref|ZP_07498290.1| hypothetical protein EcolH7_12533 [Escherichia coli H736] gi|331642908|ref|ZP_08344043.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli H736] gi|2495629|sp|P76481|YFBK_ECOLI RecName: Full=Uncharacterized protein yfbK gi|1788606|gb|AAC75330.1| conserved protein [Escherichia coli str. K-12 substr. MG1655] gi|85675335|dbj|BAE76678.1| conserved hypothetical protein [Escherichia coli str. K12 substr. W3110] gi|157067438|gb|ABV06693.1| von Willebrand factor type A domain protein [Escherichia coli HS] gi|238860346|gb|ACR62344.1| conserved protein [Escherichia coli BW2952] gi|260448637|gb|ACX39059.1| von Willebrand factor type A [Escherichia coli DH1] gi|300316719|gb|EFJ66503.1| von Willebrand factor type A domain protein [Escherichia coli MS 175-1] gi|300451549|gb|EFK15169.1| von Willebrand factor type A domain protein [Escherichia coli MS 116-1] gi|301074238|gb|EFK89044.1| von Willebrand factor type A domain protein [Escherichia coli MS 146-1] gi|309702582|emb|CBJ01910.1| putative lipoprotein [Escherichia coli ETEC H10407] gi|315136904|dbj|BAJ44063.1| hypothetical protein ECDH1ME8569_2207 [Escherichia coli DH1] gi|331039706|gb|EGI11926.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli H736] Length = 575 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 50/313 (15%), Positives = 110/313 (35%), Gaps = 39/313 (12%) Query: 39 EVSHIFFMKTVLHSMID--RSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRN 96 + S ++ L R+ AT I N G R + + + ++ +F Sbjct: 69 QYSDKQALQGRLQEAPTFARAAKAKATHIAN--PGTARYQQFDDNPVKQVAQNPLATFSL 126 Query: 97 ELRDNGFVN-------DIDDIVRSTSLDIVVV--PQNEGYSISAISRYKIPLKFCT---- 143 ++ + N + + ++ +V P + P+ F Sbjct: 127 DVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYEL 186 Query: 144 -FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 PW + + I + S+ +++ ++D S SM S ++ + S+ Sbjct: 187 APAPWNEQRTLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISD-----ERLPLIQSSL 240 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 +++E++ ++ V +G ++I + + + I L G STN G Sbjct: 241 KLLVKELREQDNIAIVTYAG----DSRIALPSISGSHKAEINAAIDSLDAEG-STNGGAG 295 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGAI 321 L+ AY Q ++ N I+ TDG+ +D +S+ + ++ G Sbjct: 296 LELAYQQA-TKGFIKGGINR--------ILLATDGDFNVGIDDPKSIESMVKKQRESGVT 346 Query: 322 VYAIGIRVIRSHE 334 + G+ +E Sbjct: 347 LSTFGVGNSNYNE 359 >gi|324983881|gb|ADY68777.1| complement factor B/C2 [Oncorhynchus mykiss] Length = 749 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 37/218 (16%), Positives = 83/218 (38%), Gaps = 28/218 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + +L + I LD S S+ + T +K++ + ++IP + G Sbjct: 244 SIQMRQGGKLHIYIALDNSGSIS---RDNFTVAKNCVKALINQISFFEVIP------RYG 294 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 ++TF++++ E + SH ++ L+ +G + L + +I + + + Sbjct: 295 ILTFASEVNEIVDIXSPSSHSDMVLQNLNNDTYGKLNGTGSNLSRVFKKILEKMAVFKER 354 Query: 281 NTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAI---------VYAIGIRVI 330 N E + ++ I+ TDG N+ + + N G +Y G+ Sbjct: 355 N-ELEDTQQAIIMFTDGNTNMGGSAEPTVIRIRNLM--EGIHGKDWQKYLDIYVFGVGAK 411 Query: 331 RSHE----FLRACASPNSFYLVENPHSMYDAFSHIGKD 364 + E + +S FY + + + F I + Sbjct: 412 MNEEEINKLVSKKSSEKHFYKLRDDTKLTSLFKMIIDE 449 >gi|297673847|ref|XP_002814961.1| PREDICTED: anthrax toxin receptor 2 isoform 1 [Pongo abelii] gi|261858584|dbj|BAI45814.1| anthrax toxin receptor 2 [synthetic construct] Length = 488 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 38/232 (16%), Positives = 79/232 (34%), Gaps = 34/232 (14%) Query: 148 YTNSRHIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + +V+ + + D+ VLD S S+ + + I Sbjct: 16 FPGLWLLVLSGPGGLLRAQEQPSCRRAFDLYFVLDKSGSVANNW-----------IEIYN 64 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPG 262 ++++ V+ ++ + FS++ L + + ++ L + T G Sbjct: 65 FVQQLAERF-VSPEMRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEG 123 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGA 320 LK A QI G++ II+ +TDG + L ++ ++ GA Sbjct: 124 LKLANEQIQKAGGLKTSS---------IIIALTDGKLDGLVPSYAEKEAKI---SRSLGA 171 Query: 321 IVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 VY +G+ + R S + V+ + A I I+ + Sbjct: 172 SVYCVGVLDFEQAQLERIADSKEQVFPVKGG---FQALKGIINSILAQSCTE 220 >gi|224809466|ref|NP_001139266.1| anthrax toxin receptor 2 isoform 2 [Homo sapiens] gi|306526289|sp|P58335|ANTR2_HUMAN RecName: Full=Anthrax toxin receptor 2; AltName: Full=Capillary morphogenesis gene 2 protein; Short=CMG-2; Flags: Precursor Length = 489 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 38/232 (16%), Positives = 79/232 (34%), Gaps = 34/232 (14%) Query: 148 YTNSRHIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + +V+ + + D+ VLD S S+ + + I Sbjct: 16 FPGLWLLVLSGPGGLLRAQEQPSCRRAFDLYFVLDKSGSVANNW-----------IEIYN 64 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPG 262 ++++ V+ ++ + FS++ L + + ++ L + T G Sbjct: 65 FVQQLAERF-VSPEMRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEG 123 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGA 320 LK A QI G++ II+ +TDG + L ++ ++ GA Sbjct: 124 LKLANEQIQKAGGLKTSS---------IIIALTDGKLDGLVPSYAEKEAKI---SRSLGA 171 Query: 321 IVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 VY +G+ + R S + V+ + A I I+ + Sbjct: 172 SVYCVGVLDFEQAQLERIADSKEQVFPVKGG---FQALKGIINSILAQSCTE 220 >gi|124485081|ref|YP_001029697.1| hypothetical protein Mlab_0254 [Methanocorpusculum labreanum Z] gi|124362622|gb|ABN06430.1| von Willebrand factor, type A [Methanocorpusculum labreanum Z] Length = 313 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 39/216 (18%), Positives = 78/216 (36%), Gaps = 37/216 (17%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +++++ LDVS SM S D S T+++ A S ++ + G+V F + Sbjct: 87 VNLVVALDVSASM-SASDYSPTRVEAAKGSSEILIRSLSESDTA------GVVIFESGAS 139 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L + + +++ +S T GL A + + + I+ Sbjct: 140 SAAYLSSDKNRVVSRLEQVSVKTGKTALGDGLALAVDMVTAIPAGTY-----------IV 188 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR---------------SHEFL 336 V ++DG +S AK G +VY IG+ E L Sbjct: 189 VLLSDG--VSNSGMITPQEAAEYAKNSGVVVYTIGVGSESPVEVSSDGVQQYASLDEETL 246 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 R+ A + ++ + ++ + I I+ + + Sbjct: 247 RSIAEITGGEYFRSVDEKTLVQIQNTIQTSIIREPV 282 >gi|119626254|gb|EAX05849.1| anthrax toxin receptor 2, isoform CRA_b [Homo sapiens] Length = 322 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 38/232 (16%), Positives = 79/232 (34%), Gaps = 34/232 (14%) Query: 148 YTNSRHIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + +V+ + + D+ VLD S S+ + + I Sbjct: 16 FPGLWLLVLSGPGGLLRAQEQPSCRRAFDLYFVLDKSGSVANNW-----------IEIYN 64 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPG 262 ++++ V+ ++ + FS++ L + + ++ L + T G Sbjct: 65 FVQQLAERF-VSPEMRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEG 123 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGA 320 LK A QI G++ II+ +TDG + L ++ ++ GA Sbjct: 124 LKLANEQIQKAGGLKTSS---------IIIALTDGKLDGLVPSYAEKEAKI---SRSLGA 171 Query: 321 IVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 VY +G+ + R S + V+ + A I I+ + Sbjct: 172 SVYCVGVLDFEQAQLERIADSKEQVFPVKGG---FQALKGIINSILAQSCTE 220 >gi|118096010|ref|XP_413930.2| PREDICTED: similar to integrin alpha 11 subunit [Gallus gallus] Length = 1191 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 39/213 (18%), Positives = 86/213 (40%), Gaps = 37/213 (17%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++IVLD S S+ + + + ++ +K +Q G+V + Sbjct: 163 QTYMDIIIVLDGSNSIYPWVEVQ-----------HFLINILKKFYIGPGQIQVGVVQYGE 211 Query: 229 KIEEFFLLEW--GVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + F L V + ++ + G + T + G+++A ++ F G + Sbjct: 212 DVVHEFHLNDYRSVKDVVAAASHIEQRGGTETRTAYGIEFARSEAFQKGGRKG------- 264 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL--- 336 K++++ +TDGE+ + + ++ + + K YA+ + R I FL Sbjct: 265 -AKRVMIVITDGESHDSPDLEKVIE--DSEKDN-VTRYAVAVLGYYNRRGINPEAFLNEI 320 Query: 337 RACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + AS F+ V + ++ D +G+ I Sbjct: 321 KFIASDPDDKHFFNVTDEAALKDIVDALGERIF 353 >gi|62089412|dbj|BAD93150.1| anthrax toxin receptor 2 variant [Homo sapiens] Length = 502 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 38/232 (16%), Positives = 79/232 (34%), Gaps = 34/232 (14%) Query: 148 YTNSRHIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + +V+ + + D+ VLD S S+ + + I Sbjct: 30 FPGLWLLVLSGPGGLLRAQEQPSCRRAFDLYFVLDKSGSVANNW-----------IEIYN 78 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPG 262 ++++ V+ ++ + FS++ L + + ++ L + T G Sbjct: 79 FVQQLAERF-VSPEMRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEG 137 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGA 320 LK A QI G++ II+ +TDG + L ++ ++ GA Sbjct: 138 LKLANEQIQKAGGLKTSS---------IIIALTDGKLDGLVPSYAEKEAKI---SRSLGA 185 Query: 321 IVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 VY +G+ + R S + V+ + A I I+ + Sbjct: 186 SVYCVGVLDFEQAQLERIADSKEQVFPVKGG---FQALKGIINSILAQSCTE 234 >gi|30013741|gb|AAP04016.1| capillary morphogenesis protein 2 [Homo sapiens] Length = 489 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 38/232 (16%), Positives = 79/232 (34%), Gaps = 34/232 (14%) Query: 148 YTNSRHIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + +V+ + + D+ VLD S S+ + + I Sbjct: 16 FPGLWLLVLSGPGGLLRAQEQPSCRRAFDLYFVLDKSGSVANNW-----------IEIYN 64 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPG 262 ++++ V+ ++ + FS++ L + + ++ L + T G Sbjct: 65 FVQQLAERF-VSPEMRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEG 123 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGA 320 LK A QI G++ II+ +TDG + L ++ ++ GA Sbjct: 124 LKLANEQIQKAGGLKTSS---------IIIALTDGKLDGLVPSYAEKEAKI---SRSLGA 171 Query: 321 IVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 VY +G+ + R S + V+ + A I I+ + Sbjct: 172 SVYCVGVLDFEQAQLERIADSKEQVFPVKGG---FQALKGIINSILAQSCTE 220 >gi|50513243|ref|NP_477520.2| anthrax toxin receptor 2 isoform 1 [Homo sapiens] gi|21750159|dbj|BAC03731.1| unnamed protein product [Homo sapiens] Length = 488 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 38/232 (16%), Positives = 79/232 (34%), Gaps = 34/232 (14%) Query: 148 YTNSRHIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + +V+ + + D+ VLD S S+ + + I Sbjct: 16 FPGLWLLVLSGPGGLLRAQEQPSCRRAFDLYFVLDKSGSVANNW-----------IEIYN 64 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPG 262 ++++ V+ ++ + FS++ L + + ++ L + T G Sbjct: 65 FVQQLAERF-VSPEMRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEG 123 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGA 320 LK A QI G++ II+ +TDG + L ++ ++ GA Sbjct: 124 LKLANEQIQKAGGLKTSS---------IIIALTDGKLDGLVPSYAEKEAKI---SRSLGA 171 Query: 321 IVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 VY +G+ + R S + V+ + A I I+ + Sbjct: 172 SVYCVGVLDFEQAQLERIADSKEQVFPVKGG---FQALKGIINSILAQSCTE 220 >gi|326926921|ref|XP_003209645.1| PREDICTED: integrin alpha-11-like [Meleagris gallopavo] Length = 1195 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 39/213 (18%), Positives = 86/213 (40%), Gaps = 37/213 (17%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++IVLD S S+ + + + ++ +K +Q G+V + Sbjct: 163 QTYMDIIIVLDGSNSIYPWVEVQ-----------HFLINILKKFYIGPGQIQVGVVQYGE 211 Query: 229 KIEEFFLLEW--GVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + F L V + ++ + G + T + G+++A ++ F G + Sbjct: 212 DVVHEFHLNDYRSVKDVVAAASHIEQRGGTETRTAYGIEFARSEAFQKGGRKG------- 264 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL--- 336 K++++ +TDGE+ + + ++ + + K YA+ + R I FL Sbjct: 265 -AKRVMIVITDGESHDSPDLEKVIE--DSEKDN-VTRYAVAVLGYYNRRGINPEAFLNEI 320 Query: 337 RACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + AS F+ V + ++ D +G+ I Sbjct: 321 KFIASDPDDKHFFNVTDEAALKDIVDALGERIF 353 >gi|78186535|ref|YP_374578.1| hypothetical protein Plut_0657 [Chlorobium luteolum DSM 273] gi|78166437|gb|ABB23535.1| putative membrane protein [Chlorobium luteolum DSM 273] Length = 356 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 14/61 (22%), Positives = 27/61 (44%), Gaps = 1/61 (1%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R G IL AI LP++ + ++++ I +K L + D + + A + + Sbjct: 9 RRLQSQRGGT-AILFAIVLPVLLGFAALAVDLARIHLVKVELQNAADAASLGGARSLSDP 67 Query: 69 G 69 G Sbjct: 68 G 68 >gi|301788518|ref|XP_002929675.1| PREDICTED: complement factor B-like [Ailuropoda melanoleuca] gi|281345621|gb|EFB21205.1| hypothetical protein PANDA_019913 [Ailuropoda melanoleuca] Length = 768 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 39/222 (17%), Positives = 82/222 (36%), Gaps = 30/222 (13%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN---- 228 IVLD S SM + S I ++ N + + ++ + + GLVT++ Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGIGNFTRAKNCLRDFIEKVASYGVKPKYGLVTYATIPKV 320 Query: 229 ----KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI-FDMQGMRQHCNTE 283 + + +W L + K TN+ L+ YN + + + + N Sbjct: 321 WVKLRDDNSSDADWVTRILNQISYEDHKLKAGTNTKKALQEVYNMMSWPGNALLEGWN-- 378 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYC----------NEAKKRGAIVYAIGIRVIRSH 333 + + +I+ MTDG + + ++ ++ VY G+ + + Sbjct: 379 --HTRHVIILMTDGLHNMGGDPVSVIHEIRDFLYIGRDRKNPREDYLDVYVFGVGPLVNQ 436 Query: 334 EFLRACAS----PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 E + A AS + V++ ++ D F + + T + Sbjct: 437 ENINALASKKDKEQHVFRVKDMENLEDVFIQMLDETRTLGLC 478 >gi|317491692|ref|ZP_07950127.1| hypothetical protein HMPREF0864_00890 [Enterobacteriaceae bacterium 9_2_54FAA] gi|316920126|gb|EFV41450.1| hypothetical protein HMPREF0864_00890 [Enterobacteriaceae bacterium 9_2_54FAA] Length = 416 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 61/427 (14%), Positives = 130/427 (30%), Gaps = 92/427 (21%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM 66 +IR F + G I + + + +E S + L ++++ + + Sbjct: 10 SIRRFKQDRSGAFAISFVMMSGFLLSMAAFGLEGSRYITERARLSDAMEQAALA----LT 65 Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 E NG+ ++ + ++ +D+D + + + P N+ Sbjct: 66 AEDNGDGAQRNY--------------TLSSDYFRAYMRHDVDVFKPTVIVKSGISPNNQN 111 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 S L+ F + S + + + + +D++ V D S SM Sbjct: 112 LSYVEYRVSGQTLQDSWFSSTFFPSFDKQV-VIGDNGAARKFRSNMDVIFVTDFSGSMNE 170 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF----------------SNKI 230 F S TK+ + + + +E+ N V G V F + Sbjct: 171 GFGGS-TKLAELKRIVLKLSDELFSYNIDNKV---GFVPFGWGGKEGIDCDFPFVSHGPV 226 Query: 231 EEFFLLEWGVSHLQRKIK-----------------YLSKFGVST------------NSTP 261 L L++ I LS ST S+ Sbjct: 227 PSDILAGGNYKALEKYIDISGSVAAIPNPVHDIQIPLSNVNGSTCLRNSHSWKVPLTSSL 286 Query: 262 GLKYAYNQIFDMQGMRQHCN--------TEDANYKKIIVFMTDG----ENLSTKEDQQSL 309 N + G +K++V ++DG +N+ + + Sbjct: 287 AEINQINGMSAEGGTLVSSGVLLGVPYLASGTASRKVMVIVSDGTDDPKNVMITPNLINA 346 Query: 310 YYCNEAK------KRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSM-----YDAF 358 C++ + + + IGI + ++ ++C +FYL + + F Sbjct: 347 GMCDKIRQVLSTDESVGKISFIGIAYYPTVDW-KSCVGDKNFYLPQTIDELEEDLRRAVF 405 Query: 359 SHIGKDI 365 +G +I Sbjct: 406 EEVGHNI 412 >gi|126031547|pdb|2ODP|A Chain A, Complement Component C2a, The Catalytic Fragment Of C3- And C5-Convertase Of Human Complement gi|126031548|pdb|2ODQ|A Chain A, Complement Component C2a, The Catalytic Fragment Of C3- And C5-Convertase Of Human Complement Length = 509 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 80/217 (36%), Gaps = 28/217 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K+ Q L++ ++LD S+S+ + +S + M++ + V Sbjct: 1 KIQIQRSGHLNLYLLLDASQSVSEN------DFLIFKESASLMVDRIFSFEIN---VSVA 51 Query: 223 LVTFSNKIEEFFL-LEWGVSHLQRKIKYLS-------KFGVSTNSTPGLKYAYNQIFDMQ 274 ++TF+++ + L + I L + G TN+ L Y + + Sbjct: 52 IITFASEPKVLMSVLNDNSRDMTEVISSLENANYKDHENGTGTNTYAALNSVYLMMNNQM 111 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENL------STKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + + I+ +TDG++ + + + + N+ + +YAIG+ Sbjct: 112 RLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVG 171 Query: 329 VIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 + +E +++++ +++ F H Sbjct: 172 KLDVDWRELNELGSKKDGERHAFILQDTKALHQVFEH 208 >gi|109732398|gb|AAI15869.1| Vwa2 protein [Mus musculus] Length = 573 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 61/191 (31%), Gaps = 22/191 (11%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ +LD S S+ I + PD V Q GLV Sbjct: 305 GCQAQSLDLVFLLDASASVG------RENFAQMQSFIRKCTLRFDVNPD---VTQVGLVV 355 Query: 226 FSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + ++++ F L+ + + R + G ++ L + +++ +Q Sbjct: 356 YGSRVQTAFGLDTHPTRAAVLRAMSQAPYLGGVGSAGTALLHIEDKVMTVQ------RGA 409 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 K +V +T G + + + + G V + + + R + Sbjct: 410 RPGVPKAVVMLTGG-----SGAEDAAVPAQKLRGNGISVLVMSVGAVLREAVRRLAGPRD 464 Query: 344 SFYLVENPHSM 354 S V + Sbjct: 465 SLIHVAAYTDL 475 >gi|291236748|ref|XP_002738300.1| PREDICTED: complement factor B-like, partial [Saccoglossus kowalevskii] Length = 648 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 31/227 (13%), Positives = 75/227 (33%), Gaps = 23/227 (10%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 ++ LD+ +LD S S+ + + + ++E+V + D Sbjct: 226 SSEGPEGRTIDLNHAGGLDLYFMLDASASVGE------ENFKIGLNFVKRLVEKVGVSAD 279 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS-------KFGVSTNSTPGLKYAY 267 + G++T+ + + F L ++ + ++ L + T + LK Sbjct: 280 -KGGTRVGVLTYGSDVIINFHLSDDLTSTELVVQALDNIDYATHQGRRGTATKDALKTVR 338 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG-AIVYAIG 326 + + + KK + +TDG++ ++ K V+ +G Sbjct: 339 EIMIPQAAASL---VDRSFAKKALFLITDGKSN---IGGDPANEADKLKTEFHVDVHCVG 392 Query: 327 IRVIRSHEFLRACASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIW 371 I + + + P+ + +E+ + S I V Sbjct: 393 ISQASKKQLVDIASKPSRQHLFFIEDYERLEWLISAITDVEVDYSPC 439 >gi|163848654|ref|YP_001636698.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222526590|ref|YP_002571061.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163669943|gb|ABY36309.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222450469|gb|ACM54735.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 947 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 42/223 (18%), Positives = 73/223 (32%), Gaps = 41/223 (18%) Query: 165 NSQTDARLDMMIVLDVSRSME-----------SFFDSSITKIDMAIKSINAMLEEVKLIP 213 N + L ++ V+D S SM+ SS KID+A +I + Sbjct: 403 NRELRPDLAIVFVIDKSGSMDACHCADPDRGAPITSSSERKIDIAKDAIVQAAALLGPQD 462 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 G+VTF F G V + + + G TN GL A + Sbjct: 463 T------VGVVTFDGAASATFPATRGATVEQVMDAVSGVEPRGP-TNIRAGLLRAEEMLQ 515 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 + +H ++ +TDG + DQ L +++G + + Sbjct: 516 QVDARIKH-----------MILLTDG--WGSGGDQ--LDLAARLREQGITLTVVAAGSGS 560 Query: 332 SHEFLRACAS-PNSFYLVENPHSMYDAF-----SHIGKDIVTK 368 + + A +Y + + F + IG IV + Sbjct: 561 AAYLKQLAAEGGGRYYPAADMAEVPQIFVQETITAIGNYIVEQ 603 >gi|300782091|ref|YP_003762382.1| von Willebrand factor type A [Amycolatopsis mediterranei U32] gi|299791605|gb|ADJ41980.1| von Willebrand factor type A [Amycolatopsis mediterranei U32] Length = 602 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 40/204 (19%), Positives = 67/204 (32%), Gaps = 32/204 (15%) Query: 174 MMIVLDVSRSMESFFDS-SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE- 231 +++V+DVS SM +KID+A ++ L + Q GL F+ ++ Sbjct: 411 VLLVVDVSGSMGDEVKGTGKSKIDLAKQAAIDSLGQFVPRD------QVGLWQFATHLDG 464 Query: 232 --------EFFLL-EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 L G L ++ L+ T AY + + H + Sbjct: 465 DKDYQELLPVQPLGSNGKETLASRLSGLTPQ-SGTGLYDSSLAAYEYL------KAHLDP 517 Query: 283 EDANYKKIIVFMTDGENLSTKE---DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR-- 337 N +V +TDG N D E ++ I L+ Sbjct: 518 SAINA---VVVLTDGRNEDPGGVDLDHLVPQLRPEGNAESVRLFTIAYGGDADQNVLKQI 574 Query: 338 ACASPNSFYLVENPHSMYDAFSHI 361 A A+ S Y P S+ F+ + Sbjct: 575 AEATAGSEYDSSKPDSINQVFTSV 598 >gi|149601522|ref|XP_001515070.1| PREDICTED: similar to integrin alpha 2 subunit, partial [Ornithorhynchus anatinus] Length = 329 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 84/210 (40%), Gaps = 35/210 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++V D S S+ + + + ++ + + P Q GL+ + N+ Sbjct: 111 IDVVVVCDESNSIYPW--------NAVKNFLEKFVQGLDIGPTK---TQVGLIQYGNEPR 159 Query: 232 EFFLLEWGVSH--LQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F + + + ++ ++ G S TN+ +++A F + + + Sbjct: 160 VVFNMNKFKTKEEMVQETSRTNQNGGSLTNTFKAIEFARQNAFSL------ASGGRPHAT 213 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-RVIRSHEF--------LRAC 339 K++V +TDGE+ ++ + C + I + I + + +E ++A Sbjct: 214 KVMVVVTDGESHDGSNLKKVIDQCEQ---DNIIRFGIAVLGYLLRNELDTKNLITEIKAI 270 Query: 340 ASP---NSFYLVENPHSMYDAFSHIGKDIV 366 AS F+ V + ++ + +G+ I Sbjct: 271 ASSPTEKYFFNVSDEAALLEKAGTLGERIF 300 >gi|330975134|gb|EGH75200.1| von Willebrand factor, type A [Pseudomonas syringae pv. aptata str. DSM 50252] Length = 352 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 64/171 (37%), Gaps = 26/171 (15%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + ++++ + + + LE K + GL+ F Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDEVSRLVLVQQLLGDFLEGRKGD-------RVGLILFGT 143 Query: 229 KIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + + G +T + A ++ Sbjct: 144 QAFVQAPLTYDRRTVRVWLDEARIGIAGKNTALGDAIGLALKRL-----------RMRPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + +V +TDG N + + D + A + G +Y IGI + L+ Sbjct: 193 TSRALVLVTDGANNAGQIDPITA--ARLAAEEGVKIYPIGIGSDPDKDALQ 241 >gi|330952765|gb|EGH53025.1| von Willebrand factor, type A [Pseudomonas syringae Cit 7] Length = 262 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 64/171 (37%), Gaps = 26/171 (15%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + ++++ + + + LE K + GL+ F Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDEVSRLVLVQQLLGDFLEGRKGD-------RVGLILFGT 143 Query: 229 KIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + + G +T + A ++ Sbjct: 144 QAFVQAPLTYDRRTVRVWLDEARIGIAGKNTALGDAIGLALKRL-----------RMRPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + +V +TDG N + + D + A + G +Y IGI + L+ Sbjct: 193 TSRALVLVTDGANNAGQIDPITA--ARLAAEEGVKIYPIGIGSDPDKDALQ 241 >gi|238796988|ref|ZP_04640492.1| Flp pilus assembly protein TadG [Yersinia mollaretii ATCC 43969] gi|238719248|gb|EEQ11060.1| Flp pilus assembly protein TadG [Yersinia mollaretii ATCC 43969] Length = 536 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 46/225 (20%), Positives = 76/225 (33%), Gaps = 25/225 (11%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F N KG + + IFLPI +L + E+S K L I+++ + A + N Sbjct: 25 FIKNDKGAILLPFVIFLPIFIGLLFLSFEISQFLQKKAKLSDAIEQATL--ALTVENNDI 82 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 + + K D++ + L F I DI Sbjct: 83 PDANQSQKNRDLVTHYATAY-------LPSEKFSTPIIDISN----------NKGHLLYK 125 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD- 189 A + P +F P I + V D++ V+D S SM + F Sbjct: 126 AETTMSYPAQFLANSPLANTKISIADSGAARKDVAVGPSELTDVVFVVDYSSSMINPFHG 185 Query: 190 --SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 S +KID + + ++N G + FS I++ Sbjct: 186 TYGSRSKIDELRDIFYKLNGNILKNDNINT---IGFIPFSWGIKK 227 >gi|149187720|ref|ZP_01866017.1| hypothetical protein VSAK1_23409 [Vibrio shilonii AK1] gi|148838600|gb|EDL55540.1| hypothetical protein VSAK1_23409 [Vibrio shilonii AK1] Length = 340 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 66/201 (32%), Gaps = 23/201 (11%) Query: 173 DMMIVLDVSRSM--ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D+M+V+D+S SM F D+S + D ++ ++LE + + + GL+ F + Sbjct: 95 DLMLVVDLSGSMSERDFLDNSGKEQDRL-TAVKSVLETFAAKREGD---RLGLILFGDSA 150 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + ++ G + N E+ +K+ Sbjct: 151 YLQSPFTADHEAWLALLDQAQVGMAGESTHLG------DAVGLTIKTYIDNPENQTVEKV 204 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-------HEFLRACA--S 341 + +TDG T + A+ G +Y + + + + A + Sbjct: 205 AIILTDG--NDTDSLVPPIDAAKVAQAYGIRLYIVAMGSPNTTGDQAIDFSTIETMATVT 262 Query: 342 PNSFYLVENPHSMYDAFSHIG 362 +L + + + I Sbjct: 263 GGQAFLAMSQEDLDAVYQTIS 283 >gi|34980918|gb|AAH57200.1| Itgax protein [Mus musculus] Length = 304 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 30/173 (17%), Positives = 64/173 (36%), Gaps = 23/173 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV---NNVVQSGLVTFSNK 229 D++ ++D S S+ S + K+ +K++ + L+ + ++ + TF+N Sbjct: 152 DIVFLIDGSGSISS---TDFEKMLDFVKAVMSQLQRPSTRFSLMQFSDYFRVHF-TFNNF 207 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 I L + + + T + +K+ ++F Q + + K Sbjct: 208 ISTSSPLS--------LLDSVRQLRGYTYTASAIKHVITELFTTQ------SGARQDATK 253 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 +++ +TDG S+ A+ I YAIG+ L S Sbjct: 254 VLIVITDGRKQGDNLSYDSVIP--MAEAASIIRYAIGVGHKDGFPPLPPVTSS 304 >gi|77464595|ref|YP_354099.1| von Willebrand (VWA) domain-containing protein [Rhodobacter sphaeroides 2.4.1] gi|77389013|gb|ABA80198.1| Putative membrane protein with von Willebrand (VWA) domain [Rhodobacter sphaeroides 2.4.1] Length = 651 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 39/232 (16%), Positives = 79/232 (34%), Gaps = 23/232 (9%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 P T PW ++ + + + + + L+++ ++D S SM+ K+ + Sbjct: 262 PTLSVTRTPWNPETQLVHVALQGRM-PAIEDRPPLNLVFLIDTSGSMQ-----DPAKLPL 315 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST 257 +S ML ++ V V +G ++ S + + L G ST Sbjct: 316 LKQSFGLMLGRLRPEDQVAIVTYAGSAG---EVLAPTA-ANQRSTILSALDRLDA-GGST 370 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAK 316 GL AY +M G + +V TDG+ NL + ++ + Sbjct: 371 AGEEGLALAYRTASEMAGAGEVTR---------VVLATDGDFNLGISDPEELARLVAHER 421 Query: 317 KRGAIVYAIGIRVIR-SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 G + +G ++A A + S+ +A + + Sbjct: 422 DTGIYLSVLGFGRGNLDDATMQALAQNGNGQAAY-IDSLNEAQKVLVDQLSG 472 >gi|326922361|ref|XP_003207417.1| PREDICTED: collagen alpha-3(VI) chain-like [Meleagris gallopavo] Length = 3135 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 74/198 (37%), Gaps = 21/198 (10%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + + ++ + + + ++ GLV FS+ + Sbjct: 644 DIIFLLDGS------LNVGNANFPFVRDFVVTLVNNLDV---GTDKIRVGLVQFSDTPKT 694 Query: 233 FFLL-EWGV-SHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L + S + +++ L G S N+ L N + + + + Sbjct: 695 EFSLYSYQTKSDIIQRLGQLRPKGGSVLNTGSAL----NFVLSNHFTEAGGSRINEQVPQ 750 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 ++V +T G + L N+ + G + +A+G+R E + +P Y V+ Sbjct: 751 VLVLVTAGRSADP-----FLQVSNDLARAGVLTFAVGVRNADKAELEQIAFNPRMVYFVD 805 Query: 350 NPHSMYDAFSHIGKDIVT 367 + + + K I T Sbjct: 806 DFSGLTALPQELNKPITT 823 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 44/266 (16%), Positives = 100/266 (37%), Gaps = 30/266 (11%) Query: 104 VNDIDDIVRSTSLDIV-----VVPQNEGYSISAISRYKIPLKFCTFIPW-----YTNSRH 153 VN +++ STS+ + V +N+ I+ + ++ T +P Sbjct: 1560 VNRPANVISSTSIQPLGVGARNVDRNQLQVITNDPGRVLVVQDFTGLPTLERRVQNILEE 1619 Query: 154 IVMPITSSVK-VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 + +P T S + D++ +LD S ++ D + + + V I Sbjct: 1620 LPVPTTESPGYPGPGGKKQADIVFLLDGSINLGR---------DNFQEVLQFVYSIVDAI 1670 Query: 213 PDVNNVVQSGLVTFSNKI-EEFFLLEWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + + +Q GL +++ + +EFFL ++ + I + G +T A + Sbjct: 1671 YEDGDSIQVGLAQYNSDVTDEFFLKDYSTKPQILDAINKVIYKGGRVANTGA---AIRHL 1727 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 +++ + D +I +T G++ S+ + G V+A+G+R I Sbjct: 1728 QAKHFVKEAGSRIDQRVPQIAFIITGGKSSDD-GQGASMEVAQK----GVKVFAVGVRNI 1782 Query: 331 RSHEFLRACASPNSFYLVENPHSMYD 356 E + + + + V + + Sbjct: 1783 DLEEVSKLASESATSFRVSTAQELSE 1808 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 73/191 (38%), Gaps = 23/191 (12%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S ++ S + ++E +++ + G+V +S Sbjct: 236 AQESADLIFLIDGSNNIGS------VNFQAIRDFLVNLIESLRV---GAQQIHIGVVQYS 286 Query: 228 NKIEEFFLL-EWGVS-HLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ F L + + +K LS + G N+ L+Y +F G + + Sbjct: 287 DQPRTEFALNSYSTKADVLDAVKALSFRGGKEANTGAALEYVVENLFTQAGGSRI----E 342 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I+V ++ GE+ K+ ++IG+ S E L+ A+ S Sbjct: 343 EAVPQILVLISGGESSD-----DIREGLLAVKQASIFSFSIGVLNADSAE-LQQIATDGS 396 Query: 345 F-YLVENPHSM 354 F + + ++ Sbjct: 397 FAFTALDIRNL 407 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 30/206 (14%), Positives = 76/206 (36%), Gaps = 16/206 (7%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + ++ V +Q A + + V D+ ++S ++ + + + +++ + + Sbjct: 13 LGLLLSGFCSVGAQQQAAVRNVAVADIIFLVDSSWNIGKEHFQLVREFLYDVVKALDV-- 70 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N + LV FS F L S + I + G + + GL+Y + Sbjct: 71 -GGNDFRFALVQFSGNPHTEFQLNTYPSNQDVLSHIANMPYMGGGSKTGKGLEY----LI 125 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 + + + ++I+ +TDG+ + K + A+G++ Sbjct: 126 ENHLTKAAGSRASEGVPQVIIVLTDGQ-----SQDDVALPSSVLKSAHVNMIAVGVQDAV 180 Query: 332 SHEFLRACASP--NSFYLVENPHSMY 355 E + P + +EN +++ Sbjct: 181 EGELKEIASRPFDTHLFNIENFTALH 206 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 41/241 (17%), Positives = 86/241 (35%), Gaps = 22/241 (9%) Query: 118 IVVVPQNEGYSISAISRYKIP-LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMI 176 +V + + Y S ++P L+ P T S + + V V + D++ Sbjct: 1384 MVQISLSPDYVFQVSSFQELPSLEQKLLAPIETLSADQIRQLLGDVTVPDVSGEEKDVVF 1443 Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 ++D S S+ + I+ +++++ + P+ V+ G+V FSN + F L Sbjct: 1444 LIDSSDSVR------TDGLAHIRDFISRIVQQLDVGPNK---VRIGVVQFSNNVFPEFFL 1494 Query: 237 EWGVSH--LQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + S + + I+ L + G N+ L Y F ED + ++V Sbjct: 1495 KTHKSKNAVLQAIRRLRLRGGYPVNAGKALDYVVKNYFIKS---AGSRIEDGVPQHLVVI 1551 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHS 353 + D + N +G R + ++ P +V++ Sbjct: 1552 LGD------QSQDDVNRPANVISSTSIQPLGVGARNVDRNQLQVITNDPGRVLVVQDFTG 1605 Query: 354 M 354 + Sbjct: 1606 L 1606 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 72/205 (35%), Gaps = 26/205 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + F + + ++E + + + V+ +V +SN I+ Sbjct: 1035 DVVFLIDGSDGVRRGFP-------LLKTFVERVVESLDI---GRDKVRVAIVQYSNVIQP 1084 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 FLL+ + L I+ L+ G S N+ L Y +F + + + Sbjct: 1085 EFLLDAYEDKADLVSAIQALTIMGGSPLNTGAALDYLIKNVFTVSSGSRIAEG----VPQ 1140 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 ++ +T + + K G + + IGI E P+ V Sbjct: 1141 FLILLT-----ADRSQDDVRRPSVVLKTSGTVPFGIGIGNADLTELQTISFLPDFAISVP 1195 Query: 350 NPHSMYDAFSHIGKDIVTKRIWYDK 374 + + + + + + I K Sbjct: 1196 DFSQL----DSVQQAVSNRVIRLTK 1216 Score = 39.8 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 43/264 (16%), Positives = 82/264 (31%), Gaps = 26/264 (9%) Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVN 165 +I++ ++ + +I + I L C P + Sbjct: 2560 EINNTAVGHAIVLPTSGSQLNDTIRRLLTCHICLDVCEPDPICGYGSQRPVFR----DXA 2615 Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + TD DM ++D S S + + K + S + E K+ V Sbjct: 2616 APTDVDTDMAFIMDSSASTTPLQFNEMKKYISHLISNMEISSEPKISQHHARVAVLQQAP 2675 Query: 226 FSNKIEEFFLL--------EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + ++ F ++G Q+ I YL T + A + Sbjct: 2676 YEHETNSSFPPVKTEFSLTDYGSK--QKIINYLHNQMTQLYGTMAMGSAV------EHTV 2727 Query: 278 QHCNTEDANYK--KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHE 334 H N + K+IV M G+ +E + +AK +G + +GI Sbjct: 2728 AHIFESAPNPRDLKVIVLMITGKME-KQELEDLREAVIDAKCKGYLFVILGIGRNVDVKN 2786 Query: 335 FLRACASPNSFYL--VENPHSMYD 356 + PN + V P +++ Sbjct: 2787 IYSLASEPNDVFFKLVSKPGELHE 2810 >gi|88857796|ref|ZP_01132439.1| von Willebrand factor type A domain protein [Pseudoalteromonas tunicata D2] gi|88820993|gb|EAR30805.1| von Willebrand factor type A domain protein [Pseudoalteromonas tunicata D2] Length = 608 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 42/253 (16%), Positives = 82/253 (32%), Gaps = 29/253 (11%) Query: 94 FRNELRDNGFVNDIDDIV-----RSTSLDIVVVPQNEGYSISAISRYKIPLKF---CTFI 145 F ++ + N I + ++ Y SA + P Sbjct: 157 FSIDVDTGSYSNSRRMIKMGKRPPADAVREEAFINYFDYHYSAPKSLETPFNVHTEVAPA 216 Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 PW + + + I + +++ +LDVS SM + K+ + S+ + Sbjct: 217 PWNNQRQLLKIGIKG-FDIEKAELKAANLVFLLDVSGSMNA-----PDKLPLLKSSLTML 270 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 +++ V VV +G + + + LS G STN G++ Sbjct: 271 TKQLDENDSVAIVVYAGAAG----LVLPATKGNEYQVISNALNNLSA-GGSTNGAQGIEL 325 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYA 324 AY QI ++ N ++ TDG+ N+ +K G + Sbjct: 326 AY-QIASQNFKKEGINR--------VILATDGDFNVGMSSVDALKKLIANKRKTGIALTT 376 Query: 325 IGIRVIRSHEFLR 337 +G ++ L Sbjct: 377 LGFGQGNYNDGLM 389 >gi|302382107|ref|YP_003817930.1| hypothetical protein Bresu_0994 [Brevundimonas subvibrioides ATCC 15264] gi|302192735|gb|ADL00307.1| Protein of unknown function DUF3520 [Brevundimonas subvibrioides ATCC 15264] Length = 625 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 42/267 (15%), Positives = 95/267 (35%), Gaps = 36/267 (13%) Query: 83 LCRIKNTWNMSFRNELRDNGFVN-----DIDDIVRSTSLDIVVVPQNEGYSISAISRYKI 137 + R+ + +F ++ + N ++ + + Y + R Sbjct: 156 VRRVADEPVSTFSIDVDTAAYANVRRFISEGQTPPRDAVRVEEMINYFDYGYARPGRADE 215 Query: 138 PLKFCTFI---PWYT----NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 P T + PW R IV ++ + L++ ++DVS SM+S Sbjct: 216 PFAVSTAVAASPWSANAGAGGRQIVHIGLQGYELPAGERRPLNLTFMVDVSGSMQS---- 271 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIK 248 K+ +A +++N +++ ++ + + +++ + G L+ + Sbjct: 272 -PDKLGLAQQTMNLIIDRLRPED------RVAVTYYASDVGTAVGPTPGSEKLKLRCAVA 324 Query: 249 YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQ 307 L+ G ST G+ AY Q +I++F TDG+ N+ +D++ Sbjct: 325 ALNA-GGSTAGAQGMVNAYEQ--------AEAAFSPDKVNRILMF-TDGDFNVGVTDDRR 374 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRSHE 334 Y + + G + G + Sbjct: 375 LEDYVADKRGTGIYLSVYGFGRGNYQD 401 >gi|269926840|ref|YP_003323463.1| von Willebrand factor type A [Thermobaculum terrenum ATCC BAA-798] gi|269790500|gb|ACZ42641.1| von Willebrand factor type A [Thermobaculum terrenum ATCC BAA-798] Length = 918 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 37/215 (17%), Positives = 79/215 (36%), Gaps = 40/215 (18%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSME----------SFFDSSITKIDMAIKSINA 204 +P+ S ++ N + ++ +++ +D S SM + I K+D+A +S Sbjct: 389 TLPVDSQIR-NPDEEPQVAVVMAIDKSGSMAACHCEGSKLLEQYPGGIPKVDIAKESAIL 447 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNK---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 E + G+V F + + S + K+ + G TN Sbjct: 448 SSETLGPNDIF------GVVAFDTAPRWVVRPEPVTD-KSSIAEKVAGIQGSG-GTNIYG 499 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 GL A + + ++ +H ++ +TDG + D ++A++ G Sbjct: 500 GLAEAIDSLIKVKAKNKH-----------VILLTDGWSNVGNYD----ELISKARRHGIT 544 Query: 322 VYAIGIRVIRSHEFLRACAS--PNSFYLVENPHSM 354 + + S + LR+ A +FY + + Sbjct: 545 ISTVS-AAGGSAQLLRSIAEKGGGTFYNTRDSADI 578 >gi|198421591|ref|XP_002123589.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 401 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 62/213 (29%), Gaps = 34/213 (15%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-- 228 LDM+ VLD S S+ D N ++ + Q G++ +S+ Sbjct: 199 DLDMLFVLDGSGSVGK---------DNFEIVKNWTIKVANSFDISDGYTQVGVIQYSHYW 249 Query: 229 -----------KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 K E ++ +S +T + L + + Sbjct: 250 ATEPLDKQSYIKTEVPLGKYRNKQEFSAAVRNISLHEYTTYTAHALNK------TVFDFQ 303 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 Q K+++ +TDG + + S N + +A+G+ E Sbjct: 304 QSSRWNRPKTSKVLILLTDGLSTDKQLLPSS---ANYVRSLNITTFAVGVGEANEKELQE 360 Query: 338 AC---ASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + Y N + S + I+ Sbjct: 361 IANGQGTNERVYYTSNFAGLNKIVSQLRSAILN 393 >gi|166367777|ref|YP_001660050.1| hypothetical protein MAE_50360 [Microcystis aeruginosa NIES-843] gi|166090150|dbj|BAG04858.1| hypothetical protein MAE_50360 [Microcystis aeruginosa NIES-843] Length = 420 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 41/230 (17%), Positives = 88/230 (38%), Gaps = 29/230 (12%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 ++S+ +M ++ T+ +++ +VLD S SM+ ++ K+ +++ Sbjct: 17 NQSSSQRQLMLSIAATSEQINTNLPINLCLVLDHSGSMQG------KPLETVKKAALSLI 70 Query: 207 EEVKLIPDVNNVVQSGLVTFSNK---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL 263 E + + + ++ F ++ I E ++ ++ KI+ L G T G+ Sbjct: 71 ESLGVND------RLSVIAFDHRAKVILPSQSREDDLTLIRSKIQQLQA-GGGTAIDEGI 123 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 K + +T Y I +TDGEN +Q+ L A + G + Sbjct: 124 KLGIQE----------SSTGSKGYVSHIFLLTDGENEHGN-NQRCLKLAEVAAEYGITLN 172 Query: 324 AIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 G + + L A + S +E P F+ + + + R+ Sbjct: 173 TFGFGDHWNQDILEKIADIAGGSLSYIERPEQALIEFTRLFNRLQSVRLT 222 >gi|149040468|gb|EDL94506.1| rCG57747 [Rattus norvegicus] Length = 539 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 32/183 (17%), Positives = 66/183 (36%), Gaps = 25/183 (13%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-NVVQSGLV 224 LD++ +LD S S+ + + + + L DVN +V Q GLV Sbjct: 239 GCQAQSLDLVFLLDASASVGGENFAQMQS----------FIRKCTLRFDVNPDVTQVGLV 288 Query: 225 TFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + N+++ F L+ + R + G ++ L + +++ +Q Sbjct: 289 VYGNQVQTAFGLDTHPTRASVLRAMSQAPYLGGVGSAGTALLHIEDKVMTVQ------RG 342 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 K +V +T G + + + + G V + + + E +R A P Sbjct: 343 ARPGVPKAVVMLTGG-----SGAEDAAVPAQKLRDNGISVLVMSVGAVL-KEAVRRLAGP 396 Query: 343 NSF 345 + Sbjct: 397 RDY 399 Score = 42.1 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 73/200 (36%), Gaps = 22/200 (11%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 R+D++ +LD S T + ++ + V+ + ++ + G+ ++S + Sbjct: 56 RIDVLFLLDSSAG---------TTPEGFRQAKAFVKRFVQAVLSEDSRARVGVASYSRDL 106 Query: 231 EEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + V L R + + G T + L G T + Sbjct: 107 MVAVPVGEYQHVPDLVRILDSIPFSGGPTLTGSALLQV-----TEHGFGSASRTGQDRPR 161 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 +++V +T+ + + + A+ R ++ +G ++R+ E R S + Sbjct: 162 RVVVLLTE-----SHAQDEVAGPADHARARELLLLGMGNELMRA-ELERITGSRKHVIIH 215 Query: 349 ENPHSMYDAFSHIGKDIVTK 368 +P M+ + + + ++ Sbjct: 216 TDPRDMFSQIPELQRRLCSQ 235 >gi|83814223|ref|YP_446667.1| putative batB protein [Salinibacter ruber DSM 13855] gi|83755617|gb|ABC43730.1| putative batB protein [Salinibacter ruber DSM 13855] Length = 350 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 26/175 (14%), Positives = 60/175 (34%), Gaps = 15/175 (8%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 +V + LD+++ LDVS SM + D +++ A I +++++ + Sbjct: 70 GTEVRTVERRGLDLVVALDVSASMRAQ-DVPPSRLRRAKNEIRTLVDDLSGD-------R 121 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGMR 277 GLV F+ L + + + T+ + + + Sbjct: 122 VGLVLFAGSGFVQSPLTTDYGAFRLFLDAAAPDQISTPGTDVSAAVDAGLQAFGAPRPTD 181 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + + ++ ++DGEN + D A++ G + G+ Sbjct: 182 DTTAAPEEPRPRALLIVSDGENHAGDLDAAR----QRAEEAGVTLLTAGVGTEDG 232 >gi|73953968|ref|XP_853856.1| PREDICTED: similar to tumor endothelial marker 8 isoform 1 precursor [Canis familiaris] Length = 924 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 19/101 (18%), Positives = 44/101 (43%), Gaps = 4/101 (3%) Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++ + N+ + +I+ +TDG L + +++ +++ GA +Y+IG+ R Sbjct: 469 NVNEQIEEANSGGKKFPSMIIALTDGT-LMPEPYEETKIEAENSRQLGATIYSIGVMDYR 527 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + L SP+ + V+N + I + + K Sbjct: 528 RDQLLSIADSPDHVFGVDNG---FKGLQDIVEPLTAKSCIE 565 >gi|289607418|emb|CBI60804.1| unnamed protein product [Sordaria macrospora] Length = 814 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 24/243 (9%), Positives = 72/243 (29%), Gaps = 43/243 (17%) Query: 3 SLNLNIRNFFYNYKGG-MTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHA 61 +L + + + + +G + ++T +P + ++G ++++ + + D + Sbjct: 2 ALRMLLGRLWSDQRGNTLALMTLALIP-LVALMGSGLDMTRAYVAQNRFRQACDAGSLAG 60 Query: 62 ATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVV 121 + + + + S L + I T++ Sbjct: 61 RRMLAGLTLPQAARDEATKYFMFDFPQGYLQSAPYTLTMSVPTAGTLQISSQTTVPTT-- 118 Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 + + + T S +Q D+M V D+S Sbjct: 119 -----------------------LMGLFGFDTLPISTTCSA---TQDFVNTDIMFVFDLS 152 Query: 182 RSMESF---------FDSSITKIDMAIKSINAMLEEVKLIPD----VNNVVQSGLVTFSN 228 SM + S +++ + + + ++ N ++ G V +++ Sbjct: 153 GSMNCAPGVTGYCGDVEQSGSRMGALRSAATSFYDTLETAQSQLAANNLRLRYGFVNYNS 212 Query: 229 KIE 231 + Sbjct: 213 TVN 215 Score = 56.3 bits (134), Expect = 9e-06, Method: Composition-based stats. Identities = 31/172 (18%), Positives = 52/172 (30%), Gaps = 43/172 (25%) Query: 238 WGVSHLQRKIKYLSKFGVS------------TNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 S + L G + ++T K A + D+ Sbjct: 466 NNRSGFVSYLNGLVARGGTYHDIGMIWGARFLSTTGLFKSATPETNDVNDPDNPAKIRGF 525 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYY-----------------------------CNEAK 316 + KK ++FMTDG+ T D + CN AK Sbjct: 526 SVKKYMIFMTDGDMSPTWSDYSAYGIEYLDGRVMGSPTTDNTALLARHLQRFRMACNAAK 585 Query: 317 KRGAIVYAIGIRVIRSHEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDIVT 367 +G ++ I + + + CAS P + + ++ F IG I T Sbjct: 586 AKGIDIWVIAFSTTLTAD-MTNCASKPEQAAGLSSNAALIAKFKEIGSKIAT 636 >gi|291613312|ref|YP_003523469.1| von Willebrand factor type A [Sideroxydans lithotrophicus ES-1] gi|291583424|gb|ADE11082.1| von Willebrand factor type A [Sideroxydans lithotrophicus ES-1] Length = 321 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 26/210 (12%), Positives = 73/210 (34%), Gaps = 38/210 (18%) Query: 174 MMIVLDVSRSMESF-FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 +++ +DVS SM + + ++ + K + + Q GL+ F + Sbjct: 90 LLLAVDVSGSMSTQDMAGNANRLQVVQKVAGDFIRR-------RHGDQVGLILFGTQPYL 142 Query: 233 FFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L ++ + + + + G T + A ++ D + + + Sbjct: 143 QAPLTTDLNTVGQFLDEAMIGVAGTQTAIGDAIGLAIKRLRDATNVSGR--KGET----V 196 Query: 291 IVFMTDGENLS---TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS------------HEF 335 ++ +TDG N + ++ + G +Y IG+ ++ + Sbjct: 197 LILLTDGSNDAGAMPPDEAAKMAA-----AAGLRIYTIGVGSDQTDPFGMGGANDLDEDT 251 Query: 336 LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + ++ + ++ ++ I + Sbjct: 252 LKLIAKTTGGEYFRATDVENLQQVYTRIDR 281 >gi|225310539|emb|CAQ19230.1| collagen type XXVIII alpha 1 b precursor [Danio rerio] Length = 491 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 40/222 (18%), Positives = 80/222 (36%), Gaps = 19/222 (8%) Query: 149 TNSRHIVMPITSSVKVNSQTDA-RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 N R PITS+V ++ + L++ ++D S S + + + + Sbjct: 16 RNGRRKSKPITSNVIPKNKDENCNLELAFLVDSSESAKDNHGQE----KSFVTDLVNHIP 71 Query: 208 EVKLIPDVNNVVQSGLVTFSNKI--EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 ++L ++ L+ +S+ + E+ F GV Q ++ + G T +T + Sbjct: 72 NIRLQTGQGLNFRTALLQYSSHVITEQSFKDWRGVPSFQSRVASIPFIGHGTYTTYAI-- 129 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 R + K+ + M G + D S +AK +G + + Sbjct: 130 -------TNLTRIYLEESGPGTVKVAILMYGGASHPKNPDIFSAL--ADAKNQGIKFFIV 180 Query: 326 GIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 G+ + E L+ AS + V N I ++I Sbjct: 181 GLTSAANMEKLQLLASAPASRYVHNIQD-KGVVDKIIREITK 221 >gi|115511022|ref|NP_035838.3| von Willebrand factor precursor [Mus musculus] gi|37784506|gb|AAP41950.1| von Willebrand factor [Mus musculus] gi|225356464|gb|AAI48364.1| Von Willebrand factor homolog [synthetic construct] Length = 2813 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 42/308 (13%), Positives = 91/308 (29%), Gaps = 37/308 (12%) Query: 78 KGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYK 136 + + + + N ++I + + + V + + ISR Sbjct: 1587 EHSFSPSQGDRVEAPNLVYMVTGNPASDEIKRLPGDIQVVPIGVGPHANMQELERISRPI 1646 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD--------ARLDMMIVLDVSRSMESFF 188 P+ F + +V+ S + LD++++LD S S+ Sbjct: 1647 APIFIRDFETLPREAPDLVLQTCCSKEGLQLPTLPPLPDCSQPLDVVLLLDGSSSLPE-- 1704 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV----SHLQ 244 + D A + + + P + Q ++ + + W V +HLQ Sbjct: 1705 ----SSFDKMKSFAKAFISKANIGPH---LTQVSVIQYGSINTIDVP--WNVVQEKAHLQ 1755 Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 + + + G + L +A + H A+ +I+ M Sbjct: 1756 SLVDLMQQEGGPSQIGDALAFAVRYVTSQ----IHGARPGASKAVVIIIM-------DTS 1804 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSM--YDAFSHIG 362 + A+ V+ +G+ LR A P + V + + +G Sbjct: 1805 LDPVDTAADAARSNRVAVFPVGVGDRYDEAQLRILAGPGASSNVVKLQQVEDLSTMATLG 1864 Query: 363 KDIVTKRI 370 K Sbjct: 1865 NSFFHKLC 1872 >gi|86129844|gb|ABC86574.1| VWF [Mus musculus] Length = 2813 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 42/308 (13%), Positives = 91/308 (29%), Gaps = 37/308 (12%) Query: 78 KGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYK 136 + + + + N ++I + + + V + + ISR Sbjct: 1587 EHSFSPSQGDRVEAPNLVYMVTGNPASDEIKRLPGDIQVVPIGVGPHANMQELERISRPI 1646 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD--------ARLDMMIVLDVSRSMESFF 188 P+ F + +V+ S + LD++++LD S S+ Sbjct: 1647 APIFIRDFETLPREAPDLVLQTCCSKEGLQLPTLPPLPDCSQPLDVVLLLDGSSSLPE-- 1704 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV----SHLQ 244 + D A + + + P + Q ++ + + W V +HLQ Sbjct: 1705 ----SSFDKMKSFAKAFISKANIGPH---LTQVSVIQYGSINTIDVP--WNVVQEKAHLQ 1755 Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 + + + G + L +A + H A+ +I+ M Sbjct: 1756 SLVDLMQQEGGPSQIGDALAFAVRYVTSQ----IHGARPGASKAVVIIIM-------DTS 1804 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSM--YDAFSHIG 362 + A+ V+ +G+ LR A P + V + + +G Sbjct: 1805 LDPVDTAADAARSNRVAVFPVGVGDRYDEAQLRILAGPGASSNVVKLQQVEDLSTMATLG 1864 Query: 363 KDIVTKRI 370 K Sbjct: 1865 NSFFHKLC 1872 >gi|56404690|sp|Q8CIZ8|VWF_MOUSE RecName: Full=von Willebrand factor; Short=vWF; Contains: RecName: Full=von Willebrand antigen 2; AltName: Full=von Willebrand antigen II; Flags: Precursor gi|32454887|gb|AAN07781.2| von Willebrand factor [Mus musculus] Length = 2813 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 42/308 (13%), Positives = 91/308 (29%), Gaps = 37/308 (12%) Query: 78 KGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYK 136 + + + + N ++I + + + V + + ISR Sbjct: 1587 EHSFSPSQGDRVEAPNLVYMVTGNPASDEIKRLPGDIQVVPIGVGPHANMQELERISRPI 1646 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD--------ARLDMMIVLDVSRSMESFF 188 P+ F + +V+ S + LD++++LD S S+ Sbjct: 1647 APIFIRDFETLPREAPDLVLQTCCSKEGLQLPTLPPLPDCSQPLDVVLLLDGSSSLPE-- 1704 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV----SHLQ 244 + D A + + + P + Q ++ + + W V +HLQ Sbjct: 1705 ----SSFDKMKSFAKAFISKANIGPH---LTQVSVIQYGSINTIDVP--WNVVQEKAHLQ 1755 Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 + + + G + L +A + H A+ +I+ M Sbjct: 1756 SLVDLMQQEGGPSQIGDALAFAVRYVTSQ----IHGARPGASKAVVIIIM-------DTS 1804 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSM--YDAFSHIG 362 + A+ V+ +G+ LR A P + V + + +G Sbjct: 1805 LDPVDTAADAARSNRVAVFPVGVGDRYDEAQLRILAGPGASSNVVKLQQVEDLSTMATLG 1864 Query: 363 KDIVTKRI 370 K Sbjct: 1865 NSFFHKLC 1872 >gi|148667418|gb|EDK99834.1| Von Willebrand factor homolog [Mus musculus] Length = 2816 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 42/308 (13%), Positives = 91/308 (29%), Gaps = 37/308 (12%) Query: 78 KGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYK 136 + + + + N ++I + + + V + + ISR Sbjct: 1590 EHSFSPSQGDRVEAPNLVYMVTGNPASDEIKRLPGDIQVVPIGVGPHANMQELERISRPI 1649 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD--------ARLDMMIVLDVSRSMESFF 188 P+ F + +V+ S + LD++++LD S S+ Sbjct: 1650 APIFIRDFETLPREAPDLVLQTCCSKEGLQLPTLPPLPDCSQPLDVVLLLDGSSSLPE-- 1707 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV----SHLQ 244 + D A + + + P + Q ++ + + W V +HLQ Sbjct: 1708 ----SSFDKMKSFAKAFISKANIGPH---LTQVSVIQYGSINTIDVP--WNVVQEKAHLQ 1758 Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 + + + G + L +A + H A+ +I+ M Sbjct: 1759 SLVDLMQQEGGPSQIGDALAFAVRYVTSQ----IHGARPGASKAVVIIIM-------DTS 1807 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSM--YDAFSHIG 362 + A+ V+ +G+ LR A P + V + + +G Sbjct: 1808 LDPVDTAADAARSNRVAVFPVGVGDRYDEAQLRILAGPGASSNVVKLQQVEDLSTMATLG 1867 Query: 363 KDIVTKRI 370 K Sbjct: 1868 NSFFHKLC 1875 >gi|86129842|gb|ABC86573.1| VWF [Mus musculus] Length = 2813 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 42/308 (13%), Positives = 91/308 (29%), Gaps = 37/308 (12%) Query: 78 KGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYK 136 + + + + N ++I + + + V + + ISR Sbjct: 1587 EHSFSPSQGDRVEAPNLVYMVTGNPASDEIKRLPGDIQVVPIGVGPHANMQELERISRPI 1646 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD--------ARLDMMIVLDVSRSMESFF 188 P+ F + +V+ S + LD++++LD S S+ Sbjct: 1647 APIFIRDFETLPREAPDLVLQTCCSKEGLQLPTLPPLPDCSQPLDVVLLLDGSSSLPE-- 1704 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV----SHLQ 244 + D A + + + P + Q ++ + + W V +HLQ Sbjct: 1705 ----SSFDKMKSFAKAFISKANIGPH---LTQVSVIQYGSINTIDVP--WNVVQEKAHLQ 1755 Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 + + + G + L +A + H A+ +I+ M Sbjct: 1756 SLVDLMQQEGGPSQIGDALAFAVRYVTSQ----IHGARPGASKAVVIIIM-------DTS 1804 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSM--YDAFSHIG 362 + A+ V+ +G+ LR A P + V + + +G Sbjct: 1805 LDPVDTAADAARSNRVAVFPVGVGDRYDEAQLRILAGPGASSNVVKLQQVEDLSTMATLG 1864 Query: 363 KDIVTKRI 370 K Sbjct: 1865 NSFFHKLC 1872 >gi|332262981|ref|XP_003280535.1| PREDICTED: integrin alpha-M-like [Nomascus leucogenys] Length = 997 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 77/210 (36%), Gaps = 28/210 (13%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 D+ ++D S S+ K+ + ++ L++ K + + + Sbjct: 143 GCPQEDSDIAFLIDGSGSINPN---DFQKMKEFVSTVMEQLKKSKTLFSLMQYSEEFWTH 199 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F+ K + + + +K + + T++ G++ ++F++ Sbjct: 200 FTFKEFQDNP------NPRSLVKPIMQLFGRTHTATGIRKVVRELFNITQ------GARK 247 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI----RSHEFLRACAS 341 N KI+V +TDGE + + EA + G I Y IG+ S + L AS Sbjct: 248 NAFKILVVITDGEKFGDPLGYEDVIP--EADREGVIRYVIGVGDAFHSMNSRQELNTIAS 305 Query: 342 P---NSFYLVENPHSMYDAFSHIGKDIVTK 368 + + V N ++A I + K Sbjct: 306 KPPRDHVFQVNN----FEALKTIQNQLREK 331 >gi|325287596|ref|YP_004263386.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324323050|gb|ADY30515.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 696 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 44/267 (16%), Positives = 93/267 (34%), Gaps = 33/267 (12%) Query: 103 FVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFI---PWYTNSRHIVMPIT 159 +N+ D+I ++ I + Y + P T + PW T ++ + + + Sbjct: 269 MINNGDNIPYD-AVKIEEMVNYFDYDYP-QPTDEHPFSISTDVAKTPWNTQTQLVRIGLQ 326 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + ++ ++ ++DVS SME K+ + I + ++ ++ V+ VV Sbjct: 327 GK-EYLNEELPASNLTFLIDVSGSMEDH-----NKLPLLISAFKLLVHQLIEKDKVSIVV 380 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMR 277 +G G ++ I L K G ST G+K AY ++ + + Sbjct: 381 YAGAAG------VVLPPTNGDQK-EKIINALQKLEAGGSTAGGQGIKLAY-KLAEKNFKK 432 Query: 278 QHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF- 335 N ++ TDG+ N+ D + + G + +G + + Sbjct: 433 NGNNR--------VILATDGDFNVGASSDTAMEKLIEKKRASGVFLSVLGFGMGNYKDSK 484 Query: 336 LRACAS--PNSFYLVENPHSMYDAFSH 360 L A + ++ F Sbjct: 485 LETLADKGNGNHAYIDTMQEAQKVFGD 511 >gi|323138635|ref|ZP_08073702.1| hypothetical protein Met49242DRAFT_3090 [Methylocystis sp. ATCC 49242] gi|322396123|gb|EFX98657.1| hypothetical protein Met49242DRAFT_3090 [Methylocystis sp. ATCC 49242] Length = 547 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 27/138 (19%), Positives = 46/138 (33%), Gaps = 20/138 (14%) Query: 240 VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 ++ ST T G Y+YN + + N + N Sbjct: 405 TDGFNSWNSRVNTATGSTYDTLG-YYSYN----GAENERFPDGSQGNGVNYRSLLAAAAN 459 Query: 300 LSTKE--------DQQSLYYCNEAKKRGAIVYAIGIRV------IRSHEFLRACAS-PNS 344 S+ D+ + C AK G V+ IG V + ++ CA+ + Sbjct: 460 NSSSYQTISRAMQDELTRQACTNAKTAGIEVFTIGFSVSGDPIDAQGLALMKECATNEDH 519 Query: 345 FYLVENPHSMYDAFSHIG 362 ++ E+ + AFS IG Sbjct: 520 YFKAEDASQLNAAFSQIG 537 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 29/215 (13%), Positives = 71/215 (33%), Gaps = 30/215 (13%) Query: 21 ILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGG 80 ++ + L + L+LG + + + L D +++ A+++ K Sbjct: 1 MIFGLSLMPVMLMLGATADYTRFTTTRAALQQAADSAVLTVASKMTESTTNAQAKDQA-- 58 Query: 81 DILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLK 140 ++ N IV ++ + + K+ ++ Sbjct: 59 ----------------QVVLNAQPRMTTAIVTGATVSEDKR--------TVCATAKVTIQ 94 Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 + + +++ + ++ +VLD S SM S D +KI + Sbjct: 95 NSFMQMAQLATLTPTVKSCANLAGGADPGTTYEIALVLDNSGSMNSSSDGQ-SKISILKS 153 Query: 201 SINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 + N+ V + +N V+ +V FS+ + Sbjct: 154 AANSF---VDTMFSKSNNVKFSVVPFSSGVAAVDP 185 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 19/87 (21%), Positives = 33/87 (37%), Gaps = 2/87 (2%) Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + L S +Q KI L G TN G + + + + +K Sbjct: 341 TQRILQLTQKKSDVQNKINQLVANGA-TNLHEGFMWGWRTLSPNAPFSGGRAYQAPKNRK 399 Query: 290 IIVFMTDGEN-LSTKEDQQSLYYCNEA 315 I+VFMTDG N +++ + + + Sbjct: 400 IMVFMTDGFNSWNSRVNTATGSTYDTL 426 >gi|73958268|ref|XP_848527.1| PREDICTED: similar to Integrin alpha-M precursor (Cell surface glycoprotein MAC-1 alpha subunit) (CR-3 alpha chain) (CD11b) (Leukocyte adhesion receptor MO1) (Neutrophil adherence receptor) isoform 2 [Canis familiaris] Length = 1153 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 40/205 (19%), Positives = 80/205 (39%), Gaps = 32/205 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S+ + T + ++ ++++ K N+ L+ FS + Sbjct: 151 DIAFLIDGSGSI------NPTDFQRMKEFVSTVMDQFK-----NSKTLFSLMQFSEDFQI 199 Query: 233 FFLLEWGVSHLQRK--IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + + +K + + T++ G++ ++F + N KI Sbjct: 200 HFTFNEFKKNPKPSFLVKSIKQLLGRTHTATGIRKVVRELFHSS------SGARENALKI 253 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI----RSHEFLRACASP---N 343 +V +TDGE D + + EA + G I Y IG+ ++ E L AS + Sbjct: 254 LVVITDGEKYGDPLDYKDVIP--EADREGIIRYVIGVGDAFNHLKNREELNIIASKPPRD 311 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTK 368 + V N ++A I + K Sbjct: 312 HVFRVNN----FEALKTIQNQLQEK 332 >gi|73958266|ref|XP_856286.1| PREDICTED: similar to Integrin alpha-M precursor (Cell surface glycoprotein MAC-1 alpha subunit) (CR-3 alpha chain) (CD11b) (Leukocyte adhesion receptor MO1) (Neutrophil adherence receptor) isoform 3 [Canis familiaris] Length = 789 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 40/205 (19%), Positives = 80/205 (39%), Gaps = 32/205 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S+ + T + ++ ++++ K N+ L+ FS + Sbjct: 151 DIAFLIDGSGSI------NPTDFQRMKEFVSTVMDQFK-----NSKTLFSLMQFSEDFQI 199 Query: 233 FFLLEWGVSHLQRK--IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + + +K + + T++ G++ ++F + N KI Sbjct: 200 HFTFNEFKKNPKPSFLVKSIKQLLGRTHTATGIRKVVRELFHSS------SGARENALKI 253 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI----RSHEFLRACASP---N 343 +V +TDGE D + + EA + G I Y IG+ ++ E L AS + Sbjct: 254 LVVITDGEKYGDPLDYKDVIP--EADREGIIRYVIGVGDAFNHLKNREELNIIASKPPRD 311 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTK 368 + V N ++A I + K Sbjct: 312 HVFRVNN----FEALKTIQNQLQEK 332 >gi|73958264|ref|XP_547048.2| PREDICTED: similar to Integrin alpha-M precursor (Cell surface glycoprotein MAC-1 alpha subunit) (CR-3 alpha chain) (CD11b) (Leukocyte adhesion receptor MO1) (Neutrophil adherence receptor) isoform 1 [Canis familiaris] Length = 1165 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 40/205 (19%), Positives = 80/205 (39%), Gaps = 32/205 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S+ + T + ++ ++++ K N+ L+ FS + Sbjct: 151 DIAFLIDGSGSI------NPTDFQRMKEFVSTVMDQFK-----NSKTLFSLMQFSEDFQI 199 Query: 233 FFLLEWGVSHLQRK--IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + + +K + + T++ G++ ++F + N KI Sbjct: 200 HFTFNEFKKNPKPSFLVKSIKQLLGRTHTATGIRKVVRELFHSS------SGARENALKI 253 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI----RSHEFLRACASP---N 343 +V +TDGE D + + EA + G I Y IG+ ++ E L AS + Sbjct: 254 LVVITDGEKYGDPLDYKDVIP--EADREGIIRYVIGVGDAFNHLKNREELNIIASKPPRD 311 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTK 368 + V N ++A I + K Sbjct: 312 HVFRVNN----FEALKTIQNQLQEK 332 >gi|114594050|ref|XP_526573.2| PREDICTED: anthrax toxin receptor 2 isoform 6 [Pan troglodytes] Length = 489 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 38/232 (16%), Positives = 79/232 (34%), Gaps = 34/232 (14%) Query: 148 YTNSRHIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + +V+ + + D+ VLD S S+ + + I Sbjct: 16 FPGLWLLVLSGPGGLLRAQEQPSCRRAFDLYFVLDKSGSVANNW-----------IEIYN 64 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPG 262 ++++ V+ ++ + FS++ L + + ++ L + T G Sbjct: 65 FVQQLAERF-VSPEMRLSFIVFSSQASIILPLTGDRGKISKGLEDLKRVSPVGETYIHEG 123 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGA 320 LK A QI G++ II+ +TDG + L ++ ++ GA Sbjct: 124 LKLANEQIQKAGGLKTSS---------IIIALTDGKLDGLVPSYAEKEAKI---SRSLGA 171 Query: 321 IVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 VY +G+ + R S + V+ + A I I+ + Sbjct: 172 SVYCVGVLDFEQAQLERIADSKEQVFPVKGG---FQALKGIINSILAQSCTE 220 >gi|114594046|ref|XP_001145091.1| PREDICTED: hypothetical protein isoform 4 [Pan troglodytes] gi|114594048|ref|XP_001145172.1| PREDICTED: anthrax toxin receptor 2 isoform 5 [Pan troglodytes] Length = 488 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 38/232 (16%), Positives = 79/232 (34%), Gaps = 34/232 (14%) Query: 148 YTNSRHIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + +V+ + + D+ VLD S S+ + + I Sbjct: 16 FPGLWLLVLSGPGGLLRAQEQPSCRRAFDLYFVLDKSGSVANNW-----------IEIYN 64 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPG 262 ++++ V+ ++ + FS++ L + + ++ L + T G Sbjct: 65 FVQQLAERF-VSPEMRLSFIVFSSQASIILPLTGDRGKISKGLEDLKRVSPVGETYIHEG 123 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGA 320 LK A QI G++ II+ +TDG + L ++ ++ GA Sbjct: 124 LKLANEQIQKAGGLKTSS---------IIIALTDGKLDGLVPSYAEKEAKI---SRSLGA 171 Query: 321 IVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 VY +G+ + R S + V+ + A I I+ + Sbjct: 172 SVYCVGVLDFEQAQLERIADSKEQVFPVKGG---FQALKGIINSILAQSCTE 220 >gi|114594044|ref|XP_001144933.1| PREDICTED: hypothetical protein isoform 2 [Pan troglodytes] Length = 508 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 38/232 (16%), Positives = 79/232 (34%), Gaps = 34/232 (14%) Query: 148 YTNSRHIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + +V+ + + D+ VLD S S+ + + I Sbjct: 16 FPGLWLLVLSGPGGLLRAQEQPSCRRAFDLYFVLDKSGSVANNW-----------IEIYN 64 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPG 262 ++++ V+ ++ + FS++ L + + ++ L + T G Sbjct: 65 FVQQLAERF-VSPEMRLSFIVFSSQASIILPLTGDRGKISKGLEDLKRVSPVGETYIHEG 123 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGA 320 LK A QI G++ II+ +TDG + L ++ ++ GA Sbjct: 124 LKLANEQIQKAGGLKTSS---------IIIALTDGKLDGLVPSYAEKEAKI---SRSLGA 171 Query: 321 IVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 VY +G+ + R S + V+ + A I I+ + Sbjct: 172 SVYCVGVLDFEQAQLERIADSKEQVFPVKGG---FQALKGIINSILAQSCTE 220 >gi|317455060|pdb|2XWB|F Chain F, Crystal Structure Of Complement C3b In Complex With Factors B And D gi|317455061|pdb|2XWB|H Chain H, Crystal Structure Of Complement C3b In Complex With Factors B And D Length = 732 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 37/251 (14%), Positives = 81/251 (32%), Gaps = 44/251 (17%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 P R IV+ + S+ ++ +VLD S S+ + + K Sbjct: 216 GHGPGEQQKRKIVLDPSGSM----------NIYLVLDGSGSIGASDFTGAKKC------- 258 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIE--------EFFLLEWGVSHLQRKIKYLSKFG 254 ++ ++ + + GLVT++ + + +W L K Sbjct: 259 --LVNLIEKVASYGVKPRYGLVTYATYPKIWVKVSEADSSNADWVTKQLNEINYEDHKLK 316 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCN 313 TN+ L+ Y+ + + +I+ MTDG N+ + Sbjct: 317 SGTNTKKALQAVYS-MMSWPDDVPPEGWNRTRH--VIILMTDGLHNMGGDPITVIDEIRD 373 Query: 314 ---------EAKKRGAIVYAIGIRVIRSHEFLRACAS----PNSFYLVENPHSMYDAFSH 360 ++ VY G+ + + + A AS + V++ ++ D F Sbjct: 374 LLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQ 433 Query: 361 IGKDIVTKRIW 371 + + + + Sbjct: 434 MIDESQSLSLC 444 >gi|251837060|pdb|3HRZ|D Chain D, Cobra Venom Factor (Cvf) In Complex With Human Factor B gi|251837064|pdb|3HS0|D Chain D, Cobra Venom Factor (Cvf) In Complex With Human Factor B gi|251837068|pdb|3HS0|I Chain I, Cobra Venom Factor (Cvf) In Complex With Human Factor B gi|317455073|pdb|2XWJ|I Chain I, Crystal Structure Of Complement C3b In Complex With Factor B gi|317455074|pdb|2XWJ|J Chain J, Crystal Structure Of Complement C3b In Complex With Factor B gi|317455075|pdb|2XWJ|K Chain K, Crystal Structure Of Complement C3b In Complex With Factor B gi|317455076|pdb|2XWJ|L Chain L, Crystal Structure Of Complement C3b In Complex With Factor B Length = 741 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 37/251 (14%), Positives = 81/251 (32%), Gaps = 44/251 (17%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 P R IV+ + S+ ++ +VLD S S+ + + K Sbjct: 225 GHGPGEQQKRKIVLDPSGSM----------NIYLVLDGSGSIGASDFTGAKKC------- 267 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIE--------EFFLLEWGVSHLQRKIKYLSKFG 254 ++ ++ + + GLVT++ + + +W L K Sbjct: 268 --LVNLIEKVASYGVKPRYGLVTYATYPKIWVKVSEADSSNADWVTKQLNEINYEDHKLK 325 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCN 313 TN+ L+ Y+ + + +I+ MTDG N+ + Sbjct: 326 SGTNTKKALQAVYS-MMSWPDDVPPEGWNRTRH--VIILMTDGLHNMGGDPITVIDEIRD 382 Query: 314 ---------EAKKRGAIVYAIGIRVIRSHEFLRACAS----PNSFYLVENPHSMYDAFSH 360 ++ VY G+ + + + A AS + V++ ++ D F Sbjct: 383 LLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVFYQ 442 Query: 361 IGKDIVTKRIW 371 + + + + Sbjct: 443 MIDESQSLSLC 453 >gi|239926966|ref|ZP_04683919.1| lipoprotein [Streptomyces ghanaensis ATCC 14672] gi|291435315|ref|ZP_06574705.1| lipoprotein [Streptomyces ghanaensis ATCC 14672] gi|291338210|gb|EFE65166.1| lipoprotein [Streptomyces ghanaensis ATCC 14672] Length = 527 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 34/220 (15%), Positives = 73/220 (33%), Gaps = 28/220 (12%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 + + + + + V+D S SM + ++D+A KS+ AM + Sbjct: 143 EDGWSLVRVGLATRSAAPGGERPPAALTFVVDTSGSM-----AEPGRLDLARKSLAAMTD 197 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLK 264 ++ V LVTFS++ E + + + L STN G++ Sbjct: 198 RLRDDDSVA------LVTFSDEAETVLPMTRLGGNRDEVHDAVDRLEP-DRSTNLGAGVE 250 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL--YYCNEAKKRGAIV 322 Y ++G+R+ +V ++D + + D ++ + G + Sbjct: 251 AGYE--TAVEGLREGATNR-------VVLVSDALANTGETDADAILERISRSRRAHGITL 301 Query: 323 YAIGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSH 360 + +G+ + A V + F Sbjct: 302 FGVGVGSDYGDALMERLADRGDGHTVYVSGEEDAREVFGE 341 >gi|225872598|ref|YP_002754053.1| von Willebrand factor type A domain protein [Acidobacterium capsulatum ATCC 51196] gi|225793914|gb|ACO34004.1| von Willebrand factor type A domain protein [Acidobacterium capsulatum ATCC 51196] Length = 313 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 76/213 (35%), Gaps = 42/213 (19%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 QT L +++ +D S S+ D + A + + A L + +V Sbjct: 64 ERQTGLPLSIVLAIDTSGSVRKDLD---EEKRAAREFLRATLRPED---------RVEIV 111 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 F+ ++ E + + R + LS+ G +T + Y ++ G Sbjct: 112 NFNTRVHEVVPFTNNLKKIDRGLNRLSE-GPATALYAAIAYGSEELAQRPG--------- 161 Query: 285 ANYKKIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAI---GIRVIRSHEF----- 335 +K++V ++DG+N + QQ+L + A + +++++ + + Sbjct: 162 ---RKVLVVISDGDNTVANSSYQQAL---DRAVRAETMIFSVIDLPVINDAGRDVGGEHA 215 Query: 336 ---LRACASPNSFYLVENPHSMYDAFSHIGKDI 365 L +Y + ++ F + + Sbjct: 216 MIALSEATGGEYYYEAD--GNLQGVFKRLSTAL 246 >gi|323135758|ref|ZP_08070841.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] gi|322398849|gb|EFY01368.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] Length = 588 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 39/201 (19%), Positives = 65/201 (32%), Gaps = 66/201 (32%) Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + L + ++ KI L G +TN G + + I +N Sbjct: 379 DHTTQRLLQLTTSQTTIKNKIDQLVANG-NTNLQEGFMWGWRTISPNGPFAAGRPYATSN 437 Query: 287 YKKIIVFMT-----------------------------------DG-------------- 297 +K++VFMT DG Sbjct: 438 NRKVMVFMTDGFNHWGAYPNTVVGSDYEALGYYTYNGEKNLRLPDGSRGDRVDYQNALKA 497 Query: 298 -ENLSTKE--------DQQSLYYCNEAKKRGAIVYAIGIRV------IRSHEFLRACASP 342 N ++ D+ +L C AK G V+ IG + E L++CA+ Sbjct: 498 ARNSNSSYLATARDAQDELTLQACTNAKNAGVEVFTIGFSTSTDPIDAQGLELLKSCATN 557 Query: 343 -NSFYLVENPHSMYDAFSHIG 362 + ++ VEN + + AFS IG Sbjct: 558 VDHYFAVENANQLNAAFSSIG 578 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 27/228 (11%), Positives = 73/228 (32%), Gaps = 31/228 (13%) Query: 9 RNFFYNYKGGMTILTAI-FLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 ++F + G + ++ + +P++F++ + + ++ L D +++ A+++ Sbjct: 14 KSFGADESGNVGMVFGLGLVPVMFMLGATA-DYTRYATTRSALRQATDVAVLTVASKLTA 72 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 N T+ I Q Sbjct: 73 -------------TTTDAQAKAQAQVILNAQPR-------MSTASITTASIATTKQTFCA 112 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 + + + T+ + + + + +A ++ +V+D S SM S Sbjct: 113 TSEVTIQ-----NSFMQMARVTSL-TPSVTSCADLAWGANPNATYEVALVVDNSGSMLSS 166 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 D S+TKI + + ++ + + VQ + F+ + Sbjct: 167 -DGSVTKISALKTAAKSFVDTMFAKAP--DRVQFSVTPFAGAVVAVDP 211 >gi|291395335|ref|XP_002714014.1| PREDICTED: integrin alpha 2 [Oryctolagus cuniculus] Length = 1207 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 33/210 (15%), Positives = 77/210 (36%), Gaps = 35/210 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++V D S S+ + D + ++ + + P Q GL+ ++N Sbjct: 196 IDVVVVCDESNSIYPW--------DAVKNFLEKFVQGLDIGPTK---TQVGLIQYANNPR 244 Query: 232 EFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L + + + G TN+ +++A + + + Sbjct: 245 VVFNLNTFKTKDGMITATSQTYQYGGDLTNTFKAIQFARDFAYSA------ASGGRPGAT 298 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEF---LRAC 339 K++V +TDGE+ + + CN + + I + + + ++A Sbjct: 299 KVMVVVTDGESHDGSMLRAVIDQCN---NDNILRFGIAVLGYLNRNALDTKNLIKEIKAI 355 Query: 340 AS---PNSFYLVENPHSMYDAFSHIGKDIV 366 AS F+ V + ++ + +G+ I Sbjct: 356 ASIPTERYFFNVSDEAALLEKAGTLGEQIF 385 >gi|48428050|sp|Q864V9|CFAB_GORGO RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|29690187|gb|AAM10005.1| complement factor B precursor [Gorilla gorilla] Length = 764 Score = 59.4 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 32/221 (14%), Positives = 74/221 (33%), Gaps = 28/221 (12%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--- 229 IVLD S SM + S + ++ ++ + + GLVT++ Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPKI 320 Query: 230 -IEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ + +++ ++ K TN+ L+ Y+ + Sbjct: 321 WVKVSDPDSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYS-MMSWPDDVPPEGWNR 379 Query: 285 ANYKKIIVFMTDG-ENLSTKEDQQSLYYCN---------EAKKRGAIVYAIGIRVIRSHE 334 + +I+ MTDG N+ + ++ VY G+ + + Sbjct: 380 TRH--VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDHKNPREDYLDVYVFGVGPLVNQV 437 Query: 335 FLRACAS----PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + A AS + V++ ++ D F + + + + Sbjct: 438 NINALASKKDNEQHVFKVKDMENLEDVFYQMIDESQSLSLC 478 >gi|197105075|ref|YP_002130452.1| hypothetical protein PHZ_c1612 [Phenylobacterium zucineum HLK1] gi|196478495|gb|ACG78023.1| conserved hypothetical protein [Phenylobacterium zucineum HLK1] Length = 521 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 40/174 (22%), Positives = 56/174 (32%), Gaps = 45/174 (25%) Query: 236 LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 L L+ + L G STN GL + ++ + M +KKI+V MT Sbjct: 342 LTTDFDGLRDAVDDLVADG-STNIPMGLVWGWHTLAPMAPFPDGVPYLTEKHKKIVVLMT 400 Query: 296 DGENL----------------------------------------STKEDQQSLYYCNEA 315 DGEN + D + L C Sbjct: 401 DGENTILYKDTPNGSDYSGVGHARQGRVLDPAGRPITESSSQRERTAALDDRLLKLCANM 460 Query: 316 KK--RGAIVYAI-GIRVIRSHEFLRACASP-NSFYLVENPHSMYDAFSHIGKDI 365 K + +YAI S L+ CAS + +Y V+N M AF I I Sbjct: 461 KAPAKDIEIYAIRVEVSSGSSSVLQTCASSADHYYDVQNAADMTMAFQSIAGQI 514 Score = 43.6 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 42/249 (16%), Positives = 89/249 (35%), Gaps = 44/249 (17%) Query: 8 IRNFFYNYKGGMTILTA-IFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM 66 +R + G + I A +P+ LG I+++ K L +D + + AA Sbjct: 13 LRR-LADDGGNVAITVAFAMVPLAIGTLG-AIDLARGASAKVQLQDALDAAALGAARSSA 70 Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 N + G+ L R F EL + F D+ Sbjct: 71 NSPDTL----QAAGERLLRQNLALGGDF--ELVSSSFTFGPDN----------------- 107 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 ++R ++ ++ ++ ++ + + V +L++ +VLD + SM Sbjct: 108 ---KVLARAQVRVEP--YVAGLAGVNNMDIAAATEVVR---AGMQLEIALVLDNTGSM-- 157 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--------EEFFLLEW 238 ++ + + A K+ +E V N ++ LV FS+ + ++ + Sbjct: 158 NQNNKLYHLKTAAKAFVTAMETAAEGNTVPNSIKISLVPFSHTVRVDSDAYRNAAWIDQN 217 Query: 239 GVSHLQRKI 247 G S + +I Sbjct: 218 GSSPINNEI 226 >gi|149200035|ref|ZP_01877061.1| hypothetical protein LNTAR_03619 [Lentisphaera araneosa HTCC2155] gi|149136908|gb|EDM25335.1| hypothetical protein LNTAR_03619 [Lentisphaera araneosa HTCC2155] Length = 307 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 77/210 (36%), Gaps = 30/210 (14%) Query: 162 VKVNSQTDARLD-MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 +K + + + + I LD S SM + F + ++A++++ E Sbjct: 77 IKGKPEDEKIISNIQICLDSSGSMRADFGGK-NRYEVAMQAVKEFTE-------YREGDA 128 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKI-----KYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 GL F + + + S + +SK+ TN L+ + Q+ Sbjct: 129 FGLTVFGTEYINWVPVTKDTSAIALATPFLAPDRMSKWFGGTNIAKALRGSQQQLL---- 184 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + + ++I+ ++DG + S + ++ E + + Y I I Sbjct: 185 -------QQEDGDRMIILVSDGVSGSPND---TVDMAQELRNNKIVAYCIYIGSGNGSPE 234 Query: 336 LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 + A A + + V NP ++ + F I K Sbjct: 235 MNALAAITGGQVFGVNNPKALDETFRFIDK 264 >gi|86741605|ref|YP_482005.1| von Willebrand factor, type A [Frankia sp. CcI3] gi|86568467|gb|ABD12276.1| von Willebrand factor, type A [Frankia sp. CcI3] Length = 534 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 35/202 (17%), Positives = 63/202 (31%), Gaps = 30/202 (14%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ----SGLVTFSNK- 229 + +LDVS SM ++I ++ + + + ++TF+ + Sbjct: 344 IFLLDVSGSMAG------SRIAALQAALRGLTGADDTLSGRFARFRGREKITMITFAGRA 397 Query: 230 -------IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + + ++ + + L + T L+ Y T Sbjct: 398 NDPVDFAVNDPRPGSADLAGVNTFVDGL-RLQDGTAIYSALEAGYRAAGAAVEADPGYLT 456 Query: 283 EDANYKKIIVFMTDGENLS--TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 IV MTDGEN S + D +S Y A R + I A Sbjct: 457 S-------IVLMTDGENNSGISAADFRSSYQRLPAAARAVRTFTIAFGEADPAALRDISA 509 Query: 341 -SPNSFYLVENPHSMYDAFSHI 361 + + + S+ DAF I Sbjct: 510 DTGGAVFDAR-TSSLADAFKDI 530 >gi|119889916|ref|XP_001252289.1| PREDICTED: Epithelial chloride channel protein-like [Bos taurus] gi|297473018|ref|XP_002686328.1| PREDICTED: Epithelial chloride channel protein-like [Bos taurus] gi|296489229|gb|DAA31342.1| Epithelial chloride channel protein-like [Bos taurus] Length = 903 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 39/195 (20%), Positives = 67/195 (34%), Gaps = 32/195 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 + +VLD S SM S ++ ++ L I + G+VTF + E Sbjct: 309 VCLVLDKSGSMSSE-----DRLFRMNQAAELFL-----IQIIEKGSLVGMVTFDSVAEIR 358 Query: 232 -EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + + T+ GLK + I Q Sbjct: 359 NNLTKITDDNVYENITANLPQEANGGTSICRGLKAGFQAIIQSQQSTSGSE--------- 409 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 I+ +TDGE+ C E K+ G I++ I + + E L + FY Sbjct: 410 IILLTDGEDNEIHS-------CIEEVKQSGVIIHTIALGPSAAKELETLSDMTGGHRFYA 462 Query: 348 VENPHSMYDAFSHIG 362 ++ + + +AFS I Sbjct: 463 NKDINGLTNAFSRIS 477 >gi|329888464|ref|ZP_08267062.1| hypothetical protein BDIM_03870 [Brevundimonas diminuta ATCC 11568] gi|328847020|gb|EGF96582.1| hypothetical protein BDIM_03870 [Brevundimonas diminuta ATCC 11568] Length = 650 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 34/173 (19%), Positives = 60/173 (34%), Gaps = 48/173 (27%) Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQI------FDMQGMRQHCNTEDANYKKIIVFMTDG 297 + KI +S G ST G+ +A+ + + K+ + MTDG Sbjct: 478 KSKIDQMSA-GGSTAGQIGIAWAWYALSPDFASLFSGEGQPGAYAPSDTL-KVAILMTDG 535 Query: 298 ENLSTKED----------------------------QQSLYYCNEAKKRGAIVYAIGIRV 329 E + D QS+ C + +G +VY +G + Sbjct: 536 EFNTPFRDGVIALDAGTGSGGLDSHIDLNSSNGDPFAQSVALCQAMQAKGVVVYTVGFDL 595 Query: 330 IRSH----------EFLRACASPN--SFYLVENPHSMYDAFSHIGKDIVTKRI 370 + + +R CA+ F+ ++ + +AF IG+DI RI Sbjct: 596 GSATGREGVVDTALDVMRECATNEQTHFFQADDGTDLKEAFRAIGRDITRLRI 648 Score = 45.1 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 33/250 (13%), Positives = 84/250 (33%), Gaps = 36/250 (14%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 F +G + ++ A+ LP++ ++ I++ +K L +D + + AA + Sbjct: 19 RFLRQTQGNVAMMFAMALPVLLMITLGAIDIHQASKVKAQLQDALDAAALAAARSTFTDD 78 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 N K+ + + + + FV + + ++++ V+ N Sbjct: 79 VNIN--KVGLAALKANMPSYFGE---ASGDTASFVLLNNRVTGEATVNVKVLVANV---- 129 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME-SFF 188 + Y + + S V S+ +++ + LD++ SM+ Sbjct: 130 -VLPPY-----------GKLLDDFLPVSSRSEVLRASRN---VEVAMALDITGSMDNCSR 174 Query: 189 DSSIT-KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI 247 + T K++ + +++ V + L+ + GV+ I Sbjct: 175 NCPPTSKLEDLQAAAKELVDIVVQDQQTPFYSKVALIPY----------AAGVNVGSSAI 224 Query: 248 KYLSKFGVST 257 +T Sbjct: 225 SARGPLDTTT 234 >gi|298372684|ref|ZP_06982674.1| BatA protein [Bacteroidetes oral taxon 274 str. F0058] gi|298275588|gb|EFI17139.1| BatA protein [Bacteroidetes oral taxon 274 str. F0058] Length = 326 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 50/250 (20%), Positives = 85/250 (34%), Gaps = 54/250 (21%) Query: 154 IVMPITSSVKVNSQTDA-RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 IV+ + S + ++++I +D+S SM + D S +++ A + L Sbjct: 67 IVLARPQTSDSYSNSTTEGINIVIAMDISGSMLA-RDLSPNRLEAAKDVGIEFI----LS 121 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG-----VSTNSTPGLKYAY 267 +N GLV F+ + + S+ + + T GL A Sbjct: 122 RPNDNF---GLVVFAGESFTQCPI---TSNHASLVNLFKQVDFGIIQDGTAIGLGLATAI 175 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 N+I D +G K+I+ +TDG N D + A+ G VY IG+ Sbjct: 176 NRIKDAEGKS-----------KVIILLTDGTNN--TGDIAPISAAQIAQSYGIRVYTIGV 222 Query: 328 RVIRSHEF----------------------LRACAS--PNSFYLVENPHSMYDAFSHIGK 363 E L+ AS ++ N S+ +S I K Sbjct: 223 GTQGIAEVPMLDQFGNIHYTEAEVVIDETTLQQIASTTGGKYFRATNVSSLKQIYSEIDK 282 Query: 364 DIVTKRIWYD 373 TK + Sbjct: 283 MEKTKLKTEN 292 >gi|284052693|ref|ZP_06382903.1| von Willebrand factor type A domain-containing protein [Arthrospira platensis str. Paraca] gi|291571888|dbj|BAI94160.1| von Willebrand factor type A [Arthrospira platensis NIES-39] Length = 541 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 65/191 (34%), Gaps = 21/191 (10%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 PW + + + + + + +++ +LDVS SM ++ + + Sbjct: 152 APWNPQHQLVHIGLQGK-TLAIEELPPSNLVFLLDVSGSMN-----QPNRLPLLKEGFKL 205 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 +++++ V V +G + + I L G ST G+K Sbjct: 206 LVDQLSEQDTVAIAVYAGAAG----VVLPPTPGNEKQKIIAAIDGLQAQG-STAGGEGIK 260 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVY 323 AY M ++ ++ TDG+ N+ D + + + RG + Sbjct: 261 LAYELATRMLSEGKNNR---------VILATDGDFNVGVSSDAELVRLIESYRDRGIYLT 311 Query: 324 AIGIRVIRSHE 334 +G + + Sbjct: 312 VLGFGMGNYKD 322 >gi|114586163|ref|XP_526141.2| PREDICTED: similar to alpha 3 type VI collagen isoform 1 precursor [Pan troglodytes] Length = 891 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 57/369 (15%), Positives = 120/369 (32%), Gaps = 42/369 (11%) Query: 7 NIRNFFYNYKGGMTILTAIFLPI---IFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAAT 63 ++RNF Y I + + I F G + ++ F + Sbjct: 43 SVRNFLYILVNSFNISSDVLRHIRQFQFKPGGKKMGLALKFILDHHFQEASGSRASQGVP 102 Query: 64 QIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQ 123 QI + + G LR G + + + ++ + Sbjct: 103 QIAVVISSGPVEDHVHGP-------------VEALRRAGILLYAIGVRDAVWAELREIAS 149 Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSRHI--VMPITSSVKVNSQTDARLDMMIVLDVS 181 + + ++ L + P V + A D++ ++D S Sbjct: 150 SPQENFTSFVPNFSGLSNLAQKLRQELCDMLAKAAPHVDHVSPACREAALADIVFLVDNS 209 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG-- 239 S+ + K+ + S+ L ++ V+ GL +++ I F L Sbjct: 210 TSIG---PQNFQKVKNFLYSVILGL------DISSDRVRVGLAQYNDNIYPAFQLNQHPL 260 Query: 240 VSHLQRKIKYLSKFGVSTNSTPGLKYA-YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE 298 S + +I+ L TN+ L++ N + + G R +I++ +TD E Sbjct: 261 KSMILEQIQNLPYRTGGTNTGSALEFIRTNYLTEESGSRA-----KDRVPQIVILVTDRE 315 Query: 299 NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLVENPHSMYD 356 + + + K+ G +VY +G+ V E + + P + EN + + D Sbjct: 316 -----SNDEVQEVADRLKEDGVVVYVVGVNVQDVQELQKIASEPFEKFLFNTENFNILQD 370 Query: 357 AFSHIGKDI 365 I + + Sbjct: 371 FSGSILQTL 379 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/235 (14%), Positives = 81/235 (34%), Gaps = 32/235 (13%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 P Y + + + ++ D++ ++ E F + + Sbjct: 628 MIDPDYRIPKTLEVIACATAV-------PADLIFLI------EEFSRVRQPNFQQVVNFL 674 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYL--SKFGVSTN 258 ++ + + PD V+ GLV +S + F L+ + + + L + T Sbjct: 675 KTIVSSLSIRPD---TVRFGLVFYSEEPRLEFSLDAFQNPAKILEHLDKLTYRERKGRTK 731 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + L + N++F + + ++I V +T+G + + ++ Sbjct: 732 TGAALDFLRNEVFIQEKGSWSNHG----VQQIAVVITEG-----FSQDRVSRPASRLRRA 782 Query: 319 GAIVYAIGI-RVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 G +YA+G V S + + + P Y V P + S +G + + Sbjct: 783 GVTIYAVGTHNVSESKDLEKIASYPPWKYSV--PLESFLQLSVVGSKLTHQLCSE 835 >gi|297170900|gb|ADI21918.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [uncultured gamma proteobacterium HF0130_26L16] Length = 563 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 36/211 (17%), Positives = 84/211 (39%), Gaps = 39/211 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK---- 229 + +LD S SM + +I +A +++++++E+ +++G+ + ++ Sbjct: 30 LYFILDASGSMWERVEGKP-RIVIAKETLSSLIEKTPAE------IRTGITAYGHRRKFD 82 Query: 230 ---IEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 I+E L+ +Q +I LS G T T ++ +++ +G Sbjct: 83 CSDIQEIVPLKSLDPMTKYQVQERISTLSAMGK-TPITDSIRQTVDRLKTEEGRS----- 136 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA--IVYAIGIRVIRSHEFLRACA 340 IV ++DG + +E K G +++ +G + + E +C Sbjct: 137 -------TIVLISDG---LESCGKDPCALTSELKSSGINFVMHVVGFGLTQEQEQKLSCI 186 Query: 341 S---PNSFYLVENPHSMYDAFSHIGKDIVTK 368 S F+ N + DA + + + +V + Sbjct: 187 SAAGEGKFFTAGNAADLLDALTVVKESVVEQ 217 >gi|242042273|ref|XP_002468531.1| hypothetical protein SORBIDRAFT_01g047480 [Sorghum bicolor] gi|241922385|gb|EER95529.1| hypothetical protein SORBIDRAFT_01g047480 [Sorghum bicolor] Length = 650 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 79/204 (38%), Gaps = 29/204 (14%) Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFI-PWYTNSRHIVMPITSSVKV 164 D + + +S D G +++ + + + HI P + + Sbjct: 115 DDEQVEPPSSGDQAAASFRSGGTLAVTTHTEYSAVARDSSRDNFAVLVHIKAPGMTDAEA 174 Query: 165 NSQT--DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + A LD++ VLDVS SM K+ + +++ +++ + + Sbjct: 175 AAGDAPRAPLDLVTVLDVSGSM------IGAKLALLKQAMGFVIDNLGPHD------RLS 222 Query: 223 LVTFSNKIEEFFLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 +V+FS++ L G + + ++ L G TN GL+ A + R+ Sbjct: 223 VVSFSDRARRVTRLLRMSGDGKAAAKSAVESLVARG-GTNIAEGLRTAARVL----EERR 277 Query: 279 HCNTEDANYKKIIVFMTDGENLST 302 H NT + ++ ++DG++ T Sbjct: 278 HRNTVSS-----VILLSDGQDTYT 296 >gi|156120138|ref|NP_001095285.1| complement C2 [Sus scrofa] gi|148724911|emb|CAN87699.1| complement component 2 [Sus scrofa] Length = 752 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 32/250 (12%), Positives = 81/250 (32%), Gaps = 31/250 (12%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 T ++ K+ Q L++ ++LD S+S+ Sbjct: 215 DVTPALGTSFSHLLGATNPTQTKKTENVGR---KIQIQRSGHLNLYLLLDASQSVSE--- 268 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWGVSHLQRKIK 248 + +S + +++ + + ++TF++K + L+ + + Sbjct: 269 ---EDFGVFKRSASLLVDRIFSFEIN---ISVAIITFASKPKIIMSVLKDKSRDVTEVVH 322 Query: 249 YLS-------KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL- 300 L + G TN L Y + + + I+ +TDG++ Sbjct: 323 SLENINYKDHENGTGTNIYEALNSVYIMMNNQMQNLGMNTMAWQEIRHAIILLTDGKSNM 382 Query: 301 -----STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVEN 350 ++ + + E +K +YAIG+ + +E +++++ Sbjct: 383 GGSPKPAVDNIKEILNIKEKRKDYLDIYAIGVGNVEVDWRELNELGSKKDGERHAFILKD 442 Query: 351 PHSMYDAFSH 360 ++ F H Sbjct: 443 TEALSQVFEH 452 >gi|38455778|gb|AAR20890.1| complement C2 [Sus scrofa] Length = 734 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 32/250 (12%), Positives = 81/250 (32%), Gaps = 31/250 (12%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 T ++ K+ Q L++ ++LD S+S+ Sbjct: 215 DVTPALGTSFSHLLGATNPTQTKKTENVGR---KIQIQRSGHLNLYLLLDASQSVSE--- 268 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWGVSHLQRKIK 248 + +S + +++ + + ++TF++K + L+ + + Sbjct: 269 ---EDFGVFKRSASLLVDRIFSFEIN---ISVAIITFASKPKIIMSVLKDKSRDVTEVVH 322 Query: 249 YLS-------KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL- 300 L + G TN L Y + + + I+ +TDG++ Sbjct: 323 SLENINYKDHENGTGTNIYEALNSVYIMMNNQMQNLGMNTMAWQEIRHAIILLTDGKSNM 382 Query: 301 -----STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVEN 350 ++ + + E +K +YAIG+ + +E +++++ Sbjct: 383 GGSPKPAVDNIKEILNIKEKRKDYLDIYAIGVGNVEVDWRELNELGSKKDGERHAFILKD 442 Query: 351 PHSMYDAFSH 360 ++ F H Sbjct: 443 TEALSQVFEH 452 >gi|332815788|ref|XP_003309587.1| PREDICTED: collagen alpha-3(VI) chain [Pan troglodytes] Length = 2570 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 50/312 (16%), Positives = 113/312 (36%), Gaps = 29/312 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + + GG + F +R +G V+ Sbjct: 913 LEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR-----FAQVIRSSGIVSLGVG 967 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + ++P + + ++ P + + Sbjct: 968 DRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPE 1027 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + + E V + + + +Q GLV +++ Sbjct: 1028 KKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEDGDSIQVGLVQYNS 1078 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 + F L+ + + I + G N+ GL++ N G R D Sbjct: 1079 DPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSR-----LD 1133 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I +T G+++ +D +RG V+A+G+R I S E + ++ + Sbjct: 1134 QRVPQIAFVITGGKSVEDAQDVSLALT-----QRGVKVFAVGVRNIDSEEVGKIASNSAT 1188 Query: 345 FYLVENPHSMYD 356 + V N + + Sbjct: 1189 AFRVGNVQELSE 1200 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 86/261 (32%), Gaps = 24/261 (9%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + ++V + + Y S + ++P K T I T+ + + ++ + Sbjct: 764 RNADQEELVKISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAV 823 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S + ++ ++ + + P V+ G+V FS Sbjct: 824 ESDAADIVFLIDSSEGVRPDG------FAHIRDFVSRIVRRLNIGPSK---VRVGVVQFS 874 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 N + F L+ S + I+ L G S N+ L++ +F + + Sbjct: 875 NDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDG-- 932 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + +V + G K + + G + +G R I E P Sbjct: 933 --VPQHLVLVLGG-----KSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDPRL 985 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 + V + + I Sbjct: 986 VFTVREFRELPNIEERIMNSF 1006 Score = 42.4 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 61/163 (37%), Gaps = 24/163 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 422 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 471 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + ++ L+ G T N+ L++ N + G R Sbjct: 472 EFYLNSYMNQQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRI-----TEGVP 526 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ +T + + K+ GA+ IGI Sbjct: 527 QLLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 564 >gi|240255535|ref|NP_476507.3| collagen alpha-3(VI) chain isoform 4 precursor [Homo sapiens] Length = 2570 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 50/312 (16%), Positives = 113/312 (36%), Gaps = 29/312 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + + GG + F +R +G V+ Sbjct: 913 LEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR-----FAQVIRSSGIVSLGVG 967 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + ++P + + ++ P + + Sbjct: 968 DRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPE 1027 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + + E V + + + +Q GLV +++ Sbjct: 1028 KKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEDGDSIQVGLVQYNS 1078 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 + F L+ + + I + G N+ GL++ N G R D Sbjct: 1079 DPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSR-----LD 1133 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I +T G+++ +D +RG V+A+G+R I S E + ++ + Sbjct: 1134 QRVPQIAFVITGGKSVEDAQDVSLALT-----QRGVKVFAVGVRNIDSEEVGKIASNSAT 1188 Query: 345 FYLVENPHSMYD 356 + V N + + Sbjct: 1189 AFRVGNVQELSE 1200 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 86/261 (32%), Gaps = 24/261 (9%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + ++V + + Y S + ++P K T I T+ + + ++ + Sbjct: 764 RNADQEELVKISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAV 823 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S + ++ ++ + + P V+ G+V FS Sbjct: 824 ESDAADIVFLIDSSEGVRPDG------FAHIRDFVSRIVRRLNIGPSK---VRVGVVQFS 874 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 N + F L+ S + I+ L G S N+ L++ +F + + Sbjct: 875 NDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDG-- 932 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + +V + G K + + G + +G R I E P Sbjct: 933 --VPQHLVLVLGG-----KSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDPRL 985 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 + V + + I Sbjct: 986 VFTVREFRELPNIEERIMNSF 1006 Score = 43.6 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 61/163 (37%), Gaps = 24/163 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 422 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 471 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + ++ L+ G T N+ L++ N + G R Sbjct: 472 EFYLNSYMNKQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRI-----TEGVP 526 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ +T + + K+ GA+ IGI Sbjct: 527 QLLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 564 >gi|219841772|gb|AAI44596.1| COL6A3 protein [Homo sapiens] gi|223462379|gb|AAI50626.1| COL6A3 protein [Homo sapiens] Length = 2570 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 50/312 (16%), Positives = 113/312 (36%), Gaps = 29/312 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + + GG + F +R +G V+ Sbjct: 913 LEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR-----FAQVIRSSGIVSLGVG 967 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + ++P + + ++ P + + Sbjct: 968 DRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPE 1027 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + + E V + + + +Q GLV +++ Sbjct: 1028 KKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEDGDSIQVGLVQYNS 1078 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 + F L+ + + I + G N+ GL++ N G R D Sbjct: 1079 DPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSR-----LD 1133 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I +T G+++ +D +RG V+A+G+R I S E + ++ + Sbjct: 1134 QRVPQIAFVITGGKSVEDAQDVSLALT-----QRGVKVFAVGVRNIDSEEVGKIASNSAT 1188 Query: 345 FYLVENPHSMYD 356 + V N + + Sbjct: 1189 AFRVGNVQELSE 1200 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 86/261 (32%), Gaps = 24/261 (9%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + ++V + + Y S + ++P K T I T+ + + ++ + Sbjct: 764 RNADQEELVKISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAV 823 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S + ++ ++ + + P V+ G+V FS Sbjct: 824 ESDAADIVFLIDSSEGVRPDG------FAHIRDFVSRIVRRLNIGPSK---VRVGVVQFS 874 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 N + F L+ S + I+ L G S N+ L++ +F + + Sbjct: 875 NDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDG-- 932 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + +V + G K + + G + +G R I E P Sbjct: 933 --VPQHLVLVLGG-----KSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDPRL 985 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 + V + + I Sbjct: 986 VFTVREFRELPNIEERIMNSF 1006 Score = 43.6 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 61/163 (37%), Gaps = 24/163 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 422 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 471 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + ++ L+ G T N+ L++ N + G R Sbjct: 472 EFYLNSYMNKQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRI-----TEGVP 526 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ +T + + K+ GA+ IGI Sbjct: 527 QLLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 564 >gi|219521324|gb|AAI71790.1| COL6A3 protein [Homo sapiens] Length = 2570 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 50/312 (16%), Positives = 113/312 (36%), Gaps = 29/312 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + + GG + F +R +G V+ Sbjct: 913 LEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR-----FAQVIRSSGIVSLGVG 967 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + ++P + + ++ P + + Sbjct: 968 DRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPE 1027 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + + E V + + + +Q GLV +++ Sbjct: 1028 KKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEDGDSIQVGLVQYNS 1078 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 + F L+ + + I + G N+ GL++ N G R D Sbjct: 1079 DPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSR-----LD 1133 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I +T G+++ +D +RG V+A+G+R I S E + ++ + Sbjct: 1134 QRVPQIAFVITGGKSVEDAQDVSLALT-----QRGVKVFAVGVRNIDSEEVGKIASNSAT 1188 Query: 345 FYLVENPHSMYD 356 + V N + + Sbjct: 1189 AFRVGNVQELSE 1200 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 86/261 (32%), Gaps = 24/261 (9%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + ++V + + Y S + ++P K T I T+ + + ++ + Sbjct: 764 RNADQEELVKISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAV 823 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S + ++ ++ + + P V+ G+V FS Sbjct: 824 ESDAADIVFLIDSSEGVRPDG------FAHIRDFVSRIVRRLNIGPSK---VRVGVVQFS 874 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 N + F L+ S + I+ L G S N+ L++ +F + + Sbjct: 875 NDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDG-- 932 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + +V + G K + + G + +G R I E P Sbjct: 933 --VPQHLVLVLGG-----KSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDPRL 985 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 + V + + I Sbjct: 986 VFTVREFRELPNIEERIMNSF 1006 Score = 43.6 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 61/163 (37%), Gaps = 24/163 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 422 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 471 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + ++ L+ G T N+ L++ N + G R Sbjct: 472 EFYLNSYMNKQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRI-----TEGVP 526 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ +T + + K+ GA+ IGI Sbjct: 527 QLLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 564 >gi|119591516|gb|EAW71110.1| collagen, type VI, alpha 3, isoform CRA_h [Homo sapiens] Length = 2977 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 50/312 (16%), Positives = 113/312 (36%), Gaps = 29/312 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + + GG + F +R +G V+ Sbjct: 1320 LEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR-----FAQVIRSSGIVSLGVG 1374 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + ++P + + ++ P + + Sbjct: 1375 DRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPE 1434 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + + E V + + + +Q GLV +++ Sbjct: 1435 KKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEDGDSIQVGLVQYNS 1485 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 + F L+ + + I + G N+ GL++ N G R D Sbjct: 1486 DPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSR-----LD 1540 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I +T G+++ +D +RG V+A+G+R I S E + ++ + Sbjct: 1541 QRVPQIAFVITGGKSVEDAQDVSLALT-----QRGVKVFAVGVRNIDSEEVGKIASNSAT 1595 Query: 345 FYLVENPHSMYD 356 + V N + + Sbjct: 1596 AFRVGNVQELSE 1607 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 71/203 (34%), Gaps = 24/203 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + + A D++ ++D S ++ + + + +++ + + + Sbjct: 29 QADVKNGAAADIIFLVDSSWTIGE------EHFQLVREFLYDVVKSLAVGEND---FHFA 79 Query: 223 LVTFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQ-IFDMQGMRQH 279 LV F+ FLL + + I +S G + + GL+Y + G R Sbjct: 80 LVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGTNQTGKGLEYIMQSHLTKAAGSRA- 138 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 ++IV +TDG E K V+AIG+ Sbjct: 139 ----GDGVPQVIVVLTDG-----HSKDGLALPSAELKSADVNVFAIGVEDADEGALKEIA 189 Query: 340 ASP--NSFYLVENPHSMYDAFSH 360 + P + +EN S++D + Sbjct: 190 SEPLNMHMFNLENFTSLHDIVGN 212 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 39/272 (14%), Positives = 88/272 (32%), Gaps = 41/272 (15%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS-------- 166 ++ + + I++ F + ++V + SSV Sbjct: 174 AIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLK 233 Query: 167 --QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D++ ++D S ++ + + + +LE++ P ++ G+V Sbjct: 234 DITAQDSADIIFLIDGSN------NTGSVNFAVILDFLVNLLEKL---PIGTQQIRVGVV 284 Query: 225 TFSNKIEEFFLLEWGVSHLQ--RKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 FS++ F L+ + Q +K L G N L + F R + Sbjct: 285 QFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHFT----RAGGS 340 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACA 340 + +++V ++ G + + A V++ G+ S L+ A Sbjct: 341 RVEEGVPQVLVLISAGPSSDEIRYGVVAL-------KQASVFSFGLGAQAASRAELQHIA 393 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + ++ F G D+ K + Y Sbjct: 394 TDDNLVFT------VPEFRSFG-DLQEKLLPY 418 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 86/261 (32%), Gaps = 24/261 (9%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + ++V + + Y S + ++P K T I T+ + + ++ + Sbjct: 1171 RNADQEELVKISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAV 1230 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S + ++ ++ + + P V+ G+V FS Sbjct: 1231 ESDAADIVFLIDSSEGVRPDG------FAHIRDFVSRIVRRLNIGPSK---VRVGVVQFS 1281 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 N + F L+ S + I+ L G S N+ L++ +F + + Sbjct: 1282 NDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDG-- 1339 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + +V + G K + + G + +G R I E P Sbjct: 1340 --VPQHLVLVLGG-----KSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDPRL 1392 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 + V + + I Sbjct: 1393 VFTVREFRELPNIEERIMNSF 1413 Score = 43.6 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 61/163 (37%), Gaps = 24/163 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 829 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 878 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + ++ L+ G T N+ L++ N + G R Sbjct: 879 EFYLNSYMNKQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRI-----TEGVP 933 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ +T + + K+ GA+ IGI Sbjct: 934 QLLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 971 >gi|119591509|gb|EAW71103.1| collagen, type VI, alpha 3, isoform CRA_a [Homo sapiens] Length = 2211 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 50/312 (16%), Positives = 113/312 (36%), Gaps = 29/312 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + + GG + F +R +G V+ Sbjct: 1320 LEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR-----FAQVIRSSGIVSLGVG 1374 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + ++P + + ++ P + + Sbjct: 1375 DRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPE 1434 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + + E V + + + +Q GLV +++ Sbjct: 1435 KKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEDGDSIQVGLVQYNS 1485 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 + F L+ + + I + G N+ GL++ N G R D Sbjct: 1486 DPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSR-----LD 1540 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I +T G+++ +D +RG V+A+G+R I S E + ++ + Sbjct: 1541 QRVPQIAFVITGGKSVEDAQDVSLALT-----QRGVKVFAVGVRNIDSEEVGKIASNSAT 1595 Query: 345 FYLVENPHSMYD 356 + V N + + Sbjct: 1596 AFRVGNVQELSE 1607 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 71/203 (34%), Gaps = 24/203 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + + A D++ ++D S ++ + + + +++ + + + Sbjct: 29 QADVKNGAAADIIFLVDSSWTIGE------EHFQLVREFLYDVVKSLAVGEND---FHFA 79 Query: 223 LVTFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQ-IFDMQGMRQH 279 LV F+ FLL + + I +S G + + GL+Y + G R Sbjct: 80 LVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGTNQTGKGLEYIMQSHLTKAAGSRA- 138 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 ++IV +TDG E K V+AIG+ Sbjct: 139 ----GDGVPQVIVVLTDG-----HSKDGLALPSAELKSADVNVFAIGVEDADEGALKEIA 189 Query: 340 ASP--NSFYLVENPHSMYDAFSH 360 + P + +EN S++D + Sbjct: 190 SEPLNMHMFNLENFTSLHDIVGN 212 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 39/272 (14%), Positives = 88/272 (32%), Gaps = 41/272 (15%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS-------- 166 ++ + + I++ F + ++V + SSV Sbjct: 174 AIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLK 233 Query: 167 --QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D++ ++D S ++ + + + +LE++ P ++ G+V Sbjct: 234 DITAQDSADIIFLIDGSN------NTGSVNFAVILDFLVNLLEKL---PIGTQQIRVGVV 284 Query: 225 TFSNKIEEFFLLEWGVSHLQ--RKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 FS++ F L+ + Q +K L G N L + F R + Sbjct: 285 QFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHFT----RAGGS 340 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACA 340 + +++V ++ G + + A V++ G+ S L+ A Sbjct: 341 RVEEGVPQVLVLISAGPSSDEIRYGVVAL-------KQASVFSFGLGAQAASRAELQHIA 393 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + ++ F G D+ K + Y Sbjct: 394 TDDNLVFT------VPEFRSFG-DLQEKLLPY 418 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 86/261 (32%), Gaps = 24/261 (9%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + ++V + + Y S + ++P K T I T+ + + ++ + Sbjct: 1171 RNADQEELVKISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAV 1230 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S + ++ ++ + + P V+ G+V FS Sbjct: 1231 ESDAADIVFLIDSSEGVRPDG------FAHIRDFVSRIVRRLNIGPSK---VRVGVVQFS 1281 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 N + F L+ S + I+ L G S N+ L++ +F + + Sbjct: 1282 NDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDG-- 1339 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + +V + G K + + G + +G R I E P Sbjct: 1340 --VPQHLVLVLGG-----KSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDPRL 1392 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 + V + + I Sbjct: 1393 VFTVREFRELPNIEERIMNSF 1413 Score = 43.6 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 61/163 (37%), Gaps = 24/163 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 829 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 878 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + ++ L+ G T N+ L++ N + G R Sbjct: 879 EFYLNSYMNKQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRI-----TEGVP 933 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ +T + + K+ GA+ IGI Sbjct: 934 QLLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 971 >gi|114584073|ref|XP_001153410.1| PREDICTED: alpha 3 type VI collagen isoform 2 [Pan troglodytes] Length = 2977 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 50/312 (16%), Positives = 113/312 (36%), Gaps = 29/312 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + + GG + F +R +G V+ Sbjct: 1320 LEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR-----FAQVIRSSGIVSLGVG 1374 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + ++P + + ++ P + + Sbjct: 1375 DRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPE 1434 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + + E V + + + +Q GLV +++ Sbjct: 1435 KKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEDGDSIQVGLVQYNS 1485 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 + F L+ + + I + G N+ GL++ N G R D Sbjct: 1486 DPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSR-----LD 1540 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I +T G+++ +D +RG V+A+G+R I S E + ++ + Sbjct: 1541 QRVPQIAFVITGGKSVEDAQDVSLALT-----QRGVKVFAVGVRNIDSEEVGKIASNSAT 1595 Query: 345 FYLVENPHSMYD 356 + V N + + Sbjct: 1596 AFRVGNVQELSE 1607 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 71/203 (34%), Gaps = 24/203 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + + A D++ ++D S ++ + + + +++ + + + Sbjct: 29 QADVKNGAAADIIFLVDSSWTIGE------EHFQLVREFLYDVVKSLAVGEND---FHFA 79 Query: 223 LVTFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQ-IFDMQGMRQH 279 LV F+ FLL + + I +S G + + GL+Y + G R Sbjct: 80 LVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGTNQTGKGLEYIMQSHLTKAAGSRA- 138 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 ++IV +TDG E K V+AIG+ Sbjct: 139 ----GDGVPQVIVVLTDG-----HSKDGLALPSAELKSADVNVFAIGVEDADEGALKEIA 189 Query: 340 ASP--NSFYLVENPHSMYDAFSH 360 + P + +EN S++D + Sbjct: 190 SEPLNMHMFNLENFTSLHDIVGN 212 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 39/272 (14%), Positives = 88/272 (32%), Gaps = 41/272 (15%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS-------- 166 ++ + + I++ F + ++V + SSV Sbjct: 174 AIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLK 233 Query: 167 --QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D++ ++D S ++ + + + +LE++ P ++ G+V Sbjct: 234 DITAQDSADIIFLIDGSN------NTGSVNFAVILDFLVNLLEKL---PIGTQQIRVGVV 284 Query: 225 TFSNKIEEFFLLEWGVSHLQ--RKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 FS++ F L+ + Q +K L G N L + F R + Sbjct: 285 QFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHFT----RAGGS 340 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACA 340 + +++V ++ G + + A V++ G+ S L+ A Sbjct: 341 RVEEGVPQVLVLISAGPSSDEIRYGVVAL-------KQASVFSFGLGAQAASRAELQHIA 393 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + ++ F G D+ K + Y Sbjct: 394 TDDNLVFT------VPEFRSFG-DLQEKLLPY 418 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 86/261 (32%), Gaps = 24/261 (9%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + ++V + + Y S + ++P K T I T+ + + ++ + Sbjct: 1171 RNADQEELVKISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAV 1230 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S + ++ ++ + + P V+ G+V FS Sbjct: 1231 ESDAADIVFLIDSSEGVRPDG------FAHIRDFVSRIVRRLNIGPSK---VRVGVVQFS 1281 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 N + F L+ S + I+ L G S N+ L++ +F + + Sbjct: 1282 NDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDG-- 1339 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + +V + G K + + G + +G R I E P Sbjct: 1340 --VPQHLVLVLGG-----KSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDPRL 1392 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 + V + + I Sbjct: 1393 VFTVREFRELPNIEERIMNSF 1413 Score = 42.4 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 61/163 (37%), Gaps = 24/163 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 829 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 878 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + ++ L+ G T N+ L++ N + G R Sbjct: 879 EFYLNSYMNQQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRI-----TEGVP 933 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ +T + + K+ GA+ IGI Sbjct: 934 QLLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 971 >gi|62088852|dbj|BAD92873.1| alpha 3 type VI collagen isoform 5 precursor variant [Homo sapiens] Length = 1702 Score = 59.4 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 50/312 (16%), Positives = 113/312 (36%), Gaps = 29/312 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + + GG + F +R +G V+ Sbjct: 45 LEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR-----FAQVIRSSGIVSLGVG 99 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + ++P + + ++ P + + Sbjct: 100 DRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPE 159 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + + E V + + + +Q GLV +++ Sbjct: 160 KKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEDGDSIQVGLVQYNS 210 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 + F L+ + + I + G N+ GL++ N G R D Sbjct: 211 DPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSR-----LD 265 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I +T G+++ +D +RG V+A+G+R I S E + ++ + Sbjct: 266 QRVPQIAFVITGGKSVEDAQDVSLALT-----QRGVKVFAVGVRNIDSEEVGKIASNSAT 320 Query: 345 FYLVENPHSMYD 356 + V N + + Sbjct: 321 AFRVGNVQELSE 332 >gi|313238993|emb|CBY13979.1| unnamed protein product [Oikopleura dioica] Length = 676 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 34/184 (18%), Positives = 64/184 (34%), Gaps = 24/184 (13%) Query: 129 ISAISRYKIPLKFCTFIPWYTNS--RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 + L C + + + + I + V++ S D+M V+D S S+ S Sbjct: 171 MPISGNNDSQLDICDQLTYCDTTLFENSSRFIRTDVEL-SCQQRDNDIMFVVDSSGSVGS 229 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK 246 + L+EVK + + + G+V F+ E + L+ G+ Sbjct: 230 ----------AGFARVRNFLKEVKSY--MGSTSRIGVVRFATNSELIWGLDDGLDGFTNA 277 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + ++ T + GL AY+ + + K IV MTDG + Sbjct: 278 VTNMAWTMGGTYTAKGLDLAYDHMLL---------RGRRSATKTIVLMTDGYTSNQSAYD 328 Query: 307 QSLY 310 + Sbjct: 329 AEIK 332 >gi|149033988|gb|EDL88771.1| similar to Protein KIAA1510 precursor (predicted) [Rattus norvegicus] Length = 1320 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 60/174 (34%), Gaps = 23/174 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D++ ++D S S+ + +++ + PD VQ GL +S Sbjct: 174 TPADIIFLVDGSWSIGHN------NFQQVKDFLASIITHFAIGPDK---VQVGLTQYSGD 224 Query: 230 IEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + L + + +L G +T + L + + + Sbjct: 225 PQTEWDLNSFHTKEQVLAAVHHLHYKGGNTFTGLALTHV------LGQNLKPAAGVRPEA 278 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 K+++ +TDG K K + V+ +G++ + E L+ AS Sbjct: 279 AKVLILVTDG-----KSQDDVRTAARVLKDQDIDVFTVGVKNVDEAE-LKLLAS 326 >gi|257052678|ref|YP_003130511.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] gi|256691441|gb|ACV11778.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] Length = 1100 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 64/195 (32%), Gaps = 28/195 (14%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D+ V+D S SM +I A S + + ++ LV+F+ Sbjct: 512 PIDLAFVIDESGSMGG------ARIQDAKASAKRFVGGLYEDD------RAALVSFAGGA 559 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + I L+ TN+ GL+ A +++ Sbjct: 560 TLGQSLTTDHGAVNASIDQLNAG-GGTNTGAGLQKAVDELTSNGEGDTQE---------- 608 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRAC-ASPNSFYLV 348 I+ + DG T + A + + IG+ I + E A+ FY V Sbjct: 609 IILLADG---GTGLGPDPVTIAQTADEHRITINTIGMGTGIDAQELTSIADATGGEFYQV 665 Query: 349 ENPHSMYDAFSHIGK 363 + + + F + + Sbjct: 666 SDSSELPEVFDRVEQ 680 >gi|224370037|ref|YP_002604201.1| hypothetical protein HRM2_29500 [Desulfobacterium autotrophicum HRM2] gi|223692754|gb|ACN16037.1| conserved hypothetical protein [Desulfobacterium autotrophicum HRM2] Length = 332 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 73/199 (36%), Gaps = 26/199 (13%) Query: 172 LDMMIVLDVSRSM---ESFFDSSI-TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +++++ LD+S+SM + D +I ++D + + + + G+V F Sbjct: 87 INIILALDLSKSMAALDFKLDGAIVNRLDAVKNVVKDFIMKRSGD-------RIGMVVFG 139 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 ++ L + + L G ST + + ++ D++ Sbjct: 140 SEAFTQMPLTRDYDTIAFVLSRLKIGAAGPSTAIGDAMGISLKRLEDVKSKSN------- 192 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 I++ +TDG++ + + A++RG VY IG+ FL Sbjct: 193 ----IVILLTDGKSN--SGEITPGAAADIARERGVKVYTIGVGQRGKAPFLVNDPLFGQR 246 Query: 346 YLVENPHSMYDAFSHIGKD 364 Y+ + ++A I Sbjct: 247 YVYQMVDMDHEALKEIADK 265 >gi|170076505|ref|YP_001733144.1| hypothetical protein SYNPCC7002_G0035 [Synechococcus sp. PCC 7002] gi|169887367|gb|ACB01075.1| conserved hypothetical protein (von Willebrand factor type A domain) [Synechococcus sp. PCC 7002] Length = 420 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 46/254 (18%), Positives = 89/254 (35%), Gaps = 43/254 (16%) Query: 136 KIPLKFCTFIPWY-TNSRHIVMPI-----TSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 K+P+ + R + + + V++++ L++ VLD S SM Sbjct: 2 KMPMISLIPMHGAIAQGRSVTLDVLIKIEPPLVELDNNARPPLNLGFVLDKSGSMHGN-- 59 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE--FFLLEWGVSHLQRKI 247 K+D A ++I +E++ + L F ++E L L I Sbjct: 60 ----KLDYAKQAIAYAIEQLLPSD------RLSLTLFDTQVETKIPSTLATDKQRLLETI 109 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL--STKED 305 K L + G ST G + G++ + + + ++ ++DG T D Sbjct: 110 K-LIRSGSSTALHDG--------WVQGGIQVGQYLNNDHLNR-VILLSDGLANVGETNPD 159 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFS---- 359 + K G A+G+ + L A A +F+ + +P + F Sbjct: 160 VIASDVHGLMK-TGISTSALGVGRDYDEDLLEAIARSGDGNFFHIASPEDLPQIFETELQ 218 Query: 360 ----HIGKDIVTKR 369 IG+ + K Sbjct: 219 GLATTIGRSVTLKL 232 >gi|149437045|ref|XP_001515975.1| PREDICTED: similar to inter-alpha trypsin inhibitor heavy chain precursor 5 [Ornithorhynchus anatinus] Length = 949 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 75/203 (36%), Gaps = 33/203 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ VLD S SM K+ +++ +L +++ + N +V FS++I Sbjct: 306 VVFVLDSSASM------VGAKLKQTKEALFTILHDLRPEDNFN------IVGFSSRIKVW 353 Query: 231 -EEFFLLE-WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 ++ + + + I ++S G TN L+ + D H + + + Sbjct: 354 KDQLVPVTPNSIRDGKVYIHHMSPSG-GTNINGALQTGIRLLND---FVAHNDIDARSVS 409 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEF-----LRACA 340 +IVF+TDG + Q+ N K+ ++ IGI + L C Sbjct: 410 -LIVFLTDGRPT--VGEIQTPKILNNTKEAARDRVCLFTIGIGDDVDFKLLEKLSLENCG 466 Query: 341 SPNSF-YLVENPHSMYDAFSHIG 362 F + + + IG Sbjct: 467 MTRRFQVEADAAAQLKGFYDEIG 489 >gi|116254826|ref|YP_770662.1| hypothetical protein pRL100386 [Rhizobium leguminosarum bv. viciae 3841] gi|115259474|emb|CAK10612.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 644 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 39/262 (14%), Positives = 85/262 (32%), Gaps = 29/262 (11%) Query: 85 RIKNTWNMSFRNELRDNGFVN-----DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPL 139 + +F ++ + + S+ + + Y + P Sbjct: 182 SVATDPVSTFSADVDSASYAFVRRSLTGGAMPDPLSVRVEEMINYFPYDWPGPDNAEQPF 241 Query: 140 KF---CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 K PW ++ + + I + T +++ ++DVS SM+ K+ Sbjct: 242 KATVTVMPTPWNHDTELMHVAIKG-YDIAPATTPHANLVFLIDVSGSMDE-----PDKLP 295 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS 256 + + ++ +K V+ V +G + S + I L G S Sbjct: 296 LLKSAFRLLVNRLKPDDTVSIVTYAG----NAGTVLTPTRVAEKSKILSAIDRLEA-GGS 350 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEA 315 T G++ AY + ++ N ++ TDG+ N+ D+ E Sbjct: 351 TGGAEGIEAAY-DLAKQGFVKDGVNR--------VMLATDGDFNVGPSSDEDLKRIIEER 401 Query: 316 KKRGAIVYAIGIRVIRSHEFLR 337 +K G + +G ++ L Sbjct: 402 RKDGIFLTVLGFGRGNLNDSLM 423 >gi|13365523|dbj|BAB39135.1| CD11-2 [Cyprinus carpio] Length = 1187 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 81/210 (38%), Gaps = 23/210 (10%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + + ++++ + D S SM + DM I +++++ N+ ++ Sbjct: 149 TAAYQECSKREVNLVFLFDGSSSM------KAVEFDMNKNFIKDVMKKLS-----NSSIK 197 Query: 221 SGLVTFSNKIEEFFLL-EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 V FS +I F ++ + K+ TN+ + Y + + Sbjct: 198 FAAVQFSTEIRTVFDFNDYQNGSAEEKLMKERHMKSLTNTYKAINYVLKNVLNSVSSGA- 256 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 D N +K +V +TDG+ S +D L C+ ++ + Y IG+ + + Sbjct: 257 ----DPNAQKALVIITDGDP-SDNDDYNILNICD---EQNILRYIIGVGKVDLTTLTQLA 308 Query: 340 ASP--NSFYLVENPHSMYDAFSHIGKDIVT 367 A P N+ + ++ + + ++ K I Sbjct: 309 AEPKLNNTFYIQEYNGLKGLLDNLQKKIYN 338 >gi|94969533|ref|YP_591581.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94551583|gb|ABF41507.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 362 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 36/228 (15%), Positives = 84/228 (36%), Gaps = 37/228 (16%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + I ++ S DA + + ++ D+S SM KID + ++I K Sbjct: 120 LDQNIPQQIRHFSSEDAPVSIGVIFDMSGSMS-------NKIDKSREAIVEF---FKTAN 169 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + +V F++K E + +Q K+ L T+ + N++ Sbjct: 170 PDDEFF---VVAFNDKPEVLQDFTNRIEDIQEKLTILQPKDR-TSLLDAIYLGMNKMRQA 225 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--RVI- 330 + R K ++ ++DG + ++ + + + ++ +YAIGI Sbjct: 226 KYER-----------KALLIISDGGDNHSRYTENEIK--SMVREADVQIYAIGIYDLAPT 272 Query: 331 -----RSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 L + + + ++N + + D + IG ++ + + Sbjct: 273 TTEEMAGPALLGEISDWTGGRMFPIDNVNELADVATKIGVELRNQYVL 320 >gi|268558414|ref|XP_002637197.1| Hypothetical protein CBG09720 [Caenorhabditis briggsae] Length = 630 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 37/253 (14%), Positives = 89/253 (35%), Gaps = 37/253 (14%) Query: 117 DIVVVPQNEGYSISAISRY-----KIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR 171 + + + + + + S + K P+ + + + S+ S T Sbjct: 393 NEIEELNGKNFKVRSRSVHFAMTEKPPVTTAMNPMKFFTTSRTPITTAKSLIPYSCTA-- 450 Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK-- 229 D+ ++D+S + D S +D+A +I+++ P V+ GL+++S Sbjct: 451 -DVFFLVDLS---QGTGDKSQQYLDIAASAISSL-------PISQEAVRVGLISYSGPGR 499 Query: 230 --IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + L ++ + + G +T + ++YA + H N Sbjct: 500 THVRVYLDKHNDKEKLIEEMFLMERHGGTTRTADAIRYA----TKIFEGMAHPA--RKNV 553 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI----RVIRSHEFLRACASPN 343 KK++V TDG + D A+ +G + A+ + + Sbjct: 554 KKVLVVFTDGYSQDHPRDA-----ARGARAKGLQLIAVAVKDRLAPPDEEQLAEIGGHAK 608 Query: 344 SFYLVENPHSMYD 356 + ++ N + + Sbjct: 609 NVFISPNGRELRE 621 >gi|242097078|ref|XP_002439029.1| hypothetical protein SORBIDRAFT_10g030210 [Sorghum bicolor] gi|241917252|gb|EER90396.1| hypothetical protein SORBIDRAFT_10g030210 [Sorghum bicolor] Length = 607 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 38/217 (17%), Positives = 71/217 (32%), Gaps = 31/217 (14%) Query: 136 KIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKI 195 K +K + + + + S ++ A LD+++VLDVS SM F ++ Sbjct: 69 KASVKPYYPKEAPLGASTVRVLLDVSSSSSTAGRAALDLVVVLDVSGSMRDFG-----RL 123 Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH----LQRKIKYLS 251 D ++ +++++ + + +VTF+ L L + L Sbjct: 124 DKLKSAMRFIIKKLAPMD------RLSVVTFNGGATRECPLRAMSEDAVPVLTDIVDGLV 177 Query: 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY 311 G TN GLK + + ++ M+DGE S + + Sbjct: 178 ARG-GTNIEAGLKMGLQVLDGRRYTGARTAG--------VILMSDGEQNS-GDATRVRNP 227 Query: 312 CNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 N VY + L+ A Y Sbjct: 228 QNYP------VYTLSFGSNADMNLLQKLAGGGGTYNP 258 >gi|198426775|ref|XP_002120099.1| PREDICTED: similar to SD03168p [Ciona intestinalis] Length = 1474 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 82/223 (36%), Gaps = 33/223 (14%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S +++ ++D++ +LD S S+ + ++ + I + + N V Sbjct: 155 SISRIDPFECPKVDILFLLDGSGSIVE------SDFEIMKEWIENITLSFDI--SSNGSV 206 Query: 220 QSGLVTFSN----KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 GL+ FS+ K E + + +K ++ +T + L+ + Sbjct: 207 AVGLMQFSHFSLTKTEFQIGMFTTKEEIMAAMKNVTIKKGNTYTADALRR------SIAV 260 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-------- 327 ++ D N +K+IV +TDGE T + + + +G + A+ I Sbjct: 261 FQKSSRYNDTNTRKVIVLLTDGEATDTASLSST---ADLVRSQGITITAVLITEKVLPSE 317 Query: 328 --RVIRSHEFLRA--CASPNSFYLVENPHSMYDAFSHIGKDIV 366 + + + +P+ ++V ++ I + I Sbjct: 318 RSAAVAQMQLIVNGVAGNPSGVFVVGTTANLDSVIRAITQRIQ 360 >gi|332832625|ref|XP_520182.3| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 [Pan troglodytes] Length = 3571 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 31/212 (14%), Positives = 69/212 (32%), Gaps = 38/212 (17%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL+++ ++D S S+ + + +L + ++P + +VTFS+K Sbjct: 81 RLELVFLVDDSSSVGE------VNFRSELMFVRKLLSDFPVVPTA---TRVAIVTFSSKN 131 Query: 231 EEFFLLEW-GVSHLQRK--------IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +++ ++ I +S G T + + A + Sbjct: 132 YVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYTKGAFQQAAQILLHA-------- 183 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 N K+I +TDG + + G ++ GI E ++ Sbjct: 184 --RENSTKVIFLITDGYSNG----GDPRPIAASLRDSGVEIFTFGIWQGNIRELNDMAST 237 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 P E + +F + + + + D Sbjct: 238 PK-----EEHCYLLHSFEEF-EALARRALHED 263 >gi|261415941|ref|YP_003249624.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261372397|gb|ACX75142.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|302326143|gb|ADL25344.1| von Willebrand factor type A domain protein [Fibrobacter succinogenes subsp. succinogenes S85] Length = 228 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 35/169 (20%), Positives = 63/169 (37%), Gaps = 15/169 (8%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + R+ + +VLD S SME I + IN + V+ + Sbjct: 11 DLENNPSTRVPVCLVLDTSGSMEGQP------ISELNEGINCFYDAVRSDETALYAAEIA 64 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +VTF +L+ S L+ + + F T + A + + +G ++ Sbjct: 65 VVTFGG----SAVLKTDFSTLEHQPDSPNFFANGGTPMGEAMNMALDLLEKRKG--EYKA 118 Query: 282 TEDANYKKIIVFMTDGENLSTKED--QQSLYYCNEAKKRGAIVYAIGIR 328 + Y+ IV MTDG+ + + C K R ++ IGI Sbjct: 119 SGVDYYQPWIVLMTDGKPNGDSSEYARAVQRTCEMIKNRKLTIFPIGIG 167 >gi|157817857|ref|NP_001102478.1| procollagen, type VI, alpha 3 [Rattus norvegicus] gi|149037629|gb|EDL92060.1| procollagen, type VI, alpha 3 (predicted), isoform CRA_a [Rattus norvegicus] Length = 2207 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 46/311 (14%), Positives = 112/311 (36%), Gaps = 28/311 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + GG + + + + G + Sbjct: 1518 LEFVARNLFVKSAGSRIEDGVPQHLVLFLGGKSQDDVSRHAQVISSSGIMSLGI--GDRN 1575 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD 169 I R+ I P+ + ++P + + S P + Sbjct: 1576 IDRTDLQTITNDPR---LVFTVREFRELPNIEERVMLSFGPSGPTPQPPEVEFPSSRPEK 1632 Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + D++ +LD S+ DS + + E V + + + ++ GLV +++ Sbjct: 1633 KKADIVFLLD--GSINFRRDS-------FQEVLRFASEIVDTVYEDGDSIRVGLVQYNSD 1683 Query: 230 IEEFFLLEWGVSHLQRKIKYLSK----FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + F L + ++ I ++K G N+ G+++ + + + + D Sbjct: 1684 PTDEFFLRDFSTK-RQIIDAINKVIYKGGRHANTRVGIEH----LLKNHFVSEAGSRLDE 1738 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +I +T G+++ +D ++G V+A+G+R I S E + ++ + Sbjct: 1739 RVPQIAFVITGGKSVEDAQDVSLALT-----QKGVKVFAVGVRNIDSEEVGKIASNSATA 1793 Query: 346 YLVENPHSMYD 356 + V + + + Sbjct: 1794 FRVGSVQELSE 1804 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 79/203 (38%), Gaps = 25/203 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + + + + A D++ ++D S + + + + + ++ ++E + + + Sbjct: 25 AQQQSDVKNGAAADILFLVDSS------WSAGKDRFLLVQEFLSDVVESLSVGDND---F 75 Query: 220 QSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYA-YNQIFDMQGM 276 LV + FLL + I +S G S + GL+Y ++ + + G Sbjct: 76 HFALVRLNGNPHTEFLLNAYHSKQEVLSHILNMSYIGESNQTGKGLEYIIHSHLTEASGS 135 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 R ++IV +TDG+ E+ E K V+A+G+ L Sbjct: 136 RA-----ADGVPQVIVVLTDGQ----SEEDGFALPSAELKSADVNVFAVGVEDAD-ERTL 185 Query: 337 RACAS---PNSFYLVENPHSMYD 356 R AS + +EN S++D Sbjct: 186 REIASEPLSMHVFNLENVTSLHD 208 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 77/212 (36%), Gaps = 24/212 (11%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T + +V+ D++ +LD S ++ + + ++ + + D Sbjct: 628 TGTTEVHVNKR---DIIFLLDGSDNVGKNNFPYV------RDFVTNLVNSLDVGSDN--- 675 Query: 219 VQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQG 275 ++ GLV FS+ F L+ S L ++ L K G N+ L Y + Sbjct: 676 IRVGLVQFSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGTGLNAGSALSY----VHANHF 731 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + + ++++ + G + L N + G + + +G E Sbjct: 732 TEAGGSRIREHVPQLLLLLMAG-----PSEDVYLQAANALVRSGVLTFCVGTNQADKAEL 786 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 R +P+ YL+++ ++ + + + T Sbjct: 787 ERIAFNPSLVYLMDDFSALPALPQQLIQPLTT 818 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 39/237 (16%), Positives = 83/237 (35%), Gaps = 24/237 (10%) Query: 124 NEGYSISAISRYKIP---LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDV 180 + Y S + ++P + T I T+ + + ++ + D++ ++D Sbjct: 1382 SPEYVFSVSTFRELPRLEQRLLTPITTLTSQQIQQILASTRYPPSVAESDAADIVFLIDS 1441 Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV 240 S +M+ I ++ ++ + + P+ V+ G+V FSN + F L+ Sbjct: 1442 SDAMKPDG------IAHIRDFVSRIVRRLNIGPNK---VRIGVVQFSNDVFPEFYLKTHK 1492 Query: 241 S--HLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 S + I+ L G S N+ L++ +F ED + +++F+ Sbjct: 1493 SQNSVLEAIRRLRFKGGSPLNTGRALEFVARNLFVKS---AGSRIEDGVPQHLVLFL--- 1546 Query: 298 ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSM 354 K + G + IG R I + P + V + Sbjct: 1547 ---GGKSQDDVSRHAQVISSSGIMSLGIGDRNIDRTDLQTITNDPRLVFTVREFREL 1600 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 39/222 (17%), Positives = 81/222 (36%), Gaps = 33/222 (14%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 S + +Q A D++ ++D S ++ D+ + +LE + + Sbjct: 226 AGDKESLKDITAQDSA--DIIFLIDGS------QNTGKANFDVIRDFLVNVLERLSV--- 274 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLS-KFGVSTNSTPGLKYAYNQIF 271 N V+ G+V +S++ F L+ S + +K LS G N L + F Sbjct: 275 GNQQVRVGVVQYSDEPRTMFSLDSYPSKAAVLDAVKRLSFAGGELANIGQALDFVVENHF 334 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 R + + +++V ++ G + D A K+G V++ G+ Sbjct: 335 T----RTGGSRVEEGVPQVLVLISAGPSSDEIRDAVV------ALKQG-SVFSFGLGAQA 383 Query: 332 SHEF-LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + L+ A+ ++ F G D+ + + Y Sbjct: 384 ASRVELQHIATDDNLVFT------VPEFRSFG-DLQEQLLPY 418 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 74/206 (35%), Gaps = 28/206 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + S F + + + ++E + + P+ V+ LV +S++ Sbjct: 1027 DVVFLIDGSEGVRSGFP-------LLKEFVQRVVESLDVGPNQ---VRVALVQYSDRTRP 1076 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + I+ L+ G T N+ L + N + G R + Sbjct: 1077 EFYLNSHMDQQGVISAIRRLTLLGGPTPNTGAALDFVLRNILTSSTGSRI-----EEGVP 1131 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 ++++ +T + + K+ GA+ IGI E P+ + Sbjct: 1132 QLLIVLT-----AERSGDDVRGPSVVLKQGGAVPIGIGIGNADISEMQTISFIPDFAVAI 1186 Query: 349 ENPHSMYDAFSHIGKDIVTKRIWYDK 374 + I + I + I ++ Sbjct: 1187 PTFREL----GTIQQVISERVIQLNR 1208 >gi|124783268|ref|NP_073725.2| sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 precursor [Mus musculus] gi|171769535|sp|A2AVA0|SVEP1_MOUSE RecName: Full=Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1; AltName: Full=Polydom; Flags: Precursor gi|123210319|emb|CAM21214.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Mus musculus] gi|123229801|emb|CAM23597.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Mus musculus] Length = 3567 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 33/225 (14%), Positives = 74/225 (32%), Gaps = 38/225 (16%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 S V+ + L+++ ++D S S+ + + + + + P V+ Sbjct: 69 FRSRVRRLRELSGSLELVFLVDESSSVGQTN---------FLNELKFVRKLLSDFPVVST 119 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSH---------LQRKIKYLSKFGVSTNSTPGLKYAYN 268 + +VTFS+K +++ + L R+I ++ G T + + A Sbjct: 120 ATRVAIVTFSSKNNVVARVDYISTSRAHQHKCALLSREIPAITYRGGGTYTKGAFQQAAQ 179 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + N K+I +TDG + + G ++ GI Sbjct: 180 ILRHS----------RENSTKVIFLITDGYSNG----GDPRPIAASLRDFGVEIFTFGIW 225 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 E ++P E + +F + + + + D Sbjct: 226 QGNIRELNDMASTPK-----EEHCYLLHSFEEF-EALARRALHED 264 >gi|26330612|dbj|BAC29036.1| unnamed protein product [Mus musculus] Length = 440 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 33/225 (14%), Positives = 74/225 (32%), Gaps = 38/225 (16%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 S V+ + L+++ ++D S S+ + + + + + P V+ Sbjct: 69 FRSRVRRLRELSGSLELVFLVDESSSVGQTN---------FLNELKFVRKLLSDFPVVST 119 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSH---------LQRKIKYLSKFGVSTNSTPGLKYAYN 268 + +VTFS+K +++ + L R+I ++ G T + + A Sbjct: 120 ATRVAIVTFSSKNNVVARVDYISTSRAHQHKCALLSREIPAITYRGGGTYTKGAFQQAAQ 179 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + N K+I +TDG + + G ++ GI Sbjct: 180 ILRHS----------RENSTKVIFLITDGYSNG----GDPRPIAASLRDFGVEIFTFGIW 225 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 E ++P E + +F + + + + D Sbjct: 226 QGNIRELNDMASTPK-----EEHCYLLHSFEEF-EALARRALHED 264 >gi|26342893|dbj|BAC35103.1| unnamed protein product [Mus musculus] Length = 848 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 33/225 (14%), Positives = 74/225 (32%), Gaps = 38/225 (16%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 S V+ + L+++ ++D S S+ + + + + + P V+ Sbjct: 69 FRSRVRRLRELSGSLELVFLVDESSSVGQTN---------FLNELKFVRKLLSDFPVVST 119 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSH---------LQRKIKYLSKFGVSTNSTPGLKYAYN 268 + +VTFS+K +++ + L R+I ++ G T + + A Sbjct: 120 ATRVAIVTFSSKNNVVARVDYISTSRAHQHKCALLSREIPAITYRGGGTYTKGAFQQAAQ 179 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + N K+I +TDG + + G ++ GI Sbjct: 180 ILRHS----------RENSTKVIFLITDGYSNG----GDPRPIAASLRDFGVEIFTFGIW 225 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 E ++P E + +F + + + + D Sbjct: 226 QGNIRELNDMASTPK-----EEHCYLLHSFEEF-EALARRALHED 264 >gi|11177164|gb|AAG32160.1|AF206329_1 polydom protein [Mus musculus] Length = 3567 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 33/225 (14%), Positives = 74/225 (32%), Gaps = 38/225 (16%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 S V+ + L+++ ++D S S+ + + + + + P V+ Sbjct: 69 FRSRVRRLRELSGSLELVFLVDESSSVGQTN---------FLNELKFVRKLLSDFPVVST 119 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSH---------LQRKIKYLSKFGVSTNSTPGLKYAYN 268 + +VTFS+K +++ + L R+I ++ G T + + A Sbjct: 120 ATRVAIVTFSSKNNVVARVDYISTSRAHQHKCALLSREIPAITYRGGGTYTKGAFQQAAQ 179 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + N K+I +TDG + + G ++ GI Sbjct: 180 ILRHS----------RENSTKVIFLITDGYSNG----GDPRPIAASLRDFGVEIFTFGIW 225 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 E ++P E + +F + + + + D Sbjct: 226 QGNIRELNDMASTPK-----EEHCYLLHSFEEF-EALARRALHED 264 >gi|242074986|ref|XP_002447429.1| hypothetical protein SORBIDRAFT_06g000920 [Sorghum bicolor] gi|241938612|gb|EES11757.1| hypothetical protein SORBIDRAFT_06g000920 [Sorghum bicolor] Length = 519 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 47/225 (20%), Positives = 79/225 (35%), Gaps = 37/225 (16%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 P N++ +++ +T ++ + LD++ VLDVS SM+ I K+ A+K + Sbjct: 35 NMAPLEENTQKVLLELTG--GDSTSDRSGLDLVAVLDVSGSMQ---GEKIDKMKTAMKFV 89 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK----IKYLSKFGVSTN 258 L + + +VTF + L Q + I L+ G +TN Sbjct: 90 VKKLSSID---------RLSIVTFMDTATRICPLRQVTDASQPELLGLIDALNP-GGNTN 139 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 T GL+ + D ++ M+DG+ K Sbjct: 140 ITDGLQTGLKVLADRNLSSGRVVG--------VMLMSDGQQNRGGNAADV-------KIG 184 Query: 319 GAIVYAIGIRVIRSHEFLRACAS---PNSFYLVENPHSMYDAFSH 360 A VY G L A A +F +V + + AFS Sbjct: 185 NAPVYTFGFGADYDPTVLNAVARNSMGGTFSVVNDVDKLSMAFSQ 229 >gi|319425442|gb|ADV53516.1| lipoprotein with VWA and DUF3520 domains [Shewanella putrefaciens 200] Length = 638 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 60/371 (16%), Positives = 121/371 (32%), Gaps = 51/371 (13%) Query: 35 GMIIEVSHIFFMKTVLHSM---IDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWN 91 + I S + S ID SL + N + D IK Sbjct: 84 ALAIHKSEELHQLKQVQSSERLIDASLAAKQRSAKHVINTHYVAAPIASDAWYGIKQPER 143 Query: 92 MSF----------RNELRDNGFVNDIDDIVRSTSLDIVVV---PQNEGYSISAIS---RY 135 F E+ + F D+D ST ++ P+ I + Y Sbjct: 144 NRFEKQIQNGIMVAGEIPISTFSIDVDTGSYSTLRRMIKEGSLPEKGTIRIEEMLNYFTY 203 Query: 136 KIPLKFCTFIPWYTNSRHIVMP----------ITSSVKVNSQTDARLDMMIVLDVSRSME 185 PL P+ + P ++ +++ +LDVS SM Sbjct: 204 DYPLPNKNAAPFSVTTELAPSPYNDDMMLLRIGLKGYELTKSELGASNLVFLLDVSGSM- 262 Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR 245 +S K+ + ++ + +++ V+ VV +G + + L Sbjct: 263 ----ASADKLPLLQTALKMLTQQLSAQDKVSIVVYAGAAG----VVLDGASGDDIQALTY 314 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKE 304 ++ L G STN + G+ AY ++H N ++ TDG+ N+ Sbjct: 315 ALEQLRA-GGSTNGSQGILQAYQL------AQKHFIQGGINR---VILATDGDFNVGVTN 364 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIR-SHEFLRACASPNSFYLVENPHSMYDAFSHIGK 363 Q + + K+RG + +G + + + + A + + ++ +A + Sbjct: 365 FDQLISLIEKEKQRGIGLTTLGFGMGNYNDQLMEQLADKGNGHYAY-IDTLNEARKVLVD 423 Query: 364 DIVTKRIWYDK 374 ++ + + K Sbjct: 424 ELSSTLLTIAK 434 >gi|313227391|emb|CBY22538.1| unnamed protein product [Oikopleura dioica] Length = 455 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 66/181 (36%), Gaps = 23/181 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + +D+ ++D S S+ + IT + + V + ++ + Sbjct: 259 EDNVAIDLAFIIDSSSSIGEDNFNLITSF---------VGDIVSNFTISETLTHIAVLRY 309 Query: 227 SNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++++ + S + I ++ G T + L YA N + + Sbjct: 310 NSQVSPVLWFDTFNTKSEILAAISAINYSGSGTRTGKALGYAANYVLN------PTYGAR 363 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + + +TDGE+ + A K + V AIGI + E ++ +SP S Sbjct: 364 EEVPTVTLVITDGESQDE------ITTAARALKSKSAVMAIGIGDAKETELVQMASSPLS 417 Query: 345 F 345 F Sbjct: 418 F 418 >gi|296156498|ref|ZP_06839336.1| von Willebrand factor type A [Burkholderia sp. Ch1-1] gi|295893097|gb|EFG72877.1| von Willebrand factor type A [Burkholderia sp. Ch1-1] Length = 446 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 48/295 (16%), Positives = 99/295 (33%), Gaps = 43/295 (14%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 +G ++I+ A+ L + +LG+ ++ + +K L + D +++ A + N Sbjct: 9 RRQRGSVSIIVAVSLIALLGILGLAVDSGLGYVIKARLDAATDGAVIAAGEAVTRGSN-- 66 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 N + N + V + + IV +I Sbjct: 67 ----------QTEQTNNAQQAATAFFTANYPAGFLGSSVAAGTPSIVFDA--GTVTIGMT 114 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI 192 ++ +P+ F + + + + TS + LDM V+D + S+ + Sbjct: 115 AQASVPVSFSKVL----GFKVLNVSSTSQAIRKT-----LDMAFVIDTTGSLNTSGVP-- 163 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWG--VSHLQRKIKY 249 + A L + N+ V + ++ F G + + I Sbjct: 164 ---AAVRSNAVAFLNNFDVT---NDRVALMHFAYGTVVDVPFSGNARGFDRTTMTADINK 217 Query: 250 LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 + F STNS + A NQ+ + + ++IVF +DG S Sbjct: 218 YT-FNGSTNSAEAIWNARNQLNTV--------ISQPSSLRVIVFFSDGAPNSFSS 263 >gi|313226593|emb|CBY21739.1| unnamed protein product [Oikopleura dioica] Length = 694 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 67/212 (31%), Gaps = 29/212 (13%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + + LD++ V+D S S+ + + + + Sbjct: 171 ITQDQCRTNALDIVFVVDESGSIG------TPNFQLIKDFLEHF---ASDSTIAADATRI 221 Query: 222 GLVTFSNKIEEFFLLEWG-VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + FS +F L ++ +IK + +TN+ L A + D R Sbjct: 222 AIRPFSTSNYLYFSLNDFKTKNIINEIKNMPYNSGNTNTADALDAA---LTDYGTDR--- 275 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK--RGAIVYAIGIRVIRSHEFLRA 338 K++V +TDG + S + + K R +AIG+ E Sbjct: 276 ----PESVKVMVTITDGASNSF---LSTSAAADRVKNDLRNIQSFAIGVSGANMAELEAI 328 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + +++ F I +++ K Sbjct: 329 AITDKHVFMLNGWAD----FEPIKSNLLQKVC 356 >gi|315615538|gb|EFU96170.1| von Willebrand factor type A domain protein [Escherichia coli 3431] Length = 575 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 50/313 (15%), Positives = 110/313 (35%), Gaps = 39/313 (12%) Query: 39 EVSHIFFMKTVLHSMID--RSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRN 96 + S ++ L R+ AT I N G R + + + ++ +F Sbjct: 69 QYSDKQALQGRLQEAPTFARAAKAKATHIAN--PGTARYQQFDDNPVKQVAQNPLATFSL 126 Query: 97 ELRDNGFVN-------DIDDIVRSTSLDIVVV--PQNEGYSISAISRYKIPLKFCT---- 143 ++ + N + + ++ +V P + P+ F Sbjct: 127 DVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYEL 186 Query: 144 -FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 PW + + I + S+ +++ ++D S SM S ++ + S+ Sbjct: 187 APAPWNEQRTLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISD-----ERLPLIQSSL 240 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 +++E++ ++ V +G ++I + + + I L G STN G Sbjct: 241 KLLVKELREQDNIAIVTYAG----DSRIALPSISGSHKAEINAAIDSLDAEG-STNGGAG 295 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGAI 321 L+ AY Q ++ N I+ TDG+ +D +S+ + ++ G Sbjct: 296 LELAYQQA-TKGFIKGGINR--------ILLATDGDFNVGIDDPKSIESMVKKQRESGVT 346 Query: 322 VYAIGIRVIRSHE 334 + G+ +E Sbjct: 347 LSTFGVGNSNYNE 359 >gi|242072180|ref|XP_002446026.1| hypothetical protein SORBIDRAFT_06g000656 [Sorghum bicolor] gi|241937209|gb|EES10354.1| hypothetical protein SORBIDRAFT_06g000656 [Sorghum bicolor] Length = 434 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 79/207 (38%), Gaps = 29/207 (14%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSIT----KIDMAIKSINAMLEEV-----KLIPDVNNV 218 A +D++ VLDVS SM + + T ++++A +++ ++ + + Sbjct: 8 ERAPVDVVAVLDVSGSMAWDYGNGTTVENHRLELAKEAMAKAIQSLGPAAAAVAAGGARR 67 Query: 219 VQSGLVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + +V FSN +++ L G ++ + L G + P + Sbjct: 68 NRLAVVPFSNVVKQVTPLTEMDMEGQQTVKNAVDALKPGGQADYLMP--------LKIAA 119 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--- 331 + E+ + II+F++DG++ ++ + K ++A G+ V Sbjct: 120 KILDERKAEEKDRLAIIIFVSDGQDHYFRDTDDMKETLTQHKLIKYPIHAFGVSVSEQDS 179 Query: 332 ---SHEFLRAC--ASPNSFYLVENPHS 353 + LRA A+ S+ + Sbjct: 180 SGGGAKALRAMADATSGSYTSITQDDD 206 >gi|313239130|emb|CBY14106.1| unnamed protein product [Oikopleura dioica] Length = 509 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 38/194 (19%), Positives = 70/194 (36%), Gaps = 24/194 (12%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD I+LD S S+ + + K+ + +L + L V+ GL+T++ + Sbjct: 321 LDFFIILDQSSSIG---NENFQKM---KNFVINLLMQSNLGQHG---VRVGLITYNRRPT 371 Query: 232 EFFLLEWGVSHLQR--KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F + +H Q + + G TN+ +K+ F + + Sbjct: 372 LRFHMNEMENHQQAINAVDSIVYEGRGTNTGAAIKWVVENAFRPE------FGDRPEVPN 425 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSFY 346 ++ +TDG L + + + VYA+GI + L AS Sbjct: 426 KVLLITDGRARDPP----VLKVQSGRLQEQSTVYALGIGKQIDYVELNRIASDPSERHVL 481 Query: 347 LVENPHSMYDAFSH 360 V+N + AF Sbjct: 482 YVDNFSFLERAFQK 495 >gi|313214907|emb|CBY41128.1| unnamed protein product [Oikopleura dioica] Length = 509 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 38/194 (19%), Positives = 70/194 (36%), Gaps = 24/194 (12%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD I+LD S S+ + + K+ + +L + L V+ GL+T++ + Sbjct: 321 LDFFIILDQSSSIG---NENFQKM---KNFVINLLMQSNLGQHG---VRVGLITYNRRPT 371 Query: 232 EFFLLEWGVSHLQR--KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F + +H Q + + G TN+ +K+ F + + Sbjct: 372 LRFHMNEMENHQQAINAVDSIVYEGRGTNTGAAIKWVVENAFRPE------FGDRPEVPN 425 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSFY 346 ++ +TDG L + + + VYA+GI + L AS Sbjct: 426 KVLLITDGRARDPP----VLKVQSGRLQEQSTVYALGIGKQIDYVELNRIASDPSERHVL 481 Query: 347 LVENPHSMYDAFSH 360 V+N + AF Sbjct: 482 YVDNFSFLERAFQK 495 >gi|225703035|ref|NP_795896.4| integrin alpha-11 precursor [Mus musculus] Length = 1188 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 87/214 (40%), Gaps = 39/214 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++IVLD S S+ + + + ++ +K +Q G+V + Sbjct: 160 QTYMDIVIVLDGSNSIYPWVEVQ-----------HFLINILKKFYIGPGQIQVGIVQYGE 208 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 F L S ++ ++ S + G T + G+++A ++ F G + Sbjct: 209 DAVHEFHLNDYRS-VKDVVEAASHIEQRGGTETRTAFGIEFARSEAFQKGGRKG------ 261 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL-- 336 KK+++ +TDGE+ + + ++ + +++K YA+ + R I FL Sbjct: 262 --AKKVMIVITDGESHDSPDLEKVIR---QSEKDNVTRYAVAVLGYYNRRGINPETFLNE 316 Query: 337 -RACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + AS F+ V + ++ D +G I Sbjct: 317 IKYIASDPDDKHFFNVTDEAALKDIVDALGDRIF 350 >gi|148694080|gb|EDL26027.1| integrin, alpha 11 [Mus musculus] Length = 1172 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 87/214 (40%), Gaps = 39/214 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++IVLD S S+ + + + ++ +K +Q G+V + Sbjct: 137 QTYMDIVIVLDGSNSIYPWVEVQ-----------HFLINILKKFYIGPGQIQVGIVQYGE 185 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 F L S ++ ++ S + G T + G+++A ++ F G + Sbjct: 186 DAVHEFHLNDYRS-VKDVVEAASHIEQRGGTETRTAFGIEFARSEAFQKGGRKG------ 238 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL-- 336 KK+++ +TDGE+ + + ++ + +++K YA+ + R I FL Sbjct: 239 --AKKVMIVITDGESHDSPDLEKVIR---QSEKDNVTRYAVAVLGYYNRRGINPETFLNE 293 Query: 337 -RACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + AS F+ V + ++ D +G I Sbjct: 294 IKYIASDPDDKHFFNVTDEAALKDIVDALGDRIF 327 >gi|32394646|gb|AAM62130.1| a11 integrin [Mus musculus] Length = 1188 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 87/214 (40%), Gaps = 39/214 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++IVLD S S+ + + + ++ +K +Q G+V + Sbjct: 160 QTYMDIVIVLDGSNSIYPWVEVQ-----------HFLINILKKFYIGPGQIQVGIVQYGE 208 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 F L S ++ ++ S + G T + G+++A ++ F G + Sbjct: 209 DAVHEFHLNDYRS-VKDVVEAASHIEQRGGTETRTAFGIEFARSEAFQKGGRKG------ 261 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL-- 336 KK+++ +TDGE+ + + ++ + +++K YA+ + R I FL Sbjct: 262 --AKKVMIVITDGESHDSPDLEKVIR---QSEKDNVTRYAVAVLGYYNRRGINPETFLNE 316 Query: 337 -RACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + AS F+ V + ++ D +G I Sbjct: 317 IKYIASDPDDKHFFNVTDEAALKDIVDALGDRIF 350 >gi|75812635|ref|YP_320253.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75705391|gb|ABA25064.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 592 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 66/205 (32%), Gaps = 28/205 (13%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 S +++++D S SM K+ + +E L D + +V+ Sbjct: 44 SSQQTPQAIVLLIDASSSMSD------GKLTEVKTAATKFVERRNLTQD-----KLAVVS 92 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F I+ L L+ I LS+ G T GL A ++ + Sbjct: 93 FGLDIQTATPLTDNADTLESAIASLSEAG-GTPMAQGLDAAIGEL------------QAT 139 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 + I+ TDG + A+ + + A+ ++ + A P+ Sbjct: 140 FLSRNILLFTDG---VPDSQALASLSAQSARSQRINLIAVATGDADTNYLAQLTADPSLV 196 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRI 370 + N AF + I + + Sbjct: 197 FYA-NSGQFDQAFRNAEAAIYKQLV 220 >gi|48428051|sp|Q864W1|CFAB_PONPY RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|29690183|gb|AAM10003.1| complement factor B precursor [Pongo pygmaeus] Length = 764 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 32/221 (14%), Positives = 74/221 (33%), Gaps = 28/221 (12%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--- 229 IVLD S SM + S + ++ ++ + + GLVT++ Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGAGNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPKI 320 Query: 230 -IEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ + +++ ++ K TN+ L+ Y+ + Sbjct: 321 WVKVSEPDSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYS-MMSWPDDIPPEGWNR 379 Query: 285 ANYKKIIVFMTDG-ENLSTKEDQQSLYYCN---------EAKKRGAIVYAIGIRVIRSHE 334 + +I+ MTDG N+ + ++ VY G+ + + Sbjct: 380 TRH--VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQV 437 Query: 335 FLRACAS----PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + A AS + V++ ++ D F + + + + Sbjct: 438 NINALASKKDNEQHVFKVKDMENLEDVFFQMIDESQSLSLC 478 >gi|288800165|ref|ZP_06405624.1| BatA protein [Prevotella sp. oral taxon 299 str. F0039] gi|288333413|gb|EFC71892.1| BatA protein [Prevotella sp. oral taxon 299 str. F0039] Length = 323 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 33/158 (20%), Positives = 59/158 (37%), Gaps = 16/158 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +DVS SM + D +++ A K + + + ++ + +G + Sbjct: 79 IDIMLAMDVSTSMLAE-DLKPNRMEAAKKVAAEFISD-RANDNIGLTIFAGEAFTQCPMT 136 Query: 232 EF-FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + ++ I T GL A +++ E K+ Sbjct: 137 TDHASLLNLLQGVRTDIASRGLIADGTAVGMGLANAVSRL-----------KESKAKSKV 185 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 I+ +TDG N D L AK G VY IG+ Sbjct: 186 IILLTDGSNNM--GDISPLTAAQIAKSLGIRVYTIGVG 221 >gi|288941617|ref|YP_003443857.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] gi|288896989|gb|ADC62825.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] Length = 341 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 70/216 (32%), Gaps = 44/216 (20%) Query: 173 DMMIVLDVSRSM---ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+M+ +DVS SM + D ++++ + E + + GL+ F + Sbjct: 92 DLMLAIDVSGSMAQEDYELDGRPVSRLAVVRTVASAFVERRAGD------RLGLILFGTR 145 Query: 230 IEEFFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + + + + + G T + A ++ E Sbjct: 146 AYLQTPLTFDGATVAAMLRDSVVGLAGRETAIGDAIGLAVKRL-----------REQPEG 194 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-------SHEFLRACA 340 +++++ +TDG+N + D A + G VY IGI LR + Sbjct: 195 QRVLILLTDGDNTAGALDPLEAAE--LAAQAGVRVYTIGIGGGELGVRSLFGMRLLRQAS 252 Query: 341 -------------SPNSFYLVENPHSMYDAFSHIGK 363 + + ++ + + + + Sbjct: 253 DFDPATLERIAEITGGRAFTADSRQQLEAVYDELDR 288 >gi|48428278|sp|P61622|ITA11_MOUSE RecName: Full=Integrin alpha-11; Flags: Precursor gi|35193068|gb|AAH58716.1| Integrin alpha 11 [Mus musculus] Length = 1188 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 87/214 (40%), Gaps = 39/214 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++IVLD S S+ + + + ++ +K +Q G+V + Sbjct: 160 QTYMDIVIVLDGSNSIYPWVEVQ-----------HFLINILKKFYIGPGQIQVGIVQYGE 208 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 F L S ++ ++ S + G T + G+++A ++ F G + Sbjct: 209 DAVHEFHLNDYRS-VKDVVEAASHIEQRGGTETRTAFGIEFARSEAFQKGGRKG------ 261 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL-- 336 KK+++ +TDGE+ + + ++ + +++K YA+ + R I FL Sbjct: 262 --AKKVMIVITDGESHDSPDLEKVIR---QSEKDNVTRYAVAVLGYYNRRGINPETFLNE 316 Query: 337 -RACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + AS F+ V + ++ D +G I Sbjct: 317 IKYIASDPDDKHFFNVTDEAALKDIVDALGDRIF 350 >gi|332254890|ref|XP_003276566.1| PREDICTED: integrin alpha-2 [Nomascus leucogenys] Length = 1181 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 43/287 (14%), Positives = 101/287 (35%), Gaps = 49/287 (17%) Query: 109 DIVRSTSL-DIVVVPQNEGYSISAISRYKIPLKFCTFIPW--YTNSRHIVMPITSSVKVN 165 ++ STS+ ++ + N ++ W +++ + S + + Sbjct: 96 NLQTSTSIPNVTEMKTNMSLGLTLTRNMGTGGFLTCGPLWAQQCGNQYYTTGVCSDISPD 155 Query: 166 SQTDARL-----------DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 Q A D+++V D S S+ + D + ++ + + P Sbjct: 156 FQLSASFSPAAQPCPSLIDVVVVCDESNSIYPW--------DAVKNFLEKFVQGLDVGPT 207 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI--KYLSKFGVS-TNSTPGLKYAYNQIF 271 Q GL+ ++N F L + + + S++G TN+ ++YA + Sbjct: 208 K---TQVGLIQYANNPRVVFNLNTYKTKEEMIVATSQTSQYGGDLTNTFGAIQYARKYAY 264 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI---- 327 + + K++V +TDGE+ + + CN + + I + Sbjct: 265 SA------ASGGRRSATKVMVVVTDGESHDGSMLKAVIDQCNH---DNILRFGIAVLGYL 315 Query: 328 --RVIRSHEF---LRACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + + ++A AS F+ V + ++ + +G+ I Sbjct: 316 NRNALDTKNLIKEIKAIASIPTERYFFNVSDEAALLEKAGTLGEQIF 362 >gi|311271483|ref|XP_003133150.1| PREDICTED: anthrax toxin receptor-like [Sus scrofa] Length = 728 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 36/245 (14%), Positives = 81/245 (33%), Gaps = 57/245 (23%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D+ +LD+S S+ + + + + + VK + ++ +T+S Sbjct: 82 QSTYDLYFILDMSGSINNNW----------MDIYALVEDLVKKFDNPK--IRMSFITYST 129 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFG--------------------------VSTNSTPG 262 + L + +++ G +T S+ Sbjct: 130 QGHILMKLTSDKDLSEFEVEENESVGCYHSGGRKRGRVGCGPLVEELPIREKKTTASSKA 189 Query: 263 ---------LKYAYNQIFD------MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 A+ + D + N+ + +I+ +TDG S Q+ Sbjct: 190 GIQNLKQESFGEAFEGLPDPLYRVEANEQIEQANSGENKVPSMIIALTDGTLESISL-QE 248 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + + A+K GA VY IG++ + + S + + V+ + A +I ++T Sbjct: 249 TKQQADRARKLGANVYCIGVKDYETEQLSEIADSSDHVFGVDQG---FKALKNIIDPLMT 305 Query: 368 KRIWY 372 K Sbjct: 306 KTCIE 310 >gi|157818579|ref|NP_001100919.1| calcium-activated chloride channel regulator 1 [Rattus norvegicus] gi|149026146|gb|EDL82389.1| chloride channel calcium activated 3 (predicted) [Rattus norvegicus] Length = 910 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 58/315 (18%), Positives = 109/315 (34%), Gaps = 50/315 (15%) Query: 60 HAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIV 119 + + N +R D I + N + + N +N + + + + Sbjct: 201 QGGSCVTNGKCVIDRVTGLYKDNCVFIPDK-NQREKASIMFNQNINSVVEFCTEKNHNKE 259 Query: 120 V-VPQNEG---YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMM 175 QN+ S + + K T + + + QT R+ + Sbjct: 260 APNAQNQRCNLRSTWEVIQESEDFKQTTPMTAQPPAPTFSLL---------QTRQRI-VC 309 Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 +VLD S SM+S D+ + +++ A + L+ V G+VTF + Sbjct: 310 LVLDKSGSMQS--DNRLNRMNQA--------SRLFLLQTVEQGSWVGMVTFDSTAYVQSE 359 Query: 236 LEWGVSHLQRK--IKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L S R IK L T+ GL+ A+ I + + + IV Sbjct: 360 LTQLNSGADRDLLIKRLPTVASGGTSICSGLQAAFTSI----KKKYPTDGAE------IV 409 Query: 293 FMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYLVE 349 +TDGE+ ++ C + K GAI++ + + + E L Y + Sbjct: 410 LLTDGEDN-------TISSCFDLVKNSGAIIHTVALGPSAAKELEQLSKMTGGLQTYSSD 462 Query: 350 NPHS--MYDAFSHIG 362 + + DAF+ + Sbjct: 463 QIQNNGLVDAFAALS 477 >gi|34541235|ref|NP_905714.1| batB protein [Porphyromonas gingivalis W83] gi|34397551|gb|AAQ66613.1| batB protein [Porphyromonas gingivalis W83] Length = 339 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 65/170 (38%), Gaps = 26/170 (15%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + + ++ MI LD+S SM D ++ A + + + + ++ N+ V GLV Sbjct: 85 PKEEKGIEAMICLDISNSMLCE-DVKPNRLSFAKQVLGKLFDGLQ-----NDKV--GLVV 136 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 F+ + +S ++ + +S T ++ A D + + Sbjct: 137 FAGNAYTQIPITTDLSAAKQFLADISPNMVTAQGTAIGAAIELASKSFSDNKEIG----- 191 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 K I+ +TDGEN + ++ +A + G V IG+ Sbjct: 192 ------KTIIVLTDGENH----EGNAIEAAQQAHEAGIRVNVIGLGTALG 231 >gi|163846842|ref|YP_001634886.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222524662|ref|YP_002569133.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163668131|gb|ABY34497.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222448541|gb|ACM52807.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 545 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 35/199 (17%), Positives = 69/199 (34%), Gaps = 34/199 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--- 229 D+++V+D S SME K+ M I L + + GL+TF ++ Sbjct: 371 DILLVVDTSGSMEG------DKMTMVKAGIETFLMRILPED------RLGLITFDSQARL 418 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + L LQ ++ + G T L A + + + + + Sbjct: 419 VVPMAPLSENRIDLQIAVQEMRASGR-TALFDALDLARQTLEALP-------PAEDDRIR 470 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAK----KRGAIVYAIGIRVIRSHEFLRACAS-PNS 344 IV ++DG D S E + + G ++ + + L A A + Sbjct: 471 AIVLLSDG------ADNASRLTLEEVRRQFDESGITIFPVAYGSDADRQVLDAIAEFSRT 524 Query: 345 FYLVENPHSMYDAFSHIGK 363 +V + + F ++ + Sbjct: 525 IVVVGDSGDIAQIFENLSR 543 >gi|297669807|ref|XP_002813080.1| PREDICTED: collagen alpha-3(VI) chain-like isoform 3 [Pongo abelii] Length = 2575 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 50/312 (16%), Positives = 113/312 (36%), Gaps = 29/312 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + + GG + F +R +G V+ Sbjct: 913 LEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR-----FAQVIRSSGIVSLGVG 967 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + ++P + + ++ P + + Sbjct: 968 DRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPE 1027 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + + E V + + + +Q GLV +++ Sbjct: 1028 KKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEDGDSIQVGLVQYNS 1078 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 + F L+ + + I + G N+ GL++ N G R D Sbjct: 1079 DPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTRVGLEHLRVNHFVPEAGSR-----LD 1133 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I +T G+++ +D +RG V+A+G+R I S E + ++ + Sbjct: 1134 QRVPQIAFVITGGKSVEDAQDVSLALT-----QRGVKVFAVGVRNIDSEEVGKIASNSAT 1188 Query: 345 FYLVENPHSMYD 356 + V N + + Sbjct: 1189 AFRVGNVQELSE 1200 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 86/261 (32%), Gaps = 24/261 (9%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + ++V + + Y S + ++P K T I T+ + + ++ + Sbjct: 764 RNADQEELVKISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAV 823 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S + ++ ++ + + P V+ G+V FS Sbjct: 824 ESDAADIVFLIDSSEGVRPDG------FAHIRDFVSRIVRRLNIGPSK---VRVGVVQFS 874 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 N + F L+ S + I+ L G S N+ L++ +F + + Sbjct: 875 NDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDG-- 932 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + +V + G K + + G + +G R I E P Sbjct: 933 --VPQHLVLVLGG-----KSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDPRL 985 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 + V + + I Sbjct: 986 VFTVREFRELPNIEERIMNSF 1006 Score = 42.4 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 61/163 (37%), Gaps = 24/163 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 422 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 471 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + ++ L+ G T N+ L++ N + G R Sbjct: 472 EFYLNSYMNQQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRI-----TEGVP 526 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ +T + + K+ GA+ IGI Sbjct: 527 QLLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 564 >gi|297669805|ref|XP_002813079.1| PREDICTED: collagen alpha-3(VI) chain-like isoform 2 [Pongo abelii] Length = 2976 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 50/312 (16%), Positives = 113/312 (36%), Gaps = 29/312 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + + GG + F +R +G V+ Sbjct: 1314 LEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR-----FAQVIRSSGIVSLGVG 1368 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + ++P + + ++ P + + Sbjct: 1369 DRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPE 1428 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + + E V + + + +Q GLV +++ Sbjct: 1429 KKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEDGDSIQVGLVQYNS 1479 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 + F L+ + + I + G N+ GL++ N G R D Sbjct: 1480 DPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTRVGLEHLRVNHFVPEAGSR-----LD 1534 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I +T G+++ +D +RG V+A+G+R I S E + ++ + Sbjct: 1535 QRVPQIAFVITGGKSVEDAQDVSLALT-----QRGVKVFAVGVRNIDSEEVGKIASNSAT 1589 Query: 345 FYLVENPHSMYD 356 + V N + + Sbjct: 1590 AFRVGNVQELSE 1601 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 47/322 (14%), Positives = 112/322 (34%), Gaps = 29/322 (9%) Query: 54 IDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRS 113 +D S ++ + + + NN G + + ++ RS Sbjct: 312 LDGSALYTGSAL--DFVRNNLFTSSAGYRAAEGIPKLLVLITGGKSLDEISQSAQELKRS 369 Query: 114 TSLDIVVVPQNEGYSISAISRYKIPLKF----CTFIPWYTNSRHIVMPITSSVKVNSQTD 169 + + + + + + L F P + P+ + Sbjct: 370 SIMAFAIGNKGADQAELKEIAFDSSLVFIPAEFRAAPLQGMLPGFLAPLRTLSGTPEVHA 429 Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN- 228 + D++ +LD S ++ T + ++ + + N+ ++ GLV FS+ Sbjct: 430 NKRDIIFLLDGSANVGK------TNFPYVRDFVMNLVNSLDV---GNDNIRVGLVQFSDT 480 Query: 229 KIEEFFLLEWGV-SHLQRKIKYLS-KFGVSTNSTPGLKYAY-NQIFDMQGMRQHCNTEDA 285 + EF L + S + ++ L + G N+ L Y + N + G R H Sbjct: 481 PVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVHANHFTEAGGSRIH-----E 535 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 + ++++ +T G+ + L N + G + + +G E + +P+ Sbjct: 536 HVPQLLLLLTAGQ-----SEDSYLQAANALTRAGILTFCVGASQANKAELEQIAFNPSLV 590 Query: 346 YLVENPHSMYDAFSHIGKDIVT 367 YL+++ S+ + + + T Sbjct: 591 YLMDDFSSLPALPQQLIQPLTT 612 Score = 45.1 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 34/223 (15%), Positives = 76/223 (34%), Gaps = 31/223 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + T+ + D++ ++D S ++ + + + +LE++ P Sbjct: 17 LSGFPTTHAQQQQAAQDSADIIFLIDGSN------NTGSVNFAVILDFLVNLLEKL---P 67 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLS-KFGVSTNSTPGLKYAYNQI 270 ++ G+V FS++ F L+ + Q +K L G N L + Sbjct: 68 IGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENH 127 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV- 329 F R + + +++V ++ G + + A V++ G+ Sbjct: 128 FT----RAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVAL-------KQASVFSFGLGAQ 176 Query: 330 IRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 S L+ A+ ++ F G D+ K + Y Sbjct: 177 AASRAELQHIATDDNLVFT------VPEFRSFG-DLQEKLLPY 212 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 86/261 (32%), Gaps = 24/261 (9%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + ++V + + Y S + ++P K T I T+ + + ++ + Sbjct: 1165 RNADQEELVKISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAV 1224 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S + ++ ++ + + P V+ G+V FS Sbjct: 1225 ESDAADIVFLIDSSEGVRPDG------FAHIRDFVSRIVRRLNIGPSK---VRVGVVQFS 1275 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 N + F L+ S + I+ L G S N+ L++ +F + + Sbjct: 1276 NDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDG-- 1333 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + +V + G K + + G + +G R I E P Sbjct: 1334 --VPQHLVLVLGG-----KSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDPRL 1386 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 + V + + I Sbjct: 1387 VFTVREFRELPNIEERIMNSF 1407 Score = 42.4 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 61/163 (37%), Gaps = 24/163 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 823 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 872 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + ++ L+ G T N+ L++ N + G R Sbjct: 873 EFYLNSYMNQQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRI-----TEGVP 927 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ +T + + K+ GA+ IGI Sbjct: 928 QLLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 965 >gi|297669803|ref|XP_002813078.1| PREDICTED: collagen alpha-3(VI) chain-like isoform 1 [Pongo abelii] Length = 3182 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 50/312 (16%), Positives = 113/312 (36%), Gaps = 29/312 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + + GG + F +R +G V+ Sbjct: 1520 LEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR-----FAQVIRSSGIVSLGVG 1574 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + ++P + + ++ P + + Sbjct: 1575 DRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPE 1634 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + + E V + + + +Q GLV +++ Sbjct: 1635 KKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEDGDSIQVGLVQYNS 1685 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 + F L+ + + I + G N+ GL++ N G R D Sbjct: 1686 DPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTRVGLEHLRVNHFVPEAGSR-----LD 1740 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I +T G+++ +D +RG V+A+G+R I S E + ++ + Sbjct: 1741 QRVPQIAFVITGGKSVEDAQDVSLALT-----QRGVKVFAVGVRNIDSEEVGKIASNSAT 1795 Query: 345 FYLVENPHSMYD 356 + V N + + Sbjct: 1796 AFRVGNVQELSE 1807 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 47/322 (14%), Positives = 112/322 (34%), Gaps = 29/322 (9%) Query: 54 IDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRS 113 +D S ++ + + + NN G + + ++ RS Sbjct: 518 LDGSALYTGSAL--DFVRNNLFTSSAGYRAAEGIPKLLVLITGGKSLDEISQSAQELKRS 575 Query: 114 TSLDIVVVPQNEGYSISAISRYKIPLKF----CTFIPWYTNSRHIVMPITSSVKVNSQTD 169 + + + + + + L F P + P+ + Sbjct: 576 SIMAFAIGNKGADQAELKEIAFDSSLVFIPAEFRAAPLQGMLPGFLAPLRTLSGTPEVHA 635 Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN- 228 + D++ +LD S ++ T + ++ + + N+ ++ GLV FS+ Sbjct: 636 NKRDIIFLLDGSANVGK------TNFPYVRDFVMNLVNSLDV---GNDNIRVGLVQFSDT 686 Query: 229 KIEEFFLLEWGV-SHLQRKIKYLS-KFGVSTNSTPGLKYAY-NQIFDMQGMRQHCNTEDA 285 + EF L + S + ++ L + G N+ L Y + N + G R H Sbjct: 687 PVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVHANHFTEAGGSRIH-----E 741 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 + ++++ +T G+ + L N + G + + +G E + +P+ Sbjct: 742 HVPQLLLLLTAGQ-----SEDSYLQAANALTRAGILTFCVGASQANKAELEQIAFNPSLV 796 Query: 346 YLVENPHSMYDAFSHIGKDIVT 367 YL+++ S+ + + + T Sbjct: 797 YLMDDFSSLPALPQQLIQPLTT 818 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 71/203 (34%), Gaps = 24/203 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + + A D++ ++D S ++ + + + +++ + + + Sbjct: 29 QADVKNGAAADIIFLVDSSWTIGE------EHFQLVREFLYDVVKSLAVGEND---FHFA 79 Query: 223 LVTFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQ-IFDMQGMRQH 279 LV F+ FLL + + I +S G + + GL+Y + G R Sbjct: 80 LVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGTNQTGKGLEYIMQSHLTKAAGSRA- 138 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 ++IV +TDG E K V+AIG+ Sbjct: 139 ----GDGVPQVIVVLTDG-----HSKDGLALPSAELKSADVNVFAIGVEDADEGALKEIA 189 Query: 340 ASP--NSFYLVENPHSMYDAFSH 360 + P + +EN S++D + Sbjct: 190 SEPLNMHMFNLENFTSLHDIVGN 212 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 39/272 (14%), Positives = 88/272 (32%), Gaps = 41/272 (15%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS-------- 166 ++ + + I++ F + ++V + SSV Sbjct: 174 AIGVEDADEGALKEIASEPLNMHMFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLK 233 Query: 167 --QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D++ ++D S ++ + + + +LE++ P ++ G+V Sbjct: 234 DITAQDSADIIFLIDGSN------NTGSVNFAVILDFLVNLLEKL---PIGTQQIRVGVV 284 Query: 225 TFSNKIEEFFLLEWGVSHLQ--RKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 FS++ F L+ + Q +K L G N L + F R + Sbjct: 285 QFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHFT----RAGGS 340 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACA 340 + +++V ++ G + + A V++ G+ S L+ A Sbjct: 341 RVEEGVPQVLVLISAGPSSDEIRYGVVAL-------KQASVFSFGLGAQAASRAELQHIA 393 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + ++ F G D+ K + Y Sbjct: 394 TDDNLVFT------VPEFRSFG-DLQEKLLPY 418 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 86/261 (32%), Gaps = 24/261 (9%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + ++V + + Y S + ++P K T I T+ + + ++ + Sbjct: 1371 RNADQEELVKISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAV 1430 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S + ++ ++ + + P V+ G+V FS Sbjct: 1431 ESDAADIVFLIDSSEGVRPDG------FAHIRDFVSRIVRRLNIGPSK---VRVGVVQFS 1481 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 N + F L+ S + I+ L G S N+ L++ +F + + Sbjct: 1482 NDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDG-- 1539 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + +V + G K + + G + +G R I E P Sbjct: 1540 --VPQHLVLVLGG-----KSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDPRL 1592 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 + V + + I Sbjct: 1593 VFTVREFRELPNIEERIMNSF 1613 Score = 42.4 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 61/163 (37%), Gaps = 24/163 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 1029 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 1078 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + ++ L+ G T N+ L++ N + G R Sbjct: 1079 EFYLNSYMNQQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRI-----TEGVP 1133 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ +T + + K+ GA+ IGI Sbjct: 1134 QLLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 1171 >gi|218781309|ref|YP_002432627.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] gi|218762693|gb|ACL05159.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] Length = 336 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 31/166 (18%), Positives = 63/166 (37%), Gaps = 18/166 (10%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + +D+M+ +D+S SM D+ +++ A + + ++ V + GLV FS Sbjct: 85 SRKGVDIMVCVDISNSMMVE-DAQPNRLERAKREVADLIR-VATGD------RLGLVAFS 136 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L +Q + L T L++ + M ++ Sbjct: 137 GVAFTQCPLTLDYQAIQMFLDQL------TVDLLPLRFQGTDLGAAIEMGMTAFDPKSST 190 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 K+I+ +TDGE+ ++ L +A G ++ +GI Sbjct: 191 DKVILLITDGEDN----EEAGLKAAEKASDEGIRIFVLGIGDPAGG 232 >gi|332246079|ref|XP_003272177.1| PREDICTED: complement factor B-like [Nomascus leucogenys] Length = 764 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 72/221 (32%), Gaps = 28/221 (12%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-- 230 IVLD S SM + S + ++ ++ + + GLVT++ Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPRI 320 Query: 231 ------EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ +W L + K TN+ L+ Y+ + Sbjct: 321 WVKVSEQDSSNADWVTKQLNKINYEDHKLKSGTNTKKALQAVYS-MMSWPDDIPPEGWNR 379 Query: 285 ANYKKIIVFMTDG-ENLSTKEDQQSLYYCN---------EAKKRGAIVYAIGIRVIRSHE 334 + +I+ MTDG N+ + ++ VY G+ + + Sbjct: 380 TRH--VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQV 437 Query: 335 FLRACAS----PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + A AS + V++ ++ D F + + + + Sbjct: 438 NINALASKKDNEQHVFKVKDMENLEDVFFQMIDESQSLSLC 478 >gi|109466064|ref|XP_001075558.1| PREDICTED: integrin, alpha 2 [Rattus norvegicus] Length = 1178 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 36/286 (12%), Positives = 99/286 (34%), Gaps = 46/286 (16%) Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 + + +++ + + G +++ L +++ I S V + Sbjct: 94 LQNSASISNVTEIKTNMSLGLTLTRNPGTGGFLTCGPLWAHQCGNQYYATGICSDVSPDF 153 Query: 167 QTDARL-----------DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 Q+ D+++V D S S+ + + + ++ + + P Sbjct: 154 QSLTSFSPAVQACPSLVDVVVVCDESNSIYPW--------EAVKNFLEKFVQGLDIGP-- 203 Query: 216 NNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFD 272 Q L+ ++N F L + + ++G TN+ +++A + + Sbjct: 204 -KKTQVALIQYANDPRVVFNLTTYKNKEDMVQATSETRQYGGDLTNTFKAIQFARDIAYL 262 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI----- 327 + + K++V +TDGE+ + Q + CN + + I + Sbjct: 263 PE------SGGRPGATKVMVVVTDGESHDGSKLQTVIQQCN---DDEILRFGIAVLGYLN 313 Query: 328 -RVIRSHEFLR----ACASP--NSFYLVENPHSMYDAFSHIGKDIV 366 + + ++ ++P F+ V + ++ + +G+ I Sbjct: 314 RNALDTKNLIKEIKAIASTPTERYFFNVADEAALLEKAGTLGEHIF 359 >gi|207079949|ref|NP_001128930.1| DKFZP469A1324 protein [Pongo abelii] gi|55726315|emb|CAH89929.1| hypothetical protein [Pongo abelii] Length = 740 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 34/238 (14%), Positives = 83/238 (34%), Gaps = 29/238 (12%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + + TN K+ Q L+ ++LD S+S+ + +S Sbjct: 224 FSHMLGATNPTQKTKESLGR-KIQIQRSGHLNPYLLLDCSQSVSEN------DFLIFKES 276 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWGVSHLQRKIKYLS-------KF 253 + M++ + V ++TF+++ + L + I L + Sbjct: 277 ASLMVDRIFSFEIN---VSVAIITFASEPKVLMSVLNDNSRDITEVISSLENANYKDHEN 333 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL------STKEDQQ 307 G TN+ L Y + + + + I+ +TDG++ + + + Sbjct: 334 GTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIR 393 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 + N+ + +YAIG+ + +E +++++ +++ F H Sbjct: 394 EILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQVFEH 451 >gi|32475534|ref|NP_868528.1| BatB [Rhodopirellula baltica SH 1] gi|32446076|emb|CAD75905.1| BatB [Rhodopirellula baltica SH 1] Length = 747 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 65/163 (39%), Gaps = 25/163 (15%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 ++ + VLDVSRSM + D S ++ A + I M++E+ + GLV F+ Sbjct: 124 QRGIEAVFVLDVSRSMLAE-DVSPNRLGRAKQQIKDMVDEMPGD-------RVGLVVFAG 175 Query: 229 KIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + + L V ++ + + S + ++ A + D Sbjct: 176 ETRQTLPLTRHVEDFKQTLDSVGIHSVRRGGSRLGDAIRVASDAFLDKTTDH-------- 227 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 K +V +TDGE+ + ++ ++G ++ IG+ Sbjct: 228 ---KAMVILTDGEDQESDPVSEAKRAYE---EQGIRIFTIGLG 264 >gi|156147106|gb|ABU53697.1| CnPolydom [Hydractinia symbiolongicarpus] Length = 551 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 35/180 (19%), Positives = 65/180 (36%), Gaps = 17/180 (9%) Query: 182 RSM---ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-- 236 SM SF ++T ++ + A+L EV++ + + G + ++I ++L Sbjct: 1 GSMVSTGSFQGRTMTGFQISKAFVKALLSEVRVASNATRIA-IGTFSSDHRINFNYILNP 59 Query: 237 --EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR-QHCNTEDANYKKIIVF 293 K + G TN L+ AYN + H +N K+++ Sbjct: 60 SYANTKCKFNDDFKKIKFDGFMTNIKGSLQDAYNVFRSLDSDPVTHSRRPRSN--KVVIL 117 Query: 294 MTDG-----ENLSTKEDQQSLYYCNEAKKRG-AIVYAIGIRVIRSHEFLRACASPNSFYL 347 +TDG N K+ G +Y +G+ L+ A+ S +L Sbjct: 118 LTDGVGNMVGNRVDSAGADGAPEALRLKQTGYVELYTVGVTHATDQNMLKKIATDPSLFL 177 >gi|319902109|ref|YP_004161837.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] gi|319417140|gb|ADV44251.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] Length = 327 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 44/235 (18%), Positives = 81/235 (34%), Gaps = 47/235 (20%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P T++ NS+ + +D+M+ +DVS SM + D +++ A + P Sbjct: 71 LARPQTTNSWQNSEIEG-IDIMLAMDVSTSMLAE-DLKPNRLEAAKDVATEFING---RP 125 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIF 271 + N G+ F+ + L + L +K + T GL A ++ Sbjct: 126 NDN----IGITLFAGESFTQCPLTVDHAVLLNLLKDMKCGFIEDGTAIGMGLANAVTRLK 181 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-- 329 D + K+I+ +TDG N K D L AK G VY IG+ Sbjct: 182 DSKAKS-----------KVIILLTDGVNN--KGDISPLTAAEIAKSFGIRVYTIGVGTNG 228 Query: 330 --------------------IRSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 I + + +++ + + + + I K Sbjct: 229 MAPYPYPVGGTVQYVNMPVEIDEKTLTQIAGTTEGNYFRATSNSKLKEVYEEIDK 283 >gi|301772358|ref|XP_002921601.1| PREDICTED: integrin alpha-2-like [Ailuropoda melanoleuca] Length = 1369 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 33/210 (15%), Positives = 76/210 (36%), Gaps = 35/210 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++V D S S+ + + + ++ + + P Q GL+ ++NK Sbjct: 361 IDVVVVCDESNSIYPW--------EAVKNFLEKFVQSLDIGP---KKTQVGLIQYANKPR 409 Query: 232 EFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L + ++ + G TN+ ++YA Sbjct: 410 VIFNLNTFKTKAEMIEATSQTYQYGGDLTNTFKAIQYA------KDFAYAAGAGGRLGAA 463 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEF---LRAC 339 K++V +TDGE+ + + C+ + + I + + + ++A Sbjct: 464 KVMVVVTDGESHDGSMLKAVIDQCD---NDNILRFGIAVLGYLNRNALDTKNLIKEIKAI 520 Query: 340 AS---PNSFYLVENPHSMYDAFSHIGKDIV 366 AS F+ V + ++ + +G+ I Sbjct: 521 ASIPTETFFFNVSDEAALLEKAGTLGEQIF 550 >gi|109077204|ref|XP_001095246.1| PREDICTED: integrin alpha-2 [Macaca mulatta] Length = 1180 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 43/287 (14%), Positives = 100/287 (34%), Gaps = 49/287 (17%) Query: 109 DIVRSTSL-DIVVVPQNEGYSISAISRYKIPLKFCTFIPW--YTNSRHIVMPITSSVKVN 165 ++ STS+ ++ + N ++ W +++ + S + + Sbjct: 96 NLQTSTSIPNVTEMKTNMSLGLTLTRNMGTGGFLTCGPLWAQQCGNQYYTTGVCSDISPD 155 Query: 166 SQTDARL-----------DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 Q A D+++V D S S+ + D + ++ + + P Sbjct: 156 FQLSASFSPAAQPCPSLIDVVVVCDESNSIYPW--------DAVKNFLEKFVQGLDIGPT 207 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI--KYLSKFGVS-TNSTPGLKYAYNQIF 271 Q GL+ ++N F L + + + S+ G TN+ ++YA + Sbjct: 208 K---TQVGLIQYANNPRVVFNLNTYKTKEEMIVATSQTSQHGGDLTNTFGAIQYARKYAY 264 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI---- 327 + + K++V +TDGE+ + + CN + + I + Sbjct: 265 SA------ASGGRRSATKVMVVVTDGESHDGSMLKAVIDQCNH---DNILRFGIAVLGYL 315 Query: 328 --RVIRSHEF---LRACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + + ++A AS F+ V + ++ + +G+ I Sbjct: 316 NRNALDTKNLIKEIKAIASIPTERYFFNVSDEAALLEKAGTLGEQIF 362 >gi|301065602|ref|YP_003787625.1| hypothetical protein LCAZH_0484 [Lactobacillus casei str. Zhang] gi|300438009|gb|ADK17775.1| Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain [Lactobacillus casei str. Zhang] Length = 909 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 48/289 (16%), Positives = 97/289 (33%), Gaps = 25/289 (8%) Query: 19 MTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLK 78 MT +TA L ++I + M T L + + AA N Sbjct: 1 MTKMTAKVAITGHLFAVLLI----LMSMLTGLVTSGSSVVTAAANIRPTYQTDANGTYPT 56 Query: 79 GGDILCRIKNTWNMSFRNELR--DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYK 136 + +N N +++ DN + + D + S P N Y I ++ Sbjct: 57 NSWQVTGQQNVINQRGGDQVSGWDNNTIWNGDATDTTNSYLKFGDPNNPDYQIRKYAK-- 114 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 + + + + + Q +D+++V+D+S SMES + + Sbjct: 115 --------ETNTPGLYDVYLNVKGNTQ---QNVKPVDIVLVVDMSGSMESKNNGGTDRAG 163 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVSHLQRKIKYLSKFGV 255 + L ++ + + V GL+ FS+ G +++ + Sbjct: 164 AVRTGVKNFLTSIQNA-GLGDYVNVGLIGFSSPGYIGGGNKTTGPGYIRVGLGKAGNISQ 222 Query: 256 STNSTPGLKYAYN-QIFDMQGMRQHCNTEDANY---KKIIVFMTDGENL 300 L +N + G+RQ + + KK+++ +TDG Sbjct: 223 QQAINSALSPTFNGGTYTQIGLRQGSAMLNTDTSGNKKMMILLTDGVPT 271 >gi|242042269|ref|XP_002468529.1| hypothetical protein SORBIDRAFT_01g047460 [Sorghum bicolor] gi|241922383|gb|EER95527.1| hypothetical protein SORBIDRAFT_01g047460 [Sorghum bicolor] Length = 698 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 58/309 (18%), Positives = 113/309 (36%), Gaps = 45/309 (14%) Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELR-----DNGFVNDIDDIVRSTSLDIVVVPQNE 125 N + ++ R+ +T++ + + L + G ND + I + ++ Sbjct: 138 LNWPQDEGHMAVVRRLSHTYSGNLQEHLPFFRTLEAGIFNDDEHIDLQSDMNDEHNAITG 197 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 I A S + + T + H+ P +S ++ + A LD++ VLDVS SM Sbjct: 198 SVKIKAYSEFPAIEQSVT-KEIFAILIHLRAPKSSH---SASSRAPLDLVTVLDVSGSMA 253 Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW----GVS 241 TKI + +++ +++ + + ++ FS+ F L G Sbjct: 254 G------TKIALLKNAMSFVIQTLGPND------RLSVIAFSSTARRLFPLRRMTLAGRQ 301 Query: 242 HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 + + L G TN GLK I R+ N + I+ ++DG++ Sbjct: 302 QALQAVSSLVASG-GTNIADGLKKGAKVI----EDRRLKNPVCS-----IILLSDGQDTY 351 Query: 302 TKE------DQQSLYYCNEAKKRG--AIVYAIGIRVIRSHEFLRACA--SPNSFYLVENP 351 T D +L + G ++ G + A A S +F ++ Sbjct: 352 TLPSDRNLLDYSALVPPSILPGTGHHVQIHTFGFGSDHDSAAMHAIAEISSGTFSFIDAE 411 Query: 352 HSMYDAFSH 360 S+ D F+ Sbjct: 412 GSIQDGFAQ 420 >gi|71278376|ref|YP_269691.1| von Willebrand factor type A domain-containing protein [Colwellia psychrerythraea 34H] gi|71144116|gb|AAZ24589.1| von Willebrand factor type A domain protein [Colwellia psychrerythraea 34H] Length = 364 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 36/233 (15%), Positives = 72/233 (30%), Gaps = 49/233 (21%) Query: 167 QTDARLDMMIVLDVSRSME---------SFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 Q + D+MI +D+S SM + ++ K D + + + VN Sbjct: 89 QEKSARDLMIAVDLSGSMAVEDFTLPIATNELTNRAKNDTDSSATKSSTNDTGKGEKVNR 148 Query: 218 VV----------------QSGLVTFSNKIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNS 259 +V + GL+ F + ++ Q + + G ST Sbjct: 149 LVAVKHVLNAFVKSREHDRLGLILFGDAPYLQAPFTDDIATWQALLNESDIGMAGQSTAF 208 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 + A + + ++++ +TDG T + A R Sbjct: 209 GDAIGLAIS-----------VFQQSDTQNRVLIVLTDG--NDTASKVPPVEAAKVAAARD 255 Query: 320 AIVYAIGIRVIRS-------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 +Y I I + E L+A A + + N + ++ I + Sbjct: 256 IKIYTIAIGDPSAVGEEKVDLEVLQAMAEITQGKSFQALNSEELLKVYAEIDR 308 >gi|20306196|gb|AAH28343.1| Chloride channel calcium activated 3 [Mus musculus] Length = 913 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 52/336 (15%), Positives = 116/336 (34%), Gaps = 51/336 (15%) Query: 44 FFMKTVLHSMIDRSLV---------HAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSF 94 + K ++ + + + I N +R D + + + + Sbjct: 176 YLSKGKPQAVRCSAAITGKNQVRRCQGGSCITNGKCVIDRVTGLYKDNCVFVPDP-HQNE 234 Query: 95 RNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHI 154 + + N +N + + + + P ++ + S +++ + F + Sbjct: 235 KASIMFNQNINSVVEFCTEKNHN-QEAPNDQNQRCNLRSTWEVIQESEDFKQTTPMTAQP 293 Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 P S +++ + + +VLD S SM + +++ ++ L + Sbjct: 294 PAPTFSLLQI-----GQRIVCLVLDKSGSM-----LNDDRLNRMNQASRLFL-----LQT 338 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK--IKYLSK-FGVSTNSTPGLKYAYNQIF 271 V G+VTF + L+ S R IK+L T+ GL+ A+ I Sbjct: 339 VEQGSWVGMVTFDSAAYVQSELKQLNSGADRDLLIKHLPTVSAGGTSICSGLRTAFTVI- 397 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVI 330 +++ IV +TDGE+ ++ C + K+ GAI++ + + Sbjct: 398 ----KKKYPTDGSE-----IVLLTDGEDN-------TISSCFDLVKQSGAIIHTVALGPA 441 Query: 331 RSHEF--LRACASPNSFYLVENPHS--MYDAFSHIG 362 + E L Y + + DAF+ + Sbjct: 442 AAKELEQLSKMTGGLQTYSSDQVQNNGFVDAFAALS 477 >gi|162457601|ref|YP_001619968.1| hypothetical protein sce9315 [Sorangium cellulosum 'So ce 56'] gi|161168183|emb|CAN99488.1| conserved hypothetical protein with a vWF type A domain [Sorangium cellulosum 'So ce 56'] Length = 617 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 40/215 (18%), Positives = 79/215 (36%), Gaps = 28/215 (13%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 H+ + + S+ L + +VLDVS SM+ + A + A+++ + Sbjct: 191 GPVHVRVALRSTA-QAPAARPHLSVHLVLDVSGSMQG------EPMARARDAARALVDRL 243 Query: 210 KLIPDVNNVVQSGLVTFSNKIE---EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 D + LVTFS+ + + + + ++ I + + G TN GL Sbjct: 244 DANDDFS------LVTFSSGADVRIDDGPVGPRRAAIKATIDGIRE-GGGTNIGQGLALG 296 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAI 325 Y Q R K+++ ++DG N ++ +A + G Sbjct: 297 Y-----AQAARPGIPEGAV---KVVLLLSDGRANAGITSSERLSRLALDAFQGGVQTSTF 348 Query: 326 GIRVIRSHEFLRACASPNS--FYLVENPHSMYDAF 358 G+ + A AS + +Y + +P + A Sbjct: 349 GLGADYDGALMSAIASDGAGGYYYLRDPDQIAPAL 383 >gi|301606537|ref|XP_002932887.1| PREDICTED: cochlin-like [Xenopus (Silurana) tropicalis] Length = 789 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 30/210 (14%), Positives = 74/210 (35%), Gaps = 37/210 (17%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV-VQSGLVTFSNK 229 +++ ++D S S+ D+ + + + +V +V++V + G + F+ Sbjct: 604 SVNIGFLIDGSSSVG----------DLNFRIMLDFMADVVRAFEVSDVGTKVGAIQFTYD 653 Query: 230 IEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 F L + ++ + T + + +A +F + N Sbjct: 654 QRLEFGLNDHSNKDDVLNALRSIRYMSGGTATGDAINFAVRNLFQPTKDGHNKN------ 707 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 ++ +TDG+ + A G ++++G+ + L+ AS Sbjct: 708 --FLIIVTDGQ-----SYDDVRGPASSAHISGVTIFSVGVAWAPLED-LKDMAS-----E 754 Query: 348 VENPHSMYDA----FSHIGKDIVTKRIWYD 373 +N H+ + I D++ + I D Sbjct: 755 PKNSHTFFTREFPGLEQITPDLI-RGICRD 783 >gi|45382993|ref|NP_990865.1| collagen alpha-3(VI) chain precursor [Gallus gallus] gi|1345652|sp|P15989|CO6A3_CHICK RecName: Full=Collagen alpha-3(VI) chain; Flags: Precursor gi|211622|gb|AAA03201.1| alpha-3 collagen type VI [Gallus gallus] Length = 3137 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 43/281 (15%), Positives = 95/281 (33%), Gaps = 27/281 (9%) Query: 92 MSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNS 151 E++ N V + ++ + + L+F I S Sbjct: 565 EQAAAEMKRNRIVILAVGSRNADVAELQEIAHERDFVFQ---PNDFRLQFMQAILPEVLS 621 Query: 152 --RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 R + + + Q R D++ +LD S + + ++ + Sbjct: 622 PIRTLSGGMVIHETPSVQVTKR-DIIFLLDGS------LNVGNANFPFVRDFVVTLVNYL 674 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGV-SHLQRKIKYLSKFGVST-NSTPGLKYA 266 + + ++ GLV FS+ + F L + S + +++ L G S N+ L Sbjct: 675 DV---GTDKIRVGLVQFSDTPKTEFSLYSYQTKSDIIQRLGQLRPKGGSVLNTGSAL--- 728 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 N + + + +++V +T G + L N+ + G + +A+G Sbjct: 729 -NFVLSNHFTEAGGSRINEQVPQVLVLVTAGRSAVP-----FLQVSNDLARAGVLTFAVG 782 Query: 327 IRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 +R E + +P Y +++ + + K I T Sbjct: 783 VRNADKAELEQIAFNPKMVYFMDDFSDLTTLPQELKKPITT 823 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 43/266 (16%), Positives = 99/266 (37%), Gaps = 30/266 (11%) Query: 104 VNDIDDIVRSTSLDIV-----VVPQNEGYSISAISRYKIPLKFCTFIPW-----YTNSRH 153 VN +++ STS+ + V +N+ I+ + ++ T +P Sbjct: 1561 VNRPANVISSTSIQPLGVGARNVDRNQLQVITNDPGRVLVVQDFTGLPTLERKVQNILEE 1620 Query: 154 IVMPIT-SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 + +P T V + + D++ +LD S ++ D + + + V I Sbjct: 1621 LTVPTTEGPVYPGPEGKKQADIVFLLDGSINLGR---------DNFQEVLQFVYSIVDAI 1671 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + + +Q GL +++ + + F L+ S + I + G +T A + Sbjct: 1672 YEDGDSIQVGLAQYNSDVTDEFFLKDYSSKPEILDAINKVIYKGGRVANTGA---AIKHL 1728 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 +++ + D +I +T G++ S+ + G V+A+G+R I Sbjct: 1729 QAKHFVKEAGSRIDQRVPQIAFIITGGKSSDD-GQGASMEVAQK----GVKVFAVGVRNI 1783 Query: 331 RSHEFLRACASPNSFYLVENPHSMYD 356 E + + + + V + + Sbjct: 1784 DLEEVSKLASESATSFRVSTAQELSE 1809 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 42/245 (17%), Positives = 90/245 (36%), Gaps = 29/245 (11%) Query: 118 IVVVPQNEGYSISAISRYKIP-LKFCTFIPWYTNS----RHIVMPITSSVKVNSQTDARL 172 +V + + Y S ++P L+ P T + R ++ +T+ V+ + Sbjct: 1384 MVQISLSPDYVFQVSSFQELPSLEQKLLAPIETLTADQIRQLLGDVTTIPDVSGEEK--- 1440 Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S S+ S + I+ +++++ + P+ V+ G+V FSN + Sbjct: 1441 DVVFLIDSSDSVRSDG------LAHIRDFISRIVQQLDVGPNK---VRIGVVQFSNNVFP 1491 Query: 233 FFLLEWGVSH--LQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L S + + I+ L + G N+ L Y F ED + Sbjct: 1492 EFYLRTHKSKNAVLQAIRRLRLRGGYPVNAGKALDYVVKNYFIKS---AGSRIEDGVPQH 1548 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 ++V + D + N +G R + ++ P +V+ Sbjct: 1549 LVVILGD------QSQDDVNRPANVISSTSIQPLGVGARNVDRNQLQVITNDPGRVLVVQ 1602 Query: 350 NPHSM 354 + + Sbjct: 1603 DFTGL 1607 Score = 52.8 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 73/191 (38%), Gaps = 23/191 (12%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S ++ S + ++E +++ + G+V +S Sbjct: 236 AQESADLIFLIDGSDNIGS------VNFQAIRDFLVNLIESLRV---GAQQIHIGVVQYS 286 Query: 228 NKIEEFFLL-EWGVS-HLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ F L + + +K LS + G N+ L+Y +F G + + Sbjct: 287 DQPRTEFALNSYSTKADVLDAVKALSFRGGKEANTGAALEYVVENLFTQAGGSRI----E 342 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I+V ++ GE+ K+ ++IG+ S E L+ A+ S Sbjct: 343 EAVPQILVLISGGESSD-----DIREGLLAVKQASIFSFSIGVLNADSAE-LQQIATDGS 396 Query: 345 F-YLVENPHSM 354 F + + ++ Sbjct: 397 FAFTALDIRNL 407 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 66/187 (35%), Gaps = 22/187 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S S+ + + + + VK + N + LV FS Sbjct: 38 DIIFLVDSSWSIGKEHFQLVREF---------LYDVVKALDVGGNDFRFALVQFSGNPHT 88 Query: 233 FFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L S + I + G + + GL+Y + + + + ++ Sbjct: 89 EFQLNTYPSNQDVLSHIANMPYMGGGSKTGKGLEY----LIENHLTKAAGSRASEGVPQV 144 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLV 348 I+ +TDG+ + K + A+G++ E + P + + Sbjct: 145 IIVLTDGQ-----SQDDVALPSSVLKSAHVNMIAVGVQDAVEGELKEIASRPFDTHLFNL 199 Query: 349 ENPHSMY 355 EN +++ Sbjct: 200 ENFTALH 206 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 72/205 (35%), Gaps = 26/205 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + F + + ++E + + + V+ +V +SN I+ Sbjct: 1035 DVVFLIDGSDGVRRGFP-------LLKTFVERVVESLDI---GRDKVRVAIVQYSNAIQP 1084 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 FLL+ + L I+ L+ G S N+ L Y +F + + + Sbjct: 1085 EFLLDAYEDKADLVSAIQALTIMGGSPLNTGAALDYLIKNVFTVSSGSRIAEG----VPQ 1140 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 ++ +T + + K G + + IGI E P+ V Sbjct: 1141 FLILLT-----ADRSQDDVRRPSVVLKTSGTVPFGIGIGNADLTELQTISFLPDFAISVP 1195 Query: 350 NPHSMYDAFSHIGKDIVTKRIWYDK 374 + + + + + + I K Sbjct: 1196 DFSQL----DSVQQAVSNRVIRLTK 1216 >gi|291527684|emb|CBK93270.1| Mg-chelatase subunit ChlD [Eubacterium rectale M104/1] Length = 1237 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 71/196 (36%), Gaps = 27/196 (13%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 ++V+D S SM S D ++ +A LI ++ + G+V F++ + Sbjct: 612 ILVIDDSGSMVSN-DRYNQRLTVAQN----------LIDNLPENSKVGVVKFTSSTTKLT 660 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + YL+ ++ + A N F M D N K+++ + Sbjct: 661 TSLTSDKETAK--SYLTTSYFRSSGGTSMYTAINSSFSMFEAT------DDNILKMMIVL 712 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-----SPNSFYLVE 349 +DG T + N G +Y +G+ S F + + +FYL Sbjct: 713 SDGATSYTYLHSSVVTTAN---NNGVKIYTVGLGSSSSSYFTQYLKPLANNTGGAFYLAS 769 Query: 350 NPHSMYDAFSHIGKDI 365 + + D + I K I Sbjct: 770 DASQLEDIYKDINKKI 785 >gi|194291599|ref|YP_002007506.1| hypothetical protein RALTA_B0833 [Cupriavidus taiwanensis LMG 19424] gi|193225503|emb|CAQ71449.1| conserved hypothetical protein, Von Willebrand factor type A domain (vwa), putative exported protein [Cupriavidus taiwanensis LMG 19424] Length = 356 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 34/238 (14%), Positives = 77/238 (32%), Gaps = 53/238 (22%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++V+D+S SM + D +++ A ++ +LE V G+V + Sbjct: 98 VLLVMDLSGSMRAQ-DVKPSRLRAAQQAATTLLE----AQPAG--VSVGVVAMAGTAAVA 150 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF-DMQGMRQHCNTEDANYKK--- 289 + I+ L G T G+ A + ++ + +D + Sbjct: 151 QAPTRAREAVATAIERLQPQG-GTALGNGMLIALTTLLPELTPDAERLMNDDTPPPRKPR 209 Query: 290 --------------------IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 IV +DGE+ + ++ A + G +Y +G+ Sbjct: 210 ALANPPADTEPVKPGSYTSGAIVLFSDGESNAGPAALRAAQL---AAEHGVRIYTVGVGT 266 Query: 330 IRS---------------HEFLR--ACASPNSFYLVENPHSMYDAFSHI-GKDIVTKR 369 + L+ A A+ ++ +E+ + + + + KR Sbjct: 267 PEGVVLSVDGWSARVRLDEKVLKEVADATSAEYFRLEDAAELKRVYRALNARLAFDKR 324 >gi|327310820|ref|YP_004337717.1| hypothetical protein TUZN_0922 [Thermoproteus uzoniensis 768-20] gi|326947299|gb|AEA12405.1| hypothetical protein TUZN_0922 [Thermoproteus uzoniensis 768-20] Length = 509 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 72/199 (36%), Gaps = 29/199 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 I++D S+SM F K+ AI++ ++E + V V F K++ Sbjct: 42 FAILIDKSKSMADF-----DKLAHAIEAAKGLVESMAPEDIVAVYV------FDEKVKAL 90 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE---------D 284 V ++ + + K T + Y + + + Sbjct: 91 VPPT-PVEKARKMLGKIEKIKPGTYTLL-----YQALLQVIDDLRGIKRGLPLMPRRAVP 144 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIRVIRSHEFLRACA--S 341 N K IV +TDGE +++ + +A R G + AIGI + + L A A S Sbjct: 145 ENIPKRIVVITDGEPWPYYTEERWYEHLGKAAARYGITISAIGIGDDYNEKILYALANSS 204 Query: 342 PNSFYLVENPHSMYDAFSH 360 ++Y + + ++ Sbjct: 205 GGAWYHISQIRDISQVLAN 223 >gi|282879637|ref|ZP_06288368.1| von Willebrand factor type A domain protein [Prevotella timonensis CRIS 5C-B1] gi|281306585|gb|EFA98614.1| von Willebrand factor type A domain protein [Prevotella timonensis CRIS 5C-B1] Length = 332 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 69/216 (31%), Gaps = 39/216 (18%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +DVS SM + D +++ A + + ++ + +G + Sbjct: 88 IDIMLAMDVSTSMLAE-DLKPNRLEAAKNVAAEFI-SGRPNDNIGLTIFAGEAFTQCPMT 145 Query: 232 EF-FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + +++ I T GL A +++ + K+ Sbjct: 146 TDHTSLLNLLRNVRTDIAARGLISDGTAVGMGLANAVSRL-----------KDSKTKSKV 194 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV--------------------- 329 ++ +TDG N D + AK VY IG+ Sbjct: 195 VILLTDGSNNM--GDISPMTSAQIAKSLDIRVYTIGVGTNKVAPYPMSVGGGTQYINIPV 252 Query: 330 -IRSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 I S A + +FY N + + I K Sbjct: 253 EIDSKTLSDIAAVTEGNFYRATNNQQLKQIYKDIDK 288 >gi|254787962|ref|YP_003075391.1| PKD domain-containing protein [Teredinibacter turnerae T7901] gi|237687416|gb|ACR14680.1| PKD domain protein [Teredinibacter turnerae T7901] Length = 1083 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 42/241 (17%), Positives = 81/241 (33%), Gaps = 58/241 (24%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 V +T + LD ++++D S SM + D +++ A +N + D + Sbjct: 403 VDEAPRTYSELDSLLIIDSSGSMRT-TDPQSRRLEAANTYVN-----ISAAED-----KI 451 Query: 222 GLVTF--SNKIEEFFLL--EWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM- 273 G++ F S ++ + F L + G + L+ I + G T L A + + Sbjct: 452 GIIDFDGSARVVKPFTLLGQQGSTERIQLENAINGIDAVG-GTEIAGSLGLACSTLMASW 510 Query: 274 ----------------------QGMRQHCNTEDA---------NYKKIIVFMTDGENLST 302 H N A N KKI + +TDG+ S+ Sbjct: 511 QDDIIEQQHMVNDIKALVEEFPNNTISHSNARRAAIPGIAHAINSKKIAILLTDGDTPSS 570 Query: 303 KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPH--SMYDAFSH 360 + + G ++Y IG + + +Y+ + + A+ Sbjct: 571 YSQANQCFI-----ENGWMLYTIGFGGANGEKLAPLAEASGGYYIAADSSLLDLNCAYQQ 625 Query: 361 I 361 I Sbjct: 626 I 626 >gi|116625273|ref|YP_827429.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228435|gb|ABJ87144.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 307 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 34/223 (15%), Positives = 80/223 (35%), Gaps = 39/223 (17%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 SVKV + D + M +++D S SM K++ A ++ V+ + V Sbjct: 53 PQSVKVFRREDVPVSMGLIIDNSGSMRDKRG----KVEAAALAL------VRDSNPDDEV 102 Query: 219 VQSGLVTFSNKIEEFFL----LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 +V F+++ + ++ + + G + ++ A D Sbjct: 103 F---IVNFNDEAFLDNPHGKDFTTNIKEMEEALTRIDSRGGT-----AMRDALRMSIDHV 154 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------- 327 + H + KK++V +TDG +++ ++Y +G+ Sbjct: 155 KEKAHKD------KKVLVVVTDG--NDNSSVVSLENLVKASQQSEVLIYGVGLLGDEERR 206 Query: 328 RVIRSHEFLRAC--ASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 R+ L+A A+ + ++ + + + +DI + Sbjct: 207 EAQRAQRALKALAEATGGEVFFPKDVNEVDKVAHQVARDIRNQ 249 >gi|149059383|gb|EDM10390.1| integrin, alpha 2 [Rattus norvegicus] Length = 1140 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 36/286 (12%), Positives = 99/286 (34%), Gaps = 46/286 (16%) Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 + + +++ + + G +++ L +++ I S V + Sbjct: 94 LQNSASISNVTEIKTNMSLGLTLTRNPGTGGFLTCGPLWAHQCGNQYYATGICSDVSPDF 153 Query: 167 QTDARL-----------DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 Q+ D+++V D S S+ + + + ++ + + P Sbjct: 154 QSLTSFSPAVQACPSLVDVVVVCDESNSIYPW--------EAVKNFLEKFVQGLDIGP-- 203 Query: 216 NNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFD 272 Q L+ ++N F L + + ++G TN+ +++A + + Sbjct: 204 -KKTQVALIQYANDPRVVFNLTTYKNKEDMVQATSETRQYGGDLTNTFKAIQFARDIAYL 262 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI----- 327 + + K++V +TDGE+ + Q + CN + + I + Sbjct: 263 PE------SGGRPGATKVMVVVTDGESHDGSKLQTVIQQCN---DDEILRFGIAVLGYLN 313 Query: 328 -RVIRSHEFLR----ACASP--NSFYLVENPHSMYDAFSHIGKDIV 366 + + ++ ++P F+ V + ++ + +G+ I Sbjct: 314 RNALDTKNLIKEIKAIASTPTERYFFNVADEAALLEKAGTLGEHIF 359 >gi|77552603|gb|ABA95400.1| von Willebrand factor type A domain containing protein [Oryza sativa Japonica Group] Length = 574 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 71/163 (43%), Gaps = 26/163 (15%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSM--ESFFDSS--ITKIDMAIKSINAMLEEV 209 +++ + + V + A +D++ VLDVS SM E F +++D+ ++ +++ V Sbjct: 86 VLVTVEAPKVVAPEKRAPIDLVAVLDVSGSMNKEEFVRGKHMSSRLDLLKIAMKYIIKLV 145 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK----IKYLSKFGVSTNSTPGLKY 265 + + +V+F++ + + L + ++K + L G +T+ P LK Sbjct: 146 RDAD------RLAIVSFNHAVVSEYGLTRNSADSRKKLENLVDKLKASG-NTDFRPALKK 198 Query: 266 AYNQIF--DMQGMRQHCN------TEDANYKK---IIVFMTDG 297 A + +++ + N KK I+ ++DG Sbjct: 199 AVEDMNIQNIKNSSAYNNFQILDGRGKEEKKKRVGFILLLSDG 241 >gi|221130232|ref|XP_002156394.1| PREDICTED: similar to tyrosine kinase receptor [Hydra magnipapillata] Length = 1746 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 41/228 (17%), Positives = 78/228 (34%), Gaps = 32/228 (14%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 ST L + N+ SI + + H+ P + Sbjct: 1539 RNSTMLTASTLINNKLVSIDQNPVAFDMWNKVVLSQTFNGTVHLEDPSC---------EG 1589 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D+ +LD S S+ES + + + S +G+VTFS++ Sbjct: 1590 FFDVGFILDSSGSLESNYSQEVDFLKQLASSFG----------ISKQGSHAGVVTFSSEA 1639 Query: 231 EEFFLL-EWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L ++ + + + + G T L+ A +G R Sbjct: 1640 KLSIQLDKYFTDADFNKAVDDIPYMGGGTRIDLALEKAIELFDTKKGSRN-------EAP 1692 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 K++ +TDG +E + + NE K++ ++AIGI + + L Sbjct: 1693 KLLFLLTDG----VQEPKMEIPVPNEIKQKIIQLFAIGIGSNVNKDEL 1736 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 36/281 (12%), Positives = 86/281 (30%), Gaps = 31/281 (11%) Query: 81 DILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVV-PQNEGYSISAISRYKIPL 139 + ++ E N + + D ++ + + + S Y Sbjct: 1062 FLPNTKEDLLQEHQAEESAQNIYPFGVIDEGDIKITNLFRELKDGKKPNFWSQSSYHWNH 1121 Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 + + ++ P + +D+ +LD S S+E + Sbjct: 1122 TAVPYKINSDTNLNLKSPDCVGI---------VDVGFILDSSGSLEKEYPQE-------K 1165 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVST 257 + + + + P + + +VTFS+ L + + + T Sbjct: 1166 EFLINLASTFGINPSGAH---AAVVTFSSDAVLSIKLNDYFEQAPFNKSVYEIEHMNGWT 1222 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAK 316 L+ + ++ G R+ N +++ +TDG + + + + + Sbjct: 1223 RIDLALRKSLEMFEEINGARK-------NVPRLLFLLTDGKQETNEGGAEDPVNVAQLLR 1275 Query: 317 KRGAIVYAIGIRVI-RSHEFLRACASPNSFYLVENPHSMYD 356 RG + A+GI E S + +L EN + Sbjct: 1276 DRGVEIIAVGIGKGVNRLELNNIAGSSDKVFLAENFDELIK 1316 >gi|220933243|ref|YP_002512142.1| von Willebrand factor type A [Thioalkalivibrio sp. HL-EbGR7] gi|219994553|gb|ACL71155.1| von Willebrand factor type A [Thioalkalivibrio sp. HL-EbGR7] Length = 325 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 26/205 (12%), Positives = 63/205 (30%), Gaps = 33/205 (16%) Query: 173 DMMIVLDVSRSM----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++V+D S SM +++I++ + + + L+ + Sbjct: 98 DLVLVVDTSGSMLVRDYRAEGRPVSRIEVLQGVVTRFVR-------AREGDRFALIPMAE 150 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ L G T G+ A Q+ R+ Sbjct: 151 EAATLVPLTGDRELVASQLARLRAGMLGDDTAIGDGIALALRQLQASGAERRPA------ 204 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI------GIRVIRSHEFLRACA 340 ++ +DGE+ + + A+ G +Y + + + L A Sbjct: 205 ----LILFSDGESNA--GLLRPSEALALARAAGVALYTVEITGGQALAPVEGEPSLADMA 258 Query: 341 --SPNSFYLVENPHSMYDAFSHIGK 363 + + V + + I + Sbjct: 259 ETTGGRHFHVTRSADLEAVIATIDR 283 >gi|124004754|ref|ZP_01689598.1| von Willebrand factor, type A, putative [Microscilla marina ATCC 23134] gi|123989877|gb|EAY29406.1| von Willebrand factor, type A, putative [Microscilla marina ATCC 23134] Length = 354 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 65/168 (38%), Gaps = 20/168 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ I +D+S SM++ D ++++ ++ ++ +K + GLV FS+ Sbjct: 106 DIFIAVDLSLSMKA-TDIPPSRLEKIKYELSNIINTLKSD-------RIGLVIFSSSAFM 157 Query: 233 FFLLEWGVSHLQRKIKYLSKF-----GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + L + L+ T+ L+ + Q + + Y Sbjct: 158 HCPLTYDKGALNLFTQILNTNLMPIGNAGTDFYAPLELV---LKKYQEANKSNRKQQNEY 214 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 K++V +DGE + ++ K+ V+ +G+ ++ + Sbjct: 215 AKVVVLFSDGEEFGDRY----TAIVDQYKQNNIRVFTVGVGSLQGGKI 258 >gi|108760371|ref|YP_628476.1| von Willebrand factor type A domain-containing protein [Myxococcus xanthus DK 1622] gi|108464251|gb|ABF89436.1| von Willebrand factor type A domain protein [Myxococcus xanthus DK 1622] Length = 422 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 29/203 (14%), Positives = 69/203 (33%), Gaps = 28/203 (13%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + + + +VLD S SM + K+ A ++ +++ +K + Sbjct: 39 AETGQRVPVSLALVLDRSGSM------NGQKLADARRAATELVQRLKPED------RLAF 86 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLS---KFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + + + + ++ L + STN + L A N + R H Sbjct: 87 IDYGTDV-RVQPSRRMTEEAREELLTLISGLQDDGSTNISGALDAAANAL------RPHM 139 Query: 281 NTEDANYKKIIVFMTDGEN-LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 + + ++DG+ + L + ++ G V A+G+ +R Sbjct: 140 REYRVSR---AILLSDGQPTTGIVSEPGLLDQVRQLRRDGITVSALGVGRDYQETLMRGM 196 Query: 340 ASPNSFY--LVENPHSMYDAFSH 360 A + +++ + + FS Sbjct: 197 AEQGGGFSGFIDDSARLAEVFSR 219 >gi|291402773|ref|XP_002718214.1| PREDICTED: integrin, alpha 11 [Oryctolagus cuniculus] Length = 1188 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 87/214 (40%), Gaps = 39/214 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++IVLD S S+ + + + ++ +K +Q G+V + Sbjct: 160 QTYMDIVIVLDGSNSIYPWVEVQ-----------HFLINILKKFYIGPGQIQVGVVQYGE 208 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 F L S ++ ++ S + G T + G+++A ++ F G + Sbjct: 209 DAVHEFHLNDYKS-VKDVVEAASHIEQRGGTETRTAFGIEFARSEAFQKGGRKG------ 261 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL-- 336 KK+++ +TDGE+ + + ++ + ++++ YA+ + R I FL Sbjct: 262 --AKKVMIVITDGESHDSPDLKKVIR---QSERDNVTRYAVAVLGYYNRRGINPETFLNE 316 Query: 337 -RACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + AS F+ V + ++ D +G I Sbjct: 317 IKYIASDPDDKHFFNVTDEAALKDIVDALGDRIF 350 >gi|288942396|ref|YP_003444636.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] gi|288897768|gb|ADC63604.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] Length = 346 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 29/223 (13%), Positives = 72/223 (32%), Gaps = 47/223 (21%) Query: 168 TDARLDMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + D+MI +D S SME+ + ++ + + + D + GL Sbjct: 95 STPGYDLMIAVDASHSMEALDFTVEGRQVNRMAVVKGVMGRFI-------DARQGDRVGL 147 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 + F ++ L ++ + + G +T + +++ Sbjct: 148 ILFGSQAFILSPLTLDRHAARQLLDGVVPSIAGPATALGDAIALGVSKL----------- 196 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR---------- 331 E ++++ + DG+N + A+ G +Y IG+ + Sbjct: 197 RERPEGSRVMIVIADGDNNAGSF--APKEAARLARATGTRIYVIGVGSKQPSIPILEEGS 254 Query: 332 ---------SHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L+ A + ++ + ++ + S IG+ Sbjct: 255 VRYRDDLTMDEGTLQEIADLTGGGYFRATDTRALEEISSRIGQ 297 >gi|225028889|ref|ZP_03718081.1| hypothetical protein EUBHAL_03177 [Eubacterium hallii DSM 3353] gi|224953773|gb|EEG34982.1| hypothetical protein EUBHAL_03177 [Eubacterium hallii DSM 3353] Length = 1070 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 44/258 (17%), Positives = 88/258 (34%), Gaps = 40/258 (15%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 + + + + + A+LD++ +LD S SM+ F + +I A +I A+ Sbjct: 362 LRDDGTYDLTLTVAGKKGTETN-KAKLDVIYILDKSGSMKEDFGGTSKRI-AASNAITAL 419 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNST----- 260 + +K + N + +VTFS WG + +T++ Sbjct: 420 TKSLK--QNANIDARFSMVTFSGNKTTGM---WGQGDTKTWDDAEVAVSWTTDAGTIERG 474 Query: 261 --PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL---------------STK 303 P N ++ ++ ++ A ++F++DG+ Sbjct: 475 SKPTSNGGTNYQAGIRTAKELLTSKRAGAMTAVIFISDGDPTFYYNPDGYTRGDGNNDGN 534 Query: 304 EDQQSLYYCNEAKKRGAI------VYAIGIRVIRSHEFLR-ACASPN----SFYLVENPH 352 +L C +A K Y +G+ + L C++ + N Sbjct: 535 GGADNLKVCLDAAKNEIANLGVNYFYTVGVGKANDYVNLSDLCSASGVSGAKNFDGTNTD 594 Query: 353 SMYDAFSHIGKDIVTKRI 370 + AFS I DI+T Sbjct: 595 ELTKAFSTIESDILTFLC 612 >gi|305665950|ref|YP_003862237.1| aerotolerance-related membrane protein [Maribacter sp. HTCC2170] gi|88710725|gb|EAR02957.1| aerotolerance-related membrane protein [Maribacter sp. HTCC2170] Length = 349 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 27/200 (13%), Positives = 80/200 (40%), Gaps = 19/200 (9%) Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI 192 S +K LK F+ T ++ K+ + +D++ +DVS+SM + D + Sbjct: 51 SSFKSTLKLLFFLLGITLLTLGLVNPKIGTKLETVKREGVDIVFAVDVSKSMLAE-DIAP 109 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK 252 +++ A + ++ ++ ++ + G++ ++ + + + ++ ++ Sbjct: 110 NRLEKAKRLVSEIINQLASD-------RIGIIAYAGQAFPQLPITTDYGAAKMFLQNMNT 162 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 + T+ + A + N +++ ++DGE + +L Sbjct: 163 NML-TSQGTAINEAIELATTYYDDEEQTN-------RVLFIISDGE---DHSEGTTLKAV 211 Query: 313 NEAKKRGAIVYAIGIRVIRS 332 ++A + G ++ IG+ + Sbjct: 212 DDAIEEGIQIFTIGVGKSKG 231 >gi|312197712|ref|YP_004017773.1| von Willebrand factor type A [Frankia sp. EuI1c] gi|311229048|gb|ADP81903.1| von Willebrand factor type A [Frankia sp. EuI1c] Length = 372 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 50/246 (20%), Positives = 76/246 (30%), Gaps = 56/246 (22%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 T +M+ LDVS SM D +I A K+ A + K P + + GLVT Sbjct: 80 PITSNSTTIMLALDVSGSM-CSTDVPPNRITAAEKAATAFI---KAQPAGS---RIGLVT 132 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ------GMRQH 279 FS L ++ L+ T G+ + + I D G Sbjct: 133 FSGIAGLLVPPTTDSQKLLDALQNLTTS-RGTAIGQGILTSIDAIADADPSVAPTGSAVS 191 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR----------- 328 N +IV +TDG N + Q + +A R VY IG Sbjct: 192 GNGTGPYAADVIVVLTDGANTQGVDPQTA---AKQAAARRLRVYTIGFGTTTPAPMVCGS 248 Query: 329 ---------------------------VIRSHEFLRACA-SPNSFYLVENPHSMYDAFSH 360 VI A + ++Y +N + DA Sbjct: 249 SQVGGFGGFGGFGGFGGGGRLGDRSPLVIDEQALRDVAATTGGTYYRAQNAGQLQDALGT 308 Query: 361 IGKDIV 366 + ++I Sbjct: 309 LPRNIT 314 >gi|116695550|ref|YP_841126.1| hypothetical protein H16_B1611 [Ralstonia eutropha H16] gi|113530049|emb|CAJ96396.1| conserved hypothetical protein [Ralstonia eutropha H16] Length = 354 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 33/243 (13%), Positives = 74/243 (30%), Gaps = 63/243 (25%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 + +++ +D SRSM + D + T+I + ++ ++ + V+ G+V+F+ Sbjct: 86 ITLVLAMDTSRSM-AATDVAPTRIGASKQAARDLIVGLPAS------VRLGMVSFAATAT 138 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG---------------- 275 + I + + T + GL A +F G Sbjct: 139 VVLPPTDNRQDMLDAIDRF-QLQLGTATGSGLIQALAVLFPDDGIDLEAILFSGESLAPG 197 Query: 276 ----MRQHCNTEDANYKK---------------IIVFMTDGENLSTKEDQQSLYYCNEAK 316 DA K+ ++ ++DG + + + + Sbjct: 198 PGGRSLTEAAAADAVRKREQERPAAQPGSYRHGAVILLSDGRRTTGPDPLDAARMAAQ-- 255 Query: 317 KRGAIVYAIGIRVIRSHE---------------FLRACA--SPNSFYLVENPHSMYDAFS 359 RG VY +G + LRA A + ++ + + + Sbjct: 256 -RGVRVYTVGFGTPQGGAAAESGLSYYMQLDEPALRAVAAITNGEYFQAGSAADLSQVYR 314 Query: 360 HIG 362 + Sbjct: 315 QLS 317 >gi|239833540|ref|ZP_04681868.1| Hypothetical protein OINT_2000308 [Ochrobactrum intermedium LMG 3301] gi|239821603|gb|EEQ93172.1| Hypothetical protein OINT_2000308 [Ochrobactrum intermedium LMG 3301] Length = 637 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 34/231 (14%), Positives = 80/231 (34%), Gaps = 26/231 (11%) Query: 6 LNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 L I F + +G ++ A+ L + L + ++ +++ ++ + + +D + + + Sbjct: 32 LTILRFGKDERGNFAMIAALVLVPLLLAGMVAVDTANLMRVRNNVQASLDAAALAVGKRF 91 Query: 66 MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 + + + + ++ F D + Sbjct: 92 STGESHTVVQDYGARIFYANVTALSADAINFQI---AFPQD----------------KTT 132 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 + A + + F T T + V + +++ +VLD S+SM+ Sbjct: 133 DQQVQATAAFTYKSLFGVVASRLTGDNWDKHQYTLTASVRLKNT--IEVALVLDNSKSMD 190 Query: 186 SFFDSSITK-IDMAIKSINAMLE----EVKLIPDVNNVVQSGLVTFSNKIE 231 S K ID+ + + ++E + LI V VQ LV F+ + Sbjct: 191 ETRSGSSKKRIDLLKDAASQLVETMASQSALITYVEKPVQFSLVPFAGSVN 241 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 31/225 (13%), Positives = 64/225 (28%), Gaps = 77/225 (34%) Query: 222 GLVTFSNKIEEFFLLEW-----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 G +S L G+ +Q IK + G TN + + + I Sbjct: 412 GGPNYSCTTLPLTPLTDVTTEQGMKTVQTAIKAMVPNG-GTNVPEAMAWGWRTIVQGAPF 470 Query: 277 RQHCNTEDANYKKIIVFMTDG--------------------------------------- 297 + + + K+++ +TDG Sbjct: 471 TEARASTERGNDKVVIVLTDGANTYYKYDGLAGSGPDRAGNLSYYSTHGYTARITKKYSQ 530 Query: 298 ------------ENLSTKE---DQQSLYYCNEAKKRGAIVYAIGIRVIRSH-------EF 335 +N +T + + C+ AK IV + + + ++ + Sbjct: 531 SRLFQESGVSVSQNNTTYTKALNARFAKLCDNAKAANIIVMTVALDLNEANSTEKAQIDL 590 Query: 336 LRACAS----------PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 LR+C+S P + + + F IG ++ R+ Sbjct: 591 LRSCSSNSRVRMEGGKPAKLFWNSTGGELSETFRQIGDELSNLRL 635 >gi|153012136|ref|YP_001373346.1| cell wall anchor domain-containing protein [Ochrobactrum anthropi ATCC 49188] gi|151564024|gb|ABS17517.1| LPXTG-motif cell wall anchor domain protein [Ochrobactrum anthropi ATCC 49188] Length = 750 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 46/265 (17%), Positives = 94/265 (35%), Gaps = 36/265 (13%) Query: 81 DILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLK 140 I ++K + + L ++ +D R +LD + + P Sbjct: 263 TISVKLKAGFPLGKVESLFHKVRIDTTNDATREITLDGAAAADRDFVLEWSAVASDAP-- 320 Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 + V+ + + S + +++ V+D S SM T I+ A Sbjct: 321 QVGLFREHIGKDDYVLAYVTPPALASPKKVQREVIFVIDNSGSMGG------TSIEQAKA 374 Query: 201 SINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE-----WGVSHLQRKIKYLSKFGV 255 S++ L +++ + ++ F + + +FF ++ +R + L G Sbjct: 375 SLDYALSQLQPGD------RFNVIRFDDTLTKFFEDSVDANQENIASARRFVTSLEAQG- 427 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 T P L A + G+RQ IVF+TDGE ++Q L A Sbjct: 428 GTEMLPALHAALDDSNQGNGLRQ------------IVFLTDGE----ISNEQQLLDAVAA 471 Query: 316 KKRGAIVYAIGIRVIRSHEFLRACA 340 ++ + ++ +GI + + A Sbjct: 472 RRGRSRIFMVGIGSAPNSYLMNRAA 496 >gi|114600323|ref|XP_526928.2| PREDICTED: integrin alpha-2 [Pan troglodytes] Length = 1181 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 43/287 (14%), Positives = 101/287 (35%), Gaps = 49/287 (17%) Query: 109 DIVRSTSL-DIVVVPQNEGYSISAISRYKIPLKFCTFIPW--YTNSRHIVMPITSSVKVN 165 ++ STS+ ++ + N ++ W +++ + S + + Sbjct: 96 NLQTSTSIPNVTEMKTNMSLGLTLTRNMGTGGFLTCGPLWAQQCGNQYYTTGVCSDISPD 155 Query: 166 SQTDARL-----------DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 Q A D+++V D S S+ + D + ++ + + P Sbjct: 156 FQLSASFSPATQPCPSLIDVVVVCDESNSIYPW--------DAVKNFLEKFVQGLDIGPT 207 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI--KYLSKFGVS-TNSTPGLKYAYNQIF 271 Q GL+ ++N F L + + + S++G TN+ ++YA + Sbjct: 208 K---TQVGLIQYANNPRVVFNLNTYKTKEEMIVATSQTSQYGGDLTNTFGAIQYARKYAY 264 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI---- 327 + + K++V +TDGE+ + + CN + + I + Sbjct: 265 SA------ASGGRRSATKVMVVVTDGESHDGSMLKAVIDQCNH---DNILRFGIAVLGYL 315 Query: 328 --RVIRSHEF---LRACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + + ++A AS F+ V + ++ + +G+ I Sbjct: 316 NRNALDTKNLIKEIKAIASIPTERYFFNVSDEAALLEKAGTLGEQIF 362 >gi|73976419|ref|XP_852853.1| PREDICTED: hypothetical protein XP_847760 [Canis familiaris] Length = 642 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 32/168 (19%), Positives = 71/168 (42%), Gaps = 16/168 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ V+D S ES + K + S++ + ++ + + ++ + FS+ ++ Sbjct: 47 IDVVFVVDSS---ESSKIALFDKQKDFVNSLSDKVFQLTPVGFLKYDIKLAALQFSSSVQ 103 Query: 232 EFFLLE-W-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 W + ++K+K ++ G T S + A + +G + K Sbjct: 104 IDPPFSSWKDLQTFKQKVKSMNFIGQGTFSYYAISNA-TMLLKREGRKDGV--------K 154 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + + MTDG + D QS+ +A+ G + IG+ + + LR Sbjct: 155 VALLMTDGIDHPKNPDVQSIS--EDARTAGILFITIGLSTVVNEAKLR 200 >gi|260576512|ref|ZP_05844501.1| conserved hypothetical protein [Rhodobacter sp. SW2] gi|259021235|gb|EEW24542.1| conserved hypothetical protein [Rhodobacter sp. SW2] Length = 529 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 20/80 (25%), Positives = 35/80 (43%), Gaps = 4/80 (5%) Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACAS-PNSFYLVENP 351 TD E ++D + C+ AK G ++++IG LR CAS P++++ Sbjct: 451 FTD-EISYGQKDVRLQQICDAAKDSGIVIFSIGFEAPENGRNQLRDCASQPSNYFNATGV 509 Query: 352 HSMYDAFSHIGKDIVTKRIW 371 + AF I + R+ Sbjct: 510 -QITTAFRAIATQLSHLRLT 528 >gi|126331114|ref|XP_001371712.1| PREDICTED: similar to Anthrax toxin receptor 2 precursor (Capillary morphogenesis gene 2 protein) (CMG-2) [Monodelphis domestica] Length = 610 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 37/215 (17%), Positives = 72/215 (33%), Gaps = 27/215 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S + D+ VLD S S+ + I ++++ V+ + Sbjct: 154 SQAEEQPSCHGVFDLYFVLDKSGSVAQNW-----------IEIYNFVKQLTERF-VSPGM 201 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMR 277 + + FS++ L + +K L + T GL+ A QI + G++ Sbjct: 202 RLSFIVFSSQATIILPLTGDSKKITEGLKDLKEVQPVGETYIHEGLRLANEQIKNAGGLK 261 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 I++ +TDG+ + N ++ GA VY +G+ + Sbjct: 262 TSS---------IVIALTDGKLDKLVPKYAAKE-ANISRTLGARVYCVGVLDFDQAQLEN 311 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 S + + V + A I I+ K Sbjct: 312 IADSKDQVFPVTGG---FQALKGIINSILEKSCTE 343 >gi|57114202|ref|NP_001009169.1| complement factor B precursor [Pan troglodytes] gi|38502961|sp|Q864W0|CFAB_PANTR RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|29690185|gb|AAM10004.1| complement factor B precursor [Pan troglodytes] Length = 764 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 32/221 (14%), Positives = 74/221 (33%), Gaps = 28/221 (12%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--- 229 IVLD S SM + S + ++ ++ + + GLVT++ Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATHPKI 320 Query: 230 -IEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ + +++ ++ K TN+ L+ Y+ + Sbjct: 321 WVKVSDPDSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYS-MMSWPDDIPPEGWNR 379 Query: 285 ANYKKIIVFMTDG-ENLSTKEDQQSLYYCN---------EAKKRGAIVYAIGIRVIRSHE 334 + +I+ MTDG N+ + ++ VY G+ + + Sbjct: 380 TRH--VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQV 437 Query: 335 FLRACAS----PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + A AS + V++ ++ D F + + + + Sbjct: 438 NINALASKKDNEQHVFKVKDMENLEDVFYQMIDESQSLSLC 478 >gi|149037632|gb|EDL92063.1| procollagen, type VI, alpha 3 (predicted), isoform CRA_d [Rattus norvegicus] gi|149037633|gb|EDL92064.1| procollagen, type VI, alpha 3 (predicted), isoform CRA_d [Rattus norvegicus] Length = 2140 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 46/311 (14%), Positives = 112/311 (36%), Gaps = 28/311 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + GG + + + + G + Sbjct: 1518 LEFVARNLFVKSAGSRIEDGVPQHLVLFLGGKSQDDVSRHAQVISSSGIMSLGI--GDRN 1575 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD 169 I R+ I P+ + ++P + + S P + Sbjct: 1576 IDRTDLQTITNDPR---LVFTVREFRELPNIEERVMLSFGPSGPTPQPPEVEFPSSRPEK 1632 Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + D++ +LD S+ DS + + E V + + + ++ GLV +++ Sbjct: 1633 KKADIVFLLD--GSINFRRDS-------FQEVLRFASEIVDTVYEDGDSIRVGLVQYNSD 1683 Query: 230 IEEFFLLEWGVSHLQRKIKYLSK----FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + F L + ++ I ++K G N+ G+++ + + + + D Sbjct: 1684 PTDEFFLRDFSTK-RQIIDAINKVIYKGGRHANTRVGIEH----LLKNHFVSEAGSRLDE 1738 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +I +T G+++ +D ++G V+A+G+R I S E + ++ + Sbjct: 1739 RVPQIAFVITGGKSVEDAQDVSLALT-----QKGVKVFAVGVRNIDSEEVGKIASNSATA 1793 Query: 346 YLVENPHSMYD 356 + V + + + Sbjct: 1794 FRVGSVQELSE 1804 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 79/203 (38%), Gaps = 25/203 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + + + + A D++ ++D S + + + + + ++ ++E + + + Sbjct: 25 AQQQSDVKNGAAADILFLVDSS------WSAGKDRFLLVQEFLSDVVESLSVGDND---F 75 Query: 220 QSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYA-YNQIFDMQGM 276 LV + FLL + I +S G S + GL+Y ++ + + G Sbjct: 76 HFALVRLNGNPHTEFLLNAYHSKQEVLSHILNMSYIGESNQTGKGLEYIIHSHLTEASGS 135 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 R ++IV +TDG+ E+ E K V+A+G+ L Sbjct: 136 RA-----ADGVPQVIVVLTDGQ----SEEDGFALPSAELKSADVNVFAVGVEDAD-ERTL 185 Query: 337 RACAS---PNSFYLVENPHSMYD 356 R AS + +EN S++D Sbjct: 186 REIASEPLSMHVFNLENVTSLHD 208 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 77/212 (36%), Gaps = 24/212 (11%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T + +V+ D++ +LD S ++ + + ++ + + D Sbjct: 628 TGTTEVHVNKR---DIIFLLDGSDNVGKNNFPYV------RDFVTNLVNSLDVGSDN--- 675 Query: 219 VQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQG 275 ++ GLV FS+ F L+ S L ++ L K G N+ L Y + Sbjct: 676 IRVGLVQFSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGTGLNAGSALSY----VHANHF 731 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + + ++++ + G + L N + G + + +G E Sbjct: 732 TEAGGSRIREHVPQLLLLLMAG-----PSEDVYLQAANALVRSGVLTFCVGTNQADKAEL 786 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 R +P+ YL+++ ++ + + + T Sbjct: 787 ERIAFNPSLVYLMDDFSALPALPQQLIQPLTT 818 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 39/237 (16%), Positives = 83/237 (35%), Gaps = 24/237 (10%) Query: 124 NEGYSISAISRYKIP---LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDV 180 + Y S + ++P + T I T+ + + ++ + D++ ++D Sbjct: 1382 SPEYVFSVSTFRELPRLEQRLLTPITTLTSQQIQQILASTRYPPSVAESDAADIVFLIDS 1441 Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV 240 S +M+ I ++ ++ + + P+ V+ G+V FSN + F L+ Sbjct: 1442 SDAMKPDG------IAHIRDFVSRIVRRLNIGPNK---VRIGVVQFSNDVFPEFYLKTHK 1492 Query: 241 S--HLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 S + I+ L G S N+ L++ +F ED + +++F+ Sbjct: 1493 SQNSVLEAIRRLRFKGGSPLNTGRALEFVARNLFVKS---AGSRIEDGVPQHLVLFL--- 1546 Query: 298 ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSM 354 K + G + IG R I + P + V + Sbjct: 1547 ---GGKSQDDVSRHAQVISSSGIMSLGIGDRNIDRTDLQTITNDPRLVFTVREFREL 1600 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 39/222 (17%), Positives = 81/222 (36%), Gaps = 33/222 (14%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 S + +Q A D++ ++D S ++ D+ + +LE + + Sbjct: 226 AGDKESLKDITAQDSA--DIIFLIDGS------QNTGKANFDVIRDFLVNVLERLSV--- 274 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLS-KFGVSTNSTPGLKYAYNQIF 271 N V+ G+V +S++ F L+ S + +K LS G N L + F Sbjct: 275 GNQQVRVGVVQYSDEPRTMFSLDSYPSKAAVLDAVKRLSFAGGELANIGQALDFVVENHF 334 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 R + + +++V ++ G + D A K+G V++ G+ Sbjct: 335 T----RTGGSRVEEGVPQVLVLISAGPSSDEIRDAVV------ALKQG-SVFSFGLGAQA 383 Query: 332 SHEF-LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + L+ A+ ++ F G D+ + + Y Sbjct: 384 ASRVELQHIATDDNLVFT------VPEFRSFG-DLQEQLLPY 418 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 74/206 (35%), Gaps = 28/206 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + S F + + + ++E + + P+ V+ LV +S++ Sbjct: 1027 DVVFLIDGSEGVRSGFP-------LLKEFVQRVVESLDVGPNQ---VRVALVQYSDRTRP 1076 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + I+ L+ G T N+ L + N + G R + Sbjct: 1077 EFYLNSHMDQQGVISAIRRLTLLGGPTPNTGAALDFVLRNILTSSTGSRI-----EEGVP 1131 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 ++++ +T + + K+ GA+ IGI E P+ + Sbjct: 1132 QLLIVLT-----AERSGDDVRGPSVVLKQGGAVPIGIGIGNADISEMQTISFIPDFAVAI 1186 Query: 349 ENPHSMYDAFSHIGKDIVTKRIWYDK 374 + I + I + I ++ Sbjct: 1187 PTFREL----GTIQQVISERVIQLNR 1208 >gi|162452621|ref|YP_001614988.1| hypothetical protein sce4345 [Sorangium cellulosum 'So ce 56'] gi|161163203|emb|CAN94508.1| hypothetical protein sce4345 [Sorangium cellulosum 'So ce 56'] Length = 521 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 38/222 (17%), Positives = 82/222 (36%), Gaps = 29/222 (13%) Query: 152 RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 +M +++ A+ ++ +V+D S SM T++ A+++ + + Sbjct: 86 ETFLMFEVRGDGSPARSLAQANLSLVIDRSGSM------KGTRLTNAVQAATTAVSRLND 139 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYN 268 V+ +VTF + + +I + G T + G++ + Sbjct: 140 GDVVS------VVTFDTRTSVVVPPTTVGPETRGRILASVRGISLGGDTCISCGIEEGLS 193 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + A +++V ++DG+ N ++ A+ RG + IG+ Sbjct: 194 LL----------GQTSAGVSRMLV-LSDGDANHGVRDVPGFRAMAQRARDRGVAITTIGV 242 Query: 328 RVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVT 367 V + + L A A S Y VEN ++ F + + T Sbjct: 243 DVDYNEKILSAIALDSNGRHYFVENDAALARIFEAEAEQLTT 284 >gi|119512059|ref|ZP_01631153.1| hypothetical protein N9414_12318 [Nodularia spumigena CCY9414] gi|119463285|gb|EAW44228.1| hypothetical protein N9414_12318 [Nodularia spumigena CCY9414] Length = 435 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 36/238 (15%), Positives = 85/238 (35%), Gaps = 33/238 (13%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 N R + + I++ +Q L++ ++LD S SM I I+++ +L+ + Sbjct: 21 NQRQLSISISAIADELAQ-HLPLNLCLILDKSGSMHGQP------IATVIQAVEQLLDRL 73 Query: 210 KLIPDVNNVV--QSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKY 265 + + +V F+ + + + + ++ +I K T GL+ Sbjct: 74 QPSDSETPTSGDRISVVAFAGEAQVIIPNQTLQDTASIKAQIHKKLKASGGTAIAEGLQL 133 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST----------KEDQQSLYYCNEA 315 ++ +TDG S+ ++++ L + ++A Sbjct: 134 GITELMKGTKGAVSQA----------FLLTDGHGESSLKIWKFEIGKDDNKRCLEFAHKA 183 Query: 316 KKRGAIVYAIGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 K + +G + + L A + +E P + F H+ + I + R+ Sbjct: 184 TKINLTINTLGFGNDWNQDLLEKIADAGGGTLAYIERPEQALEQFRHLLQRIQSVRLT 241 >gi|309361725|emb|CAP28912.2| hypothetical protein CBG_09720 [Caenorhabditis briggsae AF16] Length = 675 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 37/253 (14%), Positives = 89/253 (35%), Gaps = 37/253 (14%) Query: 117 DIVVVPQNEGYSISAISRY-----KIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR 171 + + + + + + S + K P+ + + + S+ S T Sbjct: 438 NEIEELNGKNFKVRSRSVHFAMTEKPPVTTAMNPMKFFTTSRTPITTAKSLIPYSCTA-- 495 Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK-- 229 D+ ++D+S + D S +D+A +I+++ P V+ GL+++S Sbjct: 496 -DVFFLVDLS---QGTGDKSQQYLDIAASAISSL-------PISQEAVRVGLISYSGPGR 544 Query: 230 --IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + L ++ + + G +T + ++YA + H N Sbjct: 545 THVRVYLDKHNDKEKLIEEMFLMERHGGTTRTADAIRYA----TKIFEGMAHPA--RKNV 598 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI----RVIRSHEFLRACASPN 343 KK++V TDG + D A+ +G + A+ + + Sbjct: 599 KKVLVVFTDGYSQDHPRDA-----ARGARAKGLQLIAVAVKDRLAPPDEEQLAEIGGHAK 653 Query: 344 SFYLVENPHSMYD 356 + ++ N + + Sbjct: 654 NVFISPNGRELRE 666 >gi|221043432|dbj|BAH13393.1| unnamed protein product [Homo sapiens] Length = 367 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 62/162 (38%), Gaps = 17/162 (10%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D++ ++D S S++ ++ ++++ E + + + F Sbjct: 144 CPHQEMDIVFLIDGSGSIDQN---DFNQMKGFVQAVMGQFEGTDTLFALMQYSNLLKIHF 200 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + Q + + + T + G+ Q+F H N + Sbjct: 201 TFTQFRTSPSQ------QSLVDPIVQLKGLTFTATGILTVVTQLF------HHKNGARKS 248 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 KKI++ +TDG+ + + +A+K G I YAIG+R Sbjct: 249 AKKILIVITDGQKYKDPLEYSDVIP--QAEKAGIIRYAIGVR 288 >gi|262195149|ref|YP_003266358.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262078496|gb|ACY14465.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 412 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 36/239 (15%), Positives = 76/239 (31%), Gaps = 51/239 (21%) Query: 164 VNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + +TD + + ++D S SM+ F I + ++ E+ ++ + + V+ G Sbjct: 111 IEVETDPIIPTVQFLIDFSGSMDQNFGG-IKRSQAVRNAL--FDEDDGVVALLQSQVRFG 167 Query: 223 LVTFSN----------KIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQ 269 +++ ++ + ++ L+ I T + Sbjct: 168 ASLYTSFDGNEAPPCPRLTQVAPAFNNLTALRADIGGPLNDPPNAGDTPTG-------ES 220 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGE--------NLSTKEDQQSLYYCNEAK----K 317 I + D IV TDGE + + EA + Sbjct: 221 IDAIAENFPDNGPNDKPL---IVLATDGEPDSCTDPDPNTDPGRAATRRLSEEATQRAFE 277 Query: 318 RGAIVYAIGIRVIRSHEFLRACAS------------PNSFYLVENPHSMYDAFSHIGKD 364 G +Y + + + L+ A+ P + Y+ N + DAFS I + Sbjct: 278 AGIELYVLSVGNDVGADHLQRVANAGVGKALDESNDPATVYIGNNQQELVDAFSEIIRS 336 >gi|238020799|ref|ZP_04601225.1| hypothetical protein GCWU000324_00689 [Kingella oralis ATCC 51147] gi|237867779|gb|EEP68785.1| hypothetical protein GCWU000324_00689 [Kingella oralis ATCC 51147] Length = 554 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 55/341 (16%), Positives = 118/341 (34%), Gaps = 39/341 (11%) Query: 36 MIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDI--LCRIKNTWNMS 93 ++ E S+ +K + +L A N +K + + + + Sbjct: 45 IMFERSNAAAVKEYAGAPAYAALEKTARIAPAPSLPQNTEKYGKIESNPVQAVARNPVST 104 Query: 94 FRNELRDNGFVND-----IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTF---I 145 F ++ + N + + + ++ I + Y A P T Sbjct: 105 FSIDVDTGSYANARRFLNDNRLPPAHAVRIEELINYFDYGY-APPSDGKPFAVYTETIDS 163 Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 PW +++ I + I + ++ S +++ ++DVS SM++ K+ + K++ + Sbjct: 164 PWQADAKLIKIAIKAK-EIRSSALPPANLVFLVDVSGSMQA-----QDKLPLVKKTLRIL 217 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGL 263 + ++ V L+T+++ + G + + I L G ST L Sbjct: 218 TKRLRAEDKVT------LITYASNEKLVLPPTSGKDKDTILQAINQLEA-GGSTAGEQAL 270 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGAIV 322 + AY Q ++ N I+ TDG+ D +L E +K G + Sbjct: 271 QMAYAQ------AQKAYIKNGINR---ILLATDGDFNVGITDFNTLKDTVAEKRKAGISL 321 Query: 323 YAIGIRVIRSHEFLR---ACASPNSFYLVENPHSMYDAFSH 360 +G +E L A A ++ ++N Sbjct: 322 TTLGFGTGNYNEQLMEQLADAGDGNYSYIDNETEAKKVLQR 362 >gi|260813733|ref|XP_002601571.1| hypothetical protein BRAFLDRAFT_141158 [Branchiostoma floridae] gi|229286869|gb|EEN57583.1| hypothetical protein BRAFLDRAFT_141158 [Branchiostoma floridae] Length = 161 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 30/172 (17%), Positives = 54/172 (31%), Gaps = 26/172 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++LD S S+ D + ++ + P + G+ +S++ Sbjct: 1 DLFLLLDGSGSVG------TANFDKVKQFAADVVNSFDVSPTA---TRVGVAQYSDRNSL 51 Query: 233 FFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L I +S T + L++ RQ+ K+ Sbjct: 52 VFNLGDHADKPSTVSAINGISYQRGGTKTGAALEFV----------RQNAAWRGGAVPKV 101 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 ++ +TDG K G VYAIG+ E + S Sbjct: 102 MIVLTDG-----KSGDAVAAPSQSLAADGVAVYAIGVGNFDHAELQQIANSD 148 >gi|150389538|ref|YP_001319587.1| von Willebrand factor, type A [Alkaliphilus metalliredigens QYMF] gi|149949400|gb|ABR47928.1| von Willebrand factor, type A [Alkaliphilus metalliredigens QYMF] Length = 551 Score = 58.6 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 66/217 (30%), Gaps = 27/217 (12%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P+ S + + L + ++LD S SM + A + L V + Sbjct: 159 PVKSDTFLANLQQVPLSVSLILDNSGSMSGNP------MTQAKSAAKQFLNYVD----FS 208 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 N Q ++ F++ + + L I + T L + + G Sbjct: 209 NGDQVEIIEFNSDVYIRIPYGSDIKSLNTAIDTMESNSQ-TALYDALYTGLVRAYSQSG- 266 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 K I+ TDGE + + ++ ++ IG+ + E L Sbjct: 267 -----------PKCILAFTDGEENA--SIRSVSEVTELSRATSIPIFIIGVGSLIDEESL 313 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + A + ++ + + + + + Sbjct: 314 KEIAEQTGGEYFYSPTAVELEQIYKTVYDQQKEQYVL 350 >gi|225352478|ref|ZP_03743501.1| hypothetical protein BIFPSEUDO_04100 [Bifidobacterium pseudocatenulatum DSM 20438] gi|225156985|gb|EEG70354.1| hypothetical protein BIFPSEUDO_04100 [Bifidobacterium pseudocatenulatum DSM 20438] Length = 810 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 34/190 (17%), Positives = 59/190 (31%), Gaps = 15/190 (7%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 V ++S G + + + + H+ SS T Sbjct: 36 VTTSSAIAEDDQNPSGDTSTLQPANSKSIAKIEGGDGDQYALHLTASGDSS-SSTVTTAV 94 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D+++VLD S SM+ + + A + +L VQ +VTFS++ Sbjct: 95 PADIVLVLDKSGSMK-NSNRDTNAKNAATALASKLLTAANAALPAEQQVQMAVVTFSDRA 153 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + TN LK A N + + +K Sbjct: 154 RTTSQFTTSPGAIGTAVSAWP--NGGTNWEDALKTA-NDLSS----------GRSGVQKH 200 Query: 291 IVFMTDGENL 300 IVF++DG Sbjct: 201 IVFLSDGNPT 210 >gi|332665830|ref|YP_004448618.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] gi|332334644|gb|AEE51745.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] Length = 630 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 46/282 (16%), Positives = 97/282 (34%), Gaps = 37/282 (13%) Query: 103 FVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFI---PWYTNSRHIVMPIT 159 F+N+ ++ I + Y K P T + PW R + + + Sbjct: 191 FINNGQR-PPEDAVRIEEMINYFDYEYP-EPTDKHPFSITTEVSDCPWQPKHRLVHIGLQ 248 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + +++ ++DVS SM S+ K+ + S + E+++ V VV Sbjct: 249 GKHTP-VENLPAANLVFLVDVSGSM-----SAANKLPLVQASYKLLAEQLRPQDRVAIVV 302 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 +G + + ++ I L G ST G+ AY +++ Sbjct: 303 YAGAAG----LVLESTTGNNKTKIKEAIDKLQA-GGSTAGGEGILLAYKT------AKEN 351 Query: 280 CNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGA--IVYAIGIRVIRSHEFL 336 N ++ +DG+ N+ D + + E +K G + G+ + ++ Sbjct: 352 FIKGGNNR---VILASDGDFNVGVSSDGELVRIIEEERKSGVYLTILGYGMGNYKDNKMQ 408 Query: 337 RACASP-NSFYLVENPHS----MYDAFS----HIGKDIVTKR 369 + S + ++N + F I KD+ + Sbjct: 409 KLADSGNGNHAYIDNLDEARRVLVSEFGGTMYTIAKDVKLQL 450 >gi|301778755|ref|XP_002924795.1| PREDICTED: integrin alpha-X-like [Ailuropoda melanoleuca] Length = 1160 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 74/204 (36%), Gaps = 28/204 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S S+ TK+ +K++ + L+ FS+ Sbjct: 152 DIVFLIDGSGSISPL---DFTKMLNFVKAVMSQFRRPNTQFS--------LMQFSDDFRV 200 Query: 233 FFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + + + + G T++ ++ NQ+F + + KI Sbjct: 201 HFTFKDFTDSSNPLVLLDSVYQLGGYTHTATAIQMVTNQLFSTS------SGARKDASKI 254 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI-----RSHEFLRACASP--N 343 ++ +TDG+ + + + A+ G I YA+G+ + E + P Sbjct: 255 LIVITDGQKQGDYLNYEDVIP--MAEAAGIIRYAVGVGLAFRKRHSWKELNDIASKPSNE 312 Query: 344 SFYLVENPHSMYDAFSHIGKDIVT 367 + V+N + D + + + I Sbjct: 313 YIFKVDNFDVLRDIQNQLKEKIFA 336 >gi|194043712|ref|XP_001928130.1| PREDICTED: collagen alpha-3(VI) chain isoform 2 [Sus scrofa] Length = 2972 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 50/323 (15%), Positives = 112/323 (34%), Gaps = 31/323 (9%) Query: 54 IDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRS 113 +D + ++ + + E NN + G + + ++ RS Sbjct: 311 LDGAALYTGSAL--EFVRNNMFTSEAGYRAAEGVPRLLVLITGGKSLDDVSQPAQELKRS 368 Query: 114 TSLDIVVVPQNEGYSISAISRYK-----IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT 168 + + V + + + IP +F P ++ P+ + Sbjct: 369 SIMAFAVGNKAADQAELEEIAFDPSLVFIPTEF-RASPLQGVLSSLLAPLRTLSGTTEVH 427 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S + T+ + ++ + + D ++ GLV FS+ Sbjct: 428 ANKRDIIFLLDGS------VNVGRTRFPYVRDFVMNLVNSLDVGGDN---IRVGLVQFSD 478 Query: 229 -KIEEFFLLEWGV-SHLQRKIKYLS-KFGVSTNSTPGLKYAY-NQIFDMQGMRQHCNTED 284 + EF L + S L ++ L K G N+ L Y + N + G R Sbjct: 479 TPVTEFSLNTYQTKSDLLAHVRQLQLKGGSGLNTGAALSYVHANHFTEAGGSRIQ----- 533 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 ++++ +T G + L N + G + + +G E + +P+ Sbjct: 534 EQVPQLLLLLTAG-----PSEDSYLQAANALARSGILTFCVGASQANKAELEQIAFNPSL 588 Query: 345 FYLVENPHSMYDAFSHIGKDIVT 367 YL+++ S+ + + + T Sbjct: 589 VYLMDDFSSLPALPQQLIQPLTT 611 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 41/255 (16%), Positives = 95/255 (37%), Gaps = 25/255 (9%) Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKV 164 +I R+ I P+ + ++P + + + P + Sbjct: 1367 GDRNIDRTELQTITNDPR---LVFTVREFRELPNIEERVMNSFGPSGVTPAPPGVDILPP 1423 Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + D++ +LD S+ DS + + + E V + + + +Q GLV Sbjct: 1424 SRPEKKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEGGDSIQVGLV 1474 Query: 225 TFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMRQHCN 281 +++ + F L+ + + I + G N+ GL++ + Q + + + Sbjct: 1475 QYNSDPTDEFFLKDFSTKQQIIDAINKVVYKGGRHANTKVGLEH----LRLNQFVPEAGS 1530 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 + +I +T G K + + +RG V+A+G+R I S E + ++ Sbjct: 1531 RLEQRVPQIAFVITGG-----KSVEDAQEASLALTQRGVKVFAVGVRNIDSEEVGKIASN 1585 Query: 342 PNSFYLVENPHSMYD 356 + + V N + + Sbjct: 1586 SATAFRVGNVQELSE 1600 Score = 45.5 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 73/213 (34%), Gaps = 34/213 (15%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVV 219 + + +D +DM ++D S + F + + K I ++ ++ L PD + Sbjct: 2402 RRAAGSDVDIDMAFIVDSSETTTPF------QFNEMRKYIGYVVRQLDLSPDPKASQHFA 2455 Query: 220 QSGLVTFS----------NKIEEFFLLE-WGVSH--LQRKIKYLSKFGVSTNSTPGLKYA 266 + +V + ++ F L +G + +++ + + Y Sbjct: 2456 RVAVVQHAPWESVGNASVPPVKVDFSLTDYGSKEKLVAFLSSRMTQLQGTRALGSAIYYT 2515 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 +F+ + KI+V M GE + ++ +AK +G +G Sbjct: 2516 VENVFES--------APNPRDLKIVVLMLTGEVQKEQLEEAQRAIL-QAKCKGYFFVILG 2566 Query: 327 IRVI-RSHEFLRACASPNSFYL--VENPHSMYD 356 I E + PN + V+ + + Sbjct: 2567 IGRKVNVKELYSFASEPNDVFFKLVDKSSELNE 2599 Score = 45.1 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 43/254 (16%), Positives = 84/254 (33%), Gaps = 27/254 (10%) Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVK 163 + + I + P+ Y S + ++P K T I T + + ++S Sbjct: 1163 ARNADPEELVKISLSPE---YVFSVSTFRELPSLEQKLLTPITTLTAEQIQQLLASTSYP 1219 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + D++ ++D S ++ I + ++ + + P+ V+ G+ Sbjct: 1220 PPAVESDAADIVFLIDSSDGVKPDG------IAHIRDFVIRIVRRLNIGPNK---VRVGV 1270 Query: 224 VTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHC 280 V FSN + F L+ S I+ L G S N+ L++ F Sbjct: 1271 VQFSNDVFPEFYLKTYRSQASVLDAIRRLRFKGGSPLNTGKALEFVARNFFVKS---AGS 1327 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 ED + +++F+ K Y G + +G R I E Sbjct: 1328 RIEDGVPQHLVLFL------GGKSQDDVSRYSQVMGSSGIVRLGVGDRNIDRTELQTITN 1381 Query: 341 SPNSFYLVENPHSM 354 P + V + Sbjct: 1382 DPRLVFTVREFREL 1395 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 25/163 (15%), Positives = 57/163 (34%), Gaps = 24/163 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + S F + + + ++E + + PD V+ +V +S++ Sbjct: 822 DVVFLIDGSEGVRSGFP-------LLKEFVQRVVESLDVGPD---RVRVAVVQYSDRTRP 871 Query: 233 FFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L Q + + + G N+ L + I + Sbjct: 872 EFYL-NSYMDQQSVVNAVRQLTLLGGPIPNTGAALDFVLRDILTESAGSRIAEG----IP 926 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ +T + + K+ GA+ IGI Sbjct: 927 QLLIVLT-----ADRSGDDVRGPSVVLKRGGAVPIGIGIGNAD 964 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 28/206 (13%), Positives = 72/206 (34%), Gaps = 35/206 (16%) Query: 173 DMMIVLDVS---RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D++ ++D S S+ + + + +LE + + ++ G+V +S++ Sbjct: 35 DIIFLIDGSNETGSVH---------FAVILDFLVNLLERLSV---GAQQIRVGVVQYSDE 82 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 F L + + + + G + GL + + D R + + Sbjct: 83 PRTMFSL-NSFATKAQVLDAVKALGFLGGELANVGLALDF--VVDHHFTRAGGSRVEEGV 139 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACASPNSFY 346 +++V ++ + + K+ V++ G+ S L+ A+ ++F Sbjct: 140 PQVLVLIS-----AAPSSDKIRDAVLALKQAS--VFSFGLGAQAASKAELQHIATNDNFV 192 Query: 347 LVENPHSMYDAFSHIGKDIVTKRIWY 372 F G D+ + + Y Sbjct: 193 FT------VPEFRSFG-DLQDQLLPY 211 >gi|194043710|ref|XP_001928122.1| PREDICTED: collagen alpha-3(VI) chain isoform 1 [Sus scrofa] Length = 3178 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 50/323 (15%), Positives = 112/323 (34%), Gaps = 31/323 (9%) Query: 54 IDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRS 113 +D + ++ + + E NN + G + + ++ RS Sbjct: 517 LDGAALYTGSAL--EFVRNNMFTSEAGYRAAEGVPRLLVLITGGKSLDDVSQPAQELKRS 574 Query: 114 TSLDIVVVPQNEGYSISAISRYK-----IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT 168 + + V + + + IP +F P ++ P+ + Sbjct: 575 SIMAFAVGNKAADQAELEEIAFDPSLVFIPTEF-RASPLQGVLSSLLAPLRTLSGTTEVH 633 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S + T+ + ++ + + D ++ GLV FS+ Sbjct: 634 ANKRDIIFLLDGS------VNVGRTRFPYVRDFVMNLVNSLDVGGDN---IRVGLVQFSD 684 Query: 229 -KIEEFFLLEWGV-SHLQRKIKYLS-KFGVSTNSTPGLKYAY-NQIFDMQGMRQHCNTED 284 + EF L + S L ++ L K G N+ L Y + N + G R Sbjct: 685 TPVTEFSLNTYQTKSDLLAHVRQLQLKGGSGLNTGAALSYVHANHFTEAGGSRIQ----- 739 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 ++++ +T G + L N + G + + +G E + +P+ Sbjct: 740 EQVPQLLLLLTAG-----PSEDSYLQAANALARSGILTFCVGASQANKAELEQIAFNPSL 794 Query: 345 FYLVENPHSMYDAFSHIGKDIVT 367 YL+++ S+ + + + T Sbjct: 795 VYLMDDFSSLPALPQQLIQPLTT 817 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 41/255 (16%), Positives = 95/255 (37%), Gaps = 25/255 (9%) Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKV 164 +I R+ I P+ + ++P + + + P + Sbjct: 1573 GDRNIDRTELQTITNDPR---LVFTVREFRELPNIEERVMNSFGPSGVTPAPPGVDILPP 1629 Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + D++ +LD S+ DS + + + E V + + + +Q GLV Sbjct: 1630 SRPEKKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEGGDSIQVGLV 1680 Query: 225 TFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMRQHCN 281 +++ + F L+ + + I + G N+ GL++ + Q + + + Sbjct: 1681 QYNSDPTDEFFLKDFSTKQQIIDAINKVVYKGGRHANTKVGLEH----LRLNQFVPEAGS 1736 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 + +I +T G K + + +RG V+A+G+R I S E + ++ Sbjct: 1737 RLEQRVPQIAFVITGG-----KSVEDAQEASLALTQRGVKVFAVGVRNIDSEEVGKIASN 1791 Query: 342 PNSFYLVENPHSMYD 356 + + V N + + Sbjct: 1792 SATAFRVGNVQELSE 1806 Score = 45.5 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 73/213 (34%), Gaps = 34/213 (15%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVV 219 + + +D +DM ++D S + F + + K I ++ ++ L PD + Sbjct: 2608 RRAAGSDVDIDMAFIVDSSETTTPF------QFNEMRKYIGYVVRQLDLSPDPKASQHFA 2661 Query: 220 QSGLVTFS----------NKIEEFFLLE-WGVSH--LQRKIKYLSKFGVSTNSTPGLKYA 266 + +V + ++ F L +G + +++ + + Y Sbjct: 2662 RVAVVQHAPWESVGNASVPPVKVDFSLTDYGSKEKLVAFLSSRMTQLQGTRALGSAIYYT 2721 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 +F+ + KI+V M GE + ++ +AK +G +G Sbjct: 2722 VENVFES--------APNPRDLKIVVLMLTGEVQKEQLEEAQRAIL-QAKCKGYFFVILG 2772 Query: 327 IRVI-RSHEFLRACASPNSFYL--VENPHSMYD 356 I E + PN + V+ + + Sbjct: 2773 IGRKVNVKELYSFASEPNDVFFKLVDKSSELNE 2805 Score = 45.1 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 43/254 (16%), Positives = 84/254 (33%), Gaps = 27/254 (10%) Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVK 163 + + I + P+ Y S + ++P K T I T + + ++S Sbjct: 1369 ARNADPEELVKISLSPE---YVFSVSTFRELPSLEQKLLTPITTLTAEQIQQLLASTSYP 1425 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + D++ ++D S ++ I + ++ + + P+ V+ G+ Sbjct: 1426 PPAVESDAADIVFLIDSSDGVKPDG------IAHIRDFVIRIVRRLNIGPNK---VRVGV 1476 Query: 224 VTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHC 280 V FSN + F L+ S I+ L G S N+ L++ F Sbjct: 1477 VQFSNDVFPEFYLKTYRSQASVLDAIRRLRFKGGSPLNTGKALEFVARNFFVKS---AGS 1533 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 ED + +++F+ K Y G + +G R I E Sbjct: 1534 RIEDGVPQHLVLFL------GGKSQDDVSRYSQVMGSSGIVRLGVGDRNIDRTELQTITN 1587 Query: 341 SPNSFYLVENPHSM 354 P + V + Sbjct: 1588 DPRLVFTVREFREL 1601 Score = 43.6 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 33/274 (12%), Positives = 88/274 (32%), Gaps = 45/274 (16%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS-------- 166 ++ + + I++ F + ++V + SS+ Sbjct: 173 AVGVEDADEGALKEIASEPLNMHVFNLENFTSLHDLVGNLVSCVHSSMTPEGAGGTETLK 232 Query: 167 --QTDARLDMMIVLDVS---RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 D++ ++D S S+ + + + +LE + + ++ Sbjct: 233 DITAQESADIIFLIDGSNETGSVH---------FAVILDFLVNLLERLSV---GAQQIRV 280 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQH 279 G+V +S++ F L + + + + G + GL + + D R Sbjct: 281 GVVQYSDEPRTMFSL-NSFATKAQVLDAVKALGFLGGELANVGLALDF--VVDHHFTRAG 337 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRA 338 + + +++V ++ + + K+ V++ G+ S L+ Sbjct: 338 GSRVEEGVPQVLVLIS-----AAPSSDKIRDAVLALKQAS--VFSFGLGAQAASKAELQH 390 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 A+ ++F F G D+ + + Y Sbjct: 391 IATNDNFVFT------VPEFRSFG-DLQDQLLPY 417 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 25/163 (15%), Positives = 57/163 (34%), Gaps = 24/163 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + S F + + + ++E + + PD V+ +V +S++ Sbjct: 1028 DVVFLIDGSEGVRSGFP-------LLKEFVQRVVESLDVGPD---RVRVAVVQYSDRTRP 1077 Query: 233 FFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L Q + + + G N+ L + I + Sbjct: 1078 EFYL-NSYMDQQSVVNAVRQLTLLGGPIPNTGAALDFVLRDILTESAGSRIAEG----IP 1132 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ +T + + K+ GA+ IGI Sbjct: 1133 QLLIVLT-----ADRSGDDVRGPSVVLKRGGAVPIGIGIGNAD 1170 >gi|313227043|emb|CBY22190.1| unnamed protein product [Oikopleura dioica] Length = 1109 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 37/247 (14%), Positives = 78/247 (31%), Gaps = 24/247 (9%) Query: 117 DIVVVPQNEGYSISAISRYKIPLKFCT-FIPWYTNSRH---IVMPITSSVKVNSQTDARL 172 I + +S+ P P N + + + + + + Sbjct: 475 KIYSEGEEREIQVSSSLFPTKPTSLRDSPAPGEDNKITENLVELTDPLTSFICENSTKEV 534 Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 DM+ +LD S S+ + + ++ + + P Q + ++N Sbjct: 535 DMIFLLDSSGSVGK------PNFQVMKSWMRRLISGLNIAP---GRTQVSVYLYNNIFRT 585 Query: 233 FFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L + I + G T L+ A +++ Q N + Sbjct: 586 IFNLNEHQNAYDMITAINKMVYSGKGTRIARALQSAMSKVLIPQSGL------RPNSEIY 639 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSFYL 347 + +TDG+ + +A + A+GI + L A AS ++ +L Sbjct: 640 LYLLTDGKESDVADVNNMANDIKDAFGDRITLTAVGISRSVENAELYAIASAPKKDNVFL 699 Query: 348 VENPHSM 354 +EN + Sbjct: 700 LENYRDL 706 >gi|157831431|pdb|1IDO|A Chain A, I-Domain From Integrin Cr3, Mg2+ Bound Length = 189 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 81/210 (38%), Gaps = 28/210 (13%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 D+ ++D S S+ ++ + ++ L++ K + + + + Sbjct: 1 GCPQEDSDIAFLIDGSGSIIPH---DFRRMKEFVSTVMEQLKKSKTLFSLMQYSEEFRIH 57 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F+ K + + + +K +++ T++ G++ ++F++ N Sbjct: 58 FTFKEFQNNP------NPRSLVKPITQLLGRTHTATGIRKVVRELFNIT------NGARK 105 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFLRACAS 341 N KI+V +TDGE + + EA + G I Y IG R +S + L AS Sbjct: 106 NAFKILVVITDGEKFGDPLGYEDVIP--EADREGVIRYVIGVGDAFRSEKSRQELNTIAS 163 Query: 342 P---NSFYLVENPHSMYDAFSHIGKDIVTK 368 + + V N ++A I + K Sbjct: 164 KPPRDHVFQVNN----FEALKTIQNQLREK 189 >gi|4154309|gb|AAD04919.1| von Willebrand factor [Canis lupus familiaris] Length = 2813 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 38/278 (13%), Positives = 88/278 (31%), Gaps = 31/278 (11%) Query: 78 KGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYK 136 + + + + + N ++I + + + V + + I Sbjct: 1587 EHSFSVSQGDREQVPNLVYMVTGNPASDEIKRMPGDIQVVPIGVGPHANVQELEKIGWPN 1646 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSS--------VKVNSQTDARLDMMIVLDVSRSMESFF 188 P+ F + +V+ S + LD++++LD S S+ + + Sbjct: 1647 APILIHDFEMLPREAPDLVLQRCCSGEGLQIPTLSPTPDCSQPLDVVLLLDGSSSIPASY 1706 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN--KIEEFFLLEWGVSHLQRK 246 D A + + P + Q ++ + + I+ + + + HL Sbjct: 1707 ------FDEMKSFTKAFISRANIGP---RLTQVSVLQYGSITTIDVPWNVAYEKVHLLSL 1757 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + + + G + L +A + H A+ K +++ +TD Sbjct: 1758 VDLMQQEGGPSQIGDALSFAVRYVTS----EVHGARPGAS-KAVVILVTD------VSVD 1806 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 A+ V+ IGI S L + A P + Sbjct: 1807 SVDAAAEAARSNRVTVFPIGIGDRYSEAQLSSLAGPKA 1844 >gi|1478046|gb|AAB05549.1| von Willebrand factor [Canis lupus familiaris] Length = 2813 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 38/278 (13%), Positives = 88/278 (31%), Gaps = 31/278 (11%) Query: 78 KGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYK 136 + + + + + N ++I + + + V + + I Sbjct: 1587 EHSFSVSQGDREQVPNLVYMVTGNPASDEIKRMPGDIQVVPIGVGPHANVQELEKIGWPN 1646 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSS--------VKVNSQTDARLDMMIVLDVSRSMESFF 188 P+ F + +V+ S + LD++++LD S S+ + + Sbjct: 1647 APILIHDFEMLPREAPDLVLQRCCSGEGLQIPTLSPTPDCSQPLDVVLLLDGSSSIPASY 1706 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN--KIEEFFLLEWGVSHLQRK 246 D A + + P + Q ++ + + I+ + + + HL Sbjct: 1707 ------FDEMKSFTKAFISRANIGP---RLTQVSVLQYGSITTIDVPWNVAYEKVHLLSL 1757 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + + + G + L +A + H A+ K +++ +TD Sbjct: 1758 VDLMQQEGGPSQIGDALSFAVRYVTS----EVHGARPGAS-KAVVILVTD------VSVD 1806 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 A+ V+ IGI S L + A P + Sbjct: 1807 SVDAAAEAARSNRVTVFPIGIGDRYSEAQLSSLAGPKA 1844 >gi|50950127|ref|NP_001002932.1| von Willebrand factor precursor [Canis lupus familiaris] gi|40786760|gb|AAB93766.2| von Willebrand factor [Canis lupus familiaris] Length = 2813 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 38/278 (13%), Positives = 88/278 (31%), Gaps = 31/278 (11%) Query: 78 KGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYK 136 + + + + + N ++I + + + V + + I Sbjct: 1587 EHSFSVSQGDREQVPNLVYMVTGNPASDEIKRMPGDIQVVPIGVGPHANVQELEKIGWPN 1646 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSS--------VKVNSQTDARLDMMIVLDVSRSMESFF 188 P+ F + +V+ S + LD++++LD S S+ + + Sbjct: 1647 APILIHDFEMLPREAPDLVLQRCCSGEGLQIPTLSPTPDCSQPLDVVLLLDGSSSIPASY 1706 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN--KIEEFFLLEWGVSHLQRK 246 D A + + P + Q ++ + + I+ + + + HL Sbjct: 1707 ------FDEMKSFTKAFISRANIGP---RLTQVSVLQYGSITTIDVPWNVAYEKVHLLSL 1757 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + + + G + L +A + H A+ K +++ +TD Sbjct: 1758 VDLMQQEGGPSQIGDALSFAVRYVTS----EVHGARPGAS-KAVVILVTD------VSVD 1806 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 A+ V+ IGI S L + A P + Sbjct: 1807 SVDAAAEAARSNRVTVFPIGIGDRYSEAQLSSLAGPKA 1844 >gi|12644030|sp|Q28295|VWF_CANFA RecName: Full=von Willebrand factor; Short=vWF; Flags: Precursor Length = 2813 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 38/278 (13%), Positives = 88/278 (31%), Gaps = 31/278 (11%) Query: 78 KGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYK 136 + + + + + N ++I + + + V + + I Sbjct: 1587 EHSFSVSQGDREQVPNLVYMVTGNPASDEIKRMPGDIQVVPIGVGPHANVQELEKIGWPN 1646 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSS--------VKVNSQTDARLDMMIVLDVSRSMESFF 188 P+ F + +V+ S + LD++++LD S S+ + + Sbjct: 1647 APILIHDFEMLPREAPDLVLQRCCSGEGLQIPTLSPTPDCSQPLDVVLLLDGSSSIPASY 1706 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN--KIEEFFLLEWGVSHLQRK 246 D A + + P + Q ++ + + I+ + + + HL Sbjct: 1707 ------FDEMKSFTKAFISRANIGP---RLTQVSVLQYGSITTIDVPWNVAYEKVHLLSL 1757 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + + + G + L +A + H A+ K +++ +TD Sbjct: 1758 VDLMQQEGGPSQIGDALSFAVRYVTS----EVHGARPGAS-KAVVILVTD------VSVD 1806 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 A+ V+ IGI S L + A P + Sbjct: 1807 SVDAAAEAARSNRVTVFPIGIGDRYSEAQLSSLAGPKA 1844 >gi|326332116|ref|ZP_08198400.1| putative von Willebrand factor type A domain protein [Nocardioidaceae bacterium Broad-1] gi|325950087|gb|EGD42143.1| putative von Willebrand factor type A domain protein [Nocardioidaceae bacterium Broad-1] Length = 338 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 31/168 (18%), Positives = 59/168 (35%), Gaps = 15/168 (8%) Query: 170 ARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 A LD+++V+D +RSM ++ K + A+ + V+ G +TF Sbjct: 78 ADLDVLVVVDRTRSMVAEDGPGGEARMTQLKKDLKALSAALPS-------VRFGAITFGG 130 Query: 229 KIEEF-FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 ++ + + + L S + +++ Q E Sbjct: 131 EVVRTEMPFTYDTTAFNAWVDGLYAERAFDGSGSMVDAPRDEVISALERDQERFPER--- 187 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 ++I+VF +DGEN DQQS ++ G +G Sbjct: 188 RRIVVFASDGENTREGVDQQSFSEIDDLSAGGV---VLGYGTEEGGRM 232 >gi|240169099|ref|ZP_04747758.1| hypothetical protein MkanA1_07284 [Mycobacterium kansasii ATCC 12478] Length = 741 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 41/217 (18%), Positives = 81/217 (37%), Gaps = 39/217 (17%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + V + A D+++VLD S SM + K+ A ++ +++ + + Sbjct: 284 TLVPPAEPSSAPRDVVVVLDRSGSMGGW------KMVAARRAAGRIVDMLDTVD------ 331 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--------LQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 + ++ F ++I+ + G+ + L G + + P L+ A + Sbjct: 332 RFCVLAFDDRIDTPTDMAPGLVEGSDQNRFAAASWLGSLRSRGGTEMAEP-LRRAVELLA 390 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG-AIVYAIGIRVI 330 RQ +V +TDG+ +SL A+ G +Y +GI Sbjct: 391 GSDEGRQAS----------VVLVTDGQITGEDHLLRSL-----AQSLGRIRIYCVGIDRA 435 Query: 331 RSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 + FL A LVE+ + +A S + + I Sbjct: 436 VNAGFLDRLARLGRGRAELVESEARLDEAMSRLARTI 472 >gi|163658505|gb|ABY28382.1| complement component factor B/C2 [Branchiostoma belcheri tsingtauense] Length = 752 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 44/230 (19%), Positives = 85/230 (36%), Gaps = 37/230 (16%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 +S+ S+ A LD++ LD S S+ + AI+ +++ E + + Sbjct: 279 SSRGRNSLGRLSRGSAGLDLVFALDKSSSI------DAVDFNQAIQFTRSIINEFG-VTN 331 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWG------VSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 Q LVTF ++ + LEW + R+++ L G T T L+ N Sbjct: 332 KEGGTQVALVTFGSQAQ----LEWNLGQLDSKRKVFRQLRQLQPEGGGTALTAALQTVLN 387 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + + K+ + +TDG N+ + + V+A+G+ Sbjct: 388 DVLPVTRAGA---------KRALFIITDGKSNVGASPGVFARRLRE---EEAFEVFAVGV 435 Query: 328 RVIRSHEFLRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 L + AS + +L+ + + F + I K I Y++ Sbjct: 436 GANIDKNELNSVASQPFTSHVFLIND----FSNFDTLVNTIAEKEIDYEQ 481 >gi|128485808|ref|NP_001076062.1| integrin alpha-M [Ovis aries] gi|124127039|gb|ABM92271.1| CD11b [Ovis aries] Length = 1152 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 38/230 (16%), Positives = 84/230 (36%), Gaps = 34/230 (14%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 S + P + D+ ++D S S++ ++ + ++ + ++ Sbjct: 126 FGSNLLQKPRRIPAALRECPQQDSDIAFLIDGSGSID---PVDFDRMKKFVSTVMSRFQK 182 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK--IKYLSKFGVSTNSTPGLKYA 266 K + L+ +S+ F + + ++ + + T++ G++ Sbjct: 183 SKTLF--------ALMQYSDDFRTHFTFNDFKRNSDPELLVRPIGQLFGRTHTATGIRKV 234 Query: 267 YNQIF-DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 ++F G R H KI++ +TDGE + + + EA ++G I Y I Sbjct: 235 VRELFHSSSGARNHAL-------KIMIVITDGEKYLDPLEYRDVIP--EADRKGIIRYVI 285 Query: 326 G----IRVIRSHEFLRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTK 368 G +S + L AS + + V N ++A I + K Sbjct: 286 GVGDAFNSKKSRKELDTIASKPPADHVFQVNN----FEALKTIQNQLQEK 331 >gi|328951281|ref|YP_004368616.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] gi|328451605|gb|AEB12506.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] Length = 328 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 29/166 (17%), Positives = 62/166 (37%), Gaps = 18/166 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+V+D S+SM + D S ++++ A + L+ V + GLV+FS Sbjct: 88 VMVVVDTSKSMIA-VDQSPSRLEAARAIVRTFLDRVPRG------ARVGLVSFSAYASVL 140 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK---- 289 L +++ ++ L T+ + A + + + D + Sbjct: 141 VLPTARHVEVRKALEALEPQEA-TSLGAAILAAVRALPGRERAGEELLGRDPVPPELQEL 199 Query: 290 ---IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 ++ ++DG + S L A+ +Y +G+ R Sbjct: 200 PPATVLLISDGVSTS---GLDPLEAARVARAHQVRIYTVGVGSPRG 242 >gi|254787807|ref|YP_003075236.1| von Willebrand factor A [Teredinibacter turnerae T7901] gi|237686979|gb|ACR14243.1| von Willebrand factor type A domain protein [Teredinibacter turnerae T7901] Length = 689 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 42/213 (19%), Positives = 77/213 (36%), Gaps = 28/213 (13%) Query: 134 RYKIPLKFCTFIPWYTNSRHIVMP-------ITSSVKVNSQTDAR-LDMMIVLDVSRSME 185 Y PL P+ I P + +K +++ +LDVS SM Sbjct: 281 PYDYPLPSAATAPFKPTITVIPAPWNQAKRLVHIGIKALPLAHPPKANLVFLLDVSGSM- 339 Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR 245 S K+ + +S+ +L ++ V+ VV +G + E + + Sbjct: 340 ----GSPDKLPLVKQSMELLLSGLQPTDTVSIVVYAGAAG---TVLEPTPVA-EQQKILA 391 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKE 304 + L+ G ST G++ AY Q+ + R N I+ TDG+ N+ + Sbjct: 392 ALDRLNA-GGSTAGAQGIELAY-QLAEANYQRDAVNR--------IILATDGDFNVGIAD 441 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +Q Y + G + +G ++ L Sbjct: 442 PEQLKGYVERKRANGIELSILGFGSGNYNDALM 474 >gi|90412167|ref|ZP_01220173.1| hypothetical protein P3TCK_27759 [Photobacterium profundum 3TCK] gi|90326891|gb|EAS43276.1| hypothetical protein P3TCK_27759 [Photobacterium profundum 3TCK] Length = 504 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 45/254 (17%), Positives = 92/254 (36%), Gaps = 29/254 (11%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 + +G I TA+ L +F ++ +E + K L + + + T N+ + Sbjct: 15 RHQRGAAGIYTALALIPLFGMIFWALEGTRYIQKKNRLADATEAATLAITTA--NQDDKT 72 Query: 73 NRKKLKGGDILCRIKNTWNMS----FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 +L G I I+N +++ R+E DN D ++ + Y Sbjct: 73 YENQLATGYIQAYIRNITSINNIKIERSEGIDNYPTPDGNE-----------EREYFQYR 121 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 ++A + + I IP + + + + D +D++ V D S SM+ Sbjct: 122 VTAKTNH-ISWLSSDIIPSFAPTETVANRALARNYPIYLGDKDIDIVFVSDFSGSMKGN- 179 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVV--QSGLVTFSNKIEEFFLLEWGVSHLQRK 246 KI +I A+ E+ + D V + V ++ +++E + Sbjct: 180 -----KIRALKDAIQAIANEILVPRDGEVEVTNRIAFVPYNMRVQEKRS---NTRWCITQ 231 Query: 247 IKYLSKFGVSTNST 260 + Y F S+ Sbjct: 232 LDYRPNFNGGNYSS 245 >gi|319787647|ref|YP_004147122.1| von Willebrand factor type A [Pseudoxanthomonas suwonensis 11-1] gi|317466159|gb|ADV27891.1| von Willebrand factor type A [Pseudoxanthomonas suwonensis 11-1] Length = 585 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 41/275 (14%), Positives = 99/275 (36%), Gaps = 29/275 (10%) Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDI-----VRSTSLDIVVVPQ 123 + + + + R + +F ++ + N + + S+ + + Sbjct: 106 ETNTETYEAREDNPVRRAREVPVSTFSVDVDTGSYANVRRMLRDGYRPPADSVRVEEMLN 165 Query: 124 NEGYSISAISRYKIPLKFCTFI---PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDV 180 Y A + ++P K T + PW + +++ I V+ + ++++++D Sbjct: 166 YFDYGHPAPASREVPFKVTTELAPAPWNPARQLLMVGIKG-YDVDKRELPPANLVLLVDT 224 Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV 240 S SM+ K+ + ++ ++ +++ V+ V +G + Sbjct: 225 SGSMD-----DPAKLPLLKRAFAQLVPQLRAKDRVSIVAYAG----HAGLVLPPTPGNRH 275 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-N 299 + ++ L G STN GL+ AY RQ N I+ TDG+ N Sbjct: 276 GEILAALEGLHAAG-STNGGEGLRLAY------AMARQGHVEGGVNR---ILLATDGDFN 325 Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + + L + ++ G + +G ++ Sbjct: 326 VGITDRNALLTLVADQRRSGIALSTLGFGSGNYND 360 >gi|73958270|ref|XP_856370.1| PREDICTED: similar to Integrin alpha-M precursor (Cell surface glycoprotein MAC-1 alpha subunit) (CR-3 alpha chain) (CD11b) (Leukocyte adhesion receptor MO1) (Neutrophil adherence receptor) isoform 4 [Canis familiaris] Length = 1036 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 40/205 (19%), Positives = 80/205 (39%), Gaps = 32/205 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S+ + T + ++ ++++ K N+ L+ FS + Sbjct: 151 DIAFLIDGSGSI------NPTDFQRMKEFVSTVMDQFK-----NSKTLFSLMQFSEDFQI 199 Query: 233 FFLLEWGVSHLQRK--IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + + +K + + T++ G++ ++F + N KI Sbjct: 200 HFTFNEFKKNPKPSFLVKSIKQLLGRTHTATGIRKVVRELFHSS------SGARENALKI 253 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI----RSHEFLRACASP---N 343 +V +TDGE D + + EA + G I Y IG+ ++ E L AS + Sbjct: 254 LVVITDGEKYGDPLDYKDVIP--EADREGIIRYVIGVGDAFNHLKNREELNIIASKPPRD 311 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTK 368 + V N ++A I + K Sbjct: 312 HVFRVNN----FEALKTIQNQLQEK 332 >gi|309791847|ref|ZP_07686333.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308226108|gb|EFO79850.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 542 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 73/195 (37%), Gaps = 26/195 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN---K 229 D+++V+D+S SME K++M + + L + + G++TFS+ + Sbjct: 368 DIILVVDISGSMEG------DKLEMTRAGLESFLMRILPDD------RVGMITFSSSATE 415 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + L LQ I +S G T ++ A + + + K Sbjct: 416 VVAPAALSENRMQLQMAISEMSATGK-TAVFDAVELARQSLEALPST-------GEDRMK 467 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS-PNSFYLV 348 IV ++DG + +++ L + G ++ + L A A + +V Sbjct: 468 AIVLLSDGADNASRITLADLE--RNFDETGVSIFPVAYGADADRSILDAIAEFSRTIVVV 525 Query: 349 ENPHSMYDAFSHIGK 363 + + F ++ + Sbjct: 526 GDTGDIAQIFENLSR 540 >gi|291295671|ref|YP_003507069.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290470630|gb|ADD28049.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 744 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 73/213 (34%), Gaps = 34/213 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + +++VLDVS SM K+ +A+ ++ + G+V F Sbjct: 331 EEPGGVGIVLVLDVSGSMLED-----DKLGLAVTGSLELIRSARPQD------YIGVVVF 379 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 S++ F +++ + L ++ G T ++ AY + + TE Sbjct: 380 SDRPRWLFRPRPMTEQGRKEAESLLLSTQAGGGTM----IRRAY---LEALEALEQVPTE 432 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG--AIVYAIGIRVIRSHEFLRACAS 341 K ++ +TDG D + A++ + I FLR A Sbjct: 433 S----KQVIALTDGLAADVTPD-----LFDAAREASPRIKTNTVAIGADADGRFLRELAQ 483 Query: 342 --PNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 +++ V P + F + + + Sbjct: 484 AGDGTYWDVPRPEDLPRFFLEEAQRVFRREALE 516 >gi|163761157|ref|ZP_02168234.1| hypothetical protein HPDFL43_13595 [Hoeflea phototrophica DFL-43] gi|162281708|gb|EDQ32002.1| hypothetical protein HPDFL43_13595 [Hoeflea phototrophica DFL-43] Length = 444 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 58/438 (13%), Positives = 132/438 (30%), Gaps = 88/438 (20%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM 66 ++ + + G +L + + + V G+ ++ S+ +KT ++D +++ A I+ Sbjct: 5 SVSQYLRSRDGNFGLLAGLVMVALVWVAGLAVDFSNALRVKTTAQDIVDATVLRATRDII 64 Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 EG +L ++F + + + V + + V + Sbjct: 65 EEGKTLAEAELSARKYFDA-----ELAFSSGVGLEVSTFTLTQGVDGI-VKLGVSGKTST 118 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVM--PITSSVKVNSQTDARLDMMI-VLDV--- 180 + A+ R +IP+ S I + +T+S+ + ++ L+ Sbjct: 119 SLLKAVGREEIPVSVDAAAHVGGGSVEIAIAFDVTNSMGFGTTWGEATSVIASALNALKA 178 Query: 181 -SRSMESFFDSSITKIDMAIKSINAM---------------------------------- 205 S SM F ++++ + N + Sbjct: 179 NSGSMALTFIPFTDRVNVGMGRANLLNPGDQTAVKKGGWGGCVDVRATKKKNKGETEYFM 238 Query: 206 -------LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTN 258 + + + + VS + ++ L+K G Sbjct: 239 PDSAPEKGDRFTKFDNGTPAAHKSGYKLACNPQSIIGPTSNVSDVTSQLGKLTKGGTG-R 297 Query: 259 STPGLKYAYNQI-------------FDMQGMRQHCNTEDANYKKIIVFMTDGE------- 298 G + + + D T N +KI V TDG Sbjct: 298 FDLGFAWLWYALSPNWKGFWSGGAPADNGVNLADYPTASTNTRKIAVLATDGLTNAYVYE 357 Query: 299 ---------NLSTKED-QQSLYYCNEAKKRGAIVYAIGI-RVIRSHEFLRACAS--PNSF 345 N +K+ + + C + V+ + + ++ + R CAS + Sbjct: 358 YGKTNLAGWNTGSKDHFENVVAICKSMAAQKIEVHVMHVNGNDKAEPYFRECASATGGGY 417 Query: 346 YLVENPHSMYDAFSHIGK 363 Y V + ++ DA + I Sbjct: 418 YKVASKQTLVDALTGITN 435 >gi|89068992|ref|ZP_01156373.1| Putative membrane protein with von Willebrand (VWA) domain [Oceanicola granulosus HTCC2516] gi|89045361|gb|EAR51426.1| Putative membrane protein with von Willebrand (VWA) domain [Oceanicola granulosus HTCC2516] Length = 669 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 38/227 (16%), Positives = 83/227 (36%), Gaps = 26/227 (11%) Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFC---TFIPWYTNSRHIVMPITSSVKVNS 166 + ++ I + Y A + + P + PW + + + + ++ Sbjct: 250 LPPRDAVRIEEMVNYFPYDYPAPAAGEAPFRATLGVMDSPWAEGRQLVHIGLQGAL-PPV 308 Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + L+++ ++D S SM++ K+ + +S ML E++ ++ V +G Sbjct: 309 EARPPLNLVFLVDTSGSMDA-----PDKLGLLKQSFRLMLSELRPQDEIAIVAYAGSAG- 362 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ + + ++ L G STN GL+ AY TED Sbjct: 363 --EVLAPTP-AGERATILAALERL-AAGGSTNGAGGLEQAY--------ATAEAMTEDGE 410 Query: 287 YKKIIVFMTDGENLSTKEDQQSLY--YCNEAKKRGAIVYAIGIRVIR 331 + I+ TDG+ D +L ++ + G + +G Sbjct: 411 VSR-ILLATDGDFNVGLSDPSALEDFIADK-RDSGTYLSVLGFGRGN 455 >gi|294055226|ref|YP_003548884.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] gi|293614559|gb|ADE54714.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] Length = 330 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 69/207 (33%), Gaps = 30/207 (14%) Query: 168 TDARLDMMIVLDVSRSME--SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 D+M+ +D+S SM+ F D+S +ID + + E V + GL+ Sbjct: 90 EKTARDLMLAVDLSGSMDAADFVDASGEQIDRLSAAKGVLNEFVA----GREGDRLGLIV 145 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 F N + ++ G ST + A + RQ Sbjct: 146 FGNAAYLQAPFTDDHETWLALLDESIVNMAGPSTALGDSIGLA------IAHFRQSKTEN 199 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-------HEFL 336 ++++ +TDG T L AK G +Y + + + E L Sbjct: 200 -----RVLIVLTDG--NDTGSRVPPLDAAEVAKVEGVTIYTVAVGDPTTVGEEALDMETL 252 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHI 361 A + ++ + ++ + + I Sbjct: 253 ETVARLTGGDSFVASDLVALRETYQRI 279 >gi|254420639|ref|ZP_05034363.1| von Willebrand factor type A domain protein [Brevundimonas sp. BAL3] gi|196186816|gb|EDX81792.1| von Willebrand factor type A domain protein [Brevundimonas sp. BAL3] Length = 613 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 45/297 (15%), Positives = 103/297 (34%), Gaps = 39/297 (13%) Query: 56 RSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNM---SFRNELRDNGFVN-----DI 107 ++V ++IM + + +K T + +F ++ + N D Sbjct: 117 NAVVVTGSRIMPGAPAPSDTETYPDATPNPVKRTADQPVSTFSIDVDTAAYSNVRRFIDE 176 Query: 108 DDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFI---PW----YTNSRHIVMPITS 160 + ++ + + Y + + P T + PW R IV Sbjct: 177 GRSPPADAVRVEELINAFDYGYARPTSLARPFAITTAVVASPWAPRTERGGRQIVHIGLQ 236 Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 ++ L++ ++DVS SM S K+D+A +++N ++ ++ ++ Sbjct: 237 GYELPQGEQRPLNLTFLVDVSGSMRS-----PDKLDLAKQAMNLAIDRLRPQDTLS---- 287 Query: 221 SGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + ++ G ++ + L G T G+ AY+Q Sbjct: 288 --VTYYAEGAGTTLQPTPGDQKLKMRCAVASLRASG-GTAGATGMTNAYDQ--------A 336 Query: 279 HCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + +I++F TDG+ N+ ++++ Y E + G + G + Sbjct: 337 QASFARDKVNRILMF-TDGDFNVGVTDNKRLEDYVAEKRGTGVYLSVYGFGRGNYQD 392 >gi|186896818|ref|YP_001873930.1| hypothetical protein YPTS_3520 [Yersinia pseudotuberculosis PB1/+] gi|186699844|gb|ACC90473.1| conserved hypothetical protein [Yersinia pseudotuberculosis PB1/+] Length = 518 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 36/241 (14%), Positives = 78/241 (32%), Gaps = 36/241 (14%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F N +G + + +P+ ++ + E SH + L ++++ + +T+ N Sbjct: 17 FIKNRQGAILLSFMALIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALSTE--NNYR 74 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 + + ++ ++ S R VN ++I+ T + Sbjct: 75 NDRASNNRNNYLVTSYAQSYLPSERFSQPR--VVNTYNEILGYTEYN------------- 119 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA-RLDMMIVLDVSRSM----- 184 A + L T S + + + + A +D++ V D S SM Sbjct: 120 ASLQMNYQLALLNSYLKQTPSPTWDVNENGAARKYLSSIAEPIDVVFVTDFSGSMNLPFG 179 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ 244 + ++ ITK+D + + +N G V WG + Sbjct: 180 DIELNNRITKLDELKAIFVKLNNRIFSNDGINT---IGFV----------PFSWGTKRIS 226 Query: 245 R 245 Sbjct: 227 A 227 Score = 40.9 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 21/91 (23%), Positives = 40/91 (43%), Gaps = 18/91 (19%) Query: 282 TEDANYKKIIVFMTDGENL----STKEDQQ-----------SLYYCNEAKKRGAIVYAIG 326 TE N K+++ ++DG++ S+ DQ+ + C + K G + IG Sbjct: 407 TESQNNNKLMIILSDGDDNTQKMSSPHDQKAGIINITQKLITEGMCQKIKDNGIKMVFIG 466 Query: 327 IRVIRSHEFL---RACASPNSFYLVENPHSM 354 I + + + + C +FYL +N H + Sbjct: 467 IGYVPDNNIIDWEKDCVGTGNFYLAKNAHEL 497 >gi|116494017|ref|YP_805751.1| von Willebrand factor domain-containing protein [Lactobacillus casei ATCC 334] gi|116104167|gb|ABJ69309.1| Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain [Lactobacillus casei ATCC 334] Length = 909 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 49/293 (16%), Positives = 101/293 (34%), Gaps = 33/293 (11%) Query: 19 MTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLK 78 MT +TA L ++I + M T L + + A N Sbjct: 1 MTKMTAKVARTGHLFAVLLI----LMSMLTGLVTSGSSVVTATANIRPTYQTDANGTYPT 56 Query: 79 GGDILCRIKNTWNMSFRNELR--DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYK 136 + +N N +++ DN + + D + S P N Y I ++ Sbjct: 57 NSWQVTGQQNVINQRGGDQVSGWDNNTIWNGDATDTTNSYLKFGDPNNPDYQIRKYAK-- 114 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 + + + + + Q +D+++V+D+S SMES + + Sbjct: 115 --------ETNTPGLYDVYLNVKGNTQ---QNVKPVDIVLVVDMSGSMESKNNGGTDRAG 163 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVSHLQRKIKYLSKFGV 255 + L ++ + + V GL+ FS+ G +++ + Sbjct: 164 AVRTGVKNFLTSIQNA-GLGDYVNVGLIGFSSPGYIGGGNKTTGPGYIRVGLGK----AG 218 Query: 256 STNSTPGLKYAYNQIFDMQ-----GMRQHCNTEDANY---KKIIVFMTDGENL 300 +T+ + A + F+ G+RQ +A+ KK+++ +TDG Sbjct: 219 NTSQQQAINSALSPTFNGGTYTQIGLRQGSAMLNADTSGNKKMMILLTDGVPT 271 >gi|297473448|ref|XP_002686617.1| PREDICTED: collagen, type VI, alpha 3-like isoform 2 [Bos taurus] gi|296488812|gb|DAA30925.1| collagen, type VI, alpha 3-like isoform 2 [Bos taurus] Length = 2956 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 47/332 (14%), Positives = 111/332 (33%), Gaps = 47/332 (14%) Query: 53 MIDRSLVHAATQI---------------MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNE 97 +D S ++ + + EG + GG L + R+ Sbjct: 310 ALDGSALYTGSALDFVRNNLFTEAAGYRAAEGVPKLLVLVTGGKSLDAVSQPAQELKRSG 369 Query: 98 LRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMP 157 + N + D ++ + + +A PL+ +P Sbjct: 370 ILAFAVGNKVAD-----QAELEEIAFDSSLVFTATEFRPAPLQ--GVLPGLLGPLR---T 419 Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 +T + +V D++ +LD S ++ T + ++ + + ++ Sbjct: 420 LTGTTEVRVNKR---DIIFLLDGSSNVGE------TNFPYVRDFVMNLVNSLDV---GSD 467 Query: 218 VVQSGLVTFSN-KIEEFFLLEWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 ++ GLV FS+ + EF L + S L ++ + G S +T A + + Sbjct: 468 HIRVGLVQFSDTPVTEFSLNTYPTKSELLAHLRQMQLQGGSVLNTGA---ALSYVHANHF 524 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + + ++++ +T G+ + L N + G + + +G E Sbjct: 525 TEAGGSRIQDHVPQLLLLLTAGQ-----SEDSYLQAANALARAGILTFCVGTSQADRAEL 579 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 +P YL+++ S+ + + + T Sbjct: 580 EEIAFNPGLVYLMDDFSSLPALPQQLIQPLTT 611 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 43/256 (16%), Positives = 92/256 (35%), Gaps = 27/256 (10%) Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVM--PITSSVK 163 +I R+ I P+ + +P + S + P + Sbjct: 1367 GDRNIDRTELQTITSDPR---LVFTVREFRDLP-SIEERMVNSFGSSGVTPAPPGVDTPS 1422 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + + D++ +LD S+ DS + + + E V + + + +Q GL Sbjct: 1423 PSRPEKKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEGGDSIQVGL 1473 Query: 224 VTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMRQHC 280 V +++ + F L+ + + I + G N+ GL++ F + + Sbjct: 1474 VQYNSDPTDEFFLKDFPTKQQIIDAINKVVYKGGRHANTKVGLEHLRRNHFVPEAGSR-- 1531 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 D +I +T G K + + +RG V+A+G+R I S E + + Sbjct: 1532 --LDQRVPQIAFVITGG-----KSVEDAQEASMALTQRGVKVFAVGVRNIDSEEVGKIAS 1584 Query: 341 SPNSFYLVENPHSMYD 356 + + + V N + + Sbjct: 1585 NSATAFRVGNVQELSE 1600 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 70/213 (32%), Gaps = 34/213 (15%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVV 219 + + +D +DM +LD S S F + + K I ++ ++ + PD + Sbjct: 2402 RRAAGSDVDIDMAFILDSSESTTPF------QFNEMRKYIEYLVRQLDVSPDPKASQHFA 2455 Query: 220 QSGLVT---FSNKIEEFFLL--------EWGVSHLQRKI--KYLSKFGVSTNSTPGLKYA 266 + +V + ++G +++ + + Y Sbjct: 2456 RVAVVQHAPYEAVDNASVPPVKVEFSLTDYGSKEKLLAFLGSRMTQLQGTRALGRAIDYT 2515 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 IF+ + KI+V M GE + ++ +AK +G +G Sbjct: 2516 IENIFES--------APNPRDLKIVVLMLTGEVQKQQLEEAQRAIL-QAKCKGYFFVILG 2566 Query: 327 IRVI-RSHEFLRACASPNSFYL--VENPHSMYD 356 I E + + PN + V+ + + Sbjct: 2567 IGRKVNVKELYSSASEPNDVFFKLVDKSTELNE 2599 Score = 44.0 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 44/255 (17%), Positives = 89/255 (34%), Gaps = 29/255 (11%) Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVK 163 ++ + + I + P+ Y S + ++P K T + T + + ++ Sbjct: 1163 ARNVDQEELVKISLSPE---YVFSVNTFRELPSLEQKLLTPLTTLTAGQIQQLLASTRYP 1219 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + D++ ++D S ++ I + ++ + + P+ V+ G+ Sbjct: 1220 PPAVESDAADIVFLIDSSDGVKPDG------IAHIRDFVIRIVRRLNVGPNK---VRIGV 1270 Query: 224 VTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVST-NSTPGLKY-AYNQIFDMQGMRQH 279 + FSN + F L+ S I+ L G S N+ L++ A N G R Sbjct: 1271 LQFSNDVFPEFQLKTYKSQASVLDAIRRLRFKGGSPLNTGKALEFVARNYFVKSAGSR-- 1328 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 ED + +++F+ K Y K G +G R I E Sbjct: 1329 --IEDGVPQHLVLFL------GGKSQDDISRYSQVIKSAGIASLGVGDRNIDRTELQTIT 1380 Query: 340 ASPNSFYLVENPHSM 354 + P + V + Sbjct: 1381 SDPRLVFTVREFRDL 1395 Score = 43.2 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 27/193 (13%), Positives = 64/193 (33%), Gaps = 21/193 (10%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S ++ + + + +LE + + ++ G+V +S Sbjct: 30 AQDSADIIFLIDGSN------NTGSVNFAVILDFLVNLLERLSI---GTQQIRVGVVQYS 80 Query: 228 NKIEEFFLL-EWGVS-HLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ F L + + +K L G N L + F R + + Sbjct: 81 DEPRTMFSLNSYSTKAQVLDAVKALGFIGGELANVGLALDFVVENHFT----RAGGSRAE 136 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +++V ++ G + D K+ + +G + E + N Sbjct: 137 EGVPQVLVLISAGPSSDEIRDGVIA-----LKQASVFSFGLGAQAASKAELQHIATNDNL 191 Query: 345 FYLVENPHSMYDA 357 + V S+ D Sbjct: 192 VFTVPEFRSLGDV 204 >gi|297473446|ref|XP_002686616.1| PREDICTED: collagen, type VI, alpha 3-like isoform 1 [Bos taurus] gi|296488811|gb|DAA30924.1| collagen, type VI, alpha 3-like isoform 1 [Bos taurus] Length = 3162 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 47/332 (14%), Positives = 111/332 (33%), Gaps = 47/332 (14%) Query: 53 MIDRSLVHAATQI---------------MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNE 97 +D S ++ + + EG + GG L + R+ Sbjct: 516 ALDGSALYTGSALDFVRNNLFTEAAGYRAAEGVPKLLVLVTGGKSLDAVSQPAQELKRSG 575 Query: 98 LRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMP 157 + N + D ++ + + +A PL+ +P Sbjct: 576 ILAFAVGNKVAD-----QAELEEIAFDSSLVFTATEFRPAPLQ--GVLPGLLGPLR---T 625 Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 +T + +V D++ +LD S ++ T + ++ + + ++ Sbjct: 626 LTGTTEVRVNKR---DIIFLLDGSSNVGE------TNFPYVRDFVMNLVNSLDV---GSD 673 Query: 218 VVQSGLVTFSN-KIEEFFLLEWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 ++ GLV FS+ + EF L + S L ++ + G S +T A + + Sbjct: 674 HIRVGLVQFSDTPVTEFSLNTYPTKSELLAHLRQMQLQGGSVLNTGA---ALSYVHANHF 730 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + + ++++ +T G+ + L N + G + + +G E Sbjct: 731 TEAGGSRIQDHVPQLLLLLTAGQ-----SEDSYLQAANALARAGILTFCVGTSQADRAEL 785 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 +P YL+++ S+ + + + T Sbjct: 786 EEIAFNPGLVYLMDDFSSLPALPQQLIQPLTT 817 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 43/256 (16%), Positives = 92/256 (35%), Gaps = 27/256 (10%) Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVM--PITSSVK 163 +I R+ I P+ + +P + S + P + Sbjct: 1573 GDRNIDRTELQTITSDPR---LVFTVREFRDLP-SIEERMVNSFGSSGVTPAPPGVDTPS 1628 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + + D++ +LD S+ DS + + + E V + + + +Q GL Sbjct: 1629 PSRPEKKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEGGDSIQVGL 1679 Query: 224 VTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMRQHC 280 V +++ + F L+ + + I + G N+ GL++ F + + Sbjct: 1680 VQYNSDPTDEFFLKDFPTKQQIIDAINKVVYKGGRHANTKVGLEHLRRNHFVPEAGSR-- 1737 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 D +I +T G K + + +RG V+A+G+R I S E + + Sbjct: 1738 --LDQRVPQIAFVITGG-----KSVEDAQEASMALTQRGVKVFAVGVRNIDSEEVGKIAS 1790 Query: 341 SPNSFYLVENPHSMYD 356 + + + V N + + Sbjct: 1791 NSATAFRVGNVQELSE 1806 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 70/213 (32%), Gaps = 34/213 (15%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVV 219 + + +D +DM +LD S S F + + K I ++ ++ + PD + Sbjct: 2608 RRAAGSDVDIDMAFILDSSESTTPF------QFNEMRKYIEYLVRQLDVSPDPKASQHFA 2661 Query: 220 QSGLVT---FSNKIEEFFLL--------EWGVSHLQRKI--KYLSKFGVSTNSTPGLKYA 266 + +V + ++G +++ + + Y Sbjct: 2662 RVAVVQHAPYEAVDNASVPPVKVEFSLTDYGSKEKLLAFLGSRMTQLQGTRALGRAIDYT 2721 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 IF+ + KI+V M GE + ++ +AK +G +G Sbjct: 2722 IENIFES--------APNPRDLKIVVLMLTGEVQKQQLEEAQRAIL-QAKCKGYFFVILG 2772 Query: 327 IRVI-RSHEFLRACASPNSFYL--VENPHSMYD 356 I E + + PN + V+ + + Sbjct: 2773 IGRKVNVKELYSSASEPNDVFFKLVDKSTELNE 2805 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 31/256 (12%), Positives = 81/256 (31%), Gaps = 31/256 (12%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARL-- 172 ++ + + I++ + + ++V + SS+ + Sbjct: 173 AIGVEDADEAALKEIASEPLNMHVFNLENYTSLHDIVGNLVACVRSSMAPERAGGTEIPK 232 Query: 173 --------DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D++ ++D S ++ + + + +LE + + ++ G+V Sbjct: 233 DITAQDSADIIFLIDGSN------NTGSVNFAVILDFLVNLLERLSI---GTQQIRVGVV 283 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCN 281 +S++ F L + + +K L G N L + F R + Sbjct: 284 QYSDEPRTMFSLNSYSTKAQVLDAVKALGFIGGELANVGLALDFVVENHFT----RAGGS 339 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 + +++V ++ G + D K+ + +G + E + Sbjct: 340 RAEEGVPQVLVLISAGPSSDEIRDGVIA-----LKQASVFSFGLGAQAASKAELQHIATN 394 Query: 342 PNSFYLVENPHSMYDA 357 N + V S+ D Sbjct: 395 DNLVFTVPEFRSLGDV 410 Score = 44.0 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 44/255 (17%), Positives = 89/255 (34%), Gaps = 29/255 (11%) Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVK 163 ++ + + I + P+ Y S + ++P K T + T + + ++ Sbjct: 1369 ARNVDQEELVKISLSPE---YVFSVNTFRELPSLEQKLLTPLTTLTAGQIQQLLASTRYP 1425 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + D++ ++D S ++ I + ++ + + P+ V+ G+ Sbjct: 1426 PPAVESDAADIVFLIDSSDGVKPDG------IAHIRDFVIRIVRRLNVGPNK---VRIGV 1476 Query: 224 VTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVST-NSTPGLKY-AYNQIFDMQGMRQH 279 + FSN + F L+ S I+ L G S N+ L++ A N G R Sbjct: 1477 LQFSNDVFPEFQLKTYKSQASVLDAIRRLRFKGGSPLNTGKALEFVARNYFVKSAGSR-- 1534 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 ED + +++F+ K Y K G +G R I E Sbjct: 1535 --IEDGVPQHLVLFL------GGKSQDDISRYSQVIKSAGIASLGVGDRNIDRTELQTIT 1586 Query: 340 ASPNSFYLVENPHSM 354 + P + V + Sbjct: 1587 SDPRLVFTVREFRDL 1601 >gi|240255542|ref|NP_476505.3| collagen alpha-3(VI) chain isoform 2 precursor [Homo sapiens] gi|193787261|dbj|BAG52467.1| unnamed protein product [Homo sapiens] Length = 1036 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 86/213 (40%), Gaps = 26/213 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + +V+S D++ +LD S ++ T + ++ + + N+ Sbjct: 221 SGTPEVHSNKR---DIIFLLDGSANVGK------TNFPYVRDFVMNLVNSLDI---GNDN 268 Query: 219 VQSGLVTFSN-KIEEFFLLEWGV-SHLQRKIKYLS-KFGVSTNSTPGLKYAY-NQIFDMQ 274 ++ GLV FS+ + EF L + S + ++ L + G N+ L Y Y N + Sbjct: 269 IRVGLVQFSDTPVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAG 328 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 G R + ++++ +T G+ + L N + G + + +G E Sbjct: 329 GSRI-----REHVPQLLLLLTAGQ-----SEDSYLQAANALTRAGILTFCVGASQANKAE 378 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + +P+ YL+++ S+ + + + T Sbjct: 379 LEQIAFNPSLVYLMDDFSSLPALPQQLIQPLTT 411 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 59/162 (36%), Gaps = 22/162 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 622 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 671 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L + ++ L+ G T N+ L++ I + + Sbjct: 672 EFYLNSYMNKQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRITEG----VPQ 727 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 +++ +T + + K+ GA+ IGI Sbjct: 728 LLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 764 >gi|153946957|ref|YP_001399586.1| hypothetical protein YpsIP31758_0593 [Yersinia pseudotuberculosis IP 31758] gi|152958452|gb|ABS45913.1| conserved hypothetical protein [Yersinia pseudotuberculosis IP 31758] Length = 518 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 78/241 (32%), Gaps = 36/241 (14%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F N +G + + +P+ ++ + E SH + L I+++ + +T+ N Sbjct: 17 FIKNRQGAILLSFMALIPVFIGLIFLSFEFSHFIQKRAKLSDAIEQASLALSTE--NNYR 74 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 + + ++ ++ S R VN ++I+ T + Sbjct: 75 NDRASNNRNNYLVTSYAQSYLPSERFSQPR--VVNTYNEILGYTEYN------------- 119 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA-RLDMMIVLDVSRSM----- 184 A + L T S + + + + A +D++ V D S SM Sbjct: 120 ASLQMNYQLALLNSYLKQTPSPTWDVNENGAARKYLSSIAEPIDVVFVTDFSGSMNLPFG 179 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ 244 + ++ ITK+D + + +N G V WG + Sbjct: 180 DIELNNRITKLDELKAIFVKLNNRIFSNDGINT---IGFV----------PFSWGTKRIS 226 Query: 245 R 245 Sbjct: 227 A 227 Score = 40.9 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 21/91 (23%), Positives = 40/91 (43%), Gaps = 18/91 (19%) Query: 282 TEDANYKKIIVFMTDGENL----STKEDQQ-----------SLYYCNEAKKRGAIVYAIG 326 TE N K+++ ++DG++ S+ DQ+ + C + K G + IG Sbjct: 407 TESQNNNKLMIILSDGDDNTQKMSSPHDQKAGIINITQKLITEGMCQKIKDNGIKMVFIG 466 Query: 327 IRVIRSHEFL---RACASPNSFYLVENPHSM 354 I + + + + C +FYL +N H + Sbjct: 467 IGYVPDNNIIDWEKDCVGTGNFYLAKNAHEL 497 >gi|298488105|ref|ZP_07006142.1| von Willebrand factor type A domain protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] gi|298157384|gb|EFH98467.1| von Willebrand factor type A domain protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] Length = 352 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 63/169 (37%), Gaps = 22/169 (13%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + ++++ + + + LE K + GL+ F Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDEVSRLVLVQQLLGDFLEGRKGD-------RVGLILFGT 143 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L + ++ + N+ G D G+ Sbjct: 144 QAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTALG---------DAIGLGLKRLRLRPATS 194 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +++V +TDG N + + D + A + G +Y IGI + L+ Sbjct: 195 RVLVLVTDGANNAGQIDPITA--ARLAAEEGVKIYPIGIGSDPDKDALQ 241 >gi|289624057|ref|ZP_06457011.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. aesculi str. NCPPB3681] gi|289650363|ref|ZP_06481706.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. aesculi str. 2250] gi|330866187|gb|EGH00896.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. aesculi str. 0893_23] Length = 352 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 63/169 (37%), Gaps = 22/169 (13%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + ++++ + + + LE K + GL+ F Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDEVSRLVLVQQLLGDFLEGRKGD-------RVGLILFGT 143 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L + ++ + N+ G D G+ Sbjct: 144 QAFVQAPLTYDRRTVRVWLDEAKIGIAGKNTALG---------DAIGLGLKRLRLRPATS 194 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +++V +TDG N + + D + A + G +Y IGI + L+ Sbjct: 195 RVLVLVTDGANNAGQIDPITA--ARLAAEEGVKIYPIGIGSDPDKDALQ 241 >gi|222623880|gb|EEE58012.1| hypothetical protein OsJ_08791 [Oryza sativa Japonica Group] Length = 759 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 52/327 (15%), Positives = 109/327 (33%), Gaps = 49/327 (14%) Query: 51 HSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDI 110 + +++ + + G ++ L + + +E + + + Sbjct: 205 QNGGQMTILRPLPRARSSGRLHHMTSLLPDTDRSIFNDDEPLDSLSEANEGSQQGCLRTV 264 Query: 111 VRSTSLDIVVVPQNEG-YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD 169 T + VP++ + + + K PL + T Sbjct: 265 EIKTYPEFTEVPESTSERNFTVLVHLKAPLAQTLQTSSKLEDGN----------SLGTTR 314 Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 A +D++ VLDVS SM TK+ + +++ +++ + + ++ FS+ Sbjct: 315 APVDLITVLDVSGSMAG------TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSS 362 Query: 230 IEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F L G + + L+ G TN GL+ I D Q C+ Sbjct: 363 ARRLFPLRRMTETGRQQSLQAVYSLTSNG-GTNIAEGLRKGSKVIEDRQAKNPVCS---- 417 Query: 286 NYKKIIVFMTDGENLSTKEDQQSL-----YYCNEA--KKRG---AIVYAIGIRVIRSHEF 335 I+ ++DG++ T + YC+ G V+ G Sbjct: 418 -----IILLSDGQDTYTVSPTAGVHKAAPEYCSLLPYTSNGCQQVPVHVFGFGADHDSVS 472 Query: 336 LRACA--SPNSFYLVENPHSMYDAFSH 360 L + + S +F +E ++ DAF+ Sbjct: 473 LHSISQTSGGTFSFIETEAAIQDAFAQ 499 >gi|218192066|gb|EEC74493.1| hypothetical protein OsI_09963 [Oryza sativa Indica Group] Length = 641 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 38/208 (18%), Positives = 76/208 (36%), Gaps = 30/208 (14%) Query: 99 RDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPI 158 + ND ++I + V S+ + + P + + I + Sbjct: 180 PEASIFNDDENIDPQSE--TVDDHNAVTKSVEIKTYSEFPAIQKSE-RRKVFAILIHLKA 236 Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 S+ S + A LD++ VLDVS SM K+ + ++++ +++ + Sbjct: 237 PKSLDSVS-SRAPLDLVTVLDVSGSMSGI------KLSLLKRAMSFVIQTLGPND----- 284 Query: 219 VQSGLVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + +V FS+ + F L G + I L G TN LK + Sbjct: 285 -RLSVVAFSSTAQRLFPLRRMTLTGRQQALQAISSLVASG-GTNIADALK---------K 333 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLST 302 G + + N I+ ++DG++ + Sbjct: 334 GAKVVKDRRRKNPVSSIILLSDGQDTHS 361 >gi|332140758|ref|YP_004426496.1| von Willebrand factor, type A [Alteromonas macleodii str. 'Deep ecotype'] gi|327550780|gb|AEA97498.1| von Willebrand factor, type A [Alteromonas macleodii str. 'Deep ecotype'] Length = 349 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 30/212 (14%), Positives = 67/212 (31%), Gaps = 44/212 (20%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD-VNNVV--QSGLVTFSNKI 230 MM+ +D+S SM I + + + +N + ++ D + V + GL+ F++ Sbjct: 90 MMLAVDLSGSM------KIDDMQLNGRQVNRLTMTKSVVYDFIQRRVGDRLGLILFADTA 143 Query: 231 EEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L + + + G T + A + + Sbjct: 144 YVQAPLTYDRDTVSTLLSEAVIGLVGEQTAIGDAIGLAVKRFDERDESNN---------- 193 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--------------SHE 334 +++ +TDG+N + + A +G VY IG+ + S E Sbjct: 194 -VLILLTDGQNTAGNITPEQAK--ELAINKGVKVYTIGVGADKMLIQSFFGSREINPSQE 250 Query: 335 FLRAC------ASPNSFYLVENPHSMYDAFSH 360 ++ ++ N + + Sbjct: 251 LDEGMLTDIATSTGGQYFRARNAQELEAIYQQ 282 >gi|308462096|ref|XP_003093334.1| hypothetical protein CRE_03436 [Caenorhabditis remanei] gi|308250345|gb|EFO94297.1| hypothetical protein CRE_03436 [Caenorhabditis remanei] Length = 384 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 38/222 (17%), Positives = 79/222 (35%), Gaps = 16/222 (7%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 PL + + + + + ++ LD++ V+D S+ M + + +I + Sbjct: 3 PLLILFLLIGVSTANYTALSYEERPCGTDISNLWLDVVAVVDNSKVMG---NGDLAQIAV 59 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST 257 I I A IP+ + GLVT++ ++ G+ Q + + Sbjct: 60 LITEIFAESRIGTSIPNQPKTTRLGLVTYN----WNATIQAGLDKFQSQQDVFENIFNAL 115 Query: 258 NSTPGLKYAY--NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 NS +Y N + + + NY+K+IV S + + Sbjct: 116 NSVSSTSESYLANGLVAAENVLARGPNRGNNYQKVIVLFA----ASYSSHSNPIAIADRL 171 Query: 316 KKRGAIVYAIGIRV---IRSHEFLRACASPNSFYLVENPHSM 354 K+ G + +G ++ L ASPN + ++ + Sbjct: 172 KQAGITIITMGYNNVGDPNFYQNLAKIASPNKSFTEKSLSQI 213 >gi|153874442|ref|ZP_02002664.1| von Willebrand factor, type A [Beggiatoa sp. PS] gi|152069095|gb|EDN67337.1| von Willebrand factor, type A [Beggiatoa sp. PS] Length = 478 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 69/199 (34%), Gaps = 21/199 (10%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+++++D S SM + TK+ + ++ + + N + +V F + Sbjct: 41 PPHDVILLIDTSGSM-----AEGTKLQEVQAAAIQFIQRRHGLTHLANN-KIAVVGFGGR 94 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + +L++ I+ L G + GL+ A NQ+ + ++ Sbjct: 95 AYLVANLTSDLMNLEQPIQKLRAVGGTPMD-RGLQSAMNQLSA----------GSDSEQR 143 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 I+ TDG+ + + +L K + AI L + Sbjct: 144 SILLFTDGKPDNQRT---TLNASQLVKNANIQIVAIATDDADIG-LLTQVTGDAALVFPT 199 Query: 350 NPHSMYDAFSHIGKDIVTK 368 + + AF + I + Sbjct: 200 SVGNFDQAFQKAEQAIYEQ 218 >gi|257058175|ref|YP_003136063.1| von Willebrand factor type A [Cyanothece sp. PCC 8802] gi|256588341|gb|ACU99227.1| von Willebrand factor type A [Cyanothece sp. PCC 8802] Length = 418 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 73/204 (35%), Gaps = 28/204 (13%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + +S +++ ++LD S SM ++ ++ N +++ + + Sbjct: 31 ATGEDSDRTLPINLGLILDRSGSMR------AQAMETVKEAANYLVDGLGPDD------R 78 Query: 221 SGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++TF++ E + + ++ KI L+ G T G+K + + R Sbjct: 79 LSVITFNHHAEVILPNQSVEDLQGVKNKINRLTASG-GTCIDEGMKLGIKEAALGKENRV 137 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 I +TDGEN ++++ L A + + +G + + L Sbjct: 138 SQ----------IFLLTDGENEH-GDNERCLKLAKVAAEYNITLNTLGFGSNWNQDILEQ 186 Query: 339 CASP--NSFYLVENPHSMYDAFSH 360 A +E+P FS Sbjct: 187 IADSAGGMLCYIEHPEQALTEFSR 210 >gi|241113143|ref|YP_002972978.1| hypothetical protein Rleg_4788 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240861351|gb|ACS59017.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 461 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 43/331 (12%), Positives = 107/331 (32%), Gaps = 73/331 (22%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R + G + I+ A+ L + + +G + + ++ + S +D +L+ A +I Sbjct: 24 LRGLGRDRGGNVAIVVALTLVPMIVAVGASFDYIRTYNVRQRMQSDLDTALIAAVKEI-- 81 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 D +K + F ++ ++ + DI+ + + Sbjct: 82 -----------DTDDAVALKQKVSDWFHAQVENSYTLGDINIDTSNHKITATASGTVP-- 128 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM--- 184 T + N + + + S+VK + L++ IV+D S SM Sbjct: 129 ---------------TTLMRIANIDTVDVSVASAVK--GPATSYLNVYIVIDTSPSMLLA 171 Query: 185 ----------------------------ESFFDSSIT---------KIDMAIKSINAMLE 207 ++++ + D+A ++ +L Sbjct: 172 ATTAGQATMYSGIGCQFACHTGDAHKIGNKTYNNNYEYSTAKNIKLRADVAGDAVKDVLA 231 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 + + ++ GL + + + E + ++ S S S + Sbjct: 232 LIDTSDSNHQRIKVGLYSLGDTLTEVLAPTLSTDTARNRLTDASYGLTSATSKAATYFDV 291 Query: 268 NQIFDMQGMRQHCNTEDANYK-KIIVFMTDG 297 + Q + + +N K+++ +TDG Sbjct: 292 SLTTLKQKVGTGGDGTASNSPLKLVLLLTDG 322 >gi|73972306|ref|XP_860269.1| PREDICTED: similar to Complement factor B precursor (C3/C5 convertase) (Properdin factor B) (Glycine-rich beta glycoprotein) (GBG) (PBF2) isoform 3 [Canis familiaris] Length = 549 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 42/251 (16%), Positives = 87/251 (34%), Gaps = 44/251 (17%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 IP R IV+ + S+ ++ +VLD S S+ + + T+ A + Sbjct: 250 GHIPGEQQKRKIVLDPSGSM----------NIYLVLDGSDSIGA---GNFTR---AKNCL 293 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSN--------KIEEFFLLEWGVSHLQRKIKYLSKFG 254 +E+V GLVT++ + + +W L + Sbjct: 294 RDFIEKVASYGVKPKY---GLVTYATNPKVWVRVRDKNSSDADWVTKILNQVSYEDHMLK 350 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC-- 312 TN+ L+ Y+ + + G + + +I+ MTDG + ++ Sbjct: 351 SGTNTKKALEAVYS-MMNWPGETPPADWNRTRH--VIILMTDGLYNMGGDPVSVIHNIRN 407 Query: 313 --------NEAKKRGAIVYAIGIRVIRSHEFLRACAS----PNSFYLVENPHSMYDAFSH 360 ++ VY G+ + + E + A AS + V++ ++ D F Sbjct: 408 FLDIGRDHKNPREDYLDVYVFGVGPLVNQENINALASKKDKEQHVFKVKDMENLEDVFIQ 467 Query: 361 IGKDIVTKRIW 371 + + T + Sbjct: 468 MLDETRTLGLC 478 >gi|73972310|ref|XP_849985.1| PREDICTED: similar to Complement factor B precursor (C3/C5 convertase) (Properdin factor B) (Glycine-rich beta glycoprotein) (GBG) (PBF2) isoform 2 [Canis familiaris] Length = 1112 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 42/251 (16%), Positives = 87/251 (34%), Gaps = 44/251 (17%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 IP R IV+ + S+ ++ +VLD S S+ + + T+ A + Sbjct: 595 GHIPGEQQKRKIVLDPSGSM----------NIYLVLDGSDSIGA---GNFTR---AKNCL 638 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSN--------KIEEFFLLEWGVSHLQRKIKYLSKFG 254 +E+V GLVT++ + + +W L + Sbjct: 639 RDFIEKVASYGVKPKY---GLVTYATNPKVWVRVRDKNSSDADWVTKILNQVSYEDHMLK 695 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC-- 312 TN+ L+ Y+ + + G + + +I+ MTDG + ++ Sbjct: 696 SGTNTKKALEAVYS-MMNWPGETPPADWNRTRH--VIILMTDGLYNMGGDPVSVIHNIRN 752 Query: 313 --------NEAKKRGAIVYAIGIRVIRSHEFLRACAS----PNSFYLVENPHSMYDAFSH 360 ++ VY G+ + + E + A AS + V++ ++ D F Sbjct: 753 FLDIGRDHKNPREDYLDVYVFGVGPLVNQENINALASKKDKEQHVFKVKDMENLEDVFIQ 812 Query: 361 IGKDIVTKRIW 371 + + T + Sbjct: 813 MLDETRTLGLC 823 >gi|73972308|ref|XP_532086.2| PREDICTED: similar to Complement factor B precursor (C3/C5 convertase) (Properdin factor B) (Glycine-rich beta glycoprotein) (GBG) (PBF2) isoform 1 [Canis familiaris] Length = 767 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 42/251 (16%), Positives = 87/251 (34%), Gaps = 44/251 (17%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 IP R IV+ + S+ ++ +VLD S S+ + + T+ A + Sbjct: 250 GHIPGEQQKRKIVLDPSGSM----------NIYLVLDGSDSIGA---GNFTR---AKNCL 293 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSN--------KIEEFFLLEWGVSHLQRKIKYLSKFG 254 +E+V GLVT++ + + +W L + Sbjct: 294 RDFIEKVASYGVKPKY---GLVTYATNPKVWVRVRDKNSSDADWVTKILNQVSYEDHMLK 350 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC-- 312 TN+ L+ Y+ + + G + + +I+ MTDG + ++ Sbjct: 351 SGTNTKKALEAVYS-MMNWPGETPPADWNRTRH--VIILMTDGLYNMGGDPVSVIHNIRN 407 Query: 313 --------NEAKKRGAIVYAIGIRVIRSHEFLRACAS----PNSFYLVENPHSMYDAFSH 360 ++ VY G+ + + E + A AS + V++ ++ D F Sbjct: 408 FLDIGRDHKNPREDYLDVYVFGVGPLVNQENINALASKKDKEQHVFKVKDMENLEDVFIQ 467 Query: 361 IGKDIVTKRIW 371 + + T + Sbjct: 468 MLDETRTLGLC 478 >gi|308466921|ref|XP_003095711.1| hypothetical protein CRE_10578 [Caenorhabditis remanei] gi|308244476|gb|EFO88428.1| hypothetical protein CRE_10578 [Caenorhabditis remanei] Length = 637 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 38/266 (14%), Positives = 92/266 (34%), Gaps = 35/266 (13%) Query: 102 GFVNDIDDIVRSTSLDIVVVPQN---EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPI 158 + ID R +N S+ K P+ + + ++ + Sbjct: 387 APASTIDFPPRKHEGGEEFNGRNFKVRSRSVHFAMTEKPPVTTTMNVMKFFSTSRTPLTT 446 Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + T D+ ++D+S + D S +D+A +I+++ P Sbjct: 447 AKPLIPFACTA---DVFFLVDLS---QGTGDKSQQYLDIAASAISSL-------PISQEA 493 Query: 219 VQSGLVTFSNK----IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 V+ GL+++S + + L ++ + + G +T + ++YA + Sbjct: 494 VRVGLISYSGPGRTHVRVYLDKHNEKEKLIEEMFLMERHGGTTRTADAIRYA----TKIF 549 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI----RVI 330 H N KK++V TDG + + D + A+ +G + A+ + Sbjct: 550 EGMAHPA--RRNVKKVLVVFTDGYSQDSPRDAARV-----ARAKGLQLIAVAVKDRLAPP 602 Query: 331 RSHEFLRACASPNSFYLVENPHSMYD 356 + + ++ + + + Sbjct: 603 DEEQLAEIGGHARNVFVSPSGRELRE 628 >gi|116003875|ref|NP_001070294.1| anthrax toxin receptor 2 [Bos taurus] gi|115305014|gb|AAI23758.1| Anthrax toxin receptor 2 [Bos taurus] gi|296486409|gb|DAA28522.1| anthrax toxin receptor 2 [Bos taurus] Length = 488 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 37/226 (16%), Positives = 72/226 (31%), Gaps = 34/226 (15%) Query: 154 IVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + + + + D+ VLD S S+ + + I ++++ Sbjct: 22 LALSGPGGLLSAQEQPSCHGAFDLYFVLDKSGSVANNW-----------IEIYNFVQQLT 70 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYN 268 V+ ++ + FS++ L + + L T GLK A Sbjct: 71 ERF-VSPQMRLSFIVFSSQATIILPLTGDRGKISEGLDNLKHVSPVGETYIHEGLKLANE 129 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 QI D II+ +TDG + L ++ ++ GA VY +G Sbjct: 130 QI---------EKARDLKTSSIIIALTDGKLDGLVPSYAEKEAKI---SRSLGARVYCVG 177 Query: 327 IRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + + R S + V+ + A I I+ + Sbjct: 178 VLDFEQAQLERIADSKEQVFPVKGG---FQALRGIINSILAQSCTE 220 >gi|118349482|ref|XP_001008022.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89289789|gb|EAR87777.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 632 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 47/324 (14%), Positives = 109/324 (33%), Gaps = 36/324 (11%) Query: 49 VLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDID 108 S+ + ATQ+ + + K + + + + + LR+ D+ Sbjct: 79 QKQSLAQNPIKAKATQLKSAVSKQLTSFQKSDISIQKEEKIQKLDTKTMLREQEIKPDLQ 138 Query: 109 DIVRSTSL-------DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSS 161 +++ + + + + + ++ I+ Sbjct: 139 QMIKDAKKPSYDLEKGLTFEIKTLNKHFQFNNEQDCNIPIMVSVKTQDSTNDILEEQKEQ 198 Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 VK Q+ +D++ V+D S SM+ KI ++ +L+ + + Sbjct: 199 VKQAEQSRPSIDLVCVIDNSGSMQG------EKIQNVKTTLLQLLDMLNSND------RL 246 Query: 222 GLVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 L+ F++ L ++Q+ I ++ T+ G+ A+N + R Sbjct: 247 SLILFNSYPTLLCNLRKVDDKNTPNIQKIINSITAEEY-TDINSGMLMAFNIL----QKR 301 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE-AKKRGAIVYAIGIRVIRSHEFL 336 Q N + I ++DG++ E + N+ K +++ G + Sbjct: 302 QFFNPVSS-----IFLLSDGQDNGADEKIKKYINSNQSLKNECFSIHSFGFGSDHDGPLM 356 Query: 337 -RACA-SPNSFYLVENPHSMYDAF 358 R C +FY VE + + + F Sbjct: 357 NRICQLKDGNFYYVEKINQVDEFF 380 >gi|297190882|ref|ZP_06908280.1| von Willebrand factor [Streptomyces pristinaespiralis ATCC 25486] gi|197722677|gb|EDY66585.1| von Willebrand factor [Streptomyces pristinaespiralis ATCC 25486] Length = 518 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 42/244 (17%), Positives = 82/244 (33%), Gaps = 33/244 (13%) Query: 132 ISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSS 191 + R + ++ + + +V + + + + D ++ VLD S SM Sbjct: 289 VPRIDRLSEPVGNALYFPDDQKVVDRLLADY-GDPERDRAAQVIFVLDFSSSMRG----- 342 Query: 192 ITKIDMAIKSINAMLEEVKLIPDVNNVVQ------SGLVTFSNKIEEFFLLEW-GVSHLQ 244 +I ++I+ + V+ ++ F ++ E + + G L Sbjct: 343 -ERITALRETIDGLAGGDDS--PSGKFVRFYRGETLTVMRFGGRVLEERNITYDGPRDLD 399 Query: 245 R--KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 R + F ST L +AY + R + + IV MTDGEN + Sbjct: 400 RLRGVVASDDFAGSTAIWSSLDHAYRAV-----ARDLVDRPERRVS--IVLMTDGENNAG 452 Query: 303 KEDQQSL----YYCNEAKKRGAIVYAIGIRVIRSHEFLRAC-ASPNSFYLVENPHSMYDA 357 + + +A++ Y I + E R A+ + S+ A Sbjct: 453 MDVDAFVRAHARLPEDARR--VRTYTIRYGEADTRELDRGARATGGRMVDATD-RSLLSA 509 Query: 358 FSHI 361 F I Sbjct: 510 FKEI 513 >gi|188994392|ref|YP_001928644.1| putative aerotolerance-related exported protein BatB [Porphyromonas gingivalis ATCC 33277] gi|188594072|dbj|BAG33047.1| putative aerotolerance-related exported protein BatB [Porphyromonas gingivalis ATCC 33277] Length = 339 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 65/170 (38%), Gaps = 26/170 (15%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + + ++ MI LD+S SM D ++ A + + + + ++ N+ V GLV Sbjct: 85 PKKEKGIEAMICLDISNSMLCE-DVKPNRLSFAKQVLGKLFDGLQ-----NDKV--GLVV 136 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 F+ + +S ++ + +S T ++ A D + + Sbjct: 137 FAGNAYTQIPITTDLSAAKQFLADISPNMVTAQGTAIGAAIELASKSFSDNKEIG----- 191 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 K I+ +TDGEN + ++ +A + G V IG+ Sbjct: 192 ------KTIIVLTDGENH----EGNAIEAAQQAHEAGIRVNVIGLGTALG 231 >gi|218245149|ref|YP_002370520.1| von Willebrand factor type A [Cyanothece sp. PCC 8801] gi|218165627|gb|ACK64364.1| von Willebrand factor type A [Cyanothece sp. PCC 8801] Length = 418 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 73/204 (35%), Gaps = 28/204 (13%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + +S +++ ++LD S SM ++ ++ N +++ + + Sbjct: 31 ATGEDSDRTLPINLGLILDRSGSMR------AQAMETVKEAANYLVDGLGPDD------R 78 Query: 221 SGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++TF++ E + + ++ KI L+ G T G+K + + R Sbjct: 79 LSVITFNHHAEVILPNQSVEDLQGVKNKINRLTASG-GTCIDEGMKLGIKEAALGKENRV 137 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 I +TDGEN ++++ L A + + +G + + L Sbjct: 138 SQ----------IFLLTDGENEH-GDNERCLKLAKVAAEYNITLNTLGFGSNWNQDILEQ 186 Query: 339 CASP--NSFYLVENPHSMYDAFSH 360 A +E+P FS Sbjct: 187 IADSAGGMLCYIEHPEQALTEFSR 210 >gi|187251530|ref|YP_001876012.1| von Willebrand factor type A [Elusimicrobium minutum Pei191] gi|186971690|gb|ACC98675.1| Von Willebrand factor type [Elusimicrobium minutum Pei191] Length = 373 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 68/203 (33%), Gaps = 35/203 (17%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++ +D S SM + D +I A + + + G+V F++ Sbjct: 147 VDIILAIDTSGSM-AAQDFDPNRITAAKVAAANFIANRLSD-------RIGIVVFASDAM 198 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTN-STPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L L + + V T+ + G A + + + K+ Sbjct: 199 LQSPLTLDYESLLDFLADVRIGMVRTDGTAIGDAIAVSSVHLER---------SPARSKV 249 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLR---------A 338 I+ +TDGE+ S + + AK G VY I S + L A Sbjct: 250 IILLTDGESNS-----GVISPLDAAKTAALYGIKVYTIATISKNSRDSLDFKPDDLEQIA 304 Query: 339 CASPNSFYLVENPHSMYDAFSHI 361 + +Y N + ++ I Sbjct: 305 KLTGGKYYRAYNEAELTKIYAEI 327 >gi|301614659|ref|XP_002936803.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Xenopus (Silurana) tropicalis] Length = 720 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 35/212 (16%), Positives = 82/212 (38%), Gaps = 29/212 (13%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV---Q 220 D L+M +LD S S + + + K + M++++K + + + Sbjct: 48 SEQDEDCILEMAFLLDSSESAKDYNHNREKK------FVLQMVDKLKEVKPNSGRSFSWR 101 Query: 221 SGLVTFSNK--IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 L+ +S+ IE+ F G + + +I ++ G T ++ + QI+ +G + Sbjct: 102 MALLQYSSTVVIEQTFRDWKGPENFKSRIAPIAYIGHGTYTSYAI-TNLTQIYMNEGTHK 160 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI----RSHE 334 K+ + +TDG + D + + AK ++ +G+ + ++ Sbjct: 161 SV--------KVAILITDGVDHPRNPDIFAAT--SNAKHHDIKLFTVGMTSVAKETANNA 210 Query: 335 FLRACAS---PNSFYLVENPHSMYDAFSHIGK 363 LR AS + + +P + + + Sbjct: 211 KLRLLASVPATRFVFHMLDPDVVDKILKEVKE 242 >gi|156523144|ref|NP_001095986.1| inter-alpha-trypsin inhibitor heavy chain H5 precursor [Bos taurus] gi|187609595|sp|A2VE29|ITIH5_BOVIN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H5; Short=ITI heavy chain H5; Short=ITI-HC5; Short=Inter-alpha-inhibitor heavy chain 5; Flags: Precursor gi|126010782|gb|AAI33545.1| ITIH5 protein [Bos taurus] Length = 940 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 33/175 (18%), Positives = 68/175 (38%), Gaps = 27/175 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ VLD S SM TK+ ++ +L +++ N +V FSN+I+ Sbjct: 296 VVFVLDSSASM------VGTKLRQTKDALFTILHDLRPQDHFN------IVGFSNRIKVW 343 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + + I ++S G T+ L+ + D H + ED + Sbjct: 344 KDHLVSVTPNSIRDGKVYIHHMSPSG-GTDINGALQRGIQLLND---YVAHNDIEDRSVS 399 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACA 340 ++VF+TDG+ + + N ++ ++ +GI + L + Sbjct: 400 -LVVFLTDGKPT--VGETHTFKILNNTREATRGRVCIFTVGIGADVDFKLLEKLS 451 >gi|31615654|pdb|1NA5|A Chain A, Integrin Alpha M I Domain Length = 197 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 81/203 (39%), Gaps = 28/203 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S+ ++ + ++ L++ K + + + + F+ K + Sbjct: 7 DIAFLIDGSGSIIPH---DFRRMKEFVSTVMEQLKKSKTLFSLMQYSEEFRIHFTFKEFQ 63 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +K +++ T++ G++ ++F++ N N KI+V Sbjct: 64 NNP------NPRSLVKPITQLLGRTHTATGIRKVVRELFNIT------NGARKNAFKILV 111 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFLRACASP---NSF 345 +TDGE + + EA + G I Y IG R +S + L AS + Sbjct: 112 VITDGEKFGDPLGYEDVIP--EADREGVIRYVIGVGDAFRSEKSRQELNTIASKPPRDHV 169 Query: 346 YLVENPHSMYDAFSHIGKDIVTK 368 + V N ++A I + K Sbjct: 170 FQVNN----FEALKTIQNQLREK 188 >gi|157831557|pdb|1JLM|A Chain A, I-Domain From Integrin Cr3, Mn2+ Bound Length = 192 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 81/203 (39%), Gaps = 28/203 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S+ ++ + ++ L++ K + + + + F+ K + Sbjct: 8 DIAFLIDGSGSIIPH---DFRRMKEFVSTVMEQLKKSKTLFSLMQYSEEFRIHFTFKEFQ 64 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +K +++ T++ G++ ++F++ N N KI+V Sbjct: 65 NNP------NPRSLVKPITQLLGRTHTATGIRKVVRELFNIT------NGARKNAFKILV 112 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFLRACASP---NSF 345 +TDGE + + EA + G I Y IG R +S + L AS + Sbjct: 113 VITDGEKFGDPLGYEDVIP--EADREGVIRYVIGVGDAFRSEKSRQELNTIASKPPRDHV 170 Query: 346 YLVENPHSMYDAFSHIGKDIVTK 368 + V N ++A I + K Sbjct: 171 FQVNN----FEALKTIQNQLREK 189 >gi|239781743|pdb|2WIN|I Chain I, C3 Convertase (C3bbb) Stabilized By Scin gi|239781744|pdb|2WIN|J Chain J, C3 Convertase (C3bbb) Stabilized By Scin gi|239781745|pdb|2WIN|K Chain K, C3 Convertase (C3bbb) Stabilized By Scin gi|239781746|pdb|2WIN|L Chain L, C3 Convertase (C3bbb) Stabilized By Scin Length = 507 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 71/221 (32%), Gaps = 28/221 (12%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE- 231 IVLD S SM + S + ++ ++ + + GLVT++ + Sbjct: 2 IVLDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPKI 61 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + +W L K TN+ L+ Y+ + Sbjct: 62 WVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYS-MMSWPDDVPPEGWNR 120 Query: 285 ANYKKIIVFMTDG-ENLSTKEDQQSLYYCN---------EAKKRGAIVYAIGIRVIRSHE 334 + +I+ MTDG N+ + ++ VY G+ + + Sbjct: 121 TRH--VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQV 178 Query: 335 FLRACAS----PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + A AS + V++ ++ D F + + + + Sbjct: 179 NINALASKKDNEQHVFKVKDMENLEDVFYQMIDESQSLSLC 219 >gi|194384366|dbj|BAG64956.1| unnamed protein product [Homo sapiens] Length = 1266 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 71/221 (32%), Gaps = 28/221 (12%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE- 231 IVLD S SM + S + ++ ++ + + GLVT++ + Sbjct: 763 IVLDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPKI 822 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + +W L K TN+ L+ Y+ + Sbjct: 823 WVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYS-MMSWPDDVPPEGWNR 881 Query: 285 ANYKKIIVFMTDG-ENLSTKEDQQSLYYCN---------EAKKRGAIVYAIGIRVIRSHE 334 + +I+ MTDG N+ + ++ VY G+ + + Sbjct: 882 TRH--VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQV 939 Query: 335 FLRACAS----PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + A AS + V++ ++ D F + + + + Sbjct: 940 NINALASKKDNEQHVFKVKDMENLEDVFYQMIDESQSLSLC 980 Score = 44.0 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 21/153 (13%), Positives = 52/153 (33%), Gaps = 16/153 (10%) Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ + L SH+ K G TN+ L Y + + + Sbjct: 146 YSYDFPEDVAPALGTSFSHMLGATNPTQKTKDHENGTGTNTYAALNSVYLMMNNQMRLLG 205 Query: 279 HCNTEDANYKKIIVFMTDGENL------STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + I+ +TDG++ + + + + N+ + +YAIG+ + Sbjct: 206 METMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDV 265 Query: 333 -----HEFLRACASPNSFYLVENPHSMYDAFSH 360 +E +++++ +++ F H Sbjct: 266 DWRELNELGSKKDGERHAFILQDTKALHQVFEH 298 >gi|168983786|emb|CAQ06837.1| complement factor B [Homo sapiens] gi|168984885|emb|CAQ08426.1| complement factor B [Homo sapiens] Length = 589 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 71/221 (32%), Gaps = 28/221 (12%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE- 231 IVLD S SM + S + ++ ++ + + GLVT++ + Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPKI 320 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + +W L K TN+ L+ Y+ + Sbjct: 321 WVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYS-MMSWPDDVPPEGWNR 379 Query: 285 ANYKKIIVFMTDG-ENLSTKEDQQSLYYCN---------EAKKRGAIVYAIGIRVIRSHE 334 + +I+ MTDG N+ + ++ VY G+ + + Sbjct: 380 TRH--VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQV 437 Query: 335 FLRACAS----PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + A AS + V++ ++ D F + + + + Sbjct: 438 NINALASKKDNEQHVFKVKDMENLEDVFYQMIDESQSLSLC 478 >gi|134105218|pdb|2OK5|A Chain A, Human Complement Factor B Length = 752 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 71/221 (32%), Gaps = 28/221 (12%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE- 231 IVLD S SM + S + ++ ++ + + GLVT++ + Sbjct: 246 IVLDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPKI 305 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + +W L K TN+ L+ Y+ + Sbjct: 306 WVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYS-MMSWPDDVPPEGWNR 364 Query: 285 ANYKKIIVFMTDG-ENLSTKEDQQSLYYCN---------EAKKRGAIVYAIGIRVIRSHE 334 + +I+ MTDG N+ + ++ VY G+ + + Sbjct: 365 TRH--VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQV 422 Query: 335 FLRACAS----PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + A AS + V++ ++ D F + + + + Sbjct: 423 NINALASKKDNEQHVFKVKDMENLEDVFYQMIDESQSLSLC 463 >gi|57209925|emb|CAI41860.1| complement factor B [Homo sapiens] Length = 764 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 71/221 (32%), Gaps = 28/221 (12%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE- 231 IVLD S SM + S + ++ ++ + + GLVT++ + Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPKI 320 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + +W L K TN+ L+ Y+ + Sbjct: 321 WVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYS-MMSWPDDVPPEGWNR 379 Query: 285 ANYKKIIVFMTDG-ENLSTKEDQQSLYYCN---------EAKKRGAIVYAIGIRVIRSHE 334 + +I+ MTDG N+ + ++ VY G+ + + Sbjct: 380 TRH--VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQV 437 Query: 335 FLRACAS----PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + A AS + V++ ++ D F + + + + Sbjct: 438 NINALASKKDNEQHVFKVKDMENLEDVFYQMIDESQSLSLC 478 >gi|13278732|gb|AAH04143.1| Complement factor B [Homo sapiens] gi|14124934|gb|AAH07990.1| Complement factor B [Homo sapiens] gi|62898361|dbj|BAD97120.1| complement factor B preproprotein variant [Homo sapiens] gi|119623955|gb|EAX03550.1| complement factor B [Homo sapiens] gi|123982996|gb|ABM83239.1| complement factor B [synthetic construct] gi|123997681|gb|ABM86442.1| complement factor B [synthetic construct] gi|307685187|dbj|BAJ20524.1| complement factor B [synthetic construct] Length = 764 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 71/221 (32%), Gaps = 28/221 (12%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE- 231 IVLD S SM + S + ++ ++ + + GLVT++ + Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPKI 320 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + +W L K TN+ L+ Y+ + Sbjct: 321 WVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYS-MMSWPDDVPPEGWNR 379 Query: 285 ANYKKIIVFMTDG-ENLSTKEDQQSLYYCN---------EAKKRGAIVYAIGIRVIRSHE 334 + +I+ MTDG N+ + ++ VY G+ + + Sbjct: 380 TRH--VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQV 437 Query: 335 FLRACAS----PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + A AS + V++ ++ D F + + + + Sbjct: 438 NINALASKKDNEQHVFKVKDMENLEDVFYQMIDESQSLSLC 478 >gi|13560705|gb|AAK30167.1|AF349679_1 factor B [Homo sapiens] Length = 621 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 71/221 (32%), Gaps = 28/221 (12%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE- 231 IVLD S SM + S + ++ ++ + + GLVT++ + Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPKI 320 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + +W L K TN+ L+ Y+ + Sbjct: 321 WVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYS-MMSWPDDVPPEGWNR 379 Query: 285 ANYKKIIVFMTDG-ENLSTKEDQQSLYYCN---------EAKKRGAIVYAIGIRVIRSHE 334 + +I+ MTDG N+ + ++ VY G+ + + Sbjct: 380 TRH--VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQV 437 Query: 335 FLRACAS----PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + A AS + V++ ++ D F + + + + Sbjct: 438 NINALASKKDNEQHVFKVKDMENLEDVFYQMIDESQSLSLC 478 >gi|291922|gb|AAA16820.1| complement factor B [Homo sapiens] gi|2347133|gb|AAB67977.1| complement factor B [Homo sapiens] Length = 764 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 71/221 (32%), Gaps = 28/221 (12%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE- 231 IVLD S SM + S + ++ ++ + + GLVT++ + Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPKI 320 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + +W L K TN+ L+ Y+ + Sbjct: 321 WVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYS-MMSWPDDVPPEGWNR 379 Query: 285 ANYKKIIVFMTDG-ENLSTKEDQQSLYYCN---------EAKKRGAIVYAIGIRVIRSHE 334 + +I+ MTDG N+ + ++ VY G+ + + Sbjct: 380 TRH--VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQV 437 Query: 335 FLRACAS----PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + A AS + V++ ++ D F + + + + Sbjct: 438 NINALASKKDNEQHVFKVKDMENLEDVFYQMIDESQSLSLC 478 >gi|67782358|ref|NP_001701.2| complement factor B preproprotein [Homo sapiens] gi|584908|sp|P00751|CFAB_HUMAN RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; AltName: Full=Glycine-rich beta glycoprotein; Short=GBG; AltName: Full=PBF2; AltName: Full=Properdin factor B; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|4261689|gb|AAD13989.1|S67310_1 complement factor B [Homo sapiens] gi|297569|emb|CAA51389.1| complement factor B [Homo sapiens] gi|25070931|gb|AAN71991.1| B-factor, properdin [Homo sapiens] gi|55961819|emb|CAI17456.1| complement factor B [Homo sapiens] gi|123857994|emb|CAM25864.1| complement factor B [Homo sapiens] gi|168984418|emb|CAQ09274.1| complement factor B [Homo sapiens] gi|168985079|emb|CAQ07483.1| complement factor B [Homo sapiens] gi|168985957|emb|CAQ07113.1| complement factor B [Homo sapiens] Length = 764 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 71/221 (32%), Gaps = 28/221 (12%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE- 231 IVLD S SM + S + ++ ++ + + GLVT++ + Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPKI 320 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + +W L K TN+ L+ Y+ + Sbjct: 321 WVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYS-MMSWPDDVPPEGWNR 379 Query: 285 ANYKKIIVFMTDG-ENLSTKEDQQSLYYCN---------EAKKRGAIVYAIGIRVIRSHE 334 + +I+ MTDG N+ + ++ VY G+ + + Sbjct: 380 TRH--VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQV 437 Query: 335 FLRACAS----PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + A AS + V++ ++ D F + + + + Sbjct: 438 NINALASKKDNEQHVFKVKDMENLEDVFYQMIDESQSLSLC 478 >gi|320450000|ref|YP_004202096.1| hypothetical protein TSC_c09220 [Thermus scotoductus SA-01] gi|320150169|gb|ADW21547.1| conserved hypothetical protein [Thermus scotoductus SA-01] Length = 691 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 70/198 (35%), Gaps = 33/198 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++V+DVS SME + ++ LE V+ + + G+V FS+ Sbjct: 294 LVLVMDVSGSMEG---------EKLSLAVAGALELVRSAAEED---YLGVVLFSSTHRVL 341 Query: 234 FLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F +++ + L + G T + A + + R K Sbjct: 342 FPPRPMTEQGKKEAESLLLSVRAGGGTVLGSAFREAVRLLQGVPVER-----------KG 390 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--PNSFYLV 348 I+ ++DG ++ +L A+ G V A+ + FL+ A +Y Sbjct: 391 ILVLSDGLISDPQDPILAL-----AEASGLEVSAMALGADADRAFLKVLAERGGGRYYQA 445 Query: 349 ENPHSMYDAFSHIGKDIV 366 + F G+++ Sbjct: 446 ATAQELPRLFLKEGQEVF 463 >gi|296194626|ref|XP_002745014.1| PREDICTED: integrin alpha-2 [Callithrix jacchus] Length = 1208 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 36/246 (14%), Positives = 86/246 (34%), Gaps = 46/246 (18%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARL-----------DMMIVLDVSRSMESFFDSSITKI 195 +++ + S + + Q + D+++V D S S+ + Sbjct: 164 QQCGNQYYTTGVCSDISSDFQLSSSFSPAVQTCPSLIDVVVVCDESNSIYPW-------- 215 Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVSH-LQRKIKYLSKF 253 D + ++ + + P Q L+ ++N F L + + S++ Sbjct: 216 DAVKNFLEKFVQGLDIGPTK---TQVALIQYANNPRVVFNLNTYRTKEEMTLATSQTSQY 272 Query: 254 GVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 G TN+ ++YA + + + K++V +TDGE+ + + C Sbjct: 273 GGDLTNTFGAIQYARKYAYSA------ASGGRPSATKVMVVVTDGESHDGSMLKAVIEQC 326 Query: 313 NEAKKRGAIVYAIGI------RVIRSHEF---LRACAS---PNSFYLVENPHSMYDAFSH 360 N + + I + + + ++A AS F+ V + ++ + Sbjct: 327 NH---DNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEAALLEKAGT 383 Query: 361 IGKDIV 366 +G+ I Sbjct: 384 LGEQIF 389 >gi|113971308|ref|YP_735101.1| von Willebrand factor, type A [Shewanella sp. MR-4] gi|113885992|gb|ABI40044.1| von Willebrand factor, type A [Shewanella sp. MR-4] Length = 624 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 42/269 (15%), Positives = 99/269 (36%), Gaps = 24/269 (8%) Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITS--SVKVNSQ 167 + + +L + + Y S+ + P T + + +++ + + Sbjct: 171 LPQKDTLRVEEMLNYFSYDYPLPSKNEAPFSVTTELAPSPYNYDMMLLRIGLKGYEQSKA 230 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +++ +LDVS SM +S K+ + ++ + +++ V+ VV +G Sbjct: 231 ELGASNLVFLLDVSGSM-----ASPDKLPLLQTALKMLTQQLGAQDKVSIVVYAGAAG-- 283 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + L ++ LS G STN G++ AY ++H N Sbjct: 284 --VVLDGAAGNDSQTLNYALEQLSA-GGSTNGAQGIQLAYQL------AKKHLVEGGINR 334 Query: 288 KKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE-FLRACASPNSF 345 ++F TDG+ N+ T + + + K+ G + +G + ++ + A + Sbjct: 335 ---VIFATDGDFNVGTTNLDELIDLVSAQKQLGIGLTTLGFGMGDYNDHLMEQLADKGNG 391 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 S+ +A + + + + K Sbjct: 392 QYAY-IDSLNEARKVLVEQLSATLLTIAK 419 >gi|327542236|gb|EGF28725.1| BatB protein [Rhodopirellula baltica WH47] Length = 700 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 65/163 (39%), Gaps = 25/163 (15%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 ++ + VLDVSRSM + D S ++ A + I M++E+ + GLV F+ Sbjct: 77 QRGIEAVFVLDVSRSMLAE-DVSPNRLGRAKQQIKDMVDEMPGD-------RVGLVVFAG 128 Query: 229 KIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + + L V ++ + + S + ++ A + D Sbjct: 129 ETRQTLPLTRHVEDFKQTLDSVGIHSVRRGGSRLGDAIRVASDAFLDKTTDH-------- 180 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 K +V +TDGE+ + ++ ++G ++ IG+ Sbjct: 181 ---KAMVILTDGEDQESDPVSEAKRAHE---EQGIRIFTIGLG 217 >gi|119575268|gb|EAW54873.1| integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor), isoform CRA_b [Homo sapiens] Length = 1179 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 38/246 (15%), Positives = 88/246 (35%), Gaps = 46/246 (18%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARL-----------DMMIVLDVSRSMESFFDSSITKI 195 +++ + S + + Q A D+++V D S S+ + Sbjct: 137 QQCGNQYYTTGVCSDISPDFQLSASFSPATQPCPSLIDVVVVCDESNSIYPW-------- 188 Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI--KYLSKF 253 D + ++ + + P Q GL+ ++N F L + + + S++ Sbjct: 189 DAVKNFLEKFVQGLDIGPTK---TQVGLIQYANNPRVVFNLNTYKTKEEMIVATSQTSQY 245 Query: 254 GVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 G TN+ ++YA + + + K++V +TDGE+ + + C Sbjct: 246 GGDLTNTFGAIQYARKYAYSA------ASGGRRSATKVMVVVTDGESHDGSMLKAVIDQC 299 Query: 313 NEAKKRGAIVYAIGI------RVIRSHEF---LRACAS---PNSFYLVENPHSMYDAFSH 360 N + + I + + + ++A AS F+ V + ++ + Sbjct: 300 NH---DNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEAALLEKAGT 356 Query: 361 IGKDIV 366 +G+ I Sbjct: 357 LGEQIF 362 >gi|52545928|emb|CAH56139.1| hypothetical protein [Homo sapiens] Length = 1222 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 86/213 (40%), Gaps = 26/213 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + +V+S D++ +LD S ++ T + ++ + + N+ Sbjct: 422 SGTPEVHSNKR---DIIFLLDGSANVGK------TNFPYVRDFVMNLVNSLDI---GNDN 469 Query: 219 VQSGLVTFSN-KIEEFFLLEWGV-SHLQRKIKYLS-KFGVSTNSTPGLKYAY-NQIFDMQ 274 ++ GLV FS+ + EF L + S + ++ L + G N+ L Y Y N + Sbjct: 470 IRVGLVQFSDTPVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAG 529 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 G R + ++++ +T G+ + L N + G + + +G E Sbjct: 530 GSRI-----REHVPQLLLLLTAGQ-----SEDSYLQAANALTRAGILTFCVGASQANKAE 579 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + +P+ YL+++ S+ + + + T Sbjct: 580 LEQIAFNPSLVYLMDDFSSLPALPQQLIQPLTT 612 Score = 42.8 bits (99), Expect = 0.096, Method: Composition-based stats. Identities = 34/223 (15%), Positives = 76/223 (34%), Gaps = 31/223 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + T+ + D++ ++D S ++ + + + +LE++ P Sbjct: 17 LSGLPTTHAQQQQAAQDSADIIFLIDGSN------NTGSVNFAVILDFLVNLLEKL---P 67 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLS-KFGVSTNSTPGLKYAYNQI 270 ++ G+V FS++ F L+ + Q +K L G N L + Sbjct: 68 IGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENH 127 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV- 329 F R + + +++V ++ G + + A V++ G+ Sbjct: 128 FT----RAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVAL-------KQASVFSFGLGAQ 176 Query: 330 IRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 S L+ A+ ++ F G D+ K + Y Sbjct: 177 AASRAELQHIATDDNLVFT------VPEFRSFG-DLQEKLLPY 212 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 59/162 (36%), Gaps = 22/162 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 823 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 872 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L + ++ L+ G T N+ L++ I + + Sbjct: 873 EFYLNSYMNKQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRITEG----VPQ 928 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 +++ +T + + K+ GA+ IGI Sbjct: 929 LLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 965 >gi|323345325|ref|ZP_08085548.1| aerotolerance protein BatA [Prevotella oralis ATCC 33269] gi|323093439|gb|EFZ36017.1| aerotolerance protein BatA [Prevotella oralis ATCC 33269] Length = 332 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 40/222 (18%), Positives = 73/222 (32%), Gaps = 51/222 (22%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +DVS SM + D +I+ A + P+ N GL F+ + Sbjct: 88 IDIMLAMDVSTSMLAE-DLHPNRIEAAKAVAAEFIAG---RPNDN----IGLTIFAGEAF 139 Query: 232 EFFLLEWG-------VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + + +++ I T GL A +++ D + Sbjct: 140 TQCPMTTDHASLLNLLQNVRTDIAARGLIQDGTAVGMGLANAVSRLKDSKAKS------- 192 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR------- 337 K+++ +TDG N D + AK G VY IG+ + + Sbjct: 193 ----KVVILLTDGSNNM--GDLSPMTSAQIAKSLGIRVYTIGVGTNKVARYPMPVTGGIQ 246 Query: 338 ----------------ACASPNSFYLVENPHSMYDAFSHIGK 363 A + +FY N + ++ I K Sbjct: 247 YVNIPVEIDTKTLSDIAATTDGNFYRATNNRELKQIYNDIDK 288 >gi|194389238|dbj|BAG65607.1| unnamed protein product [Homo sapiens] Length = 1237 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 86/213 (40%), Gaps = 26/213 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + +V+S D++ +LD S ++ T + ++ + + N+ Sbjct: 422 SGTPEVHSNKR---DIIFLLDGSANVGK------TNFPYVRDFVMNLVNSLDI---GNDN 469 Query: 219 VQSGLVTFSN-KIEEFFLLEWGV-SHLQRKIKYLS-KFGVSTNSTPGLKYAY-NQIFDMQ 274 ++ GLV FS+ + EF L + S + ++ L + G N+ L Y Y N + Sbjct: 470 IRVGLVQFSDTPVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAG 529 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 G R + ++++ +T G+ + L N + G + + +G E Sbjct: 530 GSRI-----REHVPQLLLLLTAGQ-----SEDSYLQAANALTRAGILTFCVGASQANKAE 579 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + +P+ YL+++ S+ + + + T Sbjct: 580 LEQIAFNPSLVYLMDDFSSLPALPQQLIQPLTT 612 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 76/222 (34%), Gaps = 29/222 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + T+ + D++ ++D S ++ + + + +LE++ P Sbjct: 17 LSGFPTTHAQQQQAAQDSADIIFLIDGSN------NTGSVNFAVILDFLVNLLEKL---P 67 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLS-KFGVSTNSTPGLKYAYNQI 270 ++ G+V FS++ F L+ + Q +K L G N L + Sbjct: 68 IGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENH 127 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 F R + + +++V ++ G + K+ + +G++ Sbjct: 128 FT----RAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVA-----LKQASVFSFGLGVQAA 178 Query: 331 RSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 E L+ A+ ++ F G D+ K + Y Sbjct: 179 SRAE-LQHIATDDNLVFT------VPEFRSFG-DLQEKLLPY 212 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 59/162 (36%), Gaps = 22/162 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 823 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 872 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L + ++ L+ G T N+ L++ I + + Sbjct: 873 EFYLNSYMNKQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRITEG----VPQ 928 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 +++ +T + + K+ GA+ IGI Sbjct: 929 LLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 965 >gi|148666786|gb|EDK99202.1| anthrax toxin receptor 1 [Mus musculus] Length = 533 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 38/224 (16%), Positives = 77/224 (34%), Gaps = 32/224 (14%) Query: 156 MPITSSVKVNSQTDAR-----LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + + + + D+ +L +RS+ ++ I +E++ Sbjct: 20 VLVCAGHGGRREDGGPACYGGFDLYFIL--TRSVLHHWN-----------EIYYFVEQLA 66 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYN 268 ++ + FS + L +++ ++ L K G T G + A Sbjct: 67 HRFISPQ-LRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASE 125 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 QI+ + +I+ +TDGE L S N ++ GAIVY +G++ Sbjct: 126 QIYYENSQGYRTAS-------VIIALTDGE-LHEDLFFYSEREANRSRDLGAIVYCVGVK 177 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + R S + + V + + A I I+ K Sbjct: 178 DFNETQLARIADSKDHVFPVNDG---FQALQGIIHSILKKSCIE 218 >gi|148656885|ref|YP_001277090.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148568995|gb|ABQ91140.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 412 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 29/194 (14%), Positives = 69/194 (35%), Gaps = 28/194 (14%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 L+ +VLD S SM+ K+ ++ ++E + V +V F + + Sbjct: 41 PLNFCLVLDRSGSMQG------AKLAALKDAVKRVIETLTPQDIVA------IVLFDDTV 88 Query: 231 EEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + L ++ + + G + S + +R++ + Sbjct: 89 QTLVPATFATDKATLIAQVDAIEEAGGTAMSGG-------MAAGIVELRKNHDPGRVGA- 140 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR--ACASPNSFY 346 ++ +TDG+ ++ + E + G + A+G+ + L A A+ Sbjct: 141 --MLLLTDGQTWG--DEDRCRALAQELARDGVRITALGLGAEWNEALLDDIAEATGGISD 196 Query: 347 LVENPHSMYDAFSH 360 + +P + F H Sbjct: 197 YIADPAQITTFFQH 210 >gi|240255540|ref|NP_476506.3| collagen alpha-3(VI) chain isoform 3 precursor [Homo sapiens] Length = 1237 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 86/213 (40%), Gaps = 26/213 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + +V+S D++ +LD S ++ T + ++ + + N+ Sbjct: 422 SGTPEVHSNKR---DIIFLLDGSANVGK------TNFPYVRDFVMNLVNSLDI---GNDN 469 Query: 219 VQSGLVTFSN-KIEEFFLLEWGV-SHLQRKIKYLS-KFGVSTNSTPGLKYAY-NQIFDMQ 274 ++ GLV FS+ + EF L + S + ++ L + G N+ L Y Y N + Sbjct: 470 IRVGLVQFSDTPVTEFSLNTYQTKSDILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAG 529 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 G R + ++++ +T G+ + L N + G + + +G E Sbjct: 530 GSRI-----REHVPQLLLLLTAGQ-----SEDSYLQAANALTRAGILTFCVGASQANKAE 579 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + +P+ YL+++ S+ + + + T Sbjct: 580 LEQIAFNPSLVYLMDDFSSLPALPQQLIQPLTT 612 Score = 43.6 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 34/223 (15%), Positives = 76/223 (34%), Gaps = 31/223 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + T+ + D++ ++D S ++ + + + +LE++ P Sbjct: 17 LSGFPTTHAQQQQAAQDSADIIFLIDGSN------NTGSVNFAVILDFLVNLLEKL---P 67 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLS-KFGVSTNSTPGLKYAYNQI 270 ++ G+V FS++ F L+ + Q +K L G N L + Sbjct: 68 IGTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENH 127 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV- 329 F R + + +++V ++ G + + A V++ G+ Sbjct: 128 FT----RAGGSRVEEGVPQVLVLISAGPSSDEIRYGVVAL-------KQASVFSFGLGAQ 176 Query: 330 IRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 S L+ A+ ++ F G D+ K + Y Sbjct: 177 AASRAELQHIATDDNLVFT------VPEFRSFG-DLQEKLLPY 212 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 59/162 (36%), Gaps = 22/162 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 823 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 872 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L + ++ L+ G T N+ L++ I + + Sbjct: 873 EFYLNSYMNKQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRITEG----VPQ 928 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 +++ +T + + K+ GA+ IGI Sbjct: 929 LLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 965 >gi|167624593|ref|YP_001674887.1| von Willebrand factor type A [Shewanella halifaxensis HAW-EB4] gi|167354615|gb|ABZ77228.1| von Willebrand factor type A [Shewanella halifaxensis HAW-EB4] Length = 345 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 34/217 (15%), Positives = 72/217 (33%), Gaps = 50/217 (23%) Query: 173 DMMIVLDVSRSM---ESFFDSSI-TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+M+ +D+S SM + D + + + I+ +E K + GL+ F++ Sbjct: 85 DLMLSVDLSGSMQIEDMVLDGKVVDRFSLIQHVISDFIERRKGD-------RIGLILFAD 137 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + + +K G T + A + ++ Sbjct: 138 HAYLQSPLTQDRRTVAQYLKEAQIGLVGKQTAIGEAIALAVKRFDKVEQSN--------- 188 Query: 287 YKKIIVFMTDGENLS--TKEDQQSLYYCNEAKKRGAIVYAIGIRV--IRSHEF------- 335 ++++ +TDG N + +Q + A KRG +Y IG+ + Sbjct: 189 --RVLILLTDGSNNAGAISPEQAT----QIAAKRGITIYTIGVGADVMERRTLFGKERVN 242 Query: 336 ---------LRACA--SPNSFYLVENPHSMYDAFSHI 361 L+ A + ++ N + + I Sbjct: 243 PSMDLDESQLQEIAKTTGGQYFRARNTEELEQIYQVI 279 >gi|125559999|gb|EAZ05447.1| hypothetical protein OsI_27661 [Oryza sativa Indica Group] Length = 704 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 43/225 (19%), Positives = 87/225 (38%), Gaps = 39/225 (17%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 H P + + + A +D++ VLDVS SME + K+ + +++ +++++ Sbjct: 212 HAKAPSIAVAEATAAARAPVDLVTVLDVSGSMEGY------KLTLLKRAMGFVIDKLGPG 265 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + +V+FS + L G + + ++ L G TN GL A Sbjct: 266 D------RLAVVSFSYNAQRVIRLTRMSDDGKASAKSAVESL-AAGGGTNILKGLVEA-A 317 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST--------KEDQQSLYYCNEAKKRG- 319 ++FD + R + ++ ++DG++ S+ K+ G Sbjct: 318 KVFDGRRYRNAVAS--------VILLSDGQDTYNVNGGWGASNSKNYSVLVPPSFKRSGD 369 Query: 320 --AIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSH 360 V+ G + A A + +F +EN + DAF+ Sbjct: 370 RRLSVHTFGFGTDHDAAAMHAIAEETGGTFSFIENQAVVQDAFAQ 414 >gi|327543487|gb|EGF29906.1| von Willebrand factor type A domain-containing protein [Rhodopirellula baltica WH47] Length = 464 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 37/231 (16%), Positives = 87/231 (37%), Gaps = 23/231 (9%) Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI 192 + + L + H+ + +T +++ +++ +VLD S SM S Sbjct: 26 VKLDVRLVHPVMKAGEKQTNHLRIALTGFELKSTEERPPVNVCLVLDHSGSM------SG 79 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK 252 K+ A ++ A ++ + +++V L + + S +++KI+ + Sbjct: 80 QKLARAKEAAEAAIDRL----SDDDIVSVVLYDSNVTVLVPATKATDRSSIKQKIRGIQA 135 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYY 311 S+ L ++ +R+ E N ++ ++DG N+ K Q+ Sbjct: 136 G-----SSTALFAGVSK--GAAEVRKFLADEQVNR---VILLSDGLANVGPKSPQELEGL 185 Query: 312 CNEAKKRGAIVYAIGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSH 360 K V +G+ + + + A AS + +E+ S+ F+ Sbjct: 186 GRSLMKEAISVSTLGLGSGYNEDLMVALASVGGGNHAFIEDADSLVAVFNQ 236 >gi|62180885|ref|YP_217302.1| von Willebrand factor type A domain-containing protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|62128518|gb|AAX66221.1| putative von Willebrand factor, vWF type A domain [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|322715363|gb|EFZ06934.1| von Willebrand factor type A domain-containing protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. A50] Length = 593 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 28/188 (14%), Positives = 72/188 (38%), Gaps = 21/188 (11%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 PW + + + + + ++ +++ ++D S SM+ ++ + + Sbjct: 203 LAPSPWNAQRTLLKVDVQAR-DMQTRDLPPANLVFLIDTSGSMQP-----AERLPLIRSA 256 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 + ++ +++ ++ V +G + + ++ I L +G ST Sbjct: 257 LKLLVNDLRAQDNITIVTYAG----GTHVALASTAGNNTTAIKAAIDNLDAYG-STGGEA 311 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGA 320 GL+ AY Q + AN I+ TDG+ NL + + + +++G Sbjct: 312 GLRLAYEQ------AEKGFIKGGANR---ILLTTDGDFNLGITDPKDIEALVKKEREKGI 362 Query: 321 IVYAIGIR 328 + +G+ Sbjct: 363 TLSTLGVG 370 >gi|2645090|dbj|BAA23627.1| collagen-like protein [Hydra vulgaris] Length = 354 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 37/194 (19%), Positives = 65/194 (33%), Gaps = 20/194 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 M+ +LD S S+ D + K+ +KSI N + + G++ F + E Sbjct: 1 MLFLLDSSGSVG---DENFDKMKEFVKSIVL------NFDVDNQLTRIGIIRFDSDAEII 51 Query: 234 FLLEWGVS--HLQRKIKYLSKF-GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + L I + G+ T L+ A + G A+ K Sbjct: 52 IQLSDHKTLKDLLNDIDSIRYNEGIQTRIDKALERAMEAFSEKNG-------GRADATKA 104 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 +V + DG+N + Q Y IGI + L A+ N ++ Sbjct: 105 LVLLADGQNSFIEGSQDLNEELKPLIDAKVFRYVIGIGRELDLKELEDIATNNIAIYADS 164 Query: 351 PHSM-YDAFSHIGK 363 + + I + Sbjct: 165 FDELKFKVDEQISR 178 Score = 40.5 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 20/106 (18%), Positives = 45/106 (42%), Gaps = 11/106 (10%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 ++ D D+ LD S S F++ + IKS+ + I ++ V+ G Sbjct: 258 EIGEINDIPTDIAFALDASAS---FYEEGFQQEKDFIKSV------IDKIELSSSGVRVG 308 Query: 223 LVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYA 266 ++T+S++ + ++ +++ I + + T GL+ A Sbjct: 309 VLTYSDEAKIRIRFDYSFDKEDVKKAIDNIPYDSMGTRIDLGLEAA 354 >gi|303324778|pdb|2XGG|A Chain A, Structure Of Toxoplasma Gondii Micronemal Protein 2 A_i Domain gi|303324779|pdb|2XGG|B Chain B, Structure Of Toxoplasma Gondii Micronemal Protein 2 A_i Domain Length = 178 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 31/170 (18%), Positives = 68/170 (40%), Gaps = 22/170 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + ++ +LD+ ++D S S+ I + + ++ L + + P+ V + Sbjct: 9 RGSAMGTNQLDICFLIDSSGSIG------IQNFRLVKQFLHTFLMVLPIGPEE---VNNA 59 Query: 223 LVTFSNKI----EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 +VT+S + + + + STN++ GLK +F Sbjct: 60 VVTYSTDVHLQWDLQSPNAVDKQLAAHAVLDMPYKKGSTNTSDGLKACKQILFTGS---- 115 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + K+++ MTDGE+ S D +++ E ++ G IV + + Sbjct: 116 --RPGREHVPKLVIGMTDGESDS---DFRTVRAAKEIRELGGIVTVLAVG 160 >gi|2623767|gb|AAB86531.1| Lu-ECAM-1 [Bos taurus] Length = 820 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 39/191 (20%), Positives = 68/191 (35%), Gaps = 32/191 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM + D A + ++E+ L+ G+VTF + E Sbjct: 310 VCLVLDKSGSMSAE-DRLFQMNQAAELYLIQVIEKGSLV---------GMVTFDSVAEIQ 359 Query: 234 FLLEWGV--SHLQRKIKYLSK-FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + Q+ L + T+ GLK + I Sbjct: 360 NHLTRITDDNVYQKITAKLPQVANGGTSICRGLKAGFQAIIHSDQSTSGSE--------- 410 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 I+ +TDGE+ C + K+ GAI++ I + + E L F+ Sbjct: 411 IILLTDGEDNEINS-------CFEDVKRSGAIIHTIALGPSAAKELETLSNMTGGYRFFA 463 Query: 348 VENPHSMYDAF 358 ++ + +AF Sbjct: 464 NKDITGLTNAF 474 >gi|148261962|ref|YP_001236089.1| hypothetical protein Acry_2980 [Acidiphilium cryptum JF-5] gi|326405471|ref|YP_004285553.1| hypothetical protein ACMV_33240 [Acidiphilium multivorum AIU301] gi|146403643|gb|ABQ32170.1| hypothetical protein Acry_2980 [Acidiphilium cryptum JF-5] gi|325052333|dbj|BAJ82671.1| hypothetical protein ACMV_33240 [Acidiphilium multivorum AIU301] Length = 431 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 63/442 (14%), Positives = 130/442 (29%), Gaps = 106/442 (23%) Query: 1 MFSLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVH 60 M L + + + +G + I+TA+ + +LGM I+ K+ + S D + + Sbjct: 1 MLVLFIKFKALCGDRRGNIAIITALVSLTLIFILGMGIDYGLAIDRKSQMESYADAAALA 60 Query: 61 AAT-QIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIV 119 A T ++ G + + + T + ++ I Sbjct: 61 AVTPAMVAAGQSSAITTAQNVFNAQALTMT----------------GVTYNANDVTVSIA 104 Query: 120 VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLD 179 +A +Y+ + S I T++ + +D ++LD Sbjct: 105 TSGDKR----TATVQYQAQSQAMLPDVMGFGSIKIGGQATATTTIAPN----IDFYLLLD 156 Query: 180 VSRSME--------SFFDSSIT-------------------------------------- 193 S SM + ++ T Sbjct: 157 DSPSMAIAATQSGINTMVANTTAQGGCAFGCHEENPSADKLGNPYGEDNYALARSLGVTL 216 Query: 194 KIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEEFFLLEWGVSHLQRKI----- 247 +IDM ++ ++ + + + TF + L +S Q + Sbjct: 217 RIDMLRQATQDLMTTAQTTETQKGTTYRMAIYTFDIGLNTIGNLTSDLSQAQTEAGNIQL 276 Query: 248 ------KYLSKFGVSTNSTPGLKYAYNQIFD-MQGMRQHCNTEDANYKKIIVFMTDG-EN 299 +L++ + + A N I M ++++ F+TDG E+ Sbjct: 277 LEVYSNNWLTQNDYNDDEDTNYDTALNGINAIMPNPGNGTGAAGDTPQEVLFFVTDGVED 336 Query: 300 LSTKEDQQ----SLYYCNEAKKRGAIV---YA--------------IGIRVIRSHEFLRA 338 ++Q + C K RG + Y I L+ Sbjct: 337 EDVNGNRQQSLLNTDLCTAIKNRGIRIAVLYTEYLPLPTNSWYNTYIAPFQNSIAPTLQQ 396 Query: 339 CASPNSFYLVENPHSMYDAFSH 360 CASP ++ V++ + A S Sbjct: 397 CASPGLYFEVKSGGDISAAMSA 418 >gi|325286052|ref|YP_004261842.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324321506|gb|ADY28971.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 348 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 82/202 (40%), Gaps = 19/202 (9%) Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 S +K LKFC FI T + K+ + +D++ +DVS+SM + D Sbjct: 49 TKSTFKSILKFCVFILGLTLLIVGLANPKIGTKLETVKREGVDIVFAVDVSKSMLAE-DI 107 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL 250 + +++ A + ++ ++ ++ + G++ ++ + + + ++ Sbjct: 108 APSRLAKAKRIVSEIIAQLGSD-------RIGIIAYAGQAYPQLPITTDYGAAKMFLQ-- 158 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 G++TN A NQ D+ +D +++ ++DGE + + Sbjct: 159 ---GLNTNMLSSQGTAINQALDLASTYYD---DDEQTNRVLFIISDGE---DHSEGSTEG 209 Query: 311 YCNEAKKRGAIVYAIGIRVIRS 332 +A +G ++ IG+ + Sbjct: 210 AVEKAVDQGIKIFTIGVGTEKG 231 >gi|32475885|ref|NP_868879.1| hypothetical protein RB9427 [Rhodopirellula baltica SH 1] gi|32446428|emb|CAD76256.1| hypothetical protein containing vWFA domain [Rhodopirellula baltica SH 1] Length = 484 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 37/231 (16%), Positives = 87/231 (37%), Gaps = 23/231 (9%) Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI 192 + + L + H+ + +T +++ +++ +VLD S SM S Sbjct: 46 VKLDVRLVHPVMKAGEKQTNHLRIALTGFELKSAEERPPVNVCLVLDHSGSM------SG 99 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK 252 K+ A ++ A ++ + +++V L + + S +++KI+ + Sbjct: 100 QKLARAKEAAEAAIDRL----SDDDIVSVVLYDSNVTVLVPATKATDRSSIKQKIRGIQA 155 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYY 311 S+ L ++ +R+ E N ++ ++DG N+ K Q+ Sbjct: 156 G-----SSTALFAGVSK--GAAEVRKFLADEQVNR---VILLSDGLANVGPKSPQELEGL 205 Query: 312 CNEAKKRGAIVYAIGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSH 360 K V +G+ + + + A AS + +E+ S+ F+ Sbjct: 206 GRSLMKEAISVSTLGLGSGYNEDLMVALASVGGGNHAFIEDADSLVSVFNQ 256 >gi|311106403|ref|YP_003979256.1| von Willebrand factor type A domain-containing protein 2 [Achromobacter xylosoxidans A8] gi|310761092|gb|ADP16541.1| von Willebrand factor type A domain protein 2 [Achromobacter xylosoxidans A8] Length = 340 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 74/204 (36%), Gaps = 30/204 (14%) Query: 173 DMMIVLDVSRSM--ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D+++ +D+S+SM E F + D + + + D + GL+ F Sbjct: 93 DLLLAIDISQSMETEDFVAPDGRREDRLSGVKAVVADFIDRRQDD----RLGLIVFGTAA 148 Query: 231 EEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L + L+ + +S G +T + A Q + Sbjct: 149 YPQAPLTQDHATLKLLLGQVSTRMAGPNTAIGDAIGVAIKQFEHAGE-----------HD 197 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-------EFLRACA- 340 ++++ +TDG T + A R +V+ +GI ++ + LRA A Sbjct: 198 QVLILLTDG--NDTGSAVPPDRAASMAAARHIVVHTVGIGDPQAEGEEKVDFDALRAIAA 255 Query: 341 -SPNSFYLVENPHSMYDAFSHIGK 363 + F+ ++ S+ ++ + + Sbjct: 256 KTGGRFFPAQDQASLRQVYAELDR 279 >gi|119614001|gb|EAW93595.1| hCG2042895 [Homo sapiens] Length = 713 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/168 (17%), Positives = 70/168 (41%), Gaps = 16/168 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ ++D S ES + K + S++ + ++ + ++ + FS+ ++ Sbjct: 47 IDIVFIVDSS---ESSKIALFDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQ 103 Query: 232 EFFLLE-W-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 W + ++K+K ++ G T S + A ++ +G + K Sbjct: 104 IDPPFSSWKDLQTFKQKVKSMNLIGQGTFSYYAISNA-TRLLKREGRKDGV--------K 154 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +++ MTDG + D QS+ +A+ G I + + + LR Sbjct: 155 VVLLMTDGIDHPKNPDVQSIS--EDARISGISFITIALSTVVNEAKLR 200 >gi|116295258|ref|NP_002194.2| integrin alpha-2 precursor [Homo sapiens] gi|21105795|gb|AAM34795.1|AF512556_1 integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor) [Homo sapiens] gi|119575267|gb|EAW54872.1| integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor), isoform CRA_a [Homo sapiens] gi|151556518|gb|AAI48597.1| Integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor) [synthetic construct] gi|162319056|gb|AAI56716.1| Integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor) [synthetic construct] gi|168278403|dbj|BAG11081.1| integrin alpha-2 precursor [synthetic construct] Length = 1181 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 38/246 (15%), Positives = 88/246 (35%), Gaps = 46/246 (18%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARL-----------DMMIVLDVSRSMESFFDSSITKI 195 +++ + S + + Q A D+++V D S S+ + Sbjct: 137 QQCGNQYYTTGVCSDISPDFQLSASFSPATQPCPSLIDVVVVCDESNSIYPW-------- 188 Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI--KYLSKF 253 D + ++ + + P Q GL+ ++N F L + + + S++ Sbjct: 189 DAVKNFLEKFVQGLDIGPTK---TQVGLIQYANNPRVVFNLNTYKTKEEMIVATSQTSQY 245 Query: 254 GVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 G TN+ ++YA + + + K++V +TDGE+ + + C Sbjct: 246 GGDLTNTFGAIQYARKYAYSA------ASGGRRSATKVMVVVTDGESHDGSMLKAVIDQC 299 Query: 313 NEAKKRGAIVYAIGI------RVIRSHEF---LRACAS---PNSFYLVENPHSMYDAFSH 360 N + + I + + + ++A AS F+ V + ++ + Sbjct: 300 NH---DNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEAALLEKAGT 356 Query: 361 IGKDIV 366 +G+ I Sbjct: 357 LGEQIF 362 >gi|51095061|gb|EAL24305.1| similar to Matn2-prov protein [Homo sapiens] Length = 651 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/168 (17%), Positives = 70/168 (41%), Gaps = 16/168 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ ++D S ES + K + S++ + ++ + ++ + FS+ ++ Sbjct: 47 IDIVFIVDSS---ESSKIALFDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQ 103 Query: 232 EFFLLE-W-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 W + ++K+K ++ G T S + A ++ +G + K Sbjct: 104 IDPPFSSWKDLQTFKQKVKSMNLIGQGTFSYYAISNA-TRLLKREGRKDGV--------K 154 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +++ MTDG + D QS+ +A+ G I + + + LR Sbjct: 155 VVLLMTDGIDHPKNPDVQSIS--EDARISGISFITIALSTVVNEAKLR 200 >gi|51095062|gb|EAL24306.1| similar to Matn2-prov protein [Homo sapiens] Length = 668 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/168 (17%), Positives = 70/168 (41%), Gaps = 16/168 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ ++D S ES + K + S++ + ++ + ++ + FS+ ++ Sbjct: 47 IDIVFIVDSS---ESSKIALFDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQ 103 Query: 232 EFFLLE-W-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 W + ++K+K ++ G T S + A ++ +G + K Sbjct: 104 IDPPFSSWKDLQTFKQKVKSMNLIGQGTFSYYAISNA-TRLLKREGRKDGV--------K 154 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +++ MTDG + D QS+ +A+ G I + + + LR Sbjct: 155 VVLLMTDGIDHPKNPDVQSIS--EDARISGISFITIALSTVVNEAKLR 200 >gi|254421496|ref|ZP_05035214.1| von Willebrand factor type A domain protein [Synechococcus sp. PCC 7335] gi|196188985|gb|EDX83949.1| von Willebrand factor type A domain protein [Synechococcus sp. PCC 7335] Length = 410 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 39/231 (16%), Positives = 83/231 (35%), Gaps = 31/231 (13%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 I ++ R + + + + A L++ VLD S SM T + ++ + Sbjct: 16 IDQASSQRQLSVSVAAK---GGGVSAPLNVCFVLDRSGSM------MGTPLQTVKQAASR 66 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEE--FFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 +++ + +N + ++ F +K E L ++R+I L G T G Sbjct: 67 IVDRL------SNRDRISIIAFDHKAEVLISNELASDPQAIKRRINSLRA-GGGTCIDDG 119 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 LK Q+ + ++ +TDGEN ++ +++ + A V Sbjct: 120 LKAGIEQLASGKEGYISQ----------LLLLTDGENEH-GDNSRAIKLADVAIGYNLTV 168 Query: 323 YAIGIRVIRSHEFLR--ACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 +G + + L A A S +E+ F + + + + Sbjct: 169 NTLGFGDHWNQDVLEQIADAGGGSLSYIEHAEEAIATFGRLFTRMQSVSLT 219 >gi|296196207|ref|XP_002745750.1| PREDICTED: anthrax toxin receptor 2-like [Callithrix jacchus] Length = 663 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 77/232 (33%), Gaps = 34/232 (14%) Query: 148 YTNSRHIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + + + + + D+ VLD S S+ + + I Sbjct: 191 FPGLWLLTLSGPGVLLRAQEQPSCRRAFDLYFVLDKSGSVANNW-----------IEIYN 239 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPG 262 ++++ V+ ++ + FS++ L + + ++ L T G Sbjct: 240 FVQQLAERF-VSPEMRLSFIVFSSQASIILPLTGDRDKISKGLEDLKHVNPVGETYIHEG 298 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGA 320 LK A QI G++ II+ +TDG + L ++ ++ GA Sbjct: 299 LKLANEQIQKAGGLKTSS---------IIIALTDGKLDGLVPSYAEKEAKI---SRSLGA 346 Query: 321 IVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 VY +G+ + R S + V+ + A I I+ + Sbjct: 347 SVYCVGVLDFEQAQLERIADSKEQVFPVKGG---FQALKGIINSILAQSCTE 395 >gi|3929911|dbj|BAA34707.1| complement factor B/C2B [Cyprinus carpio] Length = 833 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 32/228 (14%), Positives = 84/228 (36%), Gaps = 40/228 (17%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + RL++ I+LD S S+ +A + ++ V+ + ++ G+++++ Sbjct: 336 AEGRLNVFILLDTSASISP------ESFHLAKNA---TIQLVQKLDSYEVTMRFGIISYA 386 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSK-----------FGVSTNSTPGLKYAYNQIFDMQGM 276 ++ +E + + L + + Y+ + TN L Y ++ ++ Sbjct: 387 SEAKEIVSI---TNDLSQDVHYVMRKLHEFSDKSHGNKRGTNLHDALNKVYEELALLREN 443 Query: 277 RQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYY---------CNEAKKRGAIVYAIG 326 ++ E N +I+ TDG N+ + K+ VY Sbjct: 444 KRSHFNETQN---VIIIATDGYSNMGPSPINILPKIRNLFGYKSSVDHTKEELLDVYVFA 500 Query: 327 IRVIRSHEFLRACAS----PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + + + L++ AS +++++ + F+ + D + Sbjct: 501 VGQQVNKQELQSIASIKKDERHVFVLKDYRQLGLVFNQMISDSAVTKC 548 >gi|295398785|ref|ZP_06808791.1| von Willebrand factor type A domain protein [Aerococcus viridans ATCC 11563] gi|294972971|gb|EFG48792.1| von Willebrand factor type A domain protein [Aerococcus viridans ATCC 11563] Length = 516 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 40/238 (16%), Positives = 79/238 (33%), Gaps = 48/238 (20%) Query: 168 TDARLDMMIVLDVSRSMES-FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 A +D+++VLD S SM +S T++D + E V L P+ + +V+F Sbjct: 76 QQAPVDVVLVLDRSGSMNFVETPNSPTRLDYGKLAAINFAERV-LGPNGIPGSRVSVVSF 134 Query: 227 SNKIE-------------------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 S L + + I ++ FG TN+ G + Sbjct: 135 SGPAYATGVRNNPQRHYGQQNQATTDLDLSSDLRAVTDSINRITAFG-GTNTEAGFEQGR 193 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED-------------QQSLYYCNE 314 + I +G + N K+++ +TDG ++ + Q+++ Sbjct: 194 SVI---EGTTSNQNPNS---NKVVIMLTDGLPTASNGNPYAETTDINHVHIQRAINAGKN 247 Query: 315 AKKRGAI-VYAIGI---RVIRSHEFLRAC---ASPNSFYLVENPHSMYDAFSHIGKDI 365 + V+ IG+ A +Y + + F I + + Sbjct: 248 IYQNDIADVFTIGLTTGMNATEKALADNILTQAQNKGYYPAPSATDLDAIFEEISQRL 305 >gi|293334601|ref|NP_001168718.1| hypothetical protein LOC100382510 [Zea mays] gi|223950381|gb|ACN29274.1| unknown [Zea mays] Length = 629 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 45/260 (17%), Positives = 90/260 (34%), Gaps = 49/260 (18%) Query: 119 VVVPQNEG-YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 VP+N + + + K P + S T A +D++ V Sbjct: 138 TQVPENTSERNFTVLVHLKAP--PAQHLLQ-------SSSDLGDGNGVSTTRAPVDLITV 188 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 LDVS SM TK+ + +++ +++ + + ++ FS+ F L Sbjct: 189 LDVSGSMAG------TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSSARRLFPLR 236 Query: 238 WGVSHLQRK----IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 +++ + L+ G TN GL+ +G + + N I+ Sbjct: 237 RMTESGRQQSLLAVNSLTANG-GTNIAEGLR---------KGSKVIEERQSKNPVCSIIL 286 Query: 294 MTDGENLSTKEDQQ-----SLYYC------NEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 ++DG++ T + YC + V+ G L + + Sbjct: 287 LSDGQDTYTVSPTAGVHKGATEYCALLPSTTTNGSQQVPVHVFGFGADHDSVSLHSISQT 346 Query: 341 SPNSFYLVENPHSMYDAFSH 360 S +F +E ++ DAF+ Sbjct: 347 SGGTFSFIETEATIQDAFAQ 366 >gi|193788254|dbj|BAG53148.1| unnamed protein product [Homo sapiens] Length = 437 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 39/214 (18%), Positives = 87/214 (40%), Gaps = 39/214 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++IVLD S + + + + ++ +K +Q G+V + Sbjct: 160 QTYMDIVIVLDGSNGIYPWVEVQ-----------HFLINILKKFYIGPGQIQVGVVQYGE 208 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + F L S ++ ++ S + G T + G+++A ++ F G + Sbjct: 209 DVVHEFHLNDYRS-VKDVVEAASHIEQRGGTETRTAFGIEFARSEAFQKGGRKG------ 261 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL-- 336 KK+++ +TDGE+ + + ++ + ++++ YA+ + R I FL Sbjct: 262 --AKKVMIVITDGESHDSPDLEKVIQ---QSERDNVTRYAVAVLGYYNRRGINPETFLNE 316 Query: 337 -RACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + AS F+ V + ++ D +G I Sbjct: 317 IKYIASDPDDKHFFNVTDEAALKDIVDALGDRIF 350 >gi|91783676|ref|YP_558882.1| hypothetical protein Bxe_A2138 [Burkholderia xenovorans LB400] gi|91687630|gb|ABE30830.1| Conserved hypothetical protein containing von Willebrand factor type A domain [Burkholderia xenovorans LB400] Length = 337 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 58/168 (34%), Gaps = 23/168 (13%) Query: 168 TDARLDMMIVLDVSRSM--ESFFDSSITKID---MAIKSINAMLEEVKLIPDVNNVVQSG 222 + D+M+ +D+S SM F D + ++D + + + + K + G Sbjct: 82 DEPARDLMLAIDLSGSMATRDFVDPAGERMDRLSAVKRVVANFVAKRKGD-------RIG 134 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 LV F + L ++ + + + G D G+ Sbjct: 135 LVVFGDAAYPQAPLTLDHDSVRILLDQMQIGMAGPRTAIG---------DAIGLTVKLMA 185 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + +K+++ +TDG T AK+ +V+ +GI Sbjct: 186 DSHAQEKVLILLTDG--NDTSSAIPPERAAEIAKQHKLVVHTVGIGDP 231 >gi|88800880|ref|ZP_01116434.1| hypothetical protein MED297_00315 [Reinekea sp. MED297] gi|88776393|gb|EAR07614.1| hypothetical protein MED297_00315 [Reinekea sp. MED297] Length = 555 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 39/265 (14%), Positives = 93/265 (35%), Gaps = 31/265 (11%) Query: 83 LCRIKNTWNMSFRNELRDNGFVNDIDDI-----VRSTSLDIVVVPQNEGYSISAISRYKI 137 + ++ +F ++ + N + + S+ + Y++ A Sbjct: 100 IRQVATDPVSTFSTDVDTASYTNARRFLNQGMRPPADSIRVEEFINYFDYALPAPDTTNT 159 Query: 138 PLKFCTFIPWYTNSRHIVMPITS--SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKI 195 P++ T + + S S + + +T L+++ +LDVS SM S K+ Sbjct: 160 PIQISTERTQTPWNPQTELVRVSLQSYRSDFKTLPPLNLVFLLDVSGSMNS-----PDKL 214 Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKF 253 + +S N ++ +++ + + ++ + G + + + I L Sbjct: 215 PLMQRSFNLLVSQLRPQD------RVAIAVYAGQSGVVLEPTSGDQKAQINQAINQLRA- 267 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYC 312 G T+ + G+ AY + + N I TDG+ N+ T + Sbjct: 268 GGGTHGSAGIHLAY-DLAQANYLPDGINR--------IFIGTDGDFNVGTTSLTELKALI 318 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLR 337 ++ G + +G ++ L Sbjct: 319 ERKREAGVFLSVLGFGTGNYNDALM 343 >gi|302551540|ref|ZP_07303882.1| lipoprotein [Streptomyces viridochromogenes DSM 40736] gi|302469158|gb|EFL32251.1| lipoprotein [Streptomyces viridochromogenes DSM 40736] Length = 518 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 75/200 (37%), Gaps = 28/200 (14%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + + V+DVS SM S ++D+A +S++ M + ++ V +VT Sbjct: 162 TGERLPAALTFVIDVSGSM-----SEPGRLDLAQESLSVMTDRLRDDDSVA------IVT 210 Query: 226 FSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS++ E + +Q I L STN G++ Y ++G+R+ Sbjct: 211 FSDEAETVLPMTRLDGNRDEIQDVISDL-ATQDSTNLGAGVETGYE--TAVEGLREGATN 267 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSL--YYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +V ++D + D ++ E ++ G ++ +G+ + A Sbjct: 268 R-------VVLVSDALANTGDTDADTILERIAGERREHGITLFGVGVGSDYGDALMERLA 320 Query: 341 S--PNSFYLVENPHSMYDAF 358 V P ++ F Sbjct: 321 DRGDGHTVYVSGPDEAHEVF 340 >gi|124942|sp|P17301|ITA2_HUMAN RecName: Full=Integrin alpha-2; AltName: Full=CD49 antigen-like family member B; AltName: Full=Collagen receptor; AltName: Full=Platelet membrane glycoprotein Ia; Short=GPIa; AltName: Full=VLA-2 subunit alpha; AltName: CD_antigen=CD49b; Flags: Precursor gi|33907|emb|CAA34894.1| unnamed protein product [Homo sapiens] Length = 1181 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 38/246 (15%), Positives = 88/246 (35%), Gaps = 46/246 (18%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARL-----------DMMIVLDVSRSMESFFDSSITKI 195 +++ + S + + Q A D+++V D S S+ + Sbjct: 137 QQCGNQYYTTGVCSDISPDFQLSASFSPATQPCPSLIDVVVVCDESNSIYPW-------- 188 Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI--KYLSKF 253 D + ++ + + P Q GL+ ++N F L + + + S++ Sbjct: 189 DAVKNFLEKFVQGLDIGPTK---TQVGLIQYANNPRVVFNLNTYKTKEEMIVATSQTSQY 245 Query: 254 GVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 G TN+ ++YA + + + K++V +TDGE+ + + C Sbjct: 246 GGDLTNTFGAIQYARKYAYSA------ASGGRRSATKVMVVVTDGESHDGSMLKAVIDQC 299 Query: 313 NEAKKRGAIVYAIGI------RVIRSHEF---LRACAS---PNSFYLVENPHSMYDAFSH 360 N + + I + + + ++A AS F+ V + ++ + Sbjct: 300 NH---DNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEAALLEKAGT 356 Query: 361 IGKDIV 366 +G+ I Sbjct: 357 LGEQIF 362 >gi|120599917|ref|YP_964491.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] gi|120560010|gb|ABM25937.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] Length = 638 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 59/370 (15%), Positives = 120/370 (32%), Gaps = 51/370 (13%) Query: 36 MIIEVSHIFFMKTVLHSM---IDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNM 92 + I S + S ID SL + N + D IK Sbjct: 85 LAIYKSEELHQLKQVQSSERLIDASLAAKQRSAKHVINTHYVAAPIASDAWYGIKQPERN 144 Query: 93 SF----------RNELRDNGFVNDIDDIVRSTSLDIVVV---PQNEGYSISAIS---RYK 136 F E+ + F D+D ST ++ P+ I + Y Sbjct: 145 RFEKQIQNGIMVAGEIPISTFSIDVDTGSYSTLRRMIKEGSLPEKGTIRIEEMLNYFTYD 204 Query: 137 IPLKFCTFIPWYTNSRHIVMP----------ITSSVKVNSQTDARLDMMIVLDVSRSMES 186 PL P+ + P ++ +++ +LDVS SM Sbjct: 205 YPLPNKNAAPFSVTTELAPSPYNDDMMLLRIGLKGYELTKSELGASNLVFLLDVSGSM-- 262 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK 246 +S K+ + ++ + +++ V+ VV +G + + L Sbjct: 263 ---ASADKLPLLQTALKMLTQQLSAQDKVSIVVYAGAAG----VVLDGASGDDIQALTYA 315 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKED 305 ++ L G STN + G+ AY ++H N ++ TDG+ N+ Sbjct: 316 LEQLRA-GGSTNGSQGILQAYQL------AQKHFIQGGINR---VILATDGDFNVGVTNF 365 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIR-SHEFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 + + K+RG + +G + + + + A + + ++ +A + + Sbjct: 366 DLLISLIEKEKQRGIGLTTLGFGMDNYNDQLMEQLADKGNGHYAY-IDTLNEARKVLVDE 424 Query: 365 IVTKRIWYDK 374 + + + K Sbjct: 425 LSSTLLTIAK 434 >gi|225850253|ref|YP_002730487.1| putative von Willebrand factor type A domain protein [Persephonella marina EX-H1] gi|225645927|gb|ACO04113.1| putative von Willebrand factor type A domain protein [Persephonella marina EX-H1] Length = 304 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 39/197 (19%), Positives = 70/197 (35%), Gaps = 32/197 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++I LDVS SM+ K+ ++ + + L + + G++ F N Sbjct: 86 IIIALDVSNSMKEK-----NKLKISKEILRDFLLKRDEED------RIGILVFDNLPFRL 134 Query: 234 FLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L L R I + T GL A N + KI Sbjct: 135 MPLTSDRGALLRVISIIRPAMVDVGGTAMYDGLVEALNMFMKDRRN------------KI 182 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC--ASPNSFYLV 348 I+ +TDG ++++K + + N+ GA +Y IG+ + L A+ + V Sbjct: 183 IILLTDGGDINSKYTLEDVVRFNQ--DIGAKIYTIGVSSGMNFYVLERLSEATGGKAFFV 240 Query: 349 EN--PHSMYDAFSHIGK 363 ++ F I + Sbjct: 241 TKDYQKALRSVFDEINR 257 >gi|239629502|ref|ZP_04672533.1| von Willebrand factor domain containing protein [Lactobacillus paracasei subsp. paracasei 8700:2] gi|239528188|gb|EEQ67189.1| von Willebrand factor domain containing protein [Lactobacillus paracasei subsp. paracasei 8700:2] Length = 909 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 50/293 (17%), Positives = 103/293 (35%), Gaps = 33/293 (11%) Query: 19 MTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLK 78 MT +TA L + ++I + M T L + + AA N Sbjct: 1 MTKMTAKVARTGHLFVVLLI----LMSMLTGLVTSGSSVVTAAANIRPTYQTDANGTYPT 56 Query: 79 GGDILCRIKNTWNMSFRNELR--DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYK 136 + +N N +++ DN + + D + S P N Y I ++ Sbjct: 57 NSWQVTGQQNVINQRGGDQVSGWDNNTIWNGDATDTTNSYLKFGDPNNPDYQIRKYAK-- 114 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 + + + + + Q +D+++V+D+S SMES + + Sbjct: 115 --------ETNTPGLYDVYLNVKGNTQ---QNVKPVDIVLVVDMSGSMESKNNGGTDRAG 163 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVSHLQRKIKYLSKFGV 255 + L ++ + + V GL+ FS+ G +++ + Sbjct: 164 AVRTGVKNFLTSIQNA-GLGDYVNVGLIGFSSPGYIGGGNKTTGPGYIRVGLGK----AG 218 Query: 256 STNSTPGLKYAYNQIFDMQ-----GMRQHCNTEDANY---KKIIVFMTDGENL 300 +T+ + A + F+ G+RQ +A+ KK+++ +TDG Sbjct: 219 NTSQQQAINSALSPTFNGGTYTQIGLRQGSAMLNADTSGNKKMMILLTDGVPT 271 >gi|59711129|ref|YP_203905.1| TadG-like protein [Vibrio fischeri ES114] gi|59479230|gb|AAW85017.1| TadG-like protein [Vibrio fischeri ES114] Length = 465 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 37/308 (12%), Positives = 90/308 (29%), Gaps = 48/308 (15%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 I G IL + +P++F V + + + K L + +++ + Sbjct: 2 IMKLKKQQSGHAAILFVMCIPVLFGVFTLASDGARALQSKARLEDAAEAAVLAVSAYGEE 61 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 + K + + N ++ DD Y Sbjct: 62 DEVSTQTGKDYVAHYMHDMSNLVDIEVEKLECSELPECTADD----------NDRPFVEY 111 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITS------SVKVNSQTDARLDMMIVLDVS 181 +S +++K W+ + V S S Q+ +D+ +LD S Sbjct: 112 QVSGRTKHK---------SWFPGNDVTVGFGESFDVTGMSKARKFQSSQPMDITFILDFS 162 Query: 182 RSMESFFD------------------SSITKIDMAIKSINAMLEEVKLIPDVNN--VVQS 221 SM ++ S +++ + + +E+++ + + Sbjct: 163 GSMNYDWEGHAPSYMEEEVPKVPGRYSPPSRLSDLKDVVQMVTDELQVYNNSTTGPKHRV 222 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQ-RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + ++ + G ++ ++I + G K N+ F ++G Sbjct: 223 AMTGYNRRTVNES--SNGKFVIRDQRITKYNSDGYDAGDKFYPKKTINKQFMVKGAAARV 280 Query: 281 NTEDANYK 288 D + Sbjct: 281 PNGDEKAE 288 >gi|2623765|gb|AAB86530.1| Lu-ECAM-1 [Bos taurus] Length = 794 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 39/191 (20%), Positives = 68/191 (35%), Gaps = 32/191 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM + D A + ++E+ L+ G+VTF + E Sbjct: 310 VCLVLDKSGSMSAE-DRLFQMNQAAELYLIQVIEKGSLV---------GMVTFDSVAEIQ 359 Query: 234 FLLEWGV--SHLQRKIKYLSK-FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + Q+ L + T+ GLK + I Sbjct: 360 NHLTRITDDNVYQKITAKLPQVANGGTSICRGLKAGFQAIIHSDQSTSGSE--------- 410 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 I+ +TDGE+ C + K+ GAI++ I + + E L F+ Sbjct: 411 IILLTDGEDNEINS-------CFEDVKRSGAIIHTIALGPSAAKELETLSNMTGGYRFFA 463 Query: 348 VENPHSMYDAF 358 ++ + +AF Sbjct: 464 NKDITGLTNAF 474 >gi|296190554|ref|XP_002806559.1| PREDICTED: LOW QUALITY PROTEIN: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Callithrix jacchus] Length = 3582 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 31/212 (14%), Positives = 70/212 (33%), Gaps = 38/212 (17%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL+++ ++D S S+ + + +L + ++P + +VTFS+K Sbjct: 81 RLELVFLVDDSSSVGE------VNFRSELLFVRKLLSDFPVVPTA---TRVAIVTFSSKN 131 Query: 231 EEFFLLEW-GVSHLQRK--------IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +++ S ++ I +S G T + + A + Sbjct: 132 YVVPRVDYISTSRARQHKCALLLQEIPAISYRGGGTYTKGAFQQAAQILLHA-------- 183 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 N K++ +TDG + + G ++ GI E ++ Sbjct: 184 --RENSTKVLFLITDGYSNG----GDPRPIAASLRDSGVEIFTFGIWQGNIRELNDMAST 237 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 P E + +F + + + + D Sbjct: 238 PK-----EEHCYLLHSFEEF-EALARRALHED 263 >gi|47551047|ref|NP_999700.1| complement factor B [Strongylocentrotus purpuratus] gi|3928787|gb|AAC79682.1| factor B SpBf [Strongylocentrotus purpuratus] Length = 833 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 31/193 (16%), Positives = 68/193 (35%), Gaps = 14/193 (7%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+ D S S+ + ++ ++ ++++ + G V++S++ Sbjct: 388 MDIYFAFDASNSVG------LKNFEIGKTFAKQLVGKLQVNTSPGG-TRVGAVSYSSEAR 440 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 F + S + +K + TN L A I + + ++ K+I+ Sbjct: 441 RLFNVNDFTSTVD-VVKAIEANVNYTNKGTNLPAALETI-GVMITETADESGYSSRKRIL 498 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSFYLV 348 +TDG + +S E ++ IGI L ASP + + Sbjct: 499 FIITDGFSNVGGAPSKSAQPLKE--DAALKIHCIGISRNTDKTALAEIASPPVSEHVFYL 556 Query: 349 ENPHSMYDAFSHI 361 + + + A I Sbjct: 557 SDYNELERAVEAI 569 >gi|290543406|ref|NP_001166514.1| cochlin [Cavia porcellus] gi|195970365|gb|ACG60666.1| coagulation factor C [Cavia porcellus] Length = 553 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 29/236 (12%), Positives = 76/236 (32%), Gaps = 34/236 (14%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTD------ARLDMMIVLDVSRSMESFFDSSITK 194 F +P + + V P+ + + Q +++ ++D S S+ + Sbjct: 332 FSYHMPNWFGTTKYVKPLVQKLCSHEQMMCSKTCYNSVNIAFLIDGSSSVGD------SN 385 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSK 252 + ++ ++ + + ++ + V F+ F + + I+ + Sbjct: 386 FRLMLEFVSNIAKTFEISDIGA---KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRSIRY 442 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 T + + + +F +R N +V +TDG+ Sbjct: 443 MSGGTATGDAISFTVRNVFGP--VRDSPNKN------FLVIITDGQ-----SYDDVRGPA 489 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACAS---PNSFYLVENPHSMYDAFSHIGKDI 365 A G ++++G+ + L+ AS + + + S I + I Sbjct: 490 AAAHDAGITIFSVGVAWAPLDD-LKDMASKPKESHAFFTREFTGLEPIVSDIIRGI 544 >gi|24375056|ref|NP_719099.1| von Willebrand factor type A domain-containing protein [Shewanella oneidensis MR-1] gi|24349804|gb|AAN56543.1|AE015791_7 von Willebrand factor type A domain protein [Shewanella oneidensis MR-1] Length = 621 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 37/231 (16%), Positives = 87/231 (37%), Gaps = 22/231 (9%) Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITS--SVKVNSQ 167 + + +L + + Y+ ++ + P T + + +++ + + Sbjct: 169 LPQKDTLRVEEMLNYFSYNYPQPNKNEAPFSVTTELAPSPYNDDMMLLRIGLKGYEQSKA 228 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +++ +LDVS SM S K+ + ++ + +++ V+ VV +G Sbjct: 229 ELGASNLVFLLDVSGSMASD-----DKLPLLQTALKMLTQQLDEQDKVSIVVYAGAAG-- 281 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + L ++ L+ G STN G++ AY ++H N Sbjct: 282 --VVLDGAAGNDIKILTYALEQLTA-GGSTNGAEGIQLAYQL------AQKHFVKGGINR 332 Query: 288 KKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 ++ TDG+ N+ T + + KK G + +G + ++ L Sbjct: 333 ---VILATDGDFNVGTTNLDELVDLVEVQKKHGIGLTTLGFGMGNYNDHLM 380 >gi|47207969|emb|CAF93030.1| unnamed protein product [Tetraodon nigroviridis] Length = 586 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 36/265 (13%), Positives = 88/265 (33%), Gaps = 35/265 (13%) Query: 119 VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVL 178 + +S+++ ++ + + + M + D ++D+ +V+ Sbjct: 133 TLSQWTSSFSVTSSDQF-YKINQLCVPAKKPTKKKLTMK---KALIPGNKDCKIDVAMVI 188 Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW 238 D S + + ++ + + +K+ + G++ S+ FLL Sbjct: 189 DSSN------NIGRRRFNLQKNFLIKLAAMMKVGQTGPH---MGVIQASDTPRTEFLLSN 239 Query: 239 GV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 L IK L G +N+ + +A F Q Y +++V + D Sbjct: 240 YTQPKELLFAIKELGYLGGDSNTGKAITHAAETFF------QQERGGRRGYPRVMVVLVD 293 Query: 297 GENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--------LRACASPNSFYLV 348 G + E L A++ G V+ + + E ++ ++ + Sbjct: 294 GWPSDSLEQAAML-----AREFGINVFLVSVAKPAPEELPMVPDKDFMKKAVCRDNGFFS 348 Query: 349 ENPHSMYDAFSHIGKDIVTKRIWYD 373 S + + H+ K + + D Sbjct: 349 YGIPSWFSSSKHV-KALSQRLCSLD 372 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 21/123 (17%), Positives = 47/123 (38%), Gaps = 18/123 (14%) Query: 222 GLVTFS--NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G V ++ ++E FL + ++ +S T + + ++ +F ++ Sbjct: 448 GAVQYTYDQRLEFGFLDQPNKDEALAALRRISYMSGGTATGEAISFSTQNLFRRTAPGRN 507 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 +V +TDG+ EA+ +G V+++G+ + LRA Sbjct: 508 ----------FLVVVTDGQ-----SYDDVRRPAMEAQAQGISVFSVGVAWAPLDD-LRAM 551 Query: 340 ASP 342 +S Sbjct: 552 SSE 554 >gi|87310695|ref|ZP_01092823.1| BatB [Blastopirellula marina DSM 3645] gi|87286676|gb|EAQ78582.1| BatB [Blastopirellula marina DSM 3645] Length = 364 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 61/163 (37%), Gaps = 25/163 (15%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +++M VLDVSRSM + D + ++ A + I ML+ + + GL+ F+ Sbjct: 88 QKGIEVMFVLDVSRSMLAE-DVTPNRLQRAKQQIKDMLKVMAGD-------RVGLIAFAG 139 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + + L ++ + + + + A D Sbjct: 140 ETRQSVPLTSHYEDFKQTLDSVGPHTVRSGGSRLGDAIAAASKGFIDKTYDH-------- 191 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 K IV TDGE+ +K + + +RG V+ +G+ Sbjct: 192 ---KAIVVFTDGEDQESKPVEVAKSLYA---ERGIRVFTVGLG 228 >gi|121606137|ref|YP_983466.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] gi|120595106|gb|ABM38545.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] Length = 354 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 39/242 (16%), Positives = 76/242 (31%), Gaps = 57/242 (23%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +DVS SM + D ++ + + A L D+ V+ G+V F+ Sbjct: 92 IILAMDVSGSMRA-TDVLPNRLVASQNAAKAFLA------DLPRNVRVGVVAFAGTAAVV 144 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF----------DMQGMRQH---- 279 L I + T G+ + ++F + R+H Sbjct: 145 QPPTVSREDLTAAIDKF-QLQRGTAIGNGIIVSLAELFPEAGIDLESMENNRERKHGLSL 203 Query: 280 --CNTEDANYKKI-------------IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 +D N KK I+ +TDG+ SL A RG VY Sbjct: 204 DQAGKDDGNGKKAFTPVAPGSYTSAAIILLTDGQRT---TGIDSLDAAKVAADRGIRVYT 260 Query: 325 IGIRVIRS---------------HEFLRAC--ASPNSFYLVENPHSMYDAFSHIGKDIVT 367 +G+ + E L+ A+ ++ + + + + Sbjct: 261 VGVGTVEGETIGFEGWSMRVKLDEETLKGIARATQAEYFYAGTATDLKKVYQTLSSRLTV 320 Query: 368 KR 369 ++ Sbjct: 321 EK 322 >gi|255262383|ref|ZP_05341725.1| von Willebrand factor, type A [Thalassiobium sp. R2A62] gi|255104718|gb|EET47392.1| von Willebrand factor, type A [Thalassiobium sp. R2A62] Length = 634 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 38/229 (16%), Positives = 83/229 (36%), Gaps = 24/229 (10%) Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCT---FIPWYTNSRHIVMPITSSVKVNS 166 + S ++ I + Y A +P + T PW +++ + + I ++ + Sbjct: 216 LPPSGAVRIEEMINYFSYDYLATDAGDVPFRATTSVFETPWNADTQLLHIGIQGTL-PDV 274 Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 L+++ ++D S SM K+ + I + ML E++ +V+ + +G Sbjct: 275 SEHPPLNLVFLIDTSGSMN-----QPDKLPLLISAFRLMLSELRPEDEVSIITYAGSAG- 328 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ + + + LS G ST GL+ AY M + Sbjct: 329 --QVLAPTPASDRAT-ILAALNRLSA-GGSTAGQAGLRQAYAIAAAMSEDGEIAR----- 379 Query: 287 YKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++ TDG+ N+ + Y + G + +G ++ Sbjct: 380 ----VILATDGDFNVGLNDPDTLKDYITTRRDSGTYLSVLGFGRGNLND 424 >gi|242042271|ref|XP_002468530.1| hypothetical protein SORBIDRAFT_01g047470 [Sorghum bicolor] gi|241922384|gb|EER95528.1| hypothetical protein SORBIDRAFT_01g047470 [Sorghum bicolor] Length = 686 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/138 (21%), Positives = 56/138 (40%), Gaps = 26/138 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 A LD++ VLDVS SM K+ + +++ ++ + + +V+FS+ Sbjct: 163 RAPLDLVTVLDVSGSMRW------DKLALVKQAMGFVIGSLGPHD------RLSVVSFSS 210 Query: 229 KIEEFFLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 L G S ++ L G TN GL+ A + G R+H N Sbjct: 211 GARRVTRLLRMSHTGKSLATEAVESLRA-GGGTNIAEGLRTAAKVL----GERRHRNAVS 265 Query: 285 ANYKKIIVFMTDGENLST 302 + ++ ++DG + + Sbjct: 266 S-----VILLSDGHDNYS 278 >gi|238060187|ref|ZP_04604896.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] gi|237881998|gb|EEP70826.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] Length = 779 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/197 (15%), Positives = 72/197 (36%), Gaps = 28/197 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 +++ +D S SM + T+ +A + + + + + L +FS++ ++ Sbjct: 592 ILLAVDTSGSMNARVGGR-TRFQVATTAADRAVGLLNSAD------RVALWSFSSETDQR 644 Query: 233 -----FFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + G + L+ + G +T ++ A+ ++ D D Sbjct: 645 RGKPYSEEIRLGPYDRAAFTRRLTGLRVGGNTALYATVRAAHRRLLDN---------HDP 695 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEA--KKRGAIVYAIGIRVIRSHEFLR--ACAS 341 + +V +TDG+N +++ + R V+ + L A AS Sbjct: 696 DRINAVVVLTDGKNEYPRDNDLDRLLADIELDPNRPVKVFCVAFDRESDLAALDRIAGAS 755 Query: 342 PNSFYLVENPHSMYDAF 358 + +P ++ +AF Sbjct: 756 AGKAFDATDPATIDEAF 772 >gi|119613999|gb|EAW93593.1| hCG19532, isoform CRA_a [Homo sapiens] Length = 302 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 60/165 (36%), Gaps = 24/165 (14%) Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + G++ +S+K+E+ L+ S + + + G T + L+ A N +F+ Sbjct: 11 TARIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQYLGEGTYTATALQAA-NDMFEDA- 68 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--- 332 KK+ + +TDG+ +++ ++ A ++ IG+ Sbjct: 69 --------RPGVKKVALVITDGQ-TDSRDKEKLTEVVKNASDTNVEIFVIGVVKKNDPNF 119 Query: 333 ----HEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 E P Y ++ ++ D + ++I D Sbjct: 120 EIFHKEMNLIATDPEHVYQFDDFFTLQDTLK----QKLFQKICED 160 >gi|332233322|ref|XP_003265851.1| PREDICTED: anthrax toxin receptor 2 [Nomascus leucogenys] Length = 488 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 72/206 (34%), Gaps = 31/206 (15%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D+ VLD S S+ + + I ++++ V+ ++ + FS++ Sbjct: 42 AFDLYFVLDKSGSVANNW-----------IEIYNFVQQLAERF-VSPEMRLSFIVFSSQA 89 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L + + ++ L T GLK A QI G++ Sbjct: 90 TIILPLTGDRGKISKGLEDLKHVSPVGETYIHEGLKLANEQIQKAGGLKTSS-------- 141 Query: 289 KIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 II+ +TDG + L ++ ++ GA VY +G+ + R S + Sbjct: 142 -IIIALTDGKLDGLVPSYAEKEAKI---SRSLGASVYCVGVLDFEQAQLERIADSKEQVF 197 Query: 347 LVENPHSMYDAFSHIGKDIVTKRIWY 372 V+ + A I I+ + Sbjct: 198 PVKGG---FQALKGIINSILAQSCTE 220 >gi|116249091|ref|YP_764932.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] gi|115253741|emb|CAK12134.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] Length = 465 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 43/331 (12%), Positives = 103/331 (31%), Gaps = 73/331 (22%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 R + G + I+ A+ L + + +G + + ++ + S +D +L+ A +I Sbjct: 28 FRGLGRDRGGNVAIVVALTLVPMIVAVGASFDYIRTYNVRQRMQSDLDTALIAAVKEI-- 85 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 D +K F ++ ++ + DID + + Sbjct: 86 -----------DTDDTDALKEKVADWFHAQVENSYTLGDIDIDTSNHKITATASGTVPTT 134 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM--- 184 + N + + + S+VK + L++ IV+D S SM Sbjct: 135 LMKI-----------------ANIDTVDVSVASAVK--GPATSYLNVYIVIDTSPSMLLA 175 Query: 185 -----------------------------------ESFFDSSITKI--DMAIKSINAMLE 207 + + K+ D+A ++ +L+ Sbjct: 176 ATTAGQSAMYSGIGCQFACHTGDAHTVGKTKYANNYEYSAAKTIKLRADVAGDAVRDVLD 235 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 + + ++ GL + + + E + ++ S S S + Sbjct: 236 MIDDSDSNHQRIKVGLYSLGDTLTEVLTPTLSTDTARNRLADASYGLTSATSKAATYFDV 295 Query: 268 NQIFDMQGMRQHCNTEDANYK-KIIVFMTDG 297 + Q + + + K+++ +TDG Sbjct: 296 SLATLKQKVGTGGDGTSSGSPLKLVLLLTDG 326 >gi|16765642|ref|NP_461257.1| hypothetical protein STM2315 [Salmonella enterica subsp. enterica serovar Typhimurium str. LT2] gi|167992650|ref|ZP_02573747.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar 4,[5],12:i:- str. CVM23701] gi|197262795|ref|ZP_03162869.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA23] gi|16420855|gb|AAL21216.1| putative von Willebrand factor, vWF type A domain protein [Salmonella enterica subsp. enterica serovar Typhimurium str. LT2] gi|197241050|gb|EDY23670.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA23] gi|205329241|gb|EDZ16005.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar 4,[5],12:i:- str. CVM23701] gi|261247522|emb|CBG25349.1| lipoprotein [Salmonella enterica subsp. enterica serovar Typhimurium str. D23580] gi|267994407|gb|ACY89292.1| hypothetical protein STM14_2853 [Salmonella enterica subsp. enterica serovar Typhimurium str. 14028S] gi|301158873|emb|CBW18386.1| lipoprotein [Salmonella enterica subsp. enterica serovar Typhimurium str. SL1344] gi|312913305|dbj|BAJ37279.1| hypothetical protein STMDT12_C23360 [Salmonella enterica subsp. enterica serovar Typhimurium str. T000240] gi|321222984|gb|EFX48055.1| hypothetical protein SEE_04302 [Salmonella enterica subsp. enterica serovar Typhimurium str. TN061786] gi|323130645|gb|ADX18075.1| Putative von Willebrand factor, vWF type A domain protein [Salmonella enterica subsp. enterica serovar Typhimurium str. 4/74] gi|332989248|gb|AEF08231.1| hypothetical protein STMUK_2345 [Salmonella enterica subsp. enterica serovar Typhimurium str. UK-1] Length = 593 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 71/188 (37%), Gaps = 21/188 (11%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 PW + + + + + ++ +++ ++D S SM+ ++ + + Sbjct: 203 LAPSPWNAQRTLLKVDVQAR-DMQTRDLPPANLVFLIDTSGSMQP-----AERLPLIRSA 256 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 + ++ +++ ++ V +G + + ++ I L +G ST Sbjct: 257 LKLLVNDLRAQDNITIVTYAG----GTHVALASTAGNNTTAIKAAIDNLDAYG-STGGEA 311 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGA 320 GL+ AY Q + N I+ TDG+ NL + + + +++G Sbjct: 312 GLRLAYEQ------AEKGFIKGGVNR---ILLTTDGDFNLGITDPKDIEALVKKEREKGI 362 Query: 321 IVYAIGIR 328 + +G+ Sbjct: 363 TLSTLGVG 370 >gi|22219356|pdb|1M1U|A Chain A, An Isoleucine-Based Allosteric Switch Controls Affinity And Shape Shifting In Integrin Cd11b A-Domain Length = 195 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 40/215 (18%), Positives = 88/215 (40%), Gaps = 30/215 (13%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 +++ + Q D+ D+ ++D S S+ ++ + ++ L++ K + + + Sbjct: 4 ALRGSPQEDS--DIAFLIDGSGSIIPH---DFRRMKEFVSTVMEQLKKSKTLFSLMQYSE 58 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + F+ K + + + +K +++ T++ G++ ++F++ Sbjct: 59 EFRIHFTFKEFQNNP------NPRSLVKPITQLLGRTHTATGIRKVVRELFNIT------ 106 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFL 336 N N KI+V +TDGE + + EA + G I Y IG R +S + L Sbjct: 107 NGARKNAFKILVVITDGEKFGDPLGYEDVIP--EADREGVIRYVIGVGDAFRSEKSRQEL 164 Query: 337 RACASP---NSFYLVENPHSMYDAFSHIGKDIVTK 368 AS + + V N ++A I + K Sbjct: 165 NTIASKPPRDHVFQVNN----FEALKTIQNQLREK 195 >gi|332222722|ref|XP_003260519.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 [Nomascus leucogenys] Length = 3535 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 30/212 (14%), Positives = 69/212 (32%), Gaps = 38/212 (17%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL+++ ++D S S+ + + +L + ++P + +VTFS+K Sbjct: 81 RLELVFLVDDSSSVGE------VNFRSELMFVRKLLSDFPVVPTA---TRVAIVTFSSKN 131 Query: 231 EEFFLLEW-GVSHLQRK--------IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +++ ++ I +S G T + + A + Sbjct: 132 YVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYTKGAFQQAAQILLHA-------- 183 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 N K++ +TDG + + G ++ GI E ++ Sbjct: 184 --RENSTKVVFLITDGYSNG----GDPRPIAASLRDSGVEIFTFGIWQGNIRELNDMAST 237 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 P E + +F + + + + D Sbjct: 238 PK-----EEHCYLLHSFEEF-EALARRALHED 263 >gi|296159241|ref|ZP_06842067.1| von Willebrand factor type A [Burkholderia sp. Ch1-1] gi|295890500|gb|EFG70292.1| von Willebrand factor type A [Burkholderia sp. Ch1-1] Length = 345 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 58/168 (34%), Gaps = 23/168 (13%) Query: 168 TDARLDMMIVLDVSRSM--ESFFDSSITKID---MAIKSINAMLEEVKLIPDVNNVVQSG 222 + D+M+ +D+S SM F D + ++D + + + + K + G Sbjct: 90 DEPARDLMLAIDLSGSMATRDFVDPAGERMDRLSAVKRVVADFVAKRKGD-------RIG 142 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 LV F + L ++ + + + G D G+ Sbjct: 143 LVVFGDAAYPQAPLTLDHDSVRILLDQMQIGMAGPRTAIG---------DAIGLTVKLMA 193 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + +K+++ +TDG T AK+ +V+ IGI Sbjct: 194 DSHAQEKVLILLTDG--NDTSSAIPPERAAEIAKQHKLVVHTIGIGDP 239 >gi|193783708|dbj|BAG53619.1| unnamed protein product [Homo sapiens] Length = 868 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 30/212 (14%), Positives = 69/212 (32%), Gaps = 38/212 (17%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL+++ ++D S S+ + + +L + ++P + +VTFS+K Sbjct: 81 RLELVFLVDDSSSVGE------VNFRSELMFVRKLLSDFPVVPTA---TRVAIVTFSSKN 131 Query: 231 EEFFLLEW-GVSHLQRK--------IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +++ ++ I +S G T + + A + Sbjct: 132 YVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYTKGAFQQAAQILLHA-------- 183 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 N K++ +TDG + + G ++ GI E ++ Sbjct: 184 --RENSTKVVFLITDGYSNG----GDPRPIAASLRDSGVEIFTFGIWQGNIRELNDMAST 237 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 P E + +F + + + + D Sbjct: 238 PK-----EEHCYLLHSFEEF-EALARRALHED 263 >gi|55662683|emb|CAH74138.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Homo sapiens] gi|55665761|emb|CAH73557.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Homo sapiens] Length = 845 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 30/212 (14%), Positives = 69/212 (32%), Gaps = 38/212 (17%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL+++ ++D S S+ + + +L + ++P + +VTFS+K Sbjct: 58 RLELVFLVDDSSSVGE------VNFRSELMFVRKLLSDFPVVPTA---TRVAIVTFSSKN 108 Query: 231 EEFFLLEW-GVSHLQRK--------IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +++ ++ I +S G T + + A + Sbjct: 109 YVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYTKGAFQQAAQILLHA-------- 160 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 N K++ +TDG + + G ++ GI E ++ Sbjct: 161 --RENSTKVVFLITDGYSNG----GDPRPIAASLRDSGVEIFTFGIWQGNIRELNDMAST 214 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 P E + +F + + + + D Sbjct: 215 PK-----EEHCYLLHSFEEF-EALARRALHED 240 >gi|55662684|emb|CAH74139.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Homo sapiens] gi|55665762|emb|CAH73558.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Homo sapiens] gi|55957947|emb|CAI14068.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Homo sapiens] Length = 3548 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 30/212 (14%), Positives = 69/212 (32%), Gaps = 38/212 (17%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL+++ ++D S S+ + + +L + ++P + +VTFS+K Sbjct: 58 RLELVFLVDDSSSVGE------VNFRSELMFVRKLLSDFPVVPTA---TRVAIVTFSSKN 108 Query: 231 EEFFLLEW-GVSHLQRK--------IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +++ ++ I +S G T + + A + Sbjct: 109 YVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYTKGAFQQAAQILLHA-------- 160 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 N K++ +TDG + + G ++ GI E ++ Sbjct: 161 --RENSTKVVFLITDGYSNG----GDPRPIAASLRDSGVEIFTFGIWQGNIRELNDMAST 214 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 P E + +F + + + + D Sbjct: 215 PK-----EEHCYLLHSFEEF-EALARRALHED 240 >gi|148886654|ref|NP_699197.3| sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 [Homo sapiens] gi|296452942|sp|Q4LDE5|SVEP1_HUMAN RecName: Full=Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1; AltName: Full=CCP module-containing protein 22; AltName: Full=Polydom; AltName: Full=Selectin-like osteoblast-derived protein; Short=SEL-OB; AltName: Full=Serologically defined breast cancer antigen NY-BR-38; Flags: Precursor Length = 3571 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 30/212 (14%), Positives = 69/212 (32%), Gaps = 38/212 (17%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL+++ ++D S S+ + + +L + ++P + +VTFS+K Sbjct: 81 RLELVFLVDDSSSVGE------VNFRSELMFVRKLLSDFPVVPTA---TRVAIVTFSSKN 131 Query: 231 EEFFLLEW-GVSHLQRK--------IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +++ ++ I +S G T + + A + Sbjct: 132 YVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYTKGAFQQAAQILLHA-------- 183 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 N K++ +TDG + + G ++ GI E ++ Sbjct: 184 --RENSTKVVFLITDGYSNG----GDPRPIAASLRDSGVEIFTFGIWQGNIRELNDMAST 237 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 P E + +F + + + + D Sbjct: 238 PK-----EEHCYLLHSFEEF-EALARRALHED 263 >gi|119579467|gb|EAW59063.1| hCG1794476, isoform CRA_b [Homo sapiens] Length = 1196 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 30/212 (14%), Positives = 69/212 (32%), Gaps = 38/212 (17%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL+++ ++D S S+ + + +L + ++P + +VTFS+K Sbjct: 81 RLELVFLVDDSSSVGE------VNFRSELMFVRKLLSDFPVVPTA---TRVAIVTFSSKN 131 Query: 231 EEFFLLEW-GVSHLQRK--------IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +++ ++ I +S G T + + A + Sbjct: 132 YVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYTKGAFQQAAQILLHA-------- 183 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 N K++ +TDG + + G ++ GI E ++ Sbjct: 184 --RENSTKVVFLITDGYSNG----GDPRPIAASLRDSGVEIFTFGIWQGNIRELNDMAST 237 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 P E + +F + + + + D Sbjct: 238 PK-----EEHCYLLHSFEEF-EALARRALHED 263 >gi|112180424|gb|AAH30816.1| SVEP1 protein [Homo sapiens] Length = 868 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 30/212 (14%), Positives = 69/212 (32%), Gaps = 38/212 (17%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL+++ ++D S S+ + + +L + ++P + +VTFS+K Sbjct: 81 RLELVFLVDDSSSVGE------VNFRSELMFVRKLLSDFPVVPTA---TRVAIVTFSSKN 131 Query: 231 EEFFLLEW-GVSHLQRK--------IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +++ ++ I +S G T + + A + Sbjct: 132 YVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYTKGAFQQAAQILLHA-------- 183 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 N K++ +TDG + + G ++ GI E ++ Sbjct: 184 --RENSTKVVFLITDGYSNG----GDPRPIAASLRDSGVEIFTFGIWQGNIRELNDMAST 237 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 P E + +F + + + + D Sbjct: 238 PK-----EEHCYLLHSFEEF-EALARRALHED 263 >gi|37222213|gb|AAQ89957.1| selectin-like protein [Homo sapiens] gi|68655017|emb|CAF04067.1| SEL-OB protein [Homo sapiens] Length = 3574 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 30/212 (14%), Positives = 69/212 (32%), Gaps = 38/212 (17%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL+++ ++D S S+ + + +L + ++P + +VTFS+K Sbjct: 81 RLELVFLVDDSSSVGE------VNFRSELMFVRKLLSDFPVVPTA---TRVAIVTFSSKN 131 Query: 231 EEFFLLEW-GVSHLQRK--------IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +++ ++ I +S G T + + A + Sbjct: 132 YVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYTKGAFQQAAQILLHA-------- 183 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 N K++ +TDG + + G ++ GI E ++ Sbjct: 184 --RENSTKVVFLITDGYSNG----GDPRPIAASLRDSGVEIFTFGIWQGNIRELNDMAST 237 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 P E + +F + + + + D Sbjct: 238 PK-----EEHCYLLHSFEEF-EALARRALHED 263 >gi|297624820|ref|YP_003706254.1| von Willebrand factor type A [Truepera radiovictrix DSM 17093] gi|297166000|gb|ADI15711.1| von Willebrand factor type A [Truepera radiovictrix DSM 17093] Length = 802 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 36/224 (16%), Positives = 80/224 (35%), Gaps = 34/224 (15%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 WY V+P+T+ ++ + + ++IV+DVS+SM + + +++++A + ++ Sbjct: 344 WYRTPVEAVLPVTTDLRTEVEV-PLVALVIVMDVSQSMTA---GNPSRLELAKEGAVGVV 399 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPG 262 + G +TFS++ E F G + I ++ G T P Sbjct: 400 DLAYERD------MLGFITFSDRPEWVFRPRQATLQGKREMTAAILNVAPQG-GTIFEPA 452 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE------NLSTKEDQQSLYYCNEAK 316 + A + + + +H ++ +TDG+ S + Sbjct: 453 YREALDVLMAQEAAVKH-----------VIVLTDGKFADGTGPFSRGPAPDFGRLAALGR 501 Query: 317 KRGAIVYAIGIR-VIRSHEFLRAC-ASPNSFYLVENPHSMYDAF 358 + G I I + A +Y + ++ F Sbjct: 502 RSGITTSTIAIGDGADPQQLTTIARAGGGRYYEALDVSTLPRIF 545 >gi|168465984|ref|ZP_02699854.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Newport str. SL317] gi|195631159|gb|EDX49719.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Newport str. SL317] Length = 593 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 30/205 (14%), Positives = 77/205 (37%), Gaps = 26/205 (12%) Query: 130 SAISRYKIPLKF-----CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 + +IP+ F PW + + + + + ++ +++ ++D S SM Sbjct: 186 TKPIAARIPMPFAVKYELAPSPWNAQRTLLKVDVQAR-DMQTRDLPPANLVFLIDTSGSM 244 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ 244 + ++ + ++ ++ +++ ++ V +G + + ++ Sbjct: 245 QP-----AERLPLIRSALKLLVNDLRAQDNITIVTYAG----GTHVALASTAGNNTTAIK 295 Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTK 303 I L +G ST GL+ AY Q + N I+ TDG+ NL Sbjct: 296 AAIDNLDAYG-STGGEAGLRLAYEQ------AEKGFIKGGVNR---ILLTTDGDFNLGIT 345 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIR 328 + + + +++G + +G+ Sbjct: 346 DPKDIEALVKKEREKGITLSTLGVG 370 >gi|157823041|ref|NP_001101626.1| integrin alpha-11 [Rattus norvegicus] gi|149041917|gb|EDL95758.1| integrin, alpha 11 (predicted) [Rattus norvegicus] Length = 1171 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 87/214 (40%), Gaps = 39/214 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++IVLD S S+ + + + ++E + +Q G+V + Sbjct: 136 QTYMDIVIVLDGSNSIYPWVEVQ-----------HFLIEILTKFYIGPGQIQVGIVQYGE 184 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 F L S ++ ++ S + G T + G+++A ++ F G + Sbjct: 185 DAVHEFHLNDYKS-VKDVVEAASHIEQRGGTETRTAFGIEFARSEAFQKGGRKG------ 237 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFL-- 336 KK+++ +TDGE+ + + ++ + +++K YA+ + R I FL Sbjct: 238 --AKKVMIVITDGESHDSPDLEKVIR---QSEKDNVTRYAVAVLGYYNRRGINPETFLNE 292 Query: 337 -RACAS---PNSFYLVENPHSMYDAFSHIGKDIV 366 + AS F+ V + ++ D +G I Sbjct: 293 IKYIASDPDDKHFFNVTDEAALKDIVDALGDRIF 326 >gi|225465131|ref|XP_002271188.1| PREDICTED: hypothetical protein [Vitis vinifera] Length = 768 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 29/155 (18%), Positives = 57/155 (36%), Gaps = 26/155 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 A +D++ VLDVS SM +K+ + +++ +++ + + +V+FS Sbjct: 281 DRAPIDLVAVLDVSGSMAG------SKLSLLKRAVCFLIQNLGPSD------RLSIVSFS 328 Query: 228 NKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + F L G I L+ G TN GLK + + Sbjct: 329 STARRIFPLRRMSDNGREAAGLAINSLTSSG-GTNIVEGLKKGVRVLEERSE-------- 379 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 N I+ ++DG++ ++ + A Sbjct: 380 -QNPVASIILLSDGKDTYNCDNVNRRQTSHCASSN 413 >gi|161612960|ref|YP_001586925.1| hypothetical protein SPAB_00666 [Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7] gi|161362324|gb|ABX66092.1| hypothetical protein SPAB_00666 [Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7] Length = 593 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 71/188 (37%), Gaps = 21/188 (11%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 PW + + + + + ++ +++ ++D S SM+ ++ + + Sbjct: 203 LAPSPWNAQRTLLKVDVQAR-DMQTRDLPPANLVFLIDTSGSMQP-----AERLPLIRSA 256 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 + ++ +++ ++ V +G + + ++ I L +G ST Sbjct: 257 LKLLVNDLRAQDNITIVTYAG----GTHVALASTAGNNTTAIKAAIDNLDAYG-STGGEA 311 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGA 320 GL+ AY Q + N I+ TDG+ NL + + + +++G Sbjct: 312 GLRLAYEQ------AEKGFIKGGVNR---ILLTTDGDFNLGITDPKDIEALVKKEREKGI 362 Query: 321 IVYAIGIR 328 + +G+ Sbjct: 363 TLSTLGVG 370 >gi|126463435|ref|YP_001044549.1| von Willebrand factor, type A [Rhodobacter sphaeroides ATCC 17029] gi|126105099|gb|ABN77777.1| von Willebrand factor, type A [Rhodobacter sphaeroides ATCC 17029] Length = 651 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 40/232 (17%), Positives = 79/232 (34%), Gaps = 23/232 (9%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 P T PW +R + + + + + L+++ ++D S SM+ K+ + Sbjct: 262 PTLSITRTPWNPETRLVHVALQGRM-PAIEDRPPLNLVFLIDTSGSMQ-----DPAKLPL 315 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST 257 +S ML ++ V V +G ++ S + + L G ST Sbjct: 316 LKQSFGLMLGRLRPEDQVAIVTYAGSAG---EVLAPTA-ANQRSTILSALDRLDA-GGST 370 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAK 316 GL AY +M G + +V TDG+ NL + ++ + Sbjct: 371 AGDEGLALAYRTASEMAGAGEVTR---------VVLATDGDFNLGISDPEELARLVAHER 421 Query: 317 KRGAIVYAIGIRVIR-SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 G + +G ++A A + S+ +A + + Sbjct: 422 DTGVYLSVLGFGRGNLDDATMQALAQNGNGQAAY-IDSLNEAQKVLVDQLSG 472 >gi|291411005|ref|XP_002721795.1| PREDICTED: integrin alpha M [Oryctolagus cuniculus] Length = 1155 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 36/228 (15%), Positives = 82/228 (35%), Gaps = 28/228 (12%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 S + P + D+ ++D S S++S T + ++ ++E+ Sbjct: 126 FGSNLLQSPQRVPETLRGCPQQESDIAFLIDGSGSIDS------TDFQRMKEFVSTVMEQ 179 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYA 266 + L+ +S + F + + +K + + T++ G+ Sbjct: 180 FTKSNSL-----FALMQYSEEFRTHFTFSDFKRNPNPRALVKPIRQLLGRTHTATGILKV 234 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 ++F + AN +K++V +TDGE + + + A++ G I Y +G Sbjct: 235 VTELFHSS------SGARANARKVLVVITDGEKFGDTLEYEDVIP--RAEREGVIRYVVG 286 Query: 327 ----IRVIRSHEFLRACASP---NSFYLVENPHSMYDAFSHIGKDIVT 367 +S + L AS + V N ++ + + + I Sbjct: 287 VGDAFNSEQSRQELNTIASKPSREHVFRVNNFEALNTIRNQLQEKIFA 334 >gi|308068881|ref|YP_003870486.1| von Willebrand factor A [Paenibacillus polymyxa E681] gi|305858160|gb|ADM69948.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Paenibacillus polymyxa E681] Length = 600 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 37/184 (20%), Positives = 72/184 (39%), Gaps = 33/184 (17%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 +S + +++D ++V+DVS SM + I +++ ++ + D Sbjct: 28 PQMANASSPSASKVDAVLVVDVSNSMNTSDPGKIGN-----EAMKMFIDMLSTQND---- 78 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRK-----IKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + G+V +++ ++ L S ++ I L++ G T+++ G+K A + D Sbjct: 79 -KVGIVAYTDVVQREKALLNITSEADKQELKTFIDGLNR-GAYTDTSVGVKEALRILQDG 136 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTK--------EDQQSLYYCNEAKKRGAIVYAI 325 + IV + DG N K DQ EAK G +Y I Sbjct: 137 KTAGHAPM---------IVMLADGNNDFNKTTGRTESQSDQDMAQAVAEAKNSGVPIYTI 187 Query: 326 GIRV 329 G+ Sbjct: 188 GLNA 191 >gi|219852403|ref|YP_002466835.1| hypothetical protein Mpal_1806 [Methanosphaerula palustris E1-9c] gi|219546662|gb|ACL17112.1| conserved hypothetical protein [Methanosphaerula palustris E1-9c] Length = 316 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 34/224 (15%), Positives = 84/224 (37%), Gaps = 43/224 (19%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 QT + +++ +D S SM + D T+++ A +S + +++ + +G+V F Sbjct: 83 QTRDGVSVVLAIDDSGSMAAN-DYQPTRLEAAKESASVLIKSLDPKD------YAGVVIF 135 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + K + + T GL A + + N Sbjct: 136 ESGATTAAYLSPDKDRVMEKTAAIEQKNGQTALGDGLALAVDMADSI-----------PN 184 Query: 287 YKKIIVFMTDGENLS--TKEDQQSLYYCNEAKKRGAIVYAIGIRVIR------------- 331 KK++V ++DG + + + + A + V+ +G+ Sbjct: 185 QKKVVVLLSDGVGNAGVISPEDATAF----AAQNKVQVFTVGLGSKSPVLLGTDPTGTPQ 240 Query: 332 ----SHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 L++ A + ++Y + +++ ++ + K+IV ++ Sbjct: 241 YATLDEAALQSIAEKTGGTYYTSVDEQTLHQIYAGLNKEIVREK 284 >gi|311747444|ref|ZP_07721229.1| putative BatB protein [Algoriphagus sp. PR1] gi|126574803|gb|EAZ79174.1| putative BatB protein [Algoriphagus sp. PR1] Length = 321 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 33/179 (18%), Positives = 66/179 (36%), Gaps = 18/179 (10%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 P + + + + D+ + +D+S+SM + D +++ + + + Sbjct: 61 FAGPSIGTSVKEIKEEGK-DIFLAVDLSQSMNA-TDIGPSRLQRIKFELKELTKSFPSD- 117 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + GL+ FS++ L + S LQ I G++T P N + Sbjct: 118 ------RIGLIIFSSEAFMQCPLTFDQSVLQLYID-----GLNTGLVPNFGTDLNAPLRI 166 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 R + K ++ ++DGEN + +E K G V+A+GI Sbjct: 167 ALDRFQNDESQEVKSKSVILISDGENFG----DELENIGSELKNLGVKVFALGIGTESG 221 >gi|296205952|ref|XP_002806985.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-3(VI) chain-like [Callithrix jacchus] Length = 3176 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 70/202 (34%), Gaps = 24/202 (11%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + + A D++ ++D S S+ + + + +++ + + + L Sbjct: 30 ADVKNGAAADIIFLVDSSWSIGE------EHFQLVREFLYDVVKSLAVGEND---FHFAL 80 Query: 224 VTFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQ-IFDMQGMRQHC 280 V F+ FLL + + I +S G + + GL+Y + G R Sbjct: 81 VQFNGNPHTEFLLNTYRTKQEVLSHISNMSYTGGTNQTGKGLEYVMQSHLTKAAGSRA-- 138 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 ++IV +TDG + K V+AIG+ + Sbjct: 139 ---GDGVPQVIVVLTDG-----HSKEGLALPSAGLKSADVNVFAIGVEGADEGALKEIAS 190 Query: 341 SP--NSFYLVENPHSMYDAFSH 360 P + +EN S++D + Sbjct: 191 EPLNMHMFNLENFTSLHDIVEN 212 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 32/214 (14%), Positives = 79/214 (36%), Gaps = 28/214 (13%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + +V++ D++ +LD S ++ + ++ + + N+ Sbjct: 628 SGTPEVHANKR---DIIFLLDGSANVGK------ANFFSMRSFVMMLVNSLDV---GNDN 675 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKI-KYLSKF----GVSTNSTPGLKYAYNQIFDM 273 ++ GLV FS+ F L + I +L + G N+ L Y + Sbjct: 676 IRVGLVQFSDTPVTEFSL--NTYQTKSDILDHLRQLQLQGGSGLNTGSALSY----VHAN 729 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + + ++++ +T G+ + L N + G + + +G Sbjct: 730 HFTEAGGSRIREHVPQLLLLLTAGQ-----SEDSYLQAANALTRAGILTFCVGASQANKA 784 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 E + +P+ YL+++ S+ D + + + T Sbjct: 785 ELEQIAFNPSLVYLMDDFSSLPDLSQQLIQPLTT 818 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 49/312 (15%), Positives = 109/312 (34%), Gaps = 29/312 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + + GG + F +R +G V+ Sbjct: 1522 LEFVARNFFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR-----FAQVIRSSGIVSLGVG 1576 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + +P + + ++ P + Sbjct: 1577 DRNIDRTELQTITNDPRLVFTVRDFRDLPNIEERIMNSFGPSAATPAPPGVDIPSPSLPE 1636 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + + E V + + + +Q GLV +++ Sbjct: 1637 KKKADIVFLLD--GSINFRRDS-------FQEVLXFVSEIVDTVYEDGDSIQVGLVQYNS 1687 Query: 229 -KIEEFFLLEWGVSH-LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 +EFF ++ + I + G N+ GL++ N G R + Sbjct: 1688 DPTDEFFXRDFSTKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSR-----LE 1742 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I +T G K + + + G V+A+G+R I S E + ++ + Sbjct: 1743 QRVPQIAFVVTGG-----KSVEDAQQVSLALTQSGVKVFAVGVRNIDSEEVGKTASNSAT 1797 Query: 345 FYLVENPHSMYD 356 + V N + + Sbjct: 1798 AFRVGNVQELSE 1809 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 41/272 (15%), Positives = 89/272 (32%), Gaps = 41/272 (15%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS-------- 166 ++ + + I++ F + ++V + SSV Sbjct: 174 AIGVEGADEGALKEIASEPLNMHMFNLENFTSLHDIVENLVSCVHSSVSPERAGDTETLK 233 Query: 167 --QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D++ ++D S ++ + + + +LE++ P ++ G+V Sbjct: 234 DITAQESADIIFLIDGSN------NTGSVNFAVILDFLVNLLEKL---PVGTQQIRVGVV 284 Query: 225 TFSNKIEEFFLLEWGVSHLQ--RKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 FS++ F L+ S Q +K L G N L + F R + Sbjct: 285 QFSDEPRTMFSLDTYSSKAQVLSAVKALGFAGGELANIGLALDFVVENHFT----RAGGS 340 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACA 340 + +++V ++ G + D + A V++ G+ S L+ A Sbjct: 341 RVEEGVPQVLVLISAGPSSDEIGDGVVAL-------KQASVFSFGLGAQAASRAELQHVA 393 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + ++ F G D+ K + Y Sbjct: 394 TDDNLVFT------VPEFRSFG-DLQEKLLPY 418 Score = 43.2 bits (100), Expect = 0.063, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 61/163 (37%), Gaps = 24/163 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 1031 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 1080 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAY-NQIFDMQGMRQHCNTEDANYK 288 F L + I+ L+ G T N+ L++ N + G R Sbjct: 1081 EFYLNSYMNQQDIVNAIRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRI-----TEGVP 1135 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ +T + + KK GA+ IGI Sbjct: 1136 QLLIVVT-----ADRSGDDVRNPSVVLKKGGAVPIGIGIGNAD 1173 Score = 42.8 bits (99), Expect = 0.092, Method: Composition-based stats. Identities = 50/349 (14%), Positives = 107/349 (30%), Gaps = 43/349 (12%) Query: 40 VSHIFFMKTVLHSMIDRSLVHAATQI-MNEGNGNNRKKLKGGDILCRIKN------TWNM 92 + + K + + + R QI + + + + RI+ Sbjct: 1287 LLNAHSSKDEVQNAVRRLRPKGGRQINLGSALEYVSRNIFKRPLGSRIEEGVPQFLVLIS 1346 Query: 93 SFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSR 152 S +++ + V ++ + + V + E IS Y + +P + Sbjct: 1347 SGKSDDEVDDPVVELKQFGVAPLIVARNVDEEELVKISLSPEYVFLVSSFQELPSL--EQ 1404 Query: 153 HIVMPITSSVKVNSQ-------------TDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 ++ PI S + Q D++ ++D S + S Sbjct: 1405 KLLTPIMSLNSGHIQQLLASTHYPPPAVESDAADIVFLIDSSEGVRSEG------FAHIR 1458 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVST 257 + ++ + + P V+ G+V FSN + F L+ S + I+ L G S Sbjct: 1459 DFVIRIVRRLNIGPSK---VRVGVVQFSNDVFPEFYLKTYRSQVPVLEAIRRLRLRGGSP 1515 Query: 258 -NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 N+ L++ F + + + +V + G K + + Sbjct: 1516 LNTGRALEFVARNFFVKSAGSRIEDG----VPQHLVLVLGG-----KSQDDVSRFAQVIR 1566 Query: 317 KRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 G + +G R I E P + V + + + I Sbjct: 1567 SSGIVSLGVGDRNIDRTELQTITNDPRLVFTVRDFRDLPNIEERIMNSF 1615 >gi|291414463|ref|XP_002723479.1| PREDICTED: collagen, type VI, alpha 1 [Oryctolagus cuniculus] Length = 868 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 59/167 (35%), Gaps = 29/167 (17%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVVQSGLVTF 226 +D++ VLD S S+ + ++A I +++ + V +G+V + Sbjct: 453 GPIDILFVLDSSESIG------LQNFEIAKDFIIKVIDRLSRDELVKFEPGQSHAGVVQY 506 Query: 227 SNKIEEF-----FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 S+ + L+ IK L T + L+Y +++ Sbjct: 507 SHNQMQEHVDLRSPSIRNTQDLKEAIKKLQWMAGGTFTGEALQYTRDRLL---------- 556 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +I + +TDG + + ++ C V ++GI+ Sbjct: 557 -PPTPNTRIALVITDGRSDTQRDTTPLSVLCGP----DIQVVSVGIK 598 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 26/180 (14%), Positives = 61/180 (33%), Gaps = 19/180 (10%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + D+ I+LD S S+ S + K + ++ V+ +V +S + Sbjct: 666 SPTDITILLDGSASVGSH---NFDTTKRFAKRLAERFLSAGRTDPAHD-VRVAVVQYSGR 721 Query: 230 IEEFFLLEW-----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ + L + + F +T+ L Y R + Sbjct: 722 GQQQPERAALQFLQNYTVLAGTVDTMGFFNDATDVNDALSYV---------TRFYREASP 772 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 A KK ++ +DG + + EA++ G ++ + + + +R + + Sbjct: 773 ATAKKKVLLFSDGNSQGATAEAIERAV-QEAQRAGIEIFVVVVGRQVNEPHVRVLVTGKT 831 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 22/120 (18%), Positives = 44/120 (36%), Gaps = 6/120 (5%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP---D 214 + S + D +D+ VLD S S+ +D ++ + D Sbjct: 63 VRGSRALPLGADCPVDLFFVLDTSESVALRLKPYGALVDKVKAFTKNFIDNLNDRYYRCD 122 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N V +G + +S+++E L G L+ + + FG T + +K ++ Sbjct: 123 RNLVWNAGALHYSDEVEVISGLTRMPAGRDALKASVDAVKYFGKGTYTDCAIKRGLEELL 182 >gi|288921031|ref|ZP_06415322.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288347549|gb|EFC81835.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 401 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 37/186 (19%), Positives = 59/186 (31%), Gaps = 17/186 (9%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + + +++ LDVS SM D S ++ A K+ + Sbjct: 63 LGLATLGVGAARPQASVPIAANSTTILLALDVSGSM-CSTDVSPNRLTAAEKAATDFI-- 119 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + P + + GLVTF+ L +K L+ T G+ + + Sbjct: 120 -RAQPGGS---RIGLVTFAGSAGLLVPPTDDTDKLLAALKSLTTS-RGTAIGQGILTSID 174 Query: 269 QIFDMQ------GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 I ++ G T +IV +TDG N EA R V Sbjct: 175 AIAEVDPSVPPTGADVPGGTGGEYAADVIVVLTDGANT---VGVDPRTAAGEAATRRLRV 231 Query: 323 YAIGIR 328 Y IG Sbjct: 232 YTIGFG 237 >gi|194433366|ref|ZP_03065646.1| von Willebrand factor type A domain protein [Shigella dysenteriae 1012] gi|194418460|gb|EDX34549.1| von Willebrand factor type A domain protein [Shigella dysenteriae 1012] gi|320178755|gb|EFW53718.1| hypothetical protein SGB_04028 [Shigella boydii ATCC 9905] gi|332090753|gb|EGI95846.1| von Willebrand factor type A domain protein [Shigella dysenteriae 155-74] Length = 575 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 49/307 (15%), Positives = 108/307 (35%), Gaps = 39/307 (12%) Query: 39 EVSHIFFMKTVLHSMID--RSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRN 96 + S ++ L R+ AT I N G R + + + ++ +F Sbjct: 69 QYSDKQALQGRLQEAPTFARAAKAKATHIAN--PGTARYQQFDDNPVKQVAQNPLATFSL 126 Query: 97 ELRDNGFVN-------DIDDIVRSTSLDIVVV--PQNEGYSISAISRYKIPLKFCT---- 143 ++ + N + + ++ +V P + P+ F Sbjct: 127 DVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYEL 186 Query: 144 -FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 PW + + I + S+ +++ ++D S SM S ++ + S+ Sbjct: 187 APAPWNEQRTLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISD-----ERLPLIQSSL 240 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 +++E++ ++ V +G ++I + + + I L G STN G Sbjct: 241 KLLVKELREQDNIAIVTYAG----DSRIALPSISGSHKAEINAAIDSLDAEG-STNGGAG 295 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGAI 321 L+ AY Q ++ N I+ TDG+ +D +S+ + ++ G Sbjct: 296 LELAYQQA-AKGFIKGGINR--------ILLATDGDFNVGIDDPKSIESMVKKQRESGVT 346 Query: 322 VYAIGIR 328 + G+ Sbjct: 347 LSTFGVG 353 >gi|159027742|emb|CAO89612.1| unnamed protein product [Microcystis aeruginosa PCC 7806] Length = 416 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 87/234 (37%), Gaps = 29/234 (12%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 ++S+ +M ++ T+ +++ +VLD S SM+ ++ K+ Sbjct: 13 NIDANQSSSQRQLMLSIAATSEQINTNLPINLCLVLDHSGSMQG------KPLETVKKAA 66 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNK---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNS 259 +++E + + + ++ F ++ I + ++ ++ KI+ L G T Sbjct: 67 LSLIESLGVND------RLSVIAFDHRAKVILPSQSRQDDLTLIRSKIQQLRA-GGGTAI 119 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 G+K + Y I +TDGEN ++Q+ L A + G Sbjct: 120 DEGIKLGIQESSS----------GSKGYVSHIFLLTDGENEH-GDNQRCLKLAAVAAEYG 168 Query: 320 AIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + G + + L A + S +E P F+ + + + R+ Sbjct: 169 ITLNTFGFGDHWNQDILEKIADIAGGSLSYIERPEQALIEFTRLFNRLQSVRLT 222 >gi|119614000|gb|EAW93594.1| hCG19532, isoform CRA_b [Homo sapiens] Length = 304 Score = 57.9 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 60/165 (36%), Gaps = 24/165 (14%) Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + G++ +S+K+E+ L+ S + + + G T + L+ A N +F+ Sbjct: 11 TARIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQYLGEGTYTATALQAA-NDMFEDA- 68 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--- 332 KK+ + +TDG+ +++ ++ A ++ IG+ Sbjct: 69 --------RPGVKKVALVITDGQ-TDSRDKEKLTEVVKNASDTNVEIFVIGVVKKNDPNF 119 Query: 333 ----HEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 E P Y ++ ++ D + ++I D Sbjct: 120 EIFHKEMNLIATDPEHVYQFDDFFTLQDTLK----QKLFQKICED 160 >gi|291382819|ref|XP_002708118.1| PREDICTED: polydom [Oryctolagus cuniculus] Length = 3569 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 72/212 (33%), Gaps = 38/212 (17%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL+++ ++D S S+ +K + +L + ++P + +VTFS+K Sbjct: 81 RLELVFLVDESSSVGH------ANFLSELKFVRKLLSDFPVVPTA---TRVAIVTFSSKN 131 Query: 231 EEFFLLEWGVSH---------LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +++ S L R+I ++ G T + + A + Sbjct: 132 NVVPRVDYISSRRAHQHKCALLSREIPAITYRGGGTYTKGAFQQAAQILRHS-------- 183 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 N K+I +TDG + + G ++ GI E ++ Sbjct: 184 --RENSTKVIFLITDGYSNG----GDPRPIAASLRDFGVEIFTFGIWQGNIRELNDMAST 237 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 P E + +F + + + + D Sbjct: 238 PK-----EEHCYLLHSFEEF-EALARRALHED 263 >gi|221640506|ref|YP_002526768.1| von Willebrand factor, type A [Rhodobacter sphaeroides KD131] gi|221161287|gb|ACM02267.1| von Willebrand factor, type A [Rhodobacter sphaeroides KD131] Length = 651 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 40/232 (17%), Positives = 78/232 (33%), Gaps = 23/232 (9%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 P T PW +R + + + + + L+++ ++D S SM+ K+ + Sbjct: 262 PTLSVTRTPWNPETRLVHVALQGRM-PAIEDRPPLNLVFLIDTSGSMQ-----DPAKLPL 315 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST 257 +S ML ++ V V +G ++ S + + L G ST Sbjct: 316 LKQSFGLMLGRLRPEDQVAIVTYAGSAG---EVLAPTA-ANQRSTILSALDRLDA-GGST 370 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAK 316 GL AY +M G + +V TDG+ NL + + + Sbjct: 371 AGEEGLALAYRTASEMAGAGEVTR---------VVLATDGDFNLGISDPEDLARLVAHER 421 Query: 317 KRGAIVYAIGIRVIR-SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 G + +G ++A A + S+ +A + + Sbjct: 422 DTGVYLSVLGFGRGNLDDATMQALAQNGNGQAAY-IDSLNEAQKVLVDQLSG 472 >gi|126334857|ref|XP_001374633.1| PREDICTED: similar to leukocyte immune-type receptor TS32.15 L1.1a [Monodelphis domestica] Length = 3609 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 35/237 (14%), Positives = 77/237 (32%), Gaps = 38/237 (16%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 P + +V+ + L+++ ++D S S+ + K + + Sbjct: 83 PKENKVEKLGQSFKRNVRKLRERSHSLELVFLVDESSSVGHANFLNELK------FVKKL 136 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---------LQRKIKYLSKFGVS 256 L + ++P + +VTFS+K +++ S L R+I ++ G Sbjct: 137 LSDFPVVPSA---TRVAIVTFSSKNNVVPRVDYISSSRAHQHKCSLLNREIPNITYRGGG 193 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 T + + A + N K+I +TDG + + Sbjct: 194 TYTKGAFQQAAQILRHS----------RENSTKVIFLITDGYSNG----GDPRPIAASLR 239 Query: 317 KRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 G ++ GI E ++P E + +F + + + + D Sbjct: 240 DFGVEIFTFGIWQGNIRELNDMASTPK-----EEHCYLLHSFEEF-EALARRALHED 290 >gi|115449371|ref|NP_001048450.1| Os02g0806700 [Oryza sativa Japonica Group] gi|47497349|dbj|BAD19389.1| zinc finger-like [Oryza sativa Japonica Group] gi|113537981|dbj|BAF10364.1| Os02g0806700 [Oryza sativa Japonica Group] gi|215701428|dbj|BAG92852.1| unnamed protein product [Oryza sativa Japonica Group] Length = 723 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 52/327 (15%), Positives = 109/327 (33%), Gaps = 49/327 (14%) Query: 51 HSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDI 110 + +++ + + G ++ L + + +E + + + Sbjct: 169 QNGGQMTILRPLPRARSSGRLHHMTSLLPDTDRSIFNDDEPLDSLSEANEGSQQGCLRTV 228 Query: 111 VRSTSLDIVVVPQNEG-YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD 169 T + VP++ + + + K PL + T Sbjct: 229 EIKTYPEFTEVPESTSERNFTVLVHLKAPLAQTLQTSSKLEDGN----------SLGTTR 278 Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 A +D++ VLDVS SM TK+ + +++ +++ + + ++ FS+ Sbjct: 279 APVDLITVLDVSGSMAG------TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSS 326 Query: 230 IEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F L G + + L+ G TN GL+ I D Q C+ Sbjct: 327 ARRLFPLRRMTETGRQQSLQAVYSLTSNG-GTNIAEGLRKGSKVIEDRQAKNPVCS---- 381 Query: 286 NYKKIIVFMTDGENLSTKEDQQSL-----YYCNEA--KKRG---AIVYAIGIRVIRSHEF 335 I+ ++DG++ T + YC+ G V+ G Sbjct: 382 -----IILLSDGQDTYTVSPTAGVHKAAPEYCSLLPYTSNGCQQVPVHVFGFGADHDSVS 436 Query: 336 LRACA--SPNSFYLVENPHSMYDAFSH 360 L + + S +F +E ++ DAF+ Sbjct: 437 LHSISQTSGGTFSFIETEAAIQDAFAQ 463 >gi|163848230|ref|YP_001636274.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222526140|ref|YP_002570611.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163669519|gb|ABY35885.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222450019|gb|ACM54285.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 419 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 36/209 (17%), Positives = 73/209 (34%), Gaps = 31/209 (14%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 +Q + + VLD S SM KI+ +++ +E++ ++ +V Sbjct: 38 TQVRVPIHVCFVLDRSGSM------KGEKIERLRQAVVRAIEQLNEQDIIS------IVI 85 Query: 226 FSNKIE--EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 F ++ E + + + + G T P L+ ++ Sbjct: 86 FDHRTEVLVPAQPVRQRTAILDLVHRIRDAG-GTRIAPALEKGIQEL----------QKM 134 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--S 341 ++ +V +TDG+ KE L ++A + G + A+GI + + L A S Sbjct: 135 PQGVRR-LVLLTDGQTEHEKE---CLLRADDAGRLGIPITALGIGKDWNEDLLIEMANRS 190 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + P + F H + I Sbjct: 191 RGVADYIAQPGEIVQYFQHTVQRAQQTTI 219 >gi|307565272|ref|ZP_07627765.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] gi|307345941|gb|EFN91285.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] Length = 318 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 75/213 (35%), Gaps = 37/213 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + +D+M+ +D+S SM + D ++++A + + + P+ N GL F Sbjct: 83 KDSEGIDIMLAMDISASMLTN-DVIPNRLEVAKEVASDFI---SGRPNDN----IGLTIF 134 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSK-------FGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + + L + L + + T GL A ++ + Sbjct: 135 AGEAFTQCPLTTDHASLINLLNSVRTDLVVKGLIQDGTAIGMGLINAVGRLKSSKAKS-- 192 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH------ 333 K+++ +TDG N + AKK VY IG+ +++ Sbjct: 193 ---------KVVILLTDGSNNVGSISPMTA--AEIAKKFNIRVYTIGLGTEQNNGYSDID 241 Query: 334 -EFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 LR A + FY ++ + + I K Sbjct: 242 YTTLRQIANVTNGKFYSAQSQTELSQIYKDINK 274 >gi|192359934|ref|YP_001981670.1| von Willebrand factor type A domain-containing protein [Cellvibrio japonicus Ueda107] gi|190686099|gb|ACE83777.1| von Willebrand factor type A domain protein [Cellvibrio japonicus Ueda107] Length = 674 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 42/232 (18%), Positives = 82/232 (35%), Gaps = 24/232 (10%) Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVKVNS 166 + ++ I + YS S + P PW ++ + + I Sbjct: 249 LPPKDAVRIEEMVNYFDYSYPLPSSAQTPFTTNITVLDSPWKPGNKLLHIGIQGYQLPAG 308 Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + +++ +LDVS SM+ +K+ + +S+ +L +K V VV +G Sbjct: 309 HI-PQSNLVFLLDVSGSMDE-----PSKLPLVKQSMELLLSTLKPEDTVAIVVYAGAAGT 362 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + S + + L G ST GL AY Q+ + + N Sbjct: 363 VLEPTKVRE----KSKILAALHNLQA-GGSTAGGEGLALAY-QLAEANFNPKGVNR---- 412 Query: 287 YKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 I+ TDG+ N+ D+ + + +G + +G + L Sbjct: 413 ----IILATDGDFNVGQTGDEPLQDFVERKRAKGIYLSVLGFGQGNYQDALM 460 >gi|301059316|ref|ZP_07200243.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300446545|gb|EFK10383.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 527 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 32/218 (14%), Positives = 77/218 (35%), Gaps = 29/218 (13%) Query: 150 NSRHIVMPIT--SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 N +++V+ +T + + +++ IVLD S SM S K++ A ++ L Sbjct: 127 NPQNVVVKVTLDAPAPPSRMERPPVNIAIVLDRSGSM------SGQKLEKAKQAAIEALR 180 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEE--FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 + + ++ + + ++ V ++ +I+ + G + + Sbjct: 181 RLGQKDMFS------VIVYDHNVKTIVPAQSARNVEWIESRIRGIGPGGNTALFGGVSQG 234 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYA 324 A ++ H I+ ++DG N+ + K V Sbjct: 235 ASEVRKNLSNKYVHR----------IILLSDGLANVGPSSPEDLGRLGAALIKESISVTT 284 Query: 325 IGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSH 360 IG+ + + + + S + Y VE+ + F+ Sbjct: 285 IGVGTDYNEDLMARLSQNSDGNTYFVESSRDLPKIFAA 322 >gi|170740935|ref|YP_001769590.1| hypothetical protein M446_2717 [Methylobacterium sp. 4-46] gi|168195209|gb|ACA17156.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 432 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 61/420 (14%), Positives = 130/420 (30%), Gaps = 62/420 (14%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI--- 65 R F + G + ++ + L + L++G ++ + +T L ++ D++++ A + Sbjct: 17 RVFAADRSGSIGMMFVVTLVPVLLLVGAAVDFTSYQKARTELDAVADQAVLAAVSAAGMK 76 Query: 66 MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRST-SLDIVVVPQN 124 M++ + D + N V +T I+ + Sbjct: 77 MSQADAEAAMAKLFTDAAAALPNVSASPRAATAPTTDGVRTASLTYSATIRTGIMRLAGF 136 Query: 125 EGYSISAISRYKIPLK-FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVL-DVSR 182 + + P F F NS + + T++ + D+S Sbjct: 137 STVAFGGTATAASPNPIFTDFYLLLDNSPSMGVAATTADIATMVANTSDQCAFACHDMSA 196 Query: 183 SMESFFDSSIT-----KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-----KIEE 232 ++ + +ID+ + +++ + + +F + + Sbjct: 197 GGNDYYAKAKNLGVKMRIDVVRDATQQLMDTASAKAIAAGQYRMAIYSFGTSCSGIGLNQ 256 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG---------MRQHCNTE 283 L +S + L + P Y +Q D G Sbjct: 257 VSALTANLSTSKTDAGAL-----DLMTVPYQNYNNDQCTDFDGIFARLNSAVPNPGSGAS 311 Query: 284 DANYKKIIVFMTDG--ENLSTKEDQQ-----------SLYYCNEAKKRGAIV-------- 322 A+ +K++ F++DG + + +L C K RG V Sbjct: 312 AASPQKVVFFVSDGVADANYPSTCTKPTTNGRCQEPITLANCQALKDRGIRVAVLYTTYL 371 Query: 323 -------YAIGIRVIRSH--EFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT--KRIW 371 Y I S + ACASP+ ++ V + DA + K +V +RI Sbjct: 372 PLPTNGWYNTWIAPFSSQIATNMAACASPDLYWPVSPSEGIADAMKGLFKKVVDSQRRIT 431 >gi|261415414|ref|YP_003249097.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261371870|gb|ACX74615.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|302325633|gb|ADL24834.1| BatA protein [Fibrobacter succinogenes subsp. succinogenes S85] Length = 367 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 41/257 (15%), Positives = 77/257 (29%), Gaps = 69/257 (26%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK---------- 210 +++V + +D+M+ LDVS SM + +T+ + A + + +K Sbjct: 82 AMEVEYTSTDGVDIMLALDVSGSMGTL--DMLTRTEQAKLGVMNAEKILKRGEYWKYSRL 139 Query: 211 --LIPDVNNVV------QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST----- 257 + + + GL F + L L +K T Sbjct: 140 GYAQDVIAEFIGKRHSDRIGLSAFGARSFTQCPLTMDYGSLLEILKASDDLARDTLVNNR 199 Query: 258 -NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 GL A ++ ++++ +TDG + + + AK Sbjct: 200 TAIGDGLMNALARL-----------KMSDAKSRVVILLTDGRDNA--SVVPPVRAAEVAK 246 Query: 317 KRGAIVYAIGIRVIRSH----------------------------EFLRACAS--PNSFY 346 G VY +G+ + L+A AS FY Sbjct: 247 SLGVKVYTVGVGKKSGKILAFQQNPWTGEISWGERDITPEEGIDEDVLKAIASKTGGRFY 306 Query: 347 LVENPHSMYDAFSHIGK 363 EN + +S I + Sbjct: 307 RAENKAELEKIYSEIDE 323 >gi|227533248|ref|ZP_03963297.1| yvcC protein [Lactobacillus paracasei subsp. paracasei ATCC 25302] gi|227189098|gb|EEI69165.1| yvcC protein [Lactobacillus paracasei subsp. paracasei ATCC 25302] Length = 596 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 47/289 (16%), Positives = 96/289 (33%), Gaps = 31/289 (10%) Query: 22 LTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGD 81 +TA L ++I + M T L + + AA N Sbjct: 1 MTAKVARTGHLFAVLLI----LMSMLTGLVTSGSSVVTAAANIRPTYQTDANGTYPTNSW 56 Query: 82 ILCRIKNTWNMSFRNELR--DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPL 139 + +N N +++ DN + + D + S P N Y I ++ Sbjct: 57 QVTGQQNVINQRGGDQVSGWDNNTIWNGDATDTTNSYLKFGDPNNPDYQIRKYAK----- 111 Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 + + + + + Q +D+++V+D+S SMES + + Sbjct: 112 -----ETNTPGLYDVYLNVKGNTQ---QNVKPVDIVLVVDMSGSMESKNNGGTDRAGAVR 163 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNS 259 + L ++ + + V GL+ FS+ + I +T+ Sbjct: 164 TGVKNFLTSIQNA-GLGDYVNVGLIGFSSPGYIGGG---NKTTGPGYIHVGLGKAGNTSQ 219 Query: 260 TPGLKYAYNQIFDMQ-----GMRQHCNTEDANY---KKIIVFMTDGENL 300 + A + F+ G+RQ + + KK+++ +TDG Sbjct: 220 QQAINSALSPTFNGGTYTQIGLRQGSAMLNEDTSGNKKMMILLTDGVPT 268 >gi|147776143|emb|CAN69721.1| hypothetical protein VITISV_014218 [Vitis vinifera] Length = 686 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 29/155 (18%), Positives = 56/155 (36%), Gaps = 26/155 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 A +D++ VLDVS SM +K+ + +++ +++ + + +V+FS Sbjct: 199 DRAPIDLVAVLDVSGSMAG------SKLSLLKRAVCFLIQNLGPSD------RLSIVSFS 246 Query: 228 NKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + F L G I L G TN GLK + + Sbjct: 247 STARRIFPLRRMSDNGREAAGLAINSLXSSG-GTNIVEGLKKGVRVLEERSE-------- 297 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 N I+ ++DG++ ++ + A Sbjct: 298 -QNPVASIILLSDGKDTYNCDNVNRRQTSHCASSN 331 >gi|126660809|ref|ZP_01731904.1| hypothetical protein CY0110_12397 [Cyanothece sp. CCY0110] gi|126617906|gb|EAZ88680.1| hypothetical protein CY0110_12397 [Cyanothece sp. CCY0110] Length = 416 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 37/216 (17%), Positives = 78/216 (36%), Gaps = 28/216 (12%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 +N++ V S+V +S L++ ++LD S SM I ++ ++ Sbjct: 17 NQSNTQRQVAISLSAVSESSDRTLPLNLGLILDHSGSMTG------KPIKTVKEAAIRLV 70 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLK 264 E + + +V F +K + + + + ++I+ L G T G+K Sbjct: 71 ESLGSGD------RLSVVAFDHKAKVIVPNQPIDDIKTVNQQIQRLEPAG-GTCIDEGMK 123 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 ++ + R I +TDGEN ++++ L A + + Sbjct: 124 LGIKEVALGKDDRVSQ----------IFLLTDGENEH-GDNERCLKLAQVAAEYNITLNT 172 Query: 325 IGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAF 358 +G + + L + A + +E P F Sbjct: 173 LGFGNHWNQDVLESIADAVGGTLCYIEQPEQAITEF 208 >gi|296111730|ref|YP_003622112.1| hypothetical protein LKI_08020 [Leuconostoc kimchii IMSNU 11154] gi|295833262|gb|ADG41143.1| hypothetical protein LKI_08020 [Leuconostoc kimchii IMSNU 11154] Length = 894 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 36/244 (14%), Positives = 75/244 (30%), Gaps = 33/244 (13%) Query: 66 MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 N+ G + N N N + + L N Sbjct: 49 TNDNKGVYPTNSWTIPGQNTVINHQGGDASNGWDKNSSWSGDSSDTSKSYLKFGTDTSNP 108 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 Y I ++ + + +D+++V+D+S SM Sbjct: 109 DYQIRKYAKETSTPGLYDVYLNAKGNE-------------VKNIKPIDIVLVVDMSGSMN 155 Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--IEEFFLLEWGV--S 241 S + ++ A + + L+ + + V G+V FS+ I L + S Sbjct: 156 SSVNGGNDRVGAARQGVKNFLKTINDA-GIGKYVNVGVVGFSSPGYISSSGTLSENIDAS 214 Query: 242 HLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 Q I ++ F T + G++ + + + ++KK+++ +TD Sbjct: 215 DNQAHITRINNLLANDFKGGTFTQLGIRTGQSML----------AGDSNDHKKMMILLTD 264 Query: 297 GENL 300 G Sbjct: 265 GVPT 268 >gi|261338458|ref|ZP_05966342.1| putative von Willebrand factor type A domain protein [Bifidobacterium gallicum DSM 20093] gi|270276443|gb|EFA22297.1| putative von Willebrand factor type A domain protein [Bifidobacterium gallicum DSM 20093] Length = 493 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 41/249 (16%), Positives = 87/249 (34%), Gaps = 39/249 (15%) Query: 144 FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSIN 203 + S + + ++S+ +Q D+++++DVS SM T + +A ++N Sbjct: 45 LMGNGDGSYSLTVSVSSTDMDTAQQQTESDVVVLMDVSGSMT------TTDMKVAKNAVN 98 Query: 204 AMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL 263 + ++ + D N+ V+ +V FS++ + G L ++T ++ G Sbjct: 99 GLANQL--LNDENDTVRMSIVRFSSEA-KTLEFSNGSEWTHS--PALVAQALNTLTSRG- 152 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN-------EAK 316 ++ D+ K +V M+DG + ++ C A Sbjct: 153 NTNWDGALQNASALVQ---GDSARKSYVVLMSDGYPNTINSCYPAVANCTDTSWSEPNAV 209 Query: 317 KRGAI---------VYAIGIRVIRSHEF--------LRACASPNSFYLVENPHSMYDAFS 359 + +YA+ R S +A P + S+ +AF Sbjct: 210 PKAIEAANTMPNTQIYAVSTRTSASESMKELVDGINAKAPKYPAQIMYGTDQQSLNNAFD 269 Query: 360 HIGKDIVTK 368 I I + Sbjct: 270 TIADAIRKR 278 >gi|42524419|ref|NP_969799.1| putative secreted protein [Bdellovibrio bacteriovorus HD100] gi|39576628|emb|CAE80792.1| putative secreted protein [Bdellovibrio bacteriovorus HD100] Length = 469 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 37/221 (16%), Positives = 72/221 (32%), Gaps = 30/221 (13%) Query: 156 MPITSSVKVNSQTDARLDMMIV---LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 +P +S +V ++ + +V +D S SM TKI + K + L Sbjct: 49 IPANASGEVKAEDLPSTAIPLVEYVIDSSGSMGQLMGPKKTKIYVLKKLLARYLMSQWTE 108 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + + G + + + +E S ++ +K G+ T LK AY Sbjct: 109 KTSSGLRVIGSRRKKDCKDNYLAIEPAQSKLGAIEGIVKGFEPVGM-TPIGQALKDAYKD 167 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-----KKRGAIVYA 324 + +H K +V TDGE ++ C A K + Sbjct: 168 V-------EHYKGP-----KRVVLFTDGEETCGQD------PCKIAAELSGKDVDLKFFV 209 Query: 325 IGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 + + + L A ++ + + F + K + Sbjct: 210 VAFGLQNQPDVLDKLACIGDMSQADDEEKLEELFQDLDKQL 250 >gi|308502223|ref|XP_003113296.1| CRE-DIG-1 protein [Caenorhabditis remanei] gi|308265597|gb|EFP09550.1| CRE-DIG-1 protein [Caenorhabditis remanei] Length = 13921 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 46/238 (19%), Positives = 90/238 (37%), Gaps = 32/238 (13%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPIT------SSVKVNSQTDARLDMMIVLDVSR 182 + ++ + P +P ++R P T S + S D D++IVLD S Sbjct: 13151 VLSLIFFSYPHYSLQIVPNVESARTWPTPRTKATTPAGSGRSCSSIDFESDVIIVLDSS- 13209 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH 242 E+F + A+ SI L PDV+ + G V +S+K+ L H Sbjct: 13210 --ENFTPDEFDSMKDAVASIVD--TGFDLAPDVS---KIGFVIYSDKVAVPVAL----GH 13258 Query: 243 LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN-TEDANYKKIIVFMTDGENLS 301 + KI+ + K + G+ A ++ + RQ N KI++ +T+G+N Sbjct: 13259 YEDKIELIEKIVDAEKINDGVAIA---LYGLNAARQQFQLHGRENATKIVILITNGKNRG 13315 Query: 302 TKEDQQSLYYCNEAKKR-GAIVYAIGIR-VIRSHEFLRAC---ASPNSFYLVENPHSM 354 + + + G ++A+ + ++ ++P+ V + Sbjct: 13316 N-----AAAAAEDLRDMYGVQLFAVAVGSNPDELATIKRLVGNSNPDHVIEVAQSTEI 13368 >gi|225387166|ref|ZP_03756930.1| hypothetical protein CLOSTASPAR_00918 [Clostridium asparagiforme DSM 15981] gi|225046714|gb|EEG56960.1| hypothetical protein CLOSTASPAR_00918 [Clostridium asparagiforme DSM 15981] Length = 556 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 47/297 (15%), Positives = 107/297 (36%), Gaps = 40/297 (13%) Query: 51 HSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDI 110 S D +++ A ++ + + + N +F ++ + N I Sbjct: 54 QSQWDGAVMEAEGPPLSHNTEEYNYIAENAFLA--VANAPLSTFAADVDTASYANLRRKI 111 Query: 111 -----VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSV--K 163 V + ++ I + Y P T+I + + + K Sbjct: 112 LEGNEVPADAVRIEEMLNYFTYDYP-EPTEDEPFSVTTYIGDCPWNENHKLLQIGLQAEK 170 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + + +++ ++DVS SMES K+ + ++ + E ++ V+ + Sbjct: 171 PDLENQKPSNLVFLIDVSGSMES-----ADKLGLVKRAFLLLTENLRPEDTVS------I 219 Query: 224 VTFSNKIEEFFLLEWGVSHLQR-----KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 VT+++ ++ GVS ++ I+ L+ G ST+ + G++ AY + Sbjct: 220 VTYAS---SDTVVLDGVSGEEKAAIMTAIENLTA-GGSTDGSKGIETAYRL------AEE 269 Query: 279 HCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 H + N ++ TDG+ NL + + K+ G + +G + Sbjct: 270 HFQKDGNNR---VILATDGDLNLGLTSEGDLTRLIQKKKESGVFLSVMGFGTGNIKD 323 >gi|198430141|ref|XP_002124276.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 606 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 36/213 (16%), Positives = 72/213 (33%), Gaps = 30/213 (14%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++I+LD S S+ I I ++ +++ + + Q + ++N+ Sbjct: 408 PTDLVIILDASSSVNEDNWRRIKSI-----TVQLIIKFMS-----SESTQFAVFRYNNRP 457 Query: 231 EEFFLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + + L I+ + GV T + L +A + I N Sbjct: 458 DVATQILLQNTNDAAALLNAIENIPYGGVGTRTGIALGHAADVILHTD------NGNRPK 511 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFLRACA 340 I++ MTDG+ + + + GA V A+GI + E L Sbjct: 512 AADIVLIMTDGQ---VNVNDNTQVPADALHNMGATVMAVGIEPEKFGGRFQLEELLDITQ 568 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + + + ++ I K V D Sbjct: 569 QNRNHIFLPDGSTVVSTVDEI-KSAVENLACQD 600 >gi|194381994|dbj|BAG64366.1| unnamed protein product [Homo sapiens] Length = 1207 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 50/312 (16%), Positives = 113/312 (36%), Gaps = 29/312 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + + GG + F +R +G V+ Sbjct: 913 LEFVARNLFVKSAGSRIEDGVPQHLVLVLGGKSQDDVSR-----FAQVVRSSGIVSLGVG 967 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 ++ + + + ++P + + ++ P + + Sbjct: 968 DRNIDRTELQTITNDPRLVFTVREFRELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPE 1027 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + + E V + + + +Q GLV +++ Sbjct: 1028 KKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEDGDSIQVGLVQYNS 1078 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 + F L+ + + I + G N+ GL++ N G R D Sbjct: 1079 DPTDEFFLKDFSTKRQIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSR-----LD 1133 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I +T G+++ +D +RG V+A+G+R I S E + ++ + Sbjct: 1134 QRVPQIAFVITGGKSVEDAQDVSLALT-----QRGVKVFAVGVRNIDSEEVGKIASNSAT 1188 Query: 345 FYLVENPHSMYD 356 + V N + + Sbjct: 1189 AFRVGNVQELSE 1200 Score = 43.6 bits (101), Expect = 0.047, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 86/261 (32%), Gaps = 24/261 (9%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + ++V + + Y S + ++P K T I T+ + + ++ + Sbjct: 764 RNADQEELVKISLSPEYVFSVSTFRELPSLEQKLLTPITTLTSEQIQKLLASTRYPPPAV 823 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ ++D S + ++ ++ + + P V+ G+V FS Sbjct: 824 ESDAADIVFLIDSSEGVRPDG------FAHIRDFVSRIVRRLNIGPSK---VRVGVVQFS 874 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 N + F L+ S + I+ L G S N+ L++ +F + + Sbjct: 875 NDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDG-- 932 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + +V + G K + + G + +G R I E P Sbjct: 933 --VPQHLVLVLGG-----KSQDDVSRFAQVVRSSGIVSLGVGDRNIDRTELQTITNDPRL 985 Query: 345 FYLVENPHSMYDAFSHIGKDI 365 + V + + I Sbjct: 986 VFTVREFRELPNIEERIMNSF 1006 Score = 42.4 bits (98), Expect = 0.099, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 59/162 (36%), Gaps = 22/162 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S + S F + + + ++E + + D V+ +V +S++ Sbjct: 422 DVVFLLDGSEGVRSGFP-------LLKEFVQRVVESLDVGQD---RVRVAVVQYSDRTRP 471 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L + ++ L+ G T N+ L++ I + + Sbjct: 472 EFYLNSYMNKQDVVNAVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRITEG----VPQ 527 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 +++ +T + + K+ GA+ IGI Sbjct: 528 LLIVLT-----ADRSGDDVRNPSVVVKRGGAVPIGIGIGNAD 564 >gi|15451571|gb|AAK98695.1|AC069158_7 Hypothetical protein protein containing a von Willebrand factor type A domain [Oryza sativa Japonica Group] Length = 714 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 52/327 (15%), Positives = 109/327 (33%), Gaps = 49/327 (14%) Query: 51 HSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDI 110 + +++ + + G ++ L + + +E + + + Sbjct: 160 QNGGQMTILRPLPRARSSGRLHHMTSLLPDTDRSIFNDDEPLDSLSEANEGSQQGCLRTV 219 Query: 111 VRSTSLDIVVVPQNEG-YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD 169 T + VP++ + + + K PL + T Sbjct: 220 EIKTYPEFTEVPESTSERNFTVLVHLKAPLAQTLQTSSKLEDGN----------SLGTTR 269 Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 A +D++ VLDVS SM TK+ + +++ +++ + + ++ FS+ Sbjct: 270 APVDLITVLDVSGSMAG------TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSS 317 Query: 230 IEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F L G + + L+ G TN GL+ I D Q C+ Sbjct: 318 ARRLFPLRRMTETGRQQSLQAVYSLTSNG-GTNIAEGLRKGSKVIEDRQAKNPVCS---- 372 Query: 286 NYKKIIVFMTDGENLSTKEDQQSL-----YYCNEA--KKRG---AIVYAIGIRVIRSHEF 335 I+ ++DG++ T + YC+ G V+ G Sbjct: 373 -----IILLSDGQDTYTVSPTAGVHKAAPEYCSLLPYTSNGCQQVPVHVFGFGADHDSVS 427 Query: 336 LRACA--SPNSFYLVENPHSMYDAFSH 360 L + + S +F +E ++ DAF+ Sbjct: 428 LHSISQTSGGTFSFIETEAAIQDAFAQ 454 >gi|88801582|ref|ZP_01117110.1| hypothetical protein PI23P_02947 [Polaribacter irgensii 23-P] gi|88782240|gb|EAR13417.1| hypothetical protein PI23P_02947 [Polaribacter irgensii 23-P] Length = 330 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 22/173 (12%), Positives = 65/173 (37%), Gaps = 20/173 (11%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +D++ LD+S+SM + D + +++ + + I+ +++ + + Sbjct: 61 GSKLKTIKREGVDIVFALDISKSMLAE-DIAPNRLEKSKQIISKIIDRLGSD-------R 112 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 G++ ++ + + ++ + VS + + A + Sbjct: 113 VGIIVYAGNSYPLLPITTDHAAANMFLQNANPDMVS-SQGTAINEALELAKTYYNNDEQT 171 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 N + ++ ++DGE+ +++ G +Y IG+ R Sbjct: 172 N-------RFLIILSDGEDH----QEETKQVAQNLANNGVKIYTIGVGTARGG 213 >gi|84688081|ref|ZP_01015939.1| hypothetical protein 1099457000215_RB2654_05415 [Maritimibacter alkaliphilus HTCC2654] gi|84663909|gb|EAQ10415.1| hypothetical protein RB2654_05415 [Rhodobacterales bacterium HTCC2654] Length = 595 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 19/74 (25%), Positives = 37/74 (50%), Gaps = 1/74 (1%) Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLRACASPNSFYLVENPHSMYDAF 358 ++++ C AK G +V+ IG V S + +R+CAS ++Y + + AF Sbjct: 522 NYSEKNDNLDEICTAAKNAGMVVFTIGFEVSGSQHDIMRSCASAPAYYFDVDGLDISAAF 581 Query: 359 SHIGKDIVTKRIWY 372 + I ++I R+ + Sbjct: 582 AAIAREISKLRLVF 595 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 64/198 (32%), Gaps = 24/198 (12%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F + +G I ++ L G+ ++ + + +DR+++ AA+ Sbjct: 21 FLRDERGSFIIFGIAVFMLMCLAGGIAVDTMRYETHRVHVQGTLDRAILAAAS------- 73 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 L + + + L +DID T+ ++ + Sbjct: 74 --------LDQDLDPEEVVLDYFTKAGLGHVISQDDIDVFENQTNGEVADDVAVTTRRVE 125 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLD-MMIVLDVSRSMESFFD 189 A +P F Y + + ++ L + +VLDVS SM + Sbjct: 126 ASVSALMPTTFLRLAHMYDLGLY--------TEGGAEEALSLSEISLVLDVSGSMGNSSS 177 Query: 190 SSITKIDMAIKSINAMLE 207 S +KI ++ + Sbjct: 178 SGYSKIYELRRAAKRFVN 195 >gi|126309708|ref|XP_001376394.1| PREDICTED: similar to B-factor, properdin [Monodelphis domestica] Length = 764 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 31/225 (13%), Positives = 70/225 (31%), Gaps = 34/225 (15%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +++ +VLD S S+ + K + + + + +VT++ Sbjct: 270 AGSMNIYLVLDASDSIGKNNFTGAKKC---------LSSLIDKVASYGVEPRYAVVTYAT 320 Query: 229 KIEEFFLLE--------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + + L W L++ + TN+ L Y + + Sbjct: 321 EAKAVVKLSDKESSNADWVKQELEKIKYSDHRLKAGTNTKKALTMLYEMMILQE---SQN 377 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYC----------NEAKKRGAIVYAIGIRVI 330 + + +IV MTDG + ++ ++ VY GI + Sbjct: 378 DINWNKTRHVIVLMTDGNYNMGGDPVAAIEQIREFLDIGRNRKNPRENYLDVYVFGIGPL 437 Query: 331 RSHEFLRACAS----PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 E + A AS + V++ + + F + + + Sbjct: 438 VDQEKINALASKKDGEKHVFKVKDMEDLENVFYMMIDESKALSLC 482 >gi|326529585|dbj|BAK04739.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 742 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 38/237 (16%), Positives = 83/237 (35%), Gaps = 50/237 (21%) Query: 109 DIVRSTSLDIVVVPQNEG-YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + +T + +P++ S + + K PL +P N P T Sbjct: 234 TVEITTHPEFTEIPESTSERSFTVLIHLKAPLAQSLHVPGDDN-----GPNTG------- 281 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 A +D++ VLDVS SM TK+ + +++ +++ + + ++ FS Sbjct: 282 -RAPVDLITVLDVSGSMAG------TKLALLKRAMGFVIQNLGSSD------RLSVIAFS 328 Query: 228 NKIEEFFLLEWGVSHLQRK----IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + F L +++ + L+ G TN GL+ +++ + + + + Sbjct: 329 SSARRLFPLRRMTESGRKQSLLAVNSLTSNG-GTNIAEGLRKG-SKVIEERQAKNPVCS- 385 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 I+ ++DG++ T + Y + CA Sbjct: 386 -------IILLSDGQDTYTVSPSTGAH----------KPYTVSPTAGAQKASAEYCA 425 >gi|326430405|gb|EGD75975.1| hypothetical protein PTSG_00683 [Salpingoeca sp. ATCC 50818] Length = 762 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 29/199 (14%), Positives = 70/199 (35%), Gaps = 22/199 (11%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 T + D++ +LD S S+ +A + + +++++ + NV G F Sbjct: 495 CTFRQTDVLFILDNSGSVGP------DNFALAQEFVMDLVDQMTISSTAINV---GAFLF 545 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +++++ + +Q I T + L +A + + + + Sbjct: 546 NSQVQMLTAFTDDKTAIQAAISGYSYPPSNTAGTATGAALNFAVDTMLQSGAGYRGGSV- 604 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 ++ +TDG + S GA V ++GI LR A+ + Sbjct: 605 ------LVYVITDGRSQEDSSFVASA--AANLHATGAEVVSVGITSSVDETQLRTIATSD 656 Query: 344 -SFYLVENPHSMYDAFSHI 361 + +++E+ + Sbjct: 657 SNVFVLEDFSDLNGQLGDT 675 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 38/198 (19%), Positives = 67/198 (33%), Gaps = 25/198 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 LD++ LD S S+ + D+A + +E + + PDV V G++ Sbjct: 176 CDFNSLDVLFTLDASGSVG------VDNFDIAKNFVADSVELMDVDPDVIRVA--GMMFH 227 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKF------GVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +N + +F + + F T + L Y + Sbjct: 228 ANPLPQFDFDFSFDRDVIA--DAVRSFVYPTDRNWGTATGAALNYIRKYLL------VPS 279 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRAC 339 I+ F+TDG S + + GA V AIGI I + Sbjct: 280 AGNRDPADTIVYFITDG--NSQEALSFVQDAADNIHATGARVVAIGITDAIDQSQLEIIA 337 Query: 340 ASPNSFYLVENPHSMYDA 357 +SP+ +VE+ + + Sbjct: 338 SSPDDVIIVEDFADLDEV 355 >gi|324508324|gb|ADY43516.1| C-type lectin protein 160 [Ascaris suum] Length = 696 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 39/213 (18%), Positives = 75/213 (35%), Gaps = 25/213 (11%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + I +D+ V+D S+S+ + T +D + + ++ I Sbjct: 12 ALFIALKATQQECILGAVDLAFVIDGSQSVG--VTNFQTTLDAIVHA-----TQLTNIGP 64 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVS-HLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFD 272 + + LV F F L + ++ I+ N G+ A +IF+ Sbjct: 65 LEKDSRVALVVFDETAHIEFGLMNTTNADVEAAIRRSKYPNTNGVNIAAGMDAATRKIFN 124 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 H + N K+++ +T + S + D ++ + A++ GAI AIGI Sbjct: 125 ------HAYPPNRNAPKLLIIIT---SNSDQSDVRAEN--SLAREVGAIPAAIGIHNGNG 173 Query: 333 -----HEFLRACASPNSFYLVENPHSMYDAFSH 360 + A P Y + S+ D Sbjct: 174 IGASREQLEIATGDPKRVYTTTDYSSLDDILQK 206 >gi|149920127|ref|ZP_01908600.1| hypothetical protein PPSIR1_33369 [Plesiocystis pacifica SIR-1] gi|149819070|gb|EDM78507.1| hypothetical protein PPSIR1_33369 [Plesiocystis pacifica SIR-1] Length = 400 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 40/222 (18%), Positives = 76/222 (34%), Gaps = 31/222 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN---- 228 ++++LD S SM + ++++ L +V L D+++ V+ G TFS Sbjct: 122 SVLMLLDRSGSMMQTGFDADEPDKSRWQALHEALGDVMLADDMDHFVEFGAKTFSTQGWG 181 Query: 229 ------KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +I+ LL+ L+ L T + L + D + Sbjct: 182 ECGVSPQIDVPMLLDNSELLLELIPGPLEDVNGGTPTLAALDAGLGMMRDYEAPGA---- 237 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNE----AKK-RGAIVYAIGIRVIRSHEFLR 337 K +V +TDG T + +L A++ G Y +GI + + Sbjct: 238 ------KAVVLITDGSIGCTDDQAATLEQITTELTLAREVDGIATYVVGISPSYNSAKAQ 291 Query: 338 ------ACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 A + ++ N S++ A + D D Sbjct: 292 LGAMAEAGGGADDYFEAANAESLHAALEQVVADSYANSCHLD 333 >gi|301780866|ref|XP_002925835.1| PREDICTED: integrin alpha-D-like [Ailuropoda melanoleuca] Length = 1040 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 32/214 (14%), Positives = 79/214 (36%), Gaps = 32/214 (14%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 +D++ ++D S S+ ++ ++++ + L Sbjct: 155 STECPSQEMDIVFLIDGSGSI---AQRDFQRMKGFVRAVMGQFGGTNTLFS--------L 203 Query: 224 VTFSNKIEEFFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 + +SN ++ F S Q + + + T + G++ ++F N Sbjct: 204 IQYSNHLKIHFTFTQFKSSSSPQSLVDPIVQLNGLTFTATGIRTVVQELF------HSKN 257 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-------VIRSHE 334 +KI++ +TDG+ + + +A++ G + YAIG+ + Sbjct: 258 GARKTARKILIVITDGQKYKDPLEYSDVIP--QAERAGIVRYAIGVGDAFQKLTARQELN 315 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 + + S + + V+N + A S+I + + K Sbjct: 316 TIGSKPSQDHVFRVDN----FAALSNIQEQLQEK 345 >gi|114707049|ref|ZP_01439948.1| Von Willebrand domain containing protein [Fulvimarina pelagi HTCC2506] gi|114537599|gb|EAU40724.1| Von Willebrand domain containing protein [Fulvimarina pelagi HTCC2506] Length = 317 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 70/210 (33%), Gaps = 39/210 (18%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +D+S SME ++T++D + + + GL+ F+ Sbjct: 89 DLILAIDLSGSMEREDFDLDGRTVTRLDAVKAVAKEFVTS-RAGD------RVGLILFAE 141 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + V+ + R + + G ST GL A ++ Sbjct: 142 FAYTAAPLTFDVAAVSRIVDEATIGISGRSTAIAGGLGLALKRL-----------KRSDA 190 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-----------EF 335 ++++ ++DG T + A++ G V+ I + Sbjct: 191 QSRVVILLSDGS--DTSGNVLPRDSARLAEQLGVTVHTIALGPEDMETAPQTRDAVDTAT 248 Query: 336 LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 LR A S + V N + + I + Sbjct: 249 LRDIAELSGGRTFRVRNTDDLRAVTAEIDR 278 >gi|149921110|ref|ZP_01909568.1| von Willebrand factor type A domain protein [Plesiocystis pacifica SIR-1] gi|149817997|gb|EDM77456.1| von Willebrand factor type A domain protein [Plesiocystis pacifica SIR-1] Length = 532 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 34/218 (15%), Positives = 76/218 (34%), Gaps = 20/218 (9%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 S + V + L++ IV+D S SM+ + + A+ + M Sbjct: 109 LSAAGSNTLDARGNFRDAVARTSAEPLNLAIVIDHSGSMKGQRERN------ALDAAAGM 162 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 + ++ V+ +V+++ K + + + + +S V S P Sbjct: 163 ISRLRDGDTVS------VVSYNTKAHTIVPVT--TLDARNRDRVISDLRVGVASRPSGNT 214 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYA 324 + + + ++ ++DGE N +++ EA+ RG + + Sbjct: 215 CVSCGVEAGLQTLQGRRPGIDR---MLLLSDGEANRGVRDEPGIRRLAREARNRGVSISS 271 Query: 325 IGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSH 360 IG+ V + + A A + Y E ++ F Sbjct: 272 IGVDVDYNEVLMSAIAREANGRHYFSETGSNLDAIFDQ 309 >gi|297201072|ref|ZP_06918469.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083] gi|197712143|gb|EDY56177.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083] Length = 516 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 75/204 (36%), Gaps = 26/204 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 +++ + T +VLD S SM ++ D S + ++ A L + V Sbjct: 321 TTLDTHGLTGTTARTYLVLDRSASMRPYYKDGSAQALGEQTLALAAHL------DPESKV 374 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 +V FS +++ L G++ + KI L G+ A ++ Sbjct: 375 H---VVFFSTELDGTGEL--GLTDHENKIDELHA-GLGRMGRTSYHAAVEEVL------A 422 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA-IGIRVIRSHEF-- 335 H + ++VF TDG + Q+L AK + ++ + + F Sbjct: 423 HHGKNAPDTPALVVFQTDGAPDAKTPATQALT--EAAKTHPNVFFSFVAFGDPENKAFDY 480 Query: 336 LR--ACASPNSFYLVENPHSMYDA 357 LR A+ + F E P + DA Sbjct: 481 LRKLKLANTSHFLAGETPKELTDA 504 >gi|70730213|ref|YP_259952.1| von Willebrand factor type A domain-containing protein [Pseudomonas fluorescens Pf-5] gi|68344512|gb|AAY92118.1| von Willebrand factor type A domain protein [Pseudomonas fluorescens Pf-5] Length = 332 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 34/218 (15%), Positives = 79/218 (36%), Gaps = 36/218 (16%) Query: 167 QTDARLDMMIVLDVSRSME--SFFDSS---ITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 Q D+M+ +D+S+SM+ F D++ I ++ + + ++ K + Sbjct: 87 QQRPVRDLMLAIDLSQSMQTQDFNDANGQRIDRLSAVKEVVQGFIQRRKDD-------RL 139 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 GL+ F + L + L + + G +T + A + + H Sbjct: 140 GLIVFGSGAFAQAPLTLDHASLSLLLEDSGIGMAGPNTAIGDAIGLALKLL-----EQAH 194 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH------ 333 +K+++ +TDG T + A RG +++ IGI + Sbjct: 195 ------EPEKVLILLTDG--NDTSSAITPQHAAAMAAARGVVIHTIGIGDPSAEGEAKVD 246 Query: 334 -EFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 L A + ++ E+ ++ ++ + + + Sbjct: 247 LSALEQIARTTGGRYFRAEDRSALDQVYATLDRLTPHQ 284 >gi|92117939|ref|YP_577668.1| hypothetical protein Nham_2418 [Nitrobacter hamburgensis X14] gi|91800833|gb|ABE63208.1| conserved hypothetical protein [Nitrobacter hamburgensis X14] Length = 483 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 65/222 (29%), Gaps = 27/222 (12%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 F + +G + + AI L + +G ++ + ++ + + +D + + A Sbjct: 16 RFGQDLRGNIAPIFAIALLPMLGFVGAAVDYTRANAARSSMQAAMDSAALMVA------- 68 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 + + N L N D S + Sbjct: 69 ---KDANAASPQMTADQVTAAAQKYFNALYHNT---DAQGASVSAVYTPYNNGTPATVVL 122 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 S + P + T+S T L + + LDV+ SM Sbjct: 123 SGSGNVQTDFMKVVGFPQ-------ISFKTNSTATWGNTK--LRVAMALDVTGSM----- 168 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 SS K+ + +++ +K V ++ F+ + Sbjct: 169 SSAGKLVQMKIAAKKLIDTLKASATAEGDVYISIIPFNVMVN 210 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 32/152 (21%), Positives = 51/152 (33%), Gaps = 24/152 (15%) Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN- 299 S L+ KI L G TN ++ A+ + E Y I+ ++DG N Sbjct: 332 STLKGKINNLVANGA-TNQAIAMQMAWMMLQPTAPFPAPAKDEKYKYTDAIILLSDGLNT 390 Query: 300 ----------LSTKEDQQSLYYCNEAKKRGA---------IVYAIGIRVIR--SHEFLRA 338 S++ D + CN K +Y I + L+ Sbjct: 391 QDRWYGNGSDWSSQVDTRQALLCNNIKNDPISKTDPTRRTRIYTIQVNTDGDPESTVLKN 450 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 CA+ F+ + AF+ IG + RI Sbjct: 451 CATDG-FFPTSTASGIASAFAQIGASLSQLRI 481 >gi|327278400|ref|XP_003223950.1| PREDICTED: integrin alpha-M-like [Anolis carolinensis] Length = 1160 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 77/210 (36%), Gaps = 30/210 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 D+++++D S S++ +++ I I + ++ + Sbjct: 148 CPKQPTDIVLLIDGSGSIQPN---QFSEMKTFISMIMKRFQNTNTQFALSQYARW----- 199 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY-NQIFDMQGMRQHCNTEDA 285 + E FL V + ++ +++ +T + ++ Q +G R + Sbjct: 200 -YREEFTFLDFQRVRNPDELLRPVTQLRGATLTATYIQRVVREQFVTEKGSRPGAS---- 254 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI-----RSHEFLRACA 340 K+++ +TDGE + EA++ G I +AIG+ E + + Sbjct: 255 ---KVLIVITDGEKSGDPLQYSDVIP--EAERAGIIRFAIGVGKAFSGGTAKQELISIAS 309 Query: 341 SP--NSFYLVENPHSMYDAFSHIGKDIVTK 368 P + + V+N +DA I + K Sbjct: 310 QPEDDHVFPVDN----FDALKDIQNKLQDK 335 >gi|297460574|ref|XP_597723.5| PREDICTED: collagen, type XX, alpha 1 [Bos taurus] Length = 1364 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 33/173 (19%), Positives = 66/173 (38%), Gaps = 23/173 (13%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +DM+ ++D S S+ + + +++E ++ P VQ GL +S Sbjct: 172 PVDMIFLVDGSWSIGH------SHFQQVKDFLASVIEPFEIGP---GKVQVGLTQYSGAP 222 Query: 231 EEFFLL-EWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + L +G + + L G +T + L + Q + Sbjct: 223 QTEWDLNAFGTKEEVLNAVHNLHYRGGNTFTGLALTHVLEQ------NLKPRAGLRPEAA 276 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 K+++ +TDG K + + K G ++A+G++ E LR AS Sbjct: 277 KLVILVTDG-----KSQDDAHTAGHVLKGLGVDIFAVGVKNADETE-LRLLAS 323 >gi|297481786|ref|XP_002692297.1| PREDICTED: Collagen, type XX, alpha 1-like [Bos taurus] gi|296481265|gb|DAA23380.1| Collagen, type XX, alpha 1-like [Bos taurus] Length = 1342 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 33/173 (19%), Positives = 66/173 (38%), Gaps = 23/173 (13%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +DM+ ++D S S+ + + +++E ++ P VQ GL +S Sbjct: 172 PVDMIFLVDGSWSIGH------SHFQQVKDFLASVIEPFEIGP---GKVQVGLTQYSGAP 222 Query: 231 EEFFLL-EWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + L +G + + L G +T + L + Q + Sbjct: 223 QTEWDLNAFGTKEEVLNAVHNLHYRGGNTFTGLALTHVLEQ------NLKPRAGLRPEAA 276 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 K+++ +TDG K + + K G ++A+G++ E LR AS Sbjct: 277 KLVILVTDG-----KSQDDAHTAGHVLKGLGVDIFAVGVKNADETE-LRLLAS 323 >gi|86134840|ref|ZP_01053422.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] gi|85821703|gb|EAQ42850.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] Length = 349 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 66/173 (38%), Gaps = 20/173 (11%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +D++ LDVS+SM + D + +++ A + I+ +++++ + Sbjct: 79 GSKLKTVKREGVDVVFALDVSKSMLAE-DIAPNRLEKAKQIISKIIDKLGSD-------R 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 G++ ++ + + ++ + VS + + A + Sbjct: 131 VGVIIYAGNSYPLLPITTDHAAANMFLQNANPDMVS-SQGTAINEALELAKTYYNNDEQT 189 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 N + +V ++DGE+ +++ G +Y IG+ + Sbjct: 190 N-------RFLVIISDGEDH----QEETKQVAQNLSNDGVKIYTIGVGTEKGG 231 >gi|313213359|emb|CBY37183.1| unnamed protein product [Oikopleura dioica] Length = 435 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 36/209 (17%), Positives = 74/209 (35%), Gaps = 25/209 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 N T LD++ V+D S S+ ++ + + ++ + D + Sbjct: 172 TQNECTSKALDIVFVVDESGSVGP------DNFELVKQFLVDYAQDSNIAADA---TRIA 222 Query: 223 LVTFSNKIEEFFLL-EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 + T+S + F L ++ + I L G+ TN+ + N + Sbjct: 223 IRTYSTNSDLDFSLNDYKTKDIISVINNLVWNGLGTNTADAITNGLNDFGN--------- 273 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 + + KI+V +TDG++ + ++ +A R +AIGI E + Sbjct: 274 -DRSESVKIMVTITDGQSNYNQ--VKAAADLLKADPRNIQSFAIGIDGAHMAELQAIATT 330 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + F I K++ +K Sbjct: 331 DPGHIE---MLKNWSDFGPIKKNLQSKVC 356 >gi|218191772|gb|EEC74199.1| hypothetical protein OsI_09355 [Oryza sativa Indica Group] Length = 723 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 52/327 (15%), Positives = 109/327 (33%), Gaps = 49/327 (14%) Query: 51 HSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDI 110 + +++ + + G ++ L + + +E + + + Sbjct: 169 QNGGQMTILRPLPRARSSGRLHHMTSLLPDTDRSIFNDDEPLDSLSEANEGSQQGCLRTV 228 Query: 111 VRSTSLDIVVVPQNEG-YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD 169 T + VP++ + + + K PL + T Sbjct: 229 EIKTYPEFTEVPESTSERNFTVLVHLKAPLAQTLQTSSKLEDGN----------SLGTTR 278 Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 A +D++ VLDVS SM TK+ + +++ +++ + + ++ FS+ Sbjct: 279 APVDLITVLDVSGSMAG------TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSS 326 Query: 230 IEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F L G + + L+ G TN GL+ I D Q C+ Sbjct: 327 ARRLFPLRRMTETGRQQSLQAVYSLTSNG-GTNIAEGLRKGSKVIEDRQAKNPVCS---- 381 Query: 286 NYKKIIVFMTDGENLSTKEDQQSL-----YYCNEA--KKRG---AIVYAIGIRVIRSHEF 335 I+ ++DG++ T + YC+ G V+ G Sbjct: 382 -----IILLSDGQDTYTVSPTAGVHKAAPEYCSLLPYTSNGCQQVPVHVFGFGADHDSVS 436 Query: 336 LRACA--SPNSFYLVENPHSMYDAFSH 360 L + + S +F +E ++ DAF+ Sbjct: 437 LHSISQTSGGTFSFIETEAAIQDAFAQ 463 >gi|212635209|ref|YP_002311734.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212556693|gb|ACJ29147.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 360 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 45/248 (18%), Positives = 88/248 (35%), Gaps = 40/248 (16%) Query: 144 FIPWYTNSRHIVMPITSSVKVNSQTDARL--DMMIVLDVSRSMESF-------------- 187 FI W + P+ + QT ++ D+M+V+D+S SM++ Sbjct: 67 FIVWLLLVTAMTQPV---ILGEPQTRLQIGRDLMVVVDLSGSMDTKDFTLHVKQQTADGI 123 Query: 188 FDSSITKI-DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK 246 +SS T+I D I ++A+ + + + GL+ F + ++ R Sbjct: 124 ANSSGTEISDEYISRLDAVKRVLHEFAEQRQGDRLGLILFGDAAYLQAPFTADLASWLRL 183 Query: 247 IK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 + ++ G ST+ L A + + N K+++ +TDG T Sbjct: 184 LDESRVAMAGQSTHVGDALGLAIKVMSSDEIKSSQKN-------KVVLLLTDG--NDTDS 234 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIR--SHEFLR-------ACASPNSFYLVENPHSMY 355 L A K+G V+ I I + + + A + + + + Sbjct: 235 SVPPLEAAKIAAKKGIRVHVIAIGDPQTVGEQAMDMEVIEGVAALTGGKAFKAISTQELN 294 Query: 356 DAFSHIGK 363 + I K Sbjct: 295 KVYQTISK 302 >gi|260461186|ref|ZP_05809435.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259033220|gb|EEW34482.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 523 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 32/172 (18%), Positives = 56/172 (32%), Gaps = 36/172 (20%) Query: 232 EFFLLEWGVSHLQRKIKYLSK-FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + L+ + + G TN + GL + + + N KI Sbjct: 347 PVVPLTADLDKLRTAAAQMQEWNGSGTNVSEGLSWGMRVLSPAPPYTDGAPWKTPNTSKI 406 Query: 291 IVFMTDGEN--------------------------LSTKEDQQ------SLYYCNEAKKR 318 +V +TDGEN S + D +L C++ K + Sbjct: 407 VVLLTDGENVVYGASAEPEKSDYTSYGYLSSGRFGTSNQTDAARSVDRWTLDVCDKLKAQ 466 Query: 319 GAIVYAIGIRV--IRSHEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDIVT 367 +Y I ++ + CA+ P +Y V +P + + F I T Sbjct: 467 QVQIYTITLQSDTAANRTLYGKCATNPADYYAVNDPSKLPNVFQTIAGKFTT 518 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 35/234 (14%), Positives = 77/234 (32%), Gaps = 39/234 (16%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 I F+ + G +L A+ P I +G +VS + K L + +D + + + Sbjct: 3 INKFWRSESGNFALLFALAAPAILAAVGFAADVSSVMRAKVNLQNSLDAATLSS------ 56 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 L + R+ D F ++ + T+ + + Sbjct: 57 -------SHLSDDEAARRLA-----------FDGYFQANVANHPELTNAKLTLSVDKGFN 98 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 + + + F + +++HI + + L++++VLD + SM Sbjct: 99 YVKTKAIASADVN-LYFAFLFGDNQHIEVDA-----GGVEATNNLEVVLVLDNTGSMAG- 151 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF---FLLEW 238 KI + +L+ + + V + +V F + F W Sbjct: 152 -----AKIKALRDATKVLLDNLDGAKSPDRKVTAAIVPFVTAVNINGDKFDPSW 200 >gi|118763676|gb|AAI28665.1| LOC557935 protein [Danio rerio] Length = 1155 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 37/260 (14%), Positives = 96/260 (36%), Gaps = 37/260 (14%) Query: 125 EGYSISAISRYKIPLKFCTFIP--WYTNSRHIVM-----------PITSSVKVNSQTDAR 171 +S+S PL C+ + ++ + T Sbjct: 89 RFFSMSVSVSPDAPLTSCSPYLPHDCDGNSYLNGICYQFNSGLQAVSNFTAAYQECTKRE 148 Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 ++++ + D SRSM+ + D+ I +++++ N+ ++ V F+ Sbjct: 149 VNLVFLFDGSRSMKP------EEFDLNKDFITDIMKKLS-----NSSIKFAAVQFATTFS 197 Query: 232 EFFLL-EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F ++ + ++K++ TN+ +KY + + + + A Sbjct: 198 TVFDFNDYQLGSAEQKLQKEKHMSALTNTHGAIKYVLENLLNNVSSGANPKAQHA----- 252 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF---YL 347 +V +TDG+ ++++ L C++ + + Y IG+ + + L + AS + Sbjct: 253 LVIITDGDPTD-RDNEHVLEKCDK---QNILRYIIGVGGLANLARLTSLASEPKHNNTFF 308 Query: 348 VENPHSMYDAFSHIGKDIVT 367 +++ + ++ K I Sbjct: 309 IKDYKGLEGLLDNLQKKIYN 328 >gi|90410254|ref|ZP_01218271.1| hypothetical protein P3TCK_05786 [Photobacterium profundum 3TCK] gi|90329607|gb|EAS45864.1| hypothetical protein P3TCK_05786 [Photobacterium profundum 3TCK] Length = 370 Score = 57.9 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 36/220 (16%), Positives = 74/220 (33%), Gaps = 24/220 (10%) Query: 163 KVNSQTDARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 K ++T D+M+V+D+S SM E F + + + ++ +L + D + + Sbjct: 85 KSQTRTLIGRDVMVVVDLSGSMAEKDFLTPYGEKHTRLDAVKQVLAQFSQHRDGD---RL 141 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 GL+ F + + + G ST+ + A D + Sbjct: 142 GLILFGDAAYLQAPFTADHETWLALLDETQVGMAGQSTHLGDAIGLAIKVFNDQADAAKQ 201 Query: 280 CNT-------EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR- 331 T +K+++ +TDG T + A R ++ I + R Sbjct: 202 AATQQGSAVVTRPVKEKVVIVLTDG--NDTDSLVPPIDAAKVAASRDIRIHMIAMGDPRT 259 Query: 332 -SHEFLR-------ACASPNSFYLVENPHSMYDAFSHIGK 363 + L A + + +P + + IG+ Sbjct: 260 VGEQALDMEVIEQVASLTGGQSFQALSPAELTRVYKKIGE 299 >gi|309271626|ref|XP_003085374.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-3(VI) chain [Mus musculus] Length = 3284 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 42/311 (13%), Positives = 107/311 (34%), Gaps = 27/311 (8%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + GG + + +G V+ Sbjct: 1517 LEFVARNLFVKSAGSRIEDGVPQHLVLFLGGKSQDDVARHAQ-----VISSSGIVSLGIG 1571 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 D+ + + + ++P + + + P + Sbjct: 1572 DRNIDRTDLQTITNDPRLVFTVREFRELPNIEERVMLSFGPSGATPQPPGVDLPSPSRPE 1631 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + E V + + + ++ GLV +++ Sbjct: 1632 KKKADIVFLLD--GSINFRRDS-------FQEVLRFASEIVDTVYEDGDSIRVGLVQYNS 1682 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + F L + + I + G N+ G+++ + + + + D Sbjct: 1683 DPTDEFFLRDFSTKRQIIDAINKVVYKGGRHANTRVGIEH----LLRNHFVPEAGSRLDE 1738 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +I +T G+++ +D ++G V+A+G+R I S E + ++ + Sbjct: 1739 RVPQIAFVITGGKSVEDAQDVSLALT-----QKGVKVFAVGVRNIDSEEVGKIASNSATA 1793 Query: 346 YLVENPHSMYD 356 + V + + + Sbjct: 1794 FRVGSVQELSE 1804 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 72/199 (36%), Gaps = 24/199 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T + +V+ D++ +LD S ++ + + ++ + + D Sbjct: 627 TGTTEVHVNKR---DIIFLLDGSDNVGKNNFPYV------RDFVTNLVNSLDVGSDN--- 674 Query: 219 VQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQG 275 ++ GLV FS+ F L+ S L ++ L K G N+ L Y I Sbjct: 675 IRVGLVQFSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGSGLNAGSALSY----IHANHF 730 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + + ++++ + G + L N + G + + +G E Sbjct: 731 TEAGGSRTREHVPQLLLLLMAG-----PSEDAYLQAANALVRSGVLTFCVGTNRADKAEL 785 Query: 336 LRACASPNSFYLVENPHSM 354 +P+ YL+++ S+ Sbjct: 786 EHIAFNPSLVYLMDDFRSL 804 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 79/213 (37%), Gaps = 30/213 (14%) Query: 154 IVMPITSSVKVNSQTDARL------DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 + ++ ++Q + D++ ++D S + + + + + ++ ++E Sbjct: 13 FSLLLSGIATTHAQQHGDVKNGAAADIVFLVDSS------WSAGKDRFLLVQEFLSDVVE 66 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKY 265 + + + LV + FLL + I +S G S + GL+Y Sbjct: 67 SLAVGDND---FHFALVRLNGNPHTEFLLNTYHSKQEVLSHIVNMSYIGGSNQTGKGLEY 123 Query: 266 A-YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 ++ + + G R ++I+ +TDG+ ED +L K V+A Sbjct: 124 VIHSHLTEASGSRA-----ADGVPQVIIVLTDGQ----SEDGFALPSAE-LKSADVNVFA 173 Query: 325 IGIRVIRSHEFLRACASP--NSFYLVENPHSMY 355 +G+ + P + +EN S++ Sbjct: 174 VGVEGADERALGEVASEPLSMHVFNLENVTSLH 206 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 70/205 (34%), Gaps = 26/205 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + S F + + ++E + + PD V+ LV +S++ Sbjct: 1026 DVVFLIDGSEGVRSGFP-------LLKDFVQRVVESLDVGPD---RVRVALVQYSDRTRP 1075 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L + I+ L+ G T N+ L++ I + + Sbjct: 1076 EFYLNSHMDQQGVISAIRRLTLLGGPTPNTGAALEFVLRNILTSSTGSRIAEG----VPQ 1131 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +T + K+ GA+ IGI E P+ + Sbjct: 1132 LLIVLT-----AEPSGDDVRGPSVVLKQGGAVPIGIGIGNADISEMQTISFIPDFAVAIP 1186 Query: 350 NPHSMYDAFSHIGKDIVTKRIWYDK 374 + I + I + I ++ Sbjct: 1187 TFREL----GTIQQVISERVIQLNR 1207 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 51/348 (14%), Positives = 108/348 (31%), Gaps = 65/348 (18%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + F K + + + R +Q+ G + +KN + + + + Sbjct: 1283 LNAHFSKDEVQNAVRRLRPKGGSQV-----------YIGNALEYVLKNIFQRPLGSRIEE 1331 Query: 101 NGFVNDI-------DDIVRSTSLDIV-----------VVPQNEGYSISAISRYKIPLKFC 142 + DD V +++++ Q E IS Y + Sbjct: 1332 GVPQFLVLISSGKSDDEVDDSAVELKQFGVAPLTIARHTDQEELVKISLSPEYVYSVSTF 1391 Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQ-------------TDARLDMMIVLDVSRSMESFFD 189 +P + ++ PIT+ D++ ++D S +++ Sbjct: 1392 RELPRL--EQKLLTPITTLTSQQIHQILASTRYPPSVVESDAADIVFLIDSSDAVKPDG- 1448 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKI 247 I ++ ++ + + P V+ G+V FSN + F L+ S + I Sbjct: 1449 -----IAHIRDFVSRIVRRLNIGPSK---VRIGVVQFSNDVFPEFYLKTHKSQSSVLEAI 1500 Query: 248 KYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + L G S N+ L++ +F ED + +++F+ K Sbjct: 1501 RRLRFKGGSPLNTGRALEFVARNLFVKS---AGSRIEDGVPQHLVLFL------GGKSQD 1551 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSM 354 + G + IG R I + P + V + Sbjct: 1552 DVARHAQVISSSGIVSLGIGDRNIDRTDLQTITNDPRLVFTVREFREL 1599 >gi|309264114|ref|XP_003086228.1| PREDICTED: collagen alpha-3(VI) chain isoform 5 [Mus musculus] Length = 2349 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 42/311 (13%), Positives = 107/311 (34%), Gaps = 27/311 (8%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + GG + + +G V+ Sbjct: 1311 LEFVARNLFVKSAGSRIEDGVPQHLVLFLGGKSQDDVARHAQ-----VISSSGIVSLGIG 1365 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 D+ + + + ++P + + + P + Sbjct: 1366 DRNIDRTDLQTITNDPRLVFTVREFRELPNIEERVMLSFGPSGATPQPPGVDLPSPSRPE 1425 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + E V + + + ++ GLV +++ Sbjct: 1426 KKKADIVFLLD--GSINFRRDS-------FQEVLRFASEIVDTVYEDGDSIRVGLVQYNS 1476 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + F L + + I + G N+ G+++ + + + + D Sbjct: 1477 DPTDEFFLRDFSTKRQIIDAINKVVYKGGRHANTRVGIEH----LLRNHFVPEAGSRLDE 1532 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +I +T G+++ +D ++G V+A+G+R I S E + ++ + Sbjct: 1533 RVPQIAFVITGGKSVEDAQDVSLALT-----QKGVKVFAVGVRNIDSEEVGKIASNSATA 1587 Query: 346 YLVENPHSMYD 356 + V + + + Sbjct: 1588 FRVGSVQELSE 1598 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 72/199 (36%), Gaps = 24/199 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T + +V+ D++ +LD S ++ + + ++ + + D Sbjct: 421 TGTTEVHVNKR---DIIFLLDGSDNVGKNNFPYV------RDFVTNLVNSLDVGSDN--- 468 Query: 219 VQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQG 275 ++ GLV FS+ F L+ S L ++ L K G N+ L Y I Sbjct: 469 IRVGLVQFSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGSGLNAGSALSY----IHANHF 524 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + + ++++ + G + L N + G + + +G E Sbjct: 525 TEAGGSRTREHVPQLLLLLMAG-----PSEDAYLQAANALVRSGVLTFCVGTNRADKAEL 579 Query: 336 LRACASPNSFYLVENPHSM 354 +P+ YL+++ S+ Sbjct: 580 EHIAFNPSLVYLMDDFRSL 598 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 70/205 (34%), Gaps = 26/205 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + S F + + ++E + + PD V+ LV +S++ Sbjct: 820 DVVFLIDGSEGVRSGFP-------LLKDFVQRVVESLDVGPD---RVRVALVQYSDRTRP 869 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L + I+ L+ G T N+ L++ I + + Sbjct: 870 EFYLNSHMDQQGVISAIRRLTLLGGPTPNTGAALEFVLRNILTSSTGSRIAEG----VPQ 925 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +T + K+ GA+ IGI E P+ + Sbjct: 926 LLIVLT-----AEPSGDDVRGPSVVLKQGGAVPIGIGIGNADISEMQTISFIPDFAVAIP 980 Query: 350 NPHSMYDAFSHIGKDIVTKRIWYDK 374 + I + I + I ++ Sbjct: 981 TFREL----GTIQQVISERVIQLNR 1001 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 51/348 (14%), Positives = 108/348 (31%), Gaps = 65/348 (18%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + F K + + + R +Q+ G + +KN + + + + Sbjct: 1077 LNAHFSKDEVQNAVRRLRPKGGSQV-----------YIGNALEYVLKNIFQRPLGSRIEE 1125 Query: 101 NGFVNDI-------DDIVRSTSLDIV-----------VVPQNEGYSISAISRYKIPLKFC 142 + DD V +++++ Q E IS Y + Sbjct: 1126 GVPQFLVLISSGKSDDEVDDSAVELKQFGVAPLTIARHTDQEELVKISLSPEYVYSVSTF 1185 Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQ-------------TDARLDMMIVLDVSRSMESFFD 189 +P + ++ PIT+ D++ ++D S +++ Sbjct: 1186 RELPRL--EQKLLTPITTLTSQQIHQILASTRYPPSVVESDAADIVFLIDSSDAVKPDG- 1242 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKI 247 I ++ ++ + + P V+ G+V FSN + F L+ S + I Sbjct: 1243 -----IAHIRDFVSRIVRRLNIGPSK---VRIGVVQFSNDVFPEFYLKTHKSQSSVLEAI 1294 Query: 248 KYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + L G S N+ L++ +F ED + +++F+ K Sbjct: 1295 RRLRFKGGSPLNTGRALEFVARNLFVKS---AGSRIEDGVPQHLVLFL------GGKSQD 1345 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSM 354 + G + IG R I + P + V + Sbjct: 1346 DVARHAQVISSSGIVSLGIGDRNIDRTDLQTITNDPRLVFTVREFREL 1393 >gi|309264110|ref|XP_003086224.1| PREDICTED: collagen alpha-3(VI) chain isoform 1 [Mus musculus] Length = 3057 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 42/311 (13%), Positives = 107/311 (34%), Gaps = 27/311 (8%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + GG + + +G V+ Sbjct: 1311 LEFVARNLFVKSAGSRIEDGVPQHLVLFLGGKSQDDVARHAQ-----VISSSGIVSLGIG 1365 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 D+ + + + ++P + + + P + Sbjct: 1366 DRNIDRTDLQTITNDPRLVFTVREFRELPNIEERVMLSFGPSGATPQPPGVDLPSPSRPE 1425 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + E V + + + ++ GLV +++ Sbjct: 1426 KKKADIVFLLD--GSINFRRDS-------FQEVLRFASEIVDTVYEDGDSIRVGLVQYNS 1476 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + F L + + I + G N+ G+++ + + + + D Sbjct: 1477 DPTDEFFLRDFSTKRQIIDAINKVVYKGGRHANTRVGIEH----LLRNHFVPEAGSRLDE 1532 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +I +T G+++ +D ++G V+A+G+R I S E + ++ + Sbjct: 1533 RVPQIAFVITGGKSVEDAQDVSLALT-----QKGVKVFAVGVRNIDSEEVGKIASNSATA 1587 Query: 346 YLVENPHSMYD 356 + V + + + Sbjct: 1588 FRVGSVQELSE 1598 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 72/199 (36%), Gaps = 24/199 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T + +V+ D++ +LD S ++ + + ++ + + D Sbjct: 421 TGTTEVHVNKR---DIIFLLDGSDNVGKNNFPYV------RDFVTNLVNSLDVGSDN--- 468 Query: 219 VQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQG 275 ++ GLV FS+ F L+ S L ++ L K G N+ L Y I Sbjct: 469 IRVGLVQFSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGSGLNAGSALSY----IHANHF 524 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + + ++++ + G + L N + G + + +G E Sbjct: 525 TEAGGSRTREHVPQLLLLLMAG-----PSEDAYLQAANALVRSGVLTFCVGTNRADKAEL 579 Query: 336 LRACASPNSFYLVENPHSM 354 +P+ YL+++ S+ Sbjct: 580 EHIAFNPSLVYLMDDFRSL 598 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 70/205 (34%), Gaps = 26/205 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + S F + + ++E + + PD V+ LV +S++ Sbjct: 820 DVVFLIDGSEGVRSGFP-------LLKDFVQRVVESLDVGPD---RVRVALVQYSDRTRP 869 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L + I+ L+ G T N+ L++ I + + Sbjct: 870 EFYLNSHMDQQGVISAIRRLTLLGGPTPNTGAALEFVLRNILTSSTGSRIAEG----VPQ 925 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +T + K+ GA+ IGI E P+ + Sbjct: 926 LLIVLT-----AEPSGDDVRGPSVVLKQGGAVPIGIGIGNADISEMQTISFIPDFAVAIP 980 Query: 350 NPHSMYDAFSHIGKDIVTKRIWYDK 374 + I + I + I ++ Sbjct: 981 TFREL----GTIQQVISERVIQLNR 1001 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 51/348 (14%), Positives = 108/348 (31%), Gaps = 65/348 (18%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + F K + + + R +Q+ G + +KN + + + + Sbjct: 1077 LNAHFSKDEVQNAVRRLRPKGGSQV-----------YIGNALEYVLKNIFQRPLGSRIEE 1125 Query: 101 NGFVNDI-------DDIVRSTSLDIV-----------VVPQNEGYSISAISRYKIPLKFC 142 + DD V +++++ Q E IS Y + Sbjct: 1126 GVPQFLVLISSGKSDDEVDDSAVELKQFGVAPLTIARHTDQEELVKISLSPEYVYSVSTF 1185 Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQ-------------TDARLDMMIVLDVSRSMESFFD 189 +P + ++ PIT+ D++ ++D S +++ Sbjct: 1186 RELPRL--EQKLLTPITTLTSQQIHQILASTRYPPSVVESDAADIVFLIDSSDAVKPDG- 1242 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKI 247 I ++ ++ + + P V+ G+V FSN + F L+ S + I Sbjct: 1243 -----IAHIRDFVSRIVRRLNIGPSK---VRIGVVQFSNDVFPEFYLKTHKSQSSVLEAI 1294 Query: 248 KYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + L G S N+ L++ +F ED + +++F+ K Sbjct: 1295 RRLRFKGGSPLNTGRALEFVARNLFVKS---AGSRIEDGVPQHLVLFL------GGKSQD 1345 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSM 354 + G + IG R I + P + V + Sbjct: 1346 DVARHAQVISSSGIVSLGIGDRNIDRTDLQTITNDPRLVFTVREFREL 1393 >gi|309264106|ref|XP_003086227.1| PREDICTED: collagen alpha-3(VI) chain isoform 4 [Mus musculus] Length = 3062 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 42/311 (13%), Positives = 107/311 (34%), Gaps = 27/311 (8%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + GG + + +G V+ Sbjct: 1311 LEFVARNLFVKSAGSRIEDGVPQHLVLFLGGKSQDDVARHAQ-----VISSSGIVSLGIG 1365 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 D+ + + + ++P + + + P + Sbjct: 1366 DRNIDRTDLQTITNDPRLVFTVREFRELPNIEERVMLSFGPSGATPQPPGVDLPSPSRPE 1425 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + E V + + + ++ GLV +++ Sbjct: 1426 KKKADIVFLLD--GSINFRRDS-------FQEVLRFASEIVDTVYEDGDSIRVGLVQYNS 1476 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + F L + + I + G N+ G+++ + + + + D Sbjct: 1477 DPTDEFFLRDFSTKRQIIDAINKVVYKGGRHANTRVGIEH----LLRNHFVPEAGSRLDE 1532 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +I +T G+++ +D ++G V+A+G+R I S E + ++ + Sbjct: 1533 RVPQIAFVITGGKSVEDAQDVSLALT-----QKGVKVFAVGVRNIDSEEVGKIASNSATA 1587 Query: 346 YLVENPHSMYD 356 + V + + + Sbjct: 1588 FRVGSVQELSE 1598 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 72/199 (36%), Gaps = 24/199 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T + +V+ D++ +LD S ++ + + ++ + + D Sbjct: 421 TGTTEVHVNKR---DIIFLLDGSDNVGKNNFPYV------RDFVTNLVNSLDVGSDN--- 468 Query: 219 VQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQG 275 ++ GLV FS+ F L+ S L ++ L K G N+ L Y I Sbjct: 469 IRVGLVQFSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGSGLNAGSALSY----IHANHF 524 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + + ++++ + G + L N + G + + +G E Sbjct: 525 TEAGGSRTREHVPQLLLLLMAG-----PSEDAYLQAANALVRSGVLTFCVGTNRADKAEL 579 Query: 336 LRACASPNSFYLVENPHSM 354 +P+ YL+++ S+ Sbjct: 580 EHIAFNPSLVYLMDDFRSL 598 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 70/205 (34%), Gaps = 26/205 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + S F + + ++E + + PD V+ LV +S++ Sbjct: 820 DVVFLIDGSEGVRSGFP-------LLKDFVQRVVESLDVGPD---RVRVALVQYSDRTRP 869 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L + I+ L+ G T N+ L++ I + + Sbjct: 870 EFYLNSHMDQQGVISAIRRLTLLGGPTPNTGAALEFVLRNILTSSTGSRIAEG----VPQ 925 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +T + K+ GA+ IGI E P+ + Sbjct: 926 LLIVLT-----AEPSGDDVRGPSVVLKQGGAVPIGIGIGNADISEMQTISFIPDFAVAIP 980 Query: 350 NPHSMYDAFSHIGKDIVTKRIWYDK 374 + I + I + I ++ Sbjct: 981 TFREL----GTIQQVISERVIQLNR 1001 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 51/348 (14%), Positives = 108/348 (31%), Gaps = 65/348 (18%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + F K + + + R +Q+ G + +KN + + + + Sbjct: 1077 LNAHFSKDEVQNAVRRLRPKGGSQV-----------YIGNALEYVLKNIFQRPLGSRIEE 1125 Query: 101 NGFVNDI-------DDIVRSTSLDIV-----------VVPQNEGYSISAISRYKIPLKFC 142 + DD V +++++ Q E IS Y + Sbjct: 1126 GVPQFLVLISSGKSDDEVDDSAVELKQFGVAPLTIARHTDQEELVKISLSPEYVYSVSTF 1185 Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQ-------------TDARLDMMIVLDVSRSMESFFD 189 +P + ++ PIT+ D++ ++D S +++ Sbjct: 1186 RELPRL--EQKLLTPITTLTSQQIHQILASTRYPPSVVESDAADIVFLIDSSDAVKPDG- 1242 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKI 247 I ++ ++ + + P V+ G+V FSN + F L+ S + I Sbjct: 1243 -----IAHIRDFVSRIVRRLNIGPSK---VRIGVVQFSNDVFPEFYLKTHKSQSSVLEAI 1294 Query: 248 KYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + L G S N+ L++ +F ED + +++F+ K Sbjct: 1295 RRLRFKGGSPLNTGRALEFVARNLFVKS---AGSRIEDGVPQHLVLFL------GGKSQD 1345 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSM 354 + G + IG R I + P + V + Sbjct: 1346 DVARHAQVISSSGIVSLGIGDRNIDRTDLQTITNDPRLVFTVREFREL 1393 >gi|297816770|ref|XP_002876268.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata] gi|297322106|gb|EFH52527.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata] Length = 672 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 78/213 (36%), Gaps = 31/213 (14%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + +++ A +D++ VLDVS SM TK+ + +++ +++ + Sbjct: 224 VHLKAVTGDQISQYRRAPVDLVTVLDVSGSMGG------TKLALLKRAMGFVIQNLGSSD 277 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + ++ FS+ F L G + + L G TN GL+ Sbjct: 278 ------RLSVIAFSSTARRLFPLTRMSDAGRQQALQAVNSLVANG-GTNIFDGLRKGAKV 330 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 + D + + I+ ++DG + T +A V++ G Sbjct: 331 MEDRRERNSVAS---------IILLSDGRDTYTTNHPD---PSYKAMLPQIPVHSFGFGS 378 Query: 330 IRSHEFLRACA--SPNSFYLVENPHSMYDAFSH 360 + + + S +F +E+ + DA + Sbjct: 379 DHDASVMHSVSEFSGGTFSFIESESVIQDALAQ 411 >gi|309264112|ref|XP_003086226.1| PREDICTED: collagen alpha-3(VI) chain isoform 3 [Mus musculus] Length = 2656 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 42/311 (13%), Positives = 107/311 (34%), Gaps = 27/311 (8%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + GG + + +G V+ Sbjct: 910 LEFVARNLFVKSAGSRIEDGVPQHLVLFLGGKSQDDVARHAQ-----VISSSGIVSLGIG 964 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 D+ + + + ++P + + + P + Sbjct: 965 DRNIDRTDLQTITNDPRLVFTVREFRELPNIEERVMLSFGPSGATPQPPGVDLPSPSRPE 1024 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + E V + + + ++ GLV +++ Sbjct: 1025 KKKADIVFLLD--GSINFRRDS-------FQEVLRFASEIVDTVYEDGDSIRVGLVQYNS 1075 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + F L + + I + G N+ G+++ + + + + D Sbjct: 1076 DPTDEFFLRDFSTKRQIIDAINKVVYKGGRHANTRVGIEH----LLRNHFVPEAGSRLDE 1131 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +I +T G+++ +D ++G V+A+G+R I S E + ++ + Sbjct: 1132 RVPQIAFVITGGKSVEDAQDVSLALT-----QKGVKVFAVGVRNIDSEEVGKIASNSATA 1186 Query: 346 YLVENPHSMYD 356 + V + + + Sbjct: 1187 FRVGSVQELSE 1197 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 70/205 (34%), Gaps = 26/205 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + S F + + ++E + + PD V+ LV +S++ Sbjct: 419 DVVFLIDGSEGVRSGFP-------LLKDFVQRVVESLDVGPD---RVRVALVQYSDRTRP 468 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L + I+ L+ G T N+ L++ I + + Sbjct: 469 EFYLNSHMDQQGVISAIRRLTLLGGPTPNTGAALEFVLRNILTSSTGSRIAEG----VPQ 524 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +T + K+ GA+ IGI E P+ + Sbjct: 525 LLIVLT-----AEPSGDDVRGPSVVLKQGGAVPIGIGIGNADISEMQTISFIPDFAVAIP 579 Query: 350 NPHSMYDAFSHIGKDIVTKRIWYDK 374 + I + I + I ++ Sbjct: 580 TFREL----GTIQQVISERVIQLNR 600 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 51/348 (14%), Positives = 108/348 (31%), Gaps = 65/348 (18%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + F K + + + R +Q+ G + +KN + + + + Sbjct: 676 LNAHFSKDEVQNAVRRLRPKGGSQV-----------YIGNALEYVLKNIFQRPLGSRIEE 724 Query: 101 NGFVNDI-------DDIVRSTSLDIV-----------VVPQNEGYSISAISRYKIPLKFC 142 + DD V +++++ Q E IS Y + Sbjct: 725 GVPQFLVLISSGKSDDEVDDSAVELKQFGVAPLTIARHTDQEELVKISLSPEYVYSVSTF 784 Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQ-------------TDARLDMMIVLDVSRSMESFFD 189 +P + ++ PIT+ D++ ++D S +++ Sbjct: 785 RELPRL--EQKLLTPITTLTSQQIHQILASTRYPPSVVESDAADIVFLIDSSDAVKPDG- 841 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKI 247 I ++ ++ + + P V+ G+V FSN + F L+ S + I Sbjct: 842 -----IAHIRDFVSRIVRRLNIGPSK---VRIGVVQFSNDVFPEFYLKTHKSQSSVLEAI 893 Query: 248 KYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + L G S N+ L++ +F ED + +++F+ K Sbjct: 894 RRLRFKGGSPLNTGRALEFVARNLFVKS---AGSRIEDGVPQHLVLFL------GGKSQD 944 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSM 354 + G + IG R I + P + V + Sbjct: 945 DVARHAQVISSSGIVSLGIGDRNIDRTDLQTITNDPRLVFTVREFREL 992 >gi|309264108|ref|XP_003086225.1| PREDICTED: collagen alpha-3(VI) chain isoform 2 [Mus musculus] Length = 3263 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 42/311 (13%), Positives = 107/311 (34%), Gaps = 27/311 (8%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + GG + + +G V+ Sbjct: 1517 LEFVARNLFVKSAGSRIEDGVPQHLVLFLGGKSQDDVARHAQ-----VISSSGIVSLGIG 1571 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 D+ + + + ++P + + + P + Sbjct: 1572 DRNIDRTDLQTITNDPRLVFTVREFRELPNIEERVMLSFGPSGATPQPPGVDLPSPSRPE 1631 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + E V + + + ++ GLV +++ Sbjct: 1632 KKKADIVFLLD--GSINFRRDS-------FQEVLRFASEIVDTVYEDGDSIRVGLVQYNS 1682 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + F L + + I + G N+ G+++ + + + + D Sbjct: 1683 DPTDEFFLRDFSTKRQIIDAINKVVYKGGRHANTRVGIEH----LLRNHFVPEAGSRLDE 1738 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +I +T G+++ +D ++G V+A+G+R I S E + ++ + Sbjct: 1739 RVPQIAFVITGGKSVEDAQDVSLALT-----QKGVKVFAVGVRNIDSEEVGKIASNSATA 1793 Query: 346 YLVENPHSMYD 356 + V + + + Sbjct: 1794 FRVGSVQELSE 1804 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 72/199 (36%), Gaps = 24/199 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T + +V+ D++ +LD S ++ + + ++ + + D Sbjct: 627 TGTTEVHVNKR---DIIFLLDGSDNVGKNNFPYV------RDFVTNLVNSLDVGSDN--- 674 Query: 219 VQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQG 275 ++ GLV FS+ F L+ S L ++ L K G N+ L Y I Sbjct: 675 IRVGLVQFSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGSGLNAGSALSY----IHANHF 730 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + + ++++ + G + L N + G + + +G E Sbjct: 731 TEAGGSRTREHVPQLLLLLMAG-----PSEDAYLQAANALVRSGVLTFCVGTNRADKAEL 785 Query: 336 LRACASPNSFYLVENPHSM 354 +P+ YL+++ S+ Sbjct: 786 EHIAFNPSLVYLMDDFRSL 804 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 79/213 (37%), Gaps = 30/213 (14%) Query: 154 IVMPITSSVKVNSQTDARL------DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 + ++ ++Q + D++ ++D S + + + + + ++ ++E Sbjct: 13 FSLLLSGIATTHAQQHGDVKNGAAADIVFLVDSS------WSAGKDRFLLVQEFLSDVVE 66 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKY 265 + + + LV + FLL + I +S G S + GL+Y Sbjct: 67 SLAVGDND---FHFALVRLNGNPHTEFLLNTYHSKQEVLSHIVNMSYIGGSNQTGKGLEY 123 Query: 266 A-YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 ++ + + G R ++I+ +TDG+ ED +L K V+A Sbjct: 124 VIHSHLTEASGSRA-----ADGVPQVIIVLTDGQ----SEDGFALPSAE-LKSADVNVFA 173 Query: 325 IGIRVIRSHEFLRACASP--NSFYLVENPHSMY 355 +G+ + P + +EN S++ Sbjct: 174 VGVEGADERALGEVASEPLSMHVFNLENVTSLH 206 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 70/205 (34%), Gaps = 26/205 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + S F + + ++E + + PD V+ LV +S++ Sbjct: 1026 DVVFLIDGSEGVRSGFP-------LLKDFVQRVVESLDVGPD---RVRVALVQYSDRTRP 1075 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L + I+ L+ G T N+ L++ I + + Sbjct: 1076 EFYLNSHMDQQGVISAIRRLTLLGGPTPNTGAALEFVLRNILTSSTGSRIAEG----VPQ 1131 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +T + K+ GA+ IGI E P+ + Sbjct: 1132 LLIVLT-----AEPSGDDVRGPSVVLKQGGAVPIGIGIGNADISEMQTISFIPDFAVAIP 1186 Query: 350 NPHSMYDAFSHIGKDIVTKRIWYDK 374 + I + I + I ++ Sbjct: 1187 TFREL----GTIQQVISERVIQLNR 1207 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 51/348 (14%), Positives = 108/348 (31%), Gaps = 65/348 (18%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + F K + + + R +Q+ G + +KN + + + + Sbjct: 1283 LNAHFSKDEVQNAVRRLRPKGGSQV-----------YIGNALEYVLKNIFQRPLGSRIEE 1331 Query: 101 NGFVNDI-------DDIVRSTSLDIV-----------VVPQNEGYSISAISRYKIPLKFC 142 + DD V +++++ Q E IS Y + Sbjct: 1332 GVPQFLVLISSGKSDDEVDDSAVELKQFGVAPLTIARHTDQEELVKISLSPEYVYSVSTF 1391 Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQ-------------TDARLDMMIVLDVSRSMESFFD 189 +P + ++ PIT+ D++ ++D S +++ Sbjct: 1392 RELPRL--EQKLLTPITTLTSQQIHQILASTRYPPSVVESDAADIVFLIDSSDAVKPDG- 1448 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKI 247 I ++ ++ + + P V+ G+V FSN + F L+ S + I Sbjct: 1449 -----IAHIRDFVSRIVRRLNIGPSK---VRIGVVQFSNDVFPEFYLKTHKSQSSVLEAI 1500 Query: 248 KYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + L G S N+ L++ +F ED + +++F+ K Sbjct: 1501 RRLRFKGGSPLNTGRALEFVARNLFVKS---AGSRIEDGVPQHLVLFL------GGKSQD 1551 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSM 354 + G + IG R I + P + V + Sbjct: 1552 DVARHAQVISSSGIVSLGIGDRNIDRTDLQTITNDPRLVFTVREFREL 1599 >gi|148708139|gb|EDL40086.1| mCG12867, isoform CRA_e [Mus musculus] Length = 2555 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 42/311 (13%), Positives = 107/311 (34%), Gaps = 27/311 (8%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + GG + + +G V+ Sbjct: 1517 LEFVARNLFVKSAGSRIEDGVPQHLVLFLGGKSQDDVARHAQ-----VISSSGIVSLGIG 1571 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 D+ + + + ++P + + + P + Sbjct: 1572 DRNIDRTDLQTITNDPRLVFTVREFRELPNIEERVMLSFGPSGATPQPPGVDLPSPSRPE 1631 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + E V + + + ++ GLV +++ Sbjct: 1632 KKKADIVFLLD--GSINFRRDS-------FQEVLRFASEIVDTVYEDGDSIRVGLVQYNS 1682 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + F L + + I + G N+ G+++ + + + + D Sbjct: 1683 DPTDEFFLRDFSTKRQIIDAINKVVYKGGRHANTRVGIEH----LLRNHFVPEAGSRLDE 1738 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +I +T G+++ +D ++G V+A+G+R I S E + ++ + Sbjct: 1739 RVPQIAFVITGGKSVEDAQDVSLALT-----QKGVKVFAVGVRNIDSEEVGKIASNSATA 1793 Query: 346 YLVENPHSMYD 356 + V + + + Sbjct: 1794 FRVGSVQELSE 1804 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 72/199 (36%), Gaps = 24/199 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T + +V+ D++ +LD S ++ + + ++ + + D Sbjct: 627 TGTTEVHVNKR---DIIFLLDGSDNVGKNNFPYV------RDFVTNLVNSLDVGSDN--- 674 Query: 219 VQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQG 275 ++ GLV FS+ F L+ S L ++ L K G N+ L Y I Sbjct: 675 IRVGLVQFSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGSGLNAGSALSY----IHANHF 730 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + + ++++ + G + L N + G + + +G E Sbjct: 731 TEAGGSRTREHVPQLLLLLMAG-----PSEDAYLQAANALVRSGVLTFCVGTNRADKAEL 785 Query: 336 LRACASPNSFYLVENPHSM 354 +P+ YL+++ S+ Sbjct: 786 EHIAFNPSLVYLMDDFRSL 804 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 79/213 (37%), Gaps = 30/213 (14%) Query: 154 IVMPITSSVKVNSQTDARL------DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 + ++ ++Q + D++ ++D S + + + + + ++ ++E Sbjct: 13 FSLLLSGIATTHAQQHGDVKNGAAADIVFLVDSS------WSAGKDRFLLVQEFLSDVVE 66 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKY 265 + + + LV + FLL + I +S G S + GL+Y Sbjct: 67 SLAVGDND---FHFALVRLNGNPHTEFLLNTYHSKQEVLSHIVNMSYIGGSNQTGKGLEY 123 Query: 266 A-YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 ++ + + G R ++I+ +TDG+ ED +L K V+A Sbjct: 124 VIHSHLTEASGSRA-----ADGVPQVIIVLTDGQ----SEDGFALPSAE-LKSADVNVFA 173 Query: 325 IGIRVIRSHEFLRACASP--NSFYLVENPHSMY 355 +G+ + P + +EN S++ Sbjct: 174 VGVEGADERALGEVASEPLSMHVFNLENVTSLH 206 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 70/205 (34%), Gaps = 26/205 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + S F + + ++E + + PD V+ LV +S++ Sbjct: 1026 DVVFLIDGSEGVRSGFP-------LLKDFVQRVVESLDVGPD---RVRVALVQYSDRTRP 1075 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L + I+ L+ G T N+ L++ I + + Sbjct: 1076 EFYLNSHMDQQGVISAIRRLTLLGGPTPNTGAALEFVLRNILTSSTGSRIAEG----VPQ 1131 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +T + K+ GA+ IGI E P+ + Sbjct: 1132 LLIVLT-----AEPSGDDVRGPSVVLKQGGAVPIGIGIGNADISEMQTISFIPDFAVAIP 1186 Query: 350 NPHSMYDAFSHIGKDIVTKRIWYDK 374 + I + I + I ++ Sbjct: 1187 TFREL----GTIQQVISERVIQLNR 1207 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 51/348 (14%), Positives = 108/348 (31%), Gaps = 65/348 (18%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + F K + + + R +Q+ G + +KN + + + + Sbjct: 1283 LNAHFSKDEVQNAVRRLRPKGGSQV-----------YIGNALEYVLKNIFQRPLGSRIEE 1331 Query: 101 NGFVNDI-------DDIVRSTSLDIV-----------VVPQNEGYSISAISRYKIPLKFC 142 + DD V +++++ Q E IS Y + Sbjct: 1332 GVPQFLVLISSGKSDDEVDDSAVELKQFGVAPLTIARHTDQEELVKISLSPEYVYSVSTF 1391 Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQ-------------TDARLDMMIVLDVSRSMESFFD 189 +P + ++ PIT+ D++ ++D S +++ Sbjct: 1392 RELPRL--EQKLLTPITTLTSQQIHQILASTRYPPSVVESDAADIVFLIDSSDAVKPDG- 1448 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKI 247 I ++ ++ + + P V+ G+V FSN + F L+ S + I Sbjct: 1449 -----IAHIRDFVSRIVRRLNIGPSK---VRIGVVQFSNDVFPEFYLKTHKSQSSVLEAI 1500 Query: 248 KYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + L G S N+ L++ +F ED + +++F+ K Sbjct: 1501 RRLRFKGGSPLNTGRALEFVARNLFVKS---AGSRIEDGVPQHLVLFL------GGKSQD 1551 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSM 354 + G + IG R I + P + V + Sbjct: 1552 DVARHAQVISSSGIVSLGIGDRNIDRTDLQTITNDPRLVFTVREFREL 1599 >gi|148708136|gb|EDL40083.1| mCG12867, isoform CRA_b [Mus musculus] Length = 3261 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 42/311 (13%), Positives = 107/311 (34%), Gaps = 27/311 (8%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + GG + + +G V+ Sbjct: 1517 LEFVARNLFVKSAGSRIEDGVPQHLVLFLGGKSQDDVARHAQ-----VISSSGIVSLGIG 1571 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 D+ + + + ++P + + + P + Sbjct: 1572 DRNIDRTDLQTITNDPRLVFTVREFRELPNIEERVMLSFGPSGATPQPPGVDLPSPSRPE 1631 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + E V + + + ++ GLV +++ Sbjct: 1632 KKKADIVFLLD--GSINFRRDS-------FQEVLRFASEIVDTVYEDGDSIRVGLVQYNS 1682 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + F L + + I + G N+ G+++ + + + + D Sbjct: 1683 DPTDEFFLRDFSTKRQIIDAINKVVYKGGRHANTRVGIEH----LLRNHFVPEAGSRLDE 1738 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +I +T G+++ +D ++G V+A+G+R I S E + ++ + Sbjct: 1739 RVPQIAFVITGGKSVEDAQDVSLALT-----QKGVKVFAVGVRNIDSEEVGKIASNSATA 1793 Query: 346 YLVENPHSMYD 356 + V + + + Sbjct: 1794 FRVGSVQELSE 1804 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 72/199 (36%), Gaps = 24/199 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T + +V+ D++ +LD S ++ + + ++ + + D Sbjct: 627 TGTTEVHVNKR---DIIFLLDGSDNVGKNNFPYV------RDFVTNLVNSLDVGSDN--- 674 Query: 219 VQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQG 275 ++ GLV FS+ F L+ S L ++ L K G N+ L Y I Sbjct: 675 IRVGLVQFSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGSGLNAGSALSY----IHANHF 730 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + + ++++ + G + L N + G + + +G E Sbjct: 731 TEAGGSRTREHVPQLLLLLMAG-----PSEDAYLQAANALVRSGVLTFCVGTNRADKAEL 785 Query: 336 LRACASPNSFYLVENPHSM 354 +P+ YL+++ S+ Sbjct: 786 EHIAFNPSLVYLMDDFRSL 804 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 79/213 (37%), Gaps = 30/213 (14%) Query: 154 IVMPITSSVKVNSQTDARL------DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 + ++ ++Q + D++ ++D S + + + + + ++ ++E Sbjct: 13 FSLLLSGIATTHAQQHGDVKNGAAADIVFLVDSS------WSAGKDRFLLVQEFLSDVVE 66 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKY 265 + + + LV + FLL + I +S G S + GL+Y Sbjct: 67 SLAVGDND---FHFALVRLNGNPHTEFLLNTYHSKQEVLSHIVNMSYIGGSNQTGKGLEY 123 Query: 266 A-YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 ++ + + G R ++I+ +TDG+ ED +L K V+A Sbjct: 124 VIHSHLTEASGSRA-----ADGVPQVIIVLTDGQ----SEDGFALPSAE-LKSADVNVFA 173 Query: 325 IGIRVIRSHEFLRACASP--NSFYLVENPHSMY 355 +G+ + P + +EN S++ Sbjct: 174 VGVEGADERALGEVASEPLSMHVFNLENVTSLH 206 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 70/205 (34%), Gaps = 26/205 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + S F + + ++E + + PD V+ LV +S++ Sbjct: 1026 DVVFLIDGSEGVRSGFP-------LLKDFVQRVVESLDVGPD---RVRVALVQYSDRTRP 1075 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L + I+ L+ G T N+ L++ I + + Sbjct: 1076 EFYLNSHMDQQGVISAIRRLTLLGGPTPNTGAALEFVLRNILTSSTGSRIAEG----VPQ 1131 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +T + K+ GA+ IGI E P+ + Sbjct: 1132 LLIVLT-----AEPSGDDVRGPSVVLKQGGAVPIGIGIGNADISEMQTISFIPDFAVAIP 1186 Query: 350 NPHSMYDAFSHIGKDIVTKRIWYDK 374 + I + I + I ++ Sbjct: 1187 TFREL----GTIQQVISERVIQLNR 1207 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 51/348 (14%), Positives = 108/348 (31%), Gaps = 65/348 (18%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + F K + + + R +Q+ G + +KN + + + + Sbjct: 1283 LNAHFSKDEVQNAVRRLRPKGGSQV-----------YIGNALEYVLKNIFQRPLGSRIEE 1331 Query: 101 NGFVNDI-------DDIVRSTSLDIV-----------VVPQNEGYSISAISRYKIPLKFC 142 + DD V +++++ Q E IS Y + Sbjct: 1332 GVPQFLVLISSGKSDDEVDDSAVELKQFGVAPLTIARHTDQEELVKISLSPEYVYSVSTF 1391 Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQ-------------TDARLDMMIVLDVSRSMESFFD 189 +P + ++ PIT+ D++ ++D S +++ Sbjct: 1392 RELPRL--EQKLLTPITTLTSQQIHQILASTRYPPSVVESDAADIVFLIDSSDAVKPDG- 1448 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKI 247 I ++ ++ + + P V+ G+V FSN + F L+ S + I Sbjct: 1449 -----IAHIRDFVSRIVRRLNIGPSK---VRIGVVQFSNDVFPEFYLKTHKSQSSVLEAI 1500 Query: 248 KYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + L G S N+ L++ +F ED + +++F+ K Sbjct: 1501 RRLRFKGGSPLNTGRALEFVARNLFVKS---AGSRIEDGVPQHLVLFL------GGKSQD 1551 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSM 354 + G + IG R I + P + V + Sbjct: 1552 DVARHAQVISSSGIVSLGIGDRNIDRTDLQTITNDPRLVFTVREFREL 1599 >gi|148708135|gb|EDL40082.1| mCG12867, isoform CRA_a [Mus musculus] Length = 2656 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 42/311 (13%), Positives = 107/311 (34%), Gaps = 27/311 (8%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + GG + + +G V+ Sbjct: 910 LEFVARNLFVKSAGSRIEDGVPQHLVLFLGGKSQDDVARHAQ-----VISSSGIVSLGIG 964 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 D+ + + + ++P + + + P + Sbjct: 965 DRNIDRTDLQTITNDPRLVFTVREFRELPNIEERVMLSFGPSGATPQPPGVDLPSPSRPE 1024 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + E V + + + ++ GLV +++ Sbjct: 1025 KKKADIVFLLD--GSINFRRDS-------FQEVLRFASEIVDTVYEDGDSIRVGLVQYNS 1075 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + F L + + I + G N+ G+++ + + + + D Sbjct: 1076 DPTDEFFLRDFSTKRQIIDAINKVVYKGGRHANTRVGIEH----LLRNHFVPEAGSRLDE 1131 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +I +T G+++ +D ++G V+A+G+R I S E + ++ + Sbjct: 1132 RVPQIAFVITGGKSVEDAQDVSLALT-----QKGVKVFAVGVRNIDSEEVGKIASNSATA 1186 Query: 346 YLVENPHSMYD 356 + V + + + Sbjct: 1187 FRVGSVQELSE 1197 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 70/205 (34%), Gaps = 26/205 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + S F + + ++E + + PD V+ LV +S++ Sbjct: 419 DVVFLIDGSEGVRSGFP-------LLKDFVQRVVESLDVGPD---RVRVALVQYSDRTRP 468 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L + I+ L+ G T N+ L++ I + + Sbjct: 469 EFYLNSHMDQQGVISAIRRLTLLGGPTPNTGAALEFVLRNILTSSTGSRIAEG----VPQ 524 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +T + K+ GA+ IGI E P+ + Sbjct: 525 LLIVLT-----AEPSGDDVRGPSVVLKQGGAVPIGIGIGNADISEMQTISFIPDFAVAIP 579 Query: 350 NPHSMYDAFSHIGKDIVTKRIWYDK 374 + I + I + I ++ Sbjct: 580 TFREL----GTIQQVISERVIQLNR 600 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 51/348 (14%), Positives = 108/348 (31%), Gaps = 65/348 (18%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + F K + + + R +Q+ G + +KN + + + + Sbjct: 676 LNAHFSKDEVQNAVRRLRPKGGSQV-----------YIGNALEYVLKNIFQRPLGSRIEE 724 Query: 101 NGFVNDI-------DDIVRSTSLDIV-----------VVPQNEGYSISAISRYKIPLKFC 142 + DD V +++++ Q E IS Y + Sbjct: 725 GVPQFLVLISSGKSDDEVDDSAVELKQFGVAPLTIARHTDQEELVKISLSPEYVYSVSTF 784 Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQ-------------TDARLDMMIVLDVSRSMESFFD 189 +P + ++ PIT+ D++ ++D S +++ Sbjct: 785 RELPRL--EQKLLTPITTLTSQQIHQILASTRYPPSVVESDAADIVFLIDSSDAVKPDG- 841 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKI 247 I ++ ++ + + P V+ G+V FSN + F L+ S + I Sbjct: 842 -----IAHIRDFVSRIVRRLNIGPSK---VRIGVVQFSNDVFPEFYLKTHKSQSSVLEAI 893 Query: 248 KYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + L G S N+ L++ +F ED + +++F+ K Sbjct: 894 RRLRFKGGSPLNTGRALEFVARNLFVKS---AGSRIEDGVPQHLVLFL------GGKSQD 944 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSM 354 + G + IG R I + P + V + Sbjct: 945 DVARHAQVISSSGIVSLGIGDRNIDRTDLQTITNDPRLVFTVREFREL 992 >gi|90420796|ref|ZP_01228702.1| conserved hypothetical protein with von Willebrand factor domain [Aurantimonas manganoxydans SI85-9A1] gi|90335087|gb|EAS48848.1| conserved hypothetical protein with von Willebrand factor domain [Aurantimonas manganoxydans SI85-9A1] Length = 320 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 39/207 (18%), Positives = 69/207 (33%), Gaps = 37/207 (17%) Query: 173 DMMIVLDVSRSMESF---FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+++ +D+S SME D A+K++ + + GLV F+ Sbjct: 91 DLVLAIDLSGSMEQEDFVLDGRTVSRLDAVKAVARDFVRARAGD------RVGLVIFAET 144 Query: 230 IEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + V + R I + G ST + GL A ++ Sbjct: 145 AYFAAPLTFDVEAVGRLIDQATIGISGRSTAISDGLGLALKRLARSDARS---------- 194 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-----------EFL 336 +++V ++DG N + Q + A++ G V+ I + L Sbjct: 195 -RVVVLLSDGVNNA--GAVQPRDAGSLAERLGIRVHTIALGPADLETDPKSRDAVDTATL 251 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHI 361 RA A S + V + I Sbjct: 252 RAIAETSGGETFRVRTTDDLRQVARAI 278 >gi|1616950|gb|AAB16869.1| CD11b [Sus scrofa] Length = 920 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 37/212 (17%), Positives = 76/212 (35%), Gaps = 32/212 (15%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 D+ ++D S S+ ++ + ++ ++ K + L+ Sbjct: 10 GCPQQESDIAFLIDGSGSINRL---DFQRMKEFVSTVMGQFQKSKTLF--------ALMQ 58 Query: 226 FSNKIEEFFLLEWGVSHLQRK--IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +S F + K ++ + + T++ G++ ++F + Sbjct: 59 YSEDFYTHFTFNDFKRNPSPKLLVRPIRQLLGRTHTATGIRKVVRELF------HSKSGA 112 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI----RSHEFLRAC 339 N KI+V +TDGE + + EA ++G I Y IG+ +S E L Sbjct: 113 RENALKILVVITDGEKFGDPLGYEDVIP--EADRKGVIRYVIGVGDAFNSWKSREELNTI 170 Query: 340 AS---PNSFYLVENPHSMYDAFSHIGKDIVTK 368 AS + + V N ++A I + K Sbjct: 171 ASKPCGDHVFQVNN----FEAVKTIQNQLQEK 198 >gi|168229682|ref|ZP_02654740.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Kentucky str. CDC 191] gi|194468558|ref|ZP_03074542.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Kentucky str. CVM29188] gi|194454922|gb|EDX43761.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Kentucky str. CVM29188] gi|205335570|gb|EDZ22334.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Kentucky str. CDC 191] Length = 596 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 71/188 (37%), Gaps = 21/188 (11%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 PW + + + + + ++ +++ ++D S SM+ ++ + + Sbjct: 206 LAPSPWNAQRTLLKVDVQAR-DMQTRDLPPANLVFLIDTSGSMQP-----AERLPLIRSA 259 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 + ++ +++ ++ V +G + + ++ I L +G ST Sbjct: 260 LKLLVNDLRAQDNITIVTYAG----GTHVALASTAGNNTTAIKAAIDNLDAYG-STGGEA 314 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGA 320 GL+ AY Q + N I+ TDG+ NL + + + +++G Sbjct: 315 GLRLAYEQ------AEKGFIKGGVNR---ILLTTDGDFNLGITDPKDIEALVKKEREKGI 365 Query: 321 IVYAIGIR 328 + +G+ Sbjct: 366 TLSTLGVG 373 >gi|332088403|gb|EGI93521.1| von Willebrand factor type A domain protein [Shigella boydii 5216-82] Length = 575 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 49/307 (15%), Positives = 108/307 (35%), Gaps = 39/307 (12%) Query: 39 EVSHIFFMKTVLHSMID--RSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRN 96 + S ++ L R+ AT I N G R + + + ++ +F Sbjct: 69 QYSDKQALQGRLQEAPTFARAAKAKATHIAN--LGTARYQQFDDNPVKQVAQNPLATFSL 126 Query: 97 ELRDNGFVN-------DIDDIVRSTSLDIVVV--PQNEGYSISAISRYKIPLKFCT---- 143 ++ + N + + ++ +V P + P+ F Sbjct: 127 DVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYEL 186 Query: 144 -FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 PW + + I + S+ +++ ++D S SM S ++ + S+ Sbjct: 187 APAPWNEQRTLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISD-----ERLPLIQSSL 240 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 +++E++ ++ V +G ++I + + + I L G STN G Sbjct: 241 KLLVKELREQDNIAIVTYAG----DSRIALPSISGSHKAEINAAIDSLDAEG-STNGGAG 295 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGAI 321 L+ AY Q ++ N I+ TDG+ +D +S+ + ++ G Sbjct: 296 LELAYQQA-AKGFIKGGINR--------ILLATDGDFNVGIDDPKSIESMVKKQRESGVT 346 Query: 322 VYAIGIR 328 + G+ Sbjct: 347 LSTFGVG 353 >gi|220897449|emb|CAX15333.1| complement component 2 (within H-2S) [Mus musculus] Length = 623 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 37/238 (15%), Positives = 81/238 (34%), Gaps = 28/238 (11%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 T + TN ++ + K+ Q L++ ++LD S+S+ D+ KS Sbjct: 93 LTNLLGATNPTQNLLTKSLGRKIIIQRSGHLNLYLLLDASQSVTE------KDFDIFKKS 146 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF-FLLEWGVSHLQRKIKYLSK-------F 253 M+E + V ++TF+++ + +L + I L Sbjct: 147 AELMVERIFSFEVN---VSVAIITFASQPKTIMSILSERSQDVTEVITSLDSASYKDHEN 203 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYY- 311 TN+ L Y+ + + + I+ +TDG N+ + Sbjct: 204 ATGTNTYEVLIRVYSMMQSQMDRLGMETSAWKEIRHTIILLTDGKSNMGDSPKKAVTRIR 263 Query: 312 ----CNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 + + +YAIG+ + +E +++++ ++ F H Sbjct: 264 ELLSIEQNRDDYLDIYAIGVGKLDVDWKELNELGSKKDGERHAFILQDAKALQQIFEH 321 >gi|148694788|gb|EDL26735.1| complement component 2 (within H-2S), isoform CRA_b [Mus musculus] Length = 753 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 37/238 (15%), Positives = 81/238 (34%), Gaps = 28/238 (11%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 T + TN ++ + K+ Q L++ ++LD S+S+ D+ KS Sbjct: 230 LTNLLGATNPTQNLLTKSLGRKIIIQRSGHLNLYLLLDASQSVTE------KDFDIFKKS 283 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF-FLLEWGVSHLQRKIKYLSK-------F 253 M+E + V ++TF+++ + +L + I L Sbjct: 284 AELMVERIFSFEVN---VSVAIITFASQPKTIMSILSERSQDVTEVITSLDSASYKDHEN 340 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYY- 311 TN+ L Y+ + + + I+ +TDG N+ + Sbjct: 341 ATGTNTYEVLIRVYSMMQSQMDRLGMETSAWKEIRHTIILLTDGKSNMGDSPKKAVTRIR 400 Query: 312 ----CNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 + + +YAIG+ + +E +++++ ++ F H Sbjct: 401 ELLSIEQNRDDYLDIYAIGVGKLDVDWKELNELGSKKDGERHAFILQDAKALQQIFEH 458 >gi|148694789|gb|EDL26736.1| complement component 2 (within H-2S), isoform CRA_c [Mus musculus] Length = 809 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 37/238 (15%), Positives = 81/238 (34%), Gaps = 28/238 (11%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 T + TN ++ + K+ Q L++ ++LD S+S+ D+ KS Sbjct: 230 LTNLLGATNPTQNLLTKSLGRKIIIQRSGHLNLYLLLDASQSVTE------KDFDIFKKS 283 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF-FLLEWGVSHLQRKIKYLSK-------F 253 M+E + V ++TF+++ + +L + I L Sbjct: 284 AELMVERIFSFEVN---VSVAIITFASQPKTIMSILSERSQDVTEVITSLDSASYKDHEN 340 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYY- 311 TN+ L Y+ + + + I+ +TDG N+ + Sbjct: 341 ATGTNTYEVLIRVYSMMQSQMDRLGMETSAWKEIRHTIILLTDGKSNMGDSPKKAVTRIR 400 Query: 312 ----CNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 + + +YAIG+ + +E +++++ ++ F H Sbjct: 401 ELLSIEQNRDDYLDIYAIGVGKLDVDWKELNELGSKKDGERHAFILQDAKALQQIFEH 458 >gi|148694790|gb|EDL26737.1| complement component 2 (within H-2S), isoform CRA_d [Mus musculus] Length = 755 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 37/238 (15%), Positives = 81/238 (34%), Gaps = 28/238 (11%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 T + TN ++ + K+ Q L++ ++LD S+S+ D+ KS Sbjct: 230 LTNLLGATNPTQNLLTKSLGRKIIIQRSGHLNLYLLLDASQSVTE------KDFDIFKKS 283 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF-FLLEWGVSHLQRKIKYLSK-------F 253 M+E + V ++TF+++ + +L + I L Sbjct: 284 AELMVERIFSFEVN---VSVAIITFASQPKTIMSILSERSQDVTEVITSLDSASYKDHEN 340 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYY- 311 TN+ L Y+ + + + I+ +TDG N+ + Sbjct: 341 ATGTNTYEVLIRVYSMMQSQMDRLGMETSAWKEIRHTIILLTDGKSNMGDSPKKAVTRIR 400 Query: 312 ----CNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 + + +YAIG+ + +E +++++ ++ F H Sbjct: 401 ELLSIEQNRDDYLDIYAIGVGKLDVDWKELNELGSKKDGERHAFILQDAKALQQIFEH 458 >gi|268573930|ref|XP_002641942.1| Hypothetical protein CBG16647 [Caenorhabditis briggsae] gi|229553908|sp|A8XP79|CL160_CAEBR RecName: Full=C-type lectin protein 160; Flags: Precursor gi|187026423|emb|CAP34559.1| CBR-CLEC-160 protein [Caenorhabditis briggsae AF16] Length = 635 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 69/200 (34%), Gaps = 29/200 (14%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ V+DVS M + + M IN ++ ++ L P++ VQ GL+ +SN+ E Sbjct: 286 IDIVFVVDVSEGMG------LGGLMMVKAEINTLVGQMSLDPEIQKHVQVGLIKYSNESE 339 Query: 232 EFFLLEWGVSH--------LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 F + +++ + + N L+ A I M Sbjct: 340 IVFKPSEYTNEDEFTEDLWTDPRLEDVDEKTDEVNLHLALQQAAKMIGSM---------- 389 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA---CA 340 KK++V + + + D + ++ G + + S + A Sbjct: 390 RKGVKKVVVIY--AASYNDEGDDDARQIAANIRESGYEIITVAFVEPESSSLVMKIGELA 447 Query: 341 SPNSFYLVENPHSMYDAFSH 360 SP + + D Sbjct: 448 SPRMNFTSFRDELLVDELED 467 >gi|157951694|ref|NP_038512.2| complement C2 precursor [Mus musculus] gi|3986766|gb|AAC84162.1| C2 [Mus musculus] gi|15029737|gb|AAH11086.1| Complement component 2 (within H-2S) [Mus musculus] gi|74147034|dbj|BAE27452.1| unnamed protein product [Mus musculus] gi|148694787|gb|EDL26734.1| complement component 2 (within H-2S), isoform CRA_a [Mus musculus] gi|220897448|emb|CAX15332.1| complement component 2 (within H-2S) [Mus musculus] Length = 760 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 37/238 (15%), Positives = 81/238 (34%), Gaps = 28/238 (11%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 T + TN ++ + K+ Q L++ ++LD S+S+ D+ KS Sbjct: 230 LTNLLGATNPTQNLLTKSLGRKIIIQRSGHLNLYLLLDASQSVTE------KDFDIFKKS 283 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF-FLLEWGVSHLQRKIKYLSK-------F 253 M+E + V ++TF+++ + +L + I L Sbjct: 284 AELMVERIFSFEVN---VSVAIITFASQPKTIMSILSERSQDVTEVITSLDSASYKDHEN 340 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYY- 311 TN+ L Y+ + + + I+ +TDG N+ + Sbjct: 341 ATGTNTYEVLIRVYSMMQSQMDRLGMETSAWKEIRHTIILLTDGKSNMGDSPKKAVTRIR 400 Query: 312 ----CNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 + + +YAIG+ + +E +++++ ++ F H Sbjct: 401 ELLSIEQNRDDYLDIYAIGVGKLDVDWKELNELGSKKDGERHAFILQDAKALQQIFEH 458 >gi|332559488|ref|ZP_08413810.1| von Willebrand (VWA) domain-containing protein [Rhodobacter sphaeroides WS8N] gi|332277200|gb|EGJ22515.1| von Willebrand (VWA) domain-containing protein [Rhodobacter sphaeroides WS8N] Length = 651 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 40/232 (17%), Positives = 79/232 (34%), Gaps = 23/232 (9%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 P T PW +R + + + + + L+++ ++D S SM+ K+ + Sbjct: 262 PSLSVTRTPWNPETRLVHVALQGRM-PAIEDRPPLNLVFLIDTSGSMQ-----DPAKLPL 315 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST 257 +S ML ++ V V +G ++ S + + L G ST Sbjct: 316 LKQSFGLMLGRLRPEDQVAIVTYAGSAG---EVLAPTA-ANQRSTILSALDRLDA-GGST 370 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAK 316 GL AY +M G + +V TDG+ NL + ++ + Sbjct: 371 AGEEGLALAYRTASEMAGAGEVTR---------VVLATDGDFNLGISDPEELARLVAHER 421 Query: 317 KRGAIVYAIGIRVIR-SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 G + +G ++A A + S+ +A + + Sbjct: 422 DTGVYLSVLGFGRGNLDDATMQALAQNGNGQAAY-IDSLNEAQKVLVDQLSG 472 >gi|313222038|emb|CBY39058.1| unnamed protein product [Oikopleura dioica] Length = 1721 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 44/210 (20%), Positives = 79/210 (37%), Gaps = 27/210 (12%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 S D+ VLD S S TK I + VK+ + + GL+ Sbjct: 212 SIEHPSADIFFVLDGSYS---------TKRAGWESIIEFVQRFVKINFEHGGDMNYGLLQ 262 Query: 226 FSNKIEE--FFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQIFDMQGMRQH 279 +S+ +E F + G + I+ LS+ G ST + ++YA F + Sbjct: 263 YSDFVEPILSFADKEGQREIAEFIEILSQIKYHSGFSTLTGTAMRYAAETEFMSE----- 317 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG-AIVYAIGIRVIRSHEFLRA 338 N +KI++ +TDG +D + G V A+G+ E + Sbjct: 318 -RGARKNVRKIMIVVTDGRAKDI-DDNTVKIVGESLRAAGDLTVVAVGVNKAVESELVEI 375 Query: 339 CASPNSFYLVENPHSMYD----AFSHIGKD 364 +SP+ + + +++ F+ I K+ Sbjct: 376 ASSPDFVHNTKKFEDLHNFLSPVFAEICKE 405 >gi|218516298|ref|ZP_03513138.1| hypothetical protein Retl8_22869 [Rhizobium etli 8C-3] Length = 432 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 37/231 (16%), Positives = 77/231 (33%), Gaps = 24/231 (10%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFC---TFIPWYTNSRHIVMPITSSVKVNSQ 167 S+ + + Y + P K PW ++ + + I + Sbjct: 1 PDPQSVRVEEMINYFPYDWPGPEKADQPFKATVTVMPTPWNHDTELMHVAIKG-YDIAPT 59 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 T +++ ++DVS SM+ K+ + + ++ +K V+ V +G Sbjct: 60 TAPHANLVFLIDVSGSMDE-----PDKLPLLKSAFRLLVNRLKADDTVSIVTYAGNAGTV 114 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + I L G ST G++ AY + ++ N Sbjct: 115 LEPTRVAE----KPKILSAIDRLEA-GGSTGGAEGIEAAY-DLAKKAFVQDGVNR----- 163 Query: 288 KKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 ++ TDG+ N+ D+ E +K G + +G ++ L Sbjct: 164 ---VMLATDGDFNVGPSSDEDLKRIIEEKRKDGIFLTVLGFGRGNLNDSLM 211 >gi|119896366|ref|YP_931579.1| hypothetical protein azo0074 [Azoarcus sp. BH72] gi|119668779|emb|CAL92692.1| conseved hypothetical exported protein [Azoarcus sp. BH72] Length = 563 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 46/305 (15%), Positives = 100/305 (32%), Gaps = 37/305 (12%) Query: 47 KTVLHSMIDRSLVHAATQI---MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGF 103 + H + + + + E R + + R+ +F ++ + Sbjct: 61 RVQRHDAMGAAKIAPPAVAPLPLAEPADRERYQAIERHGIQRVAEAPVSTFSIDVDTGSY 120 Query: 104 VN-----DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFI---PWYTNSRHIV 155 N + + ++ + + Y S + + P T I PW S + Sbjct: 121 SNLRRMLNAGQLPPRDAVRVEELVNYFPYRYS-LPQGDAPFAVDTEIAPTPWNPRSLLLR 179 Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 + I + Q +++ ++DVS SM S K+ + ++ + +++ Sbjct: 180 VGIQA-ADPAKQALPPANLVFLVDVSGSMNS-----PDKLPLLQNALKLFVAQLRPQD-- 231 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + LVT+++ G + + I L G + G+ AY Sbjct: 232 ----RVALVTYASGTRVVLEPTAGDRKAAITAAIDGLVPGGAT-AGAAGIDLAYR--MAE 284 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 QG +H I+ TDG+ N+ + E +K G + +G Sbjct: 285 QGFVEHGINR-------ILLATDGDFNVGITRFETLKDRVAERRKSGIALSTLGFGGGNY 337 Query: 333 HEFLR 337 ++ L Sbjct: 338 NDQLM 342 >gi|118349484|ref|XP_001008023.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89289790|gb|EAR87778.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 646 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 44/296 (14%), Positives = 99/296 (33%), Gaps = 36/296 (12%) Query: 77 LKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSL-------DIVVVPQNEGYSI 129 K + + + + + L++ D+ +V+ + + Sbjct: 106 QKSDISIEKQEKIQKLDTKTMLQEQEIKPDLQQMVKDAKKPSYDLEKGLTFEIKTLNKHF 165 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 + + + ++ I+ VK Q+ +D++ V+D S SM Sbjct: 166 QFNNEQDCNIPIMVSVKTQDSTNDILEEQKEQVKQVEQSRPSIDLVCVIDNSGSM----- 220 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE----WGVSHLQR 245 KI ++ +L+ + + L+ F++ L ++Q Sbjct: 221 -QGEKIQNVKTTLLQLLDMLNSND------RLSLILFNSYPTLLCNLRKVDDENTPNIQS 273 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 I ++ G T+ G+ A+N + RQ N + I ++DG++ E Sbjct: 274 IINSITADG-GTDINSGMLMAFNIL----QKRQFFNPVSS-----IFLLSDGQDNGADEK 323 Query: 306 QQSLYYCNE-AKKRGAIVYAIGIRVIRSHEFL-RACA-SPNSFYLVENPHSMYDAF 358 + N+ K +++ G + R C +FY VE + + + F Sbjct: 324 IKKYINSNQSLKNECFSIHSFGFGSDHDGPLMNRICQLKDGNFYYVEKINQVDEFF 379 >gi|172039857|ref|YP_001799571.1| hypothetical protein cur_0177 [Corynebacterium urealyticum DSM 7109] gi|171851161|emb|CAQ04137.1| hypothetical protein cu0177 [Corynebacterium urealyticum DSM 7109] Length = 675 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 45/260 (17%), Positives = 86/260 (33%), Gaps = 51/260 (19%) Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 + G S S K I + IV+P D M ++LD S S Sbjct: 26 STGLSGRVRSSVKHRSAMVFAIALLLSLAGIVVPFGQPQAAAEAKDIPPTM-LILDASGS 84 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI------------E 231 M + T++D A ++ V + + G + + K+ + Sbjct: 85 MMARDAGGQTRLDAAKEASKNFSRSV------SEESELGFMVYGTKVGNSPEEREAGCKD 138 Query: 232 EFFLLEWGV---SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 LL G + ++ ++ G T P LK A ++ + Sbjct: 139 VTTLLPVGKGNAGKISGEVDKVNASGH-TPMGPALKQAAKELPNEGE------------- 184 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKK---RGA--IVYAIGIRV-IRSHEFLRAC--- 339 + IV ++DGE+ C+ AK +G + +G V + + L+ C Sbjct: 185 RSIVLVSDGEDTCAPP-----PVCDVAKDLHKQGIDLTINTVGFLVDPAARKELQ-CIAE 238 Query: 340 ASPNSFYLVENPHSMYDAFS 359 A + ++ S+ ++ Sbjct: 239 AGGGEYLDAQDAESLAESMK 258 >gi|152993961|ref|YP_001359682.1| von Willebrand factor type A domain-containing protein [Sulfurovum sp. NBC37-1] gi|151425822|dbj|BAF73325.1| von Willebrand factor type A domain protein [Sulfurovum sp. NBC37-1] Length = 325 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 48/249 (19%), Positives = 93/249 (37%), Gaps = 33/249 (13%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM--ES 186 +S ISR KF FI + + P + ++ ++ +++I +D+S SM + Sbjct: 51 VSGISRATKGEKFLLFILYVLVLSALAKPNL-IGEPVTKDVSQRELLISVDLSGSMMTKD 109 Query: 187 FFDSS---ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 F + I +++ + L+E K + GL+ F N + L Sbjct: 110 FVNKEGKAIDRLEAVKMVLRDFLKERKGE-------KIGLILFGNAAFVQAPFTQDLDAL 162 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 + + L + G D G+ E ++++ M+DG++ +K Sbjct: 163 EHLLDSLRVGMAGPQTAMG---------DSIGLAVKMFRESNVTDRMLIVMSDGDDTGSK 213 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIRS-------HEFLRACA--SPNSFYLVENPHSM 354 ++ A K G V+ IGI ++ + L+ A + FY N + Sbjct: 214 VPPKT--SAELAAKNGVNVFTIGIGDPKNAGEHPIDTDTLKEIAAITGGKFYYAWNLDDL 271 Query: 355 YDAFSHIGK 363 D + I K Sbjct: 272 QDIYKQIDK 280 >gi|261414506|ref|YP_003248189.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261370962|gb|ACX73707.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] Length = 227 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 32/180 (17%), Positives = 64/180 (35%), Gaps = 15/180 (8%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S + + +R+ + +VLD S SME I+ + + + V+ Sbjct: 7 SIEDLENNPSSRVPVCLVLDTSGSMEG------DSINELNEGVRLFYDAVRSDETALYAA 60 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + +VTF + G S L+ + T + A + + + + Sbjct: 61 EISVVTFGGH----ASCQAGFSTLEHQPDAPQFYADGGTPMGEAMNMALDML--EKRKSE 114 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKED--QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 + + Y+ IV MTDG ++ + + C+ R ++ IGI + L Sbjct: 115 YKASGVDYYQPWIVLMTDGMPNGSQAELSRSIQRTCDMINDRKLTIFPIGIGEDADMDVL 174 >gi|221119984|ref|XP_002166572.1| PREDICTED: similar to predicted protein [Hydra magnipapillata] Length = 5008 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 53/345 (15%), Positives = 123/345 (35%), Gaps = 31/345 (8%) Query: 3 SLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGM-IIEVSHIFFMKTVLHSMIDRSLVHA 61 S NL +F+Y G TI +P + + ++ + + ++ D Sbjct: 4174 SENLTTEHFYYLENGNETI----PVPSLNFNETIPASDLFKALYSEKKFDTVTDLLGFSV 4229 Query: 62 ATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD--IVRSTSLDIV 119 + N+ ++K + K F+ +N ++ S + + Sbjct: 4230 SGATANKDKSKHKKGMSKKYGTHNQKEKNKSKFQINTSNNEISRSRKKRGLLDSLTENNS 4289 Query: 120 VVPQNEGYSISAISR-YKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARL-DMMIV 177 QN+G +S K+ L + ++ N ++ ++ D++++ Sbjct: 4290 EGKQNKGQLLSIEKNILKLVLNMTNLLLSQDQIEKVLSEEKLISAGNIKSPSKFHDLILL 4349 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 +DVS +++ + + + + + +V ++ V G+V+F + L Sbjct: 4350 VDVSSAVDEL------EFTLIQQFMGDFISKVM----NDSKVSVGVVSFGQFQQMDVQLS 4399 Query: 238 WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 +Q I + G + N T LK+ I+ R N +++ + + Sbjct: 4400 KDQQKVQLGISAMKYMGDTGNLTSALKFVDQNIYQTDQKRS-------NVQQLCIIL--- 4449 Query: 298 ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACAS 341 Q + ++ K +G V+A+GI + ++ L AS Sbjct: 4450 -GGIPHYSQNADEIADKLKSQGVEVFAVGIGKMFKTDTLLLQIAS 4493 >gi|254453558|ref|ZP_05066995.1| von Willebrand factor, type A [Octadecabacter antarcticus 238] gi|198267964|gb|EDY92234.1| von Willebrand factor, type A [Octadecabacter antarcticus 238] Length = 676 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 39/233 (16%), Positives = 87/233 (37%), Gaps = 26/233 (11%) Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFI---PWYTNSRHIVMPITSSV 162 + + ++ I + Y+ A P + + PW +++ + + + + Sbjct: 254 TAGQLPPADAVRIEEMINYFPYAYPA--PDGQPFQPTINVFETPWNADTQLVHIGLQGEM 311 Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + Q L+++ ++D S SMES K+ + +S ML+ + +V V +G Sbjct: 312 -PSIQDRPALNLVFLIDTSGSMES-----ADKLPLLRQSFRLMLDNLAPEDEVAIVTYAG 365 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + + + + + + + L+ G STN GL+ AY Sbjct: 366 STSIALQPTQASE----RTTILAALNALNA-GGSTNGQGGLEQAY--------ALAETMK 412 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGAIVYAIGIRVIRSHE 334 D + + ++ TDG+ D + L Y + + G + +G + Sbjct: 413 TDGDVSR-VILATDGDFNVGLSDPRGLQAYIEDKRDTGTYLSVLGFGRGNLQD 464 >gi|219847650|ref|YP_002462083.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219541909|gb|ACL23647.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 419 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 33/209 (15%), Positives = 71/209 (33%), Gaps = 31/209 (14%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 +Q +++ VLD S SM KI+ +++ +E + + +V Sbjct: 38 TQVRMPVNVCFVLDRSGSM------KGEKIERLRQAVVKAIELLDQQDSLA------IVI 85 Query: 226 FSNKIE--EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 F ++ E + + + + G T P ++ ++ Sbjct: 86 FDHRTEVLVPAQPVRNRAMILDLVHRIRDAG-GTRIAPAVEKGLQEL--------QKMPP 136 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--S 341 + ++ +TDG+ E L ++A + G + A+GI + + L A S Sbjct: 137 GV---RRLILLTDGQTEHENE---CLLRADDAGRLGVPITALGIGKDWNEDLLIEMANRS 190 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + P + + F H + I Sbjct: 191 KGVADYIAQPGEIVNYFQHTVQRAQQTVI 219 >gi|30794326|ref|NP_851361.1| epithelial chloride channel protein [Bos taurus] gi|2623763|gb|AAB86529.1| Lu-ECAM-1 [Bos taurus] Length = 905 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 39/191 (20%), Positives = 68/191 (35%), Gaps = 32/191 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM + D A + ++E+ L+ G+VTF + E Sbjct: 310 VCLVLDKSGSMSAE-DRLFQMNQAAELYLIQVIEKGSLV---------GMVTFDSVAEIQ 359 Query: 234 FLLEWGV--SHLQRKIKYLSK-FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + Q+ L + T+ GLK + I Sbjct: 360 NHLTRITDDNVYQKITAKLPQVANGGTSICRGLKAGFQAIIHSDQSTSGSE--------- 410 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 I+ +TDGE+ C + K+ GAI++ I + + E L F+ Sbjct: 411 IILLTDGEDNEINS-------CFEDVKRSGAIIHTIALGPSAAKELETLSNMTGGYRFFA 463 Query: 348 VENPHSMYDAF 358 ++ + +AF Sbjct: 464 NKDITGLTNAF 474 >gi|88858354|ref|ZP_01132996.1| hypothetical protein PTD2_13229 [Pseudoalteromonas tunicata D2] gi|88819971|gb|EAR29784.1| hypothetical protein PTD2_13229 [Pseudoalteromonas tunicata D2] Length = 684 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 49/340 (14%), Positives = 116/340 (34%), Gaps = 54/340 (15%) Query: 48 TVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDI 107 L + + ++E + K + G +IK + +++ + ++ Sbjct: 197 ADLQGNWPSASEITSAPFIDELETQSAKAVGQGMSDAKIKQSVSIALNLGFELDTIMSPY 256 Query: 108 DDIVRS--------TSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH-IVMPI 158 +I + SL N + + + + ++ F + N + +VM + Sbjct: 257 HEINQQLIGNNHYQVSLKQGTTFANRDFVLRVKPKNQAAIQAAVFKEHFENDDYALVMLM 316 Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 S + + +++ V+D S SM ++ A ++ L + N Sbjct: 317 PPSDEFIAAQRLPREVIFVIDTSGSMHG------ESLEQAKSALFFALANLDPQDSFN-- 368 Query: 219 VQSGLVTFSNKI-----EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 ++ F++K+ + ++ + + + L G + + A+ Q+ D Sbjct: 369 ----IIEFNSKVNALNAQALPANDFNIRRARNFVYGLKADGGTE-----IGLAFEQVLDN 419 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 A+Y + IVF+TDG + E + ++ IGI Sbjct: 420 SE--------HADYLRQIVFLTDGS--ISNETEVFAQIKGSLGDS--RIFTIGIG----- 462 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 ++PNS+++ F+ IG +R + Sbjct: 463 ------SAPNSYFMTRAATLGRGTFTFIGDVTDVQRTMKN 496 >gi|260793650|ref|XP_002591824.1| hypothetical protein BRAFLDRAFT_125322 [Branchiostoma floridae] gi|229277035|gb|EEN47835.1| hypothetical protein BRAFLDRAFT_125322 [Branchiostoma floridae] Length = 691 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 43/230 (18%), Positives = 84/230 (36%), Gaps = 37/230 (16%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 +S+ S+ A LD++ LD S S+++ ++ SI + + Sbjct: 219 SSRGRNSLGRLSRGSAGLDLVFALDKSSSIDA---VDFSRAIQFTTSIINEFG----VTN 271 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWG------VSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 Q LVTF ++ + LEW + R+++ L G T T L+ N Sbjct: 272 REGGTQVALVTFGSQAQ----LEWNLGQLDSKRKVLRQLRQLQPEGGGTALTAALQTVLN 327 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 ++ + K+ + +TDG N+ + + V+A+G+ Sbjct: 328 EVLPVA---------RVGAKRALFIITDGKSNVGASPGVFARRLRE---EEAFEVFAVGV 375 Query: 328 RVIRSHEFLRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 L + AS + +L+ + + F + I K I Y++ Sbjct: 376 GANVDKNELNSVASQPFTSHVFLIND----FSNFDTLVNTIAEKEIDYEQ 421 >gi|149043682|gb|EDL97133.1| procollagen, type VI, alpha 1 (predicted), isoform CRA_a [Rattus norvegicus] Length = 727 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 59/170 (34%), Gaps = 29/170 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVVQSGL 223 T +D++ VLD S S+ + ++A I +++ + V +G+ Sbjct: 310 CTCGPIDILFVLDSSESIG------LQNFEIAKDFIIKVIDRLSKDELVKFEPGQSHAGV 363 Query: 224 VTFSNKIEEF-----FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V +S+ + + +K L T + L+Y +++ Sbjct: 364 VQYSHNQMQEHVDMRSPNVRNAQDFKEAVKKLQWMAGGTFTGEALQYTRDRLL------- 416 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +I + +TDG + + ++ C V ++GI+ Sbjct: 417 ----PPTQNNRIALVITDGRSDTQRDTTPLSVLCG----SDIQVVSVGIK 458 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 55/165 (33%), Gaps = 19/165 (11%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + D+ I+LD S S+ S + + K + VV+ +V +S + Sbjct: 525 SPTDITILLDSSASVGSH---NFETTKVFAKRLAERFLSAGREDP-TQVVRVAVVQYSGQ 580 Query: 230 IEE-----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ + + L + + +T+ L Y R + Sbjct: 581 GQQQPGRASLQFQQNYTVLASSVDSMDFINDATDVNDALSYV---------TRFYRENSS 631 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 KK ++ +DG + + EA++ G ++ + + Sbjct: 632 GATKKRVLLFSDGNSQGATAEAIEKAV-QEAQRGGIEIFVMVVGP 675 >gi|291395817|ref|XP_002714337.1| PREDICTED: complement factor B-like [Oryctolagus cuniculus] Length = 764 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 34/251 (13%), Positives = 75/251 (29%), Gaps = 44/251 (17%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 P R IV+ S+ ++ +VLD S S+ + + + Sbjct: 250 GHSPGELQKRKIVLDPAGSM----------NIYLVLDGSDSIGASNFTGAKRC------- 292 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIE--------EFFLLEWGVSHLQRKIKYLSKFG 254 ++ ++ + + GLVT++ + W L + K Sbjct: 293 --LVNLIEKVASYGVRPRYGLVTYATYPNVLVRVSDPKSSDANWVTEKLNQISYEDHKLK 350 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 TN+ L YN + G + +I+ MTDG + + + + Sbjct: 351 TGTNTKRALVEVYN-MMSWPGDVPPEGWNRTRH--VIILMTDGLHNMGGDPVTVINEIRD 407 Query: 315 A----------KKRGAIVYAIGIR----VIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 ++ VY G+ + + + V++ + D F Sbjct: 408 LLNIGKDRKNPREDYLDVYVFGVGPLVEPANINALASKKENEQHVFRVKDMEHLEDVFFQ 467 Query: 361 IGKDIVTKRIW 371 + + + + Sbjct: 468 MIDESQSLGLC 478 >gi|188590759|ref|YP_001922423.1| von Willebrand factor type A domain protein [Clostridium botulinum E3 str. Alaska E43] gi|188501040|gb|ACD54176.1| von Willebrand factor type A domain protein [Clostridium botulinum E3 str. Alaska E43] Length = 984 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 33/130 (25%), Positives = 56/130 (43%), Gaps = 20/130 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++VLD S SM+ +KI + + ++K IP+++ +VT+S + Sbjct: 92 IVLVLDTSGSMKD------SKIKKMKNAAMEFVNKIKKIPNLD----IDIVTYSTSGYTY 141 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L + I + G TN+ GL+ A N I D++ + N K IVF Sbjct: 142 LNNGNTEEDLLKIINSIKADG-GTNTGEGLRKA-NYILDLEKNK--------NADKSIVF 191 Query: 294 MTDGENLSTK 303 M+DG Sbjct: 192 MSDGMPTYYS 201 >gi|157838288|pdb|1BHQ|1 Chain 1, Mac-1 I Domain Cadmium Complex gi|157838289|pdb|1BHQ|2 Chain 2, Mac-1 I Domain Cadmium Complex Length = 189 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 81/203 (39%), Gaps = 28/203 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S+ ++ + ++ L++ K + + + + F+ K + Sbjct: 2 DIAFLIDGSGSIIPH---DFRRMKEFVSTVMEQLKKSKTLFSLMQYSEEFRIHFTFKEFQ 58 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +K +++ T++ G++ ++F++ N N KI+V Sbjct: 59 NNP------NPRSLVKPITQLLGRTHTATGIRKVVRELFNIT------NGARKNAFKILV 106 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFLRACASP---NSF 345 +TDGE + + EA + G I Y IG R +S + L AS + Sbjct: 107 VITDGEKFGDPLGYEDVIP--EADREGVIRYVIGVGDAFRSEKSRQELNTIASKPPRDHV 164 Query: 346 YLVENPHSMYDAFSHIGKDIVTK 368 + V N ++A I + K Sbjct: 165 FQVNN----FEALKTIQNQLREK 183 >gi|253722212|pdb|1IDN|1 Chain 1, Mac-1 I Domain Metal Free gi|253722213|pdb|1IDN|2 Chain 2, Mac-1 I Domain Metal Free gi|313507148|pdb|1BHO|1 Chain 1, Mac-1 I Domain Magnesium Complex gi|313507149|pdb|1BHO|2 Chain 2, Mac-1 I Domain Magnesium Complex Length = 190 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 81/203 (39%), Gaps = 28/203 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S+ ++ + ++ L++ K + + + + F+ K + Sbjct: 3 DIAFLIDGSGSIIPH---DFRRMKEFVSTVMEQLKKSKTLFSLMQYSEEFRIHFTFKEFQ 59 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +K +++ T++ G++ ++F++ N N KI+V Sbjct: 60 NNP------NPRSLVKPITQLLGRTHTATGIRKVVRELFNIT------NGARKNAFKILV 107 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFLRACASP---NSF 345 +TDGE + + EA + G I Y IG R +S + L AS + Sbjct: 108 VITDGEKFGDPLGYEDVIP--EADREGVIRYVIGVGDAFRSEKSRQELNTIASKPPRDHV 165 Query: 346 YLVENPHSMYDAFSHIGKDIVTK 368 + V N ++A I + K Sbjct: 166 FQVNN----FEALKTIQNQLREK 184 >gi|72093926|ref|XP_787298.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3, partial [Strongylocentrotus purpuratus] gi|115954441|ref|XP_001184332.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3, partial [Strongylocentrotus purpuratus] Length = 504 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 34/173 (19%), Positives = 65/173 (37%), Gaps = 18/173 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+DVS SM KID ++ +L++V+ I +N V+ V + Sbjct: 338 VVFVIDVSGSMRG------RKIDQTKRAFTTILDDVRPIDRINIVLFESDVRVWRSNQMV 391 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 ++ +R + + G TN GL+ A + + +H N E I+ Sbjct: 392 EATSDNIAAAKRHVNRIRA-GGGTNLYDGLRNAVDLLM------EHGNGEAMPL---IIM 441 Query: 294 MTDGENL--STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +TDG+ S K + + ++++ FL + N Sbjct: 442 LTDGQPTSGSVKSTSEIIQRITNLIDGRLSLFSVSFGNGVDFSFLEKLSLSNQ 494 >gi|57163755|ref|NP_001009219.1| integrin alpha-E [Felis catus] gi|20530606|gb|AAM27173.1|AF420018_1 alpha E integrin [Felis catus] Length = 1160 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 37/214 (17%), Positives = 80/214 (37%), Gaps = 24/214 (11%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + + A ++ I+LD S S++ A I+ M++ N Sbjct: 179 TQERQEDEAAGTEIAIILDGSGSIDP------PDFQKAKDFISNMMKNFYAKCFECNF-- 230 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQR--KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 LV + I+ F L L +++ +++ T + +++ + IF Sbjct: 231 -ALVQYGEVIQTEFDLRDSQDALASLARVQNITQVKNVTKTASAIQHVLDNIFTPS---- 285 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--L 336 + N K+IV +TDG+ + ++ + K +G +AI + + L Sbjct: 286 --HGSRKNAPKVIVVITDGDIFGDPLNLTTVI--SSPKMQGVERFAIRVGNESTKTLKEL 341 Query: 337 RACASP---NSFYLVENPHSMYDAFSHIGKDIVT 367 + AS + V N ++ S + ++I+ Sbjct: 342 KLIASDPFERHAFTVTNYSALDGLLSKLQQNIIH 375 >gi|126173282|ref|YP_001049431.1| von Willebrand factor type A [Shewanella baltica OS155] gi|125996487|gb|ABN60562.1| von Willebrand factor, type A [Shewanella baltica OS155] Length = 642 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 36/215 (16%), Positives = 72/215 (33%), Gaps = 30/215 (13%) Query: 134 RYKIPLKFCTFIPWYTNSRHIVMP----------ITSSVKVNSQTDARLDMMIVLDVSRS 183 Y PL P+ + P + +++ +LDVS S Sbjct: 190 AYDYPLPAKNAAPFSVTTELAPSPYNDDMMLLRIGLKGYDLPKSQLGASNLVFLLDVSGS 249 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 M +S K+ + ++ + ++ V+ VV +G + + L Sbjct: 250 M-----ASADKLPLLQTALKLLTAQLSAQDKVSIVVYAGAAG----VVLDGVSGNDTQTL 300 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLST 302 ++ LS G S N G+ AY ++H N ++ TDG+ N+ Sbjct: 301 TYALEQLSA-GGSINGGQGITQAYQL------AKKHFIPNGINR---VILATDGDFNVGV 350 Query: 303 KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + + + K G + +G + ++ L Sbjct: 351 TDFDDLIALIEKEKDHGIGLTTLGFGLGNYNDQLM 385 >gi|85374101|ref|YP_458163.1| hypothetical protein ELI_06370 [Erythrobacter litoralis HTCC2594] gi|84787184|gb|ABC63366.1| hypothetical protein ELI_06370 [Erythrobacter litoralis HTCC2594] Length = 435 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 19/156 (12%), Positives = 51/156 (32%), Gaps = 16/156 (10%) Query: 1 MFSLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVH 60 M S+ + + G ++ A+ LP + G ++++ + +K L +D++ V Sbjct: 1 MISIRQTAKRLRQSNTGNAMMILALGLPALVGGAGYGLDMAQWYMLKRELQYAVDQAAVA 60 Query: 61 AATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVV 120 A + G + W+ E N + ++ + Sbjct: 61 GAYSLSYNGTAGD----------------WSARAEQEYDANRSITTGYATANDSTKGVTD 104 Query: 121 VPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVM 156 S++ + + L F + + + ++ Sbjct: 105 YGSFTQNSVTVSATMDVSLPFSSILLSTPTTINVNS 140 >gi|118150796|ref|NP_001071310.1| cochlin precursor [Bos taurus] gi|75057908|sp|Q5EA64|COCH_BOVIN RecName: Full=Cochlin; Flags: Precursor gi|59857775|gb|AAX08722.1| coagulation factor C homolog, cochlin precursor [Bos taurus] Length = 550 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 28/209 (13%), Positives = 70/209 (33%), Gaps = 31/209 (14%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTD------ARLDMMIVLDVSRSMESFFDSSITK 194 F +P + + V P+ + + Q +++ ++D S S+ + Sbjct: 329 FSYHMPNWFGTTKYVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGE------SN 382 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSK 252 + +K ++ + + ++ + V F+ F + + I+ +S Sbjct: 383 FRLMLKFVSNIAKTFEISDIGA---KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNISY 439 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 T + + + +F +R N +V +TDG+ Sbjct: 440 MSGGTATGDAISFTVRNVFGP--VRDSPNKN------FLVIVTDGQ-----SYDDVRGPA 486 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 A G ++++G+ + L+ AS Sbjct: 487 AAAHDAGITIFSVGVAWAPLDD-LKDMAS 514 >gi|125975554|ref|YP_001039464.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] gi|125715779|gb|ABN54271.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] Length = 536 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 32/231 (13%), Positives = 79/231 (34%), Gaps = 29/231 (12%) Query: 136 KIPLKFCTFIPWYTNS--RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSIT 193 P T I + +++ + K++++ +++ ++DVS SM+ Sbjct: 143 DEPFSITTEIGQCPWNPENKLMLVGLQTKKLSTEQLPPSNLVFLIDVSGSMDE-----PN 197 Query: 194 KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF 253 K+ + + +++E+ V+ VV +G + + + L Sbjct: 198 KLPLLKSAFKLLVDELDEDDRVSIVVYAGAAG----LVLDSTPGNEKDKILDALMNLEA- 252 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYC 312 G ST G+K AY + ++ N ++ TDG+ N+ + + + Sbjct: 253 GGSTAGAEGIKLAY-DVAKKNFIKSGNNR--------VILATDGDFNVGISSEAELVRLI 303 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACAS-----PNSFYLVENPHSMYDAF 358 + + G + +G + S ++ ++N Sbjct: 304 EKKRDEGIFLTVLGFGTGNYKD--SKMESLADKGNGNYAYIDNIAEARKVL 352 >gi|195978918|ref|YP_002124162.1| fimbrial structural subunit protein FszD [Streptococcus equi subsp. zooepidemicus MGCS10565] gi|195975623|gb|ACG63149.1| fimbrial structural subunit protein FszD [Streptococcus equi subsp. zooepidemicus MGCS10565] Length = 967 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 36/246 (14%), Positives = 82/246 (33%), Gaps = 47/246 (19%) Query: 166 SQTDARLDMMIVLDVSRSMESFFD-SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + LD+++V+D S SM+ + I + ++ ++ ++N + ++ Sbjct: 409 TGKKQPLDVLVVVDRSASMKEGISQNDIPRDQAVKNALTGAGGLLQKFININAENKLSVI 468 Query: 225 T------FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTN--STPGLKYAYNQIFDMQGM 276 ++++ + + W Q I V N S+ L D G Sbjct: 469 GFQGSLNYNSREGKPERISWRSIIYQPSINNNKDADVLKNWESSSALNRDDLSYKDKNGT 528 Query: 277 RQHC----------NTEDANYKKIIVFMTDG-------------------ENLSTKEDQQ 307 H D ++KI+VF++DG + Sbjct: 529 NYHAALVKADEMLNKVADDGHRKIMVFVSDGVPTFYFGSDHYRAGNGTSDASNIKSSQDG 588 Query: 308 SLYYCNEAKKR--GAIVYAIGIR-------VIRSHEFLRACASPNSFYLVENPHSMYDAF 358 + ++ KK+ +Y++G+ S L+ + + +Y + N + Sbjct: 589 TRAAIDDFKKKHPNLSIYSLGVSKDINSDTASSSPVVLKYLSGEDHYYGITNTVELEKIA 648 Query: 359 SHIGKD 364 + I +D Sbjct: 649 NKIVED 654 >gi|198245970|ref|YP_002216383.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|197940486|gb|ACH77819.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|326624134|gb|EGE30479.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Dublin str. 3246] Length = 593 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 71/188 (37%), Gaps = 21/188 (11%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 PW + + + + + ++ +++ ++D S SM+ ++ + + Sbjct: 203 LAPSPWNAQRTLLKVDVQAR-DMQTRDLPPANLVFLIDTSGSMQP-----AERLPLIRSA 256 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 + ++ +++ ++ V +G + + ++ I L +G ST Sbjct: 257 LKLLVNDLRAQDNITIVTYAG----GTHVALASTAGNNTTAIKAAIDNLDTYG-STGGEA 311 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGA 320 GL+ AY Q + N I+ TDG+ NL + + + +++G Sbjct: 312 GLRLAYEQ------AEKGFIKGGVNR---ILLTTDGDFNLGITDPKDIEALVKKEREKGI 362 Query: 321 IVYAIGIR 328 + +G+ Sbjct: 363 TLSTLGVG 370 >gi|296489197|gb|DAA31310.1| epithelial chloride channel protein [Bos taurus] Length = 905 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 39/191 (20%), Positives = 68/191 (35%), Gaps = 32/191 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM + D A + ++E+ L+ G+VTF + E Sbjct: 310 VCLVLDKSGSMSAE-DRLFQMNQAAELYLIQVIEKGSLV---------GMVTFDSVAEIQ 359 Query: 234 FLLEWGV--SHLQRKIKYLSK-FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + Q+ L + T+ GLK + I Sbjct: 360 NHLTRITDDNVYQKITAKLPQVANGGTSICRGLKAGFQAIIHSDQSTSGSE--------- 410 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 I+ +TDGE+ C + K+ GAI++ I + + E L F+ Sbjct: 411 IILLTDGEDNEINS-------CFEDVKRSGAIIHTIALGPSAAKELETLSNMTGGYRFFA 463 Query: 348 VENPHSMYDAF 358 ++ + +AF Sbjct: 464 NKDITGLTNAF 474 >gi|292655414|ref|YP_003535311.1| von Willebrand factor type A domain-containing protein [Haloferax volcanii DS2] gi|291372503|gb|ADE04730.1| von Willebrand factor type A domain protein [Haloferax volcanii DS2] Length = 818 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 76/209 (36%), Gaps = 41/209 (19%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 +P+T+ + QT +++ +DVS S ES KS+ + + + Sbjct: 392 LPVTTGEGASQQT----NLVFAIDVSGSAESGMR--------VQKSVA-----LDALDQL 434 Query: 216 NNVVQSGLVTFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + + G+V F+ + + L I+ L G +T+ GL A Q+ D Sbjct: 435 GDENRVGIVGFNYRAYDVAPLRPLGPNRESAADLIRRLES-GGATDIAVGLDGAAQQLGD 493 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 +G I+ ++DG + Q + ++ + G V IG + Sbjct: 494 RRGT--------------IILISDGHDRF----QDAATLADQLGRDGVSVITIGTGPNPN 535 Query: 333 HEFLRAC--ASPNSFYLVENPHSMYDAFS 359 LRA AS ++ + + F Sbjct: 536 ERTLRAIARASGGNYLRADETDRLRILFG 564 >gi|197250621|ref|YP_002147271.1| von Willebrand factor type A domain-containing protein [Salmonella enterica subsp. enterica serovar Agona str. SL483] gi|197214324|gb|ACH51721.1| von Willebrand factor type A domain protein [Salmonella enterica subsp. enterica serovar Agona str. SL483] Length = 598 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 71/188 (37%), Gaps = 21/188 (11%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 PW + + + + + ++ +++ ++D S SM+ ++ + + Sbjct: 208 LAPSPWNAQRTLLKVDVQAR-DMQTRDLPPANLVFLIDTSGSMQP-----AERLPLIRSA 261 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 + ++ +++ ++ V +G + + ++ I L +G ST Sbjct: 262 LKLLVNDLRAQDNITIVTYAG----GTHVALASTAGNNTTAIKAAIDNLDAYG-STGGEA 316 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGA 320 GL+ AY Q + N I+ TDG+ NL + + + +++G Sbjct: 317 GLRLAYEQ------AEKGFIKGGVNR---ILLTTDGDFNLGITDPKDIEALVKKEREKGI 367 Query: 321 IVYAIGIR 328 + +G+ Sbjct: 368 TLSTLGVG 375 >gi|332828718|gb|EGK01410.1| hypothetical protein HMPREF9455_02243 [Dysgonomonas gadei ATCC BAA-286] Length = 330 Score = 57.5 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 35/217 (16%), Positives = 72/217 (33%), Gaps = 46/217 (21%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++ LD+S +M + D S T+++ A K + N+ + GLV F + Sbjct: 91 IDIVMALDISGTMMAQ-DFSPTRLEAAKKVAAEFIN-----DRPND--RIGLVIFGGESF 142 Query: 232 EFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L L + + T GL + N++ + + + Sbjct: 143 TQCPLTTDHKVLLNLLTEVKFGMIEDGTAIGLGLANSVNRL-----------KDSKSKSR 191 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----------IRVIRSHEFLR-- 337 +++ +TDG N + L A VY IG I + ++ Sbjct: 192 VVILLTDGSNNA--GQIAPLTAAELAASYDIRVYTIGIGSRGTSTARIMTPYGLQTMQVS 249 Query: 338 -----------ACASPNSFYLVENPHSMYDAFSHIGK 363 A + ++ + S+ + I + Sbjct: 250 GDFDERTLTEIAAITKGQYFRATDNTSLSAIYDEIDQ 286 >gi|293415564|ref|ZP_06658207.1| yfbK protein [Escherichia coli B185] gi|291433212|gb|EFF06191.1| yfbK protein [Escherichia coli B185] Length = 575 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 49/307 (15%), Positives = 108/307 (35%), Gaps = 39/307 (12%) Query: 39 EVSHIFFMKTVLHSMID--RSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRN 96 + S ++ L R+ AT I N G R + + + ++ +F Sbjct: 69 QYSDKQALQGRLQEAPTFARAAKAKATHIAN--PGTARYQQFDDNPVKQVAQNPLATFSL 126 Query: 97 ELRDNGFVN-------DIDDIVRSTSLDIVVV--PQNEGYSISAISRYKIPLKFCT---- 143 ++ + N + + ++ +V P + P+ F Sbjct: 127 DVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYEL 186 Query: 144 -FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 PW + + I + S+ +++ ++D S SM S ++ + S+ Sbjct: 187 APAPWNEQRTLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISD-----ERLPLIQSSL 240 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 +++E++ ++ V +G ++I + + + I L G STN G Sbjct: 241 KLLVKELREQDNIAIVTYAG----DSRIALPSISGSHKAEINAAIDSLDAEG-STNGGAG 295 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGAI 321 L+ AY Q ++ N I+ TDG+ +D +S+ + ++ G Sbjct: 296 LELAYQQA-AKGFIKGGINR--------ILLATDGDFNVGIDDPKSIESMIKKQRESGVT 346 Query: 322 VYAIGIR 328 + G+ Sbjct: 347 LSTFGVG 353 >gi|148706513|gb|EDL38460.1| vitrin, isoform CRA_b [Mus musculus] Length = 643 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 21/132 (15%), Positives = 53/132 (40%), Gaps = 20/132 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS--NKI 230 D+ V+D S S+ + ++ + + +E ++ + G V ++ ++ Sbjct: 484 DIGFVIDGSSSVG------TSNFRTVLQFVANLSKEFEISDTD---TRVGAVQYTYEQRL 534 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 E F + + I+ + + T++ ++YA Q+F N +K+ Sbjct: 535 EFGFDKYNSKADILSAIRRVGYWSGGTSTGAAIQYALEQLF---------KKSKPNKRKV 585 Query: 291 IVFMTDGENLST 302 ++ +TDG + Sbjct: 586 MIIITDGRSYDD 597 >gi|31615649|pdb|1N9Z|A Chain A, Integrin Alpha M I Domain Mutant Length = 192 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 80/202 (39%), Gaps = 28/202 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S+ ++ + ++ L++ K + + + + F+ K + Sbjct: 7 DIAFLIDGSGSIIPH---DFRRMKEFVSTVMEQLKKSKTLFSLMQYSEEFRIHFTFKEFQ 63 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +K +++ T++ G++ ++F++ N N KI+V Sbjct: 64 NNP------NPRSLVKPITQLLGRTHTATGIRKVVRELFNIT------NGARKNAFKILV 111 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFLRACASP---NSF 345 +TDGE + + EA + G I Y IG R +S + L AS + Sbjct: 112 VITDGEKFGDPLGYEDVIP--EADREGVIRYVIGVGDAFRSEKSRQELNTIASKPPRDHV 169 Query: 346 YLVENPHSMYDAFSHIGKDIVT 367 + V N ++A I + Sbjct: 170 FQVNN----FEALKTIQNQLRE 187 >gi|73958318|ref|XP_547049.2| PREDICTED: similar to integrin alpha X precursor [Canis familiaris] Length = 1149 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 78/203 (38%), Gaps = 26/203 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S S+ K+ +K++ + + + + + ++ Sbjct: 141 DIVFLIDGSGSIS---PRDFIKMLNFVKAVMSQFQRPSTQFSLMQF------SNNFRVHF 191 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 F + S+ + +S+ G T++ ++ N++F + KI++ Sbjct: 192 TFEVFTYSSNPLALLDSVSQLGGLTHTATAIRIVTNELFSAS------KGARKDASKILI 245 Query: 293 FMTDGENLSTK-EDQQSLYYCNEAKKRGAIVYAIGIRVI----RSHEFLRACASPNSF-- 345 +TDG+ + + A G I YA+G+ +S + L AS S Sbjct: 246 VITDGQKKGDSLGYEDVIPMAEAA---GIIRYAVGVGTAFQKMQSWKELNDIASKPSHEY 302 Query: 346 -YLVENPHSMYDAFSHIGKDIVT 367 + VEN ++ D + + + I Sbjct: 303 IFKVENFDALRDIQNQLKEKIFA 325 >gi|27228596|ref|NP_758646.1| hypothetical protein pCAR1_p105 [Pseudomonas resinovorans] gi|219857018|ref|YP_002474050.1| hypothetical protein pCAR12_p105 [Pseudomonas sp. CA10] gi|26106184|dbj|BAC41624.1| hypothetical protein [Pseudomonas resinovorans] gi|219688946|dbj|BAH10037.1| hypothetical protein [Pseudomonas putida] Length = 604 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 45/206 (21%), Positives = 84/206 (40%), Gaps = 31/206 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + I+LD S SM+S D + + A+ ++ + LE + L V +G + Sbjct: 420 SRAERQSASIQILLDKSGSMKSAMD----QAEAAVYAVLSALEGLPL-------VTTGAM 468 Query: 225 TF----SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +F ++ +E L++ S +R IK +S+ G S G A Q + Sbjct: 469 SFPNKANDGVERCALIK---SPKERLIKAVSEGGFGAMSEGGTPLA--QALWPAAVEVLR 523 Query: 281 NTEDANYKKIIVFMTDGENLS--TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + KKI+ +TDGE + T ++ L C G V +G H L+A Sbjct: 524 AKGE---KKILFVITDGEPNAGTTHAAKEFLQRCEV---SGIEVIGLGFGSANEH-ILKA 576 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKD 364 S + V ++ ++ + ++ Sbjct: 577 LFS--QYRAVGEVANLKNSLFELVRE 600 >gi|301627727|ref|XP_002943021.1| PREDICTED: complement factor B-like, partial [Xenopus (Silurana) tropicalis] Length = 705 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 40/240 (16%), Positives = 85/240 (35%), Gaps = 35/240 (14%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 N + S KV D +++ IVLD S+S+ K + A ++ +E+V Sbjct: 218 NVDTTNLEGRSDRKVQILKDGLMNIFIVLDTSKSVGK------DKFNEAKEASILFIEKV 271 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--------VSHLQRKIKYLSKFGVSTNSTP 261 +++++++ L + HL+ + TN+ Sbjct: 272 SSYDIKPQYC---IISYASEAIPVVSLRDQDSKNADAVIEHLENFVYDSHADKQGTNTRA 328 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKI---IVFMTDGENL---STKEDQQSL------ 309 L Y Q+ + Q + N ++ KI I+ MTDG+ +E+ + + Sbjct: 329 ALHSIYQQLIE-QELVYKNNNNKESFMKIHNVILLMTDGKFNMGGDPREEMKVIRRFLNV 387 Query: 310 -YYCNEAKKRGAIVYAIGIRV----IRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 ++ ++ VY G+ ++ + +EN M + F + + Sbjct: 388 GTSKDDLREEYLDVYVFGLGSDIDQPEINDLASKKDKEVHTFHLENVDKMKEFFELMIDE 447 >gi|326674787|ref|XP_003200204.1| PREDICTED: integrin alpha-10-like [Danio rerio] Length = 1170 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 45/282 (15%), Positives = 100/282 (35%), Gaps = 57/282 (20%) Query: 117 DIVVVPQNEGYSISAISRYKIPLKFCTFI-PWYTNSRHIVMPITSSVKVNSQTDARL--- 172 ++ +N ++ C + + I +SV + + + + Sbjct: 97 NVSRNLKNSHLGMTLTPTDSDGFLACAPLWSQECGTSLFSTGICASVTSDMEPNDVIAPT 156 Query: 173 --------DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D++IVLD S S+ +++ ++ +L + + P +Q G++ Sbjct: 157 AQRCTTYMDIVIVLDGSNSIYPWYE--------VQNFLSNILSKFHISP---EQMQVGVL 205 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKF-------GVSTNSTPGLKYAYNQIFDMQ-GM 276 + E + EW + Q + G T + ++ A + F G Sbjct: 206 QYG----EISVHEWSLRDYQTTADVVEAAKNISRQEGRETRTAYAIQMACTEAFSPDRGA 261 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--RVIRSHE 334 R+ K+++ +TDGE+ ++ ++L C + R YAI + IR + Sbjct: 262 REGA-------TKVMIVVTDGESHDGEDLPEALIECEK---RNITRYAIAVLGHYIRRQQ 311 Query: 335 -------FLRACASP---NSFYLVENPHSMYDAFSHIGKDIV 366 ++ +S F+ V + ++ D +G I Sbjct: 312 DPETFINEIKYISSDPDEKYFFNVTDEAALNDIVDALGDRIF 353 >gi|85705211|ref|ZP_01036310.1| hypothetical protein ROS217_17122 [Roseovarius sp. 217] gi|85670084|gb|EAQ24946.1| hypothetical protein ROS217_17122 [Roseovarius sp. 217] Length = 580 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 70/225 (31%), Gaps = 38/225 (16%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F G +T+L+ + ++ G+ ++ + L + +DR+++ AT G Sbjct: 19 FIQEEDGTVTVLSFFIFVMFLMMGGIGLDTMRQEMARASLQATLDRAVLAGATASTEAGA 78 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 + F + + + DI + Sbjct: 79 ----------------RTIVEDYFAKSGQSDYLL-------AQKDGDISTTLNAAKVTAG 115 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 A T++ + T++ +V +L+ ++VLDVS SM S Sbjct: 116 AELSLD------TYLMKLAGVPTLSASGTATAEVRI---PKLEAILVLDVSGSMASN--- 163 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 +KI + + V + V +V FS + Sbjct: 164 --SKIQNLQTAAKDFVTTVMNSSKPGDTV-MSIVPFSFSVTPPQS 205 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 19/65 (29%), Positives = 28/65 (43%), Gaps = 3/65 (4%) Query: 310 YYCNEAKKRGAIVYAIGIRVI---RSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIV 366 C K G +VY+IG V + L ACAS + Y + + AFS I ++ Sbjct: 515 NVCKATKTEGVVVYSIGFEVPVNGTAENQLSACASSPAHYFRASGTDIKSAFSAIAANVK 574 Query: 367 TKRIW 371 R+ Sbjct: 575 QLRLT 579 >gi|281339018|gb|EFB14602.1| hypothetical protein PANDA_010506 [Ailuropoda melanoleuca] Length = 1096 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 33/210 (15%), Positives = 76/210 (36%), Gaps = 35/210 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++V D S S+ + + + ++ + + P Q GL+ ++NK Sbjct: 152 IDVVVVCDESNSIYPW--------EAVKNFLEKFVQSLDIGP---KKTQVGLIQYANKPR 200 Query: 232 EFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L + ++ + G TN+ ++YA Sbjct: 201 VIFNLNTFKTKAEMIEATSQTYQYGGDLTNTFKAIQYA------KDFAYAAGAGGRLGAA 254 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEF---LRAC 339 K++V +TDGE+ + + C+ + + I + + + ++A Sbjct: 255 KVMVVVTDGESHDGSMLKAVIDQCD---NDNILRFGIAVLGYLNRNALDTKNLIKEIKAI 311 Query: 340 AS---PNSFYLVENPHSMYDAFSHIGKDIV 366 AS F+ V + ++ + +G+ I Sbjct: 312 ASIPTETFFFNVSDEAALLEKAGTLGEQIF 341 >gi|238913524|ref|ZP_04657361.1| von Willebrand factor type A domain protein [Salmonella enterica subsp. enterica serovar Tennessee str. CDC07-0191] Length = 596 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 71/188 (37%), Gaps = 21/188 (11%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 PW + + + + + ++ +++ ++D S SM+ ++ + + Sbjct: 206 LAPSPWNAQRTLLKVDVQAR-DMQTRDLPPANLVFLIDTSGSMQP-----AERLPLIRSA 259 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 + ++ +++ ++ V +G + + ++ I L +G ST Sbjct: 260 LKLLVNDLRAQDNITIVTYAG----GTHVALASTAGNNTTAIKAAIDNLDAYG-STGGEA 314 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGA 320 GL+ AY Q + N I+ TDG+ NL + + + +++G Sbjct: 315 GLRLAYEQ------AEKGFIKGGVNR---ILLTTDGDFNLGITDPKDIEALVKKEREKGI 365 Query: 321 IVYAIGIR 328 + +G+ Sbjct: 366 TLSTLGVG 373 >gi|307196324|gb|EFN77937.1| Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 [Harpegnathos saltator] Length = 2255 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 31/234 (13%), Positives = 74/234 (31%), Gaps = 39/234 (16%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNS--QTDARLDMMIVLDVSRSMESFFDSSITKID 196 + + S+ ++ + ++ ++++M ++D S S+ + + Sbjct: 33 YEHLDDVDKMLKSKTDILSRRLKIDIDRLRNKTNQIELMFLVDASGSVGA---------E 83 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK------IEEFFLLEWGVSH---LQRKI 247 +N + + + + ++TF + +++ L ++ Sbjct: 84 NFRSELNFVTKLLSDFTVDATAARVAIITFGSPRNVTRNVDQISRHGGNDHKCYLLNKQF 143 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 ++ G T + L A + K++ +TDG + Sbjct: 144 NEIAYSGGGTYTRGALLEALAIL----------EKSREEASKVVFLITDGFSNG----GD 189 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHI 361 N K GA V+ GIR E L AS + + D+F+ Sbjct: 190 PRPAANLLKNAGATVFTFGIRTGNVEE-LHDIAS----FPGYTHSYLLDSFAEF 238 >gi|281416565|ref|ZP_06247585.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|281407967|gb|EFB38225.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|316939671|gb|ADU73705.1| Protein of unknown function DUF3520 [Clostridium thermocellum DSM 1313] Length = 538 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 33/233 (14%), Positives = 81/233 (34%), Gaps = 33/233 (14%) Query: 136 KIPLKFCTFIPWYTNS--RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSIT 193 P T I + +++ + K++++ +++ ++DVS SM+ Sbjct: 145 DEPFSITTEIGQCPWNPENKLMLVGLQTKKLSTEQLPPSNLVFLIDVSGSMDE-----PN 199 Query: 194 KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS-- 251 K+ + + +++E+ V+ VV +G L + + + L Sbjct: 200 KLPLLKSAFKLLVDELDEDDRVSIVVYAGAAG-------LVLDSTPGNEKDKILDALMNL 252 Query: 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLY 310 + G ST G+K AY + ++ N ++ TDG+ N+ + + + Sbjct: 253 EAGGSTAGAEGIKLAY-DVAKKNFIKSGNNR--------VILATDGDFNVGISSEAELVR 303 Query: 311 YCNEAKKRGAIVYAIGIRVIRSHEFLRACAS-----PNSFYLVENPHSMYDAF 358 + + G + +G + S ++ ++N Sbjct: 304 LIEKKRDEGIFLTVLGFGTGNYKD--SKMESLADKGNGNYAYIDNIAEARKVL 354 >gi|260433775|ref|ZP_05787746.1| von Willebrand factor type A [Silicibacter lacuscaerulensis ITI-1157] gi|260417603|gb|EEX10862.1| von Willebrand factor type A [Silicibacter lacuscaerulensis ITI-1157] Length = 327 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 28/221 (12%), Positives = 77/221 (34%), Gaps = 34/221 (15%) Query: 168 TDARLDMMIVLDVSRSM--ESFFDSSIT---KIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + D+++ +D+S SM F + T +++ + A + + + Sbjct: 90 EKSARDVVLAVDISGSMDQRDFKAADGTPKQRLEAVKDVLRAFIA-------ARDGDRMA 142 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 L+ F + + L ++ + N+ G D G+ Sbjct: 143 LIIFGTRAFVQAPFTEDLQSLNGFLEQTAVGMAGPNTALG---------DAIGLGIRTFE 193 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-------EF 335 +++++ ++DG T + + A +G ++Y IG+ + + Sbjct: 194 SSEVDQRMMIVLSDG--ADTSSRMTPVIAASIAADKGVVIYTIGVGDPDATGEDRVDLDA 251 Query: 336 LRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 L+ A + ++ ++ ++ + + I D R+ + Sbjct: 252 LKDIANKTQGQYFFADDEAALTEVYRQI--DAQNPRVVETQ 290 >gi|258516146|ref|YP_003192368.1| von Willebrand factor type A [Desulfotomaculum acetoxidans DSM 771] gi|257779851|gb|ACV63745.1| von Willebrand factor type A [Desulfotomaculum acetoxidans DSM 771] Length = 219 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 39/204 (19%), Positives = 72/204 (35%), Gaps = 14/204 (6%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + ++LD S SM I+ + + M+ E+K P ++TF + Sbjct: 11 RLPVYLLLDRSGSMFG------EPIEAVKQGVKYMISELKKEPQAIETAYISVITFGSDA 64 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L + + +I+ T S + N FD +R+ T+ +YK + Sbjct: 65 RQDVQLTELAAFKEPQIEA-----NGTTSLGAALHILNNCFD-NEVRKSTPTQKGDYKPL 118 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 + MTDGE E+ K + A+G + + L+ Sbjct: 119 VFIMTDGEPTDDWENAAREIKQKSGKVA--NIVAVGCGPDVNTDTLKKITDIVLLMSSYQ 176 Query: 351 PHSMYDAFSHIGKDIVTKRIWYDK 374 P F + + + I + K Sbjct: 177 PEDFKQFFRWVSQSVKQASIKFTK 200 >gi|260800525|ref|XP_002595179.1| hypothetical protein BRAFLDRAFT_241028 [Branchiostoma floridae] gi|229280423|gb|EEN51191.1| hypothetical protein BRAFLDRAFT_241028 [Branchiostoma floridae] Length = 284 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 31/170 (18%), Positives = 65/170 (38%), Gaps = 22/170 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D + S+ + ++ I+ + A L N V+ +V ++ ++ Sbjct: 1 DIVFIVDGTGSVGLE---NFERMKTFIRQLFAYL------DIGENAVRVSIVQYAAQVRT 51 Query: 233 FFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L+ + + Q + + G T + + +A N FD+ A+ KI Sbjct: 52 EFFLDQYYDLQEAQDAVDGIEYMGGFTLTGKAIDFATNLHFDL------RKGARADVTKI 105 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 V +TDG ++ G + A+G+ + L A A Sbjct: 106 AVVITDGR-----SYDDVNRPARRMRQAGIVTIAVGVGNNLDRDQLTAIA 150 >gi|126733489|ref|ZP_01749236.1| von Willebrand factor, type A [Roseobacter sp. CCS2] gi|126716355|gb|EBA13219.1| von Willebrand factor, type A [Roseobacter sp. CCS2] Length = 699 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 72/210 (34%), Gaps = 32/210 (15%) Query: 134 RYKIPLKFCTFIPWYTNSRHIVMPITSSVK----------VNSQTDARLDMMIVLDVSRS 183 Y P P+ P + + L+++ ++D S S Sbjct: 297 PYDYPTPDAGEAPFRPTVTTFQTPWNADTQLVHIALQGQMPEVAARPPLNLVFLIDTSGS 356 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 M+ TK+ + +S ML++++ V V +G ++ + L Sbjct: 357 MD-----DPTKLPLLKQSFRLMLDQLRPEDQVAIVEYAGSAG---QVLVPTSASERTTIL 408 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 Q I+ L G STN GL+ AY+ M+ + ++ TDG+ Sbjct: 409 QA-IQSLGA-GGSTNGQGGLEQAYSVAEAMREDGEVNR---------VILATDGDFNVGL 457 Query: 304 EDQQSLY--YCNEAKKRGAIVYAIGIRVIR 331 + +L ++ ++ G + +G Sbjct: 458 SNPDALKDFIADK-RETGTYLSVLGFGRGN 486 >gi|323350757|ref|ZP_08086417.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis VMC66] gi|322123037|gb|EFX94736.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis VMC66] Length = 458 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 36/161 (22%), Positives = 65/161 (40%), Gaps = 17/161 (10%) Query: 167 QTDARLDMMIVLDVSRSMESFFDS------SITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 Q + + V D+S SM+ + + +++D+ M+ E++ + +V+ + Sbjct: 193 QGQMNIAVSFVFDISGSMKGALNGANPTSNNPSRMDILRDKAEIMINELQSVGNVSVNLT 252 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + T S K F L+ ++ IK L G TN GL+Y + +QH Sbjct: 253 TFSTTGSYKQAAFSQLDREAGTIKESIKNLKSDGGVTNPGDGLRYGMVSL-----QKQHA 307 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 K +V +TDG + +QQ E K+ G Sbjct: 308 QL------KYVVLLTDGVPNAYLVNQQGQAGGLEMKREGIQ 342 >gi|253582981|ref|ZP_04860199.1| magnesium chelatase [Fusobacterium varium ATCC 27725] gi|251835187|gb|EES63730.1| magnesium chelatase [Fusobacterium varium ATCC 27725] Length = 632 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 39/209 (18%), Positives = 75/209 (35%), Gaps = 27/209 (12%) Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 + ++ A R P + N+ I + + ++ V+D S Sbjct: 404 GKIRDFAFDATIRAAAPYQ----KKNKENNLMINIKKEHIRVKVREKRTGASILFVVDSS 459 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEFFLLEWGV 240 SM +++ ++ ++L+ + G+V+F +K EE + + Sbjct: 460 GSM-----GVKKRMEAVKGAVMSLLK-----DAYEKRDRVGMVSFRRDKAEELLPITRSI 509 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 Q+K++ L G T G+ AY I MR+ D +IVF++DG+ Sbjct: 510 DLAQKKLEKL-ATGGKTPLAEGIAKAYTII--KNEMRK-----DKEVVPLIVFLSDGKGN 561 Query: 301 STKEDQQ----SLYYCNEAKKRGAIVYAI 325 + + SL + K G I Sbjct: 562 FSASGKDPVKESLEMAEKIKNEGIRAIVI 590 >gi|67922256|ref|ZP_00515770.1| von Willebrand factor, type A [Crocosphaera watsonii WH 8501] gi|67855959|gb|EAM51204.1| von Willebrand factor, type A [Crocosphaera watsonii WH 8501] Length = 416 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 39/218 (17%), Positives = 79/218 (36%), Gaps = 28/218 (12%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 +NS+ V S+V +S L++ ++LD S SM S + ++ + ++ Sbjct: 17 NQSNSQRQVAISLSAVSESSDRSLPLNLGLILDHSGSM------SGKPMKTVKEAASYLV 70 Query: 207 EEVKLIPDVNNVVQSGLVTFSNK--IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 E + + +V F ++ + + ++ I L G T+ G+K Sbjct: 71 EGLGPDD------RLSVVAFDHRAKVIVPNQPVDEIDGVKDAIASLKAEG-GTSIDEGMK 123 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 Q+ + R I +TDGEN ++++ L A + + Sbjct: 124 LGIKQVALGKEDRVSQ----------IFLLTDGENEH-GDNERCLKLAQVAGEYNITLNT 172 Query: 325 IGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSH 360 +G + + L + A + +E P FS Sbjct: 173 LGFGNHWNQDVLESIADSVGGTLCYIEQPEQALTEFSR 210 >gi|118081930|ref|XP_414992.2| PREDICTED: hypothetical protein [Gallus gallus] Length = 1794 Score = 57.5 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 75/223 (33%), Gaps = 28/223 (12%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 P T+ + + D+ ++D S S+ S+ K+ + I + ++ Sbjct: 461 EGPETTLIGQTVCGKFKADIGFLVDESSSIGW---SNFNKVKDFLFRIISYFPKIGP--- 514 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 Q + +S + F + + +K L G +T + G+ + ++F Sbjct: 515 --EGTQVAVAQYSEEPRAAFHFNQHQDRNGALKAVKELHYAGGNTKTGRGIAFMLKELF- 571 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 Q + +++ +TDG L A G + A+G+ Sbjct: 572 -----QPSRGMRPEFPHVLMLVTDGR-----SQDDVLPPARAAHALGIRIIAVGVSGADP 621 Query: 333 HEFLRACASPN--SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 E N + + V +D F I ++++ I D Sbjct: 622 AELNDILLQQNLQNVFYVST----FDEFPQILRELIE-VICSD 659 >gi|315298071|gb|EFU57340.1| von Willebrand factor type A domain protein [Escherichia coli MS 16-3] Length = 581 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 50/313 (15%), Positives = 112/313 (35%), Gaps = 39/313 (12%) Query: 39 EVSHIFFMKTVLHSMID--RSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRN 96 + S ++ L R+ AT I N G R + + + ++ ++F Sbjct: 75 QYSDKQTLQGRLQEAPTFARAAKANATHIAN--PGTARYQQFDDNPVKQVAQNPLVTFSL 132 Query: 97 ELRDNGFVN-------DIDDIVRSTSLDIVVV--PQNEGYSISAISRYKIPLKFCT---- 143 ++ + N + + ++ +V P + P+ F Sbjct: 133 DVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYEL 192 Query: 144 -FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 PW + + I + S+ +++ ++D S SM S ++ + S+ Sbjct: 193 APAPWNEQRTLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISD-----ERLPLIQSSL 246 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 +++E++ ++ V +G ++I + + + I L G STN G Sbjct: 247 KLLVKELREQDNIAIVTYAG----DSRIALPSISGSHKAEINAAIDSLDAEG-STNGGAG 301 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGAI 321 L+ AY Q ++ N I+ TDG+ +D +S+ + ++ G Sbjct: 302 LEMAYQQA-AKGFIKGGINR--------ILLATDGDFNVGIDDPKSIESMVKKQRESGVT 352 Query: 322 VYAIGIRVIRSHE 334 + +G+ +E Sbjct: 353 LSTLGVGDSNYNE 365 >gi|297685094|ref|XP_002820135.1| PREDICTED: LOW QUALITY PROTEIN: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Pongo abelii] Length = 3553 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 30/212 (14%), Positives = 69/212 (32%), Gaps = 38/212 (17%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL+++ ++D S S+ + + +L + ++P + +VTFS+K Sbjct: 66 RLELVFLVDDSSSVG------QINFRSELMFVRKLLSDFPVVPTA---TRVAIVTFSSKN 116 Query: 231 EEFFLLEW-GVSHLQRK--------IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +++ ++ I +S G T + + A + Sbjct: 117 YVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYTKGAFQQAAQILLHA-------- 168 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 N K++ +TDG + + G ++ GI E ++ Sbjct: 169 --RENSTKVVFLITDGYSNG----GDPRPIAASLRDSGVEIFTFGIWQGNIRELNDMAST 222 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 P E + +F + + + + D Sbjct: 223 PK-----EEHCYLLHSFEEF-EALARRALHED 248 >gi|256821501|ref|YP_003145464.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] gi|256795040|gb|ACV25696.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] Length = 582 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 43/253 (16%), Positives = 93/253 (36%), Gaps = 29/253 (11%) Query: 94 FRNELRDNGFVN-----DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTF---I 145 F ++ + N + + ++ + Y + P T Sbjct: 124 FSIDVDTGSYSNVRRMLNDGYLPPEDAVRLEEFVNYFNYDYQTPDSTEQPFAVNTHVFSA 183 Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 PW +N+ + + I + Q +++ ++DVS SM S K+ + KS+ + Sbjct: 184 PWNSNAYLMEIGIKG-FEPEQQELPPSNLVYLIDVSGSMNSE-----DKLGLVKKSLKLL 237 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 +E ++ VV +G + + +++ + LS G STN G++ Sbjct: 238 AQESSDQDRISIVVYAGA----SGVVLEPTKGNDRMAIEQALDRLSA-GGSTNGGAGIEL 292 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYA 324 AY ++ + ++ N ++ TDG+ N+ T +Q + ++ G Sbjct: 293 AY-KLAEQAFIKDGINR--------VILATDGDFNVGTINREQLIDLVERKRESGISFTT 343 Query: 325 IGIRVIRSHEFLR 337 +G +E L Sbjct: 344 LGFGSGNYNEHLM 356 >gi|172037673|ref|YP_001804174.1| hypothetical protein cce_2760 [Cyanothece sp. ATCC 51142] gi|171699127|gb|ACB52108.1| unknown [Cyanothece sp. ATCC 51142] Length = 423 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 71/200 (35%), Gaps = 28/200 (14%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK- 229 L++ ++LD S SM I ++ +++ + + +V F ++ Sbjct: 49 PLNLGLILDHSGSMTG------KPIKTVKEAAMRLVDGLGASD------RLSVVAFDHRA 96 Query: 230 -IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + +++ I+ L G T+ G+K ++ + R Sbjct: 97 KVIVPNQPVDDIERVKQAIERLKPEG-GTSIDEGMKLGIKEVALGKDDRVSQ-------- 147 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--PNSFY 346 I +TDGEN ++++ L A + V +G + + L + A + Sbjct: 148 --IFLLTDGENEH-GDNERCLKLAQVAAEYNITVNTLGFGNHWNQDVLESIADAVGGTLC 204 Query: 347 LVENPHSMYDAFSHIGKDIV 366 +E P FS + I Sbjct: 205 YIEQPEQALTEFSRLFTRIQ 224 >gi|163801668|ref|ZP_02195566.1| hypothetical protein 1103602000597_AND4_09447 [Vibrio sp. AND4] gi|159174585|gb|EDP59387.1| hypothetical protein AND4_09447 [Vibrio sp. AND4] Length = 524 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 47/316 (14%), Positives = 98/316 (31%), Gaps = 52/316 (16%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 + KG I A+ L I + +E + + L + + + T G Sbjct: 9 RSLHKQKGVAAIWMALLLVPIMGITFWAVEGTRYIQESSRLRDSAEAAALA-VTIEDKPG 67 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 + + + IK+ + R E N + D ++ Y++ Sbjct: 68 AASVMAENYVRSYVRDIKSINVQAERRE-PGNSRNEEAADFIQ--------------YTV 112 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM--ESF 187 +A + + +FIP + ++ I + ++S +D++ V D S SM + Sbjct: 113 NATTTHDSWF-ANSFIPSFDETQDIAGRSLARKYLSSVGGKNIDIVFVSDFSGSMNFDWM 171 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVV----------------------QSGLVT 225 + KID +I A+ + N V + G V Sbjct: 172 DPNGNKKIDDLKTAIRAISNKFICQDVRNEFVEGELKPVCHDQEDGYTADKLKNRVGFVP 231 Query: 226 FSNK--------IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA--YNQIFDMQG 275 F+++ + L++ + + I +S GV + + N + Sbjct: 232 FNSRTREKRGSSVYATSQLKY-KDNYKTDISSVSYKGVDWDYWTRFRSEEIKNCATNSNF 290 Query: 276 MRQHCNTEDANYKKII 291 K+ I Sbjct: 291 CEAPRQERHLEAKRAI 306 >gi|148537043|dbj|BAF63430.1| Ca(2+)-activated chloride channel splicing variant [Rattus norvegicus] Length = 514 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 60/165 (36%), Gaps = 28/165 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN--KIE 231 + +VLD S SM++ ++ ++ L ++ + GLVTF + +I+ Sbjct: 309 ICLVLDKSGSMDTE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSTAQIQ 358 Query: 232 EFFLLEWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + ++ L + T+ GL+ + I Sbjct: 359 NYLIKITNTGDYKKITGNLPQQAVGGTSICRGLEAGFQAITSSDQSTSGSE--------- 409 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 IV +TDGE+ + + K GA+++ I + + E Sbjct: 410 IVLLTDGEDDLISSCFEVV------KHSGAVIHTIALGPKAAREL 448 >gi|330465656|ref|YP_004403399.1| von willebrand factor type a [Verrucosispora maris AB-18-032] gi|328808627|gb|AEB42799.1| von willebrand factor type a [Verrucosispora maris AB-18-032] Length = 410 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 39/223 (17%), Positives = 76/223 (34%), Gaps = 40/223 (17%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 MP ++ T+ + +VLDVS SM + ++I +A ++ E V +PD Sbjct: 1 MPASADDTAEPVTEPP-RVQLVLDVSGSMRATDIDGRSRISVAQQA---FGEVVDALPDE 56 Query: 216 NNV-VQSGLVTFSNKIEEFFLLEW---------GVSHLQRKIKYLSKFGVSTNSTPGLKY 265 + ++ T+ + ++ L+ + + L G T L+ Sbjct: 57 TQLGIRVLGATYRGEDKQQGCLDTQQIVPVGPVNRERAKAAVATLRPTGF-TPVGLALRE 115 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI---- 321 A + R+ IV +TDGE+ D C A++ A Sbjct: 116 AAKDLGGGTTARR------------IVLITDGEDTCAPPD-----PCQVARELAAQGTTL 158 Query: 322 -VYAIGIRV---IRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 V +G+ +R A A+ ++ + + Sbjct: 159 VVDTLGLAPDEKVRRQLLCIAAATGGTYTAATSAEDLTGRLKQ 201 >gi|298246130|ref|ZP_06969936.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] gi|297553611|gb|EFH87476.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] Length = 412 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 61/176 (34%), Gaps = 26/176 (14%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + +Q L+ +V+D S SM K+ +++ +++ ++ Sbjct: 35 DIMAQVRMPLNFSLVIDHSGSM------KGAKLRNVKEAVKMVIDRLEPSD------YIS 82 Query: 223 LVTFSN--KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V F + ++ + ++ I + G +T S ++ + +R+ Sbjct: 83 VVIFDDSAQVIIPSMPANDPVGMKAAIDRIQDAGGTTMSLGMIQ-------SLGELRRWN 135 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 + ++ +TDG + + +A G +Y +GI L Sbjct: 136 IPNAVSR---MILLTDGVTYGDTD--RCRQLARDAAAAGISIYPLGIGADWDENLL 186 >gi|290999945|ref|XP_002682540.1| predicted protein [Naegleria gruberi] gi|284096167|gb|EFC49796.1| predicted protein [Naegleria gruberi] Length = 502 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 43/216 (19%), Positives = 79/216 (36%), Gaps = 28/216 (12%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMES--FFDSSITKIDMAIKSINAMLEEVKLIPD 214 P + + +++ +VLD+S SM+ S +K+ +I ++ D Sbjct: 58 PQEEEAETTNVLKTPVNICLVLDISGSMDEPLKNRSKGSKLTACKSAIRELVTNFLTYKD 117 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 L+T+S+ + F + S I +S G STN L A + + + Sbjct: 118 T-----IHLITYSDSPKTVFTEKNKESVNLNDIDKISTEG-STNIASALHSAVDLLHNSN 171 Query: 275 GMRQHCNTEDANYKKIIVFMTDGE--------NLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 A K+I F +DG+ N+ + L +E K + + G Sbjct: 172 ----------APGTKLIAFFSDGQCNVGETNLNIFGSGLLKKLKDYSEGKDDQIHISSYG 221 Query: 327 IRVIRSHEFLRACASPNS--FYLVENPHSMYDAFSH 360 + +L+A A +Y +E+ DAF Sbjct: 222 VGSDYDELWLQAIARTGKGEYYYLEDETYAKDAFER 257 >gi|117921993|ref|YP_871185.1| von Willebrand factor, type A [Shewanella sp. ANA-3] gi|117614325|gb|ABK49779.1| von Willebrand factor, type A [Shewanella sp. ANA-3] Length = 335 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 70/178 (39%), Gaps = 31/178 (17%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFF-----DSSITKIDMAIKSINAMLEEVKLIPDVNN 217 +V ++ D D+++++D+S SM+ S++T+++ A + + + + Sbjct: 87 EVQTREDFGRDVLMLVDLSGSMDEADFTTADGSTLTRLNAAKNVLKTFIAK-RSGD---- 141 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF-----GVSTNSTPGLKYAYNQIFD 272 + GL+ F + F+ + Q + L + G ST+ + Sbjct: 142 --RFGLILFG---DAAFIQTPFTADQQVWLSLLEEAQTGMAGQSTHLGDAI--------- 187 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 G++ + +++++ +TDG T + + A RG +Y I + Sbjct: 188 GLGIKVFEQNPQPSEQQVMIVLTDG--NDTGSFVEPVDAAKIAAARGIKIYTIAMGDP 243 >gi|325686522|gb|EGD28550.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK72] Length = 458 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 36/161 (22%), Positives = 65/161 (40%), Gaps = 17/161 (10%) Query: 167 QTDARLDMMIVLDVSRSMESFFDS------SITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 Q + + V D+S SM+ + + +++D+ M+ E++ + +V+ + Sbjct: 193 QGQMNIAVSFVFDISGSMKGALNGANPTSNNPSRMDILRDKAEIMINELQSVGNVSVNLT 252 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + T S K F L+ ++ IK L G TN GL+Y + +QH Sbjct: 253 TFSTTGSYKQAAFSQLDREAGTIKESIKNLKSDGGVTNPGDGLRYGMVSL-----QKQHA 307 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 K +V +TDG + +QQ E K+ G Sbjct: 308 QL------KYVVLLTDGVPNAYLVNQQGQAGGLEMKREGIQ 342 >gi|324996174|gb|EGC28084.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK678] Length = 458 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 36/161 (22%), Positives = 65/161 (40%), Gaps = 17/161 (10%) Query: 167 QTDARLDMMIVLDVSRSMESFFDS------SITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 Q + + V D+S SM+ + + +++D+ M+ E++ + +V+ + Sbjct: 193 QGQMNIAVSFVFDISGSMKGALNGANPTSNNPSRMDILRDKAEIMINELQSVGNVSVNLT 252 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + T S K F L+ ++ IK L G TN GL+Y + +QH Sbjct: 253 TFSTTGSYKQAAFSQLDREAGTIKESIKNLKSDGGVTNPGDGLRYGMVSL-----QKQHA 307 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 K +V +TDG + +QQ E K+ G Sbjct: 308 QL------KYVVLLTDGVPNAYLVNQQGQAGGLEMKREGIQ 342 >gi|119899150|ref|YP_934363.1| hypothetical protein azo2860 [Azoarcus sp. BH72] gi|119671563|emb|CAL95476.1| conserved hypothetical membrane protein [Azoarcus sp. BH72] Length = 343 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 31/254 (12%), Positives = 75/254 (29%), Gaps = 53/254 (20%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + S++ TD + + + D+S SM + D + ++ A + + +E + Sbjct: 71 VAVARPSALVTLPVTDQTILLAM--DISGSMRA-TDIAPNRLAAAQAAARSFVEVLPSD- 126 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD- 272 + G+V F+ + I + + T G+ + + Sbjct: 127 -----TRVGVVAFAATAALIQAPTRNHDDVLAAIDRV-QLQRGTAIGSGMVLSLATLLPE 180 Query: 273 --------------------MQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYY 311 ++ IV +TDGE D + Sbjct: 181 AGIDLRLLAGDGSPPADKPPPGEPTHAPVPPGSHAYGAIVLLTDGERTTGPPLDFATRL- 239 Query: 312 CNEAKKRGAIVYAIGIRVIRS---------------HEFLRACA--SPNSFYLVENPHSM 354 A G VY +G+ L++ A + ++ ++ ++ Sbjct: 240 ---AADHGVRVYTVGVGTAEGGVVGYEGWSMRVRLDEAALKSIADETRGEYFHAQSAEAL 296 Query: 355 YDAFSHIGKDIVTK 368 + +G + + Sbjct: 297 RTIYRKLGTRLTLQ 310 >gi|261212659|ref|ZP_05926943.1| protein BatA [Vibrio sp. RC341] gi|260837724|gb|EEX64401.1| protein BatA [Vibrio sp. RC341] Length = 232 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 70/211 (33%), Gaps = 38/211 (18%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 M+V+D+S SM + + + A+ + + + GL+ F++ Sbjct: 1 MLVVDLSYSMSQE---DMQSGQQMVDRLTAVKQVLSDFIAKREGDRIGLILFADHAYLQT 57 Query: 235 LLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L + +++ L G T G+ A D +++I+ Sbjct: 58 PLTLDRETVTQQLNQAVLKLIGTQTAIGEGIGLATKIFIDSDA-----------PQRVII 106 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR------------------SHE 334 ++DG N + L N AK+ + +Y +G+ + Sbjct: 107 LLSDGSNTA--GVLDPLEAANIAKQYHSTIYTVGVGAGEMVVKDFLFSRKVNTAQDLDEK 164 Query: 335 FLRACAS--PNSFYLVENPHSMYDAFSHIGK 363 L+ AS ++ N + + + I + Sbjct: 165 TLQTIASTTGGQYFRARNQQDLQNIYDTINQ 195 >gi|171910783|ref|ZP_02926253.1| Protein containing von Willebrand factor (vWF) type A domain [Verrucomicrobium spinosum DSM 4136] Length = 917 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 37/176 (21%), Positives = 66/176 (37%), Gaps = 26/176 (14%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 +VN Q A + + ++DVS SM+ K+ + +S+ + E + + Sbjct: 530 RVNQQERAPANFVFLVDVSGSMDE-----PDKLPLVKQSLRMLTERLSTKD------RVA 578 Query: 223 LVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +VT++ G S + I L G STN G++ AY Q +QH Sbjct: 579 IVTYAGSTAVILPSTAGTEKSRIIEAIDGLGA-GGSTNGAGGIRLAYEQ------AQQHF 631 Query: 281 NTEDANYKKIIVFMTDGENL--STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 E N ++ TDG+ + + +AK R + +G + Sbjct: 632 QKEGVNR---VILCTDGDFNVGISSPGELQKLIEEKAKSR-VFLSVLGFGAGNLKD 683 >gi|114046077|ref|YP_736627.1| von Willebrand factor, type A [Shewanella sp. MR-7] gi|113887519|gb|ABI41570.1| von Willebrand factor, type A [Shewanella sp. MR-7] Length = 335 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 28/178 (15%), Positives = 69/178 (38%), Gaps = 31/178 (17%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFF-----DSSITKIDMAIKSINAMLEEVKLIPDVNN 217 +V ++ D+++++D+S SM+ S++T+++ A + + + + Sbjct: 87 EVQTREAFGRDVLMLVDLSGSMDEADFTTADGSTLTRLNAAKNVLKTFIAK-RSGD---- 141 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF-----GVSTNSTPGLKYAYNQIFD 272 + GL+ F + F+ + Q + L + G ST+ + Sbjct: 142 --RFGLILFG---DAAFIQTPFTADQQVWLSLLEEAQTGMAGQSTHLGDAI--------- 187 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 G++ + +++++ +TDG T + + A RG +Y I + Sbjct: 188 GLGIKVFEQNPQPSEQQVMIVLTDG--NDTGSFVEPVDAAKIAAARGIKIYTIAMGDP 243 >gi|296473248|gb|DAA15363.1| integrin alpha M [Bos taurus] Length = 1152 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 77/207 (37%), Gaps = 36/207 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S++ ++ + ++ + ++ K + L+ +S+ + Sbjct: 150 DIAFLIDGSGSID---PVDFERMKRFVSTVMSQFQKSKTLFS--------LMQYSDDFQT 198 Query: 233 FFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIF-DMQGMRQHCNTEDANYKK 289 F + + + + + T++ G++ ++F G R H K Sbjct: 199 HFTFNDFKRNPVPEFLVGPIRQLFGRTHTATGIRKVVRELFHSSSGARNHAI-------K 251 Query: 290 IIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFLRACAS--- 341 I++ +TDGE E + + K I Y IG R +S + L AS Sbjct: 252 IMIVITDGEKYLDPLEYSDVIPEADRKK---IIRYVIGVGDAFRSRKSRQELDTIASKPP 308 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTK 368 + + V N ++A I + K Sbjct: 309 ADHVFQVNN----FEALKTIQNQLQEK 331 >gi|311251228|ref|XP_003124501.1| PREDICTED: integrin alpha-D-like [Sus scrofa] Length = 230 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 84/214 (39%), Gaps = 32/214 (14%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 V +D+ ++D S S++ ++ ++++ + ++ L Sbjct: 6 VTECPKQEIDIAFLIDGSGSIDQN---DFKQMKNFVRAVMDQFMDTSILFS--------L 54 Query: 224 VTFSNKIEEFFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 + +SN ++ F +H Q + + + T + G++ ++F N Sbjct: 55 MQYSNLLKTHFTFSQFQTHRSPQSLVDPIVQLKGLTYTATGIQTVVKELF------HSKN 108 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-------VIRSHE 334 ++ KKI++ +TDG+ + + + +A+K I YAIG+ + Sbjct: 109 GARSSAKKILLVITDGQKYKDPLEYEDVIP--QAEKANVIRYAIGVGDAFQEHSAKQELS 166 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 + + S + + V+N + A S I K + K Sbjct: 167 IIGSLPSKDHVFKVDN----FAALSSIQKKLQEK 196 >gi|282900569|ref|ZP_06308511.1| von Willebrand factor, type A [Cylindrospermopsis raciborskii CS-505] gi|281194369|gb|EFA69324.1| von Willebrand factor, type A [Cylindrospermopsis raciborskii CS-505] Length = 418 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 36/222 (16%), Positives = 82/222 (36%), Gaps = 28/222 (12%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 S+ ++ S++ + L++ ++LD S SM ++ ++ +++++ Sbjct: 20 GSQRLMATSVSAIGETIDSRVPLNLCLILDHSGSM------KGQPVENVKRAAWLLVDKL 73 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 + + +V F+++ E + H++++I LS G T+ GL+ Sbjct: 74 RDQD------RLSIVVFNHRAEVLLSNQNVVDRDHIKQQINRLSANG-GTSIDEGLRLGI 126 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 ++ + +TDGEN ++ + L + A V +G Sbjct: 127 EEL----------AKGRKDTISQAFLLTDGENEH-GDNNRCLKFAQLAADYNLTVNTLGF 175 Query: 328 RVIRSHEFLRAC--ASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + L A S +E+P D F + + T Sbjct: 176 GNNWNQHILEKISDAGLGSLSHIEHPDQAMDKFDSLLTRMQT 217 >gi|91201645|emb|CAJ74705.1| hypothetical protein kuste3942 [Candidatus Kuenenia stuttgartiensis] Length = 336 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 38/174 (21%), Positives = 61/174 (35%), Gaps = 31/174 (17%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 L+++ VLDVS SM + D +++ A I ++ + + GLV F+ Sbjct: 87 EKEGLEIVFVLDVSMSMLAE-DVKPNRLECAKMEIANLVRGL-------EDDRVGLVVFA 138 Query: 228 NKIEEFFLLEWGVSHLQ-------RKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + F LL + + + Y+ TN L A + G Sbjct: 139 AR--AFSLLPYPTKDYEMVFLRILNMVNEHYVRFVPYGTNIGNALIAAMETFSNEAG--- 193 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 KKII+ +TDGE + Q +K Y IGI + Sbjct: 194 ---------KKIIILLTDGEEQLLRRSQVVEAIRLLLEKNDISTYIIGIGDPNN 238 >gi|221132796|ref|XP_002166108.1| PREDICTED: similar to fibrillar collagen [Hydra magnipapillata] Length = 2213 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 79/214 (36%), Gaps = 38/214 (17%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD+++VLD S + + AMLE + + + NV ++++S+ E Sbjct: 656 LDILLVLDDS------IKTGQENFKKSKDFSKAMLEWLSIDQNNTNV---AVISYSDIAE 706 Query: 232 ---------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTP-GLKYAYNQIFDMQGMRQHCN 281 L+ + LQ KI + G ST+ L A ++F Sbjct: 707 LHISFPVSGSDDPLQ-SLYDLQGKIDSIPYKGGSTSRLDRALSLASTRVFPEG------- 758 Query: 282 TEDANYKKIIVFMTDGENLSTKE--DQQSLYYCNEAKKRG------AIVYAIGIRVIRSH 333 N KK+I+ TDG + E D S + K+ G + ++ + + Sbjct: 759 KRTRNAKKVIILFTDGSTDVSSERLDVASWPLRKQKKRDGENEVNAIRIMSVTVSNKTNS 818 Query: 334 EFLRACASP---NSFYLVENPHSMYDAFSHIGKD 364 L SP + + + ++++ I ++ Sbjct: 819 NGLANVLSPPFIENTFTAADYDDIFNSIQQIAEE 852 >gi|110629868|gb|ABG80450.1| fibrillar collagen [Hydra vulgaris] Length = 1883 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 79/214 (36%), Gaps = 38/214 (17%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD+++VLD S + + AMLE + + + NV ++++S+ E Sbjct: 326 LDILLVLDDS------IKTGQENFKKSKDFSKAMLEWLSIDQNNTNV---AVISYSDIAE 376 Query: 232 ---------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTP-GLKYAYNQIFDMQGMRQHCN 281 L+ + LQ KI + G ST+ L A ++F Sbjct: 377 LHISFPVSGSDDPLQ-SLYDLQGKIDSIPYKGGSTSRLDRALSLASTRVFPEG------- 428 Query: 282 TEDANYKKIIVFMTDGENLSTKE--DQQSLYYCNEAKKRG------AIVYAIGIRVIRSH 333 N KK+I+ TDG + E D S + K+ G + ++ + + Sbjct: 429 KRTRNAKKVIILFTDGSTDVSSERLDVASWPLRKQKKRDGENEVNAIRIMSVTVSNKTNS 488 Query: 334 EFLRACASP---NSFYLVENPHSMYDAFSHIGKD 364 L SP + + + ++++ I ++ Sbjct: 489 NGLANVLSPPFIENTFTAADYDDIFNSIQQIAEE 522 >gi|115450663|ref|NP_001048932.1| Os03g0142500 [Oryza sativa Japonica Group] gi|108706121|gb|ABF93916.1| zinc finger family protein, putative, expressed [Oryza sativa Japonica Group] gi|113547403|dbj|BAF10846.1| Os03g0142500 [Oryza sativa Japonica Group] gi|125584872|gb|EAZ25536.1| hypothetical protein OsJ_09360 [Oryza sativa Japonica Group] gi|215712380|dbj|BAG94507.1| unnamed protein product [Oryza sativa Japonica Group] Length = 694 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 74/213 (34%), Gaps = 40/213 (18%) Query: 99 RDNGFVNDIDDIVRSTSL-----DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 + ND ++I + + + + YS + K + + Sbjct: 177 PEASIFNDDENIDPQSETVDDHNAVTNSVEIKTYSEFPAIQKSERRKVFAILIHLKAPKS 236 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + A LD++ VLDVS SM K+ + ++++ +++ + Sbjct: 237 LD---------SVSSRAPLDLVTVLDVSGSMSGI------KLSLLKRAMSFVIQTLGPND 281 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + +V FS+ + F L G + I L G TN LK Sbjct: 282 ------RLSVVAFSSTAQRLFPLRRMTLTGRQQALQAISSLVASG-GTNIADALK----- 329 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 +G + + N I+ ++DG++ + Sbjct: 330 ----KGAKVVKDRRRKNPVSSIILLSDGQDTHS 358 >gi|326430083|gb|EGD75653.1| NOTCH2 protein [Salpingoeca sp. ATCC 50818] Length = 4350 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 36/191 (18%), Positives = 74/191 (38%), Gaps = 20/191 (10%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+++VLD S S+ + +T A ++ P + +V FS + Sbjct: 2634 DVVLVLDSSASLHQSGWADVTDFASA---------FLEAFPSDTTHARVAVVVFSTRASL 2684 Query: 233 FFLLEWGVSHLQ---RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + + L+ ++ L +T + L++ + + N Sbjct: 2685 VADFDDYLGDLRGLASVVETLPFENGATATDRALRFVRQNLISSVDAGRRSNVGS----- 2739 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +TDGE + +ED Q +E GA +YA+G + S L P+ + VE Sbjct: 2740 VVITVTDGEPSAFQEDVQQRV--DELVAVGAQLYAVGAGDLVSDSTLNLLG-PDGVFRVE 2796 Query: 350 NPHSMYDAFSH 360 + ++D + Sbjct: 2797 STRWLFDLLAD 2807 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 41/222 (18%), Positives = 87/222 (39%), Gaps = 20/222 (9%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 PLK + +P+ T + + S D+++VLD S S+ +++ + Sbjct: 2828 PLKSVSSLPFTTEAITSSAVVPLCASDPSADVLYQDVVLVLDSSASLLEEGWAAVAEFGA 2887 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ---RKIKYLSKFG 254 ++ P + +V FS + + + L+ ++ L Sbjct: 2888 L---------FLEAFPSDTTHARVAVVVFSTRASLVADFDDYLGDLRGLASVVETLPFEN 2938 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 +T + LK+ ++ G+ + N +++ +TDGE + +ED Q +E Sbjct: 2939 GATATDRALKFVRERLVTATGVGRRSNVGS-----VVITVTDGEPSALQEDVQQRV--DE 2991 Query: 315 AKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYD 356 GA +YA+G + S L P+ + V++ + + D Sbjct: 2992 LVAVGAQLYAVGAGDLVSDSTLNLLG-PDGVHHVDDVNHLQD 3032 >gi|149188995|ref|ZP_01867284.1| hypothetical protein VSAK1_21554 [Vibrio shilonii AK1] gi|148837181|gb|EDL54129.1| hypothetical protein VSAK1_21554 [Vibrio shilonii AK1] Length = 266 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 70/201 (34%), Gaps = 26/201 (12%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 MM+ +D+S SME+ I++ +L E L + + GL+ F + Sbjct: 30 MMVAVDLSGSMEAKDFVDQQGINVRRIDGVKLLLESFLQQRTGD--RVGLIAFGDDAYLQ 87 Query: 234 FLLEWGVSHLQRKIKYL--SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + L ++ + G T + A N +H + + K++ Sbjct: 88 APVTEDFDTLSLLLEQMDVRMAGAGTALGDAIGVAVNHF-------EHSESNN----KVL 136 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--------VIRSHEFLRACA-SP 342 + +TDG+ T + + A +RG +Y I I I R + + Sbjct: 137 LLLTDGK--DTTSQFPPVDAAHFAGERGVTIYPIAIGDATNVGEEAIDLEMLARIASYTG 194 Query: 343 NSFYLVENPHSMYDAFSHIGK 363 + + ++ + + + Sbjct: 195 GQVFEALDGDALAAVYQTLNE 215 >gi|297199802|ref|ZP_06917199.1| lipoprotein [Streptomyces sviceus ATCC 29083] gi|197710264|gb|EDY54298.1| lipoprotein [Streptomyces sviceus ATCC 29083] Length = 506 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 38/191 (19%), Positives = 73/191 (38%), Gaps = 26/191 (13%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 V T S N+ + V+D+S SM S ++D+A +S++ M E ++ Sbjct: 144 VGLATRSAGENADERPPAALTFVIDISGSM-----SEPGRLDLAQRSLDTMTERLRDDDS 198 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 V LVTFS++ + + + I L STN G++ Y Sbjct: 199 VA------LVTFSDRARKVLPMTRLGGHRDRIHEAIDGLEPT-YSTNLGAGVETGYK--T 249 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIRV 329 ++G+R+ +V ++D N + L + A ++ G ++ +G+ Sbjct: 250 AVEGLRKGATNR-------VVLISDALANDGETDPDAILERIDTARREHGITLFGVGVGS 302 Query: 330 IRSHEFLRACA 340 + A Sbjct: 303 DYGDALMERLA 313 >gi|149021014|gb|EDL78621.1| rCG55860, isoform CRA_b [Rattus norvegicus] gi|149021015|gb|EDL78622.1| rCG55860, isoform CRA_b [Rattus norvegicus] Length = 919 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 74/202 (36%), Gaps = 33/202 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ VLD+S SM K+ +++ +L +++ + ++ FSN+I Sbjct: 262 VVFVLDISASM------VGAKLQQTREALVTILNDLRPQD------RFNIIGFSNRIKMW 309 Query: 231 -EEFFLLE-WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + + + + + +LS G T+ L+ A + + ED + Sbjct: 310 KDHLLPVTPDNIRNGKIYMYHLSPTG-GTDINGALQTAIKLL---NNYVAQNDIEDRSVS 365 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEF-----LRACA 340 +I+F+TDG+ + +L + K+ ++ +GI L C Sbjct: 366 -LIIFLTDGKPTF--GETNTLRILSNTKEATGGQICIFTVGIGNDVDFRLLEKLSLENCG 422 Query: 341 SPNSFYLVENP-HSMYDAFSHI 361 + E + + I Sbjct: 423 LTRRVHEEEKAGAQLIGFYDEI 444 >gi|307591436|ref|YP_003900235.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306986290|gb|ADN18169.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 441 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 64/173 (36%), Gaps = 20/173 (11%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 S + V+D S SM+ +K + I+S+ ++ +L D + ++ Sbjct: 35 SNSRPSTSFAFVIDTSGSMDEVVTGGKSKKSIVIESLYQLVRSGRLTQDDH----IAIIE 90 Query: 226 FSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 F ++ L L+ I L+ F T G+ A + + Sbjct: 91 FHDQASTLIGLTPATQVFQLENAIARLNDFSGGTCMGKGMNEALVLLTNQ---------- 140 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 +++++F TDGE ++ + + +G + A+G+ + + L Sbjct: 141 SMTSRRVLIF-TDGETFDEEDCEI---IAQQFSNQGISITAMGVGDEFNEDLL 189 >gi|90403616|ref|NP_001035046.1| integrin alpha M [Bos taurus] gi|74229861|gb|AAX46797.1| integrin alpha M [Bos taurus] Length = 1152 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 75/207 (36%), Gaps = 36/207 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S++ ++ + ++ + ++ K + L+ +S+ + Sbjct: 150 DIAFLIDGSGSID---PVDFERMKRFVSTVMSQFQKSKTLFS--------LMQYSDDFQT 198 Query: 233 FFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIF-DMQGMRQHCNTEDANYKK 289 F + + + + + T++ G++ ++F G R H K Sbjct: 199 HFTFNDFKRNPVPEFLVGPIRQLFGRTHTATGIRKVVRELFHSSSGARNHAI-------K 251 Query: 290 IIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGI-----RVIRSHEFLRACASP- 342 I++ +TDGE E ++ + K I Y IG+ E + P Sbjct: 252 IMIVITDGEKYLDPLEYSDAIPEADREK---IIRYVIGVGDAFRGRKSRQELDTIASKPP 308 Query: 343 -NSFYLVENPHSMYDAFSHIGKDIVTK 368 + + V N ++A I + K Sbjct: 309 ADHVFQVNN----FEALKTIQNQLQEK 331 >gi|327270786|ref|XP_003220169.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 975 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 51/294 (17%), Positives = 99/294 (33%), Gaps = 42/294 (14%) Query: 91 NMSFRNELRDNGFVNDIDDIVRSTSLDI-VVVPQNEGYSISAISRYKIPLKFCTFIPWYT 149 + + ++ ++ +I QN+ + + + Y Sbjct: 227 KQTASASIMYMQSLSSAMQFCNKSNHNIKATNMQNKQCNFRST------WEVIMNSSDYA 280 Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 +S I P ++ QT R+ + +VLDVS SM ++ +I ++ L ++ Sbjct: 281 SSFPINSPPSAPAISLLQTHDRV-VCLVLDVSGSMTTY-----DRIARLKQAAELFLLQI 334 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAY 267 G+VTF++ L S + YL + Sbjct: 335 IETGSW-----VGIVTFNSYATRQIGLRQITSDSVRESLKNYLPTSAGG---GTIICSGV 386 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIG 326 Q F ++ + +T+ IV +TDGE+ S C E + G+I++ I Sbjct: 387 RQGFQVKQKKYQTSTKGCE----IVLLTDGEDNSVSS-------CFAEVQSSGSIIHTIA 435 Query: 327 IRVIRSHEFLRAC-ASPNSFYLVENPHS---MYDAFSHIGK---DIVTKRIWYD 373 + + E + + + + DAFS I DI + I + Sbjct: 436 LGPNAAKELEMLADMTGGLKFSATDSLDSNGLIDAFSRISSESGDISQQSIQLE 489 >gi|308497646|ref|XP_003111010.1| CRE-CUTL-23 protein [Caenorhabditis remanei] gi|308242890|gb|EFO86842.1| CRE-CUTL-23 protein [Caenorhabditis remanei] Length = 801 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 63/168 (37%), Gaps = 22/168 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D LD+ +V+D S S++ F S K + +L + + P V+ L+T+ Sbjct: 257 GSDKPLDLALVVDASESLDHLF-SDQKKF-----LVERVLGNINIHP---EAVRVALITY 307 Query: 227 SNKI----EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 S + + L + +Q +K + +T + L A++ + + Sbjct: 308 SGQAFVHFKFNSFLYGNNTSVQGFVKNIRSIKGTTATNVALMDAFDLLTS----KDPSTG 363 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 K+ + +TDG + + + G I+ A+ + Sbjct: 364 VREGVPKMALVLTDG-----HSHKSPKAISEKMRAAGIIMIAVSVTPR 406 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 25/139 (17%), Positives = 46/139 (33%), Gaps = 17/139 (12%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 LD++ +LD S S+ D I+ + +K + + + L+ FS Sbjct: 48 APPLDLIFILDSSGSLR----------DKFQDEIDIIRRILKHVTIGKSATRVMLIQFSG 97 Query: 229 KI--EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 E F L + L T ++A Q+ + D + Sbjct: 98 TQHLEFNFEKFTDREELLAALDVLRHVSGITRIGGAFEFALQQLKSPGSGLR-----DGS 152 Query: 287 YKKIIVFMTDGENLSTKED 305 KI+ ++DG +D Sbjct: 153 VPKIVYLLSDGRTHDYPKD 171 >gi|296208411|ref|XP_002751094.1| PREDICTED: epithelial chloride channel protein-like [Callithrix jacchus] Length = 904 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 64/188 (34%), Gaps = 26/188 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM + + + A L +++I + V +++ Sbjct: 309 VCLVLDKSGSM-------AAEDRLFRMNQAAELYLIQIIEKGSLVGMVTFDSYAQIQNNL 361 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + ++ + + T+ GLK + I I+ Sbjct: 362 IKITEDNTYQKITANLPQEASGGTSICNGLKAGFQAISQSNQSTLGSE---------IIL 412 Query: 294 MTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYLVEN 350 +TDGE+ C E ++ GAI++ I + E L + FY + Sbjct: 413 LTDGEDNQIS-------LCFEEVRQSGAIIHTIALGPSAEKELETLSNMTRGHRFYAHND 465 Query: 351 PHSMYDAF 358 + + DAF Sbjct: 466 INGLIDAF 473 >gi|262198293|ref|YP_003269502.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262081640|gb|ACY17609.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 419 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 32/203 (15%), Positives = 67/203 (33%), Gaps = 29/203 (14%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 S +++ +V+D S SM ++ AI + ++E++ + ++ Sbjct: 28 ESSARMPVNLALVIDRSSSMRG------PRLASAIVAARQVVEQLDERD------RLSVI 75 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 F F ++ ++ GV TN G+K + +R Sbjct: 76 AFDATARTIFGPMSVTDEARQTLEQALAGLRTGVGTNLAAGMKKG------AEAVRSGFV 129 Query: 282 TEDANYKKIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + +V +TDG+ ++ + + RG + +G+ E L A Sbjct: 130 RGALSR---LVLLTDGQPSLGITDNDRLCALAQKEADRGVTITTMGLGQGFDDELLADLA 186 Query: 341 SPNS---FYLVENPHSMYDAFSH 360 YL + + AF Sbjct: 187 HSGRGGFHYLA-SAADIPGAFGR 208 >gi|315186712|gb|EFU20470.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6578] Length = 332 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 43/234 (18%), Positives = 82/234 (35%), Gaps = 44/234 (18%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P K +D ++I LDVS SM + + +A I V+ P Sbjct: 73 LSGPYLVERKQVVVSDPP-TIVIALDVSPSMGAMDIPGKQRFQVARDVIREF---VRSYP 128 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + GLV F + V + +++ + F + + G+ + + Sbjct: 129 HMA----VGLVLFGKEAMLEVPPTIDVEYFLERLEAVRLFSLGDGTALGMGVGISLL--- 181 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR----- 328 H + +A++ + +V +TDG+N + + ++ AK V+ +G+ Sbjct: 182 -----HLSRVNASF-RAVVILTDGKNTTGEILPETA--AEMAKDLDIPVFTVGVGSDLPV 233 Query: 329 -----------VIRS-------HEFLRACA--SPNSFYLVENPHSMYDAFSHIG 362 E LR A S F+ P S++ F +IG Sbjct: 234 SLDVIDPSTGTRYAGVLEEGYDEETLRRMAEMSGGQFFSGYTPTSLHRIFQYIG 287 >gi|163760702|ref|ZP_02167782.1| hypothetical protein HPDFL43_12638 [Hoeflea phototrophica DFL-43] gi|162282024|gb|EDQ32315.1| hypothetical protein HPDFL43_12638 [Hoeflea phototrophica DFL-43] Length = 668 Score = 57.1 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 41/315 (13%), Positives = 107/315 (33%), Gaps = 31/315 (9%) Query: 60 HAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRST----- 114 A +R + + + + +F ++ + + + Sbjct: 186 DGANPAPGAEAERDRVEGFDSNGVRSVAEYPVSTFSADVDTASYAMVRRALKQGVMPDPR 245 Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFC---TFIPWYTNSRHIVMPITSSVKVNSQTDAR 171 ++ I + Y A + P + T PW N+R + + + V + Sbjct: 246 TVRIEEMVNYFNYDYPAPESVETPFRATVTVTPTPWNANTRLLHIGVKG-YDVKPAARPQ 304 Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 ++++++DVS SM+ K+ + + ++++++ V+ V +G + Sbjct: 305 ANLVLLVDVSGSMQE-----TDKLPLLKSAFRLLIQKLEPEDTVSIVTYAGDAG---TVL 356 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 E + + + L G ST G++ AY + R + + Sbjct: 357 EPTPASD-KAKILDALDDLRP-GGSTAGAAGIEEAYR---LAEKARVNGGVNR------V 405 Query: 292 VFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-SHEFLRACASPNSFYLVE 349 + TDG+ N+ +D E ++ G + G + + ++ A + Sbjct: 406 LLATDGDFNVGASDDDALKSLIEEKRESGVFLSIFGFGQGNYNDQLMQTLAQNGNGVAAY 465 Query: 350 NPHSMYDAFSHIGKD 364 ++ +A + ++ Sbjct: 466 -IDTLAEAEKTLAQE 479 >gi|148708137|gb|EDL40084.1| mCG12867, isoform CRA_c [Mus musculus] Length = 2207 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 42/311 (13%), Positives = 107/311 (34%), Gaps = 27/311 (8%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + GG + + +G V+ Sbjct: 1517 LEFVARNLFVKSAGSRIEDGVPQHLVLFLGGKSQDDVARHAQ-----VISSSGIVSLGIG 1571 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 D+ + + + ++P + + + P + Sbjct: 1572 DRNIDRTDLQTITNDPRLVFTVREFRELPNIEERVMLSFGPSGATPQPPGVDLPSPSRPE 1631 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + E V + + + ++ GLV +++ Sbjct: 1632 KKKADIVFLLD--GSINFRRDS-------FQEVLRFASEIVDTVYEDGDSIRVGLVQYNS 1682 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + F L + + I + G N+ G+++ + + + + D Sbjct: 1683 DPTDEFFLRDFSTKRQIIDAINKVVYKGGRHANTRVGIEH----LLRNHFVPEAGSRLDE 1738 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +I +T G+++ +D ++G V+A+G+R I S E + ++ + Sbjct: 1739 RVPQIAFVITGGKSVEDAQDVSLALT-----QKGVKVFAVGVRNIDSEEVGKIASNSATA 1793 Query: 346 YLVENPHSMYD 356 + V + + + Sbjct: 1794 FRVGSVQELSE 1804 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 72/199 (36%), Gaps = 24/199 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T + +V+ D++ +LD S ++ + + ++ + + D Sbjct: 627 TGTTEVHVNKR---DIIFLLDGSDNVGKNNFPYV------RDFVTNLVNSLDVGSDN--- 674 Query: 219 VQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQG 275 ++ GLV FS+ F L+ S L ++ L K G N+ L Y I Sbjct: 675 IRVGLVQFSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGSGLNAGSALSY----IHANHF 730 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + + ++++ + G + L N + G + + +G E Sbjct: 731 TEAGGSRTREHVPQLLLLLMAG-----PSEDAYLQAANALVRSGVLTFCVGTNRADKAEL 785 Query: 336 LRACASPNSFYLVENPHSM 354 +P+ YL+++ S+ Sbjct: 786 EHIAFNPSLVYLMDDFRSL 804 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 79/213 (37%), Gaps = 30/213 (14%) Query: 154 IVMPITSSVKVNSQTDARL------DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 + ++ ++Q + D++ ++D S + + + + + ++ ++E Sbjct: 13 FSLLLSGIATTHAQQHGDVKNGAAADIVFLVDSS------WSAGKDRFLLVQEFLSDVVE 66 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKY 265 + + + LV + FLL + I +S G S + GL+Y Sbjct: 67 SLAVGDND---FHFALVRLNGNPHTEFLLNTYHSKQEVLSHIVNMSYIGGSNQTGKGLEY 123 Query: 266 A-YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 ++ + + G R ++I+ +TDG+ ED +L K V+A Sbjct: 124 VIHSHLTEASGSRA-----ADGVPQVIIVLTDGQ----SEDGFALPSAE-LKSADVNVFA 173 Query: 325 IGIRVIRSHEFLRACASP--NSFYLVENPHSMY 355 +G+ + P + +EN S++ Sbjct: 174 VGVEGADERALGEVASEPLSMHVFNLENVTSLH 206 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 70/205 (34%), Gaps = 26/205 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + S F + + ++E + + PD V+ LV +S++ Sbjct: 1026 DVVFLIDGSEGVRSGFP-------LLKDFVQRVVESLDVGPD---RVRVALVQYSDRTRP 1075 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L + I+ L+ G T N+ L++ I + + Sbjct: 1076 EFYLNSHMDQQGVISAIRRLTLLGGPTPNTGAALEFVLRNILTSSTGSRIAEG----VPQ 1131 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +T + K+ GA+ IGI E P+ + Sbjct: 1132 LLIVLT-----AEPSGDDVRGPSVVLKQGGAVPIGIGIGNADISEMQTISFIPDFAVAIP 1186 Query: 350 NPHSMYDAFSHIGKDIVTKRIWYDK 374 + I + I + I ++ Sbjct: 1187 TFREL----GTIQQVISERVIQLNR 1207 Score = 45.1 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 51/348 (14%), Positives = 108/348 (31%), Gaps = 65/348 (18%) Query: 41 SHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + F K + + + R +Q+ G + +KN + + + + Sbjct: 1283 LNAHFSKDEVQNAVRRLRPKGGSQV-----------YIGNALEYVLKNIFQRPLGSRIEE 1331 Query: 101 NGFVNDI-------DDIVRSTSLDIV-----------VVPQNEGYSISAISRYKIPLKFC 142 + DD V +++++ Q E IS Y + Sbjct: 1332 GVPQFLVLISSGKSDDEVDDSAVELKQFGVAPLTIARHTDQEELVKISLSPEYVYSVSTF 1391 Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQ-------------TDARLDMMIVLDVSRSMESFFD 189 +P + ++ PIT+ D++ ++D S +++ Sbjct: 1392 RELPRL--EQKLLTPITTLTSQQIHQILASTRYPPSVVESDAADIVFLIDSSDAVKPDG- 1448 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKI 247 I ++ ++ + + P V+ G+V FSN + F L+ S + I Sbjct: 1449 -----IAHIRDFVSRIVRRLNIGPSK---VRIGVVQFSNDVFPEFYLKTHKSQSSVLEAI 1500 Query: 248 KYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + L G S N+ L++ +F ED + +++F+ K Sbjct: 1501 RRLRFKGGSPLNTGRALEFVARNLFVKS---AGSRIEDGVPQHLVLFL------GGKSQD 1551 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSM 354 + G + IG R I + P + V + Sbjct: 1552 DVARHAQVISSSGIVSLGIGDRNIDRTDLQTITNDPRLVFTVREFREL 1599 >gi|84623314|ref|YP_450686.1| hypothetical protein XOO_1657 [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|122879130|ref|YP_200396.6| hypothetical protein XOO1757 [Xanthomonas oryzae pv. oryzae KACC10331] gi|188577378|ref|YP_001914307.1| von Willebrand factor type A domain protein [Xanthomonas oryzae pv. oryzae PXO99A] gi|84367254|dbj|BAE68412.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|188521830|gb|ACD59775.1| von Willebrand factor type A domain protein [Xanthomonas oryzae pv. oryzae PXO99A] Length = 350 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 41/311 (13%), Positives = 104/311 (33%), Gaps = 43/311 (13%) Query: 76 KLKGGDILCRIKNTWNMSFRNELRDNGFVN-----DIDDIVRSTSLDIVVVPQNEGYSIS 130 + + + + +F ++ + N + ++ + + Y Sbjct: 47 QHIDDNAIVQAAQQPISTFSIDVDTGSYSNVRRFLSAGSLPPVDAVRVEELINYFRYDHP 106 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSV--KVNSQTDARLDMMIVLDVSRSMESFF 188 A P T + ++ ++ + + +++ +DVS SM++ Sbjct: 107 A-PTNDKPFAVRTELATTPWNKDSLLLRVGIAGRDIATADLPPANLVFQVDVSGSMDA-- 163 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF--SNKIEEFFLLEWGVSHLQRK 246 K+ + S+ ++ +++ + LVT+ + + + Sbjct: 164 ---PDKLPLLRSSLKLLVRQLRKQD------RITLVTYAGNTAVVLPPTPGDQQGRIVEA 214 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKED 305 I L + G ST G++ AY + +R N I+ TDG+ N+ + Sbjct: 215 IDSL-QSGGSTAGASGIELAY-KAAQQGYLRGGINR--------ILLATDGDFNVGVTDF 264 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL---RACASPNSFYLVENP--------HSM 354 +E ++ G + +G ++ L A A ++ ++ P H + Sbjct: 265 DALKGMVSEKRRSGVALSTLGFGTGNYNDNLMEQSADAGDGAYAYIDTPLEARKVLTHEL 324 Query: 355 YDAFSHIGKDI 365 + I +D+ Sbjct: 325 GATLATIARDV 335 >gi|326444122|ref|ZP_08218856.1| von Willebrand factor, type A [Streptomyces clavuligerus ATCC 27064] Length = 519 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 42/249 (16%), Positives = 76/249 (30%), Gaps = 43/249 (17%) Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 A+ R + ++ + +V + +S + ++ +LD S SM Sbjct: 289 AVPRSPALREPIGNALYFPGRQEVVDRLVASYDTARRAGPA-RVLFLLDFSGSMR----- 342 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG---------VS 241 I+++ + + D ++G ++ E +L +G Sbjct: 343 -----GAGIRALRTTFDGLSGADDS----RTGKFARFHRGETLTVLRFGGKVLERRTVTY 393 Query: 242 HLQRKIKYLSK------FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 +R ++ L F +T GL AY ED IV MT Sbjct: 394 RGERDLERLRSLVASEGFDTTTAIWSGLDAAYRTA-------AGMLREDPARPLSIVLMT 446 Query: 296 DGENLSTKEDQQSLYYCNEAKK---RGAIVYAIGIRVIRSHEFLRAC-ASPNSFYLVENP 351 DG N + + A+ YA+ + E RA A+ Sbjct: 447 DGRNNAGMS-LDAFLRAQRARTGPVASVRTYAVRYGEADADELGRAARATGGRLLDATE- 504 Query: 352 HSMYDAFSH 360 + DAF Sbjct: 505 RPLLDAFKE 513 >gi|297270401|ref|XP_002800091.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Macaca mulatta] Length = 3386 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 30/212 (14%), Positives = 69/212 (32%), Gaps = 38/212 (17%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL+++ ++D S S+ + + +L + ++P + +VTFS+K Sbjct: 81 RLELVFLVDDSSSVGE------VNFRSELMFVRKLLSDFPVVPTA---TRVAIVTFSSKN 131 Query: 231 EEFFLLEW-GVSHLQRK--------IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +++ ++ I +S G T + + A + Sbjct: 132 YVVPRVDYISTRRARQHKCALLLQEIPAISYRGGGTYTKGAFQQAAQILLHA-------- 183 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 N K++ +TDG + + G ++ GI E ++ Sbjct: 184 --RENSTKVVFLITDGYSNG----GDPRPIAASLRDSGVEIFTFGIWQGNIRELNDMASA 237 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 P E + +F + + + + D Sbjct: 238 PK-----EEHCYLLHSFEEF-EALARRALHED 263 >gi|294815777|ref|ZP_06774420.1| von Willebrand factor [Streptomyces clavuligerus ATCC 27064] gi|294328376|gb|EFG10019.1| von Willebrand factor [Streptomyces clavuligerus ATCC 27064] Length = 568 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 42/249 (16%), Positives = 76/249 (30%), Gaps = 43/249 (17%) Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 A+ R + ++ + +V + +S + ++ +LD S SM Sbjct: 338 AVPRSPALREPIGNALYFPGRQEVVDRLVASYDTARRAGPA-RVLFLLDFSGSMR----- 391 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG---------VS 241 I+++ + + D ++G ++ E +L +G Sbjct: 392 -----GAGIRALRTTFDGLSGADDS----RTGKFARFHRGETLTVLRFGGKVLERRTVTY 442 Query: 242 HLQRKIKYLSK------FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 +R ++ L F +T GL AY ED IV MT Sbjct: 443 RGERDLERLRSLVASEGFDTTTAIWSGLDAAYRTA-------AGMLREDPARPLSIVLMT 495 Query: 296 DGENLSTKEDQQSLYYCNEAKK---RGAIVYAIGIRVIRSHEFLRAC-ASPNSFYLVENP 351 DG N + + A+ YA+ + E RA A+ Sbjct: 496 DGRNNAGMS-LDAFLRAQRARTGPVASVRTYAVRYGEADADELGRAARATGGRLLDATE- 553 Query: 352 HSMYDAFSH 360 + DAF Sbjct: 554 RPLLDAFKE 562 >gi|312196063|ref|YP_004016124.1| von Willebrand factor type A [Frankia sp. EuI1c] gi|311227399|gb|ADP80254.1| von Willebrand factor type A [Frankia sp. EuI1c] Length = 560 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 61/200 (30%), Gaps = 28/200 (14%) Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ----SGLVTFSNKIE- 231 VLD S SME ++ +++ + + + ++TF++K+ Sbjct: 370 VLDTSGSMEG------PRLAALQQALTGLTGADDSLSGRFARFRAREQVTIITFNDKVTA 423 Query: 232 ----EFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 G + L+ Y + G +T L AY T Sbjct: 424 TRQFTVSDPTPGSADLKAISDYGAALRAGGNTAIYSALDAAYTTAAAGMKADPSALTS-- 481 Query: 286 NYKKIIVFMTDGENLS--TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS-P 342 IV MTDGEN + Y RG +A+ + S Sbjct: 482 -----IVLMTDGENNRGLDSAGFLARYNTRPPDVRGVRTFAVDFGDADRAALTQIATSTG 536 Query: 343 NSFYLVENPH-SMYDAFSHI 361 + + P S+ D F I Sbjct: 537 GAVFDATAPGVSLSDVFREI 556 >gi|315223476|ref|ZP_07865333.1| aerotolerance-related exported protein BatB [Capnocytophaga ochracea F0287] gi|314946649|gb|EFS98640.1| aerotolerance-related exported protein BatB [Capnocytophaga ochracea F0287] Length = 347 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 29/175 (16%), Positives = 69/175 (39%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +D++ +DVS+SM + D + +++ A + + ++K + Sbjct: 79 GTKIETVKREGVDIVFAIDVSKSMLAE-DVAPNRLEKAKRIAFETISQLKGD-------R 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMR 277 G+V ++ L S + ++ ++ T ++ A N Sbjct: 131 VGIVAYAASAYPQLALTTDHSAAKMFLQGMNTDMLSSQGTAIQEAIRMASNYF------- 183 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 E+ +++ +TDGE+ + + EA+++G +Y IGI + Sbjct: 184 ----DENTPTARLLFILTDGEDH----EMGATEIATEAQEKGVHIYTIGIGTEKG 230 >gi|308472959|ref|XP_003098706.1| hypothetical protein CRE_04221 [Caenorhabditis remanei] gi|308268306|gb|EFP12259.1| hypothetical protein CRE_04221 [Caenorhabditis remanei] Length = 399 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 31/185 (16%), Positives = 70/185 (37%), Gaps = 14/185 (7%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD+++V+D S M + ++++ +I SI + + + GL+T++ + Sbjct: 45 LDVVLVVDNSHGMTNEGLANVSS---SILSIFGNGTRIGTNLTEHRTTRVGLITYNAEAT 101 Query: 232 EFFLLE------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L +H+ + +S S + GLK AY+ + + + Sbjct: 102 QIADLNVLQSFFNLTNHVNSSLAEVSNSTWSFDK-VGLKAAYDLLQNQSFPPNS----RS 156 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +Y+K+++ E+ + K G + +G E L +SP Sbjct: 157 HYQKVVILFASDSQAQNSEELDPYPMDYQLKDAGVKIVTVGYGNETLLERLSNISSPEYA 216 Query: 346 YLVEN 350 + + Sbjct: 217 FDGYD 221 >gi|218198427|gb|EEC80854.1| hypothetical protein OsI_23472 [Oryza sativa Indica Group] Length = 604 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 31/186 (16%), Positives = 67/186 (36%), Gaps = 26/186 (13%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSR--HIVMPITSSVKVNSQTDARLDMMIVLDVSRSME- 185 +S IP ++ +++ + + + +D++ VLDVS SM Sbjct: 1 MSVAPVKVSTTPIFPTIPRAQTNKDFQVLLRVEAPPAADLNGHVPIDVVAVLDVSGSMND 60 Query: 186 -----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-KIEEFFL---- 235 + T++D+ S+ ++ ++ + +V F++ ++E+ Sbjct: 61 PVAASPESNLQATRLDVLKASMKFIIRKLDDGD------RLSIVAFNDGPVKEYSSGLLD 114 Query: 236 -LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 G S +KI L G S L + + RQ + I+ + Sbjct: 115 VSGDGRSIAGKKIDRLQARGG---SGSALMLELQEAVKILDERQGNSRNRVG---FILLL 168 Query: 295 TDGENL 300 TDG++ Sbjct: 169 TDGDDT 174 >gi|182676519|sp|P0C6B8|SVEP1_RAT RecName: Full=Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1; Flags: Precursor Length = 3564 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 75/225 (33%), Gaps = 38/225 (16%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 S V+ + RL+++ ++D S S+ + + + + + P V+ Sbjct: 69 FRSRVRRLRELSDRLELVFLVDESSSVGQTN---------FLNELKFVRKLLSDFPVVST 119 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSH---------LQRKIKYLSKFGVSTNSTPGLKYAYN 268 + +VTFS+K +++ + L R+I ++ G T + + A Sbjct: 120 ATRVAIVTFSSKNNVVARVDYISTSRAHQHKCALLSREIPAITYRGGGTYTMGAFQQAAQ 179 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + N K+I +TDG + + G ++ GI Sbjct: 180 ILRHS----------RENSTKVIFLITDGYSNG----GDPRPIAASLRDFGVEIFTFGIW 225 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 E ++P E + +F + + + + D Sbjct: 226 QGNIRELNDMASTPK-----EEHCYLLHSFEEF-EALARRALHED 264 >gi|168817956|ref|ZP_02829956.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537] gi|205345018|gb|EDZ31782.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537] gi|320086747|emb|CBY96519.1| Inter-alpha-trypsin inhibitor heavy chain H3 Inter-alpha-inhibitor heavy chain 3; ITI heavy chain H3; ITI-HC3; Flags: Precursor [Salmonella enterica subsp. enterica serovar Weltevreden str. 2007-60-3289-1] Length = 604 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 71/188 (37%), Gaps = 21/188 (11%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 PW + + + + + ++ +++ ++D S SM+ ++ + + Sbjct: 214 LAPSPWNAQRTLLKVDVQAR-DMQTRDLPPANLVFLIDTSGSMQP-----AERLPLIQSA 267 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 + ++ +++ ++ V +G + + ++ I L +G ST Sbjct: 268 LKLLVNDLRAQDNITIVTYAG----GTHVALASTAGNNTTAIKAAIDNLDAYG-STGGEA 322 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGA 320 GL+ AY Q + N I+ TDG+ NL + + + +++G Sbjct: 323 GLRLAYEQ------AEKGFIKGGVNR---ILLTTDGDFNLGITDPKDIEALVKKEREKGI 373 Query: 321 IVYAIGIR 328 + +G+ Sbjct: 374 TLSTLGVG 381 >gi|293347389|ref|XP_002726583.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Rattus norvegicus] Length = 3578 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 75/225 (33%), Gaps = 38/225 (16%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 S V+ + RL+++ ++D S S+ + + + + + P V+ Sbjct: 69 FRSRVRRLRELSDRLELVFLVDESSSVGQTN---------FLNELKFVRKLLSDFPVVST 119 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSH---------LQRKIKYLSKFGVSTNSTPGLKYAYN 268 + +VTFS+K +++ + L R+I ++ G T + + A Sbjct: 120 ATRVAIVTFSSKNNVVARVDYISTSRAHQHKCALLSREIPAITYRGGGTYTMGAFQQAAQ 179 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + N K+I +TDG + + G ++ GI Sbjct: 180 ILRHS----------RENSTKVIFLITDGYSNG----GDPRPIAASLRDFGVEIFTFGIW 225 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 E ++P E + +F + + + + D Sbjct: 226 QGNIRELNDMASTPK-----EEHCYLLHSFEEF-EALARRALHED 264 >gi|109474969|ref|XP_001065678.1| PREDICTED: polydom [Rattus norvegicus] Length = 3583 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 75/225 (33%), Gaps = 38/225 (16%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 S V+ + RL+++ ++D S S+ + + + + + P V+ Sbjct: 69 FRSRVRRLRELSDRLELVFLVDESSSVGQTN---------FLNELKFVRKLLSDFPVVST 119 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSH---------LQRKIKYLSKFGVSTNSTPGLKYAYN 268 + +VTFS+K +++ + L R+I ++ G T + + A Sbjct: 120 ATRVAIVTFSSKNNVVARVDYISTSRAHQHKCALLSREIPAITYRGGGTYTMGAFQQAAQ 179 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + N K+I +TDG + + G ++ GI Sbjct: 180 ILRHS----------RENSTKVIFLITDGYSNG----GDPRPIAASLRDFGVEIFTFGIW 225 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 E ++P E + +F + + + + D Sbjct: 226 QGNIRELNDMASTPK-----EEHCYLLHSFEEF-EALARRALHED 264 >gi|148645283|gb|ABR01165.1| complement factor B [Ovis aries] Length = 761 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 38/222 (17%), Positives = 79/222 (35%), Gaps = 30/222 (13%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE- 231 IVLD S SM + S + N + + ++ + + GLVT++ + + Sbjct: 261 IVLDPSGSMNIYLVLDGSDSVGAHNFTGAKNCLRDFIEKVASYGVKPKYGLVTYATEPKV 320 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF-DMQGMRQHCNTE 283 + +W L R K TN+ L YN + D+ +++ N Sbjct: 321 LIKVFDPKSSEADWVTEQLNRINYADHKLKAGTNTKRALLEVYNMMSRDINNLKETWNRT 380 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYC----------NEAKKRGAIVYAIGIRVIRSH 333 +I+ MTDG + + ++ ++ +Y G+ + + Sbjct: 381 RH----VIIIMTDGLHNMGGDPVTVIHDIRYLLDIGRNRKNPREDYLDIYVFGVGPLVNQ 436 Query: 334 EFLRACAS----PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 E + A AS + ++ ++ D F + + T + Sbjct: 437 ENINALASKKDKEQHVFKLQGMENLEDVFVQMLDESRTLGLC 478 >gi|218461471|ref|ZP_03501562.1| von Willebrand factor type A [Rhizobium etli Kim 5] Length = 459 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 26/158 (16%), Positives = 52/158 (32%), Gaps = 24/158 (15%) Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN---QIFDMQGMRQHCNTEDANYK 288 L ++L+ +K L+ G + + Y + + G + + Sbjct: 293 PVTPLTGDFAYLKSVVKNLTSEGSTRLDAGVVAGWYTLSPKWQGVWGDQSSPAPVSDSVH 352 Query: 289 KIIVFMTDGENLSTKEDQQSLYY---------------------CNEAKKRGAIVYAIGI 327 K++VFMTDGE + + + C KK G +Y + Sbjct: 353 KVMVFMTDGEMNTKYDPNDKFDWICSQTQSSACNAFATAARQTACTAMKKSGIEIYTLSY 412 Query: 328 RVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 +R CA+ + + +P ++ + I I Sbjct: 413 SADADVVNIRNCATNTAHFFTASPATIKTVYETIAAAI 450 Score = 42.8 bits (99), Expect = 0.094, Method: Composition-based stats. Identities = 33/224 (14%), Positives = 70/224 (31%), Gaps = 35/224 (15%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 + + G + I+ + + L +G I+ + +T L + D Sbjct: 1 LSRLIDDRDGAVAIIVILVAVPMLLAVGASIDYIRAYNGRTELQAAADS----------- 49 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 K K G I T N F++ + + + V Sbjct: 50 -AVLAAAAKYKSGMPEATIAKTI----------NAFLSANGEFETAVAGKPQVASDESEL 98 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 + TF+ N + + + I S + +L++ +VLDVS SM Sbjct: 99 CLDVADAV-----PTTFMKLA-NIQSVPISIRSCAALPGVK--QLEIALVLDVSSSMIEE 150 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 + ++ L+ + + + +V FS+++ Sbjct: 151 -----NRFTPMQTAVAGFLQAFSSNTSLVDKTKISIVPFSSRVN 189 >gi|205353430|ref|YP_002227231.1| lipoprotein [Salmonella enterica subsp. enterica serovar Gallinarum str. 287/91] gi|205273211|emb|CAR38174.1| lipoprotein [Salmonella enterica subsp. enterica serovar Gallinarum str. 287/91] Length = 499 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 33/216 (15%), Positives = 81/216 (37%), Gaps = 22/216 (10%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKF-CTFIPWYTNSRHIVMPITSSVKVNSQTDARLD 173 +L V Q+ I+A +K+ PW + + + + + + + Sbjct: 81 ALQPVADKQDNTKPIAACIPMPFAVKYELAPSPWNAQRTLLKVDVQAR-DMQVRDLPPAN 139 Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM+ ++ + ++ ++ +++ ++ V +G + Sbjct: 140 LVFLIDTSGSMQP-----AERLPLIQSALKLLVNDLRAQDNITIVTYAG----GTHVALA 190 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + ++ I L +G ST GL+ AY Q + N I+ Sbjct: 191 STAGNNTTAIKAAIDNLDAYG-STGGEAGLRLAYEQ------AEKGFIKGGVNR---ILL 240 Query: 294 MTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 TDG+ NL + + + +++G + +G+ Sbjct: 241 TTDGDFNLGITDPKDIEALVKKEREKGITLSTLGVG 276 >gi|90019771|ref|YP_525598.1| inter-alpha-trypsin inhibitor domain-containing protein [Saccharophagus degradans 2-40] gi|89949371|gb|ABD79386.1| von Willebrand factor, type A [Saccharophagus degradans 2-40] Length = 763 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 44/317 (13%), Positives = 105/317 (33%), Gaps = 21/317 (6%) Query: 31 FLVLGMIIEVSH-IFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNT 89 G + + I + + D L+ + G+ + + G D I Sbjct: 239 LAAGGYGWQSFNPIIHTQKPTPQVPDAHLISPPMVLAQGQYGDGQYEQTGKDNRATISIQ 298 Query: 90 WNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE----GYSISAISRYKIPLKFCTFI 145 + F ++ + + S++ ++ + + + + + + F Sbjct: 299 LDAGFNVANIESLYHQITINKPPSSAYNVELTNGSTLMDRDFVLQWRATASSAPQAAVFK 358 Query: 146 PWYTNSRHIVMPITSSVKVNSQTDA-RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 ++++ + T + D++ V+D S SM T I A +S+ Sbjct: 359 ETLAGEDYLLLMLLPPQGQQQHTQSLSRDIVFVVDTSGSM------QGTSIQQAKRSLQF 412 Query: 205 MLEEVKLIPDVNNV-VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL 263 L + N + + F ++ V + L+ T L Sbjct: 413 ALRGLNPSDTFNIIEFDTSFSRFRSRPVSAT--ASNVQAAVSWVNNLNADN-GTEMYAAL 469 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 + A++Q+ + + N ++ +VF+TDG ++Q+L + A ++ Sbjct: 470 EEAFDQLASINPNGTENSKSSNNLQQ-VVFITDG----AVGNEQALLSLIHRRLNNARLF 524 Query: 324 AIGIRVIRSHEFLRACA 340 + I + F+R A Sbjct: 525 TVAIGSAPNSYFMRKAA 541 >gi|326435125|gb|EGD80695.1| hypothetical protein PTSG_01285 [Salpingoeca sp. ATCC 50818] Length = 1006 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 67/193 (34%), Gaps = 29/193 (15%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-- 228 +D+ ++DVS S+ + I + ++ + P N V+ L+T+ + Sbjct: 308 AVDVGFIVDVSGSVG------AANFALVRDFIASTVDML---PVGENTVRVALMTYHSSN 358 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGV---STNSTPGLK-YAYNQIFDMQGMRQHCNTED 284 + F + + + I L T + L +A N + G R Sbjct: 359 MPQFDFDDSFDRATVVSAISSLVYDDSRQYGTATGSALNFFADNMLQASAGYRGGPA--- 415 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR-ACASPN 343 I+ TDG + GA V IG+ + L+ +SP+ Sbjct: 416 -----IVYVFTDG-----ASQDDVTPGAAALQATGAQVVTIGVTAAVNEAELQEIASSPS 465 Query: 344 SFYLVENPHSMYD 356 ++V + S+ D Sbjct: 466 DVFIVADFDSLTD 478 >gi|120436991|ref|YP_862677.1| von Willebrand factor type A domain-containing protein [Gramella forsetii KT0803] gi|117579141|emb|CAL67610.1| secreted protein containing von Willebrand factor type A domain [Gramella forsetii KT0803] Length = 592 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 44/285 (15%), Positives = 102/285 (35%), Gaps = 30/285 (10%) Query: 59 VHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDI-----VRS 113 + + I N + + +K + +F ++ G+ N I + Sbjct: 109 IRGTSNIQTSVTANESYNKREDNQFKLVKASPLSTFSIDVDKAGYSNIRRMINNGIQIPK 168 Query: 114 TSLDIVVVPQNEGYSISAISRYKIPLKFCT---FIPWYTNSRHIVMPITSSVKVNSQTDA 170 ++ I + Y K P T PW ++++ + + + + + Sbjct: 169 DAVKIEEMINYFNYDYK-QPTGKHPFSIQTEYAQTPWNSDTKLVKIGLQGK-TIPLENVP 226 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +++ +LDVS SM K+ + + + ++ ++ VV +G S+ + Sbjct: 227 ASNLVFLLDVSGSM-----GQQNKLPLLKSAFKLLTNNLREQDKISIVVYAG----SSGV 277 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + ++ + LS G ST G++ AY +I ++ N Sbjct: 278 VLEPTSGDQKTKIEEALDKLSA-GGSTAGGEGIELAY-KIAKDNFIKNGNNR-------- 327 Query: 291 IVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++ TDG+ N+ D+ E ++ G + A+G + + Sbjct: 328 VILATDGDFNVGLSSDKAMEDLIKEKRESGIFLTALGFGMGNYKD 372 >gi|58425974|gb|AAW75011.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC10331] Length = 365 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 39/298 (13%), Positives = 99/298 (33%), Gaps = 35/298 (11%) Query: 76 KLKGGDILCRIKNTWNMSFRNELRDNGFVN-----DIDDIVRSTSLDIVVVPQNEGYSIS 130 + + + + +F ++ + N + ++ + + Y Sbjct: 62 QHIDDNAIVQAAQQPISTFSIDVDTGSYSNVRRFLSAGSLPPVDAVRVEELINYFRYDHP 121 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSV--KVNSQTDARLDMMIVLDVSRSMESFF 188 A P T + ++ ++ + + +++ +DVS SM++ Sbjct: 122 A-PTNDKPFAVRTELATTPWNKDSLLLRVGIAGRDIATADLPPANLVFQVDVSGSMDA-- 178 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF--SNKIEEFFLLEWGVSHLQRK 246 K+ + S+ ++ +++ + LVT+ + + + Sbjct: 179 ---PDKLPLLRSSLKLLVRQLRKQD------RITLVTYAGNTAVVLPPTPGDQQGRIVEA 229 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKED 305 I L + G ST G++ AY + +R N I+ TDG+ N+ + Sbjct: 230 IDSL-QSGGSTAGASGIELAY-KAAQQGYLRGGINR--------ILLATDGDFNVGVTDF 279 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL---RACASPNSFYLVENPHSMYDAFSH 360 +E ++ G + +G ++ L A A ++ ++ P +H Sbjct: 280 DALKGMVSEKRRSGVALSTLGFGTGNYNDNLMEQSADAGDGAYAYIDTPLEARKVLTH 337 >gi|73971950|ref|XP_532030.2| PREDICTED: similar to sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Canis familiaris] Length = 3569 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 72/213 (33%), Gaps = 38/213 (17%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 RL+++ ++D S S+ ++ + +L + ++P + +VTFS+K Sbjct: 80 GRLELVFLVDESSSVG------QANFLSELRFVRKLLSDFPVVPTA---TRVAIVTFSSK 130 Query: 230 IEEFFLLEWGVSH---------LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +++ L+R+I ++ G T + + A + Sbjct: 131 NNVVPRVDYISHRRAHQHKCALLRREIPAIAYRGGGTYTKGAFQQAAQILRHS------- 183 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 N K+I +TDG + + G ++ GI E + Sbjct: 184 ---RENSTKVIFLITDGYSNG----GDPRPVAASLRDFGVEIFTFGIWQGNIRELNDMAS 236 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 +P E + +F + + + + D Sbjct: 237 TPK-----EEHCYLLHSFEEF-EALARRALHED 263 >gi|326628521|gb|EGE34864.1| lipoprotein [Salmonella enterica subsp. enterica serovar Gallinarum str. 9] Length = 596 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 33/216 (15%), Positives = 81/216 (37%), Gaps = 22/216 (10%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKF-CTFIPWYTNSRHIVMPITSSVKVNSQTDARLD 173 +L V Q+ I+A +K+ PW + + + + + + + Sbjct: 178 ALQPVADKQDNTKPIAACIPMPFAVKYELAPSPWNAQRTLLKVDVQAR-DMQVRDLPPAN 236 Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM+ ++ + ++ ++ +++ ++ V +G + Sbjct: 237 LVFLIDTSGSMQP-----AERLPLIQSALKLLVNDLRAQDNITIVTYAG----GTHVALA 287 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + ++ I L +G ST GL+ AY Q + N I+ Sbjct: 288 STAGNNTTAIKAAIDNLDAYG-STGGEAGLRLAYEQ------AEKGFIKGGVNR---ILL 337 Query: 294 MTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 TDG+ NL + + + +++G + +G+ Sbjct: 338 TTDGDFNLGITDPKDIEALVKKEREKGITLSTLGVG 373 >gi|319792023|ref|YP_004153663.1| von willebrand factor type a [Variovorax paradoxus EPS] gi|315594486|gb|ADU35552.1| von Willebrand factor type A [Variovorax paradoxus EPS] Length = 345 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 37/237 (15%), Positives = 78/237 (32%), Gaps = 52/237 (21%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +DVS SM + D ++ A ++ + ++ D+ V+ G+V F+ + Sbjct: 88 IILAMDVSGSMRAA-DVLPNRLVAAQEAAKSFIK------DLPRTVKVGIVAFAGSAQVA 140 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG---------MRQHCNTED 284 L L I + +T + + + +F G RQ D Sbjct: 141 QLPTTNHDDLVTAIDSF-QLQRATATGNAIVVSLATLFPDAGIDVEQFSAPSRQRGTPID 199 Query: 285 ANYKK---------------IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 KK I+ +TDG+ L A RG +Y +G+ Sbjct: 200 QTEKKLKDFTPVAPGSFTSAAIIMLTDGQRT---TGVDPLDAAKAAADRGVRIYTVGVGT 256 Query: 330 IRS---------------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 + E L+A A + ++ + + + + + ++ Sbjct: 257 VDGETIGFEGWSMRVRLDEETLKAVANKTNAEYFYAGTANDLKKVYETLSSKLTVEK 313 >gi|301620566|ref|XP_002939640.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like protein-like [Xenopus (Silurana) tropicalis] Length = 1179 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 35/247 (14%), Positives = 77/247 (31%), Gaps = 18/247 (7%) Query: 95 RNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHI 154 ++ + + T G + + +Y + LK N + Sbjct: 224 STQVEKGTYCARVTYTPTPTE---QAAHSRPGVTADFVLQYDVSLKDLAGDVQIYNGYFV 280 Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + + D++ V+DVS SM TKI +++ +L ++ Sbjct: 281 HYFAPRGLPP-----IQKDVIFVIDVSGSMFG------TKIKQTKSAMHVILNDLHRDDS 329 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 N + S +V + + + + G + + + A IF+ Sbjct: 330 FNIITFSDVVHVWRPGQSIPATAQNKKSAKDYVNKIEADGWTDINAALMAAA--SIFNQT 387 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLS-TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + T +I+F+TDGE S + L +A ++ + + Sbjct: 388 SHKPEKETSTKKIP-LIIFLTDGEATSGVLATSRILSNAQKAMGGTISLFCLAFGEDADY 446 Query: 334 EFLRACA 340 +R + Sbjct: 447 NLMRRLS 453 >gi|325919326|ref|ZP_08181363.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas gardneri ATCC 19865] gi|325550197|gb|EGD21014.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas gardneri ATCC 19865] Length = 520 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 40/241 (16%), Positives = 90/241 (37%), Gaps = 36/241 (14%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 PW +S + + + + + +++ ++DVS SMES K+ + S Sbjct: 122 LAPTPWNNDSLLLRVGVAGRA-IATADLPPANLVFLVDVSGSMES-----PDKLPLLQSS 175 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTF--SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNS 259 + ++ +++ + LVT+ + + + I + + G ST Sbjct: 176 LKLLVRQLRAKD------RITLVTYAGNTAVVLPPTPGDQQGRIIEAIDTV-QSGGSTAG 228 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKR 318 G++ AY + +R N I+ TDG+ N+ + Q E ++ Sbjct: 229 ASGIELAY-KAAQQGYLRGGINR--------ILLATDGDFNVGVTDFDQLKGMVAEKRRS 279 Query: 319 GAIVYAIGIRVIRSHEFLR---ACASPNSFYLVENP--------HSMYDAFSHIGKDIVT 367 G + +G ++ L A A ++ +++P H + + I +D+ Sbjct: 280 GVALSTLGFGTGNYNDTLMEQLADAGDGAYAYIDSPLEARKVLTHELGATLATIARDVKI 339 Query: 368 K 368 + Sbjct: 340 Q 340 >gi|297567245|ref|YP_003686217.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] gi|296851694|gb|ADH64709.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] Length = 467 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 69/195 (35%), Gaps = 42/195 (21%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSI---------------------TKIDMAIKSIN 203 +++ +L + V+D S SM +KID+ I+++ Sbjct: 40 ATRSRPQLVVAFVVDTSGSMREVVTEPTERTGQSVRVDGKDYEVVRGAKSKIDLVIEALQ 99 Query: 204 AMLEEVKLIPDVNNVVQSGLVTFSN--KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 +L +L P + +V F + ++ + F + L + L+++ T Sbjct: 100 NLLSSPQLQPSD----RLAIVKFDDVAEVVQPFTPANEKARLVAAAERLTQYSGGTQMGA 155 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 G++ + G R+ ++ +TDG+ + +A+ Sbjct: 156 GMREGMRLLEREAGSRR------------LILLTDGQTFDEPLVETVAAQLAQAR---IP 200 Query: 322 VYAIGIRVIRSHEFL 336 V AIG+ + + L Sbjct: 201 VTAIGVGDEWNDDLL 215 >gi|282877523|ref|ZP_06286341.1| von Willebrand factor type A domain protein [Prevotella buccalis ATCC 35310] gi|281300347|gb|EFA92698.1| von Willebrand factor type A domain protein [Prevotella buccalis ATCC 35310] Length = 332 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 40/222 (18%), Positives = 73/222 (32%), Gaps = 51/222 (22%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +DVS SM + D +++ A + P+ N GL F+ + Sbjct: 88 IDIMLAMDVSTSMLAE-DLRPNRLEAAKNVAAEFI---SGRPNDN----IGLTIFAGESF 139 Query: 232 EFFLLEWG-------VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + + +++ I T GL A +++ D + Sbjct: 140 TQCPMTTDHASLLNLLRNVRTDIAARGLISDGTAVGMGLANAVSRLKDSKAKS------- 192 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV--------------- 329 K+++ +TDG N D + A+ G VY IG+ Sbjct: 193 ----KVVILITDGSNNM--GDISPMTSAQIAQSLGIRVYTIGVGTNKVAPYPMNVGGTTQ 246 Query: 330 -------IRSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGK 363 I S A + +FY N + ++ I K Sbjct: 247 YVNIPVEIDSKTLSDIAAVTEGNFYRATNNKELKQIYNDIDK 288 >gi|193216292|ref|YP_001997491.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] gi|193089769|gb|ACF15044.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] Length = 346 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 65/167 (38%), Gaps = 16/167 (9%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 ++++I LDVS SM + D +++ + +I+ LE + + GLV F+ Sbjct: 85 KRKGIEVVIALDVSNSMLAD-DIQPSRLQKSKYTISNFLERLGND-------RVGLVVFA 136 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + + S L+ + +S TN + ++ + + ++ + Sbjct: 137 GQSFVQCPITSDKSALKLFMDIVSTDAIPTQGTNFSSAIRESIRALERIEEGAEAEEKNR 196 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 K+I+ +DGE+ D EA + +Y +G+ Sbjct: 197 VR-NKVILIFSDGEDHEAGID----EVLEEAASKNIRIYTVGVGSAE 238 >gi|91216720|ref|ZP_01253685.1| aerotolerance-related membrane protein [Psychroflexus torquis ATCC 700755] gi|91185189|gb|EAS71567.1| aerotolerance-related membrane protein [Psychroflexus torquis ATCC 700755] Length = 349 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 26/176 (14%), Positives = 65/176 (36%), Gaps = 26/176 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +D++ LDVS+SM + D + +++ + + I ++ ++ + Sbjct: 79 GTKMETIKREGVDIVFALDVSKSMLAE-DIAPNRLEKSKRIITEIVNKLTAD-------R 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMR 277 GLV ++ + + + ++ ++ T + + A + D Sbjct: 131 VGLVGYAGSAFPQVPITTDYASTKTFLQSMNTDMVSSQGTAISQAIDLAKSYYNDDDQTN 190 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 K+++ +++GE+ D A G +Y IG+ R Sbjct: 191 -----------KVLIILSEGEDH----DSNVESMAETAAAEGIKIYTIGVGTERGD 231 >gi|73954244|ref|XP_546326.2| PREDICTED: similar to Integrin alpha-2 precursor (Platelet membrane glycoprotein Ia) (GPIa) (Collagen receptor) (VLA-2 alpha chain) (CD49b) [Canis familiaris] Length = 1171 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 75/210 (35%), Gaps = 35/210 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++V D S S+ + + + ++ + + P Q GL+ ++NK Sbjct: 163 IDVVVVCDESNSIYPW--------EAVKNFLEKFVQGLDIGP---KKTQVGLIQYANKPR 211 Query: 232 EFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L + + + G TN+ ++YA Sbjct: 212 VVFNLNTFKTKAEMTEATSQTYQYGGDLTNTFKAIQYA------KDFAYAAAAGGRPGAT 265 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEF---LRAC 339 K++V +TDGE+ + + CN + + I + + + ++A Sbjct: 266 KVMVVVTDGESHDGSMLKAVIDQCN---NDNILRFGIAVLGYLNRNALDTKNLIKEIKAI 322 Query: 340 AS---PNSFYLVENPHSMYDAFSHIGKDIV 366 AS F+ V + ++ + +G+ I Sbjct: 323 ASIPTERYFFNVSDEAALLEKAGTLGEQIF 352 >gi|326334019|ref|ZP_08200248.1| BatA protein [Nocardioidaceae bacterium Broad-1] gi|325948168|gb|EGD40279.1| BatA protein [Nocardioidaceae bacterium Broad-1] Length = 336 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 33/223 (14%), Positives = 68/223 (30%), Gaps = 47/223 (21%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +D S SM + D +++ A + + ++++ + GLV+FS+ Sbjct: 88 ILVAIDTSASMTAE-DMPPNRLEAAKAAAISFIDQLPARYN------VGLVSFSSSARVV 140 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA-------- 285 + + R I L T Y+ I D+Q + + Sbjct: 141 TSPTTDHALVVRSIDGLGPPDGGTAIGEA---VYSSIDDLQQILEEAAASGPSQEPEESE 197 Query: 286 -----------NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR------ 328 +V ++DG N + + + E G I Sbjct: 198 GSEKSEESEEERSPAHLVLLSDGGNSAGRSPVAAAEAARE---AGLPTSTIAYGTEGSSA 254 Query: 329 -VIRSHEF------LRACA--SPNSFYLVENPHSMYDAFSHIG 362 + LR A + FY + + + + IG Sbjct: 255 TLPGGQSVEVREDTLRNLADTTGGRFYRASSADELREVYDDIG 297 >gi|297562484|ref|YP_003681458.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|296846932|gb|ADH68952.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 505 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 76/205 (37%), Gaps = 28/205 (13%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 +++ IT+ + + +VLD S SM ++D A++++ +++E + Sbjct: 21 SVLVDITAPEREEETERPPATLQVVLDRSGSMGG------GRLDGAVRALLSLVERLAPS 74 Query: 213 PDVNNVVQSGLVTFSN--KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + GLV+F++ ++E + ++R I L G T+ + GL + Sbjct: 75 DNF------GLVSFNDQARVEVPCGPLEDKARVRRLISGLHASG-GTDLSSGLLRGVQEA 127 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 R ++ ++DG N + +A G ++G + Sbjct: 128 RRAGADRGG----------TLLLISDGHANQGVTDHDLLRQVAADAYAHGVTTTSLGYGL 177 Query: 330 IRSHEFLRACASP--NSFYLVENPH 352 E L A A S E+P Sbjct: 178 GYDEELLGAVADGGAGSALFAEDPD 202 >gi|159897645|ref|YP_001543892.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159890684|gb|ABX03764.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 562 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 29/199 (14%), Positives = 71/199 (35%), Gaps = 21/199 (10%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 Q ++D+ +++D S SM ++ A ++ ++ +V + T Sbjct: 379 QHKKQVDVALIIDTSGSMRQE-----NRLREAKTALGDFIDIFADQDNVQVTI---FSTN 430 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + ++ + + + L +I L G T + Y I +++ Sbjct: 431 ATELSDLSPIGPKRADLHTRIDGLVADGE-TRLYSTIGEVYTDIQQQTEVQRI------- 482 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG--AIVYAIGIRVIRSHEFLRACAS-PN 343 + +V +TDGE+ ++ + L + G ++ I + E L+ A Sbjct: 483 --RALVVLTDGEDTASSLSLEQLNEQIRQDESGTSIKIFTIAYGSDANQEVLQRIAEITG 540 Query: 344 SFYLVENPHSMYDAFSHIG 362 + +P ++ + I Sbjct: 541 AKSYTGDPATIRQVYHEIA 559 >gi|291401551|ref|XP_002717040.1| PREDICTED: anthrax toxin receptor 2 [Oryctolagus cuniculus] Length = 486 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 36/215 (16%), Positives = 73/215 (33%), Gaps = 31/215 (14%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + D+ VLD S S+ + + I ++++ V+ ++ Sbjct: 33 AQEQPSCRTAFDLYFVLDKSGSVANNW-----------IEIYDFVQKLTERF-VSPEMRL 80 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQH 279 + FS++ L + + ++ L T GLK A QI G++ Sbjct: 81 SFIVFSSQATIILPLTGDRGKITKGLEDLKSVSPVGETYIHEGLKLANEQIQKAGGLKTS 140 Query: 280 CNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 II+ +TDG + L ++ ++ GA VY +G+ + + Sbjct: 141 S---------IIIALTDGKLDGLVPSYAEKEAKI---SRSLGASVYCVGVLDFEQAQLEK 188 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 S + V+ + A I I+ + Sbjct: 189 IADSKEQVFPVKGG---FQALKGIINSILAQSCTE 220 >gi|302868694|ref|YP_003837331.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|315504835|ref|YP_004083722.1| von willebrand factor type a [Micromonospora sp. L5] gi|302571553|gb|ADL47755.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|315411454|gb|ADU09571.1| von Willebrand factor type A [Micromonospora sp. L5] Length = 316 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 34/231 (14%), Positives = 71/231 (30%), Gaps = 35/231 (15%) Query: 154 IVMPITSSVKVNSQTDARLD---MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 +++ T + T L+ +M+ +DVS SM++ D S +++ A ++ + E+ Sbjct: 66 LLVLATGLARPAIDTREPLERATVMLAIDVSLSMQAD-DVSPNRLEAAQEAAKQFVGELP 124 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + GLV+F+ + + I L +T + + I Sbjct: 125 ESYN------LGLVSFAKSANVLVPPTKDRAAVTTAIDGLV-LAEATATGEAVFTCLEAI 177 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + IV ++DG S + +++ V I Sbjct: 178 RSVPADGAAGIPPAR-----IVLLSDGYRTSGRSVEEAAAAAQA---ANVPVSTIAFGTD 229 Query: 331 RSH--------------EFLRACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 L A + FY + + + +G I Sbjct: 230 SGQVDIGGQLQRVPVDRTALSQLAETTQGFFYEAASASELKQVYQDMGSSI 280 >gi|85374662|ref|YP_458724.1| von Willebrand factor type A domain-containing protein [Erythrobacter litoralis HTCC2594] gi|84787745|gb|ABC63927.1| von Willebrand factor type A domain protein [Erythrobacter litoralis HTCC2594] Length = 580 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 28/173 (16%), Positives = 61/173 (35%), Gaps = 22/173 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + +++ ++DVS SM K+ + ++ + E++ V+ VV +G Sbjct: 206 DIERSERPPANLVFLMDVSGSM-----GRPDKLPLVKTALAGLAGELQPQDKVSIVVYAG 260 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 ++ + L + G ST G++ AY QI + + N Sbjct: 261 AAGL------VLEPTNDTRKIRAALNQL-QAGGSTAGGAGIQLAY-QIAEDNFIEGGVNR 312 Query: 283 EDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++ TDG+ N+ + + + G + +G +E Sbjct: 313 --------VILATDGDFNVGVSSRDALIEMIEKKRDSGITLTTLGFGTGNYNE 357 >gi|116618630|ref|YP_819001.1| von Willebrand factor domain-containing protein [Leuconostoc mesenteroides subsp. mesenteroides ATCC 8293] gi|116097477|gb|ABJ62628.1| von Willebrand factor (vWF) domain containing protein [Leuconostoc mesenteroides subsp. mesenteroides ATCC 8293] Length = 920 Score = 57.1 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 45/295 (15%), Positives = 86/295 (29%), Gaps = 31/295 (10%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 N G + I L + LV ++ + + Q + Sbjct: 4 NIHKIVNGLFLRVFGILLILSTLVANALM-----------IPLTTVSAASDITPQYTSNS 52 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 G + N N N N + L N Y I Sbjct: 53 TGVFPTNSWTIPGQNTVINHQGGDASNGWDKNSSWNGDSSDTSKSYLKFGTDTSNPDYQI 112 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 ++ + + +D+++V+D+S SME + Sbjct: 113 RKYAKETSTPGLYDVYLNAKGNE-------------VKNIKPIDIVLVVDMSGSMEPANN 159 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY 249 S + A + + ++ V GLV FS+ +L+ ++ Sbjct: 160 PSGSNRAQATRDGVKQFLQAIKDAGISQYVNVGLVGFSSPGNYVTG---SNGYLEVGMQS 216 Query: 250 LSKFGVSTNSTPGLKYAYN-QIFDMQGMRQHCNTEDA---NYKKIIVFMTDGENL 300 LS G + L ++ F G+R+ D ++KK+++ +TDG Sbjct: 217 LSTTGQTEQINKTLSPTFSGGTFTQLGIRRGQKMLDDDKNDHKKMMILLTDGVPT 271 >gi|256005895|ref|ZP_05430841.1| von Willebrand factor type A [Clostridium thermocellum DSM 2360] gi|255990131|gb|EEU00267.1| von Willebrand factor type A [Clostridium thermocellum DSM 2360] Length = 524 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 32/231 (13%), Positives = 79/231 (34%), Gaps = 29/231 (12%) Query: 136 KIPLKFCTFIPWYTNS--RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSIT 193 P T I + +++ + K++++ +++ ++DVS SM+ Sbjct: 131 DEPFSITTEIGQCPWNPENKLMLVGLQTKKLSTEQLPPSNLVFLIDVSGSMDE-----PN 185 Query: 194 KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF 253 K+ + + +++E+ V+ VV +G + + + L Sbjct: 186 KLPLLKSAFKLLVDELDEDDRVSIVVYAGAAG----LVLDSTPGNEKDKILDALMNLEA- 240 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYC 312 G ST G+K AY + ++ N ++ TDG+ N+ + + + Sbjct: 241 GGSTAGAEGIKLAY-DVAKKNFIKSGNNR--------VILATDGDFNVGISSEAELVRLI 291 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACAS-----PNSFYLVENPHSMYDAF 358 + + G + +G + S ++ ++N Sbjct: 292 EKKRDEGIFLTVLGFGTGNYKD--SKMESLADKGNGNYAYIDNIAEARKVL 340 >gi|7145102|gb|AAA36225.2| MHC serum complement factor B [Homo sapiens] Length = 677 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 31/221 (14%), Positives = 75/221 (33%), Gaps = 28/221 (12%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--- 229 IVLD S SM + S + ++ ++ + + GLVT++ Sbjct: 174 IVLDPSGSMNIYLVLDGSDSIGASNFTGAKKCLVNLIEKLASYGVKPRYGLVTYATXXXI 233 Query: 230 -IEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ + + +++ ++ K TN+ L+ Y+ + Sbjct: 234 WVKVSEAVSSNADWVTKQLNEINYEDHKLKSGTNTEEALQAVYS-MMSWPDDVPPEGWNR 292 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYC----------NEAKKRGAIVYAIGIRVIRSHE 334 + +I+ MTDG + + + ++ VY G+ + + Sbjct: 293 TRH--VIILMTDGLHNMGGDPITVIDXXXXXXYIGKDRKNPREDYLDVYVFGVGPLVNQV 350 Query: 335 FLRACAS----PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + A AS + V++ ++ D F + + + + Sbjct: 351 NINALASKKDNEQHVFKVKDMENLEDVFYQMIDESQSLSLC 391 >gi|311254858|ref|XP_001927013.2| PREDICTED: calcium-activated chloride channel regulator 4 [Sus scrofa] Length = 910 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 77/197 (39%), Gaps = 34/197 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--IE 231 + +VLD S SM SS +++ ++ L ++ V N G+V F + I Sbjct: 289 VCLVLDKSGSM-----SSSNRLNRMNQAAKYFLMQI-----VENGSWVGMVHFDSTASIR 338 Query: 232 EFFLLEWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + G + + + L T+ G++ A+ + + + H + + Sbjct: 339 SDLIQITGSNERDKLLGSLPTTASGGTSICSGIRRAFEVV---RKLYSHTDGSE------ 389 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFL-RACASPNSFYLV 348 IV +TDGE+ + C +E K+ GAI++ I + + + A+ + Sbjct: 390 IVLLTDGEDN-------TAGACVDEVKQSGAIIHFIALGPSADKAVIEMSTATGGVHFYA 442 Query: 349 ENPHS---MYDAFSHIG 362 + + DAF + Sbjct: 443 TDEAENNGLIDAFGALA 459 >gi|225418703|ref|ZP_03761892.1| hypothetical protein CLOSTASPAR_05927 [Clostridium asparagiforme DSM 15981] gi|225041758|gb|EEG52004.1| hypothetical protein CLOSTASPAR_05927 [Clostridium asparagiforme DSM 15981] Length = 1360 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 34/305 (11%), Positives = 104/305 (34%), Gaps = 36/305 (11%) Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIV--RSTSLDIVVVPQNEG 126 + + + +++ + + F + D V + ++++I + + Sbjct: 444 ESLKDVALYTDNVLNNQLEIVLPETVSADSVQTQFSGYMTDYVSMQRSAVNISSLDASAF 503 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ----TDARLDMMIVLDVSR 182 I+A + + P+ + HI + + + +M++ DVS Sbjct: 504 SEITAYVQIETPVDYSIDEL----KSHITVEDCGAQISEYNLEKVEYSSANMLLCCDVSG 559 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH 242 SM+ I+ + ++ + + ++ + G++ F++ ++ Sbjct: 560 SMQGRP------IEDSRAAV------ISMAESMSGNARLGVILFNSSVQGLTDFTVQPDV 607 Query: 243 LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLS 301 ++ + ++ G + N + + +V M+DG EN + Sbjct: 608 IRSTAESMTANGGT-----------NIFDTVVHGLESFPKNGPEVLNTLVVMSDGQENNA 656 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFS 359 ++ AK + +V+ +G+ +L+ A ++ V + S+ + Sbjct: 657 HSAEEIQTAIGQAAKDKSILVHCLGLGSEVDANYLQTIAQSAGGTYQYVTDSSSLAVFYQ 716 Query: 360 HIGKD 364 ++ Sbjct: 717 NLASQ 721 >gi|310641808|ref|YP_003946566.1| von willebrand factor type a [Paenibacillus polymyxa SC2] gi|309246758|gb|ADO56325.1| von Willebrand factor type A [Paenibacillus polymyxa SC2] Length = 600 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 37/184 (20%), Positives = 73/184 (39%), Gaps = 33/184 (17%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + S + +++D ++V+DVS SM + I +++ ++ + D Sbjct: 28 PQTANAASPSASKVDAVLVVDVSNSMNTSDPGKIGN-----EAMKMFIDMLSTQND---- 78 Query: 219 VQSGLVTFSNKIEEFFLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + G+V +++ ++ L E L+ I L++ G T+++ G+K A + D Sbjct: 79 -KVGIVAYTDVVQREKALLNISSEADKQELKTFIDGLNR-GAYTDTSVGVKEAIRILQDG 136 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS--------LYYCNEAKKRGAIVYAI 325 + IV + DG N K ++ EAKK G +Y I Sbjct: 137 KTAGHAPM---------IVMLADGNNDFNKTTGRTESQSAQDMAQAVAEAKKSGVPIYTI 187 Query: 326 GIRV 329 G+ Sbjct: 188 GLNA 191 >gi|162456414|ref|YP_001618781.1| hypothetical protein sce8131 [Sorangium cellulosum 'So ce 56'] gi|161166996|emb|CAN98301.1| hypothetical protein sce8131 [Sorangium cellulosum 'So ce 56'] Length = 507 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 74/213 (34%), Gaps = 31/213 (14%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 + + ++ Q A +++++D S SM+ K++ A + A ++ + Sbjct: 106 IDVPAARAARGQPRAPAAVVLLVDASGSMQG------PKMENARAAAQAFVDRL----PD 155 Query: 216 NNVVQSGLVTFSNKIE---EFFLLEWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 ++V + +F++ + +L + R I L G STN GLK A Sbjct: 156 GDLV--SVASFADTAQARVAPTVLGRSTRPAVARAIAALGPDG-STNLFAGLKLA----- 207 Query: 272 DMQGMRQHC-NTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 QH + + +V ++DG+ N+ G + +IG+ Sbjct: 208 -----EQHALAAPSTHAVRRVVLISDGQANIGPSSPDILGALAQRGAAHGVQITSIGVGA 262 Query: 330 IRSHEFLRACA--SPNSFYLVENPHSMYDAFSH 360 L A A S Y + M Sbjct: 263 DYDERTLNALAVGSSGRLYHLTEAREMSSVLER 295 >gi|47229020|emb|CAG09535.1| unnamed protein product [Tetraodon nigroviridis] Length = 680 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 25/132 (18%), Positives = 51/132 (38%), Gaps = 20/132 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS--NKI 230 D+ V+D S S+ ++ + + E ++ + G V ++ ++ Sbjct: 495 DIAFVIDGSSSVG------TGNFRTVLQFVANITREFEISDTD---TRVGAVQYTYEQRL 545 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 E F + L IK +S + T++ + YA Q+F N +KI Sbjct: 546 EFSFSQYGNKADLLGAIKRISYWSGGTSTGAAISYASEQLFS---------KSKPNKRKI 596 Query: 291 IVFMTDGENLST 302 ++ +TDG + Sbjct: 597 MIVITDGRSYDD 608 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 24/198 (12%), Positives = 67/198 (33%), Gaps = 22/198 (11%) Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 +P T + + +S++ + + ++D++ ++D S S+ + Sbjct: 257 VPRSPTTSSIKFFLQKQFASSSLTSLRADHPAECKIDLVFLMDGSWSIGK------RRFK 310 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFG 254 + + + + + + P G+V + + L+ + ++ +++ G Sbjct: 311 IQKDFLAEVAQAINVGPTGP---MMGVVQYGDDPVTEISLKSVSSSREARSAVEKITQKG 367 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 ++ L Y Q F + + V + DG E+ L Sbjct: 368 GLSHVGRALSYINKQYFSDANGNRGAAPN------VAVVLVDGWPTDKVEEASRL----- 416 Query: 315 AKKRGAIVYAIGIRVIRS 332 A++ G ++ + I Sbjct: 417 ARESGINIFFVTIEGPDG 434 >gi|281420095|ref|ZP_06251094.1| BatA protein [Prevotella copri DSM 18205] gi|281405895|gb|EFB36575.1| BatA protein [Prevotella copri DSM 18205] Length = 332 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 44/282 (15%), Positives = 87/282 (30%), Gaps = 61/282 (21%) Query: 122 PQNEGYSISAISRYKIPLKFCT----FIPWYTNSRHIVMPITSSVKVNSQTD------AR 171 +S +Y+ K +P V+ + + + + T Sbjct: 28 KNAPTMRMSDTKKYQYAPKSLRVRLIHLPMVLRCICFVLIVCAMARPQTHTAWDNKTVEG 87 Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +DVS SM + D +++ A + P+ N GL F+ + Sbjct: 88 IDIMLAMDVSTSMLAE-DLRPNRMEAAKDVATEFI---SGRPNDN----IGLTIFAGEAF 139 Query: 232 EFFLLEWG-------VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + + + I T GL A +++ + Sbjct: 140 TQCPMTTDHASLLRLLQATRTDIAARGLIDDGTAVGMGLANAVSRL-----------KDS 188 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS------------ 332 + K+++ +TDG N + + AK G VY IG+ + Sbjct: 189 KSKSKVVILLTDGSNNM--GEISPMTAAEIAKSYGIRVYTIGVGTNKVAPYPMPVAGGVQ 246 Query: 333 ---------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 + L A + +FY N + + + I K Sbjct: 247 YVNIPVEIDTKTLSDIAQTTDGNFYRATNNNELKKIYRDIDK 288 >gi|156342098|ref|XP_001620878.1| hypothetical protein NEMVEDRAFT_v1g222619 [Nematostella vectensis] gi|156206294|gb|EDO28778.1| predicted protein [Nematostella vectensis] Length = 252 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 73/195 (37%), Gaps = 24/195 (12%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D+M +LD S S+ + + A+++I M+ + + ++T + + + Sbjct: 49 FDIMYILDSSSSVSDW------EFQRAVQAIQTMVAK------SKRDNRHAVITIATRAK 96 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 F +++ ++ +S+ G TN+ L+ A+ + + Sbjct: 97 TFMNFSSRADAVRK-LRGISRSGGKTNTQDALELAFQMFTTS---KYGSTPGGLAR---V 149 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRACASPN--SFYLV 348 + +TDG + K ++ + K G V+ I I + + L A+ Y V Sbjct: 150 LVVTDGRSNIEK--HRTERKAFKLKANGIEVFVIAIGDYLEGMDELARMANTKYAHMYRV 207 Query: 349 ENPHSMYDAFSHIGK 363 E+ + I + Sbjct: 208 EDVKGLARVVKLIPR 222 >gi|291403690|ref|XP_002718170.1| PREDICTED: cochlin [Oryctolagus cuniculus] Length = 551 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 26/209 (12%), Positives = 69/209 (33%), Gaps = 31/209 (14%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTD------ARLDMMIVLDVSRSMESFFDSSITK 194 F +P + + V P+ + + Q +++ ++D S S+ + Sbjct: 330 FSYHMPNWFGTTKYVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGE------SN 383 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSK 252 + ++ ++ + + ++ + V F+ F + + I+ + Sbjct: 384 FRLMLEFVSNIAKTFEISDIGA---KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRY 440 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 T + + + +F +R N +V +TDG+ Sbjct: 441 MSGGTATGDAISFTVRNVFGP--VRDSPNKN------FLVIITDGQ-----SYDDVRGPA 487 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 A G ++++G+ + L+ AS Sbjct: 488 AAAHDAGITIFSVGVAWAPLDD-LKDMAS 515 >gi|2689175|emb|CAA06010.1| hypothetical protein [Borrelia burgdorferi] Length = 328 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 40/196 (20%), Positives = 71/196 (36%), Gaps = 26/196 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++IVLD+S SM + SS +++ + + I + + GLV F+ Sbjct: 93 DIVIVLDISPSMGAVEFSSKNRLEFSKELIRGFISQ-------RENDNIGLVAFAKDASI 145 Query: 233 FFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + +K+ Y+ G + G+ A + + K+ Sbjct: 146 VVPITTDREFFNKKLDDIYIMDLGNGSALGLGISIALSHL-----------KHSEALKRS 194 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP----NSFY 346 IV +TDG S + D+ N A+ +Y+IGI S SF Sbjct: 195 IVVLTDGVVNSDEIDKD--QVINLAQGLNVKIYSIGIGSSEEFSVEFKLRSGKFYQGSFK 252 Query: 347 LVENPHSMYDAFSHIG 362 V +P + + + G Sbjct: 253 EVYDPSMLVEISNKTG 268 >gi|296100188|ref|YP_003617105.1| hypothetical protein pDK1_p020 [Pseudomonas putida] gi|295443554|dbj|BAJ06433.1| hypothetical protein [Pseudomonas putida] Length = 604 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 43/206 (20%), Positives = 84/206 (40%), Gaps = 31/206 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + I+LD S SM+S D + + A+ ++ + LE + L V +G + Sbjct: 420 SRAERQSASIQILLDKSGSMKSAMD----QAEAAVYAVLSALEGLPL-------VTTGAM 468 Query: 225 TF----SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +F ++ +E L++ S +R I+ +S+ G S G A Q + Sbjct: 469 SFPNKANDGVERCALIK---SPKERLIRAVSEGGFGAMSEGGTPLA--QALWPAAVEVLR 523 Query: 281 NTEDANYKKIIVFMTDGENLS--TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + KKI+ +TDGE + T ++ + C G V +G H L+A Sbjct: 524 AKGE---KKILFVITDGEPNAGTTHAAKEFIQRCEV---SGIEVIGLGFGSANEH-ILKA 576 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKD 364 S + V ++ ++ + ++ Sbjct: 577 LFS--QYRAVGEVANLKNSLFELVRE 600 >gi|182438638|ref|YP_001826357.1| hypothetical protein SGR_4845 [Streptomyces griseus subsp. griseus NBRC 13350] gi|178467154|dbj|BAG21674.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus NBRC 13350] Length = 578 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 37/244 (15%), Positives = 87/244 (35%), Gaps = 28/244 (11%) Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 P+ G+S++ K ++ + + + + ++ + V+D+S Sbjct: 167 PKGSGFSVNVDGARIGAGKGGGGGTGASDWSLLRVGLATEAAPSTAERPPAALTFVVDIS 226 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--- 238 SM ++D+ KS+ + +E++ V+ LVTFS+ E + Sbjct: 227 GSMAETG-----RLDLVRKSLAVLTDELRDDDSVS------LVTFSDAAETRLPMTRLQG 275 Query: 239 GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG- 297 + ++ + + STN G+ Y + ++G R+ +V ++D Sbjct: 276 NRNRIKDAVDEMRP-EQSTNVEAGITRGYEE--SVEGHRKGATNR-------VVLLSDAL 325 Query: 298 ENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIRVIRSHEFLRACAS--PNSFYLVENPHSM 354 N E L + ++ G ++ +G+ F+ + + V + Sbjct: 326 ANTGDTEADGILERIDSTRREYGITLFGVGVGSDYGDAFMERLTNKGDGNTTYVGDEAQA 385 Query: 355 YDAF 358 F Sbjct: 386 RKVF 389 >gi|157962424|ref|YP_001502458.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] gi|157847424|gb|ABV87923.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] Length = 336 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 30/215 (13%), Positives = 69/215 (32%), Gaps = 46/215 (21%) Query: 173 DMMIVLDVSRSM---ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+M+ +D+S SM + D + I+ + + E + + GL+ F++ Sbjct: 85 DLMLSVDLSGSMQIEDMVIDGKVVDRFTLIQHVISDFIERRKGD------RIGLILFADH 138 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + + +K G T + + ++ Sbjct: 139 AYLQSPLTQDRRSVAQYLKEAQIGLVGKQTAIGEAIALGVKRFDKVEQSN---------- 188 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNE-AKKRGAIVYAIGIRV--IRSHEF--------- 335 ++++ +TDG N + ++ A +RG +Y IG+ + Sbjct: 189 -RVLILLTDGSNN---AGAITPEQASQIAAQRGITIYTIGVGADVMERRTLFGKERVNPS 244 Query: 336 -------LRACA--SPNSFYLVENPHSMYDAFSHI 361 L+ A + ++ N + + I Sbjct: 245 MDLDESQLQEIAKVTGGQYFRARNTEELEQIYQVI 279 >gi|323701386|ref|ZP_08113060.1| von Willebrand factor type A [Desulfotomaculum nigrificans DSM 574] gi|323533645|gb|EGB23510.1| von Willebrand factor type A [Desulfotomaculum nigrificans DSM 574] Length = 602 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 31/205 (15%), Positives = 70/205 (34%), Gaps = 15/205 (7%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 I++Y +P Y + + S + + LD+++++D S SM Sbjct: 381 QVINQYNLPSGELDEDALYQAAYSNRLFKRSEIIETRTRN--LDIVLLIDSSASMVYPAG 438 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY 249 I+++++A +E ++ + V++ + F+ K + + + Sbjct: 439 EGISRVELARNLAALFVEALEPVDS----VKTWVFGFNLKGAVNLMELYSPHLTNKARIG 494 Query: 250 LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL 309 +S +T LKYA ++ K+++ + DG E + Sbjct: 495 MSVAEGTTPEGSALKYAALRLMSE---------GRRFVPKVLIVIADGNPNPGPETKLVK 545 Query: 310 YYCNEAKKRGAIVYAIGIRVIRSHE 334 + K G I + E Sbjct: 546 EQVRKMKALGCKTINISVGDRPGEE 570 >gi|225449026|ref|XP_002273050.1| PREDICTED: hypothetical protein [Vitis vinifera] Length = 710 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 38/249 (15%), Positives = 90/249 (36%), Gaps = 37/249 (14%) Query: 59 VHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDI 118 + + IM + + L + N N + + T ++ Sbjct: 157 LDSNRNIMALHQASEPGVFNDDESLDHQPVPAERNSSNGNAAEN--NPVRTVEIKTYPEV 214 Query: 119 VVVPQNEGY-SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 P+++ Y + + + K + ++I +++S + A +D++ V Sbjct: 215 SAAPRSKSYDNFTVLVHLKAAV--------ANTGQNIQRNMSNSPLNSHNPRAPVDLVTV 266 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 LD+S SM TK+ + +++ +++ + + ++ FS+ F L Sbjct: 267 LDISGSMAG------TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSTARRLFPLR 314 Query: 238 W----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 G + + L G TN GL+ +G + + ++ N I+ Sbjct: 315 RMTDAGRQQALQAVNSLVANG-GTNIAEGLR---------KGAKVMEDRKERNPVSSIIL 364 Query: 294 MTDGENLST 302 ++DG++ T Sbjct: 365 LSDGQDTYT 373 >gi|125852492|ref|XP_696796.2| PREDICTED: von Willebrand factor A domain-containing protein 1-like [Danio rerio] Length = 503 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 44/227 (19%), Positives = 84/227 (37%), Gaps = 34/227 (14%) Query: 154 IVMPITSSVKVNSQTDARL-----DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + + + S +S TD+ L D++ +LD S S+ S+ + + ++ +L Sbjct: 12 LTVFLCSGDAQDSVTDSVLNCCEGDVLFLLDSSGSVASY------EFSRMVDFLSELLLP 65 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYA 266 L PD V+ GL+ + F + S LQ ++ + TN+ L A Sbjct: 66 FSLGPD---HVRVGLLQVGTEPHLEFGFDAYSSQQGLQAALERTKQLKGDTNTVEALLMA 122 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 Q+ + + +++V++TDG + + G V + Sbjct: 123 RAQVLRVGVPG----GARPDLPRVLVWLTDG-----VDPGNVQEPMARLRDEGVAVLVVS 173 Query: 327 IRVIRSHEFLRACASP---NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 ++ LR SP + V+ D S IG+D+ I Sbjct: 174 TGHGN-YQVLREVVSPPTEEHLFFVD-----IDDISIIGEDLRNSII 214 >gi|28210485|ref|NP_781429.1| membrane-associated protein [Clostridium tetani E88] gi|28202922|gb|AAO35366.1| membrane-associated protein [Clostridium tetani E88] Length = 842 Score = 56.7 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 38/219 (17%), Positives = 79/219 (36%), Gaps = 28/219 (12%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 +P++ +VK +++++D S SM+ + KI++A + ++ ++ Sbjct: 394 LPVSCNVKNKR-KQGDAGIVLLIDCSGSMDDE-SGGVKKIELAKQGAIETIKALESED-- 449 Query: 216 NNVVQSGLVTFSNKIE--EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 G++ FS+ I+ F L +++ L G T PGL + Sbjct: 450 ----YIGILGFSDTIDWVVPFQKAENKEKLIKEVGKLKPKG-GTLIIPGLIEGVKTLSSA 504 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + +H ++ +TDG+ D Y KK + +G+ Sbjct: 505 KTKVKH-----------MILLTDGQAEKNGFD----KYLENMKKNNMTLSTVGLGEDSDR 549 Query: 334 EFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 E L + + Y + S+ F+ + K I Sbjct: 550 EVLTHLSDFTGGRKYFSNDFKSVPIIFAKETRISQKKYI 588 >gi|293342867|ref|XP_001069890.2| PREDICTED: rCG55860-like [Rattus norvegicus] gi|149021013|gb|EDL78620.1| rCG55860, isoform CRA_a [Rattus norvegicus] Length = 953 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 74/202 (36%), Gaps = 33/202 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ VLD+S SM K+ +++ +L +++ + ++ FSN+I Sbjct: 296 VVFVLDISASM------VGAKLQQTREALVTILNDLRPQD------RFNIIGFSNRIKMW 343 Query: 231 -EEFFLLE-WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + + + + + +LS G T+ L+ A + + ED + Sbjct: 344 KDHLLPVTPDNIRNGKIYMYHLSPTG-GTDINGALQTAIKLL---NNYVAQNDIEDRSVS 399 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEF-----LRACA 340 +I+F+TDG+ + +L + K+ ++ +GI L C Sbjct: 400 -LIIFLTDGKPTF--GETNTLRILSNTKEATGGQICIFTVGIGNDVDFRLLEKLSLENCG 456 Query: 341 SPNSFYLVENP-HSMYDAFSHI 361 + E + + I Sbjct: 457 LTRRVHEEEKAGAQLIGFYDEI 478 >gi|126272975|ref|XP_001371818.1| PREDICTED: similar to anthrax toxin receptor [Monodelphis domestica] Length = 858 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 73/200 (36%), Gaps = 25/200 (12%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 ++LD S S++ + I + +E + ++ +++ + FS+K Sbjct: 348 CVILDKSGSVKHHW-----------IEIYSFVESLAEKF-ISPMLRMSFIVFSSKGTTIM 395 Query: 235 LLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L +++ ++ L G T G + A QI+ H N +I+ Sbjct: 396 KLTEDREAIRQGLEVLRYEVPGGDTFMHKGFERANEQIY-------HENYGGLRTASVII 448 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPH 352 +TDGE + N A+ GAIVY +G++ + S + + V Sbjct: 449 ALTDGELQKEQFYFAEKEV-NRARTFGAIVYCVGVKDFNETQLSTIADSIDHVFPVTGG- 506 Query: 353 SMYDAFSHIGKDIVTKRIWY 372 + A + I+ K Sbjct: 507 --FHALRGVIDSILKKSCIE 524 >gi|262195558|ref|YP_003266767.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262078905|gb|ACY14874.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 775 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 36/177 (20%), Positives = 65/177 (36%), Gaps = 28/177 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ +VLD S SM + + + A K++ A L + + ++ F + ++ Sbjct: 266 DVTLVLDRSGSMS---GAPLARAKDAAKAVVARLGDGD---------RVNVMAFDDGVDA 313 Query: 233 FF----LLEW-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 F + S I LS G T+ L A + +T + Sbjct: 314 LFLRPVPISAERRSQAVEYIDRLS-DGGGTDLAGALAEA---LDAQHPSESEADTGSRPH 369 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I+F+TDG++ S Q + +A+ V+ IG+ L AS Sbjct: 370 --VILFLTDGQSDSQATLQVARGDAGDAR-----VFTIGVGDGVEKPLLARLASEKR 419 >gi|149922129|ref|ZP_01910569.1| hypothetical protein PPSIR1_23374 [Plesiocystis pacifica SIR-1] gi|149817066|gb|EDM76548.1| hypothetical protein PPSIR1_23374 [Plesiocystis pacifica SIR-1] Length = 546 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 31/234 (13%), Positives = 81/234 (34%), Gaps = 23/234 (9%) Query: 136 KIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKI 195 + ++ +++ + LD+ IVLD S SM K+ Sbjct: 95 QYGYVHVAEDAGQPFEMPAIIRLSADDEAGQGPRPGLDLAIVLDRSGSMGG------DKL 148 Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE----WGVSHLQRKIKYLS 251 A ++ ++ + + L+++ + + L+ G+ L+R++ + Sbjct: 149 RFAKQAGLDLVNRLDEQD------RVTLISYDDTVTPLSNLQRVDDDGIEVLRRQLLDI- 201 Query: 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE-DANYKKIIVFMTDGENL--STKEDQQS 308 + G +T P L ++ + TE + + ++ ++DG T+ + Sbjct: 202 QVGGTTALGPALFMGLQRLAAPEPFGPQTRTEARHDRLRHVILLSDGIANVGETRPEVIG 261 Query: 309 LYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSH 360 G V +G+ + + + + A ++ +E+ S+ Sbjct: 262 GRVAEHF-GGGVSVSTLGMGLDYNEDLMTRIADEGGGRYHFIEDAESIPAMLGD 314 >gi|320158392|ref|YP_004190770.1| protein TadG, associated with Flp pilus assembly [Vibrio vulnificus MO6-24/O] gi|319933704|gb|ADV88567.1| protein TadG, associated with Flp pilus assembly [Vibrio vulnificus MO6-24/O] Length = 442 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 24/217 (11%), Positives = 78/217 (35%), Gaps = 14/217 (6%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 +G I+ LP++ +++ ++++ + + + + ++ Sbjct: 20 KKQQGVAGIIFMGMLPVLVIIMVFSMQMTQRHMAHAKITEAAEVASLA----LIASPKEG 75 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 + K + +I + + + E+ F + + P + +SA Sbjct: 76 DEKNQEYAQ---KIVDHYIPDNKGEVVARVFNRRCEYKDGCVQRSGELAP-FTDFVVSAK 131 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM-ESFFDSS 191 +++ + ++ + + LD+ ++D+S SM + S Sbjct: 132 TKHD---SWISYNDGEMGLTK-DFEVMGTSTSRKFLPQPLDIYFIIDMSGSMVNPWGGSG 187 Query: 192 ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 TK D+ +IN ++++++ + + ++ F + Sbjct: 188 KTKYDVVADTINRIVDDLREFKT-DRKSRVAVIGFHH 223 >gi|145597778|ref|YP_001161854.1| hypothetical protein YPDSF_0468 [Yersinia pestis Pestoides F] gi|145209474|gb|ABP38881.1| membrane protein [Yersinia pestis Pestoides F] Length = 513 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 37/242 (15%), Positives = 80/242 (33%), Gaps = 38/242 (15%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F N +G + + +P+ ++ + E SH + L ++++ + +T+ N Sbjct: 12 FIKNRQGAILLSFMALIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALSTE--NNYR 69 Query: 71 GNNRKKLKGGDILCRIKNTW--NMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 + + ++ ++ + F N + + + SL + Y Sbjct: 70 NDRASNNRNNYLVTSYAQSYLPSERFSQPRVVNTYNESLGYTEYNASLQM-------NYQ 122 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES-- 186 ++ ++ Y LK W N SS+ +D++ V D S SM+ Sbjct: 123 LALLNSY---LKQTPSPTWDVNENGAARKYLSSI------AEPIDVVFVTDFSGSMDLPF 173 Query: 187 ---FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 ++ ITK+D + + +N G V WG + Sbjct: 174 GDIERNNRITKLDELKAIFVKLNNRIFSNDGINT---IGFV----------PFSWGTKRI 220 Query: 244 QR 245 Sbjct: 221 SA 222 Score = 40.9 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 21/91 (23%), Positives = 40/91 (43%), Gaps = 18/91 (19%) Query: 282 TEDANYKKIIVFMTDGENL----STKEDQQ-----------SLYYCNEAKKRGAIVYAIG 326 TE N K+++ ++DG++ S+ DQ+ + C + K G + IG Sbjct: 402 TESQNNNKLMIILSDGDDNTQKMSSPHDQKAGIINITQKLITEGMCQKIKDNGIKMVFIG 461 Query: 327 IRVIRSHEFL---RACASPNSFYLVENPHSM 354 I + + + + C +FYL +N H + Sbjct: 462 IGYVPDNNIIDWEKDCVGTGNFYLAKNAHEL 492 >gi|487830|gb|AAA59180.1| eukocyte adhesion glycoprotein precursor [Homo sapiens] Length = 1163 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 41/216 (18%), Positives = 78/216 (36%), Gaps = 32/216 (14%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 V D++ ++D S S+ S + + A++ + + Q Sbjct: 140 VSRQECPRQEQDIVFLIDGSGSISS------RNFATMMNFVRAVISQFQ-----RPSTQF 188 Query: 222 GLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 L+ FSNK + F E S+ + + + T + ++ +++F Sbjct: 189 SLMQFSNKFQTHFTFEEFRRTSNPLSLLASVHQLQGFTYTATAIQNVVHRLFHAS----- 243 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEF 335 + KI++ +TDG+ D + + +A G I YAIG + S + Sbjct: 244 -YGARRDATKILIVITDGKKEGDSLDYKDVIPMADA--AGIIRYAIGVGLAFQNRNSWKE 300 Query: 336 LRACA---SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 L A S + VE+ +DA I + K Sbjct: 301 LNDIASKPSQEHIFKVED----FDALKDIQNQLKEK 332 >gi|27754463|gb|AAO22679.1| unknown protein [Arabidopsis thaliana] Length = 641 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 75/217 (34%), Gaps = 32/217 (14%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + A LD++ VLDVS SM K+++ +++ +++ + Sbjct: 185 VHLKAEGVSDDARRARAPLDLITVLDVSGSM------DGVKMELMKNAMSFVIQNLGETD 238 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + +++FS+ F L G + + L G TN GLK Sbjct: 239 ------RLSVISFSSMARRLFPLRLMSETGKQAAMQAVNSLVADG-GTNIAEGLKIGARV 291 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE----AKKRGAIVYAI 325 I R+ N ++ ++DG++ T + ++ Sbjct: 292 I----EGRRWKNPVSG-----MMLLSDGQDNFTFSHAGVRLRTDYESLLPSSCRIPIHTF 342 Query: 326 GIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSH 360 G E + + S +F +E + DAF+ Sbjct: 343 GFGSDHDAELMHTISEVSSGTFSFIETETVIQDAFAQ 379 >gi|15223093|ref|NP_172283.1| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|8778841|gb|AAF79840.1|AC026875_20 T6D22.13 [Arabidopsis thaliana] gi|332190114|gb|AEE28235.1| C3HC4-type RING finger-containing protein [Arabidopsis thaliana] Length = 641 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 75/217 (34%), Gaps = 32/217 (14%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + A LD++ VLDVS SM K+++ +++ +++ + Sbjct: 185 VHLKAEGVSDDARRARAPLDLITVLDVSGSM------DGVKMELMKNAMSFVIQNLGETD 238 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + +++FS+ F L G + + L G TN GLK Sbjct: 239 ------RLSVISFSSMARRLFPLRLMSETGKQAAMQAVNSLVADG-GTNIAEGLKIGARV 291 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE----AKKRGAIVYAI 325 I R+ N ++ ++DG++ T + ++ Sbjct: 292 I----EGRRWKNPVSG-----MMLLSDGQDNFTFSHAGVRLRTDYESLLPSSCRIPIHTF 342 Query: 326 GIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSH 360 G E + + S +F +E + DAF+ Sbjct: 343 GFGSDHDAELMHTISEVSSGTFSFIETETVIQDAFAQ 379 >gi|51597679|ref|YP_071870.1| membrane protein. [Yersinia pseudotuberculosis IP 32953] gi|51590961|emb|CAH22619.1| Putative membrane protein [Yersinia pseudotuberculosis IP 32953] Length = 518 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 37/242 (15%), Positives = 80/242 (33%), Gaps = 38/242 (15%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F N +G + + +P+ ++ + E SH + L ++++ + +T+ N Sbjct: 17 FIKNRQGAILLSFMALIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALSTE--NNYR 74 Query: 71 GNNRKKLKGGDILCRIKNTW--NMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 + + ++ ++ + F N + + + SL + Y Sbjct: 75 NDRASNNRNNYLVTSYAQSYLPSERFSQPRVVNTYNESLGYTEYNASLQM-------NYQ 127 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES-- 186 ++ ++ Y LK W N SS+ +D++ V D S SM+ Sbjct: 128 LALLNSY---LKQTPSPTWDVNENGAARKYLSSI------AEPIDVVFVTDFSGSMDLPF 178 Query: 187 ---FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 ++ ITK+D + + +N G V WG + Sbjct: 179 GDIERNNRITKLDELKAIFVKLNNRIFSNDGINT---IGFV----------PFSWGTKRI 225 Query: 244 QR 245 Sbjct: 226 SA 227 Score = 40.9 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 21/91 (23%), Positives = 40/91 (43%), Gaps = 18/91 (19%) Query: 282 TEDANYKKIIVFMTDGENL----STKEDQQ-----------SLYYCNEAKKRGAIVYAIG 326 TE N K+++ ++DG++ S+ DQ+ + C + K G + IG Sbjct: 407 TESQNNNKLMIILSDGDDNTQKMSSPHDQKAGIINITQKLITEGMCQKIKDNGIKMVFIG 466 Query: 327 IRVIRSHEFL---RACASPNSFYLVENPHSM 354 I + + + + C +FYL +N H + Sbjct: 467 IGYVPDNNIIDWEKDCVGTGNFYLAKNAHEL 497 >gi|156383259|ref|XP_001632752.1| predicted protein [Nematostella vectensis] gi|156219812|gb|EDO40689.1| predicted protein [Nematostella vectensis] Length = 157 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 31/172 (18%), Positives = 66/172 (38%), Gaps = 21/172 (12%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK-- 229 +D+ ++D S+S+E + + +K I + G+VTF+ Sbjct: 3 VDLAFLIDGSKSIEDAGKGNFKRCLDFVKRIAL------SFDISASGTHIGIVTFATDPT 56 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 +E F + + + I + T + L+ +F+ ++ AN + Sbjct: 57 VELEFDQSFDNTSIATIIDNIRNPDALTFTGKALETVKKDLFE--------KSQRANVHR 108 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 +++ +TDG + + K+ G +YA+G+ E L+ AS Sbjct: 109 MLIVLTDGRSWD-----AVQEPAKQLKESGVTLYAVGVGQDYDLEQLKDIAS 155 >gi|147815707|emb|CAN70517.1| hypothetical protein VITISV_016246 [Vitis vinifera] Length = 715 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 38/249 (15%), Positives = 90/249 (36%), Gaps = 37/249 (14%) Query: 59 VHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDI 118 + + IM + + L + N N + + T ++ Sbjct: 162 LDSNRNIMALHQASEPGVFNDDESLDHQPVPAERNSSNGNAAEN--NPVRTVEIKTYPEV 219 Query: 119 VVVPQNEGY-SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 P+++ Y + + + K + ++I +++S + A +D++ V Sbjct: 220 SAAPRSKSYDNFTVLVHLKAAV--------ANTGQNIQRNMSNSPLNSHNPRAPVDLVTV 271 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 LD+S SM TK+ + +++ +++ + + ++ FS+ F L Sbjct: 272 LDISGSMAG------TKLALLKRAMGFVIQNLGSSD------RLSVIAFSSTARRLFPLR 319 Query: 238 W----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 G + + L G TN GL+ +G + + ++ N I+ Sbjct: 320 RMTDAGRQQALQAVNSLVANG-GTNIAEGLR---------KGAKVMEDRKERNPVSSIIL 369 Query: 294 MTDGENLST 302 ++DG++ T Sbjct: 370 LSDGQDTYT 378 >gi|297606054|ref|NP_001057930.2| Os06g0578100 [Oryza sativa Japonica Group] gi|255677166|dbj|BAF19844.2| Os06g0578100 [Oryza sativa Japonica Group] Length = 622 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 68/187 (36%), Gaps = 26/187 (13%) Query: 128 SISAISRYKIPLKFCTFIPWYTNSR--HIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 ++S IP ++ +++ + + + +D++ VLDVS SM Sbjct: 24 AMSVAPVKVSTTPIFPTIPRAQTNKDFQVLLRVEAPPAADLNGHVPIDVVAVLDVSGSMN 83 Query: 186 ------SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-KIEEFFL--- 235 + T++D+ S+ ++ ++ + +V F++ ++E+ Sbjct: 84 DPVAASPESNLQATRLDVLKASMKFIIRKLDDGD------RLSIVAFNDGPVKEYSSGLL 137 Query: 236 --LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 G S +KI L G S L + + RQ + I+ Sbjct: 138 DVSGDGRSIAGKKIDRLQARGG---SGSALMPELQEAVKILDERQGNSRNRVG---FILL 191 Query: 294 MTDGENL 300 +TDG++ Sbjct: 192 LTDGDDT 198 >gi|218961690|ref|YP_001741465.1| BatA protein (fragment) [Candidatus Cloacamonas acidaminovorans] gi|167730347|emb|CAO81259.1| BatA protein (fragment) [Candidatus Cloacamonas acidaminovorans] Length = 270 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 33/168 (19%), Positives = 63/168 (37%), Gaps = 18/168 (10%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 +K ++ +D+++ +D+S SM + + ++ A+ VK P+ + Sbjct: 14 GIKTRDLSNKGVDIVMAIDISGSMLAMDFAPKNRLSAAVSVAKDF---VKRRPND----R 66 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 GLV FS L + + + L ++ + G+ A Sbjct: 67 FGLVAFSEYALTQVPLTFDHLAMLNSLDKLKVNEEASATAIGMGLA---------KAVAR 117 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 K+I+ +TDG +S + L AK+ G VY IG+ Sbjct: 118 LKNSTAKSKVIILITDG--VSNTGEIDPLTAAGMAKELGIKVYPIGVG 163 >gi|78060312|ref|YP_366887.1| hypothetical protein Bcep18194_C7199 [Burkholderia sp. 383] gi|77964862|gb|ABB06243.1| hypothetical protein Bcep18194_C7199 [Burkholderia sp. 383] Length = 423 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 32/303 (10%), Positives = 88/303 (29%), Gaps = 25/303 (8%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R + +G + I+ + L ++ +G+ +++ ++ ++ L + D + AA + + Sbjct: 11 MRRSLHRQRGAVAIIVGLSLAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDLTS 70 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRS--TSLDIVVVPQNE 125 + + + G + + + ++ N V D + T + + Sbjct: 71 AISLSVAE--ADGIAAGHLNFVFFQNKSVQMSTNANVTFSDSLTDPFLTRSAVTTPSSIK 128 Query: 126 GYSISAISRYKIP--LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 +A ++ +P + + ++ V + V + S Sbjct: 129 YVQCTATLSNIAHWFIEVLNVLPGTKLANAAEVSASAIATVGGGQTTCAIPVFVCRATGS 188 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 IT + + N + L +N+ L ++ Sbjct: 189 PSYNVGDWITSLSGS-----------STTYGPGNFGWAALDGSTNEPTIASELSGNTCNI 237 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 ST A+N F + + ++ ++ G N Sbjct: 238 TS-----PPDLGSTGLKSASLRAWNTRFGIYTNGANGSSGQPDFTGYAYV---GPNYGPP 289 Query: 304 EDQ 306 Sbjct: 290 GTA 292 >gi|7463254|pir||E70121 hypothetical protein BB0173 - Lyme disease spirochete Length = 340 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 68/196 (34%), Gaps = 27/196 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++IVLD+S SM + SS +++ + + I + + GLV F+ Sbjct: 101 DIVIVLDISPSMGAVEFSSKNRLEFSKELIRGFISQ-------RENDNIGLVAFAKDASI 153 Query: 233 FFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + +K+ Y+ G + G+ A + + K+ Sbjct: 154 VVPITTDREFFNKKLDDIYIMDLGNGSALGLGISIALSHL-----------KHSEALKRS 202 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP----NSFY 346 IV +TDG S + Q + +Y+IGI S SF Sbjct: 203 IVVLTDGVVNSDEIKDQVINLAQGL---NVKIYSIGIGSSEEFSVEFKLRSGKFYQGSFK 259 Query: 347 LVENPHSMYDAFSHIG 362 V +P + + + G Sbjct: 260 EVYDPSMLVEISNKTG 275 >gi|326500988|dbj|BAJ98725.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326527981|dbj|BAJ89042.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 707 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 44/218 (20%), Positives = 83/218 (38%), Gaps = 44/218 (20%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 Q A LD++ VLDVS SM K+ + +++ +++ + + +V Sbjct: 203 KPQPRAPLDLVTVLDVSGSMSGH------KLALLKQAMRFVIDNLGPDD------RLSVV 250 Query: 225 TFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +FS++ L G + ++ L G TN GL+ A + RQH Sbjct: 251 SFSSEARRLTRLARMSDAGKALSVNAVESLVARG-GTNIAEGLRTAAKVL----DERQHR 305 Query: 281 NTEDANYKKIIVFMTDGENLST-----------KEDQQSLYYCNEAK-----KRGAIVYA 324 N + +V ++DG++ T + + L + A+ A ++ Sbjct: 306 NAVSS-----VVLLSDGQDTYTMMRRRGPSGVHAGNYEELVPPSFARTGADGDWSAPIHT 360 Query: 325 IGIRVIRSHEFLR--ACASPNSFYLVENPHSMYDAFSH 360 G + A A+ +F +EN + DAF+ Sbjct: 361 FGFGNDHDAAAMHVIAEATGGTFSFIENEAVIQDAFAQ 398 >gi|325860337|ref|ZP_08173459.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] gi|325482216|gb|EGC85227.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] Length = 331 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 66/179 (36%), Gaps = 26/179 (14%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P S S+ ++ +I LD+S SM + D S ++++ + + ++ + Sbjct: 76 PQIGSKLATSRQREGIETIIALDISNSMLAE-DVSPSRLEKSKLLVENLMNKFSED---- 130 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDM 273 + GL+ F+ + + + ++ T+ L+ + N Sbjct: 131 ---KIGLIVFAGDAFVQLPITGDYVSAKMFLDNINPSLIGTQGTDIGKALQLSINSFT-- 185 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 ++ K I+ +TDGE+ + + +A+ +G V+ +GI Sbjct: 186 ---------PNSKVGKAIILITDGEDN----EGGAEAMAKQARNKGIKVFILGIGSKEG 231 >gi|293571190|ref|ZP_06682227.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] gi|291608742|gb|EFF38027.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] Length = 1219 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 44/268 (16%), Positives = 82/268 (30%), Gaps = 79/268 (29%) Query: 167 QTDARLDMMIVLDVSRSME-SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 Q +D+++V+D+S SME S + + A + L+ +K + + V GLV Sbjct: 333 QDIKPVDIVLVVDMSGSMESSQSNGWNDRAGAARNGVKNFLQTIKDA-GIGDYVNVGLVG 391 Query: 226 FSNKIEEFFLLEW-----GVSHLQRKIKYL-----SKFGVSTNSTPGLKYAYNQIFDMQG 275 FS+ + G + I + KF T + G+ Sbjct: 392 FSSPGYVTGPNGYLTVPIGKASDTSHINAINDALKPKFTGGTYTQIGI-----------E 440 Query: 276 MRQHCNTEDANYKKIIVFMTDG-------------------------------------- 297 Q +N K+++ +TDG Sbjct: 441 QGQQMLAGSSNENKMMIVLTDGVPTFSKKVTAAQTIDGTTYATKFGNTLDEPRNTSKLNS 500 Query: 298 ---------ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL---------RAC 339 +L AK G ++ +GI++ + FL Sbjct: 501 SYEVGSWGNRTNINSTWPATLGAAKIAKDAGLTIHTLGIQLSKDGNFLTEQQVRDRASLI 560 Query: 340 ASPNSFYLVENPHSMYDAFSHIGKDIVT 367 A+P + E + + D + K++V Sbjct: 561 ATPGKYKDAETTNDVSDYLNEQAKNVVK 588 >gi|38505728|ref|NP_942348.1| hypothetical protein slr7060 [Synechocystis sp. PCC 6803] gi|38423752|dbj|BAD01962.1| slr7060 [Synechocystis sp. PCC 6803] Length = 588 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 36/241 (14%), Positives = 83/241 (34%), Gaps = 30/241 (12%) Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 IPLK I+ S Q L++ V+D S SME KI Sbjct: 9 IPLKNAVCSERAVTLDLIIRITPPSPPAMDQPRPSLNLGFVIDRSGSMEGH-----NKIT 63 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE--FFLLEWGVSHLQRKIKYLSKFG 254 A +++ ++++ ++ + F ++++ L + +R ++ ++ G Sbjct: 64 YARQAVCYAIDQLSPGDHLS------VTIFDDQVQTLIPSTLVKDKAQFKRLVQGINPGG 117 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-CN 313 + L+ ++ I+ ++DG + + + + Sbjct: 118 CTDLHGGWLQGGIQVSQNLSAELNR-----------IILLSDGLANRGETNPDIIATDVH 166 Query: 314 EAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 +RGA +G+ + + L A A ++Y V + + F +++ Sbjct: 167 GLAQRGASTTTLGLGDDYNEDLLEAMARSGDGNYYYVADAEQLPTIFE---RELQGLAAT 223 Query: 372 Y 372 Y Sbjct: 224 Y 224 >gi|327313514|ref|YP_004328951.1| von Willebrand factor type A domain-containing protein [Prevotella denticola F0289] gi|326944388|gb|AEA20273.1| von Willebrand factor type A domain protein [Prevotella denticola F0289] Length = 331 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 66/179 (36%), Gaps = 26/179 (14%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P S S+ ++ +I LD+S SM + D S ++++ + + ++ + Sbjct: 76 PQIGSKLATSRQREGIETIIALDISNSMLAE-DVSPSRLEKSKLLVENLMNKFSED---- 130 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDM 273 + GL+ F+ + + + ++ T+ L+ + N Sbjct: 131 ---KIGLIVFAGDAFVQLPITGDYVSAKMFLDNINPSLIGTQGTDIGKALQLSINSFT-- 185 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 ++ K I+ +TDGE+ + + +A+ +G V+ +GI Sbjct: 186 ---------PNSKVGKAIILITDGEDN----EGGAEAMAKQARNKGIKVFILGIGSKEG 231 >gi|54290564|dbj|BAD61973.1| zinc finger-like [Oryza sativa Japonica Group] gi|54291279|dbj|BAD62048.1| zinc finger-like [Oryza sativa Japonica Group] Length = 598 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 31/186 (16%), Positives = 67/186 (36%), Gaps = 26/186 (13%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSR--HIVMPITSSVKVNSQTDARLDMMIVLDVSRSME- 185 +S IP ++ +++ + + + +D++ VLDVS SM Sbjct: 1 MSVAPVKVSTTPIFPTIPRAQTNKDFQVLLRVEAPPAADLNGHVPIDVVAVLDVSGSMND 60 Query: 186 -----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-KIEEFFL---- 235 + T++D+ S+ ++ ++ + +V F++ ++E+ Sbjct: 61 PVAASPESNLQATRLDVLKASMKFIIRKLDDGD------RLSIVAFNDGPVKEYSSGLLD 114 Query: 236 -LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 G S +KI L G S L + + RQ + I+ + Sbjct: 115 VSGDGRSIAGKKIDRLQARGG---SGSALMPELQEAVKILDERQGNSRNRVG---FILLL 168 Query: 295 TDGENL 300 TDG++ Sbjct: 169 TDGDDT 174 >gi|194291603|ref|YP_002007510.1| hypothetical protein RALTA_B0837 [Cupriavidus taiwanensis LMG 19424] gi|193225507|emb|CAQ71453.1| conserved hypothetical protein, Von Willebrand factor type A domain (vwa), putative membrane protein [Cupriavidus taiwanensis LMG 19424] Length = 353 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 35/242 (14%), Positives = 71/242 (29%), Gaps = 62/242 (25%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 + +++ +D SRSME+ D +I A ++ ++ + V+ G+V+F+ Sbjct: 86 VTLVLAMDTSRSMEAA-DVPPNRISAAQQAARDLVVGLPAS------VRLGIVSFAGTAA 138 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG---------------- 275 + I+ + T + GL A +F G Sbjct: 139 VVLPPTDNRQDMLDAIERF-QLQRGTATGSGLFQALAVLFPEDGIDLEVILFGSRSDRAG 197 Query: 276 ------------------MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 Q + ++ ++DG + + + + Sbjct: 198 RGTSLDEAAAADAARRREQGQQAAQPGSYRHGAVILLSDGRRTTGPDPLDAARMAAQ--- 254 Query: 318 RGAIVYAIGIR------VIRS---------HEFLRACAS--PNSFYLVENPHSMYDAFSH 360 RG VY +G S LRA AS +Y + + + Sbjct: 255 RGVRVYTVGFGSQQVTSAPESSLSYFMQLDEPALRAVASITGGEYYHAGSAADLSQVYRQ 314 Query: 361 IG 362 + Sbjct: 315 LS 316 >gi|146292146|ref|YP_001182570.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] gi|145563836|gb|ABP74771.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] Length = 633 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 48/299 (16%), Positives = 105/299 (35%), Gaps = 38/299 (12%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVV---PQNEGYSISAIS---RYKIPLKFCTFIPW 147 E+ + F D+D ST ++ P+ I + Y PL P+ Sbjct: 151 VAGEIPISTFSIDVDTGSYSTLRRMIKEGSLPEKGTIRIEEMLNYFTYDYPLPNKNAAPF 210 Query: 148 YTNSRHIVMP----------ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 + P ++ +++ +LDVS SM +S K+ + Sbjct: 211 SATTELAPSPYNDDMMLLRIGLKGYELTKSELGASNLVFLLDVSGSM-----ASADKLPL 265 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST 257 ++ + +++ V+ VV +G + + L ++ L G ST Sbjct: 266 LQTALKMLTQQLSAQDKVSIVVYAGAAG----VVLDGASGDDIQALTYALEQLRA-GGST 320 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAK 316 N + G+ AY ++H N ++ TDG+ N+ Q + + K Sbjct: 321 NGSQGILQAYQL------AQKHFIQGGINR---VILATDGDFNVGVTNFDQLISLIEKEK 371 Query: 317 KRGAIVYAIGIRVIR-SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 +RG + +G + + + + A + + ++ +A + ++ + + K Sbjct: 372 QRGIGLTTLGFGMDNYNDQLMEQLADKGNGHYAY-IDTLNEARKVLVDELSSTLLTIAK 429 >gi|296475339|gb|DAA17454.1| cochlin precursor [Bos taurus] Length = 550 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 27/209 (12%), Positives = 70/209 (33%), Gaps = 31/209 (14%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTD------ARLDMMIVLDVSRSMESFFDSSITK 194 F +P + + V P+ + + Q +++ ++D S S+ + Sbjct: 329 FSYHMPNWFGTTKYVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGE------SN 382 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSK 252 + ++ ++ + + ++ + V F+ F + + I+ +S Sbjct: 383 FRLMLEFVSNIAKTFEISDIGA---KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNISY 439 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 T + + + +F +R N +V +TDG+ Sbjct: 440 MSGGTATGDAISFTVRNVFGP--VRDSPNKN------FLVIVTDGQ-----SYDDVRGPA 486 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 A G ++++G+ + L+ AS Sbjct: 487 AAAHDAGITIFSVGVAWAPLDD-LKDMAS 514 >gi|327266506|ref|XP_003218046.1| PREDICTED: complement factor B-like [Anolis carolinensis] Length = 764 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 39/235 (16%), Positives = 85/235 (36%), Gaps = 35/235 (14%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 ++ + D L++ IV+D SRS+ K D ++ N ++ ++ I + + G Sbjct: 242 RIKIEKDGSLNIYIVIDSSRSI---------KKDQFQEAQNMSIKLIEKISSYDISPKYG 292 Query: 223 LVTFSNKI--------EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 ++TF+ ++ E W + L+ K TN GL Y + + Sbjct: 293 VITFATEVKELVSTMDERSSDAAWVIEKLEGIKHSDHKQKPGTNIYKGLFAVYTMMISQE 352 Query: 275 GM--RQHCNTEDANYK--KIIVFMTDGENLSTKEDQQSLYYC----------NEAKKRGA 320 R+ N + K +IV ++DG+ + + ++ Sbjct: 353 AAERRRGLNPPPVSEKTRHVIVLLSDGDYNMGGDPTSVIKQIRGFLHIGRNHTNPREDYL 412 Query: 321 IVYAIGIRVIRSHEFLRACASPN----SFYLVENPHSMYDAFSHIGKDIVTKRIW 371 VY + + E + AS + +++ + + AF + + T + Sbjct: 413 DVYIFAVGGTVNMENVNKIASQKSGERHAFKLKDYNDLQVAFDEMIDESETLSMC 467 >gi|326489689|dbj|BAK01825.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 691 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 44/218 (20%), Positives = 83/218 (38%), Gaps = 44/218 (20%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 Q A LD++ VLDVS SM K+ + +++ +++ + + +V Sbjct: 187 KPQPRAPLDLVTVLDVSGSMSGH------KLALLKQAMRFVIDNLGPDD------RLSVV 234 Query: 225 TFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +FS++ L G + ++ L G TN GL+ A + RQH Sbjct: 235 SFSSEARRLTRLARMSDAGKALSVNAVESLVARG-GTNIAEGLRTAAKVL----DERQHR 289 Query: 281 NTEDANYKKIIVFMTDGENLST-----------KEDQQSLYYCNEAK-----KRGAIVYA 324 N + +V ++DG++ T + + L + A+ A ++ Sbjct: 290 NAVSS-----VVLLSDGQDTYTMMRRRGPSGVHAGNYEELVPPSFARTGADGDWSAPIHT 344 Query: 325 IGIRVIRSHEFLR--ACASPNSFYLVENPHSMYDAFSH 360 G + A A+ +F +EN + DAF+ Sbjct: 345 FGFGNDHDAAAMHVIAEATGGTFSFIENEAVIQDAFAQ 382 >gi|148688391|gb|EDL20338.1| mCG126758 [Mus musculus] Length = 223 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 36/224 (16%), Positives = 76/224 (33%), Gaps = 30/224 (13%) Query: 148 YTNSRHIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + + + S+ + D+ VLD S S+ + + I Sbjct: 16 FPGLWLLAVGGPGSLLQAQEQPSCKKAFDLYFVLDKSGSVANNW-----------IEIYN 64 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPG 262 + ++ V+ ++ + FS++ L + + ++ L T G Sbjct: 65 FVHQLTERF-VSPEMRLSFIVFSSQATIILPLTGDRYKIGKGLEDLKAVKPVGETYIHEG 123 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 LK A QI + G++ II+ +TDG+ + +++ GA V Sbjct: 124 LKLANEQIQNAGGLKASS---------IIIALTDGKLDGLVP-SYAENEAKKSRSLGASV 173 Query: 323 YAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIV 366 Y +G+ + R S + + V+ + A I + Sbjct: 174 YCVGVLDFEQAQLERIADSKDQVFPVKGG---FQALKGIINSVS 214 >gi|148680075|gb|EDL12022.1| mCG3350, isoform CRA_b [Mus musculus] Length = 527 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 35/166 (21%), Positives = 61/166 (36%), Gaps = 30/166 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--IE 231 + +VLD S SM+ ++ ++ L ++ + GLVTF + I+ Sbjct: 323 VCLVLDKSGSMDKE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSAAHIQ 372 Query: 232 EFFLLEWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + S Q+ L + T+ GL+ + I Sbjct: 373 NYLIKITSSSDYQKITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE--------- 423 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIRVIRSHEF 335 IV +TDGE+ + C EA R GAI++ I + + E Sbjct: 424 IVLLTDGEDNGIRS-------CFEAVSRSGAIIHTIALGPSAAREL 462 >gi|194225621|ref|XP_001916184.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Equus caballus] Length = 3570 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 72/212 (33%), Gaps = 38/212 (17%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL+++ ++D S S+ +K + +L + ++P + +VTFS+K Sbjct: 81 RLELVFLVDESSSVG------QANFLSELKFVRKLLSDFPVVPTA---TRVAIVTFSSKN 131 Query: 231 EEFFLLEWGVSH---------LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +++ S L ++I ++ G T + + A + Sbjct: 132 NVVPRVDYISSRRAHQHKCALLSQEIPAITYRGGGTYTKGAFQQAAQILRHS-------- 183 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 N K+I +TDG + + G ++ GI E ++ Sbjct: 184 --RENSTKVIFLITDGYSNG----GDPRPVAASLRDFGVEIFTFGIWQGNIRELNDMAST 237 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 P E + +F + + + + D Sbjct: 238 PK-----EEHCYLLHSFEEF-EALARRALHED 263 >gi|115305395|gb|AAI23842.1| COCH protein [Bos taurus] Length = 550 Score = 56.7 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 27/209 (12%), Positives = 70/209 (33%), Gaps = 31/209 (14%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTD------ARLDMMIVLDVSRSMESFFDSSITK 194 F +P + + V P+ + + Q +++ ++D S S+ + Sbjct: 329 FSYHMPNWFGTTKYVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGE------SN 382 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSK 252 + ++ ++ + + ++ + V F+ F + + I+ +S Sbjct: 383 FRLMLEFVSNIAKTFEISDIGA---KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNISY 439 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 T + + + +F +R N +V +TDG+ Sbjct: 440 MSGGTATGDAISFTVRNVFGP--VRDSPNKN------FLVIVTDGQ-----SYDDVRGPA 486 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 A G ++++G+ + L+ AS Sbjct: 487 AAAHDAGITIFSVGVAWAPLDD-LKDMAS 514 >gi|327538509|gb|EGF25172.1| von Willebrand factor type A domain-containing protein [Rhodopirellula baltica WH47] Length = 764 Score = 56.7 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 42/307 (13%), Positives = 100/307 (32%), Gaps = 33/307 (10%) Query: 53 MIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVR 112 D V A++I + + + +I + + + D Sbjct: 223 AADTDQVPDASRITPDSLPEGMRNGNDVFVSVKIDAAMPIQQIVPVTHELDIQQTSDTHA 282 Query: 113 STSLDIVVVPQNEGYSIS-AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR 171 + +L ++ + I ++ L T ++++ + + Sbjct: 283 AITLKNQSTIADKDFIIEYRLAGDDSTLASLTHRESDAEDGYVMLALQPKWSIEPTEITP 342 Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 ++++VLD S SM +I+++ + + L + N+ + ++ FSN+ Sbjct: 343 REVILVLDTSGSMN---GPAISQLRLFADHVLDHL-------NPNDEFR--VIAFSNRTT 390 Query: 232 EFFLLE-----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 F + ++ ++ L G TN P LK A +++ Sbjct: 391 AFQPDAIAATDANIQSAKQFVRGLRASG-GTNLLPALKLAL-----------GGEADESA 438 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 + +V MTD D L Y + + + A V+ I + + A + Sbjct: 439 RPRYMVLMTD---ALVGNDHSILRYLRQPEFQDARVFPIAFGAAPNDYLISRAAEMGRGF 495 Query: 347 LVENPHS 353 ++ + Sbjct: 496 SMQVTNQ 502 >gi|257876693|ref|ZP_05656346.1| von Willebrand factor type A domain-containing protein [Enterococcus casseliflavus EC20] gi|257810859|gb|EEV39679.1| von Willebrand factor type A domain-containing protein [Enterococcus casseliflavus EC20] Length = 1195 Score = 56.7 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 28/149 (18%), Positives = 60/149 (40%), Gaps = 24/149 (16%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-- 228 +D+++V+D S SM + +I K ++ L +++ + + V G V +S+ Sbjct: 354 PIDVVLVVDWSGSMNE-----MGRIAEVKKGVDRFLNQIE-GSGIQDSVYMGYVGYSSDG 407 Query: 229 --KIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + L S ++ I+ ++ T + GL+ A + + G Sbjct: 408 SNYQNKTCQLG-KFSEVKETIRSMTPETAAGGTFTQRGLRQAGDMLSTQNG--------- 457 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + Sbjct: 458 --HKKVIVLLTDGVPTYSYHVSKVHTQAD 484 >gi|224113057|ref|XP_002316375.1| predicted protein [Populus trichocarpa] gi|222865415|gb|EEF02546.1| predicted protein [Populus trichocarpa] Length = 714 Score = 56.7 bits (135), Expect = 7e-06, Method: Composition-based stats. Identities = 39/212 (18%), Positives = 80/212 (37%), Gaps = 43/212 (20%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 A +D++ VLD+S SM TK+ + +++ +++ + + ++ FS+ Sbjct: 262 RAPVDLVTVLDISGSMAG------TKLALLKRAMGFVIQNLGSND------RLSVIAFSS 309 Query: 229 KIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 F L G H + + L G TN GL+ +G + + + Sbjct: 310 TARRLFPLRRMSDTGRQHALQAVNALVANG-GTNIAEGLR---------KGAKVMEDRRE 359 Query: 285 ANYKKIIVFMTDGENLSTKEDQ--------------QSLYYCNEAKKRGAIVYAIGIRVI 330 N I+ ++DG++ T S++ + A + V+A G Sbjct: 360 KNPVASIILLSDGQDTYTVSGNGGNQPQPNYQLLLPVSIHGGDNAGFQ-IPVHAFGFGAD 418 Query: 331 RSHEFLRACA--SPNSFYLVENPHSMYDAFSH 360 + + + S +F +E + DAF+ Sbjct: 419 HDASSMHSISEISGGTFSFIETEAVIQDAFAQ 450 >gi|68536401|ref|YP_251106.1| hypothetical protein jk1316 [Corynebacterium jeikeium K411] gi|260577533|ref|ZP_05845473.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC 43734] gi|68264000|emb|CAI37488.1| hypothetical protein jk1316 [Corynebacterium jeikeium K411] gi|258604337|gb|EEW17574.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC 43734] Length = 663 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 38/230 (16%), Positives = 77/230 (33%), Gaps = 49/230 (21%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSS-ITKIDMAIKSINAMLEEVKLIPDVNNV 218 S N T + +VLD S SM T++D A K+ N I + + Sbjct: 52 GSTSSNDATGGDSKVAVVLDASDSMAEKDTGDGGTRMDAAKKAAN------DTIDTLADS 105 Query: 219 VQSGLVTFSNK--------------IEEFFLLEWGV-SHLQRKIKYLSKFGVSTNSTPGL 263 Q+ ++ + ++ I L L+ KI L G T + Sbjct: 106 AQTAVIAYGSEESNAPDNRDKGCQDITTLASLGNNKPEDLEDKINGLEPKGY-TPIGNAI 164 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK---RGA 320 K A ++ K+ I+ ++DG + C+ A+ G Sbjct: 165 KKAAEELGSSG-------------KRNIILVSDGIDTCAPP-----PVCDVAEDIAGDGI 206 Query: 321 --IVYAIGIRVIRSHEFLRAC---ASPNSFYLVENPHSMYDAFSHIGKDI 365 ++ +G +V + C S ++ ++ ++ +A + + + Sbjct: 207 DLAIHTVGFKVDDKAQKELECISEVSGGTYTSADDTEALTEALTDAAQRV 256 >gi|326670654|ref|XP_002663415.2| PREDICTED: collagen alpha-3(VI) chain [Danio rerio] Length = 3218 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 53/310 (17%), Positives = 104/310 (33%), Gaps = 29/310 (9%) Query: 48 TVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDI 107 L + D +A EG L GG + + L+++G + Sbjct: 686 AALQYVRDSVFTASAGSRKQEGIPQVLILLSGGRSNDNVDTP-----ASALKESGVLILG 740 Query: 108 DDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 S+ ++ + + Y+ S +P F N V P+T +V Sbjct: 741 IGTRNSSR-EVQRITNDPAYAQSLNDFSDLPGVQQQFYSSLNNVLVQVKPMTPTVAAEP- 798 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 AR D++ +LD S D + + + M+E + + + + + +V +S Sbjct: 799 RMARRDVVFLLDGS-------DGTRSSFSAMRDFVERMVERLNVSENRD---RVSVVQYS 848 Query: 228 NKIEEFFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 E F L I+ L G N+ L+Y + +F + Sbjct: 849 RDPEAHFYLNTYSRKEDALDTIRGLRHKGGRPLNTGAALQYLRDNVFTASSGSRRVEG-- 906 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 ++++ ++ T+ + K+ G +++AIG R S E R P+ Sbjct: 907 --VPQLLILLS-----GTRSFDNVDTPASSLKELGVLIFAIGSRSSDSQELQRVSNEPSY 959 Query: 345 FYLVENPHSM 354 V + Sbjct: 960 ALSVREFSDL 969 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 74/196 (37%), Gaps = 28/196 (14%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T +V + R D++ ++D S S+ + + I +++++ + PD Sbjct: 1396 TPAVPRDPLNLGRKDIIFLIDGSDSVG------QSGVAHIRDFILKVVDQLDVRPDQ--- 1446 Query: 219 VQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGV-STNSTPGLKYAY-NQIFDMQ 274 V+ LV + + + F L + IK L G + +KY N++ Sbjct: 1447 VRVALVQYGERPKTEFSLNSHDNKQSVISAIKRLRHMGGRGADLAEAIKYVIRNELQASA 1506 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV--IRS 332 G+R A + +V +T G + S S+ +G+ V IGI S Sbjct: 1507 GVRL------AQASQHLVVLTGGRSTSDVSTYGSIL-------KGSRVNCIGIGAENADS 1553 Query: 333 HEFLRACASPNSFYLV 348 + ++ S + V Sbjct: 1554 RQLIQIATSSDDVLQV 1569 Score = 53.6 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 71/202 (35%), Gaps = 26/202 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S D + + + M+E + + + + + +V +S E Sbjct: 5 DVVFLLDGS-------DGTRSSFSAMRDFVERMVERLNVSENRD---RVSVVQYSRDPEA 54 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L I+ L G N+ L+Y + +F + + Sbjct: 55 HFYLNTYSRKEDALDTIRGLRHKGGRPLNTGAALQYLRDNVFTASSGSRRVEG----VPQ 110 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ ++ + + K+ G +++AIG R S E R P+ V Sbjct: 111 LLILLS-----GARSFDNVDTPASSLKELGVLIFAIGSRSSDSQELQRISQEPSYALSVS 165 Query: 350 NPHSM----YDAFSHIGKDIVT 367 + + F++I K V Sbjct: 166 DFTDLPSVQQQLFTNINKVFVA 187 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 32/186 (17%), Positives = 72/186 (38%), Gaps = 24/186 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV-KLIPDVNNVVQSGLVTFSNKIE 231 D++ ++D + + + F +I M++ V + + + V+ +V +S+ + Sbjct: 1210 DVVFLIDGTSKIRNEFP-----------AIRDMVQRVVEKLDVGLDNVRVSVVQYSDDPK 1258 Query: 232 EFFLLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 FLL + +++ I+ L G + N+ L+Y I+ R + + Sbjct: 1259 LEFLLNEHSTKEEVRQAIRRLRNKGGNELNTGKALEYVSKTIY----QRSAGSRVEEGVP 1314 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 + ++ +T G K + N+ K A+G R + E SP Y + Sbjct: 1315 QFLILVTGG-----KSNDDVSGPANQLKLSRVAPLAVGARDADAEELRLISFSPELAYTI 1369 Query: 349 ENPHSM 354 + + Sbjct: 1370 RDFQQL 1375 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 37/272 (13%), Positives = 81/272 (29%), Gaps = 26/272 (9%) Query: 96 NELRDNGFVNDID-DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHI 154 + VN I + S ++ + + + S +P FI + S + Sbjct: 1533 GSILKGSRVNCIGIGAENADSRQLIQIATSSDDVLQVPSFPNLPNIQNKFIARLSGSIVV 1592 Query: 155 VMPIT-SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 PI D++ ++D S ++ + + +L + L Sbjct: 1593 EPPIEIDETTPGLPQAKAADIVFLVDGSINLGRNN---------FKEVMEFILNLIDLFY 1643 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQI 270 + +Q GL ++ + + F L + I G + +++ + Sbjct: 1644 TERDRLQIGLAHYATDVTDVFYLNTYNNKDDIINAITRAEYKGGREIRTGSAIRH-VQKT 1702 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 ++ + +I++ +T G S K G VYA+G+ I Sbjct: 1703 HFVKEKGSRKDEG---IPQILMVVTGGR-----SRDDSKSAALGLKASGVRVYAVGVGDI 1754 Query: 331 RSHEFLRACASPNSFYL-VENPHSMYDAFSHI 361 L S + + + I Sbjct: 1755 EDE--LNNLGSEATTVARASTFQELSELNEQI 1784 Score = 43.2 bits (100), Expect = 0.059, Method: Composition-based stats. Identities = 34/272 (12%), Positives = 83/272 (30%), Gaps = 24/272 (8%) Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITS 160 + I + ++ V Y++S + + N+ + + + Sbjct: 934 GVLIFAIGSRSSDSQ-ELQRVSNEPSYALSVREFSDL-SNVQQKLLSNINTALVEVTSLT 991 Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + + D++ ++D S + F I ++ + + L N ++ Sbjct: 992 TTVTVESEGPKKDIIFLIDGSEDVGREFPI----IQEFVRRVVSNLNV------GENKIR 1041 Query: 221 SGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKF-GVSTNSTPGLKYAYNQIFDMQGMR 277 G+V + + L + IK L + G N + + ++ Sbjct: 1042 IGVVQYGDIPNADIYLNSHRTKEGVLNGIKELRQLRGTKRNLGQAIDFVRREVLASG--- 1098 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 ++ + +V ++ G K K+ G +IG R + + E Sbjct: 1099 -RGGRKEEGVPQFVVVVSGG-----KATDNIRQSATALKQSGIFPLSIGTRDVGTQELQV 1152 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 P YLV++ +Y + + Sbjct: 1153 TSYVPRFAYLVDDLPGLYTIQETLINTLTELS 1184 >gi|207857733|ref|YP_002244384.1| lipoprotein [Salmonella enterica subsp. enterica serovar Enteritidis str. P125109] gi|206709536|emb|CAR33881.1| lipoprotein [Salmonella enterica subsp. enterica serovar Enteritidis str. P125109] Length = 596 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 70/188 (37%), Gaps = 21/188 (11%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 PW + + + + + + +++ ++D S SM+ ++ + + Sbjct: 206 LAPSPWNAQRTLLKVDVQAR-DMQVRDLPPANLVFLIDTSGSMQP-----AERLPLIQSA 259 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 + ++ +++ ++ V +G + + ++ I L +G ST Sbjct: 260 LKLLVNDLRAQDNITIVTYAG----GTHVALASTAGNNTTAIKAAIDNLDAYG-STGGEA 314 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGA 320 GL+ AY Q + N I+ TDG+ NL + + + +++G Sbjct: 315 GLRLAYEQ------AEKGFIKGGVNR---ILLTTDGDFNLGITDPKDIEALVKKEREKGI 365 Query: 321 IVYAIGIR 328 + +G+ Sbjct: 366 TLSTLGVG 373 >gi|71898007|ref|ZP_00680212.1| von Willebrand factor, type A [Xylella fastidiosa Ann-1] gi|71732251|gb|EAO34306.1| von Willebrand factor, type A [Xylella fastidiosa Ann-1] Length = 941 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 56/184 (30%), Gaps = 27/184 (14%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + Q + + +D+S SM+ + +++D ++NA L+++ Sbjct: 140 PEKAAIGMQMAPSVAVYFAIDLSGSMDYAGSNGRSRLDNMKTALNAALDQLGQSIASGTA 199 Query: 219 VQSGLVTFSNKIEEFFLL------EWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAY 267 V L F + + L G++ L+ + F T P A Sbjct: 200 VDIMLAGFGDAPDHRQTLLRRNCTAQGIAELKSWVAARQALYGTYFPAGTMDMPSFYAA- 258 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 +N ++ FMTDGE + + Y I I Sbjct: 259 ---------------ASSNAVRVAFFMTDGEPDPPSATLAQAARADVDQVAHLRCYGITI 303 Query: 328 RVIR 331 + Sbjct: 304 DLAN 307 >gi|170739508|ref|YP_001768163.1| von Willebrand factor type A [Methylobacterium sp. 4-46] gi|168193782|gb|ACA15729.1| von Willebrand factor type A [Methylobacterium sp. 4-46] Length = 342 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 35/206 (16%), Positives = 73/206 (35%), Gaps = 30/206 (14%) Query: 168 TDARLDMMIVLDVSRSME--SFFDSSITKID--MAIKSINAMLEEVKLIPDVNNVVQSGL 223 D+++++D+S SM+ F D+S +D A+K++ + ++G+ Sbjct: 90 DQPTRDLLLLVDLSGSMDTKDFTDASGRTVDRLTAVKAVLDDFLSRRKGD------RAGV 143 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 V F + F L + L + L ++ G+ + D G+ Sbjct: 144 VVFG---DAPFALVPFTTDLDLCREMLR------DTVVGMAGPRTALGDAIGLGIALFDR 194 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-------HEFL 336 K I+ +TDG T AK +G +++ + I + L Sbjct: 195 STVKAKTIIALTDG--NDTASQVPPTEAAGVAKDKGIVIHTVAIGDPSTVGEDKLDETAL 252 Query: 337 R--ACASPNSFYLVENPHSMYDAFSH 360 + A A+ F+ + + + Sbjct: 253 KDVASATGGGFFRALDRDELARIYGR 278 >gi|32476015|ref|NP_869009.1| inter-alpha-trypsin inhibitor domain-containing protein [Rhodopirellula baltica SH 1] gi|32446559|emb|CAD76394.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated protein containing vWFA domain [Rhodopirellula baltica SH 1] Length = 764 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 41/307 (13%), Positives = 100/307 (32%), Gaps = 33/307 (10%) Query: 53 MIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVR 112 D V A++I + + + +I + + + D Sbjct: 223 AADTDQVPDASRITPDALPEGMRNGNDVFVSVKIDAAMPIQQIVPVTHELDIQQTSDTHA 282 Query: 113 STSLDIVVVPQNEGYSIS-AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR 171 + +L ++ + I ++ L T ++++ + + Sbjct: 283 AITLKNQSTVADKDFIIEYRLAGDDSTLASLTHRESDAEDGYVMLALQPKWSIEPTEITP 342 Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 ++++VLD S SM +I+++ + + L + N+ + ++ FSN+ Sbjct: 343 REVILVLDTSGSMN---GPAISQLRLFADHVLDHL-------NPNDEFR--VIAFSNRTT 390 Query: 232 EFFLLE-----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 F + ++ ++ L G TN P LK A +++ Sbjct: 391 AFQPNAVSATDANIQSAKQFVRGLRASG-GTNLLPALKLAL-----------GGEADESA 438 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 + ++ MTD D L Y + + + A V+ I + + A + Sbjct: 439 RPRYMILMTD---ALVGNDHSILRYLRQPEFQDARVFPIAFGAAPNDYLISRAAEMGRGF 495 Query: 347 LVENPHS 353 ++ + Sbjct: 496 SMQVTNQ 502 >gi|114048546|ref|YP_739096.1| von Willebrand factor, type A [Shewanella sp. MR-7] gi|113889988|gb|ABI44039.1| von Willebrand factor, type A [Shewanella sp. MR-7] Length = 625 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 41/269 (15%), Positives = 98/269 (36%), Gaps = 24/269 (8%) Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITS--SVKVNSQ 167 + + +L + + Y S+ + P T + + +++ + + Sbjct: 171 LPQKDTLRVEEMLNYFSYDYPLPSKNEAPFSVTTELAPSPYNDDMMLLRIGLKGYEQSKA 230 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +++ +LDVS SM +S K+ + ++ + +++ V+ VV +G Sbjct: 231 ELGASNLVFLLDVSGSM-----ASPDKLPLLQTALKMLTQQLGAQDKVSIVVYAGAAG-- 283 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + L ++ LS G STN G++ AY ++H N Sbjct: 284 --VVLDGAAGNDSQTLNYALEQLSA-GGSTNGAQGIQLAYQL------AKKHLVEGGINR 334 Query: 288 KKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE-FLRACASPNSF 345 ++ TDG+ N+ T + + + K+ G + +G + ++ + A + Sbjct: 335 ---VILATDGDFNVGTTNLDELIDLVSAQKQLGIGLTTLGFGMGDYNDHLMEQLADKGNG 391 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 S+ +A + + + + K Sbjct: 392 QYAY-IDSLNEARKVLVEQLSATLLTIAK 419 >gi|291569722|dbj|BAI91994.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 213 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 32/132 (24%), Positives = 59/132 (44%), Gaps = 12/132 (9%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +VLD S SM I+ + I A++ E++ P ++TF + Sbjct: 5 RLPVYLVLDCSGSM------CGEPIEAVNQGIKALVAELQSEPYAIETAYLSVITFESTA 58 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 ++ F L L + + G +T+ LK Q FD + +++ +T+ ++K + Sbjct: 59 QQVFPLT----ELMKFQPPVLSAGGTTSLGDALKL-LTQCFD-KEVKKASDTQKGDWKPL 112 Query: 291 IVFMTDGENLST 302 + MTDG T Sbjct: 113 VFLMTDGMPTDT 124 >gi|117618496|ref|YP_856674.1| von Willebrand factor type A domain-containing protein [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|117559903|gb|ABK36851.1| von Willebrand factor type A domain protein [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 337 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 27/163 (16%), Positives = 59/163 (36%), Gaps = 20/163 (12%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D+++ +D+S SM + + + ++ ++ + + + L+ F++ Sbjct: 79 QSSRDLILAVDLSDSMRTQDMLDEGEQKDRLSAVRQQIDRLIAARPGD---RIALIVFAD 135 Query: 229 KIEEFFLLEWGVSHLQRKIKYL--SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + L + L G +T + A RQH E + Sbjct: 136 HAYLLSPLTQETNALLGLTRELDFELVGRTTALGEAILLA----------RQHKAPERST 185 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 ++ +TDG N + D L +A G +Y +G+ Sbjct: 186 A---LLLVTDGRNTAGSADP--LREAKQAAAAGIRLYTLGVGA 223 >gi|73970245|ref|XP_855427.1| PREDICTED: similar to tumor endothelial marker 8 isoform 1 precursor [Canis familiaris] Length = 555 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 32/156 (20%), Positives = 58/156 (37%), Gaps = 13/156 (8%) Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGM 276 ++ + FS + L +++ ++ L K G T G + A QI+ Sbjct: 69 LRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENSQ 128 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 + +I+ +TDGE L S N ++ GAIVY +G++ + Sbjct: 129 GYRTAS-------VIIALTDGE-LHEDLFFYSEREANRSRDLGAIVYCVGVKDFNETQLA 180 Query: 337 RACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 R S + + V + + A I I+ K Sbjct: 181 RIADSKDHVFPVNDG---FQALQGIIHSILKKSCIE 213 >gi|325108274|ref|YP_004269342.1| von Willebrand factor A [Planctomyces brasiliensis DSM 5305] gi|324968542|gb|ADY59320.1| von Willebrand factor type A [Planctomyces brasiliensis DSM 5305] Length = 1584 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 41/277 (14%), Positives = 83/277 (29%), Gaps = 49/277 (17%) Query: 123 QNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 Q G+ ++ +PL S I ++ V V + +M+VLD S+ Sbjct: 1013 QFRGHEVATSLSLSVPL-------GQRLSAAIADETSAQVTVIDSQAKPVSVMVVLDCSQ 1065 Query: 183 SMESFFD-----SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI----EEF 233 SM +TK D AI + N +L + +P+ + GLV + +++ Sbjct: 1066 SMNQTIPWEASQQDLTKFDAAITATNQILNALSRLPNA----RVGLVLYGHRVGWNTASP 1121 Query: 234 FLLEWGVSHLQRKIKYLSKFGVST------NSTPGLKYAYNQI-------------FDMQ 274 + + L + + + A ++ ++ Sbjct: 1122 SQILRQTGYANPIPDSLMPYEDTEIVLPFGRFARAERDAVEKLLKTVKPWGETPLYLSIR 1181 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR-----GAIVYAIGIRV 329 N++ K+ IV +TDG N + + + GI Sbjct: 1182 NALNEMNSKSLGGKQQIVVITDGVNKQLNPSASAYVSLSSLLNENFGQTSVNILGFGIDP 1241 Query: 330 IRSHEFLR-----ACASPNSFYLVENPHSMYDAFSHI 361 S R A + + + + + Sbjct: 1242 NESQTAARQFEQLASRTGGEYVEINDAGRLLRKVDDY 1278 >gi|75907530|ref|YP_321826.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75701255|gb|ABA20931.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 418 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 81/222 (36%), Gaps = 29/222 (13%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 + R + + I++ + +T L++ ++LD S SM + +++ ++ +++ + Sbjct: 21 SQRQLAISISAGAEPQDRT-VPLNLCLILDHSGSM------NGRPLEIVKQAAIRLVDRL 73 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 K + +V F ++ + + ++++I L+ G T GL+ Sbjct: 74 KTGD------RLSVVAFDHRAKVLVPNQVIDNPEQIKKQINRLAADG-GTAIDEGLRLGI 126 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 ++ + +TDGEN ++ + L + A + +G Sbjct: 127 EELAKGKKETISQA----------FLLTDGENEH-GDNNRCLKFAQLAAGYNLTLNTLGF 175 Query: 328 RVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVT 367 + + L A S ++ D F + I T Sbjct: 176 GDNWNQDVLEKIADAGLGSLSYIQKAEQAVDEFGRLFSRIQT 217 >gi|311254860|ref|XP_003125977.1| PREDICTED: calcium-activated chloride channel regulator 4-like [Sus scrofa] Length = 874 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 77/197 (39%), Gaps = 34/197 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--IE 231 + +VLD S SM SS +++ ++ L ++ V N G+V F + I Sbjct: 253 VCLVLDKSGSM-----SSSNRLNRMNQAAKYFLMQI-----VENGSWVGMVHFDSTASIR 302 Query: 232 EFFLLEWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + G + + + L T+ G++ A+ + + + H + + Sbjct: 303 SDLIQITGSNERDKLLGSLPTTASGGTSICSGIRRAFEVV---RKLYSHTDGSE------ 353 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFL-RACASPNSFYLV 348 IV +TDGE+ + C +E K+ GAI++ I + + + A+ + Sbjct: 354 IVLLTDGEDN-------TAGACVDEVKQSGAIIHFIALGPSADKAVIEMSTATGGVHFYA 406 Query: 349 ENPHS---MYDAFSHIG 362 + + DAF + Sbjct: 407 TDEAENNGLIDAFGALA 423 >gi|257883753|ref|ZP_05663406.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium 1,231,501] gi|257819591|gb|EEV46739.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium 1,231,501] Length = 1475 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 34/153 (22%), Positives = 65/153 (42%), Gaps = 22/153 (14%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDS--SITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + + + +D++ VLD S SM S TK I+++N M +++ P ++ + Sbjct: 343 IGGEKKEISPIDIVFVLDKSASMSELTAGTNSQTKNAALIEAVNEMSKDLLSDPSLD--I 400 Query: 220 QSGLVTF--------SNKIEEF--FLLEWGVSHLQRKIK-YLSKFG-VSTNSTPGLKYAY 267 + G+V F +++ F L ++ L L++ T T GLK Y Sbjct: 401 RIGMVNFYHNSTAINNHEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGY 460 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 ++ G E+ N +KI++ + DG Sbjct: 461 ETLYKDNG------GENRNPEKILIVVGDGTPT 487 >gi|254786708|ref|YP_003074137.1| von Willebrand factor A [Teredinibacter turnerae T7901] gi|237687355|gb|ACR14619.1| von Willebrand factor, type A [Teredinibacter turnerae T7901] Length = 767 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 70/200 (35%), Gaps = 35/200 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ +V+DVS SM+ ++ + +L V+L+P+ + ++G+ TF + Sbjct: 50 DVRLVIDVSGSMKRNDPNN------LRQPAVDLL--VQLLPEGS---RAGVWTFGKWVNM 98 Query: 233 FFLLEWGV----SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + Q K ++ G+ TN L+ A A ++ Sbjct: 99 LVPHRDVTDPWRATAQAKASEINSVGLFTNIGEALEKA----------TFEGADGGAEFR 148 Query: 289 KIIVFMTDG--------ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA-- 338 K I+ +TDG E + + + K+ G V+ I + L Sbjct: 149 KSIILLTDGMVDIDKSPEQNKREWRRIADEVIPRLKEAGVTVHTIALSANADTNLLNKIS 208 Query: 339 CASPNSFYLVENPHSMYDAF 358 A+ + + + F Sbjct: 209 LATGGMAEVAHSADDLMRIF 228 >gi|256820365|ref|YP_003141644.1| von Willebrand factor type A [Capnocytophaga ochracea DSM 7271] gi|256581948|gb|ACU93083.1| von Willebrand factor type A [Capnocytophaga ochracea DSM 7271] Length = 347 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 29/175 (16%), Positives = 69/175 (39%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +D++ +DVS+SM + D + +++ A + + ++K + Sbjct: 79 GTKIETVKREGVDIVFAIDVSKSMLAE-DVAPNRLEKAKRIAFETISQLKGD-------R 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMR 277 G+V ++ L S + ++ ++ T ++ A N Sbjct: 131 VGIVAYAASAYPQLALTTDHSAAKMFLQDMNTDMLSSQGTAIQEAIRMASNYF------- 183 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 E+ +++ +TDGE+ + + EA+++G +Y IGI + Sbjct: 184 ----DENTPTARLLFILTDGEDH----EMGATEIATEAQEKGVHIYTIGIGTEKG 230 >gi|148680074|gb|EDL12021.1| mCG3350, isoform CRA_a [Mus musculus] Length = 513 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 35/166 (21%), Positives = 61/166 (36%), Gaps = 30/166 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--IE 231 + +VLD S SM+ ++ ++ L ++ + GLVTF + I+ Sbjct: 309 VCLVLDKSGSMDKE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSAAHIQ 358 Query: 232 EFFLLEWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + S Q+ L + T+ GL+ + I Sbjct: 359 NYLIKITSSSDYQKITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE--------- 409 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIRVIRSHEF 335 IV +TDGE+ + C EA R GAI++ I + + E Sbjct: 410 IVLLTDGEDNGIRS-------CFEAVSRSGAIIHTIALGPSAAREL 448 >gi|149920875|ref|ZP_01909337.1| hypothetical protein PPSIR1_38721 [Plesiocystis pacifica SIR-1] gi|149818274|gb|EDM77727.1| hypothetical protein PPSIR1_38721 [Plesiocystis pacifica SIR-1] Length = 367 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 39/236 (16%), Positives = 74/236 (31%), Gaps = 41/236 (17%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 ++V+V++ +++++D S SM F + ++ + V + + + Sbjct: 76 CANVEVDTSPTTP-TVVLLVDQSGSMWDDFGGQP-RWVALENTLFDPVNGV--VKPLEDQ 131 Query: 219 VQSGLVTFSN---------KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 V+ GL +S+ + F +G + T + +K Sbjct: 132 VRFGLALYSSMNGSFGGECPLITEFAPSFGNHASLAATFASAMPLDDTPTGDSIKAVAET 191 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENL-----STKEDQQ-SLYYCNEAKKRGAIVY 323 + KIIV TDGE +E Q SL A G + Sbjct: 192 LAAFPEDG----------PKIIVLATDGEPDTCAVPDPQEGQPLSLEATQAAFDDGIRTF 241 Query: 324 AIGIRVIRSHEFLRACASPN------------SFYLVENPHSMYDAFSHIGKDIVT 367 I + + L+ A+ FY NP + AF + + Sbjct: 242 VISVGNQVTDAHLQELANAGVGLPTQGAVENAPFYKTLNPAELVSAFEAVINGFIG 297 >gi|163735880|ref|ZP_02143308.1| hypothetical protein RLO149_07941 [Roseobacter litoralis Och 149] gi|161390816|gb|EDQ15157.1| hypothetical protein RLO149_07941 [Roseobacter litoralis Och 149] Length = 320 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 62/164 (37%), Gaps = 24/164 (14%) Query: 173 DMMIVLDVSRSM---ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+ IVLD+S SM + D A+ ++ A + + LV F ++ Sbjct: 92 DLAIVLDLSGSMVRDDFDLDGRQVTRREAVATVGADFARRRGGD------RVALVVFGSE 145 Query: 230 IEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + V + R+I+ G +T+ + GL A ++ + + Sbjct: 146 AYFAAPFSFDVEAIARQIESAQIGVSGRATSISDGLGIALKRMENSEAAS---------- 195 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ ++DG N + + + A + G V+ I + Sbjct: 196 -RVVILLSDGVNNAGATNPRG--VAELAAQMGVRVHTIALGPKD 236 >gi|162454087|ref|YP_001616454.1| hypothetical protein sce5811 [Sorangium cellulosum 'So ce 56'] gi|161164669|emb|CAN95974.1| hypothetical protein predicted by Glimmer/Critica [Sorangium cellulosum 'So ce 56'] Length = 907 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 44/196 (22%), Positives = 81/196 (41%), Gaps = 27/196 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 L + +VLD S SM ID A ++ A+++ + D + L TFS+ Sbjct: 500 PHLSVHLVLDTSGSMAG------APIDSARRAAQALVDRLAPADDFS------LTTFSSD 547 Query: 230 ---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + E + + ++R I+ L + G TN GL Y Q R + Sbjct: 548 AEVVIEDGPVGPRRAAIRRAIEGLRE-GGGTNIGAGLSLGY-----AQASRPGI---PED 598 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS- 344 ++++ ++DG S + L + +A +RG A+G+ + + A AS + Sbjct: 599 AVRVVLLVSDGRATSGLTHSERLAWLALDAFQRGIQTSALGLGDDFDGQLMSAIASDGAG 658 Query: 345 -FYLVENPHSMYDAFS 359 +Y + +P + A S Sbjct: 659 GYYYLRHPEQIAPALS 674 >gi|15965798|ref|NP_386151.1| putative signal peptide protein [Sinorhizobium meliloti 1021] gi|307311332|ref|ZP_07590975.1| putative signal peptide protein [Sinorhizobium meliloti BL225C] gi|307318865|ref|ZP_07598297.1| conserved hypothetical protein [Sinorhizobium meliloti AK83] gi|15075067|emb|CAC46624.1| Hypothetical signal peptide protein [Sinorhizobium meliloti 1021] gi|306895586|gb|EFN26340.1| conserved hypothetical protein [Sinorhizobium meliloti AK83] gi|306899633|gb|EFN30261.1| putative signal peptide protein [Sinorhizobium meliloti BL225C] Length = 444 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 11/79 (13%), Positives = 27/79 (34%) Query: 3 SLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAA 62 + L++R + G +L AI I + I+ + +F + S +D + + + Sbjct: 5 TFRLHLRRLVRDRDGNFAVLGAIAFVPIIGAAALAIDFAGAYFEAEKIQSALDAAALGSV 64 Query: 63 TQIMNEGNGNNRKKLKGGD 81 ++ Sbjct: 65 RAYGEGATEDDAYDAAQKF 83 >gi|293569033|ref|ZP_06680345.1| von Willebrand factor type A domain protein [Enterococcus faecium E1071] gi|291588214|gb|EFF20050.1| von Willebrand factor type A domain protein [Enterococcus faecium E1071] Length = 1502 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 34/153 (22%), Positives = 65/153 (42%), Gaps = 22/153 (14%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDS--SITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + + + +D++ VLD S SM S TK I+++N M +++ P ++ + Sbjct: 370 IGGEKKEISPIDIVFVLDKSASMSELTAGTNSQTKNAALIEAVNEMSKDLLSDPSLD--I 427 Query: 220 QSGLVTF--------SNKIEEF--FLLEWGVSHLQRKIK-YLSKFG-VSTNSTPGLKYAY 267 + G+V F +++ F L ++ L L++ T T GLK Y Sbjct: 428 RIGMVNFYHNSTAINNHEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGY 487 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 ++ G E+ N +KI++ + DG Sbjct: 488 ETLYKDNG------GENRNPEKILIVVGDGTPT 514 >gi|186681467|ref|YP_001864663.1| von Willebrand factor A [Nostoc punctiforme PCC 73102] gi|186463919|gb|ACC79720.1| von Willebrand factor, type A [Nostoc punctiforme PCC 73102] Length = 418 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 77/225 (34%), Gaps = 28/225 (12%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 NS+ + S++ L++ ++LD S SM + ++ K+ N ++ Sbjct: 17 NQLNSQRQLGISISAIAETQDRHVPLNLCLILDHSGSM------NGRSLETVKKAANRLV 70 Query: 207 EEVKLIPDVNNVVQSGLVTFSNK--IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 + + + +V F ++ + ++ +I L+ G T GL+ Sbjct: 71 DRLNPSD------RLSVVVFDHRAKVLVPSQSVEDPEKIKNQINRLAADG-GTAIDEGLR 123 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 ++ + +TDGEN ++ + L + A + Sbjct: 124 LGIEELAKGKKDTVSQA----------FLLTDGENEH-GDNNRCLKFAQLAASYNLTLNT 172 Query: 325 IGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVT 367 +G + + L A + ++ P D F+ + I T Sbjct: 173 LGFGDNWNQDVLEKIADAGLGTLSYIQKPEEAVDEFNRLFSRIQT 217 >gi|319954909|ref|YP_004166176.1| von willebrand factor type a [Cellulophaga algicola DSM 14237] gi|319423569|gb|ADV50678.1| von Willebrand factor type A [Cellulophaga algicola DSM 14237] Length = 703 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 47/313 (15%), Positives = 102/313 (32%), Gaps = 45/313 (14%) Query: 64 QIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQ 123 +I N+ + + +T+++ N + + + +D V + + Sbjct: 235 KITNDEKYAQLTENPFKRLALSPLSTFSIDVDKAGYSN--IRRLINAGSQIPIDAVKIEE 292 Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS----------QTDARLD 173 Y Y P + P+ N P ++ K+ + + Sbjct: 293 MINY-----FDYDYP-QPLGEHPFSINLEAAKTPWNAATKIVRIGLQGKEYLNEDLPASN 346 Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + ++DVS SM + K+ + + ++ +++ V+ VV +G Sbjct: 347 LTFLIDVSGSMN-----NQNKLPLLKSAFKLLVHQLREKDRVSIVVYAGAAG-------V 394 Query: 234 FLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L G + ++ IK L G ST G++ AY + N + + Sbjct: 395 VLEPTGGNDKEKIIKALDNLSAGGSTAGGEGIELAY----ALAEKNFKPNKNNR-----V 445 Query: 292 VFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRACAS--PNSFYL 347 + TDG+ N+ D+ E +K G + +G + + L A + Sbjct: 446 IMATDGDFNVGASSDKDMETLIEEKRKSGIFLSVLGFGMGNYKDSKLEKLADKGNGNHAY 505 Query: 348 VENPHSMYDAFSH 360 ++ F Sbjct: 506 IDTMQEAQKIFGE 518 >gi|301768026|ref|XP_002919432.1| PREDICTED: calcium-activated chloride channel regulator 1-like [Ailuropoda melanoleuca] Length = 913 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 42/197 (21%), Positives = 76/197 (38%), Gaps = 35/197 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM S+ ++ ++ L ++ V G+VTF + Sbjct: 307 VCLVLDKSGSM-----SNGDRLKRLNQAGKLFLLQI-----VEQGSWVGMVTFDSAAHVQ 356 Query: 234 FLLE--WGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L G + K L T+ GL+ A+ I R+ +T+ + Sbjct: 357 SELVQINGATERDALTKSLPTVASGGTSICSGLRSAFAVI------RKKFSTDGSE---- 406 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ ++ C NE K+ GA+++ + + + E L Y Sbjct: 407 IVLLTDGEDN-------TISSCFNEVKQSGAVIHTVALGPSAAKELEELSKMTGGLQTYA 459 Query: 348 VENPHS--MYDAFSHIG 362 + + + DAF + Sbjct: 460 SDQAQNNGLIDAFGALS 476 >gi|22127367|ref|NP_670790.1| hypothetical protein y3493 [Yersinia pestis KIM 10] gi|45442761|ref|NP_994300.1| hypothetical protein YP_2999 [Yersinia pestis biovar Microtus str. 91001] gi|108809099|ref|YP_653015.1| hypothetical protein YPA_3108 [Yersinia pestis Antiqua] gi|108810706|ref|YP_646473.1| hypothetical protein YPN_0541 [Yersinia pestis Nepal516] gi|150260286|ref|ZP_01917014.1| putative fimbrial anchor [Yersinia pestis CA88-4125] gi|162419964|ref|YP_001604884.1| hypothetical protein YpAngola_A0266 [Yersinia pestis Angola] gi|165939877|ref|ZP_02228416.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. IP275] gi|166009017|ref|ZP_02229915.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166211928|ref|ZP_02237963.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. B42003004] gi|167466384|ref|ZP_02331088.1| hypothetical protein YpesF_00480 [Yersinia pestis FV-1] gi|218927875|ref|YP_002345750.1| hypothetical protein YPO0684 [Yersinia pestis CO92] gi|229837366|ref|ZP_04457529.1| putative fimbrial anchor [Yersinia pestis Pestoides A] gi|229840578|ref|ZP_04460737.1| putative fimbrial anchor [Yersinia pestis biovar Orientalis str. PEXU2] gi|229842872|ref|ZP_04463024.1| putative fimbrial anchor [Yersinia pestis biovar Orientalis str. India 195] gi|229900904|ref|ZP_04516028.1| putative fimbrial anchor [Yersinia pestis Nepal516] gi|294502750|ref|YP_003566812.1| hypothetical protein YPZ3_0640 [Yersinia pestis Z176003] gi|21960452|gb|AAM87041.1|AE013952_8 hypothetical [Yersinia pestis KIM 10] gi|45437627|gb|AAS63177.1| putative membrane protein [Yersinia pestis biovar Microtus str. 91001] gi|108774354|gb|ABG16873.1| membrane protein [Yersinia pestis Nepal516] gi|108781012|gb|ABG15070.1| putative membrane protein [Yersinia pestis Antiqua] gi|115346486|emb|CAL19360.1| putative membrane protein [Yersinia pestis CO92] gi|149289694|gb|EDM39771.1| putative fimbrial anchor [Yersinia pestis CA88-4125] gi|162352779|gb|ABX86727.1| conserved hypothetical protein [Yersinia pestis Angola] gi|165912188|gb|EDR30826.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. IP275] gi|165992356|gb|EDR44657.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166206674|gb|EDR51154.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. B42003004] gi|229682243|gb|EEO78335.1| putative fimbrial anchor [Yersinia pestis Nepal516] gi|229690139|gb|EEO82196.1| putative fimbrial anchor [Yersinia pestis biovar Orientalis str. India 195] gi|229696944|gb|EEO86991.1| putative fimbrial anchor [Yersinia pestis biovar Orientalis str. PEXU2] gi|229705489|gb|EEO91499.1| putative fimbrial anchor [Yersinia pestis Pestoides A] gi|262364727|gb|ACY61284.1| hypothetical protein YPD8_0594 [Yersinia pestis D182038] gi|294353209|gb|ADE63550.1| hypothetical protein YPZ3_0640 [Yersinia pestis Z176003] gi|320016753|gb|ADW00325.1| putative fimbrial anchor [Yersinia pestis biovar Medievalis str. Harbin 35] Length = 518 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 37/242 (15%), Positives = 80/242 (33%), Gaps = 38/242 (15%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F N +G + + +P+ ++ + E SH + L ++++ + +T+ N Sbjct: 17 FIKNRQGAILLSFMALIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALSTE--NNYR 74 Query: 71 GNNRKKLKGGDILCRIKNTW--NMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 + + ++ ++ + F N + + + SL + Y Sbjct: 75 NDRASNNRNNYLVTSYAQSYLPSERFSQPRVVNTYNESLGYTEYNASLQM-------NYQ 127 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES-- 186 ++ ++ Y LK W N SS+ +D++ V D S SM+ Sbjct: 128 LALLNSY---LKQTPSPTWDVNENGAARKYLSSI------AEPIDVVFVTDFSGSMDLPF 178 Query: 187 ---FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 ++ ITK+D + + +N G V WG + Sbjct: 179 GDIERNNRITKLDELKAIFVKLNNRIFSNDGINT---IGFV----------PFSWGTKRI 225 Query: 244 QR 245 Sbjct: 226 SA 227 Score = 40.9 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 21/91 (23%), Positives = 40/91 (43%), Gaps = 18/91 (19%) Query: 282 TEDANYKKIIVFMTDGENL----STKEDQQ-----------SLYYCNEAKKRGAIVYAIG 326 TE N K+++ ++DG++ S+ DQ+ + C + K G + IG Sbjct: 407 TESQNNNKLMIILSDGDDNTQKMSSPHDQKAGIINITQKLITEGMCQKIKDNGIKMVFIG 466 Query: 327 IRVIRSHEFL---RACASPNSFYLVENPHSM 354 I + + + + C +FYL +N H + Sbjct: 467 IGYVPDNNIIDWEKDCVGTGNFYLAKNAHEL 497 >gi|326435505|gb|EGD81075.1| hypothetical protein PTSG_11020 [Salpingoeca sp. ATCC 50818] Length = 552 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 41/250 (16%), Positives = 85/250 (34%), Gaps = 30/250 (12%) Query: 120 VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITS-SVKVNSQTDARLDMMIVL 178 Q S++A ++ +F + + + V + + + +D++ V+ Sbjct: 12 ATAQAPNSSVAARLTFESTSEFVAYSRGQRHKINSVFHVQAPEYQGEEAARGAVDIVAVI 71 Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW 238 DVS SM S +K+D+A ++ +++ + GLV + + + F L Sbjct: 72 DVSGSM------SGSKLDLAKATLEFLIKNLSQTDH------MGLVVYHSDVSVAFPLTR 119 Query: 239 ----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 G + L + S +F M Q A+ ++ M Sbjct: 120 MDAEGKRTATAALSTLRAQRCTNLSGG--------LFKGIEMMQGRERSAASVSS-VLLM 170 Query: 295 TDG-ENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIRVIRSHEFLRACASPNS--FYLVEN 350 TDG N + +Y G E L+ + + +Y +EN Sbjct: 171 TDGIANEGVRGPNLITATRQLMGDNPSYSLYTFGYGSNHEEELLKDLSEVGNGMYYYIEN 230 Query: 351 PHSMYDAFSH 360 ++ ++F Sbjct: 231 NDTIPESFGD 240 >gi|156405002|ref|XP_001640521.1| predicted protein [Nematostella vectensis] gi|156227656|gb|EDO48458.1| predicted protein [Nematostella vectensis] Length = 308 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 75/202 (37%), Gaps = 23/202 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + + D++ VLD S S+ N +L LI + Sbjct: 89 RRSVGDNIFYDVVFVLDSSASVG------------VKDYKNGILALQTLITRAKEDTRYA 136 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +TFS + F + ++ + ++ TN+ L Q++ + Sbjct: 137 GITFSTEANITFYFTDPLDAMK-GLGGITYAPGMTNTQAALDICRTQLWLNKKSGFR--- 192 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH--EFLRACA 340 ++K+I++ +TDG++ E ++LY + K G ++ + + E + + Sbjct: 193 -RLSFKRILI-VTDGQSNINME--RTLYNAFQLKNMGIEIFVVAVGKYLRGIAEIVGLAS 248 Query: 341 SPNSF-YLVENPHSMYDAFSHI 361 S ++ Y V N + + I Sbjct: 249 STDAHLYRVRNLRGLLEVVHLI 270 >gi|224097862|ref|XP_002311085.1| predicted protein [Populus trichocarpa] gi|222850905|gb|EEE88452.1| predicted protein [Populus trichocarpa] Length = 713 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 46/307 (14%), Positives = 99/307 (32%), Gaps = 51/307 (16%) Query: 74 RKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY-SISAI 132 + D ++ + G N + T ++ ++ Y + + + Sbjct: 174 EPSVFNDDESLDLQPAFAERSSGNKIAAGH-NAGKIVEIKTYPEVSAASRSNSYDNFTVL 232 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI 192 K P + S +++ A +D++ VLD+S SM Sbjct: 233 VHLKAGATVARENPR--------GNLASLPQLSQTPRAPVDLVTVLDISGSMAG------ 278 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIK 248 TK+ + +++ +++ + + ++ FS+ F L G H + + Sbjct: 279 TKLALLKRAMGFVIQNLGSND------RLSVIAFSSTARRLFSLRRMSDAGRQHALQAVN 332 Query: 249 YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS 308 L G TN GL+ +G + + N I+ ++DG++ T Sbjct: 333 SLVANG-GTNIAEGLR---------KGAKVMEERREKNPVASIILLSDGQDTYTVSGSSG 382 Query: 309 LYYCNEAK-----------KRG--AIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHS 353 + G V+A G + + + S +F +E Sbjct: 383 NQPQPNYRLLLPLSIHGGDNAGFQIPVHAFGFGADHDASSMHSISEISGGTFSFIETEAV 442 Query: 354 MYDAFSH 360 + DAF+ Sbjct: 443 IQDAFAQ 449 >gi|149028000|gb|EDL83451.1| complement factor B, isoform CRA_d [Rattus norvegicus] Length = 543 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 34/236 (14%), Positives = 81/236 (34%), Gaps = 34/236 (14%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 ++ S++ +++ +VLD S S+ + S+ T A + + ++E+V Sbjct: 34 VSCSLEGVEIKGGSMNIYMVLDGSDSIGA---SNFT---GAKRCLANLIEKVAS---YGV 84 Query: 218 VVQSGLVTFSNKI--------EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + GLVT++ E +W L + K TN+ L+ Y+ Sbjct: 85 KPRYGLVTYATVPKVLVRVSEERSSDADWVTEKLNQISYEDHKLKSGTNTKKALQAVYS- 143 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA----------KKRG 319 + G + +I+ MTDG + + + + ++ Sbjct: 144 MMSWPGDAPPEGWNRTRH--VIIIMTDGLHNMGGDPVTVIEDIRDLLDIGRDRKNPREDY 201 Query: 320 AIVYAIGIR----VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 VY G+ + + + + V++ + + F + + + + Sbjct: 202 LDVYVFGVGPLVDPVNINALASKKNNEQHVFKVKDMEDLENVFYKMIDETKSLGLC 257 >gi|256420242|ref|YP_003120895.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256035150|gb|ACU58694.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 639 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 72/190 (37%), Gaps = 21/190 (11%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 PW T + + + + V +++ ++DVS SM S K+ + ++ + Sbjct: 248 PWNTAHQLVRIALKGK-DVAKDNLPPSNLVFLIDVSGSM-----SDAKKLPLVKQAFKLL 301 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 + +++ + V VV +G + + + + L G ST G++ Sbjct: 302 VNQLRPVDRVAIVVYAGAAG----LVLPSTSGDHKTAILDALDKLEA-GGSTAGGEGVQL 356 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYA 324 AY + ++ N ++ TDG+ N+ D + + +++G + Sbjct: 357 AYKTATEY-LLKSGNNR--------VIIATDGDFNVGPSSDGELQRIIEKKREKGIFLSV 407 Query: 325 IGIRVIRSHE 334 +G + + Sbjct: 408 LGFGMGNYKD 417 >gi|16358975|gb|AAH10260.1| Clca1 protein [Mus musculus] Length = 513 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 35/166 (21%), Positives = 61/166 (36%), Gaps = 30/166 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--IE 231 + +VLD S SM+ ++ ++ L ++ + GLVTF + I+ Sbjct: 309 VCLVLDKSGSMDKE-----DRLIRMNQAAELYLTQIVEKESM-----VGLVTFDSAAHIQ 358 Query: 232 EFFLLEWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + S Q+ L + T+ GL+ + I Sbjct: 359 NYLIKITSSSDYQKITANLPQQASGGTSICHGLQAGFQAITSSDQSTSGSE--------- 409 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIRVIRSHEF 335 IV +TDGE+ + C EA R GAI++ I + + E Sbjct: 410 IVLLTDGEDNGIRS-------CFEAVSRSGAIIHTIALGPSAAREL 448 >gi|95147674|ref|NP_001035616.1| complement factor B precursor [Bos taurus] gi|146345391|sp|P81187|CFAB_BOVIN RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; AltName: Full=EC-VMFB; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|86438491|gb|AAI12505.1| Complement factor B [Bos taurus] gi|296474252|gb|DAA16367.1| complement factor B precursor [Bos taurus] Length = 761 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 37/221 (16%), Positives = 79/221 (35%), Gaps = 28/221 (12%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE- 231 IVLD S SM + S + N + + ++ + + GLVT++ + + Sbjct: 261 IVLDPSGSMNIYLVLDGSDSVGAHNFTGAKNCLRDFIEKVASYGVKPKYGLVTYATEPKV 320 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + +W L + K TN+ L YN + + + Q T + Sbjct: 321 LIRVSDPKSSEADWVTDQLNQINYADHKLKAGTNTKRALLEVYNMM--SREVNQFKETWN 378 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYC----------NEAKKRGAIVYAIGIRVIRSHE 334 + +I+ MTDG + + ++ ++ +Y G+ + + E Sbjct: 379 RT-RHVIIIMTDGLHNMGGDPVTVIHDIRYLLDIGRNRKNPREDYLDIYVFGVGPLVNQE 437 Query: 335 FLRACAS----PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + A AS + ++ ++ D F + + T + Sbjct: 438 NINALASKKDKEKHVFKLQGMENLEDVFVQMLDESRTLGLC 478 >gi|72162840|ref|YP_290497.1| von Willebrand factor, type A [Thermobifida fusca YX] gi|71916572|gb|AAZ56474.1| von Willebrand factor, type A [Thermobifida fusca YX] Length = 609 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 36/226 (15%), Positives = 72/226 (31%), Gaps = 54/226 (23%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDS-SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 ++ ++++V+D S SM+ T++++A ++ L+E + GL Sbjct: 404 AELRKPANVLLVIDTSGSMQESVPGTGSTRLELAKEAAITSLDEFSDSD------RVGLW 457 Query: 225 TFSNKIEEFFLLEW-----------------GVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 FS +E+ +W L +I L G T A Sbjct: 458 MFSTDLEDNGQ-DWRELVPLGPLGASVNGTPRREELAERISNLPPGGG----TGLYDTAL 512 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE-------DQQSLYYCNEAKKRGA 320 + + +VF+TDG+N D + + G Sbjct: 513 AAHTLVAEHSRPDAINA------VVFLTDGKNEDLNGISLEKLLDSITPEPGQQ----GV 562 Query: 321 IVYAIGIRVIRSHEFLRAC-----ASPNSFYLVENPHSMYDAFSHI 361 ++ I L+ A+ + Y +P S+ + F + Sbjct: 563 RIFTISYG---EDADLKTMTQIAEATNAAAYDASDPQSIDEVFEAV 605 >gi|299143633|ref|ZP_07036713.1| von Willebrand factor type A domain protein [Peptoniphilus sp. oral taxon 386 str. F0131] gi|298518118|gb|EFI41857.1| von Willebrand factor type A domain protein [Peptoniphilus sp. oral taxon 386 str. F0131] Length = 1217 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 33/155 (21%), Positives = 59/155 (38%), Gaps = 25/155 (16%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 P N + M + + V + +++V+D S SM+ F ++ A + NA Sbjct: 180 PNIANKWTVKMLVAARDSVKTSK-----IVLVIDTSGSMKDFG-----RMKGAKNAANAF 229 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 ++ V D + Q G+V F++ + + L I LS G T + G+K Sbjct: 230 VDNVL---DGSQSTQIGIVRFASNVSIVSDFTSNKAKLHSAINALSAEG-GTFTQAGVKQ 285 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 A + KK +V ++DG Sbjct: 286 ARTMLAGSGAD-----------KKYMVVLSDGVPT 309 >gi|296214738|ref|XP_002807270.1| PREDICTED: LOW QUALITY PROTEIN: cochlin-like [Callithrix jacchus] Length = 594 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 26/207 (12%), Positives = 72/207 (34%), Gaps = 27/207 (13%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTD------ARLDMMIVLDVSRSMESFFDSSITK 194 F +P + + V P+ + + Q +++ ++D S S+ + Sbjct: 373 FSYHMPNWFGTTKYVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGD------SN 426 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG 254 + ++ ++ + + ++ + T+ ++E F ++ I+ + Sbjct: 427 FRLMLEFVSNIAKTFEISDIGAKIAAVQF-TYDQRMEFSFTDYNTKENVLAVIRNIRYMS 485 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 T + + + +F +R+ N +V +TDG+ Sbjct: 486 GGTATGDAISFTVRNVFGP--IRESPNKN------FLVIVTDGQ-----SYDDVQGPAAA 532 Query: 315 AKKRGAIVYAIGIRVIRSHEFLRACAS 341 A G ++++G+ + L+ AS Sbjct: 533 AHDAGITIFSVGVAWAPLDD-LKDMAS 558 >gi|258615515|ref|ZP_05713285.1| hypothetical protein EfaeD_07377 [Enterococcus faecium DO] gi|293563519|ref|ZP_06677967.1| Bee1, putative [Enterococcus faecium E1162] gi|294622786|ref|ZP_06701740.1| Bee1, putative [Enterococcus faecium U0317] gi|291597744|gb|EFF28882.1| Bee1, putative [Enterococcus faecium U0317] gi|291604521|gb|EFF34007.1| Bee1, putative [Enterococcus faecium E1162] Length = 1344 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 34/153 (22%), Positives = 65/153 (42%), Gaps = 22/153 (14%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDS--SITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + + + +D++ VLD S SM S TK I+++N M +++ P ++ + Sbjct: 212 IGGEKKEISPIDIVFVLDKSASMSELTAGTNSQTKNAALIEAVNEMSKDLLSDPSLD--I 269 Query: 220 QSGLVTF--------SNKIEEF--FLLEWGVSHLQRKIK-YLSKFG-VSTNSTPGLKYAY 267 + G+V F +++ F L ++ L L++ T T GLK Y Sbjct: 270 RIGMVNFYHNSTAINNHEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGY 329 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 ++ G E+ N +KI++ + DG Sbjct: 330 ETLYKDNG------GENRNPEKILIVVGDGTPT 356 >gi|118394228|ref|XP_001029494.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89283721|gb|EAR81831.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 406 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 43/261 (16%), Positives = 91/261 (34%), Gaps = 41/261 (15%) Query: 118 IVVVPQNEG--YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT------- 168 +V ++ Y++ + F + I + ++ + Sbjct: 95 VVNSKNHKKQIYNLEKGLVLDVKTLHKHFQFNKNQDQTIPVMVSVKTLDQTNDMEVESNP 154 Query: 169 ---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ V+D S SM S +KI+ +I +LE + + L+T Sbjct: 155 LEGRPNLDLICVIDNSGSM-----SGCSKIENVKNTILQLLEMLNEND------RLSLIT 203 Query: 226 FSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 F+ K ++ L+ LQ K + G T+ T GL+ A+ + + Sbjct: 204 FNTKAKQLCGLKKVNNQNKESLQTITKSIKADG-GTDITSGLEIAFQIL------QSRKQ 256 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSL--YYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 + I ++DG++ ++L + ++ +++ G ++ Sbjct: 257 KNSVSS---IFLLSDGQDDGADIKIKNLLKTTYQQLQEESFTIHSFGFGNDHDGPLMQKI 313 Query: 340 AS--PNSFYLVENPHSMYDAF 358 A SFY VE + + F Sbjct: 314 AQIKDGSFYFVEKNDQVDEFF 334 >gi|260061450|ref|YP_003194530.1| hypothetical protein RB2501_07615 [Robiginitalea biformata HTCC2501] gi|88785582|gb|EAR16751.1| hypothetical protein RB2501_07615 [Robiginitalea biformata HTCC2501] Length = 348 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 24/172 (13%), Positives = 69/172 (40%), Gaps = 19/172 (11%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +D++ +DVS+SM + D + +++ A + ++ ++ E+ + Sbjct: 79 GTKLETVKREGVDIVFAVDVSKSMLAE-DIAPNRLEKAKRLVSEIINELASD-------R 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 G++ ++ + + + ++ ++ +S + + A + Sbjct: 131 VGIIAYAAQAFPQLPITTDYGAAKMFLQSMNTDMLS-SQGTAIHEAIELAATYFDDEEQT 189 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 N +I+ ++DGE + Q + +A +G ++ IG+ R Sbjct: 190 N-------RILFLVSDGE---DHAEDQVMDAIEQATDQGIRIFTIGVGSARG 231 >gi|69244819|ref|ZP_00603043.1| von Willebrand factor, type A [Enterococcus faecium DO] gi|257882064|ref|ZP_05661717.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,502] gi|257889959|ref|ZP_05669612.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,410] gi|260560224|ref|ZP_05832401.1| von Willebrand factor [Enterococcus faecium C68] gi|314947791|ref|ZP_07851198.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecium TX0082] gi|68196173|gb|EAN10603.1| von Willebrand factor, type A [Enterococcus faecium DO] gi|257817722|gb|EEV45050.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,502] gi|257826319|gb|EEV52945.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,410] gi|260073791|gb|EEW62116.1| von Willebrand factor [Enterococcus faecium C68] gi|313645771|gb|EFS10351.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecium TX0082] Length = 1345 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 34/153 (22%), Positives = 65/153 (42%), Gaps = 22/153 (14%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDS--SITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + + + +D++ VLD S SM S TK I+++N M +++ P ++ + Sbjct: 213 IGGEKKEISPIDIVFVLDKSASMSELTAGTNSQTKNAALIEAVNEMSKDLLSDPSLD--I 270 Query: 220 QSGLVTF--------SNKIEEF--FLLEWGVSHLQRKIK-YLSKFG-VSTNSTPGLKYAY 267 + G+V F +++ F L ++ L L++ T T GLK Y Sbjct: 271 RIGMVNFYHNSTAINNHEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGY 330 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 ++ G E+ N +KI++ + DG Sbjct: 331 ETLYKDNG------GENRNPEKILIVVGDGTPT 357 >gi|326789198|ref|YP_004307019.1| hypothetical protein Clole_0061 [Clostridium lentocellum DSM 5427] gi|326539962|gb|ADZ81821.1| Protein of unknown function DUF3520 [Clostridium lentocellum DSM 5427] Length = 670 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 42/287 (14%), Positives = 96/287 (33%), Gaps = 24/287 (8%) Query: 52 SMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN-DIDDI 110 + A + N + N+ + +T+++ N ++ Sbjct: 46 ESATADIEETAPIVFNSESYNSFTENPFIKTTDETFSTFSIDVDIASYSNVRRFITNQEL 105 Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD- 169 ++ + YS IP + + + + + Q D Sbjct: 106 PPVDAIRTEELINYFNYSYP-EPTDNIPFSLSQEMMPCPWNESSQLLLIGLQGKHLQPDE 164 Query: 170 -ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +++ +LDVS SM S K+ + KS N + +K ++ VV +G + Sbjct: 165 VPPSNLVFLLDVSGSM-----SDTNKLPLLKKSFNILTSNLKESDCISIVVYAGA----S 215 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + S + ++ L G ST G+ AY +H + N Sbjct: 216 GVVLDGVAGNDESLINEALESLEA-GGSTAGAEGIAMAYEL------AEKHFIKDGNNR- 267 Query: 289 KIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++ TDG+ N+ + + + +++G + +G+ + + Sbjct: 268 --VILATDGDFNVGPNSESDLIRIIEKKREKGIFLSVLGLGMGNYKD 312 >gi|170591600|ref|XP_001900558.1| Immunoglobulin I-set domain containing protein [Brugia malayi] gi|158592170|gb|EDP30772.1| Immunoglobulin I-set domain containing protein [Brugia malayi] Length = 6163 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 43/296 (14%), Positives = 104/296 (35%), Gaps = 48/296 (16%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYT---- 149 RN+ + + F+ D+ + +V N I+ + + Sbjct: 5341 LRNDTKVHIFLVDVGNQGDKAQNLAIVGESNPHRIIAIDGWHAMRPDILNPFVNELCKLL 5400 Query: 150 -------------NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 +R M I + +++ ++ D + D++ VLD S ++ S + Sbjct: 5401 PQKQDKASRDSTWPTRQTEMRIATPIRICNRVDFQADIIFVLDSSDNVTSKEYVN----- 5455 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFG 254 + I+ +++ + ++V+ G + +S+K L + + +Q I + G Sbjct: 5456 -LKEDISMLID--DIFDLSPDIVRIGFIEYSDKASVPVPLGYYDNKVQLLADISNSEQLG 5512 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 + GL A Q ++H +I++ +T G N ++ Sbjct: 5513 GTPVIVRGLHAAKEQF------KRH---GRNGVSRILLLVTSGANRGN-----VATAADD 5558 Query: 315 AKKR-GAIVYAIGIRVIRSHEFLRACASPNSF-----YLVENPHSMYDA-FSHIGK 363 +KR V+A+ + R + + + + + + + + HIG+ Sbjct: 5559 LRKRLKVSVFALVVNTSRGAQMMLNRLTGDEHTQRRVISISSASKLQERELLHIGQ 5614 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 23/164 (14%), Positives = 53/164 (32%), Gaps = 23/164 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 DM+++LD S S+ + + I L N V+ G++ + +K+E Sbjct: 5219 DMLLILDSSSSVR------VVDYRIMKDFIKKFLT--NHFNLQRNYVRVGVMKYGDKVEI 5270 Query: 233 FFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + L +I + + N L A + + +I Sbjct: 5271 PISLGDYDSQTELLSRISETRRMRGNANLGQALLDA---------SGEFLIFGSKDIPRI 5321 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 I+ ++G + ++ L + ++ + + Sbjct: 5322 IIIFSNGRSRGELKENALLLRNDT----KVHIFLVDVGNQGDKA 5361 >gi|119890597|ref|XP_001256059.1| PREDICTED: alpha 3 type VI collagen, partial [Bos taurus] Length = 1632 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 47/332 (14%), Positives = 111/332 (33%), Gaps = 47/332 (14%) Query: 53 MIDRSLVHAATQI---------------MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNE 97 +D S ++ + + EG + GG L + R+ Sbjct: 516 ALDGSALYTGSALDFVRNNLFTEAAGYRAAEGVPKLLVLVTGGKSLDAVSQPAQELKRSG 575 Query: 98 LRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMP 157 + N + D ++ + + +A PL+ +P Sbjct: 576 ILAFAVGNKVAD-----QAELEEIAFDSSLVFTATEFRPAPLQ--GVLPGLLGPLR---T 625 Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 +T + +V D++ +LD S ++ T + ++ + + ++ Sbjct: 626 LTGTTEVRVNKR---DIIFLLDGSSNVGE------TNFPYVRDFVMNLVNSLDV---GSD 673 Query: 218 VVQSGLVTFSN-KIEEFFLLEWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 ++ GLV FS+ + EF L + S L ++ + G S +T A + + Sbjct: 674 HIRVGLVQFSDTPVTEFSLNTYPTKSELLAHLRQMQLQGGSVLNTGA---ALSYVHANHF 730 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + + ++++ +T G+ + L N + G + + +G E Sbjct: 731 TEAGGSRIQDHVPQLLLLLTAGQ-----SEDSYLQAANALARAGILTFCVGTSQADRAEL 785 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 +P YL+++ S+ + + + T Sbjct: 786 EEIAFNPGLVYLMDDFSSLPALPQQLIQPLTT 817 Score = 42.8 bits (99), Expect = 0.095, Method: Composition-based stats. Identities = 31/256 (12%), Positives = 81/256 (31%), Gaps = 31/256 (12%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARL-- 172 ++ + + I++ + + ++V + SS+ + Sbjct: 173 AIGVEDADEAALKEIASEPLNMHVFNLENYTSLHDIVGNLVACVRSSMAPERAGGTEIPK 232 Query: 173 --------DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D++ ++D S ++ + + + +LE + + ++ G+V Sbjct: 233 DITAQDSADIIFLIDGSN------NTGSVNFAVILDFLVNLLERLSI---GTQQIRVGVV 283 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCN 281 +S++ F L + + +K L G N L + F R + Sbjct: 284 QYSDEPRTMFSLNSYSTKAQVLDAVKALGFIGGELANVGLALDFVVENHFT----RAGGS 339 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 + +++V ++ G + D K+ + +G + E + Sbjct: 340 RAEEGVPQVLVLISAGPSSDEIRDGVIA-----LKQASVFSFGLGAQAASKAELQHIATN 394 Query: 342 PNSFYLVENPHSMYDA 357 N + V S+ D Sbjct: 395 DNLVFTVPEFRSLGDV 410 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 44/255 (17%), Positives = 89/255 (34%), Gaps = 29/255 (11%) Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVK 163 ++ + + I + P+ Y S + ++P K T + T + + ++ Sbjct: 1369 ARNVDQEELVKISLSPE---YVFSVNTFRELPSLEQKLLTPLTTLTAGQIQQLLASTRYP 1425 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + D++ ++D S ++ I + ++ + + P+ V+ G+ Sbjct: 1426 PPAVESDAADIVFLIDSSDGVKPDG------IAHIRDFVIRIVRRLNVGPNK---VRIGV 1476 Query: 224 VTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVST-NSTPGLKY-AYNQIFDMQGMRQH 279 + FSN + F L+ S I+ L G S N+ L++ A N G R Sbjct: 1477 LQFSNDVFPEFQLKTYKSQASVLDAIRRLRFKGGSPLNTGKALEFVARNYFVKSAGSR-- 1534 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 ED + +++F+ K Y K G +G R I E Sbjct: 1535 --IEDGVPQHLVLFL------GGKSQDDISRYSQVIKSAGIASLGVGDRNIDRTELQTIT 1586 Query: 340 ASPNSFYLVENPHSM 354 + P + V + Sbjct: 1587 SDPRLVFTVREFRDL 1601 >gi|281352224|gb|EFB27808.1| hypothetical protein PANDA_008060 [Ailuropoda melanoleuca] Length = 911 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 42/197 (21%), Positives = 76/197 (38%), Gaps = 35/197 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM S+ ++ ++ L ++ V G+VTF + Sbjct: 307 VCLVLDKSGSM-----SNGDRLKRLNQAGKLFLLQI-----VEQGSWVGMVTFDSAAHVQ 356 Query: 234 FLLE--WGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L G + K L T+ GL+ A+ I R+ +T+ + Sbjct: 357 SELVQINGATERDALTKSLPTVASGGTSICSGLRSAFAVI------RKKFSTDGSE---- 406 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ ++ C NE K+ GA+++ + + + E L Y Sbjct: 407 IVLLTDGEDN-------TISSCFNEVKQSGAVIHTVALGPSAAKELEELSKMTGGLQTYA 459 Query: 348 VENPHS--MYDAFSHIG 362 + + + DAF + Sbjct: 460 SDQAQNNGLIDAFGALS 476 >gi|282900974|ref|ZP_06308907.1| hypothetical protein CRC_02390 [Cylindrospermopsis raciborskii CS-505] gi|281194065|gb|EFA69029.1| hypothetical protein CRC_02390 [Cylindrospermopsis raciborskii CS-505] Length = 487 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 65/203 (32%), Gaps = 28/203 (13%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 T +++++D S SM K+ + + ++ L V Q +V F Sbjct: 46 TTNPQAIVLLIDTSSSMSD------GKLTEVKTAASQFMQRRNL-----EVDQIAVVNFG 94 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 +++ L ++ L I L + G ST G+ A Q+ + Sbjct: 95 SEVATPAPLTNDINILNNAINQLLENG-STPMGEGIDTAQGQL------------QATTL 141 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 K I+ TDG + + G + A+ ++ + + + Sbjct: 142 NKNIILFTDGIPDDPNF---AYNSALSVRNAGIKLIAVATGGADTNYLTQITGDRSLVFY 198 Query: 348 VENPHSMYDAFSHIGKDIVTKRI 370 N AFS I + I Sbjct: 199 A-NSGQFDQAFSQAEAVIYKQLI 220 >gi|257892717|ref|ZP_05672370.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,408] gi|257829096|gb|EEV55703.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,408] Length = 1347 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 35/162 (21%), Positives = 64/162 (39%), Gaps = 22/162 (13%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMES--FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + Q + +D++ VLD S SM +K I+++N + E + P+++ + Sbjct: 213 IGSEKQEISPIDIVFVLDKSASMNEGTLEGGGQSKNAALIEAVNEISENLLSDPNMD--I 270 Query: 220 QSGLVTF---SNKIEEFFLLEWGVSHLQRKIKYLSKFG---------VSTNSTPGLKYAY 267 + G+V F S I + + L I L+ T T GLK Y Sbjct: 271 RIGMVNFYHNSTVINNQEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGY 330 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL 309 ++ G E+ N +KI++ + DG + Q+ Sbjct: 331 ETLYADNG------GENRNPEKILIVVGDGTPTFSYAPIQTR 366 >gi|257867801|ref|ZP_05647454.1| von Willebrand factor type A/Cna B-type domain-containing protein [Enterococcus casseliflavus EC30] gi|257874128|ref|ZP_05653781.1| von Willebrand factor type A domain-containing protein [Enterococcus casseliflavus EC10] gi|257801884|gb|EEV30787.1| von Willebrand factor type A/Cna B-type domain-containing protein [Enterococcus casseliflavus EC30] gi|257808292|gb|EEV37114.1| von Willebrand factor type A domain-containing protein [Enterococcus casseliflavus EC10] Length = 1191 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 28/149 (18%), Positives = 60/149 (40%), Gaps = 24/149 (16%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-- 228 +D+++V+D S SM + +I K ++ L +++ + + V G V +S+ Sbjct: 350 PIDVVLVVDWSGSMNE-----MGRIAEVKKGVDRFLNQIE-GSGIQDSVYMGYVGYSSDG 403 Query: 229 --KIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + L S ++ I+ ++ T + GL+ A + + G Sbjct: 404 SNYQNKTCQLG-KFSEVKETIRSMTPETAAGGTFTQRGLRQAGDMLSTQNG--------- 453 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + Sbjct: 454 --HKKVIVLLTDGVPTYSYHVSKVHTQAD 480 >gi|239987768|ref|ZP_04708432.1| hypothetical protein SrosN1_10718 [Streptomyces roseosporus NRRL 11379] Length = 527 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 80/212 (37%), Gaps = 28/212 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + + + + V+D+S SM ++D+ KS+ + +E++ Sbjct: 148 LRVGLATKTAPPTSERPPAALTFVVDISGSMAETG-----RLDLVRKSLTILADELRDDD 202 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLE---WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 ++ LVTFS++ E + + ++ + + STN G+K Y + Sbjct: 203 SLS------LVTFSDEAETRLPMTRVKDNRNRIKDVVSEMQP-AQSTNVEAGIKLGYEE- 254 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIR 328 ++G R+ +V ++D N E + L + A++ G ++ +G+ Sbjct: 255 -SVEGHREGATNR-------VVLLSDALANTGETEAEGILKKIDSARREYGITLFGVGVG 306 Query: 329 VIRSHEFLRACAS--PNSFYLVENPHSMYDAF 358 F+ + + V + F Sbjct: 307 SDYGDAFMEQLTNKGDGNTTYVGDETQARKVF 338 >gi|51244490|ref|YP_064374.1| hypothetical protein DP0638 [Desulfotalea psychrophila LSv54] gi|50875527|emb|CAG35367.1| conserved hypothetical membrane protein (BatA) [Desulfotalea psychrophila LSv54] Length = 328 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 36/224 (16%), Positives = 75/224 (33%), Gaps = 50/224 (22%) Query: 168 TDARLDMMIVLDVSRSM---ESFFDSSIT-KIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + +D+++ +DVS SM + + T ++++ + + + P+ + GL Sbjct: 83 KSSGIDILLAVDVSGSMQAMDFTLNGKRTNRLEVVKDVMAKFISQ---RPNDS----IGL 135 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 V F+ + + L ++ LS T + N++ Sbjct: 136 VAFAGRPYVVCPPTLDHNWLTLRLHSLSIGMIEDGTAIGSAIGTGVNRL----------- 184 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-----------VI 330 E + +II+ +TDG N + K L AK VY IG Sbjct: 185 REKKSPSQIIILLTDGINNAGK--VPPLIAAEAAKSFKVKVYTIGAGTRGEAPIPITDAF 242 Query: 331 RSHEFLRACA-------------SPNSFYLVENPHSMYDAFSHI 361 + +RA + ++ + S+ ++ I Sbjct: 243 GRRQLVRARVDIDDKTLSKVAQITGARYFRATDTESLEKVYAEI 286 >gi|330995094|ref|ZP_08319011.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] gi|329576670|gb|EGG58173.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] Length = 332 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 53/267 (19%), Positives = 93/267 (34%), Gaps = 56/267 (20%) Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVM----PITSSVKVNSQTDARLDMMIVLDVSRSMES 186 A YK L FI +V+ P TS+ N+Q + +D+M+ +DVS SM + Sbjct: 44 APKSYKNHLIHAPFILRLFTFVMVVIILARPQTSNSWKNTQVEG-IDIMLAVDVSTSMLA 102 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG------- 239 D +++ A + + + P+ N GL F+ + + Sbjct: 103 E-DLKPNRLEAAKQVASEFIAG---RPNDN----IGLTIFAAEAFTQCPMTTDHAVLLNL 154 Query: 240 VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 ++ + T G+ A +++ D + K+I+ +TDG N Sbjct: 155 FHGIKTDMAQRGMIQDGTAVGMGIANAVSRLKDSKAKS-----------KVIILLTDGTN 203 Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIR----------VIRSHEFL--------RACA- 340 + D L AK G VY IG+ V ++L + A Sbjct: 204 NA--GDISPLTAAEIAKSFGIRVYTIGVGTNGLAPYPMPVAGGVQYLNVPVEIDTKTLAA 261 Query: 341 ----SPNSFYLVENPHSMYDAFSHIGK 363 + FY + + D + I K Sbjct: 262 IAGKTDGEFYRATDNKKLEDVYKDIDK 288 >gi|313247257|emb|CBY15545.1| unnamed protein product [Oikopleura dioica] Length = 409 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 39/165 (23%), Positives = 66/165 (40%), Gaps = 19/165 (11%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 LD++I+LD SRS++ + +S+ M + I D N + L+ + Sbjct: 78 CESRPLDLVILLDSSRSIDE------KSWLLQKESVERMASVLFPIDDFN--TRISLIRY 129 Query: 227 SNKIEEFFLLEWGVSH--LQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 L S+ ++ K+ L + TN GL+ A+ + R+ Sbjct: 130 GYSAFLAHRLSEEQSYPMIKEKLFRLEHTYEDQTNVHFGLRKAFTEFSTCPRDRR----- 184 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + N KK IV +TDGE K + L AKK ++ + I Sbjct: 185 NQNSKKAIVIITDGEFTEPKIAYEDLKI---AKKNEVEIFILAIG 226 >gi|237737389|ref|ZP_04567870.1| BatB protein [Fusobacterium mortiferum ATCC 9817] gi|229421251|gb|EEO36298.1| BatB protein [Fusobacterium mortiferum ATCC 9817] Length = 322 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 72/206 (34%), Gaps = 32/206 (15%) Query: 134 RYKIPLKFCTFIPWYTNSRHIVMPITS---SVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 R K + + S +V+ + S ++ +++ +++D SRSM + D Sbjct: 39 RNKRRISIIKILLLTLGSVLVVISLLSPQKEIEDEEIEVKGMNIYVLIDTSRSMLTE-DV 97 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL 250 +++ + + +++ +K + G + FS+ L + Q I + Sbjct: 98 YPNRLEAGKRVLTNLIQSLKGD-------RVGFIPFSDSAYIQMPLTDDYNITQNYINAI 150 Query: 251 SK---FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 G T L+ A + N K ++ ++DG + K Sbjct: 151 DTTLISGGGTELYQALELA---------EKSFKEIGSEN--KTVIVISDGGDFDKKS--- 196 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRSH 333 + K+ VY+IG+ + Sbjct: 197 ----LDFVKENKIDVYSIGVGTKEGN 218 >gi|118355471|ref|XP_001010995.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89292762|gb|EAR90750.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 787 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 43/273 (15%), Positives = 102/273 (37%), Gaps = 40/273 (14%) Query: 104 VNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPI----- 158 ++ + Y + + F + + I + + Sbjct: 164 SQEMQQEDNIQEVVKSKQNNKNSYDLEKGLSLDVKTFQKHFQFNNSQDQTIPIMVSVKTL 223 Query: 159 --TSSVKVNSQ---TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 TS +++ S LD++ V+D S SM KI+ +I +++ + Sbjct: 224 EQTSDMEIQSNLLEGRPNLDLICVIDNSGSM------DGEKIENVKNTILQLIDML---- 273 Query: 214 DVNNVVQSGLVTFSNKIEEFFLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 N+ + ++TF++ ++ L + +LQ+ K + G TN T GL+ A++ Sbjct: 274 --NDHDRLSIITFNSYAKQLCGLRKVNKDNKENLQKITKSIQADG-GTNITSGLQTAFSI 330 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL--YYCNEAKKRGAIVYAIGI 327 + R+ N+ + + ++DG++ ++ ++L + ++ +++ G Sbjct: 331 L----QNRKQRNSVSSVF-----LLSDGQDNNSDSRIRNLLQTTYQQLQEECFTIHSFGF 381 Query: 328 RVIRSHEFLRACAS--PNSFYLVENPHSMYDAF 358 ++ A SFY VE + + F Sbjct: 382 GNDHDGPLMQRIAQIKDGSFYYVERNDQVDEFF 414 >gi|311262926|ref|XP_003129419.1| PREDICTED: anthrax toxin receptor 2-like [Sus scrofa] Length = 241 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 36/222 (16%), Positives = 73/222 (32%), Gaps = 34/222 (15%) Query: 154 IVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + + ++ + D+ VLD S S+ + + I + ++ Sbjct: 22 LALSGPGALVSAQEQPSCSGAFDLYFVLDKSGSVANNW-----------IEIYNFVHQLT 70 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYN 268 V+ ++ + FS++ L + + L + T GLK A Sbjct: 71 ERF-VSPQMRLSFIVFSSQATIILPLTGDRGKISEGLDNLKRVSPVGETYIHEGLKLANE 129 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 QI G++ II+ +TDG + L ++ ++ GA VY +G Sbjct: 130 QIEKAGGLKTSS---------IIIALTDGKLDGLVPSYAEKEAKI---SRSLGARVYCVG 177 Query: 327 IRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 + + R S + V + A I ++ Sbjct: 178 VLDFEQAQLERIADSKEQVFPVTGG---FQALKGIINSSSSR 216 >gi|291444745|ref|ZP_06584135.1| lipoprotein [Streptomyces roseosporus NRRL 15998] gi|291347692|gb|EFE74596.1| lipoprotein [Streptomyces roseosporus NRRL 15998] Length = 531 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 80/212 (37%), Gaps = 28/212 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + + + + V+D+S SM ++D+ KS+ + +E++ Sbjct: 164 LRVGLATKTAPPTSERPPAALTFVVDISGSMAETG-----RLDLVRKSLTILADELRDDD 218 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLE---WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 ++ LVTFS++ E + + ++ + + STN G+K Y + Sbjct: 219 SLS------LVTFSDEAETRLPMTRVKDNRNRIKDVVSEMQP-AQSTNVEAGIKLGYEE- 270 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIR 328 ++G R+ +V ++D N E + L + A++ G ++ +G+ Sbjct: 271 -SVEGHREGATNR-------VVLLSDALANTGETEAEGILKKIDSARREYGITLFGVGVG 322 Query: 329 VIRSHEFLRACAS--PNSFYLVENPHSMYDAF 358 F+ + + V + F Sbjct: 323 SDYGDAFMEQLTNKGDGNTTYVGDETQARKVF 354 >gi|308476046|ref|XP_003100240.1| hypothetical protein CRE_21951 [Caenorhabditis remanei] gi|308265764|gb|EFP09717.1| hypothetical protein CRE_21951 [Caenorhabditis remanei] Length = 879 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 31/193 (16%), Positives = 72/193 (37%), Gaps = 25/193 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++I+ D S + T + N ++ V+ +P + V+ GL+ +S+ Sbjct: 31 PVLDIIILFDTSG-------GNDTVFEQQK---NWTIKIVRDLPIHEDAVRVGLIQYSDA 80 Query: 230 IEEFFLLEWGV--SHLQRKIKYLSKF-GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + F L + + ++ L+ G T + L A ++F+ + Sbjct: 81 AKTEFNLSRYSERNDIITHLETLTFMPGEDTRTGVALDKADEEMFN------YIGGARLK 134 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-SHEFLRACASPNSF 345 ++I+ TDG ++ +++G +Y I + I E L ++ Sbjct: 135 ATRLIILFTDGLSMDKPT-----KSAKTLRRKGVKIYTISVNSIGFVPEMLGIVGDADNV 189 Query: 346 YLVENPHSMYDAF 358 + + + + Sbjct: 190 FGPTDEDRIEERL 202 Score = 44.0 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 27/198 (13%), Positives = 71/198 (35%), Gaps = 24/198 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S +D++ V+D S S+ +D+ K + ++++V+ + + GL+ Sbjct: 690 SSSVQCPMDILFVVDSSGSIARTYDTQ--KDTHFQDYLTQLIKKVEP----SRSHRVGLI 743 Query: 225 TFSNKIEEFFLLEWGVSHLQRK----IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 F+ + + + I+ + +T L+ + + R+H Sbjct: 744 QFAGPHIQKMEWSFDTHSKNSQLLSAIRSVRHLTGTTYIGAALEL---SLILLDSRRKHT 800 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 T ++ ++DG + Q L + +YAI + + + ++L Sbjct: 801 ET-------TVILISDGFSQDDSTQQAKLLR----QLPNVKMYAISLNKLTNTKYLTDIV 849 Query: 341 SPNSFYLVENPHSMYDAF 358 + + ++ F Sbjct: 850 GDRKNLFINDESHWFEEF 867 >gi|327272012|ref|XP_003220780.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Anolis carolinensis] Length = 955 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 72/199 (36%), Gaps = 33/199 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ VLD S SM TK+ ++ +L++++ N ++ FSN+I Sbjct: 308 VVFVLDSSASM------VGTKLRQTKDALFTILQDLRPEDHFN------IIGFSNRIKVW 355 Query: 231 --EEFFLLE-WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 ++ + + + I +S G TN L+ + + D + D Sbjct: 356 QHDQLVPVTPNNIRDAKVYIHNMSPSG-GTNINGALQISTKILND------YIAQNDIEA 408 Query: 288 KKI--IVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-----LRAC 339 + + I+F+TDG E + + EA + ++ IGI ++ L C Sbjct: 409 RSVSLIIFLTDGRPTFGEIEPAKIINNTKEAIRNKFCLFTIGIGNDVDYKLLERLALENC 468 Query: 340 ASPNSFYLVENPHSMYDAF 358 E+ F Sbjct: 469 GMMRRVREEEDAAEQLKGF 487 >gi|260575971|ref|ZP_05843966.1| von Willebrand factor type A [Rhodobacter sp. SW2] gi|259021897|gb|EEW25198.1| von Willebrand factor type A [Rhodobacter sp. SW2] Length = 670 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 39/226 (17%), Positives = 80/226 (35%), Gaps = 24/226 (10%) Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKF---CTFIPWYTNSRHIVMPITSSVKVNS 166 + + ++ + + Y+ +A + P + PW +R + + + + Sbjct: 251 LPEAEAVRVEEMVNYFPYAYAAPVAGEAPFRTAVTVMQTPWNPGTRLVRIGLQGRL-PAL 309 Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 L+++ ++D S SME K+ + +S+ ML E++ V V +G Sbjct: 310 DDRPPLNLVFLIDTSGSME-----DANKLPLLKQSLRLMLAELRPEDQVAIVAYAGSAG- 363 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 +I E + + L G ST GL AY M G + Sbjct: 364 --EILPPTKAE-NADEILAALDRLGA-GGSTAGAEGLALAYQVARKMAGAGEVSR----- 414 Query: 287 YKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++ TDG+ N+ + + Y + + G + +G Sbjct: 415 ----VLLATDGDFNVGIDDPEGLTKYIAKQRDTGVYLSVLGFGRGN 456 >gi|197335948|ref|YP_002155278.1| hypothetical protein VFMJ11_0524 [Vibrio fischeri MJ11] gi|197317438|gb|ACH66885.1| conserved hypothetical protein [Vibrio fischeri MJ11] Length = 463 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 38/307 (12%), Positives = 92/307 (29%), Gaps = 48/307 (15%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 G IL + +P++F V + + + K L + +++ + + Sbjct: 1 MKLKKQQSGHAAILFVMCIPVLFGVFTLASDGARALQSKARLEDAAEAAVLAVSAYGEED 60 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 K L + + ++ DD Y Sbjct: 61 EVSTQTGKDYVAHYLHDMSSLVDIKVEKLECSELPECTADD----------NDRPFVEYQ 110 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMP-----ITSSVKV-NSQTDARLDMMIVLDVSR 182 +S +++ I W+ + V +T S K Q+ +D+ +LD S Sbjct: 111 VSGRTKH---------ISWFPGNDVTVGFGESFDVTGSSKARKFQSSQPMDITFILDFSG 161 Query: 183 SMESFFD------------------SSITKIDMAIKSINAMLEEVKLIPDVNN--VVQSG 222 SM ++ S +++ + + +E+++ + + Sbjct: 162 SMNYDWEGHAPSYMEEEIPKVPGRYSPPSRLSDLKYVVQMVTDELQVYNNSTAGPKHRVA 221 Query: 223 LVTFSNKIEEFFLLEWGVSHLQ-RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 + ++ + G ++ ++I + G T K N+ F ++G Sbjct: 222 MTGYNRRTVNES--SNGKFVIRDQRITKYNSDGYDAGDTFYPKKTINKQFMVKGAAARVP 279 Query: 282 TEDANYK 288 D + Sbjct: 280 NGDEKAE 286 >gi|148704833|gb|EDL36780.1| coagulation factor C homolog (Limulus polyphemus), isoform CRA_a [Mus musculus] Length = 608 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 26/207 (12%), Positives = 69/207 (33%), Gaps = 27/207 (13%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTD------ARLDMMIVLDVSRSMESFFDSSITK 194 F +P + + V P+ + + Q +++ ++D S S+ + Sbjct: 387 FSYHMPNWFGTTKYVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGD------SN 440 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG 254 + ++ ++ + + ++ + T+ + E F ++ + + Sbjct: 441 FRLMLEFVSNIAKTFEISDIGAKIAAVQF-TYDQRTEFSFTDYNTKENVLAVLANIRYMS 499 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 T + + + +F D+ K +V +TDG+ Sbjct: 500 GGTATGDAIAFTVRNVFGP--------IRDSPNKNFLVIVTDGQ-----SYDDVRGPAAA 546 Query: 315 AKKRGAIVYAIGIRVIRSHEFLRACAS 341 A G ++++G+ + LR AS Sbjct: 547 AHDAGITIFSVGVAWAPLDD-LRDMAS 572 >gi|126631586|gb|AAI34052.1| Col6a2 protein [Danio rerio] Length = 310 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 29/203 (14%), Positives = 61/203 (30%), Gaps = 17/203 (8%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 + + S +K +D+ +D S ++ + + ++ E Sbjct: 15 HAQGQTPAGSSCSPIKSTEC---PIDLYFAIDTSETIALQENPPGSLVESIKDFTIRFAE 71 Query: 208 EVKLIP---DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 +++ + VN G + FS E F + ++ + G T+ + Sbjct: 72 KLQNVNYRGSVNITWAIGGLHFSQLQEFFSTITT-KEKFISNLRPIRYLGRGTHIDCAIT 130 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 Q+ + K+ V +TDG + A+ ++A Sbjct: 131 NMTKQLV--------RFPSRPDAKRFAVVITDGHVTANPCGGI-KVAAERARDENIRIFA 181 Query: 325 IGIRVIRSHEFLRACA-SPNSFY 346 + LR A SP Y Sbjct: 182 VASSRNLEETGLREIANSPAGVY 204 >gi|320160484|ref|YP_004173708.1| hypothetical protein ANT_10740 [Anaerolinea thermophila UNI-1] gi|319994337|dbj|BAJ63108.1| hypothetical protein ANT_10740 [Anaerolinea thermophila UNI-1] Length = 802 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 37/171 (21%), Positives = 67/171 (39%), Gaps = 32/171 (18%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 +T D++ VLD S SME K A +++ +L + + L Sbjct: 289 QAPETAIPKDVIFVLDRSGSMEG------VKFQQAKQALEYVLSRLNPQD------RFNL 336 Query: 224 VTFSNKIEEFFLLEWGVSHL---QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 ++FSN++E F GV + Q+ + LS G TN L A + Sbjct: 337 LSFSNQVEVFAPEMEGVEAIPQAQKWVAGLSAAG-GTNIHRALLDAIQFV---------- 385 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK--RGAIVYAIGIRV 329 + ++F+TDG D++ + + A++ RG ++ G+ Sbjct: 386 ---RSQRPTYLIFLTDGLPTVGITDREQI-LDDFARQAPRGLRLFVFGVGY 432 >gi|221104611|ref|XP_002170515.1| PREDICTED: similar to Collagen alpha-1(XII) chain [Hydra magnipapillata] Length = 1137 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 33/211 (15%), Positives = 77/211 (36%), Gaps = 22/211 (10%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + +S ++++ I+LD S S+ S T + + + +E + P Sbjct: 283 VCNSEACEPVCAKQMEVAILLDASTSVTS------TNWKKTVSFVQSFTKEFVMGPTG-- 334 Query: 218 VVQSGLVTFSNK----IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 V+ ++ F+N+ I W RK+ + T + +K A +IF Sbjct: 335 -VRFAVIDFANEAQIQINILDPKYWSQEAFSRKVGSIEYSRGRTKTDLAIKLAREKIFCD 393 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRS 332 + K+++ +TDG++ ++ + + +G+ I Sbjct: 394 KCNL------RRTVPKLLIVLTDGQSTDPDLTEKEANLIKT--QSKVSIITMGVGDKIDK 445 Query: 333 HEFLRACASPNSFYLVENPHSMYDAFSHIGK 363 +E ++ + +L+ + D + I K Sbjct: 446 NELTSMASNSDYVFLLNGYKYINDKINQIIK 476 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 69/191 (36%), Gaps = 21/191 (10%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + ++++ I+LD S S+ ++ I + ++ K+ P V+ L+ F Sbjct: 714 ECTKQIEIGILLDASTSVT------LSNWKKTIDFVQDFSKQFKMGPTG---VRFALIDF 764 Query: 227 SNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 S+ + W K+ + T + L+ A +F + +H Sbjct: 765 SDDAILQISISDPRFWDQETFGEKVSSIEYSQGKTRTDLALEVARKHVFCNECGLRHNT- 823 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 ++++ +TDG++ K Q + + ++G+ E L++ A+ Sbjct: 824 -----PRLLIVLTDGQSTFPKLTQFEAQLIKA--ENNLTIISVGVSDQVDIEELKSLATD 876 Query: 343 NSFYLVENPHS 353 + N Sbjct: 877 RDHVFLLNEQK 887 >gi|326666584|ref|XP_687953.4| PREDICTED: collagen alpha-1(VII) chain [Danio rerio] Length = 2001 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 64/177 (36%), Gaps = 20/177 (11%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 T D++ ++D S S+ T + I ++ + ++ G+ +S Sbjct: 34 TAMPADIVFLVDDSWSVGP------TSFQQIKEFIADIIRAFQGNVFGQEGIRFGVTVYS 87 Query: 228 NKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L + + R ++ + G S+ + L++ +F +R+ Sbjct: 88 DLPRMRIALTDYSTLDEVLRAVEDVPYEGGSSRTGLALEFLEESVFSPSIIRESA----- 142 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 KI V +T+G++ +D G ++A+G+R E + P Sbjct: 143 --PKIAVLITNGQSDDQVDD-----PAKAVADSGISLFAVGVRNADQSELKKIVTEP 192 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 38/300 (12%), Positives = 87/300 (29%), Gaps = 30/300 (10%) Query: 79 GGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP 138 G + K ++ + + + + A Sbjct: 865 EGPVTTITKRILGEPRLVPVQSSTPAPVSANTKSNDMHLQTTTRNTRTPVVKAPVTASAT 924 Query: 139 LKFCTFIPWYTN-SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 + + + T + ++ + + D++ ++D S S+ + Sbjct: 925 GQTASALIRMTTVNDQSILSAETPPPGPVCGRVKADIVFLVDESWSIGTNNFGK------ 978 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSN--KIEEFFLLEWGVSHLQRKIKYLSKFGV 255 + + I Q +V +S+ +IE F + + R ++ + G Sbjct: 979 LKDFLFRTVTYFPSIGPKG--TQIAVVHYSDQPRIEFNFNTHKDRNSVLRALREVRYGGG 1036 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 +T + G+ Y ++F Q ++V +TDG + A Sbjct: 1037 NTKTGRGISYVLREMF------QESLGMRQEAPHVLVLLTDG-----RAQDDVEPPSRIA 1085 Query: 316 KKRGAIVYAIGIRVIRSHEFLRACASPNSF---YLVENPHSMYDAFSHIGKDIVTKRIWY 372 G V IGI E R ASP ++ + + + I ++ + Sbjct: 1086 HALGVSVLVIGIAHADMEEV-RTIASPTTYKNIFYASDFDDL----PSIEREFIGSICSE 1140 >gi|218462279|ref|ZP_03502370.1| hypothetical protein RetlK5_23628 [Rhizobium etli Kim 5] Length = 347 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 38/253 (15%), Positives = 84/253 (33%), Gaps = 16/253 (6%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F + G I+TA+ + + GM ++ +H ++T L++ D + V I + Sbjct: 8 FISDRSGNFGIMTALLMVPLLGTAGMAVDFAHALSLRTQLYAAADAAAVG---SIAEKSG 64 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 + + T + ++ ++ +D+ +S Sbjct: 65 AVAAAMAMNSNGTVSLGKTDARNIFMSQMSG----ELAEVQVDLGIDVTKTANKLNSQVS 120 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME-SFFD 189 + +P F + + I+ + QT A +D I+LD + SM Sbjct: 121 FTAT--VPTTFMQILGRD------SITISGTATAEYQTAAFMDFYILLDNTPSMGVGATP 172 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY 249 ++K++ A + K I + G+ + + + + +R Sbjct: 173 DDVSKLEAKAGCAFACHQMDKTINNYTIAKSLGVAMRIDVVRQATQALTDTAKTERVSSD 232 Query: 250 LSKFGVSTNSTPG 262 + GV T T Sbjct: 233 QFRMGVYTFGTKA 245 >gi|94732541|emb|CAK05117.1| novel protein similar to vertebrate collagen family [Danio rerio] Length = 1721 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 64/177 (36%), Gaps = 20/177 (11%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 T D++ ++D S S+ T + I ++ + ++ G+ +S Sbjct: 3 TAMPADIVFLVDDSWSVGP------TSFQQIKEFIADIIRAFQGNVFGQEGIRFGVTVYS 56 Query: 228 NKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L + + R ++ + G S+ + L++ +F +R+ Sbjct: 57 DLPRMRIALTDYSTLDEVLRAVEDVPYEGGSSRTGLALEFLEESVFSPSIIRESA----- 111 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 KI V +T+G++ +D G ++A+G+R E + P Sbjct: 112 --PKIAVLITNGQSDDQVDD-----PAKAVADSGISLFAVGVRNADQSELKKIVTEP 161 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 38/300 (12%), Positives = 87/300 (29%), Gaps = 30/300 (10%) Query: 79 GGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP 138 G + K ++ + + + + A Sbjct: 754 EGPVTTITKRILGEPRLVPVQSSTPAPVSANTKSNDMHLQTTTRNTRTPVVKAPVTASAT 813 Query: 139 LKFCTFIPWYTN-SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 + + + T + ++ + + D++ ++D S S+ + Sbjct: 814 GQTASALIRMTTVNDQSILSAETPPPGPVCGRVKADIVFLVDESWSIGTNNFGK------ 867 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSN--KIEEFFLLEWGVSHLQRKIKYLSKFGV 255 + + I Q +V +S+ +IE F + + R ++ + G Sbjct: 868 LKDFLFRTVTYFPSIGPKG--TQIAVVHYSDQPRIEFNFNTHKDRNSVLRALREVRYGGG 925 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 +T + G+ Y ++F Q ++V +TDG + A Sbjct: 926 NTKTGRGISYVLREMF------QESLGMRQEAPHVLVLLTDG-----RAQDDVEPPSRIA 974 Query: 316 KKRGAIVYAIGIRVIRSHEFLRACASPNSF---YLVENPHSMYDAFSHIGKDIVTKRIWY 372 G V IGI E R ASP ++ + + + I ++ + Sbjct: 975 HALGVSVLVIGIAHADMEEV-RTIASPTTYKNIFYASDFDDL----PSIEREFIGSICSE 1029 >gi|293571291|ref|ZP_06682325.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] gi|291608698|gb|EFF37986.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] Length = 1364 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 35/162 (21%), Positives = 64/162 (39%), Gaps = 22/162 (13%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMES--FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + Q + +D++ VLD S SM +K I+++N + E + P+++ + Sbjct: 232 IGSEKQEISPIDIVFVLDKSASMNEGTLEGGGQSKNAALIEAVNEISENLLSDPNMD--I 289 Query: 220 QSGLVTF---SNKIEEFFLLEWGVSHLQRKIKYLSKFG---------VSTNSTPGLKYAY 267 + G+V F S I + + L I L+ T T GLK Y Sbjct: 290 RIGMVNFYHNSTVINNQEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGY 349 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL 309 ++ G E+ N +KI++ + DG + Q+ Sbjct: 350 ETLYADNG------GENRNPEKILIVVGDGTPTFSYAPIQTR 385 >gi|332879552|ref|ZP_08447247.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332682518|gb|EGJ55420.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 332 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 52/267 (19%), Positives = 93/267 (34%), Gaps = 56/267 (20%) Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVM----PITSSVKVNSQTDARLDMMIVLDVSRSMES 186 A YK L FI +++ P TS+ N+Q + +D+M+ +DVS SM + Sbjct: 44 APKSYKNHLIHAPFILRLFTFVMVIIILARPQTSNSWKNTQVEG-IDIMLAVDVSTSMLA 102 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG------- 239 D +++ A + + + P+ N GL F+ + + Sbjct: 103 E-DLKPNRLEAAKQVASEFIAG---RPNDN----IGLTIFAAEAFTQCPMTTDHAVLLNL 154 Query: 240 VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 ++ + T G+ A +++ D + K+I+ +TDG N Sbjct: 155 FHGIKTDMAQRGMIQDGTAVGMGIANAVSRLKDSKAKS-----------KVIILLTDGTN 203 Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIR----------VIRSHEFL--------RACA- 340 + D L AK G VY IG+ V ++L + A Sbjct: 204 NA--GDISPLTAAEIAKSFGIRVYTIGVGTNGLAPYPMPVAGGVQYLNVPVEIDTKTLAA 261 Query: 341 ----SPNSFYLVENPHSMYDAFSHIGK 363 + FY + + D + I K Sbjct: 262 IAGKTDGEFYRATDNKKLEDVYKDIDK 288 >gi|331006836|ref|ZP_08330094.1| hypothetical protein IMCC1989_793 [gamma proteobacterium IMCC1989] gi|330419347|gb|EGG93755.1| hypothetical protein IMCC1989_793 [gamma proteobacterium IMCC1989] Length = 693 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 73/221 (33%), Gaps = 28/221 (12%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 I S + D+ +V+D+S SM+ + + ML ++ Sbjct: 13 IPLSYAQAPEAAKPSDVRLVIDISGSMKKNDPQN------LRRPALDMLVQLLPKGSKAG 66 Query: 218 VVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + G + +WG S +IK +++F TN L+ A Sbjct: 67 IWTFGQYVNMLVPHKPVDAQWGRTASAASSEIKSIAQF---TNIGAALEKA-------AY 116 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR--------GAIVYAIGI 327 + D +Y+ ++ +TDG ++++ + G ++ I + Sbjct: 117 DHKQQMKADQDYQTHVILLTDGMVDIDRDNRLNKKERQRILNNVLPMYQQSGITLHTIAL 176 Query: 328 RVIRSHEFLR--ACASPNSFYLVENPHSMYDAFSHIGKDIV 366 + L A A+ + +N + + F + V Sbjct: 177 SDNADKKLLNKLALATDGKVSVAKNAEELMNVFLRVFNQAV 217 >gi|260801245|ref|XP_002595506.1| hypothetical protein BRAFLDRAFT_69088 [Branchiostoma floridae] gi|229280753|gb|EEN51518.1| hypothetical protein BRAFLDRAFT_69088 [Branchiostoma floridae] Length = 1641 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 30/206 (14%), Positives = 77/206 (37%), Gaps = 35/206 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++++D S+S+ ++ K +L+ N + G+ + + + Sbjct: 587 IILLVDGSKSVTYLNFPNVLKF---------ILKLAAGFEIGPNAAKLGVYQYGSDVRTE 637 Query: 234 FLL-EWGVSH----LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F + ++ I+Y++++G T + L+ Y + K Sbjct: 638 FPIGQYNTREDVLNAVLNIQYMNQWG--TFTGKALEEVYKTF-----------PAGDDAK 684 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 K+++ +TDG+ +++ + K GA++ A+G+ R E +S + + Sbjct: 685 KVVIIITDGK---AMDEEVLRKASQDVKADGAMICAVGVGGFRLKELSLLASSQDLVFTA 741 Query: 349 ENPHSMYDAFSHIGKDIVTKRIWYDK 374 + M +D V + + Sbjct: 742 TDFDKMDAI-----RDTVLDAVCEGQ 762 >gi|153876525|ref|ZP_02003802.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] gi|152067011|gb|EDN66198.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] Length = 180 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 22/123 (17%), Positives = 44/123 (35%), Gaps = 22/123 (17%) Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 G T + A ++ E +I++ +TDGEN + + L Sbjct: 32 GRDTAIGDAIGLAVKKL-----------RERPEGSRILILLTDGENNA--GALKPLQAAE 78 Query: 314 EAKKRGAIVYAIGIRVIRS-------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGKD 364 AK+ +Y IG+ L+ A + +++ N ++ + + HI K Sbjct: 79 LAKQYDIRIYTIGVGGKGGMFSRGLNETELKKIAQLTNGAYFPATNLGALNNVYEHIDKT 138 Query: 365 IVT 367 + Sbjct: 139 LQK 141 >gi|1706571|sp|P54281|ECLC_BOVIN RecName: Full=Epithelial chloride channel protein; AltName: Full=Calcium-activated chloride channel gi|1184066|gb|AAC48511.1| calcium-activated chloride channel [Bos taurus] Length = 903 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 37/191 (19%), Positives = 65/191 (34%), Gaps = 32/191 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 + +VLD S SM S ++ ++ L I + G+VTF + E Sbjct: 309 VCLVLDKSGSMSSE-----DRLFRMNQAAELFL-----IQIIEKGSLVGMVTFDSVAEIR 358 Query: 232 -EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + + T+ GLK + I Q Sbjct: 359 NNLTKITDDNVYENITANLPQEANGGTSICRGLKAGFQAIIQSQQSTSGSE--------- 409 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 I+ +TDGE+ C E K+ G I++ I + + E L + FY Sbjct: 410 IILLTDGEDNEIHS-------CIEEVKQSGVIIHTIALGPSAAKELETLSDMTGGHRFYA 462 Query: 348 VENPHSMYDAF 358 ++ + + +AF Sbjct: 463 NKDINGLTNAF 473 >gi|20093632|ref|NP_613479.1| Mg-chelatase subunit ChlI /Chld [Methanopyrus kandleri AV19] gi|19886501|gb|AAM01409.1| Mg-chelatase subunit ChlI and Chld (MoxR-like ATPase and vWF domain) [Methanopyrus kandleri AV19] Length = 818 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 72/195 (36%), Gaps = 26/195 (13%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + + LD++ V+D S SM S +ID A ++ A+ D + G+V Sbjct: 629 DREEEVCLDIVYVIDTSGSM------SGDRIDAAKRAAIALAHFSVKAGD-----RVGIV 677 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 F+ K E + V + K+ L G +T+ ++ G Sbjct: 678 GFNTKAEIVVDITSDVEEIITKVMSLKP-GGATDIGDAIRV---------GTELFRRCGR 727 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYY--CNEAKKRGAIVYAIGIRVI-RSHEFLRACA- 340 + ++ +TDG + D ++ A + G + IGI++ + A Sbjct: 728 PDRDWHMILLTDGVPTKGEPDPETKALSEATAASRMGVTISTIGIKLPEEGIRLIEHIAG 787 Query: 341 -SPNSFYLVENPHSM 354 S + + +P + Sbjct: 788 ISGGRSHHITDPEEL 802 >gi|227552322|ref|ZP_03982371.1| von Willebrand factor, type A [Enterococcus faecium TX1330] gi|227178545|gb|EEI59517.1| von Willebrand factor, type A [Enterococcus faecium TX1330] Length = 1518 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 34/153 (22%), Positives = 61/153 (39%), Gaps = 22/153 (14%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMES--FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + Q + +D++ VLD S SM +K I+++N + E + P+++ + Sbjct: 390 IGSEKQEISPIDIVFVLDKSASMNEGTLEGGGQSKNAALIEAVNEISENLLSDPNMD--I 447 Query: 220 QSGLVTF---SNKIEEFFLLEWGVSHLQRKIKYLSKFG---------VSTNSTPGLKYAY 267 + G+V F S I + + L I L+ T T GLK Y Sbjct: 448 RIGMVNFYHNSTVINNQEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGY 507 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 ++ G E+ N +KI++ + DG Sbjct: 508 ETLYADNG------GENRNPEKILIVVGDGTPT 534 >gi|156404155|ref|XP_001640273.1| predicted protein [Nematostella vectensis] gi|156227406|gb|EDO48210.1| predicted protein [Nematostella vectensis] Length = 1128 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 78/210 (37%), Gaps = 25/210 (11%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV------NN 217 V + + D+++V+D S SM +++ +A ++ +L+ + V + Sbjct: 203 VEAASPQPKDVILVVDYSGSMGG------SRLPIAKEAAKTVLDTLNPRDRVAFLAFESG 256 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY--AYNQIFDMQG 275 V + + + K E+ F + I L KF ++ G Y A+N FD+ Sbjct: 257 VRRVKVTSGDAKDEKCFESSLAKAS-PVNIDILKKFLDGEYASGGTMYAIAFNAAFDI-L 314 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQ-QSLYYCNEAKKRGAIVYAIGIR---VIR 331 + + +I+FMTDG ++ N+ A + G+ Sbjct: 315 DKYYKEKNTTRRP-VILFMTDGAPNDDPGTILNTVKTRNQGLSTKADILTFGMGGGISPA 373 Query: 332 SHEFLRACAS----PNSFYLVENPHSMYDA 357 + L++ A + + V ++ D Sbjct: 374 GVDLLQSLAEQTLDGGARFEVSLTTALRDV 403 >gi|332827795|gb|EGK00530.1| hypothetical protein HMPREF9455_03173 [Dysgonomonas gadei ATCC BAA-286] Length = 603 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 35/201 (17%), Positives = 70/201 (34%), Gaps = 25/201 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 ++++ + + ++DVS SM+ K+D+ S+ ++ ++ I V VV +G Sbjct: 233 EIDTDNLPATNFVFLIDVSGSMDWDG-----KLDLVKSSMKLLVNNLRPIDRVAIVVYAG 287 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 S + + L+ G ST G+ AY +I + N Sbjct: 288 AAG----QVLPSTPGSEKSKILESLNGLTA-GGSTAGGEGIVLAY-KIAKENLIEGGNNR 341 Query: 283 EDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH----EFLR 337 I+ TDG+ N+ + +K G + +G + + L Sbjct: 342 --------IILCTDGDFNVGVSSNDGLEKLIENERKSGVFLSILGYGMGNYKDDKMQTLA 393 Query: 338 ACASPNSFYLVENPHSMYDAF 358 + N Y ++N Sbjct: 394 QAGNGNHAY-IDNMQEANKVL 413 >gi|293377912|ref|ZP_06624093.1| LPXTG-motif cell wall anchor domain protein [Enterococcus faecium PC4.1] gi|292643459|gb|EFF61588.1| LPXTG-motif cell wall anchor domain protein [Enterococcus faecium PC4.1] Length = 1498 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 34/153 (22%), Positives = 61/153 (39%), Gaps = 22/153 (14%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMES--FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + Q + +D++ VLD S SM +K I+++N + E + P+++ + Sbjct: 370 IGSEKQEISPIDIVFVLDKSASMNEGTLEGGGQSKNAALIEAVNEISENLLSDPNMD--I 427 Query: 220 QSGLVTF---SNKIEEFFLLEWGVSHLQRKIKYLSKFG---------VSTNSTPGLKYAY 267 + G+V F S I + + L I L+ T T GLK Y Sbjct: 428 RIGMVNFYHNSTVINNQEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGY 487 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 ++ G E+ N +KI++ + DG Sbjct: 488 ETLYADNG------GENRNPEKILIVVGDGTPT 514 >gi|170744040|ref|YP_001772695.1| von Willebrand factor type A [Methylobacterium sp. 4-46] gi|168198314|gb|ACA20261.1| von Willebrand factor type A [Methylobacterium sp. 4-46] Length = 654 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 45/309 (14%), Positives = 108/309 (34%), Gaps = 35/309 (11%) Query: 52 SMIDRSLVHAATQIMNEGNGNNR--KKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 + + R+ A + +E G +R +GG + R +S + G V D + Sbjct: 147 AAMARAAGETA-PVPSEPVGRDRFANAPEGGFRITREAPVSTVSLGVDTASYGIVRDALN 205 Query: 110 ---IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCT---FIPWYTNSRHIVMPITSSVK 163 + ++ + Y+ A + P + PW + + + I Sbjct: 206 RNHLPPPAAVRTEELINYFPYAYPAPASPDAPFRVTASVFPSPWAEGRKLLHIGIRGYA- 264 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 V +++ ++D S SM ++ ++ + +S+ +L + + L Sbjct: 265 VAPAERPPANLVFLVDTSGSM-----AAPNRLPLVKQSLAMLLTTLDARD------RVAL 313 Query: 224 VTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 V ++ ++ + I+ L G ST G++ AY +H + Sbjct: 314 VAYAGEVGTVLEPTPAGEAGRILAAIETLQAHG-STAGGEGIRQAY------ALAARHFD 366 Query: 282 TEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-SHEFLRAC 339 + N ++ TDG+ N+ + + +++G + +G + + ++A Sbjct: 367 PKAVNR---VILATDGDFNVGITGRDELTGFVARERRKGIFLSVLGFGMGNLNDALMQAL 423 Query: 340 ASPNSFYLV 348 A + Sbjct: 424 AKDGNGVAA 432 >gi|148708138|gb|EDL40085.1| mCG12867, isoform CRA_d [Mus musculus] Length = 2281 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 42/311 (13%), Positives = 107/311 (34%), Gaps = 27/311 (8%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + GG + + +G V+ Sbjct: 1517 LEFVARNLFVKSAGSRIEDGVPQHLVLFLGGKSQDDVARHAQ-----VISSSGIVSLGIG 1571 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 D+ + + + ++P + + + P + Sbjct: 1572 DRNIDRTDLQTITNDPRLVFTVREFRELPNIEERVMLSFGPSGATPQPPGVDLPSPSRPE 1631 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + E V + + + ++ GLV +++ Sbjct: 1632 KKKADIVFLLD--GSINFRRDS-------FQEVLRFASEIVDTVYEDGDSIRVGLVQYNS 1682 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + F L + + I + G N+ G+++ + + + + D Sbjct: 1683 DPTDEFFLRDFSTKRQIIDAINKVVYKGGRHANTRVGIEH----LLRNHFVPEAGSRLDE 1738 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +I +T G+++ +D ++G V+A+G+R I S E + ++ + Sbjct: 1739 RVPQIAFVITGGKSVEDAQDVSLALT-----QKGVKVFAVGVRNIDSEEVGKIASNSATA 1793 Query: 346 YLVENPHSMYD 356 + V + + + Sbjct: 1794 FRVGSVQELSE 1804 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 72/199 (36%), Gaps = 24/199 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T + +V+ D++ +LD S ++ + + ++ + + D Sbjct: 627 TGTTEVHVNKR---DIIFLLDGSDNVGKNNFPYV------RDFVTNLVNSLDVGSDN--- 674 Query: 219 VQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQG 275 ++ GLV FS+ F L+ S L ++ L K G N+ L Y I Sbjct: 675 IRVGLVQFSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGSGLNAGSALSY----IHANHF 730 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + + ++++ + G + L N + G + + +G E Sbjct: 731 TEAGGSRTREHVPQLLLLLMAG-----PSEDAYLQAANALVRSGVLTFCVGTNRADKAEL 785 Query: 336 LRACASPNSFYLVENPHSM 354 +P+ YL+++ S+ Sbjct: 786 EHIAFNPSLVYLMDDFRSL 804 Score = 50.5 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 79/213 (37%), Gaps = 30/213 (14%) Query: 154 IVMPITSSVKVNSQTDARL------DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 + ++ ++Q + D++ ++D S + + + + + ++ ++E Sbjct: 13 FSLLLSGIATTHAQQHGDVKNGAAADIVFLVDSS------WSAGKDRFLLVQEFLSDVVE 66 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKY 265 + + + LV + FLL + I +S G S + GL+Y Sbjct: 67 SLAVGDND---FHFALVRLNGNPHTEFLLNTYHSKQEVLSHIVNMSYIGGSNQTGKGLEY 123 Query: 266 A-YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 ++ + + G R ++I+ +TDG+ ED +L K V+A Sbjct: 124 VIHSHLTEASGSRA-----ADGVPQVIIVLTDGQ----SEDGFALPSAE-LKSADVNVFA 173 Query: 325 IGIRVIRSHEFLRACASP--NSFYLVENPHSMY 355 +G+ + P + +EN S++ Sbjct: 174 VGVEGADERALGEVASEPLSMHVFNLENVTSLH 206 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 70/205 (34%), Gaps = 26/205 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + S F + + ++E + + PD V+ LV +S++ Sbjct: 1026 DVVFLIDGSEGVRSGFP-------LLKDFVQRVVESLDVGPD---RVRVALVQYSDRTRP 1075 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L + I+ L+ G T N+ L++ I + + Sbjct: 1076 EFYLNSHMDQQGVISAIRRLTLLGGPTPNTGAALEFVLRNILTSSTGSRIAEG----VPQ 1131 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +T + K+ GA+ IGI E P+ + Sbjct: 1132 LLIVLT-----AEPSGDDVRGPSVVLKQGGAVPIGIGIGNADISEMQTISFIPDFAVAIP 1186 Query: 350 NPHSMYDAFSHIGKDIVTKRIWYDK 374 + I + I + I ++ Sbjct: 1187 TFREL----GTIQQVISERVIQLNR 1207 Score = 44.4 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 40/243 (16%), Positives = 85/243 (34%), Gaps = 24/243 (9%) Query: 118 IVVVPQNEGYSISAISRYKIP---LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDM 174 +V + + Y S + ++P K T I T+ + + ++ + D+ Sbjct: 1375 LVKISLSPEYVYSVSTFRELPRLEQKLLTPITTLTSQQIHQILASTRYPPSVVESDAADI 1434 Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 + ++D S +++ I ++ ++ + + P V+ G+V FSN + F Sbjct: 1435 VFLIDSSDAVKPDG------IAHIRDFVSRIVRRLNIGPSK---VRIGVVQFSNDVFPEF 1485 Query: 235 LLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L+ S + I+ L G S N+ L++ +F ED + ++ Sbjct: 1486 YLKTHKSQSSVLEAIRRLRFKGGSPLNTGRALEFVARNLFVKS---AGSRIEDGVPQHLV 1542 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENP 351 +F+ K + G + IG R I + P + V Sbjct: 1543 LFL------GGKSQDDVARHAQVISSSGIVSLGIGDRNIDRTDLQTITNDPRLVFTVREF 1596 Query: 352 HSM 354 + Sbjct: 1597 REL 1599 >gi|88601902|ref|YP_502080.1| von Willebrand factor, type A [Methanospirillum hungatei JF-1] gi|88187364|gb|ABD40361.1| von Willebrand factor, type A [Methanospirillum hungatei JF-1] Length = 316 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 35/209 (16%), Positives = 82/209 (39%), Gaps = 39/209 (18%) Query: 180 VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG 239 VS SM++ D T+++ + +S +L+ + +G++TF + L Sbjct: 96 VSGSMQA-TDYQPTRLESSKRSAEILLKSLDPKD------YAGIITFESGATSAAYLSPD 148 Query: 240 VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 + RK++ + +T GL + M N KK+++ ++DG N Sbjct: 149 KDRVIRKLQAIEPKEGATAIGDGLALGIDMAESM-----------PNRKKVVILLSDGVN 197 Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIR-----------------VIRSHEFLR--ACA 340 + + A+++G V+ IG+ L+ A + Sbjct: 198 NAGVIHPE--QAAGFAREKGIQVFTIGMGSDSPVVLGYDWFGNPQYATLDEAMLQQIAAS 255 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 + ++ + ++ + +S++ K+IV ++ Sbjct: 256 TNGQYFKSVDDRTLSEIYSNLNKEIVREK 284 >gi|114562801|ref|YP_750314.1| vault protein inter-alpha-trypsin subunit [Shewanella frigidimarina NCIMB 400] gi|114334094|gb|ABI71476.1| Vault protein inter-alpha-trypsin domain protein [Shewanella frigidimarina NCIMB 400] Length = 722 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 42/223 (18%), Positives = 87/223 (39%), Gaps = 27/223 (12%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 N +VM + SV+V+ Q ++++V+D S SM S I A +++ L + Sbjct: 322 NMYSLVMLMPPSVEVSEQHLIARELILVIDTSGSM------SGQSITQAKQALQFALAGL 375 Query: 210 KLIPDVNNVVQSGLVTFSNKIE--EFFLLEWGVSHL---QRKIKYLSKFGVSTNSTPGLK 264 + I N ++ F++ + L ++ R I+ L G + + L+ Sbjct: 376 RDIDSFN------IIEFNSDVTMLSATPLSANSRNIGKANRFIQSLDADGGTEMRS-ALQ 428 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 A + D + + ++FMTDG + E Q + ++ ++ Sbjct: 429 TA---LVDSVQQDSDQTDAHSEMLRQVIFMTDGAVGNEHELYQLIN--DQLGDS--RLFT 481 Query: 325 IGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSHIGKDI 365 +GI + +F+R A+ +F + N + + I Sbjct: 482 VGIGSAPNSDFMRRAATMGRGTFTYIGNESEVQQKIEQLLNKI 524 >gi|319956579|ref|YP_004167842.1| von willebrand factor type a [Nitratifractor salsuginis DSM 16511] gi|319418983|gb|ADV46093.1| von Willebrand factor type A [Nitratifractor salsuginis DSM 16511] Length = 560 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 34/190 (17%), Positives = 66/190 (34%), Gaps = 23/190 (12%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + ++LD S SM + D+A K + ++ + L F++ Sbjct: 82 VALLLDASYSMREGG-----RFDIARKVLLDFIDRRPKD-------RIALEVFADYAYLA 129 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + L+ + L V T Y +F G R E+ ++++ Sbjct: 130 APMSYEKKGLKTILAALEPGVVGGRDT----ALYEALFL--GARLF-KKEEGRSNRVMIL 182 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENP 351 +TDG + T + E K+ VY +G+ L A + FY P Sbjct: 183 LTDG--IDTVGNIPLEAAIRELKRAHIRVYTVGVGDDFRRGVLEKIARSTGGRFYDARYP 240 Query: 352 HSMYDAFSHI 361 ++ + + I Sbjct: 241 EALANIYRRI 250 >gi|307719357|ref|YP_003874889.1| batA protein [Spirochaeta thermophila DSM 6192] gi|306533082|gb|ADN02616.1| putative batA protein [Spirochaeta thermophila DSM 6192] Length = 332 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 80/214 (37%), Gaps = 43/214 (20%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++I LDVS SM + + +A + I V+ P + GLV F + Sbjct: 92 IVIALDVSPSMGAMDIPGRQRFQVAREVIRGF---VRSYPHMA----VGLVLFGKEALLE 144 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 V + +++ + F + + G+ + H + +A++ + +V Sbjct: 145 VPPTIDVEYFLERLEAVRLFSLGDGTALGM--------GVGTSLLHLSRVNASF-RAVVI 195 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR---------------------- 331 +TDG+N + + ++ A++ G ++ +G+ R Sbjct: 196 LTDGKNTTGEILPETA--AEMARELGIPLFTVGVGSDRPVSLDVIDPSTGTRYAGVLEEG 253 Query: 332 -SHEFLRACA--SPNSFYLVENPHSMYDAFSHIG 362 E LR A S F+ P S++ F +IG Sbjct: 254 YDEETLRRIAEISGGQFFSGYTPTSLHRIFQYIG 287 >gi|269960459|ref|ZP_06174831.1| hypothetical protein VME_12150 [Vibrio harveyi 1DA3] gi|269834536|gb|EEZ88623.1| hypothetical protein VME_12150 [Vibrio harveyi 1DA3] Length = 420 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 26/173 (15%), Positives = 56/173 (32%), Gaps = 6/173 (3%) Query: 9 RNFF----YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQ 64 RN KG +L ++ L I+ + I+++H KT L + +D + + A Sbjct: 3 RNMHIAPKRTQKGITLVLISMVLLILLGMAAFGIDLNHQVLNKTRLQNAVDSAALAGAVV 62 Query: 65 IMNEGNGNNRKKLKGGDILCRIKN--TWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVP 122 + GN + + + + + F + F D V + S Sbjct: 63 VDENGNVSAAETAAKATLSSISASDGNAELVFTDSNTAVTFSTDRATFVSAASFTPPASG 122 Query: 123 QNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMM 175 + + Y A++ + N + S+ + + + M Sbjct: 123 EYDIYVRVAVTEIGLTQYLSDVFGINKNVSASAVAGRSAAIAYTCNISPIAMC 175 >gi|159037814|ref|YP_001537067.1| von Willebrand factor type A [Salinispora arenicola CNS-205] gi|157916649|gb|ABV98076.1| von Willebrand factor type A [Salinispora arenicola CNS-205] Length = 427 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 36/236 (15%), Positives = 77/236 (32%), Gaps = 44/236 (18%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + + P + + + + +VLDVS SM + ++I +A ++ Sbjct: 11 LVGLLAMSVMTGPAPALADGEAPVEPP-KVELVLDVSGSMRATDIDGRSRISVAQQA--- 66 Query: 205 MLEEVKLIPDVNNV-VQSGLVTF-----------SNKIEEFFLLEWGVSHLQRKIKYLSK 252 E V +PD + ++ T+ + +I ++ + + L Sbjct: 67 FNEVVDALPDETQLGIRVLGATYPGENKERGCQDTQQIVPVGPVD--RVQAKAAVATLRP 124 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 G T L+ A + R+ IV +TDGE+ D C Sbjct: 125 TGF-TPVGLALRSAAQDLGTGSTARR------------IVLITDGEDTCAPPD-----PC 166 Query: 313 NEAKK-----RGAIVYAIGIRV---IRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 A++ +V +G+ +R A A+ ++ ++ + Sbjct: 167 EVARELAAQGTKLVVDTLGLAPDEKVRRQLLCIAAATGGTYTAAQSADELTGRIKQ 222 >gi|315501784|ref|YP_004080671.1| von willebrand factor type a [Micromonospora sp. L5] gi|315408403|gb|ADU06520.1| von Willebrand factor type A [Micromonospora sp. L5] Length = 430 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 71/205 (34%), Gaps = 39/205 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV-VQSGLVTFSNKIEE 232 + +VLDVS SM + ++I +A ++ E V +PD + ++ T+ K ++ Sbjct: 42 VELVLDVSGSMRARDIDGRSRISVAQQA---FNEVVDALPDETQLGIRVLGATYRGKDKK 98 Query: 233 FFLLEW---------GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 L+ + + + L G T L+ A + R+ Sbjct: 99 QGCLDTQQIVPVGPVDRTQAKAAVAGLRPTGF-TPVGLALRSAAQDLGTGSTARR----- 152 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI-----VYAIGIRVIR--SHEFL 336 IV +TDGE+ D C A++ A V +G+ + L Sbjct: 153 -------IVLITDGEDTCAPPD-----PCEVARELAAQGTRLVVDTLGLAPDEKVRKQLL 200 Query: 337 RAC-ASPNSFYLVENPHSMYDAFSH 360 A+ ++ ++ + Sbjct: 201 CIAGATGGTYTAAQSADELTGRIKQ 225 >gi|302865239|ref|YP_003833876.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|302568098|gb|ADL44300.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] Length = 429 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 71/205 (34%), Gaps = 39/205 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV-VQSGLVTFSNKIEE 232 + +VLDVS SM + ++I +A ++ E V +PD + ++ T+ K ++ Sbjct: 42 VELVLDVSGSMRARDIDGRSRISVAQQA---FNEVVDALPDETQLGIRVLGATYRGKDKK 98 Query: 233 FFLLEW---------GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 L+ + + + L G T L+ A + R+ Sbjct: 99 QGCLDTQQIVPVGPVDRTQAKAAVAGLRPTGF-TPVGLALRSAAQDLGTGSTARR----- 152 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI-----VYAIGIRVIR--SHEFL 336 IV +TDGE+ D C A++ A V +G+ + L Sbjct: 153 -------IVLITDGEDTCAPPD-----PCEVARELAAQGTRLVVDTLGLAPDEKVRKQLL 200 Query: 337 RAC-ASPNSFYLVENPHSMYDAFSH 360 A+ ++ ++ + Sbjct: 201 CIAGATGGTYTAAQSADELTGRIKQ 225 >gi|254519993|ref|ZP_05132049.1| von Willebrand factor [Clostridium sp. 7_2_43FAA] gi|226913742|gb|EEH98943.1| von Willebrand factor [Clostridium sp. 7_2_43FAA] Length = 960 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 26/180 (14%), Positives = 67/180 (37%), Gaps = 21/180 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K + + +++D S SM + ++K+ +A ++ LE ++ + +++ Sbjct: 397 KRGKNEVPAISINLIIDKSGSMSAE-GGGVSKLTLAKEAAMKALENLREVDEIS------ 449 Query: 223 LVTFSNKIEEFFLLEW-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 ++ F + +E L+ G +++ + T+ P L+ YN +H Sbjct: 450 VIAFDDTYDEVVPLQKVGDKEAIKELISGIQIRGGTSIYPALEQGYNMQMQSSAKIKHT- 508 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 + +TDG++ ++ +L + + + + L AS Sbjct: 509 ----------ILLTDGQDGYGLDNYATLL--QNFIDNNITLSTVAVGEGANAGLLNQLAS 556 >gi|126662670|ref|ZP_01733669.1| hypothetical protein FBBAL38_04925 [Flavobacteria bacterium BAL38] gi|126626049|gb|EAZ96738.1| hypothetical protein FBBAL38_04925 [Flavobacteria bacterium BAL38] Length = 347 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 23/175 (13%), Positives = 68/175 (38%), Gaps = 27/175 (15%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +D++ +D+S+SM + D +++ + ++ ++ ++ + Sbjct: 79 GTKMETVKRQGIDIVFAVDISKSMLAE-DIKPNRLEKTKQLVSQIINQLGND-------R 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMR 277 G+V ++ + S + ++ ++ T +K A + Sbjct: 131 VGIVGYAGSAYPILPMTTDYSIAKMYLQSMNTNMVSSQGTAFNDAIKLAVDYF------- 183 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + + K+I+ ++DGE+ + + AK++G + IG+ + Sbjct: 184 -----DVKDTSKLIILVSDGEDHGEGASEAI----DLAKEKGVRILTIGVGTEKG 229 >gi|164688691|ref|ZP_02212719.1| hypothetical protein CLOBAR_02337 [Clostridium bartlettii DSM 16795] gi|164602167|gb|EDQ95632.1| hypothetical protein CLOBAR_02337 [Clostridium bartlettii DSM 16795] Length = 1508 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 42/230 (18%), Positives = 81/230 (35%), Gaps = 46/230 (20%) Query: 173 DMMIVLDVSRSMESFFDSSIT------KIDMAIKSINAMLEEV-------KLIPDVNNVV 219 D+++V+DVS SM+ D T +I +A S + ++ + +VV Sbjct: 402 DVVLVIDVSGSMDWDVDGKQTTDNTKKRITIAKDSAKQFVNQLFANNEDGSKSNNRVSVV 461 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF-GVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++N I + I +S T+ + A + ++ Sbjct: 462 IFSSSGYTNGILCSLKNVDNKQTVIDAIDGISNNPTGGTDYDNAMTMAEQVLETVK---- 517 Query: 279 HCNTEDANYKKIIVFMTDG--ENLST--------KEDQQSLYYCNEAKKR-GAIVYAIGI 327 D K ++FM+DG EN + ++ +E K GA +Y + Sbjct: 518 -----DTTRNKAVLFMSDGAPENGYNGKTGYDIYPDAFKAHEKSSEIKNNYGATIYTVSF 572 Query: 328 RVIRSH----------EFLR-ACASPNS-FYLVENPHSMYDAFSHIGKDI 365 + S + LR AS + + + + +AF++I I Sbjct: 573 GLKGSQYKELTEDRCRQILRDYMASNENCYKNANSKEDLENAFTNIATAI 622 >gi|56460106|ref|YP_155387.1| von Willebrand factor type A (vWA) domain-containing protein [Idiomarina loihiensis L2TR] gi|56179116|gb|AAV81838.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Idiomarina loihiensis L2TR] Length = 327 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 26/216 (12%), Positives = 73/216 (33%), Gaps = 46/216 (21%) Query: 174 MMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +M+ +D+S SME S+ ++ M ++ +E + GL+ F++ Sbjct: 88 IMLAVDLSGSMEIADMQLEGRSVNRLTMVKHVLSDFIER-------REGDRLGLILFADT 140 Query: 230 IEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + + +++ + L G T + + + D + Sbjct: 141 AYLQTPMTYDRNTVKQMLNESVLGLVGERTAIGDAIALSVKRFRDDEKSN---------- 190 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-------SHE------ 334 +++V +TDG+N + + A+ +Y I + Sbjct: 191 -RVLVLLTDGQNTA--GNLPPEQALELAQAYDVTIYPIAVGAEEVVVDSFFGQRRVNPSR 247 Query: 335 -----FLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 +++ A + ++ + + + + + + K Sbjct: 248 DLDVPLMQSIAKQTGGKYFRARSTNELEEIYQRLDK 283 >gi|47224105|emb|CAG13025.1| unnamed protein product [Tetraodon nigroviridis] Length = 983 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 38/218 (17%), Positives = 81/218 (37%), Gaps = 35/218 (16%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++D S S+ + + S+ MLE + VN +V F Sbjct: 72 GAASPKDMLILVDASGSVSGLT------LKLIRTSVTEMLETLSDDDYVN------VVYF 119 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + ++++ + L+ ++ ++ G+ TN T GL++A+ Q+ R + Sbjct: 120 NTQVKKTACFDHLVQANVRNKKLLKDAVQNITAKGI-TNYTKGLEFAFEQLSVTNVSRAN 178 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR---VIRSHEFL 336 CN KII+ TDG + + + ++ + + Sbjct: 179 CN-------KIIMLFTDGGEERAQAILEKYNA-----DKKVRIFTFSVGQHNYDKGPIQW 226 Query: 337 RACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 AC++ FY + + ++ + + DK Sbjct: 227 MACSNKGYFYEIPSIGAIRINTQEYLDVLGRPMVLADK 264 >gi|330945007|gb|EGH46785.1| von Willebrand factor, type A [Pseudomonas syringae pv. pisi str. 1704B] Length = 258 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 60/166 (36%), Gaps = 26/166 (15%) Query: 178 LDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +DVS SM+ + ++++ + + + LE K + GL+ F + Sbjct: 2 VDVSGSMDYPDMQWKSDEVSRLVLVQQLLGDFLEGRKGD-------RVGLILFGTQAFVQ 54 Query: 234 FLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L + ++ + + G +T + A ++ + + Sbjct: 55 APLTYDRRTVRVWLDEARIGIAGKNTALGDAIGLALKRL-----------RMRPATSRAL 103 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 V +TDG N + + D + A + G +Y IGI + L+ Sbjct: 104 VLVTDGANNAGQIDPITA--ARLAAEEGVKIYPIGIGSDPDKDALQ 147 >gi|291569213|dbj|BAI91485.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 412 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 73/200 (36%), Gaps = 26/200 (13%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 P +++ + S++ + +++ ++LD S SM + ++ ++ + Sbjct: 16 PNQESNQRQLSISVSAIPDPFEGQVPMNLCLILDHSGSM------NGQPLETVKQAAKEL 69 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGL 263 ++ + + + +V F ++ + + +++KI L G T GL Sbjct: 70 IDRLNVGD------RISVVAFDHRAKVLVPNQDIADPDGIKKKIDGLRCSG-GTAIDEGL 122 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 K ++ + R +TDGEN ++++ L A + + Sbjct: 123 KLGIEELGKGKQDRISQG----------FLLTDGENEH-GDNKRCLKLAKLATEYKLTIN 171 Query: 324 AIGIRVIRSHEFLRACASPN 343 ++G + + L A Sbjct: 172 SLGFGNDWNQDILEKIADAG 191 >gi|301622626|ref|XP_002940637.1| PREDICTED: integrin alpha-L-like [Xenopus (Silurana) tropicalis] Length = 1031 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 25/164 (15%), Positives = 61/164 (37%), Gaps = 22/164 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 A +D+ ++D S SM ++++ + + +++ ++ V + F Sbjct: 156 CQKAEVDLCFMVDGSSSMGEI------EVNIVKEFMKNVIKSLE----NETSVHFAAIQF 205 Query: 227 SNKIEEFFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S + F + TN+ ++Y ++IF Sbjct: 206 STHPKTEFTFADFQKDRNPDVLLANYRLLKGFTNTYKAIQYTLDRIFT------EKYGSR 259 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + KK++V + DGE+ + +++ ++A+ Y IG+ Sbjct: 260 PSAKKVLVILADGESTDD-DTTKAIEKADKAR---VSRYIIGVG 299 >gi|307292639|ref|ZP_07572485.1| hypothetical protein SphchDRAFT_0111 [Sphingobium chlorophenolicum L-1] gi|306880705|gb|EFN11921.1| hypothetical protein SphchDRAFT_0111 [Sphingobium chlorophenolicum L-1] Length = 540 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 37/299 (12%), Positives = 94/299 (31%), Gaps = 72/299 (24%) Query: 8 IRNFFYNYKGG-MTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM 66 + + N G + I+ A LP + ++G +++S + KT L D ++ + Sbjct: 12 LMRLYRNQAGNTLAIVAAAMLP-LAGMVGGALDISRGYLAKTRLQQACDAGVLAGRKVMG 70 Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 + G ++ + R + N + + +T ++ + + Sbjct: 71 SSGVLSDSVR---------------DEVRKYVSFNYPSGYLGSTLATTDINPTLGSND-- 113 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 I+ IP T + ++ + + + + + +D+++VLD + SM Sbjct: 114 -QIALSLTTAIP----TAVMRLFGRNNMSITASCTAR---NDYSNIDIVLVLDTTGSMAC 165 Query: 187 ------------------------------------FFDSSITKIDMAIKSINAMLEEVK 210 +++++ ++ + ++ Sbjct: 166 KPERNDSDCSTWAGSRYVTQWVAGLGRDATFVPEEMNSGVNVSRMQGLRTALANLQSQMA 225 Query: 211 LI--------PDVNNVVQSGLVTFSNKIEEFFLL-EWGVSHLQRKIKYLSKFGVSTNST 260 I V+ +V FS + F G + R + ++ G NS Sbjct: 226 TIETQFNMTEESKRKRVRWAIVPFSQMVNAGFSQGSAGTTLYSRHSDWFNRTGKYYNSG 284 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 19/117 (16%), Positives = 34/117 (29%), Gaps = 32/117 (27%) Query: 286 NYKKIIVFMTDGENLSTKEDQQSLY-------------------------YCNEAKKRGA 320 + ++FMTDG + + C K Sbjct: 422 PVSRYVIFMTDGYMSIGSSNYAAYAQEDYWRRVAAAGASKNDNHYARMLMTCTAIKNMDT 481 Query: 321 IVYAIGIRVIRS-HEFLRACAS------PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 +Y I + L C+S P Y ++ + F IG++I + R+ Sbjct: 482 KIYTISFGAGSTLDSNLINCSSSTNTTNPEFAYKADSSSDLNRVFRDIGENIGSLRL 538 >gi|298291248|ref|YP_003693187.1| von Willebrand factor A [Starkeya novella DSM 506] gi|296927759|gb|ADH88568.1| von Willebrand factor type A [Starkeya novella DSM 506] Length = 313 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 58/164 (35%), Gaps = 26/164 (15%) Query: 174 MMIVLDVSRSM--ESFF--DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +++ LD+S SM E F ++++D + + + + GLV F ++ Sbjct: 87 IVLALDLSGSMVKEDFVLDGKPLSRLDAVRRVASRFVA-------ARRGDRIGLVIFGDR 139 Query: 230 IEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + V + I+ G ST + GL A ++ Sbjct: 140 AYVAQPPTFDVGSVAHAIEAAQIGISGRSTAISDGLGLATRRLLQSDATS---------- 189 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 K++V ++DG + T Q+ A G V+ I + Sbjct: 190 -KVVVLLSDG--VDTSGKVQAGDAARLAASHGIRVHTIALGPED 230 >gi|257879128|ref|ZP_05658781.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,230,933] gi|257813356|gb|EEV42114.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,230,933] Length = 1258 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 35/153 (22%), Positives = 62/153 (40%), Gaps = 22/153 (14%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDS--SITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + + + +D++ VLD S SM S TK I+++N M +++ P ++ + Sbjct: 126 IGGEKKEISPIDIVFVLDKSASMSELTAGTNSQTKNAALIEAVNEMSKDLLSDPSLD--I 183 Query: 220 QSGLVTF---SNKIEEFFLLEWGVSHLQRKIKYLSKFG---------VSTNSTPGLKYAY 267 + G+V F S I + + L I L+ T T GLK Y Sbjct: 184 RIGMVNFYHNSTAINNHEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGY 243 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 ++ G E+ N +KI++ + DG Sbjct: 244 ETLYKDNG------GENRNPEKILIVVGDGTPT 270 >gi|194207263|ref|XP_001489838.2| PREDICTED: coagulation factor C homolog, cochlin (Limulus polyphemus) [Equus caballus] Length = 549 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 27/209 (12%), Positives = 69/209 (33%), Gaps = 31/209 (14%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTD------ARLDMMIVLDVSRSMESFFDSSITK 194 F +P + + V P+ + + Q +++ ++D S S+ + Sbjct: 328 FSYHMPNWFGTTKYVKPLVQKLCAHEQMMCSKTCYNSVNIAFLIDGSSSVGD------SN 381 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSK 252 + ++ ++ + + ++ + V F+ F + + I+ + Sbjct: 382 FRLMLEFVSNIAKTFEISDIGA---KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRY 438 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 T + + + +F MR N +V +TDG+ Sbjct: 439 MSGGTATGDAISFTVRNVFGP--MRDSPNKN------FLVIVTDGQ-----SYDDVRGPA 485 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 A G ++++G+ + L+ AS Sbjct: 486 AAAHDAGITIFSVGVAWAPLDD-LKDMAS 513 >gi|120437735|ref|YP_863421.1| von Willebrand factor (vWA) type A domain-containing protein [Gramella forsetii KT0803] gi|117579885|emb|CAL68354.1| membrane protein containing von Willebrand factor (vWA) type A domain [Gramella forsetii KT0803] Length = 354 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 69/173 (39%), Gaps = 20/173 (11%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +D++ +DVS+SM++ D + +++ + + ++ +L + + Sbjct: 79 GTKMETVKREGVDIVFAIDVSKSMDAE-DIAPNRLEKSKQLVSQILSSLGSD-------R 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 G++ ++ + S + ++ L+ +S + + A Q Sbjct: 131 VGIIAYAGGAYPQLPITTDFSAAKMFLQALNTDMIS-SQGTAISDAIELATTYYDDDQQT 189 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 N +++ ++DGE+ + +A ++G ++ IG+ + Sbjct: 190 N-------RVLFIISDGEDH----EGNVEDIAEQAAEKGIRIFTIGVGTEKGG 231 >gi|238755460|ref|ZP_04616800.1| hypothetical protein yruck0001_3370 [Yersinia ruckeri ATCC 29473] gi|238706301|gb|EEP98678.1| hypothetical protein yruck0001_3370 [Yersinia ruckeri ATCC 29473] Length = 465 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 51/290 (17%), Positives = 93/290 (32%), Gaps = 46/290 (15%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F N KGG+ I I LP ++ ++ + + + K L +++ + + E N Sbjct: 25 FLENKKGGIIIPFFISLPFFIAIIMLLFDFTQLINNKIKLSDALEQGALA----LTAENN 80 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 N + I I + +N VN + D + Q Y I Sbjct: 81 AKNDTR-NNELISAYINFYLGHRHQLTQYNNITVNYQQN------PDRLYHTQLSQYHID 133 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 A F + + + N I + +D++ V D S SME F + Sbjct: 134 ANIEQPTLFPFTSLLIDHDN------FIIGGSAAAIKDVPAMDVVFVTDFSGSMEGDFHN 187 Query: 191 S-----ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR 245 ++K+D + + I N + F WG Sbjct: 188 PDDPEVLSKLDELKR---IFFKIADDIYTANK----------DSTISFSPFSWGTKSADN 234 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 K L + N+I+ + NT +A+ +K ++ +T Sbjct: 235 KKCSLHF----------MPKEKNKIYPIPSNEIERNT-EAHQEKYMIAIT 273 >gi|254450361|ref|ZP_05063798.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|254450938|ref|ZP_05064375.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198264767|gb|EDY89037.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198265344|gb|EDY89614.1| conserved hypothetical protein [Octadecabacter antarcticus 238] Length = 75 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 18/74 (24%), Positives = 30/74 (40%), Gaps = 1/74 (1%) Query: 299 NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-SHEFLRACASPNSFYLVENPHSMYDA 357 T+ D + C A+ +G ++Y + L+ CAS S + N + A Sbjct: 1 MNGTEADARLSDICAAARAQGVVIYTVAFEAPSGGQSALQDCASSPSHHFDVNGTDISSA 60 Query: 358 FSHIGKDIVTKRIW 371 FS I DI ++ Sbjct: 61 FSAIASDIRALKLT 74 >gi|119505575|ref|ZP_01627647.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2080] gi|119458684|gb|EAW39787.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2080] Length = 316 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 68/205 (33%), Gaps = 28/205 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSIT-KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 Q A D+MI +D+S SME+ S K + ++ +L ++ + + GL+ Sbjct: 76 QQKAGRDLMIAVDLSGSMETEDFSQADGKPADRLTAVKTVLRQLA---NERAGDRLGLIV 132 Query: 226 FSN--KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 F + ++ F + L + G ST + A ++F Sbjct: 133 FGSSAYLQSPFTEDHRTWLLLLNETRIRMAGPSTALGDAVGLAI-KLFKDAETEH----- 186 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-------HEFL 336 ++++ +TDG T + A +Y I + + + L Sbjct: 187 -----RVLLLLTDG--NDTGSLVPPVDAARVAATEDIRIYPIAVGDPTAVGEEAIDLDTL 239 Query: 337 RACA--SPNSFYLVENPHSMYDAFS 359 A + + + + F Sbjct: 240 ARMAEVTGGQAFEALSSEDLIAVFK 264 >gi|257895102|ref|ZP_05674755.1| von Willebrand factor domain-containing protein [Enterococcus faecium Com12] gi|257831667|gb|EEV58088.1| von Willebrand factor domain-containing protein [Enterococcus faecium Com12] Length = 1341 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 34/153 (22%), Positives = 61/153 (39%), Gaps = 22/153 (14%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMES--FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + Q + +D++ VLD S SM +K I+++N + E + P+++ + Sbjct: 213 IGSEKQEISPIDIVFVLDKSASMNEGTLEGGGQSKNAALIEAVNEISENLLSDPNMD--I 270 Query: 220 QSGLVTF---SNKIEEFFLLEWGVSHLQRKIKYLSKFG---------VSTNSTPGLKYAY 267 + G+V F S I + + L I L+ T T GLK Y Sbjct: 271 RIGMVNFYHNSTVINNQEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGY 330 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 ++ G E+ N +KI++ + DG Sbjct: 331 ETLYADNG------GENRNPEKILIVVGDGTPT 357 >gi|255531385|ref|YP_003091757.1| von Willebrand factor A [Pedobacter heparinus DSM 2366] gi|255344369|gb|ACU03695.1| von Willebrand factor type A [Pedobacter heparinus DSM 2366] Length = 332 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 39/229 (17%), Positives = 71/229 (31%), Gaps = 46/229 (20%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S+ + T +D++I D+S SM + D +++ ++ Sbjct: 80 SAFSWQNSTTEGIDIVIATDISGSMLAE-DLKPNRLEAGKNIAIDFIKGRPED------- 131 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMR 277 + GLV FS + L L +S T GL A N++ D + Sbjct: 132 RIGLVIFSGESFTQCPLTIDHDVLINLFSDISNGMVEDGTAIGMGLATAVNRLKDSEAKS 191 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR------ 331 K+++ +TDG N + + AK+ VY IG+ Sbjct: 192 -----------KVVILLTDGSNTTGSIPPLTA--AEIAKQMKVRVYTIGVGTKGYAPYPV 238 Query: 332 ---------------SHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L A + ++ N + + + I K Sbjct: 239 KTPFGTQYQQVPVTIDEGVLSKIAGITGGKYFRATNNEKLKEIYQQIDK 287 >gi|218506166|ref|ZP_03504044.1| hypothetical protein RetlB5_00485 [Rhizobium etli Brasil 5] Length = 205 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 69/177 (38%), Gaps = 31/177 (17%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R + G + I+ A+ L + + +G + + ++ + S +D +L+ A QI N Sbjct: 10 LRGLRRDRTGNVGIIVALSLVPMLVAVGASFDYIRSYNVRQKMQSDLDAALIAAVKQINN 69 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 G+ + K +++N++ + + + Sbjct: 70 TGDTDALKLKVTDWFHAQVENSYTLGE-------------------------IDIDTTNH 104 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 +I+A + +P F N + + + S+VK + L++ IV+D S SM Sbjct: 105 NITATASGTVPTTFMKIA----NIDTVPVSVASAVK--GPATSYLNVYIVIDTSPSM 155 >gi|218190303|gb|EEC72730.1| hypothetical protein OsI_06342 [Oryza sativa Indica Group] Length = 585 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 26/187 (13%), Positives = 71/187 (37%), Gaps = 31/187 (16%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM---- 184 + + +P +++ + + + + +D++ VLDVS SM Sbjct: 6 VKVSTTTMLPTIPRG---HTNKDFRVLLRVEAPPMADLKGHVPIDVVAVLDVSGSMGDPA 62 Query: 185 ----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL---- 236 + + +++D+ +++ ++ ++ + +V F+++ + + Sbjct: 63 MASSDFEKNKPPSRLDVLKEAMKFIIRKLDDGD------RLSIVAFNDRPVKEYSTGLLN 116 Query: 237 --EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 G ++K+ +L G T P L+ A + G ++ I+ + Sbjct: 117 ISGNGRRIAEKKVDWLEARG-GTALMPALEEAIRVLDCRPGDSRNSVG-------FILLL 168 Query: 295 TDGENLS 301 TDG++ S Sbjct: 169 TDGDDTS 175 >gi|170681089|ref|YP_001744470.1| von Willebrand factor type A domain-containing protein [Escherichia coli SMS-3-5] gi|218700745|ref|YP_002408374.1| hypothetical protein ECIAI39_2418 [Escherichia coli IAI39] gi|170518807|gb|ACB16985.1| von Willebrand factor type A domain protein [Escherichia coli SMS-3-5] gi|218370731|emb|CAR18544.1| conserved hypothetical protein [Escherichia coli IAI39] Length = 588 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 50/320 (15%), Positives = 114/320 (35%), Gaps = 48/320 (15%) Query: 41 SHIFFMKTVLHSMIDRSLVH-------AATQIMNEGNGNNRKKLKGGDILCRIKNTWNMS 93 + + K L + + + AA+QI N G R + + + ++ + Sbjct: 75 AQQYSDKQALQGRLQAAPKYQHAAREKAASQIAN--PGTARYQQFDDNPVKQVAQNPLAT 132 Query: 94 FRNELRDNGFVN-----DIDDIVRSTSLDIVVVPQN--EGYSISAISRYKIPLKF----- 141 F ++ + N + + ++ + + + I+ S K P+ Sbjct: 133 FSLDVDTGSYANVRRFLNHGQLPPPDAVRVEEMVNYFPSDWVINDKSNNKEPVPASKPIP 192 Query: 142 ------CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKI 195 PW + + I + S+ +++ ++D S SM S ++ Sbjct: 193 FAMRYELAPAPWNEQRTLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISD-----ERL 246 Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGV 255 + S+ +++E++ ++ V +G ++I + + + I L G Sbjct: 247 PLIQSSLKLLVKELREQDNIAIVTYAG----DSRIALPSISGSHKAEINAAIDSLDADG- 301 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNE 314 STN GL+ AY Q ++ N I+ TDG+ +D +S+ + Sbjct: 302 STNGGAGLELAYQQA-AKGFIKGGINR--------ILLATDGDFNVGIDDPKSIESMVKK 352 Query: 315 AKKRGAIVYAIGIRVIRSHE 334 ++ G + G+ +E Sbjct: 353 QRESGVSLSTFGVGDSNYNE 372 >gi|297698645|ref|XP_002826428.1| PREDICTED: integrin alpha-X-like, partial [Pongo abelii] Length = 836 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 40/216 (18%), Positives = 79/216 (36%), Gaps = 32/216 (14%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 D++ ++D S S+ S + + A++ + + Q Sbjct: 140 ASRQECPRQEQDIVFLIDGSGSISS------DNFATMMNFVRAVISQFQ-----RPSTQF 188 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 L+ FSNK + F E S + + + T++ ++ +++F Sbjct: 189 SLMQFSNKFQTHFTFEKFRSSSNPLSLLDSVHQLRGLTHTATAIQNVVHRLFHAS----- 243 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR----VIRSHEF 335 + KI++ +TDG+ + D + + +A G I YAIG+ I S + Sbjct: 244 -YGARRDAAKILIVITDGKKEGDRLDYKDVIPMADA--AGIIRYAIGVGSAFEYINSWKE 300 Query: 336 LRACA---SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 L A S + VE+ ++A I + K Sbjct: 301 LNDIASKPSQEHIFKVED----FEALKDIQNQLKEK 332 >gi|54124354|gb|AAV29939.1| anthrax toxin receptor [Rattus norvegicus] Length = 245 Score = 55.9 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 32/156 (20%), Positives = 58/156 (37%), Gaps = 13/156 (8%) Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGM 276 ++ + FS + L +++ ++ L K G T G + A QI+ Sbjct: 17 LRMSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENSQ 76 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 + +I+ +TDGE L S N ++ GAIVY +G++ + Sbjct: 77 GYRTAS-------VIIALTDGE-LHEDLFFYSEREANRSRDLGAIVYCVGVKDFNETQLA 128 Query: 337 RACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 R S + + V + + A I I+ K Sbjct: 129 RIADSKDHVFPVNDG---FQALQGIIHSILKKSCIE 161 >gi|239941305|ref|ZP_04693242.1| hypothetical protein SrosN15_09946 [Streptomyces roseosporus NRRL 15998] Length = 516 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 80/212 (37%), Gaps = 28/212 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + + + + V+D+S SM ++D+ KS+ + +E++ Sbjct: 149 LRVGLATKTAPPTSERPPAALTFVVDISGSMAETG-----RLDLVRKSLTILADELRDDD 203 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLE---WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 ++ LVTFS++ E + + ++ + + STN G+K Y + Sbjct: 204 SLS------LVTFSDEAETRLPMTRVKDNRNRIKDVVSEMQP-AQSTNVEAGIKLGYEE- 255 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIR 328 ++G R+ +V ++D N E + L + A++ G ++ +G+ Sbjct: 256 -SVEGHREGATNR-------VVLLSDALANTGETEAEGILKKIDSARREYGITLFGVGVG 307 Query: 329 VIRSHEFLRACAS--PNSFYLVENPHSMYDAF 358 F+ + + V + F Sbjct: 308 SDYGDAFMEQLTNKGDGNTTYVGDETQARKVF 339 >gi|113867618|ref|YP_726107.1| von Willebrand factor type A domain-containing protein [Ralstonia eutropha H16] gi|113526394|emb|CAJ92739.1| von Willebrand factor (vWF) type A domain [Ralstonia eutropha H16] Length = 345 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 35/209 (16%), Positives = 71/209 (33%), Gaps = 32/209 (15%) Query: 169 DARLDMMIVLDVSRSM--ESFFDSS---ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 D++I LD+S+SM F D S I ++ + ++ + + + GL Sbjct: 91 QPARDLLIALDLSQSMDTRDFGDPSGALIPRVQAVRQVVSGFVAR-RPGD------RIGL 143 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + F + +Q I L ++ G D G+ Sbjct: 144 IVFGDAPYPLAPFTLDHQLVQTLITGLLPGMAGPSTALG---------DAIGLGIKMFEH 194 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--SHE-----FL 336 +K+++ +TDG T AK+R +V+ IGI E L Sbjct: 195 SEAPEKVLIVLTDG--NDTASRMPPERAGGIAKERKVVVHTIGIGDPNASGEEKVDLGVL 252 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGK 363 + A + ++ + + ++ + + Sbjct: 253 QRLAAQTGGRYFFGADQAGLETIYATLDR 281 >gi|297465782|ref|XP_609132.4| PREDICTED: collagen, type VI, alpha 3-like isoform 1, partial [Bos taurus] Length = 1803 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 43/256 (16%), Positives = 92/256 (35%), Gaps = 27/256 (10%) Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVM--PITSSVK 163 +I R+ I P+ + +P + S + P + Sbjct: 145 GDRNIDRTELQTITSDPR---LVFTVREFRDLP-SIEERMVNSFGSSGVTPAPPGVDTPS 200 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + + D++ +LD S+ DS + + + E V + + + +Q GL Sbjct: 201 PSRPEKKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEGGDSIQVGL 251 Query: 224 VTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMRQHC 280 V +++ + F L+ + + I + G N+ GL++ F + + Sbjct: 252 VQYNSDPTDEFFLKDFPTKQQIIDAINKVVYKGGRHANTKVGLEHLRRNHFVPEAGSR-- 309 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 D +I +T G K + + +RG V+A+G+R I S E + + Sbjct: 310 --LDQRVPQIAFVITGG-----KSVEDAQEASMALTQRGVKVFAVGVRNIDSEEVGKIAS 362 Query: 341 SPNSFYLVENPHSMYD 356 + + + V N + + Sbjct: 363 NSATAFRVGNVQELSE 378 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 70/213 (32%), Gaps = 34/213 (15%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVV 219 + + +D +DM +LD S S F + + K I ++ ++ + PD + Sbjct: 1180 RRAAGSDVDIDMAFILDSSESTTPF------QFNEMRKYIEYLVRQLDVSPDPKASQHFA 1233 Query: 220 QSGLVT---FSNKIEEFFLL--------EWGVSHLQRKI--KYLSKFGVSTNSTPGLKYA 266 + +V + ++G +++ + + Y Sbjct: 1234 RVAVVQHAPYEAVDNASVPPVKVEFSLTDYGSKEKLLAFLGSRMTQLQGTRALGRAIDYT 1293 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 IF+ + KI+V M GE + ++ +AK +G +G Sbjct: 1294 IENIFES--------APNPRDLKIVVLMLTGEVQKQQLEEAQRAIL-QAKCKGYFFVILG 1344 Query: 327 IRVI-RSHEFLRACASPNSFYL--VENPHSMYD 356 I E + + PN + V+ + + Sbjct: 1345 IGRKVNVKELYSSASEPNDVFFKLVDKSTELNE 1377 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 36/186 (19%), Positives = 66/186 (35%), Gaps = 23/186 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S ++ I + ++ + + P+ V+ G++ FSN + Sbjct: 7 DIVFLIDSSDGVKPDG------IAHIRDFVIRIVRRLNVGPNK---VRIGVLQFSNDVFP 57 Query: 233 FFLLEWGVSHLQ--RKIKYLSKFGVST-NSTPGLKY-AYNQIFDMQGMRQHCNTEDANYK 288 F L+ S I+ L G S N+ L++ A N G R ED + Sbjct: 58 EFQLKTYKSQASVLDAIRRLRFKGGSPLNTGKALEFVARNYFVKSAGSR----IEDGVPQ 113 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 +++F+ K Y K G +G R I E + P + V Sbjct: 114 HLVLFL------GGKSQDDISRYSQVIKSAGIASLGVGDRNIDRTELQTITSDPRLVFTV 167 Query: 349 ENPHSM 354 + Sbjct: 168 REFRDL 173 >gi|297473452|ref|XP_002686619.1| PREDICTED: collagen, type VI, alpha 3-like isoform 4 [Bos taurus] gi|296488814|gb|DAA30927.1| collagen, type VI, alpha 3-like isoform 4 [Bos taurus] Length = 2555 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 43/256 (16%), Positives = 92/256 (35%), Gaps = 27/256 (10%) Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVM--PITSSVK 163 +I R+ I P+ + +P + S + P + Sbjct: 966 GDRNIDRTELQTITSDPR---LVFTVREFRDLP-SIEERMVNSFGSSGVTPAPPGVDTPS 1021 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + + D++ +LD S+ DS + + + E V + + + +Q GL Sbjct: 1022 PSRPEKKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEGGDSIQVGL 1072 Query: 224 VTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMRQHC 280 V +++ + F L+ + + I + G N+ GL++ F + + Sbjct: 1073 VQYNSDPTDEFFLKDFPTKQQIIDAINKVVYKGGRHANTKVGLEHLRRNHFVPEAGSR-- 1130 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 D +I +T G K + + +RG V+A+G+R I S E + + Sbjct: 1131 --LDQRVPQIAFVITGG-----KSVEDAQEASMALTQRGVKVFAVGVRNIDSEEVGKIAS 1183 Query: 341 SPNSFYLVENPHSMYD 356 + + + V N + + Sbjct: 1184 NSATAFRVGNVQELSE 1199 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 24/181 (13%), Positives = 62/181 (34%), Gaps = 21/181 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + +T + I +++ +++ D ++Q + +++ + Sbjct: 37 DIVFLVDGSSKLG------LTNFNAIRDFIAKVIQRLEIRQD---LIQVAVAQYADTVRP 87 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F + ++ + S + L + N +F + K Sbjct: 88 EFYFNTYPSKREVINAVRKMKALDGSALYTGSALDFVRNNLFT-EAAGYRAAEG---VPK 143 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 ++V +T G K E K+ G + +A+G +V E + + Sbjct: 144 LLVLVTGG-----KSLDAVSQPAQELKRSGILAFAVGNKVADQAELEEIAFDSSLVFTAT 198 Query: 350 N 350 Sbjct: 199 E 199 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 70/213 (32%), Gaps = 34/213 (15%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVV 219 + + +D +DM +LD S S F + + K I ++ ++ + PD + Sbjct: 2001 RRAAGSDVDIDMAFILDSSESTTPF------QFNEMRKYIEYLVRQLDVSPDPKASQHFA 2054 Query: 220 QSGLVT---FSNKIEEFFLL--------EWGVSHLQRKI--KYLSKFGVSTNSTPGLKYA 266 + +V + ++G +++ + + Y Sbjct: 2055 RVAVVQHAPYEAVDNASVPPVKVEFSLTDYGSKEKLLAFLGSRMTQLQGTRALGRAIDYT 2114 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 IF+ + KI+V M GE + ++ +AK +G +G Sbjct: 2115 IENIFES--------APNPRDLKIVVLMLTGEVQKQQLEEAQRAIL-QAKCKGYFFVILG 2165 Query: 327 IRVI-RSHEFLRACASPNSFYL--VENPHSMYD 356 I E + + PN + V+ + + Sbjct: 2166 IGRKVNVKELYSSASEPNDVFFKLVDKSTELNE 2198 Score = 44.0 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 44/255 (17%), Positives = 89/255 (34%), Gaps = 29/255 (11%) Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVK 163 ++ + + I + P+ Y S + ++P K T + T + + ++ Sbjct: 762 ARNVDQEELVKISLSPE---YVFSVNTFRELPSLEQKLLTPLTTLTAGQIQQLLASTRYP 818 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + D++ ++D S ++ I + ++ + + P+ V+ G+ Sbjct: 819 PPAVESDAADIVFLIDSSDGVKPDG------IAHIRDFVIRIVRRLNVGPNK---VRIGV 869 Query: 224 VTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVST-NSTPGLKY-AYNQIFDMQGMRQH 279 + FSN + F L+ S I+ L G S N+ L++ A N G R Sbjct: 870 LQFSNDVFPEFQLKTYKSQASVLDAIRRLRFKGGSPLNTGKALEFVARNYFVKSAGSR-- 927 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 ED + +++F+ K Y K G +G R I E Sbjct: 928 --IEDGVPQHLVLFL------GGKSQDDISRYSQVIKSAGIASLGVGDRNIDRTELQTIT 979 Query: 340 ASPNSFYLVENPHSM 354 + P + V + Sbjct: 980 SDPRLVFTVREFRDL 994 >gi|264678234|ref|YP_003278141.1| hypothetical protein CtCNB1_2099 [Comamonas testosteroni CNB-2] gi|262208747|gb|ACY32845.1| putative membrane protein [Comamonas testosteroni CNB-2] Length = 408 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 24/150 (16%), Positives = 53/150 (35%), Gaps = 3/150 (2%) Query: 17 GGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKK 76 G I A+F+ + +G+ +++ +F +KT L + +D + AA ++ G + + Sbjct: 12 GAFLITFALFMLFLLGFMGIALDLGRLFIVKTELQTAMDSCALAAAREL--NGQSDAITR 69 Query: 77 LKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYK 136 + + N N+ N + + + Y+ S Sbjct: 70 AQNAGMAAGNSNNANLQSANWNGQGKLPATGISFRKQDYVTPTSDGKLARYAECQYSMSS 129 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 I L + +T P T +V+ + Sbjct: 130 IKLWLLQAMGAFTG-DSATWPNTGTVEARA 158 >gi|297473450|ref|XP_002686618.1| PREDICTED: collagen, type VI, alpha 3-like isoform 3 [Bos taurus] gi|296488813|gb|DAA30926.1| collagen, type VI, alpha 3-like isoform 3 [Bos taurus] Length = 2962 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 43/256 (16%), Positives = 92/256 (35%), Gaps = 27/256 (10%) Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVM--PITSSVK 163 +I R+ I P+ + +P + S + P + Sbjct: 1373 GDRNIDRTELQTITSDPR---LVFTVREFRDLP-SIEERMVNSFGSSGVTPAPPGVDTPS 1428 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + + D++ +LD S+ DS + + + E V + + + +Q GL Sbjct: 1429 PSRPEKKKADIVFLLD--GSINFRRDS-------FQEVLRFVSEIVDTVYEGGDSIQVGL 1479 Query: 224 VTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMRQHC 280 V +++ + F L+ + + I + G N+ GL++ F + + Sbjct: 1480 VQYNSDPTDEFFLKDFPTKQQIIDAINKVVYKGGRHANTKVGLEHLRRNHFVPEAGSR-- 1537 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 D +I +T G K + + +RG V+A+G+R I S E + + Sbjct: 1538 --LDQRVPQIAFVITGG-----KSVEDAQEASMALTQRGVKVFAVGVRNIDSEEVGKIAS 1590 Query: 341 SPNSFYLVENPHSMYD 356 + + + V N + + Sbjct: 1591 NSATAFRVGNVQELSE 1606 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 34/259 (13%), Positives = 88/259 (33%), Gaps = 26/259 (10%) Query: 98 LRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISA---ISRYKIPLKFCTFIPWYTNSRHI 154 L+ + ++ ++ + N+ + S + + +I RHI Sbjct: 368 LKQASVFSFGLGAQAASKAELQHIATNDNLVFTVPEFRSLGDVQEQLLPYIVGVAQ-RHI 426 Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 V+ + V + + R D++ ++D S + +T + I +++ +++ D Sbjct: 427 VLQPPTIVTQVIEVNKR-DIVFLVDGSSKLG------LTNFNAIRDFIAKVIQRLEIRQD 479 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIF 271 ++Q + +++ + F + ++ + S + L + N +F Sbjct: 480 ---LIQVAVAQYADTVRPEFYFNTYPSKREVINAVRKMKALDGSALYTGSALDFVRNNLF 536 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 + K++V +T G K E K+ G + +A+G +V Sbjct: 537 T-EAAGYRAAEG---VPKLLVLVTGG-----KSLDAVSQPAQELKRSGILAFAVGNKVAD 587 Query: 332 SHEFLRACASPNSFYLVEN 350 E + + Sbjct: 588 QAELEEIAFDSSLVFTATE 606 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 70/213 (32%), Gaps = 34/213 (15%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVV 219 + + +D +DM +LD S S F + + K I ++ ++ + PD + Sbjct: 2408 RRAAGSDVDIDMAFILDSSESTTPF------QFNEMRKYIEYLVRQLDVSPDPKASQHFA 2461 Query: 220 QSGLVT---FSNKIEEFFLL--------EWGVSHLQRKI--KYLSKFGVSTNSTPGLKYA 266 + +V + ++G +++ + + Y Sbjct: 2462 RVAVVQHAPYEAVDNASVPPVKVEFSLTDYGSKEKLLAFLGSRMTQLQGTRALGRAIDYT 2521 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 IF+ + KI+V M GE + ++ +AK +G +G Sbjct: 2522 IENIFES--------APNPRDLKIVVLMLTGEVQKQQLEEAQRAIL-QAKCKGYFFVILG 2572 Query: 327 IRVI-RSHEFLRACASPNSFYL--VENPHSMYD 356 I E + + PN + V+ + + Sbjct: 2573 IGRKVNVKELYSSASEPNDVFFKLVDKSTELNE 2605 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 31/256 (12%), Positives = 81/256 (31%), Gaps = 31/256 (12%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARL-- 172 ++ + + I++ + + ++V + SS+ + Sbjct: 173 AIGVEDADEAALKEIASEPLNMHVFNLENYTSLHDIVGNLVACVRSSMAPERAGGTEIPK 232 Query: 173 --------DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D++ ++D S ++ + + + +LE + + ++ G+V Sbjct: 233 DITAQDSADIIFLIDGSN------NTGSVNFAVILDFLVNLLERLSI---GTQQIRVGVV 283 Query: 225 TFSNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCN 281 +S++ F L + + +K L G N L + F R + Sbjct: 284 QYSDEPRTMFSLNSYSTKAQVLDAVKALGFIGGELANVGLALDFVVENHFT----RAGGS 339 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 + +++V ++ G + D K+ + +G + E + Sbjct: 340 RAEEGVPQVLVLISAGPSSDEIRDGVIA-----LKQASVFSFGLGAQAASKAELQHIATN 394 Query: 342 PNSFYLVENPHSMYDA 357 N + V S+ D Sbjct: 395 DNLVFTVPEFRSLGDV 410 Score = 44.0 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 44/255 (17%), Positives = 89/255 (34%), Gaps = 29/255 (11%) Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPL---KFCTFIPWYTNSRHIVMPITSSVK 163 ++ + + I + P+ Y S + ++P K T + T + + ++ Sbjct: 1169 ARNVDQEELVKISLSPE---YVFSVNTFRELPSLEQKLLTPLTTLTAGQIQQLLASTRYP 1225 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + D++ ++D S ++ I + ++ + + P+ V+ G+ Sbjct: 1226 PPAVESDAADIVFLIDSSDGVKPDG------IAHIRDFVIRIVRRLNVGPNK---VRIGV 1276 Query: 224 VTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVST-NSTPGLKY-AYNQIFDMQGMRQH 279 + FSN + F L+ S I+ L G S N+ L++ A N G R Sbjct: 1277 LQFSNDVFPEFQLKTYKSQASVLDAIRRLRFKGGSPLNTGKALEFVARNYFVKSAGSR-- 1334 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 ED + +++F+ K Y K G +G R I E Sbjct: 1335 --IEDGVPQHLVLFL------GGKSQDDISRYSQVIKSAGIASLGVGDRNIDRTELQTIT 1386 Query: 340 ASPNSFYLVENPHSM 354 + P + V + Sbjct: 1387 SDPRLVFTVREFRDL 1401 >gi|226226934|ref|YP_002761040.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] gi|226090125|dbj|BAH38570.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] Length = 565 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 65/181 (35%), Gaps = 24/181 (13%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 P + + +DM I +D S SM + D ++++ + + ++ + Sbjct: 83 SGPRWGLARGPMSSRG-IDMAIAIDASLSMLAQ-DERPSRLERVKQEV----RRLRAMSP 136 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIF 271 + + L+ F+ + L L+ + L ++ + ++ + Sbjct: 137 AD---RVALIAFAGRSYILTPLTGDDGALELFLDNLDPGVVGQAGSSLSRAIRQGSELLL 193 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 G + + +V ++DGE+ + ED +S +EA +G V +G Sbjct: 194 ASDG----------SADRALVLLSDGESFDSAEDIESA--ASEAGSKGISVVTVGFGTRD 241 Query: 332 S 332 Sbjct: 242 G 242 >gi|332295408|ref|YP_004437331.1| von Willebrand factor type A [Thermodesulfobium narugense DSM 14796] gi|332178511|gb|AEE14200.1| von Willebrand factor type A [Thermodesulfobium narugense DSM 14796] Length = 438 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 36/194 (18%), Positives = 78/194 (40%), Gaps = 22/194 (11%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + T+ + + + ++ V + +L + VLD S SM + +KI++ I+S+ Sbjct: 13 LLADTSGQKLFVLLSIEVSKEVEERGKLFVSFVLDTSGSMSETVN-DKSKIEIVIESLKK 71 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPG 262 +LE L D +VTF ++++ + + + V TN G Sbjct: 72 ILESNILKDDDE----ISIVTFDDEVKIVLPFTAATEKEKIFSSFEQIRTGTVGTNLGAG 127 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 +K + + + D G+++ +V +TDG ++ L +E V Sbjct: 128 MKVSLDLLKDKAGIKK------------MVVLTDGNVFDLDLVEKVL---DELVFSNISV 172 Query: 323 YAIGIRVIRSHEFL 336 ++G+ + + L Sbjct: 173 ISVGVGDEWNEDLL 186 >gi|301762312|ref|XP_002916580.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Ailuropoda melanoleuca] Length = 3529 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 35/247 (14%), Positives = 75/247 (30%), Gaps = 39/247 (15%) Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLD-MMIVLDVSRSMESFFDSSITKI 195 +P +SS K+ ++ ++D S S+ Sbjct: 4 LPFNKIQSAANEACLTTSADFASSSQKLAKTPGKEAQRLVFLVDESSSVG------QANF 57 Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---------LQRK 246 +K + +L + ++P + +VTFS+K +++ L R+ Sbjct: 58 LSELKFVRKLLSDFPVVPTA---TRVAIVTFSSKNNVVPRVDYISHRRAHQHKCALLSRE 114 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 I ++ G T + + A + N K+I +TDG + Sbjct: 115 IPAITYRGGGTYTKGAFQQAAQILRHS----------RENSTKVIFLITDGYSNG----G 160 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIV 366 + G ++ GI E ++P E + +F + + Sbjct: 161 DPRPVAASLRDFGVEIFTFGIWQGNIRELNDMASTPK-----EEHCYLLHSFEEF-EALA 214 Query: 367 TKRIWYD 373 + + D Sbjct: 215 RRALHED 221 >gi|156402481|ref|XP_001639619.1| predicted protein [Nematostella vectensis] gi|156226748|gb|EDO47556.1| predicted protein [Nematostella vectensis] Length = 154 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 63/159 (39%), Gaps = 21/159 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+ ++D S S+E + + K+ + + +++E ++ G++ + +K E Sbjct: 3 VDLGFIIDSSGSIEFYGKGNFKKV---LDFVGGIVKEFEV---SKQGTHVGIIRYDHKAE 56 Query: 232 --EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + F + KI ++ G T + L A + ++ + Sbjct: 57 ILKPFGQVTDKQGVLDKISKITFTGGGTKTGQALTLAMDGLYQID--------NRKEVPD 108 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +++ +TDG + N+ K G V+A+GI Sbjct: 109 VLIVLTDG-----ASKDSVVDPANKLKNSGVTVFAVGIG 142 >gi|149188658|ref|ZP_01866950.1| hypothetical protein VSAK1_16267 [Vibrio shilonii AK1] gi|148837568|gb|EDL54513.1| hypothetical protein VSAK1_16267 [Vibrio shilonii AK1] Length = 346 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 31/217 (14%), Positives = 75/217 (34%), Gaps = 33/217 (15%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSS-----ITKIDMAIKSINAMLEEVKLIPDVNN 217 + ++ D+M+ +D+S SM SS ++++D ++ + Sbjct: 89 EPQTREKLGRDVMVAVDLSGSMSEMDFSSSDGQAVSRLDAVKSVLHEFVAT-------RE 141 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + GL+ F + + ++ G ST+ + A ++F+ Sbjct: 142 GDRLGLILFGDAAYLQTPFTADHDVWLALLDQTEVAMAGQSTHLGDAIGLAI-KVFEQSE 200 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--SH 333 + +K++V +TDG + + + + AK G ++ I + Sbjct: 201 S-------SKDKEKVVVVLTDGNDTGSFVEPKDAAIVAAAK--GVRIHVIAMGDPATIGE 251 Query: 334 EFLRACA-------SPNSFYLVENPHSMYDAFSHIGK 363 + L S + + ++ A+ IG+ Sbjct: 252 QALDMATIDNIASQSGGQAFQALDQEALQQAYRTIGE 288 >gi|126727880|ref|ZP_01743708.1| hypothetical protein RB2150_00467 [Rhodobacterales bacterium HTCC2150] gi|126702821|gb|EBA01926.1| hypothetical protein RB2150_00467 [Rhodobacterales bacterium HTCC2150] Length = 576 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 17/74 (22%), Positives = 28/74 (37%), Gaps = 1/74 (1%) Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRACASPNSFYLVENPHSMYDAF 358 ++ D C AK RG ++ I + E L CA+ ++ Y S+ F Sbjct: 503 TGSQSDTLMSANCTAAKDRGITIFTIAFEAPSNAETQLNNCATSDNHYYDAQGTSITSVF 562 Query: 359 SHIGKDIVTKRIWY 372 S I I ++ Sbjct: 563 SSIATTIQKLKLTL 576 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 33/234 (14%), Positives = 76/234 (32%), Gaps = 40/234 (17%) Query: 2 FSLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHA 61 FS+ N+ + KG MT + I+ G+ I+ + + +D +++ A Sbjct: 22 FSMRKVFSNWRKSEKGSMTAFGIFIVAIMVTSAGLSIDFMRQERTRVQMQQNLDTAVLSA 81 Query: 62 ATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVV 121 A+ + G + + D+ + S+++ Sbjct: 82 ASLLQTLG-------------------------AEAVVTDYMSKANIDVDYNLSVNVSEG 116 Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 +A + + T N + + +TS + + L++ +VLDVS Sbjct: 117 INFRAVDATATATLE------TLFLGLLNIDSLGITVTSGAE---ERIPNLEISLVLDVS 167 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 SM S +++ + + + V ++ FS+ + Sbjct: 168 GSMGSN-----SRLTNLKTAATQFVSTIISGGSG-GTVAMSIIPFSSSVTPSQS 215 >gi|4139906|pdb|1AOX|A Chain A, I Domain From Integrin Alpha2-Beta1 gi|4139907|pdb|1AOX|B Chain B, I Domain From Integrin Alpha2-Beta1 Length = 203 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 78/210 (37%), Gaps = 35/210 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++V D S S+ + D + ++ + + P Q GL+ ++N Sbjct: 8 IDVVVVCDESNSIYPW--------DAVKNFLEKFVQGLDIGPTK---TQVGLIQYANNPR 56 Query: 232 EFFLLEWGVSHLQRKI---KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L + + + + G TN+ ++YA + + + Sbjct: 57 VVFNLNTYKTKEEMIVATSQTSQYGGDLTNTFGAIQYARKYAYSA------ASGGRRSAT 110 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEF---LRAC 339 K++V +TDGE+ + + CN + + I + + + ++A Sbjct: 111 KVMVVVTDGESHDGSMLKAVIDQCNH---DNILRFGIAVLGYLNRNALDTKNLIKEIKAI 167 Query: 340 AS---PNSFYLVENPHSMYDAFSHIGKDIV 366 AS F+ V + ++ + +G+ I Sbjct: 168 ASIPTERYFFNVSDEAALLEKAGTLGEQIF 197 >gi|332879903|ref|ZP_08447588.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332682114|gb|EGJ55026.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 345 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 27/175 (15%), Positives = 68/175 (38%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +D++ +DVS+SM + D + +++ A + + ++K + Sbjct: 79 GTKIETVKREGVDIVFAIDVSKSMLAE-DVAPNRLEKAKRIAFETINQLKGD-------R 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMR 277 G+V ++ L S + ++ ++ T ++ A N D Sbjct: 131 VGIVAYAASAYPQLALTTDHSAAKMFLQSMNTNMLSSQGTAIQEAIRMATNYFDDKSTTS 190 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 +++ ++DGE+ + + EA+++G +Y IG+ + Sbjct: 191 -----------RLLFIISDGEDH----EMGATEIAAEAQEKGIHIYTIGVGTEKG 230 >gi|331658353|ref|ZP_08359315.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli TA206] gi|331056601|gb|EGI28610.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli TA206] Length = 574 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 49/294 (16%), Positives = 107/294 (36%), Gaps = 37/294 (12%) Query: 56 RSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN-------DID 108 R+ AT I N G R + + + ++ ++F ++ + N + Sbjct: 87 RAAKANATHIAN--PGTARYQQFDDNPVKQVAQNPLVTFSLDVDTGSYANVRRFLNQGLL 144 Query: 109 DIVRSTSLDIVVV--PQNEGYSISAISRYKIPLKFCT-----FIPWYTNSRHIVMPITSS 161 + ++ VV P + P+ F PW + + I + Sbjct: 145 PPPDAVRVEEVVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAK 204 Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 S+ +++ ++D S SM S ++ + S+ +++E++ ++ V + Sbjct: 205 -DRKSEELPASNLVFLIDTSGSMISD-----ERLPLIQSSLKLLVKELREQDNIAIVTYA 258 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 G ++I + + + I L G STN GL+ AY Q ++ N Sbjct: 259 G----DSRIALPSISGSHKAEINAAIDSLDAEG-STNGGAGLEMAYQQA-AKGFIKGGIN 312 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGAIVYAIGIRVIRSHE 334 I+ TDG+ +D +S+ + ++ G + +G+ +E Sbjct: 313 R--------ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNE 358 >gi|72162079|ref|YP_289736.1| von Willebrand factor, type A [Thermobifida fusca YX] gi|71915811|gb|AAZ55713.1| von Willebrand factor, type A [Thermobifida fusca YX] Length = 315 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 44/247 (17%), Positives = 80/247 (32%), Gaps = 35/247 (14%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDAR---LDMMIVLDVSRSMESFFDSSITKIDMAI 199 +P I++ I + + +++ +DVS SM + D + ++ A Sbjct: 54 RHVPAALFLLTIIVLILALARPAMDVQVPRERATILVAIDVSPSM-AATDVAPDRLTSAK 112 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNS 259 +S + +E + + GLV FS+ + I L+ T Sbjct: 113 ESAQSFIESLPPRFN------VGLVAFSSVATVVASPTQDHQAVADSIANLT-ISSGTAI 165 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 G+ + I D IV ++DGEN S + + A G Sbjct: 166 GEGVFASLQAIRSFDE-----KATDDPPPAAIVLLSDGENTSGRPVAAAADEARAA---G 217 Query: 320 AIVYAIGIR--------------VIRSHEFLR--ACASPNSFYLVENPHSMYDAFSHIGK 363 V I E L+ A + FY E+ + D ++ IG Sbjct: 218 VPVSTIAFGTGVSIIEIEGHYVPANIDKETLKELAMTTGGRFYEAESTGELKDVYADIGS 277 Query: 364 DIVTKRI 370 + T+ + Sbjct: 278 SLGTETV 284 >gi|332223236|ref|XP_003260773.1| PREDICTED: cochlin [Nomascus leucogenys] Length = 550 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 26/209 (12%), Positives = 70/209 (33%), Gaps = 31/209 (14%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTD------ARLDMMIVLDVSRSMESFFDSSITK 194 F +P + + V P+ + + Q +++ ++D S S+ + Sbjct: 329 FSYHMPNWFGTTKYVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGD------SN 382 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSK 252 + ++ ++ + + ++ + V F+ F + + I+ + Sbjct: 383 FRLMLEFVSNIAKTFEISDIGA---KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRY 439 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 T + + + +F +R+ N +V +TDG+ Sbjct: 440 MSGGTATGDAISFTVRNVFGP--IRESPNKN------FLVIVTDGQ-----SYDDVQGPA 486 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 A G ++++G+ + L+ AS Sbjct: 487 AAAHDAGITIFSVGVAWAPLDD-LKDMAS 514 >gi|282896313|ref|ZP_06304335.1| von Willebrand factor, type A [Raphidiopsis brookii D9] gi|281198809|gb|EFA73688.1| von Willebrand factor, type A [Raphidiopsis brookii D9] Length = 336 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 37/222 (16%), Positives = 87/222 (39%), Gaps = 28/222 (12%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 +S+ ++ S++ + L++ ++LD S SM ++ ++ +++++ Sbjct: 20 SSQRLIAISVSAIGETIDSRVPLNLCLILDHSGSM------KGQPVENVKRAAWLLVDKL 73 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 + + +V F+++ E + H++++I LS G T+ GL+ Sbjct: 74 RDQD------RLSIVVFNHRAEVLLSNQNVVDRDHIKQQINRLSANG-GTSIDEGLRLGI 126 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 ++ + +T + +TDGEN ++ + L + A V +G Sbjct: 127 EEL-----AKGRRDTISQAF-----LLTDGENEH-GDNNRCLKFAQLAADYNLTVNTLGF 175 Query: 328 RVIRSHEFLRAC--ASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + L A S +E+P D F+ + + T Sbjct: 176 GNNWNQHILEKISDAGLGSLSHIEHPDQAVDKFNSLLMRMQT 217 >gi|194675927|ref|XP_001788181.1| PREDICTED: integrin, alpha E [Bos taurus] gi|297486662|ref|XP_002695836.1| PREDICTED: integrin, alpha E (antigen CD103, human mucosal lymphocyte antigen 1; alpha polypeptide) [Bos taurus] gi|296476814|gb|DAA18929.1| integrin, alpha E [Bos taurus] Length = 1163 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 75/203 (36%), Gaps = 28/203 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + IVLD S S++ K I + +K + +V + I+ Sbjct: 185 IAIVLDGSGSID---PPDFQKAKDFI------YDMMKNFYTKCFECRFAVVQYGEVIQTE 235 Query: 234 FLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L ++ + + S+ G T + +++ + IF A K+ Sbjct: 236 LDLLD-SQDVRASLDRVKNISQVGKITKTASAMQHVLDNIFTPNQ------GSRAKASKV 288 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI----RSHEFLRACASP---N 343 +V +TDGE + ++ N K G +AIG+ +++ L+ AS + Sbjct: 289 MVVLTDGEIFQDPLNLTTVI--NSPKMHGVERFAIGVGEAFNKSKAYHELKLIASDPDED 346 Query: 344 SFYLVENPHSMYDAFSHIGKDIV 366 + V N ++ + + + I+ Sbjct: 347 HAFKVTNYMALDGLLNKLQESII 369 >gi|323136279|ref|ZP_08071361.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] gi|322398353|gb|EFY00873.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] Length = 577 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 24/199 (12%), Positives = 62/199 (31%), Gaps = 29/199 (14%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F +++G + +L + + + ++ G ++ + VL D + + A++I + Sbjct: 8 FVSDHRGNVVMLFGLSVIPVMMMAGAAVDYARGVTTHKVLQQGADTAALAVASRITAATS 67 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 + K N + + +T + + + + I Sbjct: 68 TADAIKQA---------------------QNVLRSASQRLAAATISNATISADRKTFCI- 105 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 + T I + + S ++ + ++ + LD S SM Sbjct: 106 -----DAQVSIPTMIMKIARIDSMAPAVMSCAEIGGGSTN-YEIALALDNSGSMNESAGG 159 Query: 191 SITKIDMAIKSINAMLEEV 209 TKI + + + Sbjct: 160 -ATKIQSLKTAATNFVNSM 177 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 37/209 (17%), Positives = 62/209 (29%), Gaps = 66/209 (31%) Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + L S + KI L+ G TN G+ +A+ I A Sbjct: 368 NAATQTALQLTPTQSTITAKIAQLTAAG-DTNLHEGVMWAWRSISPNPPFSAGSAYNTAG 426 Query: 287 YKKIIVFMT-------------------------------------------------DG 297 +KI+V MT DG Sbjct: 427 VRKILVLMTDGYNNWTSNTNTVGGSYYEALGYYSYNGAKNRRLPDGTQGNGVDYQSQLDG 486 Query: 298 ENLSTKE---------DQQSLYYCNEAKKRGAIVYAIGIRV------IRSHEFLRACAS- 341 S + D+ + C AK +G +Y+I V L++CA+ Sbjct: 487 AANSWTDYKSVSRQAQDELTRQSCENAKAKGIEIYSIAFSVSTNPIDAAGINLLKSCATN 546 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + + L + + AFS I ++ R+ Sbjct: 547 ADHYLLATDSTQIDRAFSQIAMNLSKLRL 575 >gi|145594605|ref|YP_001158902.1| von Willebrand factor, type A [Salinispora tropica CNB-440] gi|145303942|gb|ABP54524.1| von Willebrand factor, type A [Salinispora tropica CNB-440] Length = 436 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 76/227 (33%), Gaps = 44/227 (19%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P+ + + + + +VLDVS SM + ++I +A ++ E V +P Sbjct: 29 MTGPVPALADWETPVEPP-KVELVLDVSGSMRATDIDGRSRISVAQQA---FNEVVDALP 84 Query: 214 DVNNV-VQSGLVTF-----------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 D + ++ T+ + +I ++ + + L G T Sbjct: 85 DETELGIRVLGATYPGDDKEQGCQDTQQIVPVGPVD--RVQAKAAVATLRPTGY-TPVGL 141 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK---- 317 L+ A + R+ IV +TDGE+ D C A++ Sbjct: 142 ALRSAAEDLGTGSTARR------------IVLITDGEDTCAPPD-----PCEVARELAAQ 184 Query: 318 -RGAIVYAIGIRVIR--SHEFLRAC-ASPNSFYLVENPHSMYDAFSH 360 +V +G+ + L A+ ++ ++ + Sbjct: 185 GTKLVVDTLGLAPDEKVRQQLLCIAGATGGTYTAAQSADELTGRIKQ 231 >gi|37676262|ref|NP_936658.1| hypothetical protein VVA0602 [Vibrio vulnificus YJ016] gi|37200803|dbj|BAC96628.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 442 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 26/217 (11%), Positives = 78/217 (35%), Gaps = 14/217 (6%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 +G I+ LPI+ +++ ++++ + + + + ++ Sbjct: 20 KKQQGVAGIIFMGMLPILVIIMVFSMQMTQRHMAHAKITEAAEVASLA----LIASPKEG 75 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 + K + +I + + + E+ F + + P + +SA Sbjct: 76 DEKNQEYAQ---KIVDHYIPDNKGEVVARVFHRRCEYKDGCVQRSGELAP-FTDFVVSAK 131 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM-ESFFDSS 191 +++ + + T + + LD+ ++D+S SM + S Sbjct: 132 TKHDSWISYNEGEMGLTK----DFEVMGTSTSRKFLPQPLDIYFIIDMSGSMVNPWGGSG 187 Query: 192 ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 TK D+ +IN ++++++ + + ++ F + Sbjct: 188 KTKYDVVADTINRIVDDLREFKT-DRKSRVAVIGFHH 223 >gi|111020122|ref|YP_703094.1| hypothetical protein RHA1_ro03133 [Rhodococcus jostii RHA1] gi|110819652|gb|ABG94936.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 326 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 68/211 (32%), Gaps = 30/211 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +DVS SM + D +++ A ++ + GL F+ Sbjct: 91 VILAIDVSLSMRA-TDVPPSRLAAAQAGAKTF------ADNLTPGINLGLEAFAGTASML 143 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + +L + T + + A I + G+ IV Sbjct: 144 VSPITDHTATDNALDHL-QLAERTATGEAIFTALQAIDTLAGVLGG---GSTPPPARIVL 199 Query: 294 MTDGENLST---KEDQQSLYYCNEAKKRGAIVYAIGIR--------------VIRSHEFL 336 +DG+ + + + AK++G + I V E L Sbjct: 200 ESDGKQTVPADLNDPRGAFTAARLAKEQGVPISTISFGTTHGAIDLNGSHIPVPVDDESL 259 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 R A S SF+ + + ++ ++ + I Sbjct: 260 RRIAELSGGSFFTATSADELQASYQNLQQQI 290 >gi|209527269|ref|ZP_03275780.1| von Willebrand factor type A [Arthrospira maxima CS-328] gi|209492336|gb|EDZ92680.1| von Willebrand factor type A [Arthrospira maxima CS-328] Length = 414 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 73/200 (36%), Gaps = 26/200 (13%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 P +++ + S+V + +++ ++LD S SM + ++ ++ + Sbjct: 16 PNQESNQRQLSISVSAVPDPFEGQVPMNLCLILDHSGSM------NGQPLETVKQAAKEL 69 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGL 263 ++ + + + +V F ++ + + +++KI L G T G+ Sbjct: 70 IDRLNVGD------RISVVAFDHRAKVLVPNQDLTDPDGIKKKIDGLRCSG-GTAIDEGI 122 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 K ++ + R +TDGEN ++++ L A + + Sbjct: 123 KLGIEELGKGKQDRISQG----------FLLTDGENEH-GDNKRCLKLAKLATEYKLTIN 171 Query: 324 AIGIRVIRSHEFLRACASPN 343 ++G + + L A Sbjct: 172 SLGFGDDWNQDILEKIADAG 191 >gi|299139640|ref|ZP_07032813.1| von Willebrand factor type A [Acidobacterium sp. MP5ACTX8] gi|298598264|gb|EFI54429.1| von Willebrand factor type A [Acidobacterium sp. MP5ACTX8] Length = 488 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 35/231 (15%), Positives = 77/231 (33%), Gaps = 25/231 (10%) Query: 157 PITSSVKVNSQTDARLDMM-IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 P +S +++ +++D+S SM T+ +I L ++ D Sbjct: 88 PFYASAGSGPDAGQHTNVVLMIVDISGSMNQPMAGGSTRFQSLKSAIAQFLAGMQEGSD- 146 Query: 216 NNVVQSGLVTFSN----KIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQ 269 + +V F + + + ++ L G +T + + Sbjct: 147 ----RVAIVPFESHNVVSTIRSAVFTTHRADALAQLNALPAPGPKNNTALYQAVFSGVDS 202 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN------EAKKRGAIVY 323 + QH A + ++ MTDG+N D L + A+ + + + Sbjct: 203 MKGELASLQHEGATLAELQPHLIVMTDGKNEVAPGDDPQLLNGDLGLQQAVAQVQTSNLD 262 Query: 324 AIGIRVIRSHEF----LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 IGI ++ L+ F+ + + + A H+ + V+ I Sbjct: 263 TIGIGFGDKNDIDAGALQKLT--KRFFYASDANQLLAAL-HVSRSAVSHSI 310 >gi|224057976|ref|XP_002299418.1| predicted protein [Populus trichocarpa] gi|222846676|gb|EEE84223.1| predicted protein [Populus trichocarpa] Length = 595 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 48/273 (17%), Positives = 100/273 (36%), Gaps = 43/273 (15%) Query: 105 NDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKV 164 D++V S + ++ + Y +F + R + P+ +++ Sbjct: 91 FSDDELVPDVSQGQPSSSRPHAITVKTLPEYPAVSASESFSKFGVLVRVLAPPLDNTLPH 150 Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + A +D++ VLDVS SM K+ + +++N +++ + + +V Sbjct: 151 H---RAPIDIVNVLDVSGSM-------AGKLILLKRAVNFIIQNLGPSD------RLSIV 194 Query: 225 TFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 TFS+ L G + LS G TN GL+ + + RQH Sbjct: 195 TFSSSARRILPLRTMSGSGREDAISVVNSLSATG-GTNIVAGLRKGVRVL---EERRQHN 250 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYC-----------NEAKKRGAIVYAIGIRV 329 + I+ ++DG + + L Y E+++ ++ G + Sbjct: 251 SVAS------IILLSDGCDTQSHSTHNRLEYLKLIFPSNNASGEESRQPTFPIHTFGFGL 304 Query: 330 IRSHEFLRACA--SPNSFYLVENPHSMYDAFSH 360 + A + S +F +E+ + DAF+ Sbjct: 305 DHDSAAMHAISDVSGGTFSFIESIDILQDAFAR 337 >gi|255587116|ref|XP_002534143.1| protein binding protein, putative [Ricinus communis] gi|223525789|gb|EEF28236.1| protein binding protein, putative [Ricinus communis] Length = 728 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 35/205 (17%), Positives = 75/205 (36%), Gaps = 35/205 (17%) Query: 105 NDIDDIVRSTSLDIVVVPQNEGY-SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK 163 N I T ++ ++ Y + + + K P P + S + Sbjct: 210 NSQKSIEIKTYPEVPSASRSCAYDNFTVLVHLKAPATVTMQNPRINQA--------SLPQ 261 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 ++ A +D++ VLD+S SM TK+ + +++ +++ + + + Sbjct: 262 LSQSPRAPVDLVTVLDISGSMAG------TKLALLKRAMGFVIQNLGSND------RLSV 309 Query: 224 VTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + FS+ F L G + + L G TN GL+ +G + Sbjct: 310 IAFSSTARRLFPLRRMSDTGRQQALQAVNSLVAHG-GTNIAEGLR---------KGAKVM 359 Query: 280 CNTEDANYKKIIVFMTDGENLSTKE 304 + + N I+ ++DG++ T Sbjct: 360 EDRREKNPVASIILLSDGQDTYTVS 384 >gi|213961827|ref|ZP_03390093.1| von Willebrand factor, type A [Capnocytophaga sputigena Capno] gi|213955616|gb|EEB66932.1| von Willebrand factor, type A [Capnocytophaga sputigena Capno] Length = 607 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 74/201 (36%), Gaps = 26/201 (12%) Query: 136 KIPLKFCTFIPWYTNS--RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSIT 193 K PLK T + + +++ + K++ +++ ++D S SM+ Sbjct: 212 KTPLKATTTLSSCPWNPDNYLLRIGLQAKKIDFTKAPPSNIVFLIDTSGSMDE-----PN 266 Query: 194 KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--IEEFFLLEWGVSHLQRKIKYLS 251 K+ + S +L+ ++ + +V ++++ I + + I L Sbjct: 267 KMPLLKASFKLLLDNLRPED------RIAIVVYASQTGIALPSTPAKEKEKISKVIDDLV 320 Query: 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLY 310 G ST GL+ AY + + + + N I+ TDG+ N+ + Sbjct: 321 ASG-STAGGAGLQTAYE-VAEKNFLPKGNNR--------IILATDGDFNVGISSRDELQR 370 Query: 311 YCNEAKKRGAIVYAIGIRVIR 331 E + G + +G + Sbjct: 371 LVEEKRNNGIYISVLGYGMGN 391 >gi|260820654|ref|XP_002605649.1| hypothetical protein BRAFLDRAFT_150471 [Branchiostoma floridae] gi|229290984|gb|EEN61659.1| hypothetical protein BRAFLDRAFT_150471 [Branchiostoma floridae] Length = 168 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 23/137 (16%), Positives = 51/137 (37%), Gaps = 15/137 (10%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +D+ V+D S S+ + + + K + + V + + G+V F+ Sbjct: 2 TPMDLAFVIDGSASVGPL------QFEKSKKFVR---DMVDGFDIGSAQTRVGVVQFAWM 52 Query: 230 IEEFFLLEWGV--SHLQRKIKYLSKF-GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ F L + + L+ I + G T L + ++F E + Sbjct: 53 VQAEFNLGDYLDGTDLRNAIARIRYMDGPGTEIGKALVFTKRRLFS---ELYGARPETQD 109 Query: 287 YKKIIVFMTDGENLSTK 303 +I++ +TDG + Sbjct: 110 VPRIVILITDGRSSPES 126 >gi|332308254|ref|YP_004436105.1| LPXTG-motif cell wall anchor domain protein [Glaciecola agarilytica 4H-3-7+YE-5] gi|332175583|gb|AEE24837.1| LPXTG-motif cell wall anchor domain protein [Glaciecola agarilytica 4H-3-7+YE-5] Length = 777 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 56/335 (16%), Positives = 108/335 (32%), Gaps = 57/335 (17%) Query: 60 HAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIV 119 A +N + I + + EL ++ D + +DI Sbjct: 255 QTAVAKVNSNGWGESVSVLSRQIKTPDEPANPIRISVELNSGFSISPEDIVSEHHPIDIT 314 Query: 120 -VVPQNEGYSISAISRY---------------KIP--LKFCTFIPWYTNSRHIVMPITSS 161 Q Y I + IP F I Y + ++ P Sbjct: 315 PKDQQAGHYQIELTKEHIANQDFSLRWKPTVGDIPSAAHFSETIGNYRYAMVMLTPPRQD 374 Query: 162 VKVNSQTDARLD---MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + T + ++ +LD S SM I A ++++ L ++ VN Sbjct: 375 DADDKSTKTPVSAREVVFLLDTSGSMAG------ESIVQAKRAVDFALTQLHPEDSVN-- 426 Query: 219 VQSGLVTFSNKIEEFFLLE-----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 ++ F++ + + L + + + LS G T P L A + Sbjct: 427 ----VIEFNDAPQALWNLAMPATANNIQRARNWVASLSANG-GTEMAPALSMA---LHKT 478 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 +Q+ N + +VF+TDG + ED N+ ++ IGI Sbjct: 479 NLEQQNINEGSPVQLRQVVFITDGS--VSNEDALMSLIENQLADS--RLFTIGIG----- 529 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 ++PNS+++ + + F++IG + Sbjct: 530 ------SAPNSYFMTQAAQAGRGTFTYIGDINQVQ 558 >gi|218661390|ref|ZP_03517320.1| von Willebrand factor type A [Rhizobium etli IE4771] Length = 370 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 60/165 (36%), Gaps = 20/165 (12%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++DVS SM+ + K+ + S ++ +K V V +G + Sbjct: 4 LVFLIDVSGSMDE-----LDKLPLLKSSFRLLVNRLKADDTVAIVTYAGNAGTVLEPTRV 58 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 S + I L G ST G++ AY + ++ N ++ Sbjct: 59 SE----KSKILSAIDRLEA-GGSTGGAEGIEAAY-DLAQKAFVKDGVNR--------VML 104 Query: 294 MTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 TDG+ N+ D+ E +K G + +G ++ L Sbjct: 105 ATDGDFNVGPSSDEDLKRIIEEKRKEGIFLTVLGFGRGNLNDSLM 149 >gi|260818868|ref|XP_002604604.1| hypothetical protein BRAFLDRAFT_126772 [Branchiostoma floridae] gi|229289932|gb|EEN60615.1| hypothetical protein BRAFLDRAFT_126772 [Branchiostoma floridae] Length = 2219 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 38/208 (18%), Positives = 74/208 (35%), Gaps = 41/208 (19%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 ++D++ VLD S S+ + + D ++ + +L E + + + ++T+S Sbjct: 70 GQVDLIFVLDRSASIGWYNGA----WDSMLQFVENLLYEFSV---NSVHTRVSIITYSTT 122 Query: 230 IEEFFL-LEWGVS-------HLQRKIKYL---SKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + L G + L I+ T +T L+ A + Sbjct: 123 VSVDVDYLSDGSASSRLTKCKLNDDIEKRLRNKALHGWTATTTALERAKQVLLTS----- 177 Query: 279 HCNTEDANYKKIIVFMTDGENL-STKEDQQSLYYCN--------EAKKRGAIVYAIGIRV 329 KK I +TDG + + N A+ R ++A+GI Sbjct: 178 -----RPTAKKAIFLLTDGRSNIGQPPSIPAREISNLRWSGWDMAAQGRQVEIFALGIED 232 Query: 330 IRSHEFLRACASP---NSFYLVENPHSM 354 E LR+ A P + ++L++N Sbjct: 233 AVEAE-LRSIAYPSQQDHYFLLDNFQDF 259 >gi|194289639|ref|YP_002005546.1| hypothetical protein RALTA_A1531 [Cupriavidus taiwanensis LMG 19424] gi|193223474|emb|CAQ69479.1| conserved hypothetical protein, vWA domain (Von Willebrand factor, type A); putative membrane protein [Cupriavidus taiwanensis LMG 19424] Length = 359 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 69/204 (33%), Gaps = 32/204 (15%) Query: 169 DARLDMMIVLDVSRSM--ESFFDSS---ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 D++I LD+S+SM F D + I ++ + ++ + + + GL Sbjct: 91 QPARDLLIALDLSQSMDTRDFRDPAGALIPRVQAVRQVVSGFVAR-RPGD------RIGL 143 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + F + +Q I L ++ G D G+ Sbjct: 144 IVFGDAPYPLAPFTLDHRLVQTLIADLLPGMAGPSTALG---------DAIGLGIKMFEH 194 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-------EFL 336 +K+++ +TDG T AK+R +V+ IGI + + L Sbjct: 195 SEAPEKVLIVLTDG--NDTASRMPPERAGGIAKERKVVVHTIGIGDPNASGEEKVDLDVL 252 Query: 337 RACA--SPNSFYLVENPHSMYDAF 358 + A + ++ + + + Sbjct: 253 QKLAAQTGGRYFFGADQAGLETIY 276 >gi|27366553|ref|NP_762080.1| protein TadG, associated with Flp pilus assembly [Vibrio vulnificus CMCP6] gi|27358119|gb|AAO07070.1| Protein TadG, associated with Flp pilus assembly [Vibrio vulnificus CMCP6] Length = 426 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 25/217 (11%), Positives = 78/217 (35%), Gaps = 14/217 (6%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 +G I+ LPI+ +++ ++++ + + + + ++ Sbjct: 4 KKQQGVAGIIFMGMLPILVIIMVFSMQMTQRHMAHAKITEAAEVASLA----LIASPKEG 59 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 + K + +I + + + E+ F + + P + +SA Sbjct: 60 DEKNQEYAQ---KIVDHYIPDNKGEVVARVFHRRCEYKDGCVQRSGELAP-FTDFVVSAK 115 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM-ESFFDSS 191 +++ + ++ + + LD+ ++D+S SM + S Sbjct: 116 TKHD---SWISYNDGEMGLTK-DFEVMGTSTSRKFLPQPLDIYFIIDMSGSMVNPWGGSG 171 Query: 192 ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 TK D+ +IN ++++++ + + ++ F + Sbjct: 172 KTKYDVVADTINRIVDDLREFKT-DRKSRVAVIGFHH 207 >gi|47218988|emb|CAG02026.1| unnamed protein product [Tetraodon nigroviridis] Length = 1039 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 35/201 (17%), Positives = 78/201 (38%), Gaps = 38/201 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D+S SM S TK+ +++ +LE++ G++ F ++I+ Sbjct: 435 VVFVIDMSGSM------SGTKMQQTREAMLKILEDLDPEDHF------GIILFDHRIQF- 481 Query: 234 FLLEWGVSHLQR----------KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 W S + +K + +G + + P LK A + + + R+ Sbjct: 482 ----WNTSLSKATKENIDEAMVYVKAIQSYGGTDINAPVLK-AVDML---KEDRKAKRLP 533 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG-AIVYAIGIRVIRSHEFLRACASP 342 + + +I+ +TDG+ S + + +A G ++++G + FL + Sbjct: 534 EKSID-MIILLTDGDPNSGESRIPVIQENVKAAIGGQMSLFSLGFGNDVKYPFLDVMSRE 592 Query: 343 NS-----FYLVENPHSMYDAF 358 N+ Y + F Sbjct: 593 NNGLARRIYEGSDAALQLQGF 613 >gi|326678379|ref|XP_002666248.2| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Danio rerio] Length = 1089 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 41/232 (17%), Positives = 81/232 (34%), Gaps = 49/232 (21%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + D NV + F Sbjct: 244 GASSPKDMVILVDVSGSVSGLT------LKLIKASVTEMLDTL-SDDDYVNVAR-----F 291 Query: 227 SNKIEEFFL--------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + K E + ++ + G +T+ G +A+NQ+ + R Sbjct: 292 NEKAEAVVPCFDHLVQANVRNKKIFKEAVQQMQAKG-TTDYKSGFHFAFNQLLNTNVPRA 350 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LR 337 +CN KII+ TDG ++ + + N+ V+ + L+ Sbjct: 351 NCN-------KIIMLFTDGGEDRAQDIFEQYNWPNKT----VRVFTFSVGQHNYDVTPLQ 399 Query: 338 --ACASPNSFYLVE-------NPHSMYDAF-------SHIGKDIVTKRIWYD 373 AC++ ++ + N D I K + ++ D Sbjct: 400 WIACSNKGYYFEIRSICAIRINTQEYLDVLGRPMVLAGRIAKQVQWTNVYQD 451 >gi|29828547|ref|NP_823181.1| hypothetical protein SAV_2005 [Streptomyces avermitilis MA-4680] gi|29605651|dbj|BAC69716.1| hypothetical protein [Streptomyces avermitilis MA-4680] Length = 420 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 40/232 (17%), Positives = 82/232 (35%), Gaps = 33/232 (14%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 P + + ++D +VLDVS SM + +++ A ++ N +L+ Sbjct: 22 FAFPAGAVADETTADAPKVD--LVLDVSGSMRARDIDGGSRMAAAKQAFNEVLD--ATPE 77 Query: 214 DVNNVVQSGLVTF-----------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 +V +++ + + ++ L+ + + + LS G T P Sbjct: 78 EVRLGIRTLGANYPGDDRKTGCKDTAQLYPVSTLD--RTEAKTAVATLSPTGW-TPIGPA 134 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 L A + + G H IV +TDGE+ D + AK G + Sbjct: 135 LLKAADDL--DGGTGSHR----------IVLITDGEDTCAPLDPCEVAREIAAKGVGLTI 182 Query: 323 YAIGIRVIRSHEFLRAC---ASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 +G+ +C A+ ++ VE+ + D + + K + Sbjct: 183 DTLGLVPNSKLSKQLSCIAEATGGTYTSVEHKEDLTDKVNQLVDRAADKVVT 234 >gi|3236370|gb|AAC23667.1| type VI collagen alpha 3 subunit [Mus musculus] Length = 2657 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 41/311 (13%), Positives = 106/311 (34%), Gaps = 27/311 (8%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + GG + + +G V+ Sbjct: 910 LEFVARNLFVKSAGSRIEDGVPQHLVLFLGGKSQDDVARHAQ-----VISSSGIVSLGIG 964 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 D+ + + + ++P + + + P + Sbjct: 965 DRNIDRTDLQTITNDPRLVFTVREFRELPNIEERVMLSFGPSGATPQPPGVDLPSPSRPE 1024 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S+ DS + + V + + + ++ GLV +++ Sbjct: 1025 KKKADIVFLLD--GSINFRRDS-------FQEVLRFASVIVDTVYEDGDSIRVGLVQYNS 1075 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + F L + + I + G N+ G+++ + + + + D Sbjct: 1076 DPTDEFFLRDFSTKRQIIDAINKVVYKGGRHANTRVGIEH----LLRNHFVPEAGSRLDV 1131 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +I +T G+++ +D ++G V+A+G+R I S E + ++ + Sbjct: 1132 RVPQIAFVITGGKSVEDAQDVSLALT-----QKGVKVFAVGVRNIDSEEVGKIASNSATA 1186 Query: 346 YLVENPHSMYD 356 + V + + + Sbjct: 1187 FRVGSVQELSE 1197 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 71/205 (34%), Gaps = 26/205 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + S F + + ++E + + PD V+ LV +S++ Sbjct: 419 DVVFLIDGSEGVRSGFP-------LLKDFVQRVVESLDVGPD---RVRVALVQYSDRTRP 468 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L + I+ L+ G +T N+ L++ I + + Sbjct: 469 EFYLNSHMDQQGVISAIRRLTLLGGTTPNTGAALEFVLRNILTSSTGSRIAEG----VPQ 524 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +T + K+ GA+ IGI E P+ + Sbjct: 525 LLIVLT-----AEPSGDDVRGPSVVLKQGGAVPIGIGIGNADISEMQTISFIPDFAVAIP 579 Query: 350 NPHSMYDAFSHIGKDIVTKRIWYDK 374 + I + I + I ++ Sbjct: 580 TFREL----GTIQRVISERVIQLNR 600 Score = 45.5 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 40/243 (16%), Positives = 85/243 (34%), Gaps = 24/243 (9%) Query: 118 IVVVPQNEGYSISAISRYKIP---LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDM 174 +V + + Y S + ++P K T I T+ + + ++ + D+ Sbjct: 768 LVKISLSPEYVYSVSTFRELPRLEQKLLTPITTLTSQQIHQILASTRYPPSVVESDAADI 827 Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 + ++D S +++ I ++ ++ + + P V+ G+V FSN + F Sbjct: 828 VFLIDSSDAVKPDG------IAHIRDFVSRIVRRLNIGPSK---VRIGVVQFSNDVFPEF 878 Query: 235 LLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L+ S + I+ L G S N+ L++ +F ED + ++ Sbjct: 879 YLKTHKSQSSVLEAIRRLRFKGGSPLNTGRALEFVARNLFVKS---AGSRIEDGVPQHLV 935 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENP 351 +F+ K + G + IG R I + P + V Sbjct: 936 LFL------GGKSQDDVARHAQVISSSGIVSLGIGDRNIDRTDLQTITNDPRLVFTVREF 989 Query: 352 HSM 354 + Sbjct: 990 REL 992 >gi|192435|gb|AAA37381.1| complement component C2 [Mus musculus] Length = 753 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 36/238 (15%), Positives = 80/238 (33%), Gaps = 28/238 (11%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 T + TN ++ + K+ Q L++ ++LD S+S+ D+ KS Sbjct: 230 LTNLLGATNPTQNLLTKSLGRKIIIQRSGHLNLYLLLDASQSVTE------KDFDIFKKS 283 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF-FLLEWGVSHLQRKIKYLSK-------F 253 M+E + V ++TF+++ + +L + I L Sbjct: 284 AELMVERIFSFEVN---VTVAIITFASQPKTIMSILSERSQDVTEVITSLDSASYKDHEN 340 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYY- 311 N+ L Y+ + + + I+ +TDG N+ + Sbjct: 341 ATGANTYEVLIRVYSMMQTQMDRLGMETSAWKEIRHTIILLTDGKSNMGDSPKKAVTRIR 400 Query: 312 ----CNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 + + +YAIG+ + +E +++++ ++ F H Sbjct: 401 ELLSIEQNRDDYLDIYAIGVGKLDVDWKELNELGSKKDGERHAFILQDAKALQQIFEH 458 >gi|116591|sp|P21180|CO2_MOUSE RecName: Full=Complement C2; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement C2b fragment; Contains: RecName: Full=Complement C2a fragment; Flags: Precursor gi|192434|gb|AAA37380.1| complement component C2 [Mus musculus] gi|192437|gb|AAA63294.1| complement component C2 [Mus musculus] Length = 760 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 36/238 (15%), Positives = 80/238 (33%), Gaps = 28/238 (11%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 T + TN ++ + K+ Q L++ ++LD S+S+ D+ KS Sbjct: 230 LTNLLGATNPTQNLLTKSLGRKIIIQRSGHLNLYLLLDASQSVTE------KDFDIFKKS 283 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF-FLLEWGVSHLQRKIKYLSK-------F 253 M+E + V ++TF+++ + +L + I L Sbjct: 284 AELMVERIFSFEVN---VTVAIITFASQPKTIMSILSERSQDVTEVITSLDSASYKDHEN 340 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYY- 311 N+ L Y+ + + + I+ +TDG N+ + Sbjct: 341 ATGANTYEVLIRVYSMMQTQMDRLGMETSAWKEIRHTIILLTDGKSNMGDSPKKAVTRIR 400 Query: 312 ----CNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 + + +YAIG+ + +E +++++ ++ F H Sbjct: 401 ELLSIEQNRDDYLDIYAIGVGKLDVDWKELNELGSKKDGERHAFILQDAKALQQIFEH 458 >gi|168700938|ref|ZP_02733215.1| BatA [Gemmata obscuriglobus UQM 2246] Length = 317 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 72/207 (34%), Gaps = 22/207 (10%) Query: 167 QTDARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 Q + ++ +DVS SM F D + + D ++K+I+ L+ + GL Sbjct: 92 QKRSLTNIQFAVDVSGSMLAPFGDGN--RYDASMKAIDTFLD-FRKGDAF------GLTF 142 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP-GLKYAYNQIFDMQGMRQHCNTED 284 F + + L V+ ++ ++ V + A N +R+ Sbjct: 143 FGDAFVHWVPLTTDVTAIRCSPPFMRPETVPPPFGGTAIAKALNGCKT--ELRR-----R 195 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLRACA-SP 342 K+IV +TDG + + + G V+ I + E + C + Sbjct: 196 DEGDKMIVLITDGFSYDLTGNDE--EIARTLSAEGVAVFCIIVGGFEPQAEIVNICRLTG 253 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVTKR 369 + ++P ++ F I + Sbjct: 254 GEAFRADDPDALPAVFKKIDTMKQAQL 280 >gi|294055720|ref|YP_003549378.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] gi|293615053|gb|ADE55208.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] Length = 326 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 75/204 (36%), Gaps = 30/204 (14%) Query: 173 DMMIVLDVSRSM--ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK- 229 D+++++D+S SM F + +ID ++ +L+E + + + GL+ F + Sbjct: 95 DLLLIVDLSGSMDARDFTNPEGERIDRL-SAVKGVLDEFLTRREGD---RVGLIVFGSAA 150 Query: 230 -IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 ++ F + + + + G T L +F+ + + Sbjct: 151 FVQVPFTQDLNACRILLEETTVRMAGPRTVFGDALGLGIT-LFERSEVEE---------- 199 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-------HEFLRACAS 341 ++I+ +TDG T A V+ +G+ + E L A AS Sbjct: 200 RVIIALTDG--NDTGSRVPPAEAAKIANDNTVKVHVVGVGDPTTTGEDVLDEEALNAVAS 257 Query: 342 --PNSFYLVENPHSMYDAFSHIGK 363 ++ N + D ++ + + Sbjct: 258 TTGGRYFHANNREELEDIYTELDR 281 >gi|149027999|gb|EDL83450.1| complement factor B, isoform CRA_c [Rattus norvegicus] Length = 739 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 30/221 (13%), Positives = 68/221 (30%), Gaps = 28/221 (12%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-- 230 I+LD S SM + S + + ++ + + GLVT++ Sbjct: 236 IILDPSGSMNIYMVLDGSDSIGASNFTGAKRCLANLIEKVASYGVKPRYGLVTYATVPKV 295 Query: 231 ------EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E +W L + K TN+ L+ Y+ + G Sbjct: 296 LVRVSEERSSDADWVTEKLNQISYEDHKLKSGTNTKKALQAVYS-MMSWPGDAPPEGWNR 354 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEA----------KKRGAIVYAIGIR----VI 330 + +I+ MTDG + + + + ++ VY G+ + Sbjct: 355 TRH--VIIIMTDGLHNMGGDPVTVIEDIRDLLDIGRDRKNPREDYLDVYVFGVGPLVDPV 412 Query: 331 RSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + + + V++ + + F + + + + Sbjct: 413 NINALASKKNNEQHVFKVKDMEDLENVFYKMIDETKSLGLC 453 >gi|218156285|ref|NP_997631.2| complement factor B [Rattus norvegicus] gi|149027998|gb|EDL83449.1| complement factor B, isoform CRA_b [Rattus norvegicus] Length = 763 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 30/221 (13%), Positives = 68/221 (30%), Gaps = 28/221 (12%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-- 230 I+LD S SM + S + + ++ + + GLVT++ Sbjct: 260 IILDPSGSMNIYMVLDGSDSIGASNFTGAKRCLANLIEKVASYGVKPRYGLVTYATVPKV 319 Query: 231 ------EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E +W L + K TN+ L+ Y+ + G Sbjct: 320 LVRVSEERSSDADWVTEKLNQISYEDHKLKSGTNTKKALQAVYS-MMSWPGDAPPEGWNR 378 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEA----------KKRGAIVYAIGIR----VI 330 + +I+ MTDG + + + + ++ VY G+ + Sbjct: 379 TRH--VIIIMTDGLHNMGGDPVTVIEDIRDLLDIGRDRKNPREDYLDVYVFGVGPLVDPV 436 Query: 331 RSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + + + V++ + + F + + + + Sbjct: 437 NINALASKKNNEQHVFKVKDMEDLENVFYKMIDETKSLGLC 477 >gi|33086684|gb|AAP92654.1| Da1-24 [Rattus norvegicus] Length = 1116 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 30/221 (13%), Positives = 68/221 (30%), Gaps = 28/221 (12%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-- 230 I+LD S SM + S + + ++ + + GLVT++ Sbjct: 613 IILDPSGSMNIYMVLDGSDSIGASNFTGAKRCLANLIEKVASYGVKPRYGLVTYATVPKV 672 Query: 231 ------EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E +W L + K TN+ L+ Y+ + G Sbjct: 673 LVRVSEERSSDADWVTEKLNQISYEDHKLKSGTNTKKALQAVYS-MMSWPGDAPPEGWNR 731 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEA----------KKRGAIVYAIGIR----VI 330 + +I+ MTDG + + + + ++ VY G+ + Sbjct: 732 TRH--VIIIMTDGLHNMGGDPVTVIEDIRDLLDIGRDRKNPREDYLDVYVFGVGPLVDPV 789 Query: 331 RSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + + + V++ + + F + + + + Sbjct: 790 NINALASKKNNEQHVFKVKDMEDLENVFYKMIDETKSLGLC 830 >gi|55378019|ref|YP_135869.1| hypothetical protein rrnAC1219 [Haloarcula marismortui ATCC 43049] gi|55230744|gb|AAV46163.1| unknown [Haloarcula marismortui ATCC 43049] Length = 788 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 39/215 (18%), Positives = 82/215 (38%), Gaps = 37/215 (17%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 V+V + T +++I++DVS S ES K++ + ++ + + Q Sbjct: 365 VRVGNATGGESNIVILVDVSGSAESGLS--------VQKAVA-----LDVLDQLGDENQV 411 Query: 222 GLVTFSN---KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 G+V F+ ++ E L + KI+ L G +T+ GL+ A + D +G Sbjct: 412 GVVAFNQNAYRVSEMQALGQNRAETADKIRRLES-GGATDIAVGLQGADELLDDREGT-- 468 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 I+ ++DG++ N+ + G V ++G+ +R Sbjct: 469 ------------IILLSDGQDRLGPP----AAVANQLGREGTRVVSVGVGKRVGVATMRQ 512 Query: 339 CASP--NSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 AS S++ + + F + + + Sbjct: 513 IASESGGSYFAADETERLRLLFGGSSRRYQGENLT 547 >gi|46237594|emb|CAE83972.1| B-factor, properdin [Rattus norvegicus] gi|56268879|gb|AAH87089.1| Complement factor B [Rattus norvegicus] Length = 761 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 30/221 (13%), Positives = 68/221 (30%), Gaps = 28/221 (12%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-- 230 I+LD S SM + S + + ++ + + GLVT++ Sbjct: 258 IILDPSGSMNIYMVLDGSDSIGASNFTGAKRCLANLIEKVASYGVKPRYGLVTYATVPKV 317 Query: 231 ------EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E +W L + K TN+ L+ Y+ + G Sbjct: 318 LVRVSEERSSDADWVTEKLNQISYEDHKLKSGTNTKKALQAVYS-MMSWPGDAPPEGWNR 376 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEA----------KKRGAIVYAIGIR----VI 330 + +I+ MTDG + + + + ++ VY G+ + Sbjct: 377 TRH--VIIIMTDGLHNMGGDPVTVIEDIRDLLDIGRDRKNPREDYLDVYVFGVGPLVDPV 434 Query: 331 RSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + + + V++ + + F + + + + Sbjct: 435 NINALASKKNNEQHVFKVKDMEDLENVFYKMIDETKSLGLC 475 >gi|332993941|gb|AEF03996.1| von Willebrand factor, type A [Alteromonas sp. SN2] Length = 344 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 31/215 (14%), Positives = 73/215 (33%), Gaps = 50/215 (23%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD-VNNVV--QSGLVTFSNKI 230 MM+ +D+S SM I +++ + +N + ++ D + V + GL+ F++ Sbjct: 90 MMLAVDLSGSM------KIDDMELNGRQVNRLTMTKSVLYDFIQRRVGDRLGLILFADTA 143 Query: 231 EEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L + + + + G T + A + + + Sbjct: 144 YLQAPLTYDRDTVSTLLSESVIGLVGEQTAIGDAIGLAVKRFDEKEESNN---------- 193 Query: 289 KIIVFMTDGENLS---TKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-------------- 331 +++ +TDG+N + T E + L + VY IG+ + Sbjct: 194 -VLILLTDGQNTAGFITPEQAKELAV-----NKKVKVYTIGVGADKMLIQSFFGSRQVNP 247 Query: 332 ----SHEFLRACASP--NSFYLVENPHSMYDAFSH 360 + L AS ++ + + + ++ Sbjct: 248 SQELDEDMLSDLASSTGGQYFRARDVNELEAIYAK 282 >gi|327270780|ref|XP_003220166.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 906 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 54/336 (16%), Positives = 107/336 (31%), Gaps = 51/336 (15%) Query: 60 HAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIV 119 AT+ ++ G + G I + F + F+ D + + + Sbjct: 182 AEATRCSSDITGKYIFRNDKGQIRTCNTERRSQLFEAGCQ---FIPDKTQTTPVSIMYMQ 238 Query: 120 VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVM----PITSSVKVNSQTDARLDM- 174 + + + K P + Y ++ ++M + SS + L + Sbjct: 239 NLSSVTQFCDESSHNIKAP-NMQNKMCDYRSTWEVIMDSADFVNSSPRSAPPPHPTLSLL 297 Query: 175 -------MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +VLDVS SM F +I ++ L ++ G+V F+ Sbjct: 298 QTRDRVVCLVLDVSGSMRDF-----NRIHRLKQAAELFLLQIIETGSWA-----GIVVFN 347 Query: 228 NKIEEFFLLEWGVSHLQRKI--KYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 +K L+ S R+ +L T+ G++ + + C Sbjct: 348 SKASTKAFLQQITSDSVRQTLSDHLPTVADGGTSICSGIREGFQVFLQKYSSTEGCE--- 404 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-SPN 343 IV +TDGE+ S + + G+ ++ I + + E + Sbjct: 405 ------IVLLTDGEDSSVSSCFAEVE------RSGSTIHTIALGPSAAKELEMLANMTGG 452 Query: 344 SFYLVENPHS---MYDAFSHIGK---DIVTKRIWYD 373 + + + DAFS I DI + I + Sbjct: 453 LTFSATDSLDSNGLIDAFSGISSGSGDISQQSIQLE 488 >gi|271964702|ref|YP_003338898.1| von Willebrand factor type A [Streptosporangium roseum DSM 43021] gi|270507877|gb|ACZ86155.1| von Willebrand factor type A [Streptosporangium roseum DSM 43021] Length = 514 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 39/199 (19%), Positives = 63/199 (31%), Gaps = 30/199 (15%) Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLE-------EVKLIPDVNNVVQSGLVTFSN 228 VLD S SME +I+ +++ + NV+ + Sbjct: 329 FVLDTSGSMEG------ERIEALRQALVTLTGADTSASGTFSRFRSRENVIMIPFGGSAG 382 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS---KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + F L E +I+ + + T GL+ AY Q D R H + Sbjct: 383 LPQPFILPERDPQPALAQIRAYAERLRAAGGTAIYDGLRAAYGQAGDAG--RDHYTS--- 437 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK--RGAIVYAIGIRVIRSHEFLRACA-SP 342 IV MTDGEN + Y + R + + + E R + Sbjct: 438 -----IVLMTDGENTDGSSYEDFEAYYRSLPEARRQVRTFVVLFGESDADEMERIATLTR 492 Query: 343 NSFYLVENPHSMYDAFSHI 361 + + S+ AF I Sbjct: 493 GAVFDAR-TGSLASAFKEI 510 >gi|183222779|ref|YP_001840775.1| putative von Willebrand factor, type A [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] gi|189912810|ref|YP_001964365.1| hypothetical protein LBF_3320 [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167777486|gb|ABZ95787.1| Conserved hypothetical protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167781201|gb|ABZ99499.1| Hypothetical protein; putative von Willebrand factor, type A [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] Length = 550 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 82/223 (36%), Gaps = 29/223 (13%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 H+++ + N + L + + +D S SM K++ I + A++ Sbjct: 18 EVQENHLLLRFRTPANPNVEERKPLVIGLAIDKSWSM------KGEKMEAVIDASCALVN 71 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIK-YLSKFGVSTNSTPGLKYA 266 + V+ +V +S ++ + + K + STN + G A Sbjct: 72 WLTRHDAVS------IVAYSADVQLIQPVTHLTEKVSVTDKIRNIQVATSTNLSGGWLSA 125 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL-YYCNEAKKRGAIVYAI 325 + + + K ++ +TDG S +D+++L + G I Sbjct: 126 LKSLNQSKIPNAY---------KRVLLLTDGNPTSGIKDKEALVTIAADHLSMGISTTTI 176 Query: 326 GIRVIRSHEFLR--ACASPNSFYLVENPHS----MYDAFSHIG 362 G+ + E L A A +FY ++NP + ++ F IG Sbjct: 177 GVGNDFNEEMLVEIAKAGGGNFYYIDNPENASDIFFEEFGDIG 219 >gi|47218989|emb|CAG02027.1| unnamed protein product [Tetraodon nigroviridis] Length = 849 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 35/201 (17%), Positives = 78/201 (38%), Gaps = 38/201 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D+S SM S TK+ +++ +LE++ G++ F ++I+ Sbjct: 271 VVFVIDMSGSM------SGTKMQQTREAMLKILEDLDPEDHF------GIILFDHRIQF- 317 Query: 234 FLLEWGVSHLQR----------KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 W S + +K + +G + + P LK A + + + R+ Sbjct: 318 ----WNTSLSKATKENIDEAMVYVKAIQSYGGTDINAPVLK-AVDML---KEDRKAKRLP 369 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG-AIVYAIGIRVIRSHEFLRACASP 342 + + +I+ +TDG+ S + + +A G ++++G + FL + Sbjct: 370 EKSID-MIILLTDGDPNSGESRIPVIQENVKAAIGGQMSLFSLGFGNDVKYPFLDVMSRE 428 Query: 343 NS-----FYLVENPHSMYDAF 358 N+ Y + F Sbjct: 429 NNGLARRIYEGSDAALQLQGF 449 >gi|265766730|ref|ZP_06094559.1| conserved hypothetical protein [Bacteroides sp. 2_1_16] gi|263253107|gb|EEZ24583.1| conserved hypothetical protein [Bacteroides sp. 2_1_16] Length = 608 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 36/251 (14%), Positives = 90/251 (35%), Gaps = 28/251 (11%) Query: 92 MSFRNELRDNGFVND-----IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIP 146 +F ++ + N ++ + ++ + Y+ + + P++ T + Sbjct: 161 STFSIDVDAASYSNMRRFLNKGELPPTNAIRTEELINYFSYNYARPTGND-PVRITTEVG 219 Query: 147 WYTNSRH--IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + +V + ++ ++ +++ ++DVS SM ++D+ S+ Sbjct: 220 TCPWNEQHRLVRIGLKAKEIPTENLPASNLVFLIDVSGSMY-----GPERLDLVKSSLKL 274 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 ++ ++ V V+ SG + ++ I L G ST G+ Sbjct: 275 LVNNLREKDKVAIVIYSGAAG----EKLASTPGSDKQKIREAIDELEASG-STAGGEGIM 329 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVY 323 AY +I + N I+ TDG+ N+ D++ + +K G + Sbjct: 330 LAY-KIAQKNFISGGNNR--------IILCTDGDFNVGVSSDKELEKLIEQKRKSGIFLT 380 Query: 324 AIGIRVIRSHE 334 +G + + Sbjct: 381 VLGYGMGNYKD 391 >gi|212722920|ref|NP_001131192.1| hypothetical protein LOC100192500 [Zea mays] gi|194690832|gb|ACF79500.1| unknown [Zea mays] Length = 650 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 40/227 (17%), Positives = 78/227 (34%), Gaps = 28/227 (12%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME------SFFDSS 191 IP + + + + +D++ VLDVS SM + + Sbjct: 44 TTPIFPEIPQGQARKDFQVLVRVEAPARPEARIPIDVVAVLDVSGSMNNPAAAPTERTRT 103 Query: 192 ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--EEFFLLEWGVSHLQR-KIK 248 +++D+ + M+ +++ + +V FS++ E L + + +R I+ Sbjct: 104 TSRLDLLKTAAKFMVAKLEDGD------RLSIVAFSDRPVRELSSGLLYMTADGRRNAIR 157 Query: 249 YLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 L + T P + A + QG IV +TDG ++ Sbjct: 158 SLDQLEARGGTALVPAFEEAVKVLDGRQGDGGDRLG-------FIVLLTDGAEDASGSFT 210 Query: 307 QSLYYCNEAKK--RGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVE 349 S + R V+A G+ E L A S ++ V+ Sbjct: 211 LSERRREVIRGALRKYPVHAFGLGTAHGPEVLLYLAQESRGTYSFVD 257 >gi|327263661|ref|XP_003216636.1| PREDICTED: cochlin-like [Anolis carolinensis] Length = 527 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 61/173 (35%), Gaps = 25/173 (14%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +++ ++D S S+ + + ++ I+ + + ++ + V F+ Sbjct: 342 SVNVAFLIDGSSSVGD------SNFRLMLEFISNVAKSFEITDIGA---KIAAVQFTYDQ 392 Query: 231 EEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F ++ + ++ + T + + + +F +R N Sbjct: 393 RTEFSFTDYITKENVLAALRGIRYMSGGTATGEAISHTTRNVFGP--VRDGGNKN----- 445 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 +V +TDG+ A++ G +Y+IGI + L+ AS Sbjct: 446 -FLVILTDGQ-----SYDDVRGPAVAAQQAGITIYSIGIAWAPLDD-LKDMAS 491 >gi|198429151|ref|XP_002122318.1| PREDICTED: similar to novel EGF domain containing protein [Ciona intestinalis] Length = 7911 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 83/210 (39%), Gaps = 25/210 (11%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + +A+LD++ VLD S S+ + + + +++ K+ PD VQ GL + Sbjct: 6992 KANAKLDLVFVLDSSGSV------QRVNFRLVLDFVKSVVSGFKIGPDG---VQVGLARY 7042 Query: 227 SNKIEEFFLL-EWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 +++ ++ + L E+ L R I + ST + L+ +F R Sbjct: 7043 NSRWKKLWDLNEYSTKDRLLRAISGVDYISGSTRTGVALRSTALTMFGSSQGR------R 7096 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + K + V +TDG++ + + + +++ A+GI+ + + + P Sbjct: 7097 KDVKAVTVVVTDGKSWD-----KVEEPALLLRNKSSVI-ALGIKNAVKAQLVSIASYPPE 7150 Query: 345 FYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 Y E + A + + D+ Sbjct: 7151 TYAQEVIS--FAALDKFVSTLFSIICGTDE 7178 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 83/205 (40%), Gaps = 25/205 (12%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++++LD S S++S ++ + I ++ + K+ D + G+ T+S ++ Sbjct: 6765 IDLVVLLDTSSSIKS------KNFELIREFIANLINQFKIGKDG---LLVGVATYSRSVQ 6815 Query: 232 EFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + + L R ++ + G T++ + I D++ + + Sbjct: 6816 NLWEMNKYSDKDSLLRAVRGIPYNGGGTSTGAAI----TNITDIKYTE--LAGRRKSAQA 6869 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 + + +TDG + +D Y + K +V A+G++ + P+S + + Sbjct: 6870 VTLVLTDG---VSSDDVSGPAYILQQKS---VVIALGVKGANLKQLNEIATEPDSIFAI- 6922 Query: 350 NPHSMYDAFSHIGKDIVTKRIWYDK 374 +D I + IV + D+ Sbjct: 6923 -MLQNFDELKGITRTIVEAMCYVDE 6946 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 67/201 (33%), Gaps = 49/201 (24%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 Q A D++ VLD S S++ + ++ + +++ + P VQ G+V + Sbjct: 7223 QKTAITDLVFVLDASGSVKRY------NFELVKDFVANIVDRFTIGPKA---VQVGMVRY 7273 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + G T + L+ F + Sbjct: 7274 T--------------------------GGWTYTAKALQGVARYQFTSS------AGSRKS 7301 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 K I++ +TDG++ + S +R A V A+GI+ E + P+S Y Sbjct: 7302 VKPIVIVITDGKSNDQVRNAASTL------QRFATVIALGIKGADKSELGIIASDPDSKY 7355 Query: 347 LVE--NPHSMYDAFSHIGKDI 365 + + + I + Sbjct: 7356 SLNIRSFSELTRIIDMIISAM 7376 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 71/201 (35%), Gaps = 25/201 (12%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +D +D++ V+D S + + ++ + + V+ N VQ LV +S Sbjct: 2184 SDTAIDLVFVVDSSTYVRR---------QNFKEMLHLVSDIVRRFQIGPNDVQVALVRYS 2234 Query: 228 NKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + F L +++ I ++ L+ ++I + + Sbjct: 2235 TSADIAFSLSAHSDETNVISSITAVTNNPGYRRIGQALQTVSSRILSAE------TGQRQ 2288 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 ++V + G D E K + ++ A+GIR + E L +S +F Sbjct: 2289 GVPAVVVVIATG-----HSDDLIKTASAELKSKATVI-ALGIRSVEG-EMLDEISSGQNF 2341 Query: 346 -YLVENPHSMYDAFSHIGKDI 365 + +++ + + I Sbjct: 2342 SIYENDYIQLHEVGGSVAQLI 2362 >gi|139948509|ref|NP_001077327.1| complement factor B [Danio rerio] gi|125858059|gb|AAI29235.1| Zgc:158446 protein [Danio rerio] Length = 751 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 37/216 (17%), Positives = 81/216 (37%), Gaps = 20/216 (9%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K+ +LD+ I LDVS S++ + A I ++E++ N Sbjct: 241 KIQVHKGGKLDIYIALDVSDSIDE------EDFERAKDVIKTLIEKISYYEVSPNYEILL 294 Query: 223 LVTFSNKIEEFFLLEWGVSH----LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 T +++I + G + +K++ + S + AYN+I++ + Sbjct: 295 FATDTDRIISMREFKNGQGKDLLKIIQKLQDYAYDKKGQRSGTNIAQAYNKIYESMTIEL 354 Query: 279 HCNTEDANYKKIIVFM-TDGE-NLSTKEDQQSLYYCNEAKKRGA----IVYAIGIR---- 328 N ED + IV M TDG+ N+ + ++ +Y G+ Sbjct: 355 MTNKEDFKATQHIVIMFTDGQANMGGSPKPLVDKIKSLVRQNSVEEKLELYVFGLGNDVH 414 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 ++ A+ F+ +++ + + F ++ + Sbjct: 415 AEDINDLKTDRANEKFFFKLKSLDDLKETFDNMIDE 450 >gi|162454179|ref|YP_001616546.1| hypothetical protein sce5902 [Sorangium cellulosum 'So ce 56'] gi|161164761|emb|CAN96066.1| hypothetical protein sce5902 [Sorangium cellulosum 'So ce 56'] Length = 940 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 73/217 (33%), Gaps = 35/217 (16%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 WY + ++P+ + + + M +V+D S SM + A K+ +L Sbjct: 450 WYRTTIERILPVRMDNERK-KDMPSVAMALVMDRSGSMT---GLPLEMAKAAAKATAGVL 505 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEF--FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 LI ++ F + + S + +I + G T L Sbjct: 506 SSDDLIE---------VIAFDSAPTRYVKMQPARNRSRIAGEIARIQP-GGGTEIFSALD 555 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVY 323 AY + Q ++H ++ +TDG+ + D S V Sbjct: 556 AAYQDMTVTQARKKH-----------VILLTDGKASTGGIRDLVSAMIAES-----ITVT 599 Query: 324 AIGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAF 358 +G+ + L+ A F+ V +P+++ F Sbjct: 600 TVGLGNDLDEQLLKMIADVGGGRFHAVPDPNNLPRIF 636 >gi|148704834|gb|EDL36781.1| coagulation factor C homolog (Limulus polyphemus), isoform CRA_b [Mus musculus] Length = 574 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 26/207 (12%), Positives = 69/207 (33%), Gaps = 27/207 (13%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTD------ARLDMMIVLDVSRSMESFFDSSITK 194 F +P + + V P+ + + Q +++ ++D S S+ + Sbjct: 353 FSYHMPNWFGTTKYVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGD------SN 406 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG 254 + ++ ++ + + ++ + T+ + E F ++ + + Sbjct: 407 FRLMLEFVSNIAKTFEISDIGAKIAAVQF-TYDQRTEFSFTDYNTKENVLAVLANIRYMS 465 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 T + + + +F D+ K +V +TDG+ Sbjct: 466 GGTATGDAIAFTVRNVFGP--------IRDSPNKNFLVIVTDGQ-----SYDDVRGPAAA 512 Query: 315 AKKRGAIVYAIGIRVIRSHEFLRACAS 341 A G ++++G+ + LR AS Sbjct: 513 AHDAGITIFSVGVAWAPLDD-LRDMAS 538 >gi|17231851|ref|NP_488399.1| hypothetical protein alr4359 [Nostoc sp. PCC 7120] gi|17133495|dbj|BAB76058.1| alr4359 [Nostoc sp. PCC 7120] Length = 418 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 34/222 (15%), Positives = 82/222 (36%), Gaps = 29/222 (13%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 + R + + I++ + +T L++ ++LD S SM + +++ ++ +++ + Sbjct: 21 SQRQLAISISAGAEPQDRT-VPLNLCLILDHSGSM------NGRPLEIVKQAAIRLVDRL 73 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 K + +V F ++ + + ++++I L+ G T GL+ Sbjct: 74 KTGD------RLSVVAFDHRAKVLVPNQVIDNPEQIKKQISRLAADG-GTAIDEGLRLGI 126 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 ++ + +TDGEN ++ + L + A + +G Sbjct: 127 EELAKGKKETISQA----------FLLTDGENEH-GDNSRCLKFAQLAAGYSLTLNTLGF 175 Query: 328 RVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVT 367 + + L A S ++ P D F + I T Sbjct: 176 GDNWNQDILEKIADAGLGSLSYIQKPEQAVDEFGRLFSRIQT 217 >gi|311265878|ref|XP_003130868.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Sus scrofa] Length = 944 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 68/202 (33%), Gaps = 33/202 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ VLD S SM K+ ++ +L +++ N ++ FSN+I+ Sbjct: 296 VVFVLDSSASM------VGAKLRQTKDALFTILHDLRPQDHFN------IIGFSNRIKVW 343 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + V + I ++S G T+ L+ + D R + Sbjct: 344 KDHLVSVTPNNVRDGKVYIHHMSPSG-GTDINGALQTGIALLHDSV-ARHDLEDRSVSL- 400 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEF-----LRACA 340 IVF+TDG+ + + N ++ ++ +GI L C Sbjct: 401 --IVFLTDGKPT--VGETHTPKILNNTREAARGRVCIFTVGIGDDVDFRLLEKLSLENCG 456 Query: 341 SPNSFYLVENP-HSMYDAFSHI 361 + E+ + + I Sbjct: 457 FTRHVHEDEDAGAQLIGFYDEI 478 >gi|301792481|ref|XP_002931207.1| PREDICTED: epithelial chloride channel protein-like [Ailuropoda melanoleuca] Length = 904 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 50/264 (18%), Positives = 91/264 (34%), Gaps = 37/264 (14%) Query: 103 FVNDIDDIVR--STSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITS 160 FV ID + + P + S+ S + + + F P + Sbjct: 240 FVQSIDSVTEFCTAKTHNTEAPNLQNKMCSSRSTWDVIMDSDDFQNASPMKGTDPPPHPT 299 Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + S+ + +VLD S SM S + + +++ A E+ LI + Sbjct: 300 FSLLKSKQRV---VCLVLDKSGSMSS--EDRLLRMNQAA--------ELYLIQIIEKGSL 346 Query: 221 SGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSK-FGVSTNSTPGLKYAYNQIFDMQGMR 277 G+VTF + L G + ++ L + T+ GL+ + I Sbjct: 347 VGMVTFESLATILNYLTDIIGDNAYEKITANLPREASGGTSICSGLRAGFQAIIHSNQST 406 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF- 335 I+ +TDGE+ C E K+ GA+++ I + + E Sbjct: 407 SGSE---------IILLTDGEDDQISS-------CFEEVKQSGAVIHTIALGPSAARELE 450 Query: 336 -LRACASPNSFYLVENPHSMYDAF 358 L FY ++ + + DAF Sbjct: 451 TLSNMTGGYRFYANKDINGLTDAF 474 >gi|317127857|ref|YP_004094139.1| von Willebrand factor A [Bacillus cellulosilyticus DSM 2522] gi|315472805|gb|ADU29408.1| von Willebrand factor type A [Bacillus cellulosilyticus DSM 2522] Length = 282 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 28/206 (13%), Positives = 81/206 (39%), Gaps = 27/206 (13%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + KV + +++ ++LD S SM S + ++ N ++ ++ ++ Sbjct: 31 TGAKVKHTERSPINLSLLLDRSGSM------SGEPLRYCKEACNFVINQLTDKDILS--- 81 Query: 220 QSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 +V F +++E + L+ I+ + G++ S ++ + + Q ++ Sbjct: 82 ---VVVFDDQVETIIEPQKVTHKDLLKEYIQRIETRGITNLSGGLIQGCQHVL--KQEVK 136 Query: 278 QHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 + N ++ ++DG+ N + + + ++ + G ++ +G+ E L Sbjct: 137 NYVNR--------VILLSDGQANAGITDKEALVKLADDYQSAGLVISTLGVSEHFDEELL 188 Query: 337 RACASP--NSFYLVENPHSMYDAFSH 360 A +F+ + ++ F Sbjct: 189 EGVADSGRGNFHFINEVENIPSIFEQ 214 >gi|282858824|ref|ZP_06267969.1| von Willebrand factor type A domain protein [Prevotella bivia JCVIHMP010] gi|282588393|gb|EFB93553.1| von Willebrand factor type A domain protein [Prevotella bivia JCVIHMP010] Length = 318 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 39/213 (18%), Positives = 73/213 (34%), Gaps = 37/213 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + +D+M+ +D+S SM + D + ++ +A + + P+ N GL F Sbjct: 83 KETEGIDIMLAMDISASMLTN-DVTPNRLIVAKNVASDFING---RPNDN----IGLTIF 134 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSK-------FGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + + L + L + + T GL A ++ Sbjct: 135 AGEAFTQCPLTIDHATLINLLNNVRADLVVKGLIQDGTAIGMGLANAVGRLKASNAKS-- 192 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE----- 334 KI++ +TDG N + AKK VY IG+ +S Sbjct: 193 ---------KIVILLTDGSNNVGSISPMTAATI--AKKFNIRVYTIGLGTEQSGNYNDID 241 Query: 335 --FLR--ACASPNSFYLVENPHSMYDAFSHIGK 363 L+ A + FY ++ + ++ I K Sbjct: 242 YTTLKQIALTTNGEFYRAQSQTELLQIYNDINK 274 >gi|163754424|ref|ZP_02161546.1| aerotolerance-related membrane protein [Kordia algicida OT-1] gi|161325365|gb|EDP96692.1| aerotolerance-related membrane protein [Kordia algicida OT-1] Length = 344 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 28/176 (15%), Positives = 69/176 (39%), Gaps = 26/176 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +D++ +DVS+SM + D S +++D + + ++ ++ + + Sbjct: 77 GTKLETVKREGVDIVFAVDVSKSMLAE-DISPSRLDKSKRIVSEIINNLGSD-------R 128 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMR 277 G++ ++ + + S + + L+ T ++ A D++ Sbjct: 129 IGIIAYAGRAVPQLPITTDFSAAKMFLNNLNTNMLSSQGTAIDDAIRLAKTYYDDVEQTN 188 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 +++V ++DGE+ + Q EA K G Y IG+ + Sbjct: 189 -----------RVLVIISDGEDHTGGAGQ----LAEEATKEGIKTYTIGVGTTKGG 229 >gi|115535038|ref|NP_509469.2| hypothetical protein K09E2.1 [Caenorhabditis elegans] gi|90568060|gb|AAC46572.2| Hypothetical protein K09E2.1 [Caenorhabditis elegans] Length = 915 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 71/193 (36%), Gaps = 25/193 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++I+ D S + T + N ++ V+ +P + V+ GLV +S Sbjct: 31 PVLDIIILFDTSG-------GNDTVFEQQK---NWTIKIVRDLPIHEDAVRVGLVQYSES 80 Query: 230 IEEFFLLEWGV--SHLQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + F L + + ++ L+ V T + L A +IFD G Sbjct: 81 AKTEFNLSKYSERNDIIAHMETLTFMQVEDTRTGVALNKADEEIFDFNG------GARLK 134 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-SHEFLRACASPNSF 345 ++I+ TDG ++ +++G +Y I + I E L ++ Sbjct: 135 ATRLIIIFTDGLSMDKPS-----KAAKALRRKGVKIYTISVNSIGFIPEMLGIVGDADNV 189 Query: 346 YLVENPHSMYDAF 358 + + + + Sbjct: 190 FGPNDEERIEERL 202 Score = 37.4 bits (85), Expect = 3.2, Method: Composition-based stats. Identities = 27/208 (12%), Positives = 72/208 (34%), Gaps = 37/208 (17%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 ++S +D++ V+D S S+ +D+ + ++++V+ + + G Sbjct: 721 TLSSAVQCPMDILFVVDSSGSITHTYDTQ-------KDYLTQLIKKVEP----SRSHRVG 769 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRK----IKYLSKFG--------VSTNSTPGLKYAYNQI 270 L+ F+ + + + I+ + +T L+ + Sbjct: 770 LIQFAGPHIQKMEWSFDTHSKNSQLLSAIRSVRHLTGVVLHFQSGTTYIGAALEL---SL 826 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + R+H T ++ ++DG + Q L + +YAI + + Sbjct: 827 ILLDSRRKHTET-------TVILISDGFSQDDSTQQAKLLR----QLPNVKMYAISLNKL 875 Query: 331 RSHEFLRACASPNSFYLVENPHSMYDAF 358 + ++L + N + ++ F Sbjct: 876 TNTKYLTDIVGDRKNLFINNESTWFEEF 903 >gi|291398583|ref|XP_002715574.1| PREDICTED: chloride channel accessory 1-like [Oryctolagus cuniculus] Length = 911 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 40/199 (20%), Positives = 77/199 (38%), Gaps = 35/199 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM SS +++ ++ L + + G+VTF + + Sbjct: 307 VCLVLDKSGSM-----SSDNRLNRLNQAGKLFL-----LQTIEQGSWVGMVTFDSAAQVR 356 Query: 234 FLLEWGVSHLQRK--IKYLSKF-GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L S R+ K L T+ GL+ A++ I +++ Sbjct: 357 SELRQIKSGTDREALTKSLPTVPSGGTSICSGLRVAFSVI-----KKKYPTDGSE----- 406 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ ++ C +E ++ GAI++ + + + E L Y Sbjct: 407 IVLLTDGEDN-------TIKVCFDEVRQSGAIIHTVALGPSAALELEELSKMTGGLQTYA 459 Query: 348 VENPHS--MYDAFSHIGKD 364 + + + DAF + + Sbjct: 460 SDQVQNNGLIDAFGALSSE 478 >gi|258620794|ref|ZP_05715829.1| conserved hypothetical protein [Vibrio mimicus VM573] gi|258586992|gb|EEW11706.1| conserved hypothetical protein [Vibrio mimicus VM573] Length = 419 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 31/223 (13%), Positives = 81/223 (36%), Gaps = 21/223 (9%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVH--AATQIM 66 +G I+ + P + ++L +++S F L + + + A+ + Sbjct: 1 MKSIKKQQGVAGIIYIMMFPAMMMILAFTMQLSQQFLAHARLSEASEVASLALIASPKED 60 Query: 67 NEGNGNNRKKLKGGDILCRIKNT-WNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 +E N + +KL ++ I + + + +G V + T + +++ Sbjct: 61 DENNVSYARKLVDRYVVDNIDDIKVTVKNKRCEYKDGCVQSSGEAAPFTDFTVAATAKHK 120 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 + I + + P +T + S +D+ ++D+S SM Sbjct: 121 SW---------ISYENISLKPEFT--------VNGSSVTRKFLPQPVDVYFIVDMSASMR 163 Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + + + ++ID I ++ ++K + L+ + N Sbjct: 164 ATWQNGKSQIDEVKNVITRVVNDLKDFDTEVK-SRVALLGYHN 205 >gi|258624772|ref|ZP_05719703.1| putative Flp pilus assembly protein TadG [Vibrio mimicus VM603] gi|258582934|gb|EEW07752.1| putative Flp pilus assembly protein TadG [Vibrio mimicus VM603] Length = 419 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 31/223 (13%), Positives = 81/223 (36%), Gaps = 21/223 (9%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVH--AATQIM 66 +G I+ + P + ++L +++S F L + + + A+ + Sbjct: 1 MKSIKKQQGVAGIIYIMMFPAMMMILAFTMQLSQQFLAHARLSEASEVASLALIASPKED 60 Query: 67 NEGNGNNRKKLKGGDILCRIKNT-WNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 +E N + +KL ++ I + + + +G V + T + +++ Sbjct: 61 DENNVSYARKLVDRYVVDNIDDIKVTVKNKRCEYKDGCVQSSGEAAPFTDFTVAATAKHK 120 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 + I + + P +T + S +D+ ++D+S SM Sbjct: 121 SW---------ISYENISLKPEFT--------VNGSSVTRKFLPQPVDVYFIVDMSASMR 163 Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + + + ++ID I ++ ++K + L+ + N Sbjct: 164 ATWQNGKSQIDEVKNVITRVVNDLKDFDTEVK-SRVALLGYHN 205 >gi|126730249|ref|ZP_01746060.1| hypothetical protein SSE37_10854 [Sagittula stellata E-37] gi|126708982|gb|EBA08037.1| hypothetical protein SSE37_10854 [Sagittula stellata E-37] Length = 666 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 27/133 (20%), Positives = 46/133 (34%), Gaps = 32/133 (24%) Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDG---------ENLSTKEDQQSLYY--------- 311 + D + + + ED +K IV M+DG + + + L Sbjct: 532 LVDSRFLGRPIAWEDEETEKFIVVMSDGAVTDQYRPVDPFAPLNGETELQTQGSGSYTTF 591 Query: 312 -------------CNEAKKRGAIVYAIGIRVIRSH-EFLRACASPNSFYLVENPHSMYDA 357 C A+ G V+A+ + + LR CAS +S + + DA Sbjct: 592 STRGNNLDNLHTQCQLARDLGVTVFAVAFETTDADADELRLCASSDSHFFHVQGTEIIDA 651 Query: 358 FSHIGKDIVTKRI 370 F I + I R+ Sbjct: 652 FDTIARQINNLRL 664 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 39/235 (16%), Positives = 92/235 (39%), Gaps = 41/235 (17%) Query: 14 NYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNN 73 + +G MTI + L +I ++ G +++ + ++ L + +DR+++ AA Sbjct: 55 DEEGTMTIFSTFMLVLILVITGASVDIMYQEAIRARLQATLDRAVLAAA----------- 103 Query: 74 RKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAIS 133 L + ++ N +V + T D++ P +++A + Sbjct: 104 --------DLDQQQDPVA-------VVNDYVTKAGLVEHLT--DVIATPGLYDRTVAADA 146 Query: 134 RYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSIT 193 + F W T +P+ ++ + A +++ +V+D+S SM Sbjct: 147 GLTLDTYFLRMSGWQT------LPVIAASTAE-ERIANVEISLVMDISGSMRWN-----N 194 Query: 194 KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIK 248 +I A + + +V L D V L+ F+ ++ ++ ++ KI+ Sbjct: 195 RITNARNAAKDFVTKV-LTEDSAGVTTLNLIPFAGQVNPGDVMFDYFRGVRPKIQ 248 >gi|110626529|gb|ABG79013.1| TadG [Yersinia ruckeri] Length = 478 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 51/290 (17%), Positives = 93/290 (32%), Gaps = 46/290 (15%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F N KGG+ I I LP ++ ++ + + + K L +++ + + E N Sbjct: 38 FLENKKGGIIIPFFISLPFFIAIIMLLFDFTQLINNKIKLSDALEQGALA----LTAENN 93 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 N + I I + +N VN + D + Q Y I Sbjct: 94 AKNDTR-NNELISAYINFYLGHRHQLTQYNNITVNYQQN------PDRLYHTQLSQYHID 146 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 A F + + + N I + +D++ V D S SME F + Sbjct: 147 ANIEQPTLFPFTSLLIDHDN------FIIGGSAAAIKDVPAMDVVFVTDFSGSMEGDFHN 200 Query: 191 S-----ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR 245 ++K+D + + I N + F WG Sbjct: 201 PDDPEVLSKLDELKR---IFFKIADDIYTANK----------DSTISFSPFSWGTKSADN 247 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 K L + N+I+ + NT +A+ +K ++ +T Sbjct: 248 KKCSLHF----------MPKEKNKIYPIPSNEIERNT-EAHQEKYMIAIT 286 >gi|284922261|emb|CBG35346.1| putative lipoprotein [Escherichia coli 042] Length = 588 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 50/320 (15%), Positives = 114/320 (35%), Gaps = 48/320 (15%) Query: 41 SHIFFMKTVLHSMIDRSLVH-------AATQIMNEGNGNNRKKLKGGDILCRIKNTWNMS 93 + + K L + + + AA+QI N G R + + + ++ + Sbjct: 75 AQQYSDKQALQGRLQAAPKYQHAAREKAASQIAN--PGTARYQQFDDNPVKQVAQNPLAT 132 Query: 94 FRNELRDNGFVN-----DIDDIVRSTSLDIVVVPQN--EGYSISAISRYKIPLKF----- 141 F ++ + N + + ++ + + + I+ S K P+ Sbjct: 133 FSLDVDTGSYANVRRFLNHGQLPPPDAVRVEEMVNYFPSDWVINDKSNNKEPVPASKPIP 192 Query: 142 ------CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKI 195 PW + + I + S+ +++ ++D S SM S ++ Sbjct: 193 FAMRCELAPAPWNEQRTLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISD-----ERL 246 Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGV 255 + S+ +++E++ ++ V +G ++I + + + I L G Sbjct: 247 PLIQSSLKLLVKELREQDNIAIVTYAG----DSRIALPSISGSHKAEINAAIDSLDAEG- 301 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNE 314 STN GL+ AY Q ++ N I+ TDG+ +D +S+ + Sbjct: 302 STNGGAGLELAYQQA-AKGFIKGGINR--------ILLATDGDFNVGIDDPKSIESMVKK 352 Query: 315 AKKRGAIVYAIGIRVIRSHE 334 ++ G + G+ +E Sbjct: 353 QRESGVSLSTFGVGDSNYNE 372 >gi|108758240|ref|YP_629592.1| putative lipoprotein [Myxococcus xanthus DK 1622] gi|108462120|gb|ABF87305.1| putative lipoprotein [Myxococcus xanthus DK 1622] Length = 659 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 72/206 (34%), Gaps = 41/206 (19%) Query: 152 RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID--------------- 196 + + + S + + +++V+D S SM D ++D Sbjct: 39 DRLTLQGRVCTRPPSPSGFPVKVVVVIDESGSM-CVSDPPGAQLDNGFCQRREILDIIPE 97 Query: 197 -----MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG-------VSHLQ 244 ++++ ++++ + + VQ + F + + +++ Sbjct: 98 GVTEPARVRALKRLVQQFREVNAQGGNVQVSVAPFETNVRNVWPPTTTGDRFARPDNNID 157 Query: 245 RKIKYL-SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI-IVFMTDG----- 297 I+ L S+ G T+ L YAY+ I N E + +VF+TDG Sbjct: 158 SYIEGLQSQLGKGTDYQGALSYAYSLISSDINAVAQSNPELLPRTRYVVVFLTDGTPYPR 217 Query: 298 ----ENLSTKEDQQS--LYYCNEAKK 317 +NLS D + L + + + Sbjct: 218 CSATDNLSVYADPDNPDLTWADSLRD 243 >gi|297663822|ref|XP_002810364.1| PREDICTED: integrin alpha-10-like [Pongo abelii] Length = 1177 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 43/244 (17%), Positives = 82/244 (33%), Gaps = 46/244 (18%) Query: 150 NSRHIVMPITSSVKVNSQTDARL-----------DMMIVLDVSRSMESFFDSSITKIDMA 198 S I + V + Q L D++IVLD S S+ + +++ Sbjct: 133 GSSVFSSGICARVDASFQPQGSLAPTAQRCPTYMDVVIVLDGSNSIYPW-----SEVQTF 187 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGV 255 ++ + L I +Q GLV + + L + ++ + G Sbjct: 188 LRRLVGKL----FIDPEQ--IQVGLVQYGESPVHEWSLGDFRTKEEVVRAAKNLSRREGR 241 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 T + + A + F + +++V +TDGE+ +E +L C Sbjct: 242 ETKTAQAILVACTEGFSQS------HGGRPEAARLLVVVTDGESHDGEELPAALKACEAG 295 Query: 316 KKRGAIVYAIGI------RVIRSHEFLR----ACASPNS--FYLVENPHSMYDAFSHIGK 363 + Y I + R FLR + P+ F+ V + ++ D +G Sbjct: 296 R---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASDPDERFFFNVTDEAALTDIVDALGD 352 Query: 364 DIVT 367 I Sbjct: 353 RIFG 356 >gi|295132199|ref|YP_003582875.1| von Willebrand factor (vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] gi|294980214|gb|ADF50679.1| von Willebrand factor (vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] Length = 347 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 22/176 (12%), Positives = 66/176 (37%), Gaps = 26/176 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + + +D++ +DVS+SM++ D + ++++ + + + +L + + Sbjct: 79 GTTMETVKREGVDIVFAIDVSKSMDAE-DIAPSRLEKSKQIVRQILGGLGSD-------R 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMR 277 G++ ++ + + + ++ L+ T + A D Sbjct: 131 VGIIAYAGSAFPQLPITTDYAAAKMFLQALNTDMISSQGTAIGDAIDLATTYYDDDNQTN 190 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 +++ ++DGE+ + +EA ++G +Y IG+ + Sbjct: 191 -----------RVLFIISDGEDH----EGNVESIADEAAEKGIRIYTIGVGTEKGG 231 >gi|317125812|ref|YP_004099924.1| von Willebrand factor A [Intrasporangium calvum DSM 43043] gi|315589900|gb|ADU49197.1| von Willebrand factor type A [Intrasporangium calvum DSM 43043] Length = 577 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 47/322 (14%), Positives = 94/322 (29%), Gaps = 27/322 (8%) Query: 54 IDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRS 113 S + + E N N + I ++ + N + I+ + Sbjct: 255 AKASAFPTSEAALAEHNRVNPQNKMVAVIPAEGTPSFEYTLINVSTNPAKAQGIEALRAF 314 Query: 114 TSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLD 173 + S +P + S +T+++ V Sbjct: 315 LRSEEAAKTLASFGLRSTAVPVTLPTPPGSIGEIKPGSTPNAAQVTAAIDVWQAATTSFQ 374 Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ V DVS SM + T++ + ++ L + + GL FS+ Sbjct: 375 LLSVFDVSGSMNEKV-GNTTRVRITQEAAGIALNALP------RSTKLGLWVFSSDKGGG 427 Query: 231 ---EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 +E L L ++ + S + ++D + Sbjct: 428 RDYKELVPL-----GLLSDDAQRARMAAAAASLSKEVDGWTGLYDTIWAAYSKVKASYDP 482 Query: 288 KKI--IVFMTDGENLSTKEDQQSLYYCNEAKK-----RGAIVYAIGIRVIRSHEFLRACA 340 +++ +V +TDG+N + K R + IGI E LR + Sbjct: 483 QRVNAVVILTDGKNEDPGGGLSLEQLLAKIKDATDPKRPIAITTIGIGPGVDAESLRKIS 542 Query: 341 SPNS--FYLVENPHSMYDAFSH 360 + +Y ENP M + Sbjct: 543 RSSYSDYYGAENPADMTTVLAK 564 >gi|332845803|ref|XP_510951.3| PREDICTED: integrin alpha-X [Pan troglodytes] Length = 1209 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 42/216 (19%), Positives = 78/216 (36%), Gaps = 32/216 (14%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 V D++ ++D S S+ S + + A++ + + Q Sbjct: 140 VSRQECPRQEQDIVFLIDGSGSISS------RNFATMMNFVRAVISQFQ-----RPSTQF 188 Query: 222 GLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 L+ FSNK + F E S+ + + + T + ++ +Q+F Sbjct: 189 SLMQFSNKFQTHFTFEEFRRSSNPLSLLASVHQLQGFTYTATAIQNVVHQLFHAS----- 243 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEF 335 + KI++ +TDG+ D + + +A G I YAIG + S + Sbjct: 244 -YGARRDAAKILIVITDGKKEGDSLDYKDVIPMADA--AGIIRYAIGVGLAFQNRNSWKE 300 Query: 336 LRACA---SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 L A S + VE+ +DA I + K Sbjct: 301 LNDIASKPSQEHIFKVED----FDALKDIQNQLKEK 332 >gi|329941682|ref|ZP_08290947.1| von Willebrand factor type A /toxic cation resistance protein [Streptomyces griseoaurantiacus M045] gi|329299399|gb|EGG43299.1| von Willebrand factor type A /toxic cation resistance protein [Streptomyces griseoaurantiacus M045] Length = 248 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 35/162 (21%), Positives = 65/162 (40%), Gaps = 27/162 (16%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNN 217 SSV+ + D R + +VLD S SM ++ D ++ + + S++A L++ +P Sbjct: 33 ESSVRGHGLEDVRAAVYLVLDRSGSMRPYYRDGTMQHLAERVLSLSAHLDDDGTVP---- 88 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQ 274 +V FS ++ L G + I L + TN + + Sbjct: 89 -----VVFFSTDVDGCTDLTLGRH--RGHIDKLHENLGHMGRTNYHCAMDAVID------ 135 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 H ++ ++VF TDG S + ++ Y C A+ Sbjct: 136 ----HYLESGSDAPALVVFQTDGGPTSRQAAER--YLCKAAR 171 >gi|326779287|ref|ZP_08238552.1| Protein of unknown function DUF3520 [Streptomyces cf. griseus XylebKG-1] gi|326659620|gb|EGE44466.1| Protein of unknown function DUF3520 [Streptomyces cf. griseus XylebKG-1] Length = 575 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 37/244 (15%), Positives = 88/244 (36%), Gaps = 28/244 (11%) Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 P+ G+S++ K ++ + + + + ++ + V+D+S Sbjct: 164 PKGSGFSVNVDGARIGAGKGGGGGTGASDWSLLRVGLATEAAPSTAERPPAALTFVVDIS 223 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--- 238 SM ++D+ KS+ + +E++ V+ LVTFS+ E + Sbjct: 224 GSMAETG-----RLDLVRKSLAVLTDELRDDDSVS------LVTFSDAAETRLPMTRLQG 272 Query: 239 GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG- 297 + ++ ++ + STN G+ Y + ++G R+ +V ++D Sbjct: 273 NRNRIKDVVEEMRP-EQSTNVEAGITRGYEE--SVEGHRKGATNR-------VVLLSDAL 322 Query: 298 ENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIRVIRSHEFLRACAS--PNSFYLVENPHSM 354 N E L + ++ G ++ +G+ F+ + + V + Sbjct: 323 ANTGDTEADGILERIDSTRREYGITLFGVGVGSDYGDAFMERLTNKGDGNTTYVGDEAQA 382 Query: 355 YDAF 358 F Sbjct: 383 RKVF 386 >gi|268611865|ref|ZP_06145592.1| von Willebrand factor type A [Ruminococcus flavefaciens FD-1] Length = 550 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 73/190 (38%), Gaps = 21/190 (11%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 PW + + +++ I ++ Q +++ ++D S SM S+ K+ + + + + Sbjct: 164 PWNRDHKLMMVGIQGK-ELQQQETPPSNLVFLIDSSGSMNSY-----DKLPLVQSAFSML 217 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 E++ ++ V +G S+ + + ++ ++ G STN G+K Sbjct: 218 AEQLDKNDRISIVTYAG----SSAVLLDGEKGSNTDEILEQLYSITASG-STNGEGGIKT 272 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYA 324 AY +H N ++ TDG+ N+ +++ + G + Sbjct: 273 AYEL------AEEHFIKGGNNR---VILATDGDLNVGASSEEELTRLIETKRDNGIYLSV 323 Query: 325 IGIRVIRSHE 334 +G + Sbjct: 324 LGFGEGNYKD 333 >gi|150397140|ref|YP_001327607.1| putative signal peptide protein [Sinorhizobium medicae WSM419] gi|150028655|gb|ABR60772.1| conserved hypothetical signal peptide protein [Sinorhizobium medicae WSM419] Length = 126 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 11/76 (14%), Positives = 25/76 (32%) Query: 6 LNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 L++R + +G +L AI I + I+ + +F + S +D + + + Sbjct: 8 LHMRRLMRDREGNFAVLGAIAFIPIIGAAALAIDFAGAYFEAEKIQSALDAAALGSVRAY 67 Query: 66 MNEGNGNNRKKLKGGD 81 Sbjct: 68 GEGATEEEAYDAAQKF 83 >gi|110678222|ref|YP_681229.1| hypothetical protein RD1_0875 [Roseobacter denitrificans OCh 114] gi|109454338|gb|ABG30543.1| conserved hypothetical protein [Roseobacter denitrificans OCh 114] Length = 320 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 62/164 (37%), Gaps = 24/164 (14%) Query: 173 DMMIVLDVSRSM---ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+ IVLD+S SM + D A+ ++ A + + LV F ++ Sbjct: 92 DLAIVLDLSGSMVRDDFDLDGRQVTRRDAVATVGADFARRRGGD------RVALVVFGSE 145 Query: 230 IEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + V + R+I+ G +T+ + GL A ++ + + Sbjct: 146 AYFAAPFSFDVEAIARQIEGAQIGISGRATSISDGLGLALKRMENSEAAS---------- 195 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++++ ++DG N + + + A + G V+ I + Sbjct: 196 -RVVILLSDGVNNAGATNPRG--VAELAAQMGVRVHTIALGPKD 236 >gi|306814616|ref|ZP_07448778.1| hypothetical protein ECNC101_21282 [Escherichia coli NC101] gi|305852010|gb|EFM52462.1| hypothetical protein ECNC101_21282 [Escherichia coli NC101] Length = 581 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 49/294 (16%), Positives = 107/294 (36%), Gaps = 37/294 (12%) Query: 56 RSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN-------DID 108 R+ AT I N G R + + + ++ ++F ++ + N + Sbjct: 94 RAAKANATHIAN--PGTARYQQFDDNPVKQVAQNPLVTFSLDVDTGSYANVRRFLNQGLL 151 Query: 109 DIVRSTSLDIVVV--PQNEGYSISAISRYKIPLKFCT-----FIPWYTNSRHIVMPITSS 161 + ++ VV P + P+ F PW + + I + Sbjct: 152 PPPDAVRVEEVVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQRTLLKVDILAK 211 Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 S+ +++ ++D S SM S ++ + S+ +++E++ ++ V + Sbjct: 212 -DRKSEELPASNLVFLIDTSGSMISD-----ERLPLIQSSLKLLVKELREQDNIAIVTYA 265 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 G ++I + + + I L G STN GL+ AY Q ++ N Sbjct: 266 G----DSRIALPSISGSHKAEINAAIDSLDAEG-STNGGAGLEMAYQQA-AKGFIKGGIN 319 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGAIVYAIGIRVIRSHE 334 I+ TDG+ +D +S+ + ++ G + +G+ +E Sbjct: 320 R--------ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDSNYNE 365 >gi|260856317|ref|YP_003230208.1| hypothetical protein ECO26_3261 [Escherichia coli O26:H11 str. 11368] gi|260868996|ref|YP_003235398.1| hypothetical protein ECO111_3021 [Escherichia coli O111:H- str. 11128] gi|300903656|ref|ZP_07121573.1| von Willebrand factor type A domain protein [Escherichia coli MS 84-1] gi|301303269|ref|ZP_07209394.1| von Willebrand factor type A domain protein [Escherichia coli MS 124-1] gi|257754966|dbj|BAI26468.1| conserved predicted protein [Escherichia coli O26:H11 str. 11368] gi|257765352|dbj|BAI36847.1| conserved predicted protein [Escherichia coli O111:H- str. 11128] gi|300404332|gb|EFJ87870.1| von Willebrand factor type A domain protein [Escherichia coli MS 84-1] gi|300841443|gb|EFK69203.1| von Willebrand factor type A domain protein [Escherichia coli MS 124-1] gi|315255206|gb|EFU35174.1| von Willebrand factor type A domain protein [Escherichia coli MS 85-1] gi|323156423|gb|EFZ42578.1| von Willebrand factor type A domain protein [Escherichia coli EPECa14] gi|323176802|gb|EFZ62392.1| von Willebrand factor type A domain protein [Escherichia coli 1180] Length = 584 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 52/316 (16%), Positives = 113/316 (35%), Gaps = 44/316 (13%) Query: 41 SHIFFMKTVLHSMIDRSLVH-------AATQIMNEGNGNNRKKLKGGDILCRIKNTWNMS 93 + + K L + + + AA+QI N G R K + + ++ + Sbjct: 75 AQQYSDKQALQGRLQAAPKYQHAAREKAASQIAN--PGTARYKQFDDNPVKQVAQNPLAT 132 Query: 94 FRNELRDNGFVN-----DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP-LKFCTFI-- 145 F ++ + N + + ++ + + Y + IP K F Sbjct: 133 FSLDVDTGSYANVRRFLNHGLLPPPDAVRVEEIVNYFPYDWDIKDKQSIPATKPIPFAMR 192 Query: 146 ------PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 PW + + I + S+ +++ ++D S SM S ++ + Sbjct: 193 YELAPAPWNEQLTLLKIDILAK-DHKSEELPASNLVFLIDTSGSMISD-----ERLPLIQ 246 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNS 259 S+ +++E++ ++ V +G ++I + + + I L G STN Sbjct: 247 SSLKLLVKELREQDNIAIVTYAG----DSRIALPSISGSHKAEINAAIDSLDAEG-STNG 301 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKR 318 GL+ AY Q ++ N I+ TDG+ +D +S+ + ++ Sbjct: 302 GAGLELAYQQA-AKGFIKGGINR--------ILLATDGDFNVGIDDPKSIESMVKKQRES 352 Query: 319 GAIVYAIGIRVIRSHE 334 G + G+ +E Sbjct: 353 GVTLSTFGVGNSNYNE 368 >gi|198274642|ref|ZP_03207174.1| hypothetical protein BACPLE_00794 [Bacteroides plebeius DSM 17135] gi|198272089|gb|EDY96358.1| hypothetical protein BACPLE_00794 [Bacteroides plebeius DSM 17135] Length = 332 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 40/222 (18%), Positives = 74/222 (33%), Gaps = 51/222 (22%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+M+ +DVS SM + D +I+ A + + P+ N GL F+ + Sbjct: 88 IDIMLAVDVSTSMLAE-DLKPNRIEAAKQVAAEFING---RPNDN----IGLTIFAGEAF 139 Query: 232 EFFLLEWGVSHLQRKIKYLS-------KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 L L + +S T GL A +++ D + Sbjct: 140 TQCPLTVDHGVLLNLFQSVSCDMVQRGMIEDGTALGMGLANAVSRLKDSKAKS------- 192 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV--------------- 329 K+++ +TDG N + D L AK+ G VY IG+ Sbjct: 193 ----KVVILLTDGVNN--RGDISPLTAAEIAKQFGIRVYTIGVGTNGTAPYPMQTYAGVQ 246 Query: 330 -------IRSHEFLRACASP-NSFYLVENPHSMYDAFSHIGK 363 I + + +++ + + + + I K Sbjct: 247 YVQMPVEIDEQTMSQIAGTTNGNYFRATSNTKLKEVYREIDK 288 >gi|28198673|ref|NP_778987.1| hypothetical protein PD0767 [Xylella fastidiosa Temecula1] gi|182681364|ref|YP_001829524.1| von Willebrand factor type A [Xylella fastidiosa M23] gi|28056764|gb|AAO28636.1| conserved hypothetical protein [Xylella fastidiosa Temecula1] gi|182631474|gb|ACB92250.1| von Willebrand factor type A [Xylella fastidiosa M23] gi|307579810|gb|ADN63779.1| von Willebrand factor type A [Xylella fastidiosa subsp. fastidiosa GB514] Length = 941 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 31/184 (16%), Positives = 59/184 (32%), Gaps = 27/184 (14%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + Q + + +D+S SM+ + +++D ++NA L+++ Sbjct: 140 PEKAAIGMQMPPSVAVYFAIDLSGSMDYVGGNGRSRLDNMKTALNAALDQLGQSIASGTA 199 Query: 219 VQSGLVTFSNKIEEFFLL------EWGVSHLQRKIK-----YLSKFGVSTNSTPGLKYAY 267 V LV F + + L G++ L+ + Y + F T P A Sbjct: 200 VDIMLVGFGDAPDHRQTLLRRNCTAQGIAELKSWVATRQALYGTYFPAGTMDMPSFYAA- 258 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 +N ++ FMTDGE + + Y I I Sbjct: 259 ---------------ASSNAVRVAFFMTDGEPDPPSATLAQAARADVDQVAHLRCYGITI 303 Query: 328 RVIR 331 + Sbjct: 304 DLAN 307 >gi|324508230|gb|ADY43477.1| C-type lectin protein 160 [Ascaris suum] Length = 599 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 39/213 (18%), Positives = 75/213 (35%), Gaps = 25/213 (11%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + I +D+ V+D S+S+ + T +D + + ++ I Sbjct: 12 ALFIALKATQQECILGAVDLAFVIDGSQSVG--VTNFQTTLDAIVHA-----TQLTNIGP 64 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVS-HLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFD 272 + + LV F F L + ++ I+ N G+ A +IF+ Sbjct: 65 LEKDSRVALVVFDETAHIEFGLMNTTNADVEAAIRRSKYPNTNGVNIAAGMDAATRKIFN 124 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 H + N K+++ +T + S + D ++ + A++ GAI AIGI Sbjct: 125 ------HAYPPNRNAPKLLIIIT---SNSDQSDVRAEN--SLAREVGAIPAAIGIHNGNG 173 Query: 333 -----HEFLRACASPNSFYLVENPHSMYDAFSH 360 + A P Y + S+ D Sbjct: 174 IGASREQLEIATGDPKRVYTTTDYSSLDDILQK 206 >gi|191637338|ref|YP_001986504.1| hypothetical protein LCABL_05200 [Lactobacillus casei BL23] gi|190711640|emb|CAQ65646.1| Putative uncharacterized protein yvcC [Lactobacillus casei BL23] Length = 909 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 49/293 (16%), Positives = 101/293 (34%), Gaps = 33/293 (11%) Query: 19 MTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLK 78 MT +TA L ++I + M T L + + AA N Sbjct: 1 MTKMTAKVARTGHLFAVLLI----LMSMLTGLVTSGSSVVTAAANIRPTYQTDANGTYPT 56 Query: 79 GGDILCRIKNTWNMSFRNELR--DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYK 136 + +N N +++ DN + + D + S P N Y I ++ Sbjct: 57 NSWQVTGQQNVINQRGGDQVSGWDNNTIWNGDATDTTNSYLKFGDPNNPDYQIRKYAK-- 114 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 + + + + + Q +D+++V+D+S SMES + + Sbjct: 115 --------ETNTPGLYDVYLNVKGNTQ---QNVKPVDIVLVVDMSGSMESKNNGGTDRAG 163 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVSHLQRKIKYLSKFGV 255 + L ++ + + V L+ FS+ G +++ + Sbjct: 164 AVRTGVKNFLTSIQNA-GLGDYVNVSLIGFSSPGYIGGGNKTTGPGYIRVGLGK----AG 218 Query: 256 STNSTPGLKYAYNQIFDMQ-----GMRQHCNTEDANY---KKIIVFMTDGENL 300 +T+ + A + F+ G+RQ +A+ KK+++ +TDG Sbjct: 219 NTSQQQAINSALSPTFNGGTYTQIGLRQGSAMLNADTSGNKKMMILLTDGVPT 271 >gi|47214620|emb|CAG01461.1| unnamed protein product [Tetraodon nigroviridis] Length = 2255 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 29/147 (19%), Positives = 60/147 (40%), Gaps = 16/147 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 A+ D++ V+D S S+ S TK+ I ++ ++ +I +Q V Sbjct: 1228 AVCKGAKADLVFVIDGSWSIGE---DSFTKV---IHFVSGIIGAFDVIGPSG--MQVSFV 1279 Query: 225 TFSNKIEEFFLL-EWGVSHLQ-RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +S+ + F L + + + + G +T + LK+ Y + F ++ N Sbjct: 1280 QYSDNAKTEFKLNSYREKGVAMAALHLIRYQGGNTKTGMALKHTYEKAFSLE------NG 1333 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSL 309 N K++V +TDG + + + Sbjct: 1334 MRRNVPKMVVVITDGRSQDEVKKSATK 1360 >gi|148694797|gb|EDL26744.1| complement factor B, isoform CRA_g [Mus musculus] Length = 541 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 30/236 (12%), Positives = 75/236 (31%), Gaps = 34/236 (14%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 ++ S++ +++ +VLD S S+ S + + + ++ + Sbjct: 32 VSCSLEGVEIKGGSMNIYLVLDGSDSIGSSNFTGAKRC---------LTNLIEKVASYGV 82 Query: 218 VVQSGLVTFSNKI--------EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + GL+T++ E +W L + K TN+ L+ Y+ Sbjct: 83 RPRYGLLTYATVPKVLVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYS- 141 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYY---------CNEAKKRG 319 + G + +I+ MTDG N+ ++ Sbjct: 142 MMSWAGDAPPEGWNRTRH--VIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPREDY 199 Query: 320 AIVYAIGIRV----IRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 VY G+ + + + + + V++ + + F + + + + Sbjct: 200 LDVYVFGVGPLVDSVNINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSLC 255 >gi|109083237|ref|XP_001114797.1| PREDICTED: cochlin-like isoform 4 [Macaca mulatta] Length = 550 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 26/209 (12%), Positives = 70/209 (33%), Gaps = 31/209 (14%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTD------ARLDMMIVLDVSRSMESFFDSSITK 194 F +P + + V P+ + + Q +++ ++D S S+ + Sbjct: 329 FSYHMPNWFGTTKYVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGD------SN 382 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSK 252 + ++ ++ + + ++ + V F+ F + + I+ + Sbjct: 383 FRLMLEFVSNIAKTFEISDIGA---KIAAVQFTYDQRTEFSFTDYSTKEDVLAVIRNIRY 439 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 T + + + +F +R+ N +V +TDG+ Sbjct: 440 MSGGTATGDAISFTVRNVFGP--IRESPNKN------FLVIVTDGQ-----SYDDVQGPA 486 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 A G ++++G+ + L+ AS Sbjct: 487 AAAHDAGITIFSVGVAWAPLDD-LKDMAS 514 >gi|12957162|dbj|BAB32650.1| complement factor B/C2-A3 [Cyprinus carpio] Length = 754 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 35/221 (15%), Positives = 77/221 (34%), Gaps = 27/221 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD------VN 216 K+ +LD+ I LDVS S++ K IK++ + ++ P+ Sbjct: 241 KIRVGKGGKLDIYIALDVSDSIDEE---DFEKAKGVIKTLIEKISYYEVSPNYEILIFAT 297 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 +V + + + + + + +++K + + AY I + + Sbjct: 298 DVARI-----VSMRDFKSAQKNNLLEILKRLKDYEYNSKGDRTGTNIAQAYRSILESMQI 352 Query: 277 RQHCNTEDANYKKIIVFM-TDGE-NLSTKEDQQSLYYCNEAKKRG-------AIVYAIGI 327 Q N E+ + IV M TDG+ N+ + KK +Y G+ Sbjct: 353 EQMTNKEEFKTTQHIVIMFTDGQANMGGNPRPWVDQIKDLVKKNSPSEEEENLDLYVFGM 412 Query: 328 R----VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 ++ + F+ ++N + + F + + Sbjct: 413 GDDVNAEDINDLKTDRGNEKFFFKLKNLEDLQETFDSMIDE 453 >gi|313892786|ref|ZP_07826367.1| von Willebrand factor type A domain protein [Veillonella sp. oral taxon 158 str. F0412] gi|313442717|gb|EFR61128.1| von Willebrand factor type A domain protein [Veillonella sp. oral taxon 158 str. F0412] Length = 230 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 72/204 (35%), Gaps = 15/204 (7%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +L ++++LD+S SM S KID + M+E V+ ++TF Sbjct: 12 EAKKLPVVLLLDISGSM------SGAKIDSLYDATIDMIETFSAAQAKEQVIDVAIITFG 65 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 +E + I S G+ T L+ A + I D T Y Sbjct: 66 THVELHTKYTPVKDLQAKGICKFSASGL-TPMGTALRMAKDMIEDKD------VTPSRIY 118 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL-RACASPNSFY 346 + +V ++DG + + N+ + +A+ I L R PN+ Sbjct: 119 RPAVVLVSDGAPNDDWKSPMDKFI-NDGRSAKCQRFAVAIGNDADRSILERFTQDPNAVL 177 Query: 347 LVENPHSMYDAFSHIGKDIVTKRI 370 E+ + + F I I T + Sbjct: 178 FAEDAKDISEQFKTISMSISTMAV 201 >gi|307180277|gb|EFN68310.1| Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 [Camponotus floridanus] Length = 2214 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 32/216 (14%), Positives = 67/216 (31%), Gaps = 34/216 (15%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + +++++ ++D S S+ S +N + + + + Sbjct: 86 IDRLRNETNQVELVFLVDASDSIGS---------KNFRSELNFVTKLLSDFTVDETTTRV 136 Query: 222 GLVTFSNK---------IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 +VTF + I + L ++ ++ G T + L A + Sbjct: 137 AVVTFGGRGNVYRNIDQISRHGPNDHKCYLLNKQFGNITYSGGGTYTRGALLEALTIL-- 194 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 K++ +TDG + + K GA V+ GIR Sbjct: 195 --------EKSREKANKVVFLITDGFSNG----GDPRPAADLLKNTGATVFTFGIRTGNV 242 Query: 333 HEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 E L ASP + S + F + + + + Sbjct: 243 EE-LHDIASPPGYTHSYLLDSFAE-FEALARRALHR 276 >gi|224285863|gb|ACN40645.1| unknown [Picea sitchensis] Length = 829 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 40/218 (18%), Positives = 83/218 (38%), Gaps = 37/218 (16%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 ++ + A +D++ VLDVS SM S TK+ + +++ ++ + Sbjct: 345 EGNMVKDPGCRAPIDLVTVLDVSGSM------SGTKLALLKRAMAFVISNLSPED----- 393 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQG 275 + +V FS+ + F L+ QR + + TN GL+ + Sbjct: 394 -RLSVVVFSSTAKRVFSLKRMTPDGQRAANRVVERLLCTGGTNIAEGLRKGAKVL----E 448 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLST-KEDQQSLYYCNEAKKRG----------AIVYA 324 R+ N + I+ ++DG++ + L+ +E ++ V+A Sbjct: 449 DRRQRNPVAS-----IMLLSDGQDTYSLSSRGVVLFPSDEQRRSARQSTRYGHVQIPVHA 503 Query: 325 IGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSH 360 G V + A + S +F ++ + DAF+ Sbjct: 504 FGFGVDHDAATMHAISEVSGGTFSFIQAESLVQDAFAQ 541 >gi|94309590|ref|YP_582800.1| hypothetical protein Rmet_0645 [Cupriavidus metallidurans CH34] gi|93353442|gb|ABF07531.1| conserved hypothetical protein [Cupriavidus metallidurans CH34] Length = 434 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 17/149 (11%), Positives = 53/149 (35%), Gaps = 6/149 (4%) Query: 3 SLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAA 62 S +++ +G + I+ + + ++ +G+ +++ ++ K+ L + D + AA Sbjct: 6 SRGMSLPRIHERQRGAVAIIVGLMIVVLVGFIGLALDLGKLYVSKSELQNRADSCALAAA 65 Query: 63 TQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIV-RSTSLDIVVV 121 + G + L + F+ L + + + + + + Sbjct: 66 RDL----TGATPLTVSEAAGLTAAARNLVL-FQGNLEQQPNITSAESVTYSDSLANPFLD 120 Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTN 150 + Y+++ I K + W+ Sbjct: 121 KNSVTYALNTIKYVKCDVSRGNIANWFAQ 149 >gi|312068806|ref|XP_003137386.1| CUTiclin-Like family member [Loa loa] gi|307767445|gb|EFO26679.1| CUTiclin-Like family member [Loa loa] Length = 691 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 23/136 (16%), Positives = 45/136 (33%), Gaps = 16/136 (11%) Query: 221 SGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 L+T+S + F + + + L +T++ L AY + D Sbjct: 4 LALITYSGQAYVHFKFNDPQIGNNTAVIGHLNALKSIKGTTSTHIALHQAYKLLMDTD-- 61 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV---IRSH 333 N KK+I+ TDG + + +D K ++AI + Sbjct: 62 --SGNGAREGVKKMIIIFTDGHSQQSPQDMALR-----LKNESVEIFAITLTPAPYADEG 114 Query: 334 EFLRACASPNSFYLVE 349 E L + + + + Sbjct: 115 ELLSITQNTDHIFTPD 130 >gi|194385372|dbj|BAG65063.1| unnamed protein product [Homo sapiens] Length = 1095 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 43/244 (17%), Positives = 82/244 (33%), Gaps = 46/244 (18%) Query: 150 NSRHIVMPITSSVKVNSQTDARL-----------DMMIVLDVSRSMESFFDSSITKIDMA 198 S I + V + Q L D++IVLD S S+ + +++ Sbjct: 99 GSSVFSSGICARVDASFQPQGSLAPTAQRCPTYMDVVIVLDGSNSIYPW-----SEVQTF 153 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGV 255 ++ + L I +Q GLV + + L + ++ + G Sbjct: 154 LRRLVGKL----FIDPEQ--IQVGLVQYGESPVHEWSLGDFRTKEEVVRAAKNLSRREGR 207 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 T + + A + F + +++V +TDGE+ +E +L C Sbjct: 208 ETKTAQAIMVACTEGFSQS------HGGRPEAARLLVVVTDGESHDGEELPAALKACEAG 261 Query: 316 KKRGAIVYAIGI------RVIRSHEFLR----ACASPNS--FYLVENPHSMYDAFSHIGK 363 + Y I + R FLR + P+ F+ V + ++ D +G Sbjct: 262 R---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASDPDERFFFNVTDEAALTDIVDALGD 318 Query: 364 DIVT 367 I Sbjct: 319 RIFG 322 >gi|119591830|gb|EAW71424.1| integrin, alpha 10, isoform CRA_a [Homo sapiens] Length = 1177 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 43/244 (17%), Positives = 82/244 (33%), Gaps = 46/244 (18%) Query: 150 NSRHIVMPITSSVKVNSQTDARL-----------DMMIVLDVSRSMESFFDSSITKIDMA 198 S I + V + Q L D++IVLD S S+ + +++ Sbjct: 133 GSSVFSSGICARVDASFQPQGSLAPTAQRCPTYMDVVIVLDGSNSIYPW-----SEVQTF 187 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGV 255 ++ + L I +Q GLV + + L + ++ + G Sbjct: 188 LRRLVGKL----FIDPEQ--IQVGLVQYGESPVHEWSLGDFRTKEEVVRAAKNLSRREGR 241 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 T + + A + F + +++V +TDGE+ +E +L C Sbjct: 242 ETKTAQAIMVACTEGFSQS------HGGRPEAARLLVVVTDGESHDGEELPAALKACEAG 295 Query: 316 KKRGAIVYAIGI------RVIRSHEFLR----ACASPNS--FYLVENPHSMYDAFSHIGK 363 + Y I + R FLR + P+ F+ V + ++ D +G Sbjct: 296 R---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASDPDERFFFNVTDEAALTDIVDALGD 352 Query: 364 DIVT 367 I Sbjct: 353 RIFG 356 >gi|114558438|ref|XP_514418.2| PREDICTED: integrin alpha-10 isoform 2 [Pan troglodytes] Length = 1167 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 43/244 (17%), Positives = 82/244 (33%), Gaps = 46/244 (18%) Query: 150 NSRHIVMPITSSVKVNSQTDARL-----------DMMIVLDVSRSMESFFDSSITKIDMA 198 S I + V + Q L D++IVLD S S+ + +++ Sbjct: 133 GSSVFSSGICARVDASFQPQGSLAPTAQRCPTYMDVVIVLDGSNSIYPW-----SEVQTF 187 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGV 255 ++ + L I +Q GLV + + L + ++ + G Sbjct: 188 LRRLVGKL----FIDPEQ--IQVGLVQYGESPVHEWSLGDFRTKEEVVRAAKNLSRREGR 241 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 T + + A + F + +++V +TDGE+ +E +L C Sbjct: 242 ETKTAQAIMVACTEGFSQS------HGGRPEAARLLVVVTDGESHDGEELPAALKACEAG 295 Query: 316 KKRGAIVYAIGI------RVIRSHEFLR----ACASPNS--FYLVENPHSMYDAFSHIGK 363 + Y I + R FLR + P+ F+ V + ++ D +G Sbjct: 296 R---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASDPDERFFFNVTDEAALTDIVDALGD 352 Query: 364 DIVT 367 I Sbjct: 353 RIFG 356 >gi|3420888|gb|AAC31952.1| integrin subunit alpha 10 precursor [Homo sapiens] Length = 1167 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 43/244 (17%), Positives = 82/244 (33%), Gaps = 46/244 (18%) Query: 150 NSRHIVMPITSSVKVNSQTDARL-----------DMMIVLDVSRSMESFFDSSITKIDMA 198 S I + V + Q L D++IVLD S S+ + +++ Sbjct: 133 GSSVFSSGICARVDASFQPQGSLAPTAQRCPTYMDVVIVLDGSNSIYPW-----SEVQTF 187 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGV 255 ++ + L I +Q GLV + + L + ++ + G Sbjct: 188 LRRLVGKL----FIDPEQ--IQVGLVQYGESPVHEWSLGDFRTKEEVVRAAKNLSRREGR 241 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 T + + A + F + +++V +TDGE+ +E +L C Sbjct: 242 ETKTAQAIMVACTEGFSQS------HGGRPEAARLLVVVTDGESHDGEELPAALKACEAG 295 Query: 316 KKRGAIVYAIGI------RVIRSHEFLR----ACASPNS--FYLVENPHSMYDAFSHIGK 363 + Y I + R FLR + P+ F+ V + ++ D +G Sbjct: 296 R---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASDPDERFFFNVTDEAALTDIVDALGD 352 Query: 364 DIVT 367 I Sbjct: 353 RIFG 356 >gi|38569398|ref|NP_003628.2| integrin alpha-10 precursor [Homo sapiens] gi|115502407|sp|O75578|ITA10_HUMAN RecName: Full=Integrin alpha-10; Flags: Precursor gi|6650628|gb|AAF21944.1|AF112345_1 integrin alpha 10 subunit [Homo sapiens] gi|119591831|gb|EAW71425.1| integrin, alpha 10, isoform CRA_b [Homo sapiens] gi|182887771|gb|AAI60008.1| Integrin, alpha 10 [synthetic construct] gi|189055316|dbj|BAG36921.1| unnamed protein product [Homo sapiens] Length = 1167 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 43/244 (17%), Positives = 82/244 (33%), Gaps = 46/244 (18%) Query: 150 NSRHIVMPITSSVKVNSQTDARL-----------DMMIVLDVSRSMESFFDSSITKIDMA 198 S I + V + Q L D++IVLD S S+ + +++ Sbjct: 133 GSSVFSSGICARVDASFQPQGSLAPTAQRCPTYMDVVIVLDGSNSIYPW-----SEVQTF 187 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGV 255 ++ + L I +Q GLV + + L + ++ + G Sbjct: 188 LRRLVGKL----FIDPEQ--IQVGLVQYGESPVHEWSLGDFRTKEEVVRAAKNLSRREGR 241 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 T + + A + F + +++V +TDGE+ +E +L C Sbjct: 242 ETKTAQAIMVACTEGFSQS------HGGRPEAARLLVVVTDGESHDGEELPAALKACEAG 295 Query: 316 KKRGAIVYAIGI------RVIRSHEFLR----ACASPNS--FYLVENPHSMYDAFSHIGK 363 + Y I + R FLR + P+ F+ V + ++ D +G Sbjct: 296 R---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASDPDERFFFNVTDEAALTDIVDALGD 352 Query: 364 DIVT 367 I Sbjct: 353 RIFG 356 >gi|55670411|pdb|1V7P|C Chain C, Structure Of Ems16-Alpha2-I Domain Complex Length = 200 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 78/210 (37%), Gaps = 35/210 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++V D S S+ + D + ++ + + P Q GL+ ++N Sbjct: 7 IDVVVVCDESNSIYPW--------DAVKNFLEKFVQGLDIGPTK---TQVGLIQYANNPR 55 Query: 232 EFFLLEWGVSHLQRKI---KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L + + + + G TN+ ++YA + + + Sbjct: 56 VVFNLNTYKTKEEMIVATSQTSQYGGDLTNTFGAIQYARKYAYSA------ASGGRRSAT 109 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEF---LRAC 339 K++V +TDGE+ + + CN + + I + + + ++A Sbjct: 110 KVMVVVTDGESHDGSMLKAVIDQCNH---DNILRFGIAVLGYLNRNALDTKNLIKEIKAI 166 Query: 340 AS---PNSFYLVENPHSMYDAFSHIGKDIV 366 AS F+ V + ++ + +G+ I Sbjct: 167 ASIPTERYFFNVSDEAALLEKAGTLGEQIF 196 >gi|322711218|gb|EFZ02792.1| U-box domain-containing protein [Metarhizium anisopliae ARSEF 23] Length = 734 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 37/216 (17%), Positives = 72/216 (33%), Gaps = 29/216 (13%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFD-------SSITKIDMAIKSINAMLEEVK 210 + D+++VLDVS SME + +T +D+ + ++E + Sbjct: 37 VPPRAPAAQIDHVPCDIVLVLDVSTSMEDNAPVPGETERTGLTVLDLTKHAALTIIETL- 95 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 N+ + G+V+F+ L +RKIK L G STN G++ Sbjct: 96 -----NDRDRLGIVSFATNSTIVQTLTHMDISNKDEARRKIKALDPNG-STNLWHGIRDG 149 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 + ++ +TDG Q + + A ++ G Sbjct: 150 IQIFEQSAENGNI---------RAMMVLTDGMPNHMCPQQGYIPKLKTLPRLPAAIHTFG 200 Query: 327 IRVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSH 360 L++ A ++ + + + F H Sbjct: 201 FGYGLRSGLLKSLAEYGHGNYAFIPDAGMIGTVFVH 236 >gi|310657503|ref|YP_003935224.1| hypothetical protein CLOST_0189 [Clostridium sticklandii DSM 519] gi|308824281|emb|CBH20319.1| exported protein of unknown function [Clostridium sticklandii] Length = 466 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 27/149 (18%), Positives = 61/149 (40%), Gaps = 20/149 (13%) Query: 174 MMIVLDVSRSMESFF------DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + +VLD S SM + S+ ++I + A++++ + ++ G++ +S Sbjct: 199 ITLVLDESGSMANNMSGGRVTSSNPSRISILRNRAKALIDQFSGLGNI----YVGIIPYS 254 Query: 228 NKIE----EFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGL---KYAYNQIFDMQGMRQ 278 + + F+L G + ++ KI L+ G+ TN+ + YA Q D Sbjct: 255 DDAYISGTKSFVLANGTNVNTIKNKIDSLTAQGM-TNTGDAMRVSYYATKQFKDSPNSID 313 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 + D ++ ++DG+ + Sbjct: 314 NTLPTDTKVIPYMILLSDGDPTVFSATTR 342 >gi|283850951|ref|ZP_06368236.1| von Willebrand factor type A [Desulfovibrio sp. FW1012B] gi|283573597|gb|EFC21572.1| von Willebrand factor type A [Desulfovibrio sp. FW1012B] Length = 330 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 61/163 (37%), Gaps = 18/163 (11%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 LD+M+V+D+S SM + + D + ++A+ + + + GLV F Sbjct: 81 QGRGLDIMLVVDLSESMAAM---DMRLADRTVTRLDAVADAAARFAANHPGDRIGLVAFG 137 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 ++ + L + L+ G T GL A ++ D G Sbjct: 138 SRAYAVMPPSADRAALTGALARLAVGAAGKRTAMGDGLGLAVKRLSDAPG---------- 187 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 ++ V DG + + + A +RG VY++G+ Sbjct: 188 -LSRLAVVFGDGRSNA--GEVSPEDAAKAASERGVTVYSVGVG 227 >gi|216264497|ref|ZP_03436489.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 156a] gi|221217546|ref|ZP_03589016.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 72a] gi|224532807|ref|ZP_03673422.1| von Willebrand factor type A domain protein [Borrelia burgdorferi WI91-23] gi|224534090|ref|ZP_03674673.1| von Willebrand factor type A domain protein [Borrelia burgdorferi CA-11.2a] gi|225548563|ref|ZP_03769611.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 94a] gi|225549807|ref|ZP_03770771.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 118a] gi|215980970|gb|EEC21777.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 156a] gi|221192609|gb|EEE18826.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 72a] gi|224512196|gb|EEF82582.1| von Willebrand factor type A domain protein [Borrelia burgdorferi WI91-23] gi|224512789|gb|EEF83157.1| von Willebrand factor type A domain protein [Borrelia burgdorferi CA-11.2a] gi|225369615|gb|EEG99064.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 118a] gi|225370826|gb|EEH00261.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 94a] Length = 333 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 39/196 (19%), Positives = 70/196 (35%), Gaps = 26/196 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++IVLD+S SM + SS +++ + + I + + GLV F+ Sbjct: 93 DIVIVLDISPSMGAVEFSSKNRLEFSKELIRGFISQ-------RENDNIGLVAFAKDASI 145 Query: 233 FFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + +K+ Y+ G + G+ A + + K+ Sbjct: 146 VVPITTDREFFNKKLDDIYIMDLGNGSALGLGISIALSHL-----------KHSEALKRS 194 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP----NSFY 346 IV +TDG S + + N A+ +Y+IGI S SF Sbjct: 195 IVVLTDGVVNSDEIYKD--QVINLAQGLNVKIYSIGIGSSEEFSVEFKLRSGKFYQGSFK 252 Query: 347 LVENPHSMYDAFSHIG 362 V +P + + + G Sbjct: 253 EVYDPSMLVEISNKTG 268 >gi|301133566|gb|ADK63405.1| C3HC4 type zinc finger protein [Brassica rapa] Length = 677 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 41/220 (18%), Positives = 79/220 (35%), Gaps = 35/220 (15%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 I + +S + A +D++ VLDVS SM TK+ + +++ +++ + Sbjct: 227 INLKAPPVSSSSSSSRAPVDLVTVLDVSGSMAG------TKLALLKRAMGFVIQNLGPFD 280 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + +++FS+ F L G + + G TN GL Sbjct: 281 ------RLSVISFSSTSRRSFPLRLMTETGKQEALQAVNSFVSNG-GTNIAEGLTKGAKV 333 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLS-----TKEDQQSLYYCNEAKK--RGAIV 322 + D + IV ++DG++ T + + Y K V Sbjct: 334 LID------RRFKNSVSS---IVLLSDGQDTYTMTSPTGSNTKGADYKTLLPKEVNRIPV 384 Query: 323 YAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSH 360 +A G + + A S +F +E+ + DAF+ Sbjct: 385 HAFGFGADHDASLMHSIAENSGGTFSFIESETVIQDAFAQ 424 >gi|115486675|ref|NP_001068481.1| Os11g0687100 [Oryza sativa Japonica Group] gi|77552567|gb|ABA95364.1| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] gi|113645703|dbj|BAF28844.1| Os11g0687100 [Oryza sativa Japonica Group] Length = 633 Score = 55.2 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 36/175 (20%), Positives = 70/175 (40%), Gaps = 30/175 (17%) Query: 141 FCTFIPWYTNSR-HIVMPITSSVKVNSQTDARLDMMIVLDVSRSME-------SFFDSSI 192 F T TN +++ + + + + LD++ VLDVS SM + Sbjct: 38 FPTIPRGQTNKDFQVLLRVEAPPAADLNSHVPLDVVAVLDVSGSMNDPVAAASPKSNLQG 97 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-KIEEFFL-----LEWGVSHLQRK 246 +++D+ S+ ++ ++ + +V F++ ++E+ G S +K Sbjct: 98 SRLDVLKASMKFVIRKLADGD------RLSIVAFNDGPVKEYSSGLLDVSGDGRSIAGKK 151 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQG-MRQHCNTEDANYKKIIVFMTDGENL 300 I L G T P L+ A + + QG R H I+ +TDG++ Sbjct: 152 IDRLQARG-GTALMPALEEAVKILDERQGSSRNHVG--------FILLLTDGDDT 197 >gi|307720884|ref|YP_003892024.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306978977|gb|ADN09012.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 304 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 32/205 (15%), Positives = 76/205 (37%), Gaps = 25/205 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDS----SITKIDMAIKSINAMLEEVKLIPDVNNV 218 K+ + +D+++ LD S SM ++ + +++++ + + + Sbjct: 75 KIKAVKSNAVDIVLALDTSDSMSTYGFNEKKYKQSRLNVVKEVVQNFINSRVKD------ 128 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + GLV F L + + + ++ + ++ + D Sbjct: 129 -RIGLVVFGTTAGIASPLSFDKEAQKNIVGNINVGVLGKSTA---------LIDGIVSSI 178 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + KII+ ++DGE+ ++K + AKK +Y I I S + ++ Sbjct: 179 QLLKNSKSKSKIIILLSDGEDSASKIPLEFAL--KLAKKYNIKIYTITIDKSYS-DMMKV 235 Query: 339 CASPN--SFYLVENPHSMYDAFSHI 361 A+ N + V+N + + I Sbjct: 236 IANKNGAKNFEVQNKEDLVKVYKSI 260 >gi|73967405|ref|XP_537778.2| PREDICTED: similar to integrin, alpha E (antigen CD103, human mucosal lymphocyte antigen 1; alpha polypeptide) [Canis familiaris] Length = 1178 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 80/202 (39%), Gaps = 26/202 (12%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + I+LD S S++ A I+ M+ + LV + I+ Sbjct: 207 IAIILDGSGSIDP------PDFQRAKDFISNMMRN---FYEKCFECSFALVQYGEVIQTE 257 Query: 234 FLLEWGVSHLQR--KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 F L + K++ +++ G T + +++ + IF + N K++ Sbjct: 258 FDLRDSQDVMTSLAKVQNITQVGNVTKTASAMQHVLDNIFTPN------HGSRKNASKVM 311 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI----RSHEFLRACAS--PNSF 345 V +TDG+ + ++ N K +G +AIG+ +++ L+ AS + + Sbjct: 312 VVLTDGDIFGDPLNLTTVI--NSPKMQGVERFAIGVGNAFEKNKTYHELKLIASDPDDRY 369 Query: 346 -YLVENPHSMYDAFSHIGKDIV 366 + V N ++ S + + I+ Sbjct: 370 AFKVTNYTALDGLLSKLQQTII 391 >gi|294781746|ref|ZP_06747079.1| phage/colicin/tellurite resistance cluster TerY protein [Fusobacterium sp. 1_1_41FAA] gi|294481856|gb|EFG29624.1| phage/colicin/tellurite resistance cluster TerY protein [Fusobacterium sp. 1_1_41FAA] Length = 229 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 41/206 (19%), Positives = 76/206 (36%), Gaps = 15/206 (7%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T L ++++LDVS SM S KI+ + N M++ ++ ++ Sbjct: 9 KPATAKHLPVVLLLDVSGSM------SGEKIENLYDATNEMIKVFSDAVSKEKIIDIAII 62 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 TF +E V R + G+ T L+ A + I D + T Sbjct: 63 TFGENVELHTPYTSVVDFKSRGLNPFLASGM-TPLGTALRMAKDMIEDKE------TTPS 115 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS-PN 343 Y+ +V ++DG + N + +A+ I ++ L++ A Sbjct: 116 NIYRPAVVLVSDGVPTDEWRGPLD-NFKNNGRSSKCQRFAVAIGNDADNQMLKSFAECNE 174 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKR 369 +F++ EN + D F I + K Sbjct: 175 NFFIAENVSDIVDKFKQISMSVSVKA 200 >gi|218672731|ref|ZP_03522400.1| hypothetical protein RetlG_14377 [Rhizobium etli GR56] Length = 323 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/158 (16%), Positives = 51/158 (32%), Gaps = 24/158 (15%) Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN---QIFDMQGMRQHCNTEDANYK 288 L ++L+ +K L+ G + + Y + + G + Sbjct: 157 PVTPLTGDFAYLKSVVKNLTSEGSTRLDAGVVAGWYTLSPKWQGVWGDETSPAEVSDSVH 216 Query: 289 KIIVFMTDGENLSTKEDQQSLYY---------------------CNEAKKRGAIVYAIGI 327 K++VFMTDGE + + + C KK G +Y + Sbjct: 217 KVMVFMTDGEMNTKYDPNDKFDWICSQTQSSACNAFATAAMQTACTAMKKSGIEIYTLSY 276 Query: 328 RVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 +R CA+ + + +P ++ + I I Sbjct: 277 SADADVVNIRNCATNTAHFFTASPATIKTVYETIAAAI 314 >gi|326505554|dbj|BAJ95448.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326515132|dbj|BAK03479.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 707 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 48/305 (15%), Positives = 106/305 (34%), Gaps = 37/305 (12%) Query: 72 NNRKKLKGGDILCRIKNTWNMSFRNELRD-NGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 N + ++ R+ N+++ + +L DI + + +N + S Sbjct: 151 NWPQDDGHMAVIRRLSNSYSGNLLEQLPVFRTPEADIFNDDEHIDVHSETAEENNEVTGS 210 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 + ++ ++ + + A LD++ VLDVS SM Sbjct: 211 VEIKTYAEVQAIQQSVTQKVFSILIHLKAPKSLESVSSRAPLDLVTVLDVSGSM------ 264 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW-GVSHLQRKIKY 249 K+ + K++ +++ + + ++ FS+ F L V+ + I+ Sbjct: 265 KGAKLALLKKAMCFVIQTLGPND------RLSVIAFSSTARRLFPLRQMNVNGRMQAIQA 318 Query: 250 LSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL-STKEDQ 306 ++ G TN + GLK I R+ N + I+ ++DG++ S Sbjct: 319 VNSLVDGGGTNISDGLKKGAKVI----EHRRLKNPVCS-----IILLSDGQDTYSVPTFD 369 Query: 307 QSLYYCNEA---------KKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMY 355 L + A ++ G + + A A S +F ++ S+ Sbjct: 370 DELQTNHSALVPPSILPGTGNHVQIHTFGFGMDHDSAAMHAIAETSSGTFSFIDAEGSIQ 429 Query: 356 DAFSH 360 + F+ Sbjct: 430 NGFAQ 434 >gi|187919333|ref|YP_001888364.1| von Willebrand factor type A [Burkholderia phytofirmans PsJN] gi|187717771|gb|ACD18994.1| von Willebrand factor type A [Burkholderia phytofirmans PsJN] Length = 451 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 46/299 (15%), Positives = 96/299 (32%), Gaps = 47/299 (15%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F +G ++I A+ L + +LG+ ++ + +K L + D +++ A + N Sbjct: 11 FRRRQRGSVSIFVAVSLIALLGILGLAVDSGFGYMIKARLDAATDGAVIAAGEAVTRGNN 70 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 + + F ++ +V +I Sbjct: 71 QTEQTNNAQQAATAFFAANYPAGFLGS--------------SVSAGTPSIVFNAGTVTIG 116 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 ++ +P+ F + + + +S + LDM+ V+D ++S+ + Sbjct: 117 MTAQASVPVTFSKVL----GFNVLNVSSSSQAIRKT-----LDMVFVIDNTKSLNTSGVP 167 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK---- 246 + A L + D + L+ F+ + R Sbjct: 168 -----AAVRSNAVAFLNNFDVTND-----RVALMHFAYGTVVDVPFKGNTRGFDRATMTT 217 Query: 247 -IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 I + F STNS L A NQ+ + + ++IVF +DG S Sbjct: 218 DINKYT-FDGSTNSPEALWNARNQLNTV--------ITQPSSLRVIVFFSDGAPNSFSS 267 >gi|48425689|pdb|1SHU|X Chain X, Crystal Structure Of The Von Willebrand Factor A Domain Of Human Capillary Morphogenesis Protein 2: An Anthrax Toxin Receptor Length = 182 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 70/202 (34%), Gaps = 27/202 (13%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D+ VLD S S+ + + I ++++ V+ ++ + FS++ Sbjct: 6 AFDLYFVLDKSGSVANNW-----------IEIYNFVQQLAERF-VSPEMRLSFIVFSSQA 53 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L + + ++ L + T GLK A QI G++ Sbjct: 54 TIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKAGGLKTSS-------- 105 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 II+ +TDG+ ++ GA VY +G+ + R S + V Sbjct: 106 -IIIALTDGKLDGLVPSYAEKE-AKISRSLGASVYCVGVLDFEQAQLERIADSKEQVFPV 163 Query: 349 ENPHSMYDAFSHIGKDIVTKRI 370 + + A I I+ + Sbjct: 164 KGG---FQALKGIINSILAQSC 182 >gi|301766292|ref|XP_002918563.1| PREDICTED: cochlin-like [Ailuropoda melanoleuca] Length = 550 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/209 (12%), Positives = 69/209 (33%), Gaps = 31/209 (14%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTD------ARLDMMIVLDVSRSMESFFDSSITK 194 F +P + + V P+ + + Q +++ ++D S S+ + Sbjct: 329 FSYHMPNWFGTTKYVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGD------SN 382 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSK 252 + ++ ++ + + ++ + V F+ F + + I+ + Sbjct: 383 FRLMLEFVSNIAKTFEISDIGA---KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRY 439 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 T + + + +F +R N +V +TDG+ Sbjct: 440 MSGGTATGDAISFTVRNVFGP--VRDSPNKN------FLVIVTDGQ-----SYDDVRGPA 486 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 A G ++++G+ + L+ AS Sbjct: 487 AAAHDAGITIFSVGVAWAPLDD-LKDMAS 514 >gi|156382663|ref|XP_001632672.1| predicted protein [Nematostella vectensis] gi|156219731|gb|EDO40609.1| predicted protein [Nematostella vectensis] Length = 1235 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 54/341 (15%), Positives = 107/341 (31%), Gaps = 52/341 (15%) Query: 46 MKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN 105 MK +L ++ + L + + + R K K + L + + Sbjct: 170 MKRLLRNIKNGVLKECCKLLPSSLEYHARFKSKANSKSFCEAKAKSAGNVRVLGEGIYSA 229 Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVN 165 ++ ++ L EG PLK C + S+ K Sbjct: 230 MSRNLKQNQRLKWQFFGSKEGL---CTIYPAAPLKECHAYDNRLRPWYTSAAYPSTKK-- 284 Query: 166 SQTDARLDMMIVLDVSRSMESFFD---SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 ++IVLD S SM S + T++D+A +++ +L + + G Sbjct: 285 --------LVIVLDTSSSMASRVELGTKRRTRLDVAKAALSTILSTLLPQD------KVG 330 Query: 223 LVTFSNKIEEFFLL--------------EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 +V F++K+ + V++L+ I G T K A+ Sbjct: 331 VVLFNSKVTLAGSSGVDECYSTRLAPAGRFNVNYLKDFINRSRP-GGGTQYQNAFKAAFT 389 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK-----RGAIVY 323 + + + ++F+TDG + + K+ + Sbjct: 390 LLKSAKSGDGGGE------QSFLLFLTDGGPKDDALEVE-RLIAQNKKEMEESRERVTIM 442 Query: 324 AIGIRVIRS-HEFLRACAS--PNSFYLVENPHSMYDAFSHI 361 IG+ +FL + + + V+N MY A Sbjct: 443 TIGLGKDEHMKDFLGRLSKNVGSKYSQVDNEAHMYSAIHDY 483 >gi|126282074|ref|XP_001368568.1| PREDICTED: similar to Coch-5B2 gene product [Monodelphis domestica] Length = 549 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/205 (14%), Positives = 67/205 (32%), Gaps = 27/205 (13%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +++ ++D S S+ + + ++ ++ + + ++ + V F+ Sbjct: 364 SVNIAFLIDGSSSVGE------SNFRLMLEFVSNIAKIFEISDIGA---KIAAVQFTYDQ 414 Query: 231 EEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F + + I+ + T + + + +F +R N Sbjct: 415 RPEFSFTDYTTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGP--IRDGPNKN----- 467 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 +V +TDG+ A K G VY+IG+ + L+ AS Sbjct: 468 -FLVIVTDGQ-----SYDDVRAPAAAAHKAGITVYSIGVAWAPLDD-LKDMASEPKETHA 520 Query: 349 ENPHSMYDAFSHIGKDIVTKRIWYD 373 + I DI+ + I D Sbjct: 521 FFTRE-FSGLEQIATDII-RGICRD 543 >gi|255550407|ref|XP_002516254.1| inter-alpha-trypsin inhibitor heavy chain, putative [Ricinus communis] gi|223544740|gb|EEF46256.1| inter-alpha-trypsin inhibitor heavy chain, putative [Ricinus communis] Length = 755 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 80/202 (39%), Gaps = 46/202 (22%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D+S SME ++ +++ L ++ N ++ F+ + F Sbjct: 327 IVFIVDISGSMEG------KPLEGMKNAMSGALAKLNPKDSFN------IIAFNGETYLF 374 Query: 234 FLLE--WGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L ++R +++++ G TN + L A + + QG Sbjct: 375 SSLMELATEKTVERAVEWMNLNFIAGGGTNISVPLNQAMEMVSNTQGSLP---------- 424 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKK----RGAI---VYAIGIRVIRSHEFLRACAS 341 +I +TDG + + C+ KK +GAI +Y GI +H FLR A+ Sbjct: 425 -VIFLVTDG------AVEDERHICDSMKKYVRGKGAICPRIYTFGIGTYCNHYFLRMLAT 477 Query: 342 P-----NSFYLVENPHSMYDAF 358 ++ Y V++ + + F Sbjct: 478 VCRGQYDAAYDVDSVQARMEIF 499 >gi|325570952|ref|ZP_08146571.1| von Willebrand factor type A domain protein [Enterococcus casseliflavus ATCC 12755] gi|325156278|gb|EGC68462.1| von Willebrand factor type A domain protein [Enterococcus casseliflavus ATCC 12755] Length = 1176 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 28/149 (18%), Positives = 60/149 (40%), Gaps = 24/149 (16%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF---- 226 +D+++V+D S SM + +I K ++ L +++ + + V G V + Sbjct: 335 PIDVVLVVDWSGSMNE-----MGRITEVKKGVDRFLNQIE-GSGIQDSVYMGYVGYSSDG 388 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 +N + L S ++ I+ ++ T + GL+ A + + G Sbjct: 389 NNYQNKTCQLG-KFSEVKETIRTMTPETAAGGTFTQRGLRQAGDMLSTQNG--------- 438 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + Sbjct: 439 --HKKVIVLLTDGVPTYSYHVSKVHTQAD 465 >gi|254436533|ref|ZP_05050027.1| von Willebrand factor type A domain protein [Octadecabacter antarcticus 307] gi|198251979|gb|EDY76293.1| von Willebrand factor type A domain protein [Octadecabacter antarcticus 307] Length = 613 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/176 (19%), Positives = 70/176 (39%), Gaps = 20/176 (11%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 PW +++ + + I + + L+++ ++D S SMES K+ + +S M Sbjct: 229 PWNADTQLVHIGIQGEM-PAIEDRPPLNLVFLIDTSGSMES-----ADKLPLLRQSFRLM 282 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 L+ ++ +V V +G + + + + + + + L+ G STN GL+ Sbjct: 283 LDNLRPEDEVAIVTYAGSTSIALEPTQASE----RATIIAALNALNA-GGSTNGQGGLEQ 337 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 AY + + ++ TDG+ D + L E K+ A Sbjct: 338 AY--------ALAETMKTAGDVSR-VILATDGDFNVGLSDPRGLQAYIEDKRDDAQ 384 >gi|299136327|ref|ZP_07029511.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] gi|298602451|gb|EFI58605.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] Length = 376 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 43/325 (13%), Positives = 109/325 (33%), Gaps = 43/325 (13%) Query: 51 HSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDI 110 + + N + G + ++ +G + Sbjct: 47 QQPATTPATTPQSASPTQPNSTSPTSPTGSTAAPQSSTPTLQPRSDQDAQDGTTFTLHRS 106 Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 V L V+ ++ + + + + +H+ ++ QT+ Sbjct: 107 VNEVDLIFTVMGKDGHFV------SNLQQQNFGLLDDGRPPQHV-------LRFAQQTNL 153 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 L + I+LD S S+ + + ++ L ++++ + G F +I Sbjct: 154 PLRVGIMLDTSSSIRQ-------RFEFEQQAATDFL--LQVLHPADRAFVEG---FDVQI 201 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L I+ L G + + +Q+ +Q +DA +K Sbjct: 202 NIAQDFTNRIDMLDTGIRRLRPGGGTALFDSLYRTCKDQMLTLQ--------QDAEVRKA 253 Query: 291 IVFMTDGENLSTKE-DQQSLYYCNEAKKRGAIVYAIGIRVI----RSHEFLRAC--ASPN 343 IV ++DG++ ++ + +++ C A+ IVY I V + + L+ A+ Sbjct: 254 IVLVSDGDDDYSRVLETEAIKMCQRAET---IVYTISTNVGPSRDKGDDVLQQISDATGG 310 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTK 368 + + + F +I +++ ++ Sbjct: 311 QSFYPQRIDDVAIGFRNIEEELRSQ 335 >gi|237808477|ref|YP_002892917.1| von Willebrand factor type A [Tolumonas auensis DSM 9187] gi|237500738|gb|ACQ93331.1| von Willebrand factor type A [Tolumonas auensis DSM 9187] Length = 316 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 75/211 (35%), Gaps = 33/211 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 Q+ D+++ +D+S+SM+ +T A+ ++ + ++ + G++ F Sbjct: 82 QSFPSRDLLLAVDISQSMQ---IKDMTINGEAVDRLSMVKSYLQSFIKQRQGDRIGIILF 138 Query: 227 SN--KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ + F +W + L + G T + A + H Sbjct: 139 ADHAYLMVPFTQDWQAAGLLLDEVNIGLAGKFTAIGEAITLAVKKTL-------HEPKPI 191 Query: 285 ANYKKIIVFMTDGE---NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------- 332 N K ++ ++DG+ N D +L AK G +Y IGI + Sbjct: 192 QN--KTLILLSDGKDSINTIQPTDAAAL-----AKASGLKIYTIGIGSDSTDAEAESDLD 244 Query: 333 HEFLRACA--SPNSFYLVENPHSMYDAFSHI 361 L A + ++ + + + + I Sbjct: 245 ETTLEEIANMTGGQYFRARSEQDLSEIYQQI 275 >gi|242034231|ref|XP_002464510.1| hypothetical protein SORBIDRAFT_01g019870 [Sorghum bicolor] gi|241918364|gb|EER91508.1| hypothetical protein SORBIDRAFT_01g019870 [Sorghum bicolor] Length = 647 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 32/135 (23%), Positives = 58/135 (42%), Gaps = 26/135 (19%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ VLDVS SM TK+++ +++ +++ + + +V+FS+ Sbjct: 176 LDLVTVLDVSGSM------VGTKLELLKQAMGFVIDNLGPRD------RLCVVSFSSGAN 223 Query: 232 EFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L G S +R ++ L G TN L+ A I R H N + Sbjct: 224 RLMRLARMSDAGKSLARRAVQSL-AAGGGTNIGEALRRAAKVI----DERMHRNAVAS-- 276 Query: 288 KKIIVFMTDGENLST 302 +V ++DG++ T Sbjct: 277 ---VVLLSDGQDTYT 288 >gi|113476849|ref|YP_722910.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] gi|110167897|gb|ABG52437.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] Length = 441 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 70/192 (36%), Gaps = 23/192 (11%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 S + V+D S SM + KID+ ++++ ++ +++ P + LV Sbjct: 35 SASRPSTTFTFVIDTSGSMYDDSEVGRPKIDIVVEALERLVTDIQADPRD----RIALVQ 90 Query: 226 FSNKIEEFFLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 F + L + Q I L F T G++ + N + D Sbjct: 91 FDDSASVLLPLTAATDTVTLQNAISKLRSFSGGTRMALGIEKSLNLLKDSV--------- 141 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--S 341 + ++ ++F TDG+ T ++ + + G + A+G+ + L + + Sbjct: 142 -LSSRRTLIF-TDGQ---TIDEIDCRELAVQFAQAGIPITALGVGDYN-EDLLVYLSDHT 195 Query: 342 PNSFYLVENPHS 353 + V S Sbjct: 196 GGRVFNVVEQAS 207 >gi|260800517|ref|XP_002595175.1| hypothetical protein BRAFLDRAFT_240946 [Branchiostoma floridae] gi|229280419|gb|EEN51187.1| hypothetical protein BRAFLDRAFT_240946 [Branchiostoma floridae] Length = 286 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/170 (17%), Positives = 64/170 (37%), Gaps = 22/170 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D + S+ + ++ I+ + + L N V+ +V ++ + Sbjct: 1 DIVFIVDGTGSVGLE---NFERMKTFIRQLFSYL------DIGENAVRVSIVQYAAQART 51 Query: 233 FFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L+ + + Q + + G T + + +A N FD+ A+ KI Sbjct: 52 EFFLDQYYDLQEAQDAVDDIEYMGGYTLTGKAIDFATNLHFDL------RKGARADVTKI 105 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 V +TDG ++ G + A+G+ + L A A Sbjct: 106 AVVITDGR-----SYDDVNRPARRMRQAGIVTIAVGVGNNLDRDQLTAIA 150 >gi|77465024|ref|YP_354527.1| hypothetical protein RSP_3006 [Rhodobacter sphaeroides 2.4.1] gi|77389442|gb|ABA80626.1| conserved hypothetical protein containing Von Willebrand factor, type A domain [Rhodobacter sphaeroides 2.4.1] Length = 222 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/162 (18%), Positives = 62/162 (38%), Gaps = 13/162 (8%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 ARL + +VLD S SM + IT+ + ++ ++ + +VTF Sbjct: 15 NPTARLPVCLVLDTSASMT---GAPITE---LQEGVSTFFAQLLADDVAEYSAEVAVVTF 68 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ + ++ + L+ G T ++ A + + ++ Sbjct: 69 GGNVDMAVDFA---AVTRQTVPSLTA-GGMTPMGEAVETALELLHTRKE--EYKRAGVDY 122 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 Y+ +V MTDG + S + +++ V+AIGI Sbjct: 123 YQPWLVIMTDGAPTDNIS-KASRLVDDLVREKKLAVFAIGIG 163 >gi|268579981|ref|XP_002644973.1| Hypothetical protein CBG10938 [Caenorhabditis briggsae] Length = 548 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 78/211 (36%), Gaps = 34/211 (16%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 +T D + D+ I+ D S S+E F + + + +++ +P Sbjct: 357 SVTEPTDKLPVNDCQYDVGIIFDSSGSLEKNFQTQL-----------QIANKLQQMPIRP 405 Query: 217 NVVQSGLVTFS----NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 N+ + +V F+ ++ F+ L++ I+ + +T + LK Sbjct: 406 NLTRVAIVQFAGKSKTRVLADFVQNKTKDQLEKIIEKSPFYSGTTFTNQALKRM------ 459 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 ++ N K ++ TDG + + K++G VY +GI ++ Sbjct: 460 ---ALLFEASKRDNCKMKLLVFTDG-----YSAEDTAEGIEALKRQGITVYTVGISTDKN 511 Query: 333 HEF----LRACA-SPNSFYLVENPHSMYDAF 358 L+ A SP+ ++ + ++ F Sbjct: 512 AGLNVSELKGMATSPSHYFDSSDFDNLLKHF 542 >gi|325268974|ref|ZP_08135595.1| aerotolerance protein BatB [Prevotella multiformis DSM 16608] gi|324988595|gb|EGC20557.1| aerotolerance protein BatB [Prevotella multiformis DSM 16608] Length = 330 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 27/179 (15%), Positives = 67/179 (37%), Gaps = 26/179 (14%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P S ++ ++ +I LD+S SM + D + ++++ + + ++ Sbjct: 76 PQIGSKLATNKHREGIETIIALDISNSMLAE-DVAPSRLEKSKLLVENLMSRFSED---- 130 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDM 273 + GL+ F+ + + + + ++ T+ L+ A N Sbjct: 131 ---KIGLIVFAGEAFVQLPITSDYVSAKMFLDNINPSLIGTQGTDIGKALQLAANSFT-- 185 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 ++ K I+ +TDGE+ + + +A+ +G V+ +GI + Sbjct: 186 ---------PNSKAGKAIILITDGEDN----EGGAEAMAKQARSKGIKVFILGIGSRQG 231 >gi|171921010|gb|ACB59193.1| TadG [Actinobacillus suis ATCC 33415] Length = 554 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 37/272 (13%), Positives = 87/272 (31%), Gaps = 22/272 (8%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 I+ F + G ++ + I ++ + +E + I K L ++++++ + + + Sbjct: 9 IKRFIQDESGVYAVIGGLLALPIVALMFVSLESAGIIQDKARLSDSLEQAVLSLSAENNS 68 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDI-----VVVP 122 N+ K + R+ FV + + Sbjct: 69 GRKSNDYKLSNTDAENGHFNPNSKIGERDLEISKSFVTTYLPQTDPNKIKLQPVCTTTDK 128 Query: 123 QNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA--------RLDM 174 +N ++ W+ I + V + S + A +D+ Sbjct: 129 KNRQGHTASTETICTVAGTIEHKSWFPLKVGSTEVIPTEVNIASNSKAIKKNTISIPIDL 188 Query: 175 MIVLDVSRSMESFFDSS------ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 M+ D+S SM ++ +KID+ + + + N+ + + F+ Sbjct: 189 MVAADLSGSMRYDLENRYEPKDGTSKIDILKAVLTELSSNSLFSQESNDNNRIAVSPFAL 248 Query: 229 KIEEFF---LLEWGVSHLQRKIKYLSKFGVST 257 E L + + + R I Y G+ T Sbjct: 249 GAEYSTTECTLPFALKNNNRTINYTKSLGIPT 280 >gi|260800519|ref|XP_002595176.1| hypothetical protein BRAFLDRAFT_140700 [Branchiostoma floridae] gi|229280420|gb|EEN51188.1| hypothetical protein BRAFLDRAFT_140700 [Branchiostoma floridae] Length = 153 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 62/170 (36%), Gaps = 22/170 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D + S+ + + I + + + + V+ +V ++ + Sbjct: 1 DIVFIVDGTGSVG------LENFERMKTFIGQLFAYLDIGENA---VRVSIVQYAAQART 51 Query: 233 FFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L+ + + Q + + G T + + +A N FD+ A+ KI Sbjct: 52 EFFLDQYYDLQEAQDAVDGIEYMGGYTLTGKAIDFATNLHFDL------RKGARADVTKI 105 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 V +TDG ++ G + A+G+ + L A A Sbjct: 106 AVVITDGR-----SYDDVNRPARRMRQAGIVTIAVGVGNNLDRDQLTAIA 150 >gi|114652507|ref|XP_001170996.1| PREDICTED: coagulation factor C homolog, cochlin isoform 3 [Pan troglodytes] Length = 534 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/209 (12%), Positives = 70/209 (33%), Gaps = 31/209 (14%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTD------ARLDMMIVLDVSRSMESFFDSSITK 194 F +P + + V P+ + + Q +++ ++D S S+ + Sbjct: 313 FSYHMPNWFGTTKYVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGD------SN 366 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSK 252 + ++ ++ + + ++ + V F+ F + + I+ + Sbjct: 367 FRLMLEFVSNIAKTFEISDIGA---KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRY 423 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 T + + + +F +R+ N +V +TDG+ Sbjct: 424 MSGGTATGDAISFTVRNVFGP--IRESPNKN------FLVIVTDGQ-----SYDDVQGPA 470 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 A G ++++G+ + L+ AS Sbjct: 471 AAAHDAGITIFSVGVAWAPLDD-LKDMAS 498 >gi|297735865|emb|CBI18619.3| unnamed protein product [Vitis vinifera] Length = 656 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 31/154 (20%), Positives = 58/154 (37%), Gaps = 26/154 (16%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + T+S +N A +D++ VLDV M K+ M +++ ++ + Sbjct: 267 TINTTTSSLLNPARRAPIDLVTVLDVGGGMTG------AKLQMMKRAMRLVISSLSSTD- 319 Query: 215 VNNVVQSGLVTFSNKIEEFFLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + +V FS + L G +R I+ L G T++ LK A + Sbjct: 320 -----RLSIVAFSASSKRLMPLKRMTTTGRRSARRIIESLIA-GQGTSAGEALKKASKVL 373 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 R+ N + I+ ++DG+N Sbjct: 374 ----EDRRERNPVAS-----IMLLSDGQNERVSS 398 >gi|225468694|ref|XP_002269894.1| PREDICTED: hypothetical protein, partial [Vitis vinifera] Length = 585 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 31/154 (20%), Positives = 58/154 (37%), Gaps = 26/154 (16%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + T+S +N A +D++ VLDV M K+ M +++ ++ + Sbjct: 168 TINTTTSSLLNPARRAPIDLVTVLDVGGGMTG------AKLQMMKRAMRLVISSLSSTD- 220 Query: 215 VNNVVQSGLVTFSNKIEEFFLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + +V FS + L G +R I+ L G T++ LK A + Sbjct: 221 -----RLSIVAFSASSKRLMPLKRMTTTGRRSARRIIESLIA-GQGTSAGEALKKASKVL 274 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 R+ N + I+ ++DG+N Sbjct: 275 ----EDRRERNPVAS-----IMLLSDGQNERVSS 299 >gi|225438159|ref|XP_002262605.1| PREDICTED: hypothetical protein [Vitis vinifera] Length = 830 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 31/154 (20%), Positives = 58/154 (37%), Gaps = 26/154 (16%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + T+S +N A +D++ VLDV M K+ M +++ ++ + Sbjct: 413 TINTTTSSLLNPARRAPIDLVTVLDVGGGMTG------AKLQMMKRAMRLVISSLSSTD- 465 Query: 215 VNNVVQSGLVTFSNKIEEFFLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + +V FS + L G +R I+ L G T++ LK A + Sbjct: 466 -----RLSIVAFSASSKRLMPLKRMTTTGRRSARRIIESLIA-GQGTSAGEALKKASKVL 519 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 R+ N + I+ ++DG+N Sbjct: 520 ----EDRRERNPVAS-----IMLLSDGQNERVSS 544 >gi|125532269|gb|EAY78834.1| hypothetical protein OsI_33939 [Oryza sativa Indica Group] gi|125575070|gb|EAZ16354.1| hypothetical protein OsJ_31816 [Oryza sativa Japonica Group] Length = 654 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 48/259 (18%), Positives = 95/259 (36%), Gaps = 43/259 (16%) Query: 62 ATQIMNEGNGNNRKKLKGGDILCR------IKNTWNMSFRNELRDNGFVNDID------- 108 T +N ++R + + ++ R + W + L F +D Sbjct: 62 GTARVNPHPLDDRHQHQRMAVVRRLSRGDSVTRQWQLPIFRTLDGGIFDDDEQLDLHPAE 121 Query: 109 DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTN-SRHIVMPITSSVKVNSQ 167 D+V + +D +V + S+ + P + + H+ P S Sbjct: 122 DVVGTQDVDSIVADEMAPASVGITTYAAFPAMEESVMVEEFAVLIHLKAP---SSPATVT 178 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + A +D++ VLDVS SM TK+ + ++++ +++ + + +VTFS Sbjct: 179 SRAPIDLVTVLDVSWSMAG------TKLALLKRAMSFVIQALGPGD------RLSVVTFS 226 Query: 228 NKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + F L G +++ L G TN L+ A + D + C+ Sbjct: 227 SSARRLFPLRKMTESGRQRALQRVSSLVADG-GTNIADALRKAARVMEDRRERNPVCS-- 283 Query: 284 DANYKKIIVFMTDGENLST 302 IV ++DG + T Sbjct: 284 -------IVLLSDGRDTYT 295 >gi|115482404|ref|NP_001064795.1| Os10g0464500 [Oryza sativa Japonica Group] gi|110289213|gb|AAP54178.2| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] gi|113639404|dbj|BAF26709.1| Os10g0464500 [Oryza sativa Japonica Group] Length = 719 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 48/259 (18%), Positives = 95/259 (36%), Gaps = 43/259 (16%) Query: 62 ATQIMNEGNGNNRKKLKGGDILCR------IKNTWNMSFRNELRDNGFVNDID------- 108 T +N ++R + + ++ R + W + L F +D Sbjct: 139 GTARVNPHPLDDRHQHQRMAVVRRLSRGDSVTRQWQLPIFRTLDGGIFDDDEQLDLHPAE 198 Query: 109 DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTN-SRHIVMPITSSVKVNSQ 167 D+V + +D +V + S+ + P + + H+ P S Sbjct: 199 DVVGTQDVDSIVADEMAPASVGITTYAAFPAMEESVMVEEFAVLIHLKAP---SSPATVT 255 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + A +D++ VLDVS SM TK+ + ++++ +++ + + +VTFS Sbjct: 256 SRAPIDLVTVLDVSWSMAG------TKLALLKRAMSFVIQALGPGD------RLSVVTFS 303 Query: 228 NKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + F L G +++ L G TN L+ A + D + C+ Sbjct: 304 SSARRLFPLRKMTESGRQRALQRVSSLVADG-GTNIADALRKAARVMEDRRERNPVCS-- 360 Query: 284 DANYKKIIVFMTDGENLST 302 IV ++DG + T Sbjct: 361 -------IVLLSDGRDTYT 372 >gi|22758319|gb|AAN05523.1| unknown protein [Oryza sativa Japonica Group] Length = 731 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 48/259 (18%), Positives = 95/259 (36%), Gaps = 43/259 (16%) Query: 62 ATQIMNEGNGNNRKKLKGGDILCR------IKNTWNMSFRNELRDNGFVNDID------- 108 T +N ++R + + ++ R + W + L F +D Sbjct: 139 GTARVNPHPLDDRHQHQRMAVVRRLSRGDSVTRQWQLPIFRTLDGGIFDDDEQLDLHPAE 198 Query: 109 DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTN-SRHIVMPITSSVKVNSQ 167 D+V + +D +V + S+ + P + + H+ P S Sbjct: 199 DVVGTQDVDSIVADEMAPASVGITTYAAFPAMEESVMVEEFAVLIHLKAP---SSPATVT 255 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + A +D++ VLDVS SM TK+ + ++++ +++ + + +VTFS Sbjct: 256 SRAPIDLVTVLDVSWSMAG------TKLALLKRAMSFVIQALGPGD------RLSVVTFS 303 Query: 228 NKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + F L G +++ L G TN L+ A + D + C+ Sbjct: 304 SSARRLFPLRKMTESGRQRALQRVSSLVADG-GTNIADALRKAARVMEDRRERNPVCS-- 360 Query: 284 DANYKKIIVFMTDGENLST 302 IV ++DG + T Sbjct: 361 -------IVLLSDGRDTYT 372 >gi|226951529|ref|ZP_03821993.1| von Willebrand factor type A domain-containing protein [Acinetobacter sp. ATCC 27244] gi|226837721|gb|EEH70104.1| von Willebrand factor type A domain-containing protein [Acinetobacter sp. ATCC 27244] Length = 536 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 47/290 (16%), Positives = 102/290 (35%), Gaps = 37/290 (12%) Query: 85 RIKNTWNMSFRNELRDNGFVND-----IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPL 139 R+ + +F ++ + N + ++ I + Y P Sbjct: 81 RVTDQAVSTFSIDVDTGSYTNTRRFLNDGRLPPVDAVRIEEMINYFDYQYP-QPNGVHPF 139 Query: 140 KFCTF---IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 T PW N++ I + I + + Q +++ ++DVS SM S+ K+ Sbjct: 140 SVTTETVDSPWKENAKLIRIGIQAK-DLALQQLPPANLVFLVDVSGSM-----SAADKLP 193 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFG 254 + +++ + E+++ V ++T+++ + G + I L G Sbjct: 194 LVKQTLRILTEQLRAQDKVT------IITYASGEKLVLEPTSGEQKEKILAVINGLRA-G 246 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCN 313 +T ++ AY Q + ++ N I+ TDG+ D +L Sbjct: 247 GATAGEQAIQLAYKQA-EKAFVKNGINR--------ILLATDGDFNVGITDFSTLKGMVA 297 Query: 314 EAKKRGAIVYAIGIRVIRSHEFLR---ACASPNSFYLVENPHSMYDAFSH 360 E +K G + +G +E L A A ++ ++N + Sbjct: 298 EKRKSGISLTTLGFGTGNYNEQLMEQLADAGDGNYSYIDNKNEAKKVVQR 347 >gi|170729849|ref|YP_001775282.1| hypothetical protein Xfasm12_0653 [Xylella fastidiosa M12] gi|167964642|gb|ACA11652.1| conserved hypothetical protein [Xylella fastidiosa M12] Length = 941 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 55/184 (29%), Gaps = 27/184 (14%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + Q + + +D+S SM + +++D ++NA L+++ Sbjct: 140 PEKAAIGMQMAPSVAVYFAIDLSGSMHYVGGNGRSRLDNMKTALNAALDQLGQSIASGTA 199 Query: 219 VQSGLVTFSNKIEEFFLL------EWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAY 267 V L F + + L G++ L+ + F T P A Sbjct: 200 VDIMLAGFGDAPDHRQTLLRRNCTAQGIAELKSWVAARQALYGTYFPAGTMDMPSFYAA- 258 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 +N ++ FMTDGE + + Y I I Sbjct: 259 ---------------APSNAVRVAFFMTDGEPDPPSATLAQAARADVDQVAHLRCYGINI 303 Query: 328 RVIR 331 + Sbjct: 304 DLAN 307 >gi|71274892|ref|ZP_00651180.1| conserved hypothetical protein [Xylella fastidiosa Dixon] gi|71902167|ref|ZP_00684189.1| conserved hypothetical protein [Xylella fastidiosa Ann-1] gi|71164624|gb|EAO14338.1| conserved hypothetical protein [Xylella fastidiosa Dixon] gi|71728071|gb|EAO30276.1| conserved hypothetical protein [Xylella fastidiosa Ann-1] Length = 941 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 55/184 (29%), Gaps = 27/184 (14%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + Q + + +D+S SM + +++D ++NA L+++ Sbjct: 140 PEKAAIGMQMAPSVAVYFAIDLSGSMHYVGGNGRSRLDNMKTALNAALDQLGQSIASGTA 199 Query: 219 VQSGLVTFSNKIEEFFLL------EWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAY 267 V L F + + L G++ L+ + F T P A Sbjct: 200 VDIMLAGFGDAPDHRQTLLRRNCTAQGIAELKSWVAARQALYGTYFPAGTMDMPSFYAA- 258 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 +N ++ FMTDGE + + Y I I Sbjct: 259 ---------------APSNAVRVAFFMTDGEPDPPSATLAQAARADVDQVAHLRCYGINI 303 Query: 328 RVIR 331 + Sbjct: 304 DLAN 307 >gi|16124454|ref|NP_419018.1| hypothetical protein CC_0199 [Caulobacter crescentus CB15] gi|221233138|ref|YP_002515574.1| hypothetical protein CCNA_00199 [Caulobacter crescentus NA1000] gi|13421322|gb|AAK22186.1| hypothetical protein CC_0199 [Caulobacter crescentus CB15] gi|220962310|gb|ACL93666.1| hypothetical protein CCNA_00199 [Caulobacter crescentus NA1000] Length = 626 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 56/178 (31%), Gaps = 42/178 (23%) Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM------QGMRQHCNTEDANY 287 L + L+ +I L+ G +T GL + + + ++ + + Sbjct: 448 TPLSTDKTALKAQINGLT-VGGATAGQIGLAWGWYMVAPNFGYLWPNASQRPAAYKARDL 506 Query: 288 KKIIVFMTDG-----------------------------ENLSTKEDQQSLYYCNEAKK- 317 K+++ MTDG + + Q+ C+ K Sbjct: 507 MKVVILMTDGGFNMTYCNSVVARNIGSGTNIGDDERINCDATNGSSFDQAAELCDSIKAS 566 Query: 318 -RGAIVYAIGIRVIRSH---EFLRACASP-NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 +Y +G V FL CAS + Y + +F I ++I RI Sbjct: 567 ANDITLYTVGFTVGNDQTARNFLTNCASSTDKAYFPATGSELKASFQAIAQEISNLRI 624 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 43/326 (13%), Positives = 99/326 (30%), Gaps = 57/326 (17%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R + +G + I A+ + ++L +++V + + + +D + + AA Sbjct: 20 RRLRRDDRGAIAIQFALLALPLSILLFGLLDVGRLSLQRRQMQDALDAATLMAARS---- 75 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 + L + M+ + F ++ Sbjct: 76 -TATSSADLDTTGDAAFLAEIAGMNLGLTASSSTFSAGTNN------------------- 115 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 I L+ W + + + S V S+ L++ +VLD++ SM Sbjct: 116 -RVIGTATATLRPIIANLWQSGNFTVTAS--SEVVRASKN---LEIALVLDITGSM---- 165 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN----------------KIEE 232 + T+I + +++ + + LV +S ++ Sbjct: 166 -GNGTRIADLKVAAADLVDVLVRDTQTPFYSKMALVPYSAGVNVGATYADAVRGPVPVKT 224 Query: 233 FFLLEWGVSHLQRKIKYLSKF-GVSTNSTPGLKY-AYNQIFDMQGMRQHCNT----EDAN 286 W + V T S GL Y I ++GM + ++ Sbjct: 225 ITGAAWASGSARSITGITRANPAVVTASGHGLSTGDYVYITGVRGMTSVNDKIYRVTRSD 284 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYC 312 K+ + T+ + S + ++ C Sbjct: 285 PDKVSLNSTNTSSASNYTNGGTIQKC 310 >gi|47219204|emb|CAG11222.1| unnamed protein product [Tetraodon nigroviridis] Length = 4421 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 42/262 (16%), Positives = 96/262 (36%), Gaps = 30/262 (11%) Query: 99 RDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPL--KFCTFIPWYTNSRHIVM 156 + FV I S ++ + Y+++ +P + + + + M Sbjct: 341 QQGVFVIGIGT-RNSDRTELQKISFEPSYTLAVTEFTDLPSIQEQLSSVMSTVLLKDTAM 399 Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P T +V+ + D++ +LD S D + + + ++E +++ + Sbjct: 400 PPTVTVERQPRGK---DVVFLLDGS-------DGTRSGFPAMRDFVQRLVETLRVDDKKD 449 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFD 272 + +V +S F L + + + L K G + N+ L+Y N +F Sbjct: 450 ---RVSVVQYSRDAAVHFYLNTYTTK-REILDALRGLRHKGGRALNTGEALQYLRNNVFT 505 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + ++++ +T G + + + ++ K G +++AIG R + Sbjct: 506 ASAGSRRTEG----VPQVLILLTGGRSSDSVD-----SPASDLKPLGVLIFAIGSRGSDN 556 Query: 333 HEFLRACASPNSFYLVENPHSM 354 E R SP S +V + Sbjct: 557 REIQRISHSPTSALVVPEFTDL 578 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 42/262 (16%), Positives = 97/262 (37%), Gaps = 30/262 (11%) Query: 99 RDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPL--KFCTFIPWYTNSRHIVM 156 + FV I S ++ + Y+++ +P + + + + M Sbjct: 2256 QQGVFVIGIGT-RNSDRTELQKISFEPSYTLAVTEFTDLPSIQEQLSSVMSTVLLKDTAM 2314 Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P T +V+ + D++ +LD S D + + + ++E +++ + Sbjct: 2315 PPTVTVERQPRGK---DVVFLLDGS-------DGTRSGFPAMRDFVQRVVETLRVDDKKD 2364 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFD 272 + +V +S F L + + + L K G + N+ L+Y N +F Sbjct: 2365 ---RVSVVQYSRDAAVHFYLNTYTTK-REILDALRGLRHKGGRALNTGEALQYLRNNVFT 2420 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + ++++ +T G + + + ++ K+ G +++AIG R + Sbjct: 2421 ASAGSRRTEG----VPQVLILLTGGRSSDSVD-----SPASDLKQLGVLIFAIGSRGSDN 2471 Query: 333 HEFLRACASPNSFYLVENPHSM 354 E R SP S +V + Sbjct: 2472 REIQRISHSPTSALVVPEFTDL 2493 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 35/219 (15%), Positives = 82/219 (37%), Gaps = 27/219 (12%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 + + + + P +S + + Q D++ +LD S D + + Sbjct: 1745 QMLILLNGGRSYDSVDTPASSLKQQDQQEK---DVVFLLDGS-------DGTRSGFPAMR 1794 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS----KFGV 255 + ++E +++ + + +V +S F L + + + L K G Sbjct: 1795 DFVQRVVETLRVDDKKD---RVSVVQYSRDAAVHFYLNTYTTK-REILDALRGLRHKGGR 1850 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 + N+ L+Y N +F + ++++ +T G + + + ++ Sbjct: 1851 ALNTGEALQYLRNNVFTASAGSRRTEG----VPQVLILLTGGRSSDSVD-----SPASDL 1901 Query: 316 KKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSM 354 K+ G +++AIG R + E R SP S +V + Sbjct: 1902 KQLGVLIFAIGSRGSDNREIQRISHSPTSALVVPEFTDL 1940 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 42/262 (16%), Positives = 97/262 (37%), Gaps = 30/262 (11%) Query: 99 RDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPL--KFCTFIPWYTNSRHIVM 156 + FV I S ++ + Y+++ +P + + + + M Sbjct: 2809 QQGVFVIGIGT-RNSDRTELQKISFEPSYTLAVTEFTDLPSIQEQLSSVMSTVLLKDTAM 2867 Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P T +V+ + D++ +LD S D + + + ++E +++ + Sbjct: 2868 PPTVTVERQPRGK---DVVFLLDGS-------DGTRSGFPAMRDFVQRVVETLRVDDKKD 2917 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFD 272 + +V +S F L + + + L K G + N+ L+Y N +F Sbjct: 2918 ---RVSVVQYSRDAAVHFYLNTYTTK-REILDALRGLRHKGGRALNTGEALQYLRNNVFT 2973 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + ++++ +T G + + + ++ K+ G +++AIG R + Sbjct: 2974 ASAGSRRTE----RVPQLLILLTGGRSSDSVD-----SPASDLKQLGVLIFAIGSRGSDN 3024 Query: 333 HEFLRACASPNSFYLVENPHSM 354 E R SP S +V + Sbjct: 3025 REIQRISHSPTSALVVPEFTDL 3046 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 39/260 (15%), Positives = 90/260 (34%), Gaps = 32/260 (12%) Query: 99 RDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPI 158 + FV I S ++ + Y+++ +P ++ + Sbjct: 1377 QQGVFVIGIGT-RNSDRTELQKISFEPSYTLAVTEFTDLPSI-------QEQLSSVMSTV 1428 Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T D++ +LD S D + + + ++E +++ + Sbjct: 1429 LLKDTAMPPTVTGKDVVFLLDGS-------DGTRSGFPAMRDFVQRVVETLRVDDKKD-- 1479 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQ 274 + +V +S F L + + + L K G + N+ L+Y N +F Sbjct: 1480 -RVSVVQYSRDAAVHFYLNTYTTK-REILDALRGLRHKGGRALNTGEALQYLRNNVFTAS 1537 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + ++++ +T G + + + ++ K+ G +++AIG R + E Sbjct: 1538 AGSRRTEG----VPQVLILLTGGRSSDSVD-----SPASDLKQLGVLIFAIGSRGSDNRE 1588 Query: 335 FLRACASPNSFYLVENPHSM 354 R SP S +V + Sbjct: 1589 IQRISHSPTSALVVPEFTDL 1608 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 32/186 (17%), Positives = 72/186 (38%), Gaps = 24/186 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S D + + + ++E +++ + + +V +S Sbjct: 1 DVVFLLDGS-------DGTRSGFPAMRDFVQRVVETLRVDDKKD---RVSVVQYSRDAAV 50 Query: 233 FFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L + + + L K G + N+ L+Y N +F + Sbjct: 51 HFYLNTYTTK-REILDALRGLRHKGGRALNTGEALQYLRNNVFTASAGSRRTEG----VP 105 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 ++++ +T G + + + ++ K+ G +++AIG R + E R SP S +V Sbjct: 106 QVLILLTGGRSSDSVD-----SPASDLKQLGVLIFAIGSRGSDNREIQRISHSPTSALVV 160 Query: 349 ENPHSM 354 + Sbjct: 161 PEFTDL 166 >gi|300727143|ref|ZP_07060562.1| BatB protein [Prevotella bryantii B14] gi|299775687|gb|EFI72278.1| BatB protein [Prevotella bryantii B14] Length = 340 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 35/177 (19%), Positives = 64/177 (36%), Gaps = 30/177 (16%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSM--ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 +++ ++ MI LD+S SM E S +TK M ++ L+ NN Sbjct: 79 GTRISHDKRNGIETMIALDISNSMLAEDVAPSRLTKSKMLVE---------NLVDHFNND 129 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQG 275 Q GLV F+ + + + ++ + T+ + Sbjct: 130 -QIGLVVFAGQSYVQLPITSDYVSAKMFLQDIQPSLIQTQGTDIAGAINTCM-------- 180 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 H T + K I+ +TDGE+ + ++ A RG V+ +GI + Sbjct: 181 ---HAFTPNDKVGKAIIVITDGEDH----EGGAIEAAKAAHDRGINVFILGIGDSKG 230 >gi|114652511|ref|XP_509886.2| PREDICTED: coagulation factor C homolog, cochlin isoform 7 [Pan troglodytes] Length = 540 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/209 (12%), Positives = 70/209 (33%), Gaps = 31/209 (14%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTD------ARLDMMIVLDVSRSMESFFDSSITK 194 F +P + + V P+ + + Q +++ ++D S S+ + Sbjct: 319 FSYHMPNWFGTTKYVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGD------SN 372 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSK 252 + ++ ++ + + ++ + V F+ F + + I+ + Sbjct: 373 FRLMLEFVSNIAKTFEISDIGA---KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRY 429 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 T + + + +F +R+ N +V +TDG+ Sbjct: 430 MSGGTATGDAISFTVRNVFGP--IRESPNKN------FLVIVTDGQ-----SYDDVQGPA 476 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 A G ++++G+ + L+ AS Sbjct: 477 AAAHDAGITIFSVGVAWAPLDD-LKDMAS 504 >gi|94732992|emb|CAK03801.1| novel protein similar to vertebrate calcium channel, voltage-dependent, alpha 2/delta subunit 2 (CACNA2D2) [Danio rerio] Length = 1056 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 41/232 (17%), Positives = 80/232 (34%), Gaps = 48/232 (20%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + D NV + F Sbjct: 208 GASSPKDMVILVDVSGSVSGLT------LKLIKASVTEMLDTL-SDDDYVNVAR-----F 255 Query: 227 SNKIEEFFL--------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + K E + ++ + G +T+ G +A+NQ+ + Sbjct: 256 NEKAEAVVPCFDHLVQANVRNKKIFKEAVQQMQAKG-TTDYKSGFHFAFNQLLNKT---- 310 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LR 337 N AN KII+ TDG ++ + + N+ V+ + L+ Sbjct: 311 --NVPRANCNKIIMLFTDGGEDRAQDIFEQYNWPNKT----VRVFTFSVGQHNYDVTPLQ 364 Query: 338 --ACASPNSFYLVE-------NPHSMYDAF-------SHIGKDIVTKRIWYD 373 AC++ ++ + N D I K + ++ D Sbjct: 365 WIACSNKGYYFEIRSICAIRINTQEYLDVLGRPMVLAGRIAKQVQWTNVYQD 416 >gi|41054731|ref|NP_032422.2| integrin alpha-2 precursor [Mus musculus] gi|40781666|gb|AAH65139.1| Integrin alpha 2 [Mus musculus] gi|148686423|gb|EDL18370.1| integrin alpha 2 [Mus musculus] Length = 1178 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 39/319 (12%), Positives = 108/319 (33%), Gaps = 37/319 (11%) Query: 65 IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQ- 123 ++ + G C + + L+++ ++++ +I + SL + + Sbjct: 61 LVGSPWSGFPENRMGDVYKCPVDLPTATCEKLNLQNSASISNVTEIKTNMSLGLTLTRNP 120 Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARL-DMMIVLDVSR 182 G ++ + + + +S Q L D+++V D S Sbjct: 121 GTGGFLTCGPLWAHQCGNQYYATGICSDVSPDFQFLTSFSPAVQACPSLVDVVVVCDESN 180 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--V 240 S+ + + + + + + P Q L+ ++N+ F L Sbjct: 181 SIYPW--------EAVKNFLVKFVTGLDIGP---KKTQVALIQYANEPRIIFNLNDFETK 229 Query: 241 SHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 + + + G TN+ +++A + + + K++V +TDGE+ Sbjct: 230 EDMVQATSETRQHGGDLTNTFRAIEFARDYAYS------QTSGGRPGATKVMVVVTDGES 283 Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFLR----ACASP--NSFYL 347 + + + CN + + I + + + ++ ++P F+ Sbjct: 284 HDGSKLKTVIQQCN---DDEILRFGIAVLGYLNRNALDTKNLIKEIKAIASTPTERYFFN 340 Query: 348 VENPHSMYDAFSHIGKDIV 366 V + ++ + +G+ I Sbjct: 341 VADEAALLEKAGTLGEQIF 359 >gi|4758022|ref|NP_004077.1| cochlin precursor [Homo sapiens] gi|205277471|ref|NP_001128530.1| cochlin precursor [Homo sapiens] gi|114652503|ref|XP_001171057.1| PREDICTED: cochlin isoform 6 [Pan troglodytes] gi|7387582|sp|O43405|COCH_HUMAN RecName: Full=Cochlin; AltName: Full=COCH-5B2; Flags: Precursor gi|2801413|gb|AAC39545.1| Coch-5B2 gene product [Homo sapiens] gi|37182918|gb|AAQ89259.1| COCH [Homo sapiens] gi|58802453|gb|AAW82432.1| coagulation factor C homolog, cochlin (Limulus polyphemus) [Homo sapiens] gi|119586367|gb|EAW65963.1| coagulation factor C homolog, cochlin (Limulus polyphemus), isoform CRA_a [Homo sapiens] gi|119586369|gb|EAW65965.1| coagulation factor C homolog, cochlin (Limulus polyphemus), isoform CRA_a [Homo sapiens] gi|158258885|dbj|BAF85413.1| unnamed protein product [Homo sapiens] Length = 550 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/209 (12%), Positives = 70/209 (33%), Gaps = 31/209 (14%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTD------ARLDMMIVLDVSRSMESFFDSSITK 194 F +P + + V P+ + + Q +++ ++D S S+ + Sbjct: 329 FSYHMPNWFGTTKYVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGD------SN 382 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSK 252 + ++ ++ + + ++ + V F+ F + + I+ + Sbjct: 383 FRLMLEFVSNIAKTFEISDIGA---KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRY 439 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 T + + + +F +R+ N +V +TDG+ Sbjct: 440 MSGGTATGDAISFTVRNVFGP--IRESPNKN------FLVIVTDGQ-----SYDDVQGPA 486 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 A G ++++G+ + L+ AS Sbjct: 487 AAAHDAGITIFSVGVAWAPLDD-LKDMAS 514 >gi|224065787|ref|XP_002190547.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Taeniopygia guttata] Length = 1068 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 77/197 (39%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 215 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVYEMLDTLSDDDYVN------VASF 262 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + + ++ + G +T+ G +YA++Q+ + R + Sbjct: 263 NEKAKPVSCFKHLVQANIRNKKVFKEDVQGMVAKG-TTDYKAGFEYAFDQLQNSNITRAN 321 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N K +++G A Sbjct: 322 CN-------KMIMMFTDGGEDRVQDVFEKYKWPN--KTVRVFTFSVGQHNYDVTPLQWMA 372 Query: 339 CASPNSFYLVENPHSMY 355 CA+ ++ + + ++ Sbjct: 373 CANKGYYFEIPSIGAIR 389 >gi|195614282|gb|ACG28971.1| retrotransposon protein [Zea mays] Length = 650 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 41/232 (17%), Positives = 79/232 (34%), Gaps = 38/232 (16%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES------FFDSS 191 IP + + + + +D++ VLDVS SM + Sbjct: 44 TTPIFPEIPQGQARKDFQVLVRVEAPARPEARIPIDVVAVLDVSGSMNDPAAAPTERTRT 103 Query: 192 ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--EEFFLLEWGVSHLQR-KIK 248 +++D+ + M+ +++ + +V FS++ E L + + +R I+ Sbjct: 104 TSRLDLLKTAAKFMVAKLEDGD------RLSIVAFSDRPVRELSSGLLYMTADGRRNAIR 157 Query: 249 YLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 L + T P + A + QG IV +TDG ED Sbjct: 158 SLDQLEARGGTALVPAFEEAVKVLDGRQGDGGDRLG-------FIVLLTDG-----AEDA 205 Query: 307 QSLYYCNEAKKRGAI-------VYAIGIRVIRSHEFLRACA--SPNSFYLVE 349 + +E ++ V+A G+ E L A S ++ V+ Sbjct: 206 SGSFTLSERRREVIRGALGRYPVHAFGLGTAHGPEVLLYLAQESRGTYSFVD 257 >gi|326920703|ref|XP_003206608.1| PREDICTED: cochlin-like, partial [Meleagris gallopavo] Length = 760 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 23/172 (13%), Positives = 60/172 (34%), Gaps = 21/172 (12%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +++ ++D S S+ + + ++ I+ + + ++ + + T+ + Sbjct: 575 SVNIGFLIDGSSSVGE------SNFRLMLEFISNVAKAFEISDIGSKIATVQF-TYDQRT 627 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 E F + I+ + T + + + +F + N Sbjct: 628 EFSFTDYTTKEKVLSAIRNIRYMSGGTATGDAISFTTRNVFGPVKDGANKN--------F 679 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 +V +TDG+ A+K G V+++G+ + L+ AS Sbjct: 680 LVILTDGQ-----SYDDVRGPAVAAQKAGITVFSVGVAWAPLDD-LKDMASE 725 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 72/210 (34%), Gaps = 22/210 (10%) Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 S S + L+ P K D + D+ ++D S Sbjct: 330 SFSVTRTNNLVLEAVGRSVSTARPATGKRPKKPLEKKAGNKDCKADIAFLIDGS------ 383 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQR 245 ++ + ++ + + + + + +V G+V S + F L+ + Sbjct: 384 YNIGQRRFNLQKNFVGKVAVMLGIGTEGPHV---GVVQASEHPKIEFYLKNFTAAKEVLF 440 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 IK L G ++N+ LK+A + F M+ N KIIV DG E+ Sbjct: 441 AIKELGFRGGNSNTGKALKHAAQKFFSME------NGARKGIPKIIVVFLDGWPSDDLEE 494 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + A++ G V+ + + + E Sbjct: 495 AGIV-----AREFGVNVFIVSVAKPTTEEL 519 >gi|256028718|ref|ZP_05442552.1| von Willebrand factor type A [Fusobacterium sp. D11] gi|289766627|ref|ZP_06526005.1| von Willebrand factor type A [Fusobacterium sp. D11] gi|289718182|gb|EFD82194.1| von Willebrand factor type A [Fusobacterium sp. D11] Length = 218 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 80/213 (37%), Gaps = 18/213 (8%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 ++ SQ L ++++ D S SM + + +I ML +K + + Sbjct: 1 MEFTSQPKKVLPLILLADTSSSMREW-------MRELNTAIRDMLGTLKEQESLKAEIHI 53 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI-FDMQGMRQHC 280 +TF N G ++L + +S + + G+ + + + Sbjct: 54 SFITFGN----------GGANLHTALTPVSNIEFNDFTEGGMTPLGGALRIAKEMVENRE 103 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +Y II+ ++DG + + + N+ + + + ++GI ++ L+ + Sbjct: 104 IIPSKSYAPIILLLSDGAPNDNGWENEMYRFINDGRSKKCMRMSLGIGRDYDYDVLKGFS 163 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 S Y ++ ++ D F + I K + D Sbjct: 164 SNGEVYEAKDSMNIIDFFKFMTMTIKEKTLSKD 196 >gi|218261917|ref|ZP_03476585.1| hypothetical protein PRABACTJOHN_02256 [Parabacteroides johnsonii DSM 18315] gi|218223693|gb|EEC96343.1| hypothetical protein PRABACTJOHN_02256 [Parabacteroides johnsonii DSM 18315] Length = 328 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 41/229 (17%), Positives = 80/229 (34%), Gaps = 46/229 (20%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S+ + + +D+++ +D+S SM + D +++ + +A + P+ N Sbjct: 77 STNSWQNSSTEGIDIVLAMDISTSMMAQ-DLKPNRLEASKDVASAFING---RPNDN--- 129 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMR 277 GLV F+ + L + L K + T GL A ++I Sbjct: 130 -IGLVVFAAESFTQCPLTTDHTVLLNLFKDVQPGIIQDGTAIGLGLANAVSRI------- 181 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS----- 332 + K+I+ +TDG N + + + AK G VY IG+ Sbjct: 182 ----KDSQAKSKVIILLTDGVNN--QGEIAPVTAAEIAKTFGVRVYTIGVGTQGKAPYPF 235 Query: 333 ----------------HEFLR--ACASPNSFYLVENPHSMYDAFSHIGK 363 L+ A + ++ + S+ + +S I K Sbjct: 236 QTAFGVQYMDVDVEIDEPTLKQIAATTGGQYFRATDNASLKEIYSEIDK 284 >gi|198421146|ref|XP_002121270.1| PREDICTED: similar to cubilin [Ciona intestinalis] Length = 728 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 66/188 (35%), Gaps = 32/188 (17%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 ++ D+M +LD S S+ S + +A + + + L Q Sbjct: 525 TTAKVCQNSHTDLMFLLDSSGSVTS------SDFQLAANFVKDFITGIDLTS-----FQV 573 Query: 222 GLVTFSNK-------IEEFFLLEWGVSHL------QRKIKYLSKFGVSTNSTPGLKYAYN 268 G++ +S+ + + E + + + + G +T + + A Sbjct: 574 GVMQYSHYLLNRELDDQPYITTEINIGEYTEADPFKTAMDTIQPHGYTTYTAHAVLKA-- 631 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + R KIIV +TDG + + + + EAK G +YAIG+ Sbjct: 632 --IRVDFPRSTRFNNSCT-SKIIVLITDGSSSDSSMLRDAAL---EAKNLGVDIYAIGVG 685 Query: 329 VIRSHEFL 336 + E + Sbjct: 686 DANTQELV 693 >gi|52425826|ref|YP_088963.1| hypothetical protein MS1771 [Mannheimia succiniciproducens MBEL55E] gi|52307878|gb|AAU38378.1| unknown [Mannheimia succiniciproducens MBEL55E] Length = 541 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 35/260 (13%), Positives = 83/260 (31%), Gaps = 45/260 (17%) Query: 6 LNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 + ++ FF N G ++ I + ++ + ++ S + K I+++ + + Sbjct: 8 IKLKQFFQNEDGAYAVIMGILSFFLIGLVALTVDGSGMLLDKARFSQGIEQAGLA----L 63 Query: 66 MNEGNGNNRKKLKGGDILCRIKNTWNMS-FRNELRDNGFVNDIDDIVRSTSLDIVVVPQN 124 M E N K D+L + + F + + + + + Sbjct: 64 MAENNDFRTTNQKHADVLRQTVTKEELEGFSDTFSAQKYKRNQELVSGLVRHYYYPSTYF 123 Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS------------------ 166 + ++ +Y P + I I+ + S Sbjct: 124 KD-NLKISDKYDYQCNNLQG-PNGEQLKSIACEISGKFERPSWLYLGKNNGLSFAETTTI 181 Query: 167 -----------QTDARLDMMIVLDVSRSMESFFDS---SITKIDMAIKSINAMLEEVKLI 212 +D+M+V D+S SM S KID+ + ++A+ +E+ Sbjct: 182 NANKIYIQKNLDEIIPIDLMLVADLSGSMNSSVSGTKYGTAKIDILREVVSAIAKELLEQ 241 Query: 213 PD------VNNVVQSGLVTF 226 + ++ + G +F Sbjct: 242 NNTEEGKVISQYNRIGFTSF 261 Score = 40.9 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 21/145 (14%), Positives = 46/145 (31%), Gaps = 24/145 (16%) Query: 240 VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE- 298 + L + + G + +S+ L A + + AN ++II+ ++DGE Sbjct: 378 SADLVSRFNRVPALGATLSSSGLLIGANLLMNTNPDENAQPSKLGANTQRIILVLSDGED 437 Query: 299 --NLSTKEDQQSLYY-----CNEAKKR------------GAIVYAIGIRVIRS---HEFL 336 N ++ + C + K + + + S Sbjct: 438 QINNASSSLNITSTLINQGMCEKIKSKLNSLQDKTYLEQPTRIGFVAFGYGPSGTQKAAW 497 Query: 337 RACASPNSFYLVENPHSMYDAFSHI 361 C +Y+ N + ++F I Sbjct: 498 EKCV-GKYYYVANNKEELLESFRKI 521 >gi|281338501|gb|EFB14085.1| hypothetical protein PANDA_006133 [Ailuropoda melanoleuca] Length = 984 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 77/200 (38%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 117 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 164 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 165 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 223 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ K + V+ + R Sbjct: 224 CN-------KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQ 270 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 271 WMACENKGYYYEIPSIGAIR 290 >gi|218672263|ref|ZP_03521932.1| hypothetical protein RetlG_11787 [Rhizobium etli GR56] Length = 256 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 68/188 (36%), Gaps = 16/188 (8%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F + G I+TA+ + + GM ++ +H ++T L++ D + V I + + Sbjct: 8 FISDRSGNFGIMTALLMVPLVGTAGMAVDFAHALSLRTQLYAAADAAAVG---SIAEKSS 64 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 G+ + T + ++ ++ +D+ +S Sbjct: 65 AVAAAMAMNGNGTISLGKTDARNIFMSQVSG----ELAEVHVDLGIDVTKTANKLNSQVS 120 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME-SFFD 189 + +P F + I+ + QT A +D I+LD + SM Sbjct: 121 FTAT--VPTTFMQIFGRD------SITISGTATAEYQTAAFMDFYILLDNTPSMGVGATP 172 Query: 190 SSITKIDM 197 S ++K++ Sbjct: 173 SDVSKLEA 180 >gi|308070278|ref|YP_003871883.1| hypothetical protein PPE_03528 [Paenibacillus polymyxa E681] gi|305859557|gb|ADM71345.1| Conserved hypothetical protein [Paenibacillus polymyxa E681] Length = 695 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 35/217 (16%), Positives = 75/217 (34%), Gaps = 39/217 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D + VLD S SM I+ + IN ++ + + G V +++ + Sbjct: 47 DAVFVLDTSYSMRDTDPEGIS-----AEVINMFMDLSDA-----DRTRVGFVAYNHNVVA 96 Query: 233 FFLLE-----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L S +Q+ I+ L++ G T+ GL+ + Sbjct: 97 SKPLTSIAVAAQKSQIQQDIRTLNRSGY-TDLGLGLRRGSELL---------AAGASQGR 146 Query: 288 KKIIVFMTDGEN---------LSTKEDQQSLYYCNEAKKRGAIVYAIGI---RVIRSHEF 335 + ++ ++DGE + A+ +G VY IG+ + E Sbjct: 147 QPFMILLSDGETDFGASSGSRSKGDSNNDVSSVIKSAQTKGYPVYTIGLNHDGTVNRQEL 206 Query: 336 LRACA-SPNSFYLVENPHSMYDAFSHI-GKDIVTKRI 370 R + + + ++ + + + + I I +K + Sbjct: 207 ERIASQTGGASFITSSAEDLPEILNRIFASQIRSKLV 243 >gi|301770299|ref|XP_002920606.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Ailuropoda melanoleuca] Length = 1081 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 75/197 (38%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 220 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 267 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 268 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 326 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N + +++G A Sbjct: 327 CN-------KVIMMFTDGGEDRVQDVFEKYNWPN--RTVRVFTFSVGQHNYDVTPLQWMA 377 Query: 339 CASPNSFYLVENPHSMY 355 CA+ ++ + + ++ Sbjct: 378 CANKGYYFEIPSIGAIR 394 >gi|281338317|gb|EFB13901.1| hypothetical protein PANDA_009310 [Ailuropoda melanoleuca] Length = 1046 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 75/197 (38%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 218 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 265 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 266 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 324 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N + +++G A Sbjct: 325 CN-------KVIMMFTDGGEDRVQDVFEKYNWPN--RTVRVFTFSVGQHNYDVTPLQWMA 375 Query: 339 CASPNSFYLVENPHSMY 355 CA+ ++ + + ++ Sbjct: 376 CANKGYYFEIPSIGAIR 392 >gi|183981216|ref|YP_001849507.1| hypothetical protein MMAR_1194 [Mycobacterium marinum M] gi|183174542|gb|ACC39652.1| conserved hypothetical protein [Mycobacterium marinum M] Length = 772 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 38/216 (17%), Positives = 76/216 (35%), Gaps = 37/216 (17%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + V + A D+++VLD S SM + K+ A ++ +++ + Sbjct: 296 TLVPPAEPSSAPRDVVVVLDRSGSMGGW------KMVAARRAAGRIVDMLDAGD------ 343 Query: 220 QSGLVTFSNKIEEFFLLEWGVS--------HLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 + ++ F ++IE + G+ + L G + + P L A + Sbjct: 344 RFCVLAFDDRIETPPAMPDGLVPASDRNRFAASSWLGSLRSRGGTVMAQP-LTNAVEMLA 402 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 D RQ +V +TDG+ +SL +Y +G+ Sbjct: 403 DSGEDRQAS----------VVLVTDGQISGEDHLLRSLAPVV----GRTRIYCVGVDRAV 448 Query: 332 SHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 + FL A LVE+ + + + + + I Sbjct: 449 NAGFLERLAGLGSGRAELVESEDRLDEVMARLARTI 484 >gi|118443684|ref|YP_877685.1| hypothetical protein NT01CX_1604 [Clostridium novyi NT] gi|118134140|gb|ABK61184.1| hypothetical protein NT01CX_1604 [Clostridium novyi NT] Length = 1252 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 41/220 (18%), Positives = 76/220 (34%), Gaps = 45/220 (20%) Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ 244 + + SS + + + + ++ VK + N+ GLV+F + E L +S ++ Sbjct: 165 KCYRQSSYNEKNRLKHAQESAIKFVKKFENDKNIS-IGLVSFDTRAIEQKELTSSLSEVK 223 Query: 245 RKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG------ 297 I L + +TN GLK A + + + K ++ M+DG Sbjct: 224 SSINNLKVAYNGATNIEAGLKSAQKIL----------KKGNEDADKYVILMSDGFPTAFD 273 Query: 298 -------ENLSTKEDQQ------------------SLYYCNEAKKRGAIVYAIGIRVIRS 332 EN + E Q S+ + KK G + IG + Sbjct: 274 YAGEKFEENFNEHEVQDNTFINFGYNDYRGYAMKHSINQADSLKKVGINSFIIGFSDGAN 333 Query: 333 HEFLRAC--ASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 E L A+ + N ++ A++ I + I Sbjct: 334 SEKLNKIAKAAGGEYEEARNTDALNGAYNKIETKVKAPLI 373 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 26/140 (18%), Positives = 57/140 (40%), Gaps = 22/140 (15%) Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-------EFFLL 236 + F + ++ID K N +++ K + +V +S+K + + FL Sbjct: 715 VYEFNEQDRSRIDSVKKVANDFVDKFKDDENTE----IAIVRYSSKADVVLDNSNKVFLS 770 Query: 237 EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 ++++I L V+TN G++ +Y+ + + D + +K ++ MTD Sbjct: 771 SKDNETIKKRINSLKA-DVATNIGDGIRKSYSIL----------DKCDKDSEKYMILMTD 819 Query: 297 GENLSTKEDQQSLYYCNEAK 316 G + ++ N K Sbjct: 820 GVPTAYTCYANTIKTLNNCK 839 >gi|74224199|dbj|BAE33710.1| unnamed protein product [Mus musculus] Length = 552 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/207 (12%), Positives = 69/207 (33%), Gaps = 27/207 (13%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTD------ARLDMMIVLDVSRSMESFFDSSITK 194 F +P + + V P+ + + Q +++ ++D S S+ + Sbjct: 331 FSYHMPNWFGTTKYVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGD------SN 384 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG 254 + ++ ++ + + ++ + T+ + E F ++ + + Sbjct: 385 FRLMLEFVSNIAKTFEISDIGAKIAAVQF-TYDQRTEFSFTDYNTKENVLAVLANIRYMS 443 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 T + + + +F D+ K +V +TDG+ Sbjct: 444 GGTATGDAIAFTVRNVFGP--------IRDSPNKNFLVIVTDGQ-----SYDDVRGPAAA 490 Query: 315 AKKRGAIVYAIGIRVIRSHEFLRACAS 341 A G ++++G+ + LR AS Sbjct: 491 AHDAGITIFSVGVAWAPLDD-LRDMAS 516 >gi|52695907|pdb|1TZN|AA Chain a, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695909|pdb|1TZN|BB Chain b, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695911|pdb|1TZN|CC Chain c, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695913|pdb|1TZN|DD Chain d, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695915|pdb|1TZN|EE Chain e, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695917|pdb|1TZN|FF Chain f, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695919|pdb|1TZN|GG Chain g, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695921|pdb|1TZN|HH Chain h, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695923|pdb|1TZN|II Chain i, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695925|pdb|1TZN|JJ Chain j, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695927|pdb|1TZN|KK Chain k, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695929|pdb|1TZN|LL Chain l, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695931|pdb|1TZN|MM Chain m, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor gi|52695933|pdb|1TZN|OO Chain o, Crystal Structure Of The Anthrax Toxin Protective Antigen Heptameric Prepore Bound To The Vwa Domain Of Cmg2, An Anthrax Toxin Receptor Length = 181 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 70/202 (34%), Gaps = 27/202 (13%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D+ VLD S S+ + + I ++++ V+ ++ + FS++ Sbjct: 5 AFDLYFVLDKSGSVANNW-----------IEIYNFVQQLAERF-VSPEMRLSFIVFSSQA 52 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L + + ++ L + T GLK A QI G++ Sbjct: 53 TIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKAGGLKTSS-------- 104 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 II+ +TDG+ ++ GA VY +G+ + R S + V Sbjct: 105 -IIIALTDGKLDGLVPSYAEKE-AKISRSLGASVYCVGVLDFEQAQLERIADSKEQVFPV 162 Query: 349 ENPHSMYDAFSHIGKDIVTKRI 370 + + A I I+ + Sbjct: 163 KGG---FQALKGIINSILAQSC 181 >gi|6680956|ref|NP_031754.1| cochlin precursor [Mus musculus] gi|311771523|ref|NP_001185764.1| cochlin precursor [Mus musculus] gi|12644458|sp|Q62507|COCH_MOUSE RecName: Full=Cochlin; AltName: Full=COCH-5B2; Flags: Precursor gi|2801415|gb|AAC39949.1| Coch-5B2 gene product [Mus musculus] gi|26324626|dbj|BAC26067.1| unnamed protein product [Mus musculus] gi|28277390|gb|AAH45137.1| Coagulation factor C homolog (Limulus polyphemus) [Mus musculus] gi|74178965|dbj|BAE42713.1| unnamed protein product [Mus musculus] gi|74209551|dbj|BAE23310.1| unnamed protein product [Mus musculus] Length = 552 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/207 (12%), Positives = 69/207 (33%), Gaps = 27/207 (13%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTD------ARLDMMIVLDVSRSMESFFDSSITK 194 F +P + + V P+ + + Q +++ ++D S S+ + Sbjct: 331 FSYHMPNWFGTTKYVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGD------SN 384 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG 254 + ++ ++ + + ++ + T+ + E F ++ + + Sbjct: 385 FRLMLEFVSNIAKTFEISDIGAKIAAVQF-TYDQRTEFSFTDYNTKENVLAVLANIRYMS 443 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 T + + + +F D+ K +V +TDG+ Sbjct: 444 GGTATGDAIAFTVRNVFGP--------IRDSPNKNFLVIVTDGQ-----SYDDVRGPAAA 490 Query: 315 AKKRGAIVYAIGIRVIRSHEFLRACAS 341 A G ++++G+ + LR AS Sbjct: 491 AHDAGITIFSVGVAWAPLDD-LRDMAS 516 >gi|325108192|ref|YP_004269260.1| hypothetical protein Plabr_1627 [Planctomyces brasiliensis DSM 5305] gi|324968460|gb|ADY59238.1| protein of unknown function DUF1355 [Planctomyces brasiliensis DSM 5305] Length = 938 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 45/297 (15%), Positives = 96/297 (32%), Gaps = 27/297 (9%) Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 GN + L+ G I + N + D + I + V + SL + + Q Y Sbjct: 297 GNLAQALEAGRIATDVANIDEFPLTQDSLD-PYRAVILENVPARSLGRIKMEQLAQYVED 355 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ-TDARLDMMIVLDVSRSMESFFD 189 + +F + + + S+++ + RL + I LD S SM + Sbjct: 356 LGGGLMLTGGQRSFGVGGYFNSPLDDVLPVSMEMREEHRKNRLALAIALDRSGSMTAPVS 415 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKI 247 TK+D+A + + +V ++ + L + +++ Sbjct: 416 GGKTKMDLANLGTAECIRLLSPSDEVA------VIAVDSTPHTIVPLTNVSNPDDIAQQV 469 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 + G L A N++ +H I+ +D + + Sbjct: 470 LGIQSMGGGIFVYEALVAAGNELMKSDLATKH-----------IILFSDAADSEEPGAYR 518 Query: 308 SLYYCNEAKKRGAIVYAIGIR--VIRSHEFLRACAS--PNSFYLVENPHSMYDAFSH 360 SL + + G V IG+ + L+ A+ + ++ + F+ Sbjct: 519 SLI--KDYENAGITVSVIGLGTTADVDAKLLQEIATLGSGNIMFTQDVAELPRLFTE 573 >gi|295691296|ref|YP_003594989.1| TadE family protein [Caulobacter segnis ATCC 21756] gi|295433199|gb|ADG12371.1| TadE family protein [Caulobacter segnis ATCC 21756] Length = 531 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 53/177 (29%), Gaps = 41/177 (23%) Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI------FDMQGMRQHCNTEDANY 287 L L+ +I LS G ST G + + + ++ + Sbjct: 354 TPLSSDRVTLKGQINALS-IGGSTAGQIGFAWGWYMVSPNFGYLWPNATQRPAPYNSKDL 412 Query: 288 KKIIVFMTDGENLST----------------------------KEDQQSLYYCNEAKKRG 319 K++V MTDG + Q+ C+ K Sbjct: 413 VKVVVLMTDGAFNTPYCKGVIAKDAGSGSGAVDDHINCVATNGDAFTQTRKLCDAMKDPS 472 Query: 320 AI--VYAIGIRV---IRSHEFLRACASPNSF-YLVENPHSMYDAFSHIGKDIVTKRI 370 ++ +G V + L+ CA+ Y + AF I ++I + RI Sbjct: 473 LKLTIFTVGFDVGGDANAVNMLKYCATDAQHVYFPATGSELKTAFKSIAQEISSLRI 529 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 42/305 (13%), Positives = 93/305 (30%), Gaps = 58/305 (19%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 R + +G + I A+ + +++ +I++ I + + +D + + AA Sbjct: 8 FRRLGRDDRGAIAIQFALLAIPMSILVFALIDLGRISLQRHQMQDALDAATLMAARS--- 64 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 +L+ + ++ + F + + T+ V Sbjct: 65 --TAVTDAELESVGDPAFLAEIAGLNLGLSASNASFKAGAGNHIIGTATATVKP------ 116 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 I + + TS V +S+ L++ +VLD++ SM Sbjct: 117 ----------------IIANLWTTDDFNLTATSDVVRSSKN---LEVAVVLDITGSM--- 154 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE---------- 237 S ++I + +++ V + +V +S + + Sbjct: 155 ---SGSRITDLKTGASDLVDIVVKDQQAPFYSKVAIVPYSVGVNVGTYADAVRGAVIART 211 Query: 238 -WGVSHLQRKIKYLSKFG--------VSTNSTPGLKYA--YNQIFDMQGMRQHCNTEDAN 286 GVS + + G +S S P + YN I N +N Sbjct: 212 ITGVSKTNAAVVASAAHGFIVGDKVTISGVSGPTMLNGNTYN-ITAASADSFTINANTSN 270 Query: 287 YKKII 291 K + Sbjct: 271 APKYV 275 >gi|195941904|ref|ZP_03087286.1| hypothetical protein Bbur8_03396 [Borrelia burgdorferi 80a] gi|312149118|gb|ADQ29189.1| von Willebrand factor type A domain protein [Borrelia burgdorferi N40] Length = 333 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 39/196 (19%), Positives = 71/196 (36%), Gaps = 26/196 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++IVLD+S SM + SS +++ + + I + + + GLV F+ Sbjct: 93 DIVIVLDISPSMGAVEFSSKNRLEFSKELIRSFISQ-------RENDNIGLVAFAKDASI 145 Query: 233 FFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + +K+ Y+ G + G+ A + + K+ Sbjct: 146 VVPITTDREFFNKKLDDIYIMDLGNGSALGLGISIALSHL-----------KHSEALKRS 194 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP----NSFY 346 IV +TDG S + + N A+ +Y+IGI S SF Sbjct: 195 IVVLTDGVVNSDEIYKD--QVINLAQGLNVKIYSIGIGSSEEFSVEFKLRSGKFYQGSFK 252 Query: 347 LVENPHSMYDAFSHIG 362 V +P + + + G Sbjct: 253 EVYDPSMLVEISNKTG 268 >gi|108756796|ref|YP_635538.1| von Willebrand factor type A domain-containing protein [Myxococcus xanthus DK 1622] gi|108460676|gb|ABF85861.1| von Willebrand factor type A domain protein [Myxococcus xanthus DK 1622] Length = 700 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 37/257 (14%), Positives = 86/257 (33%), Gaps = 27/257 (10%) Query: 108 DDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 + ++ + Y + ++ F H+V + +V+ Sbjct: 270 GSLPNEQAVRVEEFVNTFDYGYAHQGSAPFSVQVEGFPSPVRKGYHVVHVGVKAREVSRP 329 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 ++ V+DVS SM ++ + ++++ ++ E+ Q +V + Sbjct: 330 QRKPSHLVFVIDVSGSMNLE-----NRLGLVKRALHLLVNELDERD------QVSIVVYG 378 Query: 228 NKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + V ++ I L G STN+ GL+ Y+ H Sbjct: 379 STARLVLEPTSAVHAHIIRAAIDSLHTEG-STNAQAGLEMGYSL------AASHLVEGGI 431 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAK-KRGAIVYAIGIRVIRSHEFLRACAS--- 341 N ++ +DG + D S++ A+ +G + +G + ++ L S Sbjct: 432 NR---VILCSDGVANTGLTDANSIWERIRARAAKGITLSTVGFGMGNYNDVLMERLSQVG 488 Query: 342 PNSFYLVENPHSMYDAF 358 ++ V+ + F Sbjct: 489 EGNYAYVDRIEEAHRIF 505 >gi|309358136|emb|CAP34492.2| CBR-DIG-1 protein [Caenorhabditis briggsae AF16] Length = 13580 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 48/226 (21%), Positives = 87/226 (38%), Gaps = 32/226 (14%) Query: 141 FCTFIPWYTNSRHIVMPIT------SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITK 194 C +P ++R P T S + S D D++IVLD S E+F Sbjct: 12821 ICKIVPNVESARTWPTPRTKATTLAGSRRSCSTIDYESDVIIVLDSS---ENFTPDEFDS 12877 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG 254 + A+ SI L PDV+ + G V +S+K+ L H + KI+ L K Sbjct: 12878 MKDAVASIVD--TGFDLAPDVS---KIGFVIYSDKVAVPVAL----GHYEDKIELLEKIV 12928 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCN-TEDANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 + G+ A ++ + RQ N KI++ +T+G+N + Sbjct: 12929 DAEKINDGVAIA---LYGLNAARQQFQLHGRENATKIVLLITNGKNRGN-----AAAAAE 12980 Query: 314 EAKKR-GAIVYAIGIR-VIRSHEFLRAC---ASPNSFYLVENPHSM 354 + + G ++A+ + ++ A+P++ V + Sbjct: 12981 DLRDMYGVQLFAVAVGSNPDELATIKRLVGNANPDNAIEVAQSTEI 13026 >gi|310641811|ref|YP_003946569.1| von willebrand factor type a [Paenibacillus polymyxa SC2] gi|309246761|gb|ADO56328.1| von Willebrand factor type A [Paenibacillus polymyxa SC2] Length = 429 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 32/212 (15%), Positives = 70/212 (33%), Gaps = 35/212 (16%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+++V+D S SM+ D ++ A I M + + +V+F + + Sbjct: 115 DLVLVIDNSGSMQQ-TDPDNERLTAAKSLIGQM----------DGDKRVAIVSFESTAQL 163 Query: 233 FFLLE-----WGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + KI + T L +I + Sbjct: 164 VQPFTPIGTDAEKQAVYSKIDSMQTIMSGGTEIGLALDETIKEIETQGNAEKGS------ 217 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH--EFLRACAS--P 342 +++ ++DG + + + Y + + IG+++ S L+ AS Sbjct: 218 ---LVIMLSDGFSELDTQTALAPYIARQ-----IPINTIGLKLAESEGIALLQNIASLTG 269 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 ++ V N + AF I I + + ++ Sbjct: 270 GTYSNVANAQGLTQAFGKIYNKIGDRTLVTER 301 >gi|297290486|ref|XP_001113553.2| PREDICTED: complement factor B isoform 1 [Macaca mulatta] Length = 1266 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 36/251 (14%), Positives = 79/251 (31%), Gaps = 44/251 (17%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 P R I++ + S+ ++ +VLD S S+ + + K Sbjct: 752 GHSPGEQQKRRIILDPSGSM----------NIYLVLDGSDSIGAGNFTGAKKC------- 794 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKI--------EEFFLLEWGVSHLQRKIKYLSKFG 254 ++ ++ + + LVT++ +E +W L K Sbjct: 795 --LVNLIEKVASYGVKPRYALVTYATYPRIWVKVSDQESSNADWVTKKLSEINYEDHKLK 852 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCN 313 TN+ L+ Y+ + + +I+ MTDG N+ + Sbjct: 853 SGTNTKRALQAVYS-MMSWPEDIPPEGWNRTRH--VIILMTDGLHNMGGDPITVIDEIRD 909 Query: 314 ---------EAKKRGAIVYAIGIRVIRSHEFLRACAS----PNSFYLVENPHSMYDAFSH 360 ++ VY G+ + + A AS + V++ ++ D F Sbjct: 910 LLYIGKDRKNPREDYLDVYVFGVGPLVDQVNINALASKKDNEQHVFKVKDMENLEDVFFQ 969 Query: 361 IGKDIVTKRIW 371 + + + + Sbjct: 970 MIDESQSLSLC 980 Score = 45.9 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 20/153 (13%), Positives = 52/153 (33%), Gaps = 16/153 (10%) Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ + L SH+ + G TN+ L Y + + + Sbjct: 146 YSYDFPEDVAPALGTSFSHMLGATNPTQRTKDHENGTGTNTYAALNSVYLMMNNQMQLLG 205 Query: 279 HCNTEDANYKKIIVFMTDGENL------STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + I+ +TDG++ + + + + N+ + +YAIG+ + Sbjct: 206 MKTMAWQEIRHAIILLTDGKSNMGGSPKTAVDQIREILNINQKRNDYLDIYAIGVGKLDV 265 Query: 333 -----HEFLRACASPNSFYLVENPHSMYDAFSH 360 +E +++++ +++ F H Sbjct: 266 DWRELNELGSKKDGERHAFILQDTKALHQVFEH 298 >gi|218249312|ref|YP_002374701.1| von Willebrand factor type A domain protein [Borrelia burgdorferi ZS7] gi|223889245|ref|ZP_03623833.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 64b] gi|226320920|ref|ZP_03796471.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 29805] gi|226321491|ref|ZP_03797017.1| von Willebrand factor type A domain protein [Borrelia burgdorferi Bol26] gi|218164500|gb|ACK74561.1| von Willebrand factor type A domain protein [Borrelia burgdorferi ZS7] gi|223885278|gb|EEF56380.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 64b] gi|226232680|gb|EEH31433.1| von Willebrand factor type A domain protein [Borrelia burgdorferi Bol26] gi|226233692|gb|EEH32422.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 29805] gi|312147800|gb|ADQ30459.1| von Willebrand factor type A domain protein [Borrelia burgdorferi JD1] Length = 333 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 39/196 (19%), Positives = 71/196 (36%), Gaps = 26/196 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++IVLD+S SM + SS +++ + + I + + + GLV F+ Sbjct: 93 DIVIVLDISPSMGAVEFSSKNRLEFSKELIRSFISQ-------RENDNIGLVAFAKDASI 145 Query: 233 FFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + +K+ Y+ G + G+ A + + K+ Sbjct: 146 VVPITTDREFFNKKLDDIYIMDLGNGSALGLGISIALSHL-----------KHSEALKRS 194 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP----NSFY 346 IV +TDG S + + N A+ +Y+IGI S SF Sbjct: 195 IVVLTDGVVNSDEIYKD--QVINLAQGLNVKIYSIGIGSSEEFSVEFKLRSGKFYQGSFK 252 Query: 347 LVENPHSMYDAFSHIG 362 V +P + + + G Sbjct: 253 EVYDPSMLVEISNKTG 268 >gi|154492261|ref|ZP_02031887.1| hypothetical protein PARMER_01895 [Parabacteroides merdae ATCC 43184] gi|154087486|gb|EDN86531.1| hypothetical protein PARMER_01895 [Parabacteroides merdae ATCC 43184] Length = 328 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 41/229 (17%), Positives = 80/229 (34%), Gaps = 46/229 (20%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S+ + + +D+++ +D+S SM + D +++ + +A + P+ N Sbjct: 77 STNSWQNSSTEGIDIVLAMDISTSMMAQ-DLKPNRLEASKDVASAFING---RPNDN--- 129 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMR 277 GLV F+ + L + L K + T GL A ++I Sbjct: 130 -IGLVVFAAESFTQCPLTTDHTVLLNLFKDVQPGIIQDGTAIGLGLANAVSRI------- 181 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS----- 332 + K+I+ +TDG N + + + AK G VY IG+ Sbjct: 182 ----KDSQAKSKVIILLTDGVNN--QGEIAPVTAAEIAKTFGVRVYTIGVGTQGKAPYPF 235 Query: 333 ----------------HEFLR--ACASPNSFYLVENPHSMYDAFSHIGK 363 L+ A + ++ + S+ + +S I K Sbjct: 236 QTAFGVQYMDVDVEIDEPTLKQIAATTGGQYFRATDNASLKEIYSEIDK 284 >gi|126352405|ref|NP_001075268.1| calcium-activated chloride channel regulator 1 precursor [Equus caballus] gi|122142874|sp|Q2TU62|CLCA1_HORSE RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 1; Short=eCLCA1; Flags: Precursor gi|46578151|gb|AAT01505.1| putative calcium activated chloride channel-like protein 1 [Equus caballus] Length = 913 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 42/197 (21%), Positives = 73/197 (37%), Gaps = 35/197 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM + ++ ++ L + V G+VTF + Sbjct: 307 VCLVLDKSGSM-----AIGDRLKRLTQAGKLFL-----LQTVEQGSWVGMVTFDSAAYVQ 356 Query: 234 FLLEW--GVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L G + K L T+ GL+ A+ I R+ T+ + Sbjct: 357 SALRQIKGGTDRDALTKSLPTVASGGTSICSGLRSAFTVI------RKKYKTDGSE---- 406 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ ++ C NE K+ GAI++ + + + E L Y Sbjct: 407 IVLLTDGEDN-------TISSCFNEVKQSGAIIHTVALGPSAAAELEELSKMTGGLQTYA 459 Query: 348 VENPHS--MYDAFSHIG 362 + + + DAF + Sbjct: 460 SDQAQNNGLIDAFGALS 476 >gi|47228197|emb|CAG07592.1| unnamed protein product [Tetraodon nigroviridis] Length = 139 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/158 (18%), Positives = 67/158 (42%), Gaps = 21/158 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 ++ D+ +LD S S+ + + D ++ ++ K +VQ + + Sbjct: 3 SQADVAFLLDGSGSV------NPSDFDRMKTFVSTLVRSFKGRDTKFAIVQFATIF---R 53 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 IE F + S+L+R I + +F TN+ + + N +F G N K Sbjct: 54 IESSFTDQ---SNLERNINNIVQFKAGTNTASAINFVVNNVFVASG------GSRENVGK 104 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 +++ +TDG++ + + + + + + +AIG+ Sbjct: 105 VLIVITDGKSNNKSKLPGAARAADAKR---IVRFAIGV 139 >gi|291245103|ref|XP_002742431.1| PREDICTED: complement component factor B/C2-like [Saccoglossus kowalevskii] Length = 782 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 32/203 (15%), Positives = 68/203 (33%), Gaps = 30/203 (14%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ D S S++ + + ++ E + + + ++F++ Sbjct: 293 LDLVFAFDCSSSIDP------VDFNRGMIFSRHIVREFGVSYEPGG-TRVAAISFASVAT 345 Query: 232 EFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 F L +++ + G T +K A+ +F R Sbjct: 346 LEFNLGDDIVNTTEKAIEQLQVIQASGGGT----AMKEAFEIMFSEVVPR-----LRPES 396 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACASP-- 342 KK + +TDG++ S + Y + R ++AIGI L+ AS Sbjct: 397 KKAMFIITDGKSNSGSP----ISYAQRLRDRNEHAFEIFAIGIGNGVDRNELKKIASEPF 452 Query: 343 -NSFYLVENPHSMYDAFSHIGKD 364 + +L+ + I + Sbjct: 453 TSHVFLIRQYEDLTTLTDIISEK 475 >gi|90408685|ref|ZP_01216835.1| hypothetical protein PCNPT3_08475 [Psychromonas sp. CNPT3] gi|90310199|gb|EAS38334.1| hypothetical protein PCNPT3_08475 [Psychromonas sp. CNPT3] Length = 349 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 72/210 (34%), Gaps = 35/210 (16%) Query: 173 DMMIVLDVSRSMESF-----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+M+V+D+S SM+ I+++D + + ++ + GL+ F Sbjct: 102 DIMVVVDLSGSMQKKDFVNQQGHKISRLDAVKEVLTDFIKT-------RQGDRLGLILFG 154 Query: 228 NKIEEFFLLEWGVSHLQRKIK-----YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + F+ + + ++ G ST+ + + + T Sbjct: 155 ---DAAFVQTPFTADHDVWLDLLMQTRVNMAGKSTHLGDAIGLTIKRFNEATK----NQT 207 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--SHEFLR--- 337 + +K+ + ++DG T + AK +Y I I + + L Sbjct: 208 SEKTREKVAIILSDG--NDTGSYVPPIDAAMVAKVNAVRIYMIAIGDPKSVGEQSLDMQT 265 Query: 338 ----ACASPNSFYLVENPHSMYDAFSHIGK 363 A S Y N + +A++ I K Sbjct: 266 INKIASVSGGQAYQALNQQELLNAYAKIDK 295 >gi|228472734|ref|ZP_04057492.1| BatB protein [Capnocytophaga gingivalis ATCC 33624] gi|228275785|gb|EEK14551.1| BatB protein [Capnocytophaga gingivalis ATCC 33624] Length = 353 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 29/175 (16%), Positives = 66/175 (37%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +D++ +DVS+SM + D +I+ A I+ ++E + + Sbjct: 79 GTKLETVKREGVDIVFAIDVSKSMLAE-DVKPNRIEKAKHIISQLIEVLHGD-------R 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMR 277 + ++ + L S + ++ ++ T ++ A N + Sbjct: 131 VAFIPYAAQAYPQLPLTSDYSSAKIFLEGINTNMLSSQGTAIGEAIQMAINYFEESSQTS 190 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 KI++ ++DGE+ D EAK +G ++ IG+ + Sbjct: 191 -----------KILIILSDGEDHQQGVDT----VIQEAKDKGIRLFTIGLGTAQG 230 >gi|291295701|ref|YP_003507099.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290470660|gb|ADD28079.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 298 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 74/195 (37%), Gaps = 24/195 (12%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ ++ SM D + ++ + A+++++ ++ G+VTFS Sbjct: 86 VILAIENGWSMRQ-TDIAPNRMVATQMAAKALVDKLP------RHIKVGVVTFSGYGTLL 138 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 +++ I L G + T GL A + + + +IV Sbjct: 139 LPPTTDRKAIRQAIDNLD-LGGGFSFTYGLLAALEAL--------PQTPPEGSRPGVIVL 189 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH---EFLRACA--SPNSFYLV 348 + G L ++A +RG V+AIG+ + E L+ A + +Y + Sbjct: 190 FSHG---HDVSGNDPLKIADQALERGIQVHAIGVGTHGHNFDEEMLKKVADRTGGRYYPI 246 Query: 349 ENPHSMYDAFSHIGK 363 + + A + +G+ Sbjct: 247 FSASDLSKAHADLGR 261 >gi|167525755|ref|XP_001747212.1| hypothetical protein [Monosiga brevicollis MX1] gi|163774507|gb|EDQ88136.1| predicted protein [Monosiga brevicollis MX1] Length = 471 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 73/205 (35%), Gaps = 36/205 (17%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +D++ V+DVS SM K+ M ++ ++ +K + LVTF Sbjct: 54 ERPAIDLVAVIDVSGSMAG------QKLKMVQSTLEFLMRNLKDTD------RFALVTFD 101 Query: 228 NKIEEFFLLEWGVSHLQRK----IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + ++ F L + + ++ L G TN + GL + + ++ Sbjct: 102 SDVKTVFDLRPMTTAHKEACLADVQKLRA-GSCTNLSGGLFRGVELMQQRGATKGAVSS- 159 Query: 284 DANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAK-----KRGAIVYAIGIRVIRSHEFLR 337 I+ MTDG N ++ C + +Y G + LR Sbjct: 160 -------ILLMTDGIANEGVRDKDD---MCRALRGLMGPAPDYTIYTFGYGKDHNENMLR 209 Query: 338 ACASPNS--FYLVENPHSMYDAFSH 360 + + +Y +E+ + ++F Sbjct: 210 QLSETGNGMYYFIESNDIIPESFGD 234 >gi|162448748|ref|YP_001611115.1| hypothetical protein sce0478 [Sorangium cellulosum 'So ce 56'] gi|161159330|emb|CAN90635.1| hypothetical protein sce0478 [Sorangium cellulosum 'So ce 56'] Length = 521 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 44/212 (20%), Positives = 82/212 (38%), Gaps = 27/212 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 I + + S V + + L ++I +D S SME I + M++ ++ Sbjct: 113 IQIGMNSPVDLGALERPPLHLVIAVDTSGSMEG------DPIAYVRAGLVEMIDALQPTD 166 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 + LV +S+ E G L + L+ G STN GL AY Sbjct: 167 ------RISLVRYSDAAEVVLEQAEGSDREALTEAFEGLTARG-STNLYEGLFTAY---- 215 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 QH + N ++F++DG Q+ + ++G + AIG+ Sbjct: 216 --ALAEQHLDPAWQNR---VIFLSDGVATAGLTSPQRLVSLAAGYAEKGIGLTAIGVGAE 270 Query: 331 RSHEFLRACAS--PNSFYLVENPHSMYDAFSH 360 + +R + +FY +E+P ++ + F+ Sbjct: 271 FDVDAMRGISEVGAGNFYFLEDPKAVEEVFAE 302 >gi|73981989|ref|XP_852918.1| PREDICTED: similar to calcium channel, voltage-dependent, alpha 2/delta subunit 1 [Canis familiaris] Length = 1147 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 77/200 (38%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 303 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 350 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 351 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 409 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ K + V+ + R Sbjct: 410 CN-------KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQ 456 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 457 WMACENKGYYYEIPSIGAIR 476 >gi|15594518|ref|NP_212307.1| hypothetical protein BB0173 [Borrelia burgdorferi B31] gi|3915348|sp|O51195|Y173_BORBU RecName: Full=Uncharacterized protein BB_0173 gi|2688067|gb|AAC66565.1| predicted coding region BB0173 [Borrelia burgdorferi B31] Length = 341 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 39/196 (19%), Positives = 70/196 (35%), Gaps = 26/196 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++IVLD+S SM + SS +++ + + I + + GLV F+ Sbjct: 101 DIVIVLDISPSMGAVEFSSKNRLEFSKELIRGFISQ-------RENDNIGLVAFAKDASI 153 Query: 233 FFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + +K+ Y+ G + G+ A + + K+ Sbjct: 154 VVPITTDREFFNKKLDDIYIMDLGNGSALGLGISIALSHL-----------KHSEALKRS 202 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP----NSFY 346 IV +TDG S + + N A+ +Y+IGI S SF Sbjct: 203 IVVLTDGVVNSDEIXKD--QVINLAQGLNVKIYSIGIGSSEEFSVEFKLRSGKFYQGSFK 260 Query: 347 LVENPHSMYDAFSHIG 362 V +P + + + G Sbjct: 261 EVYDPSMLVEISNKTG 276 >gi|328541712|ref|YP_004301821.1| hypothetical protein SL003B_0088 [polymorphum gilvum SL003B-26A1] gi|326411464|gb|ADZ68527.1| hypothetical protein SL003B_0088 [Polymorphum gilvum SL003B-26A1] Length = 454 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 52/464 (11%), Positives = 135/464 (29%), Gaps = 137/464 (29%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F + + + + + + ++ ++ G ++ ++ + +D +++ A Q+ Sbjct: 16 FGRDARASILPMVGVLVALMVVIGGAGLDYGRAIMLRASISHALDAAVLAVARQLSVSIM 75 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 ++ IK+ + + + + D+ ++ + I S Sbjct: 76 TDSELDKA-------IKDAFAANMASAGLSGATLGDLTYVLDPDAGTI-----------S 117 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 A + +P F ++ + ++ + + +++ +V+DV+ SM + Sbjct: 118 ATATALVPTYF--IHVGGLGPENVAIAASADA---TYSRFDVELAMVVDVTGSMR----N 168 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-------------------- 230 S+ + A +S+ +L + V+ LV +S + Sbjct: 169 SMASLRTAAQSVVDIL-IPDGTKKSASKVRIALVPYSQGVNLGEYAPKVSNGDAGTQNCV 227 Query: 231 ------EEFFLLEWGVSHLQ----------------------------RKIKYLSKFGVS 256 E++ + + I L G + Sbjct: 228 TERMGNEKYTDATYNYNGTSSEFFGGGSNSCASTPQMEPLTSKRNTLTSAISKLKDNGRT 287 Query: 257 TNSTP------GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 T L ++ ++ + D++ K + MTDG+ + + Sbjct: 288 AGQTGIAWGWYALSPKWSNLW--PNDSVPGSYTDSDILKFALIMTDGDFNEYYDKATAQS 345 Query: 311 YCN------------------------------------------EAKKRGAIVYAIGIR 328 C K+ G VY+I Sbjct: 346 NCKWQFNWSTFKWEQVCDSSYVWTAYSEAAGYSNVSSTRAKTLCAAIKQTGIQVYSIYFG 405 Query: 329 ---VIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVT 367 + ++ CAS +F++ + + AF+ I I Sbjct: 406 SNANSAGAKVMKDCASSTKETFFMATSDSELIAAFAKIANKIQN 449 >gi|114652501|ref|XP_001171019.1| PREDICTED: coagulation factor C homolog, cochlin isoform 4 [Pan troglodytes] Length = 569 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/209 (12%), Positives = 70/209 (33%), Gaps = 31/209 (14%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTD------ARLDMMIVLDVSRSMESFFDSSITK 194 F +P + + V P+ + + Q +++ ++D S S+ + Sbjct: 348 FSYHMPNWFGTTKYVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGD------SN 401 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSK 252 + ++ ++ + + ++ + V F+ F + + I+ + Sbjct: 402 FRLMLEFVSNIAKTFEISDIGA---KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRY 458 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 T + + + +F +R+ N +V +TDG+ Sbjct: 459 MSGGTATGDAISFTVRNVFGP--IRESPNKN------FLVIVTDGQ-----SYDDVQGPA 505 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 A G ++++G+ + L+ AS Sbjct: 506 AAAHDAGITIFSVGVAWAPLDD-LKDMAS 533 >gi|147902754|ref|NP_001082889.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Danio rerio] gi|94732178|emb|CAK04720.1| novel protein similar to vertebrate calcium channel voltage-dependent alpha 2 delta subunit 2 (CACNA2D2) [Danio rerio] Length = 1052 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 41/232 (17%), Positives = 80/232 (34%), Gaps = 48/232 (20%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + D NV + F Sbjct: 194 GASSPKDMVILVDVSGSVSGLT------LKLIKASVTEMLDTL-SDDDYVNVAR-----F 241 Query: 227 SNKIEEFFL--------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + K E + ++ + G +T+ G +A+NQ+ + Sbjct: 242 NEKAEAVVPCFDHLVQANVRNKKIFKEAVQQMQAKG-TTDYKSGFHFAFNQLLNKT---- 296 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LR 337 N AN KII+ TDG ++ + + N+ V+ + L+ Sbjct: 297 --NVPRANCNKIIMLFTDGGEDRAQDIFEQYNWPNKT----VRVFTFSVGQHNYDVTPLQ 350 Query: 338 --ACASPNSFYLVE-------NPHSMYDAF-------SHIGKDIVTKRIWYD 373 AC++ ++ + N D I K + ++ D Sbjct: 351 WIACSNKGYYFEIRSICAIRINTQEYLDVLGRPMVLAGRIAKQVQWTNVYQD 402 >gi|3183039|sp|Q62469|ITA2_MOUSE RecName: Full=Integrin alpha-2; AltName: Full=CD49 antigen-like family member B; AltName: Full=Collagen receptor; AltName: Full=Platelet membrane glycoprotein Ia; Short=GPIa; AltName: Full=VLA-2 subunit alpha; AltName: CD_antigen=CD49b; Flags: Precursor gi|473099|emb|CAA82877.1| VLA-2 homologue [Mus musculus] Length = 1178 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 39/319 (12%), Positives = 108/319 (33%), Gaps = 37/319 (11%) Query: 65 IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQ- 123 ++ + G C + + L+++ ++++ +I + SL + + Sbjct: 61 LVGSPWSGFPENRMGDVYKCPVDLPTATCEKLNLQNSASISNVTEIKTNMSLGLTLTRNP 120 Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARL-DMMIVLDVSR 182 G ++ + + + +S Q L D+++V D S Sbjct: 121 GTGGFLTCGPLWAHQCGNQYYATGICSDVSPDFQFLTSFSPAVQACPSLVDVVVVCDESN 180 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--V 240 S+ + + + + + + P Q L+ ++N+ F L Sbjct: 181 SIYPW--------EAVKNFLVKFVTGLDIGP---KKTQVALIQYANEPRIIFNLNDFETK 229 Query: 241 SHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 + + + G TN+ +++A + + + K++V +TDGE+ Sbjct: 230 EDMVQATSETRQHGGDLTNTFRAIEFARDYAYS------QTSGGRPGATKVMVVVTDGES 283 Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFLR----ACASP--NSFYL 347 + + + CN + + I + + + ++ ++P F+ Sbjct: 284 HDGSKLKTVIQQCN---DDEILRFGIAVLGYLNRNALDTKNLIKEIKAIASTPTERYFFN 340 Query: 348 VENPHSMYDAFSHIGKDIV 366 V + ++ + +G+ I Sbjct: 341 VADEAALLEKAGTLGEQIF 359 >gi|256374530|ref|YP_003098190.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] gi|255918833|gb|ACU34344.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] Length = 550 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 28/171 (16%), Positives = 56/171 (32%), Gaps = 22/171 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +D++ LD + SM ++ T I+ + ++ + GLVTF + Sbjct: 49 GPMDVVFALDDTGSMGGALNNIKTSINAVVG---------DVVSASGGDYRLGLVTFKDS 99 Query: 230 IEEFFLLEWG-----VSHLQRKIKYLSKFGVSTNSTPGLKYAYN--QIFDMQGMRQHCNT 282 I L G ++ + G S L+ A + + Sbjct: 100 INVVTGLAAGNAGTVTGYVTNVLAASGGGGEPEASDEALRTAVSLRPAAGIPQNADFTGP 159 Query: 283 EDANYKKIIVFMTDGEN------LSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 +N +K +V +TD + + + N A G + A+ + Sbjct: 160 WRSNARKFVVLVTDARPGGFDDAFTAADQASATAVANSALAAGVKLSAVYV 210 >gi|260832994|ref|XP_002611442.1| hypothetical protein BRAFLDRAFT_63926 [Branchiostoma floridae] gi|229296813|gb|EEN67452.1| hypothetical protein BRAFLDRAFT_63926 [Branchiostoma floridae] Length = 1121 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 41/262 (15%), Positives = 79/262 (30%), Gaps = 36/262 (13%) Query: 120 VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT----DARLDMM 175 PQN G S + C S +V + + D+ D++ Sbjct: 884 ASPQNGGMSCTNRLENGRCTFTCNHGYHLQGSSEVVCGADGQWRGTPPSCAACDSAADVI 943 Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS-NKIEEFF 234 +DVS S+ FD I ++ + + + G++ F+ + Sbjct: 944 FSIDVSGSVSGQFD-------TVRNFIRGVVNCLTIGGSHA---RVGVIKFAGSNANRQI 993 Query: 235 LLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L + L +I+ L + S + F G +KI + Sbjct: 994 SLTDYNNKNDLLVRIRNLDRSLGSAVGSVAGLSVMRNEFSTSG--------RPTARKIGI 1045 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---PNSFYLVE 349 +TDG T + + G ++++G+ R E L AS + + Sbjct: 1046 VLTDGR--DTSSSDAVIPDAETLRNEGTTIFSVGVANAR-RETLENMASRPVNENVFTAT 1102 Query: 350 NPHSMYDAFSHIGKDIVTKRIW 371 + + I + K Sbjct: 1103 -----FASLQTIVDSLRPKIWC 1119 >gi|187735658|ref|YP_001877770.1| von Willebrand factor type A [Akkermansia muciniphila ATCC BAA-835] gi|187425710|gb|ACD04989.1| von Willebrand factor type A [Akkermansia muciniphila ATCC BAA-835] Length = 859 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 33/191 (17%), Positives = 64/191 (33%), Gaps = 20/191 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 M IVLD S SM TK+D+ + + + L+ D + + + + + I Sbjct: 391 MSIVLDRSGSMSCSVPGGKTKMDL---ANAGTCQTISLLSDQDLISVHAVDSEPHPIVTL 447 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L + + ++ G GLK + ++ +H ++ Sbjct: 448 SSLGPNRKKMISSVSRIASMGGGIFIGAGLKAGWQELQRSVAGTRH-----------LLL 496 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE--FLRACAS--PNSFYLVE 349 D ++ D + E K G V I + +S + LR A + + Sbjct: 497 FADADDSEEPADYR--ETLKEMVKEGVTVSVIALGTEKSADAGLLREIAELGRGRIFFCD 554 Query: 350 NPHSMYDAFSH 360 P + F+ Sbjct: 555 RPGDIPSIFAQ 565 >gi|34527040|dbj|BAC85316.1| unnamed protein product [Homo sapiens] Length = 401 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/209 (12%), Positives = 70/209 (33%), Gaps = 31/209 (14%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTD------ARLDMMIVLDVSRSMESFFDSSITK 194 F +P + + V P+ + + Q +++ ++D S S+ + Sbjct: 180 FSYHMPNWFGTTKYVKPLVQKLCSHEQMMCSKTCYNSVNIAFLIDGSSSVGD------SN 233 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSK 252 + ++ ++ + + ++ + V F+ F + + I+ + Sbjct: 234 FRLMLEFVSNIAKTFEISDIGA---KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRY 290 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 T + + + +F +R+ N +V +TDG+ Sbjct: 291 MSGGTATGDAISFTVRNVFGP--IRESPNKN------FLVIVTDGQ-----SYDDVQGPA 337 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 A G ++++G+ + L+ AS Sbjct: 338 AAAHDAGITIFSVGVAWAPLDD-LKDMAS 365 >gi|194217500|ref|XP_001502733.2| PREDICTED: integrin, alpha E (antigen CD103, human mucosal lymphocyte antigen 1; alpha polypeptide) [Equus caballus] Length = 1163 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 82/217 (37%), Gaps = 26/217 (11%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + A ++ I+LD S S++ A I+ M+ + Sbjct: 175 PGDEEEEEEEAAGTEIAIILDGSGSIDP------PDFQRAKDFISNMMRNF---YEKCFE 225 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQR--KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 LV + I+ F L+ + +++ +++ G T + +++ + IF Sbjct: 226 CSFALVQYGEVIQTEFDLQDSQDVMASLARVQNITQVGNVTKTASAMQHVLDHIFTPSHS 285 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRS 332 KI+V +TDG+ + ++ N K +G +AIG + ++ Sbjct: 286 S------RKKASKIMVVLTDGDIFGDPLNLTTVI--NSPKMQGVERFAIGVGEAFQKAKT 337 Query: 333 HEFLRACASP---NSFYLVENPHSMYDAFSHIGKDIV 366 + L+ AS + V N ++ S + ++I+ Sbjct: 338 DKELKLIASDPDETHAFKVTNYAALDGLLSKLQQNII 374 >gi|119512060|ref|ZP_01631154.1| von Willebrand factor, type A [Nodularia spumigena CCY9414] gi|119463286|gb|EAW44229.1| von Willebrand factor, type A [Nodularia spumigena CCY9414] Length = 418 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 82/222 (36%), Gaps = 29/222 (13%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 + R + + I++ ++ + L++ ++LD S SM ++ ++ +++++ Sbjct: 21 SQRQLAVSISAIAEIQDRN-IPLNLCLILDHSGSMHGLP------LETVKQAAIGLVDKL 73 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 K + +V F ++ + ++++I L+ G T GL+ Sbjct: 74 KPGD------RLSVVAFDHRATVLVPNQTITNPGQIKKQINSLTADG-GTAIDEGLRLGI 126 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 ++ + +TDGEN ++Q+ L + A + +G Sbjct: 127 EELAKGKKETVSQA----------FLLTDGENEH-GDNQRCLKFAQLATGYNLTLNTLGF 175 Query: 328 RVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVT 367 + + L A S ++ P D F+ + + T Sbjct: 176 GDKWNQDVLEKIADAGLGSLSHIQKPEQAADEFNRLFSRVQT 217 >gi|309789848|ref|ZP_07684427.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308228152|gb|EFO81801.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 420 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 78/209 (37%), Gaps = 30/209 (14%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 +Q +++ VLD S SM KID ++I+ ++ + + LV Sbjct: 38 AQVRTPVNVSFVLDRSGSM------KGDKIDRVRQAISLAVDRLDAQDIAS------LVI 85 Query: 226 FSNKIEE--FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 F ++ E ++ ++ + G T P ++ +I + Sbjct: 86 FDHRNEVLIPAAPVTDRRMIKDRVSRIRDAG-GTKIAPAVEKGLREI----------EKD 134 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--S 341 + + +V +TDG+ T+ + + L ++A + G + A+G+ + + L A S Sbjct: 135 RSGAIRRLVLLTDGQ---TENEDECLRRADDAGRIGVPITALGVGQDWNEDLLIEMANRS 191 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + + + + F + + I Sbjct: 192 GGTADYIARADEITEYFQNTVQRAQNSAI 220 >gi|258652510|ref|YP_003201666.1| hypothetical protein Namu_2300 [Nakamurella multipartita DSM 44233] gi|258555735|gb|ACV78677.1| conserved hypothetical protein [Nakamurella multipartita DSM 44233] Length = 320 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 36/228 (15%), Positives = 79/228 (34%), Gaps = 27/228 (11%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + + R +M+ +D S SME+ D + ++D A ++ + ++++ Sbjct: 68 LTVALAGPTAQAKEPRNRAVVMLAVDTSLSMEA-TDVAPNRLDAAKEAAQSFVDDLTPG- 125 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 V G+V+F+ + ++ I L+ T + + + I Sbjct: 126 -----VNLGIVSFAGIATVLVSPTTDRTVAKQAIDGLT-LDERTATGEAIISSLQTIELF 179 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 T+ IV MTDG+ + +Q + +A G V I Sbjct: 180 SKTLPPDGTDTGPPPARIVLMTDGKRTVGRTEQDAAQRAADA---GVPVSVIAFGTDNGS 236 Query: 334 --------------EFLRACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 E ++ A S F+ + + ++ +G+ I Sbjct: 237 ITVNDEVIPVPLDTEAMQQIAQISGGDFHQAASAEELKSIYAQLGEQI 284 >gi|148676058|gb|EDL08005.1| inter-alpha (globulin) inhibitor H5, isoform CRA_a [Mus musculus] Length = 918 Score = 55.2 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 70/175 (40%), Gaps = 27/175 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ VLD+S SM K+ +++ +L +++ + ++ FSN+I Sbjct: 262 VVFVLDISASM------VGAKLQQTREALVTILNDLRPQD------RFNIIGFSNRIKMW 309 Query: 231 -EEFFLLE-WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + + + + + +LS G T+ L+ A + + ED + Sbjct: 310 KDHLLPVTPDNIRNGKIYMYHLSPTG-GTDINGALQAAIKLL---NNYVAQNDIEDRSVS 365 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACA 340 +I+F+TDG+ + +L + K+ ++ +GI + L + Sbjct: 366 -LIIFLTDGKPTF--GETNTLKILSNTKEATRGQICIFTVGIGDDVDFKLLEKLS 417 >gi|156382057|ref|XP_001632371.1| predicted protein [Nematostella vectensis] gi|156219426|gb|EDO40308.1| predicted protein [Nematostella vectensis] Length = 297 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 35/206 (16%), Positives = 76/206 (36%), Gaps = 25/206 (12%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ ++D S S+ S + D + ++ ++ + P Q +V + + + Sbjct: 31 MDLVFLVDGSGSIGS------SNFDRLKEFVSTVIGGFVISPQG---TQISVVVYHSSAK 81 Query: 232 E--FFLLEWGVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F + ++R I ++ G T + GL A+ + G R T Sbjct: 82 THLSFGDAQDLISVRRIISSIAYPSGPQTYTDRGLVEAHQRFAKENGARSSRTT------ 135 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 +++V + DG++ + S K G ++ A+G E L+ AS + Sbjct: 136 RVVVVINDGKSNGESLEASSKPL----KDEGIVIMALGFGSSVRVEELQTMASSQGDMHL 191 Query: 349 ENPHSMYDAFSHIGKDIVTKRIWYDK 374 + A + IV ++ Sbjct: 192 YSTAI---ALKNDANSIVNSICMVNR 214 >gi|295841331|dbj|BAJ07080.1| von Willebrand factor, type A [uncultured bacterium] Length = 334 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 37/173 (21%), Positives = 67/173 (38%), Gaps = 25/173 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +V+D S SM S K+ A ++++A LE++K + + GL+ F ++ Sbjct: 159 VYLVVDTSGSM------SGEKLAQAQEALSAFLEQIK-----GDRERVGLIEFETSVKPV 207 Query: 234 FLLE---WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L+ + L ++ + G T + A+ ++ D Sbjct: 208 VQLDELGNNRAALDLAVQRMEAAG-DTALLDAVYEAHQRL---------RKLGDEERINA 257 Query: 291 IVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 +V MTDG EN S ++ + E +V+AI LRA P Sbjct: 258 VVVMTDGQENNSWISLRKLVPQLAEDWPVPVVVFAIAYGDDADIATLRAITEP 310 >gi|192359695|ref|YP_001982895.1| von Willebrand factor type A domain-containing protein [Cellvibrio japonicus Ueda107] gi|190685860|gb|ACE83538.1| von Willebrand factor type A domain protein [Cellvibrio japonicus Ueda107] Length = 660 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 40/243 (16%), Positives = 85/243 (34%), Gaps = 46/243 (18%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 T + P+ S V D+ +++DVS SM+ ++ + + + Sbjct: 26 TGAETTSAPLPSKVL-------PADIRMIIDVSGSMKKTDPHNLRRPAVDL--------M 70 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGL 263 V+L+PD + ++G+ TF + W + ++ + T+ L Sbjct: 71 VRLLPDGS---KAGIWTFGQSVNLLVPYRLVDESWRQQAAKSA-SAINSVALHTHIGAAL 126 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL--------YYCNEA 315 + A + D +++ +V +TDG E ++ + Sbjct: 127 EKAAQDVVA----------GDDGFRRNLVLLTDGVVDIDPEAVVNIQERKRILTELLPQL 176 Query: 316 KKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIV--TKRIW 371 K G +V+ I + E ++ A + F + ++ + AF I V + Sbjct: 177 KAAGYVVHTIALSQDADQELMKKLALTTDGVFAVAQSADELMQAFLTIFDQAVPAERVPL 236 Query: 372 YDK 374 DK Sbjct: 237 DDK 239 >gi|113971716|ref|YP_735509.1| von Willebrand factor, type A [Shewanella sp. MR-4] gi|113886400|gb|ABI40452.1| von Willebrand factor, type A [Shewanella sp. MR-4] Length = 335 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 28/178 (15%), Positives = 69/178 (38%), Gaps = 31/178 (17%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFF-----DSSITKIDMAIKSINAMLEEVKLIPDVNN 217 +V ++ D+++++D+S SM+ S++T+++ A + + + + Sbjct: 87 EVQTREAFGRDVLMLVDLSGSMDEADFTTADGSTLTRLNAAKNVLKTFIAK-RSGD---- 141 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF-----GVSTNSTPGLKYAYNQIFD 272 + GL+ F + F+ + Q + L + G ST+ + Sbjct: 142 --RFGLILFG---DAAFIQTPFTADQQVWLSLLEEAQTGMAGQSTHLGDAIGL------- 189 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 G++ + +++++ +TDG T + + A RG +Y I + Sbjct: 190 --GIKVFEQNPQPSEQQVMIVLTDG--NDTGSFVEPVDAAKIAAARGIKIYIIAMGDP 243 >gi|328882566|emb|CCA55805.1| putative exported protein [Streptomyces venezuelae ATCC 10712] Length = 543 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 75/212 (35%), Gaps = 28/212 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + + + + V+D+S SM + ++D+ +S+ + +E++ Sbjct: 154 VRVGLATRAADRTGERPPAALTFVVDISGSM-----AEPGRLDLVKESLGLLADELRDDD 208 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP---GLKYAYNQI 270 LVTFS++ E + V + +++ + +T+ST G++ Y+ Sbjct: 209 S------IALVTFSDEAETRLPMT-RVGEARGRVREVVNSLATTSSTNVEAGVRTGYDVA 261 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIR 328 + + +V ++D N E L E +K G ++ +G+ Sbjct: 262 ---------VDGHRKDATNRVVLLSDALANTGATEAGAILERIEEERKAYGITLFGVGVG 312 Query: 329 VIRSHEFLRACAS--PNSFYLVENPHSMYDAF 358 F+ A V F Sbjct: 313 SDYGDAFMERLADRGDGQTTYVSTSAQARKVF 344 >gi|326670666|ref|XP_003199265.1| PREDICTED: collagen alpha-3(VI) chain [Danio rerio] Length = 1455 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 72/192 (37%), Gaps = 22/192 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS-NKIE 231 D++ +LD S D S + + I M+ ++ + D+ V+ +V +S + + Sbjct: 823 DIVFLLDGS-------DDSRNTLLTIREFIRRMVLDLDIDQDI---VRVAVVQYSEDPLI 872 Query: 232 EFFLLEWGVSH-LQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L + + I L+ G + N+ L+Y + +F +H + Sbjct: 873 HFLLNTYNSKKAVLYAINGLTAKGGRNINTGAALQYVRDHVFTTASGSRH----HLGVPQ 928 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ MT G ++ D + K G + AIGI+ E SP + + Sbjct: 929 VLIVMTGGRSIDQVAD-----PAEDLKNFGVLSIAIGIKNALESELQTIAFSPRFIFNLP 983 Query: 350 NPHSMYDAFSHI 361 + I Sbjct: 984 VSGELLHIQPDI 995 >gi|45384390|ref|NP_990268.1| cochlin precursor [Gallus gallus] gi|7387581|sp|O42163|COCH_CHICK RecName: Full=Cochlin; AltName: Full=COCH-5B2; Flags: Precursor gi|2293562|gb|AAC62253.1| Coch-5B2 [Gallus gallus] Length = 547 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/172 (13%), Positives = 60/172 (34%), Gaps = 21/172 (12%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +++ ++D S S+ + + ++ I+ + + ++ + + T+ + Sbjct: 362 SVNIGFLIDGSSSVGE------SNFRLMLEFISNVAKAFEISDIGSKIATVQF-TYDQRT 414 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 E F + I+ + T + + + +F + N Sbjct: 415 EFSFTDYTTKEKVLSAIRNIRYMSGGTATGDAISFTTRNVFGPVKDGANKN--------F 466 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 +V +TDG+ A+K G V+++G+ + L+ AS Sbjct: 467 LVILTDGQ-----SYDDVRGPAVAAQKAGITVFSVGVAWAPLDD-LKDMASE 512 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 37/211 (17%), Positives = 73/211 (34%), Gaps = 23/211 (10%) Query: 128 SISAIS-RYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 S S + L+ P + K D + D+ ++D S Sbjct: 116 SFSVTPGTNNLALEAVGRSVATARPATGKRPKKTLEKKAGNKDCKADIAFLIDGS----- 170 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQ 244 ++ + ++ + + + + + +V G+V S + F L+ + Sbjct: 171 -YNIGQRRFNLQKNFVGKVAVMLGIGTEGPHV---GVVQASEHPKIEFYLKNFTAAKEVL 226 Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 IK L G ++N+ LK+A + F M+ N KIIV DG E Sbjct: 227 FAIKELGFRGGNSNTGKALKHAAQKFFSME------NGARKGIPKIIVVFLDGWPSDDLE 280 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + + A++ G V+ + + + E Sbjct: 281 EAGIV-----AREFGVNVFIVSVAKPTTEEL 306 >gi|283778313|ref|YP_003369068.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283436766|gb|ADB15208.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 591 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 65/187 (34%), Gaps = 31/187 (16%) Query: 15 YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRS--------------LVH 60 +G + + TA+ + ++ ++ ++V +++ M+T L +D + Sbjct: 22 RRGNIIVFTAVLMVVMLGMIAFAVDVGYMYTMQTQLQRSVDAAALAGAGSLVEGTDIAQA 81 Query: 61 AATQ-IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIV 119 AT+ ++ G++ + ++ +I + F + + ST Sbjct: 82 KATEYLVRNPVGSSMTFVNEEEVPAKIAQFVAEH------GDDFEVEAGEWNASTRSFET 135 Query: 120 VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLD 179 S+S + P TF + +S + D+M+VLD Sbjct: 136 TNTLPSTLSVSM----EYPTMP-TFFGKILGKDSFSIRASSVAMYQPR-----DIMVVLD 185 Query: 180 VSRSMES 186 S SM Sbjct: 186 FSGSMND 192 >gi|242016552|ref|XP_002428850.1| conserved hypothetical protein [Pediculus humanus corporis] gi|212513586|gb|EEB16112.1| conserved hypothetical protein [Pediculus humanus corporis] Length = 1945 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 77/203 (37%), Gaps = 35/203 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +T ++++I++D S S+ +K I ++ +V++ P + G++ + Sbjct: 28 KTHKEVELIILVDGSYSVGP------KNFLSEMKYIQKVISDVEVGPKA---FRLGVIIY 78 Query: 227 SNKIEEFFLLEWGV-----SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 S + + + + S L +I + G +TN+ L+ A + Sbjct: 79 STQAVDHLSVTDELDLEKCSLLHVQIPKIKYPGKNTNTKEALEKAEKILTRA-------- 130 Query: 282 TEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 KII +TDG N+ L K + I+Y GI + E L+ + Sbjct: 131 --RKGAVKIIFLITDGFSNMGNP-----LPMAQILKDQDTIIYTFGIINGNARELLKISS 183 Query: 341 SPNSFYLVENPHSMYDAFSHIGK 363 P +E ++ +F K Sbjct: 184 QP-----IEEHAYLFTSFDEFEK 201 >gi|79607904|ref|NP_974433.2| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|332645764|gb|AEE79285.1| C3HC4-type RING finger protein [Arabidopsis thaliana] Length = 632 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 61/168 (36%), Gaps = 31/168 (18%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + +++ A +D++ VLD+S SM TK+ + +++ +++ + Sbjct: 224 VHLKAVTGDQISQYRRAPIDLVTVLDISGSMGG------TKLALLKRAMGFVIQNLGSSD 277 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + ++ FS+ F L G + + L G TN GL+ Sbjct: 278 ------RLSVIAFSSTARRLFPLTRMSDAGRQLALQAVNSLVANG-GTNIVDGLRKGAKV 330 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGE-----NLSTKEDQQSLYYC 312 + D + I+ ++DG N + +L C Sbjct: 331 MEDRLERNSVAS---------IILLSDGRDTYTTNHPDPSYKDALAQC 369 >gi|79315048|ref|NP_001030861.1| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|332645765|gb|AEE79286.1| C3HC4-type RING finger protein [Arabidopsis thaliana] Length = 633 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 61/168 (36%), Gaps = 31/168 (18%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + +++ A +D++ VLD+S SM TK+ + +++ +++ + Sbjct: 225 VHLKAVTGDQISQYRRAPIDLVTVLDISGSMGG------TKLALLKRAMGFVIQNLGSSD 278 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + ++ FS+ F L G + + L G TN GL+ Sbjct: 279 ------RLSVIAFSSTARRLFPLTRMSDAGRQLALQAVNSLVANG-GTNIVDGLRKGAKV 331 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGE-----NLSTKEDQQSLYYC 312 + D + I+ ++DG N + +L C Sbjct: 332 MEDRLERNSVAS---------IILLSDGRDTYTTNHPDPSYKDALAQC 370 >gi|298491708|ref|YP_003721885.1| von Willebrand factor type A ['Nostoc azollae' 0708] gi|298233626|gb|ADI64762.1| von Willebrand factor type A ['Nostoc azollae' 0708] Length = 418 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 36/222 (16%), Positives = 85/222 (38%), Gaps = 29/222 (13%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 + R + + I S+V L++ ++LD S SM + ++ K+++ +++++ Sbjct: 21 SQRQLAVSI-SAVGETLDRRVPLNLCLILDHSGSM------NGRALETVKKAVSLLVDQL 73 Query: 210 KLIPDVNNVVQSGLVTFSNK--IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 + +V F ++ I + + ++++I L+ G T GL+ Sbjct: 74 SSED------RLSIVVFDHRAKILVPNQIISDRNQIKQQINRLTADG-GTAIDEGLRLGI 126 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 ++ + +TDGEN ++ + L + A + +G Sbjct: 127 EELAKGKKDTISQA----------FLLTDGENEH-GDNNRCLKFAQLAASYNLTLNTLGF 175 Query: 328 RVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVT 367 + + L A + +E+P+ D FS + + T Sbjct: 176 GDNWNQDILEKIADAGLGNLSHIEHPNQAVDKFSRLFSRMQT 217 >gi|71899173|ref|ZP_00681336.1| conserved hypothetical protein [Xylella fastidiosa Ann-1] gi|71731031|gb|EAO33099.1| conserved hypothetical protein [Xylella fastidiosa Ann-1] Length = 941 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 55/184 (29%), Gaps = 27/184 (14%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + Q + + +D+S SM + +++D ++NA L+++ Sbjct: 140 PEKAAIGMQIAPSVAVYFAIDLSGSMHYVGGNGRSRLDNMKTALNAALDQLGQSIASGTA 199 Query: 219 VQSGLVTFSNKIEEFFLL------EWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAY 267 V L F + + L G++ L+ + F T P A Sbjct: 200 VDIMLAGFGDAPDHRQTLLRRNCTAQGIAELKSWVAARQALYGTYFPAGTMDMPSFYAA- 258 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 +N ++ FMTDGE + + Y I I Sbjct: 259 ---------------APSNAVRVAFFMTDGEPDPPSATLAQAARADVDQVAHLRCYGINI 303 Query: 328 RVIR 331 + Sbjct: 304 DLAN 307 >gi|116623354|ref|YP_825510.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116226516|gb|ABJ85225.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 328 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 43/316 (13%), Positives = 102/316 (32%), Gaps = 46/316 (14%) Query: 64 QIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQ 123 ++ K+++ + ++ F L + D ++ P Sbjct: 3 KVAGRPAMPGSKRIRKVLRISSAIGILSLGFSGGLSAAILLQD----QPRVAITPREKPA 58 Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNS---------RHIVMPITSSVKVNSQTDARLDM 174 + ++ K+ L T + R + V S+ + + + Sbjct: 59 SANPNMRVDV--KMILVPITVMDPADKPVENLKPDVFRVFEDDVEQHVVSLSKEEGPVSV 116 Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 ++D S SM++ D KS+ A+ + ++ + LV FS+K Sbjct: 117 GFIIDASSSMKNRMD----------KSVAAIQQFLRTNMAGDEYF---LVRFSDKPTMVT 163 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + + ++ + G + + D + + N ++ + + Sbjct: 164 TFTHNPAEIASELTLIKPLG------------WTALHDAIYLGTQQMRKAKNSRRALFVL 211 Query: 295 TD-GENLSTKEDQQSLYYCNEAKKRGAIVYAIG-IRVIRSHEFLRACASPNSFYLVENPH 352 TD G+N S + + + + VYAIG + E L A + + ++ Sbjct: 212 TDGGDNNSRYTEAEVKNFVV---ESDVRVYAIGLFERPKFLERLAALTGGEALW-AKSLK 267 Query: 353 SMYDAFSHIGKDIVTK 368 + DA I ++ + Sbjct: 268 DLPDAIERISREFRNQ 283 >gi|328953621|ref|YP_004370955.1| hypothetical protein Desac_1940 [Desulfobacca acetoxidans DSM 11109] gi|328453945|gb|AEB09774.1| hypothetical protein Desac_1940 [Desulfobacca acetoxidans DSM 11109] Length = 376 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 66/176 (37%), Gaps = 13/176 (7%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 N + +G + ++TA+ LP++ G+ I++ +++ +KT + S +D ++ ++ N+G Sbjct: 4 NLGRHEEGAIAVITALLLPVLIGFTGLAIDIGNLYVIKTRMQSAVDAAVCGGGLKLPNQG 63 Query: 70 ------------NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLD 117 NG + + N N N + ++ Sbjct: 64 LAMTTANSFITSNGFDPNDATITYTQDTVNNPAGSPEINCSMTNQVPTFFLGLFGYPNIS 123 Query: 118 IVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLD 173 I V + + A + + F + ++ I + ++ ++ + ++ Sbjct: 124 ITVSAKGILQTGGAGGPFNYAI-FSDQNLPISGNQKITGSVHTNHQLTISGNTKIS 178 >gi|7258382|emb|CAB77598.1| putative protein [Arabidopsis thaliana] Length = 676 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 34/215 (15%), Positives = 77/215 (35%), Gaps = 35/215 (16%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + +++ A +D++ VLD+S SM TK+ + +++ +++ + Sbjct: 225 VHLKAVTGDQISQYRRAPIDLVTVLDISGSMGG------TKLALLKRAMGFVIQNLGSSD 278 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + ++ FS+ F L G + + L G TN GL+ Sbjct: 279 ------RLSVIAFSSTARRLFPLTRMSDAGRQLALQAVNSLVANG-GTNIVDGLRKGAKV 331 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLST--KEDQQSLYYCNEAKKRGAIVYAIGI 327 + D + I+ ++DG + T D + V++ G Sbjct: 332 MEDRLERNSVAS---------IILLSDGRDTYTTNHPDPSYKVMLPQ-----ISVHSFGF 377 Query: 328 RVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSH 360 + + + S +F +E+ + DA + Sbjct: 378 GSDHDASVMHSVSEVSGGTFSFIESESVIQDALAQ 412 >gi|28393354|gb|AAO42101.1| unknown protein [Arabidopsis thaliana] Length = 650 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 34/215 (15%), Positives = 77/215 (35%), Gaps = 35/215 (16%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + +++ A +D++ VLD+S SM TK+ + +++ +++ + Sbjct: 199 VHLKAVTGDQISQYRRAPIDLVTVLDISGSMGG------TKLALLKRAMGFVIQNLGSSD 252 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + ++ FS+ F L G + + L G TN GL+ Sbjct: 253 ------RLSVIAFSSTARRLFPLTRMSDAGRQLALQAVNSLVANG-GTNIVDGLRKGAKV 305 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLST--KEDQQSLYYCNEAKKRGAIVYAIGI 327 + D + I+ ++DG + T D + V++ G Sbjct: 306 MEDRLERNSVAS---------IILLSDGRDTYTTNHPDPSYKVMLPQ-----ISVHSFGF 351 Query: 328 RVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSH 360 + + + S +F +E+ + DA + Sbjct: 352 GSDHDASVMHSVSEVSGGTFSFIESESVIQDALAQ 386 >gi|30694117|ref|NP_191038.2| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|332645763|gb|AEE79284.1| C3HC4-type RING finger protein [Arabidopsis thaliana] Length = 675 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 34/215 (15%), Positives = 77/215 (35%), Gaps = 35/215 (16%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + +++ A +D++ VLD+S SM TK+ + +++ +++ + Sbjct: 224 VHLKAVTGDQISQYRRAPIDLVTVLDISGSMGG------TKLALLKRAMGFVIQNLGSSD 277 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + ++ FS+ F L G + + L G TN GL+ Sbjct: 278 ------RLSVIAFSSTARRLFPLTRMSDAGRQLALQAVNSLVANG-GTNIVDGLRKGAKV 330 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLST--KEDQQSLYYCNEAKKRGAIVYAIGI 327 + D + I+ ++DG + T D + V++ G Sbjct: 331 MEDRLERNSVAS---------IILLSDGRDTYTTNHPDPSYKVMLPQ-----ISVHSFGF 376 Query: 328 RVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSH 360 + + + S +F +E+ + DA + Sbjct: 377 GSDHDASVMHSVSEVSGGTFSFIESESVIQDALAQ 411 >gi|126722583|ref|NP_001075745.1| voltage-dependent calcium channel subunit alpha-2/delta-1 preproprotein [Oryctolagus cuniculus] gi|116409|sp|P13806|CA2D1_RABIT RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-1; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-1; Flags: Precursor gi|164763|gb|AAA81562.1| dihydropryridine-sensitive calcium channel alpha-2 subunit [Oryctolagus cuniculus] Length = 1106 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 77/200 (38%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 249 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 296 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 297 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 355 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ K + V+ + R Sbjct: 356 CN-------KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQ 402 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 403 WMACENKGYYYEIPSIGAIR 422 >gi|284166763|ref|YP_003405042.1| von Willebrand factor A [Haloterrigena turkmenica DSM 5511] gi|284016418|gb|ADB62369.1| von Willebrand factor type A [Haloterrigena turkmenica DSM 5511] Length = 853 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 24/172 (13%), Positives = 65/172 (37%), Gaps = 22/172 (12%) Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTN 258 +++ +++E+ D + G+ F++ L + + + + TN Sbjct: 661 VEATRNVIDELDPSAD-----RVGVYDFASSGRALHPLSDDLESAKESV--VGTAYGGTN 713 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 GL+ A N + + ++I++ ++DG+N +T D++ + + Sbjct: 714 MAAGLEAALND---------YATRGTDDRERIVILLSDGKNSNTANDERMDELADRSDDL 764 Query: 319 GAIVYAIGIRVIRSHEFLR------ACASPNSFYLVENPHSMYDAFSHIGKD 364 ++ +G+ + A + ++Y +P + D F I + Sbjct: 765 DYTLHTVGLDALEHDSIPEDKLEGWATETGGNYYQTADPDELLDLFEEIVDE 816 >gi|54309668|ref|YP_130688.1| hypothetical protein PBPRA2504 [Photobacterium profundum SS9] gi|46914106|emb|CAG20886.1| hypothetical protein PBPRA2504 [Photobacterium profundum SS9] Length = 494 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 41/250 (16%), Positives = 87/250 (34%), Gaps = 21/250 (8%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 +G I TA+ L +F ++ +E + K L + + + T N+ + Sbjct: 5 RQQRGAAGIYTALALIPLFGMIFWALEGTRYIQKKNRLADATEAATLAVTTA--NQDDKT 62 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 +L + I+N ++ R G D + + + Y ++A Sbjct: 63 YENQLATNYVQTYIRNIAIINDIKVERSEGI-----DYYPTPDGNEER--EYFQYRVTAK 115 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI 192 + + L IP ++ + + + D +D++ V D S SM+ Sbjct: 116 TDHTSWLSS-DIIPSFSPTETVANRALARNYPIYLGDKDIDIVFVSDFSGSMKGN----- 169 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVV--QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL 250 KI +I A+ E+ + D V + V ++ +++E ++ Y Sbjct: 170 -KIRALKDAIQAIANEILVPRDGEVEVTNRIAFVPYNMRVQEKRS---NTRWCLTQLDYR 225 Query: 251 SKFGVSTNST 260 F S+ Sbjct: 226 PNFNGGNYSS 235 >gi|161086896|ref|NP_001104313.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform a [Mus musculus] gi|46576352|sp|O08532|CA2D1_MOUSE RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-1; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-1; Flags: Precursor gi|1905817|gb|AAB50138.1| voltage-gated calcium channel alpha2/delta subunit, alpha2a isoform [Mus musculus] gi|148671294|gb|EDL03241.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_a [Mus musculus] Length = 1103 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 77/200 (38%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 247 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 294 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 295 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 353 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ K + V+ + R Sbjct: 354 CN-------KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQ 400 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 401 WMACENKGYYYEIPSIGAIR 420 >gi|116622522|ref|YP_824678.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225684|gb|ABJ84393.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 324 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/231 (13%), Positives = 80/231 (34%), Gaps = 41/231 (17%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + ++ S D + + +V D+S SM K+ + + K Sbjct: 78 LEDRVAQTITHFSMDDEPVAVGLVFDISGSMGP-------KLQKSRMAAAEF---FKTAN 127 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + LV F+++ + L V +Q ++ + G T + A +++ Sbjct: 128 PDDEFF---LVEFNDQPKMVVPLTRDVEQIQNQLTWAQSKGR-TALLDAIFLAMSEL--- 180 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + +K ++ ++DG + S++ + N ++ ++YAIG+ Sbjct: 181 --------KKSTKNRKALLIISDGGDNSSRYTES--EVRNLVRENDVLIYAIGVYEFAGG 230 Query: 334 E-----------FLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 L + + + + + D + IG ++ + + Sbjct: 231 RMRTPEEAGGPGLLTELSEQTGGRHLPA-DANELPDIAAKIGVELRNRYVL 280 >gi|307324381|ref|ZP_07603589.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] gi|306890112|gb|EFN21090.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] Length = 249 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 35/160 (21%), Positives = 61/160 (38%), Gaps = 27/160 (16%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S+ + R + +VLD S SM ++ D S+ + ++ A L++ ++P Sbjct: 38 SLAKHGADGQRAAVYLVLDRSGSMRPYYRDGSVQHLAEQTLALAANLDDDGIVP------ 91 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGM 276 +V FS +IE + + Q +I L TN ++ + Sbjct: 92 ---VVFFSTEIEGTAEIS--LDAYQDRINPLHDSMGHMGRTNYHVAMQAVID-------- 138 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 H + A+ +VF TDG S + L C AK Sbjct: 139 --HYQSSGADDPAFVVFQTDGSPTSKAAAEHVL--CTAAK 174 >gi|239616825|ref|YP_002940147.1| von Willebrand factor type A [Kosmotoga olearia TBF 19.5.1] gi|239505656|gb|ACR79143.1| von Willebrand factor type A [Kosmotoga olearia TBF 19.5.1] Length = 730 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 40/252 (15%), Positives = 89/252 (35%), Gaps = 37/252 (14%) Query: 93 SFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSR 152 ++ + ++ +V D V + + + + S S ++ + + F+ Sbjct: 207 NYIKKGDESSWVFSASDYVPRSDVSLALTVTGGTLSSSIMNYWDEADRRGYFLL------ 260 Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 +V P + D++ +LD+S SM S KI+ A ++ +L+ + Sbjct: 261 TLVPPR------EPERIIPKDIVFILDISGSM------SGQKIEKAKLALLQVLQMLHEG 308 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQI 270 + ++TF+N++ S + + G TN L + Sbjct: 309 D------RFSIITFNNEVNNLTERLLPFSDRTEWYPAVKQIMAGGMTNIHDALLEGIEVL 362 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE--AKKRGAIVYAIGIR 328 ++ K+++F+TDG D ++ + AK R ++ G+ Sbjct: 363 GTQSTDDRY---------KVVLFLTDGAPTEGITDIGTIIRDSTKLAKVRDVHLFVFGVG 413 Query: 329 VIRSHEFLRACA 340 + E L A Sbjct: 414 YDVNAELLDELA 425 >gi|162451432|ref|YP_001613799.1| hypothetical protein sce3160 [Sorangium cellulosum 'So ce 56'] gi|161162014|emb|CAN93319.1| hypothetical protein predicted by Glimmer/Critica [Sorangium cellulosum 'So ce 56'] Length = 404 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 37/194 (19%), Positives = 70/194 (36%), Gaps = 23/194 (11%) Query: 174 MMIVLDVSRSMES--FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +MIV+D S SM F+D ++ ++++ ++ + V+ G +TF+ Sbjct: 110 VMIVVDRSGSMYGSGFWDP-------LKTAVLSVVDRLQ------DRVRFGFLTFTGTAN 156 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNST--PGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + L G + ST PG + + + K Sbjct: 157 QQCPLLAGADGIALNHHAAIAAAYDEASTVPPGKLETPTAMTFNETVVPELLAFPEPGPK 216 Query: 290 IIVFMTDGENL------STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 I+F+TDGE + + +A ++G + G+ + L+ A+ Sbjct: 217 YILFVTDGEPDRCDDVRAECARDDVVGAVQDAYEQGIGTFVFGLGSGALAQHLQDVANAG 276 Query: 344 SFYLVENPHSMYDA 357 + VE P S DA Sbjct: 277 AGQPVERPRSGTDA 290 >gi|330995093|ref|ZP_08319010.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] gi|329576669|gb|EGG58172.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] Length = 340 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 27/172 (15%), Positives = 75/172 (43%), Gaps = 20/172 (11%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K++++ ++ +I +D+S SM + D + ++++ + ++ +++ K+ D + Sbjct: 79 GTKMDTRKRQGIEAIIAMDISNSMMAE-DVTPSRLEKSKMLVSNIVD--KMTDD-----K 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 GL+ ++ + + + ++ ++ ++T T +K A + Sbjct: 131 IGLIVYAGEAYTQLPITSDYVSAKMFLETINPSMITTQGTD-IKQAIDLAMKS------- 182 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 T + + K I +TDGE+ + + A ++G VY +G+ + Sbjct: 183 FTPNQDVSKAIFVITDGEDN----EGGVVEMAKAAAEKGIKVYVLGVGSPQG 230 >gi|297467492|ref|XP_002705103.1| PREDICTED: polydom [Bos taurus] Length = 3396 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 71/212 (33%), Gaps = 38/212 (17%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL+++ ++D S S+ +K + +L + ++P + +VTFS+K Sbjct: 83 RLELVFLVDESSSVG------QANFLSELKFVRKLLSDFPVVPTA---TRVAIVTFSSKN 133 Query: 231 EEFFLLEWGVSH---------LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +++ S L R+I ++ G T + + A + Sbjct: 134 NVVPRVDYISSRRAHQHKCALLSREIPAITYRGGGTYTKGAFQQAAQILRHS-------- 185 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 N K+I +TDG + + G ++ GI E + Sbjct: 186 --RENSTKVIFLITDGYSNG----GDPRPIAASLRDFGVEIFTFGIWQGNIRELNDMASI 239 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 P E + +F + + + + D Sbjct: 240 PK-----EEHCYLLHSFEEF-EALARRALHED 265 >gi|229550752|ref|ZP_04439477.1| probable yvcC protein [Lactobacillus rhamnosus LMS2-1] gi|229315887|gb|EEN81860.1| probable yvcC protein [Lactobacillus rhamnosus LMS2-1] Length = 898 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 49/292 (16%), Positives = 97/292 (33%), Gaps = 21/292 (7%) Query: 19 MTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLK 78 MT +TA L ++I + M T L + + AA N Sbjct: 1 MTKMTAKVARTGHLFAVLLI----LMSMLTGLVTSGSSVVTAAANIRPTYQTDANGTYPT 56 Query: 79 GGDILCRIKNTWNMSFRNELR--DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYK 136 + +N N +++ DN D D + S P Y I ++ Sbjct: 57 NSWQVTGQQNVINQRGGDQVSGWDNNTTWDGDATNTTNSYLKFGDPNKPDYQIRKYAK-- 114 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 + + + + + Q +D+++V+D+S SMES S + Sbjct: 115 --------ETNTPGLYDVYLNVKGNTQ---QNVKPVDIVLVVDMSGSMESN-SSGTNRAG 162 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS 256 + L ++ + N V GL+ FS+ + L + + ++ Sbjct: 163 AVRTGVKNFLTSIQNA-GLGNYVNVGLIGFSSPGYIGGKSGYISVKLGKAGNASQQQAIN 221 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS 308 +P + ++ N + + KK+++ +TDG + E S Sbjct: 222 GALSPRFQGGTYTQIGLRQGSAMLNADASGNKKMMILLTDGVPTFSNEVINS 273 >gi|145299821|ref|YP_001142662.1| flp pilus assembly protein FlpL [Aeromonas salmonicida subsp. salmonicida A449] gi|88866595|gb|ABD57363.1| FlpL [Aeromonas salmonicida subsp. salmonicida A449] gi|142852593|gb|ABO90914.1| putative flp pilus assembly protein FlpL [Aeromonas salmonicida subsp. salmonicida A449] Length = 460 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 27/152 (17%), Positives = 56/152 (36%), Gaps = 14/152 (9%) Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQI-------FDMQGM 276 F + ++ +S + L +TN+ G+ + + + + Sbjct: 304 FDDADCGVSPIQAHLSTRAEYRQALDTLYAAFNTNTAEGVMWGWRLLSPQWQGRWGQGAA 363 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKE---DQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 A+ +KI+V +DGE++ + D++ L C E K++G VY + Sbjct: 364 ELPRPYGQADNRKIMVLFSDGEHMGPEAALRDRKQLLLCREMKRKGIQVYTVAF--EGDA 421 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 F+ CAS S ++ + + I Sbjct: 422 RFVAQCASDRSHAYKATSGNIRTVLTRLASAI 453 >gi|89053332|ref|YP_508783.1| von Willebrand factor, type A [Jannaschia sp. CCS1] gi|88862881|gb|ABD53758.1| von Willebrand factor type A [Jannaschia sp. CCS1] Length = 686 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 36/190 (18%), Positives = 69/190 (36%), Gaps = 21/190 (11%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 PW +++ + + I + V + L+++ ++D S SM K+ + I+S M Sbjct: 302 PWNPDTQLVHIGIQGDLPV-VEDRPPLNLVFLIDTSGSMN-----DPAKLPLLIQSFRLM 355 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 L + +V V +G + + + + + L G STN GL+ Sbjct: 356 LNRLSPEDEVAIVTYAGSAGVALEPTAASD----TATINAALTTLQA-GGSTNGVGGLEE 410 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGAIVYA 324 AY D + ++ TDG+ D +L Y E + G + Sbjct: 411 AYR--------LAGEMMVDGEVSR-VLLATDGDFNVGLSDAGALEDYIAEQRDTGIYLSV 461 Query: 325 IGIRVIRSHE 334 +G + Sbjct: 462 LGFGRGNLQD 471 >gi|86144309|ref|ZP_01062641.1| Flp pilus assembly protein TadG [Vibrio sp. MED222] gi|218676258|ref|YP_002395077.1| putative Flp pilus assembly protein TadG [Vibrio splendidus LGP32] gi|85837208|gb|EAQ55320.1| Flp pilus assembly protein TadG [Vibrio sp. MED222] gi|218324526|emb|CAV26007.1| putative Flp pilus assembly protein TadG [Vibrio splendidus LGP32] Length = 438 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 34/271 (12%), Positives = 91/271 (33%), Gaps = 31/271 (11%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 +G IL LP + + + +++S + L + + + ++ Sbjct: 1 MKSIKKNRGVAGILFVGLLPAMVIFMAFSMQMSQQMLAHSRLLEAAEVASLA----LIAS 56 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 + K +K + + + + ++ F + + P + Sbjct: 57 PKEDEDKNVKYA---RYLVDRYILDNSEDVDVAVFTRKCEYKDGCVQASGELAP-FSDFV 112 Query: 129 ISAISRYK--IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 +SA ++Y I + P +T ++ +D+ + D S SM + Sbjct: 113 VSATAKYTSWISYEDVDLEPEFT--------VSGRAVTRKYLPQSVDVYFIGDFSGSMGN 164 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK 246 + + K+D+ ++I ++++++ + + L+ ++ H+++ Sbjct: 165 SWKNGKMKLDVVKETIKRVVDDIEKF-NTEEKSRVALLGYN------------PFHVKQT 211 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 K + + KYAYN G Sbjct: 212 DKTVRVNAYGYYGSWRKKYAYNYARSSPGTT 242 >gi|222616410|gb|EEE52542.1| hypothetical protein OsJ_34771 [Oryza sativa Japonica Group] Length = 654 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 41/226 (18%), Positives = 86/226 (38%), Gaps = 33/226 (14%) Query: 141 FCTFIPWYTNSR-HIVMPITSSVKVNSQTDARLDMMIVLDVSRSME-------SFFDSSI 192 F T TN +++ + + + + LD++ VLDVS SM + Sbjct: 38 FPTIPRGQTNKDFQVLLRVEAPPAADLNSHVPLDVVAVLDVSGSMNDPVAAASPKSNLQG 97 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-KIEEFFL-----LEWGVSHLQRK 246 +++D+ S+ ++ ++ + +V F++ ++E+ G S +K Sbjct: 98 SRLDVLKASMKFVIRKLADGD------RLSIVAFNDGPVKEYSSGLLDVSGDGRSIAGKK 151 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 I L G T P L+ A + + QG ++ I+ +TDG++ + Sbjct: 152 IDRLQARG-GTALMPALEEAVKILDERQGSSRNRVG-------FILLLTDGDDTTGFRWT 203 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRAC-ASPNSFYLVEN 350 + + AK V+ G+ L S ++ V++ Sbjct: 204 RDAIHGAVAK---YPVHTFGLGASHDPEALLHIAQGSRGTYSFVDD 246 >gi|198424351|ref|XP_002122740.1| PREDICTED: similar to collagen, type XXVIII [Ciona intestinalis] Length = 870 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 80/211 (37%), Gaps = 21/211 (9%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 +P + N + +D+ ++D S S+ D + I ++ Sbjct: 640 LPGDCAAPPNRCQEYPVDITFLVDGSDSI------DREDFDRVKQWILLTVDAFNPGE-R 692 Query: 216 NNVVQSGLVTFSNK-IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 N + +V FS + + E + + +++ + + T + L++ +++ + Sbjct: 693 RNPLHVDVVQFSERSMIEVDSDVASSAQISDQVQGIIQMRSGTKTYSALEFVNREVWPL- 751 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 KI++ MTDG+ ED+ EA + ++ A+G+ Sbjct: 752 --------LRPGAFKILITMTDGD---ASEDRNVDAI-EEANNKYNLMAAVGVGSKVISS 799 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 L +S + + V+N ++ S I ++I Sbjct: 800 ELEDFSSTGNVFNVDNFSALESIISDIVENI 830 >gi|218778177|ref|YP_002429495.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] gi|218759561|gb|ACL02027.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] Length = 558 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 33/227 (14%), Positives = 77/227 (33%), Gaps = 25/227 (11%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKV--NSQTDARLDMMIVLDVSRSMESFFDSSITKI 195 P + +R ++ + + +++ +LDVS SM S K+ Sbjct: 159 PFSITMEMSQCPWNRDNMLVHVGLQGRCLDYKDVKPSNLVFLLDVSGSMNSE-----NKL 213 Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGV 255 + +S+ +++E+ V+ V +G + + + L G Sbjct: 214 PLVKRSMEMLVKELGAGDRVSIVTYAGSAG----LVLPSTSARNKRKIITALDRLEA-GG 268 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNE 314 ST G++ AY ++ + ++ TDG+ N+ + + E Sbjct: 269 STAGGEGIELAYRVAWENLIPEGNNR---------VILCTDGDFNVGVSSTPELVRMIEE 319 Query: 315 AKKRGAIVYAIGIRVIRSHEFLRACAS---PNSFYLVENPHSMYDAF 358 ++ G + G + + S +FY +++ + F Sbjct: 320 KRRAGIYLTICGFGMGNYKDEKMEAISNAGNGNFYYIDSRREAHKVF 366 >gi|320352629|ref|YP_004193968.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] gi|320121131|gb|ADW16677.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] Length = 577 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 44/285 (15%), Positives = 100/285 (35%), Gaps = 48/285 (16%) Query: 75 KKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVV------------- 121 K+ + + K WN N +++NGF+N D + + S+D+ Sbjct: 78 KRAAVSEAMPIGKPAWNRESYNAIQENGFINAARDPLSTFSIDVDTASYTNVRRFIQGGH 137 Query: 122 -PQNEGYSISAIS---RYKIPLKFCTFIPWYTNSRHIVMPI----------TSSVKVNSQ 167 P I + Y P + P+ + P ++ + + Sbjct: 138 LPPVGAVRIEEMINYFTYAYP-RPIGKAPFALGAEVGPSPFHRDYLLARIGLAAKDLAKE 196 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +++ ++DVS SM+ K+ + +++ ++ ++ V VV +G Sbjct: 197 HLPPSNLVFLIDVSGSMQ-----DGNKLPLLKQALPLVVRQLGARDRVALVVYAGA---- 247 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + + + L G ST+++ G++ AY R+ N Sbjct: 248 DSVVLPPTPGDRQQEILAALDQLQA-GGSTHASSGIRTAYEL------ARKSFIKGGNNR 300 Query: 288 KKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++ +DG+ N+ + E +K G + +G+ + Sbjct: 301 ---VILASDGDFNVGVTSRDELTRLIEEERKDGIYLTVLGLGMGN 342 >gi|297279792|ref|XP_002801791.1| PREDICTED: integrin alpha-10-like isoform 1 [Macaca mulatta] gi|297279794|ref|XP_002801792.1| PREDICTED: integrin alpha-10-like isoform 2 [Macaca mulatta] Length = 1167 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 42/244 (17%), Positives = 82/244 (33%), Gaps = 46/244 (18%) Query: 150 NSRHIVMPITSSVKVNSQTDARL-----------DMMIVLDVSRSMESFFDSSITKIDMA 198 S I + V + + L D++IVLD S S+ + +++ Sbjct: 133 GSSVFSSGICARVDASFRPQGSLAPTAQRCPTYMDVVIVLDGSNSIYPW-----SEVQTF 187 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGV 255 ++ + L I +Q GLV + + L + ++ + G Sbjct: 188 LRRLVGKL----FIDPEQ--IQVGLVQYGESPVHEWSLGDFRTKEEVVRAAKNLSRREGR 241 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 T + + A + F + +++V +TDGE+ +E +L C Sbjct: 242 ETKTAQAIMVACTEGFSQS------HGGRPEAARLLVVVTDGESHDGEELPAALKTCEAG 295 Query: 316 KKRGAIVYAIGI------RVIRSHEFLR----ACASPNS--FYLVENPHSMYDAFSHIGK 363 + Y I + R FLR + P+ F+ V + ++ D +G Sbjct: 296 R---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASDPDERFFFNVTDEAALTDIVDALGD 352 Query: 364 DIVT 367 I Sbjct: 353 RIFG 356 >gi|1098898|gb|AAC46961.1| CTRP [Plasmodium falciparum] Length = 2098 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 49/266 (18%), Positives = 87/266 (32%), Gaps = 24/266 (9%) Query: 88 NTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVV-PQNEGYSISAISRYKIPLKFCTFIP 146 + + F ++ DNG V ST P N S P P Sbjct: 745 KSVSEVFVKKICDNGVVLPPGSPSESTPGSPSESTPGNPSESTPGSPSESTPGNPSESTP 804 Query: 147 WYTNSRHIVMPITSSVKVNSQT--DARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSIN 203 + P S+ ++ D+ +++D S S+ S ++ + + Sbjct: 805 GSPSESTPGSPSESTPCSGTECLCHNTYDLTLIIDESASIGYSNWE---------KEVVP 855 Query: 204 AMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL----EWGVSHLQRKIKYLSKFGVSTNS 259 + + V G++ FS+KI EF + ++L R+I L K+ Sbjct: 856 FTIGLASNLEISEKKVNMGILLFSDKIREFIKYGQKESYDKNNLVRRIHDLKKY----YK 911 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 + G Y + + N K+ + +TDG N T + + + KK Sbjct: 912 SGGFSYIVEALKYGLYSYAKSTSSRLNVPKVNILLTDGNNTDTSDFILT-EVSSLYKKEN 970 Query: 320 AIVYAIGIRVIRSH--EFLRACASPN 343 + IGI H L C + Sbjct: 971 VKLLLIGIGGPTIHKLRLLGGCDKSD 996 Score = 45.5 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 31/206 (15%), Positives = 75/206 (36%), Gaps = 25/206 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE-VKLIPDVNNVVQSGLVTFSNKIE 231 D+ ++LD S S+ S + ++ +K + N V G++ FS+K Sbjct: 85 DLTLILDESASIGS---------KNWKNHVIPFTDKIIKDLTISKNEVHVGILLFSSKNR 135 Query: 232 EFFLLEWGVSHLQRKIKYLSKF-------GVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ + + + ++ + G T LKY ++ +H N Sbjct: 136 DYVTYGDELRYQKDELLKKVEKLKKDYYCGGGTKILGALKY------SLENYTKHKNI-R 188 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + K+ + TDG N ++ ++Q L ++ + +G+ ++ + Sbjct: 189 YDAPKVTILFTDG-NENSASNKQLLEMGLTYRRERVKLLVLGVAAAEDNKLKLIAGCEEN 247 Query: 345 FYLVENPHSMYDAFSHIGKDIVTKRI 370 + + ++ + I K + K Sbjct: 248 TNCPYSMKAEWETINDITKRLTNKIC 273 Score = 44.8 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 70/177 (39%), Gaps = 27/177 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV-KLIPDVNNVVQSGLVTFSNKIE 231 D+ ++LD SRS+ ++ K + E+V + + + G++ F+ ++ Sbjct: 305 DLTLILDESRSIT---------LNKWKKDVVPFAEKVLNNLNIDKDKIHVGIMRFAKSMK 355 Query: 232 EFFLLEWGVSHLQRKIKYLSK-------FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E +++ + L + +G +T+ L+Y ++ +H N Sbjct: 356 TDIGYEQETRYMKNDLIKLVRELKDKYGYGGATHLVDALQY------SLKTFTRHPN-NR 408 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE--FLRAC 339 + K+ + TDG N ++K+++ +K + +G+ + L C Sbjct: 409 VDAPKVTILFTDG-NETSKKEKDIRDVGLLYRKENVKLIVVGVNLATEKSLKLLAGC 464 >gi|325954651|ref|YP_004238311.1| von Willebrand factor type A [Weeksella virosa DSM 16922] gi|323437269|gb|ADX67733.1| von Willebrand factor type A [Weeksella virosa DSM 16922] Length = 336 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 34/180 (18%), Positives = 68/180 (37%), Gaps = 26/180 (14%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 M V+ + +D++ +LDVS SM++ D + +++ A + I+ L + Sbjct: 73 MDPLGGVEEREVSREGIDIVYLLDVSTSMDAQ-DVAPSRLMKASRIISQSLNSLGGD--- 128 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFD 272 ++ LV F+ L + + + LS T+ + + A + + Sbjct: 129 ----RAALVIFAADGYTISPLTNDYAAIDSYLGSLSTNLISNQGTDFSAAFREAVSVL-- 182 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 N K++V ++DGE + QS+ N V +IGI + Sbjct: 183 ---------KGAPNTSKLVVLLSDGE-DHESGENQSIKLAN---DNQIHVVSIGIGTDKG 229 >gi|309792307|ref|ZP_07686777.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308225622|gb|EFO79380.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 423 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 78/213 (36%), Gaps = 34/213 (15%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 ++++ +++ + + L++ +V+D S SM ++ + N +++ + Sbjct: 26 YVLLEVSAHGVPTNISKLPLNLCLVIDRSSSMRG------ERLQQVKDAANRIVDMLTDD 79 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTP---GLKYAY 267 LVTF+++ E + L+R I + G + +T L+ Sbjct: 80 D------YLSLVTFNDRAEVVIPAQRPQHRPDLKRMISGIEAAGGTEMATGMALALQETQ 133 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + R I+ +TDG ++ + + A+ RG + A+GI Sbjct: 134 RPLMGRGVSR-------------ILLLTDGRTYG--DEGRCVEIARRAQGRGVGLTALGI 178 Query: 328 RVIRSHEFLRACASPNS--FYLVENPHSMYDAF 358 + + L ++ + + + + F Sbjct: 179 GGEWNEDLLETMSARENSRTQYITSAAEITQIF 211 >gi|303235711|ref|ZP_07322318.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] gi|302484158|gb|EFL47146.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] Length = 341 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 32/175 (18%), Positives = 70/175 (40%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K++ ++ MI +D+S SM + D + T++D + + + ++ + Sbjct: 79 GTKISKDKREGIEAMIAMDISNSMLAQ-DVAPTRLDRSKRLVEDLVNRFTND-------K 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGMR 277 G+V F+ + + + +S T+ + + Sbjct: 131 IGIVVFAGDAFVQLPITSDYISAKMFLNNISPELIGSQGTDIGKAI-----------ELS 179 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 +H +E AN+ K I+ +TDGEN ++ + A+K+G V+ +GI + Sbjct: 180 EHSFSEKANFGKAIIIITDGENHEKGAEEMARE----AQKQGIRVFILGIGSPQG 230 >gi|283807143|pdb|3K6S|A Chain A, Structure Of Integrin Alphaxbeta2 Ectodomain gi|283807145|pdb|3K6S|C Chain C, Structure Of Integrin Alphaxbeta2 Ectodomain gi|283807147|pdb|3K6S|E Chain E, Structure Of Integrin Alphaxbeta2 Ectodomain gi|283807149|pdb|3K6S|G Chain G, Structure Of Integrin Alphaxbeta2 Ectodomain gi|283807156|pdb|3K71|A Chain A, Structure Of Integrin Alphax Beta2 Ectodomain gi|283807158|pdb|3K71|C Chain C, Structure Of Integrin Alphax Beta2 Ectodomain gi|283807160|pdb|3K71|E Chain E, Structure Of Integrin Alphax Beta2 Ectodomain gi|283807162|pdb|3K71|G Chain G, Structure Of Integrin Alphax Beta2 Ectodomain gi|283807164|pdb|3K72|A Chain A, Structure Of Integrin Alphax Beta2 gi|283807166|pdb|3K72|C Chain C, Structure Of Integrin Alphax Beta2 Length = 1095 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 41/216 (18%), Positives = 78/216 (36%), Gaps = 32/216 (14%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 V D++ ++D S S+ S + + A++ + + Q Sbjct: 121 VSRQECPRQEQDIVFLIDGSGSISS------RNFATMMNFVRAVISQFQ-----RPSTQF 169 Query: 222 GLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 L+ FSNK + F E S+ + + + T + ++ +++F Sbjct: 170 SLMQFSNKFQTHFTFEEFRRSSNPLSLLASVHQLQGFTYTATAIQNVVHRLFHAS----- 224 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEF 335 + KI++ +TDG+ D + + +A G I YAIG + S + Sbjct: 225 -YGARRDAAKILIVITDGKKEGDSLDYKDVIPMADA--AGIIRYAIGVGLAFQNRNSWKE 281 Query: 336 LRACA---SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 L A S + VE+ +DA I + K Sbjct: 282 LNDIASKPSQEHIFKVED----FDALKDIQNQLKEK 313 >gi|115374040|ref|ZP_01461329.1| thrombospondin type 3 repeat family [Stigmatella aurantiaca DW4/3-1] gi|310819324|ref|YP_003951682.1| MtsD protein involved in cell-cell cohesion [Stigmatella aurantiaca DW4/3-1] gi|115368930|gb|EAU67876.1| thrombospondin type 3 repeat family [Stigmatella aurantiaca DW4/3-1] gi|309392396|gb|ADO69855.1| MtsD protein involved in cell-cell cohesion [Stigmatella aurantiaca DW4/3-1] Length = 655 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 70/192 (36%), Gaps = 36/192 (18%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSM-------ESFFDSSITKID--------- 196 + + ++ ++ + + +++V+D S SM D ++D Sbjct: 41 TLAGRVCTA--PSNPSGFPVKVVLVIDQSGSMCVSDPPGSQGVDGFCEQVDDILLPPGVL 98 Query: 197 --MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR-------KI 247 ++++ ++ + + P+ VQ +V F ++ + + R I Sbjct: 99 EPARVRALKRLVNQFRQQPN----VQISIVPFETNVKNVWPPATTGNRFARPDASLDTYI 154 Query: 248 KYL-SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI-IVFMTDGENL---ST 302 + L ++ G T+ + YAY+ I N E + +VF+TDG S Sbjct: 155 RGLQNQLGKGTDYQGAVGYAYSLIASDINAVSASNPEVLPRTRYVVVFLTDGTPYPRCSA 214 Query: 303 KEDQQSLYYCNE 314 ++ + + Sbjct: 215 NDNLSAYATPDS 226 >gi|25149045|ref|NP_741200.1| DIsplaced Gonad family member (dig-1) [Caenorhabditis elegans] gi|74961937|sp|Q09165|DIG1_CAEEL RecName: Full=Mesocentin; Flags: Precursor gi|22331940|gb|AAM78593.1| mesocentin [Caenorhabditis elegans] gi|26251538|gb|AAA50715.2| Displaced gonad protein 1, isoform a, confirmed by transcript evidence [Caenorhabditis elegans] Length = 13100 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 44/196 (22%), Positives = 77/196 (39%), Gaps = 28/196 (14%) Query: 141 FCTFIPWYTNSRHIVMPIT------SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITK 194 C +P SR P T + + S D D++IVLD S E+F Sbjct: 12341 ICKIVPNVETSRTWPTPRTKATTTSGTGRSCSSIDYESDVIIVLDSS---ENFTPDEFVS 12397 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG 254 + A+ SI L PDV+ + G V +S+K+ L H + KI+ L K Sbjct: 12398 MKDAVASIVD--TGFDLAPDVS---KIGFVIYSDKVAVPVAL----GHYEDKIELLEKIT 12448 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCN-TEDANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 + G+ A ++ + RQ N K+++ +T+G+N + Sbjct: 12449 DAEKINDGVAIA---LYGLNAARQQFQLHGRENATKVVILITNGKNRGN-----AAAAAE 12500 Query: 314 EAKKR-GAIVYAIGIR 328 + + G ++A+ + Sbjct: 12501 DLRDMYGVQLFAVAVG 12516 >gi|34452173|ref|NP_000878.2| integrin alpha-X precursor [Homo sapiens] gi|146345441|sp|P20702|ITAX_HUMAN RecName: Full=Integrin alpha-X; AltName: Full=CD11 antigen-like family member C; AltName: Full=Leu M5; AltName: Full=Leukocyte adhesion glycoprotein p150,95 alpha chain; AltName: Full=Leukocyte adhesion receptor p150,95; AltName: CD_antigen=CD11c; Flags: Precursor Length = 1163 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 41/216 (18%), Positives = 78/216 (36%), Gaps = 32/216 (14%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 V D++ ++D S S+ S + + A++ + + Q Sbjct: 140 VSRQECPRQEQDIVFLIDGSGSISS------RNFATMMNFVRAVISQFQ-----RPSTQF 188 Query: 222 GLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 L+ FSNK + F E S+ + + + T + ++ +++F Sbjct: 189 SLMQFSNKFQTHFTFEEFRRSSNPLSLLASVHQLQGFTYTATAIQNVVHRLFHAS----- 243 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEF 335 + KI++ +TDG+ D + + +A G I YAIG + S + Sbjct: 244 -YGARRDAAKILIVITDGKKEGDSLDYKDVIPMADA--AGIIRYAIGVGLAFQNRNSWKE 300 Query: 336 LRACA---SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 L A S + VE+ +DA I + K Sbjct: 301 LNDIASKPSQEHIFKVED----FDALKDIQNQLKEK 332 >gi|23398603|gb|AAH38237.1| ITGAX protein [Homo sapiens] gi|123983342|gb|ABM83412.1| integrin, alpha X (complement component 3 receptor 4 subunit) [synthetic construct] gi|123998043|gb|ABM86623.1| integrin, alpha X (complement component 3 receptor 4 subunit) [synthetic construct] Length = 1169 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 41/216 (18%), Positives = 78/216 (36%), Gaps = 32/216 (14%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 V D++ ++D S S+ S + + A++ + + Q Sbjct: 140 VSRQECPRQEQDIVFLIDGSGSISS------RNFATMMNFVRAVISQFQ-----RPSTQF 188 Query: 222 GLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 L+ FSNK + F E S+ + + + T + ++ +++F Sbjct: 189 SLMQFSNKFQTHFTFEEFRRSSNPLSLLASVHQLQGFTYTATAIQNVVHRLFHAS----- 243 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEF 335 + KI++ +TDG+ D + + +A G I YAIG + S + Sbjct: 244 -YGARRDAAKILIVITDGKKEGDSLDYKDVIPMADA--AGIIRYAIGVGLAFQNRNSWKE 300 Query: 336 LRACA---SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 L A S + VE+ +DA I + K Sbjct: 301 LNDIASKPSQEHIFKVED----FDALKDIQNQLKEK 332 >gi|60654537|gb|AAX29959.1| integrin alpha X [synthetic construct] Length = 1170 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 41/216 (18%), Positives = 78/216 (36%), Gaps = 32/216 (14%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 V D++ ++D S S+ S + + A++ + + Q Sbjct: 140 VSRQECPRQEQDIVFLIDGSGSISS------RNFATMMNFVRAVISQFQ-----RPSTQF 188 Query: 222 GLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 L+ FSNK + F E S+ + + + T + ++ +++F Sbjct: 189 SLMQFSNKFQTHFTFEEFRRSSNPLSLLASVHQLQGFTYTATAIQNVVHRLFHAS----- 243 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEF 335 + KI++ +TDG+ D + + +A G I YAIG + S + Sbjct: 244 -YGARRDAAKILIVITDGKKEGDSLDYKDVIPMADA--AGIIRYAIGVGLAFQNRNSWKE 300 Query: 336 LRACA---SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 L A S + VE+ +DA I + K Sbjct: 301 LNDIASKPSQEHIFKVED----FDALKDIQNQLKEK 332 >gi|73538307|ref|YP_298674.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] gi|72121644|gb|AAZ63830.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] Length = 353 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 36/242 (14%), Positives = 75/242 (30%), Gaps = 62/242 (25%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 + +++ +DVSRSME+ D + T+I A ++ ++ + V+ G+V+F+ Sbjct: 86 ITLVLAMDVSRSMEA-TDVAPTRISAAQQAARDLIVGLPAS------VRLGIVSFAGTAT 138 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG---------------- 275 + I + T + GL A +F G Sbjct: 139 VVLRPTSNRQDMLDAIDRF-QLQRGTATGSGLIQALAVLFPDDGIDLEAILFADEPVFST 197 Query: 276 ----MRQHCNTEDANYKK---------------IIVFMTDGENLSTKEDQQSLYYCNEAK 316 DA K+ ++ ++DG + + + Sbjct: 198 RRAVPLDEAAAADAVRKREQATQSAQPGSYRHGAVILLSDGRRTVGPDPVDAARMAAQ-- 255 Query: 317 KRGAIVYAIGIRVIRSHE--------------FLRACA--SPNSFYLVENPHSMYDAFSH 360 RG VY +G + + LRA A + ++ + + + Sbjct: 256 -RGVRVYTVGFGTLGGNAPETSLSYYMQLDEPALRAVATITGAEYFQAGSAADLSQVYRQ 314 Query: 361 IG 362 + Sbjct: 315 LS 316 >gi|4809045|gb|AAD30063.1| Itgae protein [Mus sp.] Length = 895 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 42/236 (17%), Positives = 85/236 (36%), Gaps = 26/236 (11%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 +C T +V+ + + ++ IVLD S S+E + A Sbjct: 150 YCRSKGGSTGEEKKSARRRRTVEEEDEEEDGTEIAIVLDGSGSIEP------SDFQKAKN 203 Query: 201 SINAMLEEV--KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTN 258 I+ M+ K +VQ G V + E ++ K++ + + T Sbjct: 204 FISTMMRNFYEKCFECNFALVQYGAV---IQTEFDLQESRDINASLAKVQSIVQVKEVTK 260 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + +++ + IF K++V +TDG+ + ++ N K + Sbjct: 261 TASAMQHVLDNIFIPS------RGSRKKALKVMVVLTDGDIFGDPLNLTTVI--NSPKMQ 312 Query: 319 GAIVYAIGIR----VIRSHEFLRACASP---NSFYLVENPHSMYDAFSHIGKDIVT 367 G + +AIG+ ++ L+ AS + V N ++ S + + IV Sbjct: 313 GVVRFAIGVGDAFKNNNTYRELKLIASDPKEAHTFKVTNYSALDGLLSKLQQRIVH 368 >gi|307154093|ref|YP_003889477.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306984321|gb|ADN16202.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 240 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 35/237 (14%), Positives = 86/237 (36%), Gaps = 23/237 (9%) Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 L + +++ SS + L++ ++LD S SM + + Sbjct: 5 YSLSQSLIAVNSAATVDLLISFISSASSKDSSRRPLNLSLILDRSGSMAG------SALK 58 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS 256 A + ++E + ++ VV F I ++ + ++ +I ++ G + Sbjct: 59 YAKMAAQKLVEYLTPEDILSVVVYD---DFVETIVAPQPVKD-QAAIKNQINRINARGCT 114 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEA 315 S + ++ + + E N ++ +TDG N+ + + E Sbjct: 115 NLSGG-------WLTGCDHVKANLSAERLNR---VLLLTDGLANVGNSDPKILTKTATEK 164 Query: 316 KKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 ++G I +G + + L A+ +FY +++P F + +V+ + Sbjct: 165 AEQGIITTTLGFGTYFNEDLLINMANGGKGNFYFIQSPQDAAQVFEIEIESLVSDAV 221 >gi|149758640|ref|XP_001499491.1| PREDICTED: similar to integrin, alpha 10 isoform 1 [Equus caballus] Length = 1167 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 42/244 (17%), Positives = 81/244 (33%), Gaps = 46/244 (18%) Query: 150 NSRHIVMPITSSVKVNSQTDARL-----------DMMIVLDVSRSMESFFDSSITKIDMA 198 S I + V + Q L D++IVLD S S+ + + Sbjct: 133 GSSVFSSGICARVDASFQLQGSLAPTAQRCPTYMDVVIVLDGSNSIYPWSE--------V 184 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGV 255 + ++ + + P +Q GLV + + L + ++ + G Sbjct: 185 RTFLRRLVGRLFIDP---EQIQVGLVQYGESPVHEWSLGDFRTKEEVVRAARNLSRREGR 241 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 T + + A + F + +++V +TDGE+ +E +L C Sbjct: 242 ETKTAQAIMVACTEGFSQS------HGGRPEAARLLVVVTDGESHDGEELPTALKACEAG 295 Query: 316 KKRGAIVYAIGI------RVIRSHEFLR----ACASPNS--FYLVENPHSMYDAFSHIGK 363 + Y I + R FLR + P+ F+ V + ++ D +G Sbjct: 296 R---VTRYGIAVLGHYLRRQRDPSSFLREIRAIASDPDERFFFNVTDEAALTDIVDALGD 352 Query: 364 DIVT 367 I Sbjct: 353 RIFG 356 >gi|149758642|ref|XP_001499504.1| PREDICTED: similar to integrin, alpha 10 isoform 2 [Equus caballus] Length = 1177 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 42/244 (17%), Positives = 81/244 (33%), Gaps = 46/244 (18%) Query: 150 NSRHIVMPITSSVKVNSQTDARL-----------DMMIVLDVSRSMESFFDSSITKIDMA 198 S I + V + Q L D++IVLD S S+ + + Sbjct: 133 GSSVFSSGICARVDASFQLQGSLAPTAQRCPTYMDVVIVLDGSNSIYPWSE--------V 184 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGV 255 + ++ + + P +Q GLV + + L + ++ + G Sbjct: 185 RTFLRRLVGRLFIDP---EQIQVGLVQYGESPVHEWSLGDFRTKEEVVRAARNLSRREGR 241 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 T + + A + F + +++V +TDGE+ +E +L C Sbjct: 242 ETKTAQAIMVACTEGFSQS------HGGRPEAARLLVVVTDGESHDGEELPTALKACEAG 295 Query: 316 KKRGAIVYAIGI------RVIRSHEFLR----ACASPNS--FYLVENPHSMYDAFSHIGK 363 + Y I + R FLR + P+ F+ V + ++ D +G Sbjct: 296 R---VTRYGIAVLGHYLRRQRDPSSFLREIRAIASDPDERFFFNVTDEAALTDIVDALGD 352 Query: 364 DIVT 367 I Sbjct: 353 RIFG 356 >gi|59713412|ref|YP_206187.1| TadG-like protein [Vibrio fischeri ES114] gi|59481660|gb|AAW87299.1| TadG-like protein [Vibrio fischeri ES114] Length = 423 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 38/258 (14%), Positives = 82/258 (31%), Gaps = 20/258 (7%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +RN + +G IL A+ +P +F + + + + K + + + + + Sbjct: 1 MRNLRKHQQGHAAILFAMMIPALFGIFALASDGARAIQTKARIEDASEVAALAISAHNDP 60 Query: 68 EG--NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 + NG+ + I+ N + + I + + Sbjct: 61 DQPDNGSYTPSTRNRQIVVDYVNAYISDIDAVTDIKVAKRRCELISGCVAGLYKGDARYL 120 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ-----TDARLDMMIVLDV 180 + I +R W+ + I + +D+M D Sbjct: 121 EHEIDVTTR---------QNSWFPGNEAIEGMGETFSTRGKSLARKYQSEAVDVMFAADF 171 Query: 181 SRSM--ESFFDSSITKIDM--AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 S SM S+ ID+ I++I+ L++ +P+ + G+ FS F Sbjct: 172 SGSMLDTWSGSSNPKYIDLIEIIRNISVELQKFNDLPENRDKSTMGISAFSTFTNSFTSD 231 Query: 237 EWGVSHLQRKIKYLSKFG 254 L + + +K G Sbjct: 232 TGIQCSLSQGVNSKNKPG 249 >gi|297279796|ref|XP_002801793.1| PREDICTED: integrin alpha-10-like isoform 3 [Macaca mulatta] Length = 1036 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 40/236 (16%), Positives = 83/236 (35%), Gaps = 38/236 (16%) Query: 150 NSRHIVMPITSSVKVNSQTDA---RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 + + + S + +D +D++IVLD S S+ + +++ ++ + L Sbjct: 10 GHQKLNLDTVSYNILGVDSDGCPTYMDVVIVLDGSNSIYPW-----SEVQTFLRRLVGKL 64 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGL 263 I +Q GLV + + L + ++ + G T + + Sbjct: 65 ----FIDPEQ--IQVGLVQYGESPVHEWSLGDFRTKEEVVRAAKNLSRREGRETKTAQAI 118 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 A + F + +++V +TDGE+ +E +L C + Y Sbjct: 119 MVACTEGFSQS------HGGRPEAARLLVVVTDGESHDGEELPAALKTCEAGR---VTRY 169 Query: 324 AIGI------RVIRSHEFLR----ACASPNS--FYLVENPHSMYDAFSHIGKDIVT 367 I + R FLR + P+ F+ V + ++ D +G I Sbjct: 170 GIAVLGHYLRRQRDPSSFLREIRTIASDPDERFFFNVTDEAALTDIVDALGDRIFG 225 >gi|262527579|sp|P54289|CA2D1_HUMAN RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-1; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-1; Flags: Precursor Length = 1103 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 79/200 (39%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 247 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 294 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 295 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 353 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ N+ K + V+ + R Sbjct: 354 CN-------KIIMLFTDG------GEERAQEIFNKYNKDKKVRVFTFSVGQHNYDRGPIQ 400 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 401 WMACENKGYYYEIPSIGAIR 420 >gi|170571356|ref|XP_001891697.1| von Willebrand factor type A domain containing protein [Brugia malayi] gi|158603658|gb|EDP39502.1| von Willebrand factor type A domain containing protein [Brugia malayi] Length = 319 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 42/279 (15%), Positives = 97/279 (34%), Gaps = 25/279 (8%) Query: 59 VHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDI 118 + ++ +G + K+ + +I NTW + D N++ +I +++ Sbjct: 61 IPTLIMVVTDGRSADDPKIPAQQLQ-QIPNTWVFAAATGDPDAVDKNELLEITG--NINH 117 Query: 119 VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVL 178 +++ + + + T + ++ N + D+++V+ Sbjct: 118 IILQSGRKLAADITRKLLRQAQEKCRTTTTTTTTTTTTTTATTTTTNPISGCEQDVVLVM 177 Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN--KIEEFFLL 236 D+S + + I + I + + L+TFS+ K F L Sbjct: 178 DLSTTTNPVYRKYIEMAEELINRLL----------IGRRFSRIALITFSSVGKTRTQFNL 227 Query: 237 EW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + + I+ L G +T G++ Q +QH + KK ++ Sbjct: 228 DRYFNGKDIVTAIRRLESSGGTTAVGEGIRLGIEQ-----KDKQHGGRPNEIAKKAMLVF 282 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 TDG + + + AK G +Y + + +S+ Sbjct: 283 TDGWSNKGPDVE---EMSRNAKGAGFTLYTVVYEIQKSN 318 >gi|161086902|ref|NP_001104316.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform d [Mus musculus] gi|1905823|gb|AAB50141.1| voltage-gated calcium channel alpha2/delta subunit, alpha2d isoform [Mus musculus] gi|148671298|gb|EDL03245.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_e [Mus musculus] Length = 1079 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 77/200 (38%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 247 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 294 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 295 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 353 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ K + V+ + R Sbjct: 354 CN-------KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQ 400 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 401 WMACENKGYYYEIPSIGAIR 420 >gi|310799477|gb|EFQ34370.1| von Willebrand factor type A domain-containing protein [Glomerella graminicola M1.001] Length = 698 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 72/204 (35%), Gaps = 19/204 (9%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK-----LIPDVNNVVQSG 222 D+++V+DVS SM + A ++L+ VK ++ +N+ + G Sbjct: 83 DHVPCDIVLVIDVSGSMGCNAPVPANPGEKAENYGLSVLDLVKHAARTVLETLNDGDRLG 142 Query: 223 LVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 +VTF++K + L + ++ I + +TN GL M Sbjct: 143 IVTFASKAKVLQKLTPMDAKNKALAEKIINGMRP-DDATNLWHGLLEGIKLFNTCGEMNM 201 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 ++ +TDG +Q + ++ A ++ G L++ Sbjct: 202 GRVPA-------MMVLTDGMPNHMCPNQGYVPKLRGMEQLSASIHTFGFGYSLRSGLLKS 254 Query: 339 CAS--PNSFYLVENPHSMYDAFSH 360 A ++ + + + F H Sbjct: 255 IAEIGGGNYSFIPDAGMIGTVFVH 278 >gi|225551939|ref|ZP_03772879.1| von Willebrand factor type A domain protein [Borrelia sp. SV1] gi|225370937|gb|EEH00367.1| von Willebrand factor type A domain protein [Borrelia sp. SV1] Length = 332 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 70/196 (35%), Gaps = 26/196 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++IVLD+S SM + SS +++ + + I + + GLV F+ Sbjct: 92 DIVIVLDISPSMGAVEFSSKNRLEFSKELIRGFISQ-------RENDNIGLVAFAKDASI 144 Query: 233 FFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + +K+ Y+ G + G+ A + + K+ Sbjct: 145 VVPITTDREFFNKKLDDIYIMDLGNGSALGLGISIALSHL-----------KHSEALKRS 193 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP----NSFY 346 IV +TDG S + + N A+ +Y++GI S SF Sbjct: 194 IVVLTDGVVNSDEIYKD--QVINLAQGLNVKIYSVGIGSSEEFSVEFKLRSGKFYQGSFK 251 Query: 347 LVENPHSMYDAFSHIG 362 V +P + + + G Sbjct: 252 EVYDPSMLVEISNKTG 267 >gi|162449478|ref|YP_001611845.1| hypothetical protein sce1208 [Sorangium cellulosum 'So ce 56'] gi|161160060|emb|CAN91365.1| unnamed protein product [Sorangium cellulosum 'So ce 56'] Length = 607 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 69/217 (31%), Gaps = 29/217 (13%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 +P + R + + IT Q + + +V+D S SM S K+ +A+++ Sbjct: 11 LPAEPSERLLRVEITVPRPEGGQARKPVHLSLVIDRSGSM------SGEKLRLALEAARQ 64 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ----RKIKYLSKFGVSTNST 260 + ++ + +VTF +++E + + + G +T+ Sbjct: 65 AIRTLQPGD------RFSVVTFDHQVEVPIPSTDATPGARLRAEAALDTVIARG-NTDLG 117 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRG 319 G ++ ++ +TDG+ N + + R Sbjct: 118 GGWLRG---CAEVGAHLPEDAIGR------VLLLTDGQANHGITSPDELTSRARSQRLRR 168 Query: 320 AIVYAIGIRVIRSHEFLRACASP--NSFYLVENPHSM 354 IG+ + L + +FY + Sbjct: 169 VTTSTIGLGEGFNEFLLGRLSEEGGGNFYFAARADEL 205 >gi|114652499|ref|XP_001171038.1| PREDICTED: coagulation factor C homolog, cochlin isoform 5 [Pan troglodytes] Length = 594 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 26/209 (12%), Positives = 70/209 (33%), Gaps = 31/209 (14%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTD------ARLDMMIVLDVSRSMESFFDSSITK 194 F +P + + V P+ + + Q +++ ++D S S+ + Sbjct: 373 FSYHMPNWFGTTKYVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGD------SN 426 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSK 252 + ++ ++ + + ++ + V F+ F + + I+ + Sbjct: 427 FRLMLEFVSNIAKTFEISDIGA---KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRY 483 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 T + + + +F +R+ N +V +TDG+ Sbjct: 484 MSGGTATGDAISFTVRNVFGP--IRESPNKN------FLVIVTDGQ-----SYDDVQGPA 530 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 A G ++++G+ + L+ AS Sbjct: 531 AAAHDAGITIFSVGVAWAPLDD-LKDMAS 558 >gi|161086904|ref|NP_001104317.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform 3 [Rattus norvegicus] gi|17864880|gb|AAL47093.1|AF400662_1 L-type calcium channel alpha2/delta subunit [Rattus norvegicus] Length = 1079 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 77/200 (38%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 247 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 294 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 295 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 353 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ K + V+ + R Sbjct: 354 CN-------KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQ 400 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 401 WMACENKGYYYEIPSIGAIR 420 >gi|331089974|ref|ZP_08338865.1| hypothetical protein HMPREF1025_02448 [Lachnospiraceae bacterium 3_1_46FAA] gi|330403112|gb|EGG82675.1| hypothetical protein HMPREF1025_02448 [Lachnospiraceae bacterium 3_1_46FAA] Length = 4107 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 34/169 (20%), Positives = 65/169 (38%), Gaps = 38/169 (22%) Query: 188 FDSSITKIDMAIKSINAMLEEV----KLIPDVNNVVQSGLVTFSNKIEEFFLLEW----- 238 ++ITK+ ++N +E+ I D+ + LV F++ + ++ Sbjct: 221 LGTNITKLQALQNAVNNFVEQTAKMNDSIDDIKLQHRVSLVKFASDESDNIGNDFINNNY 280 Query: 239 ---------------GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +S L + L G T + GL A + F + G R+ Sbjct: 281 NRSQIVTELKSYTTKNISDLTSTVNSLIAAGA-TRADFGLNQA-QRAFQLGGTREGA--- 335 Query: 284 DANYKKIIVFMTDGENLSTKE-----DQQSLYYCNEAKKRGAIVYAIGI 327 +K++VF TDG+ S + ++ E K A++Y+IG+ Sbjct: 336 ----QKVVVFFTDGQPTSNSDWSNSVAAAAITNAKELKDANALIYSIGV 380 >gi|317502373|ref|ZP_07960539.1| hypothetical protein HMPREF1026_02483 [Lachnospiraceae bacterium 8_1_57FAA] gi|316896246|gb|EFV18351.1| hypothetical protein HMPREF1026_02483 [Lachnospiraceae bacterium 8_1_57FAA] Length = 4107 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 34/169 (20%), Positives = 65/169 (38%), Gaps = 38/169 (22%) Query: 188 FDSSITKIDMAIKSINAMLEEV----KLIPDVNNVVQSGLVTFSNKIEEFFLLEW----- 238 ++ITK+ ++N +E+ I D+ + LV F++ + ++ Sbjct: 221 LGTNITKLQALQNAVNNFVEQTAKMNDSIDDIKLQHRVSLVKFASDESDNIGNDFINNNY 280 Query: 239 ---------------GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +S L + L G T + GL A + F + G R+ Sbjct: 281 NRSQIVTELKSYTTKNISDLTSTVNSLIAAGA-TRADFGLNQA-QRAFQLGGTREGA--- 335 Query: 284 DANYKKIIVFMTDGENLSTKE-----DQQSLYYCNEAKKRGAIVYAIGI 327 +K++VF TDG+ S + ++ E K A++Y+IG+ Sbjct: 336 ----QKVVVFFTDGQPTSNSDWSNSVAAAAITNAKELKDANALIYSIGV 380 >gi|149923516|ref|ZP_01911918.1| hypothetical protein PPSIR1_08092 [Plesiocystis pacifica SIR-1] gi|149815646|gb|EDM75176.1| hypothetical protein PPSIR1_08092 [Plesiocystis pacifica SIR-1] Length = 716 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 25/122 (20%), Positives = 39/122 (31%), Gaps = 13/122 (10%) Query: 253 FGVSTNSTPGLK--YAYNQIFDMQGMRQHCNT-EDANYKKII-VFMTDGENLSTKEDQQS 308 G T + GL+ Y Q + M A+ + +TDG+ + Q Sbjct: 444 SGSGTYTHLGLQLVKDYQQAYSGSTMNNDMAPYPTADETLYFNILITDGQYNGYSTNAQV 503 Query: 309 LYYCNEAKKRGAIVYAIGIR----VIRSHEFLRACA-----SPNSFYLVENPHSMYDAFS 359 E G Y IG + L+ A S N++Y N + A + Sbjct: 504 QGELEEMYNSGITTYVIGFGDGVDTAAAQAQLQNMAQWGSGSQNNYYDANNQTELEQALT 563 Query: 360 HI 361 I Sbjct: 564 TI 565 >gi|6753234|ref|NP_033914.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform e [Mus musculus] gi|1905825|gb|AAB50142.1| voltage-gated calcium channel alpha2/delta subunit, alpha2e isoform [Mus musculus] gi|148671296|gb|EDL03243.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_c [Mus musculus] Length = 1084 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 77/200 (38%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 247 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 294 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 295 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 353 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ K + V+ + R Sbjct: 354 CN-------KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQ 400 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 401 WMACENKGYYYEIPSIGAIR 420 >gi|257897710|ref|ZP_05677363.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com15] gi|257835622|gb|EEV60696.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com15] Length = 1104 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 36/162 (22%), Positives = 64/162 (39%), Gaps = 22/162 (13%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMES--FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + Q + +D++ VLD S SM +K I+S+N + E + P+++ + Sbjct: 390 IGSEKQEISPIDIVFVLDKSASMNEGTLEGGGQSKNAALIESVNEISENLLSDPNMD--I 447 Query: 220 QSGLVTF---SNKIEEFFLLEWGVSHLQRKIKYLSKFG---------VSTNSTPGLKYAY 267 + G+V F S I + + L I L+ T T GLK Y Sbjct: 448 RIGMVNFYHNSTVINNQEQISSDIFPLTNDINRLTGSENTALNRTPIGGTPLTLGLKNGY 507 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL 309 ++ G E+ N +KI++ + DG + Q+ Sbjct: 508 ETLYADNG------GENRNPEKILIVVGDGTPTFSYAPIQTR 543 >gi|161086906|ref|NP_001104318.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform 2 [Rattus norvegicus] gi|27450704|gb|AAO14652.1|AF486276_1 calcium channel alpha-2 delta-1 subunit isoform e [Rattus norvegicus] Length = 1084 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 77/200 (38%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 247 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 294 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 295 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 353 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ K + V+ + R Sbjct: 354 CN-------KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQ 400 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 401 WMACENKGYYYEIPSIGAIR 420 >gi|121583393|ref|YP_973824.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] gi|120596647|gb|ABM40082.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] Length = 350 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 39/204 (19%), Positives = 65/204 (31%), Gaps = 18/204 (8%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + VLD S SM +++ K++ ++ A+++ ++ P V ++ F+ Sbjct: 3 RLPVFFVLDCSESM---VGANLKKME---GAVAAIVKSLRTDPQALETVFFSVIAFAGVA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L V + L G TN L +I D I Sbjct: 57 RTIAPL---VEIVSFYPPKLP-LGGGTNLGSALDALMGEIDRSVIKTTAERKGDWRP--I 110 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 I +TDG + + AKK I AIG+ LR + E+ Sbjct: 111 IYLVTDGRPTDNPSRAIERWNSHYAKKATLI--AIGLGRSVDFTALRRLT--ENVISFED 166 Query: 351 PHSMYDAFSHIGKDIVTKRIWYDK 374 F + + K Sbjct: 167 IKE--SDFKKFINWVTASVVVQSK 188 >gi|262197272|ref|YP_003268481.1| hypothetical protein Hoch_4090 [Haliangium ochraceum DSM 14365] gi|262080619|gb|ACY16588.1| Myxococcales GC_trans_RRR domain protein [Haliangium ochraceum DSM 14365] Length = 602 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 38/248 (15%), Positives = 88/248 (35%), Gaps = 61/248 (24%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + + + + LD+++V+D S SM ++ ++D + ++ +++ V Sbjct: 113 LAVAMNTPIGPDDIEPRPLDLVVVVDTSGSM-----ATDARMDYVRQGLHLLVDAVDEDD 167 Query: 214 DVNNVVQSGLVTFSNKIEEFFLL------------------------------------- 236 + LV++ + E L Sbjct: 168 ------RLALVSYQSFAEVHAELPALPVEETPEEPTEPTDPVGEPTDPPADPDEDPVDER 221 Query: 237 -EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 W S + + L G TN GL+ + + + + + ++ ++ Sbjct: 222 EAW-RSEMHALVDTLQP-GGGTNIYEGLERGF-------EIAKEARVNHPDRAQRVILLS 272 Query: 296 DGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIRVIRSHEFLRACAS--PNSFYLVENPH 352 DG D S+ +EA + G + +G+ + E +R A +FY VE+P Sbjct: 273 DGLATEGITDSASIIALSEAFIEGGMGLTTVGVGASFNVELMRGLAERGAGNFYFVEDPE 332 Query: 353 SMYDAFSH 360 ++ + F+ Sbjct: 333 AVREVFTE 340 >gi|113475004|ref|YP_721065.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] gi|110166052|gb|ABG50592.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] Length = 412 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 37/221 (16%), Positives = 78/221 (35%), Gaps = 29/221 (13%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + R + + I S + + L++ ++LD S SME ++ ++ ++E+ Sbjct: 20 LSQRQVALSI-SVMANQLERTVPLNLCLILDHSGSMEGRP------LETVKQAAVQLVEK 72 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 +K + +V F ++ + + + ++ KI L G T GLK Sbjct: 73 LKEGD------RLSVVAFDHQAQVIVPNQMINDSASIKGKINKLRASG-GTAIDKGLKLG 125 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 ++ N +TDGEN ++ L A + ++G Sbjct: 126 IEEL----------NKGRKESISQAFILTDGENEH-GDNDLCLKLAKLATDYNITLNSLG 174 Query: 327 IRVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSHIGKDI 365 + + L A + ++ P + FS + I Sbjct: 175 FGDDWNQDVLEKIADAGGGNLSYIQQPEQAIEEFSKLFNRI 215 >gi|229816811|ref|ZP_04447093.1| hypothetical protein BIFANG_02059 [Bifidobacterium angulatum DSM 20098] gi|229785827|gb|EEP21941.1| hypothetical protein BIFANG_02059 [Bifidobacterium angulatum DSM 20098] Length = 1185 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 65/210 (30%), Gaps = 18/210 (8%) Query: 96 NELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTN---SR 152 + NG + + + ++ ++ P + + Sbjct: 411 YTVSANGSPMTQQGSGNNAYAETGSFTVGKTSHVTIVNSNVKPSNNKSIVKTDGGDGDQY 470 Query: 153 HIVMPITS-SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 + + + S T D+++V+D S SM + A +L Sbjct: 471 TLNLTASGDSTSSTVTTATPADIVLVMDKSGSMNEN-NRDANAQKAAKDLAKKLLTGTNS 529 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 VQ +VTFS + VS + ++ TN LK A Sbjct: 530 KLPPEQQVQMAVVTFSTEASLKQKFTTNVSEINNAVRGNP--DGGTNWEAALKQA----N 583 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 DMQG R+ KK I+F++DG Sbjct: 584 DMQGGRRG-------VKKHIIFLSDGNPTY 606 >gi|109732283|gb|AAI15771.1| Integrin, alpha 10 [Mus musculus] Length = 1166 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 42/244 (17%), Positives = 81/244 (33%), Gaps = 46/244 (18%) Query: 150 NSRHIVMPITSSVKVNSQTDARL-----------DMMIVLDVSRSMESFFDSSITKIDMA 198 S I + V + + L D++IVLD S S+ + +++ Sbjct: 133 GSSVFSSGICARVDASFRPQGSLAPTAQRCPTYMDVVIVLDGSNSIYPW-----SEVQTF 187 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGV 255 ++ + L I +Q GLV + + L + ++ + G Sbjct: 188 LRRLVGRL----FIDPEQ--IQVGLVQYGENPVHEWSLGDFRTKEEVVRAARNLSRREGR 241 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 T + + A + F +++V +TDGE+ +E +L C Sbjct: 242 ETRTAQAIMVACTEGFSQS------RGGRPEAARLLVVVTDGESHDGEELPAALKACEAG 295 Query: 316 KKRGAIVYAIGI------RVIRSHEFLR----ACASPNS--FYLVENPHSMYDAFSHIGK 363 + Y I + R FLR + P+ F+ V + ++ D +G Sbjct: 296 R---VTRYGIAVLGHYLRRQRDPSSFLREIRAIASDPDERFFFNVTDEAALTDIVDALGD 352 Query: 364 DIVT 367 I Sbjct: 353 RIFG 356 >gi|119946440|ref|YP_944120.1| von Willebrand factor, type A [Psychromonas ingrahamii 37] gi|119865044|gb|ABM04521.1| von Willebrand factor, type A [Psychromonas ingrahamii 37] Length = 327 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 61/165 (36%), Gaps = 16/165 (9%) Query: 168 TDARLDMMIVLDVSRSMES-FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 DM+I LD+S SM+ + +D + + +K + GL+ F Sbjct: 82 QQQSRDMIISLDLSGSMQEVDMPLNGQTVDRLTLLKDLLKTFIKQRQGD----RLGLILF 137 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ L + + +Q+ + T + G A E+ N Sbjct: 138 ADHAYLQTPLTFDLKTIQQMVDESEIGLAGTRTAIGESIAM---------AIKRFVENKN 188 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 +++++ ++DG N S + + +A K +Y IG+ + Sbjct: 189 EQRVLILVSDGANNSGS--IEPIQAAKQAAKNNITIYTIGMGAEQ 231 >gi|124486692|ref|NP_001074522.1| integrin alpha-10 [Mus musculus] Length = 1167 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 42/244 (17%), Positives = 81/244 (33%), Gaps = 46/244 (18%) Query: 150 NSRHIVMPITSSVKVNSQTDARL-----------DMMIVLDVSRSMESFFDSSITKIDMA 198 S I + V + + L D++IVLD S S+ + +++ Sbjct: 133 GSSVFSSGICARVDASFRPQGSLAPTAQRCPTYMDVVIVLDGSNSIYPW-----SEVQTF 187 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGV 255 ++ + L I +Q GLV + + L + ++ + G Sbjct: 188 LRRLVGRL----FIDPEQ--IQVGLVQYGENPVHEWSLGDFRTKEEVVRAARNLSRREGR 241 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 T + + A + F +++V +TDGE+ +E +L C Sbjct: 242 ETRTAQAIMVACTEGFSQS------RGGRPEAARLLVVVTDGESHDGEELPAALKACEAG 295 Query: 316 KKRGAIVYAIGI------RVIRSHEFLR----ACASPNS--FYLVENPHSMYDAFSHIGK 363 + Y I + R FLR + P+ F+ V + ++ D +G Sbjct: 296 R---VTRYGIAVLGHYLRRQRDPSSFLREIRAIASDPDERFFFNVTDEAALTDIVDALGD 352 Query: 364 DIVT 367 I Sbjct: 353 RIFG 356 >gi|260800521|ref|XP_002595177.1| hypothetical protein BRAFLDRAFT_240912 [Branchiostoma floridae] gi|260800523|ref|XP_002595178.1| hypothetical protein BRAFLDRAFT_240996 [Branchiostoma floridae] gi|229280421|gb|EEN51189.1| hypothetical protein BRAFLDRAFT_240912 [Branchiostoma floridae] gi|229280422|gb|EEN51190.1| hypothetical protein BRAFLDRAFT_240996 [Branchiostoma floridae] Length = 153 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/170 (18%), Positives = 64/170 (37%), Gaps = 22/170 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D + S+ + ++ I+ + A L N V+ +V ++ + Sbjct: 1 DIVFIVDGTGSVGLE---NFERMKTFIRQLFAYL------DIGENAVRVSIVQYAAQART 51 Query: 233 FFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L+ + + Q + + G T + + +A N FD+ A+ KI Sbjct: 52 EFFLDQYYDLQEAQDAVDGIEYMGGYTLTGKAIDFATNLHFDL------RKGARADVTKI 105 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 V +TDG ++ G + A+G+ + L A A Sbjct: 106 AVVITDGR-----SYDDVNRPARRMRQAGIVTIAVGVGNNLDRDQLTAIA 150 >gi|159900699|ref|YP_001546946.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893738|gb|ABX06818.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 828 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 34/194 (17%), Positives = 65/194 (33%), Gaps = 30/194 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + ++++LD S SM +S K +A + A + + G++ F Sbjct: 381 ERPTVTLLLILDRSASM--LGESGKDKFSLAKAAAIAATDSLGADDT------IGVLAFD 432 Query: 228 NKIE---EFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + + F + GV S +Q I LS G T+ L+ + G +H Sbjct: 433 DTNDWTVTFTKVGQGVQLSEIQNNIAGLSA-GGGTDIYAALEVGMGGLAQQTGKVRHA-- 489 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS- 341 V +TDG + + + + +G + I I L + A Sbjct: 490 ---------VLLTDGRSGGESSYESLIAP---LRAQGITLSTIAIGGDADTVLLESLAKL 537 Query: 342 -PNSFYLVENPHSM 354 ++ P + Sbjct: 538 GAGRYHFASRPDDL 551 >gi|56477526|ref|YP_159115.1| hypothetical protein ebA3711 [Aromatoleum aromaticum EbN1] gi|56313569|emb|CAI08214.1| hypothetical protein ebA3711 [Aromatoleum aromaticum EbN1] Length = 441 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 65/192 (33%), Gaps = 24/192 (12%) Query: 8 IRNFFYNY---KGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQ 64 +RN + +G + I+TA+ L ++ G+ ++ H++ KT L + D + A+ + Sbjct: 1 MRNRHADRDSQRGVVAIITALSLVVLVGFAGLALDGGHLYLTKTELQNGADACALAASYE 60 Query: 65 IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQ- 123 + T R + + D+ STSL +P Sbjct: 61 LTGSPISPENFTRAENA-----GKTVGTENRVDFQGGAIAAADIDVTFSTSLAGSWLPAG 115 Query: 124 ----NEGYSISAISRYKIPLKFCTFIPWYTN----SRHIVMPITSSVKVNSQTDARLDMM 175 N Y I+R PW+ V I ++ SQ + + M Sbjct: 116 GATGNSKYVRCTITRN-------GIAPWFMQVMGFGDQTVSAIATATLAPSQNNCAIPMG 168 Query: 176 IVLDVSRSMESF 187 + S S F Sbjct: 169 LCTHPSSSAPHF 180 >gi|327270782|ref|XP_003220167.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 904 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 38/211 (18%), Positives = 75/211 (35%), Gaps = 37/211 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD+S SM F +I ++ L ++ G+V F+++ Sbjct: 303 LCLVLDISGSMNGF-----DRIYRLRQAGEQFLLQILETGSWA-----GIVVFNSQALTK 352 Query: 234 FLLEWGVSH-LQRKIKYL--SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L+ +++ + + G TN G++ + + C Sbjct: 353 TYLKQITGDSVRQTLSAYLPTAAGGGTNICSGIREGFQVFLKKYPSTEGCE--------- 403 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRAC-ASPNSFYLV 348 IV +TDGE+ C E ++ G+I++ I + + E + + Sbjct: 404 IVLLTDGEDAGVSS-------CFAEVQRSGSIIHTIALGPSAAKELEMLADMTGGLKFSA 456 Query: 349 ENPHS---MYDAFSHIGK---DIVTKRIWYD 373 + + DAFS I DI + I + Sbjct: 457 TDSLDSNGLIDAFSGISSGSGDISQQSIQLE 487 >gi|257064432|ref|YP_003144104.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Slackia heliotrinireducens DSM 20476] gi|256792085|gb|ACV22755.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Slackia heliotrinireducens DSM 20476] Length = 629 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 56/355 (15%), Positives = 122/355 (34%), Gaps = 67/355 (18%) Query: 25 IFLPI--IFLVLGMIIEVS-HIFFMKTVLHSMIDRSL----VHAATQIMNEGNGNNRKKL 77 I +P+ L++ + + V ++ + ++ + + G + Sbjct: 86 IAVPVAACLLLVAIGVGVGTNLLGSNAEMPVAETKAASEDTMAGSANSYAPDGGLAYETD 145 Query: 78 KGGDILCRIKN-----TWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQ-------NE 125 + + + +N + +NGFV+ + + + S D+ N+ Sbjct: 146 EAYETFDTLDEGAPMEDFNTEEYAAIEENGFVSTVTRPLSTCSADVDTASYCNLRRMIND 205 Query: 126 GYSISAIS-------------RYKIPLKFCTFI----------PWYTNSRHIVMPITSSV 162 GYS+ I Y + PW ++ +VM T+S Sbjct: 206 GYSLDEIPDGAVRIEEMLNYFHYDSGEPEGNDLFAVRAESARCPWNDQTQLLVMTFTASD 265 Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K + + +++ ++D+S SM+ K+D+ S +LE + + Sbjct: 266 KAQTASKGS-NLVFLIDISGSMDE-----PDKLDLLKDSFGTLLENLGPND------RVS 313 Query: 223 LVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +VT++ + G + R + L G STN GL+ AY + + + Sbjct: 314 IVTYAAGEDVLLEGASGDDTRKIMRALNRLEADG-STNGEAGLEMAYE-VAERNYIEGGV 371 Query: 281 NTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 N IV +DG+ N+ + + E ++ G + +G + Sbjct: 372 NR--------IVMASDGDLNVGITSESDLYDFVEEKRETGVYLSVLGFGSGNYKD 418 >gi|254482897|ref|ZP_05096133.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] gi|214036769|gb|EEB77440.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] Length = 330 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 68/209 (32%), Gaps = 28/209 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 Q + D+MI +D+S SME+ + + + + ++A+ + +K + + GL+ F Sbjct: 88 QQKSGRDLMIAVDLSGSMEARDFTLPS--GVTVDRLDAVKQVLKELAANRESDRLGLIVF 145 Query: 227 SNKIEEFFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 Q+ + + G ST + + D Sbjct: 146 GAAAYLQTPFTDDHQVWQQLLDETEIGMAGPSTVFGDAIGLSIKLFSDSDSDN------- 198 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--SHEFL------ 336 ++++ +TDG T + A G +Y I I + L Sbjct: 199 ----RVLIMLTDG--NDTGSTVPPVDAAKVAAANGVRIYTIAIGDPATVGEDALDMDTIT 252 Query: 337 --RACASPNSFYLVENPHSMYDAFSHIGK 363 A + + M A+ IG+ Sbjct: 253 RVSKIA-DGRTFRALDQDEMRQAYITIGE 280 >gi|153815168|ref|ZP_01967836.1| hypothetical protein RUMTOR_01400 [Ruminococcus torques ATCC 27756] gi|145847427|gb|EDK24345.1| hypothetical protein RUMTOR_01400 [Ruminococcus torques ATCC 27756] Length = 4109 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 34/169 (20%), Positives = 65/169 (38%), Gaps = 38/169 (22%) Query: 188 FDSSITKIDMAIKSINAMLEEV----KLIPDVNNVVQSGLVTFSNKIEEFFLLEW----- 238 ++ITK+ ++N +E+ I D+ + LV F++ + ++ Sbjct: 223 LGTNITKLQALQNAVNNFVEQTAKMNDSIDDIKLQHRVSLVKFASDESDNIGNDFINNNY 282 Query: 239 ---------------GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +S L + L G T + GL A + F + G R+ Sbjct: 283 NRSQIVTELKSYTTKNISDLTSTVNSLIAAGA-TRADFGLNQA-QRAFQLGGTREGA--- 337 Query: 284 DANYKKIIVFMTDGENLSTKE-----DQQSLYYCNEAKKRGAIVYAIGI 327 +K++VF TDG+ S + ++ E K A++Y+IG+ Sbjct: 338 ----QKVVVFFTDGQPTSNSDWSNSVAAAAITNAKELKDANALIYSIGV 382 >gi|311268149|ref|XP_003131915.1| PREDICTED: integrin alpha-E-like, partial [Sus scrofa] Length = 1032 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 80/202 (39%), Gaps = 26/202 (12%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + I+LD S S++ A I+ M+ I + LV + + I+ Sbjct: 206 IAIILDGSGSIDP------PDFQRAKDFISNMMRN---IYEKCFECSFALVQYGSVIQTE 256 Query: 234 FLLEWGVSHLQR--KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L+ +++ +++ G T + +++ + IF H + A K++ Sbjct: 257 LDLQDSQDVAASLDRVQNITQVGSVTKTASAMQHVLDNIFTP----SHGSK--AKASKVM 310 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI----RSHEFLRACASP---NS 344 V +TDG+ + ++ N K +G +AIG+ ++ L+ AS + Sbjct: 311 VVLTDGDIFDDPLNLTTVI--NSPKMQGVERFAIGVGGAFNKSNTYNELKLIASDPDEDH 368 Query: 345 FYLVENPHSMYDAFSHIGKDIV 366 + V N ++ S + + I+ Sbjct: 369 AFKVTNYMALDGLLSKLQQRII 390 >gi|261415412|ref|YP_003249095.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261371868|gb|ACX74613.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|302326806|gb|ADL26007.1| BatB protein [Fibrobacter succinogenes subsp. succinogenes S85] Length = 342 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 33/162 (20%), Positives = 60/162 (37%), Gaps = 23/162 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++++ D+S SM + D ++ + I+A LE + + GLV FS + + Sbjct: 90 DLVLLQDISLSMLAE-DVKPNRLVRSRHEISAFLESLTGD-------RVGLVAFSGEAQV 141 Query: 233 FFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L +Q ++ L+ TN ++ + G QH + Sbjct: 142 MVPLTLDYGTVQMVLRELNPGWLMPGTNLESAIRKGMTLFKNSGGASQHS---------V 192 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 ++ M+DGE L + G +Y IGI Sbjct: 193 MILMSDGEELEAAAVNAAKEAAEF----GIKIYTIGIGSREG 230 >gi|220897447|emb|CAX15331.1| complement component 2 (within H-2S) [Mus musculus] Length = 1276 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/221 (14%), Positives = 67/221 (30%), Gaps = 28/221 (12%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-- 230 IVLD S SM + S + + ++ + + GL+T++ Sbjct: 773 IVLDPSGSMNIYLVLDGSDSIGSSNFTGAKRCLTNLIEKVASYGVRPRYGLLTYATVPKV 832 Query: 231 ------EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E +W L + K TN+ L+ Y+ + G Sbjct: 833 LVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYS-MMSWAGDAPPEGWNR 891 Query: 285 ANYKKIIVFMTDG-ENLSTKEDQQSLYY---------CNEAKKRGAIVYAIGIRV----I 330 + +I+ MTDG N+ ++ VY G+ + Sbjct: 892 TRH--VIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLVDSV 949 Query: 331 RSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + + + + V++ + + F + + + + Sbjct: 950 NINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSLC 990 >gi|220897445|emb|CAX15329.1| complement component 2 (within H-2S) [Mus musculus] Length = 970 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/221 (14%), Positives = 67/221 (30%), Gaps = 28/221 (12%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-- 230 IVLD S SM + S + + ++ + + GL+T++ Sbjct: 467 IVLDPSGSMNIYLVLDGSDSIGSSNFTGAKRCLTNLIEKVASYGVRPRYGLLTYATVPKV 526 Query: 231 ------EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E +W L + K TN+ L+ Y+ + G Sbjct: 527 LVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYS-MMSWAGDAPPEGWNR 585 Query: 285 ANYKKIIVFMTDG-ENLSTKEDQQSLYY---------CNEAKKRGAIVYAIGIRV----I 330 + +I+ MTDG N+ ++ VY G+ + Sbjct: 586 TRH--VIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLVDSV 643 Query: 331 RSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + + + + V++ + + F + + + + Sbjct: 644 NINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSLC 684 >gi|218156291|ref|NP_001136178.1| complement factor B isoform 2 [Mus musculus] gi|220897443|emb|CAX15327.1| complement factor B [Mus musculus] Length = 713 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/221 (14%), Positives = 67/221 (30%), Gaps = 28/221 (12%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-- 230 IVLD S SM + S + + ++ + + GL+T++ Sbjct: 260 IVLDPSGSMNIYLVLDGSDSIGSSNFTGAKRCLTNLIEKVASYGVRPRYGLLTYATVPKV 319 Query: 231 ------EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E +W L + K TN+ L+ Y+ + G Sbjct: 320 LVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYS-MMSWAGDAPPEGWNR 378 Query: 285 ANYKKIIVFMTDG-ENLSTKEDQQSLYY---------CNEAKKRGAIVYAIGIRV----I 330 + +I+ MTDG N+ ++ VY G+ + Sbjct: 379 TRH--VIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLVDSV 436 Query: 331 RSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + + + + V++ + + F + + + + Sbjct: 437 NINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSLC 477 >gi|218156289|ref|NP_032224.2| complement factor B isoform 1 [Mus musculus] gi|220897442|emb|CAX15326.1| complement factor B [Mus musculus] Length = 763 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/221 (14%), Positives = 67/221 (30%), Gaps = 28/221 (12%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-- 230 IVLD S SM + S + + ++ + + GL+T++ Sbjct: 260 IVLDPSGSMNIYLVLDGSDSIGSSNFTGAKRCLTNLIEKVASYGVRPRYGLLTYATVPKV 319 Query: 231 ------EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E +W L + K TN+ L+ Y+ + G Sbjct: 320 LVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYS-MMSWAGDAPPEGWNR 378 Query: 285 ANYKKIIVFMTDG-ENLSTKEDQQSLYY---------CNEAKKRGAIVYAIGIRV----I 330 + +I+ MTDG N+ ++ VY G+ + Sbjct: 379 TRH--VIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLVDSV 436 Query: 331 RSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + + + + V++ + + F + + + + Sbjct: 437 NINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSLC 477 >gi|148694792|gb|EDL26739.1| complement factor B, isoform CRA_b [Mus musculus] Length = 760 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/221 (14%), Positives = 67/221 (30%), Gaps = 28/221 (12%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-- 230 IVLD S SM + S + + ++ + + GL+T++ Sbjct: 257 IVLDPSGSMNIYLVLDGSDSIGSSNFTGAKRCLTNLIEKVASYGVRPRYGLLTYATVPKV 316 Query: 231 ------EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E +W L + K TN+ L+ Y+ + G Sbjct: 317 LVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYS-MMSWAGDAPPEGWNR 375 Query: 285 ANYKKIIVFMTDG-ENLSTKEDQQSLYY---------CNEAKKRGAIVYAIGIRV----I 330 + +I+ MTDG N+ ++ VY G+ + Sbjct: 376 TRH--VIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLVDSV 433 Query: 331 RSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + + + + V++ + + F + + + + Sbjct: 434 NINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSLC 474 >gi|148694794|gb|EDL26741.1| complement factor B, isoform CRA_d [Mus musculus] Length = 731 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/221 (14%), Positives = 67/221 (30%), Gaps = 28/221 (12%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-- 230 IVLD S SM + S + + ++ + + GL+T++ Sbjct: 228 IVLDPSGSMNIYLVLDGSDSIGSSNFTGAKRCLTNLIEKVASYGVRPRYGLLTYATVPKV 287 Query: 231 ------EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E +W L + K TN+ L+ Y+ + G Sbjct: 288 LVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYS-MMSWAGDAPPEGWNR 346 Query: 285 ANYKKIIVFMTDG-ENLSTKEDQQSLYY---------CNEAKKRGAIVYAIGIRV----I 330 + +I+ MTDG N+ ++ VY G+ + Sbjct: 347 TRH--VIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLVDSV 404 Query: 331 RSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + + + + V++ + + F + + + + Sbjct: 405 NINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSLC 445 >gi|12805443|gb|AAH02194.1| Col6a1 protein [Mus musculus] Length = 406 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/165 (13%), Positives = 54/165 (32%), Gaps = 19/165 (11%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + D+ I+LD S S+ S + + K + + V+ +V +S + Sbjct: 204 SPADITILLDSSASVGSH---NFETTKVFAKRLAERFLSAGRADP-SQDVRVAVVQYSGQ 259 Query: 230 IEEFFLLEW-----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ + L + + +T+ L Y R + Sbjct: 260 GQQQPGRAALQFLQNYTVLASSVDSMDFINDATDVNDALSYV---------TRFYREASS 310 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 KK ++ +DG + + EA++ G ++ + + Sbjct: 311 GATKKRVLLFSDGNSQGATAEAIEKAV-QEAQRAGIEIFVVVVGP 354 >gi|161086900|ref|NP_001104315.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform c [Mus musculus] gi|1905821|gb|AAB50140.1| voltage-gated calcium channel alpha2/delta subunit, alpha2c isoform [Mus musculus] gi|109732367|gb|AAI15872.1| Cacna2d1 protein [Mus musculus] gi|148671297|gb|EDL03244.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_d [Mus musculus] Length = 1086 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 77/200 (38%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 247 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 294 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 295 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 353 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ K + V+ + R Sbjct: 354 CN-------KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQ 400 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 401 WMACENKGYYYEIPSIGAIR 420 >gi|12655866|gb|AAK00631.1| complement factor B [Halocynthia roretzi] Length = 1084 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 73/207 (35%), Gaps = 24/207 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ + DVS+S+ + + I+ A + + + + V++ ++ +++ + Sbjct: 531 VIFIFDVSKSVTKKYQDFSSGIEFAKR-------LIDRLKNFGGVLKYSIIAYASSNKTQ 583 Query: 234 FLLEW-GVSHLQRKIKYLSKFGV-----------STNSTPGLKYAYNQIFDMQGMRQHCN 281 + ++++ IK L T S A + DM +H Sbjct: 584 LEITDRFSTNVKEVIKRLDNLDSQVKEAVSELIEETRSGTATAKALKSLRDMMLFMEHDI 643 Query: 282 TEDANYKKIIVFM-TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR---VIRSHEFLR 337 D K VF+ TDG + K + + Y+I + + E L Sbjct: 644 RNDQTNDKCHVFLFTDGMHNEGKNPVEVRKEMQKIFGSNIEFYSISAQEDPSPEAFEELI 703 Query: 338 ACAS-PNSFYLVENPHSMYDAFSHIGK 363 AS P ++ +E+ H + + Sbjct: 704 GLASEPENYIYIEDIHLLSSYLDKVTD 730 >gi|116126|sp|P04186|CFAB_MOUSE RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|192412|gb|AAA37379.1| factor B [Mus musculus] gi|192414|gb|AAA63293.1| factor B [Mus musculus] gi|2944424|gb|AAC05283.1| complement factor B [Mus musculus] gi|3986764|gb|AAC84160.1| Bf [Mus musculus] gi|13529437|gb|AAH05451.1| Complement factor B [Mus musculus] gi|74146351|dbj|BAE28943.1| unnamed protein product [Mus musculus] gi|148694791|gb|EDL26738.1| complement factor B, isoform CRA_a [Mus musculus] Length = 761 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/221 (14%), Positives = 67/221 (30%), Gaps = 28/221 (12%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-- 230 IVLD S SM + S + + ++ + + GL+T++ Sbjct: 258 IVLDPSGSMNIYLVLDGSDSIGSSNFTGAKRCLTNLIEKVASYGVRPRYGLLTYATVPKV 317 Query: 231 ------EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E +W L + K TN+ L+ Y+ + G Sbjct: 318 LVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYS-MMSWAGDAPPEGWNR 376 Query: 285 ANYKKIIVFMTDG-ENLSTKEDQQSLYY---------CNEAKKRGAIVYAIGIRV----I 330 + +I+ MTDG N+ ++ VY G+ + Sbjct: 377 TRH--VIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLVDSV 434 Query: 331 RSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + + + + V++ + + F + + + + Sbjct: 435 NINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSLC 475 >gi|281337632|gb|EFB13216.1| hypothetical protein PANDA_007041 [Ailuropoda melanoleuca] Length = 524 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 26/209 (12%), Positives = 69/209 (33%), Gaps = 31/209 (14%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTD------ARLDMMIVLDVSRSMESFFDSSITK 194 F +P + + V P+ + + Q +++ ++D S S+ + Sbjct: 303 FSYHMPNWFGTTKYVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGD------SN 356 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSK 252 + ++ ++ + + ++ + V F+ F + + I+ + Sbjct: 357 FRLMLEFVSNIAKTFEISDIGA---KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRY 413 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 T + + + +F +R N +V +TDG+ Sbjct: 414 MSGGTATGDAISFTVRNVFGP--VRDSPNKN------FLVIVTDGQ-----SYDDVRGPA 460 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 A G ++++G+ + L+ AS Sbjct: 461 AAAHDAGITIFSVGVAWAPLDD-LKDMAS 488 >gi|256829450|ref|YP_003158178.1| von Willebrand factor type A [Desulfomicrobium baculatum DSM 4028] gi|256578626|gb|ACU89762.1| von Willebrand factor type A [Desulfomicrobium baculatum DSM 4028] Length = 702 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 40/218 (18%), Positives = 86/218 (39%), Gaps = 37/218 (16%) Query: 139 LKFCTFIPWYTNSRHIVMP-------ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSS 191 L+ +P + + P + + D + D++ V+D SRSM+ F Sbjct: 250 LQLAAAVPNSRGADTLASPDYLEQAQVGRAAGGAVMRDVKTDVVFVIDTSRSMQPF---- 305 Query: 192 ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL-------Q 244 IDM +++ M +++ + + + GLV F + +E LE+ +L + Sbjct: 306 ---IDMTREAVAGMTKKISA--ETADRYRFGLVVFRDSLEAAPQLEYVTRNLTPELVPGE 360 Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG------- 297 + + L K G +T + YA + + + ++F+ D Sbjct: 361 QLVDLLEKEGGATAVGS-VDYAEEAFAGVDEALRS--KWREGALRFVIFIGDASSHPKGH 417 Query: 298 -ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 +N S K++ ++A+ ++AI ++ R+ E Sbjct: 418 PQNTSGKDETDLRREYDDAQ---VHLFAIHLQDPRAAE 452 >gi|148680762|gb|EDL12709.1| integrin, alpha E, epithelial-associated [Mus musculus] Length = 1167 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 42/236 (17%), Positives = 85/236 (36%), Gaps = 26/236 (11%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 +C T +V+ + + ++ IVLD S S+E + A Sbjct: 163 YCRSKGGSTGEEKKSARRRRTVEEEDEEEDGTEIAIVLDGSGSIEP------SDFQKAKN 216 Query: 201 SINAMLEEV--KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTN 258 I+ M+ K +VQ G V + E ++ K++ + + T Sbjct: 217 FISTMMRNFYEKCFECNFALVQYGAV---IQTEFDLQESRDINASLAKVQSIVQVKEVTK 273 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + +++ + IF K++V +TDG+ + ++ N K + Sbjct: 274 TASAMQHVLDNIFIPS------RGSRKKALKVMVVLTDGDIFGDPLNLTTVI--NSPKMQ 325 Query: 319 GAIVYAIGIR----VIRSHEFLRACASP---NSFYLVENPHSMYDAFSHIGKDIVT 367 G + +AIG+ ++ L+ AS + V N ++ S + + IV Sbjct: 326 GVVRFAIGVGDAFKNNNTYRELKLIASDPKEAHTFKVTNYSALDGLLSKLQQRIVH 381 >gi|301764709|ref|XP_002917776.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-1-like [Ailuropoda melanoleuca] Length = 1091 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 77/200 (38%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 247 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 294 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 295 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 353 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ K + V+ + R Sbjct: 354 CN-------KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQ 400 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 401 WMACENKGYYYEIPSIGAIR 420 >gi|149051203|gb|EDM03376.1| coagulation factor C homolog (Limulus polyphemus) (predicted), isoform CRA_a [Rattus norvegicus] gi|169642483|gb|AAI60874.1| Coch protein [Rattus norvegicus] Length = 552 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 25/207 (12%), Positives = 69/207 (33%), Gaps = 27/207 (13%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTD------ARLDMMIVLDVSRSMESFFDSSITK 194 F +P + + V P+ + + Q +++ ++D S S+ + Sbjct: 331 FSYHMPNWFGTTKYVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGD------SN 384 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG 254 + ++ ++ + + ++ + T+ + E F ++ + + Sbjct: 385 FRLMLEFVSNIAKTFEISDIGAKIAAVQF-TYDQRTEFSFTDYNTKENVLAVLANIRYMS 443 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 T + + + +F D+ K +V +TDG+ Sbjct: 444 GGTATGDAISFTVRNVFGP--------IRDSPNKNFLVIVTDGQ-----SYDDVRGPAAA 490 Query: 315 AKKRGAIVYAIGIRVIRSHEFLRACAS 341 A G ++++G+ + L+ AS Sbjct: 491 AHDAGITIFSVGVAWAPLDD-LKDMAS 516 >gi|31542335|ref|NP_037051.2| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform 1 [Rattus norvegicus] gi|11055592|gb|AAG28164.1|AF286488_1 voltage-gated calcium channel alpha2/delta-1 subunit [Rattus norvegicus] Length = 1091 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 77/200 (38%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 247 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 294 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 295 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 353 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ K + V+ + R Sbjct: 354 CN-------KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQ 400 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 401 WMACENKGYYYEIPSIGAIR 420 >gi|6090615|gb|AAF03259.1| dihydropyridine receptor alpha 2 subunit [Homo sapiens] Length = 1110 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 79/200 (39%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 247 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 294 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 295 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 353 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ N+ K + V+ + R Sbjct: 354 CN-------KIIMLFTDG------GEERAQEIFNKYNKDKKVRVFTFSVGQHNYDRGPIQ 400 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 401 WMACENKGYYYEIPSIGAIR 420 >gi|161086898|ref|NP_001104314.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform b [Mus musculus] gi|1905819|gb|AAB50139.1| voltage-gated calcium channel alpha2/delta subunit, alpha2b isoform [Mus musculus] Length = 1091 Score = 54.8 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 77/200 (38%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 247 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 294 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 295 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 353 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ K + V+ + R Sbjct: 354 CN-------KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQ 400 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 401 WMACENKGYYYEIPSIGAIR 420 >gi|257387423|ref|YP_003177196.1| von Willebrand factor A [Halomicrobium mukohataei DSM 12286] gi|257169730|gb|ACV47489.1| von Willebrand factor type A [Halomicrobium mukohataei DSM 12286] Length = 788 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 77/203 (37%), Gaps = 37/203 (18%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 V+V + T ++++++DVS S ES K++ + ++ + + + Sbjct: 365 VQVGNATGGESNIVVLVDVSSSAESGLS--------IQKAVA-----LDVLDQLGDENRV 411 Query: 222 GLVTFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 G+V F++ L + KI+ L G +T+ GL+ A + D +G Sbjct: 412 GVVAFNHNAYRVSELRTLGQNRAETAEKIRQLES-GGATDIAVGLQGADELLGDREGT-- 468 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 I+ ++DG++ N+ + G V ++G+ +R Sbjct: 469 ------------IILLSDGQDRLGPP----AAVANQLGREGTRVVSVGVGKRVGVPTMRQ 512 Query: 339 CA--SPNSFYLVENPHSMYDAFS 359 A S S++ + + F Sbjct: 513 IAGESGGSYFAADETERLRLLFG 535 >gi|218186188|gb|EEC68615.1| hypothetical protein OsI_36984 [Oryza sativa Indica Group] Length = 585 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 26/145 (17%), Positives = 51/145 (35%), Gaps = 20/145 (13%) Query: 168 TDARLDMMIVLDVSRSMESFFD-SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D++ VLDVS SM T++D+ ++ + ++ + +V F Sbjct: 42 ERFPIDLVAVLDVSGSMTKATSMHGWTRLDLVKGAMKMVTNKLGAGD------RLAIVPF 95 Query: 227 SNKIEEFFL------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + K+ G + K+ L G T P LK+A + Sbjct: 96 NGKVVAAGATRLMEMTTKGRADANAKVNQLKA-GGDTKFLPALKHA------SGLLDSRP 148 Query: 281 NTEDANYKKIIVFMTDGENLSTKED 305 + I ++DG++ +D Sbjct: 149 AGDKQYRPGFIFLLSDGQDNGVLDD 173 >gi|294651171|ref|ZP_06728503.1| conserved hypothetical protein [Acinetobacter haemolyticus ATCC 19194] gi|292822924|gb|EFF81795.1| conserved hypothetical protein [Acinetobacter haemolyticus ATCC 19194] Length = 446 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 48/290 (16%), Positives = 102/290 (35%), Gaps = 37/290 (12%) Query: 85 RIKNTWNMSFRNELRDNGFVND-----IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPL 139 R+ + +F ++ + N + ++ I + Y P Sbjct: 81 RVTDQAVSTFSIDVDTGSYTNTRRFLNDGRLPPVDAVRIEEMINYFDYQYP-QPNGVHPF 139 Query: 140 KFCTF---IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 T PW N++ I + I + + Q +++ ++DVS SM S+ K+ Sbjct: 140 SVTTETVDSPWKENAKLIRIGIQAK-DLALQQLPPANLVFLVDVSGSM-----SAADKLP 193 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFG 254 + +++ + E+++ V ++T+++ + G + I L G Sbjct: 194 LVKQTLRILTEQLRAQDKVT------IITYASGEKLVLEPTSGEQKEKILAVINGLRARG 247 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCN 313 ST ++ AY Q + ++ N I+ TDG+ D +L Sbjct: 248 -STAGEQAIQLAYKQA-EKAFVKNGINR--------ILLATDGDFNVGITDFNTLKGMVA 297 Query: 314 EAKKRGAIVYAIGIRVIRSHEFLR---ACASPNSFYLVENPHSMYDAFSH 360 E +K G + +G +E L A A ++ ++N + Sbjct: 298 EKRKSGISLTTLGFGTGNYNEQLMEQLADAGDGNYSYIDNKNEAKKVVQR 347 >gi|284029817|ref|YP_003379748.1| von Willebrand factor type A [Kribbella flavida DSM 17836] gi|283809110|gb|ADB30949.1| von Willebrand factor type A [Kribbella flavida DSM 17836] Length = 550 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 48/249 (19%), Positives = 83/249 (33%), Gaps = 44/249 (17%) Query: 136 KIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDM----MIVLDVSRSMESFFDSS 191 + + P ++V+ Q+ L + + V+DVS SM S Sbjct: 313 DHLVSPLGEGRGEGEVTQLTKPTVAAVEKILQSWTTLSLSAHSLAVIDVSGSMAEKVAGS 372 Query: 192 I-TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL--------EWGVSH 242 T++ + I++ + +K+ PD GL FS KI + H Sbjct: 373 AKTRMQLTIEAADN---GLKMFPDSAE---LGLWVFSTKIGPDSADFRQLVPIGKLTPGH 426 Query: 243 LQRKIKYLSK----FGVST---NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 QR I L G T ++ A ++ + ++ T Sbjct: 427 RQRMIGQLKAQSARVGGGTGLYDTAIAAVRAVRSSYNSGAVN------------TVLLFT 474 Query: 296 DGENLSTK--EDQQSLYYCNEAKK--RGAIVYAIGIRVIRSHEFLRAC--ASPNSFYLVE 349 DG+N ++L N K R + A+GI + L+ A+ Y+ Sbjct: 475 DGKNDDPGSLSLDRTLQILNGLKDPARPVRIIALGIGPDADADELKRLAQATGGQAYVAR 534 Query: 350 NPHSMYDAF 358 NP + D F Sbjct: 535 NPTDLKDVF 543 >gi|21224547|ref|NP_630326.1| secreted protein [Streptomyces coelicolor A3(2)] gi|3559963|emb|CAA20601.1| putative secreted protein [Streptomyces coelicolor A3(2)] Length = 421 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 41/215 (19%), Positives = 77/215 (35%), Gaps = 41/215 (19%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S+ + + ++D +VLDVS SM + T++ A ++ N +L V Sbjct: 29 SAGEPTGEDAPKVD--LVLDVSGSMRTRDIDGGTRMAAAKQAFNEVL------DATPEEV 80 Query: 220 QSGLVTF---------------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 + G+ T + ++ L+ + + + LS G T P L Sbjct: 81 RLGIRTLGADYPGDDRKTGCKDTAQLYPVGPLD--RTEAKTAVATLSPTGW-TPIGPALL 137 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 A + + G K IV ++DGE+ D + AK G + Sbjct: 138 KAADDLDGGDGS------------KRIVLISDGEDTCAPLDPCEVAREIAAKGIGLTIDT 185 Query: 325 IGIRVIRSHEFLRAC---ASPNSFYLVENPHSMYD 356 +G+ +C A+ ++ VE+ + D Sbjct: 186 LGLVPNTKMRQQLSCIAEATGGTYTSVEHTDELTD 220 >gi|304406204|ref|ZP_07387861.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] gi|304344788|gb|EFM10625.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] Length = 762 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 59/360 (16%), Positives = 116/360 (32%), Gaps = 45/360 (12%) Query: 27 LPIIFLVLGMIIEVSHIFFMKTVL--HSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILC 84 P + I+ S+ +K L ++S + + + E + + Sbjct: 186 APKSLKLAIDAIDASNYPTIKVKLAVEDGSEQSDLSSGQVAIKENTVAQKTAEVNANTAD 245 Query: 85 RIKNT-WNMSFRNELRDNGFVNDIDDIVRSTSLDIV-VVPQNEGYSISAISRY--KIPLK 140 + ++ + N NG +D ++ L P + I +S + P Sbjct: 246 KTYEIVYDTTVSNTNPPNGEQRVVDLVIGDNKLSESYKSPSQKKLHIDDVSYNTDEYPKV 305 Query: 141 FCTFIPWYTNSRHIV--MPITSSVKVN--------------SQTDARLDMMIVLDVSRSM 184 F + N++ + P+ ++ V ++ + +V+DVS SM Sbjct: 306 NVYFSLYDENNQLVEDMNPVKTAFTVKEGDKETKNASFSKLTEKPQAISTNLVIDVSDSM 365 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-EFFLLEWGVSHL 243 D+ +TK+ + L N+VV GL++FS+ + + Sbjct: 366 SE--DNKLTKV---KDAATQFLSHASFA--SNDVV--GLMSFSDASNIRQSDFTTEIESI 416 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL--S 301 + I + G + L A NQ K +V TDG+N Sbjct: 417 KSSIAGMQTSGCT-----ALYEALNQAVSNTAY------NSVEGSKYVVVFTDGKNTICD 465 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHI 361 N A + G +YAIG+ + + + L + + +S I Sbjct: 466 GTNWVSPSTVINNALQWGVPIYAIGVEEDADLQQIAEQTNGQYHVLGNDFTDLNAIYSDI 525 >gi|301109920|ref|XP_002904040.1| conserved hypothetical protein [Phytophthora infestans T30-4] gi|262096166|gb|EEY54218.1| conserved hypothetical protein [Phytophthora infestans T30-4] Length = 2146 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 34/208 (16%), Positives = 77/208 (37%), Gaps = 22/208 (10%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 ++ + ++ + VLD S SM + + + + + + + + V+ + Sbjct: 1894 SHAASGGKMHHVFVLDCSGSMNGQPWNDL--MAAWKEYVYNRIADGATLDLVS------V 1945 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 VTF N + + + +I+Y G TN GL+ A + + Sbjct: 1946 VTFDNSAQIVYEARSITTVTNARIQY---RGGGTNYAAGLRSANEVLSRVNFDM------ 1996 Query: 284 DANYKKIIVFMTDGENLST-KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS- 341 +K IVF +DG + ++ + + ++ G +A+G I + R Sbjct: 1997 ---FKPAIVFFSDGHPCDPLQGEELATHIRGCYERNGLQAFAVGFGSINLNMLERVAEKL 2053 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKR 369 +++ V + + F I + T+ Sbjct: 2054 GGTYHHVLTGNELKATFFSISASLSTRA 2081 >gi|297203405|ref|ZP_06920802.1| VWA domain-containing protein [Streptomyces sviceus ATCC 29083] gi|197711494|gb|EDY55528.1| VWA domain-containing protein [Streptomyces sviceus ATCC 29083] Length = 421 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 41/222 (18%), Positives = 82/222 (36%), Gaps = 42/222 (18%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 I P+T++ + A + +VLDVS SM + T++ A ++ N +L+ Sbjct: 22 IAFPVTAAAGEPTGGTAP-QVDLVLDVSGSMRTADIDGGTRMAAAKRAFNEVLD--ATPE 78 Query: 214 DVNNVVQSGLVTF-----------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 +V +++ + + ++ L+ + + ++ LS G T P Sbjct: 79 EVQLGIRTLGANYPGDNQKTGCKDTAQLYPVSTLD--RTEAKTQVATLSPTGW-TPIGPA 135 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI- 321 L + D K IV ++DGE+ D C A++ GA Sbjct: 136 LLKSAGDFTDSASS------------KRIVLISDGEDTCAPLD-----PCEVAREIGAKG 178 Query: 322 ----VYAIGIRVIRSHEFLRAC---ASPNSFYLVENPHSMYD 356 + +G+ +C A+ ++ VE+ + D Sbjct: 179 IGLTIDTLGLVPNTKMRKQLSCIAEATGGTYTSVEHADELTD 220 >gi|25742632|ref|NP_113956.1| integrin, alpha E [Rattus norvegicus] gi|3236342|gb|AAC23662.1| integrin alpha E1 [Rattus norvegicus] gi|187466179|emb|CAQ51889.1| integrin, alpha E, epithelial-associated [Mus musculus] Length = 1167 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 42/236 (17%), Positives = 85/236 (36%), Gaps = 26/236 (11%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 +C T +V+ + + ++ IVLD S S+E + A Sbjct: 163 YCRSKGGSTGEEKKSARRRRTVEEEDEEEDGTEIAIVLDGSGSIEP------SDFQKAKN 216 Query: 201 SINAMLEEV--KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTN 258 I+ M+ K +VQ G V + E ++ K++ + + T Sbjct: 217 FISTMMRNFYEKCFECNFALVQYGAV---IQTEFDLQESRDINASLAKVQSIVQVKEVTK 273 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + +++ + IF K++V +TDG+ + ++ N K + Sbjct: 274 TASAMQHVLDNIFIPS------RGSRKKALKVMVVLTDGDIFGDPLNLTTVI--NSPKMQ 325 Query: 319 GAIVYAIGIR----VIRSHEFLRACASP---NSFYLVENPHSMYDAFSHIGKDIVT 367 G + +AIG+ ++ L+ AS + V N ++ S + + IV Sbjct: 326 GVVRFAIGVGDAFKNNNTYRELKLIASDPKEAHTFKVTNYSALDGLLSKLQQRIVH 381 >gi|238793630|ref|ZP_04637253.1| hypothetical protein yinte0001_2490 [Yersinia intermedia ATCC 29909] gi|238727045|gb|EEQ18576.1| hypothetical protein yinte0001_2490 [Yersinia intermedia ATCC 29909] Length = 480 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 42/238 (17%), Positives = 79/238 (33%), Gaps = 38/238 (15%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 N +G + IFLP+I ++ E++H KT L ++++ + + N Sbjct: 17 LIKNEQGAILFPFIIFLPLIIGLIFFSFELAHFLQKKTKLSDAMEQATLA-----LTVEN 71 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 N+ I +++ ++ L F +I+ N Sbjct: 72 NNSTPSAAQITKNAEIVSSYAQAY---LPAETFSTPTINII----------YNNGRIEYG 118 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARL---DMMIVLDVSRSMESF 187 A KF + I TN I+ + A + D++ V D S SM+ + Sbjct: 119 AEINMSYSAKFLSNI-QVTNLSTIINATDRGAARKNIISAPIEKTDVVFVADYSNSMDEY 177 Query: 188 F---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH 242 F ++ KI + N + + V N+ G + WG + Sbjct: 178 FYHDENEPKKIVALREIFNRLNDNVLK---NKNIHTIGFI----------PFSWGTKN 222 >gi|291299307|ref|YP_003510585.1| von Willebrand factor type A [Stackebrandtia nassauensis DSM 44728] gi|290568527|gb|ADD41492.1| von Willebrand factor type A [Stackebrandtia nassauensis DSM 44728] Length = 626 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 34/216 (15%), Positives = 73/216 (33%), Gaps = 26/216 (12%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + + S D ++++ LD S SME + TK++ A ++ + + + Sbjct: 24 VVGTAAPASAADNDGELLMALDASGSMEESDGAGNTKMETARDAV------IDVAEAMPG 77 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRK----IKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + GL + G K I + ++T +T LK + Sbjct: 78 HAKVGLRAYG-------PASTGSGCKASKELVPIDKIDADAITTAATE-LKPEGDTPIAY 129 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA--IVYAIGIRV-- 329 + + +A K I+ ++DGE + + +G V+ IG +V Sbjct: 130 SLEKAAGDFTEAKGPKTILLVSDGE---ETCGGDPVKVAEKIASQGVDLRVHVIGFQVDD 186 Query: 330 IRSHEFLRAC-ASPNSFYLVENPHSMYDAFSHIGKD 364 + A S+Y ++ ++ + Sbjct: 187 ATRKQLTEIAKAGKGSYYDAQDGPALASRLKRASES 222 >gi|27369644|ref|NP_766059.1| inter-alpha-trypsin inhibitor heavy chain H5 precursor [Mus musculus] gi|81873944|sp|Q8BJD1|ITIH5_MOUSE RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H5; Short=ITI heavy chain H5; Short=ITI-HC5; Short=Inter-alpha-inhibitor heavy chain 5; Flags: Precursor gi|26352482|dbj|BAC39871.1| unnamed protein product [Mus musculus] gi|37589944|gb|AAH43314.2| Inter-alpha (globulin) inhibitor H5 [Mus musculus] gi|38328214|gb|AAH62196.1| Inter-alpha (globulin) inhibitor H5 [Mus musculus] gi|74145221|dbj|BAE22250.1| unnamed protein product [Mus musculus] gi|122889674|emb|CAM13913.1| inter-alpha (globulin) inhibitor H5 [Mus musculus] gi|123858038|emb|CAM26660.1| inter-alpha (globulin) inhibitor H5 [Mus musculus] gi|148676059|gb|EDL08006.1| inter-alpha (globulin) inhibitor H5, isoform CRA_b [Mus musculus] Length = 952 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 70/175 (40%), Gaps = 27/175 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ VLD+S SM K+ +++ +L +++ + ++ FSN+I Sbjct: 296 VVFVLDISASM------VGAKLQQTREALVTILNDLRPQD------RFNIIGFSNRIKMW 343 Query: 231 -EEFFLLE-WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + + + + + +LS G T+ L+ A + + ED + Sbjct: 344 KDHLLPVTPDNIRNGKIYMYHLSPTG-GTDINGALQAAIKLL---NNYVAQNDIEDRSVS 399 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACA 340 +I+F+TDG+ + +L + K+ ++ +GI + L + Sbjct: 400 -LIIFLTDGKPTF--GETNTLKILSNTKEATRGQICIFTVGIGDDVDFKLLEKLS 451 >gi|296120496|ref|YP_003628274.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] gi|296012836|gb|ADG66075.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] Length = 396 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 50/393 (12%), Positives = 109/393 (27%), Gaps = 62/393 (15%) Query: 15 YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNR 74 +G + IL A + + + G + +S++ + L + D + A +++ + + Sbjct: 18 RRGAIAILAAFVMVALLALAGFFLSLSYVELTRAELRAATDAAARSAVIRLVETQSTTSG 77 Query: 75 KKLKGGDILCRI--KNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 + +++ + + + + G Sbjct: 78 RAAARDIASRFEVGGKALSLNDNDIQFGRSTRQSNGSYSFAINGTPTNAARVFGRKTKTS 137 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF---- 188 + + L F F + P S+ D++IVLD S SM Sbjct: 138 AAGPVELPFGGF---------VGAPEYSTELNAVAMRLDYDIVIVLDRSGSMGWDLSGVE 188 Query: 189 --------------------DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D + ++ + S+N L + + GLVT++ Sbjct: 189 FEYPEAVRQRPLVENYFSPPDPTGSRWAILSASVNDFLTILNQRQVAA---RVGLVTYAG 245 Query: 229 K----IEEFFLL---EWGVSHLQRKIKYLSKFG-----VSTNSTPGLKYAYNQIFDMQGM 276 L S L+ G T+ G+ A + Sbjct: 246 DYTFGKYSSVKLTVESDLTSTFSTITSKLTAIGQVPLIGGTDIGAGITAAQTMLTTSSQA 305 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEF 335 R II+ +DG + + SL ++ ++++ + Sbjct: 306 RLKTGQP------IIIVFSDG-MFNQGTEPVSLAASAYSQSSTI-IHSVTFGATAQGRAT 357 Query: 336 LR---ACASPNSFYLVENPHSMYDAFSHIGKDI 365 + A A + ++F I I Sbjct: 358 MNSVTATAGKGLSLHANTAAELAESFRSIANAI 390 >gi|205374347|ref|ZP_03227145.1| hypothetical protein Bcoam_14574 [Bacillus coahuilensis m4-4] Length = 1083 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 76/209 (36%), Gaps = 34/209 (16%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDM 174 S ++ N ++S P + +P +N++ + + + +D+ Sbjct: 22 STNLASASNNVTVNLSVT-----PSQSVVILPTTSNAKASLNLMLTPTGNPQTERDPIDL 76 Query: 175 MIVLDVSRSMESFF--DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 + V D S SM+ +SS+ +ID A ++ L + + G V FS+ Sbjct: 77 VFVFDKSGSMDFKVASNSSVKRIDSAKSAMTNALMFFDGQNTSD---RFGFVPFSSNANT 133 Query: 233 FF-----LLEWG-VSHLQRKIKYLS------KFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 WG S+ K++ + TN T L A +++FD Sbjct: 134 DVVSLTDSSGWGSSSYTNSKLQTIHNKTMGLSASGGTNYTEALDVA-SKLFDSSSK---- 188 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSL 309 K I+F+TDG + D++ Sbjct: 189 -------DKNIIFLTDGTPTFSFSDEKVY 210 >gi|166366827|ref|YP_001659100.1| von Willebrand factor type A [Microcystis aeruginosa NIES-843] gi|166089200|dbj|BAG03908.1| von Willebrand factor type A [Microcystis aeruginosa NIES-843] Length = 724 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 30/150 (20%), Positives = 54/150 (36%), Gaps = 23/150 (15%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + +T + + +++++D S SM K+ A + A +E P Sbjct: 35 EVFLTVTERPPIVEQNPQSVVMLIDTSGSMN-----DDNKLQEAKNAAKAFIERQD--PS 87 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 VN + +V F ++++ L ++ L + I LS G T GL A Q+ Sbjct: 88 VN---RFAVVGFGSQVQIGTGLTSDLATLNQAIDNLS-DGGGTRMDLGLATAIEQLESSS 143 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 R I+ TDG+ Sbjct: 144 SDRH------------ILLFTDGQPAPAPS 161 Score = 43.6 bits (101), Expect = 0.045, Method: Composition-based stats. Identities = 49/322 (15%), Positives = 97/322 (30%), Gaps = 47/322 (14%) Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 + D + +F + + + + + + Sbjct: 57 LIDTSGSMNDDNKLQEAKNAAKAFIERQDPSVNRFAVVGFGSQVQIGTGLTSDLATLNQA 116 Query: 131 AI-------SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 +R + L ++S ++ T + + ++D+MIVLDV+ S Sbjct: 117 IDNLSDGGGTRMDLGLATAIEQLESSSSDRHILLFTDGQPAPAPSPEQIDIMIVLDVTSS 176 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI------------E 231 M +I + I +E+K Q GL+ F +++ Sbjct: 177 MNE-------EIAGVQQGIQNFAQELKK---RKLDAQIGLIAFGDRLFGEEPQILNFDGS 226 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 F S I+ ++ +S L A Q F ++ K+I Sbjct: 227 VFTPDSDRFSEQIGSIQQVNGNDPPESSLDALALAARQPF------------RSHATKVI 274 Query: 292 VFMTDGENLSTKEDQQSLY-YCNEAKKRGAIVYAIGIRVIRSHEFLRA-CASPNSFYLVE 349 + +TD L + SL N K+ + I+ F + ++P +L+ Sbjct: 275 LLITDAPPLIPDREIPSLEATTNVLKENQIAQLHLVIQPDDRATFEQLQLSAPGETFLLG 334 Query: 350 NPHSMYDAFSH----IGKDIVT 367 + F IGK I Sbjct: 335 ETAAGRQGFESVLPTIGKAIAE 356 >gi|330466229|ref|YP_004403972.1| von willebrand factor type a [Verrucosispora maris AB-18-032] gi|328809200|gb|AEB43372.1| von willebrand factor type a [Verrucosispora maris AB-18-032] Length = 319 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 36/208 (17%), Positives = 67/208 (32%), Gaps = 28/208 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+ +DVS SM + D ++ A + ++ + + GLV F+ Sbjct: 88 VMVAVDVSTSMLA-TDVDPDRLSAAKSAARDFVDGLPREFN------VGLVAFAGSAAVL 140 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L I+ L G++ + A + T +IIV Sbjct: 141 VPPSTDREALHDGIRRL-AEGITGVQGTAIGEAISTSLGAVKALDEQATTQPPPARIIV- 198 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------------HEFLRAC 339 ++DG N S + EA V+ I E L+A Sbjct: 199 LSDGANTS---GMDPMEAAAEAVAFEVPVHTISFGTPGGFVDRGGRPIQVPVDGETLQAV 255 Query: 340 A--SPNSFYLVENPHSMYDAFSHIGKDI 365 A + +F+ + ++ + IG + Sbjct: 256 AEQTGGAFHQADTSDELHAVYDDIGSSV 283 >gi|327265811|ref|XP_003217701.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4-like [Anolis carolinensis] Length = 914 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 79/205 (38%), Gaps = 21/205 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT-FSNKIEE 232 ++ V+D S SM KI I+++ +LE++ N VV SG ++ + + + + Sbjct: 270 IIFVIDKSGSM------IGKKIQQTIEALQKILEDLNPEDHFNLVVFSGEISEWQSSLLK 323 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V ++ ++ + G T+ L A N + + + +IV Sbjct: 324 ATEE--NVELAKQYVRTIMAQG-GTDINGALLTAINSL---DRATSAELLPEQSIS-MIV 376 Query: 293 FMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS------F 345 +TDG+ + + S+ +A +Y +G S+ FL A N + Sbjct: 377 LLTDGQPTVGETNVNSIQTNIKKANDGNYFLYCLGFGFDVSYTFLEKLALENRGIARRIY 436 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRI 370 + + D + + I+ + + Sbjct: 437 EDSDAALQLQDFYQEVATPILKEIV 461 >gi|256784255|ref|ZP_05522686.1| secreted protein [Streptomyces lividans TK24] gi|289768140|ref|ZP_06527518.1| secreted protein [Streptomyces lividans TK24] gi|289698339|gb|EFD65768.1| secreted protein [Streptomyces lividans TK24] Length = 421 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 79/211 (37%), Gaps = 33/211 (15%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S+ + + ++D +VLDVS SM + T++ A ++ N +L+ +V + Sbjct: 29 SAGEPTGEDAPKVD--LVLDVSGSMRTRDIDGGTRMAAAKQAFNEVLD--ATPEEVQLGI 84 Query: 220 QSGLVTF-----------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 ++ + + ++ L+ + + + LS G T P L A + Sbjct: 85 RTLGADYPGDDRKTGCKDTAQLYPVGPLD--RTEAKTAVATLSPTGW-TPIGPALLKAAD 141 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + G K IV ++DGE+ D + AK G + +G+ Sbjct: 142 DLDGGDGS------------KRIVLISDGEDTCAPLDPCEVAREIAAKGIGLTIDTLGLV 189 Query: 329 VIRSHEFLRAC---ASPNSFYLVENPHSMYD 356 +C A+ ++ VE+ + D Sbjct: 190 PNTKMRRQLSCIAEATGGTYTSVEHTDELTD 220 >gi|229596191|ref|XP_001012539.2| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|225565545|gb|EAR92294.2| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 703 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 76/200 (38%), Gaps = 29/200 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + LD++ V+D S SM F +KI+ +I +LE + + L+TF Sbjct: 206 EGRPNLDLICVIDNSGSMNDF-----SKIENVKNTILQLLEMLNEND------RLSLITF 254 Query: 227 SNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + K ++ L+ LQ K + G T+ G++ A+ + + Sbjct: 255 NTKAKQLCGLKNVNNQNKKSLQTITKSIKADG-GTDIIRGIEIAFQIL------QSRKQK 307 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSL--YYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + I ++DG++ ++L + ++ +++ G ++ A Sbjct: 308 NSVSS---IFLLSDGQDNLADAGIKNLLKTTYKQLQEESFTIHSFGFGNDHDGPLMQKIA 364 Query: 341 S--PNSFYLVENPHSMYDAF 358 SFY VE + + F Sbjct: 365 QIKDGSFYFVEKNDQVDEFF 384 >gi|194209576|ref|XP_001915257.1| PREDICTED: similar to voltage-dependent calcium channel alpha-2 delta subunit [Equus caballus] Length = 1016 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 77/200 (38%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 221 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 268 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 269 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 327 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ K + V+ + R Sbjct: 328 CN-------KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQ 374 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 375 WMACENKGYYYEIPSIGAIR 394 >gi|74150598|dbj|BAE32321.1| unnamed protein product [Mus musculus] Length = 1166 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 42/236 (17%), Positives = 85/236 (36%), Gaps = 26/236 (11%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 +C T +V+ + + ++ IVLD S S+E + A Sbjct: 162 YCRSKGGSTGEEKKSARRRRTVEEEDEEEDGTEIAIVLDGSGSIEP------SDFQKAKN 215 Query: 201 SINAMLEEV--KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTN 258 I+ M+ K +VQ G V + E ++ K++ + + T Sbjct: 216 FISTMMRNFYEKCFECNFALVQYGAV---IQTEFDLQESRDINASLAKVQSIVQVKEVTK 272 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + +++ + IF K++V +TDG+ + ++ N K + Sbjct: 273 TASAMQHVLDNIFIPS------RGSRKKALKVMVVLTDGDIFGDPLNLTTVI--NSPKMQ 324 Query: 319 GAIVYAIGIR----VIRSHEFLRACASP---NSFYLVENPHSMYDAFSHIGKDIVT 367 G + +AIG+ ++ L+ AS + V N ++ S + + IV Sbjct: 325 GVVRFAIGVGDAFKNNNTYRELKLIASDPKEAHTFKVTNYSALDGLLSKLQQRIVH 380 >gi|301606953|ref|XP_002933089.1| PREDICTED: integrin alpha-2-like [Xenopus (Silurana) tropicalis] Length = 1081 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 31/210 (14%), Positives = 74/210 (35%), Gaps = 35/210 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKI 230 +D+ IV D S S+ ++ + ++ + + P Q L+ + ++ Sbjct: 185 IDIAIVCDGSNSIYPWY--------AVRNFLEKFVQGLDIGPSS---TQVSLIQYGNDPQ 233 Query: 231 EEFFLLEWGVSH-LQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + + + + I + + G + TN+ + F + N Sbjct: 234 VMYKMDTYRTKEEMAKAITGIDQMGGTQTNTFKAID------FTRKFAFSKEYGGRPNAN 287 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR------VIRSHEFLRACAS- 341 K++V +TDGE+ + + C + I + I + I S ++ S Sbjct: 288 KVMVVITDGESHDNALLSKVIGDCEK---DNIIRFGIAVLGYYNRYNIDSKNLIKEIESI 344 Query: 342 -----PNSFYLVENPHSMYDAFSHIGKDIV 366 F+ V + ++ + +G+ I Sbjct: 345 ASVPKEKFFFNVSDEVALLEKAGTLGERIF 374 >gi|290970562|ref|XP_002668176.1| predicted protein [Naegleria gruberi] gi|284081406|gb|EFC35432.1| predicted protein [Naegleria gruberi] Length = 518 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 64/198 (32%), Gaps = 20/198 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI----PDVNNVVQSGLVTFS 227 + V D S SM D+ T + + A+ E + +++V + + Sbjct: 330 FHFIFVNDKSGSMGGS-DARPTSSKYSNDRLGALFESCEKFLEVRDGSSDLV--SCIMYD 386 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + F + L + G T+ T ++ + + +Y Sbjct: 387 HSAYNCFTTNPLSTSLVSTMSSYVA-GGGTSFTNAMQ-------SVSSLISSTYPNHQSY 438 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSF 345 K +++FM+DGE D+ I++ I + + LR A+ F Sbjct: 439 KIVVLFMSDGE---DSADEAVSITGQLVSSHDIILHTIQLGGSSDNTGLRQMAATGRGQF 495 Query: 346 YLVENPHSMYDAFSHIGK 363 + S+ + I Sbjct: 496 KRANDSASLAGIYQEIAN 513 >gi|126658524|ref|ZP_01729672.1| hypothetical protein CY0110_21405 [Cyanothece sp. CCY0110] gi|126620266|gb|EAZ90987.1| hypothetical protein CY0110_21405 [Cyanothece sp. CCY0110] Length = 610 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 85/208 (40%), Gaps = 28/208 (13%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-- 228 L++ +VLD S SM + S+ AI++ +++ + ++ ++ + + Sbjct: 39 PLNLSVVLDRSGSMAGYALSN------AIQATEKLVDFLSPDDLLS------VIIYDDVA 86 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 ++ ++ KIK + G + S + ++ H +T+ N Sbjct: 87 EVIVPHQAVTNKQEIKAKIKKIRARGCTNLSGG-------WLLGCSQVKSHLSTDKLNR- 138 Query: 289 KIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC--ASPNSF 345 ++ +TDG N+ ++ + L E ++G + +G + + L A+ +F Sbjct: 139 --VLLLTDGLANIGERKPEILLKTAAEKAQQGIVTTTLGFGSNFNEDLLIGMADAAGGNF 196 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIWYD 373 Y +++P D F HI + + + + Sbjct: 197 YFIQSPDDSADVF-HIEMESLLSLVTQN 223 >gi|88702489|gb|ABD49099.1| integrin alpha E [Mus musculus] Length = 1167 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 42/236 (17%), Positives = 85/236 (36%), Gaps = 26/236 (11%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 +C T +V+ + + ++ IVLD S S+E + A Sbjct: 163 YCRSKGGSTGEEKKSARRRRTVEEEDEEEDGTEIAIVLDGSGSIEP------SDFQKAKN 216 Query: 201 SINAMLEEV--KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTN 258 I+ M+ K +VQ G V + E ++ K++ + + T Sbjct: 217 FISTMMRNFYEKCFECNFALVQYGAV---IQTEFDLQESRDINASLAKVQSIVQVKEVTK 273 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + +++ + IF K++V +TDG+ + ++ N K + Sbjct: 274 TASAMQHVLDNIFIPS------RGSRKKALKVMVVLTDGDIFGDPLNLTTVI--NSPKMQ 325 Query: 319 GAIVYAIGIR----VIRSHEFLRACASP---NSFYLVENPHSMYDAFSHIGKDIVT 367 G + +AIG+ ++ L+ AS + V N ++ S + + IV Sbjct: 326 GVVRFAIGVGDAFKNNNTYRELKLIASDPKEAHTFKVTNYSALDGLLSKLQQRIVH 381 >gi|71274669|ref|ZP_00650957.1| conserved hypothetical protein [Xylella fastidiosa Dixon] gi|71164401|gb|EAO14115.1| conserved hypothetical protein [Xylella fastidiosa Dixon] Length = 828 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 55/184 (29%), Gaps = 27/184 (14%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + Q + + +D+S SM + +++D ++NA L+++ Sbjct: 27 PEKAAIGMQMAPSVAVYFAIDLSGSMHYVGGNGRSRLDNMKTALNAALDQLGQSIASGTA 86 Query: 219 VQSGLVTFSNKIEEFFLL------EWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAY 267 V L F + + L G++ L+ + F T P A Sbjct: 87 VDIMLAGFGDAPDHRQTLLRRNCTAQGIAELKSWVAARQALYGTYFPAGTMDMPSFYAA- 145 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 +N ++ FMTDGE + + Y I I Sbjct: 146 ---------------APSNAVRVAFFMTDGEPDPPSATLAQAARADVDQVAHLRCYGINI 190 Query: 328 RVIR 331 + Sbjct: 191 DLAN 194 >gi|332237897|ref|XP_003268143.1| PREDICTED: integrin alpha-10 isoform 1 [Nomascus leucogenys] Length = 1175 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 42/244 (17%), Positives = 82/244 (33%), Gaps = 46/244 (18%) Query: 150 NSRHIVMPITSSVKVNSQTDARL-----------DMMIVLDVSRSMESFFDSSITKIDMA 198 S I + V + Q L D++IVLD S S+ + +++ Sbjct: 141 GSSVFSSGICARVDASFQPQGSLAPTAQRCPTYMDVVIVLDGSNSIYPW-----SEVQTF 195 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGV 255 ++ + L I +Q GLV + + L + ++ + G Sbjct: 196 LRRLVGKL----FIDPEQ--IQVGLVQYGESPVHEWSLGDFRTKEEVVRAAKNLSRREGR 249 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 T + + A + F + +++V +TDGE+ ++ +L C Sbjct: 250 ETKTAQAIMVACTEGFSQS------HGGRPEAARLLVVVTDGESHDGEQLPAALKACEAG 303 Query: 316 KKRGAIVYAIGI------RVIRSHEFLR----ACASPNS--FYLVENPHSMYDAFSHIGK 363 + Y I + R FLR + P+ F+ V + ++ D +G Sbjct: 304 R---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASDPDERFFFNVTDEAALTDIVDALGD 360 Query: 364 DIVT 367 I Sbjct: 361 RIFG 364 >gi|319956033|ref|YP_004167296.1| von willebrand factor type a [Nitratifractor salsuginis DSM 16511] gi|319418437|gb|ADV45547.1| von Willebrand factor type A [Nitratifractor salsuginis DSM 16511] Length = 335 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 75/197 (38%), Gaps = 26/197 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + ++ LD+S SM S D +++ A + L+ + Q GL+ F+ Sbjct: 84 QGITLLTALDISGSMRSK-DRYPNRLEFAKVKLKQFLDALPGD-------QVGLMAFARN 135 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + L++ + +++ ++ +T + A + R + +K Sbjct: 136 AFVLAPFTGDTATLKQIVDGVNEDYINMAATDFVSLA------EEAARLLAKKK----EK 185 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 I+V TDG D Q+L EA K+RG +YA+ + + L P + Sbjct: 186 ILVVFTDG------GDPQALKGFKEALKERGITLYAVLVGTKKGAPVLDRRGRPMTKRDG 239 Query: 349 ENP-HSMYDAFSHIGKD 364 + +A I ++ Sbjct: 240 TIAITQLNEALGKIARE 256 >gi|226823202|ref|NP_032425.2| integrin alpha-E isoform 1 [Mus musculus] gi|56206400|emb|CAI24788.1| integrin, alpha E, epithelial-associated [Mus musculus] Length = 1167 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 42/236 (17%), Positives = 85/236 (36%), Gaps = 26/236 (11%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 +C T +V+ + + ++ IVLD S S+E + A Sbjct: 163 YCRSKGGSTGEEKKSARRRRTVEEEDEEEDGTEIAIVLDGSGSIEP------SDFQKAKN 216 Query: 201 SINAMLEEV--KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTN 258 I+ M+ K +VQ G V + E ++ K++ + + T Sbjct: 217 FISTMMRNFYEKCFECNFALVQYGAV---IQTEFDLQESRDINASLAKVQSIVQVKEVTK 273 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + +++ + IF K++V +TDG+ + ++ N K + Sbjct: 274 TASAMQHVLDNIFIPS------RGSRKKALKVMVVLTDGDIFGDPLNLTTVI--NSPKMQ 325 Query: 319 GAIVYAIGIR----VIRSHEFLRACASP---NSFYLVENPHSMYDAFSHIGKDIVT 367 G + +AIG+ ++ L+ AS + V N ++ S + + IV Sbjct: 326 GVVRFAIGVGDAFKNNNTYRELKLIASDPKEAHTFKVTNYSALDGLLSKLQQHIVH 381 >gi|159896782|ref|YP_001543029.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159889821|gb|ABX02901.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 950 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 63/207 (30%), Gaps = 35/207 (16%) Query: 165 NSQTDARLDMMIVLDVSRSME----------SFFDSSITKIDMAIKSINAMLEEVKLIPD 214 N Q + ++ ++D S SM+ + KID+A +++ + Sbjct: 401 NRQQRPDIALVFIIDKSGSMDACHCNGGDMAAREGGGTRKIDIAKEAVAQAAAVLGKDD- 459 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + G+VTF + L+ S + + + G TN G+ AY Q+ Sbjct: 460 -----KLGVVTFDDSAHWTIELDKVPSQDDVVAALAPVPPSGQ-TNVVSGMNAAYEQLRQ 513 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 +H + +TDG +T K G + + Sbjct: 514 SDAKIKHA-----------ILLTDGWGHATDIGS----IAENMNKDGITLSVVAAGNGSD 558 Query: 333 HEFLRACA-SPNSFYLVENPHSMYDAF 358 + R +Y + F Sbjct: 559 NALQRYAELGGGRYYPARVMEEVPQIF 585 >gi|74183702|dbj|BAE24467.1| unnamed protein product [Mus musculus] Length = 952 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 70/175 (40%), Gaps = 27/175 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ VLD+S SM K+ +++ +L +++ + ++ FSN+I Sbjct: 296 VVFVLDISASM------VGAKLQQTREALVTILNDLRPQD------RFNIIGFSNRIKMW 343 Query: 231 -EEFFLLE-WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + + + + + +LS G T+ L+ A + + ED + Sbjct: 344 KDHLLPVTPDNIRNGKIYMYHLSPTG-GTDINGALQAAIKLL---NNYVAQNDIEDRSVS 399 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACA 340 +I+F+TDG+ + +L + K+ ++ +GI + L + Sbjct: 400 -LIIFLTDGKPTF--GETNTLKILSNTKEATRGQICIFTVGIGDDVDFKLLEKLS 451 >gi|320101795|ref|YP_004177386.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] gi|319749077|gb|ADV60837.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] Length = 764 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 33/209 (15%), Positives = 71/209 (33%), Gaps = 36/209 (17%) Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKI 195 + + P +++ + +++ ++++LD S SM I + Sbjct: 248 LGAMVLSHPPTANEDGTLLLLASPTIETTPNKTPPAKTVVLILDRSGSMS---GKKIEQA 304 Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-----LEWGVSHLQRKIKYL 250 A+K + L + L L+ + + +E F + R I+ + Sbjct: 305 RAAMKFVVENLNQDDLFN---------LILYDDTVEMFKPELLRCNAENRAEALRFIEGV 355 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 G ST+ GL+ I D + ++F+TDG S + ++ L Sbjct: 356 RP-GGSTDIDQGLRAGLKLI------------ADESRPNYVIFLTDGLPTSGETNE--LK 400 Query: 311 YCNEAKKR---GAIVYAIGIRVIRSHEFL 336 A+ A ++ G+ + L Sbjct: 401 IAEAARAANPLKAKLFVFGVGYDVNARLL 429 >gi|258507439|ref|YP_003170190.1| pilus specific protein [Lactobacillus rhamnosus GG] gi|257147366|emb|CAR86339.1| Pilus specific protein, ancillary protein involved in mucus-adhesion, contains von Willebrand factor (VWF) domain [Lactobacillus rhamnosus GG] gi|259648793|dbj|BAI40955.1| putative cell surface protein [Lactobacillus rhamnosus GG] Length = 895 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 36/242 (14%), Positives = 75/242 (30%), Gaps = 16/242 (6%) Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 + NG + N + +N N ++ L P N Sbjct: 45 TDANGTYPTNSWQVTGQQNVINQRGGDQVSGWDNNTIWNGDATDTTNSYLKF-GDPNNPD 103 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 Y I ++ ++ Q +D+++V+D+S SMES Sbjct: 104 YQIRKYAKETNTPGLYDVYLNVKGNKQ-------------QNVKPVDIVLVVDMSGSMES 150 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK 246 + + L ++ + N V GL+ FS+ + L + Sbjct: 151 NRWG-TNRAGAVRTGVKNFLTSIQNA-GLGNYVNVGLIGFSSPGYIGGKSGYISVKLGKA 208 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + ++ +P + ++ N + + KK+++ +TDG + E Sbjct: 209 GNASQQQAINGALSPRFQGGTYTQIGLRQGSAMLNADTSGNKKMMILLTDGVPTFSNEVI 268 Query: 307 QS 308 S Sbjct: 269 NS 270 >gi|163788218|ref|ZP_02182664.1| hypothetical protein FBALC1_07553 [Flavobacteriales bacterium ALC-1] gi|159876538|gb|EDP70596.1| hypothetical protein FBALC1_07553 [Flavobacteriales bacterium ALC-1] Length = 688 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 42/279 (15%), Positives = 96/279 (34%), Gaps = 33/279 (11%) Query: 94 FRNELRDNGFVN-----DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFI--- 145 F ++ + N + ++ S ++ I + Y+ P T + Sbjct: 246 FSIDVDKASYSNVRRMINNGQVIASDAVKIEEMINYFNYNYP-QPTDDHPFSINTEVTDT 304 Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 PW+ ++ + + + + + ++ ++DVS SM S K+ + + + Sbjct: 305 PWHNKTQLVRIGLQGKSYAD-KDLPASNLTFLIDVSGSMSSH-----NKLPLLKSAFKLL 358 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 + +++ V+ VV +G + + + L + G ST G+K Sbjct: 359 VNQLREKDKVSIVVYAGAAG----VVLEPTSGNNKEKIISALNNL-QSGGSTAGGAGIKL 413 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYA 324 AY ++ + ++ N ++ TDG+ N+ D E +K G + Sbjct: 414 AY-KLAEKNFKKKGNNR--------VILATDGDFNVGASSDNDMKTLIEEKRKSGVFLSV 464 Query: 325 IGIRVIRSHEF-LRACAS--PNSFYLVENPHSMYDAFSH 360 +G + L A + ++N F Sbjct: 465 LGFGYGNYKDSKLETLADKGNGNHAYIDNMQEAQKVFGK 503 >gi|255537858|ref|XP_002509994.1| protein binding protein, putative [Ricinus communis] gi|223550695|gb|EEF52181.1| protein binding protein, putative [Ricinus communis] Length = 767 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 31/176 (17%), Positives = 66/176 (37%), Gaps = 32/176 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 A +D++ VLDVS SM K+ M +++ ++ + + +V FS+ Sbjct: 355 RAPIDLVTVLDVSGSMTG------AKLQMLKRAMRLVISSLGSAD------RLSIVAFSS 402 Query: 229 KIEEFFLLEWGVSH----LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + L +H +R I L G T+ L+ A + R+ N Sbjct: 403 VPKRLLPLRRMTAHGQRAARRIIDRLV-CGQGTSVGDALRKATKVL----EDRRERNPVA 457 Query: 285 ANYKKIIVFMTDGEN----LSTKEDQQSLYYCNEAKKRGAIV--YAIGIRVIRSHE 334 + I+ ++DG++ S+ + + + N + + ++ G + Sbjct: 458 S-----IMLLSDGQDERVQTSSVNQRHTSGHINSTRFAHIEIPVHSFGFGQSGGYS 508 >gi|115623666|ref|XP_789748.2| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3 [Strongylocentrotus purpuratus] gi|115960627|ref|XP_001186460.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3 [Strongylocentrotus purpuratus] Length = 846 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 33/173 (19%), Positives = 64/173 (36%), Gaps = 18/173 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+DVS SM K+D ++ +L++V+ I +N V+ V + Sbjct: 339 VVFVIDVSGSMRG------RKMDQTKRAFTTILDDVRPIDRINIVLFESNVRVWRSNQMV 392 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 ++ + + +S G TN GL A + + +H N E I+ Sbjct: 393 EATGDNIAAAKNHVNDISA-GGGTNLYDGLTNAVDLLM------EHGNGEAMPL---IIM 442 Query: 294 MTDGENLSTK--EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +TDG+ S + + ++++G FL + N Sbjct: 443 LTDGQPTSGSVTSTSEIIKRITNLIDGRLSLFSVGFGNGVDFSFLEKLSLSNQ 495 >gi|312130321|ref|YP_003997661.1| von willebrand factor type a [Leadbetterella byssophila DSM 17132] gi|311906867|gb|ADQ17308.1| von Willebrand factor type A [Leadbetterella byssophila DSM 17132] Length = 318 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 65/166 (39%), Gaps = 25/166 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ + +D+S SM + D ++ID A + +++ + G++ F++ Sbjct: 79 DIFLAIDLSESMNA-TDVVPSRIDRAKNELQGLIDRFSAD-------RIGIILFNSNAYL 130 Query: 233 FFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + +++ I L ST+ +P L+ ++ + N K Sbjct: 131 LTPLTFDTENIRNTIGNLKTHMIDKGSTDFSPMLEMINEKL----------SVGTQNRGK 180 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + + +TDGE ++Q K+ ++ +G+ + + Sbjct: 181 VAIVVTDGETHYQIDEQ----LAKRLKQNNIHLFWLGVGTLGGGKI 222 >gi|301789441|ref|XP_002930137.1| PREDICTED: von Willebrand factor-like [Ailuropoda melanoleuca] Length = 2813 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 43/307 (14%), Positives = 96/307 (31%), Gaps = 35/307 (11%) Query: 78 KGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYK 136 + + + + + N ++I + + + V + + IS Sbjct: 1587 EHSFSVSQGDREQVPNLVYMVTGNPASDEIKRMPGDIQVVPIGVGPHTNVQELERISWPN 1646 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKV--------NSQTDARLDMMIVLDVSRSMESFF 188 P+ F + +V+ S + LD+ ++LD S S F Sbjct: 1647 APILIQDFETLPREAPDLVLQRCCSGEGLQPPTLAPTPDCSQPLDVALLLDGSSS---FP 1703 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL--LEWGVSHLQRK 246 S ++ K+ + + P + Q ++ + + + + +HL Sbjct: 1704 ASYFEEMKSFAKA---FISRANIGP---QLTQVSVLQYGSTTTAAVPWNVAYEKAHLLSH 1757 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + + + G ++ L YA + H A+ K +++ +TD Sbjct: 1758 VDLMQREGGLSHIGDALDYAVRYVTS----EVHGARPGAS-KAVVILVTD------VSAD 1806 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN---SFYLVENPHSMYDAFSHIGK 363 + A V+ +GI LR A PN + ++ + A + +G Sbjct: 1807 TVDAAADAATSNRVTVFPVGIGDRYDEAQLRRLAGPNAGSNVLRLQRIEDL-SAVATLGN 1865 Query: 364 DIVTKRI 370 K Sbjct: 1866 SFFHKLC 1872 >gi|326670660|ref|XP_003199262.1| PREDICTED: collagen alpha-3(VI) chain-like [Danio rerio] Length = 1404 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 55/327 (16%), Positives = 110/327 (33%), Gaps = 33/327 (10%) Query: 48 TVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDI 107 L + D +A EG L GG + + L+++G + Sbjct: 492 AALQYVRDSVFTASAGSRKQEGVPQVLILLSGGRSNDNVDTP-----ASALKESGVLILG 546 Query: 108 DDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 S+ ++ + + Y+ S +P F N V P+T +V + Sbjct: 547 IGTRNSSR-EVQRITNDPAYAQSLNDFSDLPGVQQQFYSSLNNVLVQVKPMTPTVAAEPR 605 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 R D++ +LD S D + + + M+E + + + + + +V +S Sbjct: 606 MARR-DVVFLLDGS-------DGTRSSFSAMRDFVERMVERLNVSENRD---RVSVVQYS 654 Query: 228 NKIEEFFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 E F L I+ L G N+ L+Y + +F + Sbjct: 655 RDPEAHFYLNTYSRKEDALDTIRGLRHKGGRPLNTGAALQYLRDNVFTASSGSRRVEG-- 712 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 ++++ ++ + + K+ G +++AIG R S E R P+ Sbjct: 713 --VPQLLILLS-----GARSFDNVDTPASSLKELGVLIFAIGSRSSDSQELQRISQEPSY 765 Query: 345 FYLVENPHSM----YDAFSHIGKDIVT 367 V + + F++I K V Sbjct: 766 ALSVSDFTDLPSVQQQLFTNINKVFVA 792 >gi|326670658|ref|XP_003199261.1| PREDICTED: collagen alpha-3(VI) chain-like [Danio rerio] Length = 1823 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 55/327 (16%), Positives = 110/327 (33%), Gaps = 33/327 (10%) Query: 48 TVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDI 107 L + D +A EG L GG + + L+++G + Sbjct: 686 AALQYVRDSVFTASAGSRKQEGVPQVLILLSGGRSNDNVDTP-----ASALKESGVLILG 740 Query: 108 DDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 S+ ++ + + Y+ S +P F N V P+T +V + Sbjct: 741 IGTRNSSR-EVQRITNDPAYAQSLNDFSDLPGVQQQFYSSLNNVLVQVKPMTPTVAAEPR 799 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 R D++ +LD S D + + + M+E + + + + + +V +S Sbjct: 800 MARR-DVVFLLDGS-------DGTRSSFSAMRDFVERMVERLNVSENRD---RVSVVQYS 848 Query: 228 NKIEEFFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 E F L I+ L G N+ L+Y + +F + Sbjct: 849 RDPEAHFYLNTYSRKEDALDTIRGLRHKGGRPLNTGAALQYLRDNVFTASSGSRRVEG-- 906 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 ++++ ++ + + K+ G +++AIG R S E R P+ Sbjct: 907 --VPQLLILLS-----GARSFDNVDTPASSLKELGVLIFAIGSRSSDSQELQRISQEPSY 959 Query: 345 FYLVENPHSM----YDAFSHIGKDIVT 367 V + + F++I K V Sbjct: 960 ALSVSDFTDLPSVQQQLFTNINKVFVA 986 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 50/310 (16%), Positives = 103/310 (33%), Gaps = 29/310 (9%) Query: 48 TVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDI 107 L + D +A EG L GG + + L+++G + Sbjct: 1485 AALQYVRDSVFTASAGSRKQEGVPQVLILLSGGRSNDNVDTP-----ASALKESGVLILG 1539 Query: 108 DDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 S+ ++ + + Y+ S +P F N V P+T +V + Sbjct: 1540 IGTRNSSR-EVQRITNDPAYAQSLNDFSDLPGVQQQFYSSLNNVLVQVKPMTPTVAAEPR 1598 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 R D++ +LD S D + + + M+E + + + + + +V +S Sbjct: 1599 MARR-DVVFLLDGS-------DGTRSSFSAMRDFVERMVERLNVSENRD---RVSVVQYS 1647 Query: 228 NKIEEFFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 E F L I+ L G N+ L+Y + +F + Sbjct: 1648 RDPEAHFYLNTYSRKEDALDTIRGLRHKGGRPLNTGAALQYLRDNVFTASSGSRRVEG-- 1705 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 ++++ ++ + + K+ G +++ IG R S E R P+ Sbjct: 1706 --VPQLLILLS-----GARSFDNVDTPASSLKELGVLIFVIGSRSSDSQELQRISQEPSY 1758 Query: 345 FYLVENPHSM 354 V + + Sbjct: 1759 ALSVSDFTDL 1768 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 71/202 (35%), Gaps = 26/202 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S D + + + M+E + + + + + +V +S E Sbjct: 5 DVVFLLDGS-------DGTRSSFSAMRDFVERMVERLNVSENRD---RVSVVQYSRDPEA 54 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L I+ L G N+ L+Y + +F + + Sbjct: 55 HFYLNTYSRKEDALDTIRGLRHKGGRPLNTGAALQYLRDNVFTASSGSRRVEG----VPQ 110 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ ++ + + K+ G +++AIG R S E R P+ V Sbjct: 111 LLILLS-----GARSFDNVDTPASSLKELGVLIFAIGSRSSDSQELQRISQEPSYALSVS 165 Query: 350 NPHSM----YDAFSHIGKDIVT 367 + + F++I K V Sbjct: 166 DFTDLPSVQQQLFTNINKVFVA 187 >gi|281343742|gb|EFB19326.1| hypothetical protein PANDA_020489 [Ailuropoda melanoleuca] Length = 2801 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 43/307 (14%), Positives = 96/307 (31%), Gaps = 35/307 (11%) Query: 78 KGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYK 136 + + + + + N ++I + + + V + + IS Sbjct: 1575 EHSFSVSQGDREQVPNLVYMVTGNPASDEIKRMPGDIQVVPIGVGPHTNVQELERISWPN 1634 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKV--------NSQTDARLDMMIVLDVSRSMESFF 188 P+ F + +V+ S + LD+ ++LD S S F Sbjct: 1635 APILIQDFETLPREAPDLVLQRCCSGEGLQPPTLAPTPDCSQPLDVALLLDGSSS---FP 1691 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL--LEWGVSHLQRK 246 S ++ K+ + + P + Q ++ + + + + +HL Sbjct: 1692 ASYFEEMKSFAKA---FISRANIGP---QLTQVSVLQYGSTTTAAVPWNVAYEKAHLLSH 1745 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + + + G ++ L YA + H A+ K +++ +TD Sbjct: 1746 VDLMQREGGLSHIGDALDYAVRYVTS----EVHGARPGAS-KAVVILVTD------VSAD 1794 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN---SFYLVENPHSMYDAFSHIGK 363 + A V+ +GI LR A PN + ++ + A + +G Sbjct: 1795 TVDAAADAATSNRVTVFPVGIGDRYDEAQLRRLAGPNAGSNVLRLQRIEDL-SAVATLGN 1853 Query: 364 DIVTKRI 370 K Sbjct: 1854 SFFHKLC 1860 >gi|332669282|ref|YP_004452290.1| von Willebrand factor type A [Cellulomonas fimi ATCC 484] gi|332338320|gb|AEE44903.1| von Willebrand factor type A [Cellulomonas fimi ATCC 484] Length = 538 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 43/272 (15%), Positives = 90/272 (33%), Gaps = 27/272 (9%) Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 + ++ + D+ +T+ N + + I + Y Sbjct: 78 QEEYDDSPEQPFQDVTTSPLSTFASDVDTASYSNLRRQLRQGVEPE-GVRIEELVNYFDY 136 Query: 128 SISA-ISRYKIPLKFCTF---IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 A + P T PW + + M + V T +++ +LDVS S Sbjct: 137 DYPAPEPDAEDPFTVTTQVADAPWAPGHQ-LAMIGVQATDVKPTTRGN-NVVFLLDVSGS 194 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 M+ K+ + S ++E++ V+ V +G S+++ + + Sbjct: 195 MDE-----PNKLPLLADSFALLVEQLDEDDTVSIVTYAG----SDQVLADSVPGDRRGEI 245 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLST 302 ++ L G ST GL+ AY + + N ++ TDG+ N+ Sbjct: 246 VDILRELRA-GGSTGGARGLETAYE-LAAKNFVEGGNNR--------VILATDGDFNVGP 295 Query: 303 KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 +Q E + G + +G + + Sbjct: 296 STPEQLTELIEEHARTGVYISVLGFGMGNLKD 327 >gi|326670662|ref|XP_003199263.1| PREDICTED: collagen alpha-3(VI) chain-like [Danio rerio] Length = 1024 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 55/327 (16%), Positives = 110/327 (33%), Gaps = 33/327 (10%) Query: 48 TVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDI 107 L + D +A EG L GG + + L+++G + Sbjct: 686 AALQYVRDSVFTASAGSRKQEGVPQVLILLSGGRSNDNVDTP-----ASALKESGVLILG 740 Query: 108 DDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 S+ ++ + + Y+ S +P F N V P+T +V + Sbjct: 741 IGTRNSSR-EVQRITNDPAYAQSLNDFSDLPGVQQQFYSSLNNVLVQVKPMTPTVAAEPR 799 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 R D++ +LD S D + + + M+E + + + + + +V +S Sbjct: 800 MARR-DVVFLLDGS-------DGTRSSFSAMRDFVERMVERLNVSENRD---RVSVVQYS 848 Query: 228 NKIEEFFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 E F L I+ L G N+ L+Y + +F + Sbjct: 849 RDPEAHFYLNTYSRKEDALDTIRGLRHKGGRPLNTGAALQYLRDNVFTASSGSRRVEG-- 906 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 ++++ ++ + + K+ G +++AIG R S E R P+ Sbjct: 907 --VPQLLILLS-----GARSFDNVDTPASSLKELGVLIFAIGSRSSDSQELQRISQEPSY 959 Query: 345 FYLVENPHSM----YDAFSHIGKDIVT 367 V + + F++I K V Sbjct: 960 ALSVSDFTDLPSVQQQLFTNINKVFVA 986 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 71/202 (35%), Gaps = 26/202 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S D + + + M+E + + + + + +V +S E Sbjct: 5 DVVFLLDGS-------DGTRSSFSAMRDFVERMVERLNVSENRD---RVSVVQYSRDPEA 54 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L I+ L G N+ L+Y + +F + + Sbjct: 55 HFYLNTYSRKEDALDTIRGLRHKGGRPLNTGAALQYLRDNVFTASSGSRRVEG----VPQ 110 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ ++ + + K+ G +++AIG R S E R P+ V Sbjct: 111 LLILLS-----GARSFDNVDTPASSLKELGVLIFAIGSRSSDSQELQRISQEPSYALSVS 165 Query: 350 NPHSM----YDAFSHIGKDIVT 367 + + F++I K V Sbjct: 166 DFTDLPSVQQQLFTNINKVFVA 187 >gi|304441881|gb|ADM34176.1| martilin [Aplysia californica] Length = 348 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 72/205 (35%), Gaps = 46/205 (22%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +++ IVLD S S+ + + L++ Sbjct: 2 CEQEPIELGIVLDSSVSI------DRKDFKKGKEFLQDFLQQ------------------ 37 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F + G L I + + G T++ G++Y Q R Sbjct: 38 -------FEIGGGKDELIAAIGNIPHRLGRYTSTGEGIEYMATAQLASQFTRSWA----- 85 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 +++ + +TDG +++E ++ +A++ G ++AIG+ ++ E + + Sbjct: 86 --ERVGLVITDG---NSQESAKTKEAARQARESGITMFAIGVGNVKDQELVNIAGDASRV 140 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRI 370 + V++ + +I + + + Sbjct: 141 FKVDSYDEL----ENIKQTLAHQTC 161 >gi|256419952|ref|YP_003120605.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256034860|gb|ACU58404.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 345 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 77/205 (37%), Gaps = 30/205 (14%) Query: 134 RYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSIT 193 ++ L F F+ ++ ++ T +D++I LDVS+SM + D Sbjct: 57 TFRFLLIFIAFLFGAIGLANLQ----KGSRMEKITRKGVDVVIALDVSKSMLA-GDVKPD 111 Query: 194 KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF 253 ++ A + I+ + +++ + GLV F+ L S + + +S Sbjct: 112 RLTRAKQLISKLADKLDND-------RVGLVVFAGNAYLQMPLTIDYSAAKMYLTTVSPD 164 Query: 254 G---VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 T ++ A N F+ + + K ++ ++DGE+ D+ ++ Sbjct: 165 MIPTQGTAIGQAIQVA-NDAFNKKERKH----------KSLIIISDGEDH----DEAAIS 209 Query: 311 YCNEAKKRGAIVYAIGIRVIRSHEF 335 A + G ++ IGI Sbjct: 210 KARAAFEDGVVINTIGIGSPTGSPL 234 >gi|192359982|ref|YP_001981471.1| von Willebrand factor type A domain-containing protein [Cellvibrio japonicus Ueda107] gi|190686147|gb|ACE83825.1| von Willebrand factor type A domain protein [Cellvibrio japonicus Ueda107] Length = 2103 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 51/328 (15%), Positives = 120/328 (36%), Gaps = 42/328 (12%) Query: 31 FLVLGMIIEVSHIFFMKTVLHSMI-DRSLVH-AATQIMNEGNGNNRKKLKGGDILCR--- 85 + G I++ ++ K + ++I D +++ + ++++ ++ R Sbjct: 719 IISAGDDIDLGNVLMRKPGIDALIPDVAVLAIDRSALVSDVASFTASGTIDVTLINRGNA 778 Query: 86 -IKNTWNMSFRNELRDNGFVNDIDDIVR------STSLDIVVVPQ-NEGYSISAISRY-K 136 ++ + + +L +NG + D ++ S++L + V N ++S + Sbjct: 779 SVEIPFIIYAFEDLNNNGVYDAADTLLGQVSPVISSALPLTVDGTLNTSIAVSGNLSFRD 838 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR-LDMMIVLDVSRSMESFFDSSITKI 195 P+ ++S+ + S +D+ + +D S S+ Sbjct: 839 APISILLDATNVLVELSKANNLSSTAGLCSNQQKPNVDLALCMDSSGSV---------SA 889 Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN----KIEEFFLLEWGVSHLQRKIKYLS 251 + + ++ + + V+ + F++ ++ + E V + KI+ + Sbjct: 890 ANFQLQLEGTAQAIENVVPRDGTVRVSALQFASAATVELNPTIIEEDNVQAIADKIRAIR 949 Query: 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY 311 K G T+ + A I N A+ +II TDG++ ++ S Sbjct: 950 KTGGGTSIHACIDSATTLI---------ANALPASSMQIIDVSTDGQSTQSQAVAAS--- 997 Query: 312 CNEAKKRGAIVY-AIGIRVIRSHEFLRA 338 N A+ G V AIG+ S L A Sbjct: 998 -NRARAAGIDVLNAIGVGTGISTALLNA 1024 >gi|156742135|ref|YP_001432264.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233463|gb|ABU58246.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 425 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 68/195 (34%), Gaps = 30/195 (15%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 L++ +VLD S SM ++ + +++++ LV F+++ Sbjct: 44 PLNLCLVLDRSSSMRG------ERLMQVKDAAARIVDQLGQDD------YFSLVVFNDRA 91 Query: 231 EEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + + L+ I + G T G+ A ++ R + Sbjct: 92 DVVIPAQRAIKKADLKAAIAQIEAAG-GTEMAQGMALALQEV-----QRPFLTRGISR-- 143 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS---F 345 I+ +TDG ++ + + + RG + A+GI + + L + + Sbjct: 144 --IILLTDGRTYG--DESRCVEIARRGQSRGIGLTALGIGTEWNEDLLETMTASENSRAQ 199 Query: 346 YLVENPHSMYDAFSH 360 Y+ + F+ Sbjct: 200 YIA-TAQDVVKVFAD 213 >gi|54112390|ref|NP_000713.2| voltage-dependent calcium channel subunit alpha-2/delta-1 [Homo sapiens] gi|109658756|gb|AAI17469.1| Calcium channel, voltage-dependent, alpha 2/delta subunit 1 [Homo sapiens] gi|109659118|gb|AAI17471.1| Calcium channel, voltage-dependent, alpha 2/delta subunit 1 [Homo sapiens] gi|119597396|gb|EAW76990.1| calcium channel, voltage-dependent, alpha 2/delta subunit 1 [Homo sapiens] Length = 1091 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 79/200 (39%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 247 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 294 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 295 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 353 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ N+ K + V+ + R Sbjct: 354 CN-------KIIMLFTDG------GEERAQEIFNKYNKDKKVRVFTFSVGQHNYDRGPIQ 400 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 401 WMACENKGYYYEIPSIGAIR 420 >gi|108759903|ref|YP_633800.1| BatB protein [Myxococcus xanthus DK 1622] gi|108463783|gb|ABF88968.1| batB protein [Myxococcus xanthus DK 1622] Length = 343 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 34/183 (18%), Positives = 70/183 (38%), Gaps = 25/183 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P + K +D+++ LD S+SM + D ++++ A + +L+E+K Sbjct: 77 LAQPQCGT-KSELTKRRGIDVVVALDASKSMLA-RDIQPSRLERAKLELTTLLDELKGD- 133 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQI 270 + GLV F+ L S ++ ++ + TN L+ + + Sbjct: 134 ------RVGLVVFAGDAFIQSPLTSDYSAVKLFLRAVDPEVMPQGGTNVGAALRLSRQVL 187 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 N + + ++++V +TDGE+L K G V A+G+ Sbjct: 188 ---------ENADRGSKERVVVLLTDGEDL----VGDVAEATEALKDSGVQVLAVGVGSE 234 Query: 331 RSH 333 Sbjct: 235 SGE 237 >gi|167045536|gb|ABZ10188.1| putative von Willebrand factor type A domain protein [uncultured marine microorganism HF4000_APKG10H12] Length = 356 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 55/165 (33%), Gaps = 25/165 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + IVLD S SM K+ A ++++ L L+ + + L F Sbjct: 91 NERVPVSLGIVLDTSESM------DGQKMAAAQRALDRFL--FDLLGPDDEIF---LYRF 139 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 E L R I+ + G + ++D + Sbjct: 140 DYTPELLQDWTVDRIRLSRAIRDIRPRGNTA------------LYDAVAESVPRVAGGQH 187 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 +KK ++ ++DG + ++ A++YAIGI Sbjct: 188 FKKALLIISDG--NDNNSETDVRELRELIRESEALIYAIGIDGPS 230 >gi|126306639|ref|XP_001365694.1| PREDICTED: similar to Von Willebrand factor A domain containing 1 [Monodelphis domestica] Length = 593 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 34/186 (18%), Positives = 69/186 (37%), Gaps = 25/186 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ + S + + + +++ + L P VQ+ +V ++ Sbjct: 14 DLLFLLDSSASVSYYEFSRV------KEFVGQLVQPLPLGPGS---VQTSMVHVGSEPTV 64 Query: 233 FFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + S Q I+ + TN+ L A Q+F + + +K+ Sbjct: 65 EFPFDRHHSGAAAQEAIQAAKQLMGDTNTGLALALAKKQLFTTE------AGARSGVRKV 118 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--PNSFYLV 348 +V++TDG++ K G V+ + + L A AS P Sbjct: 119 LVWVTDGDSSD-----DVQAPMQVLKDLGVTVFIVSTGRGNFLD-LSAAASQPPEKHLRF 172 Query: 349 ENPHSM 354 + + Sbjct: 173 VDVDDL 178 >gi|3024063|sp|P97280|ITIH3_MESAU RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3; Short=ITI heavy chain H3; Short=ITI-HC3; Short=Inter-alpha-inhibitor heavy chain 3; Flags: Precursor gi|1694692|dbj|BAA13940.1| inter-alpha-trypsin inhibitor heavy chain 3 [Mesocricetus auratus] Length = 886 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 37/192 (19%), Positives = 78/192 (40%), Gaps = 20/192 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEE 232 ++ V+D+S SM KI ++ +L+++K +N ++ +G+ T+ + + + Sbjct: 282 IVFVIDISGSMAG------RKIQQTRVALLKILDDMKQDDYLNFILFSTGVTTWKDSLVQ 335 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + ++ +S G+ TN GL + D + QH E + II+ Sbjct: 336 ATP--ANLEEARTFVRSISDQGM-TNINDGLLRGIRMLTDARE--QHTVPERSTS--III 388 Query: 293 FMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS-----FY 346 +TDG+ N ++ +A + +Y +G ++ FL A N Y Sbjct: 389 MLTDGDANTGESRPEKIQENVRKAIEGRFPLYNLGFGNNLNYNFLETMALENHGVARRIY 448 Query: 347 LVENPHSMYDAF 358 + + F Sbjct: 449 EDSDANLQLQGF 460 >gi|326927692|ref|XP_003210025.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Meleagris gallopavo] Length = 1108 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 78/197 (39%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ ML+ + VN + +F Sbjct: 252 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVHEMLDTLSDDDYVN------VASF 299 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + + ++ + G +T+ G +YA++Q+ + R + Sbjct: 300 NEKAKPVSCFKHLVQANIRNKKVFKEDVQGMVAKG-TTDYKAGFEYAFDQLQNSNITRAN 358 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N K +++G A Sbjct: 359 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--KTVRVFTFSVGQHNYDVTPLQWMA 409 Query: 339 CASPNSFYLVENPHSMY 355 CA+ ++ + + ++ Sbjct: 410 CANKGYYFEIPSIGAIR 426 >gi|282900568|ref|ZP_06308510.1| von Willebrand factor, type A [Cylindrospermopsis raciborskii CS-505] gi|281194368|gb|EFA69323.1| von Willebrand factor, type A [Cylindrospermopsis raciborskii CS-505] Length = 426 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 81/232 (34%), Gaps = 39/232 (16%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 + R + + I++ T L++ ++LD S SM + M I ++ +++++ Sbjct: 21 SQRQLEVSISAIAD-ELDTSLPLNLCLILDKSGSMHG------ESMSMVINAVEQLIDQL 73 Query: 210 KLIPDVNNVVQSGLVTF--SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 + + +V F S ++ + ++ ++ K G T GL Sbjct: 74 QSGD------RIAIVAFAGSGEVIIPNQIIKDPKTIKSQLHNKLKAGGGTIIGEGLSLGI 127 Query: 268 NQIF-DMQGMRQHCNTEDANYKKIIVFMTDGENL----------STKEDQQSLYYCNEAK 316 ++ +G H +TDG ++Q+ L +A Sbjct: 128 TELLKGSKGACSHAF-----------LLTDGYGDNGFKIWRLQIGPNDNQRCLELAQKAA 176 Query: 317 KRGAIVYAIGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSHIGKDIV 366 K + + G + + L A + +E P + + F+ I K I Sbjct: 177 KLNLTINSFGFGDEWNQDLLEKIADAGGGTLAYIETPQNAIEQFNRIFKRIQ 228 >gi|257870337|ref|ZP_05649990.1| von Willebrand factor type A domain-containing protein [Enterococcus gallinarum EG2] gi|257804501|gb|EEV33323.1| von Willebrand factor type A domain-containing protein [Enterococcus gallinarum EG2] Length = 1169 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 29/134 (21%), Positives = 55/134 (41%), Gaps = 20/134 (14%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 A LD+++V+D S SM +I K ++ ++ + +N+ G V +S+ Sbjct: 335 APLDLVLVVDWSGSMNDN-----NRIGEVQKGVDRFVDTLAESGITDNIH-MGYVGYSSD 388 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + G ++ IK ++ T + L+ A N + G Sbjct: 389 GYKNDSVAMGPFDSVKNAIKTITPSSTTGGTFTQKALRDAGNMLATPNG----------- 437 Query: 287 YKKIIVFMTDGENL 300 +KK+IV +TDG Sbjct: 438 HKKVIVLLTDGVPT 451 >gi|260820612|ref|XP_002605628.1| hypothetical protein BRAFLDRAFT_150512 [Branchiostoma floridae] gi|229290963|gb|EEN61638.1| hypothetical protein BRAFLDRAFT_150512 [Branchiostoma floridae] Length = 168 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 23/137 (16%), Positives = 52/137 (37%), Gaps = 15/137 (10%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +D+ V+D S S+ + + + K + M++ + + G+V F+ Sbjct: 2 TPVDLAFVIDGSASVGPL------QFEKSKKFVRDMVDGFNI---GAAQTRVGVVQFAWM 52 Query: 230 IEEFFLLEWGV--SHLQRKIKYLSKF-GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ F L + + L+ I + G T L + ++F E + Sbjct: 53 VQAEFNLGDYLDGTDLRNAIARIRYMDGPGTEIGKALVFTKRRLFS---ELYGARPETQD 109 Query: 287 YKKIIVFMTDGENLSTK 303 +I++ +TDG + Sbjct: 110 VPRIVILITDGRSSPES 126 >gi|156741691|ref|YP_001431820.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233019|gb|ABU57802.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 698 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 32/236 (13%), Positives = 82/236 (34%), Gaps = 37/236 (15%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA-----------IKSIN 203 P T+ + D+++++D S SM+ ++ ++ A I Sbjct: 33 TSPATAQQTGQALDSGNSDVVLIIDNSGSMKQNDPQNL-RLAAANLFIDLSDPRDKIGIV 91 Query: 204 AMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTP 261 + + ++ N+V+ G ++ + + L+ + L G T+ Sbjct: 92 VLSDRMRTRSLTKNLVRIG-----SRQD--------IDELKGLVDALRNETKGQETHMGT 138 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA- 320 L AY+ + G + N ++ +V ++DG + ++ ++R Sbjct: 139 ALDLAYDLLDATPGSNRGANQ-----RQFVVLLSDGLPTGVGQRERVDQAVQRFRERRYW 193 Query: 321 IVYAIGIRVIRSHEFL-RACASP--NSFYLVENPHSMYDAF-SHIGKDIVTKRIWY 372 +++I + +L +SP + + + D + + + I Y Sbjct: 194 KIFSIALGDEADPAYLDEKVSSPSGGQVVVARHAGELLDRYLDVYARAGDDRYINY 249 >gi|268558482|ref|XP_002637231.1| Hypothetical protein CBG18899 [Caenorhabditis briggsae] Length = 615 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 75/192 (39%), Gaps = 21/192 (10%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +D+M ++D S S+ I D+ I +L++V + P + + +V ++ Sbjct: 408 PPIDLMFLVDTSSSIG------INNFDIQKNFICEILKDVDVAPGRS---RISMVQYAQD 458 Query: 230 IEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYA----YNQIFDMQGMRQHCNTE 283 F + + ++R + LS G +T + L +A Y++ + ++H Sbjct: 459 PSVVFGFDQYYSYESVRRGVMRLSYTGGATMLSKALAFAGGIMYHEQNLKKTTKRHQFLP 518 Query: 284 DANYKK--IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 + + ++ ++DG D++S+ + ++A+ R + + Sbjct: 519 TPKHDRLQVLCLVSDGY-SDDSADKESVNLHDRLH---VKIFAVVTRSFNKDKLVPITRF 574 Query: 342 PNSFYLVENPHS 353 S + V S Sbjct: 575 DGSVFTVHQRES 586 >gi|213965586|ref|ZP_03393780.1| von Willebrand factor type A domain protein [Corynebacterium amycolatum SK46] gi|213951745|gb|EEB63133.1| von Willebrand factor type A domain protein [Corynebacterium amycolatum SK46] Length = 330 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 44/231 (19%), Positives = 78/231 (33%), Gaps = 29/231 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 +++ + + R +M+V+DVS SM S D + +I A ++ +E Sbjct: 73 VIISLAGPISETKVARNRATVMLVVDVSLSM-SATDVAPDRITAAKEAGQEFVE------ 125 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 ++ + + GLVTFS + + R L + + G A Sbjct: 126 NLPDDLNIGLVTFSGRARTAVSPTTNHDTVNR---ALQAAELDQATATGDAIAAALDAIN 182 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ---SLYYCNEAKKRGAIVYAIGIR-- 328 Q + IV ++DG+ +E + EA K G V I Sbjct: 183 QFSDSVQGGGEGAPPATIVLLSDGKQTVPQELDDPRGAYTAAAEAAKAGVPVNTISFGTA 242 Query: 329 ------------VIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 V + LR A + F+ + + DA+ + DI Sbjct: 243 QGAITVQGELIPVPNDDDSLREIARRTKGEFFSAGSLEQLRDAYGSLEDDI 293 >gi|301785912|ref|XP_002928373.1| PREDICTED: LOW QUALITY PROTEIN: integrin alpha-E-like [Ailuropoda melanoleuca] Length = 1188 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 36/215 (16%), Positives = 86/215 (40%), Gaps = 26/215 (12%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + + + A ++ I+LD S S++ A I+ M+ + Sbjct: 200 TQEDEEEEAAGTEIAIILDGSGSIDP------PDFQRAKDFISNMMRN---FYEKCFQCS 250 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQR--KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 LV + I+ F L + +++ +++ G T + +++ + IF Sbjct: 251 FALVQYGEVIQTEFDLRDSQDAMASLARVQNITQVGKVTKTASAMQHVLDNIFTPS---- 306 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR----VIRSHE 334 + N K++V +TDG+ + ++ + K +G +AIG+ +++ Sbjct: 307 --HGSRKNASKVMVVLTDGDIFEDPLNLTTVI--SSPKMQGVERFAIGVGKAFENNKTYN 362 Query: 335 FLRACAS--PNSF-YLVENPHSMYDAFSHIGKDIV 366 L+ AS + + + V N ++ S + ++I+ Sbjct: 363 ELKLIASDPDDRYAFKVTNYTALDGLLSKLQQNII 397 >gi|110636839|ref|YP_677046.1| outer membrane protein [Cytophaga hutchinsonii ATCC 33406] gi|110279520|gb|ABG57706.1| possible outer membrane protein [Cytophaga hutchinsonii ATCC 33406] Length = 1313 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 80/210 (38%), Gaps = 27/210 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 I + V + + +++ +D+S SM+ + + +A + A++ ++ L Sbjct: 72 IEQQVVKVVNPAAVKPKSISLVLTIDISESMQKQY------MPLAKNAAAAIVNKLPL-- 123 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 ++ + + +F++ L + I+ L G + YN+ F Sbjct: 124 ---DISECAVTSFNDVSFINTDFTRDRFKLLQSIQTLVPAGGT---------DYNKGFIK 171 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + ++K+++F+TDG D +AK GA VY I + + Sbjct: 172 SNAGGLDILKKGLHEKVLIFLTDG-----YGDVNPTEIIQQAKSIGAKVYVITLGMSAPE 226 Query: 334 EFLRAC-ASPNSFYL-VENPHSMYDAFSHI 361 E R A+ S+Y V + + + I Sbjct: 227 ELKRIVTATNGSYYENVISEQEINAVYMSI 256 >gi|319786316|ref|YP_004145791.1| von Willebrand factor type A [Pseudoxanthomonas suwonensis 11-1] gi|317464828|gb|ADV26560.1| von Willebrand factor type A [Pseudoxanthomonas suwonensis 11-1] Length = 340 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 31/221 (14%), Positives = 66/221 (29%), Gaps = 45/221 (20%) Query: 168 TDARLDMMIVLDVSRSMES----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 M++ +D+S SM + ++ A I L+ + + GL Sbjct: 92 AQQARQMLLAVDLSGSMSEVDMMLGMQPVDRLTAAKAVIADFLQR-RSGD------RVGL 144 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 + F + L +S ++ +++ G T + A ++ Sbjct: 145 LVFGQRAYMLTPLTLDLSAVREQLRDTVAGLAGRETALGDAIGLAVKRLRTQPEG----- 199 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------- 332 ++++V +TDG N Q L A VY I Sbjct: 200 ------QRVLVLLTDGVNT--TGVLQPLKAAELAAAEQVRVYTIAFGGDGGGFSLFGVQV 251 Query: 333 --------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 LR A + F+ + + + ++ + + Sbjct: 252 PVQGDEVDEATLRKVAEITGGRFFRAHDANQLAGIYAELER 292 >gi|71900686|ref|ZP_00682810.1| von Willebrand factor, type A [Xylella fastidiosa Ann-1] gi|71729565|gb|EAO31672.1| von Willebrand factor, type A [Xylella fastidiosa Ann-1] Length = 414 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 30/184 (16%), Positives = 59/184 (32%), Gaps = 27/184 (14%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + Q + + +D+S SM+ + +++D ++NA L+++ Sbjct: 140 PEKAAIGMQMPPSVAVYFAIDLSGSMDDVGGNGRSRLDNMKTALNAALDQLGQSIASGTA 199 Query: 219 VQSGLVTFSNKIEEFFLL------EWGVSHLQRKIK-----YLSKFGVSTNSTPGLKYAY 267 V LV F + + L G++ L+ + Y + F T P A Sbjct: 200 VDIMLVGFGDAPDHRQTLLRRNCTAQGIAELKSWVATRQALYGTYFPAGTMDMPSFYAA- 258 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 +N ++ F+TDGE + + Y I I Sbjct: 259 ---------------ASSNAVRVAFFITDGEPDPPSATLAQAARADVDQVAHLRCYGITI 303 Query: 328 RVIR 331 + Sbjct: 304 DLAN 307 >gi|307943468|ref|ZP_07658812.1| putative Flp pilus assembly protein TadG [Roseibium sp. TrichSKD4] gi|307773098|gb|EFO32315.1| putative Flp pilus assembly protein TadG [Roseibium sp. TrichSKD4] Length = 479 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 18/70 (25%), Positives = 31/70 (44%), Gaps = 4/70 (5%) Query: 297 GENLSTKEDQQSLYYCNEAKKRGAIVYAIGI---RVIRSHEFLRACAS-PNSFYLVENPH 352 G Q+L C K++ ++Y + + +++CAS P+ FYL E+ Sbjct: 400 GATKRDAPATQALALCEAMKEQDVVIYTVYFETTGAKFGKDLMKSCASDPDKFYLAEDRD 459 Query: 353 SMYDAFSHIG 362 + AFS I Sbjct: 460 GLKAAFSAIA 469 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 38/259 (14%), Positives = 87/259 (33%), Gaps = 37/259 (14%) Query: 35 GMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSF 94 G I+++ ++ + + +D S + A ++ + KL +I ++ + + Sbjct: 22 GSGIDLTSALNARSKMANALDASALKLAGKL-------SVAKLSDDEIQAGLEKMFTANL 74 Query: 95 RN-ELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 +L+ + + L+ V + + K Sbjct: 75 SRFDLKASAL----------SELEFEVDWTKGILDVWSDVSVKTHF----IGLGGLGPEK 120 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + +TS V SQ L++ +VLDV+ SM D I+ + A + + L Sbjct: 121 LDVGVTSRVSFASQ---ALELALVLDVTGSM----DGDISSLKEASQLLFEALVPENAGR 173 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + ++ +V +S + W V++ Q N AY ++ Sbjct: 174 H-DQRIRVSIVPYSQGVNLGAK-AWKVTNRQSDSSNCVATRGGPN---AFTDAY---YNY 225 Query: 274 QGMRQHCNTEDANYKKIIV 292 +G R + ++ Sbjct: 226 RGARSNFFVAPGALDYFVI 244 >gi|301623011|ref|XP_002940816.1| PREDICTED: integrin alpha-M-like [Xenopus (Silurana) tropicalis] Length = 907 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 32/222 (14%), Positives = 79/222 (35%), Gaps = 28/222 (12%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 V +N LD++ ++D S S+ + + ++ ++E+ K Sbjct: 214 STSRCQRVPINECPVRSLDIVFLIDGSGSIYP------DQFQEMLTFVSKVIEDFKGTDT 267 Query: 215 VNNVVQSGLVTFSN--KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + ++ +S+ + F ++ +++ +T + ++ +Q+F Sbjct: 268 L-----FAIMQYSSWFVLHFDFNKFLATKDHNSLVRNINQLKGATWTATAIQKVLSQLFI 322 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 +K+++ +TDGE D EA+++G + +AIG+ S Sbjct: 323 PS------RGARDGSQKLLIVITDGEKYGDSLDYS--IPIAEAERKGIVRFAIGVGRAFS 374 Query: 333 H-----EFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVT 367 E + + S Y V + ++ + I Sbjct: 375 EDTAYSELITIASQPSDKYVYRVGDFSALSKFRKDLQDKIFA 416 >gi|254466920|ref|ZP_05080331.1| conserved hypothetical protein [Rhodobacterales bacterium Y4I] gi|206687828|gb|EDZ48310.1| conserved hypothetical protein [Rhodobacterales bacterium Y4I] Length = 550 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 21/74 (28%), Positives = 33/74 (44%), Gaps = 1/74 (1%) Query: 299 NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLRACASPNSFYLVENPHSMYDA 357 N +DQ + C+ K +G IVY++G + + L CAS + + + DA Sbjct: 476 NYRVAKDQHTKTICDITKDQGVIVYSVGFEAPSAGIKVLEDCASSPAHFFDVEGLEISDA 535 Query: 358 FSHIGKDIVTKRIW 371 FS I I R+ Sbjct: 536 FSSIATSIRQLRLT 549 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 31/231 (13%), Positives = 79/231 (34%), Gaps = 43/231 (18%) Query: 7 NIRNFFYNYKGGMT-ILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 +R+F G + + FL + V G+ +++ + +T L +DR+++ AA Sbjct: 22 KVRSFLREEDGVLAKPMIGTFL-AMLAVGGIGVDLMRMERDRTELQYTLDRAVLAAA--- 77 Query: 66 MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 L + + L D + S++ + E Sbjct: 78 ------------DLDQSLDADAVVLDYLTKAGLEQYYSDPDDQKGLGYKSVEATIDTDFE 125 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 Y + + ++ + S + + +++ +VLD+S SM Sbjct: 126 AY-----------------LLKFAGGDNMSLYANSRAE---EIIGSVEISMVLDISGSMN 165 Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 S ++ + + + ++ DV+N+ ++ ++ ++ L Sbjct: 166 SG-----NRLVNLQAAAKSFVTQITSNTDVSNLS-ISIIPYATQVNAGEKL 210 >gi|47218505|emb|CAF97239.1| unnamed protein product [Tetraodon nigroviridis] Length = 1060 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 33/174 (18%), Positives = 61/174 (35%), Gaps = 20/174 (11%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 + D++ ++D S S+ I+A++ K V+ G+ F + Sbjct: 460 QADIVFLVDESWSVGQN------SFSHVKDFISAIITSFKDSVVGTEGVRFGVTVFGDVP 513 Query: 231 EEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L S + R I+ L G S L + +F R H Sbjct: 514 KMRIALTDYSSQEEVLRAIRDLPYEGRSRRIGDALTFLVQHVFSPVIRRDHG-------P 566 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 KI V +T+G + D + + G ++A+G+ + E R + P Sbjct: 567 KIAVLITNGRSDDPV-DAAARLVAD----SGISLFAVGVGGADASELRRMVSEP 615 >gi|28899191|ref|NP_798796.1| hypothetical protein VP2417 [Vibrio parahaemolyticus RIMD 2210633] gi|28807415|dbj|BAC60680.1| hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] Length = 431 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 28/208 (13%), Positives = 68/208 (32%), Gaps = 9/208 (4%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 KG +L ++ L I+ V I+++H KT L + +D + + A + + Sbjct: 20 RTQKGITLVLISMVLLILLGVAAFGIDLNHQVLNKTRLQNAVDTAALAGAVVADKTEDVD 79 Query: 73 NRKKLKGGDILCRIKN--TWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 + + +SF + F +D+ V + S + ++ Sbjct: 80 QAEAAVIATLSSIASESGNTELSFTDGNTSVTFSHDMQTFVNAASFTPPTGEYDIYVRVA 139 Query: 131 AI--SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 + + + S + S+ + + M D + ++E + Sbjct: 140 VTDMGISQYLSAVFGIVKNVSAS---AVAGRSAAIAYTCNLTPIAMCG--DPNGTVEDAW 194 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVN 216 D + +++ E+K+ N Sbjct: 195 GYRPPGYDPNVDMDPSLVHELKVGDQNN 222 >gi|153836806|ref|ZP_01989473.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] gi|149749952|gb|EDM60697.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] gi|328474272|gb|EGF45077.1| hypothetical protein VP10329_16235 [Vibrio parahaemolyticus 10329] Length = 418 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 28/208 (13%), Positives = 68/208 (32%), Gaps = 9/208 (4%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 KG +L ++ L I+ V I+++H KT L + +D + + A + + Sbjct: 7 RTQKGITLVLISMVLLILLGVAAFGIDLNHQVLNKTRLQNAVDTAALAGAVVADKTEDVD 66 Query: 73 NRKKLKGGDILCRIKN--TWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 + + +SF + F +D+ V + S + ++ Sbjct: 67 QAEAAVIATLSSIASESGNTELSFTDGNTSVTFSHDMQTFVNAASFTPPTGEYDIYVRVA 126 Query: 131 AI--SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 + + + S + S+ + + M D + ++E + Sbjct: 127 VTDMGISQYLSAVFGIVKNVSAS---AVAGRSAAIAYTCNLTPIAMCG--DPNGTVEDAW 181 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVN 216 D + +++ E+K+ N Sbjct: 182 GYRPPGYDPNVDMDPSLVHELKVGDQNN 209 >gi|253584082|ref|ZP_04861280.1| batA protein [Fusobacterium varium ATCC 27725] gi|251834654|gb|EES63217.1| batA protein [Fusobacterium varium ATCC 27725] Length = 325 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 31/175 (17%), Positives = 62/175 (35%), Gaps = 29/175 (16%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + ++ L++ ++D SRSM + D +++ A +++ +L+ +K + Sbjct: 72 LLDEDTVEVKGLNIYALIDTSRSMMAE-DVYPNRLEAAKRTLENLLQGLKGD-------R 123 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 G + FS+ L S + I L G T L+ A ++ Sbjct: 124 IGFIPFSDSAYIQMPLTDDYSIGKNYINALDTNLISGGGTELYQALELAEKSFKEINSDN 183 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 K I+ ++DG + K + K V++IGI Sbjct: 184 -----------KTIIILSDGGDFDEKSLKFV-------KDNKMNVFSIGIGTEEG 220 >gi|220941746|emb|CAX15447.1| novel protein similar to vertebrate collagen, type VI, alpha 3 (COL6A3) [Danio rerio] Length = 2026 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 74/196 (37%), Gaps = 28/196 (14%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T +V + R D++ ++D S S+ + + I +++++ + PD Sbjct: 266 TPAVPRDPLNLGRKDIIFLIDGSDSVG------QSGVAHIRDFILKVVDQLDVRPDQ--- 316 Query: 219 VQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGV-STNSTPGLKYAY-NQIFDMQ 274 V+ LV + + + F L + IK L G + +KY N++ Sbjct: 317 VRVALVQYGERPKTEFSLNSHDNKQSVISAIKRLRHMGGRGADLAEAIKYVIRNELQASA 376 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV--IRS 332 G+R A + +V +T G + S S+ +G+ V IGI S Sbjct: 377 GVRL------AQASQHLVVLTGGRSTSDVSTYGSIL-------KGSRVNCIGIGAENADS 423 Query: 333 HEFLRACASPNSFYLV 348 + ++ S + V Sbjct: 424 RQLIQIATSSDDVLQV 439 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 32/186 (17%), Positives = 72/186 (38%), Gaps = 24/186 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV-KLIPDVNNVVQSGLVTFSNKIE 231 D++ ++D + + + F +I M++ V + + + V+ +V +S+ + Sbjct: 80 DVVFLIDGTSKIRNEFP-----------AIRDMVQRVVEKLDVGLDNVRVSVVQYSDDPK 128 Query: 232 EFFLLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 FLL + +++ I+ L G + N+ L+Y I+ R + + Sbjct: 129 LEFLLNEHSTKEEVRQAIRRLRNKGGNELNTGKALEYVSKTIY----QRSAGSRVEEGVP 184 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 + ++ +T G K + N+ K A+G R + E SP Y + Sbjct: 185 QFLILVTGG-----KSNDDVSGPANQLKLSRVAPLAVGARDADAEELRLISFSPELAYTI 239 Query: 349 ENPHSM 354 + + Sbjct: 240 RDFQQL 245 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 37/272 (13%), Positives = 81/272 (29%), Gaps = 26/272 (9%) Query: 96 NELRDNGFVNDID-DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHI 154 + VN I + S ++ + + + S +P FI + S + Sbjct: 403 GSILKGSRVNCIGIGAENADSRQLIQIATSSDDVLQVPSFPNLPNIQNKFIARLSGSIVV 462 Query: 155 VMPIT-SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 PI D++ ++D S ++ + + +L + L Sbjct: 463 EPPIEIDETTPGLPQAKAADIVFLVDGSINLGRNN---------FKEVMEFILNLIDLFY 513 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQI 270 + +Q GL ++ + + F L + I G + +++ + Sbjct: 514 TERDRLQIGLAHYATDVTDVFYLNTYNNKDDIINAITRAEYKGGREIRTGSAIRH-VQKT 572 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 ++ + +I++ +T G S K G VYA+G+ I Sbjct: 573 HFVKEKGSRKDEG---IPQILMVVTGGR-----SRDDSKSAALGLKASGVRVYAVGVGDI 624 Query: 331 RSHEFLRACASPNSFYL-VENPHSMYDAFSHI 361 L S + + + I Sbjct: 625 EDE--LNNLGSEATTVARASTFQELSELNEQI 654 >gi|187466178|emb|CAQ51888.1| integrin, alpha E, epithelial-associated [Mus musculus] Length = 1038 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 42/236 (17%), Positives = 85/236 (36%), Gaps = 26/236 (11%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 +C T +V+ + + ++ IVLD S S+E + A Sbjct: 163 YCRSKGGSTGEEKKSARRRRTVEEEDEEEDGTEIAIVLDGSGSIEP------SDFQKAKN 216 Query: 201 SINAMLEEV--KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTN 258 I+ M+ K +VQ G V + E ++ K++ + + T Sbjct: 217 FISTMMRNFYEKCFECNFALVQYGAV---IQTEFDLQESRDINASLAKVQSIVQVKEVTK 273 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + +++ + IF K++V +TDG+ + ++ N K + Sbjct: 274 TASAMQHVLDNIFIPS------RGSRKKALKVMVVLTDGDIFGDPLNLTTVI--NSPKMQ 325 Query: 319 GAIVYAIGIR----VIRSHEFLRACASP---NSFYLVENPHSMYDAFSHIGKDIVT 367 G + +AIG+ ++ L+ AS + V N ++ S + + IV Sbjct: 326 GVVRFAIGVGDAFKNNNTYRELKLIASDPKEAHTFKVTNYSALDGLLSKLQQRIVH 381 >gi|150019021|ref|YP_001311275.1| von Willebrand factor, type A [Clostridium beijerinckii NCIMB 8052] gi|149905486|gb|ABR36319.1| von Willebrand factor, type A [Clostridium beijerinckii NCIMB 8052] Length = 962 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 28/139 (20%), Positives = 53/139 (38%), Gaps = 22/139 (15%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 ++++VLD S SM + K+ K+ + ++ + ++ +V F + Sbjct: 78 PPKEIVLVLDSSGSMADNY-----KLTNLKKAATDFITKMSTVKNLK----IAIVDFDTQ 128 Query: 230 IEEFFLLEW-----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 L V+ L+R I L+ G TN+ GL+ A + + + Sbjct: 129 ATIINKLTDVSSSTNVTALKRSINNLTA-GGGTNTGEGLRQAAYLLSNSSEQNPLASKN- 186 Query: 285 ANYKKIIVFMTDGENLSTK 303 I+FM+DGE Sbjct: 187 ------IIFMSDGEPTYYN 199 >gi|116620210|ref|YP_822366.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116223372|gb|ABJ82081.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 311 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 27/213 (12%), Positives = 65/213 (30%), Gaps = 37/213 (17%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D + + +V D S SM K+ + + + LV F Sbjct: 85 DDEPVAVGLVFDTSGSMGE-------KLQRSRMAAREFFHISN---PEDEFF---LVEFD 131 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + L ++ + + G + + D + H Sbjct: 132 SSPRLVVPLTSDTGTIEDHLTFSRSHGSTA------------LLDAIFLALHEMKHSKKN 179 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI----RVIR---SHEFLRACA 340 KK ++ ++DG + ++ ++ + K+ ++Y+IG+ L + Sbjct: 180 KKALLIISDGGDNHSRYSEK--EVSSVVKESDVLIYSIGVFGGGGSPEEAGGPGLLSKVS 237 Query: 341 --SPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + + + + D IG ++ + I Sbjct: 238 EQTGGRLFEA-SAVELPDIAKKIGIELRNRYIL 269 >gi|297191182|ref|ZP_06908580.1| von Willebrand factor [Streptomyces pristinaespiralis ATCC 25486] gi|197718543|gb|EDY62451.1| von Willebrand factor [Streptomyces pristinaespiralis ATCC 25486] Length = 424 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 38/221 (17%), Positives = 76/221 (34%), Gaps = 49/221 (22%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF------- 226 + +VLDVS SM + +++ A ++ N +L+ V V+ G+ T Sbjct: 41 VELVLDVSGSMRARDIDGKSRMAAAKQAFNEVLDAVP------EEVRLGIRTLGADYPGD 94 Query: 227 --------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + ++ L+ + + + L+ G T P L A + R+ Sbjct: 95 DRKRGCKDTRQLYPVGPLD--RTEAKAAVATLAPTGW-TPIGPALLGAAEDLEGGDATRR 151 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK---RGAI--VYAIGIRVIRSH 333 IV +TDGE+ D C A++ +G + +G+ Sbjct: 152 ------------IVLITDGEDTCAPLD-----PCEVAREIAAKGIHLVIDTLGLVPDAKT 194 Query: 334 EFLRAC---ASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 C A+ ++ V++ + D S + + Sbjct: 195 RTQLRCIAEATGGTYTSVQHTDELSDRVSQLVDRAADPVVT 235 >gi|125535226|gb|EAY81774.1| hypothetical protein OsI_36948 [Oryza sativa Indica Group] Length = 633 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 42/229 (18%), Positives = 86/229 (37%), Gaps = 36/229 (15%) Query: 141 FCTFIPWYTNSR-HIVMPITSSVKVNSQTDARLDMMIVLDVSRSME----------SFFD 189 F T TN +++ + + N + +D++ VLDVS SM + Sbjct: 38 FPTIPRGQTNKDFQVLLHVEAPPAANLKGHVPIDVVAVLDVSGSMNDPVAAAAAASPESN 97 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-KIEEFFL-----LEWGVSHL 243 +++D+ S+ ++ ++ + +V F++ ++E+ G S Sbjct: 98 LQASRLDVLKASMKFIIRKLDDGD------RLSIVAFNDGPVKEYSSGLLDVSGDGRSIA 151 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 +KI L G T P L+ A + + QG N+ I+ +TDG++ + Sbjct: 152 GKKIDRLQARG-GTALMPALEEAVKILDERQG-------GSRNHVGFILLLTDGDDTTGF 203 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRAC-ASPNSFYLVEN 350 + + AK V+ G+ L S ++ V++ Sbjct: 204 RWTRDAIHGAVAK---YPVHTFGLGASHDPEALLHIAQGSRGTYSFVDD 249 >gi|324508820|gb|ADY43721.1| C-type lectin protein 160 [Ascaris suum] Length = 534 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 42/182 (23%), Positives = 64/182 (35%), Gaps = 22/182 (12%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD+++V+D S SM D I M V DV V+ GLVTFSN+ Sbjct: 187 LDIVVVVDSSLSMTK--DGLIEVAADLATVFQWM--NVSSGTDVGQFVRVGLVTFSNQAF 242 Query: 232 EFFLLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L+ S+ L +++ + G S N L+ A + + Q R + T Sbjct: 243 VNGNLDDFTSYNSLVKRLFQMPYLGGSELNIESALQSASDIL---QSSRYYARTA----- 294 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF---LRACASPNSF 345 I+ T N+ K+ G + + R L ASP Sbjct: 295 --ILLYT--SAYGEGGFTDPKAIANQIKESGTKIITVAFRQQPEGSLVEKLSHLASPGFS 350 Query: 346 YL 347 + Sbjct: 351 FA 352 >gi|317403331|gb|EFV83845.1| hemolysin-type calcium-binding region [Achromobacter xylosoxidans C54] Length = 1141 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 27/156 (17%), Positives = 59/156 (37%), Gaps = 8/156 (5%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSS-ITKIDMAIKSINAMLEE 208 + +++ + ++ IV+D S SM + +T++ + I ++ + Sbjct: 609 SGDDVILGDKGGTVTTVEPGKNYNIAIVVDTSGSMSEASGTKGLTRMQLTIDALKNLANT 668 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 +K + NV G + ++ L V L + I+ LS G TN A Sbjct: 669 LKGHDGIVNVALIGFESTASTKYTINGLNASNVGDLIKAIEKLSASG-GTNYEGAFDEAV 727 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 +Q ++ ++ + F+TDG+ + Sbjct: 728 KWF-----NKQPTSSNGQAFENVTYFLTDGDPTFSN 758 >gi|170589747|ref|XP_001899635.1| von Willebrand factor type A domain containing protein [Brugia malayi] gi|158593848|gb|EDP32443.1| von Willebrand factor type A domain containing protein [Brugia malayi] Length = 634 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 26/193 (13%), Positives = 66/193 (34%), Gaps = 25/193 (12%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + + +D++ +LD S S+E + I +++ + V+ ++ Sbjct: 64 MKEECGVDLLFLLDTSGSLEQIYTQHINWTTQLTEALLT----------DKDDVRIAMIQ 113 Query: 226 FSNK--IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 ++ +E F + + I ++ T + L A ++F + Sbjct: 114 YAETPIVEFSFGTYRDLPDITNHIMTINLHSGGTRTGKALLAAKGELFSEE------KGA 167 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK-RGAIVYAIGIRVIR-SHEFLRACAS 341 N KIIV TDG ++ + + + ++ +Y + + E R Sbjct: 168 RKNASKIIVLFTDGLSVDDP-----IKHAQQLREIEKIKIYVVYVGSDGFEQEMNRIAGG 222 Query: 342 PNSFYLVENPHSM 354 ++ + + Sbjct: 223 RSNVFGSNEFTRL 235 Score = 43.2 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 36/297 (12%), Positives = 89/297 (29%), Gaps = 31/297 (10%) Query: 65 IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQN 124 + + + I + +N + N + + T + + Sbjct: 350 VSADEQFVRAITIPQQSIFNKDRNKSDKEIENSVTLPTLNKKLRKEQPRTEVITSSTTHS 409 Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKV--NSQTDARLDMMIVLDVSR 182 P T P + S + T S SQ +++ ++D S Sbjct: 410 FTEGREISQSVDHPSNKTTLSPRHHISTARKLGKTGSAGNIEKSQGQCPREILFIVDSSG 469 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF----SNKIEEFFLLEW 238 S++ +D + + ++L E+++ + L+ F ++E F Sbjct: 470 SVQRIYDQQ-------KEYLLSLLNELRIGEGNQ---RVALIQFAGSSHQRVEWTFDTYK 519 Query: 239 GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE 298 + + + + F +T L+ + + T ++ ++DG Sbjct: 520 DIKDIAEALNQVRHFTGTTYIGRALENSIGVL----------ETRRQGIPTTVILVSDG- 568 Query: 299 NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-SPNSFYLVENPHSM 354 +D S + YA+ + + + E+L P+ Y+ + Sbjct: 569 ---FSQDDASKPAEEIRRMSNVDFYAVSMSELNNFEYLTKLTDDPSKVYVRPRSEDL 622 >gi|1705853|sp|P54290|CA2D1_RAT RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-1; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-1; Flags: Precursor gi|203955|gb|AAA41088.1| dihydropyridine-sesitive L-type calcium channel alpha-2 subunit [Rattus norvegicus] Length = 1091 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 77/200 (38%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 246 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 293 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 294 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFTFAFEQLLNYNVSRAN 352 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ K + V+ + R Sbjct: 353 CN-------KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQ 399 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 400 WMACENKGYYYEIPSIGAIR 419 >gi|162147499|ref|YP_001601960.1| hypothetical protein GDI_1715 [Gluconacetobacter diazotrophicus PAl 5] gi|161786076|emb|CAP55658.1| conserved hypothetical protein [Gluconacetobacter diazotrophicus PAl 5] Length = 571 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 20/80 (25%), Positives = 31/80 (38%), Gaps = 7/80 (8%) Query: 298 ENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFLRACAS-PNSFYLVEN 350 N+ + + C+ K G +Y I + L+ CAS P ++Y Sbjct: 490 SNILPEMNSLVSTVCDNIKNSGITIYVILYTHEGEEADATTQAMLQNCASKPGNYYDAPT 549 Query: 351 PHSMYDAFSHIGKDIVTKRI 370 SM AFS +G + RI Sbjct: 550 AASMKQAFSDLGGQLSALRI 569 Score = 52.8 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 34/171 (19%), Positives = 74/171 (43%), Gaps = 17/171 (9%) Query: 15 YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNR 74 KG ++I+ A+ + + M +E++ I+ ++ L + +D + + AA ++ NN Sbjct: 6 RKGSVSIVMAVCAFAMLAISMMGVELARIYIVQERLQTALDAASIVAAREMSA---VNNV 62 Query: 75 KKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISR 134 G W +F + + NG ++ S V+ PQN +I+ + Sbjct: 63 GTCTGSCASDTTAIFWA-NFSSAHQANGLGPF-----QAVSTGPVITPQNAS-TITIQAN 115 Query: 135 YKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 ++PL F + I + S +++ +++ +VLD + S+E Sbjct: 116 VQLPLLFTKIL----GVSQIAL---SEHAQAVRSNMGMELALVLDNTDSLE 159 >gi|163751139|ref|ZP_02158369.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] gi|161329095|gb|EDQ00167.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] Length = 334 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 66/216 (30%), Gaps = 44/216 (20%) Query: 173 DMMIVLDVSRSM---ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+M+ +D+S SM + + I+ + + E + + GL+ F++ Sbjct: 84 DLMLAVDLSGSMQIEDMVLNGQTVDRFTMIQDVVSDFIERRKGD------KLGLILFADH 137 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + + ++ G T + + + Sbjct: 138 AYLQAPLTQDRRSVAQFLQEAQIGLVGKQTAIGEAIALGVKRFDMVDKSN---------- 187 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI----------------- 330 +I+V +TDG N S + A KRG +YAIG+ Sbjct: 188 -RILVLLTDGSNNSGSISPE--QAAAIAAKRGVKIYAIGVGADVMERRSIFGTERVNPSM 244 Query: 331 --RSHEFLRAC-ASPNSFYLVENPHSMYDAFSHIGK 363 + + + ++ + + + I K Sbjct: 245 DLDEAQLISLAKTTGGLYFRARSSQDLQLIYQEIDK 280 >gi|7441761|pir||JC5576 inter-alpha-trypsin inhibitor heavy chain 3 - golden hamster Length = 889 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 37/192 (19%), Positives = 78/192 (40%), Gaps = 20/192 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEE 232 ++ V+D+S SM KI ++ +L+++K +N ++ +G+ T+ + + + Sbjct: 285 IVFVIDISGSMAG------RKIQQTRVALLKILDDMKQDDYLNFILFSTGVTTWKDSLVQ 338 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + ++ +S G+ TN GL + D + QH E + II+ Sbjct: 339 ATP--ANLEEARTFVRSISDQGM-TNINDGLLRGIRMLTDARE--QHTVPERSTS--III 391 Query: 293 FMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS-----FY 346 +TDG+ N ++ +A + +Y +G ++ FL A N Y Sbjct: 392 MLTDGDANTGESRPEKIQENVRKAIEGRFPLYNLGFGNNLNYNFLETMALENHGVARRIY 451 Query: 347 LVENPHSMYDAF 358 + + F Sbjct: 452 EDSDANLQLQGF 463 >gi|297279798|ref|XP_002801794.1| PREDICTED: integrin alpha-10-like isoform 4 [Macaca mulatta] gi|297279800|ref|XP_002801795.1| PREDICTED: integrin alpha-10-like isoform 5 [Macaca mulatta] Length = 1024 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 38/211 (18%), Positives = 76/211 (36%), Gaps = 35/211 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++IVLD S S+ + +++ ++ + L I +Q GLV + Sbjct: 23 MDVVIVLDGSNSIYPW-----SEVQTFLRRLVGKL----FIDPEQ--IQVGLVQYGESPV 71 Query: 232 EFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L + ++ + G T + + A + F + Sbjct: 72 HEWSLGDFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQS------HGGRPEAA 125 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFLR----A 338 +++V +TDGE+ +E +L C + Y I + R FLR Sbjct: 126 RLLVVVTDGESHDGEELPAALKTCEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTI 182 Query: 339 CASPNS--FYLVENPHSMYDAFSHIGKDIVT 367 + P+ F+ V + ++ D +G I Sbjct: 183 ASDPDERFFFNVTDEAALTDIVDALGDRIFG 213 >gi|296127202|ref|YP_003634454.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] gi|296019018|gb|ADG72255.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] Length = 542 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 32/234 (13%), Positives = 85/234 (36%), Gaps = 33/234 (14%) Query: 98 LRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMP 157 ++ + +++ ++I +S +++ + + L + N+ Sbjct: 309 VKKDAPISNDEEIKILSSANVIKKEEPKKNHYY-DGEAMETLINASKAFNTPNNYADDTD 367 Query: 158 ITSSVKVNSQTD--ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 + N + +D++IVLD + SM + + + I ++ ++ D Sbjct: 368 ALTKSIKNVISKYEGSIDLVIVLDTTESMHPY-------LKAVKRDIRGVVRDL---FDN 417 Query: 216 NNVVQSGLVTFSNKIEEFF----LLEWGVSHLQRKIKYLSKFGVSTNSTP---GLKYAYN 268 N + G + + + + +F L ++ + R++ Y G + P ++ A Sbjct: 418 NKGSRIGFLLYRDVKDTYFTKRIELSDNINTINREVNYFYAAGGGDKAEPMYEAIQEALE 477 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 + + KK+++ +TD +L AK++G I+ Sbjct: 478 KFDYINE------------KKLLIVVTDAPAKVIGRADLALNT-KTAKEKGIII 518 >gi|326670656|ref|XP_003199260.1| PREDICTED: collagen alpha-3(VI) chain-like [Danio rerio] Length = 1401 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 55/327 (16%), Positives = 110/327 (33%), Gaps = 33/327 (10%) Query: 48 TVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDI 107 L + D +A EG L GG + + L+++G + Sbjct: 492 AALQYVRDSVFTASAGSRKQEGVPQVLILLSGGRSNDNVDTP-----ASALKESGVLILG 546 Query: 108 DDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 S+ ++ + + Y+ S +P F N V P+T +V + Sbjct: 547 IGTRNSSR-EVQRITNDPAYAQSLNDFSDLPGVQQQFYSSLNNVLVQVKPMTPTVAAEPR 605 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 R D++ +LD S D + + + M+E + + + + + +V +S Sbjct: 606 MARR-DVVFLLDGS-------DGTRSSFSAMRDFVERMVERLNVSENRD---RVSVVQYS 654 Query: 228 NKIEEFFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 E F L I+ L G N+ L+Y + +F + Sbjct: 655 RDPEAHFYLNTYSRKEDALDTIRGLRHKGGRPLNTGAALQYLRDNVFTASSGSRRVEG-- 712 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 ++++ ++ + + K+ G +++AIG R S E R P+ Sbjct: 713 --VPQLLILLS-----GARSFDNVDTPASSLKELGVLIFAIGSRSSDSQELQRISQEPSY 765 Query: 345 FYLVENPHSM----YDAFSHIGKDIVT 367 V + + F++I K V Sbjct: 766 ALSVSDFTDLPSVQQQLFTNINKVFVA 792 >gi|254414094|ref|ZP_05027862.1| Appr-1-p processing enzyme family protein [Microcoleus chthonoplastes PCC 7420] gi|196179230|gb|EDX74226.1| Appr-1-p processing enzyme family protein [Microcoleus chthonoplastes PCC 7420] Length = 601 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 27/197 (13%), Positives = 76/197 (38%), Gaps = 27/197 (13%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 L++ +V+D S SM KI A ++ ++++ + + + Sbjct: 39 KRPELNLGLVIDRSGSMAG------KKIAYARQAACYAVQQLLGSD------RVSVTIYD 86 Query: 228 NKIEE--FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + +E L +++ R+I+ + ++ L + + + + Q+ N E Sbjct: 87 DIVETLIPSTLATEKNYITRQIERIHPRNMT-----ALHDGW--VEGGKQVSQYLNPEGL 139 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNE-AKKRGAIVYAIGIRVIRSHEFLRACASP-- 342 N ++ ++DG + + ++ ++G +G+ + + L A+ Sbjct: 140 NR---VILLSDGLANKGQTNADAIASDVYGLAQQGVSTTTMGVGDDYNEDLLEVMANSGD 196 Query: 343 NSFYLVENPHSMYDAFS 359 ++Y ++ P + + F Sbjct: 197 GNYYYIDTPEQLPEIFQ 213 >gi|332018134|gb|EGI58743.1| Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 [Acromyrmex echinatior] Length = 2218 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 36/272 (13%), Positives = 81/272 (29%), Gaps = 45/272 (16%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 D G +N +++ ++ + + + K + I+ + Sbjct: 28 DGGSINSVENGRNDNNVS------STFLNFD-QKTNENQRKHLDDADRMLSKMDILSRLL 80 Query: 160 SS-VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + +++M+ ++D S S+ + + +N + + + Sbjct: 81 KMHIDQLRNKTDQVEMVFLVDASGSIGA---------ENFRSELNFVTKLLSDFTVDALA 131 Query: 219 VQSGLVTFSNK---------IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + LVTF + I + L ++ ++ G T + L A Sbjct: 132 ARIALVTFGGRGSVYRNIDQISRHGPNDHKCYLLNKQFSNITYSGGGTYTRGALLEALAI 191 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 + K++ +TDG + + K GAIV+ GIR Sbjct: 192 L----------EKSREAANKVVFLITDGFSNG----GDPRPAAHLLKNTGAIVFTFGIRT 237 Query: 330 IRSHEFLRACASPNSFYLVENPHSMYDAFSHI 361 E + P + D+F+ Sbjct: 238 GNVEELHDIASHPEY-----THSYLLDSFAEF 264 >gi|310824614|ref|YP_003956972.1| vault protein, inter-alpha-trypsin domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|309397686|gb|ADO75145.1| Vault protein, inter-alpha-trypsin domain protein [Stigmatella aurantiaca DW4/3-1] Length = 749 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 73/204 (35%), Gaps = 28/204 (13%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 + + I +V++ A + V+D S SM+ +++ +A ++ Sbjct: 253 KQADEPGYFIALIAPKTEVSASEIAAKRVTFVIDTSGSMQG------SRMQIAKDALKYC 306 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFL--LEWGVSHLQRKIKYLSKFG--VSTNSTP 261 + + N +V FS +E F ++Q+ + ++ + T Sbjct: 307 VTRLNPQDTFN------VVRFSTDVEALFPALKSAQPENIQKAVAFVEQLEAIGGTAIDE 360 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL-YYCNEAKKRGA 320 L N ++ +++F+TDG+ + D+ ++ + + +K Sbjct: 361 ALVRGLQD-----------NDGKSSAPHLLMFITDGQPTIGETDEGAIAQHAKDGRKAKT 409 Query: 321 IVYAIGIRVIRSHEFLRACASPNS 344 ++ G+ + L +S + Sbjct: 410 RLFTFGVGEDLNARLLDRLSSDGA 433 >gi|47523446|ref|NP_999348.1| calcium channel, voltage-dependent, alpha 2/delta subunit 1 preproprotein [Sus scrofa] gi|3341749|gb|AAC36289.1| voltage-dependent calcium channel alpha-2 delta subunit precursor [Sus scrofa] Length = 1091 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 77/200 (38%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 247 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 294 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 295 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 353 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ K + V+ + R Sbjct: 354 CN-------KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQ 400 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 401 WMACENKGYYYEIPSIGAIR 420 >gi|253582503|ref|ZP_04859725.1| conserved hypothetical protein [Fusobacterium varium ATCC 27725] gi|251835648|gb|EES64187.1| conserved hypothetical protein [Fusobacterium varium ATCC 27725] Length = 376 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 34/196 (17%), Positives = 69/196 (35%), Gaps = 23/196 (11%) Query: 140 KFCTFIPWYTNSRHIVMPITS-----SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITK 194 KF + + + IV S +V+ + ++++ VLD + SM + TK Sbjct: 3 KFLSLLMGFVFLGTIVFSTESKPENTAVEQTKAKEKDVEIVFVLDTTGSMGGLIQGAKTK 62 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK----IEEFFLLEWGVSHLQRKIKYL 250 I + + + ++ V+ GLV + ++ + + L + + + Sbjct: 63 IWSIVNEV--------MQTHKDSKVKIGLVAYRDRGDVYVTKVTQLSENLDEIYSVLMGY 114 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ-QSL 309 G + K + + +Q T N +II + D D + Sbjct: 115 KAQGGGDDPEDVRKALHESLEVIQWS-----TPRENLSQIIFLVGDAPPHDDYNDSPDTS 169 Query: 310 YYCNEAKKRGAIVYAI 325 +AK RG I+ I Sbjct: 170 DTAKKAKSRGIIINTI 185 >gi|187956243|gb|AAI50691.1| Integrin alpha E, epithelial-associated [Mus musculus] Length = 1038 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 42/236 (17%), Positives = 85/236 (36%), Gaps = 26/236 (11%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 +C T +V+ + + ++ IVLD S S+E + A Sbjct: 163 YCRSKGGSTGEEKKSARRRRTVEEEDEEEDGTEIAIVLDGSGSIEP------SDFQKAKN 216 Query: 201 SINAMLEEV--KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTN 258 I+ M+ K +VQ G V + E ++ K++ + + T Sbjct: 217 FISTMMRNFYEKCFECNFALVQYGAV---IQTEFDLQESRDINASLAKVQSIVQVKEVTK 273 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + +++ + IF K++V +TDG+ + ++ N K + Sbjct: 274 TASAMQHVLDNIFIPS------RGSRKKALKVMVVLTDGDIFGDPLNLTTVI--NSPKMQ 325 Query: 319 GAIVYAIGIR----VIRSHEFLRACASP---NSFYLVENPHSMYDAFSHIGKDIVT 367 G + +AIG+ ++ L+ AS + V N ++ S + + IV Sbjct: 326 GVVRFAIGVGDAFKNNNTYRELKLIASDPKEAHTFKVTNYSALDGLLSKLQQHIVH 381 >gi|27370456|ref|NP_766532.1| integrin alpha-E isoform 2 [Mus musculus] gi|26334103|dbj|BAC30769.1| unnamed protein product [Mus musculus] gi|56206399|emb|CAI24787.1| integrin, alpha E, epithelial-associated [Mus musculus] Length = 1038 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 42/236 (17%), Positives = 85/236 (36%), Gaps = 26/236 (11%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 +C T +V+ + + ++ IVLD S S+E + A Sbjct: 163 YCRSKGGSTGEEKKSARRRRTVEEEDEEEDGTEIAIVLDGSGSIEP------SDFQKAKN 216 Query: 201 SINAMLEEV--KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTN 258 I+ M+ K +VQ G V + E ++ K++ + + T Sbjct: 217 FISTMMRNFYEKCFECNFALVQYGAV---IQTEFDLQESRDINASLAKVQSIVQVKEVTK 273 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + +++ + IF K++V +TDG+ + ++ N K + Sbjct: 274 TASAMQHVLDNIFIPS------RGSRKKALKVMVVLTDGDIFGDPLNLTTVI--NSPKMQ 325 Query: 319 GAIVYAIGIR----VIRSHEFLRACASP---NSFYLVENPHSMYDAFSHIGKDIVT 367 G + +AIG+ ++ L+ AS + V N ++ S + + IV Sbjct: 326 GVVRFAIGVGDAFKNNNTYRELKLIASDPKEAHTFKVTNYSALDGLLSKLQQHIVH 381 >gi|310825891|ref|YP_003958248.1| hypothetical protein ELI_0266 [Eubacterium limosum KIST612] gi|308737625|gb|ADO35285.1| predicted protein [Eubacterium limosum KIST612] Length = 838 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 43/282 (15%), Positives = 83/282 (29%), Gaps = 72/282 (25%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 I + + + + +++VLD S SM SS T D A ++ E + Sbjct: 87 ITLEAVTGQDIKTIGGKSGSIVLVLDNSGSMGW--GSSPTPADYARDALKEFANEF--LK 142 Query: 214 DVNNVVQSGLVTFSN-------------KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNST 260 + N+ + GLVT+S+ K + + + + + L T+ Sbjct: 143 NGNSGNKLGLVTYSSGSGVPIYDAYDEIKYVQGYSMTENSDIFGQVVDGLQTPSGETDVQ 202 Query: 261 PGLKYAYNQIF------------------------DMQGMRQHCNTEDANYKKIIVFMT- 295 G+K A + + G N + I MT Sbjct: 203 MGIKTARDILAADTSGNPQFILVFSDGATNRSARPTSAGELGGANISPCTFGDKIYDMTF 262 Query: 296 --DG------------------ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 DG ++ AK +G +Y++ ++ Sbjct: 263 KFDGFDYGAQGSAVYNDGVYTTNGNVNAHTVAAVSEALLAKDQGIDIYSVFYHNPALNDL 322 Query: 336 --------LRACASPNSFYLVE--NPHSMYDAFSHIGKDIVT 367 ++ AS + + N + + F+ I K I Sbjct: 323 EYGAGVFVMKNSASSGQYTEISPGNAGAFAEIFTEIEKQIQE 364 >gi|258405287|ref|YP_003198029.1| hypothetical protein Dret_1163 [Desulfohalobium retbaense DSM 5692] gi|257797514|gb|ACV68451.1| Protein of unknown function DUF2134, membrane [Desulfohalobium retbaense DSM 5692] Length = 323 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 26/180 (14%), Positives = 59/180 (32%), Gaps = 15/180 (8%) Query: 12 FYNYKGGMTILTAI-FLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 + G + I+TA+ L + G+ I++ K L + +D + + A ++ G Sbjct: 11 LRDECGSVAIITALFVLFSLLATAGIAIDIGRQATAKNELQNTLDAAALAGAIELGQNGP 70 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 N + + K I N NG + +DI + + S+ Sbjct: 71 ANVKSEAKEAAENNSIDN------------NGLILGDNDIKVGNWTEPNFFSKTPYNSVK 118 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 + F + + + T+ + S + + + D + +M + Sbjct: 119 IMVNNHSINSFFASALNF--QQTVSAEATAVIGPLSGKRHLIPIAVTEDEADTMSEAGEG 176 >gi|296128023|ref|YP_003635273.1| von Willebrand factor type A [Cellulomonas flavigena DSM 20109] gi|296019838|gb|ADG73074.1| von Willebrand factor type A [Cellulomonas flavigena DSM 20109] Length = 500 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 40/276 (14%), Positives = 95/276 (34%), Gaps = 33/276 (11%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDI--VVVPQNEGYSIS--AISRYKIPLKFCTFIPWYT 149 F ++ + D + + S+D V + Y + + + Sbjct: 77 FALDVDTGAYTRFRDAVRQGFSVDPFGVRTEEFVNYFAQDYEPPAEGLGVSIDATALPFR 136 Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 +V SS ++ + A D+++V+D S SM+ K++ ++ ++ + Sbjct: 137 PDHRLVRVGISSAPASAVSRADADLVLVVDCSGSMDEAG-----KMETTKYALRTLVSSL 191 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 + + +V +S + + + + I L+ STN+ GL Y Sbjct: 192 RRTD------RVAMVCYSTEADVYLEPTPVAEREGVLAAIDRLAPR-DSTNAAAGLALGY 244 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL--YYCNEAKKRGAIVYAI 325 + M+ + +V ++DG + D + + ++AK G + ++ Sbjct: 245 DLAMSMRTEGRLTR---------VVLVSDGVANVGETDPEGILARISSQAK-AGISLISV 294 Query: 326 GIRVIRSHE-FLRACAS--PNSFYLVENPHSMYDAF 358 G+ + ++ L A V+ F Sbjct: 295 GVGITTYNDHLLEQLADQGDGWHVYVDGEAEAERVF 330 >gi|320106177|ref|YP_004181767.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319924698|gb|ADV81773.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 370 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 32/223 (14%), Positives = 78/223 (34%), Gaps = 37/223 (16%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 ++K S DA + + IV D+S SM ++K A K+++ + + Sbjct: 133 PQTIKTFSTDDAPVSIGIVFDLSGSM-------MSKFGRARKALSEFMRTSNPQDEFF-- 183 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 +V F+++ V + ++ L T + AY + ++ + Sbjct: 184 ----VVGFNDRPAVIVDYTSNVDDVDARMVMLRP----ERRTALIDAAYLGLNKLKDAKY 235 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-----RVIRSH 333 K ++ ++DG + ++ + L ++ +Y+IGI Sbjct: 236 ER--------KALLIISDGGDNRSRYVESELRRA--VRESDTQIYSIGIFDVYAATPEEK 285 Query: 334 E----FLRACA-SPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + C + + V + + D + I ++ + + Sbjct: 286 SGPTLLMDICEMTGGRMFRVTDADELGDIAARISAELRNEYVL 328 >gi|73538303|ref|YP_298670.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] gi|72121640|gb|AAZ63826.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] Length = 358 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 34/230 (14%), Positives = 66/230 (28%), Gaps = 54/230 (23%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++V+D+S SM + D ++I A ++ +L+ V G+V + Sbjct: 98 VILVIDLSGSMRAQ-DVRPSRIRAAQQAARVLLD----AQPAG--VSVGVVAMAGTAALA 150 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM----------QGMRQHCNTE 283 + I+ L G T GL A + G Sbjct: 151 QAPSHSKDDVATAIEGLKPQG-GTALGNGLLIALTTLLPQTTNDAERLMNGGDVAQPGKP 209 Query: 284 DANYK----------------KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 IV +DGE+ S Q+ G VY +G+ Sbjct: 210 GKAAPGELDNGEPVRPGSYASGAIVLFSDGESNSGPGAVQAAQLAAT---YGVRVYTVGV 266 Query: 328 RVIRS---------------HEFLRACA--SPNSFYLVENPHSMYDAFSH 360 + L+ A + ++ +E+ ++ + Sbjct: 267 GTTEGVVLSADGWSARVRLDEKVLKQVADTTGAEYFRLEDTAALKKVYRA 316 >gi|332798630|ref|YP_004460129.1| pilin isopeptide linkage domain-containing protein [Tepidanaerobacter sp. Re1] gi|332696365|gb|AEE90822.1| pilin isopeptide linkage domain protein [Tepidanaerobacter sp. Re1] Length = 925 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 34/180 (18%), Positives = 74/180 (41%), Gaps = 20/180 (11%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 N I + +T+ S+T+ D++ VLDVS SM++ + T+ ++ ++ + + Sbjct: 207 NGLNDYRIYLDVTTEA---SETETDRDIIFVLDVSNSMDTAL-GNTTRFNVLKNTVKSAV 262 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGL 263 + P + ++TF + + E + L + LS G TN + Sbjct: 263 NSLVQNPSN----RISIITFGTRAQIVTTRETDRTKLINCVNSLSLPGGTAGGTNYYESM 318 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 +A + + +++K+I F++DGE ++ ++ Y A+ Y Sbjct: 319 LHAAQIVNGSI---------NGSHEKVIFFVSDGEPTASLPAANAMGYAAYAEVATIYAY 369 >gi|312070072|ref|XP_003137977.1| hypothetical protein LOAG_02391 [Loa loa] gi|307766862|gb|EFO26096.1| hypothetical protein LOAG_02391 [Loa loa] Length = 647 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 57/166 (34%), Gaps = 24/166 (14%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 ++ +D++ +LD S S+E + I +++ + V ++ Sbjct: 65 AKEGCGVDLLFLLDTSGSLEQIYTKHINWTTQLTEALLT----------DKDQVHIAMIQ 114 Query: 226 FSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 ++ F L+ + I ++ T + L A ++F Sbjct: 115 YAETPTIEFSLDTYRNPRDITNHIMTINFHSGGTRTGKALLAAKVELFS------EKKGA 168 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK-RGAIVYAIGIR 328 N KIIV TDG ++ + + + ++ +Y + + Sbjct: 169 RKNASKIIVLFTDGLSVDDP-----VKHAQQLREVEKVKIYVVYVG 209 Score = 39.4 bits (90), Expect = 0.97, Method: Composition-based stats. Identities = 30/244 (12%), Positives = 80/244 (32%), Gaps = 39/244 (15%) Query: 118 IVVVPQNEGYSIS----AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT----- 168 + V+ + Y + + T P + + I S ++ + Sbjct: 407 VEVITSSTNYPFTEGRETSRPVDLTSSKTTSAPRHVTVQRISTARKSGKTGSAGSIEKSR 466 Query: 169 -DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF- 226 +++ ++D S S++ +D + + ++L E+++ + L+ F Sbjct: 467 GQCPREILFIVDSSGSVQRIYDQQ-------KEYLLSLLNELQIGEGDQ---RVALIQFA 516 Query: 227 ---SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 ++E F + + + + F +T L+ + + + T Sbjct: 517 GSSHQRVEWTFDTYKDIRDIAEALSQVRHFTGTTYIGRALENSVDVL----------ETR 566 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-SP 342 ++ ++DG +D S + YA+ + + + E+L P Sbjct: 567 RQGIPTTVILVSDG----FSQDDASKPAEEIRRMPNVDFYAVSMSQLNNFEYLTKLTDDP 622 Query: 343 NSFY 346 + Y Sbjct: 623 SKVY 626 >gi|209545606|ref|YP_002277835.1| hypothetical protein Gdia_3496 [Gluconacetobacter diazotrophicus PAl 5] gi|209533283|gb|ACI53220.1| conserved hypothetical protein [Gluconacetobacter diazotrophicus PAl 5] Length = 568 Score = 54.0 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 20/80 (25%), Positives = 31/80 (38%), Gaps = 7/80 (8%) Query: 298 ENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFLRACAS-PNSFYLVEN 350 N+ + + C+ K G +Y I + L+ CAS P ++Y Sbjct: 487 SNILPEMNSLVSTVCDNIKNSGITIYVILYTHEGEEADATTQAMLQNCASKPGNYYDAPT 546 Query: 351 PHSMYDAFSHIGKDIVTKRI 370 SM AFS +G + RI Sbjct: 547 AASMKQAFSDLGGQLSALRI 566 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 34/171 (19%), Positives = 74/171 (43%), Gaps = 17/171 (9%) Query: 15 YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNR 74 KG ++I+ A+ + + M +E++ I+ ++ L + +D + + AA ++ NN Sbjct: 3 RKGSVSIVMAVCAFAMLAISMMGVELARIYIVQERLQTALDAASIVAAREMSA---VNNV 59 Query: 75 KKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISR 134 G W +F + + NG ++ S V+ PQN +I+ + Sbjct: 60 GTCTGSCASDTTAIFWA-NFSSAHQANGLGPF-----QAVSTGPVITPQNAS-TITIQAN 112 Query: 135 YKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 ++PL F + I + S +++ +++ +VLD + S+E Sbjct: 113 VQLPLLFTKIL----GVSQIAL---SEHAQAVRSNMGMELALVLDNTDSLE 156 >gi|305680612|ref|ZP_07403420.1| von Willebrand factor type A domain protein [Corynebacterium matruchotii ATCC 14266] gi|305660143|gb|EFM49642.1| von Willebrand factor type A domain protein [Corynebacterium matruchotii ATCC 14266] Length = 236 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 53/136 (38%), Gaps = 4/136 (2%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 L + ++DVS SM T +D A + + ++E + ++ ++ GL+ F ++ Sbjct: 9 LPVFFLIDVSYSMLEEKPGGGTLLDAANQLVPGIVEACEKYSVLDQRLRLGLIEFCDEAR 68 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L + I L G TN +N++ + + Sbjct: 69 VVIPLS-EIDAFSENIPQLVAKG-GTNFAAAFWAVFNEMGVAVESLRKPEIGIHRPT--V 124 Query: 292 VFMTDGENLSTKEDQQ 307 F+TDGE++ E++ Sbjct: 125 FFITDGEDIGDVEERA 140 >gi|209521403|ref|ZP_03270114.1| von Willebrand factor type A [Burkholderia sp. H160] gi|209498150|gb|EDZ98294.1| von Willebrand factor type A [Burkholderia sp. H160] Length = 275 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 31/263 (11%), Positives = 78/263 (29%), Gaps = 32/263 (12%) Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVN 165 S V P+ +++ + P F + + + Sbjct: 15 TATACAASLQAVAVSQPEPGTVTVTVRAPGAAP-SADAFTLQLPGADTTPTRVPAQSVEA 73 Query: 166 SQTDAR---LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + +++ +D S SM S ++ +L + ++ Sbjct: 74 ANELSPDLATAVLLCVDRSGSMHSAVP-------AIKAALKDVLARPRPD------LRIA 120 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQH 279 L++F + + + + + T L + + ++ Sbjct: 121 LMSFGSDTPAPTPFYSESAPVIEAVDAIRAETGRDGKTRLYDALNIGMSMLANVP----- 175 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 K ++ +TDG++ ++ L + + + A G ++ L Sbjct: 176 -----LRGPKRLIVITDGKDEGSQTRFDVLSVLLQGRGQPMDAIAFGQSAQKTSSGLATL 230 Query: 340 A--SPNSFYLVENPHSMYDAFSH 360 A S +F L NP S+ +A + Sbjct: 231 ANKSSGAFVLATNPSSLVEALRN 253 >gi|33331711|gb|AAQ11020.1| mesocentin [Caenorhabditis briggsae] Length = 13133 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 42/212 (19%), Positives = 83/212 (39%), Gaps = 30/212 (14%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 T + + S ++ ++ D++IVLD S E+F + A+ SI Sbjct: 12392 TKATTLAGSRRSCSTIDYES----DVIIVLDSS---ENFTPDEFDSMKDAVASIVD--TG 12442 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 L PDV+ + G V +S+K+ L H + KI+ L K + G+ A Sbjct: 12443 FDLAPDVS---KIGFVIYSDKVAVPVAL----GHYEDKIELLEKIVDAEKINDGVAIA-- 12493 Query: 269 QIFDMQGMRQHCN-TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR-GAIVYAIG 326 ++ + RQ N KI++ +T+G+N + + + G ++A+ Sbjct: 12494 -LYGLNAARQQFQLHGRENATKIVLLITNGKNRGN-----AAAAAEDLRDMYGVQLFAVA 12547 Query: 327 IR-VIRSHEFLRAC---ASPNSFYLVENPHSM 354 + ++ A+P++ V + Sbjct: 12548 VGSNPDELATIKRLVGNANPDNAIEVAQSTEI 12579 >gi|291299992|ref|YP_003511270.1| von Willebrand factor type A [Stackebrandtia nassauensis DSM 44728] gi|290569212|gb|ADD42177.1| von Willebrand factor type A [Stackebrandtia nassauensis DSM 44728] Length = 316 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 39/247 (15%), Positives = 77/247 (31%), Gaps = 35/247 (14%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLD---MMIVLDVSRSMESFFDSSITKIDMAI 199 +P +V+ T + T + +++ LD+S SM++ D S + Sbjct: 55 RHLPAAVLLSALVVMSTGMARPAVDTQEPTERATVVLTLDLSLSMKAK-DVSPDRFSAMK 113 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNS 259 K+ ++E+ + GLVTF+ + ++ IK + +T Sbjct: 114 KASLEFVDELPKNYN------LGLVTFAKSASVAVSPTKDRNQVKSAIKSMK-LDRATAI 166 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 G+ A I + + I+ ++DG S + + A Sbjct: 167 GEGIFSALQAIQSV-----PPDGASEPAPARILLLSDGYRTSGRLVEDGAKAAKAA---K 218 Query: 320 AIVYAIGIR--------------VIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 V I V E L A + FY + + + +G Sbjct: 219 VPVSTIAFGTDTGTVEIEGETQEVPVDRETLSQTAETTGGKFYEAASVDDLKGVYEDMGS 278 Query: 364 DIVTKRI 370 I + + Sbjct: 279 SIGHRTV 285 >gi|115687249|ref|XP_792282.2| PREDICTED: similar to polydom protein [Strongylocentrotus purpuratus] Length = 2422 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 35/190 (18%), Positives = 72/190 (37%), Gaps = 39/190 (20%) Query: 168 TDARLDMMIVLDVSRSM--ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 ++ +++++ ++D S S+ E+FF+ +K + +L + + PD + ++T Sbjct: 196 SNGQVELVFLVDSSASVGIENFFNE--------LKFVKKLLADFTVAPDA---TRVAIIT 244 Query: 226 FSNK------IEEFFLLEWGVSH-----LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 FS+K + + G H L + ++ G T + + A + Sbjct: 245 FSSKHRVELNVNQLDNSIAGKHHHKCALLNEDLPKITYVGGGTYTKGAFELAKKVLQGA- 303 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 AN + + +TDG + + K G V++ GIR E Sbjct: 304 ---------RANSTQAVFLLTDGLSNGPNP----VPVAVSLKDDGVEVFSFGIRDGYIPE 350 Query: 335 FLRACASPNS 344 L+ AS Sbjct: 351 LLQ-MASEKK 359 >gi|26248659|ref|NP_754699.1| hypothetical protein c2813 [Escherichia coli CFT073] gi|91211565|ref|YP_541551.1| hypothetical protein UTI89_C2553 [Escherichia coli UTI89] gi|117624462|ref|YP_853375.1| hypothetical protein APECO1_4292 [Escherichia coli APEC O1] gi|227887329|ref|ZP_04005134.1| von Willebrand factor type A domain protein [Escherichia coli 83972] gi|300983332|ref|ZP_07176546.1| von Willebrand factor type A domain protein [Escherichia coli MS 45-1] gi|301049025|ref|ZP_07196011.1| von Willebrand factor type A domain protein [Escherichia coli MS 185-1] gi|26109064|gb|AAN81267.1|AE016763_226 Hypothetical protein yfbK [Escherichia coli CFT073] gi|91073139|gb|ABE08020.1| hypothetical protein YfbK [Escherichia coli UTI89] gi|115513586|gb|ABJ01661.1| conserved hypothetical protein [Escherichia coli APEC O1] gi|227835679|gb|EEJ46145.1| von Willebrand factor type A domain protein [Escherichia coli 83972] gi|294491429|gb|ADE90185.1| von Willebrand factor type A domain protein [Escherichia coli IHE3034] gi|300299173|gb|EFJ55558.1| von Willebrand factor type A domain protein [Escherichia coli MS 185-1] gi|300408590|gb|EFJ92128.1| von Willebrand factor type A domain protein [Escherichia coli MS 45-1] gi|307554335|gb|ADN47110.1| von Willebrand factor type A domain protein [Escherichia coli ABU 83972] gi|307626191|gb|ADN70495.1| hypothetical protein UM146_05450 [Escherichia coli UM146] gi|315285863|gb|EFU45301.1| von Willebrand factor type A domain protein [Escherichia coli MS 110-3] gi|315292192|gb|EFU51544.1| von Willebrand factor type A domain protein [Escherichia coli MS 153-1] gi|323952065|gb|EGB47939.1| von Willebrand protein type A [Escherichia coli H252] gi|323956039|gb|EGB51792.1| von Willebrand protein type A [Escherichia coli H263] Length = 580 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 49/304 (16%), Positives = 109/304 (35%), Gaps = 40/304 (13%) Query: 49 VLHSM---IDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN 105 L + + AA+QI N G R + + + ++ +F ++ + N Sbjct: 83 RLQAAPKYQHAAREKAASQIAN--PGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYAN 140 Query: 106 -------DIDDIVRSTSLDIVVV--PQNEGYSISAISRYKIPLKFCT-----FIPWYTNS 151 + + ++ +V P + P+ F PW Sbjct: 141 VRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQR 200 Query: 152 RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 + + I + S+ +++ ++D S SM S ++ + S+ +++E++ Sbjct: 201 TLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISD-----ERLPLIQSSLKLLVKELRE 254 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 ++ V +G ++I + + + I L G STN GL+ AY Q Sbjct: 255 QDNIAIVTYAG----DSRIALPSISGSHKAEINAAIDSLDAEG-STNGGAGLEMAYQQA- 308 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGAIVYAIGIRVI 330 ++ N I+ TDG+ +D +S+ + ++ G + +G+ Sbjct: 309 AKGFIKGGINR--------ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDS 360 Query: 331 RSHE 334 +E Sbjct: 361 NYNE 364 >gi|262117974|dbj|BAI48012.1| collagen type VI alpha 3 subunit [Mesocricetus auratus] Length = 2675 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 51/312 (16%), Positives = 109/312 (34%), Gaps = 29/312 (9%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 L + V +A + +G + GG + + + + G + Sbjct: 910 LEFVARNLFVKSAGSRIEDGVPQHLVLFLGGKSQDDVARHAQVISSSGIMSLGI--GDRN 967 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQT 168 I R+ I P+ + ++P + + + P Sbjct: 968 IDRADLQTITNDPK---MVFTVREFRELPNIEERVMLSFGPSGSTPRPPGVDIFTPPQPE 1024 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ +LD S + ++ ++ E V + + + ++ GLV +++ Sbjct: 1025 KKKADIVFLLDGS------INFQRESFGEVLRFVS---EIVDTVYEEGDSIRVGLVQYNS 1075 Query: 229 -KIEEFFLLEWGVSH-LQRKIKYLSKFGV-STNSTPGLKYA-YNQIFDMQGMRQHCNTED 284 +EFFL ++ + I + G N+ G+++ N G R D Sbjct: 1076 DPTDEFFLKDYSTKRQIIDAINKVVYKGGRHANTKVGIEHLRLNHFVPGAGSR-----LD 1130 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +I +T G++ ED Q K G V+AIG+R I S E + ++ + Sbjct: 1131 ERVPQIAFVITGGKS---VEDAQGASLALTQK--GVKVFAIGVRNIDSEEVGKIASNSAT 1185 Query: 345 FYLVENPHSMYD 356 + V + + + Sbjct: 1186 AFRVGSVQELSE 1197 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 39/243 (16%), Positives = 84/243 (34%), Gaps = 24/243 (9%) Query: 118 IVVVPQNEGYSISAISRYKIP---LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDM 174 +V + + Y S + ++P K T I T+ + + ++ + D+ Sbjct: 768 LVKISLSPEYVFSVSTFRELPRLEQKLLTPITTLTSQQIQQILASTRYPPSVTESDAADI 827 Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 + ++D S ++ + ++ ++ + + P V+ G+V FSN + F Sbjct: 828 VFLIDSSEAV------KADGLAHIRDFVSRIVRRLNIGPSK---VRIGVVQFSNDVFPEF 878 Query: 235 LLEWGVSH--LQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L+ S + I+ L G S N+ L++ +F ED + ++ Sbjct: 879 YLKTHKSQSLVLDAIRRLRFKGGSPLNTGKALEFVARNLFVKS---AGSRIEDGVPQHLV 935 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENP 351 +F+ K + G + IG R I + P + V Sbjct: 936 LFL------GGKSQDDVARHAQVISSSGIMSLGIGDRNIDRADLQTITNDPKMVFTVREF 989 Query: 352 HSM 354 + Sbjct: 990 REL 992 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 73/213 (34%), Gaps = 34/213 (15%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVV 219 + + +D +D+ +LD S + F + + K I ++ ++ L PD + Sbjct: 1999 RRAAGSDVDIDLAFILDSSEATTLF------QFNEMKKYIGYVVRQLDLSPDPKASQHFA 2052 Query: 220 QSGLVT---FSNKIEEFFLL--------EWGVSH--LQRKIKYLSKFGVSTNSTPGLKYA 266 + +V + + ++G L + +++ + ++Y Sbjct: 2053 RVAVVQQATYESVDNASVPPVKVEFSLTDYGAKEKLLDFLSRRMTQLQGTMALGNAIEYT 2112 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 IF+ + KI+V M G+ + ++ +AK +G +G Sbjct: 2113 IENIFES--------APNPRDLKIVVLMLTGDMQRQQLEEAQRAIL-QAKCKGYFFVVLG 2163 Query: 327 IRVI-RSHEFLRACASPNSFYL--VENPHSMYD 356 I E + PN + V+ + + Sbjct: 2164 IGRKVNVKEVYSFASEPNDVFFKLVDKSTELNE 2196 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 28/183 (15%), Positives = 66/183 (36%), Gaps = 28/183 (15%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 ++ + + + D++ ++D S + S F + + + ++E + + PD Sbjct: 401 IVSLLKAAQSAGPVSGEKDVVFLIDGSEGVRSGFP-------LLKEFVQRVVESLDVGPD 453 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK--IKYLSKF----GVSTNSTPGLKYAYN 268 V+ LV +S++ F L SH+ ++ I + + G N+ L + Sbjct: 454 ---RVRVALVQYSDRTRPEFYLN---SHMDQQGVINAIRRLTLLGGPIPNTGAALDFVLR 507 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 I + ++++ +T + + K+ GA+ IGI Sbjct: 508 NILTSSTGSRIAEG----VPQLLIVLT-----AERSGDDVRGPSVVLKQGGAVPIGIGIG 558 Query: 329 VIR 331 Sbjct: 559 NAD 561 >gi|218559186|ref|YP_002392099.1| hypothetical protein ECS88_2420 [Escherichia coli S88] gi|218365955|emb|CAR03699.1| conserved hypothetical protein [Escherichia coli S88] Length = 580 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 49/304 (16%), Positives = 109/304 (35%), Gaps = 40/304 (13%) Query: 49 VLHSM---IDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN 105 L + + AA+QI N G R + + + ++ +F ++ + N Sbjct: 83 RLQAAPKYQHAAREKAASQIAN--PGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYAN 140 Query: 106 -------DIDDIVRSTSLDIVVV--PQNEGYSISAISRYKIPLKFCT-----FIPWYTNS 151 + + ++ +V P + P+ F PW Sbjct: 141 VRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQR 200 Query: 152 RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 + + I + S+ +++ ++D S SM S ++ + S+ +++E++ Sbjct: 201 TLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISD-----ERLPLIQSSLKLLVKELRE 254 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 ++ V +G ++I + + + I L G STN GL+ AY Q Sbjct: 255 QDNIAIVTYAG----DSRIALPSISGSHKAEINAAIDSLDAEG-STNGGAGLEMAYQQA- 308 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGAIVYAIGIRVI 330 ++ N I+ TDG+ +D +S+ + ++ G + +G+ Sbjct: 309 AKGFIKGGINR--------ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDS 360 Query: 331 RSHE 334 +E Sbjct: 361 NYNE 364 >gi|156400924|ref|XP_001639042.1| predicted protein [Nematostella vectensis] gi|156226167|gb|EDO46979.1| predicted protein [Nematostella vectensis] Length = 1450 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 47/337 (13%), Positives = 104/337 (30%), Gaps = 68/337 (20%) Query: 48 TVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNEL-------RD 100 + + +D V T +N N K R++ W + EL + Sbjct: 46 QDMQAYLDSLDVDVVT--INSTTINELKTKLSEKFQVRVQAAWRLKTAVELSYFLSAPKH 103 Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSR-------- 152 N +D + + + + S P + + Sbjct: 104 NPECCSVDSGSLTYDTRFRQQVRTTQFCVRKASGATNPTQLTNEVLKAMEDNMKESPYLK 163 Query: 153 ----------HIVMPITSSVKVNSQTD------------ARLDMMIVLDVSRSM-ESFFD 189 + P+ S + S + ++++V+D S SM E Sbjct: 164 WQYFGSVEGLTTIYPMQSQAECGSYDNRARPWYVDAAAPKPKNVVLVVDSSGSMAEKHTA 223 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL--EW--------- 238 + T + MAI + A+L+ + + G+V+ + W Sbjct: 224 NGKTWLQMAIDAAKAVLDTLNPRD------KVGVVSLATDANTPGSNDTTWCYANTLAEA 277 Query: 239 ---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 +++++ + + G T P L A+ + + + + ++I+F+T Sbjct: 278 NSVNINNMKIFLDGMRSAGF-TMYIPALTKAFALLLNSKPESP------DDCDQVIIFLT 330 Query: 296 DGENLSTKEDQ-QSLYYCNEAKKRGAIVYAIGIRVIR 331 D + KE +++ N+ ++ A GI Sbjct: 331 DAKPTELKESVMRTIVESNKLLDNRVVILAYGIGAED 367 >gi|29830947|ref|NP_825581.1| hypothetical protein SAV_4404 [Streptomyces avermitilis MA-4680] gi|29608060|dbj|BAC72116.1| hypothetical protein [Streptomyces avermitilis MA-4680] Length = 582 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 42/205 (20%), Positives = 79/205 (38%), Gaps = 27/205 (13%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 ++++ N T R + +VLD S SM ++ D S + ++ A L V Sbjct: 386 TTLEKNDLTGTRAKVYLVLDRSASMRPYYKDGSAQALGEQTLALAAHL------DPEATV 439 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL-SKFGVSTNSTPGLKYAYNQIFDMQGMR 277 +V FS +++ L ++ + KI L + G ++ A + Sbjct: 440 H---VVFFSTELDGTGELT--LTEHENKIDELHTALGRMGRTS--YHAAVEAVLA----- 487 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA-IGIRVIRSHEF- 335 H + D ++VF TDG + Q+L + AK + ++ + + F Sbjct: 488 HHEKSADPKAPALVVFQTDGAPDAKTPATQALT--DAAKNHPTVFFSFVAFGEHDNKAFD 545 Query: 336 -LRACASPNS-FYLVE-NPHSMYDA 357 LR + N+ F+ P + DA Sbjct: 546 YLRKLKTGNTAFFHAGPTPRELTDA 570 >gi|42407700|dbj|BAD08848.1| zinc finger (C3HC4-type RING finger) protein family-like [Oryza sativa Japonica Group] gi|42408122|dbj|BAD09262.1| zinc finger (C3HC4-type RING finger) protein family-like [Oryza sativa Japonica Group] gi|125602049|gb|EAZ41374.1| hypothetical protein OsJ_25891 [Oryza sativa Japonica Group] Length = 704 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 41/226 (18%), Positives = 84/226 (37%), Gaps = 41/226 (18%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 H P + + + A +D++ VLDVS SME + K+ + +++ +++++ Sbjct: 212 HAKAPSIAVAEATAAARAPVDLVTVLDVSGSMEGY------KLTLLKRAMGFVIDKLGPG 265 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + +V+FS + L G + + ++ L G TN GL A Sbjct: 266 D------RLAVVSFSYNAQRVIRLTRMSDDGKASAKSAMESL-AAGGGTNILKGLVEA-A 317 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST--------KEDQQSLYYCNEAKKRG- 319 ++FD + R + ++ ++DG++ S+ K+ G Sbjct: 318 KVFDGRRYRNAVAS--------VILLSDGQDTYNVNGGWGASNSKNYSVLVPPSFKRSGD 369 Query: 320 --AIVYAIGIRVIRSHEFLRAC---ASPNSFYLVENPHSMYDAFSH 360 V+ G + +F +EN + DAF+ Sbjct: 370 RRLSVHTFGFGT-DHDAVAMNAIAEETGGTFSFIENQAVVQDAFAQ 414 >gi|27376088|ref|NP_767617.1| hypothetical protein bll0977 [Bradyrhizobium japonicum USDA 110] gi|27349227|dbj|BAC46242.1| bll0977 [Bradyrhizobium japonicum USDA 110] Length = 754 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 50/288 (17%), Positives = 98/288 (34%), Gaps = 47/288 (16%) Query: 82 ILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKF 141 I R+K + + + + D+ R +L VP + + ++ + Sbjct: 264 ITVRLKAGFALGEVKSHHHSVKIESPDNATRIVTLADGAVPADRDFELTWKPAAQKAPSV 323 Query: 142 CTFIPWYTNSRHIVMPITS-SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 F ++ +++ +T S + +Q +++ V+D S SM T I A Sbjct: 324 GLFREHVGDADYLLAFVTPPSAEQATQKPLPREVVFVIDNSGSMGG------TSIVQAKA 377 Query: 201 SINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-----LEWGVSHLQRKIKYLSKFGV 255 S+ L ++ + ++ F + ++ F V + L G Sbjct: 378 SLLYALGRLQPAD------RFNVIRFDDTMDVLFPASVPADAAHVGEATSFVSALQARG- 430 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 T P ++ A D + +VF+TDG + +QQ Sbjct: 431 GTEMVPAMRAALTD-----------KIGDTGMVRQVVFLTDG---AIGNEQQLFETITAM 476 Query: 316 KKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSM-YDAFSHIG 362 + R V+ +GI S + YL+ + AF+HIG Sbjct: 477 RGRS-RVFMVGIG------------SAPNTYLMTRASELGRGAFTHIG 511 >gi|120554526|ref|YP_958877.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] gi|120324375|gb|ABM18690.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] Length = 715 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 35/229 (15%), Positives = 80/229 (34%), Gaps = 37/229 (16%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 I W ++ + + + D+ IV+D+S SM+ ++ + + + + Sbjct: 13 LLLISWQAGAQDPALSLPGNA----------DVRIVVDISGSMKETDPQNLRRPAVRLLA 62 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 + V LV + + W + ++R + ++ + TN Sbjct: 63 RTLPEGASAGLWTFGQYVNM-LVPYGVVDQ-----SWRDTAIERS-EQINSVALHTNLGL 115 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY--------YCN 313 ++ A N + ++ ++DG+ D S Sbjct: 116 AMEKAANDWLSGGTLENT----------HLIVLSDGKVDVPGGDDASQAEEKRIVDSLLP 165 Query: 314 EAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSH 360 K +GA ++ +G+ FLR A + SF L ++ ++ AF++ Sbjct: 166 ALKDKGATIHTVGLSEKADIRFLRNLARETGGSFQLAQSAEALNLAFAN 214 >gi|302532683|ref|ZP_07285025.1| predicted protein [Streptomyces sp. C] gi|302441578|gb|EFL13394.1| predicted protein [Streptomyces sp. C] Length = 248 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 29/159 (18%), Positives = 61/159 (38%), Gaps = 25/159 (15%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 +P T++ + + +VLDVS SM + +++ A ++ N +L+ V +V Sbjct: 105 LPSTAAAADGPAKEPP-KVELVLDVSGSMRANDIDGQSRMAAAKQAFNEVLDAVP--DEV 161 Query: 216 NNVVQSGLVTF---------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 +++ T+ + + + + + + + L+ G T P L+ A Sbjct: 162 RLGIRTLGATYPGDDRALGCKDTKQLYPVGTVNRTEAKTAVATLAPTGW-TPIGPALQAA 220 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 + R+ IV +TDGE+ D Sbjct: 221 AQDLEGGNATRR------------IVLITDGEDTCAPLD 247 >gi|281346139|gb|EFB21723.1| hypothetical protein PANDA_018300 [Ailuropoda melanoleuca] Length = 1151 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 36/215 (16%), Positives = 86/215 (40%), Gaps = 26/215 (12%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + + + A ++ I+LD S S++ A I+ M+ + Sbjct: 166 TQEDEEEEAAGTEIAIILDGSGSIDP------PDFQRAKDFISNMMRN---FYEKCFQCS 216 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQR--KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 LV + I+ F L + +++ +++ G T + +++ + IF Sbjct: 217 FALVQYGEVIQTEFDLRDSQDAMASLARVQNITQVGKVTKTASAMQHVLDNIFTPS---- 272 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR----VIRSHE 334 + N K++V +TDG+ + ++ + K +G +AIG+ +++ Sbjct: 273 --HGSRKNASKVMVVLTDGDIFEDPLNLTTVI--SSPKMQGVERFAIGVGKAFENNKTYN 328 Query: 335 FLRACAS--PNSF-YLVENPHSMYDAFSHIGKDIV 366 L+ AS + + + V N ++ S + ++I+ Sbjct: 329 ELKLIASDPDDRYAFKVTNYTALDGLLSKLQQNII 363 >gi|115375477|ref|ZP_01462737.1| von Willebrand factor, type A [Stigmatella aurantiaca DW4/3-1] gi|310821370|ref|YP_003953728.1| von willebrand factor, type a [Stigmatella aurantiaca DW4/3-1] gi|115367520|gb|EAU66495.1| von Willebrand factor, type A [Stigmatella aurantiaca DW4/3-1] gi|309394442|gb|ADO71901.1| Von Willebrand factor, type A [Stigmatella aurantiaca DW4/3-1] Length = 562 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 66/212 (31%), Gaps = 25/212 (11%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + V SQ + +V D S SM S + K + ++ + Sbjct: 197 LRVGVQGKVVSRSQRKPAHLVFLV-DTSGSMHSQDKLPLAK-EAMKVAVRNL-------- 246 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + N+ V S + V + I L + G T G++ AY Sbjct: 247 NENDTVAIVTYAGSTQDVLPPTPATEVQRIHTAID-LLQSGGGTAMGSGMELAY------ 299 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGE-NLSTK-EDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 R N ++ +TDG+ N+ + L + G + IG + Sbjct: 300 ---RHAVKKASGNAISRVIVLTDGDANIGPNLSAESMLSGIEKYVAEGVTLSTIGFGMGN 356 Query: 332 -SHEFLRACAS--PNSFYLVENPHSMYDAFSH 360 + + A + + V++ F Sbjct: 357 YRDDLMERLADKGNGNCFYVDSYQEAKKVFEA 388 >gi|332817190|ref|XP_003309914.1| PREDICTED: LOW QUALITY PROTEIN: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Pan troglodytes] Length = 1241 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 75/197 (38%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 333 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 380 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 381 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 439 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N + +++G A Sbjct: 440 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--RTVRVFTFSVGQHNYDVTPLQWMA 490 Query: 339 CASPNSFYLVENPHSMY 355 CA+ ++ + + ++ Sbjct: 491 CANKGYYFEIPSIGAIR 507 >gi|332216457|ref|XP_003257368.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2 [Nomascus leucogenys] Length = 1094 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 75/197 (38%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 220 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 267 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 268 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 326 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N + +++G A Sbjct: 327 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--RTVRVFTFSVGQHNYDVTPLQWMA 377 Query: 339 CASPNSFYLVENPHSMY 355 CA+ ++ + + ++ Sbjct: 378 CANKGYYFEIPSIGAIR 394 >gi|297671247|ref|XP_002813757.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like isoform 2 [Pongo abelii] Length = 1074 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 75/197 (38%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 216 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 263 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 264 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 322 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N + +++G A Sbjct: 323 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--RTVRVFTFSVGQHNYDVTPLQWMA 373 Query: 339 CASPNSFYLVENPHSMY 355 CA+ ++ + + ++ Sbjct: 374 CANKGYYFEIPSIGAIR 390 >gi|297671245|ref|XP_002813756.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like isoform 1 [Pongo abelii] Length = 1081 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 75/197 (38%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 216 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 263 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 264 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 322 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N + +++G A Sbjct: 323 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--RTVRVFTFSVGQHNYDVTPLQWMA 373 Query: 339 CASPNSFYLVENPHSMY 355 CA+ ++ + + ++ Sbjct: 374 CANKGYYFEIPSIGAIR 390 >gi|297285706|ref|XP_001090735.2| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Macaca mulatta] Length = 1417 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 75/197 (38%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 552 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 599 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 600 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 658 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N + +++G A Sbjct: 659 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--RTVRVFTFSVGQHNYDVTPLQWMA 709 Query: 339 CASPNSFYLVENPHSMY 355 CA+ ++ + + ++ Sbjct: 710 CANKGYYFEIPSIGAIR 726 >gi|296225305|ref|XP_002758277.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2 [Callithrix jacchus] Length = 1251 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 75/197 (38%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 255 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 302 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 303 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 361 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N + +++G A Sbjct: 362 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--RTVRVFTFSVGQHNYDVTPLQWMA 412 Query: 339 CASPNSFYLVENPHSMY 355 CA+ ++ + + ++ Sbjct: 413 CANKGYYFEIPSIGAIR 429 >gi|291290994|ref|NP_001167522.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform c [Homo sapiens] Length = 1150 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 75/197 (38%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 285 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 332 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 333 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 391 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N + +++G A Sbjct: 392 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--RTVRVFTFSVGQHNYDVTPLQWMA 442 Query: 339 CASPNSFYLVENPHSMY 355 CA+ ++ + + ++ Sbjct: 443 CANKGYYFEIPSIGAIR 459 >gi|194221273|ref|XP_001915997.1| PREDICTED: similar to Voltage-dependent calcium channel subunit alpha-2/delta-2 precursor (Voltage-gated calcium channel subunit alpha-2/delta-2) [Equus caballus] Length = 1127 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 75/197 (38%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 291 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 338 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 339 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 397 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N + +++G A Sbjct: 398 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--RTVRVFTFSVGQHNYDVTPLQWMA 448 Query: 339 CASPNSFYLVENPHSMY 355 CA+ ++ + + ++ Sbjct: 449 CANKGYYFEIPSIGAIR 465 >gi|167522505|ref|XP_001745590.1| hypothetical protein [Monosiga brevicollis MX1] gi|163775939|gb|EDQ89561.1| predicted protein [Monosiga brevicollis MX1] Length = 1927 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 41/231 (17%), Positives = 75/231 (32%), Gaps = 35/231 (15%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 T M TS+ + D++IV+D S +M SI A + A+L Sbjct: 1110 GTTTPFDTTMASTSTTLILPCAVTA-DIVIVIDASGAM------SINDFTAAKTTALAIL 1162 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-----EWGVSHLQRKIKYLSKFGVSTNSTP 261 + L + GL+ FS + + L E L ++ G +TN Sbjct: 1163 RRLALAQPD---ISVGLIFFSQQAQVALPLLDINDETEFQLLLLVLQAGQYQGQATNLGS 1219 Query: 262 GLKYAYNQI-FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA 320 L A + + G RQ + +DG D Q + G Sbjct: 1220 ALSSAADLLETSNNGARQF------------ILFSDG-----SSDDQGTLVAQNIRATGI 1262 Query: 321 IVYAIG-IRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + + + + L + ++ +L + A + + +IV + Sbjct: 1263 QILTVANVMNANVYNLLLIAGAADNIFLTVQSDA-TQADAAVVSNIVNQLC 1312 Score = 38.2 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 21/144 (14%), Positives = 50/144 (34%), Gaps = 20/144 (13%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 V + D++ VLD S S+ S + + VK++ +N +Q Sbjct: 22 VSSQTCLPTSFDLLFVLDGSGSITSEQFDDVLRFAS---------RTVKVLNISDNDIQV 72 Query: 222 GLVTFSNKIEEFFLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + F++ F +++ I+ + T++ L+ N + Sbjct: 73 AALEFADYGNLMFDFAEYASTEEQAIRQHIRSIPYMYGPTDTGRALEDVVNNVLV----- 127 Query: 278 QHCNTEDANYKKIIVFMTDGENLS 301 + + ++V ++DG+ Sbjct: 128 --PSAGFRDGSCVVVVVSDGQTQD 149 Score = 36.3 bits (82), Expect = 7.8, Method: Composition-based stats. Identities = 23/130 (17%), Positives = 45/130 (34%), Gaps = 17/130 (13%) Query: 173 DMMIVLDVSRS--MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D+ +V+D S + + FF A+KS+ L +++ + + FS+ Sbjct: 1398 DLYVVVDASGTITVSDFF---------AMKSLLTSLAKIRFADTACLDQRISITVFSDDA 1448 Query: 231 EEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L G S ++ I L+ +N L ++ I Q Sbjct: 1449 ATVLSLTNGTSLVRVLDGINGLNYLARRSNLGAALDLVFDAIVASDSYVQGA----VTRP 1504 Query: 289 KIIVFMTDGE 298 ++ + DG Sbjct: 1505 TAVLVLGDGR 1514 >gi|119585523|gb|EAW65119.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_b [Homo sapiens] Length = 1146 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 75/197 (38%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 285 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 332 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 333 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 391 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N + +++G A Sbjct: 392 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--RTVRVFTFSVGQHNYDVTPLQWMA 442 Query: 339 CASPNSFYLVENPHSMY 355 CA+ ++ + + ++ Sbjct: 443 CANKGYYFEIPSIGAIR 459 >gi|2781441|gb|AAB96914.1| alpha 2 delta calcium channel subunit isoform II [Homo sapiens] Length = 1076 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 75/197 (38%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 216 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 263 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 264 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 322 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N + +++G A Sbjct: 323 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--RTVRVFTFSVGQHNYDVTPLQWMA 373 Query: 339 CASPNSFYLVENPHSMY 355 CA+ ++ + + ++ Sbjct: 374 CANKGYYFEIPSIGAIR 390 >gi|54112394|ref|NP_006021.2| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform b [Homo sapiens] gi|7414316|emb|CAB86192.1| calcium channel, alpha 2/delta subunit 2 [Homo sapiens] gi|119585522|gb|EAW65118.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_a [Homo sapiens] Length = 1143 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 75/197 (38%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 285 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 332 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 333 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 391 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N + +++G A Sbjct: 392 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--RTVRVFTFSVGQHNYDVTPLQWMA 442 Query: 339 CASPNSFYLVENPHSMY 355 CA+ ++ + + ++ Sbjct: 443 CANKGYYFEIPSIGAIR 459 >gi|54112392|ref|NP_001005505.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform a [Homo sapiens] gi|2781439|gb|AAB96913.1| alpha 2 delta calcium channel subunit isoform I [Homo sapiens] gi|3043640|dbj|BAA25484.1| KIAA0558 protein [Homo sapiens] gi|3695006|gb|AAC70914.1| putative tumor suppressor gene 26 protein alpha 2 delta calcium channel subunit [Homo sapiens] gi|119585525|gb|EAW65121.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_d [Homo sapiens] gi|156230959|gb|AAI52439.1| Calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Homo sapiens] gi|168267416|dbj|BAG09764.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2 isoform b [synthetic construct] Length = 1145 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 75/197 (38%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 285 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 332 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 333 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 391 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N + +++G A Sbjct: 392 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--RTVRVFTFSVGQHNYDVTPLQWMA 442 Query: 339 CASPNSFYLVENPHSMY 355 CA+ ++ + + ++ Sbjct: 443 CANKGYYFEIPSIGAIR 459 >gi|74725352|sp|Q9NY47|CA2D2_HUMAN RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-2; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-2; Flags: Precursor gi|7414318|emb|CAB86193.1| calcium channel, alpha 2/delta subunit 2 [Homo sapiens] Length = 1150 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 75/197 (38%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 285 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 332 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 333 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 391 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N + +++G A Sbjct: 392 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--RTVRVFTFSVGQHNYDVTPLQWMA 442 Query: 339 CASPNSFYLVENPHSMY 355 CA+ ++ + + ++ Sbjct: 443 CANKGYYFEIPSIGAIR 459 >gi|256393600|ref|YP_003115164.1| Vault protein inter-alpha-trypsin domain-containing protein [Catenulispora acidiphila DSM 44928] gi|256359826|gb|ACU73323.1| Vault protein inter-alpha-trypsin domain protein [Catenulispora acidiphila DSM 44928] Length = 1033 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 40/215 (18%), Positives = 82/215 (38%), Gaps = 35/215 (16%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + D+ ++LD S SM + K+ A ++ +++ + + ++T Sbjct: 311 TGATRPRDVALILDRSGSMGGW------KMTAARRAAARIVDTLTAED------RFAVLT 358 Query: 226 FSNKIEEFFLLEWGVSHLQ-----RKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQ 278 F +++E L G+S R +++L+ T P L+ A + D R Sbjct: 359 FDDQMETPDGLPTGLSEATDRHRFRAVQHLATVDARGGTEMEPPLRRAATLLSDDNPDR- 417 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 ++++ +TDG+ ++ L K V+ +GI + FL+ Sbjct: 418 ---------DRVLILITDGQ----VGNEDRLLTTLSPKLTHIRVHTVGIDTAVNAAFLQR 464 Query: 339 CAS--PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 ++ LVE+ + DA I I T + Sbjct: 465 LSTLGGGHCELVESEDRLDDAMDAIHHRIATPLVT 499 >gi|268580761|ref|XP_002645363.1| Hypothetical protein CBG15420 [Caenorhabditis briggsae] Length = 862 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 34/193 (17%), Positives = 74/193 (38%), Gaps = 25/193 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++I+ D S + T + N ++ V+ +P + V+ G+V +S++ Sbjct: 31 PVLDIIILFDTSG-------GNDTVFEQQK---NWTIKIVRDLPVHEDAVRVGIVQYSDE 80 Query: 230 IEEFFLLEWGV--SHLQRKIKYLSKF-GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + F L + + ++ L G T + L A ++IFD G Sbjct: 81 AKTEFNLSRYSERNDIITHLETLKFMPGEDTRTGVALSKADDEIFDYDG------GARLK 134 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-SHEFLRACASPNSF 345 ++I+ TDG ++ +++G +Y I + I E L ++ Sbjct: 135 ATRLIIVFTDGLSMDKPT-----LAAKALRRKGVKIYTISVNSIGFVPEMLGIVGDADNV 189 Query: 346 YLVENPHSMYDAF 358 + + + + + Sbjct: 190 FGPTDENRIEERL 202 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 25/198 (12%), Positives = 70/198 (35%), Gaps = 29/198 (14%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 ++ +D++ V+D S S+ +D+ + ++++V+ + + GL+ Sbjct: 678 SASVQCPMDILFVVDSSGSIARTYDTQ-------KDYLTQLIKKVEP----SRSHRVGLI 726 Query: 225 TFSNKIEEFFLLEWGVSHLQRK----IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 F+ + + + I+ + +T L+ + + R+H Sbjct: 727 QFAGPHIQKMEWSFDTHSKNSQLLSAIRSVRHLTGTTYIGAALEL---SLILLDSRRKHT 783 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 T ++ ++DG + Q L + +YAI + + + ++L Sbjct: 784 ET-------TVILISDGFSQDDSTQQAKLLR----QLPNVKMYAISLNKLTNTKYLTDIV 832 Query: 341 SPNSFYLVENPHSMYDAF 358 + + ++ F Sbjct: 833 GDRKNLFINDESQWFEEF 850 >gi|85708696|ref|ZP_01039762.1| hypothetical protein NAP1_05635 [Erythrobacter sp. NAP1] gi|85690230|gb|EAQ30233.1| hypothetical protein NAP1_05635 [Erythrobacter sp. NAP1] Length = 640 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 27/176 (15%), Positives = 53/176 (30%), Gaps = 31/176 (17%) Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ--GMRQHCNT 282 +S F L + + L+ + L+ +T G+ + I Sbjct: 466 GYSCPAAAFKLTDISRTDLETYVDGLTPR-SNTYHDFGMIWGARFISPNGIFAASNATAP 524 Query: 283 EDANYKKIIVFMTDG---------------------ENLSTKEDQQSLYY------CNEA 315 + IVFMTDG N + + + C A Sbjct: 525 NGDAISRHIVFMTDGLLVPNQEIYSMYGIEWWDRRITNDGSGGQARDRHATRFQVACRAA 584 Query: 316 KKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 ++ V+ I + + CA+P + + ++ F I ++I R+ Sbjct: 585 RQENISVWVIAFGTTLTQNLI-DCATPGRAFQANDTAALETRFEQIAQEIAALRLT 639 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 35/237 (14%), Positives = 76/237 (32%), Gaps = 39/237 (16%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 + +++A L + ++G ++ S + +T L + + Sbjct: 1 MAKDVTANTIVISAASLVPLMAMVGGGVDASRYYMAETRLQAA-------------CDAG 47 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 ++ D R F +E +G ++D+ RS + Q+ + Sbjct: 48 ALAARRSMADDNFSRADRITGEKFFDENYPDG-TFGLEDLERSFTAT-----QSGQVNGE 101 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 A T I + +T VN D++ V+DV+ SM D+ Sbjct: 102 ASGT------LPTAIMAPFGYDEFSLSVTCEADVNISNT---DVLFVVDVTGSMNCAPDN 152 Query: 191 -----------SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 KI ++ + V+ + V+ G+V +++ + L Sbjct: 153 PGGGSCGNTEDPGAKIKGLRSAVLKFYDTVETSTSPSAQVRYGMVPYASNVNVGAAL 209 >gi|83951473|ref|ZP_00960205.1| hypothetical protein ISM_12960 [Roseovarius nubinhibens ISM] gi|83836479|gb|EAP75776.1| hypothetical protein ISM_12960 [Roseovarius nubinhibens ISM] Length = 550 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 19/75 (25%), Positives = 36/75 (48%), Gaps = 3/75 (4%) Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS---HEFLRACASPNSFYLVENPHSMYD 356 +++D + C K G +V++IG + + + L+ CAS + Y ++ D Sbjct: 475 DGSEKDTRMKASCTATKNEGVVVFSIGFEIDQGGTAEQVLKNCASSENHYFRAEGININD 534 Query: 357 AFSHIGKDIVTKRIW 371 AFS I ++V R+ Sbjct: 535 AFSAIASNVVNLRLT 549 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 68/198 (34%), Gaps = 37/198 (18%) Query: 22 LTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGD 81 + +F I+ G+ +++ + + S +D +++ +A G D Sbjct: 1 MALVFFLIMIAAGGIAVDMMRYEMKRAQIQSTLDSAVLASAGA------------PYGSD 48 Query: 82 ILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKF 141 I++ + + N D + +IVV + + +A L Sbjct: 49 HRAIIEDYFRV-----------ANMTDYLAAEKEGEIVVTVNSASVTANADMTMDTYLMK 97 Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + I T+ + +L++++VLDVS SM S +K+ K+ Sbjct: 98 LSGIKEL---------RTTGGSTAVRKVPKLEVVLVLDVSGSMGSN-----SKLVNLKKA 143 Query: 202 INAMLEEVKLIPDVNNVV 219 + + + N V Sbjct: 144 AKEFVTSLLNGSEPGNTV 161 >gi|319952790|ref|YP_004164057.1| von willebrand factor type a [Cellulophaga algicola DSM 14237] gi|319421450|gb|ADV48559.1| von Willebrand factor type A [Cellulophaga algicola DSM 14237] Length = 348 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 25/174 (14%), Positives = 61/174 (35%), Gaps = 23/174 (13%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSM--ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 K+ + +D++ LDVS+SM E + + K I I L Sbjct: 79 GTKLETVKREGVDIVFALDVSKSMLAEDIAPNRLEKGKRLISEIINHLGS---------- 128 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + G++ ++ + + S + ++ ++ +S + + A Sbjct: 129 DRIGIIAYAAQAYPQLPITTDYSAAKMFLQSMNTDMLS-SQGTAINEAIELASTYYDDET 187 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 N +++ ++DGE + + +A + G ++ IG+ + Sbjct: 188 QTN-------RVLFIISDGE---DHAEGTTEDAVEKATEEGIRIFTIGVGKEKG 231 >gi|149918749|ref|ZP_01907236.1| batB protein [Plesiocystis pacifica SIR-1] gi|149820350|gb|EDM79766.1| batB protein [Plesiocystis pacifica SIR-1] Length = 421 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 28/189 (14%), Positives = 64/189 (33%), Gaps = 34/189 (17%) Query: 172 LDMMIVLDVSRSM--ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++ +D S+SM E + S +++ + ++E + G+V F+ Sbjct: 105 LDIVLAVDYSKSMLAEDVYPSRSERLEA---ELTRFIDE-----SGRRGDRVGVVIFAGA 156 Query: 230 IEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN- 286 F + +S ++ T L + + + ++ + Sbjct: 157 ARSFPVTSDMGVLSLFLAHADPRTENPGGTAIGKALDKSIDLLVAVRRDDSGARADQVEG 216 Query: 287 -----------------YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 ++IV +TDGE+ + + A++ G +Y +GI Sbjct: 217 EGEDESGAPEAAPALSEADQVIVLLTDGEDTVGRPE----EVAARAEQLGIRIYTVGIGS 272 Query: 330 IRSHEFLRA 338 +R Sbjct: 273 DSGEPIMRY 281 >gi|24373750|ref|NP_717793.1| inter-alpha-trypsin inhibitor domain-containing protein [Shewanella oneidensis MR-1] gi|24348130|gb|AAN55237.1|AE015661_7 inter-alpha-trypsin inhibitor domain protein [Shewanella oneidensis MR-1] Length = 760 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 39/226 (17%), Positives = 88/226 (38%), Gaps = 40/226 (17%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + +VM + V+ + Q + ++++V+D S SM I A ++ L +K Sbjct: 357 NYSLVMVLPPKVEASGQLNLPRELILVIDTSGSMAG------DSIIQAKNALRYALRGLK 410 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 N ++ F++ + + ++ ++ + L G + S L+ Sbjct: 411 AQDSFN------IIEFNSDVSLLSPVPLPATAENLAIARQFVNRLQADGGTEMS-LALEA 463 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 A + R + N + ++FMTDG ++++L+ + ++ + Sbjct: 464 A------LPKQRPSRAASENNVLQQVIFMTDGS----VGNEEALFELIRHQIGDNRLFTV 513 Query: 326 GIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGK-DIVTKRI 370 GI ++PNS ++ F++IG D V ++I Sbjct: 514 GIG-----------SAPNSHFMQRAAELGRGTFTYIGDVDEVEQKI 548 >gi|297292637|ref|XP_001090691.2| PREDICTED: anthrax toxin receptor 2 [Macaca mulatta] Length = 488 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 32/232 (13%), Positives = 73/232 (31%), Gaps = 34/232 (14%) Query: 148 YTNSRHIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + + + + + D+ VLD S S+ + + I Sbjct: 16 FPGLWLLALSGPGGLLRAQEQPSCRRAFDLYFVLDKSGSVANNW-----------IEIYN 64 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPG 262 ++++ V+ ++ + FS++ L + + ++ L + T G Sbjct: 65 FVQQLAERF-VSPEMRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEG 123 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGA 320 LK +I + + +TDG + L ++ ++ GA Sbjct: 124 LKLVIEKIIYSGDEKXXXXX---------IALTDGKLDGLVPSYAEKEAKI---SRSLGA 171 Query: 321 IVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 VY +G+ + R S + V+ + A I I+ + Sbjct: 172 SVYCVGVLDFEQAQLERIADSKEQVFPVKGG---FQALKGIINSILAQSCTE 220 >gi|225022482|ref|ZP_03711674.1| hypothetical protein CORMATOL_02522 [Corynebacterium matruchotii ATCC 33806] gi|224944721|gb|EEG25930.1| hypothetical protein CORMATOL_02522 [Corynebacterium matruchotii ATCC 33806] Length = 236 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 53/136 (38%), Gaps = 4/136 (2%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 L + ++DVS SM T +D A + + ++E + ++ ++ GL+ F ++ Sbjct: 9 LPVFFLIDVSYSMLEEKPGGGTLLDAANQLVPGIVEACEKYSVLDQRLRLGLIEFYDEAR 68 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L + I L G TN +N++ + + Sbjct: 69 VVIPLS-EIDAFSENIPQLVAKG-GTNFAAAFWAVFNEMGVAVESLRKPEIGIHRPT--V 124 Query: 292 VFMTDGENLSTKEDQQ 307 F+TDGE++ E++ Sbjct: 125 FFITDGEDIGDVEERA 140 >gi|330469087|ref|YP_004406830.1| von willebrand factor type a [Verrucosispora maris AB-18-032] gi|328812058|gb|AEB46230.1| von willebrand factor type a [Verrucosispora maris AB-18-032] Length = 316 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 32/208 (15%), Positives = 65/208 (31%), Gaps = 32/208 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+ +DVS SM++ D T+++ A ++ + E+ + GLV+F+ Sbjct: 89 IMLAIDVSLSMQAD-DVPPTRLEGAQEAAKQFVRELPETYN------VGLVSFAKSANVL 141 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + I L +T + + I + IV Sbjct: 142 VPPTKDRPAVTNAIDGLV-LAEATATGEAVFTCLEAIRSVPADGAAGIPPAR-----IVL 195 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-----VIRSHEF---------LRAC 339 ++DG + + +++ V I V + L A Sbjct: 196 LSDGFRTAGRSVEEAAAAAQA---ANVPVSTIAFGTDAGHVAIGGQLQRVPVDRMALAAL 252 Query: 340 A--SPNSFYLVENPHSMYDAFSHIGKDI 365 A + FY + + + +G I Sbjct: 253 AETTEGYFYEAASVSELKQVYQDMGSSI 280 >gi|312886237|ref|ZP_07745851.1| von Willebrand factor type A [Mucilaginibacter paludis DSM 18603] gi|311301262|gb|EFQ78317.1| von Willebrand factor type A [Mucilaginibacter paludis DSM 18603] Length = 348 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 63/163 (38%), Gaps = 26/163 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+MI+LDVS SM S D S +++ A ++I+ +++ + + G++ F+ + Sbjct: 91 DLMILLDVSNSMLSQ-DLSPNRLENAKRAISQLIDNLHDD-------RIGIIVFAGQAYV 142 Query: 233 FFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + S + + ++ T + GM K Sbjct: 143 QLPITTDYSAAKLFLNTINTNMVPTQGTAIGAAIDLGMQSFDFKNGMS-----------K 191 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 ++ +TDGEN + ++ N A+ + V IG+ Sbjct: 192 AMIVITDGENH----EDDAVSAANHARDKDVTVNVIGVGSEEG 230 >gi|116625274|ref|YP_827430.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228436|gb|ABJ87145.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 320 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 32/215 (14%), Positives = 71/215 (33%), Gaps = 40/215 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 D + + +V+D S SM + + K++ A ++ + VN F Sbjct: 84 SEDVPVSLGLVIDNSGSMR----NKLQKVEAAALALVKASNRDDEVFIVN---------F 130 Query: 227 SNKIEEFFL----LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 ++ + L++ +K + G + ++ A D H Sbjct: 131 NDTAYLDNPKDKDFTNDIGELEQALKRIDARGGT-----AMRDAIQMSID------HLKK 179 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI-------RVIRSHEF 335 + KK++V +TDG A + ++Y +G+ R+ Sbjct: 180 GHRD-KKVLVVITDG--NDNSSVINMERIMKNAHQSDVLIYGVGLLTEEEHREAARAKRA 236 Query: 336 LRAC--ASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 L A+ + ++ + S + DI ++ Sbjct: 237 LNDLAEATGGKTFFPKDLEEVDAIASQVAHDIRSQ 271 >gi|148655604|ref|YP_001275809.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148567714|gb|ABQ89859.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 425 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 27/198 (13%), Positives = 66/198 (33%), Gaps = 36/198 (18%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 L++ +VLD S SM ++ ++ +++++ LV F+++ Sbjct: 44 PLNLCLVLDRSSSMRG------ERLMQVKEAAARIVDQLGPDD------YFSLVVFNDRA 91 Query: 231 EEFFLLEWG--VSHLQRKIKYLSKFGVSTNS---TPGLKYAYNQIFDMQGMRQHCNTEDA 285 + + S L+ I + G + + L+ R Sbjct: 92 DVVIPAQRAIKKSDLKAAIAQIEAAGGTEMAQGLALALQEVQRPFLTRGISR-------- 143 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS- 344 ++ +TDG ++ + + + RG + A+GI + + L + + Sbjct: 144 -----LILLTDGRTYG--DESRCVEIARRGQSRGIGLTALGIGTEWNEDLLETMTASENS 196 Query: 345 --FYLVENPHSMYDAFSH 360 Y+ + F+ Sbjct: 197 RAQYIA-TAQDVVKVFAD 213 >gi|186683831|ref|YP_001867027.1| von Willebrand factor A [Nostoc punctiforme PCC 73102] gi|186466283|gb|ACC82084.1| von Willebrand factor, type A [Nostoc punctiforme PCC 73102] Length = 615 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 30/195 (15%), Positives = 79/195 (40%), Gaps = 31/195 (15%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 L++ +V+D S SM +K+ +++++++ ++ +V + ++++ Sbjct: 39 LNLSLVIDRSGSMAGAALHHA------LKAAESVVDQLEPDDILS------VVVYDDEVD 86 Query: 232 EFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + + L+ I+ + G++ S LK +H + +K Sbjct: 87 SVVPPQAVTNKATLKDSIRKVRAGGITNLSGGWLK-----------GCEHV-KTRLDPQK 134 Query: 290 I--IVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL--RACASPNS 344 I ++ +TDG N+ ++ + + + G +G + + L A A+ + Sbjct: 135 INRVLLLTDGHANMGIQDPKVLTATSGQKAEEGITTTTLGFAQGFNEDLLIGMARAARGN 194 Query: 345 FYLVENPHSMYDAFS 359 FY +++ + FS Sbjct: 195 FYFIQSIDEATEVFS 209 >gi|31789427|gb|AAP58542.1| hypothetical protein [uncultured Acidobacteria bacterium] Length = 329 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 39/181 (21%), Positives = 69/181 (38%), Gaps = 30/181 (16%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 +V P + Q D++I+LD S SM + D S ++ A I L+ Sbjct: 72 LVQPQAALATSTPQYQRE-DLVIMLDRSASMRAH-DVSPSRFARATAEIRDFLQH----- 124 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR------KIKYLSKFGVSTNSTPGLKYAY 267 N+ + GLV F+ +L + L I+ + + TN L+ A Sbjct: 125 KPENIDRVGLVGFAG---TSLILSYLTRDLDTVAFYLDWIESDPRTLLGTNIGAALRNAL 181 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + +D +KI V ++DGE+ D+ + + +G + +IGI Sbjct: 182 DV----------AKKDDRRARKIFVLLSDGEDYG---DEVARQLAVY-RGQGYRINSIGI 227 Query: 328 R 328 Sbjct: 228 G 228 >gi|324006620|gb|EGB75839.1| von Willebrand factor type A domain protein [Escherichia coli MS 57-2] Length = 580 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 49/304 (16%), Positives = 109/304 (35%), Gaps = 40/304 (13%) Query: 49 VLHSM---IDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN 105 L + + AA+QI N G R + + + ++ +F ++ + N Sbjct: 83 RLQAAPKYQHAAREKAASQIAN--PGTARYQQFDDNPVKQVAQNPLATFSLDVDTGSYAN 140 Query: 106 -------DIDDIVRSTSLDIVVV--PQNEGYSISAISRYKIPLKFCT-----FIPWYTNS 151 + + ++ +V P + P+ F PW Sbjct: 141 VRRFLNQGLLPPPDAVRVEEIVNYFPSDWDIKDKQSIPASKPIPFAMRYELAPAPWNEQR 200 Query: 152 RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 + + I + S+ +++ ++D S SM S ++ + S+ +++E++ Sbjct: 201 TLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISD-----ERLPLIQSSLKLLVKELRE 254 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 ++ V +G ++I + + + I L G STN GL+ AY Q Sbjct: 255 QDNIAIVTYAG----DSRIALPSISGSHKAEINAAIDSLDAEG-STNGGAGLEMAYQQA- 308 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGAIVYAIGIRVI 330 ++ N I+ TDG+ +D +S+ + ++ G + +G+ Sbjct: 309 AKGFIKGGINR--------ILLATDGDFNVGIDDPKSIESMVKKQRESGVTLSTLGVGDS 360 Query: 331 RSHE 334 +E Sbjct: 361 NYNE 364 >gi|160894031|ref|ZP_02074810.1| hypothetical protein CLOL250_01586 [Clostridium sp. L2-50] gi|156864409|gb|EDO57840.1| hypothetical protein CLOL250_01586 [Clostridium sp. L2-50] Length = 1391 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 57/342 (16%), Positives = 114/342 (33%), Gaps = 51/342 (14%) Query: 48 TVLHSMIDRSLVHAATQIMNEGN----------GNNRKKLKGGDILCRIKNTWNMSFRNE 97 LH++ID S + + ++ + ++ + + + + + Sbjct: 405 EHLHNVIDNSAKISDSSLLKDAIDEVVTQYNQISDDSYNAELNAAVNDMVDKQSSQVVPV 464 Query: 98 LRD--NGFVNDIDDIV---RSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSR 152 + NG N ++ I + + SI A + Sbjct: 465 SEETINGSFNSYVATTLKYDRINIHISRIDTSAYPSIQAYININGTKDSKEELADQFTKE 524 Query: 153 HIVM-----PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 + IT + + + IV+D S SME ++ + E Sbjct: 525 DFTVIDTQYEITDFTLNSGAESEAVSIGIVMDKSGSMEGAAIANAKQAAT---------E 575 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 V+ I +V++ N+ L L+ I +S G TN + GL A Sbjct: 576 AVEHITSEK----MMIVSYDNEAYLEQSLTSRSGTLKNSIAAIS-DGGGTNISAGLNLAL 630 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + + +G R ++ M+DG++ ++ED Q+ + A K G VY +G Sbjct: 631 DNLEAEKGSRA------------VILMSDGQDGGSEEDMQAAT--DRAAKLGISVYTVGF 676 Query: 328 RVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVT 367 +++A A + F + D + ++ K IV Sbjct: 677 GECD-DAYMQAIAEVTGGKFVKASASTELSDIYLYLQKYIVN 717 >gi|118348690|ref|XP_001007820.1| U-box domain containing protein [Tetrahymena thermophila] gi|89289587|gb|EAR87575.1| U-box domain containing protein [Tetrahymena thermophila SB210] Length = 790 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 62/197 (31%), Gaps = 24/197 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITK----------IDMAIKSINAMLEEV 209 S Q + D+ V+DVS SM +K +D+ S+ ++ + Sbjct: 124 SIKTPEGQQRSACDICCVIDVSGSMSDEAKIKNSKGDIESNGLTILDLVKHSVKTIINNL 183 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 + LV F + L G +H ++++ L STN G+ Sbjct: 184 DERD------RLSLVAFHTNAYKITDLTPMNENGRNHAIKELEKLIPL-DSTNIWDGIYQ 236 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA--IVY 323 A + Q + + +I++F TDG+ ++ + Sbjct: 237 ALEVVKAGQQQSIQKGEQRVAFSQILLF-TDGQPNVIPPRGHLPMLKKYKEENDVNCSIS 295 Query: 324 AIGIRVIRSHEFLRACA 340 G E L A Sbjct: 296 TFGFGYNLDSELLDQLA 312 >gi|47575877|ref|NP_757376.2| complement C2 [Rattus norvegicus] gi|46237595|emb|CAE83973.1| complement component 2 [Rattus norvegicus] gi|47477793|gb|AAH70923.1| Complement component 2 [Rattus norvegicus] Length = 758 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 30/218 (13%), Positives = 78/218 (35%), Gaps = 28/218 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K+ Q L++ ++LD S+S+ ++ +S M++ + V Sbjct: 250 KIQIQRSGHLNLYLLLDASQSVSE------KDFNIFKESAFLMVDRIFSFEIK---VSVA 300 Query: 223 LVTFSNKIEEFFL-LEWGVSHLQRKIKYLSK-------FGVSTNSTPGLKYAYNQIFDMQ 274 ++TF+++ + L ++ + L TN+ L Y + + Sbjct: 301 IITFASRPKIIMSVLNERSQNVMEVMDSLDSVCYKDHENATGTNTYEALNSVYLMMNNQM 360 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENL------STKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + + I+ +TDG++ ++ + + + + +YAIG+ Sbjct: 361 DRLGMETSAWQEIRHAIILLTDGKSNMGGSPKPAVDNIREILGISRNRNDYLDIYAIGVG 420 Query: 329 VIRS-----HEFLRACASPNSFYLVENPHSMYDAFSHI 361 + +E +++++ ++ F HI Sbjct: 421 KLDVDWKELNELGSKKDGERHAFILQDAKAVQQVFEHI 458 >gi|325963511|ref|YP_004241417.1| von Willebrand factor type A-like protein [Arthrobacter phenanthrenivorans Sphe3] gi|323469598|gb|ADX73283.1| von Willebrand factor type A-like protein [Arthrobacter phenanthrenivorans Sphe3] Length = 622 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 38/216 (17%), Positives = 73/216 (33%), Gaps = 32/216 (14%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T + R + +LDVS SM + +TK+ A ++ L+ + Sbjct: 420 TGLQDSFPEVRKRARALFLLDVSESM--VQEPGLTKLQRAKDAVLKALDHFTAEDE---- 473 Query: 219 VQSGLVTFS---------NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 GL FS + + L K+ L ++TP + Sbjct: 474 --IGLAAFSQVGDGPLTPGVVSPVAPFKTNKEDLIAKLNELKAV----DATPLFEAV--S 525 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ---SLYYCNEAKKRGAIVYAIG 326 F +++ + N+ IV ++DG+N +T S ++ +V+ + Sbjct: 526 RFAGDQAKEYKD----NFINAIVLLSDGKNDTTHPGDLGGLSEQLGHQNHSTPVLVFTLA 581 Query: 327 IRVIRSHEFLR--ACASPNSFYLVENPHSMYDAFSH 360 LR A AS +Y +P+ + + Sbjct: 582 YGPDADVPTLREIARASGAHYYDATDPNRLEEVLGE 617 >gi|300113557|ref|YP_003760132.1| von Willebrand factor type A [Nitrosococcus watsonii C-113] gi|299539494|gb|ADJ27811.1| von Willebrand factor type A [Nitrosococcus watsonii C-113] Length = 345 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 62/191 (32%), Gaps = 32/191 (16%) Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY- 249 + ++ + + L+ + GLV F + + +R + Sbjct: 116 QVDRLSAVKEVLGEFLQR-------REGDRVGLVVFGDAAYLQAPFSTDLQLSRRLLDEC 168 Query: 250 -LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS 308 + G T + N +F K I+ +TDG TK S Sbjct: 169 EVGMAGPRTAFGDAIGLGVN-LFSESEA----------PAKTIIALTDG--NDTKSQVPS 215 Query: 309 LYYCNEAKKRGAIVYAIGIRVIRS-------HEFLR--ACASPNSFYLVENPHSMYDAFS 359 + A +R ++ + I + + LR A + S++ + S+ + Sbjct: 216 VEAARVAARREIRIHTVAIGDPTTAGEDKLDQQALREVAAETGGSYFFAADRASLAGIYD 275 Query: 360 HIGKDIVTKRI 370 + +I T++I Sbjct: 276 QL-DEIETRKI 285 >gi|149544262|ref|XP_001518417.1| PREDICTED: similar to Integrin, alpha 10, partial [Ornithorhynchus anatinus] Length = 537 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 59/167 (35%), Gaps = 23/167 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++VLD S S+ + + +L ++ + P +Q GLV + + Sbjct: 44 MDIVMVLDGSNSIYPW--------AQVQTFLRRLLGKLFIDP---EQIQVGLVQYGERPV 92 Query: 232 EFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L S ++ + G T + ++ A + F Sbjct: 93 HEWKLGDFRSKNEVVRAARSLKRREGRETRTAQAIRSACTEGFS------PTRGGRPEAT 146 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 +++V +TDGE+ ++ +L C Y I I E Sbjct: 147 RLLVVVTDGESHDGEDLPGALRACETL---NVTRYGIAIGSYFGSEL 190 >gi|327270784|ref|XP_003220168.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 952 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 45/260 (17%), Positives = 91/260 (35%), Gaps = 43/260 (16%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 + +++ + + +Y + + +S + P V QT Sbjct: 248 KSNHNINATNMQN-------KLCKYSSTWEVIMDSTDFKDSSPLSAPPPDPVISLLQTQD 300 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 R+ + +VLDVS SM ++ ++ L ++ + + G+VTF++ Sbjct: 301 RV-LCLVLDVSGSMYG------VRVARLKQAAEIFLLQI-----IEDGSWVGIVTFNSAA 348 Query: 231 EEFFLLEWGVSHL--QRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L+ S + YL G T G++ + + C Sbjct: 349 TIKTGLQQITSDSIRRSLTGYLPVTAGGGTRICNGVEAGFKVFKQKYASEKGCE------ 402 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRAC-ASPNSF 345 IV +TDGE ++ YC +E K+ G+I++ I + + Sbjct: 403 ---IVLLTDGE-------DSTISYCLDEVKRSGSIIHTIALGRSADPGLEELADMTGGLK 452 Query: 346 YLVE---NPHSMYDAFSHIG 362 + + +S+ DAF+ I Sbjct: 453 FSATDSLDSNSLIDAFTGIS 472 >gi|149046637|gb|EDL99462.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_d [Rattus norvegicus] Length = 939 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 77/200 (38%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 247 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 294 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 295 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 353 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ K + V+ + R Sbjct: 354 CN-------KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQ 400 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 401 WMACENKGYYYEIPSIGAIR 420 >gi|111025338|ref|YP_707758.1| hypothetical protein RHA1_ro08556 [Rhodococcus jostii RHA1] gi|110824317|gb|ABG99600.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 326 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 33/211 (15%), Positives = 69/211 (32%), Gaps = 30/211 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +DVS SM + D +++ A ++ + GL F+ Sbjct: 91 VILAIDVSLSMRA-TDVPPSRLAAAQAGAKTF------ADNLTPGINLGLEAFAGTASML 143 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + +L + T + + A I + G+ T IV Sbjct: 144 VSPITDHTATDNALDHL-QLAERTATGEAIFTALQAIDTLAGVVGGGGTPPPAR---IVL 199 Query: 294 MTDGE---NLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--------------VIRSHEFL 336 +DG+ + + + AK++G + I V E L Sbjct: 200 ESDGKQTVPTDLNDPRGAFTAARLAKEQGVPISTISFGTTHGAIDLNGSHIPVPVDDESL 259 Query: 337 RACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 R A S SF+ + + ++ ++ + I Sbjct: 260 RRIAELSGGSFFTATSADELQASYQNLQQQI 290 >gi|266631670|emb|CBH29316.3| integrin, alpha X (complement component 3 receptor 4 subunit [Equus caballus] Length = 1160 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 31/203 (15%), Positives = 74/203 (36%), Gaps = 26/203 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S S+ + K+ +K++ + + + LV F+ K Sbjct: 152 DIVFLIDGSGSI---YFKDFAKMLSFVKAVMSQFQRPSTQFSLMQFSNKFLVHFTFKDFM 208 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 G+ + +S+ T++ ++ ++F + KI++ Sbjct: 209 DSSDPLGL------LNSVSQLRGLTHTASAIQVVIKELFSAT------RGARKDASKILI 256 Query: 293 FMTDGENLSTK-EDQQSLYYCNEAKKRGAIVYAIGIRVI----RSHEFLRACASP---NS 344 +TDG+ + A G I YA+G+ ++ + L AS Sbjct: 257 VITDGQKQGDYLGYDDVIPMAEAA---GIIRYAVGVGSAFQSTQAWQELNDIASKPSHEH 313 Query: 345 FYLVENPHSMYDAFSHIGKDIVT 367 + V++ ++ D + + + I Sbjct: 314 IFKVDDFDALRDIQNQLKEKIFA 336 >gi|166795266|ref|NP_001107649.1| integrin alpha-X [Equus caballus] gi|164507177|gb|ABY59790.1| integrin alpha X [Equus caballus] Length = 1160 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 31/203 (15%), Positives = 74/203 (36%), Gaps = 26/203 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S S+ + K+ +K++ + + + LV F+ K Sbjct: 152 DIVFLIDGSGSI---YFKDFAKMLSFVKAVMSQFQRPSTQFSLMQFSNKFLVHFTFKDFM 208 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 G+ + +S+ T++ ++ ++F + KI++ Sbjct: 209 DSSDPLGL------LNSVSQLRGLTHTASAIQVVIKELFSAT------RGARKDASKILI 256 Query: 293 FMTDGENLSTK-EDQQSLYYCNEAKKRGAIVYAIGIRVI----RSHEFLRACASP---NS 344 +TDG+ + A G I YA+G+ ++ + L AS Sbjct: 257 VITDGQKQGDYLGYDDVIPMAEAA---GIIRYAVGVGSAFQSTQAWQELNDIASKPSHEH 313 Query: 345 FYLVENPHSMYDAFSHIGKDIVT 367 + V++ ++ D + + + I Sbjct: 314 IFKVDDFDALRDIQNQLKEKIFA 336 >gi|159900724|ref|YP_001546971.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893763|gb|ABX06843.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 415 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 37/211 (17%), Positives = 75/211 (35%), Gaps = 28/211 (13%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 Q L+ +VLD S SM KI +++ ++ ++ I V+ + Sbjct: 36 PTVQAAPPLNFCLVLDRSGSMAG------DKIQHLREAVREIVANLRPIDAVS------I 83 Query: 224 VTFSNKIE--EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 V F + +E L + LQ I+ + + G + S GL+ ++ Q + Sbjct: 84 VLFDDTLEVLVPARLADDLPALQNAIESIGEQGGTAMS-LGLQAGLAELQKFQAADRVGR 142 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR--AC 339 ++ +TDG+ ++ + G + A+G+ + L A Sbjct: 143 ---------VLLLTDGQTWG--DEDTCRDLAKQIGDLGVSITALGLGTEWNEALLDDLAT 191 Query: 340 ASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 AS + +P + F + T + Sbjct: 192 ASNGESDYIADPSQISKYFQQTLQSAQTTTV 222 >gi|73960093|ref|XP_855328.1| PREDICTED: similar to calcium activated chloride channel 4 [Canis familiaris] Length = 938 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 68/193 (35%), Gaps = 26/193 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM F +++ ++ L ++ I + + V G+V F + Sbjct: 306 VCLVLDKSGSMNGF-----NRLNRMNQAAKHFL--LQTIENGSWV---GMVHFDSTAYIK 355 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L +S +R L + N + F + G IV Sbjct: 356 SNLIQIISSKERN-NLLESLPTTANGGTSICAGIKSAFQVIGEIYPQIDGSE-----IVL 409 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL-RACASPNSFYLVENPH 352 +TDGE D + E K+ GAI++ I + + + + + + + Sbjct: 410 LTDGE------DNTAKNCIGEVKQSGAIIHLIALGPSADQAVIEMSTITGGNHFFASDEA 463 Query: 353 S---MYDAFSHIG 362 + DAF + Sbjct: 464 QNNGLIDAFGALA 476 >gi|198435384|ref|XP_002128544.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 761 Score = 54.0 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 27/168 (16%), Positives = 58/168 (34%), Gaps = 27/168 (16%) Query: 170 ARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 A++D+ +VLD S S+ + ++ K+++ M + + N + V ++ Sbjct: 561 AQVDLFLVLDSSSSVGKENWN----------KTVHFMKDIIDNFVISPNDMLVAAVRYNK 610 Query: 229 KIEEFFLLEWG----VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 I+ + G + I + G T + L Y + G Sbjct: 611 FIDTDSEIMIGQYTDLESTTNAIVNMPYDGSGTLTGNALYYVLRNMLTAPG-------NR 663 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + + +++ +TDG + G V A+GI + Sbjct: 664 PDVQDLVLVVTDG-----ISKDDVATPATMLRATGTNVVALGIVNQKG 706 >gi|315923825|ref|ZP_07920054.1| conserved hypothetical protein [Pseudoramibacter alactolyticus ATCC 23263] gi|315622858|gb|EFV02810.1| conserved hypothetical protein [Pseudoramibacter alactolyticus ATCC 23263] Length = 969 Score = 53.6 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 34/219 (15%), Positives = 68/219 (31%), Gaps = 54/219 (24%) Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNV--VQSGLVTFSNKIEEFFLLEWGVSHLQRKI 247 S T++D+A + N M++++ L + N V+ LV+F + +L + Sbjct: 159 SQKTRLDVAKSATNTMIDQL-LANNATNPGSVRISLVSFDTFASDATAWSTSSENLHSIV 217 Query: 248 KYLSK--------FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 TN L+ A A+ +K ++F++DG Sbjct: 218 NGYKTPQSSHLGGHRGGTNWEDALQKA------------DGTQPRADAQKHVIFVSDGNP 265 Query: 300 -------LSTKEDQ----------------------QSLYYCNEAKKRGAIVYAIGI--R 328 +DQ + + GA Y +G Sbjct: 266 TFRISSINGNPDDQYNDVHGHGDDDYYHSHPNYNYDAAKDDAKKIVDGGAAFYTVGTFGD 325 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 R + +++Y ++ ++ AF +I I Sbjct: 326 AARMQNLATEAGASDNYYKADDEAALKAAFKNIVASITH 364 >gi|291229809|ref|XP_002734863.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii] Length = 2065 Score = 53.6 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 31/209 (14%), Positives = 77/209 (36%), Gaps = 31/209 (14%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +++ ++D S S+ + I I + + PD + +VT+S Sbjct: 51 QHGESEIVFLVDSSGSIG------ASNFHFEINFIREISTIFSMSPDEA---RVSVVTYS 101 Query: 228 NKIEEFFLLEWGVSHLQR-------KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + + +++ S + + ++ + T++ L+ A + Q R Sbjct: 102 DSSKIVRQIDYIGSSVGKNKCTFLGELSLIRYEAGWTDTKGALEEADRVL---QHARSGA 158 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 N +++V +TDG+ + + + +G + AIG+ + E A Sbjct: 159 N-------RLVVLLTDGQ----STEGDPVGIATRIRNKGIRIVAIGVGNVNMDELTSI-A 206 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 + ++++ + D + I D+ K Sbjct: 207 TAQYVFILDRLSYVVDLATRIKNDVKEKS 235 >gi|288921527|ref|ZP_06415802.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288347095|gb|EFC81397.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 587 Score = 53.6 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 45/270 (16%), Positives = 82/270 (30%), Gaps = 54/270 (20%) Query: 131 AISRYKIPLKF---CTFIPWYTNSRHIVMPITSSVKVNSQTDARLD--------MMIVLD 179 A + Y+ P PW +P ++ ++ A + VLD Sbjct: 329 ATTTYRRPATPGVRLDPAPWPAPELPAQLPFPATSEIADHLLAAYQDEYRRPSHAIFVLD 388 Query: 180 VSRSMESFFDSSITKIDMAIKSINAMLEEV--------------KLIPDVNNVVQSGLVT 225 VS SM ++++ ++ + + + L+T Sbjct: 389 VSPSMRG------ERLELLRSTLRELAGTGTSTGRGTGTDDTLEQRFARFRERERVTLIT 442 Query: 226 FSNKI--------EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 FS + + ++ + L+ G T L+ AY + D Sbjct: 443 FSGTVHDTLEFTVNDPQPGSADLTAISAAADGLT-LGSGTAIYSALEAAYRYVADSAAAP 501 Query: 278 QHCNTEDANYKKIIVFMTDGENL-STKEDQ---QSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + IV MTDGEN T D L + A+ + + R Sbjct: 502 ADGVAPLTS----IVLMTDGENNQGTTADAFHSSYLALPDAARS--VRTFTVVFGDARVD 555 Query: 334 EFLRACA--SPNSFYLVENPHSMYDAFSHI 361 E +R A + + + S+ +AF I Sbjct: 556 E-MRTIADWTGGAMFDAR-TSSLSEAFREI 583 >gi|260837284|ref|XP_002613635.1| hypothetical protein BRAFLDRAFT_93676 [Branchiostoma floridae] gi|229299021|gb|EEN69644.1| hypothetical protein BRAFLDRAFT_93676 [Branchiostoma floridae] Length = 2411 Score = 53.6 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 59/166 (35%), Gaps = 20/166 (12%) Query: 174 MMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 ++IV+DVS SM E ++++A ++ +L+ + G+V+FS + E Sbjct: 141 VVIVIDVSGSMREPPGPEEQNRLNLAKQAALTVLDTLTPRDWG------GVVSFSARAET 194 Query: 233 FF---------LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + +Q I T G + A++ + + + Sbjct: 195 PEGCLGDSLGEANPTNIGIMQDFINQRVP-ETITMYGVGFRKAFDMFAEARNKKPEQF-- 251 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 + II+F++DG + E R ++ G+ Sbjct: 252 -EDCYNIIIFLSDGSPTDKAFALDEITKGQELMDRSVYIFTYGLGA 296 >gi|332291973|ref|YP_004430582.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] gi|332170059|gb|AEE19314.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] Length = 344 Score = 53.6 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 25/178 (14%), Positives = 58/178 (32%), Gaps = 30/178 (16%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSM--ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 K+ + +D++ +DVS+SM E + I K + I L Sbjct: 79 GTKLETVKREGVDVVFAIDVSKSMLAEDIAPNRIEKSKQLVTQIINNLGS---------- 128 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQG 275 + G++ ++ + S + + ++ T ++ A D + Sbjct: 129 DRIGIIAYAGSAYPQLPITTDYSSAKLFLSQMNTDMLSSQGTAIGEAIELAKTYYNDEEQ 188 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 +++ ++DGE+ +S +A G ++ IG+ Sbjct: 189 TN-----------RVLFIISDGEDH----VGESSNIAEQANDEGIRIFTIGVGKSEGG 231 >gi|323342275|ref|ZP_08082507.1| hypothetical protein HMPREF0357_10688 [Erysipelothrix rhusiopathiae ATCC 19414] gi|322463387|gb|EFY08581.1| hypothetical protein HMPREF0357_10688 [Erysipelothrix rhusiopathiae ATCC 19414] Length = 1466 Score = 53.6 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 35/164 (21%), Positives = 59/164 (35%), Gaps = 20/164 (12%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA 198 K IP N I + + +K D+++VLD S SM+ + ID Sbjct: 65 FKTAKPIPNSINRWEISIDVFGRLKREPS-----DIVLVLDTSGSMDP--QKNPQGIDRI 117 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF-FLLEWGVSHLQRKIKYLSKFGVST 257 K+ + V I + + + LV++ K+ F + + L +IK L G T Sbjct: 118 SKAKREAIHFVNEIFERDASARVALVSYGTKVSSNSFHTKQESNLLINEIKSLKAEG-GT 176 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 + L A + K IV ++DG+ Sbjct: 177 FTQGALYEAKMLLNQSSA-----------PNKTIVLLSDGQPTY 209 >gi|83717579|ref|YP_440458.1| hypothetical protein BTH_II2270 [Burkholderia thailandensis E264] gi|167579118|ref|ZP_02371992.1| hypothetical protein BthaT_13315 [Burkholderia thailandensis TXDOH] gi|257141105|ref|ZP_05589367.1| hypothetical protein BthaA_18159 [Burkholderia thailandensis E264] gi|83651404|gb|ABC35468.1| conserved hypothetical protein [Burkholderia thailandensis E264] Length = 418 Score = 53.6 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 61/168 (36%), Gaps = 6/168 (3%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 +G ++IL A+ L ++ +G+ +++ ++ ++ L + D + AA + G N Sbjct: 17 RRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSADSCALAAARDLT--GAIN 74 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTS-LDIVVVPQNEGYSISA 131 G + F +L+ N V D + + P + Y Sbjct: 75 LSVPEAAGITAGHLNYALFEQFPVQLQTNASVTFTDSLSNPFQPKSAITSPSSIKYVKCM 134 Query: 132 ISRYKIP---LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMI 176 S+ I ++ +P T + V + +QT + + I Sbjct: 135 TSQTGIVNWFIQALDMVPGVTVANASVSATAIATIGAAQTTCAIPVFI 182 >gi|89069885|ref|ZP_01157219.1| hypothetical protein OG2516_06272 [Oceanicola granulosus HTCC2516] gi|89044561|gb|EAR50680.1| hypothetical protein OG2516_06272 [Oceanicola granulosus HTCC2516] Length = 536 Score = 53.6 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 16/70 (22%), Positives = 27/70 (38%), Gaps = 2/70 (2%) Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA-CASPNSFYL-VENPHSMYDAFSHIG 362 + + C AK +G V+ +G V + CAS + + V + AF I Sbjct: 467 NARLEAICTAAKNQGVQVFTVGFEVEDDEAIIMEDCASSRAHFFRVSGGGDLTTAFESIA 526 Query: 363 KDIVTKRIWY 372 + I R+ Sbjct: 527 RQITELRLTE 536 Score = 39.4 bits (90), Expect = 0.84, Method: Composition-based stats. Identities = 46/331 (13%), Positives = 95/331 (28%), Gaps = 45/331 (13%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAAT--------- 63 + +GGM I ++ L GM ++ + L + +D +++ AA Sbjct: 13 RDERGGMIIFGLFVFLLLLLAGGMAVDFMRTETARGRLQATLDGAVLAAADLDQDKDPVE 72 Query: 64 --------------QIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 I + +++ + + + + + Sbjct: 73 VVRDYVAKAGLDPFLIDVDVTEIAGQRIVTASAKSDVTMHFMKMVGIDFLPAPARSTASE 132 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD 169 V + + +V+ F + + I++ + + Sbjct: 133 AVSNLDVSLVLDMSGSMEGDKLDQLQAAAKNFVGIVYDTMGAEKILLNVVPYATQVAAPA 192 Query: 170 ARLDM--MIVLDVSRS-MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 LDM + + S S SF + T+ + + ++ VTF Sbjct: 193 GLLDMLGAFLREHSYSNCVSFSAADFTETSILEAAALPQGGHFDPFYTW-GPLRYDDVTF 251 Query: 227 SNKIEEFFL---LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI----------FDM 273 + L ++ I L G +T+ G+K+ I F Sbjct: 252 VCNPDPSTEVLTLASTQREIEDYIDGLVAEG-NTSIDVGMKWGAALIDPDLGSTLNEFAN 310 Query: 274 QGMRQHCNT----EDANYKKIIVFMTDGENL 300 N D + K+IV MTDG+N Sbjct: 311 GPSAAGINPVALWGDRSTDKVIVLMTDGKNT 341 >gi|326506938|dbj|BAJ91510.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 378 Score = 53.6 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 35/185 (18%), Positives = 66/185 (35%), Gaps = 26/185 (14%) Query: 107 IDDIVRSTSLDIVVVPQNEGYS-ISAISRYKIPLKFCTFI---PWYTNSRHIVM-PITSS 161 + + + + + YS SAI + F I N I P+ ++ Sbjct: 197 LTTAAGNCDVRSSRTVEIKTYSEFSAIPQSSSQDDFAVLIHLKAPCANPEQITSRPVNAT 256 Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + A +D++ +LDVS SM TK+ + +++ +++ + + Sbjct: 257 SVGYPTSRAPVDLVTLLDVSGSMAG------TKLALLKRAMGFVIQHLGPSD------RL 304 Query: 222 GLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 ++ FS+ + F L G + + L G TN LK A I R Sbjct: 305 SVIAFSSTVRRLFHLRRMSHSGRQQALQAVNSLGA-GGGTNIADALKKAAKVI----EDR 359 Query: 278 QHCNT 282 + N Sbjct: 360 SYKNP 364 >gi|213964310|ref|ZP_03392536.1| BatB protein [Capnocytophaga sputigena Capno] gi|213953052|gb|EEB64408.1| BatB protein [Capnocytophaga sputigena Capno] Length = 345 Score = 53.6 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 28/175 (16%), Positives = 69/175 (39%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +D++ +DVS+SM + D + +++ A + + ++K + Sbjct: 79 GTKIETVKREGVDIVFAIDVSKSMLAE-DVAPNRLEKAKRIAFETISQLKGD-------R 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMR 277 G+V ++ L S + ++ ++ T ++ A N D Sbjct: 131 VGIVAYAASAYPQLALTTDHSAAKMFLQGMNTDMLSSQGTAIQEAIRMASNYFDDK---- 186 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 +++ ++DGE+ + + +EA+++G +Y IGI + Sbjct: 187 -------TPTARLLFILSDGEDH----EMGATEIASEAQEKGVHIYTIGIGTEKG 230 >gi|51597046|ref|YP_071237.1| hypothetical protein YPTB2727 [Yersinia pseudotuberculosis IP 32953] gi|51590328|emb|CAH21965.1| putative membrane protein [Yersinia pseudotuberculosis IP 32953] Length = 472 Score = 53.6 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 39/244 (15%), Positives = 91/244 (37%), Gaps = 37/244 (15%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + + ++ + +T ++S + +++ +V+D S SM S +I+ A ++ Sbjct: 70 LANSEDKNYLKISLTG-FNLDSTRRSPINLALVIDRSTSM------SGERIEKAREAAIL 122 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSN---KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 + + ++ +V + N I + + + +++ G++ Sbjct: 123 AVNMLNTTDTLS------VVAYDNHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAG 176 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGA 320 + + +H N E N I+ ++DG+ N + A K+G Sbjct: 177 -------VSMGIGQVDKHLNREQVNR---IILISDGQANTGPTSISELSDLARMAAKKGI 226 Query: 321 IVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSH--------IGKDIVTKRI 370 + IG+ + + + A A S + V N + AF+ + +DIV + Sbjct: 227 AITTIGLGQDYNEDLMTAIAGYSDGNHTFVANSADLEKAFTKEFQDVMSVVAQDIVVQIK 286 Query: 371 WYDK 374 DK Sbjct: 287 TGDK 290 >gi|330829742|ref|YP_004392694.1| von Willebrand factor type A domain-containing protein [Aeromonas veronii B565] gi|328804878|gb|AEB50077.1| von Willebrand factor type A domain protein [Aeromonas veronii B565] Length = 330 Score = 53.6 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 28/172 (16%), Positives = 61/172 (35%), Gaps = 34/172 (19%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 Q + D+++ +D+S SM + + + ++ +++K + + G++ F Sbjct: 72 QYEGSRDLLLAVDLSDSMRTPDMLDNGEQQARLTAVR---QQIKALIAKRAGDRVGIIVF 128 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKF---------GVSTNSTPGLKYAYNQIFDMQGMR 277 ++ L ++I L G +T + A R Sbjct: 129 ADHAYLLSPLT-------QEIPALLTLSDELDFDLVGRTTALGEAILLA----------R 171 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 QH + ++ +TDG N + D L A +G +Y +G+ Sbjct: 172 QHGDPGRPTA---LLLVTDGRNTAGNADP--LQEAKLAAAQGIRIYTLGVGA 218 >gi|281202341|gb|EFA76546.1| Ubiquitin-conjugating enzyme [Polysphondylium pallidum PN500] Length = 561 Score = 53.6 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 50/311 (16%), Positives = 104/311 (33%), Gaps = 25/311 (8%) Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVND--------IDDIVRSTSLDIV 119 E ++ N + F+ + ++ +ND I +T+ Sbjct: 29 EEIIKRAAITSIPADQLQLLNDSSELFKEDTLEDLGINDNVELILKQITSAAAATTTTTP 88 Query: 120 VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLD 179 + S ++ + T N+ +PI +V V S+ ++D+ IVLD Sbjct: 89 TDNNTSSITNIVNSTRRLSIGDITDSNNNNNNTEPFIPIKPTVSVFSEKIKQIDI-IVLD 147 Query: 180 VSRSMESF--------FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 VS SM++ + +T+I++A + GLV F +I+ Sbjct: 148 VSGSMKAAAYAGSKVPGELEMTRIEVAQ---ALFQTFIDKYVQQEIPACVGLVCFGERID 204 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 F ++ + T +K A I + + + Sbjct: 205 LTFQPTRNFDSFSTELGDVDANQAKTRLYEAIKLAAETIVSYKNKHPADILLSDDLNCRV 264 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC--ASPNSFYLVE 349 +TDG++ S + + Y K+ ++ AI E L A+ S ++++ Sbjct: 265 FALTDGQDNSGSDPYKVFTY---LKEHNIVLDAIPCGNGADKEALGTFTKATGGSCFIID 321 Query: 350 NPHSMYDAFSH 360 + + + F Sbjct: 322 SSQAGVELFER 332 >gi|332206577|ref|XP_003252372.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-1 [Nomascus leucogenys] Length = 1107 Score = 53.6 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 78/200 (39%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 251 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 298 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 299 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 357 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ + K + V+ + R Sbjct: 358 CN-------KIIMLFTDG------GEERAQEIFTKYNKDKKVRVFTFSVGQHNYDRGPIQ 404 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 405 WMACENKGYYYEIPSIGAIR 424 >gi|309356755|emb|CAP36239.2| hypothetical protein CBG_18899 [Caenorhabditis briggsae AF16] Length = 643 Score = 53.6 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 75/192 (39%), Gaps = 21/192 (10%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +D+M ++D S S+ I D+ I +L++V + P + + +V ++ Sbjct: 432 PPIDLMFLVDTSSSIG------INNFDIQKNFICEILKDVDVAPGRS---RISMVQYAQD 482 Query: 230 IEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYA----YNQIFDMQGMRQHCNTE 283 F + + ++R + LS G +T + L +A Y++ + ++H Sbjct: 483 PSVVFGFDQYYSYESVRRGVMRLSYTGGATMLSKALAFAGGIMYHEQNLKKTTKRHQFLP 542 Query: 284 DANYKK--IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 + + ++ ++DG D++S+ + ++A+ R + + Sbjct: 543 TPKHDRLQVLCLVSDGY-SDDSADKESVNLHDRLH---VKIFAVVTRSFNKDKLVPITRF 598 Query: 342 PNSFYLVENPHS 353 S + V S Sbjct: 599 DGSVFTVHQRES 610 >gi|294141682|ref|YP_003557660.1| von Willebrand factor type A domain-containing protein [Shewanella violacea DSS12] gi|194578720|dbj|BAG66046.1| von Willebrand factor typeA domain protein [Shewanella violacea] gi|293328151|dbj|BAJ02882.1| von Willebrand factor type A domain protein [Shewanella violacea DSS12] Length = 334 Score = 53.6 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 71/216 (32%), Gaps = 44/216 (20%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ---SGLVTFSNK 229 D+MI +D+S SM I + + ++++ ++ D + GL+ F++ Sbjct: 84 DLMIAVDLSGSM------QIEDMVLNGQAVDRFTMIQDVVSDFIERRKGDKLGLILFADH 137 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + + +K G T + + + Sbjct: 138 AYLQAPLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIALGVKRFDMVDKSN---------- 187 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV--------------IRSH 333 +I+V +TDG N S + A KRG +YAIG+ S Sbjct: 188 -RILVLLTDGSNNSGSISPE--QAAAIAAKRGVKIYAIGVGADVMERRSIFGTERVNPSM 244 Query: 334 EF----LRACA--SPNSFYLVENPHSMYDAFSHIGK 363 + L + A + ++ + + + I K Sbjct: 245 DLDEAQLSSLAKITGGLYFRARSSQDLQQIYQEIDK 280 >gi|260775644|ref|ZP_05884540.1| protein TadG associated with Flp pilus assembly [Vibrio coralliilyticus ATCC BAA-450] gi|260608060|gb|EEX34229.1| protein TadG associated with Flp pilus assembly [Vibrio coralliilyticus ATCC BAA-450] Length = 407 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 50/429 (11%), Positives = 117/429 (27%), Gaps = 93/429 (21%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 + +G +IL A+ +P++F V + + + K + + AA + + Sbjct: 1 MHHKQQGHASILFAMLIPLLFGVFALGSDGARAIQSKARIEDASEA----AALALSARDD 56 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 + + I+ + + + DD+ + YS+ Sbjct: 57 EHAMSDENKTIVQAYIEEYLPVEDSDVTILGIERLECDDMPECRQGSGRGEARYTQYSVR 116 Query: 131 AISRYKIPLKFCTFIPWYTNSR-HIVMP----ITSSVKVNSQTDARLDMMIVLDVSRSME 185 + PW+ + +P K +D++ D S SM Sbjct: 117 VSA---------DQTPWFGGGSPEVEVPEVWRSQGGAKARKYQSNAVDIVFAADFSGSMA 167 Query: 186 SFF-----DSSITKIDMAIKSINAML------------------------EEVKLIPDVN 216 S + ID+ K + E+ + ++ Sbjct: 168 SPWTGGSQPKYRDLIDILEKVTVELAPYNFDSQRYNSSVGVSGFNALTYRNELCAVNNLE 227 Query: 217 NVVQSGLVTFSNKIE-------------------EFFLLEWGVSHLQRKIKYLSKFGVST 257 G+V +S + L S R + + G + Sbjct: 228 KQGLLGVVDYSRTVARMWETKSCRPPSISNSAGFHDVPLTDDYSTFNRTVDRFTARGGTA 287 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQS-LYYCNEA 315 + Y + + H + I++ ++DG +N + C + Sbjct: 288 S--------YQAVMSGARLLDHGSNNRQ----ILIVISDGQDNNLNHTNGLVNAGMCRDI 335 Query: 316 ------------KKRGAIVYAIGIR-VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIG 362 + A + IG + + C ++ + EN +++ + Sbjct: 336 ISRLEGRPSANGRDVSARLAFIGFDFEPSMNPAMVRCVGEDNVFKAENTDELFEQIMFLI 395 Query: 363 KDIVTKRIW 371 ++ V Sbjct: 396 REEVGHLAT 404 >gi|169829413|ref|YP_001699571.1| BatA [Lysinibacillus sphaericus C3-41] gi|168993901|gb|ACA41441.1| BatA [Lysinibacillus sphaericus C3-41] Length = 973 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 51/329 (15%), Positives = 115/329 (34%), Gaps = 48/329 (14%) Query: 58 LVHAATQIMNEGNGNNRKKL-------KGGDILCRIKNTWNMSFRNELRDNGFVNDIDDI 110 + + + +G +R GG+ + + T + F + F+ + D Sbjct: 572 ALKESADAASRPSGKDRYTKVMVTLVRPGGEPVTDYQGTVKIKFDGVEKTASFITNTSDP 631 Query: 111 VRSTSLDIVVVPQNEGYS-----ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVN 165 + +T V + + A I ++ T + + T+ Sbjct: 632 LNNTGSPGTAVAYFDSIIYGKSKVEATLVNPIDPRYATILKGLKDKTVTKDIFTNPNFSK 691 Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + +M V+D S SM++ ++ M+E + + NN+V + Sbjct: 692 NSCSLATEMAYVVDYSSSMKAVDPTNY--------RGKKMIEFINQLKAKNNIV----IE 739 Query: 226 FSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + K +L G + L++ + SK +T+ G+ A + + Sbjct: 740 TNTK---ATILGEGTTDAVLKKDLYKASKDKGATDIFAGIDIALTKF-----------SN 785 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--SHEFLRACAS 341 D K IV ++DG+ + + N+AKK+G +Y + + + L ++ Sbjct: 786 DTKTAKAIVVVSDGK----TSKSKMIKAINDAKKQGVKIYTVSMGKKSQINDATLMQLST 841 Query: 342 P--NSFYLVENPHSMYDAFSHIGKDIVTK 368 ++Y + ++ F + I+ K Sbjct: 842 ETGGAYYHALDNLQLHQVFQKLIDAILCK 870 >gi|196233777|ref|ZP_03132616.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] gi|196222139|gb|EDY16670.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] Length = 883 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 32/215 (14%), Positives = 73/215 (33%), Gaps = 26/215 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 ++ +++ ++DVS SM K+ + K + ++E++ + Sbjct: 412 EIPKDERPPSNLVFLIDVSGSMN-----MPNKLPLLQKCFSLLVEQLGPKD------RVS 460 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +VT+++ + +Q I L G T+ + G+ AY +Q Sbjct: 461 IVTYASGTKLVLEPTQDKEAMQTAIDGLHA-GGGTHGSSGIDLAYRM------AQQSFIP 513 Query: 283 EDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE--FLRAC 339 N ++ TDG+ N+ + L K G + +G + + ++ Sbjct: 514 GGTNR---VILATDGDWNIGITNQSELLSMITRKAKSGVFLTVLGFGLDNLKDSMLVKLA 570 Query: 340 ASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 N Y + +A + + + K Sbjct: 571 DHGNGHYAYIDTEQ--EARKVFVDQLSSTLVTIAK 603 >gi|73981421|ref|XP_850355.1| PREDICTED: similar to integrin, alpha 10 precursor [Canis familiaris] Length = 1165 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 43/244 (17%), Positives = 81/244 (33%), Gaps = 46/244 (18%) Query: 150 NSRHIVMPITSSVKVNSQTDARL-----------DMMIVLDVSRSMESFFDSSITKIDMA 198 S I + V + + L D++IVLD S S+ + +++ Sbjct: 133 GSSVFSSGICARVDASFRPQGSLAPTAQRCPTYMDVVIVLDGSNSIYPW-----SEVQTF 187 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGV 255 ++ + L I VQ GLV + + L + ++ + G Sbjct: 188 LRRLVGRL----FIDPEQ--VQVGLVQYGESPVHEWSLGDFRTKEEVVRAARNLSRREGR 241 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 T + + A + F + +++V +TDGE+ +E +L C Sbjct: 242 ETKTAQAILVACTEGFSLS------RGGRPEAARLLVVVTDGESHDGEELPAALKACEAG 295 Query: 316 KKRGAIVYAIGI------RVIRSHEFLR----ACASPNS--FYLVENPHSMYDAFSHIGK 363 + Y I + R FLR P+ F+ V + ++ D +G Sbjct: 296 R---VTRYGIAVLGHYLRRQRDPTSFLREIRMIANDPDERFFFNVTDEAALTDIVDALGD 352 Query: 364 DIVT 367 I Sbjct: 353 RIFG 356 >gi|319783082|ref|YP_004142558.1| von Willebrand factor type A [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317168970|gb|ADV12508.1| von Willebrand factor type A [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 704 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 74/194 (38%), Gaps = 21/194 (10%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 PW T++R + + I V + +++ ++DVS SM+ K+ + + Sbjct: 309 VMPTPWNTHTRLMHVAIKG-FDVKPTEQPKANLVFLIDVSGSMDE-----PDKLPLLKSA 362 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 ++ ++K ++ V +G + + + + I L+ G ST Sbjct: 363 FRLLVSKLKADDTISIVTYAGDAGTVLEPTKASQKD----KILSAIDNLTP-GGSTAGEA 417 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGA 320 G+K AY ++ ++ N ++ TDG+ N+ +D + +K G Sbjct: 418 GIKEAY-RLAQKSFVKDGVNR--------VMLATDGDFNVGQSDDDDLKRLIEKERKTGV 468 Query: 321 IVYAIGIRVIRSHE 334 + G ++ Sbjct: 469 FLSVFGFGRGNLND 482 >gi|167534724|ref|XP_001749037.1| hypothetical protein [Monosiga brevicollis MX1] gi|163772461|gb|EDQ86112.1| predicted protein [Monosiga brevicollis MX1] Length = 5417 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 30/244 (12%), Positives = 87/244 (35%), Gaps = 19/244 (7%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA---RLDMMIVLDVSRSME 185 I+ + L + +++ ++++ V+ D+ +D+ ++D S S++ Sbjct: 424 INVTASLDDELAALASVCADAGFDYVLYNSSAAIYVSVMEDSCDRPMDLAFLIDASGSID 483 Query: 186 --SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--S 241 + + T I + ++ + + + + TFS+ F L+ + Sbjct: 484 KPEYGGRNGTFSQRVIPFVKNVVRNFNV---SSEQTRVSVTTFSSGYNISFFLDEHSDLN 540 Query: 242 HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 + + ++ G T + GL+ Q+F ++++ ++DG + Sbjct: 541 SVLAAVDTIAYTGGGTYTAAGLEAIRTQVFT---ESNGMRPASEGVPRVLIVVSDG---A 594 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---PNSFYLVENPHSMYDAF 358 + + V A+G+ + LR A+ ++ Y +++ + Sbjct: 595 ASAGYEPASEAALLHEDDVNVIAVGVGKAMNLNQLRDMATGSGDSNLYTIKSFDKISAIV 654 Query: 359 SHIG 362 + Sbjct: 655 DEMS 658 >gi|149030577|gb|EDL85614.1| integrin, alpha 10 (predicted) [Rattus norvegicus] Length = 746 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 38/211 (18%), Positives = 75/211 (35%), Gaps = 35/211 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++IVLD S S+ + +++ ++ + L I +Q GLV + Sbjct: 166 MDVVIVLDGSNSIYPW-----SEVQTFLRRLVGRL----FIDPEQ--IQVGLVQYGENPV 214 Query: 232 EFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L + ++ + G T + + A + F Sbjct: 215 HEWSLGDFRTKEEVVRAARNLSRREGRETRTAQAIMVACTEGFSES------RGGRPEAA 268 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFLR----A 338 +++V +TDGE+ +E +L C + Y I + R FLR Sbjct: 269 RLLVVVTDGESHDGEELPAALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRAI 325 Query: 339 CASPNS--FYLVENPHSMYDAFSHIGKDIVT 367 + P+ F+ V + ++ D +G I Sbjct: 326 ASDPDERFFFNVTDEAALTDIVDALGDRIFG 356 >gi|164565428|ref|NP_001101169.2| integrin, alpha 10 [Rattus norvegicus] Length = 1167 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 38/211 (18%), Positives = 75/211 (35%), Gaps = 35/211 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++IVLD S S+ + +++ ++ + L I +Q GLV + Sbjct: 166 MDVVIVLDGSNSIYPW-----SEVQTFLRRLVGRL----FIDPEQ--IQVGLVQYGENPV 214 Query: 232 EFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L + ++ + G T + + A + F Sbjct: 215 HEWSLGDFRTKEEVVRAARNLSRREGRETRTAQAIMVACTEGFSES------RGGRPEAA 268 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFLR----A 338 +++V +TDGE+ +E +L C + Y I + R FLR Sbjct: 269 RLLVVVTDGESHDGEELPAALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRAI 325 Query: 339 CASPNS--FYLVENPHSMYDAFSHIGKDIVT 367 + P+ F+ V + ++ D +G I Sbjct: 326 ASDPDERFFFNVTDEAALTDIVDALGDRIFG 356 >gi|284036687|ref|YP_003386617.1| von Willebrand factor A [Spirosoma linguale DSM 74] gi|283815980|gb|ADB37818.1| von Willebrand factor type A [Spirosoma linguale DSM 74] Length = 316 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 42/249 (16%), Positives = 92/249 (36%), Gaps = 23/249 (9%) Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 + + R LK F T + + S+ + S+ M++LD + S+ S Sbjct: 46 FVVDKNGRLITGLKPSNFKIVNTVTNFYELIDVSTSNLISKPGG-YSAMLLLDQTGSI-S 103 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG-VSHLQR 245 D +I+ + +N + + +GL +F++ L G + ++ Sbjct: 104 TTDPYNLRIEASKIFLNNLGTD----------DYTGLTSFTSSYTSVVKLHSGFTNKTEQ 153 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 K L ++ + L + +Q + K +IVF TDGEN T Sbjct: 154 MKKSLDTLALNVSGGTPLYTS-----TIQSVTYTAQKGPTANKAVIVF-TDGENNVTTNT 207 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 + +A ++ ++ +G+ + L A + +F+ ++ + F +G Sbjct: 208 LEDAT--AKAIQQKIPLFTVGLSTDVNVNVLAQMANETGGAFFYAKDAGQLISTFGTLGN 265 Query: 364 DIVTKRIWY 372 + + + Y Sbjct: 266 LLHGQGLVY 274 >gi|198412594|ref|XP_002121401.1| PREDICTED: similar to sp4 protein, partial [Ciona intestinalis] Length = 406 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 62/169 (36%), Gaps = 19/169 (11%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +D+M VLD M+S + I ++S + + L ++ GL ++ Sbjct: 221 GTIDLMFVLDG---MQSVTSQNFIGIKNWMQS---FIPRLNLQD---GKIRFGLDRYTGT 271 Query: 230 IEEFFLL--EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + L + + I L ST +T + N F+ G R + Sbjct: 272 VVTDIELGESQNLPQINNVISSLPFTTQSTKTTLSITSTINNNFNSVGSRFPAS------ 325 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 ++++V +TDG T D +L A+ + ++ + + L Sbjct: 326 RRLVVVVTDGLAQETMSDVNTLV--QLAQTNKVEIISVAVGPFANLATL 372 >gi|163748339|ref|ZP_02155613.1| hypothetical protein OIHEL45_20491 [Oceanibulbus indolifex HEL-45] gi|161378385|gb|EDQ02880.1| hypothetical protein OIHEL45_20491 [Oceanibulbus indolifex HEL-45] Length = 405 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 20/69 (28%), Positives = 33/69 (47%), Gaps = 2/69 (2%) Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRV--IRSHEFLRACASPNSFYLVENPHSMYDAFSHIG 362 D++ C A++ G ++YAIG+ V S + L+ CAS + Y + + AF I Sbjct: 336 DRRLRNICAAAQRAGIVIYAIGMDVDSQNSLDLLKECASTEAHYFDVDGLEIQTAFDMIA 395 Query: 363 KDIVTKRIW 371 I R+ Sbjct: 396 ASISMLRLT 404 >gi|118590977|ref|ZP_01548377.1| hypothetical protein SIAM614_19991 [Stappia aggregata IAM 12614] gi|118436499|gb|EAV43140.1| hypothetical protein SIAM614_19991 [Stappia aggregata IAM 12614] Length = 608 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 46/244 (18%), Positives = 85/244 (34%), Gaps = 32/244 (13%) Query: 135 YKIPLKFCTFIPWYTNSRHIVMP----------ITSSVKVNSQTDARLDMMIVLDVSRSM 184 Y P+ P+ TN + P KV +++ ++D S SM Sbjct: 200 YNYPVPEKGGHPFSTNVSVVDTPWNEHTKLMQVGIQGYKVPLDDLPSQNLVFLIDTSGSM 259 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ 244 + K+ + +S +L ++ +V V +G S + E + L+ Sbjct: 260 -----ADANKLPLLQQSFRLLLSSLRDEDEVAIVTYAGS---SGVLLEPTKVADKTRILE 311 Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTK 303 + I L+ G ST GLK AY M G + I+ TDG+ N+ Sbjct: 312 K-INALTS-GGSTAGHEGLKGAYALAETMTGDGEQTR---------IILATDGDFNVGLS 360 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIR-SHEFLRACASPNSFYLVENPHSMYDAFSHIG 362 + Y E ++ G + +G + E ++ A ++ +A + Sbjct: 361 DPDSLKRYVAEQRENGTALSVLGFGRGNYNDELMQTLAQNGQGVAAY-IDTLSEARKVLV 419 Query: 363 KDIV 366 +V Sbjct: 420 DQVV 423 >gi|297303947|ref|XP_002808579.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy chain H5-like protein-like [Macaca mulatta] Length = 1313 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 31/174 (17%), Positives = 64/174 (36%), Gaps = 22/174 (12%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ V+DVS SM TK++ K++N +L +++ N+ ++FS+ I Sbjct: 284 VVFVIDVSGSMFG------TKMEQTKKAMNVILSDLRA----NDYFNI--ISFSDTINVW 331 Query: 232 -EFFLLEWGVSHLQRKIKYLSKFGVS--TNSTPGLKYAYNQI-FDMQGMRQHCNTEDANY 287 ++ + ++ YL + T+ L A + + Q + + Sbjct: 332 KAGGSIQATIQNVHSAKDYLHRMEADGWTDINSALLAAASVLNHSNQEPGRGPSVGRIPL 391 Query: 288 KKIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 I+F+TDGE L +A ++ + LR + Sbjct: 392 ---IIFLTDGEPTAGVTTPSVILSNVRQAVGHRVSLFTLAFGDDADFTLLRRLS 442 >gi|170734866|ref|YP_001773980.1| hypothetical protein Bcenmc03_6370 [Burkholderia cenocepacia MC0-3] gi|169820904|gb|ACA95485.1| conserved hypothetical protein [Burkholderia cenocepacia MC0-3] Length = 423 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 16/164 (9%), Positives = 51/164 (31%), Gaps = 2/164 (1%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R + +G + I+ + L ++ +G+ +++ ++ ++ L + D + AA + + Sbjct: 12 RRSLHRQRGAVAIIVGLALAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDLTSA 71 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 + + + S + N +D T + + Sbjct: 72 ISLSVAEADGIAAGHLNFVFFQKKSVQMSTNANVTFSDSLTNPFLTKNAVTTPANIKYVQ 131 Query: 129 ISAISRYKIP--LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 +A ++ +P + + ++ V Sbjct: 132 CTATLSNIAHWFIEVLNVLPGTKLANAAEVSASAIATVGGGQTT 175 >gi|302542925|ref|ZP_07295267.1| putative tellurium resistance protein [Streptomyces hygroscopicus ATCC 53653] gi|302460543|gb|EFL23636.1| putative tellurium resistance protein [Streptomyces himastatinicus ATCC 53653] Length = 257 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 60/160 (37%), Gaps = 27/160 (16%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S+ + + R + +VLD S SM ++ D S+ + ++ L++ ++P Sbjct: 46 SLTKHHVSGQRAAVYLVLDRSGSMRPYYRDGSVQHLAEQTLALAVNLDDDGVVP------ 99 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGM 276 +V FS +I+ + + + +I L TN ++ + Sbjct: 100 ---VVFFSTEIDGTAEIS--LEAYRDRINPLHDSMGHMGRTNYHVAMQAVID-------- 146 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 H + A +VF TDG S + L C AK Sbjct: 147 --HYQSCGAEDPAFVVFQTDGSPTSKAAAEHVL--CTAAK 182 >gi|145485516|ref|XP_001428766.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124395854|emb|CAK61368.1| unnamed protein product [Paramecium tetraurelia] Length = 947 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 60/350 (17%), Positives = 122/350 (34%), Gaps = 42/350 (12%) Query: 40 VSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWN-----MSF 94 +SHI K +L+ S + +I N +++ + ++ +++ + Sbjct: 622 ISHILQCKVLLNLEDSNSALA---EISNAEQLSDKYENSYDRSDAQVNDSFPIPPGILKQ 678 Query: 95 RNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHI 154 R F+ D I ++ + + ++ Y LK Sbjct: 679 RILYEKGLFIKRYDSIKKAAFIFTECLETSKFYDPEIRINCLKQLKEIFQSQNLLYKVPK 738 Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + + ++ D++ V+D S SME+ K ++AI I + + D Sbjct: 739 IEQLLELNEIKKNN----DIVFVIDHSGSMENI------KKELAINGILKIFDNYLQDQD 788 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEW---GVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQ 269 + + F+ IE F L ++L+ I+ + T + +AY+ Sbjct: 789 -----RISYMRFNQNIEVIFDLTSKSENTAYLRSAIERSKNIRAEGMTAMLSAVLHAYSI 843 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 H + ++ IV + DGE NLS ++ + ++ + IV IG+ Sbjct: 844 ---------HEKAVKKDNQQWIVVLCDGEDNLSNITYERMKKFTSKRPQISLIVIGIGLS 894 Query: 329 VIRS--HEFLRACASPNSFYLVEN--PHSMYDAFSHIGKDIVTKRIWYDK 374 + E C +L+E+ + AF I I YD+ Sbjct: 895 LKPDCLDELYDLCRLSQKGFLIESVYSEDLDIAFQSISNLIFGTSSIYDE 944 >gi|107028246|ref|YP_625341.1| hypothetical protein Bcen_5496 [Burkholderia cenocepacia AU 1054] gi|116687157|ref|YP_840404.1| hypothetical protein Bcen2424_6782 [Burkholderia cenocepacia HI2424] gi|105897410|gb|ABF80368.1| conserved hypothetical protein [Burkholderia cenocepacia AU 1054] gi|116652872|gb|ABK13511.1| conserved hypothetical protein [Burkholderia cenocepacia HI2424] Length = 423 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 17/166 (10%), Positives = 55/166 (33%), Gaps = 6/166 (3%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R + +G + I+ + L ++ +G+ +++ ++ ++ L + D + AA + + Sbjct: 12 RRSLHRQRGAVAIIVGLALAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDLTSA 71 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRS--TSLDIVVVPQNEG 126 + + + G + + ++ N V D + T + + Sbjct: 72 ISLSVAE--ADGIAAGHLNFVFFQKTSVQMSTNANVTFSDSLTNPFLTKNAVTTPANIKY 129 Query: 127 YSISAISRYKIP--LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 +A ++ +P + + ++ V Sbjct: 130 VQCTATLSNIAHWFIEVLNVLPGTKLANAAEVSASAIATVGGGQTT 175 >gi|291059650|gb|ADD72385.1| putative von Willebrand factor type A domain protein [Treponema pallidum subsp. pallidum str. Chicago] Length = 650 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 76/187 (40%), Gaps = 33/187 (17%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG------- 222 A++++ V+D S SM + KI A +++ + +K + ++++ +G Sbjct: 428 AQIEVSFVVDNSGSM------NKEKIASAREALAVSMLSLKDFGEYSDMLAAGRRERTTI 481 Query: 223 ---LVTFSNKIEEFFLLEWGVSH-------LQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + F + + S ++ + +FG TN LK+ I Sbjct: 482 HSEVYYFGSSFIKVKSFGKSKSKDFNSAQLIKASVNLDGRFG-GTNDAEVLKH----ILA 536 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV--YAIGIRVI 330 R+ + D ++ K+++ +TDG + E ++++ E ++RG ++ + IG+ Sbjct: 537 DVERRRARVSSDTSFVKVVLVITDGCSSYPHESRRTI---EELRRRGVMIFGFQIGLMSP 593 Query: 331 RSHEFLR 337 Sbjct: 594 EETALFH 600 >gi|15639238|ref|NP_218686.1| hypothetical protein TP0246 [Treponema pallidum subsp. pallidum str. Nichols] gi|189025479|ref|YP_001933251.1| hypothetical protein TPASS_0246 [Treponema pallidum subsp. pallidum SS14] gi|14285852|sp|O83274|Y246_TREPA RecName: Full=Uncharacterized protein TP_0246 gi|3322523|gb|AAC65240.1| predicted coding region TP0246 [Treponema pallidum subsp. pallidum str. Nichols] gi|189018054|gb|ACD70672.1| hypothetical protein TPASS_0246 [Treponema pallidum subsp. pallidum SS14] Length = 597 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 76/187 (40%), Gaps = 33/187 (17%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG------- 222 A++++ V+D S SM + KI A +++ + +K + ++++ +G Sbjct: 375 AQIEVSFVVDNSGSM------NKEKIASAREALAVSMLSLKDFGEYSDMLAAGRRERTTI 428 Query: 223 ---LVTFSNKIEEFFLLEWGVSH-------LQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + F + + S ++ + +FG TN LK+ I Sbjct: 429 HSEVYYFGSSFIKVKSFGKSKSKDFNSAQLIKASVNLDGRFG-GTNDAEVLKH----ILA 483 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV--YAIGIRVI 330 R+ + D ++ K+++ +TDG + E ++++ E ++RG ++ + IG+ Sbjct: 484 DVERRRARVSSDTSFVKVVLVITDGCSSYPHESRRTI---EELRRRGVMIFGFQIGLMSP 540 Query: 331 RSHEFLR 337 Sbjct: 541 EETALFH 547 >gi|307107982|gb|EFN56223.1| hypothetical protein CHLNCDRAFT_35166 [Chlorella variabilis] Length = 329 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 42/231 (18%), Positives = 83/231 (35%), Gaps = 34/231 (14%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI------------------TK 194 ++ + + S +K +L++++VLDVS SM S FDS K Sbjct: 99 YMAVGLDSGMKAADFARKQLNLVVVLDVSGSMGSPFDSYYYDQTVQPTAGVPDEGETKKK 158 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE----WGVSHLQRKIKYL 250 ID+A + + ++ ++ ++ +V FS+ L V L+ +I Sbjct: 159 IDVAKEVLAGIVGLLRPDDSLS------VVLFSDAACVPKPLGPVRCADVDKLKEQISAD 212 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSL 309 TN G+ Q+ G + + +VF+TD + N +Q L Sbjct: 213 VVEMGGTNFQAGIDAGGAQLT---GCAACMEANASLVENRVVFLTDAQPNAGDDSEQGLL 269 Query: 310 YYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAF 358 G IG+ + + + + + ++++ V P Sbjct: 270 ARIKALSADGIYTTIIGVGLDFNTQLVESIGKVRGSNYFSVHTPGEFRRRL 320 >gi|198430849|ref|XP_002120173.1| PREDICTED: similar to cGMP-dependent protein kinase 1, alpha isozyme (CGK 1 alpha) (cGKI-alpha) [Ciona intestinalis] Length = 1896 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 37/192 (19%), Positives = 73/192 (38%), Gaps = 22/192 (11%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ VLD S S+ S++ A++ + +L + D NV G+V F + Sbjct: 839 VDIIFVLDESGSV------SLSSYKQALRWVVELLTSFREDVDKGNVH-VGVVAFHSWAG 891 Query: 232 EFFLLEWG-VSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L S+LQ +I LS + TN P + + +R+ + Sbjct: 892 TRIALGAFEFSNLQARIIALSNGRNYGGTNIAPAID---------ETLREFNRNGRTGIQ 942 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 K ++ MTDG + + A+ +G + A+G+ + L + + Sbjct: 943 KQMILMTDGYSSYPNAISPA---AQRARAQGVVTVAVGVGGSSYAQLLNIAGNQTRVFYA 999 Query: 349 ENPHSMYDAFSH 360 N + + + Sbjct: 1000 TNFNRLGEVVES 1011 >gi|13365521|dbj|BAB39134.1| CD11-1 [Cyprinus carpio] Length = 1196 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 33/210 (15%), Positives = 81/210 (38%), Gaps = 23/210 (10%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + + ++++ + D S SM +M I +++++ N+ ++ Sbjct: 149 TAAYQECSKREVNLVFLFDGSSSM------KTVDFEMNKNFIKDIMKKLS-----NSSIK 197 Query: 221 SGLVTFSNKIEEFFLL-EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 V FS + F ++ + K+ + TN+ + Y + + Sbjct: 198 FAAVQFSTDVRTVFDFNDYQSGSAEEKLMKETHMKSLTNTHKAIDYILKNLLNSMLSGA- 256 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 D+ +K +V +TDG+ S +D L C+ ++ + Y IG+ + E + Sbjct: 257 ----DSKAQKALVIITDGDP-SDNDDYNVLKKCD---EQNILRYIIGVGKVDLIELTQLA 308 Query: 340 ASP--NSFYLVENPHSMYDAFSHIGKDIVT 367 + P N+ + +++ + ++ K I Sbjct: 309 SEPKRNNTFYIKDYSGLKGLLDNLQKKIYN 338 >gi|84498180|ref|ZP_00996977.1| putative membrane protein [Janibacter sp. HTCC2649] gi|84381680|gb|EAP97563.1| putative membrane protein [Janibacter sp. HTCC2649] Length = 654 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 32/216 (14%), Positives = 64/216 (29%), Gaps = 31/216 (14%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 S V T A+ ++++D S SM + + ++ L Sbjct: 72 GRPSPVTSKPATRAQRTTVLLIDTSGSMG------RSGMATVRTAVKDFLASAPKD---- 121 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 V+ G+V+F N + +Q + L G + + + ++ G Sbjct: 122 --VRIGVVSFGNTAGPEIAPTTARAAVQAVVDDLRADGNTALFSGVTQAV--RMLGSTGD 177 Query: 277 RQHCNTEDANYKKIIVFMTDGENL--STKEDQQSLYYCNEAKKRGAIV--YAIGIRVIRS 332 R IV ++DG+N + A + V + G + Sbjct: 178 RS------------IVLLSDGKNTVGDRASGLAAAGKALTASQVRVEVVRFTTGENDPEA 225 Query: 333 HEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 A S + + AF K + ++ Sbjct: 226 LAAFAK-AGGGSVVQATDAEGVRTAFQTAAKVLESQ 260 >gi|116623283|ref|YP_825439.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116226445|gb|ABJ85154.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 299 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 79/216 (36%), Gaps = 36/216 (16%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + ++D L + +++D S S +D + +E V L P+ + V L+ Sbjct: 68 SQESDLPLTLGLMVDTSMSQRRV-------LDAERGASYRFIETV-LRPNKDQVF---LM 116 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYL---------SKFGVSTNSTPGLKYAYNQIFDMQG 275 F +I L + L + Y+ ++ G T + A ++ + Sbjct: 117 QFDFRIFMRQPLTNSLRQLSDSLPYVDTPTFNQLRAQSGGGTLLYDAVVTASQEVMLNRT 176 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHE 334 R K ++ +TDGE+ D A++ ++Y+I Sbjct: 177 GR-----------KALILLTDGEDYG--SDASVGDAIEAAQRADTLIYSILFADQGDGRR 223 Query: 335 FLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 L+ + + SF+ V + F+ I +++ ++ Sbjct: 224 PLQRMSKETGGSFFEVSKKQDIDQIFTAIQEELRSQ 259 >gi|301768024|ref|XP_002919431.1| PREDICTED: calcium-activated chloride channel regulator 4-like [Ailuropoda melanoleuca] Length = 922 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 39/197 (19%), Positives = 79/197 (40%), Gaps = 34/197 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM S +++ ++ L ++ I + + V G+V F + Sbjct: 306 VCLVLDKSGSM-----SGFNRLNRMNQAAKHFL--LQTIENGSWV---GMVHFDSTANIK 355 Query: 234 FLLEWGVSHLQRK--IKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L +S +R ++ L T+ G++ A+ I + + + + Sbjct: 356 SNLIQIISSKERNNLLESLPKAANGGTSICAGMRSAFQVI---REVYPQIDGSE------ 406 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRACA-SPNSFYLV 348 IV +TDGE+ S K+ C +E + GAI++ I + + A + + + Sbjct: 407 IVLLTDGEDNSAKD-------CIDEVTQSGAIIHLIALGPSADQAVIEMSAMTGGNHFFA 459 Query: 349 ENPHS---MYDAFSHIG 362 + + DAF + Sbjct: 460 SDEAQNNGLIDAFGALA 476 >gi|149036652|gb|EDL91270.1| anthrax toxin receptor 1 [Rattus norvegicus] Length = 457 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 32/154 (20%), Positives = 56/154 (36%), Gaps = 13/154 (8%) Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQ 278 + FS + L +++ ++ L K G T G + A QI+ Sbjct: 1 MSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENSQGY 60 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + +I+ +TDGE L S N ++ GAIVY +G++ + R Sbjct: 61 RTAS-------VIIALTDGE-LHEDLFFYSEREANRSRDLGAIVYCVGVKDFNETQLARI 112 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 S + + V + + A I I+ K Sbjct: 113 ADSKDHVFPVNDG---FQALQGIIHSILKKSCIE 143 >gi|169856457|ref|XP_001834887.1| elongation factor-2 kinase [Coprinopsis cinerea okayama7#130] gi|116504062|gb|EAU86957.1| elongation factor-2 kinase [Coprinopsis cinerea okayama7#130] Length = 406 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 60/194 (30%), Gaps = 35/194 (18%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA--IKSINAMLEEVKLIP 213 M + K LD++ + D + S + S++ I S + L Sbjct: 22 MTVEPGPKDAPAPKRPLDIVFLQDATGSQGPYIQSAVKAIHSICQKISQSPALGGGDSGS 81 Query: 214 DVNNVVQSGLVTFS-------NKIEEFFLLEWGVSHLQRKIKYLSKFGVS---TNSTPGL 263 ++ GL+ F + + + F V +++ + L G ST L Sbjct: 82 APTESIRFGLIAFRDHPPQDRSYVTKNFGFTSNVDEVKKHLSGLIASGGGDGPEASTAAL 141 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD-----------GENLSTKEDQQSLYYC 312 A N + + KI+V +TD G + + L Sbjct: 142 AEALNMEWKENAV------------KIVVLITDAPPHGLGEVGDGFPNGSPDQNDPLSIA 189 Query: 313 NEAKKRGAIVYAIG 326 + + G +Y I Sbjct: 190 RQMAEHGISLYIIA 203 >gi|197102272|ref|NP_001124862.1| calcium channel, voltage-dependent, alpha 2/delta subunit 1 preproprotein [Pongo abelii] gi|55726163|emb|CAH89855.1| hypothetical protein [Pongo abelii] Length = 1079 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 78/200 (39%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 247 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 294 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 295 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 353 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ + K + V+ + R Sbjct: 354 CN-------KIIMLFTDG------GEERAQEIFTKYNKDKKVRVFTFSVGQHNYDRGPIQ 400 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 401 WMACENKGYYYEIPSIGAIR 420 >gi|332879551|ref|ZP_08447246.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332682517|gb|EGJ55419.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 352 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 73/175 (41%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K++++ ++ +I +D+S SM + D + ++++ + ++ +++ K+ D + Sbjct: 79 GTKMDTRKRQGIEAIIAMDISNSMMAE-DVTPSRLEKSKMLVSNIVD--KMTDD-----K 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP---GLKYAYNQIFDMQGMR 277 GL+ ++ + + + ++ ++ ++T T + A Sbjct: 131 IGLIVYAGEAYTQLPITSDYVSAKIFLETINPSMITTQGTDIKQAIDLAMKSFTS----- 185 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + + K I +TDGE+ + ++ A ++G VY +G+ + Sbjct: 186 ------NQDVSKAIFVITDGEDN----EGGAVEMAKAAAEKGIKVYVLGVGSPQG 230 >gi|149046636|gb|EDL99461.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_c [Rattus norvegicus] Length = 920 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 77/200 (38%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 247 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 294 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 295 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 353 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ K + V+ + R Sbjct: 354 CN-------KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQ 400 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 401 WMACENKGYYYEIPSIGAIR 420 >gi|156382085|ref|XP_001632385.1| predicted protein [Nematostella vectensis] gi|156219440|gb|EDO40322.1| predicted protein [Nematostella vectensis] Length = 1221 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 37/198 (18%), Positives = 72/198 (36%), Gaps = 21/198 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN--K 229 +D+ + +D S M D+ + K + ++ V D N ++ G+ T+ + Sbjct: 69 MDVALAVDTSDGMS---DADLAKTKSLVTTL------VNQFSDSENSIRFGITTYGQEAR 119 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP-GLKYAYNQIFDMQGMRQHCNTEDANYK 288 F + + L+ IK + K GV N +F ++G + Sbjct: 120 TLANFKQNFDEAKLRTAIKGIQKTGVQARRHDLAAMAVKNDLFSLEG------GMRQGHP 173 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF-YL 347 + ++F + G N T +D + G + AIG+ L AS N F + Sbjct: 174 RFVIFFSAGANTGTADDLK--KASKPLTDLGVNMIAIGVNSNADQASLAELASENRFIFS 231 Query: 348 VENPHSMYDAFSHIGKDI 365 +P + + I + Sbjct: 232 ANSPAELDALWPSIEAQM 249 Score = 40.1 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 35/190 (18%), Positives = 64/190 (33%), Gaps = 20/190 (10%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + D +D+ + VS S+ F ++I K+ + L+ N +V +GL+ + Sbjct: 977 ECDRPVDVYFGIPVSGSVAPQF-ANIKKM---------FTSLLSLLKVNNRLVHTGLIRY 1026 Query: 227 SNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S+ + L + + I + G S N + A + F + G Sbjct: 1027 SDTADVVLNLNRVYNRDAVGAVINQIPISGSSLNLARAFEVAADHGFTIYG------GVR 1080 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 KI V G S + Q+ + K G V +G+ ++ S Sbjct: 1081 QTVPKIFVVYIPGPVSSPPAEVQAAVT--KLKGLGVRVVLLGLDSNIDKSLYSTVSTQPS 1138 Query: 345 FYLVENPHSM 354 V S Sbjct: 1139 RKFVLTADSF 1148 >gi|153950207|ref|YP_001400285.1| von Willebrand factor type A domain-containing protein [Yersinia pseudotuberculosis IP 31758] gi|152961702|gb|ABS49163.1| von Willebrand factor type A domain protein [Yersinia pseudotuberculosis IP 31758] Length = 460 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 39/244 (15%), Positives = 91/244 (37%), Gaps = 37/244 (15%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + + ++ + +T ++S + +++ +V+D S SM S +I+ A ++ Sbjct: 58 LANSEDKNYLKISLTG-FNLDSTRRSPINLALVIDRSTSM------SGERIEKAREAAIL 110 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSN---KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 + + ++ +V + N I + + + +++ G++ Sbjct: 111 AVNMLNTTDTLS------VVAYDNHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAG 164 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGA 320 + + +H N E N I+ ++DG+ N + A K+G Sbjct: 165 -------VSMGIGQVDKHLNREQVNR---IILISDGQANTGPTSISELSDLARMAAKKGI 214 Query: 321 IVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSH--------IGKDIVTKRI 370 + IG+ + + + A A S + V N + AF+ + +DIV + Sbjct: 215 AITTIGLGQDYNEDLMTAIAGYSDGNHTFVANSADLEKAFTKEFQDVMSVVAQDIVVQIK 274 Query: 371 WYDK 374 DK Sbjct: 275 TGDK 278 >gi|268561224|ref|XP_002646394.1| Hypothetical protein CBG15363 [Caenorhabditis briggsae] gi|187027190|emb|CAP33690.1| hypothetical protein CBG_15363 [Caenorhabditis briggsae AF16] Length = 400 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 76/198 (38%), Gaps = 20/198 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D S+ M + ++ + I ++ + + + GLVT+++ + Sbjct: 38 LDVIAVVDNSQGMTT---DGLSSVAANIATVFSSGTRIGTNATEPRTTRVGLVTYNSVAK 94 Query: 232 EFFLLEWG--VSHLQRKI-KYLSKFGVSTNSTPG--LKYAYNQIFDMQGMRQHCNTEDAN 286 L ++ + + YLS +T+S L+ A N +F Q N+ + Sbjct: 95 VNADLNTFQSINDVYNGVFNYLSAVTDATDSYLATGLQAA-NALFAS----QSFNSTRNH 149 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC---ASPN 343 YKK+++ + + + +E K G + + + + L+ A+P Sbjct: 150 YKKVVIVY--ASEYKSYGELDPVKVADEMKGSGVYIVTVAYDQGGNGQLLKDLAGIATPG 207 Query: 344 SFYLVENPHSMYDAFSHI 361 Y N + I Sbjct: 208 --YSFSNTDDSDNVIGEI 223 >gi|301778383|ref|XP_002924606.1| PREDICTED: integrin alpha-10-like [Ailuropoda melanoleuca] Length = 1175 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 42/244 (17%), Positives = 82/244 (33%), Gaps = 46/244 (18%) Query: 150 NSRHIVMPITSSVKVNSQTDARL-----------DMMIVLDVSRSMESFFDSSITKIDMA 198 S I + V + + L D++IVLD S S+ + +++ Sbjct: 133 GSSVFSSGICARVDASFRPQGSLAPTAQRCPTYMDVVIVLDGSNSIYPW-----SEVQTF 187 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGV 255 ++ + L I +Q GLV + + L + ++ + G Sbjct: 188 LRRLVGRL----FIDPEQ--IQVGLVQYGESPVHEWSLGDFRTKDEVVRAARNLSRREGR 241 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 T + + A + F + +++V +TDGE+ +E +L C Sbjct: 242 ETKTAQAILVACTEGFSLS------RGGRPEAARLLVVVTDGESHDGEELPAALKACEAG 295 Query: 316 KKRGAIVYAIGI------RVIRSHEFLR----ACASPNS--FYLVENPHSMYDAFSHIGK 363 + Y I + R FLR + P+ F+ V + ++ D +G Sbjct: 296 R---VTRYGIAVLGHYLRRQRDPTSFLREIRTIASDPDERFFFNVTDEAALTDIVDALGD 352 Query: 364 DIVT 367 I Sbjct: 353 RIFG 356 >gi|281353330|gb|EFB28914.1| hypothetical protein PANDA_013982 [Ailuropoda melanoleuca] Length = 1128 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 42/244 (17%), Positives = 82/244 (33%), Gaps = 46/244 (18%) Query: 150 NSRHIVMPITSSVKVNSQTDARL-----------DMMIVLDVSRSMESFFDSSITKIDMA 198 S I + V + + L D++IVLD S S+ + +++ Sbjct: 116 GSSVFSSGICARVDASFRPQGSLAPTAQRCPTYMDVVIVLDGSNSIYPW-----SEVQTF 170 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGV 255 ++ + L I +Q GLV + + L + ++ + G Sbjct: 171 LRRLVGRL----FIDPEQ--IQVGLVQYGESPVHEWSLGDFRTKDEVVRAARNLSRREGR 224 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 T + + A + F + +++V +TDGE+ +E +L C Sbjct: 225 ETKTAQAILVACTEGFSLS------RGGRPEAARLLVVVTDGESHDGEELPAALKACEAG 278 Query: 316 KKRGAIVYAIGI------RVIRSHEFLR----ACASPNS--FYLVENPHSMYDAFSHIGK 363 + Y I + R FLR + P+ F+ V + ++ D +G Sbjct: 279 R---VTRYGIAVLGHYLRRQRDPTSFLREIRTIASDPDERFFFNVTDEAALTDIVDALGD 335 Query: 364 DIVT 367 I Sbjct: 336 RIFG 339 >gi|114614242|ref|XP_001160235.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 1 isoform 1 [Pan troglodytes] Length = 1110 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 78/200 (39%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 247 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 294 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 295 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 353 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ + K + V+ + R Sbjct: 354 CN-------KIIMLFTDG------GEERAQEIFTKYNKDKKVRVFTFSVGQHNYDRGPIQ 400 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 401 WMACENKGYYYEIPSIGAIR 420 >gi|307565332|ref|ZP_07627825.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] gi|307346001|gb|EFN91345.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] Length = 566 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 30/185 (16%), Positives = 66/185 (35%), Gaps = 26/185 (14%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 I P + +++ ++ +I LD+S SM + D +++D + I +L Sbjct: 73 IARPQIGNRISSTKNGKGIETVIALDISNSMLAQ-DVIPSRLDKSKLLIEDLLRSFDND- 130 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQI 270 + GL+ F+ + + + ++ T+ + A Sbjct: 131 ------KVGLIVFAGDAFVQLPITSDFISAKMFLNDINPSLIGTQGTDIGKAINLAM--- 181 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 H + + K I+ +TDGE+ + + +A++ G +Y +GI Sbjct: 182 --------HSFSPTSKAGKAIIIITDGEDN----EGGAEAMAKKAQEAGFHIYILGIGST 229 Query: 331 RSHEF 335 E Sbjct: 230 SGAEI 234 >gi|296209823|ref|XP_002807089.1| PREDICTED: LOW QUALITY PROTEIN: voltage-dependent calcium channel subunit alpha-2/delta-1-like [Callithrix jacchus] Length = 1094 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 78/200 (39%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 247 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 294 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 295 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 353 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ + K + V+ + R Sbjct: 354 CN-------KIIMLFTDG------GEERAQEIFTKYNKDKKVRVFTFSVGQHNYDRGPIQ 400 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 401 WMACENKGYYYEIPSIGAIR 420 >gi|312200955|ref|YP_004021016.1| von Willebrand factor type A [Frankia sp. EuI1c] gi|311232291|gb|ADP85146.1| von Willebrand factor type A [Frankia sp. EuI1c] Length = 618 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 35/205 (17%), Positives = 65/205 (31%), Gaps = 40/205 (19%) Query: 177 VLDVSRSME--SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 V D S SM+ T++ +A+ + +A IP + GL FS ++ Sbjct: 422 VYDTSGSMDLPVANSGGKTRLQIAVGAADA------AIPLFAKDSRLGLWQFSTNLDGTK 475 Query: 235 LLE-------------WGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G L + L G + L A+ + + Sbjct: 476 PYRELVPVGLMNDEVGTGTREEALVAAVNGLKAKGGTGLYATAL-AAFESL----SAQYQ 530 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK-----KRGAIVYAIGIRVIRSHE 334 + + +V +TDG+N K K + IG + Sbjct: 531 PDKPNQ-----VVLLTDGQNDDPTSSMTLTQLIATLKAEYNPKAPVHIITIGYGADADMD 585 Query: 335 FLR--ACASPNSFYLVENPHSMYDA 357 LR + A+ + Y ++P+S++ Sbjct: 586 ALRQISAATGSKTYPAQDPNSIFQV 610 >gi|242034233|ref|XP_002464511.1| hypothetical protein SORBIDRAFT_01g019880 [Sorghum bicolor] gi|241918365|gb|EER91509.1| hypothetical protein SORBIDRAFT_01g019880 [Sorghum bicolor] Length = 584 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 39/200 (19%), Positives = 76/200 (38%), Gaps = 32/200 (16%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 LD++ VLDVS SM+ +K+ + +++ +++ + + +V+FSN Sbjct: 147 PLDLVTVLDVSGSMQG------SKLALLKQAMGFVIDNLGPAD------RLSIVSFSNDA 194 Query: 231 EEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L G + + ++ L G STN + GL A + D R Sbjct: 195 SREIRLTRMSGDGKASAKEAVESLVADG-STNISRGLLVASEVLADR---RYRNAVTS-- 248 Query: 287 YKKIIVFMTDGENL----STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 ++ ++DG++ +A R ++ G + A A Sbjct: 249 ----VILLSDGQDNQSGVGRNHQNLVPPLFRDADSRPGSIHTFGFGSDHDAAAMHAIAEV 304 Query: 341 SPNSFYLVENPHSMYDAFSH 360 + +F VEN + D+F+ Sbjct: 305 ARGTFSFVENLAVIQDSFAQ 324 >gi|326501022|dbj|BAJ98742.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 284 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 26/199 (13%), Positives = 70/199 (35%), Gaps = 25/199 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 + ++D S S+ + A + A + + +++ + F + + Sbjct: 62 SISFLVDESGSIG------ASAFQYAKSFLYAYVNQ-----TYDDLSIMSIHFFDSTFDP 110 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + + I+ + G T + + + I + KI+V Sbjct: 111 YIYYGNNRATILNMIQSKAYRGAGTATGNAINNSVALIKNKNFPNG--------VPKILV 162 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRACASPNSFYLVENP 351 +TDG + N A+K G +++ +GI + + + ++ S ++ + + Sbjct: 163 ILTDG-----GSYDSVIEAANNARKNGIMLFCVGIGSNVNTAQLIQIAGSTSNIVYISSY 217 Query: 352 HSMYDAFSHIGKDIVTKRI 370 S+ + + I + I Sbjct: 218 SSLTNLVNLIENYFCKQII 236 >gi|297694868|ref|XP_002824689.1| PREDICTED: LOW QUALITY PROTEIN: cochlin-like [Pongo abelii] Length = 482 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 22/173 (12%), Positives = 59/173 (34%), Gaps = 25/173 (14%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +++ ++D S S+ + + ++ ++ + + ++ + V F+ Sbjct: 297 SVNIAFLIDGSSSVGD------SNFRLMLEFVSNIAKTFEISDIGA---KIAAVQFTYDQ 347 Query: 231 EEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F + + I+ + T + + + +F +R+ N Sbjct: 348 RTEFSFTDYSTKENVLAAIRNIRYMSGGTATGDAISFTVRNVFGP--IRESPNKN----- 400 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 +V +TDG+ A G V+++G+ + L+ AS Sbjct: 401 -FLVIVTDGQ-----SYDDVQGPAAAAHDAGITVFSVGVAWAPLDD-LKDMAS 446 >gi|281352223|gb|EFB27807.1| hypothetical protein PANDA_008059 [Ailuropoda melanoleuca] Length = 907 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 39/197 (19%), Positives = 79/197 (40%), Gaps = 34/197 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM S +++ ++ L ++ I + + V G+V F + Sbjct: 307 VCLVLDKSGSM-----SGFNRLNRMNQAAKHFL--LQTIENGSWV---GMVHFDSTANIK 356 Query: 234 FLLEWGVSHLQRK--IKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L +S +R ++ L T+ G++ A+ I + + + + Sbjct: 357 SNLIQIISSKERNNLLESLPKAANGGTSICAGMRSAFQVI---REVYPQIDGSE------ 407 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRACA-SPNSFYLV 348 IV +TDGE+ S K+ C +E + GAI++ I + + A + + + Sbjct: 408 IVLLTDGEDNSAKD-------CIDEVTQSGAIIHLIALGPSADQAVIEMSAMTGGNHFFA 460 Query: 349 ENPHS---MYDAFSHIG 362 + + DAF + Sbjct: 461 SDEAQNNGLIDAFGALA 477 >gi|329744564|ref|NP_001192916.1| voltage-dependent calcium channel subunit alpha-2/delta-1 [Bos taurus] Length = 1091 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 78/200 (39%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 247 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 294 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 295 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 353 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ + K + V+ + R Sbjct: 354 CN-------KIIMLFTDG------GEERAQEIFTKYNKDKKVRVFTFSVGQHNYDRGPIQ 400 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 401 WMACENKGYYYEIPSIGAIR 420 >gi|327402260|ref|YP_004343098.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] gi|327317768|gb|AEA42260.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] Length = 473 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 39/198 (19%), Positives = 83/198 (41%), Gaps = 27/198 (13%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 ++++ V+D+S SM + K+++ S+N ++ +++ D G+VT++N Sbjct: 290 DVNVIFVIDISSSM-----KNGEKMNLMKYSLNQLVSKLRPNDD------MGMVTYANTA 338 Query: 231 EEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + F G + L+ I L G+ T G+K Y ++ + Sbjct: 339 DVFQAPTSGSNKESLKSSITSLKPSGM-TAGGKGIKLGYKEVMKNYDPAKANM------- 390 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE-FLRACASPNSFYL 347 ++ +TDG +D Q + K+G + +GI L+ A+ + Sbjct: 391 --VIIITDGAFNKDSDDYQ--KTVQKYAKKGVVFSVLGIETRERDAKLLQEAAAFGNGRY 446 Query: 348 VENPHSMYDAFSHIGKDI 365 V + + DA S++ ++I Sbjct: 447 V-SIQKLVDAHSNLTEEI 463 >gi|326670664|ref|XP_003199264.1| PREDICTED: collagen alpha-3(VI) chain-like [Danio rerio] Length = 2265 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 54/327 (16%), Positives = 109/327 (33%), Gaps = 33/327 (10%) Query: 48 TVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDI 107 L + D +A EG L GG + + L+++G + Sbjct: 531 AALQYVRDSVFTASAGSRKQEGVPQVLILLSGGRSNDNVDTP-----ASALKESGVLILG 585 Query: 108 DDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 S+ ++ + + Y+ S +P F N V P+T +V + Sbjct: 586 IGTRNSSR-EVQRITNDPAYAQSLNDFSDLPGVQQQFYSSLNNVLVQVKPMTPTVAAEPR 644 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 R D++ +LD S D + + + M+E + + + + + +V +S Sbjct: 645 MARR-DVVFLLDGS-------DGTRSSFSAMRDFVERMVERLNVSENRD---RVSVVQYS 693 Query: 228 NKIEEFFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 E F L I+ L G N+ L+Y + +F + Sbjct: 694 RDPEAHFYLNTYSRKEDALDTIRGLRHKGGRPLNTGAALQYLRDNVFTASSGSRRVEG-- 751 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 ++++ ++ + + K+ G +++ IG R S E R P+ Sbjct: 752 --VPQLLILLS-----GARSFDNVDTPASSLKELGVLIFVIGSRSSDSQELQRISQEPSY 804 Query: 345 FYLVENPHSM----YDAFSHIGKDIVT 367 V + + F++I K V Sbjct: 805 ALSVSDFTDLPSVQQQLFTNINKVFVA 831 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 50/342 (14%), Positives = 107/342 (31%), Gaps = 47/342 (13%) Query: 48 TVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDI 107 L + D +A EG L GG + + L+++G + Sbjct: 1330 AALQYVRDSVFTASAGSRKQEGVPQVLILLSGGRSNDNVDTP-----ASALKESGVLILG 1384 Query: 108 DDIVRSTSLDIVVVPQNEGYSISAISRYKIP------LKFCTFIPWYTNSRHIVMPITSS 161 S+ ++ + + Y+ S +P + + +S Sbjct: 1385 IGTRNSSR-EVQRITNDPAYAQSLNDFSDLPGVQQQFYSSLNNVLVQVKPMTPTVAGMTS 1443 Query: 162 VKVNSQT---------DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 + + AR D++ +LD S D + + + M+E + + Sbjct: 1444 AHMTLCSISLPIAEPRMARRDVVFLLDGS-------DGTRSSFSAMRDFVERMVERLNVS 1496 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQ 269 + + + +V +S E F L I+ L G N+ L+Y + Sbjct: 1497 ENRD---RVSVVQYSRDPEAHFYLNTYSRKEDALDTIRGLRHKGGRPLNTGAALQYLRDN 1553 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 +F + ++++ ++ + + K+ G +++ IG R Sbjct: 1554 VFTASSGSRRVEG----VPQLLILLS-----GARSFDNVDTPASSLKELGVLIFVIGSRS 1604 Query: 330 IRSHEFLRACASPNSFYLVENPHSM----YDAFSHIGKDIVT 367 S E R P+ V + + F++I K V Sbjct: 1605 SDSQELQRISQEPSYALSVSDFTDLPSVQQQLFTNINKVFVA 1646 >gi|320007358|gb|ADW02208.1| von Willebrand factor type A [Streptomyces flavogriseus ATCC 33331] Length = 428 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 75/217 (34%), Gaps = 41/217 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF------- 226 + +VLDVS SM + ++I A ++ N +L+ V +V +++ + Sbjct: 41 VELVLDVSGSMRTRDMGGESRISAAKQAFNDVLDAVP--EEVQLGIRTLGADYPGDDRKV 98 Query: 227 ----SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + ++ L+ + + + L+ G T P L A + + R+ Sbjct: 99 GCKDTKQLYPVGPLD--RTEAKTAVATLAPTGW-TPIGPALLGAADDLDGGDSTRR---- 151 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK---RGAI--VYAIGIRVIRSHEFLR 337 IV +TDGE+ D C A+ RG + +G+ Sbjct: 152 --------IVLITDGEDTCGPLD-----PCEVARDIAARGIHLVIDTLGLVPDAKIRQQL 198 Query: 338 AC---ASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 C A+ ++ V+N + + + Sbjct: 199 TCIAEATGGTYTAVQNTDELSGRVKQLVDRAAEPVVT 235 >gi|149419345|ref|XP_001517573.1| PREDICTED: similar to anthrax toxin receptor, partial [Ornithorhynchus anatinus] Length = 139 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 30/148 (20%), Positives = 57/148 (38%), Gaps = 13/148 (8%) Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGM 276 ++ + FS + L +++ ++ L K G T G + A QI+ Sbjct: 3 LRMSFIVFSTRGSTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIY----- 57 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 H N + +I+ +TDGE L + N ++ GA VY +G++ + Sbjct: 58 --HENWQGYRTASVIIALTDGE-LHENLFFHAEQEANRSRDFGATVYCVGVKDFNETQLA 114 Query: 337 RACASPNSFYLVENPHSMYDAFSHIGKD 364 R S + + V + ++A I Sbjct: 115 RIADSKDHVFPVNDG---FEALQGIIDS 139 >gi|329663456|ref|NP_001192519.1| integrin alpha-10 [Bos taurus] gi|297472754|ref|XP_002686129.1| PREDICTED: integrin, alpha 10-like [Bos taurus] gi|296489504|gb|DAA31617.1| integrin, alpha 10-like [Bos taurus] Length = 1167 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 42/244 (17%), Positives = 79/244 (32%), Gaps = 46/244 (18%) Query: 150 NSRHIVMPITSSVKVNSQTDARL-----------DMMIVLDVSRSMESFFDSSITKIDMA 198 S I + V + + L D++IVLD S S+ + +++ Sbjct: 133 GSSVFSSGICARVDASFRPQGSLAPTAQRCPTYMDVVIVLDGSNSIYPW-----SEVQTF 187 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGV 255 ++ + L I +Q GLV + + L + ++ + G Sbjct: 188 LRRLVGRL----FIDPEQ--IQVGLVQYGESSVHEWSLGDFRTKEEVVRAARNLSRREGR 241 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 T + + + +++V +TDGE+ +E +L C Sbjct: 242 ETKTAQAI------MMACTEGFSQSRGGRPEAARLLVVVTDGESHDGEELPTALQACEAG 295 Query: 316 KKRGAIVYAIGI------RVIRSHEFL---RACASP---NSFYLVENPHSMYDAFSHIGK 363 + Y I + R FL RA AS F+ V + ++ D +G Sbjct: 296 R---VTRYGIAVLGHYLRRQRDPSSFLREIRAIASDPDEKFFFNVTDEAALTDIVDALGD 352 Query: 364 DIVT 367 I Sbjct: 353 RIFG 356 >gi|331647928|ref|ZP_08349020.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli M605] gi|330912098|gb|EGH40608.1| hypothetical protein ECAA86_02478 [Escherichia coli AA86] gi|331043652|gb|EGI15790.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli M605] Length = 580 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 35/194 (18%), Positives = 75/194 (38%), Gaps = 21/194 (10%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 PW + + I + S+ +++ ++D S SM S ++ + S Sbjct: 191 LAPAPWNEQRTLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISD-----ERLPLIQSS 244 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 + +++E++ ++ V +G ++I + + + I L G STN Sbjct: 245 LKLLVKELREQDNIAIVTYAG----DSRIALPSISGSHKAEINAAIDSLDAEG-STNGGA 299 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGA 320 GL+ AY Q + N I+ TDG+ +D +S+ + ++ G Sbjct: 300 GLEMAYQQ------AAKGFVKGGVNR---ILLATDGDFNVGIDDPKSIESMVKKQRESGV 350 Query: 321 IVYAIGIRVIRSHE 334 + +G+ +E Sbjct: 351 TLSTLGVGDSNYNE 364 >gi|281209350|gb|EFA83518.1| hypothetical protein PPL_02583 [Polysphondylium pallidum PN500] Length = 461 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 44/206 (21%), Positives = 71/206 (34%), Gaps = 18/206 (8%) Query: 167 QTDARLDMMIVLDVSRSMES-FFDSSIT----KIDMAIKSINAMLEEVKLIPDVNNVVQS 221 Q ++DM IVLD+S SM S F S+T ++ + + + Sbjct: 145 QEIKQIDM-IVLDLSGSMRSAAFKGSLTPGELEMKRIEIAQALFQTMIDKYVQLEIAAIV 203 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 GLV F +IE F ++ + T +K A I + Sbjct: 204 GLVCFGERIEVTFPPTRNFDSFSTELGEVVANQSKTRLYEAIKLAGETIVKYRENPTSLA 263 Query: 282 TE--DANYKKII--VF-MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 A K+I VF +TDG++ S + Y K I+ AI I L Sbjct: 264 DGFVLAPSDKLICRVFALTDGQDNSNACPYEVYKY---LKSANIILDAIPIG--EGGNTL 318 Query: 337 RAC--ASPNSFYLVENPHSMYDAFSH 360 + A+ S + + + + F Sbjct: 319 GSFTKATGGSCFTFNSSKAGVELFER 344 >gi|226510867|gb|ACO59960.1| matrilin-1 [Salmo salar] Length = 108 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 22/109 (20%), Positives = 44/109 (40%), Gaps = 13/109 (11%) Query: 221 SGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 GLV +S+ +++ F L L+ +K ++ T + L+Y + D Sbjct: 11 VGLVQYSSSVKQEFPLGRYNNKKDLKDAVKKMAYMERGTMTGQALRY----LTDSSFAPA 66 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 K+ + TDG + D +AK++G ++A+G+ Sbjct: 67 GGARPGVA--KVGIVFTDGRSQDYIGDA-----AKKAKEQGFKMFAVGV 108 >gi|149046634|gb|EDL99459.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_a [Rattus norvegicus] Length = 927 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 77/200 (38%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 247 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 294 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 295 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 353 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ K + V+ + R Sbjct: 354 CN-------KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRGPIQ 400 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 401 WMACENKGYYYEIPSIGAIR 420 >gi|302346570|ref|YP_003814868.1| von Willebrand factor type A domain protein [Prevotella melaninogenica ATCC 25845] gi|302151211|gb|ADK97472.1| von Willebrand factor type A domain protein [Prevotella melaninogenica ATCC 25845] Length = 331 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 25/179 (13%), Positives = 66/179 (36%), Gaps = 26/179 (14%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P S ++ ++ +I LD+S SM + D + ++++ + + ++ + Sbjct: 76 PQVGSRIATNKEREGIETIIALDISNSMLAE-DVAPSRLEKSKLLVENLMNKFSED---- 130 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDM 273 + GL+ F+ + + + ++ T+ L+ + N Sbjct: 131 ---KIGLIVFAGDAFVQLPITSDYVSAKMFLDNINPSLIGTQGTDIGKALQLSMNSFT-- 185 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 ++ K I+ +TDGE+ + + +A+ +G V+ +G+ Sbjct: 186 ---------PNSKVGKAIILITDGEDN----EGGAEEMAKQAQSKGIRVFILGVGSTEG 231 >gi|310643461|ref|YP_003948219.1| protein [Paenibacillus polymyxa SC2] gi|309248411|gb|ADO57978.1| Putative uncharacterized protein [Paenibacillus polymyxa SC2] Length = 696 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 37/229 (16%), Positives = 81/229 (35%), Gaps = 39/229 (17%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + + + + D + VLD S SM D+ I + S+ L + + Sbjct: 35 AASLGTASIEGYDAVFVLDTSYSMR---DTDPEGIAAEVISMFMDLSDAD-------RTR 84 Query: 221 SGLVTFSNKIEEFFLLE-----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 G V +++ + L S +Q++I+ L++ G T+ GL+ +G Sbjct: 85 VGFVAYNHHVVASKPLTSIGVAAQKSQIQQEIRMLNRSGY-TDLGLGLR---------KG 134 Query: 276 MRQHCNTEDANYKKIIVFMTDGEN---------LSTKEDQQSLYYCNEAKKRGAIVYAIG 326 + ++ ++DGE + A+ +G VY IG Sbjct: 135 SELLAAGASQGRQPFMILLSDGETDFGVSSGSRSKGDSNNDVSSVIKSAQTKGYPVYTIG 194 Query: 327 I---RVIRSHEFLRACA-SPNSFYLVENPHSMYDAFSHI-GKDIVTKRI 370 + + E R + + + ++ + + + + I I +K + Sbjct: 195 LNHDGTVNRQELERIASQTGGASFITSSAEDLPEILNRIFASQIRSKLV 243 >gi|297288988|ref|XP_002803438.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-1-like [Macaca mulatta] Length = 997 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 78/200 (39%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 247 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 294 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 295 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 353 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ + K + V+ + R Sbjct: 354 CN-------KIIMLFTDG------GEERAQEIFTKYNKDKKVRVFTFSVGQHNYDRGPIQ 400 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 401 WMACENKGYYYEIPSIGAIR 420 >gi|270007560|gb|EFA04008.1| hypothetical protein TcasGA2_TC014157 [Tribolium castaneum] Length = 805 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 45/337 (13%), Positives = 104/337 (30%), Gaps = 70/337 (20%) Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 NE + N+ K +I + + F + + + + G Sbjct: 159 NEISKNDDKTASLAEIKQNNSTSATVKFNPNIE-----------RQKQLATGLGTKEENG 207 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 + + +Y + + + S V+ + ++ VLD S SM+ Sbjct: 208 LAGQFVVQYDVERDPKGGEVLLKDGYFVHFFAPSEVEALPKQ-----VIFVLDTSGSMDG 262 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT--------------------F 226 +I +++N++L E+K N V S +V + Sbjct: 263 N------RIKQLKEAMNSILSELKKEDVFNIVEFSSIVKVWNVDKVQVDYEVGEDPWPLY 316 Query: 227 SN-------KIEEFFLLEW-----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + K + + ++ ++ L+ +G T+ L+ + + Sbjct: 317 DSPEAPQKNKTNQVLPPAYKATDENKEKAKKVVEKLNAYG-GTDIKSALEVGLKLVKKNK 375 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIR 331 ++ + IIVF+TDGE + + + + G A ++++ Sbjct: 376 ENKEDAHQP------IIVFLTDGEPTMGETNTEKITSAISEMNSGETRAPIFSLSFGDGA 429 Query: 332 SHEF-----LRACASPNSFYLVENPH-SMYDAFSHIG 362 EF L+ Y + + + + I Sbjct: 430 DREFLQKISLKNLGFARHIYEAADASLQLQEFYKQIS 466 >gi|117618125|ref|YP_856000.1| hypothetical protein AHA_1462 [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|117559532|gb|ABK36480.1| conserved hypothetical protein [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 460 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 27/152 (17%), Positives = 56/152 (36%), Gaps = 14/152 (9%) Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQI-------FDMQGM 276 F + ++ +S + L +TN+ G+ + + + + Sbjct: 304 FDDPDCGVSPIQAHLSTRAAYRQALDTLHAAFNTNTAEGVMWGWRLLSPQWQGRWQQGAA 363 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKE---DQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 A+ +KI+V +DGE++ + D++ L C E K++G VY + Sbjct: 364 ELPRPYGQADNRKILVLFSDGEHMGPEAALRDRKQLLLCREMKRKGIQVYTVAF--EGDA 421 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 F+ CAS S ++ + + I Sbjct: 422 RFVAQCASERSLAYKATSGNIRTVLTRLASAI 453 >gi|147905660|ref|NP_001090738.1| calcium channel, voltage-dependent, alpha 2/delta subunit 1 preproprotein [Xenopus (Silurana) tropicalis] gi|120537294|gb|AAI29013.1| LOC100036724 protein [Xenopus (Silurana) tropicalis] Length = 1076 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 74/197 (37%), Gaps = 31/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + ++ V Sbjct: 248 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETL----SDDDFVNVAAFNN 297 Query: 227 SNKIEEFF-----LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 + F L+ + ++ G +T+ G K+A++Q+ + R +CN Sbjct: 298 NAHDVSCFNHLVQANVRNKKKLKEAVNNITAKG-TTDYKTGFKFAFDQLLNHNVSRANCN 356 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR---VIRSHEFLRA 338 KII+ TDG KE ++ K + V+ + + A Sbjct: 357 -------KIIMLFTDGGEDKAKETFEAYN-----KDKTVRVFTFSVGQHNYDKGPIQWMA 404 Query: 339 CASPNSFYLVENPHSMY 355 C + +Y + + ++ Sbjct: 405 CQNKGFYYEIPSIGAIR 421 >gi|162449101|ref|YP_001611468.1| hypothetical protein sce0831 [Sorangium cellulosum 'So ce 56'] gi|161159683|emb|CAN90988.1| hypothetical protein predicted by Glimmer/Critica [Sorangium cellulosum 'So ce 56'] Length = 377 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 61/197 (30%), Gaps = 16/197 (8%) Query: 171 RLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIP---DVNNVVQSGLVTF 226 LDM+I+LD S SM S + + + + V L N ++ Sbjct: 91 PLDMLILLDRSGSMLGSKWVGVTNALSAFVTDAASAGMNVGLTYFPRSSNGQSDCNHTSY 150 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L L I+ S G T P L+ + + + Sbjct: 151 DELAVAIGELPVNTPELTASIQSTSP-GGGTPMRPALQGVLTNATAYKDA-------NPS 202 Query: 287 YKKIIVFMTDGENLSTKEDQ--QSLYYCNEA-KKRGAIVYAIGIRVIRSHEFLRACASPN 343 +K I+V TDG+ + + A + G Y + ++ + A+ Sbjct: 203 HKVIVVLATDGDPSGCSGNTVASTAEMAQRALRYNGVQTYVVAVQGSTLTNLDQIAAAGG 262 Query: 344 SFYLVENPHSMYDAFSH 360 + + + AFS Sbjct: 263 TT-RAFDVTADITAFSA 278 >gi|114047772|ref|YP_738322.1| vault protein inter-alpha-trypsin subunit [Shewanella sp. MR-7] gi|113889214|gb|ABI43265.1| Vault protein inter-alpha-trypsin domain protein [Shewanella sp. MR-7] Length = 755 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 38/228 (16%), Positives = 86/228 (37%), Gaps = 40/228 (17%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 T++ +VM + V+ + Q + ++++V+D S SM I A ++ L Sbjct: 350 TDNYSLVMVLPPKVEASEQLNLPRELILVIDTSGSMAG------DSIIQAKNALRYALRG 403 Query: 209 VKLIPDVNNVVQSGLVTFSNKIE--EFFLL---EWGVSHLQRKIKYLSKFGVSTNSTPGL 263 ++ N ++ F++ + L ++ ++ + L G + + Sbjct: 404 LRPQDSFN------IIEFNSDVSLLSPTPLPATATNLAMARQFVNRLQADGGTE-----M 452 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 A N Q + + + ++FMTDG + + + N+ ++ Sbjct: 453 AQALNAALPRQAFN--TASGEDKSLRQVIFMTDGSVGNESALFELIR--NQIGDN--RLF 506 Query: 324 AIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGK-DIVTKRI 370 +GI ++PNS ++ F++IG D V ++I Sbjct: 507 TVGIG-----------SAPNSHFMQRAAELGRGTFTYIGDVDEVEQKI 543 >gi|239814248|ref|YP_002943158.1| von Willebrand factor type A [Variovorax paradoxus S110] gi|239800825|gb|ACS17892.1| von Willebrand factor type A [Variovorax paradoxus S110] Length = 345 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 35/237 (14%), Positives = 77/237 (32%), Gaps = 52/237 (21%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +DVS SM + D ++ A ++ + ++ D+ V+ G+V F+ + Sbjct: 88 IILAMDVSGSMRAA-DVLPNRLVAAQEAAKSFIK------DLPRHVKVGIVAFAGSAQVA 140 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG---------MRQHCNTED 284 L L I + +T + + + +F G RQ D Sbjct: 141 QLPTTNHDDLITAIDSF-QLQRATATGNAIVVSLATLFPDAGIDVSQFSAPSRQRGTPID 199 Query: 285 ANYKKI---------------IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 K+ ++ +TDG+ L A RG +Y +G+ Sbjct: 200 QAEKQAKEFTPVAPGSYTSAAVIMLTDGQRT---TGVDPLDAAKAAADRGVRIYTVGVGT 256 Query: 330 IRS---------------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 + E L+A A + ++ + + + + ++ Sbjct: 257 VDGETIGFEGWSMRVRLDEETLKAVANKTQAEYFYAGTAADLKKVYETLSSRLTVEK 313 >gi|254255255|ref|ZP_04948571.1| hypothetical protein BDAG_04588 [Burkholderia dolosa AUO158] gi|124900992|gb|EAY71742.1| hypothetical protein BDAG_04588 [Burkholderia dolosa AUO158] Length = 511 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 21/164 (12%), Positives = 53/164 (32%), Gaps = 1/164 (0%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R + +G + I+ + L ++ +G+ +++ ++ ++ L + D + AA + + Sbjct: 99 VRRGLHRQRGAVAIIVGLALAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDLTS 158 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 + + G + L D+ S V P N Y Sbjct: 159 -AISLSVAEANGIAAGHMNFAFFQQKPVQMLTDSNVTFSDSLTNPFLSKTAVATPANVKY 217 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR 171 S I F + ++ S+ + + + Sbjct: 218 VKCVTSLSNIAHWFIEVLNVLPGTKLANAAEVSASAIATVGGGQ 261 >gi|332809815|ref|XP_001161858.2| PREDICTED: integrin alpha-10 isoform 1 [Pan troglodytes] Length = 1036 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 40/236 (16%), Positives = 83/236 (35%), Gaps = 38/236 (16%) Query: 150 NSRHIVMPITSSVKVNSQTDA---RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 + + + S + +D +D++IVLD S S+ + +++ ++ + L Sbjct: 10 GHQKLNLDTVSYNMLGVDSDGCPTYMDVVIVLDGSNSIYPW-----SEVQTFLRRLVGKL 64 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGL 263 I +Q GLV + + L + ++ + G T + + Sbjct: 65 ----FIDPEQ--IQVGLVQYGESPVHEWSLGDFRTKEEVVRAAKNLSRREGRETKTAQAI 118 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 A + F + +++V +TDGE+ +E +L C + Y Sbjct: 119 MVACTEGFSQS------HGGRPEAARLLVVVTDGESHDGEELPAALKACEAGR---VTRY 169 Query: 324 AIGI------RVIRSHEFLR----ACASPNS--FYLVENPHSMYDAFSHIGKDIVT 367 I + R FLR + P+ F+ V + ++ D +G I Sbjct: 170 GIAVLGHYLRRQRDPSSFLREIRTIASDPDERFFFNVTDEAALTDIVDALGDRIFG 225 >gi|254416823|ref|ZP_05030572.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] gi|196176369|gb|EDX71384.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] Length = 538 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 34/218 (15%), Positives = 77/218 (35%), Gaps = 26/218 (11%) Query: 148 YTNSRHIVMPIT---SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + + + IT S + + L++ +VLD S SM + AI++ Sbjct: 14 ADSPSTVDLLITFQGSESSQQTSSRRPLNLSLVLDRSGSMAG------APLRYAIQAAQN 67 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 +++ + ++ V + + ++ L + L+ KI + G + S Sbjct: 68 LIDYL----TADDFVSVVIYDDTAEVIIPPQLVGDQAALKAKIGKIRARGCTNLSGG--- 120 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVY 323 + + + N + ++ +TDG N K+ Q E + + Sbjct: 121 ------WLLGCSQVQANQSPERINR-VLLLTDGLANYGIKDPQVLTKTALEKAEADIVTT 173 Query: 324 AIGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFS 359 +G + + L A+ +FY +++P F Sbjct: 174 TLGFGNYFNEDLLINMANAARGNFYFIQSPDDASQVFE 211 >gi|194385334|dbj|BAG65044.1| unnamed protein product [Homo sapiens] Length = 1036 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 40/236 (16%), Positives = 83/236 (35%), Gaps = 38/236 (16%) Query: 150 NSRHIVMPITSSVKVNSQTDA---RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 + + + S + +D +D++IVLD S S+ + +++ ++ + L Sbjct: 10 GHQKLNLDTVSYNMLGVDSDGCPTYMDVVIVLDGSNSIYPW-----SEVQTFLRRLVGKL 64 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGL 263 I +Q GLV + + L + ++ + G T + + Sbjct: 65 ----FIDPEQ--IQVGLVQYGESPVHEWSLGDFRTKEEVVRAAKNLSRREGRETKTAQAI 118 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 A + F + +++V +TDGE+ +E +L C + Y Sbjct: 119 MVACTEGFSQS------HGGRPEAARLLVVVTDGESHDGEELPAALKACEAGR---VTRY 169 Query: 324 AIGI------RVIRSHEFLR----ACASPNS--FYLVENPHSMYDAFSHIGKDIVT 367 I + R FLR + P+ F+ V + ++ D +G I Sbjct: 170 GIAVLGHYLRRQRDPSSFLREIRTIASDPDERFFFNVTDEAALTDIVDALGDRIFG 225 >gi|223939936|ref|ZP_03631804.1| von Willebrand factor type A [bacterium Ellin514] gi|223891427|gb|EEF57920.1| von Willebrand factor type A [bacterium Ellin514] Length = 346 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 39/198 (19%), Positives = 73/198 (36%), Gaps = 23/198 (11%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA 198 L F + + P + + +Q +D+M+V D+S SM + Sbjct: 58 LAFLRIMTVALLVVALAGPRFAHDRTETQASG-VDIMLVFDLSWSMMVLDMGGHDETGTR 116 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE----WGVSHLQR-KIKYLSKF 253 +A+LE+ + + GL+ FS L W V +L R I + + Sbjct: 117 FGIASAVLEDFVNKRPND---RIGLIVFSGVPYLASPLTLNHDWLVENLHRLHIGIIREL 173 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 G T A ++ + + +II+ +TDG+N + + + + Sbjct: 174 G--TAIGDATAAATKRLQMSKDSKS----------RIIILLTDGDNN--QGEIEPVPAAQ 219 Query: 314 EAKKRGAIVYAIGIRVIR 331 A GA +Y IG+ + Sbjct: 220 LAAAIGAKIYTIGLGIEE 237 >gi|115372062|ref|ZP_01459374.1| inter-alpha-inhibitor H4 heavy chain, putative [Stigmatella aurantiaca DW4/3-1] gi|115371027|gb|EAU69950.1| inter-alpha-inhibitor H4 heavy chain, putative [Stigmatella aurantiaca DW4/3-1] Length = 540 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 73/204 (35%), Gaps = 28/204 (13%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 + + I +V++ A + V+D S SM+ +++ +A ++ Sbjct: 44 KQADEPGYFIALIAPKTEVSASEIAAKRVTFVIDTSGSMQG------SRMQIAKDALKYC 97 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFL--LEWGVSHLQRKIKYLSKFG--VSTNSTP 261 + + N +V FS +E F ++Q+ + ++ + T Sbjct: 98 VTRLNPQDTFN------VVRFSTDVEALFPALKSAQPENIQKAVAFVEQLEAIGGTAIDE 151 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL-YYCNEAKKRGA 320 L N ++ +++F+TDG+ + D+ ++ + + +K Sbjct: 152 ALVRGLQD-----------NDGKSSAPHLLMFITDGQPTIGETDEGAIAQHAKDGRKAKT 200 Query: 321 IVYAIGIRVIRSHEFLRACASPNS 344 ++ G+ + L +S + Sbjct: 201 RLFTFGVGEDLNARLLDRLSSDGA 224 >gi|7022738|dbj|BAA91707.1| unnamed protein product [Homo sapiens] Length = 218 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 32/154 (20%), Positives = 56/154 (36%), Gaps = 13/154 (8%) Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQ 278 + FS + L +++ ++ L K G T G + A QI+ Sbjct: 1 MSFIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGY 60 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + +I+ +TDGE L S N ++ GAIVY +G++ + R Sbjct: 61 RTAS-------VIIALTDGE-LHEDLFFYSEREANRSRDLGAIVYCVGVKDFNETQLARI 112 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 S + + V + + A I I+ K Sbjct: 113 ADSKDHVFPVNDG---FQALQGIIHSILKKSCIE 143 >gi|91082539|ref|XP_973726.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Tribolium castaneum] Length = 842 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 45/337 (13%), Positives = 104/337 (30%), Gaps = 70/337 (20%) Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 NE + N+ K +I + + F + + + + G Sbjct: 218 NEISKNDDKTASLAEIKQNNSTSATVKFNPNIE-----------RQKQLATGLGTKEENG 266 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 + + +Y + + + S V+ + ++ VLD S SM+ Sbjct: 267 LAGQFVVQYDVERDPKGGEVLLKDGYFVHFFAPSEVEALPKQ-----VIFVLDTSGSMDG 321 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT--------------------F 226 +I +++N++L E+K N V S +V + Sbjct: 322 N------RIKQLKEAMNSILSELKKEDVFNIVEFSSIVKVWNVDKVQVDYEVGEDPWPLY 375 Query: 227 SN-------KIEEFFLLEW-----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + K + + ++ ++ L+ +G T+ L+ + + Sbjct: 376 DSPEAPQKNKTNQVLPPAYKATDENKEKAKKVVEKLNAYG-GTDIKSALEVGLKLVKKNK 434 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIR 331 ++ + IIVF+TDGE + + + + G A ++++ Sbjct: 435 ENKEDAHQP------IIVFLTDGEPTMGETNTEKITSAISEMNSGETRAPIFSLSFGDGA 488 Query: 332 SHEF-----LRACASPNSFYLVENPH-SMYDAFSHIG 362 EF L+ Y + + + + I Sbjct: 489 DREFLQKISLKNLGFARHIYEAADASLQLQEFYKQIS 525 >gi|114614244|ref|XP_001160279.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-1 isoform 2 [Pan troglodytes] gi|114614246|ref|XP_519175.2| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 1 isoform 3 [Pan troglodytes] Length = 1091 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 78/200 (39%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 247 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 294 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 295 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 353 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ + K + V+ + R Sbjct: 354 CN-------KIIMLFTDG------GEERAQEIFTKYNKDKKVRVFTFSVGQHNYDRGPIQ 400 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 401 WMACENKGYYYEIPSIGAIR 420 >gi|328712316|ref|XP_001943179.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4-like [Acyrthosiphon pisum] Length = 830 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 83/223 (37%), Gaps = 45/223 (20%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT-FSNKIEE 232 ++ +LDVS SM + KI +++ +L E+ ++ S L ++ + Sbjct: 281 VIFILDVSGSM------NGQKITQVKGAMSQILSEIDSEDFFTLILFSSLAQIWTINATQ 334 Query: 233 FFLLEW-------------------------GVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 W + + ++ I+ L +ST ++ A Sbjct: 335 NTSNYWDDRGRNLNNFETMGENHFIFSANEQNIQYAKKFIQALEP-----DSTTNMEDAL 389 Query: 268 NQIFDMQGMRQHCNTEDANYKK-IIVFMTDGENLSTKEDQQSL-YYCNEAKKRGAIVYAI 325 N+ + + + + A K IIVF+TDGE + Q+L Y ++ +Y++ Sbjct: 390 NKALSIAKLGKMRFKDSAKTPKPIIVFLTDGEMNEGITNPQALMKYVSDINVDNYPIYSL 449 Query: 326 GIRVIRSHEFLRACASPNS-----FYLVENPH-SMYDAFSHIG 362 G EFL+ + N+ Y + +++ + I Sbjct: 450 GFGKGADIEFLKKLSLNNTGFARVIYEASDASLQLHNFYKEIS 492 >gi|237748397|ref|ZP_04578877.1| conserved hypothetical protein [Oxalobacter formigenes OXCC13] gi|229379759|gb|EEO29850.1| conserved hypothetical protein [Oxalobacter formigenes OXCC13] Length = 736 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 35/240 (14%), Positives = 77/240 (32%), Gaps = 24/240 (10%) Query: 88 NTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPW 147 N E ++ + + V + + L+ Sbjct: 63 NEKTGEVSYEQNSEVVNTAFGTQGQAHDITVSVTDSDGDVVSQTV-----HLETVGMQVG 117 Query: 148 YTNSRHIVMPITSSVKVNSQTDARLD---------MMIVLDVSRSMESFFDSSITKIDMA 198 +S I S + +D + +VLDVS SM + D+ T++ + Sbjct: 118 GVHSDTITGG--SGADIIIGDKGHIDGVQNTVTYNLSLVLDVSNSMYAGLDNGNTRLASS 175 Query: 199 IKSINAMLEEVKLIPDVNNV-VQSGLVTFSNKIEEFFLLE---WGVSHLQRKIKYL-SKF 253 ++++N +++ + + V LV F + E L V+ L+ + L + Sbjct: 176 VEALNNLIDSIAAQTAQGGITVNIQLVGFHSSAFENSWLSVTQDNVNELKSYVNELANHI 235 Query: 254 GVSTNSTPGLKYAYNQI--FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY 311 S + A+ ++ + Q K + F++DG + + + Y Sbjct: 236 RDGGASNTNYQAAFEEVCQWFAQQSAVPPVQGTDVVNK-VFFISDGVPNAITVNGAVINY 294 >gi|88801114|ref|ZP_01116660.1| hypothetical protein MED297_05449 [Reinekea sp. MED297] gi|88776143|gb|EAR07372.1| hypothetical protein MED297_05449 [Reinekea sp. MED297] Length = 553 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 29/206 (14%), Positives = 64/206 (31%), Gaps = 24/206 (11%) Query: 97 ELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVM 156 +R + + + + L + Q + +Y + + +++ Sbjct: 304 GVRHDSPLYATGEADENERLVLEAFAQFIADNRQVARQYGFNQNPSYQPAYELDDGSVIL 363 Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 K + M V DVS SM +I +++ V Sbjct: 364 SAQRIWKDKKSGGRPIAAMFVADVSGSM------DGDRIRALKIALDESANFVSSRNS-- 415 Query: 217 NVVQSGLVTFSNKIEEFFLLEWG----VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 GLVTF++++ + S ++ +S G T + + A +++ + Sbjct: 416 ----IGLVTFNDRVNVDLPIREFDLQQKSQFLGAVERMSA-GGGTATNDAILVAAHELLN 470 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGE 298 +K I ++DGE Sbjct: 471 F-------AKTHPEHKLTIFVLSDGE 489 >gi|257469960|ref|ZP_05634052.1| von Willebrand factor (vWA) type A domain-containing protein [Fusobacterium ulcerans ATCC 49185] Length = 322 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 31/171 (18%), Positives = 61/171 (35%), Gaps = 29/171 (16%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 ++ L++ ++D SRSM + D +++ A +++ +L+ +K + G + Sbjct: 73 DTVEVKGLNIYALIDTSRSMMTE-DVYPNRLEAAKRTLENLLQGLKGD-------RIGFI 124 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 FS+ L S + I L G T L+ A ++ Sbjct: 125 PFSDSAYIQMPLTDDYSIGKNYINALDTNLISGGGTELYQALELAEKSFKEINSDN---- 180 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 K I+ ++DG + K + K V++IGI Sbjct: 181 -------KTIIVLSDGGDFDDKSLKFV-------KDNKMNVFSIGIGTDEG 217 >gi|149916952|ref|ZP_01905453.1| hypothetical protein PPSIR1_21929 [Plesiocystis pacifica SIR-1] gi|149822230|gb|EDM81621.1| hypothetical protein PPSIR1_21929 [Plesiocystis pacifica SIR-1] Length = 416 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 34/199 (17%), Positives = 69/199 (34%), Gaps = 27/199 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 M++V+D S SM I+ A + +++ + LV F ++ E Sbjct: 1 MVLVVDTSASM------KGDAIEGAKAAAMELVDGLAEGDSFA------LVVFHSRAEVL 48 Query: 234 FL----LEWGVSHLQRKIKYLSKFGVSTNSTP------GLKYAYNQIFDMQGMRQHCNTE 283 E + + KI+ + +G + + L+ A N + Sbjct: 49 MPSTVINEDSRAAARSKIETMQAWGTTDLAGGLQQALAQLQVAQNIVGAGGSTGAQSGAP 108 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--S 341 D + +V + DG ++ + RG + A+G + L + A + Sbjct: 109 DPTVLERVVLLGDGVPNDASTIPSTVG---QLAARGTQITALGYGIEYDETLLASLAEQT 165 Query: 342 PNSFYLVENPHSMYDAFSH 360 SF V++P ++ F Sbjct: 166 HGSFRFVDDPEAVASLFRD 184 >gi|170726477|ref|YP_001760503.1| cell wall anchor domain-containing protein [Shewanella woodyi ATCC 51908] gi|169811824|gb|ACA86408.1| LPXTG-motif cell wall anchor domain protein [Shewanella woodyi ATCC 51908] Length = 739 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 39/257 (15%), Positives = 83/257 (32%), Gaps = 43/257 (16%) Query: 89 TWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY 148 W + F S+ + N + + +P Sbjct: 294 NWRPQLDTKPVAAVFSQQGKTHSLSSKAQVEPTDSNASTKADSNKAVEDDYALLMLLPPS 353 Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + + + +++V+D S SM S I A +++N L Sbjct: 354 DQKQDVSISRE--------------LILVIDTSGSM------SGASIAQAKRALNYALAG 393 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLE-----WGVSHLQRKIKYLSKFGVSTNSTPGL 263 +K N ++ F++ + + + ++ L G + + Sbjct: 394 LKAKDTFN------VIEFNSNVGSLSPYSLPATAKNIGLANQYVRSLKANGGTE-----M 442 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 + A N D + +E + ++FMTDG D+QSL++ + K + ++ Sbjct: 443 QLALNAALDKGTETEALGSERL---RQVLFMTDGS----VGDEQSLFHLIKQKIGESRLF 495 Query: 324 AIGIRVIRSHEFLRACA 340 +GI + F+R A Sbjct: 496 TLGIGSAPNSHFMRRAA 512 >gi|310817544|ref|YP_003949902.1| von willebrand factor type a domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|309390616|gb|ADO68075.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 470 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 64/175 (36%), Gaps = 28/175 (16%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +++ +V+D S SM K+D A ++ + V + LV + + Sbjct: 95 PVNVALVIDRSGSMRG------QKLDDAKRAAREFITRVSEED------RVALVHYGTDV 142 Query: 231 EEFFLLEWGVSHLQRK----IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 F + + + + G STN + GL+ A Q+ N + Sbjct: 143 -TVFPSTLATPETREQMLTFVNAIEDEG-STNISGGLEAAAQQL--------QKNADQFR 192 Query: 287 YKKIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + I+ ++DG+ ++Q + +G V A+G+ + ++ A Sbjct: 193 VSR-IILLSDGQPTAGLTREEQLTALARNLRSQGMAVSALGVGEDFNENLMQGIA 246 >gi|39997259|ref|NP_953210.1| hypothetical protein GSU2161 [Geobacter sulfurreducens PCA] gi|39984149|gb|AAR35537.1| hypothetical protein GSU2161 [Geobacter sulfurreducens PCA] Length = 575 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 72/191 (37%), Gaps = 26/191 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + ++LD S SM + + + K+ +A ++ E + IP V++ + TF Sbjct: 407 VCMLLDSSGSMGNT--TILNKMGIASRACFVAAEALFSIPG----VRTAIATFKGHDNHV 460 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 F + ++ T L +A+ ++ + R KI + Sbjct: 461 FPMV-NFGEKPDHSRFNITGSGGTRLGHALWWAWGELSLRRETR-----------KICIA 508 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHS 353 +DG+ Q ++ ++ G V IGI+ ++L P++ +++N Sbjct: 509 FSDGDTGDGPVTQAAIK---RMREEGIEVIGIGIQDNSIKQYL-----PDNHRIIKNLDQ 560 Query: 354 MYDAFSHIGKD 364 A + ++ Sbjct: 561 FTPALLELLRE 571 >gi|33600172|ref|NP_887732.1| putative hemolysin [Bordetella bronchiseptica RB50] gi|33567770|emb|CAE31684.1| putative hemolysin [Bordetella bronchiseptica RB50] Length = 3346 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 39/192 (20%), Positives = 74/192 (38%), Gaps = 11/192 (5%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDS---SITKIDMAIKSINAMLEEVKLIPDVN 216 +K N ++ +VLD+S SM + S T++ A ++ A+LE + D Sbjct: 2715 GGIKQNVTAGTSYNIALVLDLSDSMNDKWGSGSNKPTRLQTAKDALKALLENQLAVHDGE 2774 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + L+TF+ G++ ++ + L G+ +ST A+++ Sbjct: 2775 --INVSLITFNGSSSALKKSITGLTPENVDEMVDIL--MGLKASSTTPYGAAFDRTTQWF 2830 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + ++E YK + F+TDGE + + + A IGI S Sbjct: 2831 EGQPTVDSEGKPYKNLTFFLTDGEPSTEWSYNRDNEFAELAAISDVH--GIGIGSGVSTS 2888 Query: 335 FLRACASPNSFY 346 L + +Y Sbjct: 2889 TLNKYDNTGGYY 2900 >gi|300812790|ref|ZP_07093186.1| von Willebrand factor type A domain protein [Lactobacillus delbrueckii subsp. bulgaricus PB2003/044-T3-4] gi|300496224|gb|EFK31350.1| von Willebrand factor type A domain protein [Lactobacillus delbrueckii subsp. bulgaricus PB2003/044-T3-4] Length = 893 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 51/296 (17%), Positives = 104/296 (35%), Gaps = 34/296 (11%) Query: 19 MTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLK 78 MT +TA + L ++I + M T L + + AA N Sbjct: 1 MTKMTAKVVRTGHLFAFLLI----LMSMLTGLVTSGSSVVTAAANIRPTYQTDANGTYPT 56 Query: 79 GGDILCRIKNTWNMSFRNELR--DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYK 136 + +N N +++ DN + + D + S P N Y I ++ Sbjct: 57 NSWQVTGQQNVINQRGGDQVLGWDNNTIWNGDATDTTNSYLKFGDPNNPDYQIRKYAK-- 114 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 + + + + + Q +D+++V+D+S SMES + D Sbjct: 115 --------ETNTPGLYDVYLNVKGNTQ---QNVKPVDIVLVVDMSGSMESNSRG-TNRAD 162 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-EWGVSHLQRKIKYLSKFGV 255 + L ++ + + V GL+ FS+ G +++ + + Sbjct: 163 AVRTGVKNFLTSIQNA-GLGDYVNVGLIGFSSPGYIGGGNKTTGPGYIRVGLGKV----G 217 Query: 256 STNSTPGLKYAYNQIFDMQ-----GMRQHCNTEDANY---KKIIVFMTDGENLSTK 303 +T + A + +F G+RQ +A+ KK+++ +TDG + Sbjct: 218 NTKQQQAINDALSPMFQGGTYTQIGLRQGSAMLNADTSGNKKMMILLTDGVPTFSN 273 >gi|48425688|pdb|1SHT|X Chain X, Crystal Structure Of The Von Willebrand Factor A Domain Of Human Capillary Morphogenesis Protein 2: An Anthrax Toxin Receptor Length = 181 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 70/201 (34%), Gaps = 27/201 (13%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D+ VLD S S+ + + I ++++ V+ ++ + FS++ Sbjct: 6 AFDLYFVLDKSGSVANNW-----------IEIYNFVQQLAERF-VSPEMRLSFIVFSSQA 53 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L + + ++ L + T GLK A QI G++ Sbjct: 54 TIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKAGGLKTSS-------- 105 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 II+ +TDG+ ++ GA VY +G+ + R S + V Sbjct: 106 -IIIALTDGKLDGLVPSYAEKE-AKISRSLGASVYCVGVLDFEQAQLERIADSKEQVFPV 163 Query: 349 ENPHSMYDAFSHIGKDIVTKR 369 + + A I I+ + Sbjct: 164 KGG---FQALKGIINSILAQS 181 >gi|46127789|ref|XP_388448.1| hypothetical protein FG08272.1 [Gibberella zeae PH-1] Length = 774 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 72/201 (35%), Gaps = 24/201 (11%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK--LIPDVNNVVQSGLVTFSN 228 D+++V+DVS SM + A S+ + + +I +N + +VTF++ Sbjct: 86 PCDIVLVIDVSGSMGQPAPVPGEDQESAGLSVLDLTKHAARTIIESMNENDRLSIVTFAS 145 Query: 229 KIEEFFLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 K + L + + + +K + TN G+ A Q N Sbjct: 146 KAKVLQPLLPMNQDNKTRAIKNVKSMEPRDA-TNLWQGMLEAIKQF---NTDESSPNVPA 201 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEFLRACAS 341 I+ +TDG ++ + + + G A ++ G + L++ A Sbjct: 202 ------IMILTDGMPNHMNP---AVGFVPKIRNMGPLPASIHTFGFGYSLKSDLLKSIAE 252 Query: 342 --PNSFYLVENPHSMYDAFSH 360 ++ + + + F H Sbjct: 253 IGNGNYAFIPDAGMIGTVFVH 273 >gi|328947150|ref|YP_004364487.1| von Willebrand factor type A [Treponema succinifaciens DSM 2489] gi|328447474|gb|AEB13190.1| von Willebrand factor type A [Treponema succinifaciens DSM 2489] Length = 333 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 47/219 (21%), Positives = 78/219 (35%), Gaps = 47/219 (21%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 ++ VLDVS SM + S ++++ A SI +M E N+ + GLV + Sbjct: 92 SIVFVLDVSPSMAAKDIGSQSRLEAAKNSIVSMAEM-------NSGSELGLVVMAENAAV 144 Query: 233 FFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 ++K +S + G T GL A H + KK Sbjct: 145 LVSPTMDRKFFLNRLKTVSVGELGDGTAIGTGLSSAI----------YHLEK-SKSPKKS 193 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV----------IRSHEFLR--- 337 IV +TDGEN S + A + +Y +G+ +S++ Sbjct: 194 IVLITDGENNSGAVHPHTA--ARLAVNKDISLYILGVGTRGVVPIDYVDPKSNKIYSGYL 251 Query: 338 ----------ACASPNS--FYLVENPHSMYDAFSHIGKD 364 AS + F+ +E+ ++ A S I K Sbjct: 252 ESKFDTSSIARIASEGNGKFFEIESISALSQAISSISKS 290 >gi|115373770|ref|ZP_01461063.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|115369169|gb|EAU68111.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 420 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 64/175 (36%), Gaps = 28/175 (16%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +++ +V+D S SM K+D A ++ + V + LV + + Sbjct: 45 PVNVALVIDRSGSMRG------QKLDDAKRAAREFITRVSEED------RVALVHYGTDV 92 Query: 231 EEFFLLEWGVSHLQRK----IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 F + + + + G STN + GL+ A Q+ N + Sbjct: 93 -TVFPSTLATPETREQMLTFVNAIEDEG-STNISGGLEAAAQQL--------QKNADQFR 142 Query: 287 YKKIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + I+ ++DG+ ++Q + +G V A+G+ + ++ A Sbjct: 143 VSR-IILLSDGQPTAGLTREEQLTALARNLRSQGMAVSALGVGEDFNENLMQGIA 196 >gi|301604824|ref|XP_002932067.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-3(VI) chain-like [Xenopus (Silurana) tropicalis] Length = 2881 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 65/191 (34%), Gaps = 19/191 (9%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S ++ D + + + V + D + +Q L +++ + + Sbjct: 1415 DIVFLVDGSINLGR---------DNFKEVLQFVSGIVDAVFDEEDAIQIALAQYNSDVTD 1465 Query: 233 FFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L+ + + G S A + D +++ + + +I Sbjct: 1466 EFFLKDFTDRDQIMDAVTKAEYKGGRVASLGA---AIRHLQDKHFVKEAGSRVSSGVPQI 1522 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 +T G K +G V+AIG+ I E + + S + V N Sbjct: 1523 AFVITGG-----KSVDDGQSAAMALSNKGVKVFAIGVGAIDGDEVAKIASDAPSAFRVPN 1577 Query: 351 PHSMYDAFSHI 361 + + I Sbjct: 1578 VQELSELNEQI 1588 Score = 44.0 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 25/198 (12%), Positives = 62/198 (31%), Gaps = 22/198 (11%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D D++ ++D S ++ + + ++ + + + +Q G+V +S Sbjct: 26 QDDSADLVFLIDGSNNVG------RSDFSYIRDFLVNYVQ--RRLEVGRSRMQVGVVQYS 77 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + ++ F L + + ++ L G N L Y +F + Sbjct: 78 DDVKTEFSLAQNPTK-GQLVEALKNLRFIGGEEANLGAALDYVVENVFTTAAGSRR---- 132 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 + + ++ +T G N K I + IG+ E + + Sbjct: 133 EEGVPQSLMIITAG-----PSSDDFREATNALKLNSIITFGIGVERADIAELQQIATDES 187 Query: 344 SFYLVENPHSMYDAFSHI 361 + + +I Sbjct: 188 FVFSTPQMQDLSTLERNI 205 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 34/236 (14%), Positives = 78/236 (33%), Gaps = 19/236 (8%) Query: 91 NMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTN 150 R I + + +VV +N +++ + Sbjct: 1137 GEGARQVKESAVVPFLIGKNIDEEEGNKIVVARNFFLPVTSFRELPALEQTLVNSFTTLT 1196 Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + I + V+ D++ ++D S + + I +++ + Sbjct: 1197 TEQITRLFETHPDVDPDKK---DVVFLIDGSS------KTGQDGMAHIRDFILEVVQNFQ 1247 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYN 268 + P+ V+ GLV FSN+ F L+ + L I+ L G + + A Sbjct: 1248 IGPNK---VRVGLVQFSNEPVSEFFLKTNLQKQALLNNIRRLRLKGG---APLNIGKAVE 1301 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 + Q ++ + + + +V +T G++ +S + AK + V + Sbjct: 1302 YVAKNQFVKSAGSRIEEGVPQYLVLLTGGKSEDDVS--RSARLLHNAKVQSVAVAS 1355 >gi|156975610|ref|YP_001446517.1| Flp pilus assembly protein TadG [Vibrio harveyi ATCC BAA-1116] gi|156527204|gb|ABU72290.1| hypothetical protein VIBHAR_03343 [Vibrio harveyi ATCC BAA-1116] Length = 502 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 40/281 (14%), Positives = 94/281 (33%), Gaps = 25/281 (8%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 + KG + A+ L +F + +E + + L + + Sbjct: 15 RSQKGIAAVWFALSLVPVFGMTFFAVEGTRYIQETSRLRDAAQTAALA------------ 62 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE-GYSISA 131 D + + N+ + DI + + YS+ A Sbjct: 63 -----ITIDDKSNQADALATMYINDYVRDISHVDIQTVRTYEEPTEDNDNTEKIQYSVQA 117 Query: 132 ISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSS 191 ++ + IP + + ++ D +D+++V D S SM + +D Sbjct: 118 VTTHNSWF-ASNSIPSFETQEKLAGQAVAAKYPFYLGDKIIDLVLVTDFSGSMNNSWDGE 176 Query: 192 ITKIDMAIKSINAMLEEVKLIPDVNNVV--QSGLVTFSNKIEEFFLLE-WGVSHLQRKIK 248 I KID+ ++ + + + + + V + ++ F+ +++E + S L+ K Sbjct: 177 I-KIDLLKDAVKQISNRILVPREGESEVLNRIAIIPFNLRVQEKINDNLYSTSQLRYKGN 235 Query: 249 YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 Y VS+ + + Y + + + + N K Sbjct: 236 YRKS--VSSVKYEQVNWDYWSPYSEEAVEECANKRTDCPNK 274 >gi|332809817|ref|XP_003308326.1| PREDICTED: integrin alpha-10 [Pan troglodytes] Length = 1024 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 38/211 (18%), Positives = 76/211 (36%), Gaps = 35/211 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++IVLD S S+ + +++ ++ + L I +Q GLV + Sbjct: 23 MDVVIVLDGSNSIYPW-----SEVQTFLRRLVGKL----FIDPEQ--IQVGLVQYGESPV 71 Query: 232 EFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L + ++ + G T + + A + F + Sbjct: 72 HEWSLGDFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQS------HGGRPEAA 125 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFLR----A 338 +++V +TDGE+ +E +L C + Y I + R FLR Sbjct: 126 RLLVVVTDGESHDGEELPAALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTI 182 Query: 339 CASPNS--FYLVENPHSMYDAFSHIGKDIVT 367 + P+ F+ V + ++ D +G I Sbjct: 183 ASDPDERFFFNVTDEAALTDIVDALGDRIFG 213 >gi|187954851|gb|AAI40832.1| ITGA10 protein [Homo sapiens] gi|219519694|gb|AAI44638.1| ITGA10 protein [Homo sapiens] Length = 1024 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 38/211 (18%), Positives = 76/211 (36%), Gaps = 35/211 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++IVLD S S+ + +++ ++ + L I +Q GLV + Sbjct: 23 MDVVIVLDGSNSIYPW-----SEVQTFLRRLVGKL----FIDPEQ--IQVGLVQYGESPV 71 Query: 232 EFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L + ++ + G T + + A + F + Sbjct: 72 HEWSLGDFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQS------HGGRPEAA 125 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFLR----A 338 +++V +TDGE+ +E +L C + Y I + R FLR Sbjct: 126 RLLVVVTDGESHDGEELPAALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTI 182 Query: 339 CASPNS--FYLVENPHSMYDAFSHIGKDIVT 367 + P+ F+ V + ++ D +G I Sbjct: 183 ASDPDERFFFNVTDEAALTDIVDALGDRIFG 213 >gi|7385003|gb|AAF61638.1|AF172723_1 integrin alpha 10 subunit [Homo sapiens] Length = 517 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 38/211 (18%), Positives = 76/211 (36%), Gaps = 35/211 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++IVLD S S+ + +++ ++ + L I +Q GLV + Sbjct: 5 MDVVIVLDGSNSIYPW-----SEVQTFLRRLVGKL----FIDPEQ--IQVGLVQYGESPV 53 Query: 232 EFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L + ++ + G T + + A + F + Sbjct: 54 HEWSLGDFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQS------HGGRPEAA 107 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFLR----A 338 +++V +TDGE+ +E +L C + Y I + R FLR Sbjct: 108 RLLVVVTDGESHDGEELPAALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTI 164 Query: 339 CASPNS--FYLVENPHSMYDAFSHIGKDIVT 367 + P+ F+ V + ++ D +G I Sbjct: 165 ASDPDERFFFNVTDEAALTDIVDALGDRIFG 195 >gi|317064189|ref|ZP_07928674.1| conserved hypothetical protein [Fusobacterium ulcerans ATCC 49185] gi|313689865|gb|EFS26700.1| conserved hypothetical protein [Fusobacterium ulcerans ATCC 49185] Length = 325 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 31/171 (18%), Positives = 61/171 (35%), Gaps = 29/171 (16%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 ++ L++ ++D SRSM + D +++ A +++ +L+ +K + G + Sbjct: 76 DTVEVKGLNIYALIDTSRSMMTE-DVYPNRLEAAKRTLENLLQGLKGD-------RIGFI 127 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 FS+ L S + I L G T L+ A ++ Sbjct: 128 PFSDSAYIQMPLTDDYSIGKNYINALDTNLISGGGTELYQALELAEKSFKEINSDN---- 183 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 K I+ ++DG + K + K V++IGI Sbjct: 184 -------KTIIVLSDGGDFDDKSLKFV-------KDNKMNVFSIGIGTDEG 220 >gi|91082533|ref|XP_973629.1| PREDICTED: similar to Inter-alpha-trypsin inhibitor heavy chain H4 precursor (ITI heavy chain H4) (Inter-alpha-inhibitor heavy chain 4) (Inter-alpha-trypsin inhibitor family heavy chain-related protein) (IHRP) (Plasma kallikrein sensitive glycoprotein 120) (P [Tribolium castaneum] gi|270007557|gb|EFA04005.1| hypothetical protein TcasGA2_TC014154 [Tribolium castaneum] Length = 824 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 44/256 (17%), Positives = 87/256 (33%), Gaps = 42/256 (16%) Query: 132 ISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSS 191 I +Y + + + S +K + ++ VLD S SM S Sbjct: 267 IVQYDVERDPQGGEVLVRDGYFVHFFSPSELKPLPKH-----IVFVLDHSGSM------S 315 Query: 192 ITKIDMAIKSINAMLEEVKLIPDVNNVVQSG--LVTFSNKIEEFFLLEWGVSH--LQRKI 247 KID I+++ +L ++K + N+V+ G + + +F L + L+ + Sbjct: 316 GRKIDQLIEAMQNILTDLKETD-LFNIVRFGDLAMVWDVSQNQFTQLPNFNEYGNLEPHL 374 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN-------------TEDANYKKI---I 291 + ++ + ++ A I D + T++ K I Sbjct: 375 REINLPRAVNGTEENIEAAKKIIEDKSRLGMTNMMYGLEVGLFLIKRTQEETPDKYQPMI 434 Query: 292 VFMTDGENLSTKED----QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS-----P 342 VF+TDG + ++ N KK+ A ++++ FLR +S Sbjct: 435 VFLTDGHPNAGMSGRDEITNTVTSLNSGKKK-ASIFSLSFGDFADKRFLRKISSKNSGFS 493 Query: 343 NSFYLVENPHSMYDAF 358 Y + F Sbjct: 494 RHIYESSDASLQLQDF 509 >gi|2497428|sp|Q60677|ITAE_MOUSE RecName: Full=Integrin alpha-E; AltName: Full=Integrin alpha M290; AltName: CD_antigen=CD103; Contains: RecName: Full=Integrin alpha-E light chain; Contains: RecName: Full=Integrin alpha-E heavy chain; Flags: Precursor gi|535477|gb|AAC52142.1| alpha M290 integrin [Mus musculus] Length = 1167 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 41/236 (17%), Positives = 84/236 (35%), Gaps = 26/236 (11%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 +C T +V+ + + ++ IVLD S S+ + A Sbjct: 163 YCRSKGGSTGEEKKSARRRRTVEEEDEEEDGTEIAIVLDGSGSIGP------SDFQKAKN 216 Query: 201 SINAMLEEV--KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTN 258 I+ M+ K +VQ G V + E ++ K++ + + T Sbjct: 217 FISTMMRNFYEKCFECNFALVQYGAV---IQTEFDLQESRDINASLAKVQSIVQVKEVTK 273 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + +++ + IF K++V +TDG+ + ++ N K + Sbjct: 274 TASAMQHVLDNIFIPS------RGSRKKALKVMVVLTDGDIFGDPLNLTTVI--NSPKMQ 325 Query: 319 GAIVYAIGIR----VIRSHEFLRACASP---NSFYLVENPHSMYDAFSHIGKDIVT 367 G + +AIG+ ++ L+ AS + V N ++ S + + IV Sbjct: 326 GVVRFAIGVGDRFKNNNTYRELKLIASDPKEAHTFKVTNYSALDGLLSKLQQRIVH 381 >gi|22125371|ref|NP_668794.1| hypothetical protein y1474 [Yersinia pestis KIM 10] gi|45442407|ref|NP_993946.1| hypothetical protein YP_2631 [Yersinia pestis biovar Microtus str. 91001] gi|149365130|ref|ZP_01887165.1| putative membrane protein [Yersinia pestis CA88-4125] gi|218930054|ref|YP_002347929.1| hypothetical protein YPO3007 [Yersinia pestis CO92] gi|21958254|gb|AAM85045.1|AE013750_5 hypothetical [Yersinia pestis KIM 10] gi|45437272|gb|AAS62823.1| putative membrane protein [Yersinia pestis biovar Microtus str. 91001] gi|115348665|emb|CAL21610.1| putative membrane protein [Yersinia pestis CO92] gi|149291543|gb|EDM41617.1| putative membrane protein [Yersinia pestis CA88-4125] Length = 509 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 39/244 (15%), Positives = 92/244 (37%), Gaps = 37/244 (15%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + + ++ + +T ++S + +++ +V+D S SM S +I+ A ++ Sbjct: 107 LANSEDKNYLKISLTG-FNLDSTRRSPINLALVIDRSTSM------SGERIEKAREAAIL 159 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSN---KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 + + + ++ +V + N I + + + +++ G++ Sbjct: 160 AVNMLNITDTLS------VVAYDNHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAG 213 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGA 320 + + +H N E N I+ ++DG+ N + A K+G Sbjct: 214 -------VSMGIGQVDKHLNREQVNR---IILISDGQANTGPTSISELSDLARMAAKKGI 263 Query: 321 IVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSH--------IGKDIVTKRI 370 + IG+ + + + A A S + V N + AF+ + +DIV + Sbjct: 264 AITTIGLGQDYNEDLMTAIAGYSDGNHTFVANSADLEKAFTKEFQDVMSVVAQDIVVQIK 323 Query: 371 WYDK 374 DK Sbjct: 324 TGDK 327 >gi|320007113|gb|ADW01963.1| von Willebrand factor type A [Streptomyces flavogriseus ATCC 33331] Length = 661 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 64/180 (35%), Gaps = 37/180 (20%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSM-------------ESF----FDSSITKIDM 197 V P T+ + +D++++ D S SM F + I++++ Sbjct: 165 VGPETALTLFAPGERSGVDIVVLADCSGSMSIEDIPAAPSDGGRWFNRGRTSAGISRMNA 224 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL---------QRKIK 248 ++ AM++ V + LV F + E F WG+ + + + Sbjct: 225 LKDALGAMIDARMRYDGVG--TRFALVRFDHDHEPMFPSRWGMEEVSDAHSVQRLREAVS 282 Query: 249 YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS 308 L+ T+ L A + ++++V ++DG + + D++S Sbjct: 283 LLNYRQSGTDIGKALHKAGELLHRYGVPGN---------ERLVVLVSDGAHYAPIPDERS 333 >gi|222616155|gb|EEE52287.1| hypothetical protein OsJ_34277 [Oryza sativa Japonica Group] Length = 367 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 37/238 (15%), Positives = 87/238 (36%), Gaps = 36/238 (15%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM---- 184 + + P +++ + + + + +D++ VLDVS SM Sbjct: 6 VKVSTTTMFPTIPRD---HTNKDFRVLLRVEAPPMADLKGHVPIDVVEVLDVSGSMGDPA 62 Query: 185 ----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL---- 236 + + +++D+ ++ ++ +++ + +V F+++ + + Sbjct: 63 MASSDFKKNKPPSRLDVLKDAMKFIIRKLEDGD------RLSIVAFNDRPVKEYSTGLLD 116 Query: 237 --EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 G ++K+ +L G T P L+ A + G ++ I+ + Sbjct: 117 ISGNGRRIAEKKVDWLEGRG-GTALMPALEEAIRVLDCRPGDSRNRVG-------FILLL 168 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVEN 350 TDG+ T + S N A V+ G+ S E L A S ++ V++ Sbjct: 169 TDGD--DTSGFRWSRDVINGA-VGKYPVHTFGLGAAHSSEALLYIAQESRGTYSFVDD 223 >gi|221506674|gb|EEE32291.1| von willebrand factor type A domain-containing protein, putative [Toxoplasma gondii VEG] Length = 931 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 27/147 (18%), Positives = 47/147 (31%), Gaps = 15/147 (10%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 V+P T + + R+D + VLD S S+ I + Sbjct: 237 VVPETPEISTTTCHKGRVDAVAVLDGSGSISRADWKKTRDIAKLFSGALNI--------- 287 Query: 215 VNNVVQSGLVTFSNKIEEF----FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + +V FS + W L +I L + TN+ L+ AY Sbjct: 288 AEDQSHVSVVRFSTTARADWSLVQPVSWTEKQLTNRISRLPQPYGGTNTPAALEEAYKIF 347 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDG 297 R + + ++++ TDG Sbjct: 348 VTSMNNRDEHD--SEHVHRVLLLATDG 372 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 24/137 (17%), Positives = 53/137 (38%), Gaps = 13/137 (9%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + +D ++VLD S S+ + + K+ + + + GL Sbjct: 12 GQACKLKNVDAVVVLDSSMSVGAEHWQELLKLTK---------QFGDTLDSSAGHSRLGL 62 Query: 224 VTFSNKIEEFFLLE---WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 V++S+ + L+ G + ++++ S +T + L AY +F Sbjct: 63 VSYSDSVTVLRKLQKIPSGTAQFEKELGAASFMNGNTFTPKALDSAYE-LFKETIHEDSE 121 Query: 281 NTEDANYKKIIVFMTDG 297 ED +++++ TDG Sbjct: 122 GAEDQEKRRLLLLATDG 138 >gi|237831727|ref|XP_002365161.1| von Willebrand factor type A domain-containing protein [Toxoplasma gondii ME49] gi|211962825|gb|EEA98020.1| von Willebrand factor type A domain-containing protein [Toxoplasma gondii ME49] Length = 929 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 27/147 (18%), Positives = 47/147 (31%), Gaps = 15/147 (10%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 V+P T + + R+D + VLD S S+ I + Sbjct: 237 VVPETPEISTTTCHKGRVDAVAVLDGSGSISRADWKKTRDIAKLFSGALNI--------- 287 Query: 215 VNNVVQSGLVTFSNKIEEF----FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + +V FS + W L +I L + TN+ L+ AY Sbjct: 288 AEDQSHVSVVRFSTTARADWSLVQPVSWTEKQLTNRISRLPQPYGGTNTPAALEEAYKIF 347 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDG 297 R + + ++++ TDG Sbjct: 348 VTSMNNRDEHD--SEHVHRVLLLATDG 372 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 24/137 (17%), Positives = 53/137 (38%), Gaps = 13/137 (9%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + +D ++VLD S S+ + + K+ + + + GL Sbjct: 12 GQACKLKNVDAVVVLDSSMSVGAEHWQELLKLTK---------QFGDTLDSSAGHSRLGL 62 Query: 224 VTFSNKIEEFFLLE---WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 V++S+ + L+ G + ++++ S +T + L AY +F Sbjct: 63 VSYSDSVTVLRKLQKIPSGTAQFEKELGAASFMNGNTFTPKALDSAYE-LFKETIHEDSE 121 Query: 281 NTEDANYKKIIVFMTDG 297 ED +++++ TDG Sbjct: 122 GAEDQEKRRLLLLATDG 138 >gi|153812017|ref|ZP_01964685.1| hypothetical protein RUMOBE_02410 [Ruminococcus obeum ATCC 29174] gi|149831916|gb|EDM87002.1| hypothetical protein RUMOBE_02410 [Ruminococcus obeum ATCC 29174] Length = 2099 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 69/388 (17%), Positives = 115/388 (29%), Gaps = 94/388 (24%) Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 NE G N K + + N + + + +D N Sbjct: 1177 NEQIGGNEKYIYKVEETGSQVNGYTVETTQTVSGGDVQSDGKSTKIGEKDSATFTITNTY 1236 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 I S + K T + W + I + T+S K D+++VLD S SM Sbjct: 1237 TPIDINSVIEYN-KTATLLDWNQRTYKIDL--TASSKTTQSMKTPYDIVLVLDQSGSMSQ 1293 Query: 187 FF--------------------------------------------------DSSITKID 196 F D + T + Sbjct: 1294 KFVEYNKINGSSMFWRKTYYIKTQNGIYQQLSWSWDNTWSYTDSYSGKTVTVDPNTTDVY 1353 Query: 197 MAIKSINAMLEEVKL--------IPDVNNVVQSGLVTFSNK---------IEEFFLLEWG 239 +A KS ++ +K + + N+ + G+VTFSN + Sbjct: 1354 VAQKSNQTKIDALKSAATTFVNNVANKNSDCRVGIVTFSNDGYIKPITNNSYTLAKVGTS 1413 Query: 240 VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN-YKKIIVFMTDG- 297 + I L G T GL A N+IF E + KK++VF+TDG Sbjct: 1414 KGDIINTIDGLKT-GGDTYPAKGLDKA-NEIFSENSSNSWETVEQTDGRKKMVVFLTDGV 1471 Query: 298 ---ENLSTKEDQQSLYYCNEAK---KRGAIVYAIG-IRVIRSHEFL-------------R 337 N + ++ + N AK +G YA+G S + Sbjct: 1472 PAPANTNNFDENLAGAGTNSAKILHDQGVATYALGIFGAANSDGTMDNASVQRIDKYMQS 1531 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDI 365 +S + ++ ++ F I +I Sbjct: 1532 IASSHEKYMTADSVDNLSSLFESITNNI 1559 >gi|2159|emb|CAA78125.1| VLA-2 [Sus scrofa] Length = 191 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 64/186 (34%), Gaps = 27/186 (14%) Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSK 252 D + ++ + + P Q GL+ ++N F L + ++ Sbjct: 2 DAVKNFLEKFVQGLDIGPTK---TQVGLIQYANNPRVVFNLNTFKTKAEMVEATSHTTQY 58 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 G TN+ ++YA + + K++V +TDGE+ + + C Sbjct: 59 GGDLTNTFKAIQYARDSAYSA------AAGGRPGATKVMVVVTDGESHDGSMLKAVIDQC 112 Query: 313 NEAKKRGAIVYAIGI------RVIRSHEF---LRACAS---PNSFYLVENPHSMYDAFSH 360 N + + I + + + ++A AS F+ V + + + Sbjct: 113 N---NDNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEADLLEKAGT 169 Query: 361 IGKDIV 366 +G+ I Sbjct: 170 LGEQIF 175 >gi|297696959|ref|XP_002825643.1| PREDICTED: integrin alpha-11-like [Pongo abelii] Length = 1145 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 73/177 (41%), Gaps = 30/177 (16%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++IVLD S S+ + + + ++ +K +Q G+V + Sbjct: 160 QTYMDIVIVLDGSNSIYPWVEVQ-----------HFLINILKKFYIGPGQIQVGVVQYGE 208 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + F L S ++ ++ S + G T + G+++A ++ F G + Sbjct: 209 DVVHEFHLNDYRS-VKDVVEAASHIEQRGGTETRTAFGIEFARSEAFQKGGRKG------ 261 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 KK+++ +TDGE+ + + ++ + ++++ YA+ E C S Sbjct: 262 --AKKVMIVITDGESHDSPDLEKVIQ---QSERDNVTRYAVA---GLGMELTSLCHS 310 >gi|149181776|ref|ZP_01860267.1| hypothetical protein BSG1_01140 [Bacillus sp. SG-1] gi|148850517|gb|EDL64676.1| hypothetical protein BSG1_01140 [Bacillus sp. SG-1] Length = 949 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 44/266 (16%), Positives = 90/266 (33%), Gaps = 50/266 (18%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS--SITKIDMAIKSINAMLEEVKL 211 + + +T + ++ +D++ V D S SM+ F T + A+K + + Sbjct: 55 LDIEVTPKGQATNEERKPIDVVFVHDTSGSMKDSFGGVKKATSAENALKESLRFFNQNQQ 114 Query: 212 IPDVNNVVQSGL-VTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAY 267 D V V++ N ++ G+S + + + + V S Y Sbjct: 115 SKDKYFFVPFDSDVSYKNYGDKRIQPAEGLSDILPMAEHLDFSEAYWVKKYSWYYGYYWS 174 Query: 268 NQIFDMQ----------GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ--SLYYCNEA 315 +IFD + + K+ I+F+TDGE S D + +LY A Sbjct: 175 QEIFDFSVGGTNYTQSLEYALSKFSGMRDSKRYIIFLTDGEPTSLNHDNKQYTLYTNGTA 234 Query: 316 KK---------------------------RGAIVYAIGIRVIR--SHEFLRACA--SPNS 344 + +Y+I +++ L + + Sbjct: 235 RAGNVNANYNDVQKFIHEKAVASAEKLGVNDVKMYSIAFAEPGEVNYQLLENMSNKTGGR 294 Query: 345 FYLVENPHSMYDAFSHIGKDIVTKRI 370 NP+S+ + F+ I K+ + + Sbjct: 295 AIQA-NPNSLSNVFTDISKEFNSPSV 319 >gi|47229708|emb|CAG06904.1| unnamed protein product [Tetraodon nigroviridis] Length = 990 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 73/200 (36%), Gaps = 34/200 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + D NV + F Sbjct: 216 GASSPKDMVILVDVSGSVSGLT------LKLIKASVMEMLDTL-SDDDYVNVAR-----F 263 Query: 227 SNKIEEFFL--------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + K E + ++ + G +T+ G +A+NQ+ + Sbjct: 264 NEKAEAVVPCFKHLVQANVRNKKIFKDAVQQMQAKG-TTDYKSGFHFAFNQLLNKT---- 318 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LR 337 N AN KII+ TDG ++ + N+ V+ + L+ Sbjct: 319 --NVPRANCNKIIMLFTDGGEDRAQDVFMQYNWPNKT----VRVFTFSVGQHNYDVTPLQ 372 Query: 338 --ACASPNSFYLVENPHSMY 355 AC + ++ + + ++ Sbjct: 373 WIACTNKGYYFEIRSICAIR 392 >gi|325698104|gb|EGD39985.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK160] Length = 464 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 31/156 (19%), Positives = 59/156 (37%), Gaps = 27/156 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSIT-------KIDMAIKSINAMLEEVKLIPDVNNVV 219 + + + V D S SM + + T ++ + M+ ++K I +V+ Sbjct: 192 EGQMNVAISFVFDKSGSMSWDLNGNNTNYWGPKSRMSILKDKATIMMRDLKDIGNVS--- 248 Query: 220 QSGLVTFSN----KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 LV+FS ++F L+ G + ++ I L + G TN GL+Y + + Sbjct: 249 -VNLVSFSILGSYVQKDFSELDKGTTTIEASINAL-QTGGVTNPGDGLRYGMMSLQNHSA 306 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY 311 K +V +TDG + D + Sbjct: 307 QL-----------KYVVLLTDGIPNAYTVDTNDTSW 331 >gi|218690433|ref|YP_002398645.1| hypothetical protein ECED1_2737 [Escherichia coli ED1a] gi|218427997|emb|CAR08918.2| conserved hypothetical protein [Escherichia coli ED1a] Length = 580 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 35/194 (18%), Positives = 76/194 (39%), Gaps = 21/194 (10%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 PW + + I + S+ +++ ++D S SM S ++ + S Sbjct: 191 LAPAPWNEQRTLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISD-----ERLPLIQSS 244 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 + +++E++ ++ V +G ++I + + + I L G STN Sbjct: 245 LKLLVKELREQDNIAIVTYAG----DSRIALPSISGSHKAEINAAIDSLDAEG-STNGGA 299 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGA 320 GL+ AY Q ++ N I+ TDG+ +D +S+ + ++ G Sbjct: 300 GLEMAYQQA-AKGFIKGGINR--------ILLATDGDFNVGIDDPKSIESMVKKQRESGV 350 Query: 321 IVYAIGIRVIRSHE 334 + +G+ +E Sbjct: 351 TLSTLGVGDSNYNE 364 >gi|327265755|ref|XP_003217673.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Anolis carolinensis] Length = 1078 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 76/197 (38%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 219 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVYDMLDTLSDDDYVN------VASF 266 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + K++ + G +T+ G ++A+ Q+ + R + Sbjct: 267 NQKAQAVSCFTHLVQANIRNKKVFKEKVEVMEARG-TTDYKAGFEFAFEQLQNSNISRAN 325 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N K +++G A Sbjct: 326 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--KTVRVFTFSVGQHNYDVTPLQWMA 376 Query: 339 CASPNSFYLVENPHSMY 355 CA+ ++ + + ++ Sbjct: 377 CANKGYYFEIPSIGAIR 393 >gi|326479246|gb|EGE03256.1| von Willebrand domain-containing protein [Trichophyton equinum CBS 127.97] Length = 1002 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 42/295 (14%), Positives = 97/295 (32%), Gaps = 28/295 (9%) Query: 51 HSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDI 110 + + + +I+N + G ++ R K + +G + I Sbjct: 224 QNGPSPGTLSISVEILNADAIDRLDCHTHGIVVERDKKEQVFQSWADFIADGPSASLPSI 283 Query: 111 VRSTSLDI-VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD 169 L + ++ IS S + + + H I + Q Sbjct: 284 ANVRLLTGYTCLDRDFVLYISTTSSADLCTPRASLEVHPSIEGHSATMIDIPAEFMLQDQ 343 Query: 170 ARLD---MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE-VKLIPDVNNVVQSGLVT 225 A +D ++ ++D S SM S I+ + + ++S+ + + + + Sbjct: 344 APVDDKEIIFLVDQSGSMSSKLPGLISAMQIYLRSLPFSIPFNICSFGSSFQYLWKESME 403 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 +S+ ++ Q + S T+ P ++ A Q Sbjct: 404 YSDIT---------LNEAQYYVSQFSANLGGTDLLPAMERAVMQ--------------QQ 440 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 ++ ++ +TDGE +E + + + K+ +++GI SHE + A Sbjct: 441 HHFLDVIVLTDGEVWRLEETLRFVRLTHNVSKKAVRFFSLGIGDAVSHELIEGIA 495 >gi|326474287|gb|EGD98296.1| hypothetical protein TESG_05675 [Trichophyton tonsurans CBS 112818] Length = 1002 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 42/295 (14%), Positives = 97/295 (32%), Gaps = 28/295 (9%) Query: 51 HSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDI 110 + + + +I+N + G ++ R K + +G + I Sbjct: 224 QNGPSPGTLSISVEILNADAIDRLDCHTHGIVVERDKKEQVFQSWADFIADGPSASLPSI 283 Query: 111 VRSTSLDI-VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD 169 L + ++ IS S + + + H I + Q Sbjct: 284 ANVRLLTGYTCLDRDFVLYISTTSSADLCTPRASLEVHPSIEGHSATMIDIPAEFMLQDQ 343 Query: 170 ARLD---MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE-VKLIPDVNNVVQSGLVT 225 A +D ++ ++D S SM S I+ + + ++S+ + + + + Sbjct: 344 APVDDKEIIFLVDQSGSMSSKLPGLISAMQIYLRSLPFSIPFNICSFGSSFQYLWKESME 403 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 +S+ ++ Q + S T+ P ++ A Q Sbjct: 404 YSDIT---------LNEAQYYVSQFSANLGGTDLLPAMERAVMQ--------------QQ 440 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 ++ ++ +TDGE +E + + + K+ +++GI SHE + A Sbjct: 441 HHFLDVIVLTDGEVWRLEETLRFVRLTHNVSKKAVRFFSLGIGDAVSHELIEGIA 495 >gi|291456048|ref|ZP_06595438.1| putative von Willebrand factor type A domain protein [Bifidobacterium breve DSM 20213] gi|291382457|gb|EFE89975.1| putative von Willebrand factor type A domain protein [Bifidobacterium breve DSM 20213] Length = 1238 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 38/276 (13%), Positives = 90/276 (32%), Gaps = 38/276 (13%) Query: 60 HAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRN-------ELRDNGFVNDIDDIVR 112 A ++++N + + + + WN + D D R Sbjct: 375 SATSKVVNMLSDTDTSEQVKFTLSASENTNWNDYNVSFTCDGDCGEYDATPQIDTATGNR 434 Query: 113 STSLDIVVVPQ-NEGYSISAISRYKIPLKFCTF--------IPWYTNSRHIVMPITSSVK 163 ++ + + + ++ + P + +S + + + + Sbjct: 435 YVTVSVASEKSAHGKWVVTKKVKLDTPKIQKSISKDAVDSAATGQKDSYTLSLNVKGNTV 494 Query: 164 VNS---QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI--NAMLEEVKLIPDVNNV 218 + A++D+ VLD S SM + T++ +I N L V Sbjct: 495 SGTTTSGEKAKIDVAFVLDTSGSMNDKV-GNSTRLKNMQNAITDNGGLSSVLFNSPDKID 553 Query: 219 VQSGLVTFSN--KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 Q ++TF++ ++ +L + L + L+ G + ++ Sbjct: 554 AQVHVITFASGLGLDGTSVLST-KADLDEVVNGLTANGATH-------------WEKGLE 599 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 R + K +VF+TDG+ + + ++Y C Sbjct: 600 RVSNISTRPGATKYVVFLTDGDPGNKGWKETNVYSC 635 >gi|326911040|ref|XP_003201870.1| PREDICTED: collagen alpha-1(XIV) chain-like [Meleagris gallopavo] Length = 438 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 66/177 (37%), Gaps = 16/177 (9%) Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYL 250 + I++M + + ++ + + K L V+ + I+ + Sbjct: 67 ESSQLLKDFISSMARSFENVVMGKGGIRLAVALYGEKPRMCIELTDYVTIEEMLVAIQEI 126 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 S G S L +A H +T + K++V +T G++ ED+ Sbjct: 127 SIKGSSLKVGSALAFA-------AHAMSHPDTLREDAAKVVVLITSGKSSDLVEDK---- 175 Query: 311 YCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDI 365 + G V+A+GI+ HE + + P V++ H +++A + + + Sbjct: 176 -AQVLQDAGVTVFAVGIKDADKHELNKIASEPTAEHVIYVDDFHLLHNAAQKLSRRL 231 >gi|254458848|ref|ZP_05072272.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084614|gb|EDZ61902.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 629 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 71/190 (37%), Gaps = 23/190 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++I LD+S SM D ++++A K ++ + + G++ F+ Sbjct: 87 DILIALDISDSMLCE-DIYPNRLELAKKKALELINKATKD-------RVGVIAFAKNSYL 138 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + + + L +++ + N + + + T + KK ++ Sbjct: 139 VSPISFDTKTVSFLLSKL-------DTSSITQKGTNILTMLGTVE---KTNTSTDKKYLL 188 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPH 352 ++DG + + + + AK++G IV+ +GI S + Sbjct: 189 ILSDGGDETDFSAEI-----DFAKEKGIIVFVLGIGTEVGASIKNKDGSLIKYNDKVVIS 243 Query: 353 SMYDAFSHIG 362 + ++ S + Sbjct: 244 KLNESISELA 253 >gi|308500716|ref|XP_003112543.1| hypothetical protein CRE_30864 [Caenorhabditis remanei] gi|308267111|gb|EFP11064.1| hypothetical protein CRE_30864 [Caenorhabditis remanei] Length = 661 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 75/192 (39%), Gaps = 21/192 (10%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +D+M ++D S S+ I D+ I +L++V + P + + +V ++ Sbjct: 453 PPIDLMFLVDTSSSIG------INNFDIQKNFICEILKDVDIAPGRS---RISMVQYAQD 503 Query: 230 IEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYA----YNQIFDMQGMRQHCNTE 283 F + + ++R + LS G +T + L +A Y++ + ++H Sbjct: 504 PSVVFGFDQYYSYESVRRGVMRLSYTGGATMLSKALAFAGGIMYHEQNLKKTTKRHQFLP 563 Query: 284 DANYKK--IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 + + ++ ++DG D++S+ + ++A+ R + + Sbjct: 564 TPKHDRLQVLCLVSDGY-SDDSADKESVNLHDRLH---VKIFAVVTRSFNKDKLVPITRF 619 Query: 342 PNSFYLVENPHS 353 S + V S Sbjct: 620 DGSVFTVHQRES 631 >gi|239930093|ref|ZP_04687046.1| hypothetical protein SghaA1_17826 [Streptomyces ghanaensis ATCC 14672] Length = 578 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 41/202 (20%), Positives = 77/202 (38%), Gaps = 26/202 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + N T R + +VLD S SM ++ D S + ++ A L+ + Sbjct: 384 EKNGLTGTRAKVYLVLDRSASMRPYYKDGSAQALAEQTLALAAHLDPEATVK-------- 435 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL-KYAYNQIFDMQGMRQHC 280 +V FS +++ L ++ + KI L S + + +Y+ Sbjct: 436 -VVFFSTELDGTGELT--LTDHENKIDTLHA------SLGRMGRTSYHAAVAEVLALHDK 486 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA-IGIRVIRSHEF--LR 337 + + ++VF TDG + Q+L AK A+ ++ + + F LR Sbjct: 487 SAAGPDTPALVVFQTDGAPDAKTPATQALT--EAAKTHPAVFFSFVAFGDPENKAFDYLR 544 Query: 338 ACASPNS-FYLV-ENPHSMYDA 357 +PN+ +L E P + D Sbjct: 545 KLKAPNTAHFLAGETPKELTDR 566 >gi|110667707|ref|YP_657518.1| hypothetical protein HQ1753A [Haloquadratum walsbyi DSM 16790] gi|109625454|emb|CAJ51881.1| conserved hypothetical protein [Haloquadratum walsbyi DSM 16790] Length = 799 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 70/209 (33%), Gaps = 37/209 (17%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 V+V T +++ +DVS S S + +A+ ++ + + Sbjct: 376 VQVGEGTPGSARVILAIDVSGSTGS---GMQIQKAVALNALGQLGDSTS----------V 422 Query: 222 GLVTFSNKIEEFF---LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 G+V F+ + E L +++I+ L G TN GL+ A + +G Sbjct: 423 GVVGFNRQAYEVVGLEQLTENRDTTRQRIRQLRA-GGGTNIANGLRGAEEMLDGQRGT-- 479 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 ++ ++DG ++ +RG V +G + L Sbjct: 480 ------------VILISDG----VDARSRATVVAESLGRRGVRVITVGAGQRVNEPLLEQ 523 Query: 339 CA--SPNSFYLVENPHSMYDAFSHIGKDI 365 A S +++ + F G+ Sbjct: 524 IADISGGTYFQANETDRLRILFGGSGRQF 552 >gi|94969085|ref|YP_591133.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94551135|gb|ABF41059.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 349 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 35/272 (12%), Positives = 78/272 (28%), Gaps = 39/272 (14%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPL------KFCTFIPWYTNSRHIVMPITSSVKV 164 + + + +S + S Sbjct: 56 PPNAGQKSTDDQPSTTFKVSVKLVNVYTTVVDQHGAPVGNLTKDDFSITEDGNPEKIALF 115 Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +++ L ++I +D S S + + I + + + Sbjct: 116 QRESELPLSIVIAIDASGSTKKDLKLETDSAKRFARDILRPQD------------RLSVY 163 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 FS +EE + + R I + G +T + A + G Sbjct: 164 AFSETVEEIVPFTSDLRRIDRGISEIIA-GSATAMYDTIFLASKALMKHDG--------- 213 Query: 285 ANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKK--RGAIVYAIGIRVIR----SHEFLR 337 +K++V +TDG + S+ +Q+ +++ IV + R H ++ Sbjct: 214 ---RKVMVLITDGGDTFSSTSYEQAARAATQSETLLYSIIVVPVANSAGRDTGGEHALIQ 270 Query: 338 ACA-SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 + Y + S+ AF I ++ T+ Sbjct: 271 ISQDTGGKHYYATDMGSLDVAFKQISDELRTQ 302 >gi|11414924|dbj|BAB18554.1| voltage dependent calcium channel alpha2a/delta subunit [Rana catesbeiana] Length = 1102 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 74/197 (37%), Gaps = 31/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ MLE + ++ V Sbjct: 248 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVTEMLETL----SDDDFVNVAAFNS 297 Query: 227 SNKIEEFF-----LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 + F L+ + ++ G +T+ G K+A++Q+ + R +CN Sbjct: 298 NAHDVSCFHHLVQANVRNKKVLKEAVNNITAKG-TTDYKQGFKFAFDQLRNTNVSRANCN 356 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR---VIRSHEFLRA 338 KII+ TDG ED+ + + K + V+ + + A Sbjct: 357 -------KIIMLFTDG-----GEDKATETFKLYNKNKTVRVFTFSVGQHNYDKGPIQWMA 404 Query: 339 CASPNSFYLVENPHSMY 355 C + +Y + + ++ Sbjct: 405 CENKGYYYEIPSIGAIR 421 >gi|255531386|ref|YP_003091758.1| von Willebrand factor A [Pedobacter heparinus DSM 2366] gi|255344370|gb|ACU03696.1| von Willebrand factor type A [Pedobacter heparinus DSM 2366] Length = 344 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 28/175 (16%), Positives = 65/175 (37%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + D+MI+LDVS SM + D + +++ A ++I+ +++ + + Sbjct: 79 GTKIEEAKRSGSDLMILLDVSNSMLA-GDLAPNRLENAKRAISQLIDNLHND-------R 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 G++ F+ + + S + + ++ T + + G Sbjct: 131 IGIIIFAGEAYVQLPITTDYSAAKLFLNNITTDIVPTQGTAIGAAIDMGMKSFNFVNGTS 190 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 K ++ MTDGEN + ++ A + ++ IG+ Sbjct: 191 -----------KAMILMTDGENH----EDDAVSAAKRASAKDVAIHVIGVGSEEG 230 >gi|119585524|gb|EAW65120.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_c [Homo sapiens] Length = 664 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 75/197 (38%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 285 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 332 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 333 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 391 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N + +++G A Sbjct: 392 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--RTVRVFTFSVGQHNYDVTPLQWMA 442 Query: 339 CASPNSFYLVENPHSMY 355 CA+ ++ + + ++ Sbjct: 443 CANKGYYFEIPSIGAIR 459 >gi|47168593|pdb|1Q0P|A Chain A, A Domain Of Factor B Length = 223 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 34/219 (15%), Positives = 71/219 (32%), Gaps = 28/219 (12%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE- 231 IVLD S SM + S + +++ ++ + + GLVT++ + Sbjct: 8 IVLDPSGSMNIYLVLDGSDSIGASNFTGAKKSLVNLIEKVASYGVKPRYGLVTYATYPKI 67 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + +W L K TN+ L+ Y+ + Sbjct: 68 WVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYS-MMSWPDDVPPEGWNR 126 Query: 285 ANYKKIIVFMTDG-ENLSTKEDQQSLYYCN---------EAKKRGAIVYAIGIRVIRSHE 334 + +I+ MTDG N+ + ++ VY G+ + + Sbjct: 127 TRH--VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQV 184 Query: 335 FLRACAS----PNSFYLVENPHSMYDAFSHIGKDIVTKR 369 + A AS + V++ ++ D F + + + Sbjct: 185 NINALASKKDNEQHVFKVKDMENLEDVFYQMIDESQSLS 223 >gi|219127465|ref|XP_002183955.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|217404678|gb|EEC44624.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 582 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 28/149 (18%), Positives = 56/149 (37%), Gaps = 25/149 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + ++++VLD S SM + ++ + ++ ++E + + + +V FS Sbjct: 195 SSGPKNVVLVLDTSGSM-----TDGNRLSLLKQAAKQVIETLTVGD------RVAIVEFS 243 Query: 228 NK-----IEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ + FL L I + G TN A+ + D H Sbjct: 244 SQAKLFAQDNKFLFTATQKNKELLATHIDSFTAAGA-TNFLDAFTAAFAVLNDSIDQEYH 302 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQS 308 A I+F+TDGE + Q++ Sbjct: 303 VGCTTA-----ILFLTDGEMTQPENVQEA 326 >gi|160896215|ref|YP_001561797.1| von Willebrand factor type A [Delftia acidovorans SPH-1] gi|160361799|gb|ABX33412.1| von Willebrand factor type A [Delftia acidovorans SPH-1] Length = 536 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 63/199 (31%), Gaps = 20/199 (10%) Query: 175 MIVLDVSRSMESFFDSSITK-IDMAIKSINAMLEEVKLIPDVNNVVQS----GLVTFSNK 229 + VLDVS SM+ + + + + + + + + V GLV + Sbjct: 342 IFVLDVSGSMKGARLAQMKEALKLLSGAEASAASQRYAAFQARERVLLIPFSGLVGQPAR 401 Query: 230 IEEFFL-LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 ++ L+ + + L G + Y+ + Q + D Sbjct: 402 VQFAAGDLQAASAQVLAYADSLVADGGTA--------IYDALTLAQQQARQELRADPERF 453 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEFLRACA-SPNS 344 IV +TDG N + ++ A+ G V+ I +S E A + Sbjct: 454 VSIVLLTDGANTAGRDWAAFEREQRMARDGGAPLVRVFPIIFGEAQSGEMQALAALTGGR 513 Query: 345 FYLVENPHS--MYDAFSHI 361 + N + F I Sbjct: 514 AFDARNTGKSGLPLVFKEI 532 >gi|123446482|ref|XP_001311991.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121893822|gb|EAX99061.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 722 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 43/264 (16%), Positives = 91/264 (34%), Gaps = 42/264 (15%) Query: 105 NDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKV 164 DI D+ S V + ++ K + T I + I ++ Sbjct: 173 RDIQDLKVSVDGTKDVKDAHNATFVTNNPPAKDAIVIETHIKDEDKNVAISSDGYIAITT 232 Query: 165 NSQTDARLD----MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + +D ++D S SM ++I+ AIK + ++ + + + Sbjct: 233 YPYFEGPIDSNSEFYFIVDCSGSMSC------SRINNAIKCMRLFIQSLPVG------CR 280 Query: 221 SGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 ++ F + E V++ + +S TN L++ + + +G Sbjct: 281 FSILRFGSHFETVLPPCDYTDENVANAMNLLDNISANMGGTNILAPLQH-VSDLQASEG- 338 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGA-IVYAIGIRVIRSHE 334 + K I F+TDGE S C A K R +++IG+ Sbjct: 339 ----------FVKQIFFLTDGE------VDNSDIICATALKNRSTNRIFSIGLGSGADPG 382 Query: 335 FLRACA--SPNSFYLVENPHSMYD 356 ++ A S ++ ++ + +M + Sbjct: 383 LIKGMARKSGGNYAIIGDNDNMNE 406 >gi|300869050|ref|ZP_07113652.1| von Willebrand factor, type A [Oscillatoria sp. PCC 6506] gi|300332961|emb|CBN58846.1| von Willebrand factor, type A [Oscillatoria sp. PCC 6506] Length = 411 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 31/194 (15%), Positives = 69/194 (35%), Gaps = 28/194 (14%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 L++ ++LD S SM +D ++ +++ +K + ++ F +K Sbjct: 41 PLNLCLILDHSGSMGGRP------LDTVKQAAGRLVDRLKPGD------RLSVIAFDHKA 88 Query: 231 EEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + ++++I L G T GLK ++ + + Sbjct: 89 KVIVPNQFIDDPGSIKKQIDKLRSSG-GTAIDEGLKLGIEEM--------GKGKSETVSQ 139 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC--ASPNSFY 346 +TDGEN ++ + L A + ++G + + L A+ + Sbjct: 140 AF--LLTDGENEH-GDNNRCLKLAKLAADYNMTLNSLGFGDDWNQDILEKIADAAGGTLA 196 Query: 347 LVENPHSMYDAFSH 360 ++ P D FS Sbjct: 197 YIQRPEQAIDEFSR 210 >gi|260796397|ref|XP_002593191.1| hypothetical protein BRAFLDRAFT_72730 [Branchiostoma floridae] gi|229278415|gb|EEN49202.1| hypothetical protein BRAFLDRAFT_72730 [Branchiostoma floridae] Length = 994 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 38/217 (17%), Positives = 64/217 (29%), Gaps = 40/217 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 M++VLD S SM I ++ + L G+VTFS + Sbjct: 332 MVLVLDTSGSMRG------DPIRRLNQAATHFIRSTVLDDSW-----LGIVTFSTTANTY 380 Query: 234 FLLEWGVSHLQRK--IKYLSKF-GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L S R I + G +T L + I Sbjct: 381 HPLLQITSAADRTSLINRVPSTVGGTTCIGCALLEGVKVLEAQGDPSGG----------I 430 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV---IRSHEFLRACASPNSFYL 347 + M+DG+ D ++ AK G I+ + + + + FY Sbjct: 431 LFLMSDGQENEAP-DIATVTPQVLAK--GIIIDTLAYKRSADPQIESLALLTGGKSYFYS 487 Query: 348 VE--NPHSMYDAF--------SHIGKDIVTKRIWYDK 374 E + ++ DAF G + + + K Sbjct: 488 GEQGDSTALNDAFTASVLSRDDRTGGEATIQLVSETK 524 >gi|158334872|ref|YP_001516044.1| von Willebrand factor type A domain-containing protein [Acaryochloris marina MBIC11017] gi|158305113|gb|ABW26730.1| von Willebrand factor type A domain protein [Acaryochloris marina MBIC11017] Length = 419 Score = 52.8 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 34/219 (15%), Positives = 75/219 (34%), Gaps = 31/219 (14%) Query: 149 TNSRHIVMPITSSVKVNSQTD---ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 T R + + +++ A L++ ++LD S SM + ++ ++ Sbjct: 20 TQQRQLSIAVSAIHDTGLNNGTRNAPLNLCLILDHSGSMTGRP------LTTVKEAAQSL 73 Query: 206 LEEVKLIPDVNNVVQSGLVTFSN--KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL 263 ++ + + +V F + K+ L ++ I+ L G T G+ Sbjct: 74 IDRLNPGD------RIAVVAFDHHAKVLVPNQLVEDPEQIKALIQRLEPKG-GTAIDDGM 126 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 K ++ + + +TDGEN ++Q+ + A + Sbjct: 127 KLGIEELA--------VGKQGTISQAF--LLTDGENEH-GDNQRCQQFAELAAGYNITLN 175 Query: 324 AIGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSH 360 +G + + L A S +E P + D F+ Sbjct: 176 TLGFGSHWNEDVLEGIADSGGGSLSFIEKPENAVDVFNS 214 >gi|163848202|ref|YP_001636246.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222526110|ref|YP_002570581.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163669491|gb|ABY35857.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222449989|gb|ACM54255.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 415 Score = 52.8 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 34/235 (14%), Positives = 78/235 (33%), Gaps = 28/235 (11%) Query: 144 FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSIN 203 IP T + + + + L++ VLD S SM+ K+D + Sbjct: 15 PIPSSTTPQVGYLLVEAVAPATLAPSLPLNLCFVLDRSGSMQG------AKLDNLKAATR 68 Query: 204 AMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTP 261 ++E+++ +V F + ++ S L + +S+ G + S Sbjct: 69 RVIEQLQPTDIAA------IVIFDDTVQTLIPATPVGDRSALLAAVDSISEAGGTAMS-L 121 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 G++ +++H ++ + ++ +TDG+ + + G Sbjct: 122 GMQAG------QVELQKHLGSDRLSR---MLLLTDGQTWGDEP--LCRDLARSLGQAGVQ 170 Query: 322 VYAIGIRVIRSHEFLR--ACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + A G+ + + L A AS + P + F + + Sbjct: 171 IVAFGLGTEWNEQLLDDIAAASNGYSDYIAAPEQIGTFFQQAVHEAQATVATETR 225 >gi|153836414|ref|ZP_01989081.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AQ3810] gi|260365465|ref|ZP_05778002.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus K5030] gi|260877530|ref|ZP_05889885.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AN-5034] gi|260897529|ref|ZP_05906025.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus Peru-466] gi|260901731|ref|ZP_05910126.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AQ4037] gi|149750316|gb|EDM61061.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AQ3810] gi|308087122|gb|EFO36817.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus Peru-466] gi|308090607|gb|EFO40302.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AN-5034] gi|308108829|gb|EFO46369.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AQ4037] gi|308114384|gb|EFO51924.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus K5030] Length = 461 Score = 52.8 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 40/291 (13%), Positives = 109/291 (37%), Gaps = 30/291 (10%) Query: 12 FYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNG 71 F KG I+ FLPI+ + + + + + + + + + Sbjct: 4 FTKQKGVAGIIFVSFLPILIITFSFSVGYTQRLLAHSKIEEAAEVASLALIASPGKDNKD 63 Query: 72 NN--RKKLKGGDILCRIKNT-WNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 + +++ I I + ++S + +G V +++ +V +++ + Sbjct: 64 DQDYAQRIVDLYITDNISDIEISVSTKKCEYKDGCVQRNNELSPFADFTVVATAEHDSW- 122 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME--S 186 IS +I ++ + + +R + +D+ +LD S+SM Sbjct: 123 ---ISHNEIGVEPKFKVSGDSITRKYL-------------PQPVDIYFILDTSQSMSNPW 166 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPD-VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR 245 + + + T++ + +I +++E++ + + L+T++ + G ++ Sbjct: 167 YGERNKTQMQVVKDTITRVVKELENFKTGPDKKSRVALLTYNAY---NAKFDKGAGRVKL 223 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 Y S+F + S + +++FD + Q + K + +TD Sbjct: 224 Y-DYASEFSHTEAS---FESIVDKMFDKSVVEQKPHYASDYNKSQDIPLTD 270 >gi|291438431|ref|ZP_06577821.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC 14672] gi|291341326|gb|EFE68282.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC 14672] Length = 487 Score = 52.8 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 41/202 (20%), Positives = 77/202 (38%), Gaps = 26/202 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + N T R + +VLD S SM ++ D S + ++ A L+ + Sbjct: 293 EKNGLTGTRAKVYLVLDRSASMRPYYKDGSAQALAEQTLALAAHLDPEATVK-------- 344 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL-KYAYNQIFDMQGMRQHC 280 +V FS +++ L ++ + KI L S + + +Y+ Sbjct: 345 -VVFFSTELDGTGELT--LTDHENKIDTLHA------SLGRMGRTSYHAAVAEVLALHDK 395 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA-IGIRVIRSHEF--LR 337 + + ++VF TDG + Q+L AK A+ ++ + + F LR Sbjct: 396 SAAGPDTPALVVFQTDGAPDAKTPATQALT--EAAKTHPAVFFSFVAFGDPENKAFDYLR 453 Query: 338 ACASPNS-FYLV-ENPHSMYDA 357 +PN+ +L E P + D Sbjct: 454 KLKAPNTAHFLAGETPKELTDR 475 >gi|223670960|dbj|BAH22727.1| complement factor B precursor [Nematostella vectensis] Length = 708 Score = 52.8 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 37/226 (16%), Positives = 76/226 (33%), Gaps = 28/226 (12%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 + S + + A LD++ V D S S+ D+ I A I Sbjct: 226 MHFINKLELLTTDSRVRSGLSSGA---AGLDLVFVFDSSASVGE--DNFRKGIQFARTII 280 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG----VSHLQRKIKYLSKFGVSTN 258 + + P + ++ FS+ + F L+ R+++ L G T Sbjct: 281 DEF--GISATPSG---TRVAVIVFSDAAQVIFNLKSNRIVDKEEAVRRLENLQFQGGGTA 335 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + L+ + + +R N KK + +TDG++ + Sbjct: 336 TKLALQAVIDTVT--PELRN-------NSKKALFLITDGKSNKGGSPDRPAKVLRAGFN- 385 Query: 319 GAIVYAIGIRVIRSHEFLRACASP---NSFYLVENPHSMYDAFSHI 361 ++AIG+ + L++ AS Y +++ ++ I Sbjct: 386 -FEIFAIGVSDSVDKDELKSIASEPFRTHVYQIKDYATLVKLKELI 430 >gi|108808190|ref|YP_652106.1| hypothetical protein YPA_2196 [Yersinia pestis Antiqua] gi|108811539|ref|YP_647306.1| hypothetical protein YPN_1376 [Yersinia pestis Nepal516] gi|145599390|ref|YP_001163466.1| hypothetical protein YPDSF_2114 [Yersinia pestis Pestoides F] gi|165926883|ref|ZP_02222715.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. F1991016] gi|165935923|ref|ZP_02224493.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. IP275] gi|166011266|ref|ZP_02232164.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166212622|ref|ZP_02238657.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. B42003004] gi|167398885|ref|ZP_02304409.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167422701|ref|ZP_02314454.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167424397|ref|ZP_02316150.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|167467552|ref|ZP_02332256.1| hypothetical protein YpesF_06584 [Yersinia pestis FV-1] gi|170023658|ref|YP_001720163.1| von Willebrand factor type A [Yersinia pseudotuberculosis YPIII] gi|186896129|ref|YP_001873241.1| von Willebrand factor type A [Yersinia pseudotuberculosis PB1/+] gi|270489996|ref|ZP_06207070.1| von Willebrand factor type A domain protein [Yersinia pestis KIM D27] gi|294504758|ref|YP_003568820.1| membrane protein [Yersinia pestis Z176003] gi|108775187|gb|ABG17706.1| membrane protein [Yersinia pestis Nepal516] gi|108780103|gb|ABG14161.1| putative membrane protein [Yersinia pestis Antiqua] gi|145211086|gb|ABP40493.1| membrane protein [Yersinia pestis Pestoides F] gi|165916068|gb|EDR34675.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. IP275] gi|165921234|gb|EDR38458.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. F1991016] gi|165989944|gb|EDR42245.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166206553|gb|EDR51033.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. B42003004] gi|166958408|gb|EDR55429.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167051389|gb|EDR62797.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167056279|gb|EDR66048.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|169750192|gb|ACA67710.1| von Willebrand factor type A [Yersinia pseudotuberculosis YPIII] gi|186699155|gb|ACC89784.1| von Willebrand factor type A [Yersinia pseudotuberculosis PB1/+] gi|262362820|gb|ACY59541.1| membrane protein [Yersinia pestis D106004] gi|262366744|gb|ACY63301.1| membrane protein [Yersinia pestis D182038] gi|270338500|gb|EFA49277.1| von Willebrand factor type A domain protein [Yersinia pestis KIM D27] gi|294355217|gb|ADE65558.1| membrane protein [Yersinia pestis Z176003] Length = 472 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 39/244 (15%), Positives = 92/244 (37%), Gaps = 37/244 (15%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + + ++ + +T ++S + +++ +V+D S SM S +I+ A ++ Sbjct: 70 LANSEDKNYLKISLTG-FNLDSTRRSPINLALVIDRSTSM------SGERIEKAREAAIL 122 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSN---KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 + + + ++ +V + N I + + + +++ G++ Sbjct: 123 AVNMLNITDTLS------VVAYDNHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAG 176 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGA 320 + + +H N E N I+ ++DG+ N + A K+G Sbjct: 177 -------VSMGIGQVDKHLNREQVNR---IILISDGQANTGPTSISELSDLARMAAKKGI 226 Query: 321 IVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSH--------IGKDIVTKRI 370 + IG+ + + + A A S + V N + AF+ + +DIV + Sbjct: 227 AITTIGLGQDYNEDLMTAIAGYSDGNHTFVANSADLEKAFTKEFQDVMSVVAQDIVVQIK 286 Query: 371 WYDK 374 DK Sbjct: 287 TGDK 290 >gi|271964249|ref|YP_003338445.1| hypothetical protein Sros_2742 [Streptosporangium roseum DSM 43021] gi|270507424|gb|ACZ85702.1| conserved hypothetical protein [Streptosporangium roseum DSM 43021] Length = 774 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 40/219 (18%), Positives = 79/219 (36%), Gaps = 39/219 (17%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 T D+++VLD S SM + K+ A ++ +++ + + Sbjct: 284 DPAPATANAKDVVLVLDRSGSMTGW------KMVAARRAAARIVDTLTGGD------RFA 331 Query: 223 LVTFSNKIEEFFLLEWGVSHLQ--------RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 +++F +E+ L G+S + L G + P L+ A + + Sbjct: 332 VLSFDGVVEQPEGLGEGLSEASDRNRYRAVEHLARLEARGGTEMLAP-LEQAVALLSESG 390 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 R ++V +TDG+ ++ + A+ G V+ +GI + Sbjct: 391 RDR------------VLVLVTDGQ----VGNEDQILERIGARLAGVRVHTVGIDRAVNAG 434 Query: 335 FLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 FL A LVE+ + +A HI + I + Sbjct: 435 FLGRLAGLGSGRCELVESEDRLDEAMEHIHRRIGAPLVT 473 >gi|116695554|ref|YP_841130.1| hypothetical protein H16_B1615 [Ralstonia eutropha H16] gi|113530053|emb|CAJ96400.1| conserved hypothetical protein [Ralstonia eutropha H16] Length = 352 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 34/224 (15%), Positives = 69/224 (30%), Gaps = 48/224 (21%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++V+D+S SM + D ++I A ++ +L+ + V +G+V + Sbjct: 98 VVVVIDLSGSMRAQ-DIQPSRIRAAQQAAKVLLDTLPAG------VSAGVVAMAGTAAVA 150 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF--DMQGMRQHCNTEDANYKK-- 289 I L G T GL A + + N + +K Sbjct: 151 QAPSRSKDAAATAIDRLKPQG-GTALGNGLLIALTTLLPQTAGDAERLMNGDTTPLQKPD 209 Query: 290 ----------------IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS- 332 IV +DGE+ + Q+ G VY +G+ Sbjct: 210 ASHSGEAVTPGSYPSGAIVLFSDGESNTGPAATQAAQLAAA---HGVRVYTVGVGTTDGV 266 Query: 333 --------------HEFLR--ACASPNSFYLVENPHSMYDAFSH 360 + L+ A A+ ++ + + + + Sbjct: 267 VLSVDGWSARVRLDEKVLKEVANATGAEYFPLADAAQLKRVYRA 310 >gi|118355467|ref|XP_001010993.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89292760|gb|EAR90748.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 2033 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 51/261 (19%), Positives = 104/261 (39%), Gaps = 42/261 (16%) Query: 118 IVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT--------- 168 +V QN+ S + +K ++NS+ +PI SVK QT Sbjct: 1538 VVKSKQNKKNSYDLEKGLSLDVKTLQKHFQFSNSKDQTIPIMISVKTLEQTSDMEIESNL 1597 Query: 169 ---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ V+D S SM S KI+ +I +++ + N+ + ++T Sbjct: 1598 LEGRPNLDLICVIDNSGSM------SGQKIENVKNTILQLIDML------NDNDRLSIIT 1645 Query: 226 FSNKIEEFFLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 F++ ++ L + +LQ+ K + G TN T GL+ A++ + + + Sbjct: 1646 FNSHAQQLCGLRKVNKDNKENLQKITKSIYANG-GTNITSGLQTAFSILQSRKQRNSVSS 1704 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSL--YYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 I ++DG++ ++ ++L + ++ +++ G ++ Sbjct: 1705 ---------IFLLSDGQDNNSDSRIRNLLQTTYQQLQEECFTIHSFGFGNDHDGPLMQRI 1755 Query: 340 AS--PNSFYLVENPHSMYDAF 358 A SFY VE + + F Sbjct: 1756 AQIKDGSFYYVERNDQVDEFF 1776 >gi|316931543|ref|YP_004106525.1| hypothetical protein Rpdx1_0148 [Rhodopseudomonas palustris DX-1] gi|315599257|gb|ADU41792.1| Protein of unknown function DUF2134, membrane [Rhodopseudomonas palustris DX-1] Length = 443 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 57/456 (12%), Positives = 130/456 (28%), Gaps = 112/456 (24%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 I F + +G + ++ A+ L + +G ++ S ++T L + D + V A ++ Sbjct: 7 IARFHRDRRGNIAVIFALVLIPLISAIGCAVDYSRANALRTKLQAAADAASVGAVSR--- 63 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 + + G + T + + N + S + + + Sbjct: 64 ----TSPAYIAAGSMSTDGAITSGADDALRIFNGNLANLTGYTLDSVAATVSKSGE---- 115 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME-- 185 + S+ + T + + S+ + +D ++LD S SM Sbjct: 116 --AVTSKVTFSAQISTMFMKAVAVSSMTVGGVSTATASMPK--YIDFYLLLDNSPSMGVG 171 Query: 186 --------------------------------------SFFDSSITKIDMAIKSINAMLE 207 + T+ID+ + +++ Sbjct: 172 ATPTDVAAMVAATANKSSDDHCAFACHDVNNKNNYYNLAKALGITTRIDVLRSATQQLMD 231 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEF-----FLLEWGVSHLQR---KIKYLSKFGVSTNS 259 N + + F + F L +S + I ++ G + N+ Sbjct: 232 TAAATATYTNQFRMAIYDFGASAQTAGLRNLFALSASLSSAKTAAGAIDLMTVKGQNDNN 291 Query: 260 -----TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSL---- 309 + A +++ G + K + F++DG + + Sbjct: 292 DQDTQYTAILPAIDKLIAAPGTGA-----AGSPLKYLFFVSDGVADEYNPACLKPKTGNR 346 Query: 310 -------YYCNEAKKRGAIV---YAIGIRVIRSH-----------------------EFL 336 C K RG V Y + + + + + Sbjct: 347 CQSPINPALCKTLKDRGVRVAVLYTTYLDLPSNDWYKKWIAPFNEGPYGPSPNSEIAKNM 406 Query: 337 RACASPNSFYLVENPHSMYDAFSH-IGKDIVTKRIW 371 ACASP ++ V + +A + + + RI Sbjct: 407 EACASPGFYFEVSPTQGIAEAMNALFKRAVADARIT 442 >gi|28900585|ref|NP_800240.1| hypothetical protein VPA0730 [Vibrio parahaemolyticus RIMD 2210633] gi|28808965|dbj|BAC62073.1| hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] Length = 466 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 40/291 (13%), Positives = 109/291 (37%), Gaps = 30/291 (10%) Query: 12 FYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNG 71 F KG I+ FLPI+ + + + + + + + + + Sbjct: 9 FTKQKGVAGIIFVSFLPILIITFSFSVGYTQRLLAHSKIEEAAEVASLALIASPGKDNKD 68 Query: 72 NN--RKKLKGGDILCRIKNT-WNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 + +++ I I + ++S + +G V +++ +V +++ + Sbjct: 69 DQDYAQRIVDLYITDNISDIEISVSTKKCEYKDGCVQRNNELSPFADFTVVATAEHDSW- 127 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME--S 186 IS +I ++ + + +R + +D+ +LD S+SM Sbjct: 128 ---ISHNEIGVEPKFKVSGDSITRKYL-------------PQPVDIYFILDTSQSMSNPW 171 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPD-VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR 245 + + + T++ + +I +++E++ + + L+T++ + G ++ Sbjct: 172 YGERNKTQMQVVKDTITRVVKELENFKTGPDKKSRVALLTYNAY---NAKFDKGAGRVKL 228 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 Y S+F + S + +++FD + Q + K + +TD Sbjct: 229 Y-DYASEFSHTEAS---FESIVDKMFDKSVVEQKPHYASDYNKSQDIPLTD 275 >gi|291229678|ref|XP_002734799.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 1003 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 41/194 (21%), Positives = 78/194 (40%), Gaps = 30/194 (15%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN--KIEE 232 ++V+D S SME +ID ++++ + + + D V G+VTFS I+ Sbjct: 288 VLVMDTSGSMEENG-----RIDKLHQAVSNYI--LNTLDDGEEV---GVVTFSTTATIQS 337 Query: 233 FFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 +L S + ++ + G T+ GL A++ + + + Sbjct: 338 HLVLINNESRTELLSRVPSMQSVGRWTSIGSGLLKAFDVL----EEGERNAAGG-----V 388 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--VIRSHEFLRACASPNSFYLV 348 IV ++DGE + ++G V +IGI + E L A +FY Sbjct: 389 IVVISDGEENRDPLIADIIPMVL---EKGVTVDSIGIGTDASTNLEVLPAATDGMTFYYS 445 Query: 349 ENPHS--MYDAFSH 360 E+ +S + +A + Sbjct: 446 EDSNSNGLNEALAA 459 >gi|281179360|dbj|BAI55690.1| conserved hypothetical protein [Escherichia coli SE15] Length = 580 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 35/194 (18%), Positives = 76/194 (39%), Gaps = 21/194 (10%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 PW + + I + S+ +++ ++D S SM S ++ + S Sbjct: 191 LAPAPWNEQRTLLKVDILAK-DRKSEELPASNLVFLIDTSGSMISD-----ERLPLIQSS 244 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 + +++E++ ++ V +G ++I + + + I L G STN Sbjct: 245 LKLLVKELREQDNIAIVTYAG----DSRIALPSISGSHKAEINAAIDSLDAEG-STNGGA 299 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGA 320 GL+ AY Q ++ N I+ TDG+ +D +S+ + ++ G Sbjct: 300 GLEMAYQQA-AKGFIKGGINR--------ILLATDGDFNVGIDDPKSIESMVKKQRESGV 350 Query: 321 IVYAIGIRVIRSHE 334 + +G+ +E Sbjct: 351 TLSTLGVGDSNYNE 364 >gi|118617118|ref|YP_905450.1| hypothetical protein MUL_1447 [Mycobacterium ulcerans Agy99] gi|118569228|gb|ABL03979.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99] Length = 733 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 37/216 (17%), Positives = 75/216 (34%), Gaps = 37/216 (17%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + V + A D+++VLD S SM + K+ A ++ +++ + Sbjct: 257 TLVPPAEPSSAPRDVVVVLDRSGSMGGW------KMVAARRAAGRIVDMLDAGD------ 304 Query: 220 QSGLVTFSNKIEEFFLLEWGVS--------HLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 + ++ F ++IE + G+ + L G + + P L A + Sbjct: 305 RFCVLAFDDRIETPPAMPDGLVPASDRNRFAASSWLGSLRSRGGTVMAQP-LTNAVEMLA 363 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 D RQ +V + DG+ +SL +Y +G+ Sbjct: 364 DSGEDRQAS----------VVLVADGQISGEDHLLRSLAPAV----GRTRIYCVGVDRAV 409 Query: 332 SHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 + FL A LVE+ + + + + + I Sbjct: 410 NAGFLERLAGLGSGRAELVESEDRLDEVMARLARTI 445 >gi|90021002|ref|YP_526829.1| hypothetical protein Sde_1355 [Saccharophagus degradans 2-40] gi|89950602|gb|ABD80617.1| von Willebrand factor, type A [Saccharophagus degradans 2-40] Length = 787 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 29/208 (13%), Positives = 70/208 (33%), Gaps = 27/208 (12%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ ++ D+ +V+DVS SM+ ++ + +L ++ V Sbjct: 37 ESKLADRSGKPADVRLVIDVSGSMKRNDPAN------LRQPAVDLLMQLLPEGSKAGVWT 90 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 G + +W + K ++ G+ TN L+ A + Sbjct: 91 FGKWVNMLVPHQVVDEQWRSLG-RAKASEINSVGLYTNIGEALEKAAYDL---------- 139 Query: 281 NTEDANYKKIIVFMTDGE-NLSTKEDQQSLYY-------CNEAKKRGAIVYAIGIRVIRS 332 + Y K I+ +TDG ++ + D+ + + + K G ++ + + Sbjct: 140 DAASDEYAKHIILLTDGMVDIDKQPDKNTQEWRRIVDEVLPKLKAAGYTIHTVALSDNAD 199 Query: 333 HEFLRACA--SPNSFYLVENPHSMYDAF 358 + L+ + + + + F Sbjct: 200 NNLLKKLSLQTDGIASVAHTADDLMKIF 227 >gi|311254427|ref|XP_003125838.1| PREDICTED: LOW QUALITY PROTEIN: integrin alpha-10-like [Sus scrofa] Length = 1177 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 43/244 (17%), Positives = 83/244 (34%), Gaps = 46/244 (18%) Query: 150 NSRHIVMPITSSVKVNSQTDARL-----------DMMIVLDVSRSMESFFDSSITKIDMA 198 S I + V + + L D++IVLD S S+ + +++ Sbjct: 133 GSSVFSSGICARVDASFRLQGSLAPTAQRCPTYMDVVIVLDGSNSIYPW-----SEVQTF 187 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGV 255 ++ + L I +Q GLV + + L + ++ + G Sbjct: 188 LRRLVGRL----FIDPEQ--IQVGLVQYGESSVHEWSLGDFRTKEEVVRAARNLSRREGR 241 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 T + + + + +++V +TDGE+ +E ++L C Sbjct: 242 ETKTAQAI------MMACTEGFSQSHGGRPEAARLLVVVTDGESHDGEELPEALKACEAG 295 Query: 316 KKRGAIVYAIGI------RVIRSHEFL---RACAS-PNS--FYLVENPHSMYDAFSHIGK 363 + Y I + R FL RA AS P+ F+ V + ++ D +G Sbjct: 296 R---VTRYGIAVLGHYLRRQRDPSSFLQEIRAIASDPDERFFFNVTDEAALTDIVDALGD 352 Query: 364 DIVT 367 I Sbjct: 353 RIFG 356 >gi|304310230|ref|YP_003809828.1| hypothetical protein HDN1F_05810 [gamma proteobacterium HdN1] gi|301795963|emb|CBL44164.1| hypothetical protein HDN1F_05810 [gamma proteobacterium HdN1] Length = 371 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 33/245 (13%), Positives = 85/245 (34%), Gaps = 32/245 (13%) Query: 120 VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLD 179 +V + ++ L F S + V + + V+D Sbjct: 68 IVGNANPVQVFVPTQNNSGLDISDFQLREDGSSQ------TLVSAEPNSSLPFVTVFVMD 121 Query: 180 VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF-FLLEW 238 S S+ + ++++ +++ A K ++++ ++ F++++E L Sbjct: 122 YSPSVRNN-ETALKRMEEAAKGFVSLMQPKDKA---------AVIKFNDRVEVMGAGLTS 171 Query: 239 GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE 298 L I + + Y++++D + + I+ ++DG+ Sbjct: 172 NHDTLNAAIDSIPP-----------QRGYSKLYDAISKAIEVSNCNPKLVCSIIVLSDGD 220 Query: 299 NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYD 356 ++ + L+ N+ + G V+ IG + L+ A S ++Y E+ Sbjct: 221 DVGSALPLADLH--NQLYQAGTAVFPIGYGDNINVNKLQELATNSGGAYYTSEDSSQFSA 278 Query: 357 AFSHI 361 + I Sbjct: 279 VYERI 283 >gi|281340555|gb|EFB16139.1| hypothetical protein PANDA_003424 [Ailuropoda melanoleuca] Length = 191 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 67/176 (38%), Gaps = 18/176 (10%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 + S S+ + + I M+E+V D N V+ +T+S Sbjct: 3 CFFISRSGSVNNNW-----------MDIYNMVEDVVKKFD-NPKVRISFITYSTDGHTLM 50 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + + ++ + L S + ++ + + N + +I+ + Sbjct: 51 KITSDKNEIRENLAKLQNVVPSGATH--MQEGLRK---ANEQIEQENAGEKKAPIVILAL 105 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 TDG L +++ E+++ GA VY IG++ R + L SP+ + V+N Sbjct: 106 TDGT-LLPFPFEETKMEAEESRRLGATVYCIGVKDYRKDQLLDIADSPDHMFGVDN 160 >gi|116623628|ref|YP_825784.1| hypothetical protein Acid_4540 [Candidatus Solibacter usitatus Ellin6076] gi|116226790|gb|ABJ85499.1| hypothetical protein Acid_4540 [Candidatus Solibacter usitatus Ellin6076] Length = 543 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 47/310 (15%), Positives = 96/310 (30%), Gaps = 62/310 (20%) Query: 15 YKGGMTILTAIFLPIIFL-VLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNN 73 G + +L + LP I + ++G+ I+ S ++ L + +D + + A + Sbjct: 11 KSGSVMVLITLLLPSIMIPLVGLAIDASVARLVQLRLQAAVDGAAMGAGRLLGTPAVPET 70 Query: 74 RKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAIS 133 +FR + + IV P I + Sbjct: 71 LA-----------AEFLASNFRTDGSAGTW------GAHDLHSTIVYTP-GITKIIDIDA 112 Query: 134 RYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLD--MMIVLDVSRSME-SFFDS 190 ++PL F + TS+ + R D +M+V+D S +M+ S Sbjct: 113 TAQVPLLFLRI-----------LGKTSATVRARGSGTRTDSRVMLVIDRSGTMDVSDGTG 161 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE------------- 237 T+I+ A + + + + + GLV F + Sbjct: 162 LPTRIENAKTVAQTLF--IPAFTEGAD--EIGLVAFDGSAYVAYPPSQPGWDPTTTSSSR 217 Query: 238 ----------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 +++ ++ + G T + L AY ++ + D Sbjct: 218 GGPDMYFKDPNNPNNMINQVNAIDA-GSYTGTAEALWMAYIELQKAHLKDLAQDGVDLRM 276 Query: 288 KKIIVFMTDG 297 I+ +TDG Sbjct: 277 NS-ILLLTDG 285 >gi|326671057|ref|XP_003199352.1| PREDICTED: integrin alpha-2-like [Danio rerio] Length = 1181 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 45/312 (14%), Positives = 104/312 (33%), Gaps = 50/312 (16%) Query: 86 IKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI---SRYKIPLKFC 142 +N ++ ++ G V ++ S + + N S+ + C Sbjct: 68 TQNRKGEIYKCDMNKPGSVCQPLNLQNSVDVSSISNGNNINMSLGLTLTPTTKNNGFMTC 127 Query: 143 TFIPWYT-NSRHIVMPITSSVKVNSQTDARL-----------DMMIVLDVSRSMESFFDS 190 + S++ + + V + D+ IVLD S S+ + Sbjct: 128 GPLWAQLCGSQYFYPGVCADVSPQFTLQSAFSPAVQTCGSLMDIAIVLDGSNSIYPW--- 184 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS-NKIEEFFLLEWGVSH--LQRKI 247 + + + +LE + + P Q ++ + + +F+L + ++ Sbjct: 185 -----EPIVDFLVKLLENLNIGP---QSTQVSVMQYGVDTTFQFYLNSYKTKESMIKAAS 236 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 K G+ TN+ + +A F Q N K++V +TDGE+ + Sbjct: 237 NMQQKQGLETNTFKAIDFARTNAFLPQ------NGGRPGATKVMVVVTDGESHDANIRNK 290 Query: 308 SLYYCNEAKKRGAIVYAIGI------RVIRSHEF---LRACAS---PNSFYLVENPHSMY 355 + C+ + + I + I + + +++ AS F+ V ++ Sbjct: 291 VIAECD---SQNITRFGIAVLGYYIRNDIDTSKLIAEIKSIASKPTEKFFFNVSEEAALI 347 Query: 356 DAFSHIGKDIVT 367 + +G I Sbjct: 348 EIVGTLGDRIFN 359 >gi|293341619|ref|XP_001069109.2| PREDICTED: similar to Antxr2 protein [Rattus norvegicus] Length = 463 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 36/228 (15%), Positives = 72/228 (31%), Gaps = 50/228 (21%) Query: 148 YTNSRHIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + + + S+ + D+ VLD S S+ + + I Sbjct: 16 FPGLWLLTVGGPGSLLQAQEQPSCKKAFDLYFVLDKSGSVANNW-----------IEIYN 64 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 + ++ V+ + + G+ L+ + G T GLK Sbjct: 65 FVHQL-----TERFVR-------------YKISKGLEDLK----AVQPVGE-TYIHEGLK 101 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 A QI + G++ II+ +TDG+ +++ GA VY Sbjct: 102 LANEQIQNAGGLKTSS---------IIIALTDGKLDGLVPSYAEKE-AKKSRSLGASVYC 151 Query: 325 IGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 +G+ + R S + + V+ + A I I+ + Sbjct: 152 VGVLDFEQAQLERIADSKDQVFPVKGG---FQALKGIINSILAQSCTE 196 >gi|149919202|ref|ZP_01907685.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1] gi|149819916|gb|EDM79338.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1] Length = 877 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 42/215 (19%), Positives = 82/215 (38%), Gaps = 38/215 (17%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF--- 226 + +M+ V+D S SM S + +A +++ L ++ + N +++F Sbjct: 371 GQREMIFVIDRSGSM------SGVPLALAKQTLREALSHLRPVDTFN------VISFESS 418 Query: 227 SNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + + E + + H +R I L G T + + A + G+ +H Sbjct: 419 TAMLYEAAVPANEQNLVHAERFIDGLQA-GGGTMMSGAVDAA---LSPEIGLGRH----- 469 Query: 285 ANYKKIIVFMTDG--ENLSTKEDQQSLYY--CNEAKKRGAIVYAIGIRVIRSHEFLRAC- 339 + + F+TDG N Q S ++A +R A V+ +GI + E L + Sbjct: 470 ----RYVFFVTDGFISNEDEIARQASALVRAADKAGQR-ARVFGMGIGSSPNRELLASLS 524 Query: 340 -ASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 A + V N +A + +V + D Sbjct: 525 KAGKGRYLAVGNREHPREAVEAYTR-MVDSAVLTD 558 >gi|271963054|ref|YP_003337250.1| hypothetical protein Sros_1514 [Streptosporangium roseum DSM 43021] gi|270506229|gb|ACZ84507.1| hypothetical protein Sros_1514 [Streptosporangium roseum DSM 43021] Length = 594 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 37/230 (16%), Positives = 78/230 (33%), Gaps = 42/230 (18%) Query: 152 RHIVMPITSSVK----VNSQTDARLDMMIVLDVSRSMESF-FDSSITKIDMAIKSINAML 206 + +P + + S+ R +++IV+D S SME + +++++A K+ L Sbjct: 383 TTLNLPTPTVLDKVLGSWSELRKRANVLIVVDKSGSMEEEAAGTGESRLELAKKAAINAL 442 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 + + + GL FS + + R + L + P L+ Sbjct: 443 PQFRGDD------KVGLWAFSTRQDGD-----------RDYRELVPIDSVSKIGPALRDE 485 Query: 267 YN-------------QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 + + ++ MR + N +VF+TDG+N T Sbjct: 486 LDGLTAGGGTGLYDTTLAAVERMRGARDAGAINA---VVFLTDGKNEKTGGSDLDNLLGK 542 Query: 314 EAKKRGAIVYAIGIRVIRSHEFLRAC--ASPNSFYLVENPHSMYDAFSHI 361 ++ IG L+ A+ + Y ++ F+ + Sbjct: 543 L--NPDVRLFTIGYGEGADQGVLKRIAEATDGAAYDSSRADTIDQVFTSV 590 >gi|189524901|ref|XP_693183.3| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like [Danio rerio] Length = 963 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 70/201 (34%), Gaps = 31/201 (15%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +++ V+D S SM K+ +++ +L E+ GLVTF + Sbjct: 332 QMPKNVVFVIDRSGSM------MGEKMKQTQEALTTILSELHEDD------YFGLVTFDD 379 Query: 229 KIEEFFL-----LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 IE + V+ + ++ ++ + T+ G+ YA + + + + Sbjct: 380 VIESWRPSLSKATPENVTEAKEYVQTINARSM-TDINKGILYAVDMLTSEKSASFPNMS- 437 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSL-YYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 +I+ +TDG+ S ++D + A ++ +G + L A Sbjct: 438 ------MIILLTDGQPSSGEQDLSKIQENVRNAINGSMSLFCLGFGYDLDYILLDTLAKQ 491 Query: 343 N-----SFYLVENPHSMYDAF 358 N Y + F Sbjct: 492 NDGLARRVYEASDAALQLQGF 512 >gi|169234588|ref|NP_001038425.2| voltage-dependent calcium channel subunit alpha-2/delta-1 [Danio rerio] gi|169154233|emb|CAH68946.2| novel protein similar to vertebrate calcium channel, voltage-dependent, alpha 2/delta subunit 1 (CACNA2D1) [Danio rerio] Length = 1069 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 40/199 (20%), Positives = 78/199 (39%), Gaps = 35/199 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++D S S+ + + S++ MLE + VN +V+F Sbjct: 245 GAASPKDMLILVDASGSVSGLT------LKLIRTSVSEMLETLSDDDYVN------IVSF 292 Query: 227 SNKIEEFFLLEW-------GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 +N + E L+ ++ ++ G +T+ G K A+NQ+ M R + Sbjct: 293 NNSAKSVACFENLVQANVRNKKTLKEAVQKITANG-TTDYKIGFKEAFNQLASMNVSRAN 351 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF---L 336 CN KII+ TDG ED+ S + + ++ + + Sbjct: 352 CN-------KIIMLFTDG-----GEDKASEIFDEYNSDKRVRIFTFSVGQHNYDKAPIQY 399 Query: 337 RACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 400 MACHNKGYYYEIPSIGAIR 418 >gi|11414930|dbj|BAB18557.1| voltage dependent calcium channel alpha2d/delta subunit [Rana catesbeiana] Length = 1078 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 74/197 (37%), Gaps = 31/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ MLE + ++ V Sbjct: 248 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVTEMLETL----SDDDFVNVAAFNS 297 Query: 227 SNKIEEFF-----LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 + F L+ + ++ G +T+ G K+A++Q+ + R +CN Sbjct: 298 NAHDVSCFHHLVQANVRNKKVLKEAVNNITAKG-TTDYKQGFKFAFDQLRNTNVSRANCN 356 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR---VIRSHEFLRA 338 KII+ TDG ED+ + + K + V+ + + A Sbjct: 357 -------KIIMLFTDG-----GEDKATETFKLYNKNKTVRVFTFSVGQHNYDKGPIQWMA 404 Query: 339 CASPNSFYLVENPHSMY 355 C + +Y + + ++ Sbjct: 405 CENKGYYYEIPSIGAIR 421 >gi|33595651|ref|NP_883294.1| putative hemolysin [Bordetella parapertussis 12822] gi|33565730|emb|CAE36274.1| putative hemolysin [Bordetella parapertussis] Length = 2215 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 38/192 (19%), Positives = 73/192 (38%), Gaps = 11/192 (5%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDS---SITKIDMAIKSINAMLEEVKLIPDVN 216 +K N ++ +VLD+S SM + S T++ A ++ A+LE + D Sbjct: 1620 GGIKQNVTAGTSYNIALVLDLSDSMNDKWGSGSNKPTRLQTAKDALKALLENQLAVHDGE 1679 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + L+TF+ G++ ++ + L G+ +S A+++ Sbjct: 1680 --INVSLITFNGSSSALKKSITGLTPENVDEMVDIL--MGLKASSATPYGAAFDRTTQWF 1735 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + ++E YK + F+TDGE + + + A IGI S Sbjct: 1736 EGQPTVDSEGKPYKNLTFFLTDGEPSTEWSYNRDNEFAELAAISDVH--GIGIGSGVSTS 1793 Query: 335 FLRACASPNSFY 346 L + +Y Sbjct: 1794 TLNKYDNTGGYY 1805 >gi|86131263|ref|ZP_01049862.1| aerotolerance-related exported protein BatB [Dokdonia donghaensis MED134] gi|85818674|gb|EAQ39834.1| aerotolerance-related exported protein BatB [Dokdonia donghaensis MED134] Length = 344 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 22/176 (12%), Positives = 64/176 (36%), Gaps = 26/176 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +D++ +DVS+SM + D + +I+ + + + ++ + + Sbjct: 79 GTKLETVKREGVDVVFAIDVSKSMLAE-DIAPNRIEKSKQLVTQIINSLGSD-------R 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMR 277 G++ ++ + S + + ++ T ++ A D + Sbjct: 131 IGIIAYAGSAYPQLPITTDYSSAKLFLSQMNTDMLSSQGTAIGEAIELAKTYYNDEEQTN 190 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 +++ ++DGE+ ++ +A K G ++ IG+ Sbjct: 191 -----------RVLFIISDGEDH----VGEASSLAEQANKEGIRIFTIGVGKTEGG 231 >gi|221486991|gb|EEE25237.1| von willebrand factor type A domain-containing protein, putative [Toxoplasma gondii GT1] Length = 1109 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 27/147 (18%), Positives = 47/147 (31%), Gaps = 15/147 (10%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 V+P T + + R+D + VLD S S+ I + Sbjct: 237 VVPETPEISTTTCHKGRVDAVAVLDGSGSISRADWKKTRDIAKLFSGALNI--------- 287 Query: 215 VNNVVQSGLVTFSNKIEEF----FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + +V FS + W L +I L + TN+ L+ AY Sbjct: 288 AEDQSHVSVVRFSTTARADWSLVQPVSWTEKQLTNRISRLPQPYGGTNTPAALEEAYKIF 347 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDG 297 R + + ++++ TDG Sbjct: 348 VTSMNNRDEHD--SKHVHRVLLLATDG 372 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 24/137 (17%), Positives = 52/137 (37%), Gaps = 13/137 (9%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + +D ++VLD S S+ + + K+ + + + GL Sbjct: 12 GQACKLKNVDAVVVLDSSMSVGAEHWQELLKLTK---------QFGDTLDSSAGHSRLGL 62 Query: 224 VTFSNKIEEFFLLE---WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 V++S+ + L+ G + + ++ S +T + L AY +F Sbjct: 63 VSYSDSVTVLRKLQKIPSGTAQFEEELGAASFMNGNTFTPKALDSAYE-LFKETIHEDSE 121 Query: 281 NTEDANYKKIIVFMTDG 297 ED +++++ TDG Sbjct: 122 GAEDQEKRRLLLLATDG 138 >gi|196007112|ref|XP_002113422.1| hypothetical protein TRIADDRAFT_57573 [Trichoplax adhaerens] gi|190583826|gb|EDV23896.1| hypothetical protein TRIADDRAFT_57573 [Trichoplax adhaerens] Length = 949 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 66/202 (32%), Gaps = 25/202 (12%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 L + +VLD S SM S + + ++ ++ + + + G+VTFS Sbjct: 317 PLRIAMVLDKSGSM------SGRNMQLLSQAAINVIAQSRNFDG-----KLGIVTFSTNA 365 Query: 231 EEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L G S + I L S + I + + Sbjct: 366 TVTCPLTAGESDQEKNKLINCLPSEAEGETSIG------SGILKGIELLRKSKDGRKPSG 419 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFY 346 ++ M+DG+ + + ++ +I + S L S++ Sbjct: 420 GHLIVMSDGQENYRPYIEN---IMTNITENEIVITSISLGQHASENLEDLSKLTGGLSYF 476 Query: 347 LVENPH-SMYDAFSHIGKDIVT 367 N ++ +AF+ I ++V Sbjct: 477 ASTNSTLTLINAFTTIVSNLVG 498 >gi|157962337|ref|YP_001502371.1| cell wall anchor domain-containing protein [Shewanella pealeana ATCC 700345] gi|157847337|gb|ABV87836.1| LPXTG-motif cell wall anchor domain [Shewanella pealeana ATCC 700345] Length = 789 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 73/175 (41%), Gaps = 25/175 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++V+D S SM S I A ++ L ++ N +V F++ ++++ Sbjct: 401 LILVIDTSGSM------SGDAIIQAKTALKYALAGLRPTDKFN------IVQFNSDVDKW 448 Query: 234 FLLE-----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN--QIFDMQ-GMRQHCNTEDA 285 + + ++ Q I L G + S + A N + D + G N + Sbjct: 449 SGMAMSATPYNLAQAQNYINRLEANGGTEMS-IAINAALNIETVTDKETGTELDNNDLGS 507 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 N + ++F+TDG ++ L+ EA+ + ++ IGI + F++ A Sbjct: 508 NLLRQVLFITDG----AVSNESMLFELIEAQLGDSRLFTIGIGSAPNAHFMQRAA 558 >gi|297243668|ref|ZP_06927599.1| hypothetical protein GVAMD_0259 [Gardnerella vaginalis AMD] gi|296888419|gb|EFH27160.1| hypothetical protein GVAMD_0259 [Gardnerella vaginalis AMD] Length = 560 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 72/199 (36%), Gaps = 33/199 (16%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG-LVT 225 + ++ I+LD S SME ID ++++ + + V +G L++ Sbjct: 194 KDKQGVNYEILLDKSGSMED-------SIDTMKRTMSDFVRNLNY-----KVGDTGELIS 241 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F + + L I ++ +G+ T L + G Sbjct: 242 FDSYLMYMATYTNDKDRLLTGIDNMTPYGM-TALYDALYTGITNASNHPGFNC------- 293 Query: 286 NYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SP 342 ++ TDG +N ST + + AK++G VY IG S L+ A + Sbjct: 294 -----VIAFTDGQDNESTHTADEVISL---AKEKGIPVYLIGTSEADS-STLQNIANETN 344 Query: 343 NSFYLVENPHSMYDAFSHI 361 F+ + + M + + I Sbjct: 345 GYFWDMNSISDMQEVMNRI 363 >gi|11414932|dbj|BAB18558.1| voltage dependent calcium channel alpha2e/delta subunit [Rana catesbeiana] Length = 1083 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 74/197 (37%), Gaps = 31/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ MLE + ++ V Sbjct: 248 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVTEMLETL----SDDDFVNVAAFNS 297 Query: 227 SNKIEEFF-----LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 + F L+ + ++ G +T+ G K+A++Q+ + R +CN Sbjct: 298 NAHDVSCFHHLVQANVRNKKVLKEAVNNITAKG-TTDYKQGFKFAFDQLRNTNVSRANCN 356 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR---VIRSHEFLRA 338 KII+ TDG ED+ + + K + V+ + + A Sbjct: 357 -------KIIMLFTDG-----GEDKATETFKLYNKNKTVRVFTFSVGQHNYDKGPIQWMA 404 Query: 339 CASPNSFYLVENPHSMY 355 C + +Y + + ++ Sbjct: 405 CENKGYYYEIPSIGAIR 421 >gi|212715236|ref|ZP_03323364.1| hypothetical protein BIFCAT_00127 [Bifidobacterium catenulatum DSM 16992] gi|212661917|gb|EEB22492.1| hypothetical protein BIFCAT_00127 [Bifidobacterium catenulatum DSM 16992] Length = 1192 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 29/130 (22%), Positives = 50/130 (38%), Gaps = 16/130 (12%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D+++V+D S SM+ D++ + A +L + VQ +VTFS K Sbjct: 490 PADIVLVMDKSGSMKGELDNNAK--EAANALAKKLLTDKNSTLPSEQQVQMAVVTFSTKA 547 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 V + ++ TN LK A + + +N KK Sbjct: 548 TIEQNFTTDVLKINNAVEGDP--DGGTNWEAALKQA------------NILSGRSNVKKH 593 Query: 291 IVFMTDGENL 300 I+F++DG+ Sbjct: 594 IIFLSDGDPT 603 >gi|291407509|ref|XP_002720066.1| PREDICTED: inter-alpha (globulin) inhibitor H5-like [Oryctolagus cuniculus] Length = 1320 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 61/175 (34%), Gaps = 24/175 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ V+DVS SM TK+ K++N +L +++ N+ ++FS+ + Sbjct: 284 VVFVIDVSGSMFG------TKMQQTKKAMNVILSDLQA----NDYFNI--ISFSDTVSVW 331 Query: 232 ----EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI-FDMQGMRQHCNTEDAN 286 V + + ++ G T+ L A + + Q + Sbjct: 332 RAGGSIQATSQNVHSAKNYLDHMEAAGW-TDINAALLEAASVLNHSNQEPGRSPGVGRTP 390 Query: 287 YKKIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 I+F+TDGE L +A ++++ LR + Sbjct: 391 L---IIFLTDGEPTAGVTTPSVILSNVRQALGHRVSLFSLAFGDDADFPLLRRLS 442 >gi|71984286|ref|NP_498819.2| C-type LECtin family member (clec-160) [Caenorhabditis elegans] gi|47117847|sp|P34393|CL160_CAEEL RecName: Full=C-type lectin domain-containing protein 160; Flags: Precursor gi|28894818|gb|AAK84522.2|L11247_8 C-type lectin protein 160, confirmed by transcript evidence [Caenorhabditis elegans] Length = 639 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 66/184 (35%), Gaps = 25/184 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ V+DVS M + M IN ++ ++ L P++ VQ GL+ +S+K E Sbjct: 288 IDIIFVIDVSEGMGQGG------LMMVKAEINTLVGQMSLDPNIQKHVQVGLIKYSDKAE 341 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA------YNQIFDMQGMRQHCNTEDA 285 F + +F S P L+ N +Q + + Sbjct: 342 VVFKPSDYTNE--------DEFTEDLWSDPRLEDVDEKSDEVNLHLGLQQAAKMTASMRN 393 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA---CASP 342 +K+IV + + + + + ++ G + + S + ASP Sbjct: 394 GVRKVIVVY--AASYNDEGNDDARQIAANIRETGYAIITVAFVEPESSNLVMKIGEIASP 451 Query: 343 NSFY 346 + Sbjct: 452 RMNF 455 >gi|311268952|ref|XP_003132278.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like isoform 2 [Sus scrofa] Length = 1146 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 75/201 (37%), Gaps = 38/201 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 288 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 335 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 336 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 394 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG-----IRVIRSHE 334 CN K+I+ TDG ++ + + N + +++G + Sbjct: 395 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--RTGRVFTFSVGQHNYDVTPPAXX- 444 Query: 335 FLRACASPNSFYLVENPHSMY 355 CA+ ++ + + ++ Sbjct: 445 ---GCATKGYYFEIPSIGAIR 462 >gi|311268950|ref|XP_003132277.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like isoform 1 [Sus scrofa] Length = 1153 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 75/201 (37%), Gaps = 38/201 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 288 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 335 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 336 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 394 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG-----IRVIRSHE 334 CN K+I+ TDG ++ + + N + +++G + Sbjct: 395 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--RTGRVFTFSVGQHNYDVTPPAXX- 444 Query: 335 FLRACASPNSFYLVENPHSMY 355 CA+ ++ + + ++ Sbjct: 445 ---GCATKGYYFEIPSIGAIR 462 >gi|221131834|ref|XP_002154795.1| PREDICTED: similar to saxiphilin [Hydra magnipapillata] Length = 1094 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 37/241 (15%), Positives = 76/241 (31%), Gaps = 33/241 (13%) Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 L+ F P T + +T+ + N LD++IV+D S S++ ++ I Sbjct: 13 YTLQVFNF-PLATITESSSSFVTTPLCPN-----PLDIVIVIDSSGSVQKEWNDIIDHAQ 66 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFS--NKIEEFFLLEWGVSHLQRKIKYL--SK 252 + N + G+V FS + + E + ++ L Sbjct: 67 YFASTFNV----------SEQHTRIGIVDFSAVANVYKTVDNENTEEQVYNALESLRARP 116 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 T L+ + + N +KI+V TDG+ + K+++ Sbjct: 117 QNGETWLNLALQR-------TIELFGSATPQRENVRKIMVLYTDGKMTN-KDEESLRDLI 168 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + Y + + L AS + + GK + + + Sbjct: 169 KSHRLVSVESYIVQVNNDSHESTLHEVASSKLHIFKLSGEPV-----ESGKSFLNDAVCF 223 Query: 373 D 373 + Sbjct: 224 N 224 >gi|28871878|ref|NP_794497.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. tomato str. DC3000] gi|28855131|gb|AAO58192.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato str. DC3000] Length = 224 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 37/180 (20%), Positives = 68/180 (37%), Gaps = 18/180 (10%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 AR+ + +VLDVS SM I +N + ++ + +VTF Sbjct: 15 NPTARVPICLVLDVSGSMAG------EPIRELQAGVNMFYQAIREDEVAQYAAEISIVTF 68 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ + ++ ++ + L G +T+ G+ A + + +G Q Sbjct: 69 GSEAKRTV---DFMAIERQDVPALIAEG-TTSMGQGVNLALDLLEVRKGDYQRA--GVDY 122 Query: 287 YKKIIVFMTDGENLS--TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 Y+ +V MTDGE T+ ++ C K V+ I I + + L SP Sbjct: 123 YQPWMVVMTDGEPTDDITRASERIREMCESKK---LTVFPIAIGTAANLDIL-GMLSPGR 178 >gi|316969306|gb|EFV53424.1| putative von Willebrand factor type A domain protein [Trichinella spiralis] Length = 412 Score = 52.8 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 76/195 (38%), Gaps = 24/195 (12%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + DA++D+++VLD S S+E F++ K++ + V+L+P + + Sbjct: 36 GQNCRDAKVDLVLVLDSSGSVERTFENY--------KAVARHI--VQLLPIGYDRTLMSI 85 Query: 224 VTFSNK--IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 + +S + F + L ++ + G T + ++ G+ Q Sbjct: 86 LQYSKDAHVLLPFSADQRPEQLNEIVEQIQFLGSITATAEAVQM---------GLAQFGC 136 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRACA 340 ++ K+ + +TDG + + D + N + GA V + I E Sbjct: 137 GTRSDASKVFILITDGNSNNKWPD--VVNAANALQSSGATVAVVAFGDSIYWPEIDLYAG 194 Query: 341 SPNSFYLVENPHSMY 355 S + Y +N +Y Sbjct: 195 SSTNAYTEQNVEHLY 209 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 61/187 (32%), Gaps = 24/187 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + D++IV+D S+S+E F+ + L+ V+ + Sbjct: 215 LTGRVCEEQPTEHCSQQA-DIVIVVDSSQSVEEQFEQYKAEA----------LQFVRSLD 263 Query: 214 DVNNVVQSGLVTF--SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N +V + + F + ++ L G +T++ ++ A +Q+ Sbjct: 264 VGINSTLCSIVQYGRHATVILPFSEHQSKEIIIASLQNLKHLGGTTHTADAIQLALHQL- 322 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 KK+ + MTDG S L + + + Sbjct: 323 --------KLNGRRGSKKLFLLMTDG--NSADSWDTVLLTSDHLHASVDQLTIVAFGSNL 372 Query: 332 SHEFLRA 338 + L+A Sbjct: 373 GYNELQA 379 >gi|291398101|ref|XP_002715428.1| PREDICTED: integrin, alpha 10 [Oryctolagus cuniculus] Length = 1169 Score = 52.8 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 41/244 (16%), Positives = 81/244 (33%), Gaps = 46/244 (18%) Query: 150 NSRHIVMPITSSVKVNSQTDARL-----------DMMIVLDVSRSMESFFDSSITKIDMA 198 S I + V + + L D++IVLD S S+ + +++ Sbjct: 133 GSSVFSSGICARVDASFRPQGSLAPTAQRCPTYMDVVIVLDGSNSIYPW-----SEVQTF 187 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGV 255 ++ + L I +Q GLV + + L + ++ + G Sbjct: 188 LRRLVGRL----FIDPEQ--IQVGLVQYGESAVHEWSLGDFRTKEEVVRAARNLSRREGR 241 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 T + + A + F +++V +TDGE+ +E +L C Sbjct: 242 ETKTAQAIMVACTEGFSQS------RGGRPEAARLLVVVTDGESHDGEELPTALKACEAG 295 Query: 316 KKRGAIVYAIGI------RVIRSHEFLR----ACASPNS--FYLVENPHSMYDAFSHIGK 363 + Y I + R FL+ + P+ F+ V + ++ D +G Sbjct: 296 R---VTRYGIAVLGHYLRRQRDPSSFLKEIRAIASDPDERFFFNVTDEAALTDIVDALGD 352 Query: 364 DIVT 367 I Sbjct: 353 RIFG 356 >gi|268325023|emb|CBH38611.1| hypothetical protein, containing PKD domain [uncultured archaeon] Length = 1152 Score = 52.8 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 31/215 (14%), Positives = 64/215 (29%), Gaps = 20/215 (9%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI------PDVNNVVQ 220 T LD++ V+D + SM + + I A++ + ++ V+ Sbjct: 304 GTGLSLDLIFVIDTTGSMGDDIANVKASASTIVNEIEAIIPDYQVAVVDYRDFPVDPYGG 363 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF-DMQGMRQH 279 G F++ + + I+ L+ + Y+ + + Sbjct: 364 DGDYPFND----VLPFSTDKAAIISAIQGLTLGWGGDWE----ESVYSALMHSIDAGSLG 415 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + K I+ M D + S+ E +Y I I Sbjct: 416 GWRGEDQALKAIILMGDAPPHDPEPFTGYILTSVAIAAEL-ADPVHIYTIQIGGPVGKFA 474 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 A + + EN + DA ++I + I Sbjct: 475 ELASQTGGEVFTAENAEEVVDAILEAIEEITKRPI 509 >gi|296131396|ref|YP_003638646.1| Monophenol monooxygenase [Cellulomonas flavigena DSM 20109] gi|296023211|gb|ADG76447.1| Monophenol monooxygenase [Cellulomonas flavigena DSM 20109] Length = 971 Score = 52.8 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 20/131 (15%), Positives = 39/131 (29%), Gaps = 14/131 (10%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 + +VLD S SM TK + +++ + ++ + G+ F + Sbjct: 456 SLALVLDRSGSMADVAAGGATKSTLLKRAVGVVHSLMQPTDE------IGIARFGTTADV 509 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + L + T P A + +G K ++ Sbjct: 510 VLPMTAASAGLGTVL-------TGTALDPAGATALGRGL-QEGSGLINGPGATKPNKAVI 561 Query: 293 FMTDGENLSTK 303 MTDG Sbjct: 562 VMTDGNENIPP 572 >gi|163857470|ref|YP_001631768.1| putative lipoprotein [Bordetella petrii DSM 12804] gi|163261198|emb|CAP43500.1| putative lipoprotein [Bordetella petrii] Length = 582 Score = 52.8 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 41/314 (13%), Positives = 106/314 (33%), Gaps = 42/314 (13%) Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN-----DIDDIVRSTSLDIVVVPQNEGY 127 N + + ++ + + F ++ + N + + ++ Y Sbjct: 115 NYARYRDNPVVAAQEQPVST-FGADVDTGSYTNVRRLLNEGRLPPPDAVRAEEFINYFDY 173 Query: 128 SISAISRYKIPLKFCTF---IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 + + P T PW + + + I +V Q +++ ++D S SM Sbjct: 174 GYATPDSRQQPFSIITEVSAAPWNPQRQLLKIGIQG-YRVAPQDIPAANLVFLVDTSGSM 232 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--IEEFFLLEWGVSH 242 + K+ + ++ ++ +++ + +VT++ + + + Sbjct: 233 -----AERDKLPLIKGALKQLVAQLRPQD------RVAIVTYAGQASMTLDSTPGDQKAR 281 Query: 243 LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLS 301 + I L G STN GL AY Q ++ N I+ +DG+ N+ Sbjct: 282 INAAIDELRAAG-STNGGAGLDLAYAQA-AKGFVKGGVNR--------ILLASDGDFNVG 331 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA----CASPNSFYLVENPHSMYDA 357 + + ++ G + +G+ ++ L + + YL S+ +A Sbjct: 332 ATDLEDLKDKIARQRQGGIALTTLGVGGGNFNDALAMQLADAGNGSYHYL----DSLREA 387 Query: 358 FSHIGKDIVTKRIW 371 + + + + Sbjct: 388 RKVLAAQMSSTLLT 401 >gi|224067090|ref|XP_002302350.1| predicted protein [Populus trichocarpa] gi|222844076|gb|EEE81623.1| predicted protein [Populus trichocarpa] Length = 705 Score = 52.8 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 68/193 (35%), Gaps = 35/193 (18%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 P + +++ ++ A +D++ VLDVS SM K+ M +++ + Sbjct: 305 PPLPSLTTRNSSNSTASLLDPSRRAPIDLITVLDVSASMTG------AKLQMLKRAMRLV 358 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTP 261 + + + +V FS+ + L+ G +R I L G ++ Sbjct: 359 ISSLGSAD------RLSIVAFSSSPKRLLPLKRMTPNGQRSARRIIDRLV-CGQGSSVGE 411 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 L+ A + R+ N + I+ ++DG++ + + Sbjct: 412 ALRKATKVL----EDRRERNPVAS-----IMLLSDGQDERSSTRFAHIE---------IP 453 Query: 322 VYAIGIRVIRSHE 334 V++ G + Sbjct: 454 VHSFGFGQSGGNS 466 >gi|124504959|ref|XP_001351221.1| CSP and TRAP-related protein (CTRP) [Plasmodium falciparum 3D7] gi|4493942|emb|CAB38978.1| CSP and TRAP-related protein (CTRP) [Plasmodium falciparum 3D7] Length = 2114 Score = 52.8 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 42/234 (17%), Positives = 76/234 (32%), Gaps = 23/234 (9%) Query: 119 VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT--DARLDMMI 176 P N S P P + P S+ ++ D+ + Sbjct: 793 ESTPGNPSESTPGSPSESTPGNPSESTPGSPSESTPGSPSESTPCSGTECLCHNTYDLTL 852 Query: 177 VLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 ++D S S+ S ++ + + + + V G++ FS+KI EF Sbjct: 853 IIDESASIGYSNWE---------KEVVPFTIGLASNLEISEKKVNMGILLFSDKIREFIK 903 Query: 236 L----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + ++L R+I L K+ + G Y + + N K+ Sbjct: 904 YGQKESYDKNNLVRRIHDLKKY----YKSGGFSYIVEALKYGLYSYAKSTSSRLNVPKVN 959 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH--EFLRACASPN 343 + +TDG N T + + + KK + IGI H L C + Sbjct: 960 ILLTDGNNTDTSDFILT-EVSSLYKKENVKLLLIGIGGPTIHKLRLLGGCDKSD 1012 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 70/177 (39%), Gaps = 27/177 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV-KLIPDVNNVVQSGLVTFSNKIE 231 D+ ++LD SRS+ +D K + E+V + + + G++ F+ ++ Sbjct: 305 DLTLILDESRSIT---------LDKWKKDVVPFAEKVLNNLNIDKDKIHVGIMRFAKSMK 355 Query: 232 EFFLLEWGVSHLQRKIKYLSK-------FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E +++ + L + +G +T+ L+Y ++ +H N Sbjct: 356 TDIGYEQETRYMKNDLIKLVRELKDKYGYGGATHLVDALQY------SLKTFTRHPN-NR 408 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE--FLRAC 339 + K+ + TDG N ++K+++ +K + +G+ + L C Sbjct: 409 VDAPKVTILFTDG-NETSKKEKDIRDVGLLYRKENVKLIVVGVNLATEKSLKLLAGC 464 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 31/206 (15%), Positives = 75/206 (36%), Gaps = 25/206 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE-VKLIPDVNNVVQSGLVTFSNKIE 231 D+ ++LD S S+ S + ++ +K + N V G++ FS+K Sbjct: 85 DLTLILDESASIGS---------KNWKNHVIPFTDKIIKDLTISKNEVHVGILLFSSKNR 135 Query: 232 EFFLLEWGVSHLQRKIKYLSKF-------GVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ + + + ++ + G T LKY ++ +H N Sbjct: 136 DYVTYGDELRYQKDELLKKVEKLKKDYYCGGGTKILGALKY------SLENYTKHKNI-R 188 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + K+ + TDG N ++ ++Q L ++ + +G+ ++ + Sbjct: 189 YDAPKVTILFTDG-NENSASNKQLLEMGLTYRRERVKLLVLGVAAAEDNKLKLIAGCEEN 247 Query: 345 FYLVENPHSMYDAFSHIGKDIVTKRI 370 + + ++ + I K + K Sbjct: 248 TNCPYSMKAEWETINDITKRLTNKIC 273 Score = 42.8 bits (99), Expect = 0.083, Method: Composition-based stats. Identities = 37/180 (20%), Positives = 65/180 (36%), Gaps = 32/180 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL---VTFSNK 229 D+ +VLD S S+ + + I LE +K I V G+ ++ Sbjct: 569 DITLVLDESASISDLIWRN--------EVIPFSLEIIKRINISYKNVHMGVLLFSEYTRD 620 Query: 230 IEEFFLLE-WGVSHLQRKIKYLS---KFGVSTNSTPGLKYA---YNQIFDMQGMRQHCNT 282 I F+ + LQ KI L + G T L+YA Y+++ + Sbjct: 621 IVRFYDNARYEKGTLQTKINDLKRDYRSGKKTYIIQALQYALTYYSKL-----------S 669 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--LRACA 340 K+ + TDG N S + ++ +K + +G+ ++ L CA Sbjct: 670 NRKEAPKVTMLFTDG-NDSYESEKGLQDIALLYRKENVKLLVVGVSTASENKLKMLVGCA 728 >gi|17560426|ref|NP_504602.1| hypothetical protein F31F7.2 [Caenorhabditis elegans] gi|15617822|gb|AAB52325.2| Hypothetical protein F31F7.2 [Caenorhabditis elegans] Length = 689 Score = 52.8 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 31/191 (16%), Positives = 74/191 (38%), Gaps = 21/191 (10%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D+M ++D S S+ I D+ I +L++V + P + + ++ +S Sbjct: 482 PIDLMFLVDTSSSIG------INNFDIQKNFICEILKDVDIAPGRS---RIAMIQYSQDP 532 Query: 231 EEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYA----YNQIFDMQGMRQHCNTED 284 F + + ++R + LS G +T + L +A Y++ + ++H Sbjct: 533 SVVFGFDQYYSYESVRRGVMRLSYTGGATMLSKALAFAGGIMYHEQNLKKTTKKHQYLPT 592 Query: 285 ANYKK--IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + + ++ ++DG D++S+ + ++A+ R + Sbjct: 593 PKHDRLQVLCLVSDGY-SDDNADKESVNLHDHLH---VKIFAVVTRSFNKDKLAPITRFD 648 Query: 343 NSFYLVENPHS 353 S + V S Sbjct: 649 GSVFTVHQRES 659 >gi|262199272|ref|YP_003270481.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262082619|gb|ACY18588.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 430 Score = 52.8 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 37/250 (14%), Positives = 82/250 (32%), Gaps = 37/250 (14%) Query: 118 IVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 + +++ +Y + +P ++++ + + T A LD+ +V Sbjct: 5 PTPAQRAGSVAVTVTPQYDL-------LPSNARELNLMVRLEGTGD-APATRAPLDLALV 56 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI----EEF 233 +D S SM S K+ + +LE ++ + LV++S+ + Sbjct: 57 IDRSGSM------SGDKLSDVKTAALELLETLQPEDTIT------LVSYSSDVSMHLMRT 104 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + G +R + L G T PGL A + D ++ Sbjct: 105 RADDAGQREARRALLALQARG-GTALGPGLFRALEALEGAS---------DRTRMSHLML 154 Query: 294 MTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--PNSFYLVEN 350 +DG N A G V +G+ V + + + A ++ +++ Sbjct: 155 FSDGIANAGEVRPSVLGARAAGAFGAGVSVSTMGVGVDYNEDLMTRLADQGGGRYHFIQD 214 Query: 351 PHSMYDAFSH 360 ++ Sbjct: 215 SEAIASILDD 224 >gi|118575253|ref|YP_874996.1| hypothetical protein CENSYa_0043 [Cenarchaeum symbiosum A] gi|118193774|gb|ABK76692.1| conserved hypothetical protein [Cenarchaeum symbiosum A] Length = 311 Score = 52.8 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 34/222 (15%), Positives = 72/222 (32%), Gaps = 42/222 (18%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + Q + +++VLD S SM + D + T++D A + ++ + + G+ Sbjct: 77 PSGQAEGGAGIVLVLDGSGSMAAD-DYAPTRLDAAKAAAAQLVGRLAPGD------RVGV 129 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + F + L + +I + + +T GL Sbjct: 130 ILFGSSAITISYLTSDRAEAAGRIGEIVQGDGATALGDGLALGVEMAAAGPE-------- 181 Query: 284 DANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-------------- 328 K I+ ++DG N ++L + V+ IG+ Sbjct: 182 ----KSTIILLSDGVHNSGRTVPGEALELAIQG---NIRVHTIGMGSDEPVRVGDDIFGE 234 Query: 329 ---VIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDI 365 + LR A + +Y + ++ F + +DI Sbjct: 235 PRYAELDEDTLREIADRTGGMYYTSVDNPTLDGIFEALSEDI 276 >gi|386831|gb|AAA51620.1|AAA51620 integrin alpha subunit precursor [Homo sapiens] Length = 1163 Score = 52.8 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 40/216 (18%), Positives = 77/216 (35%), Gaps = 32/216 (14%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 V D++ ++D S S+ S + + A++ + + Q Sbjct: 140 VSRQECPRQEQDIVFLIDGSGSISS------RNFATMMNFVRAVISQFQ-----RPSTQF 188 Query: 222 GLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 L+ FSNK + E S+ + + + T + ++ +++F Sbjct: 189 SLMQFSNKFQTHLTFEEFRRTSNPLSLLASVHQLQGFTYTATAIQNVVHRLFHAS----- 243 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEF 335 + KI++ +TDG+ D + + +A G I YAIG + S + Sbjct: 244 -YGARRDATKILIVITDGKKEGDTLDYKDVIPMADA--AGIIRYAIGVGLAFQNRNSWKE 300 Query: 336 LRACA---SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 L A S + VE+ +DA I + K Sbjct: 301 LNDIASKPSQEHIFKVED----FDALKDIQTQLREK 332 >gi|325695689|gb|EGD37588.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK150] Length = 460 Score = 52.8 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 33/168 (19%), Positives = 62/168 (36%), Gaps = 32/168 (19%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSS-------ITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + + + V D S SM +++ + + AM++++K I +V+ Sbjct: 193 EGQMNVSISFVFDKSGSMAWDMLGKEVEETNRPSRMKILKEKSIAMMKDLKSIGNVS--- 249 Query: 220 QSGLVTFSN----KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 LV FS E+F L+ G + ++ I L + G TN GL+Y + Sbjct: 250 -VNLVAFSTLGSYVQEDFSELDKGTTTIETSINKLDE-GGYTNPGDGLRYGMVSLQKNPA 307 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 K +V +TDG + + + + + VY Sbjct: 308 QL-----------KYVVLLTDGVPNAF-----TAKTNDVSSRNRIKVY 339 >gi|11414928|dbj|BAB18556.1| voltage dependent calcium channel alpha2c/delta subunit [Rana catesbeiana] Length = 1085 Score = 52.8 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 74/197 (37%), Gaps = 31/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ MLE + ++ V Sbjct: 248 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVTEMLETL----SDDDFVNVAAFNS 297 Query: 227 SNKIEEFF-----LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 + F L+ + ++ G +T+ G K+A++Q+ + R +CN Sbjct: 298 NAHDVSCFHHLVQANVRNKKVLKEAVNNITAKG-TTDYKQGFKFAFDQLRNTNVSRANCN 356 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR---VIRSHEFLRA 338 KII+ TDG ED+ + + K + V+ + + A Sbjct: 357 -------KIIMLFTDG-----GEDKATETFKLYNKNKTVRVFTFSVGQHNYDKGPIQWMA 404 Query: 339 CASPNSFYLVENPHSMY 355 C + +Y + + ++ Sbjct: 405 CENKGYYYEIPSIGAIR 421 >gi|15418999|gb|AAK77222.1| capillary morphogenesis protein-2 [Homo sapiens] gi|119626253|gb|EAX05848.1| anthrax toxin receptor 2, isoform CRA_a [Homo sapiens] Length = 386 Score = 52.8 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 37/224 (16%), Positives = 76/224 (33%), Gaps = 34/224 (15%) Query: 148 YTNSRHIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + +V+ + + D+ VLD S S+ + + I Sbjct: 16 FPGLWLLVLSGPGGLLRAQEQPSCRRAFDLYFVLDKSGSVANNW-----------IEIYN 64 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPG 262 ++++ V+ ++ + FS++ L + + ++ L + T G Sbjct: 65 FVQQLAERF-VSPEMRLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEG 123 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGA 320 LK A QI G++ II+ +TDG + L ++ ++ GA Sbjct: 124 LKLANEQIQKAGGLKTSS---------IIIALTDGKLDGLVPSYAEKEAKI---SRSLGA 171 Query: 321 IVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 VY +G+ + R S + V+ + A I Sbjct: 172 SVYCVGVLDFEQAQLERIADSKEQVFPVKGG---FQALKGIINS 212 >gi|329849361|ref|ZP_08264207.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] gi|328841272|gb|EGF90842.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] Length = 505 Score = 52.8 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 43/305 (14%), Positives = 102/305 (33%), Gaps = 34/305 (11%) Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN-----DIDDIVRSTSLDIVVVP 122 E + R K + R+ + +F ++ + N + + ++ + Sbjct: 42 ERPNDERYDGKAVSPIVRVADRPVSTFAVDVDTGAYANVRRLLNDGQRPPADAVRTEELL 101 Query: 123 QNEGYSISAISRYKIPLKFCTFI---PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLD 179 Y + K P T + PW NSR + + + + V +++ ++D Sbjct: 102 NYFRYDYPLPADKKQPFSITTEVTTTPWNPNSRLLRVGLRA-YDVPRSERPAANLVFLID 160 Query: 180 VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG 239 VS SM+ K+ + ++ + ++++ V+ VV +G Sbjct: 161 VSGSMDEK-----DKLPLVQHALRLVADDMRPRDRVSIVVYAGAAG------IVLEPTAN 209 Query: 240 VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE- 298 + ++R + L G ST G+ A + + ++ TDG+ Sbjct: 210 PAQVRRALGQLKA-GGSTAGGEGIALA-----YATARAAYIDGGINR----VILATDGDF 259 Query: 299 NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSFYLVENPHSMY 355 N+ + + + K G + A+G +E L + ++ +++ Sbjct: 260 NVGISDPEAIKDLVRKNKDDGITLTALGFGTGNYNEALMEGIADVGNGNYAYIDSASEAR 319 Query: 356 DAFSH 360 Sbjct: 320 KVLDD 324 >gi|290956351|ref|YP_003487533.1| hypothetical protein SCAB_18411 [Streptomyces scabiei 87.22] gi|260645877|emb|CBG68968.1| putative secreted protein [Streptomyces scabiei 87.22] Length = 422 Score = 52.8 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 39/227 (17%), Positives = 84/227 (37%), Gaps = 31/227 (13%) Query: 144 FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSIN 203 F+ + I +P T+ S T +++ ++LDVS SM + +++ A ++ N Sbjct: 12 FVLLAVLTAGIAVPTTAVADEISATAPKVN--LLLDVSGSMRAKDIDGQSRMSAAKQAFN 69 Query: 204 AMLEEVKL----------IPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF 253 +L+ + ++G + ++ L+ + + + L+ Sbjct: 70 EVLDATPKEVQLGIRTLGADYPGDDRKTGCKD-TAQLYPVGPLD--RTEAKTAVATLAPT 126 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 G T P L A + + +G K IV ++DGE+ D + Sbjct: 127 GW-TPIGPALLKAADDLEGGEGT------------KRIVLISDGEDTCAPLDPCEVAREI 173 Query: 314 EAKKRGAIVYAIGIRVIRSHEFLRAC---ASPNSFYLVENPHSMYDA 357 AK G + +G+ +C A+ ++ +E+ + D Sbjct: 174 AAKGIGLTIDTLGLVPNAKLRVQLSCIAEATGGTYTSIEHRDELTDR 220 >gi|86148746|ref|ZP_01067019.1| VCBS [Vibrio sp. MED222] gi|85833461|gb|EAQ51646.1| VCBS [Vibrio sp. MED222] Length = 2142 Score = 52.8 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 40/184 (21%), Positives = 73/184 (39%), Gaps = 19/184 (10%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAML 206 + I +P+T K ++ + +V+DVS SM + T++ + +S+ ML Sbjct: 1499 QPVATAIDVPVTPETKSDTN------IQLVIDVSGSMGYDSGVAGKTRLAILKESLAKML 1552 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV-SHLQRKIKYLSKFGVSTNSTPGLKY 265 ++ + DV VQ T + K+ W S +I L +T+ L+ Sbjct: 1553 QQYDTLGDVK--VQIVTFTGNAKLIHDGSKSWFSVSEAITEINKLKPKN-NTDYDDALRK 1609 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQ-QSLYYCNEAKKRGAIVY 323 A +D + + + +Y F+TDG N + D ++ + G Sbjct: 1610 A-RTSWDHDEDSKLPDANNVSY-----FITDGIPNQDDRIDYWEAKTWTKHLDDNGITSQ 1663 Query: 324 AIGI 327 AIGI Sbjct: 1664 AIGI 1667 >gi|114327015|ref|YP_744172.1| hypothetical protein GbCGDNIH1_0351 [Granulibacter bethesdensis CGDNIH1] gi|114315189|gb|ABI61249.1| hypothetical protein GbCGDNIH1_0351 [Granulibacter bethesdensis CGDNIH1] Length = 329 Score = 52.8 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 40/251 (15%), Positives = 88/251 (35%), Gaps = 52/251 (20%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQT------DARLDMMIVLDVSRSMESFFDSSI------ 192 + + I+ + ++ + +M+++D S SM+ FD+ Sbjct: 50 VLRALGAMAIIAVVLGLAGLHREKLTIQKVGQGAQIMLLIDRSISMDQTFDNQTPNAAKE 109 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK 252 +K D AI+ + + + G+V FS L + ++ + + + Sbjct: 110 SKTDAAIRLVKDFFRQ-------RPHDRFGVVAFSTSPILAMPLTEHRAAVEASLDAMRR 162 Query: 253 FGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY 311 ++ TN GL A+ Q+ + + ++++F++DG + E + L+ Sbjct: 163 PAIARTNIGRGLSLAFAQLQNS----------SPDAARVVLFVSDGAGVIDGELEPRLH- 211 Query: 312 CNEAKKRGAIVYAIGIRVIRSHEFLRACAS-------------------PNSFYLVENPH 352 EA K G +Y + L A P + +NP+ Sbjct: 212 -AEAVKLGVHIYYL-YLRTEGDPGLHAAGETADADAPAALDAWFSGLGVPYKAFEADNPN 269 Query: 353 SMYDAFSHIGK 363 ++ A I + Sbjct: 270 ALKSAVDTINR 280 >gi|11414926|dbj|BAB18555.1| voltage dependent calcium channel alpha2b/delta subunit [Rana catesbeiana] Length = 1090 Score = 52.8 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 74/197 (37%), Gaps = 31/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ MLE + ++ V Sbjct: 248 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVTEMLETL----SDDDFVNVAAFNS 297 Query: 227 SNKIEEFF-----LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 + F L+ + ++ G +T+ G K+A++Q+ + R +CN Sbjct: 298 NAHDVSCFHHLVQANVRNKKVLKEAVNNITAKG-TTDYKQGFKFAFDQLRNTNVSRANCN 356 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR---VIRSHEFLRA 338 KII+ TDG ED+ + + K + V+ + + A Sbjct: 357 -------KIIMLFTDG-----GEDKATETFKLYNKNKTVRVFTFSVGQHNYDKGPIQWMA 404 Query: 339 CASPNSFYLVENPHSMY 355 C + +Y + + ++ Sbjct: 405 CENKGYYYEIPSIGAIR 421 >gi|186681468|ref|YP_001864664.1| von Willebrand factor A [Nostoc punctiforme PCC 73102] gi|186463920|gb|ACC79721.1| von Willebrand factor, type A [Nostoc punctiforme PCC 73102] Length = 426 Score = 52.8 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 33/210 (15%), Positives = 70/210 (33%), Gaps = 36/210 (17%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 L++ ++LD S SM I I+++ +++ +K+ + +V FS + Sbjct: 41 PLNLCLILDRSGSMHGQP------IKTVIQAVEGLIDRLKVGD------RISVVAFSGSV 88 Query: 231 EEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E + ++ +IK T GL+ ++ Sbjct: 89 EVIIPNQVIEDPESIKSQIKSKLSASGGTAIAEGLELGITELMKGTRGAVSQA------- 141 Query: 289 KIIVFMTDGENLST----------KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 +TDG S+ ++++ L +A K + G + + L Sbjct: 142 ---FLLTDGHGESSLRIWKWDIGRDDNKRCLKLAQKAAKLNLTINTFGFGNSWNQDLLEK 198 Query: 339 CAS--PNSFYLVENPHSMYDAFSHIGKDIV 366 A + +E+P + FS + I Sbjct: 199 IADVGGGTLAHIEHPEQAVEQFSRLFGRIQ 228 >gi|328470527|gb|EGF41438.1| hypothetical protein VP10329_07002 [Vibrio parahaemolyticus 10329] Length = 461 Score = 52.8 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 29/222 (13%), Positives = 85/222 (38%), Gaps = 23/222 (10%) Query: 12 FYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNG 71 F KG I+ FLPI+ + + + + + + + + + Sbjct: 4 FTKQKGVAGIIFVSFLPILIITFSFSVGYTQRLLAHSKIEEAAEVASLALIASPGKDNKD 63 Query: 72 NN--RKKLKGGDILCRIKNT-WNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 + +++ I I + ++S + +G V +++ +V +++ + Sbjct: 64 DQDYAQRIVDLYITDNISDIEISVSTKKCEYKDGCVQRNNELSPFADFTVVATAEHDSW- 122 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME--S 186 IS +I ++ + + +R + +D+ +LD S+SM Sbjct: 123 ---ISHNEIGVEPKFKVSGDSITRKYL-------------PQPVDIYFILDTSQSMSNPW 166 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPD-VNNVVQSGLVTFS 227 + + + T++ + +I +++E++ + + L+T++ Sbjct: 167 YGERNKTQMQVVKDTITRVVKELENFKTGPDKKSRVALLTYN 208 >gi|113475854|ref|YP_721915.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] gi|110166902|gb|ABG51442.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] Length = 460 Score = 52.8 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 35/183 (19%), Positives = 68/183 (37%), Gaps = 17/183 (9%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 F S+ + P D++I+LD+S SM+ S + Sbjct: 84 FSVETISEFGSKKTIKPTVILPTDTRAKTTPADIIIMLDMSGSMKFRDSSPGRRRIKFKG 143 Query: 201 SINAMLEEVKLIPDV-NNVVQSGLVTF----------SNKIEEFFLLEWGVSHLQRKIKY 249 +INA+ + + D N V+ GL F + ++ F L+ KI+ Sbjct: 144 AINAIYKFIDAANDKPNLTVRIGLAPFGKGGNQFKVSNKSLDANFYPS-NSEKLKEKIEE 202 Query: 250 L--SKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDANYKK--IIVFMTDGENLSTKE 304 L + STN L+ A + + N D + K +++ ++DG + ++ Sbjct: 203 LANQELSASTNLYQPLETAVKYLINSVNSTSDSNNKTDDSQSKQLVVIVLSDGFHNHDRD 262 Query: 305 DQQ 307 ++ Sbjct: 263 TEE 265 >gi|301767170|ref|XP_002919035.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like [Ailuropoda melanoleuca] Length = 891 Score = 52.8 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 38/193 (19%), Positives = 79/193 (40%), Gaps = 20/193 (10%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT-FSNKIE 231 ++ V+DVS SM K++ ++ +L+++K +N ++ SG VT + + + Sbjct: 284 SVVFVIDVSGSMHG------RKMEQTKDALLKILDDMKEEDYLNIILFSGDVTIWRDSLV 337 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + + + +K + G+ TN GL A N + + +H E + I+ Sbjct: 338 QATPE--NIQEARTFVKNIHDQGM-TNINDGLMRAINMLNKARE--EHRVPERSTS--IV 390 Query: 292 VFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS-----F 345 + +TDG+ N+ ++ A +Y +G ++ FL + A N Sbjct: 391 IMLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLESMALENHGLARRI 450 Query: 346 YLVENPHSMYDAF 358 Y + + F Sbjct: 451 YEDSDANLQLQGF 463 >gi|159901411|ref|YP_001547658.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159894450|gb|ABX07530.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 337 Score = 52.8 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 33/171 (19%), Positives = 75/171 (43%), Gaps = 16/171 (9%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + L ++I+LD SRSM + D ++ID + + + A+L+ + Q G++ Sbjct: 81 ETLKRSGLQVLILLDGSRSM-AAQDVRPSRIDASKRMVLALLDRL-------EGNQVGML 132 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 F + F L ++ + ++ ++ G+S T ++ +G+R Sbjct: 133 MFGSSSYVQFPLTSDLAAARSLVEPINPRGLSLGGTD-----VEEVIT-EGLRSFPI--G 184 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + ++ +TDG + + D +++ EA K G ++ IG+ + Sbjct: 185 QIEGRTMILITDGGDSDEQSDGEAVAAAREAAKMGLTIHTIGMATEAGGQI 235 >gi|328880283|emb|CCA53522.1| putative secreted protein [Streptomyces venezuelae ATCC 10712] Length = 424 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 39/219 (17%), Positives = 74/219 (33%), Gaps = 49/219 (22%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 TS+ T + +VLDVS SM + +++ A ++ N +L+ V Sbjct: 25 TSATADEPVTKESPKVELVLDVSGSMRAKDIDGKSRMSAAKQAFNEVLDAVP------EE 78 Query: 219 VQSGLVTF---------------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL 263 V+ G+ T + ++ L+ + + + L+ G T P L Sbjct: 79 VRLGIRTLGADYPGQDRKQGCKDTRQLYPVGPLD--RTEAKTAVASLAPTGW-TPIGPAL 135 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK---RGA 320 A + R+ IV +TDGE+ D C A++ +G Sbjct: 136 LGAAKDLEGGDATRR------------IVLITDGEDTCAPLD-----PCQVAREIAAKGI 178 Query: 321 I--VYAIGIRVIRSHEFLRAC---ASPNSFYLVENPHSM 354 + +G+ C A+ ++ V + Sbjct: 179 HLVIDTLGLVPDAKTRQQLTCIAEATGGTYTSVHRTEDL 217 >gi|222612967|gb|EEE51099.1| hypothetical protein OsJ_31818 [Oryza sativa Japonica Group] Length = 600 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/139 (20%), Positives = 59/139 (42%), Gaps = 26/139 (18%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 A LD++ VLDVS SM K+ + +++ +++ + + +++FS Sbjct: 154 QRAPLDLVTVLDVSGSMVGN------KLALLKQAMGFVIDNLGPGD------RLCVISFS 201 Query: 228 NKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + L G +H +R + LS G TN L+ A ++ D + R + Sbjct: 202 SGASRLMRLSRMTDAGKAHAKRAVGSLSARG-GTNIGAALRKA-AKVLDDRLYRNAVES- 258 Query: 284 DANYKKIIVFMTDGENLST 302 ++ ++DG++ T Sbjct: 259 -------VILLSDGQDTYT 270 >gi|296454916|ref|YP_003662060.1| putative von Willebrand factor type A domain-containing protein [Bifidobacterium longum subsp. longum JDM301] gi|296184348|gb|ADH01230.1| putative von Willebrand factor type A domain protein [Bifidobacterium longum subsp. longum JDM301] Length = 1242 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 38/276 (13%), Positives = 91/276 (32%), Gaps = 38/276 (13%) Query: 60 HAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRN-------ELRDNGFVNDIDDIVR 112 A ++++N + + + + WN + D D R Sbjct: 379 SATSKVVNMLSDTDTSEQVKFTLSASENTNWNDYNVSFTCDGDCGEYDATPQIDTATGNR 438 Query: 113 STSLDIVVVPQ-NEGYSISAISRYKIPLKFCTF--------IPWYTNSRHIVMPITSSVK 163 ++ + + + ++ + P + +S + + + + Sbjct: 439 YVTVSVASEKSAHGKWVVTKKVKLDTPKIQKSISKDEVDSAATGQKDSYTLSLNVKGNTV 498 Query: 164 VNS---QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI--NAMLEEVKLIPDVNNV 218 + A++D+ VLD S SM + T++ +I N L V Sbjct: 499 SGTTTSGEKAKIDVAFVLDTSGSMNDKV-GNSTRLKNMQNAITDNGGLSSVLFNSPDKID 557 Query: 219 VQSGLVTFSN--KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 Q+ ++TF++ ++ +L + L + L+ G + ++ Sbjct: 558 AQAHVITFASGLGLDGTSVLST-KADLDEVVNGLTANGATH-------------WEKGLE 603 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 R + K +VF+TDG+ + + ++Y C Sbjct: 604 RVSNISTRPGATKYVVFLTDGDPGNKGWKETNVYSC 639 >gi|218666515|ref|YP_002427074.1| hypothetical protein AFE_2697 [Acidithiobacillus ferrooxidans ATCC 23270] gi|218518728|gb|ACK79314.1| conserved hypothetical protein [Acidithiobacillus ferrooxidans ATCC 23270] Length = 590 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 22/118 (18%), Positives = 44/118 (37%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 +G + I+ AI +PI+ L L I++ H+ +++ L + D + + A + N Sbjct: 11 RAGRGERGDIAIIAAIVMPIMILALAFGIDIGHMAYVQRNLQKIADMAAIAGAEDVPNAQ 70 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 + +K G + T N + G + + V V +E Sbjct: 71 SLATGNAVKNGLQTSSTQITVTPGNWNPQIETGPSYFSAAVPYGHQANAVQVQLSESV 128 >gi|157273368|gb|ABV27267.1| von Willebrand factor type A [Candidatus Chloracidobacterium thermophilum] Length = 324 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 32/226 (14%), Positives = 77/226 (34%), Gaps = 41/226 (18%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + ++ S+ +A + + V+D S SM I + ++ + Sbjct: 82 VEQQIEYFSRDEAPVSLGFVVDTSGSMRPRRAKVIEAVKFLARA----------AKPGDE 131 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 LV F NK E + ++ + + +G T ++ Sbjct: 132 FF---LVDFKNKAELAEEFTPRPADIEEAVDNIV-WGGGTALLDAIQL----------SA 177 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI---------- 327 ++ + E N +K IV +DG++ + D++ ++ VY +G Sbjct: 178 EYADKEGKNRRKAIVVFSDGDDRDSYYDRR--QLIKLLQEYQVQVYIVGFPDDDDDGGLF 235 Query: 328 ---RVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 R+ + ++ A + + ++ + + I D+ T+ Sbjct: 236 GRSTRKRAVQLIKDIANETGGRAFFPKSVDELPEIVRTINADLRTQ 281 >gi|156364713|ref|XP_001626490.1| predicted protein [Nematostella vectensis] gi|156213368|gb|EDO34390.1| predicted protein [Nematostella vectensis] Length = 1043 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 70/205 (34%), Gaps = 28/205 (13%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 A +D+ I+LD S ++ S + K + + + + + TFS Sbjct: 89 AAAIDLAILLDASEAISPQEWSKLLKFTA---------DLMDYYGISEDGTRISVATFST 139 Query: 229 KIE------EFFLLEWGVSHLQRKI-KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 ++ EF +E + ++R I G LK A +F+ Sbjct: 140 DVDIVLSFNEFSGVEMNAASVKRGILGAKQSRGPGLRIDKALKAADKDLFN------RRF 193 Query: 282 TEDANYKKIIVFMTDG---ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + KK+ + +T G ++ ++ A+ G +YA+G+ LR Sbjct: 194 GMREDQKKVCLLVTSGAQTKDQGPSTQLGTVTALLSAR--GVDIYAVGVGDGVDSSELRN 251 Query: 339 CASPNSF-YLVENPHSMYDAFSHIG 362 AS F Y + + G Sbjct: 252 IASTEDFIYTASSFEEINKVLEPFG 276 Score = 40.1 bits (92), Expect = 0.58, Method: Composition-based stats. Identities = 34/190 (17%), Positives = 65/190 (34%), Gaps = 17/190 (8%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D+ ++D S S+ S K D + + ++ + N + Sbjct: 474 PIDVAFIMDNSESVSS------EKYDFVKRFVKDVIMSYADAENSANFAIGQYAKYFQTG 527 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVST--NSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + F + L I L + S N GL+ A N+ F ++ N Sbjct: 528 TKRFRNFRSMEELDEVINSLRQMSSSAERNVGAGLRGAANEFFQVK------NGMRQGLP 581 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS-PNSFYL 347 + ++ + N S + N K+ + A+G + FLR AS P+ + Sbjct: 582 RFLIVLA-SANPSASSEAIESAVVNLDKENVRRI-AVGFTEDATPGFLRMLASDPSLMFR 639 Query: 348 VENPHSMYDA 357 V+ P + Sbjct: 640 VDEPKKLDKV 649 >gi|47523388|ref|NP_999313.1| calcium-activated chloride channel regulator 1 precursor [Sus scrofa] gi|75051712|sp|Q9TUB5|CLCA1_PIG RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 1; AltName: Full=pCLCA1; Flags: Precursor gi|6002646|gb|AAF00077.1|AF095584_1 epithelial chloride channel protein [Sus scrofa] Length = 917 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 70/197 (35%), Gaps = 35/197 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM + +++ A ++ L+ V G+V F + Sbjct: 307 VCLVLDKSGSMT--VGGRLKRLNQA--------GKLFLLQTVEQGAWVGMVAFDSAAYVK 356 Query: 234 FLLE--WGVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + + L T+ GL+ A+ I +++ Sbjct: 357 SELVQINSAAERDALARSLPTAASGGTSICSGLRSAFTVI-----KKKYPTDGSE----- 406 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ ++ C E K+ GAI++ + + + E L Y Sbjct: 407 IVLLTDGEDN-------TISACFPEVKQNGAIIHTVALGPSAAKELEELSQMTGGLQTYA 459 Query: 348 VENPHS--MYDAFSHIG 362 + + + DAF + Sbjct: 460 SDQAENNGLIDAFGALS 476 >gi|329744607|ref|NP_001193278.1| inter-alpha-trypsin inhibitor heavy chain H3 [Sus scrofa] Length = 889 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 79/192 (41%), Gaps = 20/192 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG-LVTFSNKIEE 232 ++ V+DVS SM K++ ++ +L+++K +N V+ SG + T+ + + + Sbjct: 285 VVFVIDVSGSMYG------RKMEQTRDALLKILDDIKEDDYLNFVLFSGDVTTWKDSLVQ 338 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + ++ + G+ TN GL + + + +H E + II+ Sbjct: 339 ATPE--NIQKAREFVRNIRDQGM-TNINDGLLTGISMLNKARE--EHKVPERSTS--III 391 Query: 293 FMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS-----FY 346 +TDG+ N+ + ++ A +Y +G ++ FL + A N Y Sbjct: 392 MLTDGDANMGVSKPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLESMALENHGLARRIY 451 Query: 347 LVENPHSMYDAF 358 + + F Sbjct: 452 EDSDANLQLQGF 463 >gi|308068884|ref|YP_003870489.1| von Willebrand factor A [Paenibacillus polymyxa E681] gi|305858163|gb|ADM69951.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Paenibacillus polymyxa E681] Length = 432 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 31/211 (14%), Positives = 74/211 (35%), Gaps = 33/211 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--- 229 D+++V+D S SM+ D ++ A I M + + +V+F + Sbjct: 118 DLVLVIDNSGSMQQ-TDPDNERLTAAKSLIGQM----------DGDKRVAIVSFDSTAQL 166 Query: 230 IEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 ++ F + + KI + ++ A ++ + + Sbjct: 167 VQPFTPIRTDAEKQAVYSKIDSMQTIMSG---GTEIRLALDETIKEIETQGNAEKGSL-- 221 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH--EFLRACA--SPN 343 ++ ++DG + + + Y + V IG+++ S L+ A + Sbjct: 222 ---VIMLSDGFSELDTQTALAPYIARQ-----IPVNTIGLKLAESDGIALLQNIADLTGG 273 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 ++ V N + AF I I + + ++ Sbjct: 274 TYSNVANAQGLTQAFGKIYNKIGDRTLVTER 304 >gi|293597548|ref|NP_001170793.1| calcium channel, voltage-dependent, alpha 2/delta subunit 1 preproprotein [Gallus gallus] Length = 1082 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 76/200 (38%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++D S S+ + + S+ MLE + VN +V+F Sbjct: 248 GAASPKDMLILVDASGSVSGLT------LKLIRTSVIEMLETLSDDDFVN------VVSF 295 Query: 227 SNKIEEFF-------LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 +N + L+ + +S G+ T+ G YA+ Q+ + R + Sbjct: 296 NNNAQNVSCFNHLVQANVRNKKKLKEAVDKISAKGI-TDYKKGFSYAFEQLLNHSVSRAN 354 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ ++ + + V+ + + Sbjct: 355 CN-------KIIMLFTDG------GEERAQEIFHKYNEDKKVRVFTFSVGQHNYDKGPIQ 401 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 402 WMACENKGYYYEIPSIGAIR 421 >gi|161529149|ref|YP_001582975.1| von Willebrand factor type A [Nitrosopumilus maritimus SCM1] gi|160340450|gb|ABX13537.1| von Willebrand factor type A [Nitrosopumilus maritimus SCM1] Length = 316 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 39/237 (16%), Positives = 76/237 (32%), Gaps = 41/237 (17%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 + I + ++ + L IVLD S SM + D T++D A Sbjct: 59 PFVLMMGILGLAIIGLANPQIPTLSVENGINLS--IVLDGSESM-AATDYEPTRLDAAKN 115 Query: 201 SINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNST 260 +IN ++ +K+ P N G+V F + L I + + +T Sbjct: 116 AINNLI--LKMGPQHN----VGVVLFESGATTVSYLTPDKEKSVNAISSIEQGLGATAIG 169 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA 320 GL + + + K +++ ++DG + AK Sbjct: 170 DGLALGVDMASSI-----------PDKKGVVILLSDG--VHNSGLVTPEEATEYAKINNV 216 Query: 321 IVYAIGIRVIR-----------------SHEFLRACA--SPNSFYLVENPHSMYDAF 358 ++ IG+ I E L A + ++Y + ++ + F Sbjct: 217 QIHTIGLGSIEPVFLRDDIYGEPQYAELDEETLVIIAQQTSGNYYKSLDEQTLNEIF 273 >gi|309789715|ref|ZP_07684295.1| magnesium chelatase [Oscillochloris trichoides DG6] gi|308228201|gb|EFO81849.1| magnesium chelatase [Oscillochloris trichoides DG6] Length = 603 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 62/188 (32%), Gaps = 26/188 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + V+D S SM + ++ ++ ++L Q GLV+F Sbjct: 421 VCFVVDASWSMAAE-----ERMQATKAAVLSLLR-----DAYQRRDQVGLVSFQRDYARV 470 Query: 234 -FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L V QR+++ + G T G+ AY + + +D ++V Sbjct: 471 LLPLTNSVELAQRRLQTMPT-GGKTPLARGMLTAY-------ELLERARRQDHEVVPLMV 522 Query: 293 FMTDGENL----STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSF 345 +TDG+ + Q++ + R I L S + Sbjct: 523 LLTDGQANVAIGNAPPQQEAYAIADLIAARDIRAIVIDTEHPNFERGLSRRLSEHLKGRY 582 Query: 346 YLVENPHS 353 Y +E+ H Sbjct: 583 YRLEDLHD 590 >gi|198284406|ref|YP_002220727.1| membrane protein [Acidithiobacillus ferrooxidans ATCC 53993] gi|198248927|gb|ACH84520.1| membrane protein [Acidithiobacillus ferrooxidans ATCC 53993] Length = 596 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 22/118 (18%), Positives = 44/118 (37%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 +G + I+ AI +PI+ L L I++ H+ +++ L + D + + A + N Sbjct: 17 RAGRGERGDIAIIAAIVMPIMILALAFGIDIGHMAYVQRNLQKIADMAAIAGAEDVPNAQ 76 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 + +K G + T N + G + + V V +E Sbjct: 77 SLATGNAVKNGLQTSSTQITVTPGNWNPQIETGPSYFSAAVPYGHQANAVQVQLSESV 134 >gi|327273523|ref|XP_003221530.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-1-like [Anolis carolinensis] Length = 1091 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 72/200 (36%), Gaps = 35/200 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++D S S+ + + S+ MLE + VN +V+F Sbjct: 253 GAASPKDMLILVDASGSVSGLT------LKLIRTSVIEMLETLSDDDFVN------VVSF 300 Query: 227 SNKIEEFF-------LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + + L+ + + G+ T+ G YA+ Q+ + Sbjct: 301 NENAQNVSCFNHLVQANVRNKKKLKEAVYKIQAKGI-TDYKKGFSYAFEQLLNHNHSVFR 359 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK-KRGAIVYAIGIR---VIRSHEF 335 N KII+ TDG ++++ ++ ++ V+ + + Sbjct: 360 ANCN-----KIIMLFTDG------GEEKAQEIFDKYNVEKKVRVFTFSVGQHNYDKGPIQ 408 Query: 336 LRACASPNSFYLVENPHSMY 355 ACA+ +Y + + ++ Sbjct: 409 WMACANKGYYYEIPSIGAIR 428 >gi|260789950|ref|XP_002590007.1| hypothetical protein BRAFLDRAFT_224774 [Branchiostoma floridae] gi|229275194|gb|EEN46018.1| hypothetical protein BRAFLDRAFT_224774 [Branchiostoma floridae] Length = 142 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 31/161 (19%), Positives = 61/161 (37%), Gaps = 24/161 (14%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ +LD S ESF S I+S+ N + G+VT+SN Sbjct: 1 VDLIFLLDSS---ESFRTSGFEDAKTFIQSVVNYFTL------GENDTRVGVVTYSNADA 51 Query: 232 EFFLLEWGVSHLQ----RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + ++ ++ + +I+ L T + GL + N F N N Sbjct: 52 QITRVKLNENYTRVELLTEIRNLPYDRGHTFTGLGLDHVRNNSFL------EVNGRRNNT 105 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +++ +TD E + + ++ G V+ +G+ Sbjct: 106 PDVLIVLTDDE-----SEDDVILPAQLTRQMGIKVFVVGVG 141 >gi|198414926|ref|XP_002130077.1| PREDICTED: similar to Collagen alpha-1(XIV) chain precursor (Undulin) [Ciona intestinalis] Length = 215 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 34/227 (14%), Positives = 66/227 (29%), Gaps = 31/227 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + K ++D + LD S S + + +++ L P Sbjct: 9 LATIFVGLAKAQCPNFGKMDFVFALDSSTS------TGAENWAKVQNLVTSLISHFSLAP 62 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGV-----SHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + + F+ + + + L +G + L++A + Sbjct: 63 ---EKARFSVFRFNRIPHRRSEIVLNSFIDDKPGMITAVYNLPYYGSGSYIGRALQHAKS 119 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 N A YK I+V +TDG + L N + + YAIG++ Sbjct: 120 VSLSES------NGNRAEYKDIVVTVTDG-----RTYDDMLTPSNSLRDSNVLTYAIGVQ 168 Query: 329 VIRSH-----EFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + L S + +L N A+ I + Sbjct: 169 PTNGRGTRNDKLLEIAGSEQNLFL-TNVIGFNGAYDGIIAALERDLC 214 >gi|262183593|ref|ZP_06043014.1| hypothetical protein CaurA7_06346 [Corynebacterium aurimucosum ATCC 700975] Length = 604 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 63/195 (32%), Gaps = 32/195 (16%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF- 233 M+V D S SM + T+ID A + + E + V G + + E Sbjct: 1 MVVFDSSGSMITNDAGGQTRIDAAKDAARTFITEAGDDAPLGLVTYGGNTGEAPEDEAAG 60 Query: 234 --------FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + + L G T L+ A ++ Sbjct: 61 CQDITVVTPPEAGNSEKMIAHMDGLQPRGF-TPIGESLRKAAAEL-------------PK 106 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA--IVYAIGIRV-IRSHEFLRAC--- 339 ++ I+ ++DG T E K++G ++ +G V + + L+ C Sbjct: 107 EGQRSIILVSDGVATCTPP--PVCDVAKELKEQGIDLVINTVGFNVEPEAQQELQ-CIAD 163 Query: 340 ASPNSFYLVENPHSM 354 A+ ++ + S+ Sbjct: 164 ATGGTYANASDADSL 178 >gi|327471789|gb|EGF17230.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK408] Length = 464 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 33/168 (19%), Positives = 62/168 (36%), Gaps = 31/168 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSIT-------KIDMAIKSINAMLEEVKLIPDVNNVV 219 + + + V D S SM + + T ++ + M+ ++K I +V+ Sbjct: 192 EGQMNVAISFVFDKSGSMSWDLNGNNTNYWGPKSRMSILQDKATIMMRDLKDIGNVS--- 248 Query: 220 QSGLVTFSN----KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 LV+FS ++F L+ G + ++ I L + G TN GL+Y + + Sbjct: 249 -VNLVSFSILGSYVQKDFSELDKGTTTIEASINAL-QTGGVTNPGDGLRYGMMSLQNHSA 306 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 K +V +TDG + D + + R Y Sbjct: 307 QL-----------KYVVLLTDGIPNAYTVDTNDTSW----RNRNVHPY 339 >gi|332237899|ref|XP_003268144.1| PREDICTED: integrin alpha-10 isoform 2 [Nomascus leucogenys] Length = 1036 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 39/236 (16%), Positives = 83/236 (35%), Gaps = 38/236 (16%) Query: 150 NSRHIVMPITSSVKVNSQTDA---RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 + + + S + +D +D++IVLD S S+ + +++ ++ + L Sbjct: 10 GHQKLNLDTVSYNMLGVDSDGCPTYMDVVIVLDGSNSIYPW-----SEVQTFLRRLVGKL 64 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGL 263 I +Q GLV + + L + ++ + G T + + Sbjct: 65 ----FIDPEQ--IQVGLVQYGESPVHEWSLGDFRTKEEVVRAAKNLSRREGRETKTAQAI 118 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 A + F + +++V +TDGE+ ++ +L C + Y Sbjct: 119 MVACTEGFSQS------HGGRPEAARLLVVVTDGESHDGEQLPAALKACEAGR---VTRY 169 Query: 324 AIGI------RVIRSHEFLR----ACASPNS--FYLVENPHSMYDAFSHIGKDIVT 367 I + R FLR + P+ F+ V + ++ D +G I Sbjct: 170 GIAVLGHYLRRQRDPSSFLREIRTIASDPDERFFFNVTDEAALTDIVDALGDRIFG 225 >gi|282892468|ref|ZP_06300802.1| hypothetical protein pah_c260o013 [Parachlamydia acanthamoebae str. Hall's coccus] gi|281497750|gb|EFB40114.1| hypothetical protein pah_c260o013 [Parachlamydia acanthamoebae str. Hall's coccus] Length = 373 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 40/250 (16%), Positives = 82/250 (32%), Gaps = 31/250 (12%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 I W +++ P + + + +VLD S SM S + Sbjct: 74 IAWMDPHFYLLKPDRGERARDHTPHEGIAIYLVLDQSGSMAQKIPLSSDEKGRISVPKID 133 Query: 205 MLEEVKLIPDVNNVVQ---------SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS---- 251 + E+ + + GL+ F+ + L + ++ S Sbjct: 134 FIVEITKDFVKGDPTKGLKGLHNDLVGLIGFARTAQVLSPLTLDHQAIIDQLNKFSIVKH 193 Query: 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK--IIVFMTDG---ENLSTKEDQ 306 + T+ + N I + + K I++ +TDG N +EDQ Sbjct: 194 QDEDGTSIGYAIFKTANLIASTKHFAEELKEASPYTIKNSIMLIVTDGFQDPNPLDREDQ 253 Query: 307 -QSLYY---CNEAKKRGAIVYAIGIRV-------IRSHEFLRACA--SPNSFYLVENPHS 353 +S+ AK++G VY I + ++ + FYL+++ Sbjct: 254 YRSIELEDAAKYAKEQGVRVYIINVEPRIASEEFGSERRVMQKVTEITGGKFYLLDHIEE 313 Query: 354 MYDAFSHIGK 363 + + ++ I K Sbjct: 314 LKNIYADIDK 323 >gi|229838599|ref|ZP_04458758.1| putative membrane protein [Yersinia pestis biovar Orientalis str. PEXU2] gi|229895667|ref|ZP_04510838.1| putative membrane protein [Yersinia pestis Pestoides A] gi|229899165|ref|ZP_04514308.1| putative membrane protein [Yersinia pestis biovar Orientalis str. India 195] gi|229901807|ref|ZP_04516929.1| putative membrane protein [Yersinia pestis Nepal516] gi|229681736|gb|EEO77830.1| putative membrane protein [Yersinia pestis Nepal516] gi|229687567|gb|EEO79640.1| putative membrane protein [Yersinia pestis biovar Orientalis str. India 195] gi|229694965|gb|EEO85012.1| putative membrane protein [Yersinia pestis biovar Orientalis str. PEXU2] gi|229701473|gb|EEO89501.1| putative membrane protein [Yersinia pestis Pestoides A] Length = 437 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 39/244 (15%), Positives = 92/244 (37%), Gaps = 37/244 (15%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + + ++ + +T ++S + +++ +V+D S SM S +I+ A ++ Sbjct: 35 LANSEDKNYLKISLTG-FNLDSTRRSPINLALVIDRSTSM------SGERIEKAREAAIL 87 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSN---KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 + + + ++ +V + N I + + + +++ G++ Sbjct: 88 AVNMLNITDTLS------VVAYDNHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAG 141 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGA 320 + + +H N E N I+ ++DG+ N + A K+G Sbjct: 142 -------VSMGIGQVDKHLNREQVNR---IILISDGQANTGPTSISELSDLARMAAKKGI 191 Query: 321 IVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSH--------IGKDIVTKRI 370 + IG+ + + + A A S + V N + AF+ + +DIV + Sbjct: 192 AITTIGLGQDYNEDLMTAIAGYSDGNHTFVANSADLEKAFTKEFQDVMSVVAQDIVVQIK 251 Query: 371 WYDK 374 DK Sbjct: 252 TGDK 255 >gi|302036308|ref|YP_003796630.1| hypothetical protein NIDE0941 [Candidatus Nitrospira defluvii] gi|300604372|emb|CBK40704.1| conserved exported protein of unknown function, contains von Willebrand factor, type A and vault protein inter-alpha-trypsin domain [Candidatus Nitrospira defluvii] Length = 712 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 36/224 (16%), Positives = 79/224 (35%), Gaps = 34/224 (15%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 S ++M + + + D+ ++D S SM I+ A S+ A L + Sbjct: 333 TSYAMLMLVPPTQHRETTARVPRDITFIIDRSGSMAG------ASIEQAKGSLTAALSRL 386 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLS--KFGVSTNSTPGLKY 265 + ++ F++ + F + +Q+ I+Y T P L+ Sbjct: 387 TTQD------RFNIIQFNHTVRSLFPIPQPVTTKSMQQAIRYTEHLAADGGTEILPALRQ 440 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 A D ++Q I+ +TDG+ +++ L+ + ++ I Sbjct: 441 ALKSPQDSARLQQ------------IILITDGQ----VGNEEELFELLHQRVGSRRLFTI 484 Query: 326 GIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVT 367 GI + +R A +F + N + + D + + + Sbjct: 485 GIGSTPNSHLMRKAAETGRGTFTYIGNVNEVKDKLDGLFRKLEH 528 >gi|227832539|ref|YP_002834246.1| hypothetical protein cauri_0711 [Corynebacterium aurimucosum ATCC 700975] gi|227453555|gb|ACP32308.1| putative membrane protein [Corynebacterium aurimucosum ATCC 700975] Length = 693 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 36/262 (13%), Positives = 81/262 (30%), Gaps = 32/262 (12%) Query: 114 TSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLD 173 + + +S++ + I + T + + + + +S + Sbjct: 29 VQCSLAHMENTSRFSLTPLLTALIAVLGTTALAVVSGLLPVASAEEETNAPSSSSSTMAP 88 Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 M+V D S SM + T+ID A + + E + V G + + E Sbjct: 89 TMVVFDSSGSMITNDAGGQTRIDAAKDAARTFITEAGDDAPLGLVTYGGNTGEAPEDEAA 148 Query: 234 ---------FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + + L G T L+ A ++ Sbjct: 149 GCQDITVVTPPEAGNSEKMIAHMDGLQPRGF-TPIGESLRKAAAEL-------------P 194 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA--IVYAIGIRV-IRSHEFLRAC-- 339 ++ I+ ++DG T E K++G ++ +G V + + L+ C Sbjct: 195 KEGQRSIILVSDGVATCTPP--PVCDVAKELKEQGIDLVINTVGFNVEPEAQQELQ-CIA 251 Query: 340 -ASPNSFYLVENPHSMYDAFSH 360 A+ ++ + S+ + Sbjct: 252 DATGGTYANASDADSLAKELNR 273 >gi|224171915|ref|XP_002198669.1| PREDICTED: similar to collagen, type XX, alpha 1, partial [Taeniopygia guttata] Length = 152 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 20/122 (16%), Positives = 43/122 (35%), Gaps = 13/122 (10%) Query: 223 LVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 L +S+ + L + + ++ L G +T + L + Q + Sbjct: 1 LSQYSSDPRTEWELSTYSTREQVLEAVRNLRYKGGNTFTGLALTHVLEQ------NLKPD 54 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +K+++ +TDG K + K G ++AIG++ E + + Sbjct: 55 AGARLEAEKLVILLTDG-----KSQDDANLAAQTLKNLGIEIFAIGVKNADEAELRQVAS 109 Query: 341 SP 342 P Sbjct: 110 EP 111 >gi|114594052|ref|XP_001144775.1| PREDICTED: anthrax toxin receptor 2 isoform 1 [Pan troglodytes] Length = 386 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 37/224 (16%), Positives = 76/224 (33%), Gaps = 34/224 (15%) Query: 148 YTNSRHIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + +V+ + + D+ VLD S S+ + + I Sbjct: 16 FPGLWLLVLSGPGGLLRAQEQPSCRRAFDLYFVLDKSGSVANNW-----------IEIYN 64 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPG 262 ++++ V+ ++ + FS++ L + + ++ L + T G Sbjct: 65 FVQQLAERF-VSPEMRLSFIVFSSQASIILPLTGDRGKISKGLEDLKRVSPVGETYIHEG 123 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGA 320 LK A QI G++ II+ +TDG + L ++ ++ GA Sbjct: 124 LKLANEQIQKAGGLKTSS---------IIIALTDGKLDGLVPSYAEKEAKI---SRSLGA 171 Query: 321 IVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 VY +G+ + R S + V+ + A I Sbjct: 172 SVYCVGVLDFEQAQLERIADSKEQVFPVKGG---FQALKGIINS 212 >gi|327270778|ref|XP_003220165.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 925 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 43/272 (15%), Positives = 90/272 (33%), Gaps = 36/272 (13%) Query: 96 NELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS-AISRYKIPLKFCTFIPWYTNSRHI 154 E + + + + T + ++ I +K + + +S Sbjct: 226 KEQTTSASIMYMQALPSVTQFCDMSNHNVRAPNLQNKICNFKSTWEVIMDSTDFAHSSSR 285 Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 P Q R+ + +VLDVS SM +I ++ + + + Sbjct: 286 STPPPDPTISLLQMQERI-VCLVLDVSGSMSGN-----NRIARLKQAAETFI-----LQN 334 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSH-LQRKIKYLSKF--GVSTNSTPGLKYAYNQIF 271 + + G+VTF+N L+ VS +++ + TN G++ + Sbjct: 335 IEDGSWVGIVTFNNAATIQTGLQQVVSDTVRKTLNGYLPISANGGTNICAGVQKGFQVFS 394 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVI 330 + C IV +TDGE+ C E ++ G++++ I + Sbjct: 395 SKYASTEGCE---------IVLLTDGEDSGLSS-------CFAEVQRSGSVIHTIALGPS 438 Query: 331 RSHEFLRAC-ASPNSFYLVE---NPHSMYDAF 358 + E + + + S+ DAF Sbjct: 439 AAKELEMLADMTGGLKFSATDSVDSSSLEDAF 470 >gi|294140611|ref|YP_003556589.1| inter-alpha-trypsin inhibitor domain-containing protein [Shewanella violacea DSS12] gi|293327080|dbj|BAJ01811.1| inter-alpha-trypsin inhibitor domain protein [Shewanella violacea DSS12] Length = 747 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 38/187 (20%), Positives = 73/187 (39%), Gaps = 29/187 (15%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 ++ + + D+++V+D S SM S I A K++ L + N Sbjct: 329 EIGASSVIARDLILVIDTSGSM------SGEAIVQAKKAMGYALAGLGARDSFN------ 376 Query: 223 LVTFSNKI-----EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 ++ F++ + + + + I+ L G T P L A + + Sbjct: 377 VIAFNSDVHALSAQSLAATAKNIGRANQFIRTLKADG-GTEMGPALTRA---LDNGNHST 432 Query: 278 QHCNTEDANYK----KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 H + ED + K ++FMTDG +++SL+ E K + ++ IGI + Sbjct: 433 SHQDEEDFDSDGVRLKQVLFMTDG----AVANERSLFNLIEDKIGHSRLFTIGIGAAPNS 488 Query: 334 EFLRACA 340 F+ A Sbjct: 489 HFMERAA 495 >gi|227833165|ref|YP_002834872.1| putative secreted protein [Corynebacterium aurimucosum ATCC 700975] gi|227454181|gb|ACP32934.1| putative secreted protein [Corynebacterium aurimucosum ATCC 700975] Length = 521 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 67/203 (33%), Gaps = 35/203 (17%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLE-EVKLIPDVNNVV------QSGLVTFS 227 +VLD S SME ++D+ S+ +++ +PD V Q L+ +S Sbjct: 334 ALVLDTSGSMEG------ERMDLLKSSLLPLIDGSADGVPDGEGQVAFRNREQIKLIPYS 387 Query: 228 NKIEEFFLLE------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 ++ ++ L +++ L G T + + A++++ G Sbjct: 388 SEPQQPTRARVDKDKPATTKELADRVERLVADG-DTATFEAVLNAFDEVDTSGGDIG--- 443 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG--AIVYAIGIRVIRSHEFLRAC 339 +V MTDGE + Q + + V+ I E Sbjct: 444 --------TVVLMTDGEVTRGRTFAQFKDAYAQLPEDKKEIPVFVILYGEANIQEMEELA 495 Query: 340 A-SPNSFYLVENPHSMYDAFSHI 361 + + N + AF I Sbjct: 496 QLTGGKTFDALN-GDLAAAFEEI 517 >gi|320014437|gb|ADV98008.1| putative membrane protein [Yersinia pestis biovar Medievalis str. Harbin 35] Length = 437 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 39/244 (15%), Positives = 92/244 (37%), Gaps = 37/244 (15%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + + ++ + +T ++S + +++ +V+D S SM S +I+ A ++ Sbjct: 35 LANSEDKNYLKISLTG-FNLDSTRRSPINLALVIDRSTSM------SGERIEKAREAAIL 87 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSN---KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 + + + ++ +V + N I + + + +++ G++ Sbjct: 88 AVNMLNITDTLS------VVAYDNHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAG 141 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGA 320 + + +H N E N I+ ++DG+ N + A K+G Sbjct: 142 -------VSMGIGQVDKHLNREQVNR---IILISDGQANTGPTSISELSDLARMAAKKGI 191 Query: 321 IVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSH--------IGKDIVTKRI 370 + IG+ + + + A A S + V N + AF+ + +DIV + Sbjct: 192 AITTIGLGQDYNEDLMTAIAGYSDGNHTFVANSADLEKAFTKEFQDVMSVVAQDIVVQIK 251 Query: 371 WYDK 374 DK Sbjct: 252 TGDK 255 >gi|149187053|ref|ZP_01865360.1| hypothetical protein ED21_31369 [Erythrobacter sp. SD-21] gi|148829342|gb|EDL47786.1| hypothetical protein ED21_31369 [Erythrobacter sp. SD-21] Length = 697 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 37/229 (16%), Positives = 77/229 (33%), Gaps = 39/229 (17%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 +++ +T + +++ V+D S SM S + A KS+ L + Sbjct: 294 GKEYVMATVTPPAAAKVEKLPPRELIFVIDNSGSM------SGESMRAASKSLVYALSTL 347 Query: 210 KLIPDVNNVVQSGLVTFSNKI-----EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 + + ++ F + + + ++ +R + L G T+ P L+ Sbjct: 348 RPED------RFNIIRFDHSMTMLHPDAVAADRTNLAKARRYAESLRGQG-GTDMLPALR 400 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 A D + I+F+TDG + +E + + R V+ Sbjct: 401 AALRD-----------RDPDGKRLRQIIFLTDGNLSNEREMMSEISIA-LGRSR---VFM 445 Query: 325 IGIRVIRSHEFLRACASPN----SFY--LVENPHSMYDAFSHIGKDIVT 367 +GI + +R A + E M + + K +VT Sbjct: 446 VGIGSAPNSHLMRRMAEAGRGTFTHVGQDAEAVSEMRRMLNRLAKPVVT 494 >gi|159899681|ref|YP_001545928.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159892720|gb|ABX05800.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 550 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 36/279 (12%), Positives = 89/279 (31%), Gaps = 32/279 (11%) Query: 92 MSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNS 151 + + V + + ++ + ++ + +F P + Sbjct: 295 SDATADQKAAASVFYDFLLTPESQ-NLAMQQGFRPANVDVALASPLTAQF-GVDPNQPRN 352 Query: 152 RHIVMP---ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 P I ++ + ++M+V+D S SM K+D A + L Sbjct: 353 SLATPPADVIVAAKNAWANNRKPANIMLVVDSSGSMR-----DDDKMDQAKLGVEVFLNR 407 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKY 265 + + G++ FS+ L +++LQ + + L G +T+ + Sbjct: 408 LPSKDN------VGMIGFSSSPAVLVPLATRSENMANLQMQTQGLVPDG-NTSLYDAIDL 460 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 A ++ +++ + IV ++DG T + ++ I Sbjct: 461 ARQELENLKQPDRINA---------IVVLSDG--ADTASQLSIDQMLGNFGESSIQIFPI 509 Query: 326 GIRVIRSHEFLRACASPNSFYLVE-NPHSMYDAFSHIGK 363 L+ A + LV+ + + F ++ + Sbjct: 510 AYGADAETSILQQIADFSRTELVQGSTGDIDKIFENLSR 548 >gi|282864727|ref|ZP_06273782.1| von Willebrand factor type A [Streptomyces sp. ACTE] gi|282560666|gb|EFB66213.1| von Willebrand factor type A [Streptomyces sp. ACTE] Length = 424 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 36/219 (16%), Positives = 76/219 (34%), Gaps = 45/219 (20%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF------- 226 + +VLDVS SM + +++ A ++ N +L+ V +V +++ + Sbjct: 41 VELVLDVSGSMRTRDIDGQSRMSAAKQAFNDVLDAVP--EEVQLGIRTLGANYPGDDRKV 98 Query: 227 ----SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + ++ L+ + + + L+ G T P L A + + R+ Sbjct: 99 GCKDTKQLYPVGPLD--RTEAKTAVATLAPTGW-TPIGPALLGAADDLDGGDATRR---- 151 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK---RGAI--VYAIGI--RVIRSHEF 335 IV +TDGE+ D C A+ RG + +G+ + Sbjct: 152 --------IVLITDGEDTCGPLD-----PCEVARDIAARGIHLVIDTLGLVPNAKIRQQL 198 Query: 336 LRAC---ASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 C A+ ++ V++ + + I Sbjct: 199 --TCIAEATGGTYTAVQHADELSGRVKQLVDRAAEPTIT 235 >gi|90020471|ref|YP_526298.1| arginine biosynthesis bifunctional glutamate N-acetyltransferase/amino-acid acetyltransferase [Saccharophagus degradans 2-40] gi|89950071|gb|ABD80086.1| von Willebrand factor, type A [Saccharophagus degradans 2-40] Length = 708 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 39/288 (13%), Positives = 96/288 (33%), Gaps = 29/288 (10%) Query: 59 VHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN-----DIDDIVRS 113 V T + GN++ + + + + +F ++ + + + Sbjct: 224 VDGDTIVAPAPQGNDKFEHVEENSVKSVAEAPVSTFSIDVDTASYSFVRRQLNSGYLPEK 283 Query: 114 TSLDIVVVPQNEGYSISAISRYKIPLKF---CTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 ++ + Y+ S P K PW + + + + + Sbjct: 284 DAIRAEELINYFDYNYPLPSDSTAPFKPNITVIDSPWAKGKKLVHIGLKG-YDIAPDQKP 342 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 R +++ +LDVS SM S K+ + +S+ +L + V VV +G + Sbjct: 343 RTNLVFLLDVSGSMNS-----QDKLPLVKQSMEMLLSTLNPDDTVAIVVYAGAAG---TV 394 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 E + + ++ L G + AY + + ++ N Sbjct: 395 LEPTPAKD-KQKILSAMQRLQAGGSTAGGAGI-ALAY-DLAEANFDKKAVNR-------- 443 Query: 291 IVFMTDGENLSTKEDQQSLY-YCNEAKKRGAIVYAIGIRVIRSHEFLR 337 ++ TDG+ + ++L + +++G + +G ++ L Sbjct: 444 VILATDGDFNVGSTNNETLQGFVERKREKGIFLSVLGFGQGNYNDHLM 491 >gi|15837128|ref|NP_297816.1| hypothetical protein XF0526 [Xylella fastidiosa 9a5c] gi|9105381|gb|AAF83336.1|AE003900_15 hypothetical protein XF_0526 [Xylella fastidiosa 9a5c] Length = 941 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 56/184 (30%), Gaps = 27/184 (14%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + Q + + +D+S SM+ + ++++ ++NA+L+++ Sbjct: 140 PEKAAIGMQMAPCVAVYFAIDLSGSMDYVGGNGRSRLENMKTALNAVLDQLGQTIASGAA 199 Query: 219 VQSGLVTFSNKIEEFFLL------EWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAY 267 V LV F + + L G++ L+ + F T P A Sbjct: 200 VDILLVGFGDAPDHRQTLLRRNCTAQGIAELKSWVSARQALYGTYFPAGTMDMPSFYAA- 258 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 N ++ F+TDG + + Y I I Sbjct: 259 ---------------APPNAVRVAFFVTDGVPDPPSATNAQAARADVDQVAHLRCYGITI 303 Query: 328 RVIR 331 + Sbjct: 304 DLAD 307 >gi|179762|gb|AAA51903.1| calcium channel alpha-2b subunit [Homo sapiens] Length = 1091 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 79/200 (39%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 247 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 294 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 295 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 353 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ N+ K + V+ + R Sbjct: 354 CN-------KIIMLFTDG------GEERAQEIFNKYNKDKKVRVFRFSVGQHNYERGPIQ 400 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 401 WMACENKGYYYEIPSIGAIR 420 >gi|238060066|ref|ZP_04604775.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] gi|237881877|gb|EEP70705.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] Length = 316 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 31/208 (14%), Positives = 64/208 (30%), Gaps = 32/208 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+ +DVS SM++ D + +++ A ++ + E+ + GLV+F+ Sbjct: 89 VMLAIDVSLSMQAD-DVAPNRLEAAQEAAKQFVAELPQTYN------LGLVSFAKSANVL 141 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + I L +T + + I + IV Sbjct: 142 VPPTKDRDAVTTAIDGLV-LAEATATGEAVFTCLEAIRSVPADGAAGIPPAR-----IVL 195 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-----EFLR----------- 337 ++DG S + +++ A V I + R Sbjct: 196 LSDGFRTSGRSVEEAAAAAQAA---NVPVSTIAFGTDAGQVDIGGQLQRVPVDRMALAEL 252 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDI 365 A + FY + + + +G I Sbjct: 253 AETTEGYFYEAASVSELKQVYQDMGSSI 280 >gi|197337036|ref|YP_002157821.1| hypothetical protein VFMJ11_A0264 [Vibrio fischeri MJ11] gi|197314288|gb|ACH63737.1| conserved hypothetical protein [Vibrio fischeri MJ11] Length = 423 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 58/437 (13%), Positives = 122/437 (27%), Gaps = 100/437 (22%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +RN + +G IL A+ +P +F + + + + K + + + + + Sbjct: 1 MRNLRKHQQGHAAILFAMMIPALFGIFALASDGARAIQTKARIEDASEVAALAISAHNDP 60 Query: 68 EG--NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 + NG+ + I+ N + + I + + Sbjct: 61 DQPDNGSYTPSTRNRQIVVDYVNAYISDVDAVTDIKVAKRRCELIPECVAGLYDGDMRYL 120 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ-----TDARLDMMIVLDV 180 + I +R W+ + I + +D M D Sbjct: 121 EHEIDVTTR---------QNSWFPGNEAIEGMGETFSTRGKSLARKYQSEAVDAMFAADF 171 Query: 181 SRSM--ESFFDSSITKIDM--AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 S SM S+ +D+ I++I+A L++ +P+ N G+ FS F Sbjct: 172 SGSMLDTWSGSSNPKYVDLIEIIRNISAELQKFNDLPENRNKSTMGISAFSTFTNSFTSD 231 Query: 237 E--------------------------------WGVS----------------HLQRKIK 248 W +L Sbjct: 232 TGIQCSLSQGVNGRNGPATWFRPVKAANTVANIWNPKTEDYCKSGAYAGFHDVNLTSNFN 291 Query: 249 YLS------KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 YL+ G T S L + R ++++ ++DG + T Sbjct: 292 YLNGQVGSFYAGGGTASYQALIRGAQLLRKGNNSR-----------RLLIVLSDGMDNDT 340 Query: 303 K--EDQQSLYYCNEAKKR------------GAIVYAIGIR-VIRSHEFLRACASPNSFYL 347 + + S C + + A + IG +++ L+ C + Y Sbjct: 341 QLADGLVSAGMCRDIQNGLESDRTPDRRPIAAKMAVIGFDYNPFANKALKDCVGEKNVYK 400 Query: 348 VENPHSMYDAFSHIGKD 364 E+ + D + + Sbjct: 401 AEDADEVEDIILELINE 417 >gi|223670958|dbj|BAH22726.1| complement factor B precursor [Nematostella vectensis] Length = 708 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 37/226 (16%), Positives = 76/226 (33%), Gaps = 28/226 (12%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 + S + + A LD++ V D S S+ D+ I A I Sbjct: 226 MHFINKLELLTTDSRVRSGLSSGA---AGLDLVFVFDSSASVGE--DNFRKGIQFARTII 280 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG----VSHLQRKIKYLSKFGVSTN 258 + + P + ++ FS+ + F L+ R+++ L G T Sbjct: 281 DEF--GISATPSG---TRVAVIVFSDAAQVIFNLKSNRIVDKEEAVRRLENLQFQGGGTA 335 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + L+ + + +R N KK + +TDG++ + Sbjct: 336 TKLALQAVIDTV--NPELRN-------NSKKALFLITDGKSNKGGSPDRPAKVLRAGFN- 385 Query: 319 GAIVYAIGIRVIRSHEFLRACASP---NSFYLVENPHSMYDAFSHI 361 ++AIG+ + L++ AS Y +++ ++ I Sbjct: 386 -FEIFAIGVSDSVDKDELKSIASEPFRTHVYQIKDYATLVKLKELI 430 >gi|332237901|ref|XP_003268145.1| PREDICTED: integrin alpha-10 isoform 3 [Nomascus leucogenys] Length = 1032 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 37/211 (17%), Positives = 76/211 (36%), Gaps = 35/211 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++IVLD S S+ + +++ ++ + L I +Q GLV + Sbjct: 31 MDVVIVLDGSNSIYPW-----SEVQTFLRRLVGKL----FIDPEQ--IQVGLVQYGESPV 79 Query: 232 EFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L + ++ + G T + + A + F + Sbjct: 80 HEWSLGDFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQS------HGGRPEAA 133 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEFLR----A 338 +++V +TDGE+ ++ +L C + Y I + R FLR Sbjct: 134 RLLVVVTDGESHDGEQLPAALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTI 190 Query: 339 CASPNS--FYLVENPHSMYDAFSHIGKDIVT 367 + P+ F+ V + ++ D +G I Sbjct: 191 ASDPDERFFFNVTDEAALTDIVDALGDRIFG 221 >gi|224090451|ref|XP_002195054.1| PREDICTED: similar to Integrin alpha-2 [Taeniopygia guttata] Length = 1178 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 32/210 (15%), Positives = 74/210 (35%), Gaps = 35/210 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++V D S S+ + D + ++ + + + Q GL+ ++N Sbjct: 168 IDVVVVCDESNSIYPW--------DAVRAFLKKFVQGLDIGLNK---TQVGLIQYANDPR 216 Query: 232 EFFLL-EWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L + ++ K K G TN+ + A F + + Sbjct: 217 VVFNLNTYQTKDEVVKAMEKTFQKGGDLTNTFKAIDNARQYAFSAE------SGGRPTAT 270 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI------RSHEFLRACA-- 340 K++V +TDGE+ + + CN + + I + + ++ Sbjct: 271 KVMVVVTDGESHDGSNLKTVIGKCN---EDNITRFGIAVLGYLIRHELDTKNLIKEIKGI 327 Query: 341 ----SPNSFYLVENPHSMYDAFSHIGKDIV 366 + F+ V + ++ + +G+ I Sbjct: 328 ASHPTEKYFFNVSSEAALLEEAGTLGERIF 357 >gi|297538282|ref|YP_003674051.1| von Willebrand factor type A [Methylotenera sp. 301] gi|297257629|gb|ADI29474.1| von Willebrand factor type A [Methylotenera sp. 301] Length = 328 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 43/271 (15%), Positives = 84/271 (30%), Gaps = 40/271 (14%) Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA------RLDMM 175 + YS + + + ++ I +S + Sbjct: 24 SNSRTYSWVDMLPSDPLSNLIGLLLKILATFSLISIIIGLAGPHSLEQKVERIGIGAQIG 83 Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 +V+D S SM+ F + + A + + G++TFSN Sbjct: 84 MVIDRSASMDDPFSGGTAEGRVGETKSVAAARLMTEFVNSRQNDMIGVITFSNSAMYVLP 143 Query: 236 LEWGVSHLQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 L +Q IK + + TN GL A + ++ + I+ + Sbjct: 144 LTESREAIQAAIKATAGNSLFQTNIGGGLTSAVSLFENVPDSGS----------RAIILL 193 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS---------------------H 333 +DG + QQ L ++ +Y I +R + Sbjct: 194 SDGGGRLGGDVQQKLR--EWLQRYNITLYWIVLRQPGGISIFNEYKEIDGEPLPQEVELY 251 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 ++ + SP S Y E+P S+ +A + I + Sbjct: 252 QYFKTLRSPFSAYEAEDPKSLANAIADINEK 282 >gi|170730092|ref|YP_001775525.1| hypothetical protein Xfasm12_0914 [Xylella fastidiosa M12] gi|167964885|gb|ACA11895.1| conserved hypothetical protein [Xylella fastidiosa M12] Length = 795 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 54/177 (30%), Gaps = 27/177 (15%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 Q + + +D+S SM + +++D ++NA L+++ V L Sbjct: 1 MQMAPSVAVYFAIDLSGSMHYVGGNGRSRLDNMKTALNAALDQLGQSIASGTAVDIMLAG 60 Query: 226 FSNKIEEFFLL------EWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFDMQ 274 F + + L G++ L+ + F T P A Sbjct: 61 FGDAPDHRQTLLRRNCTAQGIAELKSWVAARQALYGTYFPAGTMDMPSFYAA-------- 112 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 +N ++ FMTDGE + + Y I I + Sbjct: 113 --------APSNAVRVAFFMTDGEPDPPSATLAQAARADVDQVAHLRCYGINIDLAN 161 >gi|256833662|ref|YP_003162389.1| von Willebrand factor type A [Jonesia denitrificans DSM 20603] gi|256687193|gb|ACV10086.1| von Willebrand factor type A [Jonesia denitrificans DSM 20603] Length = 332 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 21/134 (15%), Positives = 50/134 (37%), Gaps = 11/134 (8%) Query: 172 LDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 LD+ V+D + SM +D + ++ + ++ + + +++F + Sbjct: 75 LDVYFVVDRTGSMAAEDYDGNKPRLTGVRNDMTTLMADFSGA-------RFSIISFDSTA 127 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L L I L++ ++ L ++ + T + + ++I Sbjct: 128 SRQMPLTTDTRALSGWITNLNQEITYYSAGSSLDRVREEL---SMALRQGATRNPDNQRI 184 Query: 291 IVFMTDGENLSTKE 304 + TDGEN + E Sbjct: 185 VYLFTDGENTTDTE 198 >gi|117920853|ref|YP_870045.1| vault protein inter-alpha-trypsin subunit [Shewanella sp. ANA-3] gi|117613185|gb|ABK48639.1| Vault protein inter-alpha-trypsin domain protein [Shewanella sp. ANA-3] Length = 751 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 38/228 (16%), Positives = 86/228 (37%), Gaps = 40/228 (17%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 T++ +VM + V+ + Q + ++++V+D S SM I A ++ L Sbjct: 350 TDNYSLVMVLPPKVEASEQLNLPRELILVIDTSGSMAG------DSIIQAKNALRYALRG 403 Query: 209 VKLIPDVNNVVQSGLVTFSNKIE--EFFLL---EWGVSHLQRKIKYLSKFGVSTNSTPGL 263 ++ N ++ F++ + L ++ ++ + L G + + Sbjct: 404 LRPQDSFN------IIEFNSDVSLLSPTPLPATASNLAMARQFVNRLQADGGTE-----M 452 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 A N Q + + + ++FMTDG + + + N+ ++ Sbjct: 453 AQALNAALPRQAFNA--ASAEDKSLRQVIFMTDGSVGNESALFELIR--NQIGDN--RLF 506 Query: 324 AIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGK-DIVTKRI 370 +GI ++PNS ++ F++IG D V ++I Sbjct: 507 TVGIG-----------SAPNSHFMQRAAELGRGTFTYIGDVDEVEQKI 543 >gi|262184150|ref|ZP_06043571.1| putative secreted protein [Corynebacterium aurimucosum ATCC 700975] Length = 500 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 67/203 (33%), Gaps = 35/203 (17%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLE-EVKLIPDVNNVV------QSGLVTFS 227 +VLD S SME ++D+ S+ +++ +PD V Q L+ +S Sbjct: 313 ALVLDTSGSMEG------ERMDLLKSSLLPLIDGSADGVPDGEGQVAFRNREQIKLIPYS 366 Query: 228 NKIEEFFLLE------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 ++ ++ L +++ L G T + + A++++ G Sbjct: 367 SEPQQPTRARVDKDKPATTKELADRVERLVADG-DTATFEAVLNAFDEVDTSGGDIG--- 422 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG--AIVYAIGIRVIRSHEFLRAC 339 +V MTDGE + Q + + V+ I E Sbjct: 423 --------TVVLMTDGEVTRGRTFAQFKDAYAQLPEDKKEIPVFVILYGEANIQEMEELA 474 Query: 340 A-SPNSFYLVENPHSMYDAFSHI 361 + + N + AF I Sbjct: 475 QLTGGKTFDALN-GDLAAAFEEI 496 >gi|223933146|ref|ZP_03625138.1| LPXTG-motif cell wall anchor domain protein [Streptococcus suis 89/1591] gi|330833239|ref|YP_004402064.1| LPXTG-motif cell wall anchor domain-containing protein [Streptococcus suis ST3] gi|223898207|gb|EEF64576.1| LPXTG-motif cell wall anchor domain protein [Streptococcus suis 89/1591] gi|329307462|gb|AEB81878.1| LPXTG-motif cell wall anchor domain protein [Streptococcus suis ST3] Length = 997 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 44/250 (17%), Positives = 78/250 (31%), Gaps = 52/250 (20%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAI--KSIN------AMLEEVKLIPDVNN 217 + T D +IV+D S SM +S T+ K++N +L ++ N Sbjct: 463 TGTKEPFDTLIVVDRSTSMTDPMNSVDTQARYLAVYKALNGTAGRQGLLSKLVGFHPENQ 522 Query: 218 VVQSGLVTF----SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 V G + S + + WG S + TN T GL+ A Sbjct: 523 VAIVGFQGYPGYPSGDQDSTVIANWGRSTSVALSNIQPPYNNGTNYTAGLRTA------- 575 Query: 274 QGMRQHCNTEDANYKKIIVFMTDG------------------ENLSTKEDQQSLYYCNE- 314 + ++ KK+++F++DG N +L Y N Sbjct: 576 ---GVVLDQNQSSRKKVMIFISDGVPTFAFVNGVRYGNGTISGNNPYYTRDWTLNYFNSW 632 Query: 315 -AKKRGAIVYAIGIRVIRSHE--------FLRACAS--PNSFYLVENPHSMYDAFSHIGK 363 K +Y +GI + L +S + V + ++ I Sbjct: 633 IGKYPKLPIYTLGISSEFGNSDNLSANPYVLNHMSSQTGGFYSHVADSQALERTLQKIVD 692 Query: 364 DIVTKRIWYD 373 D + + Sbjct: 693 DTKLSLVSIN 702 >gi|88811039|ref|ZP_01126295.1| von Willebrand factor, type A [Nitrococcus mobilis Nb-231] gi|88791578|gb|EAR22689.1| von Willebrand factor, type A [Nitrococcus mobilis Nb-231] Length = 930 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 41/236 (17%), Positives = 76/236 (32%), Gaps = 44/236 (18%) Query: 121 VPQNEGYSISAISRYKIPLKFCTFIPWYTNSR---HIVMPITS-----------SVKVNS 166 PQ +++A S +P YT++ +T ++ N+ Sbjct: 341 TPQGASVTVTADSN--LPTAAARIWLSYTSTNAGDTASGSVTVRCVQTGQSWTININANT 398 Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 R + +V+D S SM ITK+ ++ N + + GLV F Sbjct: 399 IARPRSAVSLVIDRSGSMNDDAGDGITKVQKLREAANVFINIMLPGDG------IGLVRF 452 Query: 227 SNKIEEFFLLEW--------GVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 ++ + + G + I + G T+ G+ N + D Q Sbjct: 453 NDTAQRLMEITDVGASPGGAGRTDALNHIAGSDIDPSGA-TSIGDGIVNGRNMLNDAQAA 511 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 D + +V +TDG + A A YA+G+ + + Sbjct: 512 PM----PDYDVT-AMVVLTDGMWNRPPS------LADVAGSINANTYAVGLGIPSN 556 >gi|238762416|ref|ZP_04623387.1| tight adherance operon protein [Yersinia kristensenii ATCC 33638] gi|238699401|gb|EEP92147.1| tight adherance operon protein [Yersinia kristensenii ATCC 33638] Length = 459 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 40/230 (17%), Positives = 84/230 (36%), Gaps = 25/230 (10%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 + F N G + + I P + +I+EVS K L I+++ + A + N Sbjct: 13 LTQFKKNEHGAILVSFIIIFPFFIALTFIILEVSIFLQKKAKLSDAIEQATL--ALTVEN 70 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 +G N ++ K +++ N + L GF + I +I +T GY Sbjct: 71 DGIPNAAQQTKNRELVLSYANAY-------LPSEGFSDPIINIDDNT--------NYLGY 115 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 + + Y + + + ++ + + D++ V D S SM Sbjct: 116 NAAVTMTYPVEFLGRSPLTNSISNIQTTDNGEAIKNKTIEVSEPTDVVFVADYSGSMLLS 175 Query: 188 FDSSIT-----KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 F ++ +I+ + + +K + +NV G + F + + Sbjct: 176 FSDDVSIKNGERINALRSAFRILHNTIK---NNSNVNTIGFIPFGSGTKR 222 >gi|73960095|ref|XP_547299.2| PREDICTED: similar to calcium activated chloride channel 1 precursor [Canis familiaris] Length = 911 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 39/197 (19%), Positives = 73/197 (37%), Gaps = 35/197 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM ++ ++ ++ L ++ V G+VTF + + Sbjct: 307 VCLVLDKSGSM-----ATGDRLKRLNQAGKLFLLQI-----VEQGSWVGMVTFDSAAQVQ 356 Query: 234 FLLEWGVSHLQRK--IKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L S +R K L T+ GL+ A+ + + Sbjct: 357 SELIQINSGTERDALTKSLPTVATGGTSICSGLRSAF-AVIKKKYPTDGAE--------- 406 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ ++ C NE K+ GA+++ + + + E L Y Sbjct: 407 IVLLTDGEDN-------TISSCFNEVKQSGAVIHTVALGPSAAKELEELSKMTGGLQTYA 459 Query: 348 VENPHS--MYDAFSHIG 362 + + + DAF + Sbjct: 460 SDQAQNNGLIDAFGALS 476 >gi|14042797|dbj|BAB55397.1| unnamed protein product [Homo sapiens] Length = 397 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 64/187 (34%), Gaps = 35/187 (18%) Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE---------EFFLLEWGVSHL 243 TK+ ++ +L +++ + ++ FSN+I+ + G Sbjct: 4 TKLRQTKDALFTILHDLRPQD------RFSIIGFSNRIKVWKDHLISVTPDSIRDG---- 53 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 + I ++S G T+ L+ A + H D + +IVF+TDG+ Sbjct: 54 KVYIHHMSPTG-GTDINGALQRAIRLL---NKYVAHSGIGDRSVS-LIVFLTDGKPT--V 106 Query: 304 EDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEF-----LRACASPNSFYLVENP-HSM 354 + +L N ++ ++ IGI L C + E+ + Sbjct: 107 GETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQL 166 Query: 355 YDAFSHI 361 + I Sbjct: 167 IGFYDEI 173 >gi|14042827|dbj|BAB55409.1| unnamed protein product [Homo sapiens] Length = 397 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 64/187 (34%), Gaps = 35/187 (18%) Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE---------EFFLLEWGVSHL 243 TK+ ++ +L +++ + ++ FSN+I+ + G Sbjct: 4 TKLRQTKDALFTILHDLRPQD------RFSIIGFSNRIKVWKDHLISVTPDSIRDG---- 53 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 + I ++S G T+ L+ A + H D + +IVF+TDG+ Sbjct: 54 KVYIHHMSPTG-GTDINGALQRAIRLL---NKYVAHSGIGDRSVS-LIVFLTDGKPT--V 106 Query: 304 EDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEF-----LRACASPNSFYLVENP-HSM 354 + +L N ++ ++ IGI L C + E+ + Sbjct: 107 GETHTLKILNNTREAARGQVCIFTIGIGNDVDFRLLEKPSLENCGLTRRVHEEEDAGSQL 166 Query: 355 YDAFSHI 361 + I Sbjct: 167 IGFYDEI 173 >gi|89098949|ref|ZP_01171829.1| hypothetical protein B14911_06266 [Bacillus sp. NRRL B-14911] gi|89086353|gb|EAR65474.1| hypothetical protein B14911_06266 [Bacillus sp. NRRL B-14911] Length = 940 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 36/163 (22%), Positives = 59/163 (36%), Gaps = 23/163 (14%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 I +T K + +D++ V D S SM + K A ++ A + K Sbjct: 59 SIDFHLTPKGKATNANRDPIDVVFVFDKSGSMNDS-GKNPQKFQSAKDAMTAAVNFFKEN 117 Query: 213 PDVNNVVQSGLVTFSNKIEE----FFLLEWG---VSHLQRKIKYLSKFGVSTNSTPGLKY 265 N+ + G V F + +E F E ++ + LS G TN T L Sbjct: 118 AGPND--RFGFVPFDDDVETGKVVNFAPENNMASLNLINSNSNSLSALG-GTNYTQSLDA 174 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS 308 A + K ++FMTDGE +K +++ Sbjct: 175 ALGMFGNSTNN------------KYVLFMTDGEPTFSKVIERT 205 >gi|260578579|ref|ZP_05846489.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC 43734] gi|258603294|gb|EEW16561.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC 43734] Length = 646 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 35/224 (15%), Positives = 74/224 (33%), Gaps = 33/224 (14%) Query: 153 HIVMPITSSVKVNSQTDARLD-MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 IV+P + + + M++LD S SM++ T++ A + V Sbjct: 19 TIVVPPAQEARAEDEEGKNIPPTMLILDASGSMKTPDAGGQTRMAAAKDAAQLFSVAVPS 78 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEW----------GVSHLQRKIKYLSKFGVSTNSTP 261 ++ +V G ++ E + V+ + ++ + G T P Sbjct: 79 DAELGFMVY-GTEVGNSPEERDAGCKDVKTLLPVEKGNVTKIPAEVGKVEASGH-TPMGP 136 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA- 320 L+ A ++ + ++ IV ++DGE+ + KK G Sbjct: 137 ALRQAAEEL-------------PKDGERSIVLVSDGEDTCAPP--PVCEVAKDLKKEGID 181 Query: 321 -IVYAIGIRVIRSHEFLRAC---ASPNSFYLVENPHSMYDAFSH 360 + +G V C A + ++ S+ D+ Sbjct: 182 LTINTVGFLVDSKARKELECIAEAGGGEYMDAKDTVSLADSMKR 225 >gi|113867117|ref|YP_725606.1| von Willebrand factor type A (vWA) domain-containing protein [Ralstonia eutropha H16] gi|113525893|emb|CAJ92238.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Ralstonia eutropha H16] Length = 566 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 42/241 (17%), Positives = 87/241 (36%), Gaps = 36/241 (14%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 PW+ + + + I + S +++ ++DVS SM S K+ + S Sbjct: 161 LAPAPWHPANVLLRIGIKGK-DMASAALPPANLVFLVDVSGSMNS-----PDKLLLLKSS 214 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNS 259 + ++ +++ + LVT+++ G + + I L G ST Sbjct: 215 LKLLVNKLRPQD------RITLVTYASGTRVALPPTPGSDKTAISAAIDQLVA-GGSTAG 267 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKR 318 G+ AY +Q N ++ TDG+ N+ + +Q E +K Sbjct: 268 ASGIALAYQ------AAQQSFIAGGINR---VLLATDGDFNVGVTDFRQLKSMVEEKRKS 318 Query: 319 GAIVYAIGIRVIRSHEFLR---ACASPNSFYLVEN--------PHSMYDAFSHIGKDIVT 367 G + +G +E L A A ++ ++N + + I +D+ Sbjct: 319 GVSLSTLGFGTGNYNEQLMEQLADAGDGAYSYIDNLMEGNKVLVSEISSTLATIARDVKI 378 Query: 368 K 368 + Sbjct: 379 Q 379 >gi|298207016|ref|YP_003715195.1| hypothetical protein CA2559_02145 [Croceibacter atlanticus HTCC2559] gi|83849650|gb|EAP87518.1| hypothetical protein CA2559_02145 [Croceibacter atlanticus HTCC2559] Length = 346 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 21/178 (11%), Positives = 67/178 (37%), Gaps = 26/178 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +D++ +DVS+SM + D + +++ + + + ++ + + Sbjct: 78 GTKLETVKREGVDVVFAIDVSKSMLAE-DVAPNRLEKSQQLVTQIINSLASD-------R 129 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMR 277 G++ ++ + + + ++ ++ T ++ A D + Sbjct: 130 VGIIAYAGSAFPQLPITTDYASAKMFLQNMNTDMLSSQGTAINEAIQLAKTYYNDDEQTN 189 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 +++ ++DGE+ + S+ EA + G ++ IG+ + Sbjct: 190 -----------RVLFIISDGEDH----EGDSVNIAEEASEEGIRIFTIGVGTTKGGRI 232 >gi|324504675|gb|ADY42017.1| Collagen alpha-5(VI) chain [Ascaris suum] Length = 898 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 67/191 (35%), Gaps = 25/191 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ +LD S S+E + + + + V +P + V+ + ++ Sbjct: 1 MDVIFLLDTSGSIEQIY----------QEHVKWTVSLVDALPVDRDGVRIAAIQYAGFPL 50 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L +++ + + T + L+ A +++F + ++ K Sbjct: 51 TEFALGTYLNADDIRQHLSQIKFQSGVTRTGYALRKADSELFRQE------RGARSDAIK 104 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKK-RGAIVYAIGIRVIR-SHEFLRACASPNSFYL 347 IIV TDG ++ L +E + + +Y + + E R + + Sbjct: 105 IIVLFTDGLSIDDP-----LKPAHELRDIKRVKIYVVSVGSDGFEPEMNRIAGDKRNVFG 159 Query: 348 VENPHSMYDAF 358 + D Sbjct: 160 PNELSRLRDTL 170 >gi|309792255|ref|ZP_07686727.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308225796|gb|EFO79552.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 391 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/216 (13%), Positives = 77/216 (35%), Gaps = 28/216 (12%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 + + ++ L+ +VLD S SM+ K+ + ++E + V Sbjct: 4 LLVEATPVAAPPVPVPLNFCLVLDRSGSMQG------AKLQSMKAATRKVIETLTDQDVV 57 Query: 216 NNVVQSGLVTFSNKIEE--FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + +V F + ++ L + L I +S+ G + S G++ Sbjct: 58 S------IVIFDDTVQTLVPATLATDRTALLAAIDTISEAGGTAMS-LGMQAG------Q 104 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 +++H + ++ ++ +TDG+ +++ + + A+G+ + Sbjct: 105 VELQKHSGPDRLSH---MLLLTDGQTWG--DEETCRNIARALGQADVRITALGLGAEWNE 159 Query: 334 EFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVT 367 + L A S + + + + + F + Sbjct: 160 QLLDDLAEFSDGTSDYIADANQIGTFFQRAIRSAQG 195 >gi|156390493|ref|XP_001635305.1| predicted protein [Nematostella vectensis] gi|156222397|gb|EDO43242.1| predicted protein [Nematostella vectensis] Length = 229 Score = 52.5 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 55/165 (33%), Gaps = 21/165 (12%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF--SNK 229 LD+ ++D + SM + ++ ++ + I V+ LV + Sbjct: 16 LDLAFIVDCTGSMGEY----------IRQAQKHVISISETISRTAYNVRLALVEYRDHPP 65 Query: 230 IEEFFL-----LEWGVSHLQRKIKYLSKFGVS---TNSTPGLKYAYNQIFDMQGMRQHCN 281 ++ F+ V ++ + +S G + + A + + Sbjct: 66 QDKSFVTRVHDFTSDVKEMKVWVDKMSASGGGDCPESVADAIFKACKLGYREDATKMCVL 125 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 DA + F DG ++ L C+ ++G +Y IG Sbjct: 126 IADAPPHG-LGFAHDGFPNGCPDNHDPLASCHVMAEKGITLYTIG 169 >gi|301064778|ref|ZP_07205158.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300441153|gb|EFK05538.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 625 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 62/169 (36%), Gaps = 27/169 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +D+++ +D S+SM + D +++ + I + +++ + GL+ F+ Sbjct: 86 KRKGIDILMAVDTSKSMLAE-DVRPNRLERSKFGIMDFVSKLEGD-------RVGLLPFA 137 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 L + ++ L T+ + Y H Sbjct: 138 GTAFLMCPLTLDYDAFRNSLEALDTNIIPQGGTDIASAI---YEAEAAFNNDANH----- 189 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 KI+V ++DGE+L + ++L AK+R +Y +G+ Sbjct: 190 ----KILVLVSDGEDL----EGEALSAAQAAKERDLTIYTVGVGTPSGE 230 >gi|262198733|ref|YP_003269942.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262082080|gb|ACY18049.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 684 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 44/235 (18%), Positives = 84/235 (35%), Gaps = 58/235 (24%) Query: 172 LDMMIVLDVSRSMESFFDSSITK--IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS-- 227 LD +IVLD S S+ ++ + +D+ + NA L + +V+++ Sbjct: 72 LDAVIVLDASSSVRNYNNPPDANGAVDLIAGAGNAFLGAFADTNS-----RVAVVSYNAD 126 Query: 228 -------NKIEEFFLLEWGVSHLQ--------------RKIKYLSKFGVSTNSTPGLKYA 266 + L G + ++ G TN GL YA Sbjct: 127 PRLQLDLTAVTTDSLAAGGAHGIAMGDPGGPQGPMSPTTGYSEHARNGSGTNWEAGLVYA 186 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL------STKEDQ--------QSLYYC 312 N + N A+ K+++ +TDG T D+ ++ Sbjct: 187 QNVL---------ENNGRADVPKLVIHVTDGRPTRHLTPDGTVTDEGGMAVHVAEAAEVA 237 Query: 313 NEAKKRGAIVYAIGIR-VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIV 366 ++ K G ++A+G+ + E L+A + P+ F + DAF + D++ Sbjct: 238 DQLKASGVHIFAVGVGRAPQFSEELQATSGPDVF----DQTQPGDAFDVVNDDVI 288 >gi|194289206|ref|YP_002005113.1| lipoprotein, von willebrand factor type a domain [Cupriavidus taiwanensis LMG 19424] gi|193223041|emb|CAQ69046.1| putative lipoprotein, Von Willebrand factor type A domain [Cupriavidus taiwanensis LMG 19424] Length = 570 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 40/241 (16%), Positives = 87/241 (36%), Gaps = 36/241 (14%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 PW+ ++ + + I + S +++ ++DVS SM + K+ + S Sbjct: 165 LAPAPWHPSNVLLRIGIKGK-DMASGALPAANLVFLVDVSGSMNT-----PDKLPLLKSS 218 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNS 259 + ++ +++ + LVT+++ G + I L G ST Sbjct: 219 LKLLVNQLRAQD------RITLVTYASGTRVALPPTPGSDKGAIVAAIDQLVA-GGSTAG 271 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKR 318 G+ AY +Q N ++ TDG+ N+ + +Q + +K Sbjct: 272 ASGIALAYQ------AAQQSYIAGGINR---VLLATDGDFNVGVTDFRQLKSMVEDKRKS 322 Query: 319 GAIVYAIGIRVIRSHEFLR---ACASPNSFYLVEN--------PHSMYDAFSHIGKDIVT 367 G + +G +E L A A ++ ++N + + I +D+ Sbjct: 323 GVSLSTLGFGTGNYNEQLMEQLADAGDGAYSYIDNLMEGNKVLVSEISSTLATIARDVKI 382 Query: 368 K 368 + Sbjct: 383 Q 383 >gi|198433657|ref|XP_002122417.1| PREDICTED: similar to PK-120 [Ciona intestinalis] Length = 864 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 73/204 (35%), Gaps = 18/204 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+DVS SM KI +++ +L+++ I N + S + E Sbjct: 300 VVFVIDVSGSMSGH------KIVQTKEALRTILDDLNEIDQFNIITFSSTTNVWHPNEMV 353 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + + ++ ++ + G TN + + R T +++ Sbjct: 354 DVNPTNIRNAKKHVRSMYARG-GTNFNAAALDGIQLLETISSNR----TNTLEEASMMIL 408 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRG-AIVYAIGIRVIRSHEFLRACASPNS------FY 346 +TDG+ +++ + G ++ +G HEFL AS N + Sbjct: 409 LTDGQPTVGVTGNEAIRRNIRERVNGRYSIFCLGFGQHLDHEFLDQIASENKGLSRKIYN 468 Query: 347 LVENPHSMYDAFSHIGKDIVTKRI 370 + + D + + ++ I Sbjct: 469 DADAALQLKDFYDEVASPLLAHVI 492 >gi|95930867|ref|ZP_01313598.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684] gi|95133109|gb|EAT14777.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684] Length = 698 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 47/257 (18%), Positives = 91/257 (35%), Gaps = 47/257 (18%) Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 Y + + +I L T + +V +T + + T+ D VLDVS SM+ Sbjct: 266 YRLDDTTPARIELIPYKADRNATGTMMLV--VTPAADLQPITEGT-DWTFVLDVSGSMDG 322 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV-----S 241 KI ++ L ++ + N+ + +TF+ + L G Sbjct: 323 H------KIATLADGVSQTLGKL----NSNDRFRI--ITFN---QSAADLTRGFVTATPE 367 Query: 242 HLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 + + I + G STN GL+ A ++ D + IV +TDG Sbjct: 368 AVGQWINRVKTIAAGGSTNLFAGLETACRRLDDDRTTS-------------IVLVTDGVA 414 Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFS 359 + +Q+ + + ++ I + L A + + ++ Sbjct: 415 NVGRTEQR--EFLQLLTEYDVRLFTFVIGNSANRPLLDRLAKDSGGFAMQISDV-----D 467 Query: 360 HI-GKDIVTK-RIWYDK 374 I G+ I K ++ Y+K Sbjct: 468 DIQGRLIQAKAKVLYEK 484 >gi|327467286|gb|EGF12786.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK330] Length = 470 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 40/163 (24%), Positives = 68/163 (41%), Gaps = 20/163 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSIT------KIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + + + V D S SME T +I + + M++E+K I +V+ V Sbjct: 193 EGQVNVAISFVFDSSGSMERDMKGRNTNVVKERRISILREKAIEMVKELKEIGNVS-VNL 251 Query: 221 SGLVTFSNKIEEFFL-LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 SG + +++ F L+ G +++ I L GV TN GL+Y + +QH Sbjct: 252 SGFSHYGFYVQKDFSQLDKGTEQIEKSINSLPTRGV-TNPGDGLRYGMVSL-----QQQH 305 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 K +V +TDG + D + Y N+ RG + Sbjct: 306 VQL------KYVVLLTDGIPNAYIVDPSANYPGNKVWIRGGEI 342 >gi|197295155|ref|YP_002153696.1| hypothetical protein BCAS0306 [Burkholderia cenocepacia J2315] gi|195944634|emb|CAR57238.1| putative membrane protein [Burkholderia cenocepacia J2315] Length = 423 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 17/166 (10%), Positives = 55/166 (33%), Gaps = 6/166 (3%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R + +G + I+ + L ++ +G+ +++ ++ ++ L + D + AA + + Sbjct: 12 RRGLHRQRGAVAIIVGLSLAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDLTSA 71 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRS--TSLDIVVVPQNEG 126 + + + G + + ++ N V D + T + + Sbjct: 72 ISLSVAE--ADGIAAGHLNFVFFQKTSVQMSTNANVTFSDSLTNPFLTKSAVTKPASIKY 129 Query: 127 YSISAISRYKIP--LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 +A ++ +P + + ++ V Sbjct: 130 VQCTATLSNIAHWFIEVLNVLPGTKLANAAEVSASAIATVGGGQTT 175 >gi|281338026|gb|EFB13610.1| hypothetical protein PANDA_007565 [Ailuropoda melanoleuca] Length = 854 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 38/193 (19%), Positives = 79/193 (40%), Gaps = 20/193 (10%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT-FSNKIE 231 ++ V+DVS SM K++ ++ +L+++K +N ++ SG VT + + + Sbjct: 249 SVVFVIDVSGSMHG------RKMEQTKDALLKILDDMKEEDYLNIILFSGDVTIWRDSLV 302 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + + + +K + G+ TN GL A N + + +H E + I+ Sbjct: 303 QATPE--NIQEARTFVKNIHDQGM-TNINDGLMRAINMLNKARE--EHRVPERSTS--IV 355 Query: 292 VFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS-----F 345 + +TDG+ N+ ++ A +Y +G ++ FL + A N Sbjct: 356 IMLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLESMALENHGLARRI 415 Query: 346 YLVENPHSMYDAF 358 Y + + F Sbjct: 416 YEDSDANLQLQGF 428 >gi|320006793|gb|ADW01643.1| von Willebrand factor type A [Streptomyces flavogriseus ATCC 33331] Length = 248 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 36/159 (22%), Positives = 66/159 (41%), Gaps = 23/159 (14%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 SSV+ + R + +VLD S SM ++ D ++ + + S++A L++ ++P Sbjct: 34 SSVRAHGLEGVRAAVYLVLDRSGSMRPYYRDGTMQHLAEQVLSLSAHLDDDGVVP----- 88 Query: 219 VQSGLVTFSNKIEEFFLLEWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 +V FS ++ L G L K+ TN +A +++ D Sbjct: 89 ----VVFFSTDVDGCTDLTLGRHRGLMDKLHANLGHMGRTN----YHWAMDEVID----- 135 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 H + ++VF TDG S ++ Y C A+ Sbjct: 136 -HYLASGSEAPALVVFQTDGGPTSRLAAER--YLCKAAR 171 >gi|325688744|gb|EGD30753.1| von Willebrand factor type A [Streptococcus sanguinis SK115] Length = 551 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 38/186 (20%), Positives = 68/186 (36%), Gaps = 27/186 (14%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 T +++M + + K N + + V DVS SM+ +++ K ++L Sbjct: 353 QTTDGNLLMSMQNLWKKNKNNSQPIVGVFVTDVSGSMDGEPMNNLKK---------SLLN 403 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLL----EWGVSHLQRKIKYLSKFGVSTNSTPGL 263 ++ I + N Q GLV++S+ + + S+ IK L+ G T + G Sbjct: 404 SLQYINEEN---QIGLVSYSDDVTINVPIDTMNSTQKSYFTSAIKGLTPSG-GTATYDGT 459 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 A I D +I ++DG+ E ++ K G V Sbjct: 460 LVAVKMILDKMKENPGARP-------VIFVLSDGQTNGGYEFERVEPI---IKALGITVN 509 Query: 324 AIGIRV 329 IG Sbjct: 510 TIGYNA 515 >gi|312092300|ref|XP_003147289.1| hypothetical protein LOAG_11723 [Loa loa] gi|307757546|gb|EFO16780.1| hypothetical protein LOAG_11723 [Loa loa] Length = 422 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 46/300 (15%), Positives = 104/300 (34%), Gaps = 30/300 (10%) Query: 59 VHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDI 118 + ++ +G + K+ + +I NTW + + ++ I S++ Sbjct: 122 IPTLIMVVTDGRSADDPKIPAQQLQ-QIPNTWVFAAATGNPEAVDKKELLQITG--SINH 178 Query: 119 VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVL 178 V++ + + + + T + T++ N + D+++V+ Sbjct: 179 VILQSGQELAADITRKLLRQAQEKCRTTTTTTTTTTTATTTTTTTTNPISGCEQDVVLVM 238 Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS----NKIEEFF 234 D+S + + I + + S+ + L+TFS ++ + Sbjct: 239 DLSTTTNPIYRKYIEMAEELVNSLV----------IGRRFSRIALITFSSVGKSRTQFNL 288 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + + I+ L G +T G++ + Q +QH KKI++ Sbjct: 289 DRYFDGKDIVTAIRRLESSGGTTAIGEGIR-----LGTEQKDKQHGGRPVEIAKKIMLVF 343 Query: 295 TDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHS 353 TDG N ++ + AK G +Y I V + + A + + Y +E Sbjct: 344 TDGWSNKGPDVEEMTR----NAKGAGFTLYTI---VYEGNGRVDANSPGLNLYTIETMVD 396 >gi|302555134|ref|ZP_07307476.1| secreted protein [Streptomyces viridochromogenes DSM 40736] gi|302472752|gb|EFL35845.1| secreted protein [Streptomyces viridochromogenes DSM 40736] Length = 415 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 38/218 (17%), Positives = 79/218 (36%), Gaps = 32/218 (14%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P ++ + DA + +VLDVS SM + +++ A ++ N +L+ Sbjct: 13 LAFPAGAAAGEPTGQDAP-KVDLVLDVSGSMRARDIDGQSRMAAAKQAFNEVLD--ATPE 69 Query: 214 DVNNVVQSGLVTF-----------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 +V +++ + + ++ L+ + + + L+ G T P Sbjct: 70 EVELGIRTLGADYPGDDRKTGCKDTAQLYPVGPLD--RTEAKTAVATLTPTGW-TPIGPA 126 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 L A + G K IV ++DGE+ D + AK G + Sbjct: 127 LLKAAGDLDGGNGS------------KRIVLISDGEDTCAPLDPCEVAREIAAKGIGLTI 174 Query: 323 YAIGIRVIRSHEFLRAC---ASPNSFYLVENPHSMYDA 357 +G+ +C A+ ++ VE+ + D Sbjct: 175 DTLGLVPTAKLSRQLSCIAEATGGTYTSVEHQDELTDR 212 >gi|254472518|ref|ZP_05085918.1| conserved hypothetical protein [Pseudovibrio sp. JE062] gi|211958801|gb|EEA94001.1| conserved hypothetical protein [Pseudovibrio sp. JE062] Length = 479 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 16/60 (26%), Positives = 26/60 (43%), Gaps = 2/60 (3%) Query: 308 SLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDI 365 + C KK ++Y + + + + CAS +FY N ++ AFS I DI Sbjct: 413 ASNLCKNMKKEDIVIYTVFFGSDYKGKKIMEECASNSETFYHATNQSALIQAFSSIANDI 472 Score = 36.3 bits (82), Expect = 8.3, Method: Composition-based stats. Identities = 29/232 (12%), Positives = 73/232 (31%), Gaps = 26/232 (11%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 + F + G + +L A + ++ + GM I+ F + + +N Sbjct: 16 FKKFNRSEDGVVAVLVAFLMVLLIVFAGMAIDFGLGFNTR----------------RAVN 59 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 + + + + S ++ + N + V T + P+ + Sbjct: 60 QALDAAVLAVANKLSTTELSSNTVDSLIDQYFEENLKNSVGGDVVHTKPVVTYDPKGDTV 119 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 + +A + K + + + +TSS ++V V+ SM Sbjct: 120 AATATATVKTSFLPVLKLLNSESGDFGELTVTSSSTARFPKTKVEVAVVVD-VTGSMSG- 177 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPD--VNNVVQSGLVTFSNKIEEFFLLE 237 I + ML+ + + + + V+ V ++ ++ L Sbjct: 178 ------SIGSLKTASRDMLDTLLPDDNTRLQSRVRISYVPYNVGVKLDKTLA 223 >gi|126309704|ref|XP_001376377.1| PREDICTED: similar to complement C2 [Monodelphis domestica] Length = 822 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 30/213 (14%), Positives = 71/213 (33%), Gaps = 28/213 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 Q L++ ++LD S+S+ S + +S M++ + V G+V F Sbjct: 313 QRSGHLNLYLLLDSSQSV------STEDFAIFKESAQLMVDRIFSFDVN---VSVGIVIF 363 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL--------SKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + + + S + +I G TN + Y + + + Sbjct: 364 AKSPKVILSVTHKDSRDEMEIAKKLEDLKYGDPDIGTGTNINKAMMQIYEMMNNEMAIFG 423 Query: 279 HCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCN-----EAKKRGAIVYAIGIRVIRS 332 + + +I+ +TDG N+ + + + +YAIG+ + Sbjct: 424 GRQKDWEKIRHVIILLTDGKSNMGGSPTEAVKKIKEVLNIRQERTDYLDIYAIGVGKLDV 483 Query: 333 -----HEFLRACASPNSFYLVENPHSMYDAFSH 360 +E +++ + ++ F + Sbjct: 484 DWKELNELGSKKDGERHAFILPDSKALLQVFEN 516 >gi|146304257|ref|YP_001191573.1| von Willebrand factor, type A [Metallosphaera sedula DSM 5348] gi|145702507|gb|ABP95649.1| von Willebrand factor, type A [Metallosphaera sedula DSM 5348] Length = 383 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 44/192 (22%), Positives = 71/192 (36%), Gaps = 31/192 (16%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 T +++LD S SM+ KI+ A K +E +K IP N V V Sbjct: 31 KISTATGFHYIVLLDTSGSMDGL------KIESAKKGA---IELLKRIPQGNKV---SFV 78 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 TFS+++ L +I LS G + T L A+N + + G Sbjct: 79 TFSSRVNIVREFVD-PEDLTAEISSLSAGGQTAFFT-ALLTAFN-LHNKHG--------- 126 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SP 342 ++ +TDG + +D Y A G + G+ + L++ A S Sbjct: 127 --IPSYVILLTDG---NPTDDTNVETYKRIAIPNGVQTISFGLGDDYNETILKSLADRSG 181 Query: 343 NSFYLVENPHSM 354 FY V + + Sbjct: 182 GVFYHVNDAMEI 193 >gi|330808169|ref|YP_004352631.1| hypothetical protein PSEBR_a1432 [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327376277|gb|AEA67627.1| Conserved hypothetical protein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 2855 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 43/174 (24%), Positives = 75/174 (43%), Gaps = 16/174 (9%) Query: 174 MMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 ++IVLDVS SM + ++++D+A ++I+A+L++ + D V+ LVTFS+ + Sbjct: 2054 LLIVLDVSGSMADDSGVPGLSRLDLAKQAISALLDKYDDLGD----VKVQLVTFSSSATD 2109 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + V+ + + LS G TN + A + N Sbjct: 2110 QTSVWVDVATAKSLLSSLSADG-GTNYDAAVATAKTAFVTSGQLTGAQNIGY-------- 2160 Query: 293 FMTDGENLSTKEDQQSLYYCNEA--KKRGAIVYAIGIRVIRSHEFLRACASPNS 344 F +DG+ S E + +A G YAIG+ S+++L A S Sbjct: 2161 FFSDGKPNSGLETGTADEAAWKAFLDANGIKNYAIGLGDGVSNDYLDPLAYDGS 2214 >gi|237667682|ref|ZP_04527666.1| von Willebrand factor type A domain protein [Clostridium butyricum E4 str. BoNT E BL5262] gi|237656030|gb|EEP53586.1| von Willebrand factor type A domain protein [Clostridium butyricum E4 str. BoNT E BL5262] Length = 1336 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 26/157 (16%), Positives = 61/157 (38%), Gaps = 29/157 (18%) Query: 173 DMMIVLDVSRSMESF-------FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 D++++LD S SM ++ +I +S + + + + ++ G++ Sbjct: 507 DIVLILDTSGSMNFNFYNDSIPYNEKDKRIYSLKQSAKQFINKF----NNKDNIRIGIIP 562 Query: 226 F-------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + +N + + + + I + G TN G++ A + + G Sbjct: 563 YSYYSGYANNIKQLTEINDNNKKSYENYIDNIKVEGA-TNQGDGIREAGKMLLNTDG--- 618 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 N KK ++ +TDGE + ++ +L + A Sbjct: 619 -------NSKKYVILITDGEATAITIEKPNLIINDSA 648 >gi|149184581|ref|ZP_01862899.1| hypothetical protein ED21_27723 [Erythrobacter sp. SD-21] gi|148831901|gb|EDL50334.1| hypothetical protein ED21_27723 [Erythrobacter sp. SD-21] Length = 528 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 17/145 (11%), Positives = 48/145 (33%), Gaps = 5/145 (3%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 ++ + G +L A+ P++F G+ ++++ + K + +D+ + A N Sbjct: 7 MKRLQADTSGNALMLVAMGAPVLFGSAGLGVDMAQYYMWKREIQYAVDQGALAGAWSRGN 66 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 G K + + T ++ L + D +++ + + Sbjct: 67 GDMGLEYKTRAKQEFYINLSET-----KDYLLTHSIELQTFDGTPDSAVYMHATVSAQLP 121 Query: 128 SISAISRYKIPLKFCTFIPWYTNSR 152 + + + W T + Sbjct: 122 FTKVMINEGMTIAVQARATWETQKQ 146 >gi|308472927|ref|XP_003098690.1| hypothetical protein CRE_04223 [Caenorhabditis remanei] gi|308268290|gb|EFP12243.1| hypothetical protein CRE_04223 [Caenorhabditis remanei] Length = 411 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 36/191 (18%), Positives = 67/191 (35%), Gaps = 10/191 (5%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN--VVQSGLVTFSNK 229 LD+++V+D S M S D+A I+ ++ + + + GL+T++ Sbjct: 43 LDVVLVVDNSEEMGSQRL-----FDVAANIIDVFGANTRIGSNSSEPITTRVGLITYNFN 97 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L S+ TNST + + RQ+ NT +YKK Sbjct: 98 ATLNANLSQFQSYDDLSNGVFHSLSNVTNSTDSFIGTGLAMAEQLLRRQNFNTTRDHYKK 157 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLRACASPNSFYLV 348 +I+ ED+ + + K G + +G + L ASP + Sbjct: 158 VIIVY--ASAFQRNEDETPEWIADRLKGSGVKIITVGYGNSHGLIKSLSNIASPGLSFNS 215 Query: 349 ENPHSMYDAFS 359 ++ + Sbjct: 216 SGDGNLINQIQ 226 >gi|127512721|ref|YP_001093918.1| vault protein inter-alpha-trypsin subunit [Shewanella loihica PV-4] gi|126638016|gb|ABO23659.1| Vault protein inter-alpha-trypsin domain protein [Shewanella loihica PV-4] Length = 776 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 40/261 (15%), Positives = 83/261 (31%), Gaps = 28/261 (10%) Query: 117 DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMI 176 + + ++ ++ +S P + +S +VM + K ++ + Sbjct: 349 NEESLASSQAHNDQVVSEANHPAEAQASDKEAKDSYALVMLMPPQDKARV--RLPRELTL 406 Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 V+D S SM I A +I L + N ++ F + + + Sbjct: 407 VIDTSGSMTG------DSIAQAKSAILNALAGLGSQDTFN------VIAFDSSVRSLSPV 454 Query: 237 -----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + ++ L G T P L A +Q K + Sbjct: 455 ALSATAANLGKANLFVQSLEADG-GTEMAPALLRALSQPESGVSSISSAVKP--ERLKQV 511 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLVE 349 VF+TDG ++ SL+ A ++ +GI + F+ A ++ V Sbjct: 512 VFITDG----AVGNEASLFALIAANIGRQRLFTVGIGAAPNGYFMERAARAGRGTYTYVG 567 Query: 350 NPHSMYDAFSHIGKDIVTKRI 370 + + + I + +I Sbjct: 568 KISEVDAKIGELLEKIESPQI 588 >gi|325267447|ref|ZP_08134103.1| von Willebrand factor type A [Kingella denitrificans ATCC 33394] gi|324981088|gb|EGC16744.1| von Willebrand factor type A [Kingella denitrificans ATCC 33394] Length = 238 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 38/209 (18%), Positives = 78/209 (37%), Gaps = 19/209 (9%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K + T L ++++LDVS SM S KID K++ ML+ + + Sbjct: 6 KFTTPTAKPLPVVLLLDVSSSM------SGDKIDNLNKAVENMLDTFAQEEKMETEILVS 59 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 ++TF K++ K + G+ N + A M ++ T Sbjct: 60 VITFGGKVDLHVPFT--------KASQVQWHGLQVNGDTPMGTALKMAKAMIEDKE--TT 109 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGAIVYAIGIRVIRSHEFLRAC-- 339 Y+ IV ++DG+ +Q++ + +E + A+ I L+ Sbjct: 110 PSRAYRPTIVLVSDGQPTDGNIWKQAMADFISEGRSSKCDRMAMAIGHDADETVLKRFIE 169 Query: 340 ASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 + + + EN +++ F + + + Sbjct: 170 GTAHDLFYAENAGQLHEFFQRVTMSVTMR 198 >gi|326789712|ref|YP_004307533.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] gi|326540476|gb|ADZ82335.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] Length = 404 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 63/200 (31%), Gaps = 22/200 (11%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + A D++I +D S SME D + + LI ++ + +TF Sbjct: 108 KEKAFKDIVIAIDTSGSMEQS-DPNGERFKA----------TSSLIDNLEGNRRIAFMTF 156 Query: 227 -SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + I +F +E + +K + + + + Sbjct: 157 DDSPILQFDFMEATTKEQKEVVKAKIASYQQNDDG---QTGVRDMI--NEAYELIQNNSK 211 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--SHEFLRACA--S 341 N+ ++ ++DG N + +Y IG+ + ++L A + Sbjct: 212 NHSGSLIMISDGAPSDDSASNIPALVSNYVQNN-IPIYTIGMMYGDNSAEQYLIDIANLT 270 Query: 342 PNSFYLVENPHSMYDAFSHI 361 Y + + AF I Sbjct: 271 GGQHYSTSDTTMIAGAFGQI 290 >gi|332882611|ref|ZP_08450223.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332679411|gb|EGJ52396.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 547 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 42/293 (14%), Positives = 99/293 (33%), Gaps = 41/293 (13%) Query: 57 SLVHAATQIMNEGNGNNRKKLK-----GGDILCRIKNTWNMSFRNELRDNGFVN-----D 106 S + I++ ++ + + + +F ++ + N Sbjct: 59 SANATSKMILSSPPPPQQRNAETYKEIKENSFVAVAQQPVTTFSADVDRAAYANVRRIIG 118 Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAI---SRYKIPLKF-CTFIPWYTNSRHIVMPI-TSS 161 I ++ I + Y A S + + PW N H+++ I + Sbjct: 119 YGQIPPKDAVRIEEMVNYFDYDYPAPEEGSASPLRVSPELAPAPW--NPNHLLLRIGLQA 176 Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 K++ +++ ++DVS SM+ K+ + S +L +++ + Sbjct: 177 KKIDLAKAPPSNIVFLIDVSGSMDEE-----NKLPLLQSSFKMLLGQLRPDD------KV 225 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 +VT++N + V ++ IK L T+ G++ AY Q + Sbjct: 226 AIVTYANGTKVALP-STSVKDKEKIIKVLDNLYASGGTSGGKGIQLAYEQ-----AQKSF 279 Query: 280 CNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 + I+ TDG+ N+ + + ++ G + +G + Sbjct: 280 IKNGNNR----IILATDGDFNIGINNTTDLEKFIEKQRESGIYMSVLGFGMGN 328 >gi|291290987|ref|NP_001167519.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 3 [Mus musculus] Length = 1150 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 74/197 (37%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 288 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 335 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 336 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 394 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N + +++G A Sbjct: 395 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--RTVRVFTFSVGQHNYDVTPLQWMA 445 Query: 339 CASPNSFYLVENPHSMY 355 C + ++ + + ++ Sbjct: 446 CTNKGYYFEIPSIGAIR 462 >gi|291290985|ref|NP_001167518.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 1 [Mus musculus] Length = 1156 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 74/197 (37%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 288 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 335 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 336 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 394 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N + +++G A Sbjct: 395 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--RTVRVFTFSVGQHNYDVTPLQWMA 445 Query: 339 CASPNSFYLVENPHSMY 355 C + ++ + + ++ Sbjct: 446 CTNKGYYFEIPSIGAIR 462 >gi|291290989|ref|NP_001167520.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 4 [Mus musculus] gi|187957756|gb|AAI58059.1| Cacna2d2 protein [Mus musculus] Length = 1147 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 74/197 (37%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 288 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 335 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 336 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 394 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N + +++G A Sbjct: 395 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--RTVRVFTFSVGQHNYDVTPLQWMA 445 Query: 339 CASPNSFYLVENPHSMY 355 C + ++ + + ++ Sbjct: 446 CTNKGYYFEIPSIGAIR 462 >gi|149018615|gb|EDL77256.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Rattus norvegicus] Length = 1157 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 74/197 (37%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 288 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 335 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 336 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 394 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N + +++G A Sbjct: 395 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--RTVRVFTFSVGQHNYDVTPLQWMA 445 Query: 339 CASPNSFYLVENPHSMY 355 C + ++ + + ++ Sbjct: 446 CTNKGYYFEIPSIGAIR 462 >gi|148689241|gb|EDL21188.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_b [Mus musculus] Length = 1149 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 74/197 (37%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 289 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 336 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 337 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 395 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N + +++G A Sbjct: 396 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--RTVRVFTFSVGQHNYDVTPLQWMA 446 Query: 339 CASPNSFYLVENPHSMY 355 C + ++ + + ++ Sbjct: 447 CTNKGYYFEIPSIGAIR 463 >gi|40737976|gb|AAR89454.1| voltage-gated calcium channel alpha2-delta2 subunit [Mus musculus] Length = 1186 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 74/197 (37%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 327 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 374 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 375 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 433 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N + +++G A Sbjct: 434 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--RTVRVFTFSVGQHNYDVTPLQWMA 484 Query: 339 CASPNSFYLVENPHSMY 355 C + ++ + + ++ Sbjct: 485 CTNKGYYFEIPSIGAIR 501 >gi|161353447|ref|NP_064659.2| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 2 [Mus musculus] gi|81892698|sp|Q6PHS9|CA2D2_MOUSE RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-2; AltName: Full=Protein ducky; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-2; Flags: Precursor gi|38614142|gb|AAH56389.1| Cacna2d2 protein [Mus musculus] gi|148689240|gb|EDL21187.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_a [Mus musculus] Length = 1154 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 74/197 (37%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 288 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 335 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 336 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 394 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N + +++G A Sbjct: 395 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--RTVRVFTFSVGQHNYDVTPLQWMA 445 Query: 339 CASPNSFYLVENPHSMY 355 C + ++ + + ++ Sbjct: 446 CTNKGYYFEIPSIGAIR 462 >gi|291290991|ref|NP_001167521.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 5 [Mus musculus] gi|26336631|dbj|BAC31998.1| unnamed protein product [Mus musculus] Length = 1148 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 74/197 (37%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 288 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 335 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 336 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 394 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N + +++G A Sbjct: 395 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--RTVRVFTFSVGQHNYDVTPLQWMA 445 Query: 339 CASPNSFYLVENPHSMY 355 C + ++ + + ++ Sbjct: 446 CTNKGYYFEIPSIGAIR 462 >gi|15553133|gb|AAL01650.1|AF247141_1 voltage-dependent calcium channel alpha-2-delta-2 mutant subunit 2 [Mus musculus] Length = 1084 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 74/197 (37%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 216 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 263 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 264 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 322 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N + +++G A Sbjct: 323 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--RTVRVFTFSVGQHNYDVTPLQWMA 373 Query: 339 CASPNSFYLVENPHSMY 355 C + ++ + + ++ Sbjct: 374 CTNKGYYFEIPSIGAIR 390 >gi|12044402|gb|AAG47846.1|AF247139_1 voltage-dependent calcium channel alpha-2-delta-2 subunit [Mus musculus] Length = 1156 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 74/197 (37%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 288 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 335 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 336 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 394 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N + +++G A Sbjct: 395 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--RTVRVFTFSVGQHNYDVTPLQWMA 445 Query: 339 CASPNSFYLVENPHSMY 355 C + ++ + + ++ Sbjct: 446 CTNKGYYFEIPSIGAIR 462 >gi|28212250|ref|NP_783182.1| voltage-dependent calcium channel subunit alpha-2/delta-2 [Rattus norvegicus] gi|81871226|sp|Q8CFG6|CA2D2_RAT RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-2; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-2; Flags: Precursor gi|27450706|gb|AAO14653.1|AF486277_1 calcium channel alpha-2 delta-2 subunit [Rattus norvegicus] Length = 1157 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 74/197 (37%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 288 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 335 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 336 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 394 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N + +++G A Sbjct: 395 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--RTVRVFTFSVGQHNYDVTPLQWMA 445 Query: 339 CASPNSFYLVENPHSMY 355 C + ++ + + ++ Sbjct: 446 CTNKGYYFEIPSIGAIR 462 >gi|26006175|dbj|BAC41430.1| mKIAA0558 protein [Mus musculus] Length = 1098 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 74/197 (37%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 236 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 283 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 284 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNSNITRAN 342 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N + +++G A Sbjct: 343 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--RTVRVFTFSVGQHNYDVTPLQWMA 393 Query: 339 CASPNSFYLVENPHSMY 355 C + ++ + + ++ Sbjct: 394 CTNKGYYFEIPSIGAIR 410 >gi|254167839|ref|ZP_04874688.1| von Willebrand factor type A domain protein [Aciduliprofundum boonei T469] gi|197623130|gb|EDY35696.1| von Willebrand factor type A domain protein [Aciduliprofundum boonei T469] Length = 1953 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/167 (19%), Positives = 61/167 (36%), Gaps = 22/167 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSI-----------TKIDMAIKSINAMLEEVKLIPDV 215 +D++ V+D S SM S + T+ID+AI++ ++E+ V Sbjct: 1244 NKRKPIDIIFVIDTSGSMNSVVPGATVGDVNGDGRSNTRIDVAIQAAIDAVKELGPQDRV 1303 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSH-LQRKIKYLS--KFGVSTNSTPGLKYAYNQIFD 272 +G ++ EE+ + + LQ I L + G T L +A + Sbjct: 1304 AVFTFNG----NSHPEEYMGFTYVTADNLQTIISDLKDIQAGGGTPLYDTLSWAVYYMDK 1359 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 N + + + I+ +TDG + + + A G Sbjct: 1360 YSTD----NPDREDATRGILVLTDGLSNYDTYGTSNGARYDYAPGTG 1402 >gi|325673437|ref|ZP_08153128.1| type II secretion system protein [Rhodococcus equi ATCC 33707] gi|325555458|gb|EGD25129.1| type II secretion system protein [Rhodococcus equi ATCC 33707] Length = 623 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 34/254 (13%), Positives = 83/254 (32%), Gaps = 33/254 (12%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDM 174 + V + +S ++ I + + + ++V Q + D+ Sbjct: 32 QVSAVDTTRFPDIEVSILAPPGIEGQAIDPGTFALT----EGGVPREIEVRQQPGSEQDI 87 Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 ++ +DVS M S +D ++ + + + G+V S+ + Sbjct: 88 VLAIDVSGGM------SGPALDDVKRAASDFVRQAPTGAH------IGIVAISSTPQVLS 135 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 L L R+I L G S A + + + + + + I++ + Sbjct: 136 ELTTDSEDLLRRIDGLKAGGNSA-------IADSVVTAAEMLERGEAANN-----ILLLL 183 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE-FLRACA--SPNSFYLVENP 351 TDG T + + A +YA+ + ++ L+ A S + + Sbjct: 184 TDG--ADTSSAHSMSELPSVLSRSRASLYAVQMSTPETNSALLQQVARESRGQYASAGDT 241 Query: 352 HSMYDAFSHIGKDI 365 ++ + + + Sbjct: 242 AALGAIYQSAARAL 255 >gi|317122731|ref|YP_004102734.1| von Willebrand factor A [Thermaerobacter marianensis DSM 12885] gi|315592711|gb|ADU52007.1| von Willebrand factor type A [Thermaerobacter marianensis DSM 12885] Length = 1122 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 68/194 (35%), Gaps = 33/194 (17%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + + + +V+D S SME K++MA+++ + + + + G++ Sbjct: 448 SRKNLPTVALALVIDRSGSMEG------VKLEMAVEAARRVAQLLTPAD------RLGVI 495 Query: 225 TFSNK--IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 F + + ++ + G T+ GL+ A + + D++ +H Sbjct: 496 LFDTQAYVTRPIEPVDSAGQVEAAFPSV--AGGGTSLGVGLEAALHLMKDVRADVRH--- 550 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 ++ +TDG E ++G V A+ I L A Sbjct: 551 --------VIALTDG----VSEPFDVAGTARAFHEQGITVSAVAIGADADTTTLGWLAQE 598 Query: 343 --NSFYLVENPHSM 354 Y+ +P + Sbjct: 599 GGGQLYVAADPGQI 612 >gi|313243983|emb|CBY14858.1| unnamed protein product [Oikopleura dioica] gi|313245509|emb|CBY40220.1| unnamed protein product [Oikopleura dioica] Length = 1393 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 40/205 (19%), Positives = 70/205 (34%), Gaps = 31/205 (15%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 S + A +++ V+D S SM TK+D + +M+ + + + Sbjct: 779 AQSSSSAPYNVVFVMDKSGSM------IGTKLDQTKDAFRSMISSLD------RNAKFSI 826 Query: 224 VTFSN-----KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V F+ + + + V + I +S G TN L A Sbjct: 827 VGFNYATTAWRNKLVRATNYNVEEARSFISRISA-GGGTNMHAALLDAIELCNSESSSTV 885 Query: 279 HCNTEDANYKKIIVFMTDGENL-STKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHE 334 C I+FMTDG E+ + L +++++G + IG S+ Sbjct: 886 PCM---------IMFMTDGTATVGVTEESRILADVTKSRQQGKANIALNVIGFGAGISYS 936 Query: 335 FLRACASPNSFYLVENPHSMYDAFS 359 FL + NS + AF Sbjct: 937 FLSRLSVLNSGIARQIFEDTNAAFQ 961 >gi|291000628|ref|XP_002682881.1| von Willebrand factor type A domain-containing protein [Naegleria gruberi] gi|284096509|gb|EFC50137.1| von Willebrand factor type A domain-containing protein [Naegleria gruberi] Length = 207 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 30/155 (19%), Positives = 61/155 (39%), Gaps = 26/155 (16%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 T + + I + + V + + LD++ VLD S SM KI++ KS+ M+ Sbjct: 21 NQTQNMFGMASIKAPIYVEKENRSSLDIIAVLDKSGSMSD-------KIELVKKSLLFMI 73 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPG 262 ++++ + G+V F + L G + + G +TN + Sbjct: 74 DQMQARD------RLGIVEFDANVSTTLKLTSMDNGGKKQAMNCVNNIK-LGTTTNISGA 126 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 + A++ + + G + + I+ TDG Sbjct: 127 IIEAFDILANRGG--------NISPTTSILLFTDG 153 >gi|239993926|ref|ZP_04714450.1| inter-alpha-trypsin inhibitor domain-containing protein [Alteromonas macleodii ATCC 27126] Length = 586 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 36/209 (17%), Positives = 71/209 (33%), Gaps = 26/209 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 +VM + VK D D+ V+D S SM + + + ++ Sbjct: 296 LVMLMPPQVKSQDLQDFDRDITFVIDTSGSMGGRPIVDAKESLQLAIDRLSEKDRFNVVA 355 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 N+ + F +E + + +K+L+ G T P L A + Sbjct: 356 FNNDTTRL----FETSVEGT---TRNKQYARDFVKHLNA-GGGTEMAPALNAALKRTTTK 407 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 ++ K +VF+TDG ++ +L+ + + A ++ +GI + Sbjct: 408 ------------DFIKQVVFITDG----AVGNEAALFSQIKNELGDARLFTVGIGSAPNS 451 Query: 334 EFLRACAS--PNSFYLVENPHSMYDAFSH 360 F+ A S+ V N + Sbjct: 452 YFMTRAAQFGLGSYVFVRNTADIKQQMDS 480 >gi|156382093|ref|XP_001632389.1| predicted protein [Nematostella vectensis] gi|156219444|gb|EDO40326.1| predicted protein [Nematostella vectensis] Length = 801 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 36/175 (20%), Positives = 67/175 (38%), Gaps = 24/175 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK-- 229 +D+ +V+D S ++ D K+ + V I D N + GLV + K Sbjct: 115 MDVALVIDSSAAVS---DEDFEKVKKFA------IRIVHGIADAENSIHFGLVQYGEKAT 165 Query: 230 --IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP-GLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ ++ S L+ IK L+K T L+ +F ++G Sbjct: 166 TIVDFNKIMPN--SALRETIKALTKNNDPTRRVDLTLETVKKDLFSLEG------GMRQG 217 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 + + +VF+T G + +D + G + A+GI FL + A+ Sbjct: 218 HPRFVVFITAGGTDAASKDLAVAS--KPLRDLGVNIIALGINSGVDKAFLNSLAT 270 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 49/334 (14%), Positives = 104/334 (31%), Gaps = 38/334 (11%) Query: 47 KTVLHSMIDRSLVHAATQI-MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN 105 + ++ ID ++ + R ++ + Sbjct: 434 RQKVNQAIDALKFSGGAGGGVSNALEKAVSDIYTAAGGSRDSANKSLIIIGSGKVENPAG 493 Query: 106 DID--DIVRSTSLDIVVVPQNEGY---SISAISRYKIPLKFC---TFIPWYTNSRHIVMP 157 + + +R +DI +P EG ++ I+ K+ +++ + R + Sbjct: 494 ATNLANQLRQAQVDIFAMPVGEGKDLAALKTIASKKVEKNIFVTSSYLALKAHLRALTEA 553 Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + + D LD++ + + S+E KI N +K N Sbjct: 554 VCEGAEKLRPCDLPLDIVFGIPTASSLE----GDFKKIK------NTFTSLLKFFKVNNR 603 Query: 218 VVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-Q 274 G+V +S+ L+ + L + I + G N +K A + F + Sbjct: 604 KAHLGIVEYSDDARMTKPLDQEYDRKELIKVIDEIQPSGNGFNVAAAMKRASDHAFTIFG 663 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 G+RQ K ++F+ ST + + K G + + + + + Sbjct: 664 GVRQ------TVPKTFVLFV---PRDSTADMAEVNKQAQALKAMGVHIVLFAVDKVTNLD 714 Query: 335 FLRACAS-PNSFYLVENPHSMYDAFSHIGKDIVT 367 + AS P + + N F I I Sbjct: 715 KWKTIASQPARRFFIGNT------FDDIAGQIYD 742 >gi|315150339|gb|EFT94355.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0012] Length = 1103 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSSITNNINMGYVGYSSD 319 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 368 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 369 HKKVIVLLTDGVPTFSYKVSRVQTEAD 395 >gi|290957869|ref|YP_003489051.1| hypothetical protein SCAB_34031 [Streptomyces scabiei 87.22] gi|260647395|emb|CBG70500.1| putative membrane protein [Streptomyces scabiei 87.22] Length = 534 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 65/199 (32%), Gaps = 38/199 (19%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE-- 232 + VLD S SME ++D ++ + + + +V L+ F + ++ Sbjct: 358 VYVLDTSGSMEG------DRLDRLKTALTELTGDFRDREEVT------LMPFGSDVKSVR 405 Query: 233 -----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + G+ ++ + LS G T L+ AY +H D + Sbjct: 406 THVVRPADPKAGLDGIRADTRKLSAAGE-TAIYTSLRRAY----------EHLGAVDRDT 454 Query: 288 KKIIVFMTDGENLSTKE----DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-SP 342 IV MTDGEN D + A+ V+ I E + Sbjct: 455 FTSIVLMTDGENTEGASPADFDDFYGRLPDAARH--IPVFPILFGDSDRDELEHIAEVTG 512 Query: 343 NSFYLVENPHSMYDAFSHI 361 + S+ AF I Sbjct: 513 GRLFDA-TRGSLDGAFEEI 530 >gi|269126610|ref|YP_003299980.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268311568|gb|ACY97942.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 315 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 28/208 (13%), Positives = 64/208 (30%), Gaps = 32/208 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+ +DVS SM + D + + + A + ++++ + G+V F+ Sbjct: 88 VMVAIDVSLSMMAK-DVAPNRFEAAKAAAKKFIQDLPARFN------VGVVAFAGSANVV 140 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + I L+ T + + I IV Sbjct: 141 ATPSGDRAAAISSIDTLT-LAKRTAIGEAVFTSLQAIRSFD-----AQAGQDPPPAHIVL 194 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------------HEFLRAC 339 ++DG+N + + +++ + A+ V I L Sbjct: 195 LSDGDNTTGRSVPEAI---DAARAADVPVSTIAFGTPYGTVEIDGETTPVEVNKVTLAGL 251 Query: 340 A--SPNSFYLVENPHSMYDAFSHIGKDI 365 A + Y + + +++IG + Sbjct: 252 AQGTNGKAYEAADNDQLSQVYANIGTSL 279 >gi|255513821|gb|EET90086.1| von Willebrand factor type A [Candidatus Micrarchaeum acidiphilum ARMAN-2] Length = 705 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 70/202 (34%), Gaps = 32/202 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 DA ++ ++LD+S SM KI+ A + + ++ + + + Sbjct: 522 KDAGAEIWMLLDISGSMGG------QKINAAKRILGSIHDSLDG------------SKYV 563 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + F G + K L T + + YA + + + + Sbjct: 564 HLRMFGFYGSDGTHVFEFDRKMLMNLAAMGDTPTDIAIYYAMDLM----------KKDKS 613 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 N+ K + +TDG+ + +E + +L A K V+ I I + + + + Sbjct: 614 NFDKTLFIITDGDPNNGQETKNALNSLKNAMKN-VNVFTIFISREAARAV-EIFSPSDWY 671 Query: 346 YLVENPHSMYDAFSHIGKDIVT 367 + V + + K IV Sbjct: 672 FNVSSMDEVEKVLEKGIKGIVD 693 >gi|149773085|emb|CAO01892.1| collagen type VI alpha 5 [Mus musculus] Length = 1171 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 37/240 (15%), Positives = 80/240 (33%), Gaps = 26/240 (10%) Query: 121 VPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDV 180 I + + T P T + + ++ +D+ +LD Sbjct: 900 DDTGTFQVIPVPPVGDY--EPLEKLRRCTLCYDKCFPNTCAEEPFFPENSYMDVAFLLDN 957 Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS----------NKI 230 S+++ S + + S+ + + L+++S K Sbjct: 958 SKNIAS---DDFQAVKALVSSVIDSFHITSNPSASESGDRVALLSYSPSESSRRKGRVKT 1014 Query: 231 EEFFLLEWGVSHLQRKI-KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 E F S ++ I L + L++A +F T + K Sbjct: 1015 EFAFTTYDNQSIMKNYIYTSLQQLNGDATIGLALQWAMEGLFLG--------TPNPRKHK 1066 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +I+ ++ GEN KE +++ AK +G +V+ I + + E + P +L++ Sbjct: 1067 VIIVISAGENHEEKEFVKTVAL--RAKCQGYVVFVISLGSTQRDEMEELASYPLDHHLIQ 1124 >gi|332846919|ref|XP_003315346.1| PREDICTED: integrin alpha-E [Pan troglodytes] Length = 1241 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 30/158 (18%), Positives = 62/158 (39%), Gaps = 21/158 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV-KLIPDVNNVVQSGLVTFSNKIEE 232 + IVLD S S++ A I+ M+ + + N LV + I+ Sbjct: 265 IAIVLDGSGSIDP------PDFQRAKDFISNMMRNFYEKCFECN----FALVQYGGVIQT 314 Query: 233 FFLLEWGVSHLQR--KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L + +++ +++ G T + +++ + IF + + K+ Sbjct: 315 EFDLRDSQDVMASLARVQNITQVGSVTKTASAMQHVLDSIFTSSHGSRRKAS------KV 368 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +V +TDG + ++ N K +G +AIG+ Sbjct: 369 MVVLTDGGIFEDPLNLTTVI--NSPKMQGVERFAIGVG 404 >gi|149377596|ref|ZP_01895335.1| hypothetical protein MDG893_19479 [Marinobacter algicola DG893] gi|149358138|gb|EDM46621.1| hypothetical protein MDG893_19479 [Marinobacter algicola DG893] Length = 718 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 73/218 (33%), Gaps = 36/218 (16%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 A +R P W + + + N +++ V+D S SM Sbjct: 315 WAPARGMEPAAAVFQEHWQGKDYLLALVVPGL---NGNRSLPRELVFVIDTSGSMAG--- 368 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK-----IEEFFLLEWGVSHLQ 244 I A +++ L + + ++ F+++ +E ++ + Sbjct: 369 ---ESIRQARQALLRGLGTLDADD------RFNVIQFNSQTHSLFMESVPASGNNIARAR 419 Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 R +K L+ G T P L A + G + + +VF+TDG Sbjct: 420 RYVKGLNADG-GTEMAPALDAA----LETNGDGGEASRARV---RQVVFITDG----AVG 467 Query: 305 DQQSL--YYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 ++ +L + ++ +GI + F+R A Sbjct: 468 NESALFGKIRDGLGSS--RLFTVGIGSAPNMHFMREAA 503 >gi|125532271|gb|EAY78836.1| hypothetical protein OsI_33941 [Oryza sativa Indica Group] Length = 645 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 28/136 (20%), Positives = 58/136 (42%), Gaps = 26/136 (19%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 LD++ VLDVS SM K+ + +++ +++ + + +++FS+ Sbjct: 173 PLDLVTVLDVSGSMVGN------KLALLKQAMGFVIDNLGPAD------RLCVISFSSGA 220 Query: 231 EEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L G +H +R + LS G TN L+ A ++ D + R + Sbjct: 221 SRLMRLSRMTDAGKAHAKRAVGSLSARG-GTNIGAALRKA-AKVLDDRLYRNAVES---- 274 Query: 287 YKKIIVFMTDGENLST 302 ++ ++DG++ T Sbjct: 275 ----VILLSDGQDTYT 286 >gi|218680121|ref|ZP_03528018.1| hypothetical protein RetlC8_15005 [Rhizobium etli CIAT 894] Length = 168 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 16/85 (18%), Positives = 40/85 (47%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R + G + I+ A+ L + + +G + + ++ + S +D +L+ A QI N Sbjct: 16 LRGLRRDRTGNVGIIVALSLVPMLVAVGASFDYIRSYNVRQKMQSDLDAALIAAVKQINN 75 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNM 92 G+ + K +++N++ + Sbjct: 76 TGDTDALKLKVTDWFHAQVENSYTL 100 >gi|126733209|ref|ZP_01748956.1| hypothetical protein RCCS2_03619 [Roseobacter sp. CCS2] gi|126716075|gb|EBA12939.1| hypothetical protein RCCS2_03619 [Roseobacter sp. CCS2] Length = 632 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 61/184 (33%), Gaps = 34/184 (18%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 IR F + GG+ +LT + L + +V GM ++ +T L S+ DR+++ AA Sbjct: 58 IRQFRNDEDGGLIVLTLLLLISMLVVGGMAVDFMRFESERTKLQSVADRAVLAAA----- 112 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 N N ++ + S + I Sbjct: 113 --NLNQEREAADVITDFFTAEGFGGSIVG------------------TPSIQKNRNGSTI 152 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 + +I TF + P ++ + +++ +VLD+S SM S Sbjct: 153 RLESIVDVD------TFYLRLVGIDTLSAPANATAIEGTGN---VEVSLVLDISGSMGSR 203 Query: 188 FDSS 191 Sbjct: 204 MTGD 207 >gi|311742726|ref|ZP_07716535.1| possible von Willebrand factor, type A [Aeromicrobium marinum DSM 15272] gi|311314354|gb|EFQ84262.1| possible von Willebrand factor, type A [Aeromicrobium marinum DSM 15272] Length = 435 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 37/224 (16%), Positives = 78/224 (34%), Gaps = 34/224 (15%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 ++++ + + + +++VLD S SM S ++D A +++ ++E + Sbjct: 26 YVLVDVEAPALPTREDALPTRLVVVLDRSGSM------SGARLDHAKRALRQVVEALSPS 79 Query: 213 PDVNNVVQSGLVTFSNKIEEF--FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 GLVTF +E + + R + + G ST+ GL Q Sbjct: 80 DSF------GLVTFDQHVEVAVNAGPVTDPAAVLRAVDAVRP-GGSTDLAGGLIEGLRQA 132 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK--KRGAIVYAIGIR 328 + G + ++ ++DG D +L A+ RG +G+ Sbjct: 133 TLLDGDAR------------VLLISDGHANQGVVDPDALQT-FTARHLDRGVTTSTLGMG 179 Query: 329 VIRSHEFL---RACASPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 + L + + E + A + D+ +R Sbjct: 180 LGYDETLLGAVSRGGTGEQHF-AEEADTAVGAITAECGDLAAQR 222 >gi|262275460|ref|ZP_06053270.1| protein TadG associated with Flp pilus assembly [Grimontia hollisae CIP 101886] gi|262220705|gb|EEY72020.1| protein TadG associated with Flp pilus assembly [Grimontia hollisae CIP 101886] Length = 453 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 26/201 (12%), Positives = 63/201 (31%), Gaps = 7/201 (3%) Query: 17 GGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKK 76 G ++ I P++F V + +E + + ++ + + A I ++ N Sbjct: 12 GVAVVIFVIAYPLLFGVFVLAVESTRYLQTHARIGDGVEVASLAVAANISSDITENKTLA 71 Query: 77 LKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYK 136 D E + + V + Y ++ S ++ Sbjct: 72 KNYVDGFVPDGTISLADINIERKSCDEIYGSQCGVAG--VYDEEGLVFTQYKVTLSSEFE 129 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 F P + ++ + + +D+ V D S SM+ ++ I K Sbjct: 130 SWYPEDDFAPGFEE----IVELGGTAVARKYQGFTIDVAFVADFSGSMQQTWNREI-KYK 184 Query: 197 MAIKSINAMLEEVKLIPDVNN 217 + I+ + +++ D Sbjct: 185 GVVNVISDITRKLETFNDHTE 205 >gi|148689166|gb|EDL21113.1| mCG4444 [Mus musculus] Length = 860 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 36/222 (16%), Positives = 79/222 (35%), Gaps = 24/222 (10%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA 198 + + T P T + + ++ +D+ +LD S+++ S + Sbjct: 180 YEPLEKLRRCTLCYDKCFPNTCAEEPFFPENSYMDVAFLLDNSKNIAS---DDFQAVKAL 236 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFS----------NKIEEFFLLEWGVSHLQRKI- 247 + S+ + + L+++S K E F S ++ I Sbjct: 237 VSSVIDSFHITSNPSASESGDRVALLSYSPSESSRRKGRVKTEFAFTTYDNQSIMKNYIY 296 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 L + L++A +F T + K+I+ ++ GEN KE + Sbjct: 297 TSLQQLNGDATIGLALQWAMEGLFLG--------TPNPRKHKVIIVISAGENHEEKEFVK 348 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 ++ AK +G +V+ I + + E + P +L++ Sbjct: 349 TVAL--RAKCQGYVVFVISLGSTQRDEMEELASYPLDHHLIQ 388 >gi|119606782|gb|EAW86376.1| inter-alpha (globulin) inhibitor H2, isoform CRA_a [Homo sapiens] gi|119606783|gb|EAW86377.1| inter-alpha (globulin) inhibitor H2, isoform CRA_a [Homo sapiens] Length = 947 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 77/198 (38%), Gaps = 30/198 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ V+DVS SM K+ ++++ +L++++ + ++ F+ I Sbjct: 311 ILFVIDVSGSMWG------VKMKQTVEAMKTILDDLRAEDHFS------VIDFNQNIRTW 358 Query: 231 --EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + V+ +R I+ + G TN L A + + + + + Sbjct: 359 RNDLISATKTQVADAKRYIEKIQPSG-GTNINEALLRAIFILNEANNLGL-LDPNSVSL- 415 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACASPNS- 344 I+ ++DG+ K + + K+ ++++G+ ++FL+ ++ N Sbjct: 416 --IILVSDGDPTVGKCELKLSKIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENHG 473 Query: 345 ----FYLVENPHSMYDAF 358 Y ++ S F Sbjct: 474 IAQRIYGNQDTSSQLKKF 491 >gi|68535223|ref|YP_249928.1| hypothetical protein jk0158 [Corynebacterium jeikeium K411] gi|68262822|emb|CAI36310.1| hypothetical protein jk0158 [Corynebacterium jeikeium K411] Length = 646 Score = 52.1 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 35/224 (15%), Positives = 74/224 (33%), Gaps = 33/224 (14%) Query: 153 HIVMPITSSVKVNSQTDARLD-MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 IV+P + + + M++LD S SM++ T++ A + V Sbjct: 19 TIVVPPAQEARAEDEEGKNIPPTMLILDASGSMKTPDAGGQTRMAAAKDAAQLFSVAVPS 78 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEW----------GVSHLQRKIKYLSKFGVSTNSTP 261 ++ +V G ++ E + V+ + ++ + G T P Sbjct: 79 DAELGFMVY-GTEVGNSPEERDAGCKDVKTLLPVEKGNVTKIPAEVGKVEASGH-TPMGP 136 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA- 320 L+ A ++ + ++ IV ++DGE+ + KK G Sbjct: 137 ALRQAAEEL-------------PKDGERSIVLVSDGEDTCAPP--PVCEVAKDLKKEGID 181 Query: 321 -IVYAIGIRVIRSHEFLRAC---ASPNSFYLVENPHSMYDAFSH 360 + +G V C A + ++ S+ D+ Sbjct: 182 LTINTVGFLVDSKARKELECIAEAGGGEYMDAKDTVSLADSMKR 225 >gi|124006669|ref|ZP_01691501.1| domain of unknown function protein [Microscilla marina ATCC 23134] gi|123987824|gb|EAY27515.1| domain of unknown function protein [Microscilla marina ATCC 23134] Length = 3238 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 30/212 (14%), Positives = 67/212 (31%), Gaps = 25/212 (11%) Query: 170 ARLDMMIVLDVSRSMESFFDSSI---TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 ++++ VLD+S SM+ + + I + +A++ + +S L+ F Sbjct: 2206 PPIEVIYVLDMSGSMKWEYPKTDDAGKTISRFRAAQDALIYANSALAQQGMSSRSALIVF 2265 Query: 227 SNKIEEFFL-LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 ++ + L ++ L T + G+ A + + Sbjct: 2266 NDTTAQVMSGFTNNFQQLNSIVENLGAPNGGTPMSKGMLSAKELLKTRSADK-------- 2317 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA----- 340 K ++V +TDG + + + K ++ +V R E + Sbjct: 2318 --KPVVVLITDGVPTYDLRN----FPYDHLKVSAVDIFDPVAQVFRIKEVVAKMGGLNTW 2371 Query: 341 --SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 S Y + + A I + I Sbjct: 2372 ISSSYGLYNGQVLSEVMGAIDEIKAETPDALI 2403 >gi|310114429|ref|XP_003119946.1| PREDICTED: collagen alpha-4(VI) chain-like [Homo sapiens] Length = 481 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 19/129 (14%), Positives = 48/129 (37%), Gaps = 16/129 (12%) Query: 242 HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 + R I + + T + L + +G + ++ ++ +TDG Sbjct: 124 DIYRAIFDVQQMRDGTRTGKALNFTLPFFDSSKG-------GRPSVQQYLIVITDG---- 172 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHI 361 + + + I++AIG+ + + L ++ Y N ++A ++ Sbjct: 173 -VAQDNVIIPAKALRDKNTIIFAIGVGEAKKSQLLEITNDEDNVYHDVN----FEALQNL 227 Query: 362 GKDIVTKRI 370 K+I++K Sbjct: 228 EKEILSKVC 236 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 18/95 (18%), Positives = 32/95 (33%), Gaps = 7/95 (7%) Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 +T N + ++ +TDG K + G +YAIGIR E Sbjct: 5 FADTGRINVARYLIVITDG-----KSSDSVAEAAEGLRANGVNIYAIGIREANIDELKEI 59 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + + V + D + +DI + + Sbjct: 60 A--KDKIFFVYEFDLLKDIQKEVVQDICSSEACKN 92 >gi|310119165|ref|XP_003118918.1| PREDICTED: collagen alpha-4(VI) chain-like [Homo sapiens] gi|310126588|ref|XP_003120448.1| PREDICTED: collagen alpha-4(VI) chain-like [Homo sapiens] Length = 535 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 19/129 (14%), Positives = 47/129 (36%), Gaps = 16/129 (12%) Query: 242 HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 + R I + + T + L + +G + ++ ++ +TDG Sbjct: 124 DIYRAIFDVQQMRDGTRTGKALNFTLPFFDSSKG-------GRPSVQQYLIVITDG---- 172 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHI 361 + + + I++AIG+ + + L + Y N ++A ++ Sbjct: 173 -VAQDNVIIPAKALRDKNTIIFAIGVGEAKKSQLLEITNDEDKVYHDVN----FEALQNL 227 Query: 362 GKDIVTKRI 370 K+I++K Sbjct: 228 EKEILSKVC 236 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 18/95 (18%), Positives = 32/95 (33%), Gaps = 7/95 (7%) Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 +T N + ++ +TDG K + G +YAIGIR E Sbjct: 5 FADTGRINVARYVIVITDG-----KSSDSVAEAAEGLRANGVNIYAIGIREANIDELKEI 59 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + + V + D + +DI + + Sbjct: 60 A--KDKIFFVYEFDLLKDIQKEVVQDICSSEACKN 92 >gi|51597757|ref|YP_071948.1| hypothetical protein YPTB3465 [Yersinia pseudotuberculosis IP 32953] gi|153947845|ref|YP_001399498.1| tellurium resistance protein [Yersinia pseudotuberculosis IP 31758] gi|162419629|ref|YP_001605639.1| putative tellurium resistance protein [Yersinia pestis Angola] gi|170022818|ref|YP_001719323.1| von Willebrand factor type A [Yersinia pseudotuberculosis YPIII] gi|186896948|ref|YP_001874060.1| von Willebrand factor type A [Yersinia pseudotuberculosis PB1/+] gi|229837270|ref|ZP_04457433.1| hypothetical protein YPS_1184 [Yersinia pestis Pestoides A] gi|51591039|emb|CAH22703.1| Conserved hypothetical protein [Yersinia pseudotuberculosis IP 32953] gi|152959340|gb|ABS46801.1| putative tellurium resistance protein [Yersinia pseudotuberculosis IP 31758] gi|162352444|gb|ABX86392.1| putative tellurium resistance protein [Yersinia pestis Angola] gi|169749352|gb|ACA66870.1| von Willebrand factor type A [Yersinia pseudotuberculosis YPIII] gi|186699974|gb|ACC90603.1| von Willebrand factor type A [Yersinia pseudotuberculosis PB1/+] gi|229705393|gb|EEO91403.1| hypothetical protein YPS_1184 [Yersinia pestis Pestoides A] Length = 346 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 58/167 (34%), Gaps = 14/167 (8%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + VLD S SM ++ K++ + ++ ++K P ++ F+ Sbjct: 3 RLPIFFVLDCSESM---IGENLKKMN---DGLQMIINDLKKDPHALETAWISVIAFAGVA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L V + L G T+ L+ QI D + Sbjct: 57 KTIVPL---VEVVSFYPPRLP-IGGGTSLGAALQELTRQIDTQVRKTTEERKGDWKP--V 110 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + +TDG + + + + A+K + AIG+ LR Sbjct: 111 VYLLTDGRPTDDTTAEITRWKTHYARK--VNLIAIGLGPSADLNILR 155 >gi|332531455|ref|ZP_08407359.1| von Willebrand factor, type A [Hylemonella gracilis ATCC 19624] gi|332039124|gb|EGI75546.1| von Willebrand factor, type A [Hylemonella gracilis ATCC 19624] Length = 346 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 36/235 (15%), Positives = 75/235 (31%), Gaps = 53/235 (22%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+ +DVS SM + D + ++++ A + A + D+ V+ G+V F+ + Sbjct: 88 IMLAIDVSGSMRA-TDVAPSRMEAAQAAARA------FLADLPRHVKVGIVAFAGSAQIA 140 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK----- 288 L L I + T G+ + IF G+ + + Sbjct: 141 QLPTTNREDLNAAIDRF-QLQRGTAIGNGIVLSLAAIFPEVGIDLGQFSYGRPGQPRGPS 199 Query: 289 --------------------KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 I+ +TDG+ L A +RG VY +G+ Sbjct: 200 LDMPPAGPPPTPVAPGSYGSAAIILLTDGQRT---TGIDPLEAAKLASERGVRVYTVGVG 256 Query: 329 VIRS---------------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIV 366 + E L+ A + ++ + ++ + + + Sbjct: 257 TVEGVTVGFEGWSMHARLDEESLKHIAQQTRAEYFHAASAEALTQVYETLSSRLT 311 >gi|307291074|ref|ZP_07570959.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0411] gi|306497728|gb|EFM67260.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0411] Length = 1103 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 57/147 (38%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 319 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 368 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + Sbjct: 369 HKKVIVLLTDGVPTFSYTVSRVQTEAD 395 >gi|229546230|ref|ZP_04434955.1| pilus subunit protein [Enterococcus faecalis TX1322] gi|229308754|gb|EEN74741.1| pilus subunit protein [Enterococcus faecalis TX1322] Length = 1103 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 57/147 (38%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 319 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 368 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + Sbjct: 369 HKKVIVLLTDGVPTFSYTVSRVQTEAD 395 >gi|118375014|ref|XP_001020694.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89302461|gb|EAS00449.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 610 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 38/208 (18%), Positives = 79/208 (37%), Gaps = 29/208 (13%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T + +++ + LD++ ++D S SM S +KI+ +I +LE + Sbjct: 143 TQKLYISTSSRPNLDLVCIIDNSESM-----SGCSKIENVKNTILQLLEMLNEND----- 192 Query: 219 VQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + L+TF++ ++ L+ LQ + +G TN T GL+ A+ + Sbjct: 193 -RLSLITFNSYAKQLCGLKKVSNLNKETLQAITNSIKAYG-GTNITSGLEIAFQIL---- 246 Query: 275 GMRQHCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + + I ++DG + TK + ++ +++ Sbjct: 247 --QSRKKKNSVSS---IFLLSDGQDDGADTKIKNLLKITYQQLQEESFTIHSFSFGSDHD 301 Query: 333 HEFLRACAS--PNSFYLVENPHSMYDAF 358 ++ A SFY VE + + F Sbjct: 302 CPLMQKIAQIKDGSFYFVEKNDQVDEFF 329 >gi|293604651|ref|ZP_06687053.1| conserved hypothetical protein [Achromobacter piechaudii ATCC 43553] gi|292816982|gb|EFF76061.1| conserved hypothetical protein [Achromobacter piechaudii ATCC 43553] Length = 3744 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 53/165 (32%), Gaps = 10/165 (6%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI-----TKIDM 197 T + + I++ V + ++ +V+D S SM D S ++I + Sbjct: 3053 TPVLNGGSGNDIILGDKGGTVVTVEPGKNYNIALVVDTSGSMAYKLDGSTNGSGQSRIAL 3112 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSG-LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS 256 ++ + ++ + NV G T + L V L I LS G Sbjct: 3113 VKDALTNLANQLVGHDGIVNVTLIGFATTAGTPVTLQNLTSANVQTLLTAITNLSATG-G 3171 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 TN A + + N F+TDG+ Sbjct: 3172 TNYEAAFNSAVSWFNSQTAAGKSVAAGYENVT---FFLTDGDPTY 3213 >gi|257063307|ref|YP_003142979.1| hypothetical protein Shel_05710 [Slackia heliotrinireducens DSM 20476] gi|256790960|gb|ACV21630.1| hypothetical protein Shel_05710 [Slackia heliotrinireducens DSM 20476] Length = 1514 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 44/256 (17%), Positives = 82/256 (32%), Gaps = 48/256 (18%) Query: 154 IVMPITSSVKVNSQTDARLD--MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 + P SS K L V + + D + TK ++ SI + Sbjct: 706 LPNPGNSSAKFYIDNSGTLKCRFQFVQNSCSPVYMKADEASTKSEVLQNSIARFAATLGY 765 Query: 212 IPDVNNV--VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK----------------- 252 + + + + + TFSN E LL W + +++ Sbjct: 766 LSPGSQIAMTRFSVDTFSNA--ECALLNW-TNDTGEVTAAMNQEYGNPLAEGGRANQTLD 822 Query: 253 --------FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 ST++ G++ + + N N + ++ TDG++ S Sbjct: 823 GLRVYNYGITGSTHTYRGIESYIENMTNGASGGYVPNAPQGNNSRYLIIFTDGKDNSGNL 882 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIR--------VIRSHEFLRACASP-----NSFYLV--E 349 Q+S+ + K G + + ++ V S FL+ AS FY Sbjct: 883 -QKSMDDTDALKNNGYTIITVLMQSAGMTSEDVEHSTTFLKRLASSNASGEKYFYTAMYN 941 Query: 350 NPHSMYDAFSHIGKDI 365 +P + F I +I Sbjct: 942 DPEGLVKVFQDIAHEI 957 >gi|297202051|ref|ZP_06919448.1| VWA domain-containing protein [Streptomyces sviceus ATCC 29083] gi|197714313|gb|EDY58347.1| VWA domain-containing protein [Streptomyces sviceus ATCC 29083] Length = 518 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 39/197 (19%), Positives = 66/197 (33%), Gaps = 35/197 (17%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE--- 231 + VLD S SME ++D ++ + + + +V L+ F ++++ Sbjct: 343 VYVLDTSGSMEG------DRLDRLKTALADLTGDFREREEVT------LMPFGSQVKSVR 390 Query: 232 ----EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + G+ ++ LS G T L+ AY+ H + Sbjct: 391 THVVKPSDPRAGLDAIRDDTSALSADG-DTAIYTSLEKAYD----------HLGAGRDAF 439 Query: 288 KKIIVFMTDGENL--STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-SPNS 344 IV MTDGEN + D + Y K R V+ I E + Sbjct: 440 TS-IVLMTDGENTAGAKARDFDAFYARLGRKARDTPVFPILFGDSDRSELAHIADLTGGR 498 Query: 345 FYLVENPHSMYDAFSHI 361 + S+ AF I Sbjct: 499 LFDARQ-GSLDGAFEEI 514 >gi|149920662|ref|ZP_01909127.1| hypothetical protein PPSIR1_01724 [Plesiocystis pacifica SIR-1] gi|149818449|gb|EDM77898.1| hypothetical protein PPSIR1_01724 [Plesiocystis pacifica SIR-1] Length = 540 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 46/225 (20%), Positives = 88/225 (39%), Gaps = 30/225 (13%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + N +V+ + + + L++ I +D+S+SME ID + Sbjct: 128 MGNMINGGNCTVVVVGMNTPIDPAELDRPPLNLTIAVDLSKSMEG------EPIDRVRQG 181 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS-HLQRKIKYLSKFGVSTNST 260 + M E+++ + LV F ++ + S L I L +G STN Sbjct: 182 LLQMREQLEPED------RVTLVGFGDEAQVIVENADKDSVELATAIAALVPWG-STNLY 234 Query: 261 PGLKYAYNQ--IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 GL+ A+ Q ++ +G + ++ ++DG + + + EA Sbjct: 235 AGLRTAFEQTDLYAQEGWQNR-----------VLLVSDGVPTTGIVNSDKIEGLAEAWSG 283 Query: 319 -GAIVYAIGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSH 360 G + +GI E +R + SFY VE+P ++ + FS Sbjct: 284 MGYGLTTVGIGNDFDIELMRNLSELGSGSFYYVEDPDAVIEVFSE 328 >gi|32473964|ref|NP_866958.1| hypothetical protein RB5901 [Rhodopirellula baltica SH 1] gi|32444501|emb|CAD74500.1| hypothetical protein-transmembrane prediction [Rhodopirellula baltica SH 1] Length = 700 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 43/306 (14%), Positives = 97/306 (31%), Gaps = 33/306 (10%) Query: 49 VLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIK---NTWNMSFRNELRDNGFVN 105 L + ID V + + +++ + E F Sbjct: 75 QLQNPIDDEAVVMIAPPPEYEETVDLIEEIVVSDQPQVEIGSDALAEFDMAEASAATFA- 133 Query: 106 DIDDIVRSTSLDIVVVPQ---NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSV 162 +I ++V L+ + N+ +S + ++ + + V IT V Sbjct: 134 EIANMVSPVDLEPTDLGDIMVNKMFSQPVAPQDRLTDQKGRVGQGTAGASGAVDQITFEV 193 Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK-------LIPDV 215 ++ L ++ + D S S+ + D + + + E++ Sbjct: 194 MQAAEERPTL-IVWLFDQSGSLTRQRQDIRDRFDRIYEELGMLREQLDAKTAGEDPSDPS 252 Query: 216 NNVVQSGLVTFSNKIEEFFL-LEWGVSHLQRKIKYLSKFGVST-NSTPGLKYA---YNQI 270 V + ++ F K++ F + +++ + + T ++ A YN + Sbjct: 253 EARVLTSIIGFGEKVQLFTEEPTADLDLIKQTVADIPVDNSGTERVFTAIESAAKQYNSL 312 Query: 271 FDMQGMRQHCNTEDANYKKIIVFM--TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 G R K+ ++F+ TD + S+ C +K G VY +G+ Sbjct: 313 RRSNGPRG--------PKRNVMFVVVTDERGDDAHLLESSIGSC---RKWGIPVYVVGVP 361 Query: 329 VIRSHE 334 E Sbjct: 362 APFGRE 367 >gi|260592519|ref|ZP_05857977.1| BatB protein [Prevotella veroralis F0319] gi|260535565|gb|EEX18182.1| BatB protein [Prevotella veroralis F0319] Length = 331 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 28/182 (15%), Positives = 69/182 (37%), Gaps = 26/182 (14%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P + N+ + +++++ LD+S SM + D +++D + + ++ + Sbjct: 73 VARPQIGNKIANNTSREGIEVIMALDISNSMLA-TDVVPSRLDKSKLMVEGLMNKFTKN- 130 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQI 270 + GL+ F+ + + + ++ T+ + A Sbjct: 131 ------KLGLIVFAGDAFVQLPITSDYVSAKMFLDNINPSLIGTQGTDIGKAINLAM--- 181 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 H T + K IV +TDGE+ + + +A+++G V+ +GI Sbjct: 182 --------HSFTPNTQTGKAIVVITDGEDN----EGGAEAMAKQAQEKGIKVFILGIGST 229 Query: 331 RS 332 + Sbjct: 230 QG 231 >gi|326798073|ref|YP_004315892.1| von Willebrand factor type A [Sphingobacterium sp. 21] gi|326548837|gb|ADZ77222.1| von Willebrand factor type A [Sphingobacterium sp. 21] Length = 622 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 42/273 (15%), Positives = 104/273 (38%), Gaps = 30/273 (10%) Query: 103 FVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKF---CTFIPWYTNSRHIVMPIT 159 F+N+ + ++ + + Y++ A + P+ + PW + R + + + Sbjct: 181 FINNGQ-LPPKDAIRLEEMINYFQYNLPA-PQNNEPVAIHTELSQAPWNPHHRLLRIALK 238 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + +++ +++ ++DVS SM+ ++ + S+ ++++++ V V Sbjct: 239 AKA-IDAAKLPPANLVFLIDVSGSMD-----GPNRLPLVKSSLKMLVDQLRKEDRVAIVT 292 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 +G + +I+ + ++ I L G + GLK AY+ R+H Sbjct: 293 YAG----TARIKLAPVWANEKMRIKNAIDELDAGGST-AGGAGLKMAYDL------AREH 341 Query: 280 CNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH----E 334 + N I+ +DG+ N+ ++ + ++ G + +G ++ E Sbjct: 342 FKKDGNNR---IILASDGDFNVGPSSNEDMETLIEKERQSGVSLSVLGFGMVNLKDSKME 398 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 L N Y+ + S G T Sbjct: 399 LLANKGHGNYAYIDNLMEAKKAMISEFGATFFT 431 >gi|227518343|ref|ZP_03948392.1| pilus subunit protein [Enterococcus faecalis TX0104] gi|227074216|gb|EEI12179.1| pilus subunit protein [Enterococcus faecalis TX0104] Length = 1103 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 319 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 320 SYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 368 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 369 HKKVIVLLTDGVPTFSYKVSRVQTEAD 395 >gi|198424353|ref|XP_002120419.1| PREDICTED: similar to mCG120740 [Ciona intestinalis] Length = 1650 Score = 52.1 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 29/166 (17%), Positives = 67/166 (40%), Gaps = 25/166 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++VLD+S SME++ ++ + ++++ ++ V +N V G+V F+++ Sbjct: 375 IVLVLDISTSMENYG-----RMGLMRQAVSNFIDTV----PMNTWV--GIVVFASRANTL 423 Query: 234 FLLEWGVSHLQRKI---KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L S+ R I + ++ V T+ G+ + + Sbjct: 424 ARLTEITSYDARNILKTRLVNTTVVGTSIGSGIMKGLEVLETSGPRSLRGSGGS------ 477 Query: 291 IVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 I+ +TDG E+ + K + ++ G V I + + + Sbjct: 478 IIILTDGLEHNNPKIND----TIERVREFGVRVSTIALGSNVAKDL 519 >gi|301612325|ref|XP_002935678.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2 [Xenopus (Silurana) tropicalis] Length = 524 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 75/197 (38%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + V + +F Sbjct: 215 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVMEMLDTLSDDDYVT------VASF 262 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 K + ++ ++ + G +T+ G +YA++Q+ + R + Sbjct: 263 HEKADPVSCFRQLVQANVRNKKVIKEAVQEMVARG-TTDYKAGFEYAFSQLQNTSITRAN 321 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N K +++G A Sbjct: 322 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--KTVRVFTFSVGQHNYDVTPLQWMA 372 Query: 339 CASPNSFYLVENPHSMY 355 CA+ ++ + + ++ Sbjct: 373 CANKGYYFEIPSIGAIR 389 >gi|170591963|ref|XP_001900739.1| Zona pellucida-like domain containing protein [Brugia malayi] gi|158591891|gb|EDP30494.1| Zona pellucida-like domain containing protein [Brugia malayi] Length = 447 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 22/119 (18%), Positives = 45/119 (37%), Gaps = 14/119 (11%) Query: 243 LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 ++ I+ + +TN+ LK+A R N N K+ V +TDG+ Sbjct: 1 MEWAIQRIKYLSGATNTGAALKFALE--------RGFQNARGGNIPKVAVVVTDGQ---- 48 Query: 303 KEDQQSLYYCNEAKKRGAIVYAIGI-RVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 + + +VYA+G+ ++ H+ + +P VE+ + + Sbjct: 49 -SQDSVAESAQQLRDAHVMVYAVGVTNLVNVHQLHQIAGNPARVLTVESFDELSKTLAD 106 >gi|294667358|ref|ZP_06732577.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] gi|292602910|gb|EFF46342.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] Length = 1223 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 30/174 (17%), Positives = 60/174 (34%), Gaps = 20/174 (11%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 ++ + A + LD+S SM+ + T++ +INA+L++V Sbjct: 140 PEKAEIAMKAPASASLYFALDLSGSMDEVASNGQTRLTNMKTAINAVLDQVGEAVARGAA 199 Query: 219 VQSGLVTFSNKIEEFFLL------EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + +V F + G++ L+ + S + + + F Sbjct: 200 IDCMIVGFGTYPSSRQSILRRNLTAGGIAELKSWVSGRSSSFTTYFTAGVM--DMPDFFG 257 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 +++ F+TDGE + S EA GAIV ++ Sbjct: 258 GGEAFA---------RRLAFFITDGEPVDAGSGMTSAQIAAEA---GAIVSSVA 299 >gi|110331845|gb|ABG67028.1| inter-alpha (globulin) inhibitor H3 [Bos taurus] Length = 889 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 74/193 (38%), Gaps = 20/193 (10%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG-LVTFSNKIE 231 ++ V+DVS SM K++ ++ +LE+VK +N ++ SG + T+ + + Sbjct: 282 SVVFVIDVSGSMHG------RKMEQTKDALLKILEDVKQDDYLNFILFSGDVTTWKDSLV 335 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + + + + G+ TN L + + + +H E + II Sbjct: 336 PATPE--NIQEASKFVMDIQDRGM-TNINDALLRGISMLNKARE--EHTVPERSTS--II 388 Query: 292 VFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS-----F 345 + +TDG+ N+ ++ A +Y +G ++ FL A N Sbjct: 389 IMLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGLARRI 448 Query: 346 YLVENPHSMYDAF 358 Y + + F Sbjct: 449 YEDSDANLQLQGF 461 >gi|303240107|ref|ZP_07326628.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] gi|302592376|gb|EFL62103.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] Length = 329 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 62/177 (35%), Gaps = 26/177 (14%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S + + LD+ +++D S+SM + D ++ A I ++++ ++ Sbjct: 71 SFIGLAKINKEGLDIYVLIDTSKSMLAE-DIKPDRLSRAKNIIESIIDNLEGD------- 122 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGM 276 + G + FS+ L + + + G TN L A N + Sbjct: 123 RIGFIPFSSAAYIQMPLTDDYDLARMYLDVIDTDMIAGGGTNVGTALNLAENSFEETSSA 182 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 ++++ ++DGE + + S+ V+ IGI + Sbjct: 183 -----------DRVVIILSDGE----EHNSNSVDILKSFNDEHLKVFTIGIGTAKGG 224 >gi|261252915|ref|ZP_05945488.1| putative outer membrane adhesin like proteiin [Vibrio orientalis CIP 102891] gi|260936306|gb|EEX92295.1| putative outer membrane adhesin like proteiin [Vibrio orientalis CIP 102891] Length = 3332 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 44/293 (15%), Positives = 91/293 (31%), Gaps = 30/293 (10%) Query: 65 IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQN 124 ++ N N + + + +G ++ + ++ + + Sbjct: 2624 VVQTPNSNGTYTVTLDVAENELTGRVSAELVAPHDYDGSLDFDLSVSATSEEKVGADTET 2683 Query: 125 EGYS--ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 S ++ + I++ K N ++ +V+D S Sbjct: 2684 TTVSEPVTLSGSDLVVGNNVDNTLEGHGGNDILIGDQGGYKTNVTPGVNYNIALVVDASG 2743 Query: 183 SMESFF------------DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 SM + S++T++DM +++ ++E + N L+ F + I Sbjct: 2744 SMGDYVYNTDGTVMRNPDGSAMTRMDMMQEALTNLVESLVTHDGSIN---IKLIGFDDNI 2800 Query: 231 EEFFLL------EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + F V+ L KI+ G T+ G + A N Sbjct: 2801 DVTFEALDITNSSDVVAELLSKIENNLPVGGGTDYGVGFEEANNWYASSSISSNG----- 2855 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 Y+ + F+TDGE S + Y A V A+G+ L+ Sbjct: 2856 --YENMTFFLTDGEPNSGTLNNGLTEYNELVSTHNAKVMAVGMGNDIDDSVLK 2906 >gi|162419860|ref|YP_001607152.1| von Willebrand factor type A domain-containing protein [Yersinia pestis Angola] gi|162352675|gb|ABX86623.1| von Willebrand factor type A domain protein [Yersinia pestis Angola] Length = 472 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 39/244 (15%), Positives = 91/244 (37%), Gaps = 37/244 (15%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + + ++ + +T ++S + +++ +V+D S SM S +I+ A + Sbjct: 70 LANSEDKNYLKISLTG-FNLDSTRRSPINLALVIDRSTSM------SGERIEKAREEAIL 122 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSN---KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 + + + ++ +V + N I + + + +++ G++ Sbjct: 123 AVNMLNITDTLS------VVAYDNHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAG 176 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGA 320 + + +H N E N I+ ++DG+ N + A K+G Sbjct: 177 -------VSMGIGQVDKHLNREQVNR---IILISDGQANTGPTSISELSDLARMAAKKGI 226 Query: 321 IVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSH--------IGKDIVTKRI 370 + IG+ + + + A A S + V N + AF+ + +DIV + Sbjct: 227 AITTIGLGQDYNEDLMTAIAGYSDGNHTFVANSADLEKAFTKEFQDVMSVVAQDIVVQIK 286 Query: 371 WYDK 374 DK Sbjct: 287 TGDK 290 >gi|156120445|ref|NP_001095368.1| inter-alpha-trypsin inhibitor heavy chain H3 precursor [Bos taurus] gi|160332333|sp|P56652|ITIH3_BOVIN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3; Short=ITI heavy chain H3; Short=ITI-HC3; Short=Inter-alpha-inhibitor heavy chain 3; Flags: Precursor gi|154425846|gb|AAI51420.1| ITIH3 protein [Bos taurus] gi|296474789|gb|DAA16904.1| inter-alpha-trypsin inhibitor heavy chain H3 precursor [Bos taurus] Length = 891 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 74/193 (38%), Gaps = 20/193 (10%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG-LVTFSNKIE 231 ++ V+DVS SM K++ ++ +LE+VK +N ++ SG + T+ + + Sbjct: 284 SVVFVIDVSGSMHG------RKMEQTKDALLKILEDVKQDDYLNFILFSGDVTTWKDSLV 337 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + + + + G+ TN L + + + +H E + II Sbjct: 338 PATPE--NIQEASKFVMDIQDRGM-TNINDALLRGISMLNKARE--EHTVPERSTS--II 390 Query: 292 VFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS-----F 345 + +TDG+ N+ ++ A +Y +G ++ FL A N Sbjct: 391 IMLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGLARRI 450 Query: 346 YLVENPHSMYDAF 358 Y + + F Sbjct: 451 YEDSDANLQLQGF 463 >gi|149919601|ref|ZP_01908080.1| hypothetical protein PPSIR1_07008 [Plesiocystis pacifica SIR-1] gi|149819544|gb|EDM78972.1| hypothetical protein PPSIR1_07008 [Plesiocystis pacifica SIR-1] Length = 349 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 36/221 (16%), Positives = 74/221 (33%), Gaps = 34/221 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 + +++D S SM+ F I++ ++ +V ++ + ++ GL + N + Sbjct: 79 ITLLVDRSGSMDEDFGG-ISRWQAVGDTLLD--PDVGVVAPLQGDIRFGLSLYDNPGDMC 135 Query: 232 ---EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E L + + T + L + + + + Sbjct: 136 PRVESTPLALNALADMTALYQSAAPEGDTPTGSALTSVADVV----------AQDPDPGE 185 Query: 289 KIIVFMTDGEN------LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 K+IV TDGE + +++ A +G + + S + L+ A Sbjct: 186 KVIVLATDGEPDTCAQPNPDEGQPEAVAAAQAAYAQGVRTVIVSVGSGISADHLQDMANA 245 Query: 341 --------SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 S +Y + S+ DAFS I + I D Sbjct: 246 GAGVQPGGSDAVYYQALDQASLIDAFSEIIAGVRECTIDLD 286 >gi|124008260|ref|ZP_01692956.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] gi|123986209|gb|EAY26038.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] Length = 552 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 53/312 (16%), Positives = 112/312 (35%), Gaps = 34/312 (10%) Query: 74 RKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDI-----VRSTSLDIVVVPQNEGYS 128 KK + +K +F ++ + + I ++S+ + Y Sbjct: 86 EKKPANENTFLSVKTAPLSTFSIDVDNASYSRARKSINNGQLPSTSSVRLEEFINYFNYQ 145 Query: 129 ISAISRYKIPLKFCTFIPWYTNS--RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 + P T + + H+V +++S+ +++ ++DVS SM Sbjct: 146 YK-QPEGQHPFSVNTEVAKCPWNPKNHLVHIGLQGKRLDSRKLKLSNLVFLIDVSGSM-- 202 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK 246 S+ K+ + K+ ++ + V VV +G + + + Sbjct: 203 ---SAPDKLPLLRKAFKMLVNNLGEEDRVAIVVYAG----NAGLVLPATQGTDKQKIMEA 255 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKED 305 + L + G ST G+K AY +I +++ N I+ TDG+ NL D Sbjct: 256 LDKL-QSGGSTAGGAGIKLAY-KIAKQNFIKEGNNR--------IILATDGDFNLGASSD 305 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSFYLVENPHSMYDAFSHIG 362 Q E +K G + +G+ + + + ++Y ++N + +A+ G Sbjct: 306 QAMQNLIEEKRKEGVFITVLGLGMGNYRDSKMEIIADKGNGNYYYLDN---LNEAYKVFG 362 Query: 363 KDIVTKRIWYDK 374 KD+ K Sbjct: 363 KDLKGTLFTIAK 374 >gi|109900221|ref|YP_663476.1| vault protein inter-alpha-trypsin [Pseudoalteromonas atlantica T6c] gi|109702502|gb|ABG42422.1| Vault protein inter-alpha-trypsin [Pseudoalteromonas atlantica T6c] Length = 701 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 38/214 (17%), Positives = 83/214 (38%), Gaps = 36/214 (16%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S+ +Q +++ +LD S SM I A ++++ L +++ +VN Sbjct: 291 STGGAVAQQMPSREVVFLLDTSGSMAG------ESIVQAKRAVDFALTQLRPEDNVN--- 341 Query: 220 QSGLVTFSNKIE-----EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 ++ F++ + + + + L G T P L A N+ Sbjct: 342 ---IIQFNDAPQALWKRAMPATAKHIQRARNWVASLHADG-GTEMAPALTLALNK---PS 394 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 R + ++ + +VF+TDG ++ +L E+K ++ IGI Sbjct: 395 LHRDDSDLLGSHKLRQVVFITDGS----VSNEDALMSLIESKLADNRLFTIGIG------ 444 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 ++PNS+++ + + F++IG + Sbjct: 445 -----SAPNSYFMTQAAQAGRGTFTYIGDIQQVQ 473 >gi|148250139|gb|ABQ53165.1| MtsD [Myxococcus fulvus HW-1] Length = 659 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 58/159 (36%), Gaps = 30/159 (18%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKID--------------------MAIKSINAMLE 207 + + +++V+D S SM D ++D ++++ +++ Sbjct: 55 SGFPVKVVVVIDESGSM-CVSDPPGAQLDSGFCQRREILDIIPEGVTEPARVRALKRLVQ 113 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG-------VSHLQRKIKYL-SKFGVSTNS 259 + + + VQ + F + + +++ I+ L S+ G T+ Sbjct: 114 QFREVNAQGGNVQVSVAPFETNVRNVWPPATTGDRFARPDNNIDSYIEGLQSQLGKGTDY 173 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKI-IVFMTDG 297 L YAY+ I N E + +VF+TDG Sbjct: 174 QGALSYAYSFISLDINAVAQSNPELLPRTRYVVVFLTDG 212 >gi|149031330|gb|EDL86328.1| rCG38899 [Rattus norvegicus] Length = 1029 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 69/204 (33%), Gaps = 30/204 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ V+DVS SM TK+ K+++ +L +++ N ++TFS+ + Sbjct: 248 VVFVIDVSGSMFG------TKLQQTKKAMDKILSDLQTSDSFN------IITFSDTVNIW 295 Query: 232 ----EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI-FDMQGMRQHCNTEDAN 286 + + + + + G T+ L A + + Q + Sbjct: 296 KAEGSIQATVQNIHNAKNYVSRMEANGW-TDINAALLAAASVLNHSNQEPGKGRGVGQIP 354 Query: 287 YKKIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS- 344 I+F+TDGE L +A ++++ LR + N Sbjct: 355 L---IMFLTDGEPTAGETTPSVILSNVRQALAHRVSLFSLAFGDDADFSLLRRLSLENQG 411 Query: 345 -----FYLVENPHSMYDAFSHIGK 363 + + + ++ I + Sbjct: 412 EARRIYEDADAALQLEGLYAEISR 435 >gi|90418244|ref|ZP_01226156.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] gi|90337916|gb|EAS51567.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] Length = 489 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 35/165 (21%), Positives = 56/165 (33%), Gaps = 36/165 (21%) Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG--MRQHCNTEDANYKK 289 L+WG+ L I+ + + A ++ D+ + + D + K Sbjct: 327 TPIALKWGLLLLDPAIQPMLREA-------ARYRALSEELDIDARFSNRPASFTDPDTMK 379 Query: 290 IIVFMTDG------------------ENLSTKEDQQSL--------YYCNEAKKRGAIVY 323 +V MTDG N S D S+ C AK++ IV+ Sbjct: 380 FLVLMTDGAISSQRIPKDASKPVQYYNNGSLNTDLYSVGDAERFAAALCTAAKQKNVIVF 439 Query: 324 AIGIRVIR-SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 IG V + + + CAS + N + DAF I I Sbjct: 440 TIGFDVNDTAAKQMSNCASGAERFYRVNALDIQDAFKSIATAIQK 484 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 28/226 (12%), Positives = 75/226 (33%), Gaps = 41/226 (18%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F G + ++TA+ L + ++ G I++ +++VL +DR ++ AA+ Sbjct: 15 FLRAKAGSIPVMTALMLVPMIVISGGAIDLIAHERLRSVLQDGLDRGVLAAASLTQTRPP 74 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 + T + + + D + + + S Sbjct: 75 ----------------RETIESFLKAAVTKGSYALD---------VKADELSNAKRVEAS 109 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF--- 187 A + T + + + + + +++ ++LD+S SM Sbjct: 110 ATAVTD------TAFLRLIGIDKLTVEAHAEAE---EKRKNIEISLLLDMSGSMRFDKSG 160 Query: 188 ---FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 S +I+ + + ++ V L + +V ++ ++ Sbjct: 161 SYPGPSGAMRINYLRPAAKSFMDMV-LADGAEDYTTVSIVPYAGQV 205 >gi|158337332|ref|YP_001518507.1| von Willebrand factor type A domain-containing protein [Acaryochloris marina MBIC11017] gi|158307573|gb|ABW29190.1| von Willebrand factor, type A domain protein [Acaryochloris marina MBIC11017] Length = 686 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 39/251 (15%), Positives = 81/251 (32%), Gaps = 32/251 (12%) Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 A + P T + + + + + +++ ++DVS SM Sbjct: 269 ASPKGDQPFSVSTEVATAPWNNQHKLVHIGLKGKELEKEQPSNLVFLIDVSGSM-----K 323 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIK 248 K+ + KS+ ++ ++K + LV ++ + G + I Sbjct: 324 RPNKLALVKKSLCLLVHQLKPED------RVSLVVYAGRAGIVLPSTPGTQKATIMNAID 377 Query: 249 YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQ 307 L G + G+K AY+ +H N ++ TDG+ N+ D + Sbjct: 378 RLEAGGST-AGAAGIKMAYDM------AERHFLKNGNNR---VILATDGDFNVGQSSDAE 427 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRSH----EFLRACASPNSFYLVENPHSMYDAFSHIGK 363 + + RG + +G E L + N Y ++ +A + Sbjct: 428 LERLIEQKRDRGVFLTVLGYGTGNYKDNKMELLANKGNGNYAY----IDTLLEAQKVLVN 483 Query: 364 DIVTKRIWYDK 374 D+ K Sbjct: 484 DLRGTLFTIAK 494 >gi|149636528|ref|XP_001511995.1| PREDICTED: similar to putative calcium activated chloride channel-like protein 1; eCLCA1 [Ornithorhynchus anatinus] Length = 800 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 27/171 (15%), Positives = 65/171 (38%), Gaps = 40/171 (23%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK--------LIPDVNNVVQSGLVT 225 + +VLD S SM +++ +++ +++ L ++ ++ D ++++ L+ Sbjct: 307 VCLVLDKSGSM-----AAMDRLNRMNQAVKLFLLQITEKGSWVGIVLFDERAIIRNPLIQ 361 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 ++ + +L+ G T+ G++ A+ I Sbjct: 362 IISEDDRNYLMTRLPE----------AAGGGTSICSGVQAAFQAIKQKFQTTDGSE---- 407 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF 335 IV +TDGE ++ C E K+ GA ++ + + + E Sbjct: 408 -----IVLLTDGE-------DVTVSSCFEEVKQSGATIHTVALGTSAAQEL 446 >gi|320537259|ref|ZP_08037219.1| von Willebrand factor type A domain protein [Treponema phagedenis F0421] gi|320145887|gb|EFW37543.1| von Willebrand factor type A domain protein [Treponema phagedenis F0421] Length = 332 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 40/267 (14%), Positives = 83/267 (31%), Gaps = 46/267 (17%) Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ---TDARLDMMIVLDVSRSM 184 +I S L F + + + +++ + VK+ ++ T+ +M V+D+S SM Sbjct: 43 TIIKTSSQMRGLSFLSKSLVFLSLIALIVAASEPVKLKTEYVYTETANSIMFVIDISPSM 102 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ 244 + + T+I A I ++ + GL ++ Sbjct: 103 AAKDINEKTRIQAAKDIITDFVQT----YPADAF---GLTALASTAALVIPPTIQHEQFF 155 Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 ++ L + + G+ A H + I+ +TDGE+ Sbjct: 156 ARLNSLQIGELGEGTALGMGLAV--------AAAHFAKNTVKTQS-IILLTDGESN--TG 204 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-----------------------LRACAS 341 + K + Y IGI LR A Sbjct: 205 EIHPNLAAELIKSKKIGFYIIGIGKDGYANLEYVDPSTGEKREGTLQTIFNERELRELAH 264 Query: 342 PNS--FYLVENPHSMYDAFSHIGKDIV 366 + + ++ S+ + F +I ++I Sbjct: 265 RGNGIYVSAKSFASLQEIFKNISQNIS 291 >gi|288573236|ref|ZP_06391593.1| von Willebrand factor type A [Dethiosulfovibrio peptidovorans DSM 11002] gi|288568977|gb|EFC90534.1| von Willebrand factor type A [Dethiosulfovibrio peptidovorans DSM 11002] Length = 225 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 63/176 (35%), Gaps = 23/176 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 R+ + +VLDVS SM I+ + + + +K + +++F Sbjct: 16 NPTPRVPVSLVLDVSGSM------LGAPIEELNRGVELFFKSLKDDDVARYSAEVSVISF 69 Query: 227 SNKIEE---FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 SN++ + F LE + I L G T + A + + + + Sbjct: 70 SNEVTQEVDFGPLE------KCDIPELKAIGK-TRMGGAVSLALESLEKRKEL--YRTLG 120 Query: 284 DANYKKIIVFMTDGENLST--KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 Y+ +V MTDG+ ++ ++ K V+ I I + L+ Sbjct: 121 VDYYQPWMVIMTDGKPNDDWQLAAAKTSALVDKGK---LTVFPIAIGDNACTDTLK 173 >gi|186896947|ref|YP_001874059.1| von Willebrand factor type A [Yersinia pseudotuberculosis PB1/+] gi|186699973|gb|ACC90602.1| von Willebrand factor type A [Yersinia pseudotuberculosis PB1/+] Length = 233 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 67/189 (35%), Gaps = 14/189 (7%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 +++ S + + RL + +++D S SM I I AM+ ++ P Sbjct: 7 LILTPLSLILRTTSDMRRLPVYLLIDTSGSMRG------ESIHAVNVGIQAMMSALRQDP 60 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 V ++T+ N+ E+ L + + T + L+ I + Sbjct: 61 YALESVHLSIITYDNQAREYIPLTALENF--QFTDITVPSAGGTFTGAALEC---LIHCV 115 Query: 274 QGMRQHCNTEDANYKKIIVF-MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + Q + + + +VF MTDG +++ K+ + A + Sbjct: 116 ERDIQRSDGDQKGDWRPLVFLMTDGTPSDVYAYGEAIK--EVKKRAFGSIIACAVGAKAK 173 Query: 333 HEFLRACAS 341 HE L+ S Sbjct: 174 HEHLKQLTS 182 >gi|163786709|ref|ZP_02181157.1| aerotolerance-related membrane protein [Flavobacteriales bacterium ALC-1] gi|159878569|gb|EDP72625.1| aerotolerance-related membrane protein [Flavobacteriales bacterium ALC-1] Length = 345 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 21/176 (11%), Positives = 70/176 (39%), Gaps = 26/176 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +D++ +DVS+SM + D + ++D + + + ++ + + Sbjct: 79 GTKLETVRSQGVDIVFAVDVSKSMLAE-DIAPNRLDKSKQLVTQIINSLASD-------R 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMR 277 G++ ++ K + + + ++ ++ T + ++ A D + Sbjct: 131 VGIIAYAGKAFPQLPITTDYASAKMFLQNMNTDMLSSQGTAISEAIELAKTYYDDEEQTN 190 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 ++++ ++DGE+ +++ EA + G + +G+ ++ Sbjct: 191 -----------RVLIIISDGEDHG----GEAVDIAEEANEEGIRILTVGVGDVKGG 231 >gi|264679151|ref|YP_003279058.1| ferredoxin-dependent glutamate synthase [Comamonas testosteroni CNB-2] gi|262209664|gb|ACY33762.1| ferredoxin-dependent glutamate synthase [Comamonas testosteroni CNB-2] Length = 1405 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 40/283 (14%), Positives = 87/283 (30%), Gaps = 22/283 (7%) Query: 58 LVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSF-RNELRDNGFVNDIDDIVRSTSL 116 + +G GN L + + N ++ + L G+ + + Sbjct: 575 ASAGLKLLTLDGPGNADASLTLDRLADLVNNPITLTTSKGTLTLTGYDATTGKVSYTYQT 634 Query: 117 DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTN---SRHIVMPITSSVKVNSQTDARLD 173 + ++ + + KF + + + + Sbjct: 635 SGQQAHTGDDTNVQDHFQITVEDKFGGKATGDLGVLITDTAPSLKPIAESSALSSHGT-N 693 Query: 174 MMIVLDVSRSMESFFDSS------ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +M+ LD S SM + ++++D+ S+N +L++ V+ ++ F+ Sbjct: 694 IMLTLDTSGSMAWSSGVNNSNGWSLSRLDVLKSSVNGLLDK----YGEAGDVRVLILEFN 749 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + ++ + + L G TN L A + + N N Sbjct: 750 SSATQKGSGWMSLAEAKTFVNGLYADG-GTNYQDALTKAMAAWNNSGTGKLEGN----NV 804 Query: 288 KKIIVFMTDGENLSTKEDQQSLY--YCNEAKKRGAIVYAIGIR 328 + I F TDGE S + S + Y IG+ Sbjct: 805 QNISYFFTDGEPDSNRSVSSSQQTTWEKFLADNHINSYGIGLG 847 >gi|149923979|ref|ZP_01912364.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1] gi|149815157|gb|EDM74708.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1] Length = 785 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 33/193 (17%), Positives = 69/193 (35%), Gaps = 20/193 (10%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 I + T + +Q +++ +LDVS SM SS K+ + ++E++ Sbjct: 336 QIGVQATRELPAQAQELRTRNLVFLLDVSGSM-----SSRGKLPLIKHGFTQLVEQLGAE 390 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 V+ VV +G + + + L G TN + G+ AY + Sbjct: 391 DHVSIVVYAGAAG----VVLPPTSGDQKETILGALDRLEA-GGGTNGSAGIVEAYE-LAQ 444 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 + N ++ TDG+ N+ + + + ++ G + +G+ Sbjct: 445 ANFVDGGVNR--------VILGTDGDFNVGLSDHDALVELIEQKRESGVFLSVLGVGGHY 496 Query: 332 SHEFLRACASPNS 344 E + A + Sbjct: 497 DDELMEQLADHGN 509 >gi|297473020|ref|XP_002686329.1| PREDICTED: chloride channel accessory 1 [Bos taurus] gi|296489230|gb|DAA31343.1| chloride channel accessory 1 [Bos taurus] Length = 911 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 39/204 (19%), Positives = 77/204 (37%), Gaps = 33/204 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM + ++ ++ L + V G+VTF + Sbjct: 309 VCLVLDKSGSM-----TIGNRLKRLNQAGKLFL-----LQTVEQGSWVGMVTFDSAAYVQ 358 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTP-GLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L S +R L+K +T S + F + + + + I+ Sbjct: 359 SELVQINSATER--DTLTKSLPTTASGGTSICSGLRSAFTVIKKKYPTDGAE------II 410 Query: 293 FMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYLVE 349 +TDGE+ ++ C +E K+ GAI++ + + + E + Y + Sbjct: 411 LLTDGEDN-------TISACFDEVKQSGAIIHTVALGPSAAQELEQMSKMTGGLQTYASD 463 Query: 350 NPHS--MYDAFSHI--GKDIVTKR 369 + + DAF+ + G V++R Sbjct: 464 QVQNNGLVDAFAALSSGNKAVSQR 487 >gi|198417752|ref|XP_002129197.1| PREDICTED: similar to Clca1 protein [Ciona intestinalis] Length = 1034 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 47/314 (14%), Positives = 101/314 (32%), Gaps = 38/314 (12%) Query: 64 QIMNEGNGNNRKKLKGGDILCRIKNTWNMSF--RNELRDNGFVNDIDDIVRSTSLDIVVV 121 +I+ + + + L C+ W+ + + +VN+I+ D + Sbjct: 204 EIVQDVCNYDPQTLLPNSTDCKFILAWDQDLDLKASIMSYQYVNEINGFCDDNDNDPLNR 263 Query: 122 PQNEGYSISAISRYKIPL-KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARL---DMMIV 177 E + K + T +T R++ P +S + ++V Sbjct: 264 HNREAPNEHNDKCNKRSVWDVITSSVDFTGGRNLANPNVASTIPTFRVVKPFPYRSFVLV 323 Query: 178 LDVSRSMESFFDSSITKI-DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 LDVS SM + +TK+ + ++ ++ G+ FS + L Sbjct: 324 LDVSGSM---WGGRLTKMRQIMNTFVDDFVQRGD---------YVGITIFSTIARKLSPL 371 Query: 237 E--WGVSHLQRKIKYLSKF-GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 S I+ L + ST G+ A + + I+ Sbjct: 372 TRIRDQSDRASLIRRLPRSVRGSTCIGCGINSAVQIM----------EQHSPDLCGDIIV 421 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC-ASPNSFY--LVEN 350 TDGE ++ K+ V A+ + +R A+ +++ ++ Sbjct: 422 FTDGEENVEPSVADVH---DKVVKKKCRVSAVFFTTTANQALVRLVDATSGTWFYGDTDD 478 Query: 351 PHSMYDAFSHIGKD 364 + A++ + K Sbjct: 479 ITPLIGAYNQLSKS 492 >gi|26331510|dbj|BAC29485.1| unnamed protein product [Mus musculus] Length = 323 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 34/268 (12%), Positives = 90/268 (33%), Gaps = 25/268 (9%) Query: 65 IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQ- 123 ++ + G C + + L+++ ++++ +I + SL + + Sbjct: 61 LVGSPWSGFPENRMGDVYKCPVDLPTATCEKLNLQNSASISNVTEIKTNMSLGLTLTRNP 120 Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARL-DMMIVLDVSR 182 G ++ + + + +S Q L D+++V D S Sbjct: 121 GTGGFLTCGPLWAHQCGNQYYATGICSDVSPDFQFLTSFSPAVQACPSLVDVVVVCDESN 180 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--V 240 S+ + + + + + + P Q L+ ++N+ F L Sbjct: 181 SIYPW--------EAVKNFLVKFVTGLDIGP---KKTQVALIQYANEPRIIFNLNDFETK 229 Query: 241 SHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 + + + G TN+ +++A + + + K++V +TDGE+ Sbjct: 230 EDMVQATSETRQHGGDLTNTFRAIEFARDYAYS------QTSGGRPGATKVMVVVTDGES 283 Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + + + CN + + I + Sbjct: 284 HDGSKLKTVIQQCN---DDEILRFGIAV 308 >gi|313126713|ref|YP_004036983.1| mg-chelatase subunit chld [Halogeometricum borinquense DSM 11551] gi|312293078|gb|ADQ67538.1| Mg-chelatase subunit ChlD [Halogeometricum borinquense DSM 11551] Length = 785 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 51/281 (18%), Positives = 97/281 (34%), Gaps = 42/281 (14%) Query: 89 TWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKI---PLKFCTFI 145 + RN + +D+ ++ + +P + ++ A+ Y I L Sbjct: 285 PFESYLRNVYNVEKRSSVPEDLSPYYAVVVQDMPASRIGNVDALQEYVIDGNGLLVVGGP 344 Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVL--DVSRSMESFFDSSITKIDMAIKSIN 203 + N + + S + V + IVL DVS S Sbjct: 345 NSFENGNYESSSLASMLPVTTGDGRGQSTNIVLSIDVSGS----------SKGGMRVQKA 394 Query: 204 AMLEEVKLIPDVNNVVQSGLVTFSNKIEEF---FLLEWGVSHLQRKIKYLSKFGVSTNST 260 L +K + D N V G+V F+++ L L +I+ L G +T+ Sbjct: 395 VSLSALKQLGDENEV---GIVGFNHRTYSVAERQPLGPNREALADRIRRLQA-GGATDIA 450 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA 320 GL+ A + D G ++ ++DG + ++S+ Y + + G Sbjct: 451 GGLRGAGKMLGDDPGT--------------VILISDGHDR----VEESISYAKQLRSEGK 492 Query: 321 IVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFS 359 + AIG + + LR A S S++ + + F Sbjct: 493 RIIAIGAGKNPNEKNLRTIARASGGSYFRATETNRLNILFG 533 >gi|67968521|dbj|BAE00622.1| unnamed protein product [Macaca fascicularis] Length = 480 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 24/109 (22%), Positives = 41/109 (37%), Gaps = 8/109 (7%) Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 ++ + + N +I+ MTDGE L Q +L +A+K GA V Sbjct: 1 MQAGFRKAIQQIETFNSGNK----VPSMIIAMTDGE-LVAHAFQDTLREAQKARKLGANV 55 Query: 323 YAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 Y +G+ + + SP + VEN + A + +K Sbjct: 56 YTVGVADYKLDQITAIADSPEHVFAVENG---FKAMRDTVDALTSKVCL 101 >gi|327285304|ref|XP_003227374.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Anolis carolinensis] Length = 3587 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 31/193 (16%), Positives = 65/193 (33%), Gaps = 42/193 (21%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 V+ + ARL+++ ++D S S+ ++ + +L + ++P + Sbjct: 66 VRRLREGSARLELVFLVDESSSVG------QANFRSELRFVTKLLSDFPVVPTA---TRV 116 Query: 222 GLVTFSN------KIEEFFLLEWGVSH-------------LQRKIKYLSKFGVSTNSTPG 262 +VTFS+ +++ G + L R+I + G T + Sbjct: 117 AIVTFSSKNHVLSRVDYISPPSPGRTQPQQQRQQQHKCALLGREIPGIGYRGGGTYTKGA 176 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 + A + AN K+I +TDG + ++ G + Sbjct: 177 FQQAAQILLHS----------RANATKVIFLITDGYSNG----GDPRPIAASLREFGVEI 222 Query: 323 YAIGIRVIRSHEF 335 + GI E Sbjct: 223 FTFGIWQGNIREL 235 >gi|329664002|ref|NP_001193105.1| calcium-activated chloride channel regulator 1 [Bos taurus] Length = 909 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 39/204 (19%), Positives = 77/204 (37%), Gaps = 33/204 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM + ++ ++ L + V G+VTF + Sbjct: 307 VCLVLDKSGSM-----TIGNRLKRLNQAGKLFL-----LQTVEQGSWVGMVTFDSAAYVQ 356 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTP-GLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L S +R L+K +T S + F + + + + I+ Sbjct: 357 SELVQINSATER--DTLTKSLPTTASGGTSICSGLRSAFTVIKKKYPTDGAE------II 408 Query: 293 FMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYLVE 349 +TDGE+ ++ C +E K+ GAI++ + + + E + Y + Sbjct: 409 LLTDGEDN-------TISACFDEVKQSGAIIHTVALGPSAAQELEQMSKMTGGLQTYASD 461 Query: 350 NPHS--MYDAFSHI--GKDIVTKR 369 + + DAF+ + G V++R Sbjct: 462 QVQNNGLVDAFAALSSGNKAVSQR 485 >gi|292491521|ref|YP_003526960.1| hypothetical protein Nhal_1422 [Nitrosococcus halophilus Nc4] gi|291580116|gb|ADE14573.1| conserved hypothetical protein [Nitrosococcus halophilus Nc4] Length = 398 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 27/146 (18%), Positives = 58/146 (39%), Gaps = 10/146 (6%) Query: 16 KGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRK 75 +G +L I + I + G+ +++ H + KT L + +D + + A +++N+ Sbjct: 12 RGVTMVLFTIGMVAIIGMAGLALDMGHAYLNKTRLQNALDAAALSGA-KVLND------- 63 Query: 76 KLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRY 135 G T+NM EL D G V ++ V T N Y + ++ + Sbjct: 64 MHDVGQATAAALTTFNMHLEGELADAGLVPTVE--VSETLSPFAPGGINPRYLRARVNDF 121 Query: 136 KIPLKFCTFIPWYTNSRHIVMPITSS 161 + + +P N++ + + Sbjct: 122 PMQVWLAQVLPGIGNTQSVGGSAVAG 147 >gi|288802179|ref|ZP_06407619.1| BatB protein [Prevotella melaninogenica D18] gi|288335146|gb|EFC73581.1| BatB protein [Prevotella melaninogenica D18] Length = 331 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 25/179 (13%), Positives = 65/179 (36%), Gaps = 26/179 (14%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P S ++ ++ +I LD+S SM + D + +++ + + ++ + Sbjct: 76 PQVGSKIATNKEREGIETIIALDISNSMLAE-DVAPSRLGKSKLIVENLMNKFSED---- 130 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDM 273 + GL+ F+ + + + ++ T+ L+ + N Sbjct: 131 ---KIGLIVFAGDAFVQLPITSDYVSAKMFLDNINPSLIGTQGTDIGKALQLSMNSFT-- 185 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 ++ K I+ +TDGE+ + + +A+ +G V+ +G+ Sbjct: 186 ---------PNSKVGKAIILITDGEDN----EGGAEEMAKQAQSKGIRVFILGVGSTEG 231 >gi|269217609|ref|ZP_06161463.1| putative von Willebrand factor type A domain protein [Actinomyces sp. oral taxon 848 str. F0332] gi|269212544|gb|EEZ78884.1| putative von Willebrand factor type A domain protein [Actinomyces sp. oral taxon 848 str. F0332] Length = 327 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 69/193 (35%), Gaps = 13/193 (6%) Query: 174 MMIVLDVSRSME-SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 + V+D + SM +D +++ I + V IP+ + +++F++ Sbjct: 71 VFFVVDTTGSMNAEDYDGKKPRLEGVRADIRQI---VSRIPNA----RYSIISFNSGATR 123 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + V+ ++ ++ + ++ + + + + + + + + + V Sbjct: 124 ELPITTDVAAVESWVQTVKPEQSGYSNGSSVYRPVKVL--KRELERSRRSHPQSVQLVYV 181 Query: 293 FMTDGENLSTKEDQQSL-YYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENP 351 F +DGE K Q++ N G + Y S Y +E+P Sbjct: 182 F-SDGEPRGDKGQQKTYDQIRNLVDDGGVLGYGTAAGGPMKETTFSGTGSGTGSY-IEDP 239 Query: 352 HSMYDAFSHIGKD 364 + A S I + Sbjct: 240 QTRQTAISKIDEK 252 >gi|317133828|ref|YP_004089739.1| von Willebrand factor type A [Ruminococcus albus 7] gi|315450290|gb|ADU23853.1| von Willebrand factor type A [Ruminococcus albus 7] Length = 1061 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 27/194 (13%), Positives = 68/194 (35%), Gaps = 32/194 (16%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAI----KSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 I +D S SM + S + ++ +E + + + + ++TF + Sbjct: 568 AICVDCSGSMSTNDKSFKDDNGVLTCYRNIAVQNYVESMFVFDNAS------IITFESSA 621 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 E + L K + ++ G + NS + A +++ KK Sbjct: 622 SEECEMTNNKRTLSGKASFYNRGGTNANS--AIDIAIDEL------------NHVYGKKN 667 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLV 348 I+ ++DG+ + ++ + YC K G ++ + + + + L+ A + + Sbjct: 668 IILLSDGDVNVSDDN---IKYC---KNNGIRIHTVALGSGANSQLLKQYANDTGGTPLTA 721 Query: 349 ENPHSMYDAFSHIG 362 + + Sbjct: 722 TTAEGLTKIYESYA 735 >gi|299529294|ref|ZP_07042734.1| ferredoxin-dependent glutamate synthase [Comamonas testosteroni S44] gi|298722738|gb|EFI63655.1| ferredoxin-dependent glutamate synthase [Comamonas testosteroni S44] Length = 1449 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 39/282 (13%), Positives = 86/282 (30%), Gaps = 21/282 (7%) Query: 58 LVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSF-RNELRDNGFVNDIDDIVRSTSL 116 + +G N L + + N ++ + L G+ + + Sbjct: 624 ASAGLKSLTLDGPDNADATLTLDRLADLVNNPVTLTTSKGTLTLTGYDATTGKVSYTYQT 683 Query: 117 DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTN---SRHIVMPITSSVKVNSQTDARLD 173 + ++ + + KF + + + + Sbjct: 684 SGQQAHTGDDTNVQDHFQITVEDKFGGKATGDLGVLITDTAPSLKPIAESSALSSHGT-N 742 Query: 174 MMIVLDVSRSMES-----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +M+ LD S SM + ++++ + S+N +L++ V+ +V F+ Sbjct: 743 IMLTLDTSGSMNYGSGVYNGWTQLSRLAVLKSSVNNLLDK----YGEAGDVRVMIVEFNT 798 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + ++ + + L +G TN L A N + + E N + Sbjct: 799 SASQKGGGWMSLAEAKALVSGL-GYGGGTNYQTALDTAMNAWNNSGTGKL----EGGNVQ 853 Query: 289 KIIVFMTDGENLSTKEDQQS--LYYCNEAKKRGAIVYAIGIR 328 I F TDGE S + + + Y IG+ Sbjct: 854 NISYFFTDGEPDSNRSVNSAQQATWEKFLADNHINSYGIGLG 895 >gi|148680070|gb|EDL12017.1| mCG120740 [Mus musculus] Length = 752 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 39/201 (19%), Positives = 75/201 (37%), Gaps = 32/201 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLDVS SM S+ ++D+ ++ L ++ + N G+V F++ Sbjct: 309 VCLVLDVSGSMASY-----DRLDLMNRAAKHFLSQI-----IENRSWVGMVHFNHLANIK 358 Query: 234 FLLEWGVSHLQRK--IKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L S+++R ++ L T+ G+K A+ + Sbjct: 359 SELIQMNSNIERNQLLQTLPTSADGGTSICSGIKAAFQVFKNGGYETDGTE--------- 409 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYLV 348 I+ ++DGE D + +E K GAIV+ I + + Y Sbjct: 410 ILLLSDGE------DSTAKDCIDEVKDSGAIVHFIALGPSADLAVTNMSILTGGKHMYAS 463 Query: 349 ENPHS--MYDAFSHIGKDIVT 367 + + + DAF + + V Sbjct: 464 DEAQNNGLIDAFVALASENVD 484 >gi|25244377|gb|AAN72414.1| complement component C2 [Rattus norvegicus] Length = 758 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 29/218 (13%), Positives = 77/218 (35%), Gaps = 28/218 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K+ L++ ++LD S+S+ ++ +S M++ + V Sbjct: 250 KIQILRSGHLNLYLLLDASQSVSE------KDFNIFKESAFLMVDRIFSFEIK---VSVA 300 Query: 223 LVTFSNKIEEFFL-LEWGVSHLQRKIKYLSK-------FGVSTNSTPGLKYAYNQIFDMQ 274 ++TF+++ + L ++ + L TN+ L Y + + Sbjct: 301 IITFASRPKIIMSVLNERSQNVMEVMDSLDSVCYKDHENATGTNTYEALNSVYLMMNNQM 360 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENL------STKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + + I+ +TDG++ ++ + + + + +YAIG+ Sbjct: 361 DRLGMETSAWQEIRHAIILLTDGKSNMGGSPKPAVDNIREILGISRNRNDYLDIYAIGVG 420 Query: 329 VIRS-----HEFLRACASPNSFYLVENPHSMYDAFSHI 361 + +E +++++ ++ F HI Sbjct: 421 KLDVDWKELNELGSKKDGERHAFILQDAKAVQQVFEHI 458 >gi|297488656|ref|XP_002697119.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Bos taurus] gi|296474935|gb|DAA17050.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Bos taurus] Length = 1192 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 74/197 (37%), Gaps = 30/197 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + VN + +F Sbjct: 334 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTLSDDDYVN------VASF 381 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + K + + ++ + G +T G +YA++Q+ + R + Sbjct: 382 NEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAKG-TTGYKAGFEYAFDQLQNPNITRAN 440 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRA 338 CN K+I+ TDG ++ + + N + +++G A Sbjct: 441 CN-------KMIMMFTDGGEDRVQDVFEKYNWPN--RTVRVFTFSVGQHNYDVTPLQWMA 491 Query: 339 CASPNSFYLVENPHSMY 355 C + ++ + + ++ Sbjct: 492 CTNKGYYFEIPSIGAIR 508 >gi|238790016|ref|ZP_04633794.1| von Willebrand factor type A domain protein [Yersinia frederiksenii ATCC 33641] gi|238721829|gb|EEQ13491.1| von Willebrand factor type A domain protein [Yersinia frederiksenii ATCC 33641] Length = 448 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 40/244 (16%), Positives = 88/244 (36%), Gaps = 27/244 (11%) Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 +I+ K L + + ++ + +T ++S+ + +++ +V+D S SM Sbjct: 29 AITPAVTVKSELSAPVMLANSEDKNYLKISLTG-FNLDSKRRSPINLALVIDRSTSMTG- 86 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQR 245 +I+ A ++ + + ++ +V + N E L Sbjct: 87 -----DRIEKAREAAILAVNMLDASDTLS------VVAYDNNAEVIIPATKVNNKPALIA 135 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKE 304 KI+ T G+ Q+ D + N I+ ++DG+ N Sbjct: 136 KIQQHIHPMGMTALFAGVSKGIGQV-DKNLNPEQVNR--------IILLSDGQANTGPTS 186 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIG 362 + A K+G + IG+ + + + A A S + V N + AF+ Sbjct: 187 ISELSDLARMAAKKGIAITTIGLGEDYNEDLMTAIAGYSDGNHSFVANSADLESAFTKEF 246 Query: 363 KDIV 366 +D++ Sbjct: 247 QDVM 250 >gi|119623953|gb|EAX03548.1| complement component 2, isoform CRA_a [Homo sapiens] Length = 437 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/167 (15%), Positives = 58/167 (34%), Gaps = 18/167 (10%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + + TN K+ Q L++ ++LD S+S+ + +S Sbjct: 224 FSHMLGATNPTQKTKESLGR-KIQIQRSGHLNLYLLLDCSQSVSEN------DFLIFKES 276 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWGVSHLQRKIKYLS-------KF 253 + M++ + V ++TF+++ + L + I L + Sbjct: 277 ASLMVDRIFSFEIN---VSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANYKDHEN 333 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 G TN+ L Y + + + + I+ +TDG++ Sbjct: 334 GTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSN 380 >gi|302533890|ref|ZP_07286232.1| toxic cation resistance protein [Streptomyces sp. C] gi|302442785|gb|EFL14601.1| toxic cation resistance protein [Streptomyces sp. C] Length = 242 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 24/172 (13%), Positives = 58/172 (33%), Gaps = 25/172 (14%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 ++ +S+ + R + +VLD S SM ++D + + + Sbjct: 15 APGLVNLYKSAGASLSKHGMEGGRASVYLVLDYSGSMRPYYDDG--SVQAFADRVLGLSA 72 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLK 264 + + +V FS ++ + ++H + ++ ++ T+ + Sbjct: 73 HLDTDG------RVPVVFFSTDVDAVEEVT--LAHHEGQVTAIAARLGHMGKTSYHLAMD 124 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 + D ++VF TDG +S ++ Y C A+ Sbjct: 125 AVIDHYLDSGCAD----------PALVVFQTDGGPVSKLAAER--YLCKAAR 164 >gi|198419582|ref|XP_002123026.1| PREDICTED: similar to CLCA family member 1, chloride channel regulator [Ciona intestinalis] Length = 1001 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 76/202 (37%), Gaps = 39/202 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++VLDVS SM ++ M +S + + + +++ + G+V F + + Sbjct: 236 FVVVLDVSGSMRG------KRLLMMRQSTSEFISSL-----LSDGDKIGIVQFHSFAQTL 284 Query: 234 FLLEWGVSHLQRKIKYL-------SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + H+ + ++ G ST G++ A ++ Sbjct: 285 LPIR----HVNSQTDRFDICSRFPNRTGGSTCIGCGIQAAMQEMERDD---------PTE 331 Query: 287 YKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-SPNS 344 IV +TDG EN S ++ + A RG + AI + ++ ++ + Sbjct: 332 PCGHIVVLTDGMENRSP----YTVDVSSRAVNRGCTIDAIFLTTTQNTALVQLVNRTSGR 387 Query: 345 FYLVE--NPHSMYDAFSHIGKD 364 ++ + + + AF+ I + Sbjct: 388 WFFAQDRDLRRLTGAFAVIADE 409 >gi|156408868|ref|XP_001642078.1| predicted protein [Nematostella vectensis] gi|156229219|gb|EDO50015.1| predicted protein [Nematostella vectensis] Length = 257 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 37/209 (17%), Positives = 73/209 (34%), Gaps = 27/209 (12%) Query: 161 SVKVNSQTDARLD---MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + S+ LD ++ ++D S S+ + K+ + + + A+ V Sbjct: 46 TAGSRSKRQMPLDQYNVVYMVDSSNSVR-KRGFNREKVALRLLTEKALPGTVYST----- 99 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 +TFS + E F +++ + L TN+ L A ++ Sbjct: 100 ------ITFSTEAEISFRYANRSQAIEK-LNDLPYMACKTNTQLALNLA--EMIFFNNTL 150 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--VIRSHEF 335 + I+ TDG++ +Q +LY KKRG +Y + + + HE Sbjct: 151 GPLRPGR----RRILIFTDGQSNVK--EQMTLYRAFRLKKRGVEIYVVAVGKYLYGMHEI 204 Query: 336 LRACASPNSF-YLVENPHSMYDAFSHIGK 363 + S + Y V + I + Sbjct: 205 IGLATSSSHHLYRVRSMKDFVKIVQLIPR 233 >gi|145299122|ref|YP_001141963.1| von Willebrand factor type A domain-containing protein [Aeromonas salmonicida subsp. salmonicida A449] gi|142851894|gb|ABO90215.1| von Willebrand factor type A domain protein [Aeromonas salmonicida subsp. salmonicida A449] Length = 331 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 69/199 (34%), Gaps = 25/199 (12%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D+++ +D+S SM + + + ++ + + + + LV F++ Sbjct: 79 QSSRDLILAVDLSDSMRTQDMLDEGEQRDRLSAVRQQISRLIETRPGD---RIALVVFAD 135 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L L + L G +T + A RQH + + Sbjct: 136 HAYLLSPLTQETKALLTLTRELDFDLVGRTTALGEAILLA----------RQHADPKRPT 185 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSH-EFLRACASPNS 344 ++ +TDG N + D L A G +Y +G+ + E L+ S S Sbjct: 186 A---LLLVTDGRNTAGSADP--LSEARRAAASGMTLYTLGVGADPDTFVEALQPAQSDPS 240 Query: 345 FYLVENPHSMYDAFSHIGK 363 E ++ + +G+ Sbjct: 241 ---AELDEALLQQLAKVGQ 256 >gi|312139258|ref|YP_004006594.1| type ii secretion system integral membrane subunit [Rhodococcus equi 103S] gi|311888597|emb|CBH47909.1| putative type II secretion system integral membrane subunit [Rhodococcus equi 103S] Length = 622 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 34/254 (13%), Positives = 83/254 (32%), Gaps = 33/254 (12%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDM 174 + V + +S ++ I + + + ++V Q + D+ Sbjct: 31 QVSAVDTTRFPDIEVSILAPPGIEGQAIDPGTFALT----EGGVPREIEVRQQPGSEQDI 86 Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 ++ +DVS M S +D ++ + + + G+V S+ + Sbjct: 87 VLAIDVSGGM------SGPALDDVKRAASDFVRQAPAGAH------IGIVAISSTPQVLS 134 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 L L R+I L G S A + + + + + + I++ + Sbjct: 135 ELTTDSEDLLRRIDGLKAGGNSA-------IADSVVTAAEMLERGEAANN-----ILLLL 182 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE-FLRACA--SPNSFYLVENP 351 TDG T + + A +YA+ + ++ L+ A S + + Sbjct: 183 TDG--ADTSSAHSMSELPSVLSRSRASLYAVQMSTPETNSALLQQVARESRGQYASAGDT 240 Query: 352 HSMYDAFSHIGKDI 365 ++ + + + Sbjct: 241 AALGAIYQSAARAL 254 >gi|198436525|ref|XP_002124218.1| PREDICTED: similar to laminin, beta 2 [Ciona intestinalis] Length = 671 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 27/168 (16%), Positives = 53/168 (31%), Gaps = 28/168 (16%) Query: 219 VQSGLVTFSNKIEEFFLLEW--------------GVSHLQRKIKYLSKFGVSTNSTPGLK 264 +Q G+V +S+ L R + + G T + + Sbjct: 73 IQIGVVQYSHW-YHNRPLNNQRYIKTEIELGEHMNKDDFDRAVDRIKYQGFKTYTAHAIN 131 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 F R + K+ ++ +TDG+ + + + + RG I+YA Sbjct: 132 KTLEFDFLGPKNRY------PDAKRALILLTDGKATDSSYLPDVIQHAE--RDRGVIIYA 183 Query: 325 IGIRVIRSHEF-LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 +G+ S E L N + +EN + I + + Sbjct: 184 VGVGDFDSSELQLITHHHENREFELENFEDL----DSIVNSLQFQLAT 227 >gi|156743748|ref|YP_001433877.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156235076|gb|ABU59859.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 936 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 39/239 (16%), Positives = 81/239 (33%), Gaps = 29/239 (12%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA- 198 + W +P+T + Q + +++V+D+S SM + D I K+ +A Sbjct: 379 QSFGPGGWRGTPVEAALPVTMDI-PERQRQPPVSIVVVIDISGSMAATEDG-IPKLSLAL 436 Query: 199 --IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ-RKIKYLSKFGV 255 + I A+L + + + + G++ L + ++ + G Sbjct: 437 EGARRIAALLRDEDELTVIPFDDRPGVI--------VGPLPGSRRDVAIEQLNQVRLGGS 488 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 N L+ A + I+ +TDG T + + +L Sbjct: 489 GINIHDALRVA-----------ARYTRASERPVRHIITITDG--NDTTQQEGALDIVRSL 535 Query: 316 KKRGAIVYAIGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 G + ++ I F+R A+ +L E + D + + I+T I Sbjct: 536 HDEGVTLTSVAIGQGDHVPFIRDMAAVGGGRTFLTERAADVPDLLTGETQTIMTPYIVE 594 >gi|312883763|ref|ZP_07743482.1| hypothetical protein VIBC2010_14219 [Vibrio caribbenthicus ATCC BAA-2122] gi|309368512|gb|EFP96045.1| hypothetical protein VIBC2010_14219 [Vibrio caribbenthicus ATCC BAA-2122] Length = 396 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 53/374 (14%), Positives = 107/374 (28%), Gaps = 49/374 (13%) Query: 24 AIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDIL 83 A+ +P++ I+ + + D + + + + + Sbjct: 20 AMLIPMVIAAASTIVIGYQVQLSNRAMQ-AADAASIACEFKGEYDQALTQSYLDYYQPKI 78 Query: 84 CRIKNTWNMSFRNELRDNGFVNDIDD--IVRSTSLDIVVVPQNEGYSISAISRYKIPL-- 139 +++ + + ++ I + TS + + Y + + L Sbjct: 79 DKVRGQIRTNSGCNMSLGYSLSTIFTSLTLSDTSFVVSSTANEKAYVTEDVVSDPLELVI 138 Query: 140 ------------------------KFCTFIPWYTNSRHIVMPI-TSSVKVNSQTDARLD- 173 + HI + I S V+ L+ Sbjct: 139 VLDISTSMYGAINDLKAILKRGIVSLKEQQNNAQSEDHIKVSIIPFSTGVSVNNAPWLND 198 Query: 174 -MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 +D + E F ++ T +A I VKL +F+ Sbjct: 199 ARTFCVDGTTESEDKFYAART---VANLDITHDQISVKLSQPNKWRESCSAASFT----- 250 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD--MQGMRQHCNTEDANYKKI 290 L + + + L G T S GL + Q+ + N ++ Sbjct: 251 -LPLTADLDQVTNTVDSLRTEG-GTASYQGLIWGLRQLTPNWQKAWEVGPNRNVDKVERK 308 Query: 291 IVFMTDGENLSTK--EDQQSLYYCNEAKKRGAIVYAIGIRVIRS--HEFLRACASPNSFY 346 +V MTDG N + +D + C+ AK G + +G V S +F R P + Sbjct: 309 LVLMTDG-NDYGRYFDDLINAGLCDRAKDYGIALNFVGFGVNGSRLEQFTRCAVDPKGVF 367 Query: 347 LVENPHSMYDAFSH 360 + + FS Sbjct: 368 SASDTQDLDHYFSQ 381 >gi|304411390|ref|ZP_07393004.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica OS183] gi|307305288|ref|ZP_07585036.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica BA175] gi|304350245|gb|EFM14649.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica OS183] gi|306911591|gb|EFN42016.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica BA175] Length = 771 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 70/197 (35%), Gaps = 32/197 (16%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + +VM + V+ ++Q ++++V+D S SM I A ++ L+ +K Sbjct: 370 NYSLVMVLPPKVEKSTQPSLPRELILVIDTSGSMAG------DSIVQAKNALLYALKGLK 423 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFF-----LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 N ++ F++ + +F +S ++ + L G T L Sbjct: 424 PEDSFN------IIEFNSSLSQFSATPLPATSSNLSRARQFVSRLQADG-GTEMALALDA 476 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE--NLSTKEDQQSLYYCNEAKKRGAIVY 323 A + + + + ++FMTDG N D ++ Sbjct: 477 A------LPKSLGSVSPDAVQPLRQVIFMTDGSVGNEQALFDLIRYQIGES------RLF 524 Query: 324 AIGIRVIRSHEFLRACA 340 +GI + F++ A Sbjct: 525 TVGIGSAPNSHFMQRAA 541 >gi|294630425|ref|ZP_06708985.1| toxic cation resistance protein [Streptomyces sp. e14] gi|292833758|gb|EFF92107.1| toxic cation resistance protein [Streptomyces sp. e14] Length = 244 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 42/244 (17%), Positives = 77/244 (31%), Gaps = 29/244 (11%) Query: 112 RSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR 171 T D P++ + K ++K N T AR Sbjct: 1 TPTPEDSTPTPEDSTATPEDSPTPKPATPLARVKSRAPLLTTAYKAAAPAIKKNDLTGAR 60 Query: 172 LDMMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +VLD S SM ++ D S + ++ A L V + FS ++ Sbjct: 61 AKTYLVLDRSASMRPYYKDGSAQSLAEQTLALAAHL------DPEATVH---VTFFSTEV 111 Query: 231 EEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L + + L + G +T ++ + H + Sbjct: 112 DGTATLTLADHENKIDEVHAGLGRMGRTTYHA-----------AIEEVLAHHDAHHPTTP 160 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA-IGIRVIRSHEF--LRACASPN-S 344 ++VF TDG + Q+L + AK + ++ + + F LR + N S Sbjct: 161 ALVVFQTDGAPDAKTPATQAL--ADAAKTHPHVFFSFVAFGEHDNKAFDYLRKLKTGNTS 218 Query: 345 FYLV 348 F+ Sbjct: 219 FFHA 222 >gi|125975601|ref|YP_001039511.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] gi|125715826|gb|ABN54318.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] Length = 1300 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 42/282 (14%), Positives = 88/282 (31%), Gaps = 39/282 (13%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 F + ++ ++ T+ V Y + I R+ + N Sbjct: 3 FYYDEENDTIKFLETEVDEETNTIKTTVDHFSIYGVIDIVRFAQSWGIKDILDKLLNPGG 62 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 +P + + + D++ V+D + SM S + N + + Sbjct: 63 ETIPPVAEI-------GQADIVFVIDTTGSMGSVIN----------NVKNNITNFANTLM 105 Query: 214 DVNNVVQSGLVTF----SNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 + N V+ GL+ + + ++ L W VS + + G Sbjct: 106 ENNVDVRLGLIDYKDLEEDGMDSTKNLGWFDNVSDFIASVNNMRATGGGDAPE------- 158 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTD---GENLSTKEDQQSLYYCNEAKKRGAIVYA 324 + + ++ R+ N K I+ TD E+ ++ Q + K+ +V A Sbjct: 159 STVDALEEARRMDFRPGVN--KFIMLFTDVSYKESTRFEDVQSMKTVIEKLKEDKIVVSA 216 Query: 325 IGIRVIRSHEFL-RACASPNSFYLVENPHSMYDAFSHIGKDI 365 I V +E L R + + A + +I Sbjct: 217 I---VPSGYESLYRNLYTETGGVYANITQAFSSALQSLISNI 255 >gi|219684622|ref|ZP_03539565.1| von Willebrand factor type A domain protein [Borrelia garinii PBr] gi|219685812|ref|ZP_03540621.1| von Willebrand factor type A domain protein [Borrelia garinii Far04] gi|219671984|gb|EED29038.1| von Willebrand factor type A domain protein [Borrelia garinii PBr] gi|219672645|gb|EED29675.1| von Willebrand factor type A domain protein [Borrelia garinii Far04] Length = 333 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 68/196 (34%), Gaps = 26/196 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++IVLD+S SM + SS +++ + + I + + GLV F+ Sbjct: 93 DIVIVLDISPSMGAVEFSSKNRLEFSKELIRRFISQ-------RENDNIGLVAFAKDASI 145 Query: 233 FFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + +K+ Y+ G + G+ A + + K+ Sbjct: 146 VVPITTDRDFFNKKLDDIYIMDLGNGSALGLGISIALSHL-----------KHSEALKRS 194 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF----Y 346 IV +TDG S + + N A+ +Y+IGI S + Sbjct: 195 IVVLTDGVVNSDEIYKD--QVINLAQGLNVKIYSIGIGSSEEFSVEFKLRSGKFYQGSLK 252 Query: 347 LVENPHSMYDAFSHIG 362 + +P + + G Sbjct: 253 EIYDPSMLVEISHKTG 268 >gi|299533511|ref|ZP_07046889.1| isocitrate dehydrogenase [Comamonas testosteroni S44] gi|298718494|gb|EFI59473.1| isocitrate dehydrogenase [Comamonas testosteroni S44] Length = 688 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 61/163 (37%), Gaps = 21/163 (12%) Query: 174 MMIVLDVSRSMESFF-------DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 ++++LD+S SM +++ + ++++ ML+ + DV V+ VTF Sbjct: 275 LLLMLDLSGSMNWGQENNKNPAPGEKSRLQIMKEAVSLMLDSYAALGDVK--VRI--VTF 330 Query: 227 SNKIEEFFLLEW-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 N + W ++ + + L+ G T + L+ A + N Sbjct: 331 QNTAAQARQTTWIDIATAKSIVNALTATG-GTPYSKALETAMQAFNSAGKIDGGKN---- 385 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 I F+TDGE ++ E + + + ++ Sbjct: 386 ----IAYFLTDGEPTASYEIDAAREALWKTFVDANEINSMAFG 424 >gi|294789255|ref|ZP_06754493.1| tellurium resistance protein [Simonsiella muelleri ATCC 29453] gi|294482680|gb|EFG30369.1| tellurium resistance protein [Simonsiella muelleri ATCC 29453] Length = 346 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 30/172 (17%), Positives = 63/172 (36%), Gaps = 14/172 (8%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +V+DVS SM + + + IN ++ E++ P V ++ F+ + Sbjct: 3 RLPIFLVIDVSESM------VGSPLRHMQEGINQLINELRKDPYALETVYLSVIAFAGVV 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L L G T+ L + ++I + D YK I Sbjct: 57 KTLAPLM----ELYAFYPPRLPVGAGTSLGAALNHVMDEITQKVILSSPTQKGD--YKPI 110 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + M+DG + + + + ++ + ++GI L ++ Sbjct: 111 VYLMSDGSATDNPDAAIARWKRDFMQRA--TLVSVGIGPFADLSRLSEISTS 160 >gi|257415703|ref|ZP_05592697.1| von Willebrand factor [Enterococcus faecalis AR01/DG] gi|257157531|gb|EEU87491.1| von Willebrand factor [Enterococcus faecalis ARO1/DG] Length = 1154 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 317 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 370 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 371 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 419 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 420 HKKVIVLLTDGVPTFSYKVSRVQTEAD 446 >gi|256965511|ref|ZP_05569682.1| von Willebrand factor [Enterococcus faecalis HIP11704] gi|256956007|gb|EEU72639.1| von Willebrand factor [Enterococcus faecalis HIP11704] Length = 1154 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 317 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 370 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 371 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 419 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 420 HKKVIVLLTDGVPTFSYKVSRVQTEAD 446 >gi|149773089|emb|CAO01894.1| collagen type VI alpha 6 [Mus musculus] Length = 226 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 31/205 (15%), Positives = 70/205 (34%), Gaps = 29/205 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + + + I+ M+ + P N + L +S+ + Sbjct: 26 DVVFLVDSSDHLG------LKSFPLVKTFIHKMISSL---PIEANKYRVALAQYSDALHN 76 Query: 233 FFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L + + +L K G S L+ A+ F + + Sbjct: 77 EFQLGTFKNR-NPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSAPTNGRDKK----QFP 131 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 I+V + S + + ++ G + ++G++ L+A A+ + + Sbjct: 132 PILVVL-----ASAESEDDVEEAAKALREDGVKIISVGVQKASEEN-LKAMATSQFHFNL 185 Query: 349 ENPHSMYDAFS----HIGKDIVTKR 369 + F+ I KD+ R Sbjct: 186 RTARDL-SVFAPNMTEIIKDVTQYR 209 >gi|301617755|ref|XP_002938301.1| PREDICTED: integrin alpha-E-like [Xenopus (Silurana) tropicalis] Length = 1148 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 74/208 (35%), Gaps = 25/208 (12%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV--KLIPDVNNVVQSGLVT 225 +D ++ IVLD S S+ A I+ M+ K + VVQ G + Sbjct: 182 SDTGTEIAIVLDGSGSISE------EDFQKAKDFISNMIGMFWEKCLQCEFAVVQYGAI- 234 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + E L + +K++ + + G T + L + +F + + Sbjct: 235 --IQTEFDLLESRKGRFILQKVQDIKQVGNVTKTASALDHVLESVFTEE------HGSSE 286 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE----FLRACAS 341 KII+ +TDG+ D N +K + + IG+ E L+ AS Sbjct: 287 TATKIILVLTDGDIFMDPMDIN--DVMNNSKMKKIERFVIGVGEAFQKEKALKTLKTIAS 344 Query: 342 --PNSFYLVENPHSMYDAFSHIGKDIVT 367 V++ + + + + I+ Sbjct: 345 QGEEHLLTVDDYSKLEGLLTSLQQKIIG 372 >gi|256852741|ref|ZP_05558111.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T8] gi|256711200|gb|EEU26238.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T8] gi|315030743|gb|EFT42675.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX4000] Length = 1105 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 29/148 (19%), Positives = 58/148 (39%), Gaps = 20/148 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV-VQSGLVTFSN 228 LD+++V+D S SM +I K +N ++ + NN+ + G V +S+ Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTLADSGITNNININMGYVGYSS 320 Query: 229 KIEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 ++ G ++ IK ++ T + L+ A + + G Sbjct: 321 DGYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG---------- 370 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + Sbjct: 371 -HKKVIVLLTDGVPTFSYTVSRVQTEAD 397 >gi|329902233|ref|ZP_08273073.1| Putative MxaC-like protein [Oxalobacteraceae bacterium IMCC9480] gi|327548825|gb|EGF33456.1| Putative MxaC-like protein [Oxalobacteraceae bacterium IMCC9480] Length = 338 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 41/265 (15%), Positives = 80/265 (30%), Gaps = 56/265 (21%) Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD--- 189 +R +I L + + P T+ K S + ++++LD S SM+S Sbjct: 53 ARLEIALGMLAIVATVLGPAGLASPATTIEKTGSGAE----ILVLLDRSASMDSALQEKG 108 Query: 190 ---------SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV 240 + K +A ++ P GL+ FS + Sbjct: 109 AKTPLTDKYAEPKKRKIARTALAGF---AAGRPHDA----IGLMMFSENQFQVMPFNMRP 161 Query: 241 SHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE 298 +Q I+ + +T+ + A D +II+ ++DG Sbjct: 162 DMIQAAIQAGGVGSGLGNTDVGSAMLAALRTFDDRPDSGS----------RIIMLVSDGG 211 Query: 299 NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-------------------LRAC 339 + L + K+ +Y I +R R+ Sbjct: 212 AQIAPAVR--LQIADGLKRNRIALYWIYLRSYNQPALADSDSAEFDGVVEVAMHRYFRSL 269 Query: 340 ASPNSFYLVENPHSMYDAFSHIGKD 364 ++P + +N SM A +G+ Sbjct: 270 STPYRAFEADNQASMQRAIEAVGRQ 294 >gi|222635792|gb|EEE65924.1| hypothetical protein OsJ_21784 [Oryza sativa Japonica Group] Length = 578 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 69/187 (36%), Gaps = 28/187 (14%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSR--HIVMPITSSVKVNSQTDARLDMMIVLDVSRSME- 185 +S IP ++ +++ + + + +D++ VLDVS SM Sbjct: 1 MSVAPVKVSTTPIFPTIPRAQTNKDFQVLLRVEAPPAADLNGHVPIDVVAVLDVSGSMND 60 Query: 186 -----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-KIEEFFL---- 235 + T++D+ S+ ++ ++ + +V F++ ++E+ Sbjct: 61 PVAASPESNLQATRLDVLKASMKFIIRKLDDGD------RLSIVAFNDGPVKEYSSGLLD 114 Query: 236 -LEWGVSHLQRKIKYLSK-FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 G S +KI L V+ P L+ A + RQ + I+ Sbjct: 115 VSGDGRSIAGKKIDRLQAVVAVALRLCPELQEAVKIL----DERQGNSRNRVG---FILL 167 Query: 294 MTDGENL 300 +TDG++ Sbjct: 168 LTDGDDT 174 >gi|160874992|ref|YP_001554308.1| cell wall anchor domain-containing protein [Shewanella baltica OS195] gi|160860514|gb|ABX49048.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica OS195] gi|315267224|gb|ADT94077.1| Vault protein inter-alpha-trypsin domain-containing protein [Shewanella baltica OS678] Length = 771 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 70/197 (35%), Gaps = 32/197 (16%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + +VM + V+ ++Q ++++V+D S SM I A ++ L+ +K Sbjct: 370 NYSLVMVLPPKVEKSTQPSLPRELILVIDTSGSMAG------DSIVQAKNALLYALKGLK 423 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFF-----LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 N ++ F++ + +F +S ++ + L G T L Sbjct: 424 PEDSFN------IIEFNSSLSQFSATSLPATSSNLSRARQFVSRLQADG-GTEMALALDA 476 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE--NLSTKEDQQSLYYCNEAKKRGAIVY 323 A + + + + ++FMTDG N D ++ Sbjct: 477 A------LPKSLGSASPDAVQPLRQVIFMTDGSVGNEQALFDLIRYQIGES------RLF 524 Query: 324 AIGIRVIRSHEFLRACA 340 +GI + F++ A Sbjct: 525 TVGIGSAPNSHFMQRAA 541 >gi|325964121|ref|YP_004242027.1| Flp pilus assembly protein TadG [Arthrobacter phenanthrenivorans Sphe3] gi|323470208|gb|ADX73893.1| Flp pilus assembly protein TadG [Arthrobacter phenanthrenivorans Sphe3] Length = 345 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 13/106 (12%), Positives = 36/106 (33%), Gaps = 6/106 (5%) Query: 15 YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNR 74 G ++++ AI + + + + +++ I+ + L + D S + A + + G + Sbjct: 11 EGGAISVIVAILMVALLGFVAIAVDIGVIYSERAQLQNGADASAIAVAQKCARDATGVDC 70 Query: 75 KKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVV 120 + + + D+D R S+ Sbjct: 71 ST------TSALASGLANRNALDGMSKVHTIDLDKTTRKVSVTTSA 110 >gi|261251589|ref|ZP_05944163.1| hypothetical protein VIA_001610 [Vibrio orientalis CIP 102891] gi|260938462|gb|EEX94450.1| hypothetical protein VIA_001610 [Vibrio orientalis CIP 102891] Length = 396 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/131 (21%), Positives = 48/131 (36%), Gaps = 6/131 (4%) Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD--MQGMRQHCNTEDANYKKIIV 292 L ++ + + L G T S GL + Q+ + N ++ +V Sbjct: 252 PLTADLNQVTNAVDSLRTEG-GTASYQGLIWGLRQLTPNWQKAWEVGPNRNFDKVERKLV 310 Query: 293 FMTDGENLSTKEDQ-QSLYYCNEAKKRGAIVYAIGIRV--IRSHEFLRACASPNSFYLVE 349 MTDG + + D+ + C+ AK G + +G V R +F R N + Sbjct: 311 LMTDGADYGSHFDELINAGLCDRAKDYGVALNFVGFGVYGARLEQFTRCAGDANGVFSAS 370 Query: 350 NPHSMYDAFSH 360 N + FS Sbjct: 371 NTQELDSYFSQ 381 >gi|162452306|ref|YP_001614673.1| glycine-rich protein [Sorangium cellulosum 'So ce 56'] gi|161162888|emb|CAN94193.1| glycine-rich protein [Sorangium cellulosum 'So ce 56'] Length = 408 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 34/246 (13%), Positives = 74/246 (30%), Gaps = 58/246 (23%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 ++ ++IVLD S SM ++ S + M ++ L+ ++ + + GL F + Sbjct: 95 QVHLVIVLDRSDSMTKDWEGS-DRWTMMKGALGTALDVIR------DRMSVGLQLFPSDE 147 Query: 231 EEFFL----LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L V+ + + T+ A + + G Sbjct: 148 HCGMPAGEDLSVAVAPGATSVPAIKTLLDGTDPGGATPTA-DALARALGYLTVGAGGALE 206 Query: 287 YKKIIVFMTDGENLSTK------------------------------------EDQQSLY 310 K ++ TDG + +D +++ Sbjct: 207 GDKYVLLATDGGPNCNQDPAMTCEAATCTTNMDGDCPSGVPNCCVAGLTDVCLDDVRTVQ 266 Query: 311 YCNEAKKRGAIVYAIGI-RVIRSHEFLRACA---------SPNSFYLVENPHSMYDAFSH 360 + + G + +GI L A + ++ V + S+ D + Sbjct: 267 RVKDLRAAGIKTFVVGIPGATAYAHVLDQLAVEGDTATSETSPRYFEVVDAASLGDTLTG 326 Query: 361 IGKDIV 366 I +D+V Sbjct: 327 ITRDLV 332 >gi|90413889|ref|ZP_01221875.1| inter-alpha-trypsin inhibitor domain protein [Photobacterium profundum 3TCK] gi|90325073|gb|EAS41583.1| inter-alpha-trypsin inhibitor domain protein [Photobacterium profundum 3TCK] Length = 714 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 45/242 (18%), Positives = 89/242 (36%), Gaps = 42/242 (17%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDARL---DMMIVLDVSRSMESFFDSSITKIDM 197 F + + MP + +VNS T + L + VLD+S SM I+ Sbjct: 299 FTQHVEGQGYGLLLTMPPQVNHQVNSTTSSALFHQSVTFVLDISGSMYG------ESIEQ 352 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI----EEFFLLEWGV-SHLQRKIKYLSK 252 A +++ L++++ N +VTF+++ E+ + + R + L Sbjct: 353 AKQALRYGLQQLQPEDSFN------IVTFNHEAMLYSEQLLPVTSSTITRALRFVDGLDA 406 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE--NLSTKEDQQSLY 310 G + + LK A++ + H + IVF+TDG N S D Sbjct: 407 DGGTEMAA-ALKAAFSI-------KTHDQLNSTRWLNQIVFITDGSVGNESALFDLIEQQ 458 Query: 311 YCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYD----AFSHIGKD 364 + ++ +GI + F+ A ++ + + + FS I + Sbjct: 459 LVDR------RLFTVGIGSAPNSYFMTRAAMKGKGTYTYIGDVKEVNTKMRLLFSKISQP 512 Query: 365 IV 366 ++ Sbjct: 513 VM 514 >gi|45332244|gb|AAS58046.1| thrombospondin-related anonymous protein [Babesia bovis] Length = 655 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 59/170 (34%), Gaps = 29/170 (17%) Query: 167 QTDARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 Q +LD IV+D S S+ + ++ + I + ++ V L N ++ L T Sbjct: 39 QCKKQLDFSIVVDESASISDDQWEGQM------IPFLRNLIHTVDL---DNTDIRLSLTT 89 Query: 226 FSNKIEEFFL-LEWGVSHLQRKIKYLSKFGVS------TNSTPGLKYAYNQIFDMQGMRQ 278 +S + F L+ S + + L + T + L Y I Sbjct: 90 YSTPTRQIFTFLDAAASSTRLALTKLDWMNGTKARYGMTYTGRALNYVRKAILPY----- 144 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 N K ++ +TDG + + + + G V IG+ Sbjct: 145 ----GRKNVPKALLLITDG---VSSDGSYTAQVAAMLRDEGVNVMVIGVG 187 >gi|302557483|ref|ZP_07309825.1| secreted protein [Streptomyces griseoflavus Tu4000] gi|302475101|gb|EFL38194.1| secreted protein [Streptomyces griseoflavus Tu4000] Length = 417 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 33/226 (14%), Positives = 77/226 (34%), Gaps = 31/226 (13%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + + + ++LDVS SM + +++ A ++ N +L+ +V + Sbjct: 20 GAAAGEPTGQSAPKVNLLLDVSGSMRAKDIDGQSRMAAAKQAFNEVLD--ATPEEVELGI 77 Query: 220 QSGLVTF-----------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 ++ + + ++ L + + + L+ G T P L A + Sbjct: 78 RTLGANYPGDDRKEGCKDTAQLYPVGPL--NRTEAKTAVATLAPTGW-TPIGPALLKAAD 134 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + G K IV ++DGE+ D + A+ G + +G+ Sbjct: 135 DLDGGDGS------------KRIVLISDGEDTCAPLDPCEVAREIAARGIGLTIDTLGLV 182 Query: 329 VIRSHEFLRAC---ASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 +C A+ ++ VE+ + D + + + Sbjct: 183 PNAKLSRQLSCIAEATGGTYTSVEHQDELTDRVNELVDRAAEPVVT 228 >gi|281416613|ref|ZP_06247633.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|281408015|gb|EFB38273.1| von Willebrand factor type A [Clostridium thermocellum JW20] Length = 1363 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 42/282 (14%), Positives = 88/282 (31%), Gaps = 39/282 (13%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 F + ++ ++ T+ V Y + I R+ + N Sbjct: 66 FYYDEENDTIKFLETEVDEETNTIKTTVDHFSIYGVIDIVRFAQSWGIKDILDKLLNPGG 125 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 +P + + + D++ V+D + SM S + N + + Sbjct: 126 ETIPPVAEI-------GQADIVFVIDTTGSMGSVIN----------NVKNNITNFANTLM 168 Query: 214 DVNNVVQSGLVTF----SNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 + N V+ GL+ + + ++ L W VS + + G Sbjct: 169 ENNVDVRLGLIDYKDLEEDGMDSTKNLGWFDNVSDFIASVNNMRATGGGDAPE------- 221 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTD---GENLSTKEDQQSLYYCNEAKKRGAIVYA 324 + + ++ R+ N K I+ TD E+ ++ Q + K+ +V A Sbjct: 222 STVDALEEARRMDFRPGVN--KFIMLFTDVSYKESTRFEDVQSMKTVIEKLKEDKIVVSA 279 Query: 325 IGIRVIRSHEFL-RACASPNSFYLVENPHSMYDAFSHIGKDI 365 I V +E L R + + A + +I Sbjct: 280 I---VPSGYESLYRNLYTETGGVYANITQAFSSALQSLISNI 318 >gi|85710455|ref|ZP_01041519.1| putative secreted protein [Erythrobacter sp. NAP1] gi|85687633|gb|EAQ27638.1| putative secreted protein [Erythrobacter sp. NAP1] Length = 576 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 64/173 (36%), Gaps = 22/173 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + +++ ++DVS SM S K+ + +++ + E+ V+ VV +G Sbjct: 210 DIERDERPPANLVFLMDVSGSM-----GSPDKLPLVQTALSGLAGELGEQDRVSIVVYAG 264 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + ++ + LS G + G++ AYN I + + N Sbjct: 265 AAGL------VLEPTNDTAKIRAALMSLSAGGST-AGGAGIQLAYN-IAEDNFIEGGVNR 316 Query: 283 EDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++ TDG+ N+ + + + + RG + +G +E Sbjct: 317 --------VILATDGDFNVGVSDRDALVEMVEKNRDRGITLTTLGFGTGNFNE 361 >gi|317502941|ref|ZP_07961033.1| aerotolerance protein BatB [Prevotella salivae DSM 15606] gi|315665940|gb|EFV05515.1| aerotolerance protein BatB [Prevotella salivae DSM 15606] Length = 340 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 41/203 (20%), Positives = 86/203 (42%), Gaps = 24/203 (11%) Query: 132 ISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM--ESFFD 189 +S+Y+ +KFC + T ++ K++ + ++++I LD+S SM + Sbjct: 50 VSKYRPTIKFCLLLSAITILIFMIARPQVGSKISHEKREGIEVLIALDISNSMMAQDVIP 109 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY 249 S + K + I+ L+ N + GLV F+ + + ++ Sbjct: 110 SRLEKSKLLIE---------DLVDHFTND-KVGLVVFAGDAFVQLPITNDYVSAKMFLQN 159 Query: 250 LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL 309 ++ ++T T L A + + Q C T+ + + I+ +TDGE+ + +L Sbjct: 160 INPSLITTQGTD-LARAIS-------LSQSCFTQREHIGRAIIVITDGEDH----EGGAL 207 Query: 310 YYCNEAKKRGAIVYAIGIRVIRS 332 EA K+G V+ +G+ + Sbjct: 208 EAAREAYKKGINVFILGVGTSKG 230 >gi|315172113|gb|EFU16130.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX1342] Length = 1103 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 319 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 368 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 369 HKKVIVLLTDGVPTFSYKVSRVQTEAD 395 >gi|257085650|ref|ZP_05580011.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis Fly1] gi|256993680|gb|EEU80982.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis Fly1] Length = 1154 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 317 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 370 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 371 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 419 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 420 HKKVIVLLTDGVPTFSYKVSRVQTEAD 446 >gi|78776856|ref|YP_393171.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] gi|78497396|gb|ABB43936.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] Length = 595 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 76/193 (39%), Gaps = 24/193 (12%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 F+ + + P+ K + + D+MI LD+S SM + D +++ A Sbjct: 55 NALFFLIGFLIIIALAQPVIKEAKTLIKAKSA-DIMIALDISDSMLAE-DVYPKRLESAK 112 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNS 259 + A L+E K + G+V F+ L + + ++ L ++ Sbjct: 113 RKALAFLKEAK-------DERVGVVAFAKDSYLVSPLSFDKHSVSFLLEQLDTTSITEQG 165 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 + + G + ++ KK+++ ++DG + S ++ +L AKK G Sbjct: 166 SD--------FLSVIGAVEKIQKDEK--KKVLLILSDGGDKSDFSEEIAL-----AKKSG 210 Query: 320 AIVYAIGIRVIRS 332 ++ +GI + Sbjct: 211 ITIFILGIATKQG 223 >gi|312899566|ref|ZP_07758892.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0470] gi|311293245|gb|EFQ71801.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0470] Length = 1103 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 319 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 368 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 369 HKKVIVLLTDGVPTFSYKVSRVQTEAD 395 >gi|257082947|ref|ZP_05577308.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis E1Sol] gi|256990977|gb|EEU78279.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis E1Sol] Length = 1148 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 311 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 364 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 365 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 413 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 414 HKKVIVLLTDGVPTFSYKVSRVQTEAD 440 >gi|257078107|ref|ZP_05572468.1| von Willebrand factor [Enterococcus faecalis JH1] gi|256986137|gb|EEU73439.1| von Willebrand factor [Enterococcus faecalis JH1] Length = 1154 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 317 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 370 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 371 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 419 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 420 HKKVIVLLTDGVPTFSYKVSRVQTEAD 446 >gi|255976231|ref|ZP_05426817.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T2] gi|255969103|gb|EET99725.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T2] Length = 1154 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 317 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 370 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 371 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 419 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 420 HKKVIVLLTDGVPTFSYKVSRVQTEAD 446 >gi|328872160|gb|EGG20527.1| hypothetical protein DFA_00388 [Dictyostelium fasciculatum] Length = 2097 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 40/367 (10%), Positives = 121/367 (32%), Gaps = 64/367 (17%) Query: 31 FLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTW 90 F + ++ + D + A + + + + +N + I ++ Sbjct: 1611 FGGAKLAKYITDLIETIAHAQGFSD---LSKAMETLLKQSLDNPSLTEFVKIQKFVELVE 1667 Query: 91 NMSF----------RNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLK 140 +S + +L + N++D + S++ N + ++ L Sbjct: 1668 QVSVDQSLYTSNAQKKQLAQDTASNEVDAVGGSSTAVTTTHVSNSLTNREKTQLNEMELN 1727 Query: 141 FCTFIPWYTNSRHIVMPI--------------TSSVKVNSQTDARLDMMIVLDVSRSMES 186 ++ + T + + LD++ ++D + SM Sbjct: 1728 ILFGYVKDLSTLKACSLVNKLWRKVTLYPTHWTRFIGNIGGDERDLDLVFLVDNTGSMSG 1787 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS----NKIEEFFLLEWGVSH 242 + KI +++++ I +VN V+ G+V ++ + + + F ++ Sbjct: 1788 EIEQCKDKIK-------EIVDDISAIGNVN--VRVGMVFYNDHPVSNVCQVFDFTDNIAK 1838 Query: 243 LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 ++ ++ ++ +G + + + ++ + N +++V + D Sbjct: 1839 MKTQLSSVTVYG-GDDEPEAMADGFYEVNKLSFT--------PNSTRVLVLIGDANPHGF 1889 Query: 303 KEDQQSLY---YCNE-----AKK----RGAIVYAIGIRVIR-SHEFLRACA--SPNSFYL 347 + CN A+K + +Y++ R +++ A + S + Sbjct: 1890 GGNGDHYPGGCPCNHDIIELARKLVIEKRVTIYSVLCRPANHTYQVFSALSDLSEGRLFT 1949 Query: 348 VENPHSM 354 + N + Sbjct: 1950 LSNAAEL 1956 >gi|327534722|gb|AEA93556.1| von Willebrand factor type A domain protein [Enterococcus faecalis OG1RF] Length = 1103 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 319 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 368 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 369 HKKVIVLLTDGVPTFSYKVSRVQTEAD 395 >gi|315028044|gb|EFT39976.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX2137] Length = 1103 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 319 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 368 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 369 HKKVIVLLTDGVPTFSYKVSRVQTEAD 395 >gi|307271265|ref|ZP_07552544.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0855] gi|306512014|gb|EFM81005.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0855] Length = 1103 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 319 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 368 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 369 HKKVIVLLTDGVPTFSYKVSRVQTEAD 395 >gi|257086444|ref|ZP_05580805.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis D6] gi|256994474|gb|EEU81776.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis D6] Length = 1154 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 317 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 370 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 371 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 419 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 420 HKKVIVLLTDGVPTFSYKVSRVQTEAD 446 >gi|256762099|ref|ZP_05502679.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T3] gi|256683350|gb|EEU23045.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T3] Length = 1154 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 317 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 370 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 371 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 419 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 420 HKKVIVLLTDGVPTFSYKVSRVQTEAD 446 >gi|149502255|ref|XP_001506498.1| PREDICTED: similar to anthrax toxin receptor, partial [Ornithorhynchus anatinus] Length = 183 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 65/194 (33%), Gaps = 25/194 (12%) Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV 240 S S+ + T+I ++S+ ++ +++ + FS++ L Sbjct: 1 SGSVRHHW----TEIYSFVESLAEKF--------ISPMLRMSFIVFSSRGTTVMKLTEDR 48 Query: 241 SHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE 298 ++R + L G T G A N+ +I+ +TDGE Sbjct: 49 EAIRRGLNILRSEVPGGDTFMHEGFIRA-------NEQISFENSGGLRTASVIIALTDGE 101 Query: 299 NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAF 358 + + A+ GAIVY +G++ + S + + V + A Sbjct: 102 LQRDQFYYAEKE-ADRARSLGAIVYCVGVKDFNETQLSTIADSIDHVFPVTGG---FHAL 157 Query: 359 SHIGKDIVTKRIWY 372 + I+ K Sbjct: 158 RGVIDSILKKSCIE 171 >gi|324508797|gb|ADY43711.1| Vitrin [Ascaris suum] Length = 658 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 47/302 (15%), Positives = 108/302 (35%), Gaps = 46/302 (15%) Query: 36 MIIEVSHIFFMKTVLHSM--IDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMS 93 + +S +F +L DR V + + + + + + +S Sbjct: 337 YAVTMSDRYFKDELLIYTGNADRVFVRMKGKEKDFATSVDLNRCGPEGDIKKNDEIQQLS 396 Query: 94 FRNELRD-----NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY 148 R+ L++ + +N+ + R I P + +I F Sbjct: 397 VRDVLKELNESSSHTLNEDIGVERKPDAGIDNDPDINNLIAETSTPEEITTNFTR----- 451 Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 I +PI S+ ++D+M ++D S S+E+ F + + +++ Sbjct: 452 -----IPLPIDSNC--------QVDLMFIIDRSESVENEFQKQL-------QFAVDLVKR 491 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYA 266 + + V+ ++F +K + F + S + + + G ST++ G+ A Sbjct: 492 MSTSDFASR-VRVAAISFYSKAKLEFSFDEFKEQSKVLEALLQIEHIGGSTSAVSGVNLA 550 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 ++I +N + ++V ++DG S + L + + A VYA+ Sbjct: 551 VDEI---------KKAGRSNARHMVVLISDG--NSQDTWDKVLEAADRLRAIDADVYAVT 599 Query: 327 IR 328 + Sbjct: 600 VS 601 Score = 42.8 bits (99), Expect = 0.083, Method: Composition-based stats. Identities = 37/191 (19%), Positives = 76/191 (39%), Gaps = 25/191 (13%) Query: 123 QNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 ++ + +S+ K+P I + +R + +V+S DA +D++++LD S Sbjct: 161 EDAAHYLSSCPGKKLPESSLEAINQHAIAREVD-------EVSSCNDA-IDLILLLDKSA 212 Query: 183 SMESFFDSSITKIDMAIKSI-----NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 SM FDS+ I+ +KS + + + + N V G +S++ + F LE Sbjct: 213 SMVDDFDSAKKFIEELVKSAPPGDYKSRIRVALVTFNDNAHVDFGFEKYSSRDDISFALE 272 Query: 238 W---------GVSHLQRKIKYLSKFGVSTNST---PGLKYAYNQIFDMQGMRQHCNTEDA 285 VS L ++ + + + + + + + +Q N Sbjct: 273 RAENDGGETSAVSGLNAAMEEMKANRRNASRLVRRSCIGWWIQRCMGVGEYKQQNNYTMK 332 Query: 286 NYKKIIVFMTD 296 N + V M+D Sbjct: 333 NAEVYAVTMSD 343 >gi|329573764|gb|EGG55354.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX1467] Length = 1103 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 319 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 368 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 369 HKKVIVLLTDGVPTFSYKVSRVQTEAD 395 >gi|322790292|gb|EFZ15291.1| hypothetical protein SINV_15500 [Solenopsis invicta] Length = 2314 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 36/246 (14%), Positives = 76/246 (30%), Gaps = 37/246 (15%) Query: 136 KIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA---RLDMMIVLDVSRSMESFFDSSI 192 K+ + + S N +++++ ++D S S+ + Sbjct: 84 KMNIDQLRNKTDQEALEPLDSSDRGSFDFNPNELTILYQVELVFLVDASGSVGA------ 137 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK---------IEEFFLLEWGVSHL 243 D ++ + + + + L+TF + I + L Sbjct: 138 ---DNFRSELSFVTKLLSDFTVDTMAARIALITFGGRGNVYRNVDQISRHGPNDHKCYLL 194 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 ++ + ++ G T + L A + K++ +TDG + Sbjct: 195 NKQFRNITYSGGGTYTRGALLEALAIL----------EKSREAASKVVFLITDGFSNG-- 242 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGK 363 + K GAIV+A GIR E L ASP + S + F + + Sbjct: 243 --GDPRPAAHLLKNTGAIVFAFGIRTGNVEE-LHDIASPPWYAHSYFLDSFAE-FEALAR 298 Query: 364 DIVTKR 369 + + Sbjct: 299 RALHRA 304 >gi|242034241|ref|XP_002464515.1| hypothetical protein SORBIDRAFT_01g019910 [Sorghum bicolor] gi|241918369|gb|EER91513.1| hypothetical protein SORBIDRAFT_01g019910 [Sorghum bicolor] Length = 704 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 35/139 (25%), Positives = 59/139 (42%), Gaps = 26/139 (18%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 T A LD++ VLDVSRSM S K+ + +++ ++E ++ + +V FS Sbjct: 230 TRAPLDLVTVLDVSRSM------SGPKLALLKRAMRFVIENLEPSD------RLSVVAFS 277 Query: 228 NKIEEFFLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + F L +G Q+ + L G TN GL+ A + D Q C+ Sbjct: 278 SSACRLFPLRKMTAFGQQQSQQAVDSLVADG-GTNIAEGLRKAARVVEDRQARNPVCS-- 334 Query: 284 DANYKKIIVFMTDGENLST 302 I+ ++DG + Sbjct: 335 -------IILLSDGVDSHN 346 >gi|126340390|ref|XP_001364302.1| PREDICTED: similar to voltage-dependent calcium channel alpha-2 delta subunit [Monodelphis domestica] Length = 1092 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 77/200 (38%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 248 GAASPKDMLILVDVSGSVSGLT------LRLIRTSVSEMLETLSDDDFVN------VASF 295 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 296 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 354 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIRVIRSHE---F 335 CN KII+ TDG ++++ K + V+ + Sbjct: 355 CN-------KIIMLFTDG------GEERAQEIFAKYNKDKKVRVFTFSVGQHNYDRDPVK 401 Query: 336 LRACASPNSFYLVENPHSMY 355 +AC + +Y + + ++ Sbjct: 402 WKACRNCGYYYEIPSIGAIR 421 >gi|118593261|ref|ZP_01550646.1| von Willebrand factor type A like domain [Stappia aggregata IAM 12614] gi|118434152|gb|EAV40808.1| von Willebrand factor type A like domain [Stappia aggregata IAM 12614] Length = 772 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 37/183 (20%), Positives = 69/183 (37%), Gaps = 34/183 (18%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + +++ VLD S SM S I+ + + A ++ ++ ++ + + FSN Sbjct: 362 GQRELVFVLDTSGSM------SGQPIEASKTFMTAAIKALRP----DDYFRI--LHFSND 409 Query: 230 IEEFF-----LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 +F E + + LS G + + A N FD + Sbjct: 410 TSQFAGQAVLATERNKQKALKFVADLSAGGGTE-----INQAVNAAFDQ--------AQP 456 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 N +I+VF+TDG D+ ++ + A +YA G+ + L A A+ Sbjct: 457 DNTTRIVVFLTDG----YIGDEATVIKSIANRIGKARIYAFGVGNSVNRFLLDAMATEGR 512 Query: 345 FYL 347 Y Sbjct: 513 GYA 515 >gi|183219595|ref|YP_001837591.1| hypothetical protein LEPBI_I0170 [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] gi|167778017|gb|ABZ96315.1| Conserved hypothetical protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] Length = 368 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 69/197 (35%), Gaps = 31/197 (15%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 + +LD S SM +D T++ +A + + +L + GLV + N+I Sbjct: 37 VFILDASGSMSEKWDGK-TRMAVAKEKLIQVLGGLPKDAS------VGLVAYGNRIAGCQ 89 Query: 235 LLEW-------GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 G S + +K+ + G ST L+ + Q + Sbjct: 90 SARLYHPIQKGGASIVSQKLTTIVPAG-STPIAQTLQVVGEYLLSDQLETE--------- 139 Query: 288 KKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNS 344 I+F++DG E+ KK + I I + E L+ + + Sbjct: 140 ---IIFISDGVESCEGDPKSVLYNLRQSGKKFRLQILGIDID-PKGEEDLKRLSILGDGN 195 Query: 345 FYLVENPHSMYDAFSHI 361 ++ ++ P +F I Sbjct: 196 YFPLKTPEDYDRSFQRI 212 >gi|189909734|ref|YP_001961289.1| hypothetical protein LBF_0165 [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167774410|gb|ABZ92711.1| Conserved hypothetical protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] Length = 373 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 69/197 (35%), Gaps = 31/197 (15%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 + +LD S SM +D T++ +A + + +L + GLV + N+I Sbjct: 42 VFILDASGSMSEKWDGK-TRMAVAKEKLIQVLGGLPKDAS------VGLVAYGNRIAGCQ 94 Query: 235 LLEW-------GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 G S + +K+ + G ST L+ + Q + Sbjct: 95 SARLYHPIQKGGASIVSQKLTTIVPAG-STPIAQTLQVVGEYLLSDQLETE--------- 144 Query: 288 KKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNS 344 I+F++DG E+ KK + I I + E L+ + + Sbjct: 145 ---IIFISDGVESCEGDPKSVLYNLRQSGKKFRLQILGIDID-PKGEEDLKRLSILGDGN 200 Query: 345 FYLVENPHSMYDAFSHI 361 ++ ++ P +F I Sbjct: 201 YFPLKTPEDYDRSFQRI 217 >gi|159900441|ref|YP_001546688.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893480|gb|ABX06560.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 978 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 68/195 (34%), Gaps = 25/195 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N + + + ++ D+S SM K+ +A + +++ ++ ++ ++ Sbjct: 401 NREKYPPVSVAVIFDISGSMSEVVGGRQ-KVTLASEGAARVVQLLRDFDEIT------VL 453 Query: 225 TFSNKIEEFFLLEWGVSHLQRK---IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 F + ++ + G + I G N L A N + Sbjct: 454 PFDSAVQNQYGPVAGSEREVAQGEIIARGVTGGGGINVHDSLVAAGNVL----------- 502 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA--C 339 + I+ + DG +++ + ++ +E ++ G I I FL Sbjct: 503 KGRNAPIRHIILLADGS--DSQQQENAVRLTDEHRRLGITTSTIAIGNGGDVGFLNNVAV 560 Query: 340 ASPNSFYLVENPHSM 354 A +LVE+ S+ Sbjct: 561 AGGGRHFLVEDALSL 575 >gi|113970537|ref|YP_734330.1| vault protein inter-alpha-trypsin subunit [Shewanella sp. MR-4] gi|113885221|gb|ABI39273.1| Vault protein inter-alpha-trypsin domain protein [Shewanella sp. MR-4] Length = 759 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 37/225 (16%), Positives = 83/225 (36%), Gaps = 40/225 (17%) Query: 152 RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 +VM + V+ + Q + ++++V+D S SM I A ++ L ++ Sbjct: 357 YSLVMVLPPKVEASEQPNLPRELILVIDTSGSMAG------DSIIQAKNALRYALRGLRP 410 Query: 212 IPDVNNVVQSGLVTFSNKIE--EFFLL---EWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 N ++ F++ + L ++ ++ + L G + + A Sbjct: 411 QDSFN------IIEFNSDVSLLSSTPLPATATNLAMARQFVNRLQADGGTE-----MAQA 459 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 N Q + + + ++FMTDG + + + N+ ++ +G Sbjct: 460 LNSALPRQAFN--TASGEDKSLRQVIFMTDGSVGNESALFELIR--NQIGDN--RLFTVG 513 Query: 327 IRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGK-DIVTKRI 370 I ++PNS ++ F++IG D V ++I Sbjct: 514 IG-----------SAPNSHFMQRAAELGRGTFTYIGDVDEVEQKI 547 >gi|294780961|ref|ZP_06746313.1| von Willebrand factor type A domain protein [Enterococcus faecalis PC1.1] gi|307268799|ref|ZP_07550167.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX4248] gi|307286933|ref|ZP_07567011.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0109] gi|294451907|gb|EFG20357.1| von Willebrand factor type A domain protein [Enterococcus faecalis PC1.1] gi|306501991|gb|EFM71279.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0109] gi|306514927|gb|EFM83474.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX4248] gi|315031654|gb|EFT43586.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0017] gi|315034905|gb|EFT46837.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0027] gi|315165584|gb|EFU09601.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX1302] Length = 1103 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 319 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 368 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 369 HKKVIVLLTDGVPTFSYKVSRVQTEAD 395 >gi|260459671|ref|ZP_05807925.1| von Willebrand factor type A [Mesorhizobium opportunistum WSM2075] gi|259034473|gb|EEW35730.1| von Willebrand factor type A [Mesorhizobium opportunistum WSM2075] Length = 718 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 72/194 (37%), Gaps = 21/194 (10%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 PW T+++ + + I V + +++ ++DVS SM+ K+ + + Sbjct: 323 VMPTPWNTHTKLMHVAIKG-FDVKPTEQPKANLVFLIDVSGSMDE-----PDKLPLLKSA 376 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 ++ ++K ++ V +G + + + I L G ST Sbjct: 377 FRLLVSKLKADDTISIVTYAGDAGTVLMPTKIAEKD----KILNAIDNLQP-GGSTAGEA 431 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGA 320 G+K AY ++ ++ N ++ TDG+ N+ +D + +K G Sbjct: 432 GIKEAY-KLAQQSFIKDGVNR--------VMLATDGDFNVGQTDDDDLKRLIEQERKTGV 482 Query: 321 IVYAIGIRVIRSHE 334 + G ++ Sbjct: 483 FLSVFGFGRGNLND 496 >gi|170762586|gb|ACB32196.1| MxaC [uncultured bacterium 16A2] Length = 292 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 39/191 (20%), Positives = 64/191 (33%), Gaps = 35/191 (18%) Query: 174 MMIVLDVSRSMESFFDSSI------TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 ++ ++D S SM F +K A + + +E + N+ V G+ FS Sbjct: 85 IVFLIDRSGSMNETFAGRTPGGGEESKASAAKRLLQGFVERGR-----NDFV--GVAAFS 137 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + + I + + G+ TN GL A + R Sbjct: 138 TSPMLVMPLTDHRAATKAAIDAVDRPGLDYTNIGRGLAMALSMFKADASDRS-------- 189 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 + I+ ++DG + Q L E KK +Y + FLR SP + Sbjct: 190 --RAILLVSDGAGVIDPRIQDDLR--AEFKKTNVHLYWL---------FLRTAGSPGIYE 236 Query: 347 LVENPHSMYDA 357 L E A Sbjct: 237 LPEAGSDTPQA 247 >gi|315168359|gb|EFU12376.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX1341] Length = 1103 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 319 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 368 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 369 HKKVIVLLTDGVPTFSYKVSRVQTEAD 395 >gi|315148427|gb|EFT92443.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX4244] Length = 1103 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 319 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 368 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 369 HKKVIVLLTDGVPTFSYKVSRVQTEAD 395 >gi|307278884|ref|ZP_07559945.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0860] gi|306504433|gb|EFM73642.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0860] Length = 1103 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 319 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 368 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 369 HKKVIVLLTDGVPTFSYKVSRVQTEAD 395 >gi|15838706|ref|NP_299394.1| hypothetical protein XF2115 [Xylella fastidiosa 9a5c] gi|9107243|gb|AAF84914.1|AE004026_15 hypothetical protein XF_2115 [Xylella fastidiosa 9a5c] Length = 941 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/184 (15%), Positives = 56/184 (30%), Gaps = 27/184 (14%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + Q + + +D+S SM+ + +++D ++NA+L+++ Sbjct: 140 PEKAAIGMQMAPCVAVYFAIDLSGSMDYVGGNGRSRLDNMKTALNAVLDQLGQSIASGTA 199 Query: 219 VQSGLVTFSNKIEEFFLL------EWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAY 267 V L F N ++ L G++ L+ + F T P A Sbjct: 200 VDIMLAGFGNAPDQRQTLLRRNCTAQGIAELKSWVSARQALYGTYFPAGTMDMPSFYAA- 258 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 N ++ F+TDG + + Y I I Sbjct: 259 ---------------APPNAVRVAFFVTDGVPDPPSATNAQAARADVDQVAHLRCYGITI 303 Query: 328 RVIR 331 + Sbjct: 304 DLAD 307 >gi|327270794|ref|XP_003220173.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 917 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 44/242 (18%), Positives = 89/242 (36%), Gaps = 39/242 (16%) Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 + YK + + + +SR + P + +T R+ + +VLD S M Sbjct: 273 KMCSYKSTWEVISGSADFASSRPLGAPPPDPIIHLMKTQYRI-VCLVLDASAQM-----G 326 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL 250 ++ I++ L ++ + G+V F++K L+ S ++R + L Sbjct: 327 KDNRLSRLIQAAKLFLLQI-----IEEGSWVGIVAFNSKGNIQAGLQKVFSDIER--ESL 379 Query: 251 SKFGVSTNSTP-----GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 + +T + G+ A+ + C IV +T+GE Sbjct: 380 TSHLPTTAAGDCNICEGVNAAFQVFSQKLTSTEGCE---------IVLLTNGEGSD---- 426 Query: 306 QQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYLVE--NPHSMYDAFSH 360 L C ++ + + I++ I S+E L +FY + + + + DAF Sbjct: 427 ---LSPCLSKIQSQEIIIHTIAFGSKASNELEKLADMTGGKTFYATDSLDSNGLIDAFGG 483 Query: 361 IG 362 I Sbjct: 484 IS 485 >gi|257089500|ref|ZP_05583861.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis CH188] gi|256998312|gb|EEU84832.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis CH188] Length = 1154 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 317 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 370 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 371 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 419 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 420 HKKVIVLLTDGVPTFSYKVSRVQTEAD 446 >gi|256962322|ref|ZP_05566493.1| von Willebrand factor [Enterococcus faecalis Merz96] gi|256952818|gb|EEU69450.1| von Willebrand factor [Enterococcus faecalis Merz96] Length = 1154 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 317 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 370 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 371 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 419 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 420 HKKVIVLLTDGVPTFSYKVSRVQTEAD 446 >gi|51597756|ref|YP_071947.1| TerY-like tellurite resistance protein. [Yersinia pseudotuberculosis IP 32953] gi|170022819|ref|YP_001719324.1| von Willebrand factor type A [Yersinia pseudotuberculosis YPIII] gi|51591038|emb|CAH22702.1| Putative TerY-like tellurite resistance protein [Yersinia pseudotuberculosis IP 32953] gi|169749353|gb|ACA66871.1| von Willebrand factor type A [Yersinia pseudotuberculosis YPIII] Length = 233 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 66/189 (34%), Gaps = 14/189 (7%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 +++ S + + RL + +++D S SM I I AM+ ++ P Sbjct: 7 LILTPLSLILRTTSDMRRLPVYLLIDTSGSMRG------ESIHAVNVGIQAMMSALRQDP 60 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 V ++T+ N+ E+ L + + T + L+ I + Sbjct: 61 YALESVHLSIITYDNQAREYIPLTALENF--QFTDITVPSAGGTFTGAALEC---LIHCV 115 Query: 274 QGMRQHCNTEDANYKKIIVF-MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 Q + + + +VF MTDG +++ K+ + A + Sbjct: 116 DRDIQRSDGDQKGDWRPLVFLMTDGTPSDVYAYGEAIK--EVKKRAFGSIIACAVGAKAK 173 Query: 333 HEFLRACAS 341 HE L+ S Sbjct: 174 HEHLKQLTS 182 >gi|256618671|ref|ZP_05475517.1| von Willebrand factor [Enterococcus faecalis ATCC 4200] gi|256598198|gb|EEU17374.1| von Willebrand factor [Enterococcus faecalis ATCC 4200] Length = 1154 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 317 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 370 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 371 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 419 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 420 HKKVIVLLTDGVPTFSYKVSRVQTEAD 446 >gi|219848228|ref|YP_002462661.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219542487|gb|ACL24225.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 1017 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 41/235 (17%), Positives = 91/235 (38%), Gaps = 41/235 (17%) Query: 155 VMPITSSVKVNSQTD-ARLDMMIVLDVSRSME--SFFDSSITKIDMAIKSINAMLEEVKL 211 ++ I + +Q+ LD+++++D S SME + S + L L Sbjct: 16 IISICTVEPSIAQSGVEPLDLVLIIDHSGSMENPKYGRSDPHSMR--------FLAARML 67 Query: 212 IPDVNNVVQSGLVTFSNKIEEFF----LLEWGVSHLQRKIKYL--SKFGVSTNSTPGLKY 265 I +N+ + GL+ FS+ E++ L++ G L+ I + G T L+ Sbjct: 68 IDLLNDEDRVGLILFSDNAEDYSDGLQLVQTGRGRLKENIAKMESQSTGDFTRYKDALEL 127 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 A + + R+ ++F+TDG K+++ + R V+ + Sbjct: 128 AGELLGETPANRRAA----------VIFLTDGAPTDMKQEEDYSTALDLFIDRNVPVFLL 177 Query: 326 GIRVIR------SHEFLRACASPNSFY--------LVENPHSMYDAFSHIGKDIV 366 ++ ++ L+ + + +++P S+ AF+ + D+ Sbjct: 178 MLKPKEFDNNAVRNDTLQRISKTLQIFRDNKQTVIEIDDPASIARAFAKVITDLQ 232 >gi|17229274|ref|NP_485822.1| hypothetical protein all1782 [Nostoc sp. PCC 7120] gi|17130872|dbj|BAB73481.1| all1782 [Nostoc sp. PCC 7120] Length = 615 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 34/238 (14%), Positives = 85/238 (35%), Gaps = 37/238 (15%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 + + P+ F I S + N L++ +V+D S SM Sbjct: 2 VKTSYEFDQPILPAGFSLKANILLRFRAEIPESPRRN------LNLSLVIDRSGSMAGAA 55 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRK 246 +K+ +++++++ ++ +V + + ++ + L++ Sbjct: 56 LHHA------LKAAESVVDQLEPKDILS------VVVYDDAVDTVVPPQPVTDKPALKKS 103 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI--IVFMTDGE-NLSTK 303 I+ + G++ S LK + +KI ++ +TDG N+ + Sbjct: 104 IRQVRAGGITNLSGGWLK------------GCEYVKHQLDPQKINRVLLLTDGHANMGIQ 151 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL--RACASPNSFYLVENPHSMYDAFS 359 + + + + G +G + + L A A+ +FY +++ + FS Sbjct: 152 DPKILTATSTQKAEEGITTTTLGFAQGFNEDLLIGMARAANGNFYFIQSIDEAAEVFS 209 >gi|282858825|ref|ZP_06267970.1| von Willebrand factor type A domain protein [Prevotella bivia JCVIHMP010] gi|282588394|gb|EFB93554.1| von Willebrand factor type A domain protein [Prevotella bivia JCVIHMP010] Length = 340 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 30/184 (16%), Positives = 66/184 (35%), Gaps = 30/184 (16%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSM--ESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 P + ++ ++ +I LD+S SM + S + K + I+ + ++ + Sbjct: 76 PQIGNRISTTKHSFGIETVIALDISNSMLAQDVVPSRLDKSKLLIEDL------FRIFDN 129 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIF 271 GL+ F+ + + + ++ T+ + A Sbjct: 130 DK----VGLIVFAGDAFVQLPITSDFISAKMFLDNINPSLIGTQGTDIGQAINLAM---- 181 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 H + + K IV +TDGE+ + ++ ++A+K G VY +G+ Sbjct: 182 -------HSFSPTSKSGKAIVVITDGEDN----EGRAEEMASKAQKAGIQVYILGVGSTS 230 Query: 332 SHEF 335 E Sbjct: 231 GAEI 234 >gi|281349285|gb|EFB24869.1| hypothetical protein PANDA_021744 [Ailuropoda melanoleuca] Length = 493 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 31/166 (18%), Positives = 62/166 (37%), Gaps = 15/166 (9%) Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLK 264 + VK P++ ++ +T+S + L + + + L +TN G K Sbjct: 79 DVVKKFPNLK--MRVSFITYSTQGHTLMELTSDRNKIHNSLSRLKNIKPTGATNMHEGFK 136 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 A + N N +I+ +T G L+ K Q++ +A++ GA VY Sbjct: 137 KA-------NEQIEQENAGGNNAASLIIALTTG-PLTPKALQETKSEAEKAREMGAKVYC 188 Query: 325 IGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 +G++ R + + Y V N + + + +V Sbjct: 189 VGVKDYRKDQLDAIVGRKDQMYGVGNG---FKSLQDMANLLVVNSC 231 >gi|20089761|ref|NP_615836.1| magnesium-chelatase subunit [Methanosarcina acetivorans C2A] gi|19914699|gb|AAM04316.1| magnesium-chelatase subunit [Methanosarcina acetivorans C2A] Length = 704 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 39/267 (14%), Positives = 82/267 (30%), Gaps = 32/267 (11%) Query: 63 TQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVV- 121 +Q + + + + G+I + FR E R+ I Sbjct: 403 SQNAEQPDASTETTFEIGEIY--QVKQLSPEFRKEQRNGSGRRSKTLTRSKKGRYIKSSI 460 Query: 122 --PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLD 179 + + A R P + + + H + ++ V+D Sbjct: 461 PHEKTTDLAFDATLRAAAPFQLTREKKGNSVAIHESDFREKIREKKIGNL----VLFVVD 516 Query: 180 VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK-IEEFFLLEW 238 S SM + ++ + ++ +ML + + GL+ F + E Sbjct: 517 ASGSM-----GARQRMVASKGAVLSML-----MDAYQKRDKVGLIAFKGESAELLLPPTS 566 Query: 239 GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE 298 V Q+ ++ + G T + GL Y I +R+ +T +V ++DG Sbjct: 567 SVELAQKYLQEMPT-GGKTPLSRGLAKGYEVI--KNELRRDPDTCP-----FMVLISDGR 618 Query: 299 ----NLSTKEDQQSLYYCNEAKKRGAI 321 Q++ + + RG Sbjct: 619 ANVSMNGEPPLQETKTIASLFRDRGIQ 645 >gi|315160966|gb|EFU04983.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0645] Length = 1103 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 319 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 368 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 369 HKKVIVLLTDGVPTFSYKVSRVQTEAD 395 >gi|315158311|gb|EFU02328.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0312] Length = 1103 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 319 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 368 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 369 HKKVIVLLTDGVPTFSYKVSRVQTEAD 395 >gi|307274607|ref|ZP_07555787.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX2134] gi|306508759|gb|EFM77849.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX2134] Length = 1103 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 319 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 368 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 369 HKKVIVLLTDGVPTFSYKVSRVQTEAD 395 >gi|291392713|ref|XP_002712879.1| PREDICTED: von Willebrand factor [Oryctolagus cuniculus] Length = 2829 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 31/185 (16%), Positives = 60/185 (32%), Gaps = 22/185 (11%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN--V 218 ++ LD+ ++LD S S + D A + + L P + V Sbjct: 1682 TLSPAPDCGQPLDVALLLDGSS------GSPASYFDEMKSFAKAFISKANLGPHLTQVSV 1735 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 +Q G +T ++ + ++L + + + G + L +A + Sbjct: 1736 LQYGSIT---TVDVPWTAAQEKANLLSLVDPMQREGGPSQIGDALGFAVRYLTSQ----V 1788 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 H A+ K +++ +TD A+ V+ I I LR Sbjct: 1789 HGVRPAAS-KVVVILVTD------VSKDSVDAAATAARSNRVAVFPIVIGDRYDEAQLRT 1841 Query: 339 CASPN 343 A P Sbjct: 1842 LAGPE 1846 >gi|262371960|ref|ZP_06065239.1| von Willebrand factor type A domain-containing protein [Acinetobacter junii SH205] gi|262311985|gb|EEY93070.1| von Willebrand factor type A domain-containing protein [Acinetobacter junii SH205] Length = 537 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 48/317 (15%), Positives = 111/317 (35%), Gaps = 45/317 (14%) Query: 62 ATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDID--DIVRSTSLDIV 119 A + + + + + + R+ +F ++ + N + R +D + Sbjct: 59 APERVKLEANTEKYQKNEVNPVHRVAERAVSTFSIDVDTGSYTNTRRFLNDGRLPPVDAI 118 Query: 120 VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKV----------NSQTD 169 V + Y Y+ P + P+ + + P + K+ + + Sbjct: 119 RVEEMINY-----FDYQYP-QPVGIHPFSVTTETVDSPWKENAKLIKIGIQAKDLSVKQL 172 Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +++ ++DVS SM+ K+ + +++ + E+++ V ++T+++ Sbjct: 173 PAANLVFLVDVSGSMD-----DPDKLPLVKQTLRILTEQLRPQDKVT------IITYASG 221 Query: 230 IEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + G + R I L G +T+ ++ AY Q + N Sbjct: 222 EKLVLEPTSGDQKDKILRVIDELRA-GGATSGEQAIQLAYKQ------AEKAFIKNGINR 274 Query: 288 KKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGAIVYAIGIRVIRSHEFLR---ACASPN 343 I+ TDG+ D +L E +K G + A+G +E L A A Sbjct: 275 ---ILLATDGDFNVGITDFSTLKGMVAEKRKSGVSLTALGYGTGNYNEELMEQIADAGDG 331 Query: 344 SFYLVENPHSMYDAFSH 360 ++ ++N + Sbjct: 332 NYSYIDNKNEAKKVVQR 348 >gi|227552873|ref|ZP_03982922.1| pilus subunit protein [Enterococcus faecalis HH22] gi|229550416|ref|ZP_04439141.1| pilus subunit protein [Enterococcus faecalis ATCC 29200] gi|257422992|ref|ZP_05599982.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis X98] gi|312952910|ref|ZP_07771770.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0102] gi|227178004|gb|EEI58976.1| pilus subunit protein [Enterococcus faecalis HH22] gi|229304444|gb|EEN70440.1| pilus subunit protein [Enterococcus faecalis ATCC 29200] gi|257164816|gb|EEU94776.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis X98] gi|310629158|gb|EFQ12441.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0102] gi|315154335|gb|EFT98351.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0031] gi|315156569|gb|EFU00586.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0043] gi|315573683|gb|EFU85874.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0309B] gi|315582702|gb|EFU94893.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0309A] gi|323480330|gb|ADX79769.1| Endocarditis and Biofilm-Associated Pilus subunitA [Enterococcus faecalis 62] Length = 1103 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 319 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 368 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 369 HKKVIVLLTDGVPTFSYKVSRVQTEAD 395 >gi|29375667|ref|NP_814821.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis V583] gi|29343128|gb|AAO80891.1| von Willebrand factor type A domain protein [Enterococcus faecalis V583] Length = 1103 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 319 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 368 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 369 HKKVIVLLTDGVPTFSYKVSRVQTEAD 395 >gi|319902110|ref|YP_004161838.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] gi|319417141|gb|ADV44252.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] Length = 342 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 26/175 (14%), Positives = 66/175 (37%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +++MI LD+S SM + D ++++ A + + ++++++ + Sbjct: 79 GSKLETVKRQGVEVMIALDISNSMLAQ-DVQPSRLEKAKRLVAQLVDKMQND-------K 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 G++ F+ + + ++ +S T + A +G+ Sbjct: 131 VGMIVFAGDAFTQLPITNDYISAKMFLESISPSLISKQGTAIGAAISLATRSFTPQEGIG 190 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + I+ +TDGEN + A ++G V +G+ + Sbjct: 191 -----------RAIIVITDGENH----EGGVAEAAKTATEKGIQVNVLGVGMPDG 230 >gi|312903619|ref|ZP_07762795.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0635] gi|310632972|gb|EFQ16255.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0635] gi|315577195|gb|EFU89386.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0630] Length = 1103 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 319 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 368 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 369 HKKVIVLLTDGVPTFSYKVSRVQTEAD 395 >gi|293383944|ref|ZP_06629845.1| von Willebrand factor type A domain protein [Enterococcus faecalis R712] gi|293387429|ref|ZP_06631983.1| von Willebrand factor type A domain protein [Enterococcus faecalis S613] gi|312908057|ref|ZP_07767039.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 512] gi|312910783|ref|ZP_07769620.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 516] gi|291078704|gb|EFE16068.1| von Willebrand factor type A domain protein [Enterococcus faecalis R712] gi|291083151|gb|EFE20114.1| von Willebrand factor type A domain protein [Enterococcus faecalis S613] gi|310625945|gb|EFQ09228.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 512] gi|311288927|gb|EFQ67483.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 516] Length = 1103 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 319 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 368 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 369 HKKVIVLLTDGVPTFSYKVSRVQTEAD 395 >gi|255973237|ref|ZP_05423823.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T1] gi|257418922|ref|ZP_05595916.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T11] gi|255964255|gb|EET96731.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T1] gi|257160750|gb|EEU90710.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T11] Length = 1154 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 317 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 370 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 371 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 419 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 420 HKKVIVLLTDGVPTFSYKVSRVQTEAD 446 >gi|300860089|ref|ZP_07106177.1| von Willebrand factor type A domain protein [Enterococcus faecalis TUSoD Ef11] gi|295112663|emb|CBL31300.1| Cna protein B-type domain./von Willebrand factor type A domain. [Enterococcus sp. 7L76] gi|300850907|gb|EFK78656.1| von Willebrand factor type A domain protein [Enterococcus faecalis TUSoD Ef11] gi|315144603|gb|EFT88619.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX2141] Length = 1103 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 266 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 319 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 320 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 368 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 369 HKKVIVLLTDGVPTFSYKVSRVQTEAD 395 >gi|113953124|ref|YP_730426.1| structural toxin protein RtxA [Synechococcus sp. CC9311] gi|113880475|gb|ABI45433.1| structural toxin protein RtxA [Synechococcus sp. CC9311] Length = 2154 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 53/266 (19%), Positives = 97/266 (36%), Gaps = 26/266 (9%) Query: 54 IDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRS 113 D + I + + +N + I T N + D FV+ ++ + Sbjct: 918 ADPNAADYVFTITLDPSQSNYTVKEYATIDGTTDTTLNYDVKLTDEDLDFVDGDFEVTWA 977 Query: 114 TSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMP-----ITSSVKVNSQ 167 S ++ V +N S+ + YK+ + T S I + V S Sbjct: 978 PSPAVLKVGENISDVASSTTPYKVDYDLVSKSDIIPTGSGTIEGTNGEDILIGDVGGGSL 1037 Query: 168 TDARLDMMIVLDVSRSM----ESFFDSSITKIDMAIKSINAMLEE---------VKLIPD 214 + +++ +VLDVSRSM +F ++S+T+ + +L E V+++ Sbjct: 1038 VNQSINLSLVLDVSRSMILSNINFNNASVTRFSALQTATKDLLSEIAQSGATAKVQIVKY 1097 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 G F++ ++ L + + I L G TN GL A N I Sbjct: 1098 STEGSDVGYYNFTSGDDQTVLNQAF-----QDIDDLQA-GGGTNYEAGLVTALNWISGGI 1151 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENL 300 N + K ++F++DGE Sbjct: 1152 TSNTPLNVNQTDKDK-VIFISDGEPS 1176 >gi|130491439|ref|NP_001076115.1| inter-alpha-trypsin inhibitor heavy chain H1 [Oryctolagus cuniculus] gi|6579183|dbj|BAA88322.1| inter-alpha-trypsin inhibitor heavy chain H1 [Oryctolagus cuniculus] Length = 906 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 71/199 (35%), Gaps = 32/199 (16%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 ++ V+D+S SME K+ +++ +L +++ + LV F ++++ Sbjct: 287 SLVFVIDISGSMEG------QKVKQTKEALLKILGDIRPEDYFD------LVLFGSRVQS 334 Query: 233 -----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 E + + ++ S +TN GL + + QG + A Sbjct: 335 WRGSLVPASEANLQAARDFVQRFS-LAGATNLNGGLLRGIEILNNAQGNLPAVSKHAA-- 391 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACASPNS 344 I++ +TDGE D+ + +Y++G FL++ + N+ Sbjct: 392 --ILIMLTDGEPTEGVTDRP--QILKNIRSAIGGRFPLYSLGFGHDLDFNFLKSLSMENN 447 Query: 345 -----FYLVENPHSMYDAF 358 Y + F Sbjct: 448 GWAQRIYEDHDAAQQLQGF 466 >gi|298713908|emb|CBJ33776.1| conserved unknown protein [Ectocarpus siliculosus] Length = 977 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 66/217 (30%), Gaps = 26/217 (11%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 + I V + +++ +++D S S+ + + + + A + Sbjct: 46 CGAGSELTFQIEGETSVEANK---VNVAVIIDSSGSIFDIDGAFLLEKEFAKNVAASFAA 102 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKY 265 + N G ++++ + S + S TN GL Sbjct: 103 K-------NLFTNGGTASYASFSDAASDGGTFGSEAEFNEFVDNASWIEGDTNIEAGLSK 155 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 + + R ++ +TDG+ + Q + A+ G IVYA+ Sbjct: 156 GRELLANGTSTRTS----------FLILITDGDWNRGGDPQIE---ADAARDEGTIVYAV 202 Query: 326 GIRVIRSHE-FLRACASPNSFYLVENPHSMYDAFSHI 361 G+ S L + + N + + I Sbjct: 203 GVGPDVSEATLLSIGGDLTNVFDASNFTELDNTLDEI 239 >gi|167617233|ref|ZP_02385864.1| hypothetical protein BthaB_13083 [Burkholderia thailandensis Bt4] Length = 396 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 59/162 (36%), Gaps = 6/162 (3%) Query: 19 MTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLK 78 M+IL A+ L ++ +G+ +++ ++ ++ L + D + AA + G N Sbjct: 1 MSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSADSCALAAARDLT--GAINLSVPEA 58 Query: 79 GGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTS-LDIVVVPQNEGYSISAISRYKI 137 G + F +L+ N V D + + P + Y S+ I Sbjct: 59 AGITAGHLNYALFEQFPVQLQTNASVTFTDSLSNPFQPKSAITSPSSIKYVKCMTSQTGI 118 Query: 138 P---LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMI 176 ++ +P T + V + +QT + + I Sbjct: 119 VNWFIQALDMVPGVTVANASVSATAIATIGAAQTTCAIPVFI 160 >gi|308472935|ref|XP_003098694.1| hypothetical protein CRE_04222 [Caenorhabditis remanei] gi|308268294|gb|EFP12247.1| hypothetical protein CRE_04222 [Caenorhabditis remanei] Length = 405 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 32/184 (17%), Positives = 62/184 (33%), Gaps = 8/184 (4%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SR M + ++ + +I S+ + + GLVT+++ Sbjct: 44 LDVIAVVDNSRGMT---NKGLSYVASSIISVFGKNTRIGSSSAEPRTTRLGLVTYNSVAT 100 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + L S + N+T RQ + +Y+K+I Sbjct: 101 QNADLNQYQSIEDAYYGIYGALSTTVNTTESYLTTGLNAAVELFSRQSFRSNRQHYRKVI 160 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC---ASPNSFYLV 348 + + + + + N K G + I S L+ ASP + Sbjct: 161 IVY--ASEYNGRGEFDPVPIANRLKASGVNIITIAYEQPGSAGLLQGLSQIASPGFSFSG 218 Query: 349 ENPH 352 +N Sbjct: 219 DNIA 222 >gi|146343645|ref|YP_001208693.1| hypothetical protein BRADO6887 [Bradyrhizobium sp. ORS278] gi|146196451|emb|CAL80478.1| conserved hypothetical protein; protein containing a von Willebrand factor type A (VWA) domain; putative signal peptide [Bradyrhizobium sp. ORS278] Length = 755 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 53/301 (17%), Positives = 105/301 (34%), Gaps = 47/301 (15%) Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 + N+ I R++ + + V D R +L VVP + + Sbjct: 254 DPAKNDPVNPTK--ITVRLQAGFALGEVKSHHHQVTVESTDAETRVITLADGVVPADRDF 311 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITS-SVKVNSQTDARLDMMIVLDVSRSMES 186 ++ + F ++ +++ +T +V +Q D++ V+D S SM Sbjct: 312 ELTWKPASENMPSVGLFHEQVGDADYLLAFVTPPAVATATQRPQPRDVIFVIDNSGSMGG 371 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-----LEWGVS 241 T I A S+ L ++ + ++ F + + F V Sbjct: 372 ------TSIRQAKASLLYALGRLQPND------RFNVIRFDDTMTVLFPSSVPADAEHVG 419 Query: 242 HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 + R + L G T P ++ A + D++ + +VF+TDG + Sbjct: 420 NATRFVSSLDARG-GTEMVPAMRAALTD-----------DGSDSDRMRQVVFLTDG---A 464 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHI 361 DQQ + R ++ +GI ++PN++ + AF+HI Sbjct: 465 IGNDQQLFETITAMRGRS-RIFMVGIG-----------SAPNTYLMSRAAELGRGAFTHI 512 Query: 362 G 362 G Sbjct: 513 G 513 >gi|260841558|ref|XP_002613979.1| hypothetical protein BRAFLDRAFT_67440 [Branchiostoma floridae] gi|229299369|gb|EEN69988.1| hypothetical protein BRAFLDRAFT_67440 [Branchiostoma floridae] Length = 1796 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 56/293 (19%), Positives = 109/293 (37%), Gaps = 39/293 (13%) Query: 62 ATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELR-DNGFVNDIDDIVRSTSLDIVV 120 A+ N+ N + +L + +F+ E+ F++ + +++ + IV Sbjct: 30 ASTFQNQVNKYQDSRADIVFMLDSSGSVGENNFQTEITFVENFLSQLTISPQASRVAIVS 89 Query: 121 VPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDV 180 I I P C+F+ + T N++ RL ++ Sbjct: 90 FDDAARTHIDYIKN---PKNKCSFL-RELKTVKYTGGST-----NAEDAFRLAQELLRPK 140 Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK----IEEFFLL 236 S F ++ K I + L ++ + P + + +V+F N I+ Sbjct: 141 SA----FANNQPVK-QTEITFVENFLSQLTISPQAS---RVAVVSFDNHARTHIDYINSP 192 Query: 237 EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 + S L+ +K + G STN+ G + A + Q T K+++V++TD Sbjct: 193 KNKCSFLRE-LKAVKYTGGSTNAEDGFRLAQELL-----RPQSAFTNYQPVKQVVVYLTD 246 Query: 297 GENLSTKEDQQSLYYCNEAKKR-GAIVYAIGI----RVIRSHEFLRA-CASPN 343 G K D+ + N K A +Y+IG+ + + A C+SP Sbjct: 247 G-----KPDKDPVGRANNLKSVYNAEIYSIGVDPYSKRYETDGVTSADCSSPG 294 >gi|182413803|ref|YP_001818869.1| von Willebrand factor type A [Opitutus terrae PB90-1] gi|177841017|gb|ACB75269.1| von Willebrand factor type A [Opitutus terrae PB90-1] Length = 792 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 32/175 (18%), Positives = 64/175 (36%), Gaps = 24/175 (13%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 A +++ +LDVS SM+ K+ + +S+ +L ++ + Sbjct: 409 DAAVSGRAAANLVFLLDVSGSMD-----QPNKLRLVQESMRLLLGRLQPED------RVA 457 Query: 223 LVTF--SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +VT+ ++ + + I L G STN GL+ AY I + Sbjct: 458 IVTYAGNSGLALPSTPVARQREILDAIDELRA-GGSTNGAMGLQLAY-DIAKANFVANGV 515 Query: 281 NTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 N ++ TDG+ N+ + + + E K G + +G + + Sbjct: 516 NR--------VILCTDGDFNVGVTSEGELVRLIEEKAKSGVFLTVLGFGMGNLKD 562 >gi|7770149|gb|AAF69610.1|AF119917_18 PRO1851 [Homo sapiens] Length = 644 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 30/171 (17%), Positives = 64/171 (37%), Gaps = 19/171 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEE 232 ++ V+D S SM S KI +++ +L+++ N +V + + + Sbjct: 5 VVFVIDKSGSM------SGRKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLVP 58 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V+ + + G TN + A Q+ D + + I+ Sbjct: 59 AS--AENVNKARSFAAGIQALG-GTNINDAMLMAV-QLLDSSNQEERLPEGSVSL---II 111 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA---IGIRVIRSHEFLRACA 340 +TDG+ + + +S+ N ++ + Y+ +G S+ FL A Sbjct: 112 LLTDGDPTVGETNPRSIQ--NNVREAVSGRYSLFCLGFGFDVSYAFLEKLA 160 >gi|257456195|ref|ZP_05621392.1| BatA protein [Treponema vincentii ATCC 35580] gi|257446281|gb|EEV21327.1| BatA protein [Treponema vincentii ATCC 35580] Length = 332 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 34/219 (15%), Positives = 65/219 (29%), Gaps = 47/219 (21%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M V+D S SM + + T+++ A + I + E+ + GL + Sbjct: 92 LMFVIDTSPSMAAQDMGTETRLEAAKRIIKSFAEKYEGDS-------LGLTALGSSAAVL 144 Query: 234 FLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 ++ L + G T GL A + + H I Sbjct: 145 IPPTIDRHTFLTRLDQLQVGELGDGTAIGMGLASAVLHLTQYSTLPSH-----------I 193 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR----------------------- 328 + TDG+N + + K + Y IG+ Sbjct: 194 ILFTDGDNN--TGEIHPRAAADIIKHKKIGFYIIGLGKSGYAPVKYIDPIQKKEISGTLN 251 Query: 329 -VIRSHEFLRACASP-NSFYLVENPHSMYDAFSHIGKDI 365 V E + ++ ++P + D F+ + I Sbjct: 252 TVFNETELQKIAGYGNGRYFSAKSPELLTDIFNRFIQKI 290 >gi|217973614|ref|YP_002358365.1| LPXTG-motif cell wall anchor domain-containing protein [Shewanella baltica OS223] gi|217498749|gb|ACK46942.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica OS223] Length = 772 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 70/197 (35%), Gaps = 32/197 (16%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + +VM + V+ ++Q ++++V+D S SM I A ++ L+ +K Sbjct: 371 NYSLVMVLPPKVEKSTQPSLPRELILVIDTSGSMAG------DSIVQAKNALLYALKGLK 424 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFF-----LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 N ++ F++ + +F +S ++ + L G T L Sbjct: 425 PEDSFN------IIEFNSSLSQFSATSLPATSSNLSRARQFVSRLQADG-GTEMALALDA 477 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE--NLSTKEDQQSLYYCNEAKKRGAIVY 323 A + + + + ++FMTDG N D ++ Sbjct: 478 A------LPKSLGSVSPDAVQPLRQVIFMTDGSVGNEQALFDLIRYQIGES------RLF 525 Query: 324 AIGIRVIRSHEFLRACA 340 +GI + F++ A Sbjct: 526 TVGIGSAPNSHFMQRAA 542 >gi|291008392|ref|ZP_06566365.1| von Willebrand factor, type A [Saccharopolyspora erythraea NRRL 2338] Length = 538 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 36/219 (16%), Positives = 69/219 (31%), Gaps = 28/219 (12%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + ++ Q+ ++ VLD S SM S +I + Sbjct: 332 TIDRLAAEYSRPQSRDPSHVIFVLDFSASM------SGPRIAALRSAFAGFSGADPSA-- 383 Query: 215 VNNVVQ------SGLVTFSNKI--EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 V V+ ++ F + E F + G + L+ Y++ + Sbjct: 384 VGKFVRFYKGEMVTIMRFGGHVLDERDFTIT-GQADLKAVQDYIAADRFDQTTG-----V 437 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK---RGAIVY 323 ++ + + I+ MTDGEN + Q L ++A+ + Y Sbjct: 438 WSALEAAYAKAAAATRDHPEQPVTIMLMTDGENNAGISLQDFLRN-HQARDPAAKAVHTY 496 Query: 324 AIGIRVIRSHEFLRAC-ASPNSFYLVENPHSMYDAFSHI 361 + E +A A+ N S+ +AF I Sbjct: 497 TVRFGEANPGELDQAARATGGRMVDA-NATSLSEAFKEI 534 >gi|297727663|ref|NP_001176195.1| Os10g0464900 [Oryza sativa Japonica Group] gi|22758314|gb|AAN05518.1| hypothetical protein [Oryza sativa Japonica Group] gi|31432565|gb|AAP54180.1| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] gi|255679473|dbj|BAH94923.1| Os10g0464900 [Oryza sativa Japonica Group] Length = 646 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 28/136 (20%), Positives = 58/136 (42%), Gaps = 26/136 (19%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 LD++ VLDVS SM K+ + +++ +++ + + +++FS+ Sbjct: 173 PLDLVTVLDVSGSMVGN------KLALLKQAMGFVIDNLGPGD------RLCVISFSSGA 220 Query: 231 EEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L G +H +R + LS G TN L+ A ++ D + R + Sbjct: 221 SRLMRLSRMTDAGKAHAKRAVGSLSARG-GTNIGAALRKA-AKVLDDRLYRNAVES---- 274 Query: 287 YKKIIVFMTDGENLST 302 ++ ++DG++ T Sbjct: 275 ----VILLSDGQDTYT 286 >gi|315127492|ref|YP_004069495.1| inter-alpha-trypsin inhibitor domain-containing protein [Pseudoalteromonas sp. SM9913] gi|315016006|gb|ADT69344.1| inter-alpha-trypsin inhibitor domain-containing protein [Pseudoalteromonas sp. SM9913] Length = 666 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 44/279 (15%), Positives = 89/279 (31%), Gaps = 43/279 (15%) Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 N ++ DI + + + + ++ F + ++ N+ + Sbjct: 225 ANASDLSLTINMDIGLELVDINAKYHQVNIENSAFGQYSIVLKPQQAI-------NKDFV 277 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRH--IVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 + K + F N + M + S + +M+ V+D S SM Sbjct: 278 LEFKPLQKAQAQAALFTQQAKNGEQFALAMLMPPSDQFTQSERLPREMVFVVDTSGSMHG 337 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--EEFFLL---EWGVS 241 ++ A K++ L + N ++ F+N++ L ++ + Sbjct: 338 ------QSMEQAKKALFYALSLLDSDDSFN------IIGFNNQVTAMSDTPLVASDFNLR 385 Query: 242 HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 +R I L G T L N +RQ +VF+TDG Sbjct: 386 RARRFIYNLQADG-GTEIQGALDAVLNGAQFEGFVRQ------------VVFLTDGS--V 430 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + ED+ ++ +GI + F+R A Sbjct: 431 SNEDELFKSIARTLGDS--RLFTVGIGSAPNRFFMRRAA 467 >gi|224092976|ref|XP_002187320.1| PREDICTED: similar to calcium channel, voltage-dependent, alpha 2/delta subunit 1 [Taeniopygia guttata] Length = 1092 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 76/200 (38%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++D S S+ + + S+ MLE + VN +V+F Sbjct: 248 GAASPKDMLILVDASGSVSGLT------LKLIRTSVIEMLETLSDDDFVN------VVSF 295 Query: 227 SNKIEEFF-------LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 +N + L+ + +S G+ T+ G YA+ Q+ + R + Sbjct: 296 NNNAQNVSCFNHLVQANVRNKKKLKEAVYKISAKGI-TDYKKGFSYAFEQLLNHSVSRAN 354 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ ++ + + V+ + + Sbjct: 355 CN-------KIIMLFTDG------GEERAQEIFHKYNEDKKVRVFTFSVGQHNYDKGPIQ 401 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 402 WMACENKGYYYEIPSIGAIR 421 >gi|326674126|ref|XP_003200076.1| PREDICTED: integrin alpha-E-like [Danio rerio] Length = 940 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 36/206 (17%), Positives = 74/206 (35%), Gaps = 31/206 (15%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D ++ VLD S S+ + A I+ + +V + + Sbjct: 179 DPGTEIAFVLDGSGSI------QYDDFEKAKDFIHTTMSNCNFA----------IVQYGS 222 Query: 229 KIEEFFLLEWGVSHLQR--KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 I L + K+K + + T + + + IF + N N Sbjct: 223 SIRTELSLLDNEDGARSLQKVKQIKQIYNLTKTASAINHVLTDIFIPE------NGSKNN 276 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI--RSHEFLRAC---AS 341 +KII+ ++DG+ L + + K G Y+IG+ ++ + ++ A Sbjct: 277 SEKIIIVLSDGKILGDPMTLDEVLNKTQMK--GVTRYSIGVGDGILKNKDAIKEMTQIAD 334 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVT 367 P +Y V + ++ D S + + I+ Sbjct: 335 PGKYYSVSSYGALNDILSSLERGIIG 360 >gi|332809378|ref|XP_003308230.1| PREDICTED: LOW QUALITY PROTEIN: epithelial chloride channel protein-like [Pan troglodytes] Length = 901 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 42/212 (19%), Positives = 72/212 (33%), Gaps = 37/212 (17%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 I +P + + S+ + +VLD S SM + D A + ++E+ L+ Sbjct: 291 INLPRPTFSLLKSKQRV---VCLVLDKSGSMNAE-DHLFRMNQAAELYLIQIIEKGSLV- 345 Query: 214 DVNNVVQSGLVTFSNKIEEFFLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 GLVTF + + L + + T+ GLK + Sbjct: 346 --------GLVTFDSFAKIQSKLIKIIDDNTYQKITA-NLPQEADGGTSICRGLKAGFQA 396 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIR 328 I I+ +TDGE+ C E K+ G +++ I + Sbjct: 397 ISQSNQSTFGSE---------IILLTDGEDYQIS-------LCFGEVKQSGTVIHTIALG 440 Query: 329 VIRSHEF--LRACASPNSFYLVENPHSMYDAF 358 E L + FY +N + + DAF Sbjct: 441 PSADEELETLSNMTGGHRFYAHKNINGLIDAF 472 >gi|301164324|emb|CBW23882.1| conserved exported hypothetical protein [Bacteroides fragilis 638R] Length = 610 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 65/173 (37%), Gaps = 20/173 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 ++ + +++ ++DVS SM ++D+ S+ ++ ++ V V +G Sbjct: 240 EIPTDNLPASNLVFLIDVSGSMW-----GANRLDLVKSSLKLLVNNLRDKDKVAIVTYAG 294 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + ++ ++ I L G ST G+ AY +I + N Sbjct: 295 ----NAGVKLEATPGSNKQKIREAIDELEASG-STAGGEGIMLAY-KIAQKNFISGGNNR 348 Query: 283 EDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 I+ TDG+ N+ D++ + +K G + +G + + Sbjct: 349 --------IILCTDGDFNVGVSSDKELEKLIEQKRKSGIFLTVLGYGMGNYKD 393 >gi|89899605|ref|YP_522076.1| hypothetical protein Rfer_0795 [Rhodoferax ferrireducens T118] gi|89344342|gb|ABD68545.1| conserved hypothetical protein [Rhodoferax ferrireducens T118] Length = 424 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 16/167 (9%), Positives = 53/167 (31%), Gaps = 7/167 (4%) Query: 16 KGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRK 75 +G + I+ + + ++ +G+ ++ H++ KT L + D + A+ ++ + Sbjct: 19 RGAVAIVVGLMMAVLVGFIGLALDGGHLYLTKTELQNSADACALAASYELTGAPSIAPAS 78 Query: 76 KLKGGDILCRIKNTWNMSFRNELRDNG---FVNDIDDIVRSTSLDIVVV----PQNEGYS 128 + + + F+N + D + ++ V P ++ Sbjct: 79 FARAEAAGQAVGQMNKVDFQNSAIASSDIVVSFGTDLSAGNAAIKWVNAGAALPSSKYVR 138 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMM 175 + +P +P + + + + + Sbjct: 139 CTITRSNIMPWFMQVLMPSLDTLTVSSLATATLAPAQNNCGIPMAIC 185 >gi|324499478|gb|ADY39777.1| Mesocentin [Ascaris suum] Length = 4390 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 67/189 (35%), Gaps = 23/189 (12%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 T W + + T+ ++ ++ D + D+M VLD S + S + + + Sbjct: 3611 TEHTWPSRKTTLESERTTPARICTRIDYQADVMFVLDSS------DNFSPEQYGHLKEGL 3664 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 + +++E +VVQ G V +S+K L + NS Sbjct: 3665 STLIDE--TFDLSPDVVQVGFVEYSDKASVPVALG-------HYEDKVQLLTDIANSEQL 3715 Query: 263 LKYAYNQIFDMQGMRQHCN-TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR-GA 320 A + + RQ N ++++ +T+G N + + ++R Sbjct: 3716 FGEAI-VLKGLNAARQQFQLHGRKNVPRVLLLITNGVNRGNAANA-----AEDLRERYNV 3769 Query: 321 IVYAIGIRV 329 ++ + + Sbjct: 3770 ELFILAVNA 3778 >gi|218438801|ref|YP_002377130.1| von Willebrand factor A [Cyanothece sp. PCC 7424] gi|218171529|gb|ACK70262.1| von Willebrand factor type A [Cyanothece sp. PCC 7424] Length = 573 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 56/177 (31%), Gaps = 27/177 (15%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K+ + +M V+D S SM+ ++++ L + N V G Sbjct: 389 KLQKDAGKTVYLMTVIDTSGSMD----------GAPLEAVKKGLRIASKEINPGNYV--G 436 Query: 223 LVTFSNKIEEFFLLEWGV----SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 LVT+ ++ E L I L G + G+ +++ + Sbjct: 437 LVTYGDRAAEVVPLGLFDELQHKRFLAAIDNLRADGATAM-YDGMMIGLSKLMEQ----- 490 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + + + ++ +TDG+ + G VY I + E Sbjct: 491 --KKNNPDGRFYLLLLTDGQANMGVTFDEVKEVIEY---SGVRVYPIAYGDVNQEEL 542 >gi|116625363|ref|YP_827519.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228525|gb|ABJ87234.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 317 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 39/224 (17%), Positives = 81/224 (36%), Gaps = 40/224 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +TD L + +++DVS S + D + A + +L + L L+ F Sbjct: 68 ETDLPLTIGMLVDVSGSQRNLIDIERS---AASQFFREVLRKKDLAF---------LIMF 115 Query: 227 SNKIE-------EFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + E LL G++H + + + V T P Y+ ++ + Sbjct: 116 GEETELLQDYTGSPRLLTEGLNHLEVSSGVSGIHPGPVPTMGGPRGTVLYDAVYLAANEK 175 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR------ 331 K+IV +TDG + ++ + A+K ++Y+I R Sbjct: 176 LKGEVGR----KVIVVITDGVDQGSRMSRN--QAIEAAQKSDCVIYSIDYSDPRAYGPFN 229 Query: 332 -----SHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 LR + + Y V+ H++ F + +++ ++ Sbjct: 230 MVGGGGEGELRKMSDETGGRVYKVDRRHTLDQVFKELQEEMRSQ 273 >gi|225620067|ref|YP_002721324.1| hypothetical protein BHWA1_01140 [Brachyspira hyodysenteriae WA1] gi|225214886|gb|ACN83620.1| hypothetical protein BHWA1_01140 [Brachyspira hyodysenteriae WA1] Length = 557 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 32/226 (14%), Positives = 85/226 (37%), Gaps = 24/226 (10%) Query: 76 KLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD-IVRSTSLDIVVVPQNEGYSISAISR 134 + I+ + + NE++ N +++ ++ ++ + N Y++ A++ Sbjct: 302 ETSTNTIIVKKDTAPPVKESNEIKILSSANIVENPMIEKPEVEKNMAENNFNYNLEAMNN 361 Query: 135 YKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITK 194 K Y N+ + S+ + + +D++IVLD + SM + Sbjct: 362 LNNASKSFNTPNNYVNNTDELSDKFRSIIERYKNEGAIDLVIVLDTTESMHPY------- 414 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL----LEWGVSHLQRKIKYL 250 + + I M+ E+ D + + G + + + + + ++ + R++ Y Sbjct: 415 LKTIKRDIRGMVTEL---FDNHKYSRVGFLLYRDVKDTYLTKKIDFSDNINFINREVNYF 471 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 G + P Y + + + N K++++ +TD Sbjct: 472 YAAGGGDKAEPM----YEALQEALETFDYINQ-----KRLVIVLTD 508 >gi|220924567|ref|YP_002499869.1| aminoacyl-tRNA synthetase class I [Methylobacterium nodulans ORS 2060] gi|219949174|gb|ACL59566.1| aminoacyl-tRNA synthetase class I [Methylobacterium nodulans ORS 2060] Length = 407 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 10/58 (17%), Positives = 28/58 (48%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 + F + G MT++ I LP++ ++++ +++ + L + D + + A + Sbjct: 5 LARFRADADGTMTMIVGIVLPVLLGTAAVVLDGANLHLSQLRLQNAADSAALGAVQVL 62 >gi|126667415|ref|ZP_01738387.1| hypothetical protein MELB17_14151 [Marinobacter sp. ELB17] gi|126628171|gb|EAZ98796.1| hypothetical protein MELB17_14151 [Marinobacter sp. ELB17] Length = 774 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 41/234 (17%), Positives = 87/234 (37%), Gaps = 43/234 (18%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTD------ARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 W +V + + ++ +D+ I++D+S SM+ ++ + + + Sbjct: 24 LWLALVITLVAAFSGMAQAQAEQADSPQLPGAVDVRIIVDISGSMKQNDPQNLRRPAVRL 83 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-----LEWGVSHLQRKIKYLSKFG 254 + +L+PD +G+ TF + W +++ ++ Sbjct: 84 LA--------RLLPDGAT---AGVWTFGQYVNMLVPHREVSDAWRDMAIEQS-DAINSVA 131 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 + TN ++ A + F + IV +TDG+ ++ + N Sbjct: 132 MRTNLGAAIETASDGYFTGGVLSN---------THFIV-LTDGKVDISRNPSANKAEANR 181 Query: 315 --------AKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAF 358 K++GA +A+ + EFLR AS SF++ EN + + AF Sbjct: 182 ILDTLVPPLKQQGARFHAVALSAEADTEFLRKLASDSNGSFHVAENANDLSRAF 235 >gi|71028596|ref|XP_763941.1| thrombospondin-related protein [Theileria parva strain Muguga] gi|68350895|gb|EAN31658.1| thrombospondin-related protein [Theileria parva] Length = 552 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 46/308 (14%), Positives = 106/308 (34%), Gaps = 25/308 (8%) Query: 29 IIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKN 88 ++ + + + S+ F + + S++ + ++ + + + + D + Sbjct: 76 LLLTGMTRLKDGSNSFNKNSRISSVLPMETLDRLSEAITRSS-EHPVTFEALDGGSVVVT 134 Query: 89 TWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY 148 + +F +L + + T + N + + LK Sbjct: 135 NNSDTFSIKLYPSLP------GLNLTPGMLPTNKPNSHINFTGNHNEHALLKHA-LHDLS 187 Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS-MESFFDSSITKIDMAIKSINAMLE 207 T++ + K +S LD+ I++D S S M+ + + I + ++++ Sbjct: 188 TSNYDRGLYPDGIKKPSSYCHRELDLTILVDESSSIMKEEW-------EKLIPFLKSLVK 240 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 + + P N V +VTFS I S + L NS P + + Sbjct: 241 SISISP---NYVHLSVVTFSTSIRWLISFL-NPSGKDENL-ALRVIDELKNSKPVFGFTF 295 Query: 268 N-QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 Q + + N K I+ +TDG + Q+ ++ G + +G Sbjct: 296 TGQALNFITEAVYQFGARQNAPKAIILITDGSSTQPNVTSQASAM---LREAGVTILVVG 352 Query: 327 IRVIRSHE 334 + + R +E Sbjct: 353 VGMARDYE 360 >gi|217977683|ref|YP_002361830.1| von Willebrand factor type A [Methylocella silvestris BL2] gi|217503059|gb|ACK50468.1| von Willebrand factor type A [Methylocella silvestris BL2] Length = 650 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 42/216 (19%), Positives = 77/216 (35%), Gaps = 32/216 (14%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + L + ++LD SRS ES I +A +++ A+ + D Q G Sbjct: 452 RAARNEERDLAVAVLLDASRSTESSAHGEPV-IAVAREALIALARGLDACGD-----QFG 505 Query: 223 LVTFSN-KIEEFFLLEWGV------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 L FS+ + + F+ + +I L G T +++ Sbjct: 506 LYAFSSLRRDRVFVDVCKTFDEPLGGAVDARINALKP-GFYTRLGAAIRHV--------- 555 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTK------EDQQSLYYCNEAKKRGAIVYAIGIRV 329 E N +K+++ +TDG+ + + EA+++G V+AI I Sbjct: 556 --SFRLGERPNARKLLLVITDGKPNDLDHYEGRFGVEDTRRAVQEARRQGQTVFAITIDA 613 Query: 330 IRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 R F L P + +A I + I Sbjct: 614 EARAYLPRLFGRSG-FALAPTPDRLTEALPSIYRQI 648 >gi|134101426|ref|YP_001107087.1| von Willebrand factor, type A [Saccharopolyspora erythraea NRRL 2338] gi|133914049|emb|CAM04162.1| von Willebrand factor, type A [Saccharopolyspora erythraea NRRL 2338] Length = 501 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 36/219 (16%), Positives = 69/219 (31%), Gaps = 28/219 (12%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + ++ Q+ ++ VLD S SM S +I + Sbjct: 295 TIDRLAAEYSRPQSRDPSHVIFVLDFSASM------SGPRIAALRSAFAGFSGADPSA-- 346 Query: 215 VNNVVQ------SGLVTFSNKI--EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 V V+ ++ F + E F + G + L+ Y++ + Sbjct: 347 VGKFVRFYKGEMVTIMRFGGHVLDERDFTIT-GQADLKAVQDYIAADRFDQTTG-----V 400 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK---RGAIVY 323 ++ + + I+ MTDGEN + Q L ++A+ + Y Sbjct: 401 WSALEAAYAKAAAATRDHPEQPVTIMLMTDGENNAGISLQDFLRN-HQARDPAAKAVHTY 459 Query: 324 AIGIRVIRSHEFLRAC-ASPNSFYLVENPHSMYDAFSHI 361 + E +A A+ N S+ +AF I Sbjct: 460 TVRFGEANPGELDQAARATGGRMVDA-NATSLSEAFKEI 497 >gi|108763557|ref|YP_631764.1| von Willebrand factor type A domain-containing protein [Myxococcus xanthus DK 1622] gi|108467437|gb|ABF92622.1| von Willebrand factor type A domain protein [Myxococcus xanthus DK 1622] Length = 592 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 34/214 (15%), Positives = 71/214 (33%), Gaps = 31/214 (14%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + V SQ + +V D S SM S K+ +A ++I ++ + Sbjct: 227 LRVGVQGKVVSRSQRKPAHLVFLV-DTSGSMHSE-----DKLPLAREAIKVAVKNLNEND 280 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 V V +G + + + + L+ G T G++ AY Sbjct: 281 TVAIVTYAG----NTRDVLPPTPATDAKSIHAALDSLTA-GGGTAMGSGMELAY------ 329 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGE-----NLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 R + +V +TDG+ N+S S++ + G + +G Sbjct: 330 ---RHAVKKASGSVVSRVVVLTDGDANIGRNVSANAMLDSIH---KYTAEGVTLTTVGFG 383 Query: 329 VIR-SHEFLRACAS--PNSFYLVENPHSMYDAFS 359 + + + A + + V++ F Sbjct: 384 MGNYRDDLMEKLADKGNGNCFYVDSLREAKKVFE 417 >gi|47218379|emb|CAG01900.1| unnamed protein product [Tetraodon nigroviridis] Length = 683 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 78/199 (39%), Gaps = 35/199 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++D S S+ + + S++ MLE + VN +V F Sbjct: 223 GAASPKDMLILVDASGSVSGLT------LKLIQISVSKMLETLSDDDYVN------VVYF 270 Query: 227 SNKIEEFFLLEW-------GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++K + E L++ ++ ++ G +TN + G + A+ Q+ M R + Sbjct: 271 NDKAKYASCFENLVQANVRNKRMLKKAVQNITAKG-TTNYSGGFELAFEQLAQMNVSRAN 329 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR---VIRSHEFL 336 CN KII+ TDG +E + + ++ + + Sbjct: 330 CN-------KIIMLFTDGGEEKAEEIFKKYNP-----NQEVRIFTFSVGQHNYDKGPIQW 377 Query: 337 RACASPNSFYLVENPHSMY 355 ACA+ +Y + + ++ Sbjct: 378 MACANKGYYYEIPSIGAIR 396 >gi|145493674|ref|XP_001432832.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124399947|emb|CAK65435.1| unnamed protein product [Paramecium tetraurelia] Length = 618 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 61/164 (37%), Gaps = 19/164 (11%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 + + V++ + +I+LD S SM D ++ N ++ + D N Sbjct: 422 FVENQVEIQAAQQPSFHYIILLDDSGSMSG---------DRFNQAQNGLISSLSSAKD-N 471 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 ++ ++ F++ +++ ++Q + G T+ + AY +I ++ Sbjct: 472 QNIRVTIIIFND--NARCVVDSQTINMQTIKNAVVCNGGGTSFQSAFQLAYQKIAAVKNF 529 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA 320 Q K +I F TDG + + +A++ Sbjct: 530 EQFN-------KHVIFFYTDGGDSYPTQALNQFANLPQAQRMKI 566 >gi|157412071|ref|YP_001481411.1| TerY2 [Escherichia coli APEC O1] gi|226807625|ref|YP_002791319.1| TerY2 [Enterobacter cloacae] gi|226809935|ref|YP_002791629.1| TerY2 [Enterobacter cloacae] gi|99867096|gb|ABF67741.1| TerY2 [Escherichia coli APEC O1] gi|226425850|gb|ACO53943.1| TerY2 [Enterobacter cloacae] gi|226426161|gb|ACO54253.1| TerY2 [Enterobacter cloacae] Length = 213 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 32/168 (19%), Positives = 55/168 (32%), Gaps = 14/168 (8%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +V+D S SM I I AML ++ P V ++T+ N+ Sbjct: 4 RLPVYLVIDTSGSMRG------ESIHSVNVGIQAMLSALRQDPYALESVHISIITYDNEA 57 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 EF L + + T + L+ + D + Sbjct: 58 REFIPLT--PLEDFQFSDIVVPSAGGTFTGAALECLMQCVERDVRRSDGDTKGDWRP--L 113 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGA-IVYAIGIRVIRSHEFLR 337 + MTDG +++ + RG + A + HE L+ Sbjct: 114 VFLMTDGTPSDALAYGEAVK---AIRGRGFGSIIACAVGPKAGHEHLK 158 >gi|38347890|ref|NP_941139.1| putative tellurium resistance protein [Serratia marcescens] gi|156933958|ref|YP_001437874.1| hypothetical protein ESA_01784 [Cronobacter sakazakii ATCC BAA-894] gi|190410232|ref|YP_001965733.1| terY2 [Klebsiella pneumoniae] gi|237728579|ref|ZP_04559060.1| TerY2 [Citrobacter sp. 30_2] gi|38259367|emb|CAE51592.1| putative tellurium resistance protein [Serratia marcescens] gi|146151025|gb|ABQ02791.1| terY2 [Klebsiella pneumoniae] gi|156532212|gb|ABU77038.1| hypothetical protein ESA_01784 [Cronobacter sakazakii ATCC BAA-894] gi|226910057|gb|EEH95975.1| TerY2 [Citrobacter sp. 30_2] Length = 212 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 32/168 (19%), Positives = 55/168 (32%), Gaps = 14/168 (8%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +V+D S SM I I AML ++ P V ++T+ N+ Sbjct: 3 RLPVYLVIDTSGSMRG------ESIHSVNVGIQAMLSALRQDPYALESVHISIITYDNEA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 EF L + + T + L+ + D + Sbjct: 57 REFIPLT--PLEDFQFSDIVVPSAGGTFTGAALECLMQCVERDVRRSDGDTKGDWRP--L 112 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGA-IVYAIGIRVIRSHEFLR 337 + MTDG +++ + RG + A + HE L+ Sbjct: 113 VFLMTDGTPSDALAYGEAVK---AIRGRGFGSIIACAVGPKAGHEHLK 157 >gi|22127459|ref|NP_670882.1| hypothetical protein y3585 [Yersinia pestis KIM 10] gi|108809186|ref|YP_653102.1| hypothetical protein YPA_3195 [Yersinia pestis Antiqua] gi|108810627|ref|YP_646394.1| hypothetical protein YPN_0462 [Yersinia pestis Nepal516] gi|150260370|ref|ZP_01917098.1| hypothetical protein YPE_2671 [Yersinia pestis CA88-4125] gi|218927791|ref|YP_002345666.1| hypothetical protein YPO0594 [Yersinia pestis CO92] gi|229840484|ref|ZP_04460643.1| hypothetical protein YPH_2826 [Yersinia pestis biovar Orientalis str. PEXU2] gi|229842972|ref|ZP_04463123.1| hypothetical protein YPF_1320 [Yersinia pestis biovar Orientalis str. India 195] gi|229900820|ref|ZP_04515944.1| hypothetical protein YP516_0479 [Yersinia pestis Nepal516] gi|270487812|ref|ZP_06204886.1| von Willebrand factor type A domain protein [Yersinia pestis KIM D27] gi|294502679|ref|YP_003566741.1| hypothetical protein YPZ3_0569 [Yersinia pestis Z176003] gi|21960553|gb|AAM87133.1|AE013961_5 hypothetical [Yersinia pestis KIM 10] gi|108774275|gb|ABG16794.1| hypothetical protein YPN_0462 [Yersinia pestis Nepal516] gi|108781099|gb|ABG15157.1| hypothetical protein YPA_3195 [Yersinia pestis Antiqua] gi|115346402|emb|CAL19274.1| conserved hypothetical protein [Yersinia pestis CO92] gi|149289778|gb|EDM39855.1| hypothetical protein YPE_2671 [Yersinia pestis CA88-4125] gi|229682159|gb|EEO78251.1| hypothetical protein YP516_0479 [Yersinia pestis Nepal516] gi|229690038|gb|EEO82096.1| hypothetical protein YPF_1320 [Yersinia pestis biovar Orientalis str. India 195] gi|229696850|gb|EEO86897.1| hypothetical protein YPH_2826 [Yersinia pestis biovar Orientalis str. PEXU2] gi|270336316|gb|EFA47093.1| von Willebrand factor type A domain protein [Yersinia pestis KIM D27] gi|294353138|gb|ADE63479.1| hypothetical protein YPZ3_0569 [Yersinia pestis Z176003] gi|320016846|gb|ADW00418.1| hypothetical protein YPC_3986 [Yersinia pestis biovar Medievalis str. Harbin 35] Length = 327 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 61/199 (30%), Gaps = 14/199 (7%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + VLD S SM ++ K++ + ++ ++K P ++ F+ Sbjct: 3 RLPIFFVLDCSESM---IGENLKKMN---DGLQMIINDLKKDPHALETAWISVIAFAGVA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L V + L G T+ L+ QI D + Sbjct: 57 KTIVPL---VEVVSFYPPRLP-IGGGTSLGAALQELTRQIDTQVRKTTEERKGDWKP--V 110 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 + +TDG + + + + A+K + AIG+ LR + Sbjct: 111 VYLLTDGRPTDDTTAEITRWKTHYARK--VNLIAIGLGPSADLNILRQLTENVLLFNDTQ 168 Query: 351 PHSMYDAFSHIGKDIVTKR 369 I + Sbjct: 169 EGDFTQFIKWITASVSAHS 187 >gi|262173885|ref|ZP_06041562.1| protein TadG associated with Flp pilus assembly [Vibrio mimicus MB-451] gi|261891243|gb|EEY37230.1| protein TadG associated with Flp pilus assembly [Vibrio mimicus MB-451] Length = 403 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 29/207 (14%), Positives = 77/207 (37%), Gaps = 21/207 (10%) Query: 25 IFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVH--AATQIMNEGNGNNRKKLKGGDI 82 + P + ++L +++S F L + + + A+ + +E N + +KL + Sbjct: 1 MMFPAMMMILAFTMQLSQQFLAHARLSEASEVASLALIASPKEDDENNVSYARKLVDRYV 60 Query: 83 LCRIKNT-WNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKF 141 + I + + + +G V + T + +++ + I + Sbjct: 61 VDNIDDIKVTVKNKRCEYKDGCVQSSGEAAPFTDFTVAATAKHKSW---------ISYEN 111 Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + P +T + S +D+ ++D+S SM + + + ++ID Sbjct: 112 ISLKPEFT--------VNGSSVTRKFLPQPVDVYFIVDMSASMRATWQNGKSQIDEVKNV 163 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSN 228 I ++ ++K + L+ + N Sbjct: 164 ITRVVNDLKDFDTEVK-SRVALLGYHN 189 >gi|149176271|ref|ZP_01854886.1| DnaK protein (heat shock protein), C-terminal region has VWA type A domain [Planctomyces maris DSM 8797] gi|148844873|gb|EDL59221.1| DnaK protein (heat shock protein), C-terminal region has VWA type A domain [Planctomyces maris DSM 8797] Length = 715 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 68/199 (34%), Gaps = 28/199 (14%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + + + D+S SM S + + K+ A LE+V L G++ ++ Sbjct: 517 PHVTVYLAFDLSGSM------SGEPLAESQKAALAFLEQVDLT-----HCSMGVIAVADS 565 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + S +++ +K LS V NS A ++ ++G Sbjct: 566 TQTVLDACQNASKIEKAVKSLSIGMVGCGNSAQPFDTAMKKLKKVEG------------P 613 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 + ++ + DG + ++ V AIG FLR AS + Sbjct: 614 RFVITLADG---VWADQPHAVNRAKSLHSAEIDVIAIGFGDAD-KNFLRDIASCDEGSFF 669 Query: 349 ENPHSMYDAFSHIGKDIVT 367 + + FS I + I Sbjct: 670 TSLSGLSATFSSIAQVITK 688 >gi|71001856|ref|XP_755609.1| von Willebrand domain protein [Aspergillus fumigatus Af293] gi|66853247|gb|EAL93571.1| von Willebrand domain protein [Aspergillus fumigatus Af293] Length = 946 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 29/190 (15%), Positives = 64/190 (33%), Gaps = 39/190 (20%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK---- 229 ++ V+D S SM + KID ++ L+ + + V + +F ++ Sbjct: 295 IIFVIDRSGSM-------MDKIDTLKSALRVFLKSLPVG------VCFNICSFGSRHSFL 341 Query: 230 -IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + F + + + T ++ + + Sbjct: 342 WKQSLFYTAESLQEALSFVDGVRANMGGTEMQEAVEATVRSRMKDKELE----------- 390 Query: 289 KIIVFMTDGENLSTKEDQQSL--YYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 ++ +TDG+ +QQ+L + A GA +++GI SH + A + + Sbjct: 391 --VLILTDGQIW----NQQTLFGFIRETAADNGARFFSLGIGNGASHSLVEGIARAGNGF 444 Query: 347 L--VENPHSM 354 V N + Sbjct: 445 SQMVVNYEEL 454 >gi|302917449|ref|XP_003052439.1| hypothetical protein NECHADRAFT_36251 [Nectria haematococca mpVI 77-13-4] gi|256733379|gb|EEU46726.1| hypothetical protein NECHADRAFT_36251 [Nectria haematococca mpVI 77-13-4] Length = 764 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 31/220 (14%), Positives = 75/220 (34%), Gaps = 38/220 (17%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSM---------ESFFDSSITKIDMAIKSINAMLEEV 209 + D+++V+DVS SM E+ + ++ +D+ + ++E + Sbjct: 72 PPTAPSAEIPHVPCDIVLVIDVSGSMAGAAPVPGEETNESTGLSILDLTKHAARTIIETM 131 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 + G+VTF++K + L + + + + L Sbjct: 132 NESD------RLGIVTFASKAKVVQPLLSMTSENKERSRGNVTSMRPIDATNLWHGLL-- 183 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG---AIV 322 +G++ N + +N I+ +TDG + + + + G A + Sbjct: 184 --------EGIKLFKNVKSSNVP-AIMVLTDGMPNHMNP---AAGFVPKLRAMGQLPASI 231 Query: 323 YAIGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSH 360 + G L++ A ++ + + + F H Sbjct: 232 HTFGFGYHLRSGLLKSIAEIGGGNYAFIPDAGMIGTVFVH 271 >gi|159129667|gb|EDP54781.1| von Willebrand domain protein [Aspergillus fumigatus A1163] Length = 946 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 29/190 (15%), Positives = 64/190 (33%), Gaps = 39/190 (20%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK---- 229 ++ V+D S SM + KID ++ L+ + + V + +F ++ Sbjct: 295 IIFVIDRSGSM-------MDKIDTLKSALRVFLKSLPVG------VCFNICSFGSRHSFL 341 Query: 230 -IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + F + + + T ++ + + Sbjct: 342 WKQSLFYTAESLQEALSFVDGVRANMGGTEMQEAVEATVRSRMKDKELE----------- 390 Query: 289 KIIVFMTDGENLSTKEDQQSL--YYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 ++ +TDG+ +QQ+L + A GA +++GI SH + A + + Sbjct: 391 --VLILTDGQIW----NQQTLFGFIRETAADNGARFFSLGIGNGASHSLVEGIARAGNGF 444 Query: 347 L--VENPHSM 354 V N + Sbjct: 445 SQMVVNYEEL 454 >gi|148655541|ref|YP_001275746.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148567651|gb|ABQ89796.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 504 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 36/242 (14%), Positives = 78/242 (32%), Gaps = 48/242 (19%) Query: 155 VMPITSSVKVNSQTDARL--DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 T+ + A++ ++ +V D S SM TKI A +++ +++ + Sbjct: 109 PTDTTTIFRPAEGEAAQVTTNIQLVFDASGSMAQRIGGE-TKIQAARRAMERIIDTLPDN 167 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEW---------------GVSHLQRKIKYLSKFGVST 257 PD+N G F ++ + + + L+++ + G T Sbjct: 168 PDLN----VGFRVFGHEGDSSEAQKARSCQSTALLVPMQGVNKALLRQQAQAWQPTGW-T 222 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 + L+ A + +R +I+ +TDGE + C AK Sbjct: 223 PISLALQRAGEDFQAGENVRN-----------VIIMVTDGEETCGGD------PCAVAKA 265 Query: 318 -----RGAIVYAIGIR-VIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 + +G + LR A S + +N ++ + + + Sbjct: 266 LAESQAEVRIDVVGFGTTPDVAKTLRCIAENSGGVYTDAQNGDALVQTLEELIAATLKRS 325 Query: 370 IW 371 Sbjct: 326 TL 327 >gi|73541336|ref|YP_295856.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] gi|72118749|gb|AAZ61012.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] Length = 354 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 34/242 (14%), Positives = 75/242 (30%), Gaps = 61/242 (25%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 + +++ +DVSRSM + D + T+I A ++ ++ + V+ G+V+F+ Sbjct: 86 VTLVLAMDVSRSM-AASDVAPTRIGAAQQAARDLIVGLPPG------VRLGIVSFAATAI 138 Query: 232 EFFLLEWGVSHLQRKIKYLS-KFGVSTNST----------------PGLKYAYNQIFDMQ 274 + I + G +T S G+ + + Sbjct: 139 VVLPPTDNRLRMLDAIDRFELQNGTATGSGLIQSLAVLFPDDGIDLEGILFGGESLAPGT 198 Query: 275 GMRQ--HCNTEDANYKK---------------IIVFMTDGENLSTKEDQQSLYYCNEAKK 317 G R DA K+ ++ ++DG + + + + Sbjct: 199 GGRSLTEAAAADAVRKRDLEQPGAAPGSYRHGAVILLSDGRRTTGPDPLDAARMAAQ--- 255 Query: 318 RGAIVYAIGIRVI-----RSHEF----------LRACA--SPNSFYLVENPHSMYDAFSH 360 RG VY +G + LR A + ++ + + + Sbjct: 256 RGLRVYTVGFGAVQDAGTEGSSLSYEMQVDEPTLRQIATLTDGEYFQAGSAADLTRVYRQ 315 Query: 361 IG 362 + Sbjct: 316 LS 317 >gi|332366663|gb|EGJ44406.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK1059] Length = 450 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 57/321 (17%), Positives = 92/321 (28%), Gaps = 41/321 (12%) Query: 52 SMIDRSLVHAATQIMNEGN--GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 + + A T+I + K GD++ KN + + N V Sbjct: 52 EAVSSPIEAAGTEIAADTTKPVVEESTAKPGDLIDVSKNVSPVDVKESQEGNQTVRVETS 111 Query: 110 IVRSTSLDIV---VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 V T ++ V E + + + Y +S Sbjct: 112 TVDVTKTEVAPAKVDKAPEPITRRTENTLSGTDENGNPYTQYERVDKTTTITYTSTPPTV 171 Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 D++ V+D S SM S D T I+ E V+ I + GL TF Sbjct: 172 TKAGSADIVFVVDRSGSMGSTIDIVRTNIN----------EFVRNITKEGITARFGLATF 221 Query: 227 SNKIE-------------EFFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQ 269 S+++ F + + K L+ G T TP A NQ Sbjct: 222 SDEVYGRNSGSKDEDTVLTRFGSSYFTTDPAELEKALAAIRIASGGDTPETPT--PALNQ 279 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 I + KK +V +TD E K G ++ Sbjct: 280 IISTYDWSKSSKN-----KKFVVLLTDAEMKEDPSIPTVADTLAALKAAGIERTVATVKA 334 Query: 330 IRSHEFLRACASPNSFYLVEN 350 I ++ A+ +EN Sbjct: 335 IEG--IYKSFATEGRVLDIEN 353 >gi|162138242|gb|ABX82825.1| complement factor B [Sus scrofa] Length = 765 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 72/208 (34%), Gaps = 28/208 (13%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE- 231 IVLD S SM + S + N + + ++ + + GLVT++ + Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGARNFTGAKNCLKDFIEKVASYGVKPKYGLVTYATDPKV 320 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + +W L + K TN+ L YN + N Sbjct: 321 LIRVSNPKSADADWVTEQLDKISYDDHKLKAGTNTKKALLEVYN-MMSWGVNNFPDNWNR 379 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEA----------KKRGAIVYAIGIRVIRSHE 334 + +IV +TDG + + ++ + ++ +Y G+ + + E Sbjct: 380 TRH--VIVLLTDGLHNMGGDPVTVIHDIRDLLNIGRNRKNPREDYLDIYVFGVGPLVNQE 437 Query: 335 FLRACAS----PNSFYLVENPHSMYDAF 358 + A AS + +++ ++ D F Sbjct: 438 NINALASKKDKEQHVFKLKDVDNLEDVF 465 >gi|260789946|ref|XP_002590005.1| hypothetical protein BRAFLDRAFT_224914 [Branchiostoma floridae] gi|229275192|gb|EEN46016.1| hypothetical protein BRAFLDRAFT_224914 [Branchiostoma floridae] Length = 159 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 25/162 (15%), Positives = 61/162 (37%), Gaps = 24/162 (14%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D++ ++D S ESF + ++++ N + G+VTFS++ Sbjct: 1 PVDLVFLMDSS---ESFRTYGFEEEKTFVQTVVNYFTL------GENDTRVGVVTFSDRD 51 Query: 231 EEFFLLEWGVSHLQ----RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + + + +I+ + T + GL + N F R++ + Sbjct: 52 RQNTRVTLNEHYTRVELLTEIRDIPYDRGHTYTGLGLDHVRNNSFLEVNGRRNNTLD--- 108 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 ++ +TD E + + ++ G V+ +G+ Sbjct: 109 ---FLIVITDDE-----SEDDIVRPAQLIRQMGITVFVVGVG 142 >gi|156120152|ref|NP_001095294.1| complement factor B [Sus scrofa] gi|148724909|emb|CAN87697.1| B-factor, properdin [Sus scrofa] Length = 765 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 72/208 (34%), Gaps = 28/208 (13%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE- 231 IVLD S SM + S + N + + ++ + + GLVT++ + Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGARNFTGAKNCLKDFIEKVASYGVKPKYGLVTYATDPKV 320 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + +W L + K TN+ L YN + N Sbjct: 321 LIRVSNPKSADADWVTEQLDKISYDDHKLKAGTNTKKALLEVYN-MMSWGVNNFPDNWNR 379 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEA----------KKRGAIVYAIGIRVIRSHE 334 + +IV +TDG + + ++ + ++ +Y G+ + + E Sbjct: 380 TRH--VIVLLTDGLHNMGGDPVTVIHDIRDLLNIGRNRKNPREDYLDIYVFGVGPLVNQE 437 Query: 335 FLRACAS----PNSFYLVENPHSMYDAF 358 + A AS + +++ ++ D F Sbjct: 438 NINALASKKDKEQHVFKLKDVDNLEDVF 465 >gi|148724910|emb|CAN87698.1| B-factor, properdin [Sus scrofa] Length = 549 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 72/208 (34%), Gaps = 28/208 (13%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE- 231 IVLD S SM + S + N + + ++ + + GLVT++ + Sbjct: 261 IVLDPSGSMNIYLVLDGSDSIGARNFTGAKNCLKDFIEKVASYGVKPKYGLVTYATDPKV 320 Query: 232 -------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + +W L + K TN+ L YN + N Sbjct: 321 LIRVSNPKSADADWVTEQLDKISYDDHKLKAGTNTKKALLEVYN-MMSWGVNNFPDNWNR 379 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEA----------KKRGAIVYAIGIRVIRSHE 334 + +IV +TDG + + ++ + ++ +Y G+ + + E Sbjct: 380 TRH--VIVLLTDGLHNMGGDPVTVIHDIRDLLNIGRNRKNPREDYLDIYVFGVGPLVNQE 437 Query: 335 FLRACAS----PNSFYLVENPHSMYDAF 358 + A AS + +++ ++ D F Sbjct: 438 NINALASKKDKEQHVFKLKDVDNLEDVF 465 >gi|10334988|gb|AAD46685.2| TadG [Aggregatibacter actinomycetemcomitans] gi|26000721|gb|AAN75217.1| TadG [Aggregatibacter actinomycetemcomitans] Length = 538 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 49/341 (14%), Positives = 109/341 (31%), Gaps = 65/341 (19%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLV-------- 59 ++ F N G TI+TA+ + L + ++ + I K L D++ + Sbjct: 15 VKQFLQNEHGVYTIITALLAFPLLLFVAFTVDGTGILLDKARLAQATDQAALLLIAEDNQ 74 Query: 60 --------HAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIV 111 Q +++ + + K + N L +D + Sbjct: 75 YRKNKDHSDVTRQRVSQQDIDRESKDFSNAKVQAQWKKRNQELVQGLVKLYLRSDDSNGQ 134 Query: 112 RSTSLDIVVVP---------------QNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVM 156 +++S + P SI+ + + + KF W + +V Sbjct: 135 KNSSPVTIKEPFLAECLEEKTQPRNKNGTAKSIACVVQGSVQRKF-----WLPWGQTLVS 189 Query: 157 ------------PITSSVKVNSQTDARLDMMIVLDVSRSMES--------FFDSSITKID 196 + Q +D+M+V D+SRSM +ID Sbjct: 190 SSQLHDGRVGINSGETYAVKEKQITIPIDLMMVTDLSRSMNWAIVSHRDVEVPPPNRRID 249 Query: 197 MAIKSINAMLEEV---KLIPDVNNVVQSGLVTFS------NKIEEFFLLEWGVSHLQRKI 247 + ++ + + + + D++ + G V+F+ ++ + L + + Q +I Sbjct: 250 ALREVVSNIQDILLPKAIRDDISPYNRIGFVSFAAGARQKDETDNCVLPYYSKQNKQAEI 309 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 G + L + + + + Q N E +Y Sbjct: 310 SNYFNSGQISQGFEELSRSMDIEKTINQITQFKNGEKKSYP 350 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 48/152 (31%), Gaps = 23/152 (15%) Query: 232 EFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGM-RQHCNTEDANYK 288 + W L + T T G+ N + D N + N + Sbjct: 369 KATTQAWFSKSKPGVADALKEIEPLGGTAVTSGIFIGTNLMTDTNKDPEAAPNKLNTNTR 428 Query: 289 KIIVFMTDGENLSTKEDQQSLYY----CNEAKKR--------------GAIVYAIGIRVI 330 ++++ ++DGE+ ++ + C + K++ A+G Sbjct: 429 RVLLILSDGEDNRPSKNTLVTFMNSGMCEKIKEKINSLQDSNYPQVEARIAFVALGFNPP 488 Query: 331 RSHEFL-RACASPNSFYLVENPHSMYDAFSHI 361 + + C +Y V + + DAF I Sbjct: 489 QDQLIAWKKCV-GKQYYPVNSKQGLLDAFKQI 519 >gi|21228105|ref|NP_634027.1| magnesium-chelatase subunit [Methanosarcina mazei Go1] gi|20906546|gb|AAM31699.1| Magnesium-chelatase subunit [Methanosarcina mazei Go1] Length = 692 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 68/204 (33%), Gaps = 27/204 (13%) Query: 123 QNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 + + A R P + + N I++ K + ++ V+D S Sbjct: 452 KTTDLAFDATLRAAAPYQ----LTREKNGNSIIIHDPDLRKKIREKKIGNLVLFVVDASG 507 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEFFLLEWGVS 241 SM + ++ + ++ +ML + + GL+ F + E + Sbjct: 508 SM-----GARQRMVASKGAVLSML-----MDAYQKRDRVGLIAFKGDSAELLLPPTSSIE 557 Query: 242 HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE--- 298 Q+ ++ + G T + GL Y I +R+ +T +V ++DG Sbjct: 558 LAQKYLQEMPT-GGKTPISRGLVKGYEII--KSELRRDPDTCP-----FMVLISDGRANV 609 Query: 299 -NLSTKEDQQSLYYCNEAKKRGAI 321 + + + K+ G Sbjct: 610 SMNGEPPLHEIITIASRLKEEGIQ 633 >gi|281342668|gb|EFB18252.1| hypothetical protein PANDA_020017 [Ailuropoda melanoleuca] Length = 885 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 77/198 (38%), Gaps = 30/198 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ V+DVS SM K+ ++++ +L++++ Q ++ F++ + Sbjct: 248 ILFVIDVSGSMWGI------KMKQTVEAMKTILDDLRAED------QFSVIDFNHNVRTW 295 Query: 231 --EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + + ++ I+ + G TN L A IF + + + Sbjct: 296 RNDLVSATKTQIVDAKKYIEKIQPSG-GTNINEALLRA---IFILNEANNLGMLDPESVS 351 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACASPN-- 343 +I+ ++DG+ + + K+ ++++GI ++FL+ ++ N Sbjct: 352 -LIILVSDGDPTVGNCELKLSKIQKNVKQNIRDNIALFSLGIGFDVDYDFLKRLSNENRG 410 Query: 344 ---SFYLVENPHSMYDAF 358 Y ++ S F Sbjct: 411 IAQRIYGNQDTSSQLKKF 428 >gi|325282943|ref|YP_004255484.1| von Willebrand factor type A [Deinococcus proteolyticus MRP] gi|324314752|gb|ADY25867.1| von Willebrand factor type A [Deinococcus proteolyticus MRP] Length = 535 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 58/200 (29%), Gaps = 31/200 (15%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL----IPDVNNVVQSGLVTFSNKI 230 + VLDVS SME +++ ++ + + + L+ FS + Sbjct: 350 IFVLDVSGSMEG------KRLEALKAALGNLSGADTSLGWRFAAFADRERVTLIPFSGDV 403 Query: 231 EEFFLLEWGVSH------LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E + + + G TN L AY Q Sbjct: 404 EAVRSFQVNKASRAADLQAIAAAGGALQAGGGTNIYGALSEAYRQAAAAPAGSYTS---- 459 Query: 285 ANYKKIIVFMTDGENLSTKE--DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-S 341 +V MTDGE + + + Y A R + + E A + Sbjct: 460 ------VVLMTDGEGTAGPSLNEFRDFYAALPAGARSVKTFTVLFGDSDVQEMNEVAALT 513 Query: 342 PNSFYLVENPHSMYDAFSHI 361 + + ++ AF I Sbjct: 514 GGRTFDGQ--QNLAAAFKEI 531 >gi|298709908|emb|CBJ31633.1| conserved unknown protein [Ectocarpus siliculosus] Length = 304 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 35/188 (18%), Positives = 71/188 (37%), Gaps = 24/188 (12%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV--KLIPDVNNVVQSGLVT 225 + D++IVLDVS SM + ++D+A ++ ++ + +V ++ V Sbjct: 110 SSGPKDVVIVLDVSGSMSQYG-----RLDLAKEAAETVINTLGADSFVNVVTFSETARVL 164 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +N E + L ++ L V TN + ++ + + Sbjct: 165 LTNSTTLVRATEDNLGELVSLVQNLEFDLANVGTNFGAAFETTFDIL----EASRTSEET 220 Query: 284 DANYKKIIVFMTDGE-NLSTKEDQQSLY--YCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +N + IVF+TDG N+ D+ + C+ +Y V + +A A Sbjct: 221 SSNCQTAIVFLTDGNTNVGLSTDEVTSKQIACDTG-----GIYE---HVEDGGDLSQAMA 272 Query: 341 SPNSFYLV 348 +Y + Sbjct: 273 FFYRYYSI 280 >gi|183600995|ref|ZP_02962488.1| hypothetical protein PROSTU_04610 [Providencia stuartii ATCC 25827] gi|188019327|gb|EDU57367.1| hypothetical protein PROSTU_04610 [Providencia stuartii ATCC 25827] Length = 212 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 31/171 (18%), Positives = 66/171 (38%), Gaps = 12/171 (7%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +++D S SM I I AML ++ P V ++T+ N+ Sbjct: 3 RLPVYLLIDTSGSMRG------ESIHAVNVGIQAMLNALRQDPYALESVHIAIITYDNEA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 EF L + + + T + L+ + + +R+ +T+ +++ + Sbjct: 57 REFIPLTALENF--QFTDIVVPSSGGTFTGAALECLIQCV--DRDIRRTDDTQKGDWRPL 112 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 + MTDG + +++ K+ + A + H+ L+ S Sbjct: 113 VFLMTDGTPSDSYAYTEAIK--EIKKRSFGSIIACAVGPKAKHDHLKQLTS 161 >gi|51598434|ref|YP_072622.1| hypothetical protein BG0172 [Borrelia garinii PBi] gi|51573005|gb|AAU07030.1| hypothetical protein BG0172 [Borrelia garinii PBi] Length = 333 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 68/196 (34%), Gaps = 26/196 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++IVLD+S SM + SS +++ + + I + + GLV F+ Sbjct: 93 DIVIVLDISPSMGAVEFSSKNRLEFSKELIRRFISQ-------RENDNIGLVAFAKDASI 145 Query: 233 FFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + +K+ Y+ G + G+ A + + K+ Sbjct: 146 VVPITTDRDFFNKKLDDIYIMDLGNGSALGLGISIALSHL-----------KHSEALKRS 194 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF----Y 346 IV +TDG S + + N A+ +Y+IGI S + Sbjct: 195 IVVLTDGVVNSDEIYKD--QVINLAQGLNVKIYSIGIGSSEEFSVEFKLRSGKFYQGSLK 252 Query: 347 LVENPHSMYDAFSHIG 362 + +P + + G Sbjct: 253 EIYDPSMLVEISHKTG 268 >gi|189485267|ref|YP_001956208.1| aerotolerance-related cytoplasmic membrane protein BatB [uncultured Termite group 1 bacterium phylotype Rs-D17] gi|170287226|dbj|BAG13747.1| aerotolerance-related cytoplasmic membrane protein BatB [uncultured Termite group 1 bacterium phylotype Rs-D17] Length = 330 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 33/162 (20%), Positives = 68/162 (41%), Gaps = 26/162 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++I LD+S+SM + DS ++++ A ++ ++EE N + G+V FS Sbjct: 89 IIIALDISKSMLAE-DSKPSRLEKAKMIVSKIVEE-------NPGEKMGIVVFSGTAMWQ 140 Query: 234 FLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + + L+ ++ + T + + A + ++ ++ Sbjct: 141 CPLTFDLHALKMFLQSVETTNLPLGGTRISSAIMLA-----------SKAASCESAGSRV 189 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 ++ ++DGEN D + N AKK G + +IGI Sbjct: 190 MILISDGENH----DSKIKEAVNAAKKAGLRIISIGIGKKEG 227 >gi|162454787|ref|YP_001617154.1| hypothetical protein sce6505 [Sorangium cellulosum 'So ce 56'] gi|161165369|emb|CAN96674.1| putative membrane protein [Sorangium cellulosum 'So ce 56'] Length = 384 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 34/163 (20%), Positives = 67/163 (41%), Gaps = 22/163 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++VLD S+SM + D ++I A + ++++++ + G V F+ + Sbjct: 89 VDVVVVLDYSKSMYA-RDVEPSRIFRAKVEVARLIKDLEGA-------RFGAVAFAGEPM 140 Query: 232 EFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L + + + + L T L A N++ + + A +K Sbjct: 141 G-FPLTADGAAIAQFFRQLDPNDMPIGGTAIARALDQA-NELLK-------RDPKSAEHK 191 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 +II+ +TDGE + S+ A+ V IG R Sbjct: 192 RIILLVTDGE--DLEGYPLSVAQAIGAQGTTIHVVQIGGRTPE 232 >gi|149914292|ref|ZP_01902823.1| hypothetical protein RAZWK3B_19866 [Roseobacter sp. AzwK-3b] gi|149811811|gb|EDM71644.1| hypothetical protein RAZWK3B_19866 [Roseobacter sp. AzwK-3b] Length = 597 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 35/218 (16%), Positives = 74/218 (33%), Gaps = 35/218 (16%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 F G +TIL+ ++ V G+ I+ +T L + +D +++ A + Sbjct: 9 RFVTRDDGAITILSLFLFVVMLAVAGLGIDTMRHEMARTHLQATLDSAVLAGAGAPADAT 68 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 + + ++ + ++N ID DIV + S Sbjct: 69 AADVKLIVEDYFDAADLSQ--------------YLNTIDP-----ETDIVASLNAKSVSA 109 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 S L+ TF+ + + T+ + R+++++ LDVS SM Sbjct: 110 SV------ELEMDTFLMRLSG---VDTLTTAGGATAAIAAPRMEIVLALDVSGSMAG--- 157 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 ++ + + +V D +V +S Sbjct: 158 ---ERLTKMKSAAKQFVTDVMSASD-QGTTTISIVPYS 191 Score = 44.8 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 17/79 (21%), Positives = 31/79 (39%), Gaps = 5/79 (6%) Query: 297 GENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR---SHEFLRACA-SPNSFYLVENPH 352 G +D + C K +G +VY I + + + L CA SP+ + + Sbjct: 519 GSIAPETKDDRMADICGATKSKGIVVYTIAFEMGEFDSAADRLENCASSPSQHFNATTLN 578 Query: 353 SMYDAFSHIGKDIVTKRIW 371 + AF I ++ R+ Sbjct: 579 -ISQAFGSIAANVQKLRLT 596 >gi|327540682|gb|EGF27254.1| von Willebrand factor type A domain-containing protein [Rhodopirellula baltica WH47] Length = 887 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 49/302 (16%), Positives = 111/302 (36%), Gaps = 31/302 (10%) Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 E N+ + D + ++ + + ++ + R S+ I + Y Sbjct: 399 EPIQENKFRRVADDDDALSTFSIDVDTASYAKVRSYLQRGQ-LPRPDSVRIEELINYFDY 457 Query: 128 SISAISRYKIPLKFCTFI-----PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 + + P+ F + + PW N+R + + I + ++ + R +++ ++D S Sbjct: 458 QYT-PPSAEDPVPFSSAMAVASCPWNENNRLVRVGIQAK-DIDRKKRPRCNLVFLIDTSG 515 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH 242 SM K+ + I+ + +L+++ V VV +G S+ + Sbjct: 516 SM-----KRPNKLPLVIEGMKVLLDQLNKKDRVAIVVYAG----SSGLVLDSTPVKQKKK 566 Query: 243 LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLS 301 + R + LS G STN GL+ AY R++ + N ++ +DG+ N+ Sbjct: 567 IIRALSALSA-GGSTNGGAGLQLAYQT------ARENFIEDGVNR---VILCSDGDFNVG 616 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---PNSFYLVENPHSMYDAF 358 Q + K G + +G + ++ + S ++ V+ Sbjct: 617 MTGTDQLVAEATRQSKSGTELTVLGFGMGNHNDAMMERISNSGAGNYAFVDTIAEAKKVL 676 Query: 359 SH 360 + Sbjct: 677 AD 678 >gi|325679262|ref|ZP_08158849.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] gi|324109048|gb|EGC03277.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] Length = 290 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 32/137 (23%), Positives = 53/137 (38%), Gaps = 18/137 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLDVS SM F S + ++ +L D + TFS+ + Sbjct: 38 VAVVLDVSGSMAEAFRSGM-----VQATLERLLPLAMAFDDDGT---MEVWTFSHGFQRH 89 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L G + K LS +G TN +P ++ N E A ++F Sbjct: 90 KPLTRGNFYNYIKDNGLS-YGGGTNYSPVIRDVGNY---------FIKEEPACLPNYVIF 139 Query: 294 MTDGENLSTKEDQQSLY 310 +TDG+N +E + + Sbjct: 140 ITDGDNFDERETDRVIK 156 >gi|114562255|ref|YP_749768.1| von Willebrand factor, type A [Shewanella frigidimarina NCIMB 400] gi|114333548|gb|ABI70930.1| von Willebrand factor, type A [Shewanella frigidimarina NCIMB 400] Length = 612 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 45/256 (17%), Positives = 98/256 (38%), Gaps = 36/256 (14%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAIS------RYKIPLKFCTFIPW 147 E + F D+D ST+ ++ Q + + Y P+ + P+ Sbjct: 131 VAGETPVSTFSIDVDTGSYSTTRRLINQGQLPTKNTVRVEELVNYFSYDYPVPTNSEQPF 190 Query: 148 YTNSRHIVMPITSSV----------KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 N+ P + V + +++++LDVS SM SS K+ + Sbjct: 191 SVNTELAPSPYNADTQLLRIGLKGFDVAPDKLSASNLVLLLDVSGSM-----SSADKLPL 245 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST 257 +++ + +++ V+ VV +G + + + + ++ + L+ G T Sbjct: 246 LKQAMLMLSQQLSAQDKVSIVVYAGA----SGVVLDGVAGNDFTAIKTALSQLNAQG-GT 300 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAK 316 N + G++ AY ++H +N ++ TDG+ NL + QQ + + Sbjct: 301 NGSQGIQLAYQL------AQKHFIENGSNR---VILATDGDFNLGMTDHQQLVDFVASRS 351 Query: 317 KRGAIVYAIGIRVIRS 332 K+G + +G + Sbjct: 352 KKGIGLSTLGFGLGSG 367 >gi|313239872|emb|CBY14719.1| unnamed protein product [Oikopleura dioica] Length = 982 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 67/201 (33%), Gaps = 26/201 (12%) Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKI 247 + + + + I +++ + D + G+V +S+ L G + L + + Sbjct: 14 NPLEDFEKVRQWIGKLVDTFDIEEDGGG-TRVGVVIYSDAPRMEISLGNGLGKTDLIKAV 72 Query: 248 -------------KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + L +T + ++YA F +I++ + Sbjct: 73 LVIYLNLIDFLFAQSLMYERGNTLTGESIRYASEVAFS---ETSGARALSEGINRIMIVL 129 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLVENPH 352 TDG + A++ G ++YA+G+ E + P + V Sbjct: 130 TDGRAQDNVAGPAVI-----AQEDGIVIYAVGVGHAIKDELDEIASKPTHRHKFSVSEYG 184 Query: 353 SMYDAFSHIGKDIVTKRIWYD 373 ++ S++ + I I + Sbjct: 185 AIESIRSNLRRTICIHSICPN 205 >gi|313212957|emb|CBY36854.1| unnamed protein product [Oikopleura dioica] Length = 1117 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 67/201 (33%), Gaps = 26/201 (12%) Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKI 247 + + + + I +++ + D + G+V +S+ L G + L + + Sbjct: 14 NPLEDFEKVRQWIGKLVDTFDIEEDGGG-TRVGVVIYSDAPRMEISLGNGLGKTDLIKAV 72 Query: 248 -------------KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + L +T + ++YA F +I++ + Sbjct: 73 LVIYLNLIDFLFAQSLMYERGNTLTGESIRYASEVAFS---ETSGARALSEGINRIMIVL 129 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP--NSFYLVENPH 352 TDG + A++ G ++YA+G+ E + P + V Sbjct: 130 TDGRAQDNVAGPAVI-----AQEDGIVIYAVGVGHAIKDELDEIASKPTHRHKFSVSEYG 184 Query: 353 SMYDAFSHIGKDIVTKRIWYD 373 ++ S++ + I I + Sbjct: 185 AIESIRSNLRRTICIHSICPN 205 >gi|170724979|ref|YP_001759005.1| hypothetical protein Swoo_0614 [Shewanella woodyi ATCC 51908] gi|169810326|gb|ACA84910.1| conserved hypothetical protein [Shewanella woodyi ATCC 51908] Length = 503 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 32/220 (14%), Positives = 74/220 (33%), Gaps = 17/220 (7%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 +G I L +F ++ + +E + K L + + + A + N + Sbjct: 10 KRQRGAAAIYLVFLLIPLFGMVFLALEGTRYIQKKNRLGDATEAASL--AVSMANRDDKG 67 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 +L I ++N +S R + + Y ++A Sbjct: 68 YETQLAKDYISSYMRNIKEISQVKVERKEDIDHYPMADGS------FEDREYTQYRVTAK 121 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI 192 + + L IP + + + + D +D++ V D S SM Sbjct: 122 TEHTSWL-HSDLIPSFKETETLANRALARAYPEYLGDRDVDIVFVSDFSGSM------KG 174 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVV--QSGLVTFSNKI 230 ++I+ +I + E+ + D + + LV ++ ++ Sbjct: 175 SRINSLKDAITEISNEILVPRDGETEIRNRIALVPYNMRV 214 >gi|47213219|emb|CAF89740.1| unnamed protein product [Tetraodon nigroviridis] Length = 1349 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 34/218 (15%), Positives = 76/218 (34%), Gaps = 26/218 (11%) Query: 116 LDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT-DARLDM 174 L + P++ + ++ + + + T ++ Q + +D+ Sbjct: 88 LTLTSDPRDSSFVVACGPLWSHECGSSLYSTGICSRVSRTFRPTGTIAPALQRCETFMDI 147 Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 +IVLD S S+ + + N ++ + Q G+V + + + + F Sbjct: 148 IIVLDGSNSIYPWSEVQ-----------NFLINILHKFFMGPGQTQVGVVQYGSSVVQEF 196 Query: 235 LLEWGVSHLQRKIKYLS---KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L + + + S + G T + G++ A +Q F G +K++ Sbjct: 197 RLGEHRTVEEVVVAARSIRQRGGEETRTALGMEVARSQGFGRGG--------RPGARKVM 248 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 + +TDGE+ Q + YAI + Sbjct: 249 IVITDGESHD--SAQLPQAVARSNRDN-ITTYAIAVSA 283 >gi|47191295|emb|CAF87304.1| unnamed protein product [Tetraodon nigroviridis] Length = 202 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 69/185 (37%), Gaps = 22/185 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + + T I I ++++++ + PD V+ +V +S+ ++ Sbjct: 8 DIVFLVDGSDT------TGETGIAYIRDFIISVVQQLDVQPD---RVRVAVVQYSDNVQR 58 Query: 233 FFLLE--WGVSHLQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L + +K L G S++ KY ++ + ++ + Sbjct: 59 EFALNSHNNKQAVISAVKRLRLMGGRSSDLGEAFKYV-----TENELKPSSGSRPSDASQ 113 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +V +T G+ Q + Y K +G+ + + + +P V Sbjct: 114 HLVVITGGQ-----SPQDASLYGPWLKSSRVSCIGVGVGGTNTRQLTQIATTPEDVLQVP 168 Query: 350 NPHSM 354 S+ Sbjct: 169 TFLSL 173 >gi|327490425|gb|EGF22209.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1058] Length = 462 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 36/154 (23%), Positives = 60/154 (38%), Gaps = 24/154 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITK-------IDMAIKSINAMLEEVKLIPDVNNVV 219 Q + + V D S SM TK +D+ K M++++ I +++ V Sbjct: 193 QGQMNIAVSFVFDTSGSMNWDLQGRETKKSGNESRMDILRKKSVIMIKDLAEIGNIS-VN 251 Query: 220 QSGLVTFSNKIEEFFL-LEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQIFDMQG 275 G T + I++ F L+ G + + I L+ GV TN GL+Y + Sbjct: 252 LVGFSTSAKYIQQNFSNLDNGTNTIIATINKRENLNPDGV-TNPGDGLRYGMISLQSQPA 310 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL 309 K IV +TDG + D ++L Sbjct: 311 QL-----------KYIVLLTDGIPNAYLVDSRAL 333 >gi|115376470|ref|ZP_01463705.1| phage/colicin/tellurite resistance cluster TerY protein, putative [Stigmatella aurantiaca DW4/3-1] gi|310819725|ref|YP_003952083.1| hypothetical protein STAUR_2452 [Stigmatella aurantiaca DW4/3-1] gi|115366538|gb|EAU65538.1| phage/colicin/tellurite resistance cluster TerY protein, putative [Stigmatella aurantiaca DW4/3-1] gi|309392797|gb|ADO70256.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1] Length = 218 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 71/208 (34%), Gaps = 20/208 (9%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +Q L ++++ DVS SM KI ++ M+E + D+ V ++ Sbjct: 8 TTQAARPLPVVVLADVSGSM-----GVDGKIQALNLAVREMIEAFQDESDLRAEVHVSVI 62 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 TF + L G + + N + A++ M R Sbjct: 63 TFGGQSRVHLPL--GRARDAAWTD------LGANGGTPMGAAFDLARTMVEDR--NAVPS 112 Query: 285 ANYKKIIVFMTDGENLST-KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA-CASP 342 Y+ IV ++DG+ K+ +SL + A A+ I H L+A A P Sbjct: 113 RAYRPTIVLVSDGQPTDEWKQPLESLL--KNERGGKAFRMALAIGADADHAVLQAFLADP 170 Query: 343 N-SFYLVENPHSMYDAFSHIGKDIVTKR 369 Y + + F + + + Sbjct: 171 EARVYRADEARQIRKFFQLVTMSVSARS 198 >gi|257470753|ref|ZP_05634843.1| von Willebrand factor type A domain-containing protein [Fusobacterium ulcerans ATCC 49185] gi|317064958|ref|ZP_07929443.1| conserved hypothetical protein [Fusobacterium ulcerans ATCC 49185] gi|313690634|gb|EFS27469.1| conserved hypothetical protein [Fusobacterium ulcerans ATCC 49185] Length = 376 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 63/174 (36%), Gaps = 18/174 (10%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P ++V+ + ++++ VLD + SM + TKI + + + + Sbjct: 25 PENTAVEQTRAIEKDVEIVFVLDTTGSMGGLIQGAKTKIWSIVNEV--------MQNHKD 76 Query: 217 NVVQSGLVTFSNK----IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + V+ GLV + ++ + + L + + + G + K + + Sbjct: 77 SKVKIGLVAYRDRGDVYVTKVTQLNENLDEIYSVLMDYKAQGGGDDPEDVRKALHESLEI 136 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ-QSLYYCNEAKKRGAIVYAI 325 +Q N +II + D D ++ +AK +G I+ I Sbjct: 137 IQWS-----APRENLSQIIFLVGDAPPHDDYNDSPDTVVTAKKAKSKGIIINTI 185 >gi|33592721|ref|NP_880365.1| hypothetical protein BP1639 [Bordetella pertussis Tohama I] gi|33572367|emb|CAE41926.1| putative exported protein [Bordetella pertussis Tohama I] gi|332382136|gb|AEE66983.1| hypothetical protein BPTD_1619 [Bordetella pertussis CS] Length = 336 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 73/205 (35%), Gaps = 32/205 (15%) Query: 173 DMMIVLDVSRSM--ESFFDS---SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+++V+D+S+SM E F D+ ++ + + PD + GL+ F Sbjct: 93 DILLVVDISQSMDSEDFRDAQGRPASRWQAVQAVVGDFI---DKRPDD----RLGLIVFG 145 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + L+ ++ + N+ G D G+ Sbjct: 146 AGAYPQAPLTRDHAALRLLLQRTAVGMAGPNTALG---------DAIGLGIRMLDHAGER 196 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-------HEFLRACA 340 KI++ +TDG T A + +V+ IGI + + LR A Sbjct: 197 DKILILLTDG--NDTASAVPPARAAELAAQHRVVVHTIGIGDPAASGEDRVDFDALRDIA 254 Query: 341 --SPNSFYLVENPHSMYDAFSHIGK 363 + F+ + S+ + ++ + + Sbjct: 255 RIAGGRFFRARDQASLQEVYATLDR 279 >gi|134299282|ref|YP_001112778.1| von Willebrand factor, type A [Desulfotomaculum reducens MI-1] gi|134051982|gb|ABO49953.1| von Willebrand factor, type A [Desulfotomaculum reducens MI-1] Length = 599 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 30/235 (12%), Positives = 75/235 (31%), Gaps = 13/235 (5%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 FR L D G + + + ++ + ++ + + L Sbjct: 343 FRRPLMDEGAIERYEQARSAVQPFVIRLRNRLSWANTRQVINQFQLPSGELDEDSLYQAS 402 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 I ++ + +LD+ +++D S SM I+++++A +E ++ + Sbjct: 403 YSNRIFRQTEIINTRTRKLDICLLVDTSASMVYPAGKEISRVELARNLAALFVEALEPVD 462 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + V F+ K + + + ++ +T +KYA Sbjct: 463 SIKTWV----FGFNLKGAVNMYELYSPALPNKARIGVTGAEGTTPEGTAVKYA------- 511 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +K+++ + DG E + K G +I + Sbjct: 512 --ALRMMAEGRPFVQKVLIVIADGNPNPGPETKLVKEQVKRLKALGCKTISILLG 564 >gi|312879450|ref|ZP_07739250.1| von Willebrand factor type A [Aminomonas paucivorans DSM 12260] gi|310782741|gb|EFQ23139.1| von Willebrand factor type A [Aminomonas paucivorans DSM 12260] Length = 813 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 42/264 (15%), Positives = 84/264 (31%), Gaps = 38/264 (14%) Query: 112 RSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR 171 S D V P YS S P+ + +++K+ +A Sbjct: 269 DGVSGDTTVKPSRVQYSFSK------PVTTSGDYLNAVTPGTHLAAAQANLKILWLAEAT 322 Query: 172 LDMMIVLDVSRSMESFFDS--SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + IVLD S SM + D T + A + +++ + V G+V F + Sbjct: 323 MVYQIVLDRSGSMGTNPDKPDDPTPLSYAKTAACNLVDSLPK------NVYVGIVQFDDS 376 Query: 230 IEEFFLLE----------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + + + + + I L+ G ST YA +Q + Sbjct: 377 TSQVYPITLIASNDAAAAATRAAAKAAINGLTS-GGSTAIYDAASYALSQFVAQKTAL-- 433 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV---IRSHEFL 336 A+ + +TDGE+ S+ + E + + + +G S Sbjct: 434 ----SADLLGVTYLLTDGEDNSSS--KSVGEVIGEYQAQKVPLITVGYGAGGQAGSFALT 487 Query: 337 RAC-ASPNSFYL-VENPHSMYDAF 358 + + ++ + ++ F Sbjct: 488 QLADGTGGQYFASPVDQAALQQVF 511 >gi|332216213|ref|XP_003257239.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 1 [Nomascus leucogenys] Length = 930 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 63/169 (37%), Gaps = 15/169 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEE 232 ++ V+D S SM S KI +++ +L+++ N +V + + + + Sbjct: 275 VVFVIDKSGSM------SGRKIQQTREALIKILDDLSPRDQFNLIVFSTEAIQWRPSLVP 328 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V+ + + G TN + A Q+ D + + I+ Sbjct: 329 AS--AENVNKARSFAAGIQALG-GTNINDAMLMAV-QLLDSSNQEERLPDGSVSL---II 381 Query: 293 FMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +TDG+ + + +S+ EA ++ +G S+ FL A Sbjct: 382 LLTDGDPTVGETNPRSIQKNVREAVSGRYSLFCLGFGFDVSYAFLEKLA 430 >gi|239908812|ref|YP_002955554.1| hypothetical protein DMR_41770 [Desulfovibrio magneticus RS-1] gi|239798679|dbj|BAH77668.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 439 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 28/212 (13%), Positives = 69/212 (32%), Gaps = 27/212 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + ++ + ++ RL++ + +D S SM + + + Sbjct: 27 VRIQAPNTPEGETKERTRLNLALAIDRSGSMAGRPLEEAKRCASFV------------VD 74 Query: 214 DVNNVVQSGLVTFSNKIEE--FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 + N + L+ + + IE + + R I+ + G + LK A Sbjct: 75 KLKNTDRVSLIAYDSSIETRVPSVKVEDKAIFHRAIEGIDDGGCTNLHGGWLKGA----- 129 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + + + + + I+ ++DG+ N ++ + C E G G+ Sbjct: 130 --EQISPYIDPSTISR---IILLSDGQANEGLTDEAEIFKQCRELADAGVTTSTYGLGSN 184 Query: 331 RSHEFLRACASP--NSFYLVENPHSMYDAFSH 360 + + A + Y + D F Sbjct: 185 FNETLMIGMAKNGQGNSYYGRTADDLMDPFQE 216 >gi|188527810|ref|YP_001910497.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter pylori Shi470] gi|188144050|gb|ACD48467.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter pylori Shi470] Length = 217 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 29/202 (14%), Positives = 70/202 (34%), Gaps = 10/202 (4%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + + + + ++LD S SM + T+I+ I M+E +K + ++T Sbjct: 9 TMEERFIPVFLLLDTSGSMNESL-GNCTRIEALNLCIQKMIETLKQEAKKELFSKMAIIT 67 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F E +L ++ T + A + I D Sbjct: 68 FG---ENGAVLHTPFDDVKNINFKPLSASGGTPLDQAFRLAKDLIEDKDTF------PTK 118 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 YK + ++DGE K + + ++ + ++ ++I I ++ + + Sbjct: 119 FYKLYSILVSDGEPNDDKWQKALSNFHHDGRSAKSVCWSIFIGDRNTNPQVNKDFGKDGV 178 Query: 346 YLVENPHSMYDAFSHIGKDIVT 367 + ++ + F + + I Sbjct: 179 FYADDVEKLVGLFEIMTQTISK 200 >gi|83312851|ref|YP_423115.1| Flp pilus assembly protein TadG [Magnetospirillum magneticum AMB-1] gi|82947692|dbj|BAE52556.1| Flp pilus assembly protein TadG [Magnetospirillum magneticum AMB-1] Length = 464 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 26/183 (14%), Positives = 51/183 (27%), Gaps = 46/183 (25%) Query: 234 FLLEWGVSHLQRKIKYLSK-FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L + L I + T S G+ + + + + K ++ Sbjct: 280 TPLTNVKATLTPAINAMEAWSRGGTLSDVGMAWGLRVLSPEPPFTEGLPWGTPKWSKAVI 339 Query: 293 FMTDGENL-----------------------------------------STKEDQQSLYY 311 MTDG+N T + + Sbjct: 340 LMTDGDNQFYKLTSTTGGNKVNSAVNSDYGAYGRLDELGRIGTTNATTAKTTINTRLTSV 399 Query: 312 CNEAKKRGAIVYAIGIR---VIRSHEFLRACASP-NSFYLVENPHSMYDAFSHIGKDIVT 367 CN K + IVY + + + +ACA+ + ++ + + AF I + Sbjct: 400 CNAMKAKNIIVYTVTFTSGINQATKDIYKACATDASKYFDSPSQDELKSAFRAIATSLSN 459 Query: 368 KRI 370 R+ Sbjct: 460 LRV 462 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 29/225 (12%), Positives = 74/225 (32%), Gaps = 38/225 (16%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM 66 ++ KG M I+ AI L I +G+ ++V+ + +K+ + + +D + + + Sbjct: 5 SLSRLMTCRKGNMAIILAIGLLPIITTIGLGVDVARAYAVKSRMSAALDAAALAVGSSSG 64 Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 + + + D + S ++ + + Sbjct: 65 TDAQLSAVAQK--------------------FFDANYPTGALGAHPSVAVKVTGDVISAS 104 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 + + + + S+V A L++ +VLD + SM + Sbjct: 105 AVAEVDTVFM----------KVVGLNDVPVHADSTVNRQI---AGLELAMVLDNTGSMTT 151 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 + I + N + + + V+ ++ LV +S + Sbjct: 152 NNN-----IQAVRDAANQLTDILFGTATVHPYLKIALVPYSAAVN 191 >gi|309266960|ref|XP_003086909.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha (globulin) inhibitor H5-like, pseudogene [Mus musculus] gi|309271570|ref|XP_003085348.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha (globulin) inhibitor H5-like, pseudogene [Mus musculus] Length = 1321 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 68/204 (33%), Gaps = 30/204 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ V+DVS SM TK+ K+++ +L +++ N ++TFS+ + Sbjct: 284 VVFVIDVSGSMFG------TKLQQTKKAMDTILSDLQASDSFN------IITFSDTVNIW 331 Query: 232 ----EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI-FDMQGMRQHCNTEDAN 286 + + + + G T+ L A + + Q + Sbjct: 332 KAEGSIQATVQNIHSAKNYVSRMEADGW-TDINAALLAAASVLNHSNQEPGKGRGVGQIP 390 Query: 287 YKKIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS- 344 I+F+TDGE L +A ++++ LR + N Sbjct: 391 L---IMFLTDGEPTAGETTPSVILSNIRQALAHRVSLFSLAFGDDADFSLLRRLSLENQG 447 Query: 345 -----FYLVENPHSMYDAFSHIGK 363 + + + ++ I + Sbjct: 448 EARRIYEDADAALQLEGLYAEISR 471 >gi|144897297|emb|CAM74161.1| ppkA-related protein [Magnetospirillum gryphiswaldense MSR-1] Length = 663 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 44/209 (21%), Positives = 79/209 (37%), Gaps = 50/209 (23%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D + SM+ + ID ++ + E++ + + V+ GLV F + ++ Sbjct: 232 VVFVIDSTMSMDPY-------IDRTRNAVQRITEKIDKA-GLADRVKFGLVAFRSNVQAS 283 Query: 234 FLLEWGV------------SHLQRKIKYLSKFGVST-----NSTPGLKYAYNQI--FDMQ 274 LE+ Q+++ L VST +S GL A NQ+ D Sbjct: 284 PGLEYTSRLYADPSQVKDGQDFQKRVAGLKGASVSTARFSEDSYAGLMTAVNQVPWTDFG 343 Query: 275 GMRQHCNTEDANYKKIIVFMTD------GENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 G + +V +TD G+ LS + EA RG +Y + ++ Sbjct: 344 G-------------RYVVLITDAGALRGGDPLS-STKLDAEQVRLEALHRGLAIYTLHLK 389 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDA 357 + A S Y V + H + A Sbjct: 390 TPAGKD---NHAEAESQYKVLSGHPLLSA 415 >gi|306829469|ref|ZP_07462659.1| collagen adhesion protein [Streptococcus mitis ATCC 6249] gi|304428555|gb|EFM31645.1| collagen adhesion protein [Streptococcus mitis ATCC 6249] Length = 861 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 58/322 (18%), Positives = 106/322 (32%), Gaps = 89/322 (27%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME---- 185 S I I K T I + + + ITS + +Q++ LD+++V D+S SM Sbjct: 37 STIEPQTILHKIITPISGQEDKYELSLDITSKLGTETQSE-PLDVVLVADLSGSMNKRDV 95 Query: 186 -SFFDSSITKIDMAIKSINAMLEEVKLIPDV--NNVVQSGLVTFSNKIE-EFFLLEWGV- 240 S +IT++D ++ + LI + N+ + +V F KI+ +F W Sbjct: 96 PSSTGRTITRLDALKNTLKGTRDRQGLIDTILSNSNNRLSMVGFGGKIDNKFAEQSWNSY 155 Query: 241 ---------------------------------------SHLQRKIKYLS-KFGVSTNST 260 S + + + G S + Sbjct: 156 YRKWEWGYRYWPYEERTAFYDGVSPWDDADTILNWNNDASGSKTAVSKMRIAGGQSIGTE 215 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-------------ENLSTKEDQQ 307 G+ N ++ Q ++ N KK+++ ++DG N + ++ + Sbjct: 216 SGIGTGTNISAGIRIANQLIDSARPNAKKVVIVLSDGFANMYYNDSGYTVYNYNNQDGSE 275 Query: 308 S---------------LYYCNEAKKRGAIVYAIGIRVIRS-------HEFLRACAS--PN 343 + L Y K G Y+I R + ++R S PN Sbjct: 276 TAPEWFWNNLDVSINNLAYSLAPKLDGF--YSIKFRYSNNVDSITSLQYYIRYHNSSIPN 333 Query: 344 SFYLVENPHSMYDAFSHIGKDI 365 N + D+F I I Sbjct: 334 EILSANNEDQLRDSFKDITDKI 355 >gi|326674128|ref|XP_002664631.2| PREDICTED: integrin alpha-E-like [Danio rerio] Length = 540 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 40/213 (18%), Positives = 76/213 (35%), Gaps = 26/213 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV-KLIPDVNNVVQS 221 ++ VLD S S++ A I ++ V K D N Sbjct: 148 GGRINQGPGTEIAFVLDGSGSIQD------DDFQKAKDFIYNVMSNVWKTCFDCN----F 197 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQR--KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 +V + + I L + K+K + + T + + + IF + Sbjct: 198 AIVQYGSLIRTELSLLDNEDRVGSLLKVKQIKQIYNLTKTASAINHVLTDIFIPE----- 252 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-----VIRSHE 334 N N KII+ ++DG+ L N+ + +G Y+IG+ + + E Sbjct: 253 -NGSKDNSAKIIIVLSDGKILGDPMTLD--EVLNKPQMKGVTRYSIGVGDGILKNLDATE 309 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + A P +Y V + ++ D S + + I+ Sbjct: 310 EMMQIADPGKYYNVSSYRALNDIVSSLERGIIG 342 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 36/209 (17%), Positives = 75/209 (35%), Gaps = 31/209 (14%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 D ++ VLD S S++ + + I +V + +V Sbjct: 342 GTKDPGTEIAFVLDGSGSIQD----------------DDFQKAKDFIYNVMSNCNFAIVQ 385 Query: 226 FSNKIEEFFLLEWGVSHLQR--KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + + I LL K+K + + T + + + IF + ++ Sbjct: 386 YGSLIRTELLLLDNEDRAGSLLKVKQIKQIYNLTKTASAINHVLTDIFIPEKGSKNNTA- 444 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-----VIRSHEFLRA 338 KII+ ++DGE L N+ + +G Y+IG+ + + + Sbjct: 445 -----KIIIVLSDGEILEDPMTLD--EVLNKPQMKGVTRYSIGVGDGILKKPNAVKEMMQ 497 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVT 367 A P +Y V + ++ D S + ++I+ Sbjct: 498 IADPGKYYSVSSYGALNDILSSLEREIIG 526 >gi|149371022|ref|ZP_01890617.1| hypothetical protein SCB49_05035 [unidentified eubacterium SCB49] gi|149355808|gb|EDM44366.1| hypothetical protein SCB49_05035 [unidentified eubacterium SCB49] Length = 325 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 28/175 (16%), Positives = 59/175 (33%), Gaps = 24/175 (13%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSM--ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 K+ + +D++ LDVS+SM E + + K I I L Sbjct: 58 GTKLETVKREGVDVVFALDVSKSMLAEDIAPNRLEKSKQLITQIVNELGG---------- 107 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + G++ ++ + + + + + VS + + A + Sbjct: 108 DRVGIIGYAGSAFPQVPITTDFASTKLFLNSMDTDMVS-SQGTAISEAVQMSTTYFDDEE 166 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 N +++ ++DGE+ + A + G VYAIG+ + Sbjct: 167 QKN-------RVLFVISDGEDHEGNFESAI----ENATENGIKVYAIGVGTEKGG 210 >gi|197099226|ref|NP_001126843.1| inter-alpha-trypsin inhibitor heavy chain H4 [Pongo abelii] gi|55732844|emb|CAH93116.1| hypothetical protein [Pongo abelii] Length = 896 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 62/169 (36%), Gaps = 15/169 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEE 232 ++ V+D S SM S KI +++ +L+++ N +V + + + Sbjct: 275 VVFVIDKSGSM------SGKKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLVP 328 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V+ + + G TN + A Q+ D + + I+ Sbjct: 329 AS--AENVNKARSFAAGIQALG-GTNINDAMLMAV-QLLDSSNQEERLPDGSVSL---II 381 Query: 293 FMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +TDG+ + + +S+ EA ++ +G S+ FL A Sbjct: 382 LLTDGDPTVGETNPRSIQKNVREAVSGRYSLFCLGFGFDVSYAFLEKLA 430 >gi|291232650|ref|XP_002736273.1| PREDICTED: mind bomb 2-like [Saccoglossus kowalevskii] Length = 847 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 64/196 (32%), Gaps = 14/196 (7%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 LD ++ LD S SM + ++ A +++ + V +V F +K Sbjct: 441 PLDTVLCLDTSGSM---AGRGLRELKKACTEFLLGIQQTATQTGLRENV--AVVEFGSKT 495 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + I L G +T GL A ++ G+ + + Sbjct: 496 RIVRNLTDNYRLTKNAIDSLQA-GGTTPMFEGLMEAMKEVIQNGGVLTLPGGKKMTPR-- 552 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEA-KKRG----AIVYAIGIRVIRSHEFLRACAS-PNS 344 I+ MTDG + ++ K G + +G E L A N Sbjct: 553 IILMTDGRPDDKDQVLKAAMSFGPLWKAVGLPFPIPIACVGCGPDVDKELLAVIAKVTNG 612 Query: 345 FYLVENPHSMYDAFSH 360 ++V + + D F Sbjct: 613 MFVVGDISQLGDFFRR 628 >gi|221107805|ref|XP_002169300.1| PREDICTED: similar to high choriolytic enzyme [Hydra magnipapillata] Length = 1810 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 32/212 (15%), Positives = 74/212 (34%), Gaps = 22/212 (10%) Query: 155 VMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 + I++ V S LD++ ++D S +M+ F K+ +K I + Sbjct: 132 NLFISNQVNSKSNKNLCGGLLDVVFLIDSSTNMKDNF----AKVKRFMKEIAKSFGLSRE 187 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFG-VSTNSTPGLKYAYN 268 V+ ++ F ++ + L S + + ST ++ A Sbjct: 188 GSHVS------IILFGSEAKVSLKLSDNQDESTFDSACDSIDENAFPSTKVPIKIEKALQ 241 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY---CNEAKKRGAIVYAI 325 + + NT K ++ +T G+ +K ++ E +++G + ++ Sbjct: 242 --LAQKELFVQANTGREFIPKTVILITAGKQNPSKYGERVEKPFPFALELREKGVSIISV 299 Query: 326 GIRV-IRSHEFLRACASPNSFYLVENPHSMYD 356 + I E + +LV N + + Sbjct: 300 AVGSEIEKSELTGITGNIEKVFLVNNFQDLLE 331 >gi|260787567|ref|XP_002588824.1| hypothetical protein BRAFLDRAFT_89745 [Branchiostoma floridae] gi|229273994|gb|EEN44835.1| hypothetical protein BRAFLDRAFT_89745 [Branchiostoma floridae] Length = 1344 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 32/185 (17%), Positives = 68/185 (36%), Gaps = 25/185 (13%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 ++ + D+ D++ +DVS S+ FD+ T I+ + S+ + Sbjct: 1047 GTLPSCAACDSAADIIFSIDVSGSVSGHFDTVSTFINGVVNSLT----------IGRSHA 1096 Query: 220 QSGLVTFS-NKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 + GL+ F+ ++ E + L +L I + S+ A + + Sbjct: 1097 RVGLIKFAGSRAETYISLTDFDNKYNLVSAISSV----FDHYSSGEFSIAGLSVMGHEFS 1152 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 + +KI + +TDG+ + + G ++ +G+ +R E L Sbjct: 1153 TN----GRSAARKIGIVLTDGQ---DSGSGYVIPDATALRNDGTTIFCVGVADVR-RETL 1204 Query: 337 RACAS 341 AS Sbjct: 1205 DNMAS 1209 >gi|146302265|ref|YP_001196856.1| von Willebrand factor, type A [Flavobacterium johnsoniae UW101] gi|146156683|gb|ABQ07537.1| von Willebrand factor, type A [Flavobacterium johnsoniae UW101] Length = 2588 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 38/170 (22%), Positives = 67/170 (39%), Gaps = 23/170 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI-TKIDMAIKSINAMLEEVKLI 212 + + IT + + + D+++ +D+S SM + T +D A + A L + K Sbjct: 54 VKVDITGANPITRNS----DVVLAIDISGSMGNTISGDFKTSMDYAKDAALAFLNQAKAN 109 Query: 213 PDVNNVVQSGLVTFSN----KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 P + +V +S KI +L GV+ + +I L TNST Y Sbjct: 110 PQN----RIAIVAYSTTASLKIGLTYLNATGVTQITNQINALQA----TNSTNI----YA 157 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 I + + D + + I+ +TDG T + CN +K Sbjct: 158 GIVRSETELETNGRFDCSTARAIILLTDGVTNVTGTSGNT--NCNVSKTS 205 >gi|13471293|ref|NP_102862.1| hypothetical protein mll1222 [Mesorhizobium loti MAFF303099] gi|14022037|dbj|BAB48648.1| mll1222 [Mesorhizobium loti MAFF303099] Length = 638 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 29/188 (15%), Positives = 68/188 (36%), Gaps = 21/188 (11%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 PW ++ + + I + + +++ ++DVS SM+ K+ + + Sbjct: 243 VMPTPWNAQTKLMHVAIKG-FDIKPTEQPKANLVFLIDVSGSMDE-----PDKLPLLKSA 296 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 ++ +++ ++ V +G + + I L+ G ST Sbjct: 297 FRLLVSKLRADDTISIVTYAGEAGTVLMPTRAAEKD----KILNAIDNLTP-GGSTAGEA 351 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGA 320 G+K AY ++ ++ N ++ TDG+ N+ +D + +K G Sbjct: 352 GIKEAY-KLAQQSFVKDGVNR--------VMLATDGDFNVGQSDDDDLKRLIEQERKSGV 402 Query: 321 IVYAIGIR 328 + G Sbjct: 403 FLSVFGFG 410 >gi|94968893|ref|YP_590941.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94550943|gb|ABF40867.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 628 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 35/212 (16%), Positives = 69/212 (32%), Gaps = 32/212 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + L + +V+D S S+ + + L+ V P+ V V F Sbjct: 393 EAQLPLRIGLVIDTSASI-------AGRFKFEQDAAGEFLQRVLTGPEDLGFV----VGF 441 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 SN I + I+ + G + L A N F + + H E Sbjct: 442 SNSILMAQDFTHDSKQIAHSIQAFAPSGGT-----ALWDAVN--FAAEKLASH--PERQP 492 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI--------GIRVIRSHEFLRA 338 KI++ ++DGE+ + A+ VYAI L+ Sbjct: 493 VAKILIVISDGEDN--SSATTAKQAIQRAQSEEVAVYAINTLEITQRSEEPPVGVRALKT 550 Query: 339 CA--SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 A + + + + + + + + + I ++ Sbjct: 551 LAEMTGGAAFTPGSVRWLNSSLNDLQQVIRSR 582 >gi|13399532|pdb|1DZI|A Chain A, Integrin Alpha2 I Domain COLLAGEN COMPLEX Length = 185 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 32/200 (16%), Positives = 74/200 (37%), Gaps = 35/200 (17%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++V D S S+ + D + ++ + + P Q GL+ ++N Sbjct: 3 IDVVVVCDESNSIYPW--------DAVKNFLEKFVQGLDIGPTK---TQVGLIQYANNPR 51 Query: 232 EFFLLEWGVSHLQRKI---KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L + + + + G TN+ ++YA + + + Sbjct: 52 VVFNLNTYKTKEEMIVATSQTSQYGGDLTNTFGAIQYARKYAYSA------ASGGRRSAT 105 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RVIRSHEF---LRAC 339 K++V +TDGE+ + + CN + + I + + + ++A Sbjct: 106 KVMVVVTDGESHDGSMLKAVIDQCNH---DNILRFGIAVLGYLNRNALDTKNLIKEIKAI 162 Query: 340 AS---PNSFYLVENPHSMYD 356 AS F+ V + ++ + Sbjct: 163 ASIPTERYFFNVSDEAALLE 182 >gi|170575265|ref|XP_001893167.1| von Willebrand factor type A domain containing protein [Brugia malayi] gi|158600964|gb|EDP37999.1| von Willebrand factor type A domain containing protein [Brugia malayi] Length = 448 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 32/145 (22%), Positives = 57/145 (39%), Gaps = 23/145 (15%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 PI+S + A D + V+D + S+ FF+ T I I I +P+ + Sbjct: 245 PISSLSGRLDCSLAPFDTLFVVDSTSSVRQFFEDHRTYIIEIINLI---------LPEFD 295 Query: 217 NVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 N + G++ +S+ + L + + + +K L F T + LK A + D Sbjct: 296 NDTRIGIIEYSSSLRRQVKLRFIAHKNRTEIVETVKKLPFFAGITATGAALKLALEILQD 355 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDG 297 +N +V +TDG Sbjct: 356 R----------RSNVLTNVVVLTDG 370 >gi|148686422|gb|EDL18369.1| mCG2843 [Mus musculus] Length = 1122 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 29/179 (16%), Positives = 58/179 (32%), Gaps = 24/179 (13%) Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYL-SKFGVSTNSTPGLKYAY 267 VN+ G+V + + F L + + + G+ T + G+ A Sbjct: 140 TFQVVNSFAPVGIVQYGANVTHEFNLNKYSSTEEVLVAANKIGRRGGLQTMTALGIDTAR 199 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + F KK++V +TDGE+ +Q + C ++I I Sbjct: 200 KEAFT------EARGARRGVKKVMVIVTDGESHDNYRLKQVIQDCE---DENIQRFSIAI 250 Query: 328 -RVIRSHEF--------LRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 +++ AS F+ V + ++ +G+ I D+ Sbjct: 251 LGHYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKALGERIFALEATADQ 309 >gi|126732236|ref|ZP_01748037.1| hypothetical protein SSE37_18135 [Sagittula stellata E-37] gi|126707318|gb|EBA06383.1| hypothetical protein SSE37_18135 [Sagittula stellata E-37] Length = 710 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 66/187 (35%), Gaps = 23/187 (12%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 PW ++ + + + + + ++++ ++D S SM+ K+ + +S M Sbjct: 328 PWREGTQIVQIGLQGML---PEARPPMNLVFLIDTSGSMQ-----DANKLPLLKQSFRLM 379 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 L ++ V+ V +G ++ E + L + L G ST GL+ Sbjct: 380 LGQLGEEDMVSIVTYAGSAG---RVLEPTKASDRQTILDA-LDRLEA-GGSTAGQAGLQQ 434 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYA 324 AY +M ++ TDG+ N+ + Y + G + Sbjct: 435 AYATATEMARDGAVSR---------VILATDGDFNVGISDPDDMKDYIETQRGTGTYLSV 485 Query: 325 IGIRVIR 331 +G Sbjct: 486 LGFGRGN 492 >gi|308062304|gb|ADO04192.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter pylori Cuz20] Length = 217 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 30/202 (14%), Positives = 70/202 (34%), Gaps = 10/202 (4%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + + + + ++LD S SM + T+I+ I M+E +K + ++T Sbjct: 9 TMEERFIPVFLLLDTSGSMNESL-GNCTRIEALNLCIQKMIEILKQEAKKELFSKMAIIT 67 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F E +L ++ T + A + I D Sbjct: 68 FG---ENGAVLHTPFDDVKNINFKPLSASGGTPLDQAFRLAKDLIEDKDTF------PTK 118 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 YK + ++DGE K + + ++ + ++ ++I I ++ + + Sbjct: 119 FYKLYSILVSDGEPNDDKWQKALSNFHHDGRSAKSVCWSIFIGDRNTNPQVNKDFGKDGV 178 Query: 346 YLVENPHSMYDAFSHIGKDIVT 367 + +N + F + + I Sbjct: 179 FYADNVEKLVGLFEIMTQTISK 200 >gi|294628791|ref|ZP_06707351.1| toxic cation resistance protein [Streptomyces sp. e14] gi|292832124|gb|EFF90473.1| toxic cation resistance protein [Streptomyces sp. e14] Length = 241 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 62/179 (34%), Gaps = 26/179 (14%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +S+ + R + +V+D S SM +F + + + + + + Sbjct: 25 ASLAEHGLAGQRAAVYLVVDHSGSMRPYFRNG--SVQALADRVLGLAAHLD------DDG 76 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGM 276 Q +V FS ++ + + Q +++ ++ TN + + D Sbjct: 77 QVPVVFFSTDVDAVTEIA--LDDHQGRVERIAAGLGHMGRTNYHLAMDAVIDHYLDSGAT 134 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 ++VF TDG S ++ Y C AK + IG RS +F Sbjct: 135 D----------PALVVFQTDGAPTSRPAAER--YLCKAAKLPLFWQF-IGFGNPRSRQF 180 >gi|261822920|ref|YP_003261026.1| von Willebrand factor A [Pectobacterium wasabiae WPP163] gi|261606933|gb|ACX89419.1| von Willebrand factor type A [Pectobacterium wasabiae WPP163] Length = 346 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 59/167 (35%), Gaps = 14/167 (8%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + VLD S SM ++ K++ +++I + ++K P ++ F+ Sbjct: 3 RLPVFFVLDCSESM---IGDNLKKMNDGLQAI---VSDLKKDPHALETAWISVIAFAGVA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L V + L G T+ L+ QI + D + Sbjct: 57 QTIVPL---VEVVSFYPPRLP-LGGGTSLGAALRELTKQIDEQVRKTTQERKGDWKP--V 110 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + +TDG + + + + A + AIG+ LR Sbjct: 111 VYLLTDGRPTDDTASEITRWKQHYA--NKVNLIAIGLGASADLNTLR 155 >gi|297699669|ref|XP_002826908.1| PREDICTED: LOW QUALITY PROTEIN: integrin alpha-E-like [Pongo abelii] Length = 1273 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 30/158 (18%), Positives = 62/158 (39%), Gaps = 21/158 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV-KLIPDVNNVVQSGLVTFSNKIEE 232 + I+LD S S++ A I+ M+ + + N LV + I+ Sbjct: 295 IAIILDGSGSIDP------PDFQRAKDFISNMMRNFYEKCFECN----FALVQYGGVIQT 344 Query: 233 FFLLEWGVSHLQR--KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L + +++ +++ G T + +++ + IF + + K+ Sbjct: 345 EFDLRDSQDVMASLARVQNITQVGSVTKTASAMQHVLDSIFTSSHGSRRKAS------KV 398 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +V +TDG D ++ N K +G +AIG+ Sbjct: 399 MVVLTDGGIFEDPLDLTTVI--NSPKMQGVERFAIGVG 434 >gi|291231970|ref|XP_002735935.1| PREDICTED: chloride channel accessory 2-like, partial [Saccoglossus kowalevskii] Length = 849 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 39/198 (19%), Positives = 70/198 (35%), Gaps = 30/198 (15%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 +++LD+S SM S ++ I K+ + + D + G+V F+++ Sbjct: 302 VLLLDISGSMSS--NNRIEKLGQVAS-----IYILLTADDDDE---LGMVVFNDQPSTRS 351 Query: 235 LLEWGVSHLQ----RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + I G +T GL A + + + Sbjct: 352 QMVTISESTRLDLLELIPTRDDIGDATGIGSGLSEAIDVLENGGNDAAGGC--------- 402 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYLV 348 I+ ++DGE + +G V+ I + V SH L SFY Sbjct: 403 IILVSDGEENRSPYIDDVQS---TIVDKGVCVHTIALGVDASHNMEQLPLATDGKSFYYS 459 Query: 349 ENP--HSMYDAFSHIGKD 364 ENP +++ +AF I K Sbjct: 460 ENPYSNALNEAFITIAKQ 477 >gi|290958909|ref|YP_003490091.1| hypothetical protein SCAB_44831 [Streptomyces scabiei 87.22] gi|260648435|emb|CBG71546.1| conserved hypothetical protein [Streptomyces scabiei 87.22] Length = 708 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 40/190 (21%), Positives = 68/190 (35%), Gaps = 25/190 (13%) Query: 174 MMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 + +VLD S SM ++ D S + ++ A L+ K +V FS +++ Sbjct: 526 VYLVLDRSASMRGYYKDGSAQALGEQTLALAAHLDPEKTTVP--------VVFFSTELDG 577 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L + K+ L G+ A ++ QH A ++V Sbjct: 578 TGELT--LDAFDNKVDDLHA-GLGRMGRTSYHVAVEEVLA-----QHR-KAAAGAPALVV 628 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIRVIRSHEF--LRAC--ASPNSFYL 347 F TDG + Q+L + A+ G + + F LR A+ + F Sbjct: 629 FQTDGAPDAKTPANQAL--ADAAENHPGVHFAFVAFGDPENKAFDYLRKLKTATTSHFLA 686 Query: 348 VENPHSMYDA 357 E P + DA Sbjct: 687 GETPRELTDA 696 >gi|289596620|ref|YP_003483316.1| von Willebrand factor type A [Aciduliprofundum boonei T469] gi|289534407|gb|ADD08754.1| von Willebrand factor type A [Aciduliprofundum boonei T469] Length = 2166 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 29/153 (18%), Positives = 57/153 (37%), Gaps = 26/153 (16%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSI-----------TKIDMAIKSINAMLEEVKLIPDV 215 +D++ V+D S SM S + T+ID+AI++ ++E+ Sbjct: 1246 NKRKPIDIIFVIDTSGSMNSVVPGATVGDVNGDGRSNTRIDVAIQAAIDAVKELGPQD-- 1303 Query: 216 NNVVQSGLVTF--SNKIEEFFLLEWGVSH-LQRKIKYLS--KFGVSTNSTPGLKYAYNQI 270 + + TF ++ EE+ + + L I L + T L +A + Sbjct: 1304 ----RVAVFTFDGNSHPEEYMGFTYVTADNLPTIISDLKDIQADGGTPLYDTLSWAVYYM 1359 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 N + + + I+ +TDG + S Sbjct: 1360 DTQSAD----NPDREDATRGILVLTDGLSNSDN 1388 >gi|254167891|ref|ZP_04874740.1| von Willebrand factor type A domain protein [Aciduliprofundum boonei T469] gi|197623182|gb|EDY35748.1| von Willebrand factor type A domain protein [Aciduliprofundum boonei T469] Length = 2164 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 29/153 (18%), Positives = 57/153 (37%), Gaps = 26/153 (16%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSI-----------TKIDMAIKSINAMLEEVKLIPDV 215 +D++ V+D S SM S + T+ID+AI++ ++E+ Sbjct: 1244 NKRKPIDIIFVIDTSGSMNSVVPGATVGDVNGDGRSNTRIDVAIQAAIDAVKELGPQD-- 1301 Query: 216 NNVVQSGLVTF--SNKIEEFFLLEWGVSH-LQRKIKYLS--KFGVSTNSTPGLKYAYNQI 270 + + TF ++ EE+ + + L I L + T L +A + Sbjct: 1302 ----RVAVFTFDGNSHPEEYMGFTYVTADNLPTIISDLKDIQADGGTPLYDTLSWAVYYM 1357 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 N + + + I+ +TDG + S Sbjct: 1358 DTQSAD----NPDREDATRGILVLTDGLSNSDN 1386 >gi|311897983|dbj|BAJ30391.1| hypothetical protein KSE_46100 [Kitasatospora setae KM-6054] Length = 455 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 31/210 (14%), Positives = 69/210 (32%), Gaps = 39/210 (18%) Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV-VQS-GLVTFSNKIEEF 233 ++LD S SM + +++++A +SI +++ + P+ +++ G + +E Sbjct: 55 LILDGSGSMRTIDIQGKSRMEVAQQSIAEVIDAL---PNETEFGIRTLGATYPGSDQKEG 111 Query: 234 FLLE--------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + + + G T L+ A + R+ Sbjct: 112 CKDTQQLYRVGKTNKVEAKTAVATVRPTGW-TPIGIALRAAAQDLGTGPTTRR------- 163 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK---RGAIVY--AIGIRVIRSHEFLRAC- 339 IV +TDGE+ D C+ A++ +G + +G+ C Sbjct: 164 -----IVLITDGEDTCAPPD-----PCDVARELASQGIHLVVDTLGLAHDDKTRQQLICI 213 Query: 340 --ASPNSFYLVENPHSMYDAFSHIGKDIVT 367 A+ +F V + + Sbjct: 214 ANATGGTFTDVRTQEQLTKRVKQLVNRAQD 243 >gi|111114995|ref|YP_709613.1| hypothetical protein BAPKO_0175 [Borrelia afzelii PKo] gi|216263812|ref|ZP_03435806.1| von Willebrand factor type A domain protein [Borrelia afzelii ACA-1] gi|110890269|gb|ABH01437.1| hypothetical protein BAPKO_0175 [Borrelia afzelii PKo] gi|215979856|gb|EEC20678.1| von Willebrand factor type A domain protein [Borrelia afzelii ACA-1] Length = 333 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 37/196 (18%), Positives = 69/196 (35%), Gaps = 26/196 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++IVLD+S SM + SS +++ + + I + + GLV F+ Sbjct: 93 DIVIVLDISPSMGAVEFSSKNRLEFSKELIKRFISQ-------RENDNIGLVAFAKDASI 145 Query: 233 FFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + +K+ Y+ G + G+ A + + K+ Sbjct: 146 VVPITTDRDFFNKKLDDIYIMDLGNGSALGLGISIALSHL-----------KHSEALKRS 194 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF----Y 346 IV +TDG S + + N A+ +Y+IGI S + Sbjct: 195 IVVLTDGVVNSDEIYKD--QVINLAQGLNVRIYSIGIGSSEEFSVEFKLRSGKFYQGSLK 252 Query: 347 LVENPHSMYDAFSHIG 362 V +P + + + G Sbjct: 253 EVYDPSMLVEISNKTG 268 >gi|332216215|ref|XP_003257240.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 2 [Nomascus leucogenys] Length = 900 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 63/169 (37%), Gaps = 15/169 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEE 232 ++ V+D S SM S KI +++ +L+++ N +V + + + + Sbjct: 275 VVFVIDKSGSM------SGRKIQQTREALIKILDDLSPRDQFNLIVFSTEAIQWRPSLVP 328 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V+ + + G TN + A Q+ D + + I+ Sbjct: 329 AS--AENVNKARSFAAGIQALG-GTNINDAMLMAV-QLLDSSNQEERLPDGSVSL---II 381 Query: 293 FMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +TDG+ + + +S+ EA ++ +G S+ FL A Sbjct: 382 LLTDGDPTVGETNPRSIQKNVREAVSGRYSLFCLGFGFDVSYAFLEKLA 430 >gi|317406818|gb|EFV86930.1| von Willebrand factor type A domain-containing protein [Achromobacter xylosoxidans C54] Length = 252 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 53/162 (32%), Gaps = 25/162 (15%) Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ F L + LQ ++ + G +T + A + + Sbjct: 30 RLGLIVFGTAAYPQAPLTLDHAALQLLLRHTAVGMAGPNTAIGDAIGLAIRMLDAVDE-- 87 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH---- 333 K+++ +TDG T A + ++ IG+ ++ Sbjct: 88 ---------PDKVLILLTDG--NDTGSAVPPQRAATLAAQHHIRIHTIGMGDPQARGDDK 136 Query: 334 ---EFLRAC--ASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + L A+ F+ + S+ ++ + I +R+ Sbjct: 137 VDFDLLEHIAQATGGRFFQANDRESLQQVYATL-DQITPRRV 177 >gi|58176651|pdb|1RRK|A Chain A, Crystal Structure Analysis Of The Bb Segment Of Factor B gi|58176654|pdb|1RS0|A Chain A, Crystal Structure Analysis Of The Bb Segment Of Factor B Complexed With Di-Isopropyl-Phosphate (Dip) gi|58176655|pdb|1RTK|A Chain A, Crystal Structure Analysis Of The Bb Segment Of Factor B Complexed With 4-Guanidinobenzoic Acid Length = 497 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 32/223 (14%), Positives = 73/223 (32%), Gaps = 34/223 (15%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +++ +VLD S S+ + + K+ ++ ++ + + GLVT++ Sbjct: 1 SMNIYLVLDGSDSIGASNFTGAKKV---------LVNLIEKVASYGVKPRYGLVTYATYP 51 Query: 231 E--------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + + +W L K TN+ L+ Y+ + Sbjct: 52 KIWVKVSEADSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYS-MMSWPDDVPPEGW 110 Query: 283 EDANYKKIIVFMTDG-ENLSTKEDQQSLYYCN---------EAKKRGAIVYAIGIRVIRS 332 + +I+ MTDG N+ + ++ VY G+ + + Sbjct: 111 NRTRH--VIILMTDGLHNMGGDPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVN 168 Query: 333 HEFLRACAS----PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + A AS V++ + D F + + + + Sbjct: 169 QVNINALASKKDNEQHVCKVKDMECLEDVFYQMIDESQSLSLC 211 >gi|239617365|ref|YP_002940687.1| von Willebrand factor type A [Kosmotoga olearia TBF 19.5.1] gi|239506196|gb|ACR79683.1| von Willebrand factor type A [Kosmotoga olearia TBF 19.5.1] Length = 676 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 48/278 (17%), Positives = 91/278 (32%), Gaps = 34/278 (12%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 VN + L I+ + +N I + K+ I + T Sbjct: 285 SQVSVNYMIPERSDVKLSILEINKNSFPDIDVKVIIESTDKYILENINVFE-DGIPVGTT 343 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 V+ + +D++ VLD + SM + + ++E ++ + + Sbjct: 344 EPRLVSEKLVNFVDVVFVLDTTGSMTQELNGMVE----------NLIEFSNILENYGVLA 393 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS---TNSTPGLKYAYNQIFDMQGM 276 + GLVTF ++I L ++R ++ + G S L A N F Sbjct: 394 RVGLVTFGDEIRLTADLTPSFEKIRRLLQSQTADGGGDVPEISLDALNEALNMNFLDNS- 452 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQ--SLYYCNEAKK---RGAIVYAIGIRVIR 331 +KI++ +TD + + S KK GA + I + Sbjct: 453 -----------QKILILITDASPHIEGDGTKFSSTTIEETRKKILASGATL--ILVVPSN 499 Query: 332 SHEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDIVTK 368 EF+R P + + S + + K I + Sbjct: 500 KEEFVRLSEDIPGQLLDIHSAKSFGELIKFVAKQITRQ 537 Score = 39.8 bits (91), Expect = 0.72, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 66/195 (33%), Gaps = 31/195 (15%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 I S + +D++ V+D S +M I K+ + V+L+ + Sbjct: 56 TAEIISIKTEAQEKKKPVDIVFVVDNSGTMYDKVQIVIEKLG----------DLVRLLHE 105 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWG---VSHLQRKIKYLS-----KFGVSTNSTPGLKYA 266 + ++ F ++ + F++ G S ++ I+ L G L A Sbjct: 106 NGYDARFAVLGFGTEVNKEFVVTGGSRFTSSPEKTIERLKETIQYPGGKDECQIHALWIA 165 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL-YYCNEAKKRGAIVYAI 325 N F + KI++ +TD + K ++ + KR V+ + Sbjct: 166 SNYDF------------RKDASKILILLTDEDTTQNKLNEVAKPKLVENIIKRNLTVFTL 213 Query: 326 GIRVIRSHEFLRACA 340 +E L + Sbjct: 214 RYDPDPVYEELAKLS 228 >gi|187934443|ref|YP_001887479.1| von Willebrand factor type A domain protein [Clostridium botulinum B str. Eklund 17B] gi|187722596|gb|ACD23817.1| von Willebrand factor type A domain protein [Clostridium botulinum B str. Eklund 17B] Length = 1596 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 24/136 (17%), Positives = 47/136 (34%), Gaps = 27/136 (19%) Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV-----VQSGLVTFSNK---------- 229 ++S TKI K+ ++ + ++ G+V+++N Sbjct: 185 HKAYESYTTKIHELKKAAKNFIDSLTSTKTDGQTPNVKNLKIGIVSYNNSGYINEGLVQV 244 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + ++ L+ I+ L G TN+ GL+ A + + K Sbjct: 245 TDSDRKNNGNINELKDTIENLRADG-GTNTGDGLRKAAYLLNEENEAN-----------K 292 Query: 290 IIVFMTDGENLSTKED 305 ++FM DGE D Sbjct: 293 TVIFMGDGEPTYYSSD 308 >gi|192288907|ref|YP_001989512.1| hypothetical protein Rpal_0477 [Rhodopseudomonas palustris TIE-1] gi|192282656|gb|ACE99036.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 443 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 59/454 (12%), Positives = 130/454 (28%), Gaps = 115/454 (25%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQ-- 64 +I F + +G + ++ A+ L + +G ++ S +++ L + D + V A ++ Sbjct: 6 SIARFSRDRRGNIAVIFALVLVPLISAVGCAVDYSRANALRSKLQAAADAASVGAVSRTS 65 Query: 65 ---IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVV 121 + + G D RI NG +N + ++S V Sbjct: 66 PAYVAAGAMSGDGVISSGADDALRIF-------------NGNLNGLTGYTLASSSATVTK 112 Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 + S+ + T + + TS+ + +D ++LD S Sbjct: 113 ASDV-----VTSQVTFSAQISTMFMKVVGMSAMTVGGTSTATASMPK--YIDFYLLLDNS 165 Query: 182 RSME----------------------------------------SFFDSSITKIDMAIKS 201 SM + T+ID+ + Sbjct: 166 PSMGVGATPTDVSAMIAATANKSSDDHCAFACHDVNNKNNYYNLAKALGITTRIDVLRSA 225 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF--------FLLEWGVSHLQRKIKYLSKF 253 +++ +N + + F + I ++ Sbjct: 226 TQQLMDTASATATYSNQFRMAIYDFGASAQTAGLRNLFSLSASLSSAKTAASAIDLMTVK 285 Query: 254 GVSTNSTPGLKYAYNQIFDM--QGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSL- 309 G + N+ +Y IF + + + +K + F++DG + + Sbjct: 286 GQNDNNDQ--DTSYTAIFPAINNEISSPGSGVSGSPQKYLFFVSDGVADEYNPSCLKPKT 343 Query: 310 ----------YYCNEAKKRGAIV---YAIGIRVIRSH----------------------- 333 C K RG + Y + + + Sbjct: 344 GNRCQSPINPALCKTLKDRGIKIAVLYTTYLNLPSNDWYKKWIAPFNAGPYGPSPNSEIA 403 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + + ACASP ++ V + +A + + K V Sbjct: 404 QNMEACASPGFYFEVSPTQGIAEAMNALFKKAVA 437 >gi|187477305|ref|YP_785329.1| hypothetical protein BAV0798 [Bordetella avium 197N] gi|115421891|emb|CAJ48410.1| putative exported protein [Bordetella avium 197N] Length = 657 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 38/190 (20%), Positives = 74/190 (38%), Gaps = 23/190 (12%) Query: 159 TSSVKVNSQTDARLD--MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 TSS + S ++ V+D S SM+ + I+ +++ + +++ + Sbjct: 214 TSSAEQRSNQIKEFSAAVVFVVDSSISMDPY-------IERTREAVRRVYSQIE-QEKLG 265 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA--YNQIFDMQ 274 + V+ GLV F + ++ LE+ VS + + + LK A + +FD Sbjct: 266 DRVKFGLVAFRSNVKAVPGLEY-VSRVYADPNTVKDGADFLSKVSDLKQASVSSSLFDED 324 Query: 275 ---GMRQHCNTEDANYK--KIIVFMT-----DGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 G+ + N D + IV +T DG N + + EA K G +Y Sbjct: 325 AYAGVTEAINKVDWRNFGARYIVLITDAGAIDGGNKLSSTGLDASQVRLEAGKPGVAIYT 384 Query: 325 IGIRVIRSHE 334 + ++ Sbjct: 385 LHLKTAAGKN 394 >gi|329940639|ref|ZP_08289920.1| putative secreted protein [Streptomyces griseoaurantiacus M045] gi|329300700|gb|EGG44597.1| putative secreted protein [Streptomyces griseoaurantiacus M045] Length = 421 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 38/216 (17%), Positives = 74/216 (34%), Gaps = 39/216 (18%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + + + + +VLDVS SM + +++ A ++ N +L Sbjct: 26 SGAAADETADQDAPKVDLVLDVSGSMRAKDIDGGSRMAAAKQAFNEVL------DATPEE 79 Query: 219 VQSGLVTF---------------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL 263 V+ G+ T + ++ L+ + + + LS G T P L Sbjct: 80 VRLGIRTLGADYPGDNRKEGCKDTAQLYPVGPLD--RTEAKTAVATLSPTGW-TPIGPAL 136 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 A + + G K IV ++DGE+ D + AK G + Sbjct: 137 LKAADDLDGGNGS------------KRIVLISDGEDTCAPLDPCEVAREIAAKGIGLTID 184 Query: 324 AIGIRVIRSHEFLRAC---ASPNSFYLVENPHSMYD 356 +G+ +C A+ ++ VE+ + D Sbjct: 185 TLGLVPDVKLNRQLSCIAEATGGTYTSVEHRDQLTD 220 >gi|327270788|ref|XP_003220170.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 930 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 47/242 (19%), Positives = 90/242 (37%), Gaps = 39/242 (16%) Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 + YK + + + +SR + P + +T R+ + +VLD S M Sbjct: 270 KMCSYKSTWEVISGSADFASSRPLGAPPPDPIIHLMKTRERI-VCLVLDASAQM-----G 323 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL 250 ++ I++ L ++ + G+VTF++K L+ S +R + L Sbjct: 324 KDNRLSRLIRAAKLFLLQI-----IEEGSWVGIVTFNSKANIQAGLQKVFSDKER--ESL 376 Query: 251 SKFGVSTNSTP-----GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 + +T S G+ A+ + C IV +T+GE L Sbjct: 377 TSHLPTTASGDSDICEGVTTAFQVFSRKLTSTEGCE---------IVLLTNGEGLD---- 423 Query: 306 QQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYLVE--NPHSMYDAFSH 360 L C ++ + + I++ I S+E L +FY + + + + DAF Sbjct: 424 ---LSPCLSKIQSQEIIIHTIAFGSKASNELEKLADMTGGKTFYATDSLDSNGLIDAFGG 480 Query: 361 IG 362 I Sbjct: 481 IS 482 >gi|324499488|gb|ADY39781.1| Mesocentin [Ascaris suum] Length = 3894 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 67/189 (35%), Gaps = 23/189 (12%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 T W + + T+ ++ ++ D + D+M VLD S + S + + + Sbjct: 3115 TEHTWPSRKTTLESQRTTPARICTRIDYQADVMFVLDSS------DNFSPEQYGHLKEGL 3168 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 + +++E +VVQ G V +S+K L + NS Sbjct: 3169 STLIDE--TFDLSPDVVQVGFVEYSDKASVPVALG-------HYEDKVQLLTDIANSEQL 3219 Query: 263 LKYAYNQIFDMQGMRQHCN-TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR-GA 320 A + + RQ N ++++ +T+G N + + ++R Sbjct: 3220 FGEAI-VLKGLNAARQQFQLHGRKNVPRVLLLITNGVNRGNAANA-----AEDLRERYNV 3273 Query: 321 IVYAIGIRV 329 ++ + + Sbjct: 3274 ELFILAVNA 3282 >gi|318059204|ref|ZP_07977927.1| hypothetical protein SSA3_14746 [Streptomyces sp. SA3_actG] gi|318081092|ref|ZP_07988424.1| hypothetical protein SSA3_31430 [Streptomyces sp. SA3_actF] Length = 184 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 29/141 (20%), Positives = 51/141 (36%), Gaps = 17/141 (12%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 M +VLD S SM ++ D + +++ + + D VV+ LV FS I+ Sbjct: 1 MYLVLDRSGSMRPYYK------DGSAQALGEQVLALAAHLDTEAVVR--LVFFSTAIDAM 52 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L + + + L + T N ++ +R E A ++VF Sbjct: 53 GTLT--LDAYEGVVDGLHEGAGRMGRT-------NYALAIEEVRALHAKEAAGEPGLVVF 103 Query: 294 MTDGENLSTKEDQQSLYYCNE 314 TDG ++L Sbjct: 104 QTDGPPDVRTAATRALKAAET 124 >gi|261822921|ref|YP_003261027.1| von Willebrand factor A [Pectobacterium wasabiae WPP163] gi|261606934|gb|ACX89420.1| von Willebrand factor type A [Pectobacterium wasabiae WPP163] Length = 212 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 30/168 (17%), Positives = 53/168 (31%), Gaps = 12/168 (7%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +++D S SM I I AML ++ P V ++T+ N Sbjct: 3 RLPVYLLIDTSGSMRG------ESIHAVNVGIQAMLSALRQDPYALESVHISIITYDNDA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 EF L + + T + L+ + D + Sbjct: 57 REFIPLT--PLEDFQFTDIVVPSAGGTFTGAALECLIQSVDRDIKRSDGDQKGDWRP--L 112 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + MTDG +++ K+ + A + HE L+ Sbjct: 113 VFLMTDGSPSDAYAYDEAVT--EVKKRAFGSIIACAVGPKAKHEHLKK 158 >gi|126174069|ref|YP_001050218.1| vault protein inter-alpha-trypsin subunit [Shewanella baltica OS155] gi|125997274|gb|ABN61349.1| Vault protein inter-alpha-trypsin domain protein [Shewanella baltica OS155] Length = 771 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 68/197 (34%), Gaps = 32/197 (16%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + +VM + V+ ++Q ++++V+D S SM I A ++ L+ +K Sbjct: 370 NYSLVMVLPPKVEKSTQPSLPRELILVIDTSGSMAG------DSIVQAKNALLYALKGLK 423 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFF-----LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 N ++ F++ + +F +S ++ + L G T L Sbjct: 424 PEDSFN------IIEFNSSLSQFSATPLPATSSNLSRARQFVSRLQADG-GTEMALALDA 476 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE--NLSTKEDQQSLYYCNEAKKRGAIVY 323 A + + ++FMTDG N D ++ Sbjct: 477 ALPKSLGSAPSDA------VQPLRQVIFMTDGSVGNEQALFDLIRYQIGES------RLF 524 Query: 324 AIGIRVIRSHEFLRACA 340 +GI + F++ A Sbjct: 525 TVGIGSAPNSHFMQRAA 541 >gi|315613111|ref|ZP_07888021.1| collagen adhesion protein [Streptococcus sanguinis ATCC 49296] gi|315314673|gb|EFU62715.1| collagen adhesion protein [Streptococcus sanguinis ATCC 49296] Length = 863 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 52/309 (16%), Positives = 97/309 (31%), Gaps = 84/309 (27%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM-----ESFFDSSITK 194 K T I + + + ITS + +QTD LD+++V D+S SM +SF +I++ Sbjct: 47 KTITPISGQDDKYELSLDITSKLGTETQTD-PLDVVLVADLSGSMQNQDVQSFDGRTISR 105 Query: 195 IDMAIKSINAMLEEVKLIPDV--NNVVQSGLVTFSNKIE-------------EFFLLEWG 239 ID ++ LI + N+ + +V F KI+ F W Sbjct: 106 IDALKNTLRGTNGRKGLIDTILSNSNNRLSMVGFGGKIDNKKVDQYWDGNKWRLFRPYWP 165 Query: 240 VSHLQRKIKYLSK-------FGVSTNSTPGLKYAYNQIFDMQG----------------- 275 + + +S G S N+ YN Sbjct: 166 YERMTKYYDGVSPWDDANTILGWSNNARAAKTAVYNMSIAGGNSIGTESGIGTGTNIGAG 225 Query: 276 ---MRQHCNTEDANYKKIIVFMTDG----------------ENLSTKEDQQSLYYCNEAK 316 Q + +N KK+++ ++DG N + ++ + Sbjct: 226 LTLANQLMGSARSNAKKVVILLSDGFANMVYDANGYTIYNYNNEDPNIETAPQWFWDRLN 285 Query: 317 KRGAIV-----------YAIGIRVIRSHEFL---------RACASPNSFYLVENPHSMYD 356 + Y+I R + + + + PN + + + D Sbjct: 286 NNLNSLSYSLAPTLDGFYSIKFRYSNNVDSITSLQYYMRQHNASIPNEIFSANDEDQLRD 345 Query: 357 AFSHIGKDI 365 +F I I Sbjct: 346 SFKDITDKI 354 >gi|308094406|ref|ZP_05889083.2| conserved hypothetical protein [Vibrio parahaemolyticus AN-5034] gi|308095541|ref|ZP_05906689.2| conserved hypothetical protein [Vibrio parahaemolyticus Peru-466] gi|308125336|ref|ZP_05774598.2| conserved hypothetical protein [Vibrio parahaemolyticus K5030] gi|308126258|ref|ZP_05908794.2| conserved hypothetical protein [Vibrio parahaemolyticus AQ4037] gi|308089078|gb|EFO38773.1| conserved hypothetical protein [Vibrio parahaemolyticus Peru-466] gi|308091462|gb|EFO41157.1| conserved hypothetical protein [Vibrio parahaemolyticus AN-5034] gi|308109129|gb|EFO46669.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ4037] gi|308114017|gb|EFO51557.1| conserved hypothetical protein [Vibrio parahaemolyticus K5030] Length = 400 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 25/196 (12%), Positives = 63/196 (32%), Gaps = 9/196 (4%) Query: 25 IFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILC 84 + L I+ V I+++H KT L + +D + + A + + + + Sbjct: 1 MVLLILLGVAAFGIDLNHQVLNKTRLQNAVDTAALAGAVVADKTEDVDQAEAAVIATLSS 60 Query: 85 RIKN--TWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI--SRYKIPLK 140 +SF + F +D+ V + S + ++ + Sbjct: 61 IASESGNTELSFTDGNTSVTFSHDMQTFVNAASFTPPTGEYDIYVRVAVTDMGISQYLSA 120 Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 + + S + S+ + + M D + ++E + D + Sbjct: 121 VFGIVKNVSAS---AVAGRSAAIAYTCNLTPIAMCG--DPNGTVEDAWGYRPPGYDPNVD 175 Query: 201 SINAMLEEVKLIPDVN 216 +++ E+K+ N Sbjct: 176 MDPSLVHELKVGDQNN 191 >gi|150170950|emb|CAO03559.1| collagen type XXI alpha 1 [Homo sapiens] Length = 136 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 20/112 (17%), Positives = 45/112 (40%), Gaps = 11/112 (9%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S A D++ +LD S S+ ++ K + + + + P +Q G+V Sbjct: 29 SSCRTAPTDLVFILDGSYSVGP------ENFEIVKKWLVNITKNFDIGP---KFIQVGVV 79 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 +S+ L HL ++ + G +T + +++A + +F Sbjct: 80 QYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTGKAIQFALDYLFAKS 131 >gi|118356063|ref|XP_001011290.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89293057|gb|EAR91045.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 520 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 57/134 (42%), Gaps = 24/134 (17%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++ V+D S SM+ KI++ KSI +L ++ + LV F+++ Sbjct: 94 QPLDLIFVIDTSGSMQG------KKIELVKKSILQVLHIIQGDD------RISLVGFNSQ 141 Query: 230 IEEFFLLEWGVSHLQRKIKYLS---KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++KI+ + G T G++ A++ I + + + Sbjct: 142 AKVLLELTQLTKNSKKKIQKTVDELQAGGGTQIGFGMQKAFDIIKERTNSKNLAS----- 196 Query: 287 YKKIIVFMTDGENL 300 I ++DG++ Sbjct: 197 ----IFLLSDGQDN 206 >gi|332254494|ref|XP_003276364.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy chain H5-like protein-like [Nomascus leucogenys] Length = 1313 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 31/175 (17%), Positives = 64/175 (36%), Gaps = 24/175 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ V+DVS SM TKI+ K++N +L ++K N+ ++FS+ + Sbjct: 284 VVFVIDVSGSMFG------TKIEQTKKAMNVILSDLKA----NDYFNI--ISFSDTVNVW 331 Query: 232 -EFFLLEWGVSHLQRKIKYLSKFGVS--TNSTPGLKYAYNQI-FDMQGMRQHCNTEDANY 287 ++ + ++ YL T+ L A + + Q + + Sbjct: 332 KAGGSIQATIQNVHSAKDYLHCMEADGWTDINSALLAAASVLNHSNQEPGRGPSVGRIPL 391 Query: 288 KKIIVFMTDGENLSTKEDQQSL--YYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 I+F+ DGE + + C R ++++ LR + Sbjct: 392 ---IIFLMDGEPTAGVTTPSVILSNVCQALGHR-VSLFSLAFGDDADFTLLRRLS 442 >gi|223934362|ref|ZP_03626283.1| hypothetical protein Cflav_PD5900 [bacterium Ellin514] gi|223896825|gb|EEF63265.1| hypothetical protein Cflav_PD5900 [bacterium Ellin514] Length = 576 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 74/203 (36%), Gaps = 31/203 (15%) Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 + LD SRSM+ + +T+++ A +L + ++P + G+V + F Sbjct: 86 VALDTSRSMKQADEEKMTRLEAAE----NLLMDADIVP------RDGVVK--DPGVRLFE 133 Query: 236 LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 + + + + L+ G ST + N + +G K ++ T Sbjct: 134 FGEDATAVTKSMMDLAPAGNSTRFHRSINTMVNSLGADEGA------------KALILFT 181 Query: 296 DGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACASPNSFYLVENPHSM 354 DG + ++ A+ R +YA+ + + + S + V+ + Sbjct: 182 DGHDFELVNPSKTALI---ARARQVPIYAVALGKQGKVKDTSIHITSYQPYTYVKQKARI 238 Query: 355 YDAFSHIG---KDIVTKRIWYDK 374 IG +DI + + +K Sbjct: 239 SGMLRMIGCDYEDIQVELLRQNK 261 >gi|118350692|ref|XP_001008625.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89290392|gb|EAR88380.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 648 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 77/199 (38%), Gaps = 29/199 (14%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D+++V+D S SME KI + +++ ++ + + + +V F+ Sbjct: 219 RQTVDLVVVIDKSGSMEG------EKIQLVKETLVKIINLMSSMD------RICIVCFNE 266 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + ++ + L + G TN + G+ +A I + + Sbjct: 267 SGDRPLTFTRVTDENKQTLLNLIQQIYAGGGTNISEGINHALKAI------QNRKFKNNV 320 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAK-KRGAIVYAIGIRVIRSHEFLRACAS--P 342 I+ ++DG+ TK + Y ++ + K + IG + LR + Sbjct: 321 TS---ILLLSDGQ--DTKAYTRVKAYIDKYQIKDAFNIETIGFGEDHDPKLLRTLSDLRN 375 Query: 343 NSFYLVENPHSMYDAFSHI 361 +F +++ + + AF +I Sbjct: 376 GTFNFMQDVNYLDTAFINI 394 >gi|302552806|ref|ZP_07305148.1| toxic cation resistance protein [Streptomyces viridochromogenes DSM 40736] gi|302470424|gb|EFL33517.1| toxic cation resistance protein [Streptomyces viridochromogenes DSM 40736] Length = 271 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 40/200 (20%), Positives = 74/200 (37%), Gaps = 25/200 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 K ++ T R + +VLD S SM ++ D S + ++ A L+ +P Sbjct: 78 KKHNLTGTRAKVYLVLDRSASMRPYYKDGSAQALAEQTLALAAHLDPEATVP-------- 129 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +V FS +++ + ++ + KI L G+ A + H Sbjct: 130 -VVFFSTELDGTGEIT--LTDHENKIDTLHA-GLGRMGRTSYHAAVEAVLA-----HHTK 180 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA-IGIRVIRSHEF--LRA 338 +++F TDG + Q+L A + ++ + + F LR Sbjct: 181 HATPATPALVIFQTDGAPDAKTPATQALT--EAAATHPTVFFSFVAFGDPENKAFDYLRK 238 Query: 339 CASPN-SFYLV-ENPHSMYD 356 +PN S +L E P + D Sbjct: 239 LKTPNTSHFLAGETPRELTD 258 >gi|302520599|ref|ZP_07272941.1| toxic cation resistance protein [Streptomyces sp. SPB78] gi|302429494|gb|EFL01310.1| toxic cation resistance protein [Streptomyces sp. SPB78] Length = 345 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 31/155 (20%), Positives = 57/155 (36%), Gaps = 17/155 (10%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 ++++ T R + +VLD S SM ++ D + +++ + + D VV Sbjct: 148 AALRKAGLTGTRARVYLVLDRSGSMRPYYK------DGSAQALGEQVLALAAHLDTEAVV 201 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + LV FS I+ L + + I L + T N ++ +R Sbjct: 202 R--LVFFSTAIDAMGTLT--LDAYEGVIDGLHEGAGRMGRT-------NYALAIEEVRAL 250 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 E A ++VF TDG ++L Sbjct: 251 HAKEAAGEPGLVVFQTDGPPDVRTAATRALKAAET 285 >gi|254779584|ref|YP_003057690.1| hypothetical protein HELPY_0994 [Helicobacter pylori B38] gi|254001496|emb|CAX29512.1| Conserved hypothetical protein [Helicobacter pylori B38] Length = 214 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 78/212 (36%), Gaps = 16/212 (7%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + + + + ++LD S SM S + T+I + I M+E +K + +VT Sbjct: 9 TMEERFIPVFLLLDTSGSM-SHPLGNSTRIGVLNLCIQKMIETLKQEAKKELFSKMAIVT 67 Query: 226 FSNK---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 F + F IK ++ +ST+ L A+ D+ + Sbjct: 68 FGENGAVLHTPF----------DDIKNVNFEPLSTSGGTPLDQAFRLAKDLIEDKD--TF 115 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 YK + ++DGE + K + + ++ + ++ ++I I ++ + Sbjct: 116 PTKFYKPYSILVSDGEPNNDKWQEPLSSFHHDGRSAKSVCWSIFIGDREANPQVNKDFGK 175 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + + ++ + F + + I K Sbjct: 176 DGVFYADDVEKLVKLFEIMTQTISKGSASIKK 207 >gi|57958|emb|CAA79153.1| collagen alpha 2 chain type VI [Mus musculus] Length = 371 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 27/122 (22%), Positives = 46/122 (37%), Gaps = 18/122 (14%) Query: 213 PDVNNV-VQSGLVTFSNK-IEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 + + G+V +S++ E L+ +S + +K L T + LK+A Sbjct: 3 DPKSETGTRVGVVQYSHEGTFEAIRLDDERVNSLSSFKEAVKNLEWIAGGTWTPSALKFA 62 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 YNQ+ + + V +TDG + +D C+ R V AIG Sbjct: 63 YNQLI--------KESRRQKTRVFAVVITDGRHDPRDDDLNLRALCD----RDVTVTAIG 110 Query: 327 IR 328 I Sbjct: 111 IG 112 >gi|167521285|ref|XP_001744981.1| hypothetical protein [Monosiga brevicollis MX1] gi|163776595|gb|EDQ90214.1| predicted protein [Monosiga brevicollis MX1] Length = 2728 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 64/160 (40%), Gaps = 21/160 (13%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 +++ D+++V+DVS SM + ++ A + + E + + + Sbjct: 2514 SSHAEYAEAADVLMVVDVSGSMTDY-------MEQARAFVRTIAREGFHLDSTASQHRMA 2566 Query: 223 LVTFSNKI-----EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 L TF E F +W + LQ+++ ++ G TN P LK + D++ Sbjct: 2567 LFTFGTTATALGGEPLFTSDW--AQLQQRVAQIAVNGA-TNYLPALKLVEQSLRDLKASD 2623 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 ++I++F TDG N Q+ A++ Sbjct: 2624 PARYNAS---RRIVLFQTDGSNSDRN---QTRAITATARR 2657 >gi|331003698|ref|ZP_08327192.1| hypothetical protein HMPREF0491_02054 [Lachnospiraceae oral taxon 107 str. F0167] gi|330412081|gb|EGG91476.1| hypothetical protein HMPREF0491_02054 [Lachnospiraceae oral taxon 107 str. F0167] Length = 528 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 56/340 (16%), Positives = 115/340 (33%), Gaps = 39/340 (11%) Query: 6 LNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 + R+ F+ GM I AI P++F S + + +AA + Sbjct: 1 MKKRSSFFKVSAGMLI-FAISAPVLFTGCRAPHYNSGAVYDGAK-------TAENAAPYV 52 Query: 66 MNEGNGNNRKKLKGG---DILCRIKNTWNMSFRNELRDNGFVN-----DIDDIVRSTSLD 117 E N N + + + +F ++ + N + ++ + S+ Sbjct: 53 SYEPNVNYNTEEYNAIKESSFHAVATSPLSTFAADVDTASYANIRRFINSGELPPADSVR 112 Query: 118 IVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNS--RHIVMPITSSVKVNSQTDARLDMM 175 I + Y + P T I + ++M + KV +++ Sbjct: 113 IEEMLNYFRYDYP-QPKDGEPFSVSTEISACPWNPDTKLMMIGMQAKKVEESEKKPSNLV 171 Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 ++DVS SM+ K+ + + + EE+K ++ V +G + + E Sbjct: 172 FLIDVSGSMDE-----PDKLPLVKNAFLLLCEELKENDTISIVTYAG---YDQVVLEGAS 223 Query: 236 LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 + I+ L G ST + G+K AY +I N ++ T Sbjct: 224 GSD-SKEIMSAIEDLEAAG-STAGSDGIKTAY-KIAKKYFKSDGNNR--------VILAT 272 Query: 296 DGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 DG+ N+ + + + K+ G + +G + Sbjct: 273 DGDLNVGITSEGKLTRLIKKEKESGVFLSVLGFGTENIKD 312 >gi|219517748|gb|AAI36393.1| ITIH4 protein [Homo sapiens] Length = 935 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 30/171 (17%), Positives = 64/171 (37%), Gaps = 19/171 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEE 232 ++ V+D S SM S KI +++ +L+++ N +V + + + Sbjct: 275 VVFVIDKSGSM------SGRKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLVP 328 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V+ + + G TN + A Q+ D + + I+ Sbjct: 329 AS--AENVNKARSFAAGIQALG-GTNINDAMLMAV-QLLDSSNQEERLPEGSVSL---II 381 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA---IGIRVIRSHEFLRACA 340 +TDG+ + + +S+ N ++ + Y+ +G S+ FL A Sbjct: 382 LLTDGDPTVGETNPRSIQ--NNVREAVSGRYSLFCLGFGFDVSYAFLEKLA 430 >gi|149911407|ref|ZP_01900025.1| hypothetical protein PE36_11192 [Moritella sp. PE36] gi|149805515|gb|EDM65520.1| hypothetical protein PE36_11192 [Moritella sp. PE36] Length = 450 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 19/121 (15%), Positives = 49/121 (40%), Gaps = 1/121 (0%) Query: 4 LNLNIRNFFYNY-KGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAA 62 + +N N + G + I+ I L + + + ++ H+ K L +++D + +HAA Sbjct: 1 MGINSNNTLRSKQNGNVLIVFTIALFALIGMASLALDGGHLLLNKGKLQNLVDSAALHAA 60 Query: 63 TQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVP 122 T++ + I I + + + + + N +D + +++ +P Sbjct: 61 TELDEGATHEQARAAVVALIQLNIAHNDHHELASAIDFSIVNNGLDQMTAQLNVEFSQLP 120 Query: 123 Q 123 Sbjct: 121 D 121 >gi|119585666|gb|EAW65262.1| inter-alpha (globulin) inhibitor H3, isoform CRA_a [Homo sapiens] Length = 670 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 73/176 (41%), Gaps = 15/176 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG-LVTFSNKIEE 232 + V+D+S SM K++ +++ +LE++K +N ++ SG + T+ + + Sbjct: 285 VAFVIDISGSMAG------RKLEQTKEALLRILEDMKEEDYLNFILFSGDVSTWKEHLVQ 338 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +K + G+ TN GL + + + +H E + I++ Sbjct: 339 ATPE--NLQEARTFVKSMEDKGM-TNINDGLLRGISMLNKARE--EHRIPERSTS--IVI 391 Query: 293 FMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +TDG+ N+ ++ A +Y +G ++ FL A N + Sbjct: 392 MLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFA 447 >gi|307548796|dbj|BAJ19118.1| TadG [Aggregatibacter actinomycetemcomitans] gi|307548811|dbj|BAJ19132.1| TadG [Aggregatibacter actinomycetemcomitans] Length = 538 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 49/341 (14%), Positives = 109/341 (31%), Gaps = 65/341 (19%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLV-------- 59 ++ F N G TI+TA+ + L + ++ + I K L D++ + Sbjct: 15 VKQFLQNEHGVYTIITALLAFPLLLFVAFTVDGTGILLDKARLAQATDQAALLLIAEDNQ 74 Query: 60 --------HAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIV 111 Q +++ + + K + N L +D + Sbjct: 75 YRKNKDHSDVTRQRVSQQDIDRESKDFSNAKVQAQWKKRNQELVQGLVKLYLRSDDSNGQ 134 Query: 112 RSTSLDIVVVP---------------QNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVM 156 +++S + P SI+ + + + KF W + +V Sbjct: 135 KNSSPVTIKEPFLAECLEEKTQPRNKNGTAKSIACVVQGSVQRKF-----WLPWGQTLVS 189 Query: 157 ------------PITSSVKVNSQTDARLDMMIVLDVSRSMES--------FFDSSITKID 196 + Q +D+M+V D+SRSM +ID Sbjct: 190 SSQLHDGRVGINSGKTYAVKEKQITIPIDLMMVTDLSRSMNWAIVSHRDVEVPPPNRRID 249 Query: 197 MAIKSINAMLEEV---KLIPDVNNVVQSGLVTFS------NKIEEFFLLEWGVSHLQRKI 247 + ++ + + + + D++ + G V+F+ ++ + L + + Q +I Sbjct: 250 ALREVVSNIQDILLPKAIRDDISPYNRIGFVSFAAGARQKDETDNCVLPYYSKQNKQAEI 309 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 G + L + + + + Q N E +Y Sbjct: 310 SNYFNSGQISQGFEELSRSMDIEKTINQITQFKNGEKKSYP 350 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 48/152 (31%), Gaps = 23/152 (15%) Query: 232 EFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGM-RQHCNTEDANYK 288 + W L + T T G+ N + D N + N + Sbjct: 369 KATTQAWFSKSKPGVADALKEIEPLGGTAVTSGIFIGTNLMTDTNKDPEAAPNKLNTNTR 428 Query: 289 KIIVFMTDGENLSTKEDQQSLYY----CNEAKKR--------------GAIVYAIGIRVI 330 ++++ ++DGE+ ++ + C + K++ A+G Sbjct: 429 RVLLILSDGEDNRPSKNTLVTFMNSGMCEKIKEKINSLQDSNYPQVEARIAFVALGFNPP 488 Query: 331 RSHEFL-RACASPNSFYLVENPHSMYDAFSHI 361 + + C +Y V + + DAF I Sbjct: 489 QDQLIAWKKCV-GKQYYPVNSKQGLLDAFKQI 519 >gi|296211156|ref|XP_002752289.1| PREDICTED: von Willebrand factor [Callithrix jacchus] Length = 2813 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 41/278 (14%), Positives = 87/278 (31%), Gaps = 35/278 (12%) Query: 78 KGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYK 136 + ++ + + + N ++I + + + V + + I Sbjct: 1587 EHSFLVSQGDREQAPNLVYMVTGNPASDEIKRLPGDIQVVPIGVGPHANVQELETIGWPN 1646 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKV--------NSQTDARLDMMIVLDVSRSMESFF 188 P+ F + +V+ S + + LD++++LD S S Sbjct: 1647 APILIQDFETLPREAPDLVLQRCCSGEGLQMPKLAPDPDCSQPLDVVLLLDGSSS----- 1701 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG----VSHLQ 244 S + D A + + KL P + Q ++ + + W +HL Sbjct: 1702 -SPASYFDEMKSFAKAFISKAKLGPH---LTQVSVLQYGSITTIDVP--WNAAMEKAHLL 1755 Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 + + + G + L +A + H A+ K +++ +TD Sbjct: 1756 SLVDVMQREGGPSQIGNALDFAVRYLTS----EVHGARPGAS-KAVVILVTD------VS 1804 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + A+ V+ IGI LR A P Sbjct: 1805 VDSVGAAADAARANRVTVFPIGIGDRYDAAQLRTLAGP 1842 >gi|123444851|ref|XP_001311192.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121892992|gb|EAX98262.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 698 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 37/263 (14%), Positives = 93/263 (35%), Gaps = 42/263 (15%) Query: 106 DIDDIVRSTSLDI-VVVPQNEGYSISAISRYK---IPLKFCTFIPWYTNSRHIVMPITSS 161 +I D+ S + + P N + + I + S + +T+ Sbjct: 174 EIQDVKVSVQGNKNLNDPHNATFVTNEAPTKDAIIIEAQIKDEDKNVAVSSDGYIAVTTY 233 Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 ++ + V+D S SM +I+ A+K + ++ + + + Sbjct: 234 PFFEGSIESNSEFYFVVDCSGSMAG------KRIENAVKCMRLFIQSLPVG------CRF 281 Query: 222 GLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 ++ F ++ + V+ + + + T+ L++ +++ +G Sbjct: 282 AILKFGSQFQTVLEPCDYTDENVARAMKLLDTIKADMGGTDILSPLQH-VSELKAKEG-- 338 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR--GAIVYAIGIRVIRSHEF 335 + K I F+TDGE + C A+K G +++IG+ Sbjct: 339 ---------FVKQIFFLTDGEVHNPD------MICATAQKNRSGNRIFSIGLGSGADPGL 383 Query: 336 LRACA--SPNSFYLVENPHSMYD 356 ++ A S ++ ++ + +M + Sbjct: 384 IKGMARKSGGNYSIIGDDDNMNE 406 >gi|224531962|ref|ZP_03672594.1| von Willebrand factor type A domain protein [Borrelia valaisiana VS116] gi|224511427|gb|EEF81833.1| von Willebrand factor type A domain protein [Borrelia valaisiana VS116] Length = 333 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 69/196 (35%), Gaps = 26/196 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++IVLD+S SM + SS +++ + + I + + GLV F+ Sbjct: 93 DIVIVLDISPSMGAVEFSSKNRLEFSKELIKRFISQ-------RENDNIGLVAFAKDASI 145 Query: 233 FFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + +K+ Y+ G + G+ A + + K+ Sbjct: 146 VVPITTDRDFFNKKLDDIYIMDLGNGSALGLGISIALSHL-----------KHSEAPKRS 194 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF----Y 346 I+ +TDG S + + N A+ +Y+IGI S + Sbjct: 195 IIVLTDGVVNSDEIYKD--QVINLAQGLNVKIYSIGIGSFEEFGVEFKLRSGKFYQGSLK 252 Query: 347 LVENPHSMYDAFSHIG 362 V +P + + + G Sbjct: 253 EVYDPSMLVEISNKTG 268 >gi|119585669|gb|EAW65265.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein), isoform CRA_b [Homo sapiens] Length = 914 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 30/171 (17%), Positives = 64/171 (37%), Gaps = 19/171 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEE 232 ++ V+D S SM S KI +++ +L+++ N +V + + + Sbjct: 275 VVFVIDKSGSM------SGRKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLVP 328 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V+ + + G TN + A Q+ D + + I+ Sbjct: 329 AS--AENVNKARSFAAGIQALG-GTNINDAMLMAV-QLLDSSNQEERLPEGSVSL---II 381 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA---IGIRVIRSHEFLRACA 340 +TDG+ + + +S+ N ++ + Y+ +G S+ FL A Sbjct: 382 LLTDGDPTVGETNPRSIQ--NNVREAVSGRYSLFCLGFGFDVSYAFLEKLA 430 >gi|328712314|ref|XP_001943110.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 1 [Acyrthosiphon pisum] Length = 884 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 83/223 (37%), Gaps = 40/223 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP--------DVNNVVQSGLVT 225 ++ VLDVS SM K+ +++ +L E+ D V Sbjct: 357 VIFVLDVSGSM------VGQKLPQVKEAMGQILSEIHSEDFFTLILFSDFAQVWTINATQ 410 Query: 226 FSNKIEEFFLLEWGVSH----------------LQRKIKYLSKFGVS--TNSTPGLKYAY 267 ++ + + W ++ ++ ++Y KF + S+ ++ A Sbjct: 411 ETSNHWDEKVSNWKTNNNISLDTLGENRFVFPATEQNVQYAKKFIQDLQSESSTNMEDAL 470 Query: 268 NQIFDMQGMRQHCNTEDANYKK-IIVFMTDGEN-LSTKEDQQSLYYCNEAKKRGAIVYAI 325 N+ + + + + AN K IIVF+TDGE E Q+ + Y + + +Y++ Sbjct: 471 NKAHLIAKLGETRFKDGANTPKPIIVFLTDGEPTTGITEPQELIKYVSNTNEEKYPIYSL 530 Query: 326 GIRVIRSHEFLRACASPNS-----FYLVENPH-SMYDAFSHIG 362 G +FL+ + N+ Y + + + + I Sbjct: 531 GFGEGADIDFLKKLSLNNTGFARVIYEASDASLQLRNFYKEIS 573 >gi|294676192|ref|YP_003576807.1| von Willebrand factor type A domain-containing protein [Rhodobacter capsulatus SB 1003] gi|294475012|gb|ADE84400.1| von Willebrand factor type A domain protein [Rhodobacter capsulatus SB 1003] Length = 637 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 47/349 (13%), Positives = 109/349 (31%), Gaps = 47/349 (13%) Query: 47 KTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVND 106 K + + + + Q+M E ++ D+ R+ + S + Sbjct: 99 KQNITAAFTNAA-GRSRQVMLEDRATLEALMQDEDVRARLADLVAQSDSGHV---DPAQK 154 Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 + + + ++I + + Y + +S +I I S Sbjct: 155 VVSVEPAEFINI----REKLYLMPILSFEEILHPLTYEPDLLLELASIPQRAPSPPAAAP 210 Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + ++ V+D +RSM + +S+ ++ +++ ++ VQ G++ F Sbjct: 211 PSPFDAGVVFVIDTTRSMRPYINST-------RAAVARIVQSLRATDLGAR-VQFGVIGF 262 Query: 227 SN-----------KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA-YNQIFDMQ 274 + + L + + V+T STPG + D Sbjct: 263 RDNPASDPAGIGYRTRTLAPLARRADQAEVLAALDAAAEVATVSTPGFNEDSMAGVEDAV 322 Query: 275 GMRQHCNTEDANYK-KIIVFMTD-GENLSTKEDQQS----LYYCNEAKKRGAIVYAIGIR 328 G ++ + ++ +TD G + D +S +A+ G V + +R Sbjct: 323 GTIDWTGGGADPFEGRYVILITDAGPDAQDDPDARSQIGPAELRADAEAHGIAVLTLHLR 382 Query: 329 VIRSHEF-----------LRACASPNSFYLVENPHSMYDAFSHIGKDIV 366 L + +Y +EN F+ + +V Sbjct: 383 TPAGGTAQHGYAEAAYRQLSGFGGRDFYYPIENGDE--AVFAATVERMV 429 >gi|221042206|dbj|BAH12780.1| unnamed protein product [Homo sapiens] Length = 888 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 30/171 (17%), Positives = 64/171 (37%), Gaps = 19/171 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEE 232 ++ V+D S SM S KI +++ +L+++ N +V + + + Sbjct: 263 VVFVIDKSGSM------SGRKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLVP 316 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V+ + + G TN + A Q+ D + + I+ Sbjct: 317 AS--AENVNKARSFAAGIQALG-GTNINDAMLMAV-QLLDSSNQEERLPEGSVSL---II 369 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA---IGIRVIRSHEFLRACA 340 +TDG+ + + +S+ N ++ + Y+ +G S+ FL A Sbjct: 370 LLTDGDPTVGETNPRSIQ--NNVREAVSGRYSLFCLGFGFDVSYAFLEKLA 418 >gi|224534421|ref|ZP_03674999.1| von Willebrand factor type A domain protein [Borrelia spielmanii A14S] gi|224514523|gb|EEF84839.1| von Willebrand factor type A domain protein [Borrelia spielmanii A14S] Length = 333 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 37/196 (18%), Positives = 69/196 (35%), Gaps = 26/196 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++IVLD+S SM + SS +++ + + I + + GLV F+ Sbjct: 93 DIVIVLDISPSMGAVEFSSKNRLEFSKELIKRFISQ-------RENDNIGLVAFAKDASI 145 Query: 233 FFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + +K+ Y+ G + G+ A + + K+ Sbjct: 146 VVPITTDRDFFNKKLDDIYIMDLGNGSALGLGISIALSHL-----------KHSEALKRS 194 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF----Y 346 IV +TDG S + + N A+ +Y+IGI S + Sbjct: 195 IVVLTDGVVNSDEIYKD--QVINLAQGLNVKIYSIGIGSSEEFGVEFKLRSGKFYQGSLR 252 Query: 347 LVENPHSMYDAFSHIG 362 V +P + + + G Sbjct: 253 EVYDPSMLVEISNKTG 268 >gi|149053317|gb|EDM05134.1| rCG33209 [Rattus norvegicus] Length = 1169 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 37/203 (18%), Positives = 77/203 (37%), Gaps = 26/203 (12%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + IVLD S S+E + A I+ M+ N LV + I+ Sbjct: 198 IAIVLDGSGSIEP------SDFQKAKDFISTMMRNFYEKCFECNF---ALVQYGGVIQTE 248 Query: 234 FLL--EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 F L ++ K++ + + T + +++ + IF K++ Sbjct: 249 FDLLDSRDINASLAKVQSIVQVKEVTKTASAMQHVLDNIFIPS------RGSRKKALKVM 302 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI----RSHEFLRACASP---NS 344 V +TDG+ + ++ + +K +G + +AIG+ ++ L+ AS Sbjct: 303 VVLTDGDIFRDPLNLTTVI--SSSKMQGVVRFAIGVGNAFENNNTYRELKLIASDPKAAH 360 Query: 345 FYLVENPHSMYDAFSHIGKDIVT 367 + V N ++ S + + I+ Sbjct: 361 TFKVTNYSALDGLLSKLQQRIIH 383 >gi|4096840|gb|AAD05198.1| inter-alpha-trypsin inhibitor family heavy chain-related protein [Homo sapiens] Length = 930 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 30/171 (17%), Positives = 64/171 (37%), Gaps = 19/171 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEE 232 ++ V+D S SM S KI +++ +L+++ N +V + + + Sbjct: 275 VVFVIDKSGSM------SGRKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLVP 328 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V+ + + G TN + A Q+ D + + I+ Sbjct: 329 AS--AENVNKARSFAAGIQALG-GTNINDAMLMAV-QLLDSSNQEERLPEGSVSL---II 381 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA---IGIRVIRSHEFLRACA 340 +TDG+ + + +S+ N ++ + Y+ +G S+ FL A Sbjct: 382 LLTDGDPTVGETNPRSIQ--NNVREAVSGRYSLFCLGFGFDVSYAFLEKLA 430 >gi|47203515|emb|CAG02000.1| unnamed protein product [Tetraodon nigroviridis] Length = 349 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 37/201 (18%), Positives = 76/201 (37%), Gaps = 26/201 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S D S + I M ++++ D V+ ++ +S+ + Sbjct: 1 DVVFLLDGS-------DDSRNAFPAIREFIKRMAKDLETHDDA---VRMAVMQYSDDVMV 50 Query: 233 FFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L+ + + I L K G + + L++ +++F +H Sbjct: 51 HFNLK-SHNSKKALINALRNLRHKGGRNRKTGAALQFVRDRVFTTLFGSRHLEG----VP 105 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF-YL 347 +I+ +T G K K+ G +AIGI+ + E L+ ASP+ F Y Sbjct: 106 QILFLLTLG-----KAGDDVSKAALSLKQFGVQTFAIGIKKAKLQE-LQQIASPSRFLYN 159 Query: 348 VENPHSMYDAFSHIGKDIVTK 368 + + + + + Sbjct: 160 LPVFGELLSIQPQLAALVQQR 180 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 25/167 (14%), Positives = 59/167 (35%), Gaps = 22/167 (13%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + + D++ ++D S S + ++ V + ++ V+ +V +S Sbjct: 188 ENQQSDIVFLMDGS----EDTRSDFPAMKALVE------RTVDALNIGDDKVRVSVVQYS 237 Query: 228 NKIEEFFLL-EWGVS-HLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTED 284 + F L + + ++ L+ G N+ L Y N F + + Sbjct: 238 RDPQTHFNLNSYSTKQDVLAAVQQLNHKGGRPLNTGAALNYVRNSAFADSSGSRKQDG-- 295 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 +I++F+T G + + K+ G + + +G R Sbjct: 296 --VPQILIFLTGG-----RSQDNVMSVAAALKQDGVVPFCVGTRNAD 335 >gi|31542984|ref|NP_002209.2| inter-alpha-trypsin inhibitor heavy chain H4 isoform 1 precursor [Homo sapiens] gi|229463048|sp|Q14624|ITIH4_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H4; Short=ITI heavy chain H4; Short=ITI-HC4; Short=Inter-alpha-inhibitor heavy chain 4; AltName: Full=Inter-alpha-trypsin inhibitor family heavy chain-related protein; Short=IHRP; AltName: Full=Plasma kallikrein sensitive glycoprotein 120; Short=Gp120; Short=PK-120; Contains: RecName: Full=70 kDa inter-alpha-trypsin inhibitor heavy chain H4; Contains: RecName: Full=35 kDa inter-alpha-trypsin inhibitor heavy chain H4; Flags: Precursor gi|1402590|dbj|BAA07536.1| PK-120 precursor [Homo sapiens] Length = 930 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 30/171 (17%), Positives = 64/171 (37%), Gaps = 19/171 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEE 232 ++ V+D S SM S KI +++ +L+++ N +V + + + Sbjct: 275 VVFVIDKSGSM------SGRKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLVP 328 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V+ + + G TN + A Q+ D + + I+ Sbjct: 329 AS--AENVNKARSFAAGIQALG-GTNINDAMLMAV-QLLDSSNQEERLPEGSVSL---II 381 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA---IGIRVIRSHEFLRACA 340 +TDG+ + + +S+ N ++ + Y+ +G S+ FL A Sbjct: 382 LLTDGDPTVGETNPRSIQ--NNVREAVSGRYSLFCLGFGFDVSYAFLEKLA 430 >gi|293365338|ref|ZP_06612055.1| collagen adhesion protein [Streptococcus oralis ATCC 35037] gi|307703880|ref|ZP_07640821.1| von Willebrand factor type A domain protein [Streptococcus oralis ATCC 35037] gi|291316788|gb|EFE57224.1| collagen adhesion protein [Streptococcus oralis ATCC 35037] gi|307622715|gb|EFO01711.1| von Willebrand factor type A domain protein [Streptococcus oralis ATCC 35037] Length = 863 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 52/309 (16%), Positives = 97/309 (31%), Gaps = 84/309 (27%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM-----ESFFDSSITK 194 K T I + + + ITS + +QTD LD+++V D+S SM +SF +I++ Sbjct: 47 KTITPISGQDDKYELSLDITSKLGTETQTD-PLDVVLVADLSGSMQNQDVQSFDGRTISR 105 Query: 195 IDMAIKSINAMLEEVKLIPDV--NNVVQSGLVTFSNKIE-------------EFFLLEWG 239 ID ++ LI + N+ + +V F KI+ F W Sbjct: 106 IDALKNTLRGTNGRKGLIDTILSNSNNRLSMVGFGGKIDNKKVDQYWDGNKWRLFRPYWP 165 Query: 240 VSHLQRKIKYLSK-------FGVSTNSTPGLKYAYNQIFDMQG----------------- 275 + + +S G S N+ YN Sbjct: 166 YERMTKYYDGVSPWDDANTILGWSNNARAAKTAVYNMSIAGGNSIGTESGIGTGTNIGAG 225 Query: 276 ---MRQHCNTEDANYKKIIVFMTDG----------------ENLSTKEDQQSLYYCNEAK 316 Q + +N KK+++ ++DG N + ++ + Sbjct: 226 LTLANQLMGSARSNAKKVVILLSDGFANMVYDANGYTIYNYNNEDPNIETAPQWFWDRLN 285 Query: 317 KRGAIV-----------YAIGIRVIRSHEFL---------RACASPNSFYLVENPHSMYD 356 + Y+I R + + + + PN + + + D Sbjct: 286 NNLNSLSYSLAPTLDGFYSIKFRYSNNVDSITSLQYYMRQHNASIPNEIFSANDEDQLRD 345 Query: 357 AFSHIGKDI 365 +F I I Sbjct: 346 SFKDITDKI 354 >gi|119585668|gb|EAW65264.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein), isoform CRA_a [Homo sapiens] Length = 930 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 30/171 (17%), Positives = 64/171 (37%), Gaps = 19/171 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEE 232 ++ V+D S SM S KI +++ +L+++ N +V + + + Sbjct: 275 VVFVIDKSGSM------SGRKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLVP 328 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V+ + + G TN + A Q+ D + + I+ Sbjct: 329 AS--AENVNKARSFAAGIQALG-GTNINDAMLMAV-QLLDSSNQEERLPEGSVSL---II 381 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA---IGIRVIRSHEFLRACA 340 +TDG+ + + +S+ N ++ + Y+ +G S+ FL A Sbjct: 382 LLTDGDPTVGETNPRSIQ--NNVREAVSGRYSLFCLGFGFDVSYAFLEKLA 430 >gi|1483187|dbj|BAA07602.1| inter-alpha-trypsin inhibitor family heavy chain-related protein (IHRP) [Homo sapiens] Length = 930 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 30/171 (17%), Positives = 64/171 (37%), Gaps = 19/171 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEE 232 ++ V+D S SM S KI +++ +L+++ N +V + + + Sbjct: 275 VVFVIDKSGSM------SGRKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLVP 328 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V+ + + G TN + A Q+ D + + I+ Sbjct: 329 AS--AENVNKARSFAAGIQALG-GTNINDAMLMAV-QLLDSSNQEERLPEGSVSL---II 381 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA---IGIRVIRSHEFLRACA 340 +TDG+ + + +S+ N ++ + Y+ +G S+ FL A Sbjct: 382 LLTDGDPTVGETNPRSIQ--NNVREAVSGRYSLFCLGFGFDVSYAFLEKLA 430 >gi|296474801|gb|DAA16916.1| inter-alpha-trypsin inhibitor heavy chain H4 precursor [Bos taurus] Length = 916 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 33/218 (15%), Positives = 75/218 (34%), Gaps = 33/218 (15%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 +S + +++ V+D S SM KI +++ +L+++ + L Sbjct: 266 PDSLSTIPKNVIFVIDKSGSM------MGRKIKQTREALIKILDDLSPHDQFD------L 313 Query: 224 VTFSNKIEEFFLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++FS++ + L V+ + + G TN + A + Sbjct: 314 ISFSSEATTWKPLLVPASTENVNEAKSYATGIQAQG-GTNINDAMLMAVQLLEKANQEEL 372 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK--RGAI-VYAIGIRVIRSHEF 335 +I+ +TDG+ + L +K G ++ +G S+ F Sbjct: 373 LPEGSIT----LIILLTDGDPT--VGETNPLNIQKNVRKAINGQHSLFCLGFGFDVSYAF 426 Query: 336 LRACASPNS------FYLVENPHSMYDAFSHIGKDIVT 367 L A N + ++ + D + + ++T Sbjct: 427 LEKMALENGGLARRIYEDSDSALQLQDFYQEVANPLMT 464 >gi|256958585|ref|ZP_05562756.1| von Willebrand factor [Enterococcus faecalis DS5] gi|256949081|gb|EEU65713.1| von Willebrand factor [Enterococcus faecalis DS5] Length = 666 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 58/147 (39%), Gaps = 20/147 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I K +N ++ + + N + G V +S+ Sbjct: 317 TPLDLVLVVDWSGSMNEN-----NRIGEVQKGVNRFVDTL-ADSGITNNINMGYVGYSSD 370 Query: 230 IEEFFLLEWG-VSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ G ++ IK ++ T + L+ A + + G Sbjct: 371 GYNNNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNG----------- 419 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCN 313 +KK+IV +TDG + + + + Sbjct: 420 HKKVIVLLTDGVPTFSYKVSRVQTEAD 446 >gi|207029558|ref|NP_001125590.1| inter-alpha-trypsin inhibitor heavy chain H3 [Pongo abelii] Length = 879 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 73/176 (41%), Gaps = 15/176 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG-LVTFSNKIEE 232 + V+D+S SM K++ +++ +LE++K +N ++ SG + T+ + + Sbjct: 285 VAFVIDISGSMAG------RKLEQTKEALLRILEDMKKEDYLNFILFSGDVSTWKEHLVQ 338 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +K + G+ TN GL + + + +H E + I++ Sbjct: 339 ATPE--NLQEARTFVKSMEDKGM-TNINDGLLRGISMLNKARE--EHRVPERSTS--IVI 391 Query: 293 FMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +TDG+ N+ ++ A +Y +G ++ FL A N + Sbjct: 392 MLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFA 447 >gi|326433447|gb|EGD79017.1| NOTCH2 protein [Salpingoeca sp. ATCC 50818] Length = 1763 Score = 50.5 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 28/139 (20%), Positives = 57/139 (41%), Gaps = 10/139 (7%) Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 S +++ F + L + L +T+ + GL+ + IF +R + Sbjct: 360 SYTVDQLFSMTQDHDDLAATLDSLVFPDGATHMSAGLEQIRDTIF---QLRNGMREYEQA 416 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---PN 343 ++++ +TDG++ + + + RG I+YAIG+ E L A AS Sbjct: 417 IPRVLIVLTDGKSN---PGFEPHEVAEQLRNRGIIIYAIGVGDYYLPE-LEAMASEPMDR 472 Query: 344 SFYLVENPHSMYDAFSHIG 362 Y + +P S++ + Sbjct: 473 HVYELADPSSLFTIVDRLS 491 >gi|311234271|gb|ADP87125.1| Protein of unknown function DUF2134, membrane [Desulfovibrio vulgaris RCH1] Length = 440 Score = 50.5 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 26/153 (16%), Positives = 53/153 (34%), Gaps = 43/153 (28%) Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN-------------- 299 T + GLK+ + + + + +D +K+I+ +TDG+ Sbjct: 284 ASGTVISEGLKWGRHVLTPEAPFTEGSSAKD--IRKVIIVLTDGDTEDGKCGGSYAINYT 341 Query: 300 --------------------LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE--FLR 337 K + L +AK+ G V+AI S + ++ Sbjct: 342 PNAYWTNAFYGMLDMTSHCENGGKLNAAMLEEARKAKEAGIEVFAIRFGDSDSVDVSLMK 401 Query: 338 ACASP-----NSFYLVENPHSMYDAFSHIGKDI 365 + AS + +Y + + + D F IG+ + Sbjct: 402 SIASSKAGTNDHYYDAPSAYDIDDVFKKIGRQL 434 >gi|108758937|ref|YP_629042.1| von Willebrand factor type A domain-containing protein [Myxococcus xanthus DK 1622] gi|108462817|gb|ABF88002.1| von Willebrand factor type A domain protein [Myxococcus xanthus DK 1622] Length = 860 Score = 50.5 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 71/207 (34%), Gaps = 38/207 (18%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA 198 T + + + + + +++ V+DVS SM + A Sbjct: 249 TPLVTHRKGEGGPGTFALTVVPDLLALASAPPKQEVVFVVDVSGSMAG------ESLPQA 302 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-----LEWGVSHLQRKIKYLSKF 253 ++ L ++ + ++ F N+ + F + + R + L+ Sbjct: 303 QAALRLCLRHLREGD------RFNVIAFENRFQSFQPEPVPFTQRTLEEADRWVAALNAD 356 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 G + P ++ A A +IV +TDG+ + E +++ Sbjct: 357 GGTELLAP-MRAAVQ----------------AAPDGVIVLLTDGQVGNEAEILRAVL--- 396 Query: 314 EAKKRGAIVYAIGIRVIRSHEFLRACA 340 EA+K A VY+ GI S LR A Sbjct: 397 EARKT-ARVYSFGIGTNVSDVLLRDMA 422 >gi|4104232|gb|AAD01978.1| collagen alpha3(VI) [Mus musculus] Length = 1703 Score = 50.5 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 75/197 (38%), Gaps = 24/197 (12%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + + A D++ ++D S + + + + + ++ ++E + + + L Sbjct: 29 GDVKNGAAADIVFLVDSS------WSAGKDRFLLVQEFLSDVVESLAVGDND---FHFAL 79 Query: 224 VTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYA-YNQIFDMQGMRQHC 280 V + FLL + I +S G S + GL+Y ++ + + G R Sbjct: 80 VRLNGNPHTEFLLNTYHSKQEVLSHIANMSYIGGSNQTGKGLEYVIHSHLTEASGSRA-- 137 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 ++IV +TDG+ ED +L K V+A+G+ + Sbjct: 138 ---ADGVPQVIVVLTDGQ----SEDGFALPSAE-LKSADVNVFAVGVEGADERALGEVAS 189 Query: 341 SP--NSFYLVENPHSMY 355 P + +EN S++ Sbjct: 190 EPLSMHVFNLENVTSLH 206 Score = 50.5 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 72/199 (36%), Gaps = 24/199 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T + +V+ D++ +LD S ++ + + ++ + + D Sbjct: 627 TGTTEVHVNKR---DIIFLLDGSDNVGKNNFPYV------RDFVTNLVNSLDVGSDN--- 674 Query: 219 VQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQG 275 ++ GLV FS+ F L+ S L ++ L K G N+ L Y I Sbjct: 675 IRVGLVQFSDTPVTEFSLDTYQTKSELLAHLRRLQLKGGSGLNAGSALSY----IHANHF 730 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + + ++++ + G + L N + G + + +G E Sbjct: 731 TEAGGSRTREHVPQLLLLLMAG-----PSEDAYLQAANALVRSGVLTFCVGTNRADKAEL 785 Query: 336 LRACASPNSFYLVENPHSM 354 +P+ YL+++ S+ Sbjct: 786 EHIAFNPSLVYLMDDFRSL 804 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 70/205 (34%), Gaps = 26/205 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S + S F + + ++E + + PD V+ LV +S++ Sbjct: 1026 DVVFLIDGSEGVRSGFP-------LLKDFVQRVVESLDVGPD---RVRVALVQYSDRTRP 1075 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L + I+ L+ G T N+ L++ I + + Sbjct: 1076 EFYLNSHMDQQGVISAIRRLTLLGGPTPNTGAALEFVLRNILTSSTGSRIAEG----VPQ 1131 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ +T + K+ GA+ IGI E P+ + Sbjct: 1132 LLIVLT-----AEPSGDDVRGPSVVLKQGGAVPIGIGIGNADISEMQTISFIPDFAVAIP 1186 Query: 350 NPHSMYDAFSHIGKDIVTKRIWYDK 374 + I + I + I ++ Sbjct: 1187 TFREL----GTIQQVISERVIQLNR 1207 Score = 42.8 bits (99), Expect = 0.084, Method: Composition-based stats. Identities = 39/243 (16%), Positives = 84/243 (34%), Gaps = 24/243 (9%) Query: 118 IVVVPQNEGYSISAISRYKIP---LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDM 174 +V + + Y S + ++P K I T+ + + ++ + D+ Sbjct: 1375 LVKISLSPEYVYSVSTFRELPRLEQKLLKPITTLTSQQIHQILASTRYPPSVVESDAADI 1434 Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 + ++D S +++ I ++ ++ + + P V+ G+V FSN + F Sbjct: 1435 VFLIDSSDAVKPDG------IAHIRDFVSRIVRRLNIGPSK---VRIGVVQFSNDVFPEF 1485 Query: 235 LLEW--GVSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L+ S + I+ L G S N+ L++ +F ED + ++ Sbjct: 1486 YLKTHKSQSSVLEAIRRLRFKGGSPLNTGRALEFVARNLFVKS---AGSRIEDGVPQHLV 1542 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENP 351 +F+ K + G + IG R I + P + V Sbjct: 1543 LFL------GGKSQDDVARHAQVISSSGIVSLGIGDRNIDRTDLQTITNDPRLVFTVREF 1596 Query: 352 HSM 354 + Sbjct: 1597 REL 1599 >gi|74011918|ref|XP_848765.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Canis familiaris] Length = 826 Score = 50.5 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 32/168 (19%), Positives = 62/168 (36%), Gaps = 13/168 (7%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D S SM S KI +++ +L+++K N + SG VT + Sbjct: 270 VIFVIDKSGSM------SGRKIQQTREALIKILDDLKPNDQFNLISFSGDVTHWKPLLVP 323 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 E V +R + G TN + A ++ ++ + + I+ Sbjct: 324 ASPE-NVDQAKRYAANIEAHG-GTNINDAMLTAV-RLLQSANQKELLSDGSVSL---IIL 377 Query: 294 MTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +TDG+ + +A ++ +G S+ FL A Sbjct: 378 LTDGDPTVGETSPARIQKNVQKAIDGQYSLFCLGFGFDVSYVFLEKLA 425 >gi|328953619|ref|YP_004370953.1| Protein of unknown function DUF2134, membrane [Desulfobacca acetoxidans DSM 11109] gi|328453943|gb|AEB09772.1| Protein of unknown function DUF2134, membrane [Desulfobacca acetoxidans DSM 11109] Length = 333 Score = 50.5 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 18/128 (14%), Positives = 40/128 (31%), Gaps = 19/128 (14%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM--- 66 + G + I A+ L I+ + + ++ H+ +K L D + AT ++ Sbjct: 11 KLVRDASGAVAISFALTLAILCGFVALAFDIGHLVMVKAELQRTADAGALAGATGLVPYT 70 Query: 67 ----------------NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDI 110 + NN L + + + G+V + + Sbjct: 71 GPVTSQTPDWLQGQSKAQTLINNTANLADNQVFTTTDSAVTYGYWLLNPPEGYVQSLPTV 130 Query: 111 VRSTSLDI 118 +TS + Sbjct: 131 RPTTSAYL 138 >gi|222101620|gb|ACM44015.1| thrombospondin-related anonymous protein [Babesia bovis] Length = 657 Score = 50.5 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 56/169 (33%), Gaps = 27/169 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 Q +LD IV+D S S+ + I + ++ V L N ++ L T+ Sbjct: 39 QCKKQLDFSIVVDESASISNDQWGGQ-----MIPFLRNLIHTVDL---DNTDIRLSLTTY 90 Query: 227 SNKIEEFFL-LEWGVSHLQRKIKYLSKFGVS------TNSTPGLKYAYNQIFDMQGMRQH 279 S + F L+ S + + L + T + L Y I Sbjct: 91 STPTRQIFTFLDAAASSTRLALTKLDWMAGTKARSGMTYTGRALNYVRKAILPY------ 144 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 N K ++ +TDG + + + + G V IG+ Sbjct: 145 ---GRKNVPKALLLITDG---VSSDGSYTAQVAAMLRDEGVNVMVIGVG 187 >gi|260828859|ref|XP_002609380.1| hypothetical protein BRAFLDRAFT_124610 [Branchiostoma floridae] gi|229294736|gb|EEN65390.1| hypothetical protein BRAFLDRAFT_124610 [Branchiostoma floridae] Length = 321 Score = 50.5 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 57/151 (37%), Gaps = 20/151 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +DM+ V D S S+ + A ++I ++E P + + V+FS+ + Sbjct: 169 IDMLFVFDKSGSVREVNFND------AKENIKDLIENF-PAPVGPSDTRVAAVSFSDVDK 221 Query: 232 EFFLLEWGVSH----LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 ++ S ++ + ++ G T + L A + +F Q N Sbjct: 222 TRVEFDFNASGDRDSVKTSLGNIAYEGGWTATATALSLARDDVF------QSVAGSRPNS 275 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 KI+ +TDG++ + ++ K Sbjct: 276 AKILFLITDGKSNR---GGAPIPVADQLKGN 303 >gi|157375507|ref|YP_001474107.1| hypothetical protein Ssed_2370 [Shewanella sediminis HAW-EB3] gi|157317881|gb|ABV36979.1| conserved hypothetical protein [Shewanella sediminis HAW-EB3] Length = 461 Score = 50.5 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 19/111 (17%), Positives = 42/111 (37%), Gaps = 7/111 (6%) Query: 6 LNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 +NIR + G + ++ I L + V + ++ H+ K L + +D S ++AA ++ Sbjct: 16 MNIRPY-RKQGGAILVMFTIGLFSLIAVAALALDGGHLLLNKGRLQNAVDASALYAAKEL 74 Query: 66 MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSL 116 + L N+ ++ N ++ ST + Sbjct: 75 ------QDGASLYEAREAATTLLLQNLQYQENGELNSSIDLSAPDYNSTQV 119 >gi|148675553|gb|EDL07500.1| mCG120277 [Mus musculus] Length = 1031 Score = 50.5 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 68/204 (33%), Gaps = 30/204 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ V+DVS SM TK+ K+++ +L +++ N ++TFS+ + Sbjct: 250 VVFVIDVSGSMFG------TKLQQTKKAMDTILSDLQASDSFN------IITFSDTVNIW 297 Query: 232 ----EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI-FDMQGMRQHCNTEDAN 286 + + + + G T+ L A + + Q + Sbjct: 298 KAEGSIQATVQNIHSAKNYVSRMEADGW-TDINAALLAAASVLNHSNQEPGKGRGVGQIP 356 Query: 287 YKKIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS- 344 I+F+TDGE L +A ++++ LR + N Sbjct: 357 L---IMFLTDGEPTAGETTPSVILSNIRQALAHRVSLFSLAFGDDADFSLLRRLSLENQG 413 Query: 345 -----FYLVENPHSMYDAFSHIGK 363 + + + ++ I + Sbjct: 414 EARRIYEDADAALQLEGLYAEISR 437 >gi|74001592|ref|XP_548729.2| PREDICTED: similar to collagen, type VI, alpha 1 precursor [Canis familiaris] Length = 206 Score = 50.5 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 24/180 (13%), Positives = 58/180 (32%), Gaps = 19/180 (10%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + D+ I+LD S S+ S + K + ++ V+ +V +S Sbjct: 4 SPADITILLDGSASVGSH---NFDTTKRFAKRLAERFLTAGRTDPAHD-VRVAVVQYSGP 59 Query: 230 IEE-----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ + L + + F +T+ L Y R + Sbjct: 60 GQQRPGRGALRFLQNYTVLASTVDGMDFFNDATDVNDALGYV---------TRFYREAST 110 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + KK ++ +DG + EA++ ++ + + + +R + + Sbjct: 111 RDAKKRLLLFSDGNSQGATAAAIEKAV-QEAQRADIEIFVVVVGRQVNEPHVRVLVTGKT 169 >gi|218670347|ref|ZP_03520018.1| hypothetical protein RetlG_01180 [Rhizobium etli GR56] Length = 125 Score = 50.5 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 13/76 (17%), Positives = 21/76 (27%), Gaps = 12/76 (15%) Query: 22 LTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGD 81 +TAI P++ GM + V + K L + L G Sbjct: 1 MTAILAPVLLGAAGMAVHVGDMLLSKQQLQEAA------------DSAALATATALANGT 48 Query: 82 ILCRIKNTWNMSFRNE 97 I + +F Sbjct: 49 IQTSQAEAFARNFVAG 64 >gi|194221225|ref|XP_001492399.2| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 [Equus caballus] Length = 883 Score = 50.5 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 77/192 (40%), Gaps = 20/192 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG-LVTFSNKIEE 232 ++ V+DVS SM KI ++ +LE+VK +N ++ SG + T+ + + + Sbjct: 285 VVFVIDVSGSMYG------RKIQQTKDALLKILEDVKEDDYLNFILFSGDVTTWKDNLVQ 338 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + + + G+ TN L + + R+ + + II+ Sbjct: 339 ATPE--NIQQAREFVMNIHSQGM-TNINDALLRGISML---NKAREENAVPERSTS-III 391 Query: 293 FMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS-----FY 346 +TDG+ N+ + ++ + A + +Y +G ++ FL + A N Y Sbjct: 392 MLTDGDANVGESKPEKIQENVHNAIRGKFPLYNLGFGNNLNYNFLESMALENHGLARRIY 451 Query: 347 LVENPHSMYDAF 358 + + F Sbjct: 452 EDSDANLQLQGF 463 >gi|145497681|ref|XP_001434829.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124401957|emb|CAK67432.1| unnamed protein product [Paramecium tetraurelia] Length = 648 Score = 50.5 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 43/213 (20%), Positives = 82/213 (38%), Gaps = 31/213 (14%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 I I VK NS +A +D++ V+D S SME KI +S+ +L+ + Sbjct: 210 ITKDIEQYVKNNSSIEAGIDLLCVIDKSGSMEG------KKIASVQQSLVQLLDFLSEKD 263 Query: 214 DVNNVVQSGLVTFSNKIEEFFLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + L+TF + L + ++ ++ I + G TN G + A+NQ Sbjct: 264 ------RLCLITFDGSAQRLTPLKTLTQDNKNYFKKAIYSIRASGQ-TNIAKGTEIAFNQ 316 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 I +Q I ++DG++ E + + + +++ G Sbjct: 317 I------QQRKMKNQVTS---IFLLSDGQDQGAAEY---IQRQKDVVEDIVTIHSFGYGS 364 Query: 330 IRSHEFL-RAC-ASPNSFYLVENPHSMYDAFSH 360 + + C SFY +E+ + + F+ Sbjct: 365 DHDAALMSKICKVGQGSFYYIEDVKLLDEFFAD 397 >gi|260297|gb|AAB24261.1| type VI collagen alpha 3 chain [Homo sapiens] Length = 205 Score = 50.5 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 33/198 (16%), Positives = 69/198 (34%), Gaps = 22/198 (11%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + A D++ ++D S ++ + + + +++ + + + LV F Sbjct: 2 KNGAAADIIFLVDSSWTIGE------EHFQLVREFLYDVVKSLAVGEND---FHFALVQF 52 Query: 227 SNKIEEFFLL-EWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + FLL + + I +S G + + GL+Y I + + Sbjct: 53 NGNPHTEFLLNTYRTKQEVLSHISNMSYIGGTNQTGKGLEY----IMAKHLTKAAGSLAG 108 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP-- 342 ++IV +TDG E K V+AIG+ + P Sbjct: 109 DGVPQVIVVLTDG-----HSKDGLALPSAELKSADVNVFAIGVEDADEGALKEIASEPLN 163 Query: 343 NSFYLVENPHSMYDAFSH 360 + +EN S++D + Sbjct: 164 MHMFNLENFTSLHDIVGN 181 >gi|187950343|gb|AAI36394.1| Inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Homo sapiens] Length = 930 Score = 50.5 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 30/171 (17%), Positives = 64/171 (37%), Gaps = 19/171 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEE 232 ++ V+D S SM S KI +++ +L+++ N +V + + + Sbjct: 275 VVFVIDKSGSM------SGRKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLVP 328 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V+ + + G TN + A Q+ D + + I+ Sbjct: 329 AS--AENVNKARSFAAGIQALG-GTNINDAMLMAV-QLLDSSNQEERLPEGSVSL---II 381 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA---IGIRVIRSHEFLRACA 340 +TDG+ + + +S+ N ++ + Y+ +G S+ FL A Sbjct: 382 LLTDGDPTVGETNPRSIQ--NNVREAVSGRYSLFCLGFGFDVSYAFLEKLA 430 >gi|156408321|ref|XP_001641805.1| predicted protein [Nematostella vectensis] gi|156228945|gb|EDO49742.1| predicted protein [Nematostella vectensis] Length = 981 Score = 50.5 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 51/291 (17%), Positives = 106/291 (36%), Gaps = 54/291 (18%) Query: 56 RSLVHAATQIMNEGNGNNR----------KKLKGGDILCRIKNTWNMSFRNELRDNGFVN 105 +L++A T + N G N R + + NT +D+G + Sbjct: 93 NALLNAKTAVTNTGASNTRIIECCAMPAIARANYSYRFRAMLNTSVACKTRNPKDSGIIQ 152 Query: 106 DIDDIVRSTSLDIVVVPQNE-GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKV 164 + +V + ++V Y ++ Y L+F S P S V Sbjct: 153 GENSLVNTFRNNLVDSSVISWQYFGTSTGNY---LQFPASGKVCNGSSSFD-PRFQSWYV 208 Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + T R ++++V+D S SM S+ ++ +A ++ +L+ + + G+V Sbjct: 209 EAVTRMRTNIVVVIDRSSSM-----STAGRMALARQAAVTVLDTLGPND------KVGVV 257 Query: 225 TFSNKIEEFF---------LLEWGVSHLQRKIKYLSKFG---------------VSTNST 260 FS+ I + L ++ ++ ++ L+ G +T Sbjct: 258 AFSHFIIKPPGCFGGNVAEALPKNINRIKAWVEALTPRGKVSLQKTNLRYVSFPGATKYV 317 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKK----IIVFMTDGENLSTKEDQQ 307 P L+ A+ + ++ + A K+ +I+F+TDG+ D Sbjct: 318 PALEAAFEMLGGDFNIKILHHPLIALIKRSAENMILFLTDGDPFDRNPDVS 368 >gi|296282333|ref|ZP_06860331.1| von Willebrand factor type A domain-containing protein [Citromicrobium bathyomarinum JL354] Length = 571 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 32/171 (18%), Positives = 65/171 (38%), Gaps = 23/171 (13%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +++ +LDVS SM SS K+ + ++ ++ ++ V+ VV +G Sbjct: 212 ERPAANLVFLLDVSGSM-----SSADKLPLVKTAMKTLVGQLTPKDRVSIVVYAGAAGLV 266 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + + L + G ST GL+ AY + + Sbjct: 267 LEPTSDS------REIMAALDQL-QAGGSTAGGAGLELAYKVAEASKVDGINR------- 312 Query: 288 KKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 ++ TDG+ N+ ++ + L Y + +K G + +G +E L Sbjct: 313 ---VILATDGDFNVGLSDNDKLLEYVEDKRKNGIAMSVLGFGRGNINEALM 360 >gi|119026487|ref|YP_910332.1| truncated clumping factor [Bifidobacterium adolescentis ATCC 15703] gi|118766071|dbj|BAF40250.1| truncated clumping factor [Bifidobacterium adolescentis ATCC 15703] Length = 431 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 23/130 (17%), Positives = 49/130 (37%), Gaps = 14/130 (10%) Query: 150 NSRHIVMPITSSVKVNS-QTDARLDMMIVLDVSRSMESFFD--------SSITKIDMAIK 200 + + + S +T ++++++VLD S SM D S+ ++ + Sbjct: 303 GKYTLNLNVVGKDTRESHETTEKIEVVLVLDTSGSMNYCMDGSQRRCNKSNPKRLTALKE 362 Query: 201 SINAMLEEV----KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS 256 + + ++ I D N+ V+ + F L + L+ + LS G Sbjct: 363 AATSFIDATETTNDTIQDENSKVRIAIAQFGQTSGVVSSLTSDTAALKSSVSRLSANGA- 421 Query: 257 TNSTPGLKYA 266 T + G+ A Sbjct: 422 TPADKGMAAA 431 >gi|148657485|ref|YP_001277690.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569595|gb|ABQ91740.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 459 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 46/237 (19%), Positives = 94/237 (39%), Gaps = 34/237 (14%) Query: 134 RYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS-------QTDARLDMMIVLDVSRSMES 186 ++ L P R++++ + S KV Q L ++ VLDVS SM Sbjct: 46 TAQLELDQVYAPPGQNVDRYLLLTLCSPAKVPPEHALPREQHRPPLHLVAVLDVSGSM-- 103 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK 246 S TK+ A +++ L ++ + LVTFS++++ E S+ QRK Sbjct: 104 ----SGTKLASAKEALRQALHFLQDGDVFS------LVTFSDQVQTHLKAE---SYAQRK 150 Query: 247 IKYLSKFGVSTNSTP--GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 + ++ L Q D+ ++ T +++ ++DG+ + Sbjct: 151 RDKMENLLDEIRASGMTALDGGLAQGIDLGQKKRQATT-------LVLLLSDGQANVGET 203 Query: 305 DQQSLYY-CNEAKKRGAIVYAIGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAF 358 D + + +A++ G IV +G+ + + + A+ FY ++ + A Sbjct: 204 DLEKIGLRAQKARQSGLIVSTLGVGLDYNEALMVEIANQGGGRFYHIQEGSQIPAAL 260 >gi|75832116|ref|NP_001015590.2| inter-alpha-trypsin inhibitor heavy chain H4 precursor [Bos taurus] gi|122140331|sp|Q3T052|ITIH4_BOVIN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H4; Short=ITI heavy chain H4; Short=ITI-HC4; Short=Inter-alpha-inhibitor heavy chain 4; Flags: Precursor gi|74267794|gb|AAI02562.1| Inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Bos taurus] Length = 916 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 33/218 (15%), Positives = 75/218 (34%), Gaps = 33/218 (15%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 +S + +++ V+D S SM KI +++ +L+++ + L Sbjct: 266 PDSLSTIPKNVIFVIDKSGSM------MGRKIKQTREALIKILDDLSPHDQFD------L 313 Query: 224 VTFSNKIEEFFLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++FS++ + L V+ + + G TN + A + Sbjct: 314 ISFSSEATTWKPLLVPASTENVNEAKSYATGIQAQG-GTNINDAMLMAVQLLEKANQEEL 372 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK--RGAI-VYAIGIRVIRSHEF 335 +I+ +TDG+ + L +K G ++ +G S+ F Sbjct: 373 LPEGSIT----LIILLTDGDPT--VGETNPLNIQKNVRKAINGQHSLFCLGFGFDVSYAF 426 Query: 336 LRACASPNS------FYLVENPHSMYDAFSHIGKDIVT 367 L A N + ++ + D + + ++T Sbjct: 427 LEKMALENGGLARRIYEDSDSALQLQDFYQEVANPLMT 464 >gi|75070765|sp|Q5RB37|ITIH3_PONAB RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3; Short=ITI heavy chain H3; Short=ITI-HC3; Short=Inter-alpha-inhibitor heavy chain 3; Flags: Precursor gi|55728562|emb|CAH91023.1| hypothetical protein [Pongo abelii] Length = 876 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 73/176 (41%), Gaps = 15/176 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG-LVTFSNKIEE 232 + V+D+S SM K++ +++ +LE++K +N ++ SG + T+ + + Sbjct: 282 VAFVIDISGSMAG------RKLEQTKEALLRILEDMKKEDYLNFILFSGDVSTWKEHLVQ 335 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +K + G+ TN GL + + + +H E + I++ Sbjct: 336 ATPE--NLQEARTFVKSMEDKGM-TNINDGLLRGISMLNKARE--EHRVPERSTS--IVI 388 Query: 293 FMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +TDG+ N+ ++ A +Y +G ++ FL A N + Sbjct: 389 MLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFA 444 >gi|87308834|ref|ZP_01090973.1| hypothetical protein DSM3645_11362 [Blastopirellula marina DSM 3645] gi|87288545|gb|EAQ80440.1| hypothetical protein DSM3645_11362 [Blastopirellula marina DSM 3645] Length = 616 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 28/173 (16%), Positives = 63/173 (36%), Gaps = 20/173 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 ++ + +++ +LDVS SM + K+ + + + +++++ V VV +G Sbjct: 244 EIANAERPASNLVFLLDVSGSMN-----NARKLPLLKQGMKLLVDQLGENDKVAIVVYAG 298 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + S + + L G STN G++ AY ++ Sbjct: 299 AAG----MVLNSTNGDDKSTIMEALDRLQA-GGSTNGGQGIELAYQ------AATENFIK 347 Query: 283 EDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 N ++ TDG+ N+ + + K G + +G ++ Sbjct: 348 GGVNR---VILCTDGDFNVGVTSTSDLVTMAADKAKSGVFLSVMGFGTGNHND 397 >gi|330818825|ref|YP_004351042.1| von Willebrand factor, type A [Burkholderia gladioli BSR3] gi|327374367|gb|AEA65719.1| von Willebrand factor, type A [Burkholderia gladioli BSR3] Length = 660 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 22/137 (16%), Positives = 46/137 (33%), Gaps = 19/137 (13%) Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +++++ + + G + R+ ++ G ST + + A + + R Sbjct: 526 VISYAAGYQIIKSFDDGWEEVSRRSHAVASVGRSTATGMAMTVALSNLILRDEDR----- 580 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-----IRSHEFLR 337 +++V +TDG + AK+ G V I I + L Sbjct: 581 ------RMMVLLTDGAAGDP---VMTAASYQAAKEAGVEVVTIFIGRDIQAIALTRSILN 631 Query: 338 ACASPNSFYLVENPHSM 354 A F V +P + Sbjct: 632 ATGFGQHFSNVNSPDEL 648 >gi|229819442|ref|YP_002880968.1| von Willebrand factor A [Beutenbergia cavernae DSM 12333] gi|229565355|gb|ACQ79206.1| von Willebrand factor type A [Beutenbergia cavernae DSM 12333] Length = 647 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 32/220 (14%), Positives = 69/220 (31%), Gaps = 35/220 (15%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + + + + ++ ++D+S SM + T+++ A +I A L+ Sbjct: 441 VAAVAEAFPEVRKDAQVLFLVDLSASMGYDNTAGQTRLEGAQHAITAALDHFTAGD---- 496 Query: 218 VVQSGLVTFSNKIEEFFL--------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + GL F+ + L + L + L A Sbjct: 497 --RVGLAGFTTTDGTITPGLVAPVADIADNRGALVEGVNSLEPVAHT-----PLYQAV-- 547 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK-------KRGAIV 322 +Q D + IV ++DG N D +++ + + +V Sbjct: 548 ---ADFAQQQAAMWDPDRINAIVLLSDGVN--ATGDIETIGQDDMIHVLHGLHAETPVLV 602 Query: 323 YAIGIRVIRSHEFLRA--CASPNSFYLVENPHSMYDAFSH 360 + +G E L+A A+ +Y +P + Sbjct: 603 FTLGYSPDADVETLQAISSATGAHYYDATDPTEVEAVLGD 642 >gi|72008858|ref|XP_787086.1| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] gi|115969503|ref|XP_001184155.1| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] Length = 429 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 37/250 (14%), Positives = 76/250 (30%), Gaps = 25/250 (10%) Query: 117 DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMI 176 D +G + Y + N + + V S+ ++ Sbjct: 113 DEQSDFSEKGIDGDVVLTYNLMDAPTGSHTQVQNDYFVHFFSPIGLDVISKQ-----IVF 167 Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 V+DVS SM TK+ +++ ML+ + N + S V + + Sbjct: 168 VIDVSASMYG------TKLSQTKEALKTMLDNLNPTDYFNIITFSDGVQYWRENNRLAPA 221 Query: 237 EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 + ++ + Y+ + + L A + ++ N + +++ +TD Sbjct: 222 Q--RRYMDDAMAYVDSLRDDSETN--LNEAIVKAGELLDSEARYNRPGDSVYSMMILLTD 277 Query: 297 GENLSTKEDQQSLYYCNEAKK--RGAIVYAI-GIRVIRSHEFLRACA-----SPNSFYLV 348 G DQQ + A++ G I G + + L A + Y Sbjct: 278 GRPSVGTTDQQ--EILDNAREVIAGKHSLNILGFGRLVDFDLLVKLAYENNGTAKMIYEG 335 Query: 349 ENPHSMYDAF 358 F Sbjct: 336 TTAAEQLREF 345 >gi|333025630|ref|ZP_08453694.1| hypothetical protein STTU_3134 [Streptomyces sp. Tu6071] gi|332745482|gb|EGJ75923.1| hypothetical protein STTU_3134 [Streptomyces sp. Tu6071] Length = 518 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 31/155 (20%), Positives = 58/155 (37%), Gaps = 17/155 (10%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 ++++ T R + +VLD S SM ++ D + +++ + + D VV Sbjct: 321 AALRKAGLTGTRARVYLVLDRSGSMRPYYK------DGSAQALGEQVLALAAHLDTEAVV 374 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + LV FS I+ L + + + L + T N ++ +R Sbjct: 375 R--LVFFSTAIDAMGTLT--LDAYEGVVDGLHEGAGRMGRT-------NYALAIEEVRAL 423 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 E A ++VF TDG + Q+L Sbjct: 424 HAKEAAGEPGLVVFQTDGPPDARTAATQALKAAET 458 >gi|311696337|gb|ADP99210.1| protein containing a von Willebrand factor type A (vWA) domain [marine bacterium HP15] Length = 704 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 33/210 (15%), Positives = 66/210 (31%), Gaps = 34/210 (16%) Query: 136 KIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKI 195 + P W + M + + D++ V+D S SM I Sbjct: 310 QEPSAAVFHQSWEGEDYLMAMLVPGE---SGAMALPRDLVFVIDTSGSMAG------ESI 360 Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF-----LLEWGVSHLQRKIKYL 250 A ++ A L + + ++ F+++ F ++ ++ + L Sbjct: 361 RQARDALQAGLGTLTPRD------RFNVIQFNSQTHSLFMQPEVATGNNLARARQYVDRL 414 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 G T P L A + TED + ++F+TDG E Sbjct: 415 RADG-GTEMAPALSRA---------LEGGGETEDGARVRQVIFITDG--AVGNEAALFRQ 462 Query: 311 YCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + + ++ + I + F+R A Sbjct: 463 IRQQLGNQ--RLFTVAIGSAPNRHFMREAA 490 >gi|119952971|ref|YP_945180.1| hypothetical membrane spanning protein [Borrelia turicatae 91E135] gi|119861742|gb|AAX17510.1| hypothetical membrane spanning protein [Borrelia turicatae 91E135] Length = 341 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 45/235 (19%), Positives = 81/235 (34%), Gaps = 36/235 (15%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 + P S K + D++IVLD+S SM + SS +++ A + I + + Sbjct: 82 TLAGPSISKKKTTYLSSGA-DIVIVLDISPSMGAIEFSSKNRLEFAKELIKYFVYQ---- 136 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQI 270 GLV F+ + L +K+ Y+ G + G+ A + + Sbjct: 137 ---RENDNIGLVAFAKEASLIVPLTIDRDFFSKKLDDIYIMDLGNGSALGLGISIALSHL 193 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 K+ ++ +TDG S + + N A+ +Y+IGI Sbjct: 194 -----------KHSEAPKRSVIVLTDGVVNSDEVYKD--QVINLAQGLNVKIYSIGIGSS 240 Query: 331 RSHEFLRACASPNSFY-----LVENPHSMYDA-------FSHIGKDIVTKRIWYD 373 S FY + +P +++ F +G D K D Sbjct: 241 EELSVGFKLRS-GRFYQGTLREIYDPSMLFEISGKTGGLFYSVGDDFSFKLAIQD 294 >gi|156405886|ref|XP_001640962.1| predicted protein [Nematostella vectensis] gi|156228099|gb|EDO48899.1| predicted protein [Nematostella vectensis] Length = 336 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 66/197 (33%), Gaps = 12/197 (6%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 A +D+ ++D S S+ + + + I + V + S + Sbjct: 119 ASVDLGFLIDSSASIGY---QNFKSVRKMVDRIINVFTISPKQTHVAIITISDRPAHVLR 175 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + + ++R + L T L+ A+ ++F G + +K Sbjct: 176 FNTLQGADLNSASVRRVVDNLRFTRDKTRIDLALRMAHKEMFSKLG------GGRKHAQK 229 Query: 290 IIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--VIRSHEFLRACASPNSFY 346 I+V TDG + + ++ KK G + +G+R I L S + Sbjct: 230 ILVVFTDGIQTWRPDKMEKLSKASEPLKKMGVKIIPVGVRGEEINVGSLLDMSLDTYSVF 289 Query: 347 LVENPHSMYDAFSHIGK 363 +E + A + K Sbjct: 290 NMEFFPELLQALKKMSK 306 >gi|118359890|ref|XP_001013183.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89294950|gb|EAR92938.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 2138 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 42/328 (12%), Positives = 110/328 (33%), Gaps = 47/328 (14%) Query: 51 HSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDI 110 S D+S T+I+ +G K+ + + ++ N+ + Sbjct: 1321 QSDQDQSNNRDITKIL-DGLLTVEKQQTDENTHDKQPKIEEKDQSEGQQEEFEQNETHSL 1379 Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT-- 168 + + +++ + + + K ++ I+S+VK S Sbjct: 1380 RKISQKKVLI----KSIQRKVKTNKEKVQKALNEEDKENQTKSQQHRISSNVKNISGQFS 1435 Query: 169 -------DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 +D++ V+D S SM + +D+ +++ +++ ++ + Sbjct: 1436 LGQLQPMRFPIDLICVIDTSGSM------NGQPLDLLKETLLFLVDLLQTGD------RI 1483 Query: 222 GLVTFSNKIEEFFLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 L+ FS + L + + ++ +I L G TN G++ A++ + Sbjct: 1484 CLIQFSTNAQRLTPLLSIESKDNIKSIKNEINRLVAKG-GTNICQGMQLAFDVL------ 1536 Query: 277 RQHCNTEDANYKKIIVFMTDGEN----LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 +Q + ++DG N ++ + L + + + G Sbjct: 1537 KQRRYKNPITS---VFLLSDGLNDGAENKIRDLLKQLNFYQNYNEENFTIQTFGFGKDHD 1593 Query: 333 HEFLRACAS--PNSFYLVENPHSMYDAF 358 + + +FY + + H + + F Sbjct: 1594 PNLMDKISQLMDGNFYYIGDIHRIDECF 1621 >gi|42526759|ref|NP_971857.1| batA protein, putative [Treponema denticola ATCC 35405] gi|41817074|gb|AAS11768.1| batA protein, putative [Treponema denticola ATCC 35405] Length = 332 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 34/171 (19%), Positives = 60/171 (35%), Gaps = 19/171 (11%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 TDA +M +LD+S SM + S T+I A K I + + ++ GL S Sbjct: 86 TDAGSSIMFLLDISPSMAAKDMSGETRIAAAKKIIRKFVAK----YPGDSF---GLTALS 138 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + ++ LS + + G+ A + + + Sbjct: 139 SSAALILPPTIDHKVFLSRLDSLSIGELGDGTAIGMGLAVSSAYMTRTKLNSS------- 191 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 IV +TDGEN + + + ++ + Y IGI + L Sbjct: 192 --YIVLLTDGENNTGEINPKTA--AKVLVNKNIGFYVIGIGS-SGYTTLEY 237 >gi|196007110|ref|XP_002113421.1| hypothetical protein TRIADDRAFT_57572 [Trichoplax adhaerens] gi|190583825|gb|EDV23895.1| hypothetical protein TRIADDRAFT_57572 [Trichoplax adhaerens] Length = 1343 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 28/203 (13%), Positives = 68/203 (33%), Gaps = 27/203 (13%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 + +++VLD S SM + + I++ ++ ++ I G++ FS Sbjct: 296 PVRIVMVLDKSGSMRG------SNLQQLIQAATNVILQLGQIDGS-----IGIIIFSTSA 344 Query: 231 EEFFLL--EWGVSHLQRKIKYLSK-FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + I L T+ G+ + G ++ Sbjct: 345 TVTCPLMAVNNDQDKNKLIGCLPPEASGGTSIGSGILKGIELLLGSVGEQKPSGGH---- 400 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH---EFLRACASPNS 344 ++ M+DG+ + + + + +V +I S + ++ + Sbjct: 401 ---LIVMSDGQ---ENANPRIKDVMSNITENDVVVTSISFGQSASKVLEDLAKSTGGSSY 454 Query: 345 FYLVENPHSMYDAFSHIGKDIVT 367 F ++ +AF+ I ++V Sbjct: 455 FASTNGTLTLMNAFTAIISNLVG 477 >gi|168186710|ref|ZP_02621345.1| von Willebrand factor type A domain protein [Clostridium botulinum C str. Eklund] gi|169295292|gb|EDS77425.1| von Willebrand factor type A domain protein [Clostridium botulinum C str. Eklund] Length = 693 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 23/110 (20%), Positives = 39/110 (35%), Gaps = 15/110 (13%) Query: 194 KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF 253 KID K+ + + + + + GLV++ N +E L + + I Sbjct: 184 KIDELQKAAKNFVNKFE----IKANTKIGLVSYGNNGKEVHSLTNELDRINSSIDSGLSI 239 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 G TN G++ A ++ K IV MTDG + Sbjct: 240 GGGTNVGDGIRMA-----------NGILNNGSDADKYIVLMTDGMPTAAT 278 >gi|121602946|ref|YP_980275.1| vault protein inter-alpha-trypsin subunit [Polaromonas naphthalenivorans CJ2] gi|120591915|gb|ABM35354.1| Vault protein inter-alpha-trypsin domain protein [Polaromonas naphthalenivorans CJ2] Length = 701 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 28/135 (20%), Positives = 52/135 (38%), Gaps = 20/135 (14%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 +V +Q + D + V+D+S SM F +D A + ++ +++ N ++ SG Sbjct: 317 QVAAQAISPRDYIFVVDISGSMHGFP------LDTAKTLMRELIGKLRPSDTFNVLLFSG 370 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 F + + + R I + G T P LK Y Sbjct: 371 SNRFLSPASVPAT-QANIEQAVRTIDEM-GGGGGTELIPALKRVY------------AEP 416 Query: 283 EDANYKKIIVFMTDG 297 + A+ + +V +TDG Sbjct: 417 KAADVSRTVVVVTDG 431 >gi|282863104|ref|ZP_06272164.1| von Willebrand factor type A [Streptomyces sp. ACTE] gi|282562086|gb|EFB67628.1| von Willebrand factor type A [Streptomyces sp. ACTE] Length = 248 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 38/175 (21%), Positives = 71/175 (40%), Gaps = 22/175 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 V+ + R + +VLD S SM ++ D S+ + + S++A L++ ++P Sbjct: 36 VRRHGLEGVRAAVYLVLDRSGSMRPYYRDGSMQHLAEQVLSLSAHLDDDGIVP------- 88 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V FS ++ L G +R++ L T N + M + H Sbjct: 89 --VVFFSTDVDGSTDLTLGGH--RRRVDKLHANLGHMGRT-------NYHWAMDEVIDHY 137 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 ++ ++VF TDG S ++ Y C A+ + +G S+EF Sbjct: 138 LESGSSAPALVVFQTDGGPTSRFAAER--YLCKAARLPLFWQF-VGFGDPDSNEF 189 >gi|75906479|ref|YP_320775.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75700204|gb|ABA19880.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 615 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 36/236 (15%), Positives = 84/236 (35%), Gaps = 33/236 (13%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 + + P+ F I S + N L++ +V+D S SM Sbjct: 2 VKTSYEFDQPILPAGFSLKANILLRFRAEIPESPRRN------LNLSLVIDRSGSMAGAA 55 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIK 248 +K+ +++++++ ++ VV V + + + L++ I+ Sbjct: 56 LHHA------LKAAESVVDQLEPKDILSVVVYDDAV---DTVVSPQPVTD-KPALKKSIR 105 Query: 249 YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI--IVFMTDGE-NLSTKED 305 + G++ S LK + +KI ++ +TDG N+ ++ Sbjct: 106 QVRAGGITNLSGGWLK------------GCEYVKHQLDPQKINRVLLLTDGHANMGIQDP 153 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL--RACASPNSFYLVENPHSMYDAFS 359 + + + G +G + + L A A+ +FY +++ + FS Sbjct: 154 KILTATSAQKAEEGITTTTLGFAQGFNEDLLIGMARAANGNFYFIQSIDEAAEVFS 209 >gi|220925364|ref|YP_002500666.1| LPXTG-motif cell wall anchor domain-containing protein [Methylobacterium nodulans ORS 2060] gi|219949971|gb|ACL60363.1| LPXTG-motif cell wall anchor domain protein [Methylobacterium nodulans ORS 2060] Length = 725 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 28/196 (14%), Positives = 67/196 (34%), Gaps = 32/196 (16%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 + ++ + + + + D+ V+D S SM + A S+ L+ + Sbjct: 314 GTDEYLLAVVTPPEGQNLARRPRDVTFVIDNSGSMAG------ASMRQAKASLLMALDRL 367 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFL-----LEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 + ++ F N +++ F E ++ + + L G + P Sbjct: 368 APAD------RFNVIRFDNTMDQLFPEAVPADERHLAVARSFVAALEARGGTEMLAP--- 418 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 + E + + IVF+TDG +++ ++ A + + ++ Sbjct: 419 --------LTAALADPTPERTDRVRQIVFLTDG----AIGNEEQIFSAIAAGRGRSRLFM 466 Query: 325 IGIRVIRSHEFLRACA 340 IGI + + A Sbjct: 467 IGIGSAPNAHLMTYAA 482 >gi|160899637|ref|YP_001565219.1| von Willebrand factor type A [Delftia acidovorans SPH-1] gi|160365221|gb|ABX36834.1| von Willebrand factor type A [Delftia acidovorans SPH-1] Length = 244 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 66/203 (32%), Gaps = 19/203 (9%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 L ++++LDVS SM S KI ++ ML+ + + ++TF Sbjct: 15 KAKPLPVVLLLDVSGSM------SGEKIRNVNDAVRDMLDTFSDTENGETEIHVAIITFG 68 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 +++ L + + G T L+ A I D Y Sbjct: 69 SQVALHQPLASASDIHWQDLSA----GGMTPLGTALQMAKAMIEDKD------VIPSRAY 118 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC--ASPNSF 345 + +V ++DG E + + + + A+ I L + N Sbjct: 119 RPTVVLVSDGGPNDAWEKPLNAFISD-GRSAKCDRLAMAIGADADEAVLGKFIEGTSNRL 177 Query: 346 YLVENPHSMYDAFSHIGKDIVTK 368 + EN + D F + + + Sbjct: 178 FYAENAKQLRDFFKFVTMSVTIR 200 >gi|3236344|gb|AAC23663.1| integrin alpha E2 [Rattus norvegicus] Length = 1167 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 37/203 (18%), Positives = 77/203 (37%), Gaps = 26/203 (12%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + IVLD S S+E + A I+ M+ N LV + I+ Sbjct: 196 IAIVLDGSGSIEP------SDFQKAKDFISTMMRNFYEKCFECNF---ALVQYGGVIQTE 246 Query: 234 FLL--EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 F L ++ K++ + + T + +++ + IF K++ Sbjct: 247 FDLLDSRDINASLAKVQSIVQVKEVTKTASAMQHVLDNIFIPS------RGSRKKALKVM 300 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI----RSHEFLRACASP---NS 344 V +TDG+ + ++ + +K +G + +AIG+ ++ L+ AS Sbjct: 301 VVLTDGDIFRDPLNLTTVI--SSSKMQGVVRFAIGVGNAFENNNTYRELKLIASDPKAAH 358 Query: 345 FYLVENPHSMYDAFSHIGKDIVT 367 + V N ++ S + + I+ Sbjct: 359 TFKVTNYSALDGLLSKLQQRIIH 381 >gi|53804868|ref|YP_113288.1| MxaC protein [Methylococcus capsulatus str. Bath] gi|53758629|gb|AAU92920.1| MxaC protein [Methylococcus capsulatus str. Bath] Length = 327 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 37/182 (20%), Positives = 66/182 (36%), Gaps = 25/182 (13%) Query: 174 MMIVLDVSRSMESFFDSSI-TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 M+++LD SRSM+ F T + + + L + N++V G+ FS Sbjct: 85 MVLLLDRSRSMDDSFAGRTPTGGEESKSAAAERLLSGFVSSGRNDLV--GVAAFSTSPLF 142 Query: 233 FFLLEWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L + + + + G T+ + GL A + D +I+ Sbjct: 143 VLPLTDNKAAVLAAVHAMKLPGLAQTHVSKGLAMALSYFGDDSTAGS----------RIV 192 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENP 351 + ++DG + + L K++G +Y I FLR S F +NP Sbjct: 193 LLVSDG--AAEVDPDSELKLRRWFKEKGVRLYWI---------FLRTAGSHGIFETPDNP 241 Query: 352 HS 353 Sbjct: 242 EE 243 >gi|82913466|ref|XP_728655.1| methanol oxidation protein [Plasmodium yoelii yoelii str. 17XNL] gi|23485116|gb|EAA20220.1| methanol oxidation protein [Plasmodium yoelii yoelii] Length = 720 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 37/182 (20%), Positives = 66/182 (36%), Gaps = 25/182 (13%) Query: 174 MMIVLDVSRSMESFFDSSI-TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 M+++LD SRSM+ F T + + + L + N++V G+ FS Sbjct: 160 MVLLLDRSRSMDDSFAGRTPTGGEESKSAAAERLLSGFVSSGRNDLV--GVAAFSTSPLF 217 Query: 233 FFLLEWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L + + + + G T+ + GL A + D +I+ Sbjct: 218 VLPLTDNKAAVLAAVHAMKLPGLAQTHVSKGLAMALSYFGDDSTAGS----------RIV 267 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENP 351 + ++DG + + L K++G +Y I FLR S F +NP Sbjct: 268 LLVSDG--AAEVDPDSELKLRRWFKEKGVRLYWI---------FLRTAGSHGIFETPDNP 316 Query: 352 HS 353 Sbjct: 317 EE 318 >gi|293605449|ref|ZP_06687831.1| aerotolerance protein BatA [Achromobacter piechaudii ATCC 43553] gi|292816177|gb|EFF75276.1| aerotolerance protein BatA [Achromobacter piechaudii ATCC 43553] Length = 343 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 29/207 (14%), Positives = 70/207 (33%), Gaps = 36/207 (17%) Query: 173 DMMIVLDVSRSMES-----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+++ +D+S+SM+S +++ D + + + D + GL+ F Sbjct: 93 DILLAVDISQSMDSVDFNDAQGQPLSRWDAVKAVVADFIAQ--RADD-----RLGLIVFG 145 Query: 228 NKIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 L + L+ + + G +T + + + Sbjct: 146 TGAYPQAPLTRDHASLKLLLDEAAVGMAGPNTAVGDAIGLGIRMLDAAEEQ--------- 196 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-------HEFLRA 338 K+++ +TDG T A + V+ IGI + + LR Sbjct: 197 --DKVLILLTDG--NDTGSAVPPARAAALAAQHHVTVHTIGIGDPAATGEDRVDFDILRE 252 Query: 339 CA--SPNSFYLVENPHSMYDAFSHIGK 363 A + F+ + ++ + ++ + + Sbjct: 253 VARTAGGQFFPARDLATLREVYATLDR 279 >gi|148728188|ref|NP_002199.3| integrin alpha-E precursor [Homo sapiens] gi|226694184|sp|P38570|ITAE_HUMAN RecName: Full=Integrin alpha-E; AltName: Full=HML-1 antigen; AltName: Full=Integrin alpha-IEL; AltName: Full=Mucosal lymphocyte 1 antigen; AltName: CD_antigen=CD103; Contains: RecName: Full=Integrin alpha-E light chain; Contains: RecName: Full=Integrin alpha-E heavy chain; Flags: Precursor Length = 1179 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 29/158 (18%), Positives = 62/158 (39%), Gaps = 21/158 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV-KLIPDVNNVVQSGLVTFSNKIEE 232 + I+LD S S++ A I+ M+ + + N LV + I+ Sbjct: 203 IAIILDGSGSIDP------PDFQRAKDFISNMMRNFYEKCFECN----FALVQYGGVIQT 252 Query: 233 FFLLEWGVSHLQR--KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L + +++ +++ G T + +++ + IF + + K+ Sbjct: 253 EFDLRDSQDVMASLARVQNITQVGSVTKTASAMQHVLDSIFTSSHGSRRKAS------KV 306 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +V +TDG + ++ N K +G +AIG+ Sbjct: 307 MVVLTDGGIFEDPLNLTTVI--NSPKMQGVERFAIGVG 342 >gi|119610886|gb|EAW90480.1| integrin, alpha E (antigen CD103, human mucosal lymphocyte antigen 1; alpha polypeptide) [Homo sapiens] Length = 1196 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 29/158 (18%), Positives = 62/158 (39%), Gaps = 21/158 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV-KLIPDVNNVVQSGLVTFSNKIEE 232 + I+LD S S++ A I+ M+ + + N LV + I+ Sbjct: 220 IAIILDGSGSIDP------PDFQRAKDFISNMMRNFYEKCFECN----FALVQYGGVIQT 269 Query: 233 FFLLEWGVSHLQR--KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L + +++ +++ G T + +++ + IF + + K+ Sbjct: 270 EFDLRDSQDVMASLARVQNITQVGSVTKTASAMQHVLDSIFTSSHGSRRKAS------KV 323 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +V +TDG + ++ N K +G +AIG+ Sbjct: 324 MVVLTDGGIFEDPLNLTTVI--NSPKMQGVERFAIGVG 359 >gi|4406708|gb|AAB59359.2| integrin alpha E precursor [Homo sapiens] gi|109659254|gb|AAI17208.1| Integrin, alpha E (antigen CD103, human mucosal lymphocyte antigen 1; alpha polypeptide) [Homo sapiens] gi|109730475|gb|AAI13437.1| Integrin, alpha E (antigen CD103, human mucosal lymphocyte antigen 1; alpha polypeptide) [Homo sapiens] gi|313883908|gb|ADR83440.1| integrin, alpha E (antigen CD103, human mucosal lymphocyte antigen 1 [synthetic construct] Length = 1179 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 29/158 (18%), Positives = 62/158 (39%), Gaps = 21/158 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV-KLIPDVNNVVQSGLVTFSNKIEE 232 + I+LD S S++ A I+ M+ + + N LV + I+ Sbjct: 203 IAIILDGSGSIDP------PDFQRAKDFISNMMRNFYEKCFECN----FALVQYGGVIQT 252 Query: 233 FFLLEWGVSHLQR--KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L + +++ +++ G T + +++ + IF + + K+ Sbjct: 253 EFDLRDSQDVMASLARVQNITQVGSVTKTASAMQHVLDSIFTSSHGSRRKAS------KV 306 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +V +TDG + ++ N K +G +AIG+ Sbjct: 307 MVVLTDGGIFEDPLNLTTVI--NSPKMQGVERFAIGVG 342 >gi|7239181|gb|AAF43107.1| HUMINAE [Homo sapiens] Length = 1127 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 29/158 (18%), Positives = 62/158 (39%), Gaps = 21/158 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV-KLIPDVNNVVQSGLVTFSNKIEE 232 + I+LD S S++ A I+ M+ + + N LV + I+ Sbjct: 151 IAIILDGSGSIDP------PDFQRAKDFISNMMRNFYEKCFECN----FALVQYGGVIQT 200 Query: 233 FFLLEWGVSHLQR--KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L + +++ +++ G T + +++ + IF + + K+ Sbjct: 201 EFDLRDSQDVMASLARVQNITQVGSVTKTASAMQHVLDSIFTSSHGSRRKAS------KV 254 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +V +TDG + ++ N K +G +AIG+ Sbjct: 255 MVVLTDGGIFEDPLNLTTVI--NSPKMQGVERFAIGVG 290 >gi|328712312|ref|XP_003244777.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 2 [Acyrthosiphon pisum] Length = 919 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 83/223 (37%), Gaps = 40/223 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP--------DVNNVVQSGLVT 225 ++ VLDVS SM K+ +++ +L E+ D V Sbjct: 357 VIFVLDVSGSM------VGQKLPQVKEAMGQILSEIHSEDFFTLILFSDFAQVWTINATQ 410 Query: 226 FSNKIEEFFLLEWGVSH----------------LQRKIKYLSKFGVS--TNSTPGLKYAY 267 ++ + + W ++ ++ ++Y KF + S+ ++ A Sbjct: 411 ETSNHWDEKVSNWKTNNNISLDTLGENRFVFPATEQNVQYAKKFIQDLQSESSTNMEDAL 470 Query: 268 NQIFDMQGMRQHCNTEDANYKK-IIVFMTDGEN-LSTKEDQQSLYYCNEAKKRGAIVYAI 325 N+ + + + + AN K IIVF+TDGE E Q+ + Y + + +Y++ Sbjct: 471 NKAHLIAKLGETRFKDGANTPKPIIVFLTDGEPTTGITEPQELIKYVSNTNEEKYPIYSL 530 Query: 326 GIRVIRSHEFLRACASPNS-----FYLVENPH-SMYDAFSHIG 362 G +FL+ + N+ Y + + + + I Sbjct: 531 GFGEGADIDFLKKLSLNNTGFARVIYEASDASLQLRNFYKEIS 573 >gi|296208407|ref|XP_002751080.1| PREDICTED: calcium-activated chloride channel regulator 1 [Callithrix jacchus] Length = 914 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 38/197 (19%), Positives = 72/197 (36%), Gaps = 35/197 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM ++ +++ ++ L + + G+VTF + Sbjct: 307 VCLVLDKSGSM-----ATGNRLNRLNQAGQLFLLQTVELGSW-----VGMVTFDSAAYVQ 356 Query: 234 FLLE--WGVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L S K L T+ GL+ A+ I +++ Sbjct: 357 SELVQINSGSDRDTLAKRLPTAAAGGTSICTGLRSAFTVI-----RKKYPTDGSE----- 406 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ ++ C NE K+ GA+++ + + + E L Y Sbjct: 407 IVLLTDGEDN-------TISGCFNEVKQSGAVIHTVALGPSAAQELEQLSKMTGGFQTYA 459 Query: 348 VENPHS--MYDAFSHIG 362 + + + DAF + Sbjct: 460 SDQAQNNGLIDAFGALS 476 >gi|269126104|ref|YP_003299474.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268311062|gb|ACY97436.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 583 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 66/203 (32%), Gaps = 31/203 (15%) Query: 174 MMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN---- 228 ++ ++DVS SM E + T++ A +++ +++ + + GL FS Sbjct: 393 VLFLVDVSGSMAEPLPGTGRTRMQQAQRALRQAVDDFVAVD------RVGLWEFSTDLGG 446 Query: 229 -----KIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 + + L+ +I L G T A+ + ++ + Sbjct: 447 GRDYRPLVPIRPMSTPGHRERLREQIAALRPRG-DTGLYDSTLAAFRHVRAVRQDGAINS 505 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK-KRGAIVYAIGIRVIRSHEFLRAC- 339 +V +TDG N E G ++ I L+ Sbjct: 506 ---------VVVLTDGRNDDPGGGLSLEELLKELDGADGVRIFTIAYGAGADGGALKKIS 556 Query: 340 -ASPNSFYLVENPHSMYDAFSHI 361 A+ + Y +P ++ + + Sbjct: 557 EATDAAAYDSRDPATLDKVLTQV 579 >gi|222101614|gb|ACM44012.1| thrombospondin-related anonymous protein [Babesia bovis] gi|222101618|gb|ACM44014.1| thrombospondin-related anonymous protein [Babesia bovis] Length = 657 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 58/170 (34%), Gaps = 29/170 (17%) Query: 167 QTDARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 Q +LD IV+D S S+ ++ + I + ++ V L N ++ L T Sbjct: 39 QCKKQLDFSIVVDESASISNDQWEGQM------IPFLRNLIHTVDL---DNTDIRLSLTT 89 Query: 226 FSNKIEEFFL-LEWGVSHLQRKIKYLSKFGVS------TNSTPGLKYAYNQIFDMQGMRQ 278 +S + F L+ S + + L + T + L Y I Sbjct: 90 YSTPTRQIFTFLDAAASSTRLALTKLDWMAGTKARSGMTYTGRALNYVRKAILPY----- 144 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 N K ++ +TDG + + + + G V IG+ Sbjct: 145 ----GRKNVPKALLLITDG---VSSDGSYTAQVAAMLRDEGVNVMVIGVG 187 >gi|189461338|ref|ZP_03010123.1| hypothetical protein BACCOP_01988 [Bacteroides coprocola DSM 17136] gi|189431867|gb|EDV00852.1| hypothetical protein BACCOP_01988 [Bacteroides coprocola DSM 17136] Length = 341 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 30/172 (17%), Positives = 63/172 (36%), Gaps = 20/172 (11%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + ++ ++ LD+S SM + D + ++++ + K I+ ++E V +V Sbjct: 79 GSKMETVKRQGVETVVALDISNSMLAQ-DVTPSRLEKSKKLISRLVETFNN-DKVAMIVF 136 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +G I ++ I T+ + A Sbjct: 137 AGEAFTQLPITSDYI---SAKMFLETINPSLISTQGTDIAGAINLAMKSFT--------- 184 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + + IV +TDGEN + ++ EA K+G V+ +G+ Sbjct: 185 --PNEGVGRAIVLITDGENH----EGGAVEAAQEAAKKGVRVFVLGVGSPDG 230 >gi|115438797|ref|NP_001043678.1| Os01g0640200 [Oryza sativa Japonica Group] gi|20805117|dbj|BAB92788.1| zinc finger (C3HC4-type RING finger)-like protein [Oryza sativa Japonica Group] gi|113533209|dbj|BAF05592.1| Os01g0640200 [Oryza sativa Japonica Group] gi|125527021|gb|EAY75135.1| hypothetical protein OsI_03030 [Oryza sativa Indica Group] gi|125571342|gb|EAZ12857.1| hypothetical protein OsJ_02777 [Oryza sativa Japonica Group] Length = 589 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 45/265 (16%), Positives = 88/265 (33%), Gaps = 40/265 (15%) Query: 104 VNDIDDIVRSTSLDIVVVPQN--EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSS 161 ND + +++ P G + + K + P + +++ + S Sbjct: 3 YNDDEQTAPTSTATEPARPTVGITGQLVKQVRLNKYHNDVASMAPH---DQELLLELRGS 59 Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 +S A LD++ V+DVS SM +ID ++ ++ ++ + + Sbjct: 60 --SSSTDRAGLDLVAVIDVSGSM------DGDRIDKVKTALQFVIRKLSDLD------RL 105 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLS---KFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 +VTF L + + Q ++K L K TN GL+ + + + Sbjct: 106 CIVTFCTNATRLCPLRFVTAAAQAELKALVDGLKAYGDTNMKGGLETGMSVV-----DGR 160 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 A ++ M+DG + + K VY L A Sbjct: 161 SLAAGRAVS---VMLMSDGYQNHGGDARDV-----HLKN--VPVYTFSFGASHDSNLLEA 210 Query: 339 CAS---PNSFYLVENPHSMYDAFSH 360 A +F V + ++ FS Sbjct: 211 IARKSLGGTFNYVADSANLTGPFSQ 235 >gi|307260995|ref|ZP_07542677.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 12 str. 1096] gi|306869297|gb|EFN01092.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 12 str. 1096] Length = 539 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 30/248 (12%), Positives = 83/248 (33%), Gaps = 19/248 (7%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R F + G T++ + I ++ + +E + I + L ++++++ + + Sbjct: 10 RRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAENNSG 69 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIV--------V 120 N+ K + + R+ FV ++ + Sbjct: 70 RKDNDYKLSGSNKENDSFDISSEVGKRDSQMVTKFVKAFLPQTDEKNMHLTPLCKTINNN 129 Query: 121 VPQNEGYSISAISRYKIPLKFCTFIPWYTNS-----RHIVMPITSSVKVNSQTDARLDMM 175 + S ++ ++ P + + + + S + + +D+M Sbjct: 130 SGKGHTSSSEVTCTVSGTVEHKSWFPLKVGNLEVIPKQVNVASKSKALKKNTFNIPIDLM 189 Query: 176 IVLDVSRSM------ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +V D+S SM E + +KI + + + + ++ + N + G+ F+ Sbjct: 190 VVADLSGSMKDGIKGEKLEGGTNSKIYILREVLKELADKSLFTQESNEYNRIGITAFAMG 249 Query: 230 IEEFFLLE 237 E + Sbjct: 250 AEHPKENK 257 >gi|87310828|ref|ZP_01092954.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated [Blastopirellula marina DSM 3645] gi|87286343|gb|EAQ78251.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated [Blastopirellula marina DSM 3645] Length = 788 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 69/214 (32%), Gaps = 31/214 (14%) Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 P N+ ++ + T+ P + + ++ + V+ + ++ V+D S Sbjct: 244 PTNDFRLFFDTAKTDLSASVLTYRPDKSEDGYFLLLASPPVEEVGDVKTKKTVIFVVDRS 303 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL----LE 237 SM S KI+ A ++ +L + N ++ + + +E F L+ Sbjct: 304 GSM------SGEKIEQAKEAAKFVLNNLNEGDLFN------IIAYDSDVESFEPELQKLD 351 Query: 238 WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 G STN L A + D + ++F+TDG Sbjct: 352 DKTREKALGFVDNLYAGGSTNIDGALAKAMGMLKDDK------------RPSYMLFLTDG 399 Query: 298 ENLSTKEDQQSLYYCNEAK-KRGAIVYAIGIRVI 330 +Q + AK K I V Sbjct: 400 LPTH--GEQNEAKIVDNAKQKNDVRARVISFGVG 431 >gi|309361123|emb|CAP30209.2| hypothetical protein CBG_10938 [Caenorhabditis briggsae AF16] Length = 579 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 71/211 (33%), Gaps = 43/211 (20%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 +T D + D+ I+ D S S+E F + + N Sbjct: 397 SVTEPTDKLPVNDCQYDVGIIFDSSGSLEKNFQTQL--------------------QIAN 436 Query: 217 NVVQSGLVTFS----NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + Q +V F+ ++ F+ L++ I+ + +T + LK +F+ Sbjct: 437 KLFQVAIVQFAGKSKTRVLADFVQNKTKDQLEKIIEKSPFYSGTTFTNQALKR-MALLFE 495 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 ++ TDG + + K++G VY +GI ++ Sbjct: 496 ASKRDNCKMK--------LLVFTDG-----YSAEDTAEGIEALKRQGITVYTVGISTDKN 542 Query: 333 HEF----LRACA-SPNSFYLVENPHSMYDAF 358 L+ A SP+ ++ + ++ F Sbjct: 543 AGLNVSELKGMATSPSHYFDSSDFDNLLKHF 573 >gi|262050538|ref|NP_001159921.1| inter-alpha-trypsin inhibitor heavy chain H4 isoform 2 precursor [Homo sapiens] Length = 900 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 30/171 (17%), Positives = 64/171 (37%), Gaps = 19/171 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEE 232 ++ V+D S SM S KI +++ +L+++ N +V + + + Sbjct: 275 VVFVIDKSGSM------SGRKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLVP 328 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V+ + + G TN + A Q+ D + + I+ Sbjct: 329 AS--AENVNKARSFAAGIQALG-GTNINDAMLMAV-QLLDSSNQEERLPEGSVSL---II 381 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA---IGIRVIRSHEFLRACA 340 +TDG+ + + +S+ N ++ + Y+ +G S+ FL A Sbjct: 382 LLTDGDPTVGETNPRSIQ--NNVREAVSGRYSLFCLGFGFDVSYAFLEKLA 430 >gi|149921342|ref|ZP_01909796.1| pentapeptide repeat protein [Plesiocystis pacifica SIR-1] gi|149817775|gb|EDM77239.1| pentapeptide repeat protein [Plesiocystis pacifica SIR-1] Length = 739 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 22/127 (17%), Positives = 42/127 (33%), Gaps = 9/127 (7%) Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 G T + GL+ ++ + D Q D + + + + +TDG+ + D Q Sbjct: 438 GSGTFTHRGLELVFDNLLDYQANPPALYPADEDTQYVNILITDGQYSTYSTDAQVQNALE 497 Query: 314 EAKKRGAIVYAIGIRVI----RSHEFLRACAS-----PNSFYLVENPHSMYDAFSHIGKD 364 G+ Y IG + L A+ + ++ + A I +D Sbjct: 498 ALLDAGSKTYVIGFGDGLNTTEAQLQLMNMATWGSGGTEMPFDADSQAELELALGAIIED 557 Query: 365 IVTKRIW 371 I Sbjct: 558 IEFDPCC 564 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 34/250 (13%), Positives = 73/250 (29%), Gaps = 56/250 (22%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +I+LD S SM F T+ ++A+ ++ ++ N V L+ + + E Sbjct: 38 FLIILDYSSSMSEPFGMGQTRWEVAVDAVTTLMTTNGGFFQKNMHV--ALMRYGHDPEPD 95 Query: 234 FL------------------LEW-------------GVSHLQRKIKYLSK---------- 252 + W + + + I Sbjct: 96 NPGTPIPGDLSGLIDGQSLDVGWYDPDAADKTYFDCNGAEIIQSINATPPPLCPGGPGNC 155 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG--------ENLSTKE 304 G+ T + + + I + I+ +TDG LS Sbjct: 156 SGIGTWTKGAMDLGADLISQSKADHPEDTVPGDERFYGIMVVTDGAWTPAQGFPTLSPPS 215 Query: 305 DQQSLYYCNEAKKRGAIVYAI----GIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 + ++ ++ Y + + + + E A + S ENP + A Sbjct: 216 ENPAITAAQLFDQQDIPTYVVAVADAVNLPFADELAEAGGTDESI-AAENPAELTQAIGA 274 Query: 361 IGKDIVTKRI 370 + +DI + + Sbjct: 275 VVQDIADQVV 284 >gi|296225414|ref|XP_002758468.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 [Callithrix jacchus] Length = 904 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 26/153 (16%), Positives = 56/153 (36%), Gaps = 25/153 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D S SM S KI +++ +L+++ N L+TFS++ ++ Sbjct: 275 VVFVIDKSGSM------SGRKIQQTREALIKILDDLSPRDQFN------LITFSSEATQW 322 Query: 234 FL-----LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 V+ + + G TN + A Q+ D + T + Sbjct: 323 SPSLVPASAENVNKARSFAAAIHALG-GTNINDAVLMAV-QLLDRSNREERLPTRSVSL- 379 Query: 289 KIIVFMTDGENL---STKEDQQSLYYCNEAKKR 318 I+ +TDG+ + ++ ++ Sbjct: 380 --IILLTDGDPTVGEGPASNSKTRCTGENVREA 410 >gi|51476525|emb|CAH18248.1| hypothetical protein [Homo sapiens] Length = 637 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 63/169 (37%), Gaps = 15/169 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEE 232 ++ V+D S SM S KI +++ +L+++ N +V + + + Sbjct: 275 VVFVIDKSGSM------SGRKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLVP 328 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V+ + + G TN + A Q+ D + + I+ Sbjct: 329 AS--AENVNKARSFAAGIQALG-GTNINDAMLMAV-QLLDSSNQEERLPEGSVSL---II 381 Query: 293 FMTDGENLSTKEDQQSLY-YCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +TDG+ + + +S+ EA G ++ +G S+ FL A Sbjct: 382 LLTDGDPTVGETNPRSIQNNVREAVSGGYSLFCLGFGFDVSYAFLEKLA 430 >gi|196232430|ref|ZP_03131283.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] gi|196223502|gb|EDY18019.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] Length = 879 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 37/201 (18%), Positives = 67/201 (33%), Gaps = 20/201 (9%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + M++VLD S SM + TKI +A + + ++ G+V Sbjct: 417 DTPTVAMLVVLDRSGSMTAAVAGQ-TKISLADQGAVFAMNALQPKD------YFGVVAVD 469 Query: 228 NKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 K L ++KI ++ G + A+ Q+ D+ +H Sbjct: 470 TKPHTVVPLAPISAKGAAEQKILSITAGGGGIYIYTSMVEAFQQLRDIPARVKHLLLFSD 529 Query: 286 NYKKIIVF---MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE--FLRACA 340 M+DG + SL + +G+ + + FLR A Sbjct: 530 AADAEEKAAGEMSDG----IRTGGNSLDLASAMLAAKITTSVVGLGTEQDKDTPFLRQLA 585 Query: 341 S--PNSFYLVENPHSMYDAFS 359 FYL ++ ++ FS Sbjct: 586 ERGSGRFYLTDDATTLPQIFS 606 >gi|126306100|ref|XP_001362237.1| PREDICTED: similar to calcium-dependent chloride channel-1 [Monodelphis domestica] Length = 959 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 44/211 (20%), Positives = 79/211 (37%), Gaps = 38/211 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++VLD S SM + +++ ++ L ++ + +G+VTF + Sbjct: 307 LILVLDKSGSM-----AGGDRLNRLNQASQLFLLQI-----IEKGSWTGMVTFDSSATIQ 356 Query: 234 FLLEWGVSHLQRK--IKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + QR I L G T+ GL+ A+ I + Sbjct: 357 SALIQIETDAQRNSLISRLPTAAGGGTSICSGLRTAFTVIKNKFSTDGSE---------- 406 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIR--SHEFLRACASPNSFYL 347 IV +TDGE ++ C +E K+ GAI++ + + E L Sbjct: 407 IVLLTDGE-------DSTISSCFDEVKQSGAIIHTVALGPSADPGLEELAKMTGGMKTSA 459 Query: 348 VENPHS--MYDAFSHIGKD---IVTKRIWYD 373 +N + + DAFS + + I + I D Sbjct: 460 TDNAQNNGLIDAFSALSSENGAITQRSIQLD 490 >gi|118443040|ref|YP_877686.1| von Willebrand factor type A domain-containing protein [Clostridium novyi NT] gi|118133496|gb|ABK60540.1| von Willebrand factor type A domain protein [Clostridium novyi NT] Length = 708 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 26/117 (22%), Positives = 43/117 (36%), Gaps = 15/117 (12%) Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK 252 +KID K+ + + + + GLV++ NK E L + + I Y Sbjct: 182 SKIDELQKAAKNFVNKFETKSNTK----IGLVSYGNKGEVVHSLTNELDRINSSIDYGLS 237 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL 309 +TN G++ A N + + +N K IV MTDG + Sbjct: 238 VYGATNIGDGIRKA-NGLLN----------NGSNADKYIVLMTDGIPTAATCYSNVK 283 >gi|146302762|ref|YP_001197353.1| von Willebrand factor, type A [Flavobacterium johnsoniae UW101] gi|146157180|gb|ABQ08034.1| von Willebrand factor, type A [Flavobacterium johnsoniae UW101] Length = 709 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 39/238 (16%), Positives = 93/238 (39%), Gaps = 30/238 (12%) Query: 103 FVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTF---IPWYTNSRHIVMPIT 159 F+N ++ + ++ + + Y+ + + P T PW + ++ + + + Sbjct: 277 FLNSGQEVPKD-AVRVEEMVNFFKYNYP-QPKNEHPFSINTEYSDSPWNSQNKILKIGLQ 334 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + + +++ ++DVS SME + K+ + +S+ ++ E++ V+ VV Sbjct: 335 GK-NIATNDLPSSNLVFLIDVSGSME-----DMNKLPLLKQSMKILVNELRPTDKVSIVV 388 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMR 277 +G G + + IK L + ST G++ AY +I ++ Sbjct: 389 YAGAAG------MVLPPTSG-NEKKTIIKALDQLEAGGSTAGGAGIELAY-KIATENFIK 440 Query: 278 QHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 N ++ TDG+ N+ + + E +K G + +G + + Sbjct: 441 GGNNR--------VILATDGDFNVGSSSNSDMEKLIEEKRKTGVFLTCLGYGMGNYKD 490 >gi|291221810|ref|XP_002730913.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 858 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 37/233 (15%), Positives = 83/233 (35%), Gaps = 34/233 (14%) Query: 145 IPWYTNSRHIVMP---ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + ++ +P I ++ N R +++VLD S SM +I+ + Sbjct: 327 LQHEDFKNNVNLPRLMIDTTPTFNIVKTGRRRVVLVLDTSGSMAGN------RIERLHRD 380 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNS 259 + + +I D + V G++ FS+ ++ + +I + TN Sbjct: 381 TTHFI--LNVIDDDSFV---GIIQFSSDATVLSEMKKIDYASRPQIAASVPYSANGGTNF 435 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 G++ A ++ + Q + + ++ +TDG+ T + +E G Sbjct: 436 GAGIRAALQELKESQLSLKGAS---------LLIITDGQFSYTSDVT------DEVYASG 480 Query: 320 AIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 V I LR + + S+Y V + + + + I + + Sbjct: 481 VRVDTIAYTQAAEDS-LRVLSDRTGGSYYYVSDDETSTELLDSLTSTITDRPV 532 >gi|149637338|ref|XP_001510478.1| PREDICTED: similar to ITI-like protein [Ornithorhynchus anatinus] Length = 1374 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 69/201 (34%), Gaps = 26/201 (12%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF---SNKI 230 ++ V+DVS SM TK+ K+++ +L ++ N V S V+ S I Sbjct: 320 VVFVIDVSGSMFG------TKMKQTKKAMHVILNDLHHDDYFNIVTFSDAVSVWKASGSI 373 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + + + + G T+ L A +F+ + Sbjct: 374 QATPP---NIKSAKVYVNKMEADGW-TDINAALLVA-ASVFNQSTGETGRGKGLKKIP-L 427 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACASPNS--- 344 I+F+TDGE +T + + AK+ ++ + + +R + N Sbjct: 428 IIFLTDGE--ATAGVTVASRILSNAKQSLKGNISLFGLAFGDDADYHLMRRLSLENRGVA 485 Query: 345 ---FYLVENPHSMYDAFSHIG 362 + + + + I Sbjct: 486 RRIYEDADATLQLKGFYDEIA 506 >gi|59857769|gb|AAX08719.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Bos taurus] Length = 916 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 75/216 (34%), Gaps = 29/216 (13%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 +S + +++ V+D S SM KI +++ +L+++ + L Sbjct: 266 PDSLSTIPKNVIFVIDKSGSM------MGRKIKQTREALIKILDDLSPHDQFD------L 313 Query: 224 VTFSNKIEEFFLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++FS++ + L V+ + + G TN + A + Sbjct: 314 ISFSSEATTWKPLLVPASTENVNEAKSYATGIQAQG-GTNINDAMLMAVQLLEKANQEEL 372 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI-VYAIGIRVIRSHEFLR 337 +I+ +TDG+ + + ++ G ++ +G S+ FL Sbjct: 373 LPEGSIT----LIILLTDGDPTVGETNPSNIQKNVRKAINGQHSLFCLGFGFDVSYAFLE 428 Query: 338 ACASPNS------FYLVENPHSMYDAFSHIGKDIVT 367 A N + ++ + D + + ++T Sbjct: 429 KMALENGGLARRIYEDSDSALQLQDFYQEVANPLMT 464 >gi|256084536|ref|XP_002578484.1| vacuolar protein sorting 26 vps26 [Schistosoma mansoni] gi|238663859|emb|CAZ34722.1| vacuolar protein sorting 26, vps26, putative [Schistosoma mansoni] Length = 1295 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 43/269 (15%), Positives = 89/269 (33%), Gaps = 40/269 (14%) Query: 82 ILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKF 141 + +++ + +N+ V + + L +V+ + S+S Y P + Sbjct: 211 DVKSTHDSFTWNCKNDSNH-SLVKLSSEFIPDHDLQMVLSLSDPLTSLS-SLEYGDPNQS 268 Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + + V S D R + + ++D S SME I A S Sbjct: 269 SILAMNCLMVQLL----PDIPNVVSSKDMRYEFVFLIDRSGSMEG------DNISYAKTS 318 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL--LEWGVSHLQRKIKY---LSKFGVS 256 + L+ + + + ++ F + F ++ L + Y L+ Sbjct: 319 LLLFLKSLPMS------CRFQIIGFGSDFAALFPEPTDYSEGSLNTAMNYQKDLNADMGG 372 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEA 315 T + LK A H + K I+F+TDG+ + + ++A Sbjct: 373 TEAYNALKAAL-----------HSTPSGEGWFKQIIFLTDGDVGNADEVIGLVRMNVDKA 421 Query: 316 KKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + V+ IG+ S + A + Sbjct: 422 R-----VFTIGLGQGVSTALIGGVARAGN 445 >gi|308050057|ref|YP_003913623.1| hypothetical protein Fbal_2347 [Ferrimonas balearica DSM 9799] gi|307632247|gb|ADN76549.1| conserved hypothetical protein [Ferrimonas balearica DSM 9799] Length = 457 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 21/171 (12%), Positives = 53/171 (30%), Gaps = 6/171 (3%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 +G + I+ I + I + + ++ H+ K L + +D + + A I E + Sbjct: 12 RRQRGAVIIMITIAMFAILAMGALALDGGHLLLNKARLQNAVDAAALSGAVAIQKEYDYL 71 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQ------NEG 126 ++ + + + + D++ +++ P G Sbjct: 72 RARQEGLVTFTSALGAQDFAELNDRVSLSVLNFASDEVSPQITVEFSERPDPFVPVLTPG 131 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 ++ +PL + R + + Q L M+V Sbjct: 132 AQYVRVTVSDVPLNNFFAQVMGVDKRVSAVAVAGPSTSTPQCSTDLLPMMV 182 >gi|297154321|gb|ADI04033.1| hypothetical protein SBI_00912 [Streptomyces bingchenggensis BCW-1] Length = 423 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 78/207 (37%), Gaps = 37/207 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLDVS SM + ++++ A ++ N +++ V +V +++ T+ + + Sbjct: 41 VELVLDVSGSMRARDVDGMSRMAAAKQAFNEVIDAVP--DEVRLGIRTLGATYPGQDRKT 98 Query: 234 FLLEW---------GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 L+ + + + L G T L+ A + R+ Sbjct: 99 GCLDSKQLYPVGRVDRTEAKTAVATLRPTGW-TPIGLALRGASKDLAGGDATRR------ 151 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI-----VYAIGIRVIRSHEFLRAC 339 IV +TDGE+ + D C+ A++ A V +G+ + +C Sbjct: 152 ------IVLITDGEDSCGQPD-----PCDVARELAAQGTHLVVDTLGLTLDSKVRDQLSC 200 Query: 340 ---ASPNSFYLVENPHSMYDAFSHIGK 363 A+ ++ V++ + + + Sbjct: 201 IAEATGGTYTAVQHTDQLSTRIKQLVR 227 >gi|187927679|ref|YP_001898166.1| hypothetical protein Rpic_0583 [Ralstonia pickettii 12J] gi|187724569|gb|ACD25734.1| conserved hypothetical protein [Ralstonia pickettii 12J] Length = 414 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 26/163 (15%), Positives = 64/163 (39%), Gaps = 5/163 (3%) Query: 15 YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNR 74 +G + + + L ++ + G++I++S +F KT L S +D + AA ++ +G + Sbjct: 9 ERGAVLPMVGLMLAVLLGMAGLVIDLSGLFVAKTELQSAVDSCALSAAQEL--DGASDAL 66 Query: 75 KKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISR 134 + + N + N V + T V + Y S+ Sbjct: 67 TRATNAGVTAGNANRVVYQASSASLANADV-TFSTALNGTYSAAGVASSSSSYVKCKHSK 125 Query: 135 YKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 I + F+ + + V + + +V++Q+ + + ++ Sbjct: 126 SGITARLIKFVGGA--NSYAVAALAVATRVHAQSTCPIPVALI 166 >gi|149639247|ref|XP_001506420.1| PREDICTED: similar to voltage-dependent calcium channel alpha-2 delta subunit [Ornithorhynchus anatinus] Length = 1113 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 78/200 (39%), Gaps = 37/200 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 270 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 317 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 318 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 376 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIR---VIRSHEF 335 CN KII+ TDG ++++ + + + V+ + R Sbjct: 377 CN-------KIIMLFTDG------GEERAQEIFKKYNQDKKVRVFTFSVGQHNYDRGPIQ 423 Query: 336 LRACASPNSFYLVENPHSMY 355 AC + +Y + + ++ Sbjct: 424 WMACKNKGYYYEIPSIGAIR 443 >gi|329923737|ref|ZP_08279132.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] gi|328941103|gb|EGG37405.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] Length = 654 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 27/206 (13%), Positives = 67/206 (32%), Gaps = 30/206 (14%) Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 ++ A R P + + + + + ++ V+D S SM Sbjct: 423 TDVALDATIRAAAPYQAIRTKKDEVA---FAIELDDLRQKKRENRVGATLLFVVDASGSM 479 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEFFLLEWGVSHL 243 ++ ++ +I ++L+ + G++ F +N+ E + + Sbjct: 480 -----AARKRMTAVKGAILSLLQ-----DAYEKRDRIGMIAFRNNEAELILPVTRSIEAA 529 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 ++++ + G T GL AY + R++ +T +++ +TDG + Sbjct: 530 SKQLRSIPT-GGKTPLADGLAQAYKVLHS--EKRRNNDTLP-----VMIIVTDGRANESS 581 Query: 304 EDQQS--------LYYCNEAKKRGAI 321 L + G Sbjct: 582 IGLTVYADIWNECLEAAKLIRAAGIR 607 >gi|67468139|ref|XP_650129.1| elongation factor-2 kinase [Entamoeba histolytica HM-1:IMSS] gi|56466695|gb|EAL44742.1| elongation factor-2 kinase, putative [Entamoeba histolytica HM-1:IMSS] Length = 402 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 37/226 (16%), Positives = 78/226 (34%), Gaps = 33/226 (14%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF--SN 228 +LD+++++D + SM + S+ + I+ + +I ++ +V + Sbjct: 4 QLDLVLLVDTTGSMGQYLHSAQSNINKIVNTITN---------SEKIDLRFAIVEYKDHQ 54 Query: 229 KIEEFFLLE-----WGVSHLQRKIKYLSKFGVS----TNSTPGLKYAYNQIFDMQGMRQH 279 ++ F L+ + +Q I LS +G + T A N + + Sbjct: 55 PNQQQFALKKYDWMNDIKDIQNAINQLSAYGGGMDGPESVTCAFDCAVNLGYRGYAAKVI 114 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-----HE 334 DA + DG +A K +Y++ IR + Sbjct: 115 IWIADAPPHGFNIQY-DGYPNGCPCGIDFQEVVLKAIKNDIQIYSVACEPIRPIYRHFRD 173 Query: 335 FLRACA--SPNSFYLVENPHSMYDA-----FSHIGKDIVTKRIWYD 373 +RA A + F + + + D +G + V ++I D Sbjct: 174 LMRAVAMMTGGQFIALNSADCLADVIIGGSLEEVGMNFVIQKIEND 219 >gi|222101622|gb|ACM44016.1| thrombospondin-related anonymous protein [Babesia bovis] Length = 657 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 58/170 (34%), Gaps = 29/170 (17%) Query: 167 QTDARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 Q +LD IV+D S S+ ++ + I + ++ V L N ++ L T Sbjct: 39 QCKKQLDFSIVVDESASISNDQWEGQM------IPFLRNLIHTVDL---DNTDIRLSLTT 89 Query: 226 FSNKIEEFFL-LEWGVSHLQRKIKYLSKFGVS------TNSTPGLKYAYNQIFDMQGMRQ 278 +S + F L+ S + + L + T + L Y I Sbjct: 90 YSTPTRQIFTFLDAAASSTRLALTKLDWMAGTKARSGMTYTGRALNYVRKAILPY----- 144 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 N K ++ +TDG + + + + G V IG+ Sbjct: 145 ----GRKNVPKALLLITDG---VSSDGSYTAQVAAMLRDEGVNVMVIGVG 187 >gi|170592037|ref|XP_001900776.1| von Willebrand factor type A domain containing protein [Brugia malayi] gi|158591928|gb|EDP30531.1| von Willebrand factor type A domain containing protein [Brugia malayi] Length = 817 Score = 50.2 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 64/201 (31%), Gaps = 23/201 (11%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D +LD++ +LD S S+ + + + V LI Q ++ +S+ Sbjct: 38 DRQLDLLFILDGSGSVSG---------STFATQMAMLNKIVDLIEIGPKNTQIAVMQYSS 88 Query: 229 --KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++E F L+ ++ + +T + L A + N +D Sbjct: 89 YTRVEFNFSANPNKESLRASLQKIRHISGTTKTGKALDKALHVFCHDSNFGTRLNQDDVA 148 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACASPNSF 345 ++ +TDG + ++ G + +GI I E + N Sbjct: 149 QVAVV--VTDG-----HSHDDPIPAAMRLRQAGVEILTLGIGAHINMGELVEITGDQNLA 201 Query: 346 Y----LVENPHSMYDAFSHIG 362 + + F I Sbjct: 202 FQNLTSQASLDQFVHQFKKIA 222 >gi|71280576|ref|YP_269044.1| von Willebrand factor type A domain-containing protein [Colwellia psychrerythraea 34H] gi|71146316|gb|AAZ26789.1| von Willebrand factor type A domain protein [Colwellia psychrerythraea 34H] Length = 618 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 78/227 (34%), Gaps = 23/227 (10%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 S+ + + L +++ D+S S +S D+ ID+ S+ E ++ Sbjct: 406 SKTAATAEKGLFQSYRGNNRDLSCLLLADLSMSTDSHLDNDNRVIDVVQDSLLLFGEALQ 465 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAY 267 + +N G + F +L+ ++ + G T ++ A Sbjct: 466 SV--GDNFAMYGFSSVKRSNIRFTMLKNFNEKYNDHVRGRIQAITPGFYTRMGAAIRQAT 523 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED------QQSLYYCNEAKKRGAI 321 I + + K+++ +TDG+ + + NEAK+ G Sbjct: 524 KVISEQKTA-----------DKLLLILTDGKPNDIDHYEGRFGIEDTHQAINEAKRLGIK 572 Query: 322 VYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 + I I V + E+L + F + P + + + ++ Sbjct: 573 PFCITIDV-DAQEYLPYLFGNDGFTQILRPAQLPLRLPQLYHQLTSQ 618 >gi|291436333|ref|ZP_06575723.1| von Willebrand factor [Streptomyces ghanaensis ATCC 14672] gi|291339228|gb|EFE66184.1| von Willebrand factor [Streptomyces ghanaensis ATCC 14672] Length = 424 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 39/254 (15%), Positives = 89/254 (35%), Gaps = 32/254 (12%) Query: 132 ISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSS 191 + K P+ + + + P ++ + DA + ++LDVS SM + Sbjct: 1 MITRKRPVAGVCVLLAALTAGLVAFPAGAAADEPTGRDAP-KVNLLLDVSGSMRARDIDG 59 Query: 192 ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-----------SNKIEEFFLLEWGV 240 +++ A ++ N +L+ +V +++ + + ++ L+ Sbjct: 60 QSRMAAAKQAFNEVLDATPK--EVELGIRTLGANYAGDDRKEGCKDTAQLYPVGPLD--R 115 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 + + + L+ G T P L A + + G K IV ++DGE+ Sbjct: 116 TEAKAAVATLTPTGW-TPIGPSLLKAADDLEGGNGS------------KRIVLISDGEDT 162 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC---ASPNSFYLVENPHSMYDA 357 D + AK G + +G+ +C A+ ++ VE+ + D Sbjct: 163 CAPLDPCEVAREIAAKGIGLTIDTLGLVPNAKLSRQLSCIAEATGGTYASVEHQDELTDR 222 Query: 358 FSHIGKDIVTKRIW 371 + + + Sbjct: 223 VNELVDRAAEPVVT 236 >gi|114587344|ref|XP_001172675.1| PREDICTED: similar to PK-120 precursor isoform 2 [Pan troglodytes] Length = 914 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 41/266 (15%), Positives = 88/266 (33%), Gaps = 39/266 (14%) Query: 92 MSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNS 151 + + N V+ + T I P + + K P + T + Sbjct: 187 LETESTFMTNQLVDALTTWQNKTKAHIRFKP-------TVSQQQKSPEQQETVLDGNLII 239 Query: 152 RHIVMPITSSVKVNSQTDARL-------------DMMIVLDVSRSMESFFDSSITKIDMA 198 R+ V S + + + +++ V+D S SM S KI Sbjct: 240 RYDVDRAVSGGSIQIENGYFVHYFAPEGLTTMPKNVVFVIDKSGSM------SGRKIQQT 293 Query: 199 IKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST 257 +++ +L+++ N +V + + + V+ + + G T Sbjct: 294 REALIKILDDLSPRDQFNLIVFSTEASQWRPSLVPAS--AENVNKARSFAVGIQALG-GT 350 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 N + A Q+ D + + I+ +TDG+ + + +S+ N ++ Sbjct: 351 NINDAMLMAV-QLLDSSNQEEQLPEGSVSL---IILLTDGDPTVGETNPRSIQ--NNVRE 404 Query: 318 RGAIVYA---IGIRVIRSHEFLRACA 340 + Y+ +G S+ FL A Sbjct: 405 AVSGRYSLFCLGFGFDVSYAFLEKLA 430 >gi|116625802|ref|YP_827958.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228964|gb|ABJ87673.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 326 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 29/160 (18%), Positives = 60/160 (37%), Gaps = 27/160 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 DA + +V D S SM+ + + ++ +K + LV FS Sbjct: 94 DDAPASVGLVFDTSDSMQPRMNKAHEAVEAL----------LKNANPADEFF---LVQFS 140 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 ++ + + R+ + + G ST + A ++ +R Sbjct: 141 DRARLVAGMTKDSEEISRRAASM-RIGGSTALLDAVAMAMEEMKSAHYLR---------- 189 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 K++V ++DG++ S++ L ++ +YAIGI Sbjct: 190 -KVMVIISDGDDNSSRCPVNDLK--RIVREGDVTIYAIGI 226 >gi|153000354|ref|YP_001366035.1| vault protein inter-alpha-trypsin subunit [Shewanella baltica OS185] gi|151364972|gb|ABS07972.1| Vault protein inter-alpha-trypsin domain protein [Shewanella baltica OS185] Length = 772 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 69/197 (35%), Gaps = 32/197 (16%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + +VM + V+ ++Q ++++V+D S SM I A ++ L+ +K Sbjct: 371 NYSLVMVLPPKVEKSTQPSLPRELILVIDTSGSMAG------DSIVQAKNALLYALKGLK 424 Query: 211 LIPDVNNVVQSGLVTFSNKIE--EFFLL---EWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 N ++ F++ + L +S ++ + L G T L Sbjct: 425 PEDSFN------IIEFNSSLSLLSATPLPATSSNLSRARQFVSRLQADG-GTEMALALDA 477 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE--NLSTKEDQQSLYYCNEAKKRGAIVY 323 A + + + + ++FMTDG N D ++ Sbjct: 478 A------LPKSLGSVSPDAVQPLRQVIFMTDGSVGNEQALFDLIRYQIGES------RLF 525 Query: 324 AIGIRVIRSHEFLRACA 340 +GI + F++ A Sbjct: 526 TVGIGSAPNSHFMQRAA 542 >gi|295698034|ref|YP_003602691.1| putative tellurium resistance protein [Enterobacter cloacae subsp. cloacae ATCC 13047] gi|295060146|gb|ADF64883.1| putative tellurium resistance protein [Enterobacter cloacae subsp. cloacae ATCC 13047] Length = 212 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 32/168 (19%), Positives = 54/168 (32%), Gaps = 14/168 (8%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +V+D S SM I I AML ++ P V ++T+ N Sbjct: 3 RLPVYLVIDTSGSMRG------ESIHSVNVGIQAMLSALRQDPYALESVHISIITYDNGA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 EF L + + T + L+ + D + Sbjct: 57 REFIPLT--PLEDFQFSDIVVPSAGGTFTGAALECLMQCVERDVRRSDGDTKGDWRP--L 112 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGA-IVYAIGIRVIRSHEFLR 337 + MTDG +++ + RG + A + HE L+ Sbjct: 113 VFLMTDGTPSDALAYGEAVK---AIRGRGFGSIIACAVGPKAGHEHLK 157 >gi|203284094|ref|YP_002221834.1| hypothetical protein BDU_172 [Borrelia duttonii Ly] gi|201083537|gb|ACH93128.1| uncharacterized conserved protein [Borrelia duttonii Ly] Length = 341 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 44/234 (18%), Positives = 81/234 (34%), Gaps = 34/234 (14%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 + P S K+ + D++IVLD+S SM + SS +++ A + I + + Sbjct: 82 TLAGPSISRKKMTYLSSGA-DIVIVLDISPSMGAIEFSSKNRLEFAKELIEYFVYQ---- 136 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQI 270 GLV F+ + L +K+ Y+ G + G+ A + + Sbjct: 137 ---RENDNIGLVAFAKEASLIVPLTIDRDFFSKKLDDIYIMDLGNGSALGLGISIALSHL 193 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 KK ++ +TDG S + + N A+ +Y++GI Sbjct: 194 -----------KHSEAPKKSVIVLTDGVVNSDEVYKD--QVINLAQGLNVKIYSVGIGSD 240 Query: 331 RSHEFLRACASPNSF----YLVENPHSMYDA-------FSHIGKDIVTKRIWYD 373 S + V +P +++ F +G D K D Sbjct: 241 EELNVGFKLRSGKFYQGVLKEVYDPSMLFEISNKTGGLFYSVGDDFSFKLAIQD 294 >gi|210135186|ref|YP_002301625.1| phage/colicin/tellurite resistance cluster protein TerY [Helicobacter pylori P12] gi|210133154|gb|ACJ08145.1| phage/colicin/tellurite resistance cluster protein TerY [Helicobacter pylori P12] Length = 214 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 32/210 (15%), Positives = 76/210 (36%), Gaps = 12/210 (5%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + + + + ++LD S SM + T+I + I M+E +K + +VT Sbjct: 9 TMEERFIPVFLLLDTSGSMSHSL-GNGTRIGVLNLCIQKMIETLKQEAKKELFSKMAIVT 67 Query: 226 FS-NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 F N + + IK ++ +S + L A+ D+ + Sbjct: 68 FGENGVNLHTPFDD--------IKNVNFEPLSASGGTPLDQAFKLAKDLIEDKD--TFPT 117 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 YK + ++DGE + K + + ++ + ++ ++I I + + + Sbjct: 118 KFYKPYSILVSDGEPNNDKWQEPLFNFHHDGRSAKSVCWSIFIGDREVNPQVNKDFGKDG 177 Query: 345 FYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + ++ + F + + I K Sbjct: 178 VFYADDVEKLVGLFEIMTQTISKGSASIKK 207 >gi|116750907|ref|YP_847594.1| von Willebrand factor, type A [Syntrophobacter fumaroxidans MPOB] gi|116699971|gb|ABK19159.1| von Willebrand factor, type A [Syntrophobacter fumaroxidans MPOB] Length = 268 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 65/193 (33%), Gaps = 25/193 (12%) Query: 175 MIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK-IEE 232 +++D S SM + TK + A ++ + GL F I E Sbjct: 82 YVIMDSSGSMNEVRCSGNRTKSEAAKTALAQFARITPKDAN------MGLAVFDAYGIAE 135 Query: 233 FFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L G+ + + I ++ G T L Y ++ + R+ + + Sbjct: 136 RVPL--GLENRDKFIAAVNATAPGNGTPLHDALLLGYRRL--EETARRQAGYGEYH---- 187 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--FYLV 348 +V +TDG+ + + ++ +++ IG + H + P + Sbjct: 188 LVVITDGQ-AYPQNQDPTPVVAYILRQSPVVIHTIGFCIGTDHSLNQ----PGRTVYRAA 242 Query: 349 ENPHSMYDAFSHI 361 +NP + + Sbjct: 243 DNPRELQQGLEEV 255 >gi|288800164|ref|ZP_06405623.1| BatB protein [Prevotella sp. oral taxon 299 str. F0039] gi|288333412|gb|EFC71891.1| BatB protein [Prevotella sp. oral taxon 299 str. F0039] Length = 339 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 72/175 (41%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K++ + ++ +I LD+S SM + D +++D + I ++ +N+ + Sbjct: 79 GSKISHEKRNGIEAIIALDISNSMLAQ-DVQPSRLDKSKLMIENLINSF-----IND--K 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 GLV F+ + + + + ++ T+ ++ + + +G+ Sbjct: 131 IGLVVFAGEAYVQLPITSDYVSAKMFLSDITPNLISAQGTDIARAIRVSLSSFTQQKGVG 190 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 K I+ +TDGE+ + +L EAK++G V+ +G+ + Sbjct: 191 -----------KAIILITDGEDN----EGGALEAVKEAKEKGVNVFILGVGDSKG 230 >gi|126334038|ref|XP_001370553.1| PREDICTED: similar to Integrin alpha-X precursor (Leukocyte adhesion glycoprotein p150,95 alpha chain) (Leukocyte adhesion receptor p150,95) (Leu M5) (CD11c antigen) [Monodelphis domestica] Length = 1224 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 75/212 (35%), Gaps = 28/212 (13%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 D+ ++ + S + + AM+ + + Q L+ Sbjct: 200 QKCPKQDTDIAFLI------DGSGSISSSDFQKMKNFVKAMISQFEKPS-----TQFSLM 248 Query: 225 TFSN--KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 F++ KI F +R + +++ T + G+K N++F Q Sbjct: 249 QFASNFKIHFTFEKFKNSHDPRRLVDEITQLSGVTKTASGIKKVINELF------QKTRG 302 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR----VIRSHEFLRA 338 KI++ +TDGE + A+K G I YAIG+ S + L Sbjct: 303 ARQYATKILIVITDGEKYDDPLEYS--QVIPTAEKAGIIRYAIGVGEAFERPSSRQELEE 360 Query: 339 CASP---NSFYLVENPHSMYDAFSHIGKDIVT 367 AS + + V+N ++ + + + + I Sbjct: 361 IASEPSKDHIFWVDNFGALSNIQNQLKEKIFA 392 >gi|46580532|ref|YP_011340.1| von Willebrand factor type A domain-containing protein [Desulfovibrio vulgaris str. Hildenborough] gi|46449951|gb|AAS96600.1| von Willebrand factor type A domain protein [Desulfovibrio vulgaris str. Hildenborough] Length = 420 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 26/153 (16%), Positives = 53/153 (34%), Gaps = 43/153 (28%) Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN-------------- 299 T + GLK+ + + + + +D +K+I+ +TDG+ Sbjct: 264 ASGTVISEGLKWGRHVLTPEAPFTEGSSAKD--IRKVIIVLTDGDTEDGKCGGSYAINYT 321 Query: 300 --------------------LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE--FLR 337 K + L +AK+ G V+AI S + ++ Sbjct: 322 PNAYWTNAFYGMLDMTSHCENGGKLNAAMLEEARKAKEAGIEVFAIRFGDSDSVDVSLMK 381 Query: 338 ACASP-----NSFYLVENPHSMYDAFSHIGKDI 365 + AS + +Y + + + D F IG+ + Sbjct: 382 SIASSKAGTNDHYYDAPSAYDIDDVFKKIGRQL 414 >gi|8393899|ref|NP_059047.1| inter-alpha-trypsin inhibitor heavy chain H3 precursor [Rattus norvegicus] gi|3024056|sp|Q63416|ITIH3_RAT RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3; Short=ITI heavy chain H3; Short=ITI-HC3; Short=Inter-alpha-inhibitor heavy chain 3; Flags: Precursor gi|602886|emb|CAA58233.1| pre-alpha-inhibitor, heavy chain 3 [Rattus norvegicus] Length = 887 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 34/176 (19%), Positives = 72/176 (40%), Gaps = 15/176 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEE 232 + V+DVS SM S KI +++ +L+++K +N ++ +G+ T+ + + + Sbjct: 285 IAFVIDVSGSM------SGRKIQQTREALLKILDDMKEEDYLNFILFSTGVTTWKDHLVK 338 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +K + + TN GL + + H E + I+V Sbjct: 339 ATP--ANLEEARAFVKNI-RDRSMTNINDGLLRGIEMLNKARED--HLVPERSTS--ILV 391 Query: 293 FMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +TDG+ N ++ A + +Y +G ++ FL + A N + Sbjct: 392 MLTDGDANTGESRPEKIQENVRNAIRGKFPLYNLGFGNNLNYNFLESLALENHGFA 447 >gi|313239055|emb|CBY14037.1| unnamed protein product [Oikopleura dioica] Length = 1055 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 31/202 (15%), Positives = 65/202 (32%), Gaps = 24/202 (11%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D++ V + M S F + I +IN + ++ ++ F++ Sbjct: 777 GPRDVLFVAHYTTYMGSTFADISAFFENIISTIN--------VEPSDSSIRFAFSFFNHA 828 Query: 230 IEEFFLLEW--GVSHLQRKIKYLSKFGVSTN-STPGLKYAYNQIFDMQGMRQHCNTEDAN 286 EFF +W + + + N LK A + + G + +T Sbjct: 829 YIEFFAFDWLNSIDEYKWAFSSFPPASGNANYIGRALKGAADTMTPAFGKGRRIDTVGT- 887 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---N 343 +V +T + + ++ K++ V +G+ + L AS Sbjct: 888 ----VVLLT-----NAASTDEVNEMADQLKEKVDRVIVVGLGYAFGQDELAGIASSPTKE 938 Query: 344 SFYLVENPHSMYDAFSHIGKDI 365 + Y E + I +I Sbjct: 939 NLYTAEESSDLAGLVKTIADEI 960 >gi|300022610|ref|YP_003755221.1| von Willebrand factor A [Hyphomicrobium denitrificans ATCC 51888] gi|299524431|gb|ADJ22900.1| von Willebrand factor type A [Hyphomicrobium denitrificans ATCC 51888] Length = 638 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 37/198 (18%), Positives = 72/198 (36%), Gaps = 35/198 (17%) Query: 175 MIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK---- 229 M+++D S SM + KID+ + + +L+ + GLV+F ++ Sbjct: 32 MLIVDGSGSMWGRLAPDNKPKIDVVREKLATILQTPSS-------TRVGLVSFGHRRRGD 84 Query: 230 ---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 +E + + L + L+ G + L+ A + I + + Sbjct: 85 CNDVELIASPDSERAALLGPLAKLNPRGPGPVTA-ALEIAADAIGTSRPAQ--------- 134 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIRVIRSHEFLRAC---ASP 342 I+ + DG + ++ + + AK G V IGI V + AC A+ Sbjct: 135 ----IIIVGDGADNCQQDSCAAAN--DFAKSAPGVAVQVIGIGVPATERPRIACVAQATG 188 Query: 343 NSFYLVENPHSMYDAFSH 360 +Y V + + A Sbjct: 189 GRYYDVTDAAGLNAALDE 206 >gi|149034207|gb|EDL88977.1| inter-alpha trypsin inhibitor, heavy chain 3 [Rattus norvegicus] Length = 886 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 34/176 (19%), Positives = 72/176 (40%), Gaps = 15/176 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEE 232 + V+DVS SM S KI +++ +L+++K +N ++ +G+ T+ + + + Sbjct: 285 IAFVIDVSGSM------SGRKIQQTREALLKILDDMKEEDYLNFILFSTGVTTWKDHLVK 338 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +K + + TN GL + + H E + I+V Sbjct: 339 ATP--ANLEEARAFVKNI-RDRSMTNINDGLLRGIEMLNKARED--HLVPERSTS--ILV 391 Query: 293 FMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +TDG+ N ++ A + +Y +G ++ FL + A N + Sbjct: 392 MLTDGDANTGESRPEKIQENVRNAIRGKFPLYNLGFGNNLNYNFLESLALENHGFA 447 >gi|74136383|ref|NP_001028084.1| calcium-activated chloride channel regulator 1 precursor [Macaca mulatta] gi|75043731|sp|Q6PT52|CLCA1_MACMU RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 1; Flags: Precursor gi|46371863|gb|AAS90562.1| calcium-activated chloride channel family member 1 [Macaca mulatta] Length = 913 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 39/197 (19%), Positives = 73/197 (37%), Gaps = 35/197 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM ++ +++ ++ L ++ + G+VTF + Sbjct: 307 VCLVLDKSGSM-----ATGNRLNRLNQAGQLFLLQI-----IELRSWVGMVTFDSAAHVQ 356 Query: 234 FLL--EWGVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L S K L T+ GL+ A+ I +++ Sbjct: 357 SELIQINSGSDRDTLTKRLPTAASGGTSICSGLRLAFTVI-----KKKYPTDGSE----- 406 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ ++ C NE K+ GAI++ + + + E L Y Sbjct: 407 IVLLTDGEDN-------TISGCFNEVKQSGAIIHTVALGPSAARELEELSKMTGGLQTYA 459 Query: 348 VENPHS--MYDAFSHIG 362 + + + DAF + Sbjct: 460 SDQVQNNGLIDAFGALS 476 >gi|260834079|ref|XP_002612039.1| hypothetical protein BRAFLDRAFT_94127 [Branchiostoma floridae] gi|229297412|gb|EEN68048.1| hypothetical protein BRAFLDRAFT_94127 [Branchiostoma floridae] Length = 794 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 31/194 (15%), Positives = 72/194 (37%), Gaps = 34/194 (17%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTF 226 T +D++ ++ RS S D + L+ V +V+ + ++ + Sbjct: 191 TTQAVDLVFLIARPRSSVSNAD------------VKTFLKSVVAALNVSQTAARVAVIEY 238 Query: 227 SNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTN--STPGLKYAYNQIFDMQGMRQHCNT 282 ++ + F L +++ + I + +G ++ + +KY + + G Sbjct: 239 TSVMHSHFDLPTHLTNAQVTSAINSIPAWGSTSYRKTGSAIKYLTDYLSWRDG------- 291 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 K++V +TD + AK G I+ ++G+ S L A+ Sbjct: 292 ----IPKVLVVITD-----ATSNDYVSGPAQSAKNAGLILSSVGVGTSISSTELNTIATN 342 Query: 343 NSF-YLVENPHSMY 355 +S+ Y V + + Sbjct: 343 SSYRYTVSSYTDIV 356 >gi|254456981|ref|ZP_05070409.1| phage/colicin/tellurite resistance cluster TerY protein [Campylobacterales bacterium GD 1] gi|207085773|gb|EDZ63057.1| phage/colicin/tellurite resistance cluster TerY protein [Campylobacterales bacterium GD 1] Length = 229 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 80/213 (37%), Gaps = 24/213 (11%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + ++++LDVS SM+ ID K++ +ML K + ++ ++TF Sbjct: 11 EEPKSIPVVLLLDVSYSMQG------ENIDTLNKAVESMLNSFKKAETMETFIKLSIITF 64 Query: 227 SNK--IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ ++ L ++ + ++ + + + A+ M + D Sbjct: 65 GSENGVDLHTPLT--------EVSKIDFKPLTVSGSTPMGAAFKMGKAMIEDKDIFKGRD 116 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS---HEFLRACAS 341 IV ++DGE Q + + + + A+ I + F+ C Sbjct: 117 YRPT--IVLLSDGEPNDD-WRQPLDDFVSTGRTKKCDRMALAIGAADKTVLNMFIEGC-- 171 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 NS + E+ ++ D F I + + +K Sbjct: 172 ENSLFYAEDAENIIDEFKKITMSVTQRTKSVNK 204 >gi|114587340|ref|XP_516522.2| PREDICTED: similar to PK-120 precursor isoform 5 [Pan troglodytes] gi|114587342|ref|XP_001172703.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 4 [Pan troglodytes] Length = 930 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 41/266 (15%), Positives = 88/266 (33%), Gaps = 39/266 (14%) Query: 92 MSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNS 151 + + N V+ + T I P + + K P + T + Sbjct: 187 LETESTFMTNQLVDALTTWQNKTKAHIRFKP-------TVSQQQKSPEQQETVLDGNLII 239 Query: 152 RHIVMPITSSVKVNSQTDARL-------------DMMIVLDVSRSMESFFDSSITKIDMA 198 R+ V S + + + +++ V+D S SM S KI Sbjct: 240 RYDVDRAVSGGSIQIENGYFVHYFAPEGLTTMPKNVVFVIDKSGSM------SGRKIQQT 293 Query: 199 IKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST 257 +++ +L+++ N +V + + + V+ + + G T Sbjct: 294 REALIKILDDLSPRDQFNLIVFSTEASQWRPSLVPAS--AENVNKARSFAVGIQALG-GT 350 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 N + A Q+ D + + I+ +TDG+ + + +S+ N ++ Sbjct: 351 NINDAMLMAV-QLLDSSNQEEQLPEGSVSL---IILLTDGDPTVGETNPRSIQ--NNVRE 404 Query: 318 RGAIVYA---IGIRVIRSHEFLRACA 340 + Y+ +G S+ FL A Sbjct: 405 AVSGRYSLFCLGFGFDVSYAFLEKLA 430 >gi|297671393|ref|XP_002813835.1| PREDICTED: collagen alpha-1(VII) chain-like [Pongo abelii] Length = 2889 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 29/201 (14%), Positives = 63/201 (31%), Gaps = 38/201 (18%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ S+ +I ++ + V+ V +S+ Sbjct: 38 DIVFLLDGSSSIG---RSNFREIRSFLEGLVLPFSGAASAQG----VRFATVQYSDDPRG 90 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 +W G + + A + K+ + Sbjct: 91 ----QWASDASDSF------EGTGLSINVLMIRA---------ILSSVTQARDCVPKVCI 131 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVEN 350 +TDG K K +G ++A+GI+ E R + + + F+ V + Sbjct: 132 LITDG-----KSQDLVDTAAQRLKGQGIKLFAVGIKNADPEELKRVASQPTSDFFFFVND 186 Query: 351 PHSMYDAFSHIGKDIVTKRIW 371 + +V++R+ Sbjct: 187 FSILRTLL-----PLVSRRVC 202 Score = 40.9 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 26/174 (14%), Positives = 60/174 (34%), Gaps = 20/174 (11%) Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ---RKI 247 + + + + + ++ + + VQ GL+++S++ F L G L ++I Sbjct: 1051 NAHRAEATRRVLERLVSALGPLGP--QAVQVGLLSYSHRPSPLFPL-NGSHDLGIILQRI 1107 Query: 248 KYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + + N + AY + + + ++V + D Sbjct: 1108 RDMPYMDPSGNNLGTAVVTAYRYVLAPDAPGR-----RQHVPGVMVLLVD-----EPLRG 1157 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR---ACASPNSFYLVENPHSMYDA 357 EA+ G V +G+ + R S +F+ V++ S+ A Sbjct: 1158 DIFSPIREAQAAGLNVVMLGMAGADPEQLRRLAPGMDSVQTFFAVDDGPSLDQA 1211 >gi|156084610|ref|XP_001609788.1| thrombospondin-related anonymous protein [Babesia bovis] gi|154797040|gb|EDO06220.1| thrombospondin-related anonymous protein [Babesia bovis] Length = 660 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 58/170 (34%), Gaps = 29/170 (17%) Query: 167 QTDARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 Q +LD IV+D S S+ ++ + I + ++ V L N ++ L T Sbjct: 39 QCKKQLDFSIVVDESASISNDQWEGQM------IPFLRNLIHTVDL---DNTDIRLSLTT 89 Query: 226 FSNKIEEFFL-LEWGVSHLQRKIKYLSKFGVS------TNSTPGLKYAYNQIFDMQGMRQ 278 +S + F L+ S + + L + T + L Y I Sbjct: 90 YSTPTRQIFTFLDAAASSTRLALTKLDWMAGTKARSGMTYTGRALNYVRKAILPY----- 144 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 N K ++ +TDG + + + + G V IG+ Sbjct: 145 ----GRKNVPKALLLITDG---VSSDGSYTAQVAAMLRDEGVNVMVIGVG 187 >gi|315231887|ref|YP_004072323.1| hypothetical protein TERMP_02126 [Thermococcus barophilus MP] gi|315184915|gb|ADT85100.1| hypothetical protein TERMP_02126 [Thermococcus barophilus MP] Length = 1614 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 32/167 (19%), Positives = 63/167 (37%), Gaps = 27/167 (16%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D++ ++D S SM+S+ I+ S +++ I N V+ LVTF+N Sbjct: 90 PIDVVFIIDRSDSMDSY-------IEAIKNSAYQFSYDLERIGGEN--VRFALVTFANYD 140 Query: 231 EE--FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L VS + + G + S G+ A + F+ N + Sbjct: 141 DARIDLPLTNNVSEFVEALNSIYTAGGTEWSFGGILKALDLEFN------------PNAQ 188 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 K+ + +TD ++ S + N G ++ A+ + Sbjct: 189 KVFIVVTDEDDQSPYSVE---EVANNLTAEGVLL-ALVYNHRDGQKL 231 >gi|307109844|gb|EFN58081.1| hypothetical protein CHLNCDRAFT_142385 [Chlorella variabilis] Length = 654 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 34/196 (17%), Positives = 65/196 (33%), Gaps = 27/196 (13%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSME--SFFDSSITKIDMAIKSINAMLEEVKLIPD 214 PIT V + + + +LD S+S+ ++ + + S+N + Sbjct: 460 PITPPVVIQDCSAN---VCFLLDGSKSLTNVDGWNDVVASARSIMYSLNDPAAVFDVFWF 516 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 N+V + G T + S + + + T + + + Sbjct: 517 SNDVEKIGHAT-------GAEVAANNSFVSMVVNT-TPDAHGTWMAQAITTCQGVLLE-- 566 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ED + IV +TDG+ + + + AK RG + +G I + Sbjct: 567 --------EDTQASRTIVLITDGKPTDPQP---TFEAADAAKARGIKMVVVGAGEID-YA 614 Query: 335 FLRACASPNSFYLVEN 350 L+A AS F Sbjct: 615 TLKALASGPQFVFANT 630 >gi|12230718|sp|Q28833|VWF_PIG RecName: Full=von Willebrand factor; Short=vWF; Flags: Precursor gi|2981067|gb|AAC06229.1| von Willebrand factor precursor [Sus scrofa] Length = 2482 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 69/207 (33%), Gaps = 28/207 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 L ++++LD S S+ + + D A + + + P + Q ++ + + Sbjct: 1357 QPLGVVLLLDGSSSLPASY------FDEMKSFTKAFISKANIGP---QLTQVSVLQYGS- 1406 Query: 230 IEEFFLLEWG----VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 L W +HL+ + + + G + L +A + H +A Sbjct: 1407 -ITTIDLPWNMPLEKAHLRGLVDLMQREGGPSQIGDALGFAVRYVMSQ----VHGARPEA 1461 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 + K +++ +TD A+ V+ IGI LR A P + Sbjct: 1462 S-KAVVIVVTD------TSTDSVDAAAAAARSNRVAVFPIGIGDRYDEAQLRTLAGPGAS 1514 Query: 346 YLVENPHSMYDA--FSHIGKDIVTKRI 370 V + D +G + K Sbjct: 1515 SNVVKLQRIEDLPTLVTLGNSFLHKLC 1541 >gi|316969873|gb|EFV53908.1| putative von Willebrand factor type A domain protein [Trichinella spiralis] Length = 532 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 73/207 (35%), Gaps = 25/207 (12%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D++ V D S+S++ F+ + + M++ + P+ V GL+ +S + Sbjct: 40 PVDVVFVFDSSQSIKKTFEHQ-------KEVVIGMVKRMSFRPNAGLV---GLIEYSRHV 89 Query: 231 EEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L+ + K + L G T + +K Q+ Sbjct: 90 STLLPLQSPQSADQVTSKFQSLKHIGAYTLTADAVKIGVEQL----------RYGRPGVD 139 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACASPNSFYL 347 K+ V +TDG + + D + + GA V I I E + + + Sbjct: 140 KVFVLVTDGHSFNKWTDL--VKVADSLHDTGAKVIIIAIAEDLYMPELHEYAGAHGTIFT 197 Query: 348 VENPHSMYDAFSHIGKDIVTKRIWYDK 374 +N + + F+ + D + +K Sbjct: 198 RQNVSEVENYFAALVHDKCETEVVAEK 224 >gi|313207256|ref|YP_004046433.1| von willebrand factor type a [Riemerella anatipestifer DSM 15868] gi|312446572|gb|ADQ82927.1| von Willebrand factor type A [Riemerella anatipestifer DSM 15868] gi|315023480|gb|EFT36486.1| BatB [Riemerella anatipestifer RA-YM] gi|325335297|gb|ADZ11571.1| von Willebrand factor type A [Riemerella anatipestifer RA-GD] Length = 335 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 24/162 (14%), Positives = 56/162 (34%), Gaps = 26/162 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M ++D+S SM + D + ++ +A + + ++ + + + GL F+ + Sbjct: 92 VMFLVDISNSMNAQ-DVAPDRLSLAKNIV---ISSMQKMTND----RVGLAVFAGEAFSV 143 Query: 234 FLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + + L T+ ++ A ++ + Sbjct: 144 MPLTTDYLAAESFVSGLETSVVSTQGTDFYKAMQVAVSKFKAVSKGSGR----------- 192 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 IV ++DGE+ E A+ G V +G+ Sbjct: 193 IVLISDGEDNEGNEAAAIKE----AQSNGIQVITVGVGTEEG 230 >gi|297158891|gb|ADI08603.1| toxic cation resistance protein [Streptomyces bingchenggensis BCW-1] Length = 743 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 44/259 (16%), Positives = 92/259 (35%), Gaps = 46/259 (17%) Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 + A + + ++ R + +VLD S SM S+ Sbjct: 497 NAKATDEGDAAIPLSDVETRAPGLVSLYKSAADALAQKGLAGRRAAVYLVLDRSGSMRSY 556 Query: 188 F-DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK 246 + D ++ + +++A L++ +P +V FS I+ ++ +++ + Sbjct: 557 YKDGTVQHLAEQALALSANLDDDGTVP---------VVFFSTDIDGTADVD--LANYSGR 605 Query: 247 IKYLSK---FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 I+ L TN +K + H A ++F TDG S Sbjct: 606 IEELHSSLGHMGRTNYHHAIKAVID----------HYQASGATDPAFVIFQTDGAPTSKA 655 Query: 304 EDQQSLYYCNEAKKRGAIVY--AIGIRVI--RSHEFLRA---CASPNS-------FYLV- 348 +++L C A+ ++ IG + +FLR A P+ F+ Sbjct: 656 AAEKAL--CEAAR---LPIFWQFIGFGDPEAKGFDFLRKLDVLAVPDKRVVDNAGFFHAG 710 Query: 349 ENPHSMYDAFSHIGKDIVT 367 ++P ++ +A G+ +V Sbjct: 711 QDPRALSNA-ELYGQLMVE 728 >gi|262196568|ref|YP_003267777.1| FHA domain containing protein [Haliangium ochraceum DSM 14365] gi|262079915|gb|ACY15884.1| FHA domain containing protein [Haliangium ochraceum DSM 14365] Length = 564 Score = 50.2 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 34/171 (19%), Positives = 65/171 (38%), Gaps = 13/171 (7%) Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 LE ++ +P VV +V + +++ + V+ +R I+ L +ST L Sbjct: 98 LEFLQKLPRDARVV---VVGYDDEVHASRRVGD-VARARRDIEALEINPLSTE--LQLIE 151 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 A N+ D R E +K+I ++DG + + A + ++ I Sbjct: 152 AVNRARDTL-ARLEPEREGVPMRKLIAVVSDGRDADPSP-ENYRRVAKRAARNDIRIHTI 209 Query: 326 GIRVIRSHEFLRACA-----SPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 G R+ L A S +F LV + F + ++I + + Sbjct: 210 GFPADRNRYPLYGLAEMSKQSEGTFRLVLTESAFGSHFGQLAREINEQYVL 260 >gi|303240541|ref|ZP_07327057.1| protein of unknown function DUF2134, membrane [Acetivibrio cellulolyticus CD2] gi|302591943|gb|EFL61675.1| protein of unknown function DUF2134, membrane [Acetivibrio cellulolyticus CD2] Length = 305 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 25/130 (19%), Positives = 49/130 (37%), Gaps = 16/130 (12%) Query: 6 LNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQ- 64 +NI + F + KG + AI L +I + ++V I F K L + +D + + A + Sbjct: 1 MNIPSLFKSKKGTTFVFFAIILTVIVAFAALSVDVGVIAFEKAKLSNTVDAAALAGAQEL 60 Query: 65 --------------IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDI 110 I +G N + + +K T + + N F N+ D+ Sbjct: 61 VTNVSNTNNVVNNYIAKNNSGLNESTIIVNESERSVKVTSSKTVENNFAK-VFGNNSQDV 119 Query: 111 VRSTSLDIVV 120 + + + Sbjct: 120 IATAKAKVEN 129 >gi|156382210|ref|XP_001632447.1| predicted protein [Nematostella vectensis] gi|156219503|gb|EDO40384.1| predicted protein [Nematostella vectensis] Length = 8745 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 40/221 (18%), Positives = 76/221 (34%), Gaps = 14/221 (6%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + + I +VK SQ LD+ + S + D++ K+ IK++ + Sbjct: 4787 TDKLYKQIGEAVKEESQPKPSLDIGFAISASSA---DSDANFAKMKNIIKAVLDIYGTEG 4843 Query: 211 LIPDVNNVVQSG--LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + + Q+ +V S E L + L+R I ++K T+ L Sbjct: 4844 VHYTIVTFGQTAKTVVQNSKGAELRAGLPTLATELKRLIDRITKNPEGTSIVSALNRIKQ 4903 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + R N KK ++ MTD + + + G V +G+ Sbjct: 4904 SFKEATSSR-------PNTKKTLIIMTD--SNHEDGVDDVIIAMEGVEDDGIQVLYVGVD 4954 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 E L+ A ++ + + + D I + I R Sbjct: 4955 DSVDLEELKIVAPDDNHVISADGRTPGDVAEQIARRINATR 4995 Score = 40.5 bits (93), Expect = 0.41, Method: Composition-based stats. Identities = 26/141 (18%), Positives = 54/141 (38%), Gaps = 13/141 (9%) Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY---NQIFD 272 ++ GL+++ + + G+++ + ++ T S L+ A ++F+ Sbjct: 1652 AERMRYGLMSYGDDVTTTIFFTDGITNPNSLLPFIDIL-QKTPSGSSLEKALVGGKKLFE 1710 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 G R + K +V +TD ++ ++ + AK G V I + Sbjct: 1711 HGGTRPGVD-------KFLVVITDRKSTGDAAQTENAK-ADLAK-AGVNVLVIAVGSEVD 1761 Query: 333 HEFLRACASPNSFYLVENPHS 353 H+ L A+ S LV P Sbjct: 1762 HKELNETATTPSHKLVTAPGE 1782 Score = 40.1 bits (92), Expect = 0.55, Method: Composition-based stats. Identities = 28/205 (13%), Positives = 74/205 (36%), Gaps = 28/205 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--I 230 DM+ L + S ++ + +I ++++ ++ G++ F I Sbjct: 5728 DMLFAL-SAGS-----GDPVSNLARMKNTIKSIIDRYGSTS-----IRYGVIVFGGVPVI 5776 Query: 231 EEFFLLEWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + F E+ S ++R + +++ ++ L+ A + + + Sbjct: 5777 KMGFEKEYRSDSDIKRYLDSINREKQGSSLHRSLEEA---------RKVFVQQGRPDAQH 5827 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 ++V MTDG++ S ++ ++ N + + + LR S + + +V Sbjct: 5828 VLVIMTDGQSESLPDEIETAR--NPLDQMNVTIIPVAFGDESDPAQLRVATSDDRYLIVS 5885 Query: 350 NPHSMYDAFSHIGKDIVTKRIWYDK 374 +G+ I+ + K Sbjct: 5886 KTTVDPS---ELGQQIMDRLSAESK 5907 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 28/196 (14%), Positives = 67/196 (34%), Gaps = 25/196 (12%) Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 P T + R + I + ++ T ++D+ L+ ++ + Sbjct: 2366 YPEDIVTVAKPFYPPRLVDTLIRKAQQIRPST-PQIDLAFALNSDG------PRAVETFE 2418 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW---GVSHLQRKIKYLSKF 253 I +++ + ++ L+T+ N E + + L+ + + Sbjct: 2419 YMKDVIKSVINKY-----GAYKLRYALITYGNSAAERISFDDQISNTTALKHMLNQTERS 2473 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 +N + A Q+F +R N +K++V MTD + +T + + Sbjct: 2474 RGGSNIGVAI-AALKQLFASDAVRS-------NARKVLVIMTD--SQATGDLNAAKAAAK 2523 Query: 314 EAKKRGAIVYAIGIRV 329 E + G V + + Sbjct: 2524 ELENDGIKVITLALGP 2539 Score = 37.8 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 59/159 (37%), Gaps = 17/159 (10%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +DM V++ + + S+ K+ +K+I ++ + V + V + + Sbjct: 6458 PEVDMAFVIN---ARSTNSASNFEKMKDTVKAIIEEYGSRQIKYSLIQVGDTPSVELNFR 6514 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 G L+R I + + + L+ A QG R KK Sbjct: 6515 NTFP-----GDDLLKRYISDILQDSGGSALDLALEEARKVFDPAQGARPKA-------KK 6562 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 ++V +TD ++ ST D +++ + G V + + Sbjct: 6563 VLVIITDKKSDSTVSDIKAVT--RSLQDDGVRVIIVALG 6599 Score = 36.3 bits (82), Expect = 7.1, Method: Composition-based stats. Identities = 24/149 (16%), Positives = 43/149 (28%), Gaps = 17/149 (11%) Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV-------SHLQRKIK 248 + + ++ + + G + +S W S L+ I Sbjct: 3517 AATADADVNFMHMKDVLLSIIDKYSMGKIRYSVLSYGASQEVWTRLNDDFPDSQLKSVIN 3576 Query: 249 YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS 308 +S+ + + N Q A KK++V TD + S +D Sbjct: 3577 SISRPFGEPSLHKVMDRVRNLFTSGQ--------PRALAKKVLVLFTDKRSSSNPDD--V 3626 Query: 309 LYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + G V AIG+ S L Sbjct: 3627 VNSAKALADAGVHVIAIGLGSEASRPELS 3655 >gi|114587338|ref|XP_516521.2| PREDICTED: inter-alpha (globulin) inhibitor H3 isoform 2 [Pan troglodytes] Length = 865 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 73/176 (41%), Gaps = 15/176 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG-LVTFSNKIEE 232 + V+D+S SM K++ +++ +LE++K +N ++ SG + T+ + + Sbjct: 260 VAFVIDISGSMAG------RKLEQTKEALLRILEDMKEEDYLNFILFSGDVSTWKEHLVQ 313 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +K + G+ TN GL + + + +H E + I++ Sbjct: 314 ATPE--NLQEARTFVKSMEDKGM-TNINDGLLRGISMLNKARE--EHRIPERSTS--IVI 366 Query: 293 FMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +TDG+ N+ ++ A +Y +G ++ FL A N + Sbjct: 367 MLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFA 422 >gi|39937341|ref|NP_949617.1| hypothetical protein RPA4281 [Rhodopseudomonas palustris CGA009] gi|192293121|ref|YP_001993726.1| hypothetical protein Rpal_4760 [Rhodopseudomonas palustris TIE-1] gi|39651199|emb|CAE29722.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009] gi|192286870|gb|ACF03251.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 390 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 9/99 (9%), Positives = 28/99 (28%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 F + + ++ AI L + +G ++ + + + + +D +++ + + Sbjct: 3 PRFLRDRSANVAVIFAIALIPLLGAVGSAVDYTIASNQRMKMQTALDSAVLAGVLEPTDA 62 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDI 107 + N G N Sbjct: 63 AKIARASAAFTANFQPSWSAATASFTVNGGELGGSANST 101 >gi|188990634|ref|YP_001902644.1| putative secreted protein [Xanthomonas campestris pv. campestris str. B100] gi|167732394|emb|CAP50588.1| putative secreted protein [Xanthomonas campestris pv. campestris] Length = 597 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 39/241 (16%), Positives = 88/241 (36%), Gaps = 36/241 (14%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 PW T++ + + + V + +++ ++DVS SM + K+ + S Sbjct: 199 LAPTPWNTDTLLLRIGVAGR-DVPTAALPPANLVFLVDVSGSM-----GAPDKLPLLQSS 252 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNS 259 + ++ +++ + LVT++ G + I L + G T Sbjct: 253 LKLLVRQLRKQD------RITLVTYAGSTAVVLPPTSGAQQTRIVEAIDSL-QSGGGTAG 305 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKR 318 G++ AY + +R N I+ TDG+ N+ + Q E ++ Sbjct: 306 ASGIELAY-KAAQQAYLRGGINR--------ILLATDGDFNVGVTDFDQLKGMVAEKRRS 356 Query: 319 GAIVYAIGIRVIRSHEFLR---ACASPNSFYLVENP--------HSMYDAFSHIGKDIVT 367 G + +G ++ L A A ++ +++ H + + I +D+ Sbjct: 357 GVALSTLGFGTGNYNDTLMEQLADAGDGAYAYIDSALEARKVLTHELGSTLATIARDVKI 416 Query: 368 K 368 + Sbjct: 417 Q 417 >gi|167525226|ref|XP_001746948.1| hypothetical protein [Monosiga brevicollis MX1] gi|163774728|gb|EDQ88355.1| predicted protein [Monosiga brevicollis MX1] Length = 2718 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 59/157 (37%), Gaps = 15/157 (9%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 +++ D+++V+DVS SM + ++ A + + E + + + Sbjct: 2338 SSHAEYAEAADVLMVVDVSGSMTDY-------MEQARAFVRTIAREGFHLDSAASQHRMA 2390 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS--TNSTPGLKYAYNQIFDMQGMRQHC 280 L TF S + + +++ V+ TN P L+ + D++ Sbjct: 2391 LFTFGTTATALGSEPLFTSDWAQLQQRVAQIAVNGATNYLPALELVEQSLRDLKASDPAR 2450 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 ++I++F TDG N Q+ A++ Sbjct: 2451 YNAS---RRIVLFQTDGSNSDRN---QTRAITATARR 2481 >gi|116253186|ref|YP_769024.1| transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] gi|115257834|emb|CAK08932.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] Length = 797 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 48/267 (17%), Positives = 98/267 (36%), Gaps = 48/267 (17%) Query: 104 VNDIDDIVRSTSLDIVVVPQNEGYSI-SAISRYKIPLK--FCTFIPWYTNSRHIVMPITS 160 V+ D R+ SL VP ++ + + + K+P F I T V P T+ Sbjct: 288 VSQDGDQARTISLKADTVPADKDFELTWKAAPGKMPSAGLFREVIDGKTYLLAFVTPPTA 347 Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 +++ V+D S SM S I+ A +S+ + ++ + Sbjct: 348 PDTAAPPAKR--EVVFVIDNSGSM------SGPSIEQARQSLALAISKLNPDD------R 393 Query: 221 SGLVTFSNKIEEFFL-----LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 ++ F + + ++F ++ L+ G T P L+ A Sbjct: 394 FNVIRFDDTMTDYFKGLVAATPDNREKAIAYVRGLTADG-GTEMLPALQAA--------- 443 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 ++ + +VF+TDG +++ L+ A + A V+ +GI Sbjct: 444 -LRNQGPVATGALRQVVFLTDG----AIGNERQLFQEITANRSDARVFTVGIG------- 491 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIG 362 ++PN++++ + F+ IG Sbjct: 492 ----SAPNTYFMTKAAEIGRGTFTAIG 514 >gi|269125771|ref|YP_003299141.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268310729|gb|ACY97103.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 601 Score = 49.8 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 39/232 (16%), Positives = 78/232 (33%), Gaps = 23/232 (9%) Query: 143 TFIPWYTNSRHIVMPITSSV----KVNSQTDARLDMMIVLDVSRSMESFFDS-SITKIDM 197 +P + + +P + K ++ ++++V+D S SM+ +K D+ Sbjct: 376 GLLPGEPKT-TLSLPSDQILDLILKTWAELRKPANVLLVIDRSGSMQQTVPGTGKSKGDL 434 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-EEFFLLEWGVSHLQRKIKYL---SKF 253 A ++ L E + V V S + +E L + ++ Sbjct: 435 AKEAAAEALAEFRGQDQVGLWVFSAARRQGERDWQEVVPLGRMTEAHRSLLRERLLGLTL 494 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 T AY ++ G R+ +V MTDG+N Sbjct: 495 SGGTGLYNTTAAAYEKMT---GSRRGDAINA------VVVMTDGKNERPGGLDLDGLIAK 545 Query: 314 --EAKKRGAIVYAIGIRVIRSHEFLRAC--ASPNSFYLVENPHSMYDAFSHI 361 ++ V+ IG LR A+ + Y +P+++ D F+ + Sbjct: 546 LGARREESVRVFTIGYGEDADQNVLRRIAEAADGAAYDSSDPNTIGDIFTEV 597 >gi|84996511|ref|XP_952977.1| thrombospondin-related protein [Theileria annulata strain Ankara] gi|71532874|emb|CAJ20069.1| thrombospondin-related protein, putative [Theileria annulata] Length = 606 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 48/231 (20%), Positives = 84/231 (36%), Gaps = 35/231 (15%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 K +S LD+ I++D S S+ I + + I + +++ + + P N V Sbjct: 199 GIKKPSSYCHRELDLTILVDESSSIY------IEEWNKLIPFLKSLVRSINISP---NYV 249 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIK----YLSKFGVSTNSTPGLKYAYN-QIFDMQ 274 +VTFS + W +S L K L+ NS P Y + Q + Sbjct: 250 HLSMVTFS------TSIRWLISFLDPASKDEQLALAVLDKLKNSKPVFGYTFTGQALNFI 303 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + N K I+ +TDG + T Q+ + G + +G+ + E Sbjct: 304 SEAVYMFGARRNSPKGIIIITDGSSTQTNVTSQASAL---LRDAGVTILVVGVGKAKESE 360 Query: 335 F--LRACASPNS--FYLVENPHSMY--------DAFSHIGKDIVTKRIWYD 373 + C++ + + N + + I KD V K IW D Sbjct: 361 CRGIVGCSTKGECPLFFMTNWDEIIRKVGELMAEVCETIPKDAVCKPIWSD 411 >gi|307254358|ref|ZP_07536196.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|307258816|ref|ZP_07540548.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 11 str. 56153] gi|306862657|gb|EFM94613.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|306867167|gb|EFM99023.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 11 str. 56153] Length = 538 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 34/252 (13%), Positives = 84/252 (33%), Gaps = 26/252 (10%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R F + G T++ + I ++ + +E + I + L ++++++ + N Sbjct: 10 RRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAENNNG 69 Query: 69 GNGNNRK----KLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIV----- 119 N+ K K D R+ FV ++++ Sbjct: 70 RKDNDYKLSGSSNKENDSFDISSEVGK---RDSQMVTTFVKAFLPQTNDDKMNLIPICKT 126 Query: 120 ---VVPQNEGYSISAISRYKIPLKFCTFIPWYTNS-----RHIVMPITSSVKVNSQTDAR 171 + S +K ++ P + + + + S + + Sbjct: 127 VNNTSGKGHTSSSEVTCTVSGTIKHKSWFPLKVGTVEVIPQQVDVASKSKAFKKNTFNIP 186 Query: 172 LDMMIVLDVSRSM------ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 +D+M+V D+S SM E + +KI + + + + ++ + N + G+ Sbjct: 187 IDLMVVADLSGSMKDGIKGEKLKGGTNSKIYILREVLKELADKSLFTQEANEYNRIGITA 246 Query: 226 FSNKIEEFFLLE 237 F+ E + Sbjct: 247 FAMGAEHPKENK 258 >gi|110632968|ref|YP_673176.1| hypothetical protein Meso_0611 [Mesorhizobium sp. BNC1] gi|110283952|gb|ABG62011.1| conserved hypothetical protein [Chelativorans sp. BNC1] Length = 427 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 24/183 (13%), Positives = 65/183 (35%), Gaps = 7/183 (3%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 R+ + G + ++ AI P++ +G+ +E + + + L D ++ A+ + Sbjct: 15 FRSLAKDQGGSVAVIAAIVFPVVVGAMGLGVESGYWYLKQRKLQHAADVAVYAAS---VR 71 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 G+ R ++ + + S G + S+++ Sbjct: 72 YRAGDARALMETAALRSARVTGYQPSI--GTITTGVQAGSTAGSGTVSVELTETHSRLFS 129 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMP--ITSSVKVNSQTDARLDMMIVLDVSRSME 185 S+ + + + + + + + +V V TD +L+ V+ S + + Sbjct: 130 SVFTTDPVVLSARAVAELKGGSRACVLALSPTAPGAVTVTGSTDVQLNGCSVVSNSNASD 189 Query: 186 SFF 188 +F Sbjct: 190 AFL 192 >gi|330830423|ref|YP_004393375.1| FlpL [Aeromonas veronii B565] gi|328805559|gb|AEB50758.1| FlpL [Aeromonas veronii B565] Length = 460 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 26/151 (17%), Positives = 56/151 (37%), Gaps = 13/151 (8%) Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQI-------FDMQGM 276 F + ++ +S + L +TN+ G+ + + + + Sbjct: 305 FDDPDCGVSPIQAHLSTRADYRQALDTLYAAFNTNTAEGVMWGWRLLSPEWQGRWRQGAA 364 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKE--DQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 E + +KI+V +DGE+++ D++ L C E K++G +Y + Sbjct: 365 ALPRPYELQDNRKIMVLFSDGEHMTEAALRDRKQLLLCREMKRKGIQIYTVAF--EGDTR 422 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 F+ CAS S ++ + + I Sbjct: 423 FVAQCASDRSLAFKATKSNIRTVLTRLASSI 453 >gi|313903839|ref|ZP_07837228.1| von Willebrand factor type A [Thermaerobacter subterraneus DSM 13965] gi|313466027|gb|EFR61552.1| von Willebrand factor type A [Thermaerobacter subterraneus DSM 13965] Length = 1151 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 65/194 (33%), Gaps = 33/194 (17%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N + + + +V+D S SM K+ MA+++ + + + + +V Sbjct: 463 NRKNLPTVALTLVIDRSGSMAGL------KLQMAVEAARRVAQLLTPAD------RLAVV 510 Query: 225 TFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 F ++ LE + R ++ T+ GL A + ++ +H Sbjct: 511 LFDSQAYVTRPLEPVRNPGEVDRAFPAAAQ--GGTSLGSGLAAALPLMEGVKADVRH--- 565 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 ++ +TDG E +++G + A+ I L A Sbjct: 566 --------VIALTDG----VSEPFDVTGLARAFRRQGVTLSAVAIGPDADRNTLAQLARE 613 Query: 341 SPNSFYLVENPHSM 354 + Y +P + Sbjct: 614 GGGALYEAADPGQL 627 >gi|114587336|ref|XP_001172570.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 isoform 1 [Pan troglodytes] Length = 890 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 73/176 (41%), Gaps = 15/176 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG-LVTFSNKIEE 232 + V+D+S SM K++ +++ +LE++K +N ++ SG + T+ + + Sbjct: 285 VAFVIDISGSMAG------RKLEQTKEALLRILEDMKEEDYLNFILFSGDVSTWKEHLVQ 338 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +K + G+ TN GL + + + +H E + I++ Sbjct: 339 ATPE--NLQEARTFVKSMEDKGM-TNINDGLLRGISMLNKARE--EHRIPERSTS--IVI 391 Query: 293 FMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +TDG+ N+ ++ A +Y +G ++ FL A N + Sbjct: 392 MLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFA 447 >gi|257453795|ref|ZP_05619073.1| von Willebrand factor type A domain protein [Enhydrobacter aerosaccus SK60] gi|257448722|gb|EEV23687.1| von Willebrand factor type A domain protein [Enhydrobacter aerosaccus SK60] Length = 260 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 35/160 (21%), Positives = 65/160 (40%), Gaps = 8/160 (5%) Query: 175 MIVLDVSRSME-SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++VLD+S SM + +IDM + I A ++ N V+ +V + Sbjct: 45 VLVLDLSGSMAIRSGNGDKRRIDMLNEGIEAFYHDLMKDETARNRVRLAIVIVGGVNDTA 104 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK-IIV 292 L+ + + G+ T G+ A N I + R + NY + ++ Sbjct: 105 ELMMDWTDAIDFFPIKFRENGM-TPLGQGMLLALNLI---EQERINLRDNGINYTRPWVI 160 Query: 293 FMTDGENLSTKED-QQSLYYCNEA-KKRGAIVYAIGIRVI 330 MTDG +++ Q ++ C++A + I+Y I I Sbjct: 161 AMTDGLPTDSQDVWQAAINQCHQAEQNNQCIIYPIAIDAG 200 >gi|222101616|gb|ACM44013.1| thrombospondin-related anonymous protein [Babesia bovis] Length = 657 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 58/170 (34%), Gaps = 29/170 (17%) Query: 167 QTDARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 Q +LD IV+D S S+ ++ + I + ++ V L N ++ L T Sbjct: 39 QCKKQLDFSIVVDESASISNDQWEGQM------IPFLRNLIHTVDL---DNTDIRLSLTT 89 Query: 226 FSNKIEEFFL-LEWGVSHLQRKIKYLSKFGVS------TNSTPGLKYAYNQIFDMQGMRQ 278 +S + F L+ S + + L + T + L Y I Sbjct: 90 YSTPTRQIFTFLDAAASSTRLALTKLDWMAGTKARSGMTYTGRALNYVRKAILPY----- 144 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 N K ++ +TDG + + + + G V IG+ Sbjct: 145 ----GRKNVPKALLLITDG---VSSDGSYTAQVAAMLRDEGVNVMVIGVG 187 >gi|304314705|ref|YP_003849852.1| cobaltochelatase subunit-like protein [Methanothermobacter marburgensis str. Marburg] gi|302588164|gb|ADL58539.1| predicted cobaltochelatase subunit-like protein [Methanothermobacter marburgensis str. Marburg] Length = 663 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 65/201 (32%), Gaps = 27/201 (13%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 AR +++V+D+S SM S K I +E+ + D + +V F Sbjct: 474 GARASIVLVVDISGSMFSE-----KKAARVKGLIERFIEDAQRHKD-----RISVVGFRG 523 Query: 229 KIEEFFLLEWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + + S + + + + G +T G+K + + Y Sbjct: 524 RDAKVIIPSTARASSFRDTVDSI-RVGGTTPMAQGIKRGLEIL--------REEKRHSEY 574 Query: 288 KKIIVFMTDGENL---STKEDQQSLYYCNEAKKRGAIVYAIGI--RVIRSHEFLR--ACA 340 +V ++DG ++++ ++ I + A A Sbjct: 575 VPFMVILSDGMPNVGVERNPKREAVEAAARLREEDIPSAVINFEQGSRGGRDLNMEIALA 634 Query: 341 SPNSFYLVENPHSMYDAFSHI 361 S S+Y + + A I Sbjct: 635 SGGSYYDLHDLEDPSMAVPRI 655 >gi|145552898|ref|XP_001462124.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124429962|emb|CAK94751.1| unnamed protein product [Paramecium tetraurelia] Length = 533 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 41/252 (16%), Positives = 86/252 (34%), Gaps = 31/252 (12%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDM 174 + + N +IS + + T + ++ I + + +D+ Sbjct: 67 QGYMTAMKYNLKDNISIQASSHTLMNQQNAALMITIKSNDILLI--NQRGQECVRQGVDL 124 Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 + ++D S SM+ KI + K++ ML ++ + L+ F K+ Sbjct: 125 VCLIDHSGSMQG------EKIKLVRKTLKQMLTFLQPCD------RLCLIMFDCKVYRLT 172 Query: 235 LL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + V + I L G T+ G+K A + + R++ N A Sbjct: 173 RLMRVTQENVQKFRVAISSLQARG-GTDIGNGMKMALSIL----KHRKYKNPVSA----- 222 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLV 348 I ++DG + +E + + + G + + A FY V Sbjct: 223 IFLLSDGVDEGAEERVRDDLIQYNIRDSF-TIKTFGFGRDCCPKIMSEIAHYKEGQFYFV 281 Query: 349 ENPHSMYDAFSH 360 N ++ + F+ Sbjct: 282 PNLTNIDECFAE 293 >gi|123490500|ref|XP_001325627.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121908529|gb|EAY13404.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 688 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 41/261 (15%), Positives = 85/261 (32%), Gaps = 48/261 (18%) Query: 103 FVNDIDDIVRSTSLDIVVV--PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITS 160 V +++ ++ P +E +I +I K W I Sbjct: 171 SVRGTINVIDPHNVIFATKEFPNDESITIET----QIKDKDNNIAIWSDGYIAISTFTYF 226 Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 KV+S ++ ++D S SM S + I A +N + + + + Sbjct: 227 ETKVHSNSE----FYFIIDCSGSM------SGSCIQNAKLCLNIFMHSLPIG------CR 270 Query: 221 SGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 ++ F + E VS +++ + T+ LKY Sbjct: 271 FSIIKFGSDYEVALHPCDYTDENVSEAMKQLNNIDAEMGGTDILSPLKYVME-------- 322 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA--IVYAIGIRVIRSHE 334 T + K + +TDG+ D + C A++ +++IGI + Sbjct: 323 ----LTPKQGFIKQVFLLTDGQ------DSNTNELCALAQENRTNNRIFSIGIGSGADKD 372 Query: 335 FLRACA--SPNSFYLVENPHS 353 + + S ++ V++ S Sbjct: 373 LIINVSQKSGGNYVFVDDDES 393 >gi|84387243|ref|ZP_00990264.1| hypothetical protein V12B01_22476 [Vibrio splendidus 12B01] gi|84377890|gb|EAP94752.1| hypothetical protein V12B01_22476 [Vibrio splendidus 12B01] Length = 421 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 19/112 (16%), Positives = 43/112 (38%), Gaps = 4/112 (3%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 +G + ++ L + V + ++++H+ KT L + +D + + AAT I++ Sbjct: 10 KKQQGLVAVMITAALLVFLAVSALAVDINHMVVNKTRLQNAVDSATLAAAT-ILDNSKDK 68 Query: 73 NRKKLKGGDILCRIKN---TWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVV 121 + + G L + + F + ND D + + D Sbjct: 69 DAVDAEVGTALNAMAASTGNQEIDFSTASISIDYSNDPKDFTGTATFDSTDD 120 >gi|315636668|ref|ZP_07891900.1| conserved hypothetical protein [Arcobacter butzleri JV22] gi|315479050|gb|EFU69751.1| conserved hypothetical protein [Arcobacter butzleri JV22] Length = 1209 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 34/167 (20%), Positives = 65/167 (38%), Gaps = 18/167 (10%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES------FFDSSITKIDMAIKSINA 204 + I++ T ++N Q ++ +V+D S SM+ + ++I++ID+ ++ Sbjct: 702 GKDILIGDTGGTELNVQAGKNYNIALVVDTSGSMKEASGSKTAWGTTISRIDLLKDALKN 761 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNST 260 + + +K NV ++ F +E +S L KI L G + Sbjct: 762 LADSLKGHDGKINV---SIIDFDTNAKEPITFNDLTSKNISDLITKIDALKAEGGTNYED 818 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 LK FD Q + N F+TDG+ + D + Sbjct: 819 AFLKT--TSWFDTQSVTYGKAQGYENLT---YFLTDGDPTFSNRDTK 860 >gi|296446920|ref|ZP_06888856.1| conserved hypothetical protein [Methylosinus trichosporium OB3b] gi|296255595|gb|EFH02686.1| conserved hypothetical protein [Methylosinus trichosporium OB3b] Length = 486 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 45/276 (16%), Positives = 97/276 (35%), Gaps = 24/276 (8%) Query: 17 GGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKK 76 G + I+ A+ + + G ++ + ++T L+++ D + + A T M + K Sbjct: 19 GNVAIIFALAAIPLLIAAGGAVDFAIASRVQTQLYAICDSATLAATTPAMMQQTTATAKT 78 Query: 77 LKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYK 136 + ++ +++ N + TS V + +++ + Sbjct: 79 VATSMFAAQVAQINRLTY-------NSANLTVTVNDDTSASPV-----KTRTVTVSYLAQ 126 Query: 137 IPLKFCTFIPWYTNSRHIVMPI-TSSVKVNSQTDARLDMMIVLDVSRSME-SFFDSSITK 194 + F +F H+ I T + T +D +VLD S SME + + Sbjct: 127 VGNAFGSF-------YHVPTSIFTVKASSTASTARNIDFYLVLDNSPSMELPATTAGLAS 179 Query: 195 IDMAIKSINAMLEEVKLIPDVN-NVVQSGLV-TFSNKIEEFFLLE-WGVSHLQRKIKYLS 251 + A + A E P+ G + +++ L V +++ S Sbjct: 180 MTAATGCVFACHENTYSDPENTVQYPGYGTIDSYTYAKNAGIALRIDNVREAAKRLASTS 239 Query: 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + +S N AY +D ++ +T AN Sbjct: 240 QAMMSANGATYRLAAYAFNYDTTQLQALTSTTSANV 275 >gi|114587346|ref|XP_001172688.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 3 [Pan troglodytes] Length = 900 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 41/266 (15%), Positives = 88/266 (33%), Gaps = 39/266 (14%) Query: 92 MSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNS 151 + + N V+ + T I P + + K P + T + Sbjct: 187 LETESTFMTNQLVDALTTWQNKTKAHIRFKP-------TVSQQQKSPEQQETVLDGNLII 239 Query: 152 RHIVMPITSSVKVNSQTDARL-------------DMMIVLDVSRSMESFFDSSITKIDMA 198 R+ V S + + + +++ V+D S SM S KI Sbjct: 240 RYDVDRAVSGGSIQIENGYFVHYFAPEGLTTMPKNVVFVIDKSGSM------SGRKIQQT 293 Query: 199 IKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST 257 +++ +L+++ N +V + + + V+ + + G T Sbjct: 294 REALIKILDDLSPRDQFNLIVFSTEASQWRPSLVPAS--AENVNKARSFAVGIQALG-GT 350 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 N + A Q+ D + + I+ +TDG+ + + +S+ N ++ Sbjct: 351 NINDAMLMAV-QLLDSSNQEEQLPEGSVSL---IILLTDGDPTVGETNPRSIQ--NNVRE 404 Query: 318 RGAIVYA---IGIRVIRSHEFLRACA 340 + Y+ +G S+ FL A Sbjct: 405 AVSGRYSLFCLGFGFDVSYAFLEKLA 430 >gi|221128911|ref|XP_002157051.1| PREDICTED: similar to tenascin C, partial [Hydra magnipapillata] Length = 2678 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 61/195 (31%), Gaps = 24/195 (12%) Query: 142 CTFIPWYTNSRHIVMPITSSVKV--NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 C NS +V T + +S + + +IV+D S S+ ++ + Sbjct: 2007 CGNCSQCDNSVTVVDSRTCNNVPCDSSCRSSMISYIIVIDSSSSVTDYYWG------LER 2060 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVST 257 + V + GLV F++ F + I +S T Sbjct: 2061 NFVKRFAASVGFTSNTT----FGLVNFASTARTEFGCRSFTDATSFNAAIDSVSMITGGT 2116 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 GL A N + + KK+++F TDG+ K+ + + Sbjct: 2117 AINAGLTQAINLM----------QSPACRGKKVLLFTTDGQENIEKDPSKISATYETVRS 2166 Query: 318 RGAIVYAIGIRVIRS 332 +Y + + Sbjct: 2167 LANYIYIPYVNNPEA 2181 >gi|221115448|ref|XP_002154505.1| PREDICTED: similar to polydom [Hydra magnipapillata] Length = 2514 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 25/192 (13%), Positives = 62/192 (32%), Gaps = 26/192 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++DVS S+ + + ++++L ++ + P + +VTF I + Sbjct: 166 DIIFLIDVSGSISDDG------FNTEREFVSSLLSKISVQPSAA---RIAVVTFGRDINK 216 Query: 233 FFL------LEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 L+ + K + TN L+ A + D ++ Sbjct: 217 DIDYIDYGYLDKNKCTFNEEFKRVKHRKEGWTNINGALQKA-KALLDSANEKKFK---RH 272 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG--AIVYAIGIRV-IRSHEFLRACASP 342 N + V +TDG + +++IG+ + + Sbjct: 273 NVNTVAVLLTDG---GWNYGGSPYDTATNLRTGFHYVDIFSIGVGHWLDRKQLKNIAGKE 329 Query: 343 NSFYLVENPHSM 354 + + ++ Sbjct: 330 ENVIIAKDFSDF 341 >gi|162447313|ref|YP_001620445.1| surface-anchored VWFA domain-containing protein [Acholeplasma laidlawii PG-8A] gi|161985420|gb|ABX81069.1| surface-anchored VWFA domain protein [Acholeplasma laidlawii PG-8A] Length = 486 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 55/296 (18%), Positives = 115/296 (38%), Gaps = 32/296 (10%) Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTS---LDIVVVPQNEGYSI 129 +++ ++ I + N N+S F+ + R+ + I + Y+ Sbjct: 43 HQEIIENPFIDVSVNNKSNISLSANTASYSFIRSQINSGRAVDRNAVRIEEMVNFFNYNY 102 Query: 130 SAISRYKIPLKFCTFIPWYT-NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 + F + + N+ ++ I K D +++I+LDVS SM Sbjct: 103 N-QPETDKTFGFKSELIQTPWNNETHLLLIGLETKQVDLGDIPSNIVILLDVSGSM---- 157 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRK 246 S+ K+ +A K++ ++E++K ++ LVT+S+ + F + ++++ + Sbjct: 158 -SATNKLSLAKKAMELLIEQMKPNDVIS------LVTYSSGEKVVFKGKSIDDMAYMTSQ 210 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKED 305 I+ L G ST GL AY + I+ TDG+ N+ Sbjct: 211 IRLLKASG-STAGKKGLDMAYKV-----AEEYFIEGGNNR----IILATDGDFNVGISST 260 Query: 306 QQSLYYCNEAKKRGA--IVYAIGIRVIRSHEFLRAC-ASPNSFYLVENPHSMYDAF 358 + Y +E ++ G Y G + + R A +++ +++ S AF Sbjct: 261 DMLIEYISEKRESGIYFSAYGFGYGNFKDEKLERVAKAGNGTYHYIDDIISARKAF 316 >gi|313233701|emb|CBY09871.1| unnamed protein product [Oikopleura dioica] Length = 663 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 72/202 (35%), Gaps = 28/202 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 RLD+ ++LDVS ++ S + D A+L E + N VQ + +FS+ Sbjct: 459 GRLDLAVLLDVSGTIASNPNKDQDTFDFFQ----ALLNEFDTV----NQVQLSITSFSDD 510 Query: 230 IEEFFLLE-WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + L +K + G T+ GL+ A + + + Sbjct: 511 AVVDLPMGHYNEPDLFGAVKNVDWVGRLTDINEGLQTALSTMNTTD-----------DVP 559 Query: 289 KIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN---- 343 I++F++DG ++ + + G V ++G + F+ + Sbjct: 560 DIMIFVSDGFDSFDPGAIGDNAA---DISNAGVDVVSVGFGLNGFVNFMALVTVADNEGA 616 Query: 344 SFYLVENPHSMYDAFSHIGKDI 365 + + + + I + + Sbjct: 617 NVFTASTGDELLSQTTAILEAL 638 >gi|203287632|ref|YP_002222647.1| hypothetical protein BRE_171 [Borrelia recurrentis A1] gi|201084852|gb|ACH94426.1| hypothetical protein BRE_171 [Borrelia recurrentis A1] Length = 341 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 45/234 (19%), Positives = 81/234 (34%), Gaps = 34/234 (14%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 + P S K+ + D++IVLD+S SM + SS ++D A + I + + Sbjct: 82 TLAGPSISRKKMTYLSSGA-DIVIVLDISPSMGAIEFSSKNRLDFAKELIEYFVYQ---- 136 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQI 270 GLV F+ + L +K+ Y+ G + G+ A + + Sbjct: 137 ---RENDNIGLVAFAKEASLIVPLTIDRDFFSKKLDDIYIMDLGNGSALGLGISIALSHL 193 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 KK ++ +TDG S + + N A+ +Y++GI Sbjct: 194 -----------KHSEAPKKSVIVLTDGVVNSDEVYKD--QVINLAQGLNVKIYSVGIGSD 240 Query: 331 RSHEFLRACASPNSF----YLVENPHSMYDA-------FSHIGKDIVTKRIWYD 373 S + V +P +++ F +G D K D Sbjct: 241 EELNVGFKLRSGKFYQGVLKEVYDPSMLFEISNKTGGLFYSVGDDFSFKLAIQD 294 >gi|153840568|ref|ZP_01993235.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] gi|149745769|gb|EDM56899.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] Length = 187 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 27/178 (15%), Positives = 59/178 (33%), Gaps = 5/178 (2%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 KG +L ++ L I+ V I+++H KT L + +D + + A + + Sbjct: 7 RTQKGITLVLISMVLLILLGVAAFGIDLNHQVLNKTRLQNAVDTAALAGAVVADKTEDVD 66 Query: 73 NRKKLKGGDILCRIKN--TWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 + + +SF + F +D+ V + S + + Y Sbjct: 67 QAEAAVIATLSSIASESGNTELSFTDGNTSVTFSHDMQTFVNAASFTPPT-GEYDIYVRV 125 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 A++ I N + S+ + + M D + ++E + Sbjct: 126 AVTDMGISQYLSAVFGIVKNVSASAVAGRSAAIAYTCNLTPIAMCG--DPNGTVEDAW 181 >gi|119585667|gb|EAW65263.1| inter-alpha (globulin) inhibitor H3, isoform CRA_b [Homo sapiens] Length = 890 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 73/176 (41%), Gaps = 15/176 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG-LVTFSNKIEE 232 + V+D+S SM K++ +++ +LE++K +N ++ SG + T+ + + Sbjct: 285 VAFVIDISGSMAG------RKLEQTKEALLRILEDMKEEDYLNFILFSGDVSTWKEHLVQ 338 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +K + G+ TN GL + + + +H E + I++ Sbjct: 339 ATPE--NLQEARTFVKSMEDKGM-TNINDGLLRGISMLNKARE--EHRIPERSTS--IVI 391 Query: 293 FMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +TDG+ N+ ++ A +Y +G ++ FL A N + Sbjct: 392 MLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFA 447 >gi|163858556|ref|YP_001632854.1| hypothetical protein Bpet4238 [Bordetella petrii DSM 12804] gi|163262284|emb|CAP44587.1| hypothetical protein Bpet4238 [Bordetella petrii] Length = 244 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 67/203 (33%), Gaps = 19/203 (9%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 L ++++LDVS SM S KI ++ ML+ + + ++TF Sbjct: 15 KAKPLPVVLLLDVSGSM------SGEKIRNVNDAVRDMLDTFSDTENGETEIHVAIITFG 68 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 +++ L + + G T L+ A I D + Y Sbjct: 69 SQVALHQPLASASDIHWQDLSA----GGMTPLGTALQMAKAMIEDKDVV------PSRAY 118 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC--ASPNSF 345 + +V ++DG E + + + + A+ I L + N Sbjct: 119 RPTVVLVSDGGPNDAWEKPLNAFISD-GRSAKCDRLAMAIGADADEAVLGKFIEGTSNRL 177 Query: 346 YLVENPHSMYDAFSHIGKDIVTK 368 + EN + D F + + + Sbjct: 178 FYAENAKQLRDFFKFVTMSVTIR 200 >gi|283779907|ref|YP_003370662.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283438360|gb|ADB16802.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 1040 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 13/96 (13%), Positives = 34/96 (35%), Gaps = 9/96 (9%) Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV--IRSHEFLRAC 339 + K ++ ++DG+ N+ K+ G + + + L+ Sbjct: 550 KPNPASVKHMIIISDGDPSPPSG-----TILNQYKQAGIKITTVAVGTHGPAGSTPLQNI 604 Query: 340 A--SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 A + +Y+ NP ++ F + + ++ D Sbjct: 605 ANATGGKYYVATNPKALPRIFQIEARRVARPLVYED 640 >gi|39933553|ref|NP_945829.1| hypothetical protein RPA0476 [Rhodopseudomonas palustris CGA009] gi|39647399|emb|CAE25920.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009] Length = 443 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 59/454 (12%), Positives = 130/454 (28%), Gaps = 115/454 (25%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQ-- 64 +I F + +G + ++ A+ L + +G ++ S +++ L + D + V A ++ Sbjct: 6 SIARFSRDRRGNIAVIFALVLVPLISAVGCAVDYSRANALRSKLQAAADAASVGAVSRTS 65 Query: 65 ---IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVV 121 + + G D RI NG +N + ++S V Sbjct: 66 PAYVAAGAMSGDGVISSGADDALRIF-------------NGNLNGLTGYTLASSSATVTK 112 Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 + S+ + T + + TS+ + +D ++LD S Sbjct: 113 ASDV-----VTSQVTFSAQISTMFMKVVGMSAMAVGGTSTATASMPK--YIDFYLLLDNS 165 Query: 182 RSME----------------------------------------SFFDSSITKIDMAIKS 201 SM + T+ID+ + Sbjct: 166 PSMGVGATPTDVSAMIAATANKSSDDHCAFACHDVNNKNNYYNLAKALGITTRIDVLRSA 225 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF--------FLLEWGVSHLQRKIKYLSKF 253 +++ +N + + F + I ++ Sbjct: 226 TQQLMDTATATATYSNQFRMAIYDFGASAQTAGLRNLFSLSASLSSAKTAASAIDLMTVK 285 Query: 254 GVSTNSTPGLKYAYNQIFDM--QGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSL- 309 G + N+ +Y IF + + + +K + F++DG + + Sbjct: 286 GQNDNNDQ--DTSYTAIFPAINNEISSPGSGVSGSPQKYLFFVSDGVADEYNPSCLKPKT 343 Query: 310 ----------YYCNEAKKRGAIV---YAIGIRVIRSH----------------------- 333 C K RG + Y + + + Sbjct: 344 GNRCQSPINPALCKTLKDRGIKIAVLYTTYLNLPSNDWYKKWIAPFNAGPYGPSPNSEIA 403 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + + ACASP ++ V + +A + + K V Sbjct: 404 QNMEACASPGFYFEVSPTQGIAEAMNALFKKAVA 437 >gi|124008506|ref|ZP_01693199.1| von Willebrand factor, type A [Microscilla marina ATCC 23134] gi|123986014|gb|EAY25864.1| von Willebrand factor, type A [Microscilla marina ATCC 23134] Length = 425 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 35/243 (14%), Positives = 84/243 (34%), Gaps = 33/243 (13%) Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 N+ A K L + + + Q L++ +V+D S S Sbjct: 2 NKTLQFEAHLENKYWLTS-----HIKQELFVYLSLKGGKAPEKQERIPLNISLVVDRSGS 56 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE--EFFLLEWGVS 241 M S K++ K+++ +++ +K ++ +V + ++I+ Sbjct: 57 M------SGDKLNYVKKAVDFVIDNLKSDDVLS------IVQYDDEIDVVASSAKVTNKK 104 Query: 242 HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENL 300 L K+K + + TN + G+ Y Q+ Q ++ ++DG N Sbjct: 105 ALHEKVKGIQARNM-TNLSGGMMEGYAQVKSTQSNGYVNR---------VLLLSDGLANA 154 Query: 301 STKEDQQSLYYCNEA-KKRGAIVYAIGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDA 357 +Q + ++ G + G+ + + + ++Y ++ P + Sbjct: 155 GITAPEQLQQIAQKKFREAGIALSTFGVGSDFNEVLMTNLSEYGGANYYFIDMPDKIPQI 214 Query: 358 FSH 360 F+ Sbjct: 215 FAQ 217 >gi|254443293|ref|ZP_05056769.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] gi|198257601|gb|EDY81909.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] Length = 632 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 33/208 (15%), Positives = 67/208 (32%), Gaps = 20/208 (9%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 + S + + A L+++ V+D S SM S +++ KS+ ++ ++ + Sbjct: 123 LGFVSGIDATTFKPAPLNLVAVVDKSGSM------SGDPLELVRKSLRQVVSQLGSDDQL 176 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + +V G T +E + I + G + GL+ Y Q Sbjct: 177 S-IVLYGSST-HIHLEPTKTSTENRDQIIASIDRIQSHGSTAMEA-GLELGYQV--ARQS 231 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS-LYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++ TD + D + K + IG+ V E Sbjct: 232 ADAFVGKTR------VMLFTDERPNVGRTDATGFMAMAESGSKSDIGLTTIGVGVHFGAE 285 Query: 335 FLRACAS--PNSFYLVENPHSMYDAFSH 360 +S + + ++ SM F Sbjct: 286 LAEKISSVRGGNLFFFDDDESMETTFRK 313 >gi|330789584|ref|XP_003282879.1| hypothetical protein DICPUDRAFT_51972 [Dictyostelium purpureum] gi|325087163|gb|EGC40543.1| hypothetical protein DICPUDRAFT_51972 [Dictyostelium purpureum] Length = 536 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 61/193 (31%), Gaps = 10/193 (5%) Query: 175 MIVLDVSRSMES-----FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +IV+D+S SM + + ++ + + + GLV F + Sbjct: 117 VIVIDLSGSMSAPAFMGSYKPGELEMKRIEFAQALFQTFIDKMVSYELPAVCGLVCFGSV 176 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + F + ++ + T+ + A I D + K Sbjct: 177 AKLTFGITKNFDSFSTELGEIQANMGGTSLWEAIVLAAKTIVDFRNNPPSDIKLAEPEKL 236 Query: 290 II-VF-MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 VF +TDG++ S + Y KK ++ +I I I + + S ++ Sbjct: 237 FCRVFCLTDGQDTSNYPLYDAYSY---LKKNRIVLDSIPIGEISQLLLALSTGTGGSCFV 293 Query: 348 VENPHSMYDAFSH 360 + F Sbjct: 294 ADTAQEGIGLFER 306 >gi|330798620|ref|XP_003287349.1| hypothetical protein DICPUDRAFT_94383 [Dictyostelium purpureum] gi|325082616|gb|EGC36092.1| hypothetical protein DICPUDRAFT_94383 [Dictyostelium purpureum] Length = 559 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 41/199 (20%), Positives = 76/199 (38%), Gaps = 22/199 (11%) Query: 175 MIVLDVSRSME------SFFDS--SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 MIVLD+S SM S +T+I+ A ++++ GLV F Sbjct: 139 MIVLDLSGSMRLSAFKGSKVPGELEMTRIEFAQAIFQTFIDKMVSYELSAA---CGLVCF 195 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + F + ++ + +T+ + A I D R + N + A Sbjct: 196 GSSAQLTFGITRNFDSFSNELGEIQANMGNTHLWEAIILAAKTIVD---FRNNPNIKLAA 252 Query: 287 YKKII--VF-MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC--AS 341 +K++ VF ++DGE+ S L + KK ++ I I + L A A+ Sbjct: 253 PEKLLCRVFCLSDGEDNSNSSTM--LDAYDYLKKNNVVLDCIPIGL-EGRSRLSALSTAT 309 Query: 342 PNSFYLVENPHSMYDAFSH 360 S ++ ++ + F Sbjct: 310 GGSCFIADSSQEGVELFER 328 >gi|296225412|ref|XP_002758286.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 [Callithrix jacchus] Length = 860 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 34/177 (19%), Positives = 74/177 (41%), Gaps = 15/177 (8%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG-LVTFSNKIE 231 + V+DVS SM K++ +++ +LE++K +N ++ SG + T+ + Sbjct: 284 SVAFVIDVSGSMAG------RKLEQTKEALLRILEDMKEEDYLNFILFSGDVSTWKEHLV 337 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + + + +K + G+ TN GL + + + +H E + I+ Sbjct: 338 QATPE--NLQEAKMFVKSIDDRGM-TNINDGLLRGISMLNKARE--EHRVPERSTS--IV 390 Query: 292 VFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 + +TDG+ N+ ++ A + +Y +G ++ FL A N + Sbjct: 391 IMLTDGDANVGESRPEKIQENVRNAIRGKFPLYNLGFGNNLNYNFLENMALENQGFA 447 >gi|290961218|ref|YP_003492400.1| hypothetical protein SCAB_68641 [Streptomyces scabiei 87.22] gi|260650744|emb|CBG73860.1| conserved hypothetical protein [Streptomyces scabiei 87.22] Length = 239 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 58/178 (32%), Gaps = 26/178 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 S+ + R + +V+D S SM ++ + + + ++ D V Sbjct: 26 SLDRHGLRGQRAAVYLVIDHSGSMRPYYKDG--SVQALADRVLGLSAQLD--DDGTVPV- 80 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGMR 277 V FS ++ + G++ +I + T+ + + D Sbjct: 81 ---VFFSTGVDAVTEI--GLADHHGRIDRIVAGLGHMGRTDYHLAMDAVIDHYLDCGAED 135 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 +VF TDG S ++ Y C A+ + IG RS +F Sbjct: 136 ----------PAFVVFQTDGGPTSRLAAER--YMCKAARLPLFWQF-IGFGDPRSKQF 180 >gi|239928001|ref|ZP_04684954.1| secreted protein [Streptomyces ghanaensis ATCC 14672] Length = 417 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 83/232 (35%), Gaps = 32/232 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P ++ + DA + ++LDVS SM + +++ A ++ N +L+ Sbjct: 16 VAFPAGAAADEPTGRDAP-KVNLLLDVSGSMRARDIDGQSRMAAAKQAFNEVLDATPK-- 72 Query: 214 DVNNVVQSGLVTF-----------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 +V +++ + + ++ L+ + + + L+ G T P Sbjct: 73 EVELGIRTLGANYAGDDRKEGCKDTAQLYPVGPLD--RTEAKAAVATLTPTGW-TPIGPS 129 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 L A + + G K IV ++DGE+ D + AK G + Sbjct: 130 LLKAADDLEGGNGS------------KRIVLISDGEDTCAPLDPCEVAREIAAKGIGLTI 177 Query: 323 YAIGIRVIRSHEFLRAC---ASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 +G+ +C A+ ++ VE+ + D + + + Sbjct: 178 DTLGLVPNAKLSRQLSCIAEATGGTYASVEHQDELTDRVNELVDRAAEPVVT 229 >gi|164687487|ref|ZP_02211515.1| hypothetical protein CLOBAR_01128 [Clostridium bartlettii DSM 16795] gi|164603261|gb|EDQ96726.1| hypothetical protein CLOBAR_01128 [Clostridium bartlettii DSM 16795] Length = 273 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 72/202 (35%), Gaps = 14/202 (6%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 ++V + L + ++D S SM S KI ++ +L E++ + ++ Sbjct: 24 LEVKPISKKNLVIFFLVDTSGSM------SGKKIGTLNTTMEELLPELRGLGGATTDIKL 77 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 ++TFS+ E V Q L G++ L A+ ++ + ++ N Sbjct: 78 AVMTFSSGCEWITKEPMSVDDYQYWT-RLKAEGLTD-----LGEAFTELSNKLSRKEFLN 131 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA-CA 340 +Y +I +TDG + N K G V A+G+ E L+ Sbjct: 132 APSLSYAPVIFLLTDGYATDDALEGLKTLQHNNWYKYGLKV-ALGLGEKFDEELLKKFTG 190 Query: 341 SPNSFYLVENPHSMYDAFSHIG 362 +P + + I Sbjct: 191 NPELVVTAKTSDQLSKLVKTIA 212 >gi|119481411|ref|XP_001260734.1| von Willebrand domain protein [Neosartorya fischeri NRRL 181] gi|119408888|gb|EAW18837.1| von Willebrand domain protein [Neosartorya fischeri NRRL 181] Length = 941 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 30/183 (16%), Positives = 61/183 (33%), Gaps = 25/183 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D S SM + KID ++ L+ + + N +F I+ Sbjct: 290 IIFVIDRSGSM-------MDKIDTLKSALRVFLKSLPVGVCFNICSFGSAHSF-LWIQSL 341 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 F + + + T ++ A H +D + ++ Sbjct: 342 FYTAESLQEALSFVDGVRADMGGTE----MQEAV-------EATVHSRMKDKELE--VLI 388 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY--LVENP 351 +TDG+ + + A A +++GI SH + A + + LV N Sbjct: 389 LTDGQIW--NQQTLFKFIRETAADNSARFFSLGIGNGASHSLVEGIARAGNGFSQLVVNY 446 Query: 352 HSM 354 + Sbjct: 447 EEL 449 >gi|109112823|ref|XP_001117651.1| PREDICTED: integrin alpha-E-like [Macaca mulatta] Length = 956 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 31/158 (19%), Positives = 61/158 (38%), Gaps = 21/158 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV-KLIPDVNNVVQSGLVTFSNKIEE 232 + I+LD S S++ A I+ M+ + + N LV + I+ Sbjct: 367 IAIILDGSGSIDP------PDFQRAKDFISNMMRNFYEKCFECN----FALVQYGGVIQT 416 Query: 233 FFLLEWGVSHLQR--KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L + K++ +++ G T + +++ + IF + + K+ Sbjct: 417 EFDLRDSQDVMASLAKVQNITQVGSVTKTASAMQHVLDNIFTSSHGSRRKAS------KV 470 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +V +TDG D ++ N K G +AIG+ Sbjct: 471 MVVLTDGGIFEDPLDLTTVI--NSPKMHGVERFAIGVG 506 >gi|51247575|pdb|1T6B|Y Chain Y, Crystal Structure Of B. Anthracis Protective Antigen Complexed With Human Anthrax Toxin Receptor Length = 189 Score = 49.8 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 35/196 (17%), Positives = 67/196 (34%), Gaps = 27/196 (13%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D+ VLD S S+ + + I ++++ V+ ++ + FS++ Sbjct: 13 AFDLYFVLDKSGSVANNW-----------IEIYNFVQQLAERF-VSPEMRLSFIVFSSQA 60 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L + + ++ L + T GLK A QI G++ Sbjct: 61 TIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKAGGLKTSS-------- 112 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 II+ +TDG+ ++ GA VY +G+ + R S + V Sbjct: 113 -IIIALTDGKLDGLVPSYAEKE-AKISRSLGASVYCVGVLDFEQAQLERIADSKEQVFPV 170 Query: 349 ENPHSMYDAFSHIGKD 364 + + A I Sbjct: 171 KGG---FQALKGIINS 183 >gi|327281097|ref|XP_003225286.1| PREDICTED: integrin alpha-M-like [Anolis carolinensis] Length = 292 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 37/203 (18%), Positives = 78/203 (38%), Gaps = 32/203 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ S ++ + +K P+ + L+ FS+ +E Sbjct: 51 DIVFLLDGSTSVR---PSDFIEMKAFVAL------MMKRFPENTHF---ALLQFSSHFQE 98 Query: 233 FFLLEWGVSHLQRK--IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + ++ +++ S+ + G++ A +G R K+ Sbjct: 99 HFDFRHFQRNRDPDHLMREVNQLRGSSYTATGIRKATELFTTQKGARA-------TAKRF 151 Query: 291 IVFMTDGENLSTK-EDQQSLYYCNEAKKRGAIVYAIGIR----VIRSHEFLRACAS---P 342 +V +TDGE E + + N AK +AIG+ + L + S P Sbjct: 152 LVVVTDGEKFGDMLEYAEVIEEANRAK---ITRFAIGVGIVFTSRVAQRELHSIGSHPVP 208 Query: 343 NSFYLVENPHSMYDAFSHIGKDI 365 + ++V + + D + + + I Sbjct: 209 DHVFVVRHFTGLRDIQTQLKEKI 231 >gi|219125320|ref|XP_002182931.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|217405725|gb|EEC45667.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 523 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 35/171 (20%), Positives = 65/171 (38%), Gaps = 28/171 (16%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTD--ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 I ++ H I + D +D+++VLDVS SM K+ + K++ Sbjct: 39 IDSEVSTNHFCASIHARTMPKEDEDCRTPIDLIVVLDVSGSMTGN------KLKLCKKTL 92 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTN 258 +L ++ + GL++F + F + + +KI+ L+ G TN Sbjct: 93 TMLLRVLQTQD------RFGLISFGSDARVEFPAQAMSKQNKASALQKIQSLTTRGC-TN 145 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL 309 + L A ++ E +N + + F+TDG D L Sbjct: 146 MSAALGLAVQEL---------KIIEKSNPVRSLFFLTDGLANEGISDLDGL 187 >gi|72180809|ref|XP_798930.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 [Strongylocentrotus purpuratus] gi|115975272|ref|XP_001180569.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 [Strongylocentrotus purpuratus] Length = 964 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 30/168 (17%), Positives = 66/168 (39%), Gaps = 12/168 (7%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D+S SM S TK+ +++ +L+++ N + S V F Sbjct: 351 IIFVIDISGSM------SGTKLAQVKDALSTILDDMSETDKFNILPFSDDVHFLESTGML 404 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + V +R + L + +TN + N + + ++ +++ Sbjct: 405 YSTKENVRRAKRFVMGLQEM-DNTNLHKAIISGVNMLRAESE----QDPQEEEIVSMLIV 459 Query: 294 MTDGENLSTKEDQQSLY-YCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +TDG + D+ + +EA ++ IG + FLR + Sbjct: 460 LTDGNPNHGEIDKTIIERNVHEAINGDFSLFCIGFGADADYPFLRRLS 507 >gi|85707636|ref|ZP_01038702.1| hypothetical protein NAP1_00335 [Erythrobacter sp. NAP1] gi|85689170|gb|EAQ29173.1| hypothetical protein NAP1_00335 [Erythrobacter sp. NAP1] Length = 740 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 49/300 (16%), Positives = 88/300 (29%), Gaps = 43/300 (14%) Query: 56 RSLVHAATQIMNEGNGNNRKKLKGG-----DILCRIKNTWNMSFRNELRDNGFVNDIDDI 110 SL+ + ++ N GG I + + + V Sbjct: 225 ASLLAGSADLIAPTADPNMVARAGGGLNPVSITVNLDPGFAPEAISSPYHAVSVRGSGST 284 Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 T D VP N + + + P+ F + +++ IT Sbjct: 285 RTVTLADG-AVPANRDFELRWSASGDAPMLGL-FKQRHGELEYVMATITPPALERVGEAP 342 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +M+ V+D S SM + A +S+ LE ++ + ++ F + + Sbjct: 343 PREMIFVIDNSGSMAG------ESMPAARRSLLYALETLRPQD------RFNVIRFDDTM 390 Query: 231 EEFFLLEWGVSHLQRKIKYLSKF------GVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E F V I F T P L+ A + +RQ Sbjct: 391 TELFASA--VQASDSNIAAAKTFTHNLMANGGTEMLPALRAALRDRAPDERVRQ------ 442 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 ++F+TDG + E K V+ +GI + +R A Sbjct: 443 ------VIFLTDG--ALSNEADMMEEINRNRKDS--RVFMVGIGSAPNTYLMRRMAEAGR 492 >gi|309790583|ref|ZP_07685138.1| von Willebrand factor, type A [Oscillochloris trichoides DG6] gi|308227385|gb|EFO81058.1| von Willebrand factor, type A [Oscillochloris trichoides DG6] Length = 430 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 25/130 (19%), Positives = 49/130 (37%), Gaps = 24/130 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + + ++V+D S SM+ KID A + A + V+ Q L+ F++ Sbjct: 90 GSAIRSVLVIDRSGSMDE-----GNKIDGARDAAQAFVGMVRSDD------QVALIGFND 138 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 ++ + L+ I+ L G T + + + D G R Sbjct: 139 QVVVLEPFTDDQAILEAAIRRLRADG-GTALYDSIVEGVDLLRDQPGRRA---------- 187 Query: 289 KIIVFMTDGE 298 ++ +TDG+ Sbjct: 188 --LLVLTDGQ 195 >gi|302796876|ref|XP_002980199.1| hypothetical protein SELMODRAFT_444452 [Selaginella moellendorffii] gi|300151815|gb|EFJ18459.1| hypothetical protein SELMODRAFT_444452 [Selaginella moellendorffii] Length = 550 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 38/206 (18%), Positives = 80/206 (38%), Gaps = 31/206 (15%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + A +D++ VLDVS SM K+++ ++ ++ ++ + + Sbjct: 75 AAADARAPVDLVTVLDVSGSMRG------QKLELVKTAMEFVIRNLRQQD------RLAI 122 Query: 224 VTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 V+FS++ + L+ G + ++ L G T PGLK + + + R Sbjct: 123 VSFSDEPKVHLGLKRMTHDGRAAALSAVEKLRSLG-GTEIRPGLKAGF-DLLSRRKNRNP 180 Query: 280 CNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKK--RGAIVYAIGIRV-IRSHEF 335 ++ I+ ++DG +N T + + L + + V+ G Sbjct: 181 VSS--------IMLLSDGMDNAITFKRCKVLPVDSYLEDCSERVPVHTFGFGSDHDPEAM 232 Query: 336 LRAC-ASPNSFYLVENPHSMYDAFSH 360 L A+ SF V+ ++ AF+ Sbjct: 233 LSIAEATGGSFCYVQEESTVQHAFAQ 258 >gi|156405834|ref|XP_001640936.1| predicted protein [Nematostella vectensis] gi|156228073|gb|EDO48873.1| predicted protein [Nematostella vectensis] Length = 250 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 36/221 (16%), Positives = 64/221 (28%), Gaps = 26/221 (11%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + D+ +++D S S+ + + AI + Sbjct: 17 PEETSSQVCSKPS-DVALLIDASGSIGRRRWPKVVEFTQAI---------INSFNVSEEG 66 Query: 219 VQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKF------GVSTNSTPGLKYAYNQI 270 G++ +S K E G I G T LK A Q+ Sbjct: 67 SHVGIILYSTKTELLVKFNTFQGSELTADNINAKVAAVNYRDWGGLTYIDRALKLANEQL 126 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE-AKKRGAIVYAIGIR- 328 F +G + K+ V TDG+ K L ++ K + VY +GI Sbjct: 127 FSPEG----GMRASKDILKVAVVFTDGKQTKDKGPFTELQIASQPLKDKDVQVYGLGIGD 182 Query: 329 --VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 I E P + + + + + I + + Sbjct: 183 ETTIDVQEMQEMANKPENVLTAKTFEELKNLAAQITQGVCE 223 >gi|47222734|emb|CAG01701.1| unnamed protein product [Tetraodon nigroviridis] Length = 505 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 73/205 (35%), Gaps = 25/205 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ S+ S + + + E ++ + V+ G++ + Sbjct: 38 DVLFLLDSSGSVSSYEHSRM---------LAFLSELLQPFSLGEDQVRVGVLQVGTEPRL 88 Query: 233 FFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F + S LQ ++ + TN+ LK A ++ A ++ Sbjct: 89 EFGFDAHATQSSLQGSLRNIKPLRGDTNTVEALKVAQERVLRPGVPG----GARAGLPRV 144 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN----SFY 346 +V++TDG N + E ++ G V + +P F Sbjct: 145 LVWLTDGVNPGDISGPMA-----ELREEGVAVLVVSTGHSNYLVLREIVTAPVENHLHFV 199 Query: 347 LVENPHSMYDAF-SHIGKDIVTKRI 370 +++ + D I + I +R+ Sbjct: 200 DIDDMSIITDDLRDAIIELIRAERL 224 >gi|119386037|ref|YP_917092.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] gi|119376632|gb|ABL71396.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] Length = 855 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 36/224 (16%), Positives = 78/224 (34%), Gaps = 44/224 (19%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 + +TS++ + R ++VLD S SM D I KI +A + ++ + + Sbjct: 10 IALTSALAPAALAQERPSTILVLDASGSMWGQIDG-INKITIARDVVGDIVSDFPADQN- 67 Query: 216 NNVVQSGLVTFSNK-------IEEFFLLEWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAY 267 G VT+ ++ IE G + + ++ L+ G++ + Sbjct: 68 -----LGFVTYGHRERGQCADIETLVEPAPGTAAEIAGIVEGLNPRGMTPMT-------- 114 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK---KRGA--IV 322 + + +H ++ ++DG + C A+ + G Sbjct: 115 DAVVTAAQALRHTEQAAT-----VILVSDGIETCNPD------PCAAARALEEAGVDFTA 163 Query: 323 YAIGIRV-IRSHEFLR-AC---ASPNSFYLVENPHSMYDAFSHI 361 + IG V + L+ C + F +N + +A + Sbjct: 164 HVIGFDVRGEADALLQMQCIAEETGGRFLTADNAQELNEALREV 207 >gi|330469792|ref|YP_004407535.1| von willebrand factor type a [Verrucosispora maris AB-18-032] gi|328812763|gb|AEB46935.1| von willebrand factor type a [Verrucosispora maris AB-18-032] Length = 565 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 64/202 (31%), Gaps = 32/202 (15%) Query: 174 MMIVLDVSRSMESFFDS--SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 M+ ++DVS SM+ + T+ + ++ L + GL FS +++ Sbjct: 368 MLCIIDVSGSMKKAVPTANGATRQQVTAEAARRGLNLFD------DSWSIGLWVFSTRLD 421 Query: 232 ---------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 L S L+R + ++ T L AY + + + Sbjct: 422 GSRDYRQVVPTGPLSRQRSTLERSLDTITSSSGDTGLYDTLLAAYKDVQQNWEPGKVNS- 480 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK----KRGAIVYAIGIRVIRSHEFLRA 338 IV TDG+N Q + + V IGI S L Sbjct: 481 --------IVLFTDGKNEDADGISQRQLLAELKRIKDPDQPIQVIIIGIGTEVSKAELDT 532 Query: 339 CASP--NSFYLVENPHSMYDAF 358 A ++ +P + D F Sbjct: 533 IAQSAGGGAFVAADPTKIGDIF 554 >gi|329963582|ref|ZP_08301061.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] gi|328528571|gb|EGF55542.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] Length = 342 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 67/175 (38%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +++MI LD+S SM + D ++++ A + + ++++++ + Sbjct: 79 GSKLETVKRQGVEVMIALDISNSMLAQ-DVQPSRLEKAKRLVAQLVDKMQND-------K 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 G++ F+ + + ++ + T + A +G+ Sbjct: 131 VGMIVFAGDAFTQLPITSDYISAKMFLESIDPSLISKQGTAIGAAINLATRSFTPQEGVG 190 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + I+ +TDGEN + ++ A ++G V +G+ + Sbjct: 191 -----------RAIIVITDGENH----EGGAVEAAKAATEKGIQVNVLGVGMPDG 230 >gi|32474636|ref|NP_867630.1| hypothetical protein RB7099 [Rhodopirellula baltica SH 1] gi|32445175|emb|CAD75177.1| conserved hypothetical protein-containing vWFA domain [Rhodopirellula baltica SH 1] Length = 885 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 47/260 (18%), Positives = 100/260 (38%), Gaps = 30/260 (11%) Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFI-----PWYTNSRHIVMPITSSVKV 164 + R S+ I + Y + + P+ F + + PW N+R + + I + + Sbjct: 438 LPRPDSVRIEELINYFDYQYTP-PSAEDPVPFSSAMAVASCPWNENNRLVRVGIQAK-DI 495 Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + R +++ ++D S SM K+ + I+ + +L+++K V VV +G Sbjct: 496 DRKERPRCNLVFLIDTSGSM-----KRPNKLPLVIEGMKVLLDQLKNRDRVAIVVYAG-- 548 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 S+ + + R + LS G STN GL+ AY R++ + Sbjct: 549 --SSGLVLDSTPVKQKKKIIRALSALSA-GGSTNGGAGLQLAYQT------ARENFIEDG 599 Query: 285 ANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS-- 341 N ++ +DG+ N+ Q + K G + +G + ++ + S Sbjct: 600 VNR---VILCSDGDFNVGMTGTDQLVAEATRQSKSGTELTVLGFGMGNHNDAMMERISNS 656 Query: 342 -PNSFYLVENPHSMYDAFSH 360 ++ V+ + Sbjct: 657 GAGNYAFVDTIAEAKKVLAD 676 >gi|239908149|ref|YP_002954890.1| hypothetical protein DMR_35130 [Desulfovibrio magneticus RS-1] gi|239798015|dbj|BAH77004.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 328 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 57/166 (34%), Gaps = 14/166 (8%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + +DMM +D+S SM + + + + ++ + + L + + G Sbjct: 75 DAPTYAGRGVDMMFAVDLSPSMAAMDIPAEGRTITRLAAVAEAAKTLALSRPGD---RIG 131 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 LV F + + ++ L+ G K A + R + Sbjct: 132 LVAFGARAYLVVPP---TTDRAALVQALASLDTGAA---GRKTAMGDAVGLAAKRLDESP 185 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 A K +V DG + + + + A + G V+A+G+ Sbjct: 186 GQA---KAVVVFGDGRSNA--GETDPVPAAQAAVRHGVAVFAVGVG 226 >gi|225873423|ref|YP_002754882.1| hypothetical protein ACP_1808 [Acidobacterium capsulatum ATCC 51196] gi|225793805|gb|ACO33895.1| hypothetical protein ACP_1808 [Acidobacterium capsulatum ATCC 51196] Length = 339 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 31/259 (11%), Positives = 76/259 (29%), Gaps = 30/259 (11%) Query: 117 DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMI 176 + + + +K + H S QT L + I Sbjct: 62 TLTTNVNEVDLVFTVTDSHGNFVKNLKESDFALLDDHRAPAAVYSFTQ--QTQLPLRLGI 119 Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 ++D S S+ + +++ L +V V+ F Sbjct: 120 LVDTSTSIRE-------RFQFEQQAVTNFLLQVLRPKTDEAFVE----GFDEAPNFILNW 168 Query: 237 EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 + L I+ L G T + A + + ++ I+ ++D Sbjct: 169 SNNLDTLSSAIQDLHP-GGGTALYDAVYSACRD------KLLNAASGPIYVRRAIILVSD 221 Query: 297 GENLSTKED-QQSLYYCNEAKKRGAIVYAIGI----RVIRSHEFLRACA--SPNSFYLVE 349 G++ + ++ C A+ +YA+ + LR A + + Sbjct: 222 GDDNQSHAYLTDAIKECQRAQTA---IYAVSTDTDPTPDPGDDILRKMAEETGGRAFFPR 278 Query: 350 NPHSMYDAFSHIGKDIVTK 368 ++ +F+ + ++ ++ Sbjct: 279 VITNLPASFNSVEDELRSQ 297 >gi|153833207|ref|ZP_01985874.1| conserved hypothetical protein [Vibrio harveyi HY01] gi|148870478|gb|EDL69393.1| conserved hypothetical protein [Vibrio harveyi HY01] Length = 515 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 41/278 (14%), Positives = 92/278 (33%), Gaps = 47/278 (16%) Query: 16 KGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRK 75 KG + + L I +E + + L + + + + E + Sbjct: 15 KGVAAVWMGLLLVPIMGFTFWAVEGTRYVQETSRLRDSAEAAAMA----VTIEDQAGSAS 70 Query: 76 KLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRY 135 L + +++ +M+ + + DD V + Y+++A + + Sbjct: 71 TLAAKYVESYVRDIKSMNVSAQ----RYYRAADDRAG--------VLEYIQYTVNAKTTH 118 Query: 136 KIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSIT-K 194 +FIP + + + + D +D++ V D S SM + + K Sbjct: 119 DSWF-ASSFIPSFDEQQDLAGRSLARKYPAYLADNNIDIVFVSDFSGSMREQWGFNRHIK 177 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVV----------------------QSGLVTFSNKIEE 232 ID +I+ + + V + G V ++ + E Sbjct: 178 IDDLKTAISQISNNILCTSTRQEYVDGEWKDVCDEPGEDTTSDKLLNRVGFVPYNVRTRE 237 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 ++ W ++ ++ Y G +T+ +P Y YN I Sbjct: 238 --IIGWNQANTTSQLNY--TNGYNTHLSP---YTYNDI 268 >gi|89100218|ref|ZP_01173085.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] gi|89085068|gb|EAR64202.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] Length = 456 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 53/319 (16%), Positives = 105/319 (32%), Gaps = 30/319 (9%) Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 GN +N KL+ + ++ + D D + + V Sbjct: 49 GNADNPSKLESPPLPVSLEEIF-SYPEGSFSSEDLSIDTSDTEAALNAVPAVPEDASDEE 107 Query: 129 ISAISRYKIPL-KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 I + Y L K P + S + ++ A ++ IVLD S SM + Sbjct: 108 IKELFGYLYSLYKMEYQDPRAIMESASEVEGPESEGGSQESPASFNVEIVLDASGSMANK 167 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ--SGLVTFSNKIEEFFLLE-------W 238 S T++++A +SI + +++ V G + S+K E + Sbjct: 168 LGSK-TRMELAKESIKEFASSLPEEANISLRVYGHKGTGSDSDKKMSCSSNELVYPPQPY 226 Query: 239 GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE 298 L + + G + + + + Q +D +V +DG Sbjct: 227 NEGELNSALDKFNPAGWTPLA--------QSLIEAQKDLAQFEGQDNKNMVYVV--SDG- 275 Query: 299 NLSTKEDQQSLYYCNEAKKRGA--IVYAIGIRV-IRSHEFLR--ACASPNSFYLVENPHS 353 D + + K G +V IG V + + L A A+ ++ V + Sbjct: 276 --IETCDGNPVEAAKDLKDSGVAPVVNIIGFDVKGKDQQQLEEVAKAAGGTYQNVTSQQQ 333 Query: 354 MYDAFSHIGKDIVTKRIWY 372 + + + ++ + R WY Sbjct: 334 LQNELNKAVEESLKWRTWY 352 >gi|125527010|gb|EAY75124.1| hypothetical protein OsI_03018 [Oryza sativa Indica Group] Length = 589 Score = 49.8 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 38/201 (18%), Positives = 70/201 (34%), Gaps = 27/201 (13%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 +S A LD++ V+DVS SM ID A ++ ++ ++ + + Sbjct: 59 SSSSTDRAGLDLVAVIDVSGSM------DGDGIDKAKTALQFVIRKLSDLD------RLC 106 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +VTFS L + + Q ++K L G+ + +K + R+ Sbjct: 107 IVTFSTNATRLCPLRFVTAAAQAELKALV-DGLKADGMTNMKAGLETSMSVVDGRRLAAG 165 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS- 341 + ++ M+DG + + K VY G L A A Sbjct: 166 RAVS----VMLMSDGYQNDGGDARDV-----HLKN--VPVYTFGFGASHDSNLLEAIARK 214 Query: 342 --PNSFYLVENPHSMYDAFSH 360 +F V + ++ FS Sbjct: 215 SLGGTFNYVADSANLTGPFSQ 235 >gi|296481520|gb|DAA23635.1| inter-alpha globulin inhibitor H2 polypeptide [Bos taurus] Length = 946 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 26/178 (14%), Positives = 70/178 (39%), Gaps = 25/178 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ V+DVS SM K+ ++++ +L++++ + +V F++ + Sbjct: 311 ILFVIDVSGSMWGI------KMKQTVEAMKTILDDLRTEDHFS------VVDFNHNVRTW 358 Query: 231 --EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + V+ + I+ + G TN L A + + + + + Sbjct: 359 RNDLVSATKTQVADAKNYIEKIQPSG-GTNINEALLRAIFILNEANNLGM-LDPNSVSL- 415 Query: 289 KIIVFMTDGENL--STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 I+ ++DG+ K + + ++++GI ++FL+ ++ N Sbjct: 416 --IILVSDGDPTVGELKLSKIQKNVKQNIRDN-ISLFSLGIGFDVDYDFLKRLSNDNR 470 >gi|148238273|ref|NP_001091485.1| inter-alpha-trypsin inhibitor heavy chain H2 [Bos taurus] gi|146186952|gb|AAI40657.1| ITIH2 protein [Bos taurus] Length = 946 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 26/178 (14%), Positives = 70/178 (39%), Gaps = 25/178 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ V+DVS SM K+ ++++ +L++++ + +V F++ + Sbjct: 311 ILFVIDVSGSMWGI------KMKQTVEAMKTILDDLRTEDHFS------VVDFNHNVRTW 358 Query: 231 --EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + V+ + I+ + G TN L A + + + + + Sbjct: 359 RNDLVSATKTQVADAKNYIEKIQPSG-GTNINEALLRAIFILNEANNLGM-LDPNSVSL- 415 Query: 289 KIIVFMTDGENL--STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 I+ ++DG+ K + + ++++GI ++FL+ ++ N Sbjct: 416 --IILVSDGDPTVGELKLSKIQKNVKQNIRDN-ISLFSLGIGFDVDYDFLKRLSNDNR 470 >gi|33601708|ref|NP_889268.1| hypothetical protein BB2732 [Bordetella bronchiseptica RB50] gi|33576145|emb|CAE33224.1| putative exported protein [Bordetella bronchiseptica RB50] Length = 336 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 35/205 (17%), Positives = 72/205 (35%), Gaps = 32/205 (15%) Query: 173 DMMIVLDVSRSM--ESFFDS---SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+++ +D+S+SM E F D+ ++ + + PD + GL+ F Sbjct: 93 DILLAVDISQSMDSEDFRDAQGRPASRWQAVQAVVGDFI---DKRPDD----RLGLIVFG 145 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + L+ ++ + N+ G D G+ Sbjct: 146 AGAYPQAPLTRDHAALRLLLQRTAVGMAGPNTALG---------DAIGLGIRMLDHARER 196 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-------HEFLRACA 340 KI++ +TDG T A + +V+ IGI + + LR A Sbjct: 197 DKILILLTDG--NDTASAVPPARAAELAAQHRVVVHTIGIGDPAASGEDRVDFDALRDIA 254 Query: 341 --SPNSFYLVENPHSMYDAFSHIGK 363 + F+ + S+ + ++ + + Sbjct: 255 RIAGGRFFRARDQASLQEVYATLDR 279 >gi|332216203|ref|XP_003257234.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 [Nomascus leucogenys] Length = 890 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 73/176 (41%), Gaps = 15/176 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG-LVTFSNKIEE 232 + V+D+S SM K++ +++ +LE++K +N ++ SG + T+ + + Sbjct: 285 VAFVIDISGSMAG------RKLEQTKEALLRILEDMKEEDYLNFILFSGDVSTWKEHLVQ 338 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +K + G+ TN GL + + + +H E + I++ Sbjct: 339 ATPE--NLQEARTFVKSMEDKGM-TNINDGLLRGISMLNKARE--EHRVPERSTS--IVI 391 Query: 293 FMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +TDG+ N+ ++ A +Y +G ++ FL A N + Sbjct: 392 MLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFA 447 >gi|220913390|ref|YP_002488699.1| hypothetical protein Achl_2645 [Arthrobacter chlorophenolicus A6] gi|219860268|gb|ACL40610.1| conserved hypothetical protein [Arthrobacter chlorophenolicus A6] Length = 319 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 17/120 (14%), Positives = 41/120 (34%), Gaps = 8/120 (6%) Query: 8 IRNFFYN--YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 +R + KG ++++ AI L + + + ++V I+ + L S D S + A + Sbjct: 1 MRRLGADNTEKGAVSVIVAILLVTLLGFVAIAVDVGAIYSERAQLQSGADASAIALAQKC 60 Query: 66 MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 + + + + + N + +D R+ S+ Sbjct: 61 ARDTANADCST------TSTLAGSLANQNSLDGMSNVYSIQLDKTARTVSVTTSAKETGS 114 >gi|16080727|ref|NP_391555.1| hypothetical protein BSU36740 [Bacillus subtilis subsp. subtilis str. 168] gi|221311634|ref|ZP_03593481.1| hypothetical protein Bsubs1_19866 [Bacillus subtilis subsp. subtilis str. 168] gi|221315962|ref|ZP_03597767.1| hypothetical protein BsubsN3_19787 [Bacillus subtilis subsp. subtilis str. NCIB 3610] gi|221320873|ref|ZP_03602167.1| hypothetical protein BsubsJ_19730 [Bacillus subtilis subsp. subtilis str. JH642] gi|221325158|ref|ZP_03606452.1| hypothetical protein BsubsS_19896 [Bacillus subtilis subsp. subtilis str. SMY] gi|321313224|ref|YP_004205511.1| hypothetical protein BSn5_09320 [Bacillus subtilis BSn5] gi|8928525|sp|P70960|YWMC_BACSU RecName: Full=Uncharacterized protein ywmC; Flags: Precursor gi|1648853|emb|CAB03680.1| unknown [Bacillus subtilis subsp. subtilis str. 168] gi|2636199|emb|CAB15691.1| putative exported protein [Bacillus subtilis subsp. subtilis str. 168] gi|320019498|gb|ADV94484.1| hypothetical protein BSn5_09320 [Bacillus subtilis BSn5] Length = 227 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 38/214 (17%), Positives = 74/214 (34%), Gaps = 21/214 (9%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P ++ K ++ A ++ ++LD S SM D ++K + A K I+ + Sbjct: 17 LTSPAFAAEKTETEAKAPANVAVLLDASGSMAKRIDG-VSKFNSAKKEISKFASSLPEGT 75 Query: 214 DVNNVV--QSGLVTFSNKIEEFFLLE--WGVSHL--QRKIKYLSKFGVSTNSTPGLKYAY 267 V V G S K++ + +G Q + L+ G + + Sbjct: 76 QVKMSVFGSEGNNKNSGKVQSCEAIRNVYGFQSFNEQSFLNSLNTIGPTGWTPIA----- 130 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + + + DA +K++ +TDGE + E +K V IG Sbjct: 131 ---KALNEAKSSFDQLDAKGEKVVYLLTDGE---ETCGGNPIKTAKELQKDNITVNVIGF 184 Query: 328 RVIRSHE-FLRACAS--PNSFYLVENPHSMYDAF 358 ++ L A A ++ + F Sbjct: 185 DYKEGYKGQLNAIAKVGGGEYFPAYTQKDVEKIF 218 >gi|332519332|ref|ZP_08395799.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] gi|332045180|gb|EGI81373.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] Length = 345 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 68/173 (39%), Gaps = 20/173 (11%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +D++ +DVS+SM + D + +++ + + + ++ + + Sbjct: 79 GSKLETIKREGVDIVFAIDVSKSMLAE-DIAPNRLEKSKQLVTQIINNLASD-------R 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 G++ ++ K + + + ++ ++ +S + + A Q Sbjct: 131 VGIIAYAGKAFPQLPITTDYASAKMFLQNMNTDMLS-SQGTAINEAIELAKTYYDDDQQT 189 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 N +++V ++DGE+ S + NE G ++ IG+ + Sbjct: 190 N-------RVLVIISDGEDHSEAAANVAEEASNE----GIRIFTIGVGDAKGG 231 >gi|223694808|gb|ACN18090.1| von Willebrand factor type A [uncultured bacterium BLR5] Length = 347 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 37/228 (16%), Positives = 84/228 (36%), Gaps = 39/228 (17%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +TD L + +++DVS S E + A I ++ K Q+ ++ Sbjct: 85 QRETDLPLSIALLIDVSASEERTLP---DEKAAARSFIETIIRSSKD--------QAAII 133 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLS----------------KFGVSTNSTP--GLKYA 266 F++ L V + + ++ L G T + P G Sbjct: 134 PFTDYAYLEQGLTPNVLAIYQALQRLEVALPSYVGSGRKISGISSGPGTIANPREGSTAI 193 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 ++ + G + ++ I+ +TDG++ S++ + + ++A + ++YAIG Sbjct: 194 WDAVTVSAGEILTRSPGR--RRRAIILLTDGQDTSSRVTRGTAI--DKALEAETVIYAIG 249 Query: 327 IRVIR----SHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 I + L A + + + + F+ I K++ ++ Sbjct: 250 IGDSKYEGIDKGALNNVAERTGGRAFFPKRGADLTSVFTEIEKELRSQ 297 >gi|194391096|dbj|BAG60666.1| unnamed protein product [Homo sapiens] Length = 698 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 72/176 (40%), Gaps = 15/176 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG-LVTFSNKIEE 232 + V+D+S SM K++ +++ +LE++K +N + SG + T+ + + Sbjct: 285 VAFVIDISGSMAG------RKLEQTKEALLRILEDMKEEDYLNFTLFSGDVSTWKEHLVQ 338 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +K + G+ TN GL + + + +H E + I++ Sbjct: 339 ATPE--NLQEARTFVKSMEDKGM-TNINDGLLRGISMLNKARE--EHRIPERSTS--IVI 391 Query: 293 FMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +TDG+ N+ ++ A +Y +G ++ FL A N + Sbjct: 392 MLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFA 447 >gi|160858159|emb|CAP19999.1| collagen type VI alpha 5 [Homo sapiens] Length = 527 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 22/113 (19%), Positives = 47/113 (41%), Gaps = 12/113 (10%) Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 LK+A N +F +H + N K++++ +TDGE+ + + E + +G Sbjct: 1 AKALKHA-NALFT----EEHGSRIKQNVKQMLIVITDGESHDHDQLNDTAL---ELRNKG 52 Query: 320 AIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 ++A+G+ E + N+ V+N + D F+ + + Sbjct: 53 ITIFAVGVGKANQKELEGMAGNKNNTIYVDNFDKLKDVFT----LVQERMCTE 101 >gi|315650876|ref|ZP_07903919.1| von Willebrand factor type A domain protein [Eubacterium saburreum DSM 3986] gi|315486855|gb|EFU77194.1| von Willebrand factor type A domain protein [Eubacterium saburreum DSM 3986] Length = 526 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 42/275 (15%), Positives = 94/275 (34%), Gaps = 27/275 (9%) Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN-----DIDDIVRSTSLDIVVV 121 N G + + + +F ++ + N ++ + ++ I + Sbjct: 57 NVGYNTEEYNIINESGFQSVSTSPLSTFAADVDTASYANIRRFITDGELPPADAVRIEEM 116 Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYT-NSRHIVMPITSSVKVNSQTDARLDMMIVLDV 180 Y + P T I N +M I K T +++ ++DV Sbjct: 117 LNYFYYDYP-QPKDDEPFSVTTEISSCPWNPDTKLMQIGLQAKNTDTTTKPSNLVFLIDV 175 Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV 240 S SM+ K+ + + + +E+K ++ V +G ++ + Sbjct: 176 SASMDE-----PDKLPLVKNAFLLLCDELKENDTISIVTYAG----TDSVVLEGAKGSDK 226 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-N 299 + I+ L+ G ST + G+K AY +I + + N +V TDG+ N Sbjct: 227 KSIMSAIEDLTA-GGSTAGSDGIKTAY-KIAEKYFKTEGNNR--------VVLATDGDLN 276 Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + + + + + K+ + +G + Sbjct: 277 VGITSEGELIKLIKKEKESNIFLSVLGFGTDNIKD 311 >gi|217970531|ref|YP_002355765.1| von Willebrand factor type A [Thauera sp. MZ1T] gi|217507858|gb|ACK54869.1| von Willebrand factor type A [Thauera sp. MZ1T] Length = 833 Score = 49.4 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 59/175 (33%), Gaps = 26/175 (14%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 L + I++D S SM+ I A +++ A++ ++ + L F + + Sbjct: 267 PLAVKILVDCSGSMQG------DSIAAARRALQAIVAGLREGE------RFSLSRFGSTV 314 Query: 231 EEFFLLEWGVSHL-----QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 E W S QR L T L + + + Sbjct: 315 EHRSRALWRTSPATRLAGQRWAAQLQADLGGTEMEKALD---STLALAGDASVSPGAGEG 371 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 ++ +TDG+ +++ +A+ G V+ +GI + LR A Sbjct: 372 AAPVDLLLITDGQ---IHAIDRTVA---KARALGHRVFVVGIGSAPAEGVLRRLA 420 >gi|326775386|ref|ZP_08234651.1| von Willebrand factor type A [Streptomyces cf. griseus XylebKG-1] gi|326655719|gb|EGE40565.1| von Willebrand factor type A [Streptomyces cf. griseus XylebKG-1] Length = 424 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 33/215 (15%), Positives = 77/215 (35%), Gaps = 35/215 (16%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF------ 226 + +VLDVS SM + +++ A ++ N +L+ +V+ +++ + Sbjct: 40 QVELVLDVSGSMRTRDIDGQSRMAAAKQAFNDVLDAAP--EEVHLGIRTLGADYPGEDRK 97 Query: 227 -----SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 + ++ L+ + + + L+ G T P L A + + +G R+ Sbjct: 98 VGCKDTKQLYPVGPLD--RTEAKAAVATLAPTGF-TPIGPALLGAADDLEGGEGSRR--- 151 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--RVIRSHEFLRAC 339 IV +TDGE+ D + A+ +V +G+ + C Sbjct: 152 ---------IVLITDGEDTCGPLDPCEVAREIAARGTHLVVDTLGLVPNAKIRQQL--TC 200 Query: 340 ---ASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 A+ ++ V++ + + + Sbjct: 201 IAEATGGTYTAVQHKEELSGRVKQLVDRAAEPVVT 235 >gi|291446035|ref|ZP_06585425.1| predicted protein [Streptomyces roseosporus NRRL 15998] gi|291348982|gb|EFE75886.1| predicted protein [Streptomyces roseosporus NRRL 15998] Length = 537 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 42/192 (21%), Positives = 81/192 (42%), Gaps = 23/192 (11%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 +++K + T R + +VLD S SM F+ D S + ++ A L+E + Sbjct: 333 AALKAHGLTGLRARVYLVLDRSGSMRPFYKDGSAQHLGDRTLALAAHLDE-------DAT 385 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V+ +V FS I+ LE +S + ++ L G+ + A ++ Sbjct: 386 VR--VVFFSTDIDGTGSLE--LSGHEGRVDELHA-GLGRLGRTHYERAVEEVVADYE--- 437 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK-KRGAIVYAIGIRVIRSHEFLR 337 +A +++F TDG + + +Q+L + A+ A G + + +FLR Sbjct: 438 ---KAEATGPALVIFQTDGPPDAKQAARQALS--DAARLPLFFQFVAFGDQDAKGFDFLR 492 Query: 338 ACASPNS-FYLV 348 +PN+ F+ Sbjct: 493 KLDAPNAGFFHA 504 >gi|257462368|ref|ZP_05626782.1| magnesium chelatase [Fusobacterium sp. D12] gi|317060032|ref|ZP_07924517.1| magnesium chelatase [Fusobacterium sp. D12] gi|313685708|gb|EFS22543.1| magnesium chelatase [Fusobacterium sp. D12] Length = 605 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 68/199 (34%), Gaps = 32/199 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK-IEE 232 ++ V+D S SM + ++ +I ++L+ + LV F K EE Sbjct: 425 ILFVVDSSGSMGAK-----KRMRAVKGAIFSLLQ-----DAYEKRDKVALVAFRKKSAEE 474 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +++++ L G T GL AY I + +D ++V Sbjct: 475 LLSMTRSIELAKKQLQNL-ATGGKTPLAEGLFKAYQLI-------RQLKKKDGEIYPLLV 526 Query: 293 FMTDGENL----STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC-----ASPN 343 ++DG ++SL + KK G ++ I L A Sbjct: 527 LISDGRANISLHGRDPIEESLEMARKIKKEGIS--SVVIDTEEGFTLLEMAKNISEAMGA 584 Query: 344 SFYLVEN--PHSMYDAFSH 360 +Y +EN M Sbjct: 585 EYYRLENIQAEDMLKLLKK 603 >gi|239942573|ref|ZP_04694510.1| hypothetical protein SrosN15_16393 [Streptomyces roseosporus NRRL 15998] gi|239989034|ref|ZP_04709698.1| hypothetical protein SrosN1_17140 [Streptomyces roseosporus NRRL 11379] Length = 531 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 42/192 (21%), Positives = 81/192 (42%), Gaps = 23/192 (11%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 +++K + T R + +VLD S SM F+ D S + ++ A L+E + Sbjct: 327 AALKAHGLTGLRARVYLVLDRSGSMRPFYKDGSAQHLGDRTLALAAHLDE-------DAT 379 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V+ +V FS I+ LE +S + ++ L G+ + A ++ Sbjct: 380 VR--VVFFSTDIDGTGSLE--LSGHEGRVDELHA-GLGRLGRTHYERAVEEVVADYE--- 431 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK-KRGAIVYAIGIRVIRSHEFLR 337 +A +++F TDG + + +Q+L + A+ A G + + +FLR Sbjct: 432 ---KAEATGPALVIFQTDGPPDAKQAARQALS--DAARLPLFFQFVAFGDQDAKGFDFLR 486 Query: 338 ACASPNS-FYLV 348 +PN+ F+ Sbjct: 487 KLDAPNAGFFHA 498 >gi|194227183|ref|XP_001916967.1| PREDICTED: similar to inter-alpha globulin inhibitor H2 polypeptide [Equus caballus] Length = 946 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 76/198 (38%), Gaps = 32/198 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ V+DVS SM K+ ++++ +L++++ Q +V F++ + Sbjct: 311 ILFVIDVSGSMWGI------KMKQTVEAMKTILDDLRTED------QFSVVDFNHNVRTW 358 Query: 231 --EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + V+ ++ I+ + G TN L A + + + + + Sbjct: 359 RNDLVSATTTQVADAKKYIEKIQPSG-GTNINEALLRAIFILNEANNLGL-LDPNSVSL- 415 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACASPN-- 343 I+ ++DG+ + + K+ ++++GI ++FL+ ++ N Sbjct: 416 --IILVSDGDPT--VGELKLSKIQKNVKQNIQDNISLFSLGIGFDVDYDFLKRLSNENRG 471 Query: 344 ---SFYLVENPHSMYDAF 358 Y ++ S F Sbjct: 472 IAHRIYGNQDTSSQLKKF 489 >gi|182434868|ref|YP_001822587.1| hypothetical protein SGR_1075 [Streptomyces griseus subsp. griseus NBRC 13350] gi|178463384|dbj|BAG17904.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus NBRC 13350] Length = 424 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 33/215 (15%), Positives = 77/215 (35%), Gaps = 35/215 (16%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF------ 226 + +VLDVS SM + +++ A ++ N +L+ +V+ +++ + Sbjct: 40 QVELVLDVSGSMRTRDIDGQSRMAAAKQAFNDVLDAAP--EEVHLGIRTLGADYPGEDRK 97 Query: 227 -----SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 + ++ L+ + + + L+ G T P L A + + +G R+ Sbjct: 98 VGCKDTKQLYPVGPLD--RTEAKAAVATLAPTGF-TPIGPALLGAADDLEGGEGSRR--- 151 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--RVIRSHEFLRAC 339 IV +TDGE+ D + A+ +V +G+ + C Sbjct: 152 ---------IVLITDGEDTCGPLDPCEVAREIAARGTHLVVDTLGLVPNAKIRQQL--TC 200 Query: 340 ---ASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 A+ ++ V++ + + + Sbjct: 201 IAEATGGTYTAVQHKEELSGRVKQLVDRAAEPVVT 235 >gi|75750453|ref|YP_319892.1| hypothetical protein ATV_gp61 [Acidianus two-tailed virus] gi|123849288|sp|Q3V4Q4|Y892_ATV RecName: Full=Putative VWFA domain-containing protein ORF892 gi|74474836|emb|CAI59910.1| hypothetical protein [Acidianus two-tailed virus] Length = 892 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 37/213 (17%), Positives = 69/213 (32%), Gaps = 33/213 (15%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 P++ F+ Y N + S+ + D + V+D S SME Sbjct: 696 PIQQKMFLLDYLNG--------ALSIHKSEEKKQGDFLFVIDSSGSMEGN---------K 738 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST 257 ++ L K N++ + TFS +E + + + ++ + + T Sbjct: 739 IATALAIPLVTYKKYKGKRNIL---VETFS---DEPSPI-YNIKNIANVLGSMKF--GGT 789 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 N + YA I E + ++ +TDGE+ D + + KK Sbjct: 790 NIGSAVLYALKNIDKPDSDYDRKLRESLRKTRTLILLTDGEDEIP--DDIAREINSLKKK 847 Query: 318 RGAIV--YAIGIRVIRSHEFLRACASPNSFYLV 348 + Y I + C + Y V Sbjct: 848 NKVELLCYGIDLGERGLKTLKEIC---DEVYAV 877 >gi|116201805|ref|XP_001226714.1| hypothetical protein CHGG_08787 [Chaetomium globosum CBS 148.51] gi|88177305|gb|EAQ84773.1| hypothetical protein CHGG_08787 [Chaetomium globosum CBS 148.51] Length = 777 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 27/204 (13%), Positives = 69/204 (33%), Gaps = 28/204 (13%) Query: 171 RLDMMIVLDVSRSM------------ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 D+++ +D+S SM E+ D+ + ID+ + ++ + Sbjct: 72 PCDLVLSIDISGSMADEAPAPSKPGGEAGEDTGLRVIDLVKHAARTIVATLDSRD----- 126 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + G+VTF+N+ + + I+ + F S+ + + + + Sbjct: 127 -RLGIVTFTNRSKVGIPPYENKAKTLENIESMEPF-----SSTNMWHGIRDGLSLFSEAE 180 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 +T ++ +TDG + + + A ++ G L++ Sbjct: 181 GGSTGRVPA---LLVLTDGMPNYMCPPKGYVPMLRSMEPLPATIHTFGFGYELRSGLLKS 237 Query: 339 CAS--PNSFYLVENPHSMYDAFSH 360 A ++ + + + F H Sbjct: 238 IAEVGGGNYSFIPDAGMLGTVFIH 261 >gi|262091909|gb|ACY25458.1| putative von Willebrand factor type A domain-containing protein [uncultured microorganism] Length = 621 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 40/238 (16%), Positives = 86/238 (36%), Gaps = 32/238 (13%) Query: 135 YKIPLKFCTF------IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 Y++P F IP + + ++++ V D S SM Sbjct: 217 YELPDGPYEFGVTVDLIPHPLQREDFHLLRIGLQAPVIERTHPVNVIFVADASGSM---- 272 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS-NKIEEFFLLEWG--VSHLQR 245 + +ID+A ++ A+ + +PD++ + G++ FS + I F+ LQ Sbjct: 273 -AEGNRIDIARAALQALWASL--VPDLD---RVGMIQFSVDPIPASFVPHTRPDSEFLQA 326 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 I L + TN G+ D + + +V ++DG D Sbjct: 327 SIDRLLPY-YGTNVQAGIDLGVQLANDARQAW-------PDSDNYVVLISDGVANVDATD 378 Query: 306 QQSL--YYCNEAKKRGAIVYAIGIRVIRSHE-FLRACASPNS--FYLVENPHSMYDAF 358 ++ + + + IG+ + ++ L A + +Y +++P ++ F Sbjct: 379 PFAILRSAGEDDESNPIRLITIGVGIGHYNDVLLEQLAQYGNGWYYYIDSPEQAWETF 436 >gi|257884610|ref|ZP_05664263.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,501] gi|257820448|gb|EEV47596.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,501] Length = 1107 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 25/139 (17%), Positives = 57/139 (41%), Gaps = 22/139 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + LD+++V+D S SM +I ++ ++ + + + + G V + Sbjct: 264 KDITPLDLVLVVDWSGSMNDN-----NRIGEVKIGVDRFVDTL-ADSGITDKINMGYVGY 317 Query: 227 SNKIEEFF--LLEWGV-SHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 S++ + ++ G ++ ++K ++ T + L+ A N + G Sbjct: 318 SSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGNMLSVPNG------ 371 Query: 282 TEDANYKKIIVFMTDGENL 300 +KK+IV +TDG Sbjct: 372 -----HKKVIVLLTDGVPT 385 >gi|218662717|ref|ZP_03518647.1| hypothetical protein RetlI_26604 [Rhizobium etli IE4771] Length = 295 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 50/283 (17%), Positives = 89/283 (31%), Gaps = 82/283 (28%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSME---------------------SFFDSSIT 193 + I+ + QT A +D I+LD + SM D +I Sbjct: 5 SITISGTATAEYQTAAFMDFYILLDNTPSMGVGATPDDVSKLEAKAGCAFACHQMDKTIN 64 Query: 194 ------------KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF-----FLL 236 +ID+ ++ A+ + K ++ + G+ TF K E+ L Sbjct: 65 NYTIAKSLGVAMRIDVVRQATQALTDTAKTERVSSDQFRMGVYTFGTKAEDAKLTTISGL 124 Query: 237 EWGVSHLQRKIKYLS---------KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 ++ ++ + TN + N I D G T + + Sbjct: 125 TSDLTKVKNYTDAVDLMTIPYQNYNNDQITNFDSAM-TQMNTIIDQAGDG----TSNISA 179 Query: 288 KKIIVFMTDGENLSTKEDQQSLYY-------------CNEAKKRGAIV------------ 322 +KI+ F++DG S K + C K RG + Sbjct: 180 EKILFFVSDGVGDSYKPSTCTKKTTGGRCQEPIDTSFCKPLKDRGVKIAVLYTTYLPLPS 239 Query: 323 ---YAIGIRVIRSH--EFLRACASPNSFYLVENPHSMYDAFSH 360 Y I+ + ++ACASP ++ V + DA Sbjct: 240 NSWYNTWIKPFQGEIPTKMQACASPGFYFEVSPTEGITDAMKA 282 >gi|288563|emb|CAA47439.1| inter-alpha-trypsin inhibitor heavy chain H3 [Homo sapiens] Length = 885 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 73/176 (41%), Gaps = 15/176 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG-LVTFSNKIEE 232 + V+D+S SM K++ +++ +LE++K +N ++ SG + T+ + + Sbjct: 281 VAFVIDISGSMAG------RKLEQTKEALLRILEDMKEEDYLNFILFSGDVSTWKEHLVQ 334 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +K + G+ TN GL + + + +H E + I++ Sbjct: 335 ATPE--NLQEARTFVKSMEDKGM-TNINDGLLRGISMLNKARE--EHRIPERSTS--IVI 387 Query: 293 FMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +TDG+ N+ ++ A +Y +G ++ FL A N + Sbjct: 388 MLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFA 443 >gi|217970127|ref|YP_002355361.1| von Willebrand factor type A [Thauera sp. MZ1T] gi|217507454|gb|ACK54465.1| von Willebrand factor type A [Thauera sp. MZ1T] Length = 840 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 57/175 (32%), Gaps = 26/175 (14%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 L + I++D S SM+ I A +++ A++ ++ + L F + + Sbjct: 266 PLAVKILVDCSGSMQG------DSIAAARRALQAIIAGLREGE------RFSLSRFGSTV 313 Query: 231 EEFFLLEWGVSHL-----QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 E W S QR L T L + A Sbjct: 314 EHRSRALWRTSAATRQAGQRWAMQLQADLGGTEMENALASTL-ALAGDAEPSPGTEEGAA 372 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 ++ +TDG+ +++ A+ G ++ +GI + LR A Sbjct: 373 AVD--LLLITDGQ---IHAIDRTVK---RARALGNRIFVVGIGSAPAEGVLRRLA 419 >gi|62897073|dbj|BAD96477.1| inter-alpha (globulin) inhibitor H3 variant [Homo sapiens] gi|62898698|dbj|BAD97203.1| inter-alpha (globulin) inhibitor H3 variant [Homo sapiens] Length = 890 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 73/176 (41%), Gaps = 15/176 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG-LVTFSNKIEE 232 + V+D+S SM K++ +++ +LE+++ +N ++ SG + T+ + + Sbjct: 285 VAFVIDISGSMAG------RKLEQTKEALLRILEDMQEEDYLNFILFSGDVSTWKEHLVQ 338 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +K + G+ TN GL + + + +H E + I++ Sbjct: 339 ATPE--NLQEARTFVKSMEDKGM-TNINDGLLRGISMLNKARE--EHRIPERSTS--IVI 391 Query: 293 FMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +TDG+ N+ ++ A +Y +G ++ FL A N + Sbjct: 392 MLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFA 447 >gi|309358766|emb|CAP33788.2| hypothetical protein CBG_15420 [Caenorhabditis briggsae AF16] Length = 876 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 34/190 (17%), Positives = 72/190 (37%), Gaps = 32/190 (16%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++I+ D S + T + N ++ V+ +P + V+ G+V +S + Sbjct: 31 PVLDIIILFDTSG-------GNDTVFEQQK---NWTIKIVRDLPVHEDAVRVGIVQYSER 80 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + LE +K++ G T + L A ++IFD G + Sbjct: 81 NDIITHLE--------TLKFMP--GEDTRTGVALSKADDEIFDYDG------GARLKATR 124 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-SHEFLRACASPNSFYLV 348 +I+ TDG ++ +++G +Y I + I E L ++ + Sbjct: 125 LIIVFTDGLSMDKPT-----LAAKALRRKGVKIYTISVNSIGFVPEMLGIVGDADNVFGP 179 Query: 349 ENPHSMYDAF 358 + + + + Sbjct: 180 TDENRIEERL 189 >gi|227552253|ref|ZP_03982302.1| possible pilus subunit protein [Enterococcus faecium TX1330] gi|257895164|ref|ZP_05674817.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com12] gi|293378024|ref|ZP_06624201.1| von Willebrand factor type A domain protein [Enterococcus faecium PC4.1] gi|227178583|gb|EEI59555.1| possible pilus subunit protein [Enterococcus faecium TX1330] gi|257831729|gb|EEV58150.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com12] gi|292643342|gb|EFF61475.1| von Willebrand factor type A domain protein [Enterococcus faecium PC4.1] Length = 498 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 25/136 (18%), Positives = 55/136 (40%), Gaps = 22/136 (16%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I ++ ++ + + + + G V +S + Sbjct: 310 TPLDLVLVVDWSGSMNDN-----NRIGEVKIGVDRFVDTL-ADSGITDKINMGYVGYSIE 363 Query: 230 IEEFF--LLEWGV-SHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + ++ G ++ ++K ++ T + L+ A N + G Sbjct: 364 GYSYSNGAVQMGSFDSVKNQVKSITPSWTNGGTFTQKALRDAGNMLSVPNG--------- 414 Query: 285 ANYKKIIVFMTDGENL 300 +KK+IV +TDG Sbjct: 415 --HKKVIVLLTDGVPT 428 >gi|218196035|gb|EEC78462.1| hypothetical protein OsI_18329 [Oryza sativa Indica Group] Length = 614 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 29/151 (19%), Positives = 62/151 (41%), Gaps = 21/151 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 +++ +T+ + A +D++ VLDVS SME +++ +++ + ++ Sbjct: 12 VLIQVTAPPVLEGTARAGVDVVAVLDVSGSMEG------ERLEHVKEAMEIFIGKLGPDD 65 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + +V+F+ + L + G + + + L G STN L Sbjct: 66 ------RLSVVSFATSVRRLTELTYMSEQGRAVAKEIVDGLVADG-STNMGAALLEGAMI 118 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 + D +G R N ++F++DG N Sbjct: 119 LRDRKGARDESNGRVGC----MMFLSDGTND 145 >gi|126653689|ref|ZP_01725608.1| BatA [Bacillus sp. B14905] gi|126589726|gb|EAZ83861.1| BatA [Bacillus sp. B14905] Length = 973 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 37/201 (18%), Positives = 77/201 (38%), Gaps = 36/201 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + V+D S SM++ ++ M+E + + NN+V + + K Sbjct: 700 VAYVVDYSSSMKAVDPTNY--------RGKKMIEFINQLKAKNNIV----IETNTK---A 744 Query: 234 FLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 +L G + L++ + SK +T+ G+ A + + D K I Sbjct: 745 TILGEGTTDAVLKKDLYKASKEKGATDIFAGIDIALTKF-----------SNDTKTAKAI 793 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--SHEFLRACASP--NSFYL 347 V ++DG+ + NEAKK+G +Y + + + L ++ ++Y Sbjct: 794 VVVSDGK----TSKSKMTKVINEAKKQGVKIYTVSMGKKSQINDATLMQVSTETGGAYYY 849 Query: 348 VENPHSMYDAFSHIGKDIVTK 368 + ++ F + I+ K Sbjct: 850 ALDNLQLHQVFQKLIDAILCK 870 >gi|133925809|ref|NP_002208.3| inter-alpha-trypsin inhibitor heavy chain H3 preproprotein [Homo sapiens] gi|166203665|sp|Q06033|ITIH3_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3; Short=ITI heavy chain H3; Short=ITI-HC3; Short=Inter-alpha-inhibitor heavy chain 3; AltName: Full=Serum-derived hyaluronan-associated protein; Short=SHAP; Flags: Precursor gi|77748471|gb|AAI07605.1| Inter-alpha (globulin) inhibitor H3 [Homo sapiens] gi|77748473|gb|AAI07606.1| Inter-alpha (globulin) inhibitor H3 [Homo sapiens] gi|78070482|gb|AAI07815.1| Inter-alpha (globulin) inhibitor H3 [Homo sapiens] Length = 890 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 73/176 (41%), Gaps = 15/176 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG-LVTFSNKIEE 232 + V+D+S SM K++ +++ +LE+++ +N ++ SG + T+ + + Sbjct: 285 VAFVIDISGSMAG------RKLEQTKEALLRILEDMQEEDYLNFILFSGDVSTWKEHLVQ 338 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +K + G+ TN GL + + + +H E + I++ Sbjct: 339 ATPE--NLQEARTFVKSMEDKGM-TNINDGLLRGISMLNKARE--EHRIPERSTS--IVI 391 Query: 293 FMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +TDG+ N+ ++ A +Y +G ++ FL A N + Sbjct: 392 MLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFA 447 >gi|330506652|ref|YP_004383080.1| von Willebrand factor, type A [Methanosaeta concilii GP-6] gi|328927460|gb|AEB67262.1| von Willebrand factor, type A [Methanosaeta concilii GP-6] Length = 551 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 63/198 (31%), Gaps = 27/198 (13%) Query: 136 KIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKI 195 K + I + + +A + + V DVS SM+ +++ K Sbjct: 340 KYGFNGLDEYRSEFVPASGDVLIQAQKLWKEKKNANICAVFVADVSGSMDGEPLNNLKK- 398 Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE----WGVSHLQRKIKYLS 251 ++LE K I N GLV++S+ + + S+ + L Sbjct: 399 --------SLLEGQKYIGKDN---LIGLVSYSDDVYINLPISRFDLNNRSYFVGAVGGLQ 447 Query: 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY 311 G +T + G+ A + + D K I ++DGE Sbjct: 448 A-GGATATFDGIAVAMKML-------EEQLALDPKLKPKIFVLSDGETNRGHSLNDIRKL 499 Query: 312 CNEAKKRGAIVYAIGIRV 329 + G +Y IG Sbjct: 500 VE---ESGIPIYTIGYNA 514 >gi|283778201|ref|YP_003368956.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283436654|gb|ADB15096.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 786 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 26/149 (17%), Positives = 55/149 (36%), Gaps = 30/149 (20%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D S SM+ KI+ A +++ +L + N +V + + +E F Sbjct: 309 VIFVVDRSGSMQG------KKIEQAREAMRYVLNNLHEGDTFN------IVAYDSTVESF 356 Query: 234 FL-----LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + L G STN + L A+ + ++ Sbjct: 357 KPELQKFDDATRKSALAYVDGL-YAGGSTNISGALDSAFAMLT------------GSDRP 403 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 I+F+TDG + + ++ + + K Sbjct: 404 NYILFLTDGLPTAGETNEGKIVELAKQKN 432 >gi|114579064|ref|XP_515640.2| PREDICTED: hypothetical protein LOC459430 isoform 2 [Pan troglodytes] Length = 1294 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 35/171 (20%), Positives = 63/171 (36%), Gaps = 32/171 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + I++D S SM+S K+D+ I ++E N V+ + + + Sbjct: 509 IYILIDTSHSMKS-------KLDLVKDKIIQFIQEQLKYQSKFNFVKFDGQAVAWREQLA 561 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + E + Q I+++ G STN+ LK A F + R I Sbjct: 562 EVNEDNLKQAQSWIRHIK-IGSSTNTLSALKTA----FADKETRA------------IYL 604 Query: 294 MTDGENLSTKEDQQSLYYCNEAKK-RGAIVYAIGIRVIR--SHEFLRACAS 341 +TDG E ++ K + +Y I ++ FL+ A+ Sbjct: 605 LTDGRPDQPPETVI-----DQVKLFQEIPIYTISFNYNDEIANRFLKEVAA 650 >gi|85706702|ref|ZP_01037794.1| hypothetical protein ROS217_08124 [Roseovarius sp. 217] gi|85668760|gb|EAQ23629.1| hypothetical protein ROS217_08124 [Roseovarius sp. 217] Length = 240 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 42/234 (17%), Positives = 81/234 (34%), Gaps = 51/234 (21%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMES--FFDSSITKIDMAIKSINAMLEEVKLI 212 ++P +S+ D M+V D S SM ++ T+I A ++ +++ Sbjct: 12 ILPGPASMAATGCAT---DAMLVFDGSGSMAEVGHDPTAPTRIIEARVALR------RVM 62 Query: 213 PDVNNVVQSGLVTF--------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 P++ + GL+++ S F + + + I+ L+ G++ + Sbjct: 63 PEIAPYRRIGLLSYGAGGSHPCSGITRHFGPMPDAGAAVVAGIEALTPGGLT-----PIA 117 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 + ++ G R H I+V +TDG N + +L A+ R V+ Sbjct: 118 ASVAAAAEVLGYRTHPG--------IVVLVTDG-NETCGGTPCALGTALAAEARDLTVHV 168 Query: 325 IGIRV--------------IRSHEFLRAC---ASPNSFYLVENPHSMYDAFSHI 361 IG RV + C A+ F E + A Sbjct: 169 IGFRVVHDPFSWNSPEAKGYDGQTVAK-CLADATGGLFVSTETVDELVAALRET 221 >gi|332860822|ref|XP_001152090.2| PREDICTED: inter-alpha (globulin) inhibitor H5-like [Pan troglodytes] Length = 1312 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 63/174 (36%), Gaps = 22/174 (12%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ V+DVS SM TK++ ++N +L +++ N+ ++FS+ + Sbjct: 284 VVFVIDVSSSMFG------TKMEQTKMAMNVILSDLQA----NDYFNI--ISFSDTVNVW 331 Query: 232 -EFFLLEWGVSHLQRKIKYLSKFGVS--TNSTPGLKYAYNQI-FDMQGMRQHCNTEDANY 287 ++ + ++ YL T+ L A + + Q + + Sbjct: 332 KAGGSIQATIQNVHSAKDYLHCMEADGWTDINSALLAAASVLNHSNQEPGRGPSVGRIPL 391 Query: 288 KKIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 I+F+TDGE L +A ++++ LR + Sbjct: 392 ---IIFLTDGEPTAGVTTPSVILSNVRQALGHRVSLFSLAFGDDADFTLLRRLS 442 >gi|212634223|ref|YP_002310748.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212555707|gb|ACJ28161.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 342 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 35/194 (18%), Positives = 71/194 (36%), Gaps = 21/194 (10%) Query: 144 FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME-------SFFDSSITKID 196 + W + I P+ + +Q R DMMI +D+S+SME S S ++ Sbjct: 66 LLSWIVITLCIARPVLIGEPIVTQKAGR-DMMIAVDLSQSMEQKDYLLPSEEPSQDGQLK 124 Query: 197 MA--IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG 254 A + + A+ + + + GL+ F + V Q ++ + Sbjct: 125 AASNVSRLVALKSLLSSFSQQRDGDRLGLIVFGSGAYLQVPFTEDVRLWQTLLEQMDTQM 184 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 + G D G+ ++I++ +TDG + S++ D + Sbjct: 185 AGPATAIG---------DAVGLSIRAFERSNTSQRILLLVTDGSDTSSRLDP--VDAARV 233 Query: 315 AKKRGAIVYAIGIR 328 A G ++ +G+ Sbjct: 234 AAAEGIEIFTLGMG 247 >gi|145595544|ref|YP_001159841.1| von Willebrand factor, type A [Salinispora tropica CNB-440] gi|145304881|gb|ABP55463.1| von Willebrand factor, type A [Salinispora tropica CNB-440] Length = 316 Score = 49.4 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 35/231 (15%), Positives = 72/231 (31%), Gaps = 35/231 (15%) Query: 154 IVMPITSSVKVNSQTDARLD---MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 +++ T+ + + T L+ +M+ +DVS SM++ D + ++ A ++ + E+ Sbjct: 66 LLVLATALARPSVDTRQPLERATVMLAVDVSLSMQAD-DVAPNRLAAAQEAAQQFVAELP 124 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + GLV+F+ + I L ST + + I Sbjct: 125 DSYN------LGLVSFAKAANVLVPPTKDRQAVLAAINGL-ALAESTATGEAVFTCLEAI 177 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + IV ++DG S + +Q+ A V I Sbjct: 178 RSVPADGAAGIPPAR-----IVLLSDGYRTSGRSVEQAAAAAQAA---NVAVSTIAFGTD 229 Query: 331 RSH------------EFLRAC----ASPNSFYLVENPHSMYDAFSHIGKDI 365 + L + FY + + + +G I Sbjct: 230 GGQVDIGGQRQRVPVDRLALADLAETTDGYFYEAASVSELKQVYQDMGSSI 280 >gi|261867447|ref|YP_003255369.1| TadG [Aggregatibacter actinomycetemcomitans D11S-1] gi|261412779|gb|ACX82150.1| TadG [Aggregatibacter actinomycetemcomitans D11S-1] Length = 545 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 25/156 (16%), Positives = 52/156 (33%), Gaps = 23/156 (14%) Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGM-RQHCNTED 284 + + + W + + L + T T G+ N + + N Sbjct: 372 DNVNKKTTQAWFDKNNRAVASALKQIIPRGGTAVTSGIFIGTNLMMEKNKDFEAMPNKIG 431 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLY----YCNEAKKR--------------GAIVYAIG 326 N ++I++ ++DGE+ +D C K++ A G Sbjct: 432 TNTRRILMILSDGEDNIPSKDTLVKLMEAGLCTRVKEKIDGLQDSNYPKVETRIAFVAFG 491 Query: 327 IRVI-RSHEFLRACASPNSFYLVENPHSMYDAFSHI 361 + E + C + +Y V + +++DAF I Sbjct: 492 FNPPQKQQEVWKKCV-GDQYYSVSSKQALFDAFKQI 526 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 40/282 (14%), Positives = 92/282 (32%), Gaps = 51/282 (18%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 ++ F N G TI+TA+ + L + ++ + I K L D++ + + Sbjct: 15 VKQFLQNEHGVYTIITALLAFPLLLFVAFTVDGTGILLDKARLAQATDQAALLLIAEDNQ 74 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNG---------------FVNDIDDIVR 112 + +K ++ + F + ++ D + Sbjct: 75 YRKNKDHSDVKRQNVSQQEIEREGRDFSSAKVQAQWKKRNQELVQGLVKLYLRSDDSKGQ 134 Query: 113 STSLDIV-------------VVPQNEGYSISAIS-RYKIPLKFCTFIPWYTNSRHIVMPI 158 S + P+N+ + +I+ + ++ ++PW Sbjct: 135 KNSSPVTIKEPFLAECLEEKTQPKNKNGTAKSIACVVQGSVQRKFWLPWGQTLVSSSQLY 194 Query: 159 TSSVKVNSQT---------DARLDMMIVLDVSRSMESFFDS----------SITKIDMAI 199 V +NS +D+M+V D+SRSM ++ +ID Sbjct: 195 DGRVGINSGKTYAVKEKQITIPIDLMMVTDLSRSMMWAINATGNNPPEVNYPNRRIDALR 254 Query: 200 KSINAMLEEVKLIP---DVNNVVQSGLVTFSNKIEEFFLLEW 238 +++ + + + DV+ + G V+F+ + L Sbjct: 255 EAVEGIEKILLPAQNKGDVSPYNRMGFVSFAAGTRQRDELTN 296 >gi|218261918|ref|ZP_03476586.1| hypothetical protein PRABACTJOHN_02257 [Parabacteroides johnsonii DSM 18315] gi|218223694|gb|EEC96344.1| hypothetical protein PRABACTJOHN_02257 [Parabacteroides johnsonii DSM 18315] Length = 339 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 31/172 (18%), Positives = 65/172 (37%), Gaps = 20/172 (11%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +++M+ LDVS SM + D S ++ A + ++ + + + Sbjct: 79 GSKLETVKRQGVEIMVCLDVSNSMLAE-DVSPDRLSKAKQMLSKLTDGFSND-------K 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 GL+ F+ + + + ++ VST + A N + Sbjct: 131 VGLIVFAGDAFTQLPITSDYVSAKMFLSSINPSMVSTQ-GTAIGAAIN-------LAVRS 182 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 T K I+ +TDGEN + ++ A ++G V +G+ + Sbjct: 183 FTPSETSDKAIILITDGENH----EDDAVKAAAAAAEKGIHVNIVGMGDPKG 230 >gi|195438080|ref|XP_002066965.1| GK24276 [Drosophila willistoni] gi|194163050|gb|EDW77951.1| GK24276 [Drosophila willistoni] Length = 1252 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 59/391 (15%), Positives = 126/391 (32%), Gaps = 67/391 (17%) Query: 2 FSLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIF-FMKTVLHSMIDRSLVH 60 FS+ + +RN F + + I L + + EV + F + + ++D + Sbjct: 70 FSVVVELRNRFKDNDIEVKRKDGIIL-----LRELAAEVKNFMDFKRNAVMRLMDSAEQA 124 Query: 61 AATQIMNEGNGNNRKKLKGGDILCRIKN-------------------------TWNMSFR 95 A +++ +G + RI N+S Sbjct: 125 ALSELDGQGANAESPLSQQHYDARRINEYNADGKLADGARHMDIRFMRRFERLPVNLSLS 184 Query: 96 NELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPL--KFCTFIPWYTNS-- 151 + L +G D D+ + + P + ++ F+ + + Sbjct: 185 SILVPHGVDLDEADVKSALQWSAHLDPLFQN-NLEQDPALSWQYFGSSSGFLRRFPGTAW 243 Query: 152 -------RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 ++ + + D+MI+LD S SM D+A+ + Sbjct: 244 PPEGSKGSKLIHDFRTHNWFVQAASSPKDIMILLDASSSMSE------KSFDLAMATAFN 297 Query: 205 MLEEVKLIPDVN-----NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNS 259 +L+ + VN VV+S + F +++ V ++ +K + + N Sbjct: 298 ILDTLGEDDYVNLITFSEVVKSPVPCFKDRMVRATP--DNVQEIKSAVKAIK-LQDTANF 354 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 T GL+YA++ + A I+ +T+ + S KE + + + Sbjct: 355 TAGLEYAFSLLHKYNQSGAGSQCNQA-----IMLITESTSESHKEIIKQYNWPHMP---- 405 Query: 320 AIVYAIGIRVIRS-HEFLRACASPNSFYLVE 349 ++ I L A N + V+ Sbjct: 406 VRIFTYLIGKDSGSRSNLHEMACSNKGFFVQ 436 >gi|123438167|ref|XP_001309871.1| Ubiquitin-conjugating enzyme family protein [Trichomonas vaginalis G3] gi|121891616|gb|EAX96941.1| Ubiquitin-conjugating enzyme family protein [Trichomonas vaginalis G3] Length = 957 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 39/182 (21%), Positives = 66/182 (36%), Gaps = 18/182 (9%) Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW 238 D+SRSME + K+ A K I+ + + V GL+ FS+ ++ L Sbjct: 462 DLSRSME-----NSNKLKNAKKVISILAKNVFEFGIGTMW---GLINFSSTVKTVLPLTA 513 Query: 239 GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG- 297 S + S+ G T +K A I N K IV +TDG Sbjct: 514 IASEFSMAVNEDSELGDDTKLFEAIKVASETITSKSEYF-------DNVYKRIVVVTDGI 566 Query: 298 EN-LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN-SFYLVENPHSMY 355 +N K D+ I+ I I S + + A+ +F+ + ++ Sbjct: 567 DNDNHYKSDESLQKLTKILTDNKIILDVIFIDESDSRAAVMSQATGGLAFFFKGSEQNLM 626 Query: 356 DA 357 ++ Sbjct: 627 ES 628 >gi|109039132|ref|XP_001085463.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3 [Macaca mulatta] Length = 891 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 73/176 (41%), Gaps = 15/176 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG-LVTFSNKIEE 232 + V+D+S SM K++ +++ +LE++K +N ++ SG + T+ + + Sbjct: 285 VAFVIDISGSMAG------RKLEQTKEALLRILEDMKEEDYLNFILFSGDVSTWKEHLVQ 338 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +K + G+ TN GL + + + +H E + I++ Sbjct: 339 ATPE--NLQEAKTFVKSMEDKGM-TNINDGLLRGISMLNKARE--EHRVPERSTS--IVI 391 Query: 293 FMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +TDG+ N+ ++ A +Y +G ++ FL A N + Sbjct: 392 MLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFA 447 >gi|325473816|gb|EGC77004.1| BatA protein [Treponema denticola F0402] Length = 282 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 33/171 (19%), Positives = 60/171 (35%), Gaps = 19/171 (11%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 TDA +M +LD+S SM + + T+I A K I + + ++ GL S Sbjct: 36 TDAGSSIMFLLDISPSMAAKDMNGETRIAAAKKIIRKFVAK----YPGDSF---GLTALS 88 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + ++ LS + + G+ A + + + Sbjct: 89 SSAALILPPTIDHKVFLSRLDSLSIGELGDGTAIGMGLAVSSAYMTRTKLNSS------- 141 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 IV +TDGEN + + + ++ + Y IGI + L Sbjct: 142 --YIVLLTDGENNTGEINPKTA--AEVLVNKNIGFYVIGIGS-SGYTTLEY 187 >gi|269796840|ref|YP_003316295.1| von Willebrand factor type A-like protein [Sanguibacter keddieii DSM 10542] gi|269099025|gb|ACZ23461.1| von Willebrand factor type A-like protein [Sanguibacter keddieii DSM 10542] Length = 341 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 19/141 (13%), Positives = 54/141 (38%), Gaps = 11/141 (7%) Query: 163 KVNSQTDARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 A +D+ V+D + SM +D ++D A + +++EE+ + Sbjct: 67 TSTDSAAAAVDVFFVVDRTGSMAAEDYDGESKRLDGAKADVLSVVEEIPGA-------RY 119 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +++F ++ L ++ + + + + +++ Q Sbjct: 120 SVISFDSQATRQLPLTTDTRAVRAWTETADRELTYRSRGSLVDRPLDELT---RALQGSV 176 Query: 282 TEDANYKKIIVFMTDGENLST 302 + +++ F++DGEN ++ Sbjct: 177 EQRPANVRLVFFLSDGENTAS 197 >gi|317486519|ref|ZP_07945343.1| von Willebrand factor type A domain-containing protein [Bilophila wadsworthia 3_1_6] gi|316922248|gb|EFV43510.1| von Willebrand factor type A domain-containing protein [Bilophila wadsworthia 3_1_6] Length = 523 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 43/292 (14%), Positives = 95/292 (32%), Gaps = 35/292 (11%) Query: 46 MKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN 105 +T L + I+ I+ E + C + + ++ + Sbjct: 233 EETELQNSIEDIFPKTLGTILKERLSVQAGDTE--TEHCTVAKP--RNITPDVIPPDMLR 288 Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVN 165 +ID I + S+ + + Q+ S + L T + + + T +V +N Sbjct: 289 NIDRITKGLSVRLQGLMQSLSLSAP-YPSTRGRLNTAKLFRIKTGNPKVFIQKTEAVAIN 347 Query: 166 SQTDARLDMMIVLDVSRSME-SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + I+LD S SM + + + + + I N + Sbjct: 348 T------SLHILLDASASMYGKRMELATASCHAIASACSGIRGLNITITAFNGNHRGDAC 401 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + + L + +I + T P L + Q+ + R Sbjct: 402 S-------VYPLLKSGQPVHARINLMPS--GGTPLAPALWWVMQQLLFTREQR------- 445 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 K+++ +TDG+ Q+++ ++ G VY +G+ +FL Sbjct: 446 ----KMLLVLTDGQPHDMNATQKAIETASKI---GLEVYGLGMLDRSIGDFL 490 >gi|282900972|ref|ZP_06308905.1| hypothetical protein CRC_02388 [Cylindrospermopsis raciborskii CS-505] gi|281194063|gb|EFA69027.1| hypothetical protein CRC_02388 [Cylindrospermopsis raciborskii CS-505] Length = 464 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 44/243 (18%), Positives = 82/243 (33%), Gaps = 46/243 (18%) Query: 139 LKFCTFIPWYTNSRHI-VMPITSSVKVNSQTDARLDMMIVLDVSRSMES----------- 186 F+P T + + VM +K + + V+D S SM Sbjct: 9 TPHREFMPADTEGQKLFVMLKLRPLKDIATSLPPTTFTFVIDTSGSMYEVVAGESKPTGV 68 Query: 187 ----------FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 +KID+ I+S+ A++ +L + +V F++ L Sbjct: 69 TYEQDGKEYREVTGGKSKIDIVIESLLALVNSGRL----KQQDRIAIVQFNDSASSIIGL 124 Query: 237 EWGV---SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + I L F T GL+ A++ + + + K++++F Sbjct: 125 TSATEIKKTKKTAINNLRNFSGGTRMGLGLRRAFDILSEQE----------MTVKRVLLF 174 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA---CASPNSFYLVEN 350 TDG+ ++ Q N+ R V A+G+ + + L C FY+V Sbjct: 175 -TDGQTF---DEDQCQSIANDFATRNIPVTALGVGEEFNEDLLSHLSDCTGGKLFYVVSG 230 Query: 351 PHS 353 Sbjct: 231 IAK 233 >gi|225024147|ref|ZP_03713339.1| hypothetical protein EIKCOROL_01015 [Eikenella corrodens ATCC 23834] gi|224943172|gb|EEG24381.1| hypothetical protein EIKCOROL_01015 [Eikenella corrodens ATCC 23834] Length = 573 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 55/354 (15%), Positives = 112/354 (31%), Gaps = 58/354 (16%) Query: 42 HIFFMKT---VLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNEL 98 + + L +M ++ AA ++ + G ++ + R Sbjct: 50 DYYESRNARFSLRAMPQAAVKSAAPADLSANAAPMAEAAVGSVATRQMAPPRQNTERYGH 109 Query: 99 RDNGFVNDIDDIVRST-SLDIVVV---------------PQNEGYSISAISRY-KIPL-K 140 + V+ + + ST S+D+ P + I I Y K Sbjct: 110 YEPNPVHAVAEQPVSTFSIDVDTGSYANIRRFLTQTGRLPPADAVRIEEIINYFDYGYAK 169 Query: 141 FCTFIPWYTNSRHIVMPITSSVK----------VNSQTDARLDMMIVLDVSRSMESFFDS 190 P+ ++ + P S K V+ +++ ++DVS SM S Sbjct: 170 PTDGKPFAVHTETVDSPFRSGAKLIRIGIQAKEVSQAALPPANLVFLVDVSGSMYS---- 225 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIK 248 K+ M ++ + + + + LVT+++ + G + + Sbjct: 226 -RDKLPMVKYTLCTLAHQTRAQD------RITLVTYADGNKVVLPPTPGNQRQKILAALD 278 Query: 249 YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQ 307 L G ST ++ A Q +R N I+ TDG+ N+ + Sbjct: 279 SLKA-GGSTAGENAIQQA-YQAAQRAYIRNGINR--------ILLATDGDFNVGITDFNT 328 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRSHEFLR---ACASPNSFYLVENPHSMYDAF 358 E +K G + +G +E L A A ++ +++P Sbjct: 329 LRSMVAEKRKSGISLTTLGFGSGNYNERLMEQLADAGDGNYSYIDSPEEAQKVL 382 >gi|194211145|ref|XP_001494734.2| PREDICTED: chloride channel, calcium activated, family member 4 [Equus caballus] Length = 1022 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 74/195 (37%), Gaps = 34/195 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 M +VLD S SM + +++ ++ L ++ I + + V G+V F + Sbjct: 307 MCLVLDKSGSM-----AGSNRLNRMNQAAKHFL--MQTIENGSWV---GMVHFDSTAYIK 356 Query: 234 FLL--EWGVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + + ++ L T+ G+K A+ + G + + Sbjct: 357 SKLIQITSSNERNKLLESLPTAASGGTSICRGIKSAFQVLT---GTYPQIDGSE------ 407 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFL-RACASPNSFYLV 348 IV +TDGE+ + C +E ++ GAI++ I + + + + Sbjct: 408 IVLLTDGEDN-------TAGSCVDEVRQSGAIIHFIALGPSADQAVIEMSTITGGKHKYA 460 Query: 349 ENPHS---MYDAFSH 360 + + + DAF+ Sbjct: 461 SDEAANNGLIDAFAA 475 >gi|62531155|gb|AAH92555.1| LOC594926 protein [Xenopus (Silurana) tropicalis] Length = 895 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 45/279 (16%), Positives = 99/279 (35%), Gaps = 31/279 (11%) Query: 66 MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 +NE G + +G + + + SF E F ID ++ + Sbjct: 181 INEPQGISFLDAQGTFMTNDLLPLVHKSFSGEKGHVSFKPTIDQQRSCSNCSTTQLD--G 238 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 ++++ + P N + S +K + ++ ++D S SM Sbjct: 239 DFTVTYDVNRETPGNI-----QVVNGYFVHFFAPSKLKEVPKN-----IIFIIDRSISMI 288 Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEEFFLLEWGVSHLQ 244 K+ +++ +L++VK N V+ G+ + + + ++ + Sbjct: 289 GL------KMQQTKEALLKILDDVKEHDHFNFVIFDWGVEIWEQSLVKATPE--NLNRAK 340 Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 ++ L G TN L A + + D + A+ I+FMTDG+ + + Sbjct: 341 AYVRNLYPKGW-TNINDALLSAIS-LLDQAHDARSVPKRSASL---IIFMTDGQPSTGER 395 Query: 305 DQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACA 340 + + A+ +Y++G V + FL + Sbjct: 396 NLDKIQ--ENARNAIRGKYSLYSLGFGVGVDYPFLEKLS 432 >gi|116622495|ref|YP_824651.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225657|gb|ABJ84366.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 313 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 39/213 (18%), Positives = 68/213 (31%), Gaps = 29/213 (13%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + V + D + IV+D S SME I ++ N + + N + Sbjct: 66 ITVFNGEDGPVTAGIVIDNSASMEPKRAEVIAAAMAFARASNTRDQMF--VVHFNERARL 123 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 GL E + L+ I G ST + A Q Sbjct: 124 GL-------PERTPFTGKIKELETAISSFD-VGGSTALYDAILLA-----------QSHI 164 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE----FLR 337 ++I++ +TDG + S+K + + K G ++YAIGI + L Sbjct: 165 RGGVYGRRILLVITDGGDNSSKATLE--EAVDAVAKAGVVIYAIGIYDPNDKDQNPKVLA 222 Query: 338 ACA--SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 A + + + I D+ + Sbjct: 223 HLAEVTGGEAFFPTALSDITRICEEIAADVRRQ 255 >gi|327459721|gb|EGF06061.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK1057] Length = 450 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 54/321 (16%), Positives = 90/321 (28%), Gaps = 41/321 (12%) Query: 52 SMIDRSLVHAATQIMNEGN--GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 + + A T+I + K GD++ KN + + N V Sbjct: 52 EAVSSPIEAAGTEIAADTTKPVVEESTAKPGDLIDVSKNVSPVDVKESQEGNQTVRVETS 111 Query: 110 IVRSTSLDIV---VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 V T ++ V E + + + Y +S Sbjct: 112 TVDVTKTEVAPAKVDKAPEPITRRTENTLSGTDENGNPYTQYERVDKTTTITYTSTPPTV 171 Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 D++ V+D S SM ID+ +IN + + + GL TF Sbjct: 172 TKAGSADIVFVVDRSGSMGGT-------IDIVRANINEFVRNITK---EGITARFGLATF 221 Query: 227 SNKIE-------------EFFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQ 269 S+++ F + + K L+ G T TP A NQ Sbjct: 222 SDEVYGRNSGSKDEDTVLTRFGSSYFTTDPAELEKALAAIRIASGGDTPETPT--PALNQ 279 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 I + KK +V +TD E K G ++ Sbjct: 280 IISTYDWSKSSKN-----KKFVVLLTDAEMKEDPSIPTVADTLAALKAAGIERTVATVKA 334 Query: 330 IRSHEFLRACASPNSFYLVEN 350 I + A+ +EN Sbjct: 335 IEG--IYKNFATEGRVLDIEN 353 >gi|238060728|ref|ZP_04605437.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] gi|237882539|gb|EEP71367.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] Length = 580 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 46/226 (20%), Positives = 77/226 (34%), Gaps = 33/226 (14%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS--SITKIDMAIKSINAMLEEVKLI 212 + I +V S M+ V+DVS SM+ S + T+ + + + + L Sbjct: 363 PVAIDRAVSSWSIATQSGRMLCVIDVSGSMKQPVPSANNATREQVTVAAASRGLNLFD-- 420 Query: 213 PDVNNVVQSGLVTFSNKIEEFF---------LLEWGVSHLQRKIKYLSKFGVSTNSTPGL 263 + GL TFS ++ LL S L++ + + T + Sbjct: 421 ----DSWSIGLWTFSTELVGTLDYRELVPINLLSSNRSRLEQGLATIRPSSGDTGLYDTM 476 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK--EDQQSLYYCNEAKK--RG 319 AY +Q + +V TDG+N Q+ L +A R Sbjct: 477 LAAYK---TVQEDWEPGRVNS------VVLFTDGKNEDANGISQQKLLAELKQAADPERP 527 Query: 320 AIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAF-SHIG 362 V IGI S L + + ++ E+P + D F I Sbjct: 528 VQVVIIGIGNDVSKSELDSITKVTGGGSFITEDPTKIGDIFLKAIA 573 >gi|148258701|ref|YP_001243286.1| hypothetical protein BBta_7530 [Bradyrhizobium sp. BTAi1] gi|146410874|gb|ABQ39380.1| hypothetical protein BBta_7530 [Bradyrhizobium sp. BTAi1] Length = 511 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 34/189 (17%), Positives = 70/189 (37%), Gaps = 16/189 (8%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 ++ + ++I+ A+ L I +G ++ + +++ L S D + V + +Q + Sbjct: 27 RLGHDERANISIIFAMALLPILSAIGCAVDYTQATRLRSKLQSAADAASVASISQ---QS 83 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 G N D ++ N N + SL+ V+ + Sbjct: 84 LGYNAALQMTSDGTVQVAVEEATKLFNGNAANSLGYT------NLSLNAQVMKTGVKLAA 137 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME-SFF 188 + +P F T + + + +T + K S LD ++LDVS SM Sbjct: 138 TVAFSADVPTTFMTVVGYRK------LTVTGTSKSTSSLPPYLDFYLMLDVSGSMGLPST 191 Query: 189 DSSITKIDM 197 D+ T++ Sbjct: 192 DAEQTRLAA 200 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 38/213 (17%), Positives = 59/213 (27%), Gaps = 52/213 (24%) Query: 194 KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL--------QR 245 + D ++ +L N + GL F + + L ++ Sbjct: 288 RADAVGAAVQQLLVTANATQKTPNQFRIGLYPFVRYLYAYSPLTASINGSPTTPGTINHA 347 Query: 246 KIKYLSKFGVSTNSTPG-----LKYAY---NQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 S+ N++ G + A+ N I G N + +TDG Sbjct: 348 AANLASQLDTGANASLGSGGTHFENAFPTMNGIITSVGDGSASNKTQP----YVFLITDG 403 Query: 298 ENL-----------STKEDQQSLYYCNEAKKRGAIVYAIGI---------RVIRSHEF-- 335 S C K RG IV + I S +F Sbjct: 404 AQNPQVYWNGSWSGSNSATTMDTSKCTTLKSRGIIVSVLYIPYQPIQNPTSFANSEDFYA 463 Query: 336 ----------LRACASPNSFYLVENPHSMYDAF 358 L+ACASP FY +P + A Sbjct: 464 NANIPKIPPSLQACASPGYFYTANSPADITAAL 496 >gi|166031603|ref|ZP_02234432.1| hypothetical protein DORFOR_01303 [Dorea formicigenerans ATCC 27755] gi|166028580|gb|EDR47337.1| hypothetical protein DORFOR_01303 [Dorea formicigenerans ATCC 27755] Length = 685 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 70/196 (35%), Gaps = 26/196 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 L + +V DVS SM + ++ A + ++ + V+ D ++V+ L +F Sbjct: 338 NEKEALKVDMVADVSGSM------DGSPLNEAKQVMSDFVGSVQ--FDAGDLVE--LTSF 387 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 S + + L I L G T+ L A ++ G R Sbjct: 388 STGVCLEQEFSDDAATLTNDINNLVT-GDMTSLYDALYTAVERVAAQNGARC-------- 438 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-SPNSF 345 ++ TDG + + ++ N A + V+ IGI I + + + Sbjct: 439 ----VIAFTDGNDNYSNCTKE--DVVNVANRYHVPVFIIGIGSIDYADVNDIATQTGGMY 492 Query: 346 YLVENPHSMYDAFSHI 361 Y V + SM + I Sbjct: 493 YNVSDVTSMDKIYEEI 508 >gi|294674672|ref|YP_003575288.1| tellurium resistance protein [Prevotella ruminicola 23] gi|294472605|gb|ADE81994.1| putative tellurium resistance protein [Prevotella ruminicola 23] Length = 348 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 26/133 (19%), Positives = 46/133 (34%), Gaps = 12/133 (9%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + ++DVS SM IT+++ + +++E++ P V ++ F+ K Sbjct: 3 RLPIYFLIDVSESM---VGDPITQVE---NGMRNIIQELRTDPYALETVFVSVIAFAGKA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L L + G T+ L Y + + D I Sbjct: 57 KSLSPLT----ELYKFYPPKFPIGGGTSLGTALNYLMDDMEKSVQKTTSEMKGDWKP--I 110 Query: 291 IVFMTDGENLSTK 303 I TDG Sbjct: 111 IFLFTDGTPTDNP 123 >gi|326442267|ref|ZP_08217001.1| hypothetical protein SclaA2_14434 [Streptomyces clavuligerus ATCC 27064] Length = 748 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 70/200 (35%), Gaps = 24/200 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + + +VLD S SM ++ D S + + ++ A L+E + Sbjct: 556 RAGVAAKGDARVYLVLDRSGSMRPYYKDGSAQNLGERVLALAAHLDEA---------ARV 606 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +V FS +I+ L + + +I L S + + ++ + Sbjct: 607 EVVFFSTEIDGTGELT--ATAHRGRIDELHG------SLGRMGRT-SYHLAVEEVLARHA 657 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--LRAC 339 ++VF TDG S K + + G + + F LR Sbjct: 658 KAAPGAPALVVFQTDGAPES-KTAATAALASAATAEPGMFWQFVAFGERDAKAFDYLRRL 716 Query: 340 ASPNS-FYLVE-NPHSMYDA 357 A+PN+ F+ P ++ DA Sbjct: 717 AAPNAGFFHAGPEPAALTDA 736 >gi|224065911|ref|XP_002191398.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Taeniopygia guttata] Length = 809 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 33/177 (18%), Positives = 74/177 (41%), Gaps = 26/177 (14%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +++ V+D S SM KI+ ++ +L++++ + +TF+NK Sbjct: 243 FPKNVIFVIDRSGSMTG------RKIEQTRDALLKILQDLRQEDHFS------FITFNNK 290 Query: 230 IEE-----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + E E V+ ++ L+ G T+ + L A + +G+ + + Sbjct: 291 VVEWKSSLLPATEENVASAAALVQTLTARG-GTDISGALLAAVGVLEKAEGLPERSIS-- 347 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSL-YYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +I+ +TDG+ S +++ + + EA ++ +G S++FL A Sbjct: 348 -----MIILLTDGQPTSGEKNVEVIQEKVQEAINGKYALFCLGFGFDVSYKFLEKMA 399 >gi|188580652|ref|YP_001924097.1| LPXTG-motif cell wall anchor domain protein [Methylobacterium populi BJ001] gi|179344150|gb|ACB79562.1| LPXTG-motif cell wall anchor domain protein [Methylobacterium populi BJ001] Length = 723 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 31/202 (15%), Positives = 68/202 (33%), Gaps = 32/202 (15%) Query: 144 FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSIN 203 F + ++ +T D++ V+D S SM I A S+ Sbjct: 308 FRERVVGAETVLALVTPPEGAAPAVALPRDVVFVIDNSGSMGG------ASIRQAKASLL 361 Query: 204 AMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-----LEWGVSHLQRKIKYLSKFGVSTN 258 L+ ++ + ++ F + + F E ++ +R + L G + Sbjct: 362 IGLDRLRPGD------RFNVIRFDHSFDTLFPDVVPADESHLARAKRFVSGLEASGGTEM 415 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 P ++ ED + IVF+TDG ++ ++ A++ Sbjct: 416 LAP-----------LRAALADATPEDTARLRQIVFLTDG----AIGNEAQIFSAIAAERG 460 Query: 319 GAIVYAIGIRVIRSHEFLRACA 340 + ++ +GI + + A Sbjct: 461 RSRLFMVGIGSAPNGYLMSHAA 482 >gi|154411703|ref|XP_001578886.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121913087|gb|EAY17900.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 710 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 46/286 (16%), Positives = 93/286 (32%), Gaps = 45/286 (15%) Query: 88 NTWNMSFRNELRDN-GFVNDIDDIVRSTSLDIVVVPQ-------NEGYSISAISRYK--- 136 S N+L +N F I + L++ N + R Sbjct: 146 KYQKGSVTNDLPENFEFTIKIMTMKEIKKLNVTAKGNMKIIDNHNAIFETKTTPRDDAII 205 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 I S + I++ + + + ++D S SM ++I+ Sbjct: 206 IETSIKDEDKGIAISSDGYIAISTYPYFEGKVENNSEFYFLIDCSGSMYG------SRIE 259 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH----LQRKIKYLSK 252 A +N ++ + I + ++ F EE F + + R+IK L Sbjct: 260 NAKFCLNLLIHSL-PIDS-----RFSIIKFGTSYEEIFPICDYTNKNVKIAMRQIKDLDA 313 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 T+ L+Y Y Q R+ I +TDG+ + + Sbjct: 314 DMDGTDILSPLEYVYTQTTKNGYHRK------------IFLLTDGQ----VHNSDVICSL 357 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYD 356 + K+ +YAIG+ ++ + S ++ L+ + +M + Sbjct: 358 AQEKRDNNRIYAIGLGSGADPGLIKNVSLKSWGNYVLIADKDNMNE 403 >gi|110598613|ref|ZP_01386880.1| von Willebrand factor, type A [Chlorobium ferrooxidans DSM 13031] gi|110339782|gb|EAT58290.1| von Willebrand factor, type A [Chlorobium ferrooxidans DSM 13031] Length = 343 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 35/162 (21%), Positives = 64/162 (39%), Gaps = 18/162 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 DM+ ++DVSRSM + D ++ A I+ + V + ++ F+ K Sbjct: 90 DMVFLIDVSRSMRAA-DVQPDRLTQAKFEISRISRAVTGG-------RRAIILFAAKPLV 141 Query: 233 FFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + + + S T+ L+ A+N + R + +K Sbjct: 142 QCPLTADMDAFEALLGMASPDLIEAQGTDFRSALELAHNVLEPSSESRLASAAKG---EK 198 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 I+V ++DGE+ + N K V+AIG+ +IR Sbjct: 199 IMVLLSDGEDHA----GDLPAAANRIKNGRIHVFAIGVGLIR 236 >gi|38348336|ref|NP_940912.1| inter-alpha-trypsin inhibitor heavy chain H5-like protein precursor [Homo sapiens] gi|74762375|sp|Q6UXX5|ITH5L_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H5-like protein; Short=Inter-alpha inhibitor H5-like protein; Flags: Precursor gi|37181448|gb|AAQ88537.1| ITI-like protein [Homo sapiens] gi|57208182|emb|CAI42356.1| inter-alpha (globulin) inhibitor H5-like [Homo sapiens] gi|57209935|emb|CAI42344.1| inter-alpha (globulin) inhibitor H5-like [Homo sapiens] gi|119613592|gb|EAW93186.1| inter-alpha (globulin) inhibitor H5-like, isoform CRA_a [Homo sapiens] gi|162318154|gb|AAI57043.1| Inter-alpha (globulin) inhibitor H5-like [synthetic construct] gi|162318494|gb|AAI56206.1| Inter-alpha (globulin) inhibitor H5-like [synthetic construct] Length = 1313 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 63/174 (36%), Gaps = 22/174 (12%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ V+DVS SM TK++ ++N +L +++ N+ ++FS+ + Sbjct: 284 VVFVIDVSSSMFG------TKMEQTKTAMNVILSDLQA----NDYFNI--ISFSDTVNVW 331 Query: 232 -EFFLLEWGVSHLQRKIKYLSKFGVS--TNSTPGLKYAYNQI-FDMQGMRQHCNTEDANY 287 ++ + ++ YL T+ L A + + Q + + Sbjct: 332 KAGGSIQATIQNVHSAKDYLHCMEADGWTDVNSALLAAASVLNHSNQEPGRGPSVGRIPL 391 Query: 288 KKIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 I+F+TDGE L +A ++++ LR + Sbjct: 392 ---IIFLTDGEPTAGVTTPSVILSNVRQALGHRVSLFSLAFGDDADFTLLRRLS 442 >gi|118356595|ref|XP_001011553.1| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila] gi|89293320|gb|EAR91308.1| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila SB210] Length = 899 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 49/335 (14%), Positives = 102/335 (30%), Gaps = 72/335 (21%) Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDN---GFVNDIDDIVRSTSLDIVVVPQ 123 + NN + + R N +NE ++ + + ++ + + Q Sbjct: 162 EKSEINNNSDISQNNQKSRSTNPKKREIKNEQQEEIEETKQDYVKQTPKTVVIKKELPAQ 221 Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 + + S+ + F T + + +PI S+ + LD+M ++D + S Sbjct: 222 VKQFIDQYYSKNEEARNFITNFHFLKDDMTYFLPIFQSLFPKME----LDLMFIVDCTGS 277 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF----SNKIEEFFLLEWG 239 M S+ ID I ++ +K + + + V + ++ F Sbjct: 278 MSSW-------IDAVKLEITGIVAAIKNQHHGSQI-RVSFVGYRDYGDSERYSIFNFSED 329 Query: 240 VSHLQRKIKYLSKFGVS---TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 + Q I + G + + G K A +Q + Q K V + D Sbjct: 330 LEKFQDFISKVQACGGNDAAEDVAGGFKQANSQNWKSQA-------------KYAVLLAD 376 Query: 297 ---------GE--------NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 G+ + K+G +YA+ I Sbjct: 377 APAHGIQYHGDKADFYDRYPKGDPDGIDLKKEFQNLIKKGVKLYAVEIM----------- 425 Query: 340 ASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + MYD F K++ +++ K Sbjct: 426 ---------NSTKMMYDIFQQYNKEVNGQQLDITK 451 >gi|262164956|ref|ZP_06032694.1| protein TadG associated with Flp pilus assembly [Vibrio mimicus VM223] gi|262027336|gb|EEY46003.1| protein TadG associated with Flp pilus assembly [Vibrio mimicus VM223] Length = 403 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 28/207 (13%), Positives = 78/207 (37%), Gaps = 21/207 (10%) Query: 25 IFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVH--AATQIMNEGNGNNRKKLKGGDI 82 + P + ++L +++S F L + + + A+ + +E N + +K+ + Sbjct: 1 MMFPAMMMILAFTMQLSQQFLAHARLSEASEVASLALIASPKEDDENNVSYARKVVDRYV 60 Query: 83 LCRIKNT-WNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKF 141 + I + + + +G V + T + +++ + I + Sbjct: 61 VDNIDDIKVTVKNKRCEYKDGCVQSSGEAAPFTDFTVAATAKHKSW---------ISYEN 111 Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + P +T + S +D+ ++D+S SM + + + ++ID + Sbjct: 112 ISLKPEFT--------VNGSSVTRKFLPQPVDVYFIVDMSASMRATWQNGKSQIDEVKEV 163 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSN 228 I ++ ++K + L+ + N Sbjct: 164 ITRVVNDLKGFDTEVK-SRVSLLAYHN 189 >gi|254390370|ref|ZP_05005587.1| hypothetical protein SSCG_02914 [Streptomyces clavuligerus ATCC 27064] gi|294813849|ref|ZP_06772492.1| toxic cation resistance protein [Streptomyces clavuligerus ATCC 27064] gi|197704074|gb|EDY49886.1| hypothetical protein SSCG_02914 [Streptomyces clavuligerus ATCC 27064] gi|294326448|gb|EFG08091.1| toxic cation resistance protein [Streptomyces clavuligerus ATCC 27064] Length = 791 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 70/200 (35%), Gaps = 24/200 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + + +VLD S SM ++ D S + + ++ A L+E + Sbjct: 599 RAGVAAKGDARVYLVLDRSGSMRPYYKDGSAQNLGERVLALAAHLDEA---------ARV 649 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +V FS +I+ L + + +I L S + + ++ + Sbjct: 650 EVVFFSTEIDGTGELT--ATAHRGRIDELHG------SLGRMGRT-SYHLAVEEVLARHA 700 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--LRAC 339 ++VF TDG S K + + G + + F LR Sbjct: 701 KAAPGAPALVVFQTDGAPES-KTAATAALASAATAEPGMFWQFVAFGERDAKAFDYLRRL 759 Query: 340 ASPNS-FYLVE-NPHSMYDA 357 A+PN+ F+ P ++ DA Sbjct: 760 AAPNAGFFHAGPEPAALTDA 779 >gi|195051568|ref|XP_001993124.1| GH13254 [Drosophila grimshawi] gi|193900183|gb|EDV99049.1| GH13254 [Drosophila grimshawi] Length = 1237 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 35/188 (18%), Positives = 70/188 (37%), Gaps = 24/188 (12%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-----NVVQSG 222 + D+MI+LD S SM D+A + +L+ + VN +VV++ Sbjct: 255 ASSPKDIMILLDASSSMSE------KSFDLATSTAFNILDTLGEDDYVNLITFSDVVKTP 308 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + F +++ V ++ +K + + N T GL+YA++ + Sbjct: 309 VPCFKDRMVRATP--DNVQEIKSAVKAIK-LQDTANFTAGLEYAFSLLHKYNQSGAGSQC 365 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-HEFLRACAS 341 A I+ +T+ + S KE + + + ++ I L A Sbjct: 366 NQA-----IMLITESTSESHKEIIKQYNWPHMP----VRIFTYLIGSDSGSRSNLHEMAC 416 Query: 342 PNSFYLVE 349 N + V+ Sbjct: 417 SNKGFFVQ 424 >gi|327270790|ref|XP_003220171.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 866 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 45/242 (18%), Positives = 90/242 (37%), Gaps = 39/242 (16%) Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 + YK + + + +SR + P + +T R+ + +VLD S M Sbjct: 273 KMCSYKSTWEVISGSADFASSRPLGAPPPDPIIHLMKTRERI-VCLVLDASAQM-----G 326 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL 250 +++ I++ L + + G+VTF++K L+ S ++R+ L Sbjct: 327 KDNRLNRLIQAAKLFLLHI-----IEKGSWVGIVTFNSKGNIQAGLQKIFSDIERQ--SL 379 Query: 251 SKFGVSTNSTP-----GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 + +T + G+ A+ + C IV +T+GE Sbjct: 380 TSHLPTTAAGDCNICEGVNAAFQVFSQKLTSTEGCE---------IVLLTNGEGSD---- 426 Query: 306 QQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYLVE--NPHSMYDAFSH 360 L C ++ + + I++ I S+E L +FY + + + + DAF Sbjct: 427 ---LSPCLSKIQSQEIIIHTIAFGSKASNELEKLADMTGGKTFYATDSLDSNGLIDAFGG 483 Query: 361 IG 362 I Sbjct: 484 IS 485 >gi|312068041|ref|XP_003137027.1| hypothetical protein LOAG_01440 [Loa loa] gi|307767808|gb|EFO27042.1| hypothetical protein LOAG_01440 [Loa loa] Length = 808 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 25/163 (15%), Positives = 56/163 (34%), Gaps = 18/163 (11%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D +LD++ +LD S S+ + + + V +I Q ++ +S+ Sbjct: 37 DRQLDLLFILDGSGSVSGN---------TFATQMAMLNKIVDMIEIGPKNTQIAVMQYSS 87 Query: 229 --KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++E F L+ ++ + +T + L A + N +D Sbjct: 88 YTRVEFNFSANPNKESLRASLQKIRHISGTTKTGKALDKALHVFRHDSSFGARLNQDDVA 147 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 ++ +TDG + ++ G + +GI Sbjct: 148 QVAVV--VTDGHSHDDPIPAAVR-----LRQAGVQILTLGIGA 183 >gi|301113718|ref|XP_002998629.1| alpha kinase [Phytophthora infestans T30-4] gi|262111930|gb|EEY69982.1| alpha kinase [Phytophthora infestans T30-4] Length = 748 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 41/195 (21%), Positives = 75/195 (38%), Gaps = 44/195 (22%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + +++ LD++IV+D + SM + ID +I ++++ VK N Sbjct: 113 VVKEIQLKYAKANALDLVIVMDCTGSMGPW-------IDEVKSAIVSIIDNVKKDHPSAN 165 Query: 218 VVQSGLVTFSN-----KIEEFFLLEWGVSHLQRKIKYLSKFGVS---TNSTPGLKYAYNQ 269 V+ G V + + K + + L V+ +++ I L+ FG + GL+ A Sbjct: 166 -VRVGFVAYRDFCDGVKRLQVYHLTSDVAAVRKFIFRLAAFGGGDGPEDIPGGLEAALAM 224 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTD----GENLSTKEDQQ--------SLYYCNEAKK 317 F+ Q R IV + D G + ED + S C + ++ Sbjct: 225 PFNAQARR-------------IVLVGDSPCHGSRFNDGEDDRRYLFQIQQSPDICAQMRE 271 Query: 318 ---RGAIVYAIGIRV 329 RG I I+ Sbjct: 272 MVRRGIDFTIIEIQP 286 >gi|332221819|ref|XP_003260062.1| PREDICTED: calcium-activated chloride channel regulator 1 [Nomascus leucogenys] Length = 914 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 39/197 (19%), Positives = 72/197 (36%), Gaps = 35/197 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM ++ +++ ++ L + + G+VTF + Sbjct: 307 VCLVLDKSGSM-----ATGNRLNRLNQAGQLFLLQTVELGSW-----VGMVTFDSAAHVQ 356 Query: 234 FLL--EWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L S K L T+ GL+ A+ I +++ Sbjct: 357 SELIQINSGSDRDTLAKRLPAAASGGTSICRGLRSAFTVI-----KKKYPTDGSE----- 406 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ ++ C NE K+ GAI++ + + + E L Y Sbjct: 407 IVLLTDGEDN-------TISECFNEVKQSGAIIHTVALGPSAAQELEELSKMTGGLQTYA 459 Query: 348 VENPHS--MYDAFSHIG 362 + + + DAF + Sbjct: 460 SDQVQNNGLIDAFGALS 476 >gi|291486255|dbj|BAI87330.1| hypothetical protein BSNT_05611 [Bacillus subtilis subsp. natto BEST195] Length = 227 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 39/214 (18%), Positives = 75/214 (35%), Gaps = 21/214 (9%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P ++ K ++ A ++ ++LD S SM D ++K + A K I+ + Sbjct: 17 LTSPAFAAEKTETEAKAPANVAVLLDASGSMAKRIDG-VSKFNSAKKEISKFASSLPEGT 75 Query: 214 DVNNVV--QSGLVTFSNKIEEFFLLE--WGVSHL--QRKIKYLSKFGVSTNSTPGLKYAY 267 V V G S K++ + +G Q + L+ G + + Sbjct: 76 QVKMSVFGSEGNNKNSGKVQSCEAIRNVYGFQSFNEQSFLNSLNTIGPTGWTPIA----- 130 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + + + DA +K++ +TDGE + E +K V IG Sbjct: 131 ---KALNEAKSSFDQLDAKGEKVVYLLTDGE---ETCGGNPIKTAKELQKENITVNVIGF 184 Query: 328 RVIRSHE-FLRACAS--PNSFYLVENPHSMYDAF 358 ++ L A A ++ V + F Sbjct: 185 DYKEGYKGQLNAIAKVGGGEYFPVYTQKDVEKIF 218 >gi|242078053|ref|XP_002443795.1| hypothetical protein SORBIDRAFT_07g002215 [Sorghum bicolor] gi|241940145|gb|EES13290.1| hypothetical protein SORBIDRAFT_07g002215 [Sorghum bicolor] Length = 423 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 29/134 (21%), Positives = 56/134 (41%), Gaps = 26/134 (19%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 LD++ VLDVS SM K++ +++ +++ + + +V FS Sbjct: 124 PLDLVTVLDVSGSMAG------KKMERVKRAMGFLIDNLGSDD------RLSVVAFSTDA 171 Query: 231 EEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L G + +R ++ L+ G STN GL A + R+H N + Sbjct: 172 RRIIRLTRMSDDGKAAAKRAVESLAASG-STNIRGGLDVAAMVL----DGRRHKNAVAS- 225 Query: 287 YKKIIVFMTDGENL 300 ++ ++DG++ Sbjct: 226 ----VILLSDGQDN 235 >gi|4585469|gb|AAD25487.1|AF127036_1 calcium-activated chloride channel protein 1 [Homo sapiens] gi|119593592|gb|EAW73186.1| chloride channel, calcium activated, family member 1 [Homo sapiens] gi|189067292|dbj|BAG37002.1| unnamed protein product [Homo sapiens] Length = 914 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 39/197 (19%), Positives = 72/197 (36%), Gaps = 35/197 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM ++ +++ ++ L + + G+VTF + Sbjct: 307 VCLVLDKSGSM-----ATGNRLNRLNQAGQLFLLQTVELGSW-----VGMVTFDSAAHVQ 356 Query: 234 FLL--EWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L S K L T+ GL+ A+ I +++ Sbjct: 357 SELIQINSGSDRDTLAKRLPAAASGGTSICSGLRSAFTVI-----RKKYPTDGSE----- 406 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ ++ C NE K+ GAI++ + + + E L Y Sbjct: 407 IVLLTDGEDN-------TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKMTGGLQTYA 459 Query: 348 VENPHS--MYDAFSHIG 362 + + + DAF + Sbjct: 460 SDQVQNNGLIDAFGALS 476 >gi|74011920|ref|XP_548489.2| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 [Canis familiaris] Length = 897 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 78/192 (40%), Gaps = 20/192 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG-LVTFSNKIEE 232 ++ V+DVS SM K++ ++ +L ++K +N ++ SG ++T+ + + + Sbjct: 291 VVFVIDVSGSMHG------RKMEQTKDALLKILGDMKGEDYLNFILFSGDVITWKDDLVQ 344 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +K + G+ TN GL + + + +H E + II+ Sbjct: 345 ATPE--NIEEARIFVKNIHDRGL-TNINDGLLRGISMLNRARE--EHRVPERSTS--III 397 Query: 293 FMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS-----FY 346 +TDG+ N+ ++ A +Y +G ++ FL + A N Y Sbjct: 398 MLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLESMALENHGLARRIY 457 Query: 347 LVENPHSMYDAF 358 + + F Sbjct: 458 EDSDANLQLQGF 469 >gi|16126967|ref|NP_421531.1| hypothetical protein CC_2734 [Caulobacter crescentus CB15] gi|221235756|ref|YP_002518193.1| hypothetical protein CCNA_02820 [Caulobacter crescentus NA1000] gi|13424325|gb|AAK24699.1| hypothetical protein CC_2734 [Caulobacter crescentus CB15] gi|220964929|gb|ACL96285.1| conserved hypothetical protein [Caulobacter crescentus NA1000] Length = 629 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 34/252 (13%), Positives = 81/252 (32%), Gaps = 39/252 (15%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R + +G + I A+ + ++L +++VS + + + +D + + + Sbjct: 20 RRLRRDDRGAIAIQFALLALPLSILLFGVLDVSRLSLQRRQMQDALDAATL-----MAAR 74 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 L + M+ + F + Sbjct: 75 SAATASADLDTTGDAAFLAEIAGMNLGLTASSSTFSVGTGN------------------- 115 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 I LK W + TS V +S+ L++ +VLD++ SM Sbjct: 116 -RVIGTATATLKPIIANLWQAGDFTVT--ATSEVVRSSKN---LEVALVLDITGSM---- 165 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN--KIEEFFLLEWGVSHLQRK 246 S T+I + + +++ V + LV ++ ++ + + G ++ Sbjct: 166 --SGTRIADLKVAASDLVDIVIRDTQTPFYSKVALVPYAAGVNVDTYADMARGPIPVRN- 222 Query: 247 IKYLSKFGVSTN 258 I ++ T+ Sbjct: 223 ISNVAWLATGTS 234 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 63/204 (30%), Gaps = 44/204 (21%) Query: 210 KLIPDVNNVVQSGLVTFSN---KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 N + ++S + L + L+ +I S G ST GL + Sbjct: 425 STAFVGTNYPSTSADSYSPNPCPSAKITPLSSDKTALKAQINNYS-VGGSTAGQIGLAWG 483 Query: 267 YNQIFDM------QGMRQHCNTEDANYKKIIVFMTDGENLSTKED--------------- 305 + + ++ + + K+++ MTDG + + Sbjct: 484 WYMVAPNFGYIWPSASQRPAAYKSKDLMKVVIMMTDGAFNTPYCNGVIAANAGIGSGSDE 543 Query: 306 -------------QQSLYYCNEAKK--RGAIVYAIGIRVIR---SHEFLRACAS-PNSFY 346 Q+ C K +Y +G V + FL CAS + + Sbjct: 544 DHINCNATNGDPFAQARALCTVIKNSANDITLYTVGFAVGSDYTAKTFLTDCASDSSKAF 603 Query: 347 LVENPHSMYDAFSHIGKDIVTKRI 370 + +F+ I ++I + RI Sbjct: 604 FPATGSELKASFTAIAREISSLRI 627 >gi|327189769|gb|EGE56913.1| hypothetical protein RHECNPAF_550036 [Rhizobium etli CNPAF512] Length = 533 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 26/166 (15%), Positives = 61/166 (36%), Gaps = 16/166 (9%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R F+ +++G + LT I +P++ ++I+V + T L + +D + A ++ Sbjct: 6 VRRFWNDHRGYVIALTLIAMPMLLGFSLLVIDVGRSSNLHTDLQNAVDAMALAGAREL-- 63 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIV-VVPQNEG 126 +G + + + +I N F + + + +V + G Sbjct: 64 --DGRDDAITRARTAIEKIAN-----------SAAFSGGGTGMSLGSHISVVYNAGNDAG 110 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARL 172 +++ + IP T IP + + V Q + Sbjct: 111 STVTVLFLKNIPANDDTAIPSSMQTTVASEASYAWVIAKPQAMQTI 156 >gi|229822438|ref|YP_002883964.1| von Willebrand factor A [Beutenbergia cavernae DSM 12333] gi|229568351|gb|ACQ82202.1| von Willebrand factor type A [Beutenbergia cavernae DSM 12333] Length = 399 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 16/129 (12%), Positives = 47/129 (36%), Gaps = 11/129 (8%) Query: 174 MMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 + V+D + SM ++ ++D + A+ E + + ++ + ++ Sbjct: 71 LFFVVDRTGSMAAEDWNGGAPRLDGVRNDLVALTEAMAGA-------RYSIIGWDSQATR 123 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L ++ L + + ++ + + D N + +++ Sbjct: 124 QLPLTTDARAVRSWADTLRQEVSAYSAGTAVDRPLEALRDALEGAAERNPGNV---RLVF 180 Query: 293 FMTDGENLS 301 F++DGEN + Sbjct: 181 FLSDGENTN 189 >gi|15897953|ref|NP_342558.1| hypothetical protein SSO1089 [Sulfolobus solfataricus P2] gi|13814278|gb|AAK41348.1| Hypothetical protein SSO1089 [Sulfolobus solfataricus P2] Length = 436 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 36/209 (17%), Positives = 72/209 (34%), Gaps = 36/209 (17%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 V I V + + +I++D S SM K++ A++S +L + Sbjct: 102 VGFIVYIVPQQGAITSSIHYIIMIDNSPSMRG------EKLNTAVQSAQKLLYSLNEGDY 155 Query: 215 VNNVVQSGLVTF-SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 V L+ F ++ ++ G+ I + G +T + +A N Sbjct: 156 VT------LILFSNHPEIKYQGPAKGI------ITFDVGKGYTTRLHEAVNFALN----- 198 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + K I+ +TDG+ + + + + + IGI + Sbjct: 199 ------LAKQSQVPNK-IIMLTDGKPTDKR-NVKDYEKFDIPPNT--QIITIGIGSDYNE 248 Query: 334 EFLRACA--SPNSFYLVENPHSMYDAFSH 360 L+ A S FY +++ + D F Sbjct: 249 RILKKLADKSSGKFYHLKDISELPDVFES 277 >gi|299116460|emb|CBN76178.1| similar to integrin alpha Hr1 precursor-like [Ectocarpus siliculosus] Length = 353 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 33/214 (15%), Positives = 79/214 (36%), Gaps = 29/214 (13%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 M ++V N+ +++ +++D S S++ D + A ++++ ++ Sbjct: 42 DGQYSFTMEGKTTVTANT-----VNVAVIIDSSGSVDD--DEWDMSMAFAKDAVSSFADQ 94 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 L + + + + +++ F+ LE + + KY T+ G+ Sbjct: 95 -NLFTNGGSASIAQFSSSASEGGTFYSLEDFNAFVDGNTKY---SSGGTDIIDGIAKGRE 150 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + ++ TDG++ S K + + A+ G IVYA+G+ Sbjct: 151 LLKASPATTS-----------FMIVTTDGQSSSPKAE------ADAARDEGTIVYAVGVG 193 Query: 329 VIRSHE-FLRACASPNSFYLVENPHSMYDAFSHI 361 + E L + + V+ + A + I Sbjct: 194 TGPTQEILLDIGGEEANVFDVDGFDELDVALAGI 227 >gi|261414285|gb|ACX83559.1| plant-derived anthrax toxin receptor-Fc fusion protein [synthetic construct] Length = 171 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 68/195 (34%), Gaps = 27/195 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ VLD S S+ + + I ++++ V+ ++ + FS++ Sbjct: 1 DLYFVLDKSGSVANNW-----------IEIYNFVQQLAERF-VSPEMRLSFIVFSSQATI 48 Query: 233 FFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + + ++ L + T GLK A QI G++ I Sbjct: 49 ILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKAGGLKTSS---------I 99 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 I+ +TDG+ ++ GA VY +G+ + R S + V+ Sbjct: 100 IIALTDGKLDGLVPSYAEKE-AKISRSLGASVYCVGVLDFEQAQLERIADSKEQVFPVKG 158 Query: 351 PHSMYDAFSHIGKDI 365 + A I I Sbjct: 159 G---FQALKGIINSI 170 >gi|299535615|ref|ZP_07048936.1| BatA [Lysinibacillus fusiformis ZC1] gi|298728815|gb|EFI69369.1| BatA [Lysinibacillus fusiformis ZC1] Length = 972 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 38/201 (18%), Positives = 78/201 (38%), Gaps = 36/201 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + V+D S SM++ ++ M+E + + NN+V + + K Sbjct: 699 IAYVVDYSSSMKAVDPTNY--------RGKKMIELINQLKAKNNIV----IETNTK---A 743 Query: 234 FLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 +L G + L++ + SK +T+ G+ A + + D+ K I Sbjct: 744 TVLGEGTTENVLKKDLYKASKEKGATDIFAGIDIALTKF-----------SNDSKTSKAI 792 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--SHEFLRACAS--PNSFYL 347 V ++DG+ + NEAKK+G VY + + + L +S +++ Sbjct: 793 VVVSDGK----TSKSKMTKVINEAKKQGVKVYTVSMGKKSQVNDATLMQLSSETSGAYFH 848 Query: 348 VENPHSMYDAFSHIGKDIVTK 368 + ++ F + I+ K Sbjct: 849 AIDNMQLHQVFQKLIDTILCK 869 >gi|297664534|ref|XP_002810694.1| PREDICTED: calcium-activated chloride channel regulator 1-like isoform 2 [Pongo abelii] Length = 914 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 39/197 (19%), Positives = 72/197 (36%), Gaps = 35/197 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM ++ +++ ++ L + + G+VTF + Sbjct: 307 VCLVLDKSGSM-----ATGNRLNRLNQAGQLFLLQTVELGSW-----VGMVTFDSAAHVQ 356 Query: 234 FLL--EWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L S K L T+ GL+ A+ I +++ Sbjct: 357 SELIQINSGSDRDTLAKRLPAAASGGTSICSGLRSAFTVI-----RKKYPTDGSE----- 406 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ ++ C NE K+ GAI++ + + + E L Y Sbjct: 407 IVLLTDGEDN-------TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKMTGGLRTYA 459 Query: 348 VENPHS--MYDAFSHIG 362 + + + DAF + Sbjct: 460 SDQVQNNGLIDAFGALS 476 >gi|256084538|ref|XP_002578485.1| hypothetical protein [Schistosoma mansoni] gi|238663860|emb|CAZ34723.1| loss of heterozygosity 11 chromosomal region 2 gene a protein homolog (mast cell surface antigen 1) (masa-1), putative [Schistosoma mansoni] Length = 828 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 43/269 (15%), Positives = 89/269 (33%), Gaps = 40/269 (14%) Query: 82 ILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKF 141 + +++ + +N+ V + + L +V+ + S+S Y P + Sbjct: 211 DVKSTHDSFTWNCKNDSNH-SLVKLSSEFIPDHDLQMVLSLSDPLTSLS-SLEYGDPNQS 268 Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + + V S D R + + ++D S SME I A S Sbjct: 269 SILAMNCLMVQLL----PDIPNVVSSKDMRYEFVFLIDRSGSMEG------DNISYAKTS 318 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL--LEWGVSHLQRKIKY---LSKFGVS 256 + L+ + + + ++ F + F ++ L + Y L+ Sbjct: 319 LLLFLKSLPMS------CRFQIIGFGSDFAALFPEPTDYSEGSLNTAMNYQKDLNADMGG 372 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEA 315 T + LK A H + K I+F+TDG+ + + ++A Sbjct: 373 TEAYNALKAAL-----------HSTPSGEGWFKQIIFLTDGDVGNADEVIGLVRMNVDKA 421 Query: 316 KKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + V+ IG+ S + A + Sbjct: 422 R-----VFTIGLGQGVSTALIGGVARAGN 445 >gi|4009458|gb|AAC95428.1| calcium-dependent chloride channel-1 [Homo sapiens] Length = 914 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 39/197 (19%), Positives = 72/197 (36%), Gaps = 35/197 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM ++ +++ ++ L + + G+VTF + Sbjct: 307 VCLVLDKSGSM-----ATGNRLNRLNQAGQLFLLQTVELGSW-----VGMVTFDSAAHVQ 356 Query: 234 FLL--EWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L S K L T+ GL+ A+ I +++ Sbjct: 357 SELIQINSGSDRDTLAKRLPAAASGGTSICSGLRSAFTVI-----RKKYPTDGSE----- 406 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ ++ C NE K+ GAI++ + + + E L Y Sbjct: 407 IVLLTDGEDN-------TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKMTGGLQTYA 459 Query: 348 VENPHS--MYDAFSHIG 362 + + + DAF + Sbjct: 460 SDQVQNNGLIDAFGALS 476 >gi|319952925|ref|YP_004164192.1| von willebrand factor type a [Cellulophaga algicola DSM 14237] gi|319421585|gb|ADV48694.1| von Willebrand factor type A [Cellulophaga algicola DSM 14237] Length = 211 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 23/139 (16%), Positives = 49/139 (35%), Gaps = 10/139 (7%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +LD S SM I+ +++ M+ ++ + + +VTF ++ Sbjct: 3 RLPVYFLLDTSGSM------VGEPIEALNNALSGMINTLRSDAQASETLWISIVTFDREV 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 +E L S +I T + L Y + ++ D Sbjct: 57 KEIMPLTDLQSFQLPEITC--PQSGPTFTGKALDYLHEKVTKDLRKGTPEQKGDWKPLLF 114 Query: 291 IVFMTDGENLSTKEDQQSL 309 TDG+ ++ ++ + Sbjct: 115 --LFTDGKPSDVQQYKEVI 131 >gi|291448990|ref|ZP_06588380.1| secreted protein [Streptomyces roseosporus NRRL 15998] gi|291351937|gb|EFE78841.1| secreted protein [Streptomyces roseosporus NRRL 15998] Length = 420 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 34/214 (15%), Positives = 76/214 (35%), Gaps = 35/214 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF------- 226 + +VLDVS SM + +++ A ++ N +L+ V V +++ + Sbjct: 35 VELVLDVSGSMRTRDIDGQSRMAAAKQAFNDVLDAVP--EQVQLGIRTLGADYPGEDRKV 92 Query: 227 ----SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + ++ L+ + + + L+ G T P L A + + +G R+ Sbjct: 93 GCKDTRQLYPVGPLD--RTEAKTAVATLAPTGF-TPIGPALLGAADDLEGGEGSRR---- 145 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--RVIRSHEFLRAC- 339 IV +TDGE+ D + A+ +V +G+ + C Sbjct: 146 --------IVLITDGEDTCGPLDPCEVAREIAARGTHLVVDTLGLVPNAKIRQQL--TCI 195 Query: 340 --ASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 A+ ++ V++ + + + Sbjct: 196 AEATGGTYTAVQHKEELSGRVKQLVDRAAEPVVT 229 >gi|253565978|ref|ZP_04843432.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] gi|251945082|gb|EES85520.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] Length = 610 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 65/173 (37%), Gaps = 20/173 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 ++ + +++ ++DVS SM ++D+ S+ ++ ++ V V +G Sbjct: 240 EIPTDNLPASNLVFLIDVSGSMW-----GANRLDLVKSSLKLLVNNLRDKDKVAIVTYAG 294 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + ++ ++ I L G ST G+ AY +I + N Sbjct: 295 ----NAGVKLEATPGSDKQKIREAIDELEASG-STAGGEGIMLAY-KIAQKNFISGGNNR 348 Query: 283 EDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 I+ TDG+ N+ D++ + +K G + +G + + Sbjct: 349 --------IILCTDGDFNVGVSSDKELEKLIEQKRKSGIFLTVLGYGMGNYKD 393 >gi|239945536|ref|ZP_04697473.1| hypothetical protein SrosN15_31402 [Streptomyces roseosporus NRRL 15998] gi|239992004|ref|ZP_04712668.1| hypothetical protein SrosN1_32194 [Streptomyces roseosporus NRRL 11379] Length = 416 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 34/214 (15%), Positives = 76/214 (35%), Gaps = 35/214 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF------- 226 + +VLDVS SM + +++ A ++ N +L+ V V +++ + Sbjct: 31 VELVLDVSGSMRTRDIDGQSRMAAAKQAFNDVLDAVP--EQVQLGIRTLGADYPGEDRKV 88 Query: 227 ----SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + ++ L+ + + + L+ G T P L A + + +G R+ Sbjct: 89 GCKDTRQLYPVGPLD--RTEAKTAVATLAPTGF-TPIGPALLGAADDLEGGEGSRR---- 141 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--RVIRSHEFLRAC- 339 IV +TDGE+ D + A+ +V +G+ + C Sbjct: 142 --------IVLITDGEDTCGPLDPCEVAREIAARGTHLVVDTLGLVPNAKIRQQL--TCI 191 Query: 340 --ASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 A+ ++ V++ + + + Sbjct: 192 AEATGGTYTAVQHKEELSGRVKQLVDRAAEPVVT 225 >gi|209546481|ref|YP_002278399.1| hypothetical protein Rleg2_4401 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209537725|gb|ACI57659.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 534 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 59/189 (31%), Gaps = 24/189 (12%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R F+ +++G + LT I +P++ ++I+V + T L + +D + A ++ Sbjct: 6 VRRFWNDHRGYVIALTLIAMPMLLGFSLLVIDVGRSSNLHTDLQNAVDAMALAGARELDG 65 Query: 68 EGNGNNRKKLKGGDILCRIK---------------------NTWNMSFRNELRDNGFVND 106 + +R + I N + N ND Sbjct: 66 RDDAISRAQTAIEKIANSAAFSGGGAGMSLGSHISVTYDAGNDAGSTVTVFFLKNIPAND 125 Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 I S V Y+ + F + + ++ +I + ++ Sbjct: 126 DTSIPDSMK---TTVASEASYAWVIAKPQAMQTIFPVPVGFTRSTINIAADAVAVYHASA 182 Query: 167 QTDARLDMM 175 + + Sbjct: 183 CDVTPIFIC 191 >gi|125575071|gb|EAZ16355.1| hypothetical protein OsJ_31817 [Oryza sativa Japonica Group] Length = 579 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 43/206 (20%), Positives = 84/206 (40%), Gaps = 38/206 (18%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 LD++ VLDVS SM K+ + K++ +++ + + +V+FS + Sbjct: 142 PLDLVTVLDVSGSMAG------RKLALVKKAMGFVIDNLGPAD------RLCVVSFSTEA 189 Query: 231 EEFFLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L E G + +R ++ L +TN GL+ A + G R+H N + Sbjct: 190 SRRTRLLRMSEVGKATAKRAVESLV-DDSATNIGDGLRVAGRVL----GDRRHKNAVSS- 243 Query: 287 YKKIIVFMTDGENLST---KEDQQS---LYYCNEAKKRG----AIVYAIGIRVIRSHEFL 336 ++ ++DG++ + + S L + A G A ++ G + Sbjct: 244 ----VILLSDGKDSYVVPRRGNGMSYMDLVPPSFASSGGRGQLAPIHTFGFGADHDAAAM 299 Query: 337 RACA--SPNSFYLVENPHSMYDAFSH 360 A + +F VEN ++ D+F+ Sbjct: 300 NTIAESTGGTFSFVENEAAIQDSFAQ 325 >gi|125532270|gb|EAY78835.1| hypothetical protein OsI_33940 [Oryza sativa Indica Group] Length = 606 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 43/206 (20%), Positives = 84/206 (40%), Gaps = 38/206 (18%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 LD++ VLDVS SM K+ + K++ +++ + + +V+FS + Sbjct: 142 PLDLVTVLDVSGSMAG------RKLALVKKAMGFVIDNLGPAD------RLCVVSFSTEA 189 Query: 231 EEFFLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L E G + +R ++ L +TN GL+ A + G R+H N + Sbjct: 190 SRRTRLLRMSEVGKATAKRAVESLV-DDSATNIGDGLRVAGRVL----GDRRHKNAVSS- 243 Query: 287 YKKIIVFMTDGENLST---KEDQQS---LYYCNEAKKRG----AIVYAIGIRVIRSHEFL 336 ++ ++DG++ + + S L + A G A ++ G + Sbjct: 244 ----VILLSDGKDSYVVPRRGNGMSYMDLVPPSFASSGGRGRLAPIHTFGFGADHDAAAM 299 Query: 337 RACA--SPNSFYLVENPHSMYDAFSH 360 A + +F VEN ++ D+F+ Sbjct: 300 NTIAESTGGTFSFVENEAAIQDSFAQ 325 >gi|22758317|gb|AAN05521.1| hypothetical protein [Oryza sativa Japonica Group] gi|31432564|gb|AAP54179.1| von Willebrand factor type A domain containing protein [Oryza sativa Japonica Group] Length = 606 Score = 49.4 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 43/206 (20%), Positives = 84/206 (40%), Gaps = 38/206 (18%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 LD++ VLDVS SM K+ + K++ +++ + + +V+FS + Sbjct: 142 PLDLVTVLDVSGSMAG------RKLALVKKAMGFVIDNLGPAD------RLCVVSFSTEA 189 Query: 231 EEFFLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L E G + +R ++ L +TN GL+ A + G R+H N + Sbjct: 190 SRRTRLLRMSEVGKATAKRAVESLV-DDSATNIGDGLRVAGRVL----GDRRHKNAVSS- 243 Query: 287 YKKIIVFMTDGENLST---KEDQQS---LYYCNEAKKRG----AIVYAIGIRVIRSHEFL 336 ++ ++DG++ + + S L + A G A ++ G + Sbjct: 244 ----VILLSDGKDSYVVPRRGNGMSYMDLVPPSFASSGGRGQLAPIHTFGFGADHDAAAM 299 Query: 337 RACA--SPNSFYLVENPHSMYDAFSH 360 A + +F VEN ++ D+F+ Sbjct: 300 NTIAESTGGTFSFVENEAAIQDSFAQ 325 >gi|323699770|ref|ZP_08111682.1| von Willebrand factor type A [Desulfovibrio sp. ND132] gi|323459702|gb|EGB15567.1| von Willebrand factor type A [Desulfovibrio desulfuricans ND132] Length = 2034 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 64/211 (30%), Gaps = 36/211 (17%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + V +T LD+++++D S SM+ ++ +D K I A+ Sbjct: 398 EGGSPARVVSVERTRLPLDIVLLVDSSGSMKGQMRNA---LDATRKFIAAL--------P 446 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + V+ +V F K + + + G + + D Sbjct: 447 ADARVR--VVDFDTKPRALPGET--RDAALKGLAGIKANGATC------------LNDAV 490 Query: 275 GMRQHCNTEDANYKKIIVFMTDG-----ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 + H ++ TDG + + K G V+ IG Sbjct: 491 LLGLHMLAGAKRP--ALLVFTDGFDANFNDTGPGSKATRREVLDAVKTGGVPVFTIGFGK 548 Query: 330 IRSHEFLRACA--SPNSFYLVENPHSMYDAF 358 L A S +Y +P ++ AF Sbjct: 549 GHDVSTLDRIASLSGGRYYPASDPGALDKAF 579 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 67/192 (34%), Gaps = 22/192 (11%) Query: 178 LDVSRSME--SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 +D+S SM+ F ++D ++ L + + V+ G +TFS++ + Sbjct: 612 VDISGSMDKTPDFSGCNYRMDKVKAILHDFLAAL-----PDEVLAQG-MTFSDQNVIEQV 665 Query: 236 LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 + + L G T + + R + ++F+T Sbjct: 666 TTANTGEMLAAMNDLYADG-GTEIAGAVAAVLETQRAIPSTR-----------RYLLFIT 713 Query: 296 DGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRACA-SPNSFYLVENPHS 353 D ED+ + + G +GI + F RA S S+ L E+P Sbjct: 714 DAALDVEPEDKLFFETTLAKLRDEGVYCLWVGIGELDPAPFKRAAEISGGSYVLTEDPAE 773 Query: 354 MYDAFSHIGKDI 365 + AF + DI Sbjct: 774 LGRAFDGLVADI 785 >gi|302796872|ref|XP_002980197.1| hypothetical protein SELMODRAFT_444450 [Selaginella moellendorffii] gi|300151813|gb|EFJ18457.1| hypothetical protein SELMODRAFT_444450 [Selaginella moellendorffii] Length = 542 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 79/207 (38%), Gaps = 31/207 (14%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + A +D++ VLDVS SM K+++ ++ ++ ++ + Sbjct: 72 DSSDDARAPVDLVTVLDVSGSMRG------QKLELVKTAMEFVIRNLRQQD------RLA 119 Query: 223 LVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 +V+FS++ + L+ G ++ L G T PGLK + + + R Sbjct: 120 IVSFSDEPKVHLGLKRMTYDGREAALSAVEKLRTLG-GTEIRPGLKAGF-DLLSRRRNRN 177 Query: 279 HCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKK--RGAIVYAIGIRV-IRSHE 334 ++ I+ ++DG +N T + + L + + V+ G Sbjct: 178 PVSS--------IMLLSDGMDNAITFKRCKVLPVDSYLEDCSERVPVHTFGFGSDHDPEA 229 Query: 335 FLRAC-ASPNSFYLVENPHSMYDAFSH 360 L A+ SF V+ ++ AF+ Sbjct: 230 MLSIAEATGGSFCYVQEESTVQHAFAQ 256 >gi|297664532|ref|XP_002810693.1| PREDICTED: calcium-activated chloride channel regulator 1-like isoform 1 [Pongo abelii] Length = 914 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 39/197 (19%), Positives = 72/197 (36%), Gaps = 35/197 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM ++ +++ ++ L + + G+VTF + Sbjct: 307 VCLVLDKSGSM-----ATGNRLNRLNQAGQLFLLQTVELGSW-----VGMVTFDSAAHVQ 356 Query: 234 FLL--EWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L S K L T+ GL+ A+ I +++ Sbjct: 357 SELIQINSGSDRDTLAKRLPAAASGGTSICSGLRSAFTVI-----RKKYPTDGSE----- 406 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ ++ C NE K+ GAI++ + + + E L Y Sbjct: 407 IVLLTDGEDN-------TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKMTGGLRTYA 459 Query: 348 VENPHS--MYDAFSHIG 362 + + + DAF + Sbjct: 460 SDQVQNNGLIDAFGALS 476 >gi|225310537|emb|CAQ19229.1| collagen type XXVIII alpha 1 b precursor [Danio rerio] Length = 215 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 34/191 (17%), Positives = 70/191 (36%), Gaps = 18/191 (9%) Query: 149 TNSRHIVMPITSSVKVNSQTDA-RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 N R PITS+V ++ + L++ ++D S S + + + + Sbjct: 40 RNGRRKSKPITSNVIPKNKDENCNLELAFLVDSSESAKDNHGQE----KSFVTDLVNHIP 95 Query: 208 EVKLIPDVNNVVQSGLVTFSNKI--EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 ++L ++ L+ +S+ + E+ F GV Q ++ + G T +T + Sbjct: 96 NIRLQTGQGLNFRTALLQYSSHVITEQSFKDWRGVPSFQSRVASIPFIGHGTYTTYAI-- 153 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 R + K+ + M G + D S +AK +G + + Sbjct: 154 -------TNLTRIYLEESGPGTVKVAILMYGGASHPKNPDIFSAL--ADAKNQGIKFFIV 204 Query: 326 GIRVIRSHEFL 336 G+ + E L Sbjct: 205 GLTSAANMEKL 215 >gi|4009460|gb|AAC95429.1| calcium-dependent chloride channel-1 [Homo sapiens] Length = 914 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 39/197 (19%), Positives = 72/197 (36%), Gaps = 35/197 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM ++ +++ ++ L + + G+VTF + Sbjct: 307 VCLVLDKSGSM-----ATGNRLNRLNQAGQLFLLQTVELGSW-----VGMVTFDSAAHVQ 356 Query: 234 FLL--EWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L S K L T+ GL+ A+ I +++ Sbjct: 357 SELIQINSGSDRDTLAKRLPAAASGGTSICSGLRSAFTVI-----RKKYPTDGSE----- 406 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ ++ C NE K+ GAI++ + + + E L Y Sbjct: 407 IVLLTDGEDN-------TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKMTGGLQTYA 459 Query: 348 VENPHS--MYDAFSHIG 362 + + + DAF + Sbjct: 460 SDQVQNNGLIDAFGALS 476 >gi|60682855|ref|YP_212999.1| hypothetical protein BF3393 [Bacteroides fragilis NCTC 9343] gi|60494289|emb|CAH09084.1| conserved exported hypothetical protein [Bacteroides fragilis NCTC 9343] Length = 610 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 65/173 (37%), Gaps = 20/173 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 ++ + +++ ++DVS SM ++D+ S+ ++ ++ V V +G Sbjct: 240 EIPTDNLPASNLVFLIDVSGSMW-----GANRLDLVKSSLKLLVNNLRDKDKVAIVTYAG 294 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + ++ ++ I L G ST G+ AY +I + N Sbjct: 295 ----NAGVKLEATPGSDKQKIREAIDELEASG-STAGGEGIMLAY-KIAQKNFISGGNNR 348 Query: 283 EDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 I+ TDG+ N+ D++ + +K G + +G + + Sbjct: 349 --------IILCTDGDFNVGVSSDKELEKLIEQKRKSGIFLTVLGYGMGNYKD 393 >gi|39652254|emb|CAC79611.1| inter-alpha-trypsin inhibitor heavy chain H3 [Homo sapiens] Length = 886 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 73/176 (41%), Gaps = 15/176 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG-LVTFSNKIEE 232 + V+D+S SM K++ +++ +LE+++ +N ++ SG + T+ + + Sbjct: 281 VAFVIDISGSMAG------RKLEQTKEALLRILEDMQEEDYLNFILFSGDVSTWKEHLVQ 334 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +K + G+ TN GL + + + +H E + I++ Sbjct: 335 ATPE--NLQEARTFVKSMEDKGM-TNINDGLLRGISMLNKARE--EHRIPERSTS--IVI 387 Query: 293 FMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +TDG+ N+ ++ A +Y +G ++ FL A N + Sbjct: 388 MLTDGDANVGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLENMALENHGFA 443 >gi|148692829|gb|EDL24776.1| inter-alpha trypsin inhibitor, heavy chain 1, isoform CRA_c [Mus musculus] Length = 658 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 73/196 (37%), Gaps = 28/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D+S SME K+ +++ +LE+++ + + + LV F +K++ Sbjct: 297 LVFVIDISGSMEG------QKVRQTKEALLKILEDMRPVDNFD------LVLFGSKVQSW 344 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + Q ++ S +TN GL + QG ++ + Sbjct: 345 KGSLVPASNANLQAAQDFVRRFS-LAGATNLNGGLLRGIEILNKAQGSHPELSSPAS--- 400 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDGE + Q L A + +Y +G FL ++ N+ Sbjct: 401 -ILIMLTDGEPTEGETDRSQILKNVRNAIRGRFPLYNLGFGHDLDFSFLEVMSTENNGWA 459 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 460 QRIYEDHDATQQLQGF 475 >gi|120537195|ref|YP_957252.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] gi|120327030|gb|ABM21337.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] Length = 584 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 38/319 (11%), Positives = 92/319 (28%), Gaps = 50/319 (15%) Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 + G + + L+ D+ N + ++ R + L N Sbjct: 295 SNGASTHDRILQETDLPEDTANQLRDALSSQARKDQSGQHFQIDSAVGELGGNDRGDNGQ 354 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRH-----------------IVMPITSSVKVNSQTD 169 ++ I + + T ++ + + Sbjct: 355 LQAGILTSSAIRSRLIGLLQAQTRQSQRLHIRGRRVDGRRLSRLASGDARVFIQRDETSR 414 Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + ++LD S SM+ ++A ++ ++ + IP + + F Sbjct: 415 PETSVHVLLDCSGSMQHQQ-------EIANQATVSLALAISTIPKCD----IAVSMFPGN 463 Query: 230 IEEFFLLEWGVSHLQRKIKYL-SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + ++ + G T + YA ++ RQ Sbjct: 464 GGSVSPMIRRGQPVRPNLGRFCVDSGGGTPLAEAMLYAARELTASHKARQ---------- 513 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 +++ +TDG+ + + + Y N K YAIGI F ++ ++ Sbjct: 514 -VMIVITDGDPNNPQA----VNYLNGLIKGHIDTYAIGIGSPAVKRFF------ENWCVI 562 Query: 349 ENPHSMYDAFSHIGKDIVT 367 + + A I +++ Sbjct: 563 SDVSQLQSALFSIASNVLE 581 >gi|320103074|ref|YP_004178665.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] gi|319750356|gb|ADV62116.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] Length = 602 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 33/163 (20%), Positives = 63/163 (38%), Gaps = 17/163 (10%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D ++V+DVSRSM + D+ ++++ A NA+++ P + L+ FS++ Sbjct: 39 DWVLVIDVSRSM-AARDAPPSRLETAQTVANALIDLAMRSPHD----RVALIVFSDRAVI 93 Query: 233 FFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L ++R + L ++ + A N + + + A Sbjct: 94 RTPLTRRADLVRRALARLRPGELRPGGSHLAEAVTVAQNLLIRSKRLDDPAFPRSAA--- 150 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 I ++DGE S + SL + A V A+G Sbjct: 151 -IWLLSDGE--SPRPPGISLLSASPAP---IHVVAVGRAEPPG 187 >gi|308472999|ref|XP_003098726.1| hypothetical protein CRE_04168 [Caenorhabditis remanei] gi|308268326|gb|EFP12279.1| hypothetical protein CRE_04168 [Caenorhabditis remanei] Length = 380 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 33/178 (18%), Positives = 68/178 (38%), Gaps = 25/178 (14%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN--- 228 LD+++V+D S+ M S +T +++ + + + P + GLVT+++ Sbjct: 36 LDVVLVVDNSQEMGSQRLHDVTSNILSVFGADTRIGSNSVEP---RTTRVGLVTYNSAAT 92 Query: 229 ---KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY-NQIFDMQGMRQHCNTED 284 + +F + + I +L + +S A Q+ ++QG+R H Sbjct: 93 LNADLNQFQSFSDLRNGV---ISFLKVAANTKDSYLATGLAMAAQVLNVQGLRDH----- 144 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 Y+K+I+ S D + K G + + L + +SP Sbjct: 145 --YQKVIIVY--ASKYSGYGDLDPQPIADRLKGSGVKIITVAYG---DETVLESLSSP 195 >gi|167518794|ref|XP_001743737.1| hypothetical protein [Monosiga brevicollis MX1] gi|163777699|gb|EDQ91315.1| predicted protein [Monosiga brevicollis MX1] Length = 874 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 33/209 (15%), Positives = 72/209 (34%), Gaps = 28/209 (13%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 ++D++ VLD S S+ ++A + + + + ++ + ++ + F + Sbjct: 269 QVDVLFVLDASGSVG------QANFNLAQQFVISAVSQLDVGLAA---IRVAGMMFHAEA 319 Query: 231 EEFFLLEWGVSHLQ-----RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F + S Q Y T + L + + Sbjct: 320 LPQFDFDDYTSAAQVQNAVANFNYPVNENWGTATGNALDSIRTNLLQASAGYRGGEV--- 376 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRACASPNS 344 ++ F+TDG + + +S + GA V AIGI I + S ++ Sbjct: 377 ----VVYFITDGVSQESPSVVESA--AQALRATGAQVMAIGITDQIDETQLEVIAGSADN 430 Query: 345 FYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 V + ++ +A +D + R+ D Sbjct: 431 VITVADFANLNEAV----RDDLVSRVCID 455 >gi|148692830|gb|EDL24777.1| inter-alpha trypsin inhibitor, heavy chain 1, isoform CRA_d [Mus musculus] Length = 651 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 73/196 (37%), Gaps = 28/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D+S SME K+ +++ +LE+++ + + + LV F +K++ Sbjct: 290 LVFVIDISGSMEG------QKVRQTKEALLKILEDMRPVDNFD------LVLFGSKVQSW 337 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + Q ++ S +TN GL + QG ++ + Sbjct: 338 KGSLVPASNANLQAAQDFVRRFS-LAGATNLNGGLLRGIEILNKAQGSHPELSSPAS--- 393 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDGE + Q L A + +Y +G FL ++ N+ Sbjct: 394 -ILIMLTDGEPTEGETDRSQILKNVRNAIRGRFPLYNLGFGHDLDFSFLEVMSTENNGWA 452 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 453 QRIYEDHDATQQLQGF 468 >gi|167763115|ref|ZP_02435242.1| hypothetical protein BACSTE_01484 [Bacteroides stercoris ATCC 43183] gi|167699455|gb|EDS16034.1| hypothetical protein BACSTE_01484 [Bacteroides stercoris ATCC 43183] Length = 342 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 24/175 (13%), Positives = 65/175 (37%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +++MI LD+S SM + D +++ A + + ++++++ + Sbjct: 79 GSKLETVKRQGVEVMIALDISNSMLAQ-DVQPSRLQKAKRLVAQLVDKMEND-------K 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 G++ F+ + + ++ + T + A Sbjct: 131 VGMIVFAGDAFTQLPITSDYISAKMFLESIDPSLISKQGTAIGAAINLA----------- 179 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 T + ++ +TDGEN + ++ +A ++G V +G+ + Sbjct: 180 SRSFTPQEGVGRAVIVITDGENH----EGDAVEAAKDAAEKGIQVNVLGVGMPEG 230 >gi|146455221|dbj|BAF62177.1| complement factor B [Triakis scyllium] Length = 765 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 32/182 (17%), Positives = 66/182 (36%), Gaps = 25/182 (13%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K+ +L + I+LDVS S++ AI ++ + ++ + V + G Sbjct: 254 KLALGQGQKLHIYIILDVSGSIKKVEFCD------AISALTSFIDMMSRFEVVIHY---G 304 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSK----------FGVSTNSTPGLKYAYNQIFD 272 +VTF ++ + + S + + K TN T LK Y + Sbjct: 305 VVTFGSRSQIIVNIGHAESGFPDYVADVPKDLKYEDVVMGNNKGTNMTGALKTVYEMMSF 364 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENL---STK---EDQQSLYYCNEAKKRGAIVYAIG 326 + ++ + +I+ TDG S K ++ + +++ VY G Sbjct: 365 QKASMKNRQLAWIEVRHVIMIFTDGRTNMGGSPKPMMDNIEGFLEVKNSREDFLDVYVFG 424 Query: 327 IR 328 + Sbjct: 425 LG 426 >gi|87307101|ref|ZP_01089247.1| hypothetical protein DSM3645_01570 [Blastopirellula marina DSM 3645] gi|87290474|gb|EAQ82362.1| hypothetical protein DSM3645_01570 [Blastopirellula marina DSM 3645] Length = 1670 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 45/310 (14%), Positives = 96/310 (30%), Gaps = 43/310 (13%) Query: 79 GGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQ--NEGYSISAISRYK 136 D I+ T + ++ +N + +D V G+ A ++ Sbjct: 1019 DDDARRGIEWTGGKTRSIDVLVEADLNSKTLEKDAAEIDARVDGAVFFRGHLSKADTKVV 1078 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD-ARLDMMIVLDVSRSMESFFDSSITKI 195 I + + + I +S +++ Q + IVLD S SM + Sbjct: 1079 IHRRPSNHLVSNPAPDAANLAIVASPELHQQYGSGDAAISIVLDASGSMGAAAGQPFGPQ 1138 Query: 196 DMAIKSINAMLEEVKLIPDVNNV----------VQSGLVTFSNKIEE-FFLLEWGVSHLQ 244 +++ A+ ++ IPD V Q +V I+ + W Sbjct: 1139 TKYAEAVRALDRLLETIPDGTQVSVWTFGQAMGSQKTVVEAERTIQRLVAPIIWNSKDKS 1198 Query: 245 RKIKYLSKFGVST----NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-EN 299 + + + N +P L+ + +G N K ++ +TDG +N Sbjct: 1199 QYQQLVKAITYPQVEPWNESPILRA----MISAKGDLTGVNGP-----KTMLVITDGADN 1249 Query: 300 ---LSTKEDQQSLYYCNEAKK----RGAIVYAIGIR--------VIRSHEFLRACASPNS 344 + + + G + +G + + E + P Sbjct: 1250 RFVNDSTVNPRGRSVAEALFDIFDGSGISIQVVGFKVVSAEAALAQKQFELVEQLYPPGG 1309 Query: 345 FYLVENPHSM 354 FYL++ ++ Sbjct: 1310 FYLIDRVEAL 1319 >gi|307329476|ref|ZP_07608637.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] gi|306884885|gb|EFN15910.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] Length = 733 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 37/209 (17%), Positives = 74/209 (35%), Gaps = 28/209 (13%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +V+ + R + +VLD S SM +F D ++ + +++A + +P Sbjct: 521 AVQAHGLAGQRAVVYLVLDRSGSMRGYFKDGTVQHLAEQALALSAQFDATGTVP------ 574 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 +V FS I+ L L + + + S + N + + +H Sbjct: 575 ---VVFFSTDIDGTADL-----ELDQHTGRIEELHG---SLGHMGRT-NYHTAVNAVIEH 622 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY--AIGIRVI--RSHEF 335 ++F TDG + +++L C A G ++ + + +F Sbjct: 623 YKASGTTAPAFVIFQTDGAPTAKTAAEKAL--CEAA---GLPLFWQFLAFGDPEAKGFDF 677 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIGKD 364 LR + + + E F H G D Sbjct: 678 LRKLDTAPALAVPEKRAVDNTGFFHAGLD 706 >gi|307245403|ref|ZP_07527491.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|306853744|gb|EFM85961.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 1 str. 4074] Length = 538 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 33/252 (13%), Positives = 84/252 (33%), Gaps = 26/252 (10%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R F + G T++ + I ++ + +E + I + L ++++++ + N Sbjct: 10 RRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAENNNG 69 Query: 69 GNGNNRK----KLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIV----- 119 N+ K K D R+ FV ++++ Sbjct: 70 RKDNDYKLSGSSNKENDSFDISSEVGK---RDSQMVTTFVKAFLPQTNDDKMNLIPICKT 126 Query: 120 ---VVPQNEGYSISAISRYKIPLKFCTFIPWYTNS-----RHIVMPITSSVKVNSQTDAR 171 + S ++ ++ P + + + + S + + Sbjct: 127 VNNTSGKGHTSSSEVTCTVSGTIEHKSWFPLKVGTVEVIPQQVDVASKSKAFKKNTFNIP 186 Query: 172 LDMMIVLDVSRSM------ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 +D+M+V D+S SM E + +KI + + + + ++ + N + G+ Sbjct: 187 IDLMVVADLSGSMKDGIKGEKLKGGTNSKIYILREVLKELADKSLFTQEANEYNRIGITA 246 Query: 226 FSNKIEEFFLLE 237 F+ E + Sbjct: 247 FAMGAEHPKENK 258 >gi|21114118|gb|AAM42187.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66572853|gb|AAY48263.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. 8004] Length = 618 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 39/241 (16%), Positives = 89/241 (36%), Gaps = 36/241 (14%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 PW T++ + + + +V + +++ ++DVS SM + K+ + S Sbjct: 220 LAPTPWNTDTLLLRIGVAGR-EVPTAALPAANLVFLVDVSGSM-----GAPDKLPLLQSS 273 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNS 259 + ++ +++ + LVT++ G + I L + G T Sbjct: 274 LKLLVRQLRKQD------RITLVTYAGSTAVVLPPTSGAQQTRIVEAIDSL-QSGGGTAG 326 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKR 318 G++ AY + +R N I+ TDG+ N+ + Q E ++ Sbjct: 327 ASGIELAY-KAAQQAYLRGGINR--------ILLATDGDFNVGVTDFDQLKGMVAEKRRS 377 Query: 319 GAIVYAIGIRVIRSHEFLR---ACASPNSFYLVENP--------HSMYDAFSHIGKDIVT 367 G + +G ++ L A A ++ +++ H + + I +D+ Sbjct: 378 GVALSTLGFGTGNYNDTLMEQLADAGDGAYAYIDSALEARKVLTHELGSTLATIARDVKI 437 Query: 368 K 368 + Sbjct: 438 Q 438 >gi|118387578|ref|XP_001026893.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89308660|gb|EAS06648.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 1074 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 21/124 (16%), Positives = 50/124 (40%), Gaps = 16/124 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 I + + + +D++ V+D S SM KI+M +++ +++++ Sbjct: 346 ISLNTKGNFDAKAYQRPPIDLICVMDNSGSMHG------EKINMLKETLLYLIDQLDEKD 399 Query: 214 DVNNVVQSGLVTFSNKIEE---FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + GLV F++++ + L++ I + G T+ G+ A+ I Sbjct: 400 ------RLGLVLFNSEVTFRPMKSMDTTNKLKLKQYISDIRAQG-GTDINLGMTEAFKFI 452 Query: 271 FDMQ 274 + Sbjct: 453 KTRK 456 >gi|73997822|ref|XP_534930.2| PREDICTED: similar to voltage-gated calcium channel alpha(2)delta-4 subunit isoform 1 [Canis familiaris] Length = 1121 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 33/169 (19%), Positives = 62/169 (36%), Gaps = 27/169 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT-F 226 + D++IV+D S SM+ ++ +A +I+ +L+ + N+ V + + Sbjct: 269 ATSPKDIVIVVDTSGSMKGL------RMTIAKHTISTILDTL----GENDFVNIIAYSDY 318 Query: 227 SNKIEEFFL------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + +E F H ++ + L GV P L A+ + Q RQ Sbjct: 319 IHYVEPCFKGTLVQADRDNREHFKQLVDELMVKGVGI-VNPALTEAFQILKQFQEARQGS 377 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 A I+ +TDG K + + + R V+ I Sbjct: 378 LCNQA-----IMLITDGAVEDYKPVFEKYNWPD----RKIRVFTYLIGR 417 >gi|53714874|ref|YP_100866.1| putative outer membrane protein [Bacteroides fragilis YCH46] gi|52217739|dbj|BAD50332.1| putative outer membrane protein [Bacteroides fragilis YCH46] Length = 610 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 64/173 (36%), Gaps = 20/173 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 ++ + +++ ++DVS SM ++D+ S+ ++ ++ V V +G Sbjct: 240 EIPTDNLPASNLVFLIDVSGSMW-----GANRLDLVKSSLKLLVNNLRDKDKVAIVTYAG 294 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + + ++ I L G ST G+ AY +I + N Sbjct: 295 ----NAGEKLASTPGSDKQKIREAIDELEASG-STAGGEGIMLAY-KIAQKNFISGGNNR 348 Query: 283 EDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 I+ TDG+ N+ D++ + +K G + +G + + Sbjct: 349 --------IILCTDGDFNVGVSSDKELEKLIEQKRKSGIFLTVLGYGMGNYKD 393 >gi|77747911|ref|NP_638263.2| hypothetical protein XCC2915 [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|77761138|ref|YP_242283.2| hypothetical protein XC_1194 [Xanthomonas campestris pv. campestris str. 8004] Length = 597 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 39/241 (16%), Positives = 89/241 (36%), Gaps = 36/241 (14%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 PW T++ + + + +V + +++ ++DVS SM + K+ + S Sbjct: 199 LAPTPWNTDTLLLRIGVAGR-EVPTAALPAANLVFLVDVSGSM-----GAPDKLPLLQSS 252 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNS 259 + ++ +++ + LVT++ G + I L + G T Sbjct: 253 LKLLVRQLRKQD------RITLVTYAGSTAVVLPPTSGAQQTRIVEAIDSL-QSGGGTAG 305 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKR 318 G++ AY + +R N I+ TDG+ N+ + Q E ++ Sbjct: 306 ASGIELAY-KAAQQAYLRGGINR--------ILLATDGDFNVGVTDFDQLKGMVAEKRRS 356 Query: 319 GAIVYAIGIRVIRSHEFLR---ACASPNSFYLVENP--------HSMYDAFSHIGKDIVT 367 G + +G ++ L A A ++ +++ H + + I +D+ Sbjct: 357 GVALSTLGFGTGNYNDTLMEQLADAGDGAYAYIDSALEARKVLTHELGSTLATIARDVKI 416 Query: 368 K 368 + Sbjct: 417 Q 417 >gi|321475774|gb|EFX86736.1| hypothetical protein DAPPUDRAFT_221972 [Daphnia pulex] Length = 891 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 34/227 (14%), Positives = 73/227 (32%), Gaps = 50/227 (22%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL---------- 223 ++ VLD S SM T+I+ +++N++L++++ D+ +VV+ Sbjct: 307 VIFVLDTSGSMAG------TRIEQTKQAMNSILDQLRKDEDIFSVVEFSSGVTEWDLRKP 360 Query: 224 ------VTFSNKIEEFFL-----LEWGVSHLQ----RKIKYLSKFGVSTNSTPGLKYAYN 268 F++ EE + S ++ I S A + Sbjct: 361 YKGPDHYYFNSPPEETTEDATAVPQNNESEVKFGPYDDILAYPVTEQSVKRAKEFVAAMD 420 Query: 269 --QIFDMQGMRQHCNTEDANYKK------IIVFMTDGENLSTKEDQQSLYYCNEAK---- 316 ++ + + II+F+TDGE ++ D + + Sbjct: 421 VTSSTNINDALLLALKNSQSVQSRVRLTPIIIFLTDGEPTASVTD--TTEILKNVRKGNS 478 Query: 317 KRGAIVYAIGIRVIRSHEFLRACASPN-----SFYLVENPHSMYDAF 358 ++ + ++FL +S N Y + F Sbjct: 479 DDVVSIFCLAFGTGTDYQFLTKISSQNRGFARKIYEAADATLQLKGF 525 >gi|297285600|ref|XP_001085586.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 1 [Macaca mulatta] Length = 935 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 41/264 (15%), Positives = 87/264 (32%), Gaps = 35/264 (13%) Query: 92 MSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNS 151 + + N V+ + T I P + + K P + T + + Sbjct: 187 LETESTFMTNQLVDALTTSQNKTKAHIQFKP-------TLSQQQKSPEQQETVLDGHLII 239 Query: 152 RHIVMPITSSVKVNSQTDARL-------------DMMIVLDVSRSMESFFDSSITKIDMA 198 R+ V S + + + +++ V+D S SM S KI Sbjct: 240 RYDVDRAVSGGTIQIENGYFVHYFAPEGLTTMPKNVVFVIDKSGSM------SGRKIQQT 293 Query: 199 IKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST 257 +++ +L+++ N +V + + + V+ + + G T Sbjct: 294 REALIKILDDLSPRDQFNLIVFSTEATQWRPSLVPAS--AENVNEARSFAAGIQALG-GT 350 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-CNEAK 316 N + A Q+ D + + I+ +TDG+ + + +S+ EA Sbjct: 351 NINEAMLVAV-QLLDSSNQEERLPDGSVSL---IILLTDGDPTVGETNPRSIQKNVREAV 406 Query: 317 KRGAIVYAIGIRVIRSHEFLRACA 340 ++ +G S+ FL A Sbjct: 407 SGRYSLFCLGFGFDVSYAFLEKLA 430 >gi|325687363|gb|EGD29384.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK72] Length = 450 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 54/321 (16%), Positives = 92/321 (28%), Gaps = 41/321 (12%) Query: 52 SMIDRSLVHAATQIMNEGN--GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 + + A T+I + K GD++ KN + + N V+ Sbjct: 52 EAVSSPIEAAGTEIAADTTKPVVEESAAKPGDLIDVSKNVSPVDVKESQEGNQTVHVETT 111 Query: 110 IVRSTSLDIV---VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 V T ++ V + + + + Y +S Sbjct: 112 TVDVTKTEVAPGKVDKAPDPITRRTENTLSGTDENGNPYTQYERVDKTTTITYTSTPPTV 171 Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 D++ V+D S SM S ID+ +IN + + + GL TF Sbjct: 172 TKAGSADIVFVVDRSGSMGST-------IDIVRANINEFVRNITK---EGITARFGLATF 221 Query: 227 SNKIE-------------EFFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQ 269 S+++ F + + K L+ G T TP A NQ Sbjct: 222 SDEVYGRNSGSKDEDTVLTRFGSSYFTTDPAELEKALAAIRIASGGDTPETPT--PALNQ 279 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 I + KK +V +TD E K G ++ Sbjct: 280 IISTYDWSKSSKN-----KKFVVLLTDAEMKEDPSIPTVADTLAALKAAGIERTVATVKA 334 Query: 330 IRSHEFLRACASPNSFYLVEN 350 I + A+ +EN Sbjct: 335 IEG--IYKNFATEGRVLDIEN 353 >gi|237728578|ref|ZP_04559059.1| TerY3 [Citrobacter sp. 30_2] gi|226910056|gb|EEH95974.1| TerY3 [Citrobacter sp. 30_2] Length = 346 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 30/206 (14%), Positives = 58/206 (28%), Gaps = 16/206 (7%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + VLD S SM + + ++ +++ P ++ F+ Sbjct: 3 RLPVFFVLDCSESM------IGENLKKMTDGLQMIVGDLRKDPHALETAWVSVIAFAGVA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + G T+ L+ QI H D + Sbjct: 57 RTIVPL----HEIASFYPPRLPVGGGTSLGAALRELTVQIDTQVRKTTHEAKGDWKP--V 110 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 + +TDG + + + A+K + A+G+ LR + Sbjct: 111 VYLLTDGRPTDDTTAEVKRWKDHYARK--VNLIAVGLGPSADLNILRQLTENVMLFTESQ 168 Query: 351 PHSMYDAFSHIGKDIVT--KRIWYDK 374 I + + + DK Sbjct: 169 EGDFTRFIKWITASVTAHSRSVGDDK 194 >gi|325919992|ref|ZP_08181973.1| von Willebrand factor type A-like protein [Xanthomonas gardneri ATCC 19865] gi|325549526|gb|EGD20399.1| von Willebrand factor type A-like protein [Xanthomonas gardneri ATCC 19865] Length = 142 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 18/118 (15%), Positives = 39/118 (33%), Gaps = 15/118 (12%) Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ F + L ++ ++ ++ + G T + + ++ Sbjct: 34 RVGLLVFGQRAYALTPLTADLTSVRDQLADSVVGLAGRETAIGDAIALSVKRL------- 86 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 E ++++V +TDG N + L AK G V+ I Sbjct: 87 ----REQKQGQRVVVLLTDGVNTA--GVLNPLKAAELAKAEGVRVHTIAFGGSGGDSL 138 >gi|115767164|ref|XP_001193368.1| PREDICTED: similar to calcium activated chloride channel variant [Strongylocentrotus purpuratus] gi|115976266|ref|XP_001179968.1| PREDICTED: similar to calcium activated chloride channel variant [Strongylocentrotus purpuratus] Length = 797 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 63/201 (31%), Gaps = 31/201 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 +++VLD+S SME + D I+S + V IP + + G+V F + + Sbjct: 263 SVVLVLDISGSMEGN------RFDRMIQSSVVYIMSV--IPTGS---KLGIVVFDSTSQI 311 Query: 233 FFLLEWGVSHLQRK--IKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L R+ + L T G+ + Sbjct: 312 SGNLTDITETASRQRLVNALPPSPVGGTCIGCGILSGIEVLGSYAQGG------------ 359 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 I+ ++DGE + +E K G I+ I I + L S + + Sbjct: 360 YIILLSDGEETDAPFIMDTY---DEIKNSGVIIDTITISDSADQQMEDLSTNTSGIANFC 416 Query: 348 VENPHSMYDAFSHIGKDIVTK 368 ++ + I + Sbjct: 417 SDDARTGIRLIQAFQSTITER 437 >gi|134291855|ref|YP_001115624.1| hypothetical protein Bcep1808_6472 [Burkholderia vietnamiensis G4] gi|134135044|gb|ABO59369.1| conserved hypothetical protein [Burkholderia vietnamiensis G4] Length = 423 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 19/163 (11%), Positives = 52/163 (31%), Gaps = 1/163 (0%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R + +G + I+ + L ++ +G+ +++ ++ ++ L + D + AA + + Sbjct: 12 RRSLHRQRGAVAIVVGLALAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDLTSA 71 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 + + + + S L D+ + V P N Y Sbjct: 72 ISLSVAEADGIAAGHVNFA-FFQKSAVQMLTDSNVTFSDALTNPFLTKTAVSTPANVKYV 130 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR 171 + I F + + S+ + + + Sbjct: 131 KCTATLSGIANWFVGVLNAMPGVQVANATQVSASAIATVAAGQ 173 >gi|313676404|ref|YP_004054400.1| von willebrand factor type a [Marivirga tractuosa DSM 4126] gi|312943102|gb|ADR22292.1| von Willebrand factor type A [Marivirga tractuosa DSM 4126] Length = 618 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 38/238 (15%), Positives = 89/238 (37%), Gaps = 30/238 (12%) Query: 103 FVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKF---CTFIPWYTNSRHIVMPIT 159 F+N + ++ I + Y + P + PW + + + + Sbjct: 184 FINSGQN-PPKDAVRIEEMINYFNYDYK-QPSGQDPFSINTEVSQAPWNKKHQLVHIGLQ 241 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 V + ++ +++ +LDVS SM + K+ + + ++++++ V+ VV Sbjct: 242 GKV-IPTENLPASNLVFLLDVSGSMFA-----QNKLPLLKSGLKMLVDQLREEDKVSIVV 295 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMR 277 +G L + + I+ L + G ST G++ AY +I ++ Sbjct: 296 YAGAAG-------CVLPPTSGNEKDKIIEALQNLQAGGSTAGGAGIELAY-KIAKENFIK 347 Query: 278 QHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + N I+ TDG+ N+ ++ + +K G + +G + + Sbjct: 348 EGNNR--------IILATDGDFNVGASSNEAMEDLIEKKRKEGVFLTVLGFGMGNYKD 397 >gi|299138555|ref|ZP_07031734.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] gi|298599801|gb|EFI55960.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] Length = 381 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 34/224 (15%), Positives = 82/224 (36%), Gaps = 38/224 (16%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 + ++ D + + I++D S SM K ++ +++ + + Sbjct: 127 GVAQEIRGFKHEDLPVSLGILIDSSGSMYE-------KSAAVNEASLDLVKLSNPLDEAF 179 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 LV FS++ ++ LQ+ + Y+ G + L A D Sbjct: 180 ------LVDFSSEAYIDQDFTNSIAKLQQGLAYIHTSGGT-----ALYDAVVASAD---- 224 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKED-QQSLYYCNEAKKRGAIVYAIGI-------- 327 + + + K++++ +TDGE+ ++ + ++ + G +Y IG+ Sbjct: 225 --YLSKNAKHPKQVLLIVTDGEDNASSASLESAIRRVQDL--DGPAIYCIGLLFGDDVSR 280 Query: 328 -RVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 + E L+ A + Y ++ + + +DI T+ Sbjct: 281 SEAKHAREVLQELAQQTGGQAYFPKSLKDVDGLTREVAQDIRTQ 324 >gi|221042208|dbj|BAH12781.1| unnamed protein product [Homo sapiens] Length = 900 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 30/171 (17%), Positives = 63/171 (36%), Gaps = 19/171 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEE 232 ++ V+D S SM S KI +++ +L+++ N +V + + + Sbjct: 275 VVFVIDKSGSM------SGRKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLVP 328 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V+ + + G TN + A Q+ D + I+ Sbjct: 329 AS--AENVNKARSFAAGIQALG-GTNINDAMLMAV-QLLDSSNQEGRLPEGSVSL---II 381 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA---IGIRVIRSHEFLRACA 340 +TDG+ + + +S+ N ++ + Y+ +G S+ FL A Sbjct: 382 LLTDGDPTVGETNPRSIQ--NNVREAVSGRYSLFCLGFGFDVSYAFLEKLA 430 >gi|188592039|ref|YP_001796637.1| hypothetical protein RALTA_B0200 [Cupriavidus taiwanensis LMG 19424] gi|170938413|emb|CAP63400.1| conserved hypothetical protein [Cupriavidus taiwanensis LMG 19424] Length = 562 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 15/86 (17%), Positives = 38/86 (44%), Gaps = 4/86 (4%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 + N +G ++++ A+ + + + + I+V H+F + L +M+D + + AA Q+ Sbjct: 8 VPRTRRNTRGAVSVMAAVLIATVAIAALVSIDVGHVFMRQRQLQNMVDLAAMSAAQQL-- 65 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMS 93 + +L + N + Sbjct: 66 --KRADSPANLNAAVLGTVSNIGAKN 89 >gi|154492260|ref|ZP_02031886.1| hypothetical protein PARMER_01894 [Parabacteroides merdae ATCC 43184] gi|154087485|gb|EDN86530.1| hypothetical protein PARMER_01894 [Parabacteroides merdae ATCC 43184] Length = 339 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 31/172 (18%), Positives = 64/172 (37%), Gaps = 20/172 (11%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +++M+ LDVS SM + D S ++ A + ++ + + + Sbjct: 79 GSKLETVKRQGVEIMVCLDVSNSMLAE-DVSPDRLSKAKQMLSKLTDGFSND-------K 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 GL+ F+ + + + ++ VST + A N Sbjct: 131 VGLIVFAGDAFTQLPITSDYVSAKMFLSSINPSMVSTQ-GTAIGAAINLAM-------RS 182 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 T K I+ +TDGEN + ++ A ++G V +G+ + Sbjct: 183 FTPSETSDKAIILITDGENH----EDDAVKAAAAAAEKGIHVNIVGMGDPKG 230 >gi|149915102|ref|ZP_01903630.1| hypothetical protein RAZWK3B_14733 [Roseobacter sp. AzwK-3b] gi|149810823|gb|EDM70662.1| hypothetical protein RAZWK3B_14733 [Roseobacter sp. AzwK-3b] Length = 444 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 31/212 (14%), Positives = 66/212 (31%), Gaps = 27/212 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 I + S+ ++ L++ +VLD S SM + A ++ + ++ ++ Sbjct: 25 IRIVAPSAPVTETEPRPPLNLALVLDRSSSMRG------QPLHEAKRAADQIVAGLRPSD 78 Query: 214 DVNNVVQSGLVTFSNKIEEFFL--LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 + +V F N E F + + + G+ T G Q Sbjct: 79 ------RLAIVAFDNATEVMFSGGPRGDGQAARAALSRIHARGM-TALHDGWLLGVEQSI 131 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 M+ + ++DG N+ + C + G G+ + Sbjct: 132 AMREAGTPAR---------VFLLSDGVANVGLTDASAIAADCTRMAEHGITTSTCGLGMG 182 Query: 331 RSHEFLRACASP--NSFYLVENPHSMYDAFSH 360 + + + A + Y E + D F Sbjct: 183 FNEDLMAEMARAGRGNAYYGETAEDLQDPFEQ 214 >gi|261819805|ref|YP_003257911.1| von Willebrand factor A [Pectobacterium wasabiae WPP163] gi|261603818|gb|ACX86304.1| von Willebrand factor type A [Pectobacterium wasabiae WPP163] Length = 659 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 31/203 (15%), Positives = 72/203 (35%), Gaps = 30/203 (14%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T + N ++ V+D + SM + I+ ++I+ + + ++ + + Sbjct: 217 TGQNEKNMMKGFSASVVFVIDSTISMGPY-------IERTKEAIDKIYQRIEK-EQLQDK 268 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHL---------QRKIKYLSKFGVSTNSTPGLKY-AYN 268 V+ GLV + + ++ LE+ + + + +T S+ + AY Sbjct: 269 VKFGLVAYRSSVKVVPGLEYDSKMYVDPNTVKSGKDFLAKVHDLKQATVSSSKMDEDAYG 328 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTD-----GENLSTKEDQQSLYYCNEAKKRGAIVY 323 + + +V +TD G + + + EA RG +Y Sbjct: 329 GVMMALDKVDWTQFGA----RYVVLITDAGALEGSDSLSSTSLDAAQVRQEAAYRGVALY 384 Query: 324 AIGIRVIRSHEFLRACASPNSFY 346 A+ ++ + AS + Y Sbjct: 385 AMHLKAPNGK---KNHASAEAQY 404 >gi|156602970|ref|XP_001618750.1| hypothetical protein NEMVEDRAFT_v1g153509 [Nematostella vectensis] gi|156200182|gb|EDO26650.1| predicted protein [Nematostella vectensis] Length = 133 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 21/133 (15%), Positives = 42/133 (31%), Gaps = 18/133 (13%) Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 F + + + ST + L+Y ++I+ I++ Sbjct: 6 TFGKYNNKKDVLEAVDKMPYPKGSTYTGRALQYMNDEIY--------RKATRVGVPNILI 57 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSFYLVE 349 +TDG K + G +Y+IG+ L A A+ + + V+ Sbjct: 58 VLTDG-----KAHDSVAEPAKALRDIGIEIYSIGVGESYDKAELDAIATDPDASHVFSVD 112 Query: 350 --NPHSMYDAFSH 360 N +S+ Sbjct: 113 FKNMNSIVSTLDA 125 >gi|83594486|ref|YP_428238.1| von Willebrand factor, type A [Rhodospirillum rubrum ATCC 11170] gi|83577400|gb|ABC23951.1| von Willebrand factor, type A [Rhodospirillum rubrum ATCC 11170] Length = 575 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 26/187 (13%), Positives = 67/187 (35%), Gaps = 20/187 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 M+ V+D S SM + ++I A +SI + + + ++ G+V+FS+ + Sbjct: 394 MIFVVDGSGSMSEGIAGAPSRISAAKRSIRDTVNAL------HKDIRVGMVSFSDCMATQ 447 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + + + ++ +R+ + +++ Sbjct: 448 NSKYYSAAERPAFLAGVDAITPERATSLA-----------ASIRRGGALATRRSETVMMV 496 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--PNSFYLVENP 351 ++DGE+ D + +A+K I++ I + + R AS + + Sbjct: 497 VSDGEDT-CGGDPCAAARAVKAEKSNVIIHVIDLSGGGNSGVARCIASAGGGRVFTPGSA 555 Query: 352 HSMYDAF 358 + + Sbjct: 556 AQVTSSL 562 >gi|134300085|ref|YP_001113581.1| von Willebrand factor, type A [Desulfotomaculum reducens MI-1] gi|134052785|gb|ABO50756.1| von Willebrand factor, type A [Desulfotomaculum reducens MI-1] Length = 416 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 36/233 (15%), Positives = 79/233 (33%), Gaps = 28/233 (12%) Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI 192 + + L +P ++++ +T+ +V + ++ V+D S SM Sbjct: 4 IQIDLALDKTYLLPGNKQVAYLMVKLTAPKQVE-KERPVQNLSFVIDRSGSMAG------ 56 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE--EFFLLEWGVSHLQRKIKYL 250 K+D K++ + + +V F + + L+ ++ + Sbjct: 57 EKLDYTKKAVAFAVGHLSPQD------YCSVVAFDDMVTMVASSHQVANKDALKMAVESI 110 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSL 309 G STN + G+ ++ Q ++ +TDG N+ + + Sbjct: 111 YP-GGSTNLSGGMLLGVREVKLAHKENQINR---------VLLLTDGMANVGVTDHSALV 160 Query: 310 YYCNEAKKRGAIVYAIGIRVIRSHEFLRAC--ASPNSFYLVENPHSMYDAFSH 360 E G + G+ + L+A A +FY +E P + F Sbjct: 161 EKSREMAAGGVNLSTFGLGEDFEEDLLQAMVEAGGGNFYYIEKPDQIPGIFEQ 213 >gi|328885837|emb|CCA59076.1| hypothetical protein SVEN_5790 [Streptomyces venezuelae ATCC 10712] Length = 865 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 31/241 (12%), Positives = 73/241 (30%), Gaps = 46/241 (19%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA----RLDMMIVLDVSRSMESFFDSSIT 193 P T + +P T + + +D +V+D S S+ + T Sbjct: 26 PFAATTASGAPGGAGAPALPRTPAAPADPGDPGEGPDPVDFAVVVDQSASLSDKDLARET 85 Query: 194 KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-------KIEEFFLLE----WGVSH 242 + + +++ ++ ++ F + + E L G Sbjct: 86 EAAGLL-----------SQGEISERSRAAVIGFGSSEKPGQSPVREVCPLTVADAAGRER 134 Query: 243 LQRKIKYL-----SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 L ++ L ++ G T+ ++ A +++ G K++ +TDG Sbjct: 135 LSDCVQELGRRDAARMGPGTDFPAAIRQAVSRL-TADGTAGGTGAAGKPAPKVVFLLTDG 193 Query: 298 --------------ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 E+ + +++ A+ G ++ +G L A A Sbjct: 194 KLDVKDSPEYGTDPESRQSNGEKRLTEELARARAAGVQIWPLGFGSEIDRAALTAMAEGG 253 Query: 344 S 344 Sbjct: 254 Y 254 >gi|323138937|ref|ZP_08073998.1| hypothetical protein Met49242DRAFT_3386 [Methylocystis sp. ATCC 49242] gi|322395783|gb|EFX98323.1| hypothetical protein Met49242DRAFT_3386 [Methylocystis sp. ATCC 49242] Length = 482 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 62/485 (12%), Positives = 122/485 (25%), Gaps = 140/485 (28%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM 66 I F + KGG+ I+ + + + L G+ + + + L +A Sbjct: 6 KIAGFTRDDKGGVAIIMGLAVIPLVLASGLAADYA--------IVQAAKSRLDASADAAA 57 Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 + + ++ + F + Q +G Sbjct: 58 LAAIKTAQTTIAELSATNPNPRPQAIAAAMSQAEKSFYAQAGKRAADLLGKPAIDVQIKG 117 Query: 127 YSISAISRYK--IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 ++A Y +P F + + LD ++LDVS SM Sbjct: 118 QEVTANVAYSAAMPSNFGRIAGVKLMNYN------GGAGAQLTMAKFLDFYLLLDVSGSM 171 Query: 185 E----------------------------------------SFFDSSITKIDMAIKSINA 204 S ++ +ID ++ Sbjct: 172 GLPSTPAGEAALAAKNPDDLAQYPTGCRFACHFAGSQGYNVSRANNIQLRIDAVGAAVAQ 231 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH----LQRKIKYLSKFGVSTNST 260 ++E+ K + + G+ F F L + ++ I Y +T+ Sbjct: 232 LMEKAKDTATLPKQYRVGVYPFVTHANAFVDLTDNLRGDQYSVESAINY-DPATRTTDFG 290 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDAN---------------------------------- 286 L + +F + + Sbjct: 291 RLLDAGKDWVFARDLNPNYKANPNIPADVTPMGAGGSHIHNIFQDINAKIPSVGDGSGAS 350 Query: 287 -YKKIIVFMTDGENLSTKEDQQSLYY-----------------------CNEAKKRGAIV 322 + + F++DG S + + CN K RG V Sbjct: 351 SPQPFVFFVSDGMQNSQSFVSATGTWPGVTPYPTPPGQTVSIRAMDPTLCNVLKARGITV 410 Query: 323 YAIGI---------------------RVIRSHEFLRACASPNSFYLVENPHSMYDAFSHI 361 + I V +RACASPN +++ + P + DA + Sbjct: 411 SVLEIPYPTFTNPKPFAAAQEFKANDAVPNLSGAMRACASPNFYFMADTPEGIADAMKKM 470 Query: 362 GKDIV 366 + V Sbjct: 471 FEQAV 475 >gi|297473702|ref|XP_002686779.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 1 [Bos taurus] gi|296488587|gb|DAA30700.1| calcium channel, voltage-dependent, alpha 2/delta subunit 1 [Bos taurus] Length = 1085 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 31/170 (18%), Positives = 68/170 (40%), Gaps = 34/170 (20%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S++ MLE + VN + +F Sbjct: 247 GAASPKDMLILVDVSGSVSGLT------LKLIRTSVSEMLETLSDDDFVN------VASF 294 Query: 227 SNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L+ + ++ G+ T+ G +A+ Q+ + R + Sbjct: 295 NSNAQDVSCFQHLVQANVRNKKVLKDAVNNITAKGI-TDYKKGFSFAFEQLLNYNVSRAN 353 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIR 328 CN KII+ TDG ++++ + K + V+ + Sbjct: 354 CN-------KIIMLFTDG------GEERAQEIFTKYNKDKKVRVFTFSVG 390 >gi|195115455|ref|XP_002002272.1| GI17296 [Drosophila mojavensis] gi|193912847|gb|EDW11714.1| GI17296 [Drosophila mojavensis] Length = 1099 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 34/188 (18%), Positives = 69/188 (36%), Gaps = 24/188 (12%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-----NVVQSG 222 + D+MI+LD S SM D+A + +L+ + VN VV++ Sbjct: 150 ASSPKDIMILLDASSSMSE------KSFDLATATAFNILDTLGEDDYVNLITFSEVVKTP 203 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + F +++ + ++ +K + + N T GL+YA++ + Sbjct: 204 VPCFKDRMVRATP--DNIQEIKSAVKAVK-LQDTANFTAGLEYAFSLLHKYNQSGAGSQC 260 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-HEFLRACAS 341 A I+ +T+ + S KE + + + ++ I L A Sbjct: 261 NQA-----IMLITESTSESHKEIIKQYNWPHMP----VRIFTYLIGSDSGSRSNLHEMAC 311 Query: 342 PNSFYLVE 349 N + V+ Sbjct: 312 SNKGFFVQ 319 >gi|91201036|emb|CAJ74094.1| hypothetical protein kuste3333 [Candidatus Kuenenia stuttgartiensis] Length = 395 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 41/248 (16%), Positives = 84/248 (33%), Gaps = 34/248 (13%) Query: 118 IVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 + Q + + K P +F+P +++ + + ++ +I+ Sbjct: 18 LTGCAQLKSITFEK----KHPEPLESFVPQDLSAKSLSVKYAPKIE---------SFVII 64 Query: 178 LDVSRSMESFF----DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 LD S SME+ + + K D+A I+ M + + + + +V G F+ + F Sbjct: 65 LDASASMETAYTGIVNKGHPKFDVAKDIISRMNKTLPEVDVNSALVTFGHGFFTPLKKTF 124 Query: 234 --FLLEWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L L+ + S+ + + A NQ+ G Sbjct: 125 IVYELTHHSRDLLENALNMAIYPKGSSPAGNAIADASNQLLTSVGQNA------------ 172 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACASPNSFYLVE 349 ++F++DGENL ++ E Y I + E LR A + Sbjct: 173 VIFVSDGENLIGTPLEKIKDL-KELYGEKTCFYTIHVGNTPEGKEALRKLARSATCGFSV 231 Query: 350 NPHSMYDA 357 + + Sbjct: 232 TADEIASS 239 >gi|326505132|dbj|BAK02953.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 521 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 35/202 (17%), Positives = 69/202 (34%), Gaps = 27/202 (13%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 + + + + ++ LD+++VLDVS SM K+ +++ ++ Sbjct: 34 ALTADEVTAVVELNATSSTAVREGLDLVVVLDVSGSMRG------EKLQSMKRAMQFVIM 87 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 ++ + + +V+FS+ L Q ++K + G+ N +K Sbjct: 88 KLTPVD------RLSVVSFSSSATRHCPLRSVTQAAQAELKGIV-DGLVANGGTNIKAGL 140 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + + R N M+DG+ D + + N A VY G Sbjct: 141 DTALAVIAGRATTKARTPNVF----LMSDGQQTD--GDARQVDPGNVA------VYTFGF 188 Query: 328 RVIRSHEFLRACA--SPNSFYL 347 H L A SP + Sbjct: 189 GKDADHALLSDVAKKSPGGTFN 210 >gi|257897779|ref|ZP_05677432.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com15] gi|257835691|gb|EEV60765.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com15] Length = 819 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 25/136 (18%), Positives = 55/136 (40%), Gaps = 22/136 (16%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD+++V+D S SM +I ++ ++ + + + + G V +S + Sbjct: 310 TPLDLVLVVDWSGSMNDN-----NRIGEVKIGVDRFVDTL-ADSGITDKINMGYVGYSIE 363 Query: 230 IEEFF--LLEWGV-SHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + ++ G ++ ++K ++ T + L+ A N + G Sbjct: 364 GYSYSNGAVQMGSFDSVKNQVKSITPSWTNGGTFTQKALRDAGNMLSVPNG--------- 414 Query: 285 ANYKKIIVFMTDGENL 300 +KK+IV +TDG Sbjct: 415 --HKKVIVLLTDGVPT 428 >gi|255033973|ref|YP_003084594.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] gi|254946729|gb|ACT91429.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] Length = 625 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 40/316 (12%), Positives = 109/316 (34%), Gaps = 34/316 (10%) Query: 56 RSLVHAATQ-IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN-----DIDD 109 + ++ I+ K + + +F ++ + N + Sbjct: 126 AAAPSTHSETILAMPQATESYKPINENGFLSVGQQPVTTFSVDVDRAAYSNVRRFLNNGQ 185 Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFC---TFIPWYTNSRHIVMPITSSVKVNS 166 + ++ I + Y R + P+ T PW + + + + + V++ Sbjct: 186 MPPEDAVRIEEMINYFDYDYP-QPRGEHPVAIVAETTDSPWNPGLKLVHIGLQAK-TVSA 243 Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + +++ ++DVS SM K+ + ++ + +++++ ++ V +G Sbjct: 244 ENLSASNLVFLIDVSGSMNE-----ANKLPLLKQAFKLLADQLRVEDKISIVAYAGSAGM 298 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ + L G ST G++ AY+ ++H + N Sbjct: 299 VLAPTSGSE----KKTIKDALDKLEA-GGSTAGGEGIELAYDL------AKKHFLPKGNN 347 Query: 287 YKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRACAS--P 342 ++ TDG+ N+ + + E +K G + +G + + + A Sbjct: 348 R---VILATDGDFNVGISNESELQKLIEEKRKAGIFLSVMGFGMGNYKDSHVETLADKGN 404 Query: 343 NSFYLVENPHSMYDAF 358 ++ ++N F Sbjct: 405 GNYAYIDNIQEARKVF 420 >gi|224004848|ref|XP_002296075.1| predicted protein [Thalassiosira pseudonana CCMP1335] gi|209586107|gb|ACI64792.1| predicted protein [Thalassiosira pseudonana CCMP1335] Length = 868 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 42/235 (17%), Positives = 81/235 (34%), Gaps = 47/235 (20%) Query: 149 TNSRHIVMPITSSVKV------NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 S I +++K +S + +D+++ LDVS SM + K+D+ +++ Sbjct: 108 LQSGEFTGQICATIKARDLPQRDSFARSPIDIVVALDVSGSMR------VEKLDLCKETL 161 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTN 258 + +L E+ + L++FS ++ I LS G TN Sbjct: 162 HLLLRELHHDD------RFALISFSEDAVIEVPMQKVNERNKQQALHAIDRLSVKGR-TN 214 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN----- 313 + A + + N + + +TDG + +++ Sbjct: 215 IASAVSLAAQVVNGVAE---------PNKVRSVFLLTDG--NANTGYTEAIDLVKLTSIF 263 Query: 314 -EAKKR----GAIVYAIGIRVIRSHEFLRACA---SPNSFYLVENPHSMYDAFSH 360 EA + ++ G + LR A S SFY V + + AF Sbjct: 264 VEANRNPHTPPISLHTFGYGPEPDQKLLRGMAMATSGGSFYSVRDNSQVSSAFGD 318 >gi|17533689|ref|NP_496746.1| C-type LECtin family member (clec-66) [Caenorhabditis elegans] gi|3876685|emb|CAB03059.1| C. elegans protein F35C5.9, confirmed by transcript evidence [Caenorhabditis elegans] Length = 394 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 72/202 (35%), Gaps = 29/202 (14%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS---N 228 LD++ V+D S++M + + I ++ ++ D + G +T++ Sbjct: 55 LDIVFVVDNSKNMNLYNVYNT------ISNLFNPFVQIGTGYDDPRSTRVGFITYNWNAT 108 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + +F+ L+ S L +I+ LS T GL+ A N I G+R Sbjct: 109 DVADFYKLQ-SYSDLSSQIQALSVTPLSRADETYIDTGLQAAINMINATGGLR------- 160 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--VIRSHEFLRACASP 342 NYKK++V T N + + K G + + + AS Sbjct: 161 DNYKKVVVLFTSKYNYYHTYPED---LTDYLKSTGVTIITVNTGGDSYTTQSLRDKIASK 217 Query: 343 NSFYLVEN---PHSMYDAFSHI 361 + + + + A I Sbjct: 218 GMAFAMSDGNTTAELQKAVLAI 239 >gi|304437812|ref|ZP_07397761.1| von Willebrand factor type A domain protein [Selenomonas sp. oral taxon 149 str. 67H29BP] gi|304369169|gb|EFM22845.1| von Willebrand factor type A domain protein [Selenomonas sp. oral taxon 149 str. 67H29BP] Length = 255 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 68/201 (33%), Gaps = 30/201 (14%) Query: 166 SQTDARLDMMIVLDVSRSMES---------------------FFDSSITKIDMAIKSINA 204 + R+ + + LD S SM + + ++I K I Sbjct: 14 TNPTPRVPVCLCLDTSGSMAAVEAGSYVGTGETIRQDGKLWQIVEGGKSRIQELQKGIEM 73 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 E ++ + + +VTF E L + ++ + L G++ G+ Sbjct: 74 FFEAIRTDILAADSAEISIVTFD---NEAKCLLDFANIERQTVPQLHANGLTAM-GEGVN 129 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ--SLYYCNEAKKRGAIV 322 A + + Q +++ + Y+ +V M+DGE + + + V Sbjct: 130 LALDLL--AQRKKEYQDKGVDYYQPWLVLMSDGEPNGDPTELRRATQRVTELVNAGKLTV 187 Query: 323 YAIGIRVIRSHEFLRACASPN 343 + IGI + L A SP Sbjct: 188 FPIGIGSEPGMDAL-AALSPK 207 >gi|114587348|ref|XP_001172665.1| PREDICTED: similar to PK-120 precursor isoform 1 [Pan troglodytes] Length = 849 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 30/171 (17%), Positives = 64/171 (37%), Gaps = 19/171 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEE 232 ++ V+D S SM S KI +++ +L+++ N +V + + + Sbjct: 263 VVFVIDKSGSM------SGRKIQQTREALIKILDDLSPRDQFNLIVFSTEASQWRPSLVP 316 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V+ + + G TN + A Q+ D + + I+ Sbjct: 317 AS--AENVNKARSFAVGIQALG-GTNINDAMLMAV-QLLDSSNQEEQLPEGSVSL---II 369 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA---IGIRVIRSHEFLRACA 340 +TDG+ + + +S+ N ++ + Y+ +G S+ FL A Sbjct: 370 LLTDGDPTVGETNPRSIQ--NNVREAVSGRYSLFCLGFGFDVSYAFLEKLA 418 >gi|312886236|ref|ZP_07745850.1| von Willebrand factor type A [Mucilaginibacter paludis DSM 18603] gi|311301261|gb|EFQ78316.1| von Willebrand factor type A [Mucilaginibacter paludis DSM 18603] Length = 335 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 34/208 (16%), Positives = 62/208 (29%), Gaps = 47/208 (22%) Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS 241 SM + D +++ ++ +K PD + GLV FS + L Sbjct: 104 GSMLAE-DFKPNRLEAGKNIA---IDFIKNRPDD----RIGLVIFSGESFTQCPLTIDHD 155 Query: 242 HLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 L + T GL A N++ K+++ +TDG N Sbjct: 156 VLINLYHDIKNGMIEDGTAIGMGLATAVNRL-----------RGSEAKSKVVILLTDGVN 204 Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-----------------------IRSHEFL 336 + + AK+ G VY +GI I Sbjct: 205 NAGSIPPITA--AEIAKQFGIRVYTVGIGTQGYAPYPVPSPYGGVVYQRMEVQIDEPTLT 262 Query: 337 RACA-SPNSFYLVENPHSMYDAFSHIGK 363 + A + ++ N ++ + I + Sbjct: 263 KIAAITGGKYFRATNNDALTRIYKQIDQ 290 >gi|256084540|ref|XP_002578486.1| hypothetical protein [Schistosoma mansoni] gi|238663861|emb|CAZ34724.1| loss of heterozygosity 11 chromosomal region 2 gene a protein homolog (mast cell surface antigen 1) (masa-1), putative [Schistosoma mansoni] Length = 652 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 43/269 (15%), Positives = 89/269 (33%), Gaps = 40/269 (14%) Query: 82 ILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKF 141 + +++ + +N+ V + + L +V+ + S+S Y P + Sbjct: 211 DVKSTHDSFTWNCKNDSNH-SLVKLSSEFIPDHDLQMVLSLSDPLTSLS-SLEYGDPNQS 268 Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + + V S D R + + ++D S SME I A S Sbjct: 269 SILAMNCLMVQLL----PDIPNVVSSKDMRYEFVFLIDRSGSMEG------DNISYAKTS 318 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL--LEWGVSHLQRKIKY---LSKFGVS 256 + L+ + + + ++ F + F ++ L + Y L+ Sbjct: 319 LLLFLKSLPMS------CRFQIIGFGSDFAALFPEPTDYSEGSLNTAMNYQKDLNADMGG 372 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEA 315 T + LK A H + K I+F+TDG+ + + ++A Sbjct: 373 TEAYNALKAAL-----------HSTPSGEGWFKQIIFLTDGDVGNADEVIGLVRMNVDKA 421 Query: 316 KKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + V+ IG+ S + A + Sbjct: 422 R-----VFTIGLGQGVSTALIGGVARAGN 445 >gi|148656025|ref|YP_001276230.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148568135|gb|ABQ90280.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 932 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 39/231 (16%), Positives = 78/231 (33%), Gaps = 27/231 (11%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA---IKSIN 203 W +P+T + + T + ++IV+D+S SM D I K+ +A + I Sbjct: 386 WRGTPVEAALPVTMDIPIYR-TMPPVSVVIVIDISGSMAMTEDG-IPKLSLALDGARRIA 443 Query: 204 AMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL 263 ++L + + + + G+V L ++ + G N L Sbjct: 444 SLLRDEDELTILPFDDRPGVVV-------GPLPGSQRDKAIEQMSQVRLGGSGINIHDAL 496 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 A + +H I+ +TDG T + + +L + + Sbjct: 497 VAAARYVRASDRPIRH-----------IITITDG--NDTVQQEGALDIVRALRDERVTLT 543 Query: 324 AIGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 +I + F+R A+ +L E + D + I+ I Sbjct: 544 SIAVGQGSHVPFIRDMAAVGGGRTFLTERAADLPDLLLDEAEMIIQPSIIE 594 >gi|325297740|ref|YP_004257657.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] gi|324317293|gb|ADY35184.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] Length = 341 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 28/175 (16%), Positives = 67/175 (38%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + ++ ++ LD+S SM + D + ++++ + K I+ ++E N+ V Sbjct: 79 GSKMETVKRQGVETVVALDISNSMLAE-DVTPSRLEKSKKLISRLVETFN-----NDKVA 132 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 + F+ + + + ++ ++ T+ + A Sbjct: 133 M--IVFAGEAFTQLPITSDYISAKMFLETITPSLISTQGTDIRGAIDLAMKSFT------ 184 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + + IV +TDGEN + ++ EA K+G V+ +G+ Sbjct: 185 -----PNEGVGRAIVLITDGENH----EGGAVEAAQEAAKKGVRVFVLGVGSPDG 230 >gi|297708165|ref|XP_002830849.1| PREDICTED: collagen alpha-1(VI) chain-like [Pongo abelii] Length = 143 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 22/110 (20%), Positives = 42/110 (38%), Gaps = 6/110 (5%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP---DVNNVVQSGLV 224 D +D+ VLD S S+ +D ++ ++ D N V +G + Sbjct: 32 QDCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLRDRYYRCDRNLVWNAGAL 91 Query: 225 TFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 +S+++E L G L+ + + FG T + +K Q+ Sbjct: 92 HYSDEVEIIQGLTRMPGGRDALKSSVDAVKYFGKGTYTDCAIKKGLEQLL 141 >gi|114557513|ref|XP_001143250.1| PREDICTED: calcium-activated chloride channel regulator 1 [Pan troglodytes] Length = 914 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 39/197 (19%), Positives = 72/197 (36%), Gaps = 35/197 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM ++ +++ ++ L + + G+VTF + Sbjct: 307 VCLVLDKSGSM-----ATGNRLNRLNQAGQLFLLQTVELGSW-----VGMVTFDSAAHVQ 356 Query: 234 FLL--EWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L S K L T+ GL+ A+ I +++ Sbjct: 357 SELIQINSGSDRDTLAKRLPAAASGGTSICSGLRLAFTVI-----RKKYPTDGSE----- 406 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ ++ C NE K+ GAI++ + + + E L Y Sbjct: 407 IVLLTDGEDN-------TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKMTGGLQTYA 459 Query: 348 VENPHS--MYDAFSHIG 362 + + + DAF + Sbjct: 460 SDQVQNNGLIDAFGALS 476 >gi|114579066|ref|XP_001156305.1| PREDICTED: hypothetical protein isoform 1 [Pan troglodytes] Length = 1059 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 35/171 (20%), Positives = 63/171 (36%), Gaps = 32/171 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + I++D S SM+S K+D+ I ++E N V+ + + + Sbjct: 509 IYILIDTSHSMKS-------KLDLVKDKIIQFIQEQLKYQSKFNFVKFDGQAVAWREQLA 561 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + E + Q I+++ G STN+ LK A F + R I Sbjct: 562 EVNEDNLKQAQSWIRHIK-IGSSTNTLSALKTA----FADKETRA------------IYL 604 Query: 294 MTDGENLSTKEDQQSLYYCNEAKK-RGAIVYAIGIRVIR--SHEFLRACAS 341 +TDG E ++ K + +Y I ++ FL+ A+ Sbjct: 605 LTDGRPDQPPETVI-----DQVKLFQEIPIYTISFNYNDEIANRFLKEVAA 650 >gi|166406787|gb|ABY87357.1| collagen type XXI-like protein [Haliotis diversicolor] Length = 380 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 35/190 (18%), Positives = 62/190 (32%), Gaps = 24/190 (12%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 D+ +V+D SR D S +ID+ + E L N V+ G+V+ Sbjct: 4 GCGGKPADVYLVMDTSR------DVSQKEIDLQKRY---FKELSSLFYLGQNTVRLGVVS 54 Query: 226 FSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + L Q+ I + G + L Y F + RQ Sbjct: 55 SGSNSLTAIQLGSYKSPHDFQKAIDITPRVGGQRDHAKTLDYLRRTAFAPEVARQSVAH- 113 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS-- 341 +++FMT+G + + KK G V+ + + L AS Sbjct: 114 ------VVIFMTNG---FSHNAMATRIEARLLKKAGVYVFTVNLGGRWDKSELTGMASNP 164 Query: 342 -PNSFYLVEN 350 + + ++ Sbjct: 165 ASDFVFSSDD 174 >gi|171912902|ref|ZP_02928372.1| hypothetical protein VspiD_17020 [Verrucomicrobium spinosum DSM 4136] Length = 652 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 32/162 (19%), Positives = 64/162 (39%), Gaps = 26/162 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + I +D S+SM + D S ++ A + +LE + P+ + G++ F+ + Sbjct: 92 IFIAIDTSKSMLAD-DVSPNRLGRAKLAAQDLLERL---PND----RVGVIAFAGRSYLQ 143 Query: 234 FLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + I+ L ++ ++ A I ++G R+H Sbjct: 144 APLTNDHEAVIECIQSLDHTTIPRGGSSIASAIQLAVETIDKVKG-REHG---------- 192 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 +V TDG+ D+ +L A ++G IV +G+ Sbjct: 193 MVLFTDGQET----DEATLTAARMAAQKGLIVIPVGVGTTEG 230 >gi|50085104|ref|YP_046614.1| putative tellurium resistance protein [Acinetobacter sp. ADP1] gi|49531080|emb|CAG68792.1| conserved hypothetical protein; putative tellurium resistance protein [Acinetobacter sp. ADP1] Length = 352 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 65/203 (32%), Gaps = 22/203 (10%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + VLD S SM I + I ++++++ P V ++ F+ + Sbjct: 3 RLPVFFVLDCSESMAG------QNIQQMQQGIQLIMQKLRQDPYALETVYVSVIAFAGIV 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK-- 288 L + G T+ L++ N +H K Sbjct: 57 RTLVPLV----EVFAYYSTKLPLGGGTHLGKALEHLMN------EFDRHLMKTTEETKGD 106 Query: 289 -KIIVFM-TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 K I ++ TDG +D + A++ I A+G+ + L+ + Sbjct: 107 WKPIAYLFTDGRPTDECKDAIYRWQKKYARRCTLI--ALGMGKNVDYATLKQLTEHCIAF 164 Query: 347 LVENPHSMYDAFSHIGKDIVTKR 369 N F I +V + Sbjct: 165 DELNEKDFKKFFQWISASVVAQS 187 >gi|226357799|ref|YP_002787539.1| Von Willebrand factor type A domain protein, precursor [Deinococcus deserti VCD115] gi|226320042|gb|ACO48035.1| putative Von Willebrand factor type A domain protein, precursor [Deinococcus deserti VCD115] Length = 669 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 64/177 (36%), Gaps = 11/177 (6%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 + + + +++S +Q+ + + ++LD S SM S T++ A + + Sbjct: 8 HFLTRLSLALFLSASGAGQAQSGPTM-IQLILDSSGSMFSRLPGGDTRMATAQAVMTDFI 66 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL-KY 265 + P++N GL + I S L ++ L + + K Sbjct: 67 GRLPNDPNLN----VGLRLYGAAINAADPGACQDSKLVLPMRGLDRSALLAAVRGARPKG 122 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 A ++ +Q Q + ++I+V +TDG+ + RG V Sbjct: 123 ATPIVYSLQQAAQ--DFPTTAGRRIVVLVTDGQ---ESCQGDVKGALEAFRSRGLEV 174 >gi|34540039|ref|NP_904518.1| von Willebrand factor type A domain-containing protein [Porphyromonas gingivalis W83] gi|34396350|gb|AAQ65417.1| von Willebrand factor type A domain protein [Porphyromonas gingivalis W83] Length = 1226 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 26/120 (21%), Positives = 48/120 (40%), Gaps = 15/120 (12%) Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 +D S SM +I ++ AI S ++++ V+ LV++ ++ Sbjct: 149 IDQSSSMG---GQNIARLKSAIASGQRFVKKMLPKGTATEGVRIALVSYDHEPHRLSDFT 205 Query: 238 WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 + L +KI+ L+ T++ GLK A N + K I+ M+DG Sbjct: 206 KDTAFLCQKIRALTPI-WGTHTQGGLKMARNIMAT-----------STAVDKHIILMSDG 253 Score = 40.9 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 12/63 (19%), Positives = 27/63 (42%), Gaps = 3/63 (4%) Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIR---SHEFLRACASPNSFYLVENPHSMYDAFSHIG 362 ++ AK G ++ IG + ++ L+ A+ + + P ++ AF +I Sbjct: 359 NAAINEAQFAKNSGYTIHTIGYDLGDFALANNSLKLTATDENHFFTATPANLAAAFDNIA 418 Query: 363 KDI 365 + I Sbjct: 419 QTI 421 >gi|190149857|ref|YP_001968382.1| tight adherence protein G [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|307263180|ref|ZP_07544801.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 13 str. N273] gi|189914988|gb|ACE61240.1| tight adherence protein G [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|306871542|gb|EFN03265.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 13 str. N273] Length = 530 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 49/360 (13%), Positives = 121/360 (33%), Gaps = 35/360 (9%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R F + G T++ + I ++ + +E + I + L ++++++ + + Sbjct: 10 RRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAENNSG 69 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIV--------V 120 N+ K + + R+ FV ++++ Sbjct: 70 RKANDYKLGGSNPNDDSFNISSEVGKRDHAIVTTFVKTFLPQTNDDKMNLIPVCKTVNNT 129 Query: 121 VPQNEGYSISAISRYKIPLKFCTFIPWYTNS-----RHIVMPITSSVKVNSQTDARLDMM 175 + S ++ ++ P + + + + S + + +D+M Sbjct: 130 SGKGHTSSSEVTCTVSGTVEHKSWFPLKVGNLEVIPKQVDVASKSKAFKKNTFNIPIDLM 189 Query: 176 IVLDVSRSMESFFDSSIT-------KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS- 227 +V D+S SM D++ T KI + + + + ++ L D N + + F+ Sbjct: 190 VVADLSGSMNFDLDNNETKKTGKPSKISILKEVLVELADKTLLSEDANQHNRIYVTPFAL 249 Query: 228 -NKIEEFF---LLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR--Q 278 +I + W S ++ K L+K N + +I + + Sbjct: 250 GAEINKNSCALPYSWDIESSSKIENIKKILNKENSQYNRADLINNLVYRISTKETLNNIN 309 Query: 279 HCNTEDANYKKIIVFMTD----GENLSTKED-QQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + + K + D + T+ D + Y K GA + + G+ V ++ Sbjct: 310 GKQKYNVTFPKNTFCLKDMKTSNQGWYTRSDKSKFTSYVQSIKASGATLASSGVLVAANN 369 >gi|313200528|ref|YP_004039186.1| von willebrand factor type a [Methylovorus sp. MP688] gi|312439844|gb|ADQ83950.1| von Willebrand factor type A [Methylovorus sp. MP688] Length = 333 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 40/220 (18%), Positives = 76/220 (34%), Gaps = 47/220 (21%) Query: 174 MMIVLDVSRSMESFFDS----------SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 +M+VLD S SM F + +K+ A + + +E + ++V GL Sbjct: 86 IMVVLDRSASMNDDFAGKYIQSTDAVQNPSKLKAAREVLAKFIESRQ-----QDLV--GL 138 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNT 282 +TFS L L+ + G+ T GL A + Sbjct: 139 ITFSTSPVFVLPLTHDQVALKAALDSAEAQGMGFTAVARGLGMALDYFKGKPVTGA---- 194 Query: 283 EDANYKKIIVFMTD-GENLSTKEDQQSLYYCNEAKKRGAIVY-------AIGIRVIRS-- 332 ++I+ ++D G ++ ++ + + A +Y +I ++ Sbjct: 195 ------RLILLVSDGGAHIDSQTQDRIRSWFASEHAALAWIYIRGTNSPSIFVQPGEGDD 248 Query: 333 ---------HEFLRACASPNSFYLVENPHSMYDAFSHIGK 363 HE+ + P Y ENP ++ A +GK Sbjct: 249 AASAPEYFLHEYFKTLGVPYQAYEAENPRALEAALRDVGK 288 >gi|257093736|ref|YP_003167377.1| von Willebrand factor type A [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] gi|257046260|gb|ACV35448.1| von Willebrand factor type A [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] Length = 452 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 30/213 (14%), Positives = 72/213 (33%), Gaps = 28/213 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 I + + + + +V+D S SM S + A++ + ++++ Sbjct: 29 IRVQAPDPLATEKKARKPYHLALVIDRSGSM------SGPPLAEAVRCAKHIADQLEPTD 82 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 + LV F ++++ L + + G STN G + + + Sbjct: 83 IAS------LVVFDDRVQTLVPPRPVGDRQALHLALSRVHS-GGSTNLHGGWQAGADGLL 135 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSL-YYCNEAKKRGAIVYAIGIRV 329 G ++ ++DG N+ D + C +A +RG G+ Sbjct: 136 PAAGQAALAR---------VILLSDGNANVGEITDPAGIAALCAQAAERGVSTSTYGLGS 186 Query: 330 IRSHEFLRACAS--PNSFYLVENPHSMYDAFSH 360 + + + A + Y + +++ F+ Sbjct: 187 HFNEDLMVEMAKRGGGNHYYGDTAADLFEPFAA 219 >gi|253998453|ref|YP_003050516.1| von Willebrand factor type A [Methylovorus sp. SIP3-4] gi|253985132|gb|ACT49989.1| von Willebrand factor type A [Methylovorus sp. SIP3-4] Length = 333 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 40/220 (18%), Positives = 76/220 (34%), Gaps = 47/220 (21%) Query: 174 MMIVLDVSRSMESFFDS----------SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 +M+VLD S SM F + +K+ A + + +E + ++V GL Sbjct: 86 IMVVLDRSASMNDDFAGKYIQSTDAVQNPSKLKAAREVLAKFIESRQ-----QDLV--GL 138 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNT 282 +TFS L L+ + G+ T GL A + Sbjct: 139 ITFSTSPVFVLPLTHDQVALKAALDSAEAQGMGFTAVARGLGMALDYFKGKPVTGA---- 194 Query: 283 EDANYKKIIVFMTD-GENLSTKEDQQSLYYCNEAKKRGAIVY-------AIGIRVIRS-- 332 ++I+ ++D G ++ ++ + + A +Y +I ++ Sbjct: 195 ------RLILLVSDGGAHIDSQTQDRIRSWFASEHAALAWIYIRGTNSPSIFVQPGEGDD 248 Query: 333 ---------HEFLRACASPNSFYLVENPHSMYDAFSHIGK 363 HE+ + P Y ENP ++ A +GK Sbjct: 249 AASAPEYFLHEYFKTLGVPYQAYEAENPRALEAALRDVGK 288 >gi|221127354|ref|XP_002168164.1| PREDICTED: similar to microneme 1, partial [Hydra magnipapillata] Length = 285 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 27/150 (18%), Positives = 59/150 (39%), Gaps = 13/150 (8%) Query: 219 VQSGLVTFSN----KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 V+ ++ +S+ +I W K+ + T + LK A +F + Sbjct: 6 VRFAVIDYSDDAILQISVSDPRFWDHETFGEKVSSIEYSHGKTRTDLALKVARKHVFCNE 65 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 QH N K+++ +TDG++ K Q + + + ++G+ E Sbjct: 66 CSLQH------NIPKLLIVLTDGQSTFPKSTQFEAHLIKV--ENDLTIISVGVSDQVDIE 117 Query: 335 FLRACASP-NSFYLVENPHSMYDAFSHIGK 363 L++ A+ + +L+ + + D + I K Sbjct: 118 ELKSLATDRDHVFLLNSYSYLNDKINKILK 147 >gi|163758683|ref|ZP_02165770.1| von Willebrand factor type A domain protein [Hoeflea phototrophica DFL-43] gi|162283973|gb|EDQ34257.1| von Willebrand factor type A domain protein [Hoeflea phototrophica DFL-43] Length = 587 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 45/237 (18%), Positives = 78/237 (32%), Gaps = 45/237 (18%) Query: 144 FIPWYTNSRHIVMPITSSVKVNSQTDARLDM---MIVLDVSRSMESFFDSSITKIDMAIK 200 F+ + + +V +TS++ + MIVLD S SM D KI +A Sbjct: 3 FLIRFLATLALVCSVTSTMHAQENAVPSAKVGKVMIVLDGSNSMWGQVDGEA-KITIAKD 61 Query: 201 SINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE--------FFLLEWGVSHLQRKIKYLSK 252 LI + ++ V GL+ + ++ + + L K++ +S Sbjct: 62 V------MTDLISNWDDAVDLGLMVYGHRRKGDCSDIEVVALPGKVNRPALIDKVQSISP 115 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 G + S L A + + +V ++DG + C Sbjct: 116 RGKTPISKTLLLAATSVGYFSGKSS-------------VVLVSDGLETCDAD------PC 156 Query: 313 NEAKKRGAI-----VYAIGIRVIRSHEFLRAC---ASPNSFYLVENPHSMYDAFSHI 361 +AK G I V+ IG V C + F+ N + DA Sbjct: 157 AQAKALGIINPGFDVHVIGFDVTEEEFKSLQCIATETGGKFFRANNAEELKDALRRT 213 >gi|124003889|ref|ZP_01688737.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] gi|123990944|gb|EAY30411.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] Length = 704 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 35/239 (14%), Positives = 89/239 (37%), Gaps = 37/239 (15%) Query: 117 DIVVVPQNE--GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDM 174 + + + S + Y PW + + + + + + ++ + ++ Sbjct: 294 TPTKDKEGKLQTHPFSVNTEY-------GTCPWNPHHKLLQIGLQGE-NLQTKNASPANL 345 Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 + ++D S SM+S K+ + +S +L+++ ++ + +V ++ Sbjct: 346 VFLVDASGSMDSE-----DKLPLLKRSFKVLLKQL-----TDSRTKIAIVAYAGASGLVL 395 Query: 235 LLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 VSH ++ + L + G ST G++ AY + + ++ Sbjct: 396 P-ATSVSHREKILTALENIESGGSTAGGEGIELAYKI-----AQQAFIAGGNNR----VI 445 Query: 293 FMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH----EFLRACASPNSFY 346 TDG+ N+ D++ + + +K G + +G + E L + N +Y Sbjct: 446 LATDGDFNVGLSSDEELMQLISNKRKSGVYLTCLGFGTGNLNDSMMEKLTNAGNGNYYY 504 >gi|312073180|ref|XP_003139404.1| hypothetical protein LOAG_03819 [Loa loa] gi|307765434|gb|EFO24668.1| hypothetical protein LOAG_03819 [Loa loa] Length = 444 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 55/160 (34%), Gaps = 26/160 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ ++D S + K D+ I ++ ++ + L + +S Sbjct: 263 LDVVFLMDFSGGVSD-------KRDVYIDFVSILIRSLDL---NRTSAHVAAIYYSGPKR 312 Query: 232 EFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + + G +T + + YA N+ + G R+ Sbjct: 313 ARTLFHLRKHSRTEEAIKDLHQAPSNGGTTRTGEAIYYAINEFSEKFGARKGA------- 365 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 KK+I+ TDG + + A +G + A+ + Sbjct: 366 KKMIIIFTDGYSQDNPAEAS-----RAAHIKGIELKAVSV 400 >gi|197124353|ref|YP_002136304.1| von Willebrand factor A [Anaeromyxobacter sp. K] gi|196174202|gb|ACG75175.1| von Willebrand factor type A [Anaeromyxobacter sp. K] Length = 480 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 42/220 (19%), Positives = 76/220 (34%), Gaps = 28/220 (12%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 H+V+ + + + + ++ VLDVS SM K+ A +SI +++ Sbjct: 16 EAKDAHLVVSLVAPHGNARAERSPVCVIPVLDVSGSMHG------EKLHFATQSIMKLVD 69 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGL 263 + G+V FS ++E L+ + L +TN GL Sbjct: 70 HLAPGD------FCGVVVFSTEVETLAAPTEMTQDRKDALKVALGRLRPR-HNTNLAGGL 122 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIV 322 + + + + + ++ TDG N + L EA A V Sbjct: 123 ------LAGLDHAKVTKVPDGMPVR--VILFTDGLANEGPATSPEGLCALLEANLGTASV 174 Query: 323 YAIGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSH 360 A G E LR ++ ++ V +P AF+ Sbjct: 175 SAFGYGDDADQELLRELSTLGRGNYAYVRSPEDALTAFAR 214 >gi|163816539|ref|ZP_02207903.1| hypothetical protein COPEUT_02729 [Coprococcus eutactus ATCC 27759] gi|158448239|gb|EDP25234.1| hypothetical protein COPEUT_02729 [Coprococcus eutactus ATCC 27759] Length = 465 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 43/247 (17%), Positives = 86/247 (34%), Gaps = 34/247 (13%) Query: 116 LDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDM- 174 + I + + YS S +K L T I+ + + + TD +D+ Sbjct: 88 IGITEILRGRIYSQSDCLSFKKVLIIVVVGSICTG---IIAGVCEILYELNGTDYDVDIA 144 Query: 175 ---MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 M+V+D S SM++ K D + + N +LE + N V GL+ FS I Sbjct: 145 QETMLVIDDSSSMKT-----SDKNDRRLTAANELLEHID----GNRKV--GLIRFSKDIH 193 Query: 232 EFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + +++ S L +++ T+ L N A Sbjct: 194 CYIPMDYLKVNKSTLNHELEN-KAKEGGTDINDALYAVLN---------AFDKVGTATGS 243 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 + ++ +TDG++ + +++ + N + I + F++ S Sbjct: 244 RSVILLTDGKSTTNVDEEYLINRANSM---NIQINVISLGNHTDKAFIKRITSSTGGKAA 300 Query: 349 ENPHSMY 355 + Y Sbjct: 301 KTSSDFY 307 >gi|149919655|ref|ZP_01908134.1| hypothetical protein PPSIR1_07278 [Plesiocystis pacifica SIR-1] gi|149819598|gb|EDM79026.1| hypothetical protein PPSIR1_07278 [Plesiocystis pacifica SIR-1] Length = 435 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 34/192 (17%), Positives = 61/192 (31%), Gaps = 36/192 (18%) Query: 159 TSSVKVNSQTDARL--DMMIVLDVSRSM----ESFFDSSITKIDMAIKSINAMLEEVKLI 212 + S A L D++ +LD S SM D T+ +++ +++E Sbjct: 145 EGACVEQSGEAAPLPPDVLFLLDRSGSMLEVGFDVQDPDKTRWQALYEAVEGVVDE---- 200 Query: 213 PDVNNVVQSGLVTFSNK----------IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 D + + G TFS + + L L L++ T + Sbjct: 201 -DADQQIAFGAKTFSTQGFGACGVSPTPDVPIALSNAAQLLSTIPGPLAQVNGGTPTNLA 259 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG----ENLSTKEDQQSLYYCNEAKKR 318 L N + Q K ++ +TDG EN ++ A++ Sbjct: 260 LTNTMNYMESYQADG----------DKFVILITDGRIGCENDDDAATAAAVSTITAARED 309 Query: 319 -GAIVYAIGIRV 329 G Y + I Sbjct: 310 HGITTYVVCIAP 321 >gi|312082747|ref|XP_003143572.1| von Willebrand factor type A domain-containing protein [Loa loa] gi|307761265|gb|EFO20499.1| von Willebrand factor type A domain-containing protein [Loa loa] Length = 307 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 63/193 (32%), Gaps = 20/193 (10%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 LD++ +LD S S++ K I I ++ V + + L+ FS Sbjct: 72 PLDLVFMLDSSGSLK-------NKFQDEIDIIRRIVNHVTIGEPA---TRVMLIQFSGVQ 121 Query: 231 --EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E F + + L T +Y + + + GMR ++ Sbjct: 122 HLEFNFKKFKDRDDILGALDVLRHVSGITRMGDAFEYTLSMLNEKNGMR------SSDVP 175 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRACASPNSFYL 347 KII ++DG +D + + +YA G + +E L P Sbjct: 176 KIIYLLSDGRTHDYPKDTEMAEQLRQQIDN-VDIYAYGTGEYVAINELLAITKDPKKIVT 234 Query: 348 VENPHSMYDAFSH 360 +N + F Sbjct: 235 NQNLDDLEPMFDR 247 >gi|145596106|ref|YP_001160403.1| von Willebrand factor, type A [Salinispora tropica CNB-440] gi|145305443|gb|ABP56025.1| von Willebrand factor, type A [Salinispora tropica CNB-440] Length = 576 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 43/226 (19%), Positives = 74/226 (32%), Gaps = 33/226 (14%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS--SITKIDMAIKSINAMLEEVKLI 212 + + +V S M+ V+DVS SM S +++ + + + L Sbjct: 360 PLAVERAVSSWSIATQSGRMLCVIDVSGSMREPVASANGVSRQQVTLDAAGRGLHLFD-- 417 Query: 213 PDVNNVVQSGLVTFSN---------KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL 263 + Q GL FS ++ E L S L++ + + T + Sbjct: 418 ----DSWQIGLWEFSTNLGSGRDYRRLVEIGPLSSQRSELEQALAQIQPTRGDTGLFDTV 473 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK----KRG 319 AY + + Q + IV TDG+N Q + +R Sbjct: 474 LAAYEAVQEDWDEGQVNS---------IVLFTDGKNDDDNGISQQQLIAELERIKDPERP 524 Query: 320 AIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAF-SHIG 362 V IGI S L + + ++ E+P + D F I Sbjct: 525 VQVVLIGIGADVSKAELESITEVTGGGSFITEDPTKIGDIFLKAIA 570 >gi|113971723|ref|YP_735516.1| putative outer membrane adhesin like protein [Shewanella sp. MR-4] gi|113886407|gb|ABI40459.1| putative outer membrane adhesin like protein [Shewanella sp. MR-4] Length = 1215 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 74/201 (36%), Gaps = 32/201 (15%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +DM IV+D S SM + I+ AI++ +++ GLV+FS++ Sbjct: 310 GDIDMQIVMDRSGSMYG------SPINNAIQAAKTLVDA-----TAEGSTAMGLVSFSSR 358 Query: 230 --------IEEFFLLEWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +++ + G+ L+ I + G ST G A + + Q Sbjct: 359 SSVKQDFAVQQIPKPDTGIKQALKAAIDNIYASG-STALFDGSSLALDNLITYQTAAASG 417 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--VIRSHEFLRA 338 ++ + DG++ S+ +++ ++ + +++ G A Sbjct: 418 APG------VVFVLADGDDNSSIKNESTVITAYQ--NANVPIFSFGYGSASPTGPLVTMA 469 Query: 339 CASPNSFY-LVENPHSMYDAF 358 A+ ++ + DAF Sbjct: 470 NATGGKYFSSPTTLSEIIDAF 490 >gi|303251581|ref|ZP_07337755.1| hypothetical protein APP6_0784 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|307252106|ref|ZP_07534005.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|302649579|gb|EFL79761.1| hypothetical protein APP6_0784 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|306860406|gb|EFM92420.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 6 str. Femo] Length = 538 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 33/252 (13%), Positives = 84/252 (33%), Gaps = 26/252 (10%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R F + G T++ + I ++ + +E + I + L ++++++ + N Sbjct: 10 RRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAENNNG 69 Query: 69 GNGNNRK----KLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIV----- 119 N+ K K D R+ FV ++++ Sbjct: 70 RKDNDYKLSGSSNKENDSFDISSEVGK---RDSQMVTTFVKAFLPQTNDDKMNLIPICKT 126 Query: 120 ---VVPQNEGYSISAISRYKIPLKFCTFIPWYTNS-----RHIVMPITSSVKVNSQTDAR 171 + S ++ ++ P + + + + S + + Sbjct: 127 VNNTSGKGHTSSSEVTCTVSGTIEHKSWFPLKVGTVEVIPQQVDVASKSKAFKKNTFNIP 186 Query: 172 LDMMIVLDVSRSM------ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 +D+M+V D+S SM E + +KI + + + + ++ + N + G+ Sbjct: 187 IDLMVVADLSGSMKDGIKGEKLNGGTNSKIYILREVLKELADKSLFTQEANEYNRIGITA 246 Query: 226 FSNKIEEFFLLE 237 F+ E + Sbjct: 247 FAMGAEHPKENK 258 >gi|170746808|ref|YP_001753068.1| hypothetical protein Mrad2831_0362 [Methylobacterium radiotolerans JCM 2831] gi|170653330|gb|ACB22385.1| conserved hypothetical protein; putative vWFA domain protein [Methylobacterium radiotolerans JCM 2831] Length = 437 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 53/429 (12%), Positives = 120/429 (27%), Gaps = 108/429 (25%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 FF G + + A+ + ++ KT L + +D +++ +Q N Sbjct: 19 FFRARSGQVAVTFALVTLPVMFATAAAVDYGRRNAAKTQLDAALDGAVLAVMSQKTNTIP 78 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 + ++ K + T V + + S Sbjct: 79 TTTLQNMETQFRTEAAKVP-----------------GVTVTSFTPGAPVNTSKTLSLTAS 121 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME----- 185 + K L IP MP++ + T ++ ++LD S SM Sbjct: 122 YTATVKTSLASMMQIP--------AMPVSGTSSATRNTSQYINYYLLLDNSPSMGLAATD 173 Query: 186 ---------------------------------------SFFDSSITKIDMAIKSINAML 206 + ++ +I + ++++A++ Sbjct: 174 ADVQNMKIATNGCAFACHQHTFDKKGNITGDDQNDNYHIALRNNIKLRIQVLREAVSALV 233 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEF-----FLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 ++ + + Q + TF++ + + + + I + +++ Sbjct: 234 DQANVSMLLPQQFQMEMWTFNDSVTQTKLQAMTPTLNNIKNAAPNIDIAYAYYNQSDNQT 293 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSL----------- 309 + A ++ T D + + +TDG E+ QS Sbjct: 294 DFERAIARMNTTIPASGDGLTPDKPI-RFLFLVTDGVEDTGGSVTNQSAGFQIQSNRFIG 352 Query: 310 ----YYCNEAKKRGAIV---------------YAIGIRVIRSH--EFLRACASPNSFYLV 348 C+ K + + Y +R S L+ACAS ++ V Sbjct: 353 PLSPSTCSALKNKNVKIGIIYTQYLPIYDNDFYNRYVRPYESQIGPSLQACASDGMYFPV 412 Query: 349 ENPHSMYDA 357 + A Sbjct: 413 TTNGDITAA 421 >gi|167535583|ref|XP_001749465.1| hypothetical protein [Monosiga brevicollis MX1] gi|163772093|gb|EDQ85750.1| predicted protein [Monosiga brevicollis MX1] Length = 1630 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 76/206 (36%), Gaps = 27/206 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV-VQSGLVTFSNKI 230 +D++ +LD S S+ S D TK + + M+E + + + + GL +S + Sbjct: 258 IDLVFLLDGSNSV-SEPDWIRTK-----QFVVDMIEYFNVNTTYSTIGTRVGLAWYSTNV 311 Query: 231 EEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + F L+ Q ++ + T + L Y +F Q+ + + Sbjct: 312 SKEFDLDTYDSSEEAQCAVQRTKREIGETYTGKALTYVEENMFAA----QYGMRTTMDIE 367 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI---------RVIRSHEFLRAC 339 +V ++DG K + + + G ++ IGI + E Sbjct: 368 HYLVLISDG-----KSTDDIVLPARDLRDNGITIFTIGIISSSTSSSDAAAQRLELRTIA 422 Query: 340 ASPNSFYLVENPHSMYDAFSHIGKDI 365 S + F+ +E+ + + + + + Sbjct: 423 GSEDRFFEIESIQFVSEVVDSLRETL 448 >gi|158335198|ref|YP_001516370.1| von Willebrand factor type A domain-containing protein [Acaryochloris marina MBIC11017] gi|158305439|gb|ABW27056.1| von Willebrand factor type A domain protein, putative [Acaryochloris marina MBIC11017] Length = 573 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 30/199 (15%), Positives = 66/199 (33%), Gaps = 30/199 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + ++IV+D S SM TK+ ++ L+ + V L+ Sbjct: 385 QTAAKKPSQVVIVVDSSGSMTG------TKLAAVQSTLQTYLKGLGPKEKVT------LI 432 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 F + + + ++ + ++++ N+ Y+ I Q + Sbjct: 433 DFDSVVRKPVSVDGSPEGQSKGLEFVVALKADGNT-----KLYDSILAAQTWLTQNLRPN 487 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG------AIVYAIGIRVIRSHE---F 335 A ++ +TDGE+ QQ + KK G ++ +G Sbjct: 488 AI--NAVIVLTDGEDSG--SGQQLPQLLSALKKSGFEGEQRIAIFTVGYGNAGDFAPDVL 543 Query: 336 LRACASPNSFYLVENPHSM 354 + + +Y +P S+ Sbjct: 544 KQIAEANGGYYRQGDPASI 562 >gi|156400007|ref|XP_001638792.1| predicted protein [Nematostella vectensis] gi|156225915|gb|EDO46729.1| predicted protein [Nematostella vectensis] Length = 974 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 26/144 (18%), Positives = 50/144 (34%), Gaps = 22/144 (15%) Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ---RKIKYLSKFGVSTNSTPGLKY 265 V+ + + G V +S + F + + + IKY K + ++ L+ Sbjct: 101 VRNLDMPKTGARVGTVIYSKRSIVLFDFKDNKTDIILQLNTIKYREKPNRALSTGQALEL 160 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 A ++F N KKI + +T K + K G I+Y + Sbjct: 161 AQKRLFQNA---------RRNSKKIAILVT-----GEKSQDDVIIPSKLMKDSGVIIYTV 206 Query: 326 GIRVIRSH---EFLRACASPNSFY 346 G+ + + +SP+ Y Sbjct: 207 GVG--EGYFLPQLESIASSPSYVY 228 >gi|149915863|ref|ZP_01904387.1| hypothetical protein RAZWK3B_07284 [Roseobacter sp. AzwK-3b] gi|149810186|gb|EDM70032.1| hypothetical protein RAZWK3B_07284 [Roseobacter sp. AzwK-3b] Length = 235 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 40/241 (16%), Positives = 75/241 (31%), Gaps = 51/241 (21%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES--FFDSSITKIDMAIKSINAML 206 + + + +TS + D M+VLD S SM F ++ T+ID A ++ Sbjct: 1 MGALALTLALTSPAGAVT--GCARDAMLVLDGSASMSEIGFDPTAPTRIDEARAAL---- 54 Query: 207 EEVKLIPDVNNVVQSGLVTF--------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTN 258 + +P + V + GL+T+ S F ++ + + L G++ Sbjct: 55 --AQAMPRIAVVRRVGLLTYGPGGTDACSGIDLRFGPIDDAAGPIIAAVDALRPGGLTP- 111 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKK 317 I+V +TDG E + ++A Sbjct: 112 ------------LAASVQAAAEALNYRTTPGIVVLVTDGNETCGGRPCALGTALADQA-- 157 Query: 318 RGAIVYAIGIRV--------------IRSHEFLRACASP---NSFYLVENPHSMYDAFSH 360 G V+ IG RV + C S + + + +A Sbjct: 158 EGLTVHVIGFRVQYDPFAWDNPEAQTYDGGAVVAKCLSDRTGGLYVDTQTVEELTEALEA 217 Query: 361 I 361 + Sbjct: 218 V 218 >gi|149197810|ref|ZP_01874859.1| hypothetical protein LNTAR_04966 [Lentisphaera araneosa HTCC2155] gi|149139031|gb|EDM27435.1| hypothetical protein LNTAR_04966 [Lentisphaera araneosa HTCC2155] Length = 833 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 34/208 (16%), Positives = 69/208 (33%), Gaps = 33/208 (15%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 V+P+TS + + L +++V+D S SM I A K+ +L Sbjct: 396 VLPVTSRYE-KEKEQPSLALVLVIDKSGSMN---GQPIVLAREASKAAAELLSSRD---- 447 Query: 215 VNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 Q G++ F + L + +I + G TN P + + + Sbjct: 448 -----QVGVIAFDGSAKLVTDLTSAANKGEVLSQIDGIGA-GGGTNLYPAMVMGRDMLGI 501 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 +H ++ ++DG++ + S G + + + + Sbjct: 502 ASAKIKH-----------MIVLSDGQSQGGDFEGISSELAQM----GVTISTVSLGQGAA 546 Query: 333 HEFLRACASPNS--FYLVENPHSMYDAF 358 + + A A + Y+ N M F Sbjct: 547 VDLMAAIAQIGNGRAYVTNNAEEMPRIF 574 >gi|194686290|ref|XP_001789141.1| PREDICTED: matrilin 1, cartilage matrix protein-like [Bos taurus] Length = 191 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 13/105 (12%), Positives = 43/105 (40%), Gaps = 11/105 (10%) Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV 240 SRS+ + + ++ ++E + + P+ + GLV +++ +++ F L Sbjct: 23 SRSVRP------VEFEKVKVFLSQVIESLDVGPNA---TRVGLVNYASSVKQEFPLRAHS 73 Query: 241 SHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 S + + ++ + T + +++A + + + + Sbjct: 74 SKAELLQAVRRIQPLSTGTMTGLAIQFAITKALSDAEGGRPRSPD 118 >gi|145527514|ref|XP_001449557.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124417145|emb|CAK82160.1| unnamed protein product [Paramecium tetraurelia] Length = 606 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 41/212 (19%), Positives = 75/212 (35%), Gaps = 29/212 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 I I K NS +A +D++ V+D S SM S KI+ +S+ +L + Sbjct: 168 ITKDIEQYAKNNSSIEAGIDLICVIDKSGSM------SGQKIESVQQSLVQLLNFLSEKD 221 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIK---YLSKFGVSTNSTPGLKYAYNQI 270 + L+ F + L+ ++ K G STN G A+ QI Sbjct: 222 ------RLCLIVFDGGAKRHTPLKTLTEGNKKYFKGAIAAISAGGSTNIAAGTDIAFQQI 275 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 +Q I ++DG+ + + + +++ G Sbjct: 276 ------QQRKMKNQVTS---IFLLSDGQ---DSGAAERIQKQKDRISDVVTIHSFGYGND 323 Query: 331 RSHEFL-RAC-ASPNSFYLVENPHSMYDAFSH 360 + + + C SFY +EN + + F+ Sbjct: 324 HDADLMSKICKVGQGSFYYIENVKLLDEFFAD 355 >gi|114046070|ref|YP_736620.1| putative outer membrane adhesin like protein [Shewanella sp. MR-7] gi|113887512|gb|ABI41563.1| putative outer membrane adhesin like protein [Shewanella sp. MR-7] Length = 1215 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 74/201 (36%), Gaps = 32/201 (15%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +DM IV+D S SM + I+ AI++ +++ GLV+FS++ Sbjct: 310 GDIDMQIVMDRSGSMYG------SPINNAIQAAKTLVDA-----TAEGSTAMGLVSFSSR 358 Query: 230 --------IEEFFLLEWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +++ + G+ L+ I + G ST G A + + Q Sbjct: 359 SSVKQDFAVQQIPKPDTGIKQALKAAIDNIYASG-STALFDGSSLALDNLITYQTAAASG 417 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--VIRSHEFLRA 338 ++ + DG++ S+ +++ ++ + +++ G A Sbjct: 418 APG------VVFVLADGDDNSSIKNEATVITAYQ--NANVPIFSFGYGSASPTGPLVTMA 469 Query: 339 CASPNSFY-LVENPHSMYDAF 358 A+ ++ + DAF Sbjct: 470 NATGGKYFSSPTTLSEIIDAF 490 >gi|3182932|sp|Q28902|COCA1_RABIT RecName: Full=Collagen alpha-1(XII) chain gi|13195730|gb|AAB34889.2| type XII collagen [Oryctolagus cuniculus] Length = 639 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 27/126 (21%), Positives = 42/126 (33%), Gaps = 13/126 (10%) Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 + + L G +T + L + Q F Q +KI V +TDG K Sbjct: 7 QAVANLPYKGGNTLTGMALNFIRQQNFKTQ------AGMRPRARKIGVLITDG-----KS 55 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 + K G ++AIGI+ E P+ N +D+ S I D Sbjct: 56 QDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDT-HAYNVAD-FDSLSKIVDD 113 Query: 365 IVTKRI 370 + Sbjct: 114 LTINLC 119 >gi|332707319|ref|ZP_08427370.1| hypothetical protein LYNGBM3L_36250 [Lyngbya majuscula 3L] gi|332353913|gb|EGJ33402.1| hypothetical protein LYNGBM3L_36250 [Lyngbya majuscula 3L] Length = 464 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 34/228 (14%), Positives = 70/228 (30%), Gaps = 42/228 (18%) Query: 133 SRYKIPLKFCTFIPWYTNSRHI-VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSS 191 Y F+P T + + VM K + V+D S SM + Sbjct: 5 PLYVTITPHREFLPADTPDQKLFVMLKLRPTKEVAVELPSTSFAFVIDTSGSMYEVVEGD 64 Query: 192 I---------------------TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 TKID+ I+S+ ++ +L + ++ F ++ Sbjct: 65 TKPTGRVYTQDGNDYEEVIGGKTKIDIVIESLLNLVRSNQLGGSD----RIAIIQFDDQA 120 Query: 231 EEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L S L+ I+ L + T G++ + +H Sbjct: 121 STIVGLTPATETSQLEAGIEKLRNYSGGTCMGEGMEQTLTMLSGQTMTSRHA-------- 172 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 + TDG+ ++++ + + G + A+G+ + Sbjct: 173 ---LIFTDGQAF---DEEECRELAKQFSENGIPITALGVGDYNEDLLI 214 >gi|314948817|ref|ZP_07852188.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0082] gi|313644760|gb|EFS09340.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0082] Length = 1129 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 24/139 (17%), Positives = 58/139 (41%), Gaps = 22/139 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + LD+++V+D S SM +I ++ ++ + + + + G V + Sbjct: 286 KDITPLDLVLVVDWSGSMNDN-----NRIGEVKIGVDRFVDTL-ADSGITDKINMGYVGY 339 Query: 227 SNKIEEFF--LLEWGV-SHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 S++ + ++ G ++ ++K ++ T + L+ A +++ G Sbjct: 340 SSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGSRLSVPNG------ 393 Query: 282 TEDANYKKIIVFMTDGENL 300 +KK+IV +TDG Sbjct: 394 -----HKKVIVLLTDGVPT 407 >gi|297285598|ref|XP_002802831.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 2 [Macaca mulatta] Length = 900 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 41/264 (15%), Positives = 87/264 (32%), Gaps = 35/264 (13%) Query: 92 MSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNS 151 + + N V+ + T I P + + K P + T + + Sbjct: 187 LETESTFMTNQLVDALTTSQNKTKAHIQFKP-------TLSQQQKSPEQQETVLDGHLII 239 Query: 152 RHIVMPITSSVKVNSQTDARL-------------DMMIVLDVSRSMESFFDSSITKIDMA 198 R+ V S + + + +++ V+D S SM S KI Sbjct: 240 RYDVDRAVSGGTIQIENGYFVHYFAPEGLTTMPKNVVFVIDKSGSM------SGRKIQQT 293 Query: 199 IKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST 257 +++ +L+++ N +V + + + V+ + + G T Sbjct: 294 REALIKILDDLSPRDQFNLIVFSTEATQWRPSLVPAS--AENVNEARSFAAGIQALG-GT 350 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-CNEAK 316 N + A Q+ D + + I+ +TDG+ + + +S+ EA Sbjct: 351 NINEAMLVAV-QLLDSSNQEERLPDGSVSL---IILLTDGDPTVGETNPRSIQKNVREAV 406 Query: 317 KRGAIVYAIGIRVIRSHEFLRACA 340 ++ +G S+ FL A Sbjct: 407 SGRYSLFCLGFGFDVSYAFLEKLA 430 >gi|254781007|ref|YP_003065420.1| hypothetical protein CLIBASIA_04540 [Candidatus Liberibacter asiaticus str. psy62] gi|254040684|gb|ACT57480.1| hypothetical protein CLIBASIA_04540 [Candidatus Liberibacter asiaticus str. psy62] Length = 411 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 55/396 (13%), Positives = 132/396 (33%), Gaps = 43/396 (10%) Query: 6 LNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 L + +G ++TAI L ++ ++++ + L ++D + + + Sbjct: 9 LGVLRLKKCTRGVFLVITAILLSSFVAIVDVVVDQVTVMQKTAWLQEVLDHVIYRTSPKN 68 Query: 66 MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 + + R+ + I +I+ N +L + G + I ++ + + Sbjct: 69 LYDL----REAGRDNFIRHQIEKALNTYNSRDLSNTGSIESIV----KDAVILTKNVNSL 120 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMM-IVLDVS--R 182 + ++ + + + I+ KV + + L +M D Sbjct: 121 PLQFTVDIALSTTVQLRGSLLQMFSQSKGKVDISRRKKVMYKQNIGLMIMPFAWDGYWLA 180 Query: 183 SMESFFDSSITKIDMAIKS--------INAMLEEVKLIPDVNNVVQSGLVTFSNKIE-EF 233 S DS + S N +++ N +S+ + Sbjct: 181 SRGKVADSKVHPPKYLEYSHYYQQYLNRNTLVKNFLSQIPYKNFC-MAPYHYSSILYWAV 239 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNS---TPGLKYAYNQIFDMQGMRQ------HCNTED 284 L + V + +Y +T +K ++ + G Q HC Sbjct: 240 GTLTYSVDNKTTTREYYKDPYYATWDHFPYSFIKNVFDMTSNQFGDGQVLTNTNHCFPHG 299 Query: 285 ANYKKIIVFMTDGENLSTKED-----QQSLYYCNEAKKR------GAIVYAIGIRV-IRS 332 A+ K ++ + G LS ++ L C+ KR ++++G + Sbjct: 300 ASQNKYMLMLAIGNQLSRSSVEKEKIEKVLQDCHYMHKRHRTGRDAITIFSVGFSPDQDT 359 Query: 333 HEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDIVT 367 LR CAS P+ +Y + + ++ + ++++T Sbjct: 360 RYTLRQCASDPSKYYEINSDENVMPIAKSLARNVIT 395 >gi|327270792|ref|XP_003220172.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 921 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 45/242 (18%), Positives = 89/242 (36%), Gaps = 39/242 (16%) Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 + YK + + + +SR + P + +T R+ + +VLD S M Sbjct: 273 KMCSYKSTWEVISGSADFASSRPLGAPPPDPIIHLMKTQERI-VCLVLDASAQM-----G 326 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL 250 ++ I++ L + + G+VTF++K L+ S ++R + L Sbjct: 327 KDNRLSRLIQAAKLFLLHI-----IEKGSWVGIVTFNSKGNIQAGLQRIFSDIER--EGL 379 Query: 251 SKFGVSTNSTP-----GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 + +T + G+ A+ + C IV +T+GE Sbjct: 380 TSHLPTTAAGDCNICEGVNAAFQVFSQKLTSTEGCE---------IVLLTNGEGSD---- 426 Query: 306 QQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYLVE--NPHSMYDAFSH 360 L C ++ + + I++ I S+E L +FY + + + + DAF Sbjct: 427 ---LSPCLSKNQSQEIIIHTIAFGSKASNELEKLADMTGGKTFYATDSLDSNGLIDAFGG 483 Query: 361 IG 362 I Sbjct: 484 IS 485 >gi|296194628|ref|XP_002745016.1| PREDICTED: integrin alpha-1 [Callithrix jacchus] Length = 1129 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 43/293 (14%), Positives = 92/293 (31%), Gaps = 74/293 (25%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 + +L N + D+ ++ + + +V G ++ Y + + Sbjct: 88 VKLDLPVNTSIPDVTEVKENMTFGSTLVTNPNGGFLACGPLYAYRCGHLHYTTGICSDVS 147 Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + +S+ + +LD++IVLD S S+ + D +N +LE + + P Sbjct: 148 PTFQVVNSIAPVRECSTQLDIVIVLDGSNSIYPW--------DSVTAFLNDLLERMDIGP 199 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 V A + Sbjct: 200 KQTQVC--------------------------------------------DSAKEAFTEA 215 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--RVIR 331 +G R+ KK++V +TDGE+ ++ + C ++I I R Sbjct: 216 RGARRG-------VKKVMVIVTDGESHDNHRLKKVIQDCE---DENIQRFSIAILGSYNR 265 Query: 332 SH-------EFLRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + E +++ AS F+ V + ++ +G+ I D+ Sbjct: 266 GNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKTLGERIFALEATADQ 318 >gi|313215187|emb|CBY42862.1| unnamed protein product [Oikopleura dioica] Length = 289 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 31/202 (15%), Positives = 66/202 (32%), Gaps = 24/202 (11%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D++ V + M S F + I +IN + ++ ++ F++ Sbjct: 99 GPRDVLFVAHYTTYMGSTFADISAFFENIISTIN--------VEPSDSSIRFAFSFFNHA 150 Query: 230 IEEFFLLEW--GVSHLQRKIKYLSKFGVSTN-STPGLKYAYNQIFDMQGMRQHCNTEDAN 286 EFF +W + + + N LK A + + G + +T Sbjct: 151 YIEFFAFDWLNSIDEYKWAFSSFPPASGNANYIGRALKGAADTMTPEFGKGRRIDTVGT- 209 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---N 343 +V +T + + ++ K++ V +G+ + L AS Sbjct: 210 ----VVLLT-----NAASTDEVNEMADQLKEKVDRVIVVGLGYAFGQDELAGIASSPTKE 260 Query: 344 SFYLVENPHSMYDAFSHIGKDI 365 + Y+ E + I +I Sbjct: 261 NLYIAEESSDLAGLVKTIADEI 282 >gi|225872657|ref|YP_002754114.1| von Willebrand factor type A domain protein [Acidobacterium capsulatum ATCC 51196] gi|225793405|gb|ACO33495.1| von Willebrand factor type A domain protein [Acidobacterium capsulatum ATCC 51196] Length = 410 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 36/236 (15%), Positives = 87/236 (36%), Gaps = 43/236 (18%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 F W N ++ S D + M I++D S SM+ K++ K+ Sbjct: 161 DFHVWEDNKPQVIASF-------SHEDLPVSMGILVDNSGSMQ-------NKLNAVDKAA 206 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 ++ + +V FS++ ++ L++ + + G T Sbjct: 207 LDLVRASNPDDEAF------IVNFSDQAYLDQGFTSSIAKLEQGLAHTEARG-GTALYDA 259 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED-QQSLYYCNEAKKRGAI 321 + + +++ + + + K++++ +TDGE+ ++ + QQ++ G Sbjct: 260 IVASADEL----------SKDARHPKQVLLVVTDGEDDASTMNLQQAIQRVQAL--HGPE 307 Query: 322 VYAIGIRVIRSHEFLRAC---------ASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 +YAIG+ S + + Y + ++ + + + KDI + Sbjct: 308 IYAIGLLYDDSGDEAHRARKALEQLTEQTGGLAYFPRSLENVDEVAAEVAKDIRNQ 363 >gi|297285602|ref|XP_002802832.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 3 [Macaca mulatta] Length = 888 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 41/264 (15%), Positives = 87/264 (32%), Gaps = 35/264 (13%) Query: 92 MSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNS 151 + + N V+ + T I P + + K P + T + + Sbjct: 175 LETESTFMTNQLVDALTTSQNKTKAHIQFKP-------TLSQQQKSPEQQETVLDGHLII 227 Query: 152 RHIVMPITSSVKVNSQTDARL-------------DMMIVLDVSRSMESFFDSSITKIDMA 198 R+ V S + + + +++ V+D S SM S KI Sbjct: 228 RYDVDRAVSGGTIQIENGYFVHYFAPEGLTTMPKNVVFVIDKSGSM------SGRKIQQT 281 Query: 199 IKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST 257 +++ +L+++ N +V + + + V+ + + G T Sbjct: 282 REALIKILDDLSPRDQFNLIVFSTEATQWRPSLVPAS--AENVNEARSFAAGIQALG-GT 338 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-CNEAK 316 N + A Q+ D + + I+ +TDG+ + + +S+ EA Sbjct: 339 NINEAMLVAV-QLLDSSNQEERLPDGSVSL---IILLTDGDPTVGETNPRSIQKNVREAV 394 Query: 317 KRGAIVYAIGIRVIRSHEFLRACA 340 ++ +G S+ FL A Sbjct: 395 SGRYSLFCLGFGFDVSYAFLEKLA 418 >gi|255535988|ref|YP_003096359.1| BatB [Flavobacteriaceae bacterium 3519-10] gi|255342184|gb|ACU08297.1| BatB [Flavobacteriaceae bacterium 3519-10] Length = 335 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 30/218 (13%), Positives = 79/218 (36%), Gaps = 27/218 (12%) Query: 118 IVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 I + + SR+ F + IV ++ S +V ++ +++ + Sbjct: 37 IFADKRFRDELFDSRSRFSRFFPFLYLMASLFLIISIVDVLSGSEEVETKQKMN-NVIFL 95 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 LDVS SM ++ + + ++ N ++ + + + G++ F+ + L Sbjct: 96 LDVSNSM----NAQDVEQNRLQQAKNLIINAMGKMTNDK----VGIIVFAGEASSIMPLT 147 Query: 238 WGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + ++ + + T+ ++ A ++ ++ + +V + Sbjct: 148 TDFTAVETYVGGVETSIVKMQGTDFLKAMQTAADKFRNVAKGS-----------RKVVLL 196 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 +DGE+ E + A + G V ++GI Sbjct: 197 SDGEDNEGNEKAAAKL----ANREGIRVISVGIGSEEG 230 >gi|226487570|emb|CAX74655.1| Loss of heterozygosity 11 chromosomal region 2 gene A protein homolog [Schistosoma japonicum] Length = 837 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 53/320 (16%), Positives = 102/320 (31%), Gaps = 57/320 (17%) Query: 46 MKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRI-----KNTWNMSFRNELRD 100 MKT ++ ++N +K + ++F L Sbjct: 151 MKTDTAEFSHSEIIMTLPSVINPRYYPKEQKHSEEFDPFTELVLVNSVPYTITFEAHLWM 210 Query: 101 NGFVNDIDDIVRSTSLDIVVVPQN----------EGYSISAISRYKIPLKFCTFIPWYTN 150 + D+ I S +LD P N + + + PL + + + Sbjct: 211 QSKILDVKSIHDSFTLDRTNNPNNVIVKLASGFVPDHDLQMVISLSDPLTSLVSLEYGDS 270 Query: 151 SRHIVMP--------ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 ++ ++ + V S D R + + ++D S SME I A S+ Sbjct: 271 NQSSLLAMNCLMVQILPDIPNVVSSKDMRNEFVFLIDRSGSMEG------DNISYAKTSL 324 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF--LLEWGVSHLQRKIKY---LSKFGVST 257 L+ + VN Q + F + F ++ L + Y L+ T Sbjct: 325 LLFLKSL----PVNCRFQI--IGFGSNFAALFSEPTDYSEDSLNAAMNYQKDLNADMGGT 378 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAK 316 + LK A H + + K I+F+TDG+ + + ++A+ Sbjct: 379 EAYNALKSAL-----------HSSPSGEGWFKQIIFLTDGDVGNADEVIGLVRMNVDKAR 427 Query: 317 KRGAIVYAIGIRVIRSHEFL 336 V+ IG+ S + Sbjct: 428 -----VFTIGLGQGVSTALI 442 >gi|222080976|ref|YP_002540339.1| hypothetical protein Arad_7191 [Agrobacterium radiobacter K84] gi|221725655|gb|ACM28744.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 649 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 49/177 (27%), Gaps = 29/177 (16%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R F + KGG+ I+T LP++ + IE + ++ D + A N Sbjct: 12 LRLFCDDRKGGIAIMTVFCLPVVIGFAALSIEYGYGLLVRDQNQRTADLASYAGALAYSN 71 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIV-VVPQNEG 126 + + +N +D SL + + Sbjct: 72 ANSEDQMTDAALRVAK--------------------LNGVDAANVVVSLTASPKDSRVQA 111 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 + + + L + N + + +I LD S S Sbjct: 112 VHVDVTTTNTLFLAPILGVDSKLNIATEAYSSLGAAESGC--------IIALDKSAS 160 >gi|260805092|ref|XP_002597421.1| hypothetical protein BRAFLDRAFT_122634 [Branchiostoma floridae] gi|229282686|gb|EEN53433.1| hypothetical protein BRAFLDRAFT_122634 [Branchiostoma floridae] Length = 705 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 63/186 (33%), Gaps = 29/186 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ VLD S S+E + + ++ +++ ++ I D + G + + + Sbjct: 506 DVVFVLDRSSSIE---------LSIFNQAKQFIVDTLQCIADRGVQIGVGYIVYDCVPKT 556 Query: 233 FFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L S + I Y G +T + ++ Y ++ + Sbjct: 557 IITLGTYTSDDPAVSGIIHYEMTEGGTTRTPLAIR--YMRLTS---------KFRDGAAR 605 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACASPNSFYLV 348 V +TDG+ D S G +YA+ I + S + + Sbjct: 606 AAVILTDGQTEGDAADDASDAR-----DAGIEMYAVAIGSFVDGSALQAIAGSGANVFDS 660 Query: 349 ENPHSM 354 +P ++ Sbjct: 661 SDPCAL 666 >gi|82593966|ref|XP_725227.1| ookinete protein [Plasmodium yoelii yoelii str. 17XNL] gi|23480151|gb|EAA16792.1| ookinete protein-related [Plasmodium yoelii yoelii] gi|110681630|dbj|BAE98270.1| circumsporozoite protein/thrombospondin-related anonymous protein-related protein [Plasmodium yoelii] Length = 2401 Score = 48.6 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 32/210 (15%), Positives = 75/210 (35%), Gaps = 25/210 (11%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE-VKLIPDVNNVVQSGLVTFS 227 + D+ ++LD S S+ K +K + E+ VK + N + G++ F+ Sbjct: 61 QSYYDLTLILDESGSI---------KKKHWVKYVVPFTEQIVKGLKVGENDIHVGILLFA 111 Query: 228 ----NKIEEFFLLEWGVSHLQRKIKYLS---KFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + I + + + L +K+ L+ + G T LKY+ N+ + R Sbjct: 112 LKNRDYITFDNDIRYKKTELLKKVNDLNDDYRSGSDTYILEALKYSMNKYSMSKNAR--- 168 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + K+ + TDG + + + Y +++ + +G+ + Sbjct: 169 ----DDAPKVTILFTDGNDRHASKSEFHKMYSEY-QEKHIKLLVLGVSAAEEKKLKVIAG 223 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + ++ ++I + K Sbjct: 224 CEEHSSCPSAMKAEWETINNITNKLTNKIC 253 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 32/203 (15%), Positives = 70/203 (34%), Gaps = 19/203 (9%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 + +P +N +P + + D+ ++LD S S+ S+ + Sbjct: 786 SILPPESNKPETEVPTPQCIGNDCFCHDIYDLTVILDESGSIGSYN---------WKNQV 836 Query: 203 NAMLEE-VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 E+ + + + V G++ F+ +F + S+ + + L K + + Sbjct: 837 YPFTEQFINNLEISEDKVHVGIMLFAQFNRDFVMFSDKESYDKEHMMKLIKGLKDSYKSG 896 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE---DQQSLYYCNEAKKR 318 G Y + H ++ K+ + TDG N + + SL Y K+ Sbjct: 897 GYTYIIEALNYGLENYTHHKDSRSDVPKVTMLFTDGNNTNPGDKLLSDASLLY----KEE 952 Query: 319 GAIVYAIGIRVI--RSHEFLRAC 339 + +G+ + L C Sbjct: 953 NVKLLVVGVGASTMANLRLLAGC 975 Score = 44.0 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 46/255 (18%), Positives = 85/255 (33%), Gaps = 31/255 (12%) Query: 99 RDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPI 158 +G N + S + P++ G + + + T P + + P Sbjct: 518 ESSGVTNPTETPESSGVTNPTETPESSGVTNPTETPESSGVTNPTETPESSGVTNPTNPD 577 Query: 159 TSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 N + D+ ++LD S S+ + I ++ + + D Sbjct: 578 NVVSCQNDEDCYCKDFYDVTLILDESASIGESRW-----VLEVIPFAKDIINHLNIDYDS 632 Query: 216 NNVVQSGLVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNST---PGLKYA-Y 267 V G++ FS+ + + L +KI L + + LKYA Y Sbjct: 633 ---VHVGVLLFSHYALDLVPFSDEARYNKISLLKKIDSLKTNYGNGYESFIVKTLKYALY 689 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 N I D +N KI + TDG N S++ D + + + IG+ Sbjct: 690 NYIKDS---------GRSNAPKITMLFTDG-NDSSESDMDMYNIGSLYRTERVKLLVIGV 739 Query: 328 RVIRSHEF--LRACA 340 + ++ L CA Sbjct: 740 SMASENKLKQLVGCA 754 >gi|325473817|gb|EGC77005.1| BatB protein [Treponema denticola F0402] Length = 286 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 39/160 (24%), Positives = 63/160 (39%), Gaps = 22/160 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 + +M V D+S+SM S D ++I + + + +LE++ + V GLV + Sbjct: 46 VSVMFVSDISKSM-SLQDIQPSRIAVQRQFLKILLEKMHKTSPESAV---GLVITKGEGV 101 Query: 232 EFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L + + L I LS TN G+ A + + +G Sbjct: 102 LSVPLSFEKNALSSAINALSPLILSSTGTNLEAGVLRALDSFGENRGNS----------- 150 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 KIIV TDG S L+ + KK AI+ +G Sbjct: 151 KIIVLCTDGGETSGS----LLHAAEKIKKTDAILIIVGFG 186 >gi|94971019|ref|YP_593067.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94553069|gb|ABF42993.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 391 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 38/237 (16%), Positives = 74/237 (31%), Gaps = 23/237 (9%) Query: 108 DDIVRSTSLDIVVVPQNEGYSISA------ISRYKIPLKFCTFIPWYTNSRHIVMPITSS 161 + P + + ++ K + S + Sbjct: 31 TSSPPAEERQPEAQPDTPSFHVEVKEVTLPVTVRDKHGKIVQTLNKEDFSLVQDGKTQTI 90 Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + T L + +++D S S+ + + L+++ P Sbjct: 91 TQFRRDTKLPLTLGLLVDTSYSVRDELPAE-------KTASEKFLDDMLAQPKDQAF--- 140 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG----VSTNSTPGLKYAYNQIFDMQGMR 277 L+ F ++E L L R I L G S++ + Q++D + Sbjct: 141 -LIHFDREVELMTDLTSSKDKLHRGIGELETSGPPSQSSSDDGQRHRRGGTQLYDAIYLA 199 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + +K IV +TDGE+ +KE A++ AIVYAI + + Sbjct: 200 ASEILQKQQGRKAIVVLTDGEDRGSKETLTDAV--EAAQRADAIVYAIYFKGEQEQS 254 >gi|165924896|ref|ZP_02220728.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. F1991016] gi|167418907|ref|ZP_02310660.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167425152|ref|ZP_02316905.1| conserved hypothetical protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|270487722|ref|ZP_06204796.1| conserved hypothetical protein [Yersinia pestis KIM D27] gi|165923096|gb|EDR40247.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. F1991016] gi|166962901|gb|EDR58922.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167055915|gb|EDR65696.1| conserved hypothetical protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|262360782|gb|ACY57503.1| membrane protein [Yersinia pestis D106004] gi|270336226|gb|EFA47003.1| conserved hypothetical protein [Yersinia pestis KIM D27] Length = 492 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 34/232 (14%), Positives = 75/232 (32%), Gaps = 38/232 (16%) Query: 21 ILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGG 80 + +P+ ++ + E SH + L ++++ + +T+ N + + Sbjct: 1 MSFMALIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALSTE--NNYRNDRASNNRNN 58 Query: 81 DILCRIKNTW--NMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP 138 ++ ++ + F N + + + SL + Y ++ ++ Y Sbjct: 59 YLVTSYAQSYLPSERFSQPRVVNTYNESLGYTEYNASLQM-------NYQLALLNSY--- 108 Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES-----FFDSSIT 193 LK W N SS+ +D++ V D S SM+ ++ IT Sbjct: 109 LKQTPSPTWDVNENGAARKYLSSI------AEPIDVVFVTDFSGSMDLPFGDIERNNRIT 162 Query: 194 KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR 245 K+D + + +N G V WG + Sbjct: 163 KLDELKAIFVKLNNRIFSNDGINT---IGFV----------PFSWGTKRISA 201 Score = 40.9 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 21/91 (23%), Positives = 40/91 (43%), Gaps = 18/91 (19%) Query: 282 TEDANYKKIIVFMTDGENL----STKEDQQ-----------SLYYCNEAKKRGAIVYAIG 326 TE N K+++ ++DG++ S+ DQ+ + C + K G + IG Sbjct: 381 TESQNNNKLMIILSDGDDNTQKMSSPHDQKAGIINITQKLITEGMCQKIKDNGIKMVFIG 440 Query: 327 IRVIRSHEFL---RACASPNSFYLVENPHSM 354 I + + + + C +FYL +N H + Sbjct: 441 IGYVPDNNIIDWEKDCVGTGNFYLAKNAHEL 471 >gi|123233471|emb|CAM28080.1| collagen, type XII, alpha 1 [Homo sapiens] Length = 637 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 26/126 (20%), Positives = 42/126 (33%), Gaps = 13/126 (10%) Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 + + L G +T + L + Q F Q +KI V +TDG K Sbjct: 5 QAVANLPYKGGNTLTGMALNFIRQQNFRTQ------AGMRPRARKIGVLITDG-----KS 53 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 + K G ++AIGI+ E P+ N +++ S I D Sbjct: 54 QDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDT-HAYNVAD-FESLSRIVDD 111 Query: 365 IVTKRI 370 + Sbjct: 112 LTINLC 117 >gi|332221823|ref|XP_003260064.1| PREDICTED: calcium-activated chloride channel regulator 4 isoform 1 [Nomascus leucogenys] Length = 921 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 37/196 (18%), Positives = 72/196 (36%), Gaps = 36/196 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK---I 230 + +VLD S SM +++ ++ L + V N G+V F + Sbjct: 307 VCLVLDKSGSM-----GGYDRLNRMNQAAKHFL-----LQTVENGSWVGMVHFDTTATIV 356 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + ++ G + L + T+ G+KYA+ I + + + Sbjct: 357 NKLIQIK-GSDERNTLMAGLPTYALGGTSICSGIKYAFQVI---GELHSQLDGSE----- 407 Query: 290 IIVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFL-RACASPNSFYL 347 +V +TDGE+ + C +E K+ GAIV+ I + + + + S + Sbjct: 408 -VVLLTDGEDNTASS-------CIDEVKQSGAIVHFIALGSAADEAVIEMSNITGGSHFY 459 Query: 348 VENPHS---MYDAFSH 360 + + DAF Sbjct: 460 ASDEAQNNGLIDAFGA 475 >gi|324514578|gb|ADY45916.1| Collagen alpha-5(VI) chain [Ascaris suum] Length = 432 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 62/168 (36%), Gaps = 24/168 (14%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N + LD+++V+D+S + + + ++ + + P + + Sbjct: 235 NPISGCELDLVLVMDLSTTTHPIYQHY-------KEMAVELVGRLLIGP---RFTRVAFI 284 Query: 225 TFS----NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 TFS ++ + I+ + G +T G++ A Q ++ Sbjct: 285 TFSSVGKSRTHFNLNRYDNAQQIIEAIRRVESTGGTTAVGEGIRIA-----TQQQEKRMG 339 Query: 281 NTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + KK ++ TDG N ++ S AK G ++Y+IG Sbjct: 340 GRPISIAKKAMLIFTDGWSNKGPDPEEMSKE----AKAAGFVLYSIGY 383 >gi|290995572|ref|XP_002680357.1| predicted protein [Naegleria gruberi] gi|284093977|gb|EFC47613.1| predicted protein [Naegleria gruberi] Length = 269 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 36/209 (17%), Positives = 74/209 (35%), Gaps = 35/209 (16%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + +D++ V+D S SM +KI+M ++ M +++K + LV Sbjct: 42 ERKERKGIDLICVVDKSGSMAG------SKIEMVKSTLAFMFDQLKPTD------RIALV 89 Query: 225 TFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 F + I G S ++ + + G TN + L +G+R Sbjct: 90 EFDSNISTSLQFTNMNESGRSKAKQVVSNIRA-GSCTNLSGAL---------FEGLRLIG 139 Query: 281 NTEDANYKKIIVFMTDG------ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 +AN ++ TDG N + + + E + V+ G Sbjct: 140 QRTNANEVTSLLLFTDGLANEGITNTNEIVKKMTTMIHEEIRTN-LTVFTFGFGTDTDAN 198 Query: 335 FLRACASPNS--FYLVENPHSMYDAFSHI 361 L + + + +Y ++ + AF ++ Sbjct: 199 MLTSISQAGNGLYYFLQTTDDIPKAFGNV 227 >gi|161524898|ref|YP_001579910.1| hypothetical protein Bmul_1725 [Burkholderia multivorans ATCC 17616] gi|160342327|gb|ABX15413.1| conserved hypothetical protein [Burkholderia multivorans ATCC 17616] Length = 626 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 26/163 (15%), Positives = 61/163 (37%), Gaps = 9/163 (5%) Query: 16 KGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRK 75 +G + ++ AI++ I +VLG I+V +++F + L + D + + + + ++ + N Sbjct: 16 RGSVAVMAAIWVMIAIVVLG-AIDVGNLYFQRRNLQRIADMAAIASVESMTDQCSQQNSP 74 Query: 76 KLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISR 134 ++ N F + R +G I+ TS + +S Sbjct: 75 -----AMMAAQSNALANGF--DYRASGQTLSIECGRWDTSATPYFNSTFTPLNAVSVSVT 127 Query: 135 YKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 ++P F + S + S+ K + + + Sbjct: 128 QQVPYIFLGRFFGKSGSTGATVAAFSTAKAINIDSFTIGTTLA 170 >gi|124249351|ref|NP_032432.2| inter-alpha-trypsin inhibitor heavy chain H1 precursor [Mus musculus] gi|74146297|dbj|BAE28922.1| unnamed protein product [Mus musculus] gi|148692827|gb|EDL24774.1| inter-alpha trypsin inhibitor, heavy chain 1, isoform CRA_a [Mus musculus] Length = 907 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 73/196 (37%), Gaps = 28/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D+S SME K+ +++ +LE+++ + + + LV F +K++ Sbjct: 290 LVFVIDISGSMEG------QKVRQTKEALLKILEDMRPVDNFD------LVLFGSKVQSW 337 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + Q ++ S +TN GL + QG ++ + Sbjct: 338 KGSLVPASNANLQAAQDFVRRFS-LAGATNLNGGLLRGIEILNKAQGSHPELSSPAS--- 393 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDGE + Q L A + +Y +G FL ++ N+ Sbjct: 394 -ILIMLTDGEPTEGETDRSQILKNVRNAIRGRFPLYNLGFGHDLDFSFLEVMSTENNGWA 452 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 453 QRIYEDHDATQQLQGF 468 >gi|326778242|ref|ZP_08237507.1| von Willebrand factor type A [Streptomyces cf. griseus XylebKG-1] gi|326658575|gb|EGE43421.1| von Willebrand factor type A [Streptomyces cf. griseus XylebKG-1] Length = 528 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 41/191 (21%), Positives = 77/191 (40%), Gaps = 21/191 (10%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 +++K + T R + +VLD S SM F+ D S + ++ A L+E +P Sbjct: 328 AALKAHGLTGLRARVYLVLDRSGSMRPFYRDGSAQHLGDRALALAAHLDEDATVP----- 382 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 +V FS I+ +E +S + ++ L G+ A ++ Sbjct: 383 ----VVFFSTDIDGTGAIE--LSGHEGRVDELHA-GLGRLGRTHYHRAVEEVV------A 429 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 H +A +++F TDG + + +Q+L A G + + +FLR Sbjct: 430 HYEKTEATGPALVIFQTDGPPDAKQAARQALAEAARL-PLFFQFVAFGEQDGKGFDFLRK 488 Query: 339 CASPNS-FYLV 348 +PN+ F+ Sbjct: 489 LGAPNTGFFHA 499 >gi|262171974|ref|ZP_06039652.1| protein BatA [Vibrio mimicus MB-451] gi|261893050|gb|EEY39036.1| protein BatA [Vibrio mimicus MB-451] Length = 335 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 32/220 (14%), Positives = 78/220 (35%), Gaps = 40/220 (18%) Query: 163 KVNSQTDARLDMMIVLDVSRSMES-----FFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 +V ++ D+++V+D+S SME ++++ A K + + + Sbjct: 87 EVQTREAFGRDVLMVVDLSGSMEEKDFATKSGEQLSRLTAAKKVLRDFVTQ-------RQ 139 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFD 272 + GL+ F + F+ + + L + G STN + Sbjct: 140 GDRFGLILFG---DAAFIQTPFTADQDVWLNLLDEAETGMAGQSTNLGDAI--------- 187 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 G++ + + +I++ +TDG T + A +G +Y I + + Sbjct: 188 GLGIKVFEQSPSTSQDQIMLVLTDG--NDTGSFVSPVDAAKIAAAKGIRIYVIAMGDPEN 245 Query: 333 -------HEFLRACAS--PNSFYLVENPHSMYDAFSHIGK 363 + + +S ++ + + +A+ I + Sbjct: 246 VGEQPLDMDVVSRVSSLTQARSFVAIDQSQLNEAYQVIDQ 285 >gi|189518186|ref|XP_001331201.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Danio rerio] Length = 969 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 70/204 (34%), Gaps = 33/204 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ V+D S SM TK+ +++ ++ E++ + N VTFSN+I Sbjct: 317 VVFVIDTSASM------LGTKMKQTKQALFTIINELRPNDNFN------FVTFSNRIRVW 364 Query: 232 ------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + + ++ I Y+ T+ G++ + D + + Sbjct: 365 QPGKLVPVTPIS--IRDAKKFI-YMISVTGGTDINGGIQTGSALLSDYLSSKDESHHHSV 421 Query: 286 NYKKIIVFMTDGENLSTKEDQQS-LYYCNEAKKRGAIVYAIGIRVIRSHEFLR-----AC 339 + I+F+TDG + + A + ++ IG+ + L C Sbjct: 422 SL---IIFLTDGRPTVGVLQSPTIISNTKTAVQEKFCLFTIGMGDDVDYRLLERMSLDNC 478 Query: 340 ASPNSFYLVENPHSMYDAF-SHIG 362 + + M F IG Sbjct: 479 GTMRRIPEDADASLMLKGFYDEIG 502 >gi|307154013|ref|YP_003889397.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306984241|gb|ADN16122.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 349 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 40/230 (17%), Positives = 81/230 (35%), Gaps = 37/230 (16%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM---ESFFDSSITKIDMAIKSINA 204 + + ++++ + +MI+LD S SM + D + K+D+A +SI Sbjct: 38 CLLTANPSPSLSTTTSPTPIAKSP--VMIILDASGSMVKEKEKIDGKL-KLDIAKESITT 94 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNS--TPG 262 ++ ++ G + +E ++ + L T + T Sbjct: 95 IINSSDASNLELSLTALG----HKSKDCKDNIEIFEEKNKKILDVLPAIKGGTETPLTEA 150 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG------ENLSTKEDQQSLYYCNEAK 316 ++ A ++ D + K IIV +TDG + + + L + K Sbjct: 151 IRQASSKFKDKKQ------------KNIIVLLTDGVESCKKDKNHNPDQKAPLEEVDSLK 198 Query: 317 KRG----AIVYAIGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSH 360 K G + +G ++ ++E L A Y EN + A S Sbjct: 199 KEGFNFILNIITLG-KIKTNNEILEELAKAGGGKLYKPENTEELNKALSE 247 >gi|28948724|pdb|1N3Y|A Chain A, Crystal Structure Of The Alpha-X Beta2 Integrin I Domain Length = 198 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 40/215 (18%), Positives = 79/215 (36%), Gaps = 32/215 (14%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + D++ ++D S S+ S + + A++ + + Q Sbjct: 1 GSHMASRQEQDIVFLIDGSGSISS------RNFATMMNFVRAVISQFQ-----RPSTQFS 49 Query: 223 LVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 L+ FSNK + F E S+ + + + T + ++ +++F Sbjct: 50 LMQFSNKFQTHFTFEEFRRSSNPLSLLASVHQLQGFTYTATAIQNVVHRLFHAS------ 103 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSHEFL 336 + KI++ +TDG+ D + + +A G I YAIG + S + L Sbjct: 104 YGARRDAAKILIVITDGKKEGDSLDYKDVIPMADA--AGIIRYAIGVGLAFQNRNSWKEL 161 Query: 337 RACA---SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 A S + VE+ +DA I + K Sbjct: 162 NDIASKPSQEHIFKVED----FDALKDIQNQLKEK 192 >gi|71028594|ref|XP_763940.1| thrombospondin-related protein [Theileria parva strain Muguga] gi|68350894|gb|EAN31657.1| thrombospondin-related protein [Theileria parva] Length = 931 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 27/176 (15%), Positives = 72/176 (40%), Gaps = 9/176 (5%) Query: 173 DMMIVLDVSRSMESF-FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D ++VLD S SM ++ + + ++ + + S + L + + V+ + + + Sbjct: 271 DFVLVLDESESMSNYNWKKYVKEVTLLLASSISHLNKDNTLSIVH-YSNVPTLRLNFQKI 329 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK-KI 290 + + + + + +G S + L+Y Q+ + + + +++ K+ Sbjct: 330 DPEAFQNTLDKINEMFQMRRSYGKS-YTGKALEYVRQQLLHLPEIPEGSSSDSPKASNKV 388 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAI--VYAIGIRVIRSHEFLRACASPNS 344 ++ MTDG + K+ +++ + G V+A+G + L C + Sbjct: 389 VILMTDG---AAKDIEKAYNESLALRYNGVELFVFAVGFVNEENCRKLVGCPNEGH 441 >gi|295395241|ref|ZP_06805449.1| von Willebrand factor type A (vWA) domain protein [Brevibacterium mcbrellneri ATCC 49030] gi|294972003|gb|EFG47870.1| von Willebrand factor type A (vWA) domain protein [Brevibacterium mcbrellneri ATCC 49030] Length = 324 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 35/224 (15%), Positives = 69/224 (30%), Gaps = 44/224 (19%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 + + + + + D+M+ LD S SM S+ + +D I + E Sbjct: 76 ISRPVTTETLNPEQKQRDVMLCLDASGSMASY---NAKILDTYADLIKSFKGE------- 125 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + G+ F++ F L ++ ++ G +N G+ + Y D Sbjct: 126 ----RIGMTVFNSAAVSVFPLTTDYEMASEFLED-AQLGFESNGLRGINF-YQGTVDRSI 179 Query: 276 MRQHC------------NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 + D + ++F TD + + + AKK VY Sbjct: 180 DGSSLIGDGLASCLNNFDRNDEERSRSVIFATDNQLAGNPIY-ELMEAAELAKKHKVRVY 238 Query: 324 AIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 A+ P F+ + A G ++ T Sbjct: 239 ALA---------------PKGFFAASKLDELKQAAELTGGEMFT 267 >gi|149197491|ref|ZP_01874542.1| hypothetical protein LNTAR_00880 [Lentisphaera araneosa HTCC2155] gi|149139509|gb|EDM27911.1| hypothetical protein LNTAR_00880 [Lentisphaera araneosa HTCC2155] Length = 890 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 26/155 (16%), Positives = 60/155 (38%), Gaps = 18/155 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + IV+D S SM TK+++A + +E + + V+ + T ++ I Sbjct: 388 LSIVMDRSGSMGMTVKGGKTKMELANEGAAQTIELLGAMDSVSV---IAVDTEAHAIVPQ 444 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 +L+ + + + + G GL+ ++ Q+ +G +K ++ Sbjct: 445 TVLKDAP-EIASQARRVKSQGGGIYVYTGLEESWRQLEGREG------------QKHVIL 491 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +D + ++E + + K G V I + Sbjct: 492 FSD--SNDSEEPGRYKELLADMKDEGMTVSVIALG 524 >gi|182437588|ref|YP_001825307.1| hypothetical protein SGR_3795 [Streptomyces griseus subsp. griseus NBRC 13350] gi|178466104|dbj|BAG20624.1| hypothetical protein [Streptomyces griseus subsp. griseus NBRC 13350] Length = 537 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 41/191 (21%), Positives = 77/191 (40%), Gaps = 21/191 (10%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 +++K + T R + +VLD S SM F+ D S + ++ A L+E +P Sbjct: 337 AALKAHGLTGLRARVYLVLDRSGSMRPFYRDGSAQHLGDRALALAAHLDEDATVP----- 391 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 +V FS I+ +E +S + ++ L G+ A ++ Sbjct: 392 ----VVFFSTDIDGTGAIE--LSGHEGRVDELHA-GLGRLGRTHYHRAVEEVV------A 438 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 H +A +++F TDG + + +Q+L A G + + +FLR Sbjct: 439 HYEKTEATGPALVIFQTDGPPDAKQAARQALAEAARL-PLFFQFVAFGEQDGKGFDFLRK 497 Query: 339 CASPNS-FYLV 348 +PN+ F+ Sbjct: 498 LGAPNTGFFHA 508 >gi|168702184|ref|ZP_02734461.1| hypothetical protein GobsU_21830 [Gemmata obscuriglobus UQM 2246] Length = 638 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 38/286 (13%), Positives = 96/286 (33%), Gaps = 34/286 (11%) Query: 94 FRNELRDNGFVN-----DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY 148 F ++ + N + + ++++ + YS + P+ F + Sbjct: 187 FSADVNTASYANVRRMLNEGTLPPASAVFLAEFVNYFPYSYAPPPAGADPVAFHVEMGPC 246 Query: 149 TNSRHIVMPITSSV--KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 + + ++ ++ +++ ++D S SM+ ++ + KS+ ++ Sbjct: 247 PWNAKHHLLRVGVQAHQIPAEKLPPRNLVFLVDTSGSMQQE-----NRLPLVQKSLELLV 301 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLK 264 E++ + +VT++ G + + L G TN G+K Sbjct: 302 EKLTEKD------RVSVVTYAGDSRVALPPTSGADKKAILDVVTGLQANG-GTNGEGGIK 354 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVY 323 AY R N ++ TDG+ N+ ++ + + E +K + Sbjct: 355 KAYQ------FARDTFLDGGVNR---VILCTDGDFNVGVVDNGELVKLIEEQRKSKVFLT 405 Query: 324 AIGIRVIRSH-EFLRACAS--PNSFYLVENPHSMYDAFSHIGKDIV 366 +G + + L+ A+ ++ F G +V Sbjct: 406 VLGYGMGNYKDDRLKELANHGNGHHAYIDTLDEAKKVFVEQGGALV 451 >gi|156742722|ref|YP_001432851.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156234050|gb|ABU58833.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 452 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 43/223 (19%), Positives = 85/223 (38%), Gaps = 29/223 (13%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPI------TSSVKVNSQTDARLDMMIVLDVSR 182 + A P + + SR + +PI S ++ S + +D + V +S Sbjct: 33 VQASVTGAAPSRPVNWALVADVSRSMRIPIVDETQFRSLLRTGSAQEMLVDGVPVWQLSG 92 Query: 183 SM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS 241 S+ + D++ + +D +++++ ++E + + LV F++ G Sbjct: 93 SVPQEVRDTASSALDYVVRALHTIVERLDHHD------RLALVVFADHALLLIPGMVGAD 146 Query: 242 HLQ--RKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 + R I+ L G TN G+ A NQI N + + I+ +TDG Sbjct: 147 RVTLVRAIERLPGLNLGDGTNLADGIALALNQI--------RANRDGRCADR-IILLTDG 197 Query: 298 ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 T++ L ++A + IG+ + L A A Sbjct: 198 ---FTRDSAACLALADQAADEHIAITTIGLGGEFQDDLLTAIA 237 >gi|120406920|ref|YP_956749.1| von Willebrand factor, type A [Mycobacterium vanbaalenii PYR-1] gi|119959738|gb|ABM16743.1| von Willebrand factor, type A [Mycobacterium vanbaalenii PYR-1] Length = 248 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 34/189 (17%), Positives = 66/189 (34%), Gaps = 19/189 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 L +V DVS SM I +S+ + + P + + VQ G++ FS+ Sbjct: 19 LPFWLVCDVSASMGPH-------IGTLNQSLRDFRDSLATNPVLADKVQFGVIDFSDTAT 71 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 E L S L + + T A+ + + + Y + Sbjct: 72 EVIPLGDFSSA------DLERHQLRTRGGTSYGQAFTTVQQIIERDLAAGADRFRYFRPA 125 Query: 292 V-FMTDGENLST--KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 V F+TDG+ +E + L + ++A +G Y + + A+ Sbjct: 126 VFFLTDGQPTDRHWREAFRDLTFFDQASGQGFRSYPLFVPFGIGDA---DAATLAELVHP 182 Query: 349 ENPHSMYDA 357 ++ ++ A Sbjct: 183 QDRSVLFMA 191 >gi|281356510|ref|ZP_06243002.1| von Willebrand factor type A [Victivallis vadensis ATCC BAA-548] gi|281317202|gb|EFB01224.1| von Willebrand factor type A [Victivallis vadensis ATCC BAA-548] Length = 783 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 62/165 (37%), Gaps = 26/165 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M++ DVS+SM + D + ++++ A + ++E P+ + GLV F+ K Sbjct: 91 DLMVIFDVSKSMLA-TDIAPSRLEHAKFLLRQLVES---APND----RFGLVAFAGKAYL 142 Query: 233 FFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + I L+ TN L+ A G + Sbjct: 143 ACPLTSDSLAFTQYIDELNTDTVPLGGTNLEAALRVAEQAFKAAAGGNRG---------- 192 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 I+ TDG+ L+ +E +KR ++ +G+ Sbjct: 193 -ILLFTDGDELAGNS----AALVDELRKRQIPLFIVGLGDPEVGA 232 >gi|313239054|emb|CBY14036.1| unnamed protein product [Oikopleura dioica] Length = 442 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 31/202 (15%), Positives = 65/202 (32%), Gaps = 24/202 (11%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D++ V + M S F + I +IN + ++ ++ F++ Sbjct: 164 GPRDVLFVAHYTTYMGSTFADISAFFENIISTIN--------VEPSDSSIRFAFSFFNHA 215 Query: 230 IEEFFLLEW--GVSHLQRKIKYLSKFGVSTN-STPGLKYAYNQIFDMQGMRQHCNTEDAN 286 EFF +W + + + N LK A + + G + +T Sbjct: 216 YIEFFAFDWLNSIDEYKWAFSSFPPASGNANYIGRALKGAADTMTPAFGKGRRIDTVGT- 274 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---N 343 +V +T + + ++ K++ V +G+ + L AS Sbjct: 275 ----VVLLT-----NAASTDEVNEMADQLKEKVDRVIVVGLGYAFGQDELAGIASSPTKE 325 Query: 344 SFYLVENPHSMYDAFSHIGKDI 365 + Y E + I +I Sbjct: 326 NLYTAEESSDLAGLVRTIADEI 347 >gi|310818002|ref|YP_003950360.1| von willebrand factor type a domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|309391074|gb|ADO68533.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 568 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 35/234 (14%), Positives = 76/234 (32%), Gaps = 35/234 (14%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 H++ KV++ ++ +DVS SM ++++ +S+ ++E+ Sbjct: 171 RQGYHVLHVGLQGQKVSAAERLPAHLVFTIDVSGSMN-----MENRLELVKRSLAMLVEK 225 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 + + VV + + S + I L G STN GL+ A Sbjct: 226 LDSRDTLAIVV------YGDTARTVLEPTRIMDRSRILEAINALHPEG-STNVQAGLQVA 278 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 Y +R+ + ++ +DG N + ++G + + Sbjct: 279 Y--AIAASQVREGATSR-------VILCSDGVANNGITQADSIFQSVKAYAQQGVRLTTV 329 Query: 326 GIRVIRSHEFLRACAS---PNSFYLVENPHSMYDAF--------SHIGKDIVTK 368 G + ++ L S + V+ F I +D+ + Sbjct: 330 GFGMGNYNDELMERLSHVGDGQYAYVDALPEARRIFIEQFTGTLQLIARDVKVQ 383 >gi|307352559|ref|YP_003893610.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] gi|307155792|gb|ADN35172.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] Length = 1022 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 37/284 (13%), Positives = 76/284 (26%), Gaps = 80/284 (28%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLE-- 207 + + + I + +D+M+ D S SM + D ++ +D M E Sbjct: 506 NDTVTVTIQLIGDGWALQPDPIDVMLTADRSGSMLRDYPDRMVSLMDALEDFGIEMKEGW 565 Query: 208 ------------EVKLIPDVNNV-----------------VQSGLVT-FSNKIEEFFLLE 237 +I N +G +++ L Sbjct: 566 DRLGLASFGTYGNADIIDYGNRYWAGYDNSYYDDWEYISEHYAGNDKNYNDYATIDLNLT 625 Query: 238 WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 S ++K L G T GL Y+ + D + K +V ++DG Sbjct: 626 EDFSDYNTEVKALVPDG-GTPMRKGLYYSIKHLRDN---------GRDDAVKAVVVLSDG 675 Query: 298 ENLS-----------------------------TKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +Q + AK +++I Sbjct: 676 DYNYYGDPLARGSGGTKWDWSDMQEKYYTFSDLNSSEQDMRIF---AKDNDIKIFSIAYA 732 Query: 329 ---VIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVT 367 L+A A + +Y + + + + I ++ Sbjct: 733 DGISSEGKAVLQALAEGTGGKYYYAPSGEDLEEIYEDIAGELKE 776 >gi|198274643|ref|ZP_03207175.1| hypothetical protein BACPLE_00795 [Bacteroides plebeius DSM 17135] gi|198272090|gb|EDY96359.1| hypothetical protein BACPLE_00795 [Bacteroides plebeius DSM 17135] Length = 339 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 29/175 (16%), Positives = 67/175 (38%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + ++ ++ LD+S SM + D + ++++ + K I+ ++E N+ V Sbjct: 79 GSKMETVKRQGVETVVALDISNSMLAQ-DVTPSRLEKSKKLISRLVETFN-----NDKVA 132 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 + F+ + + + ++ +S T+ + A Sbjct: 133 M--IVFAGEAFTQLPITSDYVSAKMFLETISPSLITTQGTDIRGAIDLAMKSFT------ 184 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + + IV +TDGEN + ++ EA K+G V+ +G+ Sbjct: 185 -----PNEGVGRAIVLITDGENH----EGGAIEAAQEAAKKGMRVFVLGVGSPDG 230 >gi|168008926|ref|XP_001757157.1| predicted protein [Physcomitrella patens subsp. patens] gi|162691655|gb|EDQ78016.1| predicted protein [Physcomitrella patens subsp. patens] Length = 1068 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 39/258 (15%), Positives = 78/258 (30%), Gaps = 39/258 (15%) Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIV----MPITSS 161 D V + P +++ S L R +V T + Sbjct: 206 GTTDTVDPAHIS-TFNPNQARVTLTVDSLLDCDLVIVINSLGLDQPRALVERHPSHGTHA 264 Query: 162 VKVNSQTDARLD------MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 + + Q L M+ ++D S SM T+I A +++ L + Sbjct: 265 IALTFQPRFALQPLRTSEMIFLVDRSGSM------MGTQIKQAGEALELFLRSIPFENHY 318 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQIFD 272 N+V G ++ +E+ L++ + Y + T + + Sbjct: 319 FNIVGFGS---NHNFLFPTSVEYTEDSLKKAVHYAQTIQANMGGTEIANAFFEVFQR--- 372 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL--YYCNEAKKRG-AIVYAIGIRV 329 N I +TDG ++ +S+ + A+ V+ +G+ Sbjct: 373 ----------RRRNVPTQIFLLTDGMVWDAEQLTKSIIEAVDDGARNNSPVRVFTLGVGN 422 Query: 330 IRSHEFLRACASPNSFYL 347 SH + + A Y Sbjct: 423 AVSHHLIESVARAGGGYA 440 >gi|74315933|ref|NP_001028276.1| inter-alpha (globulin) inhibitor H3 [Danio rerio] gi|72679321|gb|AAI00122.1| Inter-alpha (globulin) inhibitor H3 [Danio rerio] Length = 892 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 72/202 (35%), Gaps = 33/202 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +++ ++D S SM+ KI+ ++ +L ++ GL+TFS Sbjct: 263 QRIPKNVVFIIDQSGSMQGN------KIEQTRMAMLRILSDLAKDD------YFGLITFS 310 Query: 228 NKIEEFFL-----LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + I+ + V + +K + G +T+ + A N I Q+ Sbjct: 311 SHIQAWKPELLKATAENVEEAKTFVKQIRS-GGATDINGAVLNAVNMI------NQYTQE 363 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 A+ I++ +TDG+ S + ++ A +Y +G EFL + Sbjct: 364 GSAS---ILILLTDGDPTSGVTNPVTIQQNVKTAIGGKYPLYCLGFGFNVRFEFLEKMSL 420 Query: 342 PNS-----FYLVENPHSMYDAF 358 N+ Y + F Sbjct: 421 ENNGAARRIYEDSDADLQLQGF 442 >gi|17231852|ref|NP_488400.1| hypothetical protein alr4360 [Nostoc sp. PCC 7120] gi|17133496|dbj|BAB76059.1| alr4360 [Nostoc sp. PCC 7120] Length = 427 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 30/233 (12%), Positives = 74/233 (31%), Gaps = 36/233 (15%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 +S+ + S+V + + L++ ++LD S SM + M +++ Sbjct: 12 TNIDANQLSSQRQLAISISAVAEQFEQNLPLNLCLILDQSGSMHGQP------LKMVVEA 65 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTF--SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNS 259 + +L+ ++ + +V F S + + ++ +I+ + T Sbjct: 66 VEKLLDRLQPGD------RISVVAFAGSATVIIPNQIVENPESIKTQIRKKLQASGGTVI 119 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL----------STKEDQQSL 309 GL+ ++ +TDG + ++ L Sbjct: 120 AEGLQQGITELMKGTRGAVSQA----------FLLTDGHGEDSLKIWKWEIGPDDSRRCL 169 Query: 310 YYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSH 360 + +A K + +G + + L A + +E P F+ Sbjct: 170 EFAKKAAKINLTINTLGFGNNWNQDLLETIADAGGGTLAHIERPEQAVHHFNR 222 >gi|320589835|gb|EFX02291.1| von willebrand factor type a domain containing protein [Grosmannia clavigera kw1407] Length = 735 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 41/260 (15%), Positives = 86/260 (33%), Gaps = 30/260 (11%) Query: 109 DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT 168 D S SL++ +P ++G I P I+S + Q Sbjct: 47 DSSSSASLNVYRLPSDDGILIKVKPPA-YPGNIP--------------SISSQRGLAGQG 91 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK-----LIPDVNNVVQSGL 223 +++V+DVS SM+ TK + + +L+ VK ++ +N G+ Sbjct: 92 HVPCSIVLVIDVSGSMQEDAPVPATKGEPMESNGLTVLDLVKHAARTILETLNEHDCLGI 151 Query: 224 VTFSNKIEEFFLLEWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 VTFS +L + + ++ + T + + + + Sbjct: 152 VTFSEDANVLLMLTPMTQVNKAKALQVILDLEPLTVTNL-----WKGLTAGIEIFSSKAQ 206 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 + I+ +TDG Q + K A ++ G L++ + Sbjct: 207 FSSVPS--IMLLTDGLPNFMHPPQGYIPKLRTFGKLPAPIHTFGFGYNLRSGLLKSISEL 264 Query: 341 SPNSFYLVENPHSMYDAFSH 360 + ++ + + + F H Sbjct: 265 TGGNYAFISDAGMLGTVFIH 284 >gi|317488474|ref|ZP_07947025.1| von Willebrand factor type A domain-containing protein [Eggerthella sp. 1_3_56FAA] gi|316912406|gb|EFV33964.1| von Willebrand factor type A domain-containing protein [Eggerthella sp. 1_3_56FAA] Length = 551 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 76/192 (39%), Gaps = 25/192 (13%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 PW ++ +VM + ++ + +++ ++DVS SM+ K+ + S + Sbjct: 162 PWNDQTKLLVMGFATEKDGDASSAGA-NLVFLIDVSGSMD-----DPDKLPLVKDSFATL 215 Query: 206 LEEVKLIPDVNNVVQSGLVTFSN--KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL 263 +E + + +VT+++ ++ + + R + L G STN GL Sbjct: 216 VEGL------TERDRVSVVTYASGERVLLEGVPGDDKRRIMRAVDGLVAEG-STNGEAGL 268 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIV 322 + AY ++ + + N +V +DG+ N+ + + + ++ G + Sbjct: 269 EQAY-RLAESSFIEGGVNR--------VVMASDGDLNVGISSESELHDFVERKRETGVYL 319 Query: 323 YAIGIRVIRSHE 334 +G + Sbjct: 320 SVLGFGSGNYKD 331 >gi|294140885|ref|YP_003556863.1| inter-alpha-trypsin inhibitor domain-containing protein [Shewanella violacea DSS12] gi|293327354|dbj|BAJ02085.1| inter-alpha-trypsin inhibitor domain protein [Shewanella violacea DSS12] Length = 765 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 41/237 (17%), Positives = 80/237 (33%), Gaps = 40/237 (16%) Query: 138 PLKFCTFIPWYTNSRHIV--MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKI 195 P + W + IV + + ++ R + +V+D S SM I Sbjct: 227 PNSYVGEGLWAKDGEQIVALTSFYPQLDIQTKRQPRC-IKMVVDCSGSMTG------DSI 279 Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGL---VTFSNKIEEFFLLEWGVSHLQRKIKYLSK 252 + A ++ A++E++ D N++ G + F+ ++ + +++ ++ L Sbjct: 280 NQASIALQAIVEQL-ADDDWFNIILFGSHHKLIFNKSVQATP---NNLQRVEKTLQNLRA 335 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 T L+ AY+ N I+ +TDG+ Y Sbjct: 336 DFGGTEMDSALQAAYSS------------KTPKNIPTDILLITDGQIWDQD------YLL 377 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACAS-----PNSFYLVENPHS-MYDAFSHIGK 363 A+ + +G+ S FL AS EN S + F I + Sbjct: 378 TNAQASKHRHFVVGVGSAVSEAFLSKLASETGGASEFVTPNENMSSRIVRHFERIKQ 434 >gi|290985353|ref|XP_002675390.1| von Willebrand factor type A domain-containing protein [Naegleria gruberi] gi|284088986|gb|EFC42646.1| von Willebrand factor type A domain-containing protein [Naegleria gruberi] Length = 923 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 32/218 (14%), Positives = 77/218 (35%), Gaps = 37/218 (16%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + +D+++V+D S SM K+DM +++ M++++K + Sbjct: 682 QQAQKERKGVDLVLVVDKSGSMAG------QKLDMVKSTLSFMVDQLKEKD------RVA 729 Query: 223 LVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 +V F +++ L G ++ +S G TN + L + + Q + Sbjct: 730 IVEFDTQVKTNLDLTKMDIEGKKKAKQVSSAISP-GSCTNLSGALFTSLKLLASRQQEKN 788 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK--------RGAIVYAIGIRVI 330 + ++ TDG L+ + + + ++ G Sbjct: 789 EVTS--------VILFTDG--LANRGLISTNEILQNMQDLMDELLSTSNVTIHTFGFGQD 838 Query: 331 RSHEFLRACASPNS--FYLVENPHSMYDAFSHIGKDIV 366 L + A + + +E + AF ++ ++V Sbjct: 839 TDANMLTSIAQKGNGLYDYLETADDIPKAFGNVIGNLV 876 >gi|126724455|ref|ZP_01740298.1| hypothetical protein RB2150_11506 [Rhodobacterales bacterium HTCC2150] gi|126705619|gb|EBA04709.1| hypothetical protein RB2150_11506 [Rhodobacterales bacterium HTCC2150] Length = 354 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 28/207 (13%), Positives = 66/207 (31%), Gaps = 35/207 (16%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE-EVKLIPDVNNVVQSGLVTFSNK 229 + ++++D S SM + ++ E V + G Sbjct: 165 PMSFVLLVDRSGSMAEIMPEVREAAKEFVAALPDTAECSVSSFAGDWDFSHRGPEGALTC 224 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 E F + + G +TN L+ AY + + + +++K Sbjct: 225 KPENFAFDN------------IQPGGTTNIYGPLREAYGWLSESE---------RTDHQK 263 Query: 290 IIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYA-IGIRVIRSHEFLRACASPNSFYL 347 ++ +TDG N + Q+L ++A Y + +LR+ A ++++ Sbjct: 264 AVILLTDGRANDDAASESQTLAMKDDA-------YTFVYYMGDSDDRWLRSLA--DNYFS 314 Query: 348 --VENPHSMYDAFSHIGKDIVTKRIWY 372 + F+ + + + Sbjct: 315 GGGHVSAQLERYFNVVSDAYSAQTVLE 341 >gi|118918435|ref|NP_659429.4| von Willebrand factor A domain-containing protein 3B [Homo sapiens] gi|296439299|sp|Q502W6|VWA3B_HUMAN RecName: Full=von Willebrand factor A domain-containing protein 3B Length = 1294 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 34/171 (19%), Positives = 63/171 (36%), Gaps = 32/171 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + I++D S SM+S K+D+ I ++E N V+ + + + Sbjct: 509 IYILIDTSHSMKS-------KLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAVAWREQLA 561 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + E + Q I+ + G STN+ LK A+ + E I Sbjct: 562 EVNEDNLEQAQSWIRDIK-IGSSTNTLSALKTAF------------ADKETQA----IYL 604 Query: 294 MTDGENLSTKEDQQSLYYCNEAKK-RGAIVYAIGIRVIR--SHEFLRACAS 341 +TDG E ++ K+ + +Y I ++ FL+ A+ Sbjct: 605 LTDGRPDQPPETVI-----DQVKRFQEIPIYTISFNYNDEIANRFLKEVAA 650 >gi|73949158|ref|XP_535195.2| PREDICTED: similar to inter-alpha globulin inhibitor H2 polypeptide [Canis familiaris] Length = 946 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 30/197 (15%), Positives = 75/197 (38%), Gaps = 30/197 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ V+DVS SM K+ ++++ +L++++ Q ++ F++ + Sbjct: 311 ILFVIDVSGSMWGI------KMKQTVEAMKTILDDLRAED------QFSVIDFNHNVRTW 358 Query: 231 --EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + V+ ++ I+ + G TN L A + + + D Sbjct: 359 RNDLVSATRTQVTDAKKYIEKIQPSG-GTNINEALLRAIFILNEANNLGL----LDPESV 413 Query: 289 KIIVFMTDGENL--STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--- 343 +I+ ++DG+ K + + ++++GI ++FL+ ++ N Sbjct: 414 SLIILVSDGDPTVGELKLSKIQKNVKQHIRDN-ISLFSLGIGFDVDYDFLKRLSNENRGI 472 Query: 344 --SFYLVENPHSMYDAF 358 Y ++ S F Sbjct: 473 AQRIYGNQDTSSQLKKF 489 >gi|3024062|sp|P97279|ITIH2_MESAU RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H2; Short=ITI heavy chain H2; Short=ITI-HC2; Short=Inter-alpha-inhibitor heavy chain 2; Flags: Precursor gi|1694690|dbj|BAA13939.1| inter-alpha-trypsin inhibitor heavy chain 2 [Mesocricetus auratus] Length = 946 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 30/179 (16%), Positives = 72/179 (40%), Gaps = 27/179 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ V+DVS SM K+ ++++ +L++++ Q +V F++ + Sbjct: 311 ILFVIDVSGSMWGI------KMKQTVEAMKTILDDLRTED------QFSVVDFNHNVRTW 358 Query: 231 --EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + ++ +R I+ + G TN L A IF + + Sbjct: 359 RNDLVSATKTQITDAKRYIEKIQPSG-GTNINEALLRA---IFILNEASNLGMLNPDSVS 414 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACASPNS 344 +IV ++DG+ + + K+ ++++GI ++FL+ ++ N Sbjct: 415 -LIVLVSDGDPT--VGELKLSKIQKNVKQNIQDNISLFSLGIGFDVDYDFLKRLSNENR 470 >gi|170758512|ref|YP_001787434.1| hypothetical protein CLK_1496 [Clostridium botulinum A3 str. Loch Maree] gi|169405501|gb|ACA53912.1| hypothetical protein CLK_1496 [Clostridium botulinum A3 str. Loch Maree] Length = 647 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 38/178 (21%), Positives = 65/178 (36%), Gaps = 37/178 (20%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 R+++ V+D S SM + +KI+ A K++ L I D N ++S + K+ Sbjct: 436 RIEISFVIDNSGSM------NASKIEAARKALAVTLLS---IDDFNRYLKSNAEQLNQKV 486 Query: 231 EEFFLLEWG--------------------VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 E W S + R I L +T+ L+ N+I Sbjct: 487 EVLSE-TWFFGSKYYNVKEFNDKNVKEKEKSDIIRSIIKLDATDGATDDASCLREISNRI 545 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +Q + KII +TDG + ++++ E + VYA I Sbjct: 546 TSIQESELKKGKQ----TKIIFEITDGASSFPGSAKEAIQ---ELLSKNVEVYAFQIG 596 >gi|260807878|ref|XP_002598735.1| hypothetical protein BRAFLDRAFT_95851 [Branchiostoma floridae] gi|229284009|gb|EEN54747.1| hypothetical protein BRAFLDRAFT_95851 [Branchiostoma floridae] Length = 475 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 26/167 (15%), Positives = 56/167 (33%), Gaps = 33/167 (19%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ V+D S S+ +I ++ I L+ + +Q G++ + + Sbjct: 78 IDIVFVIDGSWSIGPAV---FERIKGFVRDIVGCLDIGE--------IQVGVIHYDCLPK 126 Query: 232 EFFLLEWGVSHLQRKIK-----YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L G ++ ++ ++ G T + ++Y N I Sbjct: 127 LDIQL--GSYTMKTDLQDAILTEITFDGEVTRTGNAIQYMKNTI-----------PFRGY 173 Query: 287 YKKIIVFMTDGENLSTKED----QQSLYYCNEAKKRGAIVYAIGIRV 329 + V +TDG E +A+ G +Y+I Sbjct: 174 VPRAAVIVTDGRTQLDVEGHQFHDDFAESARDARDCGIELYSIAAGR 220 >gi|156358475|ref|XP_001624544.1| predicted protein [Nematostella vectensis] gi|156211331|gb|EDO32444.1| predicted protein [Nematostella vectensis] Length = 638 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 59/185 (31%), Gaps = 29/185 (15%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD+ ++D S S+ + + IN + V+ + + GL FS + Sbjct: 120 LDLGFLIDGSMSVGGEMN--------FKQIINFVWNVVQSFQVSAHFTRVGLAIFSLEAF 171 Query: 232 EFFLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 F + + ++Y G T GL A +F+ G Sbjct: 172 VIFNFHTYYDKNNILDALNNVQYPGTDGPGTYIGRGLHVAKEYLFEASG------RPRVP 225 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---PN 343 + I+V + + + ++ G +Y +G+ + + A AS Sbjct: 226 HALIVV-------AAGRSLDDVITPSLSLRRMGVDIYTVGVGKFYTKLQMHAMASFPHSE 278 Query: 344 SFYLV 348 + Sbjct: 279 HVFGA 283 >gi|198437668|ref|XP_002125059.1| PREDICTED: similar to predicted protein [Ciona intestinalis] Length = 983 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 32/205 (15%), Positives = 73/205 (35%), Gaps = 38/205 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN----- 228 +I+LD SRSME+ F+S ID+A + I+ +LE ++ + + F + Sbjct: 207 FLILLDSSRSMENTFNSKPM-IDIARELIDILLETLRPND------KISAIGFRHEALRS 259 Query: 229 ----KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + + F E L+ ++ ++ G S+ + A+ F + Sbjct: 260 QGCFRNQLAFASETNKEKLRSFLRNITPMGESSYT-----VAFQSAFQLLEQDYIKYKNK 314 Query: 285 ANYKKIIVFM-TDGENLSTKE---DQQSLYYCNEAK-KRGAIVYAIGIRVIRSHEFLRAC 339 ++ +K ++ + +DG+ D S+ K +++ I Sbjct: 315 SDTEKYVILLISDGQPKEAYGRMQDVYSIIEQQNLKINNSVSIFSYAIGRN--------- 365 Query: 340 ASPNSFYLVENPHSMYDAFSHIGKD 364 + + + + S + Sbjct: 366 ---GHVFAINEVSDVRETISTLNHA 387 >gi|123228966|emb|CAI21016.2| novel protein similar to vertebrate inter-alpha (globulin) inhibitor H5 (ITIH5) [Danio rerio] Length = 906 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 70/204 (34%), Gaps = 33/204 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ V+D S SM TK+ +++ ++ E++ + N VTFSN+I Sbjct: 254 VVFVIDTSASM------LGTKMKQTKQALFTIINELRPNDNFN------FVTFSNRIRVW 301 Query: 232 ------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + + ++ I Y+ T+ G++ + D + + Sbjct: 302 QPGKLVPVTPIS--IRDAKKFI-YMISVTGGTDINGGIQTGSALLSDYLSSKDESHHHSV 358 Query: 286 NYKKIIVFMTDGENLSTKEDQQS-LYYCNEAKKRGAIVYAIGIRVIRSHEFLR-----AC 339 + I+F+TDG + + A + ++ IG+ + L C Sbjct: 359 SL---IIFLTDGRPTVGVLQSPTIISNTKTAVQEKFCLFTIGMGDDVDYRLLERMSLDNC 415 Query: 340 ASPNSFYLVENPHSMYDAF-SHIG 362 + + M F IG Sbjct: 416 GTMRRIPEDADASLMLKGFYDEIG 439 >gi|317483712|ref|ZP_07942659.1| hemolysin-type calcium-binding protein [Bilophila wadsworthia 3_1_6] gi|316925028|gb|EFV46167.1| hemolysin-type calcium-binding protein [Bilophila wadsworthia 3_1_6] Length = 1111 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 50/322 (15%), Positives = 104/322 (32%), Gaps = 42/322 (13%) Query: 65 IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIV---RSTSLDIVVV 121 + N+ + ++ K G + I ++ + N+ I + ++ + Sbjct: 435 LHNDMDRDDASVSKDGLHITEITIGGKTYTSHDASTDISYNETTKISIDWQKGTISVTNT 494 Query: 122 PQNE-----GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMI 176 +N GY + + T + I + ++ Sbjct: 495 GKNSESIQFGYGVEDRHGATDSADITVNVTATTGAGSIGDDLLQGATTTENVAMSYNISF 554 Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEV-KLIPDVNNVVQSGLVTFSNKI----E 231 VLD S SM S + A +++ +E++ I + + ++ +V FS+ + Sbjct: 555 VLDKSGSMGSSYS-------TAKEAVANYIEKLWDDIQNTDAIINIQVVKFSSSVGWGDN 607 Query: 232 EFFLLEWGVSH------LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F L+ ++ L + K +TN L A F+ Q N Sbjct: 608 NTFTLDKSTTYKELQAFLSAHVTNNDKASGNTNYEDALLKA-ESWFNSQEENGFANR--- 663 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG-----AIVYAIGIRVIRSHEF--LRA 338 + F++DGE + G V+AIGI +++ L Sbjct: 664 -----LYFISDGEPNRPYGKPVERAEAVYDRIVGDSAHPVDVHAIGILGNGANDLDVLNK 718 Query: 339 CASPNSFYLVENPHSMYDAFSH 360 + + + N +YDA + Sbjct: 719 FDNTDGADQIRNAGELYDAIAS 740 >gi|284052943|ref|ZP_06383153.1| von Willebrand factor, type A [Arthrospira platensis str. Paraca] gi|291569121|dbj|BAI91393.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 463 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 32/209 (15%), Positives = 64/209 (30%), Gaps = 37/209 (17%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDS------------------SITKIDMAIKSINAMLE 207 S + V+D S SM + +T I + LE Sbjct: 35 SASRPSTTFSFVIDTSGSMYEVLEGEETIPTGNSYFLDGKQYTQVTGGKTKIDQVIESLE 94 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKY 265 + ++ + LV F + L + L+ I L F T G++ Sbjct: 95 RLVSSGQADSRDRIALVRFDDSASVLLPLTASTDTASLKNAIGQLRNFSGGTRMALGMEE 154 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 A N + + D + ++ ++F TDG+ + + + G + A+ Sbjct: 155 ALNILKNC----------DLSSRRTLIF-TDGQTFDESD---CRDLATQFAEAGIPITAL 200 Query: 326 GIRVIRSHEFLRACA--SPNSFYLVENPH 352 G+ + L + + + V Sbjct: 201 GVGEYN-EDLLLYLSDRTGGRVFNVVETQ 228 >gi|126723497|ref|NP_001075477.1| inter-alpha-trypsin inhibitor heavy chain H3 precursor [Oryctolagus cuniculus] gi|75056157|sp|Q9GLY5|ITIH3_RABIT RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3; Short=ITI heavy chain H3; Short=ITI-HC3; Short=Inter-alpha-inhibitor heavy chain 3; Flags: Precursor gi|11041718|dbj|BAB17302.1| inter-alpha-trypsin inhibitor heavy chain3 [Oryctolagus cuniculus] Length = 903 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 70/169 (41%), Gaps = 15/169 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG-LVTFSNKIEE 232 ++ V+DVS SM K++ ++ +LE+++ +N ++ S + T+ + + Sbjct: 285 VVFVIDVSGSMYG------RKLEQTKDALLKILEDMREEDHLNFILFSSDVTTWKEHLVQ 338 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +K + G STN GL + + + +H E + I++ Sbjct: 339 ATPE--NLQEARAFVKSIQDQG-STNLNDGLLRGISMLNTARE--EHRVPERSTS--IVI 391 Query: 293 FMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +TDG+ N ++ A +Y +G ++ FL + A Sbjct: 392 MLTDGDANSGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLESLA 440 >gi|325689280|gb|EGD31286.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK115] Length = 471 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 32/152 (21%), Positives = 59/152 (38%), Gaps = 33/152 (21%) Query: 167 QTDARLDMMIVLDVSRSMESFFDS-------------SITKIDMAIKSINAMLEEVKLIP 213 + A + + V D S SME D + ++ID+ + M+ +++ I Sbjct: 197 EGQANVAISFVFDASGSMEFSLDGTEKVNPYSNNPLKNRSRIDILREKTKKMMADLQPIG 256 Query: 214 DVNNVVQSGLVTFSN----KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 +V+ LV F++ + F L+ G++ + I L+ +TN GL+Y Sbjct: 257 NVS----VNLVQFNSHASFVQQNFIELDKGLTSINSAIDNLNP-EHATNPGDGLRYGMVS 311 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 + K +V +TDG S Sbjct: 312 LQSNAAQL-----------KYVVLLTDGVPNS 332 >gi|291223809|ref|XP_002731900.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 992 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 35/217 (16%), Positives = 73/217 (33%), Gaps = 36/217 (16%) Query: 155 VMPITSSVKVNSQTDARL------DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 +P+ +V +++ R+ +++VLD S SME ++ ++ +A + Sbjct: 298 NLPVDPTVTIDTTPRFRIVQSRERRLVLVLDRSGSMEG------VRLTKLRQAASAFIRN 351 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLS-KFGVSTNSTPGLKY 265 + G+V FS +E L G + I+ L G T+ G+ Sbjct: 352 -----TICEGSYLGIVEFSEFAQELAPLTLVNGSDSREGLIRRLPHSVGGWTSIGAGIMK 406 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 + + +I+ ++DG ++L + + G + I Sbjct: 407 GIEVLSTNGQNPEGG---------LIMAISDGGENRAPTLSEALQAVD---ESGVTIDTI 454 Query: 326 GIRVIRSHEFLRACASPN--SFYLV--ENPHSMYDAF 358 A SF+ ++ + DAF Sbjct: 455 AYSEQADENLASLAARTGGMSFFYSGDDDSTVLEDAF 491 >gi|223462563|gb|AAI50654.1| Von Willebrand factor A domain containing 3B [Homo sapiens] Length = 1294 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 34/171 (19%), Positives = 63/171 (36%), Gaps = 32/171 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + I++D S SM+S K+D+ I ++E N V+ + + + Sbjct: 509 IYILIDTSHSMKS-------KLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAVAWREQLA 561 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + E + Q I+ + G STN+ LK A+ + E I Sbjct: 562 EVNEDNLEQAQSWIRDIK-IGSSTNTLSALKTAF------------ADKETQA----IYL 604 Query: 294 MTDGENLSTKEDQQSLYYCNEAKK-RGAIVYAIGIRVIR--SHEFLRACAS 341 +TDG E ++ K+ + +Y I ++ FL+ A+ Sbjct: 605 LTDGRPDQPPETVI-----DQVKRFQEIPIYTISFNYNDEIANRFLKEVAA 650 >gi|171912901|ref|ZP_02928371.1| hypothetical protein VspiD_17015 [Verrucomicrobium spinosum DSM 4136] Length = 339 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 40/234 (17%), Positives = 80/234 (34%), Gaps = 55/234 (23%) Query: 172 LDMMIVLDVSRSME--SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + +++ DVS SM F+ + ++D + +++ +K P+ + G+V F Sbjct: 94 IGIVVAFDVSLSMRIRDFYIGNR-QVDRMTAAKRVLVDFIKGRPND----RIGIVAFGGA 148 Query: 230 IEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 W ++++ R + + G + + Sbjct: 149 PYNPCPPTLDHDWLLNNMDRIQTGIMEDGTA--------------IGSGIAAAARRLDQL 194 Query: 286 NYK-KIIVFMTDGENLSTK-EDQQSLYYCNEAKKRGAIVYAIGIRVIRSH---------- 333 K K+I+ MTDG N S K Q + G ++AI I H Sbjct: 195 EVKSKVILLMTDGANNSGKLSPQDAARLAATL---GIRIHAISIGTPGMHPIYMPNGPPI 251 Query: 334 ---------EFLRACAS--PNSFYLVENPHSMYDAFSHIGK----DIVTKRIWY 372 E L+ A+ SF+ E+ ++ F + + +I ++I Sbjct: 252 NSGRQEFDPETLQEVANIGSGSFFRAEDLSTLERIFKTVDEMERTEIERRKIVE 305 >gi|126336625|ref|XP_001380258.1| PREDICTED: similar to Inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Monodelphis domestica] Length = 819 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 53/363 (14%), Positives = 123/363 (33%), Gaps = 42/363 (11%) Query: 35 GMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNM-- 92 G +IE + + + +D + +++ E ++ + K Sbjct: 103 GRVIETTEKRLEQFHVSVNVDPAANATF-ELVYEELLKRHLGKYELMLMIQPKQLVKQLQ 161 Query: 93 ---SFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRY----KIPLKFCTFI 145 + NDI + + + I+ K P K T + Sbjct: 162 VDIYIFEPQGISSLENDITFMTKKLEDALTKTQNKTEVHIAFKPSLAQQQKEPWKLNTVV 221 Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARL-------------DMMIVLDVSRSMESFFDSSI 192 R+ V +T++ + + + +++ ++D S SM Sbjct: 222 DGKFIVRYDVDRVTTAGDIQIENGYFVHNFAPTQLPMVPKNIVFLIDKSGSMAG------ 275 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK 252 KI ++ +L+++K N + SG VT K E L+ + + + Sbjct: 276 RKIKKTKAALIKILDDLKPEDHFNMITFSGHVT-RWKPELVLALDEHLKEAKTFLSNTPA 334 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 GV TN + A + + + ++ + +I+ +TDG++ + Q ++ Sbjct: 335 LGV-TNVNGAVLAAVSMLDESNKKKELPEGSVS----MIILLTDGDSTEGETKLQKIHEN 389 Query: 313 NEAKKRG-AIVYAIGIRVIRSHEFLRACASPN-----SFYLVENPH-SMYDAFSHIGKDI 365 +A RG ++ +G ++ FL A N + + + D + + + Sbjct: 390 VKAAIRGQYHLFCLGFGFDINYVFLERLALDNGGMARHIFEGLDAELQLQDFYQEVANPL 449 Query: 366 VTK 368 +T+ Sbjct: 450 LTQ 452 >gi|110556625|ref|NP_997091.3| calcium-activated chloride channel regulator 4 [Mus musculus] gi|148680067|gb|EDL12014.1| mCG119588 [Mus musculus] gi|148922513|gb|AAI46305.1| Chloride channel calcium activated 6 [synthetic construct] gi|151556758|gb|AAI48748.1| Chloride channel calcium activated 6 [synthetic construct] Length = 925 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 77/199 (38%), Gaps = 34/199 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 M +VLDVS SM S+ +++ ++ L ++ + N G+V FS++ Sbjct: 309 MCLVLDVSGSMTSY-----DRLNRMNQAAKYFLSQI-----IENRSWVGMVHFSSQATIV 358 Query: 234 FLLEWGVSHLQRK--IKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L S ++R ++ L T+ G+K A+ + + Sbjct: 359 HELIQINSDIERNQLLQTLPTSANGGTSICSGIKAAFQVFKNGEYQTDGTE--------- 409 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLV 348 I+ ++DGE D + +E K G+IV+ I + + + + + + L Sbjct: 410 ILLLSDGE------DSTAKDCIDEVKDSGSIVHFIALG-PSADLAVTNMSILTGGNHKLA 462 Query: 349 ENPHS---MYDAFSHIGKD 364 + + DAF + + Sbjct: 463 TDEAQNNGLIDAFGALASE 481 >gi|115488386|ref|NP_001066680.1| Os12g0431700 [Oryza sativa Japonica Group] gi|77554879|gb|ABA97675.1| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] gi|113649187|dbj|BAF29699.1| Os12g0431700 [Oryza sativa Japonica Group] gi|125536450|gb|EAY82938.1| hypothetical protein OsI_38156 [Oryza sativa Indica Group] gi|125579179|gb|EAZ20325.1| hypothetical protein OsJ_35934 [Oryza sativa Japonica Group] Length = 524 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 64/205 (31%), Gaps = 30/205 (14%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 + + + ++ LD++ V+DVS SM KI+ K++ ++ Sbjct: 37 ALTADSVTAVVEIKATSSTAVREGLDLVAVVDVSGSMRGH------KIESVKKALQFVIM 90 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLK 264 ++ + + +VTF + + L + ++ + K T+ GL Sbjct: 91 KLTPVD------RLSIVTFESSAKRLTKLRAMTQDFRGELDGIVKSLIANGGTDIKAGLD 144 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 + D I M+DG+ + E VY Sbjct: 145 LGLAVLADRVFTESRTAN--------IFLMSDGKLEGKTSGDPTQVNPGE-----VSVYT 191 Query: 325 IGIRVIRSHEFLRACA--SPNSFYL 347 G H+ L A SP Y Sbjct: 192 FGFGHGTDHQLLTDIAKNSPGGTYS 216 >gi|332361361|gb|EGJ39165.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK1056] Length = 451 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 54/321 (16%), Positives = 90/321 (28%), Gaps = 41/321 (12%) Query: 52 SMIDRSLVHAATQIMNEGN--GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 + + A T+I + K GD++ KN + + N V Sbjct: 53 EAVSSPIEAAGTEIAADTTKPVVEESAAKPGDLIDVSKNVSPVDVKESQEGNQTVRVETA 112 Query: 110 IVRSTSLDIV---VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 V T ++ V E + + + Y +S Sbjct: 113 TVDVTKTEVAPAKVDKAPEPITRRTENTLSGTDENGNPYTQYERVDKTTTITYTSTPPTV 172 Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 D++ V+D S SM ID+ +IN + + + GL TF Sbjct: 173 TKAGSADIVFVVDRSGSMGGT-------IDIVRANINEFVRNITK---EGITARFGLATF 222 Query: 227 SNKIE-------------EFFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQ 269 S+++ F + + K L+ G T TP A NQ Sbjct: 223 SDEVYGRNSGSKDEDTVLTRFGSSYFTTDPAELEKALAAIRIASGGDTPETPT--PALNQ 280 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 I + KK +V +TD E K G ++ Sbjct: 281 IISTYDWSKSSKN-----KKFVVLLTDAEMKEDPSIPTVADTLAALKAAGIERTVATVKA 335 Query: 330 IRSHEFLRACASPNSFYLVEN 350 I + A+ +EN Sbjct: 336 IEG--IYKNFATEGRVLDIEN 354 >gi|304312981|ref|YP_003812579.1| hypothetical protein HDN1F_33640 [gamma proteobacterium HdN1] gi|301798714|emb|CBL46947.1| Hypothetical protein HDN1F_33640 [gamma proteobacterium HdN1] Length = 979 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 69/202 (34%), Gaps = 39/202 (19%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 DM +++D+S SM+ ++ + + +L K + G+ TF ++ Sbjct: 88 DMRVLIDISGSMKKTDPQNLR-----VPATKLLLNLAKPGS------RMGIWTFGQHVDR 136 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L + K K + + S+ L A D Q D +++ +V Sbjct: 137 LVPLAT----VDAKWKQAAAREANRISSSSLYTAIGDALDA--AIQGDLKPDPAWERSVV 190 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKR--------------GAIVYAIGIRVIRSHEFLRA 338 ++DG S + +++ G V+A+ + EFL+ Sbjct: 191 LLSDG------MVDISKNPADNQREQQRIFQEVVPRLVAGGYKVHAVALSEQADIEFLKR 244 Query: 339 C--ASPNSFYLVENPHSMYDAF 358 A+ F + + + F Sbjct: 245 LAEATKGHFSIAHSADQLMHVF 266 >gi|297491587|ref|XP_002698991.1| PREDICTED: Anthrax toxin receptor-like [Bos taurus] gi|296472055|gb|DAA14170.1| Anthrax toxin receptor-like [Bos taurus] Length = 534 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 27/204 (13%), Positives = 66/204 (32%), Gaps = 59/204 (28%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D+ +LD+S S+ + + + + ++ + VK + N ++ +TFS Sbjct: 76 HSSFDLYFILDMSGSVNNNWMNIYSFVE----------DLVKKFENPN--LRMSFITFST 123 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L + + + L + ++ ++ + + Sbjct: 124 EGHTVMQLTSDRNEVHDGLIRLQNIVPTGDTY--MQEGFKK------------------- 162 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 ++A++ GA VY +G++ H+ + SP + V Sbjct: 163 -----------------------ADKARQMGATVYCVGVKNFEEHQLIEIADSPYHVFAV 199 Query: 349 ENPHSMYDAFSHIGKDIVTKRIWY 372 + A I + TK Sbjct: 200 NQG---FKALKDIVDPLETKSCIE 220 >gi|119918678|ref|XP_001251487.1| PREDICTED: Anthrax toxin receptor-like [Bos taurus] Length = 534 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 27/204 (13%), Positives = 66/204 (32%), Gaps = 59/204 (28%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D+ +LD+S S+ + + + + ++ + VK + N ++ +TFS Sbjct: 76 HSSFDLYFILDMSGSVNNNWMNIYSFVE----------DLVKKFENPN--LRMSFITFST 123 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L + + + L + ++ ++ + + Sbjct: 124 EGHTVMQLTSDRNEVHDGLIRLQNIVPTGDTY--MQEGFKK------------------- 162 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 ++A++ GA VY +G++ H+ + SP + V Sbjct: 163 -----------------------ADKARQMGATVYCVGVKNFEEHQLIEIADSPYHVFAV 199 Query: 349 ENPHSMYDAFSHIGKDIVTKRIWY 372 + A I + TK Sbjct: 200 NQG---FKALKDIVDPLETKSCIE 220 >gi|77735553|ref|NP_001029472.1| calcium-activated chloride channel regulator 4 [Bos taurus] gi|74268230|gb|AAI03388.1| Chloride channel accessory 4 [Bos taurus] gi|296489202|gb|DAA31315.1| chloride channel accessory 4 [Bos taurus] Length = 933 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 73/198 (36%), Gaps = 32/198 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN--KIE 231 + +VLD S SM ++ +++ ++ L + V N G+V F + I+ Sbjct: 308 VCLVLDKSGSM-----AASNRLNRMNQAAQHFL-----LQTVENGSWVGMVHFDSSASIK 357 Query: 232 EFFLLEWGVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + S ++ ++ L T+ G++ + I + Sbjct: 358 SNLIQIISSSERRKLLESLPTAASGGTSICSGIESGFQAIRNADFQIDGSE--------- 408 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL-RACASPNSFYLVE 349 IV +TDGE D + + K+ GAI++ I + + + A+ + Sbjct: 409 IVLLTDGE------DSTAKSCIEKVKQSGAIIHFIALGPDAAQAVKEMSIATGGKYIYAS 462 Query: 350 NPHS---MYDAFSHIGKD 364 + + DAF+ + + Sbjct: 463 DEGQNNGLIDAFAALASE 480 >gi|311033467|sp|A8K7I4|CLCA1_HUMAN RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 1; Short=hCLCA1; AltName: Full=Calcium-activated chloride channel protein 1; Short=CaCC-1; Short=hCaCC-1; Flags: Precursor gi|56203695|emb|CAI22169.1| CLCA family member 1, chloride channel regulator [Homo sapiens] Length = 914 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 73/197 (37%), Gaps = 35/197 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS---NKI 230 + +VLD S SM ++ +++ ++ L + + G+VTF + Sbjct: 307 VCLVLDKSGSM-----ATGNRLNRLNQAGQLFLLQTVELGSW-----VGMVTFDSAAHVQ 356 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 E + G + + T+ GL+ A+ I +++ Sbjct: 357 NELIQINSGSDRDTLAKRLPAAASGGTSICSGLRSAFTVI-----RKKYPTDGSE----- 406 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ ++ C NE K+ GAI++ + + + E L Y Sbjct: 407 IVLLTDGEDN-------TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKMTGGLQTYA 459 Query: 348 VENPHS--MYDAFSHIG 362 + + + DAF + Sbjct: 460 SDQVQNNGLIDAFGALS 476 >gi|329894134|ref|ZP_08270119.1| Inter-alpha-trypsin inhibitor domain protein [gamma proteobacterium IMCC3088] gi|328923306|gb|EGG30626.1| Inter-alpha-trypsin inhibitor domain protein [gamma proteobacterium IMCC3088] Length = 460 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 35/201 (17%), Positives = 76/201 (37%), Gaps = 45/201 (22%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +++ V+D S SM I A +++ L+ ++ N +V F+ Sbjct: 100 AAPPREVVFVIDTSGSMAG------QSIVAARRALVESLKSLRPEDAFN------IVEFN 147 Query: 228 NK-----IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 ++ + + + ++H R I+ L G + ++ A++ + Q Sbjct: 148 HEASALFAQPYPAENYALAHAIRFIRSLEADGGTE-----IEAAFDLTLALPTDAQKL-- 200 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + I+F+TDG + E + E + R ++ +GI +SP Sbjct: 201 ------RQIIFITDGSVSNESELLAKIN--RELEDR--RLFTVGIG-----------SSP 239 Query: 343 NSFYLVENPHSMYDAFSHIGK 363 N +++ E + FS+I Sbjct: 240 NRYFMEEAARAGRGTFSYIAN 260 >gi|159043452|ref|YP_001532246.1| hypothetical protein Dshi_0900 [Dinoroseobacter shibae DFL 12] gi|157911212|gb|ABV92645.1| conserved hypothetical protein [Dinoroseobacter shibae DFL 12] Length = 640 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 33/246 (13%), Positives = 84/246 (34%), Gaps = 53/246 (21%) Query: 159 TSSVKVNSQTDARLDM--MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 + + D+ + VLD ++SME + I + + + ++++ ++ Sbjct: 205 PPTAPEKGEGAGPFDVGIVFVLDTTQSMEPY-------IAATQRVLQSTVQDI-AGTEIG 256 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHL------------------QRKIKYLSKFGVSTN 258 +V G++ F + + LE+ L ++ + G + + Sbjct: 257 ELVNFGVIGFRDNTDAVPELEYRTKVLMPLARRADQAPVIAAIGSATQVARANSPGFNED 316 Query: 259 STPGLKYAYNQI-FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ-----SLYYC 312 S G++ A ++I +D G + + ++ +TD ++ Sbjct: 317 SLAGVEDAIDEIDWDQLGAGDPIDA------RYVILVTDAGPKDPRDPNARSEIGVRELQ 370 Query: 313 NEAKKRGAIVYAIGIRVIRSHEF-----------LRACASPNSFYLVENPHSMYDAFSHI 361 +A+ R +V + ++ L A ++ +E AF+ Sbjct: 371 TDAEGRRIVVMTLHLKTPVGGAANHEYAEARYRELSRFAGREYYFPIEGGSE--QAFAAT 428 Query: 362 GKDIVT 367 + +VT Sbjct: 429 AQRLVT 434 >gi|271963053|ref|YP_003337249.1| hypothetical protein Sros_1513 [Streptosporangium roseum DSM 43021] gi|270506228|gb|ACZ84506.1| hypothetical protein Sros_1513 [Streptosporangium roseum DSM 43021] Length = 605 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 71/210 (33%), Gaps = 34/210 (16%) Query: 169 DARLDMMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 ++++V+DVS SM + D+ +K+D+A ++ L + + GL FS Sbjct: 409 RKPANVLMVIDVSGSMGAGVPDTGRSKLDLAKQAAINALPQFGPHD------KVGLWMFS 462 Query: 228 NKIEE----------FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 K + + L+ ++ L+ G T AY + D Sbjct: 463 TKRDGEKDHLELAPLDTVDAAQRKTLRTRLDGLTPDG-GTGLYDTALAAYQHVRDRHSGE 521 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKE----DQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 ++F+TDG+N + A++ ++ I Sbjct: 522 AINA---------VIFLTDGKNEDNNSLSLENLLPDLRAESAEES-VRMFTIAYGQDADL 571 Query: 334 EFLRACA--SPNSFYLVENPHSMYDAFSHI 361 L+ + + + Y S+ F+ + Sbjct: 572 GVLKQISETTNAAAYDSRESGSIDQVFTAV 601 >gi|110611231|ref|NP_001276.2| calcium-activated chloride channel regulator 1 precursor [Homo sapiens] gi|146327635|gb|AAI41452.1| Chloride channel accessory 1 [synthetic construct] gi|162318850|gb|AAI56806.1| Chloride channel accessory 1 [synthetic construct] Length = 914 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 73/197 (37%), Gaps = 35/197 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS---NKI 230 + +VLD S SM ++ +++ ++ L + + G+VTF + Sbjct: 307 VCLVLDKSGSM-----ATGNRLNRLNQAGQLFLLQTVELGSW-----VGMVTFDSAAHVQ 356 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 E + G + + T+ GL+ A+ I +++ Sbjct: 357 NELIQINSGSDRDTLAKRLPAAASGGTSICSGLRSAFTVI-----RKKYPTDGSE----- 406 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ ++ C NE K+ GAI++ + + + E L Y Sbjct: 407 IVLLTDGEDN-------TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKMTGGLQTYA 459 Query: 348 VENPHS--MYDAFSHIG 362 + + + DAF + Sbjct: 460 SDQVQNNGLIDAFGALS 476 >gi|108761535|ref|YP_628568.1| von Willebrand factor type A domain-containing protein [Myxococcus xanthus DK 1622] gi|108465415|gb|ABF90600.1| von Willebrand factor type A domain protein [Myxococcus xanthus DK 1622] Length = 445 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 75/209 (35%), Gaps = 29/209 (13%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 A + + + LD S SM + + A+ L+ + GL+TF Sbjct: 62 RAPVAINLALDRSASMRG------------VPLLAAVQAAQALVERASPRDYLGLLTFDA 109 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + E+ + + + ++ + G T ++ A + A Sbjct: 110 EPEQVLPMRAMDPNARAQLLKVLARLESGEGTALHEAVERA-------SESARRVLVPGA 162 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAK--KRGAIVYAIGIRVIRSHEFLRACASPN 343 + ++ +TDGE A+ + G +++A+G+ + L A SP+ Sbjct: 163 RPQ--VLMLTDGEPSVGPSQLAEFKT-QGARIAESGVMLHALGLGRHYLPDILEALTSPS 219 Query: 344 --SFYLVENPHSMYDAFSHIGKDIVTKRI 370 F V++P + A +G ++ + + Sbjct: 220 GTGFVHVDDPEGLPMAVGQLGAELFGEVV 248 >gi|226947182|ref|YP_002802255.1| ppkA-like protein [Azotobacter vinelandii DJ] gi|226722109|gb|ACO81280.1| ppkA-related protein [Azotobacter vinelandii DJ] Length = 656 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 76/200 (38%), Gaps = 32/200 (16%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 +VN+ ++ V+D + SM + ID ++I + + ++ + V+ G Sbjct: 217 EVNTLKGFSAAVVFVIDSTISMGPY-------IDRTREAIRKIYQRIE-QEKLLEQVRFG 268 Query: 223 LVTFSNKIEEFFLLEWGVSHL---------QRKIKYLSKFGVSTNSTPGLKY-AYNQIFD 272 LV + + I+E LE+ + + ++ +T S+ AY Sbjct: 269 LVAYRSNIKEVPALEYVSKLYVDPAKVQGGEDFLAKMAALKPATVSSSRFDEDAY----- 323 Query: 273 MQGMRQHCNTEDANY-KKIIVFMT-----DGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 MR + + + IV ++ DG + + + EAK RG +Y++ Sbjct: 324 AGVMRALDGIDWTRFGARYIVLVSDAGALDGTDPLSATGLDAAQVRLEAKHRGVAIYSLH 383 Query: 327 IRVIRSHEFLRACASPNSFY 346 ++ + AS + Y Sbjct: 384 LKTPSG---AKNHASAEAQY 400 >gi|258627123|ref|ZP_05721919.1| hypothetical protein VMB_32200 [Vibrio mimicus VM603] gi|258580641|gb|EEW05594.1| hypothetical protein VMB_32200 [Vibrio mimicus VM603] Length = 335 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 65/180 (36%), Gaps = 31/180 (17%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESF-----FDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 +V ++ D+++V+D+S SME ++++ A K + + + Sbjct: 87 EVQTREAFGRDVLMVVDLSGSMEEKDFATESGEQLSRLTAAKKVLRNFVTQ-------RQ 139 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFD 272 + GL+ F + F+ + + L + G STN + Sbjct: 140 GDRFGLILFG---DAAFIQTPFTADQNVWLNLLDEAETGMAGQSTNLGDAI--------- 187 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 G++ + + +I++ +TDG T + A +G +Y I + + Sbjct: 188 GLGIKVFEQSPSTSQDQIMLVLTDG--NDTGSFVSPVDAAKIAAAKGIRIYVIAMGDPEN 245 >gi|91776250|ref|YP_546006.1| von Willebrand factor, type A [Methylobacillus flagellatus KT] gi|91710237|gb|ABE50165.1| von Willebrand factor, type A [Methylobacillus flagellatus KT] Length = 326 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 36/231 (15%), Positives = 72/231 (31%), Gaps = 32/231 (13%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 P+T + + +++V+D S SM+ F + T A + + Sbjct: 64 AGPVTDAQYIERIGQGA-QLVLVIDRSASMDEAFSGAETSGVAGESKAAAAERLITHFVN 122 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGV-STNSTPGLKYAYNQIFDM 273 + G+VTFSN L + I+ + TN GL A Sbjct: 123 ERSNDMFGMVTFSNSAMHALPLTDSREAILAAIRAAGGAALFQTNIGSGLTAA------- 175 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGEN-LSTKEDQQSLYYCNEA---------KKRGAI-V 322 + + + I+ ++DG + Q+ + ++ GA + Sbjct: 176 ---LAQFDKTPDSGSRAIILLSDGGGRMGANTQQKIRDWLERMNVTLYWIVLRQPGATSI 232 Query: 323 YAIGIRVIRSH---------EFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 + + +F + + Y E+P S+ A I + Sbjct: 233 FDTSYVPPEDNPLPPALELHDFFKTLRTGYQAYEAEDPTSLAAAIEDINRK 283 >gi|328945098|gb|EGG39253.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1087] Length = 471 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 32/152 (21%), Positives = 59/152 (38%), Gaps = 33/152 (21%) Query: 167 QTDARLDMMIVLDVSRSMESFFDS-------------SITKIDMAIKSINAMLEEVKLIP 213 + A + + V D S SME D + ++ID+ + M+ +++ I Sbjct: 197 EGQANVAISFVFDASGSMEFSLDGTEKVNPYSNNPLKNRSRIDILREKTKKMMADLQPIG 256 Query: 214 DVNNVVQSGLVTFSN----KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 +V+ LV F++ + F L+ G++ + I L+ +TN GL+Y Sbjct: 257 NVS----VNLVQFNSHASFVQQNFIELDKGLTSINSAIDNLNP-EHATNPGDGLRYGMVS 311 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 + K +V +TDG S Sbjct: 312 LQSNAAQL-----------KYVVLLTDGVPNS 332 >gi|257889535|ref|ZP_05669188.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,410] gi|257825895|gb|EEV52521.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,410] Length = 857 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 24/139 (17%), Positives = 57/139 (41%), Gaps = 22/139 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + LD+++V+D S SM +I ++ ++ + + + + G V + Sbjct: 14 KDITPLDLVLVVDWSGSMNDN-----NRIGEVKIGVDRFVDTL-ADSGITDKINMGYVGY 67 Query: 227 SNKIEEFF--LLEWGV-SHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 S++ + ++ G ++ ++K ++ T + L+ A + + G Sbjct: 68 SSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSVPNG------ 121 Query: 282 TEDANYKKIIVFMTDGENL 300 +KK+IV +TDG Sbjct: 122 -----HKKVIVLLTDGVPT 135 >gi|149410251|ref|XP_001508722.1| PREDICTED: hypothetical protein [Ornithorhynchus anatinus] Length = 950 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 32/178 (17%), Positives = 67/178 (37%), Gaps = 23/178 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 ++ V+D S SM KI +++ +L ++ N LV F++ I + Sbjct: 252 SVVFVIDKSGSM------DGRKIVQTREALLKVLGDLNPEDQFN------LVVFNSMISQ 299 Query: 233 FFL-----LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + V ++ + + G TN + A + + D R+ + Sbjct: 300 WQPSLLKATQENVGSAKKFVLDIRASG-GTNINEAVLAAVH-LLDESNQRELLPENSVSM 357 Query: 288 KKIIVFMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 I+ +TDGE + + +++ + ++ +G S+ FL A NS Sbjct: 358 ---IILLTDGEPTVGETNPENIQQNIQRSLDGKYALFCLGFGFDVSYSFLEKMALDNS 412 >gi|148230547|ref|NP_001081234.1| complement factor B [Xenopus laevis] gi|602779|dbj|BAA06179.1| complement factor B [Xenopus laevis] Length = 747 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 34/244 (13%), Positives = 77/244 (31%), Gaps = 35/244 (14%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 N + S V D +++ IVLD S+S+ + D A + +E++ Sbjct: 218 NVDTTNLEDRSDRSVRILKDGLMNIFIVLDTSKSVGQN------RFDEAKSASILFIEKM 271 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLL----EWGVSHLQRKIKYLSK----FGVSTNSTP 261 ++++++K L + ++ TN+ Sbjct: 272 SNYDIKPRYC---IISYASKAISVVSLRDPDSNNADAVMEHLEEFQYDRHEDKQGTNTRA 328 Query: 262 GLKYAYNQIFDMQGM--RQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYY------- 311 L Y + + + R+ + +I+ MTDG+ N+ ++ Sbjct: 329 ALHAIYEHLIEQELAYEREGKKEDFMKIHNVILLMTDGKFNMGGDPREEMKLIKRFLDVG 388 Query: 312 --CNEAKKRGAIVYAIGIRV----IRSHEFLRACASPNSFYLVENPHSMYDAFSH--IGK 363 + ++ VY G+ ++ + ++N M + F I Sbjct: 389 IRKDNPREEYLDVYVFGLGSDIDQPEINDLASKKEKEVHTFHLQNVEKMKEFFELMIIED 448 Query: 364 DIVT 367 D+ Sbjct: 449 DVFD 452 >gi|226487568|emb|CAX74654.1| Loss of heterozygosity 11 chromosomal region 2 gene A protein homolog [Schistosoma japonicum] Length = 832 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 53/320 (16%), Positives = 102/320 (31%), Gaps = 57/320 (17%) Query: 46 MKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRI-----KNTWNMSFRNELRD 100 MKT ++ ++N +K + ++F L Sbjct: 146 MKTDTAEFSHSEIIMTLPSVINPRYYPKEQKHSEEFDPFTELVLVNSVPYTITFEAHLWM 205 Query: 101 NGFVNDIDDIVRSTSLDIVVVPQN----------EGYSISAISRYKIPLKFCTFIPWYTN 150 + D+ I S +LD P N + + + PL + + + Sbjct: 206 QSKILDVKSIHDSFTLDRTNNPNNVIVKLASGFVPDHDLQMVISLSDPLTSLVSLEYGDS 265 Query: 151 SRHIVMP--------ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 ++ ++ + V S D R + + ++D S SME I A S+ Sbjct: 266 NQSSLLAMNCLMVQILPDIPNVVSSKDMRNEFVFLIDRSGSMEG------DNISYAKTSL 319 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF--LLEWGVSHLQRKIKY---LSKFGVST 257 L+ + VN Q + F + F ++ L + Y L+ T Sbjct: 320 LLFLKSL----PVNCRFQI--IGFGSNFAALFSEPTDYSEDSLNAAMNYQKDLNADMGGT 373 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAK 316 + LK A H + + K I+F+TDG+ + + ++A+ Sbjct: 374 EAYNALKSAL-----------HSSPSGEGWFKQIIFLTDGDVGNADEVIGLVRMNVDKAR 422 Query: 317 KRGAIVYAIGIRVIRSHEFL 336 V+ IG+ S + Sbjct: 423 -----VFTIGLGQGVSTALI 437 >gi|160878421|ref|YP_001557389.1| von Willebrand factor type A [Clostridium phytofermentans ISDg] gi|160427087|gb|ABX40650.1| von Willebrand factor type A [Clostridium phytofermentans ISDg] Length = 551 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 39/272 (14%), Positives = 87/272 (31%), Gaps = 28/272 (10%) Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLD-----IVVVPQNE 125 KN +F ++ + N + +D I + Sbjct: 88 STEEYNAVIEQGYQSTKNHPLSTFSADVDTASYSNIRRMLKEGRRVDTGAVRIEEMLNYF 147 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNS--RHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 Y + P T + + + + + K++ A +++ ++DVS S Sbjct: 148 NYDYK-LPEGDSPFGITTELSDCPWNPDTKLFLAGIQTEKIDFSKSAPSNLVFLIDVSGS 206 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 M K+ + ++ + E + ++ V +G ++ + + Sbjct: 207 MMDE-----DKLPLVQRAFLLLTENLTEKDRISIVTYAG----NDTVVLSGAKGNQKEKI 257 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLST 302 Q I L G ST + G++ AY Q+ + N ++ TDG+ N+ Sbjct: 258 QNAITELEA-GGSTFGSKGIETAY-QLAMENYIEGGNNR--------VILATDGDLNVGV 307 Query: 303 KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + + E +K G + +G + Sbjct: 308 TSESELTNLIEEKRKSGVALSVLGFGTGNIKD 339 >gi|282601472|ref|ZP_05981787.2| von Willebrand factor type A domain protein [Subdoligranulum variabile DSM 15176] gi|282569002|gb|EFB74537.1| von Willebrand factor type A domain protein [Subdoligranulum variabile DSM 15176] Length = 246 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 29/194 (14%), Positives = 63/194 (32%), Gaps = 31/194 (15%) Query: 167 QTDARLDMMIVLDVSRSMES--------------------FFDSSITKIDMAIKSINAML 206 R+ + + LD S SM + +++D K I Sbjct: 7 NPTPRVPICLCLDTSGSMGAVQGDCVDTGKTLFEDGRQWNLVTGGTSRLDELQKGIKLFY 66 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 V+ + +VTF ++ + Q + L+ G + + Sbjct: 67 NSVREDEVARYAAEICIVTFDSEAKCRMDFAN--LDRQSDLPELTATGDT-----AMGEG 119 Query: 267 YNQIFDMQG--MRQHCNTEDANYKKIIVFMTDGENLSTKED-QQSLYYCNEAKK-RGAIV 322 N D+ R++ + ++ +V MTDG + + ++++ C + + + V Sbjct: 120 VNLALDLLESRKREYQDKGVDYFQPWLVLMTDGVPNGNEGEFERAVQRCRDMEAQKKLTV 179 Query: 323 YAIGIRVIRSHEFL 336 + I I L Sbjct: 180 FPIAIGDEGDQTAL 193 >gi|224051388|ref|XP_002199708.1| PREDICTED: coagulation factor C homolog, cochlin [Taeniopygia guttata] Length = 417 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 33/181 (18%), Positives = 67/181 (37%), Gaps = 22/181 (12%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P + K D + D+ ++D S ++ + ++ + + + + + Sbjct: 16 PKKTLDKKAGNKDCKADIAFLIDGS------YNIGQRRFNLQKNFVGKVAVMLGIGTEGP 69 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 +V G+V S + F L+ + IK L G ++N+ LKY + F ++ Sbjct: 70 HV---GVVQASEHPKIEFYLKNFTATKEVLFAIKELGFRGGNSNTGKALKYTAQKFFSLE 126 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 N KIIV DG E+ + A++ G V+ + + + E Sbjct: 127 ------NGARKGIPKIIVVFLDGWPSDDLEEAGIV-----AREFGVNVFIVSVAKPTTEE 175 Query: 335 F 335 Sbjct: 176 L 176 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 25/198 (12%), Positives = 65/198 (32%), Gaps = 24/198 (12%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +++ ++D S S+ + ++ ++ + + ++ + V T++ + Sbjct: 232 SVNIGFLIDGSSSIGEI------NFRLMLEFVSNVAKAFEISDIGSKVAAVQF-TYNQRK 284 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 E F + I + T + + + +F + N Sbjct: 285 EFGFTDHVTKEKVLSAIHNIQYMSGGTATGDAISFTTRTVFGPVKDGPNKN--------F 336 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---PNSFYL 347 ++ +TDG+ AKK G V+++G+ + L+ AS + + Sbjct: 337 LIVLTDGQ-----SYDDVTGPAAAAKKAGITVFSVGVAWAPLDD-LKEMASEPRESHTFF 390 Query: 348 VENPHSMYDAFSHIGKDI 365 + I + I Sbjct: 391 TREFTGLEQMVPDIIRGI 408 >gi|149437043|ref|XP_001515962.1| PREDICTED: hypothetical protein [Ornithorhynchus anatinus] Length = 948 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 28/197 (14%), Positives = 71/197 (36%), Gaps = 30/197 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+DVS SM K+ ++++ +L++++ Q ++ F++ + Sbjct: 314 ILFVIDVSGSMWG------VKMKQTVEAMKTILDDLRAED------QFSVIDFNHNVRSW 361 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + ++ I+ + G TN L A + + + + + Sbjct: 362 KDNLVPATDLMTTDAKKYIEKIQPNG-GTNINEALLRAIFILREASNLGM-LDPNSVSL- 418 Query: 289 KIIVFMTDGENL--STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS-- 344 I+ ++DG+ K + ++++GI ++FL + N Sbjct: 419 --IILVSDGDPTVGELKPTVIQKNVKKNMRDN-ISLFSLGIGFDVDYDFLERLSRENHGM 475 Query: 345 ---FYLVENPHSMYDAF 358 Y ++ S F Sbjct: 476 AQRIYGNQDTSSQLKQF 492 >gi|115377250|ref|ZP_01464460.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|115365726|gb|EAU64751.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 520 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 35/234 (14%), Positives = 76/234 (32%), Gaps = 35/234 (14%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 H++ KV++ ++ +DVS SM ++++ +S+ ++E+ Sbjct: 123 RQGYHVLHVGLQGQKVSAAERLPAHLVFTIDVSGSMN-----MENRLELVKRSLAMLVEK 177 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 + + VV + + S + I L G STN GL+ A Sbjct: 178 LDSRDTLAIVV------YGDTARTVLEPTRIMDRSRILEAINALHPEG-STNVQAGLQVA 230 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 Y +R+ + ++ +DG N + ++G + + Sbjct: 231 Y--AIAASQVREGATSR-------VILCSDGVANNGITQADSIFQSVKAYAQQGVRLTTV 281 Query: 326 GIRVIRSHEFLRACAS---PNSFYLVENPHSMYDAF--------SHIGKDIVTK 368 G + ++ L S + V+ F I +D+ + Sbjct: 282 GFGMGNYNDELMERLSHVGDGQYAYVDALPEARRIFIEQFTGTLQLIARDVKVQ 335 >gi|327474184|gb|EGF19594.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK408] Length = 450 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 53/321 (16%), Positives = 90/321 (28%), Gaps = 41/321 (12%) Query: 52 SMIDRSLVHAATQIMNEGN--GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 + + A ++I + K GD++ KN + + N V Sbjct: 52 EAVRSPIEAAGSEIAADTTKPVVEESTAKPGDLIDVSKNVSPVDVKESQEGNQTVRVETS 111 Query: 110 IVRSTSLDIV---VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 V T ++ V E + + + Y +S Sbjct: 112 TVDVTKTEVAPAKVDKAPEPITRRTENTLSGTDENGNPYTQYERVDKTTTITYTSTPPTV 171 Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 D++ V+D S SM ID+ +IN + + + GL TF Sbjct: 172 TKAGSADIVFVVDRSGSMGGT-------IDIVRANINEFVRNITK---EGITARFGLATF 221 Query: 227 SNKIE-------------EFFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQ 269 S+++ F + + K L+ G T TP A NQ Sbjct: 222 SDEVYGRNSGSKDEDTVLTRFGSSYFTTDPAELEKALAAIRIASGGDTPETPT--PALNQ 279 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 I + KK +V +TD E K G ++ Sbjct: 280 IISTYDWSKSSKN-----KKFVVLLTDAEMKEDPSIPTVADTLAALKAAGIERTVATVKA 334 Query: 330 IRSHEFLRACASPNSFYLVEN 350 I + A+ +EN Sbjct: 335 IEG--IYKNFATEGRVLDIEN 353 >gi|254443409|ref|ZP_05056885.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] gi|198257717|gb|EDY82025.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] Length = 257 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 37/211 (17%), Positives = 70/211 (33%), Gaps = 30/211 (14%) Query: 157 PITSSVKVNSQTDARLDMMIVL-DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 P V + D L V+ D S SM + KI+ +++ + + Sbjct: 55 PAIQEANVLMEADPVLTNYYVIFDASGSMNELVAQQLPKIEAGKQALVTFANNLPEDAN- 113 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDM 273 GL+TF + + E L G + Q I +S+ T + Y + + Sbjct: 114 -----LGLLTF-DPVRELLPL--GRGNRQAFIGSVSQIRAKGRTPLVESIVTGYRVLTEQ 165 Query: 274 QGMRQHCNTEDANYKKII-VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + Y + + V +TDG + ++ ++ V IG V Sbjct: 166 AQRQSG-------YGRYVLVIVTDGASSDGNPAGVAMEV---TRESPIEVQTIGFGVAD- 214 Query: 333 HEFLRACASPN--SFYLVENPHSMYDAFSHI 361 A P + +P ++ DA + + Sbjct: 215 ----HALNLPGVTQYVTASSPKALIDALNQV 241 >gi|116625432|ref|YP_827588.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228594|gb|ABJ87303.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 307 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 75/205 (36%), Gaps = 31/205 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D L +V D SRSM+ ++ A +S+ L + L+ FS Sbjct: 88 ADTPLSTGVVFDSSRSMK-------NRLQDARQSVEQFLRTGSTGDEYF------LIRFS 134 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 ++ + + R++ + G + + D + + + + N+ Sbjct: 135 DEAKMLAPFTADTEEIARQLGSIEAKG------------WTALNDSIVLAANQSRKARNH 182 Query: 288 KKIIVFMTD-GENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC-ASPNSF 345 +K ++ ++D G+N S + + ++ VYA+ I RS + C + Sbjct: 183 RKALLVISDGGDNNSRYTVGEMISI---LREADLRVYAVSI-FERSQLLEKICEETGGRA 238 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRI 370 V + D + +++ ++ I Sbjct: 239 LWVRKLGDLPDIMERLSQEMRSEYI 263 >gi|327403933|ref|YP_004344771.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] gi|327319441|gb|AEA43933.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] Length = 375 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 34/147 (23%), Positives = 57/147 (38%), Gaps = 25/147 (17%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSME--SFFDSSITKIDMAIKSINAMLEEVKLIPD 214 P+ S KVN LD++I LD+S SM + ++++ A ++I +L ++K Sbjct: 114 PVAGSRKVNGSKRV-LDLVICLDISNSMNTQDMGGNDVSRLTAAKQAIGELLNQLKGE-- 170 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIF 271 + +V F+N L + I + TN L+ A Q Sbjct: 171 -----RIAVVIFANDAYTQLPLTMDYGAAKLFIPDIETSMISDQGTNVGRALEIAQEQF- 224 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGE 298 +D K I+ +TDGE Sbjct: 225 -----------KDTESGKAILVITDGE 240 >gi|325831996|ref|ZP_08165093.1| von Willebrand factor type A domain protein [Eggerthella sp. HGA1] gi|325486317|gb|EGC88769.1| von Willebrand factor type A domain protein [Eggerthella sp. HGA1] Length = 548 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 77/192 (40%), Gaps = 25/192 (13%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 PW ++ +VM + ++ +++ ++DVS SM+ K+ + S A+ Sbjct: 162 PWNDQTKLLVMGFATEKDGDASPTGA-NLVFLIDVSGSMD-----DPDKLPLVKDSFAAL 215 Query: 206 LEEVKLIPDVNNVVQSGLVTFSN--KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL 263 +E + + +VT+++ ++ + + R + L G STN GL Sbjct: 216 VEGL------TERDRVSVVTYASGERVLLEGVPGDDKRRIMRAVDSLVAEG-STNGEAGL 268 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIV 322 + AY ++ + + N +V +DG+ N+ + + + + ++ G + Sbjct: 269 EQAY-RLAESSFIEGGVNR--------VVMASDGDLNVGISSESELHDFVEQKRETGVYL 319 Query: 323 YAIGIRVIRSHE 334 +G + Sbjct: 320 SVLGFGSGNYKD 331 >gi|326680240|ref|XP_001920592.3| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-4 [Danio rerio] Length = 1087 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 63/165 (38%), Gaps = 19/165 (11%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++IV+DVS SM+ K+ +A +IN +L+ + VN + + V++ Sbjct: 244 ATSPKDLVIVVDVSGSMKGL------KMTIAKHTINTILDTLGENDFVNVIAYTDYVSYV 297 Query: 228 NKIEEFFLLE---WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + L++ H + + L G +K ++ + D + Q Sbjct: 298 EPCFKGTLVQADLDNREHFKLLVDELHVKGE-AKVKKAMKESFRILADARANGQGSLCNQ 356 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 A I+ +TDG + + + ++ V+ I Sbjct: 357 A-----IMLITDGAMEDFQSVFEEFNWPDK----KVRVFTYLIGR 392 >gi|321475776|gb|EFX86738.1| hypothetical protein DAPPUDRAFT_307863 [Daphnia pulex] Length = 829 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 34/227 (14%), Positives = 73/227 (32%), Gaps = 50/227 (22%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL---------- 223 ++ VLD S SM T+I+ +++N++L++++ D+ +VV+ Sbjct: 240 VIFVLDTSGSMAG------TRIEQTKQAMNSILDQLRKDEDIFSVVEFSSGVTEWDLRKP 293 Query: 224 ------VTFSNKIEEFFL-----LEWGVSHLQ----RKIKYLSKFGVSTNSTPGLKYAYN 268 F++ EE + S ++ I S A + Sbjct: 294 YKGPDHYYFNSPPEETTEDATAVPQNNESEVKFGPYDDILAYPVTEQSVKRAKEFVAAMD 353 Query: 269 --QIFDMQGMRQHCNTEDANYKK------IIVFMTDGENLSTKEDQQSLYYCNEAK---- 316 ++ + + II+F+TDGE ++ D + + Sbjct: 354 VTSSTNINDALLLALKNSQSVQSRVRLTPIIIFLTDGEPTASVTD--TTEILKNVRKGNS 411 Query: 317 KRGAIVYAIGIRVIRSHEFLRACASPN-----SFYLVENPHSMYDAF 358 ++ + ++FL +S N Y + F Sbjct: 412 DDVVSIFCLAFGTGTDYQFLTKISSQNRGFARKIYEAADATLQLKGF 458 >gi|301788660|ref|XP_002929747.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like [Ailuropoda melanoleuca] Length = 946 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 30/197 (15%), Positives = 76/197 (38%), Gaps = 30/197 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ V+DVS SM K+ ++++ +L++++ Q ++ F++ + Sbjct: 311 ILFVIDVSGSMWGI------KMKQTVEAMKTILDDLRAED------QFSVIDFNHNVRTW 358 Query: 231 --EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + + ++ I+ + G TN L A IF + + + Sbjct: 359 RNDLVSATKTQIVDAKKYIEKIQPSG-GTNINEALLRA---IFILNEANNLGMLDPESVS 414 Query: 289 KIIVFMTDGENL--STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--- 343 +I+ ++DG+ K + + ++++GI ++FL+ ++ N Sbjct: 415 -LIILVSDGDPTVGELKLSKIQKNVKQNIRDN-IALFSLGIGFDVDYDFLKRLSNENRGI 472 Query: 344 --SFYLVENPHSMYDAF 358 Y ++ S F Sbjct: 473 AQRIYGNQDTSSQLKKF 489 >gi|295394688|ref|ZP_06804906.1| von Willebrand factor [Brevibacterium mcbrellneri ATCC 49030] gi|294972458|gb|EFG48315.1| von Willebrand factor [Brevibacterium mcbrellneri ATCC 49030] Length = 538 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 27/203 (13%), Positives = 61/203 (30%), Gaps = 17/203 (8%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-----Q 220 + M +D S SM +++ ++ + + + Sbjct: 342 ANGRKPATMFFQIDTSGSMRG------ERLEQLKTALGILSGTSAKNDTERFLAIQPREK 395 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 LV FS++++ + K++ + T + G Y+ + Sbjct: 396 LKLVEFSHEVKSTDGYRLTDNGSADKVRKDLDTKIQTLTAEGGTAIYSTLQTTLE-SAKK 454 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK-RGAIVYAIGIRVIRSHEFLRAC 339 D ++VF TDG N + + + + + V+A+ S E Sbjct: 455 EKSDDKITSVVVF-TDGMNEHGISFRAFKDWYSNNQDVQDIPVFAVSFGNADSDELQELV 513 Query: 340 A-SPNSFYLVENPHSMYDAFSHI 361 + + + + AF I Sbjct: 514 SLTGGRVFDGN--ADLTAAFKDI 534 >gi|149197907|ref|ZP_01874956.1| BatB [Lentisphaera araneosa HTCC2155] gi|149139128|gb|EDM27532.1| BatB [Lentisphaera araneosa HTCC2155] Length = 718 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 38/194 (19%), Positives = 68/194 (35%), Gaps = 34/194 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF--SNKI 230 ++ +LD+S SM + D +++ + I + ++ + GLV F S I Sbjct: 88 SVVFILDISNSMRAE-DVYPNRLEKSKNLIAECVSSLEE-------HRVGLVVFAGSASI 139 Query: 231 EEFFLLEW-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + L++ + + Y S T L A +++F D K Sbjct: 140 KCPLTLDYDFFLKMLDTVNYDSVAHGGTRIEDALMKACDKLFS----------GDTQQHK 189 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNE-AK---------KRGAIVYAI---GIRVIRSHEFL 336 I+ ++DG + D+ + A+ K GA + I G + E L Sbjct: 190 DIILISDGGDQGELLDKAIETVNEKQARLMLIGMGDEKNGAPIPTIDGNGYMMYEGREVL 249 Query: 337 RACASPNSFYLVEN 350 S YL E Sbjct: 250 TKLESETMLYLSEQ 263 >gi|156393752|ref|XP_001636491.1| predicted protein [Nematostella vectensis] gi|156223595|gb|EDO44428.1| predicted protein [Nematostella vectensis] Length = 419 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 27/171 (15%), Positives = 57/171 (33%), Gaps = 20/171 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ S++ + K I +L + + D + ++TF + + Sbjct: 45 DLLFLLDTSGSL------SLSNFNTEKKFIRNLLNVIAVGFDA---TRVEIITFGSDVNR 95 Query: 233 FFLL------EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + K + TN + AY + N Sbjct: 96 RVPFISEAHEKDTKCTFNEKFANVVHEWGMTNMRGAFEKAYEVC-----KGTWSGKKRLN 150 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 K ++ +TDG ++ + + + G + A G+ S L+ Sbjct: 151 IKTTVILITDGHWNWPWQNPDPVPKAQQLIREGVEILAFGVGYGISLSNLQ 201 >gi|20091285|ref|NP_617360.1| hypothetical protein MA2454 [Methanosarcina acetivorans C2A] gi|19916409|gb|AAM05840.1| hypothetical protein (multi-domain) [Methanosarcina acetivorans C2A] Length = 551 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 51/167 (30%), Gaps = 27/167 (16%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + + V DVS SM ++ S+ ++ + GLV++ Sbjct: 372 EKKKPICAVFVADVSGSM------MGEPLNNLKDSL------LRGQYYIGEDNMIGLVSY 419 Query: 227 SNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 SN + + + + L G T + G+ A I Q Sbjct: 420 SNDVNIDLPIAKFDLNQRASFAGAVNDLQA-GGGTATFDGIAVAMKMI-------QEQRA 471 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 D N + +I ++DGE G +Y IG Sbjct: 472 ADPNIRPVIFVLSDGETNKGHPLNDIKGIVE---DTGIPIYTIGYNA 515 >gi|12850100|dbj|BAB28591.1| unnamed protein product [Mus musculus] Length = 431 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 30/141 (21%), Positives = 52/141 (36%), Gaps = 13/141 (9%) Query: 234 FLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L +++ ++ L K G T G + A QI+ + +I Sbjct: 1 MKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENSQGYRTAS-------VI 53 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENP 351 + +TDGE L S N ++ GAIVY +G++ + R S + + V + Sbjct: 54 IALTDGE-LHEDLFFYSEREANRSRDLGAIVYCVGVKDFNETQLARIADSKDHVFPVNDG 112 Query: 352 HSMYDAFSHIGKDIVTKRIWY 372 + A I I+ K Sbjct: 113 ---FQALQGIIHSILKKSCIE 130 >gi|295840348|ref|ZP_06827281.1| secreted protein [Streptomyces sp. SPB74] gi|295827934|gb|EFG65721.1| secreted protein [Streptomyces sp. SPB74] Length = 418 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 39/221 (17%), Positives = 76/221 (34%), Gaps = 40/221 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 A + +VLDVS SM + +++ A +S N +L+ V DV +++ + Sbjct: 30 KAPPQVELVLDVSGSMRAKDIDGASRMSAAKQSFNEVLDAVP--EDVELGIRTLGADYPG 87 Query: 229 KIEEFFLLEWG---------VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + E + + + + LS G T P L A + + Sbjct: 88 EDRETGCKDTRQLYPVGHPDRTEAKAAVATLSPTGW-TPIGPALLGAAEDLRGGEAA--- 143 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK---RGAIVY--AIGIRV-IRSH 333 K IV +TDGE D C A++ +G + +G+ ++ Sbjct: 144 ---------KRIVLITDGE------DTCRRDPCEVAREIAAKGVHLVVDTLGLVPDAKTR 188 Query: 334 EFLRAC---ASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + L C A+ ++ V + + + + Sbjct: 189 DQLS-CIAEATGGTYTTVRHTKDLSGRVKQLVHRAADPVVT 228 >gi|119358219|ref|YP_912863.1| von Willebrand factor, type A [Chlorobium phaeobacteroides DSM 266] gi|119355568|gb|ABL66439.1| von Willebrand factor, type A [Chlorobium phaeobacteroides DSM 266] Length = 337 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 59/171 (34%), Gaps = 18/171 (10%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 +D++ +LDVS SM D S ++ A I + + ++ + Sbjct: 80 GSRPVERKGVDVLFMLDVSNSM-LVADVSPDRLTRAKSGILRISKGLRDG-------RQA 131 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGMRQH 279 L+ F+ + + + + +S T L A + Sbjct: 132 LLLFAGSPLVQCPMTTDHAAFEALLGMVSTELVSDQGTAFDSALNLAMRLF---ERTEPP 188 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + ++ +K+IV ++DGEN S + K+ G V+ I + Sbjct: 189 GDVKEVQGEKVIVLLSDGENHSGNF----RAVADALKQSGVSVFTIVLGKP 235 >gi|293569888|ref|ZP_06680975.1| putative pilus subunit protein PilB [Enterococcus faecium E1071] gi|291587636|gb|EFF19513.1| putative pilus subunit protein PilB [Enterococcus faecium E1071] Length = 1277 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 25/139 (17%), Positives = 57/139 (41%), Gaps = 22/139 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + LD+++V+D S SM +I ++ ++ + + + + G V + Sbjct: 286 KDITPLDLVLVVDWSGSMNDN-----NRIGEVKIGVDRFVDTL-ADSGITDKINMGYVGY 339 Query: 227 SNKIEEFF--LLEWGV-SHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 S++ + ++ G ++ ++K ++ T + L+ A N + G Sbjct: 340 SSEGYSYSNGAVQMGSFDSVKNQVKSITPSWTNGGTFTQKALRDAGNMLSVPNG------ 393 Query: 282 TEDANYKKIIVFMTDGENL 300 +KK+IV +TDG Sbjct: 394 -----HKKVIVLLTDGVPT 407 >gi|224029935|gb|ACN34043.1| unknown [Zea mays] Length = 598 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 35/216 (16%), Positives = 90/216 (41%), Gaps = 31/216 (14%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI---TKIDMAIKSINAMLEEVKLIPD 214 + + + A +D++ ++++++SM S +++D+ ++ ++ ++ Sbjct: 33 VRVEAPSSMKNHAPIDLVTLININQSMSWPAASQTEIPSRLDLLKNAMKFIIRQLGDDD- 91 Query: 215 VNNVVQSGLVTFSNKI---EEFFLLE---WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + +V F++K+ +LE G +++K+ L G T P L++A Sbjct: 92 -----RLAIVAFNDKVIKENTTGILEISGSGRMAIEKKVDGLVAMG-DTAFKPSLEHAV- 144 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK--RGAIVYAIG 326 ++ D + ++ IV ++DG + +K +S+ + + R V+ G Sbjct: 145 KLLDDRADKKRAG--------FIVLISDGLDGQSKWGDESITPTDPIRGLLRKYPVHTFG 196 Query: 327 IRVIRSHEFLRACA--SPNSFYLV--ENPHSMYDAF 358 + + L A S + + +N + +AF Sbjct: 197 LGKAHDPKALHYIADISYGIYSSIVTDNLDKIIEAF 232 >gi|182412149|ref|YP_001817215.1| von Willebrand factor type A [Opitutus terrae PB90-1] gi|177839363|gb|ACB73615.1| von Willebrand factor type A [Opitutus terrae PB90-1] Length = 859 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 36/190 (18%), Positives = 66/190 (34%), Gaps = 23/190 (12%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 + + N I M + ++ L++ ++LD S SME T ++ Sbjct: 485 QAAHPLLQQRNLVRIAMKVPAA---GRGAGQPLNLTVLLDTSGSME------RTDRATSV 535 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNS 259 ++ +L + D + L+ F+ + LL ++ Q + L +T Sbjct: 536 RAALGVLASLLTPDD-----RVTLIGFAR---QPRLLAESLAGDQA--RQLVDLASTTPF 585 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY-CNEAKKR 318 T G + R+H N N IV +TDG D L +++ Sbjct: 586 TGGTNLEAALSLAGELARRHHNAAAQNR---IVLITDGAANLGNADPAQLATRIETLRQQ 642 Query: 319 GAIVYAIGIR 328 G A G+ Sbjct: 643 GIAFDACGVG 652 >gi|163850366|ref|YP_001638409.1| hypothetical protein Mext_0933 [Methylobacterium extorquens PA1] gi|163661971|gb|ABY29338.1| conserved hypothetical protein [Methylobacterium extorquens PA1] Length = 473 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 63/470 (13%), Positives = 131/470 (27%), Gaps = 139/470 (29%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 N +G + +L A+ + ++G+ ++ KT L + D + + N Sbjct: 16 SNAEGSINVLFALAVLPTIGLVGLGVDYGMAISSKTRLDNAADAAALAGVVTAKEFIAAN 75 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 ++ + + +F F V + L+IV Q ++ A Sbjct: 76 AQQSDVMTAGIKAGEYQALKAFNVNASKVPFA-----TVSLSQLEIVRSGQ----TLDAT 126 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM-------- 184 Y ++ + ++ + + +S + LD +++DVS SM Sbjct: 127 VSYTATVQSTFGRLFGLSATTLTNRVNASADIA----GYLDFYLMVDVSGSMGLPTTDSD 182 Query: 185 ---------ESFFDSSIT------------------KIDMAIKSINAMLEEVKLIPDVNN 217 E + + D ++ A+L+ P V N Sbjct: 183 AALLASKSVEDQGNCQFACHFPNRKGWNLAAGKIQLRSDAVNNAVCALLDRASK-PIVPN 241 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G+ F N++ L + L+ + + A+ + D + Sbjct: 242 QYRIGIYPFINRLATLAPLSDTTT-------SLASLKTTADCGKAWPLAFTNLLDTGSTQ 294 Query: 278 QHCN-------------------------------TEDANYKKIIVFMTDGENLSTKE-- 304 N + AN K + +TDG S Sbjct: 295 LFTNNDPTTGTGSGGTHFETALPQMKSTIRTFGNGSSSANPKPFVFLITDGMQNSQTYSS 354 Query: 305 --DQQSL-------------------------YYCNEAKKRG--AIVYAIGIRVIRSH-- 333 D+++ C + KK G + I ++S+ Sbjct: 355 WKDKKTYPGNPSKFAGYRYADWDGSQPAQIDPAKCADLKKAGATISILYIPYNYVKSYTN 414 Query: 334 -------------------EFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 + LR CASP F+ + + + + Sbjct: 415 EGTIVWENNRVNGFSPTLADPLRQCASPGLFFTANSAKDITASLGAMFDQ 464 >gi|126306106|ref|XP_001362488.1| PREDICTED: similar to calcium-dependent chloride channel-1 [Monodelphis domestica] Length = 911 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 41/211 (19%), Positives = 78/211 (36%), Gaps = 38/211 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++VLD S SM + +++ ++ L ++ + +G+VTF + Sbjct: 307 LILVLDKSGSM-----AGEDRLNRLNQASQLFLLQI-----IEKGSWTGMVTFDSSATIQ 356 Query: 234 FLLEWGVSHLQRK--IKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + QR I L G T+ GL+ A+ I + H + Sbjct: 357 SALIQIETDAQRNSLISRLPTAAGGGTSICSGLRTAFTVI--KKKFSTHGSE-------- 406 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRACA-SPNSFYLV 348 IV +TDGE ++ C +E K+ GAI++ + + + + Sbjct: 407 IVLLTDGE-------DSTISSCFDEVKQSGAIIHTVALGPSADPGLEKLAEMTGGMKTSA 459 Query: 349 ENPHS---MYDAFSHIGK---DIVTKRIWYD 373 + + DAFS + I + I + Sbjct: 460 TDNAQNNGLIDAFSALSSGNGAITQRSIQLE 490 >gi|104782921|ref|YP_609419.1| RTX toxin [Pseudomonas entomophila L48] gi|95111908|emb|CAK16632.1| putative RTX toxin [Pseudomonas entomophila L48] Length = 2350 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 77/204 (37%), Gaps = 19/204 (9%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMES-FFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 S+V+ + + ++++V+DVS SM S +T++++A ++IN +L++ + D+ Sbjct: 1550 ESAVRSITPGEVDSNILLVVDVSSSMNSGSGVPGLTRLELAKQAINTLLDKYDDMGDIK- 1608 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 VQ VTFS + +S + I L+ G + + K + F G Sbjct: 1609 -VQI--VTFSTGATMQTPVWVSISEAKSLIAGLTAGGSTYYDSAATKA--QEAFVSAGKL 1663 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY--YCNEAKKRGAIVYAIGIR---VIRS 332 + + F +DGE + G +A+G+ + Sbjct: 1664 VGA-------QNVSYFFSDGEPSGGHSITAVRETTWETFLDDNGIKSFAVGMGSGVNAGN 1716 Query: 333 HEFLRACASPNSFYLVENPHSMYD 356 + L S ++ + Sbjct: 1717 LDPLAYDGSSHTDTNAVVVTDLNQ 1740 >gi|21707832|gb|AAH34341.1| Inter-alpha trypsin inhibitor, heavy chain 2 [Mus musculus] Length = 946 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 32/198 (16%), Positives = 77/198 (38%), Gaps = 32/198 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ V+DVS SM K+ ++++ +L++++ Q +V F++ + Sbjct: 311 ILFVIDVSGSMWGI------KMKQTVEAMKTILDDLRTDD------QFSVVDFNHNVRTW 358 Query: 231 --EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + ++ +R I+ + G TN L A IF + + Sbjct: 359 RNDLVSATKTQIADAKRYIEKIQPSG-GTNINEALLRA---IFILNEASNMGLLNPDSVS 414 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACASPN-- 343 +I+ ++DG+ + + K+ ++++GI ++FL+ ++ N Sbjct: 415 -LIILVSDGDPT--VGELKLSKIQKNVKQSIQDNISLFSLGIGFDVDYDFLKRLSNENRG 471 Query: 344 ---SFYLVENPHSMYDAF 358 Y ++ S F Sbjct: 472 IAQRIYGNQDTSSQLKKF 489 >gi|3024068|sp|Q61703|ITIH2_MOUSE RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H2; Short=ITI heavy chain H2; Short=ITI-HC2; Short=Inter-alpha-inhibitor heavy chain 2; Flags: Precursor gi|695634|emb|CAA49842.1| inter-alpha-inhibitor H2 chain [Mus musculus] gi|122889675|emb|CAM13914.1| inter-alpha trypsin inhibitor, heavy chain 2 [Mus musculus] Length = 946 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 32/198 (16%), Positives = 77/198 (38%), Gaps = 32/198 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ V+DVS SM K+ ++++ +L++++ Q +V F++ + Sbjct: 311 ILFVIDVSGSMWGI------KMKQTVEAMKTILDDLRTDD------QFSVVDFNHNVRTW 358 Query: 231 --EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + ++ +R I+ + G TN L A IF + + Sbjct: 359 RNDLVSATKTQIADAKRYIEKIQPSG-GTNINEALLRA---IFILNEASNMGLLNPDSVS 414 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACASPN-- 343 +I+ ++DG+ + + K+ ++++GI ++FL+ ++ N Sbjct: 415 -LIILVSDGDPT--VGELKLSKIQKNVKQSIQDNISLFSLGIGFDVDYDFLKRLSNENRG 471 Query: 344 ---SFYLVENPHSMYDAF 358 Y ++ S F Sbjct: 472 IAQRIYGNQDTSSQLKKF 489 >gi|313242832|emb|CBY39591.1| unnamed protein product [Oikopleura dioica] Length = 674 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 28/202 (13%), Positives = 66/202 (32%), Gaps = 24/202 (11%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D++ V ++ S+ I ++I + + + ++ ++ F++ Sbjct: 396 GPRDVLFV----AHYTTYMGSTFADISAFYENIISTIN----VEPSDSSIRFAFSFFNHA 447 Query: 230 IEEFFLLEW--GVSHLQRKIKYLSKFGVSTN-STPGLKYAYNQIFDMQGMRQHCNTEDAN 286 EFF +W + + + N LK A + + G + +T Sbjct: 448 YIEFFAFDWLNSIDEYKWAFSSFPPASGNANYIGRALKGAADTMTPAFGKGRRIDTVGT- 506 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---N 343 +V +T + + ++ K++ V +G+ + L AS Sbjct: 507 ----VVLLT-----NAASTDEVNEMADQLKEKVDRVIVVGLGYAFGQDELAGIASSPTKE 557 Query: 344 SFYLVENPHSMYDAFSHIGKDI 365 + Y E + I +I Sbjct: 558 NLYTAEESSDLAGLVRTIADEI 579 >gi|126737457|ref|ZP_01753192.1| von Willebrand factor type A domain protein [Roseobacter sp. SK209-2-6] gi|126722042|gb|EBA18745.1| von Willebrand factor type A domain protein [Roseobacter sp. SK209-2-6] Length = 479 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 73/202 (36%), Gaps = 37/202 (18%) Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK------ 229 VLD S SM + KI +A + + +L V L P+ GL+ + ++ Sbjct: 28 FVLDASGSM-WGQIEGVAKITIAQQVLQKLL--VDLSPNQE----VGLMAYGHRQKGDCS 80 Query: 230 -IEEFFLLEWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 IE+ G + + + ++ G + S ++ A + + Sbjct: 81 DIEQLIAPAAGTREAISKAVDAITPKGKTPLSAAVIQAA-EGLHLSEE------------ 127 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA--IVYAIGIRVIR--SHEFLRACA--S 341 K ++ ++DGE + E + G ++AIG + + L+ A + Sbjct: 128 KATVILISDGE---ETCGRDPCAIGAELEAAGVDFTLHAIGFGIADDAARAQLQCLAENT 184 Query: 342 PNSFYLVENPHSMYDAFSHIGK 363 + + + A SH+ K Sbjct: 185 GGVYLDAKGAEGLSAALSHVTK 206 >gi|229490509|ref|ZP_04384348.1| putative von Willebrand factor, type A [Rhodococcus erythropolis SK121] gi|229322588|gb|EEN88370.1| putative von Willebrand factor, type A [Rhodococcus erythropolis SK121] Length = 684 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 31/220 (14%), Positives = 74/220 (33%), Gaps = 39/220 (17%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 ++ + + A +++V+D+S SM + K+ A +S++ ++ ++ Sbjct: 29 TPTTPPTRAVSAAPAALLMVMDLSGSMNDNDANGKNKLTGAKQSLSRIVG-----DTASS 83 Query: 218 VVQSGLVTF---SNKIEEFFLLEWGVSHLQRKIKYLSK------FGVSTNSTPGLKYAYN 268 GL T+ + + L +++ L T + P L+ + + Sbjct: 84 STPLGLWTYPTAGSNCDPGSFLAGADGGVRKDTDTLMAQVSGLKADGGTPTGPALRASVD 143 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR-----GAIVY 323 + +V ++DGE+ + C+ AK+ V Sbjct: 144 SLKANGITTA-----------TVVLISDGESNCGQA------PCDTAKQIVAEGFDVTVE 186 Query: 324 AIGIRV-IRSHEFLRACAS--PNSFYLVENPHSMYDAFSH 360 A+G ++ + L AS + + + M Sbjct: 187 ALGFQLSGQGRTELECIASTTGGRYSDIADVDEMQKRLKE 226 >gi|156097055|ref|XP_001614561.1| CTRP adhesive protein (invasive stage) [Plasmodium vivax SaI-1] gi|148803435|gb|EDL44834.1| CTRP adhesive protein (invasive stage), putative [Plasmodium vivax] Length = 2061 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 35/210 (16%), Positives = 74/210 (35%), Gaps = 27/210 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV-NNVVQSGLVTFSNKIE 231 D+ ++LD S S+ ++ + E+V ++ + + G++ F+ K++ Sbjct: 333 DLTLILDESGSIT---------LNKWKIDVVPFAEKVVSNLNISKDKIHVGIMRFAIKVK 383 Query: 232 EF----FLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E + S L +K L G T L++ + +H N Sbjct: 384 EDVSYGQETRYDKSALINVVKELRDKYGSGQGTRLVDALEH------SLTNFTRHPN-NR 436 Query: 285 ANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 N K+ + TDG EN D +++ + +K + +G+ Sbjct: 437 PNAPKVTILFTDGNENYRRPSDVRNIGL--KYRKENVRLIVVGVYKATIKSLKMLAGCGE 494 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + + + +D + I + I K D Sbjct: 495 NEHCPQVIKCDWDQLTSITEVITDKICDID 524 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 33/209 (15%), Positives = 67/209 (32%), Gaps = 16/209 (7%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN-VVQSGLVTFS 227 + D+ ++LD S S+ K + +E++ +V+ V G++ F+ Sbjct: 835 QSMYDLTLILDESASIGH---------SNWKKQVYPFVEKIVSNLEVSESKVHVGIMLFA 885 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + +F S+ + + T Y + + A+ Sbjct: 886 KHMRDFVRFSEKESYEKDSLMRKVPELKGTYKAGSHTYIVESLEYGLQHYTKGASSRADV 945 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--SHEFLRAC-ASPNS 344 K+ + TDG N S D+ + KK + +G+ L C + Sbjct: 946 PKVTILFTDG-NNSKSGDEILSNVNSLYKKENVKLLVVGVGAASMPKLRLLGGCHKTEGD 1004 Query: 345 FYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 +D+ I + +V K D Sbjct: 1005 CPFAIKTE--WDSLKDISQGMVDKICNTD 1031 >gi|110681632|dbj|BAE98271.1| circumsporozoite protein/thrombospondin-related anonymous protein-related protein [Plasmodium vivax] Length = 2106 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 35/210 (16%), Positives = 74/210 (35%), Gaps = 27/210 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV-NNVVQSGLVTFSNKIE 231 D+ ++LD S S+ ++ + E+V ++ + + G++ F+ K++ Sbjct: 333 DLTLILDESGSIT---------LNKWKIDVVPFAEKVVSNLNISKDKIHVGIMRFAIKVK 383 Query: 232 EF----FLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E + S L +K L G T L++ + +H N Sbjct: 384 EDVSYGQETRYDKSALINVVKELRDKYGSGQGTRLVDALEH------SLTNFTRHPN-NR 436 Query: 285 ANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 N K+ + TDG EN D +++ + +K + +G+ Sbjct: 437 PNAPKVTILFTDGNENYRRPSDVRNIGL--KYRKENVRLIVVGVYKATIKSLKMLAGCGE 494 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + + + +D + I + I K D Sbjct: 495 NEHCPQVIKCDWDQLTSITEVITDKICDID 524 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 36/209 (17%), Positives = 68/209 (32%), Gaps = 16/209 (7%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN-VVQSGLVTFS 227 + D+ ++LD S S T K + +E++ +V+ V G++ F+ Sbjct: 835 QSMYDLTLILDESAS---------TGHSNWKKQVYPFVEKIVSNLEVSESKVHVGIMLFA 885 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + +F S+ + + T Y + + A+ Sbjct: 886 KHMRDFVRFSEKESYEKDSLMRKVPELKGTYKAGSHTYIVESLEYGLQHYTKGASSRADV 945 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--SHEFLRAC-ASPNS 344 K+ + TDG N S D+ + KK + IG+ L C + Sbjct: 946 PKVTILFTDG-NNSKSGDEILSNVNSLYKKENVKLLVIGVGAASMPKLRLLGGCHKTEGD 1004 Query: 345 FYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 V +D+ I + +V K D Sbjct: 1005 CPFVIKTE--WDSLKDISQGMVDKICNTD 1031 >gi|306840900|ref|ZP_07473644.1| norD protein [Brucella sp. BO2] gi|306289103|gb|EFM60361.1| norD protein [Brucella sp. BO2] Length = 633 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 85/210 (40%), Gaps = 34/210 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 L + +++DVS S +++ D+ +D+ +++ + + D + ++TF+ Sbjct: 440 QANDLAVTLLVDVSLSTDAWVDNRRV-LDVEKEALLVLANGIAACGD-----RCSILTFT 493 Query: 228 NK--------IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ + F +G + ++ +I L G T ++YA ++ Sbjct: 494 SRRRSWVRVETVKDFDESFGPT-VEHRIAALKP-GFYTRMGAAMRYATAKL--------- 542 Query: 280 CNTEDANYKKIIVFMTDGENLSTK------EDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 E N KK+++ +TDG+ + S EA+ +G V+A+ + + Sbjct: 543 --AEQPNRKKLLLLLTDGKPNDVDHYEGRFALEDSRRAAGEARAKGVNVFAVTVDR-EAS 599 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGK 363 +L A + LV N + A I + Sbjct: 600 AYLPALFGRGGYALVANLAKLPVALPAIYR 629 >gi|157412073|ref|YP_001481413.1| TerY1 [Escherichia coli APEC O1] gi|99867098|gb|ABF67743.1| TerY1 [Escherichia coli APEC O1] Length = 239 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 25/132 (18%), Positives = 44/132 (33%), Gaps = 12/132 (9%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + ++LD S SM I+ + +L +K P ++TF + Sbjct: 30 RLPVYLLLDTSGSMHG------EPIEAVKNGVQTLLTTLKQDPYALETAYVSVITFDSSA 83 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L L ++ L+ G +T+ L + I D + Sbjct: 84 RQAVPL---TDLLSFQMPALTASG-TTSLGEALSLTASSIAKEVQKTTADTKGDWRP--L 137 Query: 291 IVFMTDGENLST 302 + MTDG Sbjct: 138 VFLMTDGSPNDD 149 >gi|302405156|ref|XP_003000415.1| U-box domain containing protein [Verticillium albo-atrum VaMs.102] gi|261361072|gb|EEY23500.1| U-box domain containing protein [Verticillium albo-atrum VaMs.102] Length = 662 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 33/226 (14%), Positives = 74/226 (32%), Gaps = 38/226 (16%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI-----------TKIDMAIKSIN 203 P + A D+++V+DVS SM+ + + + +D+ + Sbjct: 74 TTPREALQSGKRIPRAPCDIVLVIDVSGSMDDAAPAPVIPGQKDENTGLSILDLTKHAAR 133 Query: 204 AMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL----EWGVSHLQRKIKYLSKFGVSTNS 259 +LE + + G+V F+ + L + KI+ L + Sbjct: 134 TILETLDERD------RLGIVAFTTNAKVILSLVEMNPDNKVSAKDKIENLQPLNGTNM- 186 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 ++ I + + C++ ++ +TDG L Y + + G Sbjct: 187 -------WHGITEGIKLFSDCDSSSGRVP-AMMVLTDG---LPNSGCPRLGYIPKLRDMG 235 Query: 320 ---AIVYAIGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSH 360 A ++ G L++ A ++ + + + F H Sbjct: 236 QLPATIHTFGFGYHIRSGLLKSIAEIGGGNYAFIPDAGMIGTVFVH 281 >gi|257790817|ref|YP_003181423.1| von Willebrand factor type A [Eggerthella lenta DSM 2243] gi|257474714|gb|ACV55034.1| von Willebrand factor type A [Eggerthella lenta DSM 2243] Length = 555 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 77/192 (40%), Gaps = 25/192 (13%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 PW ++ +VM + ++ +++ ++DVS SM+ K+ + S A+ Sbjct: 162 PWNDQTKLLVMGFATEKDGDASPTGA-NLVFLIDVSGSMD-----DPDKLPLVKDSFAAL 215 Query: 206 LEEVKLIPDVNNVVQSGLVTFSN--KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL 263 +E + + +VT+++ ++ + + R + L G STN GL Sbjct: 216 VEGL------TERDRVSVVTYASGERVLLEGVPGDDKRRIMRAVDSLVAEG-STNGEAGL 268 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIV 322 + AY ++ + + N +V +DG+ N+ + + + + ++ G + Sbjct: 269 EQAY-RLAESSFIEGGVNR--------VVMASDGDLNVGISSESELHDFVEQKRETGVYL 319 Query: 323 YAIGIRVIRSHE 334 +G + Sbjct: 320 SVLGFGSGNYKD 331 >gi|149925131|ref|ZP_01913438.1| hypothetical protein PPSIR1_40904 [Plesiocystis pacifica SIR-1] gi|149813994|gb|EDM73632.1| hypothetical protein PPSIR1_40904 [Plesiocystis pacifica SIR-1] Length = 413 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 39/271 (14%), Positives = 86/271 (31%), Gaps = 56/271 (20%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESF-FDSSITKIDMAIKSINAMLEEVKLI- 212 P +V++ + ++M+V+D S SM + +D + + N + V + Sbjct: 64 TGPDCGTVEITPEYVPP-NVMLVVDASGSMVNNSWDHDLDPNTPQVTRWNTLHGVVDNVM 122 Query: 213 PDVNNVVQSGLVTFSNK---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN- 268 + + + +G+ F + + + + + GV ++ + A Sbjct: 123 TNFGSTMYAGVQRFPSADACPDATPQSSNCYNLGSCIVGTQPEVGVGADNGDAILAAIPG 182 Query: 269 -------------QIFDMQGMRQHCNTEDANYKKIIVFMTDGENL---------STKEDQ 306 + H ++ + I+F+TDG + Sbjct: 183 PGAGNTEIVGGTPATLGINSAVDHLLAQNETNPRYILFITDGAANCNTDLPYPEYIESYD 242 Query: 307 QSLYYCNEA--KKRGAIVYAIGI------------RVIRSHEFLR------ACASPNS-- 344 ++L EA + G Y +GI ++ F+R A +P + Sbjct: 243 ETLPTTVEAAFEDEGITTYVVGIDIVDMLQGAGTDGSPEANPFVRLNDVALAGGAPKNEG 302 Query: 345 -----FYLVENPHSMYDAFSHIGKDIVTKRI 370 F+ N + DA I + + I Sbjct: 303 MDAEKFFNTTNQQELLDALQVILEGVTECVI 333 >gi|84386788|ref|ZP_00989813.1| hypothetical protein V12B01_19181 [Vibrio splendidus 12B01] gi|84378316|gb|EAP95174.1| hypothetical protein V12B01_19181 [Vibrio splendidus 12B01] Length = 404 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 66/421 (15%), Positives = 124/421 (29%), Gaps = 107/421 (25%) Query: 10 NFFYNYKGGMTILT-AIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 N +G MTI A+ P +F+ +G ++ + + + Sbjct: 9 NALKKQQGSMTISFLALLFPALFIAVGTLMVSAQVMVS----------------NRAAQA 52 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 + + + ++ L+ + + I + + GYS Sbjct: 53 ADSAALACAFADTATLPMMRAYQEYYKPTLK-GVSGLEPEIIGSECRISL-------GYS 104 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 +S PL Y SV + Q+ ++++VLDVS SM S Sbjct: 105 LS-------PLLPNFQYESYATKVTATGGGYKSVVESKQSSIPTELVLVLDVSGSMGSN- 156 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNV---VQSGLVTFSNKIEEFFLL--------- 236 I ++ L ++ + N V +V F + + Sbjct: 157 ------IQSLKSILSNALNTIQSQSNNANDLDSVSISIVPFDSGVAAQRPPWLSKEAAGI 210 Query: 237 --------EWG------------VSHLQRKIKYLSKFGVSTN---STPGL---------K 264 G H Q+ +K+ G ++ S+P L + Sbjct: 211 YCIDGLNYRNGNFSAALTVDNLATLHSQQPVKFAKPNGWLSDCNQSSPMLPLTSVFSRVR 270 Query: 265 YAYNQIFDMQGMRQH--------------------CNTEDANYKKIIVFMTDGENLSTKE 304 + N + G R + ++ +V TDG + Sbjct: 271 NSINSLTANGGTRSFHGLLWGVRQLIPSWQQAWGINVSTVPETRRKLVLFTDGADEGDTF 330 Query: 305 DQQS-LYYCNEAKKR-GAIVYAIGIRVIRSH--EFLRACASPNSFYLVENPHSMYDAFSH 360 DQ +C A + G + IG V S +F R +P+ + N + + FS Sbjct: 331 DQLVNAGFCTTAINQYGIEMNFIGYGVSSSRIAQFERCAGNPSRVFSATNTTQLNEYFSD 390 Query: 361 I 361 I Sbjct: 391 I 391 >gi|302543871|ref|ZP_07296213.1| putative toxic cation resistance protein [Streptomyces hygroscopicus ATCC 53653] gi|302461489|gb|EFL24582.1| putative toxic cation resistance protein [Streptomyces himastatinicus ATCC 53653] Length = 232 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 71/199 (35%), Gaps = 34/199 (17%) Query: 174 MMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 + +VLD S SM +++ D ++ +++A + + +P +V FS I+ Sbjct: 29 VYLVLDRSGSMRNYYKDGTVQHFAEQTLALSASVNDSGTLP---------IVFFSTDIDG 79 Query: 233 FFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + V +++ L TN ++ +H Sbjct: 80 TAEIA--VDDYNGRVEELHASLGHMGRTNYHRAIEAVI----------EHYKASGTKDPA 127 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY--AIGIRVI--RSHEFLRACASPNSF 345 ++VF TDG +++L C A G ++ +G + +FLR + + Sbjct: 128 LVVFQTDGAPTGKAAAEKAL--CEAA---GLPLFWQFVGFGDPEAKGFDFLRKLGTSPAL 182 Query: 346 YLVENPHSMYDAFSHIGKD 364 + AF H G D Sbjct: 183 AVPAKRALDNAAFFHAGLD 201 >gi|297279075|ref|XP_001109489.2| PREDICTED: calcium-activated chloride channel regulator 4-like [Macaca mulatta] Length = 931 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 39/205 (19%), Positives = 72/205 (35%), Gaps = 54/205 (26%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK---- 229 + +VLD S SM + +++ K+ L ++ V N G+V F + Sbjct: 307 VCLVLDKSGSM-----AGYDRLNQMNKAAKYFLLQI-----VENGSWVGMVHFDSTATII 356 Query: 230 ---------IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 E LL W ++ T+ G+K A+ I + H Sbjct: 357 NKPIQIISSDERNTLLAWLPTYAS----------GGTSICSGIKSAFQVI---GELSSHL 403 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFL-RA 338 + + +V +TDGE+ + C +E K+ GAIV+ I + + + Sbjct: 404 DGSE------VVLLTDGEDYTASS-------CIDEVKRSGAIVHFIALGTAADKAVIEMS 450 Query: 339 CASPNSFYLVENPHS---MYDAFSH 360 + + + + DAF Sbjct: 451 KITGGRHFYASDKAQNNGLIDAFGD 475 >gi|297380203|gb|ADI35090.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter pylori v225d] Length = 217 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 29/202 (14%), Positives = 69/202 (34%), Gaps = 10/202 (4%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + + + + ++LD S SM + T+I++ I M+E +K + ++T Sbjct: 9 TMEERFIPVFLLLDTSGSMSHSL-GNGTRIEVLNLCIQKMIETLKQEAKKELFSKMAIIT 67 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F E +L ++ T + A + I D Sbjct: 68 FG---ENGAVLHTPFDDIKNINFKPLSASGGTPLDQAFRLAKDLIEDKDTF------PTK 118 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 YK + ++DGE K + + + + ++ ++I I + + + Sbjct: 119 FYKPYSILVSDGEPNDDKWQKALSDFHHYGRSAKSVCWSIFIGNRNDNPQVNKEFGKDGV 178 Query: 346 YLVENPHSMYDAFSHIGKDIVT 367 + ++ + F + + I Sbjct: 179 FYADDVEKLVGLFEIMTQTISK 200 >gi|187918047|ref|YP_001883610.1| hypothetical membrane spanning protein [Borrelia hermsii DAH] gi|119860895|gb|AAX16690.1| hypothetical membrane spanning protein [Borrelia hermsii DAH] Length = 341 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 42/216 (19%), Positives = 75/216 (34%), Gaps = 27/216 (12%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 + P S K+ ++ D++IVLD+S SM + SS + + A + I + Sbjct: 82 TLAGPSISKKKMTYLSNGA-DIVIVLDISPSMGAIEFSSKNRFEFAKELIKYFAYQ---- 136 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQI 270 GLV F+ + L RK+ Y+ G + G+ A + + Sbjct: 137 ---RENDNIGLVAFAKEASLIVPLTIDRDFFSRKLDDIYIMDLGNGSALGLGVSIALSHL 193 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 K+ ++ +TDG S + + N A+ +Y+IGI Sbjct: 194 -----------KHSEAPKRSVIVLTDGVVNSDEVYKD--QVINLAQGLNVKIYSIGIGSG 240 Query: 331 RSHEFLRACASPNSF----YLVENPHSMYDAFSHIG 362 S + V +P + + S G Sbjct: 241 EELSVGFKLRSGKFYQGTLREVYDPSMLLEIASKTG 276 >gi|297690912|ref|XP_002822848.1| PREDICTED: von Willebrand factor-like, partial [Pongo abelii] Length = 1315 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 40/277 (14%), Positives = 87/277 (31%), Gaps = 35/277 (12%) Query: 79 GGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYKI 137 ++ + + + N ++I + + + V N + I Sbjct: 939 HSFLVSQGDREQAPNLVYMVTGNPASDEIKRLPGDIQVVPIGVGPNANVQELERIGWPNA 998 Query: 138 PLKFCTFIPWYTNSRHIVMPITSS--------VKVNSQTDARLDMMIVLDVSRSMESFFD 189 P+ F + +V+ S + + LD++++LD S S F Sbjct: 999 PILIQDFETLPREAPDLVLQRCCSGEGLQIPTLSPAADCSQPLDVILLLDGSSS---FPA 1055 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV----SHLQR 245 S ++ K+ + + + P + Q ++ + + W V +HL Sbjct: 1056 SYFDEMKSFAKA---FISKANIGPH---LTQVSVLQYGSITTIDVP--WNVAPEKAHLLS 1107 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 + + + G + L +A + H + K +++ +TD Sbjct: 1108 LVDVMQREGGPSQIGDALGFAVRYLTS----EMHGARPGVS-KAVVILVTD------ISV 1156 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + A+ V+ IGI LR A P Sbjct: 1157 DSVDAAADAARSNRVAVFPIGIGDRYDAAQLRILAGP 1193 >gi|271968449|ref|YP_003342645.1| von Willebrand factor type A domain-containing protein [Streptosporangium roseum DSM 43021] gi|270511624|gb|ACZ89902.1| von Willebrand factor type A domain protein [Streptosporangium roseum DSM 43021] Length = 490 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 66/201 (32%), Gaps = 26/201 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K + ++ V+DVS SM ++D+ ++++ +++++ Q Sbjct: 139 KAEPEARRPANLTFVVDVSGSM-----GEPGRLDLVREALHKLVDQLGPGD------QVS 187 Query: 223 LVTFSN--KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V FS ++ G L I L STN GL Y + R Sbjct: 188 IVAFSTQARLVLSMTPATGRDQLHAAIDRL-GVEDSTNLETGLTAGY-----AEAARAF- 240 Query: 281 NTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 ++ ++DG N Q L E+ R + +G+ + + Sbjct: 241 ---RPAATNRVILLSDGLANTGDTTWQGILDRVAESAGRQITLLCVGVGRDYGDQLMEQL 297 Query: 340 AS--PNSFYLVENPHSMYDAF 358 A + V + F Sbjct: 298 ADNGDGAAVYVSSADDARKVF 318 >gi|213625177|gb|AAI69984.1| Complement factor B [Xenopus laevis] Length = 747 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 34/244 (13%), Positives = 77/244 (31%), Gaps = 35/244 (14%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 N + S V D +++ IVLD S+S+ + D A + +E++ Sbjct: 218 NVDTTNLEDRSDRSVRILKDGLMNIFIVLDTSKSVGQN------RFDEAKSASILFIEKM 271 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLL----EWGVSHLQRKIKYLSK----FGVSTNSTP 261 ++++++K L + ++ TN+ Sbjct: 272 SNYDIKPRYC---IISYASKAISVVSLRDPDSNNADAVMEHLEEFQYDRHEDKQGTNTRA 328 Query: 262 GLKYAYNQIFDMQGM--RQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYY------- 311 L Y + + + R+ + +I+ MTDG+ N+ ++ Sbjct: 329 ALHAIYEHLIEQELAYEREGKKEDFMKIHNVILLMTDGKFNMGGDPREEMKLIKRFLDVG 388 Query: 312 --CNEAKKRGAIVYAIGIRV----IRSHEFLRACASPNSFYLVENPHSMYDAFSH--IGK 363 + ++ VY G+ ++ + ++N M + F I Sbjct: 389 IRKDNPREEYLDVYVFGLGSDIDQPEINDLASKKEKEVHTFHLQNVDKMKEFFELMIIED 448 Query: 364 DIVT 367 D+ Sbjct: 449 DVFD 452 >gi|218528924|ref|YP_002419740.1| hypothetical protein Mchl_0894 [Methylobacterium chloromethanicum CM4] gi|218521227|gb|ACK81812.1| conserved hypothetical protein [Methylobacterium chloromethanicum CM4] Length = 477 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 63/470 (13%), Positives = 130/470 (27%), Gaps = 139/470 (29%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 N +G + +L A+ + ++G+ ++ KT L + D + + N Sbjct: 20 SNAEGSINVLFALAVLPTIGLVGLGVDYGMAISSKTRLDNAADAAALAGVVTAKEFIAAN 79 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 ++ + + +F F V + L+IV Q ++ A Sbjct: 80 AQQSDVMTAGIKAGEYQALKAFNVNASKVPFA-----TVSLSQLEIVRSGQ----TLDAT 130 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM-------- 184 Y ++ + + + + +S + LD +++DVS SM Sbjct: 131 VSYTATVQSTFGRLFGLSVTTLTNRVNASADIA----GYLDFYLMVDVSGSMGLPTTDSD 186 Query: 185 ---------ESFFDSSIT------------------KIDMAIKSINAMLEEVKLIPDVNN 217 E + + D ++ A+L+ P V N Sbjct: 187 AALLASKSVEDQGNCQFACHFPNRKGWNLAAGKIQLRSDAVNNAVCALLDRASK-PIVPN 245 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + G+ F N++ L + L+ + + A+ + D + Sbjct: 246 QYRIGIYPFINRLATLAPLSDTTT-------SLASLKTTADCGKAWPLAFTNLLDTGSTQ 298 Query: 278 QHCN-------------------------------TEDANYKKIIVFMTDGENLSTKE-- 304 N + AN K + +TDG S Sbjct: 299 LFTNNDPTTGTGSGGTHFETALPQMKSTIRTFGNGSSSANPKPFVFLITDGMQNSQTYSS 358 Query: 305 --DQQSL-------------------------YYCNEAKKRG--AIVYAIGIRVIRSH-- 333 D+++ C + KK G + I ++S+ Sbjct: 359 WKDKKTYPGNPSKFAGYRYADWDGSQPAQIDPAKCADLKKAGATISILYIPYNYVKSYTN 418 Query: 334 -------------------EFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 + LR CASP F+ + + + + Sbjct: 419 EGTIVWENNRVNGFSPTLADPLRQCASPGLFFTANSAKDITASLGAMFDQ 468 >gi|90406967|ref|ZP_01215158.1| putative RTX toxin [Psychromonas sp. CNPT3] gi|90312009|gb|EAS40103.1| putative RTX toxin [Psychromonas sp. CNPT3] Length = 3350 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 41/190 (21%), Positives = 78/190 (41%), Gaps = 13/190 (6%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSME-SFFDSSITKIDMAIKSINAMLEEVKLI 212 I++ T ++ ++ A ++++VLD+S SM S S ++D A +++ +LE I Sbjct: 2744 IIVEETVNLVISEPEIAETNLILVLDISGSMNGSIEGSDQNRLDFAKTALSNLLE----I 2799 Query: 213 PDVNNVVQSGLVTFSNKIEEF--FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + VV LV F N I L+ G L ++Y+ + A + Sbjct: 2800 QNTLGVVNVNLVAFENNIFSSHWVTLDGGPEGLASILQYIENLNADAYGGTNYQDALKTV 2859 Query: 271 FDMQGMRQHCNTEDANYKK--IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI--VYAIG 326 Q + D + K IVF++DG+ + + N+ I + +G Sbjct: 2860 MS--EFEQGVASGDIDVSKDTNIVFLSDGKPGQSIINNPVEQEWNDFTSNYNIDSINTVG 2917 Query: 327 IRVIRSHEFL 336 I++ + L Sbjct: 2918 IQISAGDQVL 2927 >gi|332217052|ref|XP_003257667.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2 [Nomascus leucogenys] Length = 946 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 76/198 (38%), Gaps = 32/198 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ V+DVS SM K+ ++++ +L++++ + ++ F+ + Sbjct: 311 ILFVIDVSGSMWG------VKMKQTVEAMKTILDDLRAED------RFSVIDFNQNVRTW 358 Query: 231 --EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + V+ +R I+ + G TN L A + + + + + Sbjct: 359 RNDLISATKTQVADAKRYIEKIQPSG-GTNINEALLRAIFILNEASNLGL-LDPNSVSL- 415 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACASPNS- 344 I+ ++DG+ + + K+ ++++G+ ++FL+ ++ N Sbjct: 416 --IILVSDGDPT--VGELKLSKIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENHG 471 Query: 345 ----FYLVENPHSMYDAF 358 Y ++ S F Sbjct: 472 IAQRIYGNQDTSSQLKKF 489 >gi|328858486|gb|EGG07598.1| hypothetical protein MELLADRAFT_77518 [Melampsora larici-populina 98AG31] Length = 503 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 61/192 (31%), Gaps = 20/192 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD+ +LD + SM S+ ++ I++ I + +L ++ GL+ + + Sbjct: 30 LDLCFILDTTGSMGSYITAATQNIELICDEI---INSERLASP--ECLRIGLIAYRDHPP 84 Query: 232 EF-------FLLEWGVSHLQRKIKYLSKFGVS---TNSTPGLKYAYNQIFDMQGMRQHCN 281 + F +Q +K L G T + A + Q R Sbjct: 85 QDRSYITLKFDFTSNPKIVQEHLKSLWASGGGDGPEAVTAAMHEALTLDWRPQASRMAVL 144 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 DA I + DG + L + G ++ + S S Sbjct: 145 ITDAPPHGIGEY-GDGFSKGDPSGHDPLKLARTMAQNGISLFVVACEPAFSG----YMYS 199 Query: 342 PNSFYLVENPHS 353 + F + N S Sbjct: 200 NDFFRAITNITS 211 >gi|327463764|gb|EGF10080.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1057] Length = 462 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 35/154 (22%), Positives = 61/154 (39%), Gaps = 24/154 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDS-------SITKIDMAIKSINAMLEEVKLIPDVNNVV 219 Q + + V D S SM + +++D+ K M++++ I +V+ V Sbjct: 193 QGQMNIAVSFVFDTSGSMNWDLQGRNVEKTGNESRMDILRKKSVIMIKDLAEIGNVS-VN 251 Query: 220 QSGLVTFSNKIEEFFL-LEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQIFDMQG 275 G T + I++ F L+ G + + I L+ GV TN GL+Y + Sbjct: 252 LVGFSTSAKYIQQNFSNLDNGTNTIIATITKPENLNPDGV-TNPGDGLRYGMISLQSQPA 310 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL 309 K IV +TDG + D ++L Sbjct: 311 QL-----------KYIVLLTDGIPNAYLVDSRAL 333 >gi|294673503|ref|YP_003574119.1| BatB/BatC protein [Prevotella ruminicola 23] gi|294471951|gb|ADE81340.1| putative BatB/BatC protein [Prevotella ruminicola 23] Length = 566 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 30/172 (17%), Positives = 70/172 (40%), Gaps = 20/172 (11%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K++++ ++ +I +D+S SM + D + +++D + + +++ + Sbjct: 79 GTKISNEQRTGIETIIAMDISNSMLAE-DITPSRLDRSKMMVENLVDHFTND-------K 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 GL+ F+ + + + + ++T T + A + H Sbjct: 131 IGLLVFAGDAFVQLPITSDYVSAKMFLSSIDPSMMATQGTD-IARAIDMAT-------HS 182 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 T++ K I+ +TDGE+ + +L AKK G VY +G+ + Sbjct: 183 FTQEEGIGKAIIVITDGEDH----EGGALESAEAAKKAGMRVYVLGVGSTQG 230 >gi|300783401|ref|YP_003763692.1| von Willebrand factor type A [Amycolatopsis mediterranei U32] gi|299792915|gb|ADJ43290.1| von Willebrand factor type A [Amycolatopsis mediterranei U32] Length = 535 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 38/193 (19%), Positives = 61/193 (31%), Gaps = 17/193 (8%) Query: 177 VLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 VLD S SM + DS + + + ++ + V T F + Sbjct: 348 VLDTSGSMAGARIDSLRSALVGLTGADTSLTGRFRRFRSREEVTMLPFNTGPGAPRTFTV 407 Query: 236 LEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 E + +IK ++ T L AY + + D + IV Sbjct: 408 PEENPAAELAQIKTFAEGLVARGGTAIYDSLSRAYQVL-------EPLMAADPDRFTSIV 460 Query: 293 FMTDGEN---LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-SPNSFYLV 348 MTDGEN S + SL A K+ V+ + S E + + + Sbjct: 461 LMTDGENANGSSLPDFLTSLASLPPAMKQ-VPVFTVLFGEGSSDELTQVATRTGGKVFDA 519 Query: 349 ENPHSMYDAFSHI 361 N + F I Sbjct: 520 RNV-QLSRVFQEI 531 >gi|62088356|dbj|BAD92625.1| inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) variant [Homo sapiens] Length = 699 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 30/171 (17%), Positives = 64/171 (37%), Gaps = 19/171 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEE 232 ++ V+D S SM S KI +++ +L+++ N +V + + + Sbjct: 260 VVFVIDKSGSM------SGRKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLVP 313 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V+ + + G TN + A Q+ D + + I+ Sbjct: 314 AS--AENVNKARSFAAGIQALG-GTNINDAMLMAV-QLLDSSNQEERLPEGSVSL---II 366 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA---IGIRVIRSHEFLRACA 340 +TDG+ + + +S+ N ++ + Y+ +G S+ FL A Sbjct: 367 LLTDGDPTVGETNPRSIQ--NNVREAVSGRYSLFCLGFGFDVSYAFLEKLA 415 >gi|118357010|ref|XP_001011757.1| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila] gi|89293524|gb|EAR91512.1| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila SB210] Length = 842 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 32/139 (23%), Positives = 54/139 (38%), Gaps = 18/139 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 K + T+ D+ VLD + SM S+ I I +L +K VN+ ++ Sbjct: 60 TKQETNTEDSCDLAFVLDCTGSMGSW-------IQRCKTQIINILNTLKQQF-VNSTFRA 111 Query: 222 GLVTF-----SNKIEEFFLLEWGVSHLQRKIKYLSKFGVS---TNSTPGLKYAYNQIFDM 273 G V + +N+IE+F + L+ I L G + GL A +++ + Sbjct: 112 GFVAYRDHCDTNRIEKFSFISSQYEALENFIGKLVATGGGDGPEDLAGGLYTAIHELSWI 171 Query: 274 QGMRQHCNTEDANY--KKI 290 + DA KK Sbjct: 172 SKTKNLFIIADAPCHGKKY 190 >gi|73970088|ref|XP_531792.2| PREDICTED: hypothetical protein XP_531792 [Canis familiaris] Length = 1465 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 34/171 (19%), Positives = 62/171 (36%), Gaps = 32/171 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + I++D S SM+S K+D+ I ++E N VQ + + + Sbjct: 710 IYILIDTSHSMKS-------KLDLVKDKIIQFIQEQLKYKRKFNFVQFDAQAVAWQEKLV 762 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + E + Q I+ + + G STN+ L+ A+ ++I Sbjct: 763 EINEDNLRGAQSWIRDI-QIGSSTNTLHALQIAFAD----------------KETQVIYL 805 Query: 294 MTDGENLSTKEDQQSLYYCNEAKK-RGAIVYAIGIRVIR--SHEFLRACAS 341 +TDG E + K + +Y I ++ FL+ AS Sbjct: 806 LTDGRPDQPPEMVI-----EQVKVFQKIPIYTISFNYNDEIANGFLKELAS 851 >gi|38347866|ref|NP_941115.1| hypothetical protein SMR0040 [Serratia marcescens] gi|190410209|ref|YP_001965710.1| hypothetical protein pK29_p040 [Klebsiella pneumoniae] gi|226807603|ref|YP_002791297.1| hypothetical protein pEC-IMP_036 [Enterobacter cloacae] gi|226809913|ref|YP_002791607.1| hypothetical protein pEC-IMPQ_035 [Enterobacter cloacae] gi|38259343|emb|CAE51568.1| hypothetical protein SMR0040 [Serratia marcescens] gi|146151002|gb|ABQ02768.1| conserved hypothetical protein [Klebsiella pneumoniae] gi|226425828|gb|ACO53921.1| hypothetical protein [Enterobacter cloacae] gi|226426139|gb|ACO54231.1| hypothetical protein [Enterobacter cloacae] Length = 668 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 70/203 (34%), Gaps = 24/203 (11%) Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 + R K + TF + + + S+ + +++V D+S SM Sbjct: 454 QVKMRGRNKTHERGITFDGNRLILSQMGVRDVFRAQSESKNRGHVGLVLVRDISASM--- 510 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI 247 + I + + + + + + + F K EF +++ ++ + Sbjct: 511 -ANDERYIHAIKSDLA-----LSMAAESLSKMHVSNIVFPFKESEFEIIKTFDQSVEESL 564 Query: 248 KY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 L G T + L A + + D Q R KII +TDG K + Sbjct: 565 SKFTLGCKGYYTPTGSALMAAVDLLLDSQFDR-----------KIIFLITDGYPN--KSE 611 Query: 306 QQSLYYCNEAKKRGAIVYAIGIR 328 + +AK G + +GI+ Sbjct: 612 FTIVEVMEKAKCNGIEIVGVGIK 634 >gi|313895388|ref|ZP_07828945.1| von Willebrand factor type A domain protein [Selenomonas sp. oral taxon 137 str. F0430] gi|312976283|gb|EFR41741.1| von Willebrand factor type A domain protein [Selenomonas sp. oral taxon 137 str. F0430] Length = 647 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 33/166 (19%), Positives = 62/166 (37%), Gaps = 19/166 (11%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 R IV+ + + + ++++ V+D S SM ++ ++ ++ A+L E Sbjct: 439 GRRIVIARSDLRRAVRERKRGVNILFVVDASGSM-----AARARMRAVKGAMLALLREAY 493 Query: 211 LIPDVNNVVQSGLVTF-SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + D + GLV F ++ E L V QR ++ L G T GL A Sbjct: 494 VRRD-----RVGLVAFRRDRAETLLPLTRSVELAQRLLRELPT-GGRTPLAAGLSEALLH 547 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 + + ++V +TDG + E + A Sbjct: 548 L-------AGAARRGELAETLLVLLTDGRATAAPEGEDPAQAALTA 586 >gi|15488640|gb|AAH13465.1| inter-alpha trypsin inhibitor, heavy chain 1 [Mus musculus] Length = 909 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 74/196 (37%), Gaps = 28/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D+S SME K+ +++ +LE+++ + + + LV F +K++ Sbjct: 292 LVFVIDISGSMEG------QKVRQTKEALLKILEDMRPVDNFD------LVLFGSKVQSW 339 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + Q ++ S +TN GL + QG ++ + Sbjct: 340 KGSLVPVSNANLQAAQDFVRRFS-LAGATNLNGGLLRGIEILNKAQGSHPELSSPAS--- 395 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDGE + Q L A + +Y +G FL ++ N+ Sbjct: 396 -ILIMLTDGEPTEGETDRSQILKNVRNAIRGRFPLYNLGFGHDLDFSFLEVMSTENNGWA 454 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 455 QRIYEDHDATQQLQGF 470 >gi|327441394|dbj|BAK17759.1| uncharacterized protein containing a von Willebrand factor type A domain [Solibacillus silvestris StLB046] Length = 961 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/174 (14%), Positives = 65/174 (37%), Gaps = 17/174 (9%) Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 E + L+ D ++ + G V F+ ++ + L + + K+ N +P + Sbjct: 70 EVLSLMDDASSKDRFGFVGFNKEVTKELALTNNIVQAKSKLNEF-----GKNISPYMAND 124 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 ++ ++ + + K+IV MT G ++ ++ S +A + ++ I Sbjct: 125 LSK--GLEKAVDELTKKSTSNDKVIVIMTVGNSIY---NEVSKKLAAKAYEEDITIHTIS 179 Query: 327 IRVI--RSHEFLRACA--SPNSFYLVENPHSMYDAFSHIG---KDIVTKRIWYD 373 FL A + ++ N + D S + ++ + ++ D Sbjct: 180 FGDPLYADAPFLTEIAKLTGGNYTHSPNAAFLKDVLSKLSIPVQNFSGREVYSD 233 >gi|260796039|ref|XP_002593012.1| hypothetical protein BRAFLDRAFT_201563 [Branchiostoma floridae] gi|229278236|gb|EEN49023.1| hypothetical protein BRAFLDRAFT_201563 [Branchiostoma floridae] Length = 862 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 27/142 (19%), Positives = 57/142 (40%), Gaps = 12/142 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D S SM T +D+A ++ L+ ++ + + LV+F + Sbjct: 4 ILFVIDTSASMNQRTYMGTTLLDVAKGAVETFLK-LRQRDPGSRADRYMLVSFEDPP-AA 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI- 290 W +H+ ++K L G++T LK A+ + ++ + + Sbjct: 62 IKAGWKENHVAFMNELKNLQATGMTTM-GQALKQAF-DLLNLNRLVSGIDNYGQGRNPFF 119 Query: 291 -----IVFMTDGENLSTKEDQQ 307 +V +TDG L++ Q Sbjct: 120 LEPAMVVTITDGSKLTSSSGVQ 141 >gi|156394499|ref|XP_001636863.1| predicted protein [Nematostella vectensis] gi|156223970|gb|EDO44800.1| predicted protein [Nematostella vectensis] Length = 175 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 28/160 (17%), Positives = 55/160 (34%), Gaps = 21/160 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S S+ + + + + + + P + ++ L+ +S+ + Sbjct: 18 DIAFLMDSSGSIG------VRDYKKEKQFVQGLSDIFDISPGQS---RASLIIYSDFPKL 68 Query: 233 FFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F LE GV++ + +K L T L A D +I Sbjct: 69 IFDLEDGVTNQNITSVLKNLEYLRGRTRIDKALMMAEEVFADA----------RPTVPRI 118 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 +TDG+ + KK G +Y IG+ Sbjct: 119 AFILTDGKQTQDYDAIPLDVSSQRLKKMGVKIYVIGVGPY 158 >gi|118594676|ref|ZP_01552023.1| MxaC protein, putative [Methylophilales bacterium HTCC2181] gi|118440454|gb|EAV47081.1| MxaC protein, putative [Methylophilales bacterium HTCC2181] Length = 324 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 39/274 (14%), Positives = 79/274 (28%), Gaps = 51/274 (18%) Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT------DARLDMM 175 N YS + + + + +S I + + + S + Sbjct: 24 SSNNNYSWNEMIPSDPLSTIIGVLLKFLSSVIIALLVVLLAEPFSDQKVVERVGEGAQIG 83 Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 +VLD S SM+ F K + + L + N + G++TFSN Sbjct: 84 LVLDRSASMDDPFSGGGDKAGETKSAAASRL-IIDFFEARTNDM-VGVITFSNSAMFVLP 141 Query: 236 LEWGVSHLQRKIKYL-------SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L ++ + + G S+ L +N++ D Sbjct: 142 LTENKEAIKAAVTATAGNALFQTNIGAGLTSSAAL---FNEVADSG-------------S 185 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNE----------AKKRGAIVYAIGIRVIRSHEF-- 335 + ++ ++DG Q+ + + ++ G ++ + Sbjct: 186 RAVILLSDGAGRIDANTQQKIRDWFDRFDIGLYWIVLKQEGGISIFDEDYVPRDEDQLPP 245 Query: 336 -------LRACASPNSFYLVENPHSMYDAFSHIG 362 SP Y E+P S+ A I Sbjct: 246 QIELYEYFSTFRSPFKAYEAEDPKSLETAIKDIS 279 >gi|118083233|ref|XP_417223.2| PREDICTED: similar to von Willebrand factor precursor (vWF) [Gallus gallus] Length = 2797 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 31/246 (12%), Positives = 80/246 (32%), Gaps = 30/246 (12%) Query: 110 IVRSTSLDIVVVPQNEGYSIS---AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 I TS++++ + ++ IS+ P+ ++ + +V+ S K + Sbjct: 1601 IRHPTSINVIPIGITPRANVQELRMISQPNRPIILQSYSTLIEEAPELVLQSCCSHKTWT 1660 Query: 167 Q----TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + +D+M +LD S S+ ++ + + A +E N V Sbjct: 1661 EIPELCNKPMDIMFLLDGSPSIG------ASEFEEMKNFVRAFIES---ADISNTSVHVS 1711 Query: 223 LVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQH 279 ++ ++ + L L + + + T + + Sbjct: 1712 VLQYARENNLEISLNKPQETEKLIEMVHSIQQREQGPTRLGKAIDFVVQNALSES----- 1766 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 + K+++ + G K + A+ ++ IG+ + LR Sbjct: 1767 -YGGRPDASKVVIVIVSG-----KSEDTVETAALTARMNKVSLFPIGVGNGYDEQQLRTL 1820 Query: 340 ASPNSF 345 P++ Sbjct: 1821 TGPSAV 1826 >gi|330508298|ref|YP_004384726.1| hypothetical protein MCON_2454 [Methanosaeta concilii GP-6] gi|328929106|gb|AEB68908.1| conserved hypothetical protein, extracellular or membrane bound [Methanosaeta concilii GP-6] Length = 726 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 37/213 (17%), Positives = 73/213 (34%), Gaps = 33/213 (15%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 IS KI L +F + +++ VKVN + D+++VLD S SM+ Sbjct: 234 FYTISSEKIGLNLLSFKEEGQDGFFLLLAAP-DVKVNEEEIVVKDIILVLDTSGSMQG-- 290 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL---LEWGVSHLQR 245 K+D A ++ +L+ + + + +V+F+ F + Sbjct: 291 ----EKMDQAKEAARYVLDHLNPLD------RFAIVSFATTTRSFSPSLEPAAQADKGKD 340 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 + L G ST+ + A + ++F+TDG Sbjct: 341 FLDRLEAMG-STDINRAMIEA-------------VGLAEEVRPTTLIFLTDGLPTEGVTV 386 Query: 306 QQSL--YYCNEAKKRGAIVYAIGIRVIRSHEFL 336 ++ EA +++ G+ + L Sbjct: 387 TGAILDNVAREAPDN-VRIFSFGVGDDVDTDLL 418 >gi|294011131|ref|YP_003544591.1| hypothetical protein SJA_C1-11450 [Sphingobium japonicum UT26S] gi|292674461|dbj|BAI95979.1| conserved hypothetical protein [Sphingobium japonicum UT26S] Length = 418 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 30/231 (12%), Positives = 75/231 (32%), Gaps = 19/231 (8%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 + G + ++ A +P++ G+ + K + D + + A + Sbjct: 8 RLLRDRTGNVLMMAAASMPLLVGAAGLATDTVQWTLWKRQIQRQADSAALAGAYAVAQGF 67 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 N ++ + + T + N F + + VV+ + Sbjct: 68 NASDSATA-DISRMALVALTQTPTIENAPTSGPFAGNAQAVR-------VVLQTSAELPF 119 Query: 130 SAISRYKIPLKFCTFIPWYTNSRH---IVMPITSSVKVNSQTDARLDM--MIVLDVSRSM 184 S I K P+ + S + + TS+V + Q +A +++ IV + S Sbjct: 120 SKILGVKAPVIYGEATAAVVGSGDYCVVSLEKTSTVGITLQGNATVNLGCGIVTNSRASN 179 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 + S T + ++ + + L+ +S +++ + Sbjct: 180 AVYAGGSSTVAATPVAAVGGLTSSSNYVSPTT------LLPYSIPVQDPYA 224 >gi|73542340|ref|YP_296860.1| hypothetical protein Reut_A2655 [Ralstonia eutropha JMP134] gi|72119753|gb|AAZ62016.1| putative membrane protein [Ralstonia eutropha JMP134] Length = 412 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 18/113 (15%), Positives = 42/113 (37%), Gaps = 2/113 (1%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 +G + + + L ++ + G++I++ +F KT L S +D + AA ++ +G + Sbjct: 7 KKERGVILPIVGLTLAVLLGMAGLVIDLGAMFVAKTELQSAVDSCALAAAQEL--DGAAD 64 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 + + N + + V D + + S V Sbjct: 65 ALTRATSAGLTAGNANKVQYQKASASLIDTDVTFSDSLTGAFSSTFTPVANAR 117 >gi|324991934|gb|EGC23857.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK405] gi|332363536|gb|EGJ41317.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1059] Length = 462 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 35/154 (22%), Positives = 61/154 (39%), Gaps = 24/154 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDS-------SITKIDMAIKSINAMLEEVKLIPDVNNVV 219 Q + + V D S SM + +++D+ K M++++ I +V+ V Sbjct: 193 QGQMNIAVSFVFDTSGSMNWDLQGRNVEKTGNESRMDILRKKSVIMIKDLAEIGNVS-VN 251 Query: 220 QSGLVTFSNKIEEFFL-LEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQIFDMQG 275 G T + I++ F L+ G + + I L+ GV TN GL+Y + Sbjct: 252 LVGFSTSAKYIQQNFSNLDNGTNTIIATITKRENLNPDGV-TNPGDGLRYGMISLQSQPA 310 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL 309 K IV +TDG + D ++L Sbjct: 311 QL-----------KYIVLLTDGIPNAYLVDSRAL 333 >gi|257892784|ref|ZP_05672437.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium 1,231,408] gi|257829163|gb|EEV55770.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium 1,231,408] Length = 677 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 39/259 (15%), Positives = 81/259 (31%), Gaps = 56/259 (21%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 + + +Q +D+++V+D S SM K+ A+K + EE+ Sbjct: 96 LFDVTLAIKGNQLKKPIDLVMVIDYSSSMTG------EKLSNALKGLQEFGEELD-DSLE 148 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + ++ G+V ++ + ++ L+ ++ ++ T GL + + + Sbjct: 149 SGNIRIGIVAYNRFVYSTDDFLTDINQLEYFLRNTAESHTGTFMQKGLLEGQSLLEEKSR 208 Query: 276 MRQH------------------------------CNTEDANYKKIIV-FMTD-------G 297 + + + + F TD G Sbjct: 209 PEAEKMLVHIGDDSANRSYLPKENAQVFHNSGEIVDYNGYHTDQYVTEFQTDSEKYQTSG 268 Query: 298 ENLSTKEDQQSLYYCNEA--------KKRGAIVYAIGIRVIRSHEFL-RACASPNSFYLV 348 + S N+A K G Y++ E++ R AS + YL Sbjct: 269 SSTDPNAVSVSSSLINDATLGTIISIKNAGIKCYSVATAPSSRGEYIGRNLASSPNNYLS 328 Query: 349 --ENPHSMYDAFSHIGKDI 365 EN + +A I I Sbjct: 329 IDENLTGLGNALKEIANGI 347 >gi|163782258|ref|ZP_02177256.1| hypothetical protein HG1285_05710 [Hydrogenivirga sp. 128-5-R1-1] gi|159882291|gb|EDP75797.1| hypothetical protein HG1285_05710 [Hydrogenivirga sp. 128-5-R1-1] Length = 624 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 27/211 (12%), Positives = 81/211 (38%), Gaps = 31/211 (14%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 L +++D+S SM K + + ++ ++L +++ + + + Sbjct: 440 ERKDLAFELLIDISTSM--------KKEEKFVNALRSLLLVSEVLNKLKMPFSIKVFNEN 491 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + F ++ ++ + ++ +S G T+ + + + + Sbjct: 492 VYDLKDFEEDYRLAKAK-IMELISSVGGGTDLGKAINIGLESL--------ELYIKSTHR 542 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 K I++ TDGE ++ + + K++ I +G+ V + + ++ ++ Sbjct: 543 KGILILFTDGEPTKGMRGEELKSFILQMKQKFPI---VGVGVGSATQLVK------DYFD 593 Query: 348 -----VENPHSMYDAFSHIGKDIVTKRIWYD 373 VE+ + AFS I ++ + + + + Sbjct: 594 RTGVSVEDVSKLPAAFSFIVENQLKRLLSVN 624 >gi|20381126|gb|AAH28814.1| Itih1 protein [Mus musculus] Length = 911 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 74/196 (37%), Gaps = 28/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D+S SME K+ +++ +LE+++ + + + LV F +K++ Sbjct: 294 LVFVIDISGSMEG------QKVRQTKEALLKILEDMRPVDNFD------LVLFGSKVQSW 341 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + Q ++ S +TN GL + QG ++ + Sbjct: 342 KGSLVPVSNANLQAAQDFVRRFS-LAGATNLNGGLLRGIEILNKAQGSHPELSSPAS--- 397 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDGE + Q L A + +Y +G FL ++ N+ Sbjct: 398 -ILIMLTDGEPTEGETDRSQILKNVRNAIRGRFPLYNLGFGHDLDFSFLEVMSTENNGWA 456 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 457 QRIYEDHDATQQLQGF 472 >gi|282900951|ref|ZP_06308884.1| hypothetical protein CRC_02367 [Cylindrospermopsis raciborskii CS-505] gi|281194042|gb|EFA69006.1| hypothetical protein CRC_02367 [Cylindrospermopsis raciborskii CS-505] Length = 575 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 54/177 (30%), Gaps = 27/177 (15%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K T + +M V+D S SM + ++ L + N V G Sbjct: 391 KTQKDTGKTVYLMAVIDTSGSMY----------GGPLNAVKDGLRIASQQINPGNYV--G 438 Query: 223 LVTFSNKIEEFFLLEWGV----SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 LVT+ ++ L I L G + G+ A +++ Sbjct: 439 LVTYGDQPVNLVKLAPFDDLQHKRFLAAIDNLQADGATAM-YDGMMVALSEL-------V 490 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + N K ++ +TDG+ ++ G VY I + E Sbjct: 491 QQKKTNPNGKFYLLLLTDGQTNQGFNFEEVKEIIQY---SGVRVYPIAYGEVNEAEL 544 >gi|302560610|ref|ZP_07312952.1| von Willebrand factor type A domain-containing protein [Streptomyces griseoflavus Tu4000] gi|302478228|gb|EFL41321.1| von Willebrand factor type A domain-containing protein [Streptomyces griseoflavus Tu4000] Length = 449 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 28/175 (16%), Positives = 58/175 (33%), Gaps = 32/175 (18%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 T I +T+ + +++++D S SM D TK+ A + Sbjct: 37 ATGGGTIGSAVTAPHLYSPGEGPSAAVVLMVDCSGSM----DYPPTKMRNARDA------ 86 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--------VSHLQRKIKYLSKFGVSTNS 259 I + + V+ ++ ++ +E + G ++ ++ LS G T Sbjct: 87 TAAAIDALRDGVRFAVIGGTHVAKEVYPGGGGLAVAGPGTREQAKQALRKLSA-GGGTAI 145 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ--QSLYYC 312 L A + +H + +TDG N ++ +L C Sbjct: 146 GTWLGLADRLLASADVAIRHG-----------ILLTDGRNEHESQEDLKAALEAC 189 >gi|255039144|ref|YP_003089765.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] gi|254951900|gb|ACT96600.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] Length = 935 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 42/193 (21%), Positives = 72/193 (37%), Gaps = 27/193 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 A +M+++LDVS SM S + K+ + +SI ++L V+ ++ +V +S K Sbjct: 755 APNNMVLLLDVSSSMNSPY-----KMPLLKRSIKSLLTLVRPEDMIS------IVLYSGK 803 Query: 230 IEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 G S + R I L G T+ G+K AY + Q +R N Sbjct: 804 ARVVLKPTSGAKASEISRMIDLLQSDG-DTDGNEGIKLAYKTA-NKQYIRGGNNR----- 856 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIRVIRSHEFLRACASP-NSF 345 IV TDGE D+ A++ ++ G + + S+ Sbjct: 857 ---IVLATDGE--FPVSDEVMDMIRQNARQDVYLSIFTFGRHEHTGQKLKKLSELGMGSY 911 Query: 346 YLVENPHSMYDAF 358 V + + Sbjct: 912 AHVTDASADLQLI 924 >gi|308472863|ref|XP_003098658.1| hypothetical protein CRE_04224 [Caenorhabditis remanei] gi|308268258|gb|EFP12211.1| hypothetical protein CRE_04224 [Caenorhabditis remanei] Length = 392 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 36/191 (18%), Positives = 68/191 (35%), Gaps = 15/191 (7%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D S+ M + ++ + I S+ + + + GLVT+++ Sbjct: 39 LDVVAVVDNSQGMT---NEGLSNVAADIFSVFSSGTRIGSNSSEPRTTRLGLVTYNSAAT 95 Query: 232 EFFLLEWGVS--HLQRKI-KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L S + I LS +T+S + Q NT +YK Sbjct: 96 QKADLNKFQSIGDVANGIGNALSTVVDTTDSYLATGL---ILAAKMFNEQSVNTNRGHYK 152 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC---ASPNSF 345 ++++ + L N K G + + + L+ ASP F Sbjct: 153 RVVIVF--ASEYKGIGELDPLPVANRLKLSGVNIITVAYQQAGDDGLLQGLSQVASPG-F 209 Query: 346 YLVENPHSMYD 356 V NP ++ Sbjct: 210 SFVNNPLNLVT 220 >gi|218458530|ref|ZP_03498621.1| hypothetical protein RetlK5_03343 [Rhizobium etli Kim 5] Length = 185 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 56/166 (33%), Gaps = 16/166 (9%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R F+ +++G + LT I +P++ +II+V + T L + +D + A ++ Sbjct: 6 VRRFWNDHRGYVIALTLIAMPMLLGFSLLIIDVGRSSNLHTDLQNAVDAMALAGAREL-- 63 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVV-PQNEG 126 G D T N F + + + + + G Sbjct: 64 ----------DGRDDAITRAQTAIEKISN---SAAFSAGGTGMSLGSHISVTYDAGNDAG 110 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARL 172 +++ + IP T IP + + V Q + Sbjct: 111 STVTVLFLKDIPANDDTPIPSSMETTVASEASYAWVIAKPQAMQTI 156 >gi|149922178|ref|ZP_01910616.1| flagellar biosynthesis protein P [Plesiocystis pacifica SIR-1] gi|149816918|gb|EDM76403.1| flagellar biosynthesis protein P [Plesiocystis pacifica SIR-1] Length = 689 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 24/152 (15%), Positives = 46/152 (30%), Gaps = 13/152 (8%) Query: 223 LVTFSNKIEEFFLLEW-GVSHLQRKIKYLSKFGVSTNSTPGLK--YAYNQIFDMQGMRQH 279 T+S+ ++ + SH+ + G T + GL+ + G+ Sbjct: 387 AWTYSDGQQDPPGFDIPTTSHMPQCAGNTFCSGSGTYTHLGLQLIKDNQVQYQADGLMDG 446 Query: 280 CNTEDANYKKII-VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR----VIRSHE 334 + + +TDG+ + Q E G Y IG + Sbjct: 447 AEFPTNDETIYFNILITDGQYNGYSTNAQVQGELEEMYNDGITTYVIGFGDGVDTPAAMA 506 Query: 335 FLRACAS-----PNSFYLVENPHSMYDAFSHI 361 L+ A ++Y N + A + I Sbjct: 507 QLQNMAQWGSGDSENYYDANNQAELEAALTSI 538 >gi|226874935|ref|NP_034712.2| inter-alpha-trypsin inhibitor heavy chain H2 [Mus musculus] gi|148676057|gb|EDL08004.1| inter-alpha trypsin inhibitor, heavy chain 2 [Mus musculus] Length = 950 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 32/198 (16%), Positives = 77/198 (38%), Gaps = 32/198 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ V+DVS SM K+ ++++ +L++++ Q +V F++ + Sbjct: 315 ILFVIDVSGSMWGI------KMKQTVEAMKTILDDLRTDD------QFSVVDFNHNVRTW 362 Query: 231 --EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + ++ +R I+ + G TN L A IF + + Sbjct: 363 RNDLVSATKTQIADAKRYIEKIQPSG-GTNINEALLRA---IFILNEASNMGLLNPDSVS 418 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACASPN-- 343 +I+ ++DG+ + + K+ ++++GI ++FL+ ++ N Sbjct: 419 -LIILVSDGDPT--VGELKLSKIQKNVKQSIQDNISLFSLGIGFDVDYDFLKRLSNENRG 475 Query: 344 ---SFYLVENPHSMYDAF 358 Y ++ S F Sbjct: 476 IAQRIYGNQDTSSQLKKF 493 >gi|109088171|ref|XP_001107718.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like [Macaca mulatta] Length = 946 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 76/198 (38%), Gaps = 32/198 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP-----DVNNVVQSGLVTFSN 228 ++ V+DVS SM K+ ++++ +L++++ D N V++ Sbjct: 311 ILFVIDVSGSMWG------VKMKQTVEAMKTILDDLRAEDHFSVIDFNQNVRT------W 358 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + + VS +R I+ + G TN L A + + + + + Sbjct: 359 RNDLISATKTQVSDAKRYIEKIQPSG-GTNINEALLRAIFILNEANNLGL-LDPNSVSL- 415 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACASPN-- 343 I+ ++DG+ + + K+ ++++G+ ++FL+ ++ N Sbjct: 416 --IILVSDGDPT--VGELKLSKIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENRG 471 Query: 344 ---SFYLVENPHSMYDAF 358 Y ++ S F Sbjct: 472 IAQRIYGNQDTSSQLKKF 489 >gi|75910929|ref|YP_325225.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75704654|gb|ABA24330.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 608 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 35/166 (21%), Positives = 59/166 (35%), Gaps = 12/166 (7%) Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNS-----QTDARLDMMIVLDVSRSMESFFDSS 191 + + + +T+ I + + S+ S +D ++V+D S SM + Sbjct: 1 MNFNYLRYFLQFTSIPSICLLLILSISQFSKVLAQNPKGGIDWIVVVDTSASMRG-VGGT 59 Query: 192 ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS 251 SIN + KL V VT + L++ I L Sbjct: 60 RNIFAQVKNSINEFVNTAKLGDTVTIYNFDSDVTLQAQEIPIVSNPD-RGKLKQIINNLK 118 Query: 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 GV T++ ++ A + + Q NT D IVF+TDG Sbjct: 119 ADGVRTHTGKAVQQA---LSTSAKLNQRPNTADRTVS--IVFLTDG 159 >gi|74227570|dbj|BAE21837.1| unnamed protein product [Mus musculus] Length = 950 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 32/198 (16%), Positives = 77/198 (38%), Gaps = 32/198 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ V+DVS SM K+ ++++ +L++++ Q +V F++ + Sbjct: 315 ILFVIDVSGSMWGI------KMKQTVEAMKTILDDLRTDD------QFSVVDFNHNVRTW 362 Query: 231 --EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + ++ +R I+ + G TN L A IF + + Sbjct: 363 RNDLVSATKTQIADAKRYIEKIQPSG-GTNINEALLRA---IFILNEASNMGLLNPDSVS 418 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACASPN-- 343 +I+ ++DG+ + + K+ ++++GI ++FL+ ++ N Sbjct: 419 -LIILVSDGDPT--VGELKLSKIQKNVKQSIQDNISLFSLGIGFDVDYDFLKRLSNENRG 475 Query: 344 ---SFYLVENPHSMYDAF 358 Y ++ S F Sbjct: 476 IAQRIYGNQDTSSQLKKF 493 >gi|332365023|gb|EGJ42788.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK355] Length = 462 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 35/154 (22%), Positives = 61/154 (39%), Gaps = 24/154 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDS-------SITKIDMAIKSINAMLEEVKLIPDVNNVV 219 Q + + V D S SM + +++D+ K M++++ I +V+ V Sbjct: 193 QGQMNIAVSFVFDTSGSMNWDLQGRNVEKTGNESRMDILRKKSVIMIKDLAEIGNVS-VN 251 Query: 220 QSGLVTFSNKIEEFFL-LEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQIFDMQG 275 G T + I++ F L+ G + + I L+ GV TN GL+Y + Sbjct: 252 LVGFSTSAKYIQQNFSNLDNGTNTIIATITKRENLNPDGV-TNPGDGLRYGMISLQSQPA 310 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL 309 K IV +TDG + D ++L Sbjct: 311 QL-----------KYIVLLTDGIPNAYLVDSRAL 333 >gi|322434933|ref|YP_004217145.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321162660|gb|ADW68365.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 347 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 37/219 (16%), Positives = 79/219 (36%), Gaps = 37/219 (16%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 +K S DA + + IV D+S SM +K A K+++ L + Sbjct: 113 IKTFSTQDAPVTIGIVFDLSGSM-------TSKFGRARKALSEFLRTSNPADEFF----- 160 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +V F++K V ++ ++ L T + N++ + + R Sbjct: 161 -VVGFNDKPAVIVDYTSDVEDVEARMVMLKPENR-TALIDAVYLGVNKLKEAKYDR---- 214 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI---RVIRSHE---- 334 K ++ ++DG + ++ + L ++ +Y+IGI E Sbjct: 215 -------KALLIVSDGGDNRSRYTEGELR--RVVRESDVQIYSIGIYDAYAPTEEEQLGP 265 Query: 335 -FLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 L+ + + + V + M D S I ++ + + Sbjct: 266 VLLKDISEMTGGRMFPVTDIADMADIASRISAELRNEYV 304 >gi|153947567|ref|YP_001399499.1| tellurium resistance protein [Yersinia pseudotuberculosis IP 31758] gi|152959062|gb|ABS46523.1| putative tellurium resistance protein [Yersinia pseudotuberculosis IP 31758] Length = 212 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 32/172 (18%), Positives = 60/172 (34%), Gaps = 14/172 (8%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +++D S SM I I AM+ ++ P V ++T+ N+ Sbjct: 3 RLPVYLLIDTSGSMRG------ESIHAVNVGIQAMMSALRQDPYALESVHLSIITYDNQA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 E+ L + + T + L+ I + Q + + + Sbjct: 57 REYIPLTALENF--QFTDITVPSAGGTFTGAALEC---LIHCVDRDIQRSDGDQKGDWRP 111 Query: 291 IVF-MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 +VF MTDG +++ K+ + A + HE L+ S Sbjct: 112 LVFLMTDGTPSDVYAYGEAIK--EVKKRAFGSIIACAVGAKAKHEHLKQLTS 161 >gi|32477849|ref|NP_870843.1| hypothetical protein RB13068 [Rhodopirellula baltica SH 1] gi|32448406|emb|CAD77921.1| hypothetical protein-transmembrane prediction [Rhodopirellula baltica SH 1] Length = 499 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 21/107 (19%), Positives = 39/107 (36%), Gaps = 4/107 (3%) Query: 15 YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNR 74 G + IL I L +F + G++I++ + + S+ D + + Q+ N Sbjct: 108 RGGAVLILIVILLFALFAIAGLLIDIGMARLTQAHMQSVSDAASLEGGWQLAMGANQTT- 166 Query: 75 KKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVV 121 ++ D + +W R EL D N +S I Sbjct: 167 TRIAVVDRAAEMSESWGPH-RIELEDG--YNLGGSGKPESSQTINRD 210 >gi|168185240|ref|ZP_02619904.1| hypothetical protein CBB_2323 [Clostridium botulinum Bf] gi|237795478|ref|YP_002863030.1| hypothetical protein CLJ_B2256 [Clostridium botulinum Ba4 str. 657] gi|182671719|gb|EDT83680.1| hypothetical protein CBB_2323 [Clostridium botulinum Bf] gi|229261279|gb|ACQ52312.1| hypothetical protein CLJ_B2256 [Clostridium botulinum Ba4 str. 657] Length = 647 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 47/303 (15%), Positives = 91/303 (30%), Gaps = 56/303 (18%) Query: 46 MKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN 105 ++ + +D + A ++ K +++ W + + V Sbjct: 330 IQNAMQGKVD---LEAYGISQSDQQLFQFYSNKMKLEREQMRQFWKK-LIGDAKKEVSVK 385 Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVN 165 + +D + YK L T + + Sbjct: 386 KDGQVKGKLDVDSFTNFYPDFVEAEKKGNYK-NLPIFNRYLLETQADILP---------- 434 Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 R+++ V+D S SM +KI+ + K++ L I D N ++S Sbjct: 435 ----ERIEISFVIDNSGSMNE------SKIEASRKALAVTLLS---IDDFNRYLKSNAEQ 481 Query: 226 FSNKIEEFFLLEWG--------------------VSHLQRKIKYLSKFGVSTNSTPGLKY 265 + K+E W S + R I L +T+ L+ Sbjct: 482 LNQKVEVLSE-TWFFGSKYYNVKEFNDKNVKEKEKSDIIRSIVKLDATDGATDDASCLRE 540 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 N+I +Q + KII +TDG + ++++ E + VYA Sbjct: 541 ISNKITSIQERELKKGKQ----IKIIFEITDGASSFPGSAKEAVQ---ELLSKNVEVYAF 593 Query: 326 GIR 328 I Sbjct: 594 QIG 596 >gi|130498817|ref|NP_001076116.1| inter-alpha-trypsin inhibitor heavy chain2 [Oryctolagus cuniculus] gi|11041696|dbj|BAB17301.1| inter-alpha-trypsin inhibitor heavy chain2 [Oryctolagus cuniculus] Length = 946 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 77/198 (38%), Gaps = 32/198 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ V+DVS SM K+ ++++ +L++++ + +V F++ I Sbjct: 311 ILFVIDVSGSMWG------VKMKQTVEAMKTILDDLRAEDHFS------VVDFNHNIRTW 358 Query: 231 --EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + ++ +R I+ + G TN L A + + M + + Sbjct: 359 RNDLVSATKTQIADAKRYIEKIQPNG-GTNINEALLRAIFILNEANNMGL-LDPNSVSL- 415 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACASPN-- 343 I+ ++DG+ + + K+ ++++GI ++FL+ ++ N Sbjct: 416 --IILVSDGDPT--VGELKLSKIQKNVKQNIQDNVSLFSLGIGFDVDYDFLKRLSNENRG 471 Query: 344 ---SFYLVENPHSMYDAF 358 Y ++ S F Sbjct: 472 IAQRIYGNQDTSSQLKKF 489 >gi|332360778|gb|EGJ38585.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK355] Length = 452 Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 52/323 (16%), Positives = 90/323 (27%), Gaps = 43/323 (13%) Query: 52 SMIDRSLVHAATQIMNEGN--GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 + + A T+I + K GD++ KN + + N V Sbjct: 52 EAVRSPIEAAGTEIAADTTKPVVEESTAKPGDLIDVSKNVSPVDVKESQEGNQTVRVETS 111 Query: 110 IVRSTSLDIV---VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 V T ++ V E + + + Y +S Sbjct: 112 TVDVTKTEVAPAKVDKAPEPITRRTENTLSGTDENGNPYTQYERVDKTTTITYTSTPPTV 171 Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 D++ V+D S SM ID+ +IN + + + GL TF Sbjct: 172 TKAGSADIVFVVDRSGSMGGT-------IDIVRANINEFVRNITK---EGITARFGLATF 221 Query: 227 SNKIE------------------EFFLLEWGVSHLQRKIKYLS-KFGVSTNSTPGLKYAY 267 S+++ + L++ + + G T TP A Sbjct: 222 SDEVYGRNSGSKDEDTVLTRFGFGSSYFTTDPAELEKALAAIRIASGGDTPETPT--PAL 279 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 NQI + KK +V +TD E K G + Sbjct: 280 NQIISTYDWSKSSKN-----KKFVVLLTDAEMKEDPSIPTVADTLAALKAAGIERTVATV 334 Query: 328 RVIRSHEFLRACASPNSFYLVEN 350 + I + A+ +EN Sbjct: 335 KAIEG--IYKNFATEGRVLDIEN 355 >gi|226487566|emb|CAX74653.1| Loss of heterozygosity 11 chromosomal region 2 gene A protein homolog [Schistosoma japonicum] Length = 832 Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 48/272 (17%), Positives = 92/272 (33%), Gaps = 52/272 (19%) Query: 89 TWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQN----------EGYSISAISRYKIP 138 + ++F L + D+ I S +LD P N + + + P Sbjct: 194 PYTITFEAHLWMQSKILDVKSIHDSFTLDRTNNPNNVIVKLASGFVPDHDLQMVISLSDP 253 Query: 139 LKFCTFIPWYTNSRHIVMP--------ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 L + + +++ ++ + V S D R + + ++D S SME Sbjct: 254 LTSLVSLEYGDSNQSSLLAMNCLMVQILPDIPNVVSSKDMRNEFVFLIDRSGSMEG---- 309 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF--LLEWGVSHLQRKIK 248 I A S+ L+ + VN Q + F + F ++ L + Sbjct: 310 --DNISYAKTSLLLFLKSL----PVNCRFQI--IGFGSNFAALFSEPTDYSEDSLNAAMN 361 Query: 249 Y---LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKE 304 Y L+ T + LK A H + + K I+F+TDG+ + + Sbjct: 362 YQKDLNADMGGTEAYNALKSAL-----------HSSPSGEGWFKQIIFLTDGDVGNADEV 410 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 ++A+ V+ IG+ S + Sbjct: 411 IGLVRMNVDKAR-----VFTIGLGQGVSTALI 437 >gi|298294144|ref|YP_003696083.1| von Willebrand factor A [Starkeya novella DSM 506] gi|296930655|gb|ADH91464.1| von Willebrand factor type A [Starkeya novella DSM 506] Length = 356 Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 36/215 (16%), Positives = 75/215 (34%), Gaps = 39/215 (18%) Query: 174 MMIVLDVSRSME-SFFDSSITKIDMAIKSINA-MLEEVKLIPDVNNVVQSGLVTFSNKIE 231 ++++ D S SM+ SF D + T + + +L E + + G+ FS Sbjct: 96 LVLLFDRSSSMDNSFADRAPTGDQESKSAAAKRLLAEFVARRPHD---RIGVAAFSTSPM 152 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + I + + G++ T+ GL A++ D + Sbjct: 153 PVLPLTDHHEVVDAAIDAIDRPGLAFTDVGRGLALAFSYFADDTDETS----------RA 202 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG---------IRVIRSHE------- 334 ++ ++DG L + Q +L + A + +Y + V + Sbjct: 203 VLLVSDGAALIDRRVQDALR--DAAARTPVHLYWLFLRSRGSPGIFEVPPGEDTPQANPE 260 Query: 335 -----FLRACASPNSFYLVENPHSMYDAFSHIGKD 364 FL+ P + +P ++ DA + I + Sbjct: 261 RHLHLFLQGLGLPYRAFEATSPQAVADAIAEIDRQ 295 >gi|296206127|ref|XP_002750076.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2 [Callithrix jacchus] Length = 946 Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 31/197 (15%), Positives = 76/197 (38%), Gaps = 30/197 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ V+DVS SM K+ ++++ +L++++ + +V F++ I Sbjct: 311 ILFVIDVSGSMWG------VKMKQTVEAMKTILDDLRAEDHFS------VVDFNHNIRTW 358 Query: 231 --EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + V+ +R I+ + G TN L A + + M + + Sbjct: 359 RNDLISATKTQVADAKRYIEKIQPSG-GTNINEALLRAIFILNEANNMGL-LDPNSVSL- 415 Query: 289 KIIVFMTDGENL--STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--- 343 I+ ++DG+ K + + ++++G+ ++FL+ ++ N Sbjct: 416 --IILVSDGDPTVGELKLSKIQKNVKENIRDN-ISLFSLGMGFDVDYDFLKRLSNENRGI 472 Query: 344 --SFYLVENPHSMYDAF 358 Y ++ S F Sbjct: 473 AQRIYGNQDTSSQLRKF 489 >gi|3024048|sp|Q61702|ITIH1_MOUSE RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H1; Short=ITI heavy chain H1; Short=ITI-HC1; Short=Inter-alpha-inhibitor heavy chain 1; Flags: Precursor gi|695632|emb|CAA49841.1| inter-alpha-inhibitor H1 chain [Mus musculus] Length = 907 Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 74/196 (37%), Gaps = 28/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D+S SME K+ +++ +LE+++ + + + LV F +K++ Sbjct: 290 LVFVIDISGSMEG------QKVRQTKEALLKILEDMRPVDNFD------LVLFGSKVQSW 337 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + Q ++ S +TN GL + QG ++ + Sbjct: 338 KGSLVPVSNANLQAAQDFVRRFS-LAGATNLNGGLLRGIEILNKAQGSHPELSSPAS--- 393 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDGE + Q L A + +Y +G FL ++ N+ Sbjct: 394 -ILIMLTDGEPTEGETDRSQILKNVRNAIRGRFPLYNLGFGHDLDFSFLEVMSTENNGWA 452 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 453 QRIYEDHDATQQLQGF 468 >gi|194374787|dbj|BAG62508.1| unnamed protein product [Homo sapiens] Length = 677 Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 39/197 (19%), Positives = 72/197 (36%), Gaps = 35/197 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM ++ +++ ++ L + + G+VTF + Sbjct: 70 VCLVLDKSGSM-----ATGNRLNRLNQAGQLFLLQTVELGSW-----VGMVTFDSAAHVQ 119 Query: 234 FLL--EWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L S K L T+ GL+ A+ I +++ Sbjct: 120 SELIQINSGSDRDTLAKRLPAAASGGTSICSGLRSAFTVI-----RKKYPTDGSE----- 169 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ ++ C NE K+ GAI++ + + + E L Y Sbjct: 170 IVLLTDGEDN-------TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKMTGGLQTYA 222 Query: 348 VENPHS--MYDAFSHIG 362 + + + DAF + Sbjct: 223 SDQVQNNGLIDAFGALS 239 >gi|257880953|ref|ZP_05660606.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,502] gi|257892525|ref|ZP_05672178.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,408] gi|257816611|gb|EEV43939.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,502] gi|257828904|gb|EEV55511.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,408] Length = 1107 Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 24/139 (17%), Positives = 57/139 (41%), Gaps = 22/139 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + LD+++V+D S SM +I ++ ++ + + + + G V + Sbjct: 264 KDITPLDLVLVVDWSGSMNDN-----NRIGEVKIGVDRFVDTL-ADSGITDKINMGYVGY 317 Query: 227 SNKIEEFF--LLEWGV-SHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 S++ + ++ G ++ ++K ++ T + L+ A + + G Sbjct: 318 SSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSVPNG------ 371 Query: 282 TEDANYKKIIVFMTDGENL 300 +KK+IV +TDG Sbjct: 372 -----HKKVIVLLTDGVPT 385 >gi|81892746|sp|Q6Q473|CLCA4_MOUSE RecName: Full=Calcium-activated chloride channel regulator 4; AltName: Full=Calcium-activated chloride channel regulator 6; Short=mClca6; Contains: RecName: Full=Calcium-activated chloride channel regulator 4, 110 kDa form; Contains: RecName: Full=Calcium-activated chloride channel regulator 4, 30 kDa form; Flags: Precursor gi|50882459|gb|AAS86332.2| calcium activated chloride channel [Mus musculus] Length = 924 Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 38/198 (19%), Positives = 76/198 (38%), Gaps = 32/198 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 M +VLDVS SM S+ +++ ++ L ++ + N G+V FS++ Sbjct: 308 MCLVLDVSGSMTSY-----DRLNRMNQAAKYFLSQI-----IENRSWVGMVHFSSQATIV 357 Query: 234 FLLEWGVSHLQRK--IKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L S ++R ++ L T+ G+K A+ + + Sbjct: 358 HELIQINSDIERNQLLQTLPTSANGGTSICSGIKAAFQVFKNGEYQTDGTE--------- 408 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL-RACASPNSFYLVE 349 I+ ++DGE D + +E K G+IV+ I + + + + + L Sbjct: 409 ILLLSDGE------DSTAKDCIDEVKDSGSIVHFIALGPLADLAVTNMSILTGGNHKLAT 462 Query: 350 NPHS---MYDAFSHIGKD 364 + + DAF + + Sbjct: 463 DEAQNNGLIDAFGALASE 480 >gi|326504464|dbj|BAJ91064.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 720 Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 39/224 (17%), Positives = 75/224 (33%), Gaps = 35/224 (15%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 P + N + +++ + + LD++ VLDVS+SM + + K+ A+ + Sbjct: 38 APLHENRQQVLLEVIDASSAGGDRRLGLDLVAVLDVSKSMRK--NDRLGKMKTAMHFVIN 95 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNST 260 L + + +V FS + E L + L + L TN Sbjct: 96 KLGHMD---------RLSIVKFSEEAERLCPLLSVTPPNKARLNHLVDGLQVIDP-TNIR 145 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA 320 GL+ + + + + I ++DG+ + Sbjct: 146 DGLEAGLSVLAGRRITGGRVAS--------IFLLSDGDENRGHATTVDV--------SDV 189 Query: 321 IVYAIGIRVIRSHEFLRACA---SPNSFYLVENPHSMYDAFSHI 361 VY G + L A +F V++ +M + FS I Sbjct: 190 PVYTFGFGTDYDPKVLDEIARRSKGGTFNFVDDEENMTEPFSQI 233 >gi|149608697|ref|XP_001516114.1| PREDICTED: similar to hCG2002731, partial [Ornithorhynchus anatinus] Length = 180 Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 57/168 (33%), Gaps = 24/168 (14%) Query: 221 SGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMR 277 G+V + + F L + + + G T + G+ A + F Sbjct: 1 VGIVQYGENVTHEFNLNKYTTTEEVLTASNEIIQRGGRQTMTALGIDTARKEAFT----- 55 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--RVIRSH-- 333 KK++V +TDGE+ + ++ + C +AI I R + Sbjct: 56 -EPRGARRGVKKVMVIVTDGESHDNHDLEKVIQDCE---NENIQRFAIAILGSYNRGNLS 111 Query: 334 -----EFLRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 E +++ AS F+ V + ++ +G+ I D Sbjct: 112 TEKFVEEIKSIASEPTEKHFFNVSDELALLTIVEALGERIFALEATTD 159 >gi|170743045|ref|YP_001771700.1| hypothetical protein M446_4937 [Methylobacterium sp. 4-46] gi|168197319|gb|ACA19266.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 440 Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 13/58 (22%), Positives = 28/58 (48%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 IR F + G + ++ + LP++ G +E + I + L +D + + AA ++ Sbjct: 5 IRRFLADRTGAVALIIGLSLPLLVAGSGAAVEYARIHKRRAELQKAVDVAALGAAGEL 62 >gi|70778918|ref|NP_002207.2| inter-alpha-trypsin inhibitor heavy chain H2 [Homo sapiens] gi|229462889|sp|P19823|ITIH2_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H2; Short=ITI heavy chain H2; Short=ITI-HC2; Short=Inter-alpha-inhibitor heavy chain 2; AltName: Full=Inter-alpha-trypsin inhibitor complex component II; AltName: Full=Serum-derived hyaluronan-associated protein; Short=SHAP; Flags: Precursor gi|55958062|emb|CAI12957.1| inter-alpha (globulin) inhibitor H2 [Homo sapiens] Length = 946 Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 29/198 (14%), Positives = 76/198 (38%), Gaps = 32/198 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ V+DVS SM K+ ++++ +L++++ + ++ F+ I Sbjct: 311 ILFVIDVSGSMWG------VKMKQTVEAMKTILDDLRAEDHFS------VIDFNQNIRTW 358 Query: 231 --EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + V+ +R I+ + G TN L A + + + + + Sbjct: 359 RNDLISATKTQVADAKRYIEKIQPSG-GTNINEALLRAIFILNEANNLGL-LDPNSVSL- 415 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACASPNS- 344 I+ ++DG+ + + K+ ++++G+ ++FL+ ++ N Sbjct: 416 --IILVSDGDPT--VGELKLSKIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENHG 471 Query: 345 ----FYLVENPHSMYDAF 358 Y ++ S F Sbjct: 472 IAQRIYGNQDTSSQLKKF 489 >gi|82702351|ref|YP_411917.1| von Willebrand factor, type A [Nitrosospira multiformis ATCC 25196] gi|82410416|gb|ABB74525.1| von Willebrand factor, type A [Nitrosospira multiformis ATCC 25196] Length = 888 Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 54/177 (30%), Gaps = 20/177 (11%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + N+ R + +VLD S SM I+K+ ++ N + ++ Sbjct: 394 IAANTTARPRSAVALVLDRSGSMNEDAGDGISKVQKLREAANVFISAMQPADG------I 447 Query: 222 GLVTFSN---KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 GLV F+ ++ E R I G + + Q + Sbjct: 448 GLVRFNEAAQRLMEIQEAGAAPGGTGRTIALEHIAGSDIDPAGATSIGDGVVNGKQMLDD 507 Query: 279 HCNTEDANYK-KIIVFMTDGENLSTK--EDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 T Y +V +TDG D N YA+G+ + + Sbjct: 508 AQATAGTPYDVTAMVVLTDGMWNRPPPLADVMGSITANT--------YAVGLGLPSN 556 >gi|119606784|gb|EAW86378.1| inter-alpha (globulin) inhibitor H2, isoform CRA_b [Homo sapiens] gi|124376332|gb|AAI32686.1| Inter-alpha (globulin) inhibitor H2 [Homo sapiens] gi|158256194|dbj|BAF84068.1| unnamed protein product [Homo sapiens] Length = 946 Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 29/198 (14%), Positives = 76/198 (38%), Gaps = 32/198 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ V+DVS SM K+ ++++ +L++++ + ++ F+ I Sbjct: 311 ILFVIDVSGSMWG------VKMKQTVEAMKTILDDLRAEDHFS------VIDFNQNIRTW 358 Query: 231 --EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + V+ +R I+ + G TN L A + + + + + Sbjct: 359 RNDLISATKTQVADAKRYIEKIQPSG-GTNINEALLRAIFILNEANNLGL-LDPNSVSL- 415 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACASPNS- 344 I+ ++DG+ + + K+ ++++G+ ++FL+ ++ N Sbjct: 416 --IILVSDGDPT--VGELKLSKIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENHG 471 Query: 345 ----FYLVENPHSMYDAF 358 Y ++ S F Sbjct: 472 IAQRIYGNQDTSSQLKKF 489 >gi|223939755|ref|ZP_03631626.1| Vault protein inter-alpha-trypsin domain protein [bacterium Ellin514] gi|223891531|gb|EEF58021.1| Vault protein inter-alpha-trypsin domain protein [bacterium Ellin514] Length = 806 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 51/257 (19%), Positives = 87/257 (33%), Gaps = 45/257 (17%) Query: 111 VRSTSLDIVVVPQNE---GYSIS-AISRYKIPLKF--------CTFIPWYTNSR--HIVM 156 S +++ N GY S + L F + + T + ++ Sbjct: 237 SPSHEVEVKRDGSNRATVGYEASEVKPDADLQLYFAPEKDEIGVNLMAYKTGDEDGYFLL 296 Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 + V ++ D++ VLD S SM S K++ A K++ +E + Sbjct: 297 LASPGVDAKAKQIVSKDVVFVLDTSGSM------SGKKMEQAKKALQFCVESLNDGD--- 347 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQR-----KIKYLSKFGVSTNSTPGLKYAYNQIF 271 + ++ FS + E F VS R IK L G T LK A + Sbjct: 348 ---RFEIIRFSTESEPLFDKLAAVSKENREKAGDFIKNLKAMG-GTAIDEALKKALS--L 401 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + + R ++VF+TDG T ++ Q L E K ++ GI Sbjct: 402 ESKEGRPF----------VVVFLTDGLPTVGTTDEDQILKGMQERNKEKRRIFCFGIGTD 451 Query: 331 RSHEFLRACASPNSFYL 347 + L A + Sbjct: 452 VNTHLLDRIAEETRAFS 468 >gi|169642421|gb|AAI60698.1| Unknown (protein for IMAGE:4964804) [Xenopus laevis] Length = 519 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 36/250 (14%), Positives = 79/250 (31%), Gaps = 34/250 (13%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 N + S V D +++ IVLD S+S+ + D A + +E++ Sbjct: 218 NVDTTNLEDRSDRSVRILKDGLMNIFIVLDTSKSVGQN------RFDEAKSASILFIEKM 271 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLL----EWGVSHLQRKIKYLSK----FGVSTNSTP 261 ++++++K L + ++ TN+ Sbjct: 272 SNYDIKPRYC---IISYASKAISVVSLRDPDSNNADAVMEHLEEFQYDRHEDKQGTNTRA 328 Query: 262 GLKYAYNQIFDMQGM--RQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYY------- 311 L Y + + + R+ + +I+ MTDG+ N+ ++ Sbjct: 329 ALHAIYEHLIEQELAYEREGKKEDFMKIHNVILLMTDGKFNMGGDPREEMKLIKRFLDVG 388 Query: 312 --CNEAKKRGAIVYAIGIRV----IRSHEFLRACASPNSFYLVENPHSMYDAFSH-IGKD 364 + ++ VY G+ ++ + ++N M + F I +D Sbjct: 389 IRKDNPREEYLDVYVFGLGSDIDQPEINDLASKKEKEVHTFHLQNVEKMKEFFELMIIED 448 Query: 365 IVTKRIWYDK 374 V K Sbjct: 449 DVFDTCGLSK 458 >gi|159473306|ref|XP_001694780.1| flagellar associated protein [Chlamydomonas reinhardtii] gi|158276592|gb|EDP02364.1| flagellar associated protein [Chlamydomonas reinhardtii] Length = 4349 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 39/270 (14%), Positives = 79/270 (29%), Gaps = 38/270 (14%) Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 D+ + + E ++ + Y+ + I+ Sbjct: 918 TDEAHETRQGPVASDEAAEPLTLKVLPEYE---------KYALGKEACRAVISIKASAEV 968 Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + A + + VLD S SM +I++ ++ + +++++ G+V++ Sbjct: 969 KQRAHVALTCVLDRSGSMGG------ERIELVRETCHFLIDQLTADD------YLGIVSY 1016 Query: 227 SNKIEEFFLLEWGVSHLQR----KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 SN + E L +R I L+ G T GL+ Q + Sbjct: 1017 SNTVREDVPLLRMTPEARRLAHTMISSLTLHG-GTALYAGLEAGVKQQMAAASELKALAA 1075 Query: 283 EDANYKKI--IV----FMTDGENLSTKEDQQSL----YYCNEAKKRGAIVYAIGIRVIRS 332 IV TDG+ + + + V+ G S Sbjct: 1076 AAGGGSDSSRIVHSCFLFTDGQATTGPCTVNEIMGQMTSLQSPADQNITVHTFGFGDDHS 1135 Query: 333 HEFLRAC--ASPNSFYLVENPHSMYDAFSH 360 E L+ A +Y + + F Sbjct: 1136 VELLQGVAEAQSGVYYYISCADDIPSGFGD 1165 >gi|172065275|ref|YP_001815987.1| hypothetical protein BamMC406_5998 [Burkholderia ambifaria MC40-6] gi|171997517|gb|ACB68434.1| conserved hypothetical protein [Burkholderia ambifaria MC40-6] Length = 423 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 14/119 (11%), Positives = 40/119 (33%), Gaps = 1/119 (0%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R + +G + I+ + L ++ +G+ +++ ++ ++ L + D + AA + + Sbjct: 12 RRGLHRQQGAVAIIVGLALAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDLTSA 71 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 + + + + D+ + V P N Y Sbjct: 72 ISLQVAEADGIAAGHANFA-FFQQNAVQMQTDSNVTFSDSLTNPFLTKTAVATPANVKY 129 >gi|226498336|ref|NP_001143188.1| hypothetical protein LOC100275688 [Zea mays] gi|195615532|gb|ACG29596.1| hypothetical protein [Zea mays] Length = 599 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 35/216 (16%), Positives = 88/216 (40%), Gaps = 31/216 (14%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI---TKIDMAIKSINAMLEEVKLIPD 214 + + + A +D++ ++++++SM S +++D+ ++ ++ ++ Sbjct: 33 VRVEAPSSMKNHAPIDLVTLININQSMSWPAASQTEIPSRLDLLKNAMKFIIRQLGDDD- 91 Query: 215 VNNVVQSGLVTFSNKI---EEFFLLE---WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + +V F++K+ +LE G +++K+ L G T P L++A Sbjct: 92 -----RLAIVAFNDKVIKENTTGILEISGSGRMAIEKKVDGLVAMG-DTAFKPSLEHAVK 145 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK--RGAIVYAIG 326 + D ++ IV ++DG + +K +S+ + + R V+ G Sbjct: 146 LLDDRDDKKRAG---------FIVLISDGLDGQSKWGDESITPTDPIRGLLRKYPVHTFG 196 Query: 327 IRVIRSHEFLRACA--SPNSFYLV--ENPHSMYDAF 358 + + L A S + + +N + +AF Sbjct: 197 LGKAHDPKALHYIADISYGIYSSIVTDNLDKIIEAF 232 >gi|169604987|ref|XP_001795914.1| hypothetical protein SNOG_05509 [Phaeosphaeria nodorum SN15] gi|160706682|gb|EAT86573.2| hypothetical protein SNOG_05509 [Phaeosphaeria nodorum SN15] Length = 1180 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 35/239 (14%), Positives = 85/239 (35%), Gaps = 24/239 (10%) Query: 110 IVRSTSLDIVVVPQNEGYSISAISRY-KIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT 168 S SL + ++ + + I++ +P P N R ++ + + ++ Sbjct: 237 TKASASLTLGTAELDKDFVMQVIAKNTGVPKAVLENHPTIANHRALMTTLVPKFSLPAEK 296 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +++ V D S SM + T +++A +++ L+ + + N +F Sbjct: 297 P---ELVFVCDRSGSM------NGTSMELAKQALKVFLKSLPVGVKFNICSFGSSYSFLW 347 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 K + +L +++ +F + YN + ++ + + Sbjct: 348 KKSA----SYNQENLDEAVRHAEQFSAN----------YNGTEMLAPLKATIDQRYKDMP 393 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 I+ +TDG+ + L K+ V+ +GI SH + A + + Sbjct: 394 LDIILLTDGQIWDQERLFSYLNEAITGSKQPVRVFTLGIGNGVSHALIEGVAKAGNGFS 452 >gi|307288765|ref|ZP_07568743.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecalis TX0109] gi|306500234|gb|EFM69573.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecalis TX0109] gi|315166099|gb|EFU10116.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecalis TX1302] Length = 711 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 66/169 (39%), Gaps = 27/169 (15%) Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 K + ++ + + + QT++ +D+++V+D S SM K++ Sbjct: 114 YLKKSVAAVTGQQGLFNVTLDVKGN-----QTESPIDLVLVIDYSSSM------KGEKLN 162 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS 256 A+K + EE+ + V+ G+V ++ L + + + L F + Sbjct: 163 NALKGLQQFGEEL-SDSLTDGHVRIGIVAYNR-------LTYSTADFSTDMNDLEDFLRN 214 Query: 257 TN---STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 T S ++ ++G R N KK++V + DG ++ Sbjct: 215 TAEPHSGTFMQKGL-----LEGQRLLAEKSRPNAKKMLVHIGDGSANAS 258 >gi|227833260|ref|YP_002834967.1| hypothetical protein cauri_1436 [Corynebacterium aurimucosum ATCC 700975] gi|262184244|ref|ZP_06043665.1| hypothetical protein CaurA7_09649 [Corynebacterium aurimucosum ATCC 700975] gi|227454276|gb|ACP33029.1| putative membrane protein [Corynebacterium aurimucosum ATCC 700975] Length = 688 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 59/165 (35%), Gaps = 20/165 (12%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 M+VLD S SM T++D A + + E+ + GLVT+ ++E Sbjct: 70 MLVLDSSGSMNVQDAGGQTRLDAAKDATKKFVSEL------GGTIPLGLVTYGGTVDEAP 123 Query: 235 LLE-WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK-IIV 292 + G + + K V + T + + + G E+ + IV Sbjct: 124 ENQEAGCQDI--HVVSGPKEDVGDSFTGPIDALQAKGYTPIGDSLKKAAEELGGQHGTIV 181 Query: 293 FMTDGENLSTKEDQQSLYYCNEAK---KRGA--IVYAIGIRVIRS 332 ++DG + C AK ++G ++ IG V Sbjct: 182 LVSDGIDTCAPP-----PVCEVAKELHEQGIDLVINTIGFNVDEE 221 >gi|3982895|gb|AAC83698.1| complement factor Bf-2 [Oncorhynchus mykiss gairdneri] Length = 749 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/221 (13%), Positives = 79/221 (35%), Gaps = 28/221 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K+ +L++ I +D+S S+E + + A ++ ++ +V N Sbjct: 229 KITVDKAGKLNIYIAMDISDSIEE------DQFNKARNAVKKLITKVSSFAVSPNYE--- 279 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSK------FGVSTNSTPGLKYAYNQIFDMQG- 275 ++ ++ + E + + ++ ++ + +G N L A+ I + Sbjct: 280 ILFLASDVFEVVNILDFLGEKRKTLEDVLADLDNFNYGDRQNVGTNLNLAFKTILERMAI 339 Query: 276 MRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCN-------EAKKRGAIVYAIGI 327 +Q T +++F TDG N+ + D E + + +Y G+ Sbjct: 340 QKQRNETLFKEVHHVLIFFTDGAFNMGGRPDDTVAKVREMVYMNQKEERDKYLDIYVFGV 399 Query: 328 RVI----RSHEFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 + + + ++ +++ + + F I + Sbjct: 400 GREIFDEDIQPLVTKRDNEDHYFKLKDGTELEETFDKIIDE 440 >gi|55377967|ref|YP_135817.1| von Willebrand factor type A like metal binding protein [Haloarcula marismortui ATCC 43049] gi|55230692|gb|AAV46111.1| von Willebrand factor type A like metal binding protein [Haloarcula marismortui ATCC 43049] Length = 394 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/179 (17%), Positives = 64/179 (35%), Gaps = 26/179 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + + +D S SM + I + + + +L+E ++ V + F Sbjct: 33 EEPPTRQIALCIDASGSM---AGNDIEQARAGAEWVFGLLDE-------DDYVSI--IAF 80 Query: 227 SNKIEEF-FLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 N++ WG + + ++ G T+ GL A + D+ Sbjct: 81 DNEVTTVLAPTRWGTISRETAVDAVADISAGGGTDMYSGLLEAKASLQDL--------PT 132 Query: 284 DANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 D N + ++ ++DG +N E +L E G + A GI E +R + Sbjct: 133 DDNTARRVLLLSDGKDNSHDPEAFGTL--AREIDTEGIRIKAAGIGSDYREETIRTLGT 189 >gi|69244153|ref|ZP_00602689.1| von Willebrand factor, type A:Cna B-type [Enterococcus faecium DO] gi|293560613|ref|ZP_06677101.1| von Willebrand factor type A domain protein [Enterococcus faecium E1162] gi|294621687|ref|ZP_06700851.1| von Willebrand factor type A domain protein [Enterococcus faecium U0317] gi|314940439|ref|ZP_07847593.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133a04] gi|314943384|ref|ZP_07850154.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133C] gi|314953415|ref|ZP_07856334.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133A] gi|314993087|ref|ZP_07858476.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133B] gi|314997388|ref|ZP_07862342.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133a01] gi|68196610|gb|EAN11036.1| von Willebrand factor, type A:Cna B-type [Enterococcus faecium DO] gi|209491032|gb|ACI49667.1| putative pilus tip protein [Enterococcus faecium] gi|291598696|gb|EFF29749.1| von Willebrand factor type A domain protein [Enterococcus faecium U0317] gi|291605436|gb|EFF34882.1| von Willebrand factor type A domain protein [Enterococcus faecium E1162] gi|313588546|gb|EFR67391.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133a01] gi|313592412|gb|EFR71257.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133B] gi|313594552|gb|EFR73397.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133A] gi|313597919|gb|EFR76764.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133C] gi|313640359|gb|EFS04940.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133a04] Length = 1129 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 24/139 (17%), Positives = 57/139 (41%), Gaps = 22/139 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + LD+++V+D S SM +I ++ ++ + + + + G V + Sbjct: 286 KDITPLDLVLVVDWSGSMNDN-----NRIGEVKIGVDRFVDTL-ADSGITDKINMGYVGY 339 Query: 227 SNKIEEFF--LLEWGV-SHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 S++ + ++ G ++ ++K ++ T + L+ A + + G Sbjct: 340 SSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSVPNG------ 393 Query: 282 TEDANYKKIIVFMTDGENL 300 +KK+IV +TDG Sbjct: 394 -----HKKVIVLLTDGVPT 407 >gi|239622779|ref|ZP_04665810.1| von Willebrand factor type A [Bifidobacterium longum subsp. infantis CCUG 52486] gi|239514776|gb|EEQ54643.1| von Willebrand factor type A [Bifidobacterium longum subsp. infantis CCUG 52486] Length = 401 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 30/185 (16%), Positives = 58/185 (31%), Gaps = 15/185 (8%) Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 V+D S SM + + K +++ + I + V L+ F + Sbjct: 226 VVDYSGSMSGEGKNGVVK--GLNAALDPDQAKKSYIEPASGDVNI-LIPFETEAHRPVKA 282 Query: 237 E-WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 S L + G T+ GL A +++ Q+ IV MT Sbjct: 283 TGTSTSDLLHEADATDASG-GTDIYEGLLSALDELPSESEASQYTTA--------IVLMT 333 Query: 296 DGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMY 355 DG + S +D+ +++ R +++I + N+ + Sbjct: 334 DGRSNSDHQDE--FESAYKSRGRDLPIFSIMFGDADPSQLKSLATLSNAKVFDGRSGDLA 391 Query: 356 DAFSH 360 F Sbjct: 392 AVFRQ 396 >gi|291295700|ref|YP_003507098.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290470659|gb|ADD28078.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 354 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 58/189 (30%), Gaps = 38/189 (20%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +D SRSM + D + +++ A + +E L P Q GLV+FS+ Sbjct: 88 IVLAVDTSRSMLA-TDLNPNRLEAAKATARKFIE---LAPP---TTQIGLVSFSDSASAL 140 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR---------------- 277 + L I+ L +T+ + + +R Sbjct: 141 VMPTTDRQKLLEAIERLKP-AQNTSIENAIITGVRMLPGRNTLRPPAELQPPGLSQPDPL 199 Query: 278 ---------QHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYY----CNEAKKRGAIVY 323 Q +V ++DG N+S+ + AK +Y Sbjct: 200 QGIPDLPLPQQAQPPANLPPGSLVILSDGASNVSSNPTLPTRTTLEVAARFAKNANVRLY 259 Query: 324 AIGIRVIRS 332 + Sbjct: 260 TFPMGQPGG 268 >gi|55958063|emb|CAI12958.1| inter-alpha (globulin) inhibitor H2 [Homo sapiens] Length = 935 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/198 (14%), Positives = 76/198 (38%), Gaps = 32/198 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ V+DVS SM K+ ++++ +L++++ + ++ F+ I Sbjct: 300 ILFVIDVSGSMWG------VKMKQTVEAMKTILDDLRAEDHFS------VIDFNQNIRTW 347 Query: 231 --EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + V+ +R I+ + G TN L A + + + + + Sbjct: 348 RNDLISATKTQVADAKRYIEKIQPSG-GTNINEALLRAIFILNEANNLGL-LDPNSVSL- 404 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACASPNS- 344 I+ ++DG+ + + K+ ++++G+ ++FL+ ++ N Sbjct: 405 --IILVSDGDPT--VGELKLSKIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENHG 460 Query: 345 ----FYLVENPHSMYDAF 358 Y ++ S F Sbjct: 461 IAQRIYGNQDTSSQLKKF 478 >gi|333030668|ref|ZP_08458729.1| von Willebrand factor type A [Bacteroides coprosuis DSM 18011] gi|332741265|gb|EGJ71747.1| von Willebrand factor type A [Bacteroides coprosuis DSM 18011] Length = 342 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/175 (16%), Positives = 62/175 (35%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +++MI LD+S SM + D S +++ + I+ +++E+ + Sbjct: 79 GTKLETVEKQGIEVMIALDISNSMLAQ-DVSPSRLAKSKLLISKLVDELHND-------K 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 GL+ F+ + + + +S T+ + A +G+ Sbjct: 131 VGLILFAGDAFTQLPITNDFVSAKMFLSSISPNLIERQGTSIGKAVDLATRSFTSQEGVG 190 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + I+ +TDGE+ L A G V +G+ Sbjct: 191 -----------RTIILITDGEDHEPGA----LEAVKRAVDAGIQVNVMGVGSPDG 230 >gi|313203639|ref|YP_004042296.1| von willebrand factor type a [Paludibacter propionicigenes WB4] gi|312442955|gb|ADQ79311.1| von Willebrand factor type A [Paludibacter propionicigenes WB4] Length = 346 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 64/175 (36%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 +K + +++MI LDVS SM + D ++++ A + ++ KL+ D+N+ + Sbjct: 79 GLKQEKEKRKGIEVMIALDVSNSMLAQ-DVLPSRLENAKQILS------KLVDDMNDD-K 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 GLV F+ + + + +S T L A Sbjct: 131 VGLVVFAGDAYTQLPITVDYVSAKMFLSNISPELVPRQGTAIGSALDLAIKSFGAKSEAG 190 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 K I+ +TDGEN + ++ A + IV IG+ Sbjct: 191 -----------KAIILITDGENH----EDDAIGAAKLAAENNIIVNVIGMGKTDG 230 >gi|4103173|gb|AAD01701.1| von Willebrand factor [Bos taurus] Length = 549 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 35/220 (15%), Positives = 75/220 (34%), Gaps = 24/220 (10%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 P ++ LD++++LD S + S + D A + + L P Sbjct: 202 DGPHLPTLAPAPDCSQPLDVVLLLDGSST------SPASYFDEMKSFAKAFISKANLGP- 254 Query: 215 VNNVVQSGLVTF--SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + Q ++ + + ++ + + +HL + + + G + L +A I Sbjct: 255 --QLTQVSVLQYGSNTNVDVPWDIHVDKAHLLSLVDPMQREGGPSQVGQALSFAARYITS 312 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 H A+ K++V + G ++ + E + + V+ IGI Sbjct: 313 Q----VHGARPSAS--KVVVILVTGSSMDSVEAAAAAA-----RSNRVAVFPIGIGDQYD 361 Query: 333 HEFLRACASPNSFYLVENPHSMYD--AFSHIGKDIVTKRI 370 LR A P + V + D + +G + Sbjct: 362 AAQLRVLAGPGASSNVAELQRIEDLPSMVALGNSFFQRLC 401 >gi|148252253|ref|YP_001236838.1| hypothetical protein BBta_0664 [Bradyrhizobium sp. BTAi1] gi|146404426|gb|ABQ32932.1| hypothetical protein BBta_0664 [Bradyrhizobium sp. BTAi1] Length = 755 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 42/266 (15%), Positives = 86/266 (32%), Gaps = 34/266 (12%) Query: 81 DILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLK 140 I R++ + + ++ D R +L V P + + ++ Sbjct: 265 KITVRLQAGFALGEVKSHHHQVTIDSPDAKTRIVTLAEGVAPADRDFELTWKPASVAMPS 324 Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDA-RLDMMIVLDVSRSMESFFDSSITKIDMAI 199 F ++ +++ IT S D++ V+D S SM T I A Sbjct: 325 VGLFHEQVGDADYLLAFITPPAVAASAQRPQPRDVIFVIDNSGSMGG------TSIRQAK 378 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-----LEWGVSHLQRKIKYLSKFG 254 S+ L ++ + ++ F + + F V + L G Sbjct: 379 ASLLYALGRLQPND------RFNVIRFDDTMTVLFPSSVPADAEHVGSATSFVSALEARG 432 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 T P ++ A + D++ + +VF+TDG + +QQ Sbjct: 433 -GTEMVPAMRAALTD-----------DGSDSDRVRQVVFLTDG---AIGNEQQLFETITA 477 Query: 315 AKKRGAIVYAIGIRVIRSHEFLRACA 340 + R ++ +GI + + A Sbjct: 478 MRGRS-RIFMVGIGSAPNTYLMTRAA 502 >gi|332288897|ref|YP_004419749.1| hypothetical protein UMN179_00822 [Gallibacterium anatis UMN179] gi|330431793|gb|AEC16852.1| conserved hypothetical protein [Gallibacterium anatis UMN179] Length = 345 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 71/200 (35%), Gaps = 15/200 (7%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +V+D+S SM I + ++ ++ +++ P V ++ F+ Sbjct: 3 RLPVYLVIDISESMAG------ENIRQMQEGMSRLVNQLRRDPYALESVYISVIGFAGAA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L L + L G T+ L + ++I + + + ++K + Sbjct: 57 GTLAPL---TELLNFYLPRLP-IGSGTSIGTALNHVMDRI--DKEIIPSTAEQKGDWKPL 110 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 + FM+DG + +++ K A + IGI L A + Sbjct: 111 VYFMSDGSSTDDTS--KAIQRWKSLFKHRAKLINIGIGKFADLSTLNEVADLTYRLDDAD 168 Query: 351 PHSMYDAF-SHIGKDIVTKR 369 +Y A I I ++ Sbjct: 169 IERVYQALCETIATSISSQS 188 >gi|268610218|ref|ZP_06143945.1| von Willebrand factor, type A [Ruminococcus flavefaciens FD-1] Length = 565 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 60/171 (35%), Gaps = 27/171 (15%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K N + + + VLD S SM S ++ S+ ++ + G Sbjct: 382 KTNKNSGKPIAAVFVLDTSGSM------SGAPLNSLKASLRNSIKYINSSN------YIG 429 Query: 223 LVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 +V++S+ + L ++ + L+ G +T + L A + D Sbjct: 430 VVSYSSNVNVDLELAKFDLNQQAYFMGAVDSLTASG-NTATFSALSQAMIMLRDF----- 483 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 ++ N ++ ++DG++ S E A+ +Y IG Sbjct: 484 --TKDNPNVSPMVFLLSDGQSNSGSEFSDIDGAIATAQ---IPIYTIGYNA 529 >gi|255530103|ref|YP_003090475.1| von Willebrand factor A [Pedobacter heparinus DSM 2366] gi|255343087|gb|ACU02413.1| von Willebrand factor type A [Pedobacter heparinus DSM 2366] Length = 613 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 35/191 (18%), Positives = 74/191 (38%), Gaps = 21/191 (10%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 PW N + + + + K+ + +++ ++DVS SM K+ + I S Sbjct: 215 APWNANHKLVQIGLQGK-KIPTDNLPASNLVFLIDVSGSMN-----QPNKLPLLIASFKL 268 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 + E+++ V VV +G ++ + + ++ + LS G + G++ Sbjct: 269 LTEQLRPEDKVAIVVYAG----NSGLVLPSTPGNEKTKIKEALNKLSAGGST-AGGAGIQ 323 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVY 323 AY D ++ N I+ TDG+ N+ D+ E +K G + Sbjct: 324 LAYQVATD-NFIKGGNNR--------IILATDGDFNVGASSDKDMESLIEEKRKSGVFLT 374 Query: 324 AIGIRVIRSHE 334 +G + + Sbjct: 375 VLGYGMGNMKD 385 >gi|257083477|ref|ZP_05577838.1| predicted protein [Enterococcus faecalis Fly1] gi|256991507|gb|EEU78809.1| predicted protein [Enterococcus faecalis Fly1] Length = 711 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 66/169 (39%), Gaps = 27/169 (15%) Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 K + ++ + + + QT++ +D+++V+D S SM K++ Sbjct: 114 YLKKSVAAVTGQQGLFNVTLDVKGN-----QTESPIDLVLVIDYSSSM------KGEKLN 162 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS 256 A+K + EE+ + V+ G+V ++ L + + + L F + Sbjct: 163 NALKGLQQFGEEL-SDSLTDGHVRIGIVAYNR-------LTYSTADFSTDMNDLEDFLRN 214 Query: 257 TN---STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 T S ++ ++G R N KK++V + DG ++ Sbjct: 215 TAEPHSGTFMQKGL-----LEGQRLLAEKSRPNAKKMLVHIGDGSANAS 258 >gi|258651507|ref|YP_003200663.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] gi|258554732|gb|ACV77674.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] Length = 593 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 40/218 (18%), Positives = 70/218 (32%), Gaps = 35/218 (16%) Query: 174 MMIVLDVSRSMESFFDS--SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 M+ V+D+S SM + +ID+A + L + D GL FS ++ Sbjct: 387 MLAVMDISGSMLAQVPGTNGADRIDLAKDAAARGLGLYRADSD------IGLWEFSTRL- 439 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI------------FDMQGMRQH 279 + I L G + L A N + + R Sbjct: 440 --SPTSDHRELIP--ISSLGPDGQGSTGAARLAAALNGLQAIPDGGTGLYDTVLDATRTV 495 Query: 280 CNTEDANYKKIIVFMTDGENLSTKE----DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 D + +++ +TDG N S + R V +I Sbjct: 496 RAGYDPDRVNVVLLLTDGMNDDVNSITMDQLLSTLAAEQDPARPVPVISIAFGPDSDVAA 555 Query: 336 LRAC--ASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 L+ A+ + YL ++P + + F D V +R+ Sbjct: 556 LQQISRATGGATYLSQDPRQIGEIFL----DAVGQRLC 589 >gi|119622317|gb|EAX01912.1| hypothetical protein MGC26733, isoform CRA_a [Homo sapiens] Length = 1080 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 34/171 (19%), Positives = 63/171 (36%), Gaps = 32/171 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + I++D S SM+S K+D+ I ++E N V+ + + + Sbjct: 509 IYILIDTSHSMKS-------KLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAVAWREQLA 561 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + E + Q I+ + G STN+ LK A+ + E I Sbjct: 562 EVNEDNLEQAQSWIRDIK-IGSSTNTLSALKTAF------------ADKETQA----IYL 604 Query: 294 MTDGENLSTKEDQQSLYYCNEAKK-RGAIVYAIGIRVIR--SHEFLRACAS 341 +TDG E ++ K+ + +Y I ++ FL+ A+ Sbjct: 605 LTDGRPDQPPETVI-----DQVKRFQEIPIYTISFNYNDEIANRFLKEVAA 650 >gi|327439430|dbj|BAK15795.1| uncharacterized protein containing a von Willebrand factor type A domain [Solibacillus silvestris StLB046] Length = 986 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 73/199 (36%), Gaps = 31/199 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M V+D S SM++ + T + ++++ P + F+N+ Sbjct: 712 VMFVVDHSGSMKARDAKNYT-----ANKVKQTIKQIGANPSH-------VYRFNNRPNHE 759 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + + I L + ST +K T + K IV Sbjct: 760 ATDK---ADIVSSIDSLLTYKNENRSTNIVK--------ALETAIGNFTTNQYTSKAIVL 808 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--SHEFLRACASP--NSFYLVE 349 +TDG + S +Q +AK +G ++ + + + + L+ +S ++ + Sbjct: 809 VTDGYSNSNGLEQVLR----DAKLKGIAIHTVSVGSYTTVNEKLLKDISSETNGTYQNIT 864 Query: 350 NPHSMYDAFSHIGKDIVTK 368 + +++ + I I+ K Sbjct: 865 SIENLHGSLQAIITTILCK 883 >gi|285808484|gb|ADC36008.1| putative von Willebrand factor type A domain protein [uncultured bacterium 259] Length = 297 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 58/167 (34%), Gaps = 30/167 (17%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + S +D+ +VLD S SM + ++ ++++ +L+E Sbjct: 63 PQQITSLSIESTPIDLTLVLDTSSSMT----GMMERLKGDVRAVEGLLDESD-------- 110 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++GL+TFS+ + E + ST + A R Sbjct: 111 -RAGLITFSSSVREVSPMH---ERGDPAPAAALAPAGSTAFYQAVVAALLSATTPG--RP 164 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 H + M+DG++ + D + A++ ++Y + Sbjct: 165 HLA----------LVMSDGDDNISLLDGA--DVSDLARRSETVLYVV 199 >gi|30268323|emb|CAD89964.1| hypothetical protein [Homo sapiens] Length = 1060 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 34/171 (19%), Positives = 63/171 (36%), Gaps = 32/171 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + I++D S SM+S K+D+ I ++E N V+ + + + Sbjct: 509 IYILIDTSHSMKS-------KLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAVAWREQLA 561 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + E + Q I+ + G STN+ LK A+ + E I Sbjct: 562 EVNEDNLEQAQSWIRDIK-IGSSTNTLSALKTAF------------ADKETQA----IYL 604 Query: 294 MTDGENLSTKEDQQSLYYCNEAKK-RGAIVYAIGIRVIR--SHEFLRACAS 341 +TDG E ++ K+ + +Y I ++ FL+ A+ Sbjct: 605 LTDGRPDQPPETVI-----DQVKRFQEIPIYTISFNYNDEIANRFLKEVAA 650 >gi|282897675|ref|ZP_06305674.1| von Willebrand factor, type A [Raphidiopsis brookii D9] gi|281197354|gb|EFA72251.1| von Willebrand factor, type A [Raphidiopsis brookii D9] Length = 464 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 69/204 (33%), Gaps = 43/204 (21%) Query: 170 ARLDMMIVLDVSRSMESFFDSSI---------------------TKIDMAIKSINAMLEE 208 V+D S SM + +KID+ I+S+ ++ Sbjct: 42 PPTSFTFVIDTSGSMYEVVAGDVEDTGVTYQQDGKEYKQVTGGKSKIDIVIESLLRLVNS 101 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 KL + +V F + + L ++ IK L F T GL+ A Sbjct: 102 GKL----KQQDRVSIVQFDDSASQIIGLTSATETKQIETAIKKLRDFSGGTRMGLGLRRA 157 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 ++ + + + K+ ++F TDG+ ++ Q N R + A+G Sbjct: 158 FDILSEQE----------MTVKRALLF-TDGQTF---DEDQCQSIANHFATRNIPITALG 203 Query: 327 IRVIRSHEFLRACA--SPNSFYLV 348 + + + L + + + V Sbjct: 204 VGEEFNEDLLTHLSDYTGGKLFYV 227 >gi|268561232|ref|XP_002646396.1| Hypothetical protein CBG15365 [Caenorhabditis briggsae] gi|187027192|emb|CAP33692.1| hypothetical protein CBG_15365 [Caenorhabditis briggsae AF16] Length = 381 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 70/192 (36%), Gaps = 18/192 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD+++V+D S+ M +T I + SI + ++ + P + G+VT+++ Sbjct: 34 LDVVLVVDNSKGMTK---DGLTAISANLASIFSD-AQIGINPSNPKTTRIGMVTYNSNAT 89 Query: 232 EFFLLEWG---VSHLQRKIKYLSKFGVSTNSTPG--LKYAYNQIFDMQGMRQHCNTEDAN 286 L + + + +S+ + S L+ A N ++ ++ Sbjct: 90 VDAHLNYDWPKNNDVLNFYSTMSEISEDSTSYVAHGLQAAQNLLYSESFGS-----NRSH 144 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH--EFLRACASPNS 344 YKK+I+ + N K RG + I E L +SP Sbjct: 145 YKKVIIVC--ASTFKGTGKNDPIPVANRLKGRGVKILTIAYDQGDEKVVEELAKISSPGL 202 Query: 345 FYLVENPHSMYD 356 + + ++ + Sbjct: 203 SFKQDANMTLIE 214 >gi|331266438|ref|YP_004326068.1| hypothetical protein SOR_1070 [Streptococcus oralis Uo5] gi|326683110|emb|CBZ00728.1| conserved hypothetical protein [Streptococcus oralis Uo5] Length = 863 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 52/309 (16%), Positives = 97/309 (31%), Gaps = 84/309 (27%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM-----ESFFDSSITK 194 K T I + + + ITS + +QTD LD+++V D+S SM +SF +I++ Sbjct: 47 KTITPISGQDDKYELSLDITSKLGTETQTD-PLDVVLVADLSGSMQNQDVQSFDGRTISR 105 Query: 195 IDMAIKSINAMLEEVKLIPDV--NNVVQSGLVTFSNKIE-------------EFFLLEWG 239 ID ++ LI + N+ + +V F KI+ F W Sbjct: 106 IDALKNTLRGTNGRKGLIDTILSNSNNRLSMVGFGGKIDNKKVDQYWDGNKWRLFRPYWP 165 Query: 240 VSHLQRKIKYLSK-------FGVSTNSTPGLKYAYNQIFDMQG----------------- 275 + + +S G S N+ YN Sbjct: 166 YERMTKYYDGVSPWDDANTILGWSNNARAAKTAVYNMSIAGGNSIGTESGIGTGTNIGAG 225 Query: 276 ---MRQHCNTEDANYKKIIVFMTDG----------------ENLSTKEDQQSLYYCNEAK 316 Q + +N KK+++ ++DG N + ++ + Sbjct: 226 LTLANQLMGSARSNAKKVVILLSDGFANMVYDANGYTIYNYNNEDPNIETAPQWFWDRLN 285 Query: 317 KRGAIV-----------YAIGIRVIRSHEFL---------RACASPNSFYLVENPHSMYD 356 + Y+I R + + + + PN + + D Sbjct: 286 NNLNSLSYSLAPTLDGFYSIKFRYSNNVDSITSLQYYIRQHNASIPNEILSANDEDQLRD 345 Query: 357 AFSHIGKDI 365 +F +I I Sbjct: 346 SFKNITDKI 354 >gi|319778364|ref|YP_004129277.1| PpkA [Taylorella equigenitalis MCE9] gi|317108388|gb|ADU91134.1| PpkA [Taylorella equigenitalis MCE9] Length = 656 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/189 (16%), Positives = 72/189 (38%), Gaps = 30/189 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D + SM+ + I+ +++ + ++++ D+ + V+ GLV F + E Sbjct: 224 VVFVIDSTISMDPY-------INKTREAVRELYKQIEK-DDLLDQVKFGLVAFRSSTEAV 275 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA------YNQIFDMQGMRQHCNTEDANY 287 LE+ S + + + LK A +N+ + + ++ Sbjct: 276 PDLEY-TSKMYVNPNEVKDGKDFMDKVASLKQAKVSSKEFNEDSYAGINQALNDINWNDF 334 Query: 288 -KKIIVFMT-----DGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 + IV +T DG N + + EA+ +G +Y + L+ A Sbjct: 335 GARYIVLITDAGAIDGNNPLSSTGFGAQQLRQEAQHKGVAIYTLH---------LKTQAG 385 Query: 342 PNSFYLVEN 350 + E+ Sbjct: 386 SKNHQEAES 394 >gi|258623679|ref|ZP_05718665.1| conserved hypothetical protein [Vibrio mimicus VM573] gi|258584045|gb|EEW08808.1| conserved hypothetical protein [Vibrio mimicus VM573] Length = 371 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 66/180 (36%), Gaps = 31/180 (17%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDS-----SITKIDMAIKSINAMLEEVKLIPDVNN 217 +V ++ D+++V+D+S SME S ++++ A + + + + Sbjct: 123 EVQTREAFGRDVLMVVDLSGSMEEKDFSTAAGEQLSRLTAAKRVLRNFVTQ-------RQ 175 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFD 272 + GL+ F + F+ + + L + G STN + Sbjct: 176 GDRFGLILFG---DAAFIQTPFTADQDVWLNLLDEAETGMAGQSTNLGDAI--------- 223 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 G++ + + +I++ +TDG T + A +G +Y I + + Sbjct: 224 GLGIKVFEQSPSTSQDQIMLVLTDG--NDTGSFVSPVDAAKIAAAKGIRIYVIAMGDPEN 281 >gi|223936328|ref|ZP_03628240.1| von Willebrand factor type A [bacterium Ellin514] gi|223894846|gb|EEF61295.1| von Willebrand factor type A [bacterium Ellin514] Length = 657 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/172 (18%), Positives = 59/172 (34%), Gaps = 27/172 (15%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 LD+++ +D S SM + D ++ A + ++ + + GLV F+ Sbjct: 87 QRGLDILVAIDTSNSMLAE-DIQPNRLARARLAALDLMHRARTD-------RMGLVAFAG 138 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 L + + I L T + A + Sbjct: 139 TAFLQCPLTLDDAAFSQSIDSLDTRTISEGGTALAEAINTARETF-----------KNEK 187 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + K++V TDGE+ D ++ +A G +++ IGI E LR Sbjct: 188 DNHKVLVLFTDGEDQ----DMGAVSAAEKAAAEGMLIFTIGIGTPDG-ELLR 234 >gi|73745523|emb|CAI61969.2| putative TerY1 protein [Escherichia coli] Length = 239 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 25/132 (18%), Positives = 44/132 (33%), Gaps = 12/132 (9%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + ++LD S SM I+ + +L +K P ++TF + Sbjct: 30 RLPVYLLLDTSGSMHG------EPIEAVKNGVQTLLTTLKQDPYALETAYVSVITFDSSA 83 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L L ++ L+ G +T+ L + I D + Sbjct: 84 RQAVPL---TDLLSFQMPALTASG-TTSLGEALTLTASSIAKEVQKTTADTKGDWRP--L 137 Query: 291 IVFMTDGENLST 302 + MTDG Sbjct: 138 VFLMTDGSPNDD 149 >gi|221040994|dbj|BAH12174.1| unnamed protein product [Homo sapiens] Length = 951 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 34/171 (19%), Positives = 63/171 (36%), Gaps = 32/171 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + I++D S SM+S K+D+ I ++E N V+ + + + Sbjct: 166 IYILIDTSHSMKS-------KLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAVAWREQLA 218 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + E + Q I+ + G STN+ LK A+ + E I Sbjct: 219 EVNEDNLEQAQSWIRDIK-IGSSTNTLSALKTAF------------ADKETQA----IYL 261 Query: 294 MTDGENLSTKEDQQSLYYCNEAKK-RGAIVYAIGIRVIR--SHEFLRACAS 341 +TDG E ++ K+ + +Y I ++ FL+ A+ Sbjct: 262 LTDGRPDQPPETVI-----DQVKRFQEIPIYTISFNYNDEIANRFLKEVAA 307 >gi|239617868|ref|YP_002941190.1| von Willebrand factor type A [Kosmotoga olearia TBF 19.5.1] gi|239506699|gb|ACR80186.1| von Willebrand factor type A [Kosmotoga olearia TBF 19.5.1] Length = 612 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 39/212 (18%), Positives = 76/212 (35%), Gaps = 34/212 (16%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + +D++IVLD S SM + ++ A +K + + ++ LVTF ++ Sbjct: 68 SDMDIVIVLDSSGSMRNVIGIMDDLMEKA----------IKKLKEEGLRLRYALVTFGDE 117 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTN---STPGLKYAYNQIFDMQGMRQHCNTEDAN 286 I E ++ + FG + S L A N FD++ Sbjct: 118 IRELKGFTSRDDVFISWLRDVVPFGGGDDPEISLDALSMASNLPFDVKA----------- 166 Query: 287 YKKIIVFMTDG-----ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 KK+IV +T+ E+ + + +E +G + + V E+++ S Sbjct: 167 -KKVIVLITNAPAHFVEDGTAYSNVSVNGLIDELNGKGVELL---LLVPPEPEYVKISES 222 Query: 342 -PNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 F+ + AF + R+ Sbjct: 223 LGGKFFNIFKIAGPNKAFEELANLNFRTRLVE 254 >gi|75812639|ref|YP_320257.1| hypothetical protein Ava_A0010 [Anabaena variabilis ATCC 29413] gi|75705395|gb|ABA25068.1| hypothetical protein Ava_A0010 [Anabaena variabilis ATCC 29413] Length = 405 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 37/177 (20%), Positives = 63/177 (35%), Gaps = 20/177 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN----- 228 ++++LD S SM KI+ AIK+I + +K D Q +V F Sbjct: 82 IIVLLDFSGSMNKLDSRGTKKIEGAIKAIRQLTSVLK---DRGENTQVAIVPFGEAGANC 138 Query: 229 ----KIEEFFL---LEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMR-- 277 + + L LQ + YL+ STN LK A + + R Sbjct: 139 PQGYPVNKDTLDKFFAANDFKLQNNLDYLASLTPCASTNLYEPLKKAVKFLANTSDSRFY 198 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIRVIRSH 333 ++ + I+ ++DG + E Q K+ IV+ +G + Sbjct: 199 LPKDSPQTPPRLSIILLSDGYHNFANEAQDFQSLTTLLKRNTNIIVHTLGYGLTPEQ 255 >gi|260778728|ref|ZP_05887620.1| hypothetical protein VIC_004132 [Vibrio coralliilyticus ATCC BAA-450] gi|260604892|gb|EEX31187.1| hypothetical protein VIC_004132 [Vibrio coralliilyticus ATCC BAA-450] Length = 463 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 43/303 (14%), Positives = 83/303 (27%), Gaps = 56/303 (18%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 KG + +++ I P + L G I+ F +K L Sbjct: 11 RKSKGLVALISVIAAPFLILATGTAIDSGRAFLVKAKL-------------------FAA 51 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 + + D D ++ SI Sbjct: 52 VDAAGIAAARAVAEGEDAARDAAIKFYNANLPTDYHDSTTASPTVTFGYDSFGNISIDLS 111 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI 192 + ++ F + + T+ +D+++V+D + S+ + Sbjct: 112 ASAEVSTTFLGVF----GHSSLEISATAQTVRR-----PVDLVLVVDNTTSLRLGSIGDV 162 Query: 193 TK--IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI-KY 249 T + + I E + LV ++ E + H + I Sbjct: 163 TDDVVARSKDFITNFNESFD---------RIALVKYAYGAEVPVAFQSSRGHSRSDITTE 213 Query: 250 LSKFGVS-------TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 + F TNS+ G+ A + + N D K+IVF TDG + Sbjct: 214 IDAFDFGSLSSLQYTNSSEGIYLALDAL---------RNVTDPANLKVIVFFTDGAPNTF 264 Query: 303 KED 305 + Sbjct: 265 ASE 267 >gi|260803822|ref|XP_002596788.1| hypothetical protein BRAFLDRAFT_73696 [Branchiostoma floridae] gi|229282048|gb|EEN52800.1| hypothetical protein BRAFLDRAFT_73696 [Branchiostoma floridae] Length = 547 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 30/214 (14%), Positives = 75/214 (35%), Gaps = 35/214 (16%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 T S+++ +P + D++ ++D S S+ + +A++ I L+ Sbjct: 19 TTSQYLPVPQPEPQPPGPSSGCAADIVFIVDDSSSI------LGPRFGLALQFIIDFLQC 72 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR-----KIKYLSKFGVSTNSTPGL 263 + + G++ ++ L G+ + I L++ G + + L Sbjct: 73 F-----TDQDIGIGVILYNCVPRTGIPL--GMYTISNPGLPFAISNLTQEGGLSRTGHAL 125 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG---ENLSTKEDQQSLYYCNEAKKRGA 320 + + D R + + ++DG + + + A+ G Sbjct: 126 SF----MTDTSKFRTGI-------PRTAILLSDGFPQSDANAQAMDDYEAQAEAARDAGI 174 Query: 321 IVYAIGIRVIR--SHEFLRAC-ASPNSFYLVENP 351 +YA+G+ + + L S + + +NP Sbjct: 175 DLYAVGVGAAGLVNWDVLETITGSSDRVFRSDNP 208 >gi|195579432|ref|XP_002079566.1| GD21946 [Drosophila simulans] gi|194191575|gb|EDX05151.1| GD21946 [Drosophila simulans] Length = 1100 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 69/188 (36%), Gaps = 24/188 (12%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-----NVVQSG 222 + D+MI+LD S SM D+ + + +L+ + VN VV++ Sbjct: 150 ASSPKDIMILLDASSSMTE------KSFDLGMATAFNILDTLGEDDFVNLITFSEVVKTP 203 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + F +++ + ++ +K + + N T GL+YA++ + Sbjct: 204 VPCFKDRMVRATP--DNIQEIKSAVKAIK-LQDTANFTAGLEYAFSLLHKYNQSGAGSQC 260 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-HEFLRACAS 341 A I+ +T+ + S K+ + + + ++ I L A Sbjct: 261 NQA-----IMLITESTSESHKDVIKQYNWPHMP----VRIFTYLIGSDSGSRSNLHDMAC 311 Query: 342 PNSFYLVE 349 N + V+ Sbjct: 312 SNKGFFVQ 319 >gi|195475490|ref|XP_002090017.1| GE19394 [Drosophila yakuba] gi|194176118|gb|EDW89729.1| GE19394 [Drosophila yakuba] Length = 1136 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 69/188 (36%), Gaps = 24/188 (12%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-----NVVQSG 222 + D+MI+LD S SM D+ + + +L+ + VN VV++ Sbjct: 150 ASSPKDIMILLDASSSMTE------KSFDLGMATAFNILDTLGEDDFVNLITFSEVVKTP 203 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + F +++ + ++ +K + + N T GL+YA++ + Sbjct: 204 VPCFKDRMVRATP--DNIQEIKSAVKAIK-LQDTANFTAGLEYAFSLLHKYNQSGAGSQC 260 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-HEFLRACAS 341 A I+ +T+ + S K+ + + + ++ I L A Sbjct: 261 NQA-----IMLITESTSESHKDVIKQYNWPHMP----VRIFTYLIGSDSGSRSNLHDMAC 311 Query: 342 PNSFYLVE 349 N + V+ Sbjct: 312 SNKGFFVQ 319 >gi|195338633|ref|XP_002035929.1| GM14254 [Drosophila sechellia] gi|194129809|gb|EDW51852.1| GM14254 [Drosophila sechellia] Length = 1119 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 69/188 (36%), Gaps = 24/188 (12%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-----NVVQSG 222 + D+MI+LD S SM D+ + + +L+ + VN VV++ Sbjct: 150 ASSPKDIMILLDASSSMTE------KSFDLGMATAFNILDTLGEDDFVNLITFSEVVKTP 203 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + F +++ + ++ +K + + N T GL+YA++ + Sbjct: 204 VPCFKDRMVRATP--DNIQEIKSAVKAIK-LQDTANFTAGLEYAFSLLHKYNQSGAGSQC 260 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-HEFLRACAS 341 A I+ +T+ + S K+ + + + ++ I L A Sbjct: 261 NQA-----IMLITESTSESHKDVIKQYNWPHMP----VRIFTYLIGSDSGSRSNLHDMAC 311 Query: 342 PNSFYLVE 349 N + V+ Sbjct: 312 SNKGFFVQ 319 >gi|194857574|ref|XP_001968984.1| GG24200 [Drosophila erecta] gi|190660851|gb|EDV58043.1| GG24200 [Drosophila erecta] Length = 1136 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 69/188 (36%), Gaps = 24/188 (12%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-----NVVQSG 222 + D+MI+LD S SM D+ + + +L+ + VN VV++ Sbjct: 150 ASSPKDIMILLDASSSMTE------KSFDLGMATAFNILDTLGEDDFVNLITFSEVVKTP 203 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + F +++ + ++ +K + + N T GL+YA++ + Sbjct: 204 VPCFKDRMVRATP--DNIQEIKSAVKAIK-LQDTANFTAGLEYAFSLLHKYNQSGAGSQC 260 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-HEFLRACAS 341 A I+ +T+ + S K+ + + + ++ I L A Sbjct: 261 NQA-----IMLITESTSESHKDVIKQYNWPHMP----VRIFTYLIGSDSGSRSNLHDMAC 311 Query: 342 PNSFYLVE 349 N + V+ Sbjct: 312 SNKGFFVQ 319 >gi|161076922|ref|NP_001097164.1| CG4587, isoform C [Drosophila melanogaster] gi|320545115|ref|NP_001188817.1| CG4587, isoform D [Drosophila melanogaster] gi|157400167|gb|ABV53684.1| CG4587, isoform C [Drosophila melanogaster] gi|318068460|gb|AAF53476.3| CG4587, isoform D [Drosophila melanogaster] Length = 1243 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 69/188 (36%), Gaps = 24/188 (12%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-----NVVQSG 222 + D+MI+LD S SM D+ + + +L+ + VN VV++ Sbjct: 254 ASSPKDIMILLDASSSMTE------KSFDLGMATAFNILDTLGEDDFVNLITFSEVVKTP 307 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + F +++ + ++ +K + + N T GL+YA++ + Sbjct: 308 VPCFKDRMVRATP--DNIQEIKSAVKAIK-LQDTANFTAGLEYAFSLLHKYNQSGAGSQC 364 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-HEFLRACAS 341 A I+ +T+ + S K+ + + + ++ I L A Sbjct: 365 NQA-----IMLITESTSESHKDVIKQYNWPHMP----VRIFTYLIGSDSGSRSNLHDMAC 415 Query: 342 PNSFYLVE 349 N + V+ Sbjct: 416 SNKGFFVQ 423 >gi|161076920|ref|NP_001097163.1| CG4587, isoform B [Drosophila melanogaster] gi|157400166|gb|ABV53683.1| CG4587, isoform B [Drosophila melanogaster] Length = 1209 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 69/188 (36%), Gaps = 24/188 (12%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-----NVVQSG 222 + D+MI+LD S SM D+ + + +L+ + VN VV++ Sbjct: 203 ASSPKDIMILLDASSSMTE------KSFDLGMATAFNILDTLGEDDFVNLITFSEVVKTP 256 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + F +++ + ++ +K + + N T GL+YA++ + Sbjct: 257 VPCFKDRMVRATP--DNIQEIKSAVKAIK-LQDTANFTAGLEYAFSLLHKYNQSGAGSQC 313 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-HEFLRACAS 341 A I+ +T+ + S K+ + + + ++ I L A Sbjct: 314 NQA-----IMLITESTSESHKDVIKQYNWPHMP----VRIFTYLIGSDSGSRSNLHDMAC 364 Query: 342 PNSFYLVE 349 N + V+ Sbjct: 365 SNKGFFVQ 372 >gi|254881906|ref|ZP_05254616.1| tellurium resistance protein [Bacteroides sp. 4_3_47FAA] gi|319641091|ref|ZP_07995795.1| tellurium resistance protein [Bacteroides sp. 3_1_40A] gi|254834699|gb|EET15008.1| tellurium resistance protein [Bacteroides sp. 4_3_47FAA] gi|317387335|gb|EFV68210.1| tellurium resistance protein [Bacteroides sp. 3_1_40A] Length = 348 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 34/167 (20%), Positives = 62/167 (37%), Gaps = 16/167 (9%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + ++DVS SM I + +++E++ P ++ F+ K Sbjct: 3 RLPVYFLVDVSESMVG------APIQQVQDGMRMIVQELRTDPYALETAYISVIAFAGKA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L L + G T+ L++ + DM TE K Sbjct: 57 KCVSPLT----ELYKFYPPTFPIGGGTSLGNALEFLMD---DMDKTLVRTTTEQKGDWKP 109 Query: 291 IVFM-TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 IVF+ TDG + + + N+ + + A + AI I + + L Sbjct: 110 IVFLFTDGNPTDNPSNAFTR-WNNKYRGK-ANIVAISIGDNVNTQLL 154 >gi|206575542|ref|YP_002235853.1| von Willebrand factor type A domain protein [Klebsiella pneumoniae 342] gi|206570386|gb|ACI12032.1| von Willebrand factor type A domain protein [Klebsiella pneumoniae 342] Length = 212 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/167 (16%), Positives = 54/167 (32%), Gaps = 12/167 (7%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +++D S SM I + AML ++ P V ++T+ N+ Sbjct: 3 RLPVYLLIDTSGSMRG------ESIHSVNVGVQAMLSALRQDPYALESVHISIITYDNEA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 E+ L I+ T + L+ + D + Sbjct: 57 REYVPLTALADFQFSDIE--VPSAGGTFTGAALECLIQCVDRDIRRSDGEQKGDWRP--L 112 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + MTDG +++ ++ + A + HE L+ Sbjct: 113 VFLMTDGTPSDAWAYGEAVK--EVQRRSFGSIIACAVGPKAGHEHLK 157 >gi|188994155|ref|YP_001928407.1| hypothetical protein PGN_0291 [Porphyromonas gingivalis ATCC 33277] gi|188593835|dbj|BAG32810.1| conserved hypothetical protein [Porphyromonas gingivalis ATCC 33277] Length = 1228 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 26/120 (21%), Positives = 48/120 (40%), Gaps = 15/120 (12%) Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 +D S SM +I ++ AI S ++++ V+ LV++ ++ Sbjct: 149 IDQSSSMG---GQNIARLKSAIASGQRFVKKMLPKGMATEGVRIALVSYDHEPHRLSDFT 205 Query: 238 WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 + L +KI+ L+ T++ GLK A N + K I+ M+DG Sbjct: 206 KDTAFLCQKIRALTPI-WGTHTQGGLKMARNIMAT-----------STAVDKHIILMSDG 253 Score = 40.9 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 12/63 (19%), Positives = 27/63 (42%), Gaps = 3/63 (4%) Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIR---SHEFLRACASPNSFYLVENPHSMYDAFSHIG 362 ++ AK G ++ IG + ++ L+ A+ + + P ++ AF +I Sbjct: 359 NAAINEAQFAKNSGYTIHTIGYDLGDFALANNSLKLTATDENHFFTATPANLAAAFDNIA 418 Query: 363 KDI 365 + I Sbjct: 419 QTI 421 >gi|120602151|ref|YP_966551.1| von Willebrand factor type A [Desulfovibrio vulgaris DP4] gi|120562380|gb|ABM28124.1| von Willebrand factor, type A [Desulfovibrio vulgaris DP4] Length = 420 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 52/153 (33%), Gaps = 43/153 (28%) Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN-------------- 299 T + GLK+ + + + + +D +K+I+ +TDG+ Sbjct: 264 ASGTVISEGLKWGRHVLTPEAPFTEGSSAKD--IRKVIIVLTDGDTEDGKCGGSYAINYT 321 Query: 300 --------------------LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE--FLR 337 K + L + K+ G V+AI S + ++ Sbjct: 322 PNAYWTNAFYGMLDMTSHCENGGKLNAAMLEEARKVKEAGIEVFAIRFGDSDSVDVSLMK 381 Query: 338 ACASP-----NSFYLVENPHSMYDAFSHIGKDI 365 + AS + +Y + + + D F IG+ + Sbjct: 382 SIASSKAGTNDHYYDAPSAYDIDDVFKKIGRQL 414 >gi|38639543|ref|NP_943312.1| hypothetical protein LV052 [Klebsiella pneumoniae] gi|168998770|ref|YP_001688038.1| TerY2 [Klebsiella pneumoniae NTUH-K2044] gi|38016641|gb|AAR07662.1| hypothetical protein LV052 [Klebsiella pneumoniae] gi|238549791|dbj|BAH66142.1| hypothetical protein KP1_p253 [Klebsiella pneumoniae subsp. pneumoniae NTUH-K2044] Length = 212 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/167 (16%), Positives = 54/167 (32%), Gaps = 12/167 (7%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +++D S SM I + AML ++ P V ++T+ N+ Sbjct: 3 RLPVYLLIDTSGSMRG------ESIHSVNVGVQAMLSALRQDPYALESVHISIITYDNEA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 E+ L I+ T + L+ + D + Sbjct: 57 REYVPLTALADFQFSDIE--VPSAGGTFTGAALECLIQCVDRDIRRSDGEQKGDWRP--L 112 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + MTDG +++ ++ + A + HE L+ Sbjct: 113 VFLMTDGTPSDAWAYGEAVK--EVQRRSFGSIIACAVGPKAGHEHLK 157 >gi|296453244|ref|YP_003660387.1| von Willebrand factor type A (vWA) domain-containing protein [Bifidobacterium longum subsp. longum JDM301] gi|296182675|gb|ADG99556.1| von Willebrand factor type A (vWA) domain protein [Bifidobacterium longum subsp. longum JDM301] Length = 565 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 59/186 (31%), Gaps = 15/186 (8%) Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 V+D S SM + + K +++ + I + V L+ F + Sbjct: 390 VVDYSGSMSGEGKNGVVK--GLNAALDPDQAKKSYIEPASGDVNI-LIPFETEAHRPVKA 446 Query: 237 E-WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 S L + G T+ GL A +++ Q+ IV MT Sbjct: 447 TGTSTSDLLHEADATDASG-GTDIYEGLLSALDELPSESEASQYTTA--------IVLMT 497 Query: 296 DGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMY 355 DG + S +D+ +++ R +++I + N+ + Sbjct: 498 DGRSNSDHQDE--FESAYKSRGRDLPIFSIMFGDADPSQLKSLATLSNAKVFDGRSGDLA 555 Query: 356 DAFSHI 361 F + Sbjct: 556 AVFRQV 561 >gi|221044732|dbj|BAH14043.1| unnamed protein product [Homo sapiens] Length = 560 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 30/171 (17%), Positives = 64/171 (37%), Gaps = 19/171 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEE 232 ++ V+D S SM S KI +++ +L+++ N +V + + + Sbjct: 187 VVFVIDKSGSM------SGRKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLVP 240 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V+ + + G TN + A Q+ D + + I+ Sbjct: 241 AS--AENVNKARSFAAGIQALG-GTNINDAMLMAV-QLLDSSNQEERLPEGSVSL---II 293 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA---IGIRVIRSHEFLRACA 340 +TDG+ + + +S+ N ++ + Y+ +G S+ FL A Sbjct: 294 LLTDGDPTVGETNPRSIQ--NNVREAVSGRYSLFCLGFGFDVSYAFLEKLA 342 >gi|118579690|ref|YP_900940.1| magnesium chelatase [Pelobacter propionicus DSM 2379] gi|118502400|gb|ABK98882.1| protoporphyrin IX magnesium-chelatase [Pelobacter propionicus DSM 2379] Length = 689 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 22/137 (16%), Positives = 55/137 (40%), Gaps = 23/137 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D S SM + ++ + ++ ++L + + G+++F + E Sbjct: 505 LLFVVDASGSM-----GARGRMAASKGAVMSLL-----LDAYQKRDRVGMISF--RKNEA 552 Query: 234 F---LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F V + ++ + G T + + +Y Q+ + D + I Sbjct: 553 FVNLPPTTSVELAGKLLEEMP-VGGRTPLSAAIAKSYEQL-------RGVLGRDPTARPI 604 Query: 291 IVFMTDGENLSTKEDQQ 307 ++F+TDG++ D + Sbjct: 605 VIFITDGKSNVALGDGR 621 >gi|322697455|gb|EFY89235.1| U-box domain-containing protein [Metarhizium acridum CQMa 102] Length = 757 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 30/155 (19%), Positives = 53/155 (34%), Gaps = 27/155 (17%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFD-------SSITKIDMAIKSINAMLEEVK 210 + D+++VLDVS SME + +T +D+ + ++E + Sbjct: 40 VPPRAPAAQIEHVPCDIVLVLDVSTSMEDDAPVPGEAEETGLTVLDLTKHAALTIIETL- 98 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 N + G+V+F+ L +RKIK L G STN G++ Sbjct: 99 -----NEKDRLGIVSFATNSTIVQTLTHMDISNKEEARRKIKALDPNG-STNLWHGIRDG 152 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 + ++ +TDG Sbjct: 153 IQAFEQSSENGNI---------RAMMVLTDGMPNH 178 >gi|305680383|ref|ZP_07403191.1| von Willebrand factor type A domain protein [Corynebacterium matruchotii ATCC 14266] gi|305659914|gb|EFM49413.1| von Willebrand factor type A domain protein [Corynebacterium matruchotii ATCC 14266] Length = 880 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 35/221 (15%), Positives = 69/221 (31%), Gaps = 38/221 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIK-SINAMLEEV-KLIPDVNNVVQSGLVTFSNKI 230 D++ V+D S S++ T + ++ ++ ++ K + N + L F Sbjct: 60 DLIFVIDESASLKGHGGKPATDPNNIRVPAMQDLVTQLGKFAQESNADINVKLSGFGQG- 118 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD------MQGMRQHCNTED 284 +G + G T G N +F M + D Sbjct: 119 YRSQPDVYG-----GWVNVRDHAGDLTPPIQGFDQRNNDVFTDYGTALNGAMADLASRPD 173 Query: 285 ANYKKIIVFMTDG----ENLSTKEDQQSLYYC------NEAKKRGAIVYAIGIRVIRS-- 332 K I+F TDG + + C + + ++ +G+ Sbjct: 174 PESCKAILFFTDGKLTVQGDQKADIVAQKAICSADGQVKKLRDANIQLFTVGLIPSGEES 233 Query: 333 -HEFLRACASP----------NSFYLVE-NPHSMYDAFSHI 361 + LR+ + +F+ E N S++ AF I Sbjct: 234 PEQILRSMSEGNDCAIDTVPNGAFFNAESNAASLFSAFRSI 274 >gi|261856580|ref|YP_003263863.1| Vault protein inter-alpha-trypsin domain protein [Halothiobacillus neapolitanus c2] gi|261837049|gb|ACX96816.1| Vault protein inter-alpha-trypsin domain protein [Halothiobacillus neapolitanus c2] Length = 671 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/124 (22%), Positives = 47/124 (37%), Gaps = 24/124 (19%) Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 V+DVS SM F ++ A + +L +K N + SG +++ Sbjct: 315 FVVDVSGSMYGFP------LNTASDLMRELLSSLKPQETFNILFFSG----GSRVLSPTP 364 Query: 236 LEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L+ +LQR + + + G T P LK A+ + + IV Sbjct: 365 LQATPENLQRAMTMMRSIQGGGGTELLPALKTAF------------AMPRTEDTARSIVV 412 Query: 294 MTDG 297 +TDG Sbjct: 413 ITDG 416 >gi|225022540|ref|ZP_03711732.1| hypothetical protein CORMATOL_02580 [Corynebacterium matruchotii ATCC 33806] gi|224944663|gb|EEG25872.1| hypothetical protein CORMATOL_02580 [Corynebacterium matruchotii ATCC 33806] Length = 880 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 35/221 (15%), Positives = 69/221 (31%), Gaps = 38/221 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIK-SINAMLEEV-KLIPDVNNVVQSGLVTFSNKI 230 D++ V+D S S++ T + ++ ++ ++ K + N + L F Sbjct: 60 DLIFVIDESASLKGHGGKPATDPNNIRVPAMQDLVTQLGKFAQESNADINVKLSGFGQG- 118 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD------MQGMRQHCNTED 284 +G + G T G N +F M + D Sbjct: 119 YRSQPDVYG-----GWVNVRDHAGDLTPPIQGFDQRNNDVFTDYGTALNGAMADLASRPD 173 Query: 285 ANYKKIIVFMTDG----ENLSTKEDQQSLYYC------NEAKKRGAIVYAIGIRVIRS-- 332 K I+F TDG + + C + + ++ +G+ Sbjct: 174 PESCKAILFFTDGKLTVQGDQKADIVAQKAICSADGQVKKLRDANIQLFTVGLIPSGEES 233 Query: 333 -HEFLRACASP----------NSFYLVE-NPHSMYDAFSHI 361 + LR+ + +F+ E N S++ AF I Sbjct: 234 PEQILRSMSEGNDCAIDTVPNGAFFNAESNAASLFSAFRSI 274 >gi|260799772|ref|XP_002594858.1| hypothetical protein BRAFLDRAFT_124445 [Branchiostoma floridae] gi|229280095|gb|EEN50869.1| hypothetical protein BRAFLDRAFT_124445 [Branchiostoma floridae] Length = 930 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 17/110 (15%), Positives = 33/110 (30%), Gaps = 15/110 (13%) Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++++E ++ I + +T + L A + Sbjct: 235 NSRMEFNLGEYMNMTTTLSAINAIKYQKGATYTPKALDRARKEALWRGEA---------- 284 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 K+++ ++DG G IVYAIG+ E L Sbjct: 285 VPKVMIVLSDGR-----SAIDVTEASKALADAGIIVYAIGVGRADHDELL 329 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 12/77 (15%), Positives = 30/77 (38%), Gaps = 6/77 (7%) Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEA---KKRGAIVYAIGIRVIRSHEFLRACA 340 + + V +TDG + Y ++A + G VY++ + + + A Sbjct: 450 RDDVPRAAVIITDGSAQGDADGLVMGDYADQADEARDAGITVYSVPNGIPGFEDIVALEA 509 Query: 341 ---SPNSFYLVENPHSM 354 P++ + + +P + Sbjct: 510 ISGGPDNVFSMYDPCQL 526 Score = 37.4 bits (85), Expect = 3.2, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 55/169 (32%), Gaps = 24/169 (14%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + D++ + D S S+ T ++ I+ L + V G+ Sbjct: 19 KANAQCSFDVVFLADSSSSVRPH---GTTAANIYIRDAIDCLTMYNVYD-----VGVGVT 70 Query: 225 TFSNKIEEFFLL---EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 F+ E L G L + ++ G + ++ ++ D R Sbjct: 71 HFTCTSTEHIPLGHYTLGDPELTAAVGNIAHTGGLSRIGLAIR----KMKDTTSFRSG-- 124 Query: 282 TEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 ++ V +TDG + + + EA++ G +YA+ Sbjct: 125 -----VPRVAVVLTDGVGQGDANERVDDYAEQAEEAREAGITLYAVATG 168 >gi|262272104|gb|ACY40027.1| MIP14152p [Drosophila melanogaster] Length = 603 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 69/188 (36%), Gaps = 24/188 (12%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-----NVVQSG 222 + D+MI+LD S SM D+ + + +L+ + VN VV++ Sbjct: 49 ASSPKDIMILLDASSSMTE------KSFDLGMATAFNILDTLGEDDFVNLITFSEVVKTP 102 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + F +++ + ++ +K + + N T GL+YA++ + Sbjct: 103 VPCFKDRMVRATP--DNIQEIKSAVKAIK-LQDTANFTAGLEYAFSLLHKYNQSGAGSQC 159 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-HEFLRACAS 341 A I+ +T+ + S K+ + + + ++ I L A Sbjct: 160 NQA-----IMLITESTSESHKDVIKQYNWPHMP----VRIFTYLIGSDSGSRSNLHDMAC 210 Query: 342 PNSFYLVE 349 N + V+ Sbjct: 211 SNKGFFVQ 218 >gi|302552815|ref|ZP_07305157.1| predicted protein [Streptomyces viridochromogenes DSM 40736] gi|302470433|gb|EFL33526.1| predicted protein [Streptomyces viridochromogenes DSM 40736] Length = 842 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 55/163 (33%), Gaps = 11/163 (6%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S+++ ++ +LD S SM +D S + +++ + E + V V Sbjct: 640 SALERYGNAHGPGRVLFLLDSSGSMARLWDGSSGGPGILKQTLTGLGGEDE--YGVWAVH 697 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G LL +G + + L V + + M+ Sbjct: 698 GLG----DGGGTHTELLGFGTHARKDAERALDTAKVRDAEADPPAALLDALDFMRE---- 749 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 D ++IV++TDGE+ + + A+ G V Sbjct: 750 -RGGDDERPQLIVYLTDGEDDGRLSGDRRQQVADRARAAGVPV 791 >gi|256786708|ref|ZP_05525139.1| hypothetical protein SlivT_19662 [Streptomyces lividans TK24] gi|289770603|ref|ZP_06529981.1| conserved hypothetical protein [Streptomyces lividans TK24] gi|289700802|gb|EFD68231.1| conserved hypothetical protein [Streptomyces lividans TK24] Length = 475 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 42/217 (19%), Positives = 71/217 (32%), Gaps = 26/217 (11%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF-DSSITKIDMAIKSIN 203 P T + +++K T AR + +VLD S SM ++ D S + ++ Sbjct: 267 APGLTTAYKAAG---AALKKRDLTGARAKVHLVLDRSASMRPYYKDGSAQALAEQTLALA 323 Query: 204 AMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL 263 A L V + FS +++ L ++ + KI L + Sbjct: 324 AHL------DPEATVH---VTFFSTEVDGTGDLT--LTDHENKIDELHA-ALGRMGRTSY 371 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 A + H +++F TDG + QSL A Sbjct: 372 HAAVEAVLA-----HHAKEAAPGTPALVIFQTDGAPDAKTPATQSLTDA-AANHPNVHFS 425 Query: 324 AIGIRVIRSHEF--LRACASPN-SFYLV-ENPHSMYD 356 + + F LR + N S +L E P + D Sbjct: 426 FVAFGDPENKAFDYLRKLKTGNASHFLAGETPKELTD 462 >gi|21222202|ref|NP_627981.1| hypothetical protein SCO3791 [Streptomyces coelicolor A3(2)] gi|10432495|emb|CAC10329.1| hypothetical protein SCH63.38 [Streptomyces coelicolor A3(2)] Length = 445 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 42/217 (19%), Positives = 71/217 (32%), Gaps = 26/217 (11%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF-DSSITKIDMAIKSIN 203 P T + +++K T AR + +VLD S SM ++ D S + ++ Sbjct: 237 APGLTTAYKAAG---AALKKRDLTGARAKVHLVLDRSASMRPYYKDGSAQALAEQTLALA 293 Query: 204 AMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL 263 A L V + FS +++ L ++ + KI L + Sbjct: 294 AHL------DPEATVH---VTFFSTEVDGTGDLT--LTDHENKIDELHA-ALGRMGRTSY 341 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 A + H +++F TDG + QSL A Sbjct: 342 HAAVEAVLA-----HHAKEAAPGTPALVIFQTDGAPDAKTPATQSLTDA-AANHPNVHFS 395 Query: 324 AIGIRVIRSHEF--LRACASPN-SFYLV-ENPHSMYD 356 + + F LR + N S +L E P + D Sbjct: 396 FVAFGDPENKAFDYLRKLKTGNASHFLAGETPKELTD 432 >gi|317029635|ref|XP_001392021.2| von Willebrand and RING finger domain protein [Aspergillus niger CBS 513.88] Length = 1011 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 45/296 (15%), Positives = 92/296 (31%), Gaps = 45/296 (15%) Query: 41 SHIFFMK--TVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRN-E 97 I K + + D ++ + + W S + Sbjct: 375 GSILIKKHLNSMEAAADEPILAL------NLSVSELPCFYLRFPNRSQLELWRRSLLDLH 428 Query: 98 LRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMP 157 ++ N+ D S + + S S Y T I YTN Sbjct: 429 PMESTSPNNDYDYDNSGAEEEEYRNNPIRRQASINSSYGAARSINTAITDYTNM------ 482 Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + LD+++V+ VS SM+ KI + ++ +++ + Sbjct: 483 --GTESPAPSIHIPLDLVVVIPVSSSMQGL------KITLLRDALRFLVQSLGPRD---- 530 Query: 218 VVQSGLVTFSN-----KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + GLVTF + I WG + ++ + G + ++ A N D Sbjct: 531 --RMGLVTFGSSSGGVPIVGMTTKSWG--GWPKILESIRPVGQKSLRADVVEGA-NVAMD 585 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + R+ N I+ ++D ST + + + A+ +++ G+ Sbjct: 586 LLMQRKSSNPIST-----ILLISD---TSTSDPDSVDFVVSRAEAAKVNIHSFGLG 633 >gi|330918891|ref|XP_003298384.1| hypothetical protein PTT_09104 [Pyrenophora teres f. teres 0-1] gi|311328422|gb|EFQ93524.1| hypothetical protein PTT_09104 [Pyrenophora teres f. teres 0-1] Length = 1367 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 33/177 (18%), Positives = 67/177 (37%), Gaps = 16/177 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 + + LD S S S +T++D+ + +A + + L + ++ V GLVTFS K Sbjct: 939 FKVYLGLDSSAS---KQKSHMTRLDVLKQMFDAYINRL-LAYNFHSHV--GLVTFSTKAL 992 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + V + + K+ L G + ++ I Q Q + K I Sbjct: 993 VAQKITNAVENFRHKLNNLKASGDTA--------IWDSIALAQDQIQQYAEQYPGSKLRI 1044 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 + ++DGE+ K ++ + + V + + + E C+ + Sbjct: 1045 ICISDGEDN--KSQNTAVDLASRLIRDDITVDSFCLDDHSNKELQTLCSLSGGYSFA 1099 >gi|301609304|ref|XP_002934186.1| PREDICTED: epithelial chloride channel protein-like [Xenopus (Silurana) tropicalis] Length = 934 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 41/195 (21%), Positives = 76/195 (38%), Gaps = 35/195 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEE 232 + +VLDVS SM SS +I ++++ + ++ + N G+V F S+ Sbjct: 305 VSLVLDVSGSM-----SSSNRIGRQLQAVELFVVQI-----IENGAHVGIVKFSSSASVV 354 Query: 233 FFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKY--AYNQIFDMQGMRQHCNTEDANYK 288 L++ + ++K L G TN G++ A N+ FD Sbjct: 355 SSLVKINTQAQRDQLKSLIPRTAGGGTNICAGIRAGIALNKNFDGSSYGTE--------- 405 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-SPNSFYL 347 IV +TDGE ++ + + GAI++ I + + E + + Sbjct: 406 --IVLLTDGE-----DNLDTSLCFKDITDSGAIIHVIALGPNAAKELETIANMTGGLRFN 458 Query: 348 VEN---PHSMYDAFS 359 + + + DAFS Sbjct: 459 ALDKVEANELIDAFS 473 >gi|288919483|ref|ZP_06413814.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288349086|gb|EFC83332.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 533 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 34/231 (14%), Positives = 65/231 (28%), Gaps = 30/231 (12%) Query: 150 NSRHIVMPITSSVKVNSQT--------DARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 H+ +P + V + A + VLD+S SME S + I Sbjct: 310 GDTHVELPFPGTAAVADELILAYLNHFRAPTHAIFVLDLSGSMEGDRISDLRS--ALIGL 367 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEE--------FFLLEWGVSHLQRKIKYLSKF 253 A + LV F + + + L+R ++ + Sbjct: 368 TGADSSLTARFTSFRAREKITLVPFDSGVNRISDFAVTDPSPDSPELKELRRAVEGFNAG 427 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 G + Y+ + D +Y +V +TDGEN + L + Sbjct: 428 GDTA--------IYSALRAAYDRAAADLARDGSYYTSVVLLTDGENTTGASADDFLAHYR 479 Query: 314 EAK--KRGAIVYAIGIRVIRSHEFLRAC-ASPNSFYLVENPHSMYDAFSHI 361 R + + + + + + + + D F I Sbjct: 480 SLSPAARAVPTFTVLFGDADPDALRQIADVTGGTVFDAGSTS-LPDVFKDI 529 >gi|190410234|ref|YP_001965735.1| terY1 [Klebsiella pneumoniae] gi|146151027|gb|ABQ02793.1| terY1 [Klebsiella pneumoniae] Length = 239 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 25/132 (18%), Positives = 44/132 (33%), Gaps = 12/132 (9%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + ++LD S SM I+ + +L +K P ++TF + Sbjct: 30 RLPVYLLLDTSGSMHG------EPIEAVKNGVQTLLTTLKQDPYALETAHVSVITFDSSA 83 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L L ++ L+ G +T+ L + I D + Sbjct: 84 RQAVPL---TDLLSFQMPALTASG-TTSLGEALSLTASSIAKEVQKTTADTKGDWRP--L 137 Query: 291 IVFMTDGENLST 302 + MTDG Sbjct: 138 VFLMTDGSPNDD 149 >gi|108758819|ref|YP_634592.1| hypothetical protein MXAN_6470 [Myxococcus xanthus DK 1622] gi|108462699|gb|ABF87884.1| conserved hypothetical protein [Myxococcus xanthus DK 1622] Length = 914 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 75/200 (37%), Gaps = 26/200 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-NVVQSGLVT 225 Q A L + +++D S SM TK+++A + + A L + +V+ ++V + Sbjct: 390 QRRASLALSVLMDASCSMGMTVPDGRTKMELAAEGVVAALTLLNPKDEVSVHMVDTAA-- 447 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 ++I +E G+ + G L+ +I + +H Sbjct: 448 --HEIFPLSPVEAGLPL--DAVARGFSGGGGIYVGEALRAGRTEILRSEKPTRH------ 497 Query: 286 NYKKIIVFMTDGENLSTKED-QQSLYYCNEAKKRGAIVYAIGIRVIR--SHEFLRACA-- 340 ++ +D + +D Q++L +++ V IG+ V + LR A Sbjct: 498 -----VLLFSDAADSEEPDDYQRTLA---HLREQEVTVSVIGLGVPSDPDADLLREVAHR 549 Query: 341 SPNSFYLVENPHSMYDAFSH 360 Y E+ S+ FS Sbjct: 550 GGGRVYFAEDAMSLPRIFSQ 569 >gi|145295537|ref|YP_001138358.1| hypothetical protein cgR_1465 [Corynebacterium glutamicum R] gi|140845457|dbj|BAF54456.1| hypothetical protein [Corynebacterium glutamicum R] Length = 354 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 49/255 (19%), Positives = 83/255 (32%), Gaps = 48/255 (18%) Query: 132 ISRYKIPLKFCTFIPWYTNSRHIV-MPITSSVKVNSQTDARLDMM-----------IVLD 179 Y+ P +P +S+ I+ P S + TDA +D VLD Sbjct: 119 TDTYRRPTTANATLPAELSSQTIIEAPFPGS---KTVTDALIDAYTNQFRVPGETTFVLD 175 Query: 180 VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ----SGLVTFSNKIEEFFL 235 VS SM +I + +++ ++ N ++ ++ FS E Sbjct: 176 VSGSM------LGQRITLLKDTMSDLISGGATTDLANVSLRGREKVSIIPFSFGPHEVIS 229 Query: 236 LEWG------VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 G LQ++++ L G T + AY + +Y Sbjct: 230 ETLGAVGSPSRIDLQQRVEALQADG-GTGIYDAVLAAY------------AESAGGDYIP 276 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKR--GAIVYAIGIRVIRSHEFLRACA-SPNSFY 346 IV MTDGE + + Q L N V+ I + + A + + Sbjct: 277 SIVLMTDGELTAGRTYDQFLTEWNALPSNIRSIPVFVILYGEANVADMEQLAATTGGKTF 336 Query: 347 LVENPHSMYDAFSHI 361 N + +AF I Sbjct: 337 DAIN-GDLDEAFKEI 350 >gi|153842534|ref|ZP_01993517.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AQ3810] gi|149745366|gb|EDM56617.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AQ3810] Length = 223 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 24/192 (12%), Positives = 58/192 (30%), Gaps = 38/192 (19%) Query: 192 ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS 251 I ++ K ++ + + K + G+V F + L + ++IK Sbjct: 13 IDRLSAVKKVLSDFVAKRKGD-------RLGVVLFGDHAYLQTPLTADRQTVIQQIKQTV 65 Query: 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY 311 V + G D G+ + +++++ ++DG N + + Sbjct: 66 IGLVGQRTAIG---------DGIGLGTKTFVDSDAPQRVMILLSDGSNTA--GVLDPIEA 114 Query: 312 CNEAKKRGAIVYAIGIRVIR-------------------SHEFLRACA-SPNSFYLVENP 351 AKK A +Y +G+ + + ++ + Sbjct: 115 AEIAKKYNATIYTVGVGAGEMMVKDFFMTRKVDTAADLDEQTLTKIAEMTGGQYFRARDA 174 Query: 352 HSMYDAFSHIGK 363 + + I K Sbjct: 175 EQLEKIYDTINK 186 >gi|146338842|ref|YP_001203890.1| NorD protein required for nitric oxide reductase (Nor) activity [Bradyrhizobium sp. ORS278] gi|146191648|emb|CAL75653.1| NorD protein required for nitric oxide reductase (Nor) activity [Bradyrhizobium sp. ORS278] Length = 637 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 32/210 (15%), Positives = 83/210 (39%), Gaps = 32/210 (15%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 L + +++DVS S +++ D + +D+ +++ + + D +++V TF+++ Sbjct: 447 DLAVTLLVDVSLSTDAWIDG-VRVLDVEKEALLVLAHGLSACGDSHSIV-----TFTSRR 500 Query: 231 EEFFLLEW-------GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + LE ++R+I L G T +++A ++ Sbjct: 501 NSWVRLETLKAFGETMSGQVERRIGALRP-GYYTRIGTAVRHAAGELAA----------- 548 Query: 284 DANYKKIIVFMTDGENLSTK------EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 K+++V +TDG+ + + EA++ G V+ + + + ++ Sbjct: 549 RPERKRLLVVLTDGKPNDVDHYEGRFAIEDTRKAVQEARRAGVAVFGVTVD-KSAQSYVP 607 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 P + +V N + A + + + Sbjct: 608 TLFGPAGYAIVGNIRRLPAALPALYRQLAH 637 >gi|126306129|ref|XP_001365364.1| PREDICTED: similar to putative calcium activated chloride channel-like protein 1; eCLCA1 [Monodelphis domestica] Length = 895 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 35/160 (21%), Positives = 66/160 (41%), Gaps = 31/160 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++VLD S SM +S +++ ++ L ++ + +G+VTF + Sbjct: 307 LVLVLDKSGSM-----ASGDRLNRLNQASKLFLLQI-----IEKGSWAGMVTFDSSATIQ 356 Query: 234 FLLEWGVSHLQRK--IKYLSK-FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + QR I L G T+ GL+ A+ I ++ Sbjct: 357 SELIQIETDAQRNSLITRLPTVAGGGTSICSGLRTAFTVI-----KKKFSTDGSE----- 406 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRV 329 IV +TDGE+ ++ C +E K+ GAI++ + + Sbjct: 407 IVLLTDGEDN-------TISTCFDEVKQSGAIIHTVALGP 439 >gi|154302354|ref|XP_001551587.1| hypothetical protein BC1G_09961 [Botryotinia fuckeliana B05.10] gi|150855449|gb|EDN30641.1| hypothetical protein BC1G_09961 [Botryotinia fuckeliana B05.10] Length = 254 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 70/201 (34%), Gaps = 14/201 (6%) Query: 165 NSQTDARLDMMIVLDVSRSME------SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 N ++I LD S SM + ++ ++ ++ V + + Sbjct: 2 NPNNGPDEAIVICLDKSISMRALLGNNWIGNGQVSTLNRFDEAKQVFRNVVSRLSAYHLN 61 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V GLVT +K E+ + + K++ T+ L AY+ + Q ++ Sbjct: 62 VHVGLVTLGSKAEQEAHISPIGKEFRNKLENSQATDHRTSLFDPLCVAYSMLISHQ--KR 119 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 H ++ K I+ +TDGE+ + C ++ +I I + + Sbjct: 120 H---PESKMKLRIIALTDGEDN--SSVFTAEDTCKMLYYDDIVLDSIVIGGSSYGDLFKI 174 Query: 339 CA-SPNSFYLVENPHSMYDAF 358 + + + ++ F Sbjct: 175 LKYTGGYVFKPTSQMLLFQTF 195 >gi|254383386|ref|ZP_04998738.1| hypothetical protein SSAG_03040 [Streptomyces sp. Mg1] gi|194342283|gb|EDX23249.1| hypothetical protein SSAG_03040 [Streptomyces sp. Mg1] Length = 686 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 32/190 (16%), Positives = 69/190 (36%), Gaps = 20/190 (10%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 ++ +T AR + +VLD S SM F+ D + + + + D + V + Sbjct: 494 LRAKGKTGARAKVYLVLDRSGSMRPFYK------DGSAQFLADHTLALAAHLDADATVHT 547 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 V FS ++ L G ++ G ++ ++ + +H Sbjct: 548 --VFFSTDVDGTAELTLGSHDAAWAEARHAELGRMGRTS--------YHVAVEAVLEHYR 597 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--VIRSHEFLRAC 339 + ++VF TDG + + +Q++ A + ++ +FLR Sbjct: 598 KDGGTGPALVVFQTDGAPDNRQPARQAI-IDAAATAPDVHWQFVAFGDHENKAFDFLRKL 656 Query: 340 ASPNS-FYLV 348 + N+ F+ Sbjct: 657 DAGNAGFFHA 666 >gi|88601588|ref|YP_501766.1| magnesium chelatase, ChlI subunit [Methanospirillum hungatei JF-1] gi|88187050|gb|ABD40047.1| protoporphyrin IX magnesium-chelatase [Methanospirillum hungatei JF-1] Length = 619 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 66/222 (29%), Gaps = 23/222 (10%) Query: 108 DDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 I S P + Y I+ + ++ I + + + + + Sbjct: 373 QKISDSGRYFRSKNPSGKIYDIAFDATFRAAAPH--QITRSNGTLALNISVQDIRVKERK 430 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + ++ V+D S SM + ++ ++ ++L+ N Q L++F Sbjct: 431 RKSGRTIIFVVDSSGSM-----GAAKRMSAVKGAVLSLLK-----DAYINRDQVALISFR 480 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 E L G T + G+ + I ++ H Sbjct: 481 GPGAEVLLKPTRSGMTAYHQLAHLPTGGQTPLSSGIYTTVSLIRTIRRKNSHDEP----- 535 Query: 288 KKIIVFMTDGENLSTKEDQQ----SLYYCNEAKKRGAIVYAI 325 ++ ++DG + D + A+ A Y I Sbjct: 536 --FVIIISDGRANHARSDNDPVAEAWMAAAAARNEKAHYYVI 575 >gi|302536534|ref|ZP_07288876.1| VWA domain-containing protein [Streptomyces sp. C] gi|302445429|gb|EFL17245.1| VWA domain-containing protein [Streptomyces sp. C] Length = 532 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 38/198 (19%), Positives = 66/198 (33%), Gaps = 27/198 (13%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ--SGLVTFSNKIEE 232 + VLD S SME +I ++ + + L+ F +K+++ Sbjct: 347 VYVLDTSGSMEEG-----DRIGRLRSALTDLTGTGSSGTGQRFRDREEVTLLPFGDKVKK 401 Query: 233 -------FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + ++ +K L G T LK AY QH +A Sbjct: 402 VLTHVVEPGNPGPALDAIRGDVKSLRPEG-GTAVYASLKAAY----------QHLGEGNA 450 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-SPNS 344 + IV MTDG++ +D S Y ++ V+A+ E + Sbjct: 451 DAFTSIVLMTDGQSGDKVKDFDSFYAGLPEAQKRTPVFAVLFGDSDRKELTHITELTGGR 510 Query: 345 FYLVENP-HSMYDAFSHI 361 + + S+ AF I Sbjct: 511 LFDATDGNSSLAGAFEEI 528 >gi|297193194|ref|ZP_06910592.1| conserved hypothetical protein [Streptomyces pristinaespiralis ATCC 25486] gi|297151687|gb|EDY64371.2| conserved hypothetical protein [Streptomyces pristinaespiralis ATCC 25486] Length = 336 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 33/216 (15%), Positives = 70/216 (32%), Gaps = 19/216 (8%) Query: 87 KNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIP 146 D V ++ ++ ++ + + V + A + + Sbjct: 70 SEPVAADAEEPQTDAPAVAEVAEVAQTEAAEPTVTDEPTVTDAPAEPGGRAAIGLARLKT 129 Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF-DSSITKIDMAIKSINAM 205 +++K T AR + +VLD S SM +F D S ++ +++A Sbjct: 130 RAPQLVDAYKAAGAALKKAELTGARARVYLVLDRSGSMRPYFKDGSAQRLGEQALALSAH 189 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 L+ + +V FS +I+ L ++ L G+ Sbjct: 190 LDADATVD---------VVFFSTEIDGTGELTLDSHE--GRVDELHA-GLGRMGRTNYHL 237 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 A ++ + H + +++F TDG S Sbjct: 238 AVEEVMSL-----HAKKN-TDAPALVIFQTDGAPES 267 >gi|227519070|ref|ZP_03949119.1| conserved hypothetical protein [Enterococcus faecalis TX0104] gi|227073490|gb|EEI11453.1| conserved hypothetical protein [Enterococcus faecalis TX0104] Length = 711 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 66/169 (39%), Gaps = 27/169 (15%) Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 K + ++ + + + QT++ +D+++V+D S SM K++ Sbjct: 114 YLKKSVAAVTGQQGLFNVTLDVKGN-----QTESPIDLVLVIDYSSSM------KGEKLN 162 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS 256 A+K + EE+ + V+ G+V ++ L + + + L F + Sbjct: 163 NALKGLQQFGEEL-SDSLTDGHVRIGIVAYNR-------LTYSTADFSTDMNDLEDFLRN 214 Query: 257 TN---STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 T S ++ ++G R N KK++V + DG ++ Sbjct: 215 TAEPHSGTFMQKGL-----LEGQRLLAEKSRPNAKKMLVHIGDGSANAS 258 >gi|149919617|ref|ZP_01908096.1| putative outer membrane adhesin like protein [Plesiocystis pacifica SIR-1] gi|149819560|gb|EDM78988.1| putative outer membrane adhesin like protein [Plesiocystis pacifica SIR-1] Length = 1168 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 72/206 (34%), Gaps = 22/206 (10%) Query: 136 KIPLKFCTFI--PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSIT 193 + L F P + +T + V+ DA + +VLD S+SM+ S Sbjct: 544 DLDLSFFMPAGTPNSDPRSPLTDCVTHTATVHHDFDAYDTVALVLDRSKSMDQD-QGSRK 602 Query: 194 KIDMAIKSINAMLEEVKLIPDVNNVVQS-------GLVTFSNKIEEFFLLEWGV----SH 242 +I+ A ++I + V V ++ + F ++ E + Sbjct: 603 RIEWAQRAILKWTDHVADGGSVQASLRKFNQDAPPAVFGFKTVLDAVVGGETATEIDSTA 662 Query: 243 LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 ++ ++ + G ST + A + + + + +A I +TDGE T Sbjct: 663 IEEYLEDIEPDG-STAIGDAIDAAVAALMAHDDLDPNSSNNNA-----IFLITDGE--QT 714 Query: 303 KEDQQSLYYCNEAKKRGAIVYAIGIR 328 D+ +A K VY + Sbjct: 715 SGDKDVCDALEDAAKDDVPVYIAPVG 740 >gi|119613593|gb|EAW93187.1| inter-alpha (globulin) inhibitor H5-like, isoform CRA_b [Homo sapiens] Length = 523 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 63/174 (36%), Gaps = 22/174 (12%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ V+DVS SM TK++ ++N +L +++ N+ ++FS+ + Sbjct: 174 VVFVIDVSSSMFG------TKMEQTKTAMNVILSDLQA----NDYFNI--ISFSDTVNVW 221 Query: 232 -EFFLLEWGVSHLQRKIKYLSKFGVS--TNSTPGLKYAYNQI-FDMQGMRQHCNTEDANY 287 ++ + ++ YL T+ L A + + Q + + Sbjct: 222 KAGGSIQATIQNVHSAKDYLHCMEADGWTDVNSALLAAASVLNHSNQEPGRGPSVGRIPL 281 Query: 288 KKIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 I+F+TDGE L +A ++++ LR + Sbjct: 282 ---IIFLTDGEPTAGVTTPSVILSNVRQALGHRVSLFSLAFGDDADFTLLRRLS 332 >gi|114569600|ref|YP_756280.1| von Willebrand factor, type A [Maricaulis maris MCS10] gi|114340062|gb|ABI65342.1| von Willebrand factor, type A [Maricaulis maris MCS10] Length = 555 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 34/233 (14%), Positives = 86/233 (36%), Gaps = 24/233 (10%) Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKF---CTFIPWYTNSRHIVMPITSSV 162 + + + ++ I + Y + P T PW +++ + + I Sbjct: 119 EAGRLPPTDAVRIEEMVNYFDYDYALPPGPDEPFATHVTVTPTPWNADTQLMHIGIQG-Y 177 Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 ++ R +++ ++DVS SM S K+ +A+++++ +++E+ V VV + Sbjct: 178 EIIPDERPRANLVFLIDVSGSMNS-----PDKLPLAVQAMHLLVDELHPDDTVALVVYAS 232 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + + + R + LS G + AY + + N Sbjct: 233 A----SGVVLPPTEARNAREIHRALDSLSAGGSTAGGAGL-ALAY-DLAEQNFDEDAVNR 286 Query: 283 EDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++ +TDG+ N+ +D++ + + G + +G ++ Sbjct: 287 --------VMLLTDGDFNVGVTQDERLEDFVARKRDSGIYLSVMGFGRGNYND 331 >gi|67677889|gb|AAH97100.1| Cfb protein [Danio rerio] Length = 761 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 36/229 (15%), Positives = 82/229 (35%), Gaps = 29/229 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K++ +LD+ I +D S S++ K IK++ + ++ P+ Sbjct: 248 KISLDRGGKLDIYIAVDASDSID---PKDFDKAKKIIKTLIEKISYYEVSPNYE------ 298 Query: 223 LVTFSNKIEEFFLLEWGVS--------HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 ++ F+ +++ + + + + + + + Y +I D Sbjct: 299 ILMFATDVDQIVKMRDFKTNEKARNIVKVFEDLDNFNYDKKGDRTGTNIAKLYLKILDSM 358 Query: 275 GMRQHCNTED-ANYKKIIVFMTDGE-----NLSTKED-QQSLYYCNEA-KKRGAIVYAIG 326 + Q N ED + +I+ TDG+ N K D ++L N A ++ +Y G Sbjct: 359 SLEQVQNKEDFLQTQHVIIVFTDGQANMGGNPKPKVDLIKNLVIKNNASRENKLDLYVFG 418 Query: 327 IRVIRSHEFLRACASPN----SFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + E + S F+ + + + + F + D + Sbjct: 419 VGKDVKKEDMNGLVSEKKDERHFFKLPDLDEVQNTFDLMLDDSTVVGLC 467 >gi|22127458|ref|NP_670881.1| hypothetical protein y3584 [Yersinia pestis KIM 10] gi|108809185|ref|YP_653101.1| hypothetical protein YPA_3194 [Yersinia pestis Antiqua] gi|108810628|ref|YP_646395.1| hypothetical protein YPN_0463 [Yersinia pestis Nepal516] gi|145597694|ref|YP_001161770.1| hypothetical protein YPDSF_0382 [Yersinia pestis Pestoides F] gi|167467475|ref|ZP_02332179.1| putative tellurium resistance protein [Yersinia pestis FV-1] gi|21960552|gb|AAM87132.1|AE013961_4 hypothetical [Yersinia pestis KIM 10] gi|108774276|gb|ABG16795.1| hypothetical protein YPN_0463 [Yersinia pestis Nepal516] gi|108781098|gb|ABG15156.1| hypothetical protein YPA_3194 [Yersinia pestis Antiqua] gi|145209390|gb|ABP38797.1| hypothetical protein YPDSF_0382 [Yersinia pestis Pestoides F] Length = 233 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 32/189 (16%), Positives = 65/189 (34%), Gaps = 14/189 (7%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 +++ S + + RL + +++D S SM I I AM+ ++ P Sbjct: 7 LILTPLSLILRTTSDMRRLPVYLLIDTSGSMRG------ESIHAVNVGIQAMMSALRQDP 60 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 V ++T+ N+ E+ L + + T + L+ I + Sbjct: 61 YALESVHLSIITYDNQAREYIPLTALENF--QFTDITVPSAGGTFTGAALEC---LIHCV 115 Query: 274 QGMRQHCNTEDANYKKIIVF-MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 Q + + + +VF MTD +++ K+ + A + Sbjct: 116 DRDIQRSDGDQKGDWRPLVFLMTDSTPSDVYAYGEAIK--EVKKRAFGSIIACAVGAKAK 173 Query: 333 HEFLRACAS 341 HE L+ S Sbjct: 174 HEHLKQLTS 182 >gi|45442680|ref|NP_994219.1| hypothetical protein YP_2915 [Yersinia pestis biovar Microtus str. 91001] gi|45437546|gb|AAS63096.1| conserved hypothetical protein [Yersinia pestis biovar Microtus str. 91001] Length = 233 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 32/189 (16%), Positives = 65/189 (34%), Gaps = 14/189 (7%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 +++ S + + RL + +++D S SM I I AM+ ++ P Sbjct: 7 LILTPLSLILRTTSDMRRLPVYLLIDTSGSMRG------ESIHAVNVGIQAMMSALRQDP 60 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 V ++T+ N+ E+ L + + T + L+ I + Sbjct: 61 YALESVHLSIITYDNQAREYIPLTALENF--QFTDITVPSAGGTFTGAALEC---LIHCV 115 Query: 274 QGMRQHCNTEDANYKKIIVF-MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 Q + + + +VF MTD +++ K+ + A + Sbjct: 116 DRDIQRSDGDQKGDWRPLVFLMTDSTPSDVYAYGEAIK--EVKKRAFGSIIACAVGAKAK 173 Query: 333 HEFLRACAS 341 HE L+ S Sbjct: 174 HEHLKQLTS 182 >gi|120556589|ref|YP_960940.1| vault protein inter-alpha-trypsin subunit [Marinobacter aquaeolei VT8] gi|120326438|gb|ABM20753.1| Vault protein inter-alpha-trypsin domain protein [Marinobacter aquaeolei VT8] Length = 712 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 63/194 (32%), Gaps = 34/194 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK---I 230 ++ V+D S SM I A ++ L+ ++ + ++ F+++ + Sbjct: 356 LLFVIDTSGSMAG------ESIRQARSALLRGLDTLRPGD------RFNVIQFNSQAHAL 403 Query: 231 EEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + G ++ + ++ L+ G T L A D H Sbjct: 404 YTQPVPANGHYLARARDYVQDLTADG-GTEMAGALSLAMG--MDGSESSGHVQQ------ 454 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFY 346 +VFMTDG + + + ++ + I + FLR A + Sbjct: 455 --MVFMTDGAVGNESALFDQIRTGLGNR----RLFTVAIGSAPNMHFLREAARWGRGQYT 508 Query: 347 LVENPHSMYDAFSH 360 V + + A Sbjct: 509 AVHSAAEVDKALGK 522 >gi|257415126|ref|ZP_05592120.1| predicted protein [Enterococcus faecalis AR01/DG] gi|257156954|gb|EEU86914.1| predicted protein [Enterococcus faecalis ARO1/DG] Length = 711 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 66/169 (39%), Gaps = 27/169 (15%) Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 K + ++ + + + QT++ +D+++V+D S SM K++ Sbjct: 114 YLKKSVAAVTGQQGLFNVTLDVKGN-----QTESPIDLVLVIDYSSSM------KGEKLN 162 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS 256 A+K + EE+ + V+ G+V ++ L + + + L F + Sbjct: 163 NALKGLQQFGEEL-SDSLTDGHVRIGIVAYNR-------LTYSTADFSTDMNDLEDFLRN 214 Query: 257 TN---STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 T S ++ ++G R N KK++V + DG ++ Sbjct: 215 TAEPHSGTFMQKGL-----LEGQRLLAEKSRPNAKKMLVHIGDGSANAS 258 >gi|302381356|ref|YP_003817179.1| hypothetical protein Bresu_0241 [Brevundimonas subvibrioides ATCC 15264] gi|302191984|gb|ADK99555.1| hypothetical protein Bresu_0241 [Brevundimonas subvibrioides ATCC 15264] Length = 416 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 19/165 (11%), Positives = 55/165 (33%), Gaps = 8/165 (4%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F + +G + ++ A+ P + L+ +E+ + + L + D + + A ++ + Sbjct: 14 FGRDERGNIALIFALSTPAVVLISVGAVELGSVQSNRAKLQDIADTAALAGANELALAID 73 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 + I + W + + D + + +V+ + + Sbjct: 74 DAAAIERAKVFIDGHVSE-WKSAPAVTPEIAVILRDKQRV-----IQVVLKGHTPSFFAN 127 Query: 131 AISR--YKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLD 173 + +K + T ++ + S +N + RL Sbjct: 128 MLPPGGWKYHAEARAVAVGLTPLCVLITGSSGSKMLNVKDSGRLS 172 >gi|258616219|ref|ZP_05713989.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium DO] Length = 1095 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 24/139 (17%), Positives = 57/139 (41%), Gaps = 22/139 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + LD+++V+D S SM +I ++ ++ + + + + G V + Sbjct: 286 KDITPLDLVLVVDWSGSMNDN-----NRIGEVKIGVDRFVDTL-ADSGITDKINMGYVGY 339 Query: 227 SNKIEEFF--LLEWGV-SHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 S++ + ++ G ++ ++K ++ T + L+ A + + G Sbjct: 340 SSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSVPNG------ 393 Query: 282 TEDANYKKIIVFMTDGENL 300 +KK+IV +TDG Sbjct: 394 -----HKKVIVLLTDGVPT 407 >gi|159039503|ref|YP_001538756.1| von Willebrand factor type A [Salinispora arenicola CNS-205] gi|157918338|gb|ABV99765.1| von Willebrand factor type A [Salinispora arenicola CNS-205] Length = 583 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 43/207 (20%), Positives = 70/207 (33%), Gaps = 33/207 (15%) Query: 174 MMIVLDVSRSMESFFDS--SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN--- 228 M+ V+DVS SM+ ++ + + + L + Q GL FS Sbjct: 386 MLCVIDVSGSMKGSVAGAGGASRQQVTLDAARRGLSLFD------DSWQIGLWEFSTNLG 439 Query: 229 ------KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 ++ E L S L++ + + T + AY + ++ + Sbjct: 440 SGRDYRRLVEIGPLSNQRSRLEQALTQIQPTRGDTGLFDTVLAAYEAV------QEEWDP 493 Query: 283 EDANYKKIIVFMTDGENLSTK--EDQQSLYYCNEAKK--RGAIVYAIGIRVIRSHEFLRA 338 N IV TDG+N QQ L K R V IGI S L + Sbjct: 494 GQVNS---IVLFTDGKNDDDNGISQQQLLAELERIKDAERPVQVVLIGIGADVSKAELES 550 Query: 339 CA--SPNSFYLVENPHSMYDAF-SHIG 362 + ++ E+P + D F I Sbjct: 551 ITKVTGGGSFVTEDPTKIGDIFLKAIA 577 >gi|156383255|ref|XP_001632750.1| predicted protein [Nematostella vectensis] gi|156219810|gb|EDO40687.1| predicted protein [Nematostella vectensis] Length = 174 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 56/162 (34%), Gaps = 25/162 (15%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D+ +++ S ++ + F+ +K M + + +V GLV FS Sbjct: 15 QTPIDLAFLVEGSTAVGANFN-------ALMKFTTNMAKNFDISVSATHV---GLVLFST 64 Query: 229 KIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L + + L I ++ G TN+ GL+ + +FD T N Sbjct: 65 TASVKITLNEFYDIVKLTSAINLITIQGGFTNTGKGLQAVKSDLFD--------ATTRKN 116 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 ++++ +T + VY +G+ Sbjct: 117 VPRVLIVLT-----VASSLDDVRAPSQALRDNSVTVYVVGVG 153 >gi|87199536|ref|YP_496793.1| hypothetical protein Saro_1518 [Novosphingobium aromaticivorans DSM 12444] gi|87135217|gb|ABD25959.1| hypothetical protein Saro_1518 [Novosphingobium aromaticivorans DSM 12444] Length = 520 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 19/170 (11%), Positives = 54/170 (31%), Gaps = 2/170 (1%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 IR+ + G + + A+ L + ++G+ +VS + + + L + D++ + A TQ+ Sbjct: 6 IRSGLKDETGAVAVTYALALTGLIAMVGIGYDVSQVLTLDSELQNAADQAALAAVTQLDR 65 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVV-PQNEG 126 + +R K ++ + +++ GF + + Sbjct: 66 QSGAIDRAKNAANSLIANNTLLAKDAGVVTIKEAGFAFYATRADAEAGANPTTDNARARF 125 Query: 127 YSISAIS-RYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMM 175 + + L S + + + + + Sbjct: 126 AKVEVNGRSAQYALTPIGSAVASWVSPTLSAAAVAGIGSAICKVPPMKIC 175 >gi|17536325|ref|NP_495348.1| hypothetical protein T19D12.4 [Caenorhabditis elegans] gi|3258584|gb|AAC24429.1| Hypothetical protein T19D12.4a [Caenorhabditis elegans] Length = 1028 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 39/226 (17%), Positives = 81/226 (35%), Gaps = 21/226 (9%) Query: 105 NDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKV 164 + + + + D V+ ++ ++ + F +S+ + + Sbjct: 321 SYLKSYADARAADEVISKESWTELANSDPFATLICSFFKLPTTQMSSQLEYLRTERDAVI 380 Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 ++ LDM+I D S S+ S + ++ + K N+ + G++ Sbjct: 381 DTPKCEVLDMIIAFDTSESLSSLIVPQY------VDFAKKLVAQYKY---GNDNTRVGII 431 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 TFS+ + E L G + + I + G TN T A N +FD + Sbjct: 432 TFSSDVVEVRKLTDGNTLDAVNAAIDTVHYTGGLTNVTKAQLTAKN-LFDTES------- 483 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +AN K++ +TDG + ++ K I + +G Sbjct: 484 -NANRNKVLFILTDGVPTVDTYTDEVAAG-DKLKSISVISFFVGYS 527 >gi|307352799|ref|YP_003893850.1| Magnesium chelatase [Methanoplanus petrolearius DSM 11571] gi|307156032|gb|ADN35412.1| Magnesium chelatase [Methanoplanus petrolearius DSM 11571] Length = 651 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 64/206 (31%), Gaps = 23/206 (11%) Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 ++ A R P + + I + + + + V+D S S Sbjct: 422 GPDIALDATIRAAAPFQLERGAEGRDLAIKIDISDIREKVRERKIGNTI--LFVVDASGS 479 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 M + ++ +I ++L + + GLV F K E L L Sbjct: 480 M-----GAQQRMTAVKGAILSLL-----VDAYQKRDRVGLVVFRGKTAELLLPPTSSVEL 529 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE----N 299 RK G T GL A+ + ++ + ++ ++DG+ Sbjct: 530 ARKCMQELPVGGKTPLAHGLSKAFEVL------QRELMINKNTMPR-LILISDGKANVGM 582 Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAI 325 S ++ N +++ Y I Sbjct: 583 TSDSPLNDAIGIANHIREKEIASYVI 608 >gi|298491707|ref|YP_003721884.1| von Willebrand factor type A ['Nostoc azollae' 0708] gi|298233625|gb|ADI64761.1| von Willebrand factor type A ['Nostoc azollae' 0708] Length = 426 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 35/234 (14%), Positives = 77/234 (32%), Gaps = 33/234 (14%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 +N R + + I+S L++ ++LD S SM I+ I+++ Sbjct: 16 ATQLSNQRQLAISISSIAD-ELDPSLPLNLCLILDKSGSMHG------EPINTVIQAVEQ 68 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 +L +++ ++ V +G ++++ + ++ ++ K G T GL Sbjct: 69 LLAQLQPGDHISIVAFAG----TSEVIIPNQIVQDAESIKCQLHKRLKAGGGTIIAEGLS 124 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL----------STKEDQQSLYYCNE 314 ++ +TDG + ++ L + Sbjct: 125 LGITELLKGTKGAVSQA----------FLLTDGHGDRGLKIWKWEMGPNDKKRCLELAQK 174 Query: 315 AKKRGAIVYAIGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSHIGKDIV 366 A + + G + + L A + +E P D FS + K I Sbjct: 175 ATRVSLTLNTFGFGNDWNQDLLEKIADAGGGTLAYIERPQQAVDQFSRLLKRIQ 228 >gi|115537688|ref|NP_872008.2| hypothetical protein T19D12.4 [Caenorhabditis elegans] gi|82654523|gb|ABB88212.1| Hypothetical protein T19D12.4b [Caenorhabditis elegans] Length = 1015 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 39/226 (17%), Positives = 81/226 (35%), Gaps = 21/226 (9%) Query: 105 NDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKV 164 + + + + D V+ ++ ++ + F +S+ + + Sbjct: 308 SYLKSYADARAADEVISKESWTELANSDPFATLICSFFKLPTTQMSSQLEYLRTERDAVI 367 Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 ++ LDM+I D S S+ S + ++ + K N+ + G++ Sbjct: 368 DTPKCEVLDMIIAFDTSESLSSLIVPQY------VDFAKKLVAQYKY---GNDNTRVGII 418 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 TFS+ + E L G + + I + G TN T A N +FD + Sbjct: 419 TFSSDVVEVRKLTDGNTLDAVNAAIDTVHYTGGLTNVTKAQLTAKN-LFDTES------- 470 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +AN K++ +TDG + ++ K I + +G Sbjct: 471 -NANRNKVLFILTDGVPTVDTYTDEVAAG-DKLKSISVISFFVGYS 514 >gi|19552620|ref|NP_600622.1| hypothetical protein NCgl1349 [Corynebacterium glutamicum ATCC 13032] gi|62390288|ref|YP_225690.1| Mg-chelatase subunit [Corynebacterium glutamicum ATCC 13032] gi|21324171|dbj|BAB98796.1| Hypothetical protein [Corynebacterium glutamicum ATCC 13032] gi|41325625|emb|CAF21414.1| secreted Mg-chelatase subunit [Corynebacterium glutamicum ATCC 13032] Length = 525 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 49/255 (19%), Positives = 84/255 (32%), Gaps = 48/255 (18%) Query: 132 ISRYKIPLKFCTFIPWYTNSRHIV-MPITSSVKVNSQTDARLDMM-----------IVLD 179 Y+ P +P +S+ I+ P S + TDA +D VLD Sbjct: 290 TDTYRRPTTANATLPAELSSQTIIEAPFPGS---KTVTDALIDAYTNQFRVPGETTFVLD 346 Query: 180 VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ----SGLVTFSNKIEEFFL 235 VS SM +I + +++ ++ N ++ ++ FS E Sbjct: 347 VSGSM------LGQRITLLKDTMSDLISGGATTDLANVSLRDREKVSIIPFSFGPHEVIS 400 Query: 236 LEWG------VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 G + LQ++++ L G T + AY + +Y Sbjct: 401 ETLGAVGSPSRTDLQQRVEALQADG-GTGIYDAVLAAY------------AESAGGDYIP 447 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKR--GAIVYAIGIRVIRSHEFLRACA-SPNSFY 346 IV MTDGE + + Q L N V+ I + + A + + Sbjct: 448 SIVLMTDGELTAGRTYDQFLTEWNALPSNIRSIPVFVILYGEANVADMEQLAATTGGETF 507 Query: 347 LVENPHSMYDAFSHI 361 N + +AF I Sbjct: 508 DAIN-GDLDEAFKEI 521 >gi|21740064|emb|CAD39048.1| hypothetical protein [Homo sapiens] Length = 803 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 34/171 (19%), Positives = 63/171 (36%), Gaps = 32/171 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + I++D S SM+S K+D+ I ++E N V+ + + + Sbjct: 18 IYILIDTSHSMKS-------KLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAVAWREQLA 70 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + E + Q I+ + G STN+ LK A+ + E I Sbjct: 71 EVNEDNLEQAQSWIRDIK-IGSSTNTLSALKTAF------------ADKETQA----IYL 113 Query: 294 MTDGENLSTKEDQQSLYYCNEAKK-RGAIVYAIGIRVIR--SHEFLRACAS 341 +TDG E ++ K+ + +Y I ++ FL+ A+ Sbjct: 114 LTDGRPDQPPETVI-----DQVKRFQEIPIYTISFNYNDEIANRFLKEVAA 159 >gi|254492197|ref|ZP_05105371.1| type I secretion target GGXGXDXXX repeat protein domain protein [Methylophaga thiooxidans DMS010] gi|224462522|gb|EEF78797.1| type I secretion target GGXGXDXXX repeat protein domain protein [Methylophaga thiooxydans DMS010] Length = 2740 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 33/160 (20%), Positives = 59/160 (36%), Gaps = 16/160 (10%) Query: 155 VMPITSSVKVNSQTDARLD--MMIVLDVSRSMESFFD-SSITKIDMAIKSINAMLEEVKL 211 +P+ S + + + ++ +M++LDVS SM + +T++ + IKS +L++ Sbjct: 2061 DVPVASPEENSGEATLEVNTNLMMILDVSGSMNDSANFQGMTRLQVMIKSSLELLDQYDA 2120 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 D V ++TF+ W + I G +TN L A N Sbjct: 2121 YGD----VMVNIITFATSASNPSG-GWVTVDQAKAIILGLTAGGNTNYDDALNDAINAFA 2175 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY 311 + N FM+DGE S + Sbjct: 2176 LGGKLGDGQNISY--------FMSDGEPNSNNVSNSATVP 2207 >gi|166366825|ref|YP_001659098.1| von Willebrand factor type A [Microcystis aeruginosa NIES-843] gi|166089198|dbj|BAG03906.1| von Willebrand factor type A [Microcystis aeruginosa NIES-843] Length = 460 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 31/193 (16%), Positives = 60/193 (31%), Gaps = 41/193 (21%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSS---------------------ITKIDMAIKSINA 204 +QT + ++D S SM I KIDM I+S+ Sbjct: 35 AQTRPSTTFVFLIDTSGSMYEIVAGDFQPTGETYMQDGKEYTRVVGGIAKIDMVIESLRT 94 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPG 262 ++ + P+ + L+ F ++ L L+ I L F T G Sbjct: 95 LINSGRFTPED----RIALIQFDDQASTLIGLTPVTQTRQLEDAIAKLRNFSGGTCMGRG 150 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 + A + + +H + TDG+ ++ + +G + Sbjct: 151 INQALALLANQSMTSRHT-----------MIFTDGDTF---DEDDCQNLAQQFASQGISI 196 Query: 323 YAIGIRVIRSHEF 335 A+G+ + Sbjct: 197 TALGVGEFNENLL 209 >gi|225444686|ref|XP_002277641.1| PREDICTED: hypothetical protein [Vitis vinifera] gi|297738541|emb|CBI27786.3| unnamed protein product [Vitis vinifera] Length = 756 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 68/201 (33%), Gaps = 35/201 (17%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + R +++ V+D+S SM ++ +++A L ++ + + +G Sbjct: 316 QQTGKVFRKEVVFVVDISGSMRGKL------LEDTKNALSAALSKLDSKDSFSIIAFNGE 369 Query: 224 VT-FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + FS+ ++ + + + I G TN + A G Sbjct: 370 IFIFSSSVQLATKEA--IENAIQWISMNFIAGGDTNILLPMNKAMELFSHSPGSIP---- 423 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-----KKRGAI--VYAIGIRVIRSHEF 335 II +TDG + + C+ + +Y GI + +H F Sbjct: 424 -------IIFLITDG------SVEDERHICDVMTSYLTNEESIHPRIYTFGIGLYCNHYF 470 Query: 336 LRACA--SPNSFYLVENPHSM 354 L+ A + + +S+ Sbjct: 471 LKMLAMIGRGHYDAAYDANSI 491 >gi|241589598|ref|YP_002979623.1| von Willebrand factor type A [Ralstonia pickettii 12D] gi|240868310|gb|ACS65969.1| von Willebrand factor type A [Ralstonia pickettii 12D] Length = 696 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 42/288 (14%), Positives = 97/288 (33%), Gaps = 33/288 (11%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSF-----RNELRDNGFV 104 L + + V AA G N + G + + + + S + + Sbjct: 396 LEDPVTSTAVQAA-----LEAGYNPGEDLGEALQAMLTSIASTSEEPPVVSEAIPFTSEI 450 Query: 105 NDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKV 164 D +IV+ + + + + ++ A + + + + + M + + Sbjct: 451 EDNGEIVQRVKRETIALRRKLTTALEAPTLSQCWEGYAGERLSNLGALRLPMGDLNIFER 510 Query: 165 NSQ-TDARLDMMIVLDVSRSMESFFDSSITKIDMAI---KSINAMLEEVKLIPDVNNVVQ 220 ++ T + + ++LD S SM+ +I MAI ++ L ++ + Sbjct: 511 ETRRTSVDVAIQLLLDRSGSMQEE-----ERIAMAIDACTALGLALSQIDGVQIATAAFP 565 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +G + + VS +I + G +T + + + + R Sbjct: 566 AGEIDCDRNVLVLNGFGESVSRRAARIAAVRADGANTPLADAMMFGQYSLLSTKATR--- 622 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +I++ +TDGE S + + C G V+ +GI Sbjct: 623 --------RILLPITDGEPDSRSAVAEVVESCARW---GVEVFGVGIG 659 >gi|123509108|ref|XP_001329794.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121912842|gb|EAY17659.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 694 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 39/262 (14%), Positives = 87/262 (33%), Gaps = 48/262 (18%) Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD 169 + +T ++ V S+ I+ + F Y ++R + Sbjct: 190 TIPTTDFNVDVFLNKNPVSVKEITGNTQVVNFKINPLNYLSNRKTDVK------------ 237 Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 ++ +LD S SM + +I+ AIK+++ L ++ +V+ G TF++ Sbjct: 238 ---SIVFLLDCSGSM-----TIDNRIENAIKAMDLFLHSLEPGVKF-EIVRFG-STFNSL 287 Query: 230 IEEFFL--LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + ++ IK S T +K YN++ Sbjct: 288 FDFKLTEYNDDSLNTALAFIKGTSANLGGTEIFNPIKQIYNELS---------------- 331 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--PNSF 345 ++ +TDG S + + ++++G+ +R AS Sbjct: 332 PDVLFVLTDG------AVDNSQAVLDFVRDSSTKIFSLGLGAGADMNLVRNLASFTGGVS 385 Query: 346 YLVENPHSMYDAFSHIGKDIVT 367 V + + D+ + +D Sbjct: 386 EHVLDASQLRDSIIRLLEDSTN 407 >gi|307323133|ref|ZP_07602343.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] gi|306890622|gb|EFN21598.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] Length = 422 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 28/155 (18%), Positives = 57/155 (36%), Gaps = 33/155 (21%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE-----------EVKLIPDVNNVVQSG 222 + +VLDVS SM + T++ A ++ N +L+ + + + Sbjct: 40 VELVLDVSGSMRARDVDGDTRMAAAKQAFNEVLDATPEEVRLGIRTLGANYPGKD--RVA 97 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 S ++ ++ + + + L G T L+ A + +G R+ Sbjct: 98 GCRDSEQLYPVGQVD--RTEAKAAVATLRPTGW-TPIGLALRGASKDLSSGEGTRR---- 150 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 IV +TDGE+ + D C+ A++ Sbjct: 151 --------IVLITDGEDSCGQPD-----PCDVARE 172 >gi|182414211|ref|YP_001819277.1| von Willebrand factor type A [Opitutus terrae PB90-1] gi|177841425|gb|ACB75677.1| von Willebrand factor type A [Opitutus terrae PB90-1] Length = 611 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 34/162 (20%), Positives = 62/162 (38%), Gaps = 26/162 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++I +D+SRSM + D +++D A ++LE++ + GL+ FS Sbjct: 94 ILIAIDLSRSMLA-PDVKPSRLDRAKLLTQSLLEKLSGE-------RVGLIVFSGTAFLQ 145 Query: 234 FLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L L+ + L TN + A + Sbjct: 146 SPLSSDYEILREFLPALDPTFLPEGGTNYDALINTALTAFGATGAA-----------DRF 194 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 ++ ++DGE +T++D +S E K RG V A+G+ Sbjct: 195 LIILSDGE--ATEDDWRSHV--AELKNRGIRVIALGVGTTAG 232 >gi|299140485|ref|ZP_07033623.1| BatB protein [Prevotella oris C735] gi|298577451|gb|EFI49319.1| BatB protein [Prevotella oris C735] Length = 342 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 37/201 (18%), Positives = 85/201 (42%), Gaps = 20/201 (9%) Query: 132 ISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSS 191 +S+Y+ +KF + T ++ + K++ ++++I LD+S SM + D + Sbjct: 50 VSKYRPTIKFWLLLSAMTILILMIARPQAGTKISHDKRNGIEVIIALDISNSMLAE-DVT 108 Query: 192 ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS 251 ++++ + I +++ N+ V GLV F+ + + ++ + Sbjct: 109 PSRLEKSKLLIENLVDHF-----TNDKV--GLVVFAGDAFVQLPITSDYVSAKMFLQNIK 161 Query: 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY 311 ++T T + + Q+ + N K I+ +TDGE+ + +L Sbjct: 162 PSLIATQGTD--------LARAIELSQNSFMQRDNIGKAIIVITDGEDH----EGGALEA 209 Query: 312 CNEAKKRGAIVYAIGIRVIRS 332 A K+G+ V+ +GI + Sbjct: 210 AKAAHKKGSNVFILGIGDPKG 230 >gi|294661523|ref|YP_003579976.1| hypothetical protein KP-KP15_gp109 [Klebsiella phage KP15] gi|292660684|gb|ADE34932.1| hypothetical protein [Klebsiella phage KP15] Length = 738 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 66/200 (33%), Gaps = 20/200 (10%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV--NNVVQSGLVTFSNKIEE 232 + V DVS SM + ++ ++++ + + ++ Q G V K+ Sbjct: 20 VFVCDVSGSMYNELPRIR---QHLKNNLATLVKQDDTVSILYFSSRGQFGSVFVGEKVSS 76 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L S + I + T L+ A D+Q + N+ ++ Sbjct: 77 VSDL----SKINDAIDRYLRPTGCTGFVEPLQLAVEVATDLQADNGNLNS--------LI 124 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--FYLVEN 350 F+TDG + + D L C+ + + + L A + VE Sbjct: 125 FLTDGYDNCWRTDDI-LKACSTLPLTFNNIAFLEYGYYVNRPLLEKMAEATNALHKFVEG 183 Query: 351 PHSMYDAFSHIGKDIVTKRI 370 + AF I + R+ Sbjct: 184 FDAYVPAFDEIITAQTSARV 203 >gi|118365082|ref|XP_001015762.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89297529|gb|EAR95517.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 755 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 40/245 (16%), Positives = 80/245 (32%), Gaps = 38/245 (15%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA-----RLDMMIVLDVSRS 183 +S ++ K+ + + ++ I + +D++ ++D S S Sbjct: 1 MSVNNQNKMKFNLISELK-AIKPGNLRDFICGIELELENNNNNDARLPVDIICLIDNSGS 59 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW----G 239 M K + KS+ +L+ ++ Q LV+FS+ + L Sbjct: 60 MAG------KKAQLVRKSLKYLLKILEKGD------QISLVSFSSTAKTLCPLTQVNDEN 107 Query: 240 VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 ++ IK ++ G T PG K + + R+ I+ +TDGE Sbjct: 108 KQQIKSAIKQINGQG-GTFVIPGFKEVTKILNSRKEQREQT---------FILLLTDGEF 157 Query: 300 LSTKEDQQSLYYCNEAKKRGAI----VYAIGIRVIRSHEFLRACAS--PNSFYLVENPHS 353 + + +Y G + E L+ A + L+ N Sbjct: 158 GDIDSGKVIQNINRLFTQSEIQKTPYIYTYGYGDDVNPEILQEIAQKFQGKYCLISNVQQ 217 Query: 354 MYDAF 358 + D F Sbjct: 218 VTDWF 222 >gi|18858247|ref|NP_571413.1| complement factor B [Danio rerio] gi|1015970|gb|AAB19093.1| complement factor B [Danio rerio] Length = 737 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 36/229 (15%), Positives = 84/229 (36%), Gaps = 29/229 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K++ +LD+ I +D S S++ K IK++ + ++ P+ Sbjct: 248 KISLDRGGKLDIYIAVDASDSID---PKDFDKAKKIIKTLIEKISYYEVSPNYE------ 298 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRK--------IKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 ++ F+ +++ + ++ + + + + + + Y +I D Sbjct: 299 ILMFATDVDQIVKMRDFKTNEKARKILKIFEDLDNFNYDKKGDRTGTNIAKLYLKILDSM 358 Query: 275 GMRQHCNTED-ANYKKIIVFMTDGE-----NLSTKED-QQSLYYCNEA-KKRGAIVYAIG 326 + Q N ED + +I+ TDG+ N K D ++L N A ++ +Y G Sbjct: 359 SLEQVQNKEDFLQTQHVIIVFTDGQANMGGNPKPKVDLIKNLVIKNNASRENKLDLYVFG 418 Query: 327 IRVIRSHEFLRACASPN----SFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + E + S F+ + + + + F + D + Sbjct: 419 VGKDVKKEDMNGLVSEKKDERHFFKLPDLDEVQNTFDLMLDDSTVVGLC 467 >gi|224051386|ref|XP_002199700.1| PREDICTED: coagulation factor C homolog, cochlin [Taeniopygia guttata] Length = 565 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 25/198 (12%), Positives = 65/198 (32%), Gaps = 24/198 (12%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +++ ++D S S+ + ++ ++ + + ++ + V T++ + Sbjct: 380 SVNIGFLIDGSSSIGEI------NFRLMLEFVSNVAKAFEISDIGSKVAAVQF-TYNQRK 432 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 E F + I + T + + + +F + N Sbjct: 433 EFGFTDHVTKEKVLSAIHNIQYMSGGTATGDAISFTTRTVFGPVKDGPNKN--------F 484 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---PNSFYL 347 ++ +TDG+ AKK G V+++G+ + L+ AS + + Sbjct: 485 LIVLTDGQ-----SYDDVTGPAAAAKKAGITVFSVGVAWAPLDD-LKEMASEPRESHTFF 538 Query: 348 VENPHSMYDAFSHIGKDI 365 + I + I Sbjct: 539 TREFTGLEQMVPDIIRGI 556 >gi|157412070|ref|YP_001481410.1| TerY3 [Escherichia coli APEC O1] gi|99867095|gb|ABF67740.1| TerY3 [Escherichia coli APEC O1] Length = 346 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 29/206 (14%), Positives = 56/206 (27%), Gaps = 16/206 (7%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + VLD S SM + + ++ +++ P ++ F+ Sbjct: 3 RLPVFFVLDCSESM------IGENLKKMTDGLQMIIGDLRKDPHALETAWVSVIAFAGVA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + G T+ L+ QI H D + Sbjct: 57 RTIVPL----HEIASFYPPRLPVGGGTSLGAALRELTVQIDTQVRKTTHEAKGDWKP--V 110 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 + +TDG + + + A + A+G+ LR + Sbjct: 111 VYLLTDGRPTDDTTAEVKRWKDHYA--SKVNLIAVGLGPSADLNTLRQLTENVMLFTESQ 168 Query: 351 PHSMYDAFSHIGKDIVT--KRIWYDK 374 I + + + DK Sbjct: 169 EGDFTRFIKWITASVTAHSRSVGDDK 194 >gi|209527393|ref|ZP_03275900.1| von Willebrand factor type A [Arthrospira maxima CS-328] gi|209492184|gb|EDZ92532.1| von Willebrand factor type A [Arthrospira maxima CS-328] Length = 463 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 32/209 (15%), Positives = 64/209 (30%), Gaps = 37/209 (17%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDS------------------SITKIDMAIKSINAMLE 207 S + V+D S SM + +T I + LE Sbjct: 35 SASRPSTTFSFVIDTSGSMYEVLEGEETIPTGNSYFLDGKQYTQVTGGKTKIDQVIESLE 94 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKY 265 + ++ + LV F + L + L+ I L F T G++ Sbjct: 95 GLVSSGQADSRDRIALVRFDDSASVLLPLTASTDTASLKNAIGQLRNFSGGTRMALGMEE 154 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 A N + + D + ++ ++F TDG+ + + + G + A+ Sbjct: 155 ALNILKNC----------DLSSRRTLIF-TDGQTFDESD---CRDLATQFAEAGIPITAL 200 Query: 326 GIRVIRSHEFLRACA--SPNSFYLVENPH 352 G+ + L + + + V Sbjct: 201 GVGEYN-EDLLLYLSDRTGGRVFNVVETQ 228 >gi|269969412|sp|C7G0B5|PIF_PINFU RecName: Full=PIF; Contains: RecName: Full=Pif97; Contains: RecName: Full=Pif80; AltName: Full=Aragonite-binding protein; Flags: Precursor gi|256252203|dbj|BAH97338.1| Pif177 [Pinctada fucata] Length = 1007 Score = 47.5 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 37/201 (18%), Positives = 74/201 (36%), Gaps = 21/201 (10%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 V V S D+++ +D S + D K+ AML V+ + +N ++ Sbjct: 18 VGVKSDECKTADVVVNVDAS---DDVSDQDFDKLK------RAMLMMVRGLSIDDNQIRL 68 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 G+VT+ +++ + L+ L R I+Y+ K + G+ A R + Sbjct: 69 GMVTYGSEVCDSIPLQGDRLDLARTIRYMKKPTGPSKPFKGMGEA---------RRMFSS 119 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACA 340 N + M G ++ E + + ++A+ V AIG+ E Sbjct: 120 RGRYNVPH--ITMNLGGDIVDTEVKDLMDETDKARDEDIKVMAIGLGAKVDRDEIESIAY 177 Query: 341 SPNSFYLVENPHSMYDAFSHI 361 + Y +++ + I Sbjct: 178 DRDQAYFMDDEDDLIRKVKEI 198 >gi|315173854|gb|EFU17871.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecalis TX1346] Length = 683 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 66/169 (39%), Gaps = 27/169 (15%) Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 K + ++ + + + QT++ +D+++V+D S SM K++ Sbjct: 86 YLKKSVAAVTGQQGLFNVTLDVKGN-----QTESPIDLVLVIDYSSSM------KGEKLN 134 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS 256 A+K + EE+ + V+ G+V ++ L + + + L F + Sbjct: 135 NALKGLQQFGEEL-SDSLTDGHVRIGIVAYNR-------LTYSTADFSTDMNDLEDFLRN 186 Query: 257 TN---STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 T S ++ ++G R N KK++V + DG ++ Sbjct: 187 TAEPHSGTFMQKGL-----LEGQRLLAEKSRPNAKKMLVHIGDGSANAS 230 >gi|146296332|ref|YP_001180103.1| Ig domain-containing protein [Caldicellulosiruptor saccharolyticus DSM 8903] gi|145409908|gb|ABP66912.1| Ig domain protein, group 2 domain protein [Caldicellulosiruptor saccharolyticus DSM 8903] Length = 1831 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 35/214 (16%), Positives = 66/214 (30%), Gaps = 27/214 (12%) Query: 123 QNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 +I+ + + F +Y + + IT + D++ V+D + Sbjct: 802 NESSKTITVETDH--------FSVYYLINLKKYLDITGLKSGTVSPSGQADIVFVIDTTG 853 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH 242 SM +ID ++IN ++++K V GLVT+ + + G Sbjct: 854 SMSD-------EIDAVKQNINNFVDKLKTKDIS---VNLGLVTYKDITCDGPNSTVGHGF 903 Query: 243 LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 + + G G + D N K IV +TD Sbjct: 904 FSSADDFKNALGSIKVDGGG--DTPETLIDALET-ARLLGFRENSTKFIVVLTDA--NYK 958 Query: 303 KEDQQSLYYCNE----AKKRGAIVYAIGIRVIRS 332 E++ + +E K IV + Sbjct: 959 LENRFGIKSADEIIERLKSDNIIVSVVSTMSFEG 992 >gi|302670289|ref|YP_003830249.1| von Willebrand factor type A domain-containing protein [Butyrivibrio proteoclasticus B316] gi|302394762|gb|ADL33667.1| von Willebrand factor type A domain-containing protein [Butyrivibrio proteoclasticus B316] Length = 568 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 70/189 (37%), Gaps = 24/189 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 +++ + +++ ++DVS SM SS K+ + KS + +++ + ++ V SG Sbjct: 201 EIDMEEAPVSNLVFLIDVSGSM-----SSRNKLPLLQKSFDELVDSLPDEGTISIVTYSG 255 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 K+ +++ I L G TN G++ AY + Sbjct: 256 ----EEKVVLSGEPMSNKKGIKKAIDKLHANGC-TNGQAGMQKAYEI-----AQKYFIEG 305 Query: 283 EDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH----EFLR 337 + ++ TDG+ N+ + + + K G + +G + L Sbjct: 306 GNNR----VIMATDGDLNVGISDLDDLEKFITDKKDEGVFLSILGFGEGNYKDDKMQTLA 361 Query: 338 ACASPNSFY 346 C + N Y Sbjct: 362 DCGNGNYSY 370 >gi|220922748|ref|YP_002498050.1| hypothetical protein Mnod_2796 [Methylobacterium nodulans ORS 2060] gi|219947355|gb|ACL57747.1| conserved hypothetical protein [Methylobacterium nodulans ORS 2060] Length = 135 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 19/80 (23%), Positives = 33/80 (41%), Gaps = 7/80 (8%) Query: 299 NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI------RSHEFLRACASPNSF-YLVENP 351 N+ D + AK G VY IG + L CAS +S ++ + Sbjct: 55 NIRNALDALTTQAYTNAKAAGISVYTIGFSTPSDSIDDKGLSLLSNCASSSSQAFVANDA 114 Query: 352 HSMYDAFSHIGKDIVTKRIW 371 +++ AF+ I K + + R+ Sbjct: 115 NTLISAFNQIAKSVGSLRLT 134 >gi|116753762|ref|YP_842880.1| magnesium chelatase [Methanosaeta thermophila PT] gi|116665213|gb|ABK14240.1| protoporphyrin IX magnesium-chelatase [Methanosaeta thermophila PT] Length = 669 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 82/204 (40%), Gaps = 34/204 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D S SM ++ +++ A ++ ++L + + G+V F + + Sbjct: 478 VLFVVDASGSM-----GAMRRMESAKGAVLSLL-----MDSYQKRDRIGMVAF--RGNDA 525 Query: 234 FLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 LL S + +K L+ G T + GL A I Q ++ + +I Sbjct: 526 DLLLPPCSSVDLAMKRLAELPTGGRTPLSAGLSKALRVI-------QGELIKNKETRPMI 578 Query: 292 VFMTDGE-NLSTKED--QQSLYYCNEAKKRGAIVYAIGIRVIRS---HEFLRAC-----A 340 V ++DG N+S D ++ + EA++ G I V+ S L C A Sbjct: 579 VLVSDGRANVSISSDPKKEIVQIAEEARRLGVHTVVIDTEVVGSSFMEMRLGYCRDIAEA 638 Query: 341 SPNSFYLVENPHSMYDAFSHIGKD 364 + +Y + + ++ S I + Sbjct: 639 AGGRYYPISDITP--ESLSRIVEQ 660 >gi|307256565|ref|ZP_07538346.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 10 str. D13039] gi|306864975|gb|EFM96877.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 10 str. D13039] Length = 531 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 29/237 (12%), Positives = 80/237 (33%), Gaps = 19/237 (8%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R F + G T++ + I ++ + +E + I + L ++++++ + + Sbjct: 10 RRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAENNSG 69 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIV--------V 120 N+ K + + R+ FV ++++ Sbjct: 70 RKDNDYKLSGSNKENDSFDISSEVGKRDSQMVTKFVKAFLPQTNDDKMNLIPICKTVNNT 129 Query: 121 VPQNEGYSISAISRYKIPLKFCTFIPWYTNS-----RHIVMPITSSVKVNSQTDARLDMM 175 + S ++ ++ P + + + + S + + +D+M Sbjct: 130 SGKGHTSSSEVTCTVSGTIEHKSWFPLKVGTLEVIPQQVNVASQSRAFKKNTFNIPIDLM 189 Query: 176 IVLDVSRSME------SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +V D+S SM ++ +K+ + + + E+ L D N + + F Sbjct: 190 VVADLSGSMRYDITNKYETNNETSKLGILKDVLIELAEKTLLSEDANQHNRIYVTPF 246 >gi|228471033|ref|ZP_04055877.1| BatB protein [Porphyromonas uenonis 60-3] gi|228307253|gb|EEK16276.1| BatB protein [Porphyromonas uenonis 60-3] Length = 342 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/176 (14%), Positives = 60/176 (34%), Gaps = 26/176 (14%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + V++Q +D+ +DVS SM + D +I A + + ++++ Sbjct: 80 THAPVSAQQTVGVDLAFCIDVSNSM-AARDVKPDRIGFAKQIVTHTMQQLAGS------- 131 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGM 276 + ++ F+ L + + + + TN L+ + + Sbjct: 132 RIAMIVFAGGAYIRLPLTPDLPTARTFLADIQPGMVSNQGTNLGQALERSAQALSAPSRA 191 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 K ++ +TDGE ++ + K++G Y + I + Sbjct: 192 G-----------KAVILLTDGE-DHEGGLEEGIK---RLKEQGIKAYVVTIGLPEG 232 >gi|167951278|ref|ZP_02538352.1| von Willebrand factor, type A [Endoriftia persephone 'Hot96_1+Hot96_2'] Length = 269 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 42/232 (18%), Positives = 83/232 (35%), Gaps = 23/232 (9%) Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFI---PWYTNSRHIVMPITSSVKVNS 166 + ++ + + Y A + P T + PW + S S Sbjct: 18 LPPMDAVRVEEMINYFNYEDMAAEQRDTPFGITTEVAANPWNPEDKASAYRHQSLAAQGS 77 Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + ++DVS SM S K+ + +S+ + + V+ VV +G Sbjct: 78 PKCPPPIWVFLVDVSGSMHS-----PDKLPLLKRSLRLLSRSLDADDRVSLVVYAGA--- 129 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 S + E + +++ ++ LS G STN G++ AY R+ N Sbjct: 130 SGVVLEPTP-GNKRATIEQALQQLSA-GGSTNGGAGIRLAY------AKAREAFIEGGIN 181 Query: 287 YKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 ++ TDG+ N+ T Q + + ++ G + +G ++ L Sbjct: 182 R---VILATDGDFNVGTVNHQALIDLIKQQRQAGIALTTLGFGGGNYNDHLM 230 >gi|74185538|dbj|BAE30236.1| unnamed protein product [Mus musculus] Length = 411 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/133 (18%), Positives = 45/133 (33%), Gaps = 14/133 (10%) Query: 176 IVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-- 230 IVLD S SM + S + + ++ + + GL+T++ Sbjct: 260 IVLDPSGSMNIYLVLDGSDSIGSSNFTGAKRCLTNLIEKVASYGVRPRYGLLTYATVPKV 319 Query: 231 ------EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E ++W L + K TN+ L+ Y+ + G Sbjct: 320 LVRVSDERSSDVDWVTEKLNQISYEDHKLKSGTNTKRALQAVYS-MMSWAGDAPPEGWNR 378 Query: 285 ANYKKIIVFMTDG 297 + +I+ MTDG Sbjct: 379 TRH--VIIIMTDG 389 >gi|38347889|ref|NP_941138.1| putative tellurium resistance protein [Serratia marcescens] gi|190410231|ref|YP_001965732.1| terY3 [Klebsiella pneumoniae] gi|226807624|ref|YP_002791318.1| TerY3 [Enterobacter cloacae] gi|226809934|ref|YP_002791628.1| TerY3 [Enterobacter cloacae] gi|38259366|emb|CAE51591.1| putative tellurium resistance protein [Serratia marcescens] gi|146151024|gb|ABQ02790.1| terY3 [Klebsiella pneumoniae] gi|226425849|gb|ACO53942.1| TerY3 [Enterobacter cloacae] gi|226426160|gb|ACO54252.1| TerY3 [Enterobacter cloacae] Length = 346 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 29/206 (14%), Positives = 56/206 (27%), Gaps = 16/206 (7%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + VLD S SM + + ++ +++ P ++ F+ Sbjct: 3 RLPVFFVLDCSESM------IGENLKKMTDGLQMIVGDLRKDPHALETAWVSVIAFAGVA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + G T+ L+ QI H D + Sbjct: 57 RTIVPL----HEIASFYPPRLPVGGGTSLGAALRELTVQIDTQVRKTTHEAKGDWKP--V 110 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 + +TDG + + + A + A+G+ LR + Sbjct: 111 VYLLTDGRPTDDTTAEVKRWKDHYA--SKVNLIAVGLGPSADLNILRQLTENVMLFTESQ 168 Query: 351 PHSMYDAFSHIGKDIVT--KRIWYDK 374 I + + + DK Sbjct: 169 EGDFTRFIKWITASVTAHSRSVGDDK 194 >gi|254881903|ref|ZP_05254613.1| von Willebrand factor [Bacteroides sp. 4_3_47FAA] gi|319641094|ref|ZP_07995798.1| von Willebrand factor [Bacteroides sp. 3_1_40A] gi|254834696|gb|EET15005.1| von Willebrand factor [Bacteroides sp. 4_3_47FAA] gi|317387338|gb|EFV68213.1| von Willebrand factor [Bacteroides sp. 3_1_40A] Length = 212 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 54/167 (32%), Gaps = 16/167 (9%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + ++LD S SM I+ + ++ ++ P ++TF++ Sbjct: 3 RLPVYLLLDTSGSMYG------EPIEAVKNGVQTLISTLRSDPYALETAYISIITFNSSA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 ++ L + Q I T L+ +I D + Sbjct: 57 QQVTPLTELAAFQQPNIDA----SGCTALGGALELLSQKIDSEITKTTAEVKGDWRP--L 110 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 I MTDG D S K++ +V A S + L+ Sbjct: 111 IFIMTDG----VPTDDISRGLAEFKKRKCGMVVACAAGQGASTDTLK 153 >gi|308472823|ref|XP_003098638.1| hypothetical protein CRE_04225 [Caenorhabditis remanei] gi|308268238|gb|EFP12191.1| hypothetical protein CRE_04225 [Caenorhabditis remanei] Length = 417 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 37/199 (18%), Positives = 64/199 (32%), Gaps = 20/199 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D S M + + + I S+ + + + GLVT+++ + Sbjct: 40 LDVIAVVDNSHGMT---NGGVQSVAANIASVFSSGTRIGSNSTEPRTTRVGLVTYNSGAK 96 Query: 232 EFFLLEWGV---SHLQRKIKYLSKFGVSTNSTPG--LKYAYNQIFDMQGMRQHCNTEDAN 286 L K LS +T+S L A + Q NT + Sbjct: 97 LDADLNKFQDLDGLYNGVFKDLSDVVDTTDSYLATGLNAAEELL-----QSQSLNTTRDH 151 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR---ACASPN 343 YKK+I+ + + N K G ++ + LR ASP Sbjct: 152 YKKVIIVY--ASEYKGSGELDPVPVANRLKGSGVVIVTVAYDQGGDEGLLRDLANIASPG 209 Query: 344 SFY--LVENPHSMYDAFSH 360 Y N ++ Sbjct: 210 FAYSNAPNNAGNLVGQIQD 228 >gi|225575062|ref|ZP_03783672.1| hypothetical protein RUMHYD_03151 [Blautia hydrogenotrophica DSM 10507] gi|225037732|gb|EEG47978.1| hypothetical protein RUMHYD_03151 [Blautia hydrogenotrophica DSM 10507] Length = 393 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 30/170 (17%), Positives = 68/170 (40%), Gaps = 33/170 (19%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + + +DM+++LD S SM+ K + +++ A+LE++ ++ V Sbjct: 103 MENHSAVDMVLLLDGSGSMQG-------KKEPCVQATEALLEQMDEQS------RAQAVA 149 Query: 226 FSNKIEEFFLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 F++ + L E G L + ++ G T L +A N + + + + Sbjct: 150 FASCVLGNTELLPLDEEGRETLIKFVEGTDIIG-GTEFGQPLTFALNSLEEKKETGRIQA 208 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++ ++DGE ++L E K++ ++Y I + Sbjct: 209 ---------VILLSDGEGPFP----ETLE--EEYKEKDVVLYTIRMDAGE 243 >gi|167623905|ref|YP_001674199.1| vault protein inter-alpha-trypsin subunit [Shewanella halifaxensis HAW-EB4] gi|167353927|gb|ABZ76540.1| Vault protein inter-alpha-trypsin domain protein [Shewanella halifaxensis HAW-EB4] Length = 761 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 57/309 (18%), Positives = 103/309 (33%), Gaps = 34/309 (11%) Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 E + KL L L N + I + SL + + Q Sbjct: 159 EPHQVPEHKLLAPYALNFELTIQGQLASAAL--NSPTHAITKHYDNESLKVSLNNQTAAM 216 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVN-SQTDARLDMMIVLDVSRSMES 186 I + P + W + +V + +++ QT + + +V+D S SM Sbjct: 217 DRDLIVEFTQPKYYAGEGLWSKDDDKVVALTSFYPQIDLPQTTSSRCIKMVVDCSGSM-- 274 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK 246 SIT+ +A+K I +L E D N++ G S E + ++ Sbjct: 275 -LGDSITQAGIALKQILKLLNE----DDWFNIILFGSHHKSLFSESVKANRANLDIAAKE 329 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + L+ T L AY+ + I+ +TDGE ++ Sbjct: 330 LANLNADLGGTEMLSALNAAYDSAAPAELASN------------ILLITDGEIWGEEQ-- 375 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-----SPNSFYLVENPHS-MYDAFSH 360 C +A++ + +G+ S FL+ A + EN S + F Sbjct: 376 ---LIC-KAQESNHRHFVVGVGSAVSEAFLKQLADKTGGASEFVTPNENMSSRIVQHFCR 431 Query: 361 IGKDIVTKR 369 I + +T+ Sbjct: 432 IKQSKLTQS 440 >gi|332885553|gb|EGK05799.1| hypothetical protein HMPREF9456_02063 [Dysgonomonas mossii DSM 22836] Length = 580 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 40/314 (12%), Positives = 101/314 (32%), Gaps = 37/314 (11%) Query: 38 IEVSHIFFMKTVLHSMIDRSL-VHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNM---- 92 I + + K + +L + + I + ++ + + Sbjct: 63 ISLGDVRKSKMDISVADVSALEMTTSRSISSLRMESSPYDNLQDEEYSQFSENRFRLANE 122 Query: 93 ----SFRNELRDNGFVNDI-----DDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCT 143 +F ++ + N + ++ I + Y+ + + P+K T Sbjct: 123 DPLSTFSVDVDAASYSNMRRFINRGSLPNKDAVRIEELINYFSYNYTEPTGND-PIKIST 181 Query: 144 FIPWYTNSRH--IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + + +V + + + + + ++DVS SM T++D+ S Sbjct: 182 EVGNCPWNSQNRLVKIGLKARSMAGENLPASNFVFLIDVSGSMY-----GATRLDLVKSS 236 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 + + ++ V VV +G ++ + L+ G + Sbjct: 237 LKLLTNNLREKDRVAIVVYAGSAG----EVLPSTSGANKQKIKEALDNLNAGGST-TGGA 291 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGA 320 G++ AY +I ++ N I+ TDG+ N+ + L + +K G Sbjct: 292 GIQLAY-KIAKQNFIKGGNNR--------IILCTDGDFNVGVSSNDGLLALIEQERKSGV 342 Query: 321 IVYAIGIRVIRSHE 334 + +G + + Sbjct: 343 FLSILGYGMGNYKD 356 >gi|262165253|ref|ZP_06032990.1| protein BatA [Vibrio mimicus VM223] gi|262024969|gb|EEY43637.1| protein BatA [Vibrio mimicus VM223] Length = 335 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 65/180 (36%), Gaps = 31/180 (17%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESF-----FDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 +V ++ D+++V+D+S SME ++++ A K + + + Sbjct: 87 EVQTREAFGRDVLMVVDLSGSMEEKDFATEAGEQLSRLTAAKKVLRDFVTQ-------RQ 139 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFD 272 + GL+ F + F+ + + L + G STN + Sbjct: 140 GDRFGLILFG---DAAFIQTPFTADQDVWLNLLDEAETGMAGQSTNLGDAI--------- 187 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 G++ + + +I++ +TDG T + A +G +Y I + + Sbjct: 188 GLGIKVFEQSPSTSQDQIMLVLTDG--NDTGSFVSPVDAAKIAAAKGIRIYVIAMGDPEN 245 >gi|241267206|ref|XP_002406336.1| calcium activated chlorine channel, putative [Ixodes scapularis] gi|215496880|gb|EEC06520.1| calcium activated chlorine channel, putative [Ixodes scapularis] Length = 519 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 67/204 (32%), Gaps = 34/204 (16%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 P++ TF T + S K +++VLDVS SM ++ Sbjct: 128 PVQTGTFPTNRTVFTTFRLFQRSDEKSQR-------VVLVLDVSHSMRPRVGED--RLAF 178 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-----EWGVSHLQRKIKYLSK 252 + N M+ + +++ G+VTFS + + L + + I L Sbjct: 179 LQCATNHMIRHM-----LHDYQALGIVTFSGRCQVAHPLVVLNTTDARDGIAKVIDGLV- 232 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM-TDGENLSTKEDQQSLYY 311 G T+ GL A ++ + G +VF+ TDG+ + L Sbjct: 233 LGAGTSIGCGLSKA-TEMLEGNGTSARGG---------LVFLVTDGDENYKPWIVEQLPI 282 Query: 312 CNEAKKRGAIVYAIGIRVIRSHEF 335 G V + + + Sbjct: 283 ---LVSSGVKVSTFALGTLAEKKL 303 >gi|322419943|ref|YP_004199166.1| hypothetical protein GM18_2437 [Geobacter sp. M18] gi|320126330|gb|ADW13890.1| Protein of unknown function DUF2134, membrane [Geobacter sp. M18] Length = 351 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/214 (11%), Positives = 59/214 (27%), Gaps = 26/214 (12%) Query: 14 NYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM------- 66 N KG + A+ L ++ + + +++ ++F K L + D + ++ Sbjct: 11 NEKGFAIVYIALMLVVLVAFVSLAVDMGYMFVAKGQLQNAADAGALAGVAKLSDTVTARQ 70 Query: 67 --------------NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVR 112 + N GDI+ + + + N + + R Sbjct: 71 SAKLFAERNKAAGESVKVALNETNSADGDIVVGYWDKVTRTMSATVPTGKVANAVKVVAR 130 Query: 113 STSLDIV-VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR 171 T+ + +N+ + PI + S T Sbjct: 131 RTTETGTGISAENKQVDLFFGQVLNWGQMSAKAEAIACRPAKPSAPIVLCQDLCSSTTFP 190 Query: 172 LDMMI----VLDVSRSMESFFDSSITKIDMAIKS 201 + D S ++ + T+ K+ Sbjct: 191 FKVYFNQTIATDPSGALNPLYTVGWTEFSPTSKA 224 >gi|118366231|ref|XP_001016334.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89298101|gb|EAR96089.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 730 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 34/167 (20%), Positives = 69/167 (41%), Gaps = 24/167 (14%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 FIP +++ ++ T+ + S+ ++++D S SM K+++A + Sbjct: 239 LINFIPPQISTQENLLTKTTDQLIKSE------FVLIIDRSGSMYG------PKMELAKE 286 Query: 201 SINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNST 260 S+ L+ + + + N++ G + + V + ++I S TN + Sbjct: 287 SLIFFLKSL-PVGSIYNIISFGSTCEIMFDQSVQFNDQNVQNSIQQIDQFSANLGGTNVS 345 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 L++ Y +FD G+R K I +TDGE E Q+ Sbjct: 346 KALEHVYLNLFDQYGLR-----------KKIFIITDGEFTDRNETQE 381 >gi|281208683|gb|EFA82859.1| type A von Willebrand factor domain-containing protein [Polysphondylium pallidum PN500] Length = 2327 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 35/248 (14%), Positives = 75/248 (30%), Gaps = 22/248 (8%) Query: 58 LVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLD 117 V A + + K++ + +I + + N + I T Sbjct: 822 AVSGAQVTQSSTSTVQVKEIGKSNFTVQIGIEMPYNIV---KLNSPTHQIRSKKTHTKAT 878 Query: 118 IVVVP-QNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVM-PITSSVKVNSQTDARLDMM 175 + + ++ G + + + P ++ + H M + + + + Sbjct: 879 VELDRVESLGTNFQLLIGLEDPYSPRMWVEVDNSGHHASMLAFYPKLDIEHGDQPSI-VT 937 Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 IVLD+S SM + ++++ + ++ + N+VQ G + IE Sbjct: 938 IVLDLSASMHG------DPFEDMMRAVRLTITNLRGMNIKFNIVQFGDIFDWLFIEHVPP 991 Query: 236 LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 E + I L T L + M + IV T Sbjct: 992 TEANLQLAWSHINSLRPSYGGT----ALHLPLQSLILMSDQTPSNRPHN------IVLFT 1041 Query: 296 DGENLSTK 303 DG+ + Sbjct: 1042 DGQIANPP 1049 >gi|226327211|ref|ZP_03802729.1| hypothetical protein PROPEN_01077 [Proteus penneri ATCC 35198] gi|225204429|gb|EEG86783.1| hypothetical protein PROPEN_01077 [Proteus penneri ATCC 35198] Length = 178 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 52/167 (31%), Gaps = 12/167 (7%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +++D S SM I I ML ++ P V ++T+ N+ Sbjct: 3 RLPVYLLIDTSGSMRG------ESIHAVNVGIQTMLNALRQDPYALESVHISIITYDNEA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 EF L I T + L+ + D + Sbjct: 57 REFIPLTALEDFQFSDIT--VPSSGGTFTGAALECLIQCVDRDIKRSDGDQKGDWRP--L 112 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + MTDG +++ K+ + A + H+ L+ Sbjct: 113 VFLMTDGTPSDAYAYGEAIK--EVKKRSFGSIIACAVGPKAKHDHLK 157 >gi|293342302|ref|XP_001059971.2| PREDICTED: anthrax toxin receptor 1-like [Rattus norvegicus] gi|293354106|ref|XP_344486.4| PREDICTED: anthrax toxin receptor 1-like [Rattus norvegicus] gi|149034121|gb|EDL88891.1| similar to hypothetical protein 4933430J11 (predicted) [Rattus norvegicus] Length = 535 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 74/204 (36%), Gaps = 45/204 (22%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D+ ++LD S S+ + I + E + N + +S L+ + Sbjct: 62 QSSFDLYLILDKSGSVAKNW-----------IYIYSFAEGLVKKFTKNEIHKSLLLLKNI 110 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + ++ G+ +I+ + G S Sbjct: 111 EPQGLTHMQKGLIKANEQIQKSAARGHRAVS----------------------------- 141 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 +IV +TDG L K ++ +A+K GAI+Y +G+ + + + +PN+ + V Sbjct: 142 -VIVALTDGL-LLLKPYLDTMEEAKKARKLGAIIYTVGVFMYSKQQLVNIANNPNNNFGV 199 Query: 349 ENPHSMYDAFSHIGKDIVTKRIWY 372 + + A + +V+K Sbjct: 200 DGG---FAALDSVIDPLVSKSCAE 220 >gi|157738379|ref|YP_001491063.1| hypothetical protein Abu_2179 [Arcobacter butzleri RM4018] gi|157700233|gb|ABV68393.1| hypothetical protein Abu_2179 [Arcobacter butzleri RM4018] Length = 1866 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 64/165 (38%), Gaps = 18/165 (10%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES------FFDSSITKIDMAIKSINA 204 + I++ T ++N Q ++ +V+D S SM+ + ++I++ID+ ++ Sbjct: 1359 GKDILIGDTGGTQLNVQAGKNYNIALVVDTSGSMKEASGSKTAWGTTISRIDLLKDALKN 1418 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNST 260 + + +K NV ++ F +E +S L KI L G + Sbjct: 1419 LADSLKGHDGKINV---SIIDFDTNAKEPITFNDLTSKNISDLITKIDALKAEGGTNYED 1475 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 LK FD Q + N F+TDG+ + + Sbjct: 1476 AFLKT--TSWFDTQSVTYGKAQGYENLT---YFLTDGDPTFSNRN 1515 >gi|3024046|sp|P97278|ITIH1_MESAU RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H1; Short=ITI heavy chain H1; Short=ITI-HC1; Short=Inter-alpha-inhibitor heavy chain 1; Flags: Precursor gi|1694688|dbj|BAA13938.1| inter-alpha-trypsin inhibitor heavy chain 1 [Mesocricetus auratus] Length = 914 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 37/196 (18%), Positives = 70/196 (35%), Gaps = 28/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D+S SME K+ +++ +L +VK + LV F ++++ Sbjct: 296 LVFVIDISGSMEG------QKVKQTKEALLKILGDVKPGDSFD------LVLFGSRVQSW 343 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + Q ++ S +TN GL + QG ++ + Sbjct: 344 KGSLVPATQANLQAAQDFVRRFS-LAGATNLNGGLLRGIEILNKAQGSHPELSSPAS--- 399 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDGE + Q L A + +Y +G FL + NS Sbjct: 400 -ILIMLTDGEPTEGETDRSQILKNVRNAIRGRFPLYNLGFGHDLDFNFLEVMSMENSGWA 458 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 459 QRIYEDHDATQQLQGF 474 >gi|313216074|emb|CBY37452.1| unnamed protein product [Oikopleura dioica] Length = 392 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 72/200 (36%), Gaps = 35/200 (17%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N Q DA D++ V+D S S + +D + ++ E + + V+ G+V Sbjct: 41 NPQLDANYDLVFVVDRSDSTPEEY------LDAYKSFMKKIVIERPVSDNN---VRVGIV 91 Query: 225 TFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 T+++ +E LE S + I L +G + + + +Q+ Sbjct: 92 TYASDVEFEVNLEDSNSQADILNIIDNLRIYGSGRKTANAIDFLLDQV---------DRK 142 Query: 283 EDANYKKIIVFMTDGENLSTKED-------QQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + + + +T G + + Q+ +CN Y+IG+ S E Sbjct: 143 GREHIPMVSILLTSGISDEDRLKVILAGWRVQAATWCNA--------YSIGVGPFISEEE 194 Query: 336 LRACASPNSFYLVENPHSMY 355 + + Y + + + Sbjct: 195 MFLLSGDLQHYQLISSPDLL 214 >gi|306845564|ref|ZP_07478133.1| norD protein [Brucella sp. BO1] gi|306273885|gb|EFM55712.1| norD protein [Brucella sp. BO1] Length = 633 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 85/210 (40%), Gaps = 34/210 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 L + +++DVS S +++ D+ +D+ +++ + + D + ++TF+ Sbjct: 440 QANDLAVTLLVDVSLSTDAWVDNRRV-LDVEKEALLVLANGIAACGD-----RCSILTFT 493 Query: 228 NK--------IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ + F +G + ++ +I L G T +++A ++ Sbjct: 494 SRRRSWVRVETVKDFDESFGPT-VEHRIAALKP-GFYTRMGAAMRHATAKL--------- 542 Query: 280 CNTEDANYKKIIVFMTDGENLSTK------EDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 E N KK+++ +TDG+ + S EA+ +G V+A+ + + Sbjct: 543 --AEQPNRKKLLLLLTDGKPNDVDHYEGRFALEDSRRAAGEARAKGVNVFAVTVDR-EAS 599 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGK 363 +L A + LV N + A I + Sbjct: 600 AYLPALFGRGGYALVANLAKLPVALPAIYR 629 >gi|261313650|ref|ZP_05952847.1| conserved hypothetical protein [Brucella pinnipedialis M163/99/10] gi|261302676|gb|EEY06173.1| conserved hypothetical protein [Brucella pinnipedialis M163/99/10] Length = 391 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 85/210 (40%), Gaps = 34/210 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 L + +++DVS S +++ D+ +D+ +++ + + D + ++TF+ Sbjct: 198 QANDLAVTLLVDVSLSTDAWVDNRRV-LDVEKEALLVLANGIAACGD-----RCSILTFT 251 Query: 228 NK--------IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ + F +G + ++ +I L G T +++A ++ Sbjct: 252 SRRRSWVRVETVKDFDESFGPT-VEHRIAALKP-GFYTRMGAAMRHATAKL--------- 300 Query: 280 CNTEDANYKKIIVFMTDGENLSTK------EDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 E N KK+++ +TDG+ + S EA+ +G V+A+ + + Sbjct: 301 --AEQPNRKKLLLLLTDGKPNDVDHYEGRFALEDSRRAAGEARAKGVNVFAVTVDR-EAS 357 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGK 363 +L A + LV N + A I + Sbjct: 358 AYLPALFGRGGYALVANLAKLPVALPAIYR 387 >gi|256015034|ref|YP_003105043.1| nitric-oxide reductase NorD protein [Brucella microti CCM 4915] gi|255997694|gb|ACU49381.1| nitric-oxide reductase NorD protein [Brucella microti CCM 4915] Length = 633 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 85/210 (40%), Gaps = 34/210 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 L + +++DVS S +++ D+ +D+ +++ + + D + ++TF+ Sbjct: 440 QANDLAVTLLVDVSLSTDAWVDNRRV-LDVEKEALLVLANGIAACGD-----RCSILTFT 493 Query: 228 NK--------IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ + F +G + ++ +I L G T +++A ++ Sbjct: 494 SRRRSWVRVETVKDFDESFGPT-VEHRIAALKP-GFYTRMGAAMRHATAKL--------- 542 Query: 280 CNTEDANYKKIIVFMTDGENLSTK------EDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 E N KK+++ +TDG+ + S EA+ +G V+A+ + + Sbjct: 543 --AEQPNRKKLLLLLTDGKPNDVDHYEGRFALEDSRRAAGEARAKGVNVFAVTVDR-EAS 599 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGK 363 +L A + LV N + A I + Sbjct: 600 AYLPALFGRGGYALVANLAKLPVALPAIYR 629 >gi|254719982|ref|ZP_05181793.1| hypothetical protein Bru83_10644 [Brucella sp. 83/13] gi|265984989|ref|ZP_06097724.1| norD [Brucella sp. 83/13] gi|306837885|ref|ZP_07470746.1| norD protein [Brucella sp. NF 2653] gi|264663581|gb|EEZ33842.1| norD [Brucella sp. 83/13] gi|306407055|gb|EFM63273.1| norD protein [Brucella sp. NF 2653] Length = 633 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 85/210 (40%), Gaps = 34/210 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 L + +++DVS S +++ D+ +D+ +++ + + D + ++TF+ Sbjct: 440 QANDLAVTLLVDVSLSTDAWVDNRRV-LDVEKEALLVLANGIAACGD-----RCSILTFT 493 Query: 228 NK--------IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ + F +G + ++ +I L G T +++A ++ Sbjct: 494 SRRRSWVRVETVKDFDESFGPT-VEHRIAALKP-GFYTRMGAAMRHATAKL--------- 542 Query: 280 CNTEDANYKKIIVFMTDGENLSTK------EDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 E N KK+++ +TDG+ + S EA+ +G V+A+ + + Sbjct: 543 --AEQPNRKKLLLLLTDGKPNDVDHYEGRFALEDSRRAAGEARAKGVNVFAVTVDR-EAS 599 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGK 363 +L A + LV N + A I + Sbjct: 600 AYLPALFGRGGYALVANLAKLPVALPAIYR 629 >gi|254712101|ref|ZP_05173912.1| hypothetical protein BcetM6_01657 [Brucella ceti M644/93/1] gi|254715172|ref|ZP_05176983.1| hypothetical protein BcetM_01672 [Brucella ceti M13/05/1] gi|261216880|ref|ZP_05931161.1| conserved hypothetical protein [Brucella ceti M13/05/1] gi|261319749|ref|ZP_05958946.1| conserved hypothetical protein [Brucella ceti M644/93/1] gi|260921969|gb|EEX88537.1| conserved hypothetical protein [Brucella ceti M13/05/1] gi|261292439|gb|EEX95935.1| conserved hypothetical protein [Brucella ceti M644/93/1] Length = 633 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 85/210 (40%), Gaps = 34/210 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 L + +++DVS S +++ D+ +D+ +++ + + D + ++TF+ Sbjct: 440 QANDLAVTLLVDVSLSTDAWVDNRRV-LDVEKEALLVLANGIAACGD-----RCSILTFT 493 Query: 228 NK--------IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ + F +G + ++ +I L G T +++A ++ Sbjct: 494 SRRRSWVRVETVKDFDESFGPT-VEHRIAALKP-GFYTRMGAAMRHATAKL--------- 542 Query: 280 CNTEDANYKKIIVFMTDGENLSTK------EDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 E N KK+++ +TDG+ + S EA+ +G V+A+ + + Sbjct: 543 --AEQPNRKKLLLLLTDGKPNDVDHYEGRFALEDSRRAAGEARAKGVNVFAVTVDR-EAS 599 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGK 363 +L A + LV N + A I + Sbjct: 600 AYLPALFGRGGYALVANLAKLPVALPAIYR 629 >gi|260794975|ref|XP_002592482.1| hypothetical protein BRAFLDRAFT_68970 [Branchiostoma floridae] gi|229277702|gb|EEN48493.1| hypothetical protein BRAFLDRAFT_68970 [Branchiostoma floridae] Length = 806 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 56/170 (32%), Gaps = 29/170 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +LD S SM KI A +++ L+ + P G + + + Sbjct: 276 FIFILDRSGSMSGN------KIKNARETLLLFLKSL---PIGCYFNIVGFGSTHESLFKG 326 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 S + + + T L+Y Y Q A + + + Sbjct: 327 SEKYDNKSLKTACKALGKMEADLGGTEILQPLQYVYKQ------------PPIAGHPRQL 374 Query: 292 VFMTDGENLSTKEDQQ-SLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +TDGE T+ + + + A+ +++GI S ++ A Sbjct: 375 FLLTDGEVWDTQACVREVAKHADSAR-----CFSVGIGEGASTALVKGVA 419 >gi|161620327|ref|YP_001594213.1| von Willebrand factor type A [Brucella canis ATCC 23365] gi|163844435|ref|YP_001622090.1| hypothetical protein BSUIS_B0256 [Brucella suis ATCC 23445] gi|225628708|ref|ZP_03786742.1| Protein norD [Brucella ceti str. Cudo] gi|254699541|ref|ZP_05161369.1| hypothetical protein Bsuib55_01579 [Brucella suis bv. 5 str. 513] gi|254702662|ref|ZP_05164490.1| hypothetical protein Bsuib36_01702 [Brucella suis bv. 3 str. 686] gi|254706206|ref|ZP_05168034.1| hypothetical protein BpinM_04195 [Brucella pinnipedialis M163/99/10] gi|254711500|ref|ZP_05173311.1| hypothetical protein BpinB_14857 [Brucella pinnipedialis B2/94] gi|256029869|ref|ZP_05443483.1| hypothetical protein BpinM2_04290 [Brucella pinnipedialis M292/94/1] gi|256059516|ref|ZP_05449716.1| hypothetical protein Bneo5_04090 [Brucella neotomae 5K33] gi|256158038|ref|ZP_05455956.1| hypothetical protein BcetM4_04255 [Brucella ceti M490/95/1] gi|256253005|ref|ZP_05458541.1| hypothetical protein BcetB_01592 [Brucella ceti B1/94] gi|260167029|ref|ZP_05753840.1| hypothetical protein BruF5_01342 [Brucella sp. F5/99] gi|260568430|ref|ZP_05838899.1| von Willebrand factor [Brucella suis bv. 4 str. 40] gi|261220099|ref|ZP_05934380.1| protein norD [Brucella ceti B1/94] gi|261319110|ref|ZP_05958307.1| norD [Brucella pinnipedialis B2/94] gi|261323486|ref|ZP_05962683.1| protein norD [Brucella neotomae 5K33] gi|261749993|ref|ZP_05993702.1| protein norD [Brucella suis bv. 5 str. 513] gi|261753246|ref|ZP_05996955.1| protein norD [Brucella suis bv. 3 str. 686] gi|261756415|ref|ZP_06000124.1| von Willebrand factor [Brucella sp. F5/99] gi|265986887|ref|ZP_06099444.1| protein norD [Brucella pinnipedialis M292/94/1] gi|265996553|ref|ZP_06109110.1| protein norD [Brucella ceti M490/95/1] gi|294853265|ref|ZP_06793937.1| norD [Brucella sp. NVSL 07-0026] gi|161337138|gb|ABX63442.1| von Willebrand factor type A [Brucella canis ATCC 23365] gi|163675158|gb|ABY39268.1| Hypothetical protein, conserved [Brucella suis ATCC 23445] gi|225616554|gb|EEH13602.1| Protein norD [Brucella ceti str. Cudo] gi|260155095|gb|EEW90176.1| von Willebrand factor [Brucella suis bv. 4 str. 40] gi|260918683|gb|EEX85336.1| protein norD [Brucella ceti B1/94] gi|261298333|gb|EEY01830.1| norD [Brucella pinnipedialis B2/94] gi|261299466|gb|EEY02963.1| protein norD [Brucella neotomae 5K33] gi|261736399|gb|EEY24395.1| von Willebrand factor [Brucella sp. F5/99] gi|261739746|gb|EEY27672.1| protein norD [Brucella suis bv. 5 str. 513] gi|261742999|gb|EEY30925.1| protein norD [Brucella suis bv. 3 str. 686] gi|262550850|gb|EEZ07011.1| protein norD [Brucella ceti M490/95/1] gi|264659084|gb|EEZ29345.1| protein norD [Brucella pinnipedialis M292/94/1] gi|294818920|gb|EFG35920.1| norD [Brucella sp. NVSL 07-0026] Length = 633 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 85/210 (40%), Gaps = 34/210 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 L + +++DVS S +++ D+ +D+ +++ + + D + ++TF+ Sbjct: 440 QANDLAVTLLVDVSLSTDAWVDNRRV-LDVEKEALLVLANGIAACGD-----RCSILTFT 493 Query: 228 NK--------IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ + F +G + ++ +I L G T +++A ++ Sbjct: 494 SRRRSWVRVETVKDFDESFGPT-VEHRIAALKP-GFYTRMGAAMRHATAKL--------- 542 Query: 280 CNTEDANYKKIIVFMTDGENLSTK------EDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 E N KK+++ +TDG+ + S EA+ +G V+A+ + + Sbjct: 543 --AEQPNRKKLLLLLTDGKPNDVDHYEGRFALEDSRRAAGEARAKGVNVFAVTVDR-EAS 599 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGK 363 +L A + LV N + A I + Sbjct: 600 AYLPALFGRGGYALVANLAKLPVALPAIYR 629 >gi|148558518|ref|YP_001257290.1| norD protein [Brucella ovis ATCC 25840] gi|148369803|gb|ABQ62675.1| norD protein [Brucella ovis ATCC 25840] Length = 633 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 85/210 (40%), Gaps = 34/210 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 L + +++DVS S +++ D+ +D+ +++ + + D + ++TF+ Sbjct: 440 QANDLAVTLLVDVSLSTDAWVDNRRV-LDVEKEALLVLANGIAACGD-----RCSILTFT 493 Query: 228 NK--------IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ + F +G + ++ +I L G T +++A ++ Sbjct: 494 SRRRSWVRVETVKDFDESFGPT-VEHRIAALKP-GFYTRMGAAMRHATAKL--------- 542 Query: 280 CNTEDANYKKIIVFMTDGENLSTK------EDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 E N KK+++ +TDG+ + S EA+ +G V+A+ + + Sbjct: 543 --AEQPNRKKLLLLLTDGKPNDVDHYEGRFALEDSRRAAGEARAKGVNVFAVTVDR-EAS 599 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGK 363 +L A + LV N + A I + Sbjct: 600 AYLPALFGRGGYALVANLAKLPVALPAIYR 629 >gi|315150240|gb|EFT94256.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecalis TX0012] Length = 711 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 28/171 (16%), Positives = 66/171 (38%), Gaps = 31/171 (18%) Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 K + ++ + + + QT++ +D+++V+D S SM K + Sbjct: 114 YLKKSVAAVTGQQGLFNVTLDVKGN-----QTESPIDLVLVIDYSSSM---------KGE 159 Query: 197 MAIKSINAMLEEVKLIPD--VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG 254 ++N + + + + D + V+ G+V ++ L + + + L F Sbjct: 160 KLNNALNGLQQFGEELSDSLTDGHVRIGIVAYNR-------LTYSTADFSTDMNDLEDFL 212 Query: 255 VSTN---STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 +T S ++ ++G R N KK++V + DG ++ Sbjct: 213 RTTAEPHSGTFMQKGL-----LEGQRLLAEKSRPNAKKMLVHIGDGSANAS 258 >gi|257886572|ref|ZP_05666225.1| predicted protein [Enterococcus faecium 1,141,733] gi|257822626|gb|EEV49558.1| predicted protein [Enterococcus faecium 1,141,733] Length = 689 Score = 47.5 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 38/259 (14%), Positives = 81/259 (31%), Gaps = 56/259 (21%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 + + +Q +D+++V+D S SM K+ A+K + EE+ Sbjct: 96 LFDVTLAIKGNQLKKPIDLVMVIDYSSSMTG------EKLSNALKGLQEFGEELD-DSLE 148 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + ++ G+V ++ + ++ L+ ++ ++ T GL + + + Sbjct: 149 SGNIRIGIVAYNRFVYSTDDFLTDINQLEYFLRNTAESHTGTFMQKGLLEGQSLLEEKSR 208 Query: 276 MRQH------------------------------CNTEDANYKKIIV-FMTD-------G 297 + + + + F TD G Sbjct: 209 PEAEKMLVHIGDDSANRSYLPKENAQVFHNSGEIVDYNGYHTDQYVTEFQTDSEKYQTSG 268 Query: 298 ENLSTKEDQQSLYYCNEA--------KKRGAIVYAIGIRVIRSHEFL-RACASPNSFYLV 348 + S N+A K G Y++ E++ R AS + YL Sbjct: 269 SSTDPNAVSVSSSLINDATLGTIISIKNAGIKCYSVATAPSSRGEYIGRNLASSPNNYLS 328 Query: 349 --ENPHSMYDAFSHIGKDI 365 EN + +A I + Sbjct: 329 IDENLTGLGNALKEIANGM 347 >gi|327542748|gb|EGF29214.1| conserved hypothetical protein, membrane [Rhodopirellula baltica WH47] Length = 824 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 27/171 (15%), Positives = 62/171 (36%), Gaps = 12/171 (7%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL---IPDVNNVVQSGLV 224 T ++ +D S SM S + D +S+ + + + I + + ++ Sbjct: 80 TGQPARLIFAVDQSLSMGVSDSSDLQSPDRLQRSLERLFGKDEKDGWIESLRSTHLIDVM 139 Query: 225 TFS---NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR---- 277 +F + + + E + + G +TN + L+ + R Sbjct: 140 SFDATAHSRWKSYDTEDAATESPSGTFTIEANGSTTNLSAALRPVLPATATSEPNRSSDF 199 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 N+E +V MTDG + + +D L + + + G ++ +G+ Sbjct: 200 ASNNSETTGASLAVVLMTDGRDSAGVDDASDLAW--QLSQSGWQIHTLGVG 248 >gi|308501381|ref|XP_003112875.1| CRE-CLEC-160 protein [Caenorhabditis remanei] gi|308265176|gb|EFP09129.1| CRE-CLEC-160 protein [Caenorhabditis remanei] Length = 632 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 34/190 (17%), Positives = 66/190 (34%), Gaps = 38/190 (20%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ V+DVS M + + M IN ++ ++ L P++ VQ GL+ +SN+ E Sbjct: 282 IDIVFVIDVSEGMG------LGGLMMVKAEINTLVGQMSLDPNIQKHVQVGLIKYSNEAE 335 Query: 232 EFFLLE------------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 F W LQ + + + N GL+ A I M+ + Sbjct: 336 IVFKPSDYDDEDEFTEDLWSDPRLQ-DVDKVDEV----NLHLGLQKAAKMIGSMRRGVRK 390 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA- 338 A + + + + + ++ G + + S + Sbjct: 391 VVVVYAA------------SYNDEGNDDARQIAANIRESGYEIITVAFVEPESSNLVMKI 438 Query: 339 --CASPNSFY 346 ASP + Sbjct: 439 GEIASPRMNF 448 >gi|282877522|ref|ZP_06286340.1| von Willebrand factor type A domain protein [Prevotella buccalis ATCC 35310] gi|281300346|gb|EFA92697.1| von Willebrand factor type A domain protein [Prevotella buccalis ATCC 35310] Length = 345 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 36/217 (16%), Positives = 81/217 (37%), Gaps = 30/217 (13%) Query: 121 VPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDV 180 P+ ++ S+Y+ +KF + K++++ ++ +I LD+ Sbjct: 39 DPELLKLQMADTSKYRPSVKFWLLQAALALLIIMFARPQMGSKISNEKRRGIETIIALDI 98 Query: 181 SRSM--ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW 238 S SM E S + K + ++ L+ + N + GLV F+ + + Sbjct: 99 SNSMMAEDVVPSRLAKSKLMVE---------NLVDNFTND-KIGLVVFAGEAFVQLPITS 148 Query: 239 GVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 + ++ TN ++ + + + + I+ +T Sbjct: 149 DYVSAKMFLQNADPSLITTQGTNIAQAIRLSMSSFTQQDKVG-----------RAIILIT 197 Query: 296 DGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 DGE+ + ++L EA+K+G VY +G+ + Sbjct: 198 DGEDH----EGEALEAAKEARKKGINVYILGVGETKG 230 >gi|239817564|ref|YP_002946474.1| outer membrane adhesin like proteiin [Variovorax paradoxus S110] gi|239804141|gb|ACS21208.1| outer membrane adhesin like proteiin [Variovorax paradoxus S110] Length = 1867 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 34/151 (22%), Positives = 56/151 (37%), Gaps = 21/151 (13%) Query: 170 ARLD--MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 A +D +M++LD+S SM T++ A ++I +++ L D V+ LVTFS Sbjct: 1418 APIDTNLMVILDLSGSMGQETP---TRLSRAKEAIQNLIDGYDLYGD----VRVQLVTFS 1470 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 W + + + + STN L A N + N Sbjct: 1471 TT--GASQQAWMTAAEAKALVQNLQAAGSTNYDAALAAAMNGFSATGKLDGAQN------ 1522 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + F+TDGE D + N + Sbjct: 1523 --VSYFLTDGEPTL--GDGNTAQLANSSNSS 1549 >gi|296487102|gb|DAA29215.1| von Willebrand factor [Bos taurus] Length = 2850 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 35/220 (15%), Positives = 75/220 (34%), Gaps = 24/220 (10%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 P ++ LD++++LD S + S + D A + + L P Sbjct: 1712 DGPHLPTLAPAPDCSQPLDVVLLLDGSST------SPASYFDEMKSFAKAFISKANLGP- 1764 Query: 215 VNNVVQSGLVTF--SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + Q ++ + + ++ + + +HL + + + G + L +A I Sbjct: 1765 --QLTQVSVLQYGSNTNVDVPWDIHVDKAHLLSLVDPMHREGGPSQVGNALSFAARYITS 1822 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 H A+ K++V + G ++ + E + + V+ IGI Sbjct: 1823 Q----VHGARPSAS--KVVVILVTGSSMDSVEAAAAAA-----RSNRVAVFPIGIGDQYD 1871 Query: 333 HEFLRACASPNSFYLVENPHSMYD--AFSHIGKDIVTKRI 370 LR A P + V + D + +G + Sbjct: 1872 AAQLRVLAGPGASSNVAELQRIEDLPSMVALGNSFFQRLC 1911 >gi|296269618|ref|YP_003652250.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] gi|296092405|gb|ADG88357.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] Length = 223 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 28/135 (20%), Positives = 53/135 (39%), Gaps = 11/135 (8%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 ++ L +V D S SME +D + + + E+ P V + + G++ FS Sbjct: 2 SEQVLPFYLVCDESYSMEG------PPLDAINQELPEIYREIASNPVVADRARLGIIGFS 55 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 ++ E L ++ + I L+ G TN A + Q ++ Y Sbjct: 56 DRAEVLLPLSD-LNDVHS-IPQLAPRG-GTNYGAAF--ALLKSTIEQDVQALKQAGHRPY 110 Query: 288 KKIIVFMTDGENLST 302 + + F+TDG+ Sbjct: 111 RPCVFFLTDGQPTYE 125 >gi|61806576|ref|NP_001013521.1| hypothetical protein LOC541376 [Danio rerio] gi|60649627|gb|AAH90437.1| Zgc:113325 [Danio rerio] Length = 450 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 31/194 (15%), Positives = 76/194 (39%), Gaps = 22/194 (11%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + D NV + + Sbjct: 214 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVIEMLDTL-SDDDYVNVARFNEKAY 266 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL--SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + L++ + + + + + + +T+ G ++A++Q+ + Sbjct: 267 AVVPCFTTLVQANIKNKKIFKEAVMNMQAKGTTDYKTGFQFAFDQLLNDTSA------PR 320 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LR--ACAS 341 AN K+I+ TDG ++ + + N R V+ + L+ AC + Sbjct: 321 ANCNKMIMMFTDGGEDRAQDIFEKYNWPN----RTVRVFTFSVGQHNYDVTPLQWIACFN 376 Query: 342 PNSFYLVENPHSMY 355 ++ + + ++ Sbjct: 377 KGYYFEIPSIGAIR 390 >gi|328887902|ref|NP_001192237.1| von Willebrand factor [Bos taurus] Length = 2809 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 35/220 (15%), Positives = 75/220 (34%), Gaps = 24/220 (10%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 P ++ LD++++LD S + S + D A + + L P Sbjct: 1671 DGPHLPTLAPAPDCSQPLDVVLLLDGSST------SPASYFDEMKSFAKAFISKANLGP- 1723 Query: 215 VNNVVQSGLVTF--SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + Q ++ + + ++ + + +HL + + + G + L +A I Sbjct: 1724 --QLTQVSVLQYGSNTNVDVPWDIHVDKAHLLSLVDPMHREGGPSQVGNALSFAARYITS 1781 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 H A+ K++V + G ++ + E + + V+ IGI Sbjct: 1782 Q----VHGARPSAS--KVVVILVTGSSMDSVEAAAAAA-----RSNRVAVFPIGIGDQYD 1830 Query: 333 HEFLRACASPNSFYLVENPHSMYD--AFSHIGKDIVTKRI 370 LR A P + V + D + +G + Sbjct: 1831 AAQLRVLAGPGASSNVAELQRIEDLPSMVALGNSFFQRLC 1870 >gi|168065273|ref|XP_001784578.1| predicted protein [Physcomitrella patens subsp. patens] gi|162663855|gb|EDQ50597.1| predicted protein [Physcomitrella patens subsp. patens] Length = 1185 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 34/169 (20%), Positives = 61/169 (36%), Gaps = 29/169 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D S SM+ T I A +++ L + N++ G ++K Sbjct: 467 LIFVVDRSGSMQG------TPIKQAGQALELFLRSIPCEDHYFNIIGFGD---NHKTLFP 517 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + L + ++Y + A+ +IF +H + I Sbjct: 518 KSTPYNEETLTKGLRYAQALEAD-MGGTEMMSAFEEIF------EHR---RRDVPTQIFL 567 Query: 294 MTDGENLSTKEDQQSLYYC-NEAK-----KRGAIVYAIGIRVIRSHEFL 336 +TDGE D SL C +AK V+++GI SH + Sbjct: 568 LTDGEIW----DVDSLIECIRDAKKEEKSDNFVRVFSLGIGSNVSHHLV 612 >gi|114567231|ref|YP_754385.1| chloride channel [Syntrophomonas wolfei subsp. wolfei str. Goettingen] gi|114338166|gb|ABI69014.1| conserved putative chloride channel [Syntrophomonas wolfei subsp. wolfei str. Goettingen] Length = 951 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 71/206 (34%), Gaps = 27/206 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 L +++V+D S S S +K+++A ++ + + G+V F + Sbjct: 405 PSLGLVLVIDKSGS-MSEGSGGYSKVELAKEAAIQATSILGPLDMA------GVVAFDDT 457 Query: 230 IEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + +Q I + G T+ P L AY + D Sbjct: 458 AQWVVEFQAVKDKDAIQDDIATIRADG-GTSIYPALALAYTALKDAHTK----------- 505 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--PNSF 345 K I+ +TDG++ +T + + + G + + + L A+ + Sbjct: 506 FKHIILLTDGQSATTGDY---YFLSRRMARAGITMSTVAVGEGADTLLLEQLAAWGQGRY 562 Query: 346 YLVENPHSMYDAF-SHIGKDIVTKRI 370 Y + ++ F K I + + Sbjct: 563 YFSDEISNIPRIFTKETMKAIKSYLV 588 >gi|333028697|ref|ZP_08456761.1| hypothetical protein STTU_6202 [Streptomyces sp. Tu6071] gi|332748549|gb|EGJ78990.1| hypothetical protein STTU_6202 [Streptomyces sp. Tu6071] Length = 418 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 39/221 (17%), Positives = 77/221 (34%), Gaps = 40/221 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 A + +VLDVS SM + +++ A +S N +L+ V DV +++ + Sbjct: 30 KAPPQVELVLDVSGSMRAKDIDGASRMSAAKQSFNEVLDAVP--EDVELGIRTLGADYPG 87 Query: 229 KIEEFFLLEWG---------VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + E + + + + LS G T P L A + + + Sbjct: 88 EDRETGCKDTRQLYPVGHPDRTEAKAAVATLSPTGW-TPIGPALLGAADDLRGGEAS--- 143 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK---RGAIVY--AIGIRV-IRSH 333 K IV +TDGE D C A++ +G + +G+ ++ Sbjct: 144 ---------KRIVLITDGE------DTCHRDPCEVAREIAAKGVHLVVDTLGLVPDAKTR 188 Query: 334 EFLRAC---ASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + L C A+ ++ V + + + + Sbjct: 189 DQLS-CIAEATGGTYTTVRHTKDLSGRVKQLVHRAADPVVT 228 >gi|318058741|ref|ZP_07977464.1| hypothetical protein SSA3_12405 [Streptomyces sp. SA3_actG] Length = 418 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 39/221 (17%), Positives = 77/221 (34%), Gaps = 40/221 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 A + +VLDVS SM + +++ A +S N +L+ V DV +++ + Sbjct: 30 KAPPQVELVLDVSGSMRAKDIDGASRMSAAKQSFNEVLDAVP--EDVELGIRTLGADYPG 87 Query: 229 KIEEFFLLEWG---------VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + E + + + + LS G T P L A + + + Sbjct: 88 EDRETGCKDTRQLYPVGHPDRTEAKAAVATLSPTGW-TPIGPALLGAADDLRGGEAS--- 143 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK---RGAIVY--AIGIRV-IRSH 333 K IV +TDGE D C A++ +G + +G+ ++ Sbjct: 144 ---------KRIVLITDGE------DTCHRDPCEVAREIAAKGVHLVVDTLGLVPDAKTR 188 Query: 334 EFLRAC---ASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + L C A+ ++ V + + + + Sbjct: 189 DQLS-CIAEATGGTYTTVRHTKDLSGRVKQLVHRAADPVVT 228 >gi|296228555|ref|XP_002807723.1| PREDICTED: LOW QUALITY PROTEIN: integrin alpha-10-like [Callithrix jacchus] Length = 1197 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 41/244 (16%), Positives = 77/244 (31%), Gaps = 50/244 (20%) Query: 150 NSRHIVMPITSSVKVNSQTDARL-----------DMMIVLDVSRSMESFFDSSITKIDMA 198 S I + V + + L D++IVLD S S+ + +++ Sbjct: 161 GSSVFSSGICAHVDASFRPQGSLAPTAQRCPTYMDVVIVLDGSNSIYPW-----SEVQTF 215 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGV 255 ++ + L I +Q GLV + + L + + + G Sbjct: 216 LRRLVGKL----FIDPEQ--IQVGLVQYGESPVHEWSLGDFRTKEEVVSAAKNLSRREGR 269 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 T + + A + F + ++ V +TDGE D + C Sbjct: 270 ETKTAQAIMVACTEGFSQS------HGGRPEAARLXVVVTDGE----SHDGEDTKGCEAG 319 Query: 316 KKRGAIVYAIGI------RVIRSHEFLR----ACASPNS--FYLVENPHSMYDAFSHIGK 363 + Y I + R FLR + P+ F+ V + ++ D +G Sbjct: 320 R---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASDPDERFFFNVTDEAALTDIVDALGD 376 Query: 364 DIVT 367 I Sbjct: 377 RIFG 380 >gi|302517652|ref|ZP_07269994.1| von Willebrand factor [Streptomyces sp. SPB78] gi|302426547|gb|EFK98362.1| von Willebrand factor [Streptomyces sp. SPB78] Length = 418 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 39/221 (17%), Positives = 77/221 (34%), Gaps = 40/221 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 A + +VLDVS SM + +++ A +S N +L+ V DV +++ + Sbjct: 30 KAPPQVELVLDVSGSMRAKDIDGASRMSAAKQSFNEVLDAVP--EDVELGIRTLGADYPG 87 Query: 229 KIEEFFLLEWG---------VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + E + + + + LS G T P L A + + + Sbjct: 88 EDRETGCKDTRQLYPVGHPDRTEAKAAVATLSPTGW-TPIGPALLGAADDLRGGEAS--- 143 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK---RGAIVY--AIGIRV-IRSH 333 K IV +TDGE D C A++ +G + +G+ ++ Sbjct: 144 ---------KRIVLITDGE------DTCHRDPCEVAREIAAKGVHLVVDTLGLVPDAKTR 188 Query: 334 EFLRAC---ASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + L C A+ ++ V + + + + Sbjct: 189 DQLS-CIAEATGGTYTTVRHTKDLSGRVKQLVHRAADPVVT 228 >gi|50355937|ref|NP_571316.1| complement component bfb [Danio rerio] gi|49904460|gb|AAH76051.1| Complement component bfb [Danio rerio] Length = 456 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 34/219 (15%), Positives = 71/219 (32%), Gaps = 25/219 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK--LIPDVNNVVQ 220 K+ +LD+ I +DVS S+ + A + I +LE++ + ++ Sbjct: 245 KIYLNKGGKLDIYIAVDVSDSIND--------LKKAKQIIKTLLEKISYYEVSPSYEILM 296 Query: 221 SGLVTFSNKIEEFFL---LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + F + +S + + + + Y I D Sbjct: 297 FATDVYQIVKMRDFKTEEVAGSLSKVFEDLDKFDFDKKLDQKSSNIAKLYQTILDSMSNE 356 Query: 278 QHCNTED-ANYKKIIVFMTDGE-NLSTKEDQQSLYY------CNEAKKRGAIVYAIGIRV 329 Q N ED K +++ TDG+ N+ + + ++ +Y G+ Sbjct: 357 QIRNKEDFLQTKHVVIVFTDGQANMGGNPKPKVHLIRNLVLKNDANRENKLDLYVFGVGK 416 Query: 330 IRSHEFLRACASPN----SFYLVENPHSMYDAFSHIGKD 364 E L S F+ +++ + F + D Sbjct: 417 DVRTEDLNGLVSEKENERHFFKLQDLDEVQKTFDSLMND 455 >gi|187939945|gb|ACD39081.1| hypothetical protein PACL_0293 [Pseudomonas aeruginosa] Length = 223 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 71/209 (33%), Gaps = 16/209 (7%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K QT L ++++ D S SM S KI+ K + M+ + +Q Sbjct: 6 KFQVQTARPLPIIVLADTSGSM-----SVDGKIEALNKGLKDMISSFAGESRLRAEIQVS 60 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 ++TF + E L LQ +++ T L A I D + Sbjct: 61 VITFGGSLAELNLPLTPAHQLQSFTPLVAEGM--TPLGGALSLASEMIEDKD------SI 112 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 YK +IV ++DG + + N + A +A+ I L A+ Sbjct: 113 PSRAYKPVIVLVSDGYPNDDWQGPFARLV-NGERSSKATRFAMAIGADADEVMLSDFAND 171 Query: 343 NS--FYLVENPHSMYDAFSHIGKDIVTKR 369 + EN ++ F + + + Sbjct: 172 PEAPLFHAENARDIHRFFRAVTMSVSARS 200 >gi|281420094|ref|ZP_06251093.1| BatB protein [Prevotella copri DSM 18205] gi|281405894|gb|EFB36574.1| BatB protein [Prevotella copri DSM 18205] Length = 345 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 37/225 (16%), Positives = 78/225 (34%), Gaps = 26/225 (11%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 R L + P+ + IS+Y+ +KF + +V K++ Sbjct: 29 RRKAKLKKLGDPELLKQLMPNISKYRPTVKFVLMLAALALLIVMVARPQMGSKISHDKRH 88 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 ++ +I LD+S SM D +++D + I +++ + GL+ F+ Sbjct: 89 GIETIICLDISNSMLCQ-DVVPSRLDKSKMLIENLVDNFNND-------KIGLIVFAGDA 140 Query: 231 EEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + ++ ++ TN + A + N Sbjct: 141 FVQLPITTDYVSAKMFLQNITPGLIQTQGTNIGAAIDLASKSFT-----------QQENV 189 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + I+ +TDGEN + + K+G V+ +GI + Sbjct: 190 GRAIIVITDGENHEPGAQEAAAAA----NKKGINVFILGIGNTKG 230 >gi|310825488|ref|YP_003957846.1| von willebrand factor type a domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|309398560|gb|ADO76019.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 414 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 73/212 (34%), Gaps = 30/212 (14%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 + +++ +T + N+ A + + +++D S SM + A+ + +++ + Sbjct: 14 QGKEVLLLVTLEAEENT-PRAPVAVNLLIDRSASMRG------APLVAAVDAAQSLVAQA 66 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 GL+ F E+ + + L ++ L T S L Sbjct: 67 GPRD------YIGLLAFDGVPEQLLPVRAMEPDAKTELSERLSSLE-----TGSGTALHE 115 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYA 324 A A K ++ +TDGE + + G ++A Sbjct: 116 AVE---LGSSSLHRVLIPGARRK--LLLLTDGEPSVGPAALADFKTLGAKVAESGVTLHA 170 Query: 325 IGIRVIRSHEFLRACASPN--SFYLVENPHSM 354 +G+ E L A +SP+ F ++ ++ Sbjct: 171 LGLGRHYIPEMLEALSSPSGTGFAHADDAEAL 202 >gi|149199576|ref|ZP_01876610.1| von Willebrand factor, type A [Lentisphaera araneosa HTCC2155] gi|149137372|gb|EDM25791.1| von Willebrand factor, type A [Lentisphaera araneosa HTCC2155] Length = 512 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 41/327 (12%), Positives = 101/327 (30%), Gaps = 43/327 (13%) Query: 45 FMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFV 104 K S S A + + + + K D+ + + + + Sbjct: 201 HAKQKYPSKTKPSKSQAQQAEPKQSSLESLVQSKAQDLPDDMGTAISKQIKAHHNLQDYG 260 Query: 105 NDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKV 164 + + +N A L+ S S+ ++ Sbjct: 261 ETCQVAEPKAKTQVALSTKNIQEVDRASCGLNARLQALLQASQLKRSFPARRGRLSTQRL 320 Query: 165 NSQTDAR-------------LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 N + + +++D S SM+ ++ + ++ A+L+ ++ Sbjct: 321 NRLNFSPKIFLRHEQRLGLNTSVHLLMDASGSMQG-------RMSLGSQASYALLKALRS 373 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 +P + S + F + + + K + +T P L + Q Sbjct: 374 VPGIK----SAMTCFPGSSLQPSVHQVLKYDQPLHNKISVRPDGTTPLAPALWWLMQQ-- 427 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 + +KII+ +TDG+ + ++++ +A G +Y IGI+ Sbjct: 428 ---------DALRTETRKIIIILTDGDPDCVEMSKKAIA---KATSLGFEIYGIGIQCSS 475 Query: 332 SHEFLRACASPNSFYLVENPHSMYDAF 358 + L P++ ++ + + A Sbjct: 476 IKQLL-----PDAHKIIHDLKDLAPAL 497 >gi|115380522|ref|ZP_01467490.1| hypothetical membrane associated protein [Stigmatella aurantiaca DW4/3-1] gi|115362473|gb|EAU61740.1| hypothetical membrane associated protein [Stigmatella aurantiaca DW4/3-1] Length = 424 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 73/212 (34%), Gaps = 30/212 (14%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 + +++ +T + N+ A + + +++D S SM + A+ + +++ + Sbjct: 24 QGKEVLLLVTLEAEENT-PRAPVAVNLLIDRSASMRG------APLVAAVDAAQSLVAQA 76 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 GL+ F E+ + + L ++ L T S L Sbjct: 77 GPRD------YIGLLAFDGVPEQLLPVRAMEPDAKTELSERLSSLE-----TGSGTALHE 125 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYA 324 A A K ++ +TDGE + + G ++A Sbjct: 126 AVE---LGSSSLHRVLIPGARRK--LLLLTDGEPSVGPAALADFKTLGAKVAESGVTLHA 180 Query: 325 IGIRVIRSHEFLRACASPN--SFYLVENPHSM 354 +G+ E L A +SP+ F ++ ++ Sbjct: 181 LGLGRHYIPEMLEALSSPSGTGFAHADDAEAL 212 >gi|332249219|ref|XP_003273761.1| PREDICTED: LOW QUALITY PROTEIN: von Willebrand factor-like [Nomascus leucogenys] Length = 2813 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 41/277 (14%), Positives = 86/277 (31%), Gaps = 35/277 (12%) Query: 79 GGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYKI 137 ++ + + + N ++I + + + V N + I Sbjct: 1588 HSFLVSQGDREQAPNLVYMVTGNPASDEIKRLPGDIQVVPIGVGPNANVQELERIGWPNA 1647 Query: 138 PLKFCTFIPWYTNSRHIVMPITSS--------VKVNSQTDARLDMMIVLDVSRSMESFFD 189 P+ F + +V+ S + S LD++ +LD S S F Sbjct: 1648 PILIQDFETLPREAPDLVLQRCCSGEGLQIPTLSPASDCSQPLDVIFLLDGSSS---FPA 1704 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV----SHLQR 245 S ++ K+ + + + P + Q ++ + + W V +HL Sbjct: 1705 SYFDEMKSFAKA---FISKANIGPH---LTQVSVLQYGSITTIDVP--WNVAPEKAHLLS 1756 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 + + + G + L +A + H A+ K +++ +T Sbjct: 1757 LVDVMQREGGPSQIGDALGFAVRYLTS----EMHGARPGAS-KAVVILVT------DVSV 1805 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + A+ V+ IGI LR A P Sbjct: 1806 DSVDAAADAARSNRVTVFPIGIGDRYDAAQLRILAGP 1842 >gi|309791907|ref|ZP_07686390.1| magnesium chelatase ATPase subunit D [Oscillochloris trichoides DG6] gi|308226079|gb|EFO79824.1| magnesium chelatase ATPase subunit D [Oscillochloris trichoides DG6] Length = 657 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 30/170 (17%), Positives = 51/170 (30%), Gaps = 39/170 (22%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ--SGLVT 225 + A V+D S SM ++ A ++NA+L++ V + L+ Sbjct: 459 SKAGTLFCFVVDASGSMALH------RMRQAKGAVNALLQQA-------YVHRDHVALLA 505 Query: 226 F-SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 F + + V +R + L G T L AY + H T Sbjct: 506 FRGERADLLLPPSQSVELAKRALDVLPT-GGGTPLAAALLSAYQVAEQARSRGIHRTT-- 562 Query: 285 ANYKKIIVFMTDGENLST--------------KEDQQSLYYCNEAKKRGA 320 +V +TDG + + C+ K G Sbjct: 563 ------LVLITDGRPNVPLQPMPGQSKDERMAQARTEVQTLCSRLKSAGI 606 >gi|294674674|ref|YP_003575290.1| tellurium resistance protein [Prevotella ruminicola 23] gi|294471870|gb|ADE81259.1| putative tellurium resistance protein [Prevotella ruminicola 23] Length = 212 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 27/136 (19%), Positives = 44/136 (32%), Gaps = 20/136 (14%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + ++LD S SM I+ + ++ ++ P ++TF++ Sbjct: 3 RLPVYLLLDTSGSMYG------EPIEAVKNGVQTLVSTLRSDPYALETAYISIITFNSSA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK-- 288 ++ L L + T L Q + A K Sbjct: 57 QQIAPLT----ELASFQPPVIDASGCTALGEALN------LLAQKVDTEIVKTTAEVKGD 106 Query: 289 -KIIVF-MTDGENLST 302 K IVF MTDGE Sbjct: 107 WKPIVFIMTDGEPTDD 122 >gi|290995707|ref|XP_002680424.1| vWFA domain-containing protein [Naegleria gruberi] gi|284094045|gb|EFC47680.1| vWFA domain-containing protein [Naegleria gruberi] Length = 382 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 61/159 (38%), Gaps = 22/159 (13%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 ++ +D++IV+D + SM +ID A K++ +L +K ++ +++ Sbjct: 61 SENIVDLVIVMDCTGSMSG-------EIDAAKKTVQTILSSLK--DHFKTDLRFSAISYR 111 Query: 228 NKIEEF----FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + +++ F + + I +S G + L A + +M ++ Sbjct: 112 DHSDDYAVREFPFTKDIQKAKSYIDTMSAQGGG-DHPEALASALKVVNEMPFNKKG---- 166 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 KKI +++ D +Q + Y K V Sbjct: 167 ----KKICIWIADAPPHGMNSNQGADSYPEGCKDENGKV 201 >gi|220678711|emb|CAX12780.1| novel protein similar to H.sapiens ANTXR1, anthrax toxin receptor 1 (ANTXR1) [Danio rerio] Length = 270 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 31/156 (19%), Positives = 55/156 (35%), Gaps = 16/156 (10%) Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGM 276 ++ + FS + L + R + L + G T GL+ A QI+ Sbjct: 2 LRMSFIVFSTRGTTIMRLTENRDDITRGLNTLKREIPGGDTYMNLGLEEANVQIY----- 56 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 H N A+ I+ +TDGE + + A+ GAIVY +G++ + Sbjct: 57 --HGNYGAASV---IIALTDGELND-HQFVTAQQEAQRARSMGAIVYCVGVKDFNETQLA 110 Query: 337 RACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 + + V + A + I+ K Sbjct: 111 TIADTIEHVFPVIGG---FQALEGMIDSIIKKSCIE 143 >gi|254780914|ref|YP_003065327.1| hypothetical protein CLIBASIA_04065 [Candidatus Liberibacter asiaticus str. psy62] gi|254780929|ref|YP_003065342.1| hypothetical protein CLIBASIA_04140 [Candidatus Liberibacter asiaticus str. psy62] gi|254040591|gb|ACT57387.1| hypothetical protein CLIBASIA_04065 [Candidatus Liberibacter asiaticus str. psy62] gi|254040606|gb|ACT57402.1| hypothetical protein CLIBASIA_04140 [Candidatus Liberibacter asiaticus str. psy62] Length = 408 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 55/396 (13%), Positives = 132/396 (33%), Gaps = 43/396 (10%) Query: 6 LNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 L + +G ++TAI L ++ ++++ + L ++D + + + Sbjct: 9 LGVLRLKKCTRGVFLVITAILLSSFVAIVDVVVDQVTVMQKTAWLQEVLDHVIYRTSPKN 68 Query: 66 MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 + + R+ + I +I+ N +L + G + I ++ + + Sbjct: 69 LYD----LREAGRDNFIRHQIEKALNTYNSRDLSNIGSIESIV----KDAVILTKNVNSL 120 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMM-IVLDVS--R 182 + ++ + + + I+ KV + + L +M D Sbjct: 121 PLQFTVDIALSTTVQLRGSLLQMFSQSKGKVDISRRKKVMYKQNIGLMIMPFAWDGYWLA 180 Query: 183 SMESFFDSSITKIDMAIKS--------INAMLEEVKLIPDVNNVVQSGLVTFSNKIE-EF 233 S DS + S N +++ N +S+ + Sbjct: 181 SRGKVADSKVHPPKYLEYSHYYQQYLNRNTLVKNFLSQIPYKNFC-MAPYHYSSILYWAV 239 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNS---TPGLKYAYNQIFDMQGMRQ------HCNTED 284 L + V + +Y +T +K ++ + G Q HC Sbjct: 240 GTLTYSVDNKTTTREYYKDPYYATWDHFPYSFIKNVFDMTSNQFGDGQVLTNTNHCFPHG 299 Query: 285 ANYKKIIVFMTDGENLSTKED-----QQSLYYCNEAKKR------GAIVYAIGIRV-IRS 332 A+ K ++ + G LS ++ L C+ KR ++++G + Sbjct: 300 ASQNKYMLMLAIGNQLSRSSVEKEKIEKVLQDCHYMHKRHRTGRDAITIFSVGFSPDQDT 359 Query: 333 HEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDIVT 367 LR CAS P+ +Y + + ++ + ++++T Sbjct: 360 RYTLRQCASDPSKYYEINSDENVMPIAKSLARNVIT 395 >gi|315052466|ref|XP_003175607.1| U-box domain-containing protein [Arthroderma gypseum CBS 118893] gi|311340922|gb|EFR00125.1| U-box domain-containing protein [Arthroderma gypseum CBS 118893] Length = 740 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 38/218 (17%), Positives = 79/218 (36%), Gaps = 36/218 (16%) Query: 99 RDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPI 158 +D +I+ + + P N G + + + I N +++ I Sbjct: 1 MATSSSDDGFEIIEDHQIPL--RPANAG---TTAVTENLTQEPILTIHSIPNKDSMIVSI 55 Query: 159 TSSVKVNSQ-TDARLDMMIVLDVSRSMESFF----------DSSITKIDMAIKSINAMLE 207 +K + D+++V+D+S SM S D+ ++ +D+ + ++E Sbjct: 56 QPPLKPENDVPHVPCDIVLVIDISGSMNSSAPIPTGERGGEDTGLSILDLTKHAARTIIE 115 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGL 263 + + +VTF +++ F L++ S + R I L STN G+ Sbjct: 116 TLNEND------RLAVVTFCTEVKVAFELDFMNKENKSMVLRAINKL-YGTSSTNLWHGI 168 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 K + N + ++V +TDG Sbjct: 169 KEGLKVLTAT--------PVRENVQSLLV-LTDGAPNH 197 >gi|221042220|dbj|BAH12787.1| unnamed protein product [Homo sapiens] Length = 648 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 30/171 (17%), Positives = 64/171 (37%), Gaps = 19/171 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEE 232 ++ V+D S SM S KI +++ +L+++ N +V + + + Sbjct: 275 VVFVIDKSGSM------SGRKIQQTREALIKILDDLSPRDQFNLIVFSTEATQWRPSLVP 328 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V+ + + G TN + A Q+ D + + I+ Sbjct: 329 AS--AENVNKARSFAAGIQALG-GTNINDAMLMAV-QLLDSSNQEERLPEGSVSL---II 381 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA---IGIRVIRSHEFLRACA 340 +TDG+ + + +S+ N ++ + Y+ +G S+ FL A Sbjct: 382 LLTDGDPTVGETNPRSIQ--NNVREAVSGRYSLFCLGFGFDVSYAFLEKLA 430 >gi|322688246|ref|YP_004207980.1| hypothetical protein BLIF_0055 [Bifidobacterium longum subsp. infantis 157F] gi|320459582|dbj|BAJ70202.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis 157F] Length = 362 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 59/186 (31%), Gaps = 15/186 (8%) Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 V+D S SM + + K +++ + I + V L+ F + Sbjct: 187 VVDYSGSMSGEGKNGVVK--GLNAALDPDQAKKSYIEPASGDVNI-LIPFETEAHRPVKA 243 Query: 237 E-WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 S L + G T+ GL A +++ Q+ IV MT Sbjct: 244 TGTSTSDLLHEADATDASG-GTDIYEGLLSALDELPSESEASQYTTA--------IVLMT 294 Query: 296 DGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMY 355 DG + S +D+ +++ R +++I + N+ + Sbjct: 295 DGRSNSDHQDE--FESAYKSRGRDLPIFSIMFGDADPSQLKSLATLSNAKVFDGRSGDLA 352 Query: 356 DAFSHI 361 F + Sbjct: 353 AVFRQV 358 >gi|108945862|gb|ABG23473.1| integrin subunit alpha 11 [Tupaia belangeri] Length = 139 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 22/126 (17%), Positives = 56/126 (44%), Gaps = 14/126 (11%) Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVS-TNSTP 261 ++ +K +Q G+V + + F L V + ++ + G + T + Sbjct: 8 LINILKKFYIGPGQIQVGVVQYGEDVVHEFHLNDYRNVKDVVEAASHIEQRGGTETRTAF 67 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 G+++A ++ F G + KK+++ +TDGE+ + + ++ + +++K Sbjct: 68 GIEFARSEAFQKGGRKG--------AKKVMIVITDGESHDSPDLEKVIQ---QSEKDNVT 116 Query: 322 VYAIGI 327 YA+ + Sbjct: 117 RYAVAV 122 >gi|68535931|ref|YP_250636.1| putative secreted protein [Corynebacterium jeikeium K411] gi|68263530|emb|CAI37018.1| putative secreted protein [Corynebacterium jeikeium K411] Length = 550 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 37/203 (18%), Positives = 58/203 (28%), Gaps = 28/203 (13%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D +LD S SM + I K I+ E L+ FS+K+ Sbjct: 360 PADSNFLLDTSGSMRGNRLGDLKSI--LNKLIDGTAGEGNNPKGFARRETITLMPFSSKV 417 Query: 231 -------EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 L+ + L G T + AY+++ + G Sbjct: 418 ADGYTQENYDPDNAQQKQDLRGYVNGLQPRGE-TAIYDAVLRAYDRVGEGGGSLNS---- 472 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACA 340 IV MTDG + S + + V+ I E +R A Sbjct: 473 -------IVLMTDGASNSGTSRKDFITKMTRMMDETNHKIPVFVILYGEASEEE-MRFLA 524 Query: 341 --SPNSFYLVENPHSMYDAFSHI 361 + + V + AF I Sbjct: 525 NFTGGKVFNVR-GGDLAKAFEEI 546 >gi|227547429|ref|ZP_03977478.1| von Willebrand factor type A (vWA) domain protein [Bifidobacterium longum subsp. infantis ATCC 55813] gi|227212076|gb|EEI79972.1| von Willebrand factor type A (vWA) domain protein [Bifidobacterium longum subsp. infantis ATCC 55813] Length = 362 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 59/186 (31%), Gaps = 15/186 (8%) Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 V+D S SM + + K +++ + I + V L+ F + Sbjct: 187 VVDYSGSMSGEGKNGVVK--GLNAALDPDQAKKSYIEPASGDVNI-LIPFETEAHRPVKA 243 Query: 237 E-WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 S L + G T+ GL A +++ Q+ IV MT Sbjct: 244 TGTSTSDLLHEADATDASG-GTDIYEGLLSALDELPSESEASQYTTA--------IVLMT 294 Query: 296 DGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMY 355 DG + S +D+ +++ R +++I + N+ + Sbjct: 295 DGRSNSDHQDE--FESAYKSRGRDLPIFSIMFGDADPSQLKSLATLSNAKVFDGRSGDLA 352 Query: 356 DAFSHI 361 F + Sbjct: 353 AVFRQV 358 >gi|53733375|gb|AAH83523.1| Cfb protein [Danio rerio] Length = 653 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 36/229 (15%), Positives = 84/229 (36%), Gaps = 29/229 (12%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K++ +LD+ I +D S S++ K IK++ + ++ P+ Sbjct: 245 KISLDRGGKLDIYIAVDASDSID---PKDFDKAKKIIKTLIEKISYYEVSPNYE------ 295 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRK--------IKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 ++ F+ +++ + ++ + + + + + + Y +I D Sbjct: 296 ILMFATDVDQIVKMRDFKTNEKARKILKIFEDLDNFNYDKKGDRTGTNIAKLYLKILDSM 355 Query: 275 GMRQHCNTED-ANYKKIIVFMTDGE-----NLSTKED-QQSLYYCNEA-KKRGAIVYAIG 326 + Q N ED + +I+ TDG+ N K D ++L N A ++ +Y G Sbjct: 356 SLEQVQNKEDFLQTQHVIIVFTDGQANMGGNPKPKVDLIKNLVIKNNASRENKLDLYVFG 415 Query: 327 IRVIRSHEFLRACASPN----SFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + E + S F+ + + + + F + D + Sbjct: 416 VGKDVKKEDMNGLVSEKKDERHFFKLPDLDEVQNTFDLMLDDSTVVGLC 464 >gi|29832735|ref|NP_827369.1| toxic cation resistance protein [Streptomyces avermitilis MA-4680] gi|29609855|dbj|BAC73904.1| putative toxic cation resistance protein [Streptomyces avermitilis MA-4680] Length = 239 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 28/157 (17%), Positives = 49/157 (31%), Gaps = 23/157 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 S+ + R + +V+D S SM ++ + + + + D V Sbjct: 26 SLHKHGLDGQRAAVYLVVDYSGSMRPYYKDG--SVQALTDRVLGLSAHLD--DDGTVPV- 80 Query: 221 SGLVTFSNKIE--EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V FS I+ L H++ + L G T+ + + D Sbjct: 81 ---VFFSTDIDAVTDIALADHQGHIELIVAGLGHMGK-TSYHLAMDAVIDHYLDSGSKD- 135 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 +VF TDG S ++ Y C A Sbjct: 136 ---------PAFVVFQTDGGPTSKPAAER--YLCKAA 161 >gi|326478189|gb|EGE02199.1| U-box domain-containing protein [Trichophyton equinum CBS 127.97] Length = 741 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 36/187 (19%), Positives = 69/187 (36%), Gaps = 31/187 (16%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ-TDARLDMMIVLDVSRSMESFF 188 + + + I N +V+ I +K + D+++V+D+S SM S Sbjct: 27 TVTIEENLTQEPILSIHPIPNKDSMVVSIQPPLKPENDVPHVPCDIVLVIDISGSMNSAA 86 Query: 189 ----------DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW 238 D+ ++ +D+ + ++E + N + +VTF ++ F L+ Sbjct: 87 PIPTGERGGEDTGLSILDLTKHAAKTIIETL------NEKDRLAVVTFCTEVNVAFELDS 140 Query: 239 ----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 S + I L STN G+K N + N N + ++V + Sbjct: 141 MNKENKSTVLSAIDKL-YGKSSTNLWHGIKKGLNVLAT--------NPVRGNIQSLLV-L 190 Query: 295 TDGENLS 301 TDG Sbjct: 191 TDGAPNH 197 >gi|326474578|gb|EGD98587.1| hypothetical protein TESG_05957 [Trichophyton tonsurans CBS 112818] Length = 741 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 36/187 (19%), Positives = 69/187 (36%), Gaps = 31/187 (16%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ-TDARLDMMIVLDVSRSMESFF 188 + + + I N +V+ I +K + D+++V+D+S SM S Sbjct: 27 TVTIEENLTQEPILSIHPIPNKDSMVVSIQPPLKPENDVPHVPCDIVLVIDISGSMNSAA 86 Query: 189 ----------DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW 238 D+ ++ +D+ + ++E + N + +VTF ++ F L+ Sbjct: 87 PIPTGERGGEDTGLSILDLTKHAAKTIIETL------NEKDRLAVVTFCTEVNVAFELDS 140 Query: 239 ----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 S + I L STN G+K N + N N + ++V + Sbjct: 141 MNKENKSTVLSAIDKL-YGKSSTNLWHGIKKGLNVLAT--------NPVRGNIQSLLV-L 190 Query: 295 TDGENLS 301 TDG Sbjct: 191 TDGAPNH 197 >gi|198424466|ref|XP_002124191.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 1306 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 48/291 (16%), Positives = 91/291 (31%), Gaps = 42/291 (14%) Query: 98 LRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAIS-RYKIPLKFCTFIPWYTNSRHIVM 156 L N + + ++ + ++P + R KI + P + + Sbjct: 116 LNTNAVPTGDNQNKSADAIGMTMIPLGGNKTAVCSPGRRKICGRSLRLSPGACYTGTLGA 175 Query: 157 PITSSVKVN--SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 +T LD++ V+D S S+ D++ T + I + + E Sbjct: 176 AMTMKTGWGEIPCFKNYLDLVYVVDSSNSIS---DANFTIMKQIIVNASEAFEA-----S 227 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSH--------------LQRKIKYLSKF--GVSTN 258 + + Q ++ + N F + R IK +T Sbjct: 228 IGDTTQVAVLQYGNLDSAAFDHTDSKYYKSPTKLGDCNDIDCFNRAIKANMTHLNAANTF 287 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 ++ ++ A F KKI+V +TDG+ + S Sbjct: 288 TSLAIRRAVEFDFAQS-------KNKDKAKKILVLITDGQANFQSQLIVSYRLTQ---SH 337 Query: 319 GAIVYAIGIRVIRSHEFLRACA----SPNSFYLVENPHSMYDAFSHIGKDI 365 VYAIG+ ++S LR A S N + A ++ + I Sbjct: 338 NITVYAIGV-ALKSDAELRISANGGVSKERVLDANNYSELSKALRNLTETI 387 >gi|166363914|ref|YP_001656187.1| hypothetical protein MAE_11730 [Microcystis aeruginosa NIES-843] gi|166086287|dbj|BAG00995.1| hypothetical protein MAE_11730 [Microcystis aeruginosa NIES-843] Length = 581 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 31/202 (15%), Positives = 73/202 (36%), Gaps = 30/202 (14%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + +V+D S SME K+ ++ ++ + + L++F+ Sbjct: 396 AKKPSQVAVVIDTSGSMEG------QKLTSVKNTLLNYVQNLGPKE------RIALISFN 443 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + I E ++E R I+++ + S+ G + + ++ + Q+ T+ N Sbjct: 444 SVINEPVIIEGTPQGRDRGIEFIGQL----RSSGGTRLYDSALYARNWLSQNLRTDTINA 499 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG------AIVYAIGIRVIRS---HEFLRA 338 ++ +TDGE+ ++ + L E +K G + IG + Sbjct: 500 ---VLILTDGEDSGSQINLDQLE--QELQKSGFSSDQRIAFFTIGYGKEGEFNPQALQKI 554 Query: 339 CASPNSFYLVENPHSMYDAFSH 360 +Y +P ++ Sbjct: 555 AEVNGGYYRQGDPATISTVMGD 576 >gi|237741293|ref|ZP_04571774.1| conserved hypothetical protein [Fusobacterium sp. 4_1_13] gi|294784500|ref|ZP_06749789.1| von Willebrand factor type A domain protein [Fusobacterium sp. 3_1_27] gi|229430825|gb|EEO41037.1| conserved hypothetical protein [Fusobacterium sp. 4_1_13] gi|294487716|gb|EFG35075.1| von Willebrand factor type A domain protein [Fusobacterium sp. 3_1_27] Length = 228 Score = 47.1 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 37/177 (20%), Positives = 64/177 (36%), Gaps = 23/177 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 AR+ + + LD S SME I+ + + E +K + +VTF Sbjct: 15 NPTARVPVCLCLDTSGSMEG------KPIEELNEGVRLFYEAIKEDETALYSAEISVVTF 68 Query: 227 SNK---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR--QHCN 281 I +F+ L +Q LS +G + + A N D+ R ++ + Sbjct: 69 GGNAECIRDFYSL-----EIQPDAPRLSAYGKT-----PMGEAVNIGLDLLEKRKEEYKD 118 Query: 282 TEDANYKKIIVFMTDGENLSTKED--QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 Y+ +V MTDG E+ + + ++ V+ IGI L Sbjct: 119 KGVDYYQPWLVLMTDGGPNGDNEELSRAIRRTVDLVNQKKLTVFPIGIGEEADMNVL 175 >gi|331019621|gb|EGH99677.1| serine/threonine-protein kinase PpkA [Pseudomonas syringae pv. lachrymans str. M302278PT] Length = 663 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/228 (15%), Positives = 75/228 (32%), Gaps = 34/228 (14%) Query: 157 PITSSVKVNSQ-TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 P ++ SQ + ++ V+D + SM+ + I+ ++I + ++ ++ Sbjct: 221 PAQTAANATSQLKEFSAAVVFVIDSTISMDPY-------IERTREAIAKVYAQIAK-ENL 272 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 V+ GLV F + + LE+ V+ + + GLK A Sbjct: 273 GRQVKFGLVAFRSSTQAVPGLEY-VTKMYADPNTVKDGADFLAKAAGLKQAKVSSKSFNE 331 Query: 276 MR-----QHCNTEDANYK--KIIVFMT-----DGENLSTKEDQQSLYYCNEAKKRGAIVY 323 Q + D + + +V +T DG++ + + EA G +Y Sbjct: 332 DSYAGVMQAIDKVDWSPFGARYVVLITDAGALDGDDKLSGTGLNAEQVRIEASNPGVAIY 391 Query: 324 AIGIRVIRS-----------HEFLRACASPNSFYLVENPHSMYDAFSH 360 + ++ + S Y + + AF Sbjct: 392 TLHLKTAAGAKDHAKAEAQYQALSTYTGTNTSLYYPVDAGDLN-AFGS 438 >gi|322434934|ref|YP_004217146.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321162661|gb|ADW68366.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 373 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 28/172 (16%), Positives = 65/172 (37%), Gaps = 27/172 (15%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 + ++ D + + +++D S SM ++ +D A + + + Sbjct: 125 GVPQTINSFKHQDLPVSIGLLIDSSGSMYDKRNA----VDKASIDLVKLSNPMDEEF--- 177 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 LV FS + V L + + Y+ G + AY+ + Sbjct: 178 ------LVDFSTEAFIDTDFTTSVDKLSQGLSYIKSSGGTA--------AYDALVASAD- 222 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKED-QQSLYYCNEAKKRGAIVYAIGI 327 + N K++++ +TDGE+ ++ +QS+ + G ++Y +G+ Sbjct: 223 --YLTKNAKNTKQVLIIITDGEDNASSATLEQSIRRIQDL--DGPVIYCVGL 270 >gi|28870056|ref|NP_792675.1| ppkA-like protein [Pseudomonas syringae pv. tomato str. DC3000] gi|28853302|gb|AAO56370.1| ppkA-related protein [Pseudomonas syringae pv. tomato str. DC3000] Length = 653 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/228 (15%), Positives = 75/228 (32%), Gaps = 34/228 (14%) Query: 157 PITSSVKVNSQ-TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 P ++ SQ + ++ V+D + SM+ + I+ ++I + ++ ++ Sbjct: 211 PAQTAANATSQLKEFSAAVVFVIDSTISMDPY-------IERTREAIAKVYAQIAK-ENL 262 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 V+ GLV F + + LE+ V+ + + GLK A Sbjct: 263 GRQVKFGLVAFRSSTQAVPGLEY-VTKMYADPNTVKDGADFLAKAAGLKQAKVSSKSFNE 321 Query: 276 MR-----QHCNTEDANYK--KIIVFMT-----DGENLSTKEDQQSLYYCNEAKKRGAIVY 323 Q + D + + +V +T DG++ + + EA G +Y Sbjct: 322 DSYAGVMQAIDKVDWSPFGARYVVLITDAGALDGDDKLSGTGLNAEQVRIEASNPGVAIY 381 Query: 324 AIGIRVIRS-----------HEFLRACASPNSFYLVENPHSMYDAFSH 360 + ++ + S Y + + AF Sbjct: 382 TLHLKTAAGAKDHAKAEAQYQALSTYTGTNTSLYYPVDAGDLN-AFGS 428 >gi|307544656|ref|YP_003897135.1| hypothetical protein HELO_2066 [Halomonas elongata DSM 2581] gi|307216680|emb|CBV41950.1| hypothetical protein HELO_2066 [Halomonas elongata DSM 2581] Length = 612 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 40/200 (20%), Positives = 68/200 (34%), Gaps = 35/200 (17%) Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV 240 S SM+ D S +L V L+P V +G+ TF +++ L Sbjct: 47 SGSMKRNDP------DQLAASAMELL--VSLLPGG---VSAGVWTFGERVDNPLPLGEVS 95 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG--- 297 + + L P L Y Q D++ + + +AN + ++ MTDG Sbjct: 96 EQWRERALALP---------PAL-RDYQQYTDIETALEQAASAEANGWRHLILMTDGVID 145 Query: 298 ----ENLSTKEDQQSLY-----YCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFY 346 D+ S RG +V+AI + A + Sbjct: 146 LPPSRGNKPGIDEASRQRLVESMAPRFADRGVVVHAIAFSDEADLALVEQLAQRTGGLAS 205 Query: 347 LVENPHSMYDAFSHIGKDIV 366 + ++P S+ AF I + I Sbjct: 206 VAKSPESLLGAFLDIIERIF 225 >gi|295698033|ref|YP_003602690.1| putative tellurium resistance protein [Enterobacter cloacae subsp. cloacae ATCC 13047] gi|295060145|gb|ADF64882.1| putative tellurium resistance protein [Enterobacter cloacae subsp. cloacae ATCC 13047] Length = 346 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 29/206 (14%), Positives = 56/206 (27%), Gaps = 16/206 (7%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + VLD S SM + + ++ +++ P ++ F+ Sbjct: 3 RLPVFFVLDCSESM------IGENLKKMTDGLQMIVGDLRKDPHALETAWVSVIAFAGVA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + G T+ L+ QI H D + Sbjct: 57 RTIVPL----HEIASFYPPRLPVGGGTSLGAALRELTVQIDTQVRKTTHEAKGDWKP--V 110 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 + +TDG + + + A + A+G+ LR + Sbjct: 111 VYLLTDGRPTDDTTAEVKRWKDHYA--SKVNLIAVGLGPSADLNTLRQLTENVMLFTESQ 168 Query: 351 PHSMYDAFSHIGKDIVT--KRIWYDK 374 I + + + DK Sbjct: 169 EGDFTRFIKWITASVTAHSRSVGDDK 194 >gi|260581676|ref|ZP_05849473.1| conserved hypothetical protein [Haemophilus influenzae NT127] gi|260095269|gb|EEW79160.1| conserved hypothetical protein [Haemophilus influenzae NT127] Length = 345 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 64/200 (32%), Gaps = 15/200 (7%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +V+D+S SM + + I+ ++ +++ P V ++ F+ Sbjct: 3 RLPVYLVVDISESMAG------ENLRQMQEGISHLVNQLRRDPYALESVYLSVIAFAGAA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + L G T+ L + + + D Sbjct: 57 GTLAPL---TELMSFYPPRLP-IGSGTSIGAALNHLMDSLEKDIVRSTPEKKGDWKPLIY 112 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 I M+DG S + + A + IGI + + L + E+ Sbjct: 113 I--MSDGSPTDDPAQAISRW--KHHFQNKAKLINIGIGKFANLDTLSEISDLTYRLDDED 168 Query: 351 PHSMYDAF-SHIGKDIVTKR 369 +Y A + I+++ Sbjct: 169 IEKVYRALCESVADSILSQS 188 >gi|220923701|ref|YP_002499003.1| hypothetical protein Mnod_3796 [Methylobacterium nodulans ORS 2060] gi|219948308|gb|ACL58700.1| conserved hypothetical protein [Methylobacterium nodulans ORS 2060] Length = 439 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 29/225 (12%), Positives = 68/225 (30%), Gaps = 17/225 (7%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R + G ++ A+ P++ +G+ E + + + L D S A + Sbjct: 16 RALKRDVSGTAAVIAALAFPVVIGGMGLGAETGYWYLTQRKLQHAADLSAHAAGVR---- 71 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVV-------- 120 + K L N+ S + N ++SL++++ Sbjct: 72 KRAGDPKSQIDAAALNIALNSGMSSSLGNMLANSPPTSGIKAGDTSSLEVILTEVRPRLF 131 Query: 121 ----VPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMI 176 + A++ + C S + + ++ V + A Sbjct: 132 SSVFSSEPVSIRARAVASIVAGSQACVLALSPAASGAVTLSGSTVVNLKGCDIASNSTAA 191 Query: 177 -VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 +S S + + + A+ +I L + + VV+ Sbjct: 192 DAFLMSGSSAALSAGCASSVGGAVTTIQLKLTQCATVKTSAPVVR 236 >gi|115623672|ref|XP_785426.2| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3 [Strongylocentrotus purpuratus] gi|115960633|ref|XP_001186588.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3 [Strongylocentrotus purpuratus] Length = 1028 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 30/171 (17%), Positives = 62/171 (36%), Gaps = 19/171 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+DVS SM K ++ +L++V+ I +N ++ S + + Sbjct: 311 VIFVIDVSGSMYG------QKTRQTKRAFTTILDDVRPIDRINIILFSSYAHVWREDQMV 364 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 ++ +R + LS G TN L A + +H + I+ Sbjct: 365 EATSDNIAAAKRHVNGLS-VGGGTNIYDSLMKAVEILL------EHDTGDAMPL---IIM 414 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +TDG+ + + + + +++IG FL + N Sbjct: 415 LTDGQVGNAAAIVRDVTSVIGGR---LSLFSIGFGNGVDFPFLEKLSLSNQ 462 >gi|308472975|ref|XP_003098714.1| hypothetical protein CRE_04178 [Caenorhabditis remanei] gi|308268314|gb|EFP12267.1| hypothetical protein CRE_04178 [Caenorhabditis remanei] Length = 412 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/196 (17%), Positives = 66/196 (33%), Gaps = 27/196 (13%) Query: 172 LDMMIVLDVSRSMESFF------------DSSITKIDMA--IKSINAMLEEVKLIPDVNN 217 LD++ V+D SR M S TK +A I S+ + L Sbjct: 38 LDVIAVVDNSRGMTVNGLNYVSIFSLTMPSSLFTKFQIASNIASVFGFGTRIGLNASEPR 97 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GLVT+++ + L S + + ++T + + Sbjct: 98 TTRLGLVTYNSVATQMADLNQYQSLHDAFNRIFDDLSNTVDTTESYLSTGLTLAEKMFND 157 Query: 278 QHCNTEDANYKKIIV-----FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 Q N+ A+Y+K+++ + T+GE+ + K G + + Sbjct: 158 QSVNSTRAHYQKVVIVYASKYQTNGESNPES-------IADRLKLSGVKIITVAYGNAYG 210 Query: 333 -HEFLRACASPNSFYL 347 + L ASP + Sbjct: 211 LMKSLSIIASPGFAFS 226 >gi|326912615|ref|XP_003202644.1| PREDICTED: von Willebrand factor-like [Meleagris gallopavo] Length = 2810 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 25/178 (14%), Positives = 59/178 (33%), Gaps = 23/178 (12%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D+M +LD S S+ +++ + + A +E N V ++ ++ + Sbjct: 1668 PMDVMFLLDGSPSIG------VSEFEEMKNFVQAFIES---ADISNTSVHVSVLQYAREN 1718 Query: 231 EEFFLLE--WGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L +L + + + + T + + Sbjct: 1719 NLEISLNKPQETENLIKMVHSIKQREQGPTRLGKAIDFVVQNALSES------YGGRPGA 1772 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 K+I+ + G++ T E + A+ ++ IGI + LR P++ Sbjct: 1773 SKVIIVIVSGKSEDTMETAALI-----ARMNKVSLFPIGIGNGYDEQQLRTLTGPSAV 1825 >gi|254784280|ref|YP_003071708.1| von Willebrand factor A [Teredinibacter turnerae T7901] gi|237684173|gb|ACR11437.1| von Willebrand factor type A domain protein [Teredinibacter turnerae T7901] Length = 593 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 42/228 (18%), Positives = 84/228 (36%), Gaps = 26/228 (11%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS-INAML 206 N + + + + + + +L +++LDVS S++S + K+ A K+ + + Sbjct: 211 EENDQRVGVESYLTSEQIPKESVQLRTVLLLDVSTSIDS---GEMEKLKTAAKASLISYE 267 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF----GVSTNSTPG 262 + K + Q + +F ++I ++ L+ I + G STN Sbjct: 268 NDTKTSRLLPGQQQVAIYSFDSEITLLTDYTSDINLLEAAIDTIPNSVLERGNSTNLLGA 327 Query: 263 LKYA---YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKR 318 ++ A +N D+ + + V +TDGE N ++ Sbjct: 328 MEIAAERWNDQIDLIAVERG----------YAVLLTDGEHNFDSRSPADIEADLTNFFGT 377 Query: 319 GAIVYAIGIRVIRSHEFLRAC-ASPNSFYLV---ENPHSMYDAFSHIG 362 VYAI + + E L A AS V E+ + F+ + Sbjct: 378 RKKVYAIAVGNNVNLENLEAITASSEQVLTVNSFESAEELEAVFTEVA 425 >gi|125625081|ref|YP_001033564.1| hypothetical protein llmg_2320 [Lactococcus lactis subsp. cremoris MG1363] gi|124493889|emb|CAL98883.1| hypothetical protein predicted by Glimmer/Critica [Lactococcus lactis subsp. cremoris MG1363] Length = 1444 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 26/142 (18%), Positives = 56/142 (39%), Gaps = 16/142 (11%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D+++V+D+S SME + + + + L ++ V GLV +S+ Sbjct: 303 PVDIVLVVDMSGSMEGAREG------AIKQGVKSFLSSIENTA-YAQYVNVGLVGYSSPG 355 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY-NQIFDMQGMRQHCNTEDANY-- 287 ++ ++ L+ G + L+ + F G+RQ + Sbjct: 356 YIS-----NSGYITVPMESLATDGHVSAMNKALERQFVGGTFTQLGIRQGAQMLKEDASG 410 Query: 288 -KKIIVFMTDGENLSTKEDQQS 308 +K+I+ MTDG + + + Sbjct: 411 NEKMIILMTDGVPTFSNKVSSA 432 >gi|123482632|ref|XP_001323846.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121906718|gb|EAY11623.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 720 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 41/282 (14%), Positives = 99/282 (35%), Gaps = 41/282 (14%) Query: 93 SFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSR 152 F + + DI T ++I+ ++ I + + T I S Sbjct: 161 EFSTTIMTTKELRDIHVTANET-MNIIDSHNATF--VTKIPPKEDAIIIETLIKDEDKSI 217 Query: 153 HIVMPITSSVKVNSQTDARLD----MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + S+ + Q + +++ ++D S SM S ++I+ A +N ++ Sbjct: 218 AVSSDGYISISIYPQFEGKVEQKSEFYFIIDCSGSM------SGSRIENAKFCLNILIHS 271 Query: 209 VKLIPDVNNVVQSGLVTF-SNKIEEFFLLEWGVSHLQ---RKIKYLSKFGVSTNSTPGLK 264 + + + ++ F ++ E + ++ +++ I ++ T+ L+ Sbjct: 272 LPIG------CRFSIIQFGNSYKEVVSICDYSNKNVKYAMSAIARINADMGGTDILSPLE 325 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 Y + + +R+ I +TDGE D E + ++A Sbjct: 326 YVFKKKLGKGFIRK------------IFLLTDGE--VHNSDMICSRVQKERENN--RIFA 369 Query: 325 IGIRVIRSHEFLRA--CASPNSFYLVENPHSMYDAFSHIGKD 364 IG+ ++ S ++ L+ + +M + I K Sbjct: 370 IGLGSGADPGLIKNISAKSGGNYVLIADDDNMNNMIVEIMKS 411 >gi|283769330|ref|ZP_06342229.1| hypothetical protein HMPREF9013_0305 [Bulleidia extructa W1219] gi|283103987|gb|EFC05371.1| hypothetical protein HMPREF9013_0305 [Bulleidia extructa W1219] Length = 209 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 10/51 (19%), Positives = 29/51 (56%) Query: 4 LNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMI 54 + +NIR + KG ++ A+FL ++ ++ + ++V ++ K ++ ++ Sbjct: 1 MKMNIRKWIKEEKGSYIVIFALFLTVLLGMISLAVDVGMMYLKKNRMYEIV 51 >gi|229491170|ref|ZP_04384998.1| von Willebrand factor type A domain protein [Rhodococcus erythropolis SK121] gi|229321908|gb|EEN87701.1| von Willebrand factor type A domain protein [Rhodococcus erythropolis SK121] Length = 614 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 31/211 (14%), Positives = 66/211 (31%), Gaps = 40/211 (18%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S+ ++ ++ V+D S SM + + A ++ A + + Sbjct: 39 STPTAQAEETTS-SVLFVVDTSGSMAG------SPLAQAKDALRAGIGALSSGQAA---- 87 Query: 220 QSGLVTF-----SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 GL +F + + L L+ G +T + L+ A + Sbjct: 88 --GLRSFAGDCGNGGQLLVPVATDNRDQLNNATNQLTA-GGTTPTPDALRAAAGDLPSTG 144 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA--IVYAIGIRVIRS 332 + I+ ++DG++ + + G V+A+G Sbjct: 145 -------------DRTIILISDGQSTCGDPCAVATELKTQL---GIDFRVHAVGFNAPDV 188 Query: 333 HEFLRAC---ASPNSFYLVENPHSMYDAFSH 360 E +C A+ ++ N + DA S Sbjct: 189 AESELSCIANATGGRYFTATNTTELSDAISA 219 >gi|255576003|ref|XP_002528897.1| protein binding protein, putative [Ricinus communis] gi|223531651|gb|EEF33477.1| protein binding protein, putative [Ricinus communis] Length = 705 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 60/164 (36%), Gaps = 25/164 (15%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 T H+V+ + + +D+++VLDVS+ M K+ + + + ++ Sbjct: 284 GRTYQTHVVVLRIRAPPYTAARRPPIDLVMVLDVSQRM------CGVKLQVMKRIMRVVM 337 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLL----EWGVSHLQRKIKYLSKFGVSTNSTPG 262 + + +V FS + L G +R I L G ++ Sbjct: 338 SSLNSND------RLSIVAFSATSKRLSPLKRMTADGRRSARRIIDALGSTGQGMSANDA 391 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 LK A I R+ N + I+ +++G++ + + Sbjct: 392 LKKAAKVI----EDRRVKNPVAS-----IIIISNGQDDRSHINS 426 >gi|218779355|ref|YP_002430673.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] gi|218760739|gb|ACL03205.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] Length = 504 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 39/199 (19%), Positives = 73/199 (36%), Gaps = 22/199 (11%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG--LVTFSNKIEE 232 M+VLD S+SM+ D+ ++ A + ++ + + +V G + E Sbjct: 31 MLVLDCSQSMQRTLDTRPG-MENARPAALRFVKSL-PQNSLAGIVAYGQNAAKGCDNAEV 88 Query: 233 FFLLE-WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L + L IK + G + + L+ A+ Q G+ Q C +I Sbjct: 89 LVPLAPYDRRALISAIKKVQPQGKAPLAA-ALRKAWEQ---GAGLSQGC---------VI 135 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE--FLRAC-ASPNSFYLV 348 +TDG D S+ +A+ G IV IG+ R +R AS ++ Sbjct: 136 TLITDGW-DDCWGDPVSMVEDLKARGAGIIVNIIGVAPNREDAAKLMRLARASGGAYRAA 194 Query: 349 ENPHSMYDAFSHIGKDIVT 367 + + + + + Sbjct: 195 DTRADLILKAAETAESVSA 213 >gi|114799760|ref|YP_759488.1| hypothetical protein HNE_0760 [Hyphomonas neptunium ATCC 15444] gi|114739934|gb|ABI78059.1| conserved hypothetical protein [Hyphomonas neptunium ATCC 15444] Length = 576 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 16/59 (27%), Positives = 28/59 (47%) Query: 4 LNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAA 62 L+L G + IL A+ PI L++ M I++ + + + SM D + + AA Sbjct: 7 LSLAALRRAREQGGNVAILFALIAPIATLMMAMAIDLGMVNLQRRNMQSMTDLAAITAA 65 >gi|325287696|ref|YP_004263486.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324323150|gb|ADY30615.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 366 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 32/221 (14%), Positives = 71/221 (32%), Gaps = 10/221 (4%) Query: 152 RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 I +S + ++VLD+S+S+ + +D + + + V Sbjct: 104 NKISTTESSKKISPNGQIFNNSTLLVLDLSKSVLD------SSLDELKAASISFINNVMP 157 Query: 212 IPDVNNVVQSGLVTF---SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + + + + + +K+ E L L I+ ++ S ST A Sbjct: 158 EEEDKSDSFTMSIHWFDGEDKLHELNPLTSSRDELVSAIESITSTISSDASTDLYGAAIR 217 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +D +V TDG + +++ + + ++IG+ Sbjct: 218 STDLASKYLEDNTAKDVIGAASVVLFTDGTDQASRYTKAAALKKVTDADPNISFFSIGLG 277 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 E L+ S + N + F+ I I + Sbjct: 278 SEIDAEILKELGKTFSVF-ATNKEELETTFNEISTKISERA 317 >gi|312878214|ref|ZP_07738140.1| Ig domain protein group 2 domain protein [Caldicellulosiruptor lactoaceticus 6A] gi|311795008|gb|EFR11411.1| Ig domain protein group 2 domain protein [Caldicellulosiruptor lactoaceticus 6A] Length = 1831 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 66/204 (32%), Gaps = 24/204 (11%) Query: 132 ISRYKIPLKFCTFIPWYTNSRHIV-----MPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 + Y K T + + +++ + IT + D++ V+D + SM Sbjct: 798 KATYNESSKTITVETNHFSVYYLINLKKYLDITGLKSGTVSPSGQADIVFVIDTTGSMSD 857 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK 246 +ID ++IN ++++K V GLVT+ + + G Sbjct: 858 -------EIDAVKQNINNFVDKLKTKDIS---VNLGLVTYKDITCDGPNSTVGHGFFSSA 907 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + + G G + D N K IV +TD E++ Sbjct: 908 DDFKNALGSIKVDGGG--DTPETLIDALET-ARLLGFRENSTKFIVVLTDA--NYKLENR 962 Query: 307 QSLYYCNE----AKKRGAIVYAIG 326 + +E K IV + Sbjct: 963 FGIKSADEIIERLKSDNIIVSVVS 986 >gi|225030986|gb|ACN79500.1| inter-alpha-trypsin inhibitor heavy chain H4 precursor [Nilaparvata lugens] Length = 315 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 33/223 (14%), Positives = 71/223 (31%), Gaps = 39/223 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD-------VNNVVQSGLVT 225 ++ VLD+S SM KI ++ +L ++ N V S T Sbjct: 60 QVVFVLDISGSMFG------EKIKQLKDAMLKILSDLNPQDHFSIVLFSDNAYVWSKAKT 113 Query: 226 -----------FSNKIEEFFLLEWGVSHLQRKI-------KYLSKFGVSTNSTPGLKYAY 267 ++ E +L+ + + + K + T ST + Sbjct: 114 AVMKKILDEGFYNLDNETLAILDDHRNEILQATPDNVKTAKEFVELIKPTTSTNIIDGLR 173 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI--VYAI 325 + ++ ++ +T + I+ F+TDGE D + + +Y++ Sbjct: 174 KGLKLVKEGKETLDTTKEPSQPIMFFLTDGEPNVDLTDPVEIVNETSSLNEQLKTPIYSL 233 Query: 326 GIRVIRSHEFLRACASPN-----SFYLVENPH-SMYDAFSHIG 362 FL+ + N + Y + + + + I Sbjct: 234 AFGQGADITFLKKLSKANHGFARNIYEGSDATLQLNNFYKEIS 276 >gi|146343040|ref|YP_001208088.1| hypothetical protein BRADO6230 [Bradyrhizobium sp. ORS278] gi|146195846|emb|CAL79873.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 519 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 40/222 (18%), Positives = 64/222 (28%), Gaps = 56/222 (25%) Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL---QR 245 D ++D ++N + V N + GL F + ++ L +S Sbjct: 287 DCIQLRLDAVGYAVNQLFTTANTTKKVANQFRIGLYPFIRYLYSYYPLTTNISGSTSDSS 346 Query: 246 KIKYLSK-----FGVSTNSTPG-----LKYA---YNQIFDMQGMRQHCNTEDANYKKIIV 292 I Y + +TN++ G + A N + G T + Sbjct: 347 TINYAAANLATLLDTNTNASLGSGGTHIDTALSSVNSLITSVGDGS-ATTNTLP---YVF 402 Query: 293 FMTDG---------ENLSTKEDQQSLYY-----CNEAKKRGAIVYAIGI----------- 327 +TDG N S + C K RG I+ + I Sbjct: 403 LVTDGAQDPQVKGVPNGSWSGSNHATTINPTTSCTPLKNRGIIISVLYIPYQTINPVNAS 462 Query: 328 -----------RVIRSHEFLRACASPNSFYLVENPHSMYDAF 358 + L+ACASP FY P + A Sbjct: 463 FAGDEDDYANNNIPNIPPSLQACASPGFFYTANTPADITSAL 504 Score = 42.4 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 31/185 (16%), Positives = 55/185 (29%), Gaps = 35/185 (18%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHS----------MIDRSLV 59 F + K + A+ L I +G + S +K L S ++ + Sbjct: 16 RFVGDSKANIATTFALALLPILTAIGCGTDYSMAMRLKVKLQSAADAASIASISVNSAGY 75 Query: 60 HAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIV 119 AA + ++G+ N + N + TS Sbjct: 76 AAAMAMTSDGSVTAGVNEAD----------------NIFKGNASTFGGYTLTSETS---T 116 Query: 120 VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLD 179 V S +P F T I + + + + ++ + LD + LD Sbjct: 117 VTKTRSTLSSQVQFTAAVPTTFLTVIGYQSITVSGSSSSSVTLPLY------LDFYLTLD 170 Query: 180 VSRSM 184 VS SM Sbjct: 171 VSGSM 175 >gi|320169699|gb|EFW46598.1| hypothetical protein CAOG_04556 [Capsaspora owczarzaki ATCC 30864] Length = 501 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 26/133 (19%), Positives = 55/133 (41%), Gaps = 24/133 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 LD++ V+D S SM TK+++ K++ ++ +++ + LVT+ Sbjct: 40 RPTLDIVAVIDKSGSMAG------TKLELVKKTLETLVAQLRACD------RLALVTYDT 87 Query: 229 KIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ L G S + + + + G STN + GL N + + + + Sbjct: 88 EVTLDLALSPMDDKGRSKATQVVNGI-RDGSSTNLSGGLLEGLNILRNRPTDSRREVSS- 145 Query: 285 ANYKKIIVFMTDG 297 ++ +TDG Sbjct: 146 ------VLLLTDG 152 >gi|319639064|ref|ZP_07993821.1| PilC protein [Neisseria mucosa C102] gi|317399642|gb|EFV80306.1| PilC protein [Neisseria mucosa C102] Length = 1123 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 34/191 (17%), Positives = 63/191 (32%), Gaps = 6/191 (3%) Query: 118 IVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 + P I A + L + I + + +N Q + ++M + Sbjct: 7 MQNAPSGRHRLIHASVASALALISFSVQANTQQFAKIPFYLQNETSINGQPKVKHNIMFL 66 Query: 178 LDVSRSMESFFDSSITKI--DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 +D S SM+ T I D I L+ V Q GL T N Sbjct: 67 IDDSGSMQWNVQGKETSIWADKRITITKEALKSVLKEYGEKQRFQWGLQTLHNNGRTDTP 126 Query: 236 LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 E G + + ++ ++TP + Y + + N + K ++ M+ Sbjct: 127 DEEGFTDDWKDVQRRVDGIDPGHATPITRRYYEVVKNF----VMPNIKYRCQKSYVIVMS 182 Query: 296 DGENLSTKEDQ 306 DG+ + +Q Sbjct: 183 DGDANMSCSNQ 193 Score = 36.3 bits (82), Expect = 7.2, Method: Composition-based stats. Identities = 14/76 (18%), Positives = 24/76 (31%), Gaps = 12/76 (15%) Query: 296 DGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR---VIRSHEFLRACAS-PNSFYLVENP 351 DG+ K S + +G E+L AS P+ ++ Sbjct: 349 DGDPSDPKGVDYSKQL--------VQTFTVGFGEGISEVGREYLEKGASRPDWYFNAAKK 400 Query: 352 HSMYDAFSHIGKDIVT 367 + +AF I +I Sbjct: 401 EDLLEAFKTIVDNIEN 416 >gi|304393605|ref|ZP_07375533.1| conserved hypothetical protein [Ahrensia sp. R2A130] gi|303294612|gb|EFL88984.1| conserved hypothetical protein [Ahrensia sp. R2A130] Length = 170 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 17/59 (28%), Positives = 29/59 (49%), Gaps = 2/59 (3%) Query: 8 IRNFFYNYKGGMTILTAIF-LPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 +R F N +G + I+ A LP+ V G +E+S +K+ L DR+ + A + Sbjct: 18 LRRFANNERGNVMIIFAAAALPMAIGVAG-ALEISQYSQLKSQLQEASDRAALSAMAAL 75 >gi|315186713|gb|EFU20471.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6578] Length = 331 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 42/236 (17%), Positives = 79/236 (33%), Gaps = 54/236 (22%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 ++ + D++++ D+SRSM D +++D+A + ++ + + Sbjct: 77 GMEPETVKRRNADIVVLFDISRSM-LVRDVPPSRLDVAKEIALMLVSRISGA-------R 128 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 G+V F K E L + L+ I LS T+ GL A Sbjct: 129 WGVVAFKGKGELLLPLTPDLLGLEDAIGLLSPVLLRSPGTDVASGLSRALE--------- 179 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH---- 333 E +N +++++ ++DGE + A+ G V+ +GI Sbjct: 180 --AFPEQSNRQRLVILLSDGE----ALTGEIGPVLELARNLGVAVHTVGIGTESGGPVPL 233 Query: 334 --------------------EFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVT 367 L+ A + F+ VEN F H+ I Sbjct: 234 EGEEVLKKPSGEPVISRLDASLLKRIAEITGGRFFSVENAEG--QTFQHVVSTIEE 287 >gi|302336645|ref|YP_003801851.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] gi|301633830|gb|ADK79257.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] Length = 474 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 34/225 (15%), Positives = 70/225 (31%), Gaps = 35/225 (15%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK---SINAMLEEVKLIPDVNN 217 S+ N L +++++D S SM T ++ + NA+ V + Sbjct: 82 SLAPNPHESQGLSILLLMDNSGSMYDTLSGDPTGDPALMRTTYARNALRTFVGSSFHAGD 141 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 V TF+ + + + + + G S L +A + G R Sbjct: 142 SV--SFATFNTNVVLHADEAGDPVVMDMLLSGIRRPGTDE-SYTELYHALADMALPVGER 198 Query: 278 QHCNTEDANYKKIIVFMTDGE----------------NLSTKEDQQSLYYCNEAKKRGAI 321 + ++ ++DGE N D+ Y + G Sbjct: 199 SGR--------RAVIVLSDGEDYSYATHSGNPHPIYGNQQLSPDEVVEEYI----RNGVT 246 Query: 322 VYAIGIRVIRSHEF-LRACASPNSFYLVENPHSMYDAFSHIGKDI 365 +YAI + + A + + Y ++ + + I + I Sbjct: 247 LYAIHFGLEKDQYLGEMALKTGGAVYDAKDQEELTGIYHDIRQKI 291 >gi|300071886|gb|ADJ61286.1| hypothetical protein LLNZ_11975 [Lactococcus lactis subsp. cremoris NZ9000] Length = 1438 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 26/142 (18%), Positives = 56/142 (39%), Gaps = 16/142 (11%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D+++V+D+S SME + + + + L ++ V GLV +S+ Sbjct: 297 PVDIVLVVDMSGSMEGAREG------AIKQGVKSFLSSIENTA-YAQYVNVGLVGYSSPG 349 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY-NQIFDMQGMRQHCNTEDANY-- 287 ++ ++ L+ G + L+ + F G+RQ + Sbjct: 350 YIS-----NSGYITVPMESLATDGHVSAMNKALERQFVGGTFTQLGIRQGAQMLKEDASG 404 Query: 288 -KKIIVFMTDGENLSTKEDQQS 308 +K+I+ MTDG + + + Sbjct: 405 NEKMIILMTDGVPTFSNKVSSA 426 >gi|66775067|gb|AAY56127.1| complement component Bf/C2-B [Ginglymostoma cirratum] Length = 757 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 34/171 (19%), Positives = 61/171 (35%), Gaps = 28/171 (16%) Query: 147 WYTNSRHIVMPITSSVKVNS-----QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 + + I +T + + + +L + I+LDVS S++ + K+ Sbjct: 231 RFLSESKIDSQVTEEDESHGRKLILGQNHKLHIYILLDVSGSIQK---------EDFKKA 281 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKI------------EEFFLLEWGVSHLQRKIKY 249 INA+ V +I V GLV F ++ + +L+ + I Sbjct: 282 INALTTFVTMIKQFEVGVNYGLVMFGSRSCVEVNIAHDDVSDSDSVLQILPTLKYEDITR 341 Query: 250 LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 S TN T LK + + + + E + I+ TDG N Sbjct: 342 YS--DAGTNMTGALKTIFEMMVLKKASMKDKQAEWREVRHAIMIFTDGRNN 390 >gi|194758427|ref|XP_001961463.1| GF14980 [Drosophila ananassae] gi|190615160|gb|EDV30684.1| GF14980 [Drosophila ananassae] Length = 1138 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 69/188 (36%), Gaps = 24/188 (12%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-----NVVQSG 222 + D+MI+LD S SM D+ + + +L+ + VN VV++ Sbjct: 151 ASSPKDIMILLDASSSMSE------KSFDLGMATAFNILDTLGEDDFVNLITFSEVVKTP 204 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + F +++ + ++ +K + + N T GL+YA++ + Sbjct: 205 VPCFKDRMVRATP--DNIQEIKSAVKAIK-LQDTANFTAGLEYAFSLLHKYNQSGVGSQC 261 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-HEFLRACAS 341 A I+ +T+ + S K+ + + + ++ I L A Sbjct: 262 NQA-----IMLITESTSESHKDVIKQYNWPHMP----VRIFTYLIGSDSGSRSNLHDMAC 312 Query: 342 PNSFYLVE 349 N + V+ Sbjct: 313 SNKGFFVQ 320 >gi|322690259|ref|YP_004219829.1| hypothetical protein BLLJ_0067 [Bifidobacterium longum subsp. longum JCM 1217] gi|320455115|dbj|BAJ65737.1| conserved hypothetical protein [Bifidobacterium longum subsp. longum JCM 1217] Length = 380 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 59/186 (31%), Gaps = 15/186 (8%) Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 V+D S SM + + K +++ + I + V L+ F + Sbjct: 205 VVDYSGSMSGEGKNGVVK--GLNAALDPDQAKKSYIEPASGDVNI-LIPFETEAHRPVKA 261 Query: 237 E-WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 S L + G T+ GL A +++ Q+ IV MT Sbjct: 262 TGTSTSDLLHEADATDASG-GTDIYEGLLSALDELPSESEASQYTTA--------IVLMT 312 Query: 296 DGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMY 355 DG + S +D+ +++ R +++I + N+ + Sbjct: 313 DGRSNSDHQDE--FESAYKSRGRDLPIFSIMFGDADPSQLKSLATLSNAKVFDGRSGDLA 370 Query: 356 DAFSHI 361 F + Sbjct: 371 AVFRQV 376 >gi|158257430|dbj|BAF84688.1| unnamed protein product [Homo sapiens] Length = 914 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 39/197 (19%), Positives = 71/197 (36%), Gaps = 35/197 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM ++ +++ ++ L + + G+VTF + Sbjct: 307 VCLVLDKSGSM-----ATGNRLNRLNQAGQLFLLQTVELGSW-----VGMVTFDSAAHVQ 356 Query: 234 FLL--EWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L S K L T+ GL+ A I +++ Sbjct: 357 SELIQINSGSDRDTLAKRLPAAASGGTSICSGLRSASTVI-----RKKYPTDGSE----- 406 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ ++ C NE K+ GAI++ + + + E L Y Sbjct: 407 IVLLTDGEDN-------TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKMTGGLQTYA 459 Query: 348 VENPHS--MYDAFSHIG 362 + + + DAF + Sbjct: 460 SDQVQNNGLIDAFGALS 476 >gi|308063814|gb|ADO05701.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter pylori Sat464] Length = 217 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 29/202 (14%), Positives = 70/202 (34%), Gaps = 10/202 (4%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + + + + +++D S SM + T+I+ I M+E +K + ++T Sbjct: 9 TMEERFIPVFLLVDTSGSMNESL-GNCTRIEALNLCIQKMIETLKQEAKKELFSKMAIIT 67 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F E +L ++ T + A N I D Sbjct: 68 FG---ENGAVLHTPFDDVKNINFKPLSASGGTPLDQAFRLAKNLIEDKDTF------PTK 118 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 YK + ++DGE K + + ++ + ++ ++I I ++ + + Sbjct: 119 FYKPYSILVSDGELNDGKWQKALSDFHHDGRSTKSVCWSIFIGDREANPQVNKDFGKDGV 178 Query: 346 YLVENPHSMYDAFSHIGKDIVT 367 + ++ + F + + I Sbjct: 179 FYADDVEKLVGLFEIMTQTISK 200 >gi|168705263|ref|ZP_02737540.1| hypothetical protein GobsU_37375 [Gemmata obscuriglobus UQM 2246] Length = 987 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 20/183 (10%), Positives = 56/183 (30%), Gaps = 16/183 (8%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN-EGNG 71 +G + L + L +F + + +++ + +T + D + + + N +G Sbjct: 10 RGRRGTILPLLGVCLIGLFGFVALAVDLGMLAVSRTQSQNGADVAALVGTRTLNNRDGVA 69 Query: 72 NNRKKLKGGDILCRI------KNTWNMSFRNELRDNGFVNDIDD----IVRSTSLDIVVV 121 N + + +++ ++ D + T + Sbjct: 70 YNNLPAAVTAAQASVTSNPHLSTNFVSGEVSKMEVGQYLYDPTSQTFQVQNWTQVTGGGA 129 Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 G + R + + T+ + ++ + D+ VLD++ Sbjct: 130 MSAPGGNSWTAMRVTLGVSQPTYFMRVFGVNSMPSGAVATAVYRPR-----DIAFVLDMT 184 Query: 182 RSM 184 SM Sbjct: 185 GSM 187 >gi|117620923|ref|YP_857205.1| structural toxin protein RtxA [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|117562330|gb|ABK39278.1| structural toxin protein RtxA [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 4260 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/178 (19%), Positives = 63/178 (35%), Gaps = 26/178 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFF----DSSITKIDMAIKSINAMLEEVKLIPDVN 216 SV + ++++V+D S SM + S ++++A ++N M++ Sbjct: 3191 SVVITEAGLPPFNLVMVIDTSGSMLWQIGTSTNGSPNRLELAKDALNHMIDS-------- 3242 Query: 217 NVVQSGLVTFSNKIEEFF-----LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 V G+ I+ + I L G TN L A NQ+ Sbjct: 3243 -YVALGVPLVFTVIDFASGAVLIPQTSDPDVAKASISGLPTDGGGTNYNAPLVLAQNQLT 3301 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 N A Y+ + F++DG + E + + VYA+G+ V Sbjct: 3302 AD-----LANPALAGYETKVYFLSDG---APNEGNVPAGWTSFVNSNNVEVYAVGLNV 3351 >gi|46190338|ref|ZP_00121620.2| COG2304: Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Bifidobacterium longum DJO10A] gi|189440236|ref|YP_001955317.1| hypothetical protein BLD_1374 [Bifidobacterium longum DJO10A] gi|189428671|gb|ACD98819.1| Hypothetical protein BLD_1374 [Bifidobacterium longum DJO10A] Length = 380 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 59/186 (31%), Gaps = 15/186 (8%) Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 V+D S SM + + K +++ + I + V L+ F + Sbjct: 205 VVDYSGSMSGEGKNGVVK--GLNAALDPDQAKKSYIEPASGDVNI-LIPFETEAHRPVKA 261 Query: 237 E-WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 S L + G T+ GL A +++ Q+ IV MT Sbjct: 262 TGTSTSDLLHEADATDASG-GTDIYEGLLSALDELPSESEASQYTTA--------IVLMT 312 Query: 296 DGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMY 355 DG + S +D+ +++ R +++I + N+ + Sbjct: 313 DGRSNSDHQDE--FESAYKSRGRDLPIFSIMFGDADPSQLKSLATLSNAKVFDGRSGDLA 370 Query: 356 DAFSHI 361 F + Sbjct: 371 AVFRQV 376 >gi|313680435|ref|YP_004058174.1| von willebrand factor type a [Oceanithermus profundus DSM 14977] gi|313153150|gb|ADR37001.1| von Willebrand factor type A [Oceanithermus profundus DSM 14977] Length = 747 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 33/219 (15%), Positives = 77/219 (35%), Gaps = 31/219 (14%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 ++ Q A L ++VLDVS SM + +K+ A++ +++ Sbjct: 322 LEPEPQEGAAL--VLVLDVSGSMG---LGAPSKLARAVEGARKLVDAAGPEDT------L 370 Query: 222 GLVTFSNK---IEEFFLLEWG-VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 G+VTF+++ + + + + ++ L G + Q+ Sbjct: 371 GIVTFASRSRWLLAPKAMTYRAKREAETRLDALEARGGT------------QLATAYAAA 418 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + I+ ++DG+ Q++L +A RG + + FL Sbjct: 419 AEALEPLDARTRWILVLSDGQLEDDP--QRTLARARQAAARGVKTLTLALGADADRPFLA 476 Query: 338 ACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 A F + +P ++ + +G++ + + Sbjct: 477 RLAREGGGRFLDLADPAALPQVLALLGEEAFKPPVVEGR 515 >gi|156404157|ref|XP_001640274.1| predicted protein [Nematostella vectensis] gi|156227407|gb|EDO48211.1| predicted protein [Nematostella vectensis] Length = 476 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 32/164 (19%), Positives = 68/164 (41%), Gaps = 20/164 (12%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV------NN 217 V + + D+++V+D S SM +++ +A ++ +L+ + V + Sbjct: 203 VEAASPQPKDVILVVDYSGSMGG------SRLPIAKEAAKTVLDTLNPRDRVAFLAFESG 256 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA--YNQIFDMQG 275 V + + + K E+ F + I L KF ++ G YA +N FD+ Sbjct: 257 VRRVKVTSGDAKDEKCFESSLAKAS-PVNIDILKKFLDGEYASGGTMYAVAFNAAFDIL- 314 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 + + +I+FMTDG + +D ++ + + +G Sbjct: 315 DKYYKEKNTTRRP-VILFMTDG---APNDDPGTILNTVKMRNQG 354 >gi|317483399|ref|ZP_07942390.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] gi|316915154|gb|EFV36585.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] Length = 401 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 31/185 (16%), Positives = 59/185 (31%), Gaps = 15/185 (8%) Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 V+D S SM + + K +++ + I + V L+ F + Sbjct: 226 VVDYSGSMSGEGKNGVVK--GLNAALDPDQAKKSYIEPASGDVNI-LIPFETEAHRPVKA 282 Query: 237 E-WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 S L + G T+ GL A +++ Q+ IV MT Sbjct: 283 TGTSTSDLLHEADATDASG-GTDIYEGLLSALDELPSESEASQYTTA--------IVLMT 333 Query: 296 DGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMY 355 DG + S +D+ Y +++ R +++I + N+ + Sbjct: 334 DGRSNSDHQDEFESSY--KSRGRDLPIFSIMFGDADPSQLKSLATLSNAKVFDGRSGDLA 391 Query: 356 DAFSH 360 F Sbjct: 392 AVFRQ 396 >gi|553810|gb|AAA61295.1| von Willebrand factor [Homo sapiens] Length = 958 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 40/277 (14%), Positives = 86/277 (31%), Gaps = 35/277 (12%) Query: 79 GGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYKI 137 ++ + + + N ++I + + + V N + I Sbjct: 599 HSFLVSQGDREQAPNLVYMVTGNPASDEIKRLPGDIQVVPIGVGPNANVQELERIGWPNA 658 Query: 138 PLKFCTFIPWYTNSRHIVMPITSS--------VKVNSQTDARLDMMIVLDVSRSMESFFD 189 P+ F + +V+ S + LD++++LD S S F Sbjct: 659 PILIQDFETLPREAPDLVLQRCCSGEGLQIPTLSPAPDCSQPLDVILLLDGSSS---FPA 715 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV----SHLQR 245 S ++ K+ + + + P + Q ++ + + W V +HL Sbjct: 716 SYFDEMKSFAKA---FISKANIGP---RLTQVSVLQYGSITTIDVP--WNVVPEKAHLLS 767 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 + + + G + L +A + H A+ K +++ +T Sbjct: 768 LVDVMQREGGPSQIGDALGFAVRYLTS----EMHGARPGAS-KAVVILVT------DVSV 816 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + A+ V+ IGI LR A P Sbjct: 817 DSVDAAADAARSNRVTVFPIGIGDRYDAAQLRILAGP 853 >gi|33985|emb|CAA30160.1| trypsin inhibitor [Homo sapiens] Length = 946 Score = 47.1 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 77/198 (38%), Gaps = 32/198 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ V+DVS SM K+ ++++ +L++++ + ++ F+ I Sbjct: 311 ILFVIDVSGSMWG------VKMKQTVEAMKTILDDLRAEDHFS------VIDFNQNIRTW 358 Query: 231 --EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + F L + + +R I+ + G TN L A + + + + + Sbjct: 359 RNDLFQLQKHRLQIAKRYIEKIQPSG-GTNINEALLRAIFILNEANNLGL-LDPNSVSL- 415 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACASPNS- 344 I+ ++DG+ + + K+ ++++G+ ++FL+ ++ N Sbjct: 416 --IILVSDGDPT--VGELKLSKIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENHG 471 Query: 345 ----FYLVENPHSMYDAF 358 Y ++ S F Sbjct: 472 IAQRIYGNQDTSSQLKKF 489 >gi|323484869|ref|ZP_08090225.1| hypothetical protein HMPREF9474_01976 [Clostridium symbiosum WAL-14163] gi|323401865|gb|EGA94207.1| hypothetical protein HMPREF9474_01976 [Clostridium symbiosum WAL-14163] Length = 2032 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 41/364 (11%), Positives = 110/364 (30%), Gaps = 77/364 (21%) Query: 81 DILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLK 140 + +++ + + E+ +G T + V +++GY + + Y Sbjct: 983 FYVDEVQSQGAKAVKWEVSSSGTGTYPQVSGSKTEIYEVT-KESQGYLLQCTNIYGELNP 1041 Query: 141 FCTFIPWYTNS----RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS-SITKI 195 W + + + +T + D++ V+D S SM + DS + Sbjct: 1042 QVAKRAWKDYTADDEYDVTLEVTGDSIQTTIGGGTADIVFVIDKSSSMNDWDDSLDDYRW 1101 Query: 196 DMAIKSINAMLEEVKLIPDVN-------------------NVVQSGLVTFSNKIEEFF-- 234 + +++ + ++K+ + + ++ V + ++ Sbjct: 1102 NKLESTVDRFINKLKITSPNSKISFIEYQSSDLGTYQETYDDIRKVAVADTESADKNLEG 1161 Query: 235 -LLEWGVSHLQ-RKIKYL--------SKFGVSTNSTPGL---KYAYNQIFDMQGMRQHCN 281 L++ + + I + + T+S G + A +++ + N Sbjct: 1162 NKLKYFRTLQKWTAISEVGSKPYGSAPGYNQGTHSAGGYLGAERALDRLKTYNPDEYNSN 1221 Query: 282 TEDANYKKIIVFMTDG------ENLSTKEDQQSLYYCNEAKKR------------GAIVY 323 + K I+++ DG N T+ S N + A +Y Sbjct: 1222 S------KYIIYLADGTAGYYVNNNGTQAGSGSGGNANARRAAITQSGELKKKHPDATIY 1275 Query: 324 AIGI-------------RVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + + + +FY N + + F ++ + + + Sbjct: 1276 TVAFGSDSSANMNWMKPGAYNGNSDNPYNPNVTAFYSAANTKELEETFDNLAAQVGSSAV 1335 Query: 371 WYDK 374 K Sbjct: 1336 TNPK 1339 >gi|262402640|ref|ZP_06079201.1| protein BatA [Vibrio sp. RC586] gi|262351422|gb|EEZ00555.1| protein BatA [Vibrio sp. RC586] Length = 335 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 66/180 (36%), Gaps = 31/180 (17%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESF-----FDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 +V ++ D+++V+D+S SME ++++ A K + + + Sbjct: 87 EVQTREAFGRDVLMVVDLSGSMEEKDFATESGEQLSRLTAAKKVLRDFVTQ-------RQ 139 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFD 272 + GL+ F + F+ + + + L + G STN + Sbjct: 140 GDRFGLILFG---DAAFIQTPFTADQEVWLNLLDEAETGMAGQSTNLGDAI--------- 187 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 G++ + + +I++ +TDG T + A +G +Y I + + Sbjct: 188 GLGIKVFEQSPATSQDQIMLVLTDG--NDTGSFVSPVDAAKIAAAKGIRIYVIAMGDPEN 245 >gi|165975965|ref|YP_001651558.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 3 str. JL03] gi|165876066|gb|ABY69114.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 3 str. JL03] Length = 529 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 30/239 (12%), Positives = 75/239 (31%), Gaps = 22/239 (9%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R F + G T++ + I ++ + +E + I + L ++++++ + + Sbjct: 10 RRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAENNSG 69 Query: 69 GNGNNRK--------------KLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRST 114 + K G + N + + + +T Sbjct: 70 RKTTDYKLGGSNPSDESFNISSEVGKRDHAIVTAFVKTFLPQTDEKNMHLTPLCKTINNT 129 Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDM 174 S ++S +K + + + S + + +D+ Sbjct: 130 SGKGHTSSSEVTCTVSGTVEHKSWFPLKVGNL-EVIPKQVNVASQSRAIKKNTFNIPIDL 188 Query: 175 MIVLDVSRSMESFFD-------SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 M+V D+S SM D +KI + + + + ++ L D N + + F Sbjct: 189 MVVADLSGSMNFDLDNNEIKKTGKPSKISILKEVLVELADKTLLSEDANQNNRIYVTPF 247 >gi|118579649|ref|YP_900899.1| von Willebrand factor, type A [Pelobacter propionicus DSM 2379] gi|118502359|gb|ABK98841.1| von Willebrand factor, type A [Pelobacter propionicus DSM 2379] Length = 337 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 35/169 (20%), Positives = 58/169 (34%), Gaps = 13/169 (7%) Query: 168 TDARLDMMIVLDVSRSM---ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + +D+ I +DVS+SM + K+ +N I + G+ Sbjct: 77 NRSGIDLAIGIDVSKSMLAEDETLPPEGKKLFSIPNRLNRARYCALTILSALKGERVGVF 136 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 F++K L + I L ST STPG I + + + Sbjct: 137 LFASKGVPIVPLT-NDYGYCQYI--LKHANDSTISTPGSDLG-QAITTGIYLFEESSRTS 192 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSL-YYCNEAKKRGAIVYAIGIRVIRS 332 K IV ++DGE ED + A +G +Y +G + Sbjct: 193 V---KSIVLISDGE--DINEDSSVMHEAAQRAAAKGIAIYTVGTGRGQG 236 >gi|330901495|gb|EGH32914.1| von Willebrand factor, type A [Pseudomonas syringae pv. japonica str. M301072PT] Length = 218 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 22/144 (15%), Positives = 54/144 (37%), Gaps = 24/144 (16%) Query: 173 DMMIVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D+++ +DVS SM+ + ++++ + + + LE K + GL+ F + Sbjct: 91 DLLVAVDVSGSMDYPDMQWKSDEVSRLVLVQQLLGDFLEGRKGD-------RVGLILFGS 143 Query: 229 KIEEFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + ++ + + G +T + A ++ Sbjct: 144 QAFVQAPLTYDRRTVRVWLDEARIGIAGKNTALGDAIGLALKRL-----------RMRPA 192 Query: 287 YKKIIVFMTDGENLSTKEDQQSLY 310 + +V +TDG N + + D + Sbjct: 193 TSRALVLVTDGANNAGQIDPITAA 216 >gi|42407699|dbj|BAD08847.1| zinc finger (C3HC4-type RING finger) protein family-like [Oryza sativa Japonica Group] gi|42408121|dbj|BAD09261.1| zinc finger (C3HC4-type RING finger) protein family-like [Oryza sativa Japonica Group] Length = 703 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 41/224 (18%), Positives = 81/224 (36%), Gaps = 43/224 (19%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + S + A +D++ VLDVS SME + K+ + +++ + Sbjct: 251 LHVKAPSIAAEAAPARASVDLVTVLDVSGSMEGY------KLALLKRAMGLLG------- 297 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + + +V+FS L G + + ++ L G TN GL A + Sbjct: 298 PGD---RLAVVSFSYSARRVIRLTRMSEGGKASAKSAVESLHADGC-TNILEGLVEA-AK 352 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLST--------KEDQQSLYYCNEAKKRG-- 319 +FD + R + ++ ++DG++ S+ K+ G Sbjct: 353 VFDGRRYRNAVAS--------VILLSDGQDNYNVNGGWGASNSKNYSVLVPPSFKRSGDR 404 Query: 320 -AIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSH 360 V+ G + A + +F +EN + DAF+ Sbjct: 405 RLPVHTFGFGTDHDASAMHTIAEETGGTFSFIENQAVVQDAFAQ 448 >gi|313228360|emb|CBY23511.1| unnamed protein product [Oikopleura dioica] Length = 691 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 55/169 (32%), Gaps = 24/169 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ V+D S S+ + D + M++ + + D N + ++ FS+ Sbjct: 204 DIIFVVDSSGSVGADG------FDKIKVFLKNMIDYIWV--DANTTSRIAIIVFSDDAST 255 Query: 233 --FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L++ I + T + L A + N K Sbjct: 256 IWSFSDSADPVALEQTIDNTAYKNSGTETEKALDAALAIM---------DNEGRPYAVKT 306 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAI--VYAIGIRVIRSHEFLR 337 I TDG + QS+ K G I + + G + L+ Sbjct: 307 IALFTDGRANNKAAYLQSVQ---NIKDSGIILNLQSFGFQSGAGSFALQ 352 >gi|293570439|ref|ZP_06681494.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] gi|291609385|gb|EFF38652.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] Length = 1042 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 25/142 (17%), Positives = 55/142 (38%), Gaps = 28/142 (19%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + LD+++V+D S SM +I ++ ++ + + + + G V + Sbjct: 286 KDITPLDLVLVVDWSGSMNDN-----DRIGEVKIGVDRFVDTL-SDSGITDKINMGYVGY 339 Query: 227 SNKIEEFFLLEWGV------SHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQ 278 S+ + G ++ ++K ++ T + GL+ A + + G Sbjct: 340 SSDGYNY---SNGTVQMGSFDSVKNQVKSITPSWTNGGTFTQKGLRDAGDMLSVPNG--- 393 Query: 279 HCNTEDANYKKIIVFMTDGENL 300 +KK+IV +TDG Sbjct: 394 --------HKKVIVLLTDGVPT 407 >gi|332796610|ref|YP_004458110.1| von Willebrand factor type A [Acidianus hospitalis W1] gi|332694345|gb|AEE93812.1| von Willebrand factor type A [Acidianus hospitalis W1] Length = 381 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 40/202 (19%), Positives = 75/202 (37%), Gaps = 32/202 (15%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 +I+LD S SME KI+ A +E K IP N V + F++ +E Sbjct: 41 IILLDTSGSMEGL------KIEKAKSGA---IELFKRIPQGNKVT---FIKFASTVEVVR 88 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 L +I+ ++ G ++ T L A+ I+ + Sbjct: 89 EFSD-PLDLTNEIQNIAANGQTSLFT-ALLTAFKIAVKYS------------MPAYILLL 134 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPH 352 TDG + + + Y + +G + + GI + E L+ + + ++FY ++ Sbjct: 135 TDG---NPTDVTDTRTYEKMSIPQGVQIISFGIGDDYNEELLKILSDRTGSTFYHIQEAS 191 Query: 353 SMYDAFSHIGK-DIVTKRIWYD 373 + + K I K + D Sbjct: 192 EIPEKLPRAAKTKIAGKNVTVD 213 >gi|303242523|ref|ZP_07329002.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] gi|302589941|gb|EFL59710.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] Length = 1855 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 68/202 (33%), Gaps = 34/202 (16%) Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 +ISA + Y C + + I + + D++ +D + SM Sbjct: 721 TISASTPY---FSICGIVDIC---KLIESLMCDDESGGILDKGKADIIFAIDSTGSMGD- 773 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN---KIEEFFLLEWGV--SH 242 KI+ I ++N EE+ N ++ GL+ + + E W + Sbjct: 774 ------KIENVITNVNEFAEELSK----NVEIRFGLIDYKDIYEVGETTINCGWFTDVNE 823 Query: 243 LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG---EN 299 L++++ + +G + ++ R +A KK IV +TD + Sbjct: 824 LKKRVDEILVYGGGDVPESAVD-------ALEEARTMGFRPNA--KKFIVLLTDADYKDG 874 Query: 300 LSTKEDQQSLYYCNEAKKRGAI 321 + + K G I Sbjct: 875 THFTDVTTMAQEIDLLKNDGII 896 >gi|259490072|ref|NP_001159273.1| hypothetical protein LOC100304363 [Zea mays] gi|223943141|gb|ACN25654.1| unknown [Zea mays] Length = 459 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 67/202 (33%), Gaps = 35/202 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 A LD+++VLD+S SM TK++ ++ + E I + ++TF Sbjct: 39 GDTAPLDIVVVLDISGSMRG------TKLEHMKHAMTRFIIEKLGIRGD----RLAIITF 88 Query: 227 SNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +K + F L V ++ L G TN GL+ + + +G + + Sbjct: 89 ESKAHKVFDLSSMLPDQVKKAVAVVEGLKA-GGDTNIKAGLEAGLDVLKTRRGHSHNASC 147 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL------ 336 I M+DG K + V G + L Sbjct: 148 --------IFLMSDGHENVDKARTLLDRVGEHS------VVTFGFGEKSDEQLLYDIAYH 193 Query: 337 RACASPNSFYLVENPHSMYDAF 358 + + E+ + + AF Sbjct: 194 SHAGTYHHVREKEDENQLMKAF 215 >gi|217978821|ref|YP_002362968.1| hypothetical protein Msil_2684 [Methylocella silvestris BL2] gi|217504197|gb|ACK51606.1| conserved hypothetical protein [Methylocella silvestris BL2] Length = 429 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 13/48 (27%), Positives = 27/48 (56%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDR 56 R F Y+ GG+ ++ + LP+I ++ + E+S + + K + S+ D Sbjct: 20 RRFRYDRGGGVALMIGLALPVIIGMIALGTEISFLLYKKFQMQSVADS 67 >gi|145503451|ref|XP_001437701.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124404855|emb|CAK70304.1| unnamed protein product [Paramecium tetraurelia] Length = 588 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 27/170 (15%), Positives = 57/170 (33%), Gaps = 22/170 (12%) Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 ++D S SM S ++I A +++ L+ + N++ G +S E Sbjct: 344 FIIDRSGSM------SGSRISKAKEALILFLKSL-PQDSEFNIISFGSNFYSLWNESKMY 396 Query: 236 LEWGVSHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + + ++ + T LK YN+ + + + Sbjct: 397 SQNSLEQAINHVQSMDANLGGTRIIVPLKEMVYNKYYGASNKTTLN----------VFLL 446 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 TDGE+ + + + + +Y +GI S +R A + Sbjct: 447 TDGEDFADPI----IDLVQKNNRAQTRIYTLGIGEGCSQYLIRRVAEVGN 492 >gi|296108821|ref|YP_003615770.1| Magnesium chelatase [Methanocaldococcus infernus ME] gi|295433635|gb|ADG12806.1| Magnesium chelatase [Methanocaldococcus infernus ME] Length = 283 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 26/178 (14%), Positives = 69/178 (38%), Gaps = 26/178 (14%) Query: 150 NSRHIVMPITSSVKVNSQTDARLD--MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 ++++ + I + ++ ++ V+D S SM ++ ++++A +I ++L Sbjct: 81 KNKNLALYIEKEDLREKIREKKISSYILFVVDTSGSM-----GALRRMELAKGAIRSLL- 134 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKY 265 + + G++ F + + L+ S ++ K L G T + Sbjct: 135 ----VDAYQKRNRVGMIVF--RKDSADLILPFTSSVELAEKSLRDVPTGGRTPLSKAFLK 188 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ---QSLYYCNEAKKRGA 320 AY F+ + + I+VF++D + ++ + C + ++G Sbjct: 189 AYET-FEKELRKNPNIIP------IMVFISDFKPNVAIKNDFIKEIYEICEKIHEKGI 239 >gi|45767838|gb|AAH67716.1| Col14a1 protein [Danio rerio] Length = 747 Score = 46.7 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 33/86 (38%), Gaps = 11/86 (12%) Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNS 344 K++V +TDG E + G I++AIG E + + S Sbjct: 17 IPKVLVVLTDGR-----SQDDVNKISKEMQMEGYIIFAIGFADADYGELVNIASKPSERH 71 Query: 345 FYLVENPHSMYDAFSHIGKDIVTKRI 370 + V++ DAF I + ++T Sbjct: 72 VFFVDD----LDAFKKIEEQLITFVC 93 >gi|300788143|ref|YP_003768434.1| hypothetical protein AMED_6298 [Amycolatopsis mediterranei U32] gi|299797657|gb|ADJ48032.1| conserved hypothetical protein [Amycolatopsis mediterranei U32] Length = 534 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 73/200 (36%), Gaps = 27/200 (13%) Query: 174 MMIVLDVSRSMESFFDS-SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 + +++DVS SM + ++ ++ + +++ + K Q ++ FS +++ Sbjct: 337 VQVLIDVSGSMNAQVPGTNLNRMQVTMEAAAKAMHLFKPA------TQLRMLAFSTRLDG 390 Query: 233 FF------LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDA 285 + HL L K G + G Y+ + D + R+ Sbjct: 391 DKDYRELLPMASVAQHLAS--GALEKLGQVKATPDGGTGLYDSVLDTYRTARREWEPGRL 448 Query: 286 NYKKIIVFMTDGENLSTKEDQQS-----LYYCNEAKKRGAIVYAIGIRVIRSHEFLR--A 338 N ++ MTDG N + ++ L +A+ R + +GI L Sbjct: 449 NL---VIVMTDGRNEDPRGISRADLLTELAGLQDAR-RPIPLIGVGIGPDADKAELDQLT 504 Query: 339 CASPNSFYLVENPHSMYDAF 358 A+ +L +P + D F Sbjct: 505 AATGGQAFLAPDPAKITDVF 524 >gi|293374377|ref|ZP_06620704.1| LPXTG-motif cell wall anchor domain protein [Turicibacter sanguinis PC909] gi|292647040|gb|EFF65023.1| LPXTG-motif cell wall anchor domain protein [Turicibacter sanguinis PC909] Length = 1099 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 34/280 (12%), Positives = 82/280 (29%), Gaps = 46/280 (16%) Query: 64 QIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQ 123 + ++ +L I +N ++ N + + + + + Sbjct: 173 SLDSDITFTANIRLLNPYIAKGTENLYSSYNWNGYQASAAADSNTSDEYTRLTSSMSSEN 232 Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITS-SVKVNSQTDARLDMMIVLDVSR 182 + + + + ++++ V S +++ VLDVS Sbjct: 233 DGAVWLDKSVTLDSNFNYIDNLAISNGDFNVILSALGQGYNVESTAKTPANVVFVLDVSG 292 Query: 183 SM-------ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 SM E S T+ +K++N ++++ + G+VTFS ++F Sbjct: 293 SMKYKLGKDEWATKDSETRAYAMVKAVNLAIKQLVSQNPQ---TKIGIVTFSEGSQQFLP 349 Query: 236 LEWGVS--------------------------HLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 L S ++ K+ T + G+K + Sbjct: 350 LNNYTSTGDYLTYTGKNYGGWWEPSGNISISEAVEPKVSKTVNVTGGTYTQAGIKAGADM 409 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL 309 + + + ++DGE ++L Sbjct: 410 LISAANSDRVPA---------LFILSDGEPTYGNTSYETL 440 >gi|159900723|ref|YP_001546970.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893762|gb|ABX06842.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 421 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 39/206 (18%), Positives = 73/206 (35%), Gaps = 29/206 (14%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +++ VLD S SM K+ ++ L + V+ +V F + Sbjct: 41 QMPVNVSFVLDHSGSM------KGDKMRCVREATQRALGLMGPQDIVS------VVIFDH 88 Query: 229 KIEE--FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + E V+ LQ ++ + G T P L+ A N+I R+ N + Sbjct: 89 RRETIISAQPVRNVAALQAEVGKIKDAG-GTKIAPALEAALNEI------RRSQNANTIS 141 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNS 344 I+ +TDG+ ++ L E K + A+G+ + + L A S Sbjct: 142 R---IILLTDGQTEGERD---CLRLAEEIGKASVPLTALGVGDDWNEDLLIEMANRSGGV 195 Query: 345 FYLVENPHSMYDAFSHIGKDIVTKRI 370 NP+ + F + + + Sbjct: 196 AEYFSNPNDIASFFQGAVQQAQSAVV 221 >gi|17229534|ref|NP_486082.1| hypothetical protein alr2042 [Nostoc sp. PCC 7120] gi|17131133|dbj|BAB73741.1| alr2042 [Nostoc sp. PCC 7120] Length = 608 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 29/131 (22%), Positives = 47/131 (35%), Gaps = 7/131 (5%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D ++V+D S SM + SIN + ++ V VT Sbjct: 36 NPKGGIDWIVVVDTSASMRG-VGGTRNIFTQVKNSINEFVNTARVGDTVTIYNFDSDVTL 94 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L++ I L GV T++ ++ A + + Q NT D Sbjct: 95 QAQEIPIASNPD-RGKLKQIINNLKADGVRTHTGKAVQQA---LSTSAKLNQRPNTADRT 150 Query: 287 YKKIIVFMTDG 297 IVF+TDG Sbjct: 151 VS--IVFLTDG 159 >gi|198415896|ref|XP_002125135.1| PREDICTED: similar to putative calcium activated chloride channel-like protein 1; eCLCA1 [Ciona intestinalis] Length = 1580 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 37/195 (18%), Positives = 67/195 (34%), Gaps = 29/195 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++VLDVS SM ++ M +S + +PD + V G+V F + Sbjct: 337 FVLVLDVSGSMSGN------RLLMMRQSAGDFIST--SLPDGDKV---GIVQFHSSANLM 385 Query: 234 FLLEWGVSHLQR-KIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + S L R I G ST G+ A N++ N Sbjct: 386 MEIRQISSQLDRVAIAAGIPGIAGGSTCIGCGIYAAMNEMERHDANETCGN--------- 436 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC-ASPNSFYLVE 349 I+ +TDG+ A ++ +V AI + + A+ ++ + Sbjct: 437 IIVLTDGKENQPPYVNDVSQL---AIQKNCVVNAILFTTTENSALVDLVTATGGQWFFAQ 493 Query: 350 --NPHSMYDAFSHIG 362 + + +F+ I Sbjct: 494 DRDLKRLMGSFAVIA 508 >gi|156370019|ref|XP_001628270.1| predicted protein [Nematostella vectensis] gi|156215242|gb|EDO36207.1| predicted protein [Nematostella vectensis] Length = 1552 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 50/136 (36%), Gaps = 8/136 (5%) Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 ++ E +++R I L T LK + ++F N N KK+ Sbjct: 4 NKYSGAEMNNVNIKRDIDELRLERGLTFIDKALKISAEKLFT------EKNGMRLNRKKV 57 Query: 291 IVFMTDGENLSTKEDQQSLYYCNE-AKKRGAIVYAIGIRV-IRSHEFLRACASPNSFYLV 348 + +TDG K L ++ K +G VY++GI I E + + + Sbjct: 58 ALVLTDGIQTKDKGPFTPLQKASQPMKDKGVEVYSLGIGSDIDVSELITFASGEKYVFNA 117 Query: 349 ENPHSMYDAFSHIGKD 364 ++ + +I + Sbjct: 118 KSFDELQLQVENITQA 133 >gi|304382531|ref|ZP_07365026.1| aerotolerance protein BatB [Prevotella marshii DSM 16973] gi|304336362|gb|EFM02603.1| aerotolerance protein BatB [Prevotella marshii DSM 16973] Length = 340 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 65/175 (37%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + ++ +I LD+S SM + D ++++ + I M++ N+ V Sbjct: 79 GSKITHEKRNGIEAIIALDISNSMLAQ-DVVPSRLEKSKMLIENMVDNF-----TNDKV- 131 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 GL+ F+ + + ++ + T+ + A Sbjct: 132 -GLIVFAGDAFIQLPITSDFVSAKMFLQNIDPSLIATQGTDIAKAIDMAMKSFT------ 184 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + N K I+ +TDGE+ + +L AK +G V+ +G+ + Sbjct: 185 -----QQENVGKAIIVITDGEDH----EGGALEAAKTAKAKGYNVFILGVGTAKG 230 >gi|226310161|ref|YP_002770055.1| hypothetical protein BBR47_05740 [Brevibacillus brevis NBRC 100599] gi|226093109|dbj|BAH41551.1| hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 477 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 42/211 (19%), Positives = 83/211 (39%), Gaps = 27/211 (12%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ--SGLVTFSN 228 ++ I+LD S SM + + T++D A ++I A E + +V V G S+ Sbjct: 171 SYNVEIILDASGSMAAKSNGK-TRMDAAKEAIQAFAESLPEQANVALRVYGHKGSGKESD 229 Query: 229 KIEEF--FLLEWG--VSHLQRKIKYLSKFGVSTNSTPG--LKYAYNQIFDMQGMRQHCNT 282 K L +G + ++ + L++F + + L+ A + + G + Sbjct: 230 KTLSCGSSELVYGMQTYNKEKLTQSLNQFQPTGYTPIAYSLQEAKKDLSKLPGDKNTNM- 288 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA--IVYAIGIRV-IRSHEFLR-- 337 I ++DG D + + + I+ IG V + L+ Sbjct: 289 --------IFLVSDG---IETCDGDPVEAAKQLAQSEITPIINVIGFGVDGPGQQQLKEV 337 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 A A+ + L+++ + D F+ GK+I K Sbjct: 338 AKAAGGRYVLIQDQKELQDEFNR-GKEIANK 367 >gi|260786375|ref|XP_002588233.1| hypothetical protein BRAFLDRAFT_124700 [Branchiostoma floridae] gi|229273393|gb|EEN44244.1| hypothetical protein BRAFLDRAFT_124700 [Branchiostoma floridae] Length = 1313 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 43/196 (21%), Positives = 66/196 (33%), Gaps = 31/196 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + IVLDVS S++ + + +N E K I + GL+TFS+ Sbjct: 308 VSIVLDVSGSID---------MGTLLPRLN--QEASKYIRSFADGSMVGLITFSDTAAVD 356 Query: 234 FLLEWGV--SHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L SH Q I L ST+ G++ + + Sbjct: 357 HALTELTADSHRQSLITALPSSTYGSTSIGAGIQAGLSMLKPTGQGG------------T 404 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYLV 348 IV MTDG+ + Q + K + I I L + S SFY Sbjct: 405 IVLMTDGQENTAPMIQDVWPSVLQQK---VTLVTIAIGEYADMSLEDLASQTSGLSFYDT 461 Query: 349 ENPHSMYDAFSHIGKD 364 E+ + + F+ I Sbjct: 462 EDASHLSEIFTAISSQ 477 >gi|114800018|ref|YP_760837.1| hypothetical protein HNE_2140 [Hyphomonas neptunium ATCC 15444] gi|114740192|gb|ABI78317.1| conserved hypothetical protein [Hyphomonas neptunium ATCC 15444] Length = 583 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 40/249 (16%), Positives = 76/249 (30%), Gaps = 51/249 (20%) Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 IP +F P + V + +++ +A +++LD S SM TKI+ Sbjct: 5 IPKRFARRAPGLRAAMSAVAVLAAAMPAAQAQEAESASILILDASGSMWGQLKGGTTKIE 64 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL-----EWGVSH---LQRKIK 248 +A ++ G++ + ++ E GV + L ++ Sbjct: 65 VARDVMSEYFRTRNAAEP------LGVIAYGHRRRGDCADIEVIAETGVQNAATLSSRVN 118 Query: 249 YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS 308 + G+ T T L+ A QI I+ +TDG Sbjct: 119 QIRPNGM-TPLTDALRMAQKQIPKTAERAD------------IILVTDG----------- 154 Query: 309 LYYCNE--------AKKRGAIV--YAIGIRVIRSHEFLRAC---ASPNSFYLVENPHSMY 355 L C G + + +G + +C A+ + + Sbjct: 155 LETCKADPCALAAELAAEGIEIRAHVVGFGLTEQEAASLSCIPEATGGLLLRPQTGQELS 214 Query: 356 DAFSHIGKD 364 DA I + Sbjct: 215 DALGQIAEA 223 >gi|91775043|ref|YP_544799.1| von Willebrand factor, type A [Methylobacillus flagellatus KT] gi|91709030|gb|ABE48958.1| MxaC family protein, potentially involved in Ca2+ insertion into quinoproteins [Methylobacillus flagellatus KT] Length = 326 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 30/173 (17%), Positives = 56/173 (32%), Gaps = 13/173 (7%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S KV ++ ++V+D S SM+ F T A + D Sbjct: 68 GSDKVVNRVGKGAQTVMVIDRSVSMDHPFAGDATSGRAGEIKSGAARRLITQFIDSRPDD 127 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQ 278 G+V F+N + + I + G++ TN G+ A + +Q Sbjct: 128 MMGVVAFTNSALYGVKITANRDAIHAAINAATSAGINQTNIGAGITQAASLFDSIQSSGS 187 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 + I+ ++DG + E + + +Y I +R Sbjct: 188 ----------RAIILLSDG--AGKLSPRVKARIREELRDKDIKLYWIVLREPD 228 >gi|340361|gb|AAB59512.1| von Willebrand factor prepropeptide [Homo sapiens] Length = 2074 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 40/277 (14%), Positives = 86/277 (31%), Gaps = 35/277 (12%) Query: 79 GGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYKI 137 ++ + + + N ++I + + + V N + I Sbjct: 849 HSFLVSQGDREQAPNLVYMVTGNPASDEIKRLPGDIQVVPIGVGPNANVQELERIGWPNA 908 Query: 138 PLKFCTFIPWYTNSRHIVMPITSS--------VKVNSQTDARLDMMIVLDVSRSMESFFD 189 P+ F + +V+ S + LD++++LD S S F Sbjct: 909 PILIQDFETLPREAPDLVLQRCCSGEGLQIPTLSPAPDCSQPLDVILLLDGSSS---FPA 965 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV----SHLQR 245 S ++ K+ + + + P + Q ++ + + W V +HL Sbjct: 966 SYFDEMKSFAKA---FISKANIGP---RLTQVSVLQYGSITTIDVP--WNVVPEKAHLLS 1017 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 + + + G + L +A + H A+ K +++ +T Sbjct: 1018 LVDVMQREGGPSQIGDALGFAVRYLTS----EMHGARPGAS-KAVVILVT------DVSV 1066 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + A+ V+ IGI LR A P Sbjct: 1067 DSVDAAADAARSNRVTVFPIGIGDRYDAAQLRILAGP 1103 >gi|73669282|ref|YP_305297.1| protoporphyrin IX magnesium-chelatase [Methanosarcina barkeri str. Fusaro] gi|72396444|gb|AAZ70717.1| protoporphyrin IX magnesium-chelatase [Methanosarcina barkeri str. Fusaro] Length = 688 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 39/274 (14%), Positives = 85/274 (31%), Gaps = 41/274 (14%) Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD----------NGFVNDIDDIVRSTSL 116 E N ++ + G + + +++ NG + +ST Sbjct: 378 EEQNSDDPQDSTGEQPDASSETIFATGEGYQIKQFSPDFRRENRNGSGRRSKTLTKSTQG 437 Query: 117 DIVVVP----QNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARL 172 + + + A R P + + N IV+ + + Sbjct: 438 RYIKSKIPEEKTTDIAFDATLRAAAPYQ----LVREKNGNSIVIHDLDFRQKVREKKIGN 493 Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIE 231 ++ V+D S SM + ++ + ++ +ML + + GL+ F E Sbjct: 494 LVLFVVDASGSM-----GAQQRMVASKGAVLSML-----MDAYQKRDKVGLIAFKGTGAE 543 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 V Q+ ++ L G + S +K Y I +R+ +T + Sbjct: 544 LLLPPTSSVEMAQKYLEELPTGGKTPLSHGLMK-GYETIHA--ELRRDPDTCP-----FM 595 Query: 292 VFMTDGE----NLSTKEDQQSLYYCNEAKKRGAI 321 V ++DG Q++ + +K Sbjct: 596 VLISDGRANVSMNGEPPLQETKTIASMFRKECIQ 629 >gi|327470147|gb|EGF15611.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK330] Length = 450 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 52/298 (17%), Positives = 84/298 (28%), Gaps = 39/298 (13%) Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIV---VVPQNEGYSI 129 K GD++ KN + + N V V T ++ V E + Sbjct: 75 EESTAKPGDLIDVSKNVSPVDVKESQEGNQTVRVETSTVDVTKTEVAPAKVDKAPEPITR 134 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 + + Y +S D++ V+D S SM S Sbjct: 135 RTENTLSGTDENGNPYTQYERVDKTTTITYTSTPPTVTKAGSADIVFVVDRSGSMGST-- 192 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-------------EFFLL 236 ID+ +IN + + + GL TFS+++ F Sbjct: 193 -----IDIVRANINEFVRNITK---EGITARFGLATFSDEVYGRNSGSKDEDTVLTRFGS 244 Query: 237 EWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + K L+ G T TP A NQI + KK +V Sbjct: 245 SYFTTDPAELEKALAAIRIASGGDTPETPT--PALNQIISTYDWSKSSKN-----KKFVV 297 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 +TD E K G ++ I + A+ +EN Sbjct: 298 LLTDAEMKEDPSIPTVADTLAALKAAGIERTVATVKAIEG--IYKNFATEGRVLDIEN 353 >gi|85701714|ref|NP_001028371.1| calcium activated chloride channel [Mus musculus] gi|74202052|dbj|BAE23018.1| unnamed protein product [Mus musculus] gi|148680069|gb|EDL12016.1| mCG120741 [Mus musculus] gi|187951335|gb|AAI39089.1| Expressed sequence AI747448 [Mus musculus] gi|187957592|gb|AAI39090.1| Expressed sequence AI747448 [Mus musculus] Length = 925 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 77/199 (38%), Gaps = 34/199 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLDVS SM SS +++ ++ L ++ + N G+V FS++ Sbjct: 309 VCLVLDVSGSM-----SSSDRLNRMNQAAKYFLSQI-----IENRSWVGMVHFSSQATIV 358 Query: 234 FLLEWGVSHLQRK--IKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L S ++R ++ L T+ G+K A+ + + Sbjct: 359 HELIQMNSDIERNKLLQTLPTSAIGGTSICSGIKTAFQVFKNGEYQTDGTE--------- 409 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLV 348 I+ ++DGE D + +E K G+IV+ I + + + + + + L Sbjct: 410 ILLLSDGE------DSTAKDCIDEVKDSGSIVHFIALG-PSADLAVTNMSILTGGNHKLA 462 Query: 349 ENPHS---MYDAFSHIGKD 364 + + DAF + + Sbjct: 463 TDEAQNNGLIDAFGALASE 481 >gi|47522678|ref|NP_999068.1| inter-alpha-trypsin inhibitor heavy chain H2 precursor [Sus scrofa] gi|3024050|sp|O02668|ITIH2_PIG RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H2; Short=ITI heavy chain H2; Short=ITI-HC2; Short=Inter-alpha-inhibitor heavy chain 2; Flags: Precursor gi|1915954|emb|CAA72308.1| inter-alpha-inhibitor heavy-chain H2 [Sus scrofa] Length = 935 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 34/208 (16%), Positives = 78/208 (37%), Gaps = 32/208 (15%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 S +++ V+DVS SM K+ ++++ +L++++ Q L Sbjct: 290 PESMDPIPKNILFVIDVSGSMWGI------KMKQTVEAMKTILDDLRAED------QFSL 337 Query: 224 VTFSNKI-----EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V F++ I + + V+ + I+ + G TN L A + + + Sbjct: 338 VDFNHNIRTWRNDLVSATKTQVADAKTYIEKIQPSG-GTNINEALLRAIFILNEANNLGL 396 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEF 335 + + I+ ++DG+ + Q K+ ++++GI ++F Sbjct: 397 -LDPNSVSL---IILVSDGDPT--VGELQLSKIQKNVKQNIQDNVSLFSLGIGFDVDYDF 450 Query: 336 LRACASPN-----SFYLVENPHSMYDAF 358 L+ ++ N Y ++ S F Sbjct: 451 LKRLSNDNRGMAQRIYGNQDTASQLKKF 478 >gi|148253705|ref|YP_001238290.1| hypothetical protein BBta_2204 [Bradyrhizobium sp. BTAi1] gi|146405878|gb|ABQ34384.1| hypothetical protein BBta_2204 [Bradyrhizobium sp. BTAi1] Length = 409 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 23/165 (13%), Positives = 56/165 (33%), Gaps = 1/165 (0%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R F + G ++++ AI I + +G ++ + +K L ID +++ + N+ Sbjct: 15 RRFGWASDGQISMIFAIASIPILVSVGAAVDFAKSSDVKAQLQKSIDAAVLAGVVR-PND 73 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 + + G R SF + + + S + S Sbjct: 74 QQISTAAAVFSGAYRGRFDTAATASFASNTDGSLTGTATTSVKTSFLNVMGTSALGVTAS 133 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLD 173 +A + + C + N++ +++ + + S L Sbjct: 134 ATAKAGAQAQSSVCILLVSTVNAQSLLVNSGAQLNAPSCEIHVLS 178 >gi|303241024|ref|ZP_07327534.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] gi|302591449|gb|EFL61187.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] Length = 569 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 50/314 (15%), Positives = 102/314 (32%), Gaps = 30/314 (9%) Query: 65 IMNEGNGNNRKKLKGGDILCRIKNTWNMS-FRNELRDNGFVNDIDDIVRSTSLDIVVVPQ 123 ++ N N+ K K I W F L DN + + + + Q Sbjct: 265 MVASMNVNSPGKDKLVAIYPSEGTIWGDVCFTKVLSDNITEDQSKALDEVKKILLSKDLQ 324 Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSR------HIVMPITSSVKV-----NSQTDARL 172 +G + + I + +P + + R Sbjct: 325 TKGMNEYFFRPADSSIPLSNAISVQNGVNPSEPQTTLELPGIDVINGILNDWQTNMKKRA 384 Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE---EVKLIPDVNNVVQSGLVTFSNK 229 +++ V+D S SM S ID A +I + + K +++ L+TF+ Sbjct: 385 NVLFVIDTSGSM------SGEPIDNARSAIQNLFNKEAQEKNYTSIDDEDTISLMTFNTD 438 Query: 230 I-EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + + + +S + I LS G +T+ + A + ++ + Sbjct: 439 VSDVYTVKGKDISEMSVVIDSLSASG-NTHLYDAVDKAITEHQALKQSESEKKID----- 492 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA-IVYAIGIRVIRSHEFLRACASPNSFYL 347 IIV ++DG + +++ L + K+ ++ IG + Y Sbjct: 493 -IIVVLSDGADTNSQIQFSQLESMLKQKEGNLPVIITIGYGNVDKDVLESISDKTGGKYY 551 Query: 348 VENPHSMYDAFSHI 361 NP ++ F I Sbjct: 552 EGNPDTIKKVFEEI 565 >gi|291398581|ref|XP_002715573.1| PREDICTED: chloride channel accessory 4 [Oryctolagus cuniculus] Length = 874 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 69/198 (34%), Gaps = 32/198 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM +S +++ ++ L +V V N G+V F + Sbjct: 256 VCLVLDKSGSM-----ASSDRLNRMNQAAKYFLLQV-----VENGSWVGMVHFDSTASVK 305 Query: 234 FLLEW--GVSHLQRKIKYLSK-FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L S + ++ L G T G++ A+ Sbjct: 306 SQLIQIKSNSERNQLLQSLPTVAGGGTQICLGIRAAFQVFKQQNSEIDGSE--------- 356 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR--ACASPNSFYLV 348 IV +TDGE+ + + + G I++ I + ++ + +Y Sbjct: 357 IVLLTDGEDSTASSCVNEVI------ESGTIIHFIALGPSADASVIQMSNLTGGSHYYAS 410 Query: 349 ENPHS--MYDAFSHIGKD 364 +N + + DAF + + Sbjct: 411 DNAQNNGLIDAFGALTSE 428 >gi|41386751|ref|NP_958822.1| calcium-activated chloride channel regulator 4 [Rattus norvegicus] gi|37703077|gb|AAR01113.1| parturition-related protein PRP3 [Rattus norvegicus] Length = 923 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 72/204 (35%), Gaps = 44/204 (21%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV--------KLIPDVNNVVQSGLVT 225 + +VLDVS SM S+ +++ ++ L+++ + + V+S L+ Sbjct: 308 VCLVLDVSGSMGSY-----DRLNRMNQAAKFFLQQILESRSWAGMVHFHSSATVKSELIQ 362 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 ++ +E LLE + T+ G++ A+ + + Sbjct: 363 INSDVERNQLLETLPTSAS----------GGTSICSGIRTAFQVFKNKGYQTGGND---- 408 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPN 343 I+ ++DGE D + +E K GA+V+ I + A + Sbjct: 409 -----ILLLSDGE------DSTAKDCLDEVKDSGAVVHFIALGKAFDQSI-SNMANVTGG 456 Query: 344 SFYLVENPHS---MYDAFSHIGKD 364 + + DAF + + Sbjct: 457 KQLFATDEAQNNGLIDAFGALASE 480 >gi|307133505|dbj|BAJ19017.1| TadG [Aggregatibacter actinomycetemcomitans] Length = 538 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 35/245 (14%), Positives = 78/245 (31%), Gaps = 40/245 (16%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 ++ F N G I+TA+ + L + ++ + I K L D++ + + Sbjct: 15 VKQFSKNEHGVYAIITALLAFPLLLFVAFTVDGTGILLDKARLAQATDQAALLLIAEDNQ 74 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 + +K ++ + +F N + ++V+ + ++G Sbjct: 75 YRKNKDHSDVKRQNVSQQEIEREGRNFSNAKVQAQWKKRNQELVQGVCKLYLRSDDSKGQ 134 Query: 128 SISAISRYKIPL---------KFCTF-----------------IPWYTNSRHIVM----- 156 S+ K P + W + +V Sbjct: 135 KNSSPVTIKEPFLAECLEEKTQPKNQNGTAKSVACVVQGSVQRKFWLPWGQTLVSSNQLH 194 Query: 157 -------PITSSVKVNSQTDARLDMMIVLDVSRSMESFFD--SSITKIDMAIKSINAMLE 207 + Q +D+M+V D+S SM+ + D K + I ++ ++ Sbjct: 195 DGRVGINSGKTYAVKEKQITIPIDLMMVTDLSGSMKWYIDRKGDAHKPNRRIDALVEVVG 254 Query: 208 EVKLI 212 EV+ I Sbjct: 255 EVQNI 259 Score = 40.9 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 22/140 (15%), Positives = 46/140 (32%), Gaps = 22/140 (15%) Query: 242 HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ-GMRQHCNTEDANYKKIIVFMTDGENL 300 ++ + + G T T G+ N + + + N +++++ ++DGE+ Sbjct: 382 NVSEALGKIEPLG-GTAVTSGMLIGINLMTSKNSEPEAAPSKLNTNTRRVLLILSDGEDN 440 Query: 301 STKE----DQQSLYYCNEAKKR--------------GAIVYAIGIRVIRSH-EFLRACAS 341 E + C E K + A G + + + C Sbjct: 441 QPSEKTLVNLMGAGLCREIKDKMNSLQDPKYGQVEPRVAFIAFGTNLPDNQLNAWKQCV- 499 Query: 342 PNSFYLVENPHSMYDAFSHI 361 +Y V + + DAF I Sbjct: 500 GKHYYSVFSKQGLLDAFKQI 519 >gi|291242484|ref|XP_002741138.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 765 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 39/249 (15%), Positives = 78/249 (31%), Gaps = 28/249 (11%) Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 P + S S + + L F +R I + + S +++VLD S Sbjct: 235 PNRQNRICSGQSAWGVMLNHTDFKDGNNPTRIIDDTTPNFIIKRSGVRR---VVLVLDTS 291 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS 241 SM +I +S +E I D + V G +++ +++G Sbjct: 292 GSM------DGDRIQRLHQSATYFIET--RIEDGSFVGIVGFSSYAVIHSGITEIKYGFQ 343 Query: 242 HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENL 300 + +T+ GL+ A + D + + ++ +TDG EN Sbjct: 344 RGEIASNVPQVASGATSIGDGLRVALQVLQDGNVTSEGAS---------LLLITDGIENT 394 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAF 358 + E G V I + L+ + + ++ V + + Sbjct: 395 YP----LLMNVMQEVYDSGVRVDTIAYT-EAAQSTLQELSDNTGGLYFYVPDNDTSTAFI 449 Query: 359 SHIGKDIVT 367 + I Sbjct: 450 DSLAATISE 458 >gi|290987786|ref|XP_002676603.1| predicted protein [Naegleria gruberi] gi|284090206|gb|EFC43859.1| predicted protein [Naegleria gruberi] Length = 755 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 34/190 (17%), Positives = 62/190 (32%), Gaps = 27/190 (14%) Query: 164 VNSQTDARLDMMIVLDVSRSM--------ESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 +++ +LDVS SM S ++ +++D+ S+ ++E + Sbjct: 123 PTGGQRQPCNLVCILDVSGSMGSSAEDLSSSNENTGFSRLDLVKHSVRTLIELM------ 176 Query: 216 NNVVQSGLVTFSNKIEEFFLLE-WGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFD 272 N Q L+ FS+ L ++ I+ L G STN GL+ + Sbjct: 177 NEKDQISLIPFSDSARMELPLTKMDAVGKKKAIEKLEHLGPEGSTNVWDGLRLGMESSLN 236 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA--IVYAIGIRVI 330 N A ++ TDGE K+ +++ G Sbjct: 237 --------NPLCAKTNTCLILFTDGEPNINPPRGIVPTLEKYIKEHPLNSTIHSFGFGYS 288 Query: 331 RSHEFLRACA 340 L+ A Sbjct: 289 LDSALLKDIA 298 >gi|194386850|dbj|BAG59791.1| unnamed protein product [Homo sapiens] Length = 543 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 29/198 (14%), Positives = 76/198 (38%), Gaps = 32/198 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ V+DVS SM K+ ++++ +L++++ + ++ F+ I Sbjct: 73 ILFVIDVSGSMWG------VKMKQTVEAMKTILDDLRAEDHFS------VIDFNQNIRTW 120 Query: 231 --EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + V+ +R I+ + G TN L A + + + + + Sbjct: 121 RNDLISATKTQVADAKRYIEKIQPSG-GTNINEALLRAIFILNEANNLGL-LDPNSVSL- 177 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACASPNS- 344 I+ ++DG+ + + K+ ++++G+ ++FL+ ++ N Sbjct: 178 --IILVSDGDPT--VGELKLSKIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSNENHG 233 Query: 345 ----FYLVENPHSMYDAF 358 Y ++ S F Sbjct: 234 IAQRIYGNQDTSSQLKKF 251 >gi|168278088|dbj|BAG11022.1| von Willebrand factor precursor [synthetic construct] Length = 2813 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 40/277 (14%), Positives = 86/277 (31%), Gaps = 35/277 (12%) Query: 79 GGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYKI 137 ++ + + + N ++I + + + V N + I Sbjct: 1588 HSFLVSQGDREQAPNLVYMVTGNPASDEIKRLPGDIQVVPIGVGPNANVQELERIGWPNA 1647 Query: 138 PLKFCTFIPWYTNSRHIVMPITSS--------VKVNSQTDARLDMMIVLDVSRSMESFFD 189 P+ F + +V+ S + LD++++LD S S F Sbjct: 1648 PILIQDFETLPREAPDLVLQRCCSGEGLQIPTLSPAPDCSQPLDVILLLDGSSS---FPA 1704 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV----SHLQR 245 S ++ K+ + + + P + Q ++ + + W V +HL Sbjct: 1705 SYFDEMKSFAKA---FISKANIGP---RLTQVSVLQYGSITTIDVP--WNVVPEKAHLLS 1756 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 + + + G + L +A + H A+ K +++ +T Sbjct: 1757 LVDVMQREGGPSQIGDALGFAVRYLTS----EMHGARPGAS-KAVVILVT------DVSV 1805 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + A+ V+ IGI LR A P Sbjct: 1806 DSVDAAADAARSNRVTVFPIGIGDRYDAAQLRILAGP 1842 >gi|89191868|ref|NP_000543.2| von Willebrand factor preproprotein [Homo sapiens] gi|260158888|gb|ACX32324.1| von Willebrand factor preproprotein [synthetic construct] Length = 2813 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 40/277 (14%), Positives = 86/277 (31%), Gaps = 35/277 (12%) Query: 79 GGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYKI 137 ++ + + + N ++I + + + V N + I Sbjct: 1588 HSFLVSQGDREQAPNLVYMVTGNPASDEIKRLPGDIQVVPIGVGPNANVQELERIGWPNA 1647 Query: 138 PLKFCTFIPWYTNSRHIVMPITSS--------VKVNSQTDARLDMMIVLDVSRSMESFFD 189 P+ F + +V+ S + LD++++LD S S F Sbjct: 1648 PILIQDFETLPREAPDLVLQRCCSGEGLQIPTLSPAPDCSQPLDVILLLDGSSS---FPA 1704 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV----SHLQR 245 S ++ K+ + + + P + Q ++ + + W V +HL Sbjct: 1705 SYFDEMKSFAKA---FISKANIGP---RLTQVSVLQYGSITTIDVP--WNVVPEKAHLLS 1756 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 + + + G + L +A + H A+ K +++ +T Sbjct: 1757 LVDVMQREGGPSQIGDALGFAVRYLTS----EMHGARPGAS-KAVVILVT------DVSV 1805 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + A+ V+ IGI LR A P Sbjct: 1806 DSVDAAADAARSNRVTVFPIGIGDRYDAAQLRILAGP 1842 >gi|119609220|gb|EAW88814.1| von Willebrand factor, isoform CRA_a [Homo sapiens] Length = 2813 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 40/277 (14%), Positives = 86/277 (31%), Gaps = 35/277 (12%) Query: 79 GGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYKI 137 ++ + + + N ++I + + + V N + I Sbjct: 1588 HSFLVSQGDREQAPNLVYMVTGNPASDEIKRLPGDIQVVPIGVGPNANVQELERIGWPNA 1647 Query: 138 PLKFCTFIPWYTNSRHIVMPITSS--------VKVNSQTDARLDMMIVLDVSRSMESFFD 189 P+ F + +V+ S + LD++++LD S S F Sbjct: 1648 PILIQDFETLPREAPDLVLQRCCSGEGLQIPTLSPAPDCSQPLDVILLLDGSSS---FPA 1704 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV----SHLQR 245 S ++ K+ + + + P + Q ++ + + W V +HL Sbjct: 1705 SYFDEMKSFAKA---FISKANIGP---RLTQVSVLQYGSITTIDVP--WNVVPEKAHLLS 1756 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 + + + G + L +A + H A+ K +++ +T Sbjct: 1757 LVDVMQREGGPSQIGDALGFAVRYLTS----EMHGARPGAS-KAVVILVT------DVSV 1805 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + A+ V+ IGI LR A P Sbjct: 1806 DSVDAAADAARSNRVTVFPIGIGDRYDAAQLRILAGP 1842 >gi|119609221|gb|EAW88815.1| von Willebrand factor, isoform CRA_b [Homo sapiens] Length = 2764 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 40/277 (14%), Positives = 86/277 (31%), Gaps = 35/277 (12%) Query: 79 GGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYKI 137 ++ + + + N ++I + + + V N + I Sbjct: 1539 HSFLVSQGDREQAPNLVYMVTGNPASDEIKRLPGDIQVVPIGVGPNANVQELERIGWPNA 1598 Query: 138 PLKFCTFIPWYTNSRHIVMPITSS--------VKVNSQTDARLDMMIVLDVSRSMESFFD 189 P+ F + +V+ S + LD++++LD S S F Sbjct: 1599 PILIQDFETLPREAPDLVLQRCCSGEGLQIPTLSPAPDCSQPLDVILLLDGSSS---FPA 1655 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV----SHLQR 245 S ++ K+ + + + P + Q ++ + + W V +HL Sbjct: 1656 SYFDEMKSFAKA---FISKANIGP---RLTQVSVLQYGSITTIDVP--WNVVPEKAHLLS 1707 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 + + + G + L +A + H A+ K +++ +T Sbjct: 1708 LVDVMQREGGPSQIGDALGFAVRYLTS----EMHGARPGAS-KAVVILVT------DVSV 1756 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + A+ V+ IGI LR A P Sbjct: 1757 DSVDAAADAARSNRVTVFPIGIGDRYDAAQLRILAGP 1793 >gi|340356|gb|AAB59458.1| von Willebrand factor [Homo sapiens] Length = 2813 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 40/277 (14%), Positives = 86/277 (31%), Gaps = 35/277 (12%) Query: 79 GGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYKI 137 ++ + + + N ++I + + + V N + I Sbjct: 1588 HSFLVSQGDREQAPNLVYMVTGNPASDEIKRLPGDIQVVPIGVGPNANVQELERIGWPNA 1647 Query: 138 PLKFCTFIPWYTNSRHIVMPITSS--------VKVNSQTDARLDMMIVLDVSRSMESFFD 189 P+ F + +V+ S + LD++++LD S S F Sbjct: 1648 PILIQDFETLPREAPDLVLQRCCSGEGLQIPTLSPAPDCSQPLDVILLLDGSSS---FPA 1704 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV----SHLQR 245 S ++ K+ + + + P + Q ++ + + W V +HL Sbjct: 1705 SYFDEMKSFAKA---FISKANIGP---RLTQVSVLQYGSITTIDVP--WNVVPEKAHLLS 1756 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 + + + G + L +A + H A+ K +++ +T Sbjct: 1757 LVDVMQREGGPSQIGDALGFAVRYLTS----EMHGARPGAS-KAVVILVT------DVSV 1805 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + A+ V+ IGI LR A P Sbjct: 1806 DSVDAAADAARSNRVTVFPIGIGDRYDAAQLRILAGP 1842 >gi|37947|emb|CAA27972.1| unnamed protein product [Homo sapiens] Length = 2813 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 40/277 (14%), Positives = 86/277 (31%), Gaps = 35/277 (12%) Query: 79 GGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYKI 137 ++ + + + N ++I + + + V N + I Sbjct: 1588 HSFLVSQGDREQAPNLVYMVTGNPASDEIKRLPGDIQVVPIGVGPNANVQELERIGWPNA 1647 Query: 138 PLKFCTFIPWYTNSRHIVMPITSS--------VKVNSQTDARLDMMIVLDVSRSMESFFD 189 P+ F + +V+ S + LD++++LD S S F Sbjct: 1648 PILIQDFETLPREAPDLVLQRCCSGEGLQIPTLSPAPDCSQPLDVILLLDGSSS---FPA 1704 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV----SHLQR 245 S ++ K+ + + + P + Q ++ + + W V +HL Sbjct: 1705 SYFDEMKSFAKA---FISKANIGP---RLTQVSVLQYGSITTIDVP--WNVVPEKAHLLS 1756 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 + + + G + L +A + H A+ K +++ +T Sbjct: 1757 LVDVMQREGGPSQIGDALGFAVRYLTS----EMHGARPGAS-KAVVILVT------DVSV 1805 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + A+ V+ IGI LR A P Sbjct: 1806 DSVDAAADAARSNRVTVFPIGIGDRYDAAQLRILAGP 1842 >gi|317373549|sp|P04275|VWF_HUMAN RecName: Full=von Willebrand factor; Short=vWF; Contains: RecName: Full=von Willebrand antigen 2; AltName: Full=von Willebrand antigen II; Flags: Precursor Length = 2813 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 40/277 (14%), Positives = 86/277 (31%), Gaps = 35/277 (12%) Query: 79 GGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYKI 137 ++ + + + N ++I + + + V N + I Sbjct: 1588 HSFLVSQGDREQAPNLVYMVTGNPASDEIKRLPGDIQVVPIGVGPNANVQELERIGWPNA 1647 Query: 138 PLKFCTFIPWYTNSRHIVMPITSS--------VKVNSQTDARLDMMIVLDVSRSMESFFD 189 P+ F + +V+ S + LD++++LD S S F Sbjct: 1648 PILIQDFETLPREAPDLVLQRCCSGEGLQIPTLSPAPDCSQPLDVILLLDGSSS---FPA 1704 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV----SHLQR 245 S ++ K+ + + + P + Q ++ + + W V +HL Sbjct: 1705 SYFDEMKSFAKA---FISKANIGP---RLTQVSVLQYGSITTIDVP--WNVVPEKAHLLS 1756 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 + + + G + L +A + H A+ K +++ +T Sbjct: 1757 LVDVMQREGGPSQIGDALGFAVRYLTS----EMHGARPGAS-KAVVILVT------DVSV 1805 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + A+ V+ IGI LR A P Sbjct: 1806 DSVDAAADAARSNRVTVFPIGIGDRYDAAQLRILAGP 1842 >gi|269125745|ref|YP_003299115.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268310703|gb|ACY97077.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 228 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 73/203 (35%), Gaps = 19/203 (9%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 ++ L +V D S SM + + ++ E+ P V + + +++FS Sbjct: 2 SEQILPFYLVCDESYSMAGNP------LQEINDQLPQIVTEIASNPTVADKARLCIISFS 55 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG--MRQHCNTEDA 285 + E L ++ + + + L+ G + S A+ + D +R Sbjct: 56 DTAEVLLPLAD-LNDVHQ-VPQLAPKGAT--SYGA---AFTLLRDTIERDIRDLKAAGHV 108 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLRACASPNS 344 ++ + F+TDG+ D + + AK G + E LRA A+ + Sbjct: 109 PFRPTVFFLTDGQPTD--SDWATAHQRLTAKDFGPRPTILAFGFGDVRPETLRAVATFRA 166 Query: 345 FYLVENPHSMYDAFSHIGKDIVT 367 ++ +A K ++ Sbjct: 167 -FIANGELDPRNALREFAKQLLN 188 >gi|239814531|ref|YP_002943441.1| hypothetical protein Vapar_1524 [Variovorax paradoxus S110] gi|239801108|gb|ACS18175.1| conserved hypothetical protein [Variovorax paradoxus S110] Length = 409 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 16/76 (21%), Positives = 32/76 (42%) Query: 16 KGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRK 75 G + I A+ L + +G+ ++ +F +KT L + +D + AA ++ G+ R Sbjct: 11 SGAVIITVALVLLFLLGFMGIALDFGRLFIVKTELQTALDSCALSAAQELDGAGDALTRA 70 Query: 76 KLKGGDILCRIKNTWN 91 G K + Sbjct: 71 TSAGKTAADLNKINFQ 86 >gi|332260454|ref|XP_003279304.1| PREDICTED: von Willebrand factor A domain-containing protein 3B [Nomascus leucogenys] Length = 1160 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 34/171 (19%), Positives = 62/171 (36%), Gaps = 32/171 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + I++D S SM+S K+D+ I ++E N V+ + + + Sbjct: 423 IYILIDTSHSMKS-------KLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAVAWREQLA 475 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + E + Q I+ + G STN+ LK A+ + E I Sbjct: 476 EVNEDNLEQAQSWIRDMK-IGSSTNTLSALKTAF------------ADKETQA----IYL 518 Query: 294 MTDGENLSTKEDQQSLYYCNEAKK-RGAIVYAIGIRVIR--SHEFLRACAS 341 +TDG E ++ K + +Y I ++ FL+ A+ Sbjct: 519 LTDGRPDQPPEMVI-----DQVKVFQEIPIYTISFNYNDEIANRFLKEVAA 564 >gi|317493250|ref|ZP_07951672.1| von Willebrand factor type A domain-containing protein [Enterobacteriaceae bacterium 9_2_54FAA] gi|316918643|gb|EFV39980.1| von Willebrand factor type A domain-containing protein [Enterobacteriaceae bacterium 9_2_54FAA] Length = 544 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 38/195 (19%), Positives = 79/195 (40%), Gaps = 25/195 (12%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 PW +S+ + + I ++ +N+ +++ ++DVS SM S K+ + S+ + Sbjct: 141 PWNAHSKLLRIAIKAT-DINATALPPANLVFLIDVSGSM-----SDEDKLPLVKNSLKLL 194 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGL 263 + +++ + +V +S + + G S + I LS ST G+ Sbjct: 195 VNKMRDQD------KISIVIYSGETKTVLPPTSGKDKSDILSAINQLSAG-GSTAGGSGI 247 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIV 322 AY Q+ + ++ N I+ TDG+ N+ + QQ + K G + Sbjct: 248 DLAY-QMAEKGFIKNGINR--------IILATDGDFNVGITDTQQLEEKIKKKSKNGINL 298 Query: 323 YAIGIRVIRSHEFLR 337 +G ++ L Sbjct: 299 TTLGFGQGNYNDSLM 313 >gi|297622708|ref|YP_003704142.1| von Willebrand factor type A [Truepera radiovictrix DSM 17093] gi|297163888|gb|ADI13599.1| von Willebrand factor type A [Truepera radiovictrix DSM 17093] Length = 329 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 43/266 (16%), Positives = 87/266 (32%), Gaps = 36/266 (13%) Query: 109 DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPI---------T 159 + R + D N ++SA+ P+ P T + +P T Sbjct: 40 TVERDANGDPTG---NVALNVSALDSAGEPIAGRLSNPRATVTSVTPLPGLAAAQRYTAT 96 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ V+ ++ ++ +D S SM D ++D A + LE V Sbjct: 97 ATITVDITVQEVINAVLNMDRSGSMRLN-DPERLRVDAAK----SFLERVTPED------ 145 Query: 220 QSGLVTF---SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 + ++ F S+ LL+ S L + G N+ ++ + D Sbjct: 146 RIAVMEFPGQSSGFRASTLLQGFTSDKALLEAALDRVGQRGNTPI-----WDSLLDT-LD 199 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 + ++++ TDGE A + V+ IG+ L Sbjct: 200 LHAADEGGQGASRVVLLFTDGEREG--GQVAFGEALAAALESDVRVFTIGLGSDIDTAEL 257 Query: 337 R--ACASPNSFYLVENPHSMYDAFSH 360 + A + +F V + + + F Sbjct: 258 QELAAETGGTFANVASAAELEELFQR 283 >gi|119609222|gb|EAW88816.1| von Willebrand factor, isoform CRA_c [Homo sapiens] Length = 2752 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 40/277 (14%), Positives = 86/277 (31%), Gaps = 35/277 (12%) Query: 79 GGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYKI 137 ++ + + + N ++I + + + V N + I Sbjct: 1588 HSFLVSQGDREQAPNLVYMVTGNPASDEIKRLPGDIQVVPIGVGPNANVQELERIGWPNA 1647 Query: 138 PLKFCTFIPWYTNSRHIVMPITSS--------VKVNSQTDARLDMMIVLDVSRSMESFFD 189 P+ F + +V+ S + LD++++LD S S F Sbjct: 1648 PILIQDFETLPREAPDLVLQRCCSGEGLQIPTLSPAPDCSQPLDVILLLDGSSS---FPA 1704 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV----SHLQR 245 S ++ K+ + + + P + Q ++ + + W V +HL Sbjct: 1705 SYFDEMKSFAKA---FISKANIGP---RLTQVSVLQYGSITTIDVP--WNVVPEKAHLLS 1756 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 + + + G + L +A + H A+ K +++ +T Sbjct: 1757 LVDVMQREGGPSQIGDALGFAVRYLTS----EMHGARPGAS-KAVVILVT------DVSV 1805 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + A+ V+ IGI LR A P Sbjct: 1806 DSVDAAADAARSNRVTVFPIGIGDRYDAAQLRILAGP 1842 >gi|320529910|ref|ZP_08030987.1| von Willebrand factor type A domain protein [Selenomonas artemidis F0399] gi|320137928|gb|EFW29833.1| von Willebrand factor type A domain protein [Selenomonas artemidis F0399] Length = 643 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 33/166 (19%), Positives = 61/166 (36%), Gaps = 19/166 (11%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 R IV+ + + + +++ V+D S SM ++ ++ ++ A+L E Sbjct: 435 GRRIVIARSDLRRAVRERKRGANILFVVDASGSM-----AARARMRAVKGAMLALLREAY 489 Query: 211 LIPDVNNVVQSGLVTF-SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + D + GLV F ++ E L V QR ++ L G T GL A Sbjct: 490 VRRD-----RVGLVAFRRDRAETLLPLTRSVELAQRLLRELPT-GGRTPLAAGLSEALLH 543 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 + + ++V +TDG + E + A Sbjct: 544 L-------AGAARRGELAETLLVLLTDGRATAAPEGEDPAQAALTA 582 >gi|154252742|ref|YP_001413566.1| vault protein inter-alpha-trypsin subunit [Parvibaculum lavamentivorans DS-1] gi|154156692|gb|ABS63909.1| Vault protein inter-alpha-trypsin domain protein [Parvibaculum lavamentivorans DS-1] Length = 755 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 44/218 (20%), Positives = 81/218 (37%), Gaps = 48/218 (22%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 +VM S V + R + + V+D S SM S + A +S+ L+ +K Sbjct: 330 LVMLTPPSGSVQPEAKPR-EAIFVIDNSGSM------SGPSMVQAKESLLWALDRLKPGD 382 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLE-----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 N ++ F + + F ++ ++ +K L G T P L+ Sbjct: 383 TFN------VIRFDDTLTVLFPDAVPAHGENLAVAKKFVKSLEANG-GTEMLPALRA--- 432 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 + D N D + IVF+TDG N + + + ++ +G Sbjct: 433 SLID-------RNVNDGTRLRQIVFLTDGAISNEAELFHEITSNLGRS------RLFTVG 479 Query: 327 IRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 I ++PNS+++ + F+HIGK+ Sbjct: 480 IG-----------SAPNSYFMTRASEAGRGTFTHIGKE 506 >gi|86131266|ref|ZP_01049865.1| conserved hypothetical protein [Dokdonia donghaensis MED134] gi|85818677|gb|EAQ39837.1| conserved hypothetical protein [Dokdonia donghaensis MED134] Length = 288 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 37/157 (23%), Positives = 61/157 (38%), Gaps = 20/157 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L MM+V DVS S FF + I + + N + GL+ F Sbjct: 72 EEERELTMMLVADVSGS--EFFGTD----KQFKNEIVTEIAATLAFSAMQNNDKIGLILF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 +++IE F + G SH+ R I+ L +F S+ T L A + ++ + Sbjct: 126 TDEIELFIPPKKGKSHVLRIIRELLEFKPSSKKTD-LAQAIKYLSNVMKKKAIVFVLSD- 183 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 F+TDG +Q N+ G +Y Sbjct: 184 ------FITDG------YEQTMKIAANKHDITGIRIY 208 >gi|302869502|ref|YP_003838139.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|315504036|ref|YP_004082923.1| von willebrand factor type a [Micromonospora sp. L5] gi|302572361|gb|ADL48563.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|315410655|gb|ADU08772.1| von Willebrand factor type A [Micromonospora sp. L5] Length = 572 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 38/221 (17%), Positives = 73/221 (33%), Gaps = 32/221 (14%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMES--FFDSSITKIDMAIKSINAMLEEVKLI 212 I+++ S M+ V+DVS SM+ + ++ + + + + L Sbjct: 355 PGAISTATTTWSVATQSGRMLCVIDVSGSMKKPVATANGASREQVTVAAASQGLGLFD-- 412 Query: 213 PDVNNVVQSGLVTFSNKIEEFF---------LLEWGVSHLQRKIKYLSKFGVSTNSTPGL 263 + GL TFS ++ L LQR + + +T + Sbjct: 413 ----DSWSIGLWTFSTNLQGSQDWSELVGIKPLSSNRGSLQRGLASIKPSSGNTGLYDTM 468 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK--EDQQSLYYCNEAKKRG-- 319 AY ++ + + IV TDG+N + L N+ K Sbjct: 469 LAAYKKVQQDWEPGKVNS---------IVLFTDGKNEDDDGISQKALLDQLNKLKDDEQP 519 Query: 320 AIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAF 358 V IGI + L + + ++ +P + + F Sbjct: 520 VQVIIIGIGTEVNRAELESITKVTGGGAFVTTDPSKIGEIF 560 >gi|157375479|ref|YP_001474079.1| putative outer membrane adhesin like proteiin [Shewanella sediminis HAW-EB3] gi|157317853|gb|ABV36951.1| putative outer membrane adhesin like proteiin [Shewanella sediminis HAW-EB3] Length = 2812 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 44/294 (14%), Positives = 92/294 (31%), Gaps = 27/294 (9%) Query: 52 SMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIV 111 + D S + + N+ ++ + + + + ++ + D Sbjct: 1972 TAGDGSNITTGNLLDNDSGVSSSTHITEVEGVAAVNGVITVTTALGELAVYSEDSTDHRA 2031 Query: 112 RSTSLDIVVVPQNEGYSISAISRYKIPL-KFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 + + A + L + + I ++ A Sbjct: 2032 GDYEYKLTANSTDGDI---ASESFDYTLTNSLGSNSSASLTVKISDDAPVVHDISQNLQA 2088 Query: 171 RLDMM-----IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 D++ +VLDVS SM + T +++AI ++ A++ EV +VN +V Sbjct: 2089 NADVVTTNLTLVLDVSGSMGDPVGNGQTYLEVAIDALTALINEVDSTGNVN----IQIVN 2144 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F + L V+ ++ L +G + L N G Q Sbjct: 2145 FHSNTGSSGWLIDDVAGAISYLESLVTYGPTHYDA-ALNAVMNSGSLPDGADQSL----- 2198 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLY--YCNEAKKRGAIV-YAIGIRVIRSHEFL 336 + F++DG +E +L + + G + IGI + L Sbjct: 2199 -----LYFISDGSPSPGQEVDPALQSIWESYLVNSGYKTAFGIGIGSAGLSDLL 2247 >gi|150024242|ref|YP_001295068.1| BatB protein [Flavobacterium psychrophilum JIP02/86] gi|149770783|emb|CAL42248.1| BatB protein [Flavobacterium psychrophilum JIP02/86] Length = 346 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 25/173 (14%), Positives = 66/173 (38%), Gaps = 20/173 (11%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 KV +D++ +DVS+SM D + +++ + + ++ ++ + + Sbjct: 78 GTKVEKVKREGIDIVFAIDVSKSMLCE-DIAPNRLEKSKQVVSQIINNLGSD-------R 129 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 G++ ++ + + + ++ ++ VS+ + + + + Sbjct: 130 IGIIAYAGSAFPVLPITTDYNVAKMFLQSMTSDMVSSQGS--------NLDEAIKLSAKY 181 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 N K+++ +TDGE+ S + + KK G + IGI Sbjct: 182 FEGSPNTSKLMIMLTDGEDHSEGAESAAEEA----KKIGMKIITIGIGTTAGG 230 >gi|304394408|ref|ZP_07376331.1| von Willebrand factor, type A [Ahrensia sp. R2A130] gi|303293848|gb|EFL88225.1| von Willebrand factor, type A [Ahrensia sp. R2A130] Length = 689 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 32/214 (14%), Positives = 70/214 (32%), Gaps = 32/214 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 QT + ++M+V D S SM + KI++A ++++L E + + G++ + Sbjct: 24 QTQSSRNVMVVFDGSGSMWGQIEGRA-KIEIARDVLSSVLGE------TTSNMTIGMIAY 76 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM-RQHCNTEDA 285 ++ I+ + G + ++ P + N I + A Sbjct: 77 GHR----------KKGQCSDIETVVAPGPAASTVPEMIARANAIKPKGKTPLSDAVRKAA 126 Query: 286 NYKKI------IVFMTDGENLSTKEDQQSLYYCNEAKKRGA----IVYAIGIRVIRSHEF 335 + +V +TDG + E ++ G V G+ + Sbjct: 127 ESLRYTENEATVVLVTDG---IETCNADPCALATELEESGVDFTTHVVGFGLSKDEGRQV 183 Query: 336 LRACA-SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 A + F ++ + A D + Sbjct: 184 ACLAANTGGKFISADDADELKAALDETLSDFEEE 217 >gi|229523713|ref|ZP_04413118.1| hypothetical protein VCA_001283 [Vibrio cholerae bv. albensis VL426] gi|229337294|gb|EEO02311.1| hypothetical protein VCA_001283 [Vibrio cholerae bv. albensis VL426] Length = 886 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 28/134 (20%), Positives = 54/134 (40%), Gaps = 9/134 (6%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 S+T ++ ++LDVS SM + T++ + +S ++E+ + + VQ L Sbjct: 267 ESETKQGANVQLMLDVSGSMGRDAGNGKTRLQVMKESAIQLIEQYQALGQTK--VQLILF 324 Query: 225 TFSNKIEEFFLLEWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + I+ L W + + I LS G T+ +K A + Sbjct: 325 SSDASIKTASGLLWMTVAEAKNYINALSANG-GTDYDDAIKLAQESWSGTINGQPLSGAT 383 Query: 284 DANYKKIIVFMTDG 297 + +Y F++DG Sbjct: 384 NVSY-----FLSDG 392 >gi|156933957|ref|YP_001437873.1| hypothetical protein ESA_01783 [Cronobacter sakazakii ATCC BAA-894] gi|156532211|gb|ABU77037.1| hypothetical protein ESA_01783 [Cronobacter sakazakii ATCC BAA-894] Length = 346 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 29/206 (14%), Positives = 56/206 (27%), Gaps = 16/206 (7%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + VLD S SM + + ++ +++ P ++ F+ Sbjct: 3 RLPVFFVLDCSESM------IGENLKKMTDGLQMIVGDLRKDPHALETAWVSVIAFAGVA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + G T+ L+ QI H D + Sbjct: 57 RTIVPL----HEIASFYPPRLPVGGGTSLGAALRELTVQIDTQVRKTTHEAKGDWKP--V 110 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 + +TDG + + + A + A+G+ LR + Sbjct: 111 VYLLTDGRPTDDTTAEVKRWKDHYA--SKVNLIAVGLGPSADLNILRQLTENVMLFTESR 168 Query: 351 PHSMYDAFSHIGKDIVT--KRIWYDK 374 I + + + DK Sbjct: 169 EGDFTRFIKWITASVTAHSRSVGDDK 194 >gi|238063244|ref|ZP_04607953.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] gi|237885055|gb|EEP73883.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] Length = 265 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 26/143 (18%), Positives = 50/143 (34%), Gaps = 9/143 (6%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+ +DVS SM + D ++ A + ++ + + GLV F+ Sbjct: 88 VMVAVDVSTSMLAS-DVKPDRLSAAKDAARRFVDGLP------DEFNVGLVAFAGSAAVL 140 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L I+ L G + + A N +D + I+ Sbjct: 141 VPPGTDREALHDGIERL-AEGSTGVQGTAIGEAINTSLGAVRGLDSQAAKDLPPAR-IIL 198 Query: 294 MTDGENLSTKEDQQSLYYCNEAK 316 ++DG N S + ++ +AK Sbjct: 199 LSDGANTSGMDPMEAAAEAVDAK 221 >gi|90411204|ref|ZP_01219217.1| hypothetical protein P3TCK_06547 [Photobacterium profundum 3TCK] gi|90328050|gb|EAS44371.1| hypothetical protein P3TCK_06547 [Photobacterium profundum 3TCK] Length = 436 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 17/109 (15%), Positives = 37/109 (33%), Gaps = 11/109 (10%) Query: 12 FYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI------ 65 + KG + I + + ++ + ++V ++ K L +++D + + AA I Sbjct: 11 YRAQKGVVAIFATLAMVVLIGAGALALDVGNLVLSKGKLQNIVDSAALSAAKAIDLGGDQ 70 Query: 66 -----MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 NN L G + + F + L + N Sbjct: 71 AEAIVAGNEAINNNLTLDGFGSMTIDNTDIHYEFSDSLPFDSSTNTATS 119 >gi|325689802|gb|EGD31806.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK115] Length = 450 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 38/200 (19%), Positives = 60/200 (30%), Gaps = 36/200 (18%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ V+D S SM ID+ +IN + + + GL TFS Sbjct: 173 KAGSADIVFVVDRSGSMGGT-------IDIVRANINEFVRNITK---EGITARFGLATFS 222 Query: 228 NKIE-------------EFFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQI 270 +++ F + + K L+ G T TP A NQI Sbjct: 223 DEVYGRNSGSKDEDTVLTRFGSSYFTTDPAELEKALAAIRIASGGDTPETPT--TALNQI 280 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + KK +V +TD E K G ++ I Sbjct: 281 ISTYDWSKSSKN-----KKFVVLLTDAEMKEDPSIPTVADTLAALKAAGIERTVATVKAI 335 Query: 331 RSHEFLRACASPNSFYLVEN 350 + A+ +EN Sbjct: 336 EG--IYKNFATEGRVLDIEN 353 >gi|323138702|ref|ZP_08073768.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] gi|322396057|gb|EFX98592.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] Length = 333 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 23/132 (17%), Positives = 46/132 (34%), Gaps = 24/132 (18%) Query: 173 DMMIVLDVSRSMESFF------DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +++++D S SM F +K A + + ++ G+ F Sbjct: 84 QIVMLIDRSGSMNETFAGRNPSGGEESKAAAAKRILKTFVDS-------RGRDLVGVAAF 136 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 S L + + I + + G+ TN GL A + DA Sbjct: 137 STSPMLVTPLSDHIDATKAAIDAMDRPGLDYTNIARGLAMALSMFRAS----------DA 186 Query: 286 NYKKIIVFMTDG 297 ++ + I+ ++DG Sbjct: 187 DHSRAILLISDG 198 >gi|284173928|ref|ZP_06387897.1| hypothetical protein Ssol98_04615 [Sulfolobus solfataricus 98/2] gi|261602665|gb|ACX92268.1| von Willebrand factor type A [Sulfolobus solfataricus 98/2] Length = 356 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 38/208 (18%), Positives = 71/208 (34%), Gaps = 34/208 (16%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 V I V + + +I++D S SM K++ A++S +L + Sbjct: 22 VGFIVYIVPQQGAITSSIHYIIMIDNSPSMRG------EKLNTAVQSAQKLLYSLNEGDY 75 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 V L+ FSN E + + I + G +T + +A N Sbjct: 76 VT------LILFSNHPEIKYQ-----GPAKGIITFDVGKGYTTRLHEAVNFALN------ 118 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K I+ +TDG+ + + + + + IGI + Sbjct: 119 -----LAKQSQVPNK-IIMLTDGKPTDKR-NVKDYEKFDIPPNT--QIITIGIGSDYNER 169 Query: 335 FLRACA--SPNSFYLVENPHSMYDAFSH 360 L+ A S FY +++ + D F Sbjct: 170 ILKKLADKSSGKFYHLKDISELPDVFES 197 >gi|170700850|ref|ZP_02891839.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10] gi|170134258|gb|EDT02597.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10] Length = 423 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 14/119 (11%), Positives = 40/119 (33%), Gaps = 1/119 (0%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R + +G + I+ + L ++ +G+ +++ ++ ++ L + D + AA + + Sbjct: 12 RRGLHRQQGAVAIIVGLALAMMIGFVGLALDLGKLYVTRSELQNSADSCALSAARDLTSA 71 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 + + + + D+ + V P N Y Sbjct: 72 ISLQVAEADGIAAGHANFA-FFQQNAVQMQTDSNVTFSDSLTNPFLTKTAVANPANVKY 129 >gi|298704728|emb|CBJ28324.1| conserved unknown protein [Ectocarpus siliculosus] Length = 877 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 27/201 (13%), Positives = 67/201 (33%), Gaps = 35/201 (17%) Query: 177 VLDVSRSMESFFDSSITKIDMAIKSIN--AMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 VLD+ RS + + +++ + ++ + + V+ GLV+F + ++ Sbjct: 273 VLDLQRS-DPYRSRKQSRLSLTRMDVSKQLFHAFINRQQAYDLPVEVGLVSFGDDVDVPC 331 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA------------------------YNQI 270 + + ++ L+ G +T + A + Sbjct: 332 EPTPLFENFRDEVDTLTPAG-NTKLFDAISEACTLLEKWQTEWVEKADKRKEEENRKRKA 390 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 ++ +V ++DG+ TK + C +K G IV +I + Sbjct: 391 AGGDQANHGPVPDEKRPVLRVVVLSDGK--DTKSTISAHAVCGRLQKVGVIVDSITVGTE 448 Query: 331 RSHEFLRAC---ASPNSFYLV 348 ++++ C A+ + Sbjct: 449 KNNQL--KCLSLATGGYAFHP 467 >gi|256959601|ref|ZP_05563772.1| predicted protein [Enterococcus faecalis Merz96] gi|256950097|gb|EEU66729.1| predicted protein [Enterococcus faecalis Merz96] Length = 370 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 66/169 (39%), Gaps = 27/169 (15%) Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 K + ++ + + + QT++ +D+++V+D S SM K++ Sbjct: 75 YLKKSVAAVTGQQGLFNVTLDVKGN-----QTESPIDLVLVIDYSSSM------KGEKLN 123 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS 256 A+K + EE+ + V+ G+V ++ L + + + L F + Sbjct: 124 NALKGLQQFGEEL-SDSLTDGHVRIGIVAYNR-------LTYSTADFSTDMNDLEDFLRN 175 Query: 257 TN---STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 T S ++ ++G R N KK++V + DG ++ Sbjct: 176 TAEPHSGTFMQKGL-----LEGQRLLAEKSRPNAKKMLVHIGDGSANAS 219 >gi|22298517|ref|NP_681764.1| hypothetical protein tlr0974 [Thermosynechococcus elongatus BP-1] gi|22294697|dbj|BAC08526.1| tlr0974 [Thermosynechococcus elongatus BP-1] Length = 241 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 31/206 (15%), Positives = 72/206 (34%), Gaps = 21/206 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 L + ++LD S SME I+ + + EV ++V+ G++TF++ Sbjct: 22 LPVYLLLDTSSSMEG------APIESLHQGLEQFQREVSSDQFARDIVKVGVITFASD-- 73 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L+ G+ + + T + D +R + ++K + Sbjct: 74 -AQLVTGGLVPISDFQPPMLTASGVTRLDLAF-TVLLESIDRDVVRPVKGGQKGDWKPAV 131 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGA----------IVYAIGIRVIRSHEFLRACAS 341 +TDG + A+ ++ A+G + L+A ++ Sbjct: 132 FVLTDGRPTDRHG-IATDELWRPARDALVNRPKGEIKPSVIVAVGCGPHVDDDTLKAIST 190 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVT 367 +F + + + F ++ + + T Sbjct: 191 GTAFKMGTSEAAFVALFQYLSQSLTT 216 >gi|293384268|ref|ZP_06630156.1| putative von Willebrand factor type A domain protein [Enterococcus faecalis R712] gi|293387001|ref|ZP_06631569.1| putative von Willebrand factor type A domain protein [Enterococcus faecalis S613] gi|312906299|ref|ZP_07765310.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 512] gi|312909645|ref|ZP_07768499.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 516] gi|291078416|gb|EFE15780.1| putative von Willebrand factor type A domain protein [Enterococcus faecalis R712] gi|291083543|gb|EFE20506.1| putative von Willebrand factor type A domain protein [Enterococcus faecalis S613] gi|310627674|gb|EFQ10957.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 512] gi|311290047|gb|EFQ68603.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 516] Length = 383 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 66/169 (39%), Gaps = 27/169 (15%) Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 K + ++ + + + QT++ +D+++V+D S SM K++ Sbjct: 88 YLKKSVAAVTGQQGLFNVTLDVKGN-----QTESPIDLVLVIDYSSSM------KGEKLN 136 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS 256 A+K + EE+ + V+ G+V ++ L + + + L F + Sbjct: 137 NALKGLQQFGEEL-SDSLTDGHVRIGIVAYNR-------LTYSTADFSTDMNDLEDFLRN 188 Query: 257 TN---STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 T S ++ ++G R N KK++V + DG ++ Sbjct: 189 TAEPHSGTFMQKGL-----LEGQRLLAEKSRPNAKKMLVHIGDGSANAS 232 >gi|311105413|ref|YP_003978266.1| hemolysin-type calcium-binding repeat family protein 3 [Achromobacter xylosoxidans A8] gi|310760102|gb|ADP15551.1| hemolysin-type calcium-binding repeat family protein 3 [Achromobacter xylosoxidans A8] Length = 2061 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 33/147 (22%), Positives = 59/147 (40%), Gaps = 25/147 (17%) Query: 169 DARLDMMIVLDVSRSMESFFDS---SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 A ++ +VLD+S SM+ ++ S T+++ A K++ ++LE + + + L+T Sbjct: 1440 GASYNIALVLDLSGSMDYYWGSGSNQETRLETAKKALKSLLE--NQLATHDGTINVSLIT 1497 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL----------KYAYNQIFDMQG 275 F S LQ+ I L+ V L A+N+ Sbjct: 1498 F----------ADSSSKLQKAISGLTPDNVDDMVNILLGLKAGGGTPYGAAFNETKSWFD 1547 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLST 302 + + YK + F+TDGE S Sbjct: 1548 GQPTEDANGNAYKNLTFFLTDGEPSSE 1574 >gi|153008592|ref|YP_001369807.1| hypothetical protein Oant_1261 [Ochrobactrum anthropi ATCC 49188] gi|151560480|gb|ABS13978.1| conserved hypothetical protein [Ochrobactrum anthropi ATCC 49188] Length = 576 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 34/92 (36%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 F G + + A+ PI V ++ S +F + L SM D + V A + Sbjct: 7 RFLRARGGNLATMAALVSPIFLAVAAFSVDTSSLFLERRQLQSMADFAAVAGAASLSQAN 66 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDN 101 + R+ G + ++ S N DN Sbjct: 67 DAVLRQLRANGLDPVLMTGAYDPSVVNGKTDN 98 >gi|114643008|ref|XP_001160508.1| PREDICTED: von Willebrand factor isoform 2 [Pan troglodytes] gi|114643010|ref|XP_508945.2| PREDICTED: von Willebrand factor isoform 3 [Pan troglodytes] Length = 2813 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 41/277 (14%), Positives = 87/277 (31%), Gaps = 35/277 (12%) Query: 79 GGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYKI 137 ++ + + + N ++I + + + V N + I Sbjct: 1588 HSFLVSQGDREQAPNLVYMVTGNPASDEIKRLPGDIQVVPIGVGPNANVQELERIGWPNA 1647 Query: 138 PLKFCTFIPWYTNSRHIVMPITSS--------VKVNSQTDARLDMMIVLDVSRSMESFFD 189 P+ F + +V+ S + S LD++++LD S S F Sbjct: 1648 PILIQDFESLPREAPDLVLQRCCSGEGLQIPTLSPASDCSQPLDVILLLDGSSS---FPA 1704 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV----SHLQR 245 S ++ K+ + + + P + Q ++ + + W V +HL Sbjct: 1705 SYFDEMKSFAKA---FISKANIGP---RLTQVSVLQYGSITTIDVP--WNVVPEKAHLLS 1756 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 + + + G + L +A + H A+ K +++ +T Sbjct: 1757 LVDVMQREGGPSQIGDALGFAVRYLTS----EMHGARPGAS-KAVVILVT------DVSV 1805 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + A+ V+ IGI LR A P Sbjct: 1806 DSVDAAADAARSNRVTVFPIGIGDRYDAAQLRILAGP 1842 >gi|126722991|ref|NP_062242.2| inter-alpha-inhibitor H4 heavy chain [Rattus norvegicus] gi|59808174|gb|AAH89806.1| Inter alpha-trypsin inhibitor, heavy chain 4 [Rattus norvegicus] gi|149034208|gb|EDL88978.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_a [Rattus norvegicus] Length = 933 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 38/280 (13%), Positives = 90/280 (32%), Gaps = 46/280 (16%) Query: 92 MSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNS 151 L+ + ++ + I + + + ++ ++ K ++F + S Sbjct: 166 EQLVKHLQMDIYIFEPQGISTLETESTFMTQELANALTTSQNKTKAHIQFKPTLSQQRKS 225 Query: 152 RH-----IVMPITSSVKVN-SQTDARLDMM---------------------IVLDVSRSM 184 ++ + T V+ S T + + V+D S SM Sbjct: 226 QNEQDTVLDGDFTVRYDVDRSSTGGTIQIENGYFVHHFAPEDLPTMAKNVLFVIDKSGSM 285 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG-LVTFSNKIEEFFLLEWGVSHL 243 KI +++ +L+++ N +V SG + + + +L Sbjct: 286 AG------KKIQQTREALIKILKDLSTQDQFNIIVFSGEANQWEQLLVQATE-----ENL 334 Query: 244 QRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 R + Y S TN + A + + + +I+ +TDGE Sbjct: 335 NRAVDYASKIPAQGGTNINKAVLSAVELLDKSNQAELLPSKSVS----LIILLTDGEPTV 390 Query: 302 TKEDQQSLYYCNEAKKRG-AIVYAIGIRVIRSHEFLRACA 340 + + + + + G ++ +G ++ FL A Sbjct: 391 GETNPKIIQKNTQEAINGRYSLFCLGFGFDVNYPFLEKLA 430 >gi|23465165|ref|NP_695768.1| hypothetical protein BL0580 [Bifidobacterium longum NCC2705] gi|23325787|gb|AAN24404.1| hypothetical protein BL0580 [Bifidobacterium longum NCC2705] Length = 383 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 30/185 (16%), Positives = 58/185 (31%), Gaps = 15/185 (8%) Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 V+D S SM + + K +++ + I + V L+ F + Sbjct: 208 VVDYSGSMSGEGKNGVVK--GLNAALDPDQAKKSYIEPASGDVNI-LIPFETEAHRPVKA 264 Query: 237 E-WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 S L + G T+ GL A +++ Q+ IV MT Sbjct: 265 TGTSTSDLLHEADATDASG-GTDIYEGLLSALDELPSESEASQYTTA--------IVLMT 315 Query: 296 DGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMY 355 DG + S +D+ +++ R +++I + N+ + Sbjct: 316 DGRSNSDHQDE--FESAYKSRGRDLPIFSIMFGDADPSQLKSLATLSNAKVFDGRSGDLA 373 Query: 356 DAFSH 360 F Sbjct: 374 AVFRQ 378 >gi|296274053|ref|YP_003656684.1| von Willebrand factor type A [Arcobacter nitrofigilis DSM 7299] gi|296098227|gb|ADG94177.1| von Willebrand factor type A [Arcobacter nitrofigilis DSM 7299] Length = 2811 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 50/264 (18%), Positives = 87/264 (32%), Gaps = 31/264 (11%) Query: 54 IDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD--NGFVNDIDDIV 111 +D S T N + NTW ++ +D + + Sbjct: 1721 LDGSESLTVTITNVPDNATLTTNNSSYTLTNNHNNTWTVNLPEGAKDVSDSITMTVPKGT 1780 Query: 112 RSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR 171 + L I +I Y + ++ + SS + + Sbjct: 1781 ENIDLGITARATEANDNIDGD-NYAETTDSDAVVYSEDETQTLNFDGASSAAIATN---- 1835 Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 ++I LDVS SM S D + ++ +A +++ M+ E + VN LVTF++ + Sbjct: 1836 --VVITLDVSGSMTSN-DEHVNRLALAKEALAKMINEYESQGSVN----VKLVTFND--D 1886 Query: 232 EFFLLEW-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + W I LS G TN + YN + R + Sbjct: 1887 GHAVNTWMSAKDAISAINNLSS-GGKTNYEDAVYETYNNYTEPSADRT-----------V 1934 Query: 291 IVFMTDGENLSTKEDQQSLYYCNE 314 F++DGE KE+ + CN Sbjct: 1935 AYFISDGEPT--KENNEGCDPCNN 1956 >gi|222530086|ref|YP_002573968.1| Ig domain-containing protein group 2 domain-containing protein [Caldicellulosiruptor bescii DSM 6725] gi|222456933|gb|ACM61195.1| Ig domain protein group 2 domain protein [Caldicellulosiruptor bescii DSM 6725] Length = 1831 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 66/204 (32%), Gaps = 24/204 (11%) Query: 132 ISRYKIPLKFCTFIPWYTNSRHIV-----MPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 + Y K T + + +++ + IT + D++ V+D + SM Sbjct: 798 KATYNESSKTITVETNHFSVYYLINLKKYLDITGLKSGTVSPSGQADIVFVIDTTGSMSD 857 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK 246 +ID ++IN ++++K V GLVT+ + + G Sbjct: 858 -------EIDAVKQNINNFVDKLKTKDIS---VNLGLVTYKDITCDGLNSTVGHGFFSSA 907 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + + G G + D N K IV +TD E++ Sbjct: 908 DDFKNALGSIKVDGGG--DTPETLIDALET-ARLLGFRENSTKFIVVLTDA--NYKLENR 962 Query: 307 QSLYYCNE----AKKRGAIVYAIG 326 + +E K IV + Sbjct: 963 FGIKSADEIIERLKSDNIIVSVVS 986 >gi|2292988|emb|CAA72155.1| Inter-alpha-inhibitor H4 heavy chain [Rattus norvegicus] Length = 932 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 62/171 (36%), Gaps = 19/171 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG-LVTFSNKIEE 232 ++ V+D S SM KI +++ +L+++ N +V SG + + + Sbjct: 274 VLFVIDKSGSMAG------KKIQQTREALIKILKDLSTQDQFNIIVFSGEANQWEQLLVQ 327 Query: 233 FFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 +L R + Y S TN + A + + + + Sbjct: 328 ATE-----ENLNRAVDYASKIPAQGGTNINKAVLSAVELLDKSNQAELLPSKSVS----L 378 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRG-AIVYAIGIRVIRSHEFLRACA 340 I+ +TDGE + + + + + G ++ +G ++ FL A Sbjct: 379 IILLTDGEPTVGETNPKIIQKNTQEAINGRYSLFCLGFGFDVNYPFLEKLA 429 >gi|221121786|ref|XP_002165500.1| PREDICTED: similar to procollagen, type XIV, alpha 1 [Hydra magnipapillata] Length = 3126 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 38/298 (12%), Positives = 85/298 (28%), Gaps = 48/298 (16%) Query: 82 ILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKF 141 L + + + + + + + T ++ V + + PL Sbjct: 673 SLKAVTELFKSAISEKAPPGKYEKPV--VKNDTVVNFVKDDEPTKSEVGQTHSRFFPLGA 730 Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS-RSMESFFDSSITKIDMAIK 200 + + ++LD+ V+D S S ++ D ++ + Sbjct: 731 LVSK----------GSKPPCICPDKICSSKLDLAFVIDASAGSEQNGKDKMAETMEFGRR 780 Query: 201 SINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-----EFFLLEWGVSHLQRKIKYLSKFGV 255 +A + N+ GLVT++ + F ++ + ++ G Sbjct: 781 VASAF-----KVDQENS--HLGLVTYATDAQIMLNFHHFNDPDTLTEARDAVRVKPHTGK 833 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 + L A +FD ++ +++ MTDG Sbjct: 834 --YTGQALSLAKEGLFDKG--------HRSDALDVLILMTDG-----PSSDDVTEPSRAL 878 Query: 316 KKRGAIVYAIGIRVIRSHEFLRACAS---PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + G + A+GI + L AS + + YD I + Sbjct: 879 RDMGVKIIAVGIGNQIDRKQLNDIASDPDDEHVFTAD-----YDNLGTIIRRTQRAAC 931 >gi|260785822|ref|XP_002587959.1| hypothetical protein BRAFLDRAFT_87354 [Branchiostoma floridae] gi|229273114|gb|EEN43970.1| hypothetical protein BRAFLDRAFT_87354 [Branchiostoma floridae] Length = 602 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 28/203 (13%), Positives = 71/203 (34%), Gaps = 32/203 (15%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D+++ LD+S S+ + ++A + A ++ + ++ + ++ ++ + Sbjct: 397 AIDIVLALDLSSSI------PQDQFELARDFMVAFVDC--EVFQEKDI-RIAVLNYTCEA 447 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + +F L + +I L + T + + + ++ G H Sbjct: 448 DTYFDLAPIAYGMSYEIGQLMRGDGGITRTGHAINHM--RLTSKFGADAHHTA------- 498 Query: 290 IIVFMTDGENLS-----------TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 V +TDG+ + + + +A+ G +YA+ + + L Sbjct: 499 --VILTDGQTMICYTVLLSGLPFPQSEDDQQAAAADARNAGIELYAVEFGYLVNGMALAT 556 Query: 339 CASPNSFYLVENPHSMYDAFSHI 361 S V + DA I Sbjct: 557 MTGDPSGSRVFDTSQACDAAQKI 579 >gi|19552242|ref|NP_600244.1| hypothetical protein NCgl0978 [Corynebacterium glutamicum ATCC 13032] Length = 594 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 41/190 (21%), Positives = 67/190 (35%), Gaps = 16/190 (8%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-EEF 233 MIVLD S SM + T+ID A ++ +LI D+++ GL + E Sbjct: 1 MIVLDNSGSMTAQDAGGQTRIDAAKQA------STQLINDISDRTDVGLTYYGGNTGETE 54 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 +E G + I G + + + F G + IV Sbjct: 55 ADVEMGCQDV--TILGGPSRGNADTLIDTINSLQPRGFTPIGKALTDTAAELPEGGNIVL 112 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGA--IVYAIGIRV-IRSHEFLRACA--SPNSFYLV 348 ++DG T D E + G ++ IG+ V + E L A ++ Sbjct: 113 VSDGIANCTPPD--VCEVAQELAQSGINLVINTIGLNVDPAAREELECIAGVGGGTYADA 170 Query: 349 ENPHSMYDAF 358 + S+ DA Sbjct: 171 SDAQSLTDAL 180 >gi|324993014|gb|EGC24934.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK405] gi|327462310|gb|EGF08637.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK1] gi|327489663|gb|EGF21455.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK1058] Length = 450 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 38/200 (19%), Positives = 60/200 (30%), Gaps = 36/200 (18%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ V+D S SM ID+ +IN + + + GL TFS Sbjct: 173 KAGSADIVFVVDRSGSMGGT-------IDIVRANINEFVRNITK---EGITARFGLATFS 222 Query: 228 NKIE-------------EFFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQI 270 +++ F + + K L+ G T TP A NQI Sbjct: 223 DEVYGRNSGSKDEDTVLTRFGSSYFTTDPAELEKALAAIRIASGGDTPETPT--TALNQI 280 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + KK +V +TD E K G ++ I Sbjct: 281 ISTYDWSKSSKN-----KKFVVLLTDAEMKEDPSIPTVADTLAALKAAGIERTVATVKAI 335 Query: 331 RSHEFLRACASPNSFYLVEN 350 + A+ +EN Sbjct: 336 EG--IYKNFATEGRVLDIEN 353 >gi|302561188|ref|ZP_07313530.1| LOW QUALITY PROTEIN: toxic cation resistance protein [Streptomyces griseoflavus Tu4000] gi|302478806|gb|EFL41899.1| LOW QUALITY PROTEIN: toxic cation resistance protein [Streptomyces griseoflavus Tu4000] Length = 239 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 71/212 (33%), Gaps = 37/212 (17%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 S+ + R + +V+D S SM+ ++ + + + + D V Sbjct: 26 SLTKHGMDGLRAAVYLVVDYSGSMKPYYRDG--SVQALADRVLGLSAHLD--DDGTVPV- 80 Query: 221 SGLVTFSNKIE--EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V FS ++ L ++R + L G T+ + + D Sbjct: 81 ---VFFSTDVDAVTDIALADHRGRIERIVAGLGHMGR-TSYHLAMDAVIDHYLDSG---- 132 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--- 335 + ++VF TDG ++ ++ Y C A+ + +G +F Sbjct: 133 ------STAPALVVFQTDGGPVNKLAAEK--YLCKAARLPLFWQF-VGFGDPNGKQFDYL 183 Query: 336 --LRACASPNS-------FYLV-ENPHSMYDA 357 L A A P F+ ENP + DA Sbjct: 184 RGLDALAVPAKRVIDNAGFFHAGENPGEVTDA 215 >gi|291547618|emb|CBL20726.1| fibro-slime domain [Ruminococcus sp. SR1/5] Length = 1928 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 34/220 (15%), Positives = 67/220 (30%), Gaps = 61/220 (27%) Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL----------------L 236 T++D ++N +++ + + G+ FS+ + Sbjct: 1184 TRLDALKNAVNQFIDDTAKKSPNS---KIGITVFSSTDDYNRPYGNHGTSVSLGEVGTAD 1240 Query: 237 EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 V+ L+ +K L G T+ GL+ A N++ M K +V TD Sbjct: 1241 SAKVTELKNFVKDLKANG-GTDPAVGLEDAKNKLDAMVDTN----------PKYVVLFTD 1289 Query: 297 GEN------LSTKEDQQSLYYCNEAK----------KRGAIVYAIGIRVIRSHEFLRA-- 338 G+ ++ + + E K K VY IG + + + Sbjct: 1290 GKPTGGGNKWNSNAQKNAETQAGELKTGLRNNVDNAKNPYTVYTIGFALNDEGDRAKTFL 1349 Query: 339 -------------CASPNSFYLVENPHSMYDAFSHIGKDI 365 +S + ++ S+ F I I Sbjct: 1350 SGGTYDGKKDPGIASSSDCAKTADDAASLTQIFQSISSTI 1389 >gi|171681714|ref|XP_001905800.1| hypothetical protein [Podospora anserina S mat+] gi|170940816|emb|CAP66465.1| unnamed protein product [Podospora anserina S mat+] Length = 648 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 29/198 (14%), Positives = 73/198 (36%), Gaps = 18/198 (9%) Query: 168 TDARLDMMIVLDVSRSMESFFD---SSITKIDMAIKSINAMLEEVKLI--PDVNNVVQSG 222 LD+++ +DVS SM + + T+ + S+ ++ +++ + G Sbjct: 67 NHVPLDLVLSIDVSGSMGADAPVPAKNGTEGEHYGLSVLDLVRHAAKTILETLDDHDRLG 126 Query: 223 LVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYA---YNQIFDMQG 275 +VTFS + L + + + +++ L + TN G++ +N Sbjct: 127 IVTFSTSSKVVRELTYMTPANKAKILKQLDALQPLSM-TNLWHGIRDGLSLFNNNLKAVN 185 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 R++ + ++ +TDG +Q + + A ++ G Sbjct: 186 DRRNPGSGRVPA---LLVLTDGMPNHQCPNQGYVAKLRQWSTLPASIHTFGFGYSLRSGL 242 Query: 336 LRACAS--PNSFYLVENP 351 L++ A ++ + + Sbjct: 243 LKSIAEVGGGNYSFIPDA 260 >gi|146284562|ref|YP_001165515.1| von Willebrand factor, type A [Enterobacter sp. 638] gi|145320695|gb|ABP62841.1| von Willebrand factor, type A [Enterobacter sp. 638] Length = 651 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 31/211 (14%), Positives = 75/211 (35%), Gaps = 30/211 (14%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + ++ + N ++ V+D + SM + ID ++I+ + ++++ Sbjct: 201 NGSATTSAKATDEKNMMKGFSASVVFVIDSTISMGPY-------IDRTKEAIDKIYKQIE 253 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWG---------VSHLQRKIKYLSKFGVSTNSTP 261 + + V+ GLV + + ++ LE+ V + + + +T S+ Sbjct: 254 K-EQLQDKVKFGLVAYRSSVKAVPGLEYDAKMYVDPNTVKDGKDFLAKVHDLKQATVSSS 312 Query: 262 GLKY-AYNQIFDMQGMRQHCNTEDANYKKIIVFMT-----DGENLSTKEDQQSLYYCNEA 315 + AY + + +V +T DG + + + EA Sbjct: 313 KVDEDAYAGVMTALDKVDWTQFGA----RYVVLITDAGALDGTDSLSSTHLGAEQVRQEA 368 Query: 316 KKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 RG +Y + ++ + AS + Y Sbjct: 369 AYRGVALYTLHLKTPDGK---KNHASAAAQY 396 >gi|320106178|ref|YP_004181768.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319924699|gb|ADV81774.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 365 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 33/226 (14%), Positives = 73/226 (32%), Gaps = 42/226 (18%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 + ++ D + + I++D S SM K + K+ + + Sbjct: 103 GVPQTIASFRHEDLPVSLGILIDSSGSMYD-------KREAVGKASLDFVRLSNPKDEAF 155 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL--SKFGVSTNSTPGLKYAYNQIFDMQ 274 LV FS +E F+ + S +++ L K T + + + + Sbjct: 156 ------LVDFS---DEAFIDQDFTSDVKKLEDGLGYVKASGGTAIYDAVVASADYL---- 202 Query: 275 GMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIG--IRVIR 331 K++++ +TDG +N S + ++ E G ++Y +G Sbjct: 203 ------AKNAKLPKQVLLIVTDGDDNASGSTLEDAIRRVQEL--DGPVIYCVGLLFGPDS 254 Query: 332 SHE-------FLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 + L A + Y + + + +DI + Sbjct: 255 NKSESRHARRVLETLAAQTGGLAYFPRKLSEVDSIATEVAQDIRQQ 300 >gi|308175402|ref|YP_003922107.1| hypothetical protein BAMF_3511 [Bacillus amyloliquefaciens DSM 7] gi|307608266|emb|CBI44637.1| conserved hypothetical protein YwmC [Bacillus amyloliquefaciens DSM 7] gi|328555380|gb|AEB25872.1| hypothetical protein BAMTA208_18610 [Bacillus amyloliquefaciens TA208] gi|328913751|gb|AEB65347.1| hypothetical protein LL3_03821 [Bacillus amyloliquefaciens LL3] Length = 229 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 37/194 (19%), Positives = 64/194 (32%), Gaps = 21/194 (10%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV--QSGLVTFSNKIEE 232 I+LD S SM D ++K +MA I +++K V V G S K++ Sbjct: 40 AILLDASGSMAKRIDG-VSKYNMAKDEIVRFADQIKSKSQVRMTVFGSEGNNKNSGKVQS 98 Query: 233 FFLLE--WGVSHLQRK--IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + +G ++ + L+ G + + ++ + N Sbjct: 99 CESIRGVYGFQRFDKQSFLNSLNGIGPTGWTPIA--------KALEDAKASFNGVHKLGS 150 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE-FLRAC--ASPNSF 345 K + +TDGE + E +K+ V IG L A A + Sbjct: 151 KSVYLLTDGE---ETCGGDPIKTAKELRKQHIKVNVIGFDFNEGFNGQLHAIAGAGGGKY 207 Query: 346 YLVENPHSMYDAFS 359 Y + M F Sbjct: 208 YEAHSQKDMNRIFK 221 >gi|254447511|ref|ZP_05060977.1| inter-alpha-trypsin inhibitor domain protein [gamma proteobacterium HTCC5015] gi|198262854|gb|EDY87133.1| inter-alpha-trypsin inhibitor domain protein [gamma proteobacterium HTCC5015] Length = 670 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 35/214 (16%), Positives = 80/214 (37%), Gaps = 37/214 (17%) Query: 144 FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSIN 203 F Y + ++ + + ++ S +++ V+D S SM ++ A ++++ Sbjct: 292 FSEEYKGEHYALVMLRTPDEMTSGPRMPREVVFVIDTSGSMAG------QRMYHAKQALS 345 Query: 204 AMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ-----RKIKYLSKFGVSTN 258 +E + + +V F+N+ F S + + L + G T Sbjct: 346 QAVERLSPDD------RFNVVEFNNQHSRLFSSMRSASAINVKQALNWVGRL-QGGGGTM 398 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 P ++ A + D +RQ ++ +TD + E + + +++ Sbjct: 399 MLPAVEDALSVRSDPAYLRQ------------VILITDASVGNEAEILRVVER----QRK 442 Query: 319 GAIVYAIGIRVIRSHEFLRACA---SPNSFYLVE 349 GA ++ +GI V + LR A + Y+ Sbjct: 443 GARLFTVGIGVSPNSYLLRKAAQVGQGDYVYIAS 476 >gi|149375210|ref|ZP_01892982.1| hypothetical protein MDG893_06314 [Marinobacter algicola DG893] gi|149360574|gb|EDM49026.1| hypothetical protein MDG893_06314 [Marinobacter algicola DG893] Length = 658 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 80/222 (36%), Gaps = 38/222 (17%) Query: 154 IVMPITSSVKVNSQTDAR--LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 + +P T+ + D+ I++D+S SM++ +++ + ++ + + Sbjct: 12 LALPTTALAQQAPTLQLPDSADVRIIVDISGSMKTNDPNNLRR-----PAVRLLARMLPG 66 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFL-----LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 + G+ TF + +W ++R ++ + TN ++ A Sbjct: 67 QANA------GVWTFGQYVNMLVPHGKVTDDWRGLAVERS-DEINSVALRTNLGEAIQVA 119 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-----ENLSTKEDQQSL---YYCNEAKKR 318 + + + + +TDG +N S + ++ +E R Sbjct: 120 SDDYLLGSDSLDNTD---------FILLTDGKVDISDNESANDRERERILGALLDELSSR 170 Query: 319 GAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAF 358 GA ++ + + L++ A + + L + ++ AF Sbjct: 171 GATLHTVALSEEADLALLKSLAERTGGRYALASSADALTLAF 212 >gi|114643012|ref|XP_001160253.1| PREDICTED: von Willebrand factor isoform 1 [Pan troglodytes] Length = 2290 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 41/277 (14%), Positives = 87/277 (31%), Gaps = 35/277 (12%) Query: 79 GGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYKI 137 ++ + + + N ++I + + + V N + I Sbjct: 1588 HSFLVSQGDREQAPNLVYMVTGNPASDEIKRLPGDIQVVPIGVGPNANVQELERIGWPNA 1647 Query: 138 PLKFCTFIPWYTNSRHIVMPITSS--------VKVNSQTDARLDMMIVLDVSRSMESFFD 189 P+ F + +V+ S + S LD++++LD S S F Sbjct: 1648 PILIQDFESLPREAPDLVLQRCCSGEGLQIPTLSPASDCSQPLDVILLLDGSSS---FPA 1704 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV----SHLQR 245 S ++ K+ + + + P + Q ++ + + W V +HL Sbjct: 1705 SYFDEMKSFAKA---FISKANIGP---RLTQVSVLQYGSITTIDVP--WNVVPEKAHLLS 1756 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 + + + G + L +A + H A+ K +++ +T Sbjct: 1757 LVDVMQREGGPSQIGDALGFAVRYLTS----EMHGARPGAS-KAVVILVT------DVSV 1805 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + A+ V+ IGI LR A P Sbjct: 1806 DSVDAAADAARSNRVTVFPIGIGDRYDAAQLRILAGP 1842 >gi|332970076|gb|EGK09074.1| D-amino acid dehydrogenase large subunit [Desmospora sp. 8437] Length = 454 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 34/224 (15%), Positives = 73/224 (32%), Gaps = 24/224 (10%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS- 221 K + ++ + I+LD S SM + K+D+A ++I + + + ++ Sbjct: 133 KQGKEQAKQVHVEILLDASGSMAGRIRDGV-KMDLAKEAIENFVS--DMPENAKISLRVY 189 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 G + K ++ + Y+ KFG + NS + + + Sbjct: 190 GHKGSNRKQDQKESCA-STEVVYPHGSYVKGKFGKALNSFEP--TGWTPLAAAMEEARQD 246 Query: 281 NTE--DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI--VYAIGIRVIR-SHEF 335 + + II ++DG + V IG V + Sbjct: 247 LKPYAGEDAENIIYVVSDG---IETCGGDPVKAAKSLYNSDIQAVVNIIGFDVDDAGQQA 303 Query: 336 LRACAS--PNSFYLVENPHSMYDAF----SHIGKDIVTKRIWYD 373 L+ A + + +F I K++ ++WY+ Sbjct: 304 LKKVAEAGGGEYKTANTREELNQSFGIDWDEIEKEVS--KVWYN 345 >gi|281205104|gb|EFA79297.1| type A von Willebrand factor domain-containing protein [Polysphondylium pallidum PN500] Length = 811 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 37/259 (14%), Positives = 81/259 (31%), Gaps = 27/259 (10%) Query: 90 WNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISR---YKIPLKFCTFIP 146 +F N ++ + +I + R P F P Sbjct: 213 LKSNFDKFEVSNYQYKITKTTDSLNNIITIESEDGLPKNIMVMVRLLESDAPQSFVETSP 272 Query: 147 WYTNSRHIVMPITSSVKVNS-QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 N + + + +S + + + + V+D S SME T+I + + ++ + Sbjct: 273 ADPNQYAVALSFKPKLDCDSLKVRQKAEFIFVVDCSGSMEG------TRIALTRRVLHFL 326 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI-KYLSKFGVSTNSTPGLK 264 + + N+++ G + + VS I + T LK Sbjct: 327 VRSLPFSSVKFNIIRFGSNFNKMFPQSKIYDDDSVSDSAMYIDNKVHADLGGTELYQPLK 386 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 +Q DA Y + I+ +TDG + + + + + ++ Sbjct: 387 SILSQ------------KHDAEYPRQIIVLTDG---GVSGKGHLIDFVRK-EDSTSRIFT 430 Query: 325 IGIRVIRSHEFLRACASPN 343 +GI + +R+ A Sbjct: 431 VGIGNGADQDLVRSLAEAG 449 >gi|126662673|ref|ZP_01733672.1| hypothetical protein FBBAL38_04940 [Flavobacteria bacterium BAL38] gi|126626052|gb|EAZ96741.1| hypothetical protein FBBAL38_04940 [Flavobacteria bacterium BAL38] Length = 288 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 46/158 (29%), Positives = 68/158 (43%), Gaps = 22/158 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L MM+V+D+S S ESF + K D + I A L + GL+ F Sbjct: 72 EEERELTMMLVVDISGS-ESFGTKNQQKRD-MVTEIAATLAFSATQNNDK----IGLLLF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 S++IE F + G SH+ R I+ L +F +N T L A + + Sbjct: 126 SDQIELFIPPKKGKSHVLRIIRELIEFQPKSNKTD-LAQALKYLSGVLK----------- 173 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKK-RGAIVY 323 KK IVFM + K+ QQ+L + G VY Sbjct: 174 -KKAIVFMI--SDFMVKDYQQTLKIAAKKHDVTGIRVY 208 >gi|226290246|gb|EEH45730.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18] Length = 757 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 72/206 (34%), Gaps = 25/206 (12%) Query: 171 RLDMMIVLDVSRSMESFFDSSIT----KIDMAIKSINAMLEEVK--LIPDVNNVVQSGLV 224 D+++ +DVS SM+ T K + S+ + + +I +N + G+V Sbjct: 73 PCDIVLCIDVSGSMQLSAPLPTTNESGKREETGLSVLDLTKHAARTIIETLNENDRLGVV 132 Query: 225 TFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 TFSN E + + ++ L STN GLK N + ++ Sbjct: 133 TFSNDAEVAYKISHMDDTNKKAALEAVEALQPL-ASTNLWHGLKLGLNVLGEVD------ 185 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 + + +TDG+ Q + K R +++ G L Sbjct: 186 --LRPRNVQALYVLTDGQPNHMCPTQGYVPKLRPILERQKDRLPLIHTFGFGYDIRSGLL 243 Query: 337 RACAS--PNSFYLVENPHSMYDAFSH 360 ++ A ++ + + + F H Sbjct: 244 QSIAEVGGGTYSFIPDAGMIGTVFVH 269 >gi|193788521|dbj|BAG53415.1| unnamed protein product [Homo sapiens] Length = 328 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 39/197 (19%), Positives = 72/197 (36%), Gaps = 35/197 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM ++ +++ ++ L + + G+VTF + Sbjct: 20 VCLVLDKSGSM-----ATGNRLNRLNQAGQLFLLQTVELGSW-----VGMVTFDSAAHVQ 69 Query: 234 FLL--EWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L S K L T+ GL+ A+ I +++ Sbjct: 70 SELIQINSGSDRDTLAKRLPAAASGGTSICSGLRSAFTVI-----RKKYPTDGSE----- 119 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYL 347 IV +TDGE+ ++ C NE K+ GAI++ + + + E L Y Sbjct: 120 IVLLTDGEDN-------TISGCFNEVKQSGAIIHTVALGPSAAQELEELSKMTGGLQTYA 172 Query: 348 VENPHS--MYDAFSHIG 362 + + + DAF + Sbjct: 173 SDQVQNNGLIDAFGALS 189 >gi|134076517|emb|CAK39712.1| unnamed protein product [Aspergillus niger] Length = 688 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 45/296 (15%), Positives = 92/296 (31%), Gaps = 45/296 (15%) Query: 41 SHIFFMK--TVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRN-E 97 I K + + D ++ + + W S + Sbjct: 52 GSILIKKHLNSMEAAADEPILAL------NLSVSELPCFYLRFPNRSQLELWRRSLLDLH 105 Query: 98 LRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMP 157 ++ N+ D S + + S S Y T I YTN Sbjct: 106 PMESTSPNNDYDYDNSGAEEEEYRNNPIRRQASINSSYGAARSINTAITDYTNM------ 159 Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + LD+++V+ VS SM+ KI + ++ +++ + Sbjct: 160 --GTESPAPSIHIPLDLVVVIPVSSSMQGL------KITLLRDALRFLVQSLGPRD---- 207 Query: 218 VVQSGLVTFSN-----KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + GLVTF + I WG + ++ + G + ++ A N D Sbjct: 208 --RMGLVTFGSSSGGVPIVGMTTKSWG--GWPKILESIRPVGQKSLRADVVEGA-NVAMD 262 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + R+ N I+ ++D ST + + + A+ +++ G+ Sbjct: 263 LLMQRKSSNPIST-----ILLISD---TSTSDPDSVDFVVSRAEAAKVNIHSFGLG 310 >gi|159038794|ref|YP_001538047.1| von Willebrand factor type A [Salinispora arenicola CNS-205] gi|157917629|gb|ABV99056.1| von Willebrand factor type A [Salinispora arenicola CNS-205] Length = 316 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 32/208 (15%), Positives = 65/208 (31%), Gaps = 32/208 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+ +DVS SM++ D S ++ A ++ +EE+ + GLV+F+ Sbjct: 89 VMLAIDVSLSMQAD-DVSPNRLAAAQEAAQQFVEELPASYN------LGLVSFAKAANVL 141 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + L ST + + I + IV Sbjct: 142 VPPTKDRQAVVTAVDGLV-LAESTATGEAVFTCLEAIRSVPADGAAGIPPAR-----IVL 195 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-----EFLR----------- 337 ++DG S + +++ A + I + R Sbjct: 196 LSDGYRTSGRSVEEAAAAAQAANVAVS---TIAFGTDGGQVDIGGQLQRVPVDRFALAEL 252 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDI 365 A + FY + + + + +G I Sbjct: 253 AATTEGHFYEAASVNELKQVYQDMGSSI 280 >gi|15921064|ref|NP_376733.1| hypothetical protein ST0830 [Sulfolobus tokodaii str. 7] gi|15621848|dbj|BAB65842.1| 381aa long hypothetical protein [Sulfolobus tokodaii str. 7] Length = 381 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 33/166 (19%), Positives = 57/166 (34%), Gaps = 28/166 (16%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 +I+LD S SM KI+ A + +L IP N + +TFS+ + Sbjct: 44 IILLDTSGSMAGI------KIETAKQGALQLLN---KIPPGNKIT---FITFSSTVNTLI 91 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + I ++ G +T L A ++H I+ + Sbjct: 92 EFADTSGSVGETISSVTAQG-NTVLYTALSTAIQI------AKKHGIPG------YIILL 138 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 TDG + + + Y G V + GI + + L+ A Sbjct: 139 TDG---NPTDLTNTDAYEKLQFPDGFKVISFGIGDDYNEQLLKVLA 181 >gi|99078203|ref|YP_611461.1| von Willebrand factor, type A [Ruegeria sp. TM1040] gi|99035341|gb|ABF62199.1| von Willebrand factor type A [Ruegeria sp. TM1040] Length = 477 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 75/204 (36%), Gaps = 43/204 (21%) Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK------ 229 +VLD S SM D + KI +A + +L+ + N + GL+ + ++ Sbjct: 28 LVLDASGSMWGQIDG-VAKITIAQDVMQHLLKTL----PENQ--ELGLMAYGHRRKGDCN 80 Query: 230 -IEEFFLLEWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 IE+ G + + + +S G + S ++ A + + + Sbjct: 81 DIEQLIAPAAGSRQAISQAVTQISPKGKTPLSAAVMQAA-DALRSSEE------------ 127 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCN-----EAKKRGAIVYAIGIRVIR--SHEFLRACA 340 K ++ ++DGE + L C EA+ ++AIG + + L+ A Sbjct: 128 KATVILISDGE------ETCGLDPCAVGAELEARGVDFTLHAIGFGIADDAARAQLQCLA 181 Query: 341 --SPNSFYLVENPHSMYDAFSHIG 362 + + + + A + + Sbjct: 182 ENTGGFYRDASSASELTAALAQVA 205 >gi|290769918|gb|ADD61688.1| putative protein [uncultured organism] Length = 570 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 50/368 (13%), Positives = 117/368 (31%), Gaps = 59/368 (16%) Query: 35 GMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM---------NEGNGNNRKKLKGGDILCR 85 G + E + + ++ E N + Sbjct: 48 GDVAEEDNEMQVDGIMQESATNMATADCESGFYDDYLQCPDTEEWNTNEYSYTEENPWMN 107 Query: 86 IKNTWNMSFRNELRDNGFVNDIDDIVRSTSLD--IVVVPQNEGYSISAISRYKIPLKFCT 143 ++ + +F ++ + I +D +V + + Y Y PL Sbjct: 108 VQTSPLSTFAADVDTASYTQIRSAIENGYDIDPSMVRIEEMLNY-----FHYDYPLPKDD 162 Query: 144 FI----------PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSIT 193 PW +++ + + ++ ++ ++ +++ ++DVS SM Sbjct: 163 EKFAVYTEYMDCPWNEDTK-LALVSMNTQAIDFKSAPASNLVFLIDVSGSMF-----DDN 216 Query: 194 KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLS 251 K+ + +++ + ++ + +VT++ E G + I+ L Sbjct: 217 KLPLVQQALTML------AENLTEKDRVSIVTYAGSDEVVLQGVSGDDYHEISSAIEGLE 270 Query: 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLY 310 +G STN + G++ AY + + ++ TDG+ N+ + Q Sbjct: 271 AYG-STNGSAGIETAY-----ALAKKYFIKGGNNR----VILCTDGDLNVGLTSEGQLEK 320 Query: 311 YCNEAKKRGAIVYAIGIRVIRSH----EFLRACASPNSFYLVENPHSMYDAFSHIGKDIV 366 E K G + G+ E L + N Y SM++A + ++ Sbjct: 321 LITEKKDSGVFLSTFGVGYGNYKDNKLELLADKGNGNYAY----IDSMFEAKKALVDELG 376 Query: 367 TKRIWYDK 374 + K Sbjct: 377 ANMVTVAK 384 >gi|291242482|ref|XP_002741137.1| PREDICTED: chloride channel calcium activated 3-like [Saccoglossus kowalevskii] Length = 975 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 66/196 (33%), Gaps = 26/196 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++VLD S SM +I +S +E I D + V G +++ + Sbjct: 342 VVLVLDTSGSM------DGDRIQRLYQSATYFIE--NRIEDGSFVGIVGFSSYAVILASM 393 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L++G + K + +T+ G++ A + D + + ++ Sbjct: 394 TELKYGYQRSEVSSKVPQEADGATSIGGGVRLALQVLQDGNVTSEGAS---------LLL 444 Query: 294 MTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY----LV 348 +TDG EN + E G V I + ++ Y Sbjct: 445 ITDGVENTYP----FLMNVMQEVYDSGVRVDTIAFTEAAQSTLQQLSSNTGGLYNYVPED 500 Query: 349 ENPHSMYDAFSHIGKD 364 +N + D+ + + Sbjct: 501 DNSTAFIDSLAATINE 516 >gi|226326039|ref|ZP_03801557.1| hypothetical protein COPCOM_03857 [Coprococcus comes ATCC 27758] gi|225205581|gb|EEG87935.1| hypothetical protein COPCOM_03857 [Coprococcus comes ATCC 27758] Length = 823 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 39/198 (19%), Positives = 75/198 (37%), Gaps = 17/198 (8%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIP-LKFCTFIPWYTNSRHIVMPITSSVKVNSQTD 169 + ++ +V ++ G + S P L +I + + + SS K + + Sbjct: 388 TTTKNIYLVYTAKSTGSTGSGTIEDTAPQLSHQKYIKKKDGNNYDLTLNVSSKKGTTSSK 447 Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + D+++++D S SM S+ K + ++N +++ + V+ + LVTF Sbjct: 448 KKFDIVLIMDTSTSM-----SNNNKWRNSKTAVNKLIDTLSSQTTVDVNYR--LVTFGTT 500 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + G + Y K TN GL + A+ +K Sbjct: 501 AQIQTNWTTGETVKSTLSNYSIKEDQGTNYEDGLVKTKEALSSGT---------RADAEK 551 Query: 290 IIVFMTDGENLSTKEDQQ 307 IIVF+TDG+ K Sbjct: 552 IIVFLTDGQPTFYKSGTS 569 >gi|15805929|ref|NP_294628.1| hypothetical protein DR_0904 [Deinococcus radiodurans R1] gi|6458631|gb|AAF10487.1|AE001943_7 hypothetical protein DR_0904 [Deinococcus radiodurans R1] Length = 369 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 27/135 (20%), Positives = 46/135 (34%), Gaps = 26/135 (19%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--EE 232 + VLD S SM D D S+ ++ V Q L+TF + + E Sbjct: 112 VFVLDTSGSMRGIGDGKADIFDRVKDSVEQYVDSVHPD-------QVKLITFDSGVRGER 164 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII- 291 + L + L+ + L G +T + A + + + + Sbjct: 165 SYDLPAQATALRSDLDGLKANGNNTYLYRSMHAALSPLAAS--------------DRYVT 210 Query: 292 -VF-MTDGENLSTKE 304 VF MTDG + + Sbjct: 211 SVFVMTDGIDNDPQP 225 >gi|159477953|ref|XP_001697073.1| collagen-related protein [Chlamydomonas reinhardtii] gi|158274985|gb|EDP00765.1| collagen-related protein [Chlamydomonas reinhardtii] Length = 387 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 26/196 (13%), Positives = 60/196 (30%), Gaps = 23/196 (11%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + + +++++ ++D S S+ + + + + ++ Sbjct: 101 PEELRASVEKLATRLVGDVQQVNVVFLVDGSGSV------NAEEFEAMLGFCVDASNQLA 154 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLE-WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 +Q +V FSN + L L++ + + + TN L A Q Sbjct: 155 ESVPN---LQVAVVQFSNDVRVEVGLAPLDSEALRKTTREMVRMNGGTNVAVALTKA-GQ 210 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 + + + +V +TDG S + ++ ++ V V Sbjct: 211 LLKRDAA--------PDAMRHVVLLTDGRVDSYQAH-EARQVADQLADEQRHVSLFAYGV 261 Query: 330 IRS---HEFLRACASP 342 R E L P Sbjct: 262 GRGVDRAELLHIIGGP 277 >gi|2342594|emb|CAA73785.1| matrilin-3 precursor [Homo sapiens] Length = 310 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 28/91 (30%), Gaps = 12/91 (13%) Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 +N K+ + +TDG Q A+ G +YA+G+ L Sbjct: 1 EAGAREPSSNIPKVAIIVTDGRPQD-----QVNEVAARAQASGIELYAVGVDRADMAS-L 54 Query: 337 RACAS---PNSFYLVEN---PHSMYDAFSHI 361 + AS + VE + F Sbjct: 55 KMMASEPLEEHVFYVETYGVIEKLSSRFQET 85 >gi|196230799|ref|ZP_03129660.1| autotransporter-associated beta strand repeat protein [Chthoniobacter flavus Ellin428] gi|196225140|gb|EDY19649.1| autotransporter-associated beta strand repeat protein [Chthoniobacter flavus Ellin428] Length = 1545 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 36/236 (15%), Positives = 79/236 (33%), Gaps = 31/236 (13%) Query: 132 ISRYKIPLKFCTFIPWYTNSRHIVM--PITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 PL F T Y +++ + + Q L+++++LD S SME Sbjct: 1158 EPSPGAPLAFVTERARYPFAQNRDLLRFAVKTAAAGRQPGRPLNIVLLLDRSGSME---- 1213 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE--EFFLLEWGVSHLQRKI 247 ++++ ++++ + + ++ + +VTF+ + V + ++ Sbjct: 1214 -RADRVNIVREALSVLAKHLQPQD------KLSIVTFARTPHLWADAVAGDKVHDVIARV 1266 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL--STKED 305 ++ G TN L AY H + N ++ TDG D Sbjct: 1267 NEITPEG-GTNLEAALDLAYETAH------HHFAVDSTNR---VILFTDGAANLGDVNPD 1316 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIR-SHEFLRACA--SPNSFYLVENPHSMYDAF 358 + +K+G + GI + + L + + + P F Sbjct: 1317 ALTKKV-EAQRKQGIALDCFGIGWEGYNDDLLEQLTRNADGRYGFINTPEDAAANF 1371 >gi|119510964|ref|ZP_01630086.1| hypothetical protein N9414_01285 [Nodularia spumigena CCY9414] gi|119464403|gb|EAW45318.1| hypothetical protein N9414_01285 [Nodularia spumigena CCY9414] Length = 412 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 35/230 (15%), Positives = 70/230 (30%), Gaps = 21/230 (9%) Query: 117 DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD--ARLDM 174 + V I + + P+ + + + + + + Sbjct: 31 NPTVNDDRVTIRIKVKNEAERPVMGLQDTDFKLLVDQKEVRFATRDWKSPEESVPPPAWI 90 Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN------ 228 +++LD S SM KI AI ++ ++ + Q +V F Sbjct: 91 LVMLDFSGSMNQLDSGGTKKITGAINAVRQFIQVLS---QRGGNTQISIVPFGESGSRCQ 147 Query: 229 --KIEEFFL---LEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCN 281 + + L LQ + YL+K STN L A + + R + Sbjct: 148 GYPVNKETLDKFFAANDFKLQNHLNYLAKSTPCASTNLYEPLNQAVRFLATSEEERFYVA 207 Query: 282 TEDANYK--KIIVFMTDGENLSTKEDQQSLYYCNEAKKRG-AIVYAIGIR 328 + I+ ++DG + E Q N ++ V+ +G Sbjct: 208 ENSDQPQPRLSIILLSDGYHNKANEQQDFTALTNLFRRNPLITVHTLGYG 257 >gi|86358602|ref|YP_470494.1| hypothetical protein RHE_CH03000 [Rhizobium etli CFN 42] gi|86282704|gb|ABC91767.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 780 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 42/293 (14%), Positives = 103/293 (35%), Gaps = 43/293 (14%) Query: 75 KKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISR 134 ++ + +K + + + + D R+ SL VP ++ + ++ + Sbjct: 243 ARINPVSLTVNLKAGFPLGEVKSSFHDVDIGQDGDQARTISLKGDAVPADKDFELTWKAA 302 Query: 135 YKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITK 194 F +++ +T ++ ++ +++ V+D S SM S Sbjct: 303 PGKTPSAGLFREAIDGKTYLLAFVTPPTAPDAAAPSKREVVFVIDNSGSM------SGQS 356 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-----LEWGVSHLQRKIKY 249 I+ A +S+ + + + ++ F + + ++F ++ Sbjct: 357 IEQARQSLALAISRLNPND------RFNVIRFDDTMTDYFKGLVAATPDNREKAVAYVRS 410 Query: 250 LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL 309 L+ G + + A QG + +VF+TDG ++Q L Sbjct: 411 LTADGGTE-----MLPALEDALRNQGPVASGAL------RQVVFLTDG----AIGNEQQL 455 Query: 310 YYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIG 362 + A + A V+ +GI ++PN++++ + F+ IG Sbjct: 456 FQEITANRGDARVFTVGIG-----------SAPNTYFMTKAAEVGRGTFTQIG 497 >gi|315634664|ref|ZP_07889948.1| PpkA protein [Aggregatibacter segnis ATCC 33393] gi|315476612|gb|EFU67360.1| PpkA protein [Aggregatibacter segnis ATCC 33393] Length = 657 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 70/207 (33%), Gaps = 41/207 (19%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 ++ + P + T ++ V+D S SM+ + + + I + I Sbjct: 206 QQKTQSKIDPNANKSDPQVLTGFSAAVVFVVDASISMDPYINRTRDVIKQVSEQIEK--- 262 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS-------------HLQRKIK----YL 250 ++ V+ GLV F + + LE+ LQ+ ++ + Sbjct: 263 -----ENLGKQVKFGLVGFRSSTKAVPGLEYVSKMFVDPSTVKDTQDFLQKAVELKQAKV 317 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD-----GENLSTKED 305 S S +S G+ A N+I + IV +TD G N + Sbjct: 318 SSKEFSEDSFAGISQALNEINWNNFGG-----------RYIVLVTDAGALDGNNPLSSTG 366 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + EA+ RG +Y + ++ Sbjct: 367 MDAKQLRLEAQHRGVAIYTLHLKTPAG 393 >gi|302872519|ref|YP_003841155.1| Ig domain protein group 2 domain protein [Caldicellulosiruptor obsidiansis OB47] gi|302575378|gb|ADL43169.1| Ig domain protein group 2 domain protein [Caldicellulosiruptor obsidiansis OB47] Length = 1789 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 66/204 (32%), Gaps = 24/204 (11%) Query: 132 ISRYKIPLKFCTFIPWYTNSRHIV-----MPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 + Y K T + + +++ + IT + D++ V+D + SM Sbjct: 798 KATYNESSKTITVETNHFSVYYLINLKKYLDITGLKSGTVSPSGQADIVFVIDTTGSMSD 857 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK 246 +ID ++IN ++++K V GLVT+ + + G Sbjct: 858 -------EIDAVKQNINNFVDKLKTKDIS---VNLGLVTYKDITCDGPNSTVGHGFFSSA 907 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + + G G + D N K IV +TD E++ Sbjct: 908 DDFKNALGSIKVDGGG--DTPETLIDTLET-ARLLGFRENSTKFIVVLTDA--NYKLENR 962 Query: 307 QSLYYCNE----AKKRGAIVYAIG 326 + +E K IV + Sbjct: 963 FGIKSADEIIERLKSDNIIVSVVS 986 >gi|169338033|ref|ZP_02621346.2| von Willebrand factor type A domain protein [Clostridium botulinum C str. Eklund] gi|169295279|gb|EDS77412.1| von Willebrand factor type A domain protein [Clostridium botulinum C str. Eklund] Length = 1242 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 25/145 (17%), Positives = 56/145 (38%), Gaps = 25/145 (17%) Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-------EFFLL 236 + F + +++D K N +++ K + +V +S+K + FL Sbjct: 714 VYEFNEKDRSRLDSVKKVANDFVDKFKNDENTE----IAIVRYSSKANIVLDGSNKIFLN 769 Query: 237 EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 ++++I L G TN G++ +Y+ + + D + +K ++ MTD Sbjct: 770 GKDNEIIKKRINSLKADG-GTNIGDGIRKSYSIL----------DKCDKDSEKYMILMTD 818 Query: 297 GENLSTKEDQQSLYY---CNEAKKR 318 G + ++ C +K Sbjct: 819 GVPTAYTCYANTIKASNNCKYSKDN 843 Score = 45.9 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 34/220 (15%), Positives = 71/220 (32%), Gaps = 45/220 (20%) Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ 244 + + SS + + + + ++ V+ + N+ GLV+F + ++ ++ Sbjct: 164 KCYRQSSYNEKNRLQHAKESAIKFVQKFENDKNIS-IGLVSFDTTANSQKDITSKLNEVE 222 Query: 245 RKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG------ 297 I L +TN GLK A + + + K ++ M+DG Sbjct: 223 DSINSLKVADNGATNIEAGLKSAQQLL----------KKGNKDADKYVILMSDGFPTAFD 272 Query: 298 ----------------ENLSTK---------EDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 +N + S+ N KK G + IG + Sbjct: 273 YAGEKVEKNFNYHEIQDNTFINFGYYDYSGYAMKHSINQANSLKKDGINSFIIGFSEGAN 332 Query: 333 HEFLRAC--ASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 E L A+ + +N ++ A+ + + I Sbjct: 333 SEKLNNIAKAAGGEYEEAKNTDTLNGAYDKLETKVKAPLI 372 >gi|297714304|ref|XP_002833596.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like, partial [Pongo abelii] Length = 384 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 76/198 (38%), Gaps = 32/198 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP-----DVNNVVQSGLVTFSN 228 ++ V+DVS SM K+ ++++ +L++++ D N V++ Sbjct: 22 ILFVIDVSGSMWG------VKMKQTVEAMKTILDDLRAEDHFSVIDFNQNVRT------W 69 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + + V+ +R I+ + G TN L A + + + + + Sbjct: 70 RNDLISATKTQVADAKRYIEKIQPSG-GTNINEALLRAIFILNEASNLGL-LDPNSVSL- 126 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACASPN-- 343 I+ ++DG+ + + K+ ++++G+ ++FL+ +S N Sbjct: 127 --IILVSDGDPT--VGELKLSKIQKNVKENIQDNISLFSLGMGFDVDYDFLKRLSSENRG 182 Query: 344 ---SFYLVENPHSMYDAF 358 Y ++ S F Sbjct: 183 IAQRIYGNQDTSSQLKKF 200 >gi|308472943|ref|XP_003098698.1| hypothetical protein CRE_04174 [Caenorhabditis remanei] gi|308268298|gb|EFP12251.1| hypothetical protein CRE_04174 [Caenorhabditis remanei] Length = 416 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 65/199 (32%), Gaps = 20/199 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D S M + + + I S+ + + + GLVT+++ + Sbjct: 39 LDVIAVVDNSHGMT---NGGVQSVAANIASVFSSGTRIGSNSTEPRTTRVGLVTYNSAAK 95 Query: 232 EFFLLEWGV---SHLQRKIKYLSKFGVSTNS--TPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L K LS +T+S GL A + Q NT + Sbjct: 96 LDADLNKFQDLDGLYNGVFKDLSDVVDTTDSFLATGLNAAEELL-----QSQSLNTTRDH 150 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR---ACASPN 343 YKK+I+ + + N K G ++ + LR ASP Sbjct: 151 YKKVIIVY--ASEYKGSGELDPVPVANRLKGSGVVIVTVAYDQGGDEGLLRDLANIASPG 208 Query: 344 SFY--LVENPHSMYDAFSH 360 Y N ++ Sbjct: 209 FAYSNAPNNAGNLVGQIQD 227 >gi|254412101|ref|ZP_05025876.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] gi|196181067|gb|EDX76056.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] Length = 570 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 60/170 (35%), Gaps = 33/170 (19%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +++V+D S SM+ K+ ++ + + + L+ F+ Sbjct: 385 AKKPSQVVVVVDTSGSMQGN------KLPAVQNTLQNYINSLGSKD------KIALIDFN 432 Query: 228 NKIEEFFLL---EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++I + L+ + G + I L +G T YA N + D Sbjct: 433 DEISQPVLVEGTDAGRNRGLEFISGLQAYG-GTKLYDAALYARNWLQDN---------PR 482 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG------AIVYAIGIR 328 + ++ +TDGE+ ++ + L E ++ G + IG Sbjct: 483 PDAINAVLILTDGEDSGSQINLNQLE--QELQQSGFNSDQRIAFFTIGYG 530 >gi|162420405|ref|YP_001605638.1| putative tellurium resistance protein [Yersinia pestis Angola] gi|165928280|ref|ZP_02224112.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. F1991016] gi|165937836|ref|ZP_02226397.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. IP275] gi|166009072|ref|ZP_02229970.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. E1979001] gi|167398987|ref|ZP_02304511.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167421939|ref|ZP_02313692.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167426433|ref|ZP_02318186.1| putative tellurium resistance protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|270487811|ref|ZP_06204885.1| von Willebrand factor type A domain protein [Yersinia pestis KIM D27] gi|162353220|gb|ABX87168.1| putative tellurium resistance protein [Yersinia pestis Angola] gi|165914248|gb|EDR32864.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. IP275] gi|165919722|gb|EDR37055.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. F1991016] gi|165992411|gb|EDR44712.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166960076|gb|EDR56097.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167051491|gb|EDR62899.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167054656|gb|EDR64463.1| putative tellurium resistance protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|270336315|gb|EFA47092.1| von Willebrand factor type A domain protein [Yersinia pestis KIM D27] Length = 222 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 32/183 (17%), Positives = 62/183 (33%), Gaps = 14/183 (7%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S + + RL + +++D S SM I I AM+ ++ P V Sbjct: 2 SLILRTTSDMRRLPVYLLIDTSGSMRG------ESIHAVNVGIQAMMSALRQDPYALESV 55 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++T+ N+ E+ L + + T + L+ I + Q Sbjct: 56 HLSIITYDNQAREYIPLTALENF--QFTDITVPSAGGTFTGAALEC---LIHCVDRDIQR 110 Query: 280 CNTEDANYKKIIVF-MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + + + +VF MTD +++ K+ + A + HE L+ Sbjct: 111 SDGDQKGDWRPLVFLMTDSTPSDVYAYGEAIK--EVKKRAFGSIIACAVGAKAKHEHLKQ 168 Query: 339 CAS 341 S Sbjct: 169 LTS 171 >gi|23500008|ref|NP_699448.1| norD protein [Brucella suis 1330] gi|81751586|sp|Q8FX38|NORD_BRUSU RecName: Full=Protein norD gi|23463592|gb|AAN33453.1| norD protein [Brucella suis 1330] Length = 633 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 84/210 (40%), Gaps = 34/210 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 L + +++DVS S +++ D+ +D+ +++ + + D + ++TF+ Sbjct: 440 QANDLAVTLLVDVSLSTDAWVDNRRV-LDVEKEALLVLANGIAACGD-----RCSILTFT 493 Query: 228 NK--------IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ + F +G + ++ +I L G T +++A ++ Sbjct: 494 SRRRSWVRVETVKDFDESFGPT-VEHRIAALKP-GFYTRMGAAMRHATAKL--------- 542 Query: 280 CNTEDANYKKIIVFMTDGENLSTK------EDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 E N KK+++ +TDG+ + S EA+ +G V+A+ + + Sbjct: 543 --AEQPNRKKLLLLLTDGKPNDVDHYEGRFALEDSRRAAGEARAKGVNVFAVTVDR-EAS 599 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGK 363 L A + LV N + A I + Sbjct: 600 AHLPALFGRGGYALVANLAKLPVALPAIYR 629 >gi|4996347|dbj|BAA78416.1| complement B/C2-A2 [Cyprinus carpio] Length = 750 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 28/230 (12%), Positives = 79/230 (34%), Gaps = 31/230 (13%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD-------- 214 K+ +LD+ I +D S S++ + A +I +++++ P Sbjct: 250 KITMNQGGKLDIYIAVDASESIDK------KDFENAKTTIKMLIDKISYYPVSPNYEILM 303 Query: 215 -VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 +V + + + + + ++ + + + V + + AY I + Sbjct: 304 FATDVTRITSM----RDFKNNEDARNLMNIFKDLDDFNYERVGVKTGTNIAKAYTAILES 359 Query: 274 QGMRQHCNTEDANYKKIIVFM-TDGENL---STKEDQQ------SLYYCNEAKKRGAIVY 323 + + N N + IV + TDG+ + + + + + N K V+ Sbjct: 360 IKLEELNNAAIFNETQHIVILFTDGQANMGGNPRPKVEQIKHLVTKNHPNREKNLDLYVF 419 Query: 324 AIG--IRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 +G + + + F+ +++ + F + + + + Sbjct: 420 GVGDDVNQEDINGLVSQRDQEKYFFKLKDLTEVQKMFDDMIDESTSVGLC 469 >gi|326673940|ref|XP_001921532.3| PREDICTED: complement factor B [Danio rerio] Length = 862 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 35/226 (15%), Positives = 85/226 (37%), Gaps = 35/226 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + RL++ I++D S S+ S A K+I ++ ++ +VN V+++ Sbjct: 367 EEGRLNVFILMDTSGSI------SQDTFQAAKKAIIELVRKLDS-YEVNMKFDI--VSYA 417 Query: 228 NKI-EEFFLLEWGVSHLQRKIKYLSKFGV-------STNSTPGLKYAYNQIFDMQGMRQH 279 ++ E + + + ++ LS+F T+ + L+ Y Q+ ++ ++ Sbjct: 418 SEPREIVSITSFNSHDVDFVLRKLSEFSDEVHENRRGTDLSKALERVYGQLALLRENKKS 477 Query: 280 CNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYY----------CNEAKKRGAIVYAIGIR 328 E N I++ TDG N+ + ++ VY + Sbjct: 478 HFNETQN---ILIIATDGHSNMGPNPQIMLNKIRSLLGYKPSSVDHTQEELLDVYVFAVG 534 Query: 329 VIRSHEFLRACASP----NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + + L + AS +++++ + F+ + D + Sbjct: 535 KDVNRKDLTSFASSKKGEKHVFVLQDYQQLGYVFNQMISDSAVTKC 580 >gi|256376610|ref|YP_003100270.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] gi|255920913|gb|ACU36424.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] Length = 559 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 61/195 (31%), Gaps = 26/195 (13%) Query: 175 MIVLDVSRSMESFFDSSITK-IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + VLDVS SME + + + + S ++ + VV L+ F+ Sbjct: 379 VYVLDVSGSMEGDRMAQLKRALSRLTGSDESLTGQYCRFRSREEVV---LLPFNQAPLAP 435 Query: 234 FLLEWGVSHLQRKIKYLSKF------GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 V + ++ + G T L+ AY Sbjct: 436 QEFSVDVGAPRETLERIRGAVEGLVAGGDTAVYDSLERAY-----------GVVGSSPER 484 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-SPNSFY 346 +V MTDGEN + + Y A+ + V+ I + + + + Sbjct: 485 FTSVVLMTDGEN---RVGRTFAEYREFARGKAVPVFPIVFGEASRAKMGEIAEITGGAVW 541 Query: 347 LVENPHSMYDAFSHI 361 N S+ AF I Sbjct: 542 DA-NSESLERAFCQI 555 >gi|218886590|ref|YP_002435911.1| von Willebrand factor A [Desulfovibrio vulgaris str. 'Miyazaki F'] gi|218757544|gb|ACL08443.1| von Willebrand factor type A [Desulfovibrio vulgaris str. 'Miyazaki F'] Length = 2478 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 43/266 (16%), Positives = 89/266 (33%), Gaps = 46/266 (17%) Query: 100 DNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMP-- 157 D G N I I ++ + + P + +++ + + + S +V+ Sbjct: 1613 DGGTGNLIGSIDGHPAIILTLTPDSGSHAVGETVHVNVEVTLVDPLLHADGSTDVVVSGI 1672 Query: 158 -----------ITSSVKVNSQTDARL--------------DMMIVLDVSRSMESFFDSSI 192 ++ +V VN + +++++LDVS SM + + Sbjct: 1673 HVSGTDGVSPAVSGTVSVNVDDASPTAVNDVHDYQGGITSNVVVMLDVSGSMNDDANGNA 1732 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK 252 D + + ++ D V LV F + + G + +Q+ + L Sbjct: 1733 PGTDSRLTMAIDAINQLLHAYDDLGAVNVKLVWFDDSAQTHTGWLMGTTAVQQALTILEG 1792 Query: 253 F---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV-FMTDGENLST------ 302 G +TN + +Q+ T K + F++DGE Sbjct: 1793 NLNGGGATNYDAAINLVMSQL---------GTTGTPPADKTVAYFLSDGEPNRPDGSEGI 1843 Query: 303 KEDQQSLYYCNEAKKRGAIVYAIGIR 328 +Q+ + A +VYA+GI Sbjct: 1844 SGSEQTTWETFLAANNFDMVYALGIG 1869 >gi|153873859|ref|ZP_02002297.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] gi|152069676|gb|EDN67702.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] Length = 367 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 66/169 (39%), Gaps = 20/169 (11%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +++ ++DVS SM S + + ++ + ++ V+ VV +G Sbjct: 2 PPANLVFLVDVSGSMRSNHKLA-----LLKSALKLLSNQLTEKDKVSLVVYAGAAG---- 52 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + + ++ L+ G ST+ + G+ AYN + + ++ N Sbjct: 53 VVLEPTPGHQSVKINGALERLTA-GGSTHGSAGIHLAYN-LAEQAFIKNGINR------- 103 Query: 290 IIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 I+ TDG+ N+ T + + E +K G + +G ++ L Sbjct: 104 -ILLATDGDFNVGTVDFEALKNLVEEKRKSGISLTTLGFGRGNYNDQLM 151 >gi|141795362|gb|AAI39710.1| LOC563828 protein [Danio rerio] Length = 835 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 35/226 (15%), Positives = 85/226 (37%), Gaps = 35/226 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + RL++ I++D S S+ S A K+I ++ ++ +VN V+++ Sbjct: 340 EEGRLNVFILMDTSGSI------SQDTFQAAKKAIIELVRKLDS-YEVNMKFDI--VSYA 390 Query: 228 NKI-EEFFLLEWGVSHLQRKIKYLSKFGV-------STNSTPGLKYAYNQIFDMQGMRQH 279 ++ E + + + ++ LS+F T+ + L+ Y Q+ ++ ++ Sbjct: 391 SEPREIVSITSFNSHDVDFVLRKLSEFSDEVHENRRGTDLSKALERVYGQLALLRENKKS 450 Query: 280 CNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYY----------CNEAKKRGAIVYAIGIR 328 E N I++ TDG N+ + ++ VY + Sbjct: 451 HFNETQN---ILIIATDGHSNMGPNPQIMLNKIRSLLGYKPSSVDHTQEELLDVYVFAVG 507 Query: 329 VIRSHEFLRACASP----NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + + L + AS +++++ + F+ + D + Sbjct: 508 KDVNRKDLTSFASSKKGEKHVFVLQDYQQLGYVFNQMISDSAVTKC 553 >gi|115749084|ref|XP_001197592.1| PREDICTED: similar to Clca1 protein [Strongylocentrotus purpuratus] gi|115959785|ref|XP_001193076.1| PREDICTED: similar to Clca1 protein [Strongylocentrotus purpuratus] Length = 966 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 26/176 (14%), Positives = 56/176 (31%), Gaps = 22/176 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 TS + Q L +++VLD S SM + D I+ ++ + N+ Sbjct: 301 TSPNFIVVQPSGSLRIVLVLDTSGSM------DGERFDKMIRGAKNFIQSIV---PNNSY 351 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMR 277 V + + ++ + V + L T G+ A Sbjct: 352 VAIVEFNYESIVDSYMTELTSVISRKDLASLLPTLADGATCIGCGIVTAIQVAQYNDMDS 411 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + ++ ++DGE ++ ++ + G IV++I + Sbjct: 412 RGV---------YLILLSDGEENHGTPIADTM---DDIEGSGVIVHSIAFYEADTQ 455 >gi|327299330|ref|XP_003234358.1| hypothetical protein TERG_04951 [Trichophyton rubrum CBS 118892] gi|326463252|gb|EGD88705.1| hypothetical protein TERG_04951 [Trichophyton rubrum CBS 118892] Length = 741 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 28/145 (19%), Positives = 54/145 (37%), Gaps = 30/145 (20%) Query: 171 RLDMMIVLDVSRSMESFF----------DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 D+++V+D+S SM S D+ ++ +D+ + ++E + N + Sbjct: 69 PCDIVLVIDISGSMNSAAPIPTGERGGEDTGLSILDLTKHAAKTIIETL------NEKDR 122 Query: 221 SGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 +VTF ++ F L+ S + I L STN G+K N + Sbjct: 123 LAVVTFCTEVNVAFELDSMNKENKSTVLGAIDKL-YGKSSTNLWHGMKKGLNILATN--- 178 Query: 277 RQHCNTEDANYKKIIVFMTDGENLS 301 + ++ +TDG Sbjct: 179 ------PAQGKIQSLLVLTDGAPNH 197 >gi|331239018|ref|XP_003332163.1| hypothetical protein PGTG_13530 [Puccinia graminis f. sp. tritici CRL 75-36-700-3] gi|309311153|gb|EFP87744.1| hypothetical protein PGTG_13530 [Puccinia graminis f. sp. tritici CRL 75-36-700-3] Length = 501 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 68/203 (33%), Gaps = 24/203 (11%) Query: 157 PITSSVKVNSQTDAR---LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 P+ + + D LD+ +LD + SM S+ ++ I++ I + +L Sbjct: 12 PVPTLEDRLANMDFSEKMLDLCFILDTTGSMGSYITAATQNIELICDEI---INSERLAS 68 Query: 214 DVNNVVQSGLVTFSNKIEEF-------FLLEWGVSHLQRKIKYLSKFGVS---TNSTPGL 263 ++ GL+ + + + F ++ +K L G T + Sbjct: 69 P--ECLRIGLIAYRDHPPQDMSYVTLKFAFTSNPKAVKENLKTLWASGGGDGPEAVTAAM 126 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 A + Q + DA I + DG + L + + G ++ Sbjct: 127 HEALTLDWRPQASKMAVLITDAPPHGIGEY-GDGFSRGDPSGHDPLQLARKMAQTGISLF 185 Query: 324 AIGIRVI-----RSHEFLRACAS 341 + S++F RA AS Sbjct: 186 VVACEPAFSGYSYSNDFFRAIAS 208 >gi|301168170|emb|CBW27759.1| hypothetical protein BMS_2997 [Bacteriovorax marinus SJ] Length = 605 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 49/277 (17%), Positives = 111/277 (40%), Gaps = 30/277 (10%) Query: 63 TQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNG---FVNDIDDIVRSTSLDIV 119 T +++ ++ + + T+++ N N + + ++ + Sbjct: 139 TNLLSREKYDHIEASSYSRVSDNPLTTFSIDVDTASYANVRRFINNGVRPNKGAVRVEEL 198 Query: 120 VVPQNEGYSISAISRYKIPLKF-CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVL 178 + + Y+ S + I LK T PW + + + + + + + ++ +++ +L Sbjct: 199 INYFSYDYTFS-NPEHPIDLKLDLTNSPWNKDRKVVRVALKAD-TPKTAINSSKNLVFLL 256 Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW 238 DVS SM SS K+ + +SI +L +K V+ VV +G S + E + Sbjct: 257 DVSGSM-----SSPNKLPLLKESIKLLLRNLKGDDKVSIVVYAGS---SGVVLEPTSVSD 308 Query: 239 GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE 298 V + + + L + G STN G+ AY ++ + + ++ N ++ TDG+ Sbjct: 309 KV-KIHKALNQL-QSGGSTNGGAGIVAAY-KLAEEEFIKNGVNR--------VILATDGD 357 Query: 299 NL--STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 +T + +AKK +Y + + + Sbjct: 358 FNVGTTSRYELVDLIQEKAKKN---IYLTVLGLGMGN 391 >gi|237747941|ref|ZP_04578421.1| predicted protein [Oxalobacter formigenes OXCC13] gi|229379303|gb|EEO29394.1| predicted protein [Oxalobacter formigenes OXCC13] Length = 1843 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 37/190 (19%), Positives = 69/190 (36%), Gaps = 17/190 (8%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD--SSITKIDMAIKSINA 204 + + +V + V + D ++ +V+DVS SM+ D S ++++ + +++ Sbjct: 1160 GHDVNDVLVGDFAGNTTVGAVQDVVNNIYLVVDVSASMKDGMDPGSDVSRLKVVADALDK 1219 Query: 205 MLEEVKLIP-DVNNVVQSGLVTFSNKIEEFFLL-----EWGVSHLQRKIKYLSKFG-VST 257 + E++ + D + V L FS + +W K L F T Sbjct: 1220 LFEQMNVADADPHTTVNVTLSIFSGGGDTPKEYNLSVSDWAGKTADDIYKMLDDFKHGDT 1279 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 N L A D Y K ++F+TD ++ Y ++ Sbjct: 1280 NFELPLGNA-------SEWADSQAKGDGIYNK-VIFVTDANGTEILNTGKAQDYVDKILG 1331 Query: 318 RGAIVYAIGI 327 G V AI I Sbjct: 1332 SGVDVEAIAI 1341 >gi|221039656|dbj|BAH11591.1| unnamed protein product [Homo sapiens] Length = 237 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 73/197 (37%), Gaps = 38/197 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM +++ ++ L ++ + + + V G+V F + Sbjct: 20 VCLVLDKSGSM-----GGKDRLNRMNQAAKHFL--LQTVENGSWV---GMVHFDSTATIV 69 Query: 234 FLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L + L + T+ G+KYA+ I + + + Sbjct: 70 NKLIQIKSSDERNTLMAGLPTYPL--GGTSICSGIKYAFQVI---GELHSQLDGSE---- 120 Query: 289 KIIVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFL-RACASPNSFY 346 ++ +TDGE+ + C +E K+ GAIV+ I + + + + S + Sbjct: 121 --VLLLTDGEDNTASS-------CIDEVKQSGAIVHFIALGRAADEAVIEMSKITGGSHF 171 Query: 347 LVENPHS---MYDAFSH 360 V + + DAF Sbjct: 172 YVSDEAQNNGLIDAFGA 188 >gi|303246180|ref|ZP_07332461.1| von Willebrand factor type A [Desulfovibrio fructosovorans JJ] gi|302492576|gb|EFL52447.1| von Willebrand factor type A [Desulfovibrio fructosovorans JJ] Length = 329 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 30/162 (18%), Positives = 59/162 (36%), Gaps = 18/162 (11%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D+M+ +D+S SM + D + ++ A + + GLV F + Sbjct: 81 GRGVDIMLAVDLSESM-AALDMPLPDRTVSRLEAVAQAAARFAADRPGD--RIGLVAFGS 137 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + L + + LS G T + A Q+ G+ Sbjct: 138 RAYVVLPPTDDRAALTQALSRLSVGAAGRRTAMGDAVGLAVKQLDRAPGL---------- 187 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + ++VF G+ LS + + + A RG V+ +G+ Sbjct: 188 ARLVVVF---GDGLSNAGEVRPVEAAKAAAARGIAVFTVGVG 226 >gi|297666856|ref|XP_002811720.1| PREDICTED: von Willebrand factor A domain-containing protein 3B-like [Pongo abelii] Length = 968 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 34/171 (19%), Positives = 62/171 (36%), Gaps = 32/171 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + I++D S SM+S K+D+ I ++E N V+ + + + Sbjct: 369 IYILIDTSHSMKS-------KLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAVAWREQLA 421 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + E + Q I+ + G STN+ LK A+ + E I Sbjct: 422 EVNEDSLEQAQSWIRDMK-IGSSTNTLSALKTAF------------ADKETQA----IYL 464 Query: 294 MTDGENLSTKEDQQSLYYCNEAKK-RGAIVYAIGIRVIR--SHEFLRACAS 341 +TDG E ++ K + +Y I ++ FL+ A+ Sbjct: 465 LTDGRPDQPPETVI-----DQVKLFQEIPIYTISFNYNDEIANRFLKEVAA 510 >gi|325674365|ref|ZP_08154054.1| hypothetical protein HMPREF0724_11836 [Rhodococcus equi ATCC 33707] gi|325555045|gb|EGD24718.1| hypothetical protein HMPREF0724_11836 [Rhodococcus equi ATCC 33707] Length = 609 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 35/171 (20%), Positives = 64/171 (37%), Gaps = 18/171 (10%) Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS----GLVTFSNKIE 231 I+LDVS SM + D T++ + +++A + + DV V S G + K+ Sbjct: 424 ILLDVSGSMG-YTDGDGTRLSNTVDALSARIAALPTSSDVGLWVYSRGLDGAKPYLVKV- 481 Query: 232 EFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L G ++ ++ L +T++ + A++ D + + Sbjct: 482 PTGPLSDGDRSQRIEAALRSLRP-ATATSTYASVIAAHDSAVDGFVDGRPNS-------- 532 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 ++ +TDG N T Q L V + I E LR+ A Sbjct: 533 -VLLVTDGPNDDTSVGTQKLMQSLTGAAHPVRVDVVSIGENSDQETLRSMA 582 >gi|167041680|gb|ABZ06425.1| putative von Willebrand factor type A domain protein [uncultured marine microorganism HF4000_009L19] Length = 317 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 34/241 (14%), Positives = 84/241 (34%), Gaps = 53/241 (21%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + ++ D + +++VLD S S+ + + ++ MA +++ L Sbjct: 67 LDNDVPQQIESVLLEDVPITLLLVLDTSGSV---VGAPLAQLLMAAEAVAEALRPDD--- 120 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + GLVTFS+ + + L ++ + G + Y+ F Sbjct: 121 ------RVGLVTFSHNVRVVVEPPSLPASLPDALRRVRATGGTAL--------YDATFAA 166 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--VIR 331 +R+ + +++ +DG+ T N A++ +VYA+ + Sbjct: 167 FALRE-----RTVGRTLMLVFSDGD--DTTSWLDPRDVLNTAQRSDVVVYAVNLAGVAPD 219 Query: 332 S----------------------HEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVT 367 S ++L A + S ++ ++ + AF+ + + Sbjct: 220 SWQERQGRRSARRWFATEPHLFRGQYLPVLAEETGGSVFVAQDTGRLRAAFARVVDEFRR 279 Query: 368 K 368 + Sbjct: 280 R 280 >gi|300772297|ref|ZP_07082167.1| tellurium resistance protein [Sphingobacterium spiritivorum ATCC 33861] gi|300760600|gb|EFK57426.1| tellurium resistance protein [Sphingobacterium spiritivorum ATCC 33861] Length = 256 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 42/131 (32%), Gaps = 10/131 (7%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +LD S SM I +++ M+ ++ + ++TF ++ Sbjct: 48 RLPVYFLLDTSGSMHG------EPIQALNNALSGMINNLRTDAQAAETLWISMITFDREV 101 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 +E L S +I T + L+ Y+ D Sbjct: 102 KEIVPLTALESFQLPEISC--PESGPTFTGKALEILYDTATREVIKGSPEQKGDWRPLLF 159 Query: 291 IVFMTDGENLS 301 I TDG+ Sbjct: 160 I--FTDGKPSD 168 >gi|123468942|ref|XP_001317686.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121900426|gb|EAY05463.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 688 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 32/260 (12%), Positives = 85/260 (32%), Gaps = 40/260 (15%) Query: 103 FVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSV 162 V+ +++ S + + +I + K K + + + Sbjct: 181 SVDGTKNVIDSHNATFETNEAPKKDAIFIETPIKDEDKSIAVSSDGYIAISTNPSFSGKI 240 Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + NS+ V+D S SM S +I A+K + ++ + L + Sbjct: 241 ESNSE------FYFVVDCSGSM------SGARIINAVKCMRLFIQSLPLG------CRFS 282 Query: 223 LVTFSNKIEEF-FLLEWGVSHLQRKIK---YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ F E ++ ++ + + ++ T+ L++ Sbjct: 283 IIKFGTSFETVLQPCDYSDENVDKALNLLKSVNAKMGGTDILSPLQH------------I 330 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + K I +TDGE + + + + +++IG+ ++ Sbjct: 331 AGLKPQPGFVKQIFLLTDGEVNNP---DITCATALKNRNEN-RIFSIGLGSGADPGLIKG 386 Query: 339 CA--SPNSFYLVENPHSMYD 356 A S ++ ++ + +M + Sbjct: 387 LAKKSGGNYIMIADEDNMNE 406 >gi|146280055|ref|YP_001170212.1| hypothetical protein Rsph17025_4055 [Rhodobacter sphaeroides ATCC 17025] gi|145558296|gb|ABP72907.1| hypothetical protein Rsph17025_4055 [Rhodobacter sphaeroides ATCC 17025] Length = 634 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 32/237 (13%), Positives = 82/237 (34%), Gaps = 37/237 (15%) Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 PL + + S + + ++ + + ++ VLD ++SM F I+ Sbjct: 181 HPLNYEPNLLMKVASVPLEDELPAATREGPAGEFDAGVVFVLDTTKSMGPF-------IE 233 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ------RKIKYL 250 ++++ ++ ++ + G+V F + LE+ L + L Sbjct: 234 RTQRALSRVVNDLNGTETGARI-NFGVVAFRDNTAAVPGLEYRTKLLTPLARRFAQAPVL 292 Query: 251 SKFGVSTN----STPGLKYAYNQIFDMQGMRQHCN---TEDANYK-KIIVFMTDGENLST 302 +TN S+PG + + ++ + + + I+ +TD Sbjct: 293 QSISEATNVAEASSPGFNE--DSLAGVEDAVDSIDWDQDGRDPFDGRYIILVTDAGPKDP 350 Query: 303 KE-DQQS----LYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSM 354 ++ + +S +A+ RG ++ + ++ + N Y E + Sbjct: 351 RDPNARSQIGPAEIQRDAEGRGIVIMTLHLKTASG--------AGNHEYAAEQYRKL 399 >gi|110798896|ref|YP_694951.1| von Willebrand factor type A domain-containing protein [Clostridium perfringens ATCC 13124] gi|110673543|gb|ABG82530.1| von Willebrand factor type A domain protein [Clostridium perfringens ATCC 13124] Length = 580 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 37/247 (14%), Positives = 87/247 (35%), Gaps = 53/247 (21%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D+++V D S SME +++ L++++ + L+ FS + Sbjct: 76 PVDIIMVADKSGSMEYEMP-------TLKRAMKNFLDDIES--SFGDRANISLIEFSGEN 126 Query: 231 EEFFL--------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI-----FDMQGMR 277 + + + ++ I T+ +K ++I D++ Sbjct: 127 KTYIGRYCDFNGFNCYEKYFIKGSIDDAKVLCDYTSEYSTVKSKIDKITAYGRTDIEAAL 186 Query: 278 QHCNTE----DANYKKIIVFMTDGEN---------LSTKEDQQSLYYCNEAK-------- 316 + + ++N KK +VF TDG D Y K Sbjct: 187 ELVKKKLDDRNSNNKKYVVFFTDGLPIQLLNIETREYPSLDYIEKYIIPHTKEYFYEKGF 246 Query: 317 --KRGAIVYAIGIRVIR--------SHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIV 366 K Y+IG+ R + +F+++ + S+++ ++ + + ++ I +I+ Sbjct: 247 LDKNKVNFYSIGLFTGRRFDSEKKIAKDFIKSINNSGSYFITDDSNRLDSVYNDIAMNII 306 Query: 367 TKRIWYD 373 + D Sbjct: 307 NENRIMD 313 >gi|55962354|emb|CAI11851.1| novel protein (zgc:56119) [Danio rerio] gi|56207241|emb|CAI21014.1| novel protein (zgc:56119) [Danio rerio] Length = 946 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 77/207 (37%), Gaps = 30/207 (14%) Query: 165 NSQTDARLD--MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP-----DVNN 217 L ++ V+DVS SM K+ ++++ A+L+++ + D N+ Sbjct: 295 APSDLTPLSKNIVFVIDVSGSMWGL------KMKQTVEAMKAILDDLSIDDYFSIIDFNH 348 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 V+ +S + + ++ V ++ I+ + G TN L A + Sbjct: 349 NVRC----WSEDLVQASSIQ--VDEAKKYIQNIKPNG-GTNINEALLRAIQMLIKA---- 397 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK-KRGAIVYAIGIRVIRSHEFL 336 H D +I+ ++DG+ + ++ + + K ++++GI +FL Sbjct: 398 SHHGLIDPRSVSMIILVSDGDPTVGEIKLSTIQKNVKLRMKEEFSLFSLGIGFDVDFDFL 457 Query: 337 RACASPN-----SFYLVENPHSMYDAF 358 A N Y +N F Sbjct: 458 ERIAMDNRGIAQRIYANQNAAEQLKTF 484 >gi|218129580|ref|ZP_03458384.1| hypothetical protein BACEGG_01157 [Bacteroides eggerthii DSM 20697] gi|217988310|gb|EEC54633.1| hypothetical protein BACEGG_01157 [Bacteroides eggerthii DSM 20697] Length = 212 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 22/132 (16%), Positives = 44/132 (33%), Gaps = 12/132 (9%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + ++LD S SM I+ + ++ ++ P ++TF++ Sbjct: 3 RLPVYLLLDTSGSMYG------EPIEAVKNGVQTLISTLRGDPYALETAYISIITFNSVA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 ++ L + Q +I+ T L ++ D + Sbjct: 57 QQVTPLTELSAFQQPQIEA----SGCTALGEALTLLAQKVDTEIVKTTQEVKGDWKP--L 110 Query: 291 IVFMTDGENLST 302 + MTDGE Sbjct: 111 VFLMTDGEPTDD 122 >gi|260823627|ref|XP_002606182.1| hypothetical protein BRAFLDRAFT_126499 [Branchiostoma floridae] gi|229291521|gb|EEN62192.1| hypothetical protein BRAFLDRAFT_126499 [Branchiostoma floridae] Length = 951 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 35/202 (17%), Positives = 75/202 (37%), Gaps = 29/202 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ D S SM + +++ M+ +V + V +V ++ ++ Sbjct: 82 VIVAADKSGSMSGNPWRQV------QQALLYMIGDVASVNPS---VALDVVIYN---DKA 129 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 LL++ S+ Q + ++ G+++ A++ I D K ++VF Sbjct: 130 SLLQYAGSY-QDAVNRVNADGMTS-----FAAAFSCIKDCLKTEIQ---GTPVSKTVVVF 180 Query: 294 MTDGE---NLSTKEDQQSLYYCNEAKKRG--AIVYAIGIRVIRSHEF---LRACASPNSF 345 MTDG N D+ + + G AIV+ +G + F LR + Sbjct: 181 MTDGADTCNRGADIDRSVRSWKEALARLGHEAIVHVVGFSAQHDYNFLGRLRNTGTTAGL 240 Query: 346 YLVENPHSMYDAFSHIGKDIVT 367 + P +A +++ Sbjct: 241 FRYTEPSDGTEALKAKLQELFD 262 >gi|145634726|ref|ZP_01790434.1| hypothetical protein CGSHiAA_03958 [Haemophilus influenzae PittAA] gi|229844728|ref|ZP_04464867.1| hypothetical protein CGSHi6P18H1_03939 [Haemophilus influenzae 6P18H1] gi|145267892|gb|EDK07888.1| hypothetical protein CGSHiAA_03958 [Haemophilus influenzae PittAA] gi|229812442|gb|EEP48132.1| hypothetical protein CGSHi6P18H1_03939 [Haemophilus influenzae 6P18H1] Length = 345 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 64/200 (32%), Gaps = 15/200 (7%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +V+D+S SM + + I+ ++ +++ P V ++ F+ Sbjct: 3 RLPVYLVVDISESMAG------ENLRQMQEGISRLVNQLRRDPYALESVYLSVIAFAGAA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + L G T+ L + + + D Sbjct: 57 GTLAPL---TELMSFYPPRLP-IGSGTSIGAALNHLMDSLEKDIMRSTPEKKGDWKPLIY 112 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 I M+DG S + + A + IGI + + L + E+ Sbjct: 113 I--MSDGSPTDDPAQAISRW--KHHFQNKAKLINIGIGKFANLDTLNEISDLTYRLDDED 168 Query: 351 PHSMYDAF-SHIGKDIVTKR 369 +Y A + I+++ Sbjct: 169 IEKVYRALCESVADSILSQS 188 >gi|145632218|ref|ZP_01787953.1| hypothetical protein CGSHi3655_07184 [Haemophilus influenzae 3655] gi|144987125|gb|EDJ93655.1| hypothetical protein CGSHi3655_07184 [Haemophilus influenzae 3655] Length = 345 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 64/200 (32%), Gaps = 15/200 (7%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +V+D+S SM + + I+ ++ +++ P V ++ F+ Sbjct: 3 RLPVYLVVDISESMAG------ENLRQMQEGISRLVNQLRRDPYALESVYLSVIAFAGAA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + L G T+ L + + + D Sbjct: 57 GTLAPL---TELMSFYPPRLP-IGSGTSIGAALNHLMDSLEKDIMRSTPEKKGDWKPLIY 112 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 I M+DG S + + A + IGI + + L + E+ Sbjct: 113 I--MSDGSPTDDPAQAISRW--KHHFQNKAKLINIGIGKFANLDTLNEISDLTYRLDDED 168 Query: 351 PHSMYDAF-SHIGKDIVTKR 369 +Y A + I+++ Sbjct: 169 IEKVYRALCESVADSILSQS 188 >gi|220917018|ref|YP_002492322.1| tyrosinase [Anaeromyxobacter dehalogenans 2CP-1] gi|219954872|gb|ACL65256.1| tyrosinase [Anaeromyxobacter dehalogenans 2CP-1] Length = 874 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 65/180 (36%), Gaps = 26/180 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSM--ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 +++ N+ M++VLD S SM +S + +T+ D+ S + ++ + Sbjct: 332 ALRANTIARPTAAMVMVLDRSNSMTFDSGVGAGVTRADVLRFSAPTAVVVLE------DT 385 Query: 219 VQSGLVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + +F + + G + I+ + + I + Sbjct: 386 NAMAVCSFDHDAHPGIPMTEAAGAGKLTISAAIQSYAPNPNG----------WTSIGEGV 435 Query: 275 GMRQHCNTEDANYK-KIIVFMTDGENLSTKEDQQSLYYCNE--AKKRGAIVYAIGIRVIR 331 + +++ K +V +TDGE ++ ++ + A G VYAIG+ Sbjct: 436 ALAHGIVAPVTDHEVKALVVLTDGEENHGPHARRYIHDVEDLIASLNG-RVYAIGLGRAE 494 >gi|291455286|ref|ZP_06594676.1| von Willebrand factor [Streptomyces albus J1074] gi|291358235|gb|EFE85137.1| von Willebrand factor [Streptomyces albus J1074] Length = 422 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 32/223 (14%), Positives = 79/223 (35%), Gaps = 34/223 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + +VLDVS SM++ +++ A ++ N +++ V +V +++ Sbjct: 26 GPSARPAPKVELVLDVSGSMKTRDIDGQSRMSAAKQAFNDVIDAVP--EEVELGIRTLGA 83 Query: 225 TF-----------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + + ++ ++ + + + LS G T P L A + + Sbjct: 84 DYPGEDKARGCKDTRQLYPVGPID--RTEAKTAVATLSPTGW-TPIGPALLGAADDLDGD 140 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--RVIR 331 +G + IV ++DGE+ D + AK ++ +G+ Sbjct: 141 EGGSRR-----------IVLISDGEDTCGPLDPCEVAREIAAKGVDLVIDTLGLVPNAKI 189 Query: 332 SHEFLRAC---ASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + +C A+ ++ V++ + D + + Sbjct: 190 RQQL--SCIAGATGGTYTAVQHKEDLSDKVKQLVDRAADPVVT 230 >gi|253998579|ref|YP_003050642.1| von Willebrand factor type A [Methylovorus sp. SIP3-4] gi|253985258|gb|ACT50115.1| von Willebrand factor type A [Methylovorus sp. SIP3-4] Length = 326 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 35/235 (14%), Positives = 72/235 (30%), Gaps = 38/235 (16%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P T S + T +++++D S SM+ F +I A + Sbjct: 63 VAGPATPSQPIER-TGVGAQLVLIIDRSASMDDPFSGAIASGRAGESKAAAAERLITRFV 121 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS----TNSTPGLKYAYNQ 269 + G+VTFSN L + + + G S TN GL Sbjct: 122 NERKNDMFGMVTFSNSAMHVLPL---TESKEAILAAIRAAGGSALFQTNIGSGLTTG--- 175 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENL-STKEDQQSLYYCNEA---------KKRG 319 + + + I+ ++DG ++ + + ++ G Sbjct: 176 -------LAQFDKTPDSGSRAIILLSDGGGRIGAATQEKIRDWLDRMHVTLYWIVLRQPG 228 Query: 320 -AIVYAIGIRVIRS---------HEFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 ++ + +++ + S Y E+P S+ A I + Sbjct: 229 SISIFDETYKTPDDRPPPPAIELNDYFKTLRSGYQPYEAEDPQSLAAAIQDINRK 283 >gi|239983463|ref|ZP_04705987.1| hypothetical protein SalbJ_28780 [Streptomyces albus J1074] Length = 423 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 32/223 (14%), Positives = 79/223 (35%), Gaps = 34/223 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + +VLDVS SM++ +++ A ++ N +++ V +V +++ Sbjct: 27 GPSARPAPKVELVLDVSGSMKTRDIDGQSRMSAAKQAFNDVIDAVP--EEVELGIRTLGA 84 Query: 225 TF-----------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + + ++ ++ + + + LS G T P L A + + Sbjct: 85 DYPGEDKARGCKDTRQLYPVGPID--RTEAKTAVATLSPTGW-TPIGPALLGAADDLDGD 141 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--RVIR 331 +G + IV ++DGE+ D + AK ++ +G+ Sbjct: 142 EGGSRR-----------IVLISDGEDTCGPLDPCEVAREIAAKGVDLVIDTLGLVPNAKI 190 Query: 332 SHEFLRAC---ASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + +C A+ ++ V++ + D + + Sbjct: 191 RQQL--SCIAGATGGTYTAVQHKEDLSDKVKQLVDRAADPVVT 231 >gi|83645400|ref|YP_433835.1| von Willebrand factor type A (vWA) domain-containing protein [Hahella chejuensis KCTC 2396] gi|83633443|gb|ABC29410.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Hahella chejuensis KCTC 2396] Length = 749 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 33/228 (14%), Positives = 73/228 (32%), Gaps = 41/228 (17%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 C+ + W + + +P D+ +++DVS SM+ + + Sbjct: 14 CSLMLWRVAAAQMELP------------PPADVRVLIDVSGSMKKNDPKN------LRRP 55 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 ++ E+ D V G + W + + +S + TN Sbjct: 56 ALNLVTELLPEGDSAGVWTFGQYVNELAPHQVVDPGWRRLAKDKA-REISSTALYTNIGA 114 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD---------GENLSTKEDQQSLYYC 312 L+ A + + + + ++ + +TD GEN++ ++ + Sbjct: 115 ALEKA------SEDFVEGKDYSNTHF----ILLTDGVVDISQKPGENVAERDRVLTQVLK 164 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRAC--ASPNSFYLVENPHSMYDAF 358 A GA ++ I + L+ S + EN + F Sbjct: 165 RVA-GFGAKIHTIALSRNADQMLLQRLSIGSNGINAIAENSEQLSRVF 211 >gi|328884707|emb|CCA57946.1| hypothetical protein SVEN_4660 [Streptomyces venezuelae ATCC 10712] Length = 535 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 63/194 (32%), Gaps = 26/194 (13%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF----SNKI 230 + VLD S SM + +++ ++ + + + +V + V ++ + Sbjct: 357 VYVLDTSGSM------NGDRLERLKTALVELTGDFRDREEVTLMPFGSAVKRDEVRTHTV 410 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + + ++ + L+ G T L+ AY + G Sbjct: 411 DPASPRQA-LDAIRADARKLTASG-GTAIYSSLQEAYRSLGKSSGDTFTS---------- 458 Query: 291 IVFMTDGENLS--TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-SPNSFYL 347 IV MTDGEN S Y A ++ V+ I E + + Sbjct: 459 IVLMTDGENTDGAPAAAFDSFYGSLPAGQQRTPVFPILFGDSDRAELGHLADLTGGKLFD 518 Query: 348 VENPHSMYDAFSHI 361 S+ AF I Sbjct: 519 AHQ-GSLDGAFEEI 531 >gi|302517857|ref|ZP_07270199.1| von Willebrand factor [Streptomyces sp. SPB78] gi|302426752|gb|EFK98567.1| von Willebrand factor [Streptomyces sp. SPB78] Length = 610 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 36/220 (16%), Positives = 72/220 (32%), Gaps = 46/220 (20%) Query: 174 MMIVLDVSRSMESFFDSS-ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 +++V+D S SM +++D+ S+ L + D GL FS +++ Sbjct: 404 LLVVVDSSPSMAELVPGRGQSRMDVTKASLLQALAQFTSADD------IGLWEFSTRLDG 457 Query: 233 F-------------FLLEWGVSHLQRKIKYL----SKFGVSTNSTPGLKYAYNQIFDMQG 275 GV+ + + G +T AY Q Sbjct: 458 DRDYRELVPTDRLGARKGEGVTQRDKLTAAFGALQPQTGGATGLYDTTLAAYQQASKGYA 517 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS------LYYCNEAKKRGAIVYAIGIRV 329 + +V +TDG N + + A+ + AI + Sbjct: 518 ADKFNA---------VVLLTDGTNEDPGSLTRGALLTKLRDLADPARP--LPLVAIAVGP 566 Query: 330 IRSHEFLRA--CASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + + ++A A+ S + V++P +++ I IV Sbjct: 567 EAAGDDVKAIGSATGGSGFKVDDPAQIHEV---INNAIVE 603 >gi|90423420|ref|YP_531790.1| von Willebrand factor, type A [Rhodopseudomonas palustris BisB18] gi|90105434|gb|ABD87471.1| von Willebrand factor, type A [Rhodopseudomonas palustris BisB18] Length = 333 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 43/227 (18%), Positives = 82/227 (36%), Gaps = 41/227 (18%) Query: 163 KVNSQTDARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKS-INAMLEEVKLIPDVNNVVQ 220 + + Q + +++V+D S SM +SF S T + + L I + + Sbjct: 74 QSSQQIGSGAHIVLVIDRSSSMDDSFAGSRPTAEQASKSAEARRFLRSFVSIGEHDMF-- 131 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQH 279 G+ FS + L + I + + G+S T+ G+ A + H Sbjct: 132 -GVAIFSTAPLQALPLTSHREAVLAAIDAIDRPGLSETDIARGIAMA---------LAMH 181 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS------- 332 + A + IV ++DG + + Q+ L +KR +Y + +R + Sbjct: 182 DDDPSA-ASRAIVLVSDGAGVIDRRVQEKLRAA--FRKRPINLYWVFLRTANARGIFEPP 238 Query: 333 ----------------HEFLRACASPNSFYLVENPHSMYDAFSHIGK 363 H F ++ P + E P S+ DA + I + Sbjct: 239 GAGERDVPQVAPERHLHRFFQSLKIPYRAFEAERPESIGDAIAEIAR 285 >gi|254444377|ref|ZP_05057853.1| Vault protein inter-alpha-trypsin [Verrucomicrobiae bacterium DG1235] gi|198258685|gb|EDY82993.1| Vault protein inter-alpha-trypsin [Verrucomicrobiae bacterium DG1235] Length = 808 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 38/225 (16%), Positives = 75/225 (33%), Gaps = 37/225 (16%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 L+ T+ +M +T V ++ D + LDVS SM+ K+ Sbjct: 249 RLEVLTYRENEDKPGTFMMVMTPGVDLHPLEGGA-DFVFALDVSGSMQG-------KLHT 300 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE-----FFLLEWGVSHLQRKIKYLSK 252 + + ++K + +V F+N + E + ++ L+ Sbjct: 301 LASGVKKAIGQLKPED------RFRVVAFNNTAFDLNRGWVSATEANLRETFARLDQLNS 354 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 G TN G+ A ++ DA+ ++ +TDG D ++ Y Sbjct: 355 NG-GTNVYAGVHLALERL-------------DADRVATLILVTDGVTNQGIVDPKAFY-- 398 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRAC--ASPNSFYLVENPHSMY 355 K+ Y + + ++ AS S+ V N + Sbjct: 399 KLMHKQDLRFYGFLLGNSSNWPLMQLMCDASGGSYRAVSNSDDII 443 >gi|163845716|ref|YP_001633760.1| magnesium chelatase [Chloroflexus aurantiacus J-10-fl] gi|222523421|ref|YP_002567891.1| magnesium chelatase [Chloroflexus sp. Y-400-fl] gi|163667005|gb|ABY33371.1| Magnesium chelatase [Chloroflexus aurantiacus J-10-fl] gi|222447300|gb|ACM51566.1| Magnesium chelatase [Chloroflexus sp. Y-400-fl] Length = 696 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 26/133 (19%), Positives = 50/133 (37%), Gaps = 19/133 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + V+D S SM + ++ ++ ++L Q GLV+F Sbjct: 514 VCFVVDASWSMAAE-----ERMQATKAAVLSLLR-----DAYQRRDQVGLVSFQRDYARV 563 Query: 234 -FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L V QR+++ + G T + G+ A+ + + DA ++V Sbjct: 564 LLPLTNSVELAQRRLQSMPT-GGKTPLSRGMLTAF-------ELLERARRRDAEVVPLMV 615 Query: 293 FMTDGENLSTKED 305 +TDG+ + D Sbjct: 616 LLTDGQANVSISD 628 >gi|330839962|ref|YP_004414542.1| von Willebrand factor type A [Selenomonas sputigena ATCC 35185] gi|329747726|gb|AEC01083.1| von Willebrand factor type A [Selenomonas sputigena ATCC 35185] Length = 215 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 29/148 (19%), Positives = 60/148 (40%), Gaps = 12/148 (8%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 RL + ++LD S SM I+ + I A+L E++ P ++TF++ Sbjct: 5 GRRLPVYLLLDCSGSM------MGEPIEAVRQGIKALLSELRGDPQALETAYLSVITFAS 58 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 ++ + L + K L G T LK + +R++ T+ +++ Sbjct: 59 QVRQTTKL---TELMLFKEPRLEAEGC-TLMGGALKLLAECVRT--EVRKNTETQKGDWR 112 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAK 316 ++ +TDG ++ +Q+ K Sbjct: 113 PLVFLLTDGSPTDLEDFRQAAAEIKSLK 140 >gi|194385074|dbj|BAG60943.1| unnamed protein product [Homo sapiens] Length = 506 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 23/161 (14%), Positives = 58/161 (36%), Gaps = 19/161 (11%) Query: 219 VQSGLVTFSNKIEEFFL-LEWGVSHLQRKIKYLS-------KFGVSTNSTPGLKYAYNQI 270 V ++TF+++ + L + I L + G TN+ L Y + Sbjct: 45 VSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANYKDHENGTGTNTYAALNSVYLMM 104 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENL------STKEDQQSLYYCNEAKKRGAIVYA 324 + + + I+ +TDG++ + + + + N+ + +YA Sbjct: 105 NNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYA 164 Query: 325 IGIRVIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 IG+ + +E +++++ +++ F H Sbjct: 165 IGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQVFEH 205 >gi|15890597|ref|NP_356269.1| nitric oxide reductase NorD protein [Agrobacterium tumefaciens str. C58] gi|15158850|gb|AAK89054.1| nitric oxide reductase NorD protein [Agrobacterium tumefaciens str. C58] Length = 632 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 35/226 (15%), Positives = 92/226 (40%), Gaps = 31/226 (13%) Query: 154 IVMPITSSVKVNSQTDAR---LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 I S +++ + + L + I++DVS S +++FD+ +D+ +++ + + Sbjct: 422 IAAGGQGSDRIHVASRPQAHDLAVTILVDVSLSTDAWFDNRRV-LDVEKEALLVLAHGLS 480 Query: 211 LIPDVNNVV-----QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 D+++++ + V + + ++R+I L G T P +++ Sbjct: 481 ACGDIHSILTFTSRRRSWVRVETVKDFDEPMG---HTIERRIAALKP-GFYTRIGPAIRH 536 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK------EDQQSLYYCNEAKKRG 319 A ++ + R K+++ +TDG+ + S +EA++ G Sbjct: 537 ASAKLHERPERR-----------KLLLLLTDGKPNDVDHYEGRFAIEDSRRAVSEARRSG 585 Query: 320 AIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 V+ + + ++ ++ A N + +V + A I + + Sbjct: 586 VSVFGVTVDS-KAQSYIPAMFGQNGYAIVSRIAKLPSALPAIYRSL 630 >gi|239908150|ref|YP_002954891.1| hypothetical protein DMR_35140 [Desulfovibrio magneticus RS-1] gi|239798016|dbj|BAH77005.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 595 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 35/175 (20%), Positives = 61/175 (34%), Gaps = 29/175 (16%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S+ RL +M+VLD SRSM + D ++ A + + +L + + Sbjct: 90 GKPDAASKAPPRLRLMVVLDCSRSMLA-RDVPPDRLGAAKRLVLDVLARLPGLD------ 142 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGM 276 GLV F+ + + L + L+ T+ GL+ A + G+ Sbjct: 143 -VGLVGFAGRAWLACPPTPDRAGLALFLDGLTPEAAPLGGTDPAKGLEAA---GLALAGV 198 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 R ++ +TDGE Q G VYA+ + + Sbjct: 199 RPAA----------VLLVTDGEATVKPAGQTRTVL-----PPGVPVYAVAVGSGQ 238 >gi|281423275|ref|ZP_06254188.1| BatB protein [Prevotella oris F0302] gi|281402611|gb|EFB33442.1| BatB protein [Prevotella oris F0302] Length = 342 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 32/173 (18%), Positives = 73/173 (42%), Gaps = 20/173 (11%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + K++ ++++I LD+S SM + D + ++++ + I +++ N+ V Sbjct: 78 AGTKISHDKRNGIEVIIALDISNSMLAE-DVTPSRLEKSKLLIENLVDHF-----TNDKV 131 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 GLV F+ + + ++ + ++T T + + Q+ Sbjct: 132 --GLVVFAGDAFVQLPITSDYVSAKMFLQNIKPSLIATQGTD--------LARAIELSQN 181 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + N K I+ +TDGE+ + +L A K+G+ V+ +GI + Sbjct: 182 SFMQRDNIGKAIIVITDGEDH----EGGALEAAKAAHKKGSNVFILGIGDPKG 230 >gi|281338025|gb|EFB13609.1| hypothetical protein PANDA_007564 [Ailuropoda melanoleuca] Length = 901 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 64/173 (36%), Gaps = 23/173 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D S SM S K+ +++ +L+++ N L++FS ++ Sbjct: 244 VIFVIDKSGSM------SGRKMQQTREALIKILDDLSPKDQFN------LISFSGDAAQW 291 Query: 234 FLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L V+ + + G T+ + A Q+ + ++ + Sbjct: 292 KPLLVPASAENVNQARSYAAGIQAHG-GTDINEAVLMAV-QLLNSAKQKELMPEGTVSL- 348 Query: 289 KIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 I+ +TDG+ + + + EA ++ +G S+ FL A Sbjct: 349 --IILLTDGDPTMGETNPARIQRNVKEAIDGQYSLFCLGFGFDVSYAFLEKLA 399 >gi|163755380|ref|ZP_02162500.1| von Willebrand factor type A like domain [Kordia algicida OT-1] gi|161324800|gb|EDP96129.1| von Willebrand factor type A like domain [Kordia algicida OT-1] Length = 718 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 69/187 (36%), Gaps = 31/187 (16%) Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 V+DVS SM + ++++ K + +L + N ++ +G + + Sbjct: 292 FVVDVSGSMNGYP------MEVSKKLLRNLLVNLPETDHYNILLFAG----GSSVLAPEP 341 Query: 236 LEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L ++Q+ I +L+ + G T LK AY + A + +V Sbjct: 342 LACTKENIQKGINFLTNERGGGGTRLLNALKTAY--------ALPRMDKTSA---RSMVV 390 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRG-AIVYAIGIRVIRSHEFLRACA--SPNSFYLVEN 350 +TDG ++++ + G A V+ GI + L A S + ++ Sbjct: 391 ITDG---YVSVERKAFEMIEQ--NLGQANVFTFGIGSGVNRYLLEGMAKISNSETFIATE 445 Query: 351 PHSMYDA 357 + D Sbjct: 446 MNEANDV 452 >gi|148253599|ref|YP_001238184.1| NorD protein required for nitric oxide reductase (Nor) activity [Bradyrhizobium sp. BTAi1] gi|146405772|gb|ABQ34278.1| NorD protein required for nitric oxide reductase (Nor) activity [Bradyrhizobium sp. BTAi1] Length = 637 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 31/210 (14%), Positives = 81/210 (38%), Gaps = 32/210 (15%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 L + +++DVS S +++ D + +D+ +++ + + D ++++ TF+++ Sbjct: 447 DLAVTLLVDVSLSTDAWVDG-VRVLDVEKEALLVLAHGLSACGDSHSIL-----TFTSRR 500 Query: 231 EEFFLLEW-------GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + LE ++R+I L G T +++A H + Sbjct: 501 NSWVRLETVKAFGEPMSGQVERRIGALRP-GYYTRIGTAVRHA-----------AHELAD 548 Query: 284 DANYKKIIVFMTDGENLSTK------EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 K++++ +TDG+ + + EA++ G V+ + I + + Sbjct: 549 RPERKRLLLVLTDGKPNDVDHYEGRFAIEDTRKAVQEARRAGVAVFGVTIDAA-AQSYFP 607 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + +V N + A + + + Sbjct: 608 TLFGRGGYAIVGNIRRLPAALPALYRQLAH 637 >gi|227819319|ref|YP_002823290.1| hypothetical protein NGR_b10840 [Sinorhizobium fredii NGR234] gi|227338318|gb|ACP22537.1| conserved hypothetical protein [Sinorhizobium fredii NGR234] Length = 533 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 17/107 (15%), Positives = 38/107 (35%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 I F+ + +G + LT I +P++ ++I+V + T L + +D + A ++ Sbjct: 7 INRFWDDNRGYVIALTLISMPLLLGFSLLVIDVGRTGNLHTDLQNAVDAMALAGARELDG 66 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRST 114 + R + + + D + ST Sbjct: 67 RDDAITRADAAIEALANSAAFGGGGNGMSLGSHITVAYDAGNAAGST 113 >gi|167526046|ref|XP_001747357.1| hypothetical protein [Monosiga brevicollis MX1] gi|163774192|gb|EDQ87824.1| predicted protein [Monosiga brevicollis MX1] Length = 889 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 35/192 (18%), Positives = 74/192 (38%), Gaps = 28/192 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ LD S S+ + + S+ +L + V+Q G +F ++++ Sbjct: 557 IVLALDDSTSVGY---AGFQRQVTWASSLAQLLTF------DSGVIQMGAFSFDDRVQPI 607 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN-TEDANYKKIIV 292 V L ++ G +++ L + G R + + A++++I+V Sbjct: 608 SRFT-SVEVLVDTLQSAKWTGGASSLAEALSF--------TGARYFQDLSLPADHRRILV 658 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR----VIRSHEFLRACASPNSFYL- 347 + DG T Q + ++ G ++AI ++ V + L A S Y Sbjct: 659 ILLDGAADDTVA--QVARQATQLRQMGVTIFAIAVQDSNDVQQQEAQLMAAVSSTVEYHL 716 Query: 348 --VENPHSMYDA 357 V N ++ DA Sbjct: 717 MRVPNMEALMDA 728 >gi|313200651|ref|YP_004039309.1| von willebrand factor type a [Methylovorus sp. MP688] gi|312439967|gb|ADQ84073.1| von Willebrand factor type A [Methylovorus sp. MP688] Length = 316 Score = 45.9 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 35/235 (14%), Positives = 72/235 (30%), Gaps = 38/235 (16%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P T S + T +++++D S SM+ F +I A + Sbjct: 53 VAGPATPSQPIER-TGIGAQLVLIIDRSASMDDPFSGAIASGRAGESKAAAAERLITRFV 111 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS----TNSTPGLKYAYNQ 269 + G+VTFSN L + + + G S TN GL Sbjct: 112 NERKNDMFGMVTFSNSAMHVLPL---TESKEAILAAIRAAGGSALFQTNIGSGLTTG--- 165 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENL-STKEDQQSLYYCNEA---------KKRG 319 + + + I+ ++DG ++ + + ++ G Sbjct: 166 -------LAQFDKTPDSGSRAIILLSDGGGRIGAATQEKIRDWLDRMHVTLYWIVLRQPG 218 Query: 320 -AIVYAIGIRVIRS---------HEFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 ++ + +++ + S Y E+P S+ A I + Sbjct: 219 SISIFDETYKTPDDRPPPPAIELNDYFKTLRSGYQPYEAEDPQSLAAAIQDINRK 273 >gi|296327481|ref|ZP_06870027.1| D-amino acid dehydrogenase large subunit [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] gi|296155307|gb|EFG96078.1| D-amino acid dehydrogenase large subunit [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] Length = 530 Score = 45.9 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 52/335 (15%), Positives = 110/335 (32%), Gaps = 52/335 (15%) Query: 59 VHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDI 118 V+ I++E G + D + + W + EL+ D ++ +I Sbjct: 74 VYTYEAIVDEAGGLYQSPQPNEDNYVKKHDIWTEDVQKELKTIKPALD----ENASEEEI 129 Query: 119 VVVPQNEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 + Y + Y P + + + P T + + +++ IV Sbjct: 130 QHLFNQFLYIV----GYDYTPFETIDRFSYVIFKNDMENPFTHE---KIEENMNVNVEIV 182 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 LD S SM T +++A +SI +L E+ N V G+ F +K + + Sbjct: 183 LDASGSMVKKI-GDKTMMEIAKESIKKVLSEM----PANAKV--GIRVFGHKGDNTASKK 235 Query: 238 W---------------GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 V +++ ++ + G T+ ++Y + + G + Sbjct: 236 DESCGSNELIYPIGDLNVEGIEKALEPIQPTGW-TSIAKSIEYGVEDLKALDGEKTLN-- 292 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK--KRGAIVYAIGIRV-IRSHEFLR-- 337 I+ +TDG + + K ++ IG V + L+ Sbjct: 293 -------ILYIITDG---IETCGGNPVEIAKQLKGENTNIVLGIIGFNVDANQNRLLKQI 342 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 A A+ + V + + + I + + W Sbjct: 343 ADAAGGYYSSVNDANKLTGELYRINELAFSDYKWE 377 >gi|225028941|ref|ZP_03718133.1| hypothetical protein EUBHAL_03231 [Eubacterium hallii DSM 3353] gi|224953716|gb|EEG34925.1| hypothetical protein EUBHAL_03231 [Eubacterium hallii DSM 3353] Length = 291 Score = 45.9 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 69/193 (35%), Gaps = 14/193 (7%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + + ++DVS SM TKI ++ +L + + + + V+ ++ FS Sbjct: 48 AKKSMTIFFMIDVSGSM------KGTKIGSLNSTMEELLPSLIGVGEASTDVKIAIMKFS 101 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 +E + Q L G++ + A+ ++ ++ ++ Sbjct: 102 TDVEWVTPEPVKIEEYQYW-NRLEADGLT-----FMGDAFMELSKKLSRSTFLSSPSLSF 155 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA-CASPNSFY 346 +I ++DG + N+ + G + A+GI + + LRA + Sbjct: 156 APVIFLLSDGSPNDDWKKGLDTLKQNKWFQHGLKI-ALGIGSKVNMDVLRAFTGNDELAV 214 Query: 347 LVENPHSMYDAFS 359 +N + + Sbjct: 215 QAKNADQLRELIK 227 >gi|172060502|ref|YP_001808154.1| hypothetical protein BamMC406_1450 [Burkholderia ambifaria MC40-6] gi|171993019|gb|ACB63938.1| conserved hypothetical protein [Burkholderia ambifaria MC40-6] Length = 648 Score = 45.9 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 46/275 (16%), Positives = 99/275 (36%), Gaps = 21/275 (7%) Query: 22 LTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGD 81 + AI++ + +VLG +I++++++ K L ++D + + A + ++ +G Sbjct: 1 MAAIWVMVAIVVLG-VIDIANLYLQKRDLQRVVDLAALAAVQPMTSDPSGCLSDAKNNVT 59 Query: 82 ILCRIKN-----TWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYK 136 I + T + G R S V P + + + ++ Y+ Sbjct: 60 SSANINDKGYAFTLISATATANPTAGNDQIAVSCGRWDSATAYVTPASASANAAQVTAYR 119 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV-LDVSRSMESFFDSSITKI 195 F + + R V+ T++ + + + + L+ S S + D +T + Sbjct: 120 QVNYFFLGLLSQLSGRQAVVSATATARAAAIDTFSVGSTLANLNTSSS--AILDPLLTGL 177 Query: 196 DMAIKSINAMLEEVKLIPDVNNV--------VQSGLVTFSNKIEEFFLLEWGVSHLQRKI 247 A ++N L + + N Q G S+ LL G++ I Sbjct: 178 LGATTNVNVGLANYQALAGANVTLGQLATVATQLGTAGMSSPASVGKLL--GLNLTVSDI 235 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 L+ V +N+T G + G + N Sbjct: 236 LSLTATAVGSNTTVG--TVLTALKTSVGANVNANK 268 >gi|145482427|ref|XP_001427236.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124394316|emb|CAK59838.1| unnamed protein product [Paramecium tetraurelia] Length = 1189 Score = 45.9 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 36/228 (15%), Positives = 70/228 (30%), Gaps = 24/228 (10%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF--DSSITKID 196 L+ +P ++ ++++LD+S SM+ + +T++ Sbjct: 707 LQQIGSMPQDDEDDDSEQNKEKVMEPKITNPPNEAIVVLLDISGSMDELYYDSEDLTRMG 766 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS 256 + + + + D+ +V LV F N+I E + + G Sbjct: 767 VVKAFFSTFADRT-MAYDLKHV--ISLVYFDNRIIEKCSFTELFILFKDLVNKAQPTGR- 822 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI---IVFMTDGENLSTKEDQQSLYYCN 313 TN LKYA NQ+ Y K I+ +TDG++ + Sbjct: 823 TNLYRALKYAENQLL----------KFKQTYPKCLLRIIALTDGQDNDNHPLDPIKVAES 872 Query: 314 EAKKRGAIVYAIGIRVIRSHEFLRAC--ASPNSFYLVENPHSMYDAFS 359 K + V L+ A+ + + F Sbjct: 873 ILKN---EILLDSFVVSDDCTDLKKITKATGGQCFSPQTIQEGLKLFE 917 >gi|21323788|dbj|BAB98414.1| Hypothetical membrane protein [Corynebacterium glutamicum ATCC 13032] Length = 634 Score = 45.9 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 41/190 (21%), Positives = 67/190 (35%), Gaps = 16/190 (8%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-EEF 233 MIVLD S SM + T+ID A ++ +LI D+++ GL + E Sbjct: 41 MIVLDNSGSMTAQDAGGQTRIDAAKQA------STQLINDISDRTDVGLTYYGGNTGETE 94 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 +E G + I G + + + F G + IV Sbjct: 95 ADVEMGCQDV--TILGGPSRGNADTLIDTINSLQPRGFTPIGKALTDTAAELPEGGNIVL 152 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGA--IVYAIGIRV-IRSHEFLRACA--SPNSFYLV 348 ++DG T D E + G ++ IG+ V + E L A ++ Sbjct: 153 VSDGIANCTPPD--VCEVAQELAQSGINLVINTIGLNVDPAAREELECIAGVGGGTYADA 210 Query: 349 ENPHSMYDAF 358 + S+ DA Sbjct: 211 SDAQSLTDAL 220 >gi|322806307|emb|CBZ03875.1| centrosomal protein Cep290 [Clostridium botulinum H04402 065] Length = 647 Score = 45.9 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 45/303 (14%), Positives = 92/303 (30%), Gaps = 56/303 (18%) Query: 46 MKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN 105 ++ + +D + A ++ K +++ W + + V Sbjct: 330 IQNAMQGKVD---LEAYGISQSDQQLFQFYSNKMKLEREQMRQFWKK-LIGDAKKEVSVK 385 Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVN 165 + +D + + YK L T + + Sbjct: 386 KDGQVKGKLDVDSFINFYPDFVEAEKKGNYK-NLPIFNRYLLETQADILP---------- 434 Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 R+++ V+D S SM +KI+ + K++ L I D N ++S Sbjct: 435 ----ERIEISFVIDNSGSMNE------SKIEASRKALAVTLLS---IDDFNKYLKSNAEQ 481 Query: 226 FSNKIEEFFLLEWG--------------------VSHLQRKIKYLSKFGVSTNSTPGLKY 265 + K+E W S + R I L +T+ L+ Sbjct: 482 LNQKVEVLSE-TWFFGSKYYNVKEFNDKNVKEKEKSDIIRSIVKLDATDGATDDASCLRE 540 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 N++ +Q + KI+ +TDG + ++++ E + VYA Sbjct: 541 ISNRMTSIQESELKKGKQ----IKIVFEITDGASSFPGSAKEAIQ---ELLSKNVEVYAF 593 Query: 326 GIR 328 I Sbjct: 594 QIG 596 >gi|307102430|gb|EFN50705.1| hypothetical protein CHLNCDRAFT_28788 [Chlorella variabilis] Length = 344 Score = 45.9 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 44/242 (18%), Positives = 86/242 (35%), Gaps = 37/242 (15%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS------------ 190 + + +R + +S++ + RL+++++LDVS SM F S Sbjct: 57 DPLRNASAARRRSLLQAASLQASDFRRPRLNLVLLLDVSGSMGESFMSYYYDAAGTQQNL 116 Query: 191 -----SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW----GVS 241 + TKID+A + ++ +L+ + V +V FS + L + Sbjct: 117 TAEELNTTKIDVAKEVLSGVLDLLAPNDSVA------IVLFSTRACTPQPLSRVSCLDIP 170 Query: 242 HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 LQ +I ST+ + GL A ++ T+ N I+ +TD + Sbjct: 171 ALQAQIDKDMHATSSTSLSAGLDLAIAELKKCSEGMSASLTDTENR---IMVITDQQPN- 226 Query: 302 TKEDQQSLYYCNEAK---KRGAIVYAIGIRVIRSHEFLRACAS--PNSFYLVENPHSMYD 356 D + + G IG+ + + E + + ++Y V P Sbjct: 227 -SGDYTTGGLAARLRKDADDGIFTTIIGVGLDLNSELAESISKVRGANYYSVHRPGEFRR 285 Query: 357 AF 358 Sbjct: 286 RL 287 >gi|307324435|ref|ZP_07603643.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] gi|306890166|gb|EFN21144.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] Length = 543 Score = 45.9 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 67/202 (33%), Gaps = 32/202 (15%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE-----VKLIPDVNNVVQS-------G 222 + VLD S SM + ++ +++ + D V G Sbjct: 355 VYVLDTSGSM------NGDRLAQLKRALGQLAGSDVSPTGDRFRDREEVTLMPFGSRVKG 408 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + T + E + ++ ++ + L+ G T L+ AY+ + Q + Sbjct: 409 VRTHTVPGERPAPV---LAAIRADAEALTADG-DTAIFSSLQAAYDHLA------QRRSA 458 Query: 283 EDANYKKIIVFMTDGENL--STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + IV MTDGEN +T D + Y +R A V+ I + Sbjct: 459 LGDDRFTSIVLMTDGENTTGATASDFDAYYRRLRGPERTAPVFPIVFGDSDRSQLQSIAT 518 Query: 341 -SPNSFYLVENPHSMYDAFSHI 361 + + S+ AF I Sbjct: 519 LTGGRLFDATK-GSLDGAFEEI 539 >gi|62389907|ref|YP_225309.1| hypothetical protein cg1159 [Corynebacterium glutamicum ATCC 13032] gi|41325243|emb|CAF19723.1| putative secreted protein [Corynebacterium glutamicum ATCC 13032] Length = 634 Score = 45.9 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 41/190 (21%), Positives = 67/190 (35%), Gaps = 16/190 (8%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-EEF 233 MIVLD S SM + T+ID A ++ +LI D+++ GL + E Sbjct: 41 MIVLDNSGSMTAQDAGGQTRIDAAKQA------STQLINDISDRTDVGLTYYGGNTGETE 94 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 +E G + I G + + + F G + IV Sbjct: 95 ADVEMGCQDV--TILGGPSRGNADTLIDTINSLQPRGFTPIGKALTDTAAELPEGGNIVL 152 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGA--IVYAIGIRV-IRSHEFLRACA--SPNSFYLV 348 ++DG T D E + G ++ IG+ V + E L A ++ Sbjct: 153 VSDGIANCTPPD--VCEVAQELAQSGINLVINTIGLNVDPAAREELECIAGVGGGTYADA 210 Query: 349 ENPHSMYDAF 358 + S+ DA Sbjct: 211 SDAQSLTDAL 220 >gi|312136517|ref|YP_004003854.1| magnesium chelatase [Methanothermus fervidus DSM 2088] gi|311224236|gb|ADP77092.1| Magnesium chelatase [Methanothermus fervidus DSM 2088] Length = 269 Score = 45.9 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 38/208 (18%), Positives = 72/208 (34%), Gaps = 25/208 (12%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++V+D+S SM S + K+ +K I + + + G ++ Sbjct: 75 IVLVVDISGSMSS--EDKAEKVKGVLKRI-----MLDAQRHKDKLAIIGFKGREARVILP 127 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + KI+ +S G +T GLK A + + + Y I+V Sbjct: 128 S--TRRIFSFKDKIENIS-VGGTTPMAHGLKKAIEIL-------KREKKKSNEYVPILVL 177 Query: 294 MTDGENLSTKEDQQSLYYCNEAKK---RGAIVYAIGI----RVIRSHEFLRACASPNSFY 346 ++DG ++ + AK+ I R+ F A S +Y Sbjct: 178 LSDGMPNIALKNSPTRDVLELAKELNRSDIHTVIINFEKRIMRGRNFNFELALTSGGKYY 237 Query: 347 LVENPHSMYDAFSHIGKDIVTKRIWYDK 374 V N ++ + I K + DK Sbjct: 238 EVGN-RNIVETVGSIIKHERNSLFYNDK 264 >gi|158337841|ref|YP_001519017.1| hypothetical protein AM1_4727 [Acaryochloris marina MBIC11017] gi|158308082|gb|ABW29699.1| conserved hypothetical protein [Acaryochloris marina MBIC11017] Length = 971 Score = 45.9 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 39/190 (20%), Positives = 73/190 (38%), Gaps = 33/190 (17%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 I P+ + N + DMM +LD S S+ S+++ + ++ AML + Sbjct: 557 ITTPVNPDIDNNFCQVSS-DMMFILDKSGSV------SLSERRLQRDAVMAMLNYLV--- 606 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVS------HLQRKIKYLSKFGVSTNSTPGLKYAY 267 D N + G+V F + + Y++ G +TN G + A Sbjct: 607 DNNITSRVGIVRFDSTSATVIGYTDVTAANLPTFESALNTNYVNIGGGATNWEAGFQQAI 666 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 + + + ++ F DG N + ++L + K+ GA +Y IG Sbjct: 667 S---------LGVSPGSPD---VVFFFADGNINSGGSPNDEALQF----KQAGAHIYGIG 710 Query: 327 IRVIRSHEFL 336 I+ + +FL Sbjct: 711 IQSLDIDDFL 720 >gi|118096709|ref|XP_001233876.1| PREDICTED: similar to tumor suppressor candidate 4 [Gallus gallus] Length = 1208 Score = 45.9 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 32/190 (16%), Positives = 72/190 (37%), Gaps = 38/190 (20%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++DVS S+ + + S+ ML+ + ++ V F Sbjct: 81 GASSPKDMVIIVDVSGSVSGLT------LKLMKTSVCEMLDTL----SDDDYVNVASKVF 130 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + ++ + G +T+ G +YA++Q+ + R +CN Sbjct: 131 -----------------KEDVQGMVVKG-TTDYKAGFEYAFDQLQNSNITRANCN----- 167 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRACASPNSF 345 K+I+ TDG ++ + + N K +++G ACA+ + Sbjct: 168 --KMIMMFTDGGEDRVQDVFEKYNWPN--KTVRVFTFSVGQHNYDVTPLQWMACANKGYY 223 Query: 346 YLVENPHSMY 355 + + + ++ Sbjct: 224 FEIPSIGAIR 233 >gi|15603687|ref|NP_246761.1| PpkA [Pasteurella multocida subsp. multocida str. Pm70] gi|12722246|gb|AAK03906.1| PpkA [Pasteurella multocida subsp. multocida str. Pm70] Length = 674 Score = 45.9 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 81/210 (38%), Gaps = 46/210 (21%) Query: 149 TNSRHIVMPITSSVKV---NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 T+++ + + T R ++ V+D + SM+ + I+ ++I + Sbjct: 216 TSTKTVSSDAKKTATADNPQEVTGFRAAVVFVIDSTISMDPY-------INRTREAIKQV 268 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW-------------GVSHLQRKIK---- 248 ++++ ++ V+ GLV F + + LE+ G +Q+ Sbjct: 269 YDQIEK-ENLGEQVKFGLVAFRSSTKAVKGLEYTSKMFVDPTTVKDGKDFMQKVASLKQA 327 Query: 249 YLSKFGVSTNSTPGLKYAYNQI-FDMQGMRQHCNTEDANYKKIIVFMTD-----GENLST 302 +S S ++ G+ A N+I ++ G R +V +TD G+N + Sbjct: 328 KVSSKEFSEDAYAGINQALNEIQWNQFGAR------------YMVLITDAGAIEGDNPIS 375 Query: 303 KEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + EA+ RG +YA+ ++ Sbjct: 376 TTGLDAKQLRLEAQHRGVAIYALHLKTPSG 405 >gi|323498921|ref|ZP_08103904.1| hypothetical protein VISI1226_07138 [Vibrio sinaloensis DSM 21326] gi|323316033|gb|EGA69061.1| hypothetical protein VISI1226_07138 [Vibrio sinaloensis DSM 21326] Length = 418 Score = 45.9 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 23/118 (19%), Positives = 40/118 (33%), Gaps = 10/118 (8%) Query: 21 ILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGG 80 ++T +I V + I+ +H+ K L + +D + + AT + + Sbjct: 20 VMTVSMA-VIVGVAALSIDANHLMVSKNRLQNALDTAALAGATVA--------NRTYEED 70 Query: 81 DILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQ-NEGYSISAISRYKI 137 D I +N D + DD SL I N G+S + S Sbjct: 71 DAKEAIVEAYNKVTSAAGNDELVLAASDDGTSLKSLTIEYSDNANSGFSSNFPSSADY 128 >gi|8131972|gb|AAF73158.1|AF149771_1 ookinete protein [Plasmodium berghei] gi|5139521|emb|CAB45562.1| CTRP protein [Plasmodium berghei] gi|5420459|dbj|BAA82322.1| adhesive protein-like molecule [Plasmodium berghei] Length = 1905 Score = 45.9 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 37/177 (20%), Positives = 65/177 (36%), Gaps = 26/177 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++LD S S+ F + + I A IN + + V G++ FS+ + Sbjct: 512 DVTLILDESSSIGEFRWT-MEVIPFAKDVINNLNIDYDS-------VHVGVLLFSHYALD 563 Query: 233 FFLLE----WGVSHLQRKIKYLSKFGVSTNST---PGLKYAYNQIFDMQGMRQHCNTEDA 285 + L +KI L + + + LKYA + G Sbjct: 564 LVPFSDEARYNKYTLIKKIDSLKTNYGNGHESFIVKTLKYALSNYTKGSG--------RT 615 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--LRACA 340 N KI + TDG N S++ D + + + IG+ + ++ L CA Sbjct: 616 NAPKITMLFTDG-NDSSESDIDMYNIGSLYRTERVKLLVIGVSMASENKLKQLVGCA 671 Score = 40.5 bits (93), Expect = 0.40, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 65/204 (31%), Gaps = 22/204 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + ++LD S S+ ++ L +++ + +V F+ +F Sbjct: 734 LTVILDESGSIGAYNWEKQ-----VYPFTEKFLNNLEISENKVHVGIMLFAQFNRDFVKF 788 Query: 234 F-LLEWGVSHLQRKIKYLS---KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + +L ++IK L K G T L Y H ++ K Sbjct: 789 SDKESYDKENLMKQIKGLKESYKSGGYTYIIEALNYGLANYT-------HHEASRSDVPK 841 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI--RSHEFLRAC-ASPNSFY 346 + + TDG N + D+ K+ + +G+ + L C + + Sbjct: 842 VTMLFTDG-NNTNPGDKLLSDVSLLYKQENVKLLVVGVGASTMANLRLLAGCHKTDGNCP 900 Query: 347 LVENPHSMYDAFSHIGKDIVTKRI 370 L +D I K + K Sbjct: 901 LATKTE--WDNLQDISKLMADKIC 922 >gi|237728581|ref|ZP_04559062.1| TerY1 [Citrobacter sp. 30_2] gi|226910059|gb|EEH95977.1| TerY1 [Citrobacter sp. 30_2] Length = 212 Score = 45.9 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 25/132 (18%), Positives = 44/132 (33%), Gaps = 12/132 (9%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + ++LD S SM I+ + +L +K P ++TF + Sbjct: 3 RLPVYLLLDTSGSMHG------EPIEAVKNGVQTLLTTLKQDPYALETAHVSVITFDSSA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L L ++ L+ G +T+ L + I D + Sbjct: 57 RQAVPL---TDLLSFQMPALTASG-TTSLGEALSLTASSIAKEVQKTTADTKGDWRP--L 110 Query: 291 IVFMTDGENLST 302 + MTDG Sbjct: 111 VFLMTDGSPNDD 122 >gi|163815506|ref|ZP_02206879.1| hypothetical protein COPEUT_01671 [Coprococcus eutactus ATCC 27759] gi|158449143|gb|EDP26138.1| hypothetical protein COPEUT_01671 [Coprococcus eutactus ATCC 27759] Length = 550 Score = 45.9 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 33/171 (19%), Positives = 58/171 (33%), Gaps = 29/171 (16%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K + + V D S SM ++ S+ + + + NN V G Sbjct: 365 KKTKDNGKDIIAVFVADCSGSM------DGDPMNQLKNSLTNGAQYI----NDNNYV--G 412 Query: 223 LVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 LV++SN + + S+ Q + L G T S + A I + Sbjct: 413 LVSYSNSVTIEVPIAQFDLNQRSYFQGAVNNLIASG-GTASYDAVVVAVKMITE------ 465 Query: 279 HCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + + K ++ ++DG N D+ + + G VY IG Sbjct: 466 -AKAQHPDAKCMLFLLSDGYANNGYSMDEITSAL----RTSGIPVYTIGYG 511 >gi|114557515|ref|XP_524757.2| PREDICTED: calcium-activated chloride channel regulator 4 isoform 2 [Pan troglodytes] Length = 919 Score = 45.9 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 71/197 (36%), Gaps = 38/197 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM +++ ++ L + V N G+V F + Sbjct: 307 VCLVLDKSGSM-----GGKDRLNRMNQAAKHFL-----LQTVENGSWVGMVHFDSTATVV 356 Query: 234 FLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L + L + + T+ G+KYA+ I + + + Sbjct: 357 NKLIQIKSSDERNTLMAGLPTYPR--GGTSICSGIKYAFQVI---GELHSQLDGSE---- 407 Query: 289 KIIVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFL-RACASPNSFY 346 ++ +TDGE+ + C +E K+ GAIV+ I + + + + S + Sbjct: 408 --VLLLTDGEDNTASS-------CIDEVKQSGAIVHFIALGRDADEAVIEMSKITGGSHF 458 Query: 347 LVENPHS---MYDAFSH 360 V + + DAF Sbjct: 459 YVSDEAQNNGLIDAFGA 475 >gi|146276888|ref|YP_001167047.1| hypothetical protein Rsph17025_0838 [Rhodobacter sphaeroides ATCC 17025] gi|145555129|gb|ABP69742.1| hypothetical protein Rsph17025_0838 [Rhodobacter sphaeroides ATCC 17025] Length = 563 Score = 45.9 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 25/229 (10%), Positives = 69/229 (30%), Gaps = 37/229 (16%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 F + +G + + + ++ ++ G ++V +T L +DR Sbjct: 28 RFARSEEGSILVFGLMLFILMLMLGGFAVDVMSFEAKRTDLQQAVDRC------------ 75 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 + + + + ++ + Y Sbjct: 76 ----ALTAAALAQTRDPEEVVEDCMLKAGKADY-------------VTLIDHDEGLNYR- 117 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM-ESFF 188 + + P K + P + + +++++VLDVS SM + Sbjct: 118 EVVVTAQQPTKPL--FAHMLGIDSLTAPAATKAEQKVTN---VEIVMVLDVSGSMVRDSY 172 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 K+ + ++ + L D+N+ + +V ++ ++ L Sbjct: 173 SRPTDKLKNLKAAAKEFVDTM-LAKDLNHRISIAIVPYNGQVNLGKSLR 220 Score = 42.8 bits (99), Expect = 0.090, Method: Composition-based stats. Identities = 15/68 (22%), Positives = 25/68 (36%), Gaps = 2/68 (2%) Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACASP-NSFYLVENPHSMYDAFSHIG 362 D Q CN AK ++ I ++ C++ +S Y + + AF I Sbjct: 494 DAQLQALCNLAKSNNVTIFGIAFEAPANGKTQIQNCSTSRSSHYFDASGLEIQTAFRAIA 553 Query: 363 KDIVTKRI 370 I R+ Sbjct: 554 SQISYLRL 561 >gi|212702323|ref|ZP_03310451.1| hypothetical protein DESPIG_00334 [Desulfovibrio piger ATCC 29098] gi|212674201|gb|EEB34684.1| hypothetical protein DESPIG_00334 [Desulfovibrio piger ATCC 29098] Length = 1151 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 27/136 (19%), Positives = 50/136 (36%), Gaps = 8/136 (5%) Query: 239 GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE 298 G + +I + G + S G + A + N N I++F+TDGE Sbjct: 657 GYEAVLAQIDDMEAIGGTVYS-DGYEAAKDWFGGKTSPDSLQNNGGEN---IVIFVTDGE 712 Query: 299 NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV--IRSHEFLRACASPNS-FYLVENPHSMY 355 + + + A V +GI + + + L + N+ + +E+ + Sbjct: 713 PNNEWSAKNAYNQLVAAVDN-ITVETVGIAITDKDATDLLNGLTTNNNGAHFIEDASKLG 771 Query: 356 DAFSHIGKDIVTKRIW 371 D F I DI T + Sbjct: 772 DVFGEIVSDITTSTVT 787 Score = 38.2 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 18/110 (16%), Positives = 42/110 (38%), Gaps = 6/110 (5%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 + + V + + T L++ +V+D S SM T++ +++ + Sbjct: 488 GNSGHDNFVGDPGGATESTQTTYTDLNVALVVDTSGSM------DGTRMSETKEALKDLC 541 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS 256 +++K D V L+ FS + +E + K+ + G + Sbjct: 542 DQLKEHADEGADVNLSLIGFSGALNINLPVEDITGIEENYRKFTLRDGDT 591 >gi|296269297|ref|YP_003651929.1| vault protein inter-alpha-trypsin domain-containing protein [Thermobispora bispora DSM 43833] gi|296092084|gb|ADG88036.1| Vault protein inter-alpha-trypsin domain protein [Thermobispora bispora DSM 43833] Length = 796 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 72/209 (34%), Gaps = 34/209 (16%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D+++VLD S SM K + A +++ + + ++ F N I Sbjct: 299 PRDVVLVLDRSGSM------------AGWKMVAARRAAARIVDTLTERDRLAVLAFDNVI 346 Query: 231 EEFFL--LEWGVSHLQ----RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E F L + + L G + P + +G R Sbjct: 347 ERAFPDGLTAATDRARYRAVEFLARLEARGGTEMLAPLEEALTALAAAAEGGR------- 399 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SP 342 ++V +TDG+ D+ + ++ G V+A+GI + FL A Sbjct: 400 ---DAVLVLVTDGQ----VGDEDRILERMASRIGGVRVHAVGIDRAVNAAFLGRLAVLGA 452 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 LVE+ + +A HI + I + Sbjct: 453 GRCELVESEDRLDEAMEHIHRRIGAPLVT 481 >gi|149176499|ref|ZP_01855112.1| hypothetical protein PM8797T_29982 [Planctomyces maris DSM 8797] gi|148844612|gb|EDL58962.1| hypothetical protein PM8797T_29982 [Planctomyces maris DSM 8797] Length = 598 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 37/273 (13%), Positives = 91/273 (33%), Gaps = 33/273 (12%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSL------VHAATQIM 66 + +G ++ A FL + I+++ I KT + + ++ + + A Q Sbjct: 18 QSRRGAFMVMAAPFLVATMGFMAFGIDIAVITMTKTRMRNAVEAAALAAAQQITDAVQTT 77 Query: 67 NEGNGNNRKKLKGGDILCRIK--NTWNMSFRNELRDNGFVNDIDDIV-------RSTSLD 117 +G G + I ++ + + +++ D+ + Sbjct: 78 ADGIGGSDNVSGDVQDANSIAIDTARAVAEKVARLNGVYIDPETDVEFGKRYQDSGGTFH 137 Query: 118 IVVVPQNEGYSISAISRYKIPL-------KFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 +V + Y++ ++ K + F + + + + ++ + ++ Sbjct: 138 MVWGENAKPYNVVKVTARKDNATEGQPDSRLQLFFAGFMSEKTAAVTTSAIAFIEAR--- 194 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D+++VLD S SM +DS + + E L +V SG T+S+ Sbjct: 195 --DIVLVLDYSGSMS--YDSEFDAMSSYR--LGKSAVEANLDDIWETLVDSGA-TYSDSG 247 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL 263 + F G + ++ L Sbjct: 248 KLKFP-ATGYGRINSEVGTYISSTNDDYIYRAL 279 >gi|156977400|ref|YP_001448306.1| Flp pilus assembly protein TadG [Vibrio harveyi ATCC BAA-1116] gi|156528994|gb|ABU74079.1| hypothetical protein VIBHAR_06187 [Vibrio harveyi ATCC BAA-1116] Length = 515 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 60/204 (29%), Gaps = 18/204 (8%) Query: 17 GGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKK 76 G I + L I +E + + L + + + ++ Sbjct: 16 GVAAIWMGLLLVPIMGFTFWAVEGTRYVQESSRLRDSAEAAAMA--------VTIEDQPG 67 Query: 77 LKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYK 136 ++N L F D + Y+++A + + Sbjct: 68 AARALATKYVENYVRDIKSTNLSAQRFYQAED--------KGTGALEYIQYTVNARTTHD 119 Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI-TKI 195 +FIP + + + + D +D++ V D SRSM + SS KI Sbjct: 120 SWF-ASSFIPSFDKQQELAGRSLARKYPAYLGDNNIDIVFVSDFSRSMNDKWGSSWNKKI 178 Query: 196 DMAIKSINAMLEEVKLIPDVNNVV 219 D +I+ + + V Sbjct: 179 DDLKTAIDQISNNILCKSTRQEYV 202 >gi|66576258|gb|AAY51689.1| conserved hypothetical protein [Chlorobium tepidum TLS] Length = 350 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 25/180 (13%), Positives = 54/180 (30%), Gaps = 12/180 (6%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMID-------RSLV 59 +I+ G +TIL A+ L ++ ++ + ++++ + +K L + D SL+ Sbjct: 6 SIKRLHSQR-GVVTILFALVLMVLVGLIALAVDLTRLHLVKAELQNAADAAALAGAGSLI 64 Query: 60 HAATQIMNEGNGNNRKKLKGG---DILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSL 116 + Q N + + I N G+ N I S + Sbjct: 65 DTSLQTFNWSAATAKAQEFADVNSADGKTIGQHRQEQDVNVAIQPGYWNLITPSFTSNTG 124 Query: 117 DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMI 176 + ++ F I + ++V + M I Sbjct: 125 LVTHTGDGNIPAVQVTITLSHLKFFFAPILGIPEG-TVQATAIAAVSPPTGGTGLFPMAI 183 >gi|290474195|ref|YP_003467072.1| serine/threonine protein kinase [Xenorhabdus bovienii SS-2004] gi|289173505|emb|CBJ80285.1| serine/threonine protein kinase [Xenorhabdus bovienii SS-2004] Length = 663 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 31/199 (15%), Positives = 71/199 (35%), Gaps = 30/199 (15%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + N ++ V+D + SM + ID ++I + + ++ ++ V+ G Sbjct: 221 EKNMMKGFSASVVFVIDSTISMGPY-------IDRTKQAIEKIYQRIE-QENLLGQVKFG 272 Query: 223 LVTFSNKIEEFFLLEWGVSHL---------QRKIKYLSKFGVSTNSTPGLKY-AYNQIFD 272 LV + + + LE+ + + + +T S+ + AY + Sbjct: 273 LVAYRSNVTAVPDLEYDSKMYVDPNTVKDGKDFLAKVRDLKQATVSSSKVNEDAYAGVMT 332 Query: 273 MQGMRQHCNTEDANYKKIIVFMTD-----GENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + ++ +TD G++ + + EA RG +YA+ + Sbjct: 333 ALDKVDWTKFGA----RYVILVTDAGALEGDDKLSSTQLDATQVRLEATYRGVALYALHL 388 Query: 328 RVIRSHEFLRACASPNSFY 346 + H + AS + Y Sbjct: 389 KTPSGH---KNHASAENQY 404 >gi|283787709|ref|YP_003367574.1| tight adherence protein TadG [Citrobacter rodentium ICC168] gi|282951163|emb|CBG90854.1| putative tight adherence protein TadG [Citrobacter rodentium ICC168] Length = 647 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 37/255 (14%), Positives = 82/255 (32%), Gaps = 22/255 (8%) Query: 6 LNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 + + + + G +TI+ AI P + ++ + ++ + + K L +++ Sbjct: 13 IKLPSTLSSENGAITIVYAIIFPALLAMVALALDGALMINRKARLADASSEAILA----- 67 Query: 66 MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 + + F N N + +V S I + Sbjct: 68 -----ISAVDNRLVDSVAIDNNKQIAKDFVNYYLPNNQAEQLKVVVTSFDRTIEKGYIDY 122 Query: 126 GYSISAISRYKIPLKFCTF--IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 +ISA +PL F + ++ K + D + V+D S S Sbjct: 123 KIAISATLPTLLPLGHLGFSAFDRSVTVGNFDNNSGNARKFVTVISDPADYVFVVDFSDS 182 Query: 184 MESFFDSS---ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV 240 M S + T++ M + + ++ + N Q +V F + F ++ V Sbjct: 183 MNSSYIDQGRVTTRLAMLKQVVREVISG-----NKNPDSQFAIVPFDIGV--PFRIKDSV 235 Query: 241 SHLQRKIKYLSKFGV 255 + ++ G Sbjct: 236 NTTSSYANKENEGGG 250 >gi|229520644|ref|ZP_04410068.1| hypothetical protein VIF_001170 [Vibrio cholerae TM 11079-80] gi|229342468|gb|EEO07462.1| hypothetical protein VIF_001170 [Vibrio cholerae TM 11079-80] Length = 1495 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 44/282 (15%), Positives = 88/282 (31%), Gaps = 23/282 (8%) Query: 79 GGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP 138 G + + + + N V D + S + + S IP Sbjct: 786 NGAAISWVLSADKQTLTGSAGGNKVVEFTLDNQGNVSSTLHAPVDHANKSGEDTLTINIP 845 Query: 139 LKFCTFIPWYTNSRHIVMPITSSV--------KVNSQTDARLDMMIVLDVSRSMESFFDS 190 L+ + + + I S+ ++ ++LDVS SM + Sbjct: 846 LEAKN-AAGAIGTGKVTLVIEDDAPVAKEVFHVAESELKQGANVQLILDVSGSMAWGAGN 904 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW-GVSHLQRKIKY 249 T++++ +S +L++ + + +VQ + + I+ W V + I Sbjct: 905 GKTRLEVMQESAKLLLDQYSSNSNGHVLVQLVVFNHAASIKTSDNSYWMSVKDAKAYIDD 964 Query: 250 LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-------NLST 302 LS + ++ A Q + + +Y F++DG N +T Sbjct: 965 LSARWQTDYDH-AIELAEKFWSGNQNVSPLSGATNVSY-----FLSDGRPEGDDNGNPNT 1018 Query: 303 KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 ED + + N K A G+ L A Sbjct: 1019 IEDDELESWINHLKANQITALAYGMGNNVPQGELDKVAYDGH 1060 >gi|148975971|ref|ZP_01812760.1| hypothetical protein VSWAT3_06656 [Vibrionales bacterium SWAT-3] gi|145964716|gb|EDK29969.1| hypothetical protein VSWAT3_06656 [Vibrionales bacterium SWAT-3] Length = 520 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 41/295 (13%), Positives = 93/295 (31%), Gaps = 44/295 (14%) Query: 12 FYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNG 71 +G + + L I + +E + + L + + + Sbjct: 16 LQRQQGVAAVWMGLLLVPIMGMTFWAVEGTRYVQETSRLRDSAEAAAIA--------VTI 67 Query: 72 NNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISA 131 ++ L G ++N L F D+ + + Y+++A Sbjct: 68 EDQPDLARGLATQYVENYVRDIKSTNLSAQRFHQTEDEGAG--------ILEYIQYTVNA 119 Query: 132 ISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSS 191 + + +FIP + + + + D +D++ V D S SM+ + SS Sbjct: 120 KTTHDSWF-ASSFIPSFDEQQDLAGRSLARKYPVYLGDNNIDIVFVSDFSGSMDDRWGSS 178 Query: 192 I-TKIDMAIKSINAMLEEVKLIPDVNNVV----------------------QSGLVTFSN 228 KID +I+ + ++ V + G V F+ Sbjct: 179 RHKKIDDLKTAIDQISSKILCTSTDLEYVDGEWKEVCDEPGEDTTGDKLLNRVGFVPFNV 238 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 + E ++ G ++ ++ Y + VS S + + Y + + + N Sbjct: 239 RTRE--IVSGGQANATSQLSYKPNYKPNVSPYSYNDVNWDYWRAYSQNEVLNCAN 291 >gi|325959993|ref|YP_004291459.1| Magnesium chelatase [Methanobacterium sp. AL-21] gi|325331425|gb|ADZ10487.1| Magnesium chelatase [Methanobacterium sp. AL-21] Length = 711 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 37/233 (15%), Positives = 72/233 (30%), Gaps = 35/233 (15%) Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT--------DARLDMMIVLDVSRSM 184 + K P I R + + V ++ A+ + +V+D+S SM Sbjct: 478 VKSKFPKDMGNDIAIDATLRAAALRSEGEITVKTEDLRQKVRKHGAKASIAVVVDISGSM 537 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS-HL 243 K +N ++E N + +V F K + + Sbjct: 538 Y-----GEKKAVRVKDILNNLIE-----DAARNGDKVSVVGFKGKDALIIIPTTRRAVSF 587 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL--- 300 + +I+ L G +T G+K + + Y +++ +TDG Sbjct: 588 KEQIENLK-IGGTTPLASGMKRGFEIL--------KKEKFRDEYVPMMLILTDGMPNVAI 638 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAI----GIRVIRSHEFLRACASPNSFYLVE 349 S ++ K+ I ++ R A AS +Y +E Sbjct: 639 SKSPVDDAIDIAGSLKENEIHTIIINFEQAVKYGRDMNMELAVASGGRYYDLE 691 >gi|262199538|ref|YP_003270747.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262082885|gb|ACY18854.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 523 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 25/150 (16%), Positives = 62/150 (41%), Gaps = 17/150 (11%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMES-FFDSSITKIDMAIKSINAMLEEVKLI 212 + + ++S++ + L ++ +DVS SM + D ++ + + A+++++ Sbjct: 116 LQVGMSSTIDPATFERPSLTIVATVDVSGSMGWGYADDQVSAGSLTRNLLGALVDQLGPE 175 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + +VT+ ++++ L + I LS+ G STN GL+ Sbjct: 176 D------RIAIVTYGSRVDTALTLRSAGQKDEIHTAIDKLSEAG-STNMEAGLQR----- 223 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 + + + +I++F TD + Sbjct: 224 -AYAIASEAAADGETDSTRIMLF-TDVQPN 251 >gi|153806292|ref|ZP_01958960.1| hypothetical protein BACCAC_00548 [Bacteroides caccae ATCC 43185] gi|149130969|gb|EDM22175.1| hypothetical protein BACCAC_00548 [Bacteroides caccae ATCC 43185] Length = 342 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 27/175 (15%), Positives = 65/175 (37%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K + ++++I LD+S SM + D ++++ A + I+ +++E+ + Sbjct: 79 GSKQETVKRKGVEVIIALDISNSMLAQ-DVQPSRLEKAKRLISRLVDELDND-------K 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 G++ F+ + + ++ +S T + A +G+ Sbjct: 131 IGMIVFAGDAFTQLPITSDYISAKMFLESISPSLISKQGTAIGEAINLAVRSFTPQEGVG 190 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + IV +TDGEN + + ++G V +G+ + Sbjct: 191 -----------RAIVVITDGENHEGGAVEAAKVA----AEKGIQVSVLGVGMPDG 230 >gi|319956804|ref|YP_004168067.1| von willebrand factor type a [Nitratifractor salsuginis DSM 16511] gi|319419208|gb|ADV46318.1| von Willebrand factor type A [Nitratifractor salsuginis DSM 16511] Length = 398 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 72/204 (35%), Gaps = 34/204 (16%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF- 233 MI+ D S SM D KID+A +++ +L + GL + ++ + Sbjct: 24 MIIFDASGSMWGQVDGER-KIDIARRAMRDLLRDWNPQIP------LGLTVYGHRRKGDC 76 Query: 234 ----FLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + G QR I+ + + T LK Q + Sbjct: 77 NDIETPIPVGPLDRQRMIRAVEGIRPKGKTPIARALKRVAAQFRGSED------------ 124 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA--IGIRVIRSHEFLRAC---ASP 342 I+ ++DG+ D E +K G A IG V R+ + AC A+ Sbjct: 125 PATIILISDGK---ESCDADPCATARELRKSGIRFVAHVIGFHVDRTTDRQLACIARATG 181 Query: 343 NSFYLVENPHSMYDAFSHIGKDIV 366 S++ N ++ A + + K I Sbjct: 182 GSYFSARNAAALNRAITQVAKKIT 205 >gi|325914146|ref|ZP_08176499.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas vesicatoria ATCC 35937] gi|325539649|gb|EGD11292.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas vesicatoria ATCC 35937] Length = 525 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 36/209 (17%), Positives = 75/209 (35%), Gaps = 35/209 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++DVS SM + K+ + S+ + +++ + LVT++ Sbjct: 159 LVFLVDVSGSM-----GAPDKLPLLQSSLKLLTRQLRAQD------RITLVTYAGNTAVV 207 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 G + I L + G T G++ AY + +R N I Sbjct: 208 LPPTPGNQQARIVEAIDSL-QSGGGTAGASGIELAY-KAAQQSYLRDGINR--------I 257 Query: 292 VFMTDGENLSTKEDQQSLY-YCNEAKKRGAIVYAIGIRVIRSHEFLR---ACASPNSFYL 347 + TDG+ D +L E ++ G + +G ++ L A A ++ Sbjct: 258 LLATDGDFNVGVTDFDTLKGMVAEKRRSGVALSTLGFGTGNYNDTLMEQLADAGDGAYAY 317 Query: 348 VENP--------HSMYDAFSHIGKDIVTK 368 +++P H + I +D+ + Sbjct: 318 IDSPLEARKVLTHELGATLETIARDVKIQ 346 >gi|310824705|ref|YP_003957063.1| von willebrand factor type a domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|309397777|gb|ADO75236.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 881 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 38/277 (13%), Positives = 83/277 (29%), Gaps = 52/277 (18%) Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 + R L ++ + ++ F +++ + V + S+ Sbjct: 195 KIDLRVDLGREVVVESPSHALTLNKEGHQVRVSFAR--NEVALDRDFVLTVRSPDTSASL 252 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 + + +++ K TF +V + + + ++ V+D S SME Sbjct: 253 TPLVTHRLGEKPGTFAL------TVVPDLLGLATGPKRQE----VVFVVDTSGSMEG--- 299 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWGVSHL----Q 244 + A ++ L ++ + ++ F + F Sbjct: 300 ---ESLPQAQGALRLCLRHLREGD------RFNIIAFDTSFQSFAPQPAVFTQKTLEQAD 350 Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTK 303 R + L G + P L A + ++V +TDG+ + Sbjct: 351 RWVAALRANGGTELLQPMLAAV-----------------QAAPEGVVVLLTDGQVGNEAE 393 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 Q L A+ +Y+ GI S L+ A Sbjct: 394 ILQAVLRARKTAR-----IYSFGIGTNVSDALLKDMA 425 >gi|304437159|ref|ZP_07397120.1| magnesium chelatase [Selenomonas sp. oral taxon 149 str. 67H29BP] gi|304369821|gb|EFM23485.1| magnesium chelatase [Selenomonas sp. oral taxon 149 str. 67H29BP] Length = 627 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 27/145 (18%), Positives = 60/145 (41%), Gaps = 20/145 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEE 232 ++ ++D S SM + ++ M +I ++L+E + V GL+ F ++ E Sbjct: 449 ILFLVDASGSM-----GARERMRMVKGAILSLLQE---AYQKRDCV--GLIAFRRDRAET 498 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + V +++++ L G T GL +A + +++ N ++V Sbjct: 499 LLPMTRSVELAEKQLRELPT-GGRTPLAEGLAHAMQMLHELERRGGGKN--------VLV 549 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKK 317 +TDG + + D A++ Sbjct: 550 LVTDGRANTKEGDAGVQRALQAAEE 574 >gi|115372024|ref|ZP_01459336.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated protein containing vWFA domain [Stigmatella aurantiaca DW4/3-1] gi|115370989|gb|EAU69912.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated protein containing vWFA domain [Stigmatella aurantiaca DW4/3-1] Length = 843 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 38/277 (13%), Positives = 83/277 (29%), Gaps = 52/277 (18%) Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 + R L ++ + ++ F +++ + V + S+ Sbjct: 157 KIDLRVDLGREVVVESPSHALTLNKEGHQVRVSFAR--NEVALDRDFVLTVRSPDTSASL 214 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 + + +++ K TF +V + + + ++ V+D S SME Sbjct: 215 TPLVTHRLGEKPGTFAL------TVVPDLLGLATGPKRQE----VVFVVDTSGSMEG--- 261 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-LEWGVSHL----Q 244 + A ++ L ++ + ++ F + F Sbjct: 262 ---ESLPQAQGALRLCLRHLREGD------RFNIIAFDTSFQSFAPQPAVFTQKTLEQAD 312 Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTK 303 R + L G + P L A + ++V +TDG+ + Sbjct: 313 RWVAALRANGGTELLQPMLAAV-----------------QAAPEGVVVLLTDGQVGNEAE 355 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 Q L A+ +Y+ GI S L+ A Sbjct: 356 ILQAVLRARKTAR-----IYSFGIGTNVSDALLKDMA 387 >gi|332706718|ref|ZP_08426779.1| hypothetical protein LYNGBM3L_23300 [Lyngbya majuscula 3L] gi|332354602|gb|EGJ34081.1| hypothetical protein LYNGBM3L_23300 [Lyngbya majuscula 3L] Length = 972 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 64/190 (33%), Gaps = 22/190 (11%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 H + + ++ ++ D++ V+D S S D + ++ + IN + Sbjct: 274 GNHFAVYLIPALDYSTDEIVPKDVVFVIDTSGS--QMGDPLLKSQELMRRFINGL----- 326 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + + V+ L + ++ I Y+++ + + L + Sbjct: 327 ---NPKDTFTIIDVSDRATQLSTKPLSNSPQNCRKAINYINQLKANGGTY--LLKGIRHL 381 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 ++ + + IV ++DG + Q L + K G +Y+ G+ Sbjct: 382 LNLPAAPEGRL-------RSIVLLSDG---YISNENQVLAEVQQQLKPGNRIYSFGVGSS 431 Query: 331 RSHEFLRACA 340 + L A Sbjct: 432 PNRFLLNRLA 441 >gi|297567411|ref|YP_003686383.1| hypothetical protein Mesil_3037 [Meiothermus silvanus DSM 9946] gi|296851860|gb|ADH64875.1| conserved hypothetical protein [Meiothermus silvanus DSM 9946] Length = 351 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 29/216 (13%), Positives = 65/216 (30%), Gaps = 38/216 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++I LD S+SM + D + +++ A + Q GL+TFS+ Sbjct: 88 VVIALDASKSMLA-GDLNPNRLEAARAIAKEFVRLAPA------TTQIGLITFSDSASVV 140 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN------------ 281 + LQ + + +T+ + + + ++ Sbjct: 141 VAPTTDRAVLQEALDNVKPV-QNTSLPSAIVTGVRLLPGRKEVQPPKELQPQNPQNPQPQ 199 Query: 282 ----------TEDANYKKIIVFMTDGENLSTK----EDQQSLYYCNE-AKKRGAIVYAIG 326 ++ ++DG +Q +L + A+ G +YA Sbjct: 200 NPLVQPDTPPIPREFPPGSLLVISDGATNVNSNPRLPNQTALEAAAKFAQDNGVKIYAFA 259 Query: 327 IRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIG 362 + +R +++ P S+ G Sbjct: 260 VG-KEGGAVVRI--EGQDYFVPFEPRSLQQLAERTG 292 >gi|292669496|ref|ZP_06602922.1| magnesium-chelatase subunit [Selenomonas noxia ATCC 43541] gi|292648859|gb|EFF66831.1| magnesium-chelatase subunit [Selenomonas noxia ATCC 43541] Length = 636 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 31/191 (16%), Positives = 71/191 (37%), Gaps = 25/191 (13%) Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 ++SA R P + ++ +V+ + +++ ++D S SM Sbjct: 410 DLALSATLRAAAPYQRARQ-----GTQTVVIRPEDVRVWVRAKRSAANILFLVDASGSM- 463 Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEFFLLEWGVSHLQ 244 + ++ M +I A+L+E + GL+ F ++ E + V + Sbjct: 464 ----GARERMRMVKGAILALLQEA-----YQKRDRVGLIAFRRDRAETLLPMTRSVELAE 514 Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 ++++ L G T GL A + +++ K ++V +TDG + ++ Sbjct: 515 KQLRDLPT-GGRTPLAEGLACALQTLRELERRGSE--------KTVLVLITDGRTNTARD 565 Query: 305 DQQSLYYCNEA 315 + A Sbjct: 566 GDGGVQRALRA 576 >gi|194333615|ref|YP_002015475.1| von Willebrand factor type A [Prosthecochloris aestuarii DSM 271] gi|194311433|gb|ACF45828.1| von Willebrand factor type A [Prosthecochloris aestuarii DSM 271] Length = 5009 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 38/256 (14%), Positives = 93/256 (36%), Gaps = 26/256 (10%) Query: 51 HSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNG---FVNDI 107 + + + +++ + ++ + ++L G N Sbjct: 4398 EVALTYAASDTNSDGIDDTITATGPDGTVFTLKVETDGSYTFTLADQLDHTGGGLSGNGD 4457 Query: 108 DDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 D I ++V ++G S++ + + I ++ +P+T S Sbjct: 4458 DQIKTLDFSSVLVATDSDGDSVTVDNGFTITVQDDVPS---------AIPVTESATATPI 4508 Query: 168 TDARLDMMIVLDVSRSMESFFD-SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +M+++D S SM+ T++ + + ++++ + + D V+ +V F Sbjct: 4509 DTN---IMLIMDTSGSMDWPSGIPGYTRLQATVAAARQLVDKYEALGD----VRVNIVEF 4561 Query: 227 SNKIEE-FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + G + R L++ G STN L A + D G+ Q + ++ Sbjct: 4562 DTDGNKWTTGWVDGATADSRLTALLTQGGGSTNFDDALLTAMDAWDDTTGLTQIPDAQNV 4621 Query: 286 NYKKIIVFMTDGENLS 301 +Y F++DG+ + Sbjct: 4622 SY-----FLSDGDPTA 4632 >gi|206575540|ref|YP_002235854.1| von Willebrand factor type A domain protein [Klebsiella pneumoniae 342] gi|206570384|gb|ACI12030.1| von Willebrand factor type A domain protein [Klebsiella pneumoniae 342] Length = 346 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 35/188 (18%), Positives = 60/188 (31%), Gaps = 15/188 (7%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + VLD S SM ++ +++ + ++ +++ P V ++ F+ Sbjct: 3 RLPVFFVLDCSESM---IGENLKRMN---DGLQKIVGDLRKDPHALETVWVSVIAFAGVA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + G TN L+ QI D + Sbjct: 57 RTIVPL----HDIVSFYPPRLPVGGGTNLATALRELTTQIDSQVRKTTQEEKGDWKP--V 110 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 + +TDG + + AKK I A+GI LR + N EN Sbjct: 111 VYLLTDGRPTDDISAEVKRWNEFYAKKVNMI--AVGIGSSVDLSVLRQL-TENVLSFTEN 167 Query: 351 PHSMYDAF 358 + F Sbjct: 168 QEGDFSRF 175 >gi|125570256|gb|EAZ11771.1| hypothetical protein OsJ_01640 [Oryza sativa Japonica Group] Length = 1034 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 27/179 (15%), Positives = 62/179 (34%), Gaps = 10/179 (5%) Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 ++ + K+ + +V + S A +D++ VLDVS Sbjct: 9 TVQVTAYAKVDAVHVDAVKQRPGVPVLVRVVAPPPAAASSERAPIDLVAVLDVSC---CG 65 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI 247 + ++D+ K++ +++++ + V + K + + G R + Sbjct: 66 GLGPVNRMDLLKKAMGFVIDKLGEHDRLAVVPVQASAAIAEKHDLVEMNAEGRKEATRMV 125 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + + LK A + +G + H IV ++DG++ S D Sbjct: 126 QSSLTVTGENKLSTALKKAATIL---EGRKDHDKK----RPGFIVLISDGDDASVLNDA 177 >gi|325578369|ref|ZP_08148504.1| hypothetical protein HMPREF9417_1245 [Haemophilus parainfluenzae ATCC 33392] gi|325160105|gb|EGC72234.1| hypothetical protein HMPREF9417_1245 [Haemophilus parainfluenzae ATCC 33392] Length = 667 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 65/181 (35%), Gaps = 27/181 (14%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 ++ V+D + SM+ + I+ +I + E+++ ++ V+ GLV F + Sbjct: 234 GFSAAVVFVIDSTISMDPY-------INRTRDAIKKVYEKIEK-ENLGKQVKFGLVAFRS 285 Query: 229 KIEEFFLLEWGVSHL---------QRKIKYLSKFGVSTNSTPGLKY-AYNQIFDMQGMRQ 278 + LE+ + + ++ +T S+ AY + Sbjct: 286 STKAVPGLEYTSKMFVDPSTVKDGKDFMDKVANLKQATVSSKEFSEDAYAGVSQALNEIN 345 Query: 279 HCNTEDANYKKIIVFMTD-----GENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 N + +V +TD G+N + + EA+ RG +Y + ++ Sbjct: 346 WNNFGG----RYLVLITDAGAIEGDNPISTTGLDAKQLRLEAQHRGVALYTLHLKTPSGK 401 Query: 334 E 334 Sbjct: 402 N 402 >gi|255950230|ref|XP_002565882.1| Pc22g19800 [Penicillium chrysogenum Wisconsin 54-1255] gi|211592899|emb|CAP99268.1| Pc22g19800 [Penicillium chrysogenum Wisconsin 54-1255] Length = 896 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 37/252 (14%), Positives = 84/252 (33%), Gaps = 52/252 (20%) Query: 96 NELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAIS-RYKIPLKFCTFIPWYTNSRHI 154 ++ ++ F ++ I ++ I + I+ + + +P F P N + + Sbjct: 208 ADMPESTFESNYASIKLRENVTIDED-----FVITVNADKQDLPFAFLETHPTLPNQKAL 262 Query: 155 VMPITSSVKVNSQTDARLD-MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 ++ S+ L ++ V+D S SM + ++ L+ + L Sbjct: 263 MV----SLVPKFSLPPDLSEIVFVVDRSGSMTDN-------MHTLRSALGLFLKSLPLGV 311 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ---------RKIKYLSKFGVSTNSTPGLK 264 N L++F + F W S + + K + T GL+ Sbjct: 312 PFN------LISFGS----SFEAIWARSKVSTRESLEEALQHTKNIQADLGGTEILSGLE 361 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 A + + + + ++ +TDGE + E + N+ + Sbjct: 362 AAVEKRYQDKVLE-------------VLVLTDGEVWNQSEVFDLVNQANQQHST--RFFT 406 Query: 325 IGIRVIRSHEFL 336 +G+ SH + Sbjct: 407 LGLGDSVSHSLI 418 >gi|308502682|ref|XP_003113525.1| hypothetical protein CRE_26515 [Caenorhabditis remanei] gi|308263484|gb|EFP07437.1| hypothetical protein CRE_26515 [Caenorhabditis remanei] Length = 861 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 40/226 (17%), Positives = 80/226 (35%), Gaps = 22/226 (9%) Query: 144 FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSIN 203 +P P+ V++ LD++I D+S S+ + K K I Sbjct: 342 QLPVTPKKSFYNYPLAELDTVDTPKCQVLDIIIAFDISESLSRII---LPKYVAFAKRIV 398 Query: 204 AMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTP 261 A + N+ + G++TF++ + E L+ GV + + I + G T+ T Sbjct: 399 AQYKYKD-----NDFTRVGVLTFNDIVTEKLTLQKGVDLATINAAIDSVEYLGGLTDVTA 453 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 LK A +F + H K+++ ++D + + G Sbjct: 454 ALKAA-KDLFSKESDNAHS--------KVLIVLSDAVPTVDTYADE-IAAGQALSAAGVA 503 Query: 322 VYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + +G + N Y+ + +F+ I + I+T Sbjct: 504 TFFVGYNHYSDDVLKQLGQVTNPAYVFGDMSD--ASFNGITQQILT 547 >gi|257092459|ref|YP_003166100.1| von Willebrand factor type A [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] gi|257044983|gb|ACV34171.1| von Willebrand factor type A [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] Length = 769 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 28/209 (13%), Positives = 70/209 (33%), Gaps = 36/209 (17%) Query: 164 VNSQTDARLDMMIVLDVSRSMESF-FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + L ++++LD+S+S+ D T ++++ ++++ + ++ + D Sbjct: 571 SHRTNGRNLAVLVLLDLSQSLGDKAGDGQQTVLELSQEAVSLLAWSIEQLGDA-----LA 625 Query: 223 LVTFSNKIEEF--------FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + F + + WG S ++ ++ + G ST +++A Sbjct: 626 IAGFHSDTRHDVRYLHLKGYSERWGDS-VKSRLAAMQA-GYSTRIGGAIRHA-------- 675 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 H KK+++ +TDG + L + K + + Sbjct: 676 ---AHYLAAQKAEKKLLLILTDGRPSDVDVQDERLLVEDARK---------AVGEVGRRG 723 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGK 363 C S + +S I + Sbjct: 724 IFTYCISLDRQADAYVSEIFGQRYSVIDR 752 >gi|270208337|ref|YP_003329114.1| Bee1 [Enterococcus faecalis] gi|72388797|gb|AAZ68037.1| Bee1 [Enterococcus faecalis] Length = 1083 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 49/358 (13%), Positives = 102/358 (28%), Gaps = 89/358 (24%) Query: 87 KNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQ----NEGYSISAISRYKIPLKFC 142 N + S +++ N D + + ++ N ++ + I LK Sbjct: 257 TNQYTKSDSSDMIKNYNYGSAGDSDSNVEIQNILEGNLNFDNGYHAYDVGNNQNINLKKI 316 Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 I + I + + +T +D+ V+D S SM ++ + ++ Sbjct: 317 VIPTDNPTQFQIQLDI---IGGSLKTRKNVDVAFVVDKSGSM-----ANNDRWKNLKSAL 368 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 N + L + VQ G+ FS+ E + + + FG + + G Sbjct: 369 NTFAHGL-LDDNPGGSVQLGIAGFSSVQETTTYFPY--YKINVPYGKVGNFGGTQGNYTG 425 Query: 263 LKYAYNQIFDMQGMRQHCNT---------------------EDANYKKIIVFMTDG---- 297 + + + + + + + +K+++ +TDG Sbjct: 426 FTTSVDAFLNHALLNESPSGGTPTFLGLDAGLELLTNSQYNGRKDAQKVLIILTDGLPTW 485 Query: 298 ----------------------ENLS----TKEDQQSLYYCN---------EAKKRGA-- 320 +N K +LY N + Sbjct: 486 GPTTTYTSSSSGIRQSGMTRSSDNNGRVETFKATNTTLYIGNGSETGPDISTVRSNTISH 545 Query: 321 ---------IV--YAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + YAIG V ++ L A P Y + + + I K I Sbjct: 546 GKKRTEQNPEIDRYAIGFGVTDVNDILDALG-PQGKYSATSHTDLNYVLNQIKKKIQD 602 >gi|26340740|dbj|BAC34032.1| unnamed protein product [Mus musculus] Length = 941 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 24/131 (18%), Positives = 51/131 (38%), Gaps = 20/131 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D S SM S KI +++ +L+++ N L+ FS + + Sbjct: 275 VIFVIDKSGSM------SGKKIQQTREALVKILKDLSPQDQFN------LIEFSGEANQW 322 Query: 233 -FFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L++ +L + + Y S+ TN + A + + + Sbjct: 323 KQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVS---- 378 Query: 290 IIVFMTDGENL 300 +I+ +TDG+ Sbjct: 379 LIILLTDGDPT 389 >gi|333028467|ref|ZP_08456531.1| putative von Willebrand factor [Streptomyces sp. Tu6071] gi|332748319|gb|EGJ78760.1| putative von Willebrand factor [Streptomyces sp. Tu6071] Length = 588 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 36/220 (16%), Positives = 72/220 (32%), Gaps = 46/220 (20%) Query: 174 MMIVLDVSRSMESFFDSS-ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 +++V+D S SM +++D+ S+ L + D GL FS +++ Sbjct: 382 LLVVVDSSPSMAELVPGRGQSRMDVTKASLLQALAQFTSADD------IGLWEFSTRLDG 435 Query: 233 F-------------FLLEWGVSHLQRKIKYL----SKFGVSTNSTPGLKYAYNQIFDMQG 275 GV+ + + G +T AY Q Sbjct: 436 DRDYRELVPTDRLGARKGEGVTQRDKLTAAFGALQPQTGGATGLYDTTLAAYQQASKGYA 495 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS------LYYCNEAKKRGAIVYAIGIRV 329 + +V +TDG N + + A+ + AI + Sbjct: 496 ADKFNA---------VVLLTDGTNEDPGSLTRGALLTKLRDLADPARP--LPLVAIAVGP 544 Query: 330 IRSHEFLRA--CASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + + ++A A+ S + V++P +++ I IV Sbjct: 545 EAAGDDVKAIGSATGGSGFKVDDPAQIHEV---INNAIVE 581 >gi|332221825|ref|XP_003260065.1| PREDICTED: calcium-activated chloride channel regulator 4 isoform 2 [Nomascus leucogenys] Length = 684 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 37/196 (18%), Positives = 72/196 (36%), Gaps = 36/196 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK---I 230 + +VLD S SM +++ ++ L + V N G+V F + Sbjct: 70 VCLVLDKSGSM-----GGYDRLNRMNQAAKHFL-----LQTVENGSWVGMVHFDTTATIV 119 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + ++ G + L + T+ G+KYA+ I + + + Sbjct: 120 NKLIQIK-GSDERNTLMAGLPTYALGGTSICSGIKYAFQVI---GELHSQLDGSE----- 170 Query: 290 IIVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFL-RACASPNSFYL 347 +V +TDGE+ + C +E K+ GAIV+ I + + + + S + Sbjct: 171 -VVLLTDGEDNTASS-------CIDEVKQSGAIVHFIALGSAADEAVIEMSNITGGSHFY 222 Query: 348 VENPHS---MYDAFSH 360 + + DAF Sbjct: 223 ASDEAQNNGLIDAFGA 238 >gi|187778564|ref|ZP_02995037.1| hypothetical protein CLOSPO_02159 [Clostridium sporogenes ATCC 15579] gi|187772189|gb|EDU35991.1| hypothetical protein CLOSPO_02159 [Clostridium sporogenes ATCC 15579] Length = 647 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 36/178 (20%), Positives = 65/178 (36%), Gaps = 37/178 (20%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 R+++ V+D S SM + +KI+ + K++ L I D N ++S + K+ Sbjct: 436 RIEISFVIDNSGSM------NASKIEASRKALAVTLLS---IDDFNRYLKSNAEQLNQKV 486 Query: 231 EEFFLLEWG--------------------VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 E W S + R I L +T+ L+ N+I Sbjct: 487 EVLSE-TWFFGSKYYNVKEFNDKNVKEKEKSDIIRSIVKLDATDGATDDASCLREISNRI 545 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +Q + KI+ +TDG + ++++ E + VYA I Sbjct: 546 TSIQESELKKGKQ----IKIVFEITDGASSFPGSAKEAVQ---ELLSKNVEVYAFQIG 596 >gi|115976464|ref|XP_001187868.1| PREDICTED: similar to Synaptotagmin IX [Strongylocentrotus purpuratus] Length = 1426 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 32/189 (16%), Positives = 66/189 (34%), Gaps = 30/189 (15%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--IEE 232 ++VLD+S SME ++ + ++ ++E V + + G++ FS + Sbjct: 254 VLVLDISGSMEG------ERLVLLQQASANIIENVLPMGS-----RLGIIAFSEYATVRH 302 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + + + L K T+ G++ A ++ + + + Sbjct: 303 NIVEIDSQDTRRSLLNSLPKKADGRTSIGRGVELAVQKLREYESDPAGS---------TL 353 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--VIRSHEFLRACASPNSFYLVE 349 + +TDGE S Q L K V + + E L +SF + Sbjct: 354 IVITDGEQNSHPYIQDVLQ-----KTDNLTVNTVAVGGEADGELELLAVHTGGDSFSHTD 408 Query: 350 NPHSMYDAF 358 +Y +F Sbjct: 409 KSSEIYLSF 417 >gi|157412050|ref|YP_001481390.1| hypothetical protein APECO1_O1R37 [Escherichia coli APEC O1] gi|99867075|gb|ABF67720.1| conserved hypothetical protein [Escherichia coli APEC O1] Length = 670 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 69/203 (33%), Gaps = 24/203 (11%) Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 + R K + TF + + + S+ + +++V D+S SM Sbjct: 456 QVKMRGRNKTHERGITFDGNRLILSPMGVRDVFRAQSESKNRGHVGLVLVRDISASM--- 512 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI 247 + I + + + + + + + F K EF +++ ++ + Sbjct: 513 -ANDERYIHAIKSDLA-----LSMAAESLSKMHVSNIVFPFKESEFEIIKTFDQSVEESL 566 Query: 248 KY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 L G T + L A + + D Q R KII +TDG K + Sbjct: 567 SKFTLGCKGYYTPTGSALMAAVDLLLDSQFDR-----------KIIFLITDGYPN--KSE 613 Query: 306 QQSLYYCNEAKKRGAIVYAIGIR 328 +AK G + +GI+ Sbjct: 614 FTIGEVMEKAKCNGIEIVGVGIK 636 >gi|75907531|ref|YP_321827.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75701256|gb|ABA20932.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 427 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 29/230 (12%), Positives = 74/230 (32%), Gaps = 37/230 (16%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 ++ R + + I++ + + + L++ ++LD S SM + M I ++ Sbjct: 16 ATQLSSQRQLAISISAVAE-QFEQNLPLNLCLILDQSGSMHG------KPLKMVIAAVER 68 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSN--KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 +L+ ++ + +V FS + + ++ +I+ + T G Sbjct: 69 LLDRLQPGD------RISVVAFSGSATVIIPNQIVEDPESIKTQIRKKLQASGGTVIAEG 122 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL----------STKEDQQSLYYC 312 L+ ++ +TDG + ++ + Sbjct: 123 LQQGITELMKGTRGAVSQA----------FLLTDGHGEDSLKIWKWEIGPDDSRRCQEFA 172 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSH 360 +A K + +G + + L A + +E P F+ Sbjct: 173 KKAAKINLTINTLGFGNNWNQDLLETIADAGGGTLAHIERPEQAVHHFNR 222 >gi|318059857|ref|ZP_07978580.1| hypothetical protein SSA3_18051 [Streptomyces sp. SA3_actG] gi|318076736|ref|ZP_07984068.1| hypothetical protein SSA3_08427 [Streptomyces sp. SA3_actF] Length = 588 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 36/220 (16%), Positives = 72/220 (32%), Gaps = 46/220 (20%) Query: 174 MMIVLDVSRSMESFFDSS-ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 +++V+D S SM +++D+ S+ L + D GL FS +++ Sbjct: 382 LLVVVDSSPSMAELVPGRGQSRMDVTKASLLQALAQFTSADD------IGLWEFSTRLDG 435 Query: 233 F-------------FLLEWGVSHLQRKIKYL----SKFGVSTNSTPGLKYAYNQIFDMQG 275 GV+ + + G +T AY Q Sbjct: 436 DRDYRELVPTDRLGARKGEGVTQRDKLTAAFGALQPQTGGATGLYDTTLAAYQQASKGYA 495 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS------LYYCNEAKKRGAIVYAIGIRV 329 + +V +TDG N + + A+ + AI + Sbjct: 496 ADKFNA---------VVLLTDGTNEDPGSLTRGALLTKLRDLADPARP--LPLVAIAVGP 544 Query: 330 IRSHEFLRA--CASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + + ++A A+ S + V++P +++ I IV Sbjct: 545 EAAGDDVKAIGSATGGSGFKVDDPAQIHEV---INNAIVE 581 >gi|260810653|ref|XP_002600071.1| hypothetical protein BRAFLDRAFT_79673 [Branchiostoma floridae] gi|229285356|gb|EEN56083.1| hypothetical protein BRAFLDRAFT_79673 [Branchiostoma floridae] Length = 1096 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 71/197 (36%), Gaps = 26/197 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++VLDVS SM + +++ +S++ + + I D G+VTF + Sbjct: 167 VVLVLDVSGSM-----TGQGRMERLRRSVSTYI--LSTIEDGA---WLGIVTFRGTSHKI 216 Query: 234 FLLEW-GVSHLQRKIKYLSKFG-------VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 LE ++ +I ++ G V TN T + A + Q D+ Sbjct: 217 CDLEQLNGDSVREEILNMTLDGLTNRTGKVGTNITRAVTLAVQIL---GPAVQDRKLGDS 273 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--RVIRSHEFLRACASPN 343 + ++ +TDG + + + AK G ++ I + L + Sbjct: 274 TGPRQMILITDGRDRRLNNSVIFMLQNDTAK--GVVIDTIALGDGAEEGLPLLSE-VTGG 330 Query: 344 SFYLVENPHSMYDAFSH 360 F+ + + A Sbjct: 331 QFFFSPDSDAGGSALDD 347 >gi|189347154|ref|YP_001943683.1| von Willebrand factor type A [Chlorobium limicola DSM 245] gi|189341301|gb|ACD90704.1| von Willebrand factor type A [Chlorobium limicola DSM 245] Length = 6006 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 40/183 (21%), Positives = 64/183 (34%), Gaps = 28/183 (15%) Query: 174 MMIVLDVSRSMESFFD-------SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 ++I LDVS SM + + T++D+A++SI ML + D V LV F Sbjct: 5453 LLITLDVSGSMSRNLNNDSHPTGNDPTRMDIAVESIAEMLSQ----YDYRGDVSVKLVIF 5508 Query: 227 SNKIEEFFLLEWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 S + EW + + L G TN ++ A + + GM Sbjct: 5509 STNGQSLTTAEWVTVEEAKIMLNSLVANG-GTNYDGAIEAADDAFVNTNGMIA------- 5560 Query: 286 NYKKIIVFMTDGENLST--KEDQQSLYYCNE------AKKRGAIVYAIGIRVIRSHEFLR 337 N I F++DG + + + YA+GI + Sbjct: 5561 NADNIAYFISDGLPSLPTITAGDIGIQPAEQTIWETFLENNDVTSYAVGIGRMDVAAATS 5620 Query: 338 ACA 340 A A Sbjct: 5621 ALA 5623 >gi|156404063|ref|XP_001640227.1| predicted protein [Nematostella vectensis] gi|156227360|gb|EDO48164.1| predicted protein [Nematostella vectensis] Length = 245 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 64/189 (33%), Gaps = 32/189 (16%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 ++S++ + + +D+ IVLD S S+ + + N ++ L P Sbjct: 19 STSIQRETCSKK-IDLAIVLDASASVGE------RNYKLYKQFANKLISRFTLAP---GK 68 Query: 219 VQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKY-AYNQIFDMQG 275 + + FS + L + L++K+ L T +K A G Sbjct: 69 TRIAVSYFSAYYHQVTALNDVIPKAELRQKVLELPYEASFTKINLAIKTLAAEVFTTRNG 128 Query: 276 MR-----------------QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 R K + VF+TD + S+ + Y +++ Sbjct: 129 ARLSEPAPPKGTLEDDDDDDDILKRKPGAKLVTVFLTD--SFSSSGLEAMEYRAGILREK 186 Query: 319 GAIVYAIGI 327 G ++ +G+ Sbjct: 187 GIEIFVVGV 195 >gi|148692824|gb|EDL24771.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_d [Mus musculus] Length = 943 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 24/131 (18%), Positives = 51/131 (38%), Gaps = 20/131 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D S SM S KI +++ +L+++ N L+ FS + + Sbjct: 277 VIFVIDKSGSM------SGKKIQQTREALVKILKDLSPQDQFN------LIEFSGEANQW 324 Query: 233 -FFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L++ +L + + Y S+ TN + A + + + Sbjct: 325 KQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVS---- 380 Query: 290 IIVFMTDGENL 300 +I+ +TDG+ Sbjct: 381 LIILLTDGDPT 391 >gi|256420216|ref|YP_003120869.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256035124|gb|ACU58668.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 211 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 28/196 (14%), Positives = 64/196 (32%), Gaps = 14/196 (7%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + ++LD S SM +I+ + ++ +++ P V ++TF ++ Sbjct: 3 RLPVYLLLDTSGSMRG------ERIEAVKNGLQVLVSKLRQDPFALESVWISIITFDREV 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 ++ L S +I + TN L+ +++ D Sbjct: 57 KQLLPLTALESLQLPEIT--TPESGPTNMGAALEMLCSKLDAEVAKGSDTQKGDWRPLLF 114 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE-FLRACASPNSFYLVE 349 MTDG+ ++ + + K + + + FL+ Sbjct: 115 --LMTDGKPSDLAAFREVVP---KVKSKNLAALVACAAGAEAQDSFLKELTDNVVHLDTA 169 Query: 350 NPHSMYDAFSHIGKDI 365 + ++ F + I Sbjct: 170 DSSTLMSFFKWVSASI 185 >gi|16741341|gb|AAH16500.1| Inter alpha-trypsin inhibitor, heavy chain 4 [Mus musculus] gi|62204734|gb|AAH92258.1| Inter alpha-trypsin inhibitor, heavy chain 4 [Mus musculus] Length = 941 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 24/131 (18%), Positives = 51/131 (38%), Gaps = 20/131 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D S SM S KI +++ +L+++ N L+ FS + + Sbjct: 275 VIFVIDKSGSM------SGKKIQQTREALVKILKDLSPQDQFN------LIEFSGEANQW 322 Query: 233 -FFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L++ +L + + Y S+ TN + A + + + Sbjct: 323 KQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVS---- 378 Query: 290 IIVFMTDGENL 300 +I+ +TDG+ Sbjct: 379 LIILLTDGDPT 389 >gi|116623920|ref|YP_826076.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116227082|gb|ABJ85791.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 300 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 36/230 (15%), Positives = 83/230 (36%), Gaps = 42/230 (18%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + S D + + +VLDVS SM K+ A ++A+L + Sbjct: 58 LEEGVPQDIASFSTEDLPVSIGLVLDVSGSMRQ-------KLATARAFLSALLGGAEDRD 110 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + L+T +++ + L + L ++ G ST + Y I M Sbjct: 111 ESL------LLTCADRPDLQTGLTPDLERLSSLVRATRSGG----STALIDTIYLSIERM 160 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG------- 326 + R N +K+++ ++DG++ ++ + + A + A +Y+I Sbjct: 161 RSAR--------NSRKVLIVVSDGQDNFSRHTRS--ELISRAIESEAQIYSIATPEPPHF 210 Query: 327 ------IRVIRSHEFLRAC--ASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 + R L+ A+ + ++ S + + + + Sbjct: 211 QKAIELVDANRGLILLQDLAHATGGIYLPLDAASSTTAVAERLARTLHEQ 260 >gi|219848104|ref|YP_002462537.1| magnesium chelatase [Chloroflexus aggregans DSM 9485] gi|219542363|gb|ACL24101.1| Magnesium chelatase [Chloroflexus aggregans DSM 9485] Length = 696 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 29/157 (18%), Positives = 53/157 (33%), Gaps = 23/157 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + V+D S SM + ++ ++ ++L Q GLV+F Sbjct: 514 VCFVVDASWSMAAE-----ERMQATKAAVLSLLR-----DAYQRRDQVGLVSFQRDYARV 563 Query: 234 -FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L V QR+++ + G T + GL A+ + + DA ++V Sbjct: 564 LLPLTNSVELAQRRLQSMPT-GGKTPLSRGLLTAF-------ELLERARRRDAEVVPLMV 615 Query: 293 FMTDGENL----STKEDQQSLYYCNEAKKRGAIVYAI 325 +TDG+ Q++ R I Sbjct: 616 LLTDGQANVSISDLPPQQEAYRIAEMIADRQIQAIVI 652 >gi|152991131|ref|YP_001356853.1| hypothetical protein NIS_1388 [Nitratiruptor sp. SB155-2] gi|151422992|dbj|BAF70496.1| hypothetical protein [Nitratiruptor sp. SB155-2] Length = 928 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 38/166 (22%), Positives = 67/166 (40%), Gaps = 19/166 (11%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + ++M++LDVS SM T++ +++ +L+E + D V LV FS+ Sbjct: 560 PQFNIMLILDVSGSMGWDSGDGTTRLSKEVEAAQKLLQEYSKLGD----VAVKLVLFSSD 615 Query: 230 IE---EFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + ++ W + + I L +T+ + A Q+FD + N + Sbjct: 616 VSNQAQYIPQNW--MSVDKAIGMLDNLYADGTTDYVNAIDGAM-QLFDQKDGTFFDNGAN 672 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSL--YYCNEAKKRGAIVYAIGIR 328 Y FM+DGE E +L + N + I AIG Sbjct: 673 RVY-----FMSDGEPSYGGEIDGTLQHQWENFLIQHDIIANAIGFG 713 >gi|254517645|ref|ZP_05129701.1| von Willebrand factor [Clostridium sp. 7_2_43FAA] gi|226911394|gb|EEH96595.1| von Willebrand factor [Clostridium sp. 7_2_43FAA] Length = 979 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 37/192 (19%), Positives = 82/192 (42%), Gaps = 31/192 (16%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S D++IVLDVS+ M+ + + S+ L + + Q GL+ Sbjct: 512 DSTNAVPNDIVIVLDVSQEMKD-------SLTIVKNSLFNKLLNKDALKISKS--QYGLI 562 Query: 225 TFSNKIEEFFLLEWGVSHLQ-RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 TFSN++++ L +++L IK L + + TP + ++ I + N+ Sbjct: 563 TFSNQVKQEIPLTDNITNLNDNYIKSL----ATDSLTPNISKTFDSITKV------LNSG 612 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA----- 338 A+ KK ++F++ G+ T + + + +G + ++ + + + L Sbjct: 613 RADAKKNVIFISTGQASYTDSELS------KLRDKGYNIVSLSMNNKTTEDSLYKFHNLL 666 Query: 339 CASPNSFYLVEN 350 + + ++ V N Sbjct: 667 AGNADGYFNVVN 678 Score = 37.8 bits (86), Expect = 2.5, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 48/134 (35%), Gaps = 22/134 (16%) Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS 241 S E ++ KID ++ N +E +K +P++ G+V +S+ + G Sbjct: 135 TSGEHNITANNKKIDELKRAANGFIERMKDVPNLK----IGIVAYSSIATINPNSKSGTK 190 Query: 242 HLQRKIKYLSKFGVSTNSTPG---------LKYAYNQIFDMQGMRQH---------CNTE 283 ++ S + NS L N + + G ++ Sbjct: 191 KVKSLDSNSSHDVTNYNSLGANFLQSNDSRLTSVINNLEALGGTNIGEGMRKAVYMLDSG 250 Query: 284 DANYKKIIVFMTDG 297 D + K IV MTDG Sbjct: 251 DKSASKTIVLMTDG 264 >gi|226531069|ref|NP_001152771.1| inter alpha-trypsin inhibitor, heavy chain 4 isoform 2 [Mus musculus] Length = 941 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 24/131 (18%), Positives = 51/131 (38%), Gaps = 20/131 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D S SM S KI +++ +L+++ N L+ FS + + Sbjct: 275 VIFVIDKSGSM------SGKKIQQTREALVKILKDLSPQDQFN------LIEFSGEANQW 322 Query: 233 -FFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L++ +L + + Y S+ TN + A + + + Sbjct: 323 KQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVS---- 378 Query: 290 IIVFMTDGENL 300 +I+ +TDG+ Sbjct: 379 LIILLTDGDPT 389 >gi|148692823|gb|EDL24770.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_c [Mus musculus] Length = 927 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 24/131 (18%), Positives = 51/131 (38%), Gaps = 20/131 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D S SM S KI +++ +L+++ N L+ FS + + Sbjct: 277 VIFVIDKSGSM------SGKKIQQTREALVKILKDLSPQDQFN------LIEFSGEANQW 324 Query: 233 -FFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L++ +L + + Y S+ TN + A + + + Sbjct: 325 KQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVS---- 380 Query: 290 IIVFMTDGENL 300 +I+ +TDG+ Sbjct: 381 LIILLTDGDPT 391 >gi|2739028|gb|AAC25786.1| PK-120 precursor [Mus musculus] Length = 942 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 24/131 (18%), Positives = 51/131 (38%), Gaps = 20/131 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D S SM S KI +++ +L+++ N L+ FS + + Sbjct: 275 VIFVIDKSGSM------SGKKIQQTREALVKILKDLSPQDQFN------LIEFSGEANQW 322 Query: 233 -FFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L++ +L + + Y S+ TN + A + + + Sbjct: 323 KQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVS---- 378 Query: 290 IIVFMTDGENL 300 +I+ +TDG+ Sbjct: 379 LIILLTDGDPT 389 >gi|300933821|ref|ZP_07149077.1| hypothetical protein CresD4_07088 [Corynebacterium resistens DSM 45100] Length = 676 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 69/198 (34%), Gaps = 27/198 (13%) Query: 175 MIVLDVSRS-MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++V+D+S S M T++D A K+ +++ + PD N G+V + + E Sbjct: 53 VLVMDLSDSMMTKDAGGGGTRLDAAKKAATGLIDAL---PDSAN---MGMVVYG-QQESN 105 Query: 234 FLL--EWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 G ++ + ++K + + Y I + + ++ Sbjct: 106 APNNRAAGCKDVETISPVGPINKGELKDRISNFKAKGYTPI--GNSLLKAAEELGKEGER 163 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKK-RG----AIVYAIGIRVIRSHEFLRAC---AS 341 IV ++DG + C AKK G ++ +G R C S Sbjct: 164 SIVLVSDGHDTCAPP-----PVCEVAKKLAGEGYNLTIHTVGFHADRKARKELECIAKTS 218 Query: 342 PNSFYLVENPHSMYDAFS 359 + EN + ++ Sbjct: 219 GGQYLSAENASELSNSMK 236 >gi|226531047|ref|NP_061216.2| inter alpha-trypsin inhibitor, heavy chain 4 isoform 1 [Mus musculus] gi|26340986|dbj|BAC34155.1| unnamed protein product [Mus musculus] gi|148692825|gb|EDL24772.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_e [Mus musculus] gi|151357764|emb|CAO78004.1| inter alpha-trypsin inhibitor, heavy chain 4 [Mus musculus] Length = 942 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 24/131 (18%), Positives = 51/131 (38%), Gaps = 20/131 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D S SM S KI +++ +L+++ N L+ FS + + Sbjct: 275 VIFVIDKSGSM------SGKKIQQTREALVKILKDLSPQDQFN------LIEFSGEANQW 322 Query: 233 -FFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L++ +L + + Y S+ TN + A + + + Sbjct: 323 KQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVS---- 378 Query: 290 IIVFMTDGENL 300 +I+ +TDG+ Sbjct: 379 LIILLTDGDPT 389 >gi|63100270|gb|AAH94457.1| Inter alpha-trypsin inhibitor, heavy chain 4 [Mus musculus] Length = 942 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 24/131 (18%), Positives = 51/131 (38%), Gaps = 20/131 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D S SM S KI +++ +L+++ N L+ FS + + Sbjct: 275 VIFVIDKSGSM------SGKKIQQTREALVKILKDLSPQDQFN------LIEFSGEANQW 322 Query: 233 -FFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L++ +L + + Y S+ TN + A + + + Sbjct: 323 KQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVS---- 378 Query: 290 IIVFMTDGENL 300 +I+ +TDG+ Sbjct: 379 LIILLTDGDPT 389 >gi|47189394|emb|CAF87133.1| unnamed protein product [Tetraodon nigroviridis] Length = 123 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 19/107 (17%), Positives = 35/107 (32%), Gaps = 13/107 (12%) Query: 222 GLVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 GL +S + L S Q ++ G +T + L + ++ + Sbjct: 7 GLSQYSGDPRTEWQLNSFPSKQQLLEAVRNFKYKGGNTFTGQAL------LHVLEENLRP 60 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 A +V +TDG K ++ K G + A+G Sbjct: 61 EAGARAAAPTFLVLLTDG-----KSQDDAVAAAGRLKAAGVEILAVG 102 >gi|297480083|ref|XP_002691213.1| PREDICTED: von Willebrand factor A domain containing 3B [Bos taurus] gi|296482857|gb|DAA24972.1| von Willebrand factor A domain containing 3B [Bos taurus] Length = 1347 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 33/171 (19%), Positives = 62/171 (36%), Gaps = 32/171 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + I++D S SM K+D+ + +++E N VQ + + + Sbjct: 508 VYILIDTSHSM-------TGKLDLVKNKVIQLIQEQLKHKKKFNFVQFDAQASAWQEKLV 560 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + E + Q I+ + G STN+ L+ A+ ++I Sbjct: 561 EINEDNLKGAQAWIRDIK-IGSSTNTLSALQIAFAD----------------EETQVIYL 603 Query: 294 MTDGENLSTKEDQQSLYYCNEAKK-RGAIVYAIGIRVIR--SHEFLRACAS 341 +TDG DQ ++ K + +Y I ++ FL+ AS Sbjct: 604 LTDGRP-----DQSPEMVMDQVKVFQKIPIYTISFCYNDEIANGFLKDLAS 649 >gi|221126641|ref|XP_002157470.1| PREDICTED: similar to microneme 1 [Hydra magnipapillata] Length = 403 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 27/150 (18%), Positives = 59/150 (39%), Gaps = 13/150 (8%) Query: 219 VQSGLVTFSN----KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 V+ ++ +S+ +I W K+ + T + LK A +F + Sbjct: 6 VRFAVIDYSDDAILQISVSDPRFWDHETFGEKVSSIEYSHGKTRTDLALKVARKHVFCNE 65 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 QH N K+++ +TDG++ K Q + + + ++G+ E Sbjct: 66 CSLQH------NIPKLLIVLTDGQSTFPKLTQFEAHLIKV--ENNLTIISVGVSDQVDIE 117 Query: 335 FLRACASP-NSFYLVENPHSMYDAFSHIGK 363 L++ A+ + +L+ + + D + I K Sbjct: 118 ELKSLATDRDHVFLLNSYSYLNDKINKILK 147 >gi|148380001|ref|YP_001254542.1| hypothetical protein CBO2040 [Clostridium botulinum A str. ATCC 3502] gi|153932216|ref|YP_001384299.1| hypothetical protein CLB_1981 [Clostridium botulinum A str. ATCC 19397] gi|153936064|ref|YP_001387836.1| hypothetical protein CLC_1986 [Clostridium botulinum A str. Hall] gi|148289485|emb|CAL83582.1| conserved hypothetical protein [Clostridium botulinum A str. ATCC 3502] gi|152928260|gb|ABS33760.1| hypothetical protein CLB_1981 [Clostridium botulinum A str. ATCC 19397] gi|152931978|gb|ABS37477.1| hypothetical protein CLC_1986 [Clostridium botulinum A str. Hall] Length = 647 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 45/303 (14%), Positives = 92/303 (30%), Gaps = 56/303 (18%) Query: 46 MKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN 105 ++ + +D + A ++ K +++ W + + V Sbjct: 330 IQNAMQGKVD---LEAYGISQSDQQLFQFYSNKMKLEREQMRQFWKK-LIGDAKKEVSVK 385 Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVN 165 + +D + + YK L T + + Sbjct: 386 KDGQVKGKLDVDSFINFYPDFVEAEKKGNYK-NLPIFNRYLLETQADILP---------- 434 Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 R+++ V+D S SM +KI+ + K++ L I D N ++S Sbjct: 435 ----ERIEISFVIDNSGSMNE------SKIEASRKALAVTLLS---IDDFNKYLKSNAEQ 481 Query: 226 FSNKIEEFFLLEWG--------------------VSHLQRKIKYLSKFGVSTNSTPGLKY 265 + K+E W S + R I L +T+ L+ Sbjct: 482 LNQKVEVLSE-TWFFGSKYYNVKEFNDKNVKEKEKSDIIRSIIKLDATDGATDDASCLRE 540 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 N++ +Q + KI+ +TDG + ++++ E + VYA Sbjct: 541 ISNRMTSIQESELKKGKQ----IKIVFEITDGASSFPGSAKEAVQ---ELLSKNVEVYAF 593 Query: 326 GIR 328 I Sbjct: 594 QIG 596 >gi|126336622|ref|XP_001380249.1| PREDICTED: similar to Inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Monodelphis domestica] Length = 923 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 66/170 (38%), Gaps = 17/170 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D S SM K+ +++ +L +++ N V+ G V Sbjct: 274 VVFVIDKSGSMAG------RKMRQTREAMVQILGDLRPEDQFNLVIFDGHVF----QWMP 323 Query: 234 FLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 LL+ ++++ K L +TN + A ++ D ++ + + Sbjct: 324 ALLQASSQNVEQAKKFTSLISAMGATNINDAVLLAV-KMLDDSNRKEKLPPGSVSM---V 379 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRG-AIVYAIGIRVIRSHEFLRACA 340 + +TDG+ + + + + +A G +Y +G ++ FL A Sbjct: 380 ILLTDGDATDGETNPKKIQENVKAAIGGSYHLYCLGFGFDVNYAFLEKLA 429 >gi|67473840|ref|XP_652669.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS] gi|56469543|gb|EAL47283.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS] Length = 390 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 54/165 (32%), Gaps = 20/165 (12%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ + D + SM S+ +++ I+ I +I I V+ LV + + Sbjct: 4 LDLVFLCDTTGSMGSYLNAAQQSIEKIINTI---------IQSEKCNVRFALVEYKDHPP 54 Query: 232 EFFLLEWGVSH-------LQRKIKYLSKFGVSTNSTP---GLKYAYNQIFDMQGMRQHCN 281 + + ++ ++ + + G + A N + + + Sbjct: 55 QDSSFAFRLTDFTDSMKCIKEAVNKMDANGGGDTPESVCCAMHCASNLKYREKAAKVIVW 114 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 DA I DG L E + ++Y +G Sbjct: 115 IGDAPPHGFI-REGDGFPKGCPCGCDLLKEVRECMNKDIVIYCVG 158 >gi|67482407|ref|XP_656553.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS] gi|56473758|gb|EAL51167.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS] Length = 382 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 54/165 (32%), Gaps = 20/165 (12%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ + D + SM S+ +++ I+ I +I I V+ LV + + Sbjct: 4 LDLVFLCDTTGSMGSYLNAAQQSIEKIINTI---------IQSEKCNVRFALVEYKDHPP 54 Query: 232 EFFLLEWGVSH-------LQRKIKYLSKFGVSTNSTP---GLKYAYNQIFDMQGMRQHCN 281 + + ++ ++ + + G + A N + + + Sbjct: 55 QDSSFAFRLTDFTDSMKCIKEAVNKMDANGGGDTPESVCCAMHCASNLKYREKAAKVIVW 114 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 DA I DG L E + ++Y +G Sbjct: 115 IGDAPPHGFI-REGDGFPKGCPCGCDLLKEVRECMNKDIVIYCVG 158 >gi|309356625|emb|CAP36866.2| hypothetical protein CBG_19659 [Caenorhabditis briggsae AF16] Length = 346 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 32/182 (17%), Positives = 65/182 (35%), Gaps = 22/182 (12%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D S M T + I S+ +++ + P+ + G+VT++ Sbjct: 39 LDIVCVVDNSAGM--------TNVAALISSLFVDGQQLGIQPNNPRTTRVGIVTYNQGAH 90 Query: 232 EFFLLEWGVS---HLQRKIKYLSKFGVSTNSTP-GLKYAYNQIFDMQGMRQHCNTEDANY 287 L S + + + L++ S++S A N + + Q NT +Y Sbjct: 91 VIADLNNFTSVDELIDKTFRVLNQVSTSSDSYLHAGLEAANDLLEQ----QSFNTARGHY 146 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR---ACASPNS 344 +K+++ T Q L K+ + + L A+P Sbjct: 147 QKLVIVYT--SEYKGTGSQDPLPLATRMKQT-VSIATVAYNREDDLGVLSELTKIATPGY 203 Query: 345 FY 346 + Sbjct: 204 NF 205 >gi|282896561|ref|ZP_06304580.1| hypothetical protein CRD_00534 [Raphidiopsis brookii D9] gi|281198552|gb|EFA73434.1| hypothetical protein CRD_00534 [Raphidiopsis brookii D9] Length = 587 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 51/177 (28%), Gaps = 27/177 (15%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K + +M V+D S SM ++++ L + N V G Sbjct: 403 KTQKDAGKTVYLMTVIDTSGSMS----------GGPLEAVKNGLRIASQQINPGNYV--G 450 Query: 223 LVTFSNKIEEFFLLEWGV----SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 LV++ ++ L I L G + Y+ + Sbjct: 451 LVSYGDQPINLVKLAPFDDLQHKRFLAGIDGLEADGATAM--------YDGVMVGLSELL 502 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + N K ++ +TDG+ +Q G VY I + E Sbjct: 503 QQRKTNPNGKFYLLLLTDGQTNQGFNFEQVKEIIEY---SGVRVYPIAYGEVNEAEL 556 >gi|332664650|ref|YP_004447438.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] gi|332333464|gb|AEE50565.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] Length = 345 Score = 45.9 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 26/176 (14%), Positives = 62/176 (35%), Gaps = 27/176 (15%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K + +D++ LD+S+SM D + +++ + ++E++ + Sbjct: 79 GTKTQAVRRQSIDIIFALDISQSMLCQ-DIAPSRLIQGQRLCQQLIEKLSGN-------R 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMR 277 G++ F+ + L + ++ + T+ L A G R Sbjct: 131 LGVILFAGEAYMQVPLTTDYEAVSLLLQSANPDMISSQGTSIGEALAIAQTNTSKSNGNR 190 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 +++ +TDGE+ + + Q+ + G ++ IGI Sbjct: 191 ------------VVLVITDGEDHEARAEAQARQA----ARAGMKIFTIGIGSEEGG 230 >gi|257887439|ref|ZP_05667092.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,141,733] gi|257823493|gb|EEV50425.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,141,733] Length = 1107 Score = 45.9 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 26/149 (17%), Positives = 62/149 (41%), Gaps = 22/149 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + LD+++V+D S SM + +I ++ ++ + + + + G V + Sbjct: 264 KDITPLDLVLVVDWSGSMNNN-----DRIGEVKIGVDRFVDTL-ADSGITDKINMGYVGY 317 Query: 227 SNK--IEEFFLLEWGV-SHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 S++ ++ G ++ ++K ++ T + GL+ A + + G Sbjct: 318 SSEGHNYSNGTVQMGSFDSVKNQVKSITPSWTNGGTFTQKGLRDAGDMLSVPNG------ 371 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLY 310 +KK+IV +TDG + + Q+ Sbjct: 372 -----HKKVIVLLTDGVPTFSYKVQRVRA 395 >gi|313151177|ref|NP_001100761.2| inter-alpha-trypsin inhibitor heavy chain H1 [Rattus norvegicus] gi|149034204|gb|EDL88974.1| inter-alpha trypsin inhibitor, heavy chain 1 (predicted), isoform CRA_a [Rattus norvegicus] gi|187469475|gb|AAI66831.1| Itih1 protein [Rattus norvegicus] Length = 904 Score = 45.9 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 35/196 (17%), Positives = 71/196 (36%), Gaps = 28/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D+S SME K+ +++ +L ++K + + + LV F ++++ Sbjct: 287 LVFVIDISGSMEG------QKVKQTKEALLKILGDMKPVDNFD------LVLFGSQVQSW 334 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + Q ++ S +TN GL + QG ++ + Sbjct: 335 KGSLVPASHANLQAAQDFVRRFS-LAGATNLNGGLLRGIEILNRAQGSHPELSSPAS--- 390 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDGE + Q L A + +Y +G FL + N+ Sbjct: 391 -ILIMLTDGEPTEGETDRSQILKNVRNAIRGRFPLYNLGFGHDLDFSFLEVMSIENNGWA 449 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 450 QRIYEDHDATQQLQGF 465 >gi|77413359|ref|ZP_00789553.1| cell wall surface anchor family protein [Streptococcus agalactiae 515] gi|77160601|gb|EAO71718.1| cell wall surface anchor family protein [Streptococcus agalactiae 515] gi|221193497|gb|ACM07781.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193501|gb|ACM07783.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193513|gb|ACM07789.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193519|gb|ACM07792.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193529|gb|ACM07797.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193531|gb|ACM07798.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193561|gb|ACM07813.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193581|gb|ACM07823.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193589|gb|ACM07827.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193597|gb|ACM07831.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193603|gb|ACM07834.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193607|gb|ACM07836.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193609|gb|ACM07837.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193611|gb|ACM07838.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193613|gb|ACM07839.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193617|gb|ACM07841.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193621|gb|ACM07843.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193639|gb|ACM07852.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193643|gb|ACM07854.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193663|gb|ACM07864.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193665|gb|ACM07865.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193667|gb|ACM07866.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193680|gb|ACM07872.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193699|gb|ACM07881.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193711|gb|ACM07887.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193719|gb|ACM07891.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193725|gb|ACM07894.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193743|gb|ACM07903.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193745|gb|ACM07904.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193747|gb|ACM07905.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193751|gb|ACM07907.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193753|gb|ACM07908.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 901 Score = 45.9 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 38/185 (20%), Positives = 64/185 (34%), Gaps = 18/185 (9%) Query: 47 KTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVND 106 KT + + D+ N G N+++L I S++ E N Sbjct: 119 KTTIQNSGDK----------NSTIGQNQEELDKQYPPTGIYEDTKESYKLEHVKGSVPNG 168 Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 + V +EG I I + + N I + ++ V Sbjct: 169 ------KSEAKAVNPYSSEGEHIREIPEGTLSKRISEVGDLAHNKYKIELTVSGKTIVKP 222 Query: 167 -QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ VLD S SM + + + A K+ A+ VK I N+ + LVT Sbjct: 223 VDKQKPLDVVFVLDNSNSMN-NDGPNFQRHNKAKKAAEALGTAVKDILGANSDNRVALVT 281 Query: 226 FSNKI 230 + + I Sbjct: 282 YGSDI 286 >gi|22537555|ref|NP_688406.1| cell wall surface anchor family protein [Streptococcus agalactiae 2603V/R] gi|76797665|ref|ZP_00779934.1| cell wall surface anchor family protein [Streptococcus agalactiae 18RS21] gi|22534437|gb|AAN00279.1|AE014256_11 cell wall surface anchor family protein [Streptococcus agalactiae 2603V/R] gi|76586959|gb|EAO63448.1| cell wall surface anchor family protein [Streptococcus agalactiae 18RS21] gi|221193505|gb|ACM07785.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193507|gb|ACM07786.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193517|gb|ACM07791.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193523|gb|ACM07794.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193537|gb|ACM07801.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193549|gb|ACM07807.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193553|gb|ACM07809.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193557|gb|ACM07811.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193559|gb|ACM07812.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193563|gb|ACM07814.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193571|gb|ACM07818.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193583|gb|ACM07824.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193587|gb|ACM07826.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193591|gb|ACM07828.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193593|gb|ACM07829.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193599|gb|ACM07832.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193605|gb|ACM07835.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193627|gb|ACM07846.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193651|gb|ACM07858.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193653|gb|ACM07859.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193674|gb|ACM07869.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193691|gb|ACM07877.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193707|gb|ACM07885.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193713|gb|ACM07888.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193715|gb|ACM07889.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193717|gb|ACM07890.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193723|gb|ACM07893.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193729|gb|ACM07896.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193731|gb|ACM07897.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193735|gb|ACM07899.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193739|gb|ACM07901.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 901 Score = 45.9 bits (107), Expect = 0.012, Method: Composition-based stats. Identities = 38/185 (20%), Positives = 64/185 (34%), Gaps = 18/185 (9%) Query: 47 KTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVND 106 KT + + D+ N G N+++L I S++ E N Sbjct: 119 KTTIQNSGDK----------NSTIGQNQEELDKQYPPTGIYEDTKESYKLEHVKGSVPNG 168 Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 + V +EG I I + + N I + ++ V Sbjct: 169 ------KSEAKAVNPYSSEGEHIREIPEGTLSKRISEVGDLAHNKYKIELTVSGKTIVKP 222 Query: 167 -QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ VLD S SM + + + A K+ A+ VK I N+ + LVT Sbjct: 223 VDKQKPLDVVFVLDNSNSMN-NDGPNFQRHNKAKKAAEALGTAVKDILGANSDNRVALVT 281 Query: 226 FSNKI 230 + + I Sbjct: 282 YGSDI 286 >gi|301156267|emb|CBW15738.1| ppka [Haemophilus parainfluenzae T3T1] Length = 650 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 65/181 (35%), Gaps = 27/181 (14%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 ++ V+D + SM+ + I+ +I + E+++ ++ V+ GLV F + Sbjct: 217 GFSAAVVFVIDSTISMDPY-------INRTRDAIKKVYEKIEK-ENLGKQVKFGLVAFRS 268 Query: 229 KIEEFFLLEWGVSHL---------QRKIKYLSKFGVSTNSTPGLKY-AYNQIFDMQGMRQ 278 + LE+ + + ++ +T S+ AY + Sbjct: 269 STKAVPGLEYTSKMFVDPSTVKDGKDFMDKVANLKQATVSSKEFSEDAYAGVSQALNEIN 328 Query: 279 HCNTEDANYKKIIVFMTD-----GENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 N + +V +TD G+N + + EA+ RG +Y + ++ Sbjct: 329 WNNFGG----RYLVLITDAGAIEGDNPISTTGLDAKQLRLEAQHRGVALYTLHLKTPSGK 384 Query: 334 E 334 Sbjct: 385 N 385 >gi|260575021|ref|ZP_05843022.1| von Willebrand factor type A [Rhodobacter sp. SW2] gi|259022643|gb|EEW25938.1| von Willebrand factor type A [Rhodobacter sp. SW2] Length = 558 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 35/201 (17%), Positives = 67/201 (33%), Gaps = 38/201 (18%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK----- 229 +IVLD S SM D K+++A ++++ +L V + GL+ + ++ Sbjct: 32 IIVLDGSGSMWGQIDGRA-KLEIAREALSGVLAGVAPE------TELGLMVYGHRSKGDC 84 Query: 230 --IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 IE G +F T T ++ A ++ + Sbjct: 85 NDIELMVPPAAGTGPAIAAAAAGMQFLGKTPLTEAVRRAAAELRSTEE------------ 132 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA----IVYAIGIRVIRSHEFLRAC---A 340 K ++ +TDG D E + G V G+ + AC A Sbjct: 133 KATVILITDG---IETCDADPCALGAELEASGVDFTAHVVGFGLTAEEGKQV--ACLADA 187 Query: 341 SPNSFYLVENPHSMYDAFSHI 361 + + ++ S+ +A Sbjct: 188 TGGLYIEAKDAGSLTEALKTT 208 >gi|221193678|gb|ACM07871.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193701|gb|ACM07882.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 901 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 38/185 (20%), Positives = 64/185 (34%), Gaps = 18/185 (9%) Query: 47 KTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVND 106 KT + + D+ N G N+++L I S++ E N Sbjct: 119 KTTIQNSGDK----------NSTIGQNQEELDKQYPPTGIYEDTKESYKLEHVKGSVPNG 168 Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 + V +EG I I + + N I + ++ V Sbjct: 169 ------KSEAKAVNPYSSEGEHIREIPEGTLSKRISEVGDLAHNKYKIELTVSGKTIVKP 222 Query: 167 -QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ VLD S SM + + + A K+ A+ VK I N+ + LVT Sbjct: 223 VDKQKPLDVVFVLDNSNSMN-NDGPNFQRHNKAKKAAEALGTAVKDILGANSDNRVALVT 281 Query: 226 FSNKI 230 + + I Sbjct: 282 YGSDI 286 >gi|221193645|gb|ACM07855.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 901 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 38/185 (20%), Positives = 64/185 (34%), Gaps = 18/185 (9%) Query: 47 KTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVND 106 KT + + D+ N G N+++L I S++ E N Sbjct: 119 KTTIQNSGDK----------NSTIGQNQEELDKQYPPTGIYEDTKESYKLEHVKGSVPNG 168 Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 + V +EG I I + + N I + ++ V Sbjct: 169 ------KSEAKAVNPYSSEGEHIREIPEGTLSKRISEVGDLAHNKYKIELTVSGKTIVKP 222 Query: 167 -QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ VLD S SM + + + A K+ A+ VK I N+ + LVT Sbjct: 223 VDKQKPLDVVFVLDNSNSMN-NDGPNFQRHNKAKKAAEALGTAVKDILGANSDNRVALVT 281 Query: 226 FSNKI 230 + + I Sbjct: 282 YGSDI 286 >gi|332290703|ref|YP_004429312.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] gi|332168789|gb|AEE18044.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] Length = 351 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 29/196 (14%), Positives = 61/196 (31%), Gaps = 10/196 (5%) Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--EEF 233 +VLD+S S+ S T + ++ ++ + V + + + F + E Sbjct: 116 LVLDLSNSVLS------TSLQELKQASSSFISNVMPAIP-TDSFKMAIYWFDGEDVLHEL 168 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L + LQ I +++ + ST + ED +V Sbjct: 169 QPLTTSAAMLQEAIDGITEDISNDPSTDLYGAVIKAAEIADNIIDVSENEDLFAAASVVI 228 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHS 353 TDG + + + ++ + IG+ L S + N Sbjct: 229 FTDGTDQAARYTEEEAVDAVSNADDDVSFFTIGLGSEIDEGILSKIGQTESVF-ASNAAE 287 Query: 354 MYDAFSHIGKDIVTKR 369 + F+ I + + Sbjct: 288 LESVFNDISSGVAGQA 303 >gi|296118874|ref|ZP_06837447.1| putative von Willebrand factor type A domain protein [Corynebacterium ammoniagenes DSM 20306] gi|295967972|gb|EFG81224.1| putative von Willebrand factor type A domain protein [Corynebacterium ammoniagenes DSM 20306] Length = 674 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 61/186 (32%), Gaps = 14/186 (7%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEF 233 +++D S SM + T++D A ++ N ++ + G++ + ++ Sbjct: 59 ALIMDASDSMLAEDVDGGTRLDAAKQAANQLVNSLP------ETAVMGMLAYGASGSNAP 112 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 E G + S G A +R + + + I+ Sbjct: 113 DNRERGCQDIDVLAPVERIDNEELKSEIGALEAQGYTPMGNALRAAADELGSEGDRSIIL 172 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRG--AIVYAIGIRVIRSHEFLRAC---ASPNSFYLV 348 ++DG + + E G ++ +G + + + C AS ++ Sbjct: 173 VSDGIDTCAPP--PACEVAEELAGDGFDLAIHTVGFKPDEAAQAELECISEASGGTYVEA 230 Query: 349 ENPHSM 354 EN + Sbjct: 231 ENAEEL 236 >gi|302142112|emb|CBI19315.3| unnamed protein product [Vitis vinifera] Length = 626 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 26/140 (18%), Positives = 51/140 (36%), Gaps = 26/140 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 A +D++ VLDVS SM +K+ M +++ ++ + + +V FS Sbjct: 284 RAPIDLVTVLDVSASMTG------SKLQMLKRAMRLVISSLGPSD------RLAIVAFSA 331 Query: 229 KIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 L G +R I L ++ L+ A + R+ N Sbjct: 332 SPRRLLPLRRMTAHGQRSARRIIDRLVCS-QGSSVGEALRKATKVL----EDRRERNPVA 386 Query: 285 ANYKKIIVFMTDGENLSTKE 304 + I+ ++DG++ Sbjct: 387 S-----IMLLSDGQDDRVHS 401 >gi|225458992|ref|XP_002285586.1| PREDICTED: hypothetical protein [Vitis vinifera] Length = 738 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 26/140 (18%), Positives = 51/140 (36%), Gaps = 26/140 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 A +D++ VLDVS SM +K+ M +++ ++ + + +V FS Sbjct: 334 RAPIDLVTVLDVSASMTG------SKLQMLKRAMRLVISSLGPSD------RLAIVAFSA 381 Query: 229 KIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 L G +R I L ++ L+ A + R+ N Sbjct: 382 SPRRLLPLRRMTAHGQRSARRIIDRLVCS-QGSSVGEALRKATKVL----EDRRERNPVA 436 Query: 285 ANYKKIIVFMTDGENLSTKE 304 + I+ ++DG++ Sbjct: 437 S-----IMLLSDGQDDRVHS 451 >gi|254491098|ref|ZP_05104279.1| von Willebrand factor type A domain protein [Methylophaga thiooxidans DMS010] gi|224463611|gb|EEF79879.1| von Willebrand factor type A domain protein [Methylophaga thiooxydans DMS010] Length = 326 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 28/152 (18%), Positives = 52/152 (34%), Gaps = 26/152 (17%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSI------TKIDMAIKSINAMLEEVKLIPDVNNVV 219 ++T + ++ VLD S SM F +K A + ++ + N Sbjct: 76 TRTGSGAHIVFVLDRSASMNETFGGETPDEDEQSKAKAARRILSNFVT--------NRPH 127 Query: 220 QS-GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMR 277 G+ FS + L + +Q I + G++ TN GL + Sbjct: 128 DLFGVAGFSTQPFYISPLTEHKTAIQAAINSMETPGLAFTNVAKGLGMGLSYFKAQPHTG 187 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSL 309 ++IV ++DG Q++L Sbjct: 188 S----------RVIVLVSDGAATLDHRAQKTL 209 >gi|221193615|gb|ACM07840.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 901 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 38/185 (20%), Positives = 64/185 (34%), Gaps = 18/185 (9%) Query: 47 KTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVND 106 KT + + D+ N G N+++L I S++ E N Sbjct: 119 KTTIQNSGDK----------NSTIGQNQEELDKQYPPTGIYEDTKESYKLEHVKGSVPNG 168 Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 + V +EG I I + + N I + ++ V Sbjct: 169 ------KSEAKAVNPYSSEGEHIREIPEGTLSKRISEVGDLAHNKYKIELTVSGKTIVKP 222 Query: 167 -QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ VLD S SM + + + A K+ A+ VK I N+ + LVT Sbjct: 223 VDKQKPLDVVFVLDNSNSMN-NDGPNFQRHNKAKKAAEALGTAVKDILGANSDNRVALVT 281 Query: 226 FSNKI 230 + + I Sbjct: 282 YGSDI 286 >gi|221193525|gb|ACM07795.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 901 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 38/185 (20%), Positives = 64/185 (34%), Gaps = 18/185 (9%) Query: 47 KTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVND 106 KT + + D+ N G N+++L I S++ E N Sbjct: 119 KTTIQNSGDK----------NSTIGQNQEELDKQYPPTGIYEDTKESYKLEHVKGSVPNG 168 Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 + V +EG I I + + N I + ++ V Sbjct: 169 ------KSEAKAVNPYSSEGEHIREIPEGTLSKRISEVGDLAHNKYKIELTVSGKTIVKP 222 Query: 167 -QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ VLD S SM + + + A K+ A+ VK I N+ + LVT Sbjct: 223 VDKQKPLDVVFVLDNSNSMN-NDGPNFQRHNKAKKAAEALGTAVKDILGANSDNRVALVT 281 Query: 226 FSNKI 230 + + I Sbjct: 282 YGSDI 286 >gi|254452693|ref|ZP_05066130.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198267099|gb|EDY91369.1| conserved hypothetical protein [Octadecabacter antarcticus 238] Length = 173 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 34/186 (18%), Positives = 69/186 (37%), Gaps = 26/186 (13%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 + F + G + I + + + +I GM +++ + L +DR+ + AA Sbjct: 11 FQRFRNDEDGALIIFSLMMMIMILWFGGMAVDLMRYETTRAKLQGSLDRATLAAA----- 65 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 L +I R+ L G ++ + + V + Sbjct: 66 --------------DLDQIMPP-ADVVRDYLDKAGMLHFLQGEPTVSQGINYRVVSAQAS 110 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 + A+ Y +P F + + ++ T+ +V +++ +VLDVS SM S Sbjct: 111 APMALFFYDLPRIFTSPFSPGMTAINVSGASTAEERVT-----DVEVSLVLDVSSSMNS- 164 Query: 188 FDSSIT 193 DS+I Sbjct: 165 LDSNIK 170 >gi|147816411|emb|CAN77438.1| hypothetical protein VITISV_007401 [Vitis vinifera] Length = 757 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 26/140 (18%), Positives = 51/140 (36%), Gaps = 26/140 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 A +D++ VLDVS SM +K+ M +++ ++ + + +V FS Sbjct: 353 RAPIDLVTVLDVSASMTG------SKLQMLKRAMRLVISSLGPSD------RLAIVAFSA 400 Query: 229 KIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 L G +R I L ++ L+ A + R+ N Sbjct: 401 SPRRLLPLRRMTAHGQRSARRIIDRLVCS-QGSSVGEALRKATKVL----EDRRERNPVA 455 Query: 285 ANYKKIIVFMTDGENLSTKE 304 + I+ ++DG++ Sbjct: 456 S-----IMLLSDGQDDRVHS 470 >gi|158321348|ref|YP_001513855.1| magnesium chelatase [Alkaliphilus oremlandii OhILAs] gi|158141547|gb|ABW19859.1| Magnesium chelatase [Alkaliphilus oremlandii OhILAs] Length = 629 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 35/281 (12%), Positives = 83/281 (29%), Gaps = 38/281 (13%) Query: 60 HAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMS---FRNELRDNGFVNDIDDIVRSTSL 116 +A + ++ G+ + + + + F +L + + S Sbjct: 319 SSADKPISNGDYDGDSEHGDSSAEGSVSSHTVEDIEAFGEDLSMDMVWQSRFAVKGSGKR 378 Query: 117 DIVVVPQNEGYSIS--------AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVN--S 166 + V EG I + + N + + I S Sbjct: 379 NKVKTDSKEGRYIRYRIPKGRPKDIAFDATFRIAACSQGGRNREGLSLVIRSGDIREKVR 438 Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + ++ V+D S SM + ++ ++ ++L G++ F Sbjct: 439 EKHTGATILFVVDASGSMGAKR-----RMGAVKGAVLSLLN-----DAYQKRDNVGIIAF 488 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + + L + K L+ G T GL AY + + ++ Sbjct: 489 --RKDGADTLLNITRSVDLAQKCLTNLPTGGKTPLASGLYKAY-------ELLKIDRIKN 539 Query: 285 ANYKKIIVFMTDGENL----STKEDQQSLYYCNEAKKRGAI 321 A+ + IV ++DG+ S + + + + + Sbjct: 540 ADALQYIVLVSDGKGNVPLFSENAIEDAYHVGEKIRNENIK 580 >gi|326430897|gb|EGD76467.1| hypothetical protein PTSG_07584 [Salpingoeca sp. ATCC 50818] Length = 985 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 40/219 (18%), Positives = 74/219 (33%), Gaps = 29/219 (13%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK-LIPDVNNVVQ-----SG 222 + + +V+DVS SM+ + K + K L+ VK + N +Q Sbjct: 218 KTPVHICLVIDVSGSMDRH-ATQRNKYNQLEKFEQTYLDIVKHAARSIANFLQNEDNYLS 276 Query: 223 LVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 +V FS +E GVS + L TN G+ + Sbjct: 277 VVAFSENASVTLPMERMTEAGVSRATDAVGALQPC-SCTNLGDGVLRGMQLLLKG----- 330 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSL----YYCNEAKKRGAIVYAIGIRVIRSHE 334 T+ + +++ +TDGE + + L A++ + G I S Sbjct: 331 ---TDTTKAQPVLMVLTDGEPNEGNDARDVLRSFRNTYKHARRFIVNTFGFGFEQIDS-P 386 Query: 335 FLRACAS--PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 L A+ ++ V + + AF I + + + Sbjct: 387 LLSELAALGGGTYAFVPDSSFVGTAF--INATVAAQLVL 423 >gi|218659662|ref|ZP_03515592.1| hypothetical protein RetlI_08405 [Rhizobium etli IE4771] Length = 81 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 16/75 (21%), Positives = 35/75 (46%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R F+ +++G + LT I +P++ +II+V + T L + +D + A ++ Sbjct: 6 VRRFWNDHRGYVIALTLIAMPMLLGFSLLIIDVGRSSNLHTDLQNAVDAMALAGARELDG 65 Query: 68 EGNGNNRKKLKGGDI 82 + R + I Sbjct: 66 RDDAITRAQTAIEKI 80 >gi|218189604|gb|EEC72031.1| hypothetical protein OsI_04925 [Oryza sativa Indica Group] Length = 1128 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 27/179 (15%), Positives = 62/179 (34%), Gaps = 10/179 (5%) Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 ++ + K+ + +V + S A +D++ VLDVS Sbjct: 9 TVQVTAYAKVDAVHVDAVKQRPGVPVLVRVVAPPPAAASSERAPIDLVAVLDVSC---CG 65 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI 247 + ++D+ K++ +++++ + V + K + + G R + Sbjct: 66 GLGPVNRMDLLKKAMGFVIDKLGEHDRLAVVPVQASAAIAEKHDLVEMNAEGRKEATRMV 125 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + + LK A + +G + H IV ++DG++ S D Sbjct: 126 QSSLTVTGENKLSTALKKAATIL---EGRKDHDKK----RPGFIVLISDGDDASVLNDA 177 >gi|260814261|ref|XP_002601834.1| hypothetical protein BRAFLDRAFT_215239 [Branchiostoma floridae] gi|229287136|gb|EEN57846.1| hypothetical protein BRAFLDRAFT_215239 [Branchiostoma floridae] Length = 863 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 23/172 (13%), Positives = 64/172 (37%), Gaps = 19/172 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM TK+ +++N +L++++ ++ ++S+ + Sbjct: 237 IVFIIDKSGSMGG------TKMRQTKQAMNTILKDLR----DHDRFNVMPFSYSSTMWRP 286 Query: 234 FLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + + + ++ G TN + A + + + + Sbjct: 287 NEMVLATRENIESARTYVRRSINAGGGTNINQAIIDAADLLRRVTDDQ----PNSPRSAS 342 Query: 290 IIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +I+F+TDG ++ + + + A + ++ +G FL A Sbjct: 343 LIIFLTDGLPSVGESKPRNIMVNVKNAIREQVSLFCLGFGKDVDFPFLEKMA 394 >gi|193788347|dbj|BAG53241.1| unnamed protein product [Homo sapiens] Length = 325 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 23/161 (14%), Positives = 57/161 (35%), Gaps = 19/161 (11%) Query: 219 VQSGLVTFSNKIEEFFL-LEWGVSHLQRKIKYLS-------KFGVSTNSTPGLKYAYNQI 270 V ++TF+++ + L + I L + G TN+ L Y + Sbjct: 78 VSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANYKDHENGTGTNTYAALNSVYLMM 137 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENL------STKEDQQSLYYCNEAKKRGAIVYA 324 + + + I+ +TDG++ + + + + N+ + +YA Sbjct: 138 NNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYA 197 Query: 325 IGIRV-----IRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 IG+ +E +++++ +++ F H Sbjct: 198 IGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQVFEH 238 >gi|47207521|emb|CAG14087.1| unnamed protein product [Tetraodon nigroviridis] Length = 1048 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 28/138 (20%), Positives = 52/138 (37%), Gaps = 23/138 (16%) Query: 243 LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ-GMRQHCNTEDANYKKIIVFMTDGENLS 301 ++ + G T + + A + F + G R+ K+++ +TDGE+ Sbjct: 26 VEAAKNISRQEGRETRTAYAIHKACTEAFSPERGAREGA-------TKVMIVVTDGESHD 78 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGI--RVIRSHE-------FLRACAS---PNSFYLVE 349 +E +L C R YAI + IR + ++ AS F+ V Sbjct: 79 GEELPDALQECE---DRNITRYAIAVLGHYIRRQQDPETFISEIKYIASDPDDKYFFNVT 135 Query: 350 NPHSMYDAFSHIGKDIVT 367 + ++ D +G I T Sbjct: 136 DEAALNDIVDALGDRIFT 153 >gi|32474270|ref|NP_867264.1| en/Spm-like transposon protein [Rhodopirellula baltica SH 1] gi|32444808|emb|CAD74810.1| similar to En/Spm-like transposon protein [Rhodopirellula baltica SH 1] Length = 850 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 26/171 (15%), Positives = 62/171 (36%), Gaps = 12/171 (7%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL---IPDVNNVVQSGLV 224 T ++ +D S SM S + D +S+ + + + I + + ++ Sbjct: 106 TGQPARLIFAVDQSLSMGVSDSSDLQSPDRLQRSLERLFGKDEKDGWIESLRSTHLIDVM 165 Query: 225 TFS---NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR---- 277 +F + + + E + + G +TN + L+ + + Sbjct: 166 SFDATAHSRWKSYDTEDAATESPSGTFTIEANGSTTNLSAALRPVLPATATSEPNQSRDI 225 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 N+E +V MTDG + + +D L + + + G ++ +G+ Sbjct: 226 ASNNSETTGASLAVVLMTDGRDSAGVDDASDLAW--QLSQSGWQIHTLGVG 274 >gi|221369934|ref|YP_002521030.1| hypothetical protein RSKD131_4097 [Rhodobacter sphaeroides KD131] gi|221162986|gb|ACM03957.1| Hypothetical Protein RSKD131_4097 [Rhodobacter sphaeroides KD131] Length = 1043 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 32/161 (19%), Positives = 57/161 (35%), Gaps = 22/161 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV--KLIPDVNNVVQSGL--VTFSNK 229 + I LDVS SM S T++ + A++ E+ + PD N ++ L + Sbjct: 260 IYITLDVSGSM------SGTRMAAQKAGVAALIREIGASVDPDRPNDIRIVLWNAGLAGS 313 Query: 230 IEEFFLLEWGVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 IE + + L+ + LS TN A + F G ++ Sbjct: 314 IERRNMEPDDYTALEDWMLALSNSTSGGTNFNAAFAEA-STFFAGGGSKR---------- 362 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 +I++F+TDGE + + I + Sbjct: 363 RIVIFVTDGEPSPVSSVDAAEATIASLPPADIFGFNIALAN 403 >gi|206901991|ref|YP_002251775.1| von Willebrand factor type A domain protein [Dictyoglomus thermophilum H-6-12] gi|206741094|gb|ACI20152.1| von Willebrand factor type A domain protein [Dictyoglomus thermophilum H-6-12] Length = 890 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 47/277 (16%), Positives = 91/277 (32%), Gaps = 28/277 (10%) Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITS 160 N + I D + L I + + + + +F + + Sbjct: 319 NSYQAVIMDNIPQEDLSIDKMEILKNFVTEKGGTLLVLGGDKSFSAGNYQGTPLEEILPV 378 Query: 161 SVKVNSQ-TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ + + ++IVLD S SM S+ + K+++A +S +L+ ++ Sbjct: 379 TLRPEQILKKSNVAIVIVLDASGSMGSYSGGDM-KMELAKESAQLVLDLLEEKD------ 431 Query: 220 QSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 GL+ F + + L+ I +S G T P LK A + Sbjct: 432 YFGLIAFDHSYQWIVPLQPLTNKEETASLISKISP-GGGTALYPPLKSAGEALIKAPIKS 490 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +H I+ +TDG+ Y V IGI + L+ Sbjct: 491 KH-----------IIAITDGQTEGGDFYNLVKYLAKY----KITVSTIGIGEDANIPLLK 535 Query: 338 ACASPNS--FYLVENPHSMYDAFSHIGKDIVTKRIWY 372 A+ + FY N ++ K ++ I Sbjct: 536 DIANWGNGRFYHTWNIRNLPQLLLSETKALLRSNIVE 572 >gi|149034205|gb|EDL88975.1| inter-alpha trypsin inhibitor, heavy chain 1 (predicted), isoform CRA_b [Rattus norvegicus] Length = 899 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 35/196 (17%), Positives = 71/196 (36%), Gaps = 28/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D+S SME K+ +++ +L ++K + + + LV F ++++ Sbjct: 287 LVFVIDISGSMEG------QKVKQTKEALLKILGDMKPVDNFD------LVLFGSQVQSW 334 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + Q ++ S +TN GL + QG ++ + Sbjct: 335 KGSLVPASHANLQAAQDFVRRFS-LAGATNLNGGLLRGIEILNRAQGSHPELSSPAS--- 390 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDGE + Q L A + +Y +G FL + N+ Sbjct: 391 -ILIMLTDGEPTEGETDRSQILKNVRNAIRGRFPLYNLGFGHDLDFSFLEVMSIENNGWA 449 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 450 QRIYEDHDATQQLQGF 465 >gi|146292839|ref|YP_001183263.1| cell wall anchor domain-containing protein [Shewanella putrefaciens CN-32] gi|145564529|gb|ABP75464.1| LPXTG-motif cell wall anchor domain [Shewanella putrefaciens CN-32] Length = 757 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 69/199 (34%), Gaps = 39/199 (19%) Query: 152 RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 +VM + ++ + ++++V+D S SM I A ++ L +K Sbjct: 356 YSLVMVLPPKTDEHALSTLPRELILVIDTSGSMAG------DSIVQAKSALLYALNGLKA 409 Query: 212 IPDVNNVVQSGLVTFSNKIEE--FFLLEWGVSHL---QRKIKYLSKFGVSTNS---TPGL 263 N ++ F++++ + L +HL ++ I L G + S L Sbjct: 410 EDSFN------IIEFNSELTQLSPTSLPANQTHLARARQFIHRLQADGGTEMSLALNAAL 463 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE--NLSTKEDQQSLYYCNEAKKRGAI 321 N++ + + ++FMTDG N D Sbjct: 464 PRGINRLSESSQSLRQ-----------VIFMTDGSVGNEQALFDLIRYQIGES------R 506 Query: 322 VYAIGIRVIRSHEFLRACA 340 ++ +GI + F++ A Sbjct: 507 LFTVGIGSAPNSHFMQRAA 525 >gi|77411517|ref|ZP_00787861.1| cell wall surface anchor family protein [Streptococcus agalactiae CJB111] gi|77162443|gb|EAO73410.1| cell wall surface anchor family protein [Streptococcus agalactiae CJB111] gi|221193483|gb|ACM07774.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193485|gb|ACM07775.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193487|gb|ACM07776.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193491|gb|ACM07778.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193503|gb|ACM07784.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193509|gb|ACM07787.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193511|gb|ACM07788.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193515|gb|ACM07790.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193527|gb|ACM07796.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193535|gb|ACM07800.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193541|gb|ACM07803.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193545|gb|ACM07805.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193551|gb|ACM07808.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193565|gb|ACM07815.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193569|gb|ACM07817.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193573|gb|ACM07819.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193579|gb|ACM07822.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193585|gb|ACM07825.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193601|gb|ACM07833.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193619|gb|ACM07842.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193629|gb|ACM07847.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193631|gb|ACM07848.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193635|gb|ACM07850.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193637|gb|ACM07851.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193655|gb|ACM07860.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193669|gb|ACM07867.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193676|gb|ACM07870.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193683|gb|ACM07873.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193685|gb|ACM07874.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193693|gb|ACM07878.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193695|gb|ACM07879.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193703|gb|ACM07883.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193705|gb|ACM07884.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193709|gb|ACM07886.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193721|gb|ACM07892.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193727|gb|ACM07895.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193749|gb|ACM07906.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 901 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 38/185 (20%), Positives = 64/185 (34%), Gaps = 18/185 (9%) Query: 47 KTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVND 106 KT + + D+ N G N+++L I S++ E N Sbjct: 119 KTTIQNSGDK----------NSTIGQNQEELDKQYPPTGIYEDTKESYKLEHVKGSVPNG 168 Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 + V +EG I I + + N I + ++ V Sbjct: 169 ------KSEAKAVNPYSSEGEHIREIPEGTLSKRISEVGDLAHNKYKIELTVSGKTIVKP 222 Query: 167 -QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ VLD S SM + + + A K+ A+ VK I N+ + LVT Sbjct: 223 VDKQKPLDVVFVLDNSNSMN-NDGPNFQRHNKAKKAAEALGTAVKDILGANSDNRVALVT 281 Query: 226 FSNKI 230 + + I Sbjct: 282 YGSDI 286 >gi|221193567|gb|ACM07816.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 901 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 38/185 (20%), Positives = 64/185 (34%), Gaps = 18/185 (9%) Query: 47 KTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVND 106 KT + + D+ N G N+++L I S++ E N Sbjct: 119 KTTIQNSGDK----------NSTIGQNQEELDKQYPPTGIYEDTKESYKLEHVKGSVPNG 168 Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 + V +EG I I + + N I + ++ V Sbjct: 169 ------KSEAKAVNPYSSEGEHIREIPEGTLSKRISEVGDLAHNKYKIELTVSGKTIVKP 222 Query: 167 -QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ VLD S SM + + + A K+ A+ VK I N+ + LVT Sbjct: 223 VDKQKPLDVVFVLDNSNSMN-NDGPNFQRHNKAKKAAEALGTAVKDILGANSDNRVALVT 281 Query: 226 FSNKI 230 + + I Sbjct: 282 YGSDI 286 >gi|323351676|ref|ZP_08087330.1| peptidoglycan binding domain protein [Streptococcus sanguinis VMC66] gi|322122162|gb|EFX93888.1| peptidoglycan binding domain protein [Streptococcus sanguinis VMC66] Length = 450 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 60/200 (30%), Gaps = 36/200 (18%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ V+D S SM ID+ +IN + + + GL TFS Sbjct: 173 KAGSADIVFVVDRSGSMGGT-------IDIVRANINEFVRNITK---EGITARFGLATFS 222 Query: 228 NKIE----------------EFFLLEWGVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQI 270 +++ + L++ + + G T TP A NQI Sbjct: 223 DEVYGRNSGSKDEDTVLTRFGSSYFTTDPTELEKALAAIRIASGGDTPETPT--PALNQI 280 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + KK +V +TD E K G ++ I Sbjct: 281 ISTYDWSKSSKN-----KKFVVLLTDAEMKEDPSIPTVADTLAALKAAGIERTVATVKAI 335 Query: 331 RSHEFLRACASPNSFYLVEN 350 + A+ +EN Sbjct: 336 EG--IYKNFATEGRVLDIEN 353 >gi|221193661|gb|ACM07863.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 901 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 38/185 (20%), Positives = 64/185 (34%), Gaps = 18/185 (9%) Query: 47 KTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVND 106 KT + + D+ N G N+++L I S++ E N Sbjct: 119 KTTIQNSGDK----------NSTIGQNQEELDKQYPPTGIYEDTKESYKLEHVKGSVPNG 168 Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 + V +EG I I + + N I + ++ V Sbjct: 169 ------KSEAKAVNPYSSEGEHIREIPEGTLSKRISEVGDLAHNKYKIELTVSGKTIVKP 222 Query: 167 -QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ VLD S SM + + + A K+ A+ VK I N+ + LVT Sbjct: 223 VDKQKPLDVVFVLDNSNSMN-NDGPNFQRHNKAKKAAEALGTAVKDILGANSDNRVALVT 281 Query: 226 FSNKI 230 + + I Sbjct: 282 YGSDI 286 >gi|213962900|ref|ZP_03391160.1| von Willebrand factor type A [Capnocytophaga sputigena Capno] gi|213954557|gb|EEB65879.1| von Willebrand factor type A [Capnocytophaga sputigena Capno] Length = 347 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 40/196 (20%), Positives = 68/196 (34%), Gaps = 16/196 (8%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +LDVS SM I+ + +++E+K P V ++ F+ K Sbjct: 3 RLPIYFLLDVSESM------VGDPIEHVQDGMATIIKELKADPFALETVWLSIIGFAGKS 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L+ ++ KI G T+ GL N I D Sbjct: 57 KVITPLQDIITFYPPKI----PIGGGTSLASGLNELMNAIDREVVKTTLERKGDWKP--- 109 Query: 291 IVFM-TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +VF+ TDG Q++ N +R + AI + ++ L + Sbjct: 110 LVFLFTDGIPTDDPA--QAIERWNAHYRRKVNLVAISLGENTNYNLLGQLTDQVLQFNNT 167 Query: 350 NPHSMYDAFSHIGKDI 365 N + + F I I Sbjct: 168 NAAAYKEFFKWITASI 183 >gi|260827156|ref|XP_002608531.1| hypothetical protein BRAFLDRAFT_92387 [Branchiostoma floridae] gi|229293882|gb|EEN64541.1| hypothetical protein BRAFLDRAFT_92387 [Branchiostoma floridae] Length = 1634 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 42/202 (20%), Positives = 65/202 (32%), Gaps = 32/202 (15%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + S L + V+D S SM + + ++T I M++ +K Q Sbjct: 180 EAASMRRTPLRFVAVIDESGSMNNKVGEDNMTLIQRMKVFAELMVQNLKEDD------QM 233 Query: 222 GLVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 +VTF+ I+ + G + IK L G TN + GL A G Sbjct: 234 AIVTFATDIQVKLPMTQLNEDGKAQALEAIKTLRTRGQ-TNLSDGLLAALE--MFQSGGL 290 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG--------AIVYAIGIRV 329 H IV TDG + L KK + IG Sbjct: 291 FHNG---------IVLFTDGAANQGITNADHLIQAFNEKKTSVCGEACIPISTFTIGQYR 341 Query: 330 IR-SHEFLRACASPNSFYLVEN 350 +E + S F+L ++ Sbjct: 342 PHLLYEVSQKLGSDAFFWLSDD 363 >gi|126433420|ref|YP_001069111.1| von Willebrand factor, type A [Mycobacterium sp. JLS] gi|126233220|gb|ABN96620.1| von Willebrand factor, type A [Mycobacterium sp. JLS] Length = 233 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 63/179 (35%), Gaps = 16/179 (8%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S ++ D R+ +++ DVS SM+ I + A ++ + V Sbjct: 7 SVALNDANPDPRVACVVLADVSGSMQG------EPIAALERGFAAFTRYLQNEVLASKRV 60 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + +VTF + ++ + L + + T + + D+ R+H Sbjct: 61 EVAVVTFGTVATVLVPM--------QEARTLQPVAFTASGTTNMAAGIHLALDILEDRKH 112 Query: 280 CNTED--ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 Y+ I+ +TDG+ D+ RG V+A+G ++ L Sbjct: 113 AYKAAGLQYYRPWILLLTDGKPNLDGFDEAVARLNAVESARGVTVFAVGAGPRVDYQQL 171 >gi|118353830|ref|XP_001010180.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89291947|gb|EAR89935.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 544 Score = 45.5 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 39/291 (13%), Positives = 97/291 (33%), Gaps = 41/291 (14%) Query: 88 NTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPW 147 N ++ N++ + +N+ D+ + + + Y++ + F Sbjct: 21 NQPQLNQENQVSEPTKINNDDEPIDQNFKKLGDGKNKQKYNLEKGLSLDVKTLHKHFQFS 80 Query: 148 YTNSRHIVMPITSSVKVNSQTDAR----------------LDMMIVLDVSRSMESFFDSS 191 ++ I + ++ ++ + LD++ V+D S SM S Sbjct: 81 SLTNQSIPVMVSVKTLDKTEDAPKGDQEAVKQESLENRPNLDLICVIDNSGSM------S 134 Query: 192 ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW-GVSHLQRKIKYL 250 KI K++ +LE + D + + ++S ++ + I + Sbjct: 135 GEKIQNVKKTLEYLLELLG---DNDRLCLILFNSYSTRLCHLMKTNNSNKPAFKEIINKI 191 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 G T+ G++ A+ + + N I ++DG+ D + Sbjct: 192 QATG-GTDINSGMELAFRVL---------KERKYQNPVSSIFLLSDGQ--DGSADLRVRQ 239 Query: 311 YCNE-AKKRGAIVYAIGIRVIRSHEFL-RACA-SPNSFYLVENPHSMYDAF 358 + +++ G + + C+ +FY VE + + + F Sbjct: 240 SLERHLPQECFTIHSFGFGSDHDGPLMNKICSLKDGNFYYVEKINQVDEFF 290 >gi|260559816|ref|ZP_05831995.1| von Willebrand factor [Enterococcus faecium C68] gi|260074040|gb|EEW62363.1| von Willebrand factor [Enterococcus faecium C68] Length = 857 Score = 45.5 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 23/135 (17%), Positives = 56/135 (41%), Gaps = 22/135 (16%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + LD+++V+D S SM +I ++ ++ + + + + G V + Sbjct: 14 KDITPLDLVLVVDWSGSMNDN-----NRIGEVKIGVDRFVDTL-ADSGITDKINMGYVGY 67 Query: 227 SNKIEEFF--LLEWGV-SHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 S++ + ++ G ++ ++K ++ T + L+ A + + G Sbjct: 68 SSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSVPNG------ 121 Query: 282 TEDANYKKIIVFMTD 296 +KK+IV +TD Sbjct: 122 -----HKKVIVLLTD 131 >gi|221193499|gb|ACM07782.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193641|gb|ACM07853.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193647|gb|ACM07856.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193649|gb|ACM07857.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 901 Score = 45.5 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 38/185 (20%), Positives = 64/185 (34%), Gaps = 18/185 (9%) Query: 47 KTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVND 106 KT + + D+ N G N+++L I S++ E N Sbjct: 119 KTTIQNSGDK----------NSTIGQNQEELDKQYPPTGIYEDTKESYKLEHVKGSVPNG 168 Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 + V +EG I I + + N I + ++ V Sbjct: 169 ------KSEAKAVNPYSSEGEHIREIPEGTLSKRISEVGDLAHNKYKIELTVSGKTIVKP 222 Query: 167 -QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ VLD S SM + + + A K+ A+ VK I N+ + LVT Sbjct: 223 VDKQKPLDVVFVLDNSNSMN-NDGPNFQRHNKAKKAAEALGTAVKDILGANSDNRVALVT 281 Query: 226 FSNKI 230 + + I Sbjct: 282 YGSDI 286 >gi|40062983|gb|AAR37853.1| magnesium-chelatase, 60 kDa subunit [uncultured marine bacterium 443] Length = 593 Score = 45.5 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 35/233 (15%), Positives = 74/233 (31%), Gaps = 29/233 (12%) Query: 86 IKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE-----GYSISAISRYKIPLK 140 + + R+ +R N N + V + E I A R P + Sbjct: 317 LATLLKRAERSRMRQNKAGNSGQIAQDKHRGRPIGVRRGELKRGHRIDIPATLRAAAPFQ 376 Query: 141 FCTFIPWYTNSRHIVMPI-TSSVKVNSQTDARLD-MMIVLDVSRSMESFFDSSITKIDMA 198 +N + + I ++V + M+ +D S S S+ ++ A Sbjct: 377 AVRARWDTSNHQQASLYIDPQDIRVKRFEQRKSSVMIFAVDASGS------SAHQRMAEA 430 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNK-IEEFFLLEWGVSHLQRKIKYLSKFGVST 257 +I +L ++ + L++F + + + +R + L G T Sbjct: 431 KGAIELLL-----ADCYSHRTEVALISFKGESADLLLPPTRSLVRAKRTLAQLP-GGGGT 484 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 L+ Y+ ++ M V +TDG + ++ +S Sbjct: 485 PMAAALQTIYDVATTVEAMGATP---------TYVLLTDGASNVARDGTKSRT 528 >gi|124007374|ref|ZP_01692081.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] gi|123987207|gb|EAY26947.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] Length = 1088 Score = 45.5 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 36/181 (19%), Positives = 62/181 (34%), Gaps = 24/181 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M++LDVS SM S K+ + +S ++ ++ DV+ V+ +G K Sbjct: 914 LMLLLDVSGSMSSK-----DKLPLLKESFKYLISIMRPQDDVSIVIYAGDAAIVLKPTSA 968 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + I L G TN G K AY + + ++ N I+ Sbjct: 969 S----NQEQINAVIDKLRSRGK-TNVKAGFKLAYK--WMSKNFKEGGNNR-------IIL 1014 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGA--IVYAIGIRVIRSHEFLRACASPNSFYLVENP 351 TDGE + + +G V++ G + + A Y N Sbjct: 1015 ATDGE---FPISKYIYKLVEKRATKGINLSVFSFGSMTKKFETLEKLVAKGKGNYEQVNA 1071 Query: 352 H 352 Sbjct: 1072 R 1072 >gi|15929704|gb|AAH15276.1| Inter-alpha trypsin inhibitor, heavy chain 3 [Mus musculus] Length = 886 Score = 45.5 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 72/192 (37%), Gaps = 20/192 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEE 232 ++ V+DVS SM S KI +++ +L++VK +N ++ + + T+ + + + Sbjct: 282 IVFVIDVSGSM------SGRKIQQTREALLKILDDVKEDDYLNFILFSTDVTTWKDHLVQ 335 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +K + TN GL + + H E + II+ Sbjct: 336 ATP--ANLKEAKTFVKNI-HDQSMTNINDGLLKGIEMLNKARED--HTVPERSTS--III 388 Query: 293 FMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS-----FY 346 +TDG+ N ++ A +Y +G ++ FL A N Y Sbjct: 389 MLTDGDANTGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLETLALENHGLARRIY 448 Query: 347 LVENPHSMYDAF 358 + + F Sbjct: 449 EDSDANLQLQGF 460 >gi|166212064|ref|ZP_02238099.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. B42003004] gi|166206810|gb|EDR51290.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. B42003004] Length = 222 Score = 45.5 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 32/183 (17%), Positives = 62/183 (33%), Gaps = 14/183 (7%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S + + RL + +++D S SM I I AM+ ++ P V Sbjct: 2 SLILRTTSDMRRLPVYLLIDTSGSMRG------ESIHAVNVGIQAMMSALRQDPYALESV 55 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++T+ N+ E+ L + + T + L+ I + Q Sbjct: 56 HLSIITYDNQAREYIPLTALENF--QFTDITVPSAGGTFTGAALEC---LIHCVDRDIQR 110 Query: 280 CNTEDANYKKIIVF-MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + + + +VF MTD +++ K+ + A + HE L+ Sbjct: 111 SDGDQKGDWRPLVFLMTDSTPSDVYAYGEAIK--EVKKRAFGSIIACAVSAKAKHEHLKQ 168 Query: 339 CAS 341 S Sbjct: 169 LTS 171 >gi|114765751|ref|ZP_01444846.1| hypothetical protein 1100011001350_R2601_23570 [Pelagibaca bermudensis HTCC2601] gi|114541858|gb|EAU44894.1| hypothetical protein R2601_23570 [Roseovarius sp. HTCC2601] Length = 493 Score = 45.5 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 18/119 (15%), Positives = 42/119 (35%), Gaps = 3/119 (2%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 + G +T TA+ LP I + + M+ ++ + ++ L + D + + AA + Sbjct: 1 MLRDESGSVTAATAVLLPGILIGMAMLFDLLWLNNHRSHLQAQADMAALEAARYTGERPS 60 Query: 71 GNNRKKLKGGDILCRIKNTWNMS--FRNELRDNGFVN-DIDDIVRSTSLDIVVVPQNEG 126 + ++ +D F + D D R + + V + + Sbjct: 61 AVRQARVSVAVNDSFRAERLASRQIELGRWQDGSFSDMDASDPRRPNAARVTVRSEAKT 119 >gi|116192639|ref|XP_001222132.1| hypothetical protein CHGG_06037 [Chaetomium globosum CBS 148.51] gi|88181950|gb|EAQ89418.1| hypothetical protein CHGG_06037 [Chaetomium globosum CBS 148.51] Length = 952 Score = 45.5 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 25/180 (13%), Positives = 57/180 (31%), Gaps = 31/180 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ + D S SME + +++ L+ + + V+ + +F ++ E Sbjct: 290 IVFLCDRSGSMEG------PNVRHLKTALHLFLKSLPVG------VKFNICSFGSRHEFL 337 Query: 234 FLLEWGV------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 F + R + S T+ ++ N+ + + Sbjct: 338 FGNKSQTYDQSSLEKASRYVDGFSADFGGTDIYHPMEDVINRRYADMDLE---------- 387 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 + +TDGE + L V+ +GI SH + A+ + + Sbjct: 388 ---VFLLTDGEIWDQSALFELLNKKVAESNGKIRVFTLGIGSGASHALVEGVAAAGNGFS 444 >gi|313886050|ref|ZP_07819788.1| double-transmembrane N-terminal domain protein [Porphyromonas asaccharolytica PR426713P-I] gi|312924580|gb|EFR35351.1| double-transmembrane N-terminal domain protein [Porphyromonas asaccharolytica PR426713P-I] Length = 342 Score = 45.5 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 28/176 (15%), Positives = 60/176 (34%), Gaps = 26/176 (14%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + V+SQ +D+ +DVS SM + D +I A + + ++E+ Sbjct: 80 THAPVSSQQTIGVDLAFCIDVSNSM-AARDVKPDRIGFAKQIVTHTMQELAGS------- 131 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGM 276 + +V F+ L + + + + TN L+ + + Sbjct: 132 RVAMVVFAGGAYIRLPLTPDLPTARTFLADIQPGMVSNQGTNLGQALERSAQALSAPSRA 191 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 K ++ +TDGE ++ + + KK+ Y + I + Sbjct: 192 G-----------KAVILLTDGE-DHEGGLEEGI---DRLKKQEIKAYVVTIGLPDG 232 >gi|291226976|ref|XP_002733462.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii] Length = 394 Score = 45.5 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 44/246 (17%), Positives = 88/246 (35%), Gaps = 45/246 (18%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 I + S ++ A LD+ +D + SM S+ I A +++ ++EE+ + Sbjct: 2 SIKVNKVSQPELAKHDSAVLDLAFAMDCTGSMGSY-------IATAQQNVRTIVEEI--V 52 Query: 213 PDVNNVVQSGLVTF--SNKIEEFFL-----LEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 + V+ LV + E F+ VS ++ + S G ++ + Sbjct: 53 CKEKSDVRLALVEYRDHPPQESTFVTRVHDFTSSVSRMKSWLDAASASGGG-DTPEAVAD 111 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMT-----------DGENLSTKEDQQSLYYCNE 314 A +QI + + KI VF+ DG + + N+ Sbjct: 112 ALHQILKLDW--------RQDSTKICVFIADAPPHGLGCLGDGFQNGCPDGLDPIQTGNQ 163 Query: 315 AKKRGAIVYAIGIRVIRS--HEFLRACA--SPNSFYLVENPHSMYDAFSHIG---KDIVT 367 ++ +Y +G S EF A A + + + N + IG ++I Sbjct: 164 LAEKCITMYMVGCEPSISPYKEFFMAIAHMTGGQYVPLRNAALLSQVI--IGGAQEEISL 221 Query: 368 KRIWYD 373 +R+ + Sbjct: 222 QRLMEE 227 >gi|332970881|gb|EGK09858.1| D-amino-acid dehydrogenase [Desmospora sp. 8437] Length = 442 Score = 45.5 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 46/331 (13%), Positives = 104/331 (31%), Gaps = 41/331 (12%) Query: 57 SLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSL 116 ++V I++ + ++ + + E+ D + Sbjct: 11 AVVLTVALIVSGCSTGEKEANQKKPEKKKETIPKAAHEPKEIMKQKPGRFSGDNYDPKKV 70 Query: 117 DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA------ 170 + + Q ++S Y + + + T V + + Sbjct: 71 EEALD-QFPD-NLSTEEAYDRLVYLLGENYRPKYEELMSLDPTIQVNEKTPDNKIDVPSI 128 Query: 171 -RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 ++++ I+LD S SM D + K+D+A ++I A +V V+ V G + K Sbjct: 129 EQMNVEILLDASGSMAGRVDGGV-KMDLAKQAIRAFASDVPEGAQVSLRVY-GHKGSNQK 186 Query: 230 IEE----------FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ + L + S ++ + G T ++ A + + G R Sbjct: 187 KDKAVSCQSNELVYPLKSYDSSQFEQSLNQFKPTGW-TPLASAIQAAREDLKEWAGARN- 244 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI--VYAIGIRVIR-SHEFL 336 I+ ++DG + + + G V IG V + L Sbjct: 245 ----------IVYVVSDG---VETCGGDPVREAKKLGESGIEPMVKIIGFDVDDAGQQQL 291 Query: 337 RAC--ASPNSFYLVENPHSMYDAFSHIGKDI 365 + A+ S+ V + + + + I Sbjct: 292 KKVAEAADGSYQTVTSGDDLKEYLKGEKESI 322 >gi|148692822|gb|EDL24769.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_b [Mus musculus] Length = 904 Score = 45.5 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 24/131 (18%), Positives = 51/131 (38%), Gaps = 20/131 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D S SM S KI +++ +L+++ N L+ FS + + Sbjct: 277 VIFVIDKSGSM------SGKKIQQTREALVKILKDLSPQDQFN------LIEFSGEANQW 324 Query: 233 -FFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L++ +L + + Y S+ TN + A + + + Sbjct: 325 KQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVS---- 380 Query: 290 IIVFMTDGENL 300 +I+ +TDG+ Sbjct: 381 LIILLTDGDPT 391 >gi|47212619|emb|CAF92825.1| unnamed protein product [Tetraodon nigroviridis] Length = 533 Score = 45.5 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 34/247 (13%), Positives = 78/247 (31%), Gaps = 60/247 (24%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 S+ ++ + L + + S S+++ + + + + E+ K + +++ Sbjct: 14 SLCIHDSDTSCLHLYFLCRRSGSVKNHW-------EEIYYFVENLAEKFK-----SPMLR 61 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQ 278 +TFS++ L ++++ + L + G T GL+ A + Sbjct: 62 MSFITFSSRASTIMKLTENRINIRKGLNALKREIPGGDTIMHLGLQKA----------NE 111 Query: 279 HCNTEDANYKKIIVFMTDGENLST---KEDQQSLYY------------------------ 311 E+ +I+ +TDGE Q+ Sbjct: 112 QIKRENFGPASVIIALTDGELQEDELISAQQEVAASRSMSAVAAVSALAAFSIPDVMDPF 171 Query: 312 ------CNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDI 365 A+ GAIVY +G++ + + + V + A + I Sbjct: 172 PVFPQQAETARSLGAIVYCVGVKDFNETQLATIADTIEHVFPVLGG---FHALRGVINSI 228 Query: 366 VTKRIWY 372 + K Sbjct: 229 IKKSCIE 235 >gi|329847546|ref|ZP_08262574.1| hypothetical protein ABI_06110 [Asticcacaulis biprosthecum C19] gi|328842609|gb|EGF92178.1| hypothetical protein ABI_06110 [Asticcacaulis biprosthecum C19] Length = 243 Score = 45.5 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 31/170 (18%), Positives = 59/170 (34%), Gaps = 21/170 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE---VKLIPDVNNV 218 V+ ++ + I+LD + SM S +D ++T ++ + E +K + Sbjct: 20 VRAHTLPPQSVSTYILLDRTGSMSSIWDEALTSVNAYAVDLGKETPEDAGLKTSITLAVF 79 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 G + F +W + S ++D G Sbjct: 80 DHQGGLQFDVLRSHVVPAQWR----------------NITSAEASPRGMTPLYDAIGRMV 123 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKE--DQQSLYYCNEAKKRGAIVYAIG 326 +D + ++V MTDGE S+ E + + A+ RG V +G Sbjct: 124 SLAEKDNPKQAVLVIMTDGEENSSTEFNQGAAKAALDRARARGWEVVFLG 173 >gi|301617432|ref|XP_002938150.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-4-like [Xenopus (Silurana) tropicalis] Length = 985 Score = 45.5 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 28/137 (20%), Positives = 54/137 (39%), Gaps = 23/137 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT-F 226 + D++IV+D+S SM+ ++ +A +I+ +L+ + N+ V + Sbjct: 217 ATSPKDIVIVVDISGSMKGL------RMTIAKHTISTLLDTL----GENDFVNIIAYNDY 266 Query: 227 SNKIEEFFL------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + IE F H ++ + L GV T L A+ + + + Q Sbjct: 267 VHYIEPCFKGILVQADRDNREHFKQLVDELHAKGVGT-VNKALIEAFKILKEFREAGQGG 325 Query: 281 NTEDANYKKIIVFMTDG 297 A I+ +TDG Sbjct: 326 LCNQA-----IMLITDG 337 >gi|259505645|ref|ZP_05748547.1| secreted Mg-chelatase subunit [Corynebacterium efficiens YS-314] gi|259166776|gb|EEW51330.1| secreted Mg-chelatase subunit [Corynebacterium efficiens YS-314] Length = 530 Score = 45.5 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 63/202 (31%), Gaps = 33/202 (16%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAML----EEVKLIPDVNNVVQSGLVTFS- 227 D VLDVS SM T++++ ++ M+ + + ++ F+ Sbjct: 345 DTTFVLDVSGSMAG------TRMELLRSTMLEMISGEASSLTGDVSLRERENVTIIPFNF 398 Query: 228 --NKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + + E G Q + ++ T L AY Q+ + Sbjct: 399 SPGEPITATVDEVGGPQRQELVDGVTALQAEGGTGIYDALLRAYEQV------EPGASIP 452 Query: 284 DANYKKIIVFMTDGENLSTKE---DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 IV MTDGE S Q+ KKR V+ I E Sbjct: 453 S------IVLMTDGEQTSGLSFGHFQRLYSELPTEKKR-IPVFVILYGEANITEMENLAG 505 Query: 341 -SPNSFYLVENPHSMYDAFSHI 361 + + N + +AF I Sbjct: 506 LTGGKTFDAMN-GGLEEAFKEI 526 >gi|239832733|ref|ZP_04681062.1| Hypothetical protein OINT_1002014 [Ochrobactrum intermedium LMG 3301] gi|239825000|gb|EEQ96568.1| Hypothetical protein OINT_1002014 [Ochrobactrum intermedium LMG 3301] Length = 579 Score = 45.5 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 18/92 (19%), Positives = 34/92 (36%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 F G + + A+ P+ V ++ S +F + L +M D + V A + Sbjct: 7 RFLGARGGNLATMAALVSPLFLAVAAFCVDTSSLFLERRQLQNMADLAAVAGAASLSQAN 66 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDN 101 R+ G + + ++ S N DN Sbjct: 67 EAVLRQLQANGVDPVLMTDGYDPSIVNGKADN 98 >gi|225686099|ref|YP_002734071.1| protein norD [Brucella melitensis ATCC 23457] gi|256262776|ref|ZP_05465308.1| protein norD [Brucella melitensis bv. 2 str. 63/9] gi|225642204|gb|ACO02117.1| Protein norD [Brucella melitensis ATCC 23457] gi|263092583|gb|EEZ16818.1| protein norD [Brucella melitensis bv. 2 str. 63/9] gi|326410427|gb|ADZ67491.1| protein NorD [Brucella melitensis M28] gi|326553720|gb|ADZ88359.1| protein NorD [Brucella melitensis M5-90] Length = 633 Score = 45.5 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 87/210 (41%), Gaps = 34/210 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 L + +++DVS S +++ D+ +D+ +++ + + D + ++TF+ Sbjct: 440 QANDLAVTLLVDVSLSTDAWVDNRRV-LDVEKEALLVLANGIAACGD-----RCSILTFT 493 Query: 228 NK--------IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ + F +G + ++ +I L G T +++A ++ Sbjct: 494 SRRRSWVRVETVKDFDESFGPT-VEHRIAALKP-GFYTRMGAAMRHATAKL--------- 542 Query: 280 CNTEDANYKKIIVFMTDGENLSTK--EDQQSLYYCNEA----KKRGAIVYAIGIRVIRSH 333 E N KK+++ +TDG+ E + +L C A + +G V+A+ + + Sbjct: 543 --AEQPNRKKLLLLLTDGKPNDVDHYEGRFALEDCRRAAGEVRAKGVNVFAVTVDR-EAS 599 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGK 363 +L A + LV N + A I + Sbjct: 600 AYLPALFGRGGYALVANLAKLPVALPAIYR 629 >gi|260785838|ref|XP_002587967.1| hypothetical protein BRAFLDRAFT_87362 [Branchiostoma floridae] gi|229273122|gb|EEN43978.1| hypothetical protein BRAFLDRAFT_87362 [Branchiostoma floridae] Length = 192 Score = 45.5 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 28/195 (14%), Positives = 69/195 (35%), Gaps = 28/195 (14%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D+++ LD+S S+ + ++A + A ++ + ++ + ++ ++ + Sbjct: 7 AIDIVLALDLSSSI------PQDQFELARDFMVAFVDC--EVFQEKDI-RIAVLNYTCEA 57 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + +F L + +I L + T + + + ++ G H Sbjct: 58 DTYFDLAPIAYGMSYEIGQLMRGDGGITRTGHAINHM--RLTSKFGAESHHAA------- 108 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 V +TDG++ ++ + G +YA+ + L A + S Sbjct: 109 --VILTDGQSEDDQQTAAADAR-----AAGIGLYAVEFGKYVNMYALEAMTTSGSRVF-- 159 Query: 350 NPHSMYDAFSHIGKD 364 DA I D Sbjct: 160 TTSQACDAAQKIVDD 174 >gi|115525137|ref|YP_782048.1| von Willebrand factor, type A [Rhodopseudomonas palustris BisA53] gi|115519084|gb|ABJ07068.1| von Willebrand factor, type A [Rhodopseudomonas palustris BisA53] Length = 332 Score = 45.5 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 37/215 (17%), Positives = 79/215 (36%), Gaps = 39/215 (18%) Query: 174 MMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 +++V+D S SM +SF S T + + + ++ G+ FS + Sbjct: 84 IVLVIDRSSSMDDSFAGSRPTAQEASKSAEAR--RFLRSFVANAEHDMFGVAIFSTSPLQ 141 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L + + +++ G+S T+ G+ A + H + + + I Sbjct: 142 ALPLTSHREAVLAAVDAIARPGLSETDIARGIAMA---------LSMHDDDPS-SASRAI 191 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS------------------- 332 + ++DG + + Q+ L +KR +Y + +R + Sbjct: 192 ILVSDGAGVIDRLVQEKLRAA--FRKRPVHLYWVFLRTANALGIFDPPAAGERDVPQVAP 249 Query: 333 ----HEFLRACASPNSFYLVENPHSMYDAFSHIGK 363 H F ++ P + E P ++ DA + IG+ Sbjct: 250 ERHLHRFFQSLRIPYRAFEAERPEAIGDAIAEIGR 284 >gi|149200158|ref|ZP_01877182.1| batB protein [Lentisphaera araneosa HTCC2155] gi|149136799|gb|EDM25228.1| batB protein [Lentisphaera araneosa HTCC2155] Length = 621 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 26/180 (14%), Positives = 61/180 (33%), Gaps = 25/180 (13%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + + ++ ++D+S+SM + ++++ + ++ ++ Sbjct: 80 GKEISQEKESSSRSILFLVDISKSMNVRDMNEQSRLEYSKWWAKKLMNDIPGD------- 132 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGM 276 + GL+TFS L + + L+ G TN L +A Q Sbjct: 133 RFGLITFSRIANIECPLTSEPDMVLLYLSDLNSSLLPGGGTNIAAALDHAQKQF------ 186 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 E+ +++V ++DGE K +K+ V I + + + Sbjct: 187 -----KENERDSRVVVLLSDGETDGNKW----RESLEALQKKKIPVNVISLGDPKREGLV 237 >gi|312139824|ref|YP_004007160.1| hypothetical protein REQ_24380 [Rhodococcus equi 103S] gi|311889163|emb|CBH48477.1| putative secreted protein [Rhodococcus equi 103S] Length = 581 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 34/171 (19%), Positives = 63/171 (36%), Gaps = 18/171 (10%) Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS----GLVTFSNKIE 231 I+LDVS SM + D T++ + +++A + + DV V S G + K+ Sbjct: 396 ILLDVSGSMG-YTDGDGTRLSNTVDALSARIAALPTSSDVGLWVYSRGLDGAKPYLVKV- 453 Query: 232 EFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L G ++ ++ L +T++ + A++ D + + Sbjct: 454 PTGPLSDGDRRQRIEAALRSLRP-ATATSTYASVIAAHDSAVDGFVDGRPNS-------- 504 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 ++ +TDG N T Q L V + I LR+ A Sbjct: 505 -VLLVTDGPNDDTSVGTQKLMQSLTGAAHPVRVDVVSIGENSDQATLRSMA 554 >gi|161784288|sp|Q61704|ITIH3_MOUSE RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H3; Short=ITI heavy chain H3; Short=ITI-HC3; Short=Inter-alpha-inhibitor heavy chain 3; Flags: Precursor Length = 889 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 72/192 (37%), Gaps = 20/192 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEE 232 ++ V+DVS SM S KI +++ +L++VK +N ++ + + T+ + + + Sbjct: 285 IVFVIDVSGSM------SGRKIQQTREALLKILDDVKEDDYLNFILFSTDVTTWKDHLVQ 338 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +K + TN GL + + H E + II+ Sbjct: 339 ATP--ANLKEAKTFVKNI-HDQSMTNINDGLLKGIEMLNKARED--HTVPERSTS--III 391 Query: 293 FMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS-----FY 346 +TDG+ N ++ A +Y +G ++ FL A N Y Sbjct: 392 MLTDGDANTGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLETLALENHGLARRIY 451 Query: 347 LVENPHSMYDAF 358 + + F Sbjct: 452 EDSDANLQLQGF 463 >gi|159110717|ref|NP_032433.2| inter-alpha-trypsin inhibitor heavy chain H3 precursor [Mus musculus] Length = 889 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 72/192 (37%), Gaps = 20/192 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEE 232 ++ V+DVS SM S KI +++ +L++VK +N ++ + + T+ + + + Sbjct: 285 IVFVIDVSGSM------SGRKIQQTREALLKILDDVKEDDYLNFILFSTDVTTWKDHLVQ 338 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +K + TN GL + + H E + II+ Sbjct: 339 ATP--ANLKEAKTFVKNI-HDQSMTNINDGLLKGIEMLNKARED--HTVPERSTS--III 391 Query: 293 FMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS-----FY 346 +TDG+ N ++ A +Y +G ++ FL A N Y Sbjct: 392 MLTDGDANTGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLETLALENHGLARRIY 451 Query: 347 LVENPHSMYDAF 358 + + F Sbjct: 452 EDSDANLQLQGF 463 >gi|149277251|ref|ZP_01883393.1| hypothetical protein PBAL39_10186 [Pedobacter sp. BAL39] gi|149232128|gb|EDM37505.1| hypothetical protein PBAL39_10186 [Pedobacter sp. BAL39] Length = 629 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 30/205 (14%), Positives = 74/205 (36%), Gaps = 22/205 (10%) Query: 133 SRYKIPLKFCTFIPWYTNSRH--IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 P+ T I ++ +V +++ +++ ++DVS SM Sbjct: 209 PAGDDPVNIITEIAAAPWNKKHKLVQIGLQGKTISTAKLPSSNLVFLIDVSGSMN----- 263 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL 250 K+ + + S + ++++ V VV +G ++ + + ++ + L Sbjct: 264 DSNKLPLLVSSFKLLTDQLRKTDRVAIVVYAG----NSGLVLPSTSGDQKTTIKDALNKL 319 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSL 309 S G + G++ AY + + + ++ TDG+ N+ D+ Sbjct: 320 SAGGST-AGGAGIRLAYEV-----AAKNYIKGGNNR----VILATDGDFNVGASSDEDME 369 Query: 310 YYCNEAKKRGAIVYAIGIRVIRSHE 334 E +K G + +G + + Sbjct: 370 KLIEEKRKSGVFLTVLGFGMGNLKD 394 >gi|12836422|dbj|BAB23649.1| unnamed protein product [Mus musculus] Length = 902 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 24/131 (18%), Positives = 51/131 (38%), Gaps = 20/131 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D S SM S KI +++ +L+++ N L+ FS + + Sbjct: 275 VIFVIDKSGSM------SGKKIQQTREALVKILKDLSPQDQFN------LIEFSGEANQW 322 Query: 233 -FFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L++ +L + + Y S+ TN + A + + + Sbjct: 323 KQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVS---- 378 Query: 290 IIVFMTDGENL 300 +I+ +TDG+ Sbjct: 379 LIILLTDGDPT 389 >gi|54302287|ref|YP_132280.1| hypothetical protein PBPRB0607 [Photobacterium profundum SS9] gi|46915709|emb|CAG22480.1| hypothetical protein PBPRB0607 [Photobacterium profundum SS9] Length = 436 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 10/56 (17%), Positives = 27/56 (48%) Query: 12 FYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 + KG + I + + ++ + ++V ++ K L +++D + + AA I + Sbjct: 11 YRAQKGVVAIFATLAMVVLIGAGALALDVGNLILSKGKLQNLVDSAALSAAKAIDS 66 >gi|332798629|ref|YP_004460128.1| hypothetical protein TepRe1_0633 [Tepidanaerobacter sp. Re1] gi|332696364|gb|AEE90821.1| hypothetical protein TepRe1_0633 [Tepidanaerobacter sp. Re1] Length = 185 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 25/160 (15%), Positives = 52/160 (32%), Gaps = 16/160 (10%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 + F G + ++ A + +I +G+ ++VS I+ + + +++ QI+ E Sbjct: 6 KKFLQKEDGNILLIFAGSMVLIAFFIGICLDVSMIYVKRNSMQNIL---------QIIRE 56 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 + ++ D S E N D IV + P Y Sbjct: 57 ERFTYQDTIRYSDNPALTTYHIAYSAAAE-------NGFDGIVTVYFHEEDPEPNYRSYQ 109 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT 168 + + + P F T ++ + S S Sbjct: 110 VRILLTDECPFYFGRIFGLDTVPLNVSLDGGESYGEGSAD 149 >gi|194211147|ref|XP_001917810.1| PREDICTED: chloride channel, calcium activated, family member 4 [Equus caballus] Length = 909 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 37/195 (18%), Positives = 71/195 (36%), Gaps = 34/195 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM + ++ ++ L ++ I + + V G+V F + Sbjct: 307 VCLVLDKSGSM-----AGSNRLSRMNQAAKHFL--LQTIENGSWV---GMVHFDSIALVI 356 Query: 234 FLLEW--GVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + + + L T+ G+K A+ I DM Sbjct: 357 SNLTQIISSNERNKLFESLPTEAVGGTSICAGIKSAFEVITDMYSQIDGSE--------- 407 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFL-RACASPNSFYLV 348 IV +TDGE+ + C +E K+ GAI++ I + + + + Sbjct: 408 IVLLTDGEDN-------TAGSCVDEVKQSGAIIHFIALGPSADQAVIEMSTITGGKHKYA 460 Query: 349 ENPHS---MYDAFSH 360 + + + DAF+ Sbjct: 461 SDEAANNGLIDAFAA 475 >gi|151357765|emb|CAO78005.1| inter alpha-trypsin inhibitor, heavy chain 4 [Mus musculus] Length = 903 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 24/131 (18%), Positives = 51/131 (38%), Gaps = 20/131 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D S SM S KI +++ +L+++ N L+ FS + + Sbjct: 275 VIFVIDKSGSM------SGKKIQQTREALVKILKDLSPQDQFN------LIEFSGEANQW 322 Query: 233 -FFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L++ +L + + Y S+ TN + A + + + Sbjct: 323 KQSLVQATEENLNKAVNYASRIRAHGGTNINNAVLLAVELLDRSNQAELLPSKSVS---- 378 Query: 290 IIVFMTDGENL 300 +I+ +TDG+ Sbjct: 379 LIILLTDGDPT 389 >gi|148692826|gb|EDL24773.1| inter-alpha trypsin inhibitor, heavy chain 3 [Mus musculus] Length = 886 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 72/192 (37%), Gaps = 20/192 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEE 232 ++ V+DVS SM S KI +++ +L++VK +N ++ + + T+ + + + Sbjct: 283 IVFVIDVSGSM------SGRKIQQTREALLKILDDVKEDDYLNFILFSTDVTTWKDHLVQ 336 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +K + TN GL + + H E + II+ Sbjct: 337 ATP--ANLKEAKTFVKNI-HDQSMTNINDGLLKGIEMLNKARED--HTVPERSTS--III 389 Query: 293 FMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS-----FY 346 +TDG+ N ++ A +Y +G ++ FL A N Y Sbjct: 390 MLTDGDANTGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLETLALENHGLARRIY 449 Query: 347 LVENPHSMYDAF 358 + + F Sbjct: 450 EDSDANLQLQGF 461 >gi|695636|emb|CAA49843.1| inter-alpha-inhibitor H3 chain [Mus musculus] Length = 886 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 36/192 (18%), Positives = 72/192 (37%), Gaps = 20/192 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEE 232 ++ V+DVS SM S KI +++ +L++VK +N ++ + + T+ + + + Sbjct: 282 IVFVIDVSGSM------SGRKIQQTREALLKILDDVKEDDYLNFILFSTDVTTWKDHLVQ 335 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + +K + TN GL + + H E + II+ Sbjct: 336 ATP--ANLKEAKTFVKNI-HDQSMTNINDGLLKGIEMLNKARED--HTVPERSTS--III 388 Query: 293 FMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS-----FY 346 +TDG+ N ++ A +Y +G ++ FL A N Y Sbjct: 389 MLTDGDANTGESRPEKIQENVRNAIGGKFPLYNLGFGNNLNYNFLETLALENHGLARRIY 448 Query: 347 LVENPHSMYDAF 358 + + F Sbjct: 449 EDSDANLQLQGF 460 >gi|149034246|gb|EDL89016.1| calcium channel, voltage-dependent, alpha 2/delta 3 subunit, isoform CRA_b [Rattus norvegicus] Length = 924 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 30/192 (15%), Positives = 72/192 (37%), Gaps = 32/192 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++I++DVS SM+ ++ +A ++++++L+ + N ++T++ Sbjct: 74 ATSPKDVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IITYN 121 Query: 228 NKIEEFFLLEWGV---------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ G H + + L G+ L A+N + D Q Sbjct: 122 EELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIG-MLDIALNEAFNVLSDFNHTGQ 180 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLR 337 A I+ +TDG + +++ +R ++ I + + L+ Sbjct: 181 GSICSQA-----IMLITDG----AVDTYDTIFAKYNWPERKVRIFTYLIGREAAFADNLK 231 Query: 338 ACASPNSFYLVE 349 A N + + Sbjct: 232 WMACANKGFFTQ 243 >gi|149034245|gb|EDL89015.1| calcium channel, voltage-dependent, alpha 2/delta 3 subunit, isoform CRA_a [Rattus norvegicus] Length = 930 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 30/192 (15%), Positives = 72/192 (37%), Gaps = 32/192 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++I++DVS SM+ ++ +A ++++++L+ + N ++T++ Sbjct: 74 ATSPKDVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IITYN 121 Query: 228 NKIEEFFLLEWGV---------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ G H + + L G+ L A+N + D Q Sbjct: 122 EELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIG-MLDIALNEAFNVLSDFNHTGQ 180 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLR 337 A I+ +TDG + +++ +R ++ I + + L+ Sbjct: 181 GSICSQA-----IMLITDG----AVDTYDTIFAKYNWPERKVRIFTYLIGREAAFADNLK 231 Query: 338 ACASPNSFYLVE 349 A N + + Sbjct: 232 WMACANKGFFTQ 243 >gi|126340361|ref|XP_001365240.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain2 [Monodelphis domestica] Length = 951 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 27/178 (15%), Positives = 69/178 (38%), Gaps = 25/178 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN----- 228 ++ V+DVS SM K+ ++++ +L++++ Q +V F++ Sbjct: 312 ILFVIDVSGSMWGI------KMKQTVEAMKTILDDLRAED------QFSVVDFNHNVRNW 359 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + + ++ ++ I+ + G TN L A + + + + + Sbjct: 360 RDDLVLASKAQITDAKKYIEKIQPNG-GTNINEALLRAIFILNEASNLGM-LDPNSVSL- 416 Query: 289 KIIVFMTDGENL--STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 I+ ++DG+ K Q + ++++GI ++FL + N Sbjct: 417 --IILVSDGDPTVGELKLSQIQKNVKQSMQDN-ISLFSLGIGFDVDYDFLERLSQENH 471 >gi|5726289|gb|AAD48398.1|AF127035_1 calcium-activated chloride channel protein 2 [Homo sapiens] Length = 917 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 69/197 (35%), Gaps = 38/197 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM +++ ++ L + V N G+V F + Sbjct: 307 VCLVLDKSGSM-----GGKDRLNRMNQAAKHFL-----LQTVENGSWVGMVHFDSTATIV 356 Query: 234 FLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L + L + T+ G+KYA+ I + + + Sbjct: 357 NKLIQIKSSDERNTLMAGLPTYPL--GGTSICSGIKYAFQVI---GELHSQLDGSE---- 407 Query: 289 KIIVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRACA-SPNSFY 346 ++ +TDGE+ + C +E K+ GAIV+ I + + + S + Sbjct: 408 --VLLLTDGEDNTASS-------CIDEVKQSGAIVHFIALGRAADEAVIEMSKITGGSHF 458 Query: 347 LVENPHS---MYDAFSH 360 V + + DAF Sbjct: 459 YVSDEAQNNGLIDAFGA 475 >gi|160892883|ref|ZP_02073672.1| hypothetical protein CLOL250_00414 [Clostridium sp. L2-50] gi|156865442|gb|EDO58873.1| hypothetical protein CLOL250_00414 [Clostridium sp. L2-50] Length = 596 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 41/295 (13%), Positives = 99/295 (33%), Gaps = 33/295 (11%) Query: 52 SMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRI--KNTWNMSFRNELRDNGFVN---- 105 S++D + +I +G + ++ + + K+ +F + + N Sbjct: 86 SVMDSASNSEYVKIGYGESGYDTREYDYQEEHRFVSAKDFPLSTFAADCDTASYSNIRSY 145 Query: 106 -DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSR--HIVMPITSSV 162 + + + ++ + + Y + T ++ ++M ++ Sbjct: 146 IEEGMLPPAGAVRVEEMINYFDYDYVSGPEAGKKFAVYTEYADCPWNKDTKLMMVGLNTA 205 Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 ++ +++ ++D S SM K+ +A K+ + E + + Sbjct: 206 AIDMSEKKASNLVFLIDTSGSMYEE-----NKLPLAQKAFKMLAENLDEND------RIS 254 Query: 223 LVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +VT++ G + L G STN + GL AY I + Q ++ Sbjct: 255 IVTYAGSDTVVLNGVAGSEAYTICEALDSLEASG-STNGSAGLITAYE-IAEQQFIKDGN 312 Query: 281 NTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 N ++ TDG+ N+ + + E K G + +G + Sbjct: 313 NR--------VILATDGDLNVGLTSESDLVGLITEEKDSGIFLSVLGFGSDNLKD 359 >gi|83312058|ref|YP_422322.1| hypothetical protein amb2959 [Magnetospirillum magneticum AMB-1] gi|82946899|dbj|BAE51763.1| hypothetical protein [Magnetospirillum magneticum AMB-1] Length = 1196 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 23/140 (16%), Positives = 56/140 (40%), Gaps = 21/140 (15%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + +LD+++++DVS SM+ T + ++N++ +++ + N ++ Sbjct: 74 TAPSGKAAPQQLDIVVLVDVSGSMK-------TSLPHIAAAVNSLADQL--LESGNGNIR 124 Query: 221 SGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 LV F W + + ++ L+ + ++ + +R+ Sbjct: 125 LSLVRFDVSAVIDTTPHWTADREVVAQGLRKLAMIDGANDTREAFRR----------LRE 174 Query: 279 HCNTEDANYKKIIVFMTDGE 298 + +K+ VF TDGE Sbjct: 175 LLDESRPGARKMAVFFTDGE 194 >gi|198430567|ref|XP_002120241.1| PREDICTED: similar to procollagen, type XII, alpha 1 [Ciona intestinalis] Length = 218 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 25/173 (14%), Positives = 56/173 (32%), Gaps = 26/173 (15%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+++ +D S+SM A + + ++ P + + FS I+ Sbjct: 28 MDLVVAVDSSKSMGK------PNFVEAKRLVVRLINSFNTSPT---RTRVSMYKFSTFID 78 Query: 232 EFFLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L ++++ + G + L +A+ F R N Sbjct: 79 RRRELYLNEKMGNKRLQRKEVWGMPYNGYGSYVGRALSHAFRVSFT---ERLGDRPGSRN 135 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKK-RGAIVYAIGIRVIRSHEFLRA 338 ++ +TDG + + ++YAIG+ + L+ Sbjct: 136 V---LLVITDGR-----SYDGVKVISKRLRNLKDVMIYAIGVVPVNERGILQR 180 >gi|158298451|ref|XP_318625.3| AGAP009598-PA [Anopheles gambiae str. PEST] gi|157013887|gb|EAA14534.4| AGAP009598-PA [Anopheles gambiae str. PEST] Length = 1124 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 31/194 (15%), Positives = 71/194 (36%), Gaps = 26/194 (13%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP-----DVNNVVQSG 222 + D++I+LD S SM S + +A+ + +A+L+ + ++ + Sbjct: 149 ASSPKDVIILLDSSGSM------SGKEYQLAVATASAILDTLGDDDFFNLISFSDQSRVI 202 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + F +K+ V ++ I + + + N + L+ A+ + Q Sbjct: 203 VPCFQDKMVRATP--DNVKEVKTAINAV-ECENTANFSAALETAFELLRKYNQSSQGSQC 259 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA--IGIRVIRSHEFLR-AC 339 A I+ +TDG + + E + + + ++ IG + AC Sbjct: 260 NQA-----IMLITDGPSDTFMEVIKHYNHPHMP----VRIFTYLIGTDKSGGKNLYKMAC 310 Query: 340 ASPNSFYLVENPHS 353 + F + +P Sbjct: 311 ENKGFFVQINSPEE 324 >gi|152207243|gb|ABS30732.1| voltage-gated calcium channel alpha2-delta subunit 1 [Anopheles gambiae] Length = 1256 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 31/194 (15%), Positives = 71/194 (36%), Gaps = 26/194 (13%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP-----DVNNVVQSG 222 + D++I+LD S SM S + +A+ + +A+L+ + ++ + Sbjct: 264 ASSPKDVIILLDSSGSM------SGKEYQLAVATASAILDTLGDDDFFNLISFSDQSRVI 317 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + F +K+ V ++ I + + + N + L+ A+ + Q Sbjct: 318 VPCFQDKMVRATP--DNVKEVKTAINAV-ECENTANFSAALETAFELLRKYNQSSQGSQC 374 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA--IGIRVIRSHEFLR-AC 339 A I+ +TDG + + E + + + ++ IG + AC Sbjct: 375 NQA-----IMLITDGPSDTFMEVIKHYNHPHMP----VRIFTYLIGTDKSGGKNLYKMAC 425 Query: 340 ASPNSFYLVENPHS 353 + F + +P Sbjct: 426 ENKGFFVQINSPEE 439 >gi|153867800|ref|ZP_01998040.1| hypothetical protein BGS_0597 [Beggiatoa sp. SS] gi|152144880|gb|EDN71960.1| hypothetical protein BGS_0597 [Beggiatoa sp. SS] Length = 276 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 72/197 (36%), Gaps = 21/197 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 ++M+ V+D + SS++++ A +++ L ++ ++ L F + Sbjct: 20 VEMIFVVD----VSGSMSSSLSEMVQAARTVANEL----ATSNMPGQIRFALTVFDSHNN 71 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA-YNQIFDMQGMRQHCNTEDANYKKI 290 L ++ + L+ G S S + +A NQ+ R H KI Sbjct: 72 IKTSLNDDINTFYGGLNSLTIDGGSDIS---MAFAPINQLLSQ--ARPHAA-------KI 119 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 +VF TDG + + + + +G ++A+ + PN N Sbjct: 120 VVFYTDGYVFHSGKMDAIVNDAEALRNQGVQIFAVSPPEDDASAMSLITGYPNRVLRPNN 179 Query: 351 PHSMYDAFSHIGKDIVT 367 + + F ++ +V Sbjct: 180 LPDIVNRFRYVADAVVG 196 >gi|332216482|ref|XP_003257380.1| PREDICTED: LOW QUALITY PROTEIN: voltage-dependent calcium channel subunit alpha-2/delta-3-like [Nomascus leucogenys] Length = 1398 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 70/192 (36%), Gaps = 32/192 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++I++DVS SM+ ++ +A ++++++L+ + N ++ ++ Sbjct: 594 ATSPKDVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYN 641 Query: 228 NKIEEFFLLEWGV---------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ G H + + L G+ L A+N + D Q Sbjct: 642 EELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIG-MLDIALNEAFNILSDFNHTGQ 700 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLR 337 A I+ +TDG + +++ R ++ I + + L+ Sbjct: 701 GSICSQA-----IMLITDG----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLK 751 Query: 338 ACASPNSFYLVE 349 A N + + Sbjct: 752 WMACANKGFFTQ 763 >gi|332164672|ref|NP_001193680.1| calcium channel, voltage-dependent, alpha 2/delta subunit 3 [Bos taurus] gi|296474881|gb|DAA16996.1| calcium channel, voltage-dependent, alpha 2/delta subunit 3 [Bos taurus] Length = 1091 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 70/192 (36%), Gaps = 32/192 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++I++DVS SM+ ++ +A ++++++L+ + N ++ ++ Sbjct: 251 ATSPKDVVILVDVSGSMKGL------RMTIAKQTVSSILDTLGDDDFFN------IIAYN 298 Query: 228 NKIEEFFLLEWGV---------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ G H + + L G+ L A+N + D Q Sbjct: 299 EELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIG-MLDIALNEAFNILSDFNHTGQ 357 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLR 337 A I+ +TDG + +++ R ++ I + + L+ Sbjct: 358 GSICSQA-----IMLITDG----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLK 408 Query: 338 ACASPNSFYLVE 349 A N + + Sbjct: 409 WMACANKGFFTQ 420 >gi|291393868|ref|XP_002713441.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta 3 subunit [Oryctolagus cuniculus] Length = 1352 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 70/192 (36%), Gaps = 32/192 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++I++DVS SM+ ++ +A ++++++L+ + N ++ ++ Sbjct: 518 ATSPKDVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYN 565 Query: 228 NKIEEFFLLEWGV---------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ G H + + L G+ L A+N + D Q Sbjct: 566 EELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIG-MLDIALNEAFNILSDFNHTGQ 624 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLR 337 A I+ +TDG + +++ R ++ I + + L+ Sbjct: 625 GSICSQA-----IMLITDG----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLK 675 Query: 338 ACASPNSFYLVE 349 A N + + Sbjct: 676 WMACANKGFFTQ 687 >gi|241760758|ref|ZP_04758849.1| type IV fimbrial tip adhesin [Neisseria flavescens SK114] gi|241318655|gb|EER55207.1| type IV fimbrial tip adhesin [Neisseria flavescens SK114] Length = 1065 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 24/115 (20%), Positives = 44/115 (38%), Gaps = 12/115 (10%) Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYA 324 AY+ +D+ + + A F TDG + + + +K+ + Sbjct: 237 AYDTSWDIDDGLAFFSKKLAEQD----FKTDGTDAAGKSWNGDPKDPADYSKQL-VQTFT 291 Query: 325 IGIR---VIRSHEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDIV--TKRIWYD 373 +G +L+ AS P +Y + P S+ F+ I + I TK + YD Sbjct: 292 VGFGQGITPTGKRYLQLAASRPEYYYEADKPESLSKVFNDIVEQIKSGTKNVGYD 346 Score = 40.5 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 25/152 (16%), Positives = 60/152 (39%), Gaps = 19/152 (12%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 + + + + K++ Q + + ++M+ +D S SM+ + S T+I + +++ +LEE Sbjct: 39 KVPLYLQTESKIDKQPEVKHNIMLFIDDSGSMDEYIGSK-TRIQITKDALSKVLEEHGST 97 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGV----SHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + L T +N G + + ++K + G + ++ + Sbjct: 98 FNWA------LQTLNNSGGSDTPDAKGFTIPATEIANRVKRIKAKGGTPTTSRYFELVSQ 151 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 + R K +V M+DG+ Sbjct: 152 IVMPNVKYRCQ--------KSYVVLMSDGDAN 175 >gi|149197817|ref|ZP_01874866.1| von Willebrand factor type A domain protein [Lentisphaera araneosa HTCC2155] gi|149139038|gb|EDM27442.1| von Willebrand factor type A domain protein [Lentisphaera araneosa HTCC2155] Length = 1078 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 46/273 (16%), Positives = 102/273 (37%), Gaps = 35/273 (12%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKF-CTFIPWYTNSRHIVMPITSSVKVNSQTD 169 V ++ + I N Y S + + + Y + + + + ++ + + Sbjct: 675 VEASHVRIEEFINNFDYHYSVPKKEAFKIDSELSDHKVYAGVKLLRVGVQG-QRLGADSQ 733 Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSN 228 V+D S SM + ++ + K++ M + + +V + + G+ +N Sbjct: 734 KPGSYTFVIDNSGSMAAE-----NRLPLIQKTLPNMFKAMNQDDEVTILSCEGGVTNLAN 788 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 +I S L+ +K + G N + G++ AY ++ N Sbjct: 789 RITAS-----NHSQLETAVKNIEA-GTVANLSVGIEEAY-KLAAQNFRSGAVNR------ 835 Query: 289 KIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR-SHEFLRACAS--PNS 344 ++ ++DG +L KE Q+ L ++ +K+G IG+ FL A+ Sbjct: 836 --VILLSDGIASLGEKEAQEVLKTVSQYRKQGIGNTVIGVGSEDYDDSFLETLANKGDGV 893 Query: 345 FYLVENPHSMYD--------AFSHIGKDIVTKR 369 +Y ++ M D +F I +D+ + Sbjct: 894 YYFGDSKEQMNDILVNNFEASFKTIARDVKIQL 926 >gi|25028093|ref|NP_738147.1| hypothetical protein CE1537 [Corynebacterium efficiens YS-314] gi|23493377|dbj|BAC18347.1| conserved hypothetical protein [Corynebacterium efficiens YS-314] Length = 531 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 63/202 (31%), Gaps = 33/202 (16%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAML----EEVKLIPDVNNVVQSGLVTFS- 227 D VLDVS SM T++++ ++ M+ + + ++ F+ Sbjct: 346 DTTFVLDVSGSMAG------TRMELLRSTMLEMISGEASSLTGDVSLRERENVTIIPFNF 399 Query: 228 --NKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + + E G Q + ++ T L AY Q+ + Sbjct: 400 SPGEPITATVDEVGGPQRQELVDGVTALQAEGGTGIYDALLRAYEQV------EPGASIP 453 Query: 284 DANYKKIIVFMTDGENLSTKE---DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 IV MTDGE S Q+ KKR V+ I E Sbjct: 454 S------IVLMTDGEQTSGLSFGHFQRLYSELPTEKKR-IPVFVILYGEANITEMENLAG 506 Query: 341 -SPNSFYLVENPHSMYDAFSHI 361 + + N + +AF I Sbjct: 507 LTGGKTFDAMN-GGLEEAFKEI 527 >gi|262200403|ref|YP_003271611.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] gi|262083750|gb|ACY19718.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] Length = 423 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 29/188 (15%), Positives = 61/188 (32%), Gaps = 29/188 (15%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + A + +VLD S SM S + A +++ ++ ++ G Sbjct: 30 EGKVTDRAPAALQVVLDRSGSM------SGPPLAGAQRALAGVIGQLDPRDVF------G 77 Query: 223 LVTFSNKIE---EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK--YAYNQIFDMQGMR 277 +VTF + + L + + + G + S+ L+ + G+R Sbjct: 78 VVTFDDDAQVVLPAAPLADKARAVDA-VGSIVPGGCTDLSSGYLRGLQELRRATASAGIR 136 Query: 278 QHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 ++ ++DG N ++ + +A G I +G L Sbjct: 137 GG----------TVLVISDGHVNRGIRDLDEFASITAKAAADGIITSTLGYGRGYDETLL 186 Query: 337 RACASPNS 344 A A + Sbjct: 187 SAIARSGN 194 >gi|224065915|ref|XP_002191423.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain H3, partial [Taeniopygia guttata] Length = 869 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 41/275 (14%), Positives = 98/275 (35%), Gaps = 27/275 (9%) Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 E G + +G I ++NT +F + F +D + T + + + Sbjct: 178 EPQGIAELEAEGTFITNELQNTIKKTFSGKKGHISFKPTLDQ--QRTCANCSESVLDGDF 235 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 ++ + P N + +++ S+ ++ VLD S SM Sbjct: 236 TVRYDVKRTTPDN-----LQIVNGYFVHFFAPTNLPKLSKN-----IIFVLDTSGSM--- 282 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT-FSNKIEEFFLLEWGVSHLQRK 246 S +I+ +++ +L+++K N ++ ++ + + + + ++ Sbjct: 283 ---SGREIEQTKEALLKILDDIKEDDFFNIILFDSEISTWKETLIKATPE--NLDEARKF 337 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 ++++S G+ TN GL + + + II+ +TDG+ + Sbjct: 338 VQHISAQGL-TNLHGGLMRGIDILNAAHEENLVPKRSAS----IIIMLTDGQPNVGLSNT 392 Query: 307 QSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + +A +Y +G + FL A Sbjct: 393 HEIENAVKKAIDGRYTLYNLGFGSGVDYGFLERMA 427 >gi|197337244|ref|YP_002158801.1| iron-regulated protein FrpC [Vibrio fischeri MJ11] gi|197314496|gb|ACH63945.1| iron-regulated protein FrpC [Vibrio fischeri MJ11] Length = 3927 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 36/232 (15%), Positives = 79/232 (34%), Gaps = 25/232 (10%) Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKV----- 164 I+ + + V N + LK +NS +++ Sbjct: 3383 ILPVGTGNQTVDFSNLSIQVPKSES-DFTLKATATAIEASNSDTSSGDAEATLPQDILVD 3441 Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + + LD S SM ++ + ++S +LE+++ P+ + VQ L+ Sbjct: 3442 QGGSLPTTLITLALDSSGSMGDVEVDDKERMQLVLESSIKLLEDIQNQPN-SGTVQVQLI 3500 Query: 225 TFSNK-----IEEFFLLEW-----GVSHLQRKIKYLS-----KFGVSTNSTPGLKYAYNQ 269 F N+ + L W ++HLQ I+ G T+ + + Y Sbjct: 3501 DFDNRHSDETNDTATSLGWYSVADAITHLQDAIEIEMISDPYHIGGGTDYSEAI---YAI 3557 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 + + + + +N +I F++DG N + + + + Sbjct: 3558 LDGYSNDKVPNDVDLSNTNDVIYFISDGHNNEVIAADLLVQWNEFIEDKEVK 3609 >gi|148974876|ref|ZP_01811856.1| Flp pilus assembly protein TadG [Vibrionales bacterium SWAT-3] gi|145965385|gb|EDK30634.1| Flp pilus assembly protein TadG [Vibrionales bacterium SWAT-3] Length = 418 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 19/202 (9%), Positives = 65/202 (32%), Gaps = 21/202 (10%) Query: 29 IIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKN 88 ++ + + +++S T L + + + + N + D Sbjct: 1 MMVIFMAFSMQMSQQMLAHTRLLEAAEVASLALIASPREDEENNVKYARYLVDRYVVDNT 60 Query: 89 TWNMSFRNE---LRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFI 145 +G V ++ + + A ++Y ++I Sbjct: 61 DDVDVAVYTSICEYKDGCVQASGELAP-----------FSDFVVRATAKY------TSWI 103 Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 + + ++ +D+ + D S SM + + + K+D+ ++I + Sbjct: 104 AYEDVNLKPEFSVSGRAVTRKYLPQPVDVYFIGDFSGSMGNPWKNGKMKLDVVKETIKRV 163 Query: 206 LEEVKLIPDVNNVVQSGLVTFS 227 +++++ + + L+ ++ Sbjct: 164 VDDIEEF-NSEEKSRVALLGYN 184 >gi|294653580|ref|NP_714599.2| von Willebrand factor type A domain-containing protein [Leptospira interrogans serovar Lai str. 56601] gi|293630706|gb|AAN51614.2| BatB [Leptospira interrogans serovar Lai str. 56601] Length = 318 Score = 45.5 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 26/141 (18%), Positives = 56/141 (39%), Gaps = 15/141 (10%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + ++ +D++ ++DVS SM++ DSS T++ + + ML + + G Sbjct: 53 EKKEESFKGVDILFLVDVSLSMQA-IDSSPTRLAKFKEVLLRMLPSLSGN-------RFG 104 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 ++ F+ + + VS ++ L V T +Q F Sbjct: 105 MIVFAGSPFLYCPMTTDVSAFSDYVRGLDVDMVGDRGTD-----LSQAFTKAEALLRSEK 159 Query: 283 EDANYKKIIVFMTDGENLSTK 303 +I++ +TDGE+ + Sbjct: 160 --VFRNRILILVTDGEDQNDP 178 >gi|260810987|ref|XP_002600204.1| hypothetical protein BRAFLDRAFT_118258 [Branchiostoma floridae] gi|229285490|gb|EEN56216.1| hypothetical protein BRAFLDRAFT_118258 [Branchiostoma floridae] Length = 421 Score = 45.5 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 58/198 (29%), Gaps = 34/198 (17%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 LD+M ++D S S+ S +A + I + + G + ++ Sbjct: 227 DLDVMFLVDGSSSIGSEG------FALAKEYIAHFINCFDC-------PRIGFILCKCEV 273 Query: 231 EEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L + V + + T +K+ D + Sbjct: 274 NTSIPLCYYDYVRDATTAVGEIETCPSLTRIGYCIKH-MQCTTDWKEG----------VP 322 Query: 289 KIIVFMTDGENLST---KEDQQSLYYCNEAKKRGAIVYAIGIRVI---RSHEFLRACASP 342 I+V +TDG T + + + Y A+ G VYA I S Sbjct: 323 SIVVILTDGRIYGTIDGRYTEDVIGYAEAARDAGMEVYAGAIGREVFVDETALEEISGSE 382 Query: 343 NSFYLV--ENPHSMYDAF 358 + + E+P + Sbjct: 383 DRTFSTFDEDPSVLVAIL 400 >gi|284989144|ref|YP_003407698.1| cobaltochelatase subunit [Geodermatophilus obscurus DSM 43160] gi|284062389|gb|ADB73327.1| cobaltochelatase subunit [Geodermatophilus obscurus DSM 43160] Length = 674 Score = 45.5 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 25/132 (18%), Positives = 47/132 (35%), Gaps = 19/132 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEE 232 ++ V+D S SM S +++ ++ ++L + + GLVTF + E Sbjct: 492 VLFVVDASGSM-----GSRSRMTAVKGAVLSLL-----LDAYQRRDKVGLVTFRGAEAEL 541 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 W V ++ L G T GL A+ + + ++V Sbjct: 542 ALPPTWSVEAAAARLTALPT-GGRTPLAAGLLRAHETLRVERVRDAQRRP-------LLV 593 Query: 293 FMTDGENLSTKE 304 +TDG + Sbjct: 594 VVTDGRATGARG 605 >gi|324991258|gb|EGC23192.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK353] Length = 451 Score = 45.5 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 37/200 (18%), Positives = 59/200 (29%), Gaps = 36/200 (18%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ V+D S SM I + +IN + + + GL TFS Sbjct: 174 KAGSADIVFVVDRSGSMGGT-------IGIVRANINEFVRNITK---EGITARFGLATFS 223 Query: 228 NKIE-------------EFFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQI 270 +++ F + + K L+ G T TP A NQI Sbjct: 224 DEVYGRNSGSKDEDTVLTRFGSSYFTTDPAELEKALAAIRIASGGDTPETPT--TALNQI 281 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + KK +V +TD E K G ++ I Sbjct: 282 ISTYDWSKSSKN-----KKFVVLLTDAEMKEDPSIPTVADTLAALKAAGIERTVATVKAI 336 Query: 331 RSHEFLRACASPNSFYLVEN 350 + A+ +EN Sbjct: 337 EG--IYKNFATEGRVLDIEN 354 >gi|296816633|ref|XP_002848653.1| von Willebrand factor type A domain-containing protein [Arthroderma otae CBS 113480] gi|238839106|gb|EEQ28768.1| von Willebrand factor type A domain-containing protein [Arthroderma otae CBS 113480] Length = 1002 Score = 45.5 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 77/223 (34%), Gaps = 30/223 (13%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLD---MMIVLDVSRSME 185 IS I+ K P P I++ I + SQ +D ++ ++D S SM Sbjct: 280 ISTIASAK-PRASLETHPSIEGHSAIMIEIPPDFMLESQE--PVDDKEIIFLVDRSGSMA 336 Query: 186 SFFDSSITKIDMAIKSI-NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ 244 I+ + ++S+ + L + ++ +S ++ Sbjct: 337 GKIHGLISSMQFYLRSLPMSTLFNICSFGSSYQLLWEQSRAYSEIT---------LNEAL 387 Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 + S T+ P L +H + + K I+ +TDGE +E Sbjct: 388 YYVSSFSSNLGGTDLLPAL--------------EHVVLQQNHSSKDIIVLTDGEVWRLEE 433 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 + + + K+ +A+GI SHE + A+ Y Sbjct: 434 TIRFVRLTHIVSKKAIRFFALGIGNAVSHELVEGIANSGGGYA 476 >gi|156358477|ref|XP_001624545.1| predicted protein [Nematostella vectensis] gi|156211332|gb|EDO32445.1| predicted protein [Nematostella vectensis] Length = 1269 Score = 45.5 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 59/185 (31%), Gaps = 29/185 (15%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD+ ++D S S+ + + IN + V+ + + GL FS + Sbjct: 914 LDLGFLIDGSMSVGGEMN--------FKQIINFVWNVVQSFQVSAHFTRVGLAIFSLEAF 965 Query: 232 EFFLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 F + + ++Y G T GL A +F+ G Sbjct: 966 VIFNFHTYYDKNNILDALNNVQYPGTDGPGTYIGRGLHVAKEYLFEASG------RPRVP 1019 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---PN 343 + I+V + + + ++ G +Y +G+ + + A AS Sbjct: 1020 HALIVV-------AAGRSLDDVITPSLSLRRMGVDIYTVGVGKFYTKLQMHAMASFPHSE 1072 Query: 344 SFYLV 348 + Sbjct: 1073 HVFGA 1077 >gi|317056550|ref|YP_004105017.1| von Willebrand factor type A [Ruminococcus albus 7] gi|315448819|gb|ADU22383.1| von Willebrand factor type A [Ruminococcus albus 7] Length = 279 Score = 45.5 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 23/138 (16%), Positives = 46/138 (33%), Gaps = 18/138 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLDVS SM F S + ++ +L D + + TF ++ + + Sbjct: 38 VAVVLDVSGSMTKAFQSGM-----VQATLERLLPLAMAFDDDGS---MEVWTFDHEFKRY 89 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + T P L+ E A ++F Sbjct: 90 PPIT-RTNFYDYIKDNKISARGGTMYAPVLRDV---------GSYFVQEEPAKIPTYVIF 139 Query: 294 MTDGENLSTKEDQQSLYY 311 +TDG+N + +++ Sbjct: 140 ITDGDNADESDTDKAIKI 157 >gi|254559618|ref|YP_003066713.1| hypothetical protein METDI1076 [Methylobacterium extorquens DM4] gi|254266896|emb|CAX22695.1| conserved hypothetical protein [Methylobacterium extorquens DM4] Length = 473 Score = 45.5 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 69/462 (14%), Positives = 130/462 (28%), Gaps = 123/462 (26%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 N +G + +L A+ + ++G+ ++ KT L + D + + N Sbjct: 16 SNAEGSINVLFALAVLPTIGLVGLGVDYGMAISSKTRLDNAADAAALAGVVTAKEFIAAN 75 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 ++ + ++ +F F V + L+IV Q ++ A Sbjct: 76 AQQSDVTASGIKAGESQALKAFNANASKVPFA-----TVSLSQLEIVRSGQ----TLDAT 126 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME-SFFDSS 191 Y ++ + ++ + + +SV + S LD +++DVS SM DS Sbjct: 127 VSYTATVQSTFGRTFGLSATTLTNRVNASVDLAS----YLDFYLMVDVSGSMGLPTKDSD 182 Query: 192 IT------------------------------KIDMAIKSINAMLEEV---KLIPDVNNV 218 KI + ++N + E+ P V N Sbjct: 183 AEALAMQSKEKQGNCQFACHFPDSVGWTKAAGKIQLRSDAVNNAVCELLKRASTPVVPNQ 242 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMR 277 + G+ F N++ L + L + T L Q+F + Sbjct: 243 YRIGIYPFINQLATLAPLTDTTTSLAALRTAAQCDKIWPLAFTNLLDTGSTQLFTNNDPK 302 Query: 278 QHCNTEDANYKK-----------------------IIVFMTDGENLS------------- 301 + +++ + +TDG S Sbjct: 303 TGTGSGGTHFEAALPKMKSTIKPYGNGSASTNSRPFVFLITDGMQNSQSYSAWKDTKTFS 362 Query: 302 ---------TKEDQQSLYY-------CNEAKKRGAIV---YA-------------IGIRV 329 D C + K GA + Y I Sbjct: 363 GNPSKFAGYPNADWNGSQPAQIDPSKCTDLKNAGATISVLYIPYNIVKNYNNDSYIVWEN 422 Query: 330 IRSHEF-------LRACASPNSFYLVENPHSMYDAFSHIGKD 364 R ++F LR CASP FY + + + K Sbjct: 423 GRVNQFSPTLADPLRKCASPGFFYTANTQDDITASLGAMFKQ 464 >gi|256397309|ref|YP_003118873.1| hypothetical protein Caci_8209 [Catenulispora acidiphila DSM 44928] gi|256363535|gb|ACU77032.1| conserved hypothetical protein [Catenulispora acidiphila DSM 44928] Length = 177 Score = 45.5 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 21/163 (12%), Positives = 52/163 (31%), Gaps = 15/163 (9%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI- 65 ++R + +G + + I+ P++ L++ +++V + +T D++ A I Sbjct: 15 SLRRALRDDRGSLAMAVVIWAPVVVLLMAFVVDVGLLISDRTQASDYADQAARRVAQDID 74 Query: 66 --------MNEGNGNNRKKLKGGD-ILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSL 116 + NG + D + + + T + Sbjct: 75 QGWLKTHNVRGPNGEDPGIKVNVDPDTGDCVPDAEQYLLDNQITDTTITSCQVTGNPTEV 134 Query: 117 DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 D+ V P+ I+ + + F F ++ T Sbjct: 135 DLYVNPR-----ITVTLQMQYKPLFVGFALKGDSTVTGTGSAT 172 >gi|328883597|emb|CCA56836.1| hypothetical protein SVEN_3550 [Streptomyces venezuelae ATCC 10712] Length = 642 Score = 45.5 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 31/151 (20%), Positives = 55/151 (36%), Gaps = 30/151 (19%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + +K AR + +V+D S SM +F D S+ ++ + ++ A L+E + Sbjct: 447 AQLKKQGLAGARAAVYLVVDRSGSMRGYFKDGSVQRLAEQVTALAAHLDEDATVTT---- 502 Query: 219 VQSGLVTFSNKIEEFFLLE------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 V FS I+ L V + + L + N Sbjct: 503 -----VFFSTDIDGTVDLTPADLTPTRVDEVNATLGRLGRT--------------NYHRA 543 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 ++ + H DA+ ++VF TDG S Sbjct: 544 VEEVLAHHEKADASRPALVVFQTDGAPESKT 574 >gi|296223008|ref|XP_002757441.1| PREDICTED: von Willebrand factor A domain-containing protein 3B [Callithrix jacchus] Length = 1289 Score = 45.5 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 34/171 (19%), Positives = 62/171 (36%), Gaps = 32/171 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + I++D S SM+S K+D+ I ++E N V+ + + + Sbjct: 509 VYILIDTSHSMKS-------KLDLVKDKIIKFIQEQLKYKSRFNFVKFDGQAVAWREQLA 561 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + E + Q I+ + G STN+ LK A+ + E I Sbjct: 562 EVNEENLEQAQSWIRDMK-IGSSTNTLSALKTAF------------ADKETQA----IYL 604 Query: 294 MTDGENLSTKEDQQSLYYCNEAKK-RGAIVYAIGIRVIR--SHEFLRACAS 341 +TDG E ++ K + +Y I ++ FL+ A+ Sbjct: 605 LTDGRPDQPPEMVI-----DQVKVFQEIPIYTISFNYNDEIANRFLKEIAA 650 >gi|332974518|gb|EGK11438.1| PilC protein [Kingella kingae ATCC 23330] Length = 1328 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 30/147 (20%), Positives = 62/147 (42%), Gaps = 20/147 (13%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSS--ITKIDMAIKSINAMLEEVKLIPDVNN 217 ++ V A+ ++M++LD S SM + S T+ + S++ ++++ Sbjct: 40 TNTTVRGMQGAKPNIMLLLDDSGSMRAEVPGSYGQTRQGILRNSLSKIVDKYGS------ 93 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + GLV+F++ + L G S+ + I+ G + T LK A N + + Sbjct: 94 RINWGLVSFNDSSSRY-NLSLGTSYLTVANAIRNFPASGTTPTITSYLK-AVNMLNEGIK 151 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLST 302 R K +V ++DG++ Sbjct: 152 YRCQ--------KSYVVLLSDGDSNWP 170 >gi|156346879|ref|XP_001621559.1| hypothetical protein NEMVEDRAFT_v1g221836 [Nematostella vectensis] gi|156207635|gb|EDO29459.1| predicted protein [Nematostella vectensis] Length = 377 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 32/131 (24%), Positives = 54/131 (41%), Gaps = 16/131 (12%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +LD+ I+LDVS SME + A + ++ +K I GL+TF+N Sbjct: 191 QEQLDIAILLDVSMSMEWG-------LSQAQNFTSLVIGSLKDIISEGGTH-VGLITFAN 242 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD-MQGMRQHCNTEDANY 287 + E L+ S + + +T + LK A F +GMR Sbjct: 243 EAEIVIGLDDIRSRDWTAVIHKVMLSGNTYANKALKLAETTFFTEAKGMRPGSA------ 296 Query: 288 KKIIVFMTDGE 298 K+++ ++DG Sbjct: 297 -KVVITLSDGR 306 >gi|119593590|gb|EAW73184.1| chloride channel, calcium activated, family member 4, isoform CRA_a [Homo sapiens] Length = 917 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 69/197 (35%), Gaps = 38/197 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM +++ ++ L + V N G+V F + Sbjct: 307 VCLVLDKSGSM-----GGKDRLNRMNQAAKHFL-----LQTVENGSWVGMVHFDSTATIV 356 Query: 234 FLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L + L + T+ G+KYA+ I + + + Sbjct: 357 NKLIQIKSSDERNTLMAGLPTYPL--GGTSICSGIKYAFQVI---GELHSQLDGSE---- 407 Query: 289 KIIVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRACA-SPNSFY 346 ++ +TDGE+ + C +E K+ GAIV+ I + + + S + Sbjct: 408 --VLLLTDGEDNTASS-------CIDEVKQSGAIVHFIALGRAADEAVIEMSKITGGSHF 458 Query: 347 LVENPHS---MYDAFSH 360 V + + DAF Sbjct: 459 YVSDEAQNNGLIDAFGA 475 >gi|119593591|gb|EAW73185.1| chloride channel, calcium activated, family member 4, isoform CRA_b [Homo sapiens] Length = 918 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 69/197 (35%), Gaps = 38/197 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM +++ ++ L + V N G+V F + Sbjct: 308 VCLVLDKSGSM-----GGKDRLNRMNQAAKHFL-----LQTVENGSWVGMVHFDSTATIV 357 Query: 234 FLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L + L + T+ G+KYA+ I + + + Sbjct: 358 NKLIQIKSSDERNTLMAGLPTYPL--GGTSICSGIKYAFQVI---GELHSQLDGSE---- 408 Query: 289 KIIVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRACA-SPNSFY 346 ++ +TDGE+ + C +E K+ GAIV+ I + + + S + Sbjct: 409 --VLLLTDGEDNTASS-------CIDEVKQSGAIVHFIALGRAADEAVIEMSKITGGSHF 459 Query: 347 LVENPHS---MYDAFSH 360 V + + DAF Sbjct: 460 YVSDEAQNNGLIDAFGA 476 >gi|109731121|gb|AAI13690.1| Chloride channel accessory 4 [Homo sapiens] gi|109731369|gb|AAI13688.1| Chloride channel accessory 4 [Homo sapiens] gi|313883598|gb|ADR83285.1| chloride channel accessory 4 [synthetic construct] Length = 917 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 69/197 (35%), Gaps = 38/197 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM +++ ++ L + V N G+V F + Sbjct: 307 VCLVLDKSGSM-----GGKDRLNRMNQAAKHFL-----LQTVENGSWVGMVHFDSTATIV 356 Query: 234 FLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L + L + T+ G+KYA+ I + + + Sbjct: 357 NKLIQIKSSDERNTLMAGLPTYPL--GGTSICSGIKYAFQVI---GELHSQLDGSE---- 407 Query: 289 KIIVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRACA-SPNSFY 346 ++ +TDGE+ + C +E K+ GAIV+ I + + + S + Sbjct: 408 --VLLLTDGEDNTASS-------CIDEVKQSGAIVHFIALGRAADEAVIEMSKITGGSHF 458 Query: 347 LVENPHS---MYDAFSH 360 V + + DAF Sbjct: 459 YVSDEAQNNGLIDAFGA 475 >gi|150036262|ref|NP_036260.2| calcium-activated chloride channel regulator 4 [Homo sapiens] gi|205831469|sp|Q14CN2|CLCA4_HUMAN RecName: Full=Calcium-activated chloride channel regulator 4; AltName: Full=Calcium-activated chloride channel family member 4; Short=hCLCA4; AltName: Full=Calcium-activated chloride channel protein 2; Short=CaCC-2; Short=hCaCC-2; Contains: RecName: Full=Calcium-activated chloride channel regulator 4, 110 kDa form; Contains: RecName: Full=Calcium-activated chloride channel regulator 4, 30 kDa form; Flags: Precursor gi|37182063|gb|AAQ88834.1| CLCA4 [Homo sapiens] gi|56203696|emb|CAI22170.1| chloride channel, calcium activated, family member 4 [Homo sapiens] Length = 919 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 69/197 (35%), Gaps = 38/197 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM +++ ++ L + V N G+V F + Sbjct: 307 VCLVLDKSGSM-----GGKDRLNRMNQAAKHFL-----LQTVENGSWVGMVHFDSTATIV 356 Query: 234 FLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L + L + T+ G+KYA+ I + + + Sbjct: 357 NKLIQIKSSDERNTLMAGLPTYPL--GGTSICSGIKYAFQVI---GELHSQLDGSE---- 407 Query: 289 KIIVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRACA-SPNSFY 346 ++ +TDGE+ + C +E K+ GAIV+ I + + + S + Sbjct: 408 --VLLLTDGEDNTASS-------CIDEVKQSGAIVHFIALGRAADEAVIEMSKITGGSHF 458 Query: 347 LVENPHS---MYDAFSH 360 V + + DAF Sbjct: 459 YVSDEAQNNGLIDAFGA 475 >gi|332534874|ref|ZP_08410696.1| inter-alpha-trypsin inhibitor domain protein [Pseudoalteromonas haloplanktis ANT/505] gi|332035673|gb|EGI72162.1| inter-alpha-trypsin inhibitor domain protein [Pseudoalteromonas haloplanktis ANT/505] Length = 676 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 32/172 (18%), Positives = 65/172 (37%), Gaps = 34/172 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--E 231 M+ V+D S SM ++ A K++ L + N ++ F N + Sbjct: 327 MVFVVDTSGSMHG------QSMEQAKKALFYALSLLDSDDSFN------IIGFDNIVTPM 374 Query: 232 EFFLL---EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L ++ + +R I L G + ++ A N + D Sbjct: 375 SDKPLIASDFNLRRAERFIYSLEADGGTE-----IQGALNAVLDGSEFDGFV-------- 421 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + +VF+TDG ++ +L+ ++K + ++ +GI + F+R A Sbjct: 422 RQVVFLTDGS----VSNEDALFKNIQSKLGDSRLFTVGIGSAPNSFFMRRAA 469 >gi|321460552|gb|EFX71593.1| hypothetical protein DAPPUDRAFT_255504 [Daphnia pulex] Length = 983 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 38/192 (19%), Positives = 67/192 (34%), Gaps = 27/192 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++V+D+S SM+ F +I +SI + ++ D+ N G+V FS E Sbjct: 330 FVVVMDISGSMKEF-----DRIGKLSESIRSWIKT-----DLRNGSHLGMVQFSATAEIL 379 Query: 234 FLLEWGVSHLQRK--IKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L R+ I L + +T GL+ A + + N + + Sbjct: 380 SELTMISDEKSREEMIAKLPKQLQAATCIGCGLELAVQMLNE--------NKGTSETGGV 431 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC-ASPNSFYLVE 349 IV +TDG+N + K V I + Y V+ Sbjct: 432 IVLVTDGKNS--PGYLHISDVQEDILKAKIRVITIAFGEKADKNLEDLARQTDGKSYFVK 489 Query: 350 NPH---SMYDAF 358 + ++ +AF Sbjct: 490 DEDGGAALQEAF 501 >gi|317133199|ref|YP_004092513.1| von Willebrand factor type A [Ethanoligenens harbinense YUAN-3] gi|315471178|gb|ADU27782.1| von Willebrand factor type A [Ethanoligenens harbinense YUAN-3] Length = 535 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 34/189 (17%), Positives = 67/189 (35%), Gaps = 22/189 (11%) Query: 178 LDVSRSMESFFDSSITK--IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 LD S SM + K +DM + A L ++ P ++ FS+ ++ + Sbjct: 358 LDYSGSMGDNGGETGVKKAMDMILNQSTAKLYFLQATPQDK----IAVIAFSDSVKAEWY 413 Query: 236 LEWG----VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 G +S L + I+ L G + TP + A Q+ CN + Sbjct: 414 ATGGDLSSMSTLDQNIQKLQAGGGTDIYTPVM-TALQQL--AGADVSQCNPA-------V 463 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENP 351 + MTDG++ + + + +++I + L+ + + L + Sbjct: 464 ILMTDGQSNTGRTFTNVQSTYKSI-GKDIPIFSIEFGAADPTQ-LKQFGTLSKAALFDGR 521 Query: 352 HSMYDAFSH 360 + AF Sbjct: 522 KDLVAAFKQ 530 >gi|257878265|ref|ZP_05657918.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,230,933] gi|257812493|gb|EEV41251.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,230,933] Length = 1107 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 23/135 (17%), Positives = 56/135 (41%), Gaps = 22/135 (16%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + LD+++V+D S SM +I ++ ++ + + + + G V + Sbjct: 264 KDITPLDLVLVVDWSGSMNDN-----NRIGEVKIGVDRFVDTL-ADSGITDKINMGYVGY 317 Query: 227 SNKIEEFF--LLEWGV-SHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 S++ + ++ G ++ ++K ++ T + L+ A + + G Sbjct: 318 SSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSVPNG------ 371 Query: 282 TEDANYKKIIVFMTD 296 +KK+IV +TD Sbjct: 372 -----HKKVIVLLTD 381 >gi|206575582|ref|YP_002235851.1| tellurite resistance protein TerY [Klebsiella pneumoniae 342] gi|206570426|gb|ACI12072.1| tellurite resistance protein TerY [Klebsiella pneumoniae 342] Length = 212 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 26/132 (19%), Positives = 42/132 (31%), Gaps = 12/132 (9%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + ++LD S SM I+ + +L +K P ++TF + Sbjct: 3 RLPVYLLLDTSGSMHG------EPIEAVKNGVQTLLTTLKQDPYALETAYVSVITFDSTA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L L + S G +T L N+I D + Sbjct: 57 RQTVPL---TDLLNFNLPSFSASG-TTALGEALSLTANRIDAEVQKTTAETKGDWRP--L 110 Query: 291 IVFMTDGENLST 302 + MTDG Sbjct: 111 VFLMTDGGPTDD 122 >gi|262193497|ref|YP_003264706.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262076844|gb|ACY12813.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 583 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 72/211 (34%), Gaps = 32/211 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ +V+D S S + + ++ ++ + M + V +++V+ S + Sbjct: 98 DVALVIDNSGSEQGY-------LEWLQEAAHVMTDAVMGRDGRSSLVRV-----STDSDI 145 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE----DANYK 288 L ++ I L T G++ + ++ D + K Sbjct: 146 RLGLTEDEEAIRGAIDELYILNGWTALYDGIRLGNETLGAAAATHSDYDSMDDFCDTDRK 205 Query: 289 KIIVFMTDG-ENLSTKEDQQ-----------SLYYCNEAKKRGAI--VYAIGIRVIRSHE 334 +V TDG EN S E + +L ++ + +Y +G+ H Sbjct: 206 LAVVAFTDGNENNSANERLRSDEYPGDGIDTTLEDLHDLRVADVRTPIYTVGLGDEVDHG 265 Query: 335 FLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 L A + + +++ + F I + Sbjct: 266 GLEELAGYTGGRHHRIDSAADLPATFEVISE 296 >gi|325694418|gb|EGD36328.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK150] Length = 450 Score = 45.1 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 38/200 (19%), Positives = 60/200 (30%), Gaps = 36/200 (18%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ V+D S SM ID+ +IN + + + GL TFS Sbjct: 173 KAGSADIVFVVDRSGSMGGT-------IDIVRANINEFVRNITK---EGITARFGLATFS 222 Query: 228 NKIE-------------EFFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQI 270 +++ F + + K L+ G T TP A NQI Sbjct: 223 DEVYGRNSGSKDEDTVLTRFGSSYFTTDPAELEKALAAIRIASGGDTPETPT--PALNQI 280 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + KK +V +TD E K G ++ I Sbjct: 281 ISTYDWSKSSKN-----KKFVVLLTDAEMKEDPSIPTVADTLAALKAAGIERTVATVKAI 335 Query: 331 RSHEFLRACASPNSFYLVEN 350 + A+ +EN Sbjct: 336 EG--IYKNFATEGRVLDIEN 353 >gi|291227856|ref|XP_002733898.1| PREDICTED: inter-alpha trypsin inhibitor, heavy chain 3-like, partial [Saccoglossus kowalevskii] Length = 627 Score = 45.1 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 28/144 (19%), Positives = 53/144 (36%), Gaps = 15/144 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+DVS SM K+ +++ +L++++ N + S V+F + Sbjct: 302 VLFVIDVSGSM------DGAKMGQTKEALRVILDDMRSFDRFNILTFSYEVSFWKENMMI 355 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + + + L G TN GL + R + E+ +V Sbjct: 356 LATQENILEAKNFVNNLRASG-GTNFNGGLVEGVEML-----RRVTDDAENTERSAFLVI 409 Query: 294 M-TDGENLSTKEDQQSLYYCNEAK 316 M TDG+ + Q AK Sbjct: 410 MLTDGQPT--SGETQLTKIQENAK 431 >gi|126344397|ref|XP_001365113.1| PREDICTED: similar to calcium-dependent chloride channel-1, partial [Monodelphis domestica] Length = 660 Score = 45.1 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 34/160 (21%), Positives = 63/160 (39%), Gaps = 31/160 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++VLD S SM + +++ ++ L ++ + +G+VTF + Sbjct: 63 LILVLDKSGSM-----AGGDRLNRLNQASQLFLLQI-----IEKGSWTGMVTFDSSATIQ 112 Query: 234 FLLEWGVSHLQRK--IKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + QR I L G T+ GL+ A+ I + Sbjct: 113 SALIQIETDAQRNSLISRLPTAAGGGTSICSGLRTAFTVIKNKFSTDGSE---------- 162 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRV 329 IV +TDGE ++ C +E K+ GAI++ + + Sbjct: 163 IVLLTDGE-------DSTISSCFDEVKQSGAIIHTVALGP 195 >gi|319426151|gb|ADV54225.1| Vault protein inter-alpha-trypsin domain protein [Shewanella putrefaciens 200] Length = 757 Score = 45.1 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 70/196 (35%), Gaps = 33/196 (16%) Query: 152 RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 +VM + ++ + ++++V+D S SM I A ++ L +K Sbjct: 356 YSLVMVLPPKTDEHALSTLPRELILVIDTSGSMAG------DSIVQAKSALLYALNGLKA 409 Query: 212 IPDVNNVVQSGLVTFSNKIEE--FFLLEWGVSHL---QRKIKYLSKFGVSTNSTPGLKYA 266 N ++ F++++ + L +HL ++ I L G + + A Sbjct: 410 EDSFN------IIEFNSELTQLSPTSLPANQTHLARARQFIHRLQADGGTE-----MALA 458 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE--NLSTKEDQQSLYYCNEAKKRGAIVYA 324 N R +++ + ++FMTDG N D ++ Sbjct: 459 LNAALPRGINRLSESSQSL---RQVIFMTDGSVGNEQALFDLIRYQIGES------RLFT 509 Query: 325 IGIRVIRSHEFLRACA 340 +GI + F++ A Sbjct: 510 VGIGSAPNSHFMQRAA 525 >gi|119776240|ref|YP_928980.1| von Willebrand factor type A (vWA) domain-containing protein [Shewanella amazonensis SB2B] gi|119768740|gb|ABM01311.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Shewanella amazonensis SB2B] Length = 713 Score = 45.1 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 36/232 (15%), Positives = 75/232 (32%), Gaps = 39/232 (16%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 P + + + T + T R D + VLD S SM + T ++ + + + Sbjct: 306 PKVSTKGTVKLTFTPGDDLGPVTQGR-DWVFVLDKSGSMNGKY---ATLVEGVRQGLGKL 361 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-----LEWGVSHLQRKIKYLSKFGVSTNST 260 + + + F +EF ++ ++ +S G T+ Sbjct: 362 --------PAQDRFRI--ILFDESTQEFSKGFVPVDSNNINQALAWVEGISP-GNGTDLY 410 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRG 319 GLK A + DA+ +V +TDG N+ E ++ L +++ Sbjct: 411 QGLKRALTPL-------------DADRSTGVVLITDGVANVGVTEKRRFLEL---MQQQD 454 Query: 320 AIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 ++ + + L S V N + +I + + Sbjct: 455 VRLFTFIMGNSANTPLLVPMTRLSNGVATSVSNADDIVGHLMNITSKLTHQA 506 >gi|282863860|ref|ZP_06272918.1| von Willebrand factor type A [Streptomyces sp. ACTE] gi|282561561|gb|EFB67105.1| von Willebrand factor type A [Streptomyces sp. ACTE] Length = 455 Score = 45.1 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 39/195 (20%), Positives = 76/195 (38%), Gaps = 29/195 (14%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + +K T R + +VLD S SM F+ D S + ++ A L+ +P Sbjct: 251 AKLKAQGLTGLRATVYLVLDRSGSMRPFYKDGSAQHLGDRALALAAHLDTDATVP----- 305 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 +V FS ++ + G++ + +I L G+ A ++ Sbjct: 306 ----VVFFSTDVDGTGTV--GLTSHEGRIDELHA-GLGRLGRTHYHRAVEEVV------A 352 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY--AIGIRVIRSH--E 334 H ++ +++F TDG + +Q+L + A+ +Y + S + Sbjct: 353 HYEASGSSDPALVIFQTDGAPDARVAAKQALV--DAAR---LPLYFQFVAFGDEDSKAFD 407 Query: 335 FLRACASPNS-FYLV 348 FLR +PN+ F+ Sbjct: 408 FLRRLDAPNAGFFHA 422 >gi|253996156|ref|YP_003048220.1| von Willebrand factor type A [Methylotenera mobilis JLW8] gi|253982835|gb|ACT47693.1| von Willebrand factor type A [Methylotenera mobilis JLW8] Length = 2114 Score = 45.1 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 46/255 (18%), Positives = 85/255 (33%), Gaps = 45/255 (17%) Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLK--------FCTFIPWYTNSRHIVMPITSS 161 V S + +VV Y+ + + + + + + ++ + I Sbjct: 1279 TVTSATGTLVVNANTGAYTYTLNAATTEGVNDKPTFNYVLTDSVTGQSTNANLTVNIVDD 1338 Query: 162 VKVNSQTDARLD---------MMIVLDVSRSM----------ESFFDSSITKIDMAIKSI 202 V L ++IVLD S SM +S +D+S ++++A ++I Sbjct: 1339 APVGGNITQTLQAASAALTYNVVIVLDRSGSMAQDANGLWSNQSGYDASTNRMEIAKEAI 1398 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 ++ + +VN VTFS+ E V+ R + + G + ST Sbjct: 1399 AQLIARYDGLGNVN----VKFVTFSSDAVESEWYIDNVTGAVRYVDNVQAGGGTQYST-- 1452 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL--YYCNEAKKRGA 320 + A+ F+TDG+ S E +L + N G Sbjct: 1453 ---------ALNETMSGFTQPVADKTLFY-FITDGQPNSGYEVDATLQTQWQNFVAANGN 1502 Query: 321 IVYAIGIRVIRSHEF 335 I + IGI Sbjct: 1503 ISFGIGIGTASLSSL 1517 >gi|166366808|ref|YP_001659081.1| von Willebrand factor type A [Microcystis aeruginosa NIES-843] gi|166089181|dbj|BAG03889.1| von Willebrand factor type A [Microcystis aeruginosa NIES-843] Length = 456 Score = 45.1 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 39/258 (15%), Positives = 82/258 (31%), Gaps = 37/258 (14%) Query: 104 VNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYT-NSRHIVMPITSSV 162 + DI+DI +L I V + E I ++ LK T + + S Sbjct: 48 ITDIEDIDDKVTLQIQVTGE-ESKPIMGLAESDFQLKVLDKKNNKTYQGKQLPFDWKSPR 106 Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSIT-----------KIDMAIKSINAMLEEVKL 211 + +++++D S SM D + K+D AI ++ ++ Sbjct: 107 ETTPPDAW---IVVLIDFSGSMNCSQDLNTKCDAKAVAKGKRKLDAAINALGTFIKLASE 163 Query: 212 IPDVNNVV-----------QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG-----V 255 + + G + K+ L + + + +L Sbjct: 164 RKGNTYLSIVPFGVEGKNDKPGACDYYPKVTSETLDNFNLVQDVKLTNFLGSLADKTPCA 223 Query: 256 STNSTPGLKYAYNQIFDMQGMR-QHCNTEDANYK----KIIVFMTDGENLSTKEDQQSLY 310 +TN LK + + R + E K I+ ++DG + ++ + Sbjct: 224 TTNFYQALKETVKFFKNDKEGRFYPKDKEGKPLKPQPRLSIILLSDGFDNNSNYQEVQKT 283 Query: 311 YCNEAKKRGAIVYAIGIR 328 N + +V+ +G Sbjct: 284 LANLQNNKDIVVHTLGYG 301 >gi|34783791|gb|AAH56811.1| Zgc:112265 protein [Danio rerio] Length = 927 Score = 45.1 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 30/196 (15%), Positives = 67/196 (34%), Gaps = 33/196 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ ++D S SM KI ++ +L+++ GL+TF +I+ Sbjct: 277 VVFIIDRSGSMHG------RKIRQTRSALLTILKDLDEDDHF------GLITFDAEIDFW 324 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + + +K + G TN + + I Sbjct: 325 RRELLQATKANRENAESFVKRIQDRGA-TNINDAVLAGVDMI---------NRNPRKGTA 374 Query: 289 KIIVFMTDGENLSTKED-QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDG+ + + + ++ + EA +Y +G + +FL + N+ Sbjct: 375 SILILLTDGDPTAGETNIEKIMANVKEAIGSKFPLYCLGFGYDVNFDFLTKMSLENNAVA 434 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 435 RRIYEDSDADIQLQGF 450 >gi|118353828|ref|XP_001010179.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89291946|gb|EAR89934.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 511 Score = 45.1 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 37/265 (13%), Positives = 91/265 (34%), Gaps = 47/265 (17%) Query: 117 DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR----- 171 ++ + Y++ + F + ++ I + ++ ++ + Sbjct: 17 NVGDGKNKQKYNLDKGLSLDVKTLHKHFQFSSSTNQSIPIMVSVKTLDKTEDAPKGNQEA 76 Query: 172 -----------LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 LD++ V+D S SME KI K++ +LE + + Sbjct: 77 VKQDILENRPNLDLICVIDNSGSMEG------EKIQNVKKTLEYLLELLGDND------R 124 Query: 221 SGLVTFSNKIEEFFLL--EWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 L+ F++K + L + + I + G T+ G++ A+ + Sbjct: 125 LCLILFNSKATKLCHLMRTNNSNKPAFKEIINKIEANG-GTDINSGMELAFRVL----KD 179 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE-AKKRGAIVYAIGIRVIRSHEF 335 R++ N + + ++DG+ D + + +++ G Sbjct: 180 RKYHNPVSSVF-----LLSDGQ--DGSADLKVRQSLERHLPQECFTIHSFGFGSDHDGPL 232 Query: 336 L-RACA-SPNSFYLVENPHSMYDAF 358 + + C+ +FY VE + + + F Sbjct: 233 MNKICSLKDGNFYYVEKINQVDEFF 257 >gi|297579199|ref|ZP_06941127.1| conserved hypothetical protein [Vibrio cholerae RC385] gi|297536793|gb|EFH75626.1| conserved hypothetical protein [Vibrio cholerae RC385] Length = 1778 Score = 45.1 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 28/134 (20%), Positives = 54/134 (40%), Gaps = 9/134 (6%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 S+T ++ ++LDVS SM + T++ + +S ++E+ + + VQ L Sbjct: 1159 ESETKQGANVQLMLDVSGSMGRDAGNGKTRLQVMKESAIQLIEQYQALGQTK--VQLILF 1216 Query: 225 TFSNKIEEFFLLEWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + I+ L W + + I LS G T+ +K A + Sbjct: 1217 SSDASIKTASGLLWMTVAEAKNYINALSANG-GTDYDDAIKLAQESWSGTINGQPLSGAT 1275 Query: 284 DANYKKIIVFMTDG 297 + +Y F++DG Sbjct: 1276 NVSY-----FLSDG 1284 >gi|290474520|ref|YP_003467400.1| putative tellurium resistance protein (Hypothetical) [Xenorhabdus bovienii SS-2004] gi|289173833|emb|CBJ80615.1| Putative tellurium resistance protein (Hypothetical) [Xenorhabdus bovienii SS-2004] Length = 223 Score = 45.1 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 44/209 (21%), Positives = 71/209 (33%), Gaps = 16/209 (7%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K QT L ++++ D S SM S KI+ + + ML K + +Q Sbjct: 6 KFQIQTARPLPVIVLADTSGSMASDG-----KIEALNQGLKDMLLSFKDESRLRAEIQVS 60 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 ++TF N E L LQ I ST L A I D Sbjct: 61 VITFGNNKAEVNLPLAPAHLLQDFIP--LSAEGSTPLGGALSLASQMIEDKS------VI 112 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 YK ++V ++DG E S + + + A +A+ I + L A+ Sbjct: 113 PSRAYKPVVVLVSDGYPNDDWEGPFS-SFASGERSSKATRFAMSIGGDADEDMLSEFAND 171 Query: 343 NS--FYLVENPHSMYDAFSHIGKDIVTKR 369 + EN + F + + + Sbjct: 172 PEAPLFRAENASDIRRFFRAVTMSVSVQS 200 >gi|313247647|emb|CBY15808.1| unnamed protein product [Oikopleura dioica] Length = 765 Score = 45.1 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 70/201 (34%), Gaps = 22/201 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + VLD S SM ID+A +I+ L++V+ Q L+T + Sbjct: 4 LCFVLDTSASMNRRTAQGQRVIDLAKTAIDNFLKKVRAKEPSARTDQYLLLTTDREQTVR 63 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN-----YK 288 + + R++ L S + + A++ + + + + Sbjct: 64 VGPKDSPTIFWRELHNLQADDAS-DVGYAITTAFDILNMERFDHELDSIGRGRQPWSIRP 122 Query: 289 KIIVFMTDGENLSTKEDQQS-LYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-SPNSFY 346 ++ ++DG L+ K ++ L + I + + + C + Y Sbjct: 123 SAVIVLSDGGMLTNKSLTRTDLELPDG----------ISVSIPS----VSLCEVTGGKSY 168 Query: 347 LVENPHSMYDAFSHIGKDIVT 367 V +P ++ + I ++ T Sbjct: 169 QVAHPAAVMETLQAINGELQT 189 >gi|83644399|ref|YP_432834.1| von Willebrand factor type A (vWA) domain-containing protein [Hahella chejuensis KCTC 2396] gi|83632442|gb|ABC28409.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Hahella chejuensis KCTC 2396] Length = 687 Score = 45.1 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 45/273 (16%), Positives = 92/273 (33%), Gaps = 44/273 (16%) Query: 86 IKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI-SRYKIPLKFCTF 144 N +M E N + + +V Q + S YK P K Sbjct: 232 QSNAQSMEAVQEGDANAPASSPSAYRLDKDI-VVYWRQQQNLPGSVDLITYKEPGKDKG- 289 Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 ++ +T + + T+ R D +VLD S SM F T ++ K Sbjct: 290 --------TFMLTVTPGDDLPAITEGR-DWTLVLDRSGSMSGKFS---TLLEGLRKGFAK 337 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL--LEWGVSHLQRKIKYLSKFG--VSTNST 260 + N+ V+ ++ F++ E ++ +LQ+ + + G TN Sbjct: 338 F--------NRNDRVR--VIMFNDNATEVTNGWVQATPENLQQVVGAVENAGPSGGTNLM 387 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA 320 ++ A + DA+ I +TDGE + + + + +K+ Sbjct: 388 SAIQSALTGL-------------DADRTNAIWLVTDGE--ANVGETKQKAFIELLEKKDI 432 Query: 321 IVYAIGIRVIRSHEFLRACASPNSFYLVENPHS 353 ++ + + L A ++ + + +S Sbjct: 433 RLFTFIMGNSANRPLLEAITKHSNGFAISVSNS 465 >gi|226949333|ref|YP_002804424.1| hypothetical protein CLM_2254 [Clostridium botulinum A2 str. Kyoto] gi|226843333|gb|ACO85999.1| hypothetical protein CLM_2254 [Clostridium botulinum A2 str. Kyoto] Length = 647 Score = 45.1 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 45/303 (14%), Positives = 93/303 (30%), Gaps = 56/303 (18%) Query: 46 MKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN 105 ++ + +D + A ++ K +++ W + + V Sbjct: 330 IQNAMQGKVD---LEAYGISQSDQQLFQFYSNKMKLEREQMRQFWKK-LIGDAKKEVSVK 385 Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVN 165 + +D + + YK L T + + Sbjct: 386 KDGQVKGKLDVDSFINFYPDFVEAEKKGNYK-NLPIFNRYLLETQADILP---------- 434 Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 R+++ V+D S SM +KI+++ K++ L I D N ++S Sbjct: 435 ----ERIEISFVIDNSGSMNE------SKIEVSRKALAVTLLS---IDDFNKYLKSNAEQ 481 Query: 226 FSNKIEEFFLLEWG--------------------VSHLQRKIKYLSKFGVSTNSTPGLKY 265 + K+E W S + R I L +T+ L+ Sbjct: 482 LNQKVEVLSE-TWFFGSKYYNVKEFNDKNVKEKEKSDIIRSIVKLDATDGATDDASCLRE 540 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 N++ +Q + KI+ +TDG + ++++ E + VYA Sbjct: 541 ISNRMTSIQESELKKGKQ----IKIVFEITDGASSFPGSAKEAVQ---ELLSKNVEVYAF 593 Query: 326 GIR 328 I Sbjct: 594 QIG 596 >gi|94265396|ref|ZP_01289149.1| von Willebrand factor, type A [delta proteobacterium MLMS-1] gi|93454098|gb|EAT04430.1| von Willebrand factor, type A [delta proteobacterium MLMS-1] Length = 756 Score = 45.1 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 74/207 (35%), Gaps = 41/207 (19%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + L++ I++D S SM S ++ A +++ +L+ K N +V Sbjct: 258 PEKRLPLNIKILIDCSSSM------SGDSMEQARQALGEILKHFKADDHFN------IVM 305 Query: 226 FSNKIEEFFLLE-----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 F + E F + + R ++ L T L+ Q Sbjct: 306 FGDSCEALFERQVKATPANLKRAGRLLQVLDADMGCTELYEALQM----------TLQIP 355 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 ++ I+ +TDGE + + + +AK R ++++G+ S + ++ A Sbjct: 356 GPDNTPAD--ILLITDGEVWN---HDRVVGLVKKAKHR---IFSVGVGSCVSEDLVKRLA 407 Query: 341 --SPNSFYLV----ENPHSMYDAFSHI 361 S + LV + + F I Sbjct: 408 GISGGACELVAPNEQMAEKIIRHFQRI 434 >gi|332360731|gb|EGJ38540.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK49] Length = 478 Score = 45.1 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 38/198 (19%), Positives = 60/198 (30%), Gaps = 36/198 (18%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D++ V+D S SM ID+ +IN + + + GL TFS++ Sbjct: 203 GSADIVFVVDRSGSMGGT-------IDIVRANINEFVRNITK---EGITARFGLATFSDE 252 Query: 230 IE-------------EFFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQIFD 272 + F + + K L+ G T TP A NQI Sbjct: 253 VYGRNSGSKDEDTVLTRFGSSYFTTDPAELEKALAAIRIASGGDTPETPT--PALNQIIS 310 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + KK +V +TD E K G ++ I Sbjct: 311 TYDWSKSSKN-----KKFVVLLTDAEMKEDPSIPTVADTLAALKAAGIERTVATVKAIEG 365 Query: 333 HEFLRACASPNSFYLVEN 350 + A+ +EN Sbjct: 366 --IYKNFATEGRVLDIEN 381 >gi|309792347|ref|ZP_07686816.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308225613|gb|EFO79372.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 845 Score = 45.1 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 38/287 (13%), Positives = 83/287 (28%), Gaps = 56/287 (19%) Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 N N I + R N I + + + Sbjct: 295 SNSNAVTSAIDQHIANSTIMILPIYDRMVGSGNNASVRIVNFGSFIIVASGRDKNRPYFD 354 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHI-----VMPITSSVKVNSQTDARLDMMIVLDVSRS 183 + + C+ +P ++ + ++ + + ++VLD S S Sbjct: 355 MIYLGPPTRQYNVCSQMPPPPAETNLLDLAGNVSFYPEYQIIPTSQKPIQYVVVLDASGS 414 Query: 184 MESFFDSSIT-----------------------------------KIDMAIKSINAMLEE 208 M + FD +I +A K++ ++ Sbjct: 415 MSANFDGQCNNSGGVKQCANGPSGFPDVQVSNTGYDYWWTTESQRRIYVAKKALERLV-T 473 Query: 209 VKLIPDVNNVV------QSGLVTFSNKIE--EFFLLEWGVSHLQRKIKYLS------KFG 254 + +P Q +V F++ + + + L+ I L+ + Sbjct: 474 LSNMPGNPGYTNTRPSDQMAVVWFNDGVSSSQTQAFTNNPTTLKNYITTLNNVNGNYRSA 533 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 TN GL A + ++ N + YK++++F+TDG + Sbjct: 534 GGTNGAGGLYRA-SLLYQNAPKTVSFNGTNVEYKRVVLFVTDGVSNY 579 >gi|258647262|ref|ZP_05734731.1| BatB protein [Prevotella tannerae ATCC 51259] gi|260852911|gb|EEX72780.1| BatB protein [Prevotella tannerae ATCC 51259] Length = 339 Score = 45.1 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 79/213 (37%), Gaps = 29/213 (13%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPIT---SSVKVNSQTDARLDMMIVLDVSRSME 185 I+ + Y F+ + +++ + + + S+ +++ ++LDVS SM Sbjct: 44 IALVPAYSRRKPLVKFVLLLSALITLIVLLARPQNGLTTTSENKKGIEVAVMLDVSNSML 103 Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR 245 + D S +++ A ++ +++ ++ N+ + G+ F+ + + + Sbjct: 104 AQ-DVSPNRLERAKLLVSTLIDRMQ-----NDKIALGV--FAGEAYPQLPITGDYGAAKL 155 Query: 246 KIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 + ++ TN + A D + K I+ +TDGE+ Sbjct: 156 FLNSITPGMVTLQGTNLAAAINLADKSFTDKK-----------RVGKAIIIITDGEDHQG 204 Query: 303 KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 ++ + E + VY +GI + Sbjct: 205 GAEEAAAAAAKEGR----KVYILGIGNPGGAQI 233 >gi|217966673|ref|YP_002352179.1| von Willebrand factor A [Dictyoglomus turgidum DSM 6724] gi|217335772|gb|ACK41565.1| von Willebrand factor type A [Dictyoglomus turgidum DSM 6724] Length = 888 Score = 45.1 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 50/294 (17%), Positives = 94/294 (31%), Gaps = 51/294 (17%) Query: 105 NDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKV 164 N + D + + + Q G + I + +PL + + + + I K Sbjct: 304 NVVIDFKPYSQISNLNSYQ--GVIMDNIPQEDLPLDKMELLKNFVIDKGGTLLILGGDKS 361 Query: 165 NSQ----------------------TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 S + + ++IVLD S SM S+ + K+++A +S Sbjct: 362 FSAGNYHGTPLEEILPLTLKPEQILKKSNVAIIIVLDASGSMGSYSGGDM-KMELAKESA 420 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNST 260 +L+ ++ GL+ F + + L+ I +S G T Sbjct: 421 QLVLDLLEDKD------YFGLIAFDHSYQWIVPLQPLTNKEEAASLISRISP-GGGTALY 473 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA 320 P LK A + + +H I+ +TDG+ Sbjct: 474 PPLKSAGESLLKVPIKSKH-----------IIAITDGQTEGGDFYNLVRNLAKY----KI 518 Query: 321 IVYAIGIRVIRSHEFLRACASPNS--FYLVENPHSMYDAFSHIGKDIVTKRIWY 372 V IGI + L+ A+ + FY N ++ K ++ I Sbjct: 519 TVSTIGIGEDANIPLLKDIANWGNGRFYHTWNIRNLPQLLLSETKALLRSNIVE 572 >gi|189536038|ref|XP_693697.3| PREDICTED: inter-alpha (globulin) inhibitor H5-like [Danio rerio] Length = 1157 Score = 45.1 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 42/256 (16%), Positives = 85/256 (33%), Gaps = 30/256 (11%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDM 174 S+ +G S I +Y + LK + + + V + D+ Sbjct: 240 SIQQQRNVSPKGLSADFIIQYDVELKDPMGDIQVDDGYFVHYFAPRGLPVVPK-----DV 294 Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 + V+D+S SM TKI ++ ++L +++ N L+TFS+ + + Sbjct: 295 IFVIDISGSM------IGTKIKQTKAAMVSILSDLREGDYFN------LITFSDDVHTWK 342 Query: 235 LLEW------GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 V + ++ + G + + L A + + Sbjct: 343 KDRTVRATRQNVRDAKEFVRKIIAAGWTNINAALLSAAKLLNPSTRSSSSTGRAPSSQRV 402 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRG-AIVYAIGIRVIRSHEFLRACASPN---- 343 +I+F+TDGE + + + + N K G ++ + LR A N Sbjct: 403 PMIIFLTDGEATIGETETDVILH-NAQKSLGLVSLFGLAFGDDADFPMLRRLALENRGVA 461 Query: 344 -SFYLVENPHSMYDAF 358 Y ++ F Sbjct: 462 RMVYEDDDAAIQLKGF 477 >gi|108758859|ref|YP_629631.1| hypothetical protein MXAN_1374 [Myxococcus xanthus DK 1622] gi|108462739|gb|ABF87924.1| hypothetical protein MXAN_1374 [Myxococcus xanthus DK 1622] Length = 424 Score = 45.1 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 60/165 (36%), Gaps = 24/165 (14%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF- 226 ++++ VLD + SM + + KI SI + + + + P + GLV + Sbjct: 77 ARPEIEVVFVLDTTGSMSGLLEGAKRKIY----SIASRIAQGRPTPHLK----VGLVAYR 128 Query: 227 ---SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTN---STPGLKYAYNQIFDMQGMRQHC 280 + + + F L + + ++ G GL A +++ + Sbjct: 129 DVGDDYVTKRFDLSDDLDTVFANLRKFEAGGGGDTPEHVGRGLGEAVSKLSWSK------ 182 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 + K I + D +D ++ +A+++ +V + Sbjct: 183 ---NREVMKAIFLVGDAPPAQRNDDWDFKHWAKKAREKHIVVNTV 224 >gi|154496734|ref|ZP_02035430.1| hypothetical protein BACCAP_01027 [Bacteroides capillosus ATCC 29799] gi|150273986|gb|EDN01086.1| hypothetical protein BACCAP_01027 [Bacteroides capillosus ATCC 29799] Length = 786 Score = 45.1 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 65/178 (36%), Gaps = 29/178 (16%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV---KLIPDVNN----VVQSG 222 ++++++D S SM + +A+ + + + L+ + G Sbjct: 121 QPYNIVLLIDKSGSMNATDPER-----LAVSAAGMFVNSLYNESLMDQATGAGGPRSRVG 175 Query: 223 LVTFSNKIEEFFLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 +++FS + + E VS + +I ++ V+T +T L A + G Sbjct: 176 VISFSADAQTETIPVELTSEAEVSFVAGEIDAITYDKVNTGATD-LGRAV-----LSGTE 229 Query: 278 QHCNTEDANYKKIIVFMTDGENLS------TKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 +D K +I+ TDG + + A++ G +Y +G+ Sbjct: 230 MLRGAQDGVRKDMIILFTDGYTDALTPEGMERSSAMMAEGLEAARQLGCEIYVVGLNY 287 >gi|156741405|ref|YP_001431534.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156232733|gb|ABU57516.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 329 Score = 45.1 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 62/190 (32%), Gaps = 30/190 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 + V+D S SM + ++ ++ ++L + GLV+F Sbjct: 144 VCFVVDASWSMAAE-----ERMRATKAAVLSLLR-----DAYQRRDRVGLVSFQRDYATL 193 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L V QR+++ + G T + GL Y + + +D ++V Sbjct: 194 LLPLTNSVDLAQRQLQQMPT-GGKTPLSRGLLLGYEVL-------ERARRQDPEVMPLLV 245 Query: 293 FMTDGENLSTKED----QQSLYYCNEAKKRGAIVYAIG-----IRVIRSHEFLRACASPN 343 +TDG+ + D +S + + I + + Sbjct: 246 LLTDGQANVSMSDLPPQAESYALADFIAAQSIPAVVIDTEHPIFERGLARQLAHHLK--G 303 Query: 344 SFYLVENPHS 353 S+Y +E+ Sbjct: 304 SYYRLEDVQE 313 >gi|6753236|ref|NP_033915.1| voltage-dependent calcium channel subunit alpha-2/delta-3 [Mus musculus] gi|81872883|sp|Q9Z1L5|CA2D3_MOUSE RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-3; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-3; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-3; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-3; Flags: Precursor gi|4186073|emb|CAA09423.1| calcium channel alpha-2-delta-C subunit [Mus musculus] gi|147898121|gb|AAI40360.1| Calcium channel, voltage-dependent, alpha2/delta subunit 3 [synthetic construct] gi|151555329|gb|AAI48701.1| Calcium channel, voltage-dependent, alpha2/delta subunit 3 [synthetic construct] Length = 1091 Score = 45.1 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 30/192 (15%), Positives = 71/192 (36%), Gaps = 32/192 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++I++DVS SM+ ++ +A ++++++L+ + N ++T++ Sbjct: 251 ATSPKDVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IITYN 298 Query: 228 NKIEEFFLLEWGV---------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ G H + + L G+ L A+N + D Q Sbjct: 299 EELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIG-MLDIALNEAFNILSDFNHTGQ 357 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLR 337 A I+ +TDG + +++ R ++ I + + L+ Sbjct: 358 GSICSQA-----IMLITDG----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLK 408 Query: 338 ACASPNSFYLVE 349 A N + + Sbjct: 409 WMACANKGFFTQ 420 >gi|156358479|ref|XP_001624546.1| predicted protein [Nematostella vectensis] gi|156211333|gb|EDO32446.1| predicted protein [Nematostella vectensis] Length = 688 Score = 45.1 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 66/186 (35%), Gaps = 25/186 (13%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + V+ + +++ +LD S S+ + IT IK++ + Sbjct: 6 MHVSIVSGGEVELGFMLDTSTSLGGEANLKIT--LDFIKAVYGSFTISSSAY------RV 57 Query: 222 GLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G+V F + + F S ++ + + G +T + GL +F Sbjct: 58 GVVIFGSSAKVAFDFSKFSSSAEIESGLSEIKLIGGATAAGQGLTTCNTALFS------- 110 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 ++ KK+++ + G++ + + K G ++ +G+ L Sbjct: 111 --KARSSAKKMLLVLIAGKSSDD------VGVASSMKTSGISIFVLGMGKAIDKTQLNMM 162 Query: 340 ASPNSF 345 AS S+ Sbjct: 163 ASQESY 168 >gi|325696397|gb|EGD38287.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK160] Length = 450 Score = 45.1 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 38/200 (19%), Positives = 60/200 (30%), Gaps = 36/200 (18%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ V+D S SM ID+ +IN + + + GL TFS Sbjct: 173 KAGSADIVFVVDRSGSMGGT-------IDIVRANINEFVRNITK---EGITARFGLATFS 222 Query: 228 NKIE-------------EFFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQI 270 +++ F + + K L+ G T TP A NQI Sbjct: 223 DEVYGRNSGSKDEDTVLTRFGSSYFTTDPAELEKALAAIRIASGGDTPETPT--PALNQI 280 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + KK +V +TD E K G ++ I Sbjct: 281 ISTYDWSKSSKN-----KKFVVLLTDAEMKEDPSIPTVADTLAALKAAGIERTVATVKAI 335 Query: 331 RSHEFLRACASPNSFYLVEN 350 + A+ +EN Sbjct: 336 EG--IYKNFATEGRVLDIEN 353 >gi|282897191|ref|ZP_06305193.1| von Willebrand factor, type A [Raphidiopsis brookii D9] gi|281197843|gb|EFA72737.1| von Willebrand factor, type A [Raphidiopsis brookii D9] Length = 232 Score = 45.1 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 64/165 (38%), Gaps = 17/165 (10%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 +++LD S SM+ +I+ + + + +E+ V+ +VTF + + Sbjct: 32 VLLLDTSGSMQG------DRIEALNQGLLSFKDELVKNTLAARRVEVAIVTFDSYV---S 82 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 +++ V+ Q L+ G++T G+ A I Q + Y + VFM Sbjct: 83 VVQDFVTVDQFTPPILTAQGLTTM-GAGINKALEII---QERKSQYRANGIAYYRPWVFM 138 Query: 295 -TDGENLSTKE---DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 TDGE + ++ + + + + +G+ H Sbjct: 139 ITDGEPQGEIDEVIEEATQRLRGDEFNKKVAFFTVGVENANMHRL 183 >gi|222100527|ref|YP_002535095.1| hypothetical protein CTN_1553 [Thermotoga neapolitana DSM 4359] gi|221572917|gb|ACM23729.1| Putative uncharacterized protein [Thermotoga neapolitana DSM 4359] Length = 824 Score = 45.1 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 26/166 (15%), Positives = 53/166 (31%), Gaps = 29/166 (17%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ ++D + SME I+ + + L+ + I + + + + E Sbjct: 102 LDIVFLIDKTGSMEDH-------IESIKRQLKNFLDRLMKIGTD---FRIVIAEYGVEDE 151 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG-----LKYAYNQIFDMQGMRQHCNTEDAN 286 + GV + + G L +AY+ + + Sbjct: 152 PEWPSGAGVDTFYDSVMFEEISREIEEIGTGGEGWDLTWAYDAFLWALNLDW-----RES 206 Query: 287 YKKIIVFMTD-------GENLSTKE--DQQSLYYCNEAKKRGAIVY 323 +KI+V +TD G N + K+ G +Y Sbjct: 207 ARKIVVIITDVYVDSVFGPNWYYTSGCNTSMRAVDLALKESGIHLY 252 >gi|171911941|ref|ZP_02927411.1| Vault protein inter-alpha-trypsin domain protein [Verrucomicrobium spinosum DSM 4136] Length = 679 Score = 45.1 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 26/138 (18%), Positives = 47/138 (34%), Gaps = 24/138 (17%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 K + D + VLDVS SM F I+ + + ++ +L+ + N + + Sbjct: 306 AKWEAGQTPPRDYLFVLDVSGSMNGFP------IETSKRLMSDLLKGLNPGDTFNILHFA 359 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQH 279 + + L ++ K LS+ T P L+ A Sbjct: 360 S----DSAVLSPKPLAATPENIHLATKDLSRHRGNGGTELLPALQRAL------------ 403 Query: 280 CNTEDANYKKIIVFMTDG 297 + + IV +TDG Sbjct: 404 ATPREVGVSRSIVILTDG 421 >gi|327272219|ref|XP_003220883.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-4-like [Anolis carolinensis] Length = 1068 Score = 45.1 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 60/169 (35%), Gaps = 27/169 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT-F 226 + D++IV+D+S SM+ + +A +I +L+ + N+ V + Sbjct: 220 ATSPKDIVIVVDISGSMKGLL------LTIAKHTIVTILDTL----GENDFVNIIAYNDY 269 Query: 227 SNKIEEFFL------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + +E F H ++ + L GV T L A+ + + + Q Sbjct: 270 VHYVESCFKGILVQADRDNREHFKQLVDELHAKGVGT-VNKALTEAFRILREFRNAGQGG 328 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 A I+ +TDG + + + + R V+ I Sbjct: 329 LCNQA-----IMLITDGAMEDYEYVFEKFNWPD----RKVRVFTYLIGR 368 >gi|307719356|ref|YP_003874888.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6192] gi|306533081|gb|ADN02615.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6192] Length = 331 Score = 45.1 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 37/236 (15%), Positives = 78/236 (33%), Gaps = 54/236 (22%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 ++ + D++++ D+SRSM D +++++A + ++ + + Sbjct: 77 GMEQETVKRRNADIVLLFDISRSM-LVRDVPPSRLEVAKEIALMLVSRISGA-------R 128 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 G+V F K E L + L+ I L+ T+ GL A Sbjct: 129 WGVVAFKGKGELLLPLTPDLLGLEDAIGLLTPVLLRSPGTDVASGLSRALE--------- 179 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH---- 333 + +N +++++ ++DGE + A+ G V+ +GI Sbjct: 180 --AFPQQSNRQRLVILLSDGE----ALTGEIGPVLELARNLGVAVHTVGIGTESGGPVPL 233 Query: 334 --------------------EFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVT 367 L+ A + F+ V + F H+ I Sbjct: 234 EGEDVLKKPSGEPVISRLDASLLKRIAEITGGRFFSVRDAEG--QTFQHVVSTIEE 287 >gi|224369820|ref|YP_002603984.1| OmpA2 [Desulfobacterium autotrophicum HRM2] gi|223692537|gb|ACN15820.1| OmpA2 [Desulfobacterium autotrophicum HRM2] Length = 503 Score = 45.1 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 28/199 (14%), Positives = 65/199 (32%), Gaps = 29/199 (14%) Query: 169 DARLDMMIVL-DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +D IV+ D S SME F + K +A ++ M + + + GL++F Sbjct: 131 APSVDNFIVIFDASSSMEEFSNGQ-EKFTVAKAFVDRMNQVLPEMGQNA-----GLISFG 184 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + ++ +T G ++++ G + Sbjct: 185 HDPSISRKPTLNTDGMKPY------------TTAGFAAEFDRVAAPGGTSALPLALNTAA 232 Query: 288 --------KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRA 338 K ++ ++DG+N+S K + ++ +Y + + L Sbjct: 233 MDLDGVLTKTAVIVVSDGKNISPKSIDAATQLKDKFGSS-LCIYTVLVGNDAGGAALLDN 291 Query: 339 CASPNSFYLVENPHSMYDA 357 ++ ++ N + A Sbjct: 292 ISNVSTCGFSSNADDLMTA 310 >gi|209550318|ref|YP_002282235.1| Vault protein inter-alpha-trypsin domain protein [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209536074|gb|ACI56009.1| Vault protein inter-alpha-trypsin domain protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 794 Score = 45.1 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 70/194 (36%), Gaps = 43/194 (22%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D S SM S I+ A +S+ + + + ++ F + + ++ Sbjct: 356 VVFVIDNSGSM------SGPSIEQAKQSLALAISRLTPND------RFNVIRFDDTMTDY 403 Query: 234 FL-----LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F ++ L G + + ++ ++ Sbjct: 404 FKGLVAATPDNREKAIAYVRGLPADGGTEM-----------LPALEDALRNQGPVATGAL 452 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 + +VF+TDG ++Q L+ A + A V+ +GI ++PN++++ Sbjct: 453 RQVVFLTDG----AIGNEQQLFQEITANRGDARVFTVGIG-----------SAPNTYFMT 497 Query: 349 ENPHSMYDAFSHIG 362 + F+ IG Sbjct: 498 KAAEIGRGTFTQIG 511 >gi|148692793|gb|EDL24740.1| calcium channel, voltage-dependent, alpha2/delta subunit 3 [Mus musculus] Length = 1091 Score = 45.1 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 30/192 (15%), Positives = 71/192 (36%), Gaps = 32/192 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++I++DVS SM+ ++ +A ++++++L+ + N ++T++ Sbjct: 251 ATSPKDVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IITYN 298 Query: 228 NKIEEFFLLEWGV---------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ G H + + L G+ L A+N + D Q Sbjct: 299 EELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIG-MLDIALNEAFNILSDFNHTGQ 357 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLR 337 A I+ +TDG + +++ R ++ I + + L+ Sbjct: 358 GSICSQA-----IMLITDG----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLK 408 Query: 338 ACASPNSFYLVE 349 A N + + Sbjct: 409 WMACANKGFFTQ 420 >gi|320103513|ref|YP_004179104.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] gi|319750795|gb|ADV62555.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] Length = 342 Score = 45.1 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 35/228 (15%), Positives = 69/228 (30%), Gaps = 50/228 (21%) Query: 174 MMIVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +M+VLD S SM + D ++ + A ++ + N+V GLV F+ Sbjct: 94 LMVVLDRSSSMNAPVGPGDRGPSRFEAARIALAEFI-----AGRPNDV--IGLVGFAALP 146 Query: 231 EEFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + L+ + TN + D + + Sbjct: 147 DLAAVPSLDRDFLRAALLAQEIARPLEDGTN-----------LGDALALAADALRDQDAL 195 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIRVIRSHEFLR--------- 337 K++V +TDG N + E G ++ + + E L Sbjct: 196 SKVVVLVTDGRNSPALPNPLDPQVAAELLDDLGITLHILALGTEDPTELLETETTSRPAT 255 Query: 338 ----ACAS------------PNSFYLVENPHSMYDAFSHIGKDIVTKR 369 AS + + NP ++ A + I + ++ Sbjct: 256 EPRTNLASLGSELERLAQRAGGQVFEITNPGALRAALAAIDRLETSRL 303 >gi|298372685|ref|ZP_06982675.1| BatB protein [Bacteroidetes oral taxon 274 str. F0058] gi|298275589|gb|EFI17140.1| BatB protein [Bacteroidetes oral taxon 274 str. F0058] Length = 345 Score = 45.1 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 59/173 (34%), Gaps = 25/173 (14%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K + ++ M+VLD+S SM + D + +++D A ++ +++ + + G Sbjct: 81 KQENVQKRGIEAMLVLDISNSMMAQ-DIAPSRLDYAKMLLSQLIDRLTDD-------KMG 132 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQH 279 L+ F+ + + +K + T + A D + Sbjct: 133 LIVFAGDAFIQMPITSDKVSAKMFLKTIQPDLIQRQGTAIGSAIDLAVKSFNDTKQSGG- 191 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + I +TD EN + + A+ + V +GI Sbjct: 192 ---------RAIFLLTDAENHEDNAVEAAKM----ARDKNITVNVVGIGTPEG 231 >gi|85857862|ref|YP_460064.1| von Willebrand factor type A domain-containing protein [Syntrophus aciditrophicus SB] gi|85720953|gb|ABC75896.1| von Willebrand factor type A like domain [Syntrophus aciditrophicus SB] Length = 785 Score = 45.1 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 24/125 (19%), Positives = 50/125 (40%), Gaps = 24/125 (19%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 + ++DVS SM F +D++ K + ++ ++ N ++ SG + + Sbjct: 312 IFIVDVSGSMHGFP------LDISKKLLANLIGSLRPTDRFNVLLFSG----GSSLMSEE 361 Query: 235 LLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L ++++ I + + G T P LK A + + N + +V Sbjct: 362 SLPATPENVRKAISVIEQQRGGGGTEILPALKRALS------------LKKRENDSRTVV 409 Query: 293 FMTDG 297 +TDG Sbjct: 410 IVTDG 414 >gi|268324441|emb|CBH38029.1| putitive magnesium-chelatase subunit [uncultured archaeon] Length = 705 Score = 45.1 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 29/156 (18%), Positives = 57/156 (36%), Gaps = 22/156 (14%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 M V+D S SM + +++ A ++ ++L + + G+V F + Sbjct: 509 MFVVDASGSMGANR-----RMESAKGAVLSLL-----LDSYQQRDKVGMVAFKGDQADVL 558 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 L S L + G T GL+ N + + + I++ + Sbjct: 559 LPLCSSSDLAVERLRELPTGGRTPLAAGLEQGLNLLMAEKHRDEEAIP-------ILLLI 611 Query: 295 TDGENL-----STKEDQQSLYYCNEAKKRGAIVYAI 325 +DG S + +Q+ L +A+ +G V I Sbjct: 612 SDGRANVSAGGSKELEQELLALAEQARAKGIYVIVI 647 >gi|297194950|ref|ZP_06912348.1| toxic cation resistance protein [Streptomyces pristinaespiralis ATCC 25486] gi|197723405|gb|EDY67313.1| toxic cation resistance protein [Streptomyces pristinaespiralis ATCC 25486] Length = 238 Score = 45.1 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 28/158 (17%), Positives = 53/158 (33%), Gaps = 23/158 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 S++ + R + +V+D S SM+ ++ + + + L D V Sbjct: 25 SLQKHGLHGQRAAVYLVVDYSGSMKDYYKDG--SVQALADRVLGL--SANLDDDGTVPV- 79 Query: 221 SGLVTFSNKIE--EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V FS ++ LE + + L G T+ + + D Sbjct: 80 ---VFFSTDVDAVTDIALENHHGRIDEIVSGLGHMGK-TSYHLAMDAVIDHYLDSGSTD- 134 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 ++VF TDG ++ ++ Y C AK Sbjct: 135 ---------PALVVFQTDGGPINKPAAER--YLCKAAK 161 >gi|307591429|ref|YP_003900228.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306986283|gb|ADN18162.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 426 Score = 45.1 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 38/179 (21%), Positives = 64/179 (35%), Gaps = 22/179 (12%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++++LD+S SM TKI+ AIK+I E K D Q +V F + + Sbjct: 108 IIVLLDMSGSMAKEDSRGTTKIEGAIKAIREFTEIAK---DRGGNTQVSIVPFGDPGKNC 164 Query: 234 --FLLEWGV---------SHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMR--- 277 + ++ + LQ + L+ STN L A + R Sbjct: 165 AGYPIDSNTLDNFSRVDDAKLQIFLDNLASLSPCASTNLYEPLSKAVRFLGKKNDSRFYP 224 Query: 278 --QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG-AIVYAIGIRVIRSH 333 N + + I+ ++DG + E Q KK IV+ +G + Sbjct: 225 LDSSGNPIEPQPRLSIILLSDGYHNKPNEAQDFQSLNQLLKKNNQIIVHTLGYGLTAQQ 283 >gi|301767168|ref|XP_002919014.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4-like [Ailuropoda melanoleuca] Length = 849 Score = 45.1 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 64/173 (36%), Gaps = 23/173 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D S SM S K+ +++ +L+++ N L++FS ++ Sbjct: 277 VIFVIDKSGSM------SGRKMQQTREALIKILDDLSPKDQFN------LISFSGDAAQW 324 Query: 234 FLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L V+ + + G T+ + A Q+ + ++ + Sbjct: 325 KPLLVPASAENVNQARSYAAGIQAHG-GTDINEAVLMAV-QLLNSAKQKELMPEGTVSL- 381 Query: 289 KIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 I+ +TDG+ + + + EA ++ +G S+ FL A Sbjct: 382 --IILLTDGDPTMGETNPARIQRNVKEAIDGQYSLFCLGFGFDVSYAFLEKLA 432 >gi|168179747|ref|ZP_02614411.1| hypothetical protein CBN_2105 [Clostridium botulinum NCTC 2916] gi|182669274|gb|EDT81250.1| hypothetical protein CBN_2105 [Clostridium botulinum NCTC 2916] Length = 647 Score = 45.1 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 45/303 (14%), Positives = 93/303 (30%), Gaps = 56/303 (18%) Query: 46 MKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN 105 ++ + +D + A ++ K +++ W + + V Sbjct: 330 IQNAMQGKVD---LEAYGISQSDQQLFQFYSNKMKLEREQMRQFWKK-LIGDAKKEVSVK 385 Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVN 165 + +D + + YK L T + + Sbjct: 386 KDGQVKGKLDVDSFINFYPDFVEAEKKGNYK-NLSIFNRYLLETQADILP---------- 434 Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 R+++ V+D S SM +KI+ + K++ L I D N ++S Sbjct: 435 ----ERIEISFVIDNSGSMNE------SKIEASRKALAVTLLS---IDDFNKYLKSNAEK 481 Query: 226 FSNKIEEFFLLEWG--------------------VSHLQRKIKYLSKFGVSTNSTPGLKY 265 + K+E W S++ R I L +T+ L+ Sbjct: 482 LNQKVEVLSE-TWFFGSKYYNVKEFNDKNVKEKEKSNIIRSIVKLDATDGATDDASCLRE 540 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 N++ +Q + KI+ +TDG + ++++ E + VYA Sbjct: 541 ISNRMTSIQESELKKGKQ----IKIVFEITDGASSFPGSAKEAIQ---ELLSKNVEVYAF 593 Query: 326 GIR 328 I Sbjct: 594 QIG 596 >gi|119468931|ref|ZP_01611956.1| hypothetical protein ATW7_04187 [Alteromonadales bacterium TW-7] gi|119447583|gb|EAW28850.1| hypothetical protein ATW7_04187 [Alteromonadales bacterium TW-7] Length = 664 Score = 45.1 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 35/172 (20%), Positives = 64/172 (37%), Gaps = 34/172 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--E 231 M+ V+D S SM I+ A K++ L ++ N ++ F N + Sbjct: 320 MVFVVDTSGSMHG------QSIEQAKKALFYALSLLESDDSFN------IIGFDNNVTAM 367 Query: 232 EFFLL---EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L ++ + +R I L G T L + +RQ Sbjct: 368 SDRPLIASDFNLRRAERFIYSLEADG-GTEIQGALDAVLDGSTFDGFVRQ---------- 416 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +VF+TDG ++ +L+ +AK + ++ +GI + F+R A Sbjct: 417 --VVFLTDGS----VSNEATLFKNIQAKLGDSRLFTVGIGSAPNSFFMRRAA 462 >gi|38639545|ref|NP_943314.1| TerY [Klebsiella pneumoniae] gi|38016643|gb|AAR07664.1| TerY [Klebsiella pneumoniae] Length = 212 Score = 45.1 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 26/132 (19%), Positives = 42/132 (31%), Gaps = 12/132 (9%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + ++LD S SM I+ + +L +K P ++TF + Sbjct: 3 RLPVYLLLDTSGSMHG------EPIEAVKNGVQTLLTTLKQDPYALETAYVSVITFDSTA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L L + S G +T L N+I D + Sbjct: 57 RQAVPL---TDLLSFNLPSFSASG-TTALGEALSLTANRIDAEVQKTTAETKGDWRP--L 110 Query: 291 IVFMTDGENLST 302 + MTDG Sbjct: 111 VFLMTDGGPTDD 122 >gi|121583396|ref|YP_973827.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] gi|120596650|gb|ABM40085.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] Length = 212 Score = 45.1 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 48/134 (35%), Gaps = 16/134 (11%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +++D S SM S I+ + ++ ++ P ++TF ++ Sbjct: 3 RLPVYLLVDTSGSM------SGEPIEAVKNGVQVLVSTLRQDPYALETAFLSIITFDSEA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK-K 289 + L +I + G + L A + + D M T + K Sbjct: 57 RQVVPL---TELANFQIPAIVATGTT-----ALGSALSLLADKIEMEVGRTTAEVKGDWK 108 Query: 290 IIVF-MTDGENLST 302 IVF MTDG Sbjct: 109 PIVFIMTDGSPTDD 122 >gi|332708216|ref|ZP_08428205.1| hypothetical protein LYNGBM3L_12190 [Lyngbya majuscula 3L] gi|332353010|gb|EGJ32561.1| hypothetical protein LYNGBM3L_12190 [Lyngbya majuscula 3L] Length = 453 Score = 45.1 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 42/245 (17%), Positives = 80/245 (32%), Gaps = 35/245 (14%) Query: 116 LDIVVVPQNEGYSISAISRYKIPLKFCT--FIPWYTNSRHIVMPITSSVKVNSQTDARLD 173 L+ V + I + Y P+ T Y N+ + Sbjct: 43 LNATVEDEQVKIRIQVKNEYDQPVTNLTDENFQVYVNNDKVTFKPKHWKNPTKAEQPSAW 102 Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK---- 229 ++ +LD+S SM + D + + AM E +K D ++ Q +V F Sbjct: 103 IIFLLDMSGSM-AKPDKPGSSQSKLEGATAAMREFIKNTADRSSHTQVAIVPFGEPHPKN 161 Query: 230 ------------IEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQG 275 + +FF + G S LQ + L+ +TN L A N + + +G Sbjct: 162 CPKGGYPVNQATLNKFFPV--GDSELQNYLGELASKKPCAATNVYKPLTEAVNFLSEQRG 219 Query: 276 M-------RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK-----KRGAIVY 323 + + + I+ ++DG + ++ + K IV+ Sbjct: 220 DPRFYPKKEKFWQPQPQQPRLSIILLSDGYHSTSATKEAEEQEFETLKRLIKVNDNIIVH 279 Query: 324 AIGIR 328 +G Sbjct: 280 TLGYG 284 >gi|47227140|emb|CAG00502.1| unnamed protein product [Tetraodon nigroviridis] Length = 762 Score = 45.1 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 41/256 (16%), Positives = 89/256 (34%), Gaps = 34/256 (13%) Query: 134 RYKIPLK----FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 Y PL+ F + I + + + K+ + L++ I +D+S S++ Sbjct: 214 TYDTPLEVSQAFGSAIKESLTTLESINDVQGERKIRISKNGTLNIYIAVDISESIQK--- 270 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIK- 248 ++ A K+I ++ ++ N L+ FS+++ E + + IK Sbjct: 271 ---DHVESAKKAILKLITKISSFSVSPNYE---LLFFSSELSEVVNILDFFENQPVDIKG 324 Query: 249 YLSKFGVSTN-STPGLKYAYNQIFDMQGMRQHC--NTEDANYKKIIVFMTDG-ENLSTKE 304 L+KF V+ + L A+ I + + ++ I+ TDG N+ Sbjct: 325 RLTKFKVNEEHTGTDLNLAFKTILVRMALIKQRVGEKAFEEHRHAIIVFTDGVYNMGGSP 384 Query: 305 -------------DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP---NSFYLV 348 ++ + +Y GI LR + F+ + Sbjct: 385 LPTVAKIKHMVYMNKIDEETGQNPRDEYLDIYIFGIGAEIYESDLRPLTAGTGGEHFFKL 444 Query: 349 ENPHSMYDAFSHIGKD 364 ++ + F +I + Sbjct: 445 MEIQNLQETFDNIIDE 460 >gi|89100236|ref|ZP_01173103.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] gi|89085086|gb|EAR64220.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] Length = 476 Score = 45.1 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 35/220 (15%), Positives = 78/220 (35%), Gaps = 29/220 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P ++ + +++LD S SM D K+++A ++ + + + Sbjct: 153 LPEPEQAAQSETGEEQEDGKALLLLDASSSMLLDVDGKQ-KMEIAKSAVRSFAKTIGEEN 211 Query: 214 DVNNVVQSGLVTFSNK--------IEEFFLL-EWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 DV+ V T +K I+E + L + + ++ + G T +K Sbjct: 212 DVSLYVYGHAGTQEDKDKQISCTTIDEVYPLQSYNEESFFKAVEGVEAKGW-TPLAGAIK 270 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVY 323 A D +G + IV +DG E +++ + + R + Sbjct: 271 AAREASMDYEG----------DITLYIV--SDGAETCDGNPVEEARLFAETNESRMVNI- 317 Query: 324 AIGIRV-IRSHEFLRACASPNS--FYLVENPHSMYDAFSH 360 IG ++ + L+ A + +N + + S+ Sbjct: 318 -IGFNADAKAEDQLKKVAEAGKGEYIGADNADQLNSSISN 356 >gi|301381604|ref|ZP_07230022.1| ppkA-related protein [Pseudomonas syringae pv. tomato Max13] gi|302059660|ref|ZP_07251201.1| ppkA-related protein [Pseudomonas syringae pv. tomato K40] gi|302131911|ref|ZP_07257901.1| ppkA-related protein [Pseudomonas syringae pv. tomato NCPPB 1108] Length = 663 Score = 45.1 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 34/228 (14%), Positives = 74/228 (32%), Gaps = 34/228 (14%) Query: 157 PITSSVKVNSQ-TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 P ++ SQ + ++ V+D + SM+ + I+ ++I + ++ ++ Sbjct: 221 PAQTAANATSQLKEFSAAVVFVIDSTISMDPY-------IERTREAIAKVYAQIAK-ENL 272 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 V+ GLV F + + LE+ V+ + + LK A Sbjct: 273 GRQVKFGLVAFRSSTQAVPGLEY-VTKMYADPNTVKDGADFLAKAADLKQAKVSSKSFNE 331 Query: 276 MR-----QHCNTEDANYK--KIIVFMT-----DGENLSTKEDQQSLYYCNEAKKRGAIVY 323 Q + D + + +V +T DG++ + + EA G +Y Sbjct: 332 DSYAGVMQAIDKVDWSPFGARYVVLITDAGALDGDDKLSGTGLNAEQVRIEASNPGVAIY 391 Query: 324 AIGIRVIRS-----------HEFLRACASPNSFYLVENPHSMYDAFSH 360 + ++ + S Y + + AF Sbjct: 392 TLHLKTAAGAKDHAKAEAQYQALSTYTGTNTSLYYPVDAGDLN-AFGS 438 >gi|196006922|ref|XP_002113327.1| hypothetical protein TRIADDRAFT_57367 [Trichoplax adhaerens] gi|190583731|gb|EDV23801.1| hypothetical protein TRIADDRAFT_57367 [Trichoplax adhaerens] Length = 933 Score = 45.1 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 31/196 (15%), Positives = 66/196 (33%), Gaps = 27/196 (13%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 ++VLDVS SM ++ ++ L + + + + V G++TFS+ Sbjct: 316 VLVLDVSGSMRG------KPMEQLQQAATNFL--LNVAQNGSFV---GIITFSSAASIRS 364 Query: 235 LLE--WGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L + QR I L T+ G++ + G + + Sbjct: 365 SLVQINDDADRQRLILLLPSGASGSTSIGAGIQAGVKILKASVGNKSPSGG-------TL 417 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC-ASPNSFYLV-- 348 + ++DG + + V I + S + C + + V Sbjct: 418 IVLSDGRENRSPTIADVKK---QVLDNKITVQGISFGSLASVKLQSLCYETGGLSFFVPN 474 Query: 349 ENPHSMYDAFSHIGKD 364 ++ + +AF+ I + Sbjct: 475 DDTARLVNAFTTIAEK 490 >gi|168998772|ref|YP_001688040.1| TerY1 [Klebsiella pneumoniae NTUH-K2044] gi|238549793|dbj|BAH66144.1| tellurite resistance protein [Klebsiella pneumoniae subsp. pneumoniae NTUH-K2044] Length = 212 Score = 45.1 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 31/152 (20%), Positives = 52/152 (34%), Gaps = 15/152 (9%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + ++LD S SM I+ + +L +K P ++TF + Sbjct: 3 RLPVYLLLDTSGSMHG------EPIEAVKNGVQTLLTTLKQDPYALETAYVSVITFDSTA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L L + S G +T L N+I D + Sbjct: 57 RQAVPL---TDLLSFNLPSFSASG-TTALGEALSLTANRIDAEVQKTTAETKGDWRP--L 110 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 + MTDG +D + +A K+G +V Sbjct: 111 VFLMTDG---GPTDDWRKGVNEFKAAKKGVVV 139 >gi|170742540|ref|YP_001771195.1| hypothetical protein M446_4419 [Methylobacterium sp. 4-46] gi|168196814|gb|ACA18761.1| hypothetical protein M446_4419 [Methylobacterium sp. 4-46] Length = 303 Score = 45.1 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 24/171 (14%), Positives = 53/171 (30%), Gaps = 22/171 (12%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 + +G + I+ A +P + L +G I++ + + + +D +++ Sbjct: 18 SDARGTIGIMFAGMMPAVLLAIGCGIDLQRALAYRGKVQAALDGAVMAVV---------- 67 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 R + SF L +D I T L P +++A Sbjct: 68 --GNTSFDADFGRQAFKTSASFAYALDGAAPGSDPLTI---TRLTFTQNPDG---TVTAE 119 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 + + + I R + + S+ K + VL + Sbjct: 120 ATATMRTTVVSII----GVRSLDLSFRSTAKGTTTLKISTITFKVLSAQGA 166 >gi|213966921|ref|ZP_03395071.1| ppkA-related protein [Pseudomonas syringae pv. tomato T1] gi|213928243|gb|EEB61788.1| ppkA-related protein [Pseudomonas syringae pv. tomato T1] Length = 653 Score = 45.1 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 34/228 (14%), Positives = 74/228 (32%), Gaps = 34/228 (14%) Query: 157 PITSSVKVNSQ-TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 P ++ SQ + ++ V+D + SM+ + I+ ++I + ++ ++ Sbjct: 211 PAQTAANATSQLKEFSAAVVFVIDSTISMDPY-------IERTREAIAKVYAQIAK-ENL 262 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 V+ GLV F + + LE+ V+ + + LK A Sbjct: 263 GRQVKFGLVAFRSSTQAVPGLEY-VTKMYADPNTVKDGADFLAKAADLKQAKVSSKSFNE 321 Query: 276 MR-----QHCNTEDANYK--KIIVFMT-----DGENLSTKEDQQSLYYCNEAKKRGAIVY 323 Q + D + + +V +T DG++ + + EA G +Y Sbjct: 322 DSYAGVMQAIDKVDWSPFGARYVVLITDAGALDGDDKLSGTGLNAEQVRIEASNPGVAIY 381 Query: 324 AIGIRVIRS-----------HEFLRACASPNSFYLVENPHSMYDAFSH 360 + ++ + S Y + + AF Sbjct: 382 TLHLKTAAGAKDHAKAEAQYQALSTYTGTNTSLYYPVDAGDLN-AFGS 428 >gi|32475537|ref|NP_868531.1| hypothetical protein RB8767 [Rhodopirellula baltica SH 1] gi|32446079|emb|CAD75908.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 291 Score = 45.1 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 22/97 (22%), Positives = 41/97 (42%), Gaps = 9/97 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 L +M+++D S S++ S + + + A L + N + GL FS +E Sbjct: 77 LAVMLMVDCSASLDFGTQSQTKR--ELVTELGATLAM----SAIKNNDRVGLTLFSEDVE 130 Query: 232 EFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKY 265 + F G H+ R I+ + G T+ L++ Sbjct: 131 KSFPPRQGSRHVLRLIREMLTHPSSGSGTDVGAALEH 167 >gi|120599090|ref|YP_963664.1| cell wall anchor domain-containing protein [Shewanella sp. W3-18-1] gi|120559183|gb|ABM25110.1| LPXTG-motif cell wall anchor domain [Shewanella sp. W3-18-1] Length = 757 Score = 45.1 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 70/196 (35%), Gaps = 33/196 (16%) Query: 152 RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 +VM + ++ + ++++V+D S SM I A ++ L +K Sbjct: 356 YSLVMVLPPKTDEHALSTLPRELILVIDTSGSMAG------DSIVQAKSALLYALNGLKA 409 Query: 212 IPDVNNVVQSGLVTFSNKIEE--FFLLEWGVSHL---QRKIKYLSKFGVSTNSTPGLKYA 266 N ++ F++++ + L +HL ++ I L G + + A Sbjct: 410 EDSFN------IIEFNSELTQLSPTSLPANQTHLARARQFIHRLQADGGTE-----MALA 458 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE--NLSTKEDQQSLYYCNEAKKRGAIVYA 324 N R +++ + ++FMTDG N D ++ Sbjct: 459 LNAALPRGINRLSESSQSL---RQVIFMTDGSVGNEQALFDLIRYQIGES------RLFT 509 Query: 325 IGIRVIRSHEFLRACA 340 +GI + F++ A Sbjct: 510 VGIGSAPNSHFMQRAA 525 >gi|314923048|gb|EFS86879.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL001PA1] gi|314966820|gb|EFT10919.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL082PA2] gi|315093260|gb|EFT65236.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL060PA1] gi|315103482|gb|EFT75458.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA2] gi|327327646|gb|EGE69422.1| putative von Willebrand factor type A domain protein [Propionibacterium acnes HL103PA1] Length = 322 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 25/160 (15%), Positives = 52/160 (32%), Gaps = 17/160 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +D S SM++ D S T++ A + + +V +V+ S E Sbjct: 96 IVVAIDSSLSMKAD-DVSPTRLAAAKAKAKDFINSLPTGFNVA------VVSISEHPEIR 148 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + R + + + T + + + G ++ IV Sbjct: 149 MPPSTDRPTVLRAVDGI-ELQDGTALGGAIDKSLEAVKMAPGGSKNPAPAA------IVM 201 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 ++DG+N + AK VY I + Sbjct: 202 LSDGDNTQGGSPLVAANRAAAAK---VPVYTIAFGTETGY 238 >gi|45655624|ref|YP_003433.1| BatB [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] gi|45602595|gb|AAS72070.1| BatB [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] Length = 347 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 26/141 (18%), Positives = 56/141 (39%), Gaps = 15/141 (10%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + ++ +D++ ++DVS SM++ DSS T++ + + ML + + G Sbjct: 82 EKKEESFKGVDILFLVDVSLSMQA-IDSSPTRLAKFKEVLLRMLPSLSGN-------RFG 133 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 ++ F+ + + VS ++ L V T +Q F Sbjct: 134 MIVFAGSPFLYCPMTTDVSAFSDYVRGLDVDMVGDRGTD-----LSQAFTKAEALLRSEK 188 Query: 283 EDANYKKIIVFMTDGENLSTK 303 +I++ +TDGE+ + Sbjct: 189 --VFRNRILILVTDGEDQNDP 207 >gi|240137440|ref|YP_002961911.1| hypothetical protein MexAM1_META1p0705 [Methylobacterium extorquens AM1] gi|240007408|gb|ACS38634.1| conserved hypothetical protein [Methylobacterium extorquens AM1] Length = 473 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 70/462 (15%), Positives = 130/462 (28%), Gaps = 123/462 (26%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 N +G + +L A+ + ++G+ ++ KT L + D + + N Sbjct: 16 SNAEGSINVLFALAVLPTIGLVGLGVDYGMAISSKTRLDNAADAAALAGVVTAKEFIAAN 75 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 ++ + ++ +F F V + L+IV Q ++ A Sbjct: 76 AQQSDVTASGIKAGESQALKAFNANASKVPFA-----TVSLSQLEIVRTGQ----TLDAT 126 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME-SFFDSS 191 Y ++ + ++ + + +SV + S LD +++DVS SM DS Sbjct: 127 VSYTATVQSTFGRIFGLSATTLTNRVNASVDLAS----YLDFYLMVDVSGSMGLPTKDSD 182 Query: 192 IT------------------------------KIDMAIKSINAMLEEV---KLIPDVNNV 218 KI + ++N + E+ P V N Sbjct: 183 AEVLAMQSKEKQGNCQFACHFPDSVGWTKAAGKIQLRSDAVNNAVCELLKRASTPVVPNQ 242 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMR 277 + G+ F N++ L + L V T L Q+F + Sbjct: 243 YRIGIYPFINQLATLAPLTDTTTSLAALRTAAQCDKVWPLAFTNLLDTGSTQLFTNNDPK 302 Query: 278 QHCNTEDANYKK-----------------------IIVFMTDGENLS------------- 301 + +++ + +TDG S Sbjct: 303 TGTGSGGTHFEAALPKMKSTIKPYGNGSASTNSRPFVFLITDGMQNSQSYSAWKDTKTFS 362 Query: 302 ---------TKEDQQSLYY-------CNEAKKRGAIV---YA-------------IGIRV 329 D C + K GA + Y I Sbjct: 363 GNPSKFAGYPNADWNGSQPAQIDPSKCTDLKNAGATISVLYIPYNIVKNYNNDSYIVWEN 422 Query: 330 IRSHEF-------LRACASPNSFYLVENPHSMYDAFSHIGKD 364 R ++F LR CASP FY + + + K Sbjct: 423 GRVNQFSPTLADPLRKCASPGFFYTANTQDDITASLGAMFKQ 464 >gi|113867619|ref|YP_726108.1| hypothetical protein H16_A1608 [Ralstonia eutropha H16] gi|113526395|emb|CAJ92740.1| conserved hypothetical protein containing von Willebrand factor (vWF) type A [Ralstonia eutropha H16] Length = 547 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 61/195 (31%), Gaps = 29/195 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + I +D+SR+M++ D + T+++ A I A+L ++ + ++ Sbjct: 99 LAIAVDLSRTMDA-VDVTPTRLERAKLKIKALLAR-------RGGGRTAIYAYAGSTHLV 150 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L + LQ + L T P Q + I+F Sbjct: 151 LPLTDDTNLLQTFVDALQ-----TRIMPVPGRDMAQALRTIDA----DLAREPVPGTILF 201 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHS 353 +TDG + + + + + + I + LR A + + Sbjct: 202 LTDG--VDPAAGRAFRAQADSGRSQPV---VLAIGTEQGGP-LRNAA--GGYVEQDGARV 253 Query: 354 M----YDAFSHIGKD 364 AF G D Sbjct: 254 FARLDVAAFKRFGDD 268 >gi|330879011|gb|EGH13160.1| serine/threonine-protein kinase PpkA [Pseudomonas syringae pv. morsprunorum str. M302280PT] Length = 663 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 34/228 (14%), Positives = 74/228 (32%), Gaps = 34/228 (14%) Query: 157 PITSSVKVNSQ-TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 P ++ SQ + ++ V+D + SM+ + I+ ++I + ++ ++ Sbjct: 221 PAQTAANATSQLKEFSAAVVFVIDSTISMDPY-------IERTREAIAKVYAQIAK-ENL 272 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 V+ GLV F + + LE+ V+ + + LK A Sbjct: 273 GRQVKFGLVAFRSSTQAVPGLEY-VTKMYADPNTVKDGADFLAKAADLKQAKVSSKSFNE 331 Query: 276 MR-----QHCNTEDANYK--KIIVFMT-----DGENLSTKEDQQSLYYCNEAKKRGAIVY 323 Q + D + + +V +T DG++ + + EA G +Y Sbjct: 332 DSYAGVMQAIDKVDWSPFGARYVVLITDAGALDGDDKLSGTGLNAEQVRIEASNPGVAIY 391 Query: 324 AIGIRVIRS-----------HEFLRACASPNSFYLVENPHSMYDAFSH 360 + ++ + S Y + + AF Sbjct: 392 TLHLKTAAGAKDHAKAEAQYQALSTYTGTNTSLYYPVDAGDLN-AFGS 438 >gi|145592113|ref|YP_001154115.1| VWA containing CoxE family protein [Pyrobaculum arsenaticum DSM 13514] gi|145283881|gb|ABP51463.1| VWA containing CoxE family protein [Pyrobaculum arsenaticum DSM 13514] Length = 357 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 24/187 (12%), Positives = 59/187 (31%), Gaps = 36/187 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + + LDVS SM+ + TK+ +A +I ++++ + L F+ + Sbjct: 200 VYVALDVSGSMKEYMGG-ATKLKIAKDAIARYIKQMAELRGN-----VSLTLFNADADYM 253 Query: 234 FLLEWGVSHLQR--KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + +L+ +I T L ++ R H + Sbjct: 254 WTPHPAHRYLKEMLEILRYVYSMGGTEIASAL-----ELLHADAARSH-----------V 297 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVE 349 V ++DG ++ L + + ++ + L+ A + + + Sbjct: 298 VIISDGRTNDP---EKVLQLAKKFR----RIHTVA---AERSRLLKQIAKITGGKYRELN 347 Query: 350 NPHSMYD 356 + Sbjct: 348 PTLDLLS 354 >gi|113866742|ref|YP_725231.1| flp pilus assembly protein TadG [Ralstonia eutropha H16] gi|113525518|emb|CAJ91863.1| flp pilus assembly protein TadG [Ralstonia eutropha H16] Length = 417 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 9/51 (17%), Positives = 27/51 (52%) Query: 16 KGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM 66 +G + I+ + L ++ +G+ +++ ++ K+ L + +D + AA + Sbjct: 18 RGAVAIIVGLSLAVLIGFVGLALDLGKLYVTKSELQNSVDACALAAARDVT 68 >gi|324995677|gb|EGC27589.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK678] Length = 450 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 54/321 (16%), Positives = 93/321 (28%), Gaps = 41/321 (12%) Query: 52 SMIDRSLVHAATQIMNEGN--GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 + + A +I + K GD++ KN + + N V Sbjct: 52 EAVSSPIEVAGAEIAADTTKPVVEESAAKPGDLIDVSKNVSPVDVKESQEGNQTVRVETA 111 Query: 110 IVRSTSLDIV---VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 V T ++ V E + + + Y +S Sbjct: 112 TVDVTKTEVAPAKVDKAPEPITRRTENTLSGTDENGNPYTQYERVDKTTTITYTSTPPTV 171 Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 D++ V+D S SM ID+ +IN + + + GL TF Sbjct: 172 TKAGSADIVFVVDRSGSMGGT-------IDIVRANINEFVRNITK---EGITARFGLATF 221 Query: 227 SNKI--------EEFFLLEWGVS--------HLQRKIKYLS-KFGVSTNSTPGLKYAYNQ 269 S+++ +E +L S L++ + + G T TP A NQ Sbjct: 222 SDEVFGRNSGSKDEDTVLTRFGSSYFTTDPAELEKALAAIRIASGGDTPETPT--PALNQ 279 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 I + KK +V +TD E K G ++ Sbjct: 280 IISTYDWSKSSKN-----KKFVVLLTDAEMKEDPSIPTVADTLAALKAAGIERTVATVKA 334 Query: 330 IRSHEFLRACASPNSFYLVEN 350 I + A+ +EN Sbjct: 335 IEG--IYKNFATEGRVLDIEN 353 >gi|268324906|emb|CBH38494.1| conserved hypothetical protein [uncultured archaeon] Length = 709 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 29/156 (18%), Positives = 57/156 (36%), Gaps = 22/156 (14%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 M V+D S SM + +++ A ++ ++L + + G+V F + Sbjct: 513 MFVVDASGSMGANR-----RMESAKGAVLSLL-----LDSYQQRDKVGMVAFKGDQADVL 562 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 L S L + G T GL+ N + + + I++ + Sbjct: 563 LPLCSSSDLAVERLRELPTGGRTPLAAGLEQGLNLLMAEKHRDEEAIP-------ILLLI 615 Query: 295 TDGENL-----STKEDQQSLYYCNEAKKRGAIVYAI 325 +DG S + +Q+ L +A+ +G V I Sbjct: 616 SDGRANVSAGGSKELEQELLALAEQARAKGIYVIVI 651 >gi|260577971|ref|ZP_05845896.1| secreted Mg-chelatase subunit [Corynebacterium jeikeium ATCC 43734] gi|258603897|gb|EEW17149.1| secreted Mg-chelatase subunit [Corynebacterium jeikeium ATCC 43734] Length = 551 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 37/209 (17%), Positives = 65/209 (31%), Gaps = 40/209 (19%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE-EVKLIPDVNNVVQ---SGLVTF 226 D +LD S SM ++ +N +++ + + L+ F Sbjct: 361 PADSNFLLDTSGSMRGN------RLADLKGILNRLIDGTAGEAGNPKGFGRRETITLMPF 414 Query: 227 SNKI------EEFFLLEWGVS-HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 S+K+ E + S L+ + L G T + AY+++ + G Sbjct: 415 SSKVADGYTQEHYDPDSAEQSRGLRDYVNGLQPRGE-TAIYDAVLRAYDRVGEGGGSLNS 473 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK------KRGAIVYAIGIRVIRSH 333 IV MTDGE+ S Q+ + + K R V+ I Sbjct: 474 -----------IVLMTDGESNSGTNRQEFI---TKMKRKMAETDRKIPVFVILYGEASED 519 Query: 334 EFLRACA-SPNSFYLVENPHSMYDAFSHI 361 E + + + AF I Sbjct: 520 EMNFLADFTGGKVFNAR-SGDLSKAFEEI 547 >gi|281358203|ref|ZP_06244686.1| von Willebrand factor type A [Victivallis vadensis ATCC BAA-548] gi|281315293|gb|EFA99323.1| von Willebrand factor type A [Victivallis vadensis ATCC BAA-548] Length = 232 Score = 45.1 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 32/184 (17%), Positives = 71/184 (38%), Gaps = 29/184 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAI----KSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 ++ ++DVS SM + +++D+ +++ + + I + FS+ Sbjct: 62 VVFLVDVSGSMGAVTPEGGSRLDVMKRELRRAVGSAVASANRIGAPKEAGNFRVWAFSSG 121 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKF------GVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 ++ F LE + ++ L++F G STN + A+ +I ++ Q Sbjct: 122 LQLFPDLEPCGFRDRSAVERLNRFVGALGAGGSTN----MLMAWRKILELTKHGQLD--- 174 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR--ACAS 341 + F++DG+ ++ + K +AIG+ LR A A Sbjct: 175 ------TVYFLSDGDPSDCSAEELTRLLTRLPKDVTVHCFAIGLDS----SLLREIAAAH 224 Query: 342 PNSF 345 ++ Sbjct: 225 NGNY 228 >gi|327188854|gb|EGE56047.1| putative vault protein inter-alpha-trypsin domain-containing protein [Rhizobium etli CNPAF512] Length = 794 Score = 45.1 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 35/191 (18%), Positives = 76/191 (39%), Gaps = 37/191 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D S SM S I+ A +S+ + + + ++ F + + ++ Sbjct: 356 VVFVIDNSGSM------SGQSIEQARQSLALAISRLSKDD------RFNVIRFDDTMTDY 403 Query: 234 F--LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 F L+ + ++ I Y+ G++ + + A QG + + Sbjct: 404 FNGLVAASPDNREKAITYVR--GLTADGGTEMLPALEDALRNQGPVASGAL------RQV 455 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENP 351 VF+TDG ++Q L+ A + A V+ +GI ++PN++++ + Sbjct: 456 VFLTDG----AIGNEQQLFQEISANRGDARVFTVGIG-----------SAPNTYFMTKAA 500 Query: 352 HSMYDAFSHIG 362 F+ IG Sbjct: 501 EIGRGTFTAIG 511 >gi|315639746|ref|ZP_07894885.1| conserved hypothetical protein [Enterococcus italicus DSM 15952] gi|315484523|gb|EFU74980.1| conserved hypothetical protein [Enterococcus italicus DSM 15952] Length = 1228 Score = 45.1 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 53/156 (33%), Gaps = 26/156 (16%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + +D+++V+D+S SME ++ + I + ++ V+ V G V Sbjct: 351 EIKDIKPIDIVLVIDMSGSMEP------SRAPAVRQGIKDFFKYIQDA-GVSQYVNVGFV 403 Query: 225 TFSN---------KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 +S+ E + K F T + G+K + + Sbjct: 404 GYSSINQSNYVGLIQEGIQPISNTTHTNNINTKLNRTFIGGTFTQLGIKTGADMLTSYNS 463 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY 311 + K+++ +TDG + + +S Sbjct: 464 TNE----------KMMILLTDGVPTYSYKVNESTTI 489 >gi|308473011|ref|XP_003098732.1| hypothetical protein CRE_04177 [Caenorhabditis remanei] gi|308268332|gb|EFP12285.1| hypothetical protein CRE_04177 [Caenorhabditis remanei] Length = 396 Score = 45.1 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 63/181 (34%), Gaps = 8/181 (4%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SR M + S++ +++ N + + P + GLVT+++ Sbjct: 37 LDVIAVVDNSRGMTNKGLSNVASSILSVFGENTRIGSNSVEP---RTTRLGLVTYNSVAS 93 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + L S + ++ + + + Q N+ A+YK+++ Sbjct: 94 QKADLNQYQSIADAYTGVFDALSTTVDTIQSYLATGLALAERMLVDQTVNSTRAHYKRVM 153 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC---ASPNSFYLV 348 + + + L K + + S L+ ASP + Sbjct: 154 IVY--ASEYNGNGESDPLPLAERLKLSNINIITVAYEQPGSVGLLQGLTQIASPGFSFSS 211 Query: 349 E 349 E Sbjct: 212 E 212 >gi|296225455|ref|XP_002758485.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3 isoform 1 [Callithrix jacchus] gi|296225457|ref|XP_002758486.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3 isoform 2 [Callithrix jacchus] Length = 1091 Score = 45.1 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 70/192 (36%), Gaps = 32/192 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++I++DVS SM+ ++ +A ++++++L+ + N ++ ++ Sbjct: 251 ATSPKDVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYN 298 Query: 228 NKIEEFFLLEWGV---------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ G H + + L G+ L A+N + D Q Sbjct: 299 EELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIG-MLDIALNEAFNILSDFNHTGQ 357 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLR 337 A I+ +TDG + +++ R ++ I + + L+ Sbjct: 358 GSICSQA-----IMLITDG----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLK 408 Query: 338 ACASPNSFYLVE 349 A N + + Sbjct: 409 WMACANKGFFTQ 420 >gi|219127467|ref|XP_002183956.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|217404679|gb|EEC44625.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 694 Score = 45.1 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 25/154 (16%), Positives = 56/154 (36%), Gaps = 29/154 (18%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 ++++VLD S SM + ++ + + ++E + + + +V FS Sbjct: 195 DSGPKNVVLVLDTSSSM-----GNYNRLGLLQDAAIRIVETLSVGD------RIAIVQFS 243 Query: 228 NKIEEF----FLLEWGVSH----LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ + F W L+ ++ L + TN+ + + D H Sbjct: 244 SQAKPFESKGQTFFWATKENKIALKTYVEDL-ELNEGTNTLDAFNKTFAVLDDSIDQELH 302 Query: 280 CNTEDANYKKIIVFMTDG----ENLSTKEDQQSL 309 A ++F+TDG TK + ++ Sbjct: 303 NECITA-----VLFLTDGVVSPVMNETKSETETK 331 >gi|159899109|ref|YP_001545356.1| FHA domain-containing protein [Herpetosiphon aurantiacus ATCC 23779] gi|159892148|gb|ABX05228.1| FHA domain containing protein [Herpetosiphon aurantiacus ATCC 23779] Length = 785 Score = 45.1 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 35/216 (16%), Positives = 71/216 (32%), Gaps = 35/216 (16%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN--NVVQSGLVTFS 227 + + +V+D S SM F I+ ++ N +++ + + V Sbjct: 91 TPIKIALVMDQSGSMNPF-------IEEVKRAANQAIDQAPANAKIAVFTFTRMNSV--- 140 Query: 228 NKIEEFFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + F + ++ I Y S+ G T A + + NT Sbjct: 141 DVYLPAFDFSDDRNAVKDYINQNYRSEPGGETCLYTAAHQATDFLL---------NTLKP 191 Query: 286 NYKKIIVFMTDGENLSTK----EDQQSLYYCNEA---KKRGAIVYAIGI-----RVIRSH 333 ++ I+ TDG++ D+ + +A + +Y IG+ I+ Sbjct: 192 EERRAIILFTDGKDEDINGNQCSDKTVIDVTTKAGPTQGTKTPIYTIGLCSADCARIQPE 251 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 + + + LV M AF I I ++ Sbjct: 252 SLRQISENTQAISLVGPRDQMSAAFITIMDSIKNQK 287 >gi|28212256|ref|NP_783185.1| voltage-dependent calcium channel subunit alpha-2/delta-3 [Rattus norvegicus] gi|81871225|sp|Q8CFG5|CA2D3_RAT RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-3; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-3; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-3; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-3; Flags: Precursor gi|27450708|gb|AAO14654.1|AF486278_1 calcium channel alpha-2 delta-3 subunit [Rattus norvegicus] Length = 1085 Score = 45.1 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 30/192 (15%), Positives = 72/192 (37%), Gaps = 32/192 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++I++DVS SM+ ++ +A ++++++L+ + N ++T++ Sbjct: 251 ATSPKDVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IITYN 298 Query: 228 NKIEEFFLLEWGV---------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ G H + + L G+ L A+N + D Q Sbjct: 299 EELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIG-MLDIALNEAFNVLSDFNHTGQ 357 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLR 337 A I+ +TDG + +++ +R ++ I + + L+ Sbjct: 358 GSICSQA-----IMLITDG----AVDTYDTIFAKYNWPERKVRIFTYLIGREAAFADNLK 408 Query: 338 ACASPNSFYLVE 349 A N + + Sbjct: 409 WMACANKGFFTQ 420 >gi|7105926|emb|CAB75962.1| calcium channel alpha2-delta3 subunit [Homo sapiens] Length = 997 Score = 45.1 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 70/192 (36%), Gaps = 32/192 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++I++DVS SM+ ++ +A ++++++L+ + N ++ ++ Sbjct: 157 ATSPKDVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYN 204 Query: 228 NKIEEFFLLEWGV---------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ G H + + L G+ L A+N + D Q Sbjct: 205 EELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIG-MLDIALNEAFNILSDFNHTGQ 263 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLR 337 A I+ +TDG + +++ R ++ I + + L+ Sbjct: 264 GSICSQA-----IMLITDG----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLK 314 Query: 338 ACASPNSFYLVE 349 A N + + Sbjct: 315 WMACANKGFFTQ 326 >gi|314988185|gb|EFT32276.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA2] Length = 322 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 25/160 (15%), Positives = 52/160 (32%), Gaps = 17/160 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +D S SM++ D S T++ A + + +V +V+ S E Sbjct: 96 IVVAIDSSLSMKAD-DVSPTRLAAAKAKAKDFINSLPTGFNVA------VVSISEHPEIR 148 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + R + + + T + + + G ++ IV Sbjct: 149 MPPSTDRPTVLRAVDGI-ELQDGTALGGAIDKSLEAVKMAPGGSKNPAPAA------IVM 201 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 ++DG+N + AK VY I + Sbjct: 202 LSDGDNTQGGSPLVAANRAAAAK---VPVYTIAFGTETGY 238 >gi|226307531|ref|YP_002767491.1| hypothetical protein RER_40440 [Rhodococcus erythropolis PR4] gi|226186648|dbj|BAH34752.1| hypothetical protein RER_40440 [Rhodococcus erythropolis PR4] Length = 233 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 37/213 (17%), Positives = 75/213 (35%), Gaps = 21/213 (9%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 Q + L +V DVS SME I ++ A+ E+ P + ++ + +++ Sbjct: 1 MQEASLLPFYLVFDVSYSMEPV-------IGEVNNAMRALKNEILKDPILGDIARVCVLS 53 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 FS++ + + + + T+ P +I + + E Sbjct: 54 FSDEARIDVPMCDLADDTRITREDFLQVRGGTSFAPIFDLIGERIAADIADLKG-HGEGK 112 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA--IGIRVIRS-HEFLRACASP 342 ++ + F+TDG + S + + Y + + + E LRA P Sbjct: 113 VFRPTVFFVTDGVPTDAVHEWNSAFT----RLTSVKAYPNLVPFGLGDADEEVLRAITFP 168 Query: 343 -----NSFYLVENPHSMYDAFSHIGKDIVTKRI 370 F++ S A I + IVT+ + Sbjct: 169 PYRQDGYFFMANAGTSAEQAMQAITR-IVTQSV 200 >gi|317012798|gb|ADU83406.1| hypothetical protein HPLT_05065 [Helicobacter pylori Lithuania75] Length = 219 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 69/204 (33%), Gaps = 13/204 (6%) Query: 175 MIVLDVSRSMESFFD----SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 ++LD S SM T+I + I M+E +K + +VTF Sbjct: 18 FLLLDTSGSMNESLGDRTRDDRTRIGVLNLCIQKMIETLKQEAKKELFNKMAIVTFGENG 77 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + ++ K LS G + L A+ D+ + YK Sbjct: 78 AVLHTPFDDIKNIN--FKPLSASGGT-----PLDQAFKLAKDLIEYKD--TFPTKFYKPY 128 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 + ++DGE K + + ++ + ++ ++I I ++ + + + ++ Sbjct: 129 SILVSDGEPNDDKWQEPLFNFHHDGRSAKSVCWSIFIGDREANPQVNKDFGKDGVFYADD 188 Query: 351 PHSMYDAFSHIGKDIVTKRIWYDK 374 + F + + I K Sbjct: 189 VEKLVKLFEIMTQTISKGSASIKK 212 >gi|168998769|ref|YP_001688037.1| TerY3 [Klebsiella pneumoniae NTUH-K2044] gi|238549790|dbj|BAH66141.1| hypothetical protein KP1_p252 [Klebsiella pneumoniae subsp. pneumoniae NTUH-K2044] Length = 346 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 35/188 (18%), Positives = 60/188 (31%), Gaps = 15/188 (7%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + VLD S SM ++ +++ + ++ +++ P V ++ F+ Sbjct: 3 RLPVFFVLDCSESM---IGENLKRMN---DGLQKIVSDLRKDPHALETVWVSVIAFAGIA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + G TN L+ QI D + Sbjct: 57 RTIVPL----HDIVSFYPPRLPVGGGTNLATALRELTTQIDSQVRKTTLEEKGDWKP--V 110 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 + +TDG + + AKK I A+GI LR + N EN Sbjct: 111 VYLLTDGRPTDDISAEVKRWNEFYAKKVNMI--AVGIGPSVDLSVLRQL-TENVLSFTEN 167 Query: 351 PHSMYDAF 358 + F Sbjct: 168 QEGDFSRF 175 >gi|162450402|ref|YP_001612769.1| hypothetical protein sce2130 [Sorangium cellulosum 'So ce 56'] gi|161160984|emb|CAN92289.1| hypothetical protein sce2130 [Sorangium cellulosum 'So ce 56'] Length = 865 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 26/167 (15%), Positives = 52/167 (31%), Gaps = 36/167 (21%) Query: 180 VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL---- 235 VS SM+ +D A +++ L + + G++ FS+ Sbjct: 299 VSGSMQG------ESLDQAKRALRLCLRHLAEGD------RFGVIAFSSDFRALEPSLAP 346 Query: 236 -LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + + + L G + P L DA +++V + Sbjct: 347 FTQATLKAADAFVDGLRADGGTEMLNPLLAAV-------------GMLGDAERDRVVVLL 393 Query: 295 TDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 TDG+ + + + + +G +Y GI S + A Sbjct: 394 TDGQVGNEAQIVDRVVQ-----RGKGVRIYTFGIGTNVSDVLVNDLA 435 >gi|146162766|ref|XP_001010021.2| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|146146274|gb|EAR89776.2| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 783 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 22/107 (20%), Positives = 43/107 (40%), Gaps = 17/107 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 LD++ V+D+S SM K++ K+I ++ + + L+ F+ Sbjct: 81 DRQPLDLIFVIDLSISMRG------KKMNQLKKTICNLINFLNEND------RMALIGFN 128 Query: 228 NKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 N + F L + + + + G+ TN T G+ A Q+ Sbjct: 129 NSAQNLFPLSHLTQQNKKKVTQILNSILPMGL-TNITAGMMEAIKQL 174 >gi|119585707|gb|EAW65303.1| calcium channel, voltage-dependent, alpha 2/delta 3 subunit, isoform CRA_a [Homo sapiens] Length = 992 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 70/192 (36%), Gaps = 32/192 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++I++DVS SM+ ++ +A ++++++L+ + N ++ ++ Sbjct: 157 ATSPKDVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYN 204 Query: 228 NKIEEFFLLEWGV---------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ G H + + L G+ L A+N + D Q Sbjct: 205 EELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIG-MLDIALNEAFNILSDFNHTGQ 263 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLR 337 A I+ +TDG + +++ R ++ I + + L+ Sbjct: 264 GSICSQA-----IMLITDG----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLK 314 Query: 338 ACASPNSFYLVE 349 A N + + Sbjct: 315 WMACANKGFFTQ 326 >gi|160897741|ref|YP_001563323.1| hypothetical protein Daci_2300 [Delftia acidovorans SPH-1] gi|160363325|gb|ABX34938.1| conserved hypothetical protein [Delftia acidovorans SPH-1] Length = 277 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 66/203 (32%), Gaps = 36/203 (17%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITK----IDMAIKSINAMLEEVKLIPDVN 216 S + + + V+D S SM S +K D +++ ++ V Sbjct: 2 SYQAQISRTSPTAFLFVVDQSGSMADKMSSGRSKAEFVADALNRTLVNLVTRCSKSEGVR 61 Query: 217 NVVQSGLVTFS--------------------NKIEEFF--------LLEWGVSH-LQRKI 247 + G++ +S ++IE ++ G ++ I Sbjct: 62 DYFDVGVIGYSGTTVGNGFTGPLSGKILNAISEIERSPLRVEDRKRKMDDGAGGIIETSI 121 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 K+ F S N + A + + + C+ +Y ++ +TDGE+ + Sbjct: 122 KFPVWFEPSANGGTPMHAALTK--AAEELVAWCDAHPESYPPTVLHVTDGESSDGDPEAI 179 Query: 308 SLYYCNEAKKRG-AIVYAIGIRV 329 + + G +V I + Sbjct: 180 ASHLGQIRTNDGEVVVLNIHVSA 202 >gi|73985413|ref|XP_533789.2| PREDICTED: similar to calcium channel, voltage-dependent, alpha 2/delta 3 subunit [Canis familiaris] Length = 1128 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 70/192 (36%), Gaps = 32/192 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++I++DVS SM+ ++ +A ++++++L+ + N ++ ++ Sbjct: 288 ATSPKDVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYN 335 Query: 228 NKIEEFFLLEWGV---------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ G H + + L G+ L A+N + D Q Sbjct: 336 EELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIG-MLDIALNEAFNILSDFNHTGQ 394 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLR 337 A I+ +TDG + +++ R ++ I + + L+ Sbjct: 395 GSICSQA-----IMLITDG----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLK 445 Query: 338 ACASPNSFYLVE 349 A N + + Sbjct: 446 WMACANKGFFTQ 457 >gi|38639542|ref|NP_943311.1| hypothetical protein LV053 [Klebsiella pneumoniae] gi|38016640|gb|AAR07661.1| hypothetical protein LV053 [Klebsiella pneumoniae] Length = 351 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 35/188 (18%), Positives = 60/188 (31%), Gaps = 15/188 (7%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + VLD S SM ++ +++ + ++ +++ P V ++ F+ Sbjct: 8 RLPVFFVLDCSESM---IGENLKRMN---DGLQKIVSDLRKDPHALETVWVSVIAFAGIA 61 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + G TN L+ QI D + Sbjct: 62 RTIVPL----HDIVSFYPPRLPVGGGTNLATALRELTTQIDSQVRKTTLEEKGDWKP--V 115 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 + +TDG + + AKK I A+GI LR + N EN Sbjct: 116 VYLLTDGRPTDDISAEVKRWNEFYAKKVNMI--AVGIGPSVDLSVLRQL-TENVLSFTEN 172 Query: 351 PHSMYDAF 358 + F Sbjct: 173 QEGDFSRF 180 >gi|77457690|ref|YP_347195.1| VCBS [Pseudomonas fluorescens Pf0-1] gi|77381693|gb|ABA73206.1| putative secreted protein, hemolysin [Pseudomonas fluorescens Pf0-1] Length = 2887 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 38/174 (21%), Positives = 71/174 (40%), Gaps = 19/174 (10%) Query: 174 MMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 ++IVLD+S SM ++ ++++++A ++I+A+L++ + D V+ LVTFS+ + Sbjct: 2083 ILIVLDISGSMADASGVPGLSRLELAKQAISALLDKYDDLGD----VKVQLVTFSSNATD 2138 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + W + + G TN + YN + N Sbjct: 2139 RTSV-WVDVATAKTLLAGLSAGGGTNYDAAVATMYNAFNTSGKLTGAQNVGY-------- 2189 Query: 293 FMTDGENLS---TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 F +DG+ D+ +L +A YAIG+ S+ L A Sbjct: 2190 FFSDGKPNEGDIGTADEATLKAFLDA--NNIKNYAIGLGSGVSNANLDPLAYDG 2241 >gi|25011520|ref|NP_735915.1| hypothetical protein gbs1478 [Streptococcus agalactiae NEM316] gi|24413059|emb|CAD47137.1| Unknown [Streptococcus agalactiae NEM316] gi|221193659|gb|ACM07862.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193672|gb|ACM07868.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193737|gb|ACM07900.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] gi|221193741|gb|ACM07902.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 901 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 38/185 (20%), Positives = 63/185 (34%), Gaps = 18/185 (9%) Query: 47 KTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVND 106 KT + + D+ N G N ++L I S++ E N Sbjct: 119 KTTIQNSGDK----------NSTIGQNHEELDKQYPPTGIYEDTKESYKLEHVKGSVPNG 168 Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 + V +EG I I + + N I + ++ V Sbjct: 169 ------KSEAKAVNPYSSEGEHIREIPEGTLSKRISEVGDLAHNKYKIELTVSGKTIVKP 222 Query: 167 -QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ VLD S SM + + + A K+ A+ VK I N+ + LVT Sbjct: 223 VDKQKPLDVVFVLDNSNSMN-NDGPNFQRHNKAKKAAEALGTAVKDILGANSDNRVALVT 281 Query: 226 FSNKI 230 + + I Sbjct: 282 YGSDI 286 >gi|310824462|ref|YP_003956820.1| hypothetical protein STAUR_7237 [Stigmatella aurantiaca DW4/3-1] gi|309397534|gb|ADO74993.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1] Length = 913 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 33/198 (16%), Positives = 64/198 (32%), Gaps = 22/198 (11%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 Q + + +++D S SM TK+++A + + L + D + + Sbjct: 390 QRRTSVALSVLMDCSCSMGVTVPDGRTKMEVAAEGVVGALTLLNEKDDASVH-----MVD 444 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + E F L G K+ G L+ QI +H Sbjct: 445 TEPHEIFSLSSVGEGLPLNKVARGFSGGGGIFVGEALREGKTQILRSDKATRH------- 497 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH--EFLRACAS--P 342 ++ +D + +D ++ ++ V IG+ + + LR A Sbjct: 498 ----VLLFSDAADSEEPDDYRATLAA--LRRENVTVSVIGLGTPKDSDADLLREVAQLGG 551 Query: 343 NSFYLVENPHSMYDAFSH 360 Y E+ S+ FS Sbjct: 552 GRIYFAEDALSLPRIFSQ 569 >gi|182625113|ref|ZP_02952890.1| von Willebrand factor type A domain protein [Clostridium perfringens D str. JGS1721] gi|177909733|gb|EDT72159.1| von Willebrand factor type A domain protein [Clostridium perfringens D str. JGS1721] Length = 620 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 69/175 (39%), Gaps = 28/175 (16%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA 198 LK + P + I + + K ++ +D+++++D S SM D ++ Sbjct: 57 LKEVSQKPDEDGNYEITLTVKGKPKKVTK---PVDILLIMDASNSMYYNMDELKASMNSL 113 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL--EWGVSHLQRKIKYLS----- 251 + + + IP+ + +V F ++EE F + + + + Sbjct: 114 VDKV------IDNIPNS----RIAVVAFGTEVEEVFSFNDKNNFTSKEEYKNAIKDSYYY 163 Query: 252 -KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 +TN + A N+IF + ++N KK ++F +DG ++++ Sbjct: 164 ITGRGNTNIEGTWRRA-NEIFKNELN------NNSNSKKDVIFFSDGYPNVSRDE 211 >gi|149921119|ref|ZP_01909577.1| hypothetical protein PPSIR1_24814 [Plesiocystis pacifica SIR-1] gi|149818006|gb|EDM77465.1| hypothetical protein PPSIR1_24814 [Plesiocystis pacifica SIR-1] Length = 428 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 43/282 (15%), Positives = 82/282 (29%), Gaps = 77/282 (27%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM-----ESFFDSS---ITKIDMAIKSI 202 S P V + +M+V+D S SM + D + +T+ + + Sbjct: 78 SSTTSDPNCGEVSIVPTYVPP-QVMLVVDASGSMVNNSWDHDLDPNTPQVTRWNTLHGVV 136 Query: 203 NAMLEEV------------------KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ 244 + +++ P +N SG + E L+ G S + Sbjct: 137 STVMDNFGPAMYAGIQRFPSEEACPDATPMSSNCYNSGSCIVGTQPEVGVSLDNGAS-VI 195 Query: 245 RKIKYLSKFG----VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 I + T +T G+ A + H + + + ++ +TDG Sbjct: 196 AAIPGPTAGNTEIVGGTPATKGMNSAVS----------HLEQQPEAFPRYVLLITDGAAN 245 Query: 301 STKE----------DQQSLYYCNEAKKRGAIVYAIGI------------RVIRSHEFLR- 337 + D+ A G + +GI ++ F R Sbjct: 246 CDQALSFPDYIEQYDETLPTTVQAAFDGGITTFVVGIDIEDMLQGVGTDGSPEANPFERL 305 Query: 338 -----ACASPNS-------FYLVENPHSMYDAFSHIGKDIVT 367 A +P + FY N + DA I ++ Sbjct: 306 NDVAIAGGAPKNEGMDLEKFYNTTNQQELLDAIQAILGEVTD 347 >gi|153002167|ref|YP_001367848.1| putative outer membrane adhesin-like protein [Shewanella baltica OS185] gi|151366785|gb|ABS09785.1| putative outer membrane adhesin like proteiin [Shewanella baltica OS185] Length = 1215 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 70/202 (34%), Gaps = 34/202 (16%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +DM IV+D S SM + ID A ++ +++ GLV+FS + Sbjct: 310 GDIDMQIVMDRSGSMFG------SPIDNAKQAAKILVDA-----TAEGSTAMGLVSFSGR 358 Query: 230 --------IEEFFLLEWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +++ + GV L+ I + G ST G + A + + Q Sbjct: 359 SSVKQDFAMQKMPKPDNGVKQALKGAIDNIYANG-STALFDGSQLALDNLSAYQASAASG 417 Query: 281 NTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--VIRSHEFLR 337 ++ + DG +N S K + + A +++ G Sbjct: 418 APG------VVFVLADGDDNNSIKSESSVITAYQNA---NVPIFSFGYGSASPTGPLVTM 468 Query: 338 ACASPNSFY-LVENPHSMYDAF 358 A A+ ++ + DAF Sbjct: 469 ANATGGKYFSSPTTLAEIIDAF 490 >gi|115377018|ref|ZP_01464236.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|115365996|gb|EAU65013.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 884 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 33/198 (16%), Positives = 64/198 (32%), Gaps = 22/198 (11%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 Q + + +++D S SM TK+++A + + L + D + + Sbjct: 361 QRRTSVALSVLMDCSCSMGVTVPDGRTKMEVAAEGVVGALTLLNEKDDASVH-----MVD 415 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + E F L G K+ G L+ QI +H Sbjct: 416 TEPHEIFSLSSVGEGLPLNKVARGFSGGGGIFVGEALREGKTQILRSDKATRH------- 468 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH--EFLRACAS--P 342 ++ +D + +D ++ ++ V IG+ + + LR A Sbjct: 469 ----VLLFSDAADSEEPDDYRATLAA--LRRENVTVSVIGLGTPKDSDADLLREVAQLGG 522 Query: 343 NSFYLVENPHSMYDAFSH 360 Y E+ S+ FS Sbjct: 523 GRIYFAEDALSLPRIFSQ 540 >gi|74318654|ref|YP_316394.1| rubisco activation protein cbbO [Thiobacillus denitrificans ATCC 25259] gi|74058149|gb|AAZ98589.1| rubisco activation protein cbbO [Thiobacillus denitrificans ATCC 25259] Length = 773 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 25/188 (13%), Positives = 66/188 (35%), Gaps = 29/188 (15%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS-SITKIDMAIKSINAMLEEVKLIP 213 P + + +M++LD+S S+ T ++++ ++++ + ++ + Sbjct: 562 ATPDPRINMSHRSDGRDIAVMLLLDLSESLNEKAAGAGQTILELSQEAVSLLAWSIEKLG 621 Query: 214 DV------NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 D ++ + + F K W ++ ++ + G ST +++A Sbjct: 622 DPFAIAGFHSNTRHDVRYFHIKGYSE---RWN-DDVKARLAAMEA-GYSTRMGAAMRHA- 675 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED------QQSLYYCNEAKKRGAI 321 H + KK+++ +TDG + + E ++G Sbjct: 676 ----------AHYLSARPADKKLMLILTDGRPSDVDAADERLLVEDARQAVKELDRQGIF 725 Query: 322 VYAIGIRV 329 Y I + Sbjct: 726 AYCISLDA 733 >gi|118357550|ref|XP_001012024.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89293791|gb|EAR91779.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 853 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 32/171 (18%), Positives = 58/171 (33%), Gaps = 31/171 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +LD S SM S I A +++ L+ + I NVV G S + Sbjct: 316 FIFLLDRSGSM------SGQSIKQACEALVLFLQSL-PIDSYFNVVSFGS---SFEKLFP 365 Query: 234 FLLEWGVSHLQRK---IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 ++ +L++ I S T L +N + Y K Sbjct: 366 SSQKYNSQNLEQAVQIINQYSANLGGTEIYQPLHNVFN------------EKKIEGYNKQ 413 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGA-IVYAIGIRVIRSHEFLRACA 340 I +TDG+ + K+ + K +++IG + ++ A Sbjct: 414 IFLLTDGQVDNPKQVVNLIK-----KNNKFSRIHSIGFGNDADKQLIQETA 459 >gi|300856050|ref|YP_003781034.1| hypothetical protein CLJU_c28840 [Clostridium ljungdahlii DSM 13528] gi|300436165|gb|ADK15932.1| conserved hypothetical protein [Clostridium ljungdahlii DSM 13528] Length = 297 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 11/52 (21%), Positives = 27/52 (51%) Query: 14 NYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 N +G + I++ + + + ++++ I+ KT L + ID + AA ++ Sbjct: 5 NDRGNVAIISCLLITALLGFTAYVLDIGMIYIEKTKLTNAIDSGALAAALEL 56 >gi|167524114|ref|XP_001746393.1| hypothetical protein [Monosiga brevicollis MX1] gi|163775155|gb|EDQ88780.1| predicted protein [Monosiga brevicollis MX1] Length = 1705 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 62/179 (34%), Gaps = 23/179 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + +++V D+++VLD+S S++ + + + +A +IN++ P Sbjct: 743 VSPSPSATVVPQFCDFTEQDVVLVLDMSGSVDVADYNRM--LQVARATINSL-------P 793 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAY---N 268 +V++ LV L G S L + L + S AY Sbjct: 794 INEGLVRAALVLLKGTPSSPVSLAQGTSREALLDGVDNLVYS-STRPSAAATTLAYVRET 852 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + G R T ++ TDG++ ++ A G + +G Sbjct: 853 VLTAANGYRGGQAT--------VLLFTDGDSQEPFTQVEAEAQQLRALPGGVKIATVGF 903 Score = 38.2 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 63/180 (35%), Gaps = 19/180 (10%) Query: 169 DARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 DA +DM++V+D S SM + ++ S+ + + +++ + F Sbjct: 275 DAPIDMVLVIDGSGSMGHAAYNRSLDNAAALVSRLALASSNIRVAAVELKGTPITIFDFD 334 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF-DMQGMRQHCNTEDAN 286 + ++H R ++ +N L + ++I +G R + Sbjct: 335 EYSSTNDMRNAFLAHKSRYVE-----NRGSNVAGTLDFIRSEILVSNKGHRLFADIP--- 386 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV---IRSHEFLRACASPN 343 I+F TD +++ D + G ++ GI + + + L S Sbjct: 387 ---TIIF-TDAR--ASQSDIFIEEAAQRLRSVGGRLFVFGIDINTEAQEQQLLFIAGSNE 440 >gi|119510959|ref|ZP_01630081.1| von Willebrand factor, type A [Nodularia spumigena CCY9414] gi|119464398|gb|EAW45313.1| von Willebrand factor, type A [Nodularia spumigena CCY9414] Length = 464 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 36/227 (15%), Positives = 72/227 (31%), Gaps = 42/227 (18%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR-LDMMIVLDVSRSMESFFDSSIT----- 193 F+P T + + + + + V+D S SM T Sbjct: 11 PHREFMPAETAGQKLFLMLKLRPTKDVAAIPPSTTFTFVIDTSGSMYEIVAGDTTPTGKT 70 Query: 194 ----------------KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 KID+ +S+ A++ +L + +V F + + L Sbjct: 71 YTVDGKEYTQVTGGKSKIDIVSESLLALIRSGRLGASD----RIAIVQFDDTASQIIGLT 126 Query: 238 WGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 + L+ I L F T GL+ A + + Q ++ ++F T Sbjct: 127 SATEINKLEDAIAQLRTFSGGTRMGLGLRRALEMLNNQQ----------MTVRRTLLF-T 175 Query: 296 DGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 DG+ +D Q ++ + A+G+ + + L + Sbjct: 176 DGQTF---DDDQCRAIASDFATNNIPITALGVGEDFNEDLLTHLSDS 219 >gi|146304836|ref|YP_001192152.1| protoporphyrin IX magnesium-chelatase [Metallosphaera sedula DSM 5348] gi|145703086|gb|ABP96228.1| protoporphyrin IX magnesium-chelatase [Metallosphaera sedula DSM 5348] Length = 600 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 27/136 (19%), Positives = 57/136 (41%), Gaps = 21/136 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +T + ++++LD SRSM+ +I +A + +L++ + + GLVTF Sbjct: 424 ETQGAIPILLLLDSSRSMDFSR-----RILVAKAILRELLQKAYQVRS-----KVGLVTF 473 Query: 227 S-NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 S ++ L + ++ + + G + S L A + + R+ N Sbjct: 474 SGSEARYDVPLTRNLRKVEEFVNGVRPAGKTPMS-MALYLALQIVNRERRSRRKLNP--- 529 Query: 286 NYKKIIVFM-TDGENL 300 +VF+ +DG+ Sbjct: 530 -----LVFLISDGKAN 540 >gi|54112397|ref|NP_060868.2| voltage-dependent calcium channel subunit alpha-2/delta-3 [Homo sapiens] gi|74723683|sp|Q8IZS8|CA2D3_HUMAN RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-3; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-3; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-3; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-3; Flags: Precursor gi|22770596|gb|AAN06673.1| voltage-gated calcium channel alpha(2)delta-3 subunit [Homo sapiens] gi|187950675|gb|AAI37506.1| Calcium channel, voltage-dependent, alpha 2/delta subunit 3 [Homo sapiens] gi|187953583|gb|AAI37503.1| Calcium channel, voltage-dependent, alpha 2/delta subunit 3 [Homo sapiens] Length = 1091 Score = 44.8 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 70/192 (36%), Gaps = 32/192 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++I++DVS SM+ ++ +A ++++++L+ + N ++ ++ Sbjct: 251 ATSPKDVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYN 298 Query: 228 NKIEEFFLLEWGV---------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ G H + + L G+ L A+N + D Q Sbjct: 299 EELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIG-MLDIALNEAFNILSDFNHTGQ 357 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLR 337 A I+ +TDG + +++ R ++ I + + L+ Sbjct: 358 GSICSQA-----IMLITDG----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLK 408 Query: 338 ACASPNSFYLVE 349 A N + + Sbjct: 409 WMACANKGFFTQ 420 >gi|330834639|ref|YP_004409367.1| von Willebrand factor, type A [Metallosphaera cuprina Ar-4] gi|329566778|gb|AEB94883.1| von Willebrand factor, type A [Metallosphaera cuprina Ar-4] Length = 383 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 41/195 (21%), Positives = 71/195 (36%), Gaps = 31/195 (15%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + T +++LD S SME KI+ A K +E +K IP N V +T Sbjct: 32 TSTATGFHYIVLLDTSGSMEGL------KIESAKKGA---IELLKRIPQGNKV---SFIT 79 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 FS+++ L +I L+ G + T L A+N + + G Sbjct: 80 FSSRVNVVREFVD-PEDLTNEIVNLNAGGQTALFT-ALLTAFN-LHNKHG---------- 126 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPN 343 ++ +TDG + +D Y + G+ + L++ A S Sbjct: 127 -VPSYVILLTDG---NPTDDTNIETYRRLQIPSSVQAVSFGLGDDYNESILKSLADKSGG 182 Query: 344 SFYLVENPHSMYDAF 358 FY V + + + Sbjct: 183 VFYHVSDAMEIPERL 197 >gi|270487809|ref|ZP_06204883.1| von Willebrand factor type A domain protein [Yersinia pestis KIM D27] gi|270336313|gb|EFA47090.1| von Willebrand factor type A domain protein [Yersinia pestis KIM D27] Length = 207 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 26/132 (19%), Positives = 41/132 (31%), Gaps = 12/132 (9%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + ++LD S SM I+ + +L ++ P ++TF + Sbjct: 3 RLPVYLLLDTSGSMTG------EPIEAVKNGVQMLLSTLRQDPYALETAYVSVITFDSSA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L L K+ L G +T L I + D Sbjct: 57 RQAVPL---TDLLNFKLPELVANG-TTALGDALSLTAKCIGNEVQKTTADTKGDWRPLVF 112 Query: 291 IVFMTDGENLST 302 I MTDG Sbjct: 113 I--MTDGSPTDD 122 >gi|194389314|dbj|BAG61618.1| unnamed protein product [Homo sapiens] Length = 525 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 70/192 (36%), Gaps = 32/192 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++I++DVS SM+ ++ +A ++++++L+ + N ++ ++ Sbjct: 157 ATSPKDVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYN 204 Query: 228 NKIEEFFLLEWGV---------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ G H + + L G+ L A+N + D Q Sbjct: 205 EELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIG-MLDIALNEAFNILSDFNHTGQ 263 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLR 337 A I+ +TDG + +++ R ++ I + + L+ Sbjct: 264 GSICSQA-----IMLITDG----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLK 314 Query: 338 ACASPNSFYLVE 349 A N + + Sbjct: 315 WMACANKGFFTQ 326 >gi|108760718|ref|YP_633803.1| hypothetical protein MXAN_5665 [Myxococcus xanthus DK 1622] gi|108464598|gb|ABF89783.1| conserved hypothetical protein [Myxococcus xanthus DK 1622] Length = 293 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 26/97 (26%), Positives = 44/97 (45%), Gaps = 9/97 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 L +M+V+DVS S + F TK ++A + + + N + GLV FS+++E Sbjct: 77 LTVMLVVDVSAS-KEFGSRERTKSEVAAEVAAQI-----AFSAIANNDRVGLVLFSDRVE 130 Query: 232 EFFLLEWGVSHLQRKIK---YLSKFGVSTNSTPGLKY 265 + G +H+ R + G T+ GL Y Sbjct: 131 KVVPPRKGRTHVLRLVSDILTFQPQGRGTDLAAGLNY 167 >gi|94986631|ref|YP_594564.1| hypothetical protein LI0187 [Lawsonia intracellularis PHE/MN1-00] gi|94730880|emb|CAJ54243.1| hypothetical protein LI0187 [Lawsonia intracellularis PHE/MN1-00] Length = 530 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 33/171 (19%), Positives = 63/171 (36%), Gaps = 24/171 (14%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 +K S T + I+LD+S SM S I++A + ++ + IP ++ Sbjct: 349 MKTESVTGIDTAVHILLDISGSMTSC-------IELAGAACYSVALALAAIPGIS----V 397 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHC 280 G+ F + + L + G TNS + + + Sbjct: 398 GVSAF---------PADYKEDVAATVYPLLRHGKRITNSFAAEAHGSTPMTEALWWVLGM 448 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 + ++KI+ +TDG + ++++ AK+ G V IGI Sbjct: 449 LSTRPEHRKIVFVVTDGYPDDPETAKETIAV---AKRMGIEVLGIGIDAPA 496 >gi|149773087|emb|CAO01893.1| collagen type VI alpha 5 [Mus musculus] Length = 620 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 27/141 (19%), Positives = 54/141 (38%), Gaps = 21/141 (14%) Query: 220 QSGLVTFS----------NKIEEFFLLEWGVSHLQRKI-KYLSKFGVSTNSTPGLKYAYN 268 + L+++S K E F S ++ I L + L++A Sbjct: 20 RVALLSYSPSESSRRKGRVKTEFAFTTYDNQSIMKNYIYTSLQQLNGDATIGLALQWAME 79 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +F T + K+I+ ++ GEN KE +++ AK +G +V+ I + Sbjct: 80 GLFLG--------TPNPRKHKVIIVISAGENHEEKEFVKTVAL--RAKCQGYVVFVISLG 129 Query: 329 VIRSHEFLRACASPNSFYLVE 349 + E + P +L++ Sbjct: 130 STQRDEMEELASYPLDHHLIQ 150 >gi|291461066|ref|ZP_06026725.2| D-amino acid dehydrogenase large subunit [Fusobacterium periodonticum ATCC 33693] gi|291379168|gb|EFE86686.1| D-amino acid dehydrogenase large subunit [Fusobacterium periodonticum ATCC 33693] Length = 529 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 50/335 (14%), Positives = 106/335 (31%), Gaps = 52/335 (15%) Query: 59 VHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDI 118 V+ I++E G + D + + W + EL+ D Sbjct: 74 VYTYEAIVDEAGGLYQSPDAREDNYLKKHDMWKEDVQRELKKIEPALGEDASEEEI---- 129 Query: 119 VVVPQNEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 Q+ + I+ Y P + + + P T + + +++ IV Sbjct: 130 ----QHLFKQLLYIAGYDYTPFETIDRFSYVIFKNDMENPFTHE---KIEENMNVNVEIV 182 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 LD S SM T +++A +SI +L E+ + G+ F +K + + Sbjct: 183 LDASGSMVKKI-GDKTMMEIAKESIKQVLSEMPT------NAKVGVRVFGHKGDNTASKK 235 Query: 238 W---------------GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 V +++ ++ + G T+ ++Y + + G + Sbjct: 236 DESCGANELIYPIEDLNVEGIEKALEPIQPTGW-TSIAKSIEYGVEDLKALDGEKTLN-- 292 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK--KRGAIVYAIGIRV-IRSHEFLR-- 337 I+ +TDG + + K ++ IG V + L+ Sbjct: 293 -------ILYIITDG---IETCGGNPVEIAKQLKGENTNIVLGIIGFNVDANQNRLLKQI 342 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 A A+ + V + + I + + W Sbjct: 343 ADAAGGYYSSVNDADKLTGELYRINELAFSDYKWE 377 >gi|241205700|ref|YP_002976796.1| Vault protein inter-alpha-trypsin domain protein [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240859590|gb|ACS57257.1| Vault protein inter-alpha-trypsin domain protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 791 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 34/194 (17%), Positives = 74/194 (38%), Gaps = 43/194 (22%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D S SM S I+ A +S+ + ++ + ++ F + + ++ Sbjct: 356 VVFVIDNSGSM------SGPSIEQARQSLALAISKLNPDD------RFNVIRFDDTMTDY 403 Query: 234 FL-----LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F ++ L+ G T P L+ A ++ + Sbjct: 404 FKGLVTATPDNREKAIGYVRGLTADG-GTEMLPALQAA----------LRNQGPVASGAL 452 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 + +VF+TDG ++Q L+ A + A V+ +GI ++PN++++ Sbjct: 453 RQVVFLTDG----AIGNEQQLFQEITANRGDARVFTVGIG-----------SAPNTYFMT 497 Query: 349 ENPHSMYDAFSHIG 362 + F+ IG Sbjct: 498 KAAEMGRGTFTAIG 511 >gi|88601603|ref|YP_501781.1| magnesium chelatase, ChlI subunit [Methanospirillum hungatei JF-1] gi|88187065|gb|ABD40062.1| protoporphyrin IX magnesium-chelatase [Methanospirillum hungatei JF-1] Length = 680 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 66/208 (31%), Gaps = 26/208 (12%) Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 P ++ A R P + IV+ ++ + V+D S Sbjct: 443 PTGRDIALDATIRAVSPYQKMRLSDSLA----IVIRSDEVLQKKRIGKTATATLFVVDAS 498 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK-IEEFFLLEWGV 240 SM +++ A +I ++LE N + GLV F + + L + Sbjct: 499 GSM-----GVEQRMEAAKGAIFSLLE-----DSYQNRDRVGLVAFRGEGADVVLPLTSSI 548 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-- 298 +++ L G T GL+ + + R+ ++V +TDG Sbjct: 549 DLAYQRLSELPT-GGKTPLAAGLQKSLTILM-----REKQKYPSLLP--LLVLITDGRAN 600 Query: 299 -NLSTKEDQQSLYYCNEAKKRGAIVYAI 325 K + ++ K G I Sbjct: 601 VGNGGKLKDEIGLITDDLVKAGIETVII 628 >gi|227551457|ref|ZP_03981506.1| possible pilus subunit protein [Enterococcus faecium TX1330] gi|257895933|ref|ZP_05675586.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com12] gi|293377027|ref|ZP_06623237.1| von Willebrand factor type A domain protein [Enterococcus faecium PC4.1] gi|227179401|gb|EEI60373.1| possible pilus subunit protein [Enterococcus faecium TX1330] gi|257832498|gb|EEV58919.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com12] gi|292644243|gb|EFF62343.1| von Willebrand factor type A domain protein [Enterococcus faecium PC4.1] Length = 477 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 26/149 (17%), Positives = 61/149 (40%), Gaps = 22/149 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + LD+++V+D S SM +I ++ ++ + + + + G V + Sbjct: 286 KDITPLDLVLVVDWSGSMNDN-----DRIGEVKIGVDRFVDTL-ADSGITDKINMGYVGY 339 Query: 227 SNK--IEEFFLLEWGV-SHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 S++ ++ G ++ ++K ++ T + GL+ A + + G Sbjct: 340 SSEGHNYSNGTVQMGSFDSVKNQVKSITPSWTNGGTFTQKGLRDAGDMLSVPNG------ 393 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLY 310 +KK+IV +TDG + + Q+ Sbjct: 394 -----HKKVIVLLTDGVPTFSYKVQRVRA 417 >gi|17537921|ref|NP_496259.1| C-type LECtin family member (clec-60) [Caenorhabditis elegans] gi|3881710|emb|CAA88985.1| C. elegans protein ZK666.6, confirmed by transcript evidence [Caenorhabditis elegans] Length = 406 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 33/203 (16%), Positives = 63/203 (31%), Gaps = 22/203 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN--- 228 LD++ V+D S M + +T I I S+ + + P + GLVT++ Sbjct: 42 LDVVAVVDNSIGMT---NGGLTSIAANIASVVSSGTRIGTNPSEPRTTRLGLVTYNKAAA 98 Query: 229 ---KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + ++ L+ ++ R + +S S GL A + + Sbjct: 99 IQADLNQYQSLDDVYDNVFRALSSVSTSEES-YLANGLARA-EDVLEAGKQGY----NRT 152 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF---LRACASP 342 +Y+++++ + K G V + L ASP Sbjct: 153 HYQRVVIVY--ASAYKGSGALDPVPVAERLKTSGVTVITVAYDQDGDGALLADLAKIASP 210 Query: 343 NSFYLVENPHSMYDAFSHIGKDI 365 Y N I + Sbjct: 211 P--YNFTNTEDNGQVIGEIQDAL 231 >gi|115361035|ref|YP_778172.1| hypothetical protein Bamb_6294 [Burkholderia ambifaria AMMD] gi|115286363|gb|ABI91838.1| conserved hypothetical protein [Burkholderia ambifaria AMMD] Length = 423 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 15/119 (12%), Positives = 41/119 (34%), Gaps = 1/119 (0%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R+ + +G + I+ + L ++ +G+ +++ ++ ++ L + D + AA + + Sbjct: 12 RHSLHRQQGAVAIIVGLALAMMIGFVGLALDLGKLYVTRSELQNSADSCALSAARDLTS- 70 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 + G + + D+ + V P N Y Sbjct: 71 AISLQVAEADGIAAGHANYAFFQQNAVQMQTDSNVTFSDSLTNPFLTKTAVATPANVKY 129 >gi|315105441|gb|EFT77417.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL030PA1] Length = 322 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 25/160 (15%), Positives = 52/160 (32%), Gaps = 17/160 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +D S SM++ D S T++ A + + +V +V+ S E Sbjct: 96 IVVAIDSSLSMKAD-DVSPTRLAAAKAKAKDFINSLPTGFNVA------VVSISEHPEIR 148 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + R + + + T + + + G ++ IV Sbjct: 149 MPPSTDRPTVLRAVDGI-ELQDGTALGGAIDKSLEAVKMAPGGSKNPAPAA------IVM 201 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 ++DG+N + AK VY I + Sbjct: 202 LSDGDNTQGGSPLVAANRAAAAK---VPVYTIAFGTETGY 238 >gi|260797297|ref|XP_002593640.1| hypothetical protein BRAFLDRAFT_235690 [Branchiostoma floridae] gi|229278866|gb|EEN49651.1| hypothetical protein BRAFLDRAFT_235690 [Branchiostoma floridae] Length = 91 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 17/97 (17%), Positives = 39/97 (40%), Gaps = 11/97 (11%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 LD++++LD S S+ S+ + + ++E + P Q G++ +S Sbjct: 4 RTPLDIILLLDGSGSVGSY------NFEKVKQFSQKLVETFDIGPSG---TQIGVIQYST 54 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGL 263 ++ + F + S L I ++ T + + Sbjct: 55 RVRQEFSMNSFQSKETLSNAIDDIAYMRGGTRTGRAI 91 >gi|47218290|emb|CAG04122.1| unnamed protein product [Tetraodon nigroviridis] Length = 993 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 19/149 (12%), Positives = 50/149 (33%), Gaps = 17/149 (11%) Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE------WGVSHLQRKIKYLSK 252 +++ +L +++ + ++FS++I + V ++ + L Sbjct: 328 KEALLTILGDLRPAD------RFNFISFSSRIRVWQPGRLVPATPSAVRDAKKFVVMLPT 381 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL-STKEDQQSLYY 311 G T+ ++ + + + + I+F+TDG+ L Sbjct: 382 SGGGTDIDGAIQTG-SSLLRDHLSGRDAGPNSVSL---IIFLTDGQPTVGEVRPGAILGN 437 Query: 312 CNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 A + ++ IG+ + L A Sbjct: 438 ARAAVRDKFCIFTIGMGDDVDYRLLERMA 466 >gi|84385675|ref|ZP_00988706.1| hypothetical protein V12B01_26114 [Vibrio splendidus 12B01] gi|84379655|gb|EAP96507.1| hypothetical protein V12B01_26114 [Vibrio splendidus 12B01] Length = 520 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 31/209 (14%), Positives = 68/209 (32%), Gaps = 18/209 (8%) Query: 12 FYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNG 71 +G + + L I + +E + + L + + + Sbjct: 16 LQRQQGVAAVWMGLLLVPIMGMTFWAVEGTRYVQETSRLRDSAEAAAIA--------VTI 67 Query: 72 NNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISA 131 ++ G ++N L + F D+ V + Y+++A Sbjct: 68 EDQPDQARGLATKYVENYVRDIKSTNLSADRFHQAEDEGAG--------VLEYIQYTVNA 119 Query: 132 ISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSS 191 + + +FIP + + + + D +D++ V D S SM + SS Sbjct: 120 KTTHDSWF-ASSFIPSFDQQQDLAGRSLARKYPVYLGDNNIDIVFVSDFSGSMNDRWGSS 178 Query: 192 IT-KIDMAIKSINAMLEEVKLIPDVNNVV 219 KID +I+A+ ++ + V Sbjct: 179 RHIKIDDLKTAIDAISSKILCTSIKQDYV 207 >gi|326789665|ref|YP_004307486.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] gi|326540429|gb|ADZ82288.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] Length = 564 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 35/239 (14%), Positives = 72/239 (30%), Gaps = 29/239 (12%) Query: 97 ELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNS--RHI 154 +R + + I D+ + Q A + S + Sbjct: 313 GIRHDSPIYAIGDLSAEKQAILREFVQFATSETYAAKAVEYGFNGLDEYKSEMKSVDKQQ 372 Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + K ++ + V D+S SM+ +++ K +++ + I Sbjct: 373 ITKAQKLWKEKKNGTKKISAVFVADISGSMDGEPLNNLKK---------SLITGAEYIGK 423 Query: 215 VNNVVQSGLVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 N+ GLVT+S+ + + S + L G T + G+ A + Sbjct: 424 DNS---IGLVTYSDDVNINLPVGKFDLNHRSLFTGAVMDLEASG-GTATFDGMIVALKML 479 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 + + + + K ++ ++DGE K VY IG Sbjct: 480 MEEKAA-------NPDAKLMLFVLSDGETNRGHSLNDIEGILRTLK---IPVYTIGYNA 528 >gi|160876887|ref|YP_001556203.1| outer membrane adhesin-like protein [Shewanella baltica OS195] gi|160862409|gb|ABX50943.1| outer membrane adhesin like proteiin [Shewanella baltica OS195] gi|315269091|gb|ADT95944.1| outer membrane adhesin like proteiin [Shewanella baltica OS678] Length = 1215 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 70/202 (34%), Gaps = 34/202 (16%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +DM IV+D S SM + ID A ++ +++ GLV+FS + Sbjct: 310 GDIDMQIVMDRSGSMFG------SPIDNAKQAAKILVDA-----TAEGSTAMGLVSFSGR 358 Query: 230 --------IEEFFLLEWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +++ + GV L+ I + G ST G + A + + Q Sbjct: 359 SSVKQDFAMQKMPKPDNGVKQALKGAIDNIYANG-STALFDGSQLALDNLSAYQASAASG 417 Query: 281 NTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--VIRSHEFLR 337 ++ + DG +N S K + + A +++ G Sbjct: 418 APG------VVFVLADGDDNNSIKSESSVITAYQNA---NVPIFSFGYGSASPTGPLVTM 468 Query: 338 ACASPNSFY-LVENPHSMYDAF 358 A A+ ++ + DAF Sbjct: 469 ANATGGKYFSSPTTLAEIIDAF 490 >gi|7024361|emb|CAB75878.1| calcium channel alpha2-delta3 subunit [Homo sapiens] Length = 519 Score = 44.8 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 70/192 (36%), Gaps = 32/192 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++I++DVS SM+ ++ +A ++++++L+ + N ++ ++ Sbjct: 157 ATSPKDVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYN 204 Query: 228 NKIEEFFLLEWGV---------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ G H + + L G+ L A+N + D Q Sbjct: 205 EELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIG-MLDIALNEAFNILSDFNHTGQ 263 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLR 337 A I+ +TDG + +++ R ++ I + + L+ Sbjct: 264 GSICSQA-----IMLITDG----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLK 314 Query: 338 ACASPNSFYLVE 349 A N + + Sbjct: 315 WMACANKGFFTQ 326 >gi|327330197|gb|EGE71946.1| putative von Willebrand factor type A domain protein [Propionibacterium acnes HL097PA1] Length = 322 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 25/160 (15%), Positives = 52/160 (32%), Gaps = 17/160 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +D S SM++ D S T++ A + + +V +V+ S E Sbjct: 96 IVVAIDSSLSMKAD-DVSPTRLAAAKAKAKDFINSLPTGFNVA------VVSISEHPEIR 148 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + R + + + T + + + G ++ IV Sbjct: 149 MPPSTDRPTVLRAVDGI-ELQDGTALGGAIDKSLEAVKMAPGGSKNPAPAA------IVM 201 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 ++DG+N + AK VY I + Sbjct: 202 LSDGDNTQGGSPLVAANRAAAAK---VPVYTIAFGTETGY 238 >gi|326426493|gb|EGD72063.1| hypothetical protein PTSG_00082 [Salpingoeca sp. ATCC 50818] Length = 571 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 31/135 (22%), Positives = 52/135 (38%), Gaps = 18/135 (13%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTF 226 T D++IVLD S SM + + + A+ + +LE + + V VV + Sbjct: 179 TSGPKDIVIVLDRSGSMATN-----NRWETAMDAAETVLETLTIADFVAIVVFDTSASQV 233 Query: 227 SNKIEEFFLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 L V L+ + + G STN + A++ + +C+T Sbjct: 234 CGTTIPCGSLVQATADNVGTLRTLLANFNPDG-STNFESAFQVAFSVLKQTGERTSNCHT 292 Query: 283 EDANYKKIIVFMTDG 297 I+FMTDG Sbjct: 293 A-------ILFMTDG 300 >gi|302336994|ref|YP_003802200.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] gi|301634179|gb|ADK79606.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] Length = 324 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 32/185 (17%), Positives = 61/185 (32%), Gaps = 29/185 (15%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSM--ESFFDSSITKIDMAIKSINAMLEEVKLIP 213 M T + + Q + D++ +D+SRSM S + + + I+++ Sbjct: 71 MGFTGTGIDSEQFPEKKDVVFAVDLSRSMLASDVVPSRLERTKVLIRTVL---------- 120 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQI 270 D ++ + GLV F++ + V L + LS TN G+ A Sbjct: 121 DNSSGNRYGLVVFTDLGLVMVPVTEDVQSLVSAVDALSPDLLSSAGTNIAAGISAA---- 176 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 E +++IV +DGE S + + + I + Sbjct: 177 -------GQAFPEGERRQRLIVVFSDGEEHSGDPKEITASL---RRDHHITTSVIALGSA 226 Query: 331 RSHEF 335 Sbjct: 227 DGAAV 231 >gi|291288403|ref|YP_003505219.1| outer membrane adhesin like proteiin [Denitrovibrio acetiphilus DSM 12809] gi|290885563|gb|ADD69263.1| outer membrane adhesin like proteiin [Denitrovibrio acetiphilus DSM 12809] Length = 3226 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 34/232 (14%), Positives = 73/232 (31%), Gaps = 31/232 (13%) Query: 108 DDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 ++ + S + N G I+ ++ P+ N + + ++ ++ Sbjct: 2439 ENFNGADSFTYTISDGNGGTDIATVTVNVAPV----------NDAPVAVDDNATFNESTV 2488 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITK-IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +++++VLD S SM S + + I L V L+ + Sbjct: 2489 EIGSVNLVLVLDSSGSMGSNGMALAESALKALIGKYGDSLNSVMLVDFDTD--------- 2539 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + +W + + S+N G Y+ N Sbjct: 2540 ARLVYSADGDKWMTGD-----EAQAALDNSSNFRSGGYTDYDDAISAVQDNYSGAPAADN 2594 Query: 287 YKKIIVFMTDGENLSTKE---DQQSLYYCNEAKKRGA-IVYAIGIRVIRSHE 334 + F++DGE S D + + + + VYA+G+ S Sbjct: 2595 T--YVYFLSDGEPSSYSYAINDTERAEWVSFLEGSDIDQVYAVGVGRGVSQT 2644 >gi|288940556|ref|YP_003442796.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] gi|288895928|gb|ADC61764.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] Length = 610 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 32/223 (14%), Positives = 73/223 (32%), Gaps = 45/223 (20%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + V + A D+ +++DVS SM ++ + VN ++ Sbjct: 18 LIAVQAPLAAPADVRLLIDVSGSMRQNDPRNL---------------RAPALQLVNELIP 62 Query: 221 SGLVT----FSNKIEEFFLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 +G + F+ + E W L ++ + G+ T+ ++ A Sbjct: 63 AGAIAGVWLFAEQTEVLIPPAPVDDAW-KKRLAGRLARIHSRGLFTDIERAIRTATEDWT 121 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY--------CNEAKKRGAIVY 323 ++ ++ TDG +K++ +S K G V+ Sbjct: 122 ----------KTPPEGERHLILFTDGLVDVSKDEAESAASRERILSEQIESLKSEGVKVH 171 Query: 324 AIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKD 364 AIG+ +R A + + ++ ++ F + + Sbjct: 172 AIGLSDQIDEPLMRLLATQTDGWLEVAQDAETLQRLFLRVLEQ 214 >gi|223461391|gb|AAI41092.1| Cacna2d4 protein [Mus musculus] Length = 1091 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 62/169 (36%), Gaps = 27/169 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT-F 226 + D++I++D+S SM+ ++ +A +I +L+ + N+ V + Sbjct: 265 ATSPKDIVILVDISGSMKGL------RMAIAKHTITTILDTL----GENDFVNIIAYNDY 314 Query: 227 SNKIEEFFL------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + IE F H ++ + L GV S L A+ + Q +Q Sbjct: 315 VHYIEPCFKGILVQADRDNREHFKQLVDELMVKGVGVVS-QALIEAFEILKQFQESKQGS 373 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 A I+ +TDG + ++ + + R V+ I Sbjct: 374 LCNQA-----IMLITDGAVEDYEPVFETYNWPD----RKVRVFTYLIGR 413 >gi|221131887|ref|XP_002158380.1| PREDICTED: similar to HyTSR1 protein [Hydra magnipapillata] Length = 1620 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 38/203 (18%), Positives = 66/203 (32%), Gaps = 34/203 (16%) Query: 123 QNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 N S Y I + C + V S ++ + VLD S Sbjct: 108 SNSTSSYQCTPDYMISYEPCNIA--------------ACSIVKSCSNINFTFVFVLDTSS 153 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN--KIEEFFLLEWGV 240 S+++ + + A + VN+ V +V + + K+E Sbjct: 154 SVDAKQWQDEKNL------VLAFVNS--SSFGVNSNVDVAVVNYGSSAKVEADCGTFKSY 205 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-EN 299 S + + L G T GL A + Q C + +I+ +TDG EN Sbjct: 206 STFETFMNNLKLLGGGTAINKGLLAA-------EIAFQRCQKLNLEP--VIILLTDGFEN 256 Query: 300 LSTKEDQQSLYYCNEAKKRGAIV 322 + + ++ N KK +V Sbjct: 257 IDHQTPDSNIANENRIKKEALLV 279 >gi|163754946|ref|ZP_02162067.1| hypothetical protein KAOT1_02992 [Kordia algicida OT-1] gi|161325013|gb|EDP96341.1| hypothetical protein KAOT1_02992 [Kordia algicida OT-1] Length = 593 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 49/378 (12%), Positives = 120/378 (31%), Gaps = 53/378 (14%) Query: 14 NYKGGMTILT---AIFLPIIFLVLGMIIEVSHIFFMKTVLHS-------------MIDRS 57 ++ G IL I + I+++ + ++ Sbjct: 52 DFDGNYKILANKGQILVFSYVGFTSQEIKITQPVVNVQLKQGEALECVTVVAYGVAKEKK 111 Query: 58 LVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWN--MSFRNELRDNGFVN-----DIDDI 110 + A ++ + NR+ K + + + +F ++ + N + + Sbjct: 112 ALAYAVSTVHGSSVENRESYKKINENRFKRTSLAPLSTFSIDVDRAAYSNIRRMINNGEK 171 Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKF---CTFIPWYTNSRHIVMPITSSVKVNSQ 167 + ++ I + Y A K P+ PW ++R + + + ++ + Sbjct: 172 IPVNAVKIEEMINYFEYDY-AQPTDKHPMVIHTELGQTPWNKDTRLLKIALKGK-EIPLE 229 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + ++DVS SM + K+ + + M+ +++ V VV +G Sbjct: 230 EIPPSNFTFLIDVSGSMFA-----QNKLPLLKSAFTLMVNKMRPEDKVAIVVYAGAAG-- 282 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + E + + L G + G++ AY + ++ N Sbjct: 283 -MVLEPTS-GNNKQKILDALNNLRAGGST-AGGAGIELAYKTATE-NFIKNGNNR----- 333 Query: 288 KKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH----EFLRACASP 342 ++ TDG+ N+ + + ++ G + +G + E L + Sbjct: 334 ---VIMATDGDFNVGATSETAMETLIEQKRETGVYLSVLGFGMGNYQDDRLELLADKGNG 390 Query: 343 NSFYLVENPHSMYDAFSH 360 N Y V+ + F Sbjct: 391 NHAY-VDTMQEAHKIFGK 407 >gi|148667191|gb|EDK99607.1| mCG142165 [Mus musculus] Length = 1025 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 62/169 (36%), Gaps = 27/169 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT-F 226 + D++I++D+S SM+ ++ +A +I +L+ + N+ V + Sbjct: 265 ATSPKDIVILVDISGSMKGL------RMAIAKHTITTILDTL----GENDFVNIIAYNDY 314 Query: 227 SNKIEEFFL------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + IE F H ++ + L GV S L A+ + Q +Q Sbjct: 315 VHYIEPCFKGILVQADRDNREHFKQLVDELMVKGVGVVS-QALIEAFEILKQFQESKQGS 373 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 A I+ +TDG + ++ + + R V+ I Sbjct: 374 LCNQA-----IMLITDGAVEDYEPVFETYNWPD----RKVRVFTYLIGR 413 >gi|124517718|ref|NP_001028554.2| voltage-dependent calcium channel subunit alpha-2/delta-4 [Mus musculus] gi|162319140|gb|AAI56407.1| Calcium channel, voltage-dependent, alpha 2/delta subunit 4 [synthetic construct] Length = 1144 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 62/169 (36%), Gaps = 27/169 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT-F 226 + D++I++D+S SM+ ++ +A +I +L+ + N+ V + Sbjct: 293 ATSPKDIVILVDISGSMKGL------RMAIAKHTITTILDTL----GENDFVNIIAYNDY 342 Query: 227 SNKIEEFFL------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + IE F H ++ + L GV S L A+ + Q +Q Sbjct: 343 VHYIEPCFKGILVQADRDNREHFKQLVDELMVKGVGVVS-QALIEAFEILKQFQESKQGS 401 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 A I+ +TDG + ++ + + R V+ I Sbjct: 402 LCNQA-----IMLITDGAVEDYEPVFETYNWPD----RKVRVFTYLIGR 441 >gi|81871840|sp|Q5RJF7|CA2D4_MOUSE RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-4; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-4; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-4; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-4; Flags: Precursor gi|55832800|tpg|DAA05529.1| TPA_exp: putative voltage-gated calcium channel alpha(2)delta-4 subunit [Mus musculus] Length = 1116 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 62/169 (36%), Gaps = 27/169 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT-F 226 + D++I++D+S SM+ ++ +A +I +L+ + N+ V + Sbjct: 265 ATSPKDIVILVDISGSMKGL------RMAIAKHTITTILDTL----GENDFVNIIAYNDY 314 Query: 227 SNKIEEFFL------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + IE F H ++ + L GV S L A+ + Q +Q Sbjct: 315 VHYIEPCFKGILVQADRDNREHFKQLVDELMVKGVGVVS-QALIEAFEILKQFQESKQGS 373 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 A I+ +TDG + ++ + + R V+ I Sbjct: 374 LCNQA-----IMLITDGAVEDYEPVFETYNWPD----RKVRVFTYLIGR 413 >gi|50842461|ref|YP_055688.1| aerotolerance protein BatA [Propionibacterium acnes KPA171202] gi|289427106|ref|ZP_06428822.1| von Willebrand factor type A domain protein [Propionibacterium acnes J165] gi|295130538|ref|YP_003581201.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK137] gi|50840063|gb|AAT82730.1| conserved protein, putative BatA (bacteroides aerotolerance operon) [Propionibacterium acnes KPA171202] gi|289159575|gb|EFD07763.1| von Willebrand factor type A domain protein [Propionibacterium acnes J165] gi|291375881|gb|ADD99735.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK137] gi|313764513|gb|EFS35877.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL013PA1] gi|313772104|gb|EFS38070.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL074PA1] gi|313792200|gb|EFS40301.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA1] gi|313801849|gb|EFS43083.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA2] gi|313807458|gb|EFS45945.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA2] gi|313809968|gb|EFS47689.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL083PA1] gi|313813000|gb|EFS50714.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL025PA1] gi|313816054|gb|EFS53768.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL059PA1] gi|313818503|gb|EFS56217.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL046PA2] gi|313820269|gb|EFS57983.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA1] gi|313822922|gb|EFS60636.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA2] gi|313825146|gb|EFS62860.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL063PA1] gi|313827717|gb|EFS65431.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL063PA2] gi|313830297|gb|EFS68011.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL007PA1] gi|313833671|gb|EFS71385.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL056PA1] gi|313838673|gb|EFS76387.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL086PA1] gi|314915507|gb|EFS79338.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA4] gi|314918208|gb|EFS82039.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA1] gi|314920023|gb|EFS83854.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA3] gi|314925156|gb|EFS88987.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA3] gi|314932037|gb|EFS95868.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL067PA1] gi|314955907|gb|EFT00307.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL027PA1] gi|314958219|gb|EFT02322.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA1] gi|314960060|gb|EFT04162.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA2] gi|314962859|gb|EFT06959.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL082PA1] gi|314967773|gb|EFT11872.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA1] gi|314973302|gb|EFT17398.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL053PA1] gi|314975980|gb|EFT20075.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL045PA1] gi|314978483|gb|EFT22577.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL072PA2] gi|314984001|gb|EFT28093.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA1] gi|314989988|gb|EFT34079.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA3] gi|315078074|gb|EFT50125.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL053PA2] gi|315080702|gb|EFT52678.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL078PA1] gi|315084374|gb|EFT56350.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL027PA2] gi|315085715|gb|EFT57691.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA3] gi|315088865|gb|EFT60841.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL072PA1] gi|315096217|gb|EFT68193.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL038PA1] gi|315098475|gb|EFT70451.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL059PA2] gi|315101165|gb|EFT73141.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL046PA1] gi|315108386|gb|EFT80362.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL030PA2] gi|327326129|gb|EGE67919.1| putative von Willebrand factor type A domain protein [Propionibacterium acnes HL096PA2] gi|327331996|gb|EGE73733.1| putative von Willebrand factor type A domain protein [Propionibacterium acnes HL096PA3] gi|327443198|gb|EGE89852.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL013PA2] gi|327445983|gb|EGE92637.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL043PA2] gi|327448037|gb|EGE94691.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL043PA1] gi|327450841|gb|EGE97495.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA3] gi|327453082|gb|EGE99736.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL092PA1] gi|327453813|gb|EGF00468.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL083PA2] gi|328753529|gb|EGF67145.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL020PA1] gi|328754260|gb|EGF67876.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA1] gi|328754489|gb|EGF68105.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL025PA2] gi|328760649|gb|EGF74216.1| putative von Willebrand factor type A domain protein [Propionibacterium acnes HL099PA1] gi|332675378|gb|AEE72194.1| hypothetical protein PAZ_c10190 [Propionibacterium acnes 266] Length = 322 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 25/160 (15%), Positives = 52/160 (32%), Gaps = 17/160 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +D S SM++ D S T++ A + + +V +V+ S E Sbjct: 96 IVVAIDSSLSMKAD-DVSPTRLAAAKAKAKDFINSLPTGFNVA------VVSISEHPEIR 148 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + R + + + T + + + G ++ IV Sbjct: 149 MPPSTDRPTVLRAVDGI-ELQDGTALGGAIDKSLEAVKMAPGGSKNPAPAA------IVM 201 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 ++DG+N + AK VY I + Sbjct: 202 LSDGDNTQGGSPLVAANRAAAAK---VPVYTIAFGTETGY 238 >gi|47228304|emb|CAG07699.1| unnamed protein product [Tetraodon nigroviridis] Length = 1434 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 31/171 (18%), Positives = 58/171 (33%), Gaps = 31/171 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM S TKI ++ M+ +K +P + G T + Sbjct: 417 LLFLVDRSGSM------SGTKIQSVKEA---MVIALKSLPPGTKLNIVGFGTTIKPLFTS 467 Query: 234 FLLEWGVSHLQR--KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L V+ LQ ++ + TN L + Y Q +Y + + Sbjct: 468 SKLSTDVTILQACEYLQRMRADMKGTNLLGALSWVYQQ------------PMQRSYPRQV 515 Query: 292 VFMTDG--ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +TDG N++ + C + +G+ L+ A Sbjct: 516 FIITDGCVSNVAKVLELVRRNACAG------RCFGLGLGPKACRRLLQGVA 560 >gi|332291976|ref|YP_004430585.1| protein of unknown function DUF58 [Krokinobacter diaphorus 4H-3-7-5] gi|332170062|gb|AEE19317.1| protein of unknown function DUF58 [Krokinobacter diaphorus 4H-3-7-5] Length = 288 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 36/157 (22%), Positives = 61/157 (38%), Gaps = 20/157 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L MM+V DVS S FF + + + A L + + GL+ F Sbjct: 72 EEERELTMMLVADVSGS--EFFGTDKQFKSEIVTEVAATLAFSAMQNNDK----IGLILF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 +++IE F + G SH+ R I+ L +F + T L A + ++ + Sbjct: 126 TDEIELFIPPKKGKSHVLRIIRELLEFKPKSKKTD-LSQAIKYLSNVMKKKAIVFVLSD- 183 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 F+TDG +Q N+ G +Y Sbjct: 184 ------FITDG------YEQTMKIAANKHDITGIRIY 208 >gi|300795696|ref|NP_001178680.1| voltage-dependent calcium channel subunit alpha-2/delta-4 [Rattus norvegicus] Length = 1145 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 62/169 (36%), Gaps = 27/169 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT-F 226 + D++I++D+S SM+ ++ +A ++ +L+ + N+ V + Sbjct: 293 ATSPKDIVILVDMSGSMKGL------RMAIAKHTVTTILDTL----GENDFVNIIAYNDY 342 Query: 227 SNKIEEFFL------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + IE F H ++ + L GV S L A+ + Q RQ Sbjct: 343 VHYIEPCFKGILVQADRDNREHFKQLVDELMVKGVGIVS-QALIEAFQILKQFQESRQGS 401 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 A I+ +TDG + ++ + + R V+ I Sbjct: 402 LCNQA-----IMLVTDGAVEDYEPVFETYNWPD----RKVRVFTYLIGR 441 >gi|293347024|ref|XP_001056965.2| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 4-like [Rattus norvegicus] Length = 1179 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 62/169 (36%), Gaps = 27/169 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT-F 226 + D++I++D+S SM+ ++ +A ++ +L+ + N+ V + Sbjct: 293 ATSPKDIVILVDMSGSMKGL------RMAIAKHTVTTILDTL----GENDFVNIIAYNDY 342 Query: 227 SNKIEEFFL------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + IE F H ++ + L GV S L A+ + Q RQ Sbjct: 343 VHYIEPCFKGILVQADRDNREHFKQLVDELMVKGVGIVS-QALIEAFQILKQFQESRQGS 401 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 A I+ +TDG + ++ + + R V+ I Sbjct: 402 LCNQA-----IMLVTDGAVEDYEPVFETYNWPD----RKVRVFTYLIGR 441 >gi|256111841|ref|ZP_05452806.1| NorD protein [Brucella melitensis bv. 3 str. Ether] gi|265993299|ref|ZP_06105856.1| protein norD [Brucella melitensis bv. 3 str. Ether] gi|262764169|gb|EEZ10201.1| protein norD [Brucella melitensis bv. 3 str. Ether] Length = 633 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 84/210 (40%), Gaps = 34/210 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 L + +++DVS S +++ D+ +D+ +++ + + D + ++TF+ Sbjct: 440 QANDLAVTLLVDVSLSTDAWVDNRRV-LDVEKEALLVLANGIAACGD-----RCSILTFT 493 Query: 228 NK--------IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ + F +G + ++ +I L G T +++A ++ Sbjct: 494 SRRRSWVRVETVKDFDESFGPT-VEHRIAALKP-GFYTRMGAAMRHATAKL--------- 542 Query: 280 CNTEDANYKKIIVFMTDGENLSTK------EDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 E N KK+++ +TDG+ + S E + +G V+A+ + + Sbjct: 543 --AEQPNRKKLLLLLTDGKPNDVDHYEGRFALEDSRRAAGEVRAKGVNVFAVTVDR-EAS 599 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGK 363 +L A + LV N + A I + Sbjct: 600 AYLPALFGRGGYALVANLAKLPVALPAIYR 629 >gi|218662246|ref|ZP_03518176.1| putative vault protein inter-alpha-trypsin domain [Rhizobium etli IE4771] Length = 487 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 70/194 (36%), Gaps = 43/194 (22%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D S SM S I+ A +S+ + + + ++ F + + ++ Sbjct: 49 VVFVIDNSGSM------SGPSIEQARQSLALAISRLNPDD------RFNVIRFDDTMTDY 96 Query: 234 FL-----LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F ++ L+ G + + A QG Sbjct: 97 FKGLVAATPDNREKAIAYVRGLTADGGTE-----MLPALEDALRNQGPVASGAL------ 145 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 + +VF+TDG ++Q L+ A + A V+ +GI ++PN++++ Sbjct: 146 RQVVFLTDG----AIGNEQQLFQEITANRGDARVFTVGIG-----------SAPNTYFMT 190 Query: 349 ENPHSMYDAFSHIG 362 + F+ IG Sbjct: 191 KAAEIGRGTFTAIG 204 >gi|166796269|gb|AAI59125.1| LOC779593 protein [Xenopus (Silurana) tropicalis] Length = 973 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 68/202 (33%), Gaps = 30/202 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D S SM K+ +++ +L++++ N ++ FS +++ + Sbjct: 337 VVFVIDTSASMLGN------KMKQTKEALFTILKDLRPQDHFN------IIGFSKRVKVW 384 Query: 234 FLLE------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + ++ I L G TN G+ + + + Sbjct: 385 QQNQMVKVSPNNIRDAKKFIYSLYPTGE-TNINEGIHVGAQLL---NNYLASNGKHEKSV 440 Query: 288 KKIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-----LRACAS 341 +++F+TDG E + L A + +++IG L C Sbjct: 441 S-LMIFLTDGRATIGEIESPKILGNTKNAIQEKFCLFSIGFGNDVDFNLLEKLSLENCGM 499 Query: 342 PNSFYLVENPHSMYDAF-SHIG 362 E+ S F IG Sbjct: 500 MRRIQENEDAASQLKGFYDEIG 521 >gi|109039062|ref|XP_001082066.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3-like, partial [Macaca mulatta] Length = 691 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 70/192 (36%), Gaps = 32/192 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++I++DVS SM+ ++ +A ++++++L+ + N ++ ++ Sbjct: 251 ATSPKDVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYN 298 Query: 228 NKIEEFFLLEWGV---------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ G H + + L G+ L A+N + D Q Sbjct: 299 EELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIG-MLDIALNEAFNILSDFNHTGQ 357 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLR 337 A I+ +TDG + +++ R ++ I + + L+ Sbjct: 358 GSICSQA-----IMLITDG----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLK 408 Query: 338 ACASPNSFYLVE 349 A N + + Sbjct: 409 WMACANKGFFTQ 420 >gi|4039081|gb|AAC97485.1| sporozoite surface protein 2 [Plasmodium vivax] Length = 556 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 34/179 (18%), Positives = 69/179 (38%), Gaps = 30/179 (16%) Query: 171 RLDMMIVLDVSRSMESFFDSSITK----IDMAIKSINAMLEEVKLIPDV-----NNVVQS 221 +D+ +++D S S+ + + ITK ++ I S++ + + L ++ +++ Sbjct: 42 SVDLYLLVDGSGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRL 99 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 G I++ L + Y +TN T L + D Sbjct: 100 GS---GQSIDKRQALSKVTELRKTYTPY-----GTTNMTAALDEVQKHLNDR-------- 143 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 ++++ MTDG ++L N+ K+R + IGI +H+F R A Sbjct: 144 VNREKAIQLVILMTDG---VPNSKYRALEVANKLKQRNVSLAVIGIGQGINHQFNRLIA 199 >gi|4039079|gb|AAC97484.1| sporozoite surface protein 2 [Plasmodium vivax] Length = 556 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 34/179 (18%), Positives = 69/179 (38%), Gaps = 30/179 (16%) Query: 171 RLDMMIVLDVSRSMESFFDSSITK----IDMAIKSINAMLEEVKLIPDV-----NNVVQS 221 +D+ +++D S S+ + + ITK ++ I S++ + + L ++ +++ Sbjct: 42 SVDLYLLVDGSGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRL 99 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 G I++ L + Y +TN T L + D Sbjct: 100 GS---GQSIDKRQALSKVTELRKTYTPY-----GTTNMTAALDEVQKHLNDR-------- 143 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 ++++ MTDG ++L N+ K+R + IGI +H+F R A Sbjct: 144 VNREKAIQLVILMTDG---VPNSKYRALEVANKLKQRNVSLAVIGIGQGINHQFNRLIA 199 >gi|17989341|ref|NP_541974.1| NorD protein [Brucella melitensis bv. 1 str. 16M] gi|256043169|ref|ZP_05446110.1| NorD protein [Brucella melitensis bv. 1 str. Rev.1] gi|260564388|ref|ZP_05834873.1| conserved hypothetical protein [Brucella melitensis bv. 1 str. 16M] gi|265989599|ref|ZP_06102156.1| protein norD [Brucella melitensis bv. 1 str. Rev.1] gi|81850090|sp|Q8YBA4|NORD_BRUME RecName: Full=Protein norD gi|17985210|gb|AAL54238.1| nord protein [Brucella melitensis bv. 1 str. 16M] gi|260152031|gb|EEW87124.1| conserved hypothetical protein [Brucella melitensis bv. 1 str. 16M] gi|263000268|gb|EEZ12958.1| protein norD [Brucella melitensis bv. 1 str. Rev.1] Length = 633 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 84/210 (40%), Gaps = 34/210 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 L + +++DVS S +++ D+ +D+ +++ + + D + ++TF+ Sbjct: 440 QANDLAVTLLVDVSLSTDAWVDNRRV-LDVEKEALLVLANGIAACGD-----RCSILTFT 493 Query: 228 NK--------IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ + F +G + ++ +I L G T +++A ++ Sbjct: 494 SRRRSWVRVETVKDFDESFGPT-VEHRIAALKP-GFYTRMGAAMRHATAKL--------- 542 Query: 280 CNTEDANYKKIIVFMTDGENLSTK------EDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 E N KK+++ +TDG+ + S E + +G V+A+ + + Sbjct: 543 --AEQPNRKKLLLLLTDGKPNDVDHYEGRFALEDSRRAAGEVRAKGVNVFAVTVDR-EAS 599 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGK 363 +L A + LV N + A I + Sbjct: 600 AYLPALFGRGGYALVANLAKLPVALPAIYR 629 >gi|297671074|ref|XP_002813673.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3-like [Pongo abelii] Length = 987 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 70/192 (36%), Gaps = 32/192 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++I++DVS SM+ ++ +A ++++++L+ + N ++ ++ Sbjct: 241 ATSPKDVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYN 288 Query: 228 NKIEEFFLLEWGV---------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ G H + + L G+ L A+N + D Q Sbjct: 289 EELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIG-MLDIALNEAFNILSDFNHTGQ 347 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLR 337 A I+ +TDG + +++ R ++ I + + L+ Sbjct: 348 GSICSQA-----IMLITDG----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLK 398 Query: 338 ACASPNSFYLVE 349 A N + + Sbjct: 399 WMACANKGFFTQ 410 >gi|296331311|ref|ZP_06873783.1| hypothetical protein BSU6633_09416 [Bacillus subtilis subsp. spizizenii ATCC 6633] gi|305676297|ref|YP_003867969.1| hypothetical protein BSUW23_18125 [Bacillus subtilis subsp. spizizenii str. W23] gi|296151426|gb|EFG92303.1| hypothetical protein BSU6633_09416 [Bacillus subtilis subsp. spizizenii ATCC 6633] gi|305414541|gb|ADM39660.1| putative exported protein [Bacillus subtilis subsp. spizizenii str. W23] Length = 227 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 37/214 (17%), Positives = 73/214 (34%), Gaps = 21/214 (9%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P ++ K ++ +A ++ ++LD S SM D ++K + A K I+ + Sbjct: 17 LTSPAFAAEKKEAEAEAPANVAVLLDASGSMAKRIDG-VSKFNSAKKEISKFASSLPEGT 75 Query: 214 DVNNVV--QSGLVTFSNKIEEFFLLE--WGVSHL--QRKIKYLSKFGVSTNSTPGLKYAY 267 V V G S K++ + +G Q + L+ G + + Sbjct: 76 QVKMSVFGSEGNNKNSGKVQSCESIRNVYGFQSFNEQSFLNSLNAIGPTGWTPIA----- 130 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + + + D +K++ +TDGE + E K V IG Sbjct: 131 ---KALNEAKSSFDQLDTKGEKVVYLLTDGE---ETCGGNPIKTAKELHKDNITVNVIGF 184 Query: 328 RVIRSHE-FLRACAS--PNSFYLVENPHSMYDAF 358 ++ L A A ++ + F Sbjct: 185 DYKEGYKGQLNAIAKVGGGEYFPAYTQKDVEKIF 218 >gi|212703143|ref|ZP_03311271.1| hypothetical protein DESPIG_01182 [Desulfovibrio piger ATCC 29098] gi|212673409|gb|EEB33892.1| hypothetical protein DESPIG_01182 [Desulfovibrio piger ATCC 29098] Length = 512 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 34/188 (18%), Positives = 70/188 (37%), Gaps = 25/188 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + I+LD S SM I +A+++ A+ + ++ + ++ + + + + Sbjct: 340 VHILLDSSGSMHG------ESIKLAVQACYAVGKALEHLSGISLGI-TSFPAYRDGKIGV 392 Query: 234 FLLEWGVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 F L V H Q+ + + T L + Q+ ++ R K+++ Sbjct: 393 FPL---VRHGQKMTDRMQMQAHGGTPLAEALWWVMRQMLILRETR-----------KVVL 438 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPH 352 +TDG Q L +K G VY IG++ L +S E Sbjct: 439 ILTDGVPDDV---TQCLQALEALRKTGVEVYGIGMKFDCISSLLPDTSSRVISRFEELSP 495 Query: 353 SMYDAFSH 360 ++++ H Sbjct: 496 ALFEVLQH 503 >gi|197118219|ref|YP_002138646.1| hypothetical protein Gbem_1835 [Geobacter bemidjiensis Bem] gi|197087579|gb|ACH38850.1| conserved hypothetical protein [Geobacter bemidjiensis Bem] Length = 356 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 20/114 (17%), Positives = 47/114 (41%), Gaps = 7/114 (6%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 + KG + + AI L ++F LG+ ++ H+F ++ L + D + + A + Sbjct: 6 KLIRSEKGMVIVYVAILLMMMFGFLGLAVDGGHLFKVRGELQNAADAAALKGAWHLYTRP 65 Query: 70 NGNNRKKLKGGDILCRIKNTW-------NMSFRNELRDNGFVNDIDDIVRSTSL 116 + ++ N + ++ + D G+ N +I++ T+L Sbjct: 66 TDPTQLPTLQWEVARFQAQQMITENSSDNTALKDAMVDVGYWNTNSNILQPTTL 119 >gi|123498822|ref|XP_001327483.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121910413|gb|EAY15260.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 729 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 32/186 (17%), Positives = 66/186 (35%), Gaps = 34/186 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++D S SME ++I A +N ++ + + + ++ F + E Sbjct: 235 FYFIIDCSGSMEE------SRIKNAKFCLNLLIHSLPVG------CRFSIIKFGSMYEVV 282 Query: 234 FLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 V+ +I + T+ LK+ +Q ++Q Sbjct: 283 LPTCDYTDENVAKAMEQINQMDANMEGTDILSPLKFVSDQSTKEGFIKQ----------- 331 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYL 347 + +TDGE++ T + +L N R ++ IGI ++ A S + L Sbjct: 332 -VFLLTDGEDIHT-DQIYALVQANRTNNR---IFTIGIGSGADRNLIKNIARISGGNNAL 386 Query: 348 VENPHS 353 +E+ Sbjct: 387 IEDNDE 392 >gi|328946175|gb|EGG40320.1| peptidoglycan binding domain protein [Streptococcus sanguinis SK1087] Length = 450 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 39/200 (19%), Positives = 65/200 (32%), Gaps = 36/200 (18%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ V+D S SM + ID+ +IN + + + GL TFS Sbjct: 173 KAGSADIVFVVDRSGSMGAT-------IDIVRANINEFVRNITK---EGITARFGLATFS 222 Query: 228 NKI--------EEFFLLEWGVS--------HLQRKIKYLS-KFGVSTNSTPGLKYAYNQI 270 +++ +E +L S L++ + + G T TP A NQI Sbjct: 223 DEVFGRNSGSKDEDTVLTRFGSSYFTTDPAELEKALAAIRIASGGDTPETPT--PALNQI 280 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + KK +V +TD E K G ++ I Sbjct: 281 ISTYDWSKSSKN-----KKFVVLLTDAEMKEDPSIPTVADTLAALKAAGIERTVATVKAI 335 Query: 331 RSHEFLRACASPNSFYLVEN 350 + A+ +EN Sbjct: 336 EG--IYKNFATEGRVLDIEN 353 >gi|289548270|ref|YP_003473258.1| von Willebrand factor type A [Thermocrinis albus DSM 14484] gi|289181887|gb|ADC89131.1| von Willebrand factor type A [Thermocrinis albus DSM 14484] Length = 621 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 35/239 (14%), Positives = 81/239 (33%), Gaps = 40/239 (16%) Query: 147 WYTNSRHIVMPITSSVKVNSQ-----TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 N + + + + + Q L +V+DVS SM K + + + Sbjct: 411 RRLNLKKLSVEVPTGRGRIYQRRVVPERKELAFKLVIDVSSSMR--------KEEKLVSA 462 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF-GVSTNST 260 + A+L + + + + L FS ++ + + + K + G +TN Sbjct: 463 MKALLLVAETLSSMGMPLSVDL--FSERVMTLKDFDEDYRNFRSKFMQIPSMVGGATNIE 520 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA 320 L A++ + +++ +DGE + + + KK Sbjct: 521 LALLKAFDHLSSYCKTTHRRG--------VLILFSDGEPTRGLKGTELKKIIDTMKKE-- 570 Query: 321 IVYA-IGIRVIRSHEFLRACASPNSFYL-----VENPHSMYDAFSHIGKDIVTKRIWYD 373 Y +G+ V +S ++ ++ V + + AFS + ++ + D Sbjct: 571 --YPLVGVGVGQSRNYI------EEYFDRTAIKVSDISRLPSAFSFVLENYFRRLTSVD 621 >gi|66045865|ref|YP_235706.1| von Willebrand factor, type A [Pseudomonas syringae pv. syringae B728a] gi|63256572|gb|AAY37668.1| von Willebrand factor, type A [Pseudomonas syringae pv. syringae B728a] Length = 663 Score = 44.8 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 32/210 (15%), Positives = 70/210 (33%), Gaps = 33/210 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D + SM+ + I+ ++I+ + ++ ++ V+ GLV F + + Sbjct: 239 VVFVIDSTISMDPY-------IERTREAISKVYAQIAK-ENLGRQVKFGLVAFRSSTQAV 290 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR-----QHCNTEDANYK 288 LE+ V+ + + LK A Q + D + Sbjct: 291 PGLEY-VTKMYADPNTVKDGADFLAKAADLKQAKVSSKSFNEDSYAGVMQAIDKVDWSPF 349 Query: 289 --KIIVFMT-----DGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------- 332 + +V +T DG++ + + EA G +Y + ++ Sbjct: 350 GARYVVLITDAGALDGDDKLSGTGLNAEQVRIEASNPGVAIYTLHLKTAAGAKDHANAEA 409 Query: 333 --HEFLRACASPNSFYLVENPHSMYDAFSH 360 + S Y + + DAF Sbjct: 410 QYQALSTYTGTNTSLYYPVDAGDL-DAFGS 438 >gi|326927638|ref|XP_003209998.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like [Meleagris gallopavo] Length = 881 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 28/172 (16%), Positives = 68/172 (39%), Gaps = 21/172 (12%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV-TFSNKIEE 232 ++ ++D+S SM S +I +++ +L+++K N ++ V T+ + + Sbjct: 284 VIFIIDISGSM------SGREIQQTREALLKILDDIKEDDHFNFILFGSDVHTWKETLIK 337 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK--- 289 + ++ ++ + G+ TN G+ + + + K+ Sbjct: 338 ATPE--NLDEARKFVRGIDTKGL-TNLYGGMMKGIDMLNAAHE-------GNLVPKRSAS 387 Query: 290 IIVFMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 II+ +TDG+ + Q + +A + +Y +G + FL A Sbjct: 388 IIIMLTDGQPNVGISNTQDIQTHVKKAIEGKYTLYNLGFGYGVDYNFLEKMA 439 >gi|320537260|ref|ZP_08037220.1| von Willebrand factor type A domain protein [Treponema phagedenis F0421] gi|320145888|gb|EFW37544.1| von Willebrand factor type A domain protein [Treponema phagedenis F0421] Length = 319 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 36/215 (16%), Positives = 72/215 (33%), Gaps = 23/215 (10%) Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITS---SVKVNSQTDARLDMMIVLDV 180 N YS + + +P I + IV+ K+ S + ++ +D+ Sbjct: 31 NAAYSFAQHKKSILPFLIIRNILFAFAWIFIVLAAAGPLWGSKIKSVRRQGVSVVFAVDI 90 Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV 240 S+SM D +++++A + ++ L+ L + Sbjct: 91 SKSMT-LKDVKPSRLNLAKGFCEFLTVKLSNAS-------CALLAVKGDSVLSVPLTFEH 142 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 L + I+ LS T S L+ A + ++ K I+ TDGE Sbjct: 143 EVLLKAIESLSPSSY-TASGTNLQKALLKAAA-------VFPKNRATAKTIILCTDGE-- 192 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + + L E ++ G + +G I + Sbjct: 193 --QSEGNILEAAKEIQRHGIQLIIVGFGTIEGGDV 225 >gi|284037552|ref|YP_003387482.1| von Willebrand factor A [Spirosoma linguale DSM 74] gi|283816845|gb|ADB38683.1| von Willebrand factor type A [Spirosoma linguale DSM 74] Length = 320 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 30/168 (17%), Positives = 61/168 (36%), Gaps = 23/168 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D +++DVSRSM++ D T+++ I + + + + GL+ + + Sbjct: 79 DTFLLVDVSRSMDA-GDIVPTRLERVKYDIQQLCDTLPAD-------RFGLILAAPQSIL 130 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTP---GLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L L++ I+ + T T + A ++ D Q + Sbjct: 131 LSPLTADHDALKQFIREVHTSISPTGETDLCNAIAMARQKLIDDSSTHQSV--------R 182 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 IV +DGEN S+ E + + G + +G+ + Sbjct: 183 AIVLFSDGENFSSCEQTELARL----RSFGLPLVTVGVGTEAGASIRK 226 >gi|156363275|ref|XP_001625971.1| predicted protein [Nematostella vectensis] gi|156212829|gb|EDO33871.1| predicted protein [Nematostella vectensis] Length = 331 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 32/131 (24%), Positives = 54/131 (41%), Gaps = 16/131 (12%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +LD+ I+LDVS SME + A + ++ +K I GL+TF+N Sbjct: 145 QEQLDIAILLDVSMSMEWG-------LSQAQNFTSLVIGSLKDIISEGGTH-VGLITFAN 196 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD-MQGMRQHCNTEDANY 287 + E L+ S + + +T + LK A F +GMR Sbjct: 197 EAEIVIGLDDIRSRDWTAVIHKVMLSGNTYANKALKLAETTFFTEAKGMRPGSA------ 250 Query: 288 KKIIVFMTDGE 298 K+++ ++DG Sbjct: 251 -KVVITLSDGR 260 >gi|218510669|ref|ZP_03508547.1| putative vault protein inter-alpha-trypsin domain [Rhizobium etli Brasil 5] Length = 784 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 36/191 (18%), Positives = 76/191 (39%), Gaps = 37/191 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D S SM S I+ A +S+ + + + ++ F + + ++ Sbjct: 356 VVFVIDNSGSM------SGQSIEQARQSLALAISRLSKDD------RFNVIRFDDTMTDY 403 Query: 234 F--LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 F L+ + ++ I Y+ G+S + + A QG + + Sbjct: 404 FNGLVAASPDNREKAITYVR--GLSADGGTEMLPALEDALRNQGPVASGAL------RQV 455 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENP 351 VF+TDG ++Q L+ A + A V+ +GI ++PN++++ + Sbjct: 456 VFLTDG----AIGNEQQLFQEISANRGDARVFTVGIG-----------SAPNTYFMTKAA 500 Query: 352 HSMYDAFSHIG 362 F+ IG Sbjct: 501 EIGRGTFTAIG 511 >gi|190892740|ref|YP_001979282.1| vault protein inter-alpha-trypsin domain [Rhizobium etli CIAT 652] gi|190698019|gb|ACE92104.1| putative vault protein inter-alpha-trypsin domain [Rhizobium etli CIAT 652] Length = 794 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 36/191 (18%), Positives = 76/191 (39%), Gaps = 37/191 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D S SM S I+ A +S+ + + + ++ F + + ++ Sbjct: 356 VVFVIDNSGSM------SGQSIEQARQSLALAISRLSKDD------RFNVIRFDDTMTDY 403 Query: 234 F--LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 F L+ + ++ I Y+ G+S + + A QG + + Sbjct: 404 FNGLVAASPDNREKAITYVR--GLSADGGTEMLPALEDALRNQGPVASGAL------RQV 455 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENP 351 VF+TDG ++Q L+ A + A V+ +GI ++PN++++ + Sbjct: 456 VFLTDG----AIGNEQQLFQEISANRGDARVFTVGIG-----------SAPNTYFMTKAA 500 Query: 352 HSMYDAFSHIG 362 F+ IG Sbjct: 501 EIGRGTFTAIG 511 >gi|217974748|ref|YP_002359499.1| outer membrane adhesin-like protein [Shewanella baltica OS223] gi|217499883|gb|ACK48076.1| outer membrane adhesin like proteiin [Shewanella baltica OS223] Length = 1215 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 70/202 (34%), Gaps = 34/202 (16%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +DM IV+D S SM + ID A ++ +++ GLV+FS + Sbjct: 310 GDIDMQIVMDRSGSMFG------SPIDNAKQAAKILVDA-----TAEGSTAMGLVSFSGR 358 Query: 230 --------IEEFFLLEWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +++ + GV L+ I + G ST G + A + + Q Sbjct: 359 SSVKQDFAMQKMPKPDNGVKQALKGAIDNIYANG-STALFDGSQLALDNLSAYQASAASG 417 Query: 281 NTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--VIRSHEFLR 337 ++ + DG +N S K + + A +++ G Sbjct: 418 APG------VVFVLADGDDNNSIKSESSVITAYQNA---NVPIFSFGYGSASPTGPLVTM 468 Query: 338 ACASPNSFY-LVENPHSMYDAF 358 A A+ ++ + DAF Sbjct: 469 ANATGGKYFSSPTTLAEIIDAF 490 >gi|301756599|ref|XP_002914168.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-4-like [Ailuropoda melanoleuca] Length = 1127 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 61/169 (36%), Gaps = 27/169 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT-F 226 + D++IV+D S SM+ ++ +A +I+ +L+ + N+ V + + Sbjct: 269 ATSPKDIVIVVDTSGSMKGL------RMTIAKHTISTILDTL----GENDFVNIIAYSDY 318 Query: 227 SNKIEEFFL------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + IE F H ++ + L GV L A+ + Q RQ Sbjct: 319 IHYIEPCFKGILVQADRDNREHFKQLVDELMVKGVGV-VNQALTEAFQILKQFQEARQGS 377 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 A I+ +TDG + + + + R V+ I Sbjct: 378 LCNQA-----IMLITDGAVEDYEPVFEKYNWPD----RKVRVFTYLIGR 417 >gi|289606823|emb|CBI60997.1| unnamed protein product [Sordaria macrospora] Length = 599 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 24/132 (18%), Positives = 38/132 (28%), Gaps = 41/132 (31%) Query: 238 WGVSHLQRKIKYLSKFGVS------------TNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 S + L G + ++T K A + D+ Sbjct: 107 NNRSGFVSYLNGLVARGGTYHDIGMIWGARFLSTTGLFKSATPETNDVTDPDNPAKIRGF 166 Query: 286 NYKKIIVFMTDGE--------------------NLSTKEDQQSLYY---------CNEAK 316 + KK ++FMTDG+ N S D +L CN AK Sbjct: 167 SVKKYMIFMTDGDMSPTWNDYSAYGIEYLDGRVNGSPTTDNAALLARHLQRFRMACNAAK 226 Query: 317 KRGAIVYAIGIR 328 +G ++ I Sbjct: 227 AKGIDIWVIAFS 238 >gi|218193011|gb|EEC75438.1| hypothetical protein OsI_11969 [Oryza sativa Indica Group] Length = 516 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 42/259 (16%), Positives = 92/259 (35%), Gaps = 29/259 (11%) Query: 105 NDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKV 164 ND + + + + G + + + ++ P N + +++ + + Sbjct: 4 NDDEQSAPVANNNGGTTTRPPGITGVPVGQVELTKYHNAEAPLDPNDQEVLLELKGT--S 61 Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 ++ + A LD++ VLDVS SM K+D ++ ++ ++ + + +V Sbjct: 62 SATSRAALDLIAVLDVSTSMAG------DKLDRMKAALLFVIRKLDDVD------RLSIV 109 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 TFSN L + R G++ + ++ + R+ Sbjct: 110 TFSNDAARLCPLRFVAGDAARADLGALVDGLAADGNTNIRAGLEIGLAVAAGRRLTAGRS 169 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--- 341 N ++ M+DG+ + D L G V+ G+ L+A A Sbjct: 170 VN----VMLMSDGQQN--RADATRLDP------GGVPVHTFGLGADHDPAVLQAIAGKSR 217 Query: 342 PNSFYLVENPHSMYDAFSH 360 F+ V + ++ FS Sbjct: 218 EGMFHYVADGVNLTAPFSQ 236 >gi|150021377|ref|YP_001306731.1| von Willebrand factor, type A [Thermosipho melanesiensis BI429] gi|149793898|gb|ABR31346.1| von Willebrand factor, type A [Thermosipho melanesiensis BI429] Length = 349 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 70/200 (35%), Gaps = 36/200 (18%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 ++T ++D+MIVLD + SM + I+ S+ ++E ++ + G++ Sbjct: 100 ETETRNQVDIMIVLDTTGSMYNA-------IEGVKNSVQNLIETLQA---SGLDAKVGII 149 Query: 225 TFSNKI---EEFFLLEW----GVSHLQRKIKYLSKFGVS---TNSTPGLKYAYNQIFDMQ 274 F + + + W + + ++ +G + N G+ YA+N Sbjct: 150 PFDDAVPSKDITLTPNWLDLSDAGSAKDFVSNITAYGGADFPENPYAGIMYAFN------ 203 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + A+ +KII+ +TD E + ++ I + Sbjct: 204 -----NASWRASSQKIIILITDASAHYKSETYPGDAEGETLYDKDKVIDTI-----QGFV 253 Query: 335 FLRACASPNSFYLVENPHSM 354 + P +Y + Sbjct: 254 TVHGAFIPGYYYSSTDSEDF 273 >gi|149034209|gb|EDL88979.1| inter alpha-trypsin inhibitor, heavy chain 4, isoform CRA_b [Rattus norvegicus] Length = 706 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 38/280 (13%), Positives = 90/280 (32%), Gaps = 46/280 (16%) Query: 92 MSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNS 151 L+ + ++ + I + + + ++ ++ K ++F + S Sbjct: 166 EQLVKHLQMDIYIFEPQGISTLETESTFMTQELANALTTSQNKTKAHIQFKPTLSQQRKS 225 Query: 152 RH-----IVMPITSSVKVN-SQTDARLDMM---------------------IVLDVSRSM 184 ++ + T V+ S T + + V+D S SM Sbjct: 226 QNEQDTVLDGDFTVRYDVDRSSTGGTIQIENGYFVHHFAPEDLPTMAKNVLFVIDKSGSM 285 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG-LVTFSNKIEEFFLLEWGVSHL 243 KI +++ +L+++ N +V SG + + + +L Sbjct: 286 AG------KKIQQTREALIKILKDLSTQDQFNIIVFSGEANQWEQLLVQATE-----ENL 334 Query: 244 QRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 R + Y S TN + A + + + +I+ +TDGE Sbjct: 335 NRAVDYASKIPAQGGTNINKAVLSAVELLDKSNQAELLPSKSVS----LIILLTDGEPTV 390 Query: 302 TKEDQQSLYYCNEAKKRG-AIVYAIGIRVIRSHEFLRACA 340 + + + + + G ++ +G ++ FL A Sbjct: 391 GETNPKIIQKNTQEAINGRYSLFCLGFGFDVNYPFLEKLA 430 >gi|291528739|emb|CBK94325.1| von Willebrand factor type A domain [Eubacterium rectale M104/1] Length = 410 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 16/83 (19%), Positives = 35/83 (42%), Gaps = 5/83 (6%) Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC--AS 341 + +I++ +TDG N + S N A + I+Y +G+ + S L+ ++ Sbjct: 19 STDASRIMIVVTDGYNN--QSGASSATVINNAIEENVIIYCVGVGSVNS-TVLKNISEST 75 Query: 342 PNSFYLVENPHSMYDAFSHIGKD 364 +Y + + F +I + Sbjct: 76 GGCYYYINQFSQLNGIFENIISE 98 >gi|153940147|ref|YP_001391360.1| hypothetical protein CLI_2102 [Clostridium botulinum F str. Langeland] gi|152936043|gb|ABS41541.1| hypothetical protein CLI_2102 [Clostridium botulinum F str. Langeland] gi|295319388|gb|ADF99765.1| hypothetical protein CBF_2088 [Clostridium botulinum F str. 230613] Length = 647 Score = 44.8 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 45/303 (14%), Positives = 91/303 (30%), Gaps = 56/303 (18%) Query: 46 MKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN 105 ++ + +D + A ++ K +++ W + + V Sbjct: 330 IQNAMQGKVD---LEAYGISQSDQQLFQFYSNKMKLEREQMRQFW-EKLIGDAKKEVSVK 385 Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVN 165 + +D + + YK L T + Sbjct: 386 KDGQVKGKLDVDSFINFYPDFVEAEKKGNYK-NLPIFNRYLLETQADIFP---------- 434 Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 R+++ V+D S SM +KI+ + K++ L I D N ++S Sbjct: 435 ----ERIEISFVIDNSGSMNE------SKIEASRKALAVTLLS---IDDFNKYLKSNAEQ 481 Query: 226 FSNKIEEFFLLEWG--------------------VSHLQRKIKYLSKFGVSTNSTPGLKY 265 + K+E W S + R I L +T+ L+ Sbjct: 482 LNQKVEVLSE-TWFFGSKYYNVKEFNDKNVKEKEKSDIIRSIVKLDATDGATDDASCLRE 540 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 N++ +Q + KI+ +TDG + ++++ E + VYA Sbjct: 541 ISNRMTSIQESELKKGKQ----IKIVFEITDGASSFPGSAKEAVQ---ELLSKNVEVYAF 593 Query: 326 GIR 328 I Sbjct: 594 QIG 596 >gi|269217610|ref|ZP_06161464.1| putative von Willebrand factor type A domain protein [Actinomyces sp. oral taxon 848 str. F0332] gi|269212545|gb|EEZ78885.1| putative von Willebrand factor type A domain protein [Actinomyces sp. oral taxon 848 str. F0332] Length = 321 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 38/232 (16%), Positives = 78/232 (33%), Gaps = 34/232 (14%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 S ++ S ++ +S A D+++ LDVS S+E+F + + E V Sbjct: 70 SSAVIAARPSELRTHSSKLASRDLILCLDVSGSVEAFD----------AQVLKTFAEMVD 119 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS--------TNSTPG 262 + LV ++ F L + ++ ++ + K T ++ G Sbjct: 120 QFHGE----RVALVVWNRSSTVVFPLSDDYAMVKEELGRVRKVLQGGRDVDDLYTRTSAG 175 Query: 263 LKYAYNQIFDMQ--GMRQHCNTEDANYKKIIVFMTDGENLST-KEDQQSLYYCNEAKKRG 319 ++ + Q + D + + I+F TD N + + A+KR Sbjct: 176 DRFTGASLIGDGLVSCTQSFDYADKDRSRTILFATD--NWLEGSPAFELKEAASIAQKRK 233 Query: 320 AIVYAIGI-RVIRSHEFLRAC--ASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 V + I + + R+ Y A I K++ + Sbjct: 234 IRVLGLLISGYPQGRDEFRSTIQGIGGKVYDARGAR----AGDQIVKEVQAQ 281 >gi|193213113|ref|YP_001999066.1| hypothetical protein Cpar_1468 [Chlorobaculum parvum NCIB 8327] gi|193086590|gb|ACF11866.1| conserved hypothetical protein [Chlorobaculum parvum NCIB 8327] Length = 352 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 56/170 (32%), Gaps = 5/170 (2%) Query: 16 KGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRK 75 +G +TI+ AIFL ++ + +++ + K L S D + + A ++N Sbjct: 14 RGAVTIMFAIFLVVLLGFAALALDLGRMNLTKVQLQSAADAAALGGAGSLVNSSLSTYDW 73 Query: 76 KLK--GGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISA-- 131 G +L + + + G+ N D + + V + Sbjct: 74 DAAEQKGLVLAQHNIVNGEQIQQATIEAGYWNSSDGFRHHGTSGVPVTGDVPAVRATVAL 133 Query: 132 -ISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDV 180 ++ PLK N +I +++ + + DV Sbjct: 134 TSTQNNGPLKLFFAPFLGINESNIPASAIAAIYPPAGGVGMFPFTLGKDV 183 >gi|118353832|ref|XP_001010181.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89291948|gb|EAR89936.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 542 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 38/278 (13%), Positives = 95/278 (34%), Gaps = 47/278 (16%) Query: 104 VNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK 163 +N+ D+ + + + + Y++ + F + ++ I + ++ Sbjct: 49 INNDDEPIDQNFIKLGDGKNKQKYNLEKGLSLDVKTLHKHFQFSSSTNQSIPVMVSVKTL 108 Query: 164 VNSQTDAR----------------LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 ++ + LD++ V+D S SM S KI K++ +LE Sbjct: 109 DKTEDAPKGDQEAVKQESLENRPNLDLICVIDNSGSM------SGEKIQNVKKTLEYLLE 162 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLL--EWGVSH--LQRKIKYLSKFGVSTNSTPGL 263 + + L+ F++ L + + I + G T+ G+ Sbjct: 163 LLGDND------RLCLILFNSYATRLCHLMKTNNSNKPAFKEIINKIYSTG-GTDINSGM 215 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE-AKKRGAIV 322 + A+ + R++ N + + ++DG+ D + + + Sbjct: 216 ELAFRVL----KDRKYQNPVSSVF-----LLSDGQ--DGSADLRVRQSLERHLPQECFTI 264 Query: 323 YAIGIRVIRSHEFL-RACA-SPNSFYLVENPHSMYDAF 358 ++ G + + C+ +FY VE + + + F Sbjct: 265 HSFGFGSDHDGPLMNKICSLKDGNFYYVEKINQVDEFF 302 >gi|158319036|ref|YP_001511544.1| von Willebrand factor type A [Frankia sp. EAN1pec] gi|158114441|gb|ABW16638.1| von Willebrand factor type A [Frankia sp. EAN1pec] Length = 608 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 44/242 (18%), Positives = 79/242 (32%), Gaps = 42/242 (17%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVN-SQTDARLDMMIVLDVSRSMESFFDSSITK------ 194 T P T R + + ++++ + + R + + VLD S SM S + Sbjct: 375 LTPRPRMTPERTLDATVLTALQGSFAGVHQRGNTLAVLDTSGSMNEEVPGSAGRSRLSVA 434 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG 254 +D A +I E+ L GL FS ++ +W L + G Sbjct: 435 LDAAKSAIPLFAEDSDL----------GLWQFSTRLRGDQ--DW--EELVPLGPMGERLG 480 Query: 255 VSTNSTPGLKYA-----------YNQIFDM-QGMRQHCNTEDANYKKIIVFMTDGENLST 302 T S + Y+ + M QH N +V +TDG+N Sbjct: 481 AGTRSQAVMDAVNRIEPRGDTGLYDTALAAFRYMNQHYVPGRPNQ---VVLLTDGKNSDP 537 Query: 303 KE----DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA--CASPNSFYLVENPHSMYD 356 + + + +R V IG L A+ Y +P+++++ Sbjct: 538 GSIALDELVRILRREYSPQRPVQVITIGYGADTDLAALSRISAATGAETYPALDPNTIFE 597 Query: 357 AF 358 Sbjct: 598 VL 599 >gi|110633696|ref|YP_673904.1| hypothetical protein Meso_1343 [Mesorhizobium sp. BNC1] gi|110284680|gb|ABG62739.1| conserved hypothetical protein [Chelativorans sp. BNC1] Length = 571 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 7/75 (9%), Positives = 24/75 (32%) Query: 12 FYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNG 71 + G + ++ + P++ L ++ ++ + ++ D + V A I Sbjct: 13 LRDRSGNVAVMAGLLFPVMLLGAVFGVDQGSLYLERREAQALTDLAAVTAVANISRANAA 72 Query: 72 NNRKKLKGGDILCRI 86 ++ Sbjct: 73 AALTMADNRQGNVQL 87 >gi|330995096|ref|ZP_08319013.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] gi|332879554|ref|ZP_08447249.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|329576672|gb|EGG58175.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] gi|332682520|gb|EGJ55422.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 289 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/102 (22%), Positives = 47/102 (46%), Gaps = 9/102 (8%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+V+DVS S+ S + +A + ++ + G++ F Sbjct: 72 EEERELTVMLVIDVSGSLSFGTVSQTKREMVAEIAATLAFSAIQNND------KIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKY 265 S++IE+F + G H+ R I+ L + ST+ ++Y Sbjct: 126 SDRIEKFIPPKKGRKHILRIIRELLDFTPQSKSTDIGQAIEY 167 >gi|167386410|ref|XP_001737745.1| hypothetical protein [Entamoeba dispar SAW760] gi|165899349|gb|EDR25974.1| hypothetical protein, conserved [Entamoeba dispar SAW760] Length = 209 Score = 44.8 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 31/176 (17%), Positives = 57/176 (32%), Gaps = 21/176 (11%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ + D + SM S+ ++ I+ I +I I V+ LV + + Sbjct: 4 LDLVFLCDTTGSMGSYLRAAQESIEKIISTI---------IQSEKCDVRFALVEYRDHPP 54 Query: 232 E----FFLLEWGV---SHLQRKIKYLSKFGVSTNSTP---GLKYAYNQIFDMQGMRQHCN 281 + F L ++ + ++ G G+ A N + + Sbjct: 55 QDNSFAFRLTDFTDSMKAIKEAVNRMNAKGGGDEPESVCCGMHCASNLKYRESAAKVIVW 114 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 DA I DG L E K+ ++Y +G +LR Sbjct: 115 IGDAPPHGFI-REGDGFPKGCPCGYDLLKEVRECMKKDIVIYCVG-AEPLGSRYLR 168 >gi|313238221|emb|CBY13313.1| unnamed protein product [Oikopleura dioica] Length = 759 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 27/126 (21%), Positives = 52/126 (41%), Gaps = 15/126 (11%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 LD++ V+D S SM D + I + E L ++V+ G++T+SN Sbjct: 522 PLDILFVVDRSTSMR----GENQFFDASKAWIENFIGEFDL-----SLVKIGVITYSNNA 572 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L+ L++ K + + L A N + + R ED++ + I Sbjct: 573 TLDIPLKN--HLLEKITKRMDGIELDEKVGSDLSDAANLVRKVVSRRD----EDSDAESI 626 Query: 291 IVFMTD 296 ++ ++D Sbjct: 627 VIILSD 632 >gi|217976666|ref|YP_002360813.1| von Willebrand factor type A [Methylocella silvestris BL2] gi|217502042|gb|ACK49451.1| von Willebrand factor type A [Methylocella silvestris BL2] Length = 346 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 26/163 (15%), Positives = 57/163 (34%), Gaps = 25/163 (15%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI------TKIDMAIKSINAMLE 207 + P V D + +++D S SM F +K + A + + + Sbjct: 65 LTGPRRGGELVEKIGDGA-ETALLIDRSGSMNETFAGRPPEGGEESKANAARRVLQDYVR 123 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYA 266 + G+V FS L + + + + + G+ T+ GL A Sbjct: 124 R-------RQHDRFGVVGFSTSPIRMLPLTDNMDAVLGAVAAIDRPGLDYTDIGRGLSTA 176 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL 309 + + + G ++++ ++DG + + QQ+L Sbjct: 177 LSLLSEDAGQS----------PRVLLLVSDGAAVIDPKVQQTL 209 >gi|212704357|ref|ZP_03312485.1| hypothetical protein DESPIG_02412 [Desulfovibrio piger ATCC 29098] gi|212672216|gb|EEB32699.1| hypothetical protein DESPIG_02412 [Desulfovibrio piger ATCC 29098] Length = 331 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 54/174 (31%), Gaps = 24/174 (13%) Query: 173 DMMIVLDVSRSMESFFDS-SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 V+D S SM D KI++A ++ + + + + + + + Sbjct: 33 SFDFVVDYSGSMMMKNDKMKQDKIEVAKIALKRVNAAIPALDFKGGLH---TIAPNGTVI 89 Query: 232 EFFLLEWGVSHLQRKIKYLSK----FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 E W + + I L FG TN GL+ Y Sbjct: 90 EQGP--WNRAAMDSGINTLRSGFATFGRMTNMGDGLQT-YEPFLSSMERSA--------- 137 Query: 288 KKIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 ++ +TDG+N + + + V ++ + ++A A Sbjct: 138 --ALILVTDGDNNRGMDLVEVARQVYATQRNMVIHVISLA-DTPQGEATVKAIA 188 >gi|222150614|ref|YP_002559767.1| hypothetical protein MCCL_0364 [Macrococcus caseolyticus JCSC5402] gi|222119736|dbj|BAH17071.1| hypothetical protein [Macrococcus caseolyticus JCSC5402] Length = 1102 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 33/171 (19%), Positives = 63/171 (36%), Gaps = 23/171 (13%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV-V 219 + S T +D+++V+D S SM + + T+ + ++EV N + Sbjct: 494 TQGRASVTQETVDIVLVVDNSASMGTIIGNGKTRWQSMKDDVFQFIDEVTQANTAANTKI 553 Query: 220 QSGLVTFSN----KIEEFFLLEWGVSHLQRKIKYLSKF-GVSTNSTPGLKYAYNQIFDMQ 274 + +V F++ +I F + V + Y+ G T + GL N++ + Sbjct: 554 RIDVVNFASALRPEINSGFSGDPAVIKSKFYPSYVPSGQGGGTFTQQGLISGSNKLDTSR 613 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENL------STKEDQQSLYYCNEAKKRG 319 KK++V +TDG S + + N K G Sbjct: 614 AS-----------KKVMVVLTDGAPTLSYKGVSATGSESITSFSNTIKGNG 653 >gi|239617869|ref|YP_002941191.1| PEGA domain protein [Kosmotoga olearia TBF 19.5.1] gi|239506700|gb|ACR80187.1| PEGA domain protein [Kosmotoga olearia TBF 19.5.1] Length = 1706 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 48/339 (14%), Positives = 113/339 (33%), Gaps = 49/339 (14%) Query: 39 EVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNM--SFRN 96 + H+F + T L+S +D T I + + N + L+ + + R Sbjct: 166 DTVHVFEISTKLYSFVDPD-----TPIYSRSSKVNVEGLRENFSQDFMVTFLSHLGEIRY 220 Query: 97 ELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVM 156 +L + + I +++I P+ + A + P+ + Sbjct: 221 QLLNRVYSERTGKIE-FLNINIEDYPEIK-ILFRAYTDINKPISEEVLLHSDAYILEPSG 278 Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 I + + L+ ++ +D S SM+ ++ A + + L L+P+ + Sbjct: 279 RIDLQSLEALKKEPSLNFVLEVDRSGSMKPV-------MEKAKDAASYFL---DLLPENS 328 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 L+ F +IE L+R + + G + Y+ + + Sbjct: 329 E---LALIAFDTEIEVLKNFTRDREQLKRALAIIKARGATPL--------YDTVAKGIEL 377 Query: 277 RQHCNTEDANYKKIIVFMTDG-----ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VI 330 + + ++ +TDG + + ++ A++ +++AIG+ I Sbjct: 378 LSERSGP-----RFLILVTDGVDANYGDTAPGSEKTLSEVIRLARENNVVIFAIGLGTRI 432 Query: 331 RSHEFLRACASPNSFYLVENP--------HSMYDAFSHI 361 S +L +S+ + F +I Sbjct: 433 DEFSLGTLARSTGGMFLKSPTIDNLKTAFNSLLETFENI 471 >gi|301604540|ref|XP_002931918.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Xenopus (Silurana) tropicalis] Length = 929 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 68/202 (33%), Gaps = 30/202 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D S SM K+ +++ +L++++ N ++ FS +++ + Sbjct: 369 VVFVIDTSASMLGN------KMKQTKEALFTILKDLRPQDHFN------IIGFSKRVKVW 416 Query: 234 FLLE------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + ++ I L G TN G+ + + + Sbjct: 417 QQNQMVKVSPNNIRDAKKFIYSLYPTGE-TNINEGIHVGAQLL---NNYLASNGKHEKSV 472 Query: 288 KKIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-----LRACAS 341 +++F+TDG E + L A + +++IG L C Sbjct: 473 S-LMIFLTDGRATIGEIESPKILGNTKNAIQEKFCLFSIGFGNDVDFNLLEKLSLENCGM 531 Query: 342 PNSFYLVENPHSMYDAF-SHIG 362 E+ S F IG Sbjct: 532 MRRIQENEDAASQLKGFYDEIG 553 >gi|290993444|ref|XP_002679343.1| hypothetical protein NAEGRDRAFT_79080 [Naegleria gruberi] gi|284092959|gb|EFC46599.1| hypothetical protein NAEGRDRAFT_79080 [Naegleria gruberi] Length = 2704 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 27/152 (17%), Positives = 54/152 (35%), Gaps = 17/152 (11%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 + S Q LD++ V+D S S+ + + + ++ Sbjct: 2129 TSASAVVSVQYTKCQCGLNLDIIFVIDGSGSITA---DNFVNMKNFASNLTTGFGSNISP 2185 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 N G++ F++ I L+ G S + IK +++ TNS G+ A N + Sbjct: 2186 TGTN----IGIIQFASTITTHLDLKSGTSQSSVLTAIKNMAQIQSGTNSISGINTAVNMM 2241 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 +K+++ +TDG + Sbjct: 2242 LASGRKSIS--------QKLLIHITDGLSNDP 2265 >gi|170755302|ref|YP_001781603.1| hypothetical protein CLD_2585 [Clostridium botulinum B1 str. Okra] gi|169120514|gb|ACA44350.1| hypothetical protein CLD_2585 [Clostridium botulinum B1 str. Okra] Length = 647 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 35/178 (19%), Positives = 64/178 (35%), Gaps = 37/178 (20%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 R+++ V+D S SM +KI+ + K++ L I D N ++S + K+ Sbjct: 436 RIEISFVIDNSGSMSE------SKIEASRKALAVTLLS---IDDFNKYLKSNAEQLNQKV 486 Query: 231 EEFFLLEWG--------------------VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 E W S + R I L +T+ L+ N++ Sbjct: 487 EVLSE-TWFFGSKYYNVKEFNDKNVKEKEKSDIIRSIVKLDATDGATDDASCLREISNRM 545 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +Q + KI+ +TDG + ++++ E + VYA I Sbjct: 546 TSIQESELKKGKQ----IKIVFEITDGASSFPGSAKEAVQ---ELLSKNVEVYAFQIG 596 >gi|167623667|ref|YP_001673961.1| cell wall anchor domain-containing protein [Shewanella halifaxensis HAW-EB4] gi|167353689|gb|ABZ76302.1| LPXTG-motif cell wall anchor domain [Shewanella halifaxensis HAW-EB4] Length = 850 Score = 44.8 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 36/223 (16%), Positives = 73/223 (32%), Gaps = 42/223 (18%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 K T +VM + + + ++++V+D S SM S I A Sbjct: 423 KQQTLSKKALEKYALVMLMPPQGSDDESSSIARELVLVIDTSGSM------SGDAIIQAK 476 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF-----LLEWGVSHLQRKIKYLSKFG 254 ++ L ++ N ++ F++ +E + + Q I L G Sbjct: 477 SALKYALAGLRPQDSFN------VLQFNSTVERWSRHVMPATAINLGRAQNYINGLQADG 530 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI-----------------IVFMTDG 297 + S L A + + R H + + + ++F+TDG Sbjct: 531 GTEMS-LALDAA---LTKLDNDRGHNSKPVHDDDRYQSSNETLEQSAATPLRQVLFITDG 586 Query: 298 ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 E + N+ + ++ IGI + F++ A Sbjct: 587 --AVANESRLFEQIKNQLGES--RLFTIGIGSAPNAHFMQRAA 625 >gi|327538080|gb|EGF24770.1| conserved hypothetical protein, secreted [Rhodopirellula baltica WH47] Length = 359 Score = 44.8 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 22/101 (21%), Positives = 40/101 (39%), Gaps = 4/101 (3%) Query: 21 ILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGG 80 IL I L +F + G++I++ + + S+ D + + Q+ G ++ Sbjct: 2 ILVVILLFALFAIAGLLIDIGMARLTQAHMQSVSDAASLEGGWQL-AMGADQTTTRIAVV 60 Query: 81 DILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVV 121 D + +W R EL D +ND D +S I Sbjct: 61 DRAAEMSESWGPH-RIELEDGYDLND--DGKPESSQTINRD 98 >gi|198473081|ref|XP_001356166.2| GA18279 [Drosophila pseudoobscura pseudoobscura] gi|198139291|gb|EAL33226.2| GA18279 [Drosophila pseudoobscura pseudoobscura] Length = 1138 Score = 44.8 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 70/188 (37%), Gaps = 24/188 (12%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-----NVVQSG 222 + D+MI+LD S SM D+ + + +L+ + VN VV++ Sbjct: 151 ASSPKDIMILLDASSSMSE------KSFDLGMATAFNILDTLGEDDFVNLITFSEVVKAP 204 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + F +++ + ++ +K + + N T GL+YA++ + Sbjct: 205 VPCFKDRMVRATP--DNIQEIKSAVKAIK-LQDTANFTAGLEYAFSLLHKYNQSGSGSQC 261 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACAS 341 A I+ +T+ + S K+ + + + ++ I S L A Sbjct: 262 NQA-----IMLITESTSESHKDIIKQYNWPHMP----VRIFTYLIGSDSSSRSNLHDMAC 312 Query: 342 PNSFYLVE 349 N + V+ Sbjct: 313 SNKGFFVQ 320 >gi|195175237|ref|XP_002028364.1| GL15442 [Drosophila persimilis] gi|194117953|gb|EDW39996.1| GL15442 [Drosophila persimilis] Length = 1149 Score = 44.8 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 70/188 (37%), Gaps = 24/188 (12%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-----NVVQSG 222 + D+MI+LD S SM D+ + + +L+ + VN VV++ Sbjct: 151 ASSPKDIMILLDASSSMSE------KSFDLGMATAFNILDTLGEDDFVNLITFSEVVKAP 204 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + F +++ + ++ +K + + N T GL+YA++ + Sbjct: 205 VPCFKDRMVRATP--DNIQEIKSAVKAIK-LQDTANFTAGLEYAFSLLHKYNQSGSGSQC 261 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACAS 341 A I+ +T+ + S K+ + + + ++ I S L A Sbjct: 262 NQA-----IMLITESTSESHKDIIKQYNWPHMP----VRIFTYLIGSDSSSRSNLHDMAC 312 Query: 342 PNSFYLVE 349 N + V+ Sbjct: 313 SNKGFFVQ 320 >gi|145547190|ref|XP_001459277.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124427101|emb|CAK91880.1| unnamed protein product [Paramecium tetraurelia] Length = 603 Score = 44.8 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 33/213 (15%), Positives = 78/213 (36%), Gaps = 33/213 (15%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 V+ + + N+ +D++ V+DVS SM KI++ S+ +++ + Sbjct: 103 VLSLKTKKVSNNLDRPPIDLVCVVDVSGSM------IGRKINLVKDSLRYLMKILGPED- 155 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSH----LQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + ++ F+ L++ I L STN + G+ A + Sbjct: 156 -----RICIIVFTTVAHIVTSFIRNTQENKPLLKKAILELKGL-ASTNISDGMNKA---L 206 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK-KRGAIVYAIGIR- 328 + ++ + N I ++DG++ +Q+ K + +++ G Sbjct: 207 WMLKNRKY------KNPVSCIFLLSDGQDDYKGAEQRVFDQLQLLKIEEKFVIHTFGYGQ 260 Query: 329 ---VIRSHEFLRACASPNSFYLVENPHSMYDAF 358 ++ + +FY ++N + D F Sbjct: 261 DHDAYVMNQIAKY--REGNFYYIDNINKASDYF 291 >gi|116284252|gb|AAI24051.1| LOC779593 protein [Xenopus (Silurana) tropicalis] Length = 954 Score = 44.8 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 68/202 (33%), Gaps = 30/202 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D S SM K+ +++ +L++++ N ++ FS +++ + Sbjct: 318 VVFVIDTSASMLGN------KMKQTKEALFTILKDLRPQDHFN------IIGFSKRVKVW 365 Query: 234 FLLE------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + ++ I L G TN G+ + + + Sbjct: 366 QQNQMVKVSPNNIRDAKKFIYSLYPTGE-TNINEGIHVGAQLL---NNYLASNGKHEKSV 421 Query: 288 KKIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-----LRACAS 341 +++F+TDG E + L A + +++IG L C Sbjct: 422 S-LMIFLTDGRATIGEIESPKILGNTKNAIQEKFCLFSIGFGNDVDFNLLEKLSLENCGM 480 Query: 342 PNSFYLVENPHSMYDAF-SHIG 362 E+ S F IG Sbjct: 481 MRRIQENEDAASQLKGFYDEIG 502 >gi|150005796|ref|YP_001300540.1| aerotolerance-related membrane protein [Bacteroides vulgatus ATCC 8482] gi|294776175|ref|ZP_06741664.1| von Willebrand factor type A domain protein [Bacteroides vulgatus PC510] gi|149934220|gb|ABR40918.1| aerotolerance-related membrane protein [Bacteroides vulgatus ATCC 8482] gi|294449998|gb|EFG18509.1| von Willebrand factor type A domain protein [Bacteroides vulgatus PC510] Length = 340 Score = 44.4 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 26/175 (14%), Positives = 66/175 (37%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + + ++ +I LD+S SM + D + +++D + K I+ +++ + Sbjct: 79 GSKMETVKRSGVEAVIALDISNSMLAE-DVTPSRLDKSKKLISRLVDTFNND-------K 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 GL+ F+ + + ++ ++ T+ ++ A +G+ Sbjct: 131 VGLIVFAGDAFTQLPITSDYVSAKMFLETINPSLITTQGTDIGAAIRLAMKSFTPQEGVG 190 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + I+ +TDGEN + + + G V+ +G+ Sbjct: 191 -----------RAIIVITDGENHEGGAVEAAQEAAEK----GMQVFVLGVGSPDG 230 >gi|260812900|ref|XP_002601158.1| hypothetical protein BRAFLDRAFT_143984 [Branchiostoma floridae] gi|229286449|gb|EEN57170.1| hypothetical protein BRAFLDRAFT_143984 [Branchiostoma floridae] Length = 581 Score = 44.4 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 34/181 (18%), Positives = 58/181 (32%), Gaps = 29/181 (16%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 SS+ + + + +LD S SM S I A +++ L+ + V N+V Sbjct: 341 SSIPPRDPSKIPGNFIFILDRSGSM------SGANIAGARETLLLFLKSLPTC-CVFNIV 393 Query: 220 QSGLVTFSNKIEEFFLLE-WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 G ++ V IK + TN L++ ++ RQ Sbjct: 394 SFG-SSYKPMFSTSVPYTQQNVDKASADIKKMRADMGGTNILSPLQWVFSAPVTSGYPRQ 452 Query: 279 HCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 + +TDG N T D N +A+GI S + Sbjct: 453 ------------VFLLTDGSVSNTGTVIDLVRKNAYNT------RCFALGIGPKASRSLV 494 Query: 337 R 337 + Sbjct: 495 Q 495 >gi|156096226|ref|XP_001614147.1| sporozoite surface protein 2 [Plasmodium vivax SaI-1] gi|148803021|gb|EDL44420.1| sporozoite surface protein 2 [Plasmodium vivax] Length = 556 Score = 44.4 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 33/179 (18%), Positives = 69/179 (38%), Gaps = 30/179 (16%) Query: 171 RLDMMIVLDVSRSMESFFDSSITK----IDMAIKSINAMLEEVKLIPDV-----NNVVQS 221 +D+ +++D S S+ + + ITK ++ I S++ + + L ++ +++ Sbjct: 42 SVDLYLLVDGSGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRL 99 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 G I++ L + Y +TN T L + D Sbjct: 100 GS---GQSIDKRQALSKVTELRKTYTPY-----GTTNMTAALDEVQKHLNDR-------- 143 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 ++++ MTDG ++L N+ K+R + IG+ +H+F R A Sbjct: 144 VNREKAIQLVILMTDG---VPNSKYRALEVANKLKQRNVSLAVIGVGQGINHQFNRLIA 199 >gi|118081932|ref|XP_414993.2| PREDICTED: hypothetical protein [Gallus gallus] Length = 360 Score = 44.4 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 28/143 (19%), Positives = 53/143 (37%), Gaps = 16/143 (11%) Query: 219 VQSGLVTFSNKIEEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 ++ + + K L V+ + I+ +S G S L +A Sbjct: 15 IRMAVALYGEKPRMSIELTDYVTIEEILVAIQEISIKGSSLKVGSALAFA-------AHA 67 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 T N K++V +T G++ ED+ + G V+A+GI+ HE Sbjct: 68 MSQPATLRDNAAKVVVLITSGKSSDLVEDK-----AQVLQDAGVTVFAVGIKDADKHELN 122 Query: 337 RACASP--NSFYLVENPHSMYDA 357 + + P V++ H ++ A Sbjct: 123 KIASEPTAEHVIYVDDFHLLHSA 145 >gi|224367946|ref|YP_002602109.1| predicted serine/threonine protein kinase PpkA [Desulfobacterium autotrophicum HRM2] gi|223690662|gb|ACN13945.1| predicted serine/threonine protein kinase PpkA [Desulfobacterium autotrophicum HRM2] Length = 727 Score = 44.4 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 35/235 (14%), Positives = 81/235 (34%), Gaps = 43/235 (18%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + V ++D++ V D + SM + I + I + +++ P + Sbjct: 236 AGVGAEVFKAMKMDVVFVSDTTVSMRPY-------IQATLDVIRQVAQDLGSDPATAQSL 288 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHL-------QRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + G+ + + +++ + + + ++ LSK V+ + YA +F Sbjct: 289 RFGVWGYRDPVDKIPGIGYTTFNYTPALLPANDFVQALSKVDVTQVDSD--DYA-EDVFS 345 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGE--------NLSTKEDQQSLYYCNEAKKRGAIVYA 324 +IIV + D N S + Q ++A +YA Sbjct: 346 GINDAISQTAWTQGAIRIIVLLGDAPGHEPGHKYNASGQSAQTLRAISDDA---SIYLYA 402 Query: 325 IGIRVIRSH--------EFLRACASPNS-----FYLVENPHSMYDAFSHIGKDIV 366 + ++ R+ +F++ A+P ++ + + F I DI Sbjct: 403 MHVKAPRAKKYHESAEMQFMKLSANPGMQGGNAYFSM--SSENIEGFKRIASDIT 455 >gi|21756834|dbj|BAC04967.1| unnamed protein product [Homo sapiens] Length = 231 Score = 44.4 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 26/111 (23%), Positives = 49/111 (44%), Gaps = 10/111 (9%) Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRK--IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 Q L+ +SN ++ F + ++ + + + T + G+ Q+F Sbjct: 50 QFALMQYSNLLKIHFTFTQFRTSPSQQSLVDPIVQLKGLTFTATGILTVVTQLF------ 103 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 H N + KKI++ +TDG+ + + +A+K G I YAIG+R Sbjct: 104 HHKNGARKSAKKILIVITDGQKYKDPLEYSDVIP--QAEKAGIIRYAIGVR 152 >gi|89896615|ref|YP_520102.1| hypothetical protein DSY3869 [Desulfitobacterium hafniense Y51] gi|89336063|dbj|BAE85658.1| hypothetical protein [Desulfitobacterium hafniense Y51] Length = 641 Score = 44.4 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 55/172 (31%), Gaps = 21/172 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + + ++ V+D S SM + ++ +I ++L Sbjct: 438 LAVRSEDLREKVREKRTGATLLFVVDASGSMGAKR-----RMGAVKGAILSLLN-----D 487 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 G++ F E L L +K G T GL AY + Sbjct: 488 AYQKRDSIGMIAFRKDGAEVLLNITRSVDLAQKCLETLPTGGKTPLAAGLAKAYELLKVD 547 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENL----STKEDQQSLYYCNEAKKRGAI 321 + ++ + I+ ++DG+ S + + +L + + G Sbjct: 548 R-------IKNPEALQYIILVSDGKANLPLFSDQALKDALIVGKKIRHEGIR 592 >gi|256425856|ref|YP_003126509.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256040764|gb|ACU64308.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 588 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 42/240 (17%), Positives = 85/240 (35%), Gaps = 34/240 (14%) Query: 135 YKIPLKFCTFI----------PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 Y PL PW + R + + + VN + +++ ++DVS SM Sbjct: 196 YSYPLPPVGQTLAIYSNYATCPWAEDHRLLQIAVRGK-SVNLDSLPPSNLVFLIDVSGSM 254 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ 244 + K+ + + ++ ++ V V +G+ + S + Sbjct: 255 -----AMPNKLPLLQAAFRILVNNLRSNDHVAIVAYAGVPG----VILPSTPGSAKSKIL 305 Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTK 303 I YLS G + +K AY QI + +++ N ++ TDG+ N+ Sbjct: 306 NAIDYLSAGGAT-AGEAAIKLAY-QIAEENFIKEGNNR--------VILATDGDFNVGQT 355 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRACAS--PNSFYLVENPHSMYDAFSH 360 D K+ G ++ +G + + L +S +F ++N F+ Sbjct: 356 SDHDMEQLILGKKETGVLLTCLGFGMKNYKDSKLETLSSKGNGNFAYIDNLEEASKIFAR 415 >gi|269128710|ref|YP_003302080.1| Vault protein inter-alpha-trypsin domain-containing protein [Thermomonospora curvata DSM 43183] gi|268313668|gb|ACZ00043.1| Vault protein inter-alpha-trypsin domain protein [Thermomonospora curvata DSM 43183] Length = 795 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 40/229 (17%), Positives = 82/229 (35%), Gaps = 43/229 (18%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 + + + + S D++I+LD S SM + K + A Sbjct: 279 GESGTFTLTVLPPS----ERCAPRPRDVVILLDRSGSMHGW------------KMVAARR 322 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS--------HLQRKIKYLSKFGVSTN 258 +++ + + +++F + +E L+ G+S + L G + Sbjct: 323 AAARIVDTLTGRDRFAVLSFDDMVERPAGLDGGLSPATDRNRFRAVEHLAGLQARGGTEL 382 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + P + A +DA +++V +TDG+ ++ L + Sbjct: 383 AAPLREGA-------------ALLDDAGRDRVLVLITDGQ----VGNEDQLLALIDPFLN 425 Query: 319 GAIVYAIGIRVIRSHEFLRACASPNS--FYLVENPHSMYDAFSHIGKDI 365 G ++A+GI + FL A+ LVE+ + +A HI I Sbjct: 426 GLRIHAVGIDQAVNAGFLGRLATAGQGRLELVESEDRLDEAMEHIHHRI 474 >gi|146162207|ref|XP_001008983.2| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila] gi|146146475|gb|EAR88738.2| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila SB210] Length = 790 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 34/185 (18%), Positives = 66/185 (35%), Gaps = 35/185 (18%) Query: 136 KIPLKFCTFIPWYTNSR-HIVMPITSSV--KVNSQTDARLDMMIVLDVSRSMESFFDSSI 192 KI L+ +++P + K D+MIVLD + SM+S+ Sbjct: 2 KIQLQMFQDNLSEIYEDFQVILPYEQKINQKKKVSNLTSCDLMIVLDCTGSMQSW----- 56 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-----SNKIEEFFLLEWGVSHLQRKI 247 I ++I +++ VK + ++ G+V + + +IE + + +L++ I Sbjct: 57 --ISACKQNILKIIQNVKQQFQGSE-LRVGIVAYRDHCDTQRIEMYPFINQDYENLEKFI 113 Query: 248 KYLSKFGVS---TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD----GENL 300 L G + L+ A N K ++ +TD G+ Sbjct: 114 SSLKATGGGDEPEDLAGALETAINLFEWKSQA------------KTMIIITDAPCHGDTY 161 Query: 301 STKED 305 D Sbjct: 162 HNSND 166 >gi|1915956|emb|CAA72309.1| inter-alpha-inhibitor heavy-chain 1 [Sus scrofa] Length = 779 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 37/192 (19%), Positives = 68/192 (35%), Gaps = 20/192 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D+S SME K+ +++ +L ++K ++V G S + Sbjct: 162 VVFVIDISSSMEG------QKVKQTKEALLKILSDLKPGD-YFDLVLFGSAVQSWRGSLV 214 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + ++ S STN GL + QG E +N I++ Sbjct: 215 QASTANLDAARSYVRQFS-LAGSTNLNGGLLRGIEILNKAQGSL----PEFSNRASILIM 269 Query: 294 MTDGENLSTKEDQQSL--YYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS-----FY 346 +TDGE D+ + + + R +Y +G FL A N+ Y Sbjct: 270 LTDGEPTEGVTDRSQILKNVRDAIRGRF-PLYNLGFGHDVDWNFLEVMALENNGRAQRIY 328 Query: 347 LVENPHSMYDAF 358 + F Sbjct: 329 EDHDAAQQLQGF 340 >gi|291242941|ref|XP_002741338.1| PREDICTED: chloride channel accessory 2-like [Saccoglossus kowalevskii] Length = 788 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 76/202 (37%), Gaps = 38/202 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++VLDVS SM S ++I +++ + + +++ + G VTFS ++ Sbjct: 298 IVLVLDVSGSM-----SLKSRIIKLQQAVYTFI-----MDEISLGIDVGCVTFS---DKA 344 Query: 234 FLLEW----GVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 ++ W + + L G +T L + Q Q + Sbjct: 345 NIISWLMPINSDEDREEFLALVMPTLNTGGNTAIGSALIAGVQVL--SQNDTQPADGG-- 400 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---P 342 I+ +TDG+ + ++ + + + G IV + + + L AS Sbjct: 401 ----ILFLVTDGQENVPQFIEEVI---DNVIESGVIVDTLAWGLFAEEK-LETIASGTKG 452 Query: 343 NSFYLVENPHS--MYDAFSHIG 362 +S+Y E S +AF + Sbjct: 453 SSYYYSEQAQSNAHVEAFMEVA 474 >gi|182439279|ref|YP_001826998.1| hypothetical protein SGR_5486 [Streptomyces griseus subsp. griseus NBRC 13350] gi|326779931|ref|ZP_08239196.1| von Willebrand factor type A [Streptomyces cf. griseus XylebKG-1] gi|178467795|dbj|BAG22315.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus NBRC 13350] gi|326660264|gb|EGE45110.1| von Willebrand factor type A [Streptomyces cf. griseus XylebKG-1] Length = 249 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 26/157 (16%), Positives = 55/157 (35%), Gaps = 21/157 (13%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 S++ + R + +V+D S SM +++ + + + + D V Sbjct: 36 SLEKHGLGGHRAAVYLVVDHSGSMRPYYNDG--SVQALADRVLGLSSHLD--DDGTVPV- 90 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQ-RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 V FS ++ + G H + +I+ T+ + + D Sbjct: 91 ---VFFSTDVDAVTDIALGNHHGRIDEIRAGLGHMGKTSYHLAMDAVIDHYLDSG----- 142 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 + ++VF TDG ++ ++ Y C AK Sbjct: 143 -----STAPALVVFQTDGGPINKLAAER--YLCKAAK 172 >gi|153830633|ref|ZP_01983300.1| type I secretion target ggxgxdxxx repeat (2 copies) domain protein [Vibrio cholerae 623-39] gi|148873874|gb|EDL72009.1| type I secretion target ggxgxdxxx repeat (2 copies) domain protein [Vibrio cholerae 623-39] Length = 1426 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 57/180 (31%), Gaps = 33/180 (18%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSS--------------------ITKIDMAI 199 +K + + ++ +++D S SM + +++ + I Sbjct: 807 GGIKTSVEPGKNYNIALIVDTSGSMRYDLAGNQNATYDGWNDSWSQTPEQYAASRMKLTI 866 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGV 255 ++ + ++ D + V L+ F+ + +S L KI L G Sbjct: 867 DALKVLATQLA---DHDGTVNITLIGFNGTAADALPFNNLSAANLSDLIGKINLLIADG- 922 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 TN A G E+ Y F+TDG+ D + + + A Sbjct: 923 GTNYEDAFIEATKWFNTQPGSETDLKFENLTY-----FLTDGDPTVHNGDSSTGFTTDYA 977 >gi|291569126|dbj|BAI91398.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 396 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 34/227 (14%), Positives = 72/227 (31%), Gaps = 22/227 (9%) Query: 120 VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLD 179 + N + P+ + + + I S T +++++D Sbjct: 19 INNDNVTLRLQVTDGRDRPVIQLQKSDFQVITDDEPVGIKSWKSPQESTPPPAWIVVLVD 78 Query: 180 VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS------------ 227 +S SM S +ID A+ + LE++ D + +V F Sbjct: 79 LSGSMNELDTSGKRRIDGALDATRRFLEQIS---DRGGDTKVAIVPFGEGGRNCPGFEVT 135 Query: 228 -NKIEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHC--NT 282 I F + + YL+ +T+ L A + + Q R + ++ Sbjct: 136 QRGINSKFFPANDIKQTN-FLDYLAAQTLCAATDIYGPLSEAIRVLGNRQDPRFYVPEDS 194 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIR 328 + ++ ++DG + E Q + IV+ +G Sbjct: 195 GRLEPRLSVILLSDGFHNQPNEQQDFDNLITLLERNNNIIVHTLGYG 241 >gi|269954956|ref|YP_003324745.1| LPXTG-motif cell wall anchor domain-containing protein [Xylanimonas cellulosilytica DSM 15894] gi|269303637|gb|ACZ29187.1| LPXTG-motif cell wall anchor domain protein [Xylanimonas cellulosilytica DSM 15894] Length = 2724 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 40/232 (17%), Positives = 68/232 (29%), Gaps = 44/232 (18%) Query: 154 IVMPITSSVKVNSQTDA-RLDMMIVLDVSRSMESFFDSSI----------TKIDMAIKSI 202 + + S + T+ +D+++VLD S SM +S ++ ++ Sbjct: 137 VALSALGSTRHVMSTEPVPIDVVLVLDNSYSMTQCVESDAFCRGPNVWQDSRAAAMADAV 196 Query: 203 NAMLEEVKLIPDVNNV----------VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK 252 N+ ++ + N V V +GLV N W + Sbjct: 197 NSAIQIIHEDDPDNRVAIVLFGTNSEVVTGLV---NPQPTGSRYVWLTQSGNSNTNGVLT 253 Query: 253 F--GVSTNS--TPGLKYAYNQI---------FDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 G T + T G Y I + + +I+F TDGE Sbjct: 254 LHVGNQTRTIGTSGSDTQYTNIQRGLTTGLGLLASQAKAAVTGSEQRVPNLILF-TDGEP 312 Query: 300 LSTKEDQQSLYYCNEA----KKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 + Q A G+ Y G S L+ A + Y Sbjct: 313 TRSSTSQSWWNPSGSAVQGPSSTGSAFYGNGFLAALSASLLK--AKVEAHYN 362 >gi|83643000|ref|YP_431435.1| von Willebrand factor type A (vWA) domain-containing protein [Hahella chejuensis KCTC 2396] gi|83631043|gb|ABC27010.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Hahella chejuensis KCTC 2396] Length = 733 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 53/337 (15%), Positives = 107/337 (31%), Gaps = 59/337 (17%) Query: 47 KTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVND 106 ++ ++I+ S AT + + + D+L R L V Sbjct: 222 RSHERTLINDSGWAPATNQVADAPEITPPTVAKEDLLTPSHQA---RIRVHLNPGLPVES 278 Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 I+ + I Q GY +S S +P+ + W V Sbjct: 279 IESVTHR----IQWTQQTNGYEVSLESNKDVPMDKDFTLTWRVRQGSEPEAALFKEIVGD 334 Query: 167 QTDARLDMM----------------IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 A+L +M V+D S SME I A ++ L+ + Sbjct: 335 DVYAQLLLMPPQFSDEGLSLPRELIWVVDTSGSMEG------VSIQQARDAVLQALDTLT 388 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLE-----WGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 + ++ F++ + F + +R ++ L G T L Sbjct: 389 PRD------RFNVIEFNSHARKLFPQAVPAQERALQQARRFVRGLKADG-GTEIAEALDR 441 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 A + +RQ +VF+TDG ++ +L+ + + + ++ + Sbjct: 442 ALSDAAPEGYVRQ------------VVFLTDGS----VGNELALFKQIDQQLGDSRLFTV 485 Query: 326 GIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSH 360 GI + F+R A ++ + + + D + Sbjct: 486 GIGPSPNRFFMRKAAQFGRGAYSHINDTAEVSDKIAE 522 >gi|125602048|gb|EAZ41373.1| hypothetical protein OsJ_25890 [Oryza sativa Japonica Group] Length = 757 Score = 44.4 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 41/224 (18%), Positives = 81/224 (36%), Gaps = 43/224 (19%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + S + A +D++ VLDVS SME + K+ + +++ + Sbjct: 305 LHVKAPSIAAEAAPARASVDLVTVLDVSGSMEGY------KLALLKRAMGLLG------- 351 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + + +V+FS L G + + ++ L G TN GL A + Sbjct: 352 PGD---RLAVVSFSYSARRVIRLTRMSEGGKASAKSAVESLHADGC-TNILEGLVEA-AK 406 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLST--------KEDQQSLYYCNEAKKRG-- 319 +FD + R + ++ ++DG++ S+ K+ G Sbjct: 407 VFDGRRYRNAVAS--------VILLSDGQDNYNVNGGWGASNSKNYSVLVPPSFKRSGDR 458 Query: 320 -AIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSH 360 V+ G + A + +F +EN + DAF+ Sbjct: 459 RLPVHTFGFGTDHDASAMHTIAEETGGTFSFIENQAVVQDAFAQ 502 >gi|254882022|ref|ZP_05254732.1| conserved hypothetical protein [Bacteroides sp. 4_3_47FAA] gi|319640970|ref|ZP_07995678.1| aerotolerance-like membrane protein [Bacteroides sp. 3_1_40A] gi|254834815|gb|EET15124.1| conserved hypothetical protein [Bacteroides sp. 4_3_47FAA] gi|317387415|gb|EFV68286.1| aerotolerance-like membrane protein [Bacteroides sp. 3_1_40A] Length = 340 Score = 44.4 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 26/175 (14%), Positives = 66/175 (37%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + + ++ +I LD+S SM + D + +++D + K I+ +++ + Sbjct: 79 GSKMETVKRSGVEAVIALDISNSMLAE-DVTPSRLDKSKKLISRLVDTFNND-------K 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 GL+ F+ + + ++ ++ T+ ++ A +G+ Sbjct: 131 VGLIVFAGDAFTQLPITSDYVSAKMFLETINPSLITTQGTDIGAAIRLAMKSFTPQEGVG 190 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + I+ +TDGEN + + + G V+ +G+ Sbjct: 191 -----------RAIIVITDGENHEGGAVEAAQEAAEK----GMQVFVLGVGSPDG 230 >gi|89054212|ref|YP_509663.1| von Willebrand factor, type A [Jannaschia sp. CCS1] gi|88863761|gb|ABD54638.1| von Willebrand factor type A [Jannaschia sp. CCS1] Length = 1356 Score = 44.4 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 35/201 (17%), Positives = 67/201 (33%), Gaps = 36/201 (17%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK----- 229 ++VLD S SM D + KI +A + I ML D+ + V GL + ++ Sbjct: 29 ILVLDGSGSMWGQIDG-VNKIVIAREVIAEMLA------DMADDVSLGLTVYGHRQRGSC 81 Query: 230 --IEEFFLLEWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 IE G + + ++ G + + + A + + Sbjct: 82 TDIETIVAPAPGTQGRILDAVNAINPRGRTPMT-DAVIAAAQSLRSTEEAA--------- 131 Query: 287 YKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV---IRSHEFLRACA-- 340 ++ ++DG EN + + A V IG V + ++ A Sbjct: 132 ---TVILVSDGIENCNPDPCAIAAELEATGVDFTAHV--IGFDVASEPEARAQMQCIADN 186 Query: 341 SPNSFYLVENPHSMYDAFSHI 361 + F +N + A + Sbjct: 187 TGGQFLTADNATELSQALEQV 207 >gi|89100226|ref|ZP_01173093.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] gi|89085076|gb|EAR64210.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] Length = 463 Score = 44.4 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 52/370 (14%), Positives = 113/370 (30%), Gaps = 27/370 (7%) Query: 18 GMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKL 77 I IF++ E + K D + + + + N + + Sbjct: 3 NFIKKLTILFLSIFIITACSKESGNNTEQKQS--KGNDAAAIGQVDESEDTENAEDAEDA 60 Query: 78 KGGDILCRIKNT-WNMSFRNELRDNGFVNDIDDIVRS---TSLDIVVVPQNEGYSISAIS 133 K L K ++ F + I +L V E Sbjct: 61 KEQINLAENKKIPASLEEIISYPKGPFTAEDTQIKDPEVQQALSKVPELPEEASEEELND 120 Query: 134 RYKIPLKFCTFIPWYTNSRHIVMPITS-SVKVNSQTDARLDMMIVLDVSRSMESFFDSSI 192 + + + ++ + +S+ ++ I+LD S SM + S Sbjct: 121 LFAYLYSLFRKEYRDPREAIVSLTVSGPESEGSSEEKGSFNVEIILDSSGSMANKM-GSQ 179 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK---IEEFFLLEWGVSHLQRKIKY 249 T++++A SI +P+ NV GL + +K + + + L + Sbjct: 180 TRMELAKASIKKF---ASALPEEANV---GLRVYGHKGTGSDADKKMSCASNELVYAPQP 233 Query: 250 LSKFGVSTNSTPGLKYAYNQIFDM--QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 + ++T + + + + + K I+ ++DG D Sbjct: 234 YIEAELNTALNKFKPAGWTPLAQSLMEAQKDLEAYKGEKNKNIVYVVSDG---IETCDGN 290 Query: 308 SLYYCNEAKKRGA--IVYAIGIRV-IRSHEFLR--ACASPNSFYLVENPHSMYDAFSHIG 362 + K G +V IG V + + L A A+ ++ V++ + + Sbjct: 291 PVEAAASLKDSGVAPVVNIIGFDVNGKDQQQLEEVAQAAGGTYQNVKSQEQLDNELEKAI 350 Query: 363 KDIVTKRIWY 372 ++ WY Sbjct: 351 EESGKWTKWY 360 >gi|320094784|ref|ZP_08026529.1| hypothetical protein HMPREF9005_1141 [Actinomyces sp. oral taxon 178 str. F0338] gi|319978281|gb|EFW09879.1| hypothetical protein HMPREF9005_1141 [Actinomyces sp. oral taxon 178 str. F0338] Length = 352 Score = 44.4 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 16/145 (11%), Positives = 54/145 (37%), Gaps = 11/145 (7%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 S +Q + +++ +V+D + SM +D +ID + + +LE++ Sbjct: 54 GPSVEGEATQVSSSVEVYMVIDRTGSMVAEDWDGKKPRIDGVRQDVATILEKMAGS---- 109 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 + ++++ + + L + + + + ++ + + + Sbjct: 110 ---RFSIISWDSGVRTELPLTTDSTAVTSYMATFVQELSESSQGSSPDRPASHLATILEK 166 Query: 277 RQHCNTEDANYKKIIVFMTDGENLS 301 + + ++ + + TDGE + Sbjct: 167 NKQKHPQNL---RTLFVFTDGETSN 188 >gi|268610164|ref|ZP_06143891.1| von Willebrand factor type A [Ruminococcus flavefaciens FD-1] Length = 285 Score = 44.4 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 30/202 (14%), Positives = 69/202 (34%), Gaps = 15/202 (7%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + + L + ++D S SM K+ + ++ E++ + + + V+ Sbjct: 41 LSATGVSRKSLVIFFLIDTSGSM------KGKKMGELNTVMEELIPEIRRVGEADTEVKV 94 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 ++TFS + + + + L GV++ + A+ ++ N Sbjct: 95 AVLTFSTDVRWMYSTPIPIEDFEWA--RLRANGVTS-----MGAAFKELSLRMSRNSFLN 147 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL-RACA 340 + ++ +I MTDG + N K G A+GI + + L Sbjct: 148 SPSLSFAPVIFLMTDGYPSDDYREGLKELQSNSWYKFGLKA-ALGIGNEANDDVLAEFTG 206 Query: 341 SPNSFYLVENPHSMYDAFSHIG 362 S ++ + + I Sbjct: 207 SKDTVVHAYSGGQLAQMIKIIA 228 >gi|325982790|ref|YP_004295192.1| PEP motif putative anchor domain-containing protein [Nitrosomonas sp. AL212] gi|325532309|gb|ADZ27030.1| PEP motif putative anchor domain protein [Nitrosomonas sp. AL212] Length = 333 Score = 44.4 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 34/218 (15%), Positives = 70/218 (32%), Gaps = 21/218 (9%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITK--IDMAIKSINAMLEEVK 210 +V P ++ + LD+ +VLD S SM + + A ++ L Sbjct: 46 QVVNPFGNANASGIVGGSGLDLALVLDSSGSMGAVDSGKTLNQWLQEASTALVNALPAAS 105 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 V + S + ++ L G + + I + G TN G+ A ++ Sbjct: 106 TSVSVIDFDSSAAI-----LQGLTPLSSGSAAVISAINAIDASG-GTNIGAGIDSAAAEL 159 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 A +++V ++DG + ++ +G+ Sbjct: 160 TGAN--------HTAGSTQMMVVVSDGFSSGDPASSALAALGAGVDA----IHTVGLPGH 207 Query: 331 RSHEFLRACASPNSFY-LVENPHSMYDAFSHIGKDIVT 367 + S N Y + S+ D F+ ++V Sbjct: 208 DAFTMQNIATSGNGIYTNASSLTSLIDLFNGTSGNLVG 245 >gi|86360582|ref|YP_472470.1| hypothetical protein RHE_PE00308 [Rhizobium etli CFN 42] gi|86284684|gb|ABC93743.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 533 Score = 44.4 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 14/78 (17%), Positives = 33/78 (42%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 F+ +++G + LT I +P++ ++I+V + T L + +D + A ++ Sbjct: 8 RFWNDHRGYVIALTLIAMPMLLGFSLLVIDVGRSSNLHTDLQNAVDAMALAGARELDGRD 67 Query: 70 NGNNRKKLKGGDILCRIK 87 + R + I Sbjct: 68 DAITRAQTAIEKIANSAA 85 >gi|238027555|ref|YP_002911786.1| membrane protein [Burkholderia glumae BGR1] gi|237876749|gb|ACR29082.1| Membrane protein [Burkholderia glumae BGR1] Length = 620 Score = 44.4 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 12/53 (22%), Positives = 29/53 (54%), Gaps = 1/53 (1%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 +G ++T IF+ ++ VLGM +++ ++FF + L + D + + ++ Sbjct: 22 RRERGAFAMMTIIFMTVMIAVLGM-LDIGNVFFQRRDLQRIADMAALAGVQRL 73 >gi|281349604|gb|EFB25188.1| hypothetical protein PANDA_001994 [Ailuropoda melanoleuca] Length = 805 Score = 44.4 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 61/169 (36%), Gaps = 27/169 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT-F 226 + D++IV+D S SM+ ++ +A +I+ +L+ + N+ V + + Sbjct: 269 ATSPKDIVIVVDTSGSMKGL------RMTIAKHTISTILDTL----GENDFVNIIAYSDY 318 Query: 227 SNKIEEFFL------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + IE F H ++ + L GV L A+ + Q RQ Sbjct: 319 IHYIEPCFKGILVQADRDNREHFKQLVDELMVKGVGV-VNQALTEAFQILKQFQEARQGS 377 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 A I+ +TDG + + + + R V+ I Sbjct: 378 LCNQA-----IMLITDGAVEDYEPVFEKYNWPD----RKVRVFTYLIGR 417 >gi|212693196|ref|ZP_03301324.1| hypothetical protein BACDOR_02706 [Bacteroides dorei DSM 17855] gi|237709938|ref|ZP_04540419.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|237725395|ref|ZP_04555876.1| conserved hypothetical protein [Bacteroides sp. D4] gi|265753590|ref|ZP_06088945.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] gi|212664301|gb|EEB24873.1| hypothetical protein BACDOR_02706 [Bacteroides dorei DSM 17855] gi|229436082|gb|EEO46159.1| conserved hypothetical protein [Bacteroides dorei 5_1_36/D4] gi|229456031|gb|EEO61752.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|263235304|gb|EEZ20828.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] Length = 340 Score = 44.4 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 26/175 (14%), Positives = 66/175 (37%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + + ++ +I LD+S SM + D + +++D + K I+ +++ + Sbjct: 79 GSKMETVKRSGVEAVIALDISNSMLAE-DVTPSRLDKSKKLISRLVDTFNND-------K 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 GL+ F+ + + ++ ++ T+ ++ A +G+ Sbjct: 131 VGLIVFAGDAFTQLPITSDYVSAKMFLETINPSLITTQGTDIGTAIRLAMKSFTPQEGVG 190 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + I+ +TDGEN + + + G V+ +G+ Sbjct: 191 -----------RAIIVITDGENHEGGAVEAAQEAAEK----GMQVFVLGVGSPDG 230 >gi|167376657|ref|XP_001734086.1| hypothetical protein [Entamoeba dispar SAW760] gi|165904540|gb|EDR29755.1| hypothetical protein EDI_315310 [Entamoeba dispar SAW760] Length = 373 Score = 44.4 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 31/176 (17%), Positives = 57/176 (32%), Gaps = 21/176 (11%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ + D + SM S+ ++ I+ I +I I V+ LV + + Sbjct: 4 LDLVFLCDTTGSMGSYLRAAQESIEKIISTI---------IQSEKCDVRFALVEYRDHPP 54 Query: 232 E----FFLLEWGV---SHLQRKIKYLSKFGVSTNSTP---GLKYAYNQIFDMQGMRQHCN 281 + F L ++ + ++ G G+ A N + + Sbjct: 55 QDNSFAFRLTDFTDSMKAIKEAVNRMNAKGGGDEPESVCCGMHCASNLKYRESAAKVIVW 114 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 DA I DG L E K+ ++Y +G +LR Sbjct: 115 IGDAPPHGFI-REGDGFPKGCPCGYDLLKEVRECMKKDIVIYCVG-AEPLGSRYLR 168 >gi|125975609|ref|YP_001039519.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] gi|256003656|ref|ZP_05428645.1| von Willebrand factor type A [Clostridium thermocellum DSM 2360] gi|281416621|ref|ZP_06247641.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|125715834|gb|ABN54326.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] gi|255992447|gb|EEU02540.1| von Willebrand factor type A [Clostridium thermocellum DSM 2360] gi|281408023|gb|EFB38281.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|316939730|gb|ADU73764.1| von Willebrand factor type A [Clostridium thermocellum DSM 1313] Length = 565 Score = 44.4 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 52/156 (33%), Gaps = 21/156 (13%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 + V DVS SM ++ +S+ + + V GLV++S + Sbjct: 394 VFVADVSGSMAG------EPLNRLKQSLINGSKYISSD------VSIGLVSYSTDVNINL 441 Query: 235 LLE-WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + ++ + + N+ ++ I M + ++ N K ++ Sbjct: 442 PIAKFDLNQRSLFVGAVESLAAGGNTAT-----FDAIIVATKMLKEEKAKNPNAKLMLFV 496 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 ++DG + K G +Y IG Sbjct: 497 LSDGVTNYGHSLNDIK---DMMKTFGIPIYTIGYNA 529 >gi|291443930|ref|ZP_06583320.1| toxic cation resistance protein [Streptomyces roseosporus NRRL 15998] gi|291346877|gb|EFE73781.1| toxic cation resistance protein [Streptomyces roseosporus NRRL 15998] Length = 294 Score = 44.4 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 56/159 (35%), Gaps = 23/159 (14%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +S++ + R + +VLD S SM+ ++ + + + + D V Sbjct: 80 ASLEKHGLGGHRAAVYLVLDYSGSMKPYYQDG--SVQALADRVLGLSSHLD--DDGTVPV 135 Query: 220 QSGLVTFSNKIE--EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 V FS ++ L+ + + + L G T+ + + D Sbjct: 136 ----VFFSTDVDAVTDIALDNHHGRIDKIVAGLGHMGK-TSYHLAMDAVIDHYLDSG--- 187 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 + ++VF TDG ++ ++ Y C AK Sbjct: 188 -------STAPALVVFQTDGGPINKLAAER--YLCKAAK 217 >gi|239940501|ref|ZP_04692438.1| hypothetical protein SrosN15_05838 [Streptomyces roseosporus NRRL 15998] gi|239986983|ref|ZP_04707647.1| hypothetical protein SrosN1_06727 [Streptomyces roseosporus NRRL 11379] Length = 250 Score = 44.4 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 56/159 (35%), Gaps = 23/159 (14%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +S++ + R + +VLD S SM+ ++ + + + + D V Sbjct: 36 ASLEKHGLGGHRAAVYLVLDYSGSMKPYYQDG--SVQALADRVLGLSSHLD--DDGTVPV 91 Query: 220 QSGLVTFSNKIE--EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 V FS ++ L+ + + + L G T+ + + D Sbjct: 92 ----VFFSTDVDAVTDIALDNHHGRIDKIVAGLGHMGK-TSYHLAMDAVIDHYLDSG--- 143 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 + ++VF TDG ++ ++ Y C AK Sbjct: 144 -------STAPALVVFQTDGGPINKLAAER--YLCKAAK 173 >gi|149440247|ref|XP_001521494.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H3, partial [Ornithorhynchus anatinus] Length = 390 Score = 44.4 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 39/284 (13%), Positives = 97/284 (34%), Gaps = 37/284 (13%) Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 E G + + + + SF + F +D + + L+ N + Sbjct: 120 EPQGISSLDAEATFMTNDLLGVLTKSFSGKKGHVSFKPTLDQ--QRSCLNCTTSMMNGDF 177 Query: 128 SISAISRYKIPLKFC---TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 +I+ + P + + + + + +++ V+DVS SM Sbjct: 178 TITYDVVRESPANVQIVNGYFVHFFAPKDLPV-------------VPKNVVFVIDVSGSM 224 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV-TFSNKIEEFFLLEWGVSHL 243 K+ +++ +LE++K +N ++ S + T+ + + + + Sbjct: 225 YG------RKLVQTKEALLKILEDMKEEDYLNFILFSSEITTWKDTLIKATPE--NLKKA 276 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 + +K + G+ TN GL + + + + +I+ +TDGE + Sbjct: 277 KEFVKNIKDEGL-TNINDGLMRGIKMLNEARETNVVPKRSTS----LIIMLTDGE--ANV 329 Query: 304 EDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACASPNS 344 + ++ + +Y +G ++ L A N Sbjct: 330 GEIRADKIQENVRNAIGGKFPLYNLGFGYDLNYNLLEKMALENH 373 >gi|332520546|ref|ZP_08397008.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] gi|332043899|gb|EGI80094.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] Length = 698 Score = 44.4 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 36/273 (13%), Positives = 90/273 (32%), Gaps = 30/273 (10%) Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDI-----VRSTSLDIVVVPQN 124 N + + +F ++ + N I + + ++ I + Sbjct: 235 IANESYTKIHENKFKLTNTSALSTFSIDVDKASYSNVRRMINNAQHIPADAVKIEEMINY 294 Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD--ARLDMMIVLDVSR 182 Y+ S + + P T + + + + ++ ++DVS Sbjct: 295 FNYNYS-QPKDEHPFAIHTEVAQTPWNNQTKLVKIGLQGKTYENKELPAANLTFLIDVSG 353 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH 242 SM ++ + + ++++++ V+ VV +G +E + Sbjct: 354 SMSH-------ELKLLKSAFKLLVDQLRDKDKVSIVVYAGAAG--VVLEPTSGKD--KKK 402 Query: 243 LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLS 301 + + + L + G ST G+ AY ++ + + N ++ TDG+ N+ Sbjct: 403 ILKALNKL-QSGGSTAGGAGINLAY-KLAEENFNKNGNNR--------VILATDGDFNVG 452 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 +Q E +K G + +G + Sbjct: 453 ASSNQAMEDLIIEKRKSGVFLSVLGFGYGNYKD 485 >gi|291386245|ref|XP_002710074.1| PREDICTED: von Willebrand factor A domain containing 3B [Oryctolagus cuniculus] Length = 1381 Score = 44.4 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 31/171 (18%), Positives = 62/171 (36%), Gaps = 32/171 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +++D S SM+S K+D+ I ++E N V+ + + + Sbjct: 509 IYVLIDTSHSMKS-------KLDLVKDKIIQFIQEQLKYKSKFNFVKFDGQAVAWQEKLA 561 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + E + Q I+ + G +TN+ L+ A+ + I Sbjct: 562 EINEENLEQAQSWIRDIK-VGSTTNTLDALQIAFTD----------------KETQAIYL 604 Query: 294 MTDGENLSTKEDQQSLYYCNEAKK-RGAIVYAIGIRVIR--SHEFLRACAS 341 +TDG E ++ + + +YAI ++ FL+ AS Sbjct: 605 LTDGRPDQPPE-----MVMDQVRVFQKIPIYAISFNYHDEIANTFLKELAS 650 >gi|300869833|ref|YP_003784704.1| hypothetical protein BP951000_0196 [Brachyspira pilosicoli 95/1000] gi|300687532|gb|ADK30203.1| putative membrane protein containing von Willebrand factor (vWA) type A domain, BatB [Brachyspira pilosicoli 95/1000] Length = 338 Score = 44.4 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 34/155 (21%), Positives = 56/155 (36%), Gaps = 20/155 (12%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + I+LD+SRSME+ D ++++ A I ++ + V G S F Sbjct: 92 ITILLDLSRSMEAD-DVWPSRLERAKLEIEDFVKNTDNLS----VALVGFAGTSFVASPF 146 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 S++ ++ S T L A N + KK I+ Sbjct: 147 TQDMETFSYILNELNTKSVTLQGTRIADALVTAKNTFNV-----------NIPGKKSIIL 195 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +TDGE+ + D E K VY +G+ Sbjct: 196 ITDGEDHAGYFD----NILKELKDNDISVYTVGVG 226 >gi|224142781|ref|XP_002324729.1| predicted protein [Populus trichocarpa] gi|222866163|gb|EEF03294.1| predicted protein [Populus trichocarpa] Length = 751 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 41/219 (18%), Positives = 76/219 (34%), Gaps = 41/219 (18%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +S+ R +++ V+D+S SME ++ +++A L + N + +G Sbjct: 318 HSRKVFRKEIVFVVDISGSMEG------APLEGTKIALSAALTNLDSKDSFNIIAFNGET 371 Query: 225 T-FSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHC 280 FS+ +E ++R ++++S G TN LK A + G Sbjct: 372 YLFSSSMELASEDT-----VERAVEWMSMNLIAGGDTNILVPLKQATEMLSKSGGSIP-- 424 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR-----GAI--VYAIGIRVIRSH 333 I +TDG + + C+ K + GI +H Sbjct: 425 ---------FIFLVTDG------AVEDERHICDIMKSHITGGGSIHPRICTFGIGSYCNH 469 Query: 334 EFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 FLR A S + + S+ + I + I Sbjct: 470 HFLRMLAMISRGQYDAAYDIDSVESRMQKLLSRISSTII 508 >gi|297673849|ref|XP_002814962.1| PREDICTED: anthrax toxin receptor 2 isoform 2 [Pongo abelii] Length = 411 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 29/157 (18%), Positives = 55/157 (35%), Gaps = 19/157 (12%) Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMR 277 + + FS++ L + + ++ L + T GLK A QI G++ Sbjct: 2 RLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKAGGLK 61 Query: 278 QHCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 II+ +TDG + L ++ ++ GA VY +G+ + Sbjct: 62 TSS---------IIIALTDGKLDGLVPSYAEKEAKI---SRSLGASVYCVGVLDFEQAQL 109 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 R S + V+ + A I I+ + Sbjct: 110 ERIADSKEQVFPVKGG---FQALKGIINSILAQSCTE 143 >gi|133777718|gb|AAI14375.1| ANTXR2 protein [Homo sapiens] Length = 446 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 29/157 (18%), Positives = 55/157 (35%), Gaps = 19/157 (12%) Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMR 277 + + FS++ L + + ++ L + T GLK A QI G++ Sbjct: 37 RLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKAGGLK 96 Query: 278 QHCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 II+ +TDG + L ++ ++ GA VY +G+ + Sbjct: 97 TSS---------IIIALTDGKLDGLVPSYAEKEAKI---SRSLGASVYCVGVLDFEQAQL 144 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 R S + V+ + A I I+ + Sbjct: 145 ERIADSKEQVFPVKGG---FQALKGIINSILAQSCTE 178 >gi|79154032|gb|AAI07877.1| ANTXR2 protein [Homo sapiens] Length = 411 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 29/157 (18%), Positives = 55/157 (35%), Gaps = 19/157 (12%) Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMR 277 + + FS++ L + + ++ L + T GLK A QI G++ Sbjct: 2 RLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKAGGLK 61 Query: 278 QHCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 II+ +TDG + L ++ ++ GA VY +G+ + Sbjct: 62 TSS---------IIIALTDGKLDGLVPSYAEKEAKI---SRSLGASVYCVGVLDFEQAQL 109 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 R S + V+ + A I I+ + Sbjct: 110 ERIADSKEQVFPVKGG---FQALKGIINSILAQSCTE 143 >gi|55733508|emb|CAH93432.1| hypothetical protein [Pongo abelii] Length = 435 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 29/157 (18%), Positives = 55/157 (35%), Gaps = 19/157 (12%) Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMR 277 + + FS++ L + + ++ L + T GLK A QI G++ Sbjct: 26 RLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKAGGLK 85 Query: 278 QHCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 II+ +TDG + L ++ ++ GA VY +G+ + Sbjct: 86 TSS---------IIIALTDGKLDGLVPSYAEKEAKI---SRSLGASVYCVGVLDFEQAQL 133 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 R S + V+ + A I I+ + Sbjct: 134 ERIADSKEQVFPVKGG---FQALKGIINSILAQSCTE 167 >gi|288928459|ref|ZP_06422306.1| BatB protein [Prevotella sp. oral taxon 317 str. F0108] gi|288331293|gb|EFC69877.1| BatB protein [Prevotella sp. oral taxon 317 str. F0108] Length = 554 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 26/175 (14%), Positives = 63/175 (36%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ ++ +I +D+S SM + D ++++ + I +++ + Sbjct: 79 GTKITHDKRNGIETIIAVDISNSMMAQ-DVVPSRLEKSKLLIENLVDNFTHD-------R 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 GLV F+ + + ++ + T+ + + Sbjct: 131 IGLVVFAGDAFVQLPITTDYVSAKMFLQNIDPALIATQGTDIAKAINLSMRSF------- 183 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 ++ + K I+ +TDGE+ + +L A +RG V+ +GI + Sbjct: 184 ----SQQKDIGKAIIVITDGEDH----EGGALEAAKAANERGIHVFILGIGSTKG 230 >gi|254468097|ref|ZP_05081503.1| von Willebrand factor, type A [beta proteobacterium KB13] gi|207086907|gb|EDZ64190.1| von Willebrand factor, type A [beta proteobacterium KB13] Length = 326 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 40/222 (18%), Positives = 70/222 (31%), Gaps = 53/222 (23%) Query: 173 DMMIVLDVSRSMESFFDSSI---------TKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + +VLD S SM+ F S TK A + I +E K G+ Sbjct: 81 QIGLVLDRSASMDDPFSGSTQFDDEGVGETKSAAAARLIINFVESRKND-------MIGV 133 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGV-STNSTPGLKYAYNQIFDMQGMRQHCNT 282 +TFSN L S + + + + TN GL + +++F Sbjct: 134 ITFSNSAMFVLPLTQNKSAITGAVNATAGNALFQTNIGAGLS-SVSELFAKVEDSGS--- 189 Query: 283 EDANYKKIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--------- 332 + ++ ++DG Q+ + + +Y I +R Sbjct: 190 ------RAVILLSDGAGRIDAPTQQKIRDWFDRF---DIGLYWIVLRQPGGISIFDENLK 240 Query: 333 -------------HEFLRACASPNSFYLVENPHSMYDAFSHI 361 ++ + SP Y E+P S+ A I Sbjct: 241 IRDETQPPPQIELFDYFKTFRSPFQAYEAEDPASLEKAIKDI 282 >gi|291299883|ref|YP_003511161.1| Vault protein inter-alpha-trypsin domain-containing protein [Stackebrandtia nassauensis DSM 44728] gi|290569103|gb|ADD42068.1| Vault protein inter-alpha-trypsin domain protein [Stackebrandtia nassauensis DSM 44728] Length = 831 Score = 44.4 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 79/211 (37%), Gaps = 36/211 (17%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++++LD S SM + K+ A ++ +++ + + + F + Sbjct: 303 PRDVVVLLDRSGSMGGW------KMVAARRAAARIVDTLSSAD------RFAVRCFDTAM 350 Query: 231 EE-----FFLLEWGVSHLQ-RKIKYL--SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 L G + R +++L ++ T+ L A + + + R Sbjct: 351 TSPEGLDPNGLSAGTDRNRFRAVEHLAGTETRGGTDILKPLSTAVDLLTAGEKGR----- 405 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 ++I+ +TDG+ EDQ + G V+ +GI + FL A Sbjct: 406 -----DRVIILVTDGQ--VGNEDQILREL--TGRLSGMRVHVVGIDKAVNAGFLHRLALV 456 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 LVE+ + +A +HI + IV + Sbjct: 457 GRGRCELVESEDRLDEATAHIHRRIVAPVVT 487 >gi|315186710|gb|EFU20468.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6578] Length = 289 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 21/98 (21%), Positives = 45/98 (45%), Gaps = 9/98 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 L + +V+DVSRSM S + +K D +L + + + N + G + F++++E Sbjct: 77 LVLFLVVDVSRSMRS-GSNQYSKFDCL-----EILFSIFSLVTLENNDRVGALFFTDEVE 130 Query: 232 EFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYA 266 + G +H+ ++ + G ++ L+ A Sbjct: 131 DVIPPRKGKTHVLALLRRFRDMRPRGRGSDLALALRTA 168 >gi|226228456|ref|YP_002762562.1| hypothetical protein GAU_3050 [Gemmatimonas aurantiaca T-27] gi|226091647|dbj|BAH40092.1| hypothetical protein [Gemmatimonas aurantiaca T-27] Length = 642 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 73/194 (37%), Gaps = 21/194 (10%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 T PW + + + + S ++ + + +++ ++DVS SM+S K+ + +S Sbjct: 245 VTTAPWQPRHQLVRIALQSR-RIETASLPPNNLVFLIDVSGSMQS-----PDKLPLVKQS 298 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 + ++++++ V V +G + + + I+ L G + Sbjct: 299 LRLLVDQMRPQDRVAIVAYAGAAG----LVLPSTSGDEKETIIQAIERLEAGGST-AGGA 353 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGA 320 G++ AY R+H N ++ +DG+ N+ D + + G Sbjct: 354 GIELAYRT------AREHFMDHGNNR---VILASDGDFNVGVSSDGELERLIERKRTEGT 404 Query: 321 IVYAIGIRVIRSHE 334 + +G + Sbjct: 405 YLTILGFGTGNYQD 418 >gi|171693029|ref|XP_001911439.1| hypothetical protein [Podospora anserina S mat+] gi|170946463|emb|CAP73264.1| unnamed protein product [Podospora anserina S mat+] Length = 1378 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 69/204 (33%), Gaps = 21/204 (10%) Query: 138 PLKFCTFIPWYTNSRHIVMPIT-SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 P + P N + +++ I S + +++++ D+S SM+ K+ Sbjct: 324 PQAWIEKHPTLPNQQALMVTIPPGFTTRTSNPTDQTEILLLADLSGSMDD-------KLT 376 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS 256 ++ L+ + S K + +++G + Q ++ + Sbjct: 377 SLRAAMQFFLKGI----PNGRKFNVWCFGSSYKSWQPHSVDYGEASYQSASSWVDTNFHA 432 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 L A I + R + I+ +TDGE E + + + Sbjct: 433 NMGGTELLPAVQAIVTARDKRLPTD---------IIILTDGETWRLDETLEYIRKQRDLT 483 Query: 317 KRGAIVYAIGIRVIRSHEFLRACA 340 + G +A+GI SH + A Sbjct: 484 EGGIRFFALGIGPAVSHALVEGIA 507 >gi|295132196|ref|YP_003582872.1| hypothetical protein ZPR_0316 [Zunongwangia profunda SM-A87] gi|294980211|gb|ADF50676.1| conserved hypothetical protein [Zunongwangia profunda SM-A87] Length = 288 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 44/162 (27%), Positives = 63/162 (38%), Gaps = 30/162 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L MM+V+DVS S FF + I I A L + GL+ F Sbjct: 72 EEERELTMMLVVDVSGS--EFFGTQSQFKKEVITEIAATLAFSATQNNDK----IGLLMF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 S++IE + + G SH+ R I+ L T+ GLKY N + Sbjct: 126 SDQIENYIPPKKGKSHVLRIIRELLEFQPKSKKTDIGLGLKYLSNVMK------------ 173 Query: 284 DANYKKIIVF-MTDGENLSTKEDQQSLYYCNEAKK-RGAIVY 323 KK IVF ++D + QQ+L G VY Sbjct: 174 ----KKAIVFVLSD---FMGDDYQQTLKITGNRHDVTGIRVY 208 >gi|312133570|ref|YP_004000909.1| protein [Bifidobacterium longum subsp. longum BBMN68] gi|311772822|gb|ADQ02310.1| Hypothetical protein BBMN68_1309 [Bifidobacterium longum subsp. longum BBMN68] Length = 362 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 57/185 (30%), Gaps = 15/185 (8%) Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 V+D S SM + + K +++ + I + V L+ F + Sbjct: 187 VVDYSGSMSGEGKNGVVK--GLNAALDPDQAKKSYIEPASGGVNI-LIPFETEAHRPVKA 243 Query: 237 E-WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 S L + G T+ L A +++ Q+ IV MT Sbjct: 244 TGTSTSDLLHEADATDASG-GTDIYEVLLSALDELPSESEASQYTTA--------IVLMT 294 Query: 296 DGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMY 355 DG + S +D+ +++ R +++I + N+ + Sbjct: 295 DGRSNSDHQDE--FESAYKSRGRDLPIFSIMFGDADPSQLKSLATLSNAKVFDGRSGDLA 352 Query: 356 DAFSH 360 F Sbjct: 353 AVFRQ 357 >gi|218130608|ref|ZP_03459412.1| hypothetical protein BACEGG_02197 [Bacteroides eggerthii DSM 20697] gi|217986952|gb|EEC53283.1| hypothetical protein BACEGG_02197 [Bacteroides eggerthii DSM 20697] Length = 247 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 57/190 (30%), Gaps = 13/190 (6%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + V+D S SM KI ++ +L + I N + + +E Sbjct: 17 LFFVIDTSGSMAGN------KIGAVNDAVENVLPMLDEISASNPDAEIKVAA----LEFS 66 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 W + + V+ + L A ++ T ++ I+ Sbjct: 67 SGCNW-LYDEPKLASEFVWQDVTASGLTSLGAACQELNTKLSRNGFMQTPSGSFAPAIIL 125 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL-RACASPNSFYLVENPH 352 ++DG S N K + AI I + L + + + + V N Sbjct: 126 LSDGGPTDDFYGGLSKLKANNWFKNAIKI-AIAIGDDADKDVLTQFTGTNEAVFTVHNID 184 Query: 353 SMYDAFSHIG 362 ++ + Sbjct: 185 ALKQIIRVVA 194 >gi|104779436|ref|YP_605934.1| surface adhesion protein [Pseudomonas entomophila L48] gi|95108423|emb|CAK13117.1| Surface adhesion protein [Pseudomonas entomophila L48] Length = 5862 Score = 44.4 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 28/202 (13%), Positives = 69/202 (34%), Gaps = 17/202 (8%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 + S+ ++ + + T + + + ++ Sbjct: 5275 TSTESVGGSTATTEGTIKVTVYPQTYSTSNLSSDSDNITGTDGNDVVVADVSGLHVVPGQ 5334 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV--KLIPDVNNVVQSGLVTFSN 228 ++ ++D S SM S + +D A KS+ ++ + + + D + V LV F+ Sbjct: 5335 NYNLAFIVDTSGSMGS------SGVDAAKKSLESVFKTLAASVKGDQSGTVNILLVDFAT 5388 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 +++ + + LQ + L+ + TN K N + + Sbjct: 5389 QVKSSVAVTLNDAGLQTLLNALNNLRADGGTNYEDAFKTTAN--WFQNLKDGGNTGSNQT 5446 Query: 287 YKKIIVFMTDGENLSTKEDQQS 308 + F+TDG+ + ++ S Sbjct: 5447 F-----FITDGKPTYYQANENS 5463 >gi|332299343|ref|YP_004441264.1| double-transmembrane region domain protein [Porphyromonas asaccharolytica DSM 20707] gi|332176406|gb|AEE12096.1| double-transmembrane region domain protein [Porphyromonas asaccharolytica DSM 20707] Length = 342 Score = 44.4 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 28/176 (15%), Positives = 59/176 (33%), Gaps = 26/176 (14%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + V+SQ +D+ +DVS SM + D +I A + + ++E+ Sbjct: 80 THAPVSSQQTIGVDLAFCIDVSNSM-AARDVKPDRIGFAKQIVTHTMQELAGS------- 131 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGM 276 + +V F+ L + + + + TN L+ + + Sbjct: 132 RVAMVVFAGGAYIRLPLTPDLPTARTFLADIQPGMVSNQGTNLGQALERSAQALSAPSRA 191 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 K ++ +TDGE ++ + KK+ Y + I + Sbjct: 192 G-----------KAVIILTDGE-DHEGGLEEGI---ERLKKQEIKAYVVTIGLPDG 232 >gi|302384159|ref|YP_003819982.1| von Willebrand factor A [Brevundimonas subvibrioides ATCC 15264] gi|302194787|gb|ADL02359.1| von Willebrand factor type A [Brevundimonas subvibrioides ATCC 15264] Length = 591 Score = 44.4 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 61/177 (34%), Gaps = 30/177 (16%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + ++ V+D S S +++ ++ ++ +L E + Q L+ F Sbjct: 404 EQRSESTVIFVVDASGS------AALQRLAETKGAVELLLAEAYV-----RRTQVALIAF 452 Query: 227 SNK-IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + + ++ ++++ L G +T GL+ A I + Sbjct: 453 RGEGADLLLPPTRSLARARKQLAEL-AGGGATPLAAGLEMA--AILAVAE----RAKGRT 505 Query: 286 NYKKIIVFMTDGENLS--------TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 +VFMTDG T+ +Q +L C + G I I + Sbjct: 506 PL---LVFMTDGRGNIALDGGAFRTRAEQDALNACRRIRAAGLRAALIDISPRPRGD 559 >gi|332027662|gb|EGI67730.1| Voltage-dependent calcium channel subunit alpha-2/delta-3 [Acromyrmex echinatior] Length = 1211 Score = 44.4 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 42/206 (20%), Positives = 79/206 (38%), Gaps = 35/206 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++I++D S SM + ++A IN +L+ + N+ V +TFS Sbjct: 218 ATSPKDILILMDTSGSMTGI------RREIAKHVINNILDTL----GNNDFVNI--ITFS 265 Query: 228 NKIEEFFLLEWGV---------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 N+ +E G L+R I L+ + N + L A+ + + R+ Sbjct: 266 NETKEVVSCFDGTLVQANLANVRELKRAISNLNTERI-ANFSLALTTAFELLETFRNERE 324 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE----AKKR-GAIVYA--IGIRVIR 331 A I+ +TDG + K+ ++ + + K V+ IG V Sbjct: 325 GARCNQA-----IMLITDGVPYNYKDIFEAYNWKDNPDEPLKADMPVRVFTYLIGREVAD 379 Query: 332 SHEF-LRACASPNSFYLVENPHSMYD 356 E ACA+ + + P + + Sbjct: 380 VREVQWMACANRGYYVHLCTPAEVRE 405 >gi|325845103|ref|ZP_08168414.1| von Willebrand factor type A domain protein [Turicibacter sp. HGF1] gi|325488845|gb|EGC91243.1| von Willebrand factor type A domain protein [Turicibacter sp. HGF1] Length = 315 Score = 44.4 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 47/330 (14%), Positives = 98/330 (29%), Gaps = 78/330 (23%) Query: 30 IFLVLGMIIEVSHIFFMK---TVLHSMIDRSLVHAATQIMNEGNGNNR-KKLKGGDILCR 85 + + LG+ I + + ++ +V+ + +EG ++ + L + Sbjct: 15 LIIFLGLGIVTTITSYATDDVNQEQNIEVGPIVNEESTEGDEGQKDDSYEDLLDESVRTS 74 Query: 86 IKNTWNMSFRNELRDNGFVNDIDDIVRSTSLD-IVVVPQNEGYSISAISRYKIPLKFCTF 144 N + L + FV ++ + + P N RYK+ Sbjct: 75 ESNKLETVIQTNLNQSLFVTSASNVKPQIDFTYLGITPINPMQGQEFTVRYKLTPNPFQH 134 Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 ++++VLD S SM S TK+ + Sbjct: 135 ----------------------NISKPKEIVLVLDGSGSM------SGTKLTNLKNAAKD 166 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK------------------ 246 ++ +K + ++ +V FS+ + G + ++ Sbjct: 167 FIDRLKGVDNLK----VAIVVFSSNATINPISVSGTTKIKSTDKSSESSIPNYKTLQNEY 222 Query: 247 ------------IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 I + G TN+ GL+ A + D+ K I+ M Sbjct: 223 FLDINDSRLITMINNIDAQG-GTNTGDGLRKA-EYLLSQ--------KGDSVANKTIILM 272 Query: 295 TDGENLSTKEDQQS-LYYCNEAKKRGAIVY 323 +DG +S + Y E K ++ Sbjct: 273 SDGLPTYYSGSTESGVNYYKEIKDDVVGIF 302 >gi|310830566|ref|YP_003965667.1| Stress protein [Paenibacillus polymyxa SC2] gi|309250033|gb|ADO59599.1| Stress protein [Paenibacillus polymyxa SC2] Length = 256 Score = 44.4 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 34/161 (21%), Positives = 57/161 (35%), Gaps = 20/161 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + N T + IVLD S SM S + + ++S+ L V L D N + Sbjct: 47 LTKNGYTGGPARVGIVLDQSGSMSSLYRNGT------VQSVVERLLPVALRFDDNGELDV 100 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 L +K ++ R+I K ++ T+ P + + + + Sbjct: 101 WLFADRSKRLNPVSDRDFYQYVDREIMGKRENQIWGGTSYAPVMHDVFQK---------Y 151 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA 320 E +N I F+TDG N + + E K G Sbjct: 152 VVEEPSNLPTFITFITDGNN---SDQAATTKLMKELSKHGI 189 >gi|68448495|ref|NP_001020335.1| inter-alpha (globulin) inhibitor H4 [Danio rerio] gi|67677852|gb|AAH96879.1| Zgc:112265 [Danio rerio] Length = 915 Score = 44.4 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 29/196 (14%), Positives = 67/196 (34%), Gaps = 33/196 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ ++D S SM +I ++ +L+++ GL+TF +I+ Sbjct: 271 VVFIIDRSGSMHG------RRIRQTRSALLTILKDLDEDDHF------GLITFDAEIDFW 318 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + + +K + G TN + + I Sbjct: 319 KRELLQATKANRENAESFVKRIQDRGA-TNINDAVLAGVDMI---------NRNPRKGTA 368 Query: 289 KIIVFMTDGENLSTKED-QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDG+ + + + ++ + EA +Y +G + +FL + N+ Sbjct: 369 SILILLTDGDPTAGETNIEKIMANVKEAIGSKFPLYCLGFGYDVNFDFLTKMSLENNAVA 428 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 429 RRIYEDSDADIQLQGF 444 >gi|323444971|gb|EGB01833.1| hypothetical protein AURANDRAFT_69450 [Aureococcus anophagefferens] Length = 376 Score = 44.4 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 29/157 (18%), Positives = 57/157 (36%), Gaps = 21/157 (13%) Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGV----SHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 +++ + GLV + + L + + ++ L+ G STN GL+ ++ Sbjct: 35 SLDDTCRLGLVVYDAQARVVVGLARVTPAHVAKVHAALEKLAP-GTSTNLWGGLELGVDE 93 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRG---AIVYAI 325 + G + ++ +TDG N S E + + K G V+A Sbjct: 94 LVGGAGDNA----------RAVLLLTDGVPNNSPPEGEVAALRAKRLTKDGSETVAVFAA 143 Query: 326 GIRVIRSHEFLRACASPNS--FYLVENPHSMYDAFSH 360 G + L + A F V + + +F+H Sbjct: 144 GFGYALKSDLLLSLAREGGGLFSFVPDAGMVGTSFNH 180 >gi|59714345|ref|YP_207120.1| RTX repeat-containing calcium-binding cytotoxin RtxA2 [Vibrio fischeri ES114] gi|59482593|gb|AAW88232.1| RTX (repeats in toxin) calcium-binding cytotoxin RtxA2 [Vibrio fischeri ES114] Length = 3933 Score = 44.4 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 36/232 (15%), Positives = 79/232 (34%), Gaps = 25/232 (10%) Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKV----- 164 I+ + + V N + LK +NS +++ Sbjct: 3389 ILPVGTGNQTVDFSNLSIQVPKSES-DFTLKATATAIEASNSDASSGDAEATLPQDILVD 3447 Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + + LD S SM ++ + ++S +LE+++ P+ + VQ L+ Sbjct: 3448 QGGSLPTTLITLALDSSGSMGDVEVDDKERMQLVLESSIKLLEDIQNQPN-SGTVQVQLI 3506 Query: 225 TFSNK-----IEEFFLLEW-----GVSHLQRKIKYLS-----KFGVSTNSTPGLKYAYNQ 269 F N+ + L W ++HLQ I+ G T+ + + Y Sbjct: 3507 DFDNRHSDETNDTATSLGWYSVADAITHLQDAIEIEMISDPYYIGGGTDYSEAI---YAI 3563 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 + + + + +N +I F++DG N + + + + Sbjct: 3564 LDGYSNDKVPNDVDLSNTNDVIYFISDGHNNEVIPADLLVQWNEFIEDKEVK 3615 >gi|317122082|ref|YP_004102085.1| von Willebrand factor A [Thermaerobacter marianensis DSM 12885] gi|315592062|gb|ADU51358.1| von Willebrand factor type A [Thermaerobacter marianensis DSM 12885] Length = 791 Score = 44.4 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 24/133 (18%), Positives = 50/133 (37%), Gaps = 27/133 (20%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D+ +VLD S SM ++I A +L V+ + ++TF ++ Sbjct: 607 PVDVCLVLDASASMAG------SRIRAAKDLAQQLL--------VSTRDRVAVITFQERV 652 Query: 231 EEFF-LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + L + ++R + + +G+ T GL+ A + + Sbjct: 653 VQVQVPLTRNTTRVERGLSQIQPYGL-TPLAQGLEAALAYLAQSRARNP----------- 700 Query: 290 IIVFMTDGENLST 302 ++V +TDG Sbjct: 701 LLVLITDGIPTVP 713 >gi|168699404|ref|ZP_02731681.1| von Willebrand factor, type A [Gemmata obscuriglobus UQM 2246] Length = 367 Score = 44.4 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 27/169 (15%), Positives = 66/169 (39%), Gaps = 21/169 (12%) Query: 173 DMMIVLDVSRSM---ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D +IV+D+SRSM + + ++ + A +L ++ + G+V F+ + Sbjct: 112 DTVIVIDLSRSMLAEDMADPGAKSRWEAARSGALDLLAAMER----RGGHRVGVVLFAAR 167 Query: 230 IEEFFLLEWGVSHLQRKIKYLS--------KFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 + L H + ++ ++ + G ++T G ++ + Sbjct: 168 PKLVCPLTTDYKHARAVLRAVNGRFPPPECRPGPEADATSGTRFG-----AALVAAVAAH 222 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + IV ++DG+ + D++ + N A+ V+ +G+ Sbjct: 223 DPRFVGAQDIVLISDGD-DPEESDREWVRGANAARTANVPVHTVGVGNP 270 >gi|154687789|ref|YP_001422950.1| YwmC [Bacillus amyloliquefaciens FZB42] gi|154353640|gb|ABS75719.1| YwmC [Bacillus amyloliquefaciens FZB42] Length = 228 Score = 44.4 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 38/195 (19%), Positives = 65/195 (33%), Gaps = 25/195 (12%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV--QSGLVTFSNKIEE 232 I+LD S SM D ++K +MA + I +++K V V G S K++ Sbjct: 39 AILLDASGSMAKRIDG-VSKYNMAKEEIVRFADQIKSKSQVRMTVFGSEGNNKNSGKVQS 97 Query: 233 FFLLE--WGVSHLQRK--IKYLSKFGVS--TNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + +G R+ + L+ G + T L+ A + + Sbjct: 98 CESIRGVYGFQRFDRQSFLNSLNGIGPTGWTPIAKALEDAKASFTGLHKLGS-------- 149 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--HEFLRAC-ASPN 343 K + +TDGE + E +K+ V IG + A Sbjct: 150 --KSVFLLTDGE---ETCGGDPVKTAKELRKQHIKVNVIGFDFKEGFNGQLHEIAKAGGG 204 Query: 344 SFYLVENPHSMYDAF 358 +Y + M F Sbjct: 205 KYYEAHSQKDMNRIF 219 >gi|238927210|ref|ZP_04658970.1| magnesium chelatase [Selenomonas flueggei ATCC 43531] gi|238884992|gb|EEQ48630.1| magnesium chelatase [Selenomonas flueggei ATCC 43531] Length = 636 Score = 44.4 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 27/149 (18%), Positives = 60/149 (40%), Gaps = 20/149 (13%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 A +++ ++D S SM + ++ M +I A+L+E + V GL+ F Sbjct: 447 KRAAANILFLVDASGSM-----GARERMRMVKGAILALLQE---AYQKRDCV--GLIAFR 496 Query: 228 -NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ E + V +++++ L G T GL A + +++ Sbjct: 497 RDRAETLLPMTRSVELAEKQLRDLPT-GGRTPLAEGLACAVQTLRELERRGSE------- 548 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 K +++ +TDG + ++ + A Sbjct: 549 -KTVLILITDGRTNTARDGDDGVQRALRA 576 >gi|319784283|ref|YP_004143759.1| hypothetical protein Mesci_4600 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317170171|gb|ADV13709.1| hypothetical protein Mesci_4600 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 407 Score = 44.4 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 13/94 (13%), Positives = 36/94 (38%), Gaps = 2/94 (2%) Query: 8 IRNFFYNYKGGMTILTA-IFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM 66 ++ F+ + +G I TA LPI+ + G +++ + L + +D + + T+ Sbjct: 4 LKAFWSSERGNFAITTAFAMLPIMIGLAG-AVDLIGTSHDASQLQNSLDAAGLAIGTKFS 62 Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD 100 + + ++L + + + Sbjct: 63 PDMAAGDVQQLGLQFFAANMSAADQQEYLGSVSA 96 >gi|77552209|gb|ABA95006.1| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] Length = 605 Score = 44.4 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 27/179 (15%), Positives = 62/179 (34%), Gaps = 10/179 (5%) Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 ++ + K+ + +V + S A +D++ VLDVS Sbjct: 9 TVQVTAYAKVDAVHVDAVKQRPGVPVLVRVVAPPPAAASSERAPIDLVAVLDVSC---CG 65 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI 247 + ++D+ K++ +++++ + V + K + + G R + Sbjct: 66 GLGPVNRMDLLKKAMGFVIDKLGEHDRLAVVPVQASAAIAEKHDLVEMNAEGRKEATRMV 125 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + + LK A + +G + H IV ++DG++ S D Sbjct: 126 QSSLTVTGENKLSTALKKAATIL---EGRKDHDKK----RPGFIVLISDGDDASVLNDA 177 >gi|296208409|ref|XP_002751081.1| PREDICTED: calcium-activated chloride channel regulator 4 [Callithrix jacchus] Length = 931 Score = 44.4 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 38/195 (19%), Positives = 71/195 (36%), Gaps = 34/195 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM S +++ ++ L + V N G+V F++ Sbjct: 307 VCLVLDTSGSMLSH-----NRLNRMNQAAKQFL-----MQTVENGSWVGMVRFNSTATIL 356 Query: 234 FLLEW--GVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + ++ L + T+ G+K A+ I + + + Sbjct: 357 NKLIQIISSNERNTLLEKLPTRAQGGTSICSGIKSAFQVI---GELYSQLDGSE------ 407 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFL-RACASPNSFYLV 348 IV +TDGE+ + C +E K+ GAIV+ I + + + + S Sbjct: 408 IVLLTDGEDNTASS-------CIDEVKQSGAIVHFIALGKDADKAVIEMSNITGGSHLYA 460 Query: 349 ENPHS---MYDAFSH 360 + + DAF Sbjct: 461 SDEAENNGLIDAFGA 475 >gi|296446540|ref|ZP_06888482.1| von Willebrand factor type A [Methylosinus trichosporium OB3b] gi|296255894|gb|EFH02979.1| von Willebrand factor type A [Methylosinus trichosporium OB3b] Length = 333 Score = 44.4 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 33/180 (18%), Positives = 58/180 (32%), Gaps = 35/180 (19%) Query: 173 DMMIVLDVSRSMESFFDSSI------TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 D++ ++D S SM F +K A + + ++ + G+V F Sbjct: 84 DVVFLIDRSGSMNETFAGRTPSGSEESKASAARRLLQGFVDR-------RGHDRVGVVGF 136 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 S + + I + + G+ TN GL A + Sbjct: 137 STAPMLLMPMSDHREAIAAAIDAVDRPGLDYTNIGRGLAMA----------LALIGSGAP 186 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 + + IV ++DG + Q L E KK +Y + FLR S + Sbjct: 187 DRSRAIVLVSDGAGVIDPRIQDDLR--AEMKKANVNLYWL---------FLRTAGSAGIY 235 >gi|228982639|ref|ZP_04142898.1| hypothetical protein bthur0002_57720 [Bacillus thuringiensis Bt407] gi|228776822|gb|EEM25130.1| hypothetical protein bthur0002_57720 [Bacillus thuringiensis Bt407] Length = 174 Score = 44.4 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 36/167 (21%), Positives = 64/167 (38%), Gaps = 4/167 (2%) Query: 4 LNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAAT 63 LN I+ F N KG TI AI + + +G I++S I + S + V Sbjct: 3 LNKKIKTFLENKKGVATIEIAICAMLFIIAIGAFIDLSAILSKINSISSTN--AYVSRVV 60 Query: 64 QIMNEGNGNNRKKLKGGDILCRI--KNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVV 121 + KG I + +N N R+ L+++ + ID ++S+++ + Sbjct: 61 GAQGGVKTRTPENFKGEYIHSKELYQNVKNSLERSGLKESDWSMYIDGRRLTSSINVPLK 120 Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT 168 +I + K L TN++ + SS KV + Sbjct: 121 DYGNEITIKLKTNLKWDLLSNFIPGDLTNNQTSERTVVSSFKVRTGD 167 >gi|3236372|gb|AAC23668.1| type VI collagen alpha 3 subunit N7 domain [Mus musculus] Length = 200 Score = 44.4 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 30/185 (16%), Positives = 67/185 (36%), Gaps = 21/185 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S ++ + + ++ + + D ++ GLV FS+ Sbjct: 7 DIIFLLDGSDNVGKNNFPYV------RDFVTNLVNSLDVGSDN---IRVGLVQFSDTPVT 57 Query: 233 FFLLEW--GVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 F L+ S L ++ L K G N+ L Y I + A+ + Sbjct: 58 EFSLDTYQTKSELLAHLRRLQLKGGSGLNAGSALSY----IHANHFTEAGGSRTRAHVPQ 113 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +++ + G + L + + G + +G E +P+ Y+++ Sbjct: 114 LLLLLMAG-----PSEDAYLQAADAMVRSGVLTLCVGTTRADKAESEHIAFNPSLVYVMD 168 Query: 350 NPHSM 354 + S+ Sbjct: 169 DFRSL 173 >gi|150260369|ref|ZP_01917097.1| hypothetical protein YPE_2670 [Yersinia pestis CA88-4125] gi|218927792|ref|YP_002345667.1| hypothetical protein YPO0595 [Yersinia pestis CO92] gi|229837271|ref|ZP_04457434.1| hypothetical protein YPS_1185 [Yersinia pestis Pestoides A] gi|229840485|ref|ZP_04460644.1| hypothetical protein YPH_2827 [Yersinia pestis biovar Orientalis str. PEXU2] gi|229842971|ref|ZP_04463122.1| hypothetical protein YPF_1319 [Yersinia pestis biovar Orientalis str. India 195] gi|229900821|ref|ZP_04515945.1| hypothetical protein YP516_0480 [Yersinia pestis Nepal516] gi|115346403|emb|CAL19275.1| conserved hypothetical protein [Yersinia pestis CO92] gi|149289777|gb|EDM39854.1| hypothetical protein YPE_2670 [Yersinia pestis CA88-4125] gi|229682160|gb|EEO78252.1| hypothetical protein YP516_0480 [Yersinia pestis Nepal516] gi|229690037|gb|EEO82095.1| hypothetical protein YPF_1319 [Yersinia pestis biovar Orientalis str. India 195] gi|229696851|gb|EEO86898.1| hypothetical protein YPH_2827 [Yersinia pestis biovar Orientalis str. PEXU2] gi|229705394|gb|EEO91404.1| hypothetical protein YPS_1185 [Yersinia pestis Pestoides A] gi|320016845|gb|ADW00417.1| hypothetical protein YPC_3985 [Yersinia pestis biovar Medievalis str. Harbin 35] Length = 212 Score = 44.4 bits (103), Expect = 0.034, Method: Composition-based stats. Identities = 31/172 (18%), Positives = 59/172 (34%), Gaps = 14/172 (8%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +++D S SM I I AM+ ++ P V ++T+ N+ Sbjct: 3 RLPVYLLIDTSGSMRG------ESIHAVNVGIQAMMSALRQDPYALESVHLSIITYDNQA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 E+ L + + T + L+ I + Q + + + Sbjct: 57 REYIPLTALENF--QFTDITVPSAGGTFTGAALEC---LIHCVDRDIQRSDGDQKGDWRP 111 Query: 291 IVF-MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 +VF MTD +++ K+ + A + HE L+ S Sbjct: 112 LVFLMTDSTPSDVYAYGEAIK--EVKKRAFGSIIACAVGAKAKHEHLKQLTS 161 >gi|323693762|ref|ZP_08107958.1| hypothetical protein HMPREF9475_02821 [Clostridium symbiosum WAL-14673] gi|323502183|gb|EGB18049.1| hypothetical protein HMPREF9475_02821 [Clostridium symbiosum WAL-14673] Length = 1560 Score = 44.0 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 30/286 (10%), Positives = 79/286 (27%), Gaps = 68/286 (23%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSME-SFFDSSITKIDMAIKSINAMLEEVKLI 212 + + +T + D++ V+D S SM +D + + ++++ + ++K+ Sbjct: 585 VTLEVTGDSIQTTIGGGTADIVFVIDKSSSMNQWDYDLGGNRWTILKETVDRFINKLKIT 644 Query: 213 PDVN-------------NVVQSGLV-------------------TFSNKIEEFFLLEWGV 240 + N ++G +S K + Sbjct: 645 SPNSKISFIEYQSSDLTNYTRTGTYDEIRNVAMANTENADKNLEGYSLKYFRTLQRWTAI 704 Query: 241 SHL-QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE- 298 S + + T+S G A + ++ + K ++++ DG Sbjct: 705 SEVGSKPYGSAPGGNQGTHSAGGYLGAERALDRLKKYNPEEYNSNV---KYVIYLADGTA 761 Query: 299 -----------------NLSTKEDQQSLYYCNEAKKRGAIVYAIGI-------------R 328 N + + + + K A +Y + Sbjct: 762 GFYVDSYGYRDGAGSGGNTNARRAAITQSGELKKKHPDATIYTVAFGSDSSANMNWMKPG 821 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + + +FY N + F + + + + + K Sbjct: 822 AYNGNITNPYNPNVTAFYSATNTEELEKTFDSLAEQVGSSAVTNPK 867 >gi|308473880|ref|XP_003099163.1| CRE-CLEC-65 protein [Caenorhabditis remanei] gi|308267636|gb|EFP11589.1| CRE-CLEC-65 protein [Caenorhabditis remanei] Length = 376 Score = 44.0 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 75/202 (37%), Gaps = 28/202 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE---EVKLIPDVNNVVQSGLVTFS- 227 LD++I++D + SI +++ ++I+ + ++ + G +T++ Sbjct: 36 LDVVIIVD---------NCSIMNLNLVYETISGLFNKNLQIGTGYTDPRSTRVGFITYNY 86 Query: 228 ---NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + + + L W + L +I+ L ++ + + A ++ M Sbjct: 87 NATDVADFYKLQSW--ADLDSQIQRLKMTPLARTTLSKMDTA---LYAAINMINSTAGFR 141 Query: 285 ANYKKIIVFMTD--GENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACAS 341 NYKK+++ T G S K RG V + + +L+ AS Sbjct: 142 DNYKKMVIVFTSVHGSYQKNPPKDVSKIL----KARGIPVVTVNTGSSSDTQSYLKNIAS 197 Query: 342 PNSFYLVENPHSMYDAFSHIGK 363 N Y + + ++ + + Sbjct: 198 DNMAYAMADGNATQEILKAMTD 219 >gi|90409149|ref|ZP_01217268.1| hypothetical protein PCNPT3_10636 [Psychromonas sp. CNPT3] gi|90309757|gb|EAS37923.1| hypothetical protein PCNPT3_10636 [Psychromonas sp. CNPT3] Length = 226 Score = 44.0 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 70/178 (39%), Gaps = 26/178 (14%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES----FFDSSITKID 196 F + W + PI + +Q +R +M++ LD+S SM S+T++ Sbjct: 56 FILLLAWTLVIVALAKPIYYGEPIRAQQQSR-NMILSLDLSGSMNEVDMRLDGQSVTRLS 114 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS 256 + + + + GL+ F++ L + + + ++I+ ++ G+ Sbjct: 115 LVKSLLKKFVAT-------RQGDRLGLILFADHAYLQTPLTFDLKTIAQRIEE-TQIGL- 165 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 G++ A + + R N +I++ +TDG N + + + C + Sbjct: 166 ----VGIRTAIGESIAIAIKRFVKNKNKQ---RILILLTDGSNTAGR-----IKPCKQ 211 >gi|22127455|ref|NP_670878.1| hypothetical protein y3581 [Yersinia pestis KIM 10] gi|45442682|ref|NP_994221.1| putative tellurium resistance protein [Yersinia pestis biovar Microtus str. 91001] gi|51597754|ref|YP_071945.1| tellurium resistance protein [Yersinia pseudotuberculosis IP 32953] gi|108809183|ref|YP_653099.1| putative tellurium resistance protein [Yersinia pestis Antiqua] gi|108810630|ref|YP_646397.1| tellurium resistance protein [Yersinia pestis Nepal516] gi|145597696|ref|YP_001161772.1| tellurium resistance protein [Yersinia pestis Pestoides F] gi|150260367|ref|ZP_01917095.1| putative tellurium resistance protein [Yersinia pestis CA88-4125] gi|153949002|ref|YP_001399501.1| tellurium resistance protein [Yersinia pseudotuberculosis IP 31758] gi|162420775|ref|YP_001605636.1| putative tellurium resistance protein [Yersinia pestis Angola] gi|165928282|ref|ZP_02224114.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. F1991016] gi|165937805|ref|ZP_02226366.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. IP275] gi|166008850|ref|ZP_02229748.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166211902|ref|ZP_02237937.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. B42003004] gi|167398450|ref|ZP_02303974.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167422005|ref|ZP_02313758.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167426387|ref|ZP_02318140.1| putative tellurium resistance protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|167467477|ref|ZP_02332181.1| putative tellurium resistance protein [Yersinia pestis FV-1] gi|170022821|ref|YP_001719326.1| von Willebrand factor type A [Yersinia pseudotuberculosis YPIII] gi|186896945|ref|YP_001874057.1| von Willebrand factor type A [Yersinia pseudotuberculosis PB1/+] gi|218927794|ref|YP_002345669.1| putative tellurium resistance protein [Yersinia pestis CO92] gi|229837273|ref|ZP_04457436.1| putative tellurium resistance protein [Yersinia pestis Pestoides A] gi|229840487|ref|ZP_04460646.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. PEXU2] gi|229842969|ref|ZP_04463120.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. India 195] gi|229900823|ref|ZP_04515947.1| putative tellurium resistance protein [Yersinia pestis Nepal516] gi|294502682|ref|YP_003566744.1| putative tellurium resistance protein [Yersinia pestis Z176003] gi|21960549|gb|AAM87129.1|AE013961_1 hypothetical [Yersinia pestis KIM 10] gi|45437548|gb|AAS63098.1| putative tellurium resistance protein [Yersinia pestis biovar Microtus str. 91001] gi|51591036|emb|CAH22700.1| Putative tellurium resistance protein [Yersinia pseudotuberculosis IP 32953] gi|108774278|gb|ABG16797.1| tellurium resistance protein [Yersinia pestis Nepal516] gi|108781096|gb|ABG15154.1| putative tellurium resistance protein [Yersinia pestis Antiqua] gi|115346405|emb|CAL19277.1| putative tellurium resistance protein [Yersinia pestis CO92] gi|145209392|gb|ABP38799.1| tellurium resistance protein [Yersinia pestis Pestoides F] gi|149289775|gb|EDM39852.1| putative tellurium resistance protein [Yersinia pestis CA88-4125] gi|152960497|gb|ABS47958.1| putative tellurium resistance protein [Yersinia pseudotuberculosis IP 31758] gi|162353590|gb|ABX87538.1| putative tellurium resistance protein [Yersinia pestis Angola] gi|165914217|gb|EDR32833.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. IP275] gi|165919724|gb|EDR37057.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. F1991016] gi|165992189|gb|EDR44490.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166206648|gb|EDR51128.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. B42003004] gi|166960142|gb|EDR56163.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167050954|gb|EDR62362.1| putative tellurium resistance protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167054610|gb|EDR64417.1| putative tellurium resistance protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|169749355|gb|ACA66873.1| von Willebrand factor type A [Yersinia pseudotuberculosis YPIII] gi|186699971|gb|ACC90600.1| von Willebrand factor type A [Yersinia pseudotuberculosis PB1/+] gi|229682162|gb|EEO78254.1| putative tellurium resistance protein [Yersinia pestis Nepal516] gi|229690035|gb|EEO82093.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. India 195] gi|229696853|gb|EEO86900.1| putative tellurium resistance protein [Yersinia pestis biovar Orientalis str. PEXU2] gi|229705396|gb|EEO91406.1| putative tellurium resistance protein [Yersinia pestis Pestoides A] gi|262360712|gb|ACY57433.1| putative tellurium resistance protein [Yersinia pestis D106004] gi|262364659|gb|ACY61216.1| putative tellurium resistance protein [Yersinia pestis D182038] gi|294353141|gb|ADE63482.1| putative tellurium resistance protein [Yersinia pestis Z176003] Length = 212 Score = 44.0 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 26/132 (19%), Positives = 41/132 (31%), Gaps = 12/132 (9%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + ++LD S SM I+ + +L ++ P ++TF + Sbjct: 3 RLPVYLLLDTSGSMTG------EPIEAVKNGVQMLLSTLRQDPYALETAYVSVITFDSSA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L L K+ L G +T L I + D Sbjct: 57 RQAVPL---TDLLNFKLPELVANG-TTALGDALSLTAKCIGNEVQKTTADTKGDWRPLVF 112 Query: 291 IVFMTDGENLST 302 I MTDG Sbjct: 113 I--MTDGSPTDD 122 >gi|297527215|ref|YP_003669239.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] gi|297256131|gb|ADI32340.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] Length = 190 Score = 44.0 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 30/152 (19%), Positives = 52/152 (34%), Gaps = 18/152 (11%) Query: 174 MMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 ++ ++D S SM F D KI + + L + D V+ G V F +K Sbjct: 5 IVFIIDTSYSMSRKFNDFVPNKIRAVKEVLAYALTRL---FDKYKDVRVGAVVFFSKAYP 61 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L + I+ L G + GL A + G+++ + Sbjct: 62 ILSLTISREGVLMSIRELEILGEGSAPGDGLIEAVKMMRRKNGLKE------------TI 109 Query: 293 FMTDGE-NLSTKEDQQSLYYCNE-AKKRGAIV 322 +TDG N + ++Y N K + Sbjct: 110 MITDGGFNEGIPLNIAAIYAANSGVKTSFIAI 141 >gi|308472979|ref|XP_003098716.1| hypothetical protein CRE_04176 [Caenorhabditis remanei] gi|308268316|gb|EFP12269.1| hypothetical protein CRE_04176 [Caenorhabditis remanei] Length = 356 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 61/177 (34%), Gaps = 16/177 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SR M + +I I S+ + L + GLVT+++ Sbjct: 37 LDVIAVVDNSRGMT---VDGLNEIASNIASVFGFGTRIGLNASEPRTTRLGLVTYNSVAT 93 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + L S + ++T ++ + Q N+ A+Y+K++ Sbjct: 94 QKADLNQYQSIGDVFHGIFYALSNTVDTTESYLATGLELAEKMFNDQSVNSIRAHYQKVV 153 Query: 292 V-----FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-HEFLRACASP 342 + + T GE + K G + + + L ASP Sbjct: 154 IVYAATYQTKGEMDPES-------IADRLKMSGVKIITVAYGDAYGLMKSLSVIASP 203 >gi|223938464|ref|ZP_03630357.1| Vault protein inter-alpha-trypsin domain protein [bacterium Ellin514] gi|223892883|gb|EEF59351.1| Vault protein inter-alpha-trypsin domain protein [bacterium Ellin514] Length = 723 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 51/134 (38%), Gaps = 30/134 (22%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 A ++M+ VLD S SM S I A +I L++++ ++ FS Sbjct: 395 RAPMEMVFVLDCSGSM------SGEPIAQAKAAIRHALKQLQPGDSFQ------IINFSE 442 Query: 229 KIEE--FFLLEWGVSHLQR---KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + LE ++++ ++ L+ G + G+K A + Sbjct: 443 HASQLGAKPLEATPENIRKGLAYVEALNSDGPTEMI-EGIKAALD------------FPH 489 Query: 284 DANYKKIIVFMTDG 297 D + + F+TDG Sbjct: 490 DPERLRFVCFLTDG 503 >gi|254304217|ref|ZP_04971575.1| von Willebrand factor domain protein [Fusobacterium nucleatum subsp. polymorphum ATCC 10953] gi|148324409|gb|EDK89659.1| von Willebrand factor domain protein [Fusobacterium nucleatum subsp. polymorphum ATCC 10953] Length = 529 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 51/335 (15%), Positives = 108/335 (32%), Gaps = 52/335 (15%) Query: 59 VHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDI 118 V+ I++E G + + + + W + + EL+ D Sbjct: 74 VYTYEAIVDEAGGLYQSPDARENNYLKKHDIWKENVQRELKKIEPALSEDASEEEI---- 129 Query: 119 VVVPQNEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 Q+ + I+ Y P + + + P T + + +++ IV Sbjct: 130 ----QHLFKQLLYIAGYDYTPFETIDRFSYVIFKNDMENPFTHE---KIEENMNVNVEIV 182 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 LD S SM T +++A +SI +L E+ N V G+ F +K + + Sbjct: 183 LDASGSMVKKI-GDKTMMEIAKESIKQVLSEM----PANAKV--GIRVFGHKGDNTASKK 235 Query: 238 W---------------GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 V +++ ++ + G T+ ++Y + + G + Sbjct: 236 DESCGANELIYPIGDLNVEGIEKALEPIQPTGW-TSIAKSIEYGVEDLKALDGEKTLN-- 292 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK--KRGAIVYAIGIRV-IRSHEFLR-- 337 I+ +TDG + + K ++ IG V + L+ Sbjct: 293 -------ILYIITDG---IETCGGNPVEIAKQLKGENTNIVLGIIGFNVDANQNRLLKQI 342 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 A A+ + V + + I + + W Sbjct: 343 ADAAGGYYSSVNDADKLTGELYRINELAFSDYKWE 377 >gi|114594054|ref|XP_001145008.1| PREDICTED: hypothetical protein isoform 3 [Pan troglodytes] Length = 411 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 29/157 (18%), Positives = 55/157 (35%), Gaps = 19/157 (12%) Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMR 277 + + FS++ L + + ++ L + T GLK A QI G++ Sbjct: 2 RLSFIVFSSQASIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKAGGLK 61 Query: 278 QHCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 II+ +TDG + L ++ ++ GA VY +G+ + Sbjct: 62 TSS---------IIIALTDGKLDGLVPSYAEKEAKI---SRSLGASVYCVGVLDFEQAQL 109 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 R S + V+ + A I I+ + Sbjct: 110 ERIADSKEQVFPVKGG---FQALKGIINSILAQSCTE 143 >gi|195398375|ref|XP_002057797.1| GJ17905 [Drosophila virilis] gi|194141451|gb|EDW57870.1| GJ17905 [Drosophila virilis] Length = 411 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 69/188 (36%), Gaps = 24/188 (12%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-----NVVQSG 222 + D+MI+LD S M ++A+ + +L+ + VN +V++ Sbjct: 151 ASSPKDIMILLDASSGMSE------KSFELAMATAFNILDTLGEDDFVNLITFSEMVKTP 204 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + F +++ V ++ +K + + N T GL+YA++ + Sbjct: 205 VPCFKDRMVRATP--DNVQEIKSAVKAVK-LQDTANFTAGLEYAFSLLHKYNQSGAGSQC 261 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-HEFLRACAS 341 A I+ +T+ + S KE + + + ++ I L A Sbjct: 262 NQA-----IMLITESTSESHKEIIKQYNWPHMP----VRIFTYLIGSDSGSRSNLHEMAC 312 Query: 342 PNSFYLVE 349 N + V+ Sbjct: 313 SNKGFFVQ 320 >gi|171911164|ref|ZP_02926634.1| hypothetical protein VspiD_08320 [Verrucomicrobium spinosum DSM 4136] Length = 854 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 48/341 (14%), Positives = 108/341 (31%), Gaps = 39/341 (11%) Query: 40 VSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELR 99 V+ F + L + D + A E G R L + + R + Sbjct: 247 VAGAFHYEARLMAADDAVAGNNAASQWVEVQGGPRVVL----ATAYENDPLAQALRAQGF 302 Query: 100 DNGFVNDID--DIVRSTSLDIVVVPQNEGYSISA--ISRYKIPLKFCTFIPWYTNSRH-- 153 + + D+ ++ T +VV+ Y + + +H Sbjct: 303 EVEAITDLGLLNVGTLTGTKVVVLNNVPAYRLDPRFTKALDFFVNHQGGGLAMVGGKHSF 362 Query: 154 -----IVMPITSSVKVNSQTDA-----RLDMMIVLDVSRSMESFFDSSITKIDMAIKSIN 203 P+ + V+ + + M IV+D S SM + T + + Sbjct: 363 AAGGYFGSPVEPLLPVSMELKQEHRKLAVAMAIVMDRSGSMS--MTAPGTSLVKMQLANE 420 Query: 204 AMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL 263 ++L+ D + + + +++ + LQ ++ + G GL Sbjct: 421 GAARGIELLGDSDMACVYAVDSEPHEVSPLVAVGSNRGTLQNAVRRVESTGGGIYVYQGL 480 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 K A+ ++ + ++H I+ D + + ++L + K V Sbjct: 481 KRAWAELEKAKVGQRH-----------IILFADAADAEEPGEYKALL--EKMSKEKGTVS 527 Query: 324 AIGIRVIRSH--EFLRACASPNS--FYLVENPHSMYDAFSH 360 IG+ + +FL+ A + + +P + F+ Sbjct: 528 VIGLGTEKDSDADFLKDVALRGNGRIFFNSDPKELPALFAQ 568 >gi|87308731|ref|ZP_01090870.1| hypothetical protein DSM3645_10847 [Blastopirellula marina DSM 3645] gi|87288442|gb|EAQ80337.1| hypothetical protein DSM3645_10847 [Blastopirellula marina DSM 3645] Length = 625 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 69/211 (32%), Gaps = 14/211 (6%) Query: 2 FSLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHA 61 F+ NL R KG +L A+ + ++F + I+ I +T + + +D + + A Sbjct: 4 FACNLTNRR--SARKGIFVVLAAVVMVVLFGFISFGIDTGLISLEQTRMQNAVDAAALAA 61 Query: 62 ATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIV------RSTS 115 + +I + I + + ++N DIV S Sbjct: 62 SQEITSAVAQAGDSGGDPNSISISFAKQMAVDVAAA--NGVYLNADRDIVFGKRTYDPGS 119 Query: 116 LDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITS--SVKVNSQTDARLD 173 + Y++ + ++ + + + S V + D Sbjct: 120 GEWAYDWTTGPYNVVKVEAHRDQPNLEAPDGRVPLAFGWAVGVPSIPLVTSATSFVEARD 179 Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 M++VLD S SM DS I+ Sbjct: 180 MVVVLDFSGSMND--DSQFKAINRLGNDAVT 208 >gi|311105412|ref|YP_003978265.1| hemolysin-type calcium-binding repeat family protein 2 [Achromobacter xylosoxidans A8] gi|310760101|gb|ADP15550.1| hemolysin-type calcium-binding repeat family protein 2 [Achromobacter xylosoxidans A8] Length = 1396 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 66/188 (35%), Gaps = 19/188 (10%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI---TKIDMAIKSINAML 206 + +++ + N ++ ++LD+S SM ++ T +D A K++ +L Sbjct: 742 SGNDVLLGDQGGMAKNVIAGTSYNIALLLDLSYSMGWANSANPDGDTALDAAKKALKHLL 801 Query: 207 EEVKLIPDVNNVVQSGLVTFSNK----IEEFFLLEWGVSHLQRKIKYLSKFGVSTNS--T 260 E + + L+TF+++ + L ++ + L N+ Sbjct: 802 ES--QLATHEGTINVSLITFNDEDIRVQKSISDLT--PDNVDEIVSSLLHLETGPNTPYG 857 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA 320 L + H N YK + F+TDGE K + A+ Sbjct: 858 AALHETKKWFDGQPTVDDHGN----PYKNLTYFLTDGEPTPEKNYDADAEFNKLAEISDV 913 Query: 321 IVYAIGIR 328 AIG+ Sbjct: 914 H--AIGVG 919 >gi|17537919|ref|NP_496258.1| C-type LECtin family member (clec-59) [Caenorhabditis elegans] gi|3881709|emb|CAA88984.1| C. elegans protein ZK666.5, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 396 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 61/187 (32%), Gaps = 11/187 (5%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 ++ T + ++ LD++ V+D S M +T + I SI + ++ Sbjct: 15 YLSSSYTDRICGQDLSNLWLDVVAVVDNSAGMTK---GGLTSVAANIASIFSKNTQIGTN 71 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVS--HLQRKIK-YLSKFGVSTNSTPGLKYAYNQ 269 P + LVT++ L S + I L+ S S +Q Sbjct: 72 PTSPKTTRLALVTYNVDATTAADLNKFQSIDDIYSGINSALATISSSEESYLA--RGLSQ 129 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 + +H A+Y+K+++ D + K G + + Sbjct: 130 AEKVFQAGKHGF-NRAHYQKVVIVY--ASTYKGSGDLNPVPVAQRLKTSGVTIITVAYDQ 186 Query: 330 IRSHEFL 336 + + L Sbjct: 187 NKDGDIL 193 >gi|83594481|ref|YP_428233.1| serine/threonine protein kinase [Rhodospirillum rubrum ATCC 11170] gi|83577395|gb|ABC23946.1| serine/threonine protein kinase [Rhodospirillum rubrum ATCC 11170] Length = 665 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 24/158 (15%), Positives = 57/158 (36%), Gaps = 20/158 (12%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P+ + V + D R ++ V+D S SM+ + D + +D +K + + Sbjct: 216 PLKRRMNVAAMADFRAGVVFVIDASSSMDPYIDKTRQVMDDVLKRVQ--------AEGLA 267 Query: 217 NVVQSGLVTFSNKIEE---------FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 + V+ GLV F + ++ F + + K ++ + ST A Sbjct: 268 DKVRFGLVAFRDDPKKVKGVEYLTRTFADPNSTADAKAFSKAIAPLKATPISTRAF--AE 325 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 + + + + ++ +TD + + + Sbjct: 326 DSFAAIDTALNTIDWSGFAA-RYLILVTDASSRGARAE 362 >gi|54293612|ref|YP_126027.1| hypothetical protein lpl0665 [Legionella pneumophila str. Lens] gi|53753444|emb|CAH14899.1| hypothetical protein lpl0665 [Legionella pneumophila str. Lens] Length = 1169 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 29/214 (13%), Positives = 59/214 (27%), Gaps = 64/214 (29%) Query: 218 VVQSGLVTFSNKIEEFFLLEWG---------VSHLQRKIKY----LSKFGVSTNSTPGLK 264 VQ G +SN+ + + + I L ST +T Sbjct: 315 YVQRGFGYYSNQSYATGNMLVNMQTAGTNPTTTSVNNAINAFLPHLKPETNSTATTEIKA 374 Query: 265 YAYNQIFDMQGMRQ-------HCNTEDANYKKIIVFMTDG-------------------- 297 A R + + K+ I+ ++DG Sbjct: 375 AAVQSPLAGLLTRSRSFMKTVGTTSGNCPQKQYIILISDGLPTQDLQSRYWPPLGSAAAT 434 Query: 298 --------------ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-------IRSHEFL 336 N +++ ++ K G +++ IG+ + L Sbjct: 435 GYGVTATFNADGSLNNTNSQALSDAINEIKALKNDGVLIFIIGMGAGVDPAVNPEAAATL 494 Query: 337 RACA---SPNSFYLVENPHSMYDAFSHIGKDIVT 367 RA A ++Y +P ++ + + I +I Sbjct: 495 RAMAVAGGTENYYPATSPEALVSSLNSILSNIQN 528 >gi|84498148|ref|ZP_00996945.1| putative secreted protein [Janibacter sp. HTCC2649] gi|84381648|gb|EAP97531.1| putative secreted protein [Janibacter sp. HTCC2649] Length = 533 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 33/203 (16%), Positives = 72/203 (35%), Gaps = 29/203 (14%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 +V+ + + + +V+D S SM+ ++ + S+ + E ++ Sbjct: 176 EVDVREWQPVALTMVVDTSGSMD-----IRERLGLVKSSLALLAENLRPDDT------IA 224 Query: 223 LVTF---SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 +VT+ + + E + + I L G STN GL Y+Q R+ Sbjct: 225 IVTYQTDATPLLEPTPVRD-TDTILAAIDRLEA-GGSTNLEAGLLLGYDQ------AREA 276 Query: 280 CNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL-- 336 N ++ +DG N+ + + + +RG + +G + + L Sbjct: 277 YKQGATNV---VLLASDGVANVGVTDGGRLATAIRDNGRRGIHLVTVGYGMGNYSDHLME 333 Query: 337 RACASPNSFYL-VENPHSMYDAF 358 + + FY ++ F Sbjct: 334 QLADQGDGFYEYIDTFEEARKLF 356 >gi|220922039|ref|YP_002497340.1| hypothetical protein Mnod_2052 [Methylobacterium nodulans ORS 2060] gi|219946645|gb|ACL57037.1| conserved hypothetical protein [Methylobacterium nodulans ORS 2060] Length = 418 Score = 44.0 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 26/176 (14%), Positives = 55/176 (31%), Gaps = 13/176 (7%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R + G +T+L ++ LPI + + I++S + +K L D + + A + + Sbjct: 1 MRRLLRDRSGQITVLASLLLPIGLGIAALAIDLSTLQLVKHRLKVAADAASLAAVAVLPD 60 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 +R D T +++R + + P N Sbjct: 61 TTTALDRALSIAADNAGTGAGTVTA--ASDVRFGSYNSAAKSFTPGA------TPANA-V 111 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 ++A + +++ S +VLD S S Sbjct: 112 QVTASRNQAHGNPVVLAFAKALGWSTPDISVSAVAVRFS----PAYCFLVLDPSAS 163 >gi|115963085|ref|XP_001182555.1| PREDICTED: similar to calcium activated chloride channel 1 precursor [Strongylocentrotus purpuratus] Length = 1245 Score = 44.0 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 58/176 (32%), Gaps = 28/176 (15%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI----PDVNNVV 219 V + T +++VLD S SM + + I K++ A + ++++ I + Sbjct: 515 VQASTGDECRVVLVLDTSGSMGT--SNRIDKVNSAATAFVNLVDDGISIGIVTFTGSPTT 572 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + L + + + L R I L+ G T GL+ + Sbjct: 573 RHALTQINTQADRDSL---------RDIFQLTASG-GTCIGCGLEQGLEVLMAHPSGSAD 622 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 IIV MTDG++ + + G V + I E Sbjct: 623 GG--------IIVLMTDGQDSGIQNHI----IRQTLQDMGVRVNTVAIGEDAYGEL 666 >gi|74146354|dbj|BAE28944.1| unnamed protein product [Mus musculus] Length = 452 Score = 44.0 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 23/132 (17%), Positives = 54/132 (40%), Gaps = 22/132 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ V+DVS SM K+ ++++ +L++++ Q +V F++ + Sbjct: 315 ILFVIDVSGSMWGI------KMKQTVEAMKTILDDLRTDD------QFSVVDFNHNVRTW 362 Query: 231 --EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + ++ +R I+ + G TN L A IF + + Sbjct: 363 RNDLVSATKTQIADAKRYIEKIQPSG-GTNINEALLRA---IFILNEASNMGLLNPDSVS 418 Query: 289 KIIVFMTDGENL 300 +I+ ++DG+ Sbjct: 419 -LIILVSDGDPT 429 >gi|194288834|ref|YP_002004741.1| flp pilus assembly protein [Cupriavidus taiwanensis LMG 19424] gi|193222669|emb|CAQ68672.1| putative flp pilus assembly protein [Cupriavidus taiwanensis LMG 19424] Length = 418 Score = 44.0 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 9/50 (18%), Positives = 26/50 (52%) Query: 17 GGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM 66 G + I+ + L ++ +G+ +++ ++ K+ L + +D + AA + Sbjct: 19 GAVAIIVGLSLAVLIGFVGLALDLGKLYVTKSELQNSVDACALAAARDVT 68 >gi|218671335|ref|ZP_03521005.1| hypothetical protein RetlG_06538 [Rhizobium etli GR56] Length = 49 Score = 44.0 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 12/43 (27%), Positives = 17/43 (39%), Gaps = 1/43 (2%) Query: 327 IRVIRSHEFLRACASPNSFYL-VENPHSMYDAFSHIGKDIVTK 368 + L+ CAS S Y E ++ AF IG T+ Sbjct: 1 MAPEGGQALLQYCASDASHYFQAEKMEDLFAAFKAIGAKASTQ 43 >gi|2935363|gb|AAC05096.1| complement component BfB [Danio rerio] Length = 359 Score = 44.0 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 33/214 (15%), Positives = 68/214 (31%), Gaps = 25/214 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK--LIPDVNNVVQ 220 K+ +LD+ I +DVS S+ + A + I +LE++ + ++ Sbjct: 150 KIYLNKGGKLDIYIAVDVSDSIND--------LKKAKQIIKTLLEKISYYEVSPSYEILM 201 Query: 221 SGLVTFSNKIEEFFL---LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + F + +S + + + Y I D Sbjct: 202 FATDVYQIVKMRDFKTEEVAGSLSKVFEDLDKFDFDKKLDQKGSNIAKLYQTILDSMSNE 261 Query: 278 QHCNTED-ANYKKIIVFMTDGE-NLSTKEDQQSLYY------CNEAKKRGAIVYAIGIRV 329 Q N ED K +++ TDG+ N+ + + ++ +Y G+ Sbjct: 262 QIRNKEDFLQTKHVVIVFTDGQANMGGNPKPKVHLIRNLVLKNDANRENKLDLYVFGVGK 321 Query: 330 IRSHEFLRACASPN----SFYLVENPHSMYDAFS 359 E L S F+ +++ + F Sbjct: 322 DVRTEDLNGLVSEKENERHFFKLQDLDEVQKTFD 355 >gi|253996767|ref|YP_003048831.1| hypothetical protein Mmol_1398 [Methylotenera mobilis JLW8] gi|253983446|gb|ACT48304.1| conserved hypothetical protein [Methylotenera mobilis JLW8] Length = 463 Score = 44.0 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 26/166 (15%), Positives = 61/166 (36%), Gaps = 14/166 (8%) Query: 12 FYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNG 71 +G + ++ AI L ++ +LG+++++ H++ KT L + D S + A ++ + G Sbjct: 12 MQRQRGAVALIVAICLVLLVGMLGLVLDLGHLYVTKTELQNAADSSSLSGARELNGKVTG 71 Query: 72 NNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISA 131 I + + ++N +N V ++++ + P + + Sbjct: 72 --------------INSAITRAIEAAGKNNFNLNSTAVTVNASNMWVGSCPSDGCMVPIS 117 Query: 132 ISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 TF+ T R I + + T M + Sbjct: 118 SVTTDALAGDKTFLKVDTGLRTINTWFIQVLPGVANTTQTFGMAVA 163 >gi|149566038|ref|XP_001520798.1| PREDICTED: similar to calcium channel, voltage-dependent, alpha 2/delta subunit 4 [Ornithorhynchus anatinus] Length = 808 Score = 44.0 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 27/137 (19%), Positives = 56/137 (40%), Gaps = 23/137 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT-F 226 + D++IV+DVS SM+ ++ +A ++ +L+ + N+ V + + Sbjct: 241 ATSPKDIVIVVDVSGSMKGL------QMTIAKHTVATILDTL----GENDFVNIIAYSDY 290 Query: 227 SNKIEEFFL------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + +E F H ++ + L GV + S L+ A+ + ++ Q Sbjct: 291 VHYLEPCFQGILVQADRDNREHFKQLLDELQAKGVGSVS-KALREAFTVLQQVRDAGQGA 349 Query: 281 NTEDANYKKIIVFMTDG 297 A I+ +TDG Sbjct: 350 LCNQA-----IMLITDG 361 >gi|332823606|ref|XP_003311226.1| PREDICTED: complement C2 isoform 3 [Pan troglodytes] Length = 538 Score = 44.0 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 21/153 (13%), Positives = 52/153 (33%), Gaps = 16/153 (10%) Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ + L SH+ K G TN+ L Y + + + Sbjct: 85 YSYDFPEDVAPALGTSFSHMLGATNPTQKTKDHENGTGTNTYAALNSVYLMMNNQMRLLG 144 Query: 279 HCNTEDANYKKIIVFMTDGENL------STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + I+ +TDG++ + + + + N+ + +YAIG+ + Sbjct: 145 METMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDV 204 Query: 333 -----HEFLRACASPNSFYLVENPHSMYDAFSH 360 +E +++++ +++ F H Sbjct: 205 DWRELNELGSKKDGERHAFILQDTKALHQVFEH 237 >gi|327272010|ref|XP_003220779.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like [Anolis carolinensis] Length = 919 Score = 44.0 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 40/272 (14%), Positives = 91/272 (33%), Gaps = 47/272 (17%) Query: 113 STSLDIVVVPQNE-GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR 171 T + Y + ++ F + Y ++ Sbjct: 241 PTCSSTTIDGNFVLTYDVKRDTKAGELEIFNGYFIHYFAPDNL-------------EPLP 287 Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN--- 228 +++ V+DVS SM K+ ++++ +L++++ Q ++ F++ Sbjct: 288 KNILFVIDVSGSMWGL------KMRQTVEAMKTILDDLRSND------QFSVLDFNHNVR 335 Query: 229 --KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + ++ I+ + G TN L A IF ++ D + Sbjct: 336 CWRDSLVQASNAQTEAAKKYIEGIHPNG-GTNINDALLRA---IFILKEASNMGM-LDPS 390 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACASPN 343 +IV ++DG+ + ++ KK ++ +GI ++FL+ A N Sbjct: 391 STSMIVLVSDGDPTVGELKLPTIQ--KNVKKNIQDDISLFCLGIGFDVDYDFLKRLAQEN 448 Query: 344 S------FYLVENPHSMYDAFSHIGKDIVTKR 369 + F E M F+ + ++ K Sbjct: 449 NGMAHRVFGNQETSSQMRKFFNQVSTPLLKKL 480 >gi|300772296|ref|ZP_07082166.1| tellurium resistance protein [Sphingobacterium spiritivorum ATCC 33861] gi|300760599|gb|EFK57425.1| tellurium resistance protein [Sphingobacterium spiritivorum ATCC 33861] Length = 212 Score = 44.0 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 32/164 (19%), Positives = 52/164 (31%), Gaps = 18/164 (10%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +VLD S SM S I+ + M+ ++ P ++TF + Sbjct: 3 RLPVYLVLDTSGSM------SGEPIEAVKNGVQVMISSLRQNPQAIETAFLSVITFDSSA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L + I+ +T+ LK N I + + D Sbjct: 57 RQLIPLTDLGAFQMVDIRA----TGTTSLGEALKVVSNCIDNEVAKTTSESKGDWKPLVF 112 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 I MTDG + Q L + K Y I + Sbjct: 113 I--MTDGIPTD--DWQSGLREFQKRKTA----YTIACAAGSGAD 148 >gi|295849303|ref|NP_001171534.1| complement C2 isoform 3 [Homo sapiens] gi|194390502|dbj|BAG60565.1| unnamed protein product [Homo sapiens] Length = 538 Score = 44.0 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 21/153 (13%), Positives = 52/153 (33%), Gaps = 16/153 (10%) Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ + L SH+ K G TN+ L Y + + + Sbjct: 85 YSYDFPEDVAPALGTSFSHMLGATNPTQKTKDHENGTGTNTYAALNSVYLMMNNQMRLLG 144 Query: 279 HCNTEDANYKKIIVFMTDGENL------STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + I+ +TDG++ + + + + N+ + +YAIG+ + Sbjct: 145 METMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDV 204 Query: 333 -----HEFLRACASPNSFYLVENPHSMYDAFSH 360 +E +++++ +++ F H Sbjct: 205 DWRELNELGSKKDGERHAFILQDTKALHQVFEH 237 >gi|123283203|emb|CAI17449.2| complement component 2 [Homo sapiens] gi|123857991|emb|CAM25861.1| complement component 2 [Homo sapiens] gi|168983783|emb|CAQ06834.1| complement component 2 [Homo sapiens] gi|168984349|emb|CAI41857.2| complement component 2 [Homo sapiens] gi|168984417|emb|CAQ09273.1| complement component 2 [Homo sapiens] gi|168985078|emb|CAQ07482.1| complement component 2 [Homo sapiens] gi|168985956|emb|CAQ07112.1| complement component 2 [Homo sapiens] Length = 525 Score = 44.0 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 21/153 (13%), Positives = 52/153 (33%), Gaps = 16/153 (10%) Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ + L SH+ K G TN+ L Y + + + Sbjct: 72 YSYDFPEDVAPALGTSFSHMLGATNPTQKTKDHENGTGTNTYAALNSVYLMMNNQMRLLG 131 Query: 279 HCNTEDANYKKIIVFMTDGENL------STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + I+ +TDG++ + + + + N+ + +YAIG+ + Sbjct: 132 METMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDV 191 Query: 333 -----HEFLRACASPNSFYLVENPHSMYDAFSH 360 +E +++++ +++ F H Sbjct: 192 DWRELNELGSKKDGERHAFILQDTKALHQVFEH 224 >gi|308472817|ref|XP_003098635.1| hypothetical protein CRE_04180 [Caenorhabditis remanei] gi|308268235|gb|EFP12188.1| hypothetical protein CRE_04180 [Caenorhabditis remanei] Length = 779 Score = 44.0 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 64/175 (36%), Gaps = 17/175 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF----- 226 LD++ V+D S M + + I I ++ + + + GLVT+ Sbjct: 31 LDVVAVVDNSIGMT---NEGLANIAANICTVFSAGTRIGTQASEPQTTRVGLVTYNVNAQ 87 Query: 227 -SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + + +F L+ +++ + +S S ST A +F+ + T + Sbjct: 88 QNADLNKFQSLDDLYNNVFADLSSVSTSAQSFLSTG--LAAAESLFEYENFG----TNRS 141 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +YKK+++ + + + L N K G + + L+ A Sbjct: 142 HYKKVVIVY--ASSYAAGGEMDPLPVANRLKTSGVNIITVAYDQTGDGLLLKQLA 194 >gi|307107471|gb|EFN55714.1| hypothetical protein CHLNCDRAFT_134017 [Chlorella variabilis] Length = 611 Score = 44.0 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 33/177 (18%), Positives = 67/177 (37%), Gaps = 18/177 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITK-IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 L ++ ++D S S+ +T + A++ + +P + + V ++ FSN I Sbjct: 429 LCVVFLVDGSGSVTEEDFRVMTGFMLAAVRGLAGGAAGQGAVPPLRSKV--AVIQFSNDI 486 Query: 231 E-EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 E ++ V + + +++ TN ++ A Q A + Sbjct: 487 RVEQGPVDVDVGAFEALMAGMARMNGGTNIALAVQKA----------GQLLKPLSATAHR 536 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS---HEFLRACASPN 343 ++V +TDG S + +++ A V V R E LR CA+ + Sbjct: 537 VLVLLTDGRIDSHQS-REARDMAARLGDEQANVRVHAYGVGRGVDKQELLRICAARD 592 >gi|148657120|ref|YP_001277325.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569230|gb|ABQ91375.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 774 Score = 44.0 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 32/261 (12%), Positives = 80/261 (30%), Gaps = 48/261 (18%) Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 N +++ + IP + + I P V++++ A Sbjct: 261 ANNTNSALTIATPIAIPASMSNPALMFWHRYDIASPDRGEVEISTNNGATWQ-------- 312 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN-----VVQSGLVTFSNKI-----E 231 ++ + + + EV + D + F + + Sbjct: 313 ------------RLRLFTGTATSWRREVIRLDDYRGQTIRMRFRLVTDAFGVRDGWYIDD 360 Query: 232 EFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 EW + Q I L+ G T+ GL+ + + +T + + + Sbjct: 361 VTIGPEWDDVRARAQAAIDTLNSRGA-TSIGGGLQSSQRML----------DTANPDLPR 409 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRACA-SPNSFYL 347 +I+ ++DG+ + L + + V+ IG+ + L A + ++ Sbjct: 410 VIILLSDGQENTRPFVADVLP---QIRAAQTTVHTIGLGRDADQQLMLSIAAQTGGTYNY 466 Query: 348 VENPHSMYDAFSHIGKDIVTK 368 P + ++ I + + Sbjct: 467 APTPEQLSGIYNTISGAVSNR 487 Score = 36.3 bits (82), Expect = 7.9, Method: Composition-based stats. Identities = 18/129 (13%), Positives = 41/129 (31%), Gaps = 17/129 (13%) Query: 114 TSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARL- 172 T D+ V + +I +I+ + LK + + I Sbjct: 93 TKQDVTVTIGGKPATIISITPLRYLLKVQAPPQPARGRYDLEVTIRGFTAKEPGAVTYFG 152 Query: 173 ----DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +++ +D S SM K+ A ++ ++ ++ G+V F++ Sbjct: 153 VDNVSVILTIDRSGSMAGN------KMVAAREASRQFVDLMQAGDG------IGIVGFND 200 Query: 229 KIEEFFLLE 237 + L Sbjct: 201 GVVMPLNLT 209 >gi|209808861|ref|YP_002264399.1| hemolysin-type calcium-binding protein [Aliivibrio salmonicida LFI1238] gi|208010423|emb|CAQ80774.1| hemolysin-type calcium-binding protein [Aliivibrio salmonicida LFI1238] Length = 2890 Score = 44.0 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 46/282 (16%), Positives = 93/282 (32%), Gaps = 38/282 (13%) Query: 50 LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD-NGFVNDID 108 + + A+ ++ + K+ G +I + + E + NG + Sbjct: 2278 IQETETSLDLSASVTDIDGSEYLDEIKISGLPDGAQIIDKNSNEPLGEFKTINGEQVWVI 2337 Query: 109 DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIV-MPITSSVKVNSQ 167 DI ++ I Y + + + + P Sbjct: 2338 DITGQSTQTINYDNLVVRYPSTETLDIDVSAVAKESSNNSEATTTVNASPSQDIYADQGG 2397 Query: 168 TDARLDMMIVLDVSRSME--------SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + +VLD S SM+ S D T++++ +++ AML+ VK + + + V Sbjct: 2398 ETPTTLISLVLDSSGSMDHKPFKSDNSNPDQDKTRMELVLEASIAMLDNVK-VQEGSEEV 2456 Query: 220 QSGLVTF-----SNKIEEFFLLEWGVSHLQRKIKYLSKF---------------GVSTNS 259 + LV F S++ ++ L W +Q I L+ T+ Sbjct: 2457 KVQLVDFDDQKHSSQDKDVESLGWFT--VQSAIDALNAALVDIAEKDKDEHFYPKGGTDY 2514 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 G+ + D Q T D ++ F++DG+N Sbjct: 2515 EEGIYAVMSGYQDTQITNITGETND-----VVYFLSDGDNNG 2551 >gi|261880543|ref|ZP_06006970.1| von Willebrand factor [Prevotella bergensis DSM 17361] gi|270332766|gb|EFA43552.1| von Willebrand factor [Prevotella bergensis DSM 17361] Length = 289 Score = 44.0 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 21/102 (20%), Positives = 49/102 (48%), Gaps = 7/102 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S++ ++ + + A + ++ + G++ F Sbjct: 72 EEERELTVMLLIDVSGSLDFGTNTQMKRDMAAEIAATIAFSAIQNND------KIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP-GLKYAY 267 S++IE++ + G H+ I+ + F +N T G+ AY Sbjct: 126 SDQIEKYIPPKKGRKHILYIIREMLHFRPVSNKTDVGMATAY 167 >gi|121715878|ref|XP_001275548.1| von Willebrand domain protein [Aspergillus clavatus NRRL 1] gi|119403705|gb|EAW14122.1| von Willebrand domain protein [Aspergillus clavatus NRRL 1] Length = 859 Score = 44.0 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 27/177 (15%), Positives = 59/177 (33%), Gaps = 29/177 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF---SNKI 230 M+ V+D S SM+ KI ++ L+ + L N +F +++ Sbjct: 284 MIFVIDRSGSMDD-------KIATLKTALKVFLKSLPLGVCFNICSFGSSYSFLWDRSRL 336 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L+ ++ ++ + T ++ A D + Sbjct: 337 YDAASLQDALN----FVETVDSDMGGTEMQGAVEAAV-----------GGRLPDKELE-- 379 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 ++ +TDG+ K + A A +++GI SH + A + + Sbjct: 380 VLILTDGQIWRQK--VLFEFVRRTAADDSARFFSLGIGDAASHSLIEGIARAGNGFS 434 >gi|154484357|ref|ZP_02026805.1| hypothetical protein EUBVEN_02070 [Eubacterium ventriosum ATCC 27560] gi|149734834|gb|EDM50751.1| hypothetical protein EUBVEN_02070 [Eubacterium ventriosum ATCC 27560] Length = 731 Score = 44.0 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 24/121 (19%), Positives = 48/121 (39%), Gaps = 7/121 (5%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 LD+++VLD+S SM D + D +K++N + + + N + +V ++ K Sbjct: 113 PLDVVLVLDISGSMS--VDGDGKRADNLVKAVNKTITYLMKEDENN---RVAVVAYNTKA 167 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L G ++ K Y + + S Y ++ + + N K Sbjct: 168 TKILPL--GRYYVGSKADYSNTNSYFSVSKSSKNDGYKKLDIGKIKNESTNNLTDKASKS 225 Query: 291 I 291 I Sbjct: 226 I 226 >gi|39933646|ref|NP_945922.1| hypothetical protein RPA0569 [Rhodopseudomonas palustris CGA009] gi|39647492|emb|CAE26013.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009] Length = 372 Score = 44.0 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 31/184 (16%), Positives = 60/184 (32%), Gaps = 25/184 (13%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 S +Q +++ VLD + SM + + KI +I L + + ++ Sbjct: 20 SFATAAQARPSVEVAFVLDTTGSMSGLIEGAKRKIWSIATTI--------LDDNPDADIR 71 Query: 221 SGLVTF----SNKIEEFFLLEWGVSHLQRKIKYLSKFGVS---TNSTPGLKYAYNQIFDM 273 GLV + + + L + L ++ L G + L A N++ Sbjct: 72 MGLVAYRDIGDDYVVRSVDLTTDIQDLYGQLLQLQARGGGDWPESVNEALDTAINKLHWR 131 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 QG + IVF+ Q + Y A R + ++ + Sbjct: 132 QGGD----------TRRIVFLVGDAPPHMDYAQDTKYPETLAVARQKDIIVNAVQAGDAR 181 Query: 334 EFLR 337 + R Sbjct: 182 DTAR 185 >gi|114665186|ref|XP_001171182.1| PREDICTED: similar to leukointegrin alpha d chain, partial [Pan troglodytes] Length = 129 Score = 44.0 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 24/98 (24%), Positives = 42/98 (42%), Gaps = 13/98 (13%) Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG----IRVIRSH 333 H N + KKI++ +TDG+ + + +A+K G I YAIG + + Sbjct: 5 HHKNGARKSAKKILIVITDGQKYKDPLEYSDVIP--QAEKAGIIRYAIGVGHAFQGPTAR 62 Query: 334 EFLRACASP---NSFYLVENPHSMYDAFSHIGKDIVTK 368 + L +S + + V+N + A I K + K Sbjct: 63 QELNTISSAPPQDHVFKVDN----FAALGSIQKQLQEK 96 >gi|16550415|dbj|BAB70976.1| unnamed protein product [Homo sapiens] gi|119626255|gb|EAX05850.1| anthrax toxin receptor 2, isoform CRA_c [Homo sapiens] Length = 245 Score = 44.0 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 29/157 (18%), Positives = 55/157 (35%), Gaps = 19/157 (12%) Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMR 277 + + FS++ L + + ++ L + T GLK A QI G++ Sbjct: 2 RLSFIVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKAGGLK 61 Query: 278 QHCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 II+ +TDG + L ++ ++ GA VY +G+ + Sbjct: 62 TSS---------IIIALTDGKLDGLVPSYAEKEAKI---SRSLGASVYCVGVLDFEQAQL 109 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 R S + V+ + A I I+ + Sbjct: 110 ERIADSKEQVFPVKGG---FQALKGIINSILAQSCTE 143 >gi|312883317|ref|ZP_07743043.1| putative Flp pilus assembly protein TadG [Vibrio caribbenthicus ATCC BAA-2122] gi|309368933|gb|EFP96459.1| putative Flp pilus assembly protein TadG [Vibrio caribbenthicus ATCC BAA-2122] Length = 432 Score = 44.0 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 23/221 (10%), Positives = 76/221 (34%), Gaps = 38/221 (17%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 N N G + + LP++ V+ + ++ S I ++ + + + + + + E Sbjct: 6 MNRLKNSCGAAAFIFILILPVLICVMALSLQASQILLAQSKITEASEVTSLA-LSALSEE 64 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 R + + L+ ++ DD+ ++ Q+ + Sbjct: 65 ----------------RAQQKLSSYATRVLKH--YLVGTDDVKGQATM------QSSTFQ 100 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 + +F + + + S+ + + +D+ + D+S SM Sbjct: 101 FQTDLVGEATHEF--WFKHKPQADTFKVSGASTSRKH--KPQPMDVYFITDLSESMN--- 153 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 S +++ + +I ++ ++ ++ + ++ + Sbjct: 154 RSEPSRLTIVKDAIRQVVSKLPKGS------RAAFIGYNTE 188 >gi|297159760|gb|ADI09472.1| hypothetical protein SBI_06352 [Streptomyces bingchenggensis BCW-1] Length = 545 Score = 44.0 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 57/176 (32%), Gaps = 25/176 (14%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 H ++ + + + A +IV+D S SM TKI A K+ A + ++ Sbjct: 61 HAIVRVEARGLGPAAARAAASEVIVIDCSGSMTW----PPTKIMAARKATVAAVSALRKG 116 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY---NQ 269 + +V + + E + G++ K S +K A Sbjct: 117 ------TRFAVVQGTERAEVVYPPTGGMAVAGPDTKA-------AASHAAMKLAALGGTA 163 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ-----QSLYYCNEAKKRGA 320 I + + + E + + +TDG+N + C RG Sbjct: 164 IGTWLDLARRLHAEQPTALRHTLLLTDGKNEHEDPGDLARVLDACAPCFTCDARGI 219 >gi|259156141|gb|ACV96089.1| von Willebrand factor, type A [Providencia alcalifaciens Ban1] Length = 551 Score = 44.0 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 62/167 (37%), Gaps = 26/167 (15%) Query: 174 MMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF----SN 228 + +++D+S SM + + + +A ++ A++ ++ IP V V++ Sbjct: 379 VHLLVDISGSMGKPIGEGNRKYFHVANEAALALVMALEGIPGV-----VPAVSYFPGIHQ 433 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 ++ L + V H + T + +A N + + R Sbjct: 434 EVSVALLPKQSVRHRAAYFDQKPR--GCTPMAQAMWFAANSLLAQKQKR----------- 480 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 K+++ +TDG+ + C ++ G + IGI+ F Sbjct: 481 KLMIVLTDGDPDDWAATHDIIDRC---RRSGFELLGIGIQTRSVERF 524 >gi|284799403|ref|ZP_06390123.1| PilC protein [Neisseria subflava NJ9703] gi|284797751|gb|EFC53098.1| PilC protein [Neisseria subflava NJ9703] Length = 1126 Score = 44.0 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 30/191 (15%), Positives = 61/191 (31%), Gaps = 6/191 (3%) Query: 118 IVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 + P I A + L + + + V+ Q + ++M + Sbjct: 7 MQNAPSGRYRLIHASVASALALISISVQANTQQFAQTPFYLQNKTDVSGQPKVKHNIMFL 66 Query: 178 LDVSRSMESFFDSSITKIDMAIK--SINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 +D S SM+ T + + L+ V Q GL T N Sbjct: 67 IDDSGSMQWNVQGKETSVRADKRITITKEALKSVLKEYGEKQRFQWGLQTLHNNGRTDTP 126 Query: 236 LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 E G + + ++ ++TP + Y + + N + K ++ M+ Sbjct: 127 DEGGFTDDWKDVQRRVDGIDPGHATPITRRYYEVVKNF----VMPNIKYRCQKSYVIVMS 182 Query: 296 DGENLSTKEDQ 306 DG+ + +Q Sbjct: 183 DGDANMSCSNQ 193 >gi|291223817|ref|XP_002731904.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 978 Score = 44.0 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 65/195 (33%), Gaps = 29/195 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 +++VLDVS SME + + + + I ++ + G+V F E Sbjct: 328 VVLVLDVSGSMEGKPLTQL--LQVCTNYIRNTID---------DGSYLGVVKFEGIAETL 376 Query: 233 -FFLLEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 +L G + I L + T+ G+ + + + Sbjct: 377 VDLMLIDGPQIRETIIANLPTYAGGRTSIGGGVLKGIEVLSNFG--------NEDTTGGY 428 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SFYLV 348 I+ ++DGE S + + G ++ + + SF+ Sbjct: 429 IILVSDGEETSEPIIDD---IWGDIELAGVVIDTVAFSDKADKKLESLATKTGGLSFFFS 485 Query: 349 --ENPHSMYDAFSHI 361 ++ ++YDAF+ Sbjct: 486 GNDDTTALYDAFTST 500 >gi|315049865|ref|XP_003174307.1| von Willebrand domain-containing protein [Arthroderma gypseum CBS 118893] gi|311342274|gb|EFR01477.1| von Willebrand domain-containing protein [Arthroderma gypseum CBS 118893] Length = 943 Score = 44.0 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 50/300 (16%), Positives = 95/300 (31%), Gaps = 54/300 (18%) Query: 59 VHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDI 118 + + +I+N + + I+ R + + FV D + ++ Sbjct: 225 LSISVEILNADKIDWIESYTHDIIVKR-----RVKEKAFQSWADFVADSPSHRSPSVANV 279 Query: 119 VVVPQNEGYS------ISAISRYKI--PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD- 169 + + IS IS K+ P P ++ I + + Q Sbjct: 280 RLPSGHTSLKWDFVLRISTISSAKLFKPRASLEIHPLIEGHSATMIEIPAEFMLQRQASV 339 Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +++ ++D S SM S KI I ++ L + L N + +F + Sbjct: 340 NNKEIIFLVDRSGSMSS-------KIPGLISAMQIYLRSLPLSIPFN------ICSFGS- 385 Query: 230 IEEFFLLEWGVSHLQR---------KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 F L W S + S T+ P L+ A Q Sbjct: 386 ---TFELLWKESKAYSDTTLDEALTHVSKFSASLGGTDLLPALERAVMQ----------- 431 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 ++ +TDGE +E + + + K+ +++GI SHE + A Sbjct: 432 ---QQQSFLDVIVLTDGEVWRLEETLRFVKLTHTVSKKAVRFFSLGIGHAVSHELVEGIA 488 >gi|227827418|ref|YP_002829197.1| von Willebrand factor A [Sulfolobus islandicus M.14.25] gi|229584633|ref|YP_002843134.1| von Willebrand factor A [Sulfolobus islandicus M.16.27] gi|238619574|ref|YP_002914399.1| von Willebrand factor type A [Sulfolobus islandicus M.16.4] gi|227459213|gb|ACP37899.1| von Willebrand factor type A [Sulfolobus islandicus M.14.25] gi|228019682|gb|ACP55089.1| von Willebrand factor type A [Sulfolobus islandicus M.16.27] gi|238380643|gb|ACR41731.1| von Willebrand factor type A [Sulfolobus islandicus M.16.4] Length = 356 Score = 44.0 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 39/207 (18%), Positives = 72/207 (34%), Gaps = 34/207 (16%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 V + S + + +I++D S SM K++ A++S +L + Sbjct: 22 VGFVIYITPQQSSVTSSIHYIIMIDNSPSMRG------EKLNTAVQSAQKLL------YN 69 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 +N L+ FSN E Q K + F V T L A N ++ Sbjct: 70 LNEGNYVTLILFSNHPEI---------KYQGPAKGIITFDVGKGYTTRLHEAVNFTINL- 119 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K I+ +TDG+ + + + + + IGI + Sbjct: 120 ------AKQSQVPTK-IIMLTDGKPTDKR-NVKDYEKLDIPPNT--QIITIGIGNDYNER 169 Query: 335 FLRACA--SPNSFYLVENPHSMYDAFS 359 L+ A S FY +++ + + F Sbjct: 170 ILKKLADRSSGKFYHIKDISELPNIFE 196 >gi|219683166|ref|YP_002469549.1| FctX [Bifidobacterium animalis subsp. lactis AD011] gi|219620816|gb|ACL28973.1| FctX [Bifidobacterium animalis subsp. lactis AD011] Length = 879 Score = 44.0 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 26/257 (10%), Positives = 65/257 (25%), Gaps = 18/257 (7%) Query: 64 QIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQ 123 ++ + + G +++ + + PQ Sbjct: 3 RLKHRSAQSGASDHAGRKGARPLRSVLASLCAVAMSLGMASASVAAFADDRQPAATADPQ 62 Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA-RLDMMIVLDVSR 182 S P + + M +T +++ LD+ +VLDVS Sbjct: 63 AATASA---GNVDAPQHTKRISKNDDGTYTLSMDVTGKSDESTEQQVVPLDIALVLDVSG 119 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH 242 SM + ++ + S+N + V+ +++ G + Sbjct: 120 SMNE-LSGKLVYNEVELLSMNPISTYYVEKDGSYQAVRCSAISWGRCTTWQDQDSAGQKY 178 Query: 243 -------------LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + +++ LK A D + + + Sbjct: 179 TVTYNWIGGPSASVSPDVQFYKSKQSEETRLDALKDAVTYFLDQVEDQNQRINDPGKKVQ 238 Query: 290 IIVFMTDGENLSTKEDQ 306 + + G+N + Sbjct: 239 VALIKYAGKNSDKIGND 255 >gi|309790222|ref|ZP_07684794.1| hypothetical protein OSCT_0745 [Oscillochloris trichoides DG6] gi|308227807|gb|EFO81463.1| hypothetical protein OSCT_0745 [Oscillochloris trichoides DG6] Length = 472 Score = 44.0 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 58/172 (33%), Gaps = 20/172 (11%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + RLD++ +LD + SM +I I SI ++ P + Sbjct: 242 GLRGAQIDSTLRLDLLFMLDTTGSMGDELY----RIQETIDSIAQRIDAFNPRPQI---- 293 Query: 220 QSGLVTF----SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + GLV + + + ++ + ++ LS G ++ + A Sbjct: 294 RYGLVAYKDEGDDYVTRPVAFTTDLAAFRAELNALSAQGGG-DTPEAVDAALENSILKME 352 Query: 276 MRQHCNTEDANYKKIIVFMTD-GENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 D +++ + D G ++ + L EA RG +Y I Sbjct: 353 WS------DTPAVRLVFLVADAGPHIFPQIQFTYLDGAREAVARGVKIYPIA 398 >gi|260834995|ref|XP_002612495.1| hypothetical protein BRAFLDRAFT_120990 [Branchiostoma floridae] gi|229297872|gb|EEN68504.1| hypothetical protein BRAFLDRAFT_120990 [Branchiostoma floridae] Length = 443 Score = 44.0 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 37/220 (16%), Positives = 66/220 (30%), Gaps = 33/220 (15%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV-NNVVQSGLVTFSNK 229 LD ++ LD S SM + K + L V+ + + +V F Sbjct: 2 PLDTVLCLDTSGSMNGRG------MAELKKGVRHFLLGVQETANKMSLRENVAVVEFGGG 55 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + + + L G +T GL A +I G+ + Sbjct: 56 ARIIQPLSGNYGTVMQSVDNLKA-GGTTPMFEGLMEAMKEILQRGGVLTLPGGRKMTPR- 113 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRG---------AIVYAIGIRVIRSHEFLRACA 340 ++ MTDG + + L G + +G + L+A A Sbjct: 114 -VILMTDG---YPDDKENVLKAALSFGPAGWQAVGLPHPIPIACVGCGDDVDKDLLQAIA 169 Query: 341 S-PNSFYLVENPHSMYDAFS----------HIGKDIVTKR 369 N Y++ + + + F G D+ R Sbjct: 170 KLTNGMYILGDVSQLSEFFRRQVLLIRFAAQFGHDMQKLR 209 >gi|2326547|emb|CAA04501.1| Matrilin-3 [Homo sapiens] Length = 303 Score = 44.0 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 17/84 (20%), Positives = 28/84 (33%), Gaps = 12/84 (14%) Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS-- 341 +N K+ + +TDG Q A+ G +YA+G+ L+ AS Sbjct: 1 SSNIPKVAIIVTDGRPQD-----QVNEVAARAQASGIELYAVGVDRADMAS-LKMMASEP 54 Query: 342 -PNSFYLVEN---PHSMYDAFSHI 361 + VE + F Sbjct: 55 LEEHVFYVETYGVIEKLSSRFQET 78 >gi|254695378|ref|ZP_05157206.1| Von Willebrand factor, type A [Brucella abortus bv. 3 str. Tulya] gi|261215750|ref|ZP_05930031.1| norD protein [Brucella abortus bv. 3 str. Tulya] gi|260917357|gb|EEX84218.1| norD protein [Brucella abortus bv. 3 str. Tulya] Length = 633 Score = 44.0 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 84/210 (40%), Gaps = 34/210 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 L + +++DVS S +++ D+ +D+ +++ + + D + ++TF+ Sbjct: 440 QANDLAVTLLVDVSLSTDAWVDNRRV-LDVEKEALLVLANGIAACGD-----RCSILTFT 493 Query: 228 NK--------IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ + F +G + ++ +I L G T +++A ++ Sbjct: 494 SRRRSWVRVETVKDFDESFGPT-VEHRIAALKP-GFYTRMGAAMRHATAKL--------- 542 Query: 280 CNTEDANYKKIIVFMTDGENLSTK------EDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 E N KK+++ +TDG+ + S E + +G V+A+ + + Sbjct: 543 --AEQPNRKKLLLLLTDGKPNDVDHYEGRFALEDSRRAAGEVRAKGVNVFAVTVDR-EAS 599 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGK 363 +L A + LV N + A I + Sbjct: 600 AYLPALFGRGGYALVANLAKLPVAMPAIYR 629 >gi|192289002|ref|YP_001989607.1| von Willebrand factor type A [Rhodopseudomonas palustris TIE-1] gi|192282751|gb|ACE99131.1| von Willebrand factor type A [Rhodopseudomonas palustris TIE-1] Length = 372 Score = 44.0 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 31/184 (16%), Positives = 60/184 (32%), Gaps = 25/184 (13%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 S +Q +++ VLD + SM + + KI +I L + + ++ Sbjct: 20 SFATAAQARPSVEVAFVLDTTGSMSGLIEGAKRKIWSIATTI--------LDDNPDADIR 71 Query: 221 SGLVTF----SNKIEEFFLLEWGVSHLQRKIKYLSKFGVS---TNSTPGLKYAYNQIFDM 273 GLV + + + L + L ++ L G + L A N++ Sbjct: 72 MGLVAYRDIGDDYVVRSVDLTTDIQDLYGQLLQLQARGGGDWPESVNEALDTAINKLHWR 131 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 QG + IVF+ Q + Y A R + ++ + Sbjct: 132 QGGD----------TRRIVFLVGDAPPHMDYAQDTKYPETLAVARQKDIIVNAVQAGDAR 181 Query: 334 EFLR 337 + R Sbjct: 182 DTAR 185 >gi|78189842|ref|YP_380180.1| von Willebrand factor, type A [Chlorobium chlorochromatii CaD3] gi|78172041|gb|ABB29137.1| von Willebrand factor, type A [Chlorobium chlorochromatii CaD3] Length = 334 Score = 44.0 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 31/159 (19%), Positives = 60/159 (37%), Gaps = 19/159 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LDVSRSM++ D+A + +E+ I + GL+ F+ Sbjct: 90 DVLFILDVSRSMQAT--------DVAPNRLMRAKQEIAAISQNVQGGRRGLLIFAASPLL 141 Query: 233 FFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + + T P A + IFD+ +T + Sbjct: 142 HCPLTTDRDGFATLLNMAAPELIEEQGTRLQPAFALA-STIFDVANESNAASTRGV---Q 197 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +IV ++DGE+ + + + K+ ++ IG+ Sbjct: 198 VIVLLSDGEDHDSNVQRAAQQLA----KQSVQLFVIGVG 232 >gi|62317822|ref|YP_223675.1| NorD protein [Brucella abortus bv. 1 str. 9-941] gi|83269804|ref|YP_419095.1| von Willebrand factor, type A [Brucella melitensis biovar Abortus 2308] gi|189023075|ref|YP_001932816.1| Von Willebrand factor, type A [Brucella abortus S19] gi|237817362|ref|ZP_04596354.1| Protein norD [Brucella abortus str. 2308 A] gi|254691324|ref|ZP_05154578.1| Von Willebrand factor, type A [Brucella abortus bv. 6 str. 870] gi|254699113|ref|ZP_05160941.1| Von Willebrand factor, type A [Brucella abortus bv. 2 str. 86/8/59] gi|254732556|ref|ZP_05191134.1| Von Willebrand factor, type A [Brucella abortus bv. 4 str. 292] gi|256256509|ref|ZP_05462045.1| Von Willebrand factor, type A [Brucella abortus bv. 9 str. C68] gi|260545056|ref|ZP_05820877.1| von Willebrand factor [Brucella abortus NCTC 8038] gi|260756933|ref|ZP_05869281.1| norD protein [Brucella abortus bv. 6 str. 870] gi|260760365|ref|ZP_05872713.1| nitric oxide reductase activation protein [Brucella abortus bv. 4 str. 292] gi|260763605|ref|ZP_05875937.1| norD protein [Brucella abortus bv. 2 str. 86/8/59] gi|260882746|ref|ZP_05894360.1| protein norD [Brucella abortus bv. 9 str. C68] gi|297249867|ref|ZP_06933568.1| nitric-oxide reductase NorD protein [Brucella abortus bv. 5 str. B3196] gi|75495379|sp|Q576X0|NORD_BRUAB RecName: Full=Protein norD gi|114152095|sp|Q2YJT9|NORD_BRUA2 RecName: Full=Protein norD gi|62198015|gb|AAX76314.1| NorD protein [Brucella abortus bv. 1 str. 9-941] gi|82940078|emb|CAJ13118.1| Von Willebrand factor, type A [Brucella melitensis biovar Abortus 2308] gi|189021649|gb|ACD74370.1| Von Willebrand factor, type A [Brucella abortus S19] gi|237788175|gb|EEP62391.1| Protein norD [Brucella abortus str. 2308 A] gi|260098327|gb|EEW82201.1| von Willebrand factor [Brucella abortus NCTC 8038] gi|260670683|gb|EEX57623.1| nitric oxide reductase activation protein [Brucella abortus bv. 4 str. 292] gi|260674026|gb|EEX60847.1| norD protein [Brucella abortus bv. 2 str. 86/8/59] gi|260677041|gb|EEX63862.1| norD protein [Brucella abortus bv. 6 str. 870] gi|260872274|gb|EEX79343.1| protein norD [Brucella abortus bv. 9 str. C68] gi|297173736|gb|EFH33100.1| nitric-oxide reductase NorD protein [Brucella abortus bv. 5 str. B3196] Length = 633 Score = 44.0 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 84/210 (40%), Gaps = 34/210 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 L + +++DVS S +++ D+ +D+ +++ + + D + ++TF+ Sbjct: 440 QANDLAVTLLVDVSLSTDAWVDNRRV-LDVEKEALLVLANGIAACGD-----RCSILTFT 493 Query: 228 NK--------IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++ + F +G + ++ +I L G T +++A ++ Sbjct: 494 SRRRSWVRVETVKDFDESFGPT-VEHRIAALKP-GFYTRMGAAMRHATAKL--------- 542 Query: 280 CNTEDANYKKIIVFMTDGENLSTK------EDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 E N KK+++ +TDG+ + S E + +G V+A+ + + Sbjct: 543 --AEQPNRKKLLLLLTDGKPNDVDHYEGRFALEDSRRAAGEVRAKGVNVFAVTVDR-EAS 599 Query: 334 EFLRACASPNSFYLVENPHSMYDAFSHIGK 363 +L A + LV N + A I + Sbjct: 600 AYLPALFGRGGYALVANLAKLPVAMPAIYR 629 >gi|260789918|ref|XP_002589991.1| hypothetical protein BRAFLDRAFT_224797 [Branchiostoma floridae] gi|229275178|gb|EEN46002.1| hypothetical protein BRAFLDRAFT_224797 [Branchiostoma floridae] Length = 103 Score = 44.0 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 19/96 (19%), Positives = 35/96 (36%), Gaps = 11/96 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ VLD S S+ ++ K+ K++ + G+V +S+ Sbjct: 1 DLFFVLDGSGSVS---GTNFDKVKQFTKNVANAFNISASA------TRVGVVQYSDSNTL 51 Query: 233 FFLLEWGVSHLQR--KIKYLSKFGVSTNSTPGLKYA 266 F L+ I + G TN+ L++A Sbjct: 52 EFNLDDHADKPSTLVAIDRILYHGGGTNTGSALEFA 87 >gi|163889361|gb|ABY48131.1| zinc finger protein [Medicago truncatula] Length = 691 Score = 44.0 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 25/147 (17%), Positives = 54/147 (36%), Gaps = 21/147 (14%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + V A +D++IVLDV +M S K+ + ++ ++ + Sbjct: 275 LKLKTPAPAPVKVLRRAPVDVVIVLDVGGAM------SGQKLRLMKNTMRLVISSLNATD 328 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + +V FS + L+ G +R ++ L+ ++ P A + Sbjct: 329 ------RLSIVAFSGGSKRLLPLKRMTGGGQRSARRIVEALAAIDQIRDAVPAKNDALKK 382 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTD 296 + R+ N IV ++D Sbjct: 383 AAKVLEDRREKNPVAC-----IVVLSD 404 >gi|193208855|ref|NP_505147.3| C-type LECtin family member (clec-218) [Caenorhabditis elegans] gi|163644494|gb|AAY43994.2| C-type lectin protein 218 [Caenorhabditis elegans] Length = 389 Score = 44.0 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 66/189 (34%), Gaps = 20/189 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE-EVKLIPDVNNVVQSGLVTFSNKI 230 LD+ +++D S M S + ++ + S+ + V + ++T+S+ Sbjct: 37 LDVYLLIDNSAKMGS---VGLLEVASNVNSVFGFTQIRVGSNYPDKRGARVSVLTYSDSP 93 Query: 231 EEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L S L I L ST+ L+ + + +M + + Sbjct: 94 TVHANLSDFKSTDELTSMIYALKP---STSYDSNLQSSLKLVKNMMNYKDINAPRNNTQT 150 Query: 289 KIIVFMTDGENLSTKEDQQSL-YYCNEAKKRGAIVYAIG-IRVIRSHEF-----LRACAS 341 II++ D D+ ++ + ++ K G + + I L+ AS Sbjct: 151 VIIIYAGD----YVDYDEPTIAQFGDQLKADGVKIITVADISNTDHQHVSKLKWLKELAS 206 Query: 342 PNSFYLVEN 350 + + + + Sbjct: 207 EGNGFNIND 215 >gi|319952787|ref|YP_004164054.1| hypothetical protein Celal_1241 [Cellulophaga algicola DSM 14237] gi|319421447|gb|ADV48556.1| protein of unknown function DUF58 [Cellulophaga algicola DSM 14237] Length = 288 Score = 44.0 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 43/159 (27%), Positives = 66/159 (41%), Gaps = 24/159 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L MM+V+DVS S E F S+ I + + N + GL+ F Sbjct: 72 EEERELTMMLVVDVSGS-ELFGSSNQ-----FKNEIITEISATLAFSALQNNDKIGLILF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 S++IE F + G SH+ R I+ L +F +N T L A + Sbjct: 126 SDQIELFIPPKKGKSHVLRIIRELIEFTPKSNKTD-LAVALKYLT------------RVM 172 Query: 287 YKKIIVF-MTDGENLSTKEDQQSLYYCNEAKK-RGAIVY 323 KK IVF ++D T++ Q+L + G VY Sbjct: 173 KKKAIVFVLSD---FITEDYLQTLRITGKKHDVTGIRVY 208 >gi|256420212|ref|YP_003120865.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256035120|gb|ACU58664.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 212 Score = 44.0 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 52/165 (31%), Gaps = 18/165 (10%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +VLD S SM S I+ + ++ ++ P L+TF + Sbjct: 3 RLPVYLVLDTSGSM------SGEPIEAVKNGVQVLISTLRQDPYALETAFLSLITFDSDA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L S ++ L G T+ L+ + I D + Sbjct: 57 RQLVPLTDLSSF---QMPELKASG-GTSLGSALELVADSINREVAKSTPDVKGDWKP--L 110 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + MTDG T ++ N + + + + Sbjct: 111 VFLMTDGIPTDTWQNGL-----NAFQNTKIGI-TVACAAGNGADV 149 >gi|329935922|ref|ZP_08285724.1| hypothetical protein SGM_1216 [Streptomyces griseoaurantiacus M045] gi|329304613|gb|EGG48489.1| hypothetical protein SGM_1216 [Streptomyces griseoaurantiacus M045] Length = 573 Score = 44.0 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 69/193 (35%), Gaps = 25/193 (12%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ + T AR + +VLD S SM ++ D S + ++ A L V Sbjct: 379 ALRTHGLTGARAKVHLVLDRSASMRPYYKDGSAQALGEQTLALAAHL------DPEATVH 432 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + FS +++ L ++ + K+ L G+ A + Sbjct: 433 ---VTFFSTELDGTGELT--LAAHENKVDELHA-GLGRMGRTSYHAAVEAVLAKHAEEAP 486 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA-IGIRVIRSHEF--L 336 ++VF TDG + Q+L + A + + ++ + + F L Sbjct: 487 GTPA------LVVFQTDGAPDAKTPATQALK--DAAAEHPEVFFSFVAFGEHENKAFDYL 538 Query: 337 RACASPN-SFYLV 348 R + N SF+ Sbjct: 539 RKLKTENTSFFHA 551 >gi|325475174|gb|EGC78359.1| hypothetical protein HMPREF9353_00373 [Treponema denticola F0402] Length = 380 Score = 44.0 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 34/189 (17%), Positives = 71/189 (37%), Gaps = 24/189 (12%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 + + I+ I +K + LD++ LD + SM+ ++ K+I Sbjct: 202 GEMIYAKGPEDILSTIKEILKKGEKDH--LDLLFALDSTESMKD-------DVEEVRKNI 252 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNK-----IEEFFLLEWGVSHLQRKIKYLSKFGVST 257 ++ML E +P + LV + + + E + + ++ + FG Sbjct: 253 SSMLAE--TLPQYKTY-RIALVLYKDYREDFLVREACVFTDNLKKFEKALYGFKVFGGR- 308 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE-AK 316 + Y IF G+RQ DA+ K ++ + D + + ++ A Sbjct: 309 ---DIPEAVYEGIFL--GLRQSWRALDADVDKKLILIGDAPPHPKPRGKVTKEDVDKLAA 363 Query: 317 KRGAIVYAI 325 ++G +Y I Sbjct: 364 EKGVKIYPI 372 >gi|194221204|ref|XP_001915782.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 3 [Equus caballus] Length = 1055 Score = 44.0 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 70/192 (36%), Gaps = 32/192 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++I++DVS SM+ ++ +A ++++++L+ + N ++ ++ Sbjct: 215 ATSPKDVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYN 262 Query: 228 NKIEEFFLLEWGV---------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ G H + + L G+ L A+N + D Q Sbjct: 263 EELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIG-MLDIALNEAFNILSDFNHTGQ 321 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLR 337 A I+ +TDG + +++ R ++ I + + L+ Sbjct: 322 GSICIQA-----IMLITDG----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLK 372 Query: 338 ACASPNSFYLVE 349 A N + + Sbjct: 373 WMACANKGFFTQ 384 >gi|42525712|ref|NP_970810.1| hypothetical protein TDE0194 [Treponema denticola ATCC 35405] gi|41815723|gb|AAS10691.1| conserved hypothetical protein [Treponema denticola ATCC 35405] Length = 380 Score = 44.0 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 34/189 (17%), Positives = 71/189 (37%), Gaps = 24/189 (12%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 + + I+ I +K + LD++ LD + SM+ ++ K+I Sbjct: 202 GEMIYAKGPEDILSTIKEILKKGEKDH--LDLLFALDSTESMKD-------DVEEVRKNI 252 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNK-----IEEFFLLEWGVSHLQRKIKYLSKFGVST 257 ++ML E +P + LV + + + E + + ++ + FG Sbjct: 253 SSMLAE--TLPQYKTY-RIALVLYKDYREDFLVREACVFTDNLKKFEKALYGFKVFGGR- 308 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE-AK 316 + Y IF G+RQ DA+ K ++ + D + + ++ A Sbjct: 309 ---DIPEAVYEGIFL--GLRQSWRALDADVDKKLILIGDAPPHPKPRGKVTKEDVDKLAA 363 Query: 317 KRGAIVYAI 325 ++G +Y I Sbjct: 364 EKGVKIYPI 372 >gi|313225207|emb|CBY21001.1| unnamed protein product [Oikopleura dioica] Length = 489 Score = 44.0 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 26/140 (18%), Positives = 51/140 (36%), Gaps = 19/140 (13%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K LD++I++DV + ++ ++ K NA + + + Sbjct: 103 KQEPAPKCDLDVVIMIDVCSCGWDIWQGVVSYVEAIAKKFNAEIGV------SSEEARIS 156 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGMRQH 279 +V +S E + ++ ++ +S T T GL A ++ Sbjct: 157 IVQYSKSQETVISFADNIEDIKSAVEQMSPENFVAEGTYVTNGLDQAISEF--------- 207 Query: 280 CNTEDANYKKIIVFMTDGEN 299 N N KK++V + DG N Sbjct: 208 -NKARPNSKKVLVTVADGYN 226 >gi|313221456|emb|CBY32206.1| unnamed protein product [Oikopleura dioica] Length = 489 Score = 44.0 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 26/140 (18%), Positives = 51/140 (36%), Gaps = 19/140 (13%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K LD++I++DV + ++ ++ K NA + + + Sbjct: 103 KQEPAPKCDLDVVIMIDVCSCGWDIWQGVVSYVEAIAKKFNAEIGV------SSEEARIS 156 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGMRQH 279 +V +S E + ++ ++ +S T T GL A ++ Sbjct: 157 IVQYSKSQETVISFADNIEDIKSAVEQMSPENFVAEGTYVTNGLDQAISEF--------- 207 Query: 280 CNTEDANYKKIIVFMTDGEN 299 N N KK++V + DG N Sbjct: 208 -NKARPNSKKVLVTVADGYN 226 >gi|308472877|ref|XP_003098665.1| hypothetical protein CRE_04175 [Caenorhabditis remanei] gi|308268265|gb|EFP12218.1| hypothetical protein CRE_04175 [Caenorhabditis remanei] Length = 396 Score = 44.0 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 69/199 (34%), Gaps = 24/199 (12%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D SR M + + ++ I S+ + L + GLVT+++ Sbjct: 38 LDVIAVVDNSRGMT---VAGLNEVAANIASVFGSGTRIGLNASEPRTTRLGLVTYNSVAT 94 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNST---PGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + + I V TN + GL+ A + D Q N+ A+ Sbjct: 95 QKADLNQYQSIGDVFHGIFDALSNIVDTNESYLATGLELAERMLID-----QSVNSTRAH 149 Query: 287 YKKIIVFMT---DGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-HEFLRACASP 342 Y+K+++ DG + L K G + + + ASP Sbjct: 150 YQKVVIVYASEYDG-----NGELDPLPIAERLKLSGVKIITVAYGNAYGLTKSFSNIASP 204 Query: 343 NSFYLVENPHSMYDAFSHI 361 + N + I Sbjct: 205 G--FAFSNSDNQGKLIGQI 221 >gi|224534313|ref|ZP_03674891.1| conserved hypothetical protein [Borrelia spielmanii A14S] gi|224514415|gb|EEF84731.1| conserved hypothetical protein [Borrelia spielmanii A14S] Length = 370 Score = 44.0 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 39/198 (19%), Positives = 80/198 (40%), Gaps = 31/198 (15%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA 198 KF + N IV I +K + + LD+++V+DV+ SM+S I++ Sbjct: 196 TKFSDEVVIANNGIDIVDKIKKILKNSEDSVYDLDLVLVIDVTDSMKSN-------IEIL 248 Query: 199 IKSINAMLE-EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS---HLQRKIKYLSKFG 254 + + +++E +++ + GLV + + +E+F + + +L +KY++ G Sbjct: 249 KEHLFSIIEPQLQKFKSY----RIGLVFYKDYLEDFLTKAFDFNTIPYLNNILKYVNVGG 304 Query: 255 VS---TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN-LSTKEDQQSLY 310 G+ A Q FD + R + I+ + D + Sbjct: 305 GGDYPEAVFEGIDAAVTQ-FDWRAER-----------RFIIVIGDAPPHEYPRGSIVYKD 352 Query: 311 YCNEAKKRGAIVYAIGIR 328 N AK++ +Y I + Sbjct: 353 VINSAKEKDITIYGIIFQ 370 >gi|294674673|ref|YP_003575289.1| tellurium resistance protein [Prevotella ruminicola 23] gi|294473007|gb|ADE82396.1| putative tellurium resistance protein [Prevotella ruminicola 23] Length = 211 Score = 44.0 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 61/190 (32%), Gaps = 16/190 (8%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 L + I++D S SM I+ + M+ ++ P V ++TF ++ Sbjct: 4 LPVYILIDTSGSM------KGEPIESVKVGLADMVATLRQDPYALETVCISIITFDREVS 57 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + L + I L+ T+ LK ++I D I Sbjct: 58 QVLPLTELENLQMPDI--LTPDTGPTHLGKALKMLCDKIAVEVKRGTPEQKGDWRPLLFI 115 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENP 351 +TDG+ ++ Q + A + A E L+ + + ++ Sbjct: 116 --LTDGKPSDVQDYNQIIPKVKSL--NFASIVACAAGPKAKVEPLQQLT--DQVFRLDTM 169 Query: 352 HSMYDAFSHI 361 + +F Sbjct: 170 DA--TSFKKF 177 >gi|126336627|ref|XP_001380264.1| PREDICTED: similar to Inter-alpha trypsin inhibitor, heavy chain 3 [Monodelphis domestica] Length = 894 Score = 44.0 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 45/281 (16%), Positives = 93/281 (33%), Gaps = 31/281 (11%) Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGF--VNDIDDIVRSTSLDIVVVPQNE 125 E G I + N SF + F D + S ++ Sbjct: 190 EPQGIKELNADVSFITNDLMNVLTTSFSGKKGHVSFKPTFDQQRSCPTCSTSLLNGDFII 249 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 Y ++ + + + + +++ + +++ V+DVS SM Sbjct: 250 TYDVNREQPGNVQV-VNGYFVHFFAPQNLPV-------------VPKNVVFVIDVSGSM- 294 Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV-TFSNKIEEFFLLEWGVSHLQ 244 S K+ +++ +L +VK +N ++ S V T+ + + + Sbjct: 295 -----SGRKLVQTKEALLKILSDVKKDDFLNFILFSSDVRTWKENLVPATPE--NLKAAE 347 Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTK 303 + + G TN GL + + M + + II+ +TDGE N+ Sbjct: 348 EFVHQIQATG-GTNINDGLLRGIEMVNKAREMGTVLDRSTS----IIIMLTDGEANVGES 402 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 ++ A +Y +G ++ FL A N+ Sbjct: 403 RVEKIQENVRNAIGGKYPLYNLGFGYDVNYNFLERMALENN 443 >gi|74001590|ref|XP_848825.1| PREDICTED: similar to Collagen alpha 1(VI) chain precursor [Canis familiaris] Length = 206 Score = 44.0 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 24/114 (21%), Positives = 39/114 (34%), Gaps = 10/114 (8%) Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 I + L+ K+ + FG T + +K ++ H K Sbjct: 101 ISPLTPMPADRDALKAKVDAVKYFGKGTYTDCAIKKGLEELLVGG---SHLKEN-----K 152 Query: 290 IIVFMTDGEN-LSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 ++ +TDG KE L NEAK G V+++ I L A+ Sbjct: 153 YLIVVTDGHPLEGYKEPCGGLEDAVNEAKHLGVKVFSVAITPDHLEPRLSIIAT 206 >gi|313226010|emb|CBY21153.1| unnamed protein product [Oikopleura dioica] Length = 1546 Score = 44.0 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 33/220 (15%), Positives = 82/220 (37%), Gaps = 28/220 (12%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 +P + + LD++ ++D S S+ S TKI I ++L + Sbjct: 1194 AVPCLTDDVNEICKEQMLDIVFLIDGSDSITS---DEFTKIQKWISKFTSLL----SPSE 1246 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGV----SHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + ++ +V FS+ + + + ++ + + +T + L Y + Sbjct: 1247 LTQALRVVVVQFSDIVLTNVDYQLNTASDLAEFDLRLAVIRQIAKNTMTGKALSYVAEKT 1306 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 A K ++ TDG + ++D ++L ++ + A+G+ Sbjct: 1307 LPS---------LRAESLKTMITFTDGGSRDHIRRDDIKTLQ------EQFDFMLAVGVG 1351 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 + LR ++ V+ +++ D +I + + +K Sbjct: 1352 PSARDDELRMLSTKGLSIHVKEYNALADLVENILQSVASK 1391 >gi|187736264|ref|YP_001878376.1| von Willebrand factor type A [Akkermansia muciniphila ATCC BAA-835] gi|187426316|gb|ACD05595.1| von Willebrand factor type A [Akkermansia muciniphila ATCC BAA-835] Length = 754 Score = 44.0 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 36/176 (20%), Positives = 69/176 (39%), Gaps = 27/176 (15%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 I P+ +V +R +++I +D SRSM S D+S T++ A + +L+ + Sbjct: 71 SIARPVDGYTEVKEIPKSR-NILIAIDCSRSMLSK-DASPTRLGRAKTAAYDLLDAL--- 125 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG----VSTNSTPGLKYAYN 268 +N G++ FS L + L+ I+ L +FG TN ++ A Sbjct: 126 -PGDNF---GIIIFSGDAVLLMPLTHDHNALKETIEQL-QFGWVSQGGTNLENVVRLALQ 180 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 + + K +V ++DGE+ + A++ I+ Sbjct: 181 TFKRDKEA---------DAKNALVILSDGEDTVN----ITYKTAEAARQHQLIIVT 223 >gi|157375629|ref|YP_001474229.1| vault protein inter-alpha-trypsin subunit [Shewanella sediminis HAW-EB3] gi|157318003|gb|ABV37101.1| vault protein inter-alpha-trypsin domain protein [Shewanella sediminis HAW-EB3] Length = 770 Score = 44.0 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 32/185 (17%), Positives = 72/185 (38%), Gaps = 26/185 (14%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 S+ ++++V+D S SM S + ++ A K++ L + N ++ Sbjct: 368 KSRNRVSRELILVIDTSGSM------SGSAMEQAKKAMKYALAGLGSDDTFN------VI 415 Query: 225 TFSNKIEEFFL-----LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD----MQG 275 F++K+ + R + L+ G T L++A Q + Sbjct: 416 EFNSKVSSLSKGPIPASTKNIEMANRFVHSLTSDG-GTEMALALEHALGQESGGSSWQET 474 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 Q + E + + ++FMTDG ++ L+ + + + ++ +GI + F Sbjct: 475 GLQGKDEESTSRLRQVLFMTDG----AVGNEAELFKLIKYRIGKSRLFTLGIGSAPNSHF 530 Query: 336 LRACA 340 ++ A Sbjct: 531 MQRAA 535 >gi|313200801|ref|YP_004039459.1| outer membrane adhesin-like protein [Methylovorus sp. MP688] gi|312440117|gb|ADQ84223.1| outer membrane adhesin like proteiin [Methylovorus sp. MP688] Length = 1543 Score = 43.6 bits (101), Expect = 0.044, Method: Composition-based stats. Identities = 31/154 (20%), Positives = 58/154 (37%), Gaps = 15/154 (9%) Query: 174 MMIVLDVSRSMESF-FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 ++I LD+S SM + T++ AI+SI +L++ + D + LV FS + Sbjct: 1033 LLITLDISGSMRTHDGVGDTTRLASAIQSIKVLLDKYDALGD----TRISLVVFSTTAAQ 1088 Query: 233 FFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + ++ + G +TN L A + D + N + + Sbjct: 1089 VGTDWMTIDQAKAQLDQILVKGPGGNTNYDSALANAMDAFDDPGKL--------TNAQNV 1140 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 F++DGE + SL A + Sbjct: 1141 AYFISDGEPNTGSGSNTSLTGGTNTNSSDAGIQT 1174 >gi|326772465|ref|ZP_08231749.1| conserved hypothetical protein [Actinomyces viscosus C505] gi|326637097|gb|EGE37999.1| conserved hypothetical protein [Actinomyces viscosus C505] Length = 367 Score = 43.6 bits (101), Expect = 0.044, Method: Composition-based stats. Identities = 21/193 (10%), Positives = 65/193 (33%), Gaps = 13/193 (6%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSM--ESFFDSSITKID-MAIKSINAMLEEVK 210 + + + + + + +++ +V+D + SM E + +D A ++ + +++ Sbjct: 64 LALAGPAIRGSEAISVSNVEIYMVVDRTGSMAAEDYQGKGPDGVDQSASTRLDGVRADMR 123 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 I + + ++ N L + + I + S + L+ A + Sbjct: 124 AIREAFPDSRFSIIALDNTAARELPLTHDTNAVDAWIGSFKQEVSSHATGSSLEVALPVL 183 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 D +++ +DGE ++ + + A G ++ ++ Sbjct: 184 ---GQTLAQARQSDPKDIRLVYIFSDGE---ATDNGRGAQTADNA---GISWQSLA-GLV 233 Query: 331 RSHEFLRACASPN 343 L ++ Sbjct: 234 DGGAVLGYGSTEG 246 >gi|164425557|ref|XP_960517.2| hypothetical protein NCU05555 [Neurospora crassa OR74A] gi|157070974|gb|EAA31281.2| predicted protein [Neurospora crassa OR74A] Length = 766 Score = 43.6 bits (101), Expect = 0.044, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 68/212 (32%), Gaps = 40/212 (18%) Query: 171 RLDMMIVLDVSRSMESFFDSSITK----------------IDMAIKSINAMLEEVKLIPD 214 D+++ +DVS SM + T +D+ + ++ + Sbjct: 68 PCDIVLAIDVSGSMSADAPVPTTASADYTNEQPEHNGLSVLDLVKHAARTIVSTLNSSD- 126 Query: 215 VNNVVQSGLVTFSNKIEEFFLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + G+VTFS + + L +R + + F +TN G+ ++ Sbjct: 127 -----RLGIVTFSTEAKVLQPLMPMTALNKKKTERNLGGMQPF-SATNLWGGIVEGL-KL 179 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 FD Q R ++ +TDG Q + + A ++ G Sbjct: 180 FDGQSGRMPA----------LMVLTDGMPNHMCPAQGYVAKLRAMETLPAAIHTFGFGYS 229 Query: 331 RSHEFLRACAS--PNSFYLVENPHSMYDAFSH 360 L++ A + + + + F H Sbjct: 230 LRSGLLKSVAEIGGGGYSFIPDAGMIGTVFVH 261 >gi|320016843|gb|ADW00415.1| putative tellurium resistance protein [Yersinia pestis biovar Medievalis str. Harbin 35] Length = 212 Score = 43.6 bits (101), Expect = 0.045, Method: Composition-based stats. Identities = 25/131 (19%), Positives = 40/131 (30%), Gaps = 12/131 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 L + ++LD S SM I+ + +L ++ P ++TF + Sbjct: 4 LPVYLLLDTSGSMTG------EPIEAVKNGVQMLLSTLRQDPYALETAYVSVITFDSSAR 57 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + L L K+ L G +T L I + D I Sbjct: 58 QAVPL---TDLLNFKLPELVANG-TTALGDALSLTAKCIGNEVQKTTADTKGDWRPLVFI 113 Query: 292 VFMTDGENLST 302 MTDG Sbjct: 114 --MTDGSPTDD 122 >gi|83648073|ref|YP_436508.1| von Willebrand factor type A (vWA) domain-containing protein [Hahella chejuensis KCTC 2396] gi|83636116|gb|ABC32083.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Hahella chejuensis KCTC 2396] Length = 261 Score = 43.6 bits (101), Expect = 0.045, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 68/193 (35%), Gaps = 26/193 (13%) Query: 175 MIVLDVSRSMESFFDSSIT-KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--IE 231 IV D S SM++ K+D+A ++ +E++ + G+ F + E Sbjct: 75 YIVFDGSGSMDNTDCGDGKRKLDVAKTAVKKFVEQLPADAN------VGVYAFDGQGVGE 128 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L +++ I L G T + GL+ + G + + Sbjct: 129 RTHLATQNRPLVKQMIDQLVA-GGGTPLSAGLEDGKAALTAQAGKQLGYGEYH------L 181 Query: 292 VFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN-SFY-LV 348 V +TDG +L + D + ++ IG + H + P +FY Sbjct: 182 VIITDGLASLGYETDGAVQTI---LQDTPINIHTIGFCIGDDHSLHQ----PGLTFYRSA 234 Query: 349 ENPHSMYDAFSHI 361 +P S+ + + Sbjct: 235 SDPDSLMAGLNEV 247 >gi|268530522|ref|XP_002630387.1| Hypothetical protein CBG04326 [Caenorhabditis briggsae] Length = 1016 Score = 43.6 bits (101), Expect = 0.045, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 73/200 (36%), Gaps = 26/200 (13%) Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 S Y++P P+ ++ LD++I D+S S+ Sbjct: 340 VCSFYQLPAAPK------AGFEFFNYPLGELDAGDTTKCQVLDIIIAFDISESLSRII-- 391 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIK 248 + + ++ + K N+ + G++TF++++ E L G ++ + I Sbjct: 392 ----LPKYVAFAERLVAQYK--YKANDFTRVGVLTFNDQVTEKLTLVNGNDLAAVNAAID 445 Query: 249 YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS 308 + G T+ T LK A Q+F + DA+ K+++ ++D + Sbjct: 446 SVQYVGGLTDVTKALKTA-QQLFTTES--------DASRSKVLIVLSDAVPTVDTYTDE- 495 Query: 309 LYYCNEAKKRGAIVYAIGIR 328 + + G + IG Sbjct: 496 IQAGKQLSAMGVATFFIGYN 515 >gi|48374067|ref|NP_001001537.1| inter-alpha-trypsin inhibitor heavy chain H4 [Sus scrofa] gi|3024051|sp|P79263|ITIH4_PIG RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H4; Short=ITI heavy chain H4; Short=ITI-HC4; Short=Inter-alpha-inhibitor heavy chain 4; AltName: Full=Inter-alpha-trypsin inhibitor family heavy chain-related protein; Short=IHRP; AltName: Full=Major acute phase protein; Short=MAP; Flags: Precursor gi|1836014|gb|AAB46821.1| IHRP [Sus scrofa] gi|4096979|gb|AAD00024.1| inter-alpha-trypsin inhibitor family heavy chain-related protein [Sus scrofa] gi|1588326|prf||2208343A inter-alpha-trypsin inhibitor Length = 921 Score = 43.6 bits (101), Expect = 0.045, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 40/130 (30%), Gaps = 20/130 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN---KI 230 ++ V+D S SM KI +++ +L ++ N LV+FS + Sbjct: 273 VIFVIDTSGSMRG------RKIQQTREALIKILGDLGSRDQFN------LVSFSGEAPRR 320 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 V + + G TN + A + Sbjct: 321 RAVAASAENVEEAKSYAAEIHAQG-GTNINDAMLMAVQLLERANREELLPARSVT----F 375 Query: 291 IVFMTDGENL 300 I+ +TDG+ Sbjct: 376 IILLTDGDPT 385 >gi|330500925|ref|YP_004377794.1| PpkA-like protein [Pseudomonas mendocina NK-01] gi|328915211|gb|AEB56042.1| PpkA-related protein [Pseudomonas mendocina NK-01] Length = 1008 Score = 43.6 bits (101), Expect = 0.045, Method: Composition-based stats. Identities = 40/272 (14%), Positives = 91/272 (33%), Gaps = 38/272 (13%) Query: 105 NDIDDIVRSTSLDIVVV--PQNEGYSI------SAISRYKIPLKFCTFIPWYTNSRHIVM 156 ND + + + +L+ PQ++ Y + A P++ + Sbjct: 514 NDPEQVPQVVALEPTASAVPQDQFYLMPIFDFREAFDAEGQPVQLLNIASIDPGNAAQSQ 573 Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 + + R +++V+D S SM+ + ID + ++ + +++ +++ Sbjct: 574 TPAGKPATATDSGFRTGIVLVVDTSVSMQPY-------IDRVRQVVSELQSQLQARGELD 626 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHL---------QRKIKYLSKFGVSTNSTPGLKY-A 266 N V GLV + N E L++ L QR ++ + ++ S+ A Sbjct: 627 N-VSFGLVGYRNSTERTPGLQYLSKTLVSLQDGGDPQRFLRAAEQVQATSVSSHSFNEDA 685 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDG---ENLSTKEDQQS--LYYCNEAKKRGAI 321 + + N ++I+ ++D + A + Sbjct: 686 FAGVMQAVEGMDWSNYGG----RLILLVSDAGALRKNDPHSSTRMNEAEVRQAALSKQIK 741 Query: 322 VYAIGIRVIRSHEFLRACASPNSFYLVENPHS 353 ++A+ +R AS Y V S Sbjct: 742 IFALHLRTPAGKA---NHASAEQQYRVLTADS 770 >gi|293354104|ref|XP_001061161.2| PREDICTED: anthrax toxin receptor 1-like [Rattus norvegicus] Length = 550 Score = 43.6 bits (101), Expect = 0.045, Method: Composition-based stats. Identities = 32/205 (15%), Positives = 75/205 (36%), Gaps = 18/205 (8%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+ V D S+++ S ++ T + ++ ++L+P + + F + Sbjct: 71 GDFDIYFVFDKSKAVTS-WNDLYTSWEDLVEKYVKYACTLQLLPKQH----MSFILFDTR 125 Query: 230 IEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L +++++ L +TN GL+ A Q+ N+ Sbjct: 126 ASIKTPLISERWKIRQELTKLDSQVLEGATNLQKGLEKA-------NEQIQNVNSGATKR 178 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +I+ GE + ++S ++A+ GA VY +G+ + + + ++ Sbjct: 179 SSLIIAAVWGELT-PQVLKESKDEADKARSMGAYVYCVGLNSSERQKLNEITDNEENVFI 237 Query: 348 VENPHSMYDAFSHIGKDIVTKRIWY 372 ++ + F I +I T Sbjct: 238 LDGQSN---RFHSIVDEIGTMACLE 259 >gi|293342304|ref|XP_002725205.1| PREDICTED: anthrax toxin receptor 1-like [Rattus norvegicus] Length = 603 Score = 43.6 bits (101), Expect = 0.045, Method: Composition-based stats. Identities = 32/205 (15%), Positives = 75/205 (36%), Gaps = 18/205 (8%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 D+ V D S+++ S ++ T + ++ ++L+P + + F + Sbjct: 71 GDFDIYFVFDKSKAVTS-WNDLYTSWEDLVEKYVKYACTLQLLPKQH----MSFILFDTR 125 Query: 230 IEEFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L +++++ L +TN GL+ A Q+ N+ Sbjct: 126 ASIKTPLISERWKIRQELTKLDSQVLEGATNLQKGLEKA-------NEQIQNVNSGATKR 178 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +I+ GE + ++S ++A+ GA VY +G+ + + + ++ Sbjct: 179 SSLIIAAVWGELT-PQVLKESKDEADKARSMGAYVYCVGLNSSERQKLNEITDNEENVFI 237 Query: 348 VENPHSMYDAFSHIGKDIVTKRIWY 372 ++ + F I +I T Sbjct: 238 LDGQSN---RFHSIVDEIGTMACLE 259 >gi|118096863|ref|XP_414338.2| PREDICTED: similar to voltage-gated calcium channel alpha(2)delta-3 subunit [Gallus gallus] Length = 1090 Score = 43.6 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 70/192 (36%), Gaps = 32/192 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++I++DVS SM+ ++ +A ++++++L+ + N ++ ++ Sbjct: 250 ATSPKDVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYN 297 Query: 228 NKIEEFFLLEWGV---------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ G H + + L G+ L A+N + + Q Sbjct: 298 EELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIG-MLDIALNEAFNMLNEFNHTGQ 356 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLR 337 A I+ +TDG + +++ R ++ I + + L+ Sbjct: 357 GSICSQA-----IMLITDG----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLK 407 Query: 338 ACASPNSFYLVE 349 A N + + Sbjct: 408 WMACANKGFFTQ 419 >gi|75812633|ref|YP_320251.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75705389|gb|ABA25062.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 464 Score = 43.6 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 39/236 (16%), Positives = 80/236 (33%), Gaps = 44/236 (18%) Query: 139 LKFCTFIPWYTNSRHI-VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSIT---- 193 F+P T + + +M K + + V+D S SM T Sbjct: 10 TPHREFMPAETEGQKLFLMLKLRPTKEVAVSRPPTTFAFVIDTSGSMYEIVTGETTPTGV 69 Query: 194 -----------------KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 KID+ I+S+ A++ +L + +V F + + L Sbjct: 70 TYTQDAKEYSQVTGGKSKIDIVIESLLALVRSGRLEASD----RVAIVQFDDTASQIIDL 125 Query: 237 EWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 S L+ I L F T GL+ A + + + +D ++ ++F Sbjct: 126 TPATQVSQLENAIAQLRSFSGGTRMGLGLRRALDML----------SGQDMAVRRTLLF- 174 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC--ASPNSFYLV 348 TDG+ ++ ++ + + A+G+ + L ++ + + V Sbjct: 175 TDGQTF---DEDICRALASDFATKNIPITALGVGEDFKEDLLSHLSDSTGGTLFYV 227 >gi|330839047|ref|YP_004413627.1| Magnesium chelatase [Selenomonas sputigena ATCC 35185] gi|329746811|gb|AEC00168.1| Magnesium chelatase [Selenomonas sputigena ATCC 35185] Length = 648 Score = 43.6 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 27/148 (18%), Positives = 55/148 (37%), Gaps = 21/148 (14%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 A +++ ++D S SM + ++ M ++ A+L E + GL+ F Sbjct: 461 RASANILFLVDASGSMGAK-----ERMKMVKGAVLALLREA-----YQKRDRVGLIAFRR 510 Query: 229 K-IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 E + V ++ ++ L G T GL A ++ D ++ T Sbjct: 511 TSAETLLPMTRSVELAEKALRSLPT-GGKTPLAEGLAAAL-KMMDELSRKEGAET----- 563 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEA 315 ++V +TDG + ++ A Sbjct: 564 --VLVLVTDGR-TNVSAAGKAKEEALRA 588 >gi|289177626|gb|ADC84872.1| Collagen adhesion protein [Bifidobacterium animalis subsp. lactis BB-12] Length = 905 Score = 43.6 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 26/257 (10%), Positives = 65/257 (25%), Gaps = 18/257 (7%) Query: 64 QIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQ 123 ++ + + G +++ + + PQ Sbjct: 29 RLKHRSAQSGASDHAGRKGARPLRSVLASLCAVAMSLGMASASVAAFADDRQPAATADPQ 88 Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA-RLDMMIVLDVSR 182 S P + + M +T +++ LD+ +VLDVS Sbjct: 89 AATASA---GNVDAPQHTKRISKNDDGTYTLSMDVTGKSDESTEQQVVPLDIALVLDVSG 145 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH 242 SM + ++ + S+N + V+ +++ G + Sbjct: 146 SMNE-LSGKLVYNEVELLSMNPISTYYVEKDGSYQAVRCSAISWGRCTTWQDQDSAGQKY 204 Query: 243 -------------LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + +++ LK A D + + + Sbjct: 205 TVTYNWIGGPSASVSPDVQFYKSKQSEETRLDALKDAVTYFLDQVEDQNQRINDPGKKVQ 264 Query: 290 IIVFMTDGENLSTKEDQ 306 + + G+N + Sbjct: 265 VALIKYAGKNSDKIGND 281 >gi|260887186|ref|ZP_05898449.1| magnesium-chelatase, subunit D/I family [Selenomonas sputigena ATCC 35185] gi|260863248|gb|EEX77748.1| magnesium-chelatase, subunit D/I family [Selenomonas sputigena ATCC 35185] Length = 657 Score = 43.6 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 27/148 (18%), Positives = 55/148 (37%), Gaps = 21/148 (14%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 A +++ ++D S SM + ++ M ++ A+L E + GL+ F Sbjct: 470 RASANILFLVDASGSMGAK-----ERMKMVKGAVLALLREA-----YQKRDRVGLIAFRR 519 Query: 229 K-IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 E + V ++ ++ L G T GL A ++ D ++ T Sbjct: 520 TSAETLLPMTRSVELAEKALRSLPT-GGKTPLAEGLAAAL-KMMDELSRKEGAET----- 572 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEA 315 ++V +TDG + ++ A Sbjct: 573 --VLVLVTDGR-TNVSAAGKAKEEALRA 597 >gi|189347157|ref|YP_001943686.1| outer membrane adhesin like proteiin [Chlorobium limicola DSM 245] gi|189341304|gb|ACD90707.1| outer membrane adhesin like proteiin [Chlorobium limicola DSM 245] Length = 2825 Score = 43.6 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 39/192 (20%), Positives = 68/192 (35%), Gaps = 20/192 (10%) Query: 155 VMPITSSVKVNSQTDARLD--MMIVLDVSRSMESFFD--SSITKIDMAIKSINAMLEEVK 210 +PI + V S T +D + I+LDVS SM T++ A ++I +++ + Sbjct: 2138 DVPIPT-VVSESFTAKPIDTNLTIILDVSGSMNDVIPNSGGKTRLQFAKEAIASLINQYD 2196 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEW-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + DV VQ + S + W +I LS G STN L + Sbjct: 2197 ALGDVK--VQIITFSASAQPIIVSGQVWLDPGTAITQINGLSASG-STNYDDAL---VDV 2250 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE-----DQQSLYYCNEAKKRGAIVYA 324 + + + N F++DG + + + + YA Sbjct: 2251 MTTYNNAGKLSTPQSQNVG---YFLSDGVPNTPATSPGINSPEEQQWESFLTTNNINNYA 2307 Query: 325 IGIRVIRSHEFL 336 IG+ + L Sbjct: 2308 IGVGAGATQATL 2319 >gi|309364385|emb|CAP25058.2| hypothetical protein CBG_04326 [Caenorhabditis briggsae AF16] Length = 862 Score = 43.6 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 73/200 (36%), Gaps = 26/200 (13%) Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 S Y++P P+ ++ LD++I D+S S+ Sbjct: 355 VCSFYQLPAAPK------AGFEFFNYPLGELDAGDTTKCQVLDIIIAFDISESLSRII-- 406 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIK 248 + + ++ + K N+ + G++TF++++ E L G ++ + I Sbjct: 407 ----LPKYVAFAERLVAQYK--YKANDFTRVGVLTFNDQVTEKLTLVNGNDLAAVNAAID 460 Query: 249 YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS 308 + G T+ T LK A Q+F + DA+ K+++ ++D + Sbjct: 461 SVQYVGGLTDVTKALKTA-QQLFTTES--------DASRSKVLIVLSDAVPTVDTYTDE- 510 Query: 309 LYYCNEAKKRGAIVYAIGIR 328 + + G + IG Sbjct: 511 IQAGKQLSAMGVATFFIGYN 530 >gi|258624109|ref|ZP_05719059.1| von Willebrand factor type A domain protein [Vibrio mimicus VM603] gi|258583540|gb|EEW08339.1| von Willebrand factor type A domain protein [Vibrio mimicus VM603] Length = 598 Score = 43.6 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 70/188 (37%), Gaps = 22/188 (11%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + +K ++ + I++D+S SM + + D+A ++ A+ ++ IP Sbjct: 403 VTGDTRVFLKPEAKKRPNTAVHILVDMSSSMAYKAANGKERQDIAREASLAISMALEAIP 462 Query: 214 DVNNVVQSGLVTF---SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 VN VTF + F +++ G + R ++ K T + YA ++ Sbjct: 463 GVNP-----AVTFFGGNRNQPVFSVVKHGDTVQNRAGRFGFKATGGTPMAEAMWYAAFEL 517 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + R K+++ +TDG+ S + + C + V IG+ Sbjct: 518 TKTREER-----------KMLIVVTDGQPQSAPACRSVIDLCE---RSDVEVIGIGVETA 563 Query: 331 RSHEFLRA 338 + Sbjct: 564 AVSGLFQK 571 >gi|222637454|gb|EEE67586.1| hypothetical protein OsJ_25118 [Oryza sativa Japonica Group] Length = 755 Score = 43.6 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 57/188 (30%), Gaps = 42/188 (22%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D S SM+ ++ +++ L E+ N ++TF++++ F Sbjct: 335 VVFVIDTSGSMQGHP------LENVKNAMSTALSELTEGDYFN------IITFNDELHSF 382 Query: 234 FL-----LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E ++ I G T+ L A + + Sbjct: 383 SSCLEKVNEKSIASALDWINLNFVAGGGTDIMHPLNEAMASLSSAHDVLPQ--------- 433 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA-------IVYAIGIRVIRSHEFLRACAS 341 I MTDG C K + G+ + +H FLR AS Sbjct: 434 --IFLMTDGSVDDEH------NICQTVKTELISRGSKSPRISTFGLGLYCNHYFLRMLAS 485 Query: 342 PNS-FYLV 348 Y Sbjct: 486 IGRGHYDA 493 >gi|218200012|gb|EEC82439.1| hypothetical protein OsI_26857 [Oryza sativa Indica Group] Length = 863 Score = 43.6 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 57/188 (30%), Gaps = 42/188 (22%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D S SM+ ++ +++ L E+ N ++TF++++ F Sbjct: 443 VVFVIDTSGSMQGHP------LENVKNAMSTALSELTEGDYFN------IITFNDELHSF 490 Query: 234 FL-----LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E ++ I G T+ L A + + Sbjct: 491 SSCLEKVNEKSIASALDWINLNFVAGGGTDIMHPLNEAMASLSSAHDVLPQ--------- 541 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA-------IVYAIGIRVIRSHEFLRACAS 341 I MTDG C K + G+ + +H FLR AS Sbjct: 542 --IFLMTDGSVDDEH------NICQTVKTELISRGSKSPRISTFGLGLYCNHYFLRMLAS 593 Query: 342 PNS-FYLV 348 Y Sbjct: 594 IGRGHYDA 601 >gi|251799254|ref|YP_003013985.1| von Willebrand factor A [Paenibacillus sp. JDR-2] gi|247546880|gb|ACT03899.1| von Willebrand factor type A [Paenibacillus sp. JDR-2] Length = 562 Score = 43.6 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 30/170 (17%), Positives = 57/170 (33%), Gaps = 25/170 (14%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K + + + V DVS SM+ + ++ ++ L I G Sbjct: 379 KEKKNGNKPITAVFVTDVSGSMD---GEPLNRLKESLLKGQKYLGRDNSI---------G 426 Query: 223 LVTFSNKIEEFFLLE-WGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 LV++S+ + + + + + ++ + G T + G+ A + D Sbjct: 427 LVSYSDGVTIKLPIAKYDTNQQSLFVGAVNSLQSGGGTATFDGIVVALKLLEDQLAT--- 483 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 D N K +I ++DGE Q K +Y IG Sbjct: 484 ----DPNTKPVIFVLSDGETNQGYTLQDIKGLIETYK---IPIYTIGYNA 526 >gi|123454693|ref|XP_001315098.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121897764|gb|EAY02875.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 665 Score = 43.6 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 37/172 (21%), Positives = 67/172 (38%), Gaps = 35/172 (20%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D V+D S SM +I+ A+K + +L+ + + + +V F ++ + Sbjct: 240 DYFFVIDCSGSM------KGDRIEKAVKCMRLILQSLPM------KCRFSIVCFGSEFQT 287 Query: 233 FFLL-EWGVSHL---QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + E+ ++ IKY+ T+ L+Y IF GM Sbjct: 288 LLPIVEYNNENVLLAMNLIKYIQAIMGGTDIYHPLEY----IFSQNGMT----------- 332 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 K I +TDGE ++ + E K+ G +Y +GI + +R A Sbjct: 333 KKIFLLTDGE---DSNSEEIIRLVQENKQFG-NIYTVGIGIGADSGLIRNLA 380 >gi|84386030|ref|ZP_00989060.1| hypothetical protein V12B01_13425 [Vibrio splendidus 12B01] gi|84379346|gb|EAP96199.1| hypothetical protein V12B01_13425 [Vibrio splendidus 12B01] Length = 310 Score = 43.6 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 25/206 (12%), Positives = 67/206 (32%), Gaps = 16/206 (7%) Query: 108 DDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 I + + V + G + + Y+I T + W +R VK+ S+ Sbjct: 34 QPINSALTGRHVSKLRGRGLNFEEMRHYQIGDDIRT-MDWKVTNRT----GKPHVKIFSE 88 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 R ++ +++D SM + + A + + + + G + ++ Sbjct: 89 ERER-NVYVMVDQRTSMFFGSTGRMKSVVAAEVAALIAWKVIDSTD------RVGAIIYN 141 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + +H+ + + ++ + N H + Sbjct: 142 DSTALPISPQRSANHVLKILNEIASKNQQLKAGKAQDTQSNSF----AKLFHQAQRLVKH 197 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCN 313 +++ +TDG + + ++Q C Sbjct: 198 DGLVILITDGYGYNERSEEQIKALCQ 223 >gi|145502983|ref|XP_001437469.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124404619|emb|CAK70072.1| unnamed protein product [Paramecium tetraurelia] Length = 562 Score = 43.6 bits (101), Expect = 0.047, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 81/206 (39%), Gaps = 19/206 (9%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMES--FFDSSITKIDMAIKSINAMLEEVKLIPDV 215 +T S ++ Q D ++++ D+S SM S F D ++++ +A ++ L + Sbjct: 103 VTESPQIIQQQDPAEAIVVLYDISGSMSSQFFGDKELSRMGAVNAFFSAFADKT-LAFEF 161 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 N++V+ LV F + + + + + + + G ST + YA N++ +++ Sbjct: 162 NHIVK--LVWFGSTLFDKCEFTSDFNKFIKLVDDANP-GGSTKCYDAIDYAINKLLEVK- 217 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV--YAIGIRVIRSH 333 D + I+ +TDGE+ ++ +L I+ + +G + Sbjct: 218 ----QKYPDIVLR--ILALTDGEDNASASKPNTLV--QRIFDHKIIIDSFVVGDNCVGLK 269 Query: 334 EFLRACASPNSFYLVENPHSMYDAFS 359 AS Y + F Sbjct: 270 TLTH--ASNGRCYCPRDLGQGMSLFE 293 >gi|239990323|ref|ZP_04710987.1| hypothetical protein SrosN1_23653 [Streptomyces roseosporus NRRL 11379] Length = 527 Score = 43.6 bits (101), Expect = 0.047, Method: Composition-based stats. Identities = 27/140 (19%), Positives = 50/140 (35%), Gaps = 30/140 (21%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE-- 232 + VLD S SM ++ ++N + + + Q L+ F + +++ Sbjct: 347 VYVLDTSGSM------KGRRLAQLKSALNGLTGDFR------EREQVTLLPFGSTVKQVR 394 Query: 233 -----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + G + ++ LS G T L AY+ H + + Sbjct: 395 THTVDPADPKAGPAAIRADAAALSAEG-DTAIYSSLAAAYD----------HLGPDTESA 443 Query: 288 KKIIVFMTDGENLSTKEDQQ 307 IV MTDGEN + + + Sbjct: 444 FTSIVLMTDGENTAGRSAAE 463 >gi|116694147|ref|YP_728358.1| hypothetical protein H16_B0192 [Ralstonia eutropha H16] gi|113528646|emb|CAJ94993.1| Hypothetical protein H16_B0192 [Ralstonia eutropha H16] Length = 562 Score = 43.6 bits (101), Expect = 0.047, Method: Composition-based stats. Identities = 13/77 (16%), Positives = 36/77 (46%), Gaps = 4/77 (5%) Query: 17 GGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKK 76 G ++++ A+ + + + + I+V H+F + L +++D + + AA Q+ + Sbjct: 17 GAISVMAALLIATVAIAALVSIDVGHVFMRQRQLQNVVDLAAMSAAQQL----KRADSAA 72 Query: 77 LKGGDILCRIKNTWNMS 93 +L ++N + Sbjct: 73 NLNAAVLGTVRNIGAKN 89 >gi|56696061|ref|YP_166415.1| hypothetical protein SPO1165 [Ruegeria pomeroyi DSS-3] gi|56677798|gb|AAV94464.1| conserved domain protein [Ruegeria pomeroyi DSS-3] Length = 257 Score = 43.6 bits (101), Expect = 0.047, Method: Composition-based stats. Identities = 31/162 (19%), Positives = 55/162 (33%), Gaps = 11/162 (6%) Query: 173 DMMIVLDVSRSMES--FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D MIV D S SM + +I A ++++ +P + + + GLVT+ + Sbjct: 41 DAMIVFDASGSMAEMGYNGLDRPRILDAREALH------DALPRIAALRRLGLVTYGAAM 94 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + + G N ++ N + + Sbjct: 95 DGDADGDLCKRVSMPFTPSPNAAGQILNLIDAIEPDGNTALTDAVNLAARVFDQPPRPGV 154 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKR-GAIVYAIGIRVIR 331 IV +TDG+ T + A+ G V+ IG RV Sbjct: 155 IVLVTDGD--ETCGGAPCALAADLARDTPGLTVHVIGFRVRS 194 >gi|91977980|ref|YP_570639.1| TadE-like [Rhodopseudomonas palustris BisB5] gi|91684436|gb|ABE40738.1| TadE-like [Rhodopseudomonas palustris BisB5] Length = 181 Score = 43.6 bits (101), Expect = 0.047, Method: Composition-based stats. Identities = 19/53 (35%), Positives = 29/53 (54%), Gaps = 1/53 (1%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMI-DRSLV 59 +R F N +G I A+ PI F +L IIEV+ +FF VL + + D + + Sbjct: 13 MRRFGRNRRGSAAIEFALIAPIFFGLLFAIIEVALMFFAGQVLETAVQDSARL 65 >gi|326472319|gb|EGD96328.1| von Willebrand RING finger domain-containing protein [Trichophyton tonsurans CBS 112818] Length = 1004 Score = 43.6 bits (101), Expect = 0.048, Method: Composition-based stats. Identities = 36/267 (13%), Positives = 83/267 (31%), Gaps = 38/267 (14%) Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 + N + + W + N+ +++ ++ Y I Sbjct: 379 SVNELPRFHLRFSTLHQLDIWKRAL------GDLQNEGVVTRPDPEVEVPASEED-DYRI 431 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITS---SVKVNSQTDARLDMMIVLDVSRSMES 186 S + + + +S TS S LD+++V+ VS SM+ Sbjct: 432 SKTPSKRESSGYSSSYGAGKSSNTATTDYTSPGREPIPVSAFHVPLDIVVVIPVSSSMQG 491 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-----KIEEFFLLEW-GV 240 KI + ++ +++ + + GLVTF + + W G Sbjct: 492 L------KISLLRDTLRFLVQNLGPRD------RMGLVTFGSSGGGVPVVGMTSKTWNGW 539 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 + I+ + + + + G A + + + N I+ ++D + Sbjct: 540 GKIFNSIRPVGQKSLRADVVEGANVAMDLLMQRK---------SFNPVSTILLISDSQTT 590 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGI 327 + + AK G + +G+ Sbjct: 591 DPESGDFVVSRAEAAK-VGIHTFGLGL 616 >gi|189347764|ref|YP_001944293.1| von Willebrand factor type A [Chlorobium limicola DSM 245] gi|189341911|gb|ACD91314.1| von Willebrand factor type A [Chlorobium limicola DSM 245] Length = 340 Score = 43.6 bits (101), Expect = 0.048, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 60/166 (36%), Gaps = 18/166 (10%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LDVS SM + D +++ A + + V+ + L+ F+ Sbjct: 90 DLVFMLDVSNSMLA-RDVLPDRLEAARDAALRIGGSVREG-------RRALLLFAGSPLV 141 Query: 233 FFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + + + T+ P ++ A G + A ++ Sbjct: 142 QCPLTYDREAFSALLGMATPALIEEQGTSFLPAVELALKLFT---GSVPLDSDGTAEGER 198 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 I+V ++DGE+ + + + ++ G ++ +G + Sbjct: 199 IVVLLSDGEDH----EGATAAAAAKLRRNGVSLFVLGFGSRNGADI 240 >gi|294340316|emb|CAZ88697.1| putative Rubisco activation protein CbbO [Thiomonas sp. 3As] Length = 752 Score = 43.6 bits (101), Expect = 0.048, Method: Composition-based stats. Identities = 28/205 (13%), Positives = 72/205 (35%), Gaps = 32/205 (15%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + + ++LD+S S+ S T ++++ ++++ + I + + + Sbjct: 554 SHRTNGRDIAVTLLLDLSASLNDKVKGSDQTILELSQEAVSLL---AWAIDRLGDPLAIA 610 Query: 223 LVTFSNKIE------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 + + E + F WG ++ ++ + ST L++A Sbjct: 611 GFQSNTRHEVRYQHFKGFGEPWG-DEVKARLAAMQAN-YSTRMGAALRHA---------- 658 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 H + KK+++ +TDGE + L +A++ +R + + Sbjct: 659 -AHSLRARKSDKKLLLILTDGEPADIDVQDERLLI-EDARQ--------AVRELERNGLF 708 Query: 337 RACASPNSFYLVENPHSMYDAFSHI 361 C S + ++ I Sbjct: 709 TYCISLDPRADAYVSDIFGRRYTVI 733 >gi|256395173|ref|YP_003116737.1| Magnesium chelatase [Catenulispora acidiphila DSM 44928] gi|256361399|gb|ACU74896.1| Magnesium chelatase [Catenulispora acidiphila DSM 44928] Length = 719 Score = 43.6 bits (101), Expect = 0.048, Method: Composition-based stats. Identities = 24/131 (18%), Positives = 48/131 (36%), Gaps = 19/131 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEE 232 ++ V+D S SM ++ ++ +I ++L + + GLVTF + E Sbjct: 527 VLFVVDASGSM-----AARQRMAAVKGAILSLL-----LDAYQRRDKVGLVTFRGREAEV 576 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 W V ++ L G + + L+ A + + MR +++ Sbjct: 577 ALPPTWSVDAGAARLTSLPTGGRTPLAAGLLQAA--DVLRVGRMRDPNRRP------LVL 628 Query: 293 FMTDGENLSTK 303 +TDG Sbjct: 629 VITDGRATQAS 639 >gi|218515243|ref|ZP_03512083.1| hypothetical protein Retl8_16920 [Rhizobium etli 8C-3] Length = 94 Score = 43.6 bits (101), Expect = 0.048, Method: Composition-based stats. Identities = 15/87 (17%), Positives = 32/87 (36%), Gaps = 10/87 (11%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHS----------MIDRSLVH 60 F + G I+TA+ + + GM ++ +H ++T L++ V Sbjct: 8 FISDRSGNFGIMTALLMVPLVGTAGMAVDFAHALSLRTQLYAAADAAAVGSIAEKSGAVA 67 Query: 61 AATQIMNEGNGNNRKKLKGGDILCRIK 87 AA + G + K + ++ Sbjct: 68 AAMAMNGNGTISLGKTDARDIFMSQVS 94 >gi|167911128|ref|ZP_02498219.1| hypothetical protein Bpse112_11583 [Burkholderia pseudomallei 112] Length = 579 Score = 43.6 bits (101), Expect = 0.048, Method: Composition-based stats. Identities = 11/50 (22%), Positives = 29/50 (58%), Gaps = 1/50 (2%) Query: 16 KGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 +G ++ AI++ + VLG +++ ++FF++ L + D + + A ++ Sbjct: 1 RGSFALVAAIWMLVAIAVLG-AVDIGNVFFVRRDLQRVADMAALAGAQRM 49 >gi|115678877|ref|XP_794839.2| PREDICTED: similar to calcium activated chloride channel 1 precursor [Strongylocentrotus purpuratus] Length = 1031 Score = 43.6 bits (101), Expect = 0.048, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 58/176 (32%), Gaps = 28/176 (15%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI----PDVNNVV 219 V + T +++VLD S SM + + I K++ A + ++++ I + Sbjct: 301 VQASTGDECRVVLVLDTSGSMGT--SNRIDKVNSAATAFVNLVDDGISIGIVTFTGSPTT 358 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + L + + + L R I L+ G T GL+ + Sbjct: 359 RHALTQINTQADRDSL---------RDIFQLTASG-GTCIGCGLEQGLEVLMAHPSGSAD 408 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 IIV MTDG++ + + G V + I E Sbjct: 409 GG--------IIVLMTDGQDSGIQNHI----IRQTLQDMGVRVNTVAIGEDAYGEL 452 >gi|115665362|ref|XP_001180845.1| PREDICTED: hypothetical protein, partial [Strongylocentrotus purpuratus] gi|115941435|ref|XP_001179810.1| PREDICTED: hypothetical protein, partial [Strongylocentrotus purpuratus] Length = 763 Score = 43.6 bits (101), Expect = 0.048, Method: Composition-based stats. Identities = 32/178 (17%), Positives = 60/178 (33%), Gaps = 26/178 (14%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 TS + Q L +++VLD S SM+ + K I ++ I N+ Sbjct: 153 TSPNFIVVQPSGSLRIVLVLDTSGSMDG---------ERFNKMIIGAKNFIQSIVPNNSY 203 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRK--IKYLSKFGVS-TNSTPGLKYAYNQIFDMQG 275 V V F+ + S + RK L + T G++ A Sbjct: 204 VAI--VEFNYGAIVDSNMTELTSAISRKDLASLLPTYADGATCIGCGIQTAIQVAQYNGM 261 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + ++ ++DG+ S +L ++ + G IV++I + Sbjct: 262 DSRGV---------YLILLSDGQENSGTLIADTL---DDIEDSGVIVHSIAFYEADTQ 307 >gi|254427823|ref|ZP_05041530.1| PQQ enzyme repeat domain protein [Alcanivorax sp. DG881] gi|196193992|gb|EDX88951.1| PQQ enzyme repeat domain protein [Alcanivorax sp. DG881] Length = 961 Score = 43.6 bits (101), Expect = 0.048, Method: Composition-based stats. Identities = 40/289 (13%), Positives = 83/289 (28%), Gaps = 99/289 (34%) Query: 165 NSQTDARLDMMIVLDVSRSME-SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 NS+ + +++ ++D S SME +S+ TK+ + + +++ + D GL Sbjct: 27 NSENKPKANVLFLIDTSGSMERDAPNSNRTKMQELQSAFSTVIDNLGSDID------VGL 80 Query: 224 VTFSNKIEE-------FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL------------- 263 F+ + F+ + + +IK + T++TP + Sbjct: 81 AKFNGGYDNSGYGGYVFYPVSEMNDTARAEIKQIVNGLEGTSNTPTMEAYSEAARYMLGY 140 Query: 264 --------------------------------------KYAYNQIFDMQGMRQHCNTEDA 285 KY + ++D + Sbjct: 141 SPTDYAERGEAVENEPRRAVNVERKRVCYQWCQYPGYEKYRWEYVYDSKYDSPTNLKNQC 200 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYY----CN-----EAK---------------KRGAI 321 IIV MTDG+ S + CN +AK ++ Sbjct: 201 ESNHIIV-MTDGDPTSDSDYSSVTKITGGGCNGSYSCQAKLAKWLYNDDKDVEGSRKSVK 259 Query: 322 VYAIGIRVIRSHEFLRACAS---------PNSFYLVENPHSMYDAFSHI 361 + + + V ++ R + +N + F I Sbjct: 260 TWQVALGVGKNSSQARNMKNVAEAGLGDPSAEVRFADNADELAAEFKEI 308 >gi|313216810|emb|CBY38048.1| unnamed protein product [Oikopleura dioica] Length = 1481 Score = 43.6 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 33/220 (15%), Positives = 82/220 (37%), Gaps = 28/220 (12%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 +P + + LD++ ++D S S+ S TKI K I+ + + Sbjct: 1129 AVPCLTDDVNEICKEQMLDIVFLIDGSDSITS---DEFTKIQ---KWISKFTSSLSP-SE 1181 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGV----SHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + ++ +V FS+ + + + ++ + + +T + L Y + Sbjct: 1182 LTQALRVVVVQFSDIVLTNVDYQLNTASDLAEFDVRLAVIRQIAKNTMTGKALSYVAEKT 1241 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 A K ++ TDG + ++D ++L ++ + A+G+ Sbjct: 1242 LPS---------LRAESLKTMITFTDGGSRDHIRRDDIKTLQ------EQFDFMLAVGVG 1286 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 + LR ++ V+ +++ D +I + + +K Sbjct: 1287 PSARDDELRMLSTKGLSIHVKEYNALADLVENILQSVASK 1326 >gi|110681634|dbj|BAE98272.1| circumsporozoite protein/thrombospondin-related anonymous protein-related protein [Plasmodium gallinaceum] Length = 1875 Score = 43.6 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 31/171 (18%), Positives = 63/171 (36%), Gaps = 25/171 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV-KLIPDVNNVVQSGLVTFSNK-- 229 D+ ++LD S S+ K+D + + E+ + + N V G+V F K Sbjct: 79 DLTLILDESASI---------KLDNWVNKVVPFTSEIIENLSISENEVHVGIVLFGIKSR 129 Query: 230 --IEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ +++ L +K+ L T L+Y+ + R Sbjct: 130 DYVKFNENIKYKKEDLLKKVDKLKSNYGNVGGTYIIKALEYSMKNYTKNKNSR------- 182 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + K+ + TDG N + +D+Q +K + IG+ ++ Sbjct: 183 PDAPKVTILFTDG-NDNVAKDKQLYDTGLSYRKENVKLLVIGVSKAIHNKL 232 Score = 43.2 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 60/170 (35%), Gaps = 14/170 (8%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-NVVQSGLVTFSNKIE 231 D+ +VLD S S+ + K + EE+ +++ + + G++ F+ Sbjct: 517 DLTLVLDESASISDYR---------WKKEVVPFSEELIKNLNISFSKIHVGVMLFAEYNR 567 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 EF VS+ + ++ K + N + K+ Sbjct: 568 EFVRFSDKVSYEKENLQKKIKELQGDYIRGKKSHIINAL-SYALTYYSKYPGRPEAPKVT 626 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH--EFLRAC 339 + TDG S + D++ +K + IG+ + + + L AC Sbjct: 627 LLFTDGY-DSDQSDEELYNVGLLYRKNNVKLLVIGVSMANENKLKLLVAC 675 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 30/206 (14%), Positives = 67/206 (32%), Gaps = 25/206 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV-KLIPDVNNVVQSGLVTFSNKIE 231 D+ ++LD S S+ S + EEV + + + +G++ F+ ++ Sbjct: 294 DLTLILDESASIGS---------KKWKSDVVPFAEEVINNLSISKDKIHAGVMRFAIYMK 344 Query: 232 E----FFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + + L + I+ L G T+ L+Y + +H N Sbjct: 345 TDVSYDQEARYMKNDLIKIIRGLKDKYGNGGGTHIVEALEY------SLTNFTKHPN-NR 397 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + K+ + TDG S + ++ + +G+ + + + Sbjct: 398 IDAPKVTILFTDGSENS-RPISAVRDIGLLYRRENIKLLVLGVHSVDINNLKLLAGCKEN 456 Query: 345 FYLVENPHSMYDAFSHIGKDIVTKRI 370 + +D +I K K Sbjct: 457 TDCPFAMKAEWDELKNISKIFTDKIC 482 >gi|325272563|ref|ZP_08138930.1| hypothetical protein G1E_06477 [Pseudomonas sp. TJI-51] gi|324102296|gb|EGB99775.1| hypothetical protein G1E_06477 [Pseudomonas sp. TJI-51] Length = 1125 Score = 43.6 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 63/204 (30%), Gaps = 11/204 (5%) Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 I S + A + + + + IV ++ V Sbjct: 536 ISTATESVGGSAITTGNIPVTVYPATYKASVGTSGNDAMTGSEGNDIIVADVSGLNVVAG 595 Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK--LIPDVNNVVQSGLV 224 + ++ ++D S SM I A + ++ +K L D + V LV Sbjct: 596 KN---YNIAFIVDSSGSMTD------DSIAAAKSQLASVFSTLKASLGSDTSGTVNIFLV 646 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 F +++ + + +++ + K G Y + T + Sbjct: 647 DFDSQVNMNVAVNLADPNALSQLQAVLKSMQGGTEGGGTNYEDAFKTTANFFKSTMATGN 706 Query: 285 ANYKKIIVFMTDGENLSTKEDQQS 308 A + F+TDG+ + ++ + Sbjct: 707 AGADNLTYFITDGKPTFYQRNEVT 730 >gi|299138149|ref|ZP_07031329.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] gi|298600079|gb|EFI56237.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] Length = 349 Score = 43.6 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 50/266 (18%), Positives = 87/266 (32%), Gaps = 49/266 (18%) Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 +S+ S Y L W + + T + + L + I+LD S S ++ Sbjct: 64 FSVRDKSGYVTGLHQQDCGLWEDGQQQTIKNFT------QEKNLPLTIGILLDTSGSQKN 117 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK 246 + S L EV D L++F ++ L+R Sbjct: 118 VLP-------LEQDSGARFLSEVLKPKDEAF-----LISFDVNVDLLSDYTNSAHELKRA 165 Query: 247 IKY-----------LSKFGVST--NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 I + G S P Y+ ++ + H + +KI+V Sbjct: 166 IDKASINAASSSAGVPGIGGGPFPTSHPRGTLLYDAVY----LAAHDKLQSQTGRKILVL 221 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI----------GIRVIRSHEFLRACASPN 343 +TDG + ++E +S A+K AI+Y I G E L + Sbjct: 222 LTDGGDQGSQETLKSAT--EAAQKANAILYVILIADRANFSYGFNADGQMEQLAH-ETGG 278 Query: 344 SFYLV-ENPHSMYDAFSHIGKDIVTK 368 V N + +AF I ++ T+ Sbjct: 279 RVINVGNNGKKLEEAFDQIQDELRTQ 304 >gi|254172494|ref|ZP_04879169.1| PKD domain protein [Thermococcus sp. AM4] gi|214033423|gb|EEB74250.1| PKD domain protein [Thermococcus sp. AM4] Length = 1418 Score = 43.6 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 30/229 (13%), Positives = 75/229 (32%), Gaps = 47/229 (20%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 +++ S + + D++ V D + SM+ +I +++N ++E + Sbjct: 71 ATIIQKSGSISPADIVFVFDDTGSMDD-------EIGTMKRNVNELVESL---EGYGIRA 120 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS---TNSTPGLKYA----YNQIFD 272 + LVTF + VS + + L G + + A Y+++ Sbjct: 121 RYALVTFKDSPSLRLPFTTNVSLFTQTVSKLYASGGGDTPEDDLDAIAMALRLNYSRLS- 179 Query: 273 MQGMRQHCNTEDANYKKIIVFMTD------GENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 +KI++ +TD G+ + +++G ++ + Sbjct: 180 ---------------QKILILITDAPTHYAGDGSGYSDYTIP-EIAEMLREQGVLLIVVS 223 Query: 327 --IRVIRSHEFLRACA--SPNSFYLVENPH---SMYDAFSHIGKDIVTK 368 I +R A + + + + + IG+ + Sbjct: 224 PNFGSINPKNDVRELAVLTGGLWIDIHSADFGRILQKVIDSIGESYEIQ 272 >gi|198436966|ref|XP_002122845.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H5 [Ciona intestinalis] Length = 1586 Score = 43.6 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 39/270 (14%), Positives = 91/270 (33%), Gaps = 33/270 (12%) Query: 117 DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVN--------SQT 168 + P + + + S + + + + T+ + ++ + Sbjct: 881 TVRNSPVSAVITYTPTSNQQHLISPFGLNGKFVIEYDVFRDRTTEMVIDQSYFAHFITSN 940 Query: 169 DARLD--MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + ++ ++DVS SM KID +++N +L + + + + V+ Sbjct: 941 LPPMSKRVVFLIDVSGSMFGI------KIDQVRQAMNTILHGLAETDFFSVIAFNSSVSR 994 Query: 227 SNKIEEFFLLEWGVS-HLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + +L G + ++ + +L+ T+ ++ A Q+FD Sbjct: 995 WSPSGTAAVLASGTTANINSAMNFLNTTVVTRGGTDILQAVEAAI-QLFDSAATGGTNTA 1053 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG-AIVYAIGIRVIRSHEFLRACAS 341 D +V +TDG ++ RG + IG + LR A+ Sbjct: 1054 SD-----FMVLLTDGRPTDGTVSSTAIISAIRNLNRGRFGINTIGFGTLVDMNLLRKIAA 1108 Query: 342 PNS------FYLVENPHSMYDAFSHIGKDI 365 NS F + + + + + I + I Sbjct: 1109 QNSGTSIQIFIDLNSYAQISNFYEEISQPI 1138 >gi|189190514|ref|XP_001931596.1| ubiquitin-conjugating enzyme E2E 3 [Pyrenophora tritici-repentis Pt-1C-BFP] gi|187973202|gb|EDU40701.1| ubiquitin-conjugating enzyme E2E 3 [Pyrenophora tritici-repentis Pt-1C-BFP] Length = 1331 Score = 43.6 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 69/189 (36%), Gaps = 17/189 (8%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 L ++LD + S S + +T++D+ + +A + V + GLVTF Sbjct: 1044 NASQPLVFKLLLD-TLSTPSNETNHLTRLDVLKQMFDAYINRVLA---YSFQPHIGLVTF 1099 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + K + + V + + K+ L+ +G + ++ + Q Q + N Sbjct: 1100 NTKTQVAQKITNAVENSRHKLNNLAAYGDTA--------IWDSVALAQDQIQQHAKQYPN 1151 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--PNS 344 K I+ ++DGE+ + G +V + + + L+ +S Sbjct: 1152 AKLRIICISDGEDN--TSLNTVEDVAKRLTRCGIVVDSFCLGN-TGNLRLQTLSSLTEGY 1208 Query: 345 FYLVENPHS 353 + + Sbjct: 1209 VFAPKTLDE 1217 >gi|123475372|ref|XP_001320864.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121903678|gb|EAY08641.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 665 Score = 43.6 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 39/172 (22%), Positives = 66/172 (38%), Gaps = 35/172 (20%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D V+D S SME +I A+K ++ ML+ + + + + F + ++ Sbjct: 240 DYFFVIDCSGSMEG------KRIQKAVKCLHLMLQSLPM------KCRFSIFCFGTEFKQ 287 Query: 233 FFLLE-WGVSHL---QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + ++ IK + TN LK IF ++GM Sbjct: 288 IMPIAEYNNENVLLAMNMIKNIQADMYGTNIYDPLK----CIFSLEGMT----------- 332 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 K I +TDGE + ++ L E KK G +Y +GI +R A Sbjct: 333 KKIFLLTDGEVNNA---EEILNLAEENKKFG-NIYTVGIGSGADQCLIRDLA 380 >gi|72160717|ref|YP_288374.1| protoporphyrin IX magnesium-chelatase [Thermobifida fusca YX] gi|71914449|gb|AAZ54351.1| protoporphyrin IX magnesium-chelatase [Thermobifida fusca YX] Length = 673 Score = 43.6 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 26/154 (16%), Positives = 59/154 (38%), Gaps = 20/154 (12%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEE 232 +++ +D S SM ++ ++ ++ A+L + + GL+TF +++ Sbjct: 492 VLLCVDASGSM-----AARRRMSQVKTAVLALL-----LDAYRRRDRVGLITFRNHEATL 541 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V +++ L G + + LK A + D + + ++V Sbjct: 542 TLPPTGSVDVAAARLRELPAGGRTPLAEGLLKAA--------ETLRRERLRDPDRRALLV 593 Query: 293 FMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAI 325 +TDG K ++SL+ + + G I Sbjct: 594 VVTDGRATAGPKAVERSLHAADHLAREGVSALVI 627 >gi|313812314|gb|EFS50028.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL025PA1] Length = 323 Score = 43.6 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 32/156 (20%), Positives = 53/156 (33%), Gaps = 18/156 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M+ LDVS SME + + IN ++ I D + G V F + Sbjct: 95 DVMLCLDVSGSME----------GVDRQVINTYIQLADHISDD----RIGFVMFDSSAVT 140 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR---QHCNTEDANYKK 289 F L ++ +K + + P ++Y G+ + D + Sbjct: 141 VFPLTHDRDSVKAGLKQAGERLGRADLDPAVRYGPGGSLVGDGLASCVSRFDQLDQPRSR 200 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 IV TD + A KR +V+ I Sbjct: 201 SIVLATDNMVAGPSVYTVPQAV-DLAVKRHIMVFGI 235 >gi|302894803|ref|XP_003046282.1| hypothetical protein NECHADRAFT_33272 [Nectria haematococca mpVI 77-13-4] gi|256727209|gb|EEU40569.1| hypothetical protein NECHADRAFT_33272 [Nectria haematococca mpVI 77-13-4] Length = 728 Score = 43.6 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 15/109 (13%), Positives = 40/109 (36%), Gaps = 14/109 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D + SM S+ I A + +++++ V+ +V + + ++ Sbjct: 63 DLLFLIDTTGSMYSY-------IKAARDQVKRIMDDITKAFFNEAEVRIAVVGYKDHADK 115 Query: 233 FFL----LEWGVSHLQRKIKYLSKFGVS---TNSTPGLKYAYNQIFDMQ 274 + + ++ I G + G+ A N + Q Sbjct: 116 PNIQFLDFTTSIDDVRSFIDKFKASGGGDAPEDVLGGIDQAINATWKNQ 164 >gi|116751108|ref|YP_847795.1| vault protein inter-alpha-trypsin subunit [Syntrophobacter fumaroxidans MPOB] gi|116700172|gb|ABK19360.1| Vault protein inter-alpha-trypsin domain protein [Syntrophobacter fumaroxidans MPOB] Length = 680 Score = 43.6 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 46/249 (18%), Positives = 90/249 (36%), Gaps = 42/249 (16%) Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 Y + ++ L P + +V+ +S+K ++ +D VLD+S SM Sbjct: 261 YRLDDEVPARVELIPYRKGPDSAGTFMVVVTPAASLKRIAE---GVDWTFVLDISGSMTG 317 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL--LEWGVSHLQ 244 KI I+ ++ +L ++ N+ + VTF+ +F + ++Q Sbjct: 318 ------RKITTLIEGVSRVLGKM----SANDRFRI--VTFNTTAADFTGGYVPASPENVQ 365 Query: 245 RKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLS 301 ++ + + G ST GL AY ++ D + IV +TDG N+ Sbjct: 366 TWMQRVKQIQAGGSTALFDGLDLAY-RLLDGERTTG------------IVLVTDGVCNVG 412 Query: 302 TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHI 361 + L K+ ++ I + + A + + + S I Sbjct: 413 PTRHDEFLGL---LKQHDVRLFTFVIGNSANQPLMDRLAKESGGFAMNVSES-----DDI 464 Query: 362 -GKDIVTKR 369 G+ I K Sbjct: 465 AGRLIQAKA 473 >gi|149049604|gb|EDM02058.1| similar to putative voltage-gated calcium channel alpha(2)delta-4 subunit (predicted) [Rattus norvegicus] Length = 700 Score = 43.6 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 62/169 (36%), Gaps = 27/169 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT-F 226 + D++I++D+S SM+ ++ +A ++ +L+ + N+ V + Sbjct: 293 ATSPKDIVILVDMSGSMKGL------RMAIAKHTVTTILDTL----GENDFVNIIAYNDY 342 Query: 227 SNKIEEFFL------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + IE F H ++ + L GV S L A+ + Q RQ Sbjct: 343 VHYIEPCFKGILVQADRDNREHFKQLVDELMVKGVGIVS-QALIEAFQILKQFQESRQGS 401 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 A I+ +TDG + ++ + + R V+ I Sbjct: 402 LCNQA-----IMLVTDGAVEDYEPVFETYNWPD----RKVRVFTYLIGR 441 >gi|289679370|ref|ZP_06500260.1| von Willebrand factor, type A [Pseudomonas syringae pv. syringae FF5] Length = 120 Score = 43.6 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 21/136 (15%), Positives = 49/136 (36%), Gaps = 24/136 (17%) Query: 176 IVLDVSRSME----SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 + +DVS SM+ + ++++ + + + LE K + GL+ F + Sbjct: 1 MAVDVSGSMDYPDMQWKSDEVSRLVLVQQLLGDFLEGRKGD-------RVGLILFGTQAF 53 Query: 232 EFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + ++ + + G +T + A ++ + Sbjct: 54 VQAPLTYDRRTVRVWLDEARIGIAGKNTALGDAIGLALKRL-----------RMRPATSR 102 Query: 290 IIVFMTDGENLSTKED 305 +V +TDG N + + D Sbjct: 103 ALVLVTDGANNAGQID 118 >gi|123232279|emb|CAM16354.1| novel protein similar to vertebrate inter-alpha (globulin) inhibitor H family (plasma Kallikrein-sensitive glycoprotein) (ITIH) [Danio rerio] Length = 860 Score = 43.6 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 42/256 (16%), Positives = 85/256 (33%), Gaps = 30/256 (11%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDM 174 S+ +G S I +Y + LK + + + V + D+ Sbjct: 237 SIQQQRNVSPKGLSADFIIQYDVELKDPMGDIQVDDGYFVHYFAPRGLPVVPK-----DV 291 Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 + V+D+S SM TKI ++ ++L +++ N L+TFS+ + + Sbjct: 292 IFVIDISGSM------IGTKIKQTKAAMVSILSDLREGDYFN------LITFSDDVHTWK 339 Query: 235 LLEW------GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 V + ++ + G + + L A + + Sbjct: 340 KDRTVRATRQNVRDAKEFVRKIIAAGWTNINAALLSAAKLLNPSTRSSSSTGRAPSSQRV 399 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRG-AIVYAIGIRVIRSHEFLRACASPN---- 343 +I+F+TDGE + + + + N K G ++ + LR A N Sbjct: 400 PMIIFLTDGEATIGETETDVILH-NAQKSLGLVSLFGLAFGDDADFPMLRRLALENRGVA 458 Query: 344 -SFYLVENPHSMYDAF 358 Y ++ F Sbjct: 459 RMVYEDDDAAIQLKGF 474 >gi|153006798|ref|YP_001381123.1| hypothetical protein Anae109_3961 [Anaeromyxobacter sp. Fw109-5] gi|152030371|gb|ABS28139.1| hypothetical protein Anae109_3961 [Anaeromyxobacter sp. Fw109-5] Length = 381 Score = 43.6 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 12/79 (15%), Positives = 32/79 (40%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 +G + ++ AI L ++ + + + V H+ ++ L + D + A + +G Sbjct: 7 RRERGAVAVIVAIVLLVLGGFMALSLNVGHLLSVRGELQNASDAGALGGAWDLDCTVDGI 66 Query: 73 NRKKLKGGDILCRIKNTWN 91 + ++ D R + Sbjct: 67 AKARVAALDYATRHSTDYQ 85 >gi|307719359|ref|YP_003874891.1| MoxR protein [Spirochaeta thermophila DSM 6192] gi|306533084|gb|ADN02618.1| MoxR protein [Spirochaeta thermophila DSM 6192] Length = 289 Score = 43.6 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 23/98 (23%), Positives = 46/98 (46%), Gaps = 9/98 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 L + +V+DVSRSM F + +K D + + L+ NN + G + F++++E Sbjct: 77 LVLFLVVDVSRSMR-FGSNQYSKFD----YLEILFSIFSLVTLENND-RVGALFFTDEVE 130 Query: 232 EFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYA 266 + L G +H+ ++ + G ++ L+ A Sbjct: 131 DVILPRKGKTHVLALLRRFRDMRPRGRGSDLALALRTA 168 >gi|239943867|ref|ZP_04695804.1| hypothetical protein SrosN15_22911 [Streptomyces roseosporus NRRL 15998] gi|291447330|ref|ZP_06586720.1| von Willebrand factor [Streptomyces roseosporus NRRL 15998] gi|291350277|gb|EFE77181.1| von Willebrand factor [Streptomyces roseosporus NRRL 15998] Length = 396 Score = 43.6 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 27/140 (19%), Positives = 50/140 (35%), Gaps = 30/140 (21%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE-- 232 + VLD S SM ++ ++N + + + Q L+ F + +++ Sbjct: 216 VYVLDTSGSM------KGRRLAQLKSALNGLTGDFR------EREQVTLLPFGSTVKQVR 263 Query: 233 -----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + G + ++ LS G T L AY+ H + + Sbjct: 264 THTVDPADPKAGPAAIRADAAALSAEG-DTAIYSSLAAAYD----------HLGPDTESA 312 Query: 288 KKIIVFMTDGENLSTKEDQQ 307 IV MTDGEN + + + Sbjct: 313 FTSIVLMTDGENTAGRSAAE 332 >gi|326927888|ref|XP_003210120.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3-like, partial [Meleagris gallopavo] Length = 1069 Score = 43.6 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 70/192 (36%), Gaps = 32/192 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++I++DVS SM+ ++ +A ++++++L+ + N ++ ++ Sbjct: 229 ATSPKDVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYN 276 Query: 228 NKIEEFFLLEWGV---------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ G H + + L G+ L A+N + + Q Sbjct: 277 EELHYVEPCLNGTLVQADRTNKEHFREHLDKLFAKGIG-MLDIALNEAFNMLNEFNHTGQ 335 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLR 337 A I+ +TDG + +++ R ++ I + + L+ Sbjct: 336 GSICSQA-----IMLITDG----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLK 386 Query: 338 ACASPNSFYLVE 349 A N + + Sbjct: 387 WMACANKGFFTQ 398 >gi|325964113|ref|YP_004242019.1| hypothetical protein Asphe3_27670 [Arthrobacter phenanthrenivorans Sphe3] gi|323470200|gb|ADX73885.1| hypothetical protein Asphe3_27670 [Arthrobacter phenanthrenivorans Sphe3] Length = 352 Score = 43.6 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 12/74 (16%), Positives = 27/74 (36%) Query: 15 YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNR 74 +G TIL A+ + + + ++V ++ K + + D + + A + N N Sbjct: 22 ERGAATILVAVLMVALLGCAALAVDVGAMYAEKAQIQNGADATSLAIAEECANGVNCAVA 81 Query: 75 KKLKGGDILCRIKN 88 + N Sbjct: 82 MAAPANRLADANAN 95 >gi|228982551|ref|ZP_04142810.1| hypothetical protein bthur0002_56840 [Bacillus thuringiensis Bt407] gi|228776734|gb|EEM25042.1| hypothetical protein bthur0002_56840 [Bacillus thuringiensis Bt407] Length = 690 Score = 43.6 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 53/165 (32%), Gaps = 19/165 (11%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK-LIPDVNNVVQSGLVTFSNKIEEF 233 +++D S SM D + K+ + K+ + E +K IP + + + + Sbjct: 504 YMLVDGSGSMMDRVDENNNKVSCSQKACAVIEEGIKGFIPFKSTIFR--ASGRNVMHTTV 561 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + Y G + ++ A ++ + +++V Sbjct: 562 SEFSNDSKFNESWNAYFEANGGN-MDGFSIRIAIKELL-----------QRPERNRLLVI 609 Query: 294 MTDGENLSTKE----DQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++DG + EA+K G V +I R E Sbjct: 610 LSDGLPSAYPSQKLGQADVKDAVKEARKNGIKVVSICFGSKRHRE 654 >gi|167919151|ref|ZP_02506242.1| hypothetical protein BpseBC_11405 [Burkholderia pseudomallei BCC215] Length = 594 Score = 43.6 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 10/51 (19%), Positives = 28/51 (54%), Gaps = 1/51 (1%) Query: 15 YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 +G ++ AI++ + LG +++ ++FF++ L + D + + A ++ Sbjct: 15 ERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM 64 >gi|311694769|gb|ADP97642.1| von Willebrand factor, type A [marine bacterium HP15] Length = 581 Score = 43.6 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 40/226 (17%), Positives = 81/226 (35%), Gaps = 45/226 (19%) Query: 154 IVMPITSSVKVNSQTDARL----DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 V+ + + + D +L D+ I++D+S SM+ ++ A++ + +L E Sbjct: 19 FVLSLPAGLNAQESDDVQLPEQSDVRIIVDISGSMKDTDPDNLR--QPAVRLLARLLPEG 76 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFL-----LEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 G+ TF + W +QR + ++ + TN ++ Sbjct: 77 ASA---------GVWTFGQYVNMLVPHREVNNAWREMAIQRSAQ-INSVALRTNLGAAIE 126 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK-------- 316 A + F D + I +TDG+ D + E + Sbjct: 127 TASDDYFTD---------GDLSRTHFI-LLTDGK--VDISDDPAKNTAEETRILDTIVAD 174 Query: 317 --KRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAF 358 +RGA + + + +FL+A A S F + + ++ AF Sbjct: 175 LIERGATFHPVALSEAADTDFLKALATDSGGRFQVADTADALNLAF 220 >gi|224066048|ref|XP_002192868.1| PREDICTED: similar to calcium channel, voltage-dependent, alpha2/delta subunit 3 [Taeniopygia guttata] Length = 1090 Score = 43.6 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 70/192 (36%), Gaps = 32/192 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++I++DVS SM+ ++ +A ++++++L+ + N ++ ++ Sbjct: 250 ATSPKDVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYN 297 Query: 228 NKIEEFFLLEWGV---------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ G H + + L G+ L A+N + + Q Sbjct: 298 EELHYVEPCLNGTLVQADRANKEHFREHLDKLFAKGIG-MLDIALNEAFNMLNEFNHTGQ 356 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLR 337 A I+ +TDG + +++ R ++ I + + L+ Sbjct: 357 GSICSQA-----IMLITDG----AVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLK 407 Query: 338 ACASPNSFYLVE 349 A N + + Sbjct: 408 WMACANKGFFTQ 419 >gi|118591412|ref|ZP_01548810.1| von Willebrand factor type A domain protein [Stappia aggregata IAM 12614] gi|118436084|gb|EAV42727.1| von Willebrand factor type A domain protein [Stappia aggregata IAM 12614] Length = 657 Score = 43.6 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 27/211 (12%), Positives = 68/211 (32%), Gaps = 37/211 (17%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 +QTD D++ VLD S SM D K ++A + + + + ++G+ Sbjct: 16 ARAQTDTSPDLLFVLDSSNSMWGQIDG-TAKAEIARSAFEGFVAGLP------DGTRAGV 68 Query: 224 VTFSNK-------IEEFFLLEW-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + + ++ +E + + L +K L+ G + + + Sbjct: 69 MAYGHRRKADCGDVETLVPVSDLDRAKLVESVKALTPRGKTPIT--------ETLRQAAE 120 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA--IVYAIGIRVIRSH 333 + + ++ ++DG G + IG + Sbjct: 121 LLAQNDRPGR-----LILISDG---IETCGGDPCALAEALASSGVDFKAHVIGFDIASKA 172 Query: 334 EFLR-ACA---SPNSFYLVENPHSMYDAFSH 360 + + AC + +++ + + +A Sbjct: 173 DQAKIACIAHLTGGTYWNARDADGLNEALKE 203 >gi|118096699|ref|XP_414253.2| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 [Gallus gallus] Length = 886 Score = 43.6 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 42/281 (14%), Positives = 97/281 (34%), Gaps = 39/281 (13%) Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGF--VNDIDDIVRSTSLDIVVVPQNE 125 E G + + +G I ++N SF ++ F D + S ++ Sbjct: 190 EPQGISELEAEGTFITNDLQNVIKKSFSHKKGHISFKPTLDQQRTCENCSQSLLDGDFIV 249 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 Y + + + + + + ++ +++ ++D+S SM Sbjct: 250 KYDVKRTTPDNLQI-VNGYFVHFFAPTNLP-------------KLPKNVIFIIDISGSM- 294 Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR 245 S +I+ +++ +L+++K N + F + + W + ++ Sbjct: 295 -----SGREIEQTREALLKILDDIKEDDHFN------FILFGSDVHI-----WKETLIKA 338 Query: 246 KIKYLSKFGVSTNS--TPGLKYAYNQIFDMQGMRQHCNTEDANYKK---IIVFMTDGENL 300 + L + S T G+ Y I M + + K+ II+ +TDG+ Sbjct: 339 TPENLDEARKFVRSIDTEGMTNLYGGIMKGIDMLNAAHEGNLVPKRSASIIIMLTDGQPN 398 Query: 301 STKEDQQSLYY-CNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 + Q + +A + +Y +G + FL A Sbjct: 399 VGISNTQDIQTHVKKAIEGKYTLYNLGFGYGVDYNFLEKMA 439 >gi|51244368|ref|YP_064252.1| hypothetical protein DP0516 [Desulfotalea psychrophila LSv54] gi|50875405|emb|CAG35245.1| conserved hypothetical protein [Desulfotalea psychrophila LSv54] Length = 3413 Score = 43.6 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 58/147 (39%), Gaps = 16/147 (10%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS-SITKIDMAIKSINAMLEEVKLI 212 + I+++++ +S ++++IVLD S SM+ + T+I++A +++++M + + Sbjct: 2669 VGTNISNNIEDSSAASQTINLIIVLDKSGSMDYSLEGTDSTRIEVAKEALSSMFDSYDNL 2728 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 +VN V F + E+ + + T L + Sbjct: 2729 GNVN----IQFVPFDSNAEKSEWYIDDKHGANTYLDNIIPDNGGTCYDNALDKTAEGYWA 2784 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGEN 299 + ++ F++DGE Sbjct: 2785 PPAAKT-----------LVYFISDGEP 2800 >gi|332809376|ref|XP_003308229.1| PREDICTED: calcium-activated chloride channel regulator 4 isoform 1 [Pan troglodytes] Length = 682 Score = 43.6 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 71/197 (36%), Gaps = 38/197 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD S SM +++ ++ L + V N G+V F + Sbjct: 70 VCLVLDKSGSM-----GGKDRLNRMNQAAKHFL-----LQTVENGSWVGMVHFDSTATVV 119 Query: 234 FLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L + L + + T+ G+KYA+ I + + + Sbjct: 120 NKLIQIKSSDERNTLMAGLPTYPR--GGTSICSGIKYAFQVI---GELHSQLDGSE---- 170 Query: 289 KIIVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFL-RACASPNSFY 346 ++ +TDGE+ + C +E K+ GAIV+ I + + + + S + Sbjct: 171 --VLLLTDGEDNTASS-------CIDEVKQSGAIVHFIALGRDADEAVIEMSKITGGSHF 221 Query: 347 LVENPHS---MYDAFSH 360 V + + DAF Sbjct: 222 YVSDEAQNNGLIDAFGA 238 >gi|254427565|ref|ZP_05041272.1| Vault protein inter-alpha-trypsin [Alcanivorax sp. DG881] gi|196193734|gb|EDX88693.1| Vault protein inter-alpha-trypsin [Alcanivorax sp. DG881] Length = 684 Score = 43.6 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 37/219 (16%), Positives = 79/219 (36%), Gaps = 38/219 (17%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 ++ IT + T D + VLD+S SM K+ ++ L +++ Sbjct: 286 TFMLSITPGDDLPPITTGS-DWVFVLDISGSMS-------AKLATLGDGVSQALGKLRGG 337 Query: 213 PDVNNVVQSGLVTFSNKIEEFFL--LEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYN 268 + +V F ++ EE ++ +++++ K + + TN GL A N Sbjct: 338 D------RFRIVLFDDRAEELTSGFVDATPNNIRQYTKKVMQLQSRGGTNLFGGLSLALN 391 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + DA+ IV +TDG K Q+ + + + ++ + Sbjct: 392 PL-------------DADRPTGIVLVTDGVANVGKTQQK--DFIDLLENHDVRLFTFVMG 436 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + L A + ++ + + +S I I+ Sbjct: 437 NSANRPMLTAMTNASNGFAISVSNS-----DDIAGQILN 470 >gi|167902891|ref|ZP_02490096.1| hypothetical protein BpseN_11585 [Burkholderia pseudomallei NCTC 13177] Length = 593 Score = 43.6 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 10/51 (19%), Positives = 28/51 (54%), Gaps = 1/51 (1%) Query: 15 YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 +G ++ AI++ + LG +++ ++FF++ L + D + + A ++ Sbjct: 14 ERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM 63 >gi|317419026|emb|CBN81064.1| Inter-alpha-trypsin inhibitor heavy chain H3 [Dicentrarchus labrax] Length = 836 Score = 43.6 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 62/168 (36%), Gaps = 22/168 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ V+D S SM K+ +++ A+L+++ L+ F ++I+ Sbjct: 223 VVFVIDRSGSMR-------RKMVQTREALLAILKDLHEED------YFALIQFDDRIDSW 269 Query: 232 EFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + L + ++ + + Y+ + T+ + + R+ + + Sbjct: 270 QKSLTKATKENVDQAMIYVQQINYRGGTDINQAVLTGVEMLL---KDRREKKLPERSVD- 325 Query: 290 IIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 +I+ +TDG N + A + ++ +G + FL Sbjct: 326 MIILLTDGMPNSGESHLPRIQENVRSAIRGNMSLFCLGFGNDVDYSFL 373 >gi|221112074|ref|XP_002154802.1| PREDICTED: similar to HyTSR1 protein [Hydra magnipapillata] Length = 1620 Score = 43.6 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 37/203 (18%), Positives = 63/203 (31%), Gaps = 34/203 (16%) Query: 123 QNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 N S Y I + C + V S + + VLD S Sbjct: 108 SNSTSSYQCTPDYMISYEPCNIA--------------ACSIVKSCSSINFTFVFVLDSSS 153 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH 242 S+++ + + A + VN V +V F + S+ Sbjct: 154 SVDALQWQDEKNL------VLAFVNS--SSFGVNPNVDVAVVNFGKTAKVVADCGTFKSY 205 Query: 243 --LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-EN 299 + + L+ G T GL A + Q C + +I+ +TDG EN Sbjct: 206 STFETFMNNLNLLGGGTAINQGLLAA-------EIAFQRCQKLNLEP--VIILLTDGFEN 256 Query: 300 LSTKEDQQSLYYCNEAKKRGAIV 322 + + ++ N KK +V Sbjct: 257 IDNQTPDSNIANENRIKKEALLV 279 >gi|330503959|ref|YP_004380828.1| hemolysin-type calcium-binding repeat-containing protein [Pseudomonas mendocina NK-01] gi|328918245|gb|AEB59076.1| hemolysin-type calcium-binding repeat-containing protein [Pseudomonas mendocina NK-01] Length = 3977 Score = 43.6 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 31/137 (22%), Positives = 58/137 (42%), Gaps = 15/137 (10%) Query: 174 MMIVLDVSRSMESF---FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +M+VLD+S SM+ TK+ +A ++ +++ + DV +V+ VTF+N Sbjct: 3157 LMVVLDLSGSMDDAPSGVSGFSTKLALAKDAVQRLIDSYDNLGDV--MVRI--VTFANTA 3212 Query: 231 EEFFLLEWGVSHLQRKIKYL--SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + S + + L + STN L A N +D G + +Y Sbjct: 3213 SAVGNVWMTASDAKAWLTALANNAGNGSTNYDDALIKAMNA-YDSTGKLTGTGVQSVSY- 3270 Query: 289 KIIVFMTDGENLSTKED 305 F++DG+ + + Sbjct: 3271 ----FLSDGQPTLSNAN 3283 >gi|227329343|ref|ZP_03833367.1| hypothetical protein PcarcW_19177 [Pectobacterium carotovorum subsp. carotovorum WPP14] Length = 382 Score = 43.6 bits (101), Expect = 0.056, Method: Composition-based stats. Identities = 51/331 (15%), Positives = 98/331 (29%), Gaps = 68/331 (20%) Query: 39 EVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNEL 98 E+SH+F L +M + A + G N + DI+ ++ + + R Sbjct: 103 EISHLFNDPQALQAMEPTPALAKA-LLSLRGRMNEETREAVRDIIRKVVDEILRTLRP-- 159 Query: 99 RDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPI 158 F N + + QN + + + ++ ++ Sbjct: 160 ---TFTNALTGRCNRFRRSPIPSSQN----------FDWRATIAANLKHFDREKNRLVIE 206 Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESF--FDSSITKIDMAIKSINAMLEEVKLIPDVN 216 T Q D+++ +D S SM S + + I A+ + Sbjct: 207 TPHFNSRMQRHMPWDVILCVDQSASMSSSVMYAAVCASILAALPA--------------- 251 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 V+ L+ F ++ + L + + + G TN ++Y Q+ Sbjct: 252 --VRVSLIVFDTQVVDLSHLAHDPVDVLMTV----QLGGGTNIAKAMQYCEQQV------ 299 Query: 277 RQHCNTEDANYKKIIV-FMTDGENLST-----------KEDQQSLYYCNEAKKRGAIVYA 324 N K+ IV ++D E Q +L VY Sbjct: 300 --------QNPKRTIVTLISDFEEGGALNRLLSCVQRMHSQQITLLGLAALDDAAQPVYD 351 Query: 325 IGIRVIRSHEFLRACA-SPNSF--YLVENPH 352 I + + A +P F +L E Sbjct: 352 TAIGQKLADRGMHVAALTPEHFAQWLAEVMR 382 >gi|109129708|ref|XP_001116037.1| PREDICTED: integrin alpha-D-like, partial [Macaca mulatta] Length = 101 Score = 43.6 bits (101), Expect = 0.056, Method: Composition-based stats. Identities = 25/98 (25%), Positives = 43/98 (43%), Gaps = 13/98 (13%) Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV----IRSH 333 H N + KKI++ +TDG+ + + + +A+K G I YAIG+ + Sbjct: 5 HHKNGARKSAKKILIVITDGQKYKDPLEYRDVIP--QAEKAGIIRYAIGVGRAFQEPTAR 62 Query: 334 EFLRACASP---NSFYLVENPHSMYDAFSHIGKDIVTK 368 + L S + + V+N + A S I K + K Sbjct: 63 QELNTIGSAPPQDHVFKVDN----FAALSSIQKQLQEK 96 >gi|328473926|gb|EGF44743.1| putative RTX toxin [Vibrio parahaemolyticus 10329] Length = 1540 Score = 43.6 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 39/259 (15%), Positives = 76/259 (29%), Gaps = 29/259 (11%) Query: 60 HAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIV 119 T +N L N + + N D + + V Sbjct: 939 SGLTLEVNGKTVTENSDGSYTVDLSSEPNQTSHQLSGTITVPADYNGSLDFDVTATAGSV 998 Query: 120 VVPQNEGYSISAISRYKIPLKFC------TFIPWYTNSRHIVMPITSSVKVNSQTDARLD 173 V N S +F I ++ IV + V Q + Sbjct: 999 EVNNNTQMGADTASVSVRDYEFVSGTHGDNNIVGSDDNDVIVGDVQGLQIVEGQD---YN 1055 Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +LD S SM ++T++ +N ++E + V L TFS + ++ Sbjct: 1056 IAFMLDTSGSMGYDVGRAVTELKTV---LNTLIESASG--PHSGKVNVLLTTFSTESKQV 1110 Query: 234 FLLEWGVSHLQRKIKY-----LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L+ + + +++ + + +TN G + A N ++ N Sbjct: 1111 LELDLSSDNAKSQVESILDAIVKRGDGNTNYEAGFQSALNWF-------DSADSGATNLS 1163 Query: 289 KIIVFMTDGENLSTKEDQQ 307 I +DG + Sbjct: 1164 YFI---SDGRPNQATVNDA 1179 >gi|1575519|gb|AAC47463.1| thrombospondin-related anonymous protein [Plasmodium vivax] Length = 510 Score = 43.6 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 33/179 (18%), Positives = 68/179 (37%), Gaps = 30/179 (16%) Query: 171 RLDMMIVLDVSRSMESFFDSSITK----IDMAIKSINAMLEEVKLIPDV-----NNVVQS 221 +D+ +++D S S+ + + ITK ++ I S++ + + L ++ +++ Sbjct: 3 SVDLYLLVDGSGSIG--YPNWITKVIPMLNGLINSLSLSRDTINLYMNLFGNYTTELIRL 60 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 G I++ L + Y +TN T L+ + D Sbjct: 61 GS---GQSIDKRQALSKVTELRKSYSPY-----GTTNMTAALEEVQKHLNDR-------- 104 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 ++++ MTDG +L + K+R + IGI +H+F R A Sbjct: 105 VNREKAIQLVILMTDG---IPNSKYTALEVAKKLKQRNVSLAVIGIGQGINHQFNRLIA 160 >gi|308473944|ref|XP_003099195.1| CRE-CLEC-63 protein [Caenorhabditis remanei] gi|308267668|gb|EFP11621.1| CRE-CLEC-63 protein [Caenorhabditis remanei] Length = 414 Score = 43.6 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 39/213 (18%), Positives = 75/213 (35%), Gaps = 35/213 (16%) Query: 157 PITSSVKVNSQTDARL-----DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 P V+ + L D+++V+D S+ M ++ IT++ I ++ + Sbjct: 41 PTPGGSNVDRECGGDLANLWLDIVVVVDNSKGMT---NAGITQVAANIATVFGNGTRIGN 97 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 + GLVT++ E L +H+Q I L ST ++ + + Sbjct: 98 QYTDPRSTRVGLVTYNKVATEVADL----NHIQS-IDDLYSVVFSTLTSVS-SEDDSYLA 151 Query: 272 DMQGMRQHCNTE------DANYKKIIVFMT-----DGENLSTKEDQQSLYYCNEAKKRGA 320 G + +NYK++++ DG+N + + K G Sbjct: 152 TGIGAAEKVFQNGRNGNVRSNYKRLVLVYASAYKGDGQN-------DPIPVSDRLKSSGV 204 Query: 321 IVYAIGIRVIRSHEFLRAC---ASPNSFYLVEN 350 ++ I L ASPN + E+ Sbjct: 205 VISTIAFDQDGDEALLAGLAQIASPNYAFTSED 237 >gi|296127472|ref|YP_003634724.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] gi|296019288|gb|ADG72525.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] Length = 338 Score = 43.6 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 36/162 (22%), Positives = 59/162 (36%), Gaps = 26/162 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK-LIPDVNNVVQSGLVT--FSNKI 230 + I LD+SRSM++ D ++++ A I ++ L + + + F+ + Sbjct: 92 ITIALDLSRSMDAD-DVWPSRLERAKLEIEKFVKNTDNLAVSLVGFAGTSFIACPFTQDM 150 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 E F + +S ++ T L A N DA KK Sbjct: 151 ETFSYILDNLSTKSVTLQ-------GTRIADALVTAKNTFNV-----------DAVSKKS 192 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 IV +TDGE+ D E K VY IG+ + Sbjct: 193 IVLITDGEDHGGYFD----EVLKELKDMNISVYTIGVGTSQG 230 >gi|153822291|ref|ZP_01974958.1| conserved hypothetical protein [Vibrio cholerae B33] gi|229509084|ref|ZP_04398571.1| hypothetical protein VCE_000486 [Vibrio cholerae B33] gi|229608756|ref|YP_002879404.1| hypothetical protein VCD_003678 [Vibrio cholerae MJ-1236] gi|255743768|ref|ZP_05417726.1| hypothetical protein VCH_000062 [Vibrio cholera CIRS 101] gi|126520187|gb|EAZ77410.1| conserved hypothetical protein [Vibrio cholerae B33] gi|229353841|gb|EEO18776.1| hypothetical protein VCE_000486 [Vibrio cholerae B33] gi|229371411|gb|ACQ61834.1| hypothetical protein VCD_003678 [Vibrio cholerae MJ-1236] gi|255738629|gb|EET94016.1| hypothetical protein VCH_000062 [Vibrio cholera CIRS 101] gi|259156250|gb|ACV96197.1| von Willebrand factor, type A [Vibrio cholerae Ban5] gi|259156425|gb|ACV96370.1| von Willebrand factor, type A [Vibrio cholerae Ind5] Length = 551 Score = 43.6 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 62/167 (37%), Gaps = 26/167 (15%) Query: 174 MMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF----SN 228 + +++D+S SM + + + +A ++ A++ ++ IP V V++ Sbjct: 379 VHLLVDISGSMGKPIGEGNRKYFHVANEAALALVMALEGIPGV-----VPAVSYFPGIHQ 433 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 ++ L + V H + T + +A N + + R Sbjct: 434 EVSVALLPKQSVRHRAAYFDQKPR--GCTPMAQAMWFAANSLLAQKQKR----------- 480 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 K+++ +TDG+ + C ++ G + IGI+ F Sbjct: 481 KLMIVLTDGDPDDWAATHDIVDRC---RRSGFELLGIGIQTRSVERF 524 >gi|325679008|ref|ZP_08158606.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] gi|324109512|gb|EGC03730.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] Length = 270 Score = 43.2 bits (100), Expect = 0.058, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 65/201 (32%), Gaps = 15/201 (7%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + L + ++D S SM K+ + ++ E++ + + + V+ Sbjct: 27 GATAVSKKSLVIFFLIDTSGSM------KGKKMGQLNTVMEELIPEIRRVGEADTDVKVA 80 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 ++TF + +W S I+ + + A+ ++ N+ Sbjct: 81 VLTFDTDV------KWMYSAPIS-IEEFEWARLGAQGVTSMGAAFTELAARMSRNSFLNS 133 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL-RACAS 341 ++ ++ MTDG + N K G A+GI + + L S Sbjct: 134 PSLSFAPVMFLMTDGYPSDDYKAGLKALQANSWYKFGLKA-ALGIGDEANDDMLEEFTGS 192 Query: 342 PNSFYLVENPHSMYDAFSHIG 362 ++ + I Sbjct: 193 KDTVVHAYTGGQLAAMIKIIA 213 >gi|167757049|ref|ZP_02429176.1| hypothetical protein CLORAM_02598 [Clostridium ramosum DSM 1402] gi|167703224|gb|EDS17803.1| hypothetical protein CLORAM_02598 [Clostridium ramosum DSM 1402] Length = 965 Score = 43.2 bits (100), Expect = 0.058, Method: Composition-based stats. Identities = 46/286 (16%), Positives = 85/286 (29%), Gaps = 68/286 (23%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSIT------KIDMAIKSI 202 N I + + + +S D+++V D+S SM + T ++ A + Sbjct: 63 DNRYKINLDLATGKAQDSVEAIGNDIVLVFDISNSMAEDEHGNSTSSNDKKRLTKAKNAA 122 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNS 259 L K+ + N + +VTF+ L + ++ I+ + + TN Sbjct: 123 IEFLNNSKISGNKKN--RYSIVTFNYYGTVEQNLTSNLETAKQAIRDVELGNNSDGGTNI 180 Query: 260 TPGLKYAYNQI------------FDMQGMRQHCNTEDANYKKIIV------FMTD----- 296 GL A + G + +V TD Sbjct: 181 QAGLYKARTVLKNAKSENGIIILLSDGGATGSYKLNNERNNGYLVNDYSEATATDKALGY 240 Query: 297 ------GEN-----------------------LSTKEDQQSLYYCNE----AKKRGAIVY 323 GEN + + AKK G ++ Sbjct: 241 SGRYTFGENAINYDSVIKGGRNDFTLDLYLNNSHYSLNNAAATLAENEALLAKKSGNTIF 300 Query: 324 AIGIRVIRS-HEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 IG S + FL+ A+ Y + + + +I +IVT+ Sbjct: 301 TIGYTTGSSVNSFLKNVATQGEGYAYSSSSDLSGIYENIANEIVTR 346 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 54/358 (15%), Positives = 108/358 (30%), Gaps = 79/358 (22%) Query: 85 RIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTF 144 + T M ++ E ++G+ L N +++ + K Sbjct: 522 TVTETVPMEYKLESTESGYAQTNLSSGNIIELSQNKDAANGYVTLTNSFGHVGFFKATDE 581 Query: 145 IPWYTNS-----RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 Y + + ++ +VK + + D++++LD S SM+ + S +++ Sbjct: 582 KTNYLPAGSETINNYEYNVSGNVKSTTVKAPQ-DVVLLLDKSGSMDESMNGS-SRLTHLK 639 Query: 200 KSINAMLEEVKLIPDVNNVVQS-------GLVTFSNKIE----EFFLLEWGVSHLQR--K 246 ++ + ++ + V G +T +N ++ + W + Sbjct: 640 NNVIKFITKLYEHNPDSRVSVITFAYSADGSITNNNFVKLSDIKSGNETWYTYLTKNNGG 699 Query: 247 IKYLSKFGVS---------------------------TNSTPG---LKYAYNQIFDMQGM 276 IK + G + T+ PG +YN +D G Sbjct: 700 IKNIKASGGTQIDLGLYEVRNQLSSATGENNRSVIVFTDGQPGNKGFNTSYND-YDDNGY 758 Query: 277 RQHCNT-EDANYKKIIVFMTDGENLS-----------------------TKEDQQSLYYC 312 R A++ K +T G N + Sbjct: 759 RVGAEALNQADFIKFSGNLT-GINNYIESSNGSKYYGHKNDDITKNRSNNNSNDAGNRTN 817 Query: 313 NEAKKRGAIVYAIGIRVIRS---HEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 K G ++ IG+ S FL AS + N +M +AF+ I I T Sbjct: 818 RSGKGLGKTIFTIGLNSNNSSLFDSFLTRLASEGHYTKANNSSAMENAFNSIFTSITT 875 >gi|295669664|ref|XP_002795380.1| von Willebrand factor type A domain containing protein [Paracoccidioides brasiliensis Pb01] gi|226285314|gb|EEH40880.1| von Willebrand factor type A domain containing protein [Paracoccidioides brasiliensis Pb01] Length = 773 Score = 43.2 bits (100), Expect = 0.058, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 73/206 (35%), Gaps = 25/206 (12%) Query: 171 RLDMMIVLDVSRSMESFFDSSIT----KIDMAIKSINAMLEEVK--LIPDVNNVVQSGLV 224 D+++ +DVS SM+ T K + S+ + + +I +N + G+V Sbjct: 73 PCDIVLCIDVSGSMQLSAPLPTTDESGKREETGLSVLDLTKHAARTIIETLNENDRLGVV 132 Query: 225 TFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 TFSN E + + ++ L STN GLK + + + Q+ Sbjct: 133 TFSNDAEVAYKISHMDDTNKKAALEAVEALQPL-ASTNLWHGLKLGLSVLGKVDLRPQNV 191 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 + + +TDG+ Q + K R +++ G L Sbjct: 192 --------QALYVLTDGQPNHMCPRQGYVPKLRPILERQKDRLPLIHTFGFGYDIRSGLL 243 Query: 337 RACAS--PNSFYLVENPHSMYDAFSH 360 ++ A ++ + + + F H Sbjct: 244 QSIAEVGGGTYSFIPDAGMIGTVFVH 269 >gi|225387829|ref|ZP_03757593.1| hypothetical protein CLOSTASPAR_01599 [Clostridium asparagiforme DSM 15981] gi|225046072|gb|EEG56318.1| hypothetical protein CLOSTASPAR_01599 [Clostridium asparagiforme DSM 15981] Length = 547 Score = 43.2 bits (100), Expect = 0.058, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 57/167 (34%), Gaps = 24/167 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 SV Q+ + ++DVS SM+ + ++ + +I+ L E N+ + Sbjct: 71 SVSTAEQSGLPKTIYCLVDVSGSMK----GRMEQVKETLTAISGGLNE-------NDNLV 119 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 G + G ++ +I L G T+ GL + G++ Sbjct: 120 IGKMG---NQITDSAFLSGQEEIKAQIDSLQYTGEDTDLYSGL---------IHGLKFLQ 167 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + + +V ++DG + +K VY + + Sbjct: 168 QEPEVKTLRALVVLSDG-CDDQGAGSTWKEAYDAVEKADIPVYTVAV 213 >gi|125717894|ref|YP_001035027.1| peptidoglycan binding domain-containing protein [Streptococcus sanguinis SK36] gi|125497811|gb|ABN44477.1| Peptidoglycan-binding domain-containing protein, putative [Streptococcus sanguinis SK36] Length = 450 Score = 43.2 bits (100), Expect = 0.058, Method: Composition-based stats. Identities = 39/200 (19%), Positives = 63/200 (31%), Gaps = 36/200 (18%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D++ V+D S SM S ID+ +IN + + + GL TFS Sbjct: 173 KAGSADIVFVVDRSGSMGST-------IDIVRANINEFVRNITK---EGITARFGLATFS 222 Query: 228 NKIE-------------EFFLLEWGVSHLQRKIKYLSKF----GVSTNSTPGLKYAYNQI 270 +++ F + + K L+ G T TP A NQI Sbjct: 223 DEVYGRNSGSKDEDTVLTRFGSSYFTTDPAELEKALAAIRIASGGDTPETPT--PALNQI 280 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + KK +V +TD E + + K G ++ I Sbjct: 281 ISTYDWSKSSKN-----KKFVVLLTDAEIKKPLQFPRVADTLRALKAAGIERTVATVKAI 335 Query: 331 RSHEFLRACASPNSFYLVEN 350 + A+ +EN Sbjct: 336 EG--IYKNFATEGRVLDIEN 353 >gi|254179953|ref|ZP_04886552.1| conserved hypothetical protein [Burkholderia pseudomallei 1655] gi|184210493|gb|EDU07536.1| conserved hypothetical protein [Burkholderia pseudomallei 1655] Length = 602 Score = 43.2 bits (100), Expect = 0.058, Method: Composition-based stats. Identities = 10/51 (19%), Positives = 28/51 (54%), Gaps = 1/51 (1%) Query: 15 YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 +G ++ AI++ + LG +++ ++FF++ L + D + + A ++ Sbjct: 23 ERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM 72 >gi|262091908|gb|ACY25457.1| putative von Willebrand factor type A domain-containing protein [uncultured microorganism] Length = 667 Score = 43.2 bits (100), Expect = 0.059, Method: Composition-based stats. Identities = 36/211 (17%), Positives = 67/211 (31%), Gaps = 27/211 (12%) Query: 134 RYKIP-----LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 RY +P T I + + + + + L++ +VLD S SM Sbjct: 256 RYDLPDVDDSFAVTTDIVEHPIHSDLHLVRIGTQAPEFVDNTPLNVTLVLDASGSMRE-- 313 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW-GVSHLQRKI 247 ++++A + A+ ++ + +V FS + L I Sbjct: 314 ---GNRVEIARAAAEAIRRGLRDQD------RLAVVHFSTDVFAVVDHRRPNARQLSNSI 364 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 L STN GL + E N I+ M+DG D Sbjct: 365 GSLRPRN-STNVQAGLNRGV-------ELADEVRRERPNAYNYIILMSDGVANVDATDPF 416 Query: 308 SLYYCNEAKKRG--AIVYAIGIRVIRSHEFL 336 ++ E + + IG+ + ++ L Sbjct: 417 AILASAEDRDSSNPLRLITIGVGIQNYNDVL 447 >gi|237735881|ref|ZP_04566362.1| predicted protein [Mollicutes bacterium D7] gi|229381626|gb|EEO31717.1| predicted protein [Coprobacillus sp. D7] Length = 965 Score = 43.2 bits (100), Expect = 0.059, Method: Composition-based stats. Identities = 46/286 (16%), Positives = 85/286 (29%), Gaps = 68/286 (23%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSIT------KIDMAIKSI 202 N I + + + +S D+++V D+S SM + T ++ A + Sbjct: 63 DNRYKINLDLATGKAQDSVEAIGNDIVLVFDISNSMAEDEHGNSTSSNDKKRLTKAKNAA 122 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNS 259 L K+ + N + +VTF+ L + ++ I+ + + TN Sbjct: 123 IEFLNNSKISGNKKN--RYSIVTFNYYGTVEQNLTSNLETAKQAIRDVELGNNSDGGTNI 180 Query: 260 TPGLKYAYNQI------------FDMQGMRQHCNTEDANYKKIIV------FMTD----- 296 GL A + G + +V TD Sbjct: 181 QAGLYKARTVLKNAKSENGIIILLSDGGATGSYKLNNERNNGYLVNDYSEATATDKALGY 240 Query: 297 ------GEN-----------------------LSTKEDQQSLYYCNE----AKKRGAIVY 323 GEN + + AKK G ++ Sbjct: 241 SGRYTFGENAINYDSVIKGGRNDFTLDLYLNNSHYSLNNAAATLAENEALLAKKSGNTIF 300 Query: 324 AIGIRVIRS-HEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 IG S + FL+ A+ Y + + + +I +IVT+ Sbjct: 301 TIGYTTGSSVNSFLKNVATQGEGYAYSSSSDLSGIYENIANEIVTR 346 Score = 42.4 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 55/358 (15%), Positives = 108/358 (30%), Gaps = 79/358 (22%) Query: 85 RIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTF 144 + T M ++ E ++G+ L N +++ + K Sbjct: 522 TVTETVPMEYKLESTESGYAQTNLSSGNIIELSQNKDAANGYVTLTNSFGHVGFFKATDE 581 Query: 145 IPWYTNS-----RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 Y S + ++ +VK + + D++++LD S SM+ + S +++ Sbjct: 582 KTNYLPSGSETINNYEYNVSGNVKSTTVKAPQ-DVVLLLDKSGSMDESMNGS-SRLTHLK 639 Query: 200 KSINAMLEEVKLIPDVNNVVQS-------GLVTFSNKIE----EFFLLEWGVSHLQR--K 246 ++ + ++ + V G +T +N ++ + W + Sbjct: 640 NNVIKFITKLYEHNPDSRVSVITFAYSADGSITNNNFVKLSDIKSGNETWYTYLTKNNGG 699 Query: 247 IKYLSKFGVS---------------------------TNSTPG---LKYAYNQIFDMQGM 276 IK + G + T+ PG +YN +D G Sbjct: 700 IKNIKASGGTQIDLGLYEVRNQLSSATGENNRSVIVFTDGQPGNKGFNTSYND-YDDNGY 758 Query: 277 RQHCNT-EDANYKKIIVFMTDGENLS-----------------------TKEDQQSLYYC 312 R A++ K +T G N + Sbjct: 759 RVGAEALNQADFIKFSGNLT-GINNYIESSNGSKYYGHKNDDITKNRSNNNSNDAGNRTN 817 Query: 313 NEAKKRGAIVYAIGIRVIRS---HEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 K G ++ IG+ S FL AS + N +M +AF+ I I T Sbjct: 818 RSGKGLGKTIFTIGLNSNNSSLFDSFLTRLASEGHYTKANNSSAMENAFNSIFTSITT 875 >gi|322433016|ref|YP_004210265.1| von Willebrand factor type A [Acidobacterium sp. MP5ACTX9] gi|321165243|gb|ADW70947.1| von Willebrand factor type A [Acidobacterium sp. MP5ACTX9] Length = 213 Score = 43.2 bits (100), Expect = 0.059, Method: Composition-based stats. Identities = 25/174 (14%), Positives = 56/174 (32%), Gaps = 13/174 (7%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 ++ + D R + +LD S SM ID + + + +++ V+ Sbjct: 4 LEFATNPDPRCACIFLLDTSGSM------KGAPIDALNEGLRSFEHDIQDDALARRRVEI 57 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 ++TF + W + + + G T + I + ++ Sbjct: 58 AIITFGGITRQIQP--WVSAGAFQA--PVLTTGGGTPMGEAMYEGVRMINIRKA--EYKA 111 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + Y+ + +TDG ++ A RG +A+G+ Sbjct: 112 AGLSYYQPWVFLITDGTPTDEWLQAAAVVRRETA-ARGLTFFAVGVGDADMDTL 164 >gi|254459050|ref|ZP_05072473.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084321|gb|EDZ61610.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 615 Score = 43.2 bits (100), Expect = 0.059, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 63/165 (38%), Gaps = 23/165 (13%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+MI LD+S SM + D ++++A + +L E + G++ F+ Sbjct: 82 KAKSADIMIALDISDSMLAE-DVYPNRLELAKEKALTLLSEAPSE-------RVGIMAFA 133 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + + +K L T S + I D+ Q + Sbjct: 134 KNSYLVSPLSFDTGAVSFLLKQL-----DTTSITQKGTDFLSILDVFNTSQENDG----- 183 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 +K ++ ++DG + + L AKK +V+ +G+ ++ Sbjct: 184 EKYLLILSDGGDSKEFSKEIEL-----AKKSNIVVFILGVGTVKG 223 >gi|23016188|ref|ZP_00055947.1| hypothetical protein Magn03010637 [Magnetospirillum magnetotacticum MS-1] Length = 408 Score = 43.2 bits (100), Expect = 0.059, Method: Composition-based stats. Identities = 14/75 (18%), Positives = 28/75 (37%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 IR +G + I+ I + + L + E + K L S+ D + + A ++ + Sbjct: 7 IRRLCTGTEGAVAIMVGIGMTAMIGFLALGTETGLWYAAKRNLQSVADAAALGGAFELGS 66 Query: 68 EGNGNNRKKLKGGDI 82 N + D Sbjct: 67 GSNSSVISAAAIQDA 81 >gi|322436659|ref|YP_004218871.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321164386|gb|ADW70091.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 316 Score = 43.2 bits (100), Expect = 0.060, Method: Composition-based stats. Identities = 42/221 (19%), Positives = 77/221 (34%), Gaps = 37/221 (16%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + L + I+LD S S ++ +K++ +E L++F Sbjct: 69 EKKLPLTIGILLDTSGSQQNVLPLEQQSGAEFLKTVLTPKDEAF------------LISF 116 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLS-KFGVST-----NSTPGLKYAYNQIFDMQGMRQHC 280 ++ ++R I + G T NSTP Y+ ++ + Sbjct: 117 DINVDLLSDYTNSPREIKRSIDKATINTGAGTGSVTGNSTPKGTLLYDAVYLAAHDKLRQ 176 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR----------VI 330 KI+V +TDG + ++E ++ A+K AIVY I I Sbjct: 177 EAGR----KILVMLTDGGDQGSQETLKTAT--EAAQKANAIVYVILIADRGFYSGGGFSF 230 Query: 331 RSHEFLRACA--SPNSFYLV-ENPHSMYDAFSHIGKDIVTK 368 + + A + V N + DAF I ++ T+ Sbjct: 231 GGDRDMESLAHDTGGRVINVGNNGRKLEDAFDQIQDELRTQ 271 >gi|218261909|ref|ZP_03476582.1| hypothetical protein PRABACTJOHN_02253 [Parabacteroides johnsonii DSM 18315] gi|218223703|gb|EEC96353.1| hypothetical protein PRABACTJOHN_02253 [Parabacteroides johnsonii DSM 18315] Length = 285 Score = 43.2 bits (100), Expect = 0.060, Method: Composition-based stats. Identities = 27/107 (25%), Positives = 46/107 (42%), Gaps = 9/107 (8%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S +++ K + + + N + G+V F Sbjct: 72 EEERELTVMLLIDVSGS------RDFGSVNVMKKEVITEIAATLAFSAIQNNDKIGVVFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQI 270 S+KIE+F + G H+ I+ L TN LKY N I Sbjct: 126 SDKIEKFIPPQKGKKHILYIIRELIDFQPDKKQTNIAQALKYLTNAI 172 >gi|123390689|ref|XP_001299929.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121880878|gb|EAX86999.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 661 Score = 43.2 bits (100), Expect = 0.060, Method: Composition-based stats. Identities = 38/172 (22%), Positives = 67/172 (38%), Gaps = 35/172 (20%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D V+D S SME +I+ A+K + ML+ + ++ + +V F ++ + Sbjct: 240 DYFFVIDRSASMEG------DRIEKAVKCMRLMLQSLPMM------CRFSIVCFGSEFQS 287 Query: 233 FFLL-EWGVSHL---QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + E+ ++ IK ++ T+ L+Y IF GM Sbjct: 288 LLPIVEYNNENVLLAMNLIKNINANMGGTDIYHPLEY----IFSQNGMT----------- 332 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 K I +TDGE + + E K+ G +Y +GI +R A Sbjct: 333 KKIFLLTDGE---DSNSEDIIRLVQENKQFG-NIYTVGIGSGADSGLIRNLA 380 >gi|327270796|ref|XP_003220174.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 948 Score = 43.2 bits (100), Expect = 0.060, Method: Composition-based stats. Identities = 51/254 (20%), Positives = 94/254 (37%), Gaps = 38/254 (14%) Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 + YK + + +SR + P S +N + +VLD S M Sbjct: 253 KMCSYKSTWEVIMNSADFVSSRALGAP-PSDPTINLMKMQERSVCLVLDTSGKM-----G 306 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK--IK 248 ++ ++ L +++I + V G+VTF+N+ LL+ V+ R+ Sbjct: 307 KDNRLGRLNQAAKLFL--LQIIESGSWV---GIVTFNNEAATKTLLQKIVNDGVRQTLTS 361 Query: 249 YLSKFGVSTN-STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 YL + G+ + + + C IV +T GE Sbjct: 362 YLPTTAAGESKICDGVLAGFQVFLNKYPSSEGCE---------IVLLTHGE-------DP 405 Query: 308 SLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYLVE--NPHSMYDAFSHI- 361 ++ C + + G+I++ I S+E L +FY + + + + DAFS I Sbjct: 406 AIRSCFPQIQNSGSIIHTIAFGSGTSNELEKLADMTGGLAFYATDSLDSNGLMDAFSGIS 465 Query: 362 --GKDIVTKRIWYD 373 G DI + I + Sbjct: 466 SGGGDISQQSIQLE 479 >gi|196228417|ref|ZP_03127284.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] gi|196227820|gb|EDY22323.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] Length = 279 Score = 43.2 bits (100), Expect = 0.060, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 64/181 (35%), Gaps = 37/181 (20%) Query: 174 MMIVLDVSRSMESFF---DSSITKIDMAIKSINAMLEEV----KLIPDVNNVVQSGLVTF 226 + ++D S SME F +S+ K D +IN +L+ + V + Q+G++ + Sbjct: 15 FLFLIDQSGSMEDPFGGGESTRKKADAVADAINKLLQNLVIKCAKSEGVRDYYQTGVIGY 74 Query: 227 SNKI---------------------------EEFFLLEWGVSHL-QRKIKYLSKFGVSTN 258 ++ E ++ G L ++K+K+ F Sbjct: 75 GERVGPAFSGALAGRDMVPISDIASTPARIEERAKKVDDGAGGLAEQKVKFPIWFDAVAK 134 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + + A Q + +++ I++ +TDGE+ + N A Sbjct: 135 NGTPMCQALTQ--AKGILSAWIAQHPSSFPPIVINITDGESTDGDPTGVARELTNLATND 192 Query: 319 G 319 G Sbjct: 193 G 193 >gi|194016713|ref|ZP_03055326.1| von Willebrand factor, type A domain containing protein [Bacillus pumilus ATCC 7061] gi|194011319|gb|EDW20888.1| von Willebrand factor, type A domain containing protein [Bacillus pumilus ATCC 7061] Length = 638 Score = 43.2 bits (100), Expect = 0.060, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 69/210 (32%), Gaps = 21/210 (10%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLD-----MMIVLDVSRS 183 I +K + + I S+ ++ + A +++D S S Sbjct: 395 IQKTLEHKKTWPKTDLHAGRLSKKLIRYFTESNPRLFYKKQAPSSEIDAVFTLLVDCSAS 454 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 M K+D K I E +K + + +V G +N E + + + Sbjct: 455 M-------FDKMDETKKGIVLFHEALKSVQVPHQIV--GFWEDTNDASETSQPNYF-NTV 504 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 L G S S + + ++ M + +K ++ +DGE + Sbjct: 505 VSFKDSLFDAGPSIMSLEP-EEDNRDGYAIRQMTKMILKRREE-QKFLIVFSDGEPAAFS 562 Query: 304 EDQ----QSLYYCNEAKKRGAIVYAIGIRV 329 +Q + EA+K+G V + + Sbjct: 563 YEQNGIVDTHEAVLEARKKGIEVINVFLSN 592 >gi|134282290|ref|ZP_01768995.1| conserved hypothetical protein [Burkholderia pseudomallei 305] gi|134246328|gb|EBA46417.1| conserved hypothetical protein [Burkholderia pseudomallei 305] Length = 602 Score = 43.2 bits (100), Expect = 0.060, Method: Composition-based stats. Identities = 10/51 (19%), Positives = 28/51 (54%), Gaps = 1/51 (1%) Query: 15 YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 +G ++ AI++ + LG +++ ++FF++ L + D + + A ++ Sbjct: 23 ERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM 72 >gi|15674025|ref|NP_268200.1| hypothetical protein L107379 [Lactococcus lactis subsp. lactis Il1403] gi|12725093|gb|AAK06141.1|AE006434_3 unknown protein [Lactococcus lactis subsp. lactis Il1403] Length = 1450 Score = 43.2 bits (100), Expect = 0.060, Method: Composition-based stats. Identities = 25/141 (17%), Positives = 55/141 (39%), Gaps = 28/141 (19%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK- 229 +D+++V+D+S SM+ K + ++ L ++ + V G+V +S+ Sbjct: 325 PVDIVLVIDMSGSMQG------AKETAVRQGVSDFLSTIQNTA-YADYVNVGIVGYSSPG 377 Query: 230 ---------IEEFFLLEWGVSHLQRKIKYL-SKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 I SH++ + L +F T + GL+ + Sbjct: 378 NYVTGASGYITVPIDKVSSESHVKSINQALAPQFSGGTFTQLGLRKGTEML--------- 428 Query: 280 CNTEDANYKKIIVFMTDGENL 300 + ++ +K+++ MTDG Sbjct: 429 -EQDSSDNQKMMILMTDGVPT 448 >gi|332970883|gb|EGK09860.1| D-amino-acid dehydrogenase [Desmospora sp. 8437] Length = 448 Score = 43.2 bits (100), Expect = 0.061, Method: Composition-based stats. Identities = 43/237 (18%), Positives = 82/237 (34%), Gaps = 32/237 (13%) Query: 142 CTFIPWYTNSRHIVMPITSSVKV----NSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 + + + VK S + L + I+LD S SM D + K+++ Sbjct: 104 IKEMDRFNTGLELTGDTPGGVKGPDGQESAEEKPLHVSILLDASGSMAGQVDGGM-KMNL 162 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF---------FLLEWGVSHLQRKIK 248 A ++ + V+ V G ++K ++ + L G ++ + Sbjct: 163 AKAAVERFASSLPENAKVSLWVY-GHKGSNSKKDKPVSCKSTEEVYPL--GTYQEEKFSQ 219 Query: 249 YLSKFGVS--TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 L +F + T +K A ++ Q + EDA + IV +DG + Sbjct: 220 SLDQFRATGWTPIAASMKAAREEL-------QKNSGEDATHMLYIV--SDGVETCGGDPV 270 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIR-SHEFLRACAS--PNSFYLVENPHSMYDAFSH 360 N++K + V IG V + L+ A + VE+ + F Sbjct: 271 AEAKKLNQSKIKAV-VNIIGFDVDDAGQQALQKVAEAGGGEYETVESEQDLRSYFDE 326 >gi|226314649|ref|YP_002774545.1| hypothetical protein BBR47_50640 [Brevibacillus brevis NBRC 100599] gi|226097599|dbj|BAH46041.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 513 Score = 43.2 bits (100), Expect = 0.061, Method: Composition-based stats. Identities = 40/262 (15%), Positives = 93/262 (35%), Gaps = 47/262 (17%) Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 P N+ ++I A S P+ N + + + I +++ + +++ V+DVS Sbjct: 138 PTNQTFAIQADS---------GPSPFQKNLQIVRIGIKGK-ELSPKERKPANLVFVIDVS 187 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN--KIEEFFLLEWG 239 SM ++++ KS++ ++++++ G+V + + ++ Sbjct: 188 GSMNQE-----NRLELVKKSLHVLVDQLQPTDS------VGIVVYGSEGRVLLPPTSTED 236 Query: 240 VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 + I L G STN+ GL Y M A + ++ +DG Sbjct: 237 KQAILSAIDELQPEG-STNAEQGLVLGY-------EMAARSFKPPAINR--VILCSDGVA 286 Query: 300 L--STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR---ACASPNSFYLVENPHSM 354 T + + A+K + + G + ++ + A S+ ++ Sbjct: 287 NVGETGAEGILRSIEDYARKD-IYLSSFGFGMGNYNDVMMEQLANKGEGSYAYIDTFSEA 345 Query: 355 YDAF--------SHIGKDIVTK 368 F I +D+ + Sbjct: 346 RRIFTESLTGTLQTIARDVKIQ 367 >gi|148998551|ref|ZP_01825991.1| hypothetical protein CGSSp11BS70_06123 [Streptococcus pneumoniae SP11-BS70] gi|307067820|ref|YP_003876786.1| hypothetical protein SPAP_1196 [Streptococcus pneumoniae AP200] gi|147755549|gb|EDK62596.1| hypothetical protein CGSSp11BS70_06123 [Streptococcus pneumoniae SP11-BS70] gi|301794141|emb|CBW36551.1| unnamed protein product [Streptococcus pneumoniae INV104] gi|306409357|gb|ADM84784.1| hypothetical protein SPAP_1196 [Streptococcus pneumoniae AP200] gi|332202866|gb|EGJ16934.1| hypothetical protein SPAR120_0979 [Streptococcus pneumoniae GA47901] Length = 167 Score = 43.2 bits (100), Expect = 0.061, Method: Composition-based stats. Identities = 25/115 (21%), Positives = 52/115 (45%), Gaps = 9/115 (7%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME----SFFDSS--IT 193 K T I + + + ITS + +Q++ LD+++V D+S SME + S I+ Sbjct: 47 KTITPISGQKDQYELSLDITSKLGTETQSE-PLDVVLVADLSGSMEERDVWSYSSRRYIS 105 Query: 194 KIDMAIKSINAMLEEVKLIPDV--NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK 246 +I+ ++ + LI + N+ + +V F+ KI+ + + + Sbjct: 106 RIEALKHTLKGVNGRQGLIDTILSNSQNRLSIVGFAGKIDNQYNDRYYNEYYLSY 160 >gi|114321541|ref|YP_743224.1| von Willebrand factor, type A [Alkalilimnicola ehrlichii MLHE-1] gi|114227935|gb|ABI57734.1| von Willebrand factor, type A [Alkalilimnicola ehrlichii MLHE-1] Length = 972 Score = 43.2 bits (100), Expect = 0.061, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 66/196 (33%), Gaps = 22/196 (11%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT--F 226 D L++ +++D S SM S I A + +++ V+ VV+ Sbjct: 326 DDDLEISLIVDTSGSM------SGAPIINARTAGRTLVDVVEPGRTAMGVVRFSASASVV 379 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 I L+ I L G++ + + ++ + D + Sbjct: 380 HPMIAIPDPGTAEKDQLKDAIDSLPASGLTAMFDGLI-------LGLDELQDYSAANDTD 432 Query: 287 YKKIIVFMTD-GENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPN 343 ++ ++D G+N S + Q++ +A + A G LR A + Sbjct: 433 AGQVAFLLSDGGDNSSAATEPQTVQAYQDA---NVPIIAFGYGSFAPTGVLRRLADNTGG 489 Query: 344 SFYL-VENPHSMYDAF 358 F+ + +AF Sbjct: 490 EFFASPTTLAEIQEAF 505 >gi|147898495|ref|NP_001088330.1| inter-alpha (globulin) inhibitor H2 [Xenopus laevis] gi|71051796|gb|AAH98981.1| LOC495168 protein [Xenopus laevis] Length = 935 Score = 43.2 bits (100), Expect = 0.061, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 73/198 (36%), Gaps = 32/198 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ V+DVS SM K+ + ++ ++LE++ Q G++ F++ I Sbjct: 306 ILFVIDVSGSMWGL------KMKQTVDAMKSILEDLNSDD------QFGIIDFNHNIRCW 353 Query: 231 --EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E + + ++ + G TN L A IF ++ + N Sbjct: 354 KDELVYASSVEKGDASKYVQRIQPNG-GTNINDALLRA---IFILKEASN-KGLLEQNSV 408 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACASPNS- 344 +IV ++DG+ + + + ++++GI ++FL A N Sbjct: 409 SLIVLVSDGDPT--VGELKLPKIQKNVRTNIQDDIALHSLGIGFDVDYDFLERLAQENHG 466 Query: 345 ----FYLVENPHSMYDAF 358 Y ++ + F Sbjct: 467 MAQRIYGNQDTAAQLKEF 484 >gi|311978223|ref|YP_003987343.1| putative ariadne-like ring finger protein [Acanthamoeba polyphaga mimivirus] gi|82000061|sp|Q5UQ35|YR811_MIMIV RecName: Full=Putative ariadne-like RING finger protein R811 gi|55417421|gb|AAV51071.1| unknown [Acanthamoeba polyphaga mimivirus] gi|308205062|gb|ADO18863.1| putative ariadne-like ring finger protein [Acanthamoeba polyphaga mimivirus] Length = 990 Score = 43.2 bits (100), Expect = 0.061, Method: Composition-based stats. Identities = 33/177 (18%), Positives = 62/177 (35%), Gaps = 35/177 (19%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK--SINAMLEEVKLI--PDVNNVVQS 221 A +D+ IV+D + SM +F S + ++ I +++ + +I D + V + Sbjct: 1 MDNQAVVDLAIVVDATGSMGTFLSSLSESLQQIVQIIDITNVIQNINIIMYRDYCDSVIT 60 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD--------- 272 + +KI++ I+ L G ++ K A N + D Sbjct: 61 ASSGWVSKIDDLIPF----------IRGLRASGGG-DTPEAGKTAANNLLDVVKNNTIVI 109 Query: 273 --MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ--QSLYYCNEAKKRGAIVYAI 325 H + N+ + I N D+ + C+ R IVY I Sbjct: 110 WYADAPPHHKSNARDNFAREI-------NTLIGSDKIFDWIELCDTLAARNIIVYPI 159 >gi|282900192|ref|ZP_06308148.1| von Willebrand factor, type A [Cylindrospermopsis raciborskii CS-505] gi|281194941|gb|EFA69882.1| von Willebrand factor, type A [Cylindrospermopsis raciborskii CS-505] Length = 232 Score = 43.2 bits (100), Expect = 0.062, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 55/165 (33%), Gaps = 17/165 (10%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 +++LD S SM+ +I+ + + + +E+ V+ +VTF + + Sbjct: 32 VLLLDTSGSMQG------DRIEALNQGLLSFKDELVKNTLAARRVEVAIVTFDSHVNVVQ 85 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + + + T G+ + I Q + Y + VFM Sbjct: 86 DFVT----VDQFTPPILTAQGLTTMGAGINKSLEII---QERKSQYRANGIAYYRPWVFM 138 Query: 295 -TDGENLSTKE---DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 TDGE + +Q + + + + +G+ Sbjct: 139 ITDGEPQGEIDEVIEQATQRLRGDESNKKVAFFTVGVENANMDRL 183 >gi|163758555|ref|ZP_02165642.1| von Willebrand factor type A domain protein [Hoeflea phototrophica DFL-43] gi|162283845|gb|EDQ34129.1| von Willebrand factor type A domain protein [Hoeflea phototrophica DFL-43] Length = 549 Score = 43.2 bits (100), Expect = 0.062, Method: Composition-based stats. Identities = 39/196 (19%), Positives = 70/196 (35%), Gaps = 26/196 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +MIVLD S SM D KI +A + LI + ++ V GL + ++ + Sbjct: 16 VMIVLDGSNSMWGQVDGEA-KITIAKEV------MTDLITNWDDSVDLGLTVYGHRRKGD 68 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + V + K+ + + TP K ++ + + + ++ +V Sbjct: 69 CA-DIEVVAMPGKVDRQALIDKVQSITPRGKTPISKTLSLAALSVGFFSGKSS----VVL 123 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAI-----VYAIGIRVIRSHEFLRAC---ASPNSF 345 ++DG + C +AK G I V+ IG V C + F Sbjct: 124 VSDGLETCNAD------PCAQAKSLGIINPGFDVHVIGFDVTEEEFKSLQCIATETGGKF 177 Query: 346 YLVENPHSMYDAFSHI 361 + N + DA Sbjct: 178 FRANNAEELKDALRQT 193 >gi|145516893|ref|XP_001444335.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124411746|emb|CAK76938.1| unnamed protein product [Paramecium tetraurelia] Length = 1460 Score = 43.2 bits (100), Expect = 0.062, Method: Composition-based stats. Identities = 29/153 (18%), Positives = 63/153 (41%), Gaps = 19/153 (12%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 +++LD S SME F + A K + A L+E++ P+ + ++ F++ + Sbjct: 1277 ILILDDSGSMEGAF------FEAAKKGLVAFLQEIQKNPES----RVTIILFNH--QARC 1324 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 ++++ + Q + K + G T+ LK A+++I + + I F Sbjct: 1325 VVDYEIPDAQVQQKEIQFRGGGTDFDEPLKLAFDKIANNPDFDNFSSHS-------IFFY 1377 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 TDG+ + + + K+ + A Sbjct: 1378 TDGQAQYPTKAMEKVKQFPSDKREKIELVACSF 1410 >gi|332283431|ref|YP_004415342.1| hypothetical protein PT7_0178 [Pusillimonas sp. T7-7] gi|330427384|gb|AEC18718.1| hypothetical protein PT7_0178 [Pusillimonas sp. T7-7] Length = 585 Score = 43.2 bits (100), Expect = 0.062, Method: Composition-based stats. Identities = 42/209 (20%), Positives = 74/209 (35%), Gaps = 36/209 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK--- 229 D++I+ D S SM D + KI A K + + VK P+ N GL+ + ++ Sbjct: 26 DVLIIYDASGSMWGQVDG-VNKIVTARKVMGEL---VKSWPENTN---LGLIAYGHRSAG 78 Query: 230 ----IEEFFLLEW-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 IE + + + ++ G + S + A D+ R H T Sbjct: 79 SCSDIETMIEPQRVDRDAFIKTVNAITPKGKTPISASLKQAA-----DVLQYRDHNAT-- 131 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA--IVYAIGIRV-IRSHEFLRACA- 340 +V ++DG E K++G + +G + +E L A Sbjct: 132 ------VVLISDG---LESCHGDPCAVAAELKEKGVDFKAHVVGFDLDQEGNEALSCIAK 182 Query: 341 -SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 + F N + DA + +V K Sbjct: 183 NTGGIFVPASNADELQDALQQVQAKVVQK 211 >gi|298255345|ref|ZP_06978931.1| collagen adhesion protein [Streptococcus pneumoniae str. Canada MDR_19A] Length = 171 Score = 43.2 bits (100), Expect = 0.062, Method: Composition-based stats. Identities = 26/103 (25%), Positives = 49/103 (47%), Gaps = 11/103 (10%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQTDA---RLDMMIVLDVSRSME----SFFDSS- 191 K T I + + + ITS + +QT+ LD+++V D S SME + S Sbjct: 47 KTITPISGQEDQYELSLDITSKLGTETQTETQSEPLDVVLVADFSGSMEERDVWSYSSRR 106 Query: 192 -ITKIDMAIKSINAMLEEVKLIPDV--NNVVQSGLVTFSNKIE 231 I++I+ ++ ++ LI + N+ + +V F+ KI+ Sbjct: 107 YISRIEALKHTLKGVIGRQGLIDTILSNSQNRLSIVGFAGKID 149 >gi|288940760|ref|YP_003443000.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] gi|288896132|gb|ADC61968.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] Length = 351 Score = 43.2 bits (100), Expect = 0.062, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 55/168 (32%), Gaps = 14/168 (8%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +LD S SM ++ K++ ++++ ++ ++ P V ++ F+ Sbjct: 3 RLPVFFLLDCSESM---VGDNLRKMEEGLQAV---VQSLRADPHALETVHISVIAFAGFA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L V L G T+ L ++I D I Sbjct: 57 KTIVPL---VELFSFYPPRLP-LGGGTSLGAALDALMSEIDRSVVKTTPERKGDWRP--I 110 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + TDG E + A I A+G+ L+ Sbjct: 111 VYLFTDGHPTDDPEPAIERWRARYAGSATLI--AVGMGGDADFSALKR 156 >gi|242780420|ref|XP_002479591.1| von Willebrand domain protein [Talaromyces stipitatus ATCC 10500] gi|218719738|gb|EED19157.1| von Willebrand domain protein [Talaromyces stipitatus ATCC 10500] Length = 952 Score = 43.2 bits (100), Expect = 0.062, Method: Composition-based stats. Identities = 26/192 (13%), Positives = 59/192 (30%), Gaps = 29/192 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ + D S SM ++KI I +N L + N + +F ++ Sbjct: 345 IIFMADRSGSM-------MSKIPSLINVMNIFLRSLPEKCSFN------ISSFGSRPTWL 391 Query: 233 -FFLLEWGVSHL---QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + + + T L+ + H N ++ + Sbjct: 392 WPSSKRYSQEDMDIASQHVNKFQANYGGTEIYGALESVLD----------HYNKQN-DVP 440 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 ++ +TDGE + + +++GI SH + Y Sbjct: 441 TSVILLTDGEVWDVDNVIKLVRKAVSESDTNIRFFSLGIGGQVSHRLVEGIGEQGGGYAE 500 Query: 349 ENPHSMYDAFSH 360 P S+ +++ Sbjct: 501 IVPESLMNSWQE 512 >gi|167894484|ref|ZP_02481886.1| hypothetical protein Bpse7_12104 [Burkholderia pseudomallei 7894] Length = 587 Score = 43.2 bits (100), Expect = 0.062, Method: Composition-based stats. Identities = 10/51 (19%), Positives = 28/51 (54%), Gaps = 1/51 (1%) Query: 15 YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 +G ++ AI++ + LG +++ ++FF++ L + D + + A ++ Sbjct: 8 ERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM 57 >gi|167816001|ref|ZP_02447681.1| hypothetical protein Bpse9_12727 [Burkholderia pseudomallei 91] Length = 588 Score = 43.2 bits (100), Expect = 0.062, Method: Composition-based stats. Identities = 10/51 (19%), Positives = 28/51 (54%), Gaps = 1/51 (1%) Query: 15 YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 +G ++ AI++ + LG +++ ++FF++ L + D + + A ++ Sbjct: 9 ERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM 58 >gi|119512000|ref|ZP_01631096.1| von Willebrand factor, type A [Nodularia spumigena CCY9414] gi|119463351|gb|EAW44292.1| von Willebrand factor, type A [Nodularia spumigena CCY9414] Length = 202 Score = 43.2 bits (100), Expect = 0.062, Method: Composition-based stats. Identities = 26/130 (20%), Positives = 51/130 (39%), Gaps = 14/130 (10%) Query: 176 IVLDVSRSMESFFD-SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 +++D S SMES D T+ + ++ A+ + + + FS + + + Sbjct: 11 LIIDKSGSMESPGDLPGKTRWETMQENTLALARKCDQLDPDG----INVYLFSGRHKFYS 66 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + + ++I + STN L+ A++ F + N E I+ + Sbjct: 67 EV---TASKVQQIFQENHPAGSTNLGGVLQAAFDDFFKRREQGLTPNGE------TIIVV 117 Query: 295 TDGENLSTKE 304 TDGE K Sbjct: 118 TDGEPDDRKP 127 >gi|116619435|ref|YP_821591.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116222597|gb|ABJ81306.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 377 Score = 43.2 bits (100), Expect = 0.062, Method: Composition-based stats. Identities = 23/163 (14%), Positives = 59/163 (36%), Gaps = 22/163 (13%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG-LVTF 226 + ++D+S + + +I ++ L+P +G L+++ Sbjct: 97 EKQPIVAGFLVDLSNNSRIHW-------KTYQDAILELV--WNLLPGDKRY--TGYLISY 145 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 N + W + K++ + G + Y+ I+ R+ E Sbjct: 146 GNTADIAVNTTWDSDKIADKVRKMKPGGGAAL--------YDAIYLACTRRELVKGEPYE 197 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 +++IV + DG + ++K + + A++ +YA+ Sbjct: 198 PRRVIVVIGDGHDNASKHNLE--EVLELAQRNLVTIYAVSTMA 238 >gi|47229694|emb|CAG06890.1| unnamed protein product [Tetraodon nigroviridis] Length = 707 Score = 43.2 bits (100), Expect = 0.063, Method: Composition-based stats. Identities = 27/177 (15%), Positives = 63/177 (35%), Gaps = 24/177 (13%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +++ ++D S SM KI+ ++ +L ++ GL+TF Sbjct: 263 QRIPKNVVFIIDQSGSMHG------RKIEQTRTALIHILNDLAEDD------FFGLLTFD 310 Query: 228 NKIEEFFLLEWGVSHLQRK---IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + I +W +Q ++ F + + + + H Sbjct: 311 SNI-----FQWKRELVQATKANLESAKTFARNIRANGATNINAAVLKGSSMLNAHPREGS 365 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYY-CNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 A+ I++ +TDG+ + + + +++ A +Y +G + EFL + Sbjct: 366 AS---ILILLTDGDPTTGETNPEAIQSNVRNAIAEKFPLYCLGFGFDVNFEFLEKMS 419 >gi|281348290|gb|EFB23874.1| hypothetical protein PANDA_022043 [Ailuropoda melanoleuca] Length = 426 Score = 43.2 bits (100), Expect = 0.064, Method: Composition-based stats. Identities = 22/111 (19%), Positives = 39/111 (35%), Gaps = 19/111 (17%) Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 + T+ GL+ + I I+ +TDGE+ Sbjct: 52 REASGGTSICSGLRAGFQAIIHSNQSTSGSE---------IILLTDGEDDQISS------ 96 Query: 311 YC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYLVENPHSMYDAF 358 C E K+ GA+++ I + + E L FY ++ + + DAF Sbjct: 97 -CFEEVKQSGAVIHTIALGPSAARELETLSNMTGGYRFYANKDINGLTDAF 146 >gi|126451635|ref|YP_001066065.1| hypothetical protein BURPS1106A_1796 [Burkholderia pseudomallei 1106a] gi|242315320|ref|ZP_04814336.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b] gi|126225277|gb|ABN88817.1| conserved hypothetical protein [Burkholderia pseudomallei 1106a] gi|242138559|gb|EES24961.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b] Length = 602 Score = 43.2 bits (100), Expect = 0.064, Method: Composition-based stats. Identities = 10/51 (19%), Positives = 28/51 (54%), Gaps = 1/51 (1%) Query: 15 YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 +G ++ AI++ + LG +++ ++FF++ L + D + + A ++ Sbjct: 23 ERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM 72 >gi|126440828|ref|YP_001058814.1| hypothetical protein BURPS668_1775 [Burkholderia pseudomallei 668] gi|126220321|gb|ABN83827.1| conserved hypothetical protein [Burkholderia pseudomallei 668] Length = 602 Score = 43.2 bits (100), Expect = 0.064, Method: Composition-based stats. Identities = 10/51 (19%), Positives = 28/51 (54%), Gaps = 1/51 (1%) Query: 15 YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 +G ++ AI++ + LG +++ ++FF++ L + D + + A ++ Sbjct: 23 ERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM 72 >gi|229578924|ref|YP_002837322.1| von Willebrand factor A [Sulfolobus islandicus Y.G.57.14] gi|284997528|ref|YP_003419295.1| von Willebrand factor, type A [Sulfolobus islandicus L.D.8.5] gi|228009638|gb|ACP45400.1| von Willebrand factor type A [Sulfolobus islandicus Y.G.57.14] gi|284445423|gb|ADB86925.1| von Willebrand factor, type A [Sulfolobus islandicus L.D.8.5] Length = 356 Score = 43.2 bits (100), Expect = 0.064, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 71/207 (34%), Gaps = 34/207 (16%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 V + S + + +I++D S SM K++ A++S +L + Sbjct: 22 VGFVIYITPQQSSVTSSIHYIIMIDNSPSMRG------EKLNTAVQSAQKLL------YN 69 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 +N L+ FSN E + + I + G +T + + N Sbjct: 70 LNEGNYVTLILFSNHPEIKYQ-----GPAKGIITFDVGKGYTTRLHEAVSFTIN------ 118 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K I+ +TDG+ + + + + + IGI + Sbjct: 119 -----LAKQSQVPTK-IIMLTDGKPTDKR-NVKDYEKLDIPPNT--QIITIGIGNNYNER 169 Query: 335 FLRACA--SPNSFYLVENPHSMYDAFS 359 L+ A S FY +++ + + F Sbjct: 170 ILKKLADRSSGKFYHIKDISELPNIFE 196 >gi|167824377|ref|ZP_02455848.1| hypothetical protein Bpseu9_11953 [Burkholderia pseudomallei 9] Length = 589 Score = 43.2 bits (100), Expect = 0.064, Method: Composition-based stats. Identities = 10/51 (19%), Positives = 28/51 (54%), Gaps = 1/51 (1%) Query: 15 YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 +G ++ AI++ + LG +++ ++FF++ L + D + + A ++ Sbjct: 10 ERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM 59 >gi|76811718|ref|YP_333349.1| hypothetical protein BURPS1710b_1950 [Burkholderia pseudomallei 1710b] gi|237812074|ref|YP_002896525.1| membrane protein [Burkholderia pseudomallei MSHR346] gi|254188638|ref|ZP_04895149.1| conserved hypothetical protein [Burkholderia pseudomallei Pasteur 52237] gi|254261132|ref|ZP_04952186.1| conserved hypothetical protein [Burkholderia pseudomallei 1710a] gi|76581171|gb|ABA50646.1| putative membrane protein [Burkholderia pseudomallei 1710b] gi|157936317|gb|EDO91987.1| conserved hypothetical protein [Burkholderia pseudomallei Pasteur 52237] gi|237503192|gb|ACQ95510.1| membrane protein [Burkholderia pseudomallei MSHR346] gi|254219821|gb|EET09205.1| conserved hypothetical protein [Burkholderia pseudomallei 1710a] Length = 602 Score = 43.2 bits (100), Expect = 0.064, Method: Composition-based stats. Identities = 10/51 (19%), Positives = 28/51 (54%), Gaps = 1/51 (1%) Query: 15 YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 +G ++ AI++ + LG +++ ++FF++ L + D + + A ++ Sbjct: 23 ERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM 72 >gi|316974386|gb|EFV57878.1| putative von Willebrand factor type A domain protein [Trichinella spiralis] Length = 560 Score = 43.2 bits (100), Expect = 0.065, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 60/168 (35%), Gaps = 23/168 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S S+E + TK+ I S +L + + G+V F+ + Sbjct: 37 DLIFLIDSSTSVE---RRNFTKLTHLIGSTVGLLNI------GPDDTRVGIVQFTGHPKA 87 Query: 233 FFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L + L +I L T+ +++ ++ F E N + Sbjct: 88 EFDLAHYSNRTSLIDRIGNLKHITGVTHLGQAIQHVADKEFIPS------KGERPNVANV 141 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLR 337 + ++DG + + A+ R + I I + L+ Sbjct: 142 LAILSDGLSQDDVSQPSMV-----ARARNIEPIIVAIGKFISKEQLLQ 184 >gi|302652520|ref|XP_003018108.1| hypothetical protein TRV_07884 [Trichophyton verrucosum HKI 0517] gi|291181717|gb|EFE37463.1| hypothetical protein TRV_07884 [Trichophyton verrucosum HKI 0517] Length = 240 Score = 43.2 bits (100), Expect = 0.065, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 67/183 (36%), Gaps = 31/183 (16%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ-TDARLDMMIVLDVSRSMESFF 188 + + + I N +++ I +K + D+++V+D+S SM S Sbjct: 27 TVTMEENLTQEPILSIHSIPNKDSMIVSIQPPLKPGNDVPHVPCDIVLVIDISGSMNSAA 86 Query: 189 ----------DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW 238 D+ ++ +D+ + ++E + N + +VTF ++ F L++ Sbjct: 87 PIPTGEKGGEDTGLSILDLTKHAAKTIIETL------NEKDRLAVVTFCTEVNVAFELDY 140 Query: 239 ----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 S + + L STN G+K N + + ++ + Sbjct: 141 MNKENKSTVLSAVDKL-YGKSSTNLWHGIKKGLNVLATN---------PAQGKIQSLLVL 190 Query: 295 TDG 297 TDG Sbjct: 191 TDG 193 >gi|301609302|ref|XP_002934202.1| PREDICTED: epithelial chloride channel protein-like [Xenopus (Silurana) tropicalis] Length = 904 Score = 43.2 bits (100), Expect = 0.065, Method: Composition-based stats. Identities = 40/216 (18%), Positives = 82/216 (37%), Gaps = 44/216 (20%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 +P+ S + S + +VLDVS SM S +I+ ++ L ++ Sbjct: 286 NIPVPSFSLLQSSDRV---VTLVLDVSGSMASGG-----RIERLYQAAEVFLMQI----- 332 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHL-QRKIKYLSKFG--VSTNSTPGLKYAYNQIF 271 V G++TFS I L + + ++++K L T+ G++ Sbjct: 333 VEEGSYVGILTFSTSISLLSNLVQVIENTQRKQLKSLLPTNAFGGTDICLGIR------- 385 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 +G++ + + ++Y +V +TDGE+ + + GAI++ I + Sbjct: 386 --EGIKINRQYDGSSYGTELVLLTDGEDNY-----DTSKCFPDITNSGAIIHVIALGPNA 438 Query: 332 SHEF---------LRACASPNSFYLVENPHSMYDAF 358 + LR A+ + + +AF Sbjct: 439 AKALETIVDMTEGLRFLATDK-----VDTQGLIEAF 469 >gi|218460899|ref|ZP_03500990.1| von Willebrand factor type A [Rhizobium etli Kim 5] Length = 373 Score = 43.2 bits (100), Expect = 0.065, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 61/192 (31%), Gaps = 19/192 (9%) Query: 85 RIKNTWNMSFRNELRDNGFVN-----DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPL 139 + +F ++ + + S+ + + Y + P Sbjct: 165 SVATDPVSTFSADVDSASYSFVRRSLTGGAMPDPQSVRVEEMINYFPYDWPRPENAEQPF 224 Query: 140 KFC---TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 K PW ++ + + I + T +++ ++DVS SM+ K+ Sbjct: 225 KATVTVMPTPWNHDTELMHVAIKG-YDIAPATAPHANLVFLIDVSGSMDE-----PDKLP 278 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS 256 + S ++ +K V V +G + S + I L G S Sbjct: 279 LLKSSFRLLVNRLKADDTVAIVTYAGNAGTVLEPTRVSE----KSKILSAIDRLEA-GGS 333 Query: 257 TNSTPGLKYAYN 268 T G++ AY+ Sbjct: 334 TGGAEGIEAAYD 345 >gi|83753977|pdb|2ADF|A Chain A, Crystal Structure And Paratope Determination Of 82d6a3, An Antithrombotic Antibody Directed Against The Von Willebrand Factor A3-Domain Length = 196 Score = 43.2 bits (100), Expect = 0.065, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 61/177 (34%), Gaps = 26/177 (14%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++++LD S S F S ++ K+ + + + P + Q ++ + + Sbjct: 10 QPLDVILLLDGSSS---FPASYFDEMKSFAKA---FISKANIGP---RLTQVSVLQYGSI 60 Query: 230 IEEFFLLEWGV----SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 W V +HL + + + G + L +A + H A Sbjct: 61 TTIDVP--WNVVPEKAHLLSLVDVMQREGGPSQIGDALGFAVRYLTS----EMHGARPGA 114 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + K +++ +T + A+ V+ IGI LR A P Sbjct: 115 S-KAVVILVT------DVSVDSVDAAADAARSNRVTVFPIGIGDRYDAAQLRILAGP 164 >gi|3402054|pdb|1AO3|A Chain A, A3 Domain Of Von Willebrand Factor gi|3402055|pdb|1AO3|B Chain B, A3 Domain Of Von Willebrand Factor Length = 187 Score = 43.2 bits (100), Expect = 0.065, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 61/177 (34%), Gaps = 26/177 (14%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++++LD S S F S ++ K+ + + + P + Q ++ + + Sbjct: 3 QPLDVILLLDGSSS---FPASYFDEMKSFAKA---FISKANIGP---RLTQVSVLQYGSI 53 Query: 230 IEEFFLLEWGV----SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 W V +HL + + + G + L +A + H A Sbjct: 54 TTIDVP--WNVVPEKAHLLSLVDVMQREGGPSQIGDALGFAVRYLTS----EMHGARPGA 107 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + K +++ +T + A+ V+ IGI LR A P Sbjct: 108 S-KAVVILVT------DVSVDSVDAAADAARSNRVTVFPIGIGDRYDAAQLRILAGP 157 >gi|226196351|ref|ZP_03791933.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan 9] gi|254297788|ref|ZP_04965241.1| conserved hypothetical protein [Burkholderia pseudomallei 406e] gi|157807412|gb|EDO84582.1| conserved hypothetical protein [Burkholderia pseudomallei 406e] gi|225931568|gb|EEH27573.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan 9] Length = 602 Score = 43.2 bits (100), Expect = 0.065, Method: Composition-based stats. Identities = 10/51 (19%), Positives = 28/51 (54%), Gaps = 1/51 (1%) Query: 15 YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 +G ++ AI++ + LG +++ ++FF++ L + D + + A ++ Sbjct: 23 ERGSFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM 72 >gi|326407579|gb|ADZ64650.1| conserved hypothetical protein [Lactococcus lactis subsp. lactis CV56] Length = 1433 Score = 43.2 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 25/141 (17%), Positives = 54/141 (38%), Gaps = 28/141 (19%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK- 229 +D+++V+D+S SM K + ++ L ++ + V G+V +S+ Sbjct: 308 PVDIVLVIDMSGSM------QGAKETAVRQGVSDFLSTIQNTA-YADYVNVGIVGYSSPG 360 Query: 230 ---------IEEFFLLEWGVSHLQRKIKYL-SKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 I SH++ + L +F T + GL+ + Sbjct: 361 NYVTGASGYITVPIDKVSSESHVKSINQALAPQFSGGTFTQLGLRKGTEML--------- 411 Query: 280 CNTEDANYKKIIVFMTDGENL 300 + ++ +K+++ MTDG Sbjct: 412 -EQDSSDNQKMMILMTDGVPT 431 >gi|260800503|ref|XP_002595169.1| hypothetical protein BRAFLDRAFT_139524 [Branchiostoma floridae] gi|229280412|gb|EEN51180.1| hypothetical protein BRAFLDRAFT_139524 [Branchiostoma floridae] Length = 138 Score = 43.2 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 29/158 (18%), Positives = 59/158 (37%), Gaps = 22/158 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S S+ A + +++ E L P Q + + + + Sbjct: 1 DLVFLLDGSDSVTP------QSFWTAKAFLRSVIGEFSLGP---QHTQVSVFQYGGDVRQ 51 Query: 233 FFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F L H L + +++ + L+YA F + N N K+ Sbjct: 52 EFTLGTYQDHRALSVGLAVITQLSGPRRTGAALQYAVRNGFSV------ANGGGRNVGKV 105 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +V ++ G + + + + + G IVYA+G+ Sbjct: 106 LVLLSTGMSADSVQQAANNAA-----RAGIIVYALGVG 138 >gi|304393682|ref|ZP_07375610.1| magnesium-chelatase 60 kDa subunit [Ahrensia sp. R2A130] gi|303294689|gb|EFL89061.1| magnesium-chelatase 60 kDa subunit [Ahrensia sp. R2A130] Length = 595 Score = 43.2 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 23/135 (17%), Positives = 45/135 (33%), Gaps = 22/135 (16%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + ++ +D S S +++ ++ A ++ +L E GL++F Sbjct: 409 KEKSASSVIFAVDASGS------TALNRLAEAKGAVELLLGE-----SYARRDHVGLISF 457 Query: 227 SNKI-EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + E + +R + L G T L+ A + R + Sbjct: 458 RGQTAEMLLAPTRSLLRAKRALAALP-GGGGTPLADALRLA------AEASRDEQDKGRT 510 Query: 286 NYKKIIVFMTDGENL 300 IV MTDG Sbjct: 511 P---TIVIMTDGSAN 522 >gi|168204310|ref|ZP_02630315.1| von Willebrand factor type A domain protein [Clostridium perfringens E str. JGS1987] gi|170664017|gb|EDT16700.1| von Willebrand factor type A domain protein [Clostridium perfringens E str. JGS1987] Length = 620 Score = 43.2 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 67/169 (39%), Gaps = 16/169 (9%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA 198 LK + P + I + + K ++ +D+++++D S SM D ++ Sbjct: 57 LKEVSQEPDEYGNYEITLTVKGKPKKVTK---PVDILLIMDASNSMYYNMDELKESMNSL 113 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTN 258 + + + IP+ + ++ F ++EE F + K +Y S N Sbjct: 114 VDKV------IDNIPNS----RIAVIAFGTEVEEVFSFND-KNKFTSKEEYKKAIKKSYN 162 Query: 259 STPGLKYAYNQIFDMQGMRQHCNT--EDANYKKIIVFMTDGENLSTKED 305 + G+ + + N ++N KK ++F +DG ++++ Sbjct: 163 NIEGMGNTNIEGTWRRADEIFKNELNNNSNSKKDVIFFSDGYPNVSRDE 211 >gi|121599267|ref|YP_993091.1| hypothetical protein BMASAVP1_A1771 [Burkholderia mallei SAVP1] gi|124385206|ref|YP_001026132.1| hypothetical protein BMA10229_A0124 [Burkholderia mallei NCTC 10229] gi|126450090|ref|YP_001080596.1| hypothetical protein BMA10247_1041 [Burkholderia mallei NCTC 10247] gi|167002573|ref|ZP_02268363.1| conserved hypothetical protein [Burkholderia mallei PRL-20] gi|217423806|ref|ZP_03455307.1| conserved hypothetical protein [Burkholderia pseudomallei 576] gi|238563902|ref|ZP_00438064.2| membrane protein [Burkholderia mallei GB8 horse 4] gi|254177770|ref|ZP_04884425.1| conserved hypothetical protein [Burkholderia mallei ATCC 10399] gi|254206222|ref|ZP_04912574.1| conserved hypothetical protein [Burkholderia mallei JHU] gi|121228077|gb|ABM50595.1| conserved hypothetical protein [Burkholderia mallei SAVP1] gi|124293226|gb|ABN02495.1| conserved hypothetical protein [Burkholderia mallei NCTC 10229] gi|126242960|gb|ABO06053.1| conserved hypothetical protein [Burkholderia mallei NCTC 10247] gi|147753665|gb|EDK60730.1| conserved hypothetical protein [Burkholderia mallei JHU] gi|160698809|gb|EDP88779.1| conserved hypothetical protein [Burkholderia mallei ATCC 10399] gi|217393664|gb|EEC33685.1| conserved hypothetical protein [Burkholderia pseudomallei 576] gi|238519717|gb|EEP83185.1| membrane protein [Burkholderia mallei GB8 horse 4] gi|243061787|gb|EES43973.1| conserved hypothetical protein [Burkholderia mallei PRL-20] Length = 602 Score = 43.2 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 10/51 (19%), Positives = 28/51 (54%), Gaps = 1/51 (1%) Query: 15 YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 +G ++ AI++ + LG +++ ++FF++ L + D + + A ++ Sbjct: 23 ERGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM 72 >gi|297261610|ref|XP_002808020.1| PREDICTED: LOW QUALITY PROTEIN: von Willebrand factor-like [Macaca mulatta] Length = 2704 Score = 43.2 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 59/186 (31%), Gaps = 26/186 (13%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 ++ LD++++LD S S + + D A + + + P + Q Sbjct: 1570 TLSPAPDCSQPLDVILLLDGSSSFPAAY------FDEMKSFAKAFISKANIGPH---LTQ 1620 Query: 221 SGLVTFSNKIEEFFLLEWGV----SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 ++ + + W V +HL + + + G + L +A + Sbjct: 1621 VSVLQYGSITTIDVP--WNVAPEKAHLLSLVDVMQREGGPSQIGDALGFAVRYLTS---- 1674 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 H A+ +I+ M + A+ V+ IGI L Sbjct: 1675 EMHGARPGASKAVVILVM-------DVSVDAVDAAADAARSNRVTVFPIGIGDRYDAAQL 1727 Query: 337 RACASP 342 R A P Sbjct: 1728 RILAGP 1733 >gi|118601973|ref|YP_908673.1| hypothetical protein P91278ORF_075 [Photobacterium damselae subsp. piscicida] gi|118614711|ref|YP_908494.1| hypothetical protein P99018ORF_083 [Photobacterium damselae subsp. piscicida] gi|134044562|ref|YP_001101800.1| von Willebrand factor type A domain-containing protein [Yersinia ruckeri] gi|134044805|ref|YP_001102190.1| von Willebrand factor type A domain-containing protein [Yersinia pestis biovar Orientalis str. IP275] gi|165938076|ref|ZP_02226636.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. IP275] gi|229516181|ref|ZP_04405630.1| hypothetical protein VCC_000196 [Vibrio cholerae RC9] gi|118596802|dbj|BAF38106.1| hypothetical protein P99018ORF_083 [Photobacterium damselae subsp. piscicida] gi|118596982|dbj|BAF38285.1| hypothetical protein P91278ORF_075 [Photobacterium damselae subsp. piscicida] gi|133904925|gb|ABO40942.1| von Willebrand factor type A domain protein [Yersinia ruckeri] gi|133905339|gb|ABO42101.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. IP275] gi|165914099|gb|EDR32716.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. IP275] gi|229346831|gb|EEO11800.1| hypothetical protein VCC_000196 [Vibrio cholerae RC9] gi|324008185|gb|EGB77404.1| von Willebrand factor type A domain protein [Escherichia coli MS 57-2] Length = 598 Score = 43.2 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 33/177 (18%), Positives = 69/177 (38%), Gaps = 22/177 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + +K ++ + I++D+S SM + + D+A ++ A+ ++ IP Sbjct: 403 VTGDTRVFLKPEAKKRPNTAVHILVDMSSSMAYKAANGKERQDIAREASLAISMALEAIP 462 Query: 214 DVNNVVQSGLVTF---SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 VN VTF + F +++ G + R ++ K T + YA ++ Sbjct: 463 GVNP-----AVTFFGGNRNQPVFSVVKHGDTVQNRAGRFGFKATGGTPMAEAMWYAAFEL 517 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + R K+++ +TDG+ S + + C + V IG+ Sbjct: 518 TKTREER-----------KMLIVVTDGQPQSAPACRSVIDLCE---RSDVEVIGIGV 560 >gi|158318118|ref|YP_001510626.1| von Willebrand factor type A [Frankia sp. EAN1pec] gi|158113523|gb|ABW15720.1| von Willebrand factor type A [Frankia sp. EAN1pec] Length = 946 Score = 43.2 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 33/221 (14%), Positives = 67/221 (30%), Gaps = 52/221 (23%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 F +S + P+ S +D +I++DVSRS++ I + + Sbjct: 55 FVGGARAAASSGSLSGPLPGSAGAA--DSGAMDAVILVDVSRSLDEPT------IALEAR 106 Query: 201 SINAMLEEVKLIPDVNNVVQSGLVTFSNK-----------IEEFFLLEWGVSH------- 242 + V D++ + + F++ E L Sbjct: 107 AAG-----VIAATDLSTRTRIAVTAFASMGPSGSSTARPAAETLCPLAPVDDPDDRDRIS 161 Query: 243 --LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG--- 297 L ++ + G T+ LK ++ + T +K+I +TDG Sbjct: 162 GCLSGIMRRTAGQGQDTDYVTALKSGFDTLLSA------PATGSGPARKVIFILTDGQLS 215 Query: 298 ----ENLSTKEDQQSL------YYCNEAKKRGAIVYAIGIR 328 ++ +S A+K G ++ G Sbjct: 216 TAGTDSSGGSSTDRSESGQIRDEILPAARKAGIEIWPFGFG 256 >gi|317125804|ref|YP_004099916.1| von Willebrand factor A [Intrasporangium calvum DSM 43043] gi|315589892|gb|ADU49189.1| von Willebrand factor type A [Intrasporangium calvum DSM 43043] Length = 568 Score = 43.2 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 34/193 (17%), Positives = 60/193 (31%), Gaps = 23/193 (11%) Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE--- 232 + LDVS SM D T++ + ++ ++ I + + GL +S I + Sbjct: 377 LALDVSGSMLRRTDQG-TRLAVMQEA------TLQAIAGMPGSSRLGLWAYSLHIGKQGD 429 Query: 233 ---FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDANYK 288 L V H + + G T S G + Y+ I Q R N Sbjct: 430 DFRPLLNAAPVGHSSHLLDLRKQVGGLTRSVGGGRGLYDTIVATYQRARATYTKGQLNS- 488 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAK----KRGAIVYAIGIRV-IRSHEFLRACA-SP 342 +V +TDG N S+ K + + +G + + Sbjct: 489 --VVIVTDGLNDDDYGASLSVALSRVKKLVDPRNPIRITIVGFGSEPDAKAMTPFAQLTG 546 Query: 343 NSFYLVENPHSMY 355 + P + Sbjct: 547 GRYVNAAEPKDLL 559 >gi|183602734|ref|ZP_02964097.1| hypothetical protein BIFLAC_00845 [Bifidobacterium animalis subsp. lactis HN019] gi|183217972|gb|EDT88620.1| hypothetical protein BIFLAC_00845 [Bifidobacterium animalis subsp. lactis HN019] Length = 839 Score = 43.2 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 25/215 (11%), Positives = 56/215 (26%), Gaps = 18/215 (8%) Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVN 165 + PQ S P + + M +T + Sbjct: 5 SVAAFADDRQPAATADPQAATASA---GNVDAPQHTKRISKNDDGTYTLSMDVTGKSDES 61 Query: 166 SQTDA-RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 ++ LD+ +VLDVS SM + ++ + S+N + V+ + Sbjct: 62 TEQQVVPLDIALVLDVSGSMNE-LSGKLVYNEVELLSMNPISTYYVEKDGSYQAVRCSAI 120 Query: 225 TFSNKIEEFFLLEWGVSH-------------LQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 ++ G + + +++ LK A Sbjct: 121 SWGRCTTWQDQDSAGQKYTVTYNWIGGPSASVSPDVQFYKSKQSEETRLDALKDAVTYFL 180 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 D + + ++ + G+N + Sbjct: 181 DQVEDQNQRINDPGKKVQVALIKYAGKNSDKIGND 215 >gi|192289227|ref|YP_001989832.1| hypothetical protein Rpal_0799 [Rhodopseudomonas palustris TIE-1] gi|192282976|gb|ACE99356.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 468 Score = 43.2 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 22/177 (12%), Positives = 54/177 (30%), Gaps = 22/177 (12%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 + F + K + ++TA+ + I +LGM ++ + K L Sbjct: 6 LSRFVRDRKANIAVITALVMIPIIFLLGMTLDFTQALRKKQQL-----------DAAADA 54 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 R + + + N + + + T + I V Sbjct: 55 AAIAAVRPAMLMQTDAVAQNTAYAIFMST---ANRLASGLTSVPTPT-ITITDVGLQRTV 110 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 +S + P + ++ I+ + + + ++ +++D S SM Sbjct: 111 KVSYNAAS------LNNFPQLLMN-NVSWAISGASTAQASSAPNMNFYLLMDDSPSM 160 >gi|134047208|ref|YP_001101986.1| von Willebrand factor type A domain-containing protein [Salmonella enterica subsp. enterica serovar Newport str. SL254] gi|237810014|ref|YP_002894453.1| hypothetical protein pAR060302_0107 [Escherichia coli] gi|237810210|ref|YP_002894649.1| hypothetical protein pAM04528_0113 [Salmonella enterica] gi|133905127|gb|ABO41142.1| von Willebrand factor type A domain protein [Salmonella enterica subsp. enterica serovar Newport str. SL254] gi|229561694|gb|ACQ77896.1| conserved hypothetical protein [Salmonella enterica] gi|229561869|gb|ACQ78070.1| conserved hypothetical protein [Escherichia coli] gi|327536555|gb|AEA95388.1| hypothetical protein pSD853_174_122 [Salmonella enterica subsp. enterica serovar Dublin] gi|332144451|dbj|BAK19671.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Typhimurium] Length = 598 Score = 43.2 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 33/177 (18%), Positives = 69/177 (38%), Gaps = 22/177 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + +K ++ + I++D+S SM + + D+A ++ A+ ++ IP Sbjct: 403 VTGDTRVFLKPEAKKRPNTAVHILVDMSSSMAYKAANGKERQDIAREASLAISMALEAIP 462 Query: 214 DVNNVVQSGLVTF---SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 VN VTF + F +++ G + R ++ K T + YA ++ Sbjct: 463 GVNP-----AVTFFGGNRNQPVFSVVKHGDTVQNRAGRFGFKATGGTPMAEAMWYAAFEL 517 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + R K+++ +TDG+ S + + C + V IG+ Sbjct: 518 TKTREER-----------KMLIVVTDGQPQSAPACRSVIDLCE---RSDVEVIGIGV 560 >gi|332970976|gb|EGK09950.1| D-amino-acid dehydrogenase [Desmospora sp. 8437] Length = 441 Score = 43.2 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 36/310 (11%), Positives = 104/310 (33%), Gaps = 30/310 (9%) Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 ++ G++ K+ + G+ + + + G + ++ +PQ+ Sbjct: 24 SQETGSSGKQNENGEGESKPQKIATTIEEIVEQGAGTYSGDKYDQAKVEAELDKIPQDAS 83 Query: 127 ----YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 ++ + + + I + +VN + + ++ I+LD S Sbjct: 84 DEEVFNTIVSLIAEDYGPVKDAYDNFDPTFKISGDQPGT-EVNGPEEKQHNVTILLDASG 142 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE----- 237 SM + K+ +A +++ + ++ +V+ +V G ++K ++ + Sbjct: 143 SMAARVSGG-EKMQVAKEAVRSFTSQMPEGTNVSLIVY-GHKGSNSKADQAESCKGIEEI 200 Query: 238 -----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + S LQ K+ + G T + A ++ + +G ++ +I Sbjct: 201 VELGPYNESTLQSKLDPIRATGW-TPLAGAMNQAGQRLKETEGQAEN----------VIY 249 Query: 293 FMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC-ASPNSFYLVEN 350 ++DG E +++ K + + + A ++ + Sbjct: 250 VVSDGLETCGGDPVKEAKSLNQSNIKATVNIIGFDVGNKEHQALKKVAEAGGGKYFSATS 309 Query: 351 PHSMYDAFSH 360 + F + Sbjct: 310 KTELDLYFRN 319 >gi|303252178|ref|ZP_07338346.1| hypothetical protein APP2_1152 [Actinobacillus pleuropneumoniae serovar 2 str. 4226] gi|302648961|gb|EFL79149.1| hypothetical protein APP2_1152 [Actinobacillus pleuropneumoniae serovar 2 str. 4226] Length = 532 Score = 43.2 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 29/239 (12%), Positives = 81/239 (33%), Gaps = 22/239 (9%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R F + G T++ + I ++ + +E + I + L ++++++ + + Sbjct: 10 RRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAENNSG 69 Query: 69 GNGNNRK--------------KLKGGDILCRIKNTWNMSFRNELRDNGF-VNDIDDIVRS 113 N+ K + G ++ + +F + + + + + + Sbjct: 70 RKDNDYKLSGSSNKENDSFDISSEVGKRDSQMATKFVKAFLPQTNEEKMHLTPVCKTINN 129 Query: 114 TSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLD 173 T+ ++S +K + + S + + +D Sbjct: 130 TNGKGHTSSSEVTCTVSGTIEHKSWF-PLKVGTVEVIPHEVNVASKSKAFKKNTFNIPID 188 Query: 174 MMIVLDVSRSME------SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +M+V D+S SM ++ +K+ + + + E+ L D N + + F Sbjct: 189 LMVVADLSGSMRYDITNKYETNNETSKLGILKDVLIELAEKTLLSEDANQHNRIYVTPF 247 >gi|225174961|ref|ZP_03728958.1| hypothetical protein DealDRAFT_0813 [Dethiobacter alkaliphilus AHT 1] gi|225169601|gb|EEG78398.1| hypothetical protein DealDRAFT_0813 [Dethiobacter alkaliphilus AHT 1] Length = 357 Score = 43.2 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 7/43 (16%), Positives = 21/43 (48%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVL 50 R F + +G + ++ A+ L + ++I+V ++ + + Sbjct: 10 FRYLFKDERGNVLVIFAVALIALLGFAAIVIDVGGMYVERRSM 52 >gi|168213186|ref|ZP_02638811.1| von Willebrand factor type A domain protein [Clostridium perfringens CPE str. F4969] gi|170715195|gb|EDT27377.1| von Willebrand factor type A domain protein [Clostridium perfringens CPE str. F4969] Length = 1349 Score = 43.2 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 24/144 (16%), Positives = 57/144 (39%), Gaps = 32/144 (22%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+++VLD S+++ K K+ N + + + N + G+++++ K E Sbjct: 74 DVVLVLDTSQTVNE-------KQGDIEKAANDFVNKFLDSDNYKNF-KIGIISYNEKAEV 125 Query: 233 FFLLEWGVSHLQRKI---------------KYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 L + ++ I +L+ + TN + A + + Sbjct: 126 VQELTMDKNKIKSSIADSYKDAIKDKRSYNNWLNGNHLGTNVGDAFRLAISML------- 178 Query: 278 QHCNTEDANYKKIIVFMTDGENLS 301 + N +KI++FM++G+ + Sbjct: 179 --GKDSNPNKEKIVIFMSNGKPNA 200 >gi|163842175|ref|YP_001626580.1| von Willebrand factor type A domain-containing protein [Renibacterium salmoninarum ATCC 33209] gi|162955651|gb|ABY25166.1| von Willebrand factor type A domain, putative [Renibacterium salmoninarum ATCC 33209] Length = 267 Score = 43.2 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 32/179 (17%), Positives = 55/179 (30%), Gaps = 26/179 (14%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + N D+M+ LDVS SM D+ +D+ E Sbjct: 15 LAASRPAEQNANRPEQNNRDIMLCLDVSGSM---VDTDAKIVDVFANLAQEFHGE----- 66 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIK-YLSKFGVSTNSTPGLKYAY----N 268 + GLV F + + F L +++ ++ L ++ T Y + Sbjct: 67 ------RLGLVIFDSTAVQVFPLTEDYGYIKDELNVALKAMTDQSDDTGFFGGTYSGRGS 120 Query: 269 QIFDMQGM---RQHCNTEDANYKKIIVFMTD----GENLSTKEDQQSLYYCNEAKKRGA 320 + + IVF TD G+ L + D L N + G Sbjct: 121 SLIGDGLATCVNSFPKLGAEQRSRSIVFATDNVLLGKPLFSLTDAAGLATKNSIRVYGI 179 >gi|17567601|ref|NP_508552.1| hypothetical protein F40F4.6 [Caenorhabditis elegans] gi|1065514|gb|AAA81430.1| Hypothetical protein F40F4.6 [Caenorhabditis elegans] Length = 2214 Score = 43.2 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 34/235 (14%), Positives = 76/235 (32%), Gaps = 23/235 (9%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 T ++ P+ ++ N T A+ D++ ++D S+S +S FDS K Sbjct: 1983 PVTQQTQVPGTQPTQGPVATT--QNPYTSAQFDVVFMIDGSQSAQSSFDS-------LTK 2033 Query: 201 SINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI----KYLSKFGVS 256 + + + + GL+ I ++ L + G Sbjct: 2034 FVQTFMVSFNVGQSGA---RVGLIVVGGDITNPIPPAANLNSLSSQAMLNSNLAQLSGGY 2090 Query: 257 TNSTPG---LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 T+ L Y + M + +++++T T + Sbjct: 2091 TDFEDAGQILNYTLQIVSSPDFMAANNGYRSGISNHVLIYLTTTTAFDTDPTPAAQTILA 2150 Query: 314 EAKKRGAIVYAIGIRVIRSHEFLRACASPNS-FYLVENPHSMYDAFSHIGKDIVT 367 + K+ G I IG + L+ + ++ + + S+ + I + I+ Sbjct: 2151 Q-KQYGII--TIGYGGATDNNKLQTISGGSACSFTAPDFASLNNQIKTIQQLILN 2202 >gi|37676927|ref|NP_937323.1| hypothetical protein VVA1267 [Vibrio vulnificus YJ016] gi|37201471|dbj|BAC97293.1| uncharacterized protein [Vibrio vulnificus YJ016] Length = 688 Score = 43.2 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 31/220 (14%), Positives = 77/220 (35%), Gaps = 34/220 (15%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 P + I + T +++ R D + VLD S SM T + + + + Sbjct: 284 PQQSERGTIKLTFTPGDDLSAIQQGR-DWVFVLDKSGSMS---GKHATLTEGVKRGLGKL 339 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFL--LEWGVSHLQRKIKYLSKF--GVSTNSTP 261 + + + F N+++E + +++ + I+ +++ G TN Sbjct: 340 --------PSGDRFRI--LMFDNRVQEITNGFIAVNQNNVTQAIETINQIATGGGTNLYD 389 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 L+ A + + D++ I+ +TDG ++ + + ++ Sbjct: 390 ALERAVSGL-------------DSDRTTGIILVTDG--VANVGVTEKKQFLKLMQRYDVR 434 Query: 322 VYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHI 361 +Y + + L ++ + + + D HI Sbjct: 435 LYTFIMGNSANTPLLEPMTQVSNGF-ATSISNSDDILGHI 473 >gi|320159019|ref|YP_004191397.1| hypothetical protein VVM_02412 [Vibrio vulnificus MO6-24/O] gi|319934331|gb|ADV89194.1| uncharacterized protein [Vibrio vulnificus MO6-24/O] Length = 688 Score = 43.2 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 31/220 (14%), Positives = 77/220 (35%), Gaps = 34/220 (15%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 P + I + T +++ R D + VLD S SM T + + + + Sbjct: 284 PQQSERGTIKLTFTPGDDLSAIQQGR-DWVFVLDKSGSMS---GKHATLTEGVKRGLGKL 339 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFL--LEWGVSHLQRKIKYLSKF--GVSTNSTP 261 + + + F N+++E + +++ + I+ +++ G TN Sbjct: 340 --------PSGDRFRI--LMFDNRVQEITNGFIAVNQNNVTQAIETINQIATGGGTNLYD 389 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 L+ A + + D++ I+ +TDG ++ + + ++ Sbjct: 390 ALERAVSGL-------------DSDRTTGIILVTDG--VANVGVTEKKQFLKLMQRYDVR 434 Query: 322 VYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHI 361 +Y + + L ++ + + + D HI Sbjct: 435 LYTFIMGNSANTPLLEPMTQVSNGF-ATSISNSDDILGHI 473 >gi|302669471|ref|YP_003829431.1| von Willebrand factor type A domain-containing protein [Butyrivibrio proteoclasticus B316] gi|302393944|gb|ADL32849.1| von Willebrand factor type A domain-containing protein [Butyrivibrio proteoclasticus B316] Length = 561 Score = 43.2 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 27/156 (17%), Positives = 53/156 (33%), Gaps = 25/156 (16%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 + V D+S SM + T+I S+ + ++ + GLV++S+K+ Sbjct: 387 VFVTDISGSM------NGTRIKSLKNSLLSTMQYIDSSS------YIGLVSYSDKVYINL 434 Query: 235 LLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + + + K G T + + + + DAN + Sbjct: 435 PIAQFDNKQRAYFSGAVKDLDVGGQTATYDAVLVGMQMLMEKS-----KEVPDANMMLFV 489 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 ++DG + E ++ G VY IG Sbjct: 490 --LSDGAQNAGFELKRITPIVGGL---GISVYTIGY 520 >gi|198415004|ref|XP_002120336.1| PREDICTED: similar to predicted protein [Ciona intestinalis] Length = 652 Score = 43.2 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 43/232 (18%), Positives = 81/232 (34%), Gaps = 32/232 (13%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS-SITKIDMAIKSINAMLE 207 + I + + K + ++++DVS SM+ D S T +D K I Sbjct: 29 PTLKEIEVVMELEAKGKPEAGIFNRFVLLIDVSGSMDHTADGQSCTLLDRMKKFIELF-- 86 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS---TNSTPGLK 264 I + ++ G+VTFS+ L+ + + K + TN + GL Sbjct: 87 ----IDNASDTSWIGIVTFSSTANVIMELKQMTAEAKVFAKTTVLSLTTESRTNISAGLF 142 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL-----YYCNEAKKRG 319 A I + I+ TDG D +L ++++ Sbjct: 143 MALEVI--------QKLKPSRDC---IIVFTDGVANEGIVDSGTLIQEYKRISLQSRETN 191 Query: 320 AIVYAIGIRVIRSHEFLRACAS---PNSFY--LVENPHSMYDAFSHIGKDIV 366 + AI + + +FL A A+ ++FY + + I K++ Sbjct: 192 IPISAITVEGYK-PDFLYAIATELGSDAFYCLNINSAFEADMMIPQITKEMT 242 >gi|118346119|ref|XP_977009.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89288305|gb|EAR86293.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 685 Score = 43.2 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 49/293 (16%), Positives = 108/293 (36%), Gaps = 31/293 (10%) Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP---LKFCTFIPWYTN 150 N + + DI S + I+ Q ++ +Y + ++ T Sbjct: 4 LSENKETNYKLIEATDIPYSQKMKIMDNLQINSKFLTQYPKYHLQGNKVERKTNAVIQLE 63 Query: 151 SRHIVMPITSSVKVNSQTDARL--DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 ++ ++ ++ +N Q + ++ I++D S+SM S K+ + + ++E+ Sbjct: 64 FSKLLENVSQNMTLNQQENIQILNKYCILIDRSQSMMSE-----NKLQNVKQYLCNLIEK 118 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLK 264 Q L++F + + F + ++ +I + G TN L+ Sbjct: 119 ANTNS------QFALISFGSSQKLIFNFTQVTHENLESIKGQINNIISTG-DTNIIQALE 171 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIV-----FMTDGENLSTKEDQQSLYYCNEAKKRG 319 A+N I + +Q N ++ KK IV +TDG++ ++ + K Sbjct: 172 VAHNII---KQDQQLENQKEEQTKKRIVRYSAFLLTDGQDNMKEKAIFKFRENFKNKDMD 228 Query: 320 AIVYAIGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + +G + L A S FY ++ S++ F K+ + Sbjct: 229 YSINCLGFGIDHDPLLLGAITSYTGGKFYYIKPEESVFSVFQDYIKNQQETLV 281 >gi|5305316|gb|AAD41583.1|AF057703_1 structural toxin protein RtxA [Legionella pneumophila 130b] Length = 1208 Score = 43.2 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 46/293 (15%), Positives = 98/293 (33%), Gaps = 37/293 (12%) Query: 57 SLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSL 116 ++ + +++ N N+ L+ + R++ S + + ++ + DI + Sbjct: 9 AVNASGGYLISGLNLNDTITLRSANGYDRVEIENARSGAHGVSNSSLNGETFDIGLFSYN 68 Query: 117 DIVVVPQNEGYSI----------SAISRYKIPLKFCTFIPWY-------------TNSRH 153 I P ++ S +I L F Sbjct: 69 TIKTTPSEININMGLSLTDSDGDKITSSIEINLAPSVFKVGENVDDTSSSNVLHRVGGDT 128 Query: 154 IVMPITSSVKVNSQTDARLDM-------MIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 V+ + + +++ VLD S SM F T++++ +++ +L Sbjct: 129 GVVDGSGGADILVGDVGGVEIVGTTARIAFVLDESGSMGQNFGG-TTRLEVLKQTMTDIL 187 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 E+ P+ + V LV F++ + E LQ+ + ++S G+ Sbjct: 188 TELSNTPNASITVH--LVKFASVVNGTGTFEITGGELQQALDFIS--GLQIQQGLLAGTN 243 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 Y + + A+ ++ + F TDG D S Y N A+ G Sbjct: 244 YEAALGQTLQWYNSQSGTADVQQTL-FFTDGAPT-FYMDGNSTEYTNIARVYG 294 >gi|6225079|sp|Q9ZGE6|BCHD_HELMO RecName: Full=Magnesium-chelatase 67 kDa subunit; Short=Mg-chelatase subunit D; AltName: Full=Mg-protoporphyrin IX chelatase gi|3820559|gb|AAC84032.1| Mg chelatase subunit D BchD [Heliobacillus mobilis] Length = 666 Score = 43.2 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 25/183 (13%), Positives = 60/183 (32%), Gaps = 23/183 (12%) Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 +N ++ A R P + R +++ + + ++ V+D S Sbjct: 424 GKNSRVAVDATLRAAAPYQRQRRESGQYGDRQVIVTNSDIRAKQFVRKSGALIIFVVDAS 483 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-EEFFLLEWGV 240 SM ++ A +++ +L E + N + L+ F + E V Sbjct: 484 GSMAFN------RMSSAKGAVSVLLNEAYV-----NRDKVALIIFRGQQAETLVPPTRSV 532 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV-FMTDGEN 299 +++ + G + + A N+ ++ ++I+ +TDG Sbjct: 533 ELAKKRFDQVP-VGGGSPLAGAIAQAIE---------VGVNSIGSDVGQVIITLITDGRG 582 Query: 300 LST 302 Sbjct: 583 NVP 585 >gi|313204752|ref|YP_004043409.1| von willebrand factor type a [Paludibacter propionicigenes WB4] gi|312444068|gb|ADQ80424.1| von Willebrand factor type A [Paludibacter propionicigenes WB4] Length = 626 Score = 43.2 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 69/200 (34%), Gaps = 23/200 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 ++ S+ + + +LDVS SM+ K+++ SI + ++ V VV +G Sbjct: 252 EIPSENLPASNFVFLLDVSGSMD-----VPNKLELVKSSIKLLTNNLRKTDRVAIVVYAG 306 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + + + L G + G++ AY +I + + N Sbjct: 307 AAG----VVLESTEGTDKQKIMEAVDGLHAGGST-AGGAGIQLAY-KIAEKNFIENGNNR 360 Query: 283 EDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE-FLRACA 340 IV TDG+ N+ + + +K G + +G + + L+ A Sbjct: 361 --------IVLCTDGDFNVGVSSNNELESLIESKRKTGVYLTVLGYGMGNYKDNKLQILA 412 Query: 341 S--PNSFYLVENPHSMYDAF 358 + ++N Sbjct: 413 EKGNGNHAYIDNIQEANKVL 432 >gi|163850829|ref|YP_001638872.1| vault protein inter-alpha-trypsin subunit [Methylobacterium extorquens PA1] gi|163662434|gb|ABY29801.1| Vault protein inter-alpha-trypsin domain protein [Methylobacterium extorquens PA1] Length = 732 Score = 43.2 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 30/202 (14%), Positives = 65/202 (32%), Gaps = 32/202 (15%) Query: 144 FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSIN 203 F + ++ +T +S D++ V+D S SM + A S+ Sbjct: 315 FRERVAGAEAVLAVVTPPESASSAASVPRDVVFVIDNSGSMGG------ASMRQAKASLL 368 Query: 204 AMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-----LEWGVSHLQRKIKYLSKFGVSTN 258 L+ + + ++ F + + F + + + L G + Sbjct: 369 IGLDRLGAHD------RFNVIRFDHSFDTLFPDLVPADAGHLMRAKSFVAGLQASGGTEM 422 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 P +Q + E+ + +VF+TDG + + S + R Sbjct: 423 LAP-----------LQAALRGATPEETGRLRQVVFLTDGA-IGNEAQIFSAIATERGRSR 470 Query: 319 GAIVYAIGIRVIRSHEFLRACA 340 ++ +GI + +R A Sbjct: 471 ---LFMVGIGSAPNGYLMRHAA 489 >gi|27367212|ref|NP_762739.1| hypothetical protein VV2_0803 [Vibrio vulnificus CMCP6] gi|27358780|gb|AAO07729.1| Uncharacterized protein [Vibrio vulnificus CMCP6] Length = 688 Score = 43.2 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 31/220 (14%), Positives = 77/220 (35%), Gaps = 34/220 (15%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 P + I + T +++ R D + VLD S SM T + + + + Sbjct: 284 PQQSERGTIKLTFTPGDDLSAIQQGR-DWVFVLDKSGSMS---GKHATLTEGVKRGLGKL 339 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFL--LEWGVSHLQRKIKYLSKF--GVSTNSTP 261 + + + F N+++E + +++ + I+ +++ G TN Sbjct: 340 --------PSGDRFRI--LMFDNRVQEITNGFIAVNQNNVTQAIETINQIATGGGTNLYD 389 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 L+ A + + D++ I+ +TDG ++ + + ++ Sbjct: 390 ALERAVSGL-------------DSDRTTGIILVTDG--VANVGVTEKKQFLKLMQRYDVR 434 Query: 322 VYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHI 361 +Y + + L ++ + + + D HI Sbjct: 435 LYTFIMGNSANTPLLEPMTQVSNGF-ATSISNSDDILGHI 473 >gi|170742065|ref|YP_001770720.1| hypothetical protein M446_3920 [Methylobacterium sp. 4-46] gi|168196339|gb|ACA18286.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 418 Score = 43.2 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 54/176 (30%), Gaps = 13/176 (7%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R+FF + G +T+L ++ P+ V + I++S + +K L + A + Sbjct: 1 MRHFFRDRAGQITVLASLLSPVGLGVAALAIDLSTLQMVKQRLK--VTADAASLAAVAVL 58 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 G + +++ +++ + + P N Sbjct: 59 PDTGTALSRALAIAADNAGAGAGSVTTAADVQFGSYDSATRTFTAGA------TPANA-V 111 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 ++A T + ++ S +VLD S S Sbjct: 112 QVTASRSEARGNPVITGFARALGWATPDLSASAVAVRFS----PAYCFLVLDPSAS 163 >gi|307176122|gb|EFN65820.1| Voltage-dependent calcium channel subunit alpha-2/delta-3 [Camponotus floridanus] Length = 1261 Score = 43.2 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 56/355 (15%), Positives = 122/355 (34%), Gaps = 49/355 (13%) Query: 32 LVLGMIIEVSHIFFMK-TVLHSMIDRSLVHAATQI-MNEGNGNNRKKLKGGDILCRIKNT 89 LV+ + ++ + K + + ++D + A + + ++ K I Sbjct: 70 LVIEIAKDIKRMMESKISAIKRIMDTAEQSALSSMDVDPPESYKFINAKNNSIELEYSAH 129 Query: 90 WNMSFRN-----ELRDNGFVNDIDDIVRSTSLDIVVVPQN-EGYSISAISRYKIPLKFCT 143 + + + N + D++R+ + P + Y+ ++ Sbjct: 130 FGGNVNTTKSTVHVPTNVYER-ASDVIRAIEWSEKLDPIFVDNYNRDPSLSWQYFGSASG 188 Query: 144 FIPWYT----NSRHIVMPITSSVKVNSQTD-ARLDMMIVLDVSRSMESFFDSSITKIDMA 198 F+ Y + + + + + D++I++D S SM + ++A Sbjct: 189 FMRQYPAMSWYMDPVDLFDCRTRSWYIEAATSPKDILILMDTSGSMTGM------RREIA 242 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL---------LEWGVSHLQRKIKY 249 IN +L+ + N+ V +TFSN +E V L+ I+ Sbjct: 243 RHVINNILDTL----GNNDFVNI--ITFSNVTKEVVPCFSDTLVQANLANVRELKLAIEK 296 Query: 250 LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL 309 L + N + L A+ + + R+ A I+ +TDG + KE ++ Sbjct: 297 LVTEKI-ANFSLALTTAFELLETFRTEREGAKCNQA-----IMLITDGVPYNFKEIFEAY 350 Query: 310 YYCNE----AKKR-GAIVYA--IGIRVIRSHEF-LRACASPNSFYLVENPHSMYD 356 + + K V+ IG V E ACA+ + + P + + Sbjct: 351 NWKDNPDEPLKADMPVRVFTYLIGREVADVREVQWMACANRGYYVHLCTPAEVRE 405 >gi|239978028|ref|ZP_04700552.1| hypothetical protein SalbJ_01235 [Streptomyces albus J1074] gi|291449931|ref|ZP_06589321.1| predicted protein [Streptomyces albus J1074] gi|291352880|gb|EFE79782.1| predicted protein [Streptomyces albus J1074] Length = 521 Score = 43.2 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 21/140 (15%), Positives = 45/140 (32%), Gaps = 22/140 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D +D +++D S SM + + D ++ L + V L+TF Sbjct: 53 ADQPVDYAVLVDTSGSMRTKG-----RYDTVRSTLRGFLGGLSRSDHVA------LITFD 101 Query: 228 NKIEEFFLLEWG-VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ E ++ G + ++ T+ L A ++ Sbjct: 102 DRPEARYVGSAGDPGKIVGRLPKSPDPDGGTDIGAALDLALRELERSDAANVAS------ 155 Query: 287 YKKIIVFMTDGENLSTKEDQ 306 +V +TDG + + Sbjct: 156 ----VVMLTDGRHEPPRSTA 171 >gi|148657455|ref|YP_001277660.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569565|gb|ABQ91710.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 429 Score = 43.2 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 38/226 (16%), Positives = 84/226 (37%), Gaps = 26/226 (11%) Query: 146 PWYTNSRHIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 P T + +V+P T V++ S A ++ I+LD S SM + D + + +A +++ Sbjct: 141 PTATPAASVVLPGTLGVRMASGERLPGATRNLAIILDASGSMLARIDGAPKTV-IARQAL 199 Query: 203 NAMLEEVKLIPDVNNVV----QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTN 258 A++E + +V ++ + + ++ ++ + L +I + Sbjct: 200 IALVERLPATTNVALRTYGHRRADDCSDTELVQAPAPIQ--RADLINRINAIRPVNGG-R 256 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + + DM + E IV ++DG+ + + + A R Sbjct: 257 TPIA-----QSLEDMARDLAGVDGEVL-----IVLVSDGDETCGGDPVATAAALHTANPR 306 Query: 319 GAIVYAIGIRVIRSH--EFLRACAS--PNSFYLVENPHSMYDAFSH 360 V IG + + L A+ +++ N + DA Sbjct: 307 -LRVSVIGFNIEQEEWRRRLEGIAAYGGGAYFDAANAVQLADALEQ 351 >gi|289621347|emb|CBI52130.1| unnamed protein product [Sordaria macrospora] Length = 755 Score = 43.2 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 26/157 (16%), Positives = 61/157 (38%), Gaps = 13/157 (8%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSI-NAMLEEVK-LIPDVNNVVQSG--LVTFS 227 D++ +LD + SM SS + +S + M++ ++ ++ V + + G + F Sbjct: 522 FDVVFLLDDTGSMMEAVPSSTDHCETRERSKWDEMIDSLRYIVDIVTHYDRDGVDVHFFC 581 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG-----LKYAYNQIFDMQGMRQHCNT 282 N+ ++ F + G L +S + + + AY F+ R Sbjct: 582 NEDKDEFSIRDGQRVLDLLTNEVSPDEIGGGTFIAERLWTILTAYIDRFENYRRRMGERA 641 Query: 283 EDANYKKI--IVFMTDGENLSTKEDQQSLYYCNEAKK 317 K+ ++ +TDG ++ + + A++ Sbjct: 642 PPPEKPKMLNLIVITDGAADDKDGVEEVIV--DAARR 676 >gi|156744078|ref|YP_001434207.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156235406|gb|ABU60189.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 429 Score = 43.2 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 35/226 (15%), Positives = 75/226 (33%), Gaps = 26/226 (11%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARL---DMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 P T+ V+P T V++ ++ ++LD S SM + D + + +A +++ Sbjct: 141 PTATSLPDAVLPGTLGVRLAPGERTTRITENIAVILDASGSMLARLDGTPKTV-IARQAL 199 Query: 203 NAMLEEVKLIPDVNNVV----QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTN 258 A++ + +V ++ + + I+ L+ L +I + Sbjct: 200 IALINRLPETTNVALRTYGHRRADDCSDTELIQALAPLQ--RDALIARINAIRPVNGG-R 256 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + + DM IV ++DG+ + + A Sbjct: 257 TPIA-----QSLADMAQDLAGIEGNVL-----IVLVSDGDETCGGDPVATASMLRAA-NS 305 Query: 319 GAIVYAIGIRVIRSHEFLR----ACASPNSFYLVENPHSMYDAFSH 360 + IG V + R A A +++ N + DA Sbjct: 306 QLRISVIGFDVEQEEWRRRLEGIAVAGGGAYFDASNAEQLADALDQ 351 >gi|319790717|ref|YP_004152357.1| nad(+) ADP-ribosyltransferase [Variovorax paradoxus EPS] gi|315593180|gb|ADU34246.1| NAD(+) ADP-ribosyltransferase [Variovorax paradoxus EPS] Length = 689 Score = 43.2 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 28/134 (20%), Positives = 48/134 (35%), Gaps = 20/134 (14%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 V + + D + V+D+S SM F +D A + ++ ++ N ++ SG Sbjct: 314 VAASAISPRDYIFVVDISGSMHGFP------LDTAKTVLERLIGGLRPSDTFNVLLFSGS 367 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + + IK S G ST P LK Y + Sbjct: 368 NKMLSPKSVPAT-RANIEQALATIKNYSGSG-STELIPALKRVY------------AEPK 413 Query: 284 DANYKKIIVFMTDG 297 + N + +V +TDG Sbjct: 414 EENVSRTVVVVTDG 427 >gi|289425109|ref|ZP_06426886.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK187] gi|289154087|gb|EFD02775.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK187] Length = 322 Score = 43.2 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 53/160 (33%), Gaps = 17/160 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ +D S SM++ D S T++ A + + +V +V+ S E Sbjct: 96 IVVAIDSSLSMKAD-DVSPTRLAAAKAKAKDFINSLPTGFNVA------VVSISEHPEIR 148 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L + R + + + T + + + G ++ IV Sbjct: 149 MLPSTDRPTVLRAVDGI-ELQDGTALGGAIDKSLEAVKMAPGGSKNPAPAA------IVM 201 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 ++DG+N + AK VY I + Sbjct: 202 LSDGDNTQGGSPLVAANRAAAAK---VPVYTIAFGTETGY 238 >gi|167752255|ref|ZP_02424382.1| hypothetical protein ALIPUT_00498 [Alistipes putredinis DSM 17216] gi|167660496|gb|EDS04626.1| hypothetical protein ALIPUT_00498 [Alistipes putredinis DSM 17216] Length = 294 Score = 43.2 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 27/103 (26%), Positives = 47/103 (45%), Gaps = 11/103 (10%) Query: 167 QTDARLDMMIVLDVSRS-MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + + L MM+++DVS S M D + I ++ A + N+ V G + Sbjct: 74 EEERELTMMLLVDVSGSRMFGTTDRLKKNLQTEIAAVLAF-----SASENNDKV--GCIF 126 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGV---STNSTPGLKY 265 FS++IE+F + G SH+ I+ L F T + L++ Sbjct: 127 FSDRIEKFIPPKKGRSHILAIIRELVGFEPQSRGTRISEALRF 169 >gi|159898662|ref|YP_001544909.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159891701|gb|ABX04781.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 610 Score = 43.2 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 63/204 (30%), Gaps = 27/204 (13%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + V+D S SM + +++M ++ + +++ Sbjct: 257 SIEVADRKPAALTFVIDTSGSM-----AQDNRLEMVKNALIYLAGQLEPDDS------LA 305 Query: 223 LVTFSNKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +V F++ + G + + I L G STN+ GL + + Sbjct: 306 IVAFNDGMRVVLNPTSGENQMDIITAINSLEPAG-STNAEAGLYKGFELAWQAFKPEGIN 364 Query: 281 NTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE-FLRA 338 I+ +DG N E Q L + G + G+ + ++ L Sbjct: 365 R---------ILLCSDGVANSGMTEPSQLLATFQQYLDAGVQLSTYGVGMGNYNDILLEQ 415 Query: 339 CAS--PNSFYLVENPHSMYDAFSH 360 A ++ ++ F Sbjct: 416 LADKGDGNYAYFDSADEAQRLFGE 439 >gi|154497289|ref|ZP_02035985.1| hypothetical protein BACCAP_01582 [Bacteroides capillosus ATCC 29799] gi|150273688|gb|EDN00816.1| hypothetical protein BACCAP_01582 [Bacteroides capillosus ATCC 29799] Length = 234 Score = 43.2 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 36/174 (20%), Positives = 65/174 (37%), Gaps = 14/174 (8%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 R+ + + LDVS SM I+ + L+E++ ++ +VTF Sbjct: 16 NPTPRVPVGLCLDVSSSMYG------QPIEELNAGLRQFLDELRKDELTCTSAETAVVTF 69 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + + Q +++ L G+ T GL A + + Q ++ Sbjct: 70 GS---SAQCVADFATADQIQVEPLEANGL-TYMGEGLTMALDLL--EQRKERYKAAGVDY 123 Query: 287 YKKIIVFMTDGENLSTKED--QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 Y+ I+V M+DG + + C + R V A+GI E LR Sbjct: 124 YQPILVVMSDGCPNGDPRVLREAAQRICQMVEARRLTVVAVGIGEGADMEQLRR 177 >gi|332828898|gb|EGK01581.1| hypothetical protein HMPREF9455_02113 [Dysgonomonas gadei ATCC BAA-286] Length = 609 Score = 43.2 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 75/192 (39%), Gaps = 25/192 (13%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 PW + +R + + + + + S + + ++DVS SM S T++D+ S+ + Sbjct: 216 PWNSQNRLVKVGLKAK-SLASDNLPASNFVFLIDVSGSM-----SGPTRLDLVKSSLKLL 269 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGL 263 + ++ + +V +++ E G ++ + LS G + G+ Sbjct: 270 VNNLRKKD------RVAIVVYASSTGEVLPSTSGENKQKIKEALNNLSAGGST-AGGAGI 322 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIV 322 + AY +I ++ N I+ TDG+ N+ ++ +K G + Sbjct: 323 QLAY-KIAKQNFIKGGNNR--------IILCTDGDFNVGVSSNEGLQRLIENERKTGVFL 373 Query: 323 YAIGIRVIRSHE 334 +G + + Sbjct: 374 SILGYGMGNYKD 385 >gi|212635869|ref|YP_002312394.1| Von Willebrand factor, type A [Shewanella piezotolerans WP3] gi|212557353|gb|ACJ29807.1| Von Willebrand factor, type A [Shewanella piezotolerans WP3] Length = 710 Score = 43.2 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 67/195 (34%), Gaps = 35/195 (17%) Query: 152 RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 +VM + K+ A ++++V+D S SM S I+ A SI L + Sbjct: 329 YSLVMLLPPQDKMRLSALAPRELILVIDTSGSM------SGEAIEQAKASIIYALAGLSA 382 Query: 212 IPDVNNVVQSGLVTFSNKIE--EFFLL---EWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 N ++ F++ + L + Q ++ L G + S Sbjct: 383 QDSFN------ILQFNSNVYALSDTPLNASAKNIGRAQAYVQRLQANGGTEMS------- 429 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAI 325 + + ++ ++F+TDG + Q +++ ++ I Sbjct: 430 ----LALDKALSQQDANRERLRQ-VLFITDGAVGNEPQLFTQIRNQLQQSR-----LFTI 479 Query: 326 GIRVIRSHEFLRACA 340 GI + F++ A Sbjct: 480 GIGDAPNAHFMQRAA 494 >gi|195999034|ref|XP_002109385.1| hypothetical protein TRIADDRAFT_53363 [Trichoplax adhaerens] gi|190587509|gb|EDV27551.1| hypothetical protein TRIADDRAFT_53363 [Trichoplax adhaerens] Length = 356 Score = 43.2 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 31/184 (16%), Positives = 60/184 (32%), Gaps = 33/184 (17%) Query: 132 ISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSS 191 Y+ P YT ++ +S+ +++VLD S SM Sbjct: 183 NIFYQYPTTQRYCKTNYTTETKFKQWYVNAASPSSKR-----LVLVLDRSGSMS---GDR 234 Query: 192 ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE---EFFLLEWGVSHLQRKI- 247 K+ A ++ + + + G++ F ++I + + Q I Sbjct: 235 FLKVKEAATAV------LDSLGPNDE---IGVIAFDDEIRIHGGCKVTTVSPATPQSIIF 285 Query: 248 ------KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 + ST P LK+A++ + + T +IVF+TDG Sbjct: 286 LKDFINNKIQPEFGSTGYVPALKHAFDMLSTNMTSKAKTKTN------LIVFLTDGHPDE 339 Query: 302 TKED 305 + Sbjct: 340 PESQ 343 >gi|154492263|ref|ZP_02031889.1| hypothetical protein PARMER_01897 [Parabacteroides merdae ATCC 43184] gi|154087488|gb|EDN86533.1| hypothetical protein PARMER_01897 [Parabacteroides merdae ATCC 43184] Length = 289 Score = 43.2 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 27/107 (25%), Positives = 46/107 (42%), Gaps = 9/107 (8%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S +++ K + + + N + G+V F Sbjct: 72 EEERELTVMLLIDVSGS------RDFGSVNVMKKEVITEIAATLAFSAIQNNDKIGVVFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQI 270 S+KIE+F + G H+ I+ L TN LKY N I Sbjct: 126 SDKIEKFIPPQKGKKHILYVIRELIDFQPDKKQTNIAQALKYLTNAI 172 >gi|307305135|ref|ZP_07584884.1| von Willebrand factor type A [Sinorhizobium meliloti BL225C] gi|306902475|gb|EFN33070.1| von Willebrand factor type A [Sinorhizobium meliloti BL225C] Length = 631 Score = 43.2 bits (100), Expect = 0.075, Method: Composition-based stats. Identities = 39/223 (17%), Positives = 92/223 (41%), Gaps = 35/223 (15%) Query: 160 SSVKVNSQTDAR---LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 S +++ + + L + I++DVS S +++FD + +D+ +++ + + D + Sbjct: 427 GSDRIHMMSRPQAHDLAVTILVDVSLSTDAWFD-DLRVLDVEKQALQVLAHGLSACGDAH 485 Query: 217 NVVQSGLVTFSNK------IEEFFLLEWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 ++ TF+++ IE + +S ++ +I L G T +++A Sbjct: 486 EIL-----TFTSRRRDWVRIETVKAFDEAMSATIEARIAALKP-GYYTRIGAAIRHAAAG 539 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK------EDQQSLYYCNEAKKRGAIVY 323 + E N +K+++ +TDG+ + S EA++ G V+ Sbjct: 540 LV-----------ERPNRRKLLIVLTDGKPNDVDHYEGRFALEDSRRAVGEARRSGISVF 588 Query: 324 AIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIV 366 + + + +L N + +V N + A I + +V Sbjct: 589 GVTVDR-EAKSYLPVIFGQNGYAVVSNIGRLPAALPAIYRGLV 630 >gi|119501475|ref|XP_001267494.1| von Willebrand factor type A domain protein [Neosartorya fischeri NRRL 181] gi|119415660|gb|EAW25597.1| von Willebrand factor type A domain protein [Neosartorya fischeri NRRL 181] Length = 1074 Score = 43.2 bits (100), Expect = 0.075, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 60/192 (31%), Gaps = 29/192 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ + D S SM+S KI I +N L + N + +F +++ Sbjct: 341 IIFMADRSGSMKS-------KIPSLINVMNIFLRSLPEACSFN------IASFGSQVTWL 387 Query: 233 -FFLLEWGVSHLQ---RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F + + ++L + + T L + D R T Sbjct: 388 WPFSMRYSQTNLDVAAKHVDSFKANYGGTEIYGALH----SVLDHYNERNDVPTN----- 438 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 ++ +TDGE Q ++ +++GI SH + Y Sbjct: 439 --VILLTDGEVWDVDNVIQLVHRTASNGNSNIRFFSLGIGDQVSHRLVEGIGQQGGGYAE 496 Query: 349 ENPHSMYDAFSH 360 P S ++ Sbjct: 497 VVPESSMGSWQE 508 >gi|46403702|gb|AAS92908.1| putative TrkA-like nucleotide-binding protein [Sinorhizobium meliloti] Length = 631 Score = 43.2 bits (100), Expect = 0.075, Method: Composition-based stats. Identities = 39/223 (17%), Positives = 92/223 (41%), Gaps = 35/223 (15%) Query: 160 SSVKVNSQTDAR---LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 S +++ + + L + I++DVS S +++FD + +D+ +++ + + D + Sbjct: 427 GSDRIHMMSRPQAHDLAVTILVDVSLSTDAWFD-DLRVLDVEKQALQVLAHGLSACGDAH 485 Query: 217 NVVQSGLVTFSNK------IEEFFLLEWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 ++ TF+++ IE + +S ++ +I L G T +++A Sbjct: 486 EIL-----TFTSRRRDWVRIETVKAFDEAMSATIEARIAALKP-GYYTRIGAAIRHAAAG 539 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK------EDQQSLYYCNEAKKRGAIVY 323 + E N +K+++ +TDG+ + S EA++ G V+ Sbjct: 540 LV-----------ERPNRRKLLIVLTDGKPNDVDHYEGRFALEDSRRAVGEARRSGISVF 588 Query: 324 AIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIV 366 + + + +L N + +V N + A I + +V Sbjct: 589 GVTVDR-EAKSYLPVIFGQNGYAVVSNIGRLPAALPAIYRGLV 630 >gi|16263145|ref|NP_435938.1| NorD accessory protein for nitric oxide reductase [Sinorhizobium meliloti 1021] gi|14523809|gb|AAK65350.1| NorD accessory protein for nitric oxide reductase [Sinorhizobium meliloti 1021] Length = 631 Score = 43.2 bits (100), Expect = 0.075, Method: Composition-based stats. Identities = 39/223 (17%), Positives = 92/223 (41%), Gaps = 35/223 (15%) Query: 160 SSVKVNSQTDAR---LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 S +++ + + L + I++DVS S +++FD + +D+ +++ + + D + Sbjct: 427 GSDRIHMMSRPQAHDLAVTILVDVSLSTDAWFD-DLRVLDVEKQALQVLAHGLSACGDAH 485 Query: 217 NVVQSGLVTFSNK------IEEFFLLEWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 ++ TF+++ IE + +S ++ +I L G T +++A Sbjct: 486 EIL-----TFTSRRRDWVRIETVKAFDEAMSATIEARIAALKP-GYYTRIGAAIRHAAAG 539 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK------EDQQSLYYCNEAKKRGAIVY 323 + E N +K+++ +TDG+ + S EA++ G V+ Sbjct: 540 LV-----------ERPNRRKLLIVLTDGKPNDVDHYEGRFALEDSRRAVGEARRSGISVF 588 Query: 324 AIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIV 366 + + + +L N + +V N + A I + +V Sbjct: 589 GVTVDR-EAKSYLPVIFGQNGYAVVSNIGRLPAALPAIYRGLV 630 >gi|229582324|ref|YP_002840723.1| von Willebrand factor type A [Sulfolobus islandicus Y.N.15.51] gi|228013040|gb|ACP48801.1| von Willebrand factor type A [Sulfolobus islandicus Y.N.15.51] Length = 356 Score = 42.8 bits (99), Expect = 0.075, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 71/207 (34%), Gaps = 34/207 (16%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 V + S + + +I++D S SM K++ A++S +L + Sbjct: 22 VGFVIYITPQQSSVTSSIHYIIMIDNSPSMRG------EKLNTAVQSAQKLL------YN 69 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 +N L+ FSN E + + I + G +T + + N Sbjct: 70 LNEGNYVTLILFSNHPEIKYQ-----GPAKGIITFDVGKGYTTRLHEAVSFTIN------ 118 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K I+ +TDG+ + + + + + IGI + Sbjct: 119 -----LAKQSQVPTK-IIMLTDGKPTDKR-NVKDYEKLDIPPNT--QIITIGIGNNYNER 169 Query: 335 FLRACA--SPNSFYLVENPHSMYDAFS 359 L+ A S FY +++ + + F Sbjct: 170 ILKKLADRSSGKFYHIKDISELPNIFE 196 >gi|145540134|ref|XP_001455757.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124423565|emb|CAK88360.1| unnamed protein product [Paramecium tetraurelia] Length = 522 Score = 42.8 bits (99), Expect = 0.075, Method: Composition-based stats. Identities = 45/306 (14%), Positives = 96/306 (31%), Gaps = 41/306 (13%) Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNEL-RDNGFVNDIDDIVRSTSLDIVVVPQNE 125 N+ + I + R + +++G ND D I + I Sbjct: 3 NQCTSGDNDSYANTTKAIVINEQFIEGERPAICQEHGKFNDDDAI----DVVITNESNYG 58 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR-------LDMMIVL 178 S+S + + + + T +V ++ QT + +D++ ++ Sbjct: 59 RKSLSQNYMKQANYVLQDNVELKLSYSGLPTQGTQAVLLSVQTKNQAITIRQGIDLICLI 118 Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW 238 D S SM S K+ + KS+ +L+ ++ + L+ F ++ L Sbjct: 119 DHSGSM------SGEKMHLVKKSLKHLLKMLQPND------RLCLIEFDDQNYRLTRLMR 166 Query: 239 GVS----HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 I + G T+ +K A + + R+ N + I + Sbjct: 167 ATQENMYKFLIAIDTIEANGA-TDIGNAMKMALSIL----KHRRFKNPIAS-----IFLL 216 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPH 352 +DGE+ + K+ + G + + A FY + Sbjct: 217 SDGEDEGAAGRVWNDIQSKNIKEPF-TINTFGFGRDCCPKIMSEIAHFKEGQFYYISEIS 275 Query: 353 SMYDAF 358 + + F Sbjct: 276 KIDECF 281 >gi|306820467|ref|ZP_07454103.1| D-amino acid dehydrogenase large subunit [Eubacterium yurii subsp. margaretiae ATCC 43715] gi|304551542|gb|EFM39497.1| D-amino acid dehydrogenase large subunit [Eubacterium yurii subsp. margaretiae ATCC 43715] Length = 538 Score = 42.8 bits (99), Expect = 0.076, Method: Composition-based stats. Identities = 37/220 (16%), Positives = 68/220 (30%), Gaps = 24/220 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + ++++ IVLD S SM + + +++A SI +L+ + + GL Sbjct: 184 KVNENKQVNVEIVLDASGSMAKQINGQ-SMMNIAKNSITEVLKHLPK------NAKVGLR 236 Query: 225 TFSNK---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM----QGMR 277 F +K + + L I+ L+ +S + + I D Sbjct: 237 VFGHKGNNTDSGKTESCSANELIHPIETLNTSAISKALSSVEATGWTSIADSIKNGGEDL 296 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY--AIGIRVIRSHE- 334 E A IV TDG + K G V IG V + + Sbjct: 297 SKFKEEGAVNILYIV--TDG---IETCGGDPIEAAQTLKNSGTNVVLGIIGFNVNATQDA 351 Query: 335 -FLRACASPNSFYL-VENPHSMYDAFSHIGKDIVTKRIWY 372 + + Y + ++ I + W Sbjct: 352 VLKKIAEAGGGHYAIANDAGTLTSELYKITEASNNTYNWE 391 >gi|305665953|ref|YP_003862240.1| hypothetical protein FB2170_06715 [Maribacter sp. HTCC2170] gi|88710728|gb|EAR02960.1| hypothetical protein FB2170_06715 [Maribacter sp. HTCC2170] Length = 288 Score = 42.8 bits (99), Expect = 0.077, Method: Composition-based stats. Identities = 38/159 (23%), Positives = 67/159 (42%), Gaps = 24/159 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L MM+++DVS S + K I I+A L L + GL+ F Sbjct: 72 EEERELTMMLMVDVSGSELFGTTNQFKK--GVITEISATLAFSALQNNDK----VGLILF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 S+++E F + G +H+ R I+ L +F +N T + A + ++ Sbjct: 126 SDEVELFIPPKKGKTHVLRIIRELLEFKPKSNKTD-IAEALKYLTNVMK----------- 173 Query: 287 YKKIIVF-MTDGENLSTKEDQQSLYYCNEAKK-RGAIVY 323 KK IVF ++D ++ +++L G VY Sbjct: 174 -KKAIVFVLSD---FIAEDYERTLKITGNKHDVTGIRVY 208 >gi|94499792|ref|ZP_01306328.1| hypothetical protein RED65_14762 [Oceanobacter sp. RED65] gi|94427993|gb|EAT12967.1| hypothetical protein RED65_14762 [Oceanobacter sp. RED65] Length = 731 Score = 42.8 bits (99), Expect = 0.077, Method: Composition-based stats. Identities = 36/236 (15%), Positives = 81/236 (34%), Gaps = 31/236 (13%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA 198 L + + ++M +T + D ++LD+S SM+ K Sbjct: 319 LDVLAYKESANKTGTLMMTLTPGDDLQPIQRGT-DWTLLLDISGSMQG-------KFQTL 370 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVS 256 I+ + L+ V V+ +++ + FL ++ + RK+ + G Sbjct: 371 IEGVKKGLKRFNPQDRVRVVL---FNDYASNLTGGFLPATQKNIAEIIRKLDLVLPNG-G 426 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK 316 T+ G+++A + + DA+ I +TDG + + + + K Sbjct: 427 THLMDGVRFALSGL-------------DADRTSAIWLVTDGVTN--VGETKQRKFVDLLK 471 Query: 317 KRGAIVYAIGIRVIRSHEFLRAC--ASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 ++ V+ + + L+A AS V N + + + + Sbjct: 472 QKDIRVFTFIMGNGANRPLLKAITKASNGFAINVSNSDDIIGQLEKAASKVTHEAL 527 >gi|71280467|ref|YP_270055.1| von Willebrand factor type A domain-containing protein [Colwellia psychrerythraea 34H] gi|71146207|gb|AAZ26680.1| von Willebrand factor type A domain protein [Colwellia psychrerythraea 34H] Length = 786 Score = 42.8 bits (99), Expect = 0.077, Method: Composition-based stats. Identities = 39/206 (18%), Positives = 76/206 (36%), Gaps = 32/206 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S SM ++ A S+ L ++ N ++ F N E Sbjct: 395 DIIFIIDTSGSM------QAGSMEQAKSSLQLALLQLNNKDSFN------IIAFDNDTEL 442 Query: 233 FFLLE-----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 F + +S Q+ I LS G + Y + + M++ Sbjct: 443 LFPVTHMASAHNISKAQQFIDGLSANGGTEM--------YRPLSNALMMKKDKTQSSKAI 494 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--PNSF 345 ++ IVF+TDG + + + + N A+ +Y +GI + F++ A S+ Sbjct: 495 RQ-IVFITDG---AVANEFELMQLLNTAQGDF-RLYTVGIGAAPNGYFMKKAAQFGRGSY 549 Query: 346 YLVENPHSMYDAFSHIGKDIVTKRIW 371 ++N + SH I + Sbjct: 550 VFIQNKSEVQRKMSHFMTKISQPALT 575 >gi|242348028|ref|YP_002995589.1| von Willebrand factor type A domain protein [Aeromonas hydrophila] gi|224831847|gb|ACN66978.1| von Willebrand factor type A domain protein [Aeromonas hydrophila] Length = 593 Score = 42.8 bits (99), Expect = 0.079, Method: Composition-based stats. Identities = 33/177 (18%), Positives = 69/177 (38%), Gaps = 22/177 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + +K ++ + I++D+S SM + + D+A ++ A+ ++ IP Sbjct: 398 VTGDTRVFLKPEAKKRPNTAVHILVDMSSSMAYKAANGKERQDIAREASLAISMALEAIP 457 Query: 214 DVNNVVQSGLVTF---SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 VN VTF + F +++ G + R ++ K T + YA ++ Sbjct: 458 GVNP-----AVTFFGGNRNQPVFSVVKHGDTVQNRAGRFGFKATGGTPMAEAMWYAAFEL 512 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + R K+++ +TDG+ S + + C + V IG+ Sbjct: 513 TKTREER-----------KMLIVVTDGQPQSAPACRSVIDLCE---RSDVEVIGIGV 555 >gi|323474472|gb|ADX85078.1| von Willebrand factor type A [Sulfolobus islandicus REY15A] gi|323477209|gb|ADX82447.1| von Willebrand factor type A [Sulfolobus islandicus HVE10/4] Length = 356 Score = 42.8 bits (99), Expect = 0.079, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 71/208 (34%), Gaps = 34/208 (16%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 V + S + + +I++D S SM K++ A++S +L + Sbjct: 22 VGFVIYITPQQSSVTSSIHYIIMIDNSPSMRG------EKLNTAVQSAQKLL------YN 69 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 +N L+ FSN E + + I + G +T + + N Sbjct: 70 LNEGNYVTLILFSNHPEIKYQ-----GPAKGIITFDVGKGYTTRLHEAVSFTIN------ 118 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K I+ +TDG+ + + + + + IGI + Sbjct: 119 -----LAKQSQVPTK-IIMLTDGKPTDKR-NVKDYEKLDIPPNT--QIITIGIGNDYNER 169 Query: 335 FLRACA--SPNSFYLVENPHSMYDAFSH 360 L+ A S FY +++ + + F Sbjct: 170 ILKKLADRSSGKFYHIKDISELPNIFES 197 >gi|297475370|ref|XP_002707867.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 4-like [Bos taurus] gi|296487070|gb|DAA29183.1| calcium channel, voltage-dependent, alpha 2/delta subunit 4-like [Bos taurus] Length = 1111 Score = 42.8 bits (99), Expect = 0.079, Method: Composition-based stats. Identities = 30/171 (17%), Positives = 62/171 (36%), Gaps = 31/171 (18%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++IV+D S SM+ ++ +A +++ +L+ + N+ V + ++ Sbjct: 264 ATSPKDIVIVVDTSGSMKGL------RMTIAKHTVSTILDTL----GENDFVNI--IAYN 311 Query: 228 ---NKIEEFFL------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + IE F H ++ + L GV L+ A+ + Q Q Sbjct: 312 DYIHYIEPCFKGILVQADRDNREHFKQLVDELMVKGVGVVD-RALREAFQILQQFQEAGQ 370 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 A I+ +TDG + + + + R V+ I Sbjct: 371 GSLCNQA-----IMLITDGAVEDYEPVLEKYNWPD----RKVRVFTYLIGR 412 >gi|198436180|ref|XP_002124514.1| PREDICTED: similar to von Willebrand factor A domain containing 3A [Ciona intestinalis] Length = 1107 Score = 42.8 bits (99), Expect = 0.079, Method: Composition-based stats. Identities = 26/171 (15%), Positives = 62/171 (36%), Gaps = 29/171 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAML-EEVKLIPDVNNVVQSGLVTFSNKIEE 232 ++I++DVS SM D K I +++ E++ N+V + ++ + Sbjct: 867 VVILIDVSGSMSYNMD-------ELKKEITSLIWEQLNGNKTAFNIV---AFSNTSTKWQ 916 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + E S ++++S + + A + I Sbjct: 917 DSITESNQSACHDAVQWVSAL--TAHGGSATLKAIQVALADEEAEA------------IY 962 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--VIRSHEFLRACAS 341 +TDG+ + +L + K+ ++ I +++FL++ +S Sbjct: 963 LLTDGKP--DSSIKLTLSEASNLNKKNIPIHTISFNCDNREANDFLKSLSS 1011 >gi|194667489|ref|XP_001787557.1| PREDICTED: voltage-gated calcium channel alpha(2)delta-4 subunit [Bos taurus] Length = 1111 Score = 42.8 bits (99), Expect = 0.079, Method: Composition-based stats. Identities = 30/171 (17%), Positives = 62/171 (36%), Gaps = 31/171 (18%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++IV+D S SM+ ++ +A +++ +L+ + N+ V + ++ Sbjct: 264 ATSPKDIVIVVDTSGSMKGL------RMTIAKHTVSTILDTL----GENDFVNI--IAYN 311 Query: 228 ---NKIEEFFL------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + IE F H ++ + L GV L+ A+ + Q Q Sbjct: 312 DYIHYIEPCFKGILVQADRDNREHFKQLVDELMVKGVGVVD-RALREAFQILQQFQEAGQ 370 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 A I+ +TDG + + + + R V+ I Sbjct: 371 GSLCNQA-----IMLITDGAVEDYEPVLEKYNWPD----RKVRVFTYLIGR 412 >gi|169831370|ref|YP_001717352.1| magnesium chelatase [Candidatus Desulforudis audaxviator MP104C] gi|169638214|gb|ACA59720.1| Magnesium chelatase [Candidatus Desulforudis audaxviator MP104C] Length = 670 Score = 42.8 bits (99), Expect = 0.079, Method: Composition-based stats. Identities = 35/234 (14%), Positives = 70/234 (29%), Gaps = 26/234 (11%) Query: 72 NNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISA 131 N+ + G + + ++ + D S P G + A Sbjct: 375 NSNSSPEHGAPAVVAETVFAADEPFRVKRIEYERDRVLRKGSGRRSRTRTPTKAGRYVRA 434 Query: 132 ISR-------YKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARL--DMMIVLDVSR 182 R + L+ + + + S + R+ ++ V+D S Sbjct: 435 TLRRERDDLAFDATLRAAAPFQKQRARDGVAVAVESQDIREKVREKRIGNFLVFVVDASG 494 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH 242 SM + ++ A ++ ++L + + G+V F + E L Sbjct: 495 SM-----GAQQRMVAAKGAVLSLL-----LDAYQKRDRVGMVAFKGEHAEVLLPPTNSVE 544 Query: 243 LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 L + G T GL AY R H D N +++ ++D Sbjct: 545 LAERRLAELPTGGRTPLAAGLLKAYEV------ARAHLFK-DPNLSPLLIVISD 591 >gi|116329599|ref|YP_799318.1| BatB [Leptospira borgpetersenii serovar Hardjo-bovis L550] gi|116332488|ref|YP_802205.1| BatB [Leptospira borgpetersenii serovar Hardjo-bovis JB197] gi|116122492|gb|ABJ80385.1| BatB [Leptospira borgpetersenii serovar Hardjo-bovis L550] gi|116127355|gb|ABJ77447.1| BatB [Leptospira borgpetersenii serovar Hardjo-bovis JB197] Length = 347 Score = 42.8 bits (99), Expect = 0.079, Method: Composition-based stats. Identities = 26/142 (18%), Positives = 61/142 (42%), Gaps = 15/142 (10%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + ++ +D++ ++DVS SM++ DS T++ + + ML + + G Sbjct: 82 EKKEESFKGVDILFLVDVSLSMQA-IDSPPTRLARFKEVLLRMLPALSGN-------RFG 133 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 ++ F+ + + VS ++ L V T L A+++ + G + Sbjct: 134 MIVFAGSPFLYCPMTSDVSAFSDYVRGLDVDMVGDRGTD-LDGAFSKADALLGSEK---- 188 Query: 283 EDANYKKIIVFMTDGENLSTKE 304 +I++ +TDGE+ + + Sbjct: 189 --VFRNRILILVTDGEDQNDPD 208 >gi|224922718|dbj|BAH28825.1| NorD protein [Ochrobactrum anthropi] Length = 633 Score = 42.8 bits (99), Expect = 0.080, Method: Composition-based stats. Identities = 41/240 (17%), Positives = 92/240 (38%), Gaps = 41/240 (17%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 C F S H+ + ++ L + I++DVS S +++ D+ +D+ ++ Sbjct: 421 CDFAAGGQGSDHVHL-------MSRPKANDLAVTILVDVSLSTDAWIDNRRV-LDVEKEA 472 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNK--------IEEFFLLEWGVSHLQRKIKYLSKF 253 + + + D + + TF+++ + F +G S ++ +I L Sbjct: 473 LLVLANGIAACGD-----RCSIQTFTSRRRSWVRVETVKDFDETFG-SAVEHRIAALKP- 525 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK------EDQQ 307 G T +++A ++ E N KK+++ +TDG+ + Sbjct: 526 GFYTRMGAAIRHATAKL-----------AEQPNRKKLLLVLTDGKPNDVDHYEGRFALED 574 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 S EA+ G V+A+ + ++ +L ++ LV + A I + + Sbjct: 575 SRRAVAEARTTGVNVFAVTVDR-EANAYLPTLFGRRNYALVAKLSKLPVALPAIYRMMTG 633 >gi|218709385|ref|YP_002417006.1| putative hemolysin-type calcium-binding region [Vibrio splendidus LGP32] gi|218322404|emb|CAV18557.1| putative hemolysin-type calcium-binding region [Vibrio splendidus LGP32] Length = 1883 Score = 42.8 bits (99), Expect = 0.080, Method: Composition-based stats. Identities = 36/175 (20%), Positives = 72/175 (41%), Gaps = 22/175 (12%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + V+ +T + ++ ++LD + S +SS K+ + +++ ML++ V+ Sbjct: 1243 ISVSPETKSNTNVQLILD---TSGSMSNSSNGKLAIMKAAVSKMLDQ----YHDMGDVRV 1295 Query: 222 GLVTFSNKIE--EFFLLEWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 L+ F+++ EF W S + + L+ G T+ +K A Q +D Q Sbjct: 1296 QLIDFNSRSTRLEFNGRAWMTVSEAKYLVNRLTA-GGGTDYDDAVKKA-RQSWDHDEHLQ 1353 Query: 279 HCNTEDANYKKIIVFMTDGENLSTK-----EDQQSLYYCNEAKKRGAIVYAIGIR 328 N + +Y F++DG+ D + + N G +IGI Sbjct: 1354 LDNANNVSY-----FISDGKPQDGHDDATISDNEETKWANHLISNGITSQSIGIN 1403 >gi|126729349|ref|ZP_01745163.1| hypothetical protein SSE37_24154 [Sagittula stellata E-37] gi|126710339|gb|EBA09391.1| hypothetical protein SSE37_24154 [Sagittula stellata E-37] Length = 248 Score = 42.8 bits (99), Expect = 0.080, Method: Composition-based stats. Identities = 40/186 (21%), Positives = 67/186 (36%), Gaps = 23/186 (12%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 IVLD S SM S ++ A +++ L ++ + +V + + F Sbjct: 67 AIVLDDSGSMGS-------DMEAAKQAVVDALSAMQDTD------RVAVVALNAGVVLPF 113 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 V+ +R + T STP L A M ++ ++ Sbjct: 114 A---SVADARRTLPAALAPIRDTGSTP-LTRAILDTQAMLEAEASSVRGFGTFR--MIVT 167 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSM 354 TDG + Q+++ + A K + IGI IR + LR A SF V N ++ Sbjct: 168 TDGAADDGEALQRAIE--DLAAKTPIQLTTIGIG-IRGNHVLRR-ADLGSFVDVANVAAL 223 Query: 355 YDAFSH 360 A Sbjct: 224 EGALQA 229 >gi|48477361|ref|YP_023067.1| hypothetical protein PTO0289 [Picrophilus torridus DSM 9790] gi|48430009|gb|AAT42874.1| hypothetical protein PTO0289 [Picrophilus torridus DSM 9790] Length = 379 Score = 42.8 bits (99), Expect = 0.080, Method: Composition-based stats. Identities = 40/212 (18%), Positives = 76/212 (35%), Gaps = 32/212 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + +I +DVS SM K+D+A + ++E++ +N+V L+ Sbjct: 31 KTVKASGFHYIIAIDVSNSMRK------GKLDLAKEGAMNLIEKI----PRDNIV--SLI 78 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 F + + + L+ I L G + + A + + ++ Sbjct: 79 AFGDTAKVIVEGKEPTFALEA-IPSLKVAGNT-----AMYTAL--LTATKLADKYNMPGR 130 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 I+ +TDG ++ Y N G + IGI + L+ A + Sbjct: 131 ------IILLTDGMPTDVSMNES---YENLQVPEGFTIDCIGIGDNYRDDLLKLLADKGN 181 Query: 345 --FYLVENPHSMYDAFSH-IGKDIVTKRIWYD 373 FY +ENP + + DI K + D Sbjct: 182 SIFYHLENPEELPKVMESTVSSDISAKNVQVD 213 >gi|2982053|pdb|1ATZ|A Chain A, Human Von Willebrand Factor A3 Domain gi|2982054|pdb|1ATZ|B Chain B, Human Von Willebrand Factor A3 Domain Length = 189 Score = 42.8 bits (99), Expect = 0.080, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 61/177 (34%), Gaps = 26/177 (14%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++++LD S S F S ++ K+ + + + P + Q ++ + + Sbjct: 4 QPLDVILLLDGSSS---FPASYFDEMKSFAKA---FISKANIGP---RLTQVSVLQYGSI 54 Query: 230 IEEFFLLEWGV----SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 W V +HL + + + G + L +A + H A Sbjct: 55 TTIDVP--WNVVPEKAHLLSLVDVMQREGGPSQIGDALGFAVRYLTS----EMHGARPGA 108 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + K +++ +T + A+ V+ IGI LR A P Sbjct: 109 S-KAVVILVT------DVSVDSVDAAADAARSNRVTVFPIGIGDRYDAAQLRILAGP 158 >gi|315045021|ref|XP_003171886.1| von Willebrand RING finger domain-containing protein [Arthroderma gypseum CBS 118893] gi|311344229|gb|EFR03432.1| von Willebrand RING finger domain-containing protein [Arthroderma gypseum CBS 118893] Length = 1035 Score = 42.8 bits (99), Expect = 0.081, Method: Composition-based stats. Identities = 36/267 (13%), Positives = 84/267 (31%), Gaps = 38/267 (14%) Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 + N + + W + N +++ ++ Y I Sbjct: 410 SVNELPRFHLRFPTLHQLDIWKRAL------GDLQNGGVATRSVHEVEVPQSEED-DYRI 462 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK---VNSQTDARLDMMIVLDVSRSMES 186 S + + + +S + TS + S LD+++V+ VS SM+ Sbjct: 463 SKTPSKRESSGYSSSYGAGKSSNTAMTDYTSPGREPMPISTFHVPLDIVVVIPVSSSMQG 522 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-----KIEEFFLLEW-GV 240 KI + ++ +++ + + GLVTF + + W G Sbjct: 523 L------KISLLRDTLRFLVQSLGPRD------RMGLVTFGSSGGGVPVVGMTSKTWNGW 570 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 + I+ + + + + G A + + + N I+ ++D + Sbjct: 571 GKIFNSIRPVGQKSLRADVVEGANVAMDLLMQRK---------SFNPVSTILLISDSQTT 621 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGI 327 + + AK G + +G+ Sbjct: 622 DPESGDFVVSRAEAAK-VGIHTFGLGL 647 >gi|193734230|gb|ACF19883.1| PilC1 [Kingella kingae] Length = 1362 Score = 42.8 bits (99), Expect = 0.081, Method: Composition-based stats. Identities = 29/145 (20%), Positives = 60/145 (41%), Gaps = 17/145 (11%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSS-ITKIDMAIKSINAMLEEVKLIPDVNNV 218 ++ V A+ ++M++LD S SM + S T+ + S++ ++ + N Sbjct: 71 TNTTVRGMQGAKPNIMLLLDDSGSMGAQVPGSGRTRQQILQSSLSQVVAK------YGNQ 124 Query: 219 VQSGLVTFSNKIEEF-FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + GL+ F+++ F L S + IK G++ T +K A + + R Sbjct: 125 INWGLIAFNDQSSAFNLPLGTNYSTVVNSIKRFPANGLTPTITSYIK-AVDTLNKGIQYR 183 Query: 278 QHCNTEDANYKKIIVFMTDGENLST 302 K +V ++DG++ Sbjct: 184 CQ--------KSYLVMLSDGDSNYP 200 >gi|167845909|ref|ZP_02471417.1| hypothetical protein BpseB_11520 [Burkholderia pseudomallei B7210] Length = 579 Score = 42.8 bits (99), Expect = 0.081, Method: Composition-based stats. Identities = 10/50 (20%), Positives = 28/50 (56%), Gaps = 1/50 (2%) Query: 16 KGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 +G ++ AI++ + LG +++ ++FF++ L + D + + A ++ Sbjct: 1 RGSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM 49 >gi|159897014|ref|YP_001543261.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159890053|gb|ABX03133.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 222 Score = 42.8 bits (99), Expect = 0.081, Method: Composition-based stats. Identities = 27/171 (15%), Positives = 69/171 (40%), Gaps = 14/171 (8%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 ++V+D S SM+ ID + + +++ ++ LV F+++ + Sbjct: 18 ILVVDTSGSMQGRP------IDELNQGLQVFHQDISNSFSTAQRLEICLVEFNSQADCIV 71 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 S + + + +T G++ A +++ + + +T Y+ I+ M Sbjct: 72 EP----SLVDQFHMPILAVAGTTKLVDGVRLAIHKV--QERKSWYRSTGQPYYRPWIILM 125 Query: 295 TDGENLSTKEDQQ-SLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 TDGE S ++ + + + + + IG++ L+ ++P+ Sbjct: 126 TDGEPDSDQDVAGLAREIQHGVNNKQFVFFPIGVQGAD-MRMLQQISTPDR 175 >gi|326911070|ref|XP_003201885.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like [Meleagris gallopavo] Length = 948 Score = 42.8 bits (99), Expect = 0.081, Method: Composition-based stats. Identities = 33/178 (18%), Positives = 67/178 (37%), Gaps = 25/178 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN----- 228 ++ V+DVS SM K+ I+++ A+L E++ Q L+ F++ Sbjct: 313 ILFVIDVSGSMWGL------KMKQTIEAMKAILSELRAAD------QFSLIDFNHNVRCW 360 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + V ++ I+ + G TN L A I + + + Sbjct: 361 RDNLVSATPAQVEDAKKYIQTIHPNG-GTNINEALLRA-TFILNEAQNLGMLDPNSVSM- 417 Query: 289 KIIVFMTDGENLSTKEDQQSLY--YCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 IV ++DG+ + ++ K ++ +GI ++FL+ A+ N Sbjct: 418 --IVLVSDGDPTVGELKLTTIQKNVKQSIKDEF-SLFCLGIGFDVDYDFLQRIATDNR 472 >gi|256374467|ref|YP_003098127.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] gi|255918770|gb|ACU34281.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] Length = 564 Score = 42.8 bits (99), Expect = 0.081, Method: Composition-based stats. Identities = 35/212 (16%), Positives = 63/212 (29%), Gaps = 26/212 (12%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 M+ VLDVS SM + T++ A ++ L + PD + GL FS + Sbjct: 360 MLAVLDVSGSMTELMGNGQTRMAAASEAALTALGML---PDTSE---IGLWAFST--NKR 411 Query: 234 FLLEW-------GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 +W + + ++ L + D + Sbjct: 412 PPNDWVELVPLGPLGEVLGSAPRRTRLQQGAKGLAALVGGGTALNDTTLAAFRRMQSTYD 471 Query: 287 YKKI--IVFMTDGENLSTKEDQQSLYY----CNEAKKRGAIVYAIGIRVIRSHEFLR--A 338 +KI +V +TDG N + R + +G+ E L+ + Sbjct: 472 PEKINSVVLITDGRNDDYASITTAQLLQALESESDPARPIPLIMVGLGQEADMEALQEIS 531 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 A+ Y + I +R Sbjct: 532 SATGGKAYQALEAADIRSVLL---DAISQRRC 560 >gi|224004082|ref|XP_002295692.1| predicted protein [Thalassiosira pseudonana CCMP1335] gi|209585724|gb|ACI64409.1| predicted protein [Thalassiosira pseudonana CCMP1335] Length = 1007 Score = 42.8 bits (99), Expect = 0.081, Method: Composition-based stats. Identities = 35/265 (13%), Positives = 77/265 (29%), Gaps = 57/265 (21%) Query: 143 TFIPWYTNSRHIV-MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS----------- 190 T +P + P++S S + +D+S S+ + Sbjct: 740 TNMPGVVQEVTLTESPVSSEACPLSGD---FSLCFAIDMSGSVCNNGRGECLGCSPLDTC 796 Query: 191 ---------SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS 241 + ++ + + ++ +V F+ K+ L Sbjct: 797 NSVGVKKTMCCSNFLSLMEFTKDVTTSFNEVATDHDF---SVVHFATKVNTAIGLVSSRQ 853 Query: 242 HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL- 300 L+ + L G TN G+ G + N +I+ +TDG+ Sbjct: 854 ALK-VLNQLIYTGGMTNLASGIASGQETFSSTSGTSKRKN--------MILLITDGKLFI 904 Query: 301 ---------------STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE----FLRACAS 341 S + + AK +G ++ + + + F++ +S Sbjct: 905 VYNGKRCSMYQPSLPSYNPEGNAHEAATNAKSKGTVIVPVLMEAASDNAQSLMFMQKISS 964 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIV 366 + Y V + + D I +I Sbjct: 965 NGNVY-VSDFKGLNDLKDTILSEIS 988 >gi|163751746|ref|ZP_02158964.1| hypothetical protein KT99_12254 [Shewanella benthica KT99] gi|161328398|gb|EDP99557.1| hypothetical protein KT99_12254 [Shewanella benthica KT99] Length = 447 Score = 42.8 bits (99), Expect = 0.081, Method: Composition-based stats. Identities = 28/157 (17%), Positives = 60/157 (38%), Gaps = 23/157 (14%) Query: 1 MFSLNLNIRNF----FYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDR 56 MF ++ + F G + ++ I L + V + ++ H+ K L + +D Sbjct: 6 MFPSQVSFKQFSLAKLGKQGGAILVMFTIGLFSLLAVAALALDGGHLLLNKGRLQNAVDS 65 Query: 57 SLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSL 116 S ++AA + N G+ + ++ ++ + +N +N TS+ Sbjct: 66 SALYAAKILQNGGSLFDAREAATLILMQNL----------GFEENSELN--------TSV 107 Query: 117 DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH 153 D+ N ++A + L FIP + +S Sbjct: 108 DLSSPDYNAT-QVTANIFIEFSLWPDPFIPVFDDSSK 143 >gi|239813041|ref|YP_002941951.1| Vault protein inter-alpha-trypsin domain protein [Variovorax paradoxus S110] gi|239799618|gb|ACS16685.1| Vault protein inter-alpha-trypsin domain protein [Variovorax paradoxus S110] Length = 691 Score = 42.8 bits (99), Expect = 0.083, Method: Composition-based stats. Identities = 27/136 (19%), Positives = 55/136 (40%), Gaps = 24/136 (17%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 V + + D + V+D+S SM F +D A + ++ ++ N ++ SG Sbjct: 315 VAASAISPRDYIFVVDISGSMHGFP------LDTAKTVLERLIGGLRPSDTFNVLLFSG- 367 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCN 281 SNK+ + +++++ + + +G ST P LK Y Sbjct: 368 ---SNKMLSPRSVPATRANIEQALATIQNYGGSGSTELIPALKRVY------------AE 412 Query: 282 TEDANYKKIIVFMTDG 297 ++ + +V +TDG Sbjct: 413 PKEEKVSRTVVLVTDG 428 >gi|170743237|ref|YP_001771892.1| cell wall anchor domain-containing protein [Methylobacterium sp. 4-46] gi|168197511|gb|ACA19458.1| LPXTG-motif cell wall anchor domain protein [Methylobacterium sp. 4-46] Length = 761 Score = 42.8 bits (99), Expect = 0.083, Method: Composition-based stats. Identities = 31/190 (16%), Positives = 65/190 (34%), Gaps = 32/190 (16%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 + + + + + ++ V+D S SM + A S+ L+ + Sbjct: 357 LAVVTPPEGRAPARRPREVTFVIDNSGSMAG------ASMRQAKASLLVALDRLGPAD-- 408 Query: 216 NNVVQSGLVTFSNKIEEFFL-----LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + ++ F + ++ F E +R + L G T P L+ A Sbjct: 409 ----RFNVIRFDDTMDLLFPAPVPADEAHRDAARRFVAALEARG-GTEMLPPLRAAL--- 460 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 H D + IVF+TDG +++ ++ A + + ++ IGI Sbjct: 461 -----ADPHPEEGDRVRQ--IVFLTDG----AIGNEEQIFSAISAGRGRSRLFMIGIGSA 509 Query: 331 RSHEFLRACA 340 + + A Sbjct: 510 PNGHLMTHAA 519 >gi|148976298|ref|ZP_01813022.1| hypothetical protein VSWAT3_18848 [Vibrionales bacterium SWAT-3] gi|145964392|gb|EDK29647.1| hypothetical protein VSWAT3_18848 [Vibrionales bacterium SWAT-3] Length = 401 Score = 42.8 bits (99), Expect = 0.083, Method: Composition-based stats. Identities = 45/293 (15%), Positives = 78/293 (26%), Gaps = 77/293 (26%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 PL Y + V + + ++++VLDVS SM I Sbjct: 104 PLLPHFNYERYDQNVTATGGGFKGVVESKHSAIPTELVLVLDVSGSMSPN-------IQS 156 Query: 198 AIKSINAMLEEVKLIPDVNNV---VQSGLVTFSNKIEEFFLLEWGVSHLQRK--IKYLSK 252 ++ L ++ + N V +V F + + W I LS Sbjct: 157 LKSILSNALNTIQSQSNNANDLDSVSISIVPFDSGVATHRPP-WLSEETAGIYCIDGLSY 215 Query: 253 FGVS----------------------------------------TNSTPGLKYAYNQIFD 272 TN ++ + N + Sbjct: 216 RNGDFSASLTVDNLATLHSERPVKFTPPSKWLSDCNQESPMLPLTNVFSRVQNSINSLTA 275 Query: 273 MQGMRQH--------------------CNTEDANYKKIIVFMTDGENLSTKEDQQS-LYY 311 G R + + ++ +V TDG + +Q + Sbjct: 276 NGGTRSYQGLVWGVRQLIPSWQQAWGMKVSSVPETRRKLVLFTDGADEGDAFNQLVNAGF 335 Query: 312 CNEA-KKRGAIVYAIGIRV-IRSHEFLRACA-SPNSFYLVENPHSMYDAFSHI 361 C A K+ G + IG V CA +P + N + + FS I Sbjct: 336 CTTAIKQYGIEMNFIGYGVSPSRITQFENCAGNPLRVFSATNTTQLNEYFSDI 388 >gi|126459566|ref|YP_001055844.1| von Willebrand factor, type A [Pyrobaculum calidifontis JCM 11548] gi|126249287|gb|ABO08378.1| von Willebrand factor, type A [Pyrobaculum calidifontis JCM 11548] Length = 356 Score = 42.8 bits (99), Expect = 0.083, Method: Composition-based stats. Identities = 21/128 (16%), Positives = 46/128 (35%), Gaps = 21/128 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + I LDVS SM+ + + K+ +A ++ L ++ + + L+ F+ ++ Sbjct: 200 VYIALDVSGSMKEYMWGDV-KLRVAKNAVARYLRQMASLRG-----RVSLLLFN--VDAD 251 Query: 234 FLLEWGVSHLQRKIKYL------SKFGVSTNSTPGLKYAY-----NQIFDMQGMRQHCNT 282 F+ W + + ++ + G T L+ Y + G Sbjct: 252 FM--WTPYEVHKYLREMLEILEYVYAGGGTELASALEVLYSYGVREAVLITDGRTADVEK 309 Query: 283 EDANYKKI 290 + KK Sbjct: 310 TWSLVKKF 317 >gi|387437|gb|AAA39549.1| MHC factor B [Mus musculus] Length = 477 Score = 42.8 bits (99), Expect = 0.083, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 57/189 (30%), Gaps = 25/189 (13%) Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKI--------EEFFLLEWGVSHLQRKIKYLSKFGVS 256 + ++ + + GL+T++ E +W L + K Sbjct: 6 LTNLIEKVASYGVRPRYGLLTYATVPKVLVRVSDERSSDADWVTEKLNQISYEDHKLKSG 65 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYY---- 311 TN+ L+ Y+ + G + +I+ MTDG N+ Sbjct: 66 TNTKRALQAVYS-MMSWAGDAPPEGWNRTRH--VIIIMTDGLHNMGGNPVTVIQDIRALL 122 Query: 312 -----CNEAKKRGAIVYAIGIRV----IRSHEFLRACASPNSFYLVENPHSMYDAFSHIG 362 ++ VY G+ + + + + + V++ + + F + Sbjct: 123 DIGRDPKNPREDYLDVYVFGVGPLVDSVNINALASKKDNEHHVFKVKDMEDLENVFYQMI 182 Query: 363 KDIVTKRIW 371 + + + Sbjct: 183 DETKSLSLC 191 >gi|118349478|ref|XP_001008020.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89289787|gb|EAR87775.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 642 Score = 42.8 bits (99), Expect = 0.083, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 80/202 (39%), Gaps = 29/202 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + +D++ V++ S SM KI ++ +LE + + LV Sbjct: 193 NSRPSIDLVCVINNSESMHG------EKILNVKNTLLYLLEMLNSND------RLSLVLS 240 Query: 227 SNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +N F L++ L+R I +S +TN T + A+N + RQ N Sbjct: 241 NNNPTTLFDLKYLDEKNKQDLKRIINNIS-ITQNTNITKSMIKAFNIL----QFRQSQNK 295 Query: 283 EDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 + I ++DG ++ + K+ Q + + + +++ G + E + S Sbjct: 296 VSS-----IFLLSDGVDSSAEKQIQNYISSQQSLQNKNFAIHSFGYGFDQDAEMINKICS 350 Query: 342 --PNSFYLVENPHSMYDAFSHI 361 +FY ++N + + F+ + Sbjct: 351 LKNGNFYYIQNMNQVDQYFADV 372 >gi|327266004|ref|XP_003217797.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3-like [Anolis carolinensis] Length = 1098 Score = 42.8 bits (99), Expect = 0.085, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 70/192 (36%), Gaps = 32/192 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++I++DVS SM+ ++ +A ++++++L+ + N ++ ++ Sbjct: 258 ATSPKDVVILVDVSGSMKGL------RLTIAKQTVSSILDTLGDDDFFN------IIAYN 305 Query: 228 NKIEEFFLLEWGV---------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ G H + + L G+ L A+N + D Q Sbjct: 306 EELHYVEPCLNGTLVQADRANKEHFREHLDKLFAKGIG-MLDIALVEAFNMLSDFNHTGQ 364 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH-EFLR 337 A I+ +TDG + + + + R ++ I + + L+ Sbjct: 365 GSICSQA-----IMLVTDGAVDTYDAVFEKYNWPD----RKVRIFTYLIGREAAFADNLK 415 Query: 338 ACASPNSFYLVE 349 A N + + Sbjct: 416 WMACANKGFFTQ 427 >gi|313902369|ref|ZP_07835772.1| von Willebrand factor type A [Thermaerobacter subterraneus DSM 13965] gi|313467300|gb|EFR62811.1| von Willebrand factor type A [Thermaerobacter subterraneus DSM 13965] Length = 895 Score = 42.8 bits (99), Expect = 0.085, Method: Composition-based stats. Identities = 25/133 (18%), Positives = 49/133 (36%), Gaps = 27/133 (20%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D+ +VLD S SM +I A +L V+ + ++TF ++ Sbjct: 711 PVDVCLVLDASASMAGN------RIRAAKDLAQQLL--------VSTRDRVAVITFQERV 756 Query: 231 EEFF-LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + L S ++R + + +G+ T GL+ A + + Sbjct: 757 VQVQVPLTRNTSRVERGLSQIQPYGL-TPLAQGLEVALLYLAQSRARNP----------- 804 Query: 290 IIVFMTDGENLST 302 ++V +TDG Sbjct: 805 LLVLVTDGIPTVP 817 >gi|218679029|ref|ZP_03526926.1| hypothetical protein RetlC8_09174 [Rhizobium etli CIAT 894] Length = 151 Score = 42.8 bits (99), Expect = 0.085, Method: Composition-based stats. Identities = 11/57 (19%), Positives = 24/57 (42%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQ 64 N G + I+ + + L +G I+ + + L S +D +++ AA + Sbjct: 8 FSRLIDNRDGAVAIIVILVAVPLLLAVGASIDFIRAYNNRVDLQSAVDSAVLAAAAK 64 >gi|120603653|ref|YP_968053.1| von Willebrand factor type A [Desulfovibrio vulgaris DP4] gi|120563882|gb|ABM29626.1| von Willebrand factor, type A [Desulfovibrio vulgaris DP4] Length = 533 Score = 42.8 bits (99), Expect = 0.085, Method: Composition-based stats. Identities = 36/184 (19%), Positives = 58/184 (31%), Gaps = 36/184 (19%) Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 I+LD S SM + T ++ A V GL F Sbjct: 365 ILLDSSGSMSGCMTLASTACHATASALAACG------------VNVGLTAFPGHYLTAQA 412 Query: 236 LEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 VS L R + + G T L + + + R K+ Sbjct: 413 NWASVSPLIRHGQRVHPKVNVTSGGGTPLAESLWWTMQTMLPLPESR-----------KL 461 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 I+ +TDG+ S + +++L A G VY IGI L P + ++ + Sbjct: 462 ILIITDGDPDSGVQAEEALVGAARA---GFEVYGIGIISTAILSLL-----PGNSLVISS 513 Query: 351 PHSM 354 + Sbjct: 514 MGEL 517 >gi|46580450|ref|YP_011258.1| hypothetical protein DVU2043 [Desulfovibrio vulgaris str. Hildenborough] gi|46449869|gb|AAS96518.1| conserved hypothetical protein [Desulfovibrio vulgaris str. Hildenborough] gi|311234196|gb|ADP87050.1| von Willebrand factor type A [Desulfovibrio vulgaris RCH1] Length = 533 Score = 42.8 bits (99), Expect = 0.085, Method: Composition-based stats. Identities = 36/184 (19%), Positives = 58/184 (31%), Gaps = 36/184 (19%) Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 I+LD S SM + T ++ A V GL F Sbjct: 365 ILLDSSGSMSGCMTLASTACHATASALAACG------------VNVGLTAFPGHYLTAQA 412 Query: 236 LEWGVSHLQRKIKYLSK-----FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 VS L R + + G T L + + + R K+ Sbjct: 413 NWASVSPLIRHGQRVHPKVNVTSGGGTPLAESLWWTMQTMLPLPESR-----------KL 461 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 I+ +TDG+ S + +++L A G VY IGI L P + ++ + Sbjct: 462 ILIITDGDPDSGVQAEEALVGAARA---GFEVYGIGIISTAILSLL-----PGNSLVISS 513 Query: 351 PHSM 354 + Sbjct: 514 MGEL 517 >gi|255940832|ref|XP_002561185.1| Pc16g08660 [Penicillium chrysogenum Wisconsin 54-1255] gi|211585808|emb|CAP93536.1| Pc16g08660 [Penicillium chrysogenum Wisconsin 54-1255] Length = 944 Score = 42.8 bits (99), Expect = 0.085, Method: Composition-based stats. Identities = 31/176 (17%), Positives = 66/176 (37%), Gaps = 29/176 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF---SNKI 230 ++ V+D S SMES KI ++ L+ + + N +F ++++ Sbjct: 290 VVFVIDRSGSMES-------KIPTLKSALQVFLKSLPVGICFNICSFGSYYSFMWPTSQV 342 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L + + + T +K A + ++ N ED + Sbjct: 343 YDASSL----NQALAFVDTVYADMGGTE----MKQAV-----VATVQNRLNFEDLD---- 385 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 ++ +TDG+ +DQ + +A A +++GI SH + A + + Sbjct: 386 VLILTDGQIFD--QDQLFNFVREKAADNTARFFSLGIGEAASHSLIEGIARAGNGF 439 >gi|213963762|ref|ZP_03392012.1| conserved hypothetical protein [Capnocytophaga sputigena Capno] gi|213953642|gb|EEB64974.1| conserved hypothetical protein [Capnocytophaga sputigena Capno] Length = 288 Score = 42.8 bits (99), Expect = 0.086, Method: Composition-based stats. Identities = 35/161 (21%), Positives = 62/161 (38%), Gaps = 28/161 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S + I I A L L + +GL+ F Sbjct: 72 EEERELTLMLMIDVSGSELFGTEQQFKS--ELITEIAATLAFSALQNNDK----TGLILF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 S++IE + + G SH+ R I+ L + TN + L++ Sbjct: 126 SDQIELYIPPKKGKSHVLRIIRELIEFQPKSLKTNISEALQF----------------LS 169 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK-RGAIVY 323 + KK IVFM + K ++ + + G +Y Sbjct: 170 RVSKKKAIVFML--SDFMDKGYEKPIQIAAKKHDITGIRIY 208 >gi|282854078|ref|ZP_06263415.1| von Willebrand factor type A domain protein [Propionibacterium acnes J139] gi|282583531|gb|EFB88911.1| von Willebrand factor type A domain protein [Propionibacterium acnes J139] gi|314981158|gb|EFT25252.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA3] gi|315091981|gb|EFT63957.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA4] Length = 322 Score = 42.8 bits (99), Expect = 0.087, Method: Composition-based stats. Identities = 25/183 (13%), Positives = 57/183 (31%), Gaps = 19/183 (10%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + + ++ + A + ++ +D S SM++ D S T++ A + + Sbjct: 75 TTTVAWARPMGIEQVPRNHATI--VVAIDSSLSMKAD-DVSPTRLAAAKAKAKDFINSLP 131 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 +V +++ S E + R + + + T + + + Sbjct: 132 TGFNVA------VMSISEHPEIRMPPSTDRPTVLRAVDGI-ELQDGTALGGAIDKSLEAV 184 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 G ++ IV ++DG N + AK VY I Sbjct: 185 KMAPGGSKNPAPAA------IVMLSDGNNTQGGSPLVAANRAAAAK---VSVYTIAFGTE 235 Query: 331 RSH 333 + Sbjct: 236 TGY 238 >gi|121582838|ref|YP_973280.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] gi|120596100|gb|ABM39538.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] Length = 240 Score = 42.8 bits (99), Expect = 0.088, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 55/159 (34%), Gaps = 14/159 (8%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDA-RLDMMIVLDVSRSMESFFDSSITKIDMAIKSIN 203 + + P + +K + A L ++++ DVS SM KID ++ Sbjct: 1 MKSAPSDSAKTTPAATRLKAFAAPQARPLPVIVLADVSGSMSENG-----KIDALNVALK 55 Query: 204 AMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL 263 M+ + +Q GL+TF + E L + + ++ G T Sbjct: 56 EMILSFGKESGLRAEIQVGLITFGGR-EAHEHLPLVAAKVIGGVEAFKANG-GTPMGSAF 113 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 A + D + Y+ +++ ++DG Sbjct: 114 ALARKLLEDKEQ------IPSRAYRPVLILVSDGAPTDA 146 >gi|83749631|ref|ZP_00946614.1| hypothetical transmembrane protein [Ralstonia solanacearum UW551] gi|207728099|ref|YP_002256493.1| hypothetical protein RSMK04469 [Ralstonia solanacearum MolK2] gi|207744165|ref|YP_002260557.1| hypothetical protein RSIPO_02350 [Ralstonia solanacearum IPO1609] gi|83723692|gb|EAP70887.1| hypothetical transmembrane protein [Ralstonia solanacearum UW551] gi|206591344|emb|CAQ56956.1| conserved hypothetical protein [Ralstonia solanacearum MolK2] gi|206595569|emb|CAQ62496.1| conserved hypothetical protein [Ralstonia solanacearum IPO1609] Length = 345 Score = 42.8 bits (99), Expect = 0.088, Method: Composition-based stats. Identities = 30/183 (16%), Positives = 64/183 (34%), Gaps = 10/183 (5%) Query: 1 MFSLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVH 60 M ++ L IR +G + +L+ I L I + M ++++H+F ++ L + D + + Sbjct: 1 MTAMTLRIRP--RRQRGAVGVLSPILLIIFLAIGAMAVDIAHLFVVRNELQNAADAAALA 58 Query: 61 AATQIM-NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIV 119 A + N + G ++ + N G+ N S I Sbjct: 59 GAAGLYPANPKPNWSNGVAQGTSAVKLNASDNTKLTGGTVQAGYWNLTGSPAGMQSQSIT 118 Query: 120 VVPQN-EGYSISAI---SRYKIPLKFCTFIPWYTNSRHIVMPITS-SVKVNSQTDARLDM 174 + G ++ P+K ++ W N + T+ +V + + Sbjct: 119 PGSNDVPGVQVTVTRSPGNNGGPVK--GWLTWVFNGGAASIQATAVAVIAAPGSANPGSL 176 Query: 175 MIV 177 V Sbjct: 177 FPV 179 >gi|251779520|ref|ZP_04822440.1| von Willebrand factor, type A domain protein [Clostridium botulinum E1 str. 'BoNT E Beluga'] gi|243083835|gb|EES49725.1| von Willebrand factor, type A domain protein [Clostridium botulinum E1 str. 'BoNT E Beluga'] Length = 815 Score = 42.8 bits (99), Expect = 0.088, Method: Composition-based stats. Identities = 28/139 (20%), Positives = 46/139 (33%), Gaps = 36/139 (25%) Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVN----NVVQSGLV---TFSNKIEEFFL------- 235 S TK++ K+ N ++++K +PD+ N + ++ + Sbjct: 159 SKSTKMEELKKAANNFIDKMKDVPDLKICIVNYSSEATINPCGYNGDKNSASVEEDRHHT 218 Query: 236 -----------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 L + L I L G TN+ GL+ A D Sbjct: 219 IPNYKSLGTKFLNSNDNTLHSMINGLKALG-GTNTGEGLRKA----------EYMLEQGD 267 Query: 285 ANYKKIIVFMTDGENLSTK 303 + KK IVFM+DG Sbjct: 268 KDAKKTIVFMSDGLPTYYS 286 >gi|260826343|ref|XP_002608125.1| hypothetical protein BRAFLDRAFT_91397 [Branchiostoma floridae] gi|229293475|gb|EEN64135.1| hypothetical protein BRAFLDRAFT_91397 [Branchiostoma floridae] Length = 1803 Score = 42.8 bits (99), Expect = 0.088, Method: Composition-based stats. Identities = 37/168 (22%), Positives = 60/168 (35%), Gaps = 31/168 (18%) Query: 170 ARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++ LD S S+ S +D I + A+L + P + +V+F Sbjct: 43 GAVDIIFALDRSGSVGRSNYDK-------IIDFVKAVLNHFSVSPT---TTRVAVVSFGT 92 Query: 229 KIEEFFLLEWGVSH-------LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 F L S+ L+ + LS G +TN+ + N Sbjct: 93 SARVEFDLLRSSSNDNNKCELLRTHLPKLSYTGGATNTVG---------ALRLALALLKN 143 Query: 282 TEDANYKKIIVF-MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +Y +VF +TDG Y E + RG I++A GI Sbjct: 144 PGVRSYSTKVVFTITDG---YWNRGGDPAYVVRELQSRGVIMFAFGIG 188 >gi|310830437|ref|YP_003965538.1| ABC-type cobalt transport system, permease component CbiQ and transporter [Paenibacillus polymyxa SC2] gi|309249904|gb|ADO59470.1| ABC-type cobalt transport system, permease component CbiQ and transporter [Paenibacillus polymyxa SC2] Length = 338 Score = 42.8 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 27/171 (15%), Positives = 66/171 (38%), Gaps = 10/171 (5%) Query: 13 YNYKG--GMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 + KG + ++ A+ + F + IE+++++ MK + D + A Q+ + Sbjct: 9 RDEKGFVSVFVIFAVSFILPF-FIFHTIEMTYLYGMKDKFQNFNDAAASAGAMQLEKKVV 67 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 K ++ +K + ++ + N N V + + IV + + Sbjct: 68 SKGDLKFHEDEVKNVVKRILSENYGLDAHLNPQENSYITDVPNVKVSIVNQESDLPKELL 127 Query: 131 AISRYKIPLKFCTFI-------PWYTNSRHIVMPITSSVKVNSQTDARLDM 174 +K +K T I ++ I + S+++V+ +T R ++ Sbjct: 128 TQEGFKFEIKHPTIIVYTEVKPKGIFFNKFITIKSISALEVSFKTGGRTEL 178 >gi|291452540|ref|ZP_06591930.1| predicted protein [Streptomyces albus J1074] gi|291355489|gb|EFE82391.1| predicted protein [Streptomyces albus J1074] Length = 591 Score = 42.8 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 35/180 (19%), Positives = 62/180 (34%), Gaps = 26/180 (14%) Query: 174 MMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 + +VLD S SM ++ D S + + ++ L+ P VQ +V FS I+ Sbjct: 404 VYLVLDRSGSMRGYYRDGSAQCLGEQVLALATHLD-----PAGAPAVQ--VVFFSTDIDG 456 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L + +I L A ++ + A +++ Sbjct: 457 KGELT--PQAYEGRIDALHA-ECGRMGRTSYHRAIEEVTALHEAS------GATEPALVI 507 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVY--AIGIRVIRSHEF--LRACASPNSFYLV 348 F TDG + Q+L GA ++ +G F LR +P++ Y Sbjct: 508 FQTDGPPDAKTAATQALKAV-----SGAPLFWQFVGFGESEHKNFEYLRKLNTPHTGYFP 562 >gi|240137956|ref|YP_002962428.1| hypothetical protein MexAM1_META1p1287 [Methylobacterium extorquens AM1] gi|240007925|gb|ACS39151.1| conserved hypothetical protein [Methylobacterium extorquens AM1] Length = 735 Score = 42.8 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 30/202 (14%), Positives = 64/202 (31%), Gaps = 32/202 (15%) Query: 144 FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSIN 203 F + ++ +T + D++ V+D S SM + A S+ Sbjct: 320 FRERVAGAEAVLAVVTPPETASPAASVPRDVVFVIDNSGSMGG------ASMRQAKASLL 373 Query: 204 AMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-----LEWGVSHLQRKIKYLSKFGVSTN 258 L+ + + ++ F + + F + + + L G + Sbjct: 374 IGLDRLGAGD------RFNVIRFDHSFDTLFPDLVPADAGHLMRAKSFVAGLQASGGTEM 427 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 P L+ A E+ + +VF+TDG + + S + R Sbjct: 428 LAP-LQAALRDAT----------PEETGRLRQVVFLTDGA-IGNEAQIFSAIATERGRSR 475 Query: 319 GAIVYAIGIRVIRSHEFLRACA 340 ++ +GI + +R A Sbjct: 476 ---LFMVGIGSAPNGYLMRHAA 494 >gi|239980680|ref|ZP_04703204.1| hypothetical protein SalbJ_14645 [Streptomyces albus J1074] Length = 530 Score = 42.8 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 35/180 (19%), Positives = 62/180 (34%), Gaps = 26/180 (14%) Query: 174 MMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 + +VLD S SM ++ D S + + ++ L+ P VQ +V FS I+ Sbjct: 343 VYLVLDRSGSMRGYYRDGSAQCLGEQVLALATHLD-----PAGAPAVQ--VVFFSTDIDG 395 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L + +I L A ++ + A +++ Sbjct: 396 KGELT--PQAYEGRIDALHA-ECGRMGRTSYHRAIEEVTALHEAS------GATEPALVI 446 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVY--AIGIRVIRSHEF--LRACASPNSFYLV 348 F TDG + Q+L GA ++ +G F LR +P++ Y Sbjct: 447 FQTDGPPDAKTAATQALKAV-----SGAPLFWQFVGFGESEHKNFEYLRKLNTPHTGYFP 501 >gi|156742635|ref|YP_001432764.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233963|gb|ABU58746.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 777 Score = 42.8 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 19/130 (14%), Positives = 44/130 (33%), Gaps = 16/130 (12%) Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 + Q I L+ G T+ GL+ + + + + IV ++DG+ Sbjct: 373 ARAQAAIDTLNSRGA-TSIGGGLQRSQHLLTSAN----------PAIPRAIVLLSDGQEN 421 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRACA-SPNSFYLVENPHSMYDAF 358 ++ L + V+ IG+ L A + ++ P + + Sbjct: 422 TSPYVADVLPP---IRDAQTTVHTIGVGQDADQRLMLSIAAQTGGTYNYAPTPDQLARIY 478 Query: 359 SHIGKDIVTK 368 + I ++ + Sbjct: 479 NTISGNVSNR 488 >gi|84494861|ref|ZP_00993980.1| hypothetical protein JNB_08684 [Janibacter sp. HTCC2649] gi|84384354|gb|EAQ00234.1| hypothetical protein JNB_08684 [Janibacter sp. HTCC2649] Length = 562 Score = 42.8 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 35/238 (14%), Positives = 80/238 (33%), Gaps = 17/238 (7%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 I +P++ + V + + V+DVS S D+S T + ++ + + V Sbjct: 35 GDTTIDVPLSGTSSVATGA-PSTSWIYVVDVSGSTSLDCDASRTILTCEKVAVKGLNDLV 93 Query: 210 KLIPD--VNNVVQSGLVTFSNKIEEFFLLEWGVS----HLQRKIKYLSKFGVSTNSTPGL 263 + V G + ++ + S + + LS + + + Sbjct: 94 VTDGSALESAVAAFGEGADTADMQGTAGDQKFTSPSGTDVNTVVDSLSAGSIGQFTGKNV 153 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 + + + + +VF++DGE+ + + + GA +Y Sbjct: 154 GADFTNYAAGLQAASALVSAASGTQVNVVFLSDGESNAGAIGDFTTAL--TSVSAGATIY 211 Query: 324 AIGIRVIRSHEFL-------RACASPNSFYLVENPHSMYDAFSHI-GKDIVTKRIWYD 373 + S A AS + + V +P + D +++ G + T + D Sbjct: 212 PFAVGSGSSCGGGTAGTLNDMATASGTTCFAVPDPADLPDVITNVTGTTLKTLSVTLD 269 >gi|46143335|ref|ZP_00135441.2| COG4961: Flp pilus assembly protein TadG [Actinobacillus pleuropneumoniae serovar 1 str. 4074] Length = 520 Score = 42.8 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 32/242 (13%), Positives = 78/242 (32%), Gaps = 27/242 (11%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R F + G T++ + I ++ + +E + I + L ++++++ + + Sbjct: 10 RRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAENNSG 69 Query: 69 GNGNNRK----KLKGGDILCRIKNTWNMSFRNELRDNGFVND--------IDDIVRSTSL 116 N+ K K D R+ FV + + Sbjct: 70 RKDNDYKLSGSSNKENDSFDISSEVGK---RDNQMVTTFVQAFLPQTNEKAMRLTPTCKT 126 Query: 117 DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNS-----RHIVMPITSSVKVNSQTDAR 171 + S ++ ++ P + + + + S + + Sbjct: 127 VTTDNKKGHTSSSEVTCTVSGTVEHKSWFPLKVGNLEVIPQQVDVASKSRAFKKNTFNIP 186 Query: 172 LDMMIVLDVSRSMESFFD-------SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +D+M+V D+S SM D + +KI + + + + + D NN + + Sbjct: 187 IDLMVVADLSGSMNFDLDNKKIINNAKPSKIRILKEVLEELAAKSLFNQDSNNNNRIAVA 246 Query: 225 TF 226 F Sbjct: 247 PF 248 >gi|298527988|ref|ZP_07015392.1| von Willebrand factor type A [Desulfonatronospira thiodismutans ASO3-1] gi|298511640|gb|EFI35542.1| von Willebrand factor type A [Desulfonatronospira thiodismutans ASO3-1] Length = 743 Score = 42.8 bits (99), Expect = 0.090, Method: Composition-based stats. Identities = 48/259 (18%), Positives = 88/259 (33%), Gaps = 44/259 (16%) Query: 139 LKFCTFIPWYTNSRHIV-MPITSSVKVNSQTDAR-------LDMMIVLDVSRSMESFFDS 190 L+ +P + + +S ++ A +D++ +D +RSM+ F Sbjct: 281 LQLAAAVPEQRGADTLDDEEYSSQARIQRGDQAGAAMEDLNVDIVFTIDTTRSMQPF--- 337 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL------- 243 IDM +++ M+++ D + GLV F + I+ LE+ V + Sbjct: 338 ----IDMTREAVVKMVQDF--TDDTAARFRFGLVGFRDCIDTVPWLEYDVLNFTPELVDG 391 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 R I L T G YA + + N + +VF+TD + Sbjct: 392 DRLIHLLEHELEETEVGSG-DYAEEGFAGVDEALRSDW--RDNSLRFMVFITDASSNPKG 448 Query: 304 EDQQSL-----YYCNEAKKRGAIVYAIGIRVIRSHE-----------FLRACASPNSFYL 347 + Q + EA + AI + R+ E + S S Y Sbjct: 449 DPQNTTGKDEHDLRREADDAQVHILAIHLLDPRATEDHPIATPQYMKLSQVRGSSESAYT 508 Query: 348 VENPHSMYDAFSHIGKDIV 366 + + + + DIV Sbjct: 509 SVDAFE-QEEYQALVDDIV 526 >gi|224092755|ref|XP_002190101.1| PREDICTED: inter-alpha (globulin) inhibitor H2 [Taeniopygia guttata] Length = 948 Score = 42.8 bits (99), Expect = 0.090, Method: Composition-based stats. Identities = 33/178 (18%), Positives = 67/178 (37%), Gaps = 25/178 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN----- 228 ++ V+DVS SM K+ I+++ A+L E++ Q L+ F++ Sbjct: 313 ILFVIDVSGSMWGL------KMKQTIEAMKAILSELRAAD------QFSLIDFNHNVRCW 360 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + V ++ I+ + G TN L A I + + + Sbjct: 361 RDNLVSATPSQVEDAKKYIQTIHPNG-GTNINEALLRA-TFILNEAKSLGMLDPNSVSM- 417 Query: 289 KIIVFMTDGENLSTKEDQQSLY--YCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 IV ++DG+ + ++ K ++ +GI ++FL+ A+ N Sbjct: 418 --IVLVSDGDPTVGELKLTTIQKNVKQSIKDEF-SLFCLGIGFDVDYDFLQRIATDNR 472 >gi|170748502|ref|YP_001754762.1| hypothetical protein Mrad2831_2084 [Methylobacterium radiotolerans JCM 2831] gi|170655024|gb|ACB24079.1| conserved hypothetical protein [Methylobacterium radiotolerans JCM 2831] Length = 463 Score = 42.8 bits (99), Expect = 0.090, Method: Composition-based stats. Identities = 9/84 (10%), Positives = 27/84 (32%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 + G + + + ++ + G I+ + + ++ L + D ++ A + Sbjct: 7 RAVTDRSGSVALTFGLSAVVLLGLTGGGIDYARLAARRSQLQNAADAGVLAAGNYLKLAV 66 Query: 70 NGNNRKKLKGGDILCRIKNTWNMS 93 + K D + S Sbjct: 67 ATSAAAKSIVVDTVHAQAAPRPES 90 >gi|302342832|ref|YP_003807361.1| Magnesium chelatase [Desulfarculus baarsii DSM 2075] gi|301639445|gb|ADK84767.1| Magnesium chelatase [Desulfarculus baarsii DSM 2075] Length = 680 Score = 42.8 bits (99), Expect = 0.091, Method: Composition-based stats. Identities = 24/129 (18%), Positives = 42/129 (32%), Gaps = 17/129 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M +D S SM + ++ ++ +++ +L E + GLV F Sbjct: 473 IMFCVDASGSMNA-----AARMRVSKQAVLGLLTEA-----YQKRDRVGLVAFGGNAARL 522 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L G + RK+ G T GL + R+ ++V Sbjct: 523 LLPPTGSVEVARKLLAELPTGGKTPLAAGL-----AVTAQAVSRELARDPKLTP--LVVV 575 Query: 294 MTDGENLST 302 TDG Sbjct: 576 FTDGRPNVP 584 >gi|227830111|ref|YP_002831890.1| von Willebrand factor A [Sulfolobus islandicus L.S.2.15] gi|227456558|gb|ACP35245.1| von Willebrand factor type A [Sulfolobus islandicus L.S.2.15] Length = 356 Score = 42.8 bits (99), Expect = 0.091, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 71/207 (34%), Gaps = 34/207 (16%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 V + S + + +I++D S SM K++ A++S +L + Sbjct: 22 VGFVIYITPQQSSVTSSIHYIIMIDNSPSMRG------EKLNTAVQSAQKLL------YN 69 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 +N L+ FSN E + + I + G +T + + N Sbjct: 70 LNEGNYVTLILFSNHPEIKYQ-----GPAKGIITFDVGKGYTTRLHEAVSFTIN------ 118 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K I+ +TDG+ + + + + + IGI + Sbjct: 119 -----LAKQSQVPTK-IIMLTDGKPTDKR-NVKDYEKLDIPPNT--QIITIGIGNDYNER 169 Query: 335 FLRACA--SPNSFYLVENPHSMYDAFS 359 L+ A S FY +++ + + F Sbjct: 170 ILKKLADRSSGKFYHIKDISELPNIFE 196 >gi|291520528|emb|CBK75749.1| Gram positive anchor./von Willebrand factor type A domain [Butyrivibrio fibrisolvens 16/4] Length = 605 Score = 42.8 bits (99), Expect = 0.091, Method: Composition-based stats. Identities = 18/113 (15%), Positives = 49/113 (43%), Gaps = 3/113 (2%) Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY 249 T+I + +++ M++ + DV+ + ++ F+ + S++++ + Sbjct: 6 GDKTRIQLLKSAVDNMIDNIAEKEDVDAKWE--VIDFATRAAVRGGGWLNTSNVKQYVTT 63 Query: 250 LSKFGVSTNSTPGLKY-AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 + + G Y A + + ++ ++ N KKI++F+TDG+ Sbjct: 64 AINEDNNVDIGRGTNYQAGMDLAQKEFEKKQPESDRPNAKKIVLFLTDGQPTY 116 >gi|196233185|ref|ZP_03132032.1| hypothetical protein CfE428DRAFT_5199 [Chthoniobacter flavus Ellin428] gi|196222829|gb|EDY17352.1| hypothetical protein CfE428DRAFT_5199 [Chthoniobacter flavus Ellin428] Length = 252 Score = 42.8 bits (99), Expect = 0.091, Method: Composition-based stats. Identities = 30/143 (20%), Positives = 50/143 (34%), Gaps = 17/143 (11%) Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS-NKIEEFFLLEWGVSHLQR 245 D++ + K+I + +V + + Q G+ TFS I+ + S + Sbjct: 83 NKDTNEKAREKLWKTIRDTMSKVSAVAETAPY-QVGVATFSAGNIDLMAPMGSKKSIVDS 141 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKE 304 I+ ++ GVS G K A R K IV ++DG N Sbjct: 142 AIEKVTFNGVSPELYLGAKRAIEWFSGTPADR-----------KYIVLISDGISNDKVVS 190 Query: 305 DQQSLYYCNEAKKRGAIVYAIGI 327 Q + +AK + IG Sbjct: 191 QQDVVQAALKAK---VHICTIGF 210 >gi|319442698|ref|ZP_07991854.1| hypothetical protein CvarD4_13132 [Corynebacterium variabile DSM 44702] Length = 916 Score = 42.8 bits (99), Expect = 0.092, Method: Composition-based stats. Identities = 29/220 (13%), Positives = 67/220 (30%), Gaps = 41/220 (18%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAI-KSINAMLEEVKL-IPDVNNVVQSGLVTFSNKI 230 D++++LD S S++ F T D + +++++ D+ + L F Sbjct: 72 DILLILDQSASLKGFDGKPPTDPDNIRVDATRDLVKQLGTHAEDLGADINVKLAGFGEGY 131 Query: 231 EEFFLL--EW-----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +W L + + + + Y R+ Sbjct: 132 HNSTGDYGDWVSVGANSGDLDKAVDGFGRRNTD------MYTNYEDALA-GASREFAGAP 184 Query: 284 DANYK----KIIVFMTDGENLST--KEDQQSLYYCN------EAKKRGAIVYAIGIRVIR 331 + K + ++F +DG+ +Q + C + G + +G+ Sbjct: 185 TTDGKSSDCQAVLFFSDGKVTHPEKSNEQAAADVCRPNSPLVSLRNSGVRFFTVGLIPED 244 Query: 332 S----HEFLRACA--------SPNSFYLV-ENPHSMYDAF 358 E L A + +++ +P + AF Sbjct: 245 ETDSPRELLTEMAEGPCGGGEANGAYFDAGTDPAGLLSAF 284 >gi|212702918|ref|ZP_03311046.1| hypothetical protein DESPIG_00954 [Desulfovibrio piger ATCC 29098] gi|212673780|gb|EEB34263.1| hypothetical protein DESPIG_00954 [Desulfovibrio piger ATCC 29098] Length = 643 Score = 42.8 bits (99), Expect = 0.092, Method: Composition-based stats. Identities = 33/262 (12%), Positives = 81/262 (30%), Gaps = 52/262 (19%) Query: 134 RYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR-LDMMIVLDVSRSMESFFDSSI 192 Y +P+ N + S + + + +V+D S SM+ + Sbjct: 173 FYLMPILGMNRAFEGVNLLQVASIDPGSSQSGAVAGPPKTGIALVMDTSISMKPY----- 227 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY--- 249 ID + I + + ++ + + V +V F + E+ L + + Sbjct: 228 --IDQSRDIIRQLYDRLEK-DKMTDNVGFAVVAFRSSTEKTPKLGYTSQVISDFATAKDR 284 Query: 250 --------------LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 +S + +S G+ A + + ++I+ +T Sbjct: 285 KALESRLAKVEQATVSSHDFNEDSLAGIYTAIDSLNW-----------GPYSTRLILLVT 333 Query: 296 D-----GENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS----------HEFLRACA 340 D G++ + + + A+++G + + ++ + L + Sbjct: 334 DAGPLRGDDPYASQRLGAAEMNDLARQKGIWITTMHVKTPGGSKNHAYAEQAYRALSRLS 393 Query: 341 SPNSFYLVENPHSMYDAFSHIG 362 + Y N S +A G Sbjct: 394 GDQANYQAVNASSHKEAARQFG 415 >gi|254229333|ref|ZP_04922750.1| hypothetical protein VEx25_1585 [Vibrio sp. Ex25] gi|262393419|ref|YP_003285273.1| hypothetical protein VEA_002646 [Vibrio sp. Ex25] gi|151938141|gb|EDN56982.1| hypothetical protein VEx25_1585 [Vibrio sp. Ex25] gi|262337013|gb|ACY50808.1| hypothetical protein VEA_002646 [Vibrio sp. Ex25] Length = 422 Score = 42.8 bits (99), Expect = 0.092, Method: Composition-based stats. Identities = 24/187 (12%), Positives = 57/187 (30%), Gaps = 5/187 (2%) Query: 32 LVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWN 91 V I+++H KT L + +D + + A + N + + N Sbjct: 30 GVAAFGIDLNHQVLNKTRLQNAVDTAALAGAVVADKTEDVNQAETAVRTTLGGISTEPGN 89 Query: 92 MS--FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYT 149 F + F +D+ V + S + + + Y A++ + Sbjct: 90 SELTFSDSNTAVTFSHDMQTFVSAASFTP-PLGEYDIYVRVAVTDISLTQYLSGLFGIDK 148 Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 + S+ + + M D + + + D + +++ E+ Sbjct: 149 EVSASAVAGRSAAIAYTCNLTPIAMCG--DPAGDVSDAWGYRPPGYDPNVDMDPSLVHEL 206 Query: 210 KLIPDVN 216 K+ N Sbjct: 207 KVGDQNN 213 >gi|158319804|ref|YP_001512311.1| von Willebrand factor type A [Alkaliphilus oremlandii OhILAs] gi|158140003|gb|ABW18315.1| von Willebrand factor type A [Alkaliphilus oremlandii OhILAs] Length = 645 Score = 42.8 bits (99), Expect = 0.092, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 70/205 (34%), Gaps = 38/205 (18%) Query: 150 NSRHIVMPITSSVKVNSQTDAR-LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 N +++ + ++ + +++ V+D S SM KID K++ L Sbjct: 412 NYKNLPIFNRYLLESQTNVLPEKIEISFVIDNSGSMNE------RKIDATRKALAVTLLS 465 Query: 209 VKLIPDVNNV-------------VQSGLVTFSNKIEEFFLLEWGV------SHLQRKIKY 249 I D N VQ+ F ++ + S + R I Sbjct: 466 ---IDDFNRYLEKNIEKLKQKIEVQTETWFFGSRCYNVKEFNYRSSAEKEKSDIIRSIVR 522 Query: 250 LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL 309 L +T+ LK N I Q R + KI+ +TDG + + Sbjct: 523 LDGNDGATDDGACLKEISNTITPKQENRLKKGKQ----IKIVFQITDGASSFP---GMAK 575 Query: 310 YYCNEAKKRGAIVYAIGIRVIRSHE 334 E + +VY G ++ +++E Sbjct: 576 EAVEELLSKNVMVY--GFQIGKNNE 598 >gi|89096722|ref|ZP_01169614.1| tellurium resistance protein, putative [Bacillus sp. NRRL B-14911] gi|89088737|gb|EAR67846.1| tellurium resistance protein, putative [Bacillus sp. NRRL B-14911] Length = 466 Score = 42.8 bits (99), Expect = 0.092, Method: Composition-based stats. Identities = 24/158 (15%), Positives = 48/158 (30%), Gaps = 11/158 (6%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + S++ D + + LD+S SM + S KI + I A+ Sbjct: 225 LSKKVKVSLEKAGLQDHSAKVALCLDISGSMAGLYRSG--KIQEFAERILALGTRFDDDG 282 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLE-WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 ++ + F L +++ T +K F Sbjct: 283 SIDIFL------FGQNAHHADELTIDNFDGFVKRLNQAYPLEGGTYYGKVMKEIRQHYFG 336 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 G R + ++F+TDG +E + + Sbjct: 337 SAGKRSSPEARNMPV--YVMFVTDGATFDERETRNHIQ 372 >gi|322388234|ref|ZP_08061838.1| peptidoglycan binding domain protein [Streptococcus infantis ATCC 700779] gi|321140906|gb|EFX36407.1| peptidoglycan binding domain protein [Streptococcus infantis ATCC 700779] Length = 459 Score = 42.8 bits (99), Expect = 0.093, Method: Composition-based stats. Identities = 42/275 (15%), Positives = 77/275 (28%), Gaps = 33/275 (12%) Query: 64 QIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNG--FVNDIDDIVRSTSLDIVVV 121 +E + K GD++ + + N +V V T + Sbjct: 73 AAESEKAVVKESEAKPGDLIDVSTKLSPIDVKESQEGNQKVYVETATAEVSKTEVIASKD 132 Query: 122 PQNEGYSISAISRYKIP--LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLD 179 + + K TF + + + IT + D++ V+D Sbjct: 133 ETIPAPATTTTEGTVTGKDEKGNTFTQYERVDKTTTVEIT-KTPPTVKKAGAADIVFVVD 191 Query: 180 VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG 239 S SM I+ K++N + + GL T+S+++ L G Sbjct: 192 RSGSMGGT-------INTVRKNVNEFARNLAKDGVAA---RFGLATYSDEVYGRRL---G 238 Query: 240 VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI-FDMQGMRQHCNTE-------------DA 285 + + + +T+ L+ A +I G T Sbjct: 239 KTDEDTILTKFGETYFTTDPVE-LEKALEKIKIAHGGDSPETATPALTKIVSTYDWSKSP 297 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA 320 KK +V +TD KK G Sbjct: 298 KNKKFVVLLTDARMKEDPSIPSIDETLTTLKKAGI 332 >gi|3929909|dbj|BAA34706.1| complement factor B/C2A [Cyprinus carpio] Length = 762 Score = 42.8 bits (99), Expect = 0.093, Method: Composition-based stats. Identities = 28/226 (12%), Positives = 79/226 (34%), Gaps = 26/226 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K+ +LD+ I +D S S+E D A +I +LE++ P N Sbjct: 246 KITLDQGGKLDIYIAVDASDSVEE------KDFDYAKTTIKLLLEKISYYPVSPNYE--- 296 Query: 223 LVTFSNKIEEFFLLEWGV------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 ++ F+ + + + + +++ + + + Y I + + Sbjct: 297 ILMFATDVTPIIKMNQFKMQKPTLTDIFKEMDEFTYDKKGDKTGTNIAKVYTVIEESMKI 356 Query: 277 RQHCNTED-ANYKKIIVFMTDGE-NLSTKEDQQSLYY------CNEAKKRGAIVYAIGIR 328 + N + + II+ +DG N+ + + +++ +Y G+ Sbjct: 357 EELNNATVFSETQHIIILFSDGHANMGGNPKPKVEQIKRLVTKNDPKREKKLDLYVFGVG 416 Query: 329 VIRSHE---FLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + + + F+ +++ + + F ++ + + + Sbjct: 417 DVNQDDVNGLVSQRDQEKHFFKLQDLQEVQEMFDNMIDESTSVGLC 462 >gi|226360795|ref|YP_002778573.1| hypothetical protein ROP_13810 [Rhodococcus opacus B4] gi|226239280|dbj|BAH49628.1| hypothetical protein [Rhodococcus opacus B4] Length = 891 Score = 42.8 bits (99), Expect = 0.094, Method: Composition-based stats. Identities = 40/242 (16%), Positives = 83/242 (34%), Gaps = 63/242 (26%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV-VQSGLVTFSNKIE 231 D+M+++D S S++ + + N +LE++ V + + F++ Sbjct: 45 DLMLMVDQSGSLQGSDPD-----AARVSAANYLLEQLNTFGGSAGVELNVAIAGFADDFT 99 Query: 232 EFFL---LEWGV-SHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 L+ G LQ +++ G+ T+ L A + +H + +A Sbjct: 100 VHAPWTRLDNGSLPALQGEVEKFRTRTDGIDTDYWNALDGARRTLA------EHDSQSEA 153 Query: 286 NYKKIIVFMTDG--------------ENLSTKEDQQSLY--------------YCNEAKK 317 N + + + +DG + +S + DQ ++ + Sbjct: 154 NRCQAVAWFSDGKLDFTVRDAEKPYAQGISLRSDQGVQQVVAAARESICRPAGIADQLRS 213 Query: 318 RGAIVYAIGIRV----IRSHEFLR--------ACA-----SPNSFYLVENPHSMYDAFSH 360 G + +A+G+ + +R AC SP FYL +N + AF Sbjct: 214 SGIVTFAVGLAAGTAQPSDFDLMRSIATGGDGACGKTTSPSPGDFYLAQNIDDLLFAFDA 273 Query: 361 IG 362 Sbjct: 274 FS 275 >gi|162448822|ref|YP_001611189.1| putative secreted protein [Sorangium cellulosum 'So ce 56'] gi|161159404|emb|CAN90709.1| putative secreted protein [Sorangium cellulosum 'So ce 56'] Length = 563 Score = 42.8 bits (99), Expect = 0.094, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 75/204 (36%), Gaps = 32/204 (15%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSIN---AMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 ++++D S SM ++++A K+ L + + G+ ++S+ Sbjct: 38 VVIIDRSGSMYG------DRLELAKKAAKIYWNTLVSSNVPASQSFTELLGVASYSDTSS 91 Query: 232 EFFLLEWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L S L + L G ST+ GL+ A + + + + ++ Sbjct: 92 VTYPLTALPASGLDTAVDALVADG-STSIGAGLEEALDMLIS--------ESPTKSAREC 142 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-----SPNSF 345 ++ ++DG++ + +Y + + V++I + + A SP + Sbjct: 143 VILLSDGQHNTPPAPSD--FYADYFSRVD-EVHSIALGSGADEAMMSDIAANYGFSPGLY 199 Query: 346 YLVE-----NPHSMYDAFSHIGKD 364 + + + AF+ + D Sbjct: 200 LRADSDTAIDQLELIGAFNRMAND 223 >gi|149919074|ref|ZP_01907558.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1] gi|149820004|gb|EDM79425.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1] Length = 520 Score = 42.8 bits (99), Expect = 0.094, Method: Composition-based stats. Identities = 23/169 (13%), Positives = 52/169 (30%), Gaps = 21/169 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF--- 226 +RLD+ +V+D + SM +++ I + E + + LV + Sbjct: 281 SRLDVALVIDATGSMGD-------ELEYLKVEIRDIAEAINHHFPGVEQ-RFALVVYRDK 332 Query: 227 -SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + + F + QR + S G + + Sbjct: 333 GDSYVTRSFDFTTNLDSFQRDLSEQSAGGGGDYP--------EAMDAAMAKAAKLSWSKQ 384 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + ++ + D K D +L ++ + +G VY + + Sbjct: 385 DAARVTFLVADAPPHQDKAD-DTLAAVDKLRAKGVAVYPVASSGVAGEA 432 >gi|149709409|ref|XP_001496268.1| PREDICTED: chloride channel, calcium activated, family member 2 [Equus caballus] Length = 943 Score = 42.8 bits (99), Expect = 0.094, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 73/212 (34%), Gaps = 37/212 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLDVS M + ++ ++ L ++ I G+ +F++K E Sbjct: 312 VCLVLDVSSKM-----AEADRLLRLQQAAEFYLMQIVEIHTF-----VGIASFNSKGEIR 361 Query: 234 FLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L + L + T+ GLK + + + G Sbjct: 362 AQLHQINNDDDRKLLVSYLPATVSAEAETSICSGLKKGFEVVEKLNGKAHGS-------- 413 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFY 346 ++V +T G+ D+ + G+ ++ I + L F+ Sbjct: 414 -VMVLVTSGD------DEHVANCLLTVQSSGSTIHTIALGSSAVENLEELSHLTGGLKFF 466 Query: 347 LVE--NPHSMYDAFSHIGK---DIVTKRIWYD 373 + + N +SM DAFS I DI + I + Sbjct: 467 VPDKSNSNSMIDAFSRISSGTGDIFQQHIQLE 498 >gi|295093271|emb|CBK82362.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Coprococcus sp. ART55/1] Length = 612 Score = 42.8 bits (99), Expect = 0.095, Method: Composition-based stats. Identities = 33/298 (11%), Positives = 88/298 (29%), Gaps = 41/298 (13%) Query: 51 HSMIDRSLVHAATQIMNEGNGNNRKKLKGGD---------ILCRIKNTWNMSFRNELRDN 101 D + V T ++ + + + ++ + +F + Sbjct: 91 EEACDVATVAGDTAMVTDTSNSMYTEIAYDTREYDSVAENGFVSTADRPLSTFAADRDTA 150 Query: 102 GFVN-----DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSR--HI 154 + N + + ++ I + Y T ++ + Sbjct: 151 SYSNVRSYIESGCLPPDGAVRIEEMLNYFTYDYRRKPEDGEKFSIYTEYSDCPWNKATKL 210 Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 +M ++ +++ +++ ++D S SM K+ + +S + E + Sbjct: 211 MMVGINTDEIDFGDKKPSNLVFLIDTSGSMYED-----NKLPLVQQSFAMLAENLDEND- 264 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + +VT++ + G + + ++ G TN + AY + + Sbjct: 265 -----KVSIVTYAGEDTVVLSGTSGSEQYTINEALSSMTAEGC-TNGGDAIITAYE-LAE 317 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLS--TKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + N ++ TDG+ T E E K+ + +G Sbjct: 318 KNFIEGGNNR--------VILATDGDLNVGLTSESDLVDLITEEKKENNIFLSVLGFG 367 >gi|307609442|emb|CBW98936.1| structural toxin protein RtxA [Legionella pneumophila 130b] Length = 1557 Score = 42.8 bits (99), Expect = 0.096, Method: Composition-based stats. Identities = 46/293 (15%), Positives = 98/293 (33%), Gaps = 37/293 (12%) Query: 57 SLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSL 116 ++ + +++ N N+ L+ + R++ S + + ++ + DI + Sbjct: 358 AVNASGGYLISGLNLNDTITLRSANGYDRVEIENARSGAHGVSNSSLNGETFDIGLFSYN 417 Query: 117 DIVVVPQNEGYSI----------SAISRYKIPLKFCTFIPWY-------------TNSRH 153 I P ++ S +I L F Sbjct: 418 TIKTTPSEININMGLSLTDSDGDKITSSIEINLAPSVFKVGENVDDTSSSNVLHRVGGDT 477 Query: 154 IVMPITSSVKVNSQTDARLDM-------MIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 V+ + + +++ VLD S SM F T++++ +++ +L Sbjct: 478 GVVDGSGGADILVGDVGGVEIVGTTARIAFVLDESGSMGQNFGG-TTRLEVLKQTMTDIL 536 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 E+ P+ + V LV F++ + E LQ+ + ++S G+ Sbjct: 537 TELSNTPNASITVH--LVKFASVVNGTGTFEITGGELQQALDFIS--GLQIQQGLLAGTN 592 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 Y + + A+ ++ + F TDG D S Y N A+ G Sbjct: 593 YEAALGQTLQWYNSQSGTADVQQTL-FFTDGAPT-FYMDGNSTEYTNIARVYG 643 >gi|145491137|ref|XP_001431568.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124398673|emb|CAK64170.1| unnamed protein product [Paramecium tetraurelia] Length = 591 Score = 42.8 bits (99), Expect = 0.096, Method: Composition-based stats. Identities = 41/281 (14%), Positives = 95/281 (33%), Gaps = 28/281 (9%) Query: 86 IKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQ---NEGYSISAISRYKIPLKFC 142 + + M+ + ND + I + NE S + YK+ Sbjct: 79 QPSLFGMNTVKNQDIHTNFNDDEKIEPKKEEAKQNANKYNLNEKLSFEVKALYKMGKLLN 138 Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 + + I + ++ +D++ ++D S SM + KI+M +++ Sbjct: 139 SRTQYLPGIVSIKAQDQAVIQNQENQRVGVDLICLIDKSGSM------NGQKIEMVKQTL 192 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 +L+ + + L+TF ++ + L+ ++ K + + S T Sbjct: 193 ALLLDFLNEND------RYQLITFESQAQRLTPLKRVTDGNKQYFKQVIQQINSGGGTTI 246 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAI 321 +Q + N I ++DG + + + Q+ + NE Sbjct: 247 GTATEIAFKQLQERKYRNNVTS------IFLLSDGQDGQANQRIQEQIKTVNEV----FT 296 Query: 322 VYAIGIRV-IRSHEFLRACA-SPNSFYLVENPHSMYDAFSH 360 ++ G + + C SFY V++ + + F+ Sbjct: 297 LHTFGFGEDHDAQMMTQLCNLKSGSFYFVQDVTLLDEFFAD 337 >gi|54038464|gb|AAH84380.1| LOC495168 protein [Xenopus laevis] Length = 554 Score = 42.8 bits (99), Expect = 0.096, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 73/198 (36%), Gaps = 32/198 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ V+DVS SM K+ + ++ ++LE++ Q G++ F++ I Sbjct: 306 ILFVIDVSGSMWGL------KMKQTVDAMKSILEDLNSDD------QFGIIDFNHNIRCW 353 Query: 231 --EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E + + ++ + G TN L A IF ++ + N Sbjct: 354 KDELVYASSVEKGDASKYVQRIQPNG-GTNINDALLRA---IFILKEASN-KGLLEQNSV 408 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKR---GAIVYAIGIRVIRSHEFLRACASPNS- 344 +IV ++DG+ + + + ++++GI ++FL A N Sbjct: 409 SLIVLVSDGDPT--VGELKLPKIQKNVRTNIQDDIALHSLGIGFDVDYDFLERLAQENHG 466 Query: 345 ----FYLVENPHSMYDAF 358 Y ++ + F Sbjct: 467 MAQRIYGNQDTAAQLKEF 484 >gi|297157667|gb|ADI07379.1| hypothetical protein SBI_04258 [Streptomyces bingchenggensis BCW-1] Length = 528 Score = 42.8 bits (99), Expect = 0.097, Method: Composition-based stats. Identities = 46/307 (14%), Positives = 100/307 (32%), Gaps = 25/307 (8%) Query: 66 MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 +N + + ++ D + V + + +RST + + Sbjct: 232 LNRDHHTDLTVIRPQDGVVTADYPLTALASAPQDARDSVRALTEYLRSTEAQRTLTERTF 291 Query: 126 GYSISAISRYKIPLKF-CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 + R PL + +R + + ++ + ++ + VLD S SM Sbjct: 292 RRPVVTSVRPAAPLSAAKRRELPFPGTRSVADGLLAAYE--NELRRPSRTVYVLDTSGSM 349 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-------LE 237 E + ++ A+ + + + + L+ F ++++ Sbjct: 350 E---GDRLDRLKAALTQLAG-ADGAATGERFRDREEVTLMPFGSEVKAVRTHTVPEDDPG 405 Query: 238 WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 ++ ++ K L+ G T L+ AY + + + + IV MTDG Sbjct: 406 KALAAIRADAKALTADGE-TAIFSSLRAAYRHLA------ERASALGDDRFTSIVLMTDG 458 Query: 298 ENL--STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-SPNSFYLVENPHSM 354 EN + +D +S Y +R V+ I E + + + S+ Sbjct: 459 ENTAGDSADDFESFYRRLPGAQRTTPVFPILFGDSDRGELENIASLTGGRLFDATK-GSL 517 Query: 355 YDAFSHI 361 AF I Sbjct: 518 DQAFEEI 524 >gi|239817420|ref|YP_002946330.1| hypothetical protein Vapar_4453 [Variovorax paradoxus S110] gi|239803997|gb|ACS21064.1| conserved hypothetical protein [Variovorax paradoxus S110] Length = 589 Score = 42.8 bits (99), Expect = 0.097, Method: Composition-based stats. Identities = 20/86 (23%), Positives = 36/86 (41%), Gaps = 1/86 (1%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 RN +G + + TAI L +I + L + E+ ++F+MK L D + + A +I Sbjct: 11 RNAGRQIRGSILVNTAIALSLIVITL-IGTELGYLFYMKRELQKATDLAALAGAKEISYA 69 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSF 94 G+ + K N + Sbjct: 70 GSCPSAKTAAKLSANGTGSTDRNRNL 95 >gi|150024647|ref|YP_001295473.1| outer membrane protein precursor YfbK [Flavobacterium psychrophilum JIP02/86] gi|149771188|emb|CAL42655.1| Probable outer membrane protein precursor YfbK [Flavobacterium psychrophilum JIP02/86] Length = 631 Score = 42.8 bits (99), Expect = 0.097, Method: Composition-based stats. Identities = 34/233 (14%), Positives = 74/233 (31%), Gaps = 35/233 (15%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYT--NSRHIVMPITSSVKVNSQT 168 V ++ + + Y + P T + R +V + + Sbjct: 210 VPKDAVRVEEMINFFKYQYP-QPKDNHPFSITTQYSDCPWNSKRKLVQIGLQGKNIPTND 268 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +++ ++DVS SM + K+ + I+S+ ++E+++ V VV +G Sbjct: 269 LPASNLVFLIDVSGSMNAD-----NKLPLLIESLKILVEQLRKQDKVAIVVYAGAAGL-- 321 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP------GLKYAYNQIFDMQGMRQHCNT 282 G + Q I L K ++ K A R Sbjct: 322 ----VLPPTAG-NEKQTIINALEKLNAGGSTAGGAGIELAYKTAQENFIKDGNNR----- 371 Query: 283 EDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++ TDG+ N+ + D + ++ G + +G + + Sbjct: 372 --------VILATDGDFNVGSTSDSAMQTLIEDKRESGVFLTCLGYGMGNYKD 416 >gi|34525892|emb|CAE46626.1| trombospondin-related protein [Plasmodium falciparum] Length = 331 Score = 42.8 bits (99), Expect = 0.097, Method: Composition-based stats. Identities = 36/183 (19%), Positives = 63/183 (34%), Gaps = 29/183 (15%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 ++D+ +++D S S+ + ++ ++ ++ + N + L FSN Sbjct: 46 QVDLYLLMDCSGSIRRH--------NWVKHAVPLAMKLIQQLNLNENAIHLYLNIFSNNA 97 Query: 231 EEFFLL----EWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +E L IK L TN T L + D Sbjct: 98 KEIIRLHSDASKNKEKALIIIKSLLSTNLPYGRTNLTDALLQVRKHLNDR--------IN 149 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR---VIRSHEFLRACA 340 N ++IV +TDG S Q SL + RG + +GI + + FL C Sbjct: 150 RENASQLIVILTDGIPDSI---QDSLKESRKLNDRGVKIAVVGIGQGINVAFNRFLVGCH 206 Query: 341 SPN 343 + Sbjct: 207 PSD 209 >gi|326912337|ref|XP_003202509.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-4-like [Meleagris gallopavo] Length = 1086 Score = 42.8 bits (99), Expect = 0.098, Method: Composition-based stats. Identities = 30/171 (17%), Positives = 63/171 (36%), Gaps = 31/171 (18%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++I++DVS SM+ ++ +A +I +L+ + N+ V + ++ Sbjct: 240 ATSPKDIVIIVDVSGSMKGL------RMTIAKHTIVTILDTL----GENDFVNI--IAYN 287 Query: 228 NK---IEEFFL------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + IE F H ++ + L GV T L ++ + + + Q Sbjct: 288 DYVHFIEPCFKGILVQADRDNREHFKQLVDELQAKGVGT-VNKALTESFKILREFRDAGQ 346 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 A I+ +TDG + + + + R V+ I Sbjct: 347 GGLCNQA-----IMLITDGAVEDYEAVFEKYNWPD----RKVRVFTYLIGR 388 >gi|260827134|ref|XP_002608520.1| hypothetical protein BRAFLDRAFT_92398 [Branchiostoma floridae] gi|229293871|gb|EEN64530.1| hypothetical protein BRAFLDRAFT_92398 [Branchiostoma floridae] Length = 1236 Score = 42.8 bits (99), Expect = 0.098, Method: Composition-based stats. Identities = 43/206 (20%), Positives = 69/206 (33%), Gaps = 30/206 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSS-ITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S++ + L + V+D S SM+ +T I M ++ K Sbjct: 619 SLEASGHQRTPLRFVAVIDESYSMDDRIGRDKLTLIQRMQIFAELMAKDFKDED------ 672 Query: 220 QSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 Q G+VTF+N + + G KI+ +S G TN + GL A + G Sbjct: 673 QMGIVTFANDAKVVLPMTRMDSSGRDSALEKIQNISTRGQ-TNLSDGLLSAISMFKGSSG 731 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG--------AIVYAIGI 327 H I+ TDG+ D L +K G + IG Sbjct: 732 SDFHNG---------IILFTDGQANQGIIDAAELVQEYNSKMAGLGEGVCLPISTFTIGD 782 Query: 328 RVIRSH-EFLRACASPNSFYLVENPH 352 + E + S F+L ++ Sbjct: 783 YRPKLLCEVAQNLGSDAFFWLSDDTD 808 >gi|209545335|ref|YP_002277564.1| hypothetical protein Gdia_3219 [Gluconacetobacter diazotrophicus PAl 5] gi|209533012|gb|ACI52949.1| conserved hypothetical protein [Gluconacetobacter diazotrophicus PAl 5] Length = 632 Score = 42.8 bits (99), Expect = 0.098, Method: Composition-based stats. Identities = 35/246 (14%), Positives = 88/246 (35%), Gaps = 31/246 (12%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 ++ VV + E Y A + Y +P+ T + ++ + S+ ++ T Sbjct: 145 AQAGKAGPVVAVEPENYVDIAKNFYLLPILDATVTDRDSGPSVHMLHVISAPATSTGTRP 204 Query: 171 RLD----------MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 ++ V+D + SM+ + ID +I ++ V+ V + Sbjct: 205 PSSPEALRQFRAGLVFVIDTTASMQPY-------IDQTRAAIGDIVHTVQNTA-VAQNFR 256 Query: 221 SGLVTFSNKIEEFFLLEWGVS-----HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 GLV + + + + LE+ L + I + + ++++ + G Sbjct: 257 FGLVAYRDSLADTPGLEYATKVYATPDLGQPIDAVLPAISGVHDASVSSSSFDE-DAVAG 315 Query: 276 MRQHCNTEDANYK--KIIVFMTD-GENLSTKEDQQSLYYCNEAK----KRGAIVYAIGIR 328 +R ++ + + + +V +TD G + +EA+ +G ++ + + Sbjct: 316 VRAAIDSVNWDQFGGRYMVLITDAGARGGDDPHSLTHLGIDEARQLAAAKGIALFVVHLL 375 Query: 329 VIRSHE 334 Sbjct: 376 TPAGDT 381 >gi|326484489|gb|EGE08499.1| von Willebrand RING finger domain-containing protein [Trichophyton equinum CBS 127.97] Length = 919 Score = 42.4 bits (98), Expect = 0.099, Method: Composition-based stats. Identities = 36/267 (13%), Positives = 84/267 (31%), Gaps = 38/267 (14%) Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 + N + + W + N+ +++ ++ Y I Sbjct: 410 SVNELPRFHLRFSTLHQLDIWKRAL------GDLQNEGVVTRPDPEVEVPASEED-DYRI 462 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITS---SVKVNSQTDARLDMMIVLDVSRSMES 186 S + + + +S + TS S LD+++V+ VS SM+ Sbjct: 463 SKTPSKRESSGYSSSYGAGKSSNTAMTDYTSPGREPIPVSAFHVPLDIVVVIPVSSSMQG 522 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-----KIEEFFLLEW-GV 240 KI + ++ +++ + + GLVTF + + W G Sbjct: 523 L------KISLLRDTLRFLVQNLGPRD------RMGLVTFGSSGGGVPVVGMTSKTWNGW 570 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 + I+ + + + + G A + + + N I+ ++D + Sbjct: 571 GKIFNSIRPVGQKSLRADVVEGANVAMDLLMQRK---------SFNPVSTILLISDSQTT 621 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGI 327 + + AK G + +G+ Sbjct: 622 DPESGDFVVSRAEAAK-VGIHTFGLGL 647 >gi|145503661|ref|XP_001437805.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124404961|emb|CAK70408.1| unnamed protein product [Paramecium tetraurelia] Length = 265 Score = 42.4 bits (98), Expect = 0.099, Method: Composition-based stats. Identities = 30/219 (13%), Positives = 74/219 (33%), Gaps = 31/219 (14%) Query: 93 SFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSR 152 + +L +I D ++V N Y + ++I +++ Sbjct: 65 KYSQQLPQTAISQEIFDDDDQVQTNLVQAKPNM-YDLEKELIFEIKTLQKMIKLSKISTQ 123 Query: 153 HIVMPITSSVKVNSQTD----ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + I+ K +D++ ++D S SM + +KI+ +S+ +L Sbjct: 124 QLPGIISIKTKDQLNDQDLNRVGVDLICLIDKSSSM------NGSKIETVKQSLKVLLTF 177 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLK 264 + + L+ F+ + L+ + + I + G T + + Sbjct: 178 LSNQD------RLQLIIFNTHAKRLTPLKRITEDNKLYFTQMIDQIKSDG-GTQISSATQ 230 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 A +Q+ +G + N + ++DG++ Sbjct: 231 IAISQL---KGRKYRNNVSS------VFLLSDGQDNDAT 260 >gi|120601101|ref|YP_965501.1| hypothetical protein Dvul_0050 [Desulfovibrio vulgaris DP4] gi|120561330|gb|ABM27074.1| conserved hypothetical protein [Desulfovibrio vulgaris DP4] Length = 389 Score = 42.4 bits (98), Expect = 0.099, Method: Composition-based stats. Identities = 9/122 (7%), Positives = 41/122 (33%), Gaps = 6/122 (4%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 ++ +G + ++ A+ + + M +++ ++ ++ + D + + A ++N Sbjct: 4 LKRLLNEERGNVALIVALSSFALLGLGTMAVDLGVVYTKRSQMQKAADIAALAGAQALIN 63 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 + + + I + ++ D + + +++ Sbjct: 64 SSGNTDMARTQA------ITTARANLAQGDVPDRAVRDGDVTFSNNAAINTSFPMNRIDV 117 Query: 128 SI 129 I Sbjct: 118 HI 119 >gi|308472851|ref|XP_003098652.1| hypothetical protein CRE_04170 [Caenorhabditis remanei] gi|308268252|gb|EFP12205.1| hypothetical protein CRE_04170 [Caenorhabditis remanei] Length = 396 Score = 42.4 bits (98), Expect = 0.099, Method: Composition-based stats. Identities = 34/191 (17%), Positives = 69/191 (36%), Gaps = 21/191 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN--- 228 LD++ V+D S+ M + + + I S+ + + + GLVT+++ Sbjct: 39 LDVVAVVDNSQGMT---NPGLINVASDIYSVFSSGTRIGSNSSEPRTTRVGLVTYNSNAT 95 Query: 229 ---KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + ++ ++ ++ + I + S +T GL+ A + D NT A Sbjct: 96 QKADLNKYQSIDDVLNEIYDDISTVVNTADSYLAT-GLQLAEKMLIDQSE-----NTNRA 149 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF---LRACASP 342 +YK++++ + + L N K + + L ASP Sbjct: 150 HYKRVVIVY--ASEYKGEGELDPLNVANRLKLSDINIITVAYEQKGDDGLFHDLSQIASP 207 Query: 343 NSFYLVENPHS 353 F V N Sbjct: 208 G-FSFVNNASD 217 >gi|260813592|ref|XP_002601501.1| hypothetical protein BRAFLDRAFT_248587 [Branchiostoma floridae] gi|229286798|gb|EEN57513.1| hypothetical protein BRAFLDRAFT_248587 [Branchiostoma floridae] Length = 91 Score = 42.4 bits (98), Expect = 0.099, Method: Composition-based stats. Identities = 16/97 (16%), Positives = 39/97 (40%), Gaps = 11/97 (11%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 LD++++LD S S+ ++ + + ++E + P Q G++ +S Sbjct: 4 RTPLDIILLLDGSGSVGAY------NFEKVKQFSQKLVETFDIGPSG---TQIGVIQYST 54 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGL 263 ++ + F + S L I ++ T + + Sbjct: 55 RVRQEFSMNSFQSKEALSNAIDDIAYMRGGTLTGRAI 91 >gi|170584326|ref|XP_001896952.1| von Willebrand factor type A domain containing protein [Brugia malayi] gi|158595641|gb|EDP34180.1| von Willebrand factor type A domain containing protein [Brugia malayi] Length = 945 Score = 42.4 bits (98), Expect = 0.099, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 65/170 (38%), Gaps = 21/170 (12%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 S++ +++D+ I+LD S S + F+ + + L E I V Sbjct: 235 SLESKKCKYSKMDLEIILDASSSRQQVFEH--------QRELALSLIERLPIDADETHVA 286 Query: 221 SGLVTFSNKIE--EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 G+ +F++ + L ++R I+ + G ST + ++ + + Sbjct: 287 VGINSFTSVPTLRQTLGLGRDKKMVRRAIEDIKYNGGSTFTAQAVELSVQDL-------- 338 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 ++ +++V M DG + ++ L K GA ++ + + Sbjct: 339 -QRGRRSDAIQVVVLMNDGMSQDP--WEKVLEASKLLKSTGAELFGVALG 385 Score = 38.2 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 39/270 (14%), Positives = 85/270 (31%), Gaps = 33/270 (12%) Query: 63 TQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNEL-RDNGFVNDIDDIVRSTSLDIVVV 121 T + + KLK I +++ N+L + N + + L + Sbjct: 701 TTVASIEISRPTTKLKKSTTKRNITKEKSIAVLNKLEKKNETLLLDGKNSKEKQLKEISE 760 Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 + ++ +K TF P T + +P+ +D+++++D S Sbjct: 761 NLHRSXQTTSNFNFKQHSLSTTFKPDGTTTTTTTIPLPGC---------EIDVIMLIDSS 811 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLI---PDVNNVVQSGLVTFSNKIEEFFLLEW 238 S+E F+ I + + V++ + +F E+ Sbjct: 812 GSVEKTFNREKELAAEIINRLRIGANNAHVAIVKFAAKEKVKT-VWSFDKPQEK------ 864 Query: 239 GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE 298 + + + +S +T L A + +G R +++ TDG Sbjct: 865 --EKVLKALHEISFSSGTTAIHTALLQAITEYSTKKGARPEQAIP------LLIIFTDGF 916 Query: 299 NLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + SL + ++AI I Sbjct: 917 GQKDTTEAASL-----LRNLIPNIFAIAIG 941 >gi|119961201|ref|YP_948618.1| hypothetical protein AAur_2909 [Arthrobacter aurescens TC1] gi|119948060|gb|ABM06971.1| hypothetical protein AAur_2909 [Arthrobacter aurescens TC1] Length = 354 Score = 42.4 bits (98), Expect = 0.099, Method: Composition-based stats. Identities = 10/127 (7%), Positives = 37/127 (29%), Gaps = 8/127 (6%) Query: 8 IRNFFYN--YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 +R ++ +G + +TA+ + + + ++V+ ++ L + D S + A Sbjct: 1 MRRLKHDDRERGAVAPMTAMLMVALLGMTAFAVDVAMMYSEHAQLQNGADSSAIGIAQAC 60 Query: 66 MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 + + + N ++ + + + Sbjct: 61 AQNAASADCA------APTSAATSLAGLNALDGVSNAPQASVNLGTGTVDVTTQSRNTSG 114 Query: 126 GYSISAI 132 + + Sbjct: 115 DNHFTLV 121 >gi|119578125|gb|EAW57721.1| hCG2042833 [Homo sapiens] Length = 638 Score = 42.4 bits (98), Expect = 0.099, Method: Composition-based stats. Identities = 41/275 (14%), Positives = 86/275 (31%), Gaps = 31/275 (11%) Query: 79 GGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYKI 137 ++ + + + + N + I + + + V N + I Sbjct: 277 HSFLVSQGDREQVPNLVSMVTRNPASDKIKRLPGDIQVVPIGVGPNANVQELERIGWPNA 336 Query: 138 PLKFCTFIPWYTNSRHIVMPITSS--------VKVNSQTDARLDMMIVLDVSRSMESFFD 189 P+ F + +V+ S + LD++++LD S S Sbjct: 337 PILIQDFETLPREAPDLVLQRCCSGEGLQIPTLSPAPDCSQPLDVILLLDGSSSF----- 391 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNN--VVQSGLVTFSNKIEEFFLLEWGVSHLQRKI 247 + D A + + + P + V+Q G +T I+ + + +HL + Sbjct: 392 -PASSFDEMKSFAKAFISKANIGPHLTQGSVLQYGSIT---TIDVPWNVAPEKAHLLSLV 447 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 + + G + L +A + H A+ K +++ +T Sbjct: 448 DVMQREGGPSQIGDALGFAVRHLTS----EMHGARLGAS-KAVVILVT------DVSVDS 496 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + A+ V+ IGI LR A P Sbjct: 497 VDAAADAARSNRVTVFPIGIGDRYDAAQLRILAGP 531 >gi|29788808|gb|AAP03354.1| hypothetical protein [Oryza sativa Japonica Group] gi|108708692|gb|ABF96487.1| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] Length = 540 Score = 42.4 bits (98), Expect = 0.099, Method: Composition-based stats. Identities = 42/259 (16%), Positives = 92/259 (35%), Gaps = 29/259 (11%) Query: 105 NDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKV 164 ND + + + + G + + + ++ P N + +++ + + Sbjct: 4 NDDEQSAPVANNNGGTTTRPPGITGVPVGQVELTKYHNAEAPLDPNDQEVLLELKGT--S 61 Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 ++ + A LD++ VLDVS SM K+D ++ ++ ++ + + +V Sbjct: 62 SATSRAALDLIAVLDVSTSMAGN------KLDRMKAALLFVIRKLADVD------RLSIV 109 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 TFSN L + R G++ + ++ + R+ Sbjct: 110 TFSNDAARLCPLRFVAGDAARADLGALVDGLAADGNTNIRAGLEIGLAVAAGRRLTAGRA 169 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--- 341 N ++ M+DG+ + D L G V+ G+ L+A A Sbjct: 170 VN----VMLMSDGQQN--RGDATRLDP------GGVPVHTFGLGADHDPAVLQAIAGKSR 217 Query: 342 PNSFYLVENPHSMYDAFSH 360 F+ V + ++ FS Sbjct: 218 EGMFHYVADGVNLTAPFSQ 236 >gi|168701952|ref|ZP_02734229.1| hypothetical protein GobsU_20668 [Gemmata obscuriglobus UQM 2246] Length = 864 Score = 42.4 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 66/212 (31%), Gaps = 21/212 (9%) Query: 138 PLKFCTF-IPWYTNSRHIVMPITSSVKVNSQTDARLD-MMIVLDVSRSMESFFDSSITKI 195 P KF F + ++V P + +IV+D S SM S TK Sbjct: 44 PSKFTLFNLLGQPRGYYLVEPHELDRGDAPGDAKSVAHSVIVVDRSGSMYSALPE--TKE 101 Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGV 255 + + L+ + + G V + + S Q+ IK + + Sbjct: 102 TLLKILTLDEYAQYNLLVTLISYSSQGDVICHFERVPIREIMKKDSRYQKDIKSI-QTSC 160 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS---LYYC 312 +T + GLK A ++ + I TDG ++ L C Sbjct: 161 ATCISQGLKLASEKVMAGELT-------------AITIHTDGYANDPSSTSEAATLLKLC 207 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 ++ K + V + L A+ S Sbjct: 208 DDMKGKDVFVNTLAYGDYTDFRLLSRIANAGS 239 >gi|46581745|ref|YP_012553.1| hypothetical protein DVU3344 [Desulfovibrio vulgaris str. Hildenborough] gi|46451168|gb|AAS97813.1| hypothetical protein DVU_3344 [Desulfovibrio vulgaris str. Hildenborough] gi|311235373|gb|ADP88227.1| Protein of unknown function DUF2134, membrane [Desulfovibrio vulgaris RCH1] Length = 389 Score = 42.4 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 9/122 (7%), Positives = 41/122 (33%), Gaps = 6/122 (4%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 ++ +G + ++ A+ + + M +++ ++ ++ + D + + A ++N Sbjct: 4 LKRLLNEERGNVALIVALSSFALLGLGTMAVDLGVVYTKRSQMQKAADIAALAGAQALIN 63 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 + + + I + ++ D + + +++ Sbjct: 64 SSGNTDMARTQA------ITTARANLAQGDVPDRAVRDGDVTFSNNAAINTSFPMNRIDV 117 Query: 128 SI 129 I Sbjct: 118 HI 119 >gi|261822923|ref|YP_003261029.1| von Willebrand factor A [Pectobacterium wasabiae WPP163] gi|261606936|gb|ACX89422.1| von Willebrand factor type A [Pectobacterium wasabiae WPP163] Length = 212 Score = 42.4 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 20/132 (15%), Positives = 38/132 (28%), Gaps = 12/132 (9%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + ++LD S SM ++ + ++ ++ P ++TF + Sbjct: 3 RLPVYLLLDTSGSMHG------EPMEAVKNGVQTLITTLRQDPYALETAYVSVITFDSSA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L ++ + +T L I D Sbjct: 57 RQVVPLTDLINFKSPDLVA----NGTTALGEALSLVAQAIEREVQKTTAETKGDWRPLVF 112 Query: 291 IVFMTDGENLST 302 I MTDG Sbjct: 113 I--MTDGAPTDD 122 >gi|167626844|ref|YP_001677344.1| hypothetical protein Fphi_0624 [Francisella philomiragia subsp. philomiragia ATCC 25017] gi|167596845|gb|ABZ86843.1| conserved hypothetical membrane protein with von Willebrand factor type A domain [Francisella philomiragia subsp. philomiragia ATCC 25017] Length = 332 Score = 42.4 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 60/162 (37%), Gaps = 21/162 (12%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 + +I LDVS+SM++ D S T+++ A +L +K Q G++ FS++ Sbjct: 91 VSRVIALDVSQSMDT-TDVSPTRLERAKYKTLDILRRIKEG-------QVGMIVFSSEPF 142 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L + ++ + ++ + + N I+ T+ K I Sbjct: 143 VVSPLTSDANTVENLVPVIN--------SDIVPVQGNNIYKAIEKSAQLITQAGAKKGQI 194 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + +TD + ++ + ++G I + Sbjct: 195 ILITD-----STPSADAIAKAKQLAEQGIDTDVYAIGTPKGG 231 >gi|302550736|ref|ZP_07303078.1| toxic cation resistance protein [Streptomyces viridochromogenes DSM 40736] gi|302468354|gb|EFL31447.1| toxic cation resistance protein [Streptomyces viridochromogenes DSM 40736] Length = 248 Score = 42.4 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 39/212 (18%), Positives = 68/212 (32%), Gaps = 37/212 (17%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 S+ + R + +V+D S SM ++ + + + + D V Sbjct: 32 SLTKHGLDGLRAAVYLVIDHSGSMRPYYKDG--SVQALADRVLGLSAHLD--DDGTVPV- 86 Query: 221 SGLVTFSNKIEEFFL--LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V FS ++ L + R + L G T+ + + D Sbjct: 87 ---VFFSTDVDAETEIALADHQGRIDRIVAGLGHMGR-TSYHLAMDAVIDHYLDSGSKD- 141 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--- 335 ++VF TDG +S ++ Y C A+ + IG S +F Sbjct: 142 ---------PALVVFQTDGGPVSKLAAER--YLCKAARLPLFWQF-IGFGDPGSKQFDFL 189 Query: 336 --LRACASPNS-------FYLV-ENPHSMYDA 357 L A P F+ E+P + DA Sbjct: 190 RRLDELAVPEKRVVDNAGFFHAGEDPREVPDA 221 >gi|291236633|ref|XP_002738245.1| PREDICTED: calcium channel-like protein [Saccoglossus kowalevskii] Length = 1069 Score = 42.4 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 26/138 (18%), Positives = 55/138 (39%), Gaps = 21/138 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + DMMI++DVS S+ +++ S ++++ + N+ V +F+ Sbjct: 229 AASPKDMMILIDVSGSVHGLV------LELIKASAVSLIDTL----GENDFVNI--ASFN 276 Query: 228 NKIEEFF-------LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + E + ++ KI+ L G+ + G +A+ Q + Sbjct: 277 ERARFVSCFETFVQANERNKNVMKDKIRLLVDNGI-ASFDIGFTFAFEQFKKFKETSAFQ 335 Query: 281 NTE-DANYKKIIVFMTDG 297 AN ++I+ +TDG Sbjct: 336 EENQGANCSQVIMLLTDG 353 >gi|77919348|ref|YP_357163.1| hypothetical protein Pcar_1750 [Pelobacter carbinolicus DSM 2380] gi|77545431|gb|ABA88993.1| conserved hypothetical protein [Pelobacter carbinolicus DSM 2380] Length = 421 Score = 42.4 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 18/96 (18%), Positives = 37/96 (38%), Gaps = 1/96 (1%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 + G + +L AI L + + + I+V H++ ++ L + D + A ++ E + Sbjct: 10 KDQNGAVIVLVAILLILFLGIAALAIDVYHVYVVRNELQNAADAGALAGARELYLESGAS 69 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDID 108 I+N + E+ N N D Sbjct: 70 VNPNANVIANNTAIENI-SEDVPVEVNYNAAANTGD 104 >gi|156378647|ref|XP_001631253.1| predicted protein [Nematostella vectensis] gi|156218290|gb|EDO39190.1| predicted protein [Nematostella vectensis] Length = 592 Score = 42.4 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 67/176 (38%), Gaps = 23/176 (13%) Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVVQSGLVTFSNKIEEFFLLE 237 S S++ + +S T++ A+ S+ + E ++++P V NV + F+ + + Sbjct: 370 SISIDEY--NSKTRLKCAVGSL--LFESIQILPSVLARFNVYRISEREFNERSKH----- 420 Query: 238 WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 + L + ++K G T++ LK+ + +F+ N I+V +TDG Sbjct: 421 ---NGLVLSVLTVAKKGGVTHTGKALKFVQDNVFEPIQNSSESKGNAMN---ILVLLTDG 474 Query: 298 ENLSTKED---QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 ++ + K + ++G+ E P+ N Sbjct: 475 KSNDPDANLLANSVRNIKRTIKD--LTIISVGVADYDLKELEFIATDPSKHVFTSN 528 >gi|242078369|ref|XP_002443953.1| hypothetical protein SORBIDRAFT_07g005010 [Sorghum bicolor] gi|241940303|gb|EES13448.1| hypothetical protein SORBIDRAFT_07g005010 [Sorghum bicolor] Length = 567 Score = 42.4 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 39/233 (16%), Positives = 84/233 (36%), Gaps = 42/233 (18%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 +T+ + + + + + A LD++ VLDVS SM K+ + +++ ++ Sbjct: 56 RFTSRDRFAVLVHAKAPSDV-SRAPLDLVTVLDVSDSM------KGEKLALLKQAMCFVI 108 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPG 262 +++ + +VTFSN L G + + ++ L+ G TN G Sbjct: 109 DQLGPAD------RLSVVTFSNDASRLTRLARMSDAGKASAKIAVESLAVQGF-TNIKQG 161 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG---------ENLSTKEDQQ----SL 309 + A + + ++ ++DG TK SL Sbjct: 162 IHVAAEVLAGRREKNVVAG---------MILLSDGHDNCGGTSVRPDGTKSYVNLVPPSL 212 Query: 310 YYCNEAKKRGAIVYAIGIRVIRSHEFLRAC--ASPNSFYLVENPHSMYDAFSH 360 + + A ++ G + A A+ +F V + ++ D+F+ Sbjct: 213 TVAAGSSRPAAPIHTFGFGTSHDAGAMHAVAEATGGTFSFVGDEAAIQDSFAR 265 >gi|54293628|ref|YP_126043.1| structural toxin protein RtxA [Legionella pneumophila str. Lens] gi|53753460|emb|CAH14915.1| structural toxin protein RtxA [Legionella pneumophila str. Lens] Length = 7919 Score = 42.4 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 48/293 (16%), Positives = 101/293 (34%), Gaps = 37/293 (12%) Query: 57 SLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSL 116 ++ + +++ N N+ L+ + R++ S + + ++ + DI + Sbjct: 6719 AVNASGGYLISGLNLNDTITLRSANGYDRVEIENARSGAHGVSNSSLNGETFDIGLFSYN 6778 Query: 117 DIVVVPQNEGYSI----------SAISRYKIPLKFCTFIPWY-------TNSRHIVMPIT 159 I P ++ S +I L F +N H V T Sbjct: 6779 TIKTTPSEININMGLSLTDSDGDKVTSSIEINLSPSVFKVGENVDDTSSSNVSHRVGGDT 6838 Query: 160 SSVKVNSQTDARLD-------------MMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 + + D + + +LD S SM+ F T++++ +++ +L Sbjct: 6839 GVIDSSGGADILVGDVGGVEIIGTTARIAFILDESGSMDQNFGG-TTRLEVLKQTMTDIL 6897 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 E+ P+ + V LV F++ + E LQ+ + ++S G+ Sbjct: 6898 TELSNTPNASITVH--LVKFASVVNGTGTFEITGGELQQALDFIS--GLQIQQGLLAGTN 6953 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 Y + + + ++ + F TDG D S Y N A+ G Sbjct: 6954 YEAALGQTLQWYNSQSGTVDVQQTL-FFTDGAPT-FYMDGNSTEYTNIARVYG 7004 >gi|282892469|ref|ZP_06300803.1| hypothetical protein pah_c260o014 [Parachlamydia acanthamoebae str. Hall's coccus] gi|281497751|gb|EFB40115.1| hypothetical protein pah_c260o014 [Parachlamydia acanthamoebae str. Hall's coccus] Length = 364 Score = 42.4 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 64/186 (34%), Gaps = 21/186 (11%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P T + M +LD S SM+ + T+ + + ++ + ++ Sbjct: 85 PNTVLSTGQEEGIPTHQMQFILDTSASMQVKDIRNQTRSEYGKEIVDELARQLD----GK 140 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 + G + ++ + + + R ++ T+ +K +I ++ Sbjct: 141 SASLWGFAGQATRLSPATMDALFLRLMIRDLQINEGNVTGTSLINAVKAIQKEISELPQD 200 Query: 277 RQHCNTEDANYKKIIVFMTDGENL-STKEDQQSLY---YCNEAK---KRGAIVYAIGIRV 329 R K + V ++DGE+ + ++++ ++ K +Y IGI Sbjct: 201 R----------KLVAVLLSDGEDTENISAEEKAKNLRVLLDDLKTKFNDRLTIYTIGIGS 250 Query: 330 IRSHEF 335 E Sbjct: 251 REGGEI 256 >gi|225012024|ref|ZP_03702461.1| von Willebrand factor type A [Flavobacteria bacterium MS024-2A] gi|225003579|gb|EEG41552.1| von Willebrand factor type A [Flavobacteria bacterium MS024-2A] Length = 288 Score = 42.4 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 37/164 (22%), Positives = 61/164 (37%), Gaps = 34/164 (20%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+V+DVS S + + + + + ++ + GLV F Sbjct: 72 EEERELTLMLVVDVSGSEKFGTKTQFKREILTEIAATLAFSALQNND------KVGLVLF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 S++IE F + G SH+ R I+ L TN + L Sbjct: 126 SDQIELFIPPKKGRSHILRIIRELLEFKPKSTETNISAAL-----------EFLSGVLK- 173 Query: 284 DANYKKIIVF-MTDGENLSTKEDQQSLYYCNEAKK---RGAIVY 323 KK IVF ++D +++L AKK G VY Sbjct: 174 ----KKAIVFILSD---FMDSGYEKTLRI--TAKKHDLTGIRVY 208 >gi|149178272|ref|ZP_01856865.1| hypothetical protein PM8797T_16765 [Planctomyces maris DSM 8797] gi|148842921|gb|EDL57291.1| hypothetical protein PM8797T_16765 [Planctomyces maris DSM 8797] Length = 169 Score = 42.4 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 17/110 (15%), Positives = 34/110 (30%), Gaps = 11/110 (10%) Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 G TN PGL + + + + ++V ++DG ++ Sbjct: 47 GGGTNMAPGL------FISREILERPIFPSQIYLRPVVVVLSDGLTSHP---AKTSEIAT 97 Query: 314 EAKKRGAIVYAIGIRV-IRSHEFLRACASPNSFYLVENPHSMYDAFSHIG 362 + KK A + + + S FY + F+ +G Sbjct: 98 QLKKD-ADIVTVAFGDDADEPYLISLATSSEHFYHCRTGTDLRAFFASVG 146 >gi|298491820|ref|YP_003721997.1| von Willebrand factor type A ['Nostoc azollae' 0708] gi|298233738|gb|ADI64874.1| von Willebrand factor type A ['Nostoc azollae' 0708] Length = 224 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 32/166 (19%), Positives = 62/166 (37%), Gaps = 19/166 (11%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 +++LD S SM+ +I+ + + +E+ V+ ++TF + + Sbjct: 24 VLLLDTSGSMQG------ERIEALNQGLLTFKDELVKNSLAARRVEVAIITFDSHVNVV- 76 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + V+ Q L+ G++T G+ + I Q + T Y + VFM Sbjct: 77 --QDFVTADQFNPPILTAQGLTTM-GAGINKSLEII---QERKSQYRTNGIAYYRPWVFM 130 Query: 295 -TDGENLSTKEDQQSLYY----CNEAKKRGAIVYAIGIRVIRSHEF 335 TDGE ++ +EA KR + +G+ Sbjct: 131 ITDGEPQGELDNVIEQAVQRLQGDEANKR-VAFFTVGVENANMTRL 175 >gi|251795506|ref|YP_003010237.1| von Willebrand factor A [Paenibacillus sp. JDR-2] gi|247543132|gb|ACT00151.1| von Willebrand factor type A [Paenibacillus sp. JDR-2] Length = 276 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 23/164 (14%), Positives = 53/164 (32%), Gaps = 43/164 (26%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV----KLIPDVNNVVQSGLVTF--- 226 ++ ++D S SM F +S TK + +IN +L+ + + + G++ + Sbjct: 14 ILFIIDQSGSMSDPFGAS-TKSESVSDAINRLLQNLVIKCAKSEGIRDYYHVGVIGYGAG 72 Query: 227 ----------SNKIEEFFLLEWGVSHLQRKIKYLSK-------------------FGVST 257 ++ + S L +++K + T Sbjct: 73 VGPAFGGVLAGKEVVPISEIADNPSKLDKRLKKVPDGAGGLVDQLVKFPVWFEPVANGGT 132 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 ++ A + HC +++ +TDGE+ Sbjct: 133 PMCEAMRTAQTVLTQWISEYTHCFPP------VVIHITDGESTD 170 >gi|116670917|ref|YP_831850.1| von Willebrand factor, type A [Arthrobacter sp. FB24] gi|116611026|gb|ABK03750.1| von Willebrand factor, type A [Arthrobacter sp. FB24] Length = 340 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 38/227 (16%), Positives = 75/227 (33%), Gaps = 56/227 (24%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+++ LD S SM S A ++ + + D + GL F + + Sbjct: 95 DIILCLDASGSMSS-----------ADAAVVEVFARLAAGFDGE---RLGLTVFDSSAVQ 140 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL-----KYAYNQIFDMQ------------- 274 F L +Q +++ K + A + + Sbjct: 141 VFPLTDDYDVVQGQLEAARKAFDGAPGSAAFLDGTWNGAGSSLIGDGLASCVQGFPSNGG 200 Query: 275 --GMRQHCNTEDANYKKIIVFMTD----GENLSTKEDQQSLYYCNEAKKRGAIVYAI--- 325 G + + + +V TD GE + T ++ +L AKK+ VYA+ Sbjct: 201 DTGTGEQAGSGREERSRSVVLATDNFISGEPIFTLQEAAAL-----AKKQDVRVYALNPG 255 Query: 326 ----GIRVIRSHEFLRAC--ASPNSFYLVENPHSMYDAFSHIGKDIV 366 G + LR + ++Y +++P +A I + + Sbjct: 256 DFDYGTDPDQPGVQLRTAVEGTGGAYYPLDSP----EAVGEIIRRVQ 298 >gi|313896128|ref|ZP_07829682.1| von Willebrand factor type A domain protein [Selenomonas sp. oral taxon 137 str. F0430] gi|312975553|gb|EFR41014.1| von Willebrand factor type A domain protein [Selenomonas sp. oral taxon 137 str. F0430] Length = 243 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 26/161 (16%), Positives = 63/161 (39%), Gaps = 22/161 (13%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 ++ R+++++V+D S SM + +I V+ ++ V+ Sbjct: 30 TTGAEKPKRIELVLVIDKSGSMG----------GLESDTIGGFNSMVEKQRALHIPVRVT 79 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 V F+++++ + ++ I + T G + + + + Sbjct: 80 AVLFNDEVQ--------TLYDRKAIWRVPPLTDHDYVTGGTTALLDAVGTTIRRMERADG 131 Query: 283 EDANYKKII-VFMTDG-ENLSTKEDQQSLY--YCNEAKKRG 319 A K++ V +TDG EN ST+ + ++ ++ +K G Sbjct: 132 ITAPGTKVVFVIITDGLENASTEFTRAAVKRMISDKQEKAG 172 >gi|223558083|gb|ACM91087.1| hypothetical protein BACUNI_01985 [uncultured bacterium Rlip1] Length = 292 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 29/133 (21%), Positives = 53/133 (39%), Gaps = 20/133 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + + +M+++DVS S + + + + A L V + N + G++ F Sbjct: 72 EEEREMTVMLLIDVSGS------NDFGSQSQSKRDLTAELAAVLAFSAIQNNDKVGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 S+KIE+F + G SH+ R I+ + T+ GL++ Sbjct: 126 SSKIEKFIPPKKGSSHILRIIREIVDFKPVERGTDIGEGLRF-----------LTSAIKR 174 Query: 284 DANYKKIIVFMTD 296 I FMTD Sbjct: 175 RTTAFLISDFMTD 187 >gi|301058328|ref|ZP_07199361.1| conserved hypothetical protein [delta proteobacterium NaphS2] gi|300447564|gb|EFK11296.1| conserved hypothetical protein [delta proteobacterium NaphS2] Length = 292 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 22/90 (24%), Positives = 40/90 (44%), Gaps = 6/90 (6%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 L +M+VLD+S S + + + ML + + + G + FS+K+E Sbjct: 77 LTVMLVLDLSGSHRFGTRKKFKR--ELLAEVAGMLAFLAIRTNDK----VGAILFSSKVE 130 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 ++ + G SH+ R IK + + ST Sbjct: 131 KYIPPKKGASHVWRLIKEVFTYQPEDLSTD 160 >gi|162455472|ref|YP_001617839.1| hypothetical protein sce7190 [Sorangium cellulosum 'So ce 56'] gi|161166054|emb|CAN97359.1| unnamed protein product [Sorangium cellulosum 'So ce 56'] Length = 378 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 23/151 (15%), Positives = 47/151 (31%), Gaps = 23/151 (15%) Query: 171 RLDMMIVLDVSRSM-ESFFDS---------SITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 L++ I++D S SM + S + + ++ +P ++ Sbjct: 85 PLNLYIMMDKSSSMAGDKWTSAKAGLTAFVNSARFAGVRVALRFFPRVADAVPACDHE-- 142 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + + F L + I+ S G ST P L A + + + H Sbjct: 143 ----AYREPLVPFGPLPDNAGAIVAAIEAESPNGASTPVYPALGGAL--LQGIDVAKNHP 196 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYY 311 A ++ +TDG+ + Sbjct: 197 GEASA-----VLLVTDGQPQGPAAQCAGVNP 222 >gi|284050388|ref|ZP_06380598.1| hypothetical protein AplaP_02827 [Arthrospira platensis str. Paraca] gi|291570280|dbj|BAI92552.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 220 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 61/170 (35%), Gaps = 15/170 (8%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 D R ++++LD S SM S ID + + +E+ VQ +VTF Sbjct: 10 NPDPRCPVVLLLDTSGSM------SGQPIDQLNQGLVTFQQELSQDSLAARRVQVAIVTF 63 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + + +T G+ A N I + +Q + Sbjct: 64 DSYVNVVQDFV----DFDQFNPPHLSTTGTTAMGEGIGTALNLI---ESHKQLLRANGID 116 Query: 287 YKK-IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 Y + ++ +TDG + ++ + + A + + +AI ++ + Sbjct: 117 YYRPWLLMITDGAPTDSWQNAAQMLHQFYA-NKKVVFFAIAVQGADMNTL 165 >gi|229116005|ref|ZP_04245400.1| hypothetical protein bcere0017_22960 [Bacillus cereus Rock1-3] gi|228667499|gb|EEL22946.1| hypothetical protein bcere0017_22960 [Bacillus cereus Rock1-3] Length = 452 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 40/205 (19%), Positives = 75/205 (36%), Gaps = 16/205 (7%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 V + L++ I+LD S SM + + K++ A K+I L IP+ NV+ Sbjct: 142 VHSKKPKEKSLNVEILLDASGSMAGKVNGQV-KMEAAKKAIYNYL---DKIPNNANVM-- 195 Query: 222 GLVTFSNK---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 L + +K E L G S + ++ K + + + + Sbjct: 196 -LRVYGHKGSNNENDKSLSCGSSEVMYPLQPYKKEQFNAALSNFGPKGWTPLASAIESVN 254 Query: 279 HCNTE--DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-HEF 335 + E ++ ++DGE + + N++ A+V IG V S + Sbjct: 255 NDFKEYTGEENLNVVYIVSDGEETCGGDPVNAAKNLNQS-STHAVVNIIGFDVKNSEQQQ 313 Query: 336 LRACASP--NSFYLVENPHSMYDAF 358 L+ A ++ V N +Y Sbjct: 314 LKNTAEAGKGNYATVSNAEELYQTL 338 >gi|254481786|ref|ZP_05095029.1| Vault protein inter-alpha-trypsin [marine gamma proteobacterium HTCC2148] gi|214037915|gb|EEB78579.1| Vault protein inter-alpha-trypsin [marine gamma proteobacterium HTCC2148] Length = 686 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 28/184 (15%), Positives = 60/184 (32%), Gaps = 28/184 (15%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + +++ V+D S SM I A S+ L + + Sbjct: 306 ASQRAAETVPREIVFVVDTSGSMGG------VSIKQAKGSLTRALRHLGPND------RF 353 Query: 222 GLVTFSNKIEEFFLLE-----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 ++ F++ F + +++L G T P L+ A + Sbjct: 354 NVIEFNSSHRALFQHAVPASHHNLQLASEYVRHLEASG-GTEMMPALQLALKLPGAQDEL 412 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 R + ++F+TDG + + + + G+ ++ +GI + F+ Sbjct: 413 RP------EPALRQVIFITDGAVGNESALFE--HIVDSL--GGSRLFTVGIGSAPNAWFM 462 Query: 337 RACA 340 R A Sbjct: 463 RKAA 466 >gi|260813590|ref|XP_002601500.1| hypothetical protein BRAFLDRAFT_152915 [Branchiostoma floridae] gi|229286797|gb|EEN57512.1| hypothetical protein BRAFLDRAFT_152915 [Branchiostoma floridae] Length = 91 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 16/97 (16%), Positives = 39/97 (40%), Gaps = 11/97 (11%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 LD++++LD S S+ ++ + + ++E + P Q G++ +S Sbjct: 4 RTPLDIILLLDGSGSVGAY------NFEKVKQFSQKLVETFDIGPSA---TQIGVIQYST 54 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGL 263 ++ + F + S L I ++ T + + Sbjct: 55 RVRQEFSMNSFQSKETLSNAIDDIAYMRGGTLTGRAI 91 >gi|125586596|gb|EAZ27260.1| hypothetical protein OsJ_11197 [Oryza sativa Japonica Group] Length = 540 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 42/259 (16%), Positives = 92/259 (35%), Gaps = 29/259 (11%) Query: 105 NDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKV 164 ND + + + + G + + + ++ P N + +++ + + Sbjct: 4 NDDEQSAPVANNNGGTTTRPPGITGVPVGQVELTKYHNAEAPLDPNDQEVLLELKGT--S 61 Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 ++ + A LD++ VLDVS SM K+D ++ ++ ++ + + +V Sbjct: 62 SATSRAALDLIAVLDVSTSMAGN------KLDRMKAALLFVIRKLADVD------RLSIV 109 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 TFSN L + R G++ + ++ + R+ Sbjct: 110 TFSNDAARLCPLRFVAGDAARADLGALVDGLAADGNTNIRAGLEIGLAVAAGRRLTAGRA 169 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--- 341 N ++ M+DG+ + D L G V+ G+ L+A A Sbjct: 170 VN----VMLMSDGQQN--RGDATRLDP------GGVPVHTFGLGADHDPAVLQAIAGKSR 217 Query: 342 PNSFYLVENPHSMYDAFSH 360 F+ V + ++ FS Sbjct: 218 EGMFHYVADGVNLTAPFSQ 236 >gi|47226325|emb|CAG09293.1| unnamed protein product [Tetraodon nigroviridis] Length = 173 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 19/163 (11%), Positives = 50/163 (30%), Gaps = 15/163 (9%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS----GLVTF 226 + + +D S ++ ++ + + + + +Q G + F Sbjct: 7 PIKVFFTIDTSETIALQESPPGALVESIKQFTRTFVRAL-SDEEYRGHIQISWSVGGLNF 65 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 S F + R + + G T + +K + R +T+ Sbjct: 66 SQDHWVFSQFT-NKQNFIRNLGQVRYKGKGTFTDCAIKN-----MTYEMTRHDSDTKAVP 119 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 V +TDG + +A+++G ++++ Sbjct: 120 ---FAVVITDGHVTGSPCGGI-KAMAEKAREQGIRIFSVAASR 158 >gi|317507526|ref|ZP_07965248.1| hypothetical protein HMPREF9336_01620 [Segniliparus rugosus ATCC BAA-974] gi|316254173|gb|EFV13521.1| hypothetical protein HMPREF9336_01620 [Segniliparus rugosus ATCC BAA-974] Length = 320 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 27/130 (20%), Positives = 54/130 (41%), Gaps = 23/130 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAM-LEEVKLIPDVNNVVQSGLVTFSNKIEE 232 +++V+DVSRSM + S TK D+A+ ++ M L + +++ LV + Sbjct: 91 VLLVVDVSRSMSALAPSGETKRDVAVNAMGMMGLVSIAKGDEIS------LVYGDARGSA 144 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVS--TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 G +H++R + ++ + + ++ AY E K+ Sbjct: 145 RMPPGRGETHVERILARVAGHDLDHAARDSLAVQLAY--------------VERNFRKRY 190 Query: 291 IVFMTDGENL 300 +VF+ GE Sbjct: 191 VVFLVSGEPS 200 >gi|220912876|ref|YP_002488185.1| von Willebrand factor A [Arthrobacter chlorophenolicus A6] gi|219859754|gb|ACL40096.1| von Willebrand factor type A [Arthrobacter chlorophenolicus A6] Length = 333 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 62/209 (29%), Gaps = 45/209 (21%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M+ LDVS SM S A ++ + E+ D + GL F + + Sbjct: 91 DIMLCLDVSGSMSS-----------ADAAVVDVFAELAREFDGE---RIGLTIFDSTGSQ 136 Query: 233 FFLLE-----------------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 F L G + S+ GL N Q Sbjct: 137 VFPLTDDYGYARDQLLLARDAFDGKPGSSGFLDGTWGGRGSSLIGDGLASCLNGFPRTQD 196 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE--AKKRGAIVYAI-------G 326 + + +V TD + AK R VYA+ G Sbjct: 197 SDAAADNPAVKRSRSVVLATD---NYVSGNPILTLPQASALAKDRTVRVYALNPGDLDYG 253 Query: 327 IRVIRSHEFLRACA--SPNSFYLVENPHS 353 + LR A + S+Y ++NP + Sbjct: 254 SDAGQPGAQLRVAAESTGGSYYALDNPGA 282 >gi|254780135|ref|YP_003064548.1| hypothetical protein CLIBASIA_00070 [Candidatus Liberibacter asiaticus str. psy62] gi|254039812|gb|ACT56608.1| hypothetical protein CLIBASIA_00070 [Candidatus Liberibacter asiaticus str. psy62] Length = 408 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 54/398 (13%), Positives = 131/398 (32%), Gaps = 47/398 (11%) Query: 6 LNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 L + +G ++TAI L ++ ++++ + +L ++D + + + Sbjct: 9 LGVLRLKKCTRGVFLVITAILLSSFVAIVDVVVDQVTVMQKTALLQEVLDHVIYRTSPKN 68 Query: 66 MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 + + R+ + I +I+ N +L + G + I ++ + + Sbjct: 69 LYDL----REAGRDNFIRHQIEKALNTYNSRDLSNIGSIESIV----KDAVILTKNVNSL 120 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMM-IVLDVS--R 182 + ++ + + + I+ KV + + L +M D Sbjct: 121 PLQFTVDIALSTTVQLRGSLLQMFSQSKGKVDISRRKKVMYKQNIGLMIMPFAWDGYWLA 180 Query: 183 SMESFFDSSITKIDMAIKS--------INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 S DS + + S N +++ N + + K ++ Sbjct: 181 SRGKVADSQVHRPKYLEYSHYYQQYLNRNTLVKNFLSQIPYKNFC---MAPYHYKSIFYW 237 Query: 235 LLEWGVSHLQRKI------KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ------HCNT 282 E H + + + +K ++ + G Q HC Sbjct: 238 AFENFQYHQDNNMLIFYSHRDFYRITWDHFPYSFIKNVFDMTSNQFGDGQVLTNTNHCFP 297 Query: 283 EDANYKKIIVFMTDGENLSTKED-----QQSLYYCNEAKK-----RG-AIVYAIGIRV-I 330 A++ K ++ G LS + L C K RG ++++G Sbjct: 298 HGASHSKYMLMFAIGNQLSRSSVGKETIDRILQDCYYMHKHHRTGRGAITIFSVGFSPDQ 357 Query: 331 RSHEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDIVT 367 + LR CAS P+ +Y + + ++ + ++++T Sbjct: 358 DTRYTLRQCASDPSKYYEINSDENVMPIAKSLARNVIT 395 >gi|320104270|ref|YP_004179861.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] gi|319751552|gb|ADV63312.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] Length = 706 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 26/133 (19%), Positives = 54/133 (40%), Gaps = 16/133 (12%) Query: 174 MMIVLDVSRSM-------ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +++VLD S SM E S T++D A + ++ E N++V ++ F Sbjct: 99 LILVLDHSASMTATDVAGEPILGSKPTRLDEAKRQALRVVRE----KSANDLVM--VIAF 152 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ--GMRQHCNTED 284 +N+ + L ++I+ L +T+ L+ A + ++ + Sbjct: 153 ANRAQVVSSYTADEYELVKRIEALEPTQATTSLRDALEVAAGLANPSKLIEPQEGEVATE 212 Query: 285 ANYKKIIVFMTDG 297 ++ +F TDG Sbjct: 213 VVPPEMFIF-TDG 224 >gi|327405283|ref|YP_004346121.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] gi|327320791|gb|AEA45283.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] Length = 451 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 32/208 (15%), Positives = 74/208 (35%), Gaps = 35/208 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ +LD S SM + + + T+I+ A + + ++ + ++ ++ + + I Sbjct: 26 ILFILDASNSMNANWGAQ-TRIEAAKELLVKTIDSLDGSANLQIALR--VYGHQSPITAT 82 Query: 234 FLLEWGVS-----------HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + ++ KI+ + G +T L+ A + Sbjct: 83 YQDCNDTKLEVPFGPDNFLKVRNKIRTIIAKG-TTPIARSLEAAASDF------------ 129 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV--YAIGIRVIRSHEFLRACA 340 D N + II+ +TDG D + +G V + IG+ + ++ C Sbjct: 130 PDTNARNIIILITDG---LEACDNDPCVIAKKLHDKGVKVTPFVIGLGLDLTYLDQFKCI 186 Query: 341 SPNSFYLVENPHSMYDAFSH-IGKDIVT 367 S+ E + + I K ++ Sbjct: 187 --GSYSEAETKEAFNNVLKTVISKALIN 212 >gi|228931399|ref|ZP_04094324.1| hypothetical protein bthur0010_60390 [Bacillus thuringiensis serovar pondicheriensis BGSC 4BA1] gi|228828205|gb|EEM73915.1| hypothetical protein bthur0010_60390 [Bacillus thuringiensis serovar pondicheriensis BGSC 4BA1] Length = 448 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 78/209 (37%), Gaps = 16/209 (7%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + L++ I+LD S SM + + K++ A K+I L IPD +NV+ L + Sbjct: 144 KEKSLNVEILLDASGSMAGKVNGQV-KMEAAKKAIYNYL---DKIPDNSNVM---LRVYG 196 Query: 228 NK---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE- 283 +K E L G S + ++ +K + + + + E Sbjct: 197 HKGSNNENDKSLSCGSSEVMYPLQPYNKEQFNAALSKFGPKGWTPLASAIESVNDDFKEY 256 Query: 284 -DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRACAS 341 ++ ++DGE E + N++ A+V IG V S + L+ A Sbjct: 257 TGEENLNVVYIVSDGEETCGGEPVNAAKNLNQS-STHAVVNIIGFDVKNSEQLQLKNTAE 315 Query: 342 P--NSFYLVENPHSMYDAFSHIGKDIVTK 368 ++ V +Y + + + + Sbjct: 316 AGKGNYATVSTADELYQTLNKEYEKLYKQ 344 >gi|124262931|ref|YP_001023401.1| hypothetical protein Mpe_B0391 [Methylibium petroleiphilum PM1] gi|124262177|gb|ABM97166.1| hypothetical protein Mpe_B0391 [Methylibium petroleiphilum PM1] Length = 601 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 69/184 (37%), Gaps = 26/184 (14%) Query: 175 MIVLD-VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++VLD S SM F + + D A + E + N V LV ++ + ++ Sbjct: 433 VMVLDDESGSMNEPF-GDMRREDAASRVCVGAGEVLN-----NAEVPFALVGYNTSLHQY 486 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + + STN+ + +A ++ + + R KI+ Sbjct: 487 KGFDDSWAETLKDFG--PHSASSTNTHLAVVWALRELINRKERR-----------KILKV 533 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHS 353 +TDG+ ++ EAK G V + I +++ R+ P + + + Sbjct: 534 VTDGDPGDQTVLAAAI---EEAKAFGVEVRFVLISSREEYKY-RSMGVP--YGVANDAPE 587 Query: 354 MYDA 357 + +A Sbjct: 588 LANA 591 >gi|222056730|ref|YP_002539092.1| Vault protein inter-alpha-trypsin domain protein [Geobacter sp. FRC-32] gi|221566019|gb|ACM21991.1| Vault protein inter-alpha-trypsin domain protein [Geobacter sp. FRC-32] Length = 665 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 24/125 (19%), Positives = 46/125 (36%), Gaps = 24/125 (19%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 + VLDVS SM F +D A I ++ ++ N V+ +G +++ + Sbjct: 309 IFVLDVSGSMHGFP------LDTAKTLIRDLIGNLRPTDTFNLVLFAG----GSQVMDPS 358 Query: 235 LLEWGVSHLQRKIKYL--SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + ++ + I+ + + G T L A + + R V Sbjct: 359 SIPATSENITKAIRLIDSQQGGGGTELAAALNKALSLPREKGKAR------------TAV 406 Query: 293 FMTDG 297 +TDG Sbjct: 407 IITDG 411 >gi|113476846|ref|YP_722907.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] gi|110167894|gb|ABG52434.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] Length = 379 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 34/219 (15%), Positives = 71/219 (32%), Gaps = 14/219 (6%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 I + P F + + ++ I S ++++LD S SM+ Sbjct: 11 IKVTEDGERPAMFLEEQDFQAIVDNDLVDIISWKNPEESVPPPAWIIVLLDFSGSMKEKD 70 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL-------VTFSNKIE----EFFLLE 237 S TK++ AIK+ LE V ++ + E FF + Sbjct: 71 SSGTTKLEGAIKATREFLETTSARGSNTRVAIFPFGEGGGRCNSYKVRRENIKSRFFPAD 130 Query: 238 WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC--NTEDANYKKIIVFMT 295 STN LK A + D + + ++ + + ++ ++ Sbjct: 131 DFKHKNLLDNLAKKTPCASTNIYDPLKEAIRLLSDQEDTDFYVPEDSIEPEPRLSVILLS 190 Query: 296 DGENLSTKEDQQ-SLYYCNEAKKRGAIVYAIGIRVIRSH 333 DG + E++ + +V+ +G + + Sbjct: 191 DGYHNKKYENRDFRRLIALLERHDHIVVHTLGYGLTQEQ 229 >gi|83720829|ref|YP_443052.1| hypothetical protein BTH_I2535 [Burkholderia thailandensis E264] gi|167582052|ref|ZP_02374926.1| hypothetical protein BthaT_28172 [Burkholderia thailandensis TXDOH] gi|167620215|ref|ZP_02388846.1| hypothetical protein BthaB_28164 [Burkholderia thailandensis Bt4] gi|83654654|gb|ABC38717.1| conserved hypothetical protein [Burkholderia thailandensis E264] Length = 602 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 20/163 (12%), Positives = 55/163 (33%), Gaps = 16/163 (9%) Query: 15 YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNR 74 +G + AI++ + LG ++++ ++FF++ L + D + + A ++ Sbjct: 23 ERGSFAVAAAIWMLVAIAALG-VVDIGNVFFVRRDLQRVADMAALAGAQRM--------- 72 Query: 75 KKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISR 134 D C N + + + + V D +S ++ Sbjct: 73 ------DDQCSQPNAAAAANASSNGFDRAASGNTLTVSCGRWDTQSNAAPSYFSTTSTPL 126 Query: 135 YKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 + + +P++ + TS+ K + + + Sbjct: 127 NAVQVTATQNVPYFFLGPSRTVSATSTAKATNVDQFTIGTTLA 169 >gi|78186669|ref|YP_374712.1| hypothetical protein Plut_0797 [Chlorobium luteolum DSM 273] gi|78166571|gb|ABB23669.1| putative membrane protein [Chlorobium luteolum DSM 273] Length = 349 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 18/97 (18%), Positives = 39/97 (40%), Gaps = 4/97 (4%) Query: 12 FYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNG 71 + +GG IL A+ LP++ + ++++ I + L + D + + A + ++ G Sbjct: 13 LQSERGGAAILFALTLPVLLGFAALAVDLARIHLTRVELQNAADAAALGGARSL-SDSGG 71 Query: 72 NNRKKLKGGDILCRIKN---TWNMSFRNELRDNGFVN 105 N G I ++ L + G+ N Sbjct: 72 NPYNWSAAGSAALDIARRNVANGAGIQDALIETGYWN 108 >gi|70606980|ref|YP_255850.1| hypothetical protein Saci_1211 [Sulfolobus acidocaldarius DSM 639] gi|68567628|gb|AAY80557.1| conserved protein [Sulfolobus acidocaldarius DSM 639] Length = 380 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 44/233 (18%), Positives = 79/233 (33%), Gaps = 39/233 (16%) Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 +IS + F + + + V T +I+LD S SM Sbjct: 2 TISVKAELSHKYSFTSPLKG--------VFRLIIVPEKVSTARGFHYIILLDTSGSMYG- 52 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI 247 KI+ A + +L IP+ N + +TFSN + S +Q+ I Sbjct: 53 -----VKIETAKQGAMELLSR---IPEGN---KISFLTFSNNVNILSEYADAPSLVQQ-I 100 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 K + G T L+ A ++ ++H + I+ +TDG+ E Sbjct: 101 KQIRS-GGQTVLYRALERA------IEIAKKH------DLPGYIILLTDGQPTDVPE-TD 146 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAF 358 + N + I + GI + L+ + Y VE+ + + Sbjct: 147 AYEKLNYPEAYKVIAF--GIGDDYNERLLKVITDKTAGILYHVEDAKEIAEML 197 >gi|326431731|gb|EGD77301.1| hypothetical protein PTSG_08395 [Salpingoeca sp. ATCC 50818] Length = 2026 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 69/191 (36%), Gaps = 19/191 (9%) Query: 171 RLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD + +LD S S+ + +D + ++ + V LIP + ++ G+ Sbjct: 137 PLDAIFILDGSGSVTQPNWD----------RQVDFLTSVVSLIPVSLDDIRVGVCVLLGN 186 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + L G + + ++ + + A NQ+ M N + Sbjct: 187 AQTLIDL-NGATSPGAVVAAVNAYTYENFNPSATGEALNQVHTS--MLLQANGRRTDVPA 243 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI---RVIRSHEFLRACASPNSFY 346 ++V +TDG++ T D+ + + +Y+IG+ + E L S + Y Sbjct: 244 VVVVITDGDSQET--DEYVQEQADIVRSGVDRMYSIGLDTTDPEHAAEQLVVAGSASRVY 301 Query: 347 LVENPHSMYDA 357 + + Sbjct: 302 ELSTLEGQLAS 312 >gi|238064101|ref|ZP_04608810.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] gi|237885912|gb|EEP74740.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] Length = 626 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 55/165 (33%), Gaps = 22/165 (13%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSI----NAMLEEVKLIPDVNNVVQS--GLV 224 + LD S SM + T I+ A + A ++ P V G V Sbjct: 443 PVAAYYCLDGSGSMGD--NDGWTGIEAAATQVFDPEQAAQNLLQTHPQDVTTVAIFNGGV 500 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 T + + + L R + G TN L A ++ Q + Sbjct: 501 TGGSPWQVRGNDGDALRDLARSVADYEPEG-GTNMYACLLRATTELTGQQ---------N 550 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 + K+++V MTDG++ + + D + + V AI Sbjct: 551 GDRKRLVVLMTDGQSGAEQRDDAL----DALRSADVPVVAIAFGR 591 >gi|226329024|ref|ZP_03804542.1| hypothetical protein PROPEN_02926 [Proteus penneri ATCC 35198] gi|225202210|gb|EEG84564.1| hypothetical protein PROPEN_02926 [Proteus penneri ATCC 35198] Length = 227 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 26/168 (15%), Positives = 61/168 (36%), Gaps = 17/168 (10%) Query: 152 RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 H+++P + V + Q +++VLD S SM I + + + +E+K Sbjct: 3 EHLMIPDVALVDNSEQRTP---LILVLDSSGSMYGQP------IQQLNEGLKLLEQELKN 53 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 V+ ++ + + +W + + +T + A +I Sbjct: 54 DVIAAKRVRILVIEYGGYDQCTIHGDW--KDAMDFTAPVLEANGTTPMGQAITLALEEI- 110 Query: 272 DMQGMRQHCNTEDANYKK-IIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 + +Q Y + + M+DG + +Q+ C +A++ Sbjct: 111 --EAEKQRFKQAGVAYTRPWLFLMSDGVPTD--QWEQAAQLCRQAEES 154 >gi|194220392|ref|XP_001916340.1| PREDICTED: von Willebrand factor A domain containing 3B [Equus caballus] Length = 1301 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 33/171 (19%), Positives = 62/171 (36%), Gaps = 32/171 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + I++D S SM+ K+D+ I ++E N VQ + + + Sbjct: 509 VYILIDTSHSMKG-------KLDLVKDKIIQFIQEQLKYKRKFNFVQFDAQAIAWREKLV 561 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + E + Q ++ + G STN+ L+ A+ + E I Sbjct: 562 EIDEDNLKRAQSWVRDIK-IGSSTNTLNALQIAF------------ADKETQA----IYL 604 Query: 294 MTDGENLSTKEDQQSLYYCNEAKK-RGAIVYAIGIRVIR--SHEFLRACAS 341 +TDG E ++ K + +YAI ++ FL+ A+ Sbjct: 605 LTDGRPDQPLEMVI-----DQVKVFQKIPIYAISFNYNDEIANGFLKELAA 650 >gi|15897956|ref|NP_342561.1| hypothetical protein SSO1091 [Sulfolobus solfataricus P2] gi|284173931|ref|ZP_06387900.1| hypothetical protein Ssol98_04630 [Sulfolobus solfataricus 98/2] gi|13814281|gb|AAK41351.1| Hypothetical protein SSO1091 [Sulfolobus solfataricus P2] gi|261602668|gb|ACX92271.1| von Willebrand factor type A [Sulfolobus solfataricus 98/2] Length = 380 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 38/211 (18%), Positives = 71/211 (33%), Gaps = 34/211 (16%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + ++ LD S SM + KI++A + +E K IP+ N V +TF Sbjct: 33 GSATGFHYIVALDTSGSMTGY------KIELAKQGA---IELFKRIPNGNKV---SFITF 80 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA-YNQIFDMQGMRQHCNTEDA 285 S+ V+ ++ + L T G + A Y I + + Sbjct: 81 SS----------NVNVIKEFVDPLDLTNEILQITAGGQTALYTAILTANSLAKKYQMPT- 129 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPN 343 ++ +TDG L K VY+ GI + + L++ + + Sbjct: 130 ----YLLLLTDGNPTDETNIGNYLKLPYYEK---IQVYSFGIGDDYNEQLLQSVSDKTGG 182 Query: 344 SFYLVENPHSMYDAF-SHIGKDIVTKRIWYD 373 Y + + + + I K + D Sbjct: 183 VMYHISDANEIPQKLPQKAVTQIAAKNVTVD 213 >gi|47522638|ref|NP_999089.1| inter-alpha-trypsin inhibitor heavy chain H1 precursor [Sus scrofa] gi|3024032|sp|Q29052|ITIH1_PIG RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H1; Short=ITI heavy chain H1; Short=ITI-HC1; Short=Inter-alpha-inhibitor heavy chain 1; Flags: Precursor gi|565283|dbj|BAA07632.1| inter-alpha-trypsin inhibitor heavy-chain H1 [Sus scrofa] Length = 902 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 28/136 (20%), Positives = 53/136 (38%), Gaps = 12/136 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D+S SME K+ +++ +L ++K ++V G S + Sbjct: 285 VVFVIDISSSMEG------QKVKQTKEALLKILSDLKPGD-YFDLVLFGSAVQSWRGSLV 337 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + ++ S STN GL + QG E +N I++ Sbjct: 338 QASTANLDAARSYVRQFS-LAGSTNLNGGLLRGIEILNKAQGSL----PEFSNRASILIM 392 Query: 294 MTDGENLSTKEDQQSL 309 +TDGE D+ + Sbjct: 393 LTDGEPTEGVTDRSQI 408 >gi|67538388|ref|XP_662968.1| hypothetical protein AN5364.2 [Aspergillus nidulans FGSC A4] gi|40743334|gb|EAA62524.1| hypothetical protein AN5364.2 [Aspergillus nidulans FGSC A4] gi|259485192|tpe|CBF82031.1| TPA: von Willebrand domain protein (AFU_orthologue; AFUA_4G01160) [Aspergillus nidulans FGSC A4] Length = 1109 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 27/168 (16%), Positives = 54/168 (32%), Gaps = 29/168 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ + D S SMES KI I +N + + N + +F +++ Sbjct: 342 IIFMADRSGSMES-------KISSLINVMNIFIRSLPEACSFN------IASFGSEVTWL 388 Query: 234 FLLE-----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + + + TN L+ + H N +D + Sbjct: 389 WPCSKRYSQENLDVASKHVDSFRANYGGTNIYCALESVLD----------HFNKQD-DVP 437 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 ++ +TDGE Q + +++GI SH + Sbjct: 438 TNVILLTDGEVWDVDNVIQLVRRTVSMNGSNIRFFSLGIGDRVSHRLV 485 >gi|332877591|ref|ZP_08445335.1| hypothetical protein HMPREF9074_01069 [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332684440|gb|EGJ57293.1| hypothetical protein HMPREF9074_01069 [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 287 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 36/158 (22%), Positives = 59/158 (37%), Gaps = 22/158 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S F + I I A L L + +GL+ F Sbjct: 72 EEERELTLMLMVDVSGS--ELFGTQQQFKSEIITEIAATLAFSALQNNDK----TGLILF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 S++IE F + G SH+ R I+ L +F + F Sbjct: 126 SDQIELFIPPKKGKSHVLRIIRELIEFEP-------------KSFKTNIGEAFQFLSRVM 172 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKK-RGAIVY 323 KK IVFM + K ++ + + G +Y Sbjct: 173 KKKAIVFML--SDFIDKGYEKPIQIAAKKHDITGIRIY 208 >gi|241667422|ref|ZP_04755000.1| hypothetical protein FphipA2_01540 [Francisella philomiragia subsp. philomiragia ATCC 25015] gi|254875971|ref|ZP_05248681.1| conserved hypothetical protein [Francisella philomiragia subsp. philomiragia ATCC 25015] gi|254841992|gb|EET20406.1| conserved hypothetical protein [Francisella philomiragia subsp. philomiragia ATCC 25015] Length = 332 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 60/162 (37%), Gaps = 21/162 (12%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 + +I LDVS+SM++ D S T+++ A +L +K Q G++ FS++ Sbjct: 91 VSRVIALDVSQSMDT-TDVSPTRLERAKYKTLDILRRIKEG-------QVGMIVFSSEPF 142 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L + ++ + ++ + + N I+ T+ K I Sbjct: 143 VVSPLTSDANTVENLVPVIN--------SDIVPVQGNNIYKAIEKSAQLITQAGAKKGQI 194 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + +TD + ++ + ++G I + Sbjct: 195 ILITD-----STPSADAIAKAKQLAEQGIDTDVYAIGTPKGG 231 >gi|167738778|ref|ZP_02411552.1| hypothetical protein Bpse14_11973 [Burkholderia pseudomallei 14] Length = 578 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 10/49 (20%), Positives = 27/49 (55%), Gaps = 1/49 (2%) Query: 17 GGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 G ++ AI++ + LG +++ ++FF++ L + D + + A ++ Sbjct: 1 GSFALVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM 48 >gi|167516596|ref|XP_001742639.1| hypothetical protein [Monosiga brevicollis MX1] gi|163779263|gb|EDQ92877.1| predicted protein [Monosiga brevicollis MX1] Length = 7261 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 27/213 (12%), Positives = 68/213 (31%), Gaps = 11/213 (5%) Query: 52 SMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIV 111 S D + H I + + ++L I+ L + + + Sbjct: 6919 SAKDYAAQHGVQVI--RLKLGTSPRHREAEVLNVIEAPSVFELTGGLDALLSQFNANTPI 6976 Query: 112 RSTSL--DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIV-MPITSSVKVNSQT 168 ST D V+ P + P + + + + + + +S Sbjct: 6977 SSTQSSHDSVLPPAPDDLFEKVERVLSQPSPMTATLAAQAEANQLTEIHVELTRGADSTF 7036 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + +D+ + +D + SM S+ + +++ ++ + + E + + ++ G V + + Sbjct: 7037 NVDVDLALAMDCTGSMGSWIAAGKSQLAQIVQHVQNQVRE-RFGELASPRIRVGFVAYRD 7095 Query: 229 KIEEF-----FLLEWGVSHLQRKIKYLSKFGVS 256 + L V + KI G Sbjct: 7096 YSDGPNLVDSIDLTTNVQQVIAKINAQVATGGG 7128 >gi|108803481|ref|YP_643418.1| protoporphyrin IX magnesium-chelatase [Rubrobacter xylanophilus DSM 9941] gi|108764724|gb|ABG03606.1| protoporphyrin IX magnesium-chelatase [Rubrobacter xylanophilus DSM 9941] Length = 616 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 25/148 (16%), Positives = 53/148 (35%), Gaps = 22/148 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEE 232 +++V+D S SM ++ +++ ++ A+LE ++ +++F + Sbjct: 450 LVLVVDSSGSM-----AARSRMSAVKGAVRALLE-----DAYRRRDRAAVISFRGEEARL 499 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 GV +++ L G T GL+ A + E +V Sbjct: 500 LVPPASGVEAAAARLEELPT-GGRTPLAAGLELAAETVL----REASREPERRPL---LV 551 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGA 320 +TDG + L ++RG Sbjct: 552 VITDGR---ATAGEDPLAAARRLRERGV 576 >gi|296133928|ref|YP_003641175.1| hypothetical protein TherJR_2435 [Thermincola sp. JR] gi|296032506|gb|ADG83274.1| hypothetical protein TherJR_2435 [Thermincola potens JR] Length = 621 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 13/60 (21%), Positives = 29/60 (48%) Query: 1 MFSLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVH 60 M+ +N + F N KG +T+ I I+ + +G+ I+++ I + L + + + Sbjct: 1 MYIINSVLERFLKNEKGTVTVYLVIVFMIMVIFIGLFIDLARIKTAQNQLRRVANAAACS 60 >gi|119476218|ref|ZP_01616569.1| hypothetical protein GP2143_07204 [marine gamma proteobacterium HTCC2143] gi|119450082|gb|EAW31317.1| hypothetical protein GP2143_07204 [marine gamma proteobacterium HTCC2143] Length = 750 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 27/196 (13%), Positives = 61/196 (31%), Gaps = 27/196 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ +V+D+S SM+ + + +L ++ V G + Sbjct: 41 DVRVVIDISGSMKKNDPNY------LRRPALELLIQLFPEGSAAGVWTFGQWVNNLVPSN 94 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 W ++ + + +S + TN L+ A + ++ +Y ++ Sbjct: 95 TVTTAW-RANASAQAEKISSVALRTNIPAALEKAMADV----------KSKGTDYSIHLI 143 Query: 293 FMTDGE--------NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--LRACASP 342 +TDG + T + K G ++ + + E L A + Sbjct: 144 LLTDGMVDVSLSSIDNETARQRIVAEILPALKGAGVTIHTVALSQNADLELMELLAAETG 203 Query: 343 NSFYLVENPHSMYDAF 358 + E + F Sbjct: 204 GLSAVAETAEDLSRVF 219 >gi|146284567|ref|YP_001165520.1| von Willebrand factor, type A [Enterobacter sp. 638] gi|145320700|gb|ABP62846.1| von Willebrand factor, type A [Enterobacter sp. 638] Length = 401 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 34/240 (14%), Positives = 75/240 (31%), Gaps = 17/240 (7%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 +P +PI +M+++ D S SM D + ++ + Sbjct: 169 MPTAKLPVQGAPAPAPLPIKQICPAKRTKQQAPEMVLIFDASGSMSISMDITPDELRRLM 228 Query: 200 KS--INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST 257 + + E + I + +++ + + + +++ + Sbjct: 229 QDRPVKNFDREPRRISLAHR----SAKQLIDEVPKDMDISLVSAATCQQVSVTPALSFAQ 284 Query: 258 NSTPGLKYAYNQIFDMQGM-------RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 LKYA + I + + + + IIV +TDGE + Sbjct: 285 R--DELKYAIDNIQPVGKTALAEALEKAGKLVDGVDRDAIIVLITDGEETCGGDPCVVAQ 342 Query: 311 YCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + K R V + I + + A + S Y V N H + + K+ + + Sbjct: 343 QLKQQKPR-LQVNVVDIMNTGAGNCI-ASQTGGSVYAVNNTHEFNEMMNQAIKEYIPEAC 400 >gi|110832907|ref|YP_691766.1| von Willebrand factor type A domain-containing protein [Alcanivorax borkumensis SK2] gi|110646018|emb|CAL15494.1| protein containing a von Willebrand factor type A domain [Alcanivorax borkumensis SK2] Length = 698 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 34/219 (15%), Positives = 77/219 (35%), Gaps = 38/219 (17%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 ++ IT + T D + VLD+S SM K+ + L +++ Sbjct: 300 TFMLSITPGDDLPPITTGS-DWVFVLDISGSMN-------AKLATLGDGVRQALGKLR-- 349 Query: 213 PDVNNVVQSGLVTFSNKIEEFFL--LEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYN 268 N+ + V F ++ EE ++ +++++ + + + TN GL A Sbjct: 350 --GNDRFRI--VLFDDRAEELTSGFVDATPNNIRQYTQKIMQLQSRGGTNLFGGLSLALT 405 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + DA+ IV +TDG ++ + + + + ++ + Sbjct: 406 PL-------------DADRPTGIVLVTDG--VANVGKTRQKDFIDLLENHDVRLFTFVMG 450 Query: 329 VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + L A ++ + + +S I I+ Sbjct: 451 NSANRPMLTAMTDASNGFAISVSNS-----DDIAGQILN 484 >gi|302832626|ref|XP_002947877.1| hypothetical protein VOLCADRAFT_103646 [Volvox carteri f. nagariensis] gi|300266679|gb|EFJ50865.1| hypothetical protein VOLCADRAFT_103646 [Volvox carteri f. nagariensis] Length = 733 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 34/223 (15%), Positives = 67/223 (30%), Gaps = 32/223 (14%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 ++ + AR+ + VLD S SM S I + ++ + +++++ Sbjct: 212 VSIKAIADVPERARVALTCVLDRSGSM------SGGPIRLVRETCHFLIDQLTSDD---- 261 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQ 274 GL+++++ + E L + + + G ST GL Q + Sbjct: 262 --FLGLISYAHDVREDLPLLRMTPASRTLAHAVVEELVAGGSTALYDGLVAGLRQQMAAE 319 Query: 275 GMRQHCNTEDANYKKII-----------VFMTDGENLSTKEDQQSL----YYCNEAKKRG 319 N TDG+ + S+ + Sbjct: 320 RDLGGGNGASGGASDSSSPSSLSLVHSCFLFTDGQATDGPSNPASIIEGLQAAQAPSGQH 379 Query: 320 AIVYAIGIRVIRSHEFLR--ACASPNSFYLVENPHSMYDAFSH 360 V+ G S E L+ A A +Y + + F Sbjct: 380 VTVHTFGFGNGHSVELLQQVAEAQSGVYYYISCEEDIACGFGD 422 >gi|165928916|gb|ABY74499.1| integrin beta 2 [Acropora millepora] Length = 771 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 27/217 (12%), Positives = 62/217 (28%), Gaps = 46/217 (21%) Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITS 160 N N I +S I + ++ + + P + + + + Sbjct: 63 NHTRNGCQKIANPSSEAITHQNKPLDTNVKVKPQN----ITLSLRPGQPATVKVDVKMPG 118 Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + +D+ ++D+S SM + + SI K I ++ + Sbjct: 119 --------NYPVDLYYLMDLSGSMVE----DLKRFPTLGDSI------AKEITNITRNFR 160 Query: 221 SGLVTFSNKIEEFF---------------------LLEWGVSHLQRKIKYLSKFGVSTNS 259 G F +K + L +S + I ++ G N Sbjct: 161 LGFGAFVDKPLAPYIETQPGSNPNMVPAFGFVNQLSLTENISEFESVINRVNISGNIDNP 220 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 L + + + ++I++ +TD Sbjct: 221 EGTLDALMQIAVCDKEIGWRDK---QSARRIVIVITD 254 >gi|127511247|ref|YP_001092444.1| putative outer membrane adhesin like proteiin [Shewanella loihica PV-4] gi|126636542|gb|ABO22185.1| putative outer membrane adhesin like proteiin [Shewanella loihica PV-4] Length = 4836 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 34/247 (13%), Positives = 73/247 (29%), Gaps = 19/247 (7%) Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 + + + G + N+S + + NG + I S+ + Sbjct: 4179 SGVSLSAGTLNADGSWTVAVNQLGNLSIKADDSYNGDLEFTLTIKASSVEQSNSDSASSQ 4238 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 ++ R + ++ + + ++ +LD S SM S Sbjct: 4239 TTLDVSLRNYHYDNGTDGDNVINGGEDNDVIVSDTTGIQVVQGENYNVAFILDSSGSMGS 4298 Query: 187 FFDSSITKIDMAIKSINAMLEEVKL--IPDVNNVVQSGLVTFSNKIEE----FFLLEWGV 240 +I+ A + + +K + V LV F++ + + Sbjct: 4299 N------RIESAKDQLLQVFNTLKASVGGATSGTVNVLLVDFNSGTKAHVAVNLADSDAI 4352 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 S L+ + +S TN + + + G + Y F+TDGE Sbjct: 4353 SKLESVLNEISSDNGRTNYEAAFETVID--WFSHGSAASNTGTNLTY-----FITDGETN 4405 Query: 301 STKEDQQ 307 + D Sbjct: 4406 NYNVDAD 4412 >gi|313884738|ref|ZP_07818493.1| von Willebrand factor type A domain protein [Eremococcus coleocola ACS-139-V-Col8] gi|312619999|gb|EFR31433.1| von Willebrand factor type A domain protein [Eremococcus coleocola ACS-139-V-Col8] Length = 549 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 30/199 (15%), Positives = 67/199 (33%), Gaps = 13/199 (6%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +Q VLD S SME + +T + + + + L+ + ++ Sbjct: 358 QTQFKKPALTAYVLDYSGSMEGEGNKQMT--EAMAQVLLPENAKANLLQGTQQDISL-VI 414 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS+ ++ + + L +Y+ + G T G+ A + + G Sbjct: 415 PFSDMVDAATVAKGNGQELVDLNQYVQDYVVGGDTAMYEGIIAALDTMVSDYGQDL---- 470 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 +Y IV +TDG+ K + +A+ +++I + Sbjct: 471 --EDYSPAIVILTDGQPNGAKTFKDLSQRYQQAQVD-IPMFSILFGEAEEGKMKEIADLT 527 Query: 343 NSFYLVENPHSMYDAFSHI 361 + + + +AF + Sbjct: 528 KARVF-DGRKDLINAFKQV 545 >gi|221113899|ref|XP_002160633.1| PREDICTED: similar to Collagen alpha-5(VI) chain [Hydra magnipapillata] Length = 9981 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 75/202 (37%), Gaps = 20/202 (9%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + D++ ++DVS M S + +I I+ +++ N + G++TFS+ Sbjct: 3128 HGQFDILYIIDVSVQMLS-----VDRIKEFIR------KQLPTFNISFNETRIGIITFSD 3176 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L G S Q+ + + +S N P A+ ++ +N Sbjct: 3177 VANTLLTLSQGTS-TQQVLNAVKNIQLSLN-NPQFSVAFEKLSSTLSDSF-----RSNVI 3229 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF-YL 347 KI+ +T + + ++ + K IGI + L + A+ S+ Sbjct: 3230 KIVTLITSSSSNGF-DIERINSVTKKLKSSNLKFIVIGIGENVKKDELLSIATDESYAVF 3288 Query: 348 VENPHSMYDAFSHIGKDIVTKR 369 +E+ + M S I I + Sbjct: 3289 IEDDYKMDYQISKINTAIANSK 3310 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 22/184 (11%), Positives = 63/184 (34%), Gaps = 12/184 (6%) Query: 188 FDSSITKIDMAIKSINAM-LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQ 244 + + +T+ + + ++ ++L VQ G++ + ++ + L VS+ + Sbjct: 8590 WGTPLTETNQIFDAEKDFTIKLLRLFNISPAYVQIGIIQYGREVHTEYRLSDLVSNQQAE 8649 Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 I + N L+ A ++F++ + N N K ++ + K Sbjct: 8650 SAINKIVLRTPGNNLIAALRDARIKLFNLDNEEKLKNGVRRNAPKSLLLF-----NTKKS 8704 Query: 305 DQQSLYYCNE---AKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHI 361 + E + G + + + + +S++ V+N + Sbjct: 8705 AVEVKELEAEFIILRSAGTKIVTVTFDSSN-KKIFQQSNDDHSYFYVQNFLQFNALLQPV 8763 Query: 362 GKDI 365 + + Sbjct: 8764 FQQL 8767 Score = 40.5 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 28/151 (18%), Positives = 60/151 (39%), Gaps = 25/151 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D++ ++D+S S+ + I+KI ++ + + + G++ +S + + Sbjct: 2060 FDIVYLVDISNSVSNTKLFEISKI---------IVSQATSFMKNHEERRVGVIGYSLEPQ 2110 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 E L +S + R ++YL + S+N L++ +F +KI+ Sbjct: 2111 EIVSLTSDLSKVYRVLQYLKQQKESSNLKTALEFVIKNVFSTPTT-----------EKIL 2159 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 V + L T ++ N K G V Sbjct: 2160 VIIAGSPPLKTPSEEL-----NNLKDSGVKV 2185 >gi|156314124|ref|XP_001617882.1| hypothetical protein NEMVEDRAFT_v1g225712 [Nematostella vectensis] gi|156196293|gb|EDO25782.1| predicted protein [Nematostella vectensis] Length = 188 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 27/198 (13%), Positives = 65/198 (32%), Gaps = 31/198 (15%) Query: 156 MPITSSVKVNSQT--------DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 MP+T V ++D+ + D ++ ++ + + +S++ Sbjct: 1 MPVTGFQSVRPCDLLQKLAECSKKVDIGFLFDGTKDVDMSYKGN------FQQSLSLATT 54 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLS-KFGVSTNSTPGLK 264 +L N + + +SN + F + ++H L+ I+ + G N L Sbjct: 55 LSQLFDISNASARIAVAVYSNYTDSSFSFDSHLTHASLRNAIEKIVYPNGDPRNIGAALN 114 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 + Q++ N ++++ + + ++ G I+Y Sbjct: 115 NSMRQLY---------NEPRPRVPRVLIV-----TAHDRSSDGVYVASDMMRRNGVIIYG 160 Query: 325 IGIRVIRSHEFLRACASP 342 I + L AS Sbjct: 161 ISTGGGSDKKHLETIASD 178 >gi|302671561|ref|YP_003831521.1| von Willebrand factor type A domain-containing protein [Butyrivibrio proteoclasticus B316] gi|302396034|gb|ADL34939.1| von Willebrand factor type A domain-containing protein [Butyrivibrio proteoclasticus B316] Length = 319 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 73/202 (36%), Gaps = 22/202 (10%) Query: 168 TDARLDMMIVLDVSRSMESF-FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + LD++ V+D + SM + ++ +++ +N +L E+ GLVTF Sbjct: 68 STKNLDVLFVIDGTLSMWALDYNGKNERMEGVKADVNYILSELAGSN-------FGLVTF 120 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + ++ + ++ +S L Y + + + N Sbjct: 121 DDTAHVLSPFTQDLKYISDMVDIMATPETYYSSGSNLATPYKDM----EALLQSSNKKEN 176 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY 346 K I+ F++DGE ++ + + GA +G + + Sbjct: 177 RKTIVFFISDGEVTNSNMEMSYAEFAQYI-DSGA---VLGYGSEAGGKM------KENHN 226 Query: 347 LVENPHSMYDAFSHIGKDIVTK 368 V + + DA S I ++ + + Sbjct: 227 YVYDYDTHDDAISRIDEENLKR 248 >gi|257069049|ref|YP_003155304.1| von Willebrand factor type A-like protein [Brachybacterium faecium DSM 4810] gi|256559867|gb|ACU85714.1| von Willebrand factor type A-like protein [Brachybacterium faecium DSM 4810] Length = 331 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 17/155 (10%), Positives = 50/155 (32%), Gaps = 11/155 (7%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSME-SFFDSSITKIDMAIKSINAMLEEVKLI 212 + + + ++ ++ V+D + SM + +++ + ++E Sbjct: 50 VALRPVTPIESEQTERMNANVFFVVDRTGSMNAEDYAGDRPRLEGVRADMTRVMEM---- 105 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + ++ F + L I L+ + ++ + A N + Sbjct: 106 ---TEGARYSILAFDSTATRQLPLTTDAGAAAAWIDTLTTEPTAYSTGSNVDRALNPLLV 162 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 ED + ++ + DGEN ++ + Sbjct: 163 ---EISEAKREDPDSSVLVYVLADGENTDGQDAES 194 >gi|302336991|ref|YP_003802197.1| protein of unknown function DUF58 [Spirochaeta smaragdinae DSM 11293] gi|301634176|gb|ADK79603.1| protein of unknown function DUF58 [Spirochaeta smaragdinae DSM 11293] Length = 290 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 20/100 (20%), Positives = 43/100 (43%), Gaps = 9/100 (9%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +V+DVS S+ F + ++K + L + V+N + G FS++IE++ Sbjct: 79 LFLVIDVSASLG-FGNGKLSKQETVAT-----LAALLAFSAVHNNDRVGAAFFSDRIEKW 132 Query: 234 FLLEWGVSHLQR---KIKYLSKFGVSTNSTPGLKYAYNQI 270 G + + R I + G ++ L+ + + Sbjct: 133 VPPRKGRNQIFRLAGDIMEVEPKGKGSDLALALRGVHESV 172 >gi|226356525|ref|YP_002786265.1| von Willebrand factor type A domain-containing protein [Deinococcus deserti VCD115] gi|226318515|gb|ACO46511.1| putative von Willebrand factor type A domain protein, precursor [Deinococcus deserti VCD115] Length = 708 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 23/129 (17%), Positives = 40/129 (31%), Gaps = 18/129 (13%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 + VLD S SM D + S++A + ++ V V V + ++ Sbjct: 69 VFVLDTSGSMRGTGDGQADIFERVKASVDAYVRRLRP-DRVEMVSFDSGVRYQRGVDRPA 127 Query: 235 L-LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 EW + L G +T L A +++ I Sbjct: 128 ENPEW-----SALLTGLKADGKNTYLYRSLHTALSKLGGQGEYLT-----------TIFV 171 Query: 294 MTDGENLST 302 +TDG + Sbjct: 172 LTDGIDNDP 180 >gi|86143677|ref|ZP_01062053.1| hypothetical protein MED217_00250 [Leeuwenhoekiella blandensis MED217] gi|85829720|gb|EAQ48182.1| hypothetical protein MED217_00250 [Leeuwenhoekiella blandensis MED217] Length = 288 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 38/162 (23%), Positives = 59/162 (36%), Gaps = 30/162 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L MM+V DVS S FF + + I A L L + GL+ F Sbjct: 72 EEERELTMMLVADVSGS--EFFGTDQQLKSEVVTEIAATLAFSALQNNDK----IGLILF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 S+ IE + + G SH+ R I+ L T+ LK+ Sbjct: 126 SDGIELYIPPKKGKSHVLRIIRELLEFKPKSKKTDVAQALKF-----------LSGVMK- 173 Query: 284 DANYKKIIVF-MTDGENLSTKEDQQSLYYCNEAKK-RGAIVY 323 KK IVF ++D + Q ++ + G +Y Sbjct: 174 ----KKAIVFVLSD---FIADDYQDTMKIAAKRHDITGIRIY 208 >gi|320105057|ref|YP_004180648.1| hypothetical protein Isop_3542 [Isosphaera pallida ATCC 43644] gi|319752339|gb|ADV64099.1| conserved hypothetical protein [Isosphaera pallida ATCC 43644] Length = 757 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 55/169 (32%), Gaps = 32/169 (18%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 P S + R +++ LD S SM + + T+ D A + +L+E + V Sbjct: 100 PFLESSGLPLVAGQRTHLILALDGSMSMT-YQPAESTRFDQAKAAARQLLKEARPGD-VA 157 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTN---STPGLKYAYNQIFDM 273 +VV G + + + R+I L T+ S + A + Sbjct: 158 SVVLLGA----SPRIVVGEPSPRLDEVAREIDDLQATHGGTDLAASFDAIARAMDA---- 209 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 +K ++ +TD Q+ + A G V Sbjct: 210 ----------STISQKQLIILTD---------LQASSWRQPAAAEGVKV 239 >gi|145597693|ref|YP_001161769.1| hypothetical protein YPDSF_0381 [Yersinia pestis Pestoides F] gi|145209389|gb|ABP38796.1| conserved hypothetical protein [Yersinia pestis Pestoides F] Length = 126 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 24/128 (18%), Positives = 44/128 (34%), Gaps = 12/128 (9%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + VLD S SM ++ K++ + ++ ++K P ++ F+ Sbjct: 3 RLPIFFVLDCSESM---IGENLKKMN---DGLQMIINDLKKDPHALETAWISVIAFAGVA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L V + L G T+ L+ QI D + Sbjct: 57 KTIVPL---VEVVSFYPPRLP-IGGGTSLGAALQELTRQIDTQVRKTTEERKGDWKP--V 110 Query: 291 IVFMTDGE 298 + +TDG Sbjct: 111 VYLLTDGR 118 >gi|115930656|ref|XP_001179084.1| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] Length = 245 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 28/171 (16%), Positives = 61/171 (35%), Gaps = 28/171 (16%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML-EEVKLIPDVNNVVQSGLVTFSN 228 + + +++++D S SM +T ++ K + A++ ++ K N+V+ Sbjct: 2 SSIQVIVLVDTSGSM-------VTHMEDLKKDLVALIWDQFKRENISFNIVRFSADIEPW 54 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + + R + G +T + L A+ + +D + Sbjct: 55 RPHIVEPTDANCNDAVRWVSSFVPAG-NTCTLEALSEAFRE-------------KDVDA- 99 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR--SHEFLR 337 I +TDG+ S+ RG V+ I ++ FLR Sbjct: 100 --IYLLTDGKPDSSTSKVF-REIAQVNTVRGVKVHTISFNCNDESANTFLR 147 >gi|120437732|ref|YP_863418.1| hypothetical protein GFO_3411 [Gramella forsetii KT0803] gi|117579882|emb|CAL68351.1| conserved hypothetical protein [Gramella forsetii KT0803] Length = 288 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 31/131 (23%), Positives = 51/131 (38%), Gaps = 25/131 (19%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L MM+++DVS S K + + N + GL+ F Sbjct: 72 EEERELTMMLMVDVSGSE------MFGTQQQFKKDVITEIAATLAFSATKNNDKIGLMLF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +++IE + + G SH+ R I+ L G ST+ LKY N + Sbjct: 126 TDQIELYIPPKKGRSHVLRIIRELLEFKPVGKSTDIKGALKYLTNVMK------------ 173 Query: 284 DANYKKIIVFM 294 KK I+F+ Sbjct: 174 ----KKAIIFL 180 >gi|326911153|ref|XP_003201926.1| PREDICTED: LOW QUALITY PROTEIN: voltage-dependent calcium channel subunit alpha-2/delta-1-like [Meleagris gallopavo] Length = 1128 Score = 42.4 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 40/190 (21%), Positives = 73/190 (38%), Gaps = 29/190 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + DM+I++D S S+ + + S+ MLE + VN +V+F Sbjct: 311 GAASPKDMLILVDASGSVSGLT------LKLIRTSVIEMLETLSDDDFVN------VVSF 358 Query: 227 SNKIEEFF-------LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 +N + L+ + +S G+ T+ G YA+ Q+ + R + Sbjct: 359 NNNAQNVSCFNHLVQANVRNKKKLKEAVDKISAKGI-TDYKKGFSYAFEQLLNHSVSRAN 417 Query: 280 CNTEDANYKK---II---VF-MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 CN + ++ I VF + G++ K Q + C K + +IG I + Sbjct: 418 CNKIIISLERLYHFICVRVFTFSVGQHNYDKGPIQWMA-CEN-KGYYYEIPSIGAIRINT 475 Query: 333 HEFLRACASP 342 E+L P Sbjct: 476 QEYLDVLGRP 485 >gi|325066441|ref|ZP_08125114.1| von Willebrand factor type A [Actinomyces oris K20] Length = 367 Score = 42.4 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 20/193 (10%), Positives = 65/193 (33%), Gaps = 13/193 (6%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSM--ESFFDSSITKID-MAIKSINAMLEEVK 210 + + + + + + +++ +V+D + SM E + +D A ++ + +++ Sbjct: 64 LALAGPAIRGSEAISVSNVEIYMVVDRTGSMAAEDYQGKGPDGVDQSASTRLDGVRADMR 123 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 I + + ++ N L + + I + + L+ A + Sbjct: 124 AIREAFPDSRFSIIALDNTAARELPLTHDTNAVDAWIGSFKQEVSGHATGSSLEVA---L 180 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + D +++ +DGE ++ + + A G ++ ++ Sbjct: 181 PMLGLTLAQARQSDPKDIRLVYIFSDGE---ATDNGRGAQAADNA---GISWQSLA-GLV 233 Query: 331 RSHEFLRACASPN 343 L ++ Sbjct: 234 DGGAVLGYGSTEG 246 >gi|302338923|ref|YP_003804129.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] gi|301636108|gb|ADK81535.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] Length = 390 Score = 42.4 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 74/212 (34%), Gaps = 39/212 (18%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +++ ++LD S SM + +ID + + L LIP V L+ F Sbjct: 26 KGNIELFVLLDKSLSM-------VEEIDSVKEYVADELVGRLLIPGDLFV----LINFYG 74 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 K + FF G + I+ L +++ + G D K Sbjct: 75 KTDRFFS---GEVESRADIEALK-DSLTSITADGRFTDIGSALDTLEKTVEAIPVRQGRK 130 Query: 289 KIIVFMTDGENLSTKE----------DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + ++ +TDG+ + + + + L + E K+G ++ +GI + E L Sbjct: 131 RYLLLLTDGKQEAPPDSPYYSPDGSFNHRLLEHTKEIAKKGWKIHILGIGTETAAEEL-- 188 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + AFS + + + I Sbjct: 189 ------------AKELSAAFSSVREKPEEREI 208 >gi|126208028|ref|YP_001053253.1| tight adherence protein G [Actinobacillus pleuropneumoniae L20] gi|126096820|gb|ABN73648.1| tight adherence protein G [Actinobacillus pleuropneumoniae serovar 5b str. L20] Length = 520 Score = 42.4 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 32/242 (13%), Positives = 78/242 (32%), Gaps = 27/242 (11%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R F + G T++ + I ++ + +E + I + L ++++++ + + Sbjct: 10 RRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAENNSG 69 Query: 69 GNGNNRK----KLKGGDILCRIKNTWNMSFRNELRDNGFVND--------IDDIVRSTSL 116 N+ K K D R+ FV + + Sbjct: 70 RKDNDYKLSGSSNKENDSFDISSEVGK---RDTQMVTTFVQAFLPQTNEKAMRLTPTCKT 126 Query: 117 DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNS-----RHIVMPITSSVKVNSQTDAR 171 + S ++ ++ P + + + + S + + Sbjct: 127 VTTDNKKGHTSSSEVTCTVSGTVEHKSWFPLKVGNLEVIPQQVDVASKSRAFKKNTFNIP 186 Query: 172 LDMMIVLDVSRSMESFFD-------SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +D+M+V D+S SM D + +KI + + + + + D NN + + Sbjct: 187 IDLMVVADLSGSMNFDLDNKKIINNAKPSKIRILKEVLEELAAKSLFNQDSNNNNRIAVA 246 Query: 225 TF 226 F Sbjct: 247 PF 248 >gi|260837260|ref|XP_002613623.1| hypothetical protein BRAFLDRAFT_93664 [Branchiostoma floridae] gi|229299009|gb|EEN69632.1| hypothetical protein BRAFLDRAFT_93664 [Branchiostoma floridae] Length = 655 Score = 42.4 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 40/214 (18%), Positives = 76/214 (35%), Gaps = 29/214 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + DM+I+LD S S+E + + + +L+ + ++ V +G F+ Sbjct: 298 SPKDMLILLDTSGSVEG------RSLSLMKHTTWFLLDRL----TEDDYVATG--YFNAY 345 Query: 230 IEEFFLL-------EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + L + + + L N GL+YA+ +IF+ M Sbjct: 346 AQAVSCLSSFVQATTHNKEVIHKSLDNLEAADQ-ANYYAGLEYAF-KIFNNFEMEDRFEN 403 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRACAS 341 + A K+IV +T+ L + Q R V+ I + I L+ A Sbjct: 404 QGAECNKVIVLVTENAELYPEAVFQKYNP-----DRNIRVFVIVVGEPIHDWSVLQKMAC 458 Query: 342 PNSFYL--VENPHSMYDAFSHIGKDIVTKRIWYD 373 N Y V + + +A G+ + + Sbjct: 459 DNRGYFSTVRSDGAAREASGDFGQVLSRPVALEN 492 >gi|195539501|ref|NP_001124213.1| inter-alpha (globulin) inhibitor H2 [Gallus gallus] gi|190576833|gb|ACE79193.1| inter-alpha inhibitor heavy chain 2 precursor [Gallus gallus] Length = 948 Score = 42.4 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 33/178 (18%), Positives = 67/178 (37%), Gaps = 25/178 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN----- 228 ++ V+DVS SM K+ I+++ A+L E++ Q L+ F++ Sbjct: 313 ILFVIDVSGSMWGL------KMKQTIEAMKAILSELRAAD------QFSLIDFNHNVRCW 360 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + V ++ I+ + G TN L A I + + + Sbjct: 361 RDNLVSATPAQVEDAKKYIQTIHPNG-GTNINEALLRA-TFILNEAQNLGMLDPNSVSM- 417 Query: 289 KIIVFMTDGENLSTKEDQQSLY--YCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 IV ++DG+ + ++ K ++ +GI ++FL+ A+ N Sbjct: 418 --IVLVSDGDPTVGELKLTTIQKNVKQSIKDE-YSLFCLGIGFDVDYDFLQRIATDNR 472 >gi|168032073|ref|XP_001768544.1| predicted protein [Physcomitrella patens subsp. patens] gi|162680257|gb|EDQ66695.1| predicted protein [Physcomitrella patens subsp. patens] Length = 778 Score = 42.4 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 40/263 (15%), Positives = 82/263 (31%), Gaps = 45/263 (17%) Query: 93 SFRNELRDNGFVNDIDDIVRSTSLDIVVVPQN---EGYSISAISRYKIPLKFCTFIPWYT 149 SF N+L++ I ++ D Y + + + L F P + Sbjct: 265 SFGNQLKETFREPGIVAFENESNGDATDGKNQDFVASYQVWSDGIFP-NLIFQDPEPGES 323 Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 ++R S + T + ++ +LD S SM ++ A++++ + LE + Sbjct: 324 DNRGSFCLSISPPDPSKITVFQRAVVFLLDRSGSMYG------DPLNDALQALYSGLESL 377 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFL-----LEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 K N ++ F ++ F + + K T+ L+ Sbjct: 378 KPEDSFN------IIAFDHETALFSSQMERANSASILRAREWATEKCKARGGTDILSPLQ 431 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI--- 321 A+ + + G + +TDG C + R Sbjct: 432 QAFKLVENFPGAVP-----------YVFLITDG------AVDNEKNICLTMQSRIVELGA 474 Query: 322 ----VYAIGIRVIRSHEFLRACA 340 + GI ++ FL+ A Sbjct: 475 RAPRISTFGIGHYCNYYFLKMLA 497 >gi|123428709|ref|XP_001307556.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121889192|gb|EAX94626.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 667 Score = 42.4 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 39/194 (20%), Positives = 65/194 (33%), Gaps = 32/194 (16%) Query: 150 NSRHIVMPITSSVKVNSQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + +I + + L D V+D S SME ID A+K + ML+ Sbjct: 216 SDDYIAVSVYKEFISKRNDYECLSDYFFVIDCSGSMEGKL------IDKAVKCMRLMLQS 269 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK--IKYLSKFGVSTNSTPGLKYA 266 + + ++ + + + L IK + TN LK Sbjct: 270 L----PMKCRFSIYCFGYNFRQLLPIVEYNNENVLLAMNLIKNIKANMGGTNIYNPLK-- 323 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 IF GM K I +TDGE ++ ++ + + K G +Y +G Sbjct: 324 --DIFSQDGML-----------KKIFLLTDGEVDNS---EEIINLVEKNKAFG-NIYTVG 366 Query: 327 IRVIRSHEFLRACA 340 I +R A Sbjct: 367 IGSGADPGLIRNLA 380 >gi|118098040|ref|XP_414917.2| PREDICTED: hypothetical protein [Gallus gallus] Length = 858 Score = 42.4 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 64/177 (36%), Gaps = 40/177 (22%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +++D S SM+ + ++ I E+++ N V+ L+ F+ E Sbjct: 680 VCVLIDTSGSMDPYLPHITKELTSLI------WEQLR-----KNEVRFNLLRFAENTE-- 726 Query: 234 FLLEWGVSHLQR-------KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 W ++ ++++SKF N+ + A + Q + Sbjct: 727 ---SWREHLVEATDKTCHDAVQWVSKFHAHGNTHILM--ALQKALSFQDVEA-------- 773 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV--IRSHEFLRACAS 341 + +TDG+ T + K++ ++ I ++EFL+ AS Sbjct: 774 ----LYILTDGKP-DTSCNLILKEIERLRKQQDIKIHTISFSYVDREANEFLKKLAS 825 >gi|30686226|ref|NP_173345.2| inter-alpha-trypsin inhibitor heavy chain-related [Arabidopsis thaliana] gi|22531102|gb|AAM97055.1| unknown protein [Arabidopsis thaliana] gi|23197960|gb|AAN15507.1| unknown protein [Arabidopsis thaliana] gi|332191682|gb|AEE29803.1| inter-alpha-trypsin inhibitor heavy chain-like protein [Arabidopsis thaliana] Length = 754 Score = 42.4 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 39/200 (19%), Positives = 74/200 (37%), Gaps = 41/200 (20%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT-FSNKIEE 232 ++ V+D+S+SM ++ +I+ L ++ N + S FS +E Sbjct: 329 VVFVVDISKSMTG------KPLEDVKNAISTALSKLDPGDSFNIITFSNDTALFSTSMES 382 Query: 233 FFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 ++R I++++K TN P L+ A + + +G Sbjct: 383 VTSDA-----VERGIEWMNKNFVVADGTNMLPPLEKAVEMLSNTRGSIP----------- 426 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGA-------IVYAIGIRVIRSHEFLRACA-- 340 +I F+TDG + + C+ KK A ++ G+ V +H FL+ A Sbjct: 427 MIFFVTDG------SVEDERHICDVMKKHLASAGSVFPRIHTFGLGVFCNHYFLQMLANI 480 Query: 341 SPNSFYLVENPHSMYDAFSH 360 S V N + + Sbjct: 481 SCGQHESVYNTDHIEERMDK 500 >gi|262165797|ref|ZP_06033534.1| RTX toxin related protein [Vibrio mimicus VM223] gi|262025513|gb|EEY44181.1| RTX toxin related protein [Vibrio mimicus VM223] Length = 1505 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 65/192 (33%), Gaps = 20/192 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 S+T ++ ++LDVS SM++ + +++ + +S +LE+ + I VQ + Sbjct: 889 ESETKQDANVQLILDVSGSMDNSAGNGKSRLQVMKESAIQLLEQYQAIGQTK--VQLIIF 946 Query: 225 TFSNKIEEFFLLEWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + ++ W + I L G + L+ A D T Sbjct: 947 AGTASVQLHEKALWMTVDEAKSYIDALKAGGQTDYDH-ALQLA-----DEYWSGNRNGTP 1000 Query: 284 DANYKKIIVFMTDG-----------ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + F++DG EN +T E + + + + A G+ Sbjct: 1001 LTEATNVSYFLSDGKPEGFDKQNGVENDNTIEPNELASWISHLESNEITALAYGMGNSVP 1060 Query: 333 HEFLRACASPNS 344 L A Sbjct: 1061 QSELDKIAFDGH 1072 >gi|257127006|ref|YP_003165120.1| von Willebrand factor type A [Leptotrichia buccalis C-1013-b] gi|257050945|gb|ACV40129.1| von Willebrand factor type A [Leptotrichia buccalis C-1013-b] Length = 509 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 52/315 (16%), Positives = 111/315 (35%), Gaps = 32/315 (10%) Query: 58 LVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLD 117 +V+ I++ G + + + + W + +NEL+ ++ + Sbjct: 55 IVYTYEGIVDAAPGIYQFPDTYEKNIVKKSDIWRENVQNELKKIKPALG----EDASDEE 110 Query: 118 IVVVPQNEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMI 176 I + + Y I+ Y P++ + + P T K + + +++ I Sbjct: 111 IERLFKKFLY----IAGYDYEPIETLDRFSYVIFKDDMANPFT---KQKIEENMNVNLEI 163 Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 +LD S SM+ S T +++A +SI ++ E+ N + GL F +K + Sbjct: 164 ILDASGSMKQKI-SDKTMMEIAKESIEKVVSEM----PAN--TKVGLRVFGHKGDNTASK 216 Query: 237 ---EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM--QGMRQHCNTEDANYKKII 291 + L I+ L K + ++ P + I G + I+ Sbjct: 217 KQESCSANELISPIETLDKDKLKSSLAPIQPTGWTSIAKSIENGTNDLKALKGEKTLNIL 276 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKR--GAIVYAIGIRV-IRSHEFLR--ACASPNSFY 346 +TDG D + + K ++ IG V ++ L+ A A+ + Sbjct: 277 YIITDG---IETCDGNPVETAKKFKNENTDIVLGIIGFNVDAHQNKVLKEIANAANGYYS 333 Query: 347 LVENPHSMYDAFSHI 361 + + + I Sbjct: 334 SANDAAKLTEELQRI 348 >gi|307353172|ref|YP_003894223.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] gi|307156405|gb|ADN35785.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] Length = 231 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 59/171 (34%), Gaps = 18/171 (10%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 +++LD S SM KI + I +E+ V +++F K+ Sbjct: 20 ILILDTSGSMMVN-----DKIAKLNEGIALFKDEIGKDELARKRVDLAVLSFGQKVN--- 71 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK-IIVF 293 +++ S + + + L G+ T +K A + + E +Y + I Sbjct: 72 VIQDFTSIEEFEPEELVADGL-TPMGEAIKKAVEML---GSRKDEYKKEGIDYYRPWIFL 127 Query: 294 MTDGENLSTKEDQQ-----SLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 +TDGE E + + K + +A+G+ + Sbjct: 128 ITDGEPTDMYEGDEMWKEVTNLVHEGEKAGKFLFFAVGVEDADLETLAKIA 178 >gi|256788362|ref|ZP_05526793.1| hypothetical protein SlivT_28069 [Streptomyces lividans TK24] Length = 241 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 28/157 (17%), Positives = 61/157 (38%), Gaps = 23/157 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 S+ + R + +V+D S SM+ ++ D +++++ + + D + +V Sbjct: 26 SLTKHGMAGTRAAVYLVVDYSGSMKPYYK------DGSVQALADRVLSLSAHLDDDGIVP 79 Query: 221 SGLVTFSNKIEEFFL--LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 +V FS ++ L ++R + L G T+ + + D Sbjct: 80 --VVFFSTDVDAVTEIALADHQGRIERIVAGLGHMGK-TSYHLAMDAVIDHYLDSG---- 132 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 A + ++VF TDG ++ ++ Y C A Sbjct: 133 ------ARHPALVVFQTDGGPINRLAAER--YLCKAA 161 >gi|255316498|ref|ZP_05358081.1| hypothetical protein CdifQCD-7_19217 [Clostridium difficile QCD-76w55] Length = 256 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 77/201 (38%), Gaps = 23/201 (11%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D+++VLD S S F++ I+ I L + L D + ++ L F+ + Sbjct: 60 FDVVLVLDCSLSTSDLFENGT------IQDIFERLLPISLSFDNDGMLDVWL--FNEEAY 111 Query: 232 EFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + ++ ++++ + + T P +K + ++ ++ Sbjct: 112 QLTSIDMNNLFNYVKNEKLFKKYVRGGTKYAPVIKEIVKEKTELNEAKEPV--------- 162 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 I+F+TDG+N KE + + EA + IGI + + Sbjct: 163 YIIFITDGDNSDKKEAELVIR---EASNKPIFFQFIGIGKQKFKFLEQLDVMEGRVVDNA 219 Query: 350 NPHSMYDAFSHIGKDIVTKRI 370 N ++ D ++I + + +R+ Sbjct: 220 NLFALNDI-NNISDEELYERV 239 >gi|115689614|ref|XP_787130.2| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 variant [Strongylocentrotus purpuratus] gi|115969501|ref|XP_001184100.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 variant [Strongylocentrotus purpuratus] Length = 902 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 30/168 (17%), Positives = 63/168 (37%), Gaps = 11/168 (6%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++DVS SM K+ ++ +L ++ N + S V F ++ + Sbjct: 347 VIFIIDVSGSMAG------VKLRQVKDALTTILNDMPETDKFNIIPFSDDVNFLDRNKML 400 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 F V +R +K L + +TN + + D D N +++ Sbjct: 401 FSTSSNVRRAKRFVKSLQER-DNTNLHKAIIAGVRMLRD---ESDQNVRPDENVVSMLIV 456 Query: 294 MTDGENLSTKEDQQSLY-YCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 ++DG + D++ + EA + ++ +G FL A Sbjct: 457 LSDGNPNHGEIDKEIIERNVEEAIRGDFSLFNLGFGEDLDFPFLERMA 504 >gi|268532304|ref|XP_002631280.1| C. briggsae CBR-CLEC-61.1 protein [Caenorhabditis briggsae] gi|268532308|ref|XP_002631282.1| C. briggsae CBR-CLEC-61.2 protein [Caenorhabditis briggsae] gi|187036878|emb|CAP23543.1| CBR-CLEC-61.2 protein [Caenorhabditis briggsae AF16] gi|187036880|emb|CAP23545.1| CBR-CLEC-61.1 protein [Caenorhabditis briggsae AF16] Length = 403 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 67/195 (34%), Gaps = 13/195 (6%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D S M ++ + + +I S+ +++ + P+ + G+VT++ + + Sbjct: 46 LDIVCVVDNSVGMT---NAGLASVAASISSLFVDGQQLGIQPNNPRTTRIGIVTYNQEAQ 102 Query: 232 EFFLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L S + + VST+ L D+ + K Sbjct: 103 VVADLNNFTS-VDELTNKVYSILNKVSTSQDSYLHSGLEAANDLLEQQSFNTARGHYQKL 161 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF---LRACASPNSFY 346 +IV+ ++ + Q L K+ + + F L A+P + Sbjct: 162 VIVYTSEYKGTGP---QDPLPLAMRMKQT-VSIATVAYGQDDILGFLAELTKIATPGYNF 217 Query: 347 LVENPHSMYDAFSHI 361 EN + Sbjct: 218 TNENGENTISELRST 232 >gi|21220504|ref|NP_626283.1| hypothetical protein SCO2022 [Streptomyces coelicolor A3(2)] gi|289772254|ref|ZP_06531632.1| toxic cation resistance protein [Streptomyces lividans TK24] gi|5689915|emb|CAB52078.1| conserved hypothetical protein [Streptomyces coelicolor A3(2)] gi|289702453|gb|EFD69882.1| toxic cation resistance protein [Streptomyces lividans TK24] Length = 252 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 28/157 (17%), Positives = 61/157 (38%), Gaps = 23/157 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 S+ + R + +V+D S SM+ ++ D +++++ + + D + +V Sbjct: 37 SLTKHGMAGTRAAVYLVVDYSGSMKPYYK------DGSVQALADRVLSLSAHLDDDGIVP 90 Query: 221 SGLVTFSNKIEEFFL--LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 +V FS ++ L ++R + L G T+ + + D Sbjct: 91 --VVFFSTDVDAVTEIALADHQGRIERIVAGLGHMGK-TSYHLAMDAVIDHYLDSG---- 143 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 A + ++VF TDG ++ ++ Y C A Sbjct: 144 ------ARHPALVVFQTDGGPINRLAAER--YLCKAA 172 >gi|224368584|ref|YP_002602747.1| hypothetical protein HRM2_14740 [Desulfobacterium autotrophicum HRM2] gi|223691300|gb|ACN14583.1| conserved hypothetical protein [Desulfobacterium autotrophicum HRM2] Length = 222 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 78/211 (36%), Gaps = 20/211 (9%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K ++T + ++I+ D S SM S KID +++ +++ +N + Sbjct: 3 KFAARTARPIPVIILADTSGSM-----SVDGKIDAMNQALRDLIDTFSGESRLNAEIHLS 57 Query: 223 LVTF-SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 ++TF + +E L +H L G+ T + A I D + Sbjct: 58 VITFGGDGAKEHLPLT--CAHTISGFSDLQAHGM-TPMGGAFRIAKELIEDKE------K 108 Query: 282 TEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--HEFLRA 338 Y+ +IV ++DG N + SL A+K + AIG + +F+ Sbjct: 109 IPSRAYRPVIVLVSDGYPNDDWEAAFSSLRGSERAQKATRMAMAIGSDADENMLKDFIND 168 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 +P + + F + + ++ Sbjct: 169 PETP--VFRANGARDIIRFFRAVSMSVTSRS 197 >gi|157692632|ref|YP_001487094.1| regulator of nitric oxide reductase [Bacillus pumilus SAFR-032] gi|157681390|gb|ABV62534.1| possible regulator of nitric oxide reductase [Bacillus pumilus SAFR-032] Length = 638 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 35/210 (16%), Positives = 71/210 (33%), Gaps = 21/210 (10%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR---LD--MMIVLDVSRS 183 I +K + + I S+ ++ + A +D +++D S S Sbjct: 395 IQKTLEHKKTWPKTDLHAGRLSKKLIRYFTESNPRLFYKKQAPSTEIDAVFTLLVDCSAS 454 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 M K+D K I E +K + + +V G +N E + + + Sbjct: 455 M-------FDKMDETKKGIVLFHEALKSVQVPHQIV--GFWEDTNDASETSQPNYF-NTV 504 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 L G S S + + ++ M + +K ++ +DGE + Sbjct: 505 VSFKDSLFDAGPSIMSLEP-EEDNRDGYAIRQMTKMILKRREE-QKFLIVFSDGEPAAFS 562 Query: 304 EDQ----QSLYYCNEAKKRGAIVYAIGIRV 329 +Q + EA+K+G V + + Sbjct: 563 YEQNGIVDTHEAVLEARKKGIEVINVFLSN 592 >gi|159899615|ref|YP_001545862.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159892654|gb|ABX05734.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 265 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 34/254 (13%), Positives = 77/254 (30%), Gaps = 32/254 (12%) Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 + + A R P + P R +V+ + + R + V+D S Sbjct: 29 GELTDVAFDATLRVAAPFQRQRRQPHI--KRAVVLHRSDLREKVRVRRTRNAVCFVVDAS 86 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS-NKIEEFFLLEWGV 240 SM + T++ ++ ++L GLV+F+ ++ L V Sbjct: 87 WSMAAE-----TRMQATKAAVLSLLR-----DAYQRRDLVGLVSFARDRATVLLPLTNSV 136 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG--- 297 Q++++ + G + S + + D ++V +TDG Sbjct: 137 EMAQQRLRTMPTGGKTPLSRGMM--------AGFELLARAKLRDREILPLMVLLTDGHAN 188 Query: 298 -ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE-FLRACAS--PNSFYLVENPH- 352 Q++ + I + + + A S+Y +E Sbjct: 189 VSMTGQPPQQEAYQLAEFIASQQIPTVVIDTELPNFYRGLAKELADRLNGSYYQLEELSV 248 Query: 353 ---SMYDAFSHIGK 363 + + + + Sbjct: 249 GLAELVRSHQELSR 262 >gi|290980233|ref|XP_002672837.1| vWFA domain-containing protein [Naegleria gruberi] gi|284086416|gb|EFC40093.1| vWFA domain-containing protein [Naegleria gruberi] Length = 340 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 23/146 (15%), Positives = 58/146 (39%), Gaps = 22/146 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +D +D++IV+D + SM +I++A ++ ++ + ++ V++ Sbjct: 23 SDKIVDLVIVMDCTGSMSG-------EINVAKNTVATIITTLH--EHFKTDLRFTAVSYR 73 Query: 228 NKIEEF----FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + +++ F +++ + I +S G + L A + +M ++ Sbjct: 74 DHTDDYAVKEFPFTKDINNAKEYINTMSAQGGG-DYPEALASALKVVNEMPFNKKG---- 128 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSL 309 KKI+V++ D S Sbjct: 129 ----KKIVVWVADAPPHGMNASGDSY 150 >gi|290976237|ref|XP_002670847.1| vWFA domain-containing protein [Naegleria gruberi] gi|284084410|gb|EFC38103.1| vWFA domain-containing protein [Naegleria gruberi] Length = 389 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 24/146 (16%), Positives = 60/146 (41%), Gaps = 22/146 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 ++ +D++IV+D + SM +I++A +++ ++ + + ++ V++ Sbjct: 75 SENIVDLVIVMDCTGSMSG-------EIEVAKRTVTTIISTLHEKFQSD--LRFSAVSYR 125 Query: 228 NKIEEF----FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + +++ F ++ + I +S G + L A I +M ++ Sbjct: 126 DHTDDYAVKEFPFTKDLNKAKGYINTMSAQGGG-DHPEALASALYVINEMPFNKKG---- 180 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSL 309 KKI+V++ D K S Sbjct: 181 ----KKIVVWVADAPPHGMKTSSDSY 202 >gi|261194779|ref|XP_002623794.1| U-box domain-containing protein [Ajellomyces dermatitidis SLH14081] gi|239588332|gb|EEQ70975.1| U-box domain-containing protein [Ajellomyces dermatitidis SLH14081] Length = 756 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 29/202 (14%), Positives = 71/202 (35%), Gaps = 18/202 (8%) Query: 171 RLDMMIVLDVSRSMESFFDSSIT----KIDMAIKSINAMLEEVK--LIPDVNNVVQSGLV 224 D+++ +D+S SM S T K + S+ + + +I +N+ + G+V Sbjct: 73 PCDIVLCIDISYSMSSSAPLPTTDDSGKPEDTGLSVLDLTKHAARTIIETLNDNDRLGVV 132 Query: 225 TFSNKIEEFFLLEW-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 FS E + + + + +K + +++ ++ + + Sbjct: 133 AFSTDAEVVYKISNMNEDNKKAALKAVEALWPLSSTNL-----WHGLKLSLEALEEVTPI 187 Query: 284 DANYKKIIVFMTDGENLS---TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 N + + +TDG + L + K R +++ G L+A + Sbjct: 188 PQNVQ-ALYILTDGMPNHMCPRQGYVPKLRSILQQKDRLPMIHTFGFGYYIRSGLLQAIS 246 Query: 341 S--PNSFYLVENPHSMYDAFSH 360 ++ + + + F H Sbjct: 247 EVGGGTYSFIPDAGMIGTVFVH 268 >gi|41615188|ref|NP_963686.1| hypothetical protein NEQ403 [Nanoarchaeum equitans Kin4-M] gi|40068912|gb|AAR39247.1| NEQ403 [Nanoarchaeum equitans Kin4-M] Length = 216 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 31/159 (19%), Positives = 52/159 (32%), Gaps = 33/159 (20%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 +++LD S SM K + K++ L + + GL+ F+NKI + Sbjct: 74 IVLLDCSGSM---------KGEKFEKALAIALSIIYK------YKRVGLILFNNKIIKSI 118 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + L + + + TN + L+ A A K I + Sbjct: 119 PPTENKTLLVNSLFVIPR--EKTNISIALEEAMKY---------------AKPKSEIFII 161 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 TD D+ A K V+ IGI + Sbjct: 162 TDAVPTDETVDELIETVRKLALKN-IKVHVIGINLKEGK 199 >gi|46138379|ref|XP_390880.1| hypothetical protein FG10704.1 [Gibberella zeae PH-1] Length = 266 Score = 42.1 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 26/125 (20%), Positives = 49/125 (39%), Gaps = 27/125 (21%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ +LD S SM+ + S KID+ +++ +L + VN ++++ ++I Sbjct: 155 ILFLLDQSGSMDGCWQWSW-KIDVLRAAMHLVLAGLPKTCSVN------VISWGSEI--- 204 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 W + + I + T+ LK + D Q IV Sbjct: 205 ----WAI-EAKEYISEIMADMGGTDLPRALKSTVQRRLDNSKSTQ------------IVI 247 Query: 294 MTDGE 298 +TDGE Sbjct: 248 LTDGE 252 >gi|31873561|emb|CAD97767.1| hypothetical protein [Homo sapiens] Length = 363 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 17/120 (14%), Positives = 45/120 (37%), Gaps = 11/120 (9%) Query: 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL------STKED 305 + G TN+ L Y + + + + I+ +TDG++ + + Sbjct: 3 ENGTGTNTYAALNSVYLMMNNQMRLLGMETMAWQEIRHAIILLTDGKSNMGGSPKTAVDH 62 Query: 306 QQSLYYCNEAKKRGAIVYAIGIRVIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 + + N+ + +YAIG+ + +E +++++ +++ F H Sbjct: 63 IREILNINQKRNDYLDIYAIGVGKLDVDWRELNELGSKKDGERHAFILQDTKALHQVFEH 122 >gi|189501233|ref|YP_001960703.1| von Willebrand factor type A [Chlorobium phaeobacteroides BS1] gi|189496674|gb|ACE05222.1| von Willebrand factor type A [Chlorobium phaeobacteroides BS1] Length = 339 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 29/161 (18%), Positives = 58/161 (36%), Gaps = 21/161 (13%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 LDM+ +LDVS SM + D S +++ A + I + + + GLV F+ Sbjct: 91 ALDMVYLLDVSNSMLA-RDISPDRLERAREEIVRISRGI-------ERGRRGLVAFAGSG 142 Query: 231 EEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 L + + S T+ + + A ++F + Sbjct: 143 VVQCPLTTDQQAFETMLGIASPDLIEAQGTDISAAMDVA-QKMFSGSKTEEKVKAGG--- 198 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + V ++DGE K + + K++ + +G+ Sbjct: 199 --VAVLVSDGE-AHEKGFSAA---ARKLKEKDVRLIVVGVG 233 >gi|124265333|ref|YP_001019337.1| hypothetical protein Mpe_A0140 [Methylibium petroleiphilum PM1] gi|124258108|gb|ABM93102.1| conserved hypothetical protein [Methylibium petroleiphilum PM1] Length = 711 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 59/159 (37%), Gaps = 25/159 (15%) Query: 142 CTFIPWYTNSRHIVMP-ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 T P + + + + + D + V+D+S SM + +D A Sbjct: 311 LTLFPGAPGEENFFLAMVEPPRAIATTQINPRDYVFVVDISGSMHGYP------LDTAKT 364 Query: 201 SINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTN 258 + ++ ++ N ++ SG + + E + +++ + ++ +++ G ST Sbjct: 365 LLRHLIGGLRPSDTFNVLLFSGS---NRMLNETSVPAT-QANVAQALRTIAQMGGSGSTE 420 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 P LK R + + + ++ +TDG Sbjct: 421 IVPALK------------RVAALPKSPDVSRSVIVVTDG 447 >gi|193786519|dbj|BAG51302.1| unnamed protein product [Homo sapiens] Length = 164 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 16/87 (18%), Positives = 30/87 (34%), Gaps = 8/87 (9%) Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 T + L++ F Q N ++ + TDG + + A Sbjct: 4 GTMTGLALRHMVEHSFSEA---QGARPRALNVPRVGLVFTDGRSQDDIS-----VWAARA 55 Query: 316 KKRGAIVYAIGIRVIRSHEFLRACASP 342 K+ G ++YA+G+ E + P Sbjct: 56 KEEGIVMYAVGVGKAVEAELREIASEP 82 >gi|194335383|ref|YP_002017177.1| protein of unknown function DUF58 [Pelodictyon phaeoclathratiforme BU-1] gi|194307860|gb|ACF42560.1| protein of unknown function DUF58 [Pelodictyon phaeoclathratiforme BU-1] Length = 302 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 65/174 (37%), Gaps = 24/174 (13%) Query: 123 QNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 + +G S + Y+ I W T++R + + K+ ++ R +++V+D S Sbjct: 45 KGKGIEFSNVREYQYGDD-VRSIDWNTSARKHELYV----KLFTEERER-SLLLVVDASA 98 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH 242 SM + + K + + V + N + GL+ F++++E + G H Sbjct: 99 SM------LFGSREHSKKEVALEVSAVLAFSALQNNDKVGLLVFTDRVETYIPPRKGRHH 152 Query: 243 LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 + ++ L + N ++ ++II +TD Sbjct: 153 VLVILEELIRMKPG-----------NSATNINAALSFVRYTRQR-QEIIFLLTD 194 >gi|327401134|ref|YP_004341973.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] gi|327316642|gb|AEA47258.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] Length = 440 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 46/306 (15%), Positives = 102/306 (33%), Gaps = 46/306 (15%) Query: 56 RSLVHAATQIMNEGNGNNR--KKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRS 113 +L+ A + N ++L GG R T+ N + FV D+ IV Sbjct: 153 SALLEGAVEEAENVTKNAEEIRELMGGKSAGREAGTFQK-ILNLAKQMMFVEDLKKIVDM 211 Query: 114 TSLDI------VVVPQNEGYSISAISRYKIPLKFCTFIPWYTN-SRHIVMPITSSVKVNS 166 + + + + +G + +S Y I + +P + M +S + Sbjct: 212 SKRMMDFVPKATRISKVKGKTGDELSGYMITKQVERALPRELALPDELFMRRLASEGFLA 271 Query: 167 QTDARLD---MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + ++ I++D S SM + + + +S+ + + ++ Sbjct: 272 REKLKVSEGAYYILIDKSGSM-------VGEKTVWARSVAMAIYRMSKTKRRKYFLR--- 321 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 F K+ + + +K + T+ T + A + + + + Sbjct: 322 -FFDTKVHPDKPISEPKEIVDAILK--VQSNGGTDITNAISTAIDDLVEGRFA------- 371 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 Y I+ +TDGE E KK A + I + + +++ L+ + Sbjct: 372 --EYTNTIIVITDGE-------DVVDDLSKELKKTKASL--ISVMIQGNNDTLKNI--SD 418 Query: 344 SFYLVE 349 + E Sbjct: 419 HYMHAE 424 >gi|292669554|ref|ZP_06602980.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541] gi|292648763|gb|EFF66735.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541] Length = 244 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 27/148 (18%), Positives = 55/148 (37%), Gaps = 20/148 (13%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 T + R+++++V+D S SM+ + +I ++ + Sbjct: 26 GTDDAPKAMEQPKRIELVLVIDKSGSMQ----------GLEKDTIGGFNSMIEKQKKLGI 75 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 V+ V F++K + + + +R I+ + T G + + Sbjct: 76 NVRVTAVLFNDKTD--------LLYARRYIQNVRPLTEREYETGGTTALLDAVGGTILKM 127 Query: 278 QHCNTEDANYKKII-VFMTDG-ENLSTK 303 + + K+I V +TDG EN STK Sbjct: 128 ERSKATECGKTKVIFVIITDGLENASTK 155 >gi|288942397|ref|YP_003444637.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] gi|288897769|gb|ADC63605.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] Length = 357 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 60/164 (36%), Gaps = 27/164 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++++LDVSRSM + D S ++ A + + ++ + N V+ GL+ F+ Sbjct: 99 LLVLLDVSRSMLAE-DVSPNRLGRARQELQDLIVQ-------NRQVRLGLIVFATVPHVL 150 Query: 234 FLLEWGVSHLQRKIKYL-----SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + L + L S ++ T L A + + Sbjct: 151 APITEDTTSLLNALPALSADLASPGLQGSSLTRALMRAETLLAGLPEDSA---------- 200 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + ++ ++DG+ +Q + G ++ +G+ Sbjct: 201 RAVLLISDGDFDEPGLREQVARLAEQ----GVRLHVLGVGTPGG 240 >gi|168019616|ref|XP_001762340.1| predicted protein [Physcomitrella patens subsp. patens] gi|162686418|gb|EDQ72807.1| predicted protein [Physcomitrella patens subsp. patens] Length = 621 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 52/304 (17%), Positives = 96/304 (31%), Gaps = 57/304 (18%) Query: 85 RIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKI------- 137 ++K + N + F N +++I+R I N+ + I+ YK Sbjct: 210 KVKCSINTGTNGGVELGAFGNPMEEILREHGKVIFKKGGNDWKNQDFIASYKTWSDGIFP 269 Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 L F P +++R S N + ++ +LD S SM I+ Sbjct: 270 NLIFQDPEPGESDNRGSFCLSISPPDPNKIKVFQRAVVFLLDRSGSM------KGKPIEA 323 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL---EWGVSHLQRKIKYLS--K 252 A +++ LE +K N ++ F + + F S + ++ Sbjct: 324 ARQALYFGLESLKPEDSFN------IIAFDHDLTLFSPQMERSTTTSIARACEWSMTNCT 377 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 T+ L+ A+ + + G + +TDG C Sbjct: 378 ARGGTDILSPLQQAFQLLENFPGAIP-----------YVFLITDG------AVSAEQNIC 420 Query: 313 NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 + R + A+G R R F C +FY + IG+ + Sbjct: 421 LTMQSR---IVALGARAPRISTFGIGHYC----NFYF-------LKMLAVIGRGMNEVAF 466 Query: 371 WYDK 374 DK Sbjct: 467 KSDK 470 >gi|327351891|gb|EGE80748.1| U-box domain-containing protein [Ajellomyces dermatitidis ATCC 18188] Length = 756 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 30/202 (14%), Positives = 71/202 (35%), Gaps = 18/202 (8%) Query: 171 RLDMMIVLDVSRSMESFFDSSIT----KIDMAIKSINAMLEEVK--LIPDVNNVVQSGLV 224 D+++ +D+S SM S T K + SI + + +I +N+ + G+V Sbjct: 73 PCDIVLCIDISYSMSSSAPLPTTDDSGKPEDTGLSILDLTKHAARTIIETLNDNDRLGVV 132 Query: 225 TFSNKIEEFFLLEW-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 FS E + + + + +K + +++ ++ + + Sbjct: 133 AFSTDAEVVYKISNMNEDNKKAALKAVEALWPLSSTNL-----WHGLKLSLEALEEVTPI 187 Query: 284 DANYKKIIVFMTDGENLS---TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 N + + +TDG + L + K R +++ G L+A + Sbjct: 188 PQNVQ-ALYILTDGMPNHMCPRQGYVPKLRSILQQKDRLPMIHTFGFGYYIRSGLLQAIS 246 Query: 341 S--PNSFYLVENPHSMYDAFSH 360 ++ + + + F H Sbjct: 247 EVGGGTYSFIPDAGMIGTVFVH 268 >gi|290989696|ref|XP_002677473.1| von Willebrand factor type A domain-containing protein [Naegleria gruberi] gi|284091081|gb|EFC44729.1| von Willebrand factor type A domain-containing protein [Naegleria gruberi] Length = 2296 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 21/134 (15%), Positives = 51/134 (38%), Gaps = 16/134 (11%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 A++D + + D S S+ ++ K+ K I ++ + + ++ +V F + Sbjct: 2012 AAKIDFVFLYDSSGSIG---GTNFNKMRDLGKRIIGRMQLAE------DAIKVSIVRFHS 2062 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L + + ++ +T + G++ A + + + N A+ Sbjct: 2063 SGVRALSLTSNGATVNNTFTSMTYDAGNTATIAGMRMAVDAL-------RPANGGRADAD 2115 Query: 289 KIIVFMTDGENLST 302 K+I +TDG Sbjct: 2116 KVIYVLTDGLGNVP 2129 >gi|291439602|ref|ZP_06578992.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC 14672] gi|291342497|gb|EFE69453.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC 14672] Length = 455 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 27/175 (15%), Positives = 55/175 (31%), Gaps = 32/175 (18%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 T I + + + +++++D S SM D TK+ A + Sbjct: 44 ATGGGTIGSAVAAPHLYSPGRGPSAAVVLMVDCSGSM----DYPPTKMRNARDA------ 93 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFL--------LEWGVSHLQRKIKYLSKFGVSTNS 259 I + + V +V ++ +E + ++ ++ LS G T Sbjct: 94 TAAAIDALRDGVHFAVVGGTHVAKEVYPGAGRLAVADATTRDQAKQALRRLSA-GGGTAI 152 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ--QSLYYC 312 L+ A + +H + +TDG N + +L C Sbjct: 153 GTWLRLADRLLASADVAIRHG-----------ILLTDGRNEHESPEDLRAALDAC 196 >gi|239931228|ref|ZP_04688181.1| hypothetical protein SghaA1_23611 [Streptomyces ghanaensis ATCC 14672] Length = 448 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 27/175 (15%), Positives = 55/175 (31%), Gaps = 32/175 (18%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 T I + + + +++++D S SM D TK+ A + Sbjct: 37 ATGGGTIGSAVAAPHLYSPGRGPSAAVVLMVDCSGSM----DYPPTKMRNARDA------ 86 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFL--------LEWGVSHLQRKIKYLSKFGVSTNS 259 I + + V +V ++ +E + ++ ++ LS G T Sbjct: 87 TAAAIDALRDGVHFAVVGGTHVAKEVYPGAGRLAVADATTRDQAKQALRRLSA-GGGTAI 145 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ--QSLYYC 312 L+ A + +H + +TDG N + +L C Sbjct: 146 GTWLRLADRLLASADVAIRHG-----------ILLTDGRNEHESPEDLRAALDAC 189 >gi|198283923|ref|YP_002220244.1| von Willebrand factor type A [Acidithiobacillus ferrooxidans ATCC 53993] gi|218667403|ref|YP_002426556.1| von Willebrand factor type A domain protein [Acidithiobacillus ferrooxidans ATCC 23270] gi|198248444|gb|ACH84037.1| von Willebrand factor type A [Acidithiobacillus ferrooxidans ATCC 53993] gi|218519616|gb|ACK80202.1| von Willebrand factor type A domain protein [Acidithiobacillus ferrooxidans ATCC 23270] Length = 759 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 28/224 (12%), Positives = 72/224 (32%), Gaps = 42/224 (18%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSS----ITKIDMAIKSINAMLEEVK 210 P + + + ++LD+S S+ + T + ++ ++++ + ++ Sbjct: 549 STPDPRINMSHRTAGRDIAVTLLLDLSESLNESVKTGGGDGQTVLQLSQEAVSLLAWSIE 608 Query: 211 LIPDVNNVVQSGLVTFSNKIE--------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 + D + F++ + F WG ++ ++ L G ST Sbjct: 609 QLGDP-----LAIAGFNSNTRHEVRYQHIKGFSEPWG-DVVKGRLAALQA-GYSTRMGAA 661 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 +++A + + + KK+++ +TDG L +A++ Sbjct: 662 MRHAGHYLATRKAD-----------KKLMLVLTDGRPSDVDVQDDRLLI-EDARQ----- 704 Query: 323 YAIGIRVIRSHEFLRACASPNSFYLVENPHSM---YDAFSHIGK 363 + + C S + Y HI + Sbjct: 705 ---AVNELDRDGIFTYCISLDPHADAYVADIFGRQYTVIDHIAR 745 >gi|152990153|ref|YP_001355875.1| von Willebrand factor type A domain-containing protein [Nitratiruptor sp. SB155-2] gi|151422014|dbj|BAF69518.1| von Willebrand factor type A domain protein [Nitratiruptor sp. SB155-2] Length = 549 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 33/160 (20%), Positives = 59/160 (36%), Gaps = 29/160 (18%) Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 I LD+S SM++ D ++ A K I ++ + + L+ F+++ Sbjct: 89 IALDISNSMKA-TDYYPDRLQFAKKKIEEFIK-------ASKNLNIALLAFADEAYIVSP 140 Query: 236 LEWGVS---HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 ++ + + S TN L A + + +G +K +V Sbjct: 141 PSSDKEALLYMLKHLDTESLALQGTNFLAALMSA-DMLLGKEG------------QKSVV 187 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 TDG N + + AKKRG V+ IGI + Sbjct: 188 LFTDGGNKEDFSKEIAF-----AKKRGIQVHIIGIGTQKG 222 >gi|66472570|ref|NP_001018424.1| inter-alpha (globulin) inhibitor H3 [Danio rerio] gi|63100652|gb|AAH95235.1| Zgc:110377 [Danio rerio] Length = 868 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 56/169 (33%), Gaps = 23/169 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN----- 228 ++ V+D S SM K+ +++ +L E+ ++ + FS Sbjct: 257 VVFVIDNSYSMYGN------KMAQTKEALGTILGEL----PEDDYFAI--IVFSTTFVVW 304 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + E V Q +K + G T + ++ Q T N Sbjct: 305 RPYLSKATEENVKEAQEYVKTIEVIG-GTELHDATIHGVEMLYAA----QRNGTAPKNMV 359 Query: 289 KIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 +++ +TDG+ N + + +A ++ + + FL Sbjct: 360 LMMILLTDGQPNQYPRSLPEIQESIRKAIDGNITLFGLAFGNDADYGFL 408 >gi|28898407|ref|NP_798012.1| putative RTX toxin [Vibrio parahaemolyticus RIMD 2210633] gi|28806624|dbj|BAC59896.1| putative RTX toxin [Vibrio parahaemolyticus RIMD 2210633] Length = 3240 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 42/258 (16%), Positives = 79/258 (30%), Gaps = 29/258 (11%) Query: 60 HAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIV 119 T +N L N + + N D + + V Sbjct: 2639 SGLTLEVNGKTVTENSDGSYTVDLSSEPNQTSHQLSGTITVPADYNGSLDFDVTATAGSV 2698 Query: 120 VVPQNEGYSISAISRYKIPLKFC------TFIPWYTNSRHIVMPITSSVKVNSQTDARLD 173 V N S +F I ++ IV + V Q + Sbjct: 2699 EVNNNTQMGADTASVSVRDYEFVSGTHGDNNIVGSDDNDVIVGDVQGLQIVEGQD---YN 2755 Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +LD S SM ++T++ +N ++E + V L TFS + ++ Sbjct: 2756 IAFMLDTSGSMGYDVGRAVTELKTV---LNTLIESASG--PHSGKVNVLLTTFSTESKQV 2810 Query: 234 FLL----EWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L + S ++ + + K G +TN G + A N ++ ++ N Sbjct: 2811 LELDLSSDNAKSQVESILDAIVKLGDGNTNYEAGFQSALNWF-------ENADSGATNLS 2863 Query: 289 KIIVFMTDGENLSTKEDQ 306 I +DG ++ Sbjct: 2864 YFI---SDGRPNQATDNN 2878 >gi|314927767|gb|EFS91598.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL044PA1] Length = 322 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 33/249 (13%), Positives = 77/249 (30%), Gaps = 36/249 (14%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 +S ++K + T + + + ++ + A + ++ LD S SM++ Sbjct: 54 MSKQPQWKRHIAVATALL-CLATTTVAWARPMGIEQVPRDRATI--VVALDSSLSMKAD- 109 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIK 248 D+S ++ A + + +V +V+ S E + R + Sbjct: 110 DASPNRLAAAKAKAKDFVNSLPTGFNVA------VVSISAHPEIRMPPSTDRPTVLRALD 163 Query: 249 YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS 308 G+ L A ++ M + + IV ++DG N + Sbjct: 164 -----GIELQDGTALGEAIDKSLQAVKMAPGGSKDRVPA--AIVMLSDGGNTQGGSPLVA 216 Query: 309 LYYCNEAKKRGAIVYAIGIRVIRSH--------------EFLRACA--SPNSFYLVENPH 352 + AK VY I + + L A + + ++ Sbjct: 217 ATHAAAAK---VPVYTIAFGTETGYVDLDGQRERVAPDTKLLSDVADRTDAKSWTADSAD 273 Query: 353 SMYDAFSHI 361 + + + + Sbjct: 274 KLQEVYKQV 282 >gi|197118197|ref|YP_002138624.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] gi|197087557|gb|ACH38828.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] Length = 318 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 27/154 (17%), Positives = 59/154 (38%), Gaps = 20/154 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD S+SM + D +++ A +++ + +K + GLV F+ Sbjct: 82 DILFLLDTSKSMLT-RDLGQSRLAAAKEAVRQAMAGLKGE-------RVGLVVFAGSAFL 133 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L + + L + G T PG + + + + K++V Sbjct: 134 VCPL---TTDYALFDQVLKEAGEETLPLPG-----TSLAAALKEARRALQGEGDEPKVVV 185 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 ++DGE+ + + + G +YA+ Sbjct: 186 LLSDGEDH----EGEYVAAARALNAAGVKLYAVA 215 >gi|8778285|gb|AAF79294.1|AC068602_17 F14D16.26 [Arabidopsis thaliana] Length = 736 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 39/200 (19%), Positives = 74/200 (37%), Gaps = 41/200 (20%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT-FSNKIEE 232 ++ V+D+S+SM ++ +I+ L ++ N + S FS +E Sbjct: 311 VVFVVDISKSMTG------KPLEDVKNAISTALSKLDPGDSFNIITFSNDTALFSTSMES 364 Query: 233 FFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 ++R I++++K TN P L+ A + + +G Sbjct: 365 VTSDA-----VERGIEWMNKNFVVADGTNMLPPLEKAVEMLSNTRGSIP----------- 408 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGA-------IVYAIGIRVIRSHEFLRACA-- 340 +I F+TDG + + C+ KK A ++ G+ V +H FL+ A Sbjct: 409 MIFFVTDG------SVEDERHICDVMKKHLASAGSVFPRIHTFGLGVFCNHYFLQMLANI 462 Query: 341 SPNSFYLVENPHSMYDAFSH 360 S V N + + Sbjct: 463 SCGQHESVYNTDHIEERMDK 482 >gi|295093780|emb|CBK82871.1| von Willebrand factor type A domain. [Coprococcus sp. ART55/1] Length = 549 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 31/170 (18%), Positives = 55/170 (32%), Gaps = 27/170 (15%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K + + V D S SM ++ S+ + + + NN V G Sbjct: 364 KKTKDNGKDIIAVFVADCSGSM------DGDPMNQLKNSLTNGAQYI----NDNNYV--G 411 Query: 223 LVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 LV++S+ + + S+ Q + L G T S + A I D Sbjct: 412 LVSYSSSVTVEVPIAQFDLNQRSYFQGSVNNLMASG-GTASYDAVVVAMKMITD------ 464 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 E + K ++ ++DG + ++ VY IG Sbjct: 465 -AKAEHPDAKCMLFLLSDG---YANVGYSMDEITSALRQSNIPVYTIGYG 510 >gi|111018687|ref|YP_701659.1| hypothetical protein RHA1_ro01688 [Rhodococcus jostii RHA1] gi|110818217|gb|ABG93501.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 904 Score = 42.1 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 40/247 (16%), Positives = 82/247 (33%), Gaps = 63/247 (25%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV-VQSGLVTF 226 + + D+++++D S S++ + + N +LE++ V + + F Sbjct: 40 SQQQGDLLLMVDQSGSLQGSDPD-----AARVSAANYLLEQLNTFGGSAGVELNVAIAGF 94 Query: 227 SNKIEEFFL---LEWGV-SHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 S K L+ G LQ +++ G+ T+ L A + + G + Sbjct: 95 SEKFIVHAPWTRLDNGSLPALQGEVERFRTRTDGIDTDYWNALDGARRTLAERDGQSE-- 152 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQ----------------------------QSLYYC 312 AN + + + +DG+ T D + Sbjct: 153 ----ANRCQAVAWFSDGKLDFTVRDAEKPYAQGVSLGSDQGVQQVVAAARESICRPAGIA 208 Query: 313 NEAKKRGAIVYAIGIRV----IRSHEFLR--------ACA-----SPNSFYLVENPHSMY 355 ++ + G + +A+G+ + +R AC SP FYL +N + Sbjct: 209 DQLRSSGIVTFAVGLAAGTAQPSDFDLMRSIATGGDGACGKTTSPSPGDFYLAQNIDDLL 268 Query: 356 DAFSHIG 362 AF Sbjct: 269 FAFDAFS 275 >gi|297281590|ref|XP_001082306.2| PREDICTED: calcium-activated chloride channel regulator 4-like, partial [Macaca mulatta] Length = 166 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 34/168 (20%), Positives = 66/168 (39%), Gaps = 32/168 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK---I 230 + +VLD S SM S+ +++ K+ L ++ V N G+V F++ + Sbjct: 20 VCLVLDKSESMASY-----DRLNQMNKAAKYFLLQI-----VENGSWVGMVHFNSTATIV 69 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + + + L + T+ G+K A+ I + H + + Sbjct: 70 NKPIQII-SSDERNTLLAKLPTYAWGGTSICSGIKSAFQVI---GELSSHLDGSE----- 120 Query: 290 IIVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEFL 336 +V +TDGE+ + C +E K+ GAIV+ I + + Sbjct: 121 -VVLLTDGEDYTASS-------CIDEVKRSGAIVHFIALGTAADKAVI 160 >gi|163816348|ref|ZP_02207714.1| hypothetical protein COPEUT_02536 [Coprococcus eutactus ATCC 27759] gi|158448345|gb|EDP25340.1| hypothetical protein COPEUT_02536 [Coprococcus eutactus ATCC 27759] Length = 612 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 33/298 (11%), Positives = 85/298 (28%), Gaps = 41/298 (13%) Query: 51 HSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWN---------MSFRNELRDN 101 D V T ++ + + + ++ +F + Sbjct: 91 EEACDVETVAGDTAMVTDTSNSMYSEVAYDTREYDSMTENGFVSTVDRPLSTFAADRDTA 150 Query: 102 GFVN-----DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSR--HI 154 + N + + ++ I + Y T ++ + Sbjct: 151 SYSNVRSYIESGSLPPDGAVRIEEMLNYFTYDYRKKPEDGEKFSIYTEYSDCPWNKDTKL 210 Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 +M ++ +++ +++ ++D S SM K+ + +S + E + Sbjct: 211 MMVGINTDEIDFGDKKPSNLVFLIDTSGSMY-----DDNKLPLVQQSFAMLAENLDEND- 264 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + +VT++ + G + + ++ G TN + AY Sbjct: 265 -----RVSIVTYAGEDTVVLSGTPGSEQYTISEALSNMTAEGC-TNGGDAIITAYEL--- 315 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLS--TKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + N + ++ TDG+ T E E K+ + +G Sbjct: 316 --AEKNFINGGNNR----VILATDGDLNVGLTSESDLVDLITEEKKENNIFLSVLGFG 367 >gi|78357960|ref|YP_389409.1| von Willebrand factor, type A [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78220365|gb|ABB39714.1| von Willebrand factor, type A [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 547 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 64/169 (37%), Gaps = 34/169 (20%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + I++D S SM +I +A ++ A+ + + + + G+ F ++ Sbjct: 378 VHILMDCSGSMR-------KRIHLASQATYAVAKALD-----HTGINIGVTAFPATYQK- 424 Query: 234 FLLEWGV------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + W + ++ G ST P L + Q+ +Q R Sbjct: 425 -QIGWASVRPLIRHGERMHTEFSLASGGSTPLGPALWWVMQQMLPLQESR---------- 473 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 KI++ +TDG+ + ++++ + + +Y IGI E + Sbjct: 474 -KIVLIITDGDPDCSSAAKEAIAAAQKTE---IELYGIGIMSSSITELI 518 >gi|168208905|ref|ZP_02634530.1| von Willebrand factor type A domain protein [Clostridium perfringens B str. ATCC 3626] gi|170712845|gb|EDT25027.1| von Willebrand factor type A domain protein [Clostridium perfringens B str. ATCC 3626] Length = 620 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 30/175 (17%), Positives = 67/175 (38%), Gaps = 28/175 (16%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA 198 LK + P + I + + K ++ +D+++++D S SM D ++ Sbjct: 57 LKEVSQEPDKDGNYEITLTVKGKPKKVTK---PVDILLIMDASNSMYYNMDELKASMNSL 113 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW-----GVSHLQRKIK---YL 250 + + + IP+ + +V F ++EE F + IK Y Sbjct: 114 VDKV------IDNIPNS----RIAVVAFGTEVEEVFSFSNKNNFTSKEEYKNAIKDSYYY 163 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 +TN + A + ++ ++N KK ++F +DG ++++ Sbjct: 164 ITGRGNTNIEGTWRVA-------DEIFKNELNNNSNSKKDVIFFSDGYPNVSRDE 211 >gi|185132132|ref|NP_001118067.1| complement factor B [Oncorhynchus mykiss] gi|3982897|gb|AAC83699.1| complement factor Bf-1 [Oncorhynchus mykiss gairdneri] Length = 743 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 31/231 (13%), Positives = 76/231 (32%), Gaps = 34/231 (14%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K+ +L++ I +D+S S+ + A ++ ++ +V N Sbjct: 222 KITIDKGGKLNIYIAMDISDSIAE------EDFNSARNAVKKLITKVSSFSVSPNYE--- 272 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKI----------KYLSKFGVSTNSTPGLKYAYNQIFD 272 ++ F++ + E + ++ + KY ++ V TN K ++ Sbjct: 273 IIFFASDVLEVVNIIDFSGDKRKPLVDVLAELNNFKYDARDNVGTNLNLAFKTILERMAI 332 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDG--------ENLSTKEDQQSLYYCNEAKKRGAIVYA 324 + + +++F TDG EN K + +++ VY Sbjct: 333 QKKRNEMLF---MEIHHVLIFFTDGAYNMGGSPENTMAKIRESVYMNNKTKREKYLDVYV 389 Query: 325 IGIRVIRSHEFLRACAS----PNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 G+ E + + ++ ++N + F I + + Sbjct: 390 FGVGSDIFDEDIMPLVTKRNGERHYFKLKNVIDLERTFDDIIDESEVVGVC 440 >gi|301764623|ref|XP_002917727.1| PREDICTED: integrator complex subunit 6-like isoform 2 [Ailuropoda melanoleuca] Length = 850 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 29/142 (20%), Positives = 55/142 (38%), Gaps = 12/142 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM T +D A ++ ++ ++ + + LVTF Sbjct: 4 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFMK-LRARDPASRGDRYMLVTFEEPPY-A 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQI------FDMQGMRQHCNTEDA 285 W +H ++K L G++T L+ A++ + + Q N Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTL-GQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFL 120 Query: 286 NYKKIIVFMTDGENLSTKEDQQ 307 II+ +TDG L+T Q Sbjct: 121 EPA-IIITITDGSKLTTTSGVQ 141 >gi|300688886|gb|ADK31557.1| conserved hypothetical protein [Brachyspira pilosicoli 95/1000] Length = 509 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 23/170 (13%), Positives = 64/170 (37%), Gaps = 26/170 (15%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + +D+++VLD + SM + S +I K + +++ + G + + + Sbjct: 353 SAVDLVLVLDTTESMHPYLTSIKEEIKSISKQV--FKKDINS--------RIGFLLYRDV 402 Query: 230 IEEFFL----LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + + + ++ + R + Y G + P + A + + + Sbjct: 403 KDTYLTKKIDFDNNINKIYRDVNYFYASGGGDKAEP-MYEAIQKALEDFDYKNDN----- 456 Query: 286 NYKKIIVFMTDGENLST-KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 K+++ +TD K + + AK++ + I ++ I+ + Sbjct: 457 ---KVVIVITDAPAKVIGKANADTNK--KTAKEKNIKIEYILVKEIKRNS 501 >gi|297694110|ref|XP_002824337.1| PREDICTED: integrator complex subunit 6-like isoform 2 [Pongo abelii] Length = 887 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 29/142 (20%), Positives = 55/142 (38%), Gaps = 12/142 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM T +D A ++ ++ ++ + + LVTF Sbjct: 4 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFMK-LRARDPASRGDRYMLVTFEEPPY-A 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQI------FDMQGMRQHCNTEDA 285 W +H ++K L G++T L+ A++ + + Q N Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTL-GQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFL 120 Query: 286 NYKKIIVFMTDGENLSTKEDQQ 307 II+ +TDG L+T Q Sbjct: 121 EPA-IIITITDGSKLTTTSGVQ 141 >gi|297274484|ref|XP_001105975.2| PREDICTED: integrator complex subunit 6-like isoform 4 [Macaca mulatta] Length = 883 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 29/142 (20%), Positives = 55/142 (38%), Gaps = 12/142 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM T +D A ++ ++ ++ + + LVTF Sbjct: 4 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFMK-LRARDPASRGDRYMLVTFEEPPY-A 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQI------FDMQGMRQHCNTEDA 285 W +H ++K L G++T L+ A++ + + Q N Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTL-GQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFL 120 Query: 286 NYKKIIVFMTDGENLSTKEDQQ 307 II+ +TDG L+T Q Sbjct: 121 EPA-IIITITDGSKLTTTSGVQ 141 >gi|301764621|ref|XP_002917726.1| PREDICTED: integrator complex subunit 6-like isoform 1 [Ailuropoda melanoleuca] gi|281346272|gb|EFB21856.1| hypothetical protein PANDA_006082 [Ailuropoda melanoleuca] Length = 887 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 29/142 (20%), Positives = 55/142 (38%), Gaps = 12/142 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM T +D A ++ ++ ++ + + LVTF Sbjct: 4 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFMK-LRARDPASRGDRYMLVTFEEPPY-A 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQI------FDMQGMRQHCNTEDA 285 W +H ++K L G++T L+ A++ + + Q N Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTL-GQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFL 120 Query: 286 NYKKIIVFMTDGENLSTKEDQQ 307 II+ +TDG L+T Q Sbjct: 121 EPA-IIITITDGSKLTTTSGVQ 141 >gi|194384430|dbj|BAG59375.1| unnamed protein product [Homo sapiens] Length = 289 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 29/142 (20%), Positives = 55/142 (38%), Gaps = 12/142 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM T +D A ++ ++ ++ + + LVTF Sbjct: 4 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFMK-LRARDPASRGDRYMLVTFEEPPY-A 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQI------FDMQGMRQHCNTEDA 285 W +H ++K L G++T L+ A++ + + Q N Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTL-GQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFL 120 Query: 286 NYKKIIVFMTDGENLSTKEDQQ 307 II+ +TDG L+T Q Sbjct: 121 EPA-IIITITDGSKLTTTSGVQ 141 >gi|194384068|dbj|BAG64807.1| unnamed protein product [Homo sapiens] Length = 170 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 29/142 (20%), Positives = 55/142 (38%), Gaps = 12/142 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM T +D A ++ ++ ++ + + LVTF Sbjct: 4 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFMK-LRARDPASRGDRYMLVTFEEPPY-A 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQI------FDMQGMRQHCNTEDA 285 W +H ++K L G++T L+ A++ + + Q N Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTL-GQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFL 120 Query: 286 NYKKIIVFMTDGENLSTKEDQQ 307 II+ +TDG L+T Q Sbjct: 121 EPA-IIITITDGSKLTTTSGVQ 141 >gi|148704134|gb|EDL36081.1| integrator complex subunit 6 [Mus musculus] Length = 884 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 29/142 (20%), Positives = 55/142 (38%), Gaps = 12/142 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM T +D A ++ ++ ++ + + LVTF Sbjct: 4 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFMK-LRARDPASRGDRYMLVTFEEPPY-A 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQI------FDMQGMRQHCNTEDA 285 W +H ++K L G++T L+ A++ + + Q N Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTL-GQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFL 120 Query: 286 NYKKIIVFMTDGENLSTKEDQQ 307 II+ +TDG L+T Q Sbjct: 121 EPA-IIITITDGSKLTTTSGVQ 141 >gi|126327494|ref|XP_001368527.1| PREDICTED: similar to candidate tumor suppressor protein DICE1 isoform 1 [Monodelphis domestica] Length = 889 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 29/142 (20%), Positives = 55/142 (38%), Gaps = 12/142 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM T +D A ++ ++ ++ + + LVTF Sbjct: 4 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFMK-LRARDPASRGDRYMLVTFEEPPY-A 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQI------FDMQGMRQHCNTEDA 285 W +H ++K L G++T L+ A++ + + Q N Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTL-GQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFL 120 Query: 286 NYKKIIVFMTDGENLSTKEDQQ 307 II+ +TDG L+T Q Sbjct: 121 EPA-IIITITDGSKLTTTSGVQ 141 >gi|126327496|ref|XP_001368564.1| PREDICTED: similar to candidate tumor suppressor protein DICE1 isoform 2 [Monodelphis domestica] Length = 852 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 29/142 (20%), Positives = 55/142 (38%), Gaps = 12/142 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM T +D A ++ ++ ++ + + LVTF Sbjct: 4 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFMK-LRARDPASRGDRYMLVTFEEPPY-A 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQI------FDMQGMRQHCNTEDA 285 W +H ++K L G++T L+ A++ + + Q N Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTL-GQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFL 120 Query: 286 NYKKIIVFMTDGENLSTKEDQQ 307 II+ +TDG L+T Q Sbjct: 121 EPA-IIITITDGSKLTTTSGVQ 141 >gi|33872145|gb|AAH13358.1| INTS6 protein [Homo sapiens] Length = 827 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 29/142 (20%), Positives = 55/142 (38%), Gaps = 12/142 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM T +D A ++ ++ ++ + + LVTF Sbjct: 4 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFMK-LRARDPASRGDRYMLVTFEEPPY-A 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQI------FDMQGMRQHCNTEDA 285 W +H ++K L G++T L+ A++ + + Q N Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTL-GQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFL 120 Query: 286 NYKKIIVFMTDGENLSTKEDQQ 307 II+ +TDG L+T Q Sbjct: 121 EPA-IIITITDGSKLTTTSGVQ 141 >gi|35193175|gb|AAH58637.1| Ints6 protein [Mus musculus] Length = 874 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 29/142 (20%), Positives = 55/142 (38%), Gaps = 12/142 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM T +D A ++ ++ ++ + + LVTF Sbjct: 4 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFMK-LRARDPASRGDRYMLVTFEEPPY-A 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQI------FDMQGMRQHCNTEDA 285 W +H ++K L G++T L+ A++ + + Q N Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTL-GQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFL 120 Query: 286 NYKKIIVFMTDGENLSTKEDQQ 307 II+ +TDG L+T Q Sbjct: 121 EPA-IIITITDGSKLTTTSGVQ 141 >gi|51467749|ref|NP_032741.2| integrator complex subunit 6 [Mus musculus] gi|81885556|sp|Q6PCM2|INT6_MOUSE RecName: Full=Integrator complex subunit 6; Short=Int6; AltName: Full=DBI-1; AltName: Full=Protein DDX26 gi|37589266|gb|AAH59263.1| Integrator complex subunit 6 [Mus musculus] Length = 883 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 29/142 (20%), Positives = 55/142 (38%), Gaps = 12/142 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM T +D A ++ ++ ++ + + LVTF Sbjct: 4 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFMK-LRARDPASRGDRYMLVTFEEPPY-A 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQI------FDMQGMRQHCNTEDA 285 W +H ++K L G++T L+ A++ + + Q N Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTL-GQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFL 120 Query: 286 NYKKIIVFMTDGENLSTKEDQQ 307 II+ +TDG L+T Q Sbjct: 121 EPA-IIITITDGSKLTTTSGVQ 141 >gi|78358474|ref|YP_389923.1| von Willebrand factor, type A [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78220879|gb|ABB40228.1| von Willebrand factor, type A [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 547 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 64/169 (37%), Gaps = 34/169 (20%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + I++D S SM +I +A ++ A+ + + + + G+ F ++ Sbjct: 378 VHILMDCSGSMR-------KRIHLASQATYAVAKALD-----HTGINIGVTAFPATYQK- 424 Query: 234 FLLEWGV------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + W + ++ G ST P L + Q+ +Q R Sbjct: 425 -QIGWASVRPLIRHGERMHTEFSLASGGSTPLGPALWWVMQQMLPLQESR---------- 473 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 KI++ +TDG+ + ++++ + + +Y IGI E + Sbjct: 474 -KIVLIITDGDPDCSSAAKEAIAAAQKTE---IELYGIGIMSSSITELI 518 >gi|11024694|ref|NP_036273.1| integrator complex subunit 6 isoform a [Homo sapiens] gi|74753376|sp|Q9UL03|INT6_HUMAN RecName: Full=Integrator complex subunit 6; Short=Int6; AltName: Full=DBI-1; AltName: Full=Protein DDX26; AltName: Full=Protein deleted in cancer 1; Short=DICE1 gi|6062874|gb|AAF03046.1| candidate tumor suppressor protein DICE1 [Homo sapiens] gi|24980821|gb|AAH39829.1| Integrator complex subunit 6 [Homo sapiens] gi|55957338|emb|CAI12905.1| integrator complex subunit 6 [Homo sapiens] gi|55958605|emb|CAI15586.1| integrator complex subunit 6 [Homo sapiens] gi|78100165|tpg|DAA05730.1| TPA_exp: integrator complex subunit 6 [Homo sapiens] gi|119629281|gb|EAX08876.1| integrator complex subunit 6, isoform CRA_d [Homo sapiens] gi|119629282|gb|EAX08877.1| integrator complex subunit 6, isoform CRA_d [Homo sapiens] Length = 887 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 29/142 (20%), Positives = 55/142 (38%), Gaps = 12/142 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM T +D A ++ ++ ++ + + LVTF Sbjct: 4 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFMK-LRARDPASRGDRYMLVTFEEPPY-A 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQI------FDMQGMRQHCNTEDA 285 W +H ++K L G++T L+ A++ + + Q N Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTL-GQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFL 120 Query: 286 NYKKIIVFMTDGENLSTKEDQQ 307 II+ +TDG L+T Q Sbjct: 121 EPA-IIITITDGSKLTTTSGVQ 141 >gi|86132310|ref|ZP_01050905.1| conserved hypothetical protein [Dokdonia donghaensis MED134] gi|85817229|gb|EAQ38412.1| conserved hypothetical protein [Dokdonia donghaensis MED134] Length = 351 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 60/177 (33%), Gaps = 6/177 (3%) Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--EEFFLLEWGVSHLQRKIKYLSKF 253 ++ + + V + + F + E L + LQ I ++ Sbjct: 130 QELKQASTSFITNVMPAVP-TESFKMAIYWFDGEDVLHELQPLTTSATQLQEAIDGVTDD 188 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK-EDQQSLYYC 312 + ST + + + + ED +V TDG + + + +Q++L Sbjct: 189 ISNDPSTDLYGAVIKAATNAENIVETLENEDLFAAASVVIFTDGTDQAARYSEQEALDAV 248 Query: 313 NEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 + A + + IG+ E L A S + EN + + F+ I + + Sbjct: 249 SNAGEE-ISFFTIGLGSEIDEEVLTAIGKTGSAF-AENSNELEAVFNDISNGVAGQA 303 >gi|297182077|gb|ADI18250.1| hypothetical protein [uncultured Chromatiales bacterium HF0200_41F04] Length = 365 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 23/140 (16%), Positives = 46/140 (32%), Gaps = 23/140 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D + SM ID I ++ V+ + + G V F + + Sbjct: 150 LDVVFVVDATGSMGWA-------IDEIKDRIYDIVSTVRTLVPAA---RFGFVAFRDHND 199 Query: 232 EFF-----LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 F L + + L R + L G + ++ + Sbjct: 200 PEFLVRSEPLTFSTAKLHRFLDPLQAAGGGD--------IWEEVNAGIAAGIDDSGWRVG 251 Query: 287 YKKIIVFMTDGENLSTKEDQ 306 ++II+ + D D+ Sbjct: 252 ARRIIILVGDAPPREESFDE 271 >gi|198473170|ref|XP_001356194.2| GA11645 [Drosophila pseudoobscura pseudoobscura] gi|198139334|gb|EAL33254.2| GA11645 [Drosophila pseudoobscura pseudoobscura] Length = 1213 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 39/214 (18%), Positives = 79/214 (36%), Gaps = 23/214 (10%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK-----LIPDVNNVVQSGLVTFS 227 D++I+LD S SM F + +A +I ++L+ I + + V + F+ Sbjct: 277 DIVILLDHSGSMTGF------RHHVAKFTIRSILDTFSNNDFFTIFNYSAGVDDIIPCFN 330 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + + I L N T + A+ + R HCN Sbjct: 331 GALVQATPE--NIEVFNEAIANLPDPEGYANLTLAYEKAFQLLRTYYEKR-HCNATST-C 386 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCN----EAKKRGAIVYA--IGIRVIRSHEF-LRACA 340 + I+ +TDG +T + Q + N +K ++ +G V + E AC Sbjct: 387 NQAIMLVTDGVAGNTTDVFQKYNWGNGENGTSKMN-VRIFTYLLGKEVTKVREIQWMACL 445 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + + V++ +++ I T + ++ Sbjct: 446 NRGYYSHVQSLDEVHEEVLKYVDVIATPLVLQNE 479 >gi|195164890|ref|XP_002023279.1| GL21046 [Drosophila persimilis] gi|194105364|gb|EDW27407.1| GL21046 [Drosophila persimilis] Length = 1221 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 39/214 (18%), Positives = 79/214 (36%), Gaps = 23/214 (10%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK-----LIPDVNNVVQSGLVTFS 227 D++I+LD S SM F + +A +I ++L+ I + + V + F+ Sbjct: 285 DIVILLDHSGSMTGF------RHHVAKFTIRSILDTFSNNDFFTIFNYSAGVDDIIPCFN 338 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + + I L N T + A+ + R HCN Sbjct: 339 GALVQATPE--NIEVFNEAIANLPDPEGYANLTLAYEKAFQLLRTYYEKR-HCNATST-C 394 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCN----EAKKRGAIVYA--IGIRVIRSHEF-LRACA 340 + I+ +TDG +T + Q + N +K ++ +G V + E AC Sbjct: 395 NQAIMLVTDGVAGNTTDVFQKYNWGNGENGTSKMN-VRIFTYLLGKEVTKVREIQWMACL 453 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + + V++ +++ I T + ++ Sbjct: 454 NRGYYSHVQSLDEVHEEVLKYVDVIATPLVLQNE 487 >gi|156404065|ref|XP_001640228.1| predicted protein [Nematostella vectensis] gi|156227361|gb|EDO48165.1| predicted protein [Nematostella vectensis] Length = 230 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 58/168 (34%), Gaps = 12/168 (7%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + ++ + + D ++D+ IVLD S SM +A ++ I Sbjct: 22 VKAAKTEHVKCDKKVDLAIVLDASASMGE------DSYKLAKTLTKEIISRF-TISPDKT 74 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 V + ++ I + +S L + + + L+ + ++ +G Sbjct: 75 RVSLNFFSANHVIMSKLSDNFSISKLFSLTDRMMYEKSFSILSTSLETVHFEVLAKKGGA 134 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 + K +V +TDG T ++S+ + ++ + Sbjct: 135 RPKQKG---VKMAVVLVTDG--FGTSGYEESIAQAKSLQNYHVEMFTV 177 >gi|256420218|ref|YP_003120871.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256035126|gb|ACU58670.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 352 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 30/168 (17%), Positives = 62/168 (36%), Gaps = 16/168 (9%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + ++DVS SM +I + + A+++E+K P ++ F+ + Sbjct: 3 RLPIYFLIDVSESM------VGEQIQFVEEGLAAIIKELKSDPYALETAWVSIIVFAGQA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L+ + G T+ + GL + +++M+ H K Sbjct: 57 KTIVPLQ----EVISFYPPKFPIGAGTSLSNGLGH---LMYEMRKNTIHTTATQKGDWKP 109 Query: 291 IVFM-TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 IVF+ TDG + + + ++ AI + L+ Sbjct: 110 IVFLFTDGTPTDDTS-AAVREWKQNWQNKSNLI-AISFGDENNLSALK 155 >gi|297622079|ref|YP_003710216.1| hypothetical protein wcw_1872 [Waddlia chondrophila WSU 86-1044] gi|297377380|gb|ADI39210.1| conserved hypothetical protein [Waddlia chondrophila WSU 86-1044] Length = 293 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 25/102 (24%), Positives = 44/102 (43%), Gaps = 9/102 (8%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + L + +V+DVS S S + + A + V + N + GLV F Sbjct: 74 GEERELTVFLVVDVSAS------SRFGGKHQLKQDLIAEVGAVLAFSAIKNNDKIGLVLF 127 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKY 265 S++IE++ + G H+ R I+ L F T+ L++ Sbjct: 128 SSEIEKYIPPKKGTRHVLRCIRELLAFKPKHKGTDLEKALEF 169 >gi|164687484|ref|ZP_02211512.1| hypothetical protein CLOBAR_01125 [Clostridium bartlettii DSM 16795] gi|164603258|gb|EDQ96723.1| hypothetical protein CLOBAR_01125 [Clostridium bartlettii DSM 16795] Length = 261 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 35/202 (17%), Positives = 69/202 (34%), Gaps = 14/202 (6%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 ++ T + + V+D S SM KID + + ++ ++ + N ++ Sbjct: 15 LEAKPITKKSMVIFFVIDTSESM------KGKKIDDLNRVMREIIPQLAGVGGCNTELKY 68 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +++FS+ + V K LS G++ L A+ ++ M ++ Sbjct: 69 AVLSFSSGCKWITSEPMIVEDNNSW-KDLSANGITD-----LGMAFEELTSMLSRKKFLK 122 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL-RACA 340 + +Y +I MTDG + N K G + A+GI L Sbjct: 123 SPSLSYAPVIFLMTDGYPTDNYKKGLEKLKNNNWYKYGMKI-ALGIGDYFDESVLEEFTE 181 Query: 341 SPNSFYLVENPHSMYDAFSHIG 362 + N + I Sbjct: 182 NKELVVKAANSSQLAKLIKTIA 203 >gi|21672903|ref|NP_660968.1| hypothetical protein CT0062 [Chlorobium tepidum TLS] gi|21645957|gb|AAM71310.1| conserved hypothetical protein [Chlorobium tepidum TLS] Length = 307 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 31/146 (21%), Positives = 61/146 (41%), Gaps = 16/146 (10%) Query: 123 QNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 + G S + Y+ T I W T++ + + K+ ++ R+ +M+VLD S Sbjct: 51 KGRGIEFSQVREYQYGDDVRT-IDWNTSAHKNDLYV----KIFTEERERI-LMLVLDGSG 104 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH 242 SM ++ + +I A N++V GL+ FS+ +E + G +H Sbjct: 105 SMLFGSGRLKKELAAEVSAILAF-----SAVQNNDMV--GLLVFSDTVETYIPPRKGRAH 157 Query: 243 ---LQRKIKYLSKFGVSTNSTPGLKY 265 + +I + + G T+ L + Sbjct: 158 ALVILNEIFSMRQCGRKTDIDAALSF 183 >gi|260890922|ref|ZP_05902185.1| D-amino acid dehydrogenase large subunit [Leptotrichia hofstadii F0254] gi|260859475|gb|EEX73975.1| D-amino acid dehydrogenase large subunit [Leptotrichia hofstadii F0254] Length = 516 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 53/315 (16%), Positives = 115/315 (36%), Gaps = 32/315 (10%) Query: 58 LVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLD 117 +V+ I++ G + + + + + W + +NEL+ ++ + Sbjct: 61 IVYTYEGIVDAAPGIYQFPDTYEENIVKKSDIWRENVQNELKKIKPALG----EDASDEE 116 Query: 118 IVVVPQNEGYSISAISRYKI-PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMI 176 I + + Y I+ Y P++ + +V P T K + + +++ I Sbjct: 117 IERLFKKFLY----IAGYDYEPIETLDRFSYVIFKDDMVNPFT---KQKIEENMNVNLEI 169 Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 +LD S SM+ T +++A +SI ++ E+ N ++GL F +K + Sbjct: 170 ILDASGSMKQKI-GDKTMMEIAKESIEKVVSEM----PAN--TKAGLRVFGHKGDNTASK 222 Query: 237 ---EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM--QGMRQHCNTEDANYKKII 291 + L I+ L+K + ++ P + I G + I+ Sbjct: 223 KQESCSANELISPIETLNKDKLKSSLAPIQPTGWTSIAKSIENGTNDLKALKGEKTLNIL 282 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKR--GAIVYAIGIRV-IRSHEFLR--ACASPNSFY 346 +TDG D + + K ++ IG V ++ L+ A A+ + Sbjct: 283 YIITDG---IETCDGNPVETAKKFKNENTDIVLGIIGFNVDAHQNKVLKEIANAANGYYS 339 Query: 347 LVENPHSMYDAFSHI 361 V + + + I Sbjct: 340 SVNDAAKLTEELQRI 354 >gi|262201266|ref|YP_003272474.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] gi|262084613|gb|ACY20581.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] Length = 594 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 35/216 (16%), Positives = 71/216 (32%), Gaps = 42/216 (19%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE-------------VKLIPDV 215 + ++V D S SME+ + T+ + I + LE + Sbjct: 382 GVPIRTLVVEDTSGSMETPV-GNTTRAGLLIDASMTGLEMFPRNAMVGGWAFGIDKGGPD 440 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIK---YLSKFGVSTNSTPGLKYAYNQI 270 + + + +++ G L R ++ ++ G T A+ ++ Sbjct: 441 QDWTEMAPI---RRLDAPSG-SGGTHREALARAVREGLAPARLGGGTGLYDTTLAAFKKV 496 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE------DQQSLYYCNEAKKRGAIVYA 324 +T D NY ++ MTDG+N + + A+ ++ Sbjct: 497 ---------QSTYDPNYSNSVIIMTDGQNEDPGSITLTELLAELKELEDPARP--VLILT 545 Query: 325 IGIRVIRSHEFLR--ACASPNSFYLVENPHSMYDAF 358 IGI LR A A+ + Y+ + + F Sbjct: 546 IGISEDADTNALRQIAQATGGTTYVAKTAADIKQVF 581 >gi|168698747|ref|ZP_02731024.1| hypothetical protein GobsU_04449 [Gemmata obscuriglobus UQM 2246] Length = 627 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 12/79 (15%), Positives = 28/79 (35%), Gaps = 6/79 (7%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFF--DSSITKIDMAIKSINAMLEEV----KLIP 213 S ++ ++D S SM F +K ++N +L+ V Sbjct: 346 GSYDAQISRANPACLLFLIDQSGSMGDPFIGAGGTSKASTVADAVNRLLQNVVLRSAKAD 405 Query: 214 DVNNVVQSGLVTFSNKIEE 232 V + + G++ + ++ Sbjct: 406 GVRDYFRVGVIGYGTTVKA 424 >gi|29828406|ref|NP_823040.1| hypothetical protein SAV_1864 [Streptomyces avermitilis MA-4680] gi|29605509|dbj|BAC69575.1| hypothetical protein [Streptomyces avermitilis MA-4680] Length = 462 Score = 42.1 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 26/153 (16%), Positives = 53/153 (34%), Gaps = 25/153 (16%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 ++ + V+D S SM TK+D ++ + E++ G++TF + + Sbjct: 43 PVNFVFVVDTSGSMTG------TKLDTVKSALQTIYRELRPADC------LGIITFDHNV 90 Query: 231 EEFFLLEWG----VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L+ G T+ G++Y + + +H + Sbjct: 91 RTVLPAVAKQDLPPERFAEVVSALTTQG-GTDIDLGVQYG------IDEISRHSVSGRTV 143 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 +DG+ S + D + AK RG Sbjct: 144 NCLY--LFSDGDPTSGERDWIKVRANVAAKLRG 174 >gi|328881544|emb|CCA54783.1| ChlI component of cobalt chelatase involved in B12 biosynthesis or ChlD component of cobalt chelatase involved in B12 biosynthesis [Streptomyces venezuelae ATCC 10712] Length = 689 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 26/138 (18%), Positives = 56/138 (40%), Gaps = 19/138 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-EE 232 ++ V+D S SM ++ ++ ++ ++L + + GL+TF + E Sbjct: 499 VLFVVDASGSM-----AARQRMSAVKGAVLSLL-----LDAYQRRDKVGLITFRGRDAEV 548 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V +++ L G + S LK + + ++ +R DA+ + ++V Sbjct: 549 ALPPTSSVDAAAARLEKLPTGGRTPLSAGLLKA--HDVLRVERLR------DASRRPLLV 600 Query: 293 FMTDGENLSTKEDQQSLY 310 +TDG D +L Sbjct: 601 VVTDGRATGGGADPVALA 618 >gi|315644678|ref|ZP_07897809.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315279937|gb|EFU43236.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 562 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 51/165 (30%), Gaps = 31/165 (18%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 + + V DVS SM + +S+ + + G V++S + Sbjct: 387 PVAAVFVADVSGSM------DGEPLHRLKESLLTGQKYLGRDNS------IGFVSYSTDV 434 Query: 231 EEFFLLEWGVSHLQRK------IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + G L ++ I L G +T + G+ A + D + Sbjct: 435 AINLPI--GKYDLNQQSMFVGAINSLEASG-NTATFDGIVVAMKMLHDE-------MAVN 484 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 + K +I ++DGE K VY IG Sbjct: 485 PDVKPLIFVLSDGETNVGHSLDDIREL---IKAFKIPVYTIGYNA 526 >gi|308509664|ref|XP_003117015.1| CRE-CLEC-60 protein [Caenorhabditis remanei] gi|308241929|gb|EFO85881.1| CRE-CLEC-60 protein [Caenorhabditis remanei] Length = 399 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 56/169 (33%), Gaps = 11/169 (6%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 L+++ V+D S+ M + + I I S+ + + P+ + GLVT++ Sbjct: 39 LEVVAVVDNSKGMT---NGGLISIAANIASVFSNNTRIGTNPNEPRTTRLGLVTYNAVAN 95 Query: 232 EFFLLEWGVS--HLQRKI-KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L+ S + I L++ S S A +I G ++Y+ Sbjct: 96 TVADLDQFQSLDDVYDGIFGGLAQVSSSDESYLAHGLAQAEIILEAG---QTAVNRSHYE 152 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 ++++ + + K G + + L Sbjct: 153 RVVIVY--ASTYKGSGSLDPIPVADRLKTAGVTIITVAYDQDGDGALLH 199 >gi|290988612|ref|XP_002676992.1| predicted protein [Naegleria gruberi] gi|284090597|gb|EFC44248.1| predicted protein [Naegleria gruberi] Length = 2366 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 68/194 (35%), Gaps = 18/194 (9%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 T L+ ++ + ++ ++F I + +T + + Sbjct: 1758 GTPTILNPNDPANSQVIRVKPVANSCDSIQFEYTIVSDQDGTSSSATVTVQYTSCTCSL- 1816 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D++ VLD S S+ D + K+ + ++ N G++ F++ I Sbjct: 1817 DMDIIFVLDGSGSIS---DENWIKMKSFMGNVTRGFGSNISPTGTN----IGIIQFASSI 1869 Query: 231 EEFFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + G S + KI S+ T S G+ A +Q+ Sbjct: 1870 TTHLQVNSGTSVDVVLSKISSASQLKSGTGSIAGINAAIDQMVSYGRKSI--------VN 1921 Query: 289 KIIVFMTDGENLST 302 K+I+ +TDGE+ Sbjct: 1922 KMIIHVTDGESTEP 1935 >gi|195953700|ref|YP_002121990.1| von Willebrand factor type A [Hydrogenobaculum sp. Y04AAS1] gi|195933312|gb|ACG58012.1| von Willebrand factor type A [Hydrogenobaculum sp. Y04AAS1] Length = 600 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 39/205 (19%), Positives = 87/205 (42%), Gaps = 27/205 (13%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + + + +LD+S SM K D I +I +++ +++ D+ L Sbjct: 417 AGNEIPVFFSFLLDISSSMR--------KEDKLINAIKSLILISEVLSDLKLDFSISL-- 466 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT-ED 284 F+ EE+F+L+ +H ++ I + ++ GL ++ R+H + + Sbjct: 467 FN---EEYFILKDYKTHYKQVIADILNISPQKSTNLGL--------AIEEERRHIDDFQR 515 Query: 285 ANYKK-IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 +K+ + V +DGE + ++ Y N K + + I V ++ E ++ Sbjct: 516 KTHKRGVFVLFSDGEPTKGMKKEELSSYVNLIK---LEIPIVAISVGKAGESIKEIF-GK 571 Query: 344 SFYLVENPHSMYDAFSHIGKDIVTK 368 + +V + +S+ AF I + + K Sbjct: 572 NTLIVNSINSLPMAFGRIVQQQLNK 596 >gi|322374631|ref|ZP_08049145.1| putative von Willebrand factor type A domain protein [Streptococcus sp. C300] gi|321280131|gb|EFX57170.1| putative von Willebrand factor type A domain protein [Streptococcus sp. C300] Length = 459 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 40/287 (13%), Positives = 79/287 (27%), Gaps = 29/287 (10%) Query: 52 SMIDRSLVHAATQIMNEGN--GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 ++ A +I + N+ K G+++ K + + + +++ Sbjct: 55 EANKSAVEAAVAEIKADEAKPVVNQSDAKPGELVDFSKKVSPIDVKERQEGDQKIHEETA 114 Query: 110 IV--RSTSLDIVVVPQ----NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK 163 V T + V + + S +S T + + + + Sbjct: 115 TVEVTKTEVAPVKSEELPAPKTETTESTVSGKDEDGNSYTQYERVDKTTTVEIKKATRTI 174 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK---LIPDVNNVVQ 220 S + D++ V+D S SM D+ I+ +++ + + Sbjct: 175 SKSNSA---DIVFVVDRSASMSRTIDTVRNNINEFARNLAKDGVAARFGLATFSDEVYGR 231 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS----KFGVS--TNSTPGLKYAYNQIFDMQ 274 G + I F + S K L+ G STP L I Sbjct: 232 IGGLADEGTILTKFNESYFTSDPAELEKALAGIKLAHGGDFPETSTPALT---QIISTYD 288 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 + N KK +V +TD K Sbjct: 289 WSKSPKN------KKFVVLLTDAPMKEDPSIPTMANTLASLKAAKIE 329 >gi|218529650|ref|YP_002420466.1| Vault protein inter-alpha-trypsin domain protein [Methylobacterium chloromethanicum CM4] gi|218521953|gb|ACK82538.1| Vault protein inter-alpha-trypsin domain protein [Methylobacterium chloromethanicum CM4] Length = 738 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 32/215 (14%), Positives = 71/215 (33%), Gaps = 34/215 (15%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 + ++ +T + + D++ V+D S SM + A S+ L+ + Sbjct: 327 GAEAVLAVVTPPESASPASPVPRDVVFVIDNSGSMGG------ASMRQAKASLLIGLDRL 380 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFL-----LEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 + ++ F + + F + + + L G + P L+ Sbjct: 381 GAGD------RFNVIRFDHSFDTLFPDLVPADAGHLMRAKSFVAGLQASGGTEMLAP-LQ 433 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 A E+ + +VF+TDG + + S + R ++ Sbjct: 434 AALRDAT----------PEETGRLRQVVFLTDGA-IGNEAQIFSAIATERGRSR---LFM 479 Query: 325 IGIRVIRSHEFLRACAS--PNSFYLVENPHSMYDA 357 +GI + +R A SF ++ P + + Sbjct: 480 VGIGSAPNGYLMRHAAEVGQGSFTQIDTPDQVTER 514 >gi|123428998|ref|XP_001307617.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121889256|gb|EAX94687.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 642 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 37/258 (14%), Positives = 88/258 (34%), Gaps = 36/258 (13%) Query: 103 FVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSV 162 VN +++ S + V +I ++ K K + + Sbjct: 151 SVNGTTNVIDSHNATFVTNDTPTKDAIIIETQIKDKDKNIAISSDGYIAITTYPSFEGPI 210 Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 NS+ ++D S SM ++I AI+ + ++ + + + +++ G Sbjct: 211 TSNSE------FYFIVDCSGSMTG------SRILKAIECMRMFIQSLPVGCRFS-IIKFG 257 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 F ++ + +++ + + ++ T+ LKY Sbjct: 258 -SNFHTILKSCDYTDKNLANAMQLLYTINSDMGGTDIYSPLKYV------------SDIK 304 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA--IVYAIGIRVIRSHEFLRACA 340 + K I +TDGE Q S C A K + +++IG+ ++ A Sbjct: 305 PKKGFVKQIFLLTDGE------VQDSDEICAMAYKNRSNNRIFSIGLGSGADPGLIKGIA 358 Query: 341 --SPNSFYLVENPHSMYD 356 S ++ ++ + ++ Sbjct: 359 RKSGGNYTIIGDNDNLNQ 376 >gi|121594238|ref|YP_986134.1| hypothetical protein Ajs_1876 [Acidovorax sp. JS42] gi|120606318|gb|ABM42058.1| hypothetical protein Ajs_1876 [Acidovorax sp. JS42] Length = 280 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 69/192 (35%), Gaps = 30/192 (15%) Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 ++DVS S E + T + ++ + + ++ S ++ + Sbjct: 53 IIDVSGSFEHEHEEGTTSL--LLERLVPYCVVLDPDRKMDVFTFSSGEPSAHYVGSVTPE 110 Query: 237 EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN---------------------QIFDMQG 275 + + ++ + + T + ++ A ++F+ Sbjct: 111 DAQGYVTRNIVERVPGWNGGTTYSFVIERALEHFGWKECEEAHQTPQGKGFLWRLFNSPT 170 Query: 276 MRQHCNTEDANYKK--IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 T + KK +++F+TDGEN ++ ++ E+++RG VY + I Sbjct: 171 HEHGHGTPHTHEKKRSLVLFITDGENE-KNDEDRTTRVLEESQRRGDQVYFLFIGACEDR 229 Query: 334 ----EFLRACAS 341 EF++ AS Sbjct: 230 NVKFEFVKTIAS 241 >gi|326484295|gb|EGE08305.1| hypothetical protein TEQG_07279 [Trichophyton equinum CBS 127.97] Length = 378 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 25/215 (11%), Positives = 56/215 (26%), Gaps = 16/215 (7%) Query: 102 GFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSS 161 F++ + + Y P + + + +S Sbjct: 74 SFLSRNKQSPTAQKSTYSNNKSTGSSHLDLPPAYTPVSATSATAPSFPAASSEGSSVPAS 133 Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 A+ D + ++D S SM + K+ L ++ I + Sbjct: 134 PDRPDSFLAQFDTVFLIDDSGSMR----------GASWKATENALAKIAPICTAQDKDGV 183 Query: 222 GLVTFSNKIEE---FFLLEWGVSHLQRKIKYLSKFGV---STNSTPGLKYAYNQIFDMQG 275 + +++ + + H++ +S G T LK + Sbjct: 184 EVYFLNHRSDSSNGAYSNVRDTGHVREVFTAVSPCGGTPTGTRLNHILKPYLKGFEEASK 243 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 ++A I+ +TDG E Sbjct: 244 SMSPDERDNAVRPLNIIVITDGVPTDDVESVIVRA 278 >gi|256071156|ref|XP_002571907.1| dihydropyridine-sensitive l-type calcium channel [Schistosoma mansoni] gi|238657056|emb|CAZ28137.1| dihydropyridine-sensitive l-type calcium channel, putative [Schistosoma mansoni] Length = 421 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 38/194 (19%), Positives = 75/194 (38%), Gaps = 25/194 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL--------IPDVNNV 218 + D+ I+LD S SM + +A S ++E + + P + Sbjct: 216 GSSVPKDLFILLDTSGSMTG------QSLKLANLSAQKLIEALDVDDYFTVAHFPGAKDH 269 Query: 219 VQSGLVTFSNKIEE----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPG-LKYAYNQIFDM 273 V +VT +N+ E F+ + L+ + + P LK+AY ++ Sbjct: 270 VAPMIVTANNESEPICFNSFVQATRRNKLRLFYDLSTLKARGYSDFPASLKFAYEMFRNL 329 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI-VYAIGIRVIRS 332 + KI+V +TD + D+ L + K +Y++G V + Sbjct: 330 TE-SARGDRGKELRNKILVLLTD---NAFVFDESVLSQLKQQKSNITTFIYSLGEPVGAA 385 Query: 333 HE-FLRACASPNSF 345 +E ++ACA+ + + Sbjct: 386 YEHKMKACATNDYY 399 >gi|153011704|ref|YP_001372918.1| von Willebrand factor type A [Ochrobactrum anthropi ATCC 49188] gi|151563592|gb|ABS17089.1| von Willebrand factor type A [Ochrobactrum anthropi ATCC 49188] Length = 633 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 40/240 (16%), Positives = 92/240 (38%), Gaps = 41/240 (17%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 C F S H+ + ++ L + I++DVS S +++ D+ +D+ ++ Sbjct: 421 CDFAAGGQGSDHVHL-------MSRPKANDLAVTILVDVSLSTDAWIDNRRV-LDVEKEA 472 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNK--------IEEFFLLEWGVSHLQRKIKYLSKF 253 + + + D + + TF+++ + F +G + ++ +I L Sbjct: 473 LLVLANGIAACGD-----RCSIQTFTSRRRSWVRVETVKDFDETFGPA-VEHRIAALKP- 525 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK------EDQQ 307 G T +++A ++ E N KK+++ +TDG+ + Sbjct: 526 GFYTRMGAAIRHATAKL-----------AEQPNRKKLLLVLTDGKPNDVDHYEGRFALED 574 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 S EA+ G V+A+ + ++ +L ++ LV + A I + + Sbjct: 575 SRRAVAEARTTGVNVFAVTVDR-EANAYLPTLFGRRNYALVAKLSKLPVALPAIYRMMTG 633 >gi|149188854|ref|ZP_01867144.1| hypothetical protein VSAK1_05875 [Vibrio shilonii AK1] gi|148837274|gb|EDL54221.1| hypothetical protein VSAK1_05875 [Vibrio shilonii AK1] Length = 505 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 27/180 (15%), Positives = 59/180 (32%), Gaps = 18/180 (10%) Query: 31 FLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTW 90 +E + + L + + + T NN + + IK+T Sbjct: 1 MGFTFWAVEGTRYVQETSRLRDASEAAALA-VTIEDQPTLANNLATKYIENYVRDIKSTA 59 Query: 91 NMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTN 150 + R +N ++ I + + +FIP + + Sbjct: 60 LTAQRFHQAENQNAGVLEYIQYTVNAKTTHDSWFAS----------------SFIPSFGD 103 Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSIT-KIDMAIKSINAMLEEV 209 + + + D +D++ V D S SM S + ++ KID +IN + +++ Sbjct: 104 QQDLAGRSLARKYPVYLGDNNIDIVFVSDFSGSMNSRWGTNRNRKIDDLKTAINQISDKI 163 >gi|313203642|ref|YP_004042299.1| hypothetical protein Palpr_1167 [Paludibacter propionicigenes WB4] gi|312442958|gb|ADQ79314.1| protein of unknown function DUF58 [Paludibacter propionicigenes WB4] Length = 288 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 28/126 (22%), Positives = 52/126 (41%), Gaps = 9/126 (7%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +++++DVS S E S K I + + N + G++ F Sbjct: 72 EEERELTVVLLIDVSGSREFGTVSQ------LKKDIFTEVAATLAFSTIQNNDKIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 S+KIE+F + G H+ + I+ L T+ L+Y N I + Sbjct: 126 SDKIEKFIPPKKGKKHVLQIIRELIDFEPESNKTDIAGALRYLTNAIKKSSTAFIISDFI 185 Query: 284 DANYKK 289 D N+++ Sbjct: 186 DTNFER 191 >gi|156383253|ref|XP_001632749.1| predicted protein [Nematostella vectensis] gi|156219809|gb|EDO40686.1| predicted protein [Nematostella vectensis] Length = 184 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 69/204 (33%), Gaps = 25/204 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ ++D S + + +I +K++ + ++ V + + +I Sbjct: 4 DIAFLIDGST-IWYDGQQNFNRILDFVKALAKSFVVSESRSHISVTVYTD----NPRIHF 58 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + +S ++ I ++ G S N+ L +FD T AN ++++ Sbjct: 59 TLTQHYDISTIETAINSITYTGGSANTGKALSSVKTSVFD--------ATGRANVPRVLI 110 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPH 352 + + G ++++GI L AS + H Sbjct: 111 LL-----AHAVSLDGVSSASQALRSAGVTIFSVGIGNYIKIFQLNTIAS-----DPDVDH 160 Query: 353 SMYDAFSHIG--KDIVTKRIWYDK 374 +FS + + V KRI K Sbjct: 161 VFTASFSELHSLEGSVRKRICLGK 184 >gi|296815358|ref|XP_002848016.1| U-box domain containing protein [Arthroderma otae CBS 113480] gi|238841041|gb|EEQ30703.1| U-box domain containing protein [Arthroderma otae CBS 113480] Length = 748 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 33/187 (17%), Positives = 64/187 (34%), Gaps = 31/187 (16%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ-TDARLDMMIVLDVSRSMESFF 188 + I + L + N +V+ I +K D+++V+D+S SM S Sbjct: 27 TTIVEENMTLDPILSVHSIPNKNSMVVSIQPPLKPKDDVPHVPCDIVLVIDISASMNSAA 86 Query: 189 ----------DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW 238 D+ ++ +D+ + +++ + + +VTF +I F LE+ Sbjct: 87 PIPTGESGGEDTGLSILDLTKHAAKTIIQTLNEND------RLAVVTFCTEIRVAFELEF 140 Query: 239 ----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 S + I L STN G+K + ++ + Sbjct: 141 MSEENKSKVLAAIDCLHGI-SSTNLWHGIKEGLKVLATNSTQGNVQA---------LLVL 190 Query: 295 TDGENLS 301 TDG Sbjct: 191 TDGAPNH 197 >gi|150024247|ref|YP_001295073.1| hypothetical protein FP0135 [Flavobacterium psychrophilum JIP02/86] gi|149770788|emb|CAL42253.1| Protein of unknown function [Flavobacterium psychrophilum JIP02/86] Length = 288 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 37/160 (23%), Positives = 66/160 (41%), Gaps = 21/160 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L MM+++D+S S ESF + I + N + GL+ F Sbjct: 72 EEERELTMMLMVDISGS-ESFGTKNQ-----LKSEIVTEIAATMAFSATQNNDKIGLILF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 +++IE + + G H+ R I+ L +F +N T + A + Q Sbjct: 126 TDEIELYIPPKKGKPHVLRIIRELIEFKPKSNKTD-ISQALKFLSGTQK----------- 173 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKK-RGAIVYAI 325 KK IVFM + ++ + +++L + G VY I Sbjct: 174 -KKAIVFMI-SDFITEDDYEKTLKIAGKKHDITGIRVYDI 211 >gi|123414681|ref|XP_001304538.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121885997|gb|EAX91608.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 709 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 33/231 (14%), Positives = 85/231 (36%), Gaps = 43/231 (18%) Query: 119 VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLD----M 174 +V N + ++ I + + T I + + S+ Q + +++ Sbjct: 185 IVDGHNATF-VTKIPPKEDAIIIETLIKDEDKNIAVSSDGYISISTYPQFEGKVEQKSEF 243 Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEF 233 ++D S SM S ++I A +N ++ + + + ++ F + E Sbjct: 244 YFIIDCSGSM------SGSRIGNAKFCLNILIHSLPIG------CRFSIIQFGDSYKETV 291 Query: 234 FLLEWGVSHLQ---RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + ++ +++ I ++ T+ L+Y + + + +R+ Sbjct: 292 SICDYSNRNVRNAMSAIAGINADMGGTDILSPLEYVFKKKLEKGFIRK------------ 339 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEA---KKRGAIVYAIGIRVIRSHEFLRA 338 I +TDGE + S C++A ++ ++AIG+ ++ Sbjct: 340 IFLLTDGEVNN------SDEICSKAQLERENN-RIFAIGLGSGADPGLIKN 383 >gi|299139930|ref|ZP_07033101.1| von Willebrand factor type A [Acidobacterium sp. MP5ACTX8] gi|298598077|gb|EFI54244.1| von Willebrand factor type A [Acidobacterium sp. MP5ACTX8] Length = 484 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 39/214 (18%), Positives = 72/214 (33%), Gaps = 30/214 (14%) Query: 137 IPLKFCTFIPWYT-----NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSS 191 +PL + T + T+S + T M+++DVS SM + Sbjct: 66 VPLPPLRELAPRTEIEVDGQSTVPFYATASTAQSETTPLPQVTMLLIDVSGSMLGNDMAG 125 Query: 192 ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKI 247 T+ D A ++ A LE + D + + FS + + + S Q ++ Sbjct: 126 ETRFDAARQAAAAFLEGFRDGQD-----RVAIAGFSGRNVQAGIDGARFVSSRSEAQAEL 180 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK---- 303 L+ N+ L A + R D++ + ++ +TDG N Sbjct: 181 DALAAPERRNNT--ALYSAV----SIAANRLAKEARDSHSEVRLLVLTDGANDVQPQAGD 234 Query: 304 ------EDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++ E +K G V IG+ Sbjct: 235 DANLLVGNEGLEQAAKEVEKDGVSVLPIGLGSEN 268 >gi|84687480|ref|ZP_01015357.1| putative RTX toxin [Maritimibacter alkaliphilus HTCC2654] gi|84664505|gb|EAQ10992.1| putative RTX toxin [Rhodobacterales bacterium HTCC2654] Length = 1512 Score = 42.1 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 69/200 (34%), Gaps = 13/200 (6%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV-VQ 220 + V+ +++ LD S S+ S T + ++ L + + V Sbjct: 1058 IGVSEVQSNTINLSFALDASGSVGS------TGWNHMRAAVGNALSALSDQFAGSETTVN 1111 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + +++ + + ++ K+ + + G ++ + + +G Sbjct: 1112 VQFIRYASDVSYNQTFD----LVEDKVALIDAITTLGFTGGGTQWD-DALDAAKGFFDAS 1166 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 ++ AN I+ F+TDG+ S + SL A + A GI + L Sbjct: 1167 ESDAANATNILYFVTDGQPTSNSPWETSLQNIENA-PYDVDIQAFGIGSGYNTTQLARVD 1225 Query: 341 SPNSFYLVENPHSMYDAFSH 360 S + V P ++ A Sbjct: 1226 SDDEPTAVSEPGALLAAIQQ 1245 >gi|311070192|ref|YP_003975115.1| hypothetical protein BATR1942_16325 [Bacillus atrophaeus 1942] gi|310870709|gb|ADP34184.1| hypothetical protein BATR1942_16325 [Bacillus atrophaeus 1942] Length = 227 Score = 42.1 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 34/217 (15%), Positives = 69/217 (31%), Gaps = 21/217 (9%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 + P ++ + ++ ++LD S SM D ++K ++A K I + + Sbjct: 16 SVASPAFAAEQKVPAAKPDTNVAVLLDASGSMAKRIDG-VSKFNLAKKEIFQFAKSLPTD 74 Query: 213 PDVNNVV--QSGLVTFSNKIEEFFLLE--WGVSHL--QRKIKYLSKFGVSTNSTPGLKYA 266 V V G S K++ + +G Q L+ G + + Sbjct: 75 SQVKMSVFGSEGNNKNSGKVQSCEAIRNVYGFQGFDEQSFRNSLNTIGPTGWTPIA---- 130 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 + + + D + ++ +TDGE + E K V IG Sbjct: 131 ----KALNEAKSSFDQLDKKGENVVYLLTDGE---ETCGGNPIKTAKELHKHNITVNVIG 183 Query: 327 IRVIRSHE-FLRACAS--PNSFYLVENPHSMYDAFSH 360 ++ L A A ++ + + F Sbjct: 184 FDFKEGYKGQLNAIAKVGGGEYFPASSQSDIKQIFKA 220 >gi|90578363|ref|ZP_01234174.1| hypothetical protein VAS14_14969 [Vibrio angustum S14] gi|90441449|gb|EAS66629.1| hypothetical protein VAS14_14969 [Vibrio angustum S14] Length = 259 Score = 42.1 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 68/189 (35%), Gaps = 20/189 (10%) Query: 176 IVLDVSRSME-SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF- 233 ++ D S SM+ + + KI A ++I + + IP+ NV GL F NK Sbjct: 75 LIFDGSGSMDNTNCGNGQKKIVAAKQAIQTFIND---IPNSANV---GLYVFDNKDASLR 128 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI-IV 292 L ++ Y G +T L +Y+ + + ++ Y + +V Sbjct: 129 VPLGNNNRATLKQAIYDVTAGGATPLKSSLDSSYSAL-------ERQASKQLGYGEYNVV 181 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPH 352 +TDG+ +++ + + ++ IG + H + NP Sbjct: 182 IVTDGD--ASQGENPQPAINRIYRDSPVTIHTIGFCIGEQHAL--NAKGITYYQSANNPE 237 Query: 353 SMYDAFSHI 361 + ++ Sbjct: 238 KLLAGLKNV 246 >gi|308126310|ref|ZP_05909176.2| Ig domain protein [Vibrio parahaemolyticus AQ4037] gi|308108016|gb|EFO45556.1| Ig domain protein [Vibrio parahaemolyticus AQ4037] Length = 3199 Score = 42.1 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 42/258 (16%), Positives = 79/258 (30%), Gaps = 29/258 (11%) Query: 60 HAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIV 119 T +N L N + + N D + + V Sbjct: 2598 SGLTLEVNGKTVTENSDGSYTVDLSSEPNQTSHQLSGTITVPADYNGSLDFDVTATAGSV 2657 Query: 120 VVPQNEGYSISAISRYKIPLKFC------TFIPWYTNSRHIVMPITSSVKVNSQTDARLD 173 V N S +F I ++ IV + V Q + Sbjct: 2658 EVNNNTQMGADTASVSVRDYEFVSGTHGDNNIVGSDDNDVIVGDVQGLQIVEGQD---YN 2714 Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +LD S SM ++T++ +N ++E + V L TFS + ++ Sbjct: 2715 IAFMLDTSGSMGYDVGRAVTELKTV---LNTLIESASG--PHSGKVNVLLTTFSTESKQV 2769 Query: 234 FLL----EWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L + S ++ + + K G +TN G + A N ++ ++ N Sbjct: 2770 LELDLSSDNAKSQVESILDAIVKLGDGNTNYEAGFQSALNWF-------ENADSGATNLS 2822 Query: 289 KIIVFMTDGENLSTKEDQ 306 I +DG ++ Sbjct: 2823 YFI---SDGRPNQATDNN 2837 >gi|260363438|ref|ZP_05776284.1| Ig domain protein [Vibrio parahaemolyticus K5030] gi|260876569|ref|ZP_05888924.1| putative RTX toxin [Vibrio parahaemolyticus AN-5034] gi|260895071|ref|ZP_05903567.1| putative RTX toxin [Vibrio parahaemolyticus Peru-466] gi|308085969|gb|EFO35664.1| putative RTX toxin [Vibrio parahaemolyticus Peru-466] gi|308093615|gb|EFO43310.1| putative RTX toxin [Vibrio parahaemolyticus AN-5034] gi|308113743|gb|EFO51283.1| Ig domain protein [Vibrio parahaemolyticus K5030] Length = 3199 Score = 42.1 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 42/258 (16%), Positives = 79/258 (30%), Gaps = 29/258 (11%) Query: 60 HAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIV 119 T +N L N + + N D + + V Sbjct: 2598 SGLTLEVNGKTVTENSDGSYTVDLSSEPNQTSHQLSGTITVPADYNGSLDFDVTATAGSV 2657 Query: 120 VVPQNEGYSISAISRYKIPLKFC------TFIPWYTNSRHIVMPITSSVKVNSQTDARLD 173 V N S +F I ++ IV + V Q + Sbjct: 2658 EVNNNTQMGADTASVSVRDYEFVSGTHGDNNIVGSDDNDVIVGDVQGLQIVEGQD---YN 2714 Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +LD S SM ++T++ +N ++E + V L TFS + ++ Sbjct: 2715 IAFMLDTSGSMGYDVGRAVTELKTV---LNTLIESASG--PHSGKVNVLLTTFSTESKQV 2769 Query: 234 FLL----EWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L + S ++ + + K G +TN G + A N ++ ++ N Sbjct: 2770 LELDLSSDNAKSQVESILDAIVKLGDGNTNYEAGFQSALNWF-------ENADSGATNLS 2822 Query: 289 KIIVFMTDGENLSTKEDQ 306 I +DG ++ Sbjct: 2823 YFI---SDGRPNQATDNN 2837 >gi|225860988|ref|YP_002742497.1| collagen adhesion protein [Streptococcus pneumoniae Taiwan19F-14] gi|225726444|gb|ACO22295.1| collagen adhesion protein [Streptococcus pneumoniae Taiwan19F-14] Length = 167 Score = 42.1 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 25/100 (25%), Positives = 48/100 (48%), Gaps = 9/100 (9%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME----SFFDSS--IT 193 K T I + + + ITS + +Q++ LD+++V D S SME + S I+ Sbjct: 47 KTITPISGQEDQYELSLDITSKLGTETQSE-PLDVVLVADFSGSMEERDVWSYSSRRYIS 105 Query: 194 KIDMAIKSINAMLEEVKLIPDV--NNVVQSGLVTFSNKIE 231 +I+ ++ + LI + N+ + +V F+ KI+ Sbjct: 106 RIEALKHTLKGVNGRQGLIDTILSNSQNRLSIVGFAGKID 145 >gi|194686862|dbj|BAG66069.1| complement factor B-2 [Lethenteron japonicum] Length = 749 Score = 42.1 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 28/206 (13%), Positives = 68/206 (33%), Gaps = 19/206 (9%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 +F + + S + T+ +L++ +++D S+S+ D + Sbjct: 219 SFDTAEDIEEALNNTMLSLLHRQGNTEEKLNVYLLIDASQSVGK---------DNFQLGV 269 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS-HLQRKIKYLSKFG------- 254 ++ + + + + +V F+++ + L + + I L + Sbjct: 270 GFLMNLILKMSNYAIQAKYAIVVFASQAKTKLALTEKANQDPKAVINILKELSYSEFEES 329 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 TN+ L I + +QH K I +TDG + + + + Sbjct: 330 PGTNTGEALLQVVALINSYKQQQQHLPIPWNQNKHISFIVTDGRSNMGPPPKIVMN--EK 387 Query: 315 AKKRGAIVYAIGIRVIRSHEFLRACA 340 +Y GI + E + + Sbjct: 388 FADIPMDIYTFGIGNVYKEELVSIAS 413 >gi|330508119|ref|YP_004384547.1| magnesium-chelatase subunit ChlD [Methanosaeta concilii GP-6] gi|328928927|gb|AEB68729.1| magnesium-chelatase subunit ChlD (Mg-protoporphyrin IXchelatase) (Mg-chelatase subunit D) [Methanosaeta concilii GP-6] Length = 661 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 69/209 (33%), Gaps = 29/209 (13%) Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 + + +I A R P + P I++ + ++ V+D S Sbjct: 415 QEGKDIAIDATIRAAAPYQRIRSGPNA-----IIVKGEDIREKERVGKTSAVVLFVVDAS 469 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS 241 SM + +++ A ++ ++L + + G++ F K E L + Sbjct: 470 GSMGAN-----QRMESAKGAVLSLL-----MDSYQKRDKIGMIAFKGKEAEIILPP--CT 517 Query: 242 HLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-- 297 + + L G T + GL + Q +D K ++V ++DG Sbjct: 518 SVDLALGRLRELPTGGKTPLSAGLSRGLQLL-------QGELRKDEESKLMMVLISDGRA 570 Query: 298 -ENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 E + K + + K G I Sbjct: 571 NEGMGGKIKDELMAISERIKHLGIHTIVI 599 >gi|311898833|dbj|BAJ31241.1| hypothetical protein KSE_54660 [Kitasatospora setae KM-6054] Length = 442 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 23/141 (16%), Positives = 44/141 (31%), Gaps = 19/141 (13%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK---LIPDVNNVVQSGLVT 225 +++V+D S SME K+ A ++ A ++E++ V Q+G V Sbjct: 52 GPGAAVVLVIDCSGSMEH----PAGKMRHAREATAAAIDELRDGVAFAVVAGTHQAGDV- 106 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + + ++ L+ G T + A Sbjct: 107 YPGNNRLAVADRATREQAKEALRRLNPSG-GTAIGTWIAKAQQLFASHPEASI------- 158 Query: 286 NYKKIIVFMTDGENLSTKEDQ 306 + + +TDG N K Sbjct: 159 ---RHAILLTDGRNEHEKPAD 176 >gi|296236487|ref|XP_002763348.1| PREDICTED: protein DDX26B [Callithrix jacchus] Length = 861 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 52/131 (39%), Gaps = 10/131 (7%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM D + +D+A ++ L+ ++ + + LVT+ Sbjct: 4 LLFLVDTSASMNQRTDLGTSYLDIAKGAVELFLK-LRARDPASRGDRYMLVTYDEPPY-C 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAY-----NQIFDMQGMRQHCNTEDAN 286 + W +H ++K L G++T L+ ++ N++ Sbjct: 62 IKVGWKENHATFMSELKNLQASGLTTL-GQALRSSFDLLNLNRLISGIDNYGQGRNPFFL 120 Query: 287 YKKIIVFMTDG 297 I++ +TDG Sbjct: 121 EPSILITITDG 131 >gi|261819502|ref|YP_003257608.1| von Willebrand factor A [Pectobacterium wasabiae WPP163] gi|261603515|gb|ACX86001.1| VWA containing CoxE family protein [Pectobacterium wasabiae WPP163] Length = 379 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 36/259 (13%), Positives = 80/259 (30%), Gaps = 50/259 (19%) Query: 39 EVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNEL 98 E++H+F L +M + + G N + + DI+ ++ + + R Sbjct: 100 EITHLFNDPQALQAMEPTPALAKV-LLSLRGRMNEQTREAVRDIIRKVVDEILRTLRP-- 156 Query: 99 RDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPI 158 F N + + QN + + + + ++ Sbjct: 157 ---TFTNALTGRRNRFRRSPIASSQN----------FDWRATIAANLKHFDREKQRLVIE 203 Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T Q D+++ +D S SM S ++ + +P Sbjct: 204 TPHFNSRMQRHMPWDVILCVDQSASMSS-----------SVMYAAVCASILATLP----A 248 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V+ L+ F ++ + L + + + G TN ++Y ++ Sbjct: 249 VRVSLIVFDTQVVDLSHLAHDPVEVLMTV----QLGGGTNIAKAMQYCEQRV-------- 296 Query: 279 HCNTEDANYKKIIV-FMTD 296 N K+ IV ++D Sbjct: 297 ------QNPKRTIVALISD 309 >gi|91228647|ref|ZP_01262563.1| putative calcium-binding outer membrane-like protein [Vibrio alginolyticus 12G01] gi|91187798|gb|EAS74114.1| putative calcium-binding outer membrane-like protein [Vibrio alginolyticus 12G01] Length = 2510 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 27/172 (15%), Positives = 62/172 (36%), Gaps = 24/172 (13%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 V ++T ++ +++DVS SM + ++ + S +L + + I + Sbjct: 2058 DVVAETKESANVQLIMDVSGSMRTD-----NRLQIMKDSATQLLNQYESI----GQTRVQ 2108 Query: 223 LVTFSNKIEEFF--LLEWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 ++T+S+ + W + I+ L+ G +TN L A D+ + Sbjct: 2109 IITYSSTASTYAIGAATWLTVEEAKAYIETLTA-GGATNYNNALNEAKQSWDDVGKLTSA 2167 Query: 280 CNTEDANYKKIIVFMTDGENLSTKE---DQQSLYYCNEAKKRGAIVYAIGIR 328 N + F++DG+ D + + + + A+ Sbjct: 2168 SN--------VSYFLSDGQPNPASSFINDAREQSWIDHLTDPDNQITALAYG 2211 >gi|298375542|ref|ZP_06985499.1| BatB protein [Bacteroides sp. 3_1_19] gi|298268042|gb|EFI09698.1| BatB protein [Bacteroides sp. 3_1_19] Length = 574 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 31/139 (22%), Positives = 57/139 (41%), Gaps = 16/139 (11%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +++M+ LDVS SM + D S ++D A + ++ + + N+ V Sbjct: 79 GSKLETVKRQGVEIMVCLDVSNSMLAE-DVSPNRLDKAKQMLSRLTDGF-----TNDKV- 131 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 GL+ F+ + + + ++ VST + A N + Sbjct: 132 -GLIVFAGDAFTQLPITSDYISAKMFLSSINPSMVSTQ-GTAIGAAIN-------LAARS 182 Query: 281 NTEDANYKKIIVFMTDGEN 299 T D K I+ +TDGEN Sbjct: 183 FTPDETTDKAIILITDGEN 201 >gi|295093844|emb|CBK82935.1| von Willebrand factor type A domain. [Coprococcus sp. ART55/1] Length = 343 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 34/223 (15%), Positives = 73/223 (32%), Gaps = 49/223 (21%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+++ +D+S S++ + +L+++K D + G+V F+ Sbjct: 97 DIILCIDISTSVDYLNE--------------NLLDKLKKTVDELQGERFGIVIFNTSPVL 142 Query: 233 FFLLEWGVSHLQRKIKYLSK-------FGVSTNSTPGLKYAYNQIFDMQGMRQHCNT--- 282 L +++ ++ +++ + + G + Y Q + G Sbjct: 143 LTPLTDDYEYVKDQLDLIAQCLKSRNEVNLDDAFSSGYDWIYYQAYISSGTLIGNEQRGS 202 Query: 283 ----------------EDANYKKIIVFMTDGENLSTKEDQ--QSLYYCNEAKKRGAIVYA 324 D K+++F TD + T ++ C VY Sbjct: 203 SLIGDGLAAAAIDFSDADKERTKVVIFSTDNDIQGTPVATLDEAADICV---SNNVTVYG 259 Query: 325 IGI--RVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGK 363 +G + E ++ + FYL E S + S I K Sbjct: 260 VGTKEMTPENKESMKNAVEKTGGKFYLEEESGSFGEIVSSIEK 302 >gi|256820509|ref|YP_003141788.1| hypothetical protein Coch_1682 [Capnocytophaga ochracea DSM 7271] gi|256582092|gb|ACU93227.1| conserved hypothetical protein [Capnocytophaga ochracea DSM 7271] Length = 287 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 37/161 (22%), Positives = 62/161 (38%), Gaps = 28/161 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S F + I I A L L + +GL+ F Sbjct: 72 EEERELTLMLMIDVSGS--ELFGTKQEFKSEVITEIAATLAFSALQNNDK----TGLILF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 S++IE + + G SH+ R I+ L +F TN + L++ Sbjct: 126 SDQIELYIPPKKGKSHVLRIIRELIEFQPKSFKTNISEALQF----------------LS 169 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK-RGAIVY 323 KK IVFM + K ++ + G +Y Sbjct: 170 RVTKKKAIVFML--SDFMDKGYEKPIQIAARKHDITGIRIY 208 >gi|194334089|ref|YP_002015949.1| Magnesium chelatase [Prosthecochloris aestuarii DSM 271] gi|194311907|gb|ACF46302.1| Magnesium chelatase [Prosthecochloris aestuarii DSM 271] Length = 650 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 25/127 (19%), Positives = 44/127 (34%), Gaps = 17/127 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D S SM + ++ + ++ ++L I + LV+F Sbjct: 467 LIFVVDASGSM-----GARGRMAASKGAVMSLL-----IDAYQKRDKLALVSFRKNEAVV 516 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L L K+ G T GL Y R H E +++F Sbjct: 517 NLPVTSSVELAGKLLREMPVGGRTPFAAGLVKGYEI------ARSHLMKEPDARP-MVIF 569 Query: 294 MTDGENL 300 +TDG+ Sbjct: 570 VTDGKAN 576 >gi|221135718|ref|XP_002171014.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 132 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 18/105 (17%), Positives = 34/105 (32%), Gaps = 7/105 (6%) Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 +T L+ A +F N K+I+ +TDG + + Sbjct: 1 MNFTTRIDRALRLAQKDMFTS------ANGGRVGVSKLIILLTDGSQTPGGDAEDPERIA 54 Query: 313 NEAKKRGAIVYAIGIR-VIRSHEFLRACASPNSFYLVENPHSMYD 356 +E + G ++ +GI + E + Y S+ D Sbjct: 55 DELRNDGVVILGVGIGSAVNETELSHITGGKKNAYTAATFDSLTD 99 >gi|29349873|ref|NP_813376.1| putative outer membrane protein [Bacteroides thetaiotaomicron VPI-5482] gi|29341784|gb|AAO79570.1| conserved hypothetical protein, putative outer membrane protein [Bacteroides thetaiotaomicron VPI-5482] Length = 621 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 35/275 (12%), Positives = 94/275 (34%), Gaps = 28/275 (10%) Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRST-----SLDIVVVP 122 + K + ++ + +F ++ + N I T ++ + Sbjct: 142 DAVNTEEYKSISENGFKQVGESPLSTFSIDVDAASYSNMRRMINSGTLPVPDAIRTEELV 201 Query: 123 QNEGYSISAISRYKIPLKFCTFIPWYTNS--RHIVMPITSSVKVNSQTDARLDMMIVLDV 180 Y + + P+K + +V + ++ + +++ ++DV Sbjct: 202 NYFSYDYAKPTGSD-PVKITMEAGVCPWNADHRLVRIGLKAREIPTDKLPESNLVFLIDV 260 Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV 240 S SM T++D+ S+ ++ ++ V VV +G + ++ Sbjct: 261 SGSMW-----GPTRLDLVKSSLKLLVNNLREKDKVAIVVYAG----NASVKLESTPGSDK 311 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-N 299 ++ I L+ G + G++ AY ++ + + N I+ +DG+ N Sbjct: 312 QKIRDAIDELTSGGST-AGGAGIQLAY-KVAKQNFLPKGNNR--------IILCSDGDFN 361 Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + + + +K G + +G + + Sbjct: 362 VGVSSVEGLEQLIEKERKSGVFLSVLGYGMGNYKD 396 >gi|288942349|ref|YP_003444589.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] gi|288897721|gb|ADC63557.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] Length = 721 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 36/218 (16%), Positives = 80/218 (36%), Gaps = 36/218 (16%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA-------RLDMMIVLDVSRSMESFFDSS 191 L+ + +P + + P + A +D++ V+D +RSM+ F D Sbjct: 251 LQLASAVPGARGADTLQTPEYREQAQTGRDTADQGIEDIAVDIVFVIDTTRSMQPFID-- 308 Query: 192 ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH-------LQ 244 +T+ +A + E + + + GLVT+ + +E L++ + + Sbjct: 309 MTRDAVARMA-RNFSERTQ------DRFRFGLVTYRDSLEAVPALDYLTRNHTPTLVTAE 361 Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD------GE 298 R + L +T L Y +F + K ++ + D G+ Sbjct: 362 RLAEILDTDAKATRVGS-LGYD-EDVFAGVDAALRQSPWREKSIKFVILIGDASSHLKGD 419 Query: 299 --NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 N + K++ ++A + AI ++ ++ E Sbjct: 420 AHNSTRKDETDLRREYDDA---NVHLLAIHLQNPKAAE 454 >gi|271967167|ref|YP_003341363.1| chelatase protein [Streptosporangium roseum DSM 43021] gi|270510342|gb|ACZ88620.1| chelatase protein [Streptosporangium roseum DSM 43021] Length = 713 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 29/243 (11%), Positives = 72/243 (29%), Gaps = 24/243 (9%) Query: 60 HAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIV 119 G R+ R + + + R+T Sbjct: 430 AGTAAGPRPEAGTTRRDQAFDADPARPARLFTVPGIGAGAPGRRSRARTPLGRTTGAR-T 488 Query: 120 VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLD 179 + + A P + + + +++ + + ++ V+D Sbjct: 489 PEGRPASLHLKATVGAAAPYQ----LGRGRSGPGLLLRREDLREAVREGREGNLVLFVVD 544 Query: 180 VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEFFLLEW 238 S SM ++ ++ ++ +L + + GLVTF + + Sbjct: 545 ASGSM-----AARQRMRAVKTAVLGLL-----LDAYQRRDKVGLVTFRGTRADLVLPPTS 594 Query: 239 GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE 298 V +++ L G + + L+ A ++ ++ +R ++V +TDG Sbjct: 595 SVEAGAARLRDLPTGGRTPLAAGLLRAA--EVLRVERLRDPARRP------LVVLVTDGR 646 Query: 299 NLS 301 S Sbjct: 647 ATS 649 >gi|296810344|ref|XP_002845510.1| von Willebrand RING finger domain-containing protein [Arthroderma otae CBS 113480] gi|238842898|gb|EEQ32560.1| von Willebrand RING finger domain-containing protein [Arthroderma otae CBS 113480] Length = 1033 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 37/267 (13%), Positives = 85/267 (31%), Gaps = 38/267 (14%) Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 + N +TW + N + V + + Y Sbjct: 408 SVNELPSFHLRFSTLHQLDTWKRAL------GDLQNGGVVAHPDHEV-EVPTAEEDDYRN 460 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVN---SQTDARLDMMIVLDVSRSMES 186 S + + + +S ++ TS + + S LD+++V+ VS SM+ Sbjct: 461 SKTPSKRESSGYSSSYGAGKSSNTVMTDYTSPSRESVPPSTFHVPLDIVVVIPVSSSMQG 520 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-----KIEEFFLLEW-GV 240 KI + +++ +++ + + GLVTF + + W G Sbjct: 521 L------KISLLRDTLHFLVQNLGPRD------RMGLVTFGSSGGGVPVVGMTSKTWNGW 568 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 + I+ + + + + G A + + + N I+ ++D + Sbjct: 569 GKVFNSIRPVGQKSLRADVVEGANVAMDLLMQRK---------SFNPVSTILLISDSQTT 619 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGI 327 + + AK G + +G+ Sbjct: 620 DPESGDFVVSRAEAAK-VGIHTFGLGL 645 >gi|301605569|ref|XP_002932342.1| PREDICTED: LOW QUALITY PROTEIN: von Willebrand factor-like [Xenopus (Silurana) tropicalis] Length = 2869 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 62/186 (33%), Gaps = 33/186 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN--VVQSG-- 222 + +D++ VLD S S+ ++ + + A +++V + V+Q G Sbjct: 1737 NCEKPMDVIFVLDGSSSVR------ASQFEEMKTFVKAFIKKVNIGVGATQVSVLQYGWR 1790 Query: 223 ---LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQ 278 + +++ E+ LL + + + K L + Sbjct: 1791 NILEIAWTDPQEKESLL--------KAVDNIQKAEDGPAKIGEALLFGVRTAMS----EV 1838 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 H + K ++ +TD Q N A V+ IGI + L+ Sbjct: 1839 HGARPGST-KIAVIVITD------VSVDQVAEAANSAVINRVSVFPIGIGTRYDSQQLKV 1891 Query: 339 CASPNS 344 A P++ Sbjct: 1892 LAGPSA 1897 >gi|300704937|ref|YP_003746540.1| hypothetical protein RCFBP_20766 [Ralstonia solanacearum CFBP2957] gi|299072601|emb|CBJ43951.1| conserved exported protein of unknown function [Ralstonia solanacearum CFBP2957] Length = 340 Score = 41.7 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 20/124 (16%), Positives = 43/124 (34%), Gaps = 2/124 (1%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM-NEGNG 71 +G + +L+ I L I + M I+++H+F ++ L + D + + A + Sbjct: 6 RRQRGAVGVLSPILLIIFLAIGAMAIDIAHLFVVRNELQNAADAAALAGAAGLYPANPKP 65 Query: 72 NNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQN-EGYSIS 130 N + G ++ + N G+ N S I + G ++ Sbjct: 66 NWSNGVAQGTSAIKLNASDNTKLTGGTVQAGYWNLTGSPAGMQSQSITPGSNDVAGVQVT 125 Query: 131 AISR 134 Sbjct: 126 VTRS 129 >gi|260813594|ref|XP_002601502.1| hypothetical protein BRAFLDRAFT_152929 [Branchiostoma floridae] gi|229286799|gb|EEN57514.1| hypothetical protein BRAFLDRAFT_152929 [Branchiostoma floridae] Length = 96 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 15/97 (15%), Positives = 39/97 (40%), Gaps = 11/97 (11%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 LD++++LD S S+ ++ + + ++E + P Q G++ +S Sbjct: 9 RTPLDIILLLDGSGSVGAY------NFEKVKQFSQKLVETFDIGPSG---TQIGVIQYST 59 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGL 263 ++ + F + S L + ++ T + + Sbjct: 60 RVRQEFSMNSFQSKETLSNAVDDIAYMRGGTLTGRAI 96 >gi|149911640|ref|ZP_01900251.1| hypothetical protein PE36_02984 [Moritella sp. PE36] gi|149805310|gb|EDM65324.1| hypothetical protein PE36_02984 [Moritella sp. PE36] Length = 2370 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 41/294 (13%), Positives = 96/294 (32%), Gaps = 33/294 (11%) Query: 52 SMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIV 111 + + + V + + N+ + + + I + S ++ ++D Sbjct: 1669 NSGNSATVVSGNLLDNDSGIGSTTSITA--VGNTIVTPGDNSITIDMDQGTLTVYVEDDG 1726 Query: 112 RSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR 171 + D + + + I + + + + I V + + Sbjct: 1727 SHRAGDYTYTLETAATTNAESIVDSIDYQLTDSNSGNQTTAQLAITIVDDAPVATNIEQT 1786 Query: 172 LD---------MMIVLDVSRSME--SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 L+ ++IVLD S SM+ + + +++D+A ++++++ E + +VN Sbjct: 1787 LEAAAGGYTTNLIIVLDKSGSMDYSANDGTGRSRMDIAKEALSSLFNEYDNVGNVN---- 1842 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + F W + YL+ V + A N++ D Sbjct: 1843 ---IQFVAFDYYATKSSWYEDDINAAQDYLN--DVDADGGTSYDVALNKVMDDYQA---- 1893 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQS---LYYCNEAKKRGA-IVYAIGIRVI 330 K ++ F++DGE + S + N +AIGI Sbjct: 1894 ---PVADKSVVYFISDGEPSYQEAGINSNEQATWENFLVDNNIDKSFAIGIGNG 1944 >gi|15807991|ref|NP_285654.1| hypothetical protein DR_A0331 [Deinococcus radiodurans R1] gi|6460745|gb|AAF12450.1|AE001863_75 conserved hypothetical protein [Deinococcus radiodurans R1] Length = 494 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 24/188 (12%), Positives = 61/188 (32%), Gaps = 40/188 (21%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNS-QTDARLDMMIVLDVSRSMESFFDSSIT----- 193 ++ + + + + +T + Q L ++ V+D S SM Sbjct: 42 PHREYLLAQSAGQKLFLTLTVKPTAEARQARPDLSVVFVVDTSGSMREVVTEPTERTGRT 101 Query: 194 ----------------KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN--KIEEFFL 235 K+ + I+++ ++ + P + LV F + ++ F+ Sbjct: 102 TQVDGQVYEVVKGGKNKMQLMIEALRGIVTSTLIRPGD----RLALVKFDDTAEVLVPFI 157 Query: 236 LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 + L + L + T+ G++ + +G R+ +V ++ Sbjct: 158 TASNQAQLAAAVDKLDWYSGGTHMGAGMRAGAGLLTGERGSRR------------MVLIS 205 Query: 296 DGENLSTK 303 DG+ Sbjct: 206 DGQTFDAP 213 >gi|113478293|ref|YP_724354.1| protoporphyrin IX magnesium-chelatase [Trichodesmium erythraeum IMS101] gi|110169341|gb|ABG53881.1| protoporphyrin IX magnesium-chelatase [Trichodesmium erythraeum IMS101] Length = 672 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 36/253 (14%), Positives = 79/253 (31%), Gaps = 44/253 (17%) Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 + ++ A R P + + N R IV K ++ L ++ V+D S Sbjct: 427 GKVRRIAVDATLRAAAPYQ-KSRRERQPNRRVIVEQGDMRAKRLARKAGAL-VVFVVDAS 484 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-EEFFLLEWGV 240 SM ++ ++ A ++ +L E D Q L+ F + E + Sbjct: 485 GSM------ALNRMQSAKGAVMQLLTEAYQSRD-----QVSLIPFRGEQAEVLLPPTRSI 533 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 + +R+++ + G S + + + + + +IV +TDG Sbjct: 534 TSARRRLEKMPCGGGSPLAHG--------LTQAVRVGVNAKQSGDVGQVVIVAITDGRGN 585 Query: 301 ---------------STKEDQQSLYYCNEAKKRGAIVYAI-----GIRVIRSHEFLRACA 340 ++ L + + G + I + + E + Sbjct: 586 IPLARSLGEPMLDAEKPNIKEELLEIAGKIRGLGIQLLMIDTENKFVSTGFAKELAKQ-- 643 Query: 341 SPNSFYLVENPHS 353 S +Y + Sbjct: 644 SGGKYYHLPKASD 656 >gi|118353091|ref|XP_001009816.1| hypothetical protein TTHERM_00160860 [Tetrahymena thermophila] gi|89291583|gb|EAR89571.1| hypothetical protein TTHERM_00160860 [Tetrahymena thermophila SB210] Length = 1187 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 39/266 (14%), Positives = 90/266 (33%), Gaps = 34/266 (12%) Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSV-KVNSQTDARLDMMIVLDVSRSM 184 I Y ++ + + I + + K+ +++++++D S+ M Sbjct: 935 TRIIEEGQVYDSQIRKYAMMLLKNIMEKFNLEIPTLIPKLQKFNLRIIELIMLIDYSKDM 994 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH-- 242 + +I + ++ +++ + G V F+ + E F L+ + Sbjct: 995 T------VDQIKYSHARCRSIFDKIDKKD------KIGFVQFNELVHENFPLQEKELYND 1042 Query: 243 -LQRKIKYLS-KFGVSTNSTPGLK---------YAYNQIFDMQGMRQHCNTEDANYKKII 291 L++KI+ + G +N L N I D + + E+ + I Sbjct: 1043 LLEKKIRSIPMSTGGKSNLYKALDVTTGLFDKCQQMNNIED-NQKKSTLDQENNERLRFI 1101 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-LRACASP----NSFY 346 T+G N ++ +++ + KK+ + I I ++ LR A F+ Sbjct: 1102 CLFTEGNNQIKEDQLEAIK--EKMKKKQVNLIIINICNQNANMLYLRYLAKEISELGQFF 1159 Query: 347 LVENPHSMYDAFSHIGKDIVTKRIWY 372 + K I + I Sbjct: 1160 QQGDDDVEEYLNKQREKQIKRQLICE 1185 >gi|315223610|ref|ZP_07865464.1| von Willebrand factor [Capnocytophaga ochracea F0287] gi|314946391|gb|EFS98386.1| von Willebrand factor [Capnocytophaga ochracea F0287] Length = 287 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 37/161 (22%), Positives = 62/161 (38%), Gaps = 28/161 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S F + I I A L L + +GL+ F Sbjct: 72 EEERELTLMLMIDVSGS--ELFGTKQEFKSEVITEIAATLAFSALQNNDK----TGLILF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 S++IE + + G SH+ R I+ L +F TN + L++ Sbjct: 126 SDQIELYIPPKKGKSHVLRIIRELIEFQPKSFKTNISEALQF----------------LS 169 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK-RGAIVY 323 KK IVFM + K ++ + G +Y Sbjct: 170 RVRKKKAIVFML--SDFMDKGYEKPIQIAARKHDITGIRIY 208 >gi|307609426|emb|CBW98915.1| hypothetical protein LPW_07021 [Legionella pneumophila 130b] Length = 1169 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 17/87 (19%), Positives = 32/87 (36%), Gaps = 12/87 (13%) Query: 293 FMTDGENLSTKEDQ--QSLYYCNEAKKRGAIVYAIGIRV-------IRSHEFLRACA--- 340 F DG ST ++ K G +++ IG+ + LRA A Sbjct: 442 FNADGSLNSTNSQALSDAINEIKALKNDGVLIFIIGMGAGVDPAVNPEAAATLRAMAVAG 501 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVT 367 ++Y +P ++ + + I +I Sbjct: 502 GTENYYPATSPETLVSSLNSILSNIQN 528 >gi|290979266|ref|XP_002672355.1| predicted protein [Naegleria gruberi] gi|284085931|gb|EFC39611.1| predicted protein [Naegleria gruberi] Length = 1515 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 28/128 (21%), Positives = 51/128 (39%), Gaps = 16/128 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 A+LD + +LD S+S+ + ++ + +KL D V+ +V F + Sbjct: 1232 AKLDFVFLLDSSQSIG------VPNFNLIRDLCQRITGRMKLADDA---VKVSIVRFHSS 1282 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L S + + + + T PGL+ A N + + N A+ K Sbjct: 1283 ATLTLALSTDGSLINKTLSTMKYDAGFTAQVPGLRAAVNAL-------RPENGGRADADK 1335 Query: 290 IIVFMTDG 297 +I +TDG Sbjct: 1336 VIYILTDG 1343 >gi|258619934|ref|ZP_05714975.1| RTX protein, putative [Vibrio mimicus VM573] gi|258587804|gb|EEW12512.1| RTX protein, putative [Vibrio mimicus VM573] Length = 1108 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 65/192 (33%), Gaps = 20/192 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 S+ ++ ++LDVS SM++ + +++ + +S +LE+ + I VQ + Sbjct: 492 ESEAKQGANVQLILDVSGSMDNSAGNGKSRLQVMKESAIQLLEQYQAIGQTK--VQLIIF 549 Query: 225 TFSNKIEEFFLLEWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + ++ W + I+ L G + L+ A D H Sbjct: 550 AGTASVQLHEKALWMTVEQAKSYIETLKAGGQTDYDH-ALQLA-----DENWSGNHNGIP 603 Query: 284 DANYKKIIVFMTDG-----------ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + F++DG EN +T E + + + + A G+ Sbjct: 604 LIGATNVSYFLSDGRPEGFDKQNGVENQNTIEPSELASWISHLQDNEITALAYGMGNSVP 663 Query: 333 HEFLRACASPNS 344 L A Sbjct: 664 QSELDKIAFDGH 675 >gi|327402315|ref|YP_004343153.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] gi|327317823|gb|AEA42315.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] Length = 639 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 27/217 (12%), Positives = 72/217 (33%), Gaps = 38/217 (17%) Query: 115 SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDM 174 L I + + +I+ ++ + L +K++ Q +D+ Sbjct: 273 KLSIQISTNGKTVTINKVTAFDKGLNH------------------QKLKISCQEVENVDV 314 Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 V+D + SM +I+ +N ++ E + I + N + V + + + + Sbjct: 315 AFVVDATGSMGD-------EINFLKAEMNDVIFESRKISNKLNF-RFANVFYRDMTDSYI 366 Query: 235 LLEWGVSHL----QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + I + G + + + + +H +I Sbjct: 367 TKSMNFNRILTESTAFIDEQNAGGGG-DFPEAMDVGLDSAINNLSWSEHARA------RI 419 Query: 291 IVFMTDGEN-LSTKEDQQSLYYCNEAKKRGAIVYAIG 326 + + D S + +++ +A ++G + IG Sbjct: 420 LFLILDAPPHQSPQVNERLQKLILQAAEKGIRIVPIG 456 >gi|309358783|emb|CAP33694.2| hypothetical protein CBG_15368 [Caenorhabditis briggsae AF16] Length = 391 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 31/170 (18%), Positives = 62/170 (36%), Gaps = 18/170 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ V+D S+ M + +T I I S+ + ++ + + GLVT+S+K Sbjct: 35 LDVVAVVDNSQGMT---NDGLTAIAANIASVFSEGTKIGTNSNDPRTTRLGLVTYSSKAT 91 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTP-----GLKYAYNQIFDMQGMRQHCNTEDAN 286 + L+ S + + + GL+ A + N + Sbjct: 92 KNAYLDKFQSIDDLYDNIFTDLATVSQTDDSNLETGLEAAEEIL------EAGKNEKRKF 145 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 YKK+I+ + + +++ N K G + + E L Sbjct: 146 YKKLILIYA----STFNRNGEAISIANRLKSAGTKLVTVAYDQGGGDEQL 191 >gi|307294184|ref|ZP_07574028.1| Protein of unknown function DUF2134, membrane [Sphingobium chlorophenolicum L-1] gi|306880335|gb|EFN11552.1| Protein of unknown function DUF2134, membrane [Sphingobium chlorophenolicum L-1] Length = 417 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 7/53 (13%), Positives = 18/53 (33%) Query: 4 LNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDR 56 ++ + G + ++ A +PI+ G+ + K + D Sbjct: 2 FGGSMGRLLRDRTGNVLMMAAACMPILIGAAGLATDTVQWTLWKRQVQRQADS 54 >gi|257876502|ref|ZP_05656155.1| predicted protein [Enterococcus casseliflavus EC20] gi|257810668|gb|EEV39488.1| predicted protein [Enterococcus casseliflavus EC20] Length = 799 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 15/104 (14%), Positives = 44/104 (42%), Gaps = 11/104 (10%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD- 214 + + +QT +D+++V+D S SM + ++ + + + D Sbjct: 96 LFDVTLAVKGNQTSTPVDLVMVIDYSSSMTG---------EKLTNALLGLQQFGNELADS 146 Query: 215 -VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST 257 N ++ G+V ++ +I +++L++ ++ ++ T Sbjct: 147 LANGSIRIGIVAYNREIYTTNGFSTDIAYLEQFLRETAETHSGT 190 >gi|225012027|ref|ZP_03702464.1| von Willebrand factor type A [Flavobacteria bacterium MS024-2A] gi|225003582|gb|EEG41555.1| von Willebrand factor type A [Flavobacteria bacterium MS024-2A] Length = 346 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 23/172 (13%), Positives = 67/172 (38%), Gaps = 20/172 (11%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 ++ + +D++ +DVS+SM + D + +++ + + ++A+L ++ + Sbjct: 79 GTQLETVKREGVDIVFAIDVSKSMLAE-DIAPNRLEKSKRLVSAILNQLASD-------R 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 G++ ++ + + S + ++ L +T A + D+ G Sbjct: 131 VGIIAYAAQAVPQLPITTDYSAAKMFLQAL-----NTEMLSSQGTALDSAIDLSGT---F 182 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 ++ ++I ++DGE+ S + G ++ G+ Sbjct: 183 FDDEDQTNRVIFLISDGEDHSEDASNAASRAAAM----GIKIFTFGVGTEAG 230 >gi|260822965|ref|XP_002602288.1| hypothetical protein BRAFLDRAFT_121493 [Branchiostoma floridae] gi|229287595|gb|EEN58300.1| hypothetical protein BRAFLDRAFT_121493 [Branchiostoma floridae] Length = 830 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 29/166 (17%), Positives = 60/166 (36%), Gaps = 33/166 (19%) Query: 171 RLDMMIVLDVSR--SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 LD+++V+D+S SM +FF ID+ I ++ V G++T+ Sbjct: 576 SLDLILVVDLSSSISMSAFFSIKRLIIDLIYCFIEFCVD-----------VGIGIITYDC 624 Query: 229 KIEEFFLLEWGVSHLQRKIKYLS----KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + + + I ++ G T++ L+Y + D R Sbjct: 625 VLTWYLPVCTYPMENPTLIDTVNCLMYAAGGETHTGGALRY----MADTADFRDG----- 675 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + + V +TDG+ +D + A+ G ++ + Sbjct: 676 --FPRAAVVITDGQ----SDDDYTAEVA-AARSAGIGLWGVAAGHP 714 >gi|149728587|ref|XP_001492576.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H1 [Equus caballus] Length = 908 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 70/192 (36%), Gaps = 20/192 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ-SGLVTFSNKIEE 232 ++ V+D+S SM+ K+ +++ +L +++ + V+ SG+ ++ + Sbjct: 292 LVFVIDISTSMQG------QKVQQTKEALLKILGDMRPGDYFDLVLFGSGVQSWKGSLVP 345 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + ++ + STN GL + QG +N+ I++ Sbjct: 346 AS--AANLQAARDFVQRFT-LEGSTNLNGGLLQGIEILNKAQGSLPEV----SNHASILI 398 Query: 293 FMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS-----FY 346 +TDGE + Q L A + +Y +G FL + N+ Y Sbjct: 399 MLTDGEPTEGVTDRSQILKNVRNAIRGKFPLYNLGFGQNVDFNFLEVMSMENNGRAQRIY 458 Query: 347 LVENPHSMYDAF 358 + F Sbjct: 459 EDHDATQQLQGF 470 >gi|41054964|ref|NP_956728.1| integrator complex subunit 6 [Danio rerio] gi|82187780|sp|Q7SYD9|INT6_DANRE RecName: Full=Integrator complex subunit 6; Short=Int6; AltName: Full=Protein DDX26B gi|32766576|gb|AAH54905.1| DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 26B [Danio rerio] Length = 892 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 23/132 (17%), Positives = 54/132 (40%), Gaps = 12/132 (9%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM T +D+A ++ ++ ++ + + LVTF + Sbjct: 4 LLFLIDTSASMNQRTYLGTTYLDIAKGAVEIFMK-LRARDPASRGDRYMLVTFDDPPYGV 62 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN---TEDANY- 287 W +H ++K L G++T L+ A+ + ++ + + + Sbjct: 63 KA-GWKENHATFMSELKNLQASGLTTL-GHALRAAF-DLLNLNRLVSGIDNYGQGRNPFF 119 Query: 288 --KKIIVFMTDG 297 +I+ +TDG Sbjct: 120 LEPSVIITITDG 131 >gi|291408281|ref|XP_002720452.1| PREDICTED: DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 26B [Oryctolagus cuniculus] Length = 862 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 26/132 (19%), Positives = 53/132 (40%), Gaps = 10/132 (7%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM D + +D+A ++ L+ ++ + + LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFLK-LRARDPASRGDRYMLVTYDEPPY-C 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVST-----NSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 W +H ++K L G++T S+ L I + Q N + Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTLGQALRSSFDLLNLNRLISGIDNYGQGRNPFFLD 121 Query: 287 YKKIIVFMTDGE 298 I++ +TDG+ Sbjct: 122 PS-ILITITDGK 132 >gi|320010772|gb|ADW05622.1| von Willebrand factor type A [Streptomyces flavogriseus ATCC 33331] Length = 453 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 23/156 (14%), Positives = 51/156 (32%), Gaps = 30/156 (19%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 +++++D S SM D TK+ A + + I + + + +V Sbjct: 59 PGQAPAAAVVLMVDCSGSM----DYPPTKMRHARDA------TAEAIDTLRDGTRFAVVA 108 Query: 226 FSNKIEEFFL-----LEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ ++ + G + + L + G T L+ A + + + Sbjct: 109 GTHVAKDVYPGNGRLATAGPQTKAQAKEALRRLSAGGGTAIGTWLRLADRLLGAAEVDIR 168 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQ--QSLYYC 312 H + +TDG N + +L C Sbjct: 169 HG-----------ILLTDGRNEHESPEDLRAALESC 193 >gi|16329379|ref|NP_440107.1| Mg chelatase subunit ChlD [Synechocystis sp. PCC 6803] gi|3913253|sp|P72772|CHLD_SYNY3 RecName: Full=Magnesium-chelatase subunit ChlD; Short=Mg-chelatase subunit D; AltName: Full=Mg-protoporphyrin IX chelatase gi|1651860|dbj|BAA16787.1| Mg chelatase subunit; ChlD [Synechocystis sp. PCC 6803] Length = 676 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 67/201 (33%), Gaps = 42/201 (20%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK-IEE 232 ++ ++D S SM ++ ++ A ++ +L E N Q L+ F + E Sbjct: 482 IVFLVDASGSM------ALNRMQAAKGAVMQLLTEA-----YENRDQVSLIPFQGENAEV 530 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 ++ +++++ L G S S ++ A N + ++ + +IV Sbjct: 531 LLPPTRSIAMAKKRLETLPCGGGSPLSHGLMQ-AVNVGMN---AKRSGDIGQV----VIV 582 Query: 293 FMTDGENL-------------STKEDQQS--LYYCNEAKKRGAIVYAI-----GIRVIRS 332 +TDG K D ++ L + + G + I + Sbjct: 583 AITDGRGNIPLARSLGDEIPEGEKPDIKAELLEIAAKIRGLGMQLLVINTEKKFVSTGFG 642 Query: 333 HEFLRACASPNSFYLVENPHS 353 E + +Y + Sbjct: 643 KELAQKAG--GKYYQLPKATD 661 >gi|1477448|emb|CAA65418.1| magnesium chelatase [Synechocystis sp. PCC 6803] Length = 676 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 67/201 (33%), Gaps = 42/201 (20%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK-IEE 232 ++ ++D S SM ++ ++ A ++ +L E N Q L+ F + E Sbjct: 482 IVFLVDASGSM------ALNRMQAAKGAVMQLLTEA-----YENRDQVSLIPFQGENAEV 530 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 ++ +++++ L G S S ++ A N + ++ + +IV Sbjct: 531 LLPPTRSIAMAKKRLETLPCGGGSPLSHGLMQ-AVNVGMN---AKRSGDIGQV----VIV 582 Query: 293 FMTDGENL-------------STKEDQQS--LYYCNEAKKRGAIVYAI-----GIRVIRS 332 +TDG K D ++ L + + G + I + Sbjct: 583 AITDGRGNIPLARSLGDEIPEGEKPDIKAELLEIAAKIRGLGMQLLVINTEKKFVSTGFG 642 Query: 333 HEFLRACASPNSFYLVENPHS 353 E + +Y + Sbjct: 643 KELAQKAG--GKYYQLPKATD 661 >gi|296122919|ref|YP_003630697.1| hypothetical protein Plim_2675 [Planctomyces limnophilus DSM 3776] gi|296015259|gb|ADG68498.1| hypothetical protein Plim_2675 [Planctomyces limnophilus DSM 3776] Length = 645 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 30/174 (17%), Positives = 61/174 (35%), Gaps = 18/174 (10%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + I + S + ++ +LDVS SM + +++ I A L+ +KL Sbjct: 180 AVDILTEEIAQSLQEGPTRVVWILDVSPSMTEYRRVIADRMEK----IYAQLKSMKL--P 233 Query: 215 VNNVVQSGLVTFSNKIEEFFLLE-WGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFD 272 + +++GL+ F +K ++ I L N+ + A N Sbjct: 234 IETSLETGLMAFDDKPHALSTKTVKDPLKIKDLINRLPSTERGIENTYLAVMNARN---- 289 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 R E ++ + + +TD E + + +K + IG Sbjct: 290 ----RYGPAKERVPFRVMFIVVTDEEGSDVSRNLD--HCIELMQKDLIRCFVIG 337 >gi|221068121|ref|ZP_03544226.1| outer membrane adhesin like proteiin [Comamonas testosteroni KF-1] gi|220713144|gb|EED68512.1| outer membrane adhesin like proteiin [Comamonas testosteroni KF-1] Length = 1268 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 28/140 (20%), Positives = 52/140 (37%), Gaps = 19/140 (13%) Query: 171 RLDMMIVLDVSRSMESFFDSSI---------TKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + ++MIVLD+S SM + + +++ +A K++ A++ + + D V Sbjct: 848 QTNVMIVLDLSGSMAWDSNGKVLPGGGSNANSRLSLAKKALEALINKYEEYGD----VAV 903 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 LVTF+ + I L+ G + K A N G + Sbjct: 904 KLVTFNGSTANAHATWMSAATAIAIINGLTATGGTPY-----KAALNAAMGTNGFADNVG 958 Query: 282 T-EDANYKKIIVFMTDGENL 300 + + F+TDG Sbjct: 959 KLTGEGVQNVSYFITDGVPT 978 >gi|74136087|ref|NP_001027973.1| complement factor B-1 [Ciona intestinalis] gi|58531104|dbj|BAD89299.1| complement factor B-1 [Ciona intestinalis] Length = 999 Score = 41.7 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 25/184 (13%), Positives = 58/184 (31%), Gaps = 22/184 (11%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + + + ++ ++D SRS+ +D ++K ++ Sbjct: 413 PSSGVGGRSITADHEGGNEIYFLIDFSRSVTD------EALDHSLKFAQKLVTRFSGG-- 464 Query: 215 VNNVVQSGLVTF------------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 N G++ F + K+ ++ + + K G TN+ Sbjct: 465 TNKTAHYGVIIFASHSKVVLDSRKNQKLNSTKIIAYFKGIFKDKKAMRDTVGGGTNTGAA 524 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAI 321 L+ ++ + N N ++ +TDG N + N+ K+ Sbjct: 525 LQ-DLRKMLGISYRADRINNGGNNRQRHCFILTDGKSNEGENPVKMVKRMENDFKQNPPQ 583 Query: 322 VYAI 325 Y+I Sbjct: 584 FYSI 587 >gi|315612844|ref|ZP_07887755.1| peptidoglycan binding domain protein [Streptococcus sanguinis ATCC 49296] gi|315314954|gb|EFU62995.1| peptidoglycan binding domain protein [Streptococcus sanguinis ATCC 49296] Length = 459 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 38/287 (13%), Positives = 79/287 (27%), Gaps = 29/287 (10%) Query: 52 SMIDRSLVHAATQIMNEGN--GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 ++ A +I + N+ K G+++ K + + + +++ Sbjct: 55 EANKSAVEAAVAEIKADEAKPVVNQSDAKPGELVDVSKKVSPIDVKERQEGDQKIHEETA 114 Query: 110 IV--RSTSLDIVVVPQ----NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK 163 V T + V + + S +S T + + + + Sbjct: 115 TVEVTKTEVAPVKSEELPAPKTETTESTVSGKDEDGNSYTQYERVDKTTTVEIKKATRTI 174 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK---LIPDVNNVVQ 220 S + D++ V+D S SM D+ I+ +++ + + Sbjct: 175 SKSNSA---DIVFVVDRSASMSRTIDTVRNNINEFARNLAKDGVAARFGLATFSDEVYGR 231 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS----KFGVS--TNSTPGLKYAYNQIFDMQ 274 G + I F + S K L+ G STP L + + Sbjct: 232 IGGLADEGTILTKFNESYFTSDPAELEKALAGIKLAHGGDFPETSTPALTQIVSTYDWSK 291 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 + KK +V +TD K Sbjct: 292 SPKN---------KKFVVLLTDAPMKEDPSIPTMANTLASLKAAKIE 329 >gi|239627294|ref|ZP_04670325.1| von Willebrand factor [Clostridiales bacterium 1_7_47_FAA] gi|239517440|gb|EEQ57306.1| von Willebrand factor [Clostridiales bacterium 1_7_47FAA] Length = 681 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 31/146 (21%), Positives = 54/146 (36%), Gaps = 25/146 (17%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 +V ++ + + VLDVS SM + +D A + I M+ ++ N Sbjct: 304 RVPAEAIPPREYIFVLDVSGSMFGYP------LDTAKELIRNMVSNLRETDTFN------ 351 Query: 223 LVTFSNKI--EEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 L+ FSN L +++R I ++ K G T P L+ A D Sbjct: 352 LILFSNDAIRMSARSLPATDENVERAINLINRQKGGGGTELAPALEKAVGIPMDSGAG-- 409 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKE 304 + + +V +TDG + Sbjct: 410 -------SVSRSVVVITDGYMSDEQA 428 >gi|224096382|ref|XP_002192789.1| PREDICTED: similar to calcium channel, voltage-dependent, alpha 2/delta subunit 4 [Taeniopygia guttata] Length = 1068 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 28/139 (20%), Positives = 57/139 (41%), Gaps = 27/139 (19%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++I++DVS SM+ ++ +A +I +L+ + N+ V + ++ Sbjct: 240 ATSPKDIVIIVDVSGSMKGL------RMTIAKHTIVTILDTL----GENDFVNI--IAYN 287 Query: 228 NK---IEEFFL------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + IE F H ++ ++ L GV T S L ++ + + + Q Sbjct: 288 DYVHFIEPCFKGILVQADRDNREHFKQLVEELQAKGVGTVS-KALTESFKILREFRDAGQ 346 Query: 279 HCNTEDANYKKIIVFMTDG 297 A I+ +TDG Sbjct: 347 GGLCNQA-----IMLITDG 360 >gi|316973947|gb|EFV57489.1| putative midasin [Trichinella spiralis] Length = 4042 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 24/167 (14%), Positives = 58/167 (34%), Gaps = 14/167 (8%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 ++ +T +++ +D S+SME ++ L + + Sbjct: 3822 LRRTRKTKRDYQVVLAIDNSQSME---------LNNVKTMALQSLAMLNEAFYTLEIGHV 3872 Query: 222 GLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G+ F + L + V + + I+ L+ T+ L+ A + Sbjct: 3873 GVCKFGTEPILLQPLTKTFDVHYGAKIIEQLNCNEQRTDIFKLLQCAASLFTPQPASSDR 3932 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA-IVYAI 325 ++ K+++ ++DG N+ + +Q G ++Y I Sbjct: 3933 TDSG--MPTKLMIILSDGRNVYSVGEQALKDAVRSVFSAGIFVIYII 3977 >gi|307354367|ref|YP_003895418.1| Magnesium chelatase [Methanoplanus petrolearius DSM 11571] gi|307157600|gb|ADN36980.1| Magnesium chelatase [Methanoplanus petrolearius DSM 11571] Length = 642 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 30/210 (14%), Positives = 62/210 (29%), Gaps = 25/210 (11%) Query: 120 VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLD 179 +I A R P + N I + + + ++ ++D Sbjct: 402 RDRNKNDIAIDATLRAAAPFQ----TARDRNGLAIKVDREDLREKVREKRTGDIILFLVD 457 Query: 180 VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG 239 S SM ++ ++ ++L + + V GL+ F K L Sbjct: 458 ASGSM-----GVKKRMVAVKGAVLSLLND---AYQKRDTV--GLMIFRRKEATLLLPPTR 507 Query: 240 VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN 299 + L K+ G T + G+ A + + + K IV +TDG Sbjct: 508 STDLAHKLLKEIPTGGRTPLSEGVASAVRLLSQGRYSKS-------TDSKTIVILTDGRA 560 Query: 300 LST----KEDQQSLYYCNEAKKRGAIVYAI 325 + ++ A ++ + Sbjct: 561 NYSGSGRNPYEEMRMTALAASEKKIRFVVV 590 >gi|261855692|ref|YP_003262975.1| von Willebrand factor A [Halothiobacillus neapolitanus c2] gi|261836161|gb|ACX95928.1| von Willebrand factor type A [Halothiobacillus neapolitanus c2] Length = 756 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 26/204 (12%), Positives = 70/204 (34%), Gaps = 32/204 (15%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKI-DMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + +M++LD+S S+ S I +++ ++++ + + + D Sbjct: 554 SHRTNGRSIAVMLLLDLSESLNEKVAGSEQSILELSQEAVSLLAWAIDKLGDP-----FA 608 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSK------FGVSTNSTPGLKYAYNQIFDMQGM 276 + F + + + +R + G ST +++A + + + Sbjct: 609 IAGFHSNTRHEVRYQHIKGYSERWDDEVKARLAKIEAGYSTRMGAAMRHAAHYLSAQKAD 668 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 KK+++ +TDG+ L +A++ + +GI Sbjct: 669 -----------KKLMLILTDGKPSDVDTSDDRLLI-EDARQAVKELDQLGIFP------- 709 Query: 337 RACASPNSFYLVENPHSMYDAFSH 360 C S ++ + + + F Sbjct: 710 -YCISLDASLKAGDDDYVREIFGQ 732 >gi|260791673|ref|XP_002590853.1| hypothetical protein BRAFLDRAFT_125711 [Branchiostoma floridae] gi|229276050|gb|EEN46864.1| hypothetical protein BRAFLDRAFT_125711 [Branchiostoma floridae] Length = 3797 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 26/166 (15%), Positives = 54/166 (32%), Gaps = 25/166 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 T+ D++ +LD S S+ + + +A K I ++ ++ + V+ G + F Sbjct: 3522 CTECSFDIVFILDRSSSIGPYG------MYIAEKYIAHIIRCLRGLD-----VEVGYIVF 3570 Query: 227 SNKIEEFFLLE-WG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + L + + L ++K G + + + G Sbjct: 3571 DCISKWLISLGLYNVDTTALIPEVKAAEFTGGESRVGNAIYHLMCTSTYRNG-------- 3622 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 + +TDG N A+ G +YA+ I Sbjct: 3623 ---IPSAAIVLTDGAAYEEYPSNLYEIQSNAARAMGIELYAVAIGR 3665 >gi|118399120|ref|XP_001031886.1| hypothetical protein TTHERM_00721540 [Tetrahymena thermophila] gi|89286221|gb|EAR84223.1| hypothetical protein TTHERM_00721540 [Tetrahymena thermophila SB210] Length = 994 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 41/247 (16%), Positives = 92/247 (37%), Gaps = 31/247 (12%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 + K K +F + + + + V + + + + IV+D S SM Sbjct: 4 QQSVQAKAQFKVESFEGNEFIDFTLKLKLDETALVQNNSRQK-NYQIVIDNSGSM----- 57 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE-WGVSHLQRKIK 248 T I + + N +++ V + + L+TF+ I+ L + +++ I Sbjct: 58 -DGTNIQLTKQLCNELVQFVIKTQPHS---KISLMTFNTSIDHVENLHLKSLKQVEQFIS 113 Query: 249 YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE----NLSTKE 304 ++ G + ++++ D+ + N + +IV++TDG+ ST Sbjct: 114 NINANGGT-----IFHITFDKLRDIC-----QKFTNQNEELVIVYLTDGQVQSGQDSTNL 163 Query: 305 DQQSLYYCNEAKK--RGAIVYAIGIRVIRSHEFLRACAS----PNSFYLVENPHSMYDAF 358 ++ KK V+A+G+ L S +++ ++ + AF Sbjct: 164 KDSFIFLQQVLKKFVNNVEVHALGMGTSHDPVILDKIISLQTTQSTYQFIKESSEIEGAF 223 Query: 359 SHIGKDI 365 +I I Sbjct: 224 KNIVDII 230 >gi|332201476|gb|EGJ15546.1| collagen adhesion family protein [Streptococcus pneumoniae GA47368] Length = 146 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 26/103 (25%), Positives = 48/103 (46%), Gaps = 11/103 (10%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQTDA---RLDMMIVLDVSRSME----SFFDSS- 191 K T I + + + ITS + +QT+ LD+++V D S SME + S Sbjct: 22 KTITPISGQEDQYELSLDITSKLGTETQTETQSEPLDVVLVADFSGSMEERDVWSYSSRR 81 Query: 192 -ITKIDMAIKSINAMLEEVKLIPDV--NNVVQSGLVTFSNKIE 231 I++I+ ++ + LI + N+ + +V F+ KI+ Sbjct: 82 YISRIEALKHTLKGVNGRQGLIDTILSNSQNRLSIVGFAGKID 124 >gi|227112734|ref|ZP_03826390.1| putative hemagglutinin/hemolysin-related protein [Pectobacterium carotovorum subsp. brasiliensis PBR1692] Length = 1159 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 71/202 (35%), Gaps = 13/202 (6%) Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD 169 + ++TS +I GYS + +Y + + + + + I + Sbjct: 574 VAQATSTEIANHDTATGYSAGGVEQYGMSIGTIGD-DTMSGTHSHDVMIADVSGLQIIEG 632 Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV--KLIPDVNNVVQSGLVTFS 227 ++ ++D S SM S T ID A S++ + + + + V L F Sbjct: 633 QNYNIAFMVDSSGSMSS------TDIDNARTSLSNVFKSLINSAGGANSGTVNVFLADFD 686 Query: 228 NKI-EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++ + + + L + L+ +++ G Y +F + N Sbjct: 687 TQVGKTVSVNLNDTNALNKLTAVLNSMVGGSSAGGGTN--YEDVFKTTANWFQSDVVKKN 744 Query: 287 YKKIIV-FMTDGENLSTKEDQQ 307 + F+TDGE + ++ Sbjct: 745 VGNNLTYFITDGEPTYYQTNEA 766 >gi|1813523|gb|AAB63302.1| PbTRAP [Plasmodium berghei] Length = 606 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 35/181 (19%), Positives = 72/181 (39%), Gaps = 31/181 (17%) Query: 171 RLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 ++D+ I+LD S S+ S + S + + + +++ + + D N+ + T++ + Sbjct: 41 QIDLHILLDGSGSIGHSNWISHVIPM------LTTLVDNLNISRDEINISMTLFSTYARE 94 Query: 230 IEEFFLLEWGVSH-------LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + L +G + + + S G +TN T L N I Sbjct: 95 LVR--LKRYGSTSKASLRFIIAQLQNNYSPHG-TTNLTSALLNVDNLI--------QKKM 143 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS---HEFLRAC 339 N ++++ +TDG + K +S N+ KK+ V IG+ + + L C Sbjct: 144 NRPNAIQLVIILTDGIPNNLK---KSTTVVNQLKKKDVNVAIIGVGAGVNNMFNRILVGC 200 Query: 340 A 340 Sbjct: 201 G 201 >gi|68070931|ref|XP_677379.1| sporozoite surface protein 2 [Plasmodium berghei strain ANKA] gi|56497475|emb|CAH99602.1| sporozoite surface protein 2, putative [Plasmodium berghei] Length = 421 Score = 41.7 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 35/181 (19%), Positives = 72/181 (39%), Gaps = 31/181 (17%) Query: 171 RLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 ++D+ I+LD S S+ S + S + + + +++ + + D N+ + T++ + Sbjct: 41 QIDLHILLDGSGSIGHSNWISHVIPM------LTTLVDNLNISRDEINISMTLFSTYARE 94 Query: 230 IEEFFLLEWGVSH-------LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + L +G + + + S G +TN T L N I Sbjct: 95 LVR--LKRYGSTSKASLRFIIAQLQNNYSPHG-TTNLTSALLNVDNLI--------QKKM 143 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS---HEFLRAC 339 N ++++ +TDG + K +S N+ KK+ V IG+ + + L C Sbjct: 144 NRPNAIQLVIILTDGIPNNLK---KSTTVVNQLKKKDVNVAIIGVGAGVNNMFNRILVGC 200 Query: 340 A 340 Sbjct: 201 G 201 >gi|268324283|emb|CBH37871.1| hypothetical secreted protein, CARDB family [uncultured archaeon] Length = 1149 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 71/217 (32%), Gaps = 38/217 (17%) Query: 154 IVMPITSSVKVNSQ--TDARLDMMIVLDVSRSMESFFD--SSITK-----IDMAIKSINA 204 + +P+ ++++ T D++ V D++ SM I+K + A Sbjct: 66 VTIPVCKELQMHLDCVTTKNADIVFVFDITGSMGEEISEMKDISKNFADGLAAAGIDYRF 125 Query: 205 MLEEVKLIPDVNNVVQSG-LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL 263 L E + P + G F+ K+ +L S + I L+ G L Sbjct: 126 GLTEFRDFPVTCDGTVCGDANDFAYKVYNGGVLSNSSSMFKSWIDSLNPSGGGDLPESTL 185 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK------ 317 + + D + + KIIV ++D D+ + CN+ K Sbjct: 186 AALMHTVKDQKWR-------GGDASKIIVLISD---AYPHSDE---HCCNQEKNTFDGVI 232 Query: 318 -----RGAIVYAIGIRVIRSHEFLRAC-ASPNSFYLV 348 G VY +G ++ F+ + Sbjct: 233 SALTRSGMTVYVVG---PDEASMEMIANSTGGKFFHI 266 >gi|261883870|ref|ZP_06007909.1| hypothetical protein CfetvA_01086 [Campylobacter fetus subsp. venerealis str. Azul-94] Length = 216 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 40/228 (17%), Positives = 80/228 (35%), Gaps = 25/228 (10%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA 198 + CT P + + Q L++M++LD S SM + + T+++ A Sbjct: 3 FQLCTAHPLES-----AIGSPDMEDSRFQFKESLNIMVILDASGSM-ANIEGDQTRMEAA 56 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNK---IEEFFLLEWGVSHLQRKIKYLSKFGV 255 +IN+ +E + P NV GL + ++ + S L I Sbjct: 57 KSAINSFMESL---PQEANV---GLRIYGHEGSGSKTDKARSCSSSELVYSIDKYDPAKF 110 Query: 256 STNSTPGLKYAYNQI-FDMQGMRQHCNTEDANYKKIIVFM-TDGENLSTKEDQQSLYYCN 313 + + I F ++ ++ + IV++ +DG + D + Sbjct: 111 NQALVKAKPAGWTPIGFALKEAQKDLAAFQGDANTNIVYLVSDG---ISTCDDDPVGSAK 167 Query: 314 EAKKRGA--IVYAIGIRV-IRSHEFLRACA--SPNSFYLVENPHSMYD 356 I+ IG V + L+ A + ++ V+ +YD Sbjct: 168 ALYDSDITPIINVIGFNVDQEGQKQLQEVAKVTEGTYQNVQMAEGLYD 215 >gi|156400040|ref|XP_001638808.1| predicted protein [Nematostella vectensis] gi|156225932|gb|EDO46745.1| predicted protein [Nematostella vectensis] Length = 798 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 33/175 (18%), Positives = 60/175 (34%), Gaps = 31/175 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + V+D S SM S ++I A +++ L+ + PD G + S K Sbjct: 289 FIFVVDRSGSM------SGSRIKDAARTLQLFLKSL---PDGCYFNIVGFGS-SYKTLFS 338 Query: 234 FLLEWGVSHLQRKIK---YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L+ +L+ T L++ Y+Q RQ Sbjct: 339 KSKTYNDETLKTATNHAAHLAADLGGTEILEPLRWVYSQSLIEGAPRQ------------ 386 Query: 291 IVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + +TDGE + + + A+ V++ GI S E ++ A Sbjct: 387 LFLLTDGEVGNTAQVISLVAENASTAR-----VFSFGIGDGASTELIKGVARAGH 436 >gi|186470651|ref|YP_001861969.1| putative transmembrane protein [Burkholderia phymatum STM815] gi|184196960|gb|ACC74923.1| putative transmembrane protein [Burkholderia phymatum STM815] Length = 372 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 18/142 (12%), Positives = 44/142 (30%), Gaps = 5/142 (3%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 +G + ++TA+ + + + + I++ ++ + L + D + + A + Sbjct: 7 QKQRGSVAVITAVSMVSLLGLAALAIDIGNLLVSRNELQNAADAAALAGAPCLYQRAQCG 66 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSL--DIVVVPQNEGYSIS 130 N + K SF N + + ++ S +I P Sbjct: 67 NTTATEPDWTTATQK---ASSFATASTSNTVQGSVIKVAQTGSGYWNITGSPGTLETVPF 123 Query: 131 AISRYKIPLKFCTFIPWYTNSR 152 +P T N+ Sbjct: 124 TPGTNDLPAIQVTITKSAANAN 145 >gi|91776389|ref|YP_546145.1| von Willebrand factor, type A [Methylobacillus flagellatus KT] gi|91710376|gb|ABE50304.1| MxaC, protein involved in Ca2+ insertion into methanol dehydrogenase [Methylobacillus flagellatus KT] Length = 333 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 40/198 (20%), Positives = 74/198 (37%), Gaps = 33/198 (16%) Query: 174 MMIVLDVSRSM-ESFFDS----SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +MIVLD S SM +SF DS S +K+ A + + + + + + G+VTFS Sbjct: 91 VMIVLDRSASMNDSFADSAKHDSESKMAAARRVLQSFVRQSRED-------LLGMVTFST 143 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNS---TPGLKYAYNQIFDMQGMRQHCNTEDA 285 L + ++ G+ + G+ Y + + G R Sbjct: 144 SPILAAPLGGDREAVLAALRATEAGGMGFTAVARGLGMALDYFEGRPVTGARA------- 196 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIRVIRSHEFLRACASPNS 344 I+ ++DG D ++ E ++GA +Y + +R A + Sbjct: 197 -----ILLVSDG---GAHLDVKTQDLLREMFHRQGASLYWVYLRSANGVSI--KNAPEDE 246 Query: 345 FYLVENPHSMYDAFSHIG 362 H ++D F+ +G Sbjct: 247 DLDAYPEHQLHDYFNSLG 264 >gi|301609300|ref|XP_002934201.1| PREDICTED: epithelial chloride channel protein-like [Xenopus (Silurana) tropicalis] Length = 919 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 44/219 (20%), Positives = 79/219 (36%), Gaps = 46/219 (21%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 +P+ + + S + +VLDVS SM S +I ++ + ++ Sbjct: 284 NLPVPTFTLLQSSDRV---VTLVLDVSGSMASDG-----RIGRLYQAAEVFVMQI----- 330 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR-KIKYLSKFG--VSTNSTPGLKYAYNQIF 271 V G+V+FS L + QR +K+L TN G++ Sbjct: 331 VEEGSHVGIVSFSTSTTVLSKLVQVIDDTQRNHLKFLLPKTAVGGTNICAGIR------- 383 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVI 330 +G++ + + ++Y IV +TDGE+ C + G IV+ I + Sbjct: 384 --EGIKVNNQHDGSSYGTEIVLLTDGEDNYNTS------LCFPDISNSGIIVHFIALGPN 435 Query: 331 RSHEF---------LRACASPNSFYLVENPHSMYDAFSH 360 + LR A+ + + DAFS Sbjct: 436 PNPNLETIVDMTGGLRFLATDK-----VDAQGLIDAFSS 469 >gi|290997816|ref|XP_002681477.1| predicted protein [Naegleria gruberi] gi|284095101|gb|EFC48733.1| predicted protein [Naegleria gruberi] Length = 452 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 27/127 (21%), Positives = 55/127 (43%), Gaps = 14/127 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 ++I+LDVS SM ++ +++ AI + +++ + D +V F+ K + Sbjct: 79 SIIIILDVSSSMGAYH-----RLENAIYATRSVINYL-TEKDYVGIVLFNAGAFTCKKQT 132 Query: 233 FFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 FLL+ + + I + STN +N +FD + ++ Sbjct: 133 EFLLKATAQNKKTLIDCIENMMPFGSTNFEAAFNETFN-LFDRSE-----EIASSTCDRV 186 Query: 291 IVFMTDG 297 ++F+TDG Sbjct: 187 VLFLTDG 193 >gi|257866903|ref|ZP_05646556.1| predicted protein [Enterococcus casseliflavus EC30] gi|257872580|ref|ZP_05652233.1| predicted protein [Enterococcus casseliflavus EC10] gi|257800861|gb|EEV29889.1| predicted protein [Enterococcus casseliflavus EC30] gi|257806744|gb|EEV35566.1| predicted protein [Enterococcus casseliflavus EC10] Length = 779 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 16/105 (15%), Positives = 45/105 (42%), Gaps = 11/105 (10%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 V+ + +QT +D+++V+D S SM + ++ + + + D Sbjct: 95 VLFDVTLAVKGNQTSTPVDLVMVIDYSSSMTG---------EKLTNALLGLQQFGNELAD 145 Query: 215 --VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST 257 N ++ G+V ++ +I +++L++ ++ ++ T Sbjct: 146 SLANGSIRIGIVAYNREIYTTNGFSTDIAYLEQFLRETAETHSGT 190 >gi|220933244|ref|YP_002512143.1| von Willebrand factor type A [Thioalkalivibrio sp. HL-EbGR7] gi|219994554|gb|ACL71156.1| von Willebrand factor type A [Thioalkalivibrio sp. HL-EbGR7] Length = 589 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 26/174 (14%), Positives = 62/174 (35%), Gaps = 19/174 (10%) Query: 158 ITSSVKVNSQTD-ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 + + V+V + + +M+VLDVS +M++ D + ++ A+ ++ MLE ++ Sbjct: 81 VPAPVQVAGEGHRGDVSLMLVLDVSATMQAQ-DLAPRRLTRALLEVDGMLEGLRGE---- 135 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 + GLV F+ + L + + + + Sbjct: 136 ---RVGLVAFAGRALMLAPPTHDRRLLSHYLSRGPEALADPAGLSASRAVAEGLRLAGEA 192 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL--YYCNEAKKRGAIVYAIGIR 328 + +V +TDG+ + + + + G +Y +G+ Sbjct: 193 LEGSGA--------VVLITDGDARAFAGARLAAMQTQARALRDAGHTLYVLGVG 238 >gi|160940244|ref|ZP_02087589.1| hypothetical protein CLOBOL_05133 [Clostridium bolteae ATCC BAA-613] gi|158436824|gb|EDP14591.1| hypothetical protein CLOBOL_05133 [Clostridium bolteae ATCC BAA-613] Length = 683 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 27/146 (18%), Positives = 51/146 (34%), Gaps = 28/146 (19%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 +V ++ + + VLDVS SM + +D A + I ++ ++ N Sbjct: 301 RVRAEEIPPREYIFVLDVSGSMFGYP------LDTAKELIGNLVGNLRDSDQFN------ 348 Query: 223 LVTFSNKIEEFFLLE--WGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQ 278 L+ FS+ ++++ I + + G T P L+ A + Sbjct: 349 LILFSDTAVSMAPKSVPATAENIRQAIDLIERQDGGGGTELAPALEQAVS---------- 398 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKE 304 D + IV +TDG Sbjct: 399 --LPRDPRMARSIVTITDGYMSDESS 422 >gi|29346316|ref|NP_809819.1| hypothetical protein BT_0906 [Bacteroides thetaiotaomicron VPI-5482] gi|253568263|ref|ZP_04845674.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] gi|298385670|ref|ZP_06995228.1| BatB protein [Bacteroides sp. 1_1_14] gi|29338211|gb|AAO76013.1| BatB, conserved hypothetical protein [Bacteroides thetaiotaomicron VPI-5482] gi|251842336|gb|EES70416.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] gi|298261811|gb|EFI04677.1| BatB protein [Bacteroides sp. 1_1_14] Length = 342 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 63/175 (36%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K + ++++I LD+S SM + D ++++ A + I+ +++E+ + Sbjct: 79 GSKQETVKRKGVEVIIALDISNSMLAQ-DVQPSRLEKAKRLISRLVDELDND-------K 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 G++ F+ + + ++ +S T + A +G+ Sbjct: 131 VGMIVFAGDAFTQLPITSDYISAKMFLESISPSLISKQGTAIGEAINLATRSFTPQEGVG 190 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + I+ +TDGEN + + + V +G+ + Sbjct: 191 -----------RAIIVITDGENHEGGAVEAAKAAAEKG----IQVSVLGVGMPEG 230 >gi|326205015|ref|ZP_08194865.1| hypothetical protein Cpap_0022 [Clostridium papyrosolvens DSM 2782] gi|325984820|gb|EGD45666.1| hypothetical protein Cpap_0022 [Clostridium papyrosolvens DSM 2782] Length = 198 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 55/168 (32%) Query: 12 FYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNG 71 N KG T+ ++ ++FL++ + + I + + S+ D + A +NE Sbjct: 1 MKNTKGSFTVEASLVFSVVFLMVSAFVHLFVIMYQYVNVQSIADEAATKGAYYYVNETAE 60 Query: 72 NNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISA 131 N + D + + + +I+ Sbjct: 61 NYSSYKINELYWRIYDTDKGKKINAISNYVNKLLDKSVFPAENKVYVSTKNKILMKNINI 120 Query: 132 ISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLD 179 + + PL + + S + + S+ ++ T+ +M IV D Sbjct: 121 EIQEEYPLTVGNMLDIFGLSPVLSLRAVSTSPMDDNTEFIRNMDIVKD 168 >gi|152994336|ref|YP_001339171.1| hypothetical protein Mmwyl1_0295 [Marinomonas sp. MWYL1] gi|150835260|gb|ABR69236.1| conserved hypothetical protein [Marinomonas sp. MWYL1] Length = 528 Score = 41.7 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 27/226 (11%), Positives = 71/226 (31%), Gaps = 25/226 (11%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 I F+ + G + + + + + ++ + + L D + + Sbjct: 23 ITAFYQDEAGAVLPFAVLIIVGGVIAVSFAVDTTRMVNSSAQLKRATDVAALA------- 75 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 GN + D + K N D+G +N I+ ++ V + Y Sbjct: 76 --IGNIQLTNGNDDDVDLQKIASGYVLSNLGMDSGLINQIE--AEQVTVTKGEVDGSPTY 131 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 +S + + + ++ V ++ + ++L S + + Sbjct: 132 KVSVSLIAQ------SDLLKAGGQEQVIFSTVEVVSRPTE------VALILPNSGTEDRG 179 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ K ++ + +L + V LV FS + + Sbjct: 180 ELAALRK--VSKEFARNLLGDESAGVSPTQKVWLSLVPFSQSVNVY 223 >gi|311277028|ref|XP_003135467.1| PREDICTED: protein DDX26B-like isoform 2 [Sus scrofa] Length = 861 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 51/131 (38%), Gaps = 10/131 (7%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM D + +D+A ++ L+ ++ + + LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFLK-LRARDPASRGDRYMLVTYDEPPY-C 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAY-----NQIFDMQGMRQHCNTEDAN 286 W +H ++K L G++T L+ ++ N++ Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTL-GQALRSSFDLLNLNRLISGIDNYGQGRNPFFL 120 Query: 287 YKKIIVFMTDG 297 I++ +TDG Sbjct: 121 EPSILITITDG 131 >gi|311277026|ref|XP_003135466.1| PREDICTED: protein DDX26B-like isoform 1 [Sus scrofa] Length = 898 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 51/131 (38%), Gaps = 10/131 (7%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM D + +D+A ++ L+ ++ + + LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFLK-LRARDPASRGDRYMLVTYDEPPY-C 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAY-----NQIFDMQGMRQHCNTEDAN 286 W +H ++K L G++T L+ ++ N++ Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTL-GQALRSSFDLLNLNRLISGIDNYGQGRNPFFL 120 Query: 287 YKKIIVFMTDG 297 I++ +TDG Sbjct: 121 EPSILITITDG 131 >gi|301791219|ref|XP_002930578.1| PREDICTED: protein DDX26B-like isoform 1 [Ailuropoda melanoleuca] Length = 898 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 51/131 (38%), Gaps = 10/131 (7%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM D + +D+A ++ L+ ++ + + LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFLK-LRARDPASRGDRYMLVTYDEPPY-C 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAY-----NQIFDMQGMRQHCNTEDAN 286 W +H ++K L G++T L+ ++ N++ Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTL-GQALRSSFDLLNLNRLISGIDNYGQGRNPFFL 120 Query: 287 YKKIIVFMTDG 297 I++ +TDG Sbjct: 121 EPSILITITDG 131 >gi|297711112|ref|XP_002832196.1| PREDICTED: protein DDX26B-like [Pongo abelii] Length = 861 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 51/131 (38%), Gaps = 10/131 (7%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM D + +D+A ++ L+ ++ + + LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFLK-LRARDPASRGDRYMLVTYDEPPY-C 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAY-----NQIFDMQGMRQHCNTEDAN 286 W +H ++K L G++T L+ ++ N++ Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTL-GQALRSSFDLLNLNRLISGIDNYGQGRNPFFL 120 Query: 287 YKKIIVFMTDG 297 I++ +TDG Sbjct: 121 EPSILITITDG 131 >gi|293351261|ref|XP_002727737.1| PREDICTED: RIKEN cDNA 6330505F04-like [Rattus norvegicus] Length = 856 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 51/131 (38%), Gaps = 10/131 (7%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM D + +D+A ++ L+ ++ + + LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFLK-LRARDPASRGDRYMLVTYDEPPY-C 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAY-----NQIFDMQGMRQHCNTEDAN 286 W +H ++K L G++T L+ ++ N++ Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTL-GQALRSSFDLLNLNRLISGIDNYGQGRNPFFL 120 Query: 287 YKKIIVFMTDG 297 I++ +TDG Sbjct: 121 EPSILITITDG 131 >gi|288556553|ref|YP_003428488.1| hypothetical protein BpOF4_17775 [Bacillus pseudofirmus OF4] gi|288547713|gb|ADC51596.1| hypothetical protein BpOF4_17775 [Bacillus pseudofirmus OF4] Length = 459 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 74/216 (34%), Gaps = 13/216 (6%) Query: 152 RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 + + + L++ I+LD S SM D + K+ +A ++I +EE+ Sbjct: 141 EQLSFLAPNEEGELGPEEKELNVEILLDASGSMRDEVDG-VDKMTLAREAIQGFVEELPD 199 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 +V V + + E G+ + ++ + + + + Sbjct: 200 QANVALRV------YGHVGESPEKSCEGIDRVY-DLQPYDESSFQDAIDGVMANGWTPLA 252 Query: 272 DMQGMRQHCNTE-DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + + +++ ++DG + + Q++ +E ++ IG V Sbjct: 253 KAIEVTSDDYRNASKDATNMVLVVSDGMDTCGGDPVQAVKDLSEL-DVTPLISIIGFDVP 311 Query: 331 -RSHEFLRAC--ASPNSFYLVENPHSMYDAFSHIGK 363 + LR AS ++F V + + + Sbjct: 312 ANEQQQLREMAQASGSAFATVNDQAQLSAELDRSAE 347 >gi|301791221|ref|XP_002930579.1| PREDICTED: protein DDX26B-like isoform 2 [Ailuropoda melanoleuca] gi|281340117|gb|EFB15701.1| hypothetical protein PANDA_021070 [Ailuropoda melanoleuca] Length = 861 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 51/131 (38%), Gaps = 10/131 (7%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM D + +D+A ++ L+ ++ + + LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFLK-LRARDPASRGDRYMLVTYDEPPY-C 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAY-----NQIFDMQGMRQHCNTEDAN 286 W +H ++K L G++T L+ ++ N++ Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTL-GQALRSSFDLLNLNRLISGIDNYGQGRNPFFL 120 Query: 287 YKKIIVFMTDG 297 I++ +TDG Sbjct: 121 EPSILITITDG 131 >gi|297587450|ref|ZP_06946095.1| conserved hypothetical protein [Finegoldia magna ATCC 53516] gi|297575431|gb|EFH94150.1| conserved hypothetical protein [Finegoldia magna ATCC 53516] Length = 249 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 38/192 (19%), Positives = 72/192 (37%), Gaps = 18/192 (9%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV--VQSGLVTFSNKIE 231 + V+D S SM TKI +I +L E+ I + N V+ +++F+++I+ Sbjct: 17 LFFVIDTSGSM------KGTKIGEVNSAIEEILPELSDISNSNPDAEVKMAILSFNSEIQ 70 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + + + N T + A+ ++ + + ++Y +I Sbjct: 71 WITPK---TGPVDPGVYLWRDL--NANGTTRMGAAFEELESKLHGDKFMKSATSSYAPVI 125 Query: 292 VFMTDGENLSTKEDQQS---LYYCNEAKKRGAIVYAIGIRVIRSHEFLRA-CASPNSFYL 347 M+DG T+E QS N+ K G V A+GI + L A + + Sbjct: 126 FLMSDGMPTETEEQFQSGLNKLKANKWFKSGIKV-ALGIGQDADLDVLEAFTGTKEAVLQ 184 Query: 348 VENPHSMYDAFS 359 N + Sbjct: 185 TNNVKKLKAMIQ 196 >gi|223461292|gb|AAI40716.1| DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 26B [Homo sapiens] Length = 861 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 51/131 (38%), Gaps = 10/131 (7%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM D + +D+A ++ L+ ++ + + LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFLK-LRARDPASRGDRYMLVTYDEPPY-C 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAY-----NQIFDMQGMRQHCNTEDAN 286 W +H ++K L G++T L+ ++ N++ Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTL-GQALRSSFDLLNLNRLISGIDNYGQGRNPFFL 120 Query: 287 YKKIIVFMTDG 297 I++ +TDG Sbjct: 121 EPSILITITDG 131 >gi|253701050|ref|YP_003022239.1| von Willebrand factor A [Geobacter sp. M21] gi|251775900|gb|ACT18481.1| von Willebrand factor type A [Geobacter sp. M21] Length = 315 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 29/126 (23%), Positives = 52/126 (41%), Gaps = 16/126 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LD SRSM + D +++D A +++ +K + GLV F+ Sbjct: 81 DILFLLDTSRSMLT-RDLGQSRLDAAKEAVRKATAGLKGE-------RVGLVAFAGSAFL 132 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L + + L + G T PG A ++ R+ E K++V Sbjct: 133 VCPL---TTDYALFDQVLKEAGEQTLPLPGTSLA----AALKEARRALQGEGGEP-KVVV 184 Query: 293 FMTDGE 298 ++DGE Sbjct: 185 LLSDGE 190 >gi|119632131|gb|EAX11726.1| DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 26B, isoform CRA_b [Homo sapiens] Length = 898 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 51/131 (38%), Gaps = 10/131 (7%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM D + +D+A ++ L+ ++ + + LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFLK-LRARDPASRGDRYMLVTYDEPPY-C 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAY-----NQIFDMQGMRQHCNTEDAN 286 W +H ++K L G++T L+ ++ N++ Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTL-GQALRSSFDLLNLNRLISGIDNYGQGRNPFFL 120 Query: 287 YKKIIVFMTDG 297 I++ +TDG Sbjct: 121 EPSILITITDG 131 >gi|119632130|gb|EAX11725.1| DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 26B, isoform CRA_a [Homo sapiens] Length = 450 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 51/131 (38%), Gaps = 10/131 (7%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM D + +D+A ++ L+ ++ + + LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFLK-LRARDPASRGDRYMLVTYDEPPY-C 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAY-----NQIFDMQGMRQHCNTEDAN 286 W +H ++K L G++T L+ ++ N++ Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTL-GQALRSSFDLLNLNRLISGIDNYGQGRNPFFL 120 Query: 287 YKKIIVFMTDG 297 I++ +TDG Sbjct: 121 EPSILITITDG 131 >gi|114690272|ref|XP_001136594.1| PREDICTED: DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 26B isoform 1 [Pan troglodytes] Length = 894 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 51/131 (38%), Gaps = 10/131 (7%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM D + +D+A ++ L+ ++ + + LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFLK-LRARDPASRGDRYMLVTYDEPPY-C 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAY-----NQIFDMQGMRQHCNTEDAN 286 W +H ++K L G++T L+ ++ N++ Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTL-GQALRSSFDLLNLNRLISGIDNYGQGRNPFFL 120 Query: 287 YKKIIVFMTDG 297 I++ +TDG Sbjct: 121 EPSILITITDG 131 >gi|114690274|ref|XP_001136749.1| PREDICTED: DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 26B isoform 2 [Pan troglodytes] Length = 897 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 51/131 (38%), Gaps = 10/131 (7%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM D + +D+A ++ L+ ++ + + LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFLK-LRARDPASRGDRYMLVTYDEPPY-C 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAY-----NQIFDMQGMRQHCNTEDAN 286 W +H ++K L G++T L+ ++ N++ Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTL-GQALRSSFDLLNLNRLISGIDNYGQGRNPFFL 120 Query: 287 YKKIIVFMTDG 297 I++ +TDG Sbjct: 121 EPSILITITDG 131 >gi|114690278|ref|XP_521277.2| PREDICTED: protein DDX26B isoform 5 [Pan troglodytes] Length = 861 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 51/131 (38%), Gaps = 10/131 (7%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM D + +D+A ++ L+ ++ + + LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFLK-LRARDPASRGDRYMLVTYDEPPY-C 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAY-----NQIFDMQGMRQHCNTEDAN 286 W +H ++K L G++T L+ ++ N++ Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTL-GQALRSSFDLLNLNRLISGIDNYGQGRNPFFL 120 Query: 287 YKKIIVFMTDG 297 I++ +TDG Sbjct: 121 EPSILITITDG 131 >gi|114690276|ref|XP_001137050.1| PREDICTED: protein DDX26B isoform 4 [Pan troglodytes] Length = 898 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 51/131 (38%), Gaps = 10/131 (7%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM D + +D+A ++ L+ ++ + + LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFLK-LRARDPASRGDRYMLVTYDEPPY-C 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAY-----NQIFDMQGMRQHCNTEDAN 286 W +H ++K L G++T L+ ++ N++ Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTL-GQALRSSFDLLNLNRLISGIDNYGQGRNPFFL 120 Query: 287 YKKIIVFMTDG 297 I++ +TDG Sbjct: 121 EPSILITITDG 131 >gi|109132369|ref|XP_001100789.1| PREDICTED: protein DDX26B-like isoform 4 [Macaca mulatta] Length = 861 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 51/131 (38%), Gaps = 10/131 (7%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM D + +D+A ++ L+ ++ + + LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFLK-LRARDPASRGDRYMLVTYDEPPY-C 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAY-----NQIFDMQGMRQHCNTEDAN 286 W +H ++K L G++T L+ ++ N++ Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTL-GQALRSSFDLLNLNRLISGIDNYGQGRNPFFL 120 Query: 287 YKKIIVFMTDG 297 I++ +TDG Sbjct: 121 EPSILITITDG 131 >gi|332861720|ref|XP_003317760.1| PREDICTED: protein DDX26B [Pan troglodytes] gi|21756064|dbj|BAC04813.1| unnamed protein product [Homo sapiens] gi|119632132|gb|EAX11727.1| DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 26B, isoform CRA_c [Homo sapiens] Length = 365 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 51/131 (38%), Gaps = 10/131 (7%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM D + +D+A ++ L+ ++ + + LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFLK-LRARDPASRGDRYMLVTYDEPPY-C 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAY-----NQIFDMQGMRQHCNTEDAN 286 W +H ++K L G++T L+ ++ N++ Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTL-GQALRSSFDLLNLNRLISGIDNYGQGRNPFFL 120 Query: 287 YKKIIVFMTDG 297 I++ +TDG Sbjct: 121 EPSILITITDG 131 >gi|94536743|ref|NP_872346.3| protein DDX26B [Homo sapiens] gi|74742010|sp|Q5JSJ4|DX26B_HUMAN RecName: Full=Protein DDX26B gi|57209096|emb|CAI40867.1| DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 26B [Homo sapiens] gi|57209211|emb|CAI40723.1| DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 26B [Homo sapiens] Length = 861 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 51/131 (38%), Gaps = 10/131 (7%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM D + +D+A ++ L+ ++ + + LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFLK-LRARDPASRGDRYMLVTYDEPPY-C 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAY-----NQIFDMQGMRQHCNTEDAN 286 W +H ++K L G++T L+ ++ N++ Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTL-GQALRSSFDLLNLNRLISGIDNYGQGRNPFFL 120 Query: 287 YKKIIVFMTDG 297 I++ +TDG Sbjct: 121 EPSILITITDG 131 >gi|298484180|ref|ZP_07002346.1| BatB protein [Bacteroides sp. D22] gi|298269684|gb|EFI11279.1| BatB protein [Bacteroides sp. D22] Length = 342 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 24/175 (13%), Positives = 62/175 (35%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + ++++I LD+S SM + D ++++ A + I+ +++E+ + Sbjct: 79 GSKLETVKRKGVEVIIALDISNSMLAQ-DVQPSRLEKAKRLISRLVDELDND-------K 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 G++ F+ + + ++ ++ T + A Sbjct: 131 VGMIVFAGDAFTQLPITSDYISAKMFLESINPSLISKQGTAIGEAINLA----------- 179 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 T + I+ +TDGEN + + + V +G+ + Sbjct: 180 ARSFTPQEGVGRAIIVITDGENHEGGAVEAAKAAAEKG----IQVSVLGVGMPDG 230 >gi|77359583|ref|YP_339158.1| inter-alpha-trypsin inhibitor domain-containing protein [Pseudoalteromonas haloplanktis TAC125] gi|76874494|emb|CAI85715.1| conserved protein of unknown function ; putative Inter-alpha-trypsin inhibitor domain protein [Pseudoalteromonas haloplanktis TAC125] Length = 664 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 62/171 (36%), Gaps = 34/171 (19%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--EE 232 + V+D S SM ++ A ++ L + N ++ F N + Sbjct: 325 VFVVDTSGSMHG------QSMEQAKNALFYALSLLDSNDSFN------IIGFDNVVTLMS 372 Query: 233 FFLL---EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + + +R I L G + ++ A + + D + Sbjct: 373 DKPLVASGFNLRRAERFIYGLQADGGTE-----IQGALDAVLDGSQFDGFV--------R 419 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 ++F+TDG ++ +L+ +AK + ++ +GI + F+R A Sbjct: 420 QVIFLTDGS----VSNEDALFKSIQAKLGDSRLFTVGIGSAPNSFFMRRAA 466 >gi|332358821|gb|EGJ36643.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1056] Length = 434 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 32/145 (22%), Positives = 59/145 (40%), Gaps = 23/145 (15%) Query: 174 MMIVLDVSRSMESFF-------DSSITKIDMAIKSINAMLEEVKLIPDVN-NVVQ-SGLV 224 + V D S SM S T++++ N +++++K I +V+ N+V+ SG Sbjct: 199 VTFVFDTSGSMGYGLWNQKLEPTDSRTRMNILKTKANLLVDDLKEIGNVSVNLVRFSGDA 258 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ E+F L+ ++ KIK L + TN GL+Y + Sbjct: 259 SY--IQEDFVELDKDTDTIKTKIKALPTSWI-TNPGDGLRYGLVSLQRNPAQL------- 308 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSL 309 K +V +TDG + + Sbjct: 309 ----KYVVLLTDGIPNAYTASPDGI 329 >gi|328675376|gb|AEB28051.1| hypothetical protein FN3523_0194 [Francisella cf. novicida 3523] Length = 332 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 34/163 (20%), Positives = 67/163 (41%), Gaps = 29/163 (17%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + +I LDVS+SM++ D S ++++ A I +L +K Q G++ FS+ Sbjct: 88 QKNISRVIALDVSQSMDT-TDVSPSRLERAKYKIFDILRRIKEG-------QVGMIVFSS 139 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L + ++ + ++ N LK + Q+ + G +Q Sbjct: 140 EPFVVSPLTSDANTIENLVTVINSDIVPVQGHNIYKALKKS-AQLIEQAGAQQGQ----- 193 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI--VYAIG 326 I+ +TD + Q++ + ++G VYAIG Sbjct: 194 -----IILITD-----SSPSPQAISQAKQLAQQGIKTDVYAIG 226 >gi|291484531|dbj|BAI85606.1| hypothetical protein BSNT_03154 [Bacillus subtilis subsp. natto BEST195] Length = 638 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 37/219 (16%), Positives = 69/219 (31%), Gaps = 28/219 (12%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVM-----PITSSVKVNSQTDARLDMMIVLDVSRS 183 I +K L N++ + P K T+ +++D S S Sbjct: 395 IQKTLEHKKTLPRTDLHAGRLNNKLLRYFTERNPRLFYKKQEPSTEIDAVFTLLVDCSAS 454 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 M K+D + I E +K + + +V G +N E + + + Sbjct: 455 M-------FDKMDETKRGIVLFHEALKSVAVPHQIV--GFWEDTNDATETSQPNYF-NTV 504 Query: 244 QRKIKYLSKFGVSTNSTPGLKY--AYNQIFDMQGMRQHCNT--EDANYKKIIVFMTDGEN 299 L + +S P + D +RQ + +K ++ +DGE Sbjct: 505 IPFQSSLRQ-----DSGPAIMQLEPEEDNRDGYAIRQMTKKMLHRSEAQKFLIVFSDGEP 559 Query: 300 LST----KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + + EA+KRG V + + E Sbjct: 560 AAFGYEQNGIVDTSEAVIEARKRGIEVINVFLSNSEIEE 598 >gi|168216419|ref|ZP_02642044.1| von Willebrand factor type A domain protein [Clostridium perfringens NCTC 8239] gi|182381279|gb|EDT78758.1| von Willebrand factor type A domain protein [Clostridium perfringens NCTC 8239] Length = 620 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 33/180 (18%), Positives = 76/180 (42%), Gaps = 38/180 (21%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA 198 LK + P + I + + K ++ +D+++++D S SM +D Sbjct: 57 LKEVSEKPDEDGNYEITLTVKGKPKKVTK---PVDILLIMDASNSM-------WNDMDAL 106 Query: 199 IKSINAMLEEV-KLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVST 257 +S+N+++++V IP+ + ++ F ++EE F +KF Sbjct: 107 KESMNSLVDKVIDNIPNS----RIAVIAFGTEVEEVFSF-----------NNKNKFTSKE 151 Query: 258 NSTPGLKYAYNQI-----FDMQGMRQHCNT-------EDANYKKIIVFMTDGENLSTKED 305 +K +YN I +++G + + ++N KK ++F +DG ++++ Sbjct: 152 EYKKAIKKSYNNIERRGNTNIEGTWRRADEIFKNELNNNSNSKKDVIFFSDGYPNVSRDE 211 >gi|160882769|ref|ZP_02063772.1| hypothetical protein BACOVA_00730 [Bacteroides ovatus ATCC 8483] gi|237720675|ref|ZP_04551156.1| conserved hypothetical protein [Bacteroides sp. 2_2_4] gi|260170238|ref|ZP_05756650.1| hypothetical protein BacD2_00060 [Bacteroides sp. D2] gi|293373991|ref|ZP_06620332.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|299145609|ref|ZP_07038677.1| BatB protein [Bacteroides sp. 3_1_23] gi|315918601|ref|ZP_07914841.1| conserved hypothetical protein [Bacteroides sp. D2] gi|156111793|gb|EDO13538.1| hypothetical protein BACOVA_00730 [Bacteroides ovatus ATCC 8483] gi|229449510|gb|EEO55301.1| conserved hypothetical protein [Bacteroides sp. 2_2_4] gi|292631067|gb|EFF49704.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|298516100|gb|EFI39981.1| BatB protein [Bacteroides sp. 3_1_23] gi|313692476|gb|EFS29311.1| conserved hypothetical protein [Bacteroides sp. D2] Length = 342 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 62/175 (35%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + ++++I LD+S SM + D ++++ A + I+ +++E+ + Sbjct: 79 GSKLETVKRKGVEVIIALDISNSMLAQ-DVQPSRLEKAKRLISRLVDELDND-------K 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 G++ F+ + + ++ +S T + A Sbjct: 131 VGMIVFAGDAFTQLPITSDYISAKMFLESISPSLISKQGTAIGEAINLA----------- 179 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 T + I+ +TDGEN + + + V +G+ + Sbjct: 180 ARSFTPQEGVGRAIIVITDGENHEGGAVEAAKAAAEKG----IQVSVLGVGMPDG 230 >gi|297559736|ref|YP_003678710.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|296844184|gb|ADH66204.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 547 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 60/200 (30%), Gaps = 26/200 (13%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSI-----NAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + LDVS SME + ++ A+ ++ ++ + + V T+ Sbjct: 357 VYALDVSGSME---GGRLAELQSALGALTGADGGSLARSTQAFQEREVVTLLPFSTWPAD 413 Query: 230 ----IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + E ++ + L ++ L G T + L AY + G D Sbjct: 414 PRTFVVEPGSVDEVNADLSAAVEGLEAEG-DTAAYDALVRAYELLESDTGS-------DG 465 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK---RGAIVYAIGIRVIRSHEFLRACA-S 341 + +V MTDGE + A+ V+ + E + Sbjct: 466 DPLMSVVLMTDGEVNRGVGLEGFRESL-AARSEPVARVPVFTVLFGESDVPEMTELAELT 524 Query: 342 PNSFYLVENPHSMYDAFSHI 361 + + F I Sbjct: 525 GGRVFDARE-QDLEQVFREI 543 >gi|39933805|ref|NP_946081.1| hypothetical protein RPA0728 [Rhodopseudomonas palustris CGA009] gi|39647652|emb|CAE26172.1| conserved unknown protein [Rhodopseudomonas palustris CGA009] Length = 468 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 54/177 (30%), Gaps = 22/177 (12%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 + F + K + ++ A+ + I +LGM ++ + K L + Sbjct: 6 LSRFVRDRKANIAVIAALVMIPIIFLLGMTLDFTQALRKKQQLDAAA-----------DA 54 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 R + + + N + + + T + I V Sbjct: 55 AAIAAVRPAMLMQTDAVAQNTAYAIFMST---ANRLASGLTSVPTPT-ITITDVGLQRTV 110 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 +S + P + ++ I+ + + + ++ +++D S SM Sbjct: 111 KVSYNAAS------LNNFPQLLMN-NVSWAISGASTAQASSAPNMNFYLLMDDSPSM 160 >gi|298207019|ref|YP_003715198.1| hypothetical protein CA2559_02160 [Croceibacter atlanticus HTCC2559] gi|83849653|gb|EAP87521.1| hypothetical protein CA2559_02160 [Croceibacter atlanticus HTCC2559] Length = 288 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 38/139 (27%), Positives = 58/139 (41%), Gaps = 30/139 (21%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L M++++D+S S FF ++ K I + N + GL+ F Sbjct: 72 EEERELTMVLMVDISGS--EFFGTN----QAFKKDIIIEISATLAFSATQNNDKIGLLLF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 S++IE F + G H+ R I+ L +F T+ T LKY N + Sbjct: 126 SDEIELFIPPKKGRLHVLRIIRELIEFKPQSKKTDITKALKYLSNMLK------------ 173 Query: 284 DANYKKIIV-----FMTDG 297 KK IV FMTDG Sbjct: 174 ----KKAIVFVLSDFMTDG 188 >gi|255583954|ref|XP_002532724.1| inter-alpha-trypsin inhibitor heavy chain, putative [Ricinus communis] gi|223527532|gb|EEF29655.1| inter-alpha-trypsin inhibitor heavy chain, putative [Ricinus communis] Length = 752 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 65/195 (33%), Gaps = 41/195 (21%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 S+ R D++ ++D+S SM ++ A ++ + L ++ N ++ Sbjct: 317 QSRKAFRKDVIFIIDISGSM------KGGPLENAKNALMSSLSKLNSEDSFN------II 364 Query: 225 TFSNKIEEFFLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 F+++ F L + +S + G TN LK A + Sbjct: 365 AFNDETYLFSSLMEPATKEALSKASLWLNDNLTAGGGTNIMVPLKQAMKLL--------- 415 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR-------GAIVYAIGIRVIRS 332 + + +I +TDG Q CN K + + GI + Sbjct: 416 --AQTTDSIPLIFLITDG------AVQDEREICNFVKGSLTSGGPISPRICSFGIGAYCN 467 Query: 333 HEFLRACASPNSFYL 347 H FL+ A Y Sbjct: 468 HYFLQMLAQIGRGYF 482 >gi|229125374|ref|ZP_04254461.1| hypothetical protein bcere0016_56090 [Bacillus cereus 95/8201] gi|228658081|gb|EEL13834.1| hypothetical protein bcere0016_56090 [Bacillus cereus 95/8201] Length = 452 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 40/199 (20%), Positives = 73/199 (36%), Gaps = 16/199 (8%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + L++ I+LD S SM + + K++ A K+I L IPD NV+ L + Sbjct: 148 KEKSLNVEILLDASGSMAGKVNGQV-KMEAAKKAIYNYL---DKIPDNANVM---LRVYG 200 Query: 228 NK---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE- 283 +K E L G S + ++ K + + + + E Sbjct: 201 HKGSNNENDKSLSCGSSEVMYPLQPYKKEQFNAALSNFGPKGWTPLASAIESVNDDFKEY 260 Query: 284 -DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-HEFLRACAS 341 ++ ++DGE + + N++ A+V IG V S + L+ A Sbjct: 261 TGEENLNVVYIVSDGEETCGGDPVNAAKNLNQS-STHAVVNIIGFDVKNSEQQQLKNTAE 319 Query: 342 P--NSFYLVENPHSMYDAF 358 ++ V N +Y Sbjct: 320 AGKGNYATVSNADELYQTL 338 >gi|194228324|ref|XP_001490675.2| PREDICTED: similar to Protein DDX26B [Equus caballus] Length = 861 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 51/131 (38%), Gaps = 10/131 (7%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM D + +D+A ++ L+ ++ + + LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFLK-LRARDPASRGDRYMLVTYDEPPYS- 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAY-----NQIFDMQGMRQHCNTEDAN 286 W +H ++K L G++T L+ ++ N++ Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTL-GQALRSSFDLLNLNRLISGIDNYGQGRNPFFL 120 Query: 287 YKKIIVFMTDG 297 I++ +TDG Sbjct: 121 EPSILITITDG 131 >gi|159122051|gb|EDP47174.1| von Willebrand domain protein [Aspergillus fumigatus A1163] Length = 1062 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 30/192 (15%), Positives = 59/192 (30%), Gaps = 29/192 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ + D S SM S KI I +N L + N + +F +++ Sbjct: 341 IIFMADRSGSMSS-------KIPSLINVMNIFLRSLPEACSFN------IASFGSQVTWL 387 Query: 233 -FFLLEWGVSHLQ---RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + ++L + ++ T L + D R T Sbjct: 388 WPSSMRYSQTNLNVAAKHVESFQANYGGTEIYDALH----SVLDHYNERNDVPTN----- 438 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 ++ +TDGE Q ++ +++GI SH + Y Sbjct: 439 --VILLTDGEVWDVDNVIQLVHRTASNGNSNIRFFSLGIGDQVSHRLVEGIGQQGGGYAE 496 Query: 349 ENPHSMYDAFSH 360 P S ++ Sbjct: 497 VVPESSMGSWQE 508 >gi|153838884|ref|ZP_01991551.1| putative RTX toxin [Vibrio parahaemolyticus AQ3810] gi|149747675|gb|EDM58589.1| putative RTX toxin [Vibrio parahaemolyticus AQ3810] Length = 3823 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 40/270 (14%), Positives = 80/270 (29%), Gaps = 33/270 (12%) Query: 53 MIDRSLV----HAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDID 108 + D ++ T +N L N + + N Sbjct: 3211 INDSVILTGIPSGLTLEVNGKTVTENSDGSYTVDLSSEPNQTSHQLSGTITVPADYNGSL 3270 Query: 109 DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFC------TFIPWYTNSRHIVMPITSSV 162 D + + V V N S +F I ++ IV + Sbjct: 3271 DFDVTATAGSVEVNNNTQMGADTASVSVRDYEFVSGTHGDNNIVGSDDNDVIVGDVQGLQ 3330 Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 V Q ++ +LD S SM ++T++ +N ++E + V Sbjct: 3331 IVEGQD---YNIAFMLDTSGSMGYDVGRAVTELKTV---LNTLIESASG--PHSGKVNVL 3382 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKY-----LSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 L TFS + ++ L+ + + +++ + + +TN G + A N Sbjct: 3383 LTTFSTESKQVLELDLSSDNAKSQVESILDAIVKRGDGNTNYEAGFQSALNWF------- 3435 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 ++ N I +DG + Sbjct: 3436 DSADSGATNLSYFI---SDGRPNQATVNDA 3462 >gi|85859136|ref|YP_461338.1| hypothetical protein SYN_01495 [Syntrophus aciditrophicus SB] gi|85722227|gb|ABC77170.1| hypothetical exported protein [Syntrophus aciditrophicus SB] Length = 364 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 10/41 (24%), Positives = 21/41 (51%) Query: 12 FYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHS 52 KG + ++ A+ + + + M ++V HI +K LH+ Sbjct: 12 LRERKGAVAVIVALAMTVFLGIAAMAVDVGHIMVVKNELHN 52 >gi|47229648|emb|CAG06844.1| unnamed protein product [Tetraodon nigroviridis] Length = 1117 Score = 41.7 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 25/127 (19%), Positives = 49/127 (38%), Gaps = 21/127 (16%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++++LD S S+ + S + + +++ P ++ GLV + Sbjct: 433 DILLLLDSSGSVSNNEFSRFLRFSAGV---------LRIFPLGRGRLRVGLVQVGTNPKL 483 Query: 233 FFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F LE LQ ++ + + TN+ L A + D KI Sbjct: 484 EFGLEVHSDQESLQEALRGVQQLQGDTNTKDALVLAQKLLQAA----------DRGAPKI 533 Query: 291 IVFMTDG 297 ++++TDG Sbjct: 534 LLWLTDG 540 >gi|294054128|ref|YP_003547786.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] gi|293613461|gb|ADE53616.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] Length = 702 Score = 41.7 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 32/139 (23%), Positives = 57/139 (41%), Gaps = 23/139 (16%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K+ S + D++ LDVSRSM + D +++++A ++I L DV + G Sbjct: 73 KMESVSRTGRDVVFALDVSRSMLAE-DVLPSRLEVAKQAIRDAL-------DVMGNERVG 124 Query: 223 LVTFSNKIEEFFLLEWG----VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 LV ++ L + L++ FG +T L+ A + D + Sbjct: 125 LVAYAGSASILCPLTYDYDFVRYSLEQAHPRTVDFGGTT-----LQSAVEKTVDQVFLDG 179 Query: 279 HCNTEDANYKKIIVFMTDG 297 +D +V +TDG Sbjct: 180 RAGVQD------LVVLTDG 192 >gi|237716506|ref|ZP_04546987.1| conserved hypothetical protein [Bacteroides sp. D1] gi|262408104|ref|ZP_06084652.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|294645098|ref|ZP_06722824.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CC 2a] gi|294809499|ref|ZP_06768202.1| von Willebrand factor type A domain protein [Bacteroides xylanisolvens SD CC 1b] gi|229444153|gb|EEO49944.1| conserved hypothetical protein [Bacteroides sp. D1] gi|262354912|gb|EEZ04004.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|292639604|gb|EFF57896.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CC 2a] gi|294443317|gb|EFG12081.1| von Willebrand factor type A domain protein [Bacteroides xylanisolvens SD CC 1b] Length = 342 Score = 41.7 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 24/175 (13%), Positives = 62/175 (35%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + ++++I LD+S SM + D ++++ A + I+ +++E+ + Sbjct: 79 GSKLETVKRKGVEVIIALDISNSMLAQ-DVQPSRLEKAKRLISRLVDELDND-------K 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 G++ F+ + + ++ ++ T + A Sbjct: 131 VGMIVFAGDAFTQLPITSDYISAKMFLESINPSLISKQGTAIGEAINLA----------- 179 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 T + I+ +TDGEN + + + V +G+ + Sbjct: 180 ARSFTPQEGVGRAIIVITDGENHEGGAVEAAKAAAEKG----IQVSVLGVGMPDG 230 >gi|206890389|ref|YP_002248900.1| magnesium-chelatase subunit ChlD [Thermodesulfovibrio yellowstonii DSM 11347] gi|206742327|gb|ACI21384.1| magnesium-chelatase subunit ChlD [Thermodesulfovibrio yellowstonii DSM 11347] Length = 614 Score = 41.7 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 32/203 (15%), Positives = 73/203 (35%), Gaps = 30/203 (14%) Query: 121 VPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDV 180 + + +I A R P + + ++V+ + +++ V+D Sbjct: 382 MQKRPDIAIDATLRAAAPFQKLRAMK-----DNVVIFDDDLRYKEKERRMSHNVIFVVDG 436 Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV 240 S SM ++ ++ ++L I + ++ F + ++ +L Sbjct: 437 SGSM-----GVEQRMKATKGAVLSLL-----IDCYKKRDKVAMIVF--RKDKAEILLPLT 484 Query: 241 SHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE 298 S ++ +K L G T + GL AY ++ + + N + +I+ +TDG+ Sbjct: 485 SSVELALKRLREIPTGGKTPLSAGLMEAY-KLMKITHFKYPEN------RLLILIITDGK 537 Query: 299 NLSTKEDQQSLY----YCNEAKK 317 + D+ L C K Sbjct: 538 PNVSLSDKPVLEELKSVCFMLKD 560 >gi|218528581|ref|YP_002419397.1| hypothetical protein Mchl_0537 [Methylobacterium chloromethanicum CM4] gi|218520884|gb|ACK81469.1| conserved hypothetical protein [Methylobacterium chloromethanicum CM4] Length = 480 Score = 41.7 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 14/106 (13%), Positives = 38/106 (35%), Gaps = 1/106 (0%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI-MNE 68 F + G +T++ A+ + ++G I+ + + + + D ++ + + Sbjct: 29 TFHQDRGGTVTVIVALAATTLMGLVGGAIDYARLVSAQRHIQQATDAGVMAGGNALKLVV 88 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRST 114 N + L I IK++ ++ + +V T Sbjct: 89 SNTASVIGLTTQTIQDEIKDSAKNPVTIQVDVASDKTSVTAVVEQT 134 >gi|126306098|ref|XP_001362158.1| PREDICTED: similar to putative calcium activated chloride channel-like protein 1; eCLCA1, partial [Monodelphis domestica] Length = 873 Score = 41.7 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 35/170 (20%), Positives = 65/170 (38%), Gaps = 37/170 (21%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++V+D SRSM +++ +++ L ++ + +G+VTF + Sbjct: 307 LVLVIDTSRSM-----KVGNRLNRLRQALQFFLLQI-----IEKGSWTGIVTFDSSATIQ 356 Query: 234 FLLEWGVSHLQRK--IKYLSK---FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L S +QRK I L G T+ GL+ A+ + Sbjct: 357 SELMQIESDVQRKTLISRLPTVTVAGGGTHICSGLRTAFMVVKKK--------------- 401 Query: 289 KIIVFMTDGENLS--TKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF 335 F+TDG ++ T + + C E K+ GAI++ I + Sbjct: 402 ----FLTDGSEMALLTDGEDNTTNTCFEEVKQSGAIIHTIVLGPSTEKGL 447 >gi|302343943|ref|YP_003808472.1| Magnesium chelatase [Desulfarculus baarsii DSM 2075] gi|301640556|gb|ADK85878.1| Magnesium chelatase [Desulfarculus baarsii DSM 2075] Length = 697 Score = 41.7 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 23/128 (17%), Positives = 49/128 (38%), Gaps = 17/128 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM + ++ ++ ++L + + L+TF + + Sbjct: 515 LLFLVDASGSM-----GAQARMSATKGAVLSLL-----LDAYQKRDRVALITFRGREAQL 564 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L L K+ G T GL A+ Q+ +H D + + I++ Sbjct: 565 ALPPTSSIDLAAKLLAELPVGGRTPLAAGLLRAHEQLT------RHLRK-DPDGRPIVLV 617 Query: 294 MTDGENLS 301 +TDG + Sbjct: 618 LTDGRANA 625 >gi|253569758|ref|ZP_04847167.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] gi|251840139|gb|EES68221.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] Length = 621 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 36/275 (13%), Positives = 93/275 (33%), Gaps = 28/275 (10%) Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRST-----SLDIVVVP 122 + K + ++ + +F ++ + N I T ++ + Sbjct: 142 DAVNTEEYKSISENGFKQVGESPLSTFSIDVDAASYSNMRRMINSGTLPVADAIRTEELV 201 Query: 123 QNEGYSISAISRYKIPLKFCTFIPWYTNS--RHIVMPITSSVKVNSQTDARLDMMIVLDV 180 Y + + P+K + +V + ++ + +++ ++DV Sbjct: 202 NYFSYDYAKPTGSD-PVKITMEAGVCPWNADHRLVRIGLKAREIPTDKLPESNLVFLIDV 260 Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV 240 S SM T++D+ S+ ++ ++ V VV +G + ++ Sbjct: 261 SGSMW-----GPTRLDLVKSSLKLLVNNLREKDKVAIVVYAG----NASVKLESTPGSDK 311 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-N 299 ++ I L+ G + G++ AY +H N + I+ +DG+ N Sbjct: 312 QKIRDAIDELTSGGST-AGGAGIQLAYKVA-------KHNFLPKGNNR--IILCSDGDFN 361 Query: 300 LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + + + +K G + +G + + Sbjct: 362 VGVSSVEGLEQLIEKERKSGVFLSVLGYGMGNYKD 396 >gi|1575515|gb|AAC47461.1| thrombospondin-related anonymous protein [Plasmodium gallinaceum] Length = 614 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 31/171 (18%), Positives = 66/171 (38%), Gaps = 25/171 (14%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D+ +++D S S+ + + ++ + E V+ + + L F++ ++ Sbjct: 43 VDLYLLMDGSGSIGYY--------NWVTYAVPLVEEIVQNLNISKQGIHLYLSVFTHILK 94 Query: 232 EFFLL----EWGVSHLQRKIKYLS---KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 E+ L I+ L STN T L F +G R+ Sbjct: 95 EYIPLNSIFSTNRDFALNVIRSLRTKYSQNGSTNLTLALSRVLKNYFLTKGSRE------ 148 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 + ++++ TDG + + ++ N+ KK A IG+ + + EF Sbjct: 149 -DAVQLVIIFTDGSPDNK---ESAMKEVNKLKKMKAKFAVIGVGMGINKEF 195 >gi|27379052|ref|NP_770581.1| hypothetical protein blr3941 [Bradyrhizobium japonicum USDA 110] gi|27352202|dbj|BAC49206.1| blr3941 [Bradyrhizobium japonicum USDA 110] Length = 472 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 24/175 (13%), Positives = 59/175 (33%), Gaps = 24/175 (13%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 F + K + I+ A+ + +LGM ++ + + L++ D + + A M Sbjct: 11 RFARDRKANVAIIFALMMVPTIFLLGMALDYTLALRKREQLNAAADAAAIAAVRPAMLTQ 70 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 + K + N +S V + ++ IV ++ Sbjct: 71 SDTTVVKATAEAVFAAKANLPGLS----------------AVPTPTVTIVDSGLARTITV 114 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 S + P + + +++ + +S ++ +++D S SM Sbjct: 115 S------YTAQSTNNFPGVLGKQTWQVAGSATARASS--APNMNFYLLMDDSPSM 161 >gi|70981624|ref|XP_746341.1| von Willebrand domain protein [Aspergillus fumigatus Af293] gi|66843963|gb|EAL84303.1| von Willebrand domain protein [Aspergillus fumigatus Af293] Length = 1062 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 30/192 (15%), Positives = 59/192 (30%), Gaps = 29/192 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ + D S SM S KI I +N L + N + +F +++ Sbjct: 341 IIFMADRSGSMSS-------KIPSLINVMNIFLRSLPEACSFN------IASFGSQVTWL 387 Query: 233 -FFLLEWGVSHLQ---RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + ++L + ++ T L + D R T Sbjct: 388 WPSSMRYSQTNLNVAAKHVESFQANYGGTEIYDALH----SVLDHYNERNDVPTN----- 438 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 ++ +TDGE Q ++ +++GI SH + Y Sbjct: 439 --VILLTDGEVWDVDNVIQLVHRTASNGNSNIRFFSLGIGDQVSHRLVEGIGQQGGGYAE 496 Query: 349 ENPHSMYDAFSH 360 P S ++ Sbjct: 497 VVPESSMGSWQE 508 >gi|46137831|ref|XP_390607.1| hypothetical protein FG10431.1 [Gibberella zeae PH-1] Length = 851 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 26/175 (14%), Positives = 56/175 (32%), Gaps = 27/175 (15%) Query: 174 MMIVLDVSRSMESFFDSSIT---KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 ++ +LD S SM S KID+ +++ ++ + N ++++ ++ Sbjct: 383 ILFLLDQSGSMRGGCGSGFNGLRKIDVLREAMLLVISGLPKTCSFN------IISWGSET 436 Query: 231 EEFFLLE-----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + ++ + I + T+ K + D Q Sbjct: 437 RAIWEQSRKHSPDNINEARDYISQIDSNLGGTDLLRAFKSTVQRRRDESNPTQ------- 489 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 IV +TDG+ + K + +A+GI H + A Sbjct: 490 -----IVVLTDGQLNADKPMEFVWKTRQVL-LNKIRFFALGIGRNVPHRLIEGIA 538 >gi|302754788|ref|XP_002960818.1| hypothetical protein SELMODRAFT_402202 [Selaginella moellendorffii] gi|300171757|gb|EFJ38357.1| hypothetical protein SELMODRAFT_402202 [Selaginella moellendorffii] Length = 545 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 37/180 (20%), Positives = 64/180 (35%), Gaps = 9/180 (5%) Query: 173 DMMIVLDVSRSMESFFD--SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 + IVLD S SM SS +++ +A ++ +L+ + V +G F K Sbjct: 148 SLYIVLDTSSSMSISIGPLSSQSRLAVAKGILDELLDTLTNGDQVIVSDMNGGKPFGGKP 207 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L S I L + L+ + F + + ++ + N + Sbjct: 208 VSVSLEGLETSFDHAGISALKNAISNAR-ADKLQTDIKKAF-VGALDFFNSSSNLN---V 262 Query: 291 IVFMTDGE--NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 I+ +TDG+ N D ++ K VY IG + F R S N + Sbjct: 263 ILLLTDGQFANHVNLTDLDPIFKQLNEKNVVVFVYRIGFYISNDETFQRMQFSLNISFEA 322 >gi|225181947|ref|ZP_03735381.1| von Willebrand factor type A [Dethiobacter alkaliphilus AHT 1] gi|225167387|gb|EEG76204.1| von Willebrand factor type A [Dethiobacter alkaliphilus AHT 1] Length = 272 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 71/194 (36%), Gaps = 31/194 (15%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + L++ +++D S SM + +I + ++ ++ P Q G Sbjct: 84 IKDGGAPPLEVCLLVDTSGSM------NGKRIREVKTLADNLVRQMHE-PLSLITFQEGD 136 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 V + L+ ++R + +S G+ T G++ A N + +G + Sbjct: 137 VGVKVRSTRNDLM------VRRGLAAMSAAGL-TPMGEGIRTAVNYLCGRRGKKH----- 184 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEA----KKRGAIVYAIGIRVIRSHEFLRAC 339 +++ +TDG D+ EA KK + IG+ + + Sbjct: 185 ------LVILITDGLPTWASGDKDPYLDAIEAGALIKKHKMHLICIGLE-PQRKFLEKLA 237 Query: 340 ASPN-SFYLVENPH 352 S + S Y+V++ Sbjct: 238 ESADASLYIVDDLD 251 >gi|291242943|ref|XP_002741339.1| PREDICTED: chloride channel accessory 2-like [Saccoglossus kowalevskii] Length = 958 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 35/202 (17%), Positives = 68/202 (33%), Gaps = 38/202 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++VLDVS SM S ++I +++ + + +++ + G VTFS + Sbjct: 315 IVLVLDVSGSM-----SLKSRITKLRQAVYTFI-----MDEISLGIDVGCVTFS---DTA 361 Query: 234 FLLEWGV--------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 ++ W + +T GL + Q + Sbjct: 362 TIISWLTPINSDEDREEFLALVMPTLNADGNTAIGSGLLTGLQVLSQNQTESVEGS---- 417 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS---P 342 I+ +TDG+ + + G +V + V + L AS Sbjct: 418 ----IMFLVTDGQENVPPYIDDVT---DNIIESGVVVDTLAWGVFAEEK-LETIASGTKG 469 Query: 343 NSFYLVENPHS--MYDAFSHIG 362 +S+Y E S +AF + Sbjct: 470 SSYYYSEQTQSNAHVEAFMEVA 491 >gi|194335402|ref|YP_002017196.1| von Willebrand factor type A [Pelodictyon phaeoclathratiforme BU-1] gi|194307879|gb|ACF42579.1| von Willebrand factor type A [Pelodictyon phaeoclathratiforme BU-1] Length = 343 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 55/162 (33%), Gaps = 18/162 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ +LDVSRSM + D+ + +E+ I + ++ F+ Sbjct: 90 DIVFMLDVSRSMRAR--------DVLPDRLGQAKQEITSISRAVTGGRMSILLFAASPLV 141 Query: 233 FFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + S T+ + A + R +K Sbjct: 142 QCPLTTDRDAFDALLGMASPDLIEEQGTSFRAAFELAGRLLEPTLEDR---MASGVKGEK 198 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 I+V ++DGE+ + + KK ++ IG+ + + Sbjct: 199 IVVLLSDGEDH----TGEVRSAVQQLKKANVHLFVIGVGMRQ 236 >gi|319641093|ref|ZP_07995797.1| tellurium resistance protein [Bacteroides sp. 3_1_40A] gi|317387337|gb|EFV68212.1| tellurium resistance protein [Bacteroides sp. 3_1_40A] Length = 206 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 24/181 (13%), Positives = 58/181 (32%), Gaps = 16/181 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 M I++D S SM I+ ++ M+ ++L P ++T+ + + Sbjct: 1 MYILIDTSGSM------KGEPIESVKVGLSDMIASLRLDPYALETACISIITYDRDVRQI 54 Query: 234 FLLEWGVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L ++ + T++ L + M D I Sbjct: 55 LPL---TELEDLQLPEIECPDSGPTHTGAALNMLCDCYDSEVNMGSKEQKGDWMPLLFI- 110 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPH 352 +TDG+ + +++ + + + A E L+ ++ + ++ Sbjct: 111 -LTDGKPSDLQVYDEAIQKVKQ--HQFTNIVACAAGPKAKTEPLKKLT--DNVFTLDTMD 165 Query: 353 S 353 S Sbjct: 166 S 166 >gi|149921114|ref|ZP_01909572.1| hypothetical protein PPSIR1_24789 [Plesiocystis pacifica SIR-1] gi|149818001|gb|EDM77460.1| hypothetical protein PPSIR1_24789 [Plesiocystis pacifica SIR-1] Length = 389 Score = 41.3 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 39/254 (15%), Positives = 77/254 (30%), Gaps = 51/254 (20%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSM---ESFFDSSITKIDMAIKSINAMLEEVKLI 212 MP + ++++VLD SRSM D + D+ +++ + V + Sbjct: 72 MPTCETTTATVNNIPP-NVVLVLDKSRSMVVNAWDDDGNPDTEDV--TRWHSLHDTVDTV 128 Query: 213 -PDVNNVVQSGLVTFSN-----------KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNST 260 + + GL F + + E + G+ + + + + + Sbjct: 129 GHQYQDGMSLGLTLFPSVDAESSFDGACPVNEVPEVGVGLGNAEALLAAIPAADDTD--- 185 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQS----------L 309 L A + H + ++ +TDG N S + + L Sbjct: 186 --LHGATPAAAGIATALAHLEALEDGRPAAMILVTDGAANCSAGANDITKFSQYDEDLPL 243 Query: 310 YYCNEAKKRGAIVYAIGIRVIRS--------HEFLRACA-------SPNS--FYLVENPH 352 + + G Y +GI + S E L A S FY + Sbjct: 244 VVADAWDRAGIPTYVVGIDIQESSEHPFTNPREKLHEVAEAGGVARSDGEVGFYDAGDAQ 303 Query: 353 SMYDAFSHIGKDIV 366 ++ A I + Sbjct: 304 ALTAALDEIAASVS 317 >gi|149176863|ref|ZP_01855473.1| hypothetical protein PM8797T_13972 [Planctomyces maris DSM 8797] gi|148844300|gb|EDL58653.1| hypothetical protein PM8797T_13972 [Planctomyces maris DSM 8797] Length = 291 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 22/97 (22%), Positives = 44/97 (45%), Gaps = 9/97 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 L +M+++D+S S +SF + TK ++ + + + N + GL F++ IE Sbjct: 77 LSVMLLIDLSGS-QSFGTNQQTKREVVTEVGATLAMSA-----IKNNDKVGLTLFTDHIE 130 Query: 232 EFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKY 265 + G H+ R I+ + G T+ L++ Sbjct: 131 KSVPARKGSRHVLRLIREMLYCEPMGTGTDIRQALEH 167 >gi|114652509|ref|XP_001170886.1| PREDICTED: coagulation factor C homolog, cochlin isoform 1 [Pan troglodytes] Length = 514 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 22/187 (11%), Positives = 59/187 (31%), Gaps = 30/187 (16%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTD------ARLDMMIVLDVSRSMESFFDSSITK 194 F +P + + V P+ + + Q +++ ++D S S+ + Sbjct: 329 FSYHMPNWFGTTKYVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGD------SN 382 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSK 252 + ++ ++ + + ++ + V F+ F + + I+ + Sbjct: 383 FRLMLEFVSNIAKTFEISDIGA---KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRY 439 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 T + + + +F +R+ N +V +TDG+ Sbjct: 440 MSGGTATGDAISFTVRNVFGP--IRESPNKN------FLVIVTDGQ-----SYDDVQGPA 486 Query: 313 NEAKKRG 319 A G Sbjct: 487 AAAHDAG 493 >gi|13786657|pdb|1FE8|A Chain A, Crystal Structure Of The Von Willebrand Factor A3 Domain In Complex With A Fab Fragment Of Igg Ru5 That Inhibits Collagen Binding gi|13786660|pdb|1FE8|B Chain B, Crystal Structure Of The Von Willebrand Factor A3 Domain In Complex With A Fab Fragment Of Igg Ru5 That Inhibits Collagen Binding gi|13786663|pdb|1FE8|C Chain C, Crystal Structure Of The Von Willebrand Factor A3 Domain In Complex With A Fab Fragment Of Igg Ru5 That Inhibits Collagen Binding Length = 196 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 58/177 (32%), Gaps = 26/177 (14%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++++LD S S + + D A + + + P + Q ++ + + Sbjct: 10 QPLDVILLLDGSSSFPASY------FDEXKSFAKAFISKANIGP---RLTQVSVLQYGSI 60 Query: 230 IEEFFLLEWGV----SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 W V +HL + + G + L +A + H A Sbjct: 61 TTIDVP--WNVVPEKAHLLSLVDVXQREGGPSQIGDALGFAVRYLTS----EXHGARPGA 114 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 + K +++ +T + A+ V+ IGI LR A P Sbjct: 115 S-KAVVILVT------DVSVDSVDAAADAARSNRVTVFPIGIGDRYDAAQLRILAGP 164 >gi|262381908|ref|ZP_06075046.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] gi|262297085|gb|EEY85015.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] Length = 289 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 27/107 (25%), Positives = 48/107 (44%), Gaps = 9/107 (8%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S +++ K + + + N + G+V F Sbjct: 72 EEERELTVMLLIDVSGS------RDFGSVNVMKKEVITEIAATLAFSAIQNNDKIGVVFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQI 270 S+KIE+F + G H+ I+ L F T+ + LKY N I Sbjct: 126 SDKIEKFIPPQKGKKHILYIIRELIDFHPEETRTDISQVLKYLTNAI 172 >gi|238915987|ref|YP_002929504.1| hypothetical protein EUBELI_00011 [Eubacterium eligens ATCC 27750] gi|238871347|gb|ACR71057.1| Hypothetical protein EUBELI_00011 [Eubacterium eligens ATCC 27750] Length = 745 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 66/416 (15%), Positives = 126/416 (30%), Gaps = 85/416 (20%) Query: 26 FLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHA---ATQIMNEGNGNNRKKLKGGDI 82 L II +V G++ +++ + S D ++ + +G K+ Sbjct: 18 MLTIILIVTGILPFMTNKALAAENVLS--DNIILVTENEVSDTAKDGYSKVTFKIGDHTK 75 Query: 83 LCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFC 142 IKN + ++ +D G + I ++ V + YS + ++ Sbjct: 76 DVYIKNGNTVKLKSSNKDYGVI--AGGIENGEAITENTVIYAKSYSYQGTLSLEKSIQSE 133 Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI---------- 192 T + + + S+ +++VLD S SM D + Sbjct: 134 TPDENGYYTLQFSAKSQNLPSIKSENQN---VILVLDRSFSMACSVDEDVDSDAMAPTYE 190 Query: 193 -TKIDMAIKSINAMLEEVKLIPDVNNV--------VQSGLVTFSNKIEEFFLLE--WGVS 241 T+ + I ++ L E N V ++ + S+K + L + + Sbjct: 191 KTRWSVTINAVEKFLNEFLPEGTSNKVSVISYCGSARTEITNESSKDKIMSKLNSIYNRN 250 Query: 242 HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG---- 297 K SK T GL A N QG++Q ++ TDG Sbjct: 251 MYNEDYKNSSKRYNVTQIGRGLGSATNI---QQGLKQVSEIAGDTTGASVILFTDGGANR 307 Query: 298 ------------ENLS-------TKEDQQSLYYCNEA----KKRGAIVYAIGIRVIR--- 331 +N + +++ YY +A K GA +Y I + Sbjct: 308 YDNGEIGGYYYIKNNNETNRYNKPRDEVNGSYYAGKAGEELKAAGADIYTIVLMSKESDI 367 Query: 332 --------SHEFLRACAS-------------PNSFYLVENPHSMYDAFSHIGKDIV 366 ++ L S FY N + + F I ++ Sbjct: 368 TDLVKVSLGNKSLTYEKSWEKTYFTFSDGGYAKEFYTAANAEQLNNRFKQIMTEMT 423 >gi|307209671|gb|EFN86529.1| Voltage-dependent calcium channel subunit alpha-2/delta-3 [Harpegnathos saltator] Length = 1200 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 79/205 (38%), Gaps = 33/205 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++I++D S SM + ++A IN +L+ + N+ V +TFS Sbjct: 219 ATSPKDILILMDTSGSMTGI------RREIARHVINNILDTL----GNNDFVNI--ITFS 266 Query: 228 NKIEEFFL------LEWGVSHLQRKIKYLSKFGVS--TNSTPGLKYAYNQIFDMQGMRQH 279 N +E ++ +++++ + ++ N + L A+ + Sbjct: 267 NVTKEVVPCFNDTLVQANLANVRELKRAIANLETERIANFSLALTTAFELL-----ESFR 321 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE----AKKR-GAIVYA--IGIRVIRS 332 E A + I+ +TDG + KE ++ + + K ++ IG V Sbjct: 322 AEKEGARCNQAIMLITDGVPYNYKEIFEAYNWKDNPDEPLKADMPVRIFTYLIGREVADV 381 Query: 333 HEF-LRACASPNSFYLVENPHSMYD 356 E ACA+ F + P + + Sbjct: 382 REVQWMACANRGYFVHLCTPAEVRE 406 >gi|295838849|ref|ZP_06825782.1| von Willebrand factor type A domain-containing protein [Streptomyces sp. SPB74] gi|197695399|gb|EDY42332.1| von Willebrand factor type A domain-containing protein [Streptomyces sp. SPB74] Length = 440 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 23/202 (11%), Positives = 65/202 (32%), Gaps = 33/202 (16%) Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 P+ + + Y+ P + + + + +++++D S Sbjct: 7 PEGPRFEVEV---YQNPYLPEGSGEVHAVVTVTATGGGTGALSATGSGQDAAVVLMVDCS 63 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL------ 235 SM+ +K+ A ++ A L+ ++ + + ++ ++ E + Sbjct: 64 GSMQY----PPSKLHHAKEATGAALDTLR------DGTRFAVIEGTHVAREVYPRGGALA 113 Query: 236 --LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + ++ ++ L G T L+ A + +H + Sbjct: 114 VADDRTRAEAKQALRALRASG-GTAVGRWLQLAGRLLSQAPVPIRHG-----------IL 161 Query: 294 MTDGENLSTKEDQQSLYYCNEA 315 +TDG N ++ + A Sbjct: 162 LTDGRNEHETPEELRAALDDCA 183 >gi|146170189|ref|XP_001017437.2| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila] gi|146145085|gb|EAR97192.2| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila SB210] Length = 865 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 26/174 (14%), Positives = 61/174 (35%), Gaps = 19/174 (10%) Query: 105 NDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKV 164 + D S + I +N + I + F + + I+MPI S K Sbjct: 86 QIVQDKASSDEIQIQQDSENTA-NKKQIIKGMTLEDFKSSLVELNQDLKIIMPIIQSEKD 144 Query: 165 NS---QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 S + LD+ ++D++ SM + + ++ + L+ + + ++ Sbjct: 145 QSKKWNSSCSLDLCFIVDITGSMSPWMNQ-------VVQYVGNTLKTIYEDFPAID-LRV 196 Query: 222 GLVTFSNKIEE----FFLLEWGVSHLQRKIKYLSKFGVS---TNSTPGLKYAYN 268 + + ++ + V + ++ ++ G + T GL A N Sbjct: 197 AFSGYRDHCDQHKYVTYDFTRNVDSFRSQLAQVTCLGGGDCPEDVTGGLHNALN 250 >gi|330506881|ref|YP_004383309.1| magnesium-chelatase subunit ChlD [Methanosaeta concilii GP-6] gi|328927689|gb|AEB67491.1| magnesium-chelatase subunit ChlD [Methanosaeta concilii GP-6] Length = 686 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 40/261 (15%), Positives = 83/261 (31%), Gaps = 37/261 (14%) Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 + + +I A R P + P + + + ++ V+D S Sbjct: 447 KEGKDIAIDATIRAAAPYQKARSGPNAIKVKSEDIR-----EKERARKTSAMLLFVVDGS 501 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEFFLLEWGV 240 SM ++ +++ A ++ ++L + + G+V F + E V Sbjct: 502 GSM-----GAMQRMESAKGAVLSLL-----MESYQKRDKIGMVAFRGTEAELILPPSSSV 551 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 ++K L G + S + + Q +DA K ++V ++DG Sbjct: 552 DLALSRLKELPTGGKTPLSAG--------LSRGLQLLQGEMRKDAETKLMMVLVSDGRAN 603 Query: 301 ---STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS---HEFLRAC-----ASPNSFYLVE 349 K + + K+ G I V+ S L C + +Y + Sbjct: 604 VGMGGKIKDELMEISERTKQLGVHTIVIDTEVVDSSFMEMRLGYCREIAEMTGGKYYPIS 663 Query: 350 NPHSMYDAFSHIGKDIVTKRI 370 S +A I + + Sbjct: 664 GLSS--EALYSIVDEEQKLLL 682 >gi|218548279|ref|YP_002382070.1| hypothetical protein EFER_0898 [Escherichia fergusonii ATCC 35469] gi|218355820|emb|CAQ88433.1| conserved hypothetical protein; putative exported protein [Escherichia fergusonii ATCC 35469] Length = 543 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 30/235 (12%), Positives = 78/235 (33%), Gaps = 24/235 (10%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 T PW + + I ++ + +++ ++D S SM S ++ + S Sbjct: 153 LTPAPWNEKHTLLRLDI-AANDIARSKLPPTNLVFLIDTSGSMNSD-----ERLPLIKSS 206 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 + ++ E++ + +VT++ LL S ++ + + + Sbjct: 207 LKLLVNELRDQD------RISIVTYAG---SARLLLSSTSGAEK--NTILNAIANLQAGG 255 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 G + + N I+ TDG+ + + ++ G Sbjct: 256 GTNGGAGVAMAYEQAQAGYIKGGVNR---ILLATDGDFNIGDDPSSVEDLVKKQRESGIT 312 Query: 322 VYAIGIRVIRSHE--FLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + +G+ +E ++ + N Y S+ +A + ++ + K Sbjct: 313 LSTLGVGDNNYNEAMMVKIADTGNGNYSY--LDSLSEAQKVLSNEMNQTLVTVAK 365 >gi|260794937|ref|XP_002592463.1| hypothetical protein BRAFLDRAFT_68949 [Branchiostoma floridae] gi|229277683|gb|EEN48474.1| hypothetical protein BRAFLDRAFT_68949 [Branchiostoma floridae] Length = 1402 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 20/136 (14%), Positives = 48/136 (35%), Gaps = 26/136 (19%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D + +LD S S+ T ++A + I + + + G++ + Sbjct: 457 AIDTVFLLDRSSSI------PRTGFNLAKQYIWGFAKCF-----TDQEIGVGVIAYDCVP 505 Query: 231 EEFFLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + L + L+ I + G T + +KY + R+ + Sbjct: 506 KTSIKLGIFTASNSADLRYAIGDVEFTGGLTRTAAAIKY----MTQTADFRE-------D 554 Query: 287 YKKIIVFMTDGENLST 302 ++++ +TDG + Sbjct: 555 VPRVMIIITDGNDEGD 570 >gi|186681556|ref|YP_001864752.1| von Willebrand factor A [Nostoc punctiforme PCC 73102] gi|186464008|gb|ACC79809.1| von Willebrand factor, type A [Nostoc punctiforme PCC 73102] Length = 224 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 35/166 (21%), Positives = 64/166 (38%), Gaps = 19/166 (11%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 +++LD S SM+ I+ + + ++ +E+ V+ +VTF + + Sbjct: 24 VLLLDTSGSMQG------DPIEALNQGLLSLKDELVKNSLAARRVEVAIVTFDSNVNVV- 76 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + V+ Q L+ G++T G+ A + I D + T Y + VFM Sbjct: 77 --QDFVTADQFNPPILTAQGLTTM-GAGIHKALDIIQDR---KSQYRTNGIAYYRPWVFM 130 Query: 295 -TDGENLSTKEDQ----QSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 TDGE E+ +EA KR + +G+ Sbjct: 131 ITDGEPQGELENVVEQASVRLQGDEANKR-VAFFTVGVENANMTRL 175 >gi|307720604|ref|YP_003891744.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306978697|gb|ADN08732.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 631 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 30/179 (16%), Positives = 72/179 (40%), Gaps = 24/179 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P+ K+ + + D+MI LD+S SM + D ++ +A + LE ++L P Sbjct: 69 LAGPVIKDGKIEIKAKSA-DIMIALDISDSMLAE-DVYPNRLKLAKQKA---LELLRLAP 123 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + + G++ F+ L + + +K L ++ T + Sbjct: 124 NE----RIGVIAFAKNSYLVSPLSFDHEAVAFLLKKLDTNSITEQGTDLM---------- 169 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 ++ + + KK ++ ++DG + + + AK++ V+ +G+ + Sbjct: 170 SMLQVVDKSIKKDSKKYLLILSDGGDKKNFSKEITF-----AKEKDIAVFVLGVGTPQG 223 >gi|254497958|ref|ZP_05110722.1| hypothetical protein LDG_2328 [Legionella drancourtii LLAP12] gi|254352852|gb|EET11623.1| hypothetical protein LDG_2328 [Legionella drancourtii LLAP12] Length = 607 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 37/197 (18%), Positives = 70/197 (35%), Gaps = 38/197 (19%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 H ++ + ++ + +++DVS SM+ + + + + Sbjct: 15 HFLIVFSGYAASTTKDLPEI--HLLIDVSGSMKHTDPQN------LRVTAVKLFNYL--- 63 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ----RKIKYLSKFGVSTNSTPGLKYAYN 268 VNN + TFSN +E+ + + Q +K K + G TN L N Sbjct: 64 --VNNRAVVSVSTFSNDLEQIIPPQIVTAKFQESFLKKKKQIKSDGAWTNIDAALN-GVN 120 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDG--ENLSTKEDQQSLYYCNE-----AKKRGAI 321 + + KK+I+ +TDG + S +++S NE ++ Sbjct: 121 KSWSKN-------------KKVIILLTDGMLDLGSDALNKKSTQQLNETTIPILQREHVQ 167 Query: 322 VYAIGIRVIRSHEFLRA 338 VY IG+ L Sbjct: 168 VYTIGLSNEADSTLLSN 184 >gi|27378609|ref|NP_770138.1| hypothetical protein blr3498 [Bradyrhizobium japonicum USDA 110] gi|27351757|dbj|BAC48763.1| blr3498 [Bradyrhizobium japonicum USDA 110] Length = 445 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 9/58 (15%), Positives = 20/58 (34%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 +RN + +G T I L + + + E + + D + A ++ Sbjct: 1 MRNLLRSRQGSAAFATVIALVPLIGAVALGAEAGSWYVTHQHAQNAADSAAYSGALRL 58 >gi|221309837|ref|ZP_03591684.1| hypothetical protein Bsubs1_10701 [Bacillus subtilis subsp. subtilis str. 168] gi|221314159|ref|ZP_03595964.1| hypothetical protein BsubsN3_10622 [Bacillus subtilis subsp. subtilis str. NCIB 3610] gi|221319082|ref|ZP_03600376.1| hypothetical protein BsubsJ_10553 [Bacillus subtilis subsp. subtilis str. JH642] gi|221323355|ref|ZP_03604649.1| hypothetical protein BsubsS_10667 [Bacillus subtilis subsp. subtilis str. SMY] gi|255767454|ref|NP_389820.2| activator of nitric oxide reductase [Bacillus subtilis subsp. subtilis str. 168] gi|264675898|sp|O31849|YOJO_BACSU RecName: Full=Uncharacterized protein yojO; Flags: Precursor gi|225185081|emb|CAB13830.2| putative activator of nitric oxide reductase [Bacillus subtilis subsp. subtilis str. 168] Length = 638 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 33/219 (15%), Positives = 67/219 (30%), Gaps = 28/219 (12%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLD-----MMIVLDVSRS 183 I +K L N++ + + ++ + +++D S S Sbjct: 395 IQKTLEHKKTLPRTDLHAGRLNNKLLRYFTERNPRLFYKKQEPSSEIDAVFTLLVDCSAS 454 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 M K+D + I E +K + + +V F + ++ Sbjct: 455 M-------FDKMDETKRGIVLFHEALKSVAVPHQIV-----GFWEDTNDATEKS-QPNYF 501 Query: 244 QRKIKYLSKFGVSTNSTPGLKY--AYNQIFDMQGMRQHCNT--EDANYKKIIVFMTDGEN 299 I + S +S P + D +RQ + +K ++ +DGE Sbjct: 502 NTVIPFQSSLRQ--DSGPAIMQLEPEEDNRDGYAIRQMTKKMLHRSEAQKFLIVFSDGEP 559 Query: 300 LST----KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + + EA+KRG V + + E Sbjct: 560 AAFGYEQNGIVDTSEAVIEARKRGIEVINVFLSNSEIEE 598 >gi|170751541|ref|YP_001757801.1| vault protein inter-alpha-trypsin subunit [Methylobacterium radiotolerans JCM 2831] gi|170658063|gb|ACB27118.1| Vault protein inter-alpha-trypsin domain protein [Methylobacterium radiotolerans JCM 2831] Length = 729 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 39/308 (12%), Positives = 88/308 (28%), Gaps = 48/308 (15%) Query: 52 SMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIV 111 + +D + + + + T ++ LR V + Sbjct: 199 TSVDAVSLDGPAAAARDPVPDRARISPPVLDPALSAPTNPVAVTVRLRAGFAVGAVQSAT 258 Query: 112 RSTSLDIVVVPQNEGYSI--------------SAISRYKIPLKFCTFIPWYTNSRHIVMP 157 + + I P+ ++ A ++P + + Sbjct: 259 HAIRV-IETGPEARTVTLAEGVVPADRDLELTWAPVPSRMPGLGLFRETVAGRTYLLAAV 317 Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 +++ ++ D+ V+D S SM S + A + A L + Sbjct: 318 SPPAIEADAAARPARDVTFVIDNSGSM------SGASMRQAKAGLLAGLGRLSPRD---- 367 Query: 218 VVQSGLVTFSNK-----IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + ++ F + E ++ + + L G + P LK A Sbjct: 368 --RFNVIRFDDTWDALHPEPVPATRAALAEAEAFVAALEARGGTEMLAP-LKAAL----- 419 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 H + +VF+TDG D++ ++ A ++ +GI + Sbjct: 420 ---ADPHPEDGRV---RQVVFLTDG----AVGDEERIFAAIHADLGRTRLFMVGIGSAPN 469 Query: 333 HEFLRACA 340 +R A Sbjct: 470 GHLMRHAA 477 >gi|302804196|ref|XP_002983850.1| hypothetical protein SELMODRAFT_423098 [Selaginella moellendorffii] gi|300148202|gb|EFJ14862.1| hypothetical protein SELMODRAFT_423098 [Selaginella moellendorffii] Length = 544 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 37/180 (20%), Positives = 64/180 (35%), Gaps = 9/180 (5%) Query: 173 DMMIVLDVSRSMESFFD--SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 + IVLD S SM SS +++ +A ++ +L+ + V +G F K Sbjct: 147 SLYIVLDTSSSMSISIGPLSSQSRLAVAKGILDELLDTLTNGDQVIVSDMNGGKPFGGKP 206 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L S I L + L+ + F + + ++ + N + Sbjct: 207 VSVSLEGLETSFDHAGISALKNAISNAR-ADKLQTDIKKAF-VGALDFFNSSSNLN---V 261 Query: 291 IVFMTDGE--NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 I+ +TDG+ N D ++ K VY IG + F R S N + Sbjct: 262 ILLLTDGQFANHVNLTDLDPIFKQLNEKNVVVFVYRIGFYISNDETFQRMQFSLNISFEA 321 >gi|162456658|ref|YP_001619025.1| hypothetical protein sce8375 [Sorangium cellulosum 'So ce 56'] gi|161167240|emb|CAN98545.1| conserved hypothetical protein with a vWF type A domain [Sorangium cellulosum 'So ce 56'] Length = 535 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 24/195 (12%), Positives = 66/195 (33%), Gaps = 23/195 (11%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 + + VLD S SM+ + + ++++ A L E ++ V Q+ ++ + + Sbjct: 202 PIAVTFVLDTSGSMD---GEPMAREKATVRAVAASLSEGDVVNMVTWNTQNSVILSGHVV 258 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + T+ GL+ Y ++H N Sbjct: 259 DGPDD------PALLAAADALSASGGTDLESGLRVGYQL------AQEHFEEGRINR--- 303 Query: 291 IVFMTD-GENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-HEFLRACASP---NSF 345 ++ ++D G N+ ++ + +A + + +G ++ L + ++ Sbjct: 304 VILVSDGGANVGVTSEELIALHAEDADQEAIYLVGVGTGPALGYNDVLMDAVTDKGRGAY 363 Query: 346 YLVENPHSMYDAFSH 360 +++ + F Sbjct: 364 VYLDDEDEAFHMFRD 378 >gi|150007593|ref|YP_001302336.1| hypothetical protein BDI_0946 [Parabacteroides distasonis ATCC 8503] gi|255013878|ref|ZP_05286004.1| hypothetical protein B2_08217 [Bacteroides sp. 2_1_7] gi|256839780|ref|ZP_05545289.1| conserved hypothetical protein [Parabacteroides sp. D13] gi|298375539|ref|ZP_06985496.1| von Willebrand factor, type A [Bacteroides sp. 3_1_19] gi|301310441|ref|ZP_07216380.1| von Willebrand factor, type A [Bacteroides sp. 20_3] gi|149936017|gb|ABR42714.1| conserved hypothetical protein [Parabacteroides distasonis ATCC 8503] gi|256738710|gb|EEU52035.1| conserved hypothetical protein [Parabacteroides sp. D13] gi|298268039|gb|EFI09695.1| von Willebrand factor, type A [Bacteroides sp. 3_1_19] gi|300832015|gb|EFK62646.1| von Willebrand factor, type A [Bacteroides sp. 20_3] Length = 289 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 27/107 (25%), Positives = 48/107 (44%), Gaps = 9/107 (8%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S +++ K + + + N + G+V F Sbjct: 72 EEERELTVMLLIDVSGS------RDFGSVNVMKKEVITEIAATLAFSAIQNNDKIGVVFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQI 270 S+KIE+F + G H+ I+ L F T+ + LKY N I Sbjct: 126 SDKIEKFIPPQKGKKHILYIIRELIDFHPEETRTDISQVLKYLTNAI 172 >gi|194211592|ref|XP_001915502.1| PREDICTED: similar to von Willebrand factor precursor (vWF) [Equus caballus] Length = 2779 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 21/129 (16%), Positives = 47/129 (36%), Gaps = 21/129 (16%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 LD++++LD S S F S K+ K+ + + + P + + ++ + + Sbjct: 1655 QPLDVVLLLDGSSS---FPASDFDKMKSFAKA---FISKANIGP---QLTRVSVLQYGSI 1705 Query: 230 IEEFFLLEWGV----SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 W V + L + + + G + L +A + H A Sbjct: 1706 TTIDVP--WNVPQEKASLLSLVDLMQREGGPSQVGDALAFAVRYVTS----EIHGARPGA 1759 Query: 286 NYKKIIVFM 294 + K++V + Sbjct: 1760 S--KVVVIL 1766 >gi|116624980|ref|YP_827136.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228142|gb|ABJ86851.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 331 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 75/212 (35%), Gaps = 34/212 (16%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 QT+ +L +++++D S S + D + + + +L E L TF Sbjct: 79 QTEQKLSVVLLVDTSGSTAKELKY---ESDSSSRFFHVLLGEGNPEDMAA------LYTF 129 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + +I E + ++K + T + A ++ E + Sbjct: 130 NWEIREQQPFSRDLRAFDNRLK-MMHGEAGTAMYDAVYLAAQRL------------EPRD 176 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--------RVIRSHEFLRA 338 +K+IV +TDG T A+ A++YAI + R I L Sbjct: 177 GRKVIVVVTDG--GDTVSRLSVQKALEAAQLADAVIYAIVVVPITNDAGRNIGGEHALDF 234 Query: 339 CA--SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 A + ++ + AF+ I ++ T+ Sbjct: 235 MAKGTGGRIFMPTLGAELDKAFADIITELRTQ 266 >gi|77456864|ref|YP_346369.1| hypothetical protein Pfl01_0636 [Pseudomonas fluorescens Pf0-1] gi|77380867|gb|ABA72380.1| putative exported protein [Pseudomonas fluorescens Pf0-1] Length = 659 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 58/153 (37%), Gaps = 10/153 (6%) Query: 16 KGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRK 75 +G + ++ A+ L + + + ++++ ++ + L + D + + AAT+ N G G Sbjct: 14 RGAIGLMAALTLGMALVFILVVVDSGRLYLERRHLQQIADVAALEAATRGGNCGAG-ATA 72 Query: 76 KLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRY 135 + R + R G +N + +++L + V +I + + Sbjct: 73 NAYAQASVVRNNFPIPSAGRTLAVACGTLN-----LDASNLRVFAVNAASTEAIRVVVSH 127 Query: 136 KIPLKFCTFIPWYTN----SRHIVMPITSSVKV 164 +P F I + I + T+ V Sbjct: 128 TVPQSFAGAIGGLFGGAGRNATINLSATAVAAV 160 >gi|221044450|dbj|BAH13902.1| unnamed protein product [Homo sapiens] Length = 623 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 68/196 (34%), Gaps = 28/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D+S SM K+ +++ +L +++ + LV F +++ Sbjct: 5 VVFVIDISGSMRG------QKVKQIKEALLKILGDMQPGDYFD------LVLFGTRVQSW 52 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E + Q ++ S +TN GL + +Q +N+ Sbjct: 53 KGSLVQASEANLQAAQDFVRGFS-LDEATNLNGGLLRGIEILNQVQESLPEL----SNHA 107 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDG+ + Q L A + +Y +G FL + N+ Sbjct: 108 SILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 167 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 168 QRIYEDHDATQQLQGF 183 >gi|255948704|ref|XP_002565119.1| Pc22g11730 [Penicillium chrysogenum Wisconsin 54-1255] gi|211592136|emb|CAP98461.1| Pc22g11730 [Penicillium chrysogenum Wisconsin 54-1255] Length = 1029 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 70/213 (32%), Gaps = 32/213 (15%) Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 + S S Y T I YTN + LD+++V+ VS Sbjct: 466 SRGIQRQASINSSYGAGKSINTAITDYTNPDYASPDAEFPAINTV--HIPLDLVVVIPVS 523 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-----KIEEFFLL 236 SM+ KI + ++ +++ + + GLVTF + + Sbjct: 524 SSMQGL------KITLLRDALKFLVQNLGPRD------RMGLVTFGSSGGGVPLVGMTTK 571 Query: 237 EW-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 W G S + I+ + + + + G A + + R+ N I+ ++ Sbjct: 572 SWAGWSKILESIRPVGQKSLRADVVEGANVAMDLLM----QRKFNNPVST-----ILLIS 622 Query: 296 DGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 D + + AK +++ G+ Sbjct: 623 DSSISDPESVDFVVSRAEAAK---VTIHSFGLG 652 >gi|163797398|ref|ZP_02191350.1| hypothetical protein BAL199_28750 [alpha proteobacterium BAL199] gi|159177317|gb|EDP61874.1| hypothetical protein BAL199_28750 [alpha proteobacterium BAL199] Length = 683 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 73/204 (35%), Gaps = 43/204 (21%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+DVS SM + A S+ + +E + N +V F+NK F Sbjct: 330 VIFVIDVSGSM------KGEPLRAAKASLTSGIEGLGRNDTFN------VVAFNNKAAAF 377 Query: 234 F--LLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + + + +K + K G T + A D + + Sbjct: 378 YDAPVRASGKFHRAALKVIDGLKAGGGTEMAAAFELALQ------------MPGDPDRLQ 425 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +VF+TDG + + A+ ++ +GI ++PN+F++ E Sbjct: 426 QVVFITDGAVSNEAALFNQIKGELGAR----RLFTVGIG-----------SAPNTFFMEE 470 Query: 350 NPHSMYDAFSHIGKDIVTKRIWYD 373 +++IG +R+ D Sbjct: 471 AARFGRGTYTYIGDTSSAERVMRD 494 >gi|91974615|ref|YP_567274.1| von Willebrand factor, type A [Rhodopseudomonas palustris BisB5] gi|91681071|gb|ABE37373.1| von Willebrand factor, type A [Rhodopseudomonas palustris BisB5] Length = 372 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 62/181 (34%), Gaps = 29/181 (16%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 ++ ++ +++ VLD + SM + + KI +I V P + ++ Sbjct: 20 AIPSAARARPAVEVAFVLDTTGSMSGLIEGAKRKIWSIATAI------VDSNPGAD--IR 71 Query: 221 SGLVTF----SNKIEEFFLLEWGVSHLQRKIKYLSKFGVS---TNSTPGLKYAYNQI-FD 272 GLV + + + L + L ++ L G + L A N++ + Sbjct: 72 MGLVAYRDIGDDYVTRNVELTPDIQDLYARLLELQARGGGDWPESVNEALDVAVNKLRWS 131 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN--EAKKRGAIVYAIGIRVI 330 G + IVF+ Q + Y A+++ IV A+ Sbjct: 132 KDGDTRR-----------IVFLVGDAPPHMDYAQDTKYPTTLSVARQKDIIVNAVQAGAA 180 Query: 331 R 331 R Sbjct: 181 R 181 >gi|3169331|gb|AAC17863.1| YojO [Bacillus subtilis subsp. subtilis str. 168] Length = 661 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 33/219 (15%), Positives = 67/219 (30%), Gaps = 28/219 (12%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLD-----MMIVLDVSRS 183 I +K L N++ + + ++ + +++D S S Sbjct: 418 IQKTLEHKKTLPRTDLHAGRLNNKLLRYFTERNPRLFYKKQEPSSEIDAVFTLLVDCSAS 477 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 M K+D + I E +K + + +V F + ++ Sbjct: 478 M-------FDKMDETKRGIVLFHEALKSVAVPHQIV-----GFWEDTNDATEKS-QPNYF 524 Query: 244 QRKIKYLSKFGVSTNSTPGLKY--AYNQIFDMQGMRQHCNT--EDANYKKIIVFMTDGEN 299 I + S +S P + D +RQ + +K ++ +DGE Sbjct: 525 NTVIPFQSSLRQ--DSGPAIMQLEPEEDNRDGYAIRQMTKKMLHRSEAQKFLIVFSDGEP 582 Query: 300 LST----KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + + EA+KRG V + + E Sbjct: 583 AAFGYEQNGIVDTSEAVIEARKRGIEVINVFLSNSEIEE 621 >gi|170091408|ref|XP_001876926.1| predicted protein [Laccaria bicolor S238N-H82] gi|164648419|gb|EDR12662.1| predicted protein [Laccaria bicolor S238N-H82] Length = 886 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 65/181 (35%), Gaps = 22/181 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV-KLIPDVNN 217 T+ K + T ++LD+M ++D + SM+ + I A SI + + V + + Sbjct: 546 TAWSKAPTTTKSKLDIMFLIDATGSMDPY-------IKQASASIGTIYDNVLRNGSWSKD 598 Query: 218 VVQSGLVTF-SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 ++ GLV F + ++ + +S + + G Y + + Sbjct: 599 DIRVGLVAFRDHPQKKATTFLTQKYDFTSDMSKVSDNLDALEAKDG--EDYPEASEDALE 656 Query: 277 RQHCNTEDANYKKIIVFMTD------GEN-----LSTKEDQQSLYYCNEAKKRGAIVYAI 325 + + + V +TD GE+ + + + G ++Y + Sbjct: 657 DALEADWNDDAVMVTVLITDSTPHATGESHDYFKDGCPDQNDPVDIADRMADLGIVLYVL 716 Query: 326 G 326 G Sbjct: 717 G 717 >gi|53713709|ref|YP_099701.1| hypothetical protein BF2418 [Bacteroides fragilis YCH46] gi|253565657|ref|ZP_04843112.1| BatB [Bacteroides sp. 3_2_5] gi|265764033|ref|ZP_06092601.1| BatB [Bacteroides sp. 2_1_16] gi|4838139|gb|AAD30859.1|AF116251_2 BatB [Bacteroides fragilis] gi|52216574|dbj|BAD49167.1| conserved hypothetical protein BatB [Bacteroides fragilis YCH46] gi|251945936|gb|EES86343.1| BatB [Bacteroides sp. 3_2_5] gi|263256641|gb|EEZ27987.1| BatB [Bacteroides sp. 2_1_16] gi|301163418|emb|CBW22969.1| aerotolerance-related membrane protein [Bacteroides fragilis 638R] Length = 341 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 28/175 (16%), Positives = 64/175 (36%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +++MI LD+S SM + D ++++ A + I+ +++ ++ + Sbjct: 79 GSKLETVKRKGVEVMIALDISNSMLAQ-DVQPSRLEKAKRLISKLVDGMEND-------K 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 G++ F+ + + ++ +S T + A Sbjct: 131 VGMIVFAGDAFTQLPITSDYISAKMFLESISPSLISKQGTAIGAAINLA----------- 179 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 T + IV +TDGEN + + K+G V +G+ + Sbjct: 180 ARSFTPQEGVGRAIVVITDGENHEGGAVEAAKEA----AKKGIQVNVLGVGLPDG 230 >gi|134118676|ref|XP_771841.1| hypothetical protein CNBN0230 [Cryptococcus neoformans var. neoformans B-3501A] gi|50254445|gb|EAL17194.1| hypothetical protein CNBN0230 [Cryptococcus neoformans var. neoformans B-3501A] Length = 502 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 24/184 (13%), Positives = 59/184 (32%), Gaps = 35/184 (19%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN--NVVQ 220 S +D++ +LD + SM+ + +S I + + ++ +N + ++ Sbjct: 46 GSGSSHGKCIDLVFILDCTGSMQKYINS-------VRDHIIGICDMIRGEEGLNGPDDLR 98 Query: 221 SGLVTF-------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 +V + S + +F + +Q +K L+ G + + A Sbjct: 99 VAVVNYRDHPPQDSTYVYKFHPFTSDIPEVQNYLKGLTASGGG-DGPEAVTAAMAATLTE 157 Query: 274 QGMRQHCNTEDANYKKIIVFMTD------GEN-----LSTKEDQQSLYYCNEAKKRGAIV 322 R+ ++ V + D GE + L + G + Sbjct: 158 LEWRREAA-------RMAVLVADAPPHGIGEGGDQFKQGDPDGHDPLVVARMMAQNGITM 210 Query: 323 YAIG 326 + + Sbjct: 211 FMVA 214 >gi|327266510|ref|XP_003218048.1| PREDICTED: complement C2-like [Anolis carolinensis] Length = 735 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 28/215 (13%), Positives = 72/215 (33%), Gaps = 32/215 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 D L + ++ D S S+ + K + ++ V V+ +++F Sbjct: 252 SKDNFLYIYMLADASHSVTE------ANFRIFKKCLEIIINRVASFDIA---VKFSILSF 302 Query: 227 SNKIEE--------FFLLEWGVSHLQRKIKYLSK-FGVSTNSTPGLKYAYNQIFDMQGMR 277 +++ + E + ++ + Y TN+ L Y + + R Sbjct: 303 ASQPKTIVDIHEDIAEDPELVLEKMKSDMNYKDHGNATGTNTYAALDAIYRMMIN---DR 359 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA------KKRGAIVYAIGIRVIR 331 + + + I+ +TDG++ + + ++ +K +Y G+ + Sbjct: 360 ANVLEKWDKVRHAIILLTDGKSNLGRPPKDAVRSIEGLVNVGENRKDYLDIYVFGVGNLD 419 Query: 332 S-----HEFLRACASPNSFYLVENPHSMYDAFSHI 361 +E + +E ++ +AF I Sbjct: 420 VDWSAMNEIASKKPGEKHAFKLEKLETLKEAFEDI 454 >gi|321311581|ref|YP_004203868.1| von Willebrand factor type A [Bacillus subtilis BSn5] gi|320017855|gb|ADV92841.1| von Willebrand factor type A [Bacillus subtilis BSn5] Length = 638 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 30/219 (13%), Positives = 66/219 (30%), Gaps = 28/219 (12%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLD-----MMIVLDVSRS 183 I +K L N++ + + ++ + +++D S S Sbjct: 395 IQKTLEHKKTLPRTDLHAGRLNNKLLRYFTERNPRLFYKKQEPSSEIDAVFTLLVDCSAS 454 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 M K+D + I E +K + + +V F + ++ Sbjct: 455 M-------FDKMDETKRGIVLFHEALKSVAVPHQIV-----GFWEDTNDATEKS-QPNYF 501 Query: 244 QRKIKYLSKFGVSTNSTPG-LKYAYNQIFDMQGMRQHCNT---EDANYKKIIVFMTDGEN 299 I + S +S P ++ + + + +K ++ +DGE Sbjct: 502 NTVIPFQSSLRQ--DSGPAIMQLEPEEDNRDGYAIRQMTKKMLHRSEAQKFLIVFSDGEP 559 Query: 300 LST----KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + + EA+KRG V + + E Sbjct: 560 AAFGYEQNGIVDTSEAVIEARKRGIEVINVFLSNSEIEE 598 >gi|299132141|ref|ZP_07025336.1| von Willebrand factor type A [Afipia sp. 1NLS2] gi|298592278|gb|EFI52478.1| von Willebrand factor type A [Afipia sp. 1NLS2] Length = 634 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 34/208 (16%), Positives = 75/208 (36%), Gaps = 28/208 (13%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV-----QSGLVT 225 L + +++DVS S + +D+ ++ A+ + D + + + VT Sbjct: 444 DLSLAVLMDVSLS-TDAWMQDRRVLDVEKGALLALTHGLTACGDEHAIYTFTSRRRTSVT 502 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 S E L+ + R+I+ L+ G T +++ ++ + Sbjct: 503 VSTIKEFDEPLD---RRIIRRIEALTP-GQYTRIGAAVRHVTTEL-----------AQRP 547 Query: 286 NYKKIIVFMTDGENLSTKED------QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 K+++ +TDG+ + + EA+K G V+ + I F Sbjct: 548 QRHKLLLLLTDGKPNDIDHYEGRYGIEDTRMAIREARKAGLRVFGVTIDENARDYFPHIF 607 Query: 340 ASPNSFYLVENPHSMYDAFSHIGKDIVT 367 ++ +V + + A I + I T Sbjct: 608 GR-GAYSIVRDIARLPAALPAIYRQITT 634 >gi|300936621|ref|ZP_07151525.1| von Willebrand factor type A domain protein [Escherichia coli MS 21-1] gi|300458202|gb|EFK21695.1| von Willebrand factor type A domain protein [Escherichia coli MS 21-1] Length = 219 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 32/179 (17%), Positives = 63/179 (35%), Gaps = 16/179 (8%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 ++ S + R +++LDVS SM + I+ + +E+ P V Sbjct: 8 ATSDFASNPEPRCPCILLLDVSGSM------NGRPINELNAGLVTFRDELLADPLALKRV 61 Query: 220 QSGLVTFS-NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + G+VTF +E+ F S L G T + A N + + R+ Sbjct: 62 ELGIVTFGPVHVEQPF-----TSAANFFPPILFAHG-DTPMGAAITKALNMV--EERKRE 113 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + + Y+ I +TDG + + + ++IG++ + Sbjct: 114 YRANGISYYRPWIFLITDGAPTDE-WQAAANKVFQGEEDKKFAFFSIGVQGADMKTLAQ 171 >gi|224160089|ref|XP_002189262.1| PREDICTED: similar to matrilin 4, partial [Taeniopygia guttata] Length = 219 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 12/71 (16%), Positives = 35/71 (49%), Gaps = 9/71 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + +D+++V+D S+S+ ++ + +N +++ +++ P + GLV Sbjct: 158 SKCGAGHVDLVMVIDGSKSVRP------QNFELVKQFVNRIVDLLEVSPHG---TRVGLV 208 Query: 225 TFSNKIEEFFL 235 +S+++ F Sbjct: 209 QYSSRVRTEFP 219 >gi|13242646|ref|NP_077661.1| EsV-1-176 [Ectocarpus siliculosus virus 1] gi|13177446|gb|AAK14590.1|AF204951_175 EsV-1-176 [Ectocarpus siliculosus virus 1] Length = 254 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 32/190 (16%), Positives = 60/190 (31%), Gaps = 39/190 (20%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 ++++D S SM D S T+ + A K++ + V+ + + + F +K+E Sbjct: 59 VLLIDRSGSMGWA-DGSTTRWESARKAVEKL---VEAAFIYDTDHRVPVYLFDDKVEFVG 114 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + Y + +T+ L+ A + + IV + Sbjct: 115 ECTSSSQVVDVFKNYQPR--GTTDLAQCLEVAMEEYAGRKRPNYEVCPGTT----FIVVL 168 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSM 354 DG C + K LR + P S Y+ N + Sbjct: 169 -DG--------------CADDKDA-------------VKRVLRKFSDPVSGYVA-NHTQL 199 Query: 355 YDAFSHIGKD 364 +F I D Sbjct: 200 AISFLQIADD 209 >gi|255693879|ref|ZP_05417554.1| BatB protein [Bacteroides finegoldii DSM 17565] gi|260620308|gb|EEX43179.1| BatB protein [Bacteroides finegoldii DSM 17565] Length = 342 Score = 41.3 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 24/175 (13%), Positives = 62/175 (35%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + ++++I LD+S SM + D ++++ A + I+ +++E+ + Sbjct: 79 GSKLETVKRKGVEVIIALDISNSMLAQ-DVQPSRLEKAKRLISRLVDELDND-------K 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 G++ F+ + + ++ ++ T + A Sbjct: 131 VGMIVFAGDAFTQLPITSDYISAKMFLESINPSLISKQGTAIGEAINLA----------- 179 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 T + I+ +TDGEN + + + V +G+ + Sbjct: 180 ARSFTPQEGVGRAIIVITDGENHEGGAVEAAKAAAEKG----IQVNVLGVGMPDG 230 >gi|116327069|ref|YP_796789.1| hypothetical protein LBL_0242 [Leptospira borgpetersenii serovar Hardjo-bovis L550] gi|116119813|gb|ABJ77856.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar Hardjo-bovis L550] Length = 379 Score = 41.3 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 35/220 (15%), Positives = 74/220 (33%), Gaps = 38/220 (17%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + + S+ R + IV D S SM + KI +A K ++ + + Sbjct: 54 TGTHIASENHERSKLFIV-DASGSMNEYL-GIYQKIHLAKKHVSRYISTLPTE------T 105 Query: 220 QSGLVTFSNKIEEFF------LLEW-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + G + + N+I L+ + ++ L+ G T ++ A N I Sbjct: 106 EIGFIAYGNRIPGCSSSRLYEPLQRENHGTFKNRLFSLTPSGA-TPLAESIRIAGNLISQ 164 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA----IVYAIGIR 328 + + I+ +TDG K++G + +G++ Sbjct: 165 RKKETE------------IILITDG---VESCYGDPKKELQALKQQGIYFKFHILGLGLK 209 Query: 329 VIRSHEFLRACASPNS--FYLVENPHSMYDAFSHIGKDIV 366 ++ A + ++ +E+ S Y A + V Sbjct: 210 -PDEERKMKILAEEGNGKYFGIEDDSSFYTALDSLKNQTV 248 >gi|317055297|ref|YP_004103764.1| von Willebrand factor type A [Ruminococcus albus 7] gi|315447566|gb|ADU21130.1| von Willebrand factor type A [Ruminococcus albus 7] Length = 271 Score = 41.3 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 29/201 (14%), Positives = 64/201 (31%), Gaps = 15/201 (7%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + L + ++D S SM K+ + ++ E++ + + + V+ Sbjct: 28 GATAVSKKSLVIFFLIDTSGSM------KGKKMGQLNTVMEELIPEIRKVGEADTDVKVA 81 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 ++TF + +W S I+ + + A+ ++ N+ Sbjct: 82 VLTFDTDV------KWMYSAPIS-IEDFEWARLGAQGVTSMGAAFTELAARMSRNSFLNS 134 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL-RACAS 341 ++ ++ MTDG + N K G A+GI + E L S Sbjct: 135 PSLSFAPVMFLMTDGYPSDDYKSGLKALQTNSWYKFGLKA-ALGIGDEANDEMLAEFTGS 193 Query: 342 PNSFYLVENPHSMYDAFSHIG 362 + + I Sbjct: 194 KETVVHAYTGGQLASMIKIIA 214 >gi|302495833|ref|XP_003009930.1| hypothetical protein ARB_03856 [Arthroderma benhamiae CBS 112371] gi|291173452|gb|EFE29285.1| hypothetical protein ARB_03856 [Arthroderma benhamiae CBS 112371] Length = 705 Score = 41.3 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 67/191 (35%), Gaps = 39/191 (20%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ-TDARLDMMIVLDVSRSMESFF 188 + + + I N +++ I +K + D+++V+D+S SM S Sbjct: 27 TVTMEENLTQEPILSIHSIPNKDSMIVSIQPPLKPENDVPHVPCDIVLVIDISGSMNSAA 86 Query: 189 ----------DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE------- 231 D+ ++ +D+ + ++E + N + +VTF ++ Sbjct: 87 PIPTGERGGEDTGLSILDLTKHAAKTIIETL------NEKDRLAVVTFCTEVNVRTIESS 140 Query: 232 -EFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 F L++ S + I L STN G+K N + Sbjct: 141 PVAFELDYMNKENKSTVLSAIDKL-YGKSSTNLWHGIKKGLNVLTTN---------PAQG 190 Query: 287 YKKIIVFMTDG 297 + ++ +TDG Sbjct: 191 KIQSLLVLTDG 201 >gi|254373669|ref|ZP_04989153.1| hypothetical protein FTDG_01677 [Francisella novicida GA99-3548] gi|151571391|gb|EDN37045.1| hypothetical protein FTDG_01677 [Francisella novicida GA99-3548] Length = 332 Score = 41.3 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 34/163 (20%), Positives = 68/163 (41%), Gaps = 29/163 (17%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + +I LDVS+SM++ D S ++++ A I +L +K Q G++ FS+ Sbjct: 88 QKNISRVIALDVSQSMDT-TDVSPSRLERAKYKIFDILRRIKEG-------QVGMIVFSS 139 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L + ++ + ++ N LK + Q+ + G++Q Sbjct: 140 EPFVVSPLTSDANTIENLVTVINSDIVPVQGHNIYKALKKS-AQLIEQAGVQQGQ----- 193 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI--VYAIG 326 I+ +TD + Q++ + ++G VYAIG Sbjct: 194 -----IILITD-----SSPSPQAISQAKQLAQQGIKTDVYAIG 226 >gi|86748912|ref|YP_485408.1| TadE-like [Rhodopseudomonas palustris HaA2] gi|86571940|gb|ABD06497.1| TadE-like [Rhodopseudomonas palustris HaA2] Length = 181 Score = 41.3 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 22/128 (17%), Positives = 40/128 (31%), Gaps = 12/128 (9%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMI-DRSLVHAATQIMN 67 R F N +G I A+ PI +L IIE + +F VL + + D + + Sbjct: 14 RRFSRNRRGSAAIEFAMIAPIFIALLFAIIETAFVFLASQVLETAVQDSARL-------- 65 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 + + K + +G D+ S + P + Sbjct: 66 ---ILTGQAQAASYTQSQFKTDLCNRLKALFSCDGVYVDVQSYGSDFSTVSITTPIDSSK 122 Query: 128 SISAISRY 135 + +Y Sbjct: 123 NFVNTMKY 130 >gi|168487521|ref|ZP_02712029.1| collagen adhesion protein [Streptococcus pneumoniae CDC1087-00] gi|298230475|ref|ZP_06964156.1| collagen adhesion protein [Streptococcus pneumoniae str. Canada MDR_19F] gi|298502975|ref|YP_003724915.1| collagen adhesion protein family protein [Streptococcus pneumoniae TCH8431/19A] gi|183569664|gb|EDT90192.1| collagen adhesion protein [Streptococcus pneumoniae CDC1087-00] gi|298238570|gb|ADI69701.1| collagen adhesion protein family protein [Streptococcus pneumoniae TCH8431/19A] gi|327389411|gb|EGE87756.1| collagen adhesion family protein [Streptococcus pneumoniae GA04375] Length = 171 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 26/103 (25%), Positives = 48/103 (46%), Gaps = 11/103 (10%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQTDA---RLDMMIVLDVSRSME----SFFDSS- 191 K T I + + + ITS + +QT+ LD+++V D S SME + S Sbjct: 47 KTITPISGQEDQYELSLDITSKLGTETQTETQSEPLDVVLVADFSGSMEERDVWSYSSRR 106 Query: 192 -ITKIDMAIKSINAMLEEVKLIPDV--NNVVQSGLVTFSNKIE 231 I++I+ ++ + LI + N+ + +V F+ KI+ Sbjct: 107 YISRIEALKHTLKGVNGRQGLIDTILSNSQNRLSIVGFAGKID 149 >gi|150007596|ref|YP_001302339.1| hypothetical protein BDI_0949 [Parabacteroides distasonis ATCC 8503] gi|255013875|ref|ZP_05286001.1| hypothetical protein B2_08202 [Bacteroides sp. 2_1_7] gi|149936020|gb|ABR42717.1| conserved hypothetical protein BatB [Parabacteroides distasonis ATCC 8503] Length = 339 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 31/139 (22%), Positives = 57/139 (41%), Gaps = 16/139 (11%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +++M+ LDVS SM + D S ++D A + ++ + + N+ V Sbjct: 79 GSKLETVKRQGVEIMVCLDVSNSMLAE-DVSPNRLDKAKQMLSRLTDGF-----TNDKV- 131 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 GL+ F+ + + + ++ VST + A N + Sbjct: 132 -GLIVFAGDAFTQLPITSDYISAKMFLSSINPSMVSTQ-GTAIGAAIN-------LAARS 182 Query: 281 NTEDANYKKIIVFMTDGEN 299 T D K I+ +TDGEN Sbjct: 183 FTPDETTDKAIILITDGEN 201 >gi|60551291|gb|AAH91051.1| Clca1 protein [Xenopus (Silurana) tropicalis] Length = 937 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 68/196 (34%), Gaps = 39/196 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLDVS SM +I ++ + ++ + G+V F + Sbjct: 308 VTLVLDVSGSM-----GGGNRIGRLYQAAEVFVMQIVEMGS-----YVGIVQFESTASVR 357 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDANYKKIIV 292 L V QR L T + N + +G++ + + ++Y +V Sbjct: 358 SSLLQIVDDTQR--NRLKSLLPKTATGGT-----NICAGIREGIKVNKKYDGSSYSTELV 410 Query: 293 FMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF---------LRACASP 342 +TDGE+ C + G+I++ I + + E LR A+ Sbjct: 411 LLTDGEDNYATS------LCFPDVTNSGSIIHVIALGPNAAKELETIVDMTGGLRFLATD 464 Query: 343 NSFYLVENPHSMYDAF 358 + + DAF Sbjct: 465 K-----VDAQGLIDAF 475 >gi|60681980|ref|YP_212124.1| aerotolerance-related membrane protein [Bacteroides fragilis NCTC 9343] gi|60493414|emb|CAH08200.1| aerotolerance-related membrane protein [Bacteroides fragilis NCTC 9343] Length = 341 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 28/175 (16%), Positives = 64/175 (36%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +++MI LD+S SM + D ++++ A + I+ +++ ++ + Sbjct: 79 GSKLETVKRKGVEVMIALDISNSMLAQ-DVQPSRLEKAKRLISKLVDGMEND-------K 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 G++ F+ + + ++ +S T + A Sbjct: 131 VGMIVFAGDAFTQLPITSDYISAKMFLESISPSLISKQGTAIGAAINLA----------- 179 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 T + IV +TDGEN + + K+G V +G+ + Sbjct: 180 ARSFTPQEGVGRAIVVITDGENHEGGAVEAAKEA----AKKGIQVNVLGVGLPDG 230 >gi|89094360|ref|ZP_01167301.1| Uncharacterized protein containing a von Willebrand factor type A(vWA) domain [Oceanospirillum sp. MED92] gi|89081419|gb|EAR60650.1| Uncharacterized protein containing a von Willebrand factor type A(vWA) domain [Oceanospirillum sp. MED92] Length = 707 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 35/190 (18%), Positives = 66/190 (34%), Gaps = 36/190 (18%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D + VLD+S SM+ K ++ + L + N+ + V F+N+ Sbjct: 313 DWVFVLDISGSMKG-------KFAALVEGVREGLSNLSP----NDRFRI--VLFNNQARS 359 Query: 233 FFL--LEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F L + ++ + L + G TN GL+ NQ+ D++ Sbjct: 360 FTQGYLPADKTTVENTLNQLDQIQPGQGTNLYAGLQTGINQL-------------DSDRS 406 Query: 289 KIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSF 345 IV +TDG N+ T + L + + ++ + + L S Sbjct: 407 TAIVLVTDGVANVGTTHKSKFLNLLEQ---KDVRLFTFIMGNSANRPLLEEMTRVSNGFA 463 Query: 346 YLVENPHSMY 355 + N + Sbjct: 464 MSISNSDDIV 473 >gi|86137907|ref|ZP_01056483.1| hypothetical protein MED193_08593 [Roseobacter sp. MED193] gi|85825499|gb|EAQ45698.1| hypothetical protein MED193_08593 [Roseobacter sp. MED193] Length = 181 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 10/51 (19%), Positives = 25/51 (49%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSL 58 +R F + +T+ AI +P+ + L +E+ + ++ L +D ++ Sbjct: 10 LRRFRKDTDATVTVEFAILMPLFLMFLFSTVELGMLILRQSQLERALDIAV 60 >gi|329946212|ref|ZP_08293825.1| hypothetical protein HMPREF9056_01718 [Actinomyces sp. oral taxon 170 str. F0386] gi|328527810|gb|EGF54801.1| hypothetical protein HMPREF9056_01718 [Actinomyces sp. oral taxon 170 str. F0386] Length = 367 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 21/188 (11%), Positives = 66/188 (35%), Gaps = 13/188 (6%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSM--ESFFDSSITKIDM-AIKSINAMLEEVK 210 + M + + + + +++ +V+D + SM E + +D A ++ + +++ Sbjct: 64 LAMAGPAIRGSEAISVSNVEIYMVVDRTGSMAAEDYQGQGPEGVDQPASTRLDGVRADMR 123 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + + + ++ + L + + I L + + + L+ A + Sbjct: 124 AVREAFPDSRFSIIALDSAAATELPLTHDTNAVDSWIGSLKQEVSAHATGSSLEVALPLL 183 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + +D +++ +DGE +D + + A G ++ ++ Sbjct: 184 GQTLAQSRQSEPKDI---RLVYIFSDGE---ATDDGRGAQAADNA---GISWQSLA-GLV 233 Query: 331 RSHEFLRA 338 L Sbjct: 234 DGGAVLGY 241 >gi|295840162|ref|ZP_06827095.1| von Willebrand factor, type A [Streptomyces sp. SPB74] gi|197698094|gb|EDY45027.1| von Willebrand factor, type A [Streptomyces sp. SPB74] Length = 588 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 37/218 (16%), Positives = 74/218 (33%), Gaps = 42/218 (19%) Query: 174 MMIVLDVSRSMESFFDSS-ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 +++V+D S SM +++D+ S+ L + D GL F+ +++ Sbjct: 382 LLVVVDASPSMAEPVPGRGQSRMDVTKASLLQALAQFTPADD------IGLWEFATRLD- 434 Query: 233 FFLLEWGVSHLQRKIKYLS---KFGVSTNSTPGLKYAY----------NQIFDMQGMRQH 279 G + + + G S L A+ ++D Sbjct: 435 ------GDRDYRELVPTARLGARKGSSGTQREALTAAFGALQPQPGGATGLYDTTLAAYR 488 Query: 280 CNTEDANYKKI--IVFMTDGENLSTKEDQQS------LYYCNEAKKRGAIVYAIGIRVIR 331 +ED K +V +TDG N + + A + AI + Sbjct: 489 EASEDYAADKFNAVVLLTDGTNEEPGSMTRGELLTRLRDLADPAHP--LPLVAIAVGPEA 546 Query: 332 SHEFLRAC--ASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + + + A A+ S + V++P +++ I IV Sbjct: 547 AEDDMEAIGGATGGSGFKVDDPAQIHEV---INNAIVE 581 >gi|149928031|ref|ZP_01916279.1| hypothetical protein LMED105_14810 [Limnobacter sp. MED105] gi|149823219|gb|EDM82455.1| hypothetical protein LMED105_14810 [Limnobacter sp. MED105] Length = 543 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 26/118 (22%), Positives = 47/118 (39%), Gaps = 4/118 (3%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 N + +G +ILT L + LG ++ V H+ + KT L + D + AA Q+ N Sbjct: 5 PNSRQSQRGAYSILTVFVLIMSLGALG-VLAVGHMAWEKTRLQGVADLVALTAARQMSNG 63 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD---IVRSTSLDIVVVPQ 123 + + + + + ++ N N I IVR + +V +P Sbjct: 64 PEFAEAQAIALANGVSEDDDLTIECIIDDAPTNDCDNSITSRVTIVRPVNGLLVFLPN 121 >gi|45659178|ref|YP_003264.1| hypothetical protein LIC13359 [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] gi|45602424|gb|AAS71901.1| conserved hypothetical protein [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] Length = 377 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 34/235 (14%), Positives = 73/235 (31%), Gaps = 36/235 (15%) Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 + F + + ++ ++V +QT ++L ++D S SM + KI Sbjct: 31 LHFMFLQRLLNFIGFTFFILVTGTNVSSENQTPSQL---FIIDASGSMNEYL-GIYQKIH 86 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF------LLEWGVSHLQRKIKYL 250 +A K ++ + + + G + + N++ LE G + L Sbjct: 87 LAKKHVSHYISTLP------QETEIGFLAYGNRLPGCSSSRLYQPLEMGNRDTFK--NRL 138 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 S + I + E I+ +TDG Sbjct: 139 FSLTPSGATPLA-----ESIRIAGTLISQRKKETE-----IILVTDG---IESCYGDPKK 185 Query: 311 YCNEAKKRGA--IVYAIGIRVIRSHEFLRACASP---NSFYLVENPHSMYDAFSH 360 K++G + +G+ + E + ++ +E+ S Y A Sbjct: 186 ELQALKQKGIPFQFHVLGLGLKSHEELQMKILTEEGNGKYFSIEDDSSFYTALDS 240 >gi|313247474|emb|CBY15696.1| unnamed protein product [Oikopleura dioica] Length = 534 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 16/95 (16%), Positives = 40/95 (42%), Gaps = 11/95 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN--KI 230 D++ ++D S S+ + + K +KS+ V + + ++ +S+ +I Sbjct: 448 DLVFIIDGSWSVG---NVNFRKAKDFMKSL------VNPFEIGWDYTRVSVLQYSDDPRI 498 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 E + + L I ++ G +T + ++Y Sbjct: 499 EFYLKDYQDKTTLLNAIDAITYKGGNTRTGEAIRY 533 >gi|307825380|ref|ZP_07655599.1| von Willebrand factor type A [Methylobacter tundripaludum SV96] gi|307733555|gb|EFO04413.1| von Willebrand factor type A [Methylobacter tundripaludum SV96] Length = 600 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 26/140 (18%), Positives = 55/140 (39%), Gaps = 23/140 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++I LD+SRSM++ D +++ MA I +L++ K Q+ L+ ++ Sbjct: 95 LVIALDLSRSMDAA-DIKPSRLIMARYKIADILKQRKDG-------QTALLVYAGDAFTV 146 Query: 234 FLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + ++ L+ +N+ L+ A + + Sbjct: 147 TPLTDDTETIDSQLSALNTDIMPSQGSNTASALEKAVELL-----KQAGLQKGR------ 195 Query: 291 IVFMTDG-ENLSTKEDQQSL 309 I+ +TDG + T ++L Sbjct: 196 ILLVTDGVDMDKTLAAVKTL 215 >gi|256839783|ref|ZP_05545292.1| conserved hypothetical protein [Parabacteroides sp. D13] gi|256738713|gb|EEU52038.1| conserved hypothetical protein [Parabacteroides sp. D13] Length = 339 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 31/139 (22%), Positives = 57/139 (41%), Gaps = 16/139 (11%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +++M+ LDVS SM + D S ++D A + ++ + + N+ V Sbjct: 79 GSKLETVKRQGVEIMVCLDVSNSMLAE-DVSPNRLDKAKQMLSRLTDGF-----TNDKV- 131 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 GL+ F+ + + + ++ VST + A N + Sbjct: 132 -GLIVFAGDAFTQLPITSDYISAKMFLSSINPSMVSTQ-GTAIGAAIN-------LAARS 182 Query: 281 NTEDANYKKIIVFMTDGEN 299 T D K I+ +TDGEN Sbjct: 183 FTPDETTDKAIILITDGEN 201 >gi|255671683|gb|ACU26442.1| uncharacterized protein [uncultured bacterium HF186_25m_18N5] gi|255671756|gb|ACU26513.1| uncharacterized protein [uncultured bacterium HF186_25m_27D22] Length = 836 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 36/290 (12%), Positives = 93/290 (32%), Gaps = 32/290 (11%) Query: 62 ATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDN---GFVNDIDDIVRSTSLDI 118 ++ M N + + + R+ +++ + +ID ++ Sbjct: 193 GSEAMTTQAAVNPRYIADKETRHRVSLELHLNESATVFQAHSPSHALEIDAGPDKSTYVS 252 Query: 119 VVVPQ---NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMM 175 + P N + + P+ + + + + +KV+ ++ Sbjct: 253 LADPDALPNRTIEVRYMLASTQPVARVSAHRKPGQTGTFALTLEPPLKVDPDQVTPKELF 312 Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 V+D S SM + K A++ + ++ SG+ + + + Sbjct: 313 FVVDTSGSM---MGEPLDKARAAMRYALERMGPDDTFQIID--FASGVASLAPRPLPNTP 367 Query: 236 LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK-KIIVFM 294 + I+ ++ G + + G+R + + +I+ FM Sbjct: 368 E--NLRKGLAFIEAMTSQGGTEM--------------LAGIRAALDGPTPPGRLRIVAFM 411 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 TDG D L Y +++ + A +++ G+ + L A+ Sbjct: 412 TDG---YIGNDGDILDYIDQSVGQ-ARLFSFGVGEDVNRYLLEEMATRGR 457 >gi|300776964|ref|ZP_07086822.1| von Willebrand factor type A domain protein [Chryseobacterium gleum ATCC 35910] gi|300502474|gb|EFK33614.1| von Willebrand factor type A domain protein [Chryseobacterium gleum ATCC 35910] Length = 800 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 39/258 (15%), Positives = 87/258 (33%), Gaps = 28/258 (10%) Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTF---IPWYTNSRHIVMPITSSVKVNS 166 +V ++ I + Y + + P T PW + + + + + Sbjct: 375 VVDKNAVRIEEMVNYFKYDYP-QPKNENPFSINTEYSDAPWNPKHKLLKIGLQGK-NLPM 432 Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +++ ++DVS SM S K+ + S +L +++ V VV +G Sbjct: 433 DKLPASNLVFLIDVSGSM-----SDENKLPLLKSSFKVLLNQLRPKDKVGIVVYAGSAGM 487 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + + L + G ST G++ AY ++ +++ N Sbjct: 488 VLPPTSAGEKD----KIIEALDRL-QAGGSTAGGAGIELAY-KLAQENFVKEGNNR---- 537 Query: 287 YKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE-FLRACAS--P 342 ++ TDG+ N+ T + +K G + +G + + L A Sbjct: 538 ----VIIATDGDFNVGTSSISDLKTLIEDRRKSGVFLTCLGFGMGNYKDNTLETLADKGN 593 Query: 343 NSFYLVENPHSMYDAFSH 360 ++ ++N Sbjct: 594 GNYAYIDNMQEANKFLGK 611 >gi|119493487|ref|ZP_01624154.1| von Willebrand factor, type A [Lyngbya sp. PCC 8106] gi|119452670|gb|EAW33850.1| von Willebrand factor, type A [Lyngbya sp. PCC 8106] Length = 843 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 29/211 (13%), Positives = 67/211 (31%), Gaps = 34/211 (16%) Query: 136 KIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKI 195 + T + H + S++ D++ ++D S S + Sbjct: 306 DETTQLKTLLQADDRGGHFATYLIPSLEYKQDEIVAKDVVFLIDTSGSQRG------EPL 359 Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE--EFFLLEWGVSHLQRKIKYLSKF 253 + + + ++ + + ++ FS+ L V++ Q+ I Y+++ Sbjct: 360 AKSKQLMRRFIQSLNPDDTFS------IIDFSDTTTALSATPLTNTVTNQQKAIAYINQL 413 Query: 254 --GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI--IVFMTDGENLSTKEDQQSL 309 T G++ + K++ IV +TDG + + L Sbjct: 414 EANGGTELLNGIQTVM-------------DFPSPPVKRLRSIVLITDG---YIGNENEVL 457 Query: 310 YYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 K G +Y+ G+ + L A Sbjct: 458 SVVKNQLKSGNRLYSFGVGSSVNRFLLNRLA 488 >gi|38489998|gb|AAR21601.1| complement factor B precursor [Naja kaouthia] Length = 787 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 55/143 (38%), Gaps = 21/143 (14%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 L++ V+D S+S + A +++ +++++ + G+VTF Sbjct: 242 NKSDSLNIYFVIDGSKSFG------RKRFIKAKEALLKLIQKISSYDIHPSY---GIVTF 292 Query: 227 SNKIE--------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF--DMQGM 276 + + + +W L + + TN GL++ Y + D++ Sbjct: 293 ATENRMVLHTTHPQSSDADWVSERLGSISIDVHERKPGTNIANGLRFVYEMMIMQDVEEK 352 Query: 277 RQHCNTE--DANYKKIIVFMTDG 297 ++ N + II+ +TDG Sbjct: 353 QRGLNPTPVSTSVHHIIIVLTDG 375 >gi|83643773|ref|YP_432208.1| hypothetical protein HCH_00891 [Hahella chejuensis KCTC 2396] gi|83631816|gb|ABC27783.1| uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain [Hahella chejuensis KCTC 2396] Length = 223 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 53/153 (34%), Gaps = 10/153 (6%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 ++VLD S SM I + + + +K VQ ++ N + Sbjct: 22 VLVLDGSSSMFG------EPIRQLNEGLKLLERALKEDASTAMRVQLLVIRAGNHDQAEV 75 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 L +W V + + G +T + A ++I D + I+ + Sbjct: 76 LTDW-VDAMDFNAPEVFANG-TTPLGGAMNLALDKIEDQKAAYDANGISSTRP--WIILI 131 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 +DG + + + + R +++ IG+ Sbjct: 132 SDGAPTDFNWEAVADRCRHAEQNRKVVIFPIGV 164 >gi|304437199|ref|ZP_07397160.1| von Willebrand factor type A domain protein [Selenomonas sp. oral taxon 149 str. 67H29BP] gi|304369861|gb|EFM23525.1| von Willebrand factor type A domain protein [Selenomonas sp. oral taxon 149 str. 67H29BP] Length = 248 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 26/144 (18%), Positives = 56/144 (38%), Gaps = 19/144 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + +R++++++LD S SM + +I ++ + V+ V F Sbjct: 40 EKPSRIELVMILDRSGSMH----------GLEADTIGGFNAMIEKEKKLGIGVRVTTVLF 89 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI-FDMQGMRQHCNTEDA 285 ++KI+ + R I+ + T T G + + + M Q + Sbjct: 90 NDKIDRLYE--------HRAIQSVRPLTERTYETGGTTALLDAVGETILHMEQSGAADRQ 141 Query: 286 NYKKIIVFMTDGENLSTKEDQQSL 309 K I+V +TDG ++ E ++ Sbjct: 142 GTKVIVVIITDGMENASTEFTKAK 165 >gi|298676133|ref|YP_003727882.1| von Willebrand factor type A [Methanohalobium evestigatum Z-7303] gi|298289121|gb|ADI75086.1| von Willebrand factor type A [Methanohalobium evestigatum Z-7303] Length = 569 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 36/235 (15%), Positives = 84/235 (35%), Gaps = 33/235 (14%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + S + ++D+ +++ + S + ++ ++ + + I Sbjct: 3 LALLFVVTTGSASAANEKVDLYLLV------DGSGSISSSDFELQLEGMANTINN-SSIV 55 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVS-----HLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + V ++ FSN + L S ++ I +++ G STN + + + Sbjct: 56 PQDGSVSISVIQFSNIAQVEIPLTTINSQSDADNISSDIIAINQVGGSTNISGAINLSVQ 115 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA-IVYAIGI 327 ++ +D + K+II TDG Q S + A G + +G+ Sbjct: 116 KL-----------PDDLSGKQIIDLSTDGVPN-VNGIQASYDARDNANNSGFEELNTLGV 163 Query: 328 RVIRSHEFLRAC-------ASPNSFYLVENPHSMYDAF-SHIGKDIVTKRIWYDK 374 + + FL+ P + + + F GK++ K I +K Sbjct: 164 GIGVNETFLKELVFPQPSDEYPGFYTFAGDFDTFQKEFEKKAGKEVGQKPIKLEK 218 >gi|296491866|ref|YP_003662331.1| hypothetical protein XNC1_p0046 [Xenorhabdus nematophila ATCC 19061] gi|289176751|emb|CBJ92914.1| conserved hypothetical protein; putative exported protein [Xenorhabdus nematophila ATCC 19061] Length = 598 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 33/177 (18%), Positives = 68/177 (38%), Gaps = 22/177 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + +K ++ + I++D+S SM + + D+A ++ A+ ++ IP Sbjct: 403 VTGDTRVFLKPEAKKRPNTAVHILVDMSSSMAYKAANGKERQDIAREASLAISMALEAIP 462 Query: 214 DVNNVVQSGLVTF---SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 VN VTF + F +++ G + R ++ K T + YA ++ Sbjct: 463 GVNP-----AVTFFGGNRNQPVFSVVKHGDTVQNRAGRFGFKATGGTPMAEAMWYAAFEL 517 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + R K+++ +TDG S + + C + V IG+ Sbjct: 518 TKTREER-----------KMLIVVTDGLPQSAPACRSVIDLCE---RSDVEVIGIGV 560 >gi|262381905|ref|ZP_06075043.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] gi|301310438|ref|ZP_07216377.1| BatB protein [Bacteroides sp. 20_3] gi|262297082|gb|EEY85012.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] gi|300832012|gb|EFK62643.1| BatB protein [Bacteroides sp. 20_3] Length = 339 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 31/139 (22%), Positives = 57/139 (41%), Gaps = 16/139 (11%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +++M+ LDVS SM + D S ++D A + ++ + + N+ V Sbjct: 79 GSKLETVKRQGVEIMVCLDVSNSMLAE-DVSPNRLDKAKQMLSRLTDGF-----TNDKV- 131 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 GL+ F+ + + + ++ VST + A N + Sbjct: 132 -GLIVFAGDAFTQLPITSDYISAKMFLSSINPSMVSTQ-GTAIGAAIN-------LAARS 182 Query: 281 NTEDANYKKIIVFMTDGEN 299 T D K I+ +TDGEN Sbjct: 183 FTPDETTDKAIILITDGEN 201 >gi|149925200|ref|ZP_01913495.1| hypothetical protein PPSIR1_06648 [Plesiocystis pacifica SIR-1] gi|149813928|gb|EDM73579.1| hypothetical protein PPSIR1_06648 [Plesiocystis pacifica SIR-1] Length = 353 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 41/233 (17%), Positives = 76/233 (32%), Gaps = 53/233 (22%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++V+D S SM T+ + +++ ++ EV + + +Q GL F Sbjct: 66 VVLVVDRSGSMSDNPLGDETRWE----ALHGVISEVVSGQEAS--LQLGLTMF------- 112 Query: 234 FLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAY---NQIFDMQGMRQ--------HC 280 + G + Q ++ V ++ + A N I H Sbjct: 113 PAADAGTTWEQGACLAPTQLDVAVGADTGAAILGALPGPNAITQGGTPAAAAVQLAADHL 172 Query: 281 NTEDANYKKIIVFMTDGENL--STKEDQQSLYYCN--------EAKKRGAIVYAIGI--- 327 D K++V +TDG + D Q+ + A G + +GI Sbjct: 173 RARDTQDPKLLVLVTDGAANCAADSADWQASLVYDEALQDAVANASMDGITTHVVGIQID 232 Query: 328 -------RVIRSHEF-----LRACASPNSF--YLVENPHSMYDAFSHIGKDIV 366 V+ + + L + Y VE+ + A S I DI Sbjct: 233 TELDAQAGVVPAEQLHEVAQLGGAGLDGEYAFYQVEDQAMLSAALSSITADIS 285 >gi|118353826|ref|XP_001010178.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89291945|gb|EAR89933.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 547 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 39/259 (15%), Positives = 92/259 (35%), Gaps = 45/259 (17%) Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT------------- 168 + Y++ + F + S+ I + ++ ++ Sbjct: 58 QNKQKYNLDKGLSLDVKTLHKHFQFNSSASQSIPVMVSVKTLDKTEDSPKTDLEAAKQDR 117 Query: 169 ---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++ V+D S SM KI+ K++ +LE + + L+ Sbjct: 118 LENRPNLDLVCVIDRSGSMSGN------KIENVKKTLEYLLELLGEND------RLCLIA 165 Query: 226 FSNKI-EEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 F + + L++ S+ IK +++ TN G++ A+ + R++ N Sbjct: 166 FDSCVSRRCHLMKTNSSNKPNLIKIINEIHCHGGTNINSGMELAFRVL----KERKYYNP 221 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNE-AKKRGAIVYAIGIRVIRSHEFL-RACA 340 + I ++DG++ D + + + +++ G + + C+ Sbjct: 222 VSS-----IFLLSDGQDGG--ADLRVRQSLEKHLSQECFTIHSFGFGSDHDGPLMNKICS 274 Query: 341 -SPNSFYLVENPHSMYDAF 358 +FY VE + + + F Sbjct: 275 LKDGNFYYVEKINQVDEFF 293 >gi|218549490|ref|YP_002383281.1| hypothetical protein EFER_2160 [Escherichia fergusonii ATCC 35469] gi|218357031|emb|CAQ89663.1| conserved hypothetical protein [Escherichia fergusonii ATCC 35469] gi|325497894|gb|EGC95753.1| hypothetical protein ECD227_1991 [Escherichia fergusonii ECD227] Length = 219 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 33/179 (18%), Positives = 62/179 (34%), Gaps = 16/179 (8%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 ++ S + R +++LDVS SM S I+ + +E+ P V Sbjct: 8 ATSDFASNPEPRCPCILLLDVSGSM------SGRPINELNAGLVTFRDELLADPLALKRV 61 Query: 220 QSGLVTFS-NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + G+VTF +E+ F S L G T + A N + + R+ Sbjct: 62 ELGIVTFGPVHVEQPF-----TSAANFFPPILFAQG-DTPMGAAITKAMNMV--EERKRE 113 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + + Y+ I +TDG + + + + IG++ + Sbjct: 114 YRANGISYYRPWIFLITDGAPTDE-WQAAANKVFQGEEDKKFAFFTIGVQGADMKTLAQ 171 >gi|281417929|ref|ZP_06248949.1| Carbohydrate-binding CenC domain protein [Clostridium thermocellum JW20] gi|281409331|gb|EFB39589.1| Carbohydrate-binding CenC domain protein [Clostridium thermocellum JW20] Length = 1050 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 33/239 (13%), Positives = 75/239 (31%), Gaps = 61/239 (25%) Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS-- 241 +E + T + + + +++ + + ++ +S+ + ++ +S Sbjct: 715 VEEKYIDKSTYLKLVKDAAKNFVDKF-----AGSKTKMAVIQYSDSANDNDFKKYDLSLP 769 Query: 242 ----HLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT- 295 L+ I + +N G++ AY+ + D K IV +T Sbjct: 770 DKGAALKETIDKIKPGTSGLSNMGDGMRRAYHILNDP--------PPKGQISKYIVVITG 821 Query: 296 -------------------DGENLSTKEDQQSLYYCNEAKKRGAIVYA-------IGIRV 329 +G K D +S + AK G I+ + I Sbjct: 822 SVPNRWTAVDNKKNEPKTDNGRADFIKADNESYNSLDYAKDMGRIITSKGINLVFIDFSE 881 Query: 330 IRSHEFLRACA----------SPNSFYLVENPHSMYDAFSH----IGKDIVTKRIWYDK 374 + L A + +Y N + D ++ I D+V ++ Y++ Sbjct: 882 EDIGDVLEEIAAESGAKPLEGTDRHYYKANNFLELLDILNNMTLKIYYDVVLDKVLYEE 940 >gi|116332274|ref|YP_801992.1| hypothetical protein LBJ_2829 [Leptospira borgpetersenii serovar Hardjo-bovis JB197] gi|116125963|gb|ABJ77234.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar Hardjo-bovis JB197] Length = 379 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 35/220 (15%), Positives = 74/220 (33%), Gaps = 38/220 (17%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + + S+ R + IV D S SM + KI +A K ++ + + Sbjct: 54 TGTHIASENHERSKLFIV-DASGSMNEYL-GIYQKIHLAKKHVSRYISTLPTE------T 105 Query: 220 QSGLVTFSNKIEEFF------LLEW-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + G + + N+I L+ + ++ L+ G T ++ A N I Sbjct: 106 EIGFIAYGNRIPGCSSSRLYEPLQRENHGTFKNRLFSLTPSGA-TPLAESIRIAGNLISQ 164 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA----IVYAIGIR 328 + + I+ +TDG K++G + +G++ Sbjct: 165 RKKETE------------IILITDG---VESCYGDPKKELQALKQQGIYFKFHILGLGLK 209 Query: 329 VIRSHEFLRACASPNS--FYLVENPHSMYDAFSHIGKDIV 366 ++ A + ++ +E+ S Y A + V Sbjct: 210 -PDEERKMKILAEEGNGKYFGIEDDSSFYTALDSLKNQTV 248 >gi|37654264|gb|AAQ96237.1| LRRGT00024 [Rattus norvegicus] Length = 344 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 29/142 (20%), Positives = 55/142 (38%), Gaps = 12/142 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM T +D A ++ ++ ++ + + LVTF Sbjct: 153 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFMK-LRARDPASRGDRYMLVTFEEPPY-A 210 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQI------FDMQGMRQHCNTEDA 285 W +H ++K L G++T L+ A++ + + Q N Sbjct: 211 IKAGWKENHATFMNELKNLQAEGLTTL-GQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFL 269 Query: 286 NYKKIIVFMTDGENLSTKEDQQ 307 II+ +TDG L+T Q Sbjct: 270 EPA-IIITITDGSKLTTTSGVQ 290 >gi|308238185|ref|NP_001184129.1| chloride channel accessory 1 [Xenopus (Silurana) tropicalis] Length = 933 Score = 41.3 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 68/196 (34%), Gaps = 39/196 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLDVS SM +I ++ + ++ + G+V F + Sbjct: 304 VTLVLDVSGSM-----GGGNRIGRLYQAAEVFVMQIVEMGS-----YVGIVQFESTASVR 353 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDANYKKIIV 292 L V QR L T + N + +G++ + + ++Y +V Sbjct: 354 SSLLQIVDDTQR--NRLKSLLPKTATGGT-----NICAGIREGIKVNKKYDGSSYSTELV 406 Query: 293 FMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF---------LRACASP 342 +TDGE+ C + G+I++ I + + E LR A+ Sbjct: 407 LLTDGEDNYATS------LCFPDVTNSGSIIHVIALGPNAAKELETIVDMTGGLRFLATD 460 Query: 343 NSFYLVENPHSMYDAF 358 + + DAF Sbjct: 461 K-----VDAQGLIDAF 471 >gi|42522349|ref|NP_967729.1| leucyl-rRNA synthetase [Bdellovibrio bacteriovorus HD100] gi|39574881|emb|CAE78722.1| leucyl-rRNA synthetase [Bdellovibrio bacteriovorus HD100] Length = 544 Score = 41.3 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 29/227 (12%), Positives = 75/227 (33%), Gaps = 20/227 (8%) Query: 124 NEGYSISAISRYKIP---LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDV 180 Y++ + K+ K +F + + V ++D++ V+D Sbjct: 225 GGDYTLKVSATDKLGHASEKSISFKVSSMYKQMVN-------NVKVNEYQKVDILFVIDN 277 Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVK--LIPDVNNVVQSG-LVT-FSNKIEEFFLL 236 S SME S +++ + + + ++ V+ + G LV ++ + Sbjct: 278 SGSMEYEKKSMASRVRNFLDVVKGLDWQIAVTTTDPVHKTLGDGRLVQLYNKPGQYILTS 337 Query: 237 EWGVSHLQRKIKY-LSKFGVSTNSTPGLKYAYNQI-FDMQGMRQHCNTEDANYKKIIVFM 294 + + + L + + S G+ AY I + + N + + +V + Sbjct: 338 SMNDTDARNTLSSTLQRSETGSGSEQGIFAAYRAIERSLSASGGNANFIRNDSQLAVVVI 397 Query: 295 TDGENLSTKEDQQ----SLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 +D + + + + + A+ + I L+ Sbjct: 398 SDEDESANGAKNDPANFVKFVADSFGSQKAMSFHSIIARPDDQACLK 444 >gi|90406741|ref|ZP_01214934.1| hypothetical protein PCNPT3_01875 [Psychromonas sp. CNPT3] gi|90312194|gb|EAS40286.1| hypothetical protein PCNPT3_01875 [Psychromonas sp. CNPT3] Length = 404 Score = 41.3 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 34/249 (13%), Positives = 79/249 (31%), Gaps = 42/249 (16%) Query: 17 GGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKK 76 G ++I+ LP + +L + I + T D S + A Sbjct: 10 GSISIVFIFLLPAMLAMLALSILTAMYLLSVTRASQASDVSSIACA-------------- 55 Query: 77 LKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYK 136 + + ++ + ++ + Y Sbjct: 56 -YSQRANVSLTQGFAQYYKPNFI--------------SHVNAQSTFLSGQKQCKIQIGYA 100 Query: 137 IPLKFCTFIPWYTNSR---HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSIT 193 +P + ++ + + TS++ V+S+ +D+ +VLD+S SM Sbjct: 101 FTPLLKDLLPASSQNKVHASVQIQSTSTLTVHSEIK-PMDLSLVLDISGSMSG------- 152 Query: 194 KIDMAIKSINAMLEEVKLIPDVNNV-VQSGLVTFSNKIE-EFFLLEWGVSHLQRKIKYLS 251 +I + + IN ++ ++ NN ++ +V FS+ + + +S Sbjct: 153 RIGLLKRIINQAIQNIEQQNTKNNTQIRFSIVPFSSGVSISNAPWLAKSKGKALCVDAMS 212 Query: 252 KFGVSTNST 260 G N+ Sbjct: 213 YPGNVLNTA 221 >gi|118388916|ref|XP_001027552.1| Ubiquitin-conjugating enzyme family protein [Tetrahymena thermophila] gi|89309322|gb|EAS07310.1| Ubiquitin-conjugating enzyme family protein [Tetrahymena thermophila SB210] Length = 601 Score = 41.3 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 33/139 (23%), Positives = 61/139 (43%), Gaps = 13/139 (9%) Query: 167 QTDARLDMMIVLDVSRSMES--FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + D ++++ DVS SM+S F D I++I +A ++ L + N++VQ L Sbjct: 131 EEDPAEAIVVIYDVSGSMKSGFFNDPLISRIGAVNAFFSAFADKT-LAYEYNHIVQ--LY 187 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 F ++IE+ ++H + + + G ST L N + + Sbjct: 188 WFDDRIEKKCDFTKDMNHFIKLVDDANPRG-STRLYDALMEGINSLLL-------IKKKY 239 Query: 285 ANYKKIIVFMTDGENLSTK 303 N ++ MTDGE+ +K Sbjct: 240 PNIILRLIAMTDGEDNQSK 258 >gi|78222748|ref|YP_384495.1| hypothetical protein Gmet_1536 [Geobacter metallireducens GS-15] gi|78194003|gb|ABB31770.1| conserved hypothetical protein [Geobacter metallireducens GS-15] Length = 569 Score = 41.3 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 30/206 (14%), Positives = 70/206 (33%), Gaps = 31/206 (15%) Query: 157 PITSSVKVNSQTDARL--DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 P T S+ +T L D+++++D + SM + +K ++ +++ I D Sbjct: 388 PTTGSIFNLKKTKYELHNDIVLLVDCTGSMSE--PTKWSKAELIYQTL------FTAIHD 439 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 N + ++ + E + E + ++ G T S + I Sbjct: 440 YNANARI--FAYNERNETCRITEIFKQG---QFFSVTPHGR-TASGEAI------IATAI 487 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++ I+ +TDG + + ++ YC KK+ + +G+ ++ Sbjct: 488 SLKPGARKP------FIIHITDGASNWGCGVRDAIAYC---KKKKINLLTLGLECDPMNK 538 Query: 335 FLRACASPNSFYLVENPHSMYDAFSH 360 + V + F Sbjct: 539 TALSEEYGEMIQFVNTMEQFPNVFKK 564 >gi|253998722|ref|YP_003050785.1| outer membrane adhesin-like protein [Methylovorus sp. SIP3-4] gi|253985401|gb|ACT50258.1| outer membrane adhesin like proteiin [Methylovorus sp. SIP3-4] Length = 1536 Score = 41.3 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 30/140 (21%), Positives = 56/140 (40%), Gaps = 15/140 (10%) Query: 174 MMIVLDVSRSMESF-FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 ++I LD+S SM + T++ AI+SI +L++ + D + LV FS + Sbjct: 1026 LLITLDISGSMRTQDGVGGTTRLASAIQSIKTLLDKYDALGD----TRISLVVFSTTAAQ 1081 Query: 233 FFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + ++ + G +TN L A + D + N + + Sbjct: 1082 VGTDWMTIDQAKAQLDQILVNGPKGNTNYDSALANAMDAFDDAGKL--------TNAQNV 1133 Query: 291 IVFMTDGENLSTKEDQQSLY 310 F++DGE + SL Sbjct: 1134 AYFISDGEPNTGSGSNTSLT 1153 >gi|285808482|gb|ADC36006.1| von Willebrand factor type A domain protein [uncultured bacterium 259] Length = 311 Score = 41.3 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 33/172 (19%), Positives = 68/172 (39%), Gaps = 32/172 (18%) Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 A++ + P+ T + + ++++ +D+ +V+D S SM D Sbjct: 37 AVTSHGSPVSGLTLADFELADN----GVRQTIEMVDAETLPIDLTVVVDTSGSMRRSVDD 92 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL 250 + ++I AML + L+TF+ +I E + L+ + L + L Sbjct: 93 ----LKADAQAIAAMLRPND---------RIRLLTFAGQIRETYALQAPSADL--ALDRL 137 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 + G ST+ + A + R+H +V +TDG++ S+ Sbjct: 138 AATG-STSLFDAVAVALAGVT--GTDRRHL----------VVVLTDGQDTSS 176 >gi|220913381|ref|YP_002488690.1| hypothetical protein Achl_2636 [Arthrobacter chlorophenolicus A6] gi|219860259|gb|ACL40601.1| conserved hypothetical protein [Arthrobacter chlorophenolicus A6] Length = 310 Score = 41.3 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 9/85 (10%), Positives = 22/85 (25%) Query: 29 IIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKN 88 ++ + ++ I+ + L + D S + A G D L + Sbjct: 3 VLIGAGALAVDTGQIYAERAQLQNGADASALAGADLCSANGGCTQAAATSVADALANSNS 62 Query: 89 TWNMSFRNELRDNGFVNDIDDIVRS 113 + S + + Sbjct: 63 NDSKSTVQSVDLSVPGQVTVTTSTK 87 >gi|301609308|ref|XP_002934204.1| PREDICTED: epithelial chloride channel protein [Xenopus (Silurana) tropicalis] Length = 906 Score = 40.9 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 41/201 (20%), Positives = 69/201 (34%), Gaps = 44/201 (21%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD+S SM ++ +I ++ + ++ V G+VTFSN E Sbjct: 309 VSLVLDISGSM-----TNANRITRLYQASEVYIMQI-----VEQGAYVGIVTFSNVAEIK 358 Query: 234 FLLEWGVSHLQRKIKYLS---KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L QR+ L TN G++ ++ Sbjct: 359 SQLVKITDTFQRESLKLKLPTVATGGTNICAGVQQGLQVNRNLDQSTHGTE--------- 409 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF---------LRACA 340 IV +TDGE+ C + K GAI++ I + L+ A Sbjct: 410 IVLLTDGEDSGISS-------CFPDITKSGAIIHTIALGNNADPGLEKLADLTGGLKLYA 462 Query: 341 SPNSFYLVENPHSMYDAFSHI 361 S + + + D+FS I Sbjct: 463 SDK-----VDANGLIDSFSGI 478 >gi|302656364|ref|XP_003019936.1| hypothetical protein TRV_06038 [Trichophyton verrucosum HKI 0517] gi|291183712|gb|EFE39312.1| hypothetical protein TRV_06038 [Trichophyton verrucosum HKI 0517] Length = 1055 Score = 40.9 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 36/267 (13%), Positives = 84/267 (31%), Gaps = 38/267 (14%) Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 + N + + W + N +++ ++ Y I Sbjct: 430 SVNELPRFHLRFSTLHQLDIWKRAL------GDLRNGAVVTRPDPGVEVPASEED-DYRI 482 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK---VNSQTDARLDMMIVLDVSRSMES 186 S + + + +S + TS + S LD+++V+ VS SM+ Sbjct: 483 SKTPSKRESSGYSSSYGAGKSSNTAMTDYTSPGREPMPVSAFHVPLDIVVVIPVSSSMQG 542 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-----KIEEFFLLEW-GV 240 KI + ++ +++ + + GLVTF + + W G Sbjct: 543 L------KISLLRDTLRFLVQNLGPRD------RMGLVTFGSSGGGVPVVGMTSKTWNGW 590 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 + I+ + + + + G A + + + N I+ ++D + Sbjct: 591 GKIFNSIRPVGQKSLRADVVEGANVAMDLLMQRK---------SFNPVSTILLISDSQTT 641 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGI 327 + + AK G + +G+ Sbjct: 642 DPESGDFVVSRAEAAK-VGIHTFGLGL 667 >gi|281420097|ref|ZP_06251096.1| von Willebrand factor, type A [Prevotella copri DSM 18205] gi|281405897|gb|EFB36577.1| von Willebrand factor, type A [Prevotella copri DSM 18205] Length = 289 Score = 40.9 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 18/102 (17%), Positives = 43/102 (42%), Gaps = 9/102 (8%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S+ + + + + + N + G++ F Sbjct: 72 EEERELTVMLLVDVSGSL------DFGTMKQMKRDLATEIAATLAFSAIQNNDKIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKY 265 S++IE++ + G H+ I+ + T+ L+Y Sbjct: 126 SDRIEKYIPPKKGRKHILYIIREMLNFQPQSQRTDIGCALEY 167 >gi|145480079|ref|XP_001426062.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124393134|emb|CAK58664.1| unnamed protein product [Paramecium tetraurelia] Length = 2123 Score = 40.9 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 31/186 (16%), Positives = 65/186 (34%), Gaps = 23/186 (12%) Query: 152 RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 + + + N++ + + +LD S SME K D ++S+ L+ +K Sbjct: 1920 TKLSQTFNTKIYYNTREQQEIFYLFILDDSFSMEG------KKGDEMMESLRQQLKFLKS 1973 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N + +++F+ + L + IK ++ G TN P LK +QI Sbjct: 1974 ----NKYAKVSVISFN---YKASLQIEFKKPKAKLIKQITLVGGITNFDPPLKLCLDQIL 2026 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 + I+ +DGE ++ ++ + + I + Sbjct: 2027 KYEKKIDQA---------YILLYSDGEGSYPQQSLAEYITLSQELRNKIS-FLICTAGSK 2076 Query: 332 SHEFLR 337 L+ Sbjct: 2077 PQTLLK 2082 >gi|218778169|ref|YP_002429487.1| OmpA/MotB domain protein [Desulfatibacillum alkenivorans AK-01] gi|218759553|gb|ACL02019.1| OmpA/MotB domain protein [Desulfatibacillum alkenivorans AK-01] Length = 381 Score = 40.9 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 28/213 (13%), Positives = 69/213 (32%), Gaps = 26/213 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + S++ +++LD S SM D K +A +++ M+ + Sbjct: 30 VPSDLNGSIRTGHLVQKADHFLVILDCSGSMAEMVDGE-KKFVLAREAVRKMIAGIPEAE 88 Query: 214 -DVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + ++ +T + +E L + + + ++ +N + + + Sbjct: 89 MNFEAGLRVFGLTMNPFVESTTALVDMEPLDKAAYDKALDKVTFATGKSNLALAIAQSSD 148 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + G ++ +TDG+ + + N + R VY I + Sbjct: 149 DLDKTTGEIS------------LIIVTDGKETDGEAARALEVVKNTYRDR-LCVYTIQVG 195 Query: 329 V-IRSHEFLRA------CASPNSFYLVENPHSM 354 + L C + V +P +M Sbjct: 196 HDPAGEKLLERLSRKGQCGYSENLDNVNSPKAM 228 >gi|114652505|ref|XP_001170901.1| PREDICTED: coagulation factor C homolog, cochlin isoform 2 [Pan troglodytes] gi|13938219|gb|AAH07230.1| COCH protein [Homo sapiens] gi|119586368|gb|EAW65964.1| coagulation factor C homolog, cochlin (Limulus polyphemus), isoform CRA_b [Homo sapiens] Length = 494 Score = 40.9 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 20/170 (11%), Positives = 58/170 (34%), Gaps = 25/170 (14%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTD------ARLDMMIVLDVSRSMESFFDSSITK 194 F +P + + V P+ + + Q +++ ++D S S+ + Sbjct: 329 FSYHMPNWFGTTKYVKPLVQKLCTHEQMMCSKTCYNSVNIAFLIDGSSSVGD------SN 382 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSK 252 + ++ ++ + + ++ + V F+ F + + I+ + Sbjct: 383 FRLMLEFVSNIAKTFEISDIGA---KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRY 439 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 T + + + +F +R+ N +V +TDG++ Sbjct: 440 MSGGTATGDAISFTVRNVFGP--IRESPNKN------FLVIVTDGQSYDD 481 >gi|258624851|ref|ZP_05719779.1| conserved hypothetical protein [Vibrio mimicus VM603] gi|258582849|gb|EEW07670.1| conserved hypothetical protein [Vibrio mimicus VM603] Length = 189 Score = 40.9 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 18/108 (16%), Positives = 38/108 (35%), Gaps = 5/108 (4%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M+V+D+S SM + + + A+ + + + GL+ F++ Sbjct: 85 DLMLVVDLSYSMSQE---DMQSGQQMVDRLTAVKQVLSEFITKREGDRMGLILFADHAYL 141 Query: 233 FFLLEWGVSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 L + ++ L G T G+ A D ++ Sbjct: 142 QTPLTLDRQTVISQLNQAVLKLIGTQTAIGEGIGLATKTFIDSDAPKE 189 >gi|114570700|ref|YP_757380.1| vault protein inter-alpha-trypsin subunit [Maricaulis maris MCS10] gi|114341162|gb|ABI66442.1| Vault protein inter-alpha-trypsin domain protein [Maricaulis maris MCS10] Length = 740 Score = 40.9 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 32/171 (18%), Positives = 61/171 (35%), Gaps = 32/171 (18%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 + V+D S SM + A ++ L+ ++ + ++ F N +E+ F Sbjct: 347 IFVIDNSGSMGG------ASMRQARAALITALQRLEPGD------RFNVIRFDNTMEQVF 394 Query: 235 LLE-----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 V+ + L G T P L A + +D + + Sbjct: 395 PQAVDASPDNVATALTFARRLEAQG-GTVMLPALNAALRD----------TSPDDDSRVR 443 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 IVF+TDG ++ L+ EA + ++ +GI + F+ A Sbjct: 444 QIVFLTDG----AIGNEAELFAAIEAGLGRSRLFPVGIGSAPNGYFMSRAA 490 >gi|157120592|ref|XP_001659677.1| dihydropyridine-sensitive l-type calcium channel [Aedes aegypti] gi|108874883|gb|EAT39108.1| dihydropyridine-sensitive l-type calcium channel [Aedes aegypti] Length = 1122 Score = 40.9 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 26/192 (13%), Positives = 64/192 (33%), Gaps = 22/192 (11%) Query: 168 TDARLDMMIVLDVSRSME-SFFDSSITKIDMAIKSIN--AMLEEVKLIPDVNNVVQSGLV 224 + D+MI+LD S SM + ++ + ++ V +V Sbjct: 149 ASSPKDVMILLDSSGSMSGKEYQLAVATASAIMDTLGDDDFFNLVSFSDQPKVIVPC--- 205 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 F +K+ + ++ I+ + + + N + L+ A+ + Q Sbjct: 206 -FQDKMVRATP--DNIKEVKTAIQSV-ECENTANFSAALESAFELLRRYNQSSQGSQCNQ 261 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA--IGIRVIRSHEFLR-ACAS 341 A I+ +TDG + + + + + + ++ IG + AC + Sbjct: 262 A-----IMLITDGPSDTFADVIKHYNHPHMP----VRIFTYLIGTDKSSGKNLYQMACDN 312 Query: 342 PNSFYLVENPHS 353 + + + Sbjct: 313 KGFYVQINSVEE 324 >gi|149573051|ref|XP_001520334.1| PREDICTED: hypothetical protein, partial [Ornithorhynchus anatinus] Length = 581 Score = 40.9 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 23/102 (22%), Positives = 41/102 (40%), Gaps = 13/102 (12%) Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI--- 330 + + + + KI+V +TDG+ EAK G I YAIG+ Sbjct: 1 EELFNPISGAREDANKILVVITDGQKYGDSLKYSEAIPLAEAK--GVIRYAIGVGDAFDF 58 Query: 331 --RSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTK 368 + E + +SP + + V+N + A +I + + K Sbjct: 59 SSTAEELITIASSPEQDHVFRVDN----FGALDNIQQQLQDK 96 >gi|118496822|ref|YP_897872.1| hypothetical protein FTN_0208 [Francisella tularensis subsp. novicida U112] gi|194324497|ref|ZP_03058269.1| von Willebrand factor type A domain membrane protein [Francisella tularensis subsp. novicida FTE] gi|254372186|ref|ZP_04987678.1| TPR domain protein [Francisella tularensis subsp. novicida GA99-3549] gi|118422728|gb|ABK89118.1| hypothetical membrane protein with von Willebrand factor type A domain [Francisella novicida U112] gi|151569916|gb|EDN35570.1| TPR domain protein [Francisella novicida GA99-3549] gi|194321332|gb|EDX18818.1| von Willebrand factor type A domain membrane protein [Francisella tularensis subsp. novicida FTE] Length = 332 Score = 40.9 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 34/163 (20%), Positives = 68/163 (41%), Gaps = 29/163 (17%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + +I LDVS+SM++ D S ++++ A I +L +K Q G++ FS+ Sbjct: 88 QKNISRVIALDVSQSMDT-TDVSPSRLERAKYKIFDILRRIKEG-------QVGMIVFSS 139 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L + ++ + ++ N LK + Q+ + G++Q Sbjct: 140 EPFVVSPLTSDANTIENLVTVINSDIVPVQGHNIYKALKKS-AQLIEQAGVQQGQ----- 193 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI--VYAIG 326 I+ +TD + Q++ + ++G VYAIG Sbjct: 194 -----IILITD-----SSPSPQAISQAKQLAQQGIKTDVYAIG 226 >gi|300727566|ref|ZP_07060956.1| von Willebrand factor, type A [Prevotella bryantii B14] gi|299775155|gb|EFI71757.1| von Willebrand factor, type A [Prevotella bryantii B14] Length = 245 Score = 40.9 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 57/190 (30%), Gaps = 13/190 (6%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + V+D S SM KI ++ +L + I N + + +E Sbjct: 11 LFFVIDTSGSMIGN------KIGAVNDAVENVLPMLDEISASNPDAEIKVAA----LEFS 60 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 W + + V+ + L A ++ T ++ I+ Sbjct: 61 SGCNW-LYDEPKLASEFVWQDVTASGLTSLGAACLELNSKLSRSGFMQTPSGSFAPAIIL 119 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL-RACASPNSFYLVENPH 352 ++DG S N K + AI I + L + + + + V N Sbjct: 120 LSDGGPTDDFHGGLSKLKANNWFKNAIKI-AIAIGDDADKDVLTQFTGTNEAVFTVHNID 178 Query: 353 SMYDAFSHIG 362 ++ + Sbjct: 179 ALKQIIRVVA 188 >gi|297618081|ref|YP_003703240.1| hypothetical protein Slip_1922 [Syntrophothermus lipocalidus DSM 12680] gi|297145918|gb|ADI02675.1| protein of unknown function DUF2134, membrane [Syntrophothermus lipocalidus DSM 12680] Length = 310 Score = 40.9 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 8/57 (14%), Positives = 25/57 (43%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 R G +L A+ L + ++ + +F ++ + + +D +++ A ++ Sbjct: 5 RKLIGGEGGTAAVLFALSLTALLGFTALVTDSGLLFLNQSRVANAVDSAVLAGAQEL 61 >gi|257457979|ref|ZP_05623138.1| conserved hypothetical protein [Treponema vincentii ATCC 35580] gi|257444692|gb|EEV19776.1| conserved hypothetical protein [Treponema vincentii ATCC 35580] Length = 379 Score = 40.9 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 27/168 (16%), Positives = 60/168 (35%), Gaps = 17/168 (10%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 V + + +LD + V+D + SM+ I KI + + + + + + Sbjct: 218 EVILTPKKHKKLDAVFVIDATESMKD----DIQKIREL------LADSLNKVLPKYDSYR 267 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQ 278 GLV + + ++F L + I S G S + A + D + Sbjct: 268 IGLVLYKDYYDDF--LTKTATDFTSDINRFSAALRGFSVFGGRDIPEAVYEGIDAGLDLE 325 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGAIVYAI 325 +D + K ++ + D + + + + AK + ++ I Sbjct: 326 WRTDKDTDRK--LILIGDAPPHARPQGSITKEAVMDAAKSKDVKLFPI 371 >gi|168701746|ref|ZP_02734023.1| von Willebrand factor, type A [Gemmata obscuriglobus UQM 2246] Length = 249 Score = 40.9 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 62/198 (31%), Gaps = 30/198 (15%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMES---------------------FFDSSITKIDMA 198 S V + + + R ++++D S SM T+ID+ Sbjct: 8 SDVALATNPEPRCPCVLLIDTSGSMAEVVSGTGRDLGRTAQVDGKTYRVVSGGTTRIDLV 67 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTN 258 + + +V P V+ +VTF + + + + T Sbjct: 68 NEGLRVYQADVTNDPLAAQRVEVSVVTFGDTVRTVTPFVT----TSQFTPPVLTANGETP 123 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKK-IIVFMTDGENLSTKEDQQSLYYCNEAKK 317 + A + + ++ ++ + I +TDGE E + E KK Sbjct: 124 MGAAILKAIDAVT---ERKREYRQNGLHFYRPWIFLITDGEPTDAWEAAAARVREGEEKK 180 Query: 318 RGAIVYAIGIRVIRSHEF 335 + A +A+G+ Sbjct: 181 QFA-FFAVGVEGANMDRL 197 >gi|260904335|ref|ZP_05912657.1| von Willebrand factor type A domain-containing protein [Brevibacterium linens BL2] Length = 324 Score = 40.9 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 40/250 (16%), Positives = 83/250 (33%), Gaps = 44/250 (17%) Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIV-MPITSSVKV 164 + + S I +P A+ + + T + + + V+ Sbjct: 29 TASSLAVAHSARITTLPSFR----KAMRTRLLTTVAFLAVIALTGLSTLAGISRPAWVET 84 Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + D+M+ LDVS SM + +LE + + D + + G+ Sbjct: 85 VNPEKKLRDVMLCLDVSGSM--------------LGYDADLLEAYQELVDRFDGERIGMT 130 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC---- 280 F++ F L +Q ++ + G T + G Y+++ G Sbjct: 131 VFNSTAVSAFPLTDDYEMVQNYLEEAEE-GFRTWGSEGTDYSWSTSPPNIGGSSLIGDGL 189 Query: 281 -------NTEDANYKKIIVFMTD----GENLSTKEDQQSLYYCNEAKKRGAIVYAIG--- 326 + +D + + IVF TD G+ L +D + A + VY++ Sbjct: 190 VSCVDNFDRQDEDRSRSIVFATDNMLAGDPLFDLDDAT-----DIAVESDVRVYSLSPPS 244 Query: 327 -IRVIRSHEF 335 + ++ E Sbjct: 245 VLTNPQTKEL 254 >gi|239624983|ref|ZP_04668014.1| predicted protein [Clostridiales bacterium 1_7_47_FAA] gi|239521369|gb|EEQ61235.1| predicted protein [Clostridiales bacterium 1_7_47FAA] Length = 878 Score = 40.9 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 35/239 (14%), Positives = 79/239 (33%), Gaps = 28/239 (11%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 + + + +V+ + +D D++ +D SRSM D + + Sbjct: 21 MMFGTLMAAWGEENSSVAAMDAVEAVNVSDTGYDVVFCIDNSRSMWKQQDIRDQAVRVLA 80 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNS 259 + ++ G V F++ + + L K + F T+ Sbjct: 81 NLAVG------------SDIRIGGVYFADHVYQRCSLTSLTGE--EDTKKVMSFLNFTDK 126 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS--------TKEDQQSLYY 311 G N + + +D + K+IIV +DG N + + + Sbjct: 127 DDG-NRDTNIGSALSEALKLFENQDISRKRIIVLFSDGINEDYEGTGSYTARANNMTSQA 185 Query: 312 CNEAKKRGAIVYAIGIRVIRSHE-FLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 E + +Y + + R+ E +LR + Y E+ + F + ++ + + Sbjct: 186 AGEINEAQIALYCVFLEKDRADEAYLRNLVN----YFKEDGQYDQERFFPVAENEIDRL 240 >gi|203287631|ref|YP_002222646.1| hypothetical protein BRE_170 [Borrelia recurrentis A1] gi|201084851|gb|ACH94425.1| uncharacterized conserved protein [Borrelia recurrentis A1] Length = 329 Score = 40.9 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 72/205 (35%), Gaps = 26/205 (12%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSS-ITKIDMAIKSINAMLEEVKLIP 213 ++ ++ K L + V+D+SRSM +F + I +++ A I+ +L Sbjct: 71 ILDVSWGQKATEDERVNLRISFVVDISRSMLTFDEGKSINRLESAKNLISLILNSF---- 126 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 V+ L F K S L + + Y+ + S+PG Sbjct: 127 ---ENVEYSLTIFKGKSLLVLPFSKDKSSLYKMLNYIEPNLI---SSPG------SFLGD 174 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + +D++Y +V +TDG+ + R + + IGI R Sbjct: 175 GVLSAIQGIKDSSYYNFLVVLTDGDEWGENNYYTFSKLIDV---RNIVSFVIGIGSNRPL 231 Query: 334 EFLRACASPNSFYLVENPHSMYDAF 358 + ++ V++ + F Sbjct: 232 PLV------DNHVNVKDKNDSVMTF 250 >gi|67078187|ref|YP_245807.1| D-amino acid dehydrogenase, large subunit [Bacillus cereus E33L] gi|66970493|gb|AAY60469.1| conserved hypothetical protein [Bacillus cereus E33L] Length = 452 Score = 40.9 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 39/199 (19%), Positives = 72/199 (36%), Gaps = 16/199 (8%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + L++ I+LD S SM + + K++ A K+I L IPD NV+ L + Sbjct: 148 KEKSLNVEILLDASGSMAGKVNGQV-KMEAAKKAIYNYL---DKIPDNANVM---LRVYG 200 Query: 228 NK---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE- 283 +K E L G S + ++ K + + + + E Sbjct: 201 HKGSNNENDKSLSCGSSEVMYPLQPYKKEQFNAALSKFGPKGWTPLASAIESVNDDFKEY 260 Query: 284 -DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--HEFLRAC- 339 ++ ++DGE + + N++ A+V IG V S + + Sbjct: 261 TGEENLNVVYIVSDGEETCGGDPVNAAKNLNQS-STHAVVNIIGFDVKNSEQQQLMNTAE 319 Query: 340 ASPNSFYLVENPHSMYDAF 358 A ++ V N +Y Sbjct: 320 AGKGNYATVSNADELYQTL 338 >gi|86147465|ref|ZP_01065777.1| hypothetical protein MED222_21464 [Vibrio sp. MED222] gi|85834758|gb|EAQ52904.1| hypothetical protein MED222_21464 [Vibrio sp. MED222] Length = 421 Score = 40.9 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 18/104 (17%), Positives = 40/104 (38%), Gaps = 2/104 (1%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN--EGN 70 G + I I L + V + ++++H+ KT L + +D + + AAT + N + Sbjct: 10 KKQGGLVVIFVTIALLVFLAVSALAVDINHMLVNKTRLQNAVDSAALAAATILDNSKDQA 69 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRST 114 + + + + +F + + ND +T Sbjct: 70 AVSAEVTSTLNAMAGASGNHEFNFSTAVVSVEYSNDPQSFAGTT 113 >gi|328676286|gb|AEB27156.1| hypothetical protein FNFX1_0208 [Francisella cf. novicida Fx1] Length = 332 Score = 40.9 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 34/163 (20%), Positives = 68/163 (41%), Gaps = 29/163 (17%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + +I LDVS+SM++ D S ++++ A I +L +K Q G++ FS+ Sbjct: 88 QKNISRVIALDVSQSMDT-TDVSPSRLERAKYKIFDILRRIKEG-------QVGMIVFSS 139 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L + ++ + ++ N LK + Q+ + G++Q Sbjct: 140 EPFVVSPLTSDANTIENLVTVINSDIVPVQGHNIYKALKKS-AQLIEQAGVQQGQ----- 193 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI--VYAIG 326 I+ +TD + Q++ + ++G VYAIG Sbjct: 194 -----IILITD-----SSPSPQAISQAKQLAQQGIKTDVYAIG 226 >gi|260808171|ref|XP_002598881.1| hypothetical protein BRAFLDRAFT_90093 [Branchiostoma floridae] gi|229284156|gb|EEN54893.1| hypothetical protein BRAFLDRAFT_90093 [Branchiostoma floridae] Length = 597 Score = 40.9 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 25/96 (26%), Positives = 43/96 (44%), Gaps = 16/96 (16%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNK 229 +D++ LD S S+ S S+ KI +K++ L + P + +V+F ++ Sbjct: 43 AVDIIFALDRSGSVGS---SNYNKIIDFVKAV---LSHFSVSPT---TTRVAVVSFGTDA 93 Query: 230 IEEFFLLEWGVSH------LQRKIKYLSKFGVSTNS 259 EF LL W ++ L+ I L G TN+ Sbjct: 94 KVEFDLLRWTSNNNNKCELLRTYIPKLKYTGGGTNT 129 >gi|290996510|ref|XP_002680825.1| vWFA domain-containing protein [Naegleria gruberi] gi|284094447|gb|EFC48081.1| vWFA domain-containing protein [Naegleria gruberi] Length = 395 Score = 40.9 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 24/146 (16%), Positives = 60/146 (41%), Gaps = 22/146 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 ++ +D++IV+D + SM +I++A +++ ++ + + ++ V++ Sbjct: 75 SENFVDLVIVMDCTGSMSG-------EIEVAKRTVTTIISTLHEKFQSD--LRFSAVSYR 125 Query: 228 NKIEEF----FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + +++ F ++ + I +S G + L A I +M ++ Sbjct: 126 DHTDDYAVKEFPFTKDLNKAKGYINTMSAQGGG-DHPEALASALYVINEMPFNKKG---- 180 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSL 309 KKI+V++ D K S Sbjct: 181 ----KKIVVWVADAPPHGMKTSSDSY 202 >gi|270016374|gb|EFA12820.1| hypothetical protein TcasGA2_TC001887 [Tribolium castaneum] Length = 1264 Score = 40.9 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 41/221 (18%), Positives = 80/221 (36%), Gaps = 20/221 (9%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 SS S + D++I++D S SM K ++A + ++L + VN Sbjct: 247 DFRSSAWYVSAATSPKDIVILIDNSGSMSGH------KSNLARATTESILNTLGDNDFVN 300 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS--TNSTPGLKYAYNQIFDMQ 274 S + + + L++ +++ + LS F N T L + + Sbjct: 301 VFKFSDITEETVPCFKDMLVQANNENVRWLKESLSTFKSENIANFTAALVTGFEILHKYN 360 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA--IGIRVIRS 332 Q C A I+ +TDG S +E + + + ++ +G + Sbjct: 361 RTGQGCQCNQA-----IMLITDGPPSSYQEIFKMYNFPHYP----VRIFTYLVGKDSSSA 411 Query: 333 HEF-LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 HE ACA+ + +EN + H + + + Y Sbjct: 412 HEMRWMACANKGYYTRIENFDEINQKVLHYIEVLARPMVMY 452 >gi|268316012|ref|YP_003289731.1| von Willebrand factor type A [Rhodothermus marinus DSM 4252] gi|262333546|gb|ACY47343.1| von Willebrand factor type A [Rhodothermus marinus DSM 4252] Length = 339 Score = 40.9 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 27/180 (15%), Positives = 65/180 (36%), Gaps = 23/180 (12%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 P ++ LD++I LDVS SM + D + +++ A + +LE ++ Sbjct: 75 GPRLGLQPRQAERRG-LDLLIALDVSNSMLAE-DVAPSRLARARYELYRLLEHLEGD--- 129 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFD 272 + GL+ F+ ++ + T+ ++ A Sbjct: 130 ----RVGLILFAGDAFLQCPFTTDYGAVRLFLDVADPSLIPTPGTDYVRMIQVALQAF-- 183 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + ++ ++++ ++DGEN + +Q + ++ G A+G+ Sbjct: 184 -----EAPQPDEVPRSRVLLVVSDGENHAEGFEQALR----QLQEAGIERLAVGVGETAG 234 >gi|189242452|ref|XP_969952.2| PREDICTED: similar to voltage-gated calcium channel alpha2-delta subunit 1 [Tribolium castaneum] Length = 1217 Score = 40.9 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 41/221 (18%), Positives = 80/221 (36%), Gaps = 20/221 (9%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 SS S + D++I++D S SM K ++A + ++L + VN Sbjct: 247 DFRSSAWYVSAATSPKDIVILIDNSGSMSGH------KSNLARATTESILNTLGDNDFVN 300 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS--TNSTPGLKYAYNQIFDMQ 274 S + + + L++ +++ + LS F N T L + + Sbjct: 301 VFKFSDITEETVPCFKDMLVQANNENVRWLKESLSTFKSENIANFTAALVTGFEILHKYN 360 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA--IGIRVIRS 332 Q C A I+ +TDG S +E + + + ++ +G + Sbjct: 361 RTGQGCQCNQA-----IMLITDGPPSSYQEIFKMYNFPHYP----VRIFTYLVGKDSSSA 411 Query: 333 HEF-LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 HE ACA+ + +EN + H + + + Y Sbjct: 412 HEMRWMACANKGYYTRIENFDEINQKVLHYIEVLARPMVMY 452 >gi|87199929|ref|YP_497186.1| hypothetical protein Saro_1912 [Novosphingobium aromaticivorans DSM 12444] gi|87135610|gb|ABD26352.1| conserved hypothetical protein [Novosphingobium aromaticivorans DSM 12444] Length = 435 Score = 40.9 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 47/166 (28%), Gaps = 8/166 (4%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 R G T+L A P + G ++ + + K + D++ + AA Sbjct: 15 FRRLLRARAGNATLLMAFGAPALIATAGFAVDTAQWYLWKREMQYAADQAALSAAYSKSK 74 Query: 68 EGNGNNRKKL------KGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVV 121 + + ++ N V R + +V+ Sbjct: 75 GISTTAYETHAVQEYNANLQLVTFSDTPTVSLAAYNGGTNNSVVVRASATRELAFSGIVL 134 Query: 122 PQNEGYSISAISRY--KIPLKFCTFIPWYTNSRHIVMPITSSVKVN 165 + S+SA + Y C T I + +S +K Sbjct: 135 GKPTTVSVSAQATYTAGATYTSCIIATNATADGAITIGGSSILKSG 180 >gi|198420988|ref|XP_002120145.1| PREDICTED: similar to integrin beta Hr1 [Ciona intestinalis] Length = 859 Score = 40.9 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 57/168 (33%), Gaps = 17/168 (10%) Query: 84 CRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCT 143 R+ + + + +N + + T ++ + + + + I L+ Sbjct: 60 NRVPTAIRKNCKLDRENNQTCSVGVN-SPQTLINKTINEEFTAEN-KTNVTHPIQLRPQE 117 Query: 144 FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSIN 203 + P ++ + D LD+ V+D+S SM + ++ + S+ Sbjct: 118 ISLKLRIGQ----PTKVNITFHKLIDFPLDLYYVMDLSLSMRD----DLAQLKILGSSLI 169 Query: 204 AMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS 251 +L +V + G TF +K+ F ++ +S Sbjct: 170 DVLR------NVTVNTRLGFGTFVDKVIAPFA-SDNKYIVKDACDSVS 210 >gi|156408618|ref|XP_001641953.1| predicted protein [Nematostella vectensis] gi|156229094|gb|EDO49890.1| predicted protein [Nematostella vectensis] Length = 507 Score = 40.9 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 32/239 (13%), Positives = 68/239 (28%), Gaps = 38/239 (15%) Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRY--KIPLKFCTFIPWYTNSRHIVMPITSSVK 163 D+ + I + + A+ F + S Sbjct: 65 DVATVKAVKDAGISNREEMAAFLAQAVGASCPGEGEGPQCFAGGRRGVCKLSSSCLGSRI 124 Query: 164 VNSQTDA-----------RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 + +D++ VLD S S+ A I + + + Sbjct: 125 SHVCDGPTDVVCCVECQNPIDLIFVLDSSGSVGP------ANFQSAKDFIATLCQRFNVS 178 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK----YAYN 268 + + L+ +++ + F + +S V S +K +YN Sbjct: 179 QSSESGARIALIRYASTPKIIF-------DFNAHNRRISLKEVFVISFRDMKVHHNKSYN 231 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN-EAKK-RGAIVYAI 325 I+ +Q +H + + KI G +++ D+ S +AK + I Sbjct: 232 SIWRIQNGVEHASCHEPALLKI------GNSVTLSVDENSKKLFRYKAKDLEKISILDI 284 >gi|330834066|ref|YP_004408794.1| protoporphyrin IX magnesium-chelatase [Metallosphaera cuprina Ar-4] gi|329566205|gb|AEB94310.1| protoporphyrin IX magnesium-chelatase [Metallosphaera cuprina Ar-4] Length = 607 Score = 40.9 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 30/138 (21%), Positives = 53/138 (38%), Gaps = 21/138 (15%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 ++ L ++++LD S+SME +I +A + +L + I + GLV Sbjct: 428 GVESTGALPILLLLDSSKSMEF-----SKRISLAKSILKGLLIKAYQIRS-----KVGLV 477 Query: 225 TFSN-KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 TFS E + ++ I+ + G T + L A I R+ Sbjct: 478 TFSGFSSEYVVPITKNFKKVESSIEAVRPSGK-TPISSALALAIQIINRETRSRRGVLP- 535 Query: 284 DANYKKIIVFM-TDGENL 300 IVF+ +DG+ Sbjct: 536 -------IVFLISDGKAN 546 >gi|312194365|ref|YP_004014426.1| magnesium chelatase [Frankia sp. EuI1c] gi|311225701|gb|ADP78556.1| Magnesium chelatase [Frankia sp. EuI1c] Length = 726 Score = 40.9 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 22/128 (17%), Positives = 46/128 (35%), Gaps = 17/128 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D S SM ++ ++ ++ ++L + + GLVTF + Sbjct: 522 VLFVVDASGSM-----AARARMGAVKAAVLSLL-----LDAYQRRDKIGLVTFRGEDARL 571 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L + + G T GL A+ + R+ ++V Sbjct: 572 ALPPTSSVDVAATLLADLPSGGRTPLADGLLLAHRVL-----ARERLRDPSRRP--LLVV 624 Query: 294 MTDGENLS 301 +TDG + + Sbjct: 625 LTDGRHTA 632 >gi|242051338|ref|XP_002463413.1| hypothetical protein SORBIDRAFT_02g043390 [Sorghum bicolor] gi|241926790|gb|EER99934.1| hypothetical protein SORBIDRAFT_02g043390 [Sorghum bicolor] Length = 491 Score = 40.9 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 25/138 (18%), Positives = 51/138 (36%), Gaps = 26/138 (18%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 A LD++ VLDVS SM K+++ +++ ++ ++ + +VTFS Sbjct: 70 ARAPLDLVTVLDVSLSM------KGQKLELLKQAMCFVIHQLGPAD------RLSIVTFS 117 Query: 228 NKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 L G + + + L TN GL+ + + Sbjct: 118 RHATRQIRLARMSDVGKASAKFAVGALCAV-RGTNIGQGLRVGAQVLAGRRERNAVAG-- 174 Query: 284 DANYKKIIVFMTDGENLS 301 ++ ++DG++ S Sbjct: 175 -------MILLSDGQDTS 185 >gi|32394566|gb|AAM93981.1| alpha 1 type VII collagen precursor [Griffithsia japonica] Length = 194 Score = 40.9 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 30/185 (16%), Positives = 57/185 (30%), Gaps = 16/185 (8%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + +D S S+ + +++ A+ I V SGL Sbjct: 26 PPAKICYAIDESESIS---GTEFAQLNAALVEITNTFSTAAPGSSFAAVDFSGLQNGGGV 82 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 E L V+ + K G T+S GL ++ + K Sbjct: 83 GEVVSPLTSDVTTFLAALAANPKRGGFTSSGTGLNLCDQEL------------DGQAQPK 130 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +I+ +TDG + +A V IG + +SP+ + V+ Sbjct: 131 VILLITDGVDNRNPIGVDV-EAAIKANGNTVAVVGIGDNLNDDDLRDTISSSPDLYTFVD 189 Query: 350 NPHSM 354 + ++ Sbjct: 190 DFSAL 194 >gi|4454062|emb|CAA10812.1| NorD protein [Bradyrhizobium japonicum] Length = 642 Score = 40.9 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 33/213 (15%), Positives = 80/213 (37%), Gaps = 32/213 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 L + ++ DVS S +++ D +D+ +++ + + D ++++ TF+ Sbjct: 449 QGHDLAVTLLADVSLSTDAWVDGYRV-LDVEKEALLVLAHGLSACGDHHSIL-----TFT 502 Query: 228 NKIEEFFLLEW-------GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 ++ + LE ++R+I L G T +++A + RQ Sbjct: 503 SRRRSWVRLETIKAFGEPMSGLVERRIGALKP-GYYTRIGAAVRHA-----SAELARQ-- 554 Query: 281 NTEDANYKKIIVFMTDGENLSTK------EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 KK+++ +TDG+ + + EA++ G V+ + + + Sbjct: 555 ----PQRKKLLLVLTDGKPNDVDHYEGRFAVEDTRKSVQEARRLGIAVFGVTVDA-TAQS 609 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + + +V N + A I + + Sbjct: 610 YFPTLFGRGGYAIVGNIKRLPAALPAIYRQVAH 642 >gi|27378328|ref|NP_769857.1| NorD protein [Bradyrhizobium japonicum USDA 110] gi|27351475|dbj|BAC48482.1| NorD protein [Bradyrhizobium japonicum USDA 110] Length = 639 Score = 40.9 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 33/213 (15%), Positives = 80/213 (37%), Gaps = 32/213 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 L + ++ DVS S +++ D +D+ +++ + + D ++++ TF+ Sbjct: 446 QGHDLAVTLLADVSLSTDAWVDGYRV-LDVEKEALLVLAHGLSACGDHHSIL-----TFT 499 Query: 228 NKIEEFFLLEW-------GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 ++ + LE ++R+I L G T +++A + RQ Sbjct: 500 SRRRSWVRLETIKAFGEPMSGLVERRIGALKP-GYYTRIGAAVRHA-----SAELARQ-- 551 Query: 281 NTEDANYKKIIVFMTDGENLSTK------EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 KK+++ +TDG+ + + EA++ G V+ + + + Sbjct: 552 ----PQRKKLLLVLTDGKPNDVDHYEGRFAVEDTRKSVQEARRLGIAVFGVTVDA-TAQS 606 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + + +V N + A I + + Sbjct: 607 YFPTLFGRGGYAIVGNIKRLPAALPAIYRQVAH 639 >gi|302877281|ref|YP_003845845.1| von Willebrand factor type A [Gallionella capsiferriformans ES-2] gi|302580070|gb|ADL54081.1| von Willebrand factor type A [Gallionella capsiferriformans ES-2] Length = 757 Score = 40.9 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 63/181 (34%), Gaps = 23/181 (12%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI-TKIDMAIKSINAMLEEVKLIP 213 P + + + +VLD+S S+ S T ++++ ++++ + ++ + Sbjct: 550 SSPDPRINMSHRNDGRNIAVSVVLDLSESLNEKAAGSTQTILELSQEAVSLLAWAIEKLG 609 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIK---YLSKFGVSTNSTPGLKYAYNQI 270 D SG + + F ++ ++K + ST +++A Sbjct: 610 DP--FAISGFHSNTRHDVRFMHIKGFSEKWDDEVKGRLAAMQASYSTRMGAAMRHA---- 663 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK------RGAIVYA 324 H + KK+++ +TDGE + + K +G Y Sbjct: 664 -------SHYLEQQQADKKLMLILTDGEPADIDSRDGRILIEDARKAVTELDQKGIYAYC 716 Query: 325 I 325 I Sbjct: 717 I 717 >gi|225420217|ref|ZP_03762520.1| hypothetical protein CLOSTASPAR_06560 [Clostridium asparagiforme DSM 15981] gi|225041138|gb|EEG51384.1| hypothetical protein CLOSTASPAR_06560 [Clostridium asparagiforme DSM 15981] Length = 231 Score = 40.9 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 25/188 (13%), Positives = 63/188 (33%), Gaps = 18/188 (9%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 ++++D S SM I+ + + + V +++F++ +E Sbjct: 26 VLLVDTSGSMAG------ASINELNQGLLEFGNALDQDEHARGVADVCVISFNSNVETVV 79 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH--CNTEDANYKKIIV 292 ++ + G T+ + A D R+ + Y+ + Sbjct: 80 PFCPAANYSAPTLSA----GGLTS----MNEAVIAGLDAIEERKQLYRQLGCSYYRPWMF 131 Query: 293 FMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIRVIRSHEFLRACASPNS-FYLVEN 350 +TDGE + ++ +A + + +GI ++ L++ + L + Sbjct: 132 LLTDGEPTDQNMEGEAKNRLQQALNDKKVNFFPMGIGSGANYAHLKSYTKGGNGAVLKAS 191 Query: 351 PHSMYDAF 358 +AF Sbjct: 192 ASQFKEAF 199 >gi|268578717|ref|XP_002644341.1| Hypothetical protein CBG14163 [Caenorhabditis briggsae] gi|187027709|emb|CAP32779.1| hypothetical protein CBG_14163 [Caenorhabditis briggsae AF16] Length = 2222 Score = 40.9 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 35/215 (16%), Positives = 74/215 (34%), Gaps = 21/215 (9%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 S ++ + D++ ++D S+S + T D K I + + +V Sbjct: 2009 SPSPSTAPGQQFDIVFMIDGSQS-------AQTSFDTFTKFIQTFMINYDVGISGAHV-- 2059 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA-----YN-QIFDMQ 274 GLV + IE+ ++ + + +S G+ ++ +A YN Q+ Sbjct: 2060 -GLVVVAPDIEDQAPPAANLNSITSQASLISNLGLLRDNYADFDHAGQVLTYNLQVVSSA 2118 Query: 275 GMRQHCNTEDANYKKI-IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRS 332 N +V++T + T + + K+ G I IG + + Sbjct: 2119 EYMSPTAGYRTNINNHLLVYITTTTSFYTDPTPSAQTIIAQ-KQYGII--TIGYGPTVDA 2175 Query: 333 HEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + L + + S+ + I + I T Sbjct: 2176 GKLLTISGGAACSFTANDVASLNNQIKPIQQLIAT 2210 >gi|323650180|gb|ADX97176.1| inter-alpha (globulin) inhibitor h4 isoform 1 [Perca flavescens] Length = 354 Score = 40.9 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 32/179 (17%), Positives = 62/179 (34%), Gaps = 20/179 (11%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 +S + +++ V+D S SM S K+ ++ +L ++ GL Sbjct: 72 PSSLSRIPKNVVFVIDQSGSM------SGRKMQQTRIALIHILNDLAEDDHF------GL 119 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +TF + I W +Q K L + ++G R Sbjct: 120 ITFDSSIFH-----WKRELVQANKKNLESAKTFARNIQDRGTTDINAAVLEGARMLNAHP 174 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK--RGAIVYAIGIRVIRSHEFLRACA 340 II+ +TDG+ S + + + + N + +Y +G + EFL + Sbjct: 175 REGSASIIILLTDGDPTSGETNLERIQS-NVRRDIADKFPLYCLGFGHDVNFEFLEKMS 232 >gi|291166457|gb|EFE28503.1| hypothetical protein HMPREF0389_00418 [Filifactor alocis ATCC 35896] Length = 637 Score = 40.9 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 32/153 (20%), Positives = 66/153 (43%), Gaps = 24/153 (15%) Query: 174 MMIVLDVSRSME-----SFFDSSITKIDMAIKSINAMLE--------EVKLIPDVNNVV- 219 + +VLD S SM + ++I++ +L+ +V+L+P NNV+ Sbjct: 251 ISLVLDNSGSMHTRDLKDSHGNKESRINILKVETGKLLKLLSTNKAADVELVPFDNNVLV 310 Query: 220 ---QSGLV---TFSNKIEEFFLLEWGVSHLQRKIK-YLSKFGV--STNSTPGLKYAYNQI 270 + G TF + +E+ G + + K+K + G TN+ GL+YA+ I Sbjct: 311 RSDRKGGYIKPTFYSASKEYREKIIGSNVYEGKLKESMDSLGAYSGTNTGEGLRYAFYSI 370 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 + + +++ ++ + DGE+ + Sbjct: 371 DEKNNDLL-RERPEEHFRDYLIILVDGESNAAT 402 >gi|148710198|gb|EDL42144.1| RIKEN cDNA 6330505F04, isoform CRA_a [Mus musculus] gi|148710199|gb|EDL42145.1| RIKEN cDNA 6330505F04, isoform CRA_a [Mus musculus] Length = 898 Score = 40.9 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 51/131 (38%), Gaps = 10/131 (7%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM D + +D+A ++ L+ ++ + + LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFLK-LRARDPASRGDRYMLVTYDEPPY-C 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAY-----NQIFDMQGMRQHCNTEDAN 286 W +H ++K L G++T L+ ++ N++ Sbjct: 62 IKAGWKENHATFMNELKNLQASGLTTL-GQALRSSFDLLNLNRLISGIDNYGQGRNPFFL 120 Query: 287 YKKIIVFMTDG 297 I++ +TDG Sbjct: 121 EPSILITITDG 131 >gi|148710200|gb|EDL42146.1| RIKEN cDNA 6330505F04, isoform CRA_b [Mus musculus] Length = 863 Score = 40.9 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 51/131 (38%), Gaps = 10/131 (7%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM D + +D+A ++ L+ ++ + + LVT+ Sbjct: 6 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFLK-LRARDPASRGDRYMLVTYDEPPY-C 63 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAY-----NQIFDMQGMRQHCNTEDAN 286 W +H ++K L G++T L+ ++ N++ Sbjct: 64 IKAGWKENHATFMNELKNLQASGLTTL-GQALRSSFDLLNLNRLISGIDNYGQGRNPFFL 122 Query: 287 YKKIIVFMTDG 297 I++ +TDG Sbjct: 123 EPSILITITDG 133 >gi|26327743|dbj|BAC27615.1| unnamed protein product [Mus musculus] gi|29881543|gb|AAH51161.1| Ddx26b protein [Mus musculus] Length = 444 Score = 40.9 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 51/131 (38%), Gaps = 10/131 (7%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM D + +D+A ++ L+ ++ + + LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFLK-LRARDPASRGDRYMLVTYDEPPY-C 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAY-----NQIFDMQGMRQHCNTEDAN 286 W +H ++K L G++T L+ ++ N++ Sbjct: 62 IKAGWKENHATFMNELKNLQASGLTTL-GQALRSSFDLLNLNRLISGIDNYGQGRNPFFL 120 Query: 287 YKKIIVFMTDG 297 I++ +TDG Sbjct: 121 EPSILITITDG 131 >gi|73985485|ref|XP_533794.2| PREDICTED: similar to Inter-alpha-trypsin inhibitor heavy chain H1 precursor (ITI heavy chain H1) (Inter-alpha-inhibitor heavy chain 1) (Inter-alpha-trypsin inhibitor complex component III) (Serum-derived hyaluronan-associated protein) (SHAP) [Canis familiaris] Length = 910 Score = 40.9 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 74/195 (37%), Gaps = 26/195 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-- 231 ++ V+D+S SME K+ +++ +L +++ + LV F ++++ Sbjct: 292 VVFVIDISTSMEG------QKVKQTKEALLKILGDMRPGDYFD------LVLFGSEVQSW 339 Query: 232 EFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + L++ ++L+ ++ F +TN GL + Q +N+ Sbjct: 340 KGSLVQASPANLRAAQDFVKHFFLAGATNLNGGLLRGIEILNQAQKNLPKL----SNHAS 395 Query: 290 IIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS---- 344 +++ +TDGE + Q L A + +Y +G FL + N+ Sbjct: 396 VLIMLTDGEPTEGVTDRSQILKNVRNAIRGKFPLYNLGFGDNVDFNFLDVMSMENNGRAQ 455 Query: 345 -FYLVENPHSMYDAF 358 Y + F Sbjct: 456 RIYEDHDAAQQLQGF 470 >gi|32476038|ref|NP_869032.1| hypothetical protein RB9710 [Rhodopirellula baltica SH 1] gi|32446582|emb|CAD76417.1| hypothetical protein-transmembrane prediction [Rhodopirellula baltica SH 1] Length = 1631 Score = 40.9 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 40/282 (14%), Positives = 83/282 (29%), Gaps = 65/282 (23%) Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAIS--RYKIPLKFCTFIPWYTNSRHIVMPITSSVKV 164 D ++ S+ P N +++ + F + +S Sbjct: 1000 TSDAQQAVSIHATDAPTNVELTLAVDQKRPSLAEVNFRGNRYQSSVMSSGTAFGVTSRGQ 1059 Query: 165 NSQTDARLDM----------MIVLDVSRSMESFFDSSI----------TKIDMAIKSINA 204 Q A + + V+D S SM + +K + A ++ Sbjct: 1060 TVQDGAEITIRDAMDAGRAITFVMDCSASMNDPLGEEMGRSALGAQRASKFEAARSAVYE 1119 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKI-----EEFFLLEWG---VSHLQRKIKYLSKFGVS 256 M+ ++ P Q GLV + +++ + G + LQ++ F + Sbjct: 1120 MMRRLQPGPS-----QVGLVLYGHRMAIRAGDPAKDSGDGSGQTTLLQKRYHKRFPFPPT 1174 Query: 257 ----TNSTPGLKYAYNQIFDMQGMRQHCNTE----------------------DANYKKI 290 + L +++ RQH + +K Sbjct: 1175 IQPFEDVEVALPTGRFDTAELELARQHFDAAVPWGQTPLYLSIWKAMEDISRTGDGVRKD 1234 Query: 291 IVFMTDGENLSTKEDQQSL----YYCNEAKKRGAIVYAIGIR 328 +V ++DG N +++ +AK G V+ IG Sbjct: 1235 VVVISDGRNYQFNPTPEAIFSIGQLVTQAKTLGVQVHVIGYG 1276 >gi|27370148|ref|NP_766367.1| protein DDX26B [Mus musculus] gi|81896823|sp|Q8BND4|DX26B_MOUSE RecName: Full=Protein DDX26B gi|26350879|dbj|BAC39076.1| unnamed protein product [Mus musculus] gi|44890643|gb|AAH66792.1| Ddx26b protein [Mus musculus] Length = 861 Score = 40.9 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 51/131 (38%), Gaps = 10/131 (7%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM D + +D+A ++ L+ ++ + + LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFLK-LRARDPASRGDRYMLVTYDEPPY-C 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAY-----NQIFDMQGMRQHCNTEDAN 286 W +H ++K L G++T L+ ++ N++ Sbjct: 62 IKAGWKENHATFMNELKNLQASGLTTL-GQALRSSFDLLNLNRLISGIDNYGQGRNPFFL 120 Query: 287 YKKIIVFMTDG 297 I++ +TDG Sbjct: 121 EPSILITITDG 131 >gi|294502680|ref|YP_003566742.1| hypothetical protein YPZ3_0570 [Yersinia pestis Z176003] gi|262360710|gb|ACY57431.1| hypothetical protein YPD4_0522 [Yersinia pestis D106004] gi|262364657|gb|ACY61214.1| hypothetical protein YPD8_0524 [Yersinia pestis D182038] gi|294353139|gb|ADE63480.1| hypothetical protein YPZ3_0570 [Yersinia pestis Z176003] Length = 205 Score = 40.9 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 56/167 (33%), Gaps = 14/167 (8%) Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 +++D S SM I I AM+ ++ P V ++T+ N+ E+ Sbjct: 1 MLIDTSGSMRG------ESIHAVNVGIQAMMSALRQDPYALESVHLSIITYDNQAREYIP 54 Query: 236 LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF-M 294 L + + T + L+ I + Q + + + +VF M Sbjct: 55 LTALENF--QFTDITVPSAGGTFTGAALEC---LIHCVDRDIQRSDGDQKGDWRPLVFLM 109 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 TD +++ K+ + A + HE L+ S Sbjct: 110 TDSTPSDVYAYGEAIK--EVKKRAFGSIIACAVGAKAKHEHLKQLTS 154 >gi|257094221|ref|YP_003167862.1| von Willebrand factor type A [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] gi|257046745|gb|ACV35933.1| von Willebrand factor type A [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] Length = 234 Score = 40.9 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 66/181 (36%), Gaps = 28/181 (15%) Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF-- 233 + +D+S SM + + + ++++ +L D + TF+ K Sbjct: 29 LCIDISGSMHNEYQDG-----LVQETVDRLLAVAGTFDDNGE---MEVWTFNQKYSTPPV 80 Query: 234 FLLEWGVSHLQRKI---KYLSKFGVSTNSTPGLKYAYNQIFDMQ-----------GMRQH 279 +++++ I + K+G TN P + + F + + Sbjct: 81 ARAADFGNYVKKAILDNPDVDKWG-GTNYAPVMNAVLDTYFRGERVTSGGFLGMFKKTEQ 139 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 D + + +F+TDG+N E ++ L EA++ IGI F++ Sbjct: 140 RAPRDIHIPALCLFITDGDNTDEAEAEKILR---EARQHQIYWQLIGIGTDTDFRFIKKM 196 Query: 340 A 340 A Sbjct: 197 A 197 >gi|238064373|ref|ZP_04609082.1| conserved hypothetical protein [Micromonospora sp. ATCC 39149] gi|237886184|gb|EEP75012.1| conserved hypothetical protein [Micromonospora sp. ATCC 39149] Length = 483 Score = 40.9 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 29/199 (14%), Positives = 70/199 (35%), Gaps = 34/199 (17%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ----TDARLDMMIVLDVSRSM 184 +S+ + + + F+ H ++ +T+ + + +R ++V+D S SM Sbjct: 7 VSSALDFDLRIDHDRFLSTRDRQLHAILTVTARAREDGGPAQADGSRYAEVLVVDCSGSM 66 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGV---- 240 TK+ A ++ A + + + V+ +V +++ + G+ Sbjct: 67 AQ----PPTKLGAARRATAAAVGMLP------DGVRFAVVEGTHEARMVYPRHRGLATAS 116 Query: 241 ----SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 +R++ L+ G T L A + + +H + +TD Sbjct: 117 PATREEARRELGRLAAAG-GTAIGTWLALARELLAEHPEAIRHT-----------LLLTD 164 Query: 297 GENLSTKEDQQSLYYCNEA 315 G N + + + A Sbjct: 165 GRNEHETPQRLAEVLADCA 183 >gi|260799770|ref|XP_002594857.1| hypothetical protein BRAFLDRAFT_124444 [Branchiostoma floridae] gi|229280094|gb|EEN50868.1| hypothetical protein BRAFLDRAFT_124444 [Branchiostoma floridae] Length = 2037 Score = 40.9 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 32/222 (14%), Positives = 68/222 (30%), Gaps = 35/222 (15%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + + + A ++ + L+ S++M + + + S ++E+V ++ Sbjct: 227 VCTRNGIVCTESATQEIGLFLNASKTMVMHMNPPVDS-QVLKASDGTVIEQV------SD 279 Query: 218 VVQSGLVTFSNK-IEEFFLLEWGVSHLQRKIKYLS-------------KFGVSTNSTPGL 263 G + + + L WG SH RK+ Y+ +T P + Sbjct: 280 FKYLGSFSNTAYDVSHRIGLAWGASHSLRKVGYIVFDCISKWLISLGLYNVDTTALIPEI 339 Query: 264 KYAYNQIFDMQGMRQHCNTEDA--------NYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 K A F R + +TDG + N A Sbjct: 340 KAAE---FTGGESRVGNAIYHLMCTANYRNGIPSAAIILTDGVAYEEHPNNLYELQSNAA 396 Query: 316 KKRGAIVYAIGIRVI---RSHEFLRACASPNSFYLVENPHSM 354 + G +YA+ I + + + V + ++ Sbjct: 397 RAMGIELYAVAIGREFLFNLNALANIANGADRVFDVYSCCAL 438 >gi|300727147|ref|ZP_07060566.1| von Willebrand factor type A domain protein [Prevotella bryantii B14] gi|299775691|gb|EFI72282.1| von Willebrand factor type A domain protein [Prevotella bryantii B14] Length = 289 Score = 40.9 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 18/102 (17%), Positives = 45/102 (44%), Gaps = 9/102 (8%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S+E + K + + ++ + G++ F Sbjct: 72 EEERELTVMLLIDVSGSLEFGTSRQMKKDMVTEIAATLAFSAIQNND------KIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKY 265 S++IE++ + G H+ I+ + + T+ ++Y Sbjct: 126 SDRIEKYIPPQKGKKHILYLIREMLDFHPQSLRTDIGAAMEY 167 >gi|148692918|gb|EDL24865.1| RIKEN cDNA 1700112N15 [Mus musculus] Length = 492 Score = 40.9 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 17/83 (20%), Positives = 37/83 (44%), Gaps = 4/83 (4%) Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 +I+ +TDG L K ++ +A++ GAIVY +G+ + + + P+ + V+ Sbjct: 25 VIIALTDGL-LLLKPYLDTMEEAKKARRMGAIVYTVGVFMYSKQQLVNIAGDPDRCFGVD 83 Query: 350 NPHSMYDAFSHIGKDIVTKRIWY 372 + A + + +K Sbjct: 84 ---EGFSALEGVVDPLTSKSCTE 103 >gi|167522495|ref|XP_001745585.1| hypothetical protein [Monosiga brevicollis MX1] gi|163775934|gb|EDQ89556.1| predicted protein [Monosiga brevicollis MX1] Length = 5349 Score = 40.9 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 27/172 (15%), Positives = 70/172 (40%), Gaps = 13/172 (7%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + +M+ +D S+SM + ++ A++++ + +K + G+V F Sbjct: 5134 SKRDYQIMVAIDNSKSMRLYHSRTM-----ALEALCGLTNAMKTLEAGE----LGVVGFG 5184 Query: 228 NKIEEFFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 ++ E +S + + ++ L +T+ L Q+ + R+ ++ Sbjct: 5185 RRVRSVHDFEAPLSDMAGAQWLQQLDFHDETTDMGALLSECCEQL-EQAATRKQRTQQNM 5243 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA-IVYAIGIRVIRSHEFL 336 +++ ++D +N+ +Q E ++RG +V+ I + H L Sbjct: 5244 AVSQLLFILSDSDNIYQGGKEQVERRLQELRQRGVFVVFVIFDSPEKEHSVL 5295 >gi|22298956|ref|NP_682203.1| magnesium-chelatase subunit [Thermosynechococcus elongatus BP-1] gi|22295137|dbj|BAC08965.1| magnesium-chelatase subunit [Thermosynechococcus elongatus BP-1] Length = 668 Score = 40.9 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 46/308 (14%), Positives = 93/308 (30%), Gaps = 55/308 (17%) Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 +G + L + R + + S ++ I+ + Sbjct: 382 ADGTVLDESVLYFAQLTQRQGKSGSRSLIYSQDRGRYIRP-----------ILPRGRGGR 430 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 ++ A R P + + R IV +K + L ++ V+D S SM Sbjct: 431 IAVDATLRSAAPYQ-KSRRLRQPQRRVIVENSDLRIKRLVRKAGAL-IIFVVDASGSM-- 486 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-EEFFLLEWGVSHLQR 245 ++ +++ A ++ +L + N L+ F + E + +R Sbjct: 487 ----ALNRMNSAKGAVLRLLTQA-----YQNRDMVALIPFRGEQAEVLLPPTRSIEAARR 537 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL----- 300 +++ L G S + GL A + RQ +IV +TDG Sbjct: 538 RLETLPCGGGSPLAH-GLTQAVRVGLN---ARQSGEIGQV----MIVAITDGRGNVPLSR 589 Query: 301 --------STKEDQQS--LYYCNEAKKRGAIVYAI-----GIRVIRSHEFLRACASPNSF 345 TK D ++ L + + G + I + E + + Sbjct: 590 SLGETLPPQTKPDVKAEVLEIAAKIQALGLKLLVIDTENKFVSTGMGKELAQQAG--GKY 647 Query: 346 YLVENPHS 353 Y + Sbjct: 648 YHLPKATD 655 >gi|208780563|ref|ZP_03247902.1| TPR domain protein [Francisella novicida FTG] gi|208743538|gb|EDZ89843.1| TPR domain protein [Francisella novicida FTG] Length = 332 Score = 40.9 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 34/163 (20%), Positives = 68/163 (41%), Gaps = 29/163 (17%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + +I LDVS+SM++ D S ++++ A I +L +K Q G++ FS+ Sbjct: 88 QKNISRVIALDVSQSMDT-TDVSPSRLERAKYKIFDILRRIKEG-------QVGMIVFSS 139 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L + ++ + ++ N LK + Q+ + G++Q Sbjct: 140 EPFVVSPLTSDANTIENLVTVINSDIVPVQGHNIYKALKKS-AQLIEQAGVQQGQ----- 193 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI--VYAIG 326 I+ +TD + Q++ + ++G VYAIG Sbjct: 194 -----IILITD-----SSPSPQAISQAKQLAQQGIKTDVYAIG 226 >gi|156546338|ref|XP_001606550.1| PREDICTED: similar to ENSANGP00000021218 [Nasonia vitripennis] Length = 1230 Score = 40.9 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 44/358 (12%), Positives = 117/358 (32%), Gaps = 44/358 (12%) Query: 32 LVLGMIIEVSHIFFMKT----VLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIK 87 LV + + + K + + + S + A+T +++ + Sbjct: 70 LVAQIAQNIEGMMNSKVDAIKRVMDVAESSALTASTSEAPPEPPDDKYFYRDAKNYTHFN 129 Query: 88 NTWNMSFRNELRDNGF------------------VNDIDDIVRSTSLDIVVVPQNEGYSI 129 + + F ++ N + +++ + + + P Sbjct: 130 TSLSEHFGGDVNLNFSAVHVPTTVYGRAKEVLRAIRWSEELDNTFKNNYLQDPSLSWQYF 189 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD-ARLDMMIVLDVSRSMESFF 188 + + + + P ++ + + + + D++I++D S SM Sbjct: 190 GSSTGFMRQYPAINWKPNGSDPHDPDLFDCRTRSWYIEAATSPKDVLILVDTSGSMTGM- 248 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIK 248 + ++A +N +L+ + VN V S + + L++ +++++ Sbjct: 249 -----RKEIARHVVNNILDTLGNNDYVNIVKFSNVTELAVPCFGDTLVQANLANIRELKN 303 Query: 249 YLSKFGVS--TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 +S+ N + L YA+ + + + MR+ A I+ +TDG + KE Sbjct: 304 GISEMNTERIANFSMILTYAFELLEEFREMRRGACCNQA-----IMLVTDGVPDNYKEIF 358 Query: 307 QSLYYCNEAKKRG-----AIVYAIGIRVIRS---HEFLRACASPNSFYLVENPHSMYD 356 Q + + ++ I + ACA+ F + + + Sbjct: 359 QRYNWASNPDNPDQADMPVRIFTYLIGREVADVRDSRWMACANRGYFVHLSTLAEVRE 416 >gi|302510673|ref|XP_003017288.1| hypothetical protein ARB_04167 [Arthroderma benhamiae CBS 112371] gi|291180859|gb|EFE36643.1| hypothetical protein ARB_04167 [Arthroderma benhamiae CBS 112371] Length = 1055 Score = 40.9 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 40/267 (14%), Positives = 88/267 (32%), Gaps = 38/267 (14%) Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 + N + + W + +LR+ G + R V + + Y I Sbjct: 430 SVNELPRFHLRFSTLHQLDIWKRAL-GDLRNGGV------VTRPDPGVEVPASEEDDYRI 482 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK---VNSQTDARLDMMIVLDVSRSMES 186 S + + + +S + TS + S LD+++V+ VS SM+ Sbjct: 483 SKTPSKRESSGYSSSYGAGKSSNTAMTDYTSPGREPMPVSAFHVPLDIVVVIPVSSSMQG 542 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-----KIEEFFLLEW-GV 240 KI + ++ +++ + + GLVTF + + W G Sbjct: 543 L------KISLLRDTLRFLVQNLGPRD------RMGLVTFGSSGGGVPVVGMTSKTWNGW 590 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 + I+ + + + + G A + + + N I+ ++D + Sbjct: 591 GKIFNSIRPVGQKSLRADVVEGANVAMDLLMQRK---------SFNPVSTILLISDSQTT 641 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGI 327 + + AK G + +G+ Sbjct: 642 DPESGDFVVSRAEAAK-VGIHTFGLGL 667 >gi|261878616|ref|NP_001159907.1| inter-alpha-trypsin inhibitor heavy chain H1 isoform c [Homo sapiens] gi|261878618|ref|NP_001159908.1| inter-alpha-trypsin inhibitor heavy chain H1 isoform c [Homo sapiens] Length = 623 Score = 40.9 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 68/196 (34%), Gaps = 28/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D+S SM K+ +++ +L +++ + LV F +++ Sbjct: 5 VVFVIDISGSMRG------QKVKQTKEALLKILGDMQPGDYFD------LVLFGTRVQSW 52 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E + Q ++ S +TN GL + +Q +N+ Sbjct: 53 KGSLVQASEANLQAAQDFVRGFS-LDEATNLNGGLLRGIEILNQVQESLPEL----SNHA 107 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDG+ + Q L A + +Y +G FL + N+ Sbjct: 108 SILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 167 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 168 QRIYEDHDATQQLQGF 183 >gi|261878614|ref|NP_001159906.1| inter-alpha-trypsin inhibitor heavy chain H1 isoform b [Homo sapiens] Length = 769 Score = 40.9 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 68/196 (34%), Gaps = 28/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D+S SM K+ +++ +L +++ + LV F +++ Sbjct: 151 VVFVIDISGSMRG------QKVKQTKEALLKILGDMQPGDYFD------LVLFGTRVQSW 198 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E + Q ++ S +TN GL + +Q +N+ Sbjct: 199 KGSLVQASEANLQAAQDFVRGFS-LDEATNLNGGLLRGIEILNQVQESLPEL----SNHA 253 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDG+ + Q L A + +Y +G FL + N+ Sbjct: 254 SILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 313 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 314 QRIYEDHDATQQLQGF 329 >gi|221044458|dbj|BAH13906.1| unnamed protein product [Homo sapiens] Length = 623 Score = 40.9 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 68/196 (34%), Gaps = 28/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D+S SM K+ +++ +L +++ + LV F +++ Sbjct: 5 VVFVIDISGSMRG------QKVKQTKEALLKILGDMQPGDYFD------LVLFGTRVQSW 52 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E + Q ++ S +TN GL + +Q +N+ Sbjct: 53 KGSLVQASEANLQAAQDFVRGFS-LDEATNLNGGLLRGIEILNQVQESLPEL----SNHA 107 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDG+ + Q L A + +Y +G FL + N+ Sbjct: 108 SILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 167 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 168 QRIYEDHDATQQLQGF 183 >gi|221042234|dbj|BAH12794.1| unnamed protein product [Homo sapiens] Length = 769 Score = 40.9 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 68/196 (34%), Gaps = 28/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D+S SM K+ +++ +L +++ + LV F +++ Sbjct: 151 VVFVIDISGSMRG------QKVKQTKEALLKILGDMQPGDYFD------LVLFGTRVQSW 198 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E + Q ++ S +TN GL + +Q +N+ Sbjct: 199 KGSLVQASEANLQAAQDFVRGFS-LDEATNLNGGLLRGIEILNQVQESLPEL----SNHA 253 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDG+ + Q L A + +Y +G FL + N+ Sbjct: 254 SILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 313 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 314 QRIYEDHDATQQLQGF 329 >gi|203284093|ref|YP_002221833.1| hypothetical protein BDU_171 [Borrelia duttonii Ly] gi|201083536|gb|ACH93127.1| uncharacterized conserved protein [Borrelia duttonii Ly] Length = 329 Score = 40.9 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 72/205 (35%), Gaps = 26/205 (12%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSS-ITKIDMAIKSINAMLEEVKLIP 213 ++ ++ K L + V+D+SRSM +F + I +++ A I+ +L Sbjct: 71 ILDVSWGQKATEDERVNLRISFVVDISRSMLTFDEGKSINRLESAKNLISLILNSF---- 126 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 V+ L F K S L + + Y+ + S+PG Sbjct: 127 ---ENVEYSLTIFKGKSLLVLPFSKDKSSLYKMLNYIEPNLI---SSPG------SFLGD 174 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + +D++Y +V +TDG+ + R + + IGI R Sbjct: 175 GVLSAIQGIKDSSYYNFLVVLTDGDEWGENNYYTFSKLIDV---RNIVSFVIGIGSNRPL 231 Query: 334 EFLRACASPNSFYLVENPHSMYDAF 358 + ++ V++ + F Sbjct: 232 PLV------DNHVNVKDKNDSVMTF 250 >gi|156373979|ref|XP_001629587.1| predicted protein [Nematostella vectensis] gi|156216590|gb|EDO37524.1| predicted protein [Nematostella vectensis] Length = 417 Score = 40.9 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 22/159 (13%), Positives = 52/159 (32%), Gaps = 26/159 (16%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIK 200 + +T I++ T S+ +SQ + ++D S SM + ++ Sbjct: 263 PTNLVEEFTKKPSIMINYTPSLTSSSQIAEFI---FLVDRSGSMSGKHIFQVKEM----- 314 Query: 201 SINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGVSTN 258 ++ +K +P G ++ + + + H ++ + TN Sbjct: 315 ----LILFLKSLPANCYFNLIGFGSYYRSVYQETQIYDEETAEHACNYVQKMRADLGGTN 370 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 L++ +NQ + + +TDG Sbjct: 371 LILPLEFIFNQ------------PPKKGIPRFVFMLTDG 397 >gi|332817004|ref|XP_003339088.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 [Pan troglodytes] gi|332817006|ref|XP_516520.3| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 isoform 6 [Pan troglodytes] Length = 623 Score = 40.9 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 68/196 (34%), Gaps = 28/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D+S SM K+ +++ +L +++ + LV F +++ Sbjct: 5 VVFVIDISGSMRG------QKVKQTKEALLKILGDMQPGDYFD------LVLFGTRVQSW 52 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E + Q ++ S +TN GL + +Q +N+ Sbjct: 53 KGSLVQASEANLQAAQDFVRGFS-LDEATNLNGGLLRGIEILNQVQESLPEL----SNHA 107 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDG+ + Q L A + +Y +G FL + N+ Sbjct: 108 SILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 167 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 168 QRIYEDRDATQQLQGF 183 >gi|332817002|ref|XP_003309880.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 [Pan troglodytes] Length = 769 Score = 40.9 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 68/196 (34%), Gaps = 28/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D+S SM K+ +++ +L +++ + LV F +++ Sbjct: 151 VVFVIDISGSMRG------QKVKQTKEALLKILGDMQPGDYFD------LVLFGTRVQSW 198 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E + Q ++ S +TN GL + +Q +N+ Sbjct: 199 KGSLVQASEANLQAAQDFVRGFS-LDEATNLNGGLLRGIEILNQVQESLPEL----SNHA 253 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDG+ + Q L A + +Y +G FL + N+ Sbjct: 254 SILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 313 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 314 QRIYEDRDATQQLQGF 329 >gi|316932039|ref|YP_004107021.1| von Willebrand factor type A [Rhodopseudomonas palustris DX-1] gi|315599753|gb|ADU42288.1| von Willebrand factor type A [Rhodopseudomonas palustris DX-1] Length = 372 Score = 40.9 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 61/172 (35%), Gaps = 25/172 (14%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF- 226 +++ VLD + SM + + KI +I L + + ++ GLV + Sbjct: 27 ARPTVEVAFVLDTTGSMSGLIEGAKRKIWSIATAI--------LDDNPDAEIRMGLVAYR 78 Query: 227 ---SNKIEEFFLLEWGVSHLQRKIKYLSKFGVS---TNSTPGLKYAYNQIFDMQGMRQHC 280 + + L + L ++ L G + L A N++ QG Sbjct: 79 DIGDDYVVRSVDLTTDIQDLYGQLLQLQARGGGDWPESVNEALDTAINKLQWRQGG---- 134 Query: 281 NTEDANYKKIIVFMTDGEN-LSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 + ++I+ + D + +D + A+++ IV A+ R Sbjct: 135 -----DSRRIVFVVGDAPPHMDYAQDTKYPETLAVARQKDIIVNAVQAGSAR 181 >gi|219126324|ref|XP_002183410.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|217405166|gb|EEC45110.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 969 Score = 40.9 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 31/206 (15%), Positives = 60/206 (29%), Gaps = 23/206 (11%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA 198 +K W+ +V + S + A +D++ ++D S SM +I Sbjct: 1 MKVFRPATWFLYGAALVANLVS-----VKGQAPVDLVFIIDESGSMGD----DQAQIANR 51 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS-- 256 I A L+ +V G F + L S + L G + Sbjct: 52 ANQITAALDSATAGNFRVGLVGYGASAFGGFPRKVGTLTDDASMFGAAVASLETSGGTEP 111 Query: 257 -------TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL 309 T L + + G + V +TD + + +L Sbjct: 112 GFVATELTAEDSLLYTTTSDSTKLDGSSAGTSFPGPAGF-CAVLITDEPSNG--DGATTL 168 Query: 310 YYCNEAKKRGAIVYAIGIRVIRSHEF 335 A + ++ V+ S++ Sbjct: 169 ADAKTALDNASN--SVFFGVVPSNQL 192 >gi|146303120|ref|YP_001190436.1| von Willebrand factor, type A [Metallosphaera sedula DSM 5348] gi|145701370|gb|ABP94512.1| von Willebrand factor, type A [Metallosphaera sedula DSM 5348] Length = 394 Score = 40.9 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 36/242 (14%), Positives = 78/242 (32%), Gaps = 61/242 (25%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM----------------ESFFDSSI-- 192 + + +++ LD+ IVLDVS SM + D + Sbjct: 16 NGELKFAFRATIVPERVKPVPLDLFIVLDVSGSMGIIDNPPEVDDSLIAGTAEVDGHVVR 75 Query: 193 ---------TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE---EFFLLEWGV 240 ++++A+++I +LE + ++TFS+ + + Sbjct: 76 YLKDDIGVNNRLEVALEAIRNLLENADTS------TRVTIITFSDHVNVLCRRVTPSTAL 129 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 HL+ + +T +K A + I +H ++ +TDG Sbjct: 130 EHLEEIVP-----DGNTALYSAVKKAISLI------DEHPAR--------VLLITDGYPT 170 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAF 358 +++ + + + G+ + LR+ A S FY V + + Sbjct: 171 DVEDETEYSKLEVPRFSQFIPI---GVGEYNAK-ILRSLADLSNGRFYHVNDVSEISRIM 226 Query: 359 SH 360 Sbjct: 227 EE 228 >gi|2944425|gb|AAC05284.1| complement factor C2 [Mus musculus] Length = 470 Score = 40.9 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 22/161 (13%), Positives = 52/161 (32%), Gaps = 19/161 (11%) Query: 219 VQSGLVTFSNKIEEF-FLLEWGVSHLQRKIKYLSK-------FGVSTNSTPGLKYAYNQI 270 V ++TF+++ + +L + I L TN+ L Y+ + Sbjct: 8 VSVAIITFASQPKTIMSILSERSQDVTEVITSLDSASYKDHENATGTNTYEVLIRVYSMM 67 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYY-----CNEAKKRGAIVYA 324 + + I+ +TDG N+ + + + +YA Sbjct: 68 QSQMDRLGMETSAWKEIRHTIILLTDGKSNMGDSPKKAVTRIRELLSIEQNRDDYLDIYA 127 Query: 325 IGIRVIRS-----HEFLRACASPNSFYLVENPHSMYDAFSH 360 IG+ + +E +++++ ++ F H Sbjct: 128 IGVGKLDVDWKELNELGSKKDGERHAFILQDAKALQQIFEH 168 >gi|302392065|ref|YP_003827885.1| von Willebrand factor type A [Acetohalobium arabaticum DSM 5501] gi|302204142|gb|ADL12820.1| von Willebrand factor type A [Acetohalobium arabaticum DSM 5501] Length = 230 Score = 40.9 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 34/168 (20%), Positives = 58/168 (34%), Gaps = 30/168 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + I+ DVS SM + K +++ A+ + V +IPD V GL TF + Sbjct: 50 VEIIWDVSGSMW----GKVEKNKKYLRAKKALKDIVMMIPD---HVNIGLRTFGKGEDSD 102 Query: 234 FLL-------EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 S L KI L G S L A + ++ G Sbjct: 103 RSTNLVVNIATNNKSTLLAKINNLKPAGKS-PIGKALSQAGVDLINLDGNNH-------- 153 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 I+ +TDG + + + K G ++ + I + + + Sbjct: 154 ----ILLVTDGRDTG---NIVPSRVADRLSKNGIYIHVLKIGAVGNQQ 194 >gi|312886234|ref|ZP_07745848.1| protein of unknown function DUF58 [Mucilaginibacter paludis DSM 18603] gi|311301259|gb|EFQ78314.1| protein of unknown function DUF58 [Mucilaginibacter paludis DSM 18603] Length = 291 Score = 40.9 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 28/97 (28%), Positives = 45/97 (46%), Gaps = 9/97 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 L +MI++DVS S + I+ + + L V + N + G++ FS+KIE Sbjct: 77 LTVMILMDVSGSE------NFGTINQQKQDVATELCAVLAFSAIQNNDKVGVIFFSDKIE 130 Query: 232 EFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKY 265 +F + G SH+ I+ L + TN LKY Sbjct: 131 KFIPPKKGRSHILMIIRELINFTPENKGTNVAEALKY 167 >gi|297559546|ref|YP_003678520.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|296843994|gb|ADH66014.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 587 Score = 40.9 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 41/239 (17%), Positives = 78/239 (32%), Gaps = 38/239 (15%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS-SITKIDMAIKSINAMLEEVKLIPDV 215 IT ++ +Q ++ ++D+S SM + +T++ + + LE + Sbjct: 359 SITGLIRTWNQLKMDSRVLAIVDISGSMLAEVPGTGMTRMQVTSAAATQGLEMFTPSSE- 417 Query: 216 NNVVQSGLVTFSNKIEEF-------------FLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 GL FS + + G +H L+ Sbjct: 418 -----LGLWEFSTNVNNELHYQEIAPIRELQAAADDGTAHRDVLAGALASLQPLPQGDTA 472 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE---DQQSLYYCNEAK-KR 318 L Y + Q M + + N I+ +TDG+N + D+ + A R Sbjct: 473 LYETY--LAAYQEMSRTYQPDRTNV---ILMLTDGDNDNPGGLGLDELMSQIESLASPSR 527 Query: 319 GAIVYAIGIRVIRSHEF--LR--ACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + I L+ A A+ + Y+ E+P + + F + RI D Sbjct: 528 PIPIITIAFG-PDVQNLEPLQEIAAATGGAAYMTEDPTEIGEIFL----QAFSLRISED 581 >gi|146307338|ref|YP_001187803.1| protein kinase [Pseudomonas mendocina ymp] gi|145575539|gb|ABP85071.1| protein kinase [Pseudomonas mendocina ymp] Length = 1018 Score = 40.9 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 66/181 (36%), Gaps = 25/181 (13%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 R +++V+D S SM+ + ID ++ ++ + +++ D++N V GLV F Sbjct: 595 DKFRTGIVLVVDTSVSMQPY-------IDRVVRVVDELHKQIASRGDLDN-VSFGLVGFR 646 Query: 228 NKIEEFFLLEWGV---------SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + +++ LE+ S QR + + ++ S+ F Sbjct: 647 SNVDKTPGLEYVTRTLVTLEEGSDPQRFVDLARQVKATSVSSHAFN---EDAFAGVMEAV 703 Query: 279 HCNTEDANYKKIIVFMTDG---ENLST--KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 ++I+ +TD + A ++ +YA+ +R Sbjct: 704 EGMDWTPYSGRLILLVTDAGALRKNDPLSRTQMNEAEVREAALRKQIKIYALHLRTPIGR 763 Query: 334 E 334 + Sbjct: 764 D 764 >gi|7496323|pir||T33112 hypothetical protein C18H7.1 - Caenorhabditis elegans Length = 240 Score = 40.9 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 22/92 (23%), Positives = 42/92 (45%), Gaps = 14/92 (15%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 L+++IVLD S S++ F+ S K + + EE+++ P + L+ +S Sbjct: 143 LNVLIVLDRSDSVKGGFNKS-------RKFVTDVSEELQIGP---EKHRVALIVYSGLSY 192 Query: 232 EFFLLEWGVSHLQ----RKIKYLSKFGVSTNS 259 ++ W + RK+ L G +TN+ Sbjct: 193 RREVMPWNFAKSNEEFVRKVNALRAIGGTTNT 224 >gi|125973851|ref|YP_001037761.1| hypothetical protein Cthe_1336 [Clostridium thermocellum ATCC 27405] gi|256004304|ref|ZP_05429286.1| hypothetical protein ClothDRAFT_1147 [Clostridium thermocellum DSM 2360] gi|281418012|ref|ZP_06249032.1| hypothetical protein Cther_2696 [Clostridium thermocellum JW20] gi|125714076|gb|ABN52568.1| hypothetical protein Cthe_1336 [Clostridium thermocellum ATCC 27405] gi|255991738|gb|EEU01838.1| hypothetical protein ClothDRAFT_1147 [Clostridium thermocellum DSM 2360] gi|281409414|gb|EFB39672.1| hypothetical protein Cther_2696 [Clostridium thermocellum JW20] gi|316939952|gb|ADU73986.1| hypothetical protein Clo1313_0918 [Clostridium thermocellum DSM 1313] Length = 233 Score = 40.9 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 15/56 (26%), Positives = 28/56 (50%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAA 62 N + + KG T+ AI +P + L + +I VS + K+ + ++ DR+ A Sbjct: 9 NCKGLVRDRKGSFTVEAAIIIPAVILTMFALILVSEFLYQKSCIQAIADRTAQRGA 64 >gi|313220910|emb|CBY31745.1| unnamed protein product [Oikopleura dioica] Length = 1264 Score = 40.9 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 22/172 (12%), Positives = 51/172 (29%), Gaps = 24/172 (13%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +++ LD S SM ++ M ++ + V G+++F+ Sbjct: 296 RPPRRIVLCLDASTSMLEG-----NRLSMVRNAVRQFIYTVSSG------TYVGIISFNR 344 Query: 229 KIEEFFLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 L E + L ++ T+ + N + Sbjct: 345 MSTRRQDLIEIIDEDERAFLVSRLPKDESVASGTSIGKAILRGINMLQQSGPEYLRKMDG 404 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 I+ ++DG +T ++++ K G V I + ++ Sbjct: 405 GE-----ILLLSDGLEWNTPSVEETMEL---VKLSGVRVNTISLGNSDTNVL 448 >gi|296136111|ref|YP_003643353.1| von Willebrand factor type A [Thiomonas intermedia K12] gi|295796233|gb|ADG31023.1| von Willebrand factor type A [Thiomonas intermedia K12] Length = 753 Score = 40.9 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 66/204 (32%), Gaps = 30/204 (14%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + + + ++LD+S S+ S I + ++L I + + + Sbjct: 555 SHRTNGRDIAVTLLLDLSASLNDKVKGSEQTILELSQEAVSLLAWA--IERLGDPLAIAG 612 Query: 224 VTFSNKIE------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + + E + F WG ++ ++ + ST L++A Sbjct: 613 FQSNTRHEVRYLHIKGFGEPWG-DEVKARLAAMQAN-YSTRMGAALRHA----------- 659 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 H + KK+++ +TDGE + L +A++ +R + Sbjct: 660 AHSLRARKSDKKLLLVLTDGEPADIDVQDERLLI-EDARQ--------AVRELERDGLFT 710 Query: 338 ACASPNSFYLVENPHSMYDAFSHI 361 C S + ++ I Sbjct: 711 YCISLDPRADAYVSDIFGRRYTVI 734 >gi|237724723|ref|ZP_04555204.1| conserved hypothetical protein [Bacteroides sp. D4] gi|229436918|gb|EEO46995.1| conserved hypothetical protein [Bacteroides dorei 5_1_36/D4] Length = 516 Score = 40.9 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 41/249 (16%), Positives = 91/249 (36%), Gaps = 34/249 (13%) Query: 116 LDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITS------SVKVNSQTD 169 ++ Y+ ++ R P++ + Y S ++ +T+ S N Sbjct: 287 VEFSNTGNKSSYTYPSVWRIGDPIEELDLMLTYMTSPRLIPGMTTKKWEYVSNDDNGTES 346 Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 ++D+++V+D S SM S S + A+ + +L + + + FS+K Sbjct: 347 KQMDLLLVVDTSGSMGSAMKESAN-MHQAVLASYGILSYFESTKS-----KVAFIGFSDK 400 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 I+ + ++ ++ TN G K+ ++I +R Sbjct: 401 IDAYVDWSDKYDDVRERL--------LTNGHGGTKFPISRIKSTLDVRSR--------DL 444 Query: 290 IIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVY-AIGIRVIRSHEFLRACASPNSFYL 347 + V +TDG+ + Y N+ ++ +G + + S+E L+ + Y Sbjct: 445 VTVLITDGDLGNINESVSYFRDYLND--DNKLYIFLLVGSKSLHSYEPLKNIGA--KIYN 500 Query: 348 VENPHSMYD 356 N + D Sbjct: 501 ANNANEFCD 509 >gi|6664313|gb|AAF22895.1|AC006932_12 T27G7.20 [Arabidopsis thaliana] Length = 778 Score = 40.9 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 25/179 (13%), Positives = 59/179 (32%), Gaps = 18/179 (10%) Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 + ++ A R P + + +R + + T A ++ V+D S SM Sbjct: 526 KRLAVDATLRAAAPYQKLRREKDISGTRKVFVEKTDMRAKRMARKAGALVIFVVDASGSM 585 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEFFLLEWGVSHL 243 ++ + A L+ + + V+ ++ F + E ++ Sbjct: 586 A---------LNRMQNAKGAALKLLAESYTSRDQVKVSIIPFRGDAAEVLLPPSRSIAMA 636 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 + +++ L G S + + + + + +IV +TDG T Sbjct: 637 RNRLERLPCGGGSPLAHG--------LTTAVRVGLNAEKSGDVGRIMIVAITDGRANIT 687 >gi|326680043|ref|XP_003201437.1| PREDICTED: von Willebrand factor A domain-containing protein 5A-like [Danio rerio] Length = 787 Score = 40.9 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 32/179 (17%), Positives = 62/179 (34%), Gaps = 30/179 (16%) Query: 174 MMIVLDVSRSMESFFDSSIT---KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 + V+D S SM+ +I+ A ++ +L+ + + N F ++ Sbjct: 283 FVFVMDRSGSMDWTMHRGKEAQHRIESAKDTLLLLLKSLPMGCYFNIY------GFGSRF 336 Query: 231 EEFFLLE--WGVSHLQ---RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 E FF + + +K+K + T L + Y+Q Sbjct: 337 ESFFPKSVEYNQDKMDQALKKVKEMRADMGGTEILQPLAHIYSQ------------PCIP 384 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + + + TDGE +TKE + + A+ ++ GI S + A S Sbjct: 385 EHPRQLFIFTDGEVGNTKEVLDLVK--SHARSH--RCFSFGIGEGASTALITGLAREGS 439 >gi|254881904|ref|ZP_05254614.1| tellurium resistance protein [Bacteroides sp. 4_3_47FAA] gi|254834697|gb|EET15006.1| tellurium resistance protein [Bacteroides sp. 4_3_47FAA] Length = 212 Score = 40.9 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 25/184 (13%), Positives = 60/184 (32%), Gaps = 16/184 (8%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + I++D S SM I+ ++ M+ ++L P ++T+ + Sbjct: 4 RLPVYILIDTSGSM------KGEPIESVKVGLSDMIASLRLDPYALETACISIITYDRDV 57 Query: 231 EEFFLLEWGVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + L ++ + T++ L + M D Sbjct: 58 RQILPL---TELEDLQLPEIECPDSGPTHTGAALNMLCDCYDSEVNMGSKEQKGDWMPLL 114 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 I +TDG+ + +++ + + + A E L+ ++ + ++ Sbjct: 115 FI--LTDGKPSDLQVYDEAIQKVKQ--HQFTNIVACAAGPKAKTEPLKKLT--DNVFTLD 168 Query: 350 NPHS 353 S Sbjct: 169 TMDS 172 >gi|311693026|gb|ADP95899.1| von Willebrand factor, type A-like protein [marine bacterium HP15] Length = 590 Score = 40.9 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 28/202 (13%), Positives = 66/202 (32%), Gaps = 33/202 (16%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 ++ + + ++LD S SM + ++A ++ ++ + IP + Sbjct: 413 IQRDEHKRPETSVHVLLDTSGSMSQ-------RQEIANQATVSLALAISTIPKCD----I 461 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKY-LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + F + ++ + L G T + YA ++ RQ Sbjct: 462 AVSMFPGCGGSVSPMIHRGQPVRPNLGRFLVSSGGGTPLAEAMLYAARELSASHKPRQ-- 519 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +++ +TDG + ++ Y + K YAIGI + Sbjct: 520 ---------VLIVITDGSPN----NGHAVNYLLDLMKHQIDTYAIGIGSNAVKSYF---- 562 Query: 341 SPNSFYLVENPHSMYDAFSHIG 362 ++ ++ + + A I Sbjct: 563 --GNWTVINDVRELQSALFRIA 582 >gi|322436269|ref|YP_004218481.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321163996|gb|ADW69701.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 313 Score = 40.9 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 34/167 (20%), Positives = 53/167 (31%), Gaps = 29/167 (17%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 L + +V+D S S E F I I L + L D + V F Sbjct: 74 DQLNLSIGLVIDASGSQEKF-------IKEHEHDIADFLHQTLLPGD-----RVFAVCFG 121 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL--------KYAYNQIFDMQGMRQH 279 N + +W I L + P L A N + Sbjct: 122 NHLRLVS--DW-TGDQAAIIDNLRSYDKGHRDGPELGPIEEREFGTALNDAIFYSATEKM 178 Query: 280 CNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 + +K++V +DG EN S + ++ A+ +VYAI Sbjct: 179 ADVHQ--RRKVLVVFSDGEENSSEHDLVDAI---EAAQNNDVLVYAI 220 >gi|188583113|ref|YP_001926558.1| hypothetical protein Mpop_3896 [Methylobacterium populi BJ001] gi|179346611|gb|ACB82023.1| conserved hypothetical protein [Methylobacterium populi BJ001] Length = 473 Score = 40.9 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 8/56 (14%), Positives = 23/56 (41%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 + G + ++ + + ++G IE + + +T L S +D ++ + Sbjct: 22 RLRHESDGAVAVIFGLAASTLIGLVGGGIEYARVLSARTQLQSAVDAGVMAGGNAL 77 >gi|119585664|gb|EAW65260.1| inter-alpha (globulin) inhibitor H1, isoform CRA_b [Homo sapiens] Length = 893 Score = 40.9 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 68/196 (34%), Gaps = 28/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D+S SM K+ +++ +L +++ + LV F +++ Sbjct: 293 VVFVIDISGSMRG------QKVKQTKEALLKILGDMQPGDYFD------LVLFGTRVQSW 340 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E + Q ++ S +TN GL + +Q +N+ Sbjct: 341 KGSLVQASEANLQAAQDFVRGFS-LDEATNLNGGLLRGIEILNQVQESLPEL----SNHA 395 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDG+ + Q L A + +Y +G FL + N+ Sbjct: 396 SILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 455 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 456 QRIYEDHDATQQLQGF 471 >gi|320105050|ref|YP_004180641.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] gi|319752332|gb|ADV64092.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] Length = 382 Score = 40.9 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 60/176 (34%), Gaps = 7/176 (3%) Query: 137 IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID 196 + + + I + + ++ R ++++D S SM + + TK Sbjct: 132 LHETLRNAMARRPSGPPIRLERDDLMIHETERQVRCATVVLIDQSGSMGRYGKYAQTKRI 191 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS 256 + V+ +++ G +F++++ E LL + + ++ Sbjct: 192 ALALA-----GLVQARFPQDSLEFVGFSSFAHRLSEIDLLNSAPKRVSIFDPRV-HLRIN 245 Query: 257 TNSTPGL-KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY 311 ++ P + I + + K I+ +TDGE + E + L Sbjct: 246 LDAPPAFVPQHFTNIHAGLKLARSILARQTAATKQIIVITDGEPTAHVEGRDVLLI 301 >gi|298492778|ref|YP_003722955.1| magnesium chelatase ATPase subunit D ['Nostoc azollae' 0708] gi|298234696|gb|ADI65832.1| magnesium chelatase ATPase subunit D ['Nostoc azollae' 0708] Length = 668 Score = 40.9 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 39/257 (15%), Positives = 82/257 (31%), Gaps = 44/257 (17%) Query: 118 IVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 ++ + + ++ A R P + + + + IV K + L ++ V Sbjct: 418 MIPKGKVKRIAVDATLRAAAPYQ-KARLARQPDKKVIVEQGDMRSKRLVRKAGAL-VVFV 475 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-EEFFLL 236 +D S SM ++ ++ A ++ +L E N Q L+ F + E Sbjct: 476 VDASGSM------ALNRMQSAKGAVMQLLTEA-----YQNRDQVSLIPFRGEQAEVLLPP 524 Query: 237 EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 ++ + +++ L G S + GL A + Q + +IV +TD Sbjct: 525 TRSIALAKNRLERLPCGGGSPLAH-GLTQAVRVGINAQ-------MGGDIGQVVIVAITD 576 Query: 297 GENL-------------STKEDQQS--LYYCNEAKKRGAIVYAI-----GIRVIRSHEFL 336 G K D ++ L + G + I + + E Sbjct: 577 GRGNIPLARSLGEPQEPGEKPDIKAELLDIAGRIRAAGMQLLVIDTESKFVSTGFAKELA 636 Query: 337 RACASPNSFYLVENPHS 353 + +Y + Sbjct: 637 KTAG--GKYYHLPKATD 651 >gi|119945233|ref|YP_942913.1| hypothetical protein Ping_1500 [Psychromonas ingrahamii 37] gi|119863837|gb|ABM03314.1| hypothetical protein DUF1555 [Psychromonas ingrahamii 37] Length = 348 Score = 40.9 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 33/183 (18%), Positives = 64/183 (34%), Gaps = 22/183 (12%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + + T ++ S T A LD+M + D + SM T+I+ A + N++L + Sbjct: 40 GDSVTIRKTVTITKESPTTAPLDVMFIFDTTGSMG-------TEINQAKAAANSILTGLA 92 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFL--LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA-Y 267 ++QSG +++ + L G + + I + S TP + YA Sbjct: 93 GF----GLLQSGSGWYNDPTFDGVRVDLNSGNTDVTSGISTVPFISGSGGDTPEMGYAGI 148 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIV-YAIG 326 + D R N + I+ D + + ++ A + Y Sbjct: 149 SDAADTASWRPGSN-------RFIIAFGDANFKTPPTEAATIASLTAANANLIGISYDGA 201 Query: 327 IRV 329 Sbjct: 202 FSA 204 >gi|225874357|ref|YP_002755816.1| hypothetical protein ACP_2798 [Acidobacterium capsulatum ATCC 51196] gi|225791485|gb|ACO31575.1| hypothetical protein ACP_2798 [Acidobacterium capsulatum ATCC 51196] Length = 333 Score = 40.9 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 51/154 (33%), Gaps = 10/154 (6%) Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR 245 + + + + L + + + +V F+ ++ S Q+ Sbjct: 50 ETGPGQSSALHEKVHASKQFLNSLLSAHASGSAPRVFVVQFNRDVDLLEDPSASASKAQQ 109 Query: 246 KIKYL--------SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 + + S + N K + ++D + H +K+IV ++DG Sbjct: 110 ALSQVGVAQFHGDSNADSTANKGRHAKASGAALYDAIYLATHNVLNTTAGRKVIVVLSDG 169 Query: 298 ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 + +K A++ G VYAI + R Sbjct: 170 IDQGSKTTLNGAV--EAAQRAGVAVYAIYFKGGR 201 >gi|159900457|ref|YP_001546704.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893496|gb|ABX06576.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 831 Score = 40.9 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 13/99 (13%), Positives = 30/99 (30%), Gaps = 14/99 (14%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R G + +AI ++ + I+ F + + + + + +E Sbjct: 8 RRVAQPVAGQSLVFSAILFFVMIAFAALAIDTGEAFSRQRQQQAASTAASIAGLESMNSE 67 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDI 107 +G + + R+ L NG N + Sbjct: 68 IDGTDG--------------AVQQAIRDALAANGITNAV 92 >gi|260892924|ref|YP_003239021.1| Protein of unknown function DUF2134, membrane [Ammonifex degensii KC4] gi|260865065|gb|ACX52171.1| Protein of unknown function DUF2134, membrane [Ammonifex degensii KC4] Length = 298 Score = 40.9 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 8/65 (12%), Positives = 28/65 (43%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 + +G + +L A L + + ++++ + + L + +D + + + + +G Sbjct: 4 KSERGTVAVLVAAALTFLLGLAALVVDGGGLLLARERLVNAVDAAALAGVQFLPGDPSGA 63 Query: 73 NRKKL 77 + L Sbjct: 64 VQTAL 68 >gi|326929209|ref|XP_003210761.1| PREDICTED: von Willebrand factor A domain-containing protein 3A-like, partial [Meleagris gallopavo] Length = 1180 Score = 40.9 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 65/177 (36%), Gaps = 40/177 (22%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +++D S SM+ + ++ I E+++ N V+ L+ F+ E Sbjct: 880 VCVLIDTSGSMDPYLPHITKELTSLI------WEQLR-----KNEVRFNLLRFAENTE-- 926 Query: 234 FLLEWGVSHLQR-------KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 W ++ ++++SKF N+ + A + QG+ Sbjct: 927 ---SWKEHLVEATDKTCHDAVQWVSKFHAHGNTCILM--ALQKALSFQGVEA-------- 973 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV--IRSHEFLRACAS 341 + +TDG+ T + K++ ++ I ++EFL+ AS Sbjct: 974 ----LYILTDGKP-DTSCNLILKEIERLRKQQDIKIHTISFSCVDREANEFLKKLAS 1025 >gi|302551500|ref|ZP_07303842.1| von Willebrand factor type A domain-containing protein [Streptomyces viridochromogenes DSM 40736] gi|302469118|gb|EFL32211.1| von Willebrand factor type A domain-containing protein [Streptomyces viridochromogenes DSM 40736] Length = 448 Score = 40.9 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 26/158 (16%), Positives = 50/158 (31%), Gaps = 32/158 (20%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + I++D S SM D TK+ A + I + + V ++ Sbjct: 54 SPGQGPSAAVAIMVDCSGSM----DYPPTKMRNARDA------TAAAIDTLRDGVHFAVI 103 Query: 225 TFSNKIEEFFL--------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 ++ +E + + ++ ++ LS G T LK A + Sbjct: 104 GGTHVAKEVYPGGGSLAVADATTRAQAKQALRSLSA-GGGTAIGTWLKLADRLLSSADVA 162 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQ--QSLYYC 312 +H + +TDG N +L C Sbjct: 163 IRHG-----------ILLTDGRNEHESPQDLKAALDAC 189 >gi|226228509|ref|YP_002762615.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] gi|226091700|dbj|BAH40145.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] Length = 699 Score = 40.9 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 18/107 (16%), Positives = 37/107 (34%), Gaps = 11/107 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++I+LD S SM A+ S A+++ ++ + LV + + E Sbjct: 95 VVILLDRSMSMGYTGVWPR-----ALDSARAVIDRLEGKD------RVALVAYDDAAEVM 143 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 L + ++ + + T P L+ A + D Sbjct: 144 QRLTDDRAAVRGSLGAVQPMRRGTRLAPALRTARQLLLDAPFAAAEV 190 >gi|145301356|ref|YP_001144196.1| hypothetical protein ASA_P4G149 [Aeromonas salmonicida subsp. salmonicida A449] gi|142856133|gb|ABO92448.1| conserved hypothetical protein [Aeromonas salmonicida subsp. salmonicida A449] Length = 598 Score = 40.9 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 33/177 (18%), Positives = 66/177 (37%), Gaps = 22/177 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + +K ++ + I++D+S SM + + D+A ++ A+ + IP Sbjct: 403 VTGDTRVFLKPEAKQRPNTAVHILVDMSSSMSYRAANGKERQDIAREASLAIAMALDAIP 462 Query: 214 DVNNVVQSGLVTF---SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 VN VTF K F ++ G + R ++ K +T + YA ++ Sbjct: 463 GVNP-----AVTFFGGDEKQPVFSAVKHGETVQNRAGRFGFKARGNTPMAEAIWYAAFEL 517 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + R K+++ +TDG + + C + V IG+ Sbjct: 518 TKTREER-----------KMLIVVTDGAPQNAPACSSVINLCE---RSNVEVIGIGV 560 >gi|111226678|ref|XP_642029.2| type A von Willebrand factor domain-containing protein [Dictyostelium discoideum AX4] gi|122056765|sp|Q54Z23|INT6_DICDI RecName: Full=Integrator complex subunit 6 homolog gi|90970713|gb|EAL68149.2| type A von Willebrand factor domain-containing protein [Dictyostelium discoideum AX4] Length = 1107 Score = 40.9 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 24/139 (17%), Positives = 50/139 (35%), Gaps = 11/139 (7%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + V+D S SM + +T +D + +I ++ + N + L+T Sbjct: 3 ITFVVDTSGSMSQKTTNGMTLLDCSKAAIEHFIKIRSKDASMRND-RFFLITSEENPITA 61 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPG--LKYAYNQIF------DMQGMRQHCNTEDA 285 + W + I+ + S G L+ +++ + + Q N Sbjct: 62 VKIGW-KDNFNSFIQEVKNLQTKDMSNLGFSLQKSFDFLNQFRVQSSIDNYGQGRNPWFI 120 Query: 286 NYKKIIVFMTDGENLSTKE 304 II+ +TDG +L+ Sbjct: 121 EPA-IIILLTDGSSLTNSS 138 >gi|32474856|ref|NP_867850.1| hypothetical protein RB7502 [Rhodopirellula baltica SH 1] gi|32445396|emb|CAD75397.1| hypothetical protein RB7502 [Rhodopirellula baltica SH 1] Length = 640 Score = 40.9 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 22/131 (16%), Positives = 52/131 (39%), Gaps = 20/131 (15%) Query: 167 QTDARLDMMIVLDVSRSMES-FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 Q D+ ++++V+D S SM++ ++ T++ +A+ ++ ++ Q LVT Sbjct: 83 QADSGQEVILVVDNSASMQAVEPETGQTRLQLAVDRAASLARGLRQGD------QMALVT 136 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + + ++ I + T ++ A R+ + D Sbjct: 137 AGTTVRVVVGMSDFAPAIEEAIHSIQPTDGPTQIGAAIEAA----------RRLADKND- 185 Query: 286 NYKKIIVFMTD 296 ++ IV +TD Sbjct: 186 --RRRIVVITD 194 >gi|224043354|ref|XP_002197844.1| PREDICTED: integrator complex subunit 6 [Taeniopygia guttata] Length = 887 Score = 40.9 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 28/139 (20%), Positives = 55/139 (39%), Gaps = 12/139 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM T +D+A ++ ++ ++ + + LVTF Sbjct: 4 LLFLIDTSASMNQRTHLGTTYLDIAKGAVETFMK-LRARDPASRGDRYMLVTFEEPPY-A 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQI------FDMQGMRQHCNTEDA 285 W +H ++K L G++T L+ A++ + + Q N Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTL-GQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFL 120 Query: 286 NYKKIIVFMTDGENLSTKE 304 II+ +TDG L+T Sbjct: 121 EPA-IIITVTDGSKLTTTS 138 >gi|118084887|ref|XP_417071.2| PREDICTED: similar to candidate tumor suppressor protein DICE1 [Gallus gallus] Length = 888 Score = 40.9 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 28/139 (20%), Positives = 55/139 (39%), Gaps = 12/139 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM T +D+A ++ ++ ++ + + LVTF Sbjct: 4 LLFLIDTSASMNQRTHLGTTYLDIAKGAVETFMK-LRARDPASRGDRYMLVTFEEPPY-A 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQI------FDMQGMRQHCNTEDA 285 W +H ++K L G++T L+ A++ + + Q N Sbjct: 62 IKAGWKENHATFMNELKNLQAEGLTTL-GQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFL 120 Query: 286 NYKKIIVFMTDGENLSTKE 304 II+ +TDG L+T Sbjct: 121 EPA-IIITVTDGSKLTTTS 138 >gi|301767172|ref|XP_002919036.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1-like [Ailuropoda melanoleuca] Length = 910 Score = 40.9 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 76/195 (38%), Gaps = 26/195 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D+S SME K+ +++ +L +++ + LV F ++++ + Sbjct: 292 VVFVIDISTSMEG------QKVKQTKEALLKILGDIRPGDYFD------LVLFGSEVQSW 339 Query: 234 --FLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L++ ++L+ ++ +F +TN GL + QG +N+ Sbjct: 340 RGSLVQASPANLRAAQDFVRRFFLAGATNLNGGLLRGIEILNQAQGSLPEL----SNHAS 395 Query: 290 IIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS---- 344 I++ +TDGE + Q L A + +Y +G FL + N+ Sbjct: 396 ILIMLTDGEPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHDVDLNFLEVMSMENNGRAQ 455 Query: 345 -FYLVENPHSMYDAF 358 Y + F Sbjct: 456 RIYEDHDATQQLQGF 470 >gi|170031185|ref|XP_001843467.1| dihydropyridine-sensitive l-type calcium channel [Culex quinquefasciatus] gi|167869243|gb|EDS32626.1| dihydropyridine-sensitive l-type calcium channel [Culex quinquefasciatus] Length = 1165 Score = 40.9 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 36/190 (18%), Positives = 75/190 (39%), Gaps = 14/190 (7%) Query: 173 DMMIVLDVSRSMESFFD--SSITKIDMAIKSINA-MLEEVKLIPDVNNVVQSGLVTFSNK 229 D++I+LD S SM + + + +T + N + K DV+ +V FS+ Sbjct: 221 DIVILLDNSGSMTGYRNYIAQLTVKSVLDTFSNNDFINIYKYSNDVDPLVPC----FSDI 276 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + + + L +K L G N A+ + + + +R+ CN + + Sbjct: 277 LVQATPE--NIRFLNEYVKELQPEGY-ANVGKAFVKAFELLQNYREIRR-CNESISGCNQ 332 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA--IGIRVIRSHEF-LRACASPNSFY 346 I+ +TDG + E + + K V+ +G V + E AC + + Sbjct: 333 AIMLITDGVPSNITEVFEQYNWFENGTKIPVRVFTYLLGREVTKVREIQWMACLNRGHYS 392 Query: 347 LVENPHSMYD 356 +++ + + Sbjct: 393 HIQSLDEVQE 402 >gi|290971985|ref|XP_002668746.1| vWFA domain-containing protein [Naegleria gruberi] gi|284082259|gb|EFC36002.1| vWFA domain-containing protein [Naegleria gruberi] Length = 198 Score = 40.9 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 23/140 (16%), Positives = 59/140 (42%), Gaps = 22/140 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 ++ +D++IV+D + SM +I++A +++ ++ + + ++ V++ Sbjct: 75 SENFVDLVIVMDCTGSMSG-------EIEVAKRTVTTIISTLHEKFQSD--LRFSAVSYR 125 Query: 228 NKIEEF----FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + +++ F ++ + I +S G + L A I +M ++ Sbjct: 126 DHTDDYAVKEFPFTKDLNKAKGYINTMSAQGGG-DHPEALASALYVINEMPFNKKG---- 180 Query: 284 DANYKKIIVFMTDGENLSTK 303 KKI+V++ D K Sbjct: 181 ----KKIVVWVADAPPHGMK 196 >gi|261253067|ref|ZP_05945640.1| hypothetical protein VIA_003092 [Vibrio orientalis CIP 102891] gi|260936458|gb|EEX92447.1| hypothetical protein VIA_003092 [Vibrio orientalis CIP 102891] Length = 424 Score = 40.9 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 20/91 (21%), Positives = 33/91 (36%), Gaps = 4/91 (4%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 N N KG ++ + + + + I+VSH KT L + +D + AT Sbjct: 16 PNRLRNQKGLTLVVMTMSMVAFITIAALSIDVSHFVVNKTRLQNAVDTIALAGATVA--- 72 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELR 99 N N K I+ K +E+ Sbjct: 73 -NRTNEKGDTDTAIIESYKKVIESPGNDEIE 102 >gi|118368459|ref|XP_001017436.1| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila] gi|89299203|gb|EAR97191.1| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila SB210] Length = 862 Score = 40.9 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 27/208 (12%), Positives = 60/208 (28%), Gaps = 33/208 (15%) Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 N + R K K + + E++ N ++ S + + + + Sbjct: 66 ENCSYRHKSKEDTTQKSCQEQITCANVEEIQKNDSS---LELKNSQVIKGLTMEDFKSSL 122 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 + KI L H + LD+ V+D+S SM S+ Sbjct: 123 VELNQDLKILLPIILEEKEKAKQMH--------------SSCSLDLCFVVDISGSMSSWM 168 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN----KIEEFFLLEWGVSHLQ 244 + ++ + L + ++ + + + + + Sbjct: 169 NH-------VVQYVGNTLNTIYQQFP-AITLRVAFSGYRDINDFEQYVQVDFTRNIEDFK 220 Query: 245 RKIKYLSKFGVS----TNSTPGLKYAYN 268 ++ +S G + T GL A N Sbjct: 221 TRLSKVSCGGGGGDWPEDVTGGLNNALN 248 >gi|13541413|ref|NP_111101.1| hypothetical protein TVN0582 [Thermoplasma volcanium GSS1] gi|14324796|dbj|BAB59723.1| TVG0570508 [Thermoplasma volcanium GSS1] Length = 277 Score = 40.9 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 60/185 (32%), Gaps = 34/185 (18%) Query: 166 SQTDARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEV----KLIPDVNNVVQ 220 S+ + L + +V D SRSM S +K A +IN + E+ V + Sbjct: 7 SRRNPGLFLFLV-DQSRSMVRKIAGGSESKAKEAADAINRQIGELIMRCTKNDGVRDYFY 65 Query: 221 SGLVTF-----------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVST------------ 257 G++ + + + L ++++ + T Sbjct: 66 VGVIGYGGEKGKASYLLDDDLVPVSKLAENPIRTEKRVMKIPDGSGGTADVEYEFGVWFE 125 Query: 258 ---NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 NS + A N + + + + ++Y II+ +TDG++ + Sbjct: 126 AVANSDTPMVKAMN--MAKEAVEKWVSEHPSSYPPIIINVTDGQSTDGDPYPVARNIMEM 183 Query: 315 AKKRG 319 G Sbjct: 184 ETDDG 188 >gi|221116649|ref|XP_002154434.1| PREDICTED: similar to Ints6 protein [Hydra magnipapillata] Length = 854 Score = 40.9 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 31/236 (13%), Positives = 79/236 (33%), Gaps = 39/236 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM T +D+A ++ + L+ + + + LVTF ++ + Sbjct: 4 LVFLVDTSASMNQKTCLGTTYLDLAKGAVESFLKIRARDINASRGDRYMLVTF-DEHNKS 62 Query: 234 FLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI- 290 + W ++ +++K L +G+ T+ LK A+ + +M ++ + Sbjct: 63 VKVGWRENLAQFLKELKNLEAYGL-TDLGGALKQAF-DLLNMTRLQSGIDNYGLGRNPYY 120 Query: 291 -----IVFMTDG----------------ENLSTKEDQQSLY---YCNEAKKRGAIVYAIG 326 ++ +DG N + + + + Sbjct: 121 LEPALVMLFSDGCDLINVNGITGEIIVPSNNQLPGAELTKEIYRWDQRLFALNLKIPGFA 180 Query: 327 IRVIRSHEFLRA--------CA-SPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 V L+A C + Y + + ++ + I + I + + Sbjct: 181 ASVNEKLSTLQAEDMALSNLCEDTGGKLYSIGSYKTLLQSLESIAQKIAVPGVIIN 236 >gi|118431306|ref|NP_147675.2| hypothetical protein APE_1031.1 [Aeropyrum pernix K1] gi|116062628|dbj|BAA80016.2| conserved hypothetical protein [Aeropyrum pernix K1] Length = 463 Score = 40.9 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 29/160 (18%), Positives = 53/160 (33%), Gaps = 30/160 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + ++LD S SM KID A A+ + + F + Sbjct: 298 IYVLLDKSGSM------VGAKIDWARAVAVALFRR-----SLAENRRFSARFFDSVTYPA 346 Query: 234 FLLEWGVS-----HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L L + + + G T+ T +K A + I Q + Sbjct: 347 IHLRPRSKPRDFLELVKYLAAVKA-GGGTDITAAIKTAADDISRTPRGEQRISD------ 399 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 IV +TDGE + + + + K+ A ++ + I+ Sbjct: 400 --IVLITDGE---DRLNIDVVE--DSLKRSDARLHTVIIQ 432 >gi|328869266|gb|EGG17644.1| hypothetical protein DFA_08640 [Dictyostelium fasciculatum] Length = 1236 Score = 40.5 bits (93), Expect = 0.37, Method: Composition-based stats. Identities = 35/230 (15%), Positives = 77/230 (33%), Gaps = 32/230 (13%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVL-DVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 + +S+ V T R M+IVL D S S+ + + +L ++ Sbjct: 122 TFDGILRTSIHVPVNTSDRKQMLIVLVDKSASIRYVWPQIQS----------TLLSLFQI 171 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNS------TPGLKY 265 + + + V ++ F+ + ++W + + I+ G + + L+ Sbjct: 172 VFNHHQEVILEIIFFN---NTAYQMKWTKDNYKELIQNERPTGKTCFAATFNLCGDILRG 228 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR----GAI 321 Y + D + + D + I+F+TDG + + ++ + + K Sbjct: 229 YYKKGTDFLSDKYYDPKTDTS----IIFLTDGSHTTVRDQKAAFTSLKSLIKSITCAAIT 284 Query: 322 VYAIGIRVIRSHEFL----RACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 V+ +G L R + Y +P A + I Sbjct: 285 VHTMGFTENSRFTDLDGIRRMLGTKEGVYQYCDPSDGPHALQDKLRFIFG 334 >gi|189201073|ref|XP_001936873.1| von Willebrand domain containing protein [Pyrenophora tritici-repentis Pt-1C-BFP] gi|187983972|gb|EDU49460.1| von Willebrand domain containing protein [Pyrenophora tritici-repentis Pt-1C-BFP] Length = 933 Score = 40.5 bits (93), Expect = 0.37, Method: Composition-based stats. Identities = 36/213 (16%), Positives = 73/213 (34%), Gaps = 28/213 (13%) Query: 136 KIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKI 195 IP P N R ++ + + ++ +++ ++D S SM + ++ + Sbjct: 259 GIPQAILKTHPTLPNQRALMTTLVPKFNLKTEKP---EIIFIVDRSGSMSHQIPTLVSAL 315 Query: 196 DMAIKSI-NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG 254 + +KSI + + ++ + S ++ + + + Sbjct: 316 KVFLKSIPVGCMFNICSFGSSHSCLWSESKGYNQET---------LEEAINCVDAFQADM 366 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 T + A F M+ +HC+TE +V +TDG + Q Y E Sbjct: 367 GGTET----LAAVQSCFKMRN--KHCSTE-------MVLLTDG--NIWAQQQLFNYIIEE 411 Query: 315 AKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 K R V+ IGI S + A + Sbjct: 412 TKSRDVRVFPIGIGGQVSSALIEGVARAGGGFA 444 >gi|91788413|ref|YP_549365.1| hypothetical protein Bpro_2551 [Polaromonas sp. JS666] gi|91697638|gb|ABE44467.1| hypothetical protein Bpro_2551 [Polaromonas sp. JS666] Length = 533 Score = 40.5 bits (93), Expect = 0.37, Method: Composition-based stats. Identities = 22/165 (13%), Positives = 56/165 (33%), Gaps = 5/165 (3%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 + +G + I+ + + ++F + G+++++ H++ K L + D + + A + G Sbjct: 9 KSERGAVAIIFGLTVVVLFAMGGVVLDLGHLYIAKAELQNAADAAALAGAKDLNETTPGI 68 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 + I + K ++ L N D S+ PQ + I Sbjct: 69 DAAVATAQTISAKNKYNFSTDVTLAL-ANIEFGPSPDGPWSSVATARAAPQGMTF-IKVD 126 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 + K+ T++ + + + + V Sbjct: 127 TGLKV---LGTYLMRVAGVDTVSTFGLAVAGRFVNNVTPIGVCAV 168 >gi|78484767|ref|YP_390692.1| von Willebrand factor, type A [Thiomicrospira crunogena XCL-2] gi|78363053|gb|ABB41018.1| CbbO-m protein [Thiomicrospira crunogena XCL-2] Length = 757 Score = 40.5 bits (93), Expect = 0.38, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 81/216 (37%), Gaps = 33/216 (15%) Query: 164 VNSQTDARLDMMIVLDVSRSMES-FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 ++ + +M+++D S+S++ ++ T ++++ +++ ++ + + +G Sbjct: 559 SHTTDSRNIAVMLLVDTSQSLKERNQETGQTLLELSEEALAITAWTIEQL--GDKFAIAG 616 Query: 223 LVTFSNKIEEFFLLEWGVSH----LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + + + ++ H ++ +I + ST +++A Sbjct: 617 FCSDTRHEVRYQHIKGYSEHYGDEVKSRIAAMEAS-YSTRMGAAMRHA-----------A 664 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKED------QQSLYYCNEAKKRGAIVYAIGIRVIRS 332 H KK+++ +TDGE Q + E K +G Y I + + Sbjct: 665 HYLEAQQAEKKLMLILTDGEPADIDTKDPQVLIQDTHKAVEELKSKGIYSYCITLD-PNA 723 Query: 333 HEFLRACASPNSFYLVEN-----PHSMYDAFSHIGK 363 E++ ++ Y V + P + F I + Sbjct: 724 DEYVETIF--DNHYTVIDHVDKLPEKLPQVFMKITQ 757 >gi|224476539|ref|YP_002634145.1| hypothetical protein Sca_1052 [Staphylococcus carnosus subsp. carnosus TM300] gi|222421146|emb|CAL27960.1| conserved hypothetical protein [Staphylococcus carnosus subsp. carnosus TM300] Length = 629 Score = 40.5 bits (93), Expect = 0.38, Method: Composition-based stats. Identities = 23/166 (13%), Positives = 60/166 (36%), Gaps = 29/166 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 +++D S SM K++ IK + E +K + + ++ F E+ Sbjct: 437 FTLLIDASASM-------FDKMEETIKGVVLFHETLKALNVKHEILAFNEDAFKADDEKQ 489 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 ++++ S + + + ++ A ++ + + Sbjct: 490 PNIIDEIIDYDYSTISKDSPRIMALKPQDDNRDGVAIRVASERLL-----------QRPH 538 Query: 287 YKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKRGAIVYAIGIR 328 +++ ++ +DGE + DQ + EA+K G V+ + + Sbjct: 539 HQRFLIIFSDGEPSAYNYDQDGIIDTYEAVEEARKYGLEVFNVFLS 584 >gi|281338027|gb|EFB13611.1| hypothetical protein PANDA_007566 [Ailuropoda melanoleuca] Length = 868 Score = 40.5 bits (93), Expect = 0.38, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 76/195 (38%), Gaps = 26/195 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D+S SME K+ +++ +L +++ + LV F ++++ + Sbjct: 247 VVFVIDISTSMEG------QKVKQTKEALLKILGDIRPGDYFD------LVLFGSEVQSW 294 Query: 234 --FLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L++ ++L+ ++ +F +TN GL + QG +N+ Sbjct: 295 RGSLVQASPANLRAAQDFVRRFFLAGATNLNGGLLRGIEILNQAQGSLPEL----SNHAS 350 Query: 290 IIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS---- 344 I++ +TDGE + Q L A + +Y +G FL + N+ Sbjct: 351 ILIMLTDGEPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHDVDLNFLEVMSMENNGRAQ 410 Query: 345 -FYLVENPHSMYDAF 358 Y + F Sbjct: 411 RIYEDHDATQQLQGF 425 >gi|290960313|ref|YP_003491495.1| hypothetical protein SCAB_59371 [Streptomyces scabiei 87.22] gi|260649839|emb|CBG72955.1| conserved hypothetical protein [Streptomyces scabiei 87.22] Length = 449 Score = 40.5 bits (93), Expect = 0.38, Method: Composition-based stats. Identities = 24/164 (14%), Positives = 52/164 (31%), Gaps = 24/164 (14%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 T I + + + + +++D S SM D TK+ A + A ++ Sbjct: 37 ATGGGTIGTAVAAPHLYSPGQGPSAAVALMVDCSGSM----DYPPTKMRNARDATAAAID 92 Query: 208 EVK-----LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 ++ + D +V + + + ++ ++ LS G T Sbjct: 93 TLRDGVHFAVIDGTHVARE---VYPGGGRLAVADSATRAQAKQALRRLSA-GGGTAIGTW 148 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 L+ A + +H + +TDG N + Sbjct: 149 LRLADRLLASEDVAIRHG-----------ILLTDGRNEHESPED 181 >gi|149005573|ref|ZP_01829312.1| cell wall surface anchor family protein [Streptococcus pneumoniae SP18-BS74] gi|168482703|ref|ZP_02707655.1| cell wall surface anchor family protein [Streptococcus pneumoniae CDC1873-00] gi|307126680|ref|YP_003878711.1| cell wall surface anchor family protein [Streptococcus pneumoniae 670-6B] gi|147762513|gb|EDK69473.1| cell wall surface anchor family protein [Streptococcus pneumoniae SP18-BS74] gi|154432930|gb|ABS82113.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432938|gb|ABS82120.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432946|gb|ABS82127.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432954|gb|ABS82134.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432970|gb|ABS82148.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|172043667|gb|EDT51713.1| cell wall surface anchor family protein [Streptococcus pneumoniae CDC1873-00] gi|306483742|gb|ADM90611.1| cell wall surface anchor family protein [Streptococcus pneumoniae 670-6B] gi|332076908|gb|EGI87370.1| cell wall surface anchor family protein [Streptococcus pneumoniae GA17545] gi|332203636|gb|EGJ17703.1| cell wall surface anchor family protein [Streptococcus pneumoniae GA47368] Length = 883 Score = 40.5 bits (93), Expect = 0.38, Method: Composition-based stats. Identities = 29/139 (20%), Positives = 59/139 (42%), Gaps = 10/139 (7%) Query: 149 TNSRHIVMPITSSVKVNSQTDA-RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 N I + ++ V ++ + LD++I+LD S SM + + + + A ++ A+++ Sbjct: 194 DNQYGIELTVSGKTTVETKEASTPLDVVILLDNSNSMSNIRHNHAHRAEKAGEATRALVD 253 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIE--EFFLLEWGVSHLQRKI---KYLSKFGVSTNSTPG 262 ++ PD + LVT+ + I +E GV+ KI L F +T + Sbjct: 254 KITSNPDN----RVALVTYGSTIFDGSEATVEKGVADANGKILNDSALWTFDRTTFTAKT 309 Query: 263 LKYAYNQIFDMQGMRQHCN 281 Y++ + Q Sbjct: 310 YNYSFLNLTSDPTDIQTIK 328 >gi|114587328|ref|XP_001172432.1| PREDICTED: inter-alpha (globulin) inhibitor H1 isoform 2 [Pan troglodytes] Length = 893 Score = 40.5 bits (93), Expect = 0.39, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 68/196 (34%), Gaps = 28/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D+S SM K+ +++ +L +++ + LV F +++ Sbjct: 293 VVFVIDISGSMRG------QKVKQTKEALLKILGDMQPGDYFD------LVLFGTRVQSW 340 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E + Q ++ S +TN GL + +Q +N+ Sbjct: 341 KGSLVQASEANLQAAQDFVRGFS-LDEATNLNGGLLRGIEILNQVQESLPEL----SNHA 395 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDG+ + Q L A + +Y +G FL + N+ Sbjct: 396 SILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 455 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 456 QRIYEDRDATQQLQGF 471 >gi|242129222|gb|ACS83542.1| factor B/C2B [Larimichthys crocea] Length = 651 Score = 40.5 bits (93), Expect = 0.39, Method: Composition-based stats. Identities = 42/220 (19%), Positives = 74/220 (33%), Gaps = 33/220 (15%) Query: 168 TDARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVK-----LIPDVNNVVQS 221 +R+++ I+LD S S+ + FD S I +++ ++K + ++V Sbjct: 153 EGSRMNVYILLDTSGSITKEAFDKSREATIALITKLDSYEVQLKFHVLSFASEAVDIVDI 212 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN--QIFDMQGMRQH 279 S I++ W + I TN L F H Sbjct: 213 TESQISGSIDDVI---WNLMEFNYSIHGRK---TGTNLYAALHRVNELISFFKENSATNH 266 Query: 280 CNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYY----------CNEAKKRGAIVYAIGIR 328 N + II+ TDG N+ TK + + VY GI Sbjct: 267 FN----ETQNIIIIETDGYSNMGTKPQIALARIRNMLGYFDTSQDHTDETMLDVYVFGIG 322 Query: 329 VIRSHEFLRACAS----PNSFYLVENPHSMYDAFSHIGKD 364 + L A AS ++++N ++ + F+ I D Sbjct: 323 EQLKKDQLNALASKKRDEQHVFILKNYETLGEVFNSIISD 362 >gi|290243156|ref|YP_003494826.1| von Willebrand factor type A [Thioalkalivibrio sp. K90mix] gi|288945661|gb|ADC73359.1| von Willebrand factor type A [Thioalkalivibrio sp. K90mix] Length = 615 Score = 40.5 bits (93), Expect = 0.39, Method: Composition-based stats. Identities = 35/226 (15%), Positives = 77/226 (34%), Gaps = 38/226 (16%) Query: 148 YTNSRHIVMPITSSVKVNSQTD----ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSIN 203 +SRH+ +T +V + + I++D S SM I+ A+ + Sbjct: 415 KLSSRHLSRVVTGDHRVFGKRQESGTPNTAVQILVDRSGSMAG------DPIETAMTAAL 468 Query: 204 AMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL 263 A+ + +N Q S+ + +G + + + +T + + Sbjct: 469 AIQLATDSLRGIN--TQVSAFPASSSGGLVPITSFGENGRMKADNFGVGSTGATPMSNAI 526 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 +F R K+++ +TDG ++ +++ A+ +Y Sbjct: 527 LGVLPSMFARSESR-----------KVMLVITDGAPNDSESAMEAIRM---ARDVNVEMY 572 Query: 324 AIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 AIGI P+ + VEN + + + ++I Sbjct: 573 AIGIE-----------TDPSHLFGVENTT-VIQSVGELAENIFGLL 606 >gi|189913184|ref|YP_001964413.1| Hypothetical BatB protein; putative von Willebrand factor, type A domain containing protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] gi|167781252|gb|ABZ99549.1| Hypothetical BatB protein; putative von Willebrand factor, type A domain containing protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] Length = 547 Score = 40.5 bits (93), Expect = 0.39, Method: Composition-based stats. Identities = 33/152 (21%), Positives = 63/152 (41%), Gaps = 25/152 (16%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE-EVKLIPDVN 216 I S +++ D++ V+DVS SM + ID+ + + ++L+PD+ Sbjct: 74 IKSIDIESTKEFQSTDILFVVDVSLSMNA--------IDIRPNRLKRFQDLSLRLLPDLK 125 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGV---STNSTPGLKYAYNQIFDM 273 + G++ F+ + F + +S + I+ LS V TN L+ A Sbjct: 126 GN-RVGIIVFAGQSFSFCPMTSDLSAVSDYIQSLSVEMVGSKGTNLAVALERA------- 177 Query: 274 QGMRQHCNTEDANYKKII-VFMTDGENLSTKE 304 + + N + I V +TDGE+ + Sbjct: 178 ----EKIRKKSQNIQSNITVIVTDGEDHEKQS 205 >gi|189912859|ref|YP_001964748.1| BatBC [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167777535|gb|ABZ95835.1| BatBC [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] Length = 547 Score = 40.5 bits (93), Expect = 0.39, Method: Composition-based stats. Identities = 33/152 (21%), Positives = 63/152 (41%), Gaps = 25/152 (16%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE-EVKLIPDVN 216 I S +++ D++ V+DVS SM + ID+ + + ++L+PD+ Sbjct: 74 IKSIDIESTKEFQSTDILFVVDVSLSMNA--------IDIRPNRLKRFQDLSLRLLPDLK 125 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGV---STNSTPGLKYAYNQIFDM 273 + G++ F+ + F + +S + I+ LS V TN L+ A Sbjct: 126 GN-RVGIIVFAGQSFSFCPMTSDLSAVSDYIQSLSVEMVGSKGTNLAVALERA------- 177 Query: 274 QGMRQHCNTEDANYKKII-VFMTDGENLSTKE 304 + + N + I V +TDGE+ + Sbjct: 178 ----EKIRKKSQNIQSNITVIVTDGEDHEKQS 205 >gi|291240911|ref|XP_002740358.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii] Length = 1126 Score = 40.5 bits (93), Expect = 0.39, Method: Composition-based stats. Identities = 25/160 (15%), Positives = 56/160 (35%), Gaps = 34/160 (21%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 +++++D S SM S + ++ + ++ + ++ + N+++ FS +E+ Sbjct: 873 VVVLVDTSGSMVSHME------ELVRELVSLIWDQFQRENIKFNIIR-----FSGNVEKW 921 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 LE R + G +T + L A+N N + Sbjct: 922 RHQIVDPLEENCHDAVRWVSTFVASG-NTCTLEALYEAFND----------RNIDG---- 966 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + +TDG+ S+ RG ++ I Sbjct: 967 --VYLLTDGKPDSSTS-LVLKEIARLNTTRGVKIHTISFN 1003 >gi|261880540|ref|ZP_06006967.1| BatB protein [Prevotella bergensis DSM 17361] gi|270332763|gb|EFA43549.1| BatB protein [Prevotella bergensis DSM 17361] Length = 342 Score = 40.5 bits (93), Expect = 0.39, Method: Composition-based stats. Identities = 33/172 (19%), Positives = 70/172 (40%), Gaps = 20/172 (11%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K++ + ++ +I +DVS SM++ D + +++D + + +M E + Sbjct: 81 GTKISHEKRDGIEAIICMDVSNSMKAE-DVAPSRLDKSKMLVESMTEHFTND-------K 132 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 GLV F+ + + ++ + VST T I + Sbjct: 133 LGLVVFAGDAFVQLPITSDYVSAKMFLQNIDPSLVSTQGTD--------IARAITVGMRS 184 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 T+ + I+ +TDGE+ + ++ EA+KRG V+ +G+ + Sbjct: 185 FTQQEKVGRAIIVITDGEDH----EGGAMEAAKEARKRGINVFILGVGSTKG 232 >gi|255602535|ref|XP_002537872.1| conserved hypothetical protein [Ricinus communis] gi|223514758|gb|EEF24510.1| conserved hypothetical protein [Ricinus communis] Length = 120 Score = 40.5 bits (93), Expect = 0.39, Method: Composition-based stats. Identities = 19/67 (28%), Positives = 33/67 (49%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R G I+ A+ LPI+ +LG I++S ++ K L S+ D + + AA + Sbjct: 16 LRPGRRAESGAFAIMAALVLPIMIAMLGFAIDLSRVYNRKVELQSVADAAALAAANALDG 75 Query: 68 EGNGNNR 74 G +R Sbjct: 76 TPEGIDR 82 >gi|171684709|ref|XP_001907296.1| hypothetical protein [Podospora anserina S mat+] gi|170942315|emb|CAP67967.1| unnamed protein product [Podospora anserina S mat+] Length = 1075 Score = 40.5 bits (93), Expect = 0.39, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 57/190 (30%), Gaps = 38/190 (20%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V D S SM +I +++ L+ + Sbjct: 301 IVFVCDRSGSM-----GDGKRIPNLQTALHLFLKSL-----------------------P 332 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA---YNQIFDMQGMRQHCNTEDANYKK- 289 +++ + ++ G T L +A N I G + +K+ Sbjct: 333 LGVKFNICSFGSHWDFMFPEGSRTYDASSLAHATQYVNSISANYGGTEMRMPLQDTFKRR 392 Query: 290 -----IIVFM-TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 + VFM TDGE ++ + + V+++GI SH + A Sbjct: 393 YKDMDLEVFMLTDGEIWDQQQVFGMINTHVAESEGAIRVFSLGIGNDVSHALIEGIAQAG 452 Query: 344 SFYLVENPHS 353 + + Sbjct: 453 NGFSQSVADD 462 >gi|169832403|ref|YP_001693966.1| cell wall surface anchor family protein [Streptococcus pneumoniae Hungary19A-6] gi|154432882|gb|ABS82071.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432906|gb|ABS82092.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|168994905|gb|ACA35517.1| cell wall surface anchor family protein [Streptococcus pneumoniae Hungary19A-6] Length = 883 Score = 40.5 bits (93), Expect = 0.39, Method: Composition-based stats. Identities = 29/139 (20%), Positives = 59/139 (42%), Gaps = 10/139 (7%) Query: 149 TNSRHIVMPITSSVKVNSQTDA-RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 N I + ++ V ++ + LD++I+LD S SM + + + + A ++ A+++ Sbjct: 194 DNQYGIELTVSGKTTVETKEASTPLDVVILLDNSNSMSNIRHNHAHRAEKAGEATRALVD 253 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIE--EFFLLEWGVSHLQRKI---KYLSKFGVSTNSTPG 262 ++ PD + LVT+ + I +E GV+ KI L F +T + Sbjct: 254 KITSNPDN----RVALVTYGSTIFDGSEATVEKGVADANGKILNDSALWTFDRTTFTAKT 309 Query: 263 LKYAYNQIFDMQGMRQHCN 281 Y++ + Q Sbjct: 310 YNYSFLNLTSDPTDIQTIK 328 >gi|149059384|gb|EDM10391.1| integrin alpha 1, isoform CRA_a [Rattus norvegicus] Length = 934 Score = 40.5 bits (93), Expect = 0.39, Method: Composition-based stats. Identities = 21/129 (16%), Positives = 42/129 (32%), Gaps = 21/129 (16%) Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 + G+ A + F KK++V +TDGE+ +Q + C Sbjct: 1 MTALGIDTARKEAFT------EARGARRGVKKVMVIVTDGESHDNYRLKQVIQDCE---D 51 Query: 318 RGAIVYAIGI-RVIRSHEF--------LRACAS---PNSFYLVENPHSMYDAFSHIGKDI 365 ++I I +++ AS F+ V + ++ +G+ I Sbjct: 52 ENIQRFSIAILGHYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSDELALVTIVKALGERI 111 Query: 366 VTKRIWYDK 374 D+ Sbjct: 112 FALEATADQ 120 >gi|325496693|gb|EGC94552.1| hypothetical protein ECD227_0790 [Escherichia fergusonii ECD227] Length = 496 Score = 40.5 bits (93), Expect = 0.40, Method: Composition-based stats. Identities = 34/237 (14%), Positives = 80/237 (33%), Gaps = 28/237 (11%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 T PW + + I ++ + +++ ++D S SM S ++ + S Sbjct: 106 LTPAPWNEKHTLLRLDI-AANDIARSKLPPANLVFLIDTSGSMNSD-----ERLPLIKSS 159 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNS 259 + ++ E++ + +VT++ G S + ++ + G TN Sbjct: 160 LKLLVNELRDQD------RISIVTYAGSARLLLSSTSG-SEKNTILNAIANLQAGGGTNG 212 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 G+ AY Q + N I+ TDG+ + + ++ G Sbjct: 213 GAGVAMAYEQ------AQAGYIKGGVNR---ILLATDGDFNIGDDPASVEDLVKKQRESG 263 Query: 320 AIVYAIGIR--VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + +G+ ++ + N Y S+ +A + ++ + K Sbjct: 264 ITLSTLGVGDNNYNEEMMVKIADTGNGNYSY--LDSLSEAQKVLSNEMNQTLVTVAK 318 >gi|303327225|ref|ZP_07357667.1| hemolysin-type calcium-binding region [Desulfovibrio sp. 3_1_syn3] gi|302863213|gb|EFL86145.1| hemolysin-type calcium-binding region [Desulfovibrio sp. 3_1_syn3] Length = 1149 Score = 40.5 bits (93), Expect = 0.40, Method: Composition-based stats. Identities = 30/185 (16%), Positives = 60/185 (32%), Gaps = 4/185 (2%) Query: 97 ELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVM 156 EL+ +D D S + + VV Q +IS + +V Sbjct: 402 ELKFETRDSDGDIASTSAKVPLEVVEQTTTEGGDSISNSD---DVINIAGGDGVAGTLVA 458 Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 T V Q + ++ VLD S SM+ T++ +A +SI ++ D Sbjct: 459 GDTGGVTEGQQVGSNYNVCFVLDTSGSMDGAVSGHETRLGVATQSIENFIKNSIHEGDFV 518 Query: 217 NVVQSGLVTFSNKIEEFFLLE-WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 V +V F+++ + + +R + + + + + + G Sbjct: 519 GTVNLAVVPFASEAGSVIKVSITKTAQGERYTFGEEVYDNYADFSKAFETSLGNLNANGG 578 Query: 276 MRQHC 280 Sbjct: 579 TNYEA 583 >gi|156349150|ref|XP_001621939.1| predicted protein [Nematostella vectensis] gi|156208297|gb|EDO29839.1| predicted protein [Nematostella vectensis] Length = 147 Score = 40.5 bits (93), Expect = 0.40, Method: Composition-based stats. Identities = 22/126 (17%), Positives = 44/126 (34%), Gaps = 16/126 (12%) Query: 219 VQSGLVTFSN--KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 + + TF N K+E F ++ + + + T LK +F Sbjct: 12 TRISVATFENFPKMEFSFQKYSDLNSAKFAVDAIQISNGGTRIGEALKLVKTDMF----- 66 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 NT +N ++++ MTDG + + + G + +G+ + L Sbjct: 67 ----NTARSNVPRMLLVMTDGRSSD-----DVVAPSRALRDIGVTILTLGLGSDYDLDQL 117 Query: 337 RACASP 342 + AS Sbjct: 118 KMIASG 123 >gi|255009406|ref|ZP_05281532.1| hypothetical protein Bfra3_09717 [Bacteroides fragilis 3_1_12] gi|313147165|ref|ZP_07809358.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] gi|313135932|gb|EFR53292.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] Length = 341 Score = 40.5 bits (93), Expect = 0.40, Method: Composition-based stats. Identities = 27/175 (15%), Positives = 64/175 (36%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +++MI LD+S SM + D ++++ A + I+ +++ ++ + Sbjct: 79 GSKLETVKRKGVEVMIALDISNSMLAQ-DVQPSRLEKAKRLISKLVDGMEND-------K 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 G++ F+ + + ++ ++ T + A Sbjct: 131 VGMIVFAGDAFTQLPITSDYISAKMFLESINPSLISKQGTAIGAAISLA----------- 179 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 T + IV +TDGEN + + K+G V +G+ + Sbjct: 180 ARSFTPQEGVGRAIVVITDGENHEGGAVEAAKEA----AKKGIQVNVLGVGLPDG 230 >gi|225685336|gb|EEH23620.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03] Length = 844 Score = 40.5 bits (93), Expect = 0.40, Method: Composition-based stats. Identities = 33/223 (14%), Positives = 71/223 (31%), Gaps = 32/223 (14%) Query: 136 KIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKI 195 IP F P N R ++ + + + +++ ++D S SM+ +I Sbjct: 261 DIPSGFLETHPSIPNQRALMATLVPKFNIPPISP---EIVFIIDRSGSMDG-------RI 310 Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGV 255 ++ L+ + + N +F K + + + + ++ Sbjct: 311 GTLQAALRVFLKSLPVGVTFNVCSFGSSYSFMWKKSRSYDDS-SLRAALKYVDSVAADYG 369 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL--YYCN 313 T +K F + I+ +TDG DQ++L Y Sbjct: 370 GTEMLKPVKATVENRF-------------KDLDLEILLLTDGAIW----DQETLFKYIHK 412 Query: 314 EAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY--LVENPHSM 354 A + + +GI SH + A + + V + + Sbjct: 413 AASDKPIRFFTLGIGDGASHSLIEGIAKAGNGFAQFVNDNEPL 455 >gi|254503616|ref|ZP_05115767.1| von Willebrand factor type A domain protein [Labrenzia alexandrii DFL-11] gi|222439687|gb|EEE46366.1| von Willebrand factor type A domain protein [Labrenzia alexandrii DFL-11] Length = 609 Score = 40.5 bits (93), Expect = 0.40, Method: Composition-based stats. Identities = 52/321 (16%), Positives = 101/321 (31%), Gaps = 46/321 (14%) Query: 48 TVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDI 107 SL+ A + G G+ + L+G I+ + + R + F Sbjct: 28 QQHQIDAVASLIFNAGSLRANGPGSWQALLEGNFDTFLIEASEIRNGRKDGELVPFSGLE 87 Query: 108 DDIVRSTSLDIVVVPQNEGYSISAISRYK-IPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 D + + + + K P ++ +N + I + +V+ Sbjct: 88 DRRADEAEMFVKADYSRTFDNSAGNPGSKGNPYAPGNYLGNKSNPQ----GIPADWQVSD 143 Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + +V+D+S SM + S + I +I P + + G+VTF Sbjct: 144 CQGQ--SVALVIDLSGSMGNDLASVKSSALELINAIFGT----DAAPVAS---RLGIVTF 194 Query: 227 SN----KIEEFFLLEWGVSHLQR----KIKYLSKFGVSTNS-TPGLKYAYNQIFDMQGMR 277 +N +I F + +S + I L+ G L A +R Sbjct: 195 NNTDSIQIVLPFTEQANISDRKSAAIDAINGLAILGGGAEPLNGALLTA---------LR 245 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF-- 335 + KIIVF TD + + N+ +G+++ + Sbjct: 246 GDIGSWRTGTNKIIVF-TDEPAADPELRDDVITLAND----------LGVQIPQQQALPI 294 Query: 336 -LRACASPNSFYLVENPHSMY 355 L A+ S V+ + Sbjct: 295 DLAPAANDGSVVSVQILPVLI 315 >gi|157818937|ref|NP_001101458.1| von Willebrand factor A domain-containing protein 5B1 [Rattus norvegicus] gi|149024385|gb|EDL80882.1| similar to hypothetical protein FLJ32784 (predicted) [Rattus norvegicus] Length = 1216 Score = 40.5 bits (93), Expect = 0.40, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 55/173 (31%), Gaps = 33/173 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + ++D S SM T I +++ L+ + N ++ F + + Sbjct: 354 FIFLIDRSSSMSK------TNIQCIKEAMLVALKSLMPACLFN------VIGFGSTFKTV 401 Query: 234 FLLE--WGVSHLQ---RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F + +L I+ + TN LK+ Q + Sbjct: 402 FASSQIYNEENLALACECIQRMRADMGGTNIFSPLKWVLRQ------------PLRQGHP 449 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 +++ +TDG +T + L Y+ GI + ++ AS Sbjct: 450 RLLFLITDGSVNNT---GKVLELVRN-HASSIRCYSFGIGPTVCYRLVKGLAS 498 >gi|299743781|ref|XP_001835978.2| elongation factor-2 kinase [Coprinopsis cinerea okayama7#130] gi|298405813|gb|EAU85754.2| elongation factor-2 kinase [Coprinopsis cinerea okayama7#130] Length = 454 Score = 40.5 bits (93), Expect = 0.41, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 58/169 (34%), Gaps = 16/169 (9%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF- 226 +LD++ + D + S S+ S+ I+ + E KL + ++ GLV F Sbjct: 29 NRKQLDLVFIQDCTGSQGSYISSATRNIEQI---CAHIFESGKLQSPED--LRVGLVAFR 83 Query: 227 ------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVS---TNSTPGLKYAYNQIFDMQGMR 277 + + F +S + R + L G T L A N + + Sbjct: 84 DHPPQDHTYVTKNFGFSSDISQVHRDLSSLYASGGGDGPEAVTAALAEALNMDWREYASK 143 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 DA I + DG + + + L + +G ++ + Sbjct: 144 MVVLIADAPPHGIGEY-GDGFDDGSPDGYDPLQLARQMASKGITLFFVA 191 >gi|261415601|ref|YP_003249284.1| protein of unknown function DUF58 [Fibrobacter succinogenes subsp. succinogenes S85] gi|261372057|gb|ACX74802.1| protein of unknown function DUF58 [Fibrobacter succinogenes subsp. succinogenes S85] gi|302325950|gb|ADL25151.1| conserved hypothetical protein [Fibrobacter succinogenes subsp. succinogenes S85] Length = 301 Score = 40.5 bits (93), Expect = 0.41, Method: Composition-based stats. Identities = 28/126 (22%), Positives = 50/126 (39%), Gaps = 24/126 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 M++++D S S S + A L + + N + GL+ +++++E F Sbjct: 84 MLLMVDASSS------SEFGSGKQMKGEVMATLTALLAFAAIKNNDKVGLLIYTDQVELF 137 Query: 234 FLLEWGVSHLQRKIK---YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 E G H+ R I+ Y TN+ L+YA + N K + Sbjct: 138 IPPEKGRKHVLRLIREILYFKPQHHGTNTQVALEYAGKIL---------------NRKAV 182 Query: 291 IVFMTD 296 +V M+D Sbjct: 183 VVVMSD 188 >gi|167399327|ref|ZP_02304851.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167051831|gb|EDR63239.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. UG05-0454] Length = 253 Score = 40.5 bits (93), Expect = 0.41, Method: Composition-based stats. Identities = 21/91 (23%), Positives = 40/91 (43%), Gaps = 18/91 (19%) Query: 282 TEDANYKKIIVFMTDGENL----STKEDQQ-----------SLYYCNEAKKRGAIVYAIG 326 TE N K+++ ++DG++ S+ DQ+ + C + K G + IG Sbjct: 142 TESQNNNKLMIILSDGDDNTQKMSSPHDQKAGIINITQKLITEGMCQKIKDNGIKMVFIG 201 Query: 327 IRVIRSHEFL---RACASPNSFYLVENPHSM 354 I + + + + C +FYL +N H + Sbjct: 202 IGYVPDNNIIDWEKDCVGTGNFYLAKNAHEL 232 >gi|167535479|ref|XP_001749413.1| hypothetical protein [Monosiga brevicollis MX1] gi|163772041|gb|EDQ85698.1| predicted protein [Monosiga brevicollis MX1] Length = 2014 Score = 40.5 bits (93), Expect = 0.41, Method: Composition-based stats. Identities = 14/73 (19%), Positives = 30/73 (41%), Gaps = 5/73 (6%) Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 + + ++++ +TDG + + Y K +G +YAIG+ S E + P Sbjct: 446 NNSIPRVLIVLTDGRSN---PGFEPDEYSTALKDKGIEIYAIGVGDYYSIEVQEMASEPK 502 Query: 344 --SFYLVENPHSM 354 + + N + Sbjct: 503 DRHAFELSNQDDL 515 >gi|315452880|ref|YP_004073150.1| von Willebrand factor type A domain-containing protein [Helicobacter felis ATCC 49179] gi|315131932|emb|CBY82560.1| von Willebrand factor type A domain protein [Helicobacter felis ATCC 49179] Length = 224 Score = 40.5 bits (93), Expect = 0.41, Method: Composition-based stats. Identities = 28/171 (16%), Positives = 59/171 (34%), Gaps = 16/171 (9%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 R+ + + LD S SM S I + ++ + V P +VTF Sbjct: 15 NPTKRVPVCLCLDTSSSM------SGAPIGEVNQGVDLFYQAVNDHPIAKQAADVCIVTF 68 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR--QHCNTED 284 + L++ S + + G + + A N+ D R ++ ++ Sbjct: 69 GDS--GVSLVQDFESIRDKSAPSFNAGGYT-----PMGAAVNEALDRLEERKKEYKDSGT 121 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 ++ +V +TDGE S + A + ++ I + ++ Sbjct: 122 EYFQPWLVIITDGEPTDDIS-SASRRSSDLANNKKLSIFPIIVEGGDANTL 171 >gi|257052791|ref|YP_003130624.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] gi|256691554|gb|ACV11891.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] Length = 211 Score = 40.5 bits (93), Expect = 0.41, Method: Composition-based stats. Identities = 36/155 (23%), Positives = 52/155 (33%), Gaps = 17/155 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK-LIPDVNNVVQSGLVTFSNKIEE 232 + VLD S SME D + + +K E + D N+ V+ LV I+E Sbjct: 5 ITFVLDSSGSMEKIADDTRGGFNRFLKDQRDEDGEATVTLYDFNDTVE--LVYHDRAIDE 62 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L + I G T + A ++ D N D I+V Sbjct: 63 ADKLT-----EENYIP-----GGRTALHDAITRAIDETADRIEAMAPANQPDNV---IVV 109 Query: 293 FMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 +TDG EN S ++ G IG Sbjct: 110 TLTDGKENASETPQDAVRDRVETRQEEGWEFLFIG 144 >gi|196230654|ref|ZP_03129516.1| conserved hypothetical membrane protein [Chthoniobacter flavus Ellin428] gi|196225584|gb|EDY20092.1| conserved hypothetical membrane protein [Chthoniobacter flavus Ellin428] Length = 332 Score = 40.5 bits (93), Expect = 0.41, Method: Composition-based stats. Identities = 25/143 (17%), Positives = 52/143 (36%), Gaps = 17/143 (11%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 +I +D S SM D + +K + A K+ +L+ + V + S +V Sbjct: 94 IIAIDNSASMSQN-DGATSKFEQAQKAAEDVLDSLPSGASVAVWLVSDVV-----KNVIP 147 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 ++ ++ I+ + T L+ + RQ + K + + Sbjct: 148 EPTHDLALARKSIREAQRSDRGTEMPSALRRMVEVL-----QRQPVSA------KQLFLI 196 Query: 295 TDGENLSTKEDQQSLYYCNEAKK 317 TDG+ K+ ++ + AK Sbjct: 197 TDGQANGWKQLAETRALLDAAKD 219 >gi|320533437|ref|ZP_08034120.1| von Willebrand factor type A domain protein [Actinomyces sp. oral taxon 171 str. F0337] gi|320134345|gb|EFW26610.1| von Willebrand factor type A domain protein [Actinomyces sp. oral taxon 171 str. F0337] Length = 367 Score = 40.5 bits (93), Expect = 0.41, Method: Composition-based stats. Identities = 21/192 (10%), Positives = 61/192 (31%), Gaps = 10/192 (5%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSM--ESFFDSSITKID-MAIKSINAMLEEVKLIPD 214 + + + + +++ +V+D + SM E + +D A ++ + +++ I D Sbjct: 68 GPAIRGSEAISVSNVEIYMVVDRTGSMAAEDYQGKGPEGVDQSASTRLDGVRSDMRAIRD 127 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + ++ N L + + I + + L+ A + Sbjct: 128 AFPDSRFSIIALDNTAARELPLTHDTNAVDAWIGSFKQEVTGHATGSSLEVALPLLGQSL 187 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY----AIGIRVI 330 + + + +++ +DGE + + + A + +G Sbjct: 188 AQSRQSDPNNI---RLVYIFSDGEATDEGRGALAAESAGVSWQSLADIVDGGAVLGYGST 244 Query: 331 RSHEFLRACASP 342 + SP Sbjct: 245 EGGKMRSYDGSP 256 >gi|91773456|ref|YP_566148.1| hypothetical protein Mbur_1491 [Methanococcoides burtonii DSM 6242] gi|91712471|gb|ABE52398.1| hypothetical protein with von Willebrand factor type A domain [Methanococcoides burtonii DSM 6242] Length = 1258 Score = 40.5 bits (93), Expect = 0.41, Method: Composition-based stats. Identities = 23/156 (14%), Positives = 44/156 (28%), Gaps = 19/156 (12%) Query: 223 LVTFSNKIEEFFLLEWG------VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 + + + L W S L + T + GL +++ Sbjct: 873 VYPWKDTTYSSTELNWKEWRATVTSSLSNNSLVHLSNSIDTITADGLTAIDEGLYEANNE 932 Query: 277 RQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 +V M+DG +N + AK+ ++Y +G+ Sbjct: 933 LSAITGNST-----VVLMSDGLDNAGHHS---LIEEALRAKEHNTVIYTVGLGNNEDEVD 984 Query: 336 LRACA----SPNSFYLVENPHSMYDAFSHIGKDIVT 367 C + +Y N + D F I +I Sbjct: 985 PILCEIANITGGKYYFAPNSTVLEDIFIGIASEITN 1020 >gi|313835899|gb|EFS73613.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA2] gi|314927103|gb|EFS90934.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL044PA1] gi|314970746|gb|EFT14844.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA3] gi|328906009|gb|EGG25784.1| von Willebrand factor type A domain-containing protein [Propionibacterium sp. P08] Length = 323 Score = 40.5 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 33/156 (21%), Positives = 53/156 (33%), Gaps = 18/156 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+M+ LDVS SME + + IN L+ I D + G V F + Sbjct: 95 DVMLCLDVSGSME----------GVDRQVINTYLQLADHISDD----RIGFVMFDSSAVT 140 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM---RQHCNTEDANYKK 289 F L ++ +K + PG++Y G+ + D + Sbjct: 141 VFPLTHDRDSVKAGLKQAGERLGRAGLDPGVRYGPGGSLVGDGLASCLSRFDQLDQPRSR 200 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 +V TD + A KR +V+ I Sbjct: 201 SVVLATDNMVAGPSVYTVPQAV-DLAVKRHIMVFGI 235 >gi|295662952|ref|XP_002792029.1| von Willebrand domain-containing protein [Paracoccidioides brasiliensis Pb01] gi|226279204|gb|EEH34770.1| von Willebrand domain-containing protein [Paracoccidioides brasiliensis Pb01] Length = 694 Score = 40.5 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 33/223 (14%), Positives = 71/223 (31%), Gaps = 32/223 (14%) Query: 136 KIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKI 195 IP F P N R ++ + + + +++ ++D S SM+ KI Sbjct: 206 DIPSGFLETHPSIPNQRALMATLVPKFNIPPISP---EIVFIIDRSGSMDG-------KI 255 Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGV 255 ++ L+ + + N +F K + + + + ++ Sbjct: 256 GTLQAALRVFLKSLPVGVTFNICSFGSSYSFMWKKSRSYDDS-SLKAALKYVDSVAADFG 314 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL--YYCN 313 T +K F + ++ +TDG DQ++L Y Sbjct: 315 GTEMLKPVKATVENRF-------------KDLDLEVLLLTDGAIW----DQETLFKYIHK 357 Query: 314 EAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY--LVENPHSM 354 A + + +GI SH + A + + V + + Sbjct: 358 AASDKPIRFFTLGIGDGASHSLIEGIAKAGNGFAQFVNDNEPL 400 >gi|89255638|ref|YP_512999.1| TPR repeat-containing protein [Francisella tularensis subsp. holarctica LVS] gi|115314142|ref|YP_762865.1| hypothetical protein FTH_0199 [Francisella tularensis subsp. holarctica OSU18] gi|167009920|ref|ZP_02274851.1| hypothetical protein Ftulh_04147 [Francisella tularensis subsp. holarctica FSC200] gi|169656499|ref|YP_001427653.2| hypothetical protein FTA_0220 [Francisella tularensis subsp. holarctica FTNF002-00] gi|254367032|ref|ZP_04983068.1| TPR (tetratricopeptide repeat) domain protein [Francisella tularensis subsp. holarctica 257] gi|254368551|ref|ZP_04984567.1| hypothetical protein FTAG_01535 [Francisella tularensis subsp. holarctica FSC022] gi|290953464|ref|ZP_06558085.1| hypothetical protein FtulhU_03740 [Francisella tularensis subsp. holarctica URFT1] gi|295313262|ref|ZP_06803899.1| hypothetical protein FtulhU_03725 [Francisella tularensis subsp. holarctica URFT1] gi|89143469|emb|CAJ78645.1| TPR (tetratricopeptide repeat) domain protein [Francisella tularensis subsp. holarctica LVS] gi|115129041|gb|ABI82228.1| hypothetical protein FTH_0199 [Francisella tularensis subsp. holarctica OSU18] gi|134252858|gb|EBA51952.1| TPR (tetratricopeptide repeat) domain protein [Francisella tularensis subsp. holarctica 257] gi|157121454|gb|EDO65645.1| hypothetical protein FTAG_01535 [Francisella tularensis subsp. holarctica FSC022] gi|164551567|gb|ABU60697.2| hypothetical protein with von Willebrand factor type A domain [Francisella tularensis subsp. holarctica FTNF002-00] Length = 332 Score = 40.5 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 34/163 (20%), Positives = 68/163 (41%), Gaps = 29/163 (17%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + +I LDVS+SM++ D S ++++ A I +L +K Q G++ FS+ Sbjct: 88 QKNISRVIALDVSQSMDT-TDVSPSRLERAKYKIFDILRRIKEG-------QVGMIVFSS 139 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L + ++ + ++ N LK + Q+ + G++Q Sbjct: 140 EPFVVSPLTSDANTIENLVTVINSDIVPVQGHNIYKALKKS-AQLIEQAGVQQGQ----- 193 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI--VYAIG 326 I+ +TD + Q++ + ++G VYAIG Sbjct: 194 -----IILITD-----SSPLPQAISQAKQLAQQGIKTDVYAIG 226 >gi|323490190|ref|ZP_08095408.1| von Willebrand factor type A [Planococcus donghaensis MPA1U2] gi|323396119|gb|EGA88947.1| von Willebrand factor type A [Planococcus donghaensis MPA1U2] Length = 617 Score = 40.5 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 54/160 (33%), Gaps = 30/160 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEE 232 +++D S SM + K++ +++ + ++ + + +V + ++ E+ Sbjct: 423 FCLLIDGSASM-------LDKLEETKQAVLLFHDVLRGLNVPHEIVLFYEDAYEASDAEQ 475 Query: 233 FFLLEW------GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 EW I L + +++ N++ Sbjct: 476 PNYFEWMHKFEDRNKDHAETIASLDAHEDN-RDGFAIRWMNNRL-----------KRRPE 523 Query: 287 YKKIIVFMTDGE----NLSTKEDQQSLYYCNEAKKRGAIV 322 + ++ +DGE N + + +EAKK G V Sbjct: 524 KHRFLLVFSDGEPSAYNYAENGVVDTANAVSEAKKMGIEV 563 >gi|269795767|ref|YP_003315222.1| hypothetical protein Sked_24760 [Sanguibacter keddieii DSM 10542] gi|269097952|gb|ACZ22388.1| conserved repeat protein [Sanguibacter keddieii DSM 10542] Length = 1516 Score = 40.5 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 67/222 (30%), Gaps = 38/222 (17%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 +S L + +V+D+S S+ + I + + ++ + P Sbjct: 323 SSLANPGLPPQCGLRVALVVDLSGSVGRY-------IAAMRSAASGFVDSLTGTPSS--- 372 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 L TF++ G++ + ++ Y FD R Sbjct: 373 --VALFTFADNAPAATGANLGLTPVSTT--------AGADTVKNRIATYTAGFDTNWDRG 422 Query: 279 -HCNTEDANYKKIIVFMTDGEN------------LSTKEDQQSLYYCNEAKKRGAIVYAI 325 + A + V +TDG +E + ++ N K G V A+ Sbjct: 423 LYQVAASATPFDVAVVLTDGNPTVYAAHEGPGDLTRFREVENGIFSANAVKAEGTRVIAV 482 Query: 326 GI-----RVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIG 362 G+ + + + + +Y ++ +A + Sbjct: 483 GVGDGIGGAPDNLRAISGPSGGSDYYQTDDYAEAGEALRALA 524 >gi|254560515|ref|YP_003067610.1| hypothetical protein METDI2058 [Methylobacterium extorquens DM4] gi|254267793|emb|CAX23640.1| conserved hypothetical protein [Methylobacterium extorquens DM4] Length = 733 Score = 40.5 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 30/202 (14%), Positives = 64/202 (31%), Gaps = 32/202 (15%) Query: 144 FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSIN 203 F + ++ +T + D++ V+D S SM + A S+ Sbjct: 316 FRERVAGAEAVLAVVTPPETASPAASVPRDVVFVIDNSGSMGG------ASMRQAKASLL 369 Query: 204 AMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL-----LEWGVSHLQRKIKYLSKFGVSTN 258 L+ + + ++ F + + F + + + L G + Sbjct: 370 IGLDRLGAHD------RFNVIRFDHSFDTLFPDLVPADAHHLMRAKSFVAGLQASGGTEM 423 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 P L+ A E+ + +VF+TDG + + S + R Sbjct: 424 LAP-LQAALRDAT----------PEETGRLRQVVFLTDGA-IGNEAQIFSAIATERGRSR 471 Query: 319 GAIVYAIGIRVIRSHEFLRACA 340 ++ +GI + +R A Sbjct: 472 ---LFMVGIGSAPNGYLMRHTA 490 >gi|254414405|ref|ZP_05028171.1| von Willebrand factor type A domain, putative [Microcoleus chthonoplastes PCC 7420] gi|196178635|gb|EDX73633.1| von Willebrand factor type A domain, putative [Microcoleus chthonoplastes PCC 7420] Length = 200 Score = 40.5 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 26/138 (18%), Positives = 49/138 (35%), Gaps = 13/138 (9%) Query: 173 DMMIVLDVSRSME-SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D +++D S SM T+ A +S A+ + + V FS++ + Sbjct: 5 DYTLIIDKSGSMSTPDQPGGQTRWYTAQESTLALARKCEQFDPDGITVYL----FSSRFK 60 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + + S +I + T+ L+ A N F + Q + I Sbjct: 61 RY---DNVTSDKVGQIFQENDPMGRTDLASVLQDALNSYFQRKSAGQTKPEGET-----I 112 Query: 292 VFMTDGENLSTKEDQQSL 309 + +TDGE K + + Sbjct: 113 LVITDGEPDDRKAVMRVI 130 >gi|94498563|ref|ZP_01305118.1| TadE-like protein [Sphingomonas sp. SKA58] gi|94422006|gb|EAT07052.1| TadE-like protein [Sphingomonas sp. SKA58] Length = 193 Score = 40.5 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 13/58 (22%), Positives = 25/58 (43%), Gaps = 1/58 (1%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSL-VHAATQIMNEG 69 + G T+ AI LP + ++ IE+ H+ + VL + + + A+ E Sbjct: 9 KDRTGVSTVEFAIILPALLTLMCGAIELGHMLLARVVLEGAMTEAARISTASLETAEA 66 >gi|91215374|ref|ZP_01252345.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated [Psychroflexus torquis ATCC 700755] gi|91186326|gb|EAS72698.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated [Psychroflexus torquis ATCC 700755] Length = 689 Score = 40.5 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 49/296 (16%), Positives = 105/296 (35%), Gaps = 43/296 (14%) Query: 72 NNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI-S 130 ++ ++ I+ +N+ + D N++ + ++ V++ +NE SI Sbjct: 164 SDYSAIQSDVIVFAQNLNFNLFSERTIDDVELFNNVGTMTNDGNVATVLISENEAISIND 223 Query: 131 AISRYKIPLKFCTFIPWYTNSRHI--------VMPITSSVKVNSQTDARL---DMMIVLD 179 + +Y++ IP+ T V+ S + + + ++++D Sbjct: 224 ILIKYQLASDELGVIPFSTLLEEGVNECDDFGNGFFGLVVEPESNANTEVIEKNFVLIID 283 Query: 180 VSRSME-SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV--QSGLVTFSNKIEEFFLL 236 S SM + + I + + + +I NN+V Q LV ++ + Sbjct: 284 SSGSMRGGNKMAQAKEASEFIVNNLNIGDNFNVIDFDNNIVLFQPELVEYNIQ------- 336 Query: 237 EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 + I+ + G TN + L A NQ + + IIVF TD Sbjct: 337 --NSNAALDFIENIVALGA-TNISESLVTAINQFEAGAEDKAN----------IIVFFTD 383 Query: 297 GENLSTKEDQQSLYYCNEAKK------RGAIVYAIGIRVIRSHEFLRACASPNSFY 346 G +T+ + + A+ ++ GI + + L A N+ + Sbjct: 384 G--GATEGETNTQNILQLAEDTVNQIETEIFLFTFGIGEDVTTDLLTLLAVQNNGF 437 >gi|282879638|ref|ZP_06288369.1| von Willebrand factor type A domain protein [Prevotella timonensis CRIS 5C-B1] gi|281306586|gb|EFA98615.1| von Willebrand factor type A domain protein [Prevotella timonensis CRIS 5C-B1] Length = 346 Score = 40.5 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 66/177 (37%), Gaps = 30/177 (16%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSM--ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 K++++ ++ +I LD+S SM E S + K + ++ L+ + N Sbjct: 84 GSKISNEKRNGIETIIALDISNSMLAEDVVPSRLAKSKLLVE---------NLVDNFTND 134 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQG 275 + GLV F+ + + ++ T+ ++ + + Sbjct: 135 -KIGLVIFAGDAFIQLPITSDYVSAKMFLQNTDPSLITTQGTDIARAIRLSMSSFTQQDK 193 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + + I+ +TDGE+ + +L +A K+G V+ +G+ + Sbjct: 194 VG-----------RAIILITDGEDH----EGGALEAAADANKKGINVFILGVGNTQG 235 >gi|170089927|ref|XP_001876186.1| predicted protein [Laccaria bicolor S238N-H82] gi|164649446|gb|EDR13688.1| predicted protein [Laccaria bicolor S238N-H82] Length = 465 Score = 40.5 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 32/168 (19%), Positives = 58/168 (34%), Gaps = 16/168 (9%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-- 226 +LD++ V D + S S+ S+ I+ + E KL + ++ GLV F Sbjct: 39 RKQLDLVFVQDCTGSQGSYISSATKNIEQI---CAHIFESGKLQSQED--LRVGLVAFRD 93 Query: 227 -----SNKIEEFFLLEWGVSHLQRKIKYLSKFGVS---TNSTPGLKYAYNQIFDMQGMRQ 278 I + F +S + + + L G T L A N + + Sbjct: 94 HPPQDHTYITKNFGFSSDISKVHKDLSGLYASGGGDGPEAVTAALAEALNMDWREYASKM 153 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 DA I + DG + + + L + RG ++ + Sbjct: 154 VVLIADAPPHGIGEY-GDGFDEGSPDGFDPLQLARQMAGRGITLFFVA 200 >gi|167519555|ref|XP_001744117.1| hypothetical protein [Monosiga brevicollis MX1] gi|163777203|gb|EDQ90820.1| predicted protein [Monosiga brevicollis MX1] Length = 3700 Score = 40.5 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 22/166 (13%), Positives = 60/166 (36%), Gaps = 26/166 (15%) Query: 176 IVLDVSRSME-SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 LD S SM +D+ + ++S + +E+ L + +V +S++ Sbjct: 3396 FGLDESGSMGGDNWDALLNSYSSFMQS--RVHDELNLAD------RVTVVQYSDRARTTL 3447 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + + ++ G T+ ++ +R T ++ +++FM Sbjct: 3448 AKA-SMREAAAFVPQMN--GGGTDFNVAIQ----------ELRGQGKTMSNIFRPVLIFM 3494 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +DG+ + + + + +A+ + + + L+ A Sbjct: 3495 SDGQAYDPRTELERM----KAELPHMSSFMVALGPNAQVAVLQGMA 3536 >gi|146306837|ref|YP_001187302.1| hemolysin-type calcium-binding region [Pseudomonas mendocina ymp] gi|145575038|gb|ABP84570.1| Hemolysin-type calcium-binding region [Pseudomonas mendocina ymp] Length = 3184 Score = 40.5 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 41/273 (15%), Positives = 92/273 (33%), Gaps = 32/273 (11%) Query: 85 RIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKF--- 141 ++ ++ + D N I S+ +V N + + +P +F Sbjct: 2654 QLTQSFTYVVSDGDGDRASANLDICIRGDRSVLVV--GSNADDEQGSNVLHHVPSQFDDG 2711 Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSS---ITKIDMA 198 I + +V + + + ++ ++LD S SM D S +++ + Sbjct: 2712 KGAIEGTFGNDVLVGDLGGAADPVVKPAENYNIALILDRSGSMADDPDGSGGYGSRLALL 2771 Query: 199 IKSINAMLEEVKLIPDVNNVV--QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS 256 ++NA + ++ N+ ++ ++ + L+ G Sbjct: 2772 KDAVNAFIGKLGTHTGQINIALISFSSSASLLLSGTLAQIQTALAAPNNVLMALTASGA- 2830 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYC 312 TN ++ A N F + + N + F+TDG+ + D ++ + Sbjct: 2831 TNYEAAMQQA-NAWFGGVEVNGYNN--------LAYFLTDGDPTTYNGDNSNSGSTVNFN 2881 Query: 313 NEAKK--------RGAIVYAIGIRVIRSHEFLR 337 + + A V+AIGI + LR Sbjct: 2882 DVNRALDDATTLMARAEVHAIGIGTGVNSNVLR 2914 Score = 37.1 bits (84), Expect = 4.2, Method: Composition-based stats. Identities = 33/170 (19%), Positives = 66/170 (38%), Gaps = 24/170 (14%) Query: 155 VMPITSSVKVN--SQTDARL-DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 +PIT++ K + Q+ L ++ +VLD+S SM K+ +++ ++ + Sbjct: 2343 DVPITANDKPDCIVQSAPPLVNLTLVLDISLSMAG------DKLTALKQAVISL---AQG 2393 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEW------GVSHLQRKIKYLSKFGVSTNSTPGLKY 265 ++ V L+TF++ E + G + L + L+ G TN L Sbjct: 2394 YAGLSAPVHVNLITFNSGAAEIGDFTFSSVGDAGYTALLTAVNGLTASGF-TNYEQALSV 2452 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL-STKEDQQSLYYCNE 314 A Q+ + +K F++DGE + + + N Sbjct: 2453 AKAQVLSD--ISAPGADPAQQHKLY--FISDGEPTVGAQGATLTTWIANN 2498 >gi|40062742|gb|AAR37636.1| CbbO protein/von Willebrand factor type A domain protein [uncultured marine bacterium 439] Length = 787 Score = 40.5 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 64/183 (34%), Gaps = 23/183 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE-EVKLIPDVNNVVQSGL 223 Q + + +++D+S S S I ++ ++L + I +N+ SG Sbjct: 589 KIQHTRDVAVNLLMDLSESTNDMVVGSEHSILELMQEATSLLSWAINKI--GDNLTISGF 646 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + S +++ + ++K K G+ST +++A + Sbjct: 647 ASDSRHDVQYYRFKPFHHSFNDEVKARLAGIKGGLSTRMGTAIRHAGVDLLTQSSA---- 702 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQ------QSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 KKI++ +TDGE D + E + RG + + I + Sbjct: 703 -------KKILLVLTDGEPADIDVDDPQHLRMDAKKAVEELRSRGIVTFCISLDPYADEY 755 Query: 335 FLR 337 R Sbjct: 756 VAR 758 >gi|282879635|ref|ZP_06288366.1| conserved hypothetical protein [Prevotella timonensis CRIS 5C-B1] gi|281306583|gb|EFA98612.1| conserved hypothetical protein [Prevotella timonensis CRIS 5C-B1] Length = 298 Score = 40.5 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 22/102 (21%), Positives = 48/102 (47%), Gaps = 9/102 (8%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S++ F TK +M + + I + + + G+V F Sbjct: 74 EEERELTVMLLIDVSGSLD-FGTKKQTKREMVTEIAATL--AFSAIQNND---KIGVVFF 127 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKY 265 S++IE++ + G H+ I+ + T+ ++Y Sbjct: 128 SDRIEKYIPPKKGRKHILYIIREMLDFEPQSRKTDVGMAIEY 169 >gi|229489135|ref|ZP_04383001.1| von Willebrand factor, type A [Rhodococcus erythropolis SK121] gi|229324639|gb|EEN90394.1| von Willebrand factor, type A [Rhodococcus erythropolis SK121] Length = 551 Score = 40.5 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 63/204 (30%), Gaps = 38/204 (18%) Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-------- 228 V+DVS SM+ + +T++ ++ + + N Q GL FS Sbjct: 355 VVDVSGSMD-YMQDGVTRMAATAQAGDIAIRMF----PAN--AQLGLWAFSIDLGEGTDY 407 Query: 229 -------KIEEFFLLEWGVSHLQRKIKYLSKF-GVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +++ S L +I LS G T + AY + Q Sbjct: 408 RELEPVARMDATEGDTDHRSKLLSRIDSLSSIVGGGTGLYDSVLAAYRSM---QQTYDPA 464 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG----AIVYAIGIRVIRSHEFL 336 + ++ +TDG N +++ + IG+ + L Sbjct: 465 SINS------VILLTDGANDDPSSISLQELLDTLTREQDPTRPVPIITIGVTDDADTDVL 518 Query: 337 R--ACASPNSFYLVENPHSMYDAF 358 + + + + P + F Sbjct: 519 EQISALTGGNSHFAPTPADIPKVF 542 >gi|327265809|ref|XP_003217700.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like [Anolis carolinensis] Length = 885 Score = 40.5 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 68/169 (40%), Gaps = 15/169 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV-TFSNKIEE 232 + V+DVS SM +KI A +++ ++E++K N ++ V + + I + Sbjct: 284 VAFVIDVSGSMWG------SKIRQAKEAMIKIVEDLKEDDHFNIILFESEVRKWKDGIIK 337 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V + I +++ G+ TN GL + + ++ + + + Sbjct: 338 ATPE--NVQEAKYFIGNITESGL-TNFNGGLMAGIEMLNNAHKLKIVPERSAS----LTI 390 Query: 293 FMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 ++DGE N+ + + A + +Y++G + FL + Sbjct: 391 MLSDGEANVGETDQFRIQENAKNASQGKYPLYSLGFGYNLDYGFLERLS 439 >gi|329928399|ref|ZP_08282269.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] gi|328937835|gb|EGG34241.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] Length = 562 Score = 40.5 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 54/165 (32%), Gaps = 31/165 (18%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 + + V DVS SM+ + ++ ++ + L I G V++S + Sbjct: 387 PVAAVFVADVSGSMD---GEPLNRLKESLLTGQKYLGRDNSI---------GFVSYSTDV 434 Query: 231 EEFFLLEWGVSHLQRK------IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + G L ++ I L G +T + G+ A + D + Sbjct: 435 TINLPI--GKYDLNQQSMFVGAINSLEASG-NTATFDGIVVAMKMLQDE-------MAAN 484 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 + K +I ++DGE K VY IG Sbjct: 485 PDVKPLIFVLSDGETNVGHSLDDIRGLIQAFK---IPVYTIGYNA 526 >gi|297620569|ref|YP_003708706.1| hypothetical protein wcw_0326 [Waddlia chondrophila WSU 86-1044] gi|297375870|gb|ADI37700.1| putative membrane protein [Waddlia chondrophila WSU 86-1044] Length = 375 Score = 40.5 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 31/233 (13%), Positives = 75/233 (32%), Gaps = 47/233 (20%) Query: 165 NSQTDARLDMMIVLDVSRSME-SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + D++ ++D S SM + +++++ A ++++ ++ ++ L Sbjct: 104 ATAQRKAHDIIFLVDASASMGVRDTRTGVSRLEYAKETVDEIVSRLRGES-------VAL 156 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 F++ ++ ++ + T N + + MR +E Sbjct: 157 YAFTSDTTRLSPPTMDYLFVRLVLRDMEINEGDLAGT-------NLVEALSDMRDAYFSE 209 Query: 284 DANYKKIIVFMTDG------ENLSTKEDQQS---LYYCNEAKKRGAIVYAIGIRVIRS-- 332 + K +V +TDG E + Q L +A+K ++ IG+ + Sbjct: 210 QSPKMKTLVLITDGGDTELEEMKGESRNSQIEMLLSLIPDAEKYQLRIFTIGMGTKKGEK 269 Query: 333 ----------------HEFLRACASPNS--FYLVEN---PHSMYDAFSHIGKD 364 E L+ + +Y D IG++ Sbjct: 270 IPGVEDNGKPVVSSLDEELLKKLSDKGRGAYYFANQWTVMDLAGDLMKKIGEE 322 >gi|88603963|ref|YP_504141.1| von Willebrand factor, type A [Methanospirillum hungatei JF-1] gi|88189425|gb|ABD42422.1| von Willebrand factor, type A [Methanospirillum hungatei JF-1] Length = 233 Score = 40.5 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 21/133 (15%), Positives = 44/133 (33%), Gaps = 12/133 (9%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 ++VLD S SM KI + + + +E+K + ++TF +E Sbjct: 23 VLVLDTSASMSGN------KIAELNEGLRILTDELKEDDLAVKRIDLAVITFGKGVELVR 76 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + G T + A + + ++ Y+ I + Sbjct: 77 PFT----GISAFDPPELSAGGYTPMGQAILEAVRLV--EERKAEYRTIGTDYYRPWIFLI 130 Query: 295 TDGENLSTKEDQQ 307 TDG+ ++ + Sbjct: 131 TDGQPTDMRKGDE 143 >gi|320170832|gb|EFW47731.1| predicted protein [Capsaspora owczarzaki ATCC 30864] Length = 1062 Score = 40.5 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 42/243 (17%), Positives = 75/243 (30%), Gaps = 57/243 (23%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 PL+ CT + + + + + + ++ +IV+D S SM Sbjct: 75 PLERCTLTRFGPSKALLSIQFPVNTIGAPKVQSKC--IIVIDKSGSMS----------GS 122 Query: 198 AIKSINAMLEEVKL--IPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGV 255 IK++ ++L V ++TF + E S L + L G Sbjct: 123 PIKAVASVLAYVAKQGADPT-------VITFDTRAE--------TSTLSNVLPNLQA-GG 166 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY----Y 311 T+ L N +G +VFMTDG++ + Sbjct: 167 GTSFCAALNAISNACMPHKGTIS------------VVFMTDGQDTDMSSLPAAKANFQRM 214 Query: 312 CNEAKKRGAIVYAIGIRVIRSHEFLRACA----SPNSFYLVENP-------HSMYDAFSH 360 + R +V +G + FLR + +P E + D F Sbjct: 215 VKSSPDRSVVVSTVGFGGEANLAFLRELSEIGTAPGYARFAETGDNSAALYEKLVDLFDV 274 Query: 361 IGK 363 +G Sbjct: 275 MGN 277 >gi|256823199|ref|YP_003147162.1| TPR repeat-containing protein [Kangiella koreensis DSM 16069] gi|256796738|gb|ACV27394.1| TPR repeat-containing protein [Kangiella koreensis DSM 16069] Length = 578 Score = 40.5 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 26/140 (18%), Positives = 55/140 (39%), Gaps = 22/140 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 ++VLD+S SM+S D +++ A + +L+ +K Q+G V ++ Sbjct: 100 SRVLVLDLSASMDS-PDVKPSRLARAKFKLTDILKAIKDG-------QTGFVVYAGDGFV 151 Query: 233 FFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + + L+ V + + G++ A + D + Sbjct: 152 LSPLTTDTDTIDNMVGVLATNLMPLVGSQPSKGIEKAIE-LLDNGAAGEGD--------- 201 Query: 290 IIVFMTDGENLSTKEDQQSL 309 I+++TDG + + +SL Sbjct: 202 -IIWITDGASDRELSNIESL 220 >gi|198421549|ref|XP_002127942.1| PREDICTED: similar to calcium activated chloride channel 4 [Ciona intestinalis] Length = 1075 Score = 40.5 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 21/193 (10%), Positives = 55/193 (28%), Gaps = 23/193 (11%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 ++VLD S SM S + + +++ + +V+ + Sbjct: 325 VLVLDTSGSM------SGSNYEYMMQAATDFIMTYIPKGAEAGIVEFSYTATTLSQLVSI 378 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + +L ++ ST G+ + + ++ + Sbjct: 379 ENKADREYLASRLPGQP--DGSTCIGCGILNGIEVLSNQGRDPAGGQ---------LIVL 427 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN---SFYLVENP 351 TDGE + + A + +V +I + + + V + Sbjct: 428 TDGEENYSPYVNDVR---DNAIEAHVVVDSIFFGASGNGALQQLTEDTKGTMYYNDVTDI 484 Query: 352 HSMYDAFSHIGKD 364 + + F + + Sbjct: 485 TGLKETFKQLAES 497 >gi|53713712|ref|YP_099704.1| hypothetical protein BF2421 [Bacteroides fragilis YCH46] gi|60681983|ref|YP_212127.1| hypothetical protein BF2503 [Bacteroides fragilis NCTC 9343] gi|253565660|ref|ZP_04843115.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] gi|52216577|dbj|BAD49170.1| conserved hypothetical protein [Bacteroides fragilis YCH46] gi|60493417|emb|CAH08203.1| conserved hypothetical protein [Bacteroides fragilis NCTC 9343] gi|251945939|gb|EES86346.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] Length = 289 Score = 40.5 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 23/111 (20%), Positives = 49/111 (44%), Gaps = 7/111 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S+E + K + + + N + G++ F Sbjct: 72 EEERELTVMLMVDVSGSLE------FGTVKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 S++IE+F + G H+ I+ L F + T ++ A + ++ R Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDFKPDSRRTN-IRLALEYLTNVMKRR 175 >gi|226941651|ref|YP_002796725.1| outer membrane adhesin like proteiin [Laribacter hongkongensis HLHK9] gi|226716578|gb|ACO75716.1| putative outer membrane adhesin like proteiin [Laribacter hongkongensis HLHK9] Length = 2392 Score = 40.5 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 37/226 (16%), Positives = 76/226 (33%), Gaps = 26/226 (11%) Query: 105 NDIDDIVRSTSLDIVV---VPQNEGYSISAISRYKIP-LKFCTFIPWYTNSRHIVMPITS 160 N D V+ LD V + +++ + + ++ I ++ Sbjct: 1605 NGTVDYVKDLQLDYTVKDADGATDQATVTVSPVDTVTGTSHGDTLLGDSSGEIITGDVSG 1664 Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFD----SSITKIDMAIKSINAMLEEVKLIPDVN 216 S +++ ++ I+LD S SMES+ SS +ID+A +I +L+ + Sbjct: 1665 SGGIST-PGMNYNLCIILDSSGSMESYMSTGSYSSSARIDVAKNAIQTLLDNMADFD--- 1720 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF-GVSTNSTPGLKYAYNQIFDMQG 275 + +V F + + GV+ ++ + T++ A I G Sbjct: 1721 GTINLSIVDFDSGSKMALQ---GVTLRDLCVQDAQGNWHLDTSAGSAFMQAMGNITATGG 1777 Query: 276 MRQHCN----------TEDANYKKIIVFMTDGENLSTKEDQQSLYY 311 Y F+TDG+ + E+ + Sbjct: 1778 TNYESAFMQAEQWFSQQPTDGYTNHTYFVTDGKPTARYENSFADKI 1823 >gi|47218486|emb|CAF97220.1| unnamed protein product [Tetraodon nigroviridis] Length = 900 Score = 40.5 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 61/169 (36%), Gaps = 27/169 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT-F 226 + D++IV+DVS SM+ ++ +A +I +L+ + N+ V + + Sbjct: 261 ATSPKDLVIVVDVSGSMKGL------RLTIAKHTIKTILDTL----GENDFVNIIAYSDY 310 Query: 227 SNKIEEFFLLE------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +E F H + I+ L G +K ++ + + + Q Sbjct: 311 VRYVEPCFKGTLVQADLDNREHFKLLIEELHVKGEG-KVKKAMKESFKILNEAAALGQGS 369 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 A I+ +TDG ++ + + +R V+ I Sbjct: 370 LCNQA-----IMLITDGAMEDFQDVFEEFNW----PERRVRVFTYLIGR 409 >gi|294788348|ref|ZP_06753591.1| tellurium resistance protein [Simonsiella muelleri ATCC 29453] gi|294483779|gb|EFG31463.1| tellurium resistance protein [Simonsiella muelleri ATCC 29453] Length = 212 Score = 40.5 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 28/157 (17%), Positives = 57/157 (36%), Gaps = 25/157 (15%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +++D S SM I+ + ++ ++ P ++TF ++ Sbjct: 4 RLPVYLLVDTSGSMHG------EAIEAVRNGLQVLVSALRQDPYALETAYLSVITFDSQA 57 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK-- 288 ++ L + +I + G + + A + D + + A K Sbjct: 58 KQVTPL---TELMNFQIPNIEANGAT-----AMGGALTLLADC--INREVVKGSAEVKGD 107 Query: 289 -KIIVF-MTDGENLSTKEDQQSLYYCN-EAKKRGAIV 322 K +VF ++DG D S + +A K G V Sbjct: 108 WKPVVFLLSDGSPT----DSISKGIADIKAVKTGIFV 140 >gi|260592522|ref|ZP_05857980.1| von Willebrand factor, type A [Prevotella veroralis F0319] gi|260535568|gb|EEX18185.1| von Willebrand factor, type A [Prevotella veroralis F0319] Length = 290 Score = 40.5 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 19/102 (18%), Positives = 46/102 (45%), Gaps = 9/102 (8%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+V+DVS S++ + + + + ++ + G++ F Sbjct: 72 EEERELTVMLVIDVSGSLDFGTAGQLKRECVTEIAATLAFSAIQNND------KIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVS---TNSTPGLKY 265 S+ +E++ + G H+ I+ + F + T+ GL+Y Sbjct: 126 SDHVEKYIPPKKGRKHILYLIREMLTFTPNSKKTDVGVGLEY 167 >gi|118388811|ref|XP_001027501.1| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila] gi|89309271|gb|EAS07259.1| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila SB210] Length = 1543 Score = 40.5 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 43/318 (13%), Positives = 95/318 (29%), Gaps = 59/318 (18%) Query: 38 IEVSHIFFMKTVLHSMIDRSLVH----AATQIMNEGNGNNRKKLKGGDILCRIKNTWNMS 93 I++ HI + + + + + ++ + ++ C+ Sbjct: 792 IQILHISLEENKFITPNKQKSLQRDQNSLNNSSSDTKSLSPEQSSSDKKQCKQITVPKEL 851 Query: 94 FRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNS-R 152 E++ N V D++ N I K C + ++ Sbjct: 852 EDEEVKHNFQVQPPKDVLI-----------NPQLPIKIQDLIKSHYNHCQESANFISNFY 900 Query: 153 HIVMPITSSVKVNSQTDAR--LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + + + + R LD+M ++D + SM + I I +++ +K Sbjct: 901 QLSEDMKQFLPIYQNILPRMELDLMFIMDCTGSMSGW-------IQAVKDEILSIIAAIK 953 Query: 211 LIPDVNNVVQSGLVTFSNK--IEEFFLLEWGVS--HLQRKIKYLSKFGVS---TNSTPGL 263 I N ++ + + + I+ F + ++ Q + + G + + G Sbjct: 954 DINKGNTSIRISFIGYRDYGSIQRFSIFDFSSEIDSFQNFLNQIQAEGGNDSEEDVAGGF 1013 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD----GE----------NLSTKEDQQSL 309 K+A Q + Q K VF+ D G + Sbjct: 1014 KHANLQQWKSQA-------------KYAVFIADCPAHGREYSDGHDDRYPDGDPDGVDLK 1060 Query: 310 YYCNEAKKRGAIVYAIGI 327 K+G +Y I I Sbjct: 1061 QEFKNLIKKGVQLYTIQI 1078 >gi|239636912|ref|ZP_04677911.1| von Willebrand factor type A domain protein [Staphylococcus warneri L37603] gi|239597586|gb|EEQ80084.1| von Willebrand factor type A domain protein [Staphylococcus warneri L37603] Length = 629 Score = 40.5 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 35/299 (11%), Positives = 101/299 (33%), Gaps = 36/299 (12%) Query: 39 EVSHIFFMKTV-LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSF-RN 96 +++ + K + +DR I + DI ++ + Sbjct: 313 DMTDMMTKKGKGSQNTLDR---EEGGFIGQNSAFALEGINQNVDIKWKVPDILPEYIQAY 369 Query: 97 ELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFC--TFIPWYTNSRHI 154 E N +I D+++ I Q+E ++++ K + + + + + Sbjct: 370 EDVKNDVQFEIKDLIQIIKKTIEREHQDERHNLTKGRLQKNLINWFIDDQYKLFYKKQDL 429 Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 ++ +++D S SM D +I + + +++ + + +++ Sbjct: 430 SQSFDAT------------FTLLIDASASMHDKMDETIKGVVLFHETLKELNVKHEILAF 477 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR-KIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + S N I+E ++ +I L + ++ A +++ Sbjct: 478 NEDAFDSDDTKQPNIIDEIIHYDYSTLKKDGPRIMALEPQDDN-RDGVAIRIASDRL--- 533 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKRGAIVYAIGIR 328 +++++ ++ +DGE + Q + +A+K G V+ + + Sbjct: 534 --------IRRSHHQRFLIVFSDGEPSAYNYSQDGIIDTYEAVEDARKFGIEVFNVFLS 584 >gi|153872431|ref|ZP_02001326.1| PpkA [Beggiatoa sp. PS] gi|152071102|gb|EDN68673.1| PpkA [Beggiatoa sp. PS] Length = 523 Score = 40.5 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 69/181 (38%), Gaps = 16/181 (8%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T +V LD+ V+D++ SM F I + AI + M+ + +++ Sbjct: 83 TETVTDTQAKTLGLDIKFVMDMTGSMGPF----IKRTKEAIAKVATMIAK----ENLDIQ 134 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG-VSTNSTPGLKY--AYNQIFDMQG 275 V+ GLV + + + L++ +++ + + +F V ++P Y + Sbjct: 135 VRFGLVGYRDDLVTVPGLKFVINNFTPTLVTVDEFDKVIAKASPATDKSGDYQEEAFAGI 194 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS-----LYYCNEAKKRGAIVYAIGIRVI 330 + + N K I+ + D + Q + + A + + +I ++ Sbjct: 195 NEALNSDWNDNTLKFIILVGDASSHPVSHQQNTTGLDAVQLRELANSQKINIISIHLKAS 254 Query: 331 R 331 R Sbjct: 255 R 255 >gi|149202861|ref|ZP_01879832.1| hypothetical protein RTM1035_19001 [Roseovarius sp. TM1035] gi|149143407|gb|EDM31443.1| hypothetical protein RTM1035_19001 [Roseovarius sp. TM1035] Length = 212 Score = 40.5 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 37/211 (17%), Positives = 69/211 (32%), Gaps = 42/211 (19%) Query: 175 MIVLDVSRSMES--FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 M+V D S SM ++ T+I A ++ +++PDV + GL+++ Sbjct: 1 MLVFDGSGSMAEVGHDPTAATRIIEARAALR------RVMPDVAPYRRIGLLSYGAGGSH 54 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNST------PGLKYAYNQIFDMQGMRQHCNTEDAN 286 S + R + + GL + + ++ Sbjct: 55 P------CSGITRHFAPMPDAAAAVVVAVDALMPGGLTPIAASVAAAAEVLEYRTQPG-- 106 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV---------IRSH---- 333 I+V +TDG N + +L A+ R V+ IG RV + Sbjct: 107 ---IVVLVTDG-NETCGGTPCALGAALTAEARDLTVHVIGFRVVHDPFSWNSPEAQGYDG 162 Query: 334 EFLRAC---ASPNSFYLVENPHSMYDAFSHI 361 + + C A+ F E + +A Sbjct: 163 QTVAKCLADATGGVFVSTETVDELVEALRET 193 >gi|113475558|ref|YP_721619.1| integrin, beta chain-like [Trichodesmium erythraeum IMS101] gi|110166606|gb|ABG51146.1| Integrin, beta chain-like [Trichodesmium erythraeum IMS101] Length = 663 Score = 40.5 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 38/278 (13%), Positives = 79/278 (28%), Gaps = 59/278 (21%) Query: 121 VPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDV 180 + + SI+ I K L ++ IT+ V S T LD++ + D+ Sbjct: 1 MAKKTKSSITNIVDRKGGLNPDVVDVTLVPGDNVTFDITAKVTKKSSTKLPLDLVFLSDL 60 Query: 181 SRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEFFLLEWG 239 S S + + ++ V+ I + GL ++ + F + Sbjct: 61 SGSYGDDLP-------VLQDLVPKLVSSVRDIQPNSQF---GLASYIDKPKDPFGGPKDF 110 Query: 240 VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM---------------RQHCNTED 284 V ++ I T S + A + + G R+ Sbjct: 111 VYRMESAI---------TKSRTDFQKAMDDLKIGNGNDGPEAQLEALMQLALREKEIGFR 161 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL--RACASP 342 ++++V TD + + + KK G L + + Sbjct: 162 KKSRRVVVLSTD----------ANYHKAGDGKKAGIK------TPNNGDTVLDGKPAGTG 205 Query: 343 NSFYLVENPHSMYD------AFSHIGKDIVTKRIWYDK 374 + ++ F+ G + + DK Sbjct: 206 EDYPSIDQVRDALQEAGIVPIFAVTGNQVRNYKKLVDK 243 >gi|257389158|ref|YP_003178931.1| von Willebrand factor A [Halomicrobium mukohataei DSM 12286] gi|257171465|gb|ACV49224.1| von Willebrand factor type A [Halomicrobium mukohataei DSM 12286] Length = 393 Score = 40.5 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 72/211 (34%), Gaps = 29/211 (13%) Query: 158 ITSSVKVNS---QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 +T+ + V +TD R + + +D S SME +I + + +L + Sbjct: 20 VTAEIDVEPGEQETDVRRHIALCIDTSGSME---GDNIKRARDGAAWVFGLLADED---- 72 Query: 215 VNNVVQSGLVTFSNKIEEFFLLE-WGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIF 271 +V F + W Q + ++ + G T+ GLK A + Sbjct: 73 -----YVSIVAFDTEATVILPATRWSDLDRQTAMDHVEELTAGGGTDMYNGLKAAKETLS 127 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 T K+ ++ ++DG++ D+ G + + GI Sbjct: 128 SS-------ATGPDTVKR-LLLLSDGKDNERTPDEF-EGLAEAIDDAGIRIQSAGIGTDY 178 Query: 332 SHEFLRACASP--NSFYLVENPHSMYDAFSH 360 + +R + ++ +E P + D F Sbjct: 179 NEATIRTLGTAGRGTWTHLEAPGDIEDFFGE 209 >gi|145549918|ref|XP_001460638.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124428468|emb|CAK93241.1| unnamed protein product [Paramecium tetraurelia] Length = 562 Score = 40.5 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 35/191 (18%), Positives = 62/191 (32%), Gaps = 27/191 (14%) Query: 136 KIPLKFCTFIPWYTNSRHI-VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITK 194 K LK W + I S + R+ + +LD S SM + Sbjct: 340 KKQLKDKLIKIWQEYGQDICNKLQKSYQNIQQLNHNRVHYIFILDSSESMNKDWTD---- 395 Query: 195 IDMAIKSINAMLEEVKLIPDVNNV-VQSGLVTFSNKIEEFFLLEWGVSHLQRKIK---YL 250 K + ++++K V N L+ F+ + S R IK + Sbjct: 396 ---IKKGVREFIKKIKEKDQVENKEFWISLILFNKEQTTLIN-----SKRARDIKTKFKM 447 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 + G TN +K A N + I+F TDG+ +++ + + Sbjct: 448 NFLGGGTNFGKPIKKAINLVKKDNTSDLFL----------ILFYTDGKAAIPEQELKKMQ 497 Query: 311 YCNEAKKRGAI 321 E K++ Sbjct: 498 NLEEEKRKKIH 508 >gi|118367272|ref|XP_001016851.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89298618|gb|EAR96606.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 1633 Score = 40.5 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 59/167 (35%), Gaps = 23/167 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +LD S SM ++ A++ +K +P+ + +++F + E+ Sbjct: 1048 FIFILDRSGSMRG---------QPIRRACEAIILFLKSLPNDSYF---NVISFGSSFEKL 1095 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 F + ++ + + +S G YN + ++ + Y + I Sbjct: 1096 FPFS--TKYTSESLEKAVQIINNYDSDLGGTEIYNPLHNV-----FIMKRISGYNRQIFL 1148 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +TDGE +Q K V++IG ++ A Sbjct: 1149 LTDGE--VDSSEQVIELIKKNNKYN--RVHSIGFGFKADQYLIKESA 1191 >gi|54296668|ref|YP_123037.1| structural toxin protein RtxA [Legionella pneumophila str. Paris] gi|53750453|emb|CAH11847.1| structural toxin protein RtxA [Legionella pneumophila str. Paris] Length = 7679 Score = 40.5 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 46/257 (17%), Positives = 90/257 (35%), Gaps = 15/257 (5%) Query: 66 MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 ++ + NN G IK T + + +D D I S +++ Sbjct: 6505 VSNSSLNNETFDIGLFSYNTIKTT-PSEININMGLSLTDSDGDKINSSIEINLAPSVFKV 6563 Query: 126 GYSISAISRYKIPLKF---CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 G ++ S +P + I + +V + V + + +LD S Sbjct: 6564 GENVDDTSSSNVPHRVGGDTGVIDGSGGADILVGDVGGVEVVGTTAR----LAFILDESG 6619 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH 242 SM F T++++ +++ +L E+ P+ + V LV F++ + E Sbjct: 6620 SMSQNFGG-TTRLEVLKQAMTDILTELSNTPNASITVH--LVKFASVVNGTGTFEITGGG 6676 Query: 243 LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 LQ+ + ++S G+ Y + + ++ + F TDG Sbjct: 6677 LQQALDFIS--GLQIQQGLLAGTNYEAALGQTVQWFSSQSGTVDVQQTL-FFTDGVPT-F 6732 Query: 303 KEDQQSLYYCNEAKKRG 319 D S Y N A+ G Sbjct: 6733 YMDGNSTEYTNLARVYG 6749 >gi|321460551|gb|EFX71592.1| hypothetical protein DAPPUDRAFT_326968 [Daphnia pulex] Length = 950 Score = 40.5 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 39/192 (20%), Positives = 67/192 (34%), Gaps = 34/192 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++V+DVS SM+ F + I K+ ++++ +K N Q G+V FS+ E Sbjct: 295 FVVVMDVSGSMKEF--NRIGKLGESVRA------WIKTDFPSGN--QLGMVQFSSNAEIL 344 Query: 234 FLL---EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L S + K + V+T GL+ A + D I Sbjct: 345 SDLRMIADEKSREEMMAKVPKEVFVATCIGCGLQLAMQMLKDGG---------------I 389 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC-ASPNSFYLVE 349 IV +TDG+N D + + V I + Y ++ Sbjct: 390 IVLVTDGKNSPGYHDISDVK--KDIVDAKIRVITIAYGSEADKNVEHLADVTGGKSYFIK 447 Query: 350 N---PHSMYDAF 358 + ++ AF Sbjct: 448 DDDSSEALQQAF 459 >gi|296820384|ref|XP_002849932.1| conserved hypothetical protein [Arthroderma otae CBS 113480] gi|238837486|gb|EEQ27148.1| conserved hypothetical protein [Arthroderma otae CBS 113480] Length = 302 Score = 40.5 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 25/190 (13%), Positives = 54/190 (28%), Gaps = 16/190 (8%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 PIT+S ++ D + ++D S SM T+ +A + + + Sbjct: 61 ESPITASSDQADSFLSQFDTVFLIDDSASMHGA-SWQATENALAKIAPICTARDKDGVEI 119 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 +SG + H++ ++ G + T + + + Sbjct: 120 YFLNHRSGSA------NGAYSNVRDTGHVREVFTAVTPSGGTPTGTRLMHILKPYLKGFE 173 Query: 275 GMRQHC---NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR------GAIVYAI 325 + + I+ +TDG E L K G + + Sbjct: 174 EASKSMTSDERDRVVRPLNIIVITDGVPTDDVESVIILAAKKLDKLNALPWQVGIQFFQV 233 Query: 326 GIRVIRSHEF 335 G + + Sbjct: 234 GDDPEAAEDL 243 >gi|114707528|ref|ZP_01440424.1| von Willebrand factor, type A [Fulvimarina pelagi HTCC2506] gi|114537087|gb|EAU40215.1| von Willebrand factor, type A [Fulvimarina pelagi HTCC2506] Length = 584 Score = 40.5 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 38/222 (17%), Positives = 70/222 (31%), Gaps = 31/222 (13%) Query: 121 VPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS-QTDARLDMMIVLD 179 P++ ISA +P + + + V+ S +++ Q + V+D Sbjct: 345 DPRHSSLDISATLTAALPWQRFRRQRFPRLTDRPVILTPSDIRIRRLQAKRETATIFVVD 404 Query: 180 VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEFFLLEW 238 S S +++ ++ A +I +L E + +V F E Sbjct: 405 ASGS------AALARLAEAKGAIERILAEC-----YRRRDRVAMVAFRGTSAETLLPETK 453 Query: 239 GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE 298 ++ +R + LS G T G+ A RQ E IVF+TDG+ Sbjct: 454 SLTRARRALAGLSA-GGGTPLASGIALA------GDLARQCERQERTPL---IVFLTDGK 503 Query: 299 NLST--------KEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 T + + +G I + Sbjct: 504 ANITLDGMAGRSSAREDVNTQATVLRSQGYASLVIDFSLRPG 545 >gi|553536|gb|AAA36226.1| MHC factor B [Homo sapiens] Length = 171 Score = 40.5 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 20/134 (14%), Positives = 44/134 (32%), Gaps = 17/134 (12%) Query: 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLY 310 K TN+ L+ Y+ + + +I+ MTDG N+ Sbjct: 10 KLKSGTNTKKALQAVYS-MMSWPDDVPPEGWNRTRH--VIILMTDGLHNMGGDPITVIDE 66 Query: 311 YCN---------EAKKRGAIVYAIGIRVIRSHEFLRACAS----PNSFYLVENPHSMYDA 357 + ++ VY G+ + + + A AS + V++ ++ D Sbjct: 67 IRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDV 126 Query: 358 FSHIGKDIVTKRIW 371 F + + + + Sbjct: 127 FYQMIDESQSLSLC 140 >gi|296330466|ref|ZP_06872945.1| putative activator of nitric oxide reductase [Bacillus subtilis subsp. spizizenii ATCC 6633] gi|305674748|ref|YP_003866420.1| putative activator of nitric oxide reductase [Bacillus subtilis subsp. spizizenii str. W23] gi|296152363|gb|EFG93233.1| putative activator of nitric oxide reductase [Bacillus subtilis subsp. spizizenii ATCC 6633] gi|305412992|gb|ADM38111.1| putative activator of nitric oxide reductase [Bacillus subtilis subsp. spizizenii str. W23] Length = 638 Score = 40.5 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 70/222 (31%), Gaps = 34/222 (15%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLD-----MMIVLDVSRS 183 I +K L N++ + + ++ + +++D S S Sbjct: 395 IQKTLEHKKTLPRTDLHAGRLNNKLLRYFTERNPRLFYKKQEPSSEIDAVFTLLVDCSAS 454 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 M K+D + I E +K + + +V G +N E + + + Sbjct: 455 M-------FDKMDETKRGIVLFHEALKSVAVPHQIV--GFWEDTNDATETSQPNYFNTVI 505 Query: 244 -------QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 Q + + ++ G YA Q+ + + +K ++ +D Sbjct: 506 PFQSSLRQEAGPAIMQLEPEEDNRDG--YAIRQMTK-------KMLQRSEAQKFLIVFSD 556 Query: 297 GENLST----KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 GE + + EA+KRG V + + E Sbjct: 557 GEPAAFGYEQNGIVDTSEAVIEARKRGIEVINVFLSNSEIEE 598 >gi|239787298|emb|CAX83775.1| conserved uncharacterized protein [uncultured bacterium] Length = 307 Score = 40.5 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 64/191 (33%), Gaps = 41/191 (21%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFF-----DSSITKIDMAIKSINAMLEEV---- 209 T S +++ +T + ++D SRSM F + +T+ + ++N L E+ Sbjct: 2 TYSQEISRETPGAF--LFLVDQSRSMNKEFGKSQDNGPVTRAAVVADALNNTLAELVNRC 59 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLLEWG-------------------VSHLQRKIKYL 250 V + + G++ + F E G S ++ ++ Sbjct: 60 TRDEGVTDYFEIGVIGYGRTSRPAFCWEDGLARRTMVPISEVAAHARTETSEVETIVRGN 119 Query: 251 SKFGVSTNS---------TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 + T S + + A I + Q ++ I++ +TDG Sbjct: 120 AVKETVTISKWLSPVALESTPMNAAL--IMARATLEQWIYNHPKSFPPIVINITDGMAND 177 Query: 302 TKEDQQSLYYC 312 +++ L Sbjct: 178 VNSEEELLVTA 188 >gi|213966278|ref|ZP_03394461.1| Cna protein B-type domain protein [Corynebacterium amycolatum SK46] gi|213951051|gb|EEB62450.1| Cna protein B-type domain protein [Corynebacterium amycolatum SK46] Length = 1079 Score = 40.5 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 46/267 (17%), Positives = 92/267 (34%), Gaps = 35/267 (13%) Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN-DIDDIVRSTSLDIVVVPQNEG 126 E + K L +I+ + RN NGF+ ++ + +D VVP++ Sbjct: 354 ETDATFLDPQKYNLTLDQIEPGVTLQKRNVNIGNGFITMEVVPVKDGKPVDSAVVPKDAV 413 Query: 127 YSISAISR----YKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR-------LDMM 175 ++ + K+ L I+ P + V+ + L + Sbjct: 414 FTFTNNLSQNKNLKVTLDVYGEKKDEPKPPEIISPPDGAKWVHGRVPNPPMPQRCGLRIA 473 Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 +V D+S S+ + + + K+ NA ++ ++ P + G+ F++ Sbjct: 474 VVADLSTSLRY---ADTDGFEASKKAANAFIDSLQGTP-----AELGIYNFASD---GPA 522 Query: 236 LEWGVSHLQR--KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + G +H Q I LS+ GV ++YN N ++ F Sbjct: 523 NQAGTTHGQNPPYISMLSEQGVQRAKDVVKNWSYNANNSATNWEAGLKQVKDNDYDVVYF 582 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGA 320 +TDG ++ K +G Sbjct: 583 ITDGMPTTSS----------TIKSKGI 599 >gi|212538105|ref|XP_002149208.1| von Willebrand and RING finger domain protein [Penicillium marneffei ATCC 18224] gi|210068950|gb|EEA23041.1| von Willebrand and RING finger domain protein [Penicillium marneffei ATCC 18224] Length = 1016 Score = 40.5 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 46/263 (17%), Positives = 82/263 (31%), Gaps = 34/263 (12%) Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 ++ + W +F L + V D D D + Sbjct: 401 IVSDLPCFHLRFQNRGQLDLWRRAFAG-LHNPEPVRDFDADSAIREEDDYRTTKTNKRQS 459 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 SA S Y T + YTNS P S LD+++V+ VS SM+ Sbjct: 460 SATSSYGANKSNNTALTEYTNSMRDPRPAPSF-------HVPLDLVVVIPVSSSMQGL-- 510 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-----KIEEFFLLEWGVSHLQ 244 KI + ++ ++ + + GLVTF + + W Sbjct: 511 ----KITLLRDALRFLVLNLGPRD------RMGLVTFGSSGGGVPLVGMTTKAWN--GWG 558 Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 + + L G + ++ A N D+ R+ N I+ ++D + Sbjct: 559 KILNSLRPVGQKSLRADVVEGA-NVAMDLLMQRKSNNPIST-----ILLISDSSTSDPES 612 Query: 305 DQQSLYYCNEAKKRGAIVYAIGI 327 + AK G + +G+ Sbjct: 613 VDFVVSRAEAAK-VGIHSFGLGL 634 >gi|261404225|ref|YP_003240466.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261280688|gb|ACX62659.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 562 Score = 40.5 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 54/165 (32%), Gaps = 31/165 (18%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 + + V DVS SM+ + ++ ++ + L I G V++S + Sbjct: 387 PVAAVFVADVSGSMD---GEPLNRLKESLLTGQKYLGRDNSI---------GFVSYSTDV 434 Query: 231 EEFFLLEWGVSHLQRK------IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + G L ++ I L G +T + G+ A + D + Sbjct: 435 TINLPI--GKYDLNQQSMFVGAINSLEASG-NTATFDGIVVAMKMLQDE-------MAAN 484 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 + K +I ++DGE K VY IG Sbjct: 485 PDVKPLIFVLSDGETNVGHSLDDIRGLIQAFK---IPVYTIGYNA 526 >gi|265764036|ref|ZP_06092604.1| conserved hypothetical protein [Bacteroides sp. 2_1_16] gi|263256644|gb|EEZ27990.1| conserved hypothetical protein [Bacteroides sp. 2_1_16] Length = 289 Score = 40.5 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 23/111 (20%), Positives = 49/111 (44%), Gaps = 7/111 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S+E + K + + + N + G++ F Sbjct: 72 EEERELTVMLMVDVSGSLE------FGTVKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 S++IE+F + G H+ I+ L F + T ++ A + ++ R Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDFKPDSRRTN-IRLALEYLTNVMKRR 175 >gi|212693199|ref|ZP_03301327.1| hypothetical protein BACDOR_02709 [Bacteroides dorei DSM 17855] gi|237709941|ref|ZP_04540422.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|237725392|ref|ZP_04555873.1| conserved hypothetical protein [Bacteroides sp. D4] gi|265753593|ref|ZP_06088948.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] gi|212664304|gb|EEB24876.1| hypothetical protein BACDOR_02709 [Bacteroides dorei DSM 17855] gi|229436079|gb|EEO46156.1| conserved hypothetical protein [Bacteroides dorei 5_1_36/D4] gi|229456034|gb|EEO61755.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|263235307|gb|EEZ20831.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] Length = 289 Score = 40.5 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 23/111 (20%), Positives = 48/111 (43%), Gaps = 7/111 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S+ + K + + + N + G++ F Sbjct: 72 EEERELTVMLLIDVSNSL------DFGTVKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 S++IE+F + G H+ I+ L F + T +K A + ++ R Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELLDFKPESKRTD-IKTAVEYLTNVIKKR 175 >gi|260790216|ref|XP_002590139.1| hypothetical protein BRAFLDRAFT_90872 [Branchiostoma floridae] gi|229275328|gb|EEN46150.1| hypothetical protein BRAFLDRAFT_90872 [Branchiostoma floridae] Length = 1143 Score = 40.5 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 55/172 (31%), Gaps = 28/172 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAML-EEVKLIPDVNNVVQSGLVTFSNKIEE 232 + I++D S SM K + +++ +++ L + N+++ + +++ Sbjct: 934 VAILVDTSGSMGPHLP-------ELKKELASLVWDQLCLNTEKFNLIRFSSDVDTWQVQL 986 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + G +TN+ L+ A+ + Sbjct: 987 VEPTDEFCHDAVQWQATFVAEG-NTNTLGALEEAFQDSAVDG----------------VY 1029 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--HEFLRACASP 342 +TDG+ + RG V I S + FL+ AS Sbjct: 1030 LLTDGKP-DQSASMVLKEVAKMNEGRGVHVNTISFNCDDSTANSFLKQLASE 1080 >gi|301163421|emb|CBW22972.1| conserved hypothetical protein [Bacteroides fragilis 638R] Length = 289 Score = 40.5 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 23/111 (20%), Positives = 49/111 (44%), Gaps = 7/111 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S+E + K + + + N + G++ F Sbjct: 72 EEERELTVMLMVDVSGSLE------FGTVKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 S++IE+F + G H+ I+ L F + T ++ A + ++ R Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDFKPDSRRTN-IRLALEYLTNVMKRR 175 >gi|168212598|ref|ZP_02638223.1| von Willebrand factor type A domain protein [Clostridium perfringens CPE str. F4969] gi|170715692|gb|EDT27874.1| von Willebrand factor type A domain protein [Clostridium perfringens CPE str. F4969] Length = 620 Score = 40.5 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 29/175 (16%), Positives = 67/175 (38%), Gaps = 28/175 (16%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA 198 LK + P + I + + K ++ +D+++++D S SM D ++ Sbjct: 57 LKEVSQEPDKDGNYEITLTVKGKPKKVTK---PVDILLIMDASNSMYYNMDELKASMNSL 113 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW-----GVSHLQRKIK---YL 250 + + + IP+ + +V F +++E F + IK Y Sbjct: 114 VDKV------IDNIPNS----RIAVVAFGTEVKEVFSFNNKNKFTSKEEYKNAIKGSYYY 163 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 +TN + A + ++ ++N KK ++F +DG ++++ Sbjct: 164 ITRKGNTNIEGTWRVA-------DEIFKNELNNNSNSKKDVIFFSDGYPNVSRDE 211 >gi|158523143|ref|YP_001531013.1| von Willebrand factor type A [Desulfococcus oleovorans Hxd3] gi|158511969|gb|ABW68936.1| von Willebrand factor type A [Desulfococcus oleovorans Hxd3] Length = 1006 Score = 40.5 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 30/227 (13%), Positives = 86/227 (37%), Gaps = 46/227 (20%) Query: 172 LDMMIVLDVSRSME------SFFDSSITK-----------IDMAIKSINAMLEEVKLIPD 214 + ++++D+S S E S DS K ID+ +S+ M E + + Sbjct: 795 MSTLLLIDMSASTEETAPEVSAEDSQDKKGGKSSRDDKRVIDIEKESLIVMSEALDAL-- 852 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + G + ++++++ ++ +I + ST +++A +++ Sbjct: 853 GDQYAMYGFSGHGREHVDYYVIKSFDESNTEKVKMRICGIEPR-QSTRMGTAIRHAVSKL 911 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDG----------ENLSTKEDQQSLYYCNEAKKRGA 320 + + ++++ ++DG N ++ EAK+ G Sbjct: 912 SNREADH-----------RLLILLSDGFPQDLDYGEDRNSREYGLNDTMMAFIEAKRLGI 960 Query: 321 IVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + I I +++L+ +P + ++++ + + I + ++ Sbjct: 961 KPFCITID-QSGNDYLKKMCAPEEYLIIKDIAMLPELLPGIVESLMG 1006 >gi|825681|emb|CAA34346.1| inter-alpha-trypsin inhibitor C-terminal [Homo sapiens] Length = 837 Score = 40.5 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 68/196 (34%), Gaps = 28/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D+S SM K+ +++ +L +++ + LV F +++ Sbjct: 219 VVFVIDISGSMRG------QKVKQTKEALLKILGDMQPGDYFD------LVLFGTRVQSW 266 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E + Q ++ S +TN GL + +Q +N+ Sbjct: 267 KGSLVQASEANLQAAQDFVRGFS-LDEATNLNGGLLRGIEILNQVQESLPEL----SNHA 321 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDG+ + Q L A + +Y +G FL + N+ Sbjct: 322 SILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 381 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 382 QRIYEDHDATQQLQGF 397 >gi|116622485|ref|YP_824641.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225647|gb|ABJ84356.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 313 Score = 40.5 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 31/229 (13%), Positives = 66/229 (28%), Gaps = 46/229 (20%) Query: 151 SRHIVMPITSSVKVNSQTDARLD----MMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 + + S DA + ++ D S M + + +I + Sbjct: 65 PQKLASFAEGSTVFRLVDDAPGSPGTAVFVLFDTSNRMYNSYP-------YVRDAIAEFI 117 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 ++ + TFS + L ++ N + G A Sbjct: 118 RRLEPVDSAAIY------TFSRNLYRAAPLTR---------DHVRAGAGLDNISAGDDSA 162 Query: 267 -YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK---KRGAIV 322 +N + + + +K IV + ++ ++ L + + G + Sbjct: 163 LFNALL-----LTLRDAAEVPGRKAIVVFS-----NSSDNASVLSPYDVGRLAVNEGVPI 212 Query: 323 YAI----GIRVIRSHEFLRA--CASPNSFYLVENPHSMYDAFSHIGKDI 365 Y I G++ ++ L A+ Y AF I +DI Sbjct: 213 YVISTRDGVQDPVTNSALHYLTAATGGKLYSARTWQKQAGAFQAIREDI 261 >gi|282877525|ref|ZP_06286343.1| conserved hypothetical protein [Prevotella buccalis ATCC 35310] gi|281300349|gb|EFA92700.1| conserved hypothetical protein [Prevotella buccalis ATCC 35310] Length = 296 Score = 40.5 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 23/102 (22%), Positives = 49/102 (48%), Gaps = 7/102 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S++ F TK +MA + + I + + + G++ F Sbjct: 72 EEERELTVMLLIDVSGSLD-FGTKKQTKREMATEIAATL--AFSAIQNND---KIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP-GLKYAY 267 S++IE++ + G H+ I+ + F + T + Y Sbjct: 126 SDRIEKYIPPKKGRKHILYIIREMLDFDPESKKTDVAMAIEY 167 >gi|114587330|ref|XP_001172443.1| PREDICTED: inter-alpha (globulin) inhibitor H1 isoform 3 [Pan troglodytes] Length = 898 Score = 40.5 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 68/196 (34%), Gaps = 28/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D+S SM K+ +++ +L +++ + LV F +++ Sbjct: 293 VVFVIDISGSMRG------QKVKQTKEALLKILGDMQPGDYFD------LVLFGTRVQSW 340 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E + Q ++ S +TN GL + +Q +N+ Sbjct: 341 KGSLVQASEANLQAAQDFVRGFS-LDEATNLNGGLLRGIEILNQVQESLPEL----SNHA 395 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDG+ + Q L A + +Y +G FL + N+ Sbjct: 396 SILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 455 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 456 QRIYEDRDATQQLQGF 471 >gi|328788303|ref|XP_003251102.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3-like [Apis mellifera] Length = 1234 Score = 40.5 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 43/206 (20%), Positives = 76/206 (36%), Gaps = 35/206 (16%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++I++D S SM + ++A +N +L+ + N+ V +TFS Sbjct: 218 ATSPKDILILIDTSGSMTGI------RREIARHVVNNILDTL----GNNDFVNI--ITFS 265 Query: 228 NKIEEFFL---------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 N +E V L+R I L + N + L A+ + + R+ Sbjct: 266 NVTKEVVPCFNDTLVQANLANVRELKRAILNLDTEKI-ANFSLALTTAFELLETYRTERE 324 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK---KRGAIV----YAIGIRVIR 331 A I+ +TDG + KE ++ + + K V Y IG V Sbjct: 325 GARCNQA-----IMLITDGVPYNYKEIFETYNWRDNPDEPFKADMPVRMFTYLIGREVAD 379 Query: 332 SHEF-LRACASPNSFYLVENPHSMYD 356 E ACA+ F + + + Sbjct: 380 VKEVQWMACANRGYFVHLCTLAEVRE 405 >gi|297199739|ref|ZP_06917136.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083] gi|297147478|gb|EFH28643.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083] Length = 448 Score = 40.5 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 51/166 (30%), Gaps = 30/166 (18%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 T I + + + + I++D S SM D TK+ A + Sbjct: 38 TGGGTIGSAVGAPHLWTAGQGPSAAVAIMVDCSGSM----DYPPTKMRNARDA------T 87 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFL--------LEWGVSHLQRKIKYLSKFGVSTNST 260 I + + V ++ ++ +E + ++ ++ LS G T Sbjct: 88 AAAIDTLRDGVHFAVIGGTHVAKEVYPGGGSLAVADATTRDQAKQALRRLSA-GGGTAIG 146 Query: 261 PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 L+ A + +H + +TDG N + Sbjct: 147 TWLRLADRLLSSADVAIRHG-----------ILLTDGRNEHESPED 181 >gi|172035989|ref|YP_001802490.1| magnesium chelatase, ATPase subunit D [Cyanothece sp. ATCC 51142] gi|171697443|gb|ACB50424.1| magnesium chelatase, ATPase subunit D [Cyanothece sp. ATCC 51142] Length = 682 Score = 40.5 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 39/279 (13%), Positives = 84/279 (30%), Gaps = 57/279 (20%) Query: 102 GFVNDIDDIVRSTSLDIVVVPQNEGY-------------SISAISRYKIPLKFCTFIPWY 148 F + +S S +I+ Y ++ A R P + + + Sbjct: 405 YFAQMAQNQGKSGSRNIIFSDDRGRYVKPMLPKGKVRRIAVDATLRAAAPYQ-KSRRLRH 463 Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 IV K ++ L ++ V+D S SM ++ ++ A ++ +L E Sbjct: 464 PGRNVIVEQGDIRSKRLARKAGAL-IVFVVDASGSM------ALNRMQSAKGAVMRLLTE 516 Query: 209 VKLIPDVNNVVQSGLVTFSNKI-EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 N Q L+ F + + +S +++++ L G S + + Sbjct: 517 A-----YENRDQVALIPFRGEQADVLLPPTRSISLARKRLETLPCGGGSPLAHGLTQA-- 569 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE---------------DQQSLYYC 312 + + + +IV +TDG Q+ L Sbjct: 570 --VHVGMNAKMSGDIGQV----VIVAITDGRGNIPLAKSLGEPIPEGEKPDIKQELLDIA 623 Query: 313 NEAKKRGAIVYAI-----GIRVIRSHEFLRACASPNSFY 346 + + G + I + E + ++Y Sbjct: 624 GKIRGLGIKLLMIDTEKKFVSTGFGKELAQTAG--GTYY 660 >gi|146186309|ref|XP_001033348.2| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila] gi|146143017|gb|EAR85685.2| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila SB210] Length = 1149 Score = 40.5 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 24/135 (17%), Positives = 52/135 (38%), Gaps = 22/135 (16%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF----SN 228 D++ ++D + SM S+ + T+ + ++E+VK + ++ G V F N Sbjct: 148 DLLFIMDCTGSMSSYINMCRTQ-------LFLIVEQVKKQFQKS-TLKIGFVGFRDFGDN 199 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E + L+ I +S G + ++ + A+ + +M + Sbjct: 200 NQFEIYQFTTEYDKLKSFINSVSATGGN-DTAEDVAGAFEKALEMNWTCE---------A 249 Query: 289 KIIVFMTDGENLSTK 303 K + +TD K Sbjct: 250 KYAILITDAYPHGNK 264 >gi|300789363|ref|YP_003769654.1| hypothetical protein AMED_7539 [Amycolatopsis mediterranei U32] gi|299798877|gb|ADJ49252.1| conserved hypothetical protein [Amycolatopsis mediterranei U32] Length = 526 Score = 40.1 bits (92), Expect = 0.49, Method: Composition-based stats. Identities = 39/205 (19%), Positives = 67/205 (32%), Gaps = 37/205 (18%) Query: 174 MMIVLDVSRSMESFFDS-SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 + ++LDVS SM + +++ + +++ L K + GL FS K++ Sbjct: 323 VQVLLDVSGSMAATVPGTGRSRMALTLEAATQGLGLFKP------TTEIGLWLFSTKLDG 376 Query: 233 F-------------FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 L G + +K K G +T + AY RQ Sbjct: 377 PKDYKELLPMRPISQQLAAGGVATLQAVK--PKPGGATGLYDSILAAYQ------NARQS 428 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK----KRGAIVYAIGIRVIRSHEF 335 N +V +TDG N + ++ V IGI Sbjct: 429 WQLGRINV---VVVLTDGRNEDDDSIGLPGLLAELGRLQDPRKPLPVIGIGIGPDIDASE 485 Query: 336 LR--ACASPNSFYLVENPHSMYDAF 358 LR + A+ + +P + D F Sbjct: 486 LRQVSAATGGESFTTPDPRKISDVF 510 >gi|290979035|ref|XP_002672240.1| predicted protein [Naegleria gruberi] gi|284085815|gb|EFC39496.1| predicted protein [Naegleria gruberi] Length = 2816 Score = 40.1 bits (92), Expect = 0.49, Method: Composition-based stats. Identities = 34/196 (17%), Positives = 66/196 (33%), Gaps = 18/196 (9%) Query: 109 DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT 168 T L+ + I KF I ++S +T S + + Sbjct: 2204 TAGTPTILNPSDPANTRIIRVQPIKGSCNSQKFVYGIKSDSDSTTATAIVTVSYTKCTCS 2263 Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D++ VLD S S+++ ++ + N + I + G++ FS Sbjct: 2264 LN-MDIVFVLDGSSSIDAGNWINMQNF-----ASNLTVGFGSNISPTA--TKIGIIQFST 2315 Query: 229 KIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L+ G S + I + STNS G+ A + + Sbjct: 2316 TTVTHLDLKDGTSKTAVLNAISSAGQLKSSTNSILGINNAVSMVLSQGRKDI-------- 2367 Query: 287 YKKIIVFMTDGENLST 302 +K+++ +TDG + Sbjct: 2368 TRKLLIHITDGMSNLP 2383 >gi|229512589|ref|ZP_04402059.1| IncF plasmid conjugative transfer protein TraN [Vibrio cholerae TMA 21] gi|229350371|gb|EEO15321.1| IncF plasmid conjugative transfer protein TraN [Vibrio cholerae TMA 21] Length = 933 Score = 40.1 bits (92), Expect = 0.49, Method: Composition-based stats. Identities = 22/169 (13%), Positives = 58/169 (34%), Gaps = 29/169 (17%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 IT ++++ ++ D++ ++D S SM I ++ A + + D ++ Sbjct: 218 ITHTIEIAAE---PADIVFLIDNSGSMGGV-------IHDLRNNVGAFAQLL--TQDGSD 265 Query: 218 VVQSGLVTF--SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 ++ G + S+ I L + + + ++ T + +A N Sbjct: 266 DLRLGGAVYRHSDYIFNRVELTPDFNAFKTWLDGVNAASAPTYPFDAITWAAN------- 318 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 H + +K++V + +S A +++ Sbjct: 319 ---HYQWRGGDIRKVLVLV----GNDDSAGNKSAAISALA-SNQVELFS 359 >gi|145500364|ref|XP_001436165.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124403304|emb|CAK68768.1| unnamed protein product [Paramecium tetraurelia] Length = 604 Score = 40.1 bits (92), Expect = 0.49, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 72/203 (35%), Gaps = 31/203 (15%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + + +D++ V+D S SM S KI+M +++N +L + + Sbjct: 175 MDQQQHSKVGVDLLCVIDRSGSM------SGEKIEMVKQTLNILLNFLGPKD------RL 222 Query: 222 GLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 L+ F + + L ++ I + G T G + A QI + Sbjct: 223 CLIQFDDTCQRLTNLRRVTDENKTYYSDIISKIYANG-GTVIGLGTQMALKQI------K 275 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + + I ++DG+ + SL K+ +++ G + + Sbjct: 276 YRKSVNNVTA---IFVLSDGQ---DEAAISSLQKQLAYYKQTLTIHSFGFGSDHDAKLMT 329 Query: 338 ACAS--PNSFYLVENPHSMYDAF 358 ++ SFY V N + + F Sbjct: 330 KISNLGKGSFYFVNNISLLDEFF 352 >gi|119585665|gb|EAW65261.1| inter-alpha (globulin) inhibitor H1, isoform CRA_c [Homo sapiens] Length = 911 Score = 40.1 bits (92), Expect = 0.49, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 68/196 (34%), Gaps = 28/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D+S SM K+ +++ +L +++ + LV F +++ Sbjct: 293 VVFVIDISGSMRG------QKVKQTKEALLKILGDMQPGDYFD------LVLFGTRVQSW 340 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E + Q ++ S +TN GL + +Q +N+ Sbjct: 341 KGSLVQASEANLQAAQDFVRGFS-LDEATNLNGGLLRGIEILNQVQESLPEL----SNHA 395 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDG+ + Q L A + +Y +G FL + N+ Sbjct: 396 SILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 455 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 456 QRIYEDHDATQQLQGF 471 >gi|33989|emb|CAA45188.1| inter-alpha-trypsin inhibitor heavy chain ITIH1 [Homo sapiens] Length = 911 Score = 40.1 bits (92), Expect = 0.49, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 68/196 (34%), Gaps = 28/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D+S SM K+ +++ +L +++ + LV F +++ Sbjct: 293 VVFVIDISGSMRG------QKVKQTKEALLKILGDMQPGDYFD------LVLFGTRVQSW 340 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E + Q ++ S +TN GL + +Q +N+ Sbjct: 341 KGSLVQASEANLQAAQDFVRGFS-LDEATNLNGGLLRGIEILNQVQESLPEL----SNHA 395 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDG+ + Q L A + +Y +G FL + N+ Sbjct: 396 SILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 455 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 456 QRIYEDHDATQQLQGF 471 >gi|321265474|ref|XP_003197453.1| hypothetical protein CGB_N0210C [Cryptococcus gattii WM276] gi|317463933|gb|ADV25666.1| Hypothetical Protein CGB_N0210C [Cryptococcus gattii WM276] Length = 445 Score = 40.1 bits (92), Expect = 0.49, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 64/200 (32%), Gaps = 38/200 (19%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN--NVVQ 220 S +D++ +LD + SM+ + +S I + + ++ +N + ++ Sbjct: 47 GSGSSRGKCIDLVFILDCTGSMQKYINS-------VRDHIIGICDMIRGEEGLNGPDDLR 99 Query: 221 SGLVTF-------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 +V + S + +F + +Q +K L+ G + + A Sbjct: 100 VAVVNYRDHPPQDSTYVYKFHPFTSDIPDVQNYLKGLTASGGG-DGPEAVTAAMAATLTE 158 Query: 274 QGMRQHCNTEDANYKKIIVFMTD------GEN-----LSTKEDQQSLYYCNEAKKRGAIV 322 R+ ++ V + D GE + L + G + Sbjct: 159 LEWRREAA-------RMAVLVADAPPHGIGEGGDQFKQGDPDGHDPLVIARMMAQNGITM 211 Query: 323 YAIGIRVIRSHEFLRACASP 342 + +R S L +P Sbjct: 212 F---MRRGSSSVSLYNLETP 228 >gi|189465624|ref|ZP_03014409.1| hypothetical protein BACINT_01982 [Bacteroides intestinalis DSM 17393] gi|224539998|ref|ZP_03680537.1| hypothetical protein BACCELL_04910 [Bacteroides cellulosilyticus DSM 14838] gi|189437898|gb|EDV06883.1| hypothetical protein BACINT_01982 [Bacteroides intestinalis DSM 17393] gi|224518388|gb|EEF87493.1| hypothetical protein BACCELL_04910 [Bacteroides cellulosilyticus DSM 14838] Length = 348 Score = 40.1 bits (92), Expect = 0.49, Method: Composition-based stats. Identities = 24/175 (13%), Positives = 66/175 (37%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +++MI LD+S SM + D ++++ A + + ++++++ N+ V Sbjct: 79 GSKLETVKRQGVEVMIALDISNSMLAQ-DVQPSRLEKAKRLVAQLVDKME-----NDKV- 131 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 G++ F+ + + ++ ++ T + A +G+ Sbjct: 132 -GMIVFAGDAFTQLPITSDYISAKMFLESINPSLISKQGTAIGAAINLATRSFTPQEGVG 190 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + ++ +TDGEN + + + V +G+ + Sbjct: 191 -----------RAVIVITDGENHEGGAVEAAKAAAEKG----IQVSVLGVGMPDG 230 >gi|224456528|ref|ZP_03665001.1| hypothetical membrane protein with von Willebrand factor type A domain [Francisella tularensis subsp. tularensis MA00-2987] gi|254370859|ref|ZP_04986864.1| hypothetical protein [Francisella tularensis subsp. tularensis FSC033] gi|254874285|ref|ZP_05246995.1| TPR repeat domain-containing protein [Francisella tularensis subsp. tularensis MA00-2987] gi|151569102|gb|EDN34756.1| hypothetical protein FTBG_00674 [Francisella tularensis subsp. tularensis FSC033] gi|254840284|gb|EET18720.1| TPR repeat domain-containing protein [Francisella tularensis subsp. tularensis MA00-2987] gi|282158590|gb|ADA77981.1| hypothetical protein NE061598_01655 [Francisella tularensis subsp. tularensis NE061598] Length = 332 Score = 40.1 bits (92), Expect = 0.49, Method: Composition-based stats. Identities = 34/163 (20%), Positives = 68/163 (41%), Gaps = 29/163 (17%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + +I LDVS+SM++ D S ++++ A I +L +K Q G++ FS+ Sbjct: 88 QKNISRVIALDVSQSMDT-TDVSPSRLERAKYKIFDILRRIKEG-------QVGMIVFSS 139 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L + ++ + ++ N LK + Q+ + G++Q Sbjct: 140 EPFVVSPLTSDANTIENLVTVINSDIVPVQGHNIYKALKKS-AQLIEQAGVQQGQ----- 193 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI--VYAIG 326 I+ +TD + Q++ + ++G VYAIG Sbjct: 194 -----IILITD-----SSPLPQAISQAKQLAQQGIKTDVYAIG 226 >gi|317486088|ref|ZP_07944938.1| hypothetical protein HMPREF0179_02293 [Bilophila wadsworthia 3_1_6] gi|316922661|gb|EFV43897.1| hypothetical protein HMPREF0179_02293 [Bilophila wadsworthia 3_1_6] Length = 709 Score = 40.1 bits (92), Expect = 0.50, Method: Composition-based stats. Identities = 40/312 (12%), Positives = 89/312 (28%), Gaps = 33/312 (10%) Query: 53 MIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD-NGFVNDIDDIV 111 D + + A ++ + L D + L Sbjct: 146 ATDTTAWNQALTLLFTDRSQRQPVLFFKDHKAIMDVCQAEDLPGSLSKLRAEAKAAQQGD 205 Query: 112 RSTSLDIV-VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 L I+ V P + ++S Y +P+ T + S ++A Sbjct: 206 APADLPILAVEPDDAQGAVSRNRFYLMPILKVDTPFEGTKLLEVASIDPGSAGDKPDSNA 265 Query: 171 RLD----------MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 D + +V+D + SM+ + ID ++ + A+ + V +++ V Sbjct: 266 PKDAKDDGVPRTGIAMVIDTTISMKPY-------IDQSLNVVRAIFDSVAK-DKLDDKVS 317 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 G+V F N + LE+ + + + + GL+ A Sbjct: 318 FGIVAFRNSTKASPKLEYVTKVVSDFRDATKRDEL-EKALSGLEEAKASSHSFNEDSLAG 376 Query: 281 NTEDAN-------YKKIIVFMTDG-----ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + ++++ +TD + + + A R + +R Sbjct: 377 VKAALDQLNWQPYANRVLLLITDAGPLPLSDANASTSLDVQELADLAASRNIRLVVAHVR 436 Query: 329 VIRSHEFLRACA 340 + A Sbjct: 437 TPAGKGNIDYAA 448 >gi|317123666|ref|YP_004097778.1| type II secretion system F domain [Intrasporangium calvum DSM 43043] gi|315587754|gb|ADU47051.1| Type II secretion system F domain [Intrasporangium calvum DSM 43043] Length = 652 Score = 40.1 bits (92), Expect = 0.50, Method: Composition-based stats. Identities = 38/200 (19%), Positives = 66/200 (33%), Gaps = 31/200 (15%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 M+V+D S SM + + A L+E V G+VTF+N Sbjct: 87 MLVIDTSGSMG------TDGMATVRAATRAYLKEAP------EDVLIGVVTFANTAGVDL 134 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + QR + L G + ++ D + + +V + Sbjct: 135 KPTVDRAAAQRVVNGLDARGDT------------SLYAAVRSAARAMPGDGD--RSMVLL 180 Query: 295 TDGENLSTKEDQQSLYYCN-EAKKRGAIVYAIGIRVIRSHEFLR----ACASPNSFYLVE 349 +DG + + + Q L N E K+RG V + + A AS S Sbjct: 181 SDGADTVSDDRQGDLAEANRELKRRGVRVDVVRFNTDDPDAVVALRSFASASGGSVIPAT 240 Query: 350 NPHSMYDAFSHIGKDIVTKR 369 N + AF + + ++ Sbjct: 241 NASDVGAAFKSAARALRSQA 260 >gi|156119625|ref|NP_002206.2| inter-alpha-trypsin inhibitor heavy chain H1 isoform a [Homo sapiens] gi|2851501|sp|P19827|ITIH1_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H1; Short=ITI heavy chain H1; Short=ITI-HC1; Short=Inter-alpha-inhibitor heavy chain 1; AltName: Full=Inter-alpha-trypsin inhibitor complex component III; AltName: Full=Serum-derived hyaluronan-associated protein; Short=SHAP; Flags: Precursor gi|825630|emb|CAA49279.1| inter-alpha-trypsin inhibitor heavy chain H1 [Homo sapiens] gi|62550746|gb|AAH69464.1| Inter-alpha (globulin) inhibitor H1 [Homo sapiens] gi|119585663|gb|EAW65259.1| inter-alpha (globulin) inhibitor H1, isoform CRA_a [Homo sapiens] gi|158258937|dbj|BAF85439.1| unnamed protein product [Homo sapiens] Length = 911 Score = 40.1 bits (92), Expect = 0.50, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 68/196 (34%), Gaps = 28/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D+S SM K+ +++ +L +++ + LV F +++ Sbjct: 293 VVFVIDISGSMRG------QKVKQTKEALLKILGDMQPGDYFD------LVLFGTRVQSW 340 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E + Q ++ S +TN GL + +Q +N+ Sbjct: 341 KGSLVQASEANLQAAQDFVRGFS-LDEATNLNGGLLRGIEILNQVQESLPEL----SNHA 395 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDG+ + Q L A + +Y +G FL + N+ Sbjct: 396 SILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 455 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 456 QRIYEDHDATQQLQGF 471 >gi|229588184|ref|YP_002870303.1| hypothetical protein PFLU0636 [Pseudomonas fluorescens SBW25] gi|229360050|emb|CAY46904.1| putative exported protein [Pseudomonas fluorescens SBW25] Length = 651 Score = 40.1 bits (92), Expect = 0.50, Method: Composition-based stats. Identities = 10/54 (18%), Positives = 25/54 (46%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAA 62 +G + ++ A L + + + ++++ ++ K L S+ D S + AA Sbjct: 3 PRLRSRQRGAIGLMAAGTLAVALVFMLLVVDSGRLYMEKRKLQSIADTSALEAA 56 >gi|189054356|dbj|BAG36876.1| unnamed protein product [Homo sapiens] Length = 911 Score = 40.1 bits (92), Expect = 0.50, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 68/196 (34%), Gaps = 28/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D+S SM K+ +++ +L +++ + LV F +++ Sbjct: 293 VVFVIDISGSMRG------QKVKQTKEALLKILGDMQPGDYFD------LVLFGTRVQSW 340 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E + Q ++ S +TN GL + +Q +N+ Sbjct: 341 KGSLVQASEANLQAAQDFVRGFS-LDEATNLNGGLLRGIEILNQVQESLPEL----SNHA 395 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDG+ + Q L A + +Y +G FL + N+ Sbjct: 396 SILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 455 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 456 QRIYEDHDATQQLQGF 471 >gi|260061453|ref|YP_003194533.1| hypothetical protein RB2501_07630 [Robiginitalea biformata HTCC2501] gi|88785585|gb|EAR16754.1| hypothetical protein RB2501_07630 [Robiginitalea biformata HTCC2501] Length = 288 Score = 40.1 bits (92), Expect = 0.50, Method: Composition-based stats. Identities = 34/162 (20%), Positives = 68/162 (41%), Gaps = 30/162 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++D+S S FF + + + + + + N + GL+ F Sbjct: 72 EEERELTLMLLVDISGS--EFFGT----VGQFKREVVTEVSATLAFSALQNNDKVGLILF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVS---TNSTPGLKYAYNQIFDMQGMRQHCNTE 283 S+++E + + G SH+ R I+ L +F T+ + LK+ N I Sbjct: 126 SDEVELYIPPKKGKSHVLRIIRELLEFEPKSHRTDLSEALKFLSNVIK------------ 173 Query: 284 DANYKKIIVF-MTDGENLSTKEDQQSLYYCNEAKK-RGAIVY 323 KK IVF ++D ++ +++L G +Y Sbjct: 174 ----KKAIVFVLSD---FMAEDYEKTLRVTGNKHDLTGIRIY 208 >gi|218517234|ref|ZP_03514074.1| hypothetical protein Retl8_28685 [Rhizobium etli 8C-3] Length = 176 Score = 40.1 bits (92), Expect = 0.51, Method: Composition-based stats. Identities = 26/157 (16%), Positives = 59/157 (37%), Gaps = 31/157 (19%) Query: 28 PIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIK 87 + + +G + + ++ + S +D +L+ A QI N G+ + K +++ Sbjct: 1 MPMLVAVGASFDYIRSYNVRQKMQSDLDAALIAAVKQINNTGDTDALKLKVTDWFHAQVE 60 Query: 88 NTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPW 147 N++ + + ++I+A + +P F Sbjct: 61 NSYTLGE-------------------------IDIDTTNHNITATASGTVPTTFMKIA-- 93 Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 N + + + S+VK + L++ IV+D S SM Sbjct: 94 --NIDTVPVSVASAVK--GPATSYLNVYIVIDTSPSM 126 >gi|114587326|ref|XP_001172455.1| PREDICTED: inter-alpha (globulin) inhibitor H1 isoform 4 [Pan troglodytes] Length = 911 Score = 40.1 bits (92), Expect = 0.51, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 68/196 (34%), Gaps = 28/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D+S SM K+ +++ +L +++ + LV F +++ Sbjct: 293 VVFVIDISGSMRG------QKVKQTKEALLKILGDMQPGDYFD------LVLFGTRVQSW 340 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E + Q ++ S +TN GL + +Q +N+ Sbjct: 341 KGSLVQASEANLQAAQDFVRGFS-LDEATNLNGGLLRGIEILNQVQESLPEL----SNHA 395 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDG+ + Q L A + +Y +G FL + N+ Sbjct: 396 SILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 455 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 456 QRIYEDRDATQQLQGF 471 >gi|330816722|ref|YP_004360427.1| Membrane protein [Burkholderia gladioli BSR3] gi|327369115|gb|AEA60471.1| Membrane protein [Burkholderia gladioli BSR3] Length = 622 Score = 40.1 bits (92), Expect = 0.51, Method: Composition-based stats. Identities = 10/55 (18%), Positives = 26/55 (47%), Gaps = 1/55 (1%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +G ++ A+FL +I + G +++V + + + L + D + ++ N Sbjct: 22 RRERGSFAVMAAVFLVVIAAIFG-VLDVGNTYLQRRDLQQIADMAAAAGVQRVDN 75 >gi|189423337|ref|YP_001950514.1| von Willebrand factor A [Geobacter lovleyi SZ] gi|189419596|gb|ACD93994.1| von Willebrand factor type A [Geobacter lovleyi SZ] Length = 572 Score = 40.1 bits (92), Expect = 0.51, Method: Composition-based stats. Identities = 33/154 (21%), Positives = 63/154 (40%), Gaps = 24/154 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +MI++D S SM+ KI++A K+ + E + IP V + V S+ + Sbjct: 408 IMILVDRSGSMQH------QKIEVASKTAYVVAEALDSIPGCFAAVAAFPVGNSDGV--A 459 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L+ +G K G T L +A ++ + R KI++ Sbjct: 460 PLVRFGERPCSSKFG--MSAGGGTPLAQALYWAGVELLKREEPR-----------KILLS 506 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 +TDGE + ++++ + ++G +GI Sbjct: 507 VTDGEPDDLRTTKRAIRW---LLEQGIEPMGLGI 537 >gi|320159949|ref|YP_004173173.1| putative magnesium-chelatase [Anaerolinea thermophila UNI-1] gi|319993802|dbj|BAJ62573.1| putative magnesium-chelatase [Anaerolinea thermophila UNI-1] Length = 291 Score = 40.1 bits (92), Expect = 0.51, Method: Composition-based stats. Identities = 27/175 (15%), Positives = 53/175 (30%), Gaps = 17/175 (9%) Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME 185 + A R P + N + + ++ A ++ ++D S SM Sbjct: 51 DLAFDATLRTAAPFQRQREELKKRNRVAFAVRPSDYMRKVRVRRAANLVLFLVDASWSM- 109 Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR 245 + ++ +I ++L + GL+TF L L Sbjct: 110 ----AVAERMQATKGAILSLLN-----DAYQRRDRVGLITFQRDRATLILPPTNSVQLAE 160 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 + G T + GL AY + + H + ++ +TDG Sbjct: 161 QALRDIPVGGKTPLSAGLDMAYGIL--KREKMLHPDVLPL-----LIVLTDGAGN 208 >gi|284053419|ref|ZP_06383629.1| von Willebrand factor, type A [Arthrospira platensis str. Paraca] Length = 250 Score = 40.1 bits (92), Expect = 0.51, Method: Composition-based stats. Identities = 33/193 (17%), Positives = 70/193 (36%), Gaps = 17/193 (8%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 W ++ M + +V+ + R +++LD S SM+ +D + Sbjct: 22 WMVRVKNQNMRLEEAVEFAENPEPRCPCVLLLDTSASMQG------EPLDGLNVGLMTFR 75 Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 E + V+ ++TF N+++ +++ V+ + + L+ G T + A Sbjct: 76 ENLIKDELAKKRVEIAVITFDNQVK---IIQDFVTADRFEPPMLTAQGQ-TYMGTAIGEA 131 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFM-TDGENLSTKE---DQQSLYYCNEAKKRGAIV 322 + I + Y + VFM TDGE + +Q +E + Sbjct: 132 LDMIASR---KAEYRNNGITYYRPWVFMITDGEPQGESDRITEQAIKRIRDEEANKQVAF 188 Query: 323 YAIGIRVIRSHEF 335 +A+G+ Sbjct: 189 FAVGVEGANMERL 201 >gi|110801927|ref|YP_697812.1| von Willebrand factor type A domain-containing protein [Clostridium perfringens SM101] gi|110682428|gb|ABG85798.1| von Willebrand factor type A domain protein [Clostridium perfringens SM101] Length = 622 Score = 40.1 bits (92), Expect = 0.51, Method: Composition-based stats. Identities = 32/181 (17%), Positives = 68/181 (37%), Gaps = 28/181 (15%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA 198 LK + P + I + + K ++ +D+++++D S SM D ++ Sbjct: 57 LKEVSQEPDKDGNYEITLTVKGKPKKVTK---PVDILLIMDASNSMYYNMDELKASMNSL 113 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW-----GVSHLQRKIK---YL 250 + + + IP+ + +V F ++EE F + IK Y Sbjct: 114 VDKV------IDNIPNS----RIAVVAFGTEVEEVFSFNNKNKFTSKEEYKNAIKDSYYY 163 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 +TN + A + ++ ++N KK ++F +DG + + S+ Sbjct: 164 ITRKGNTNIEGTWRVA-------DEIFKNELNNNSNSKKDVIFFSDGYPNISVDYLYSIG 216 Query: 311 Y 311 Y Sbjct: 217 Y 217 >gi|313229403|emb|CBY23990.1| unnamed protein product [Oikopleura dioica] Length = 625 Score = 40.1 bits (92), Expect = 0.52, Method: Composition-based stats. Identities = 26/241 (10%), Positives = 71/241 (29%), Gaps = 22/241 (9%) Query: 146 PWYTNSRHIVMPITSSVKV--NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSIN 203 P + T + + ++D++ ++D S S+ S + K + Sbjct: 333 PDCGGGNSVNSAGTGGGISIKALECEEQVDVVFMVDGSDSVSSRDWPKVLKW------VT 386 Query: 204 AMLEEVKLIPDVNNVV----------QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF 253 +++++ + +G + + V + + + Sbjct: 387 NLVDQISPADREKSSTVVFQDFSMNPTTGAIPKEITGRFDPGDQGSVDDFKAAVLAAKQS 446 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 TN+ L + D + ++ + + + ++DGE+ + + Sbjct: 447 AQGTNTYQTLSI---LLKDDGIFGRLPTVQNKDGATVFIILSDGESRDRAKYYD-KKTFD 502 Query: 314 EAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 K + A+G+ E + ++ + +A I I D Sbjct: 503 YFKDITRMRIAVGVADADKKELMDFAQDDEHLLHYDDFGQLMEAGQEIVDLIQAGCKADD 562 Query: 374 K 374 K Sbjct: 563 K 563 >gi|50085105|ref|YP_046615.1| putative tellurium resistance protein (TerY-like) [Acinetobacter sp. ADP1] gi|49531081|emb|CAG68793.1| putative tellurium resistance protein (TerY-like) [Acinetobacter sp. ADP1] Length = 211 Score = 40.1 bits (92), Expect = 0.52, Method: Composition-based stats. Identities = 32/172 (18%), Positives = 57/172 (33%), Gaps = 24/172 (13%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + I+LD S SM I + +ML ++ P V ++TF + Sbjct: 3 RLPVYILLDTSGSMRG------EPIHSVNVGLQSMLSALRQDPYALESVHLSIITFDLEA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG--LKYAYNQIFDMQGMRQHCNTEDANYK 288 + + L + + + S + A + + + QH K Sbjct: 57 KVYLPLT--------PLDQVQLANIDVPSAGATFMGAALELL--AEQVAQHLQKSTDEVK 106 Query: 289 ---KIIVF-MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 + ++F MTDG QQ++ + A + A E L Sbjct: 107 GDWRPLLFVMTDGSPSDVYAYQQAIPVIQQL--NFASIVACAAGPKAKQEHL 156 >gi|330802397|ref|XP_003289204.1| hypothetical protein DICPUDRAFT_79965 [Dictyostelium purpureum] gi|325080732|gb|EGC34275.1| hypothetical protein DICPUDRAFT_79965 [Dictyostelium purpureum] Length = 810 Score = 40.1 bits (92), Expect = 0.53, Method: Composition-based stats. Identities = 26/138 (18%), Positives = 49/138 (35%), Gaps = 11/138 (7%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + V+D S SM + +T +D +I ++ + N + L+T Sbjct: 3 ITFVVDTSGSMSQKTTNGMTLLDCCKAAIEHFIKIRSKDASMRND-RFFLITSEENSLTA 61 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPG--LKYAYNQIF------DMQGMRQHCNTEDA 285 + W + I+ + S G L+ A++ + + Q N Sbjct: 62 VKIGW-KDNFNTFIQEVKNLVCKDMSNLGFSLQKAFDNLNLFRIQSSIDNYGQGRNPWFI 120 Query: 286 NYKKIIVFMTDGENLSTK 303 IIV +TDG +L+ Sbjct: 121 EPA-IIVLLTDGSSLTNN 137 >gi|260797467|ref|XP_002593724.1| hypothetical protein BRAFLDRAFT_199486 [Branchiostoma floridae] gi|229278952|gb|EEN49735.1| hypothetical protein BRAFLDRAFT_199486 [Branchiostoma floridae] Length = 105 Score = 40.1 bits (92), Expect = 0.53, Method: Composition-based stats. Identities = 19/123 (15%), Positives = 36/123 (29%), Gaps = 21/123 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ VLD S S+ + + + + V + G+V +S+ Sbjct: 2 DLFFVLDGSGSVGP------DNFETVKQFVVNV---VSAFTISLTDTRVGVVQYSHFNTL 52 Query: 233 FFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L I + T + + +A RQ N DA ++ Sbjct: 53 ACNLGDHPDEASFVTAINTMQHQDGGTTTGDAMDFA----------RQEANWRDAPTPRV 102 Query: 291 IVF 293 + Sbjct: 103 SIL 105 >gi|183230381|ref|XP_648239.2| hypothetical protein [Entamoeba histolytica HM-1:IMSS] gi|169802953|gb|EAL42853.2| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS] Length = 703 Score = 40.1 bits (92), Expect = 0.53, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 70/221 (31%), Gaps = 40/221 (18%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + SS +N + + ++++ + D S SM I + L ++ +N+ Sbjct: 198 VISSTFINKEKEGNINIVFICDRSGSMYGEG------IKALRNMLQLFLRQL----PLNS 247 Query: 218 VVQSGLVTFSNKIEEFFL-----LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 Q ++F + + F E + + TN + + D Sbjct: 248 KFQI--ISFGSTYDFMFKEMTEYNEDTLKFASETVSQFEANYGGTN----MHAPLKALID 301 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIRVIR 331 + H I+ +TDG ++ Y KK +G+ Sbjct: 302 NNTEKCH-----------IILLTDG---YVNNKTSTIEYIRTLSKKNSLH--GVGLGRSC 345 Query: 332 SHEFLRACASPNSFYLV--ENPHSMYDAFSHIGKDIVTKRI 370 E +R + +NP+ + S I + I+ I Sbjct: 346 DVELIRNIGRIGNGISAISKNPNVLKKEVSKITERILIPSI 386 >gi|197118194|ref|YP_002138621.1| hypothetical protein Gbem_1809 [Geobacter bemidjiensis Bem] gi|197087554|gb|ACH38825.1| conserved hypothetical protein [Geobacter bemidjiensis Bem] Length = 294 Score = 40.1 bits (92), Expect = 0.53, Method: Composition-based stats. Identities = 23/97 (23%), Positives = 37/97 (38%), Gaps = 9/97 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 L++M+++D S S+ A + L K + GL+T S+++E Sbjct: 78 LNLMLLVDRSASLACPTPRGAKSRLAAEAAALLALAAAKSND------RVGLITCSDRVE 131 Query: 232 EFFLLEWGVSHLQRKIKYLS---KFGVSTNSTPGLKY 265 F G QR + LS G T+ L Y Sbjct: 132 SFIPPAKGARQAQRIVASLSCNASSGGGTDLAAALDY 168 >gi|124006282|ref|ZP_01691117.1| von Willebrand factor, type A [Microscilla marina ATCC 23134] gi|123988206|gb|EAY27864.1| von Willebrand factor, type A [Microscilla marina ATCC 23134] Length = 224 Score = 40.1 bits (92), Expect = 0.53, Method: Composition-based stats. Identities = 26/130 (20%), Positives = 52/130 (40%), Gaps = 12/130 (9%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 ++VLDVS SM I + + ++V + ++ ++TFS ++ Sbjct: 21 ILVLDVSGSMIGNP------IQQLNEGLQRFKQQVMNDEIASQRLEICIITFSGRVACIQ 74 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 S + K G ST GL+ A ++ + + T Y+ ++ + Sbjct: 75 EP----SLIHNFEMPTLKAGGSTALVDGLRRAILKVTTRKN--WYKQTGQPYYRPFVIMI 128 Query: 295 TDGENLSTKE 304 TDGE + ++ Sbjct: 129 TDGEPDADQD 138 >gi|222055527|ref|YP_002537889.1| von Willebrand factor A [Geobacter sp. FRC-32] gi|221564816|gb|ACM20788.1| von Willebrand factor type A [Geobacter sp. FRC-32] Length = 569 Score = 40.1 bits (92), Expect = 0.53, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 61/188 (32%), Gaps = 29/188 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+++++D + SM + TK ++ +++ L N + ++ + + Sbjct: 403 DIVLLVDCTGSMAE--PNKWTKTEVIYQTLFMALHA------YNRNARI--FAYNERNDT 452 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + E + + + G T S + I ++Q I+ Sbjct: 453 CRITE---IYKNGQFYSVMPHGR-TASGEAI------IATALNLKQSSRKP------FII 496 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPH 352 +TDG ++ YC K R + IG+ ++ A V + Sbjct: 497 HLTDGAANWGCGVSDAIRYC---KARNVRLLTIGLECDPMNKTALAEEHGEMIQFVNDVD 553 Query: 353 SMYDAFSH 360 + F Sbjct: 554 QLPTVFKQ 561 >gi|320103720|ref|YP_004179311.1| hypothetical protein Isop_2184 [Isosphaera pallida ATCC 43644] gi|319751002|gb|ADV62762.1| hypothetical protein Isop_2184 [Isosphaera pallida ATCC 43644] Length = 738 Score = 40.1 bits (92), Expect = 0.53, Method: Composition-based stats. Identities = 42/344 (12%), Positives = 101/344 (29%), Gaps = 50/344 (14%) Query: 17 GGMTILTAIFLPIIFLVLGM--IIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG--NGN 72 G M + + + ++ L I++S + V + + E Sbjct: 138 GSMAVHAGVLVILLLLPGAAEEAIDLSKRL-----------NAAVDTSVASVEEPLKIMA 186 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 + +++ + N + + +T +PQ +SA Sbjct: 187 DPSQVRSEYLTDPNANP-----VGGVLNPSPSATPVVAAAATVSSKSAMPQ--AVELSAS 239 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ-------------TDARLDMMIVLD 179 + + I +T +V Q +L ++ + D Sbjct: 240 IASPLASMPSMPAVDFAGGGGIAGDVTQAVDNVDQALDQLVREILRELQRTKLTVVWMFD 299 Query: 180 VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG 239 S SM+ + + + +++ + + +V F +I Sbjct: 300 ESNSMKDDQQAIRDRFERISSE---LIDNTSSEQRRSAALLHAVVGFGQQIHFDLEPTPD 356 Query: 240 VSHLQRKIKYLSKFGVST-NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE 298 V+ ++ I L T N+ ++ ++ ++ +IV +TD Sbjct: 357 VTRIKEAITNLRIDMTGTENTMQSVQAVIARLGSKYITKERRM--------LIVLVTDES 408 Query: 299 NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 ++++ C AK VY IG + + + L + Sbjct: 409 GDDGARVEETIQACLAAK---VSVYVIGRQSLFGTDKLHIAYTD 449 >gi|315122409|ref|YP_004062898.1| hypothetical protein CKC_03305 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495811|gb|ADR52410.1| hypothetical protein CKC_03305 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 411 Score = 40.1 bits (92), Expect = 0.53, Method: Composition-based stats. Identities = 51/392 (13%), Positives = 128/392 (32%), Gaps = 38/392 (9%) Query: 15 YKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNR 74 G I++A+ L ++ ++ + + + ++ +L S ID + +H ++ G R Sbjct: 18 TNGSFLIVSAVLLSSFLTIMDIMRDYTDMIRVRNMLQSSIDYA-LHNNPNELSVGTIKQR 76 Query: 75 KKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDI----VVVPQNEGYSIS 130 + L I + + + + E + VN + S + + ++ + Sbjct: 77 EMLIKKRIGYFLDSNYKGTLLTEEQIKLIVNQSTVSITERSFYPQQFHINIELHKNIQLK 136 Query: 131 AISRYKI--PLKFCTFIPWYTNSRH-----IVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 ++ + P K ++ +V+P T + + + + D+ S Sbjct: 137 SLILHMAMNPKKDFNISQRKSSLYKKNVALMVVPFTWTGEWIPPSLFTTQFTVSQDLLPS 196 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVVQSGLVTFSNKIEEFFLLEWGV 240 K + K + I + N + + ++ + L + Sbjct: 197 --DLKTEHFKKTEYFNKRNQFFKMFLSKIKENNLCIAPYHYSAIVYWSEGIFSYKLPFST 254 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY------KKIIVFM 294 + L K + Y + + D + KK ++ + Sbjct: 255 TFLYSFRDIYVKQYSTIWDMKPSNYILDLFAGAELHSNRLTPADPCFRRGVIQKKFMLII 314 Query: 295 TDGENLSTKEDQ----QSLYYCNEA-KKRG------AIVYAIGIRV-IRSHEFLRACAS- 341 G +S +++ + + C K G VY++GI + L C Sbjct: 315 AAGNQISDRKNSAEYFKMKHGCTLMGKNMGKNPQEEITVYSLGISPDPDTKRDLIQCTRH 374 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 P+ +Y +++ + + ++I +W D Sbjct: 375 PDRYYEIQSYKDIAPVIDRLERNIS--SVWND 404 >gi|239941198|ref|ZP_04693135.1| hypothetical protein SrosN15_09409 [Streptomyces roseosporus NRRL 15998] gi|239987673|ref|ZP_04708337.1| hypothetical protein SrosN1_10243 [Streptomyces roseosporus NRRL 11379] gi|291444639|ref|ZP_06584029.1| conserved hypothetical protein [Streptomyces roseosporus NRRL 15998] gi|291347586|gb|EFE74490.1| conserved hypothetical protein [Streptomyces roseosporus NRRL 15998] Length = 453 Score = 40.1 bits (92), Expect = 0.53, Method: Composition-based stats. Identities = 22/153 (14%), Positives = 48/153 (31%), Gaps = 30/153 (19%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 S + +++++D S SM D TK+ A + I + + Sbjct: 54 SSAPGAGQGPGAAVVLMVDCSGSM----DYPPTKMRNARDA------TAAAIDTLREGTR 103 Query: 221 SGLVTFSNKIEEFFL--------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 +V ++ ++ + + + ++ LS G T L+ A + Sbjct: 104 FAVVAGTHVAKDVYPGNGRLAVADARTKAQAKEALRKLSA-GGGTAIGTWLRLADRLLNS 162 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 +H + +TDG N + Sbjct: 163 ADVAIRHG-----------ILLTDGRNEHESPE 184 >gi|260782835|ref|XP_002586486.1| hypothetical protein BRAFLDRAFT_75124 [Branchiostoma floridae] gi|229271600|gb|EEN42497.1| hypothetical protein BRAFLDRAFT_75124 [Branchiostoma floridae] Length = 224 Score = 40.1 bits (92), Expect = 0.53, Method: Composition-based stats. Identities = 19/122 (15%), Positives = 41/122 (33%), Gaps = 17/122 (13%) Query: 213 PDVNNVVQSGLVTFSNKIEEFFLL---EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 +V+ G++ FS K L G + L I+ L + G T++ ++Y Sbjct: 63 EEVHGSFTVGVIQFSEKANATIPLGTYSAGDAGLTDDIESLKRTGGLTHTGHAIRYMKET 122 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 + + + + +TDG + + + G +Y++G Sbjct: 123 -----------SKFRDDVPRTAIVITDGRAQTMDDYAAQAEAARD---AGITLYSVGNGF 168 Query: 330 IR 331 Sbjct: 169 PS 170 >gi|307154849|ref|YP_003890233.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306985077|gb|ADN16958.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 251 Score = 40.1 bits (92), Expect = 0.53, Method: Composition-based stats. Identities = 37/211 (17%), Positives = 71/211 (33%), Gaps = 18/211 (8%) Query: 152 RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKID-MAIKSINAMLEEVK 210 ++ P S++ + L + D S SME D I ++ +SI M + Sbjct: 2 NYVERPRKSALPGGALATRPLHFFWICDCSGSME--IDGKIEALNYAIRESIPEMRKVAN 59 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 P+ +V+ V FS+ + W V+ + F S + G+ + Sbjct: 60 ENPNAQLLVR--AVKFSDGAQ------WHVADATP----IDNFTWSDLTAQGMTDMGKAL 107 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + + D ++V ++DG+ ++ ++ R A+ AI I Sbjct: 108 SLVAEQLKMPPMTDRALPPVLVLISDGQPTDD-FNRGLKDLLDQPWGRKAVRIAISIGRD 166 Query: 331 RSHEFLRACASPNSF--YLVENPHSMYDAFS 359 E L S F N ++ Sbjct: 167 ADKEVLEKFISHPEFQPLEANNAAALIKFIK 197 >gi|50119743|ref|YP_048910.1| hypothetical protein ECA0798 [Pectobacterium atrosepticum SCRI1043] gi|49610269|emb|CAG73712.1| putative membrane protein [Pectobacterium atrosepticum SCRI1043] Length = 543 Score = 40.1 bits (92), Expect = 0.53, Method: Composition-based stats. Identities = 28/223 (12%), Positives = 61/223 (27%), Gaps = 26/223 (11%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F + KG T A+ + + I++ S T + D + + Q G Sbjct: 28 FVHQEKGAGTAFYALGAMALLVTAAFIVDTSTATGDATQIKRATDAAALAVGHQATING- 86 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 +N L N +++ + + + Y+++ Sbjct: 87 --------EEYSQEDTNKLAYEYVKNNLGMNKALSEKLVASDVSVTEGRNSATRKTYTVT 138 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 A K L S V+ + + +V+ V+ M Sbjct: 139 AAFETKPSLLSLGAKKQEVYSTSEVINRPTEI------------ALVMPVTGDMSE---G 183 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 I + +S + + + LV +S + + Sbjct: 184 DIRSLKSVSRSFVE--RMLSSADGKRDNLWLSLVPYSQSVNVY 224 >gi|310800880|gb|EFQ35773.1| vault protein inter-alpha-trypsin [Glomerella graminicola M1.001] Length = 1046 Score = 40.1 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 59/176 (33%), Gaps = 24/176 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ + D S SM+ KID ++ L+ + + + N+ G ++S ++ Sbjct: 300 IVFICDRSGSMQGN------KIDDLKNALKVFLKSL-PVGAMFNICSFG-SSYSFLFDKS 351 Query: 234 FLLEWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + + T ++ + + + Sbjct: 352 MTYDKNTLETAMKHVDTFDANYGGTQFHGPIQKTFE-------------LRHTDLDLEVF 398 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRG-AIVYAIGIRVIRSHEFLRACASPNSFYL 347 +TDGE + E ++ + K +G ++ +G+ SH + A + + Sbjct: 399 VLTDGEIWNE-ERLFTMVNDHVQKSQGAIRLFTLGVGRDVSHSLIEGLARAGNGFS 453 >gi|114587324|ref|XP_001172464.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H1 isoform 5 [Pan troglodytes] Length = 911 Score = 40.1 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 68/196 (34%), Gaps = 28/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D+S SM K+ +++ +L +++ + LV F +++ Sbjct: 293 VVFVIDISGSMRG------QKVKQTKEALLKILGDMQPGDYFD------LVLFGTRVQSW 340 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E + Q ++ S +TN GL + +Q +N+ Sbjct: 341 KGSLVQASEANLQAAQDFVRGFS-LDEATNLNGGLLRGIEILNQVQESLPEL----SNHA 395 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDG+ + Q L A + +Y +G FL + N+ Sbjct: 396 SILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 455 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 456 QRIYEDRDATQQLQGF 471 >gi|125719088|ref|YP_001036221.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain-containing protein [Streptococcus sanguinis SK36] gi|125499005|gb|ABN45671.1| Nitric oxide reductase NorD / Von Willebrand factor type A (vWA) domain protein, putative [Streptococcus sanguinis SK36] gi|324989618|gb|EGC21563.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK353] Length = 444 Score = 40.1 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 32/147 (21%), Positives = 59/147 (40%), Gaps = 33/147 (22%) Query: 174 MMIVLDVSRSMESFF-----------------DSSITKIDMAIKSINAMLEEVKLIPDVN 216 + V D S SM D+ ++++ K N +++++K I +V+ Sbjct: 199 VSFVFDTSGSMAYGLRNEGKRNSQGKWGPLDADNPRARMNILKKKANLLVDDLKEIGNVS 258 Query: 217 -NVVQ-SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 N+V+ SG ++ E+F L+ ++ KIK L + TN GL+Y + Sbjct: 259 VNLVRFSGSASY--IQEDFVELDKDTGKIKEKIKSLPTSWI-TNPGDGLRYGLVSLQRNP 315 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLS 301 K +V +TDG + Sbjct: 316 AQL-----------KYVVLLTDGIPNA 331 >gi|119593589|gb|EAW73183.1| hCG25234 [Homo sapiens] Length = 195 Score = 40.1 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 35/187 (18%), Positives = 62/187 (33%), Gaps = 35/187 (18%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 I +P + + S+ + +VLD S SM + + +++ A E+ LI Sbjct: 26 INLPRPTFSLLKSKQRV---VCLVLDKSGSM--NAEDRLFRMNQAA--------ELYLIQ 72 Query: 214 DVNNVVQSGLVTFSNKIEEFFLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + GLVTF + + L + + T+ GLK + Sbjct: 73 IIEKGSLVGLVTFDSFAKIQSKLIKIIDDNTYQKITA-NLPQEADGGTSICRGLKAGFQA 131 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIR 328 I I+ +TDGE + C E K+ G +++ I + Sbjct: 132 IPQSNQSTFGSE---------IILLTDGE-------DYQISLCFGEVKQSGTVIHTIALG 175 Query: 329 VIRSHEF 335 E Sbjct: 176 PSADEEL 182 >gi|315605763|ref|ZP_07880795.1| conserved hypothetical protein [Actinomyces sp. oral taxon 180 str. F0310] gi|315312461|gb|EFU60546.1| conserved hypothetical protein [Actinomyces sp. oral taxon 180 str. F0310] Length = 344 Score = 40.1 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 17/149 (11%), Positives = 53/149 (35%), Gaps = 11/149 (7%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLI 212 ++ S + + L++ V+D + SM +D + +ID + ++ + Sbjct: 50 VMGAGPSIPGEAEEVTSSLEVYFVIDRTGSMAAEDWDGTKPRIDGVRNDVAILMSVLTGS 109 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + +V++ + L S +Q + + S++ + + Sbjct: 110 -------RFSIVSWDSNAHTDMPLTTDASAVQAYMDSFDRELSSSSQGSSVNRPAKDLAA 162 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLS 301 + + + ++ + + +DGE + Sbjct: 163 LLAKNKERHPQNL---RAVFVFSDGETSN 188 >gi|313226492|emb|CBY21637.1| unnamed protein product [Oikopleura dioica] Length = 1234 Score = 40.1 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 22/172 (12%), Positives = 51/172 (29%), Gaps = 24/172 (13%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +++ LD S SM ++ M ++ + V G+++F+ Sbjct: 296 RPPRRIVLCLDASTSMLEG-----NRLSMVRNAVRQFIYTVSSG------TYVGIISFNR 344 Query: 229 KIEEFFLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 L E + L ++ T+ + N + Sbjct: 345 MSTRRQDLIEIIDEDERAFLVSRLPKDESVASGTSIGKAILRGINMLQQSGPEYLRTMDG 404 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 I+ ++DG +T ++++ K G V I + ++ Sbjct: 405 GE-----ILLLSDGLEWNTPSVEETMEL---VKLSGVRVNTISLGNSDTNVL 448 >gi|301166645|emb|CBW26221.1| putative membrane protein [Bacteriovorax marinus SJ] Length = 287 Score = 40.1 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 60/165 (36%), Gaps = 29/165 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV---QSGLVTFSNKI 230 ++ ++DVS SM++ + K+ + ++ + ++ + + G +S Sbjct: 117 IVFLIDVSGSMKT-----LDKMGQVKAGLKMLITSMPSDYQIDVIHFPGKRGARYYSLWS 171 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L E + R + L+ G T + LKYA + D+ Sbjct: 172 YTQKLGERQKKDVYRFLNRLNPKGA-TPTRSALKYALTKYPDLTD--------------- 215 Query: 291 IVFMTDGENL--STKEDQQSLYYCNEAKK---RGAIVYAIGIRVI 330 +V ++DG ++ E +E KK + + IG+ Sbjct: 216 VVLLSDGAPTKMNSSEYDDIKDILSEVKKDNFKNIQINTIGVGAA 260 >gi|297527228|ref|YP_003669252.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] gi|297256144|gb|ADI32353.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] Length = 313 Score = 40.1 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 24/123 (19%), Positives = 53/123 (43%), Gaps = 20/123 (16%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 ++++D S+SM+ ++IT+ ++++N + + + + ++ G + KI Sbjct: 98 VLLIDESKSMQYTDGTNITR---FVRALNFTYKYMNYLSPKDKLLIIGFSEYPRKICLG- 153 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + + K+ L T+ + L YAY+ + Q Y IIV + Sbjct: 154 ----NTTFCRPKLLELEPNKKYTDLSAALAYAYSYVTASQ------------YPAIIVIV 197 Query: 295 TDG 297 +DG Sbjct: 198 SDG 200 >gi|229188238|ref|ZP_04315314.1| hypothetical protein bcere0004_57400 [Bacillus cereus BGSC 6E1] gi|228595258|gb|EEK53002.1| hypothetical protein bcere0004_57400 [Bacillus cereus BGSC 6E1] Length = 425 Score = 40.1 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 74/202 (36%), Gaps = 16/202 (7%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 L++ I+LD S SM + + K++ A K+I L IPD NV+ L + Sbjct: 120 SKAKSLNVEILLDASGSMAGKVNGEV-KMEAAKKAIYNYL---DKIPDNANVM---LRVY 172 Query: 227 SNK---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +K E L G S + ++ +K + + + + E Sbjct: 173 GHKGSNNENDKSLSCGSSEVMYPLQPYNKEQFNAALSKFGPKGWTPLASAIESVNDDFKE 232 Query: 284 --DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-HEFLRACA 340 ++ ++DGE + + N++ A+V IG V S + L+ A Sbjct: 233 YTGEENLNVVYIVSDGEETCGGDPVNAAKNLNQS-NTHAVVNIIGFDVKNSEQQQLKYTA 291 Query: 341 SP--NSFYLVENPHSMYDAFSH 360 ++ V + ++ + Sbjct: 292 EAGKGNYATVSSADELHQTLNK 313 >gi|157377271|ref|YP_001475871.1| putative outer membrane adhesin like proteiin [Shewanella sediminis HAW-EB3] gi|157319645|gb|ABV38743.1| putative outer membrane adhesin like proteiin [Shewanella sediminis HAW-EB3] Length = 4122 Score = 40.1 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 66/198 (33%), Gaps = 16/198 (8%) Query: 114 TSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPW-YTNSRHIVMPITSSVKVNSQTDARL 172 T D + + + T + + + ++ + + + Sbjct: 3522 TISDPSGLTDTATVTFDVVPSRGYGQNTGTDGNNDIQGTSNHDVIVSDTTGLQIVSGENY 3581 Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 ++ +LD S SM ++++T + + L+ + VV LV F+N + Sbjct: 3582 NIAFILDSSGSMG---NTNVTSAKEQLLEVFNTLK-ASATSVHSGVVNVLLVDFNNGTKA 3637 Query: 233 FFLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + +S+L + ++ G TN + N + G + Y Sbjct: 3638 NVSVNLADNNAISNLTDVLSTIASDGGRTNYESAFETTTN--WFNSGTASENTGTNLTY- 3694 Query: 289 KIIVFMTDGENLSTKEDQ 306 F+TDGE + DQ Sbjct: 3695 ----FITDGEPNNYNVDQ 3708 >gi|183985704|gb|AAI66222.1| LOC100158554 protein [Xenopus (Silurana) tropicalis] Length = 899 Score = 40.1 bits (92), Expect = 0.55, Method: Composition-based stats. Identities = 41/201 (20%), Positives = 69/201 (34%), Gaps = 44/201 (21%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLD+S SM ++ +I ++ + ++ V G+VTFSN E Sbjct: 309 VSLVLDISGSM-----TNANRITRLYQASEVYIMQI-----VEQGAYVGIVTFSNVAEIK 358 Query: 234 FLLEWGVSHLQRKIKYLS---KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L QR+ L TN G++ ++ Sbjct: 359 SQLVKITDTFQRESLKLKLPTVATGGTNICAGVQQGLQVNRNLDQSTHGTE--------- 409 Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF---------LRACA 340 IV +TDGE+ C + K GAI++ I + L+ A Sbjct: 410 IVLLTDGEDSGISS-------CFPDITKSGAIIHTIALGNNADPGLEKLADLTGGLKLYA 462 Query: 341 SPNSFYLVENPHSMYDAFSHI 361 S + + + D+FS I Sbjct: 463 SDK-----VDVNGLIDSFSGI 478 >gi|153873139|ref|ZP_02001819.1| conserved hypothetical protein [Beggiatoa sp. PS] gi|152070396|gb|EDN68182.1| conserved hypothetical protein [Beggiatoa sp. PS] Length = 367 Score = 40.1 bits (92), Expect = 0.55, Method: Composition-based stats. Identities = 33/220 (15%), Positives = 70/220 (31%), Gaps = 29/220 (13%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 D+ V+D++ SM + + I K + + + V+ V GL+ + Sbjct: 51 QQLGFDIKFVMDMTGSMGPYIKGTKQVITDLAKDLASTYQAVE--------VNYGLIGYR 102 Query: 228 NKIE----EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + ++ + F+++ L + + L K T L Y + + Sbjct: 103 DDLKYEPHQGFVVKNFTPQLVK-VDQLIKVLAQAKPTKNLSGDYQEEVFAGIKEAITSNW 161 Query: 284 DANYKKIIVFMTDGENLS-----TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH----- 333 N +K I+ + D + + A + +I + V R Sbjct: 162 RNNSRKFIIVIGDASSHDLSHVQNTSGMNAEQIRELANTNEINILSIYLTVPRYQQDNII 221 Query: 334 ------EFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 + S + ++ S + F I K+IV Sbjct: 222 AEPQFSQLATNPGSESLPLYMDVAASNHAEFKAIVKNIVD 261 >gi|242373708|ref|ZP_04819282.1| regulator of nitric oxide reductase [Staphylococcus epidermidis M23864:W1] gi|242348676|gb|EES40278.1| regulator of nitric oxide reductase [Staphylococcus epidermidis M23864:W1] Length = 629 Score = 40.1 bits (92), Expect = 0.55, Method: Composition-based stats. Identities = 39/310 (12%), Positives = 99/310 (31%), Gaps = 58/310 (18%) Query: 39 EVSHIFFMKTV-LHSMIDR--SLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFR 95 +++ + K S +D + Q N K++ + ++ N Sbjct: 313 DMTDMMTKKGKGSQSTLDHDEGGLIGQNQAFALEGINKNVKIEWKVPDIQPQDIIN---- 368 Query: 96 NELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISA------ISRYKIPLKFCTFIPWYT 149 + N +I D+++ I ++E +++ + + I +F F Sbjct: 369 YQSSKNDVQYEIKDLIQIIKKTIDREHEDERRNLTKGRLQKDLINWFIDDQFKLFYKKQD 428 Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 S+ T +++D S SM K+D IK + E + Sbjct: 429 LSKTFDATFT----------------LLVDASASMHD-------KMDETIKGVVLFHETL 465 Query: 210 KLIPDVNNVVQSGLVTFSNKIEE-----FFLLEWGVSHLQRKIKYLSKFG--VSTNSTPG 262 K + + ++ F E ++ + S +++ + Sbjct: 466 KTLNIKHEILAFNEDAFEADDREQPNIIDEIINYNYSIFEKEGPRIMSLEPQDDNRDGVA 525 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKR 318 ++ A ++ + ++ ++ ++ +DGE + Q + A+K Sbjct: 526 IRVASERLL-----------QRSHQQRFLIVFSDGEPSAFNYSQDGIIDTYEAVETARKF 574 Query: 319 GAIVYAIGIR 328 G V+ + + Sbjct: 575 GIEVFNVFLS 584 >gi|269125560|ref|YP_003298930.1| cobaltochelatase subunit [Thermomonospora curvata DSM 43183] gi|268310518|gb|ACY96892.1| cobaltochelatase subunit [Thermomonospora curvata DSM 43183] Length = 657 Score = 40.1 bits (92), Expect = 0.55, Method: Composition-based stats. Identities = 23/139 (16%), Positives = 55/139 (39%), Gaps = 19/139 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK-IEE 232 ++ V+D S SM ++ ++ ++ ++L + + GL+TF + E Sbjct: 482 VLFVVDASGSM-----AARRRMGAVKGAVLSLL-----LDAYQRRDKVGLITFRGRGAEL 531 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V R+++ L G + + L+ A ++ ++ +R ++V Sbjct: 532 ALPPTSSVEAGARRLERLPTGGRTPLAAGLLRAA--EVLRVERLRDPARRP------LLV 583 Query: 293 FMTDGENLSTKEDQQSLYY 311 +TDG + + Q+ Sbjct: 584 IVTDGRATTGPDPAQAAAL 602 >gi|227831991|ref|YP_002833698.1| hypothetical protein cauri_0161 [Corynebacterium aurimucosum ATCC 700975] gi|262183247|ref|ZP_06042668.1| hypothetical protein CaurA7_04584 [Corynebacterium aurimucosum ATCC 700975] gi|227453007|gb|ACP31760.1| putative membrane protein [Corynebacterium aurimucosum ATCC 700975] Length = 951 Score = 40.1 bits (92), Expect = 0.55, Method: Composition-based stats. Identities = 36/230 (15%), Positives = 76/230 (33%), Gaps = 47/230 (20%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAI-KSINAMLEEVK-LIPDVNNVVQSGLVT 225 ++ D++ VLD S S+ + S T D + +++++ L D + L Sbjct: 64 SEKSADLLFVLDQSASLVGYEGSQPTDADAFRVDATADIVKQLAHLGEDNGADINVKLAG 123 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYL-----------SKFGVSTNSTPGLKYAYNQIFDMQ 274 F + G +++ L + TN K A ++ Q Sbjct: 124 FGSDYYSDPAEYGGWTNVSGNADALQGELDKFRKEDRVADLETNYGAAYKGALKELASHQ 183 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA----------KKRGAIVYA 324 G + IVF TDG+ + E ++ + ++ Sbjct: 184 GSSC----------RGIVFFTDGKYYTDAEANDLQGAQDKLCEVNSSVTAFRNSNIRLFN 233 Query: 325 IG-IRVIRSHEFLRAC------------ASPNSFYLV-ENPHSMYDAFSH 360 +G + + + +R A +F+ E+P +++ AF + Sbjct: 234 VGLVPAAEAQDQVRQLTRMAEGEDCGQGAPNGAFFDAGEDPAALFAAFRN 283 >gi|126723120|ref|NP_001075478.1| inter-alpha-trypsin inhibitor heavy chain4 [Oryctolagus cuniculus] gi|11041722|dbj|BAB17303.1| inter-alpha-trypsin inhibitor heavy chain4 [Oryctolagus cuniculus] Length = 951 Score = 40.1 bits (92), Expect = 0.55, Method: Composition-based stats. Identities = 23/142 (16%), Positives = 54/142 (38%), Gaps = 34/142 (23%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM KI +++ +L+++ + L+ FS+ Sbjct: 276 VIFIVDQSGSM------LGRKIQQTREALLKILDDLNPRD------RFNLILFSS----- 318 Query: 234 FLLEWGVSHLQRKIKYLS---------KFGVSTNSTPGLKYAYNQI-FDMQGMRQHCNTE 283 W S +Q ++ +S + T+ L A + + + + +R + Sbjct: 319 SATPWKTSLVQASLETVSEARSYAGAIQAAGGTDINEALLLAVSLLDHEQEELRAGSVS- 377 Query: 284 DANYKKIIVFMTDGENLSTKED 305 +++ +TDGE K + Sbjct: 378 ------LLILLTDGEPTQGKTN 393 >gi|330470624|ref|YP_004408367.1| hypothetical protein VAB18032_03415 [Verrucosispora maris AB-18-032] gi|328813595|gb|AEB47767.1| hypothetical protein VAB18032_03415 [Verrucosispora maris AB-18-032] Length = 555 Score = 40.1 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 41/241 (17%), Positives = 74/241 (30%), Gaps = 35/241 (14%) Query: 138 PLKFCTFIPWYTNSRHIVMP----ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSIT 193 P R + +P +T + S ++ +LDVS SM + T Sbjct: 314 PAMPSDDRLRADERRPVPLPAEETVTDVLTGWSGVQRSARILTLLDVSGSMAAQVPGGGT 373 Query: 194 KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF---------SNKIEEFFLLEWGVSHLQ 244 ++D +++ E L+ D + G+ F +I L L Sbjct: 374 RLDATVRAAE---EGAALLLDNSE---LGVWAFATNLDGERDHQEILPVAPLGAQRDRLA 427 Query: 245 RKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 + + + T AY R++ N ++ MTDG N Sbjct: 428 EALAAVRVEPQGGTGLYDSTLAAYRD------ARRNWTPGRINL---VLVMTDGRNEDDD 478 Query: 304 EDQQSLYYCNEAK----KRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDA 357 ++ + +R + IG+ E L A A + ++ P M Sbjct: 479 SIGRAALLAELKRLQEPRRPLPIIFIGLGGAIDPEELEAIAKVTGGRVFVTAQPSGMRRI 538 Query: 358 F 358 F Sbjct: 539 F 539 >gi|288918658|ref|ZP_06413006.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288349956|gb|EFC84185.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 238 Score = 40.1 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 21/131 (16%), Positives = 45/131 (34%), Gaps = 17/131 (12%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D S SM ++ ++ ++ ++L + + GL+TF + Sbjct: 20 VLFVVDASGSM-----AARARMGAVKAAVLSLL-----LDAYQRRDKIGLITFRGEAAAL 69 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L + G T GL A+ + R+ ++V Sbjct: 70 ALPPTSSVDVAAARLADLPSGGRTPLADGLLLAHRTL-----AREQLRDPSRRP--LLVV 122 Query: 294 MTDGENLSTKE 304 +TDG + + Sbjct: 123 LTDGRHTAGPA 133 >gi|327538530|gb|EGF25193.1| von Willebrand factor type A [Rhodopirellula baltica WH47] Length = 1460 Score = 40.1 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 40/282 (14%), Positives = 82/282 (29%), Gaps = 65/282 (23%) Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAIS--RYKIPLKFCTFIPWYTNSRHIVMPITSSVKV 164 D ++ S+ P N +++ + F + +S Sbjct: 829 TSDAQQAVSIQATDAPTNVELTLAVDQKRPSLAEVNFRGNRYQSSVMSSGTAFGVTSRGQ 888 Query: 165 NSQTDARLDM----------MIVLDVSRSMESFFDSSI----------TKIDMAIKSINA 204 Q A + + V+D S SM + +K + A ++ Sbjct: 889 TVQKGAGITIRDAMDAGRAITFVMDCSASMNDPLGEEMGRSALGAQRASKFEAARSAVYE 948 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKI-----EEFFLLEWG---VSHLQRKIKYLSKFGVS 256 M+ ++ P Q GLV + +++ + G + LQ++ F + Sbjct: 949 MMRRLQPGPS-----QIGLVLYGHRMAIRAGDPAKDSGDGSGQTTLLQKRYHKRFPFPPT 1003 Query: 257 ----TNSTPGLKYAYNQIFDMQGMRQHCNTE----------------------DANYKKI 290 + L +++ RQH + +K Sbjct: 1004 IQPFEDVEVALPTGRFDTAELELARQHFDAAVPWGQTPLYLSIWKAMEDISRTGDGVRKD 1063 Query: 291 IVFMTDGENLSTKEDQQSL----YYCNEAKKRGAIVYAIGIR 328 +V ++DG N +++ AK G V+ IG Sbjct: 1064 VVVISDGRNYQFNPTPEAIFSIGQLVTRAKTLGVQVHVIGYG 1105 >gi|92113590|ref|YP_573518.1| von Willebrand factor, type A [Chromohalobacter salexigens DSM 3043] gi|91796680|gb|ABE58819.1| von Willebrand factor, type A [Chromohalobacter salexigens DSM 3043] Length = 596 Score = 40.1 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 31/215 (14%), Positives = 75/215 (34%), Gaps = 36/215 (16%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 +Q D+ ++ DVS SM++ +++ + + + + + + T Sbjct: 22 TQDATPPDVRMIFDVSGSMKANDPANLRASALQLAAALLPSQ-----------ARGSVWT 70 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 F ++ + +R+ + LS V Y Q D++ + ++ A Sbjct: 71 FGTQVRNPLPDGKVDAEWRRRARSLSPQLVD----------YQQFTDIEQALREA-SQAA 119 Query: 286 NYKKIIVFMTDG------------ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 K+ ++ +TDG + + + E + +V+ I + Sbjct: 120 GGKRHVILLTDGMVDLPGSGEVKRKRDAASRETLIASLAPELATQDVVVHTIALSRNVDR 179 Query: 334 EFLRAC--ASPNSFYLVENPHSMYDAFSHIGKDIV 366 + L ++ + E P + AF + + IV Sbjct: 180 DLLERVSQSTDGLAAVAETPEELLRAFLDVLERIV 214 >gi|307102442|gb|EFN50717.1| hypothetical protein CHLNCDRAFT_142575 [Chlorella variabilis] Length = 575 Score = 40.1 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 40/193 (20%), Positives = 72/193 (37%), Gaps = 12/193 (6%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 RL+++++LDVS SM F S + + EE + DV V +G+V Sbjct: 122 RPRLNLLVLLDVSGSMGESFSSYYYDQLGQQVAQPSGSEEQRSKMDVAKEVLAGVVGKLG 181 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + F ++ L R++ TN GL A ++ + R + A + Sbjct: 182 PDDSFGVV------LFRQMGRDIADTSGTNMQAGLDAATGEM---RACRTCMEADRAATE 232 Query: 289 KIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--PNSF 345 IV +TD + N D+ L G IG+ + + E + + ++ Sbjct: 233 NRIVLITDAQPNQGDISDEGLLARLKANAADGIHTTIIGVGLDFNTELVEGISKVRGANY 292 Query: 346 YLVENPHSMYDAF 358 + V +P Sbjct: 293 FSVHSPGEFRRRL 305 >gi|226294679|gb|EEH50099.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18] Length = 788 Score = 40.1 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 34/223 (15%), Positives = 71/223 (31%), Gaps = 32/223 (14%) Query: 136 KIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKI 195 IP F P N R ++ + + + +++ ++D S SM+ KI Sbjct: 261 DIPSGFLETHPSIPNQRALMATLVPKFNIPPISP---EIVFIIDRSGSMDG-------KI 310 Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGV 255 ++ L+ + + N +F K + + + + ++ Sbjct: 311 GTLQAALRVFLKSLPVGVTFNVCSFGSSYSFMWKKSRSYDDS-SLRAALKFVDSVAADFG 369 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL--YYCN 313 T +K F + I+ +TDG DQ++L Y Sbjct: 370 GTEMLKPVKATVENRF-------------KDLDLEILLLTDGAIW----DQETLFKYIHK 412 Query: 314 EAKKRGAIVYAIGIRVIRSHEFLRACASPNSFY--LVENPHSM 354 A + + +GI SH + A + + V + + Sbjct: 413 AASDKPIRFFTLGIGDGASHSLIEGIAKAGNGFAQFVNDNEPL 455 >gi|197322496|ref|YP_002154769.1| putative von Willebrand factor [Feldmannia species virus] gi|197130563|gb|ACH46899.1| putative von Willebrand factor [Feldmannia species virus] Length = 253 Score = 40.1 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 25/168 (14%), Positives = 51/168 (30%), Gaps = 11/168 (6%) Query: 150 NSRHIVMPITSSVKVNSQTDARL--DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 N I S+ + L ++++D S SM T+ + A + A++ Sbjct: 31 NPEQIPQMAESAAETKMLEKEALVRQFVLLIDRSGSMGWPDGDDKTRWERAKEVTKALVP 90 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 + + L F +++ +Y G +TN + L+ A Sbjct: 91 SLFKYDVDKS---IPLFLFDSEVSFVGECTNASQIETVFTEYQP--GTTTNLSGALEQAM 145 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA 315 + + IV + G + + Q Y + A Sbjct: 146 EMYLGSKRVNYEVVPGTT----FIVLLDGGADNPDEVVQVLQKYADPA 189 >gi|168702547|ref|ZP_02734824.1| hypothetical protein GobsU_23667 [Gemmata obscuriglobus UQM 2246] Length = 279 Score = 40.1 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 11/65 (16%), Positives = 27/65 (41%), Gaps = 6/65 (9%) Query: 174 MMIVLDVSRSMESFF--DSSITKIDMAIKSINAMLEEV----KLIPDVNNVVQSGLVTFS 227 ++ ++D S SM F +K ++N +L+ V V + + G++ + Sbjct: 15 LLFLIDQSGSMADPFVGTGGASKAATVADAVNRLLQNVVLRSAKADGVRDYFRVGVIGYG 74 Query: 228 NKIEE 232 ++ Sbjct: 75 TAVKA 79 >gi|146185719|ref|XP_001032387.2| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|146143099|gb|EAR84724.2| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 796 Score = 40.1 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 43/235 (18%), Positives = 77/235 (32%), Gaps = 26/235 (11%) Query: 65 IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQN 124 ++NE N KL + F + + F N ++ D V Sbjct: 179 VLNEFNIYQFNKLVKFTQVTAYHTEILAQFSKDTLKSIFQNGLNLEFLKIKYDKQNVTLL 238 Query: 125 EGYSISA-ISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRS 183 Y + IS Y + L F I + + ++ L +++D S S Sbjct: 239 SSYQNDSEISPYCVLLNFVPNIAQVKKQ----LFRQAQNEIELMKAEFL---LLIDRSGS 291 Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 M + I+ A +++ L+ + N ++ G + + + + Sbjct: 292 M------VGSNIETAKQALIFFLKSLPEGSIYN-IISFGTNYTVMYPQSVQVNDQNLQDS 344 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE 298 KI+ TN + LKY + D G+R K I +TDGE Sbjct: 345 IDKIEKFQANMGGTNISQALKYLMYNLQDQYGLR-----------KKIYIITDGE 388 >gi|91791025|ref|YP_551976.1| von Willebrand factor, type A [Polaromonas sp. JS666] gi|91700905|gb|ABE47078.1| von Willebrand factor, type A [Polaromonas sp. JS666] Length = 753 Score = 40.1 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 36/227 (15%), Positives = 65/227 (28%), Gaps = 49/227 (21%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES---FFDSSITKIDMAIKS 201 TN + + + I+LD S SM + A++ Sbjct: 559 AMGDTNVFKVTSSVCGIDAAAT---------ILLDRSGSMSRCIVEAAGAALSCSQALER 609 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 I+ + +++ P V + R++ + G T Sbjct: 610 ISKVKTSIEMFPGYAKCV--------GNTVALQAFGQSARQVARRVNEVDAEG-GTPLAE 660 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM-TDGENLSTKEDQQSLYYCNEAKKRGA 320 L+ ++ KK IVF+ TDG + +L +A+K G Sbjct: 661 ALQEVMPRLLA------------QRVKKRIVFLVTDGIPNNRPG---ALEEIGKAEKLGV 705 Query: 321 IVYAIGIR----VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGK 363 IGI I C + + + DAF + + Sbjct: 706 EFVGIGIGVHGRAIEGLTPFSIC--------INDASELPDAFEKLFR 744 >gi|284997525|ref|YP_003419292.1| von Willebrand factor, type A [Sulfolobus islandicus L.D.8.5] gi|284445420|gb|ADB86922.1| von Willebrand factor, type A [Sulfolobus islandicus L.D.8.5] Length = 380 Score = 40.1 bits (92), Expect = 0.58, Method: Composition-based stats. Identities = 31/211 (14%), Positives = 65/211 (30%), Gaps = 34/211 (16%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + ++ LD S SM + KI++A + + + + + +TF Sbjct: 33 GSATGFHYIVALDTSGSMSGY------KIELAKQGAIELFKRIPKGN------KVSFITF 80 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA-YNQIFDMQGMRQHCNTEDA 285 S+ V+ ++ + L G + A Y I + + Sbjct: 81 SS----------NVNVIKEFVDPLDLTNEILQIAAGGQTALYTAILTANSLAKKYQMPT- 129 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPN 343 ++ +TDG L K VY+ GI + + L+ + + Sbjct: 130 ----YLLLLTDGNPTDETNVGNYLKLPYFEK---MQVYSFGIGDDYNEQLLQNISDKTSG 182 Query: 344 SFYLVENPHSMYDAF-SHIGKDIVTKRIWYD 373 Y + + + + I K + D Sbjct: 183 VMYHISDANEIPQKLPQKAVTQIAAKNVTVD 213 >gi|118580309|ref|YP_901559.1| von Willebrand factor, type A [Pelobacter propionicus DSM 2379] gi|118503019|gb|ABK99501.1| von Willebrand factor, type A [Pelobacter propionicus DSM 2379] Length = 518 Score = 40.1 bits (92), Expect = 0.58, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 68/198 (34%), Gaps = 34/198 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++++LD+S SM S ++ + ++ + V V FS K +E Sbjct: 353 VILLLDLSGSMTSKYEIANAATFALHTTLFGLKG-----------VAVCSVEFSGKDKEP 401 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT--EDANYKKII 291 ++ L FG S ++ + + +KI+ Sbjct: 402 ------------EVNILVDFGRKPQSENFNHRPFDSTPTHNAIWAGRAMLLQRPEPRKIM 449 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENP 351 + +TDG + E + + K G V AIGI F ++ ++E Sbjct: 450 LLLTDGCPDNKPETRGATQ--RTMKD-GIEVAAIGIMDKNVRRFW------DNHKIIETI 500 Query: 352 HSMYDAFSHIGKDIVTKR 369 + A I + ++ KR Sbjct: 501 QELPVAMFGIMEGLLIKR 518 >gi|297473022|ref|XP_002686353.1| PREDICTED: chloride channel accessory 2 [Bos taurus] gi|296489247|gb|DAA31360.1| chloride channel accessory 2 [Bos taurus] Length = 908 Score = 40.1 bits (92), Expect = 0.58, Method: Composition-based stats. Identities = 37/209 (17%), Positives = 72/209 (34%), Gaps = 37/209 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLDVS M + ++ ++ L ++ I G+ +F +K E Sbjct: 312 VCLVLDVSSKM-----AEADRLLQLQQAAEFYLMQIVEIHTF-----VGIASFHSKGEIR 361 Query: 234 FLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L + L + T+ GLK + + + G + Sbjct: 362 AQLHQINNDDDRKLLVSYLPVAVSAEAETSVCSGLKKGFEVVEKLNGK---------AFG 412 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFY 346 +++ +T G++ + A G+ ++ I + L F+ Sbjct: 413 SVMILVTSGDDEHISNCFLT------ALSSGSTIHTIALGSSTVKNLEELSHLTGGLRFF 466 Query: 347 LVE--NPHSMYDAFSHIGK---DIVTKRI 370 + + N +SM DAFS I DI + I Sbjct: 467 VPDKSNANSMIDAFSRISSGTGDIFQQHI 495 >gi|313902415|ref|ZP_07835818.1| hypothetical protein ThesuDRAFT_1385 [Thermaerobacter subterraneus DSM 13965] gi|313467346|gb|EFR62857.1| hypothetical protein ThesuDRAFT_1385 [Thermaerobacter subterraneus DSM 13965] Length = 170 Score = 40.1 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 11/78 (14%), Positives = 28/78 (35%), Gaps = 1/78 (1%) Query: 16 KGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRK 75 G + + LP+I LG++++ S + + ++ D + + + ++E + Sbjct: 18 GGAVGAAFLLLLPVILAALGLVLDGSRLVLTRAHAQAVADFASLAGVQE-VDEEALARGE 76 Query: 76 KLKGGDILCRIKNTWNMS 93 W S Sbjct: 77 PALRAGAAAATARLWAES 94 >gi|241767270|ref|ZP_04765006.1| von Willebrand factor type A [Acidovorax delafieldii 2AN] gi|241362052|gb|EER58187.1| von Willebrand factor type A [Acidovorax delafieldii 2AN] Length = 374 Score = 40.1 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 31/197 (15%), Positives = 73/197 (37%), Gaps = 23/197 (11%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-K 229 L +++ D+S S +++ S+ ID+ ++ E + D ++ V + + Sbjct: 183 SLATLLLADLSLSTDAYATSNARVIDVIRDALYVFGEALSGTGDAFEMLGFSSVRRQHVR 242 Query: 230 IEEF--FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 I+ F W + ++ ++ L G T L+ A ++ Sbjct: 243 IQHIKGFGERWNDT-VRSRVGALKP-GFYTRMGAALRDATRRL-----------GARPER 289 Query: 288 KKIIVFMTDGENLSTKED------QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 +++++ +TDG+ + + + EA+ G + + I +HE+L Sbjct: 290 QRLLLVLTDGKPNDLDIYEGRYGLEDTRHAVQEARDAGLTPFCVTID-HEAHEYLPMLFG 348 Query: 342 PNSFYLVENPHSMYDAF 358 + LV P + Sbjct: 349 SQGYALVRRPQDLVQRL 365 >gi|255598999|ref|XP_002537129.1| conserved hypothetical protein [Ricinus communis] gi|223517386|gb|EEF25254.1| conserved hypothetical protein [Ricinus communis] Length = 397 Score = 40.1 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 23/124 (18%), Positives = 41/124 (33%), Gaps = 11/124 (8%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 + V+D S SM+ F T A A + D G+V F+N Sbjct: 159 VFVIDRSVSMDHPFAGQATGGHAAEIKSAAARRLITAFIDSRPDDMMGVVGFTNSALYGM 218 Query: 235 LLEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + I+ + ++ TN G+ A ++ + IV Sbjct: 219 KITTNREGIHAAIQAATGPALNQTNIGAGITNAVTLFENIASTGS----------RAIVL 268 Query: 294 MTDG 297 ++DG Sbjct: 269 LSDG 272 >gi|294140776|ref|YP_003556754.1| hypothetical protein SVI_2005 [Shewanella violacea DSS12] gi|293327245|dbj|BAJ01976.1| conserved hypothetical protein [Shewanella violacea DSS12] Length = 438 Score = 40.1 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 13/63 (20%), Positives = 31/63 (49%) Query: 16 KGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRK 75 G + ++ I L + V + ++ H+ K L + +D S+++AA + N+G+ + Sbjct: 20 GGAILVMFTIGLFSLLAVAALALDGGHLLLNKGRLQNAVDSSVLYAAKILQNDGSLFQAR 79 Query: 76 KLK 78 + Sbjct: 80 EAA 82 >gi|282898869|ref|ZP_06306855.1| hypothetical protein CRC_00003 [Cylindrospermopsis raciborskii CS-505] gi|281196242|gb|EFA71153.1| hypothetical protein CRC_00003 [Cylindrospermopsis raciborskii CS-505] Length = 1499 Score = 40.1 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 51/345 (14%), Positives = 111/345 (32%), Gaps = 47/345 (13%) Query: 48 TVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDI 107 + + + + G+ + F L Sbjct: 714 NAIQTEGNIGTKAFTFTVTRSGDTTSSSSANWAVTGSSTNQADVTDFGGTLPTGTVNFTA 773 Query: 108 DDIVRSTSLDI---VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITS---- 160 + ++ +D+ V +EG++++ + + T + TN I Sbjct: 774 GETSKTIIVDVLGDTTVEPDEGFTVTLSNPTNATITTATAVGTITNDDDNTGSIQGFKWK 833 Query: 161 -----SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL---I 212 V+ + +++ V+DVS S F D+ I + + ++ I Sbjct: 834 DINGNGVREDLIQGDSPNIVFVIDVSGSTRGPFQGIPVG-DVNKDGIQNTILDAEIAGFI 892 Query: 213 PDVNNVVQSGL--------VTFSNKIE-------EFFLLEWGVSHLQRKIKYLSKFGVST 257 N++V+ G V+F++ + E + G ++ K+ L G T Sbjct: 893 ALNNSLVRKGFGSRAKVSIVSFASDAKTLLTTNPETDSNKNGTKDVEEKLISLKS-GGET 951 Query: 258 NSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 N L+ A + D+ + N ++FM+DG+ + L + +K Sbjct: 952 NFEIALQEAAKTLRDIGTTAGNGN---------VIFMSDGQPNQGNYTDEVL----DLQK 998 Query: 318 RGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIG 362 G + A G+ S + L+ S + + + F +G Sbjct: 999 AGVKLSAFGVGTGASIDSLKLINPNASIF--TSTDQLLGVFDGLG 1041 >gi|72168566|ref|XP_796840.1| PREDICTED: similar to Clca1 protein [Strongylocentrotus purpuratus] gi|115961659|ref|XP_001187264.1| PREDICTED: similar to Clca1 protein [Strongylocentrotus purpuratus] Length = 958 Score = 40.1 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 36/190 (18%), Positives = 60/190 (31%), Gaps = 24/190 (12%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLDVS SM + +TK++ A I D + V + +S IE Sbjct: 320 VALVLDVSGSMG--GNDRLTKLNQAATQYLRY-----TIDDGSFVGIAHFSDYSRIIENL 372 Query: 234 FLLEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + L T G+ + GM I+ Sbjct: 373 TEITDNSRE--DLVMGLPSIANGPTCIGCGVLDGIKILKGETGMEDPAGG-------YIL 423 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN--SFYLVEN 350 ++DG+ + +E ++ G IV I L N F+ + Sbjct: 424 LISDGQQNRQPYIDEVF---DEVEEAGVIVDTIAFSDAADPNLLELSVRTNGLGFFYPDT 480 Query: 351 PHS--MYDAF 358 S + DAF Sbjct: 481 ATSTALNDAF 490 >gi|313159753|gb|EFR59110.1| conserved hypothetical protein [Alistipes sp. HGB5] Length = 294 Score = 40.1 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 26/111 (23%), Positives = 46/111 (41%), Gaps = 7/111 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L MM+++DVS S D K+I + V N + G + F Sbjct: 74 EEERELTMMLLVDVSAS------RMFGSTDRLKKNIITEIAAVLAFSAAQNNDKVGCIFF 127 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 S+K+E+F + G SH+ I+ L F ++ L + ++ R Sbjct: 128 SDKVEKFIPPKKGRSHILMIIRELVGF-RPESTGTKLSEPVRFLTNVNKKR 177 >gi|310823569|ref|YP_003955927.1| hypothetical protein STAUR_6343 [Stigmatella aurantiaca DW4/3-1] gi|309396641|gb|ADO74100.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1] Length = 293 Score = 40.1 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 26/97 (26%), Positives = 42/97 (43%), Gaps = 9/97 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 L +M+++DVS S E S + I + A I + + + GL+ FS+++E Sbjct: 77 LTVMLLVDVSASKEF---GSHERSKSEIAAEAAAQIAFSAIANND---RVGLILFSDRVE 130 Query: 232 EFFLLEWGVSHLQRKIK---YLSKFGVSTNSTPGLKY 265 + G SH+ R I G T+ GL Y Sbjct: 131 KVVPPRKGRSHVLRLISDILTFKPQGKGTDLGAGLMY 167 >gi|195475402|ref|XP_002089973.1| GE21365 [Drosophila yakuba] gi|194176074|gb|EDW89685.1| GE21365 [Drosophila yakuba] Length = 1171 Score = 40.1 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 41/218 (18%), Positives = 80/218 (36%), Gaps = 33/218 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++I+LD S SM F + +A +I ++L+ N+ + +S+++ + Sbjct: 236 DIVILLDHSGSMTGF------RHHVAKFTIRSILDTF----SNNDFFTI--LRYSSEVND 283 Query: 233 FFLLEWG---------VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 G + ++I+ L N T A+ Q+ +HC T Sbjct: 284 IIPCFNGALVQATPENIEVFNQQIELLDDPEGYANLTLAYDTAF-QLLRKYYDSRHCATN 342 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG-------AIVYAIGIRVIRSHEF- 335 + I+ +TDG +T E Q + N + G Y +G V + E Sbjct: 343 ST-CNQAIMLVTDGVAGNTTEVFQKYNWGNG--ENGTSQMDTRVFTYLLGKEVTKVREIQ 399 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 AC + + V+ +++ I T + + Sbjct: 400 WMACLNRGYYSHVQTLDEVHEEVLKYVDVIATPLVLQN 437 >gi|159041653|ref|YP_001540905.1| hypothetical protein Cmaq_1087 [Caldivirga maquilingensis IC-167] gi|157920488|gb|ABW01915.1| conserved hypothetical protein [Caldivirga maquilingensis IC-167] Length = 448 Score = 40.1 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 30/125 (24%), Positives = 50/125 (40%), Gaps = 17/125 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +V+D S SM ++I + K A + ++ + VV F +I + Sbjct: 285 IYLVIDKSGSMFYTVMNNIFEYSSVSKITWAAALAIVMVMKGHEVV---ARFFDQQIYQ- 340 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L + + + L G TN T ++ AY+ H N NYK +V Sbjct: 341 --LMTNKKDIIKTLLSLVPLG-GTNITSAIRVAYDD--------AHRNPALRNYK--LVL 387 Query: 294 MTDGE 298 +TDGE Sbjct: 388 ITDGE 392 >gi|6752871|gb|AAF27914.1|AF218037_1 RTX protein [Aeromonas salmonicida] Length = 2747 Score = 40.1 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 31/271 (11%), Positives = 82/271 (30%), Gaps = 22/271 (8%) Query: 52 SMIDRSLVHA---ATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDID 108 ++D ++ ++ + + G L W+ S + + Sbjct: 2072 EVLDSLVLSGMPKGAELYSGNQLLGTVGVDGKLTLTAGSGLWSASALDVKLSGLTLRVPG 2131 Query: 109 DIVRSTSLDIVVVPQ--NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 L + + + + SA I L + +++ ++ V Sbjct: 2132 SSAGQVDLKVEAIAKEVGTDQTSSASDEDSIRLSYFNATEGEPGNQNRTFGSEHNIVVGD 2191 Query: 167 QTDARL------DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL--IPDVNNV 218 + + ++ ++D S S+ S + + + +K D Sbjct: 2192 LDGSVILPGQNYNIAFMVDSSGSIGS------DAMKTIKSQLAQVFASLKSSAGTDGAGK 2245 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF--DMQGM 276 V LV F +++ + ++ +++ + K + G Y +F Sbjct: 2246 VNIFLVDFDSQVNKSISVDLKDPDALNQLQNIIKTMDGSEKNGG-ATXYEDVFKTTANWF 2304 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 + + N K + F+TDG+ + + Sbjct: 2305 KSDFAKLNTNAKNVAYFITDGQPNTFNSESN 2335 >gi|296386556|ref|ZP_06876055.1| ppkA-related protein [Pseudomonas aeruginosa PAb1] Length = 1032 Score = 40.1 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 63/172 (36%), Gaps = 19/172 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++V+D S SM+ + D ID + I + + V GLV F N ++ Sbjct: 616 IVLVVDTSVSMQPYIDRVREVIDGLQRQIGERGDL--------DKVSFGLVGFRNNTDKT 667 Query: 234 FLLEWGVSHL-----QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 LE+ L + ++ +T +++N+ M+ + + Y Sbjct: 668 PGLEYVSKTLVPLQPGNDAQRFAELSSQVRATDVSSHSFNEDAFAGIMQAVDEMDWSPYA 727 Query: 289 -KIIVFMTDG---ENLST--KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++++ +TD + A ++G +YA+ +R Sbjct: 728 GRVVLLVTDAGALRKNDPLGRTQMNEAEVRQAALRKGVKIYALHLRTPAGKN 779 >gi|291232109|ref|XP_002735987.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii] Length = 340 Score = 40.1 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 31/227 (13%), Positives = 75/227 (33%), Gaps = 38/227 (16%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 I + + + + + LD+ +D + SM ++ I MA +SIN +++E+ Sbjct: 2 SIRVDAVAKPEAMNHDSSILDLAFAMDCTGSMGTY-------ISMAQQSINHIVDEIVSK 54 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS------KFGVSTNSTPGLKYA 266 + ++ LV + + E V+ + + + ++ + A Sbjct: 55 ESSD--IRLALVEYRDHPPEDRTFVTRVNDFTPYVSTMKSCLNAARADGGGDTPEAVADA 112 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTD---------GE--NLSTKEDQQSLYYCNEA 315 + ++ ++ KI VF+ D G+ + + Sbjct: 113 LYDVLNLNW--------RSSATKICVFIADAPPHGLSRGGDRFPDGCPSGLDPMKTARQL 164 Query: 316 KKRGAIVYAIG----IRVIRSHEFLRACASPNSFYLVENPHSMYDAF 358 +R VY +G I + A + + + + S+ Sbjct: 165 AERDITVYMVGCEPSITPYKHFFMAVAHMTGGQYVPLTDASSLAKVI 211 >gi|167719780|ref|ZP_02403016.1| hypothetical protein BpseD_12240 [Burkholderia pseudomallei DM98] Length = 577 Score = 40.1 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 9/48 (18%), Positives = 26/48 (54%), Gaps = 1/48 (2%) Query: 18 GMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 ++ AI++ + LG +++ ++FF++ L + D + + A ++ Sbjct: 1 SFAVVAAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRM 47 >gi|116054435|ref|YP_788230.1| serine/threonine protein kinase PpkA [Pseudomonas aeruginosa UCBPP-PA14] gi|115589656|gb|ABJ15671.1| serine/threonine protein kinase PpkA [Pseudomonas aeruginosa UCBPP-PA14] Length = 1032 Score = 40.1 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 63/172 (36%), Gaps = 19/172 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++V+D S SM+ + D ID + I + + V GLV F N ++ Sbjct: 616 IVLVVDTSVSMQPYIDRVREVIDGLQRQIGERGDL--------DKVSFGLVGFRNNTDKT 667 Query: 234 FLLEWGVSHL-----QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 LE+ L + ++ +T +++N+ M+ + + Y Sbjct: 668 PGLEYVSKTLVPLQPGNDAQRFAELSSQVRATDVSSHSFNEDAFAGIMQAVDEMDWSPYA 727 Query: 289 -KIIVFMTDG---ENLST--KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++++ +TD + A ++G +YA+ +R Sbjct: 728 GRVVLLVTDAGALRKNDPLGRTQMNEAEVRQAALRKGVKIYALHLRTPAGKN 779 >gi|113474645|ref|YP_720706.1| hypothetical protein Tery_0817 [Trichodesmium erythraeum IMS101] gi|110165693|gb|ABG50233.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101] Length = 460 Score = 40.1 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 59/165 (35%), Gaps = 23/165 (13%) Query: 152 RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF-FDSSITKIDMAIKSINAMLEEVK 210 + + + + ++++LD S SM+ TK++ AI +I +++ Sbjct: 95 QTLNQADINLKIPQPKELPPAWVIVLLDFSGSMKQLDSSGEKTKLEGAIAAIRKFNQDLA 154 Query: 211 LIPDVNNVVQSGLVTFS------------NKIEEFFLLEWGVSHLQRKIKYLSK----FG 254 + +V F K + F+L G +++ + L Sbjct: 155 K---KGENTRISIVPFGKGGKNCPGNQVRKKQLDNFVLA-GKKKVEQTLNKLESKLDNLC 210 Query: 255 VSTNSTPGLKYAYNQIFDMQGMR--QHCNTEDANYKKIIVFMTDG 297 +T+ L+ A + + R N+ ++ I+ ++DG Sbjct: 211 AATDIYEPLRQAVQFFGNPEDTRFNLPINSNLPQPRRSIILLSDG 255 >gi|107099066|ref|ZP_01362984.1| hypothetical protein PaerPA_01000075 [Pseudomonas aeruginosa PACS2] Length = 1032 Score = 40.1 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 63/172 (36%), Gaps = 19/172 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++V+D S SM+ + D ID + I + + V GLV F N ++ Sbjct: 616 IVLVVDTSVSMQPYIDRVREVIDGLQRQIGERGDL--------DKVSFGLVGFRNNTDKT 667 Query: 234 FLLEWGVSHL-----QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 LE+ L + ++ +T +++N+ M+ + + Y Sbjct: 668 PGLEYVSKTLVPLQPGNDAQRFAELSSQVRATDVSSHSFNEDAFAGIMQAVDEMDWSPYA 727 Query: 289 -KIIVFMTDG---ENLST--KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++++ +TD + A ++G +YA+ +R Sbjct: 728 GRVVLLVTDAGALRKNDPLGRTQMNEAEVRQAALRKGVKIYALHLRTPAGKN 779 >gi|152984686|ref|YP_001345541.1| serine/threonine protein kinase PpkA [Pseudomonas aeruginosa PA7] gi|150959844|gb|ABR81869.1| serine/threonine protein kinase PpkA [Pseudomonas aeruginosa PA7] Length = 1032 Score = 40.1 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 63/172 (36%), Gaps = 19/172 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++V+D S SM+ + D ID + I + + V GLV F N ++ Sbjct: 616 IVLVVDTSVSMQPYIDRVREVIDGLQRQIGERGDL--------DKVSFGLVGFRNNTDKT 667 Query: 234 FLLEWGVSHL-----QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 LE+ L + ++ +T +++N+ M+ + + Y Sbjct: 668 PGLEYVSKTLVPLQPGNDAQRFAELSSQVRATDVSSHSFNEDAFAGIMQAVDEMDWSPYA 727 Query: 289 -KIIVFMTDG---ENLST--KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++++ +TD + A ++G +YA+ +R Sbjct: 728 GRVVLLVTDAGALRKNDPLGRTQMNEAEVRQAALRKGVKIYALHLRTPAGKN 779 >gi|146298483|ref|YP_001193074.1| von Willebrand factor, type A [Flavobacterium johnsoniae UW101] gi|146152901|gb|ABQ03755.1| BatB-like protein [Flavobacterium johnsoniae UW101] Length = 344 Score = 40.1 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 69/175 (39%), Gaps = 20/175 (11%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +D++ +DVS+SM + D + +++D + + ++ ++ + + Sbjct: 78 GTKMETVKREGIDIVFAVDVSKSMLAE-DVAPSRLDKSKQLVSQIINSLGND-------R 129 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 G+V ++ + S + ++ +S VS+ T + + + Sbjct: 130 IGIVAYAGSAFPVLPITSDYSVAKMFLQSMSPDMVSSQGT--------SLDEAIRLSATY 181 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 E + K+++ ++DGE+ S + K G + +G+ + Sbjct: 182 FDEKSKTSKLLILISDGEDHSEGASAAAEEA----NKIGMKIITVGVGTEKGGTI 232 >gi|5787973|gb|AAD03499.2| serine/threonine protein kinase PpkA [Pseudomonas aeruginosa] Length = 1032 Score = 40.1 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 63/172 (36%), Gaps = 19/172 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++V+D S SM+ + D ID + I + + V GLV F N ++ Sbjct: 616 IVLVVDTSVSMQPYIDRVREVIDGLQRQIGERGDL--------DKVSFGLVGFRNNTDKT 667 Query: 234 FLLEWGVSHL-----QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 LE+ L + ++ +T +++N+ M+ + + Y Sbjct: 668 PGLEYVSKTLVPLQPGNDAQRFAELSSQVRATDVSSHSFNEDAFAGIMQAVDEMDWSPYA 727 Query: 289 -KIIVFMTDG---ENLST--KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++++ +TD + A ++G +YA+ +R Sbjct: 728 GRVVLLVTDAGALRKNDPLGRTQMNEAEVRQAALRKGVKIYALHLRTPAGKN 779 >gi|15595272|ref|NP_248764.1| serine/threonine protein kinase PpkA [Pseudomonas aeruginosa PAO1] gi|9945898|gb|AAG03464.1|AE004446_12 serine/threonine protein kinase PpkA [Pseudomonas aeruginosa PAO1] Length = 1032 Score = 40.1 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 63/172 (36%), Gaps = 19/172 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++V+D S SM+ + D ID + I + + V GLV F N ++ Sbjct: 616 IVLVVDTSVSMQPYIDRVREVIDGLQRQIGERGDL--------DKVSFGLVGFRNNTDKT 667 Query: 234 FLLEWGVSHL-----QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 LE+ L + ++ +T +++N+ M+ + + Y Sbjct: 668 PGLEYVSKTLVPLQPGNDAQRFAELSSQVRATDVSSHSFNEDAFAGIMQAVDEMDWSPYA 727 Query: 289 -KIIVFMTDG---ENLST--KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++++ +TD + A ++G +YA+ +R Sbjct: 728 GRVVLLVTDAGALRKNDPLGRTQMNEAEVRQAALRKGVKIYALHLRTPAGKN 779 >gi|74195997|dbj|BAE30555.1| unnamed protein product [Mus musculus] Length = 436 Score = 40.1 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 19/149 (12%), Positives = 45/149 (30%), Gaps = 17/149 (11%) Query: 237 EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 +W L + K TN+ L+ Y+ + G + +I+ MTD Sbjct: 5 DWVTEKLNQISYEDHKLKSGTNTKRALQAVYS-MMSWAGDAPPEGWNRTRH--VIIIMTD 61 Query: 297 G-ENLSTKEDQQSLYY---------CNEAKKRGAIVYAIGIRV----IRSHEFLRACASP 342 G N+ ++ VY G+ + + + Sbjct: 62 GLHNMGGNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLVDSVNINALASKKDNE 121 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + + V++ + + F + + + + Sbjct: 122 HHVFKVKDMEDLENVFYQMIDETKSLSLC 150 >gi|32471442|ref|NP_864435.1| hypothetical protein RB1478 [Rhodopirellula baltica SH 1] gi|32443283|emb|CAD72114.1| hypothetical protein-transmembrane prediction [Rhodopirellula baltica SH 1] Length = 666 Score = 40.1 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 26/189 (13%), Positives = 64/189 (33%), Gaps = 11/189 (5%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 + + + + I I ++++ + L ++ +LD S S++ D Sbjct: 133 TVGAPAERIVSTTQQIGSADGIESATQSIENAIRGELKQGDTL-VVWLLDASISLQLNRD 191 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY 249 +I I A+ + + S +V F I E + Sbjct: 192 RMARRIREFYDEIGALSRPDTSDGIERHRLFSSVVAFGKGINEVSPPS------LVGVNA 245 Query: 250 LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI-IVFMTDGENLSTKEDQQS 308 + + G++ N + + + H + +++ +V +TD ++ + + Sbjct: 246 IDAMTRVPVDSSGIE---NTMTAVDNVLNHYRYKAKRKERLMVVLLTDESGDDVQKLEYT 302 Query: 309 LYYCNEAKK 317 + C EAK Sbjct: 303 IANCREAKA 311 >gi|218888819|ref|YP_002437683.1| serine/threonine protein kinase PpkA [Pseudomonas aeruginosa LESB58] gi|254243183|ref|ZP_04936505.1| serine/threonine protein kinase PpkA [Pseudomonas aeruginosa 2192] gi|126196561|gb|EAZ60624.1| serine/threonine protein kinase PpkA [Pseudomonas aeruginosa 2192] gi|218769042|emb|CAW24802.1| serine/threonine protein kinase PpkA [Pseudomonas aeruginosa LESB58] Length = 1032 Score = 40.1 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 63/172 (36%), Gaps = 19/172 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++V+D S SM+ + D ID + I + + V GLV F N ++ Sbjct: 616 IVLVVDTSVSMQPYIDRVREVIDGLQRQIGERGDL--------DKVSFGLVGFRNNTDKT 667 Query: 234 FLLEWGVSHL-----QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 LE+ L + ++ +T +++N+ M+ + + Y Sbjct: 668 PGLEYVSKTLVPLQPGNDAQRFAELSSQVRATDVSSHSFNEDAFAGIMQAVDEMDWSPYA 727 Query: 289 -KIIVFMTDG---ENLST--KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 ++++ +TD + A ++G +YA+ +R Sbjct: 728 GRVVLLVTDAGALRKNDPLGRTQMNEAEVRQAALRKGVKIYALHLRTPAGKN 779 >gi|298370126|ref|ZP_06981442.1| von Willebrand factor [Neisseria sp. oral taxon 014 str. F0314] gi|298281586|gb|EFI23075.1| von Willebrand factor [Neisseria sp. oral taxon 014 str. F0314] Length = 865 Score = 40.1 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 42/221 (19%), Positives = 80/221 (36%), Gaps = 29/221 (13%) Query: 96 NELRDNGFVNDIDDI---VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSR 152 E + F + D + +LDI + + ++ I I T + Sbjct: 362 GETLKDSFTYTVTDADGDKSTATLDITINGTDADKAV-------IGKNGGDNITGGTGND 414 Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKL 211 ++ ++ + ++ +VLD S SM +F DS I+MA KS+ + ++ Sbjct: 415 ALIGDA-GGYRLIIKPGQNYNVALVLDTSESMNTFRTDSGEAYIEMARKSLLKLAKDFAH 473 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 NV L F++ ++ + V L +I L GV TN + A Sbjct: 474 HDGHLNVT---LFAFNSTAKQVVTINDLTEQNVDKLLNQIMSLRAHGV-TNYDDAFRDAT 529 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS 308 + + H T F+TDG+ + + ++ Sbjct: 530 DWFSKVSSNGYHNVT---------YFLTDGQPTTYGDTGET 561 >gi|239908012|ref|YP_002954753.1| hypothetical protein DMR_33760 [Desulfovibrio magneticus RS-1] gi|239797878|dbj|BAH76867.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 451 Score = 40.1 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 27/238 (11%), Positives = 73/238 (30%), Gaps = 40/238 (16%) Query: 39 EVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNEL 98 ++ + ++ L + +D + + + Q+ ++ + + Sbjct: 17 DLGRVSVEQSRLQNAVDSAALAGSLQLPDDPDVSTGAVTAAAT----------------- 59 Query: 99 RDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPI 158 N ND D +T + + +SA ++ ++ L I + + Sbjct: 60 -QNLLANDAD----ATGILVESGGATRSVCVSAEAKVEMTLSQVIGI----GDQTVTAEA 110 Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL-IPDVNN 217 + +++++VLD + SM T I ++ ++ + + Sbjct: 111 CAGY-------NDIELVMVLDATGSM------KGTPIANVKEAATNLVNLIMPSSSSTST 157 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + GLV F K+ + + T + L+ Y++ Sbjct: 158 RSKIGLVPFQGKVRIDGNDPVTAEANPDGVGPGCRNADGTLNNGKLRTEYSKTTTKTN 215 >gi|227818616|ref|YP_002822587.1| type IV pilus associated TadE family protein [Sinorhizobium fredii NGR234] gi|36958872|gb|AAQ87297.1| Hypothetical protein RNGR00523 [Sinorhizobium fredii NGR234] gi|227337615|gb|ACP21834.1| type IV pilus associated TadE family protein [Sinorhizobium fredii NGR234] Length = 140 Score = 40.1 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 14/57 (24%), Positives = 29/57 (50%), Gaps = 2/57 (3%) Query: 3 SLNLNIRNFFYNYKGGMTILTAIF-LPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSL 58 +L+L +R + + G + A+ P++ LVLG +IE F+++ + D + Sbjct: 2 TLSLFLRRAWRSQSGATAVEFALVCFPLLLLVLG-VIEFGRAFYVRNDMSYAADVAA 57 >gi|13475442|ref|NP_107006.1| hypothetical protein mlr6511 [Mesorhizobium loti MAFF303099] gi|14026194|dbj|BAB52792.1| mlr6511 [Mesorhizobium loti MAFF303099] Length = 537 Score = 40.1 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 63/191 (32%), Gaps = 22/191 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++I+LD S SM + D K+++A +S+ +L+ V + G + + ++ + Sbjct: 11 VIIILDASGSMWAQIDGKP-KLEIARESLRTVLQSVPADDE------IGFMAYGHREKGS 63 Query: 234 FLLEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 +Q + + T++ LK+ + + K +V Sbjct: 64 C------DDIQLIVPPQPGSASAITDAADSLKFLGKTPLTAAVKQAAEALKYTEDKATVV 117 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGA----IVYAIGIRVIRSHEFLRACA-SPNSFYL 347 +TDG E K G V G+ + + + Sbjct: 118 LITDG---LETCGGDPCALGKELKASGVDFTADVVGFGLTADEGKQIACLAENTGGKYIQ 174 Query: 348 VENPHSMYDAF 358 + ++ +A Sbjct: 175 ASDEKALQEAL 185 >gi|305663839|ref|YP_003860127.1| von Willebrand factor type A [Ignisphaera aggregans DSM 17230] gi|304378408|gb|ADM28247.1| von Willebrand factor type A [Ignisphaera aggregans DSM 17230] Length = 443 Score = 40.1 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 67/172 (38%), Gaps = 31/172 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + ++LD S SM+ + ++ ++ I ++ F ++ + Sbjct: 276 IYVLLDKSGSMDGN-----KILWAKATALALFMKS--RIERRPYYIR----FFDSEPYDL 324 Query: 234 FLLEWG--VSHLQRKIKY--LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 ++ G S + + I+Y + + G T+ + + A N I + +R + Sbjct: 325 IKVKSGAKPSEVMKLIEYIAMVRNGGGTDISKAIITACNDILRNEVVRDVSD-------- 376 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 I+ +TDGE+ K + +AK I + V+ ++ L+ +S Sbjct: 377 -IIIITDGEDRIAKSLVRKSLQHAKAK-------LISVMVMGENDDLKQISS 420 >gi|187932171|ref|YP_001892156.1| hypothetical membrane protein with von Willebrand factor type A domain [Francisella tularensis subsp. mediasiatica FSC147] gi|187713080|gb|ACD31377.1| hypothetical membrane protein with von Willebrand factor type A domain [Francisella tularensis subsp. mediasiatica FSC147] Length = 332 Score = 40.1 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 34/163 (20%), Positives = 68/163 (41%), Gaps = 29/163 (17%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + +I LDVS+SM++ D S ++++ A I +L +K Q G++ FS+ Sbjct: 88 QKNISRVIALDVSQSMDT-TDVSPSRLERAKYKIFDILRRIKEG-------QVGMIVFSS 139 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L + ++ + ++ N LK + Q+ + G++Q Sbjct: 140 EPFVVSPLTSDANTIENLVTVITSDIVPVQGHNIYKALKKS-AQLIEQAGVQQGQ----- 193 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI--VYAIG 326 I+ +TD + Q++ + ++G VYAIG Sbjct: 194 -----IILITD-----SSPLPQAISQAKQLAQQGIKTDVYAIG 226 >gi|126661250|ref|ZP_01732323.1| Mg chelatase subunit; ChlD [Cyanothece sp. CCY0110] gi|126617479|gb|EAZ88275.1| Mg chelatase subunit; ChlD [Cyanothece sp. CCY0110] Length = 672 Score = 40.1 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 39/279 (13%), Positives = 82/279 (29%), Gaps = 57/279 (20%) Query: 102 GFVNDIDDIVRSTSLDIVVVPQNEGY-------------SISAISRYKIPLKFCTFIPWY 148 F + +S S I+ Y ++ A R P + + Sbjct: 395 YFAQMAQNQGKSGSRSIIFSDDRGRYVKPMLPKGKVRRIAVDATLRAAAPYQKSRRLKH- 453 Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 IV K ++ L ++ V+D S SM ++ ++ A ++ +L E Sbjct: 454 PGRNVIVEQGDIRSKRLARKAGAL-IVFVVDASGSM------ALNRMQSAKGAVMRLLTE 506 Query: 209 VKLIPDVNNVVQSGLVTFSNKI-EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 N Q L+ F + + +S +++++ L G S + + Sbjct: 507 A-----YENRDQVALIPFRGEQADVLLPPTRSISLARKRLETLPCGGGSPLAHGLTQA-- 559 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE---------------DQQSLYYC 312 + + + +IV +TDG Q+ L Sbjct: 560 --VHVGMNAKMSGDIGQV----VIVAITDGRGNIPLAKSLGEPIPEGEKPDIKQELLDIA 613 Query: 313 NEAKKRGAIVYAI-----GIRVIRSHEFLRACASPNSFY 346 + + G + I + E + ++Y Sbjct: 614 KKIRGLGIKLLMIDTEKKFVSTGFGKELAQTAG--GTYY 650 >gi|268556154|ref|XP_002636066.1| Hypothetical protein CBG01306 [Caenorhabditis briggsae] gi|187038428|emb|CAP22594.1| CBR-CLEC-218 protein [Caenorhabditis briggsae AF16] Length = 381 Score = 40.1 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 28/181 (15%), Positives = 60/181 (33%), Gaps = 11/181 (6%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD+ +V+D S M + S + + V + ++T++ Sbjct: 36 LDVYVVIDDSTKMGTSGLSQV----ASSVFSTLANSRVGSDYSDKRGARVAVITYNENAY 91 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L S+ + +S VS + ++ I +M G + ++ II Sbjct: 92 IRSNLSDLTSNQDLE-NVISSLQVSKSDISNIQTPLKLINEMMGYKDGNGPKNNTMSVII 150 Query: 292 VFMTDGENLSTKEDQQSL-YYCNEAKKRGAIVYAIGIRVIRSHEF-LRACASPNSFYLVE 349 ++ D DQ + K+ G + + + L+A AS + + Sbjct: 151 MYAAD----YIDYDQPTANQLAYYLKENGVTIITVANMDDSNKIMKLKALASEGYGFSLS 206 Query: 350 N 350 + Sbjct: 207 D 207 >gi|303325524|ref|ZP_07355967.1| OmpA family protein [Desulfovibrio sp. 3_1_syn3] gi|302863440|gb|EFL86371.1| OmpA family protein [Desulfovibrio sp. 3_1_syn3] Length = 332 Score = 40.1 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 55/174 (31%), Gaps = 22/174 (12%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 V+D S SM + K D + + N + IP ++ ++ + I Sbjct: 34 SFDFVVDYSGSM--MMQNKQLKQDKIVVAKNVLQRVNAAIPALDYNGGLHTISPNGMIIA 91 Query: 233 FFLLEWGVSHLQRKIKYLSK----FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 W + + I L FG T+ GL+ + Sbjct: 92 QGP--WDRNAMSVGIDKLRSGFQIFGRMTSMGNGLQ------------KYEPFISSMKRD 137 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRACAS 341 ++ +TDG+N + + A +R +++ I ++ A+ Sbjct: 138 AALILVTDGDNNRGTDIVEVARQL-YASQRNMVIHIISFADTPHGEAVIKEIAA 190 >gi|189518251|ref|XP_696635.3| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-4 [Danio rerio] Length = 1094 Score = 40.1 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 63/165 (38%), Gaps = 19/165 (11%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++I +D+S SM+ ++ +A +IN +L+ + VN + S V + Sbjct: 238 ATSPKDIIIAVDISGSMKGL------RLTIAKHTINTILDTLGENDFVNVIAYSDYVQYV 291 Query: 228 NKIEEFFLLE---WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + L++ H + ++ L G +K ++ + ++ + Sbjct: 292 EPCFKGTLVQADLDNREHFKLLVQELQVKGEG-KVKKAMKESFKILNEVTAEGRGSLCNQ 350 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 A I+ +TDG ++ + + + R V+ I Sbjct: 351 A-----IMLITDGAMEDFQQVFEEFNWPD----RKVRVFTYLIGR 386 >gi|147677783|ref|YP_001211998.1| flp pilus assembly protein TadD [Pelotomaculum thermopropionicum SI] gi|146273880|dbj|BAF59629.1| flp pilus assembly protein TadD [Pelotomaculum thermopropionicum SI] Length = 312 Score = 40.1 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 8/56 (14%), Positives = 24/56 (42%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 NF N G ++ + +F ++ ++ + + L + +D + + A ++ Sbjct: 13 NFLNNQNGLAAVMLCAGMAALFGFAALVTDIGLLAAKRQQLINTMDAAALAGAQEL 68 >gi|17533685|ref|NP_496744.1| C-type LECtin family member (clec-64) [Caenorhabditis elegans] gi|3876682|emb|CAB03056.1| C. elegans protein F35C5.7, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 385 Score = 40.1 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 65/189 (34%), Gaps = 19/189 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++IV+D S+ M + IT++ I + A + + ++T++++ Sbjct: 32 LDIVIVVDNSKGMT---NEGITEVAANIATTFASGPRIGTDYSDPRSTRLSILTYNSEAT 88 Query: 232 EFFLLEWGVSH---LQRKIKYLSKFGVSTNS--TPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L S Q +L++ S +S GL A + +++ + N Sbjct: 89 VVADLNQFQSADDVYQTLFSFLNEVSDSDDSFLAKGLGMAESVLYNGRMNGVRENYNRLV 148 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC---ASPN 343 + DGE L + G + + L ASPN Sbjct: 149 VVYASAYREDGE--------DPLNIAKRLRSSGVAIATVAFDPDGDGLLLSKLSKIASPN 200 Query: 344 SFYLVENPH 352 + +P+ Sbjct: 201 MSFKSRDPN 209 >gi|18309499|ref|NP_561433.1| von Willebrand factor type A domain-containing protein [Clostridium perfringens str. 13] gi|18144176|dbj|BAB80223.1| hypothetical protein [Clostridium perfringens str. 13] Length = 614 Score = 40.1 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 59/164 (35%), Gaps = 16/164 (9%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA 198 LK + P + I + + K ++ +D+++++D S SM D ++ Sbjct: 57 LKEVSQEPDEDGNYEITLTVKGKPKKVTK---PVDILLIIDASNSMYYNMDELKASMNSL 113 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL--EWGVSHLQRKIKYLSKFGVS 256 + + + IPD + +V F EE F + + + K + K + Sbjct: 114 VDKV------IDNIPDS----RIAVVAFGTYSEEVFSFNNKNNFTSKEEYKKAIKKSYNN 163 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 + + + N + N KK ++F +DG Sbjct: 164 IEGRGNTNIESSWRLADEIFKNELN-NNFNSKKDVIFFSDGYPN 206 >gi|298249212|ref|ZP_06973016.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] gi|297547216|gb|EFH81083.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] Length = 420 Score = 40.1 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 42/198 (21%), Positives = 71/198 (35%), Gaps = 29/198 (14%) Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW 238 D S SM K+ A +++ + +Q +VTF++ F Sbjct: 45 DRSGSM------DGAKMRAARDGAVKVVQAIDAS------MQFMVVTFNDNARIIFGPAA 92 Query: 239 GVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 G+ R I + V S + A N I D G Q T I+F+TDG Sbjct: 93 GIEENKNRAIAAIQT--VYAASGTRMSTALNTIVDKFGNNQSRATR-------ILFLTDG 143 Query: 298 ENLSTK--EDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRAC-ASPNSFYLVENPHS 353 +N +++ C+ A + A G+ + E L A+ S ++ P+ Sbjct: 144 KNEGEPRVALDRAVARCSAA---NISISAWGVGTDWDAAELLHMAEATRGSADIIPTPNQ 200 Query: 354 MYDAFSHIGKDIVTKRIW 371 + AFS ++ I Sbjct: 201 VEAAFSSSFSEMRRTAIT 218 >gi|291452853|ref|ZP_06592243.1| predicted protein [Streptomyces albus J1074] gi|291355802|gb|EFE82704.1| predicted protein [Streptomyces albus J1074] Length = 923 Score = 40.1 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 35/268 (13%), Positives = 69/268 (25%), Gaps = 55/268 (20%) Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT- 159 N + + YS +A F+ S T Sbjct: 118 NDTFRTGGTVTTPYRFQTPRLAAATTYSSTATG-------PGGFMVGSGGSNRTASTGTW 170 Query: 160 --SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 S V L + +++D+S SM + + + ++ ++ P ++ Sbjct: 171 QSSRVNPALPERCGLRVALIMDLSGSMSG-------SVPALKTAADTFVDALQGTP--SS 221 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG-- 275 V + ++S G ++ + + K Y + Sbjct: 222 VARFTFSSYS-------PATRGGANAPGLVSV-----STLADAAAFKRTYASWTNATAEG 269 Query: 276 ----MRQHCNTEDANYK-KIIVFMTDGENLSTKED-----------------QQSLYYCN 313 R A + + V +TDG + + + N Sbjct: 270 STNWDRALYEPASATSQYDVAVVITDGMPTNYSIPGGPSGGASGSVTRFRELESGIASAN 329 Query: 314 EAKKRGAIVYAIGIRVIRSHEFLRACAS 341 K G V A+G+ S AS Sbjct: 330 ALKNEGTRVLAVGVGEGTSGNAALNLAS 357 >gi|115379097|ref|ZP_01466221.1| von Willebrand factor, type A [Stigmatella aurantiaca DW4/3-1] gi|115363880|gb|EAU62991.1| von Willebrand factor, type A [Stigmatella aurantiaca DW4/3-1] Length = 270 Score = 40.1 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 26/97 (26%), Positives = 42/97 (43%), Gaps = 9/97 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 L +M+++DVS S E S + I + A I + + + GL+ FS+++E Sbjct: 54 LTVMLLVDVSASKEF---GSHERSKSEIAAEAAAQIAFSAIANND---RVGLILFSDRVE 107 Query: 232 EFFLLEWGVSHLQRKIK---YLSKFGVSTNSTPGLKY 265 + G SH+ R I G T+ GL Y Sbjct: 108 KVVPPRKGRSHVLRLISDILTFKPQGKGTDLGAGLMY 144 >gi|326429223|gb|EGD74793.1| hypothetical protein PTSG_07026 [Salpingoeca sp. ATCC 50818] Length = 8642 Score = 40.1 bits (92), Expect = 0.63, Method: Composition-based stats. Identities = 17/131 (12%), Positives = 49/131 (37%), Gaps = 6/131 (4%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 ++++IV+D + +M + ++ + ++ + ++ + V+ G V + + Sbjct: 8418 IEVVIVMDCTGTMRQWMRAARDHVQEMVRFVR---DQAEKTYTGEAKVRLGFVAYRDYDC 8474 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + L I ++ F +T G A + ++ T + + K++ Sbjct: 8475 REVTQQ---CDLTEDIDKVTAFIARQKATGGKDKAEEVLSGLRAAASMKWTVNPSAMKMV 8531 Query: 292 VFMTDGENLST 302 V + D Sbjct: 8532 VVVADAPQHGP 8542 >gi|320334761|ref|YP_004171472.1| stress protein [Deinococcus maricopensis DSM 21211] gi|319756050|gb|ADV67807.1| stress protein [Deinococcus maricopensis DSM 21211] Length = 428 Score = 40.1 bits (92), Expect = 0.63, Method: Composition-based stats. Identities = 19/159 (11%), Positives = 46/159 (28%), Gaps = 15/159 (9%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 +V S+K + + +VLD+S SM + + + + + Sbjct: 199 THAPQLVSLVKAAEVSLKKRGLDEHTARVALVLDISGSMGTLY-----RHGVVQRVAEKA 253 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPG 262 L D + + F + + G+ + + + T Sbjct: 254 LALASRFDDDG---RLDVFLFGEHAHDVGEI--GIPDITGAVDRVMRRHALEGGTQYGRA 308 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 ++ + + + ++F+TDGE Sbjct: 309 MQAV--RRHYLGEHKHRRAPVQQALPVYVLFVTDGETQG 345 >gi|229051629|ref|ZP_04195099.1| hypothetical protein bcere0027_55330 [Bacillus cereus AH676] gi|228721740|gb|EEL73214.1| hypothetical protein bcere0027_55330 [Bacillus cereus AH676] Length = 452 Score = 40.1 bits (92), Expect = 0.63, Method: Composition-based stats. Identities = 39/199 (19%), Positives = 72/199 (36%), Gaps = 16/199 (8%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 L++ I+LD S SM + + K++ A K+I L IPD NV+ L + Sbjct: 148 KAKSLNVEILLDASGSMAGKVNGEV-KMEAAKKAIYNYL---DKIPDNANVM---LRVYG 200 Query: 228 NK---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE- 283 +K E L G S + ++ +K + + + + E Sbjct: 201 HKGSNNENDKSLSCGSSEVMYPLQPYNKEQFNAALSKFGPKGWTPLASAIESVNADFKEY 260 Query: 284 -DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-HEFLRACAS 341 ++ ++DGE + + N++ A+V IG V S + L+ A Sbjct: 261 TGEENLNVVYIVSDGEETCGGDPVNAAKNLNQS-STHAVVNIIGFDVKNSEQQQLKNTAE 319 Query: 342 P--NSFYLVENPHSMYDAF 358 ++ V +Y Sbjct: 320 AGKGNYATVSTADELYQTL 338 >gi|333030671|ref|ZP_08458732.1| protein of unknown function DUF58 [Bacteroides coprosuis DSM 18011] gi|332741268|gb|EGJ71750.1| protein of unknown function DUF58 [Bacteroides coprosuis DSM 18011] Length = 289 Score = 40.1 bits (92), Expect = 0.63, Method: Composition-based stats. Identities = 27/111 (24%), Positives = 52/111 (46%), Gaps = 7/111 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S+E F TK DM + + ++N + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLE-FGTKKQTKKDMLTEIAATL-----AFSAIHNNDKIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 S++IE+F + G H+ I+ L F + T +K + ++ R Sbjct: 126 SDQIEKFIPPKKGRKHILFIIRELLNFNAQSRKTD-IKLGIEYLTNVMKRR 175 >gi|261289465|ref|XP_002604708.1| hypothetical protein BRAFLDRAFT_80321 [Branchiostoma floridae] gi|229290037|gb|EEN60719.1| hypothetical protein BRAFLDRAFT_80321 [Branchiostoma floridae] Length = 395 Score = 40.1 bits (92), Expect = 0.63, Method: Composition-based stats. Identities = 23/125 (18%), Positives = 42/125 (33%), Gaps = 13/125 (10%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQT-DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSIN 203 I S + +K SQ+ DM VLD S S+ + Sbjct: 14 ILLLVGSDTSTLVGALDIKPKSQSCRGAFDMYFVLDRSASVTP---------KNFREETV 64 Query: 204 AMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTP 261 +E++ + V+ +TFS+ L ++ + +L + T + Sbjct: 65 DFVEKIVNSFH-SPKVRFSFITFSDDASVVMELSNDPDVIKHGLSHLRQIDTYGGTFMSH 123 Query: 262 GLKYA 266 GL+ A Sbjct: 124 GLRKA 128 >gi|227830108|ref|YP_002831887.1| von Willebrand factor A [Sulfolobus islandicus L.S.2.15] gi|229578921|ref|YP_002837319.1| von Willebrand factor A [Sulfolobus islandicus Y.G.57.14] gi|229582327|ref|YP_002840726.1| von Willebrand factor type A [Sulfolobus islandicus Y.N.15.51] gi|227456555|gb|ACP35242.1| von Willebrand factor type A [Sulfolobus islandicus L.S.2.15] gi|228009635|gb|ACP45397.1| von Willebrand factor type A [Sulfolobus islandicus Y.G.57.14] gi|228013043|gb|ACP48804.1| von Willebrand factor type A [Sulfolobus islandicus Y.N.15.51] Length = 380 Score = 40.1 bits (92), Expect = 0.63, Method: Composition-based stats. Identities = 31/211 (14%), Positives = 65/211 (30%), Gaps = 34/211 (16%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + ++ LD S SM + KI++A + + + + + +TF Sbjct: 33 GSATGFHYIVALDTSGSMTGY------KIELAKQGAIELFKRIPKGN------KVSFITF 80 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA-YNQIFDMQGMRQHCNTEDA 285 S+ V+ ++ + L G + A Y I + + Sbjct: 81 SS----------NVNVIKEFVDPLDLTNEILQIAAGGQTALYTAILTANSLAKKYQMPT- 129 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPN 343 ++ +TDG L K VY+ GI + + L+ + + Sbjct: 130 ----YLLLLTDGNPTDETNVGNYLKLPYFEK---MQVYSFGIGDDYNEQLLQNISDKTSG 182 Query: 344 SFYLVENPHSMYDAF-SHIGKDIVTKRIWYD 373 Y + + + + I K + D Sbjct: 183 VMYHISDANEIPQKLPQKAVTQIAAKNVTVD 213 >gi|78355641|ref|YP_387090.1| von Willebrand factor, type A [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78218046|gb|ABB37395.1| von Willebrand factor, type A [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 686 Score = 40.1 bits (92), Expect = 0.63, Method: Composition-based stats. Identities = 31/221 (14%), Positives = 73/221 (33%), Gaps = 15/221 (6%) Query: 98 LRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMP 157 L + G D + ++V NEG +++ + + T +P + + Sbjct: 148 LDEEGLDPDDGGTGAGGTPSTLIVGSNEGDNLTGTQPHTDRVDDGTQVPVDSGTITGTGG 207 Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFD----SSITKIDMAIKSINAMLEEVKLIP 213 V ++++V+D+S SM + + + A K+++ + + Sbjct: 208 NDVLVGDAQGQLENANVLVVIDMSYSMSESISDGNGGTQARYEAAAKALSDFISKYAGHD 267 Query: 214 DVNNVVQS-GLVTFSNKIEEFFLLEWGVSHLQRKIKYL-----SKFGVSTNSTPGLKYAY 267 ++N V + G T + + + + + L TN ++ Sbjct: 268 NINLAVTTFGYTTLTGPVTIDLTNPDMATVISNALHALGIDQPPVNPQGTNFDAAMETGR 327 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS 308 + A YK + +F+TDG + + Sbjct: 328 EWFDAAN-----QDPGHAGYKNVAIFVTDGAPTVHNDPGST 363 >gi|153214688|ref|ZP_01949548.1| RTX protein [Vibrio cholerae 1587] gi|124115210|gb|EAY34030.1| RTX protein [Vibrio cholerae 1587] Length = 2093 Score = 39.8 bits (91), Expect = 0.64, Method: Composition-based stats. Identities = 29/188 (15%), Positives = 56/188 (29%), Gaps = 18/188 (9%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 S+ ++ ++LD S SM + T++ + + +L E + + L Sbjct: 1483 AESELKQGANVQLILDTSGSMGEPAGNGQTRLKVMQTAALQLLSEYSAL----GETRVQL 1538 Query: 224 VTFSNKIEEFFLLEWGVS-----HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + F + + G I L G T+ K A I+D Sbjct: 1539 IEFYSDSRYYVSEINGSKWMTVDEASEHIDRL-YAGGGTDYDDATKMA-ADIWDDNDGDM 1596 Query: 279 HCNTEDANYKKIIVFMTDGENL--STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 + +Y F++DG+ + L + + A G+ L Sbjct: 1597 IAGGSNISY-----FLSDGQPNQGEELSNNDRLDWEKHLRDHNVTALAYGMGNDVPQGEL 1651 Query: 337 RACASPNS 344 A Sbjct: 1652 NKVAYDGH 1659 >gi|125973772|ref|YP_001037682.1| carbohydrate-binding, CenC-like protein [Clostridium thermocellum ATCC 27405] gi|125713997|gb|ABN52489.1| Carbohydrate-binding, CenC-like protein [Clostridium thermocellum ATCC 27405] Length = 1050 Score = 39.8 bits (91), Expect = 0.64, Method: Composition-based stats. Identities = 32/239 (13%), Positives = 74/239 (30%), Gaps = 61/239 (25%) Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS-- 241 +E + T + + + +++ + + ++ +S+ + ++ +S Sbjct: 715 VEEKYIDKSTYLKLVKDAAKNFVDKF-----AGSKTKMAVIQYSDSANDNDFKKYDLSLP 769 Query: 242 ----HLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT- 295 L+ I + +N G++ AY+ + K IV +T Sbjct: 770 DKGAALKETIDKIKPGTSGLSNMGDGMRRAYHILNGP--------PPKGQISKYIVVITG 821 Query: 296 -------------------DGENLSTKEDQQSLYYCNEAKKRGAIVYA-------IGIRV 329 +G K D +S + AK G I+ + I Sbjct: 822 SVPNRWTAVDNKKNEPKTDNGRADFIKADNESYNSLDYAKDMGRIITSKGINLVFIDFSE 881 Query: 330 IRSHEFLRACA----------SPNSFYLVENPHSMYDAFSH----IGKDIVTKRIWYDK 374 + L A + +Y N + D ++ I D+V ++ Y++ Sbjct: 882 EDIGDVLEEIAAESGAKPLEGTDRHYYKANNFLELLDILNNMTLKIYYDVVLDKVLYEE 940 >gi|256004503|ref|ZP_05429482.1| Carbohydrate-binding CenC domain protein [Clostridium thermocellum DSM 2360] gi|255991508|gb|EEU01611.1| Carbohydrate-binding CenC domain protein [Clostridium thermocellum DSM 2360] gi|316940035|gb|ADU74069.1| Carbohydrate-binding CenC domain protein [Clostridium thermocellum DSM 1313] Length = 1050 Score = 39.8 bits (91), Expect = 0.64, Method: Composition-based stats. Identities = 32/239 (13%), Positives = 74/239 (30%), Gaps = 61/239 (25%) Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS-- 241 +E + T + + + +++ + + ++ +S+ + ++ +S Sbjct: 715 VEEKYIDKSTYLKLVKDAAKNFVDKF-----AGSKTKMAVIQYSDSANDNDFKKYDLSLP 769 Query: 242 ----HLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT- 295 L+ I + +N G++ AY+ + K IV +T Sbjct: 770 DKGAALKETIDKIKPGTSGLSNMGDGMRRAYHILNGP--------PPKGQISKYIVVITG 821 Query: 296 -------------------DGENLSTKEDQQSLYYCNEAKKRGAIVYA-------IGIRV 329 +G K D +S + AK G I+ + I Sbjct: 822 SVPNRWTAVDNKKNEPKTDNGRADFIKADNESYNSLDYAKDMGRIITSKGINLVFIDFSE 881 Query: 330 IRSHEFLRACA----------SPNSFYLVENPHSMYDAFSH----IGKDIVTKRIWYDK 374 + L A + +Y N + D ++ I D+V ++ Y++ Sbjct: 882 EDIGDVLEEIAAESGAKPLEGTDRHYYKANNFLELLDILNNMTLKIYYDVVLDKVLYEE 940 >gi|196231437|ref|ZP_03130295.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] gi|196224290|gb|EDY18802.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] Length = 725 Score = 39.8 bits (91), Expect = 0.64, Method: Composition-based stats. Identities = 23/160 (14%), Positives = 54/160 (33%), Gaps = 21/160 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++I +D SRSM S D + +++ A + ++ ++ + GL+ F+ Sbjct: 90 DILIAIDCSRSMLS-TDLAPSRLGRAKLATQDLISQLTGD-------RVGLIAFAGTAFL 141 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L + + L T + I + + + ++ Sbjct: 142 QAPLTIDYGAVLDSVSEL--------DTNIIPRGGTNIAAAITEADAAFGKGESDNRCLI 193 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 TDGE L + + + + ++ +G+ Sbjct: 194 IFTDGEELESDAVAAAAAEKDHMR-----IFTVGLGSADG 228 >gi|58616382|ref|YP_195512.1| tellurium resistance protein [Azoarcus sp. EbN1] gi|56315844|emb|CAI10488.1| tellurium resistance protein [Aromatoleum aromaticum EbN1] Length = 212 Score = 39.8 bits (91), Expect = 0.64, Method: Composition-based stats. Identities = 22/132 (16%), Positives = 49/132 (37%), Gaps = 12/132 (9%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +++D S SM I+ + M ++ P V ++TFS + Sbjct: 3 RLPVYLLIDCSGSM------MGEPIEAVKVGLQTMTSALRTDPYALETVHLSVITFSQQA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L +Q ++ + G T + Q + + +++ ++K + Sbjct: 57 TQSVPL---TELVQFQVPDIHASG--TTALGEALALLAQCIERDVKKTTPDSKG-DWKPV 110 Query: 291 IVFMTDGENLST 302 +TDG+ Sbjct: 111 CFLLTDGQATDD 122 >gi|186683453|ref|YP_001866649.1| von Willebrand factor A [Nostoc punctiforme PCC 73102] gi|186465905|gb|ACC81706.1| von Willebrand factor, type A [Nostoc punctiforme PCC 73102] Length = 201 Score = 39.8 bits (91), Expect = 0.64, Method: Composition-based stats. Identities = 31/171 (18%), Positives = 65/171 (38%), Gaps = 18/171 (10%) Query: 173 DMMIVLDVSRSME-SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D +++D S SM ++ ++A +S A+ + + V FS + + Sbjct: 6 DYTLIIDKSGSMSTPDQVGGRSRWEIAQESTLALARKAEQFDPDGITVYL----FSGRFK 61 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + + S +I + +TN L+ A N F + + + I Sbjct: 62 RY---DDVTSAKVAQIFLENDPAGTTNLAGVLQDAINNYFQRKAAGKSKPNGET-----I 113 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKR----GAIVYAIGIRVIRSHEFLRA 338 + +TDGE K + + + +R G + +G ++ +FL+A Sbjct: 114 LVITDGEPDDRKAVFEVIIHATRQMERDEELGISIIQVG-SDAQATKFLKA 163 >gi|288925757|ref|ZP_06419688.1| BatB protein [Prevotella buccae D17] gi|288337412|gb|EFC75767.1| BatB protein [Prevotella buccae D17] Length = 342 Score = 39.8 bits (91), Expect = 0.65, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 64/174 (36%), Gaps = 24/174 (13%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSM--ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 +V ++ +I LD+S SM + S + K + ++S+ V + Sbjct: 79 GAEVQRDKRNGIEAIICLDISNSMLAQDVAPSRLDKSKLLVESL------VDRFTNDK-- 130 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 GL+ F+ + + ++ + + T T + A N + Sbjct: 131 --IGLIVFAGDAYVQLPITSDYVSAKMFLQNIDPSLIQTQGTD-IAQAIN-------LGL 180 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 H T+ + I+ +TDGE+ + + +K+G V+ +G+ + Sbjct: 181 HSFTQADKIGRAIIVITDGEDHEGGAVEAAAEA----RKKGVNVFILGVGDTKG 230 >gi|54296651|ref|YP_123020.1| hypothetical protein lpp0682 [Legionella pneumophila str. Paris] gi|53750436|emb|CAH11830.1| hypothetical protein lpp0682 [Legionella pneumophila str. Paris] Length = 1169 Score = 39.8 bits (91), Expect = 0.65, Method: Composition-based stats. Identities = 23/179 (12%), Positives = 54/179 (30%), Gaps = 51/179 (28%) Query: 240 VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ-------HCNTEDANYKKIIV 292 + + + +L ST++T A R + + K+ I+ Sbjct: 350 TNAINAFLPHLRPETNSTSTTEIKAAAVQSPLAGLLTRSRSFMKTVGTTSGNCPQKQYII 409 Query: 293 FMTDG----------------------------------ENLSTKEDQQSLYYCNEAKKR 318 ++DG + +++ ++ N K Sbjct: 410 LISDGLPTQDLQARYWPPLGSAAATGYGVTATFNADGSLNSTNSQALSDAINEINALKND 469 Query: 319 GAIVYAIGIRV-------IRSHEFLRACA---SPNSFYLVENPHSMYDAFSHIGKDIVT 367 G + + IG+ + LRA A ++Y +P ++ ++ + I +I Sbjct: 470 GVLTFVIGMGAGVDPAVNPEAAATLRAMAVAGGTENYYPATSPQALVNSLNSILSNIQN 528 >gi|85117086|ref|XP_965177.1| hypothetical protein NCU00984 [Neurospora crassa OR74A] gi|28926981|gb|EAA35941.1| predicted protein [Neurospora crassa OR74A] gi|38567055|emb|CAE76353.1| hypothetical protein [Neurospora crassa] Length = 1086 Score = 39.8 bits (91), Expect = 0.65, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 59/191 (30%), Gaps = 33/191 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V D S SM +I+ ++ L+ + + N + +F + E Sbjct: 296 IVFVCDRSGSMGG------ARIEGLKSALRIFLKSIPVGAKFN------ICSFGSTFEFL 343 Query: 234 FLLEWGVSHLQR------KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 F + + + T L+ A+ + ++ + Sbjct: 344 FSDGSRSYDHESLRLAMDYVSRMDADLGGTEMYQPLEAAFEKRYNDMDLE---------- 393 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 + +TDGE + + + + ++ +GI SH + A + + Sbjct: 394 ---VFLLTDGEIWNQEHLFTMINKKVSESQGAIRLFTLGIGNDVSHALIEGAARAGNGFA 450 Query: 348 --VENPHSMYD 356 V + M Sbjct: 451 QSVTDSEKMNA 461 >gi|329851995|ref|ZP_08266676.1| hypothetical protein ABI_47640 [Asticcacaulis biprosthecum C19] gi|328839844|gb|EGF89417.1| hypothetical protein ABI_47640 [Asticcacaulis biprosthecum C19] Length = 399 Score = 39.8 bits (91), Expect = 0.66, Method: Composition-based stats. Identities = 17/156 (10%), Positives = 41/156 (26%), Gaps = 2/156 (1%) Query: 16 KGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM--NEGNGNN 73 G + +++A+ + V+ I+ S + + L + D + A ++ ++ Sbjct: 13 GGNVAVISALCAAPLLYVITATIDHSAMVKDRFSLQAAADAGALMGAAKLALGSDELVFG 72 Query: 74 RKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAIS 133 + + ++N + + P +A Sbjct: 73 AAEAAAHQQIGDLRNPVTFEVVVDRSTGAVTVTGRSQHAPLIGFMSDGPTPISARATAEG 132 Query: 134 RYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD 169 K PL V+S D Sbjct: 133 LLKTPLCILQIGSSEMKIDDQATVRAGGCLVHSNKD 168 >gi|298370193|ref|ZP_06981509.1| pilus-associated protein [Neisseria sp. oral taxon 014 str. F0314] gi|298281653|gb|EFI23142.1| pilus-associated protein [Neisseria sp. oral taxon 014 str. F0314] Length = 1071 Score = 39.8 bits (91), Expect = 0.66, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 53/172 (30%), Gaps = 27/172 (15%) Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 V + + L+ G ++L + S + P YAY+ F Q + + Sbjct: 162 VEYRCQKAFVVLMSDGDANLSQACGRYSSSRNRSFVYPA-GYAYDDYFGRQHAGACRDID 220 Query: 284 DANYKKI----------------IVFMTDG------ENLSTKEDQQSLYYCNEAKKRGAI 321 +Y I F T G +D ++ N + + Sbjct: 221 GGSYDTFWDRNTGLSFFSRKLSSIDFKTSGLDKAGKSWQGDPKDPKTAKNPNGFRDQLVQ 280 Query: 322 VYAIGIRV---IRSHEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDIVTKR 369 + IG +L AS + ++ E+ +Y AF I I + Sbjct: 281 TFTIGFGRDISSAGRNYLTNGASRDDYYFSAESEDDLYRAFDTITDSIKDES 332 >gi|226310168|ref|YP_002770062.1| hypothetical protein BBR47_05810 [Brevibacillus brevis NBRC 100599] gi|226093116|dbj|BAH41558.1| hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 437 Score = 39.8 bits (91), Expect = 0.66, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 69/216 (31%), Gaps = 43/216 (19%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK-- 229 ++ I+LD S SM TK+ +A ++I E + + L + +K Sbjct: 136 FNVEIILDASGSMAGKI-GDKTKMQLAKEAIQEFAEALP------EDARISLRVYGHKGS 188 Query: 230 -IEEFFLLEWGVSHL------------QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 +E L G S + ++ + G T+ L+ A + + Sbjct: 189 NADEHKQLSCGSSEMVYPLQAYDAKRLEQALDMFEPTGW-TSIAHSLRLAQEDLAGFEAD 247 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI--VYAIGIRV-IRSH 333 + +I ++DG D + E + + + IG V Sbjct: 248 KNTN---------VIYLVSDG---IETCDGNPVAVAKELSQSKIMPLLNVIGFDVNAEGQ 295 Query: 334 EFLR--ACASPNSFYLVENPHSM---YDAFSHIGKD 364 + L+ A AS + V N + I + Sbjct: 296 KQLKEIAHASEGLYANVTNREQFKQELERAKEIAQK 331 >gi|326925129|ref|XP_003208773.1| PREDICTED: hypothetical protein LOC100547685, partial [Meleagris gallopavo] Length = 721 Score = 39.8 bits (91), Expect = 0.67, Method: Composition-based stats. Identities = 23/88 (26%), Positives = 35/88 (39%), Gaps = 15/88 (17%) Query: 291 IVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFY- 346 IV +TDGE S+ C ++ GAI++ I + + E Y Sbjct: 14 IVLLTDGE-------DSSMSVCRERVRESGAIIHTIALGPAAAKELEEFSNITGGLQLYA 66 Query: 347 -LVENPHSMYDAFSHIGK---DIVTKRI 370 V+ P + +AFS I DI + I Sbjct: 67 VDVDVPSKLVEAFSEITTGSGDISEQSI 94 >gi|300934008|ref|ZP_07149264.1| hypothetical protein CresD4_08057 [Corynebacterium resistens DSM 45100] Length = 410 Score = 39.8 bits (91), Expect = 0.67, Method: Composition-based stats. Identities = 29/160 (18%), Positives = 53/160 (33%), Gaps = 25/160 (15%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + + + D+ VLD S SM + S T++D K ++ + +V + Sbjct: 207 GSDMKPRAASPRDVTFVLDTSGSMT-LVEGSATRLDNIRKPLSDAMRKVGE--------R 257 Query: 221 SGLVT---FSNKIEEF--FLLEWGVSHLQR-----KIKYLSKFGVSTNSTPGLKYAYNQI 270 G V +S+ + + V R I L++ G + + Y I Sbjct: 258 GGAVGLWNYSSPLSDSARTPFRNNVDITNRDNGTIAISILNQLGAKGAT-----HTYESI 312 Query: 271 FDMQGMRQHCNT-EDANYKKIIVFMTDGENLSTKEDQQSL 309 A +V +TDG N + +S Sbjct: 313 AAAYASAVAGAGMPGAQSPARVVLITDGPNDGGRVTLESA 352 >gi|298484177|ref|ZP_07002343.1| conserved hypothetical protein [Bacteroides sp. D22] gi|298269681|gb|EFI11276.1| conserved hypothetical protein [Bacteroides sp. D22] Length = 289 Score = 39.8 bits (91), Expect = 0.67, Method: Composition-based stats. Identities = 24/111 (21%), Positives = 49/111 (44%), Gaps = 7/111 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S+E I K + + + N + G++ F Sbjct: 72 EEERELTVMLMVDVSGSLE------FGTIKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 S++IE+F + G H+ I+ L F + T ++ A + ++ R Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDFQPESRRTN-IRLALEYLTNVMKRR 175 >gi|291398585|ref|XP_002715927.1| PREDICTED: chloride channel accessory 2 [Oryctolagus cuniculus] Length = 940 Score = 39.8 bits (91), Expect = 0.67, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 76/212 (35%), Gaps = 37/212 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLDVS M + ++ ++ L ++ I G+ +FS+K E Sbjct: 312 VCLVLDVSSKM-----AEADRLLRLQQAAEFFLMQIVEIHTF-----VGIASFSSKGEIR 361 Query: 234 FLLE--WGVSHLQRKIKYLSKFGVSTNS---TPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L+ ++ + +L + GLK + + + G Sbjct: 362 APLQQINSRDDRKQLVSHLPTTVSTEAEISVCSGLKKGFQVVEKLNGKAYGA-------- 413 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFY 346 +I+ +T G + S+ G+ V++I + + L F+ Sbjct: 414 -VIILVTSGADEHVGSCLHSVLT------SGSTVHSIALGSSAARTLEELSRRTGGLKFF 466 Query: 347 LVE--NPHSMYDAFSHIGK---DIVTKRIWYD 373 + + N +S+ DAFS I DI + I + Sbjct: 467 VPDQSNSNSLIDAFSRISSGTGDIFQQSIQLE 498 >gi|74182490|dbj|BAE42868.1| unnamed protein product [Mus musculus] Length = 219 Score = 39.8 bits (91), Expect = 0.67, Method: Composition-based stats. Identities = 27/141 (19%), Positives = 50/141 (35%), Gaps = 10/141 (7%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM T +D A ++ ++ ++ + + LVTF Sbjct: 4 LLFLIDTSASMNQRSHLGTTYLDTAKGAVETFMK-LRARDPASRGDRYMLVTFEEPPY-A 61 Query: 234 FLLEWGVSHL-------QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 W +H + + L+ G S + L + + Q N Sbjct: 62 IKGGWKENHATFMNELKNLQAEGLTTLGQSLRTGFDLLNLNRLVTGIDNYGQGRNPFFLE 121 Query: 287 YKKIIVFMTDGENLSTKEDQQ 307 II+ +TDG L+T Q Sbjct: 122 PA-IIITITDGSKLTTTSGVQ 141 >gi|194016356|ref|ZP_03054970.1| YwmC [Bacillus pumilus ATCC 7061] gi|194011829|gb|EDW21397.1| YwmC [Bacillus pumilus ATCC 7061] Length = 233 Score = 39.8 bits (91), Expect = 0.67, Method: Composition-based stats. Identities = 36/219 (16%), Positives = 73/219 (33%), Gaps = 15/219 (6%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 T S I P+ + + + + + I+LD S SM + K D+A + + + Sbjct: 16 TLSMSISSPVFAKASTVKKHNKDVRVTILLDASGSMARKVEGER-KFDLAKQEVFKFAQS 74 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + + ++ L F ++ + + R + + + + + N Sbjct: 75 L----PKDAKIRMSL--FGSEGNNKNSGKAQSCEVIRGVYGVQPYEKESFENSLNELGPN 128 Query: 269 QIFDMQGMRQHCNTEDANY---KKIIVFM-TDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 + +H D K IV++ TDGE + + + +K V Sbjct: 129 GWTPIARALEHAKQADEQLNNGTKHIVYLITDGEETCGGDPVKVAKELHNSKGSTV-VNV 187 Query: 325 IGIRVIRSHE-FLRACASP--NSFYLVENPHSMYDAFSH 360 IG+ +E L+ A +Y M S Sbjct: 188 IGLDFNDGYEGQLKQVAKAGKGHYYQASTGKEMGSILSA 226 >gi|160882772|ref|ZP_02063775.1| hypothetical protein BACOVA_00733 [Bacteroides ovatus ATCC 8483] gi|237720678|ref|ZP_04551159.1| conserved hypothetical protein [Bacteroides sp. 2_2_4] gi|156111796|gb|EDO13541.1| hypothetical protein BACOVA_00733 [Bacteroides ovatus ATCC 8483] gi|229449513|gb|EEO55304.1| conserved hypothetical protein [Bacteroides sp. 2_2_4] Length = 289 Score = 39.8 bits (91), Expect = 0.67, Method: Composition-based stats. Identities = 24/111 (21%), Positives = 49/111 (44%), Gaps = 7/111 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S+E I K + + + N + G++ F Sbjct: 72 EEERELTVMLMVDVSGSLE------FGTIKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 S++IE+F + G H+ I+ L F + T ++ A + ++ R Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDFQPESRRTN-IRLALEYLTNVMKRR 175 >gi|52840865|ref|YP_094664.1| type IV fimbrial biogenesis PilY1-related protein [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] gi|52627976|gb|AAU26717.1| type IV fimbrial biogenesis PilY1-related protein [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] Length = 1172 Score = 39.8 bits (91), Expect = 0.67, Method: Composition-based stats. Identities = 23/179 (12%), Positives = 54/179 (30%), Gaps = 51/179 (28%) Query: 240 VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ-------HCNTEDANYKKIIV 292 + + + +L ST++T A R + + K+ I+ Sbjct: 353 TNAINAFLPHLRPETNSTSTTEIKAAAVQSPLAGLLTRSRSFMKTVGTTSGNCPQKQYII 412 Query: 293 FMTDG----------------------------------ENLSTKEDQQSLYYCNEAKKR 318 ++DG + +++ ++ N K Sbjct: 413 LISDGLPTQDLQARYWPPLGSAAATGYGVTATFNADGSLNSTNSQALSDAINEINALKND 472 Query: 319 GAIVYAIGIRV-------IRSHEFLRACA---SPNSFYLVENPHSMYDAFSHIGKDIVT 367 G + + IG+ + LRA A ++Y +P ++ ++ + I +I Sbjct: 473 GVLTFVIGMGAGVDPAVNPEAAATLRAMAVAGGTENYYPATSPQALVNSLNSILSNIQN 531 >gi|42524019|ref|NP_969399.1| norD protein [Bdellovibrio bacteriovorus HD100] gi|39576227|emb|CAE80392.1| norD protein [Bdellovibrio bacteriovorus HD100] Length = 609 Score = 39.8 bits (91), Expect = 0.67, Method: Composition-based stats. Identities = 36/229 (15%), Positives = 76/229 (33%), Gaps = 29/229 (12%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSV-KVNSQTDARLDMMIVLDVSRSMESFF 188 + P + + +P + + + + L ++IVLD+S S +S+ Sbjct: 380 WRKGQLDGPELDLDALVRHVGDVQNKIPSPGRLYQTQVKRERDLQIVIVLDLSLSTDSYV 439 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIK 248 S +D ++++ + ++ N +V +G + + F +L+ Sbjct: 440 -SDRRVLDTELEAVG-LWGLLQPSGPDNTLV-AGAFSETRHKCAFEILKDQGEDWSAYFS 496 Query: 249 YLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED- 305 + T P L++A + + +K+++ +TDG+ Sbjct: 497 RAQQIVPRGYTRLGPALRHATRILRECSA-----------RQKVLIILTDGKPTDYDGYE 545 Query: 306 -----QQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVE 349 + C EA+ A I H F P FY E Sbjct: 546 GRYGIEDMRKACMEAESAQISTRAFAIEKAAKHYF------PQMFYSFE 588 >gi|84499825|ref|ZP_00998113.1| hypothetical protein OB2597_07840 [Oceanicola batsensis HTCC2597] gi|84392969|gb|EAQ05180.1| hypothetical protein OB2597_07840 [Oceanicola batsensis HTCC2597] Length = 244 Score = 39.8 bits (91), Expect = 0.67, Method: Composition-based stats. Identities = 41/231 (17%), Positives = 76/231 (32%), Gaps = 47/231 (20%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFF--DSSITKIDMAIKSINAMLEEVKLIPDV 215 T + ++N+ + D M+V D S SM F +T+I A ++I +P V Sbjct: 17 TTPAPRLNAGSGCATDAMLVFDGSGSMVEFGYDPRQVTRIREAREAIRH------AMPLV 70 Query: 216 NNVVQSGLVTFS-------NKIEEFFLLE-WGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 V + GL+ + + I+ F + R + LS G++ + Sbjct: 71 APVRRIGLLIYGPNDGDSCSGIDLRFPPRPDAADPVIRAVDALSPGGLTPLA-------- 122 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + + H IIV +TDG N + + A+ R ++ IG Sbjct: 123 RSVGVAARVLDHREKAG-----IIVVVTDG-NETCGGRPCATGAALAAEARDLTIHVIGF 176 Query: 328 RV---------IR-----SHEFLRAC---ASPNSFYLVENPHSMYDAFSHI 361 R + + C + + E + +A Sbjct: 177 RALVDYWTWDNPEQEAHVGEDTVARCLAEKTGGMYVRTETVGELVEALQAT 227 >gi|293373988|ref|ZP_06620329.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f] gi|292631064|gb|EFF49701.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f] Length = 270 Score = 39.8 bits (91), Expect = 0.68, Method: Composition-based stats. Identities = 24/111 (21%), Positives = 49/111 (44%), Gaps = 7/111 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S+E I K + + + N + G++ F Sbjct: 53 EEERELTVMLMVDVSGSLE------FGTIKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 106 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 S++IE+F + G H+ I+ L F + T ++ A + ++ R Sbjct: 107 SDRIEKFIPPKKGRKHILYIIRELIDFQPESRRTN-IRLALEYLTNVMKRR 156 >gi|148234334|ref|NP_001089951.1| integrator complex subunit 6-A [Xenopus laevis] gi|126258172|sp|Q2TAF4|INT6A_XENLA RecName: Full=Integrator complex subunit 6-A; Short=Int6-A gi|83405213|gb|AAI10953.1| MGC132177 protein [Xenopus laevis] Length = 883 Score = 39.8 bits (91), Expect = 0.68, Method: Composition-based stats. Identities = 26/133 (19%), Positives = 51/133 (38%), Gaps = 12/133 (9%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ +LD S SM T +D+A ++ ++ ++ + + LVT Sbjct: 4 LLFLLDTSASMNQRSHLGTTYLDIAKGAVETFMK-LRSRDPASRGDRYMLVTVEEPPYG- 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQI------FDMQGMRQHCNTEDA 285 W +H ++K L G++T L+ A++ + + Q N Sbjct: 62 IKAGWKENHATFMNELKNLQAVGLTTL-GQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFL 120 Query: 286 NYKKIIVFMTDGE 298 II+ +TDG Sbjct: 121 EPS-IIIVITDGS 132 >gi|307720885|ref|YP_003892025.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306978978|gb|ADN09013.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 515 Score = 39.8 bits (91), Expect = 0.68, Method: Composition-based stats. Identities = 33/203 (16%), Positives = 70/203 (34%), Gaps = 34/203 (16%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 +I LDVS+SM + D +++ +A + +++ N + G++ F+ Sbjct: 87 SAVIALDVSKSMHAS-DIYPSRLGLARLKLLKFIQKA-------NNLHVGILIFAKNSYM 138 Query: 233 FFLLEWGVS---HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + L ++ + K +TN L+ + + K Sbjct: 139 LYPLSEDTQALAYMLKNADIKQKLEPNTNLFGVLESGKKML-------------EKEKTK 185 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV- 348 I+ +TDG ++ D+ S K +YAI ++ +Y+ Sbjct: 186 NIILLTDGGEDVSRADEISYIL-----KNHLKLYAIDFGPKPNNSLKNMTQKSEGYYMKY 240 Query: 349 ----ENPHSMYDAFSHIGKDIVT 367 + + DA + I+T Sbjct: 241 QWTQSDIEGILDAIQKSSQKIIT 263 >gi|226941831|ref|YP_002796905.1| VCBS [Laribacter hongkongensis HLHK9] gi|226716758|gb|ACO75896.1| VCBS [Laribacter hongkongensis HLHK9] Length = 1087 Score = 39.8 bits (91), Expect = 0.68, Method: Composition-based stats. Identities = 32/156 (20%), Positives = 60/156 (38%), Gaps = 22/156 (14%) Query: 174 MMIVLDVSRSM------ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +++ LDVS SM + ++++ +A ++I+ +L++ + D V+ LV FS Sbjct: 264 LLLTLDVSGSMNDPSGVKDANGKDLSRLALAKQAISQLLDQYDALGD----VKVQLVKFS 319 Query: 228 NKIEEFFLLEWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 W + + + ++K TN L A Q F G Sbjct: 320 EG-GSVQSNNWMTVAEAKAALAGITKGDGGTNYDEALDLA-RQAFAKPGQLDGA------ 371 Query: 287 YKKIIVFMTDGENLSTKEDQQ--SLYYCNEAKKRGA 320 K + F +DG+ + Q+ S + K G Sbjct: 372 -KNVSYFFSDGDPTLSNSGQKNNSGATVDPDKGDGI 406 >gi|297191393|ref|ZP_06908791.1| von Willebrand factor type A [Streptomyces pristinaespiralis ATCC 25486] gi|197722324|gb|EDY66232.1| von Willebrand factor type A [Streptomyces pristinaespiralis ATCC 25486] Length = 800 Score = 39.8 bits (91), Expect = 0.68, Method: Composition-based stats. Identities = 25/239 (10%), Positives = 66/239 (27%), Gaps = 51/239 (21%) Query: 136 KIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA-RLDMMIVLDVSRSMESFFDSSITK 194 +P+ C + + + +D +V+D S S+ + + Sbjct: 8 GVPVLICALLILPAAAPGQAAGPLGRAAAPASDGPEPIDFAVVVDQSESLSEKDLAREVE 67 Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE-----------FFLLEWGVSHL 243 I +++ ++ ++ F + + G L Sbjct: 68 AASLI-----------TQGEISARSRAAVIGFGSSEKPGQSPVSEVCALTTADTAGRQRL 116 Query: 244 QRKIKYL-----SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE 298 ++ L K G T+ ++ A +++ + D + K++ +TDG Sbjct: 117 SDCVQQLAHRDDRKTGPGTDFPAAVRQAVDRLTEN---------GDKDVPKVVFLLTDGR 167 Query: 299 --------------NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 + ++ A++ ++ +G L A Sbjct: 168 LDVSDSPEYGTDEASRQANGAKRLTEELARARREFVQIWPLGFGARIDRAALTRMAEGG 226 >gi|154174230|ref|YP_001408797.1| von Willebrand factor type A domain-containing protein [Campylobacter curvus 525.92] gi|112803709|gb|EAU01053.1| putative von Willebrand factor type A domain [Campylobacter curvus 525.92] Length = 231 Score = 39.8 bits (91), Expect = 0.68, Method: Composition-based stats. Identities = 30/148 (20%), Positives = 57/148 (38%), Gaps = 17/148 (11%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 + + RL+M+++LD S SM D +I + D+N + Sbjct: 19 EIAPETSAPNRLEMVLILDKSGSMSGLED----------DTIGGFNSMIDKQKDLNITTK 68 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 V F K + ++Q+ + G +T + N+I ++ G Sbjct: 69 VTTVLFDTKFN-VIHDREDIKNVQKLTSNEYRAGGNTALLDAIGSTINKIENVSG----- 122 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQS 308 D N + + V +TDG+ S+KE ++ Sbjct: 123 -IYDKNSRVLFVIITDGQENSSKEYTKA 149 >gi|113460773|ref|YP_718840.1| hypothetical protein HS_0628 [Haemophilus somnus 129PT] gi|112822816|gb|ABI24905.1| conserved hypothetical protein, with von Willebrand factor (vWF) domain [Haemophilus somnus 129PT] Length = 212 Score = 39.8 bits (91), Expect = 0.68, Method: Composition-based stats. Identities = 23/136 (16%), Positives = 49/136 (36%), Gaps = 20/136 (14%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +++D S SM I+ + ++ ++ P ++TF + Sbjct: 3 RLPVYLLVDTSGSM------MGEPIESVRSGLQTLVSALRQDPYALETAYLSVITFDSSA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK-- 288 + L + ++ + G++ + A + D + + N A K Sbjct: 57 RQVTPL---TDLMSFQLPSIEASGLT-----AMGEALGLLADC--VNREVNKGSAEVKGD 106 Query: 289 -KIIVF-MTDGENLST 302 K +VF +TDG Sbjct: 107 WKPVVFLLTDGIPTDD 122 >gi|225010239|ref|ZP_03700711.1| conserved hypothetical protein [Flavobacteria bacterium MS024-3C] gi|225005718|gb|EEG43668.1| conserved hypothetical protein [Flavobacteria bacterium MS024-3C] Length = 289 Score = 39.8 bits (91), Expect = 0.69, Method: Composition-based stats. Identities = 37/162 (22%), Positives = 63/162 (38%), Gaps = 30/162 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+V+DVS S K + + + N + G++ F Sbjct: 72 EEERELTLMMVVDVSGS------GLFGTTTAFKKDLLIEITATLAFSALQNNDKVGVLLF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +++IE F + G SH+ R I+ L T+ + LKY N + Sbjct: 126 TDQIELFIPPKKGKSHILRIIRELLEFKPKSQKTDLSFALKYLGNVLK------------ 173 Query: 284 DANYKKIIVF-MTDGENLSTKEDQQSLYYCNEAKK-RGAIVY 323 KK IVF ++D ++E QSL + G +Y Sbjct: 174 ----KKAIVFVLSD---FISEEYLQSLRIVGKKHDLTGIRIY 208 >gi|115496418|ref|NP_001068821.1| inter-alpha-trypsin inhibitor heavy chain H1 precursor [Bos taurus] gi|122142424|sp|Q0VCM5|ITIH1_BOVIN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H1; Short=ITI heavy chain H1; Short=ITI-HC1; Short=Inter-alpha-inhibitor heavy chain 1; Flags: Precursor gi|111308523|gb|AAI20097.1| Inter-alpha (globulin) inhibitor H1 [Bos taurus] Length = 906 Score = 39.8 bits (91), Expect = 0.69, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 69/195 (35%), Gaps = 26/195 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV-TFSNKIEE 232 ++ V+D+S SME K+ ++++ +L +++ + V+ V ++ + + Sbjct: 288 VVFVIDISSSMEG------QKLKQTKEALHKILGDMRPGDYFDLVLFGSAVQSWKGSLVQ 341 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + ++ S +TN GL + Q E +N+ I++ Sbjct: 342 ASP--ANLEAARNFVQQFS-LAGATNLNGGLLRGIEIL----NKAQQSLPELSNHASILI 394 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAI----VYAIGIRVIRSHEFLRACASPNS---- 344 +TDGE D+ + G +Y +G FL + N+ Sbjct: 395 MLTDGEPTEGVMDRT--QILKNVRD-GIKGRFPLYNLGFGHDVDLNFLEVMSLENNGRVQ 451 Query: 345 -FYLVENPHSMYDAF 358 Y + F Sbjct: 452 RIYEDHDATQQLQGF 466 >gi|296474790|gb|DAA16905.1| inter-alpha-trypsin inhibitor heavy chain H1 precursor [Bos taurus] Length = 906 Score = 39.8 bits (91), Expect = 0.69, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 69/195 (35%), Gaps = 26/195 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV-TFSNKIEE 232 ++ V+D+S SME K+ ++++ +L +++ + V+ V ++ + + Sbjct: 288 VVFVIDISSSMEG------QKLKQTKEALHKILGDMRPGDYFDLVLFGSAVQSWKGSLVQ 341 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + ++ S +TN GL + Q E +N+ I++ Sbjct: 342 ASP--ANLEAARNFVQQFS-LAGATNLNGGLLRGIEIL----NKAQQSLPELSNHASILI 394 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAI----VYAIGIRVIRSHEFLRACASPNS---- 344 +TDGE D+ + G +Y +G FL + N+ Sbjct: 395 MLTDGEPTEGVMDRT--QILKNVRD-GIKGRFPLYNLGFGHDVDLNFLEVMSLENNGRVQ 451 Query: 345 -FYLVENPHSMYDAF 358 Y + F Sbjct: 452 RIYEDHDATQQLQGF 466 >gi|157694070|ref|YP_001488532.1| hypothetical protein BPUM_3319 [Bacillus pumilus SAFR-032] gi|157682828|gb|ABV63972.1| hypothetical protein YwmC [Bacillus pumilus SAFR-032] Length = 233 Score = 39.8 bits (91), Expect = 0.69, Method: Composition-based stats. Identities = 39/219 (17%), Positives = 76/219 (34%), Gaps = 15/219 (6%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 T S I P+ + + + + + I+LD S SM + K D+A + + + Sbjct: 16 TLSMSISSPVFAKASTVKKHNKDVRVTILLDASGSMARKVEGER-KFDLAKQEVFKFAQS 74 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS--TNSTPGLK-Y 265 + + ++ L F ++ + + R + + + NS GL Sbjct: 75 L----PKDAKIRMSL--FGSEGNNKNSGKAQSCEVIRGVYGVQPYEKESFENSLNGLGPN 128 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFM-TDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 + I + + + N K IV++ TDGE + + + +K V Sbjct: 129 GWTPIARALEHAKQTDEQLNNGTKHIVYLITDGEETCGGDPVKVAKELHNSKGSTV-VNV 187 Query: 325 IGIRVIRSHE-FLRACASP--NSFYLVENPHSMYDAFSH 360 IG+ +E L+ A +Y M S Sbjct: 188 IGLDFNDGYEGQLKQVAKAGKGHYYQASTGKEMGSILSA 226 >gi|123491786|ref|XP_001325914.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121908821|gb|EAY13691.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 753 Score = 39.8 bits (91), Expect = 0.69, Method: Composition-based stats. Identities = 36/257 (14%), Positives = 83/257 (32%), Gaps = 41/257 (15%) Query: 103 FVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSV 162 +++ ++ V + Y + +K ++ +I + + Sbjct: 179 SAKGTMNVIDPHNVTFVT----KTYPNDRSIIIEARIKDKDKSIAISSDGYISISTYTFF 234 Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + Q + ++D S SM ++I A +N +L + + + Sbjct: 235 EGKVQANTEF--YFIIDCSGSMYG------SRIKNAKSCLNVLLHSLPIG------CRFS 280 Query: 223 LVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ F K E +S ++ + + LKY + + Sbjct: 281 IIKFGTKFEVALEPCDYTDENMSKAMHQLDLIDADMCGNDMISPLKY----------ISE 330 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 H +D Y K + +TDGE +D+ S+ +A + V+ IGI + Sbjct: 331 HPQKKD--YIKQVFLLTDGE-----DDRISICAMVQANRDNFRVFTIGIGSDADRNLIID 383 Query: 339 CA--SPNSFYLVENPHS 353 A + +++ Sbjct: 384 VARNGSGRYIFIDDEDE 400 >gi|291232483|ref|XP_002736186.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii] Length = 1797 Score = 39.8 bits (91), Expect = 0.70, Method: Composition-based stats. Identities = 30/183 (16%), Positives = 64/183 (34%), Gaps = 33/183 (18%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 + I++DVS SME +D+ + + ++EE N+++ G + + Sbjct: 523 QVYILIDVSNSMEPH-------LDLVKEKLIRLMEEQLRHKMKFNLIKFGTRAMTWRDRM 575 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + E + ++ L+ G STN+ LK A + + + Sbjct: 576 VDVNEANLHSAWSWVRGLTVTG-STNTLSALKLALSD----------------PNTQAVY 618 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI--RSHEFLRACASP---NSFYL 347 +TDG ++ + + ++ I +++FL A+ Y Sbjct: 619 LLTDGRPDMPQKTVLAQVQLQQ----KVPIHTISFNCADTEANQFLAQLAADTGGRYHYF 674 Query: 348 VEN 350 E Sbjct: 675 SEQ 677 >gi|270014558|gb|EFA11006.1| hypothetical protein TcasGA2_TC004591 [Tribolium castaneum] Length = 1235 Score = 39.8 bits (91), Expect = 0.70, Method: Composition-based stats. Identities = 39/223 (17%), Positives = 84/223 (37%), Gaps = 23/223 (10%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 ++ + T D++I++D S SM+ +A ++N +L+ +N + S Sbjct: 226 IEAATCTK---DVIILVDNSGSMDGMGR------HIASLTVNTILDTFSNNDYINILYYS 276 Query: 222 GLVT-FSNKIEEFFLLEWGVSHL---QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 T ++ L++ ++ + I++L G T+ L+ A++ + + + +R Sbjct: 277 NQTTNYTIPCFRNLLVQATPENIVLFKEAIRHLGPSGK-TDFPQALQMAFDILENYREIR 335 Query: 278 QHCNTEDANYKK------IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA--IGIRV 329 N E K I+ +TDG + + + + K ++ IG V Sbjct: 336 GCNNEEIDEEGKSKACNQAIMLITDGISRNFSDIVMRNNQLDGGKTIPVRIFTYLIGKEV 395 Query: 330 IRSHEF-LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 E ACA+ + V+ + A I + Sbjct: 396 TNVEEIRWMACANRGFYTQVQTLEQVTSAVLQYINVIARPLVL 438 >gi|239980987|ref|ZP_04703511.1| putative surface-anchored fimbrial subunit [Streptomyces albus J1074] Length = 893 Score = 39.8 bits (91), Expect = 0.70, Method: Composition-based stats. Identities = 35/268 (13%), Positives = 69/268 (25%), Gaps = 55/268 (20%) Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT- 159 N + + YS +A F+ S T Sbjct: 88 NDTFRTGGTVTTPYRFQTPRLAAATTYSSTATG-------PGGFMVGSGGSNRTASTGTW 140 Query: 160 --SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 S V L + +++D+S SM + + + ++ ++ P ++ Sbjct: 141 QSSRVNPALPERCGLRVALIMDLSGSMSG-------SVPALKTAADTFVDALQGTP--SS 191 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG-- 275 V + ++S G ++ + + K Y + Sbjct: 192 VARFTFSSYS-------PATRGGANAPGLVSV-----STLADAAAFKRTYASWTNATAEG 239 Query: 276 ----MRQHCNTEDANYK-KIIVFMTDGENLSTKED-----------------QQSLYYCN 313 R A + + V +TDG + + + N Sbjct: 240 STNWDRALYEPASATSQYDVAVVITDGMPTNYSIPGGPSGGASGSVTRFRELESGIASAN 299 Query: 314 EAKKRGAIVYAIGIRVIRSHEFLRACAS 341 K G V A+G+ S AS Sbjct: 300 ALKNEGTRVLAVGVGEGTSGNAALNLAS 327 >gi|229113208|ref|ZP_04242703.1| hypothetical protein bcere0018_54170 [Bacillus cereus Rock1-15] gi|228670234|gb|EEL25582.1| hypothetical protein bcere0018_54170 [Bacillus cereus Rock1-15] Length = 425 Score = 39.8 bits (91), Expect = 0.70, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 73/199 (36%), Gaps = 16/199 (8%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 L++ I+LD S SM + + K++ A K+I L IPD NV+ L + Sbjct: 121 KAKSLNVEILLDASGSMAGKVNGEV-KMEAAKKAIYNYL---DKIPDNANVM---LRVYG 173 Query: 228 NK---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE- 283 +K E L G S + ++ +K + + + + E Sbjct: 174 HKGSNNENDKSLSCGSSEVMYPLQPYNKEQFNAALSKFGPKGWTPLASAIESVNDDFKEY 233 Query: 284 -DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-HEFLRACAS 341 ++ ++DGE + + N++ A+V IG V S + L+ A Sbjct: 234 TGEENLNVVYIVSDGEETCGGDPVNAAKNLNQS-STHAVVNIIGFDVKNSEQQQLKNTAE 292 Query: 342 P--NSFYLVENPHSMYDAF 358 ++ V + ++ Sbjct: 293 AGKGNYATVSSADELHQTL 311 >gi|198420230|ref|XP_002121268.1| PREDICTED: similar to Fc fragment of IgG binding protein [Ciona intestinalis] Length = 2313 Score = 39.8 bits (91), Expect = 0.70, Method: Composition-based stats. Identities = 21/138 (15%), Positives = 43/138 (31%), Gaps = 23/138 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +D + SM +I+ A ++A++ E + + L F++ Sbjct: 418 LAFAIDTTGSMSG-------EINAAKSRVSAIINE--TTGTTDQPLFFVLSEFNDPRVGP 468 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L + + + L G L YN + K+ +VF Sbjct: 469 TLATTSPTAILTGLNALRASGGGDEPELALAGIYNALIAST-------------KRSVVF 515 Query: 294 M-TDGENLSTKEDQQSLY 310 M TD + + + Sbjct: 516 MITDATAKDVNLETRVIA 533 >gi|327458562|gb|EGF04912.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1] Length = 463 Score = 39.8 bits (91), Expect = 0.71, Method: Composition-based stats. Identities = 35/158 (22%), Positives = 59/158 (37%), Gaps = 31/158 (19%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSS--------ITKIDMAIKSINAMLEEVKLIPDVNNV 218 Q + + V D S SM+ +++ + K M++++K I +++ Sbjct: 193 QGQMNIAVSFVFDTSGSMDWDLQGRNVNPNSGTESRMTILRKKAEIMIKDLKGIGNIS-- 250 Query: 219 VQSGLVTFSNK----IEEFFLLEWGVSHLQRKI---KYLSKFGVSTNSTPGLKYAYNQIF 271 LV FS+ +EF L+ G + I K L GV TN GL+Y + Sbjct: 251 --VNLVGFSSSGKYIQKEFSNLDNGADTIIGTIKDPKKLVPDGV-TNPGDGLRYGLISLQ 307 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL 309 K +V +TDG + D +L Sbjct: 308 SQPAQL-----------KYVVLLTDGIPNTYIVDPSAL 334 >gi|299145606|ref|ZP_07038674.1| conserved hypothetical protein [Bacteroides sp. 3_1_23] gi|298516097|gb|EFI39978.1| conserved hypothetical protein [Bacteroides sp. 3_1_23] Length = 289 Score = 39.8 bits (91), Expect = 0.71, Method: Composition-based stats. Identities = 24/111 (21%), Positives = 49/111 (44%), Gaps = 7/111 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S+E I K + + + N + G++ F Sbjct: 72 EEERELTVMLMVDVSGSLE------FGTIKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 S++IE+F + G H+ I+ L F + T ++ A + ++ R Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDFQPESRRTN-IRLALEYLTNVMKRR 175 >gi|254430946|ref|ZP_05044649.1| structural toxin protein RtxA [Cyanobium sp. PCC 7001] gi|197625399|gb|EDY37958.1| structural toxin protein RtxA [Cyanobium sp. PCC 7001] Length = 2003 Score = 39.8 bits (91), Expect = 0.71, Method: Composition-based stats. Identities = 38/276 (13%), Positives = 81/276 (29%), Gaps = 37/276 (13%) Query: 29 IIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKN 88 I+ +G +I ++ HS +D ++ + N+ + G I N Sbjct: 794 ILAGAVGQVIYLA-------QDHSAVDNAVTASLASGANDRA-TLQVNADGTYTFTLIDN 845 Query: 89 TWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY 148 + ++ + + D N Sbjct: 846 FLLSDPGDTTEQTESISSLVGGINILVEDGDGDAANGT---------------TGIALSL 890 Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSS-ITKIDMAIKSINAMLE 207 I + + S ++ +V+DVS SM + +T++ + I S +++ Sbjct: 891 LVKDDIPTAVPITESGESFPTGT-NLFLVIDVSGSMANASGVDGMTRMQLQINSALELID 949 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH-LQRKIKYLSKFGVSTNSTPGLKYA 266 + + + + +VTF+ F W + ++ I+ L TN L Sbjct: 950 QYEALGPLK----VNVVTFATDASAPFSTTWQDADAVKTFIQTLVPT-SRTNYDAALNLT 1004 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 N T ++ F +DG Sbjct: 1005 INTFNGGTASDIDGATN------VLYFFSDGVPNEN 1034 >gi|168047962|ref|XP_001776437.1| predicted protein [Physcomitrella patens subsp. patens] gi|162672163|gb|EDQ58704.1| predicted protein [Physcomitrella patens subsp. patens] Length = 795 Score = 39.8 bits (91), Expect = 0.71, Method: Composition-based stats. Identities = 39/205 (19%), Positives = 73/205 (35%), Gaps = 20/205 (9%) Query: 173 DMMIVLDVSRSMES------FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 D++I+LD SM F +TK D +I I A+L+ + + +V F Sbjct: 371 DLVILLDGGNSMGDDLPVDIFISKGVTKFDTSINIIKALLDTLTYGD------RVSVVLF 424 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE-DA 285 S+ E + + + K + S + + N + + + Q E ++ Sbjct: 425 SSSTEPYLVYNTTTATYNTTFMNPLKDELDKLSVDAKQGSSNFLSGILKVNQTFLPESNS 484 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR--ACASPN 343 N KI+V +TDG + Y + + R + S L AC Sbjct: 485 NALKIMVVLTDGHFADDTTLTSTSPYLTQLRSRRVLPLFYSFDRDLSKTVLPQVACNVSG 544 Query: 344 SFYLVENPH-----SMYDAFSHIGK 363 ++ +E ++ F I + Sbjct: 545 TYERIEKTVLNPLWTLRSYFGIIAR 569 >gi|332970145|gb|EGK09139.1| D-amino-acid dehydrogenase [Desmospora sp. 8437] Length = 485 Score = 39.8 bits (91), Expect = 0.71, Method: Composition-based stats. Identities = 40/210 (19%), Positives = 75/210 (35%), Gaps = 36/210 (17%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +++MI+LD S SM + K+D+A K++ ++ LV + Sbjct: 171 ESQEVNIMILLDSSGSMADKVKGGV-KMDLAKKAVKEF------ASNMPEGANVSLVVYG 223 Query: 228 NK--------------IEEFFLL-EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 +K IEE + L + Q + G T +K A ++ Sbjct: 224 HKGSNAGADKKVSCESIEEIYPLAAYDGKTFQGSLDKFQPTGW-TPLAGSMKLAQEKLAS 282 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IR 331 G N + I+ ++DG + ++ NE+ A+V IG V Sbjct: 283 HTGS---------NVQNIVYVVSDGVETCDGDPVKAAEELNES-NMKAVVNIIGFDVDNE 332 Query: 332 SHEFLR--ACASPNSFYLVENPHSMYDAFS 359 + L+ A A ++ V + + + F Sbjct: 333 GQKALKEVADAGGGTYKTVGSKVGLQEYFE 362 >gi|330684715|gb|EGG96413.1| von Willebrand factor type A domain protein [Staphylococcus epidermidis VCU121] Length = 629 Score = 39.8 bits (91), Expect = 0.71, Method: Composition-based stats. Identities = 34/301 (11%), Positives = 91/301 (30%), Gaps = 40/301 (13%) Query: 39 EVSHIFFMKTV-LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNE 97 +++ + K + +DR E G + + Sbjct: 313 DMTDMMTKKGKGSQNTLDR-----------EEGGFIGQNSAFALDGINQNVDIKWKVPDI 361 Query: 98 LRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMP 157 L + ++ +D+ +I + Q I + + I Sbjct: 362 LPE--YIQAYEDVKNDVQFEIKDLIQ----IIKKTIEREHQDERHNLTKGRLQKNLINWF 415 Query: 158 ITSSVK-----VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 I K + +++D S SM D +I + + +++ + + +++ Sbjct: 416 IDDQYKLFYKKQDLSQSFDATFTLLIDASASMHDKMDETIKGVVLFHETLKELNVKHEIL 475 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQR-KIKYLSKFGVSTNSTPGLKYAYNQIF 271 + S N I+E ++ +I L + ++ A +++ Sbjct: 476 AFNEDAFDSDDTKQPNIIDEIIHYDYSTLKKDGPRIMALEPQDDN-RDGVAIRIASDRL- 533 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKRGAIVYAIGI 327 +++++ ++ +DGE + Q + A+K G V+ + + Sbjct: 534 ----------IRRSHHQRFLIVFSDGEPSAYNYSQDGIIDTYEAVENARKFGIEVFNVFL 583 Query: 328 R 328 Sbjct: 584 S 584 >gi|260170241|ref|ZP_05756653.1| hypothetical protein BacD2_00075 [Bacteroides sp. D2] gi|315918604|ref|ZP_07914844.1| conserved hypothetical protein [Bacteroides sp. D2] gi|313692479|gb|EFS29314.1| conserved hypothetical protein [Bacteroides sp. D2] Length = 289 Score = 39.8 bits (91), Expect = 0.71, Method: Composition-based stats. Identities = 24/111 (21%), Positives = 49/111 (44%), Gaps = 7/111 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S+E I K + + + N + G++ F Sbjct: 72 EEERELTVMLMVDVSGSLE------FGTIKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 S++IE+F + G H+ I+ L F + T ++ A + ++ R Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDFQPESRRTN-IRLALEYLTNVMKRR 175 >gi|237716503|ref|ZP_04546984.1| conserved hypothetical protein [Bacteroides sp. D1] gi|262408101|ref|ZP_06084649.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|294645095|ref|ZP_06722821.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a] gi|294809496|ref|ZP_06768199.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b] gi|229444150|gb|EEO49941.1| conserved hypothetical protein [Bacteroides sp. D1] gi|262354909|gb|EEZ04001.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|292639601|gb|EFF57893.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a] gi|294443314|gb|EFG12078.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b] gi|295084187|emb|CBK65710.1| Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) [Bacteroides xylanisolvens XB1A] Length = 289 Score = 39.8 bits (91), Expect = 0.71, Method: Composition-based stats. Identities = 24/111 (21%), Positives = 49/111 (44%), Gaps = 7/111 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S+E I K + + + N + G++ F Sbjct: 72 EEERELTVMLMVDVSGSLE------FGTIKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 S++IE+F + G H+ I+ L F + T ++ A + ++ R Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDFQPESRRTN-IRLALEYLTNVMKRR 175 >gi|253687077|ref|YP_003016267.1| hypothetical protein PC1_0676 [Pectobacterium carotovorum subsp. carotovorum PC1] gi|251753655|gb|ACT11731.1| conserved hypothetical protein [Pectobacterium carotovorum subsp. carotovorum PC1] Length = 543 Score = 39.8 bits (91), Expect = 0.72, Method: Composition-based stats. Identities = 27/223 (12%), Positives = 60/223 (26%), Gaps = 26/223 (11%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F KG T + + + I++ S T + D + + Q G Sbjct: 28 FIQQEKGAGTGFYVLGAMALLVTAAFIVDTSTATGDATQIKRATDAAALAVGHQATING- 86 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 +N L N +++ + + + Y+++ Sbjct: 87 --------EEYSQEETNKLAYEYVKNNLGMNKALSEKLVASDVSVAEGRNSATRKTYTVT 138 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 K L S V+ + + +V+ V+ M D+ Sbjct: 139 VAFETKPSLMSLGARKQEVYSTSEVINRPTEI------------ALVMPVTGDMS---DA 183 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 I + +S + + + LV +S + + Sbjct: 184 DIRSLKSVSRSFVE--RMLSSADGKRDNLWLSLVPYSQSVNVY 224 >gi|296109872|ref|YP_003616821.1| von Willebrand factor type A [Methanocaldococcus infernus ME] gi|295434686|gb|ADG13857.1| von Willebrand factor type A [Methanocaldococcus infernus ME] Length = 370 Score = 39.8 bits (91), Expect = 0.72, Method: Composition-based stats. Identities = 35/182 (19%), Positives = 64/182 (35%), Gaps = 35/182 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++I LDVS SM + + +AI ++ +N + ++ F + + E Sbjct: 214 LVICLDVSGSMRGKKEEWAKALALAITNL-----------SINEGKKVHIIFFDDGVREV 262 Query: 234 FLLEWGVSHLQRKIKYLSK--FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 I Y++ +G TN LK A D I Sbjct: 263 KEF--NKKLSMNDILYIASVFYGGGTNFEKPLKKAMEYNGD------------------I 302 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENP 351 +F+TDGE + + + E KKRG +Y+ I ++ + + Y + Sbjct: 303 IFITDGE--AEISKNFTNKFIEEKKKRGIKLYSFFINTKSTYSLKKLSDLSLTIYEINER 360 Query: 352 HS 353 + Sbjct: 361 SA 362 >gi|145497443|ref|XP_001434710.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124401838|emb|CAK67313.1| unnamed protein product [Paramecium tetraurelia] Length = 1279 Score = 39.8 bits (91), Expect = 0.72, Method: Composition-based stats. Identities = 30/202 (14%), Positives = 58/202 (28%), Gaps = 30/202 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++V+D S SM K + +++I E++ P+ + L+ FS+ Sbjct: 1100 FILVIDESGSMAG------QKWKILMEAIQQCFIELRKYPNN----RISLIQFSDDARFV 1149 Query: 234 FLLEWGVSHL--QRKIKYLSKFGVSTNSTPGLK---YAYNQIFDMQGMRQHCNTEDANYK 288 + Q ++K TN YA + Sbjct: 1150 VG-SNQPEEIPQQEQVKQFQMMSGGTNFENAFLLVFYAIQRCISQFD------------F 1196 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLV 348 + +VF TDG+ N+ ++ + I + S + C Sbjct: 1197 QTVVFYTDGDADYPNVSMDLFAQVNQELRQKIDIL-ICTEIKESKSLQKVCQVFQQKMGK 1255 Query: 349 ENPHSMYDAFSHIGKDIVTKRI 370 IG+ + K Sbjct: 1256 GTLKENVKI-DQIGQLLQEKVC 1276 >gi|331266729|ref|YP_004326359.1| putative peptidoglycan-binding domain-containing protein [Streptococcus oralis Uo5] gi|326683401|emb|CBZ01019.1| putative peptidoglycan-binding domain-containing protein [Streptococcus oralis Uo5] Length = 459 Score = 39.8 bits (91), Expect = 0.72, Method: Composition-based stats. Identities = 38/287 (13%), Positives = 79/287 (27%), Gaps = 29/287 (10%) Query: 52 SMIDRSLVHAATQIMNEGN--GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 ++ A +I + N+ K G+++ K + + + +++ Sbjct: 55 EANKSAVEAAVAEIKADEAKPVVNQSDAKPGELVDVSKKVSPIDVKERQEGDQKIHEETA 114 Query: 110 IV--RSTSLDIVVVPQ----NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK 163 V T + V + + S +S T + + + + Sbjct: 115 TVEVTKTEVAPVKSEELPAPKTETTESTVSGKDEDGNSYTQYERVDKTTTVEIKKATRTI 174 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK---LIPDVNNVVQ 220 S + D++ V+D S SM D+ I+ +++ + + Sbjct: 175 SKSNSA---DIVFVVDRSASMSRTIDTVRNNINEFARNLAKDGVAARFGLATFSDEVYGR 231 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS----KFGVS--TNSTPGLKYAYNQIFDMQ 274 G + I F + S K L+ G STP L + + Sbjct: 232 IGGLADEGTILTKFNESYFTSDPAELEKALAGIKLAHGGDFPETSTPTLTQIVSTYDWSK 291 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 + KK +V +TD K Sbjct: 292 SPKN---------KKFVVLLTDAPMKEDPSIPTMANTLASLKAAKIE 329 >gi|313681553|ref|YP_004059291.1| von willebrand factor type a [Sulfuricurvum kujiense DSM 16994] gi|313154413|gb|ADR33091.1| von Willebrand factor type A [Sulfuricurvum kujiense DSM 16994] Length = 507 Score = 39.8 bits (91), Expect = 0.72, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 74/206 (35%), Gaps = 27/206 (13%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K + +A D++ +D+S SM D + +++++A K + ++ + + G Sbjct: 52 KPVTAEEAGSDVIFAVDLSYSMRG-TDLAPSRLEVAKKLLYDVVRS-------DQKDRFG 103 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 ++ F+ L L+ L + + T T + ++ H Sbjct: 104 VIGFTTSAIVLSPLTKDTEMLEHLFSSLDESQIITKGTNVM-----SALELSRKMSHSAR 158 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 I++ +TDG + ++ E + S + + + + S Sbjct: 159 P------IVILLTDGGDEASYEKESSFV-----RDNNLALSVVMLATNNGSTLP---TSE 204 Query: 343 NSFYLVENPHSMYDAFSHIGKDIVTK 368 S EN H + A + ++IV K Sbjct: 205 GSSLKDENGHIVVSARNDAIEEIVNK 230 >gi|295395240|ref|ZP_06805448.1| von Willebrand factor type A (vWA) domain protein [Brevibacterium mcbrellneri ATCC 49030] gi|294972002|gb|EFG47869.1| von Willebrand factor type A (vWA) domain protein [Brevibacterium mcbrellneri ATCC 49030] Length = 328 Score = 39.8 bits (91), Expect = 0.72, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 60/155 (38%), Gaps = 11/155 (7%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLI 212 I P + V + +A + +V+DV+ SM +D S +++ K + +++ + Sbjct: 51 IARPGIPTEVVVLKQEASAQVYLVVDVTASMIAEDWDGSEPRLEGLKKDLTELVDAMPGA 110 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + L+TF ++ L + ++ I L+ ++ +S +A + Sbjct: 111 -------KFSLITFGSESHVRVPLTTDDAAVKSAISILAP-EITRSSAGTSPFAPAETVS 162 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 + + + + + + DGE S K Sbjct: 163 SRLAKGQEAHPGED--QYVFYFGDGEKTSEKGADS 195 >gi|86608199|ref|YP_476961.1| magnesium chelatase, ATPase subunit D [Synechococcus sp. JA-2-3B'a(2-13)] gi|86556741|gb|ABD01698.1| magnesium chelatase, ATPase subunit D [Synechococcus sp. JA-2-3B'a(2-13)] Length = 672 Score = 39.8 bits (91), Expect = 0.72, Method: Composition-based stats. Identities = 35/187 (18%), Positives = 63/187 (33%), Gaps = 24/187 (12%) Query: 118 IVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 I+ N ++ A R P + + IV P K + L ++ V Sbjct: 417 ILPQGSNPRVAVDATLRAAAPYQ-KSRRQRQPGRSVIVEPSDIRAKKLVRKAGAL-VIFV 474 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-EEFFLL 236 +D S SM ++ ++ A + +L E N + L+ F + E Sbjct: 475 VDASGSM------ALNRMQAAKGAALRLLTEA-----YQNRDKVALIAFRGETAEVLLPP 523 Query: 237 EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ-GMRQHCNTEDANYKKIIVFMT 295 + +R+++ L G S L +A Q M R + ++V +T Sbjct: 524 TRSIELARRRLESLPCGGGS-----PLAHALTQAVRMGVNARAAGDVGQV----MLVALT 574 Query: 296 DGENLST 302 DG Sbjct: 575 DGRANIP 581 >gi|115522260|ref|YP_779171.1| von Willebrand factor, type A [Rhodopseudomonas palustris BisA53] gi|115516207|gb|ABJ04191.1| von Willebrand factor, type A [Rhodopseudomonas palustris BisA53] Length = 372 Score = 39.8 bits (91), Expect = 0.72, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 58/187 (31%), Gaps = 25/187 (13%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + + + + +++ VLD + SM + + KI SI + Sbjct: 14 TLAVPLAFASPALAKPVVEVAFVLDTTGSMSGLIEGAKRKIWSIATSIVD--------EN 65 Query: 215 VNNVVQSGLVTF----SNKIEEFFLLEWGVSHLQRKIKYLSKFGVS---TNSTPGLKYAY 267 + V+ GLV + + + F L + L + + G + L A Sbjct: 66 PDAEVKLGLVAYRDIGDDYVTRTFDLTTDIQDLYANLLEMKARGGGDWPESVNEALDIAV 125 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 N++ QG IVF+ Q + Y A R + + Sbjct: 126 NKLHWSQGKESCR----------IVFLVGDAPPHMDYAQDTKYPKTLALARQKDIIVNAV 175 Query: 328 RVIRSHE 334 + + + Sbjct: 176 QAGDARD 182 >gi|290979049|ref|XP_002672247.1| predicted protein [Naegleria gruberi] gi|284085822|gb|EFC39503.1| predicted protein [Naegleria gruberi] Length = 619 Score = 39.8 bits (91), Expect = 0.73, Method: Composition-based stats. Identities = 25/133 (18%), Positives = 50/133 (37%), Gaps = 17/133 (12%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +D++ VLD S S+++ ++ + N + I + G++ FS Sbjct: 1 MDIVFVLDGSSSIDAGNWINMQNF-----ASNLTVGFGSNISPTA--TKIGIIQFSTTTV 53 Query: 232 EFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L+ G S + I + STNS G+ A + + +K Sbjct: 54 THLDLKDGTSKTAVLNAISSAGQLKSSTNSILGINNAVSMVLSQGRKDI--------TRK 105 Query: 290 IIVFMTDGENLST 302 +++ +TDG + Sbjct: 106 LLIHITDGMSNLP 118 >gi|239979671|ref|ZP_04702195.1| hypothetical protein SalbJ_09552 [Streptomyces albus J1074] Length = 248 Score = 39.8 bits (91), Expect = 0.74, Method: Composition-based stats. Identities = 25/146 (17%), Positives = 55/146 (37%), Gaps = 27/146 (18%) Query: 174 MMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 + +VLD S SM ++ D ++ + ++ A L++ +P +V F Sbjct: 37 VYLVLDRSGSMRPYYRDGTVQHLAEQALALAAHLDDDGTVP---------VVFFDTDAHP 87 Query: 233 FFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + + + +++ L + +TN + + +H A Sbjct: 88 ATEVS--LDAYEGRVQELHERYGHMGTTNYAAAML----------EVIEHYTATGATAPA 135 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEA 315 ++F TDG + +E ++ L C A Sbjct: 136 FVIFQTDGGPDAKREAERVL--CRAA 159 >gi|226227556|ref|YP_002761662.1| hypothetical protein GAU_2150 [Gemmatimonas aurantiaca T-27] gi|226090747|dbj|BAH39192.1| hypothetical protein [Gemmatimonas aurantiaca T-27] Length = 302 Score = 39.8 bits (91), Expect = 0.74, Method: Composition-based stats. Identities = 33/164 (20%), Positives = 56/164 (34%), Gaps = 27/164 (16%) Query: 175 MIVLDVSRSMESFFD-SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 M+VLDVS SM+ D S +TK+ A + +AM + D GLV F + + Sbjct: 87 MLVLDVSASMQWTGDPSRLTKLAYAERLASAMALLLLRQRDA-----VGLVRFDAGLRDV 141 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNS--TPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + +R + S+ G S P L A + + Sbjct: 142 VPPRSHRTQWRRLMAAFSEPGGGAESQVAPALLQAGKLV---------------RRPGFV 186 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 V ++D + + + RG V + + +F Sbjct: 187 VLLSD----LLTDSEPVADAARTLRARGHEVLVLHVMDPAERDF 226 >gi|66818054|ref|XP_642720.1| hypothetical protein DDB_G0277329 [Dictyostelium discoideum AX4] gi|60470879|gb|EAL68851.1| hypothetical protein DDB_G0277329 [Dictyostelium discoideum AX4] Length = 798 Score = 39.8 bits (91), Expect = 0.74, Method: Composition-based stats. Identities = 22/167 (13%), Positives = 46/167 (27%), Gaps = 21/167 (12%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 V S A ++++ D + SM T I I + + + G+ Sbjct: 269 VQSSMSADVEIVFCFDTTGSM-------ATIIKSVRTQIVQTVNRLMKDIPNIKIGIMGM 321 Query: 224 VTFSNKIE--EFFLLEWGVSHLQRKIKYLSKFGVSTNSTP---GLKYAYNQIFDMQGMRQ 278 + + + + L + + I + L A + + Sbjct: 322 GDYCDGVNVLKVLDLSSNIDEIVSFINMVPNTSGGDEPEAYEYALYRAKELSWSEHTSKA 381 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 D+N E T + C++ RG +Y + Sbjct: 382 FVMIGDSNPH---------EPTYTNLNINWFKECDDLFDRGIKIYGV 419 >gi|240172543|ref|ZP_04751202.1| hypothetical protein MkanA1_24733 [Mycobacterium kansasii ATCC 12478] Length = 963 Score = 39.8 bits (91), Expect = 0.74, Method: Composition-based stats. Identities = 35/202 (17%), Positives = 76/202 (37%), Gaps = 32/202 (15%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D++++LD S SM + KI A + +++ + + ++TF ++I Sbjct: 294 PRDVVLLLDRSHSMAGW------KIVAARRVSTRIVDTLSSTD------RFAVLTFGDRI 341 Query: 231 EEFFLLEWGVSHLQ-----RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + L G+ R I++L++ ++ ++ Sbjct: 342 DRAGGLADGLVEASDRHRYRAIEHLARVDARGDT---------ELLAPLRQGLSLLRGSQ 392 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--PN 343 ++V +TDG+ EDQ +E + V+ +G+ + FL A+ Sbjct: 393 GRDAVLVLITDGQ--VGNEDQLLRELSDELQH--VRVHTVGVDQAVNAGFLDRLANIGGG 448 Query: 344 SFYLVENPHSMYDAFSHIGKDI 365 LVE+ + +A + + I Sbjct: 449 HCELVESEDRLDEAMHAMQRRI 470 >gi|58616384|ref|YP_195514.1| tellurium resistance protein [Azoarcus sp. EbN1] gi|56315846|emb|CAI10490.1| tellurium resistance protein [Aromatoleum aromaticum EbN1] Length = 214 Score = 39.8 bits (91), Expect = 0.74, Method: Composition-based stats. Identities = 24/138 (17%), Positives = 54/138 (39%), Gaps = 12/138 (8%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + RL + +++D S SM ++ + AM ++ P V + T Sbjct: 1 MSSSRRLPVYLLIDTSGSMRG------EPVESVNVGLRAMQTSLRQNPYAIETVHLSVTT 54 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLS-KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 F ++I++ L + I + +T L++ ++ + +RQ + Sbjct: 55 FDSQIKDVLPL---TALEDATIPEIVCPASGATLLGEALEHILDR--AKKEVRQSSAEQK 109 Query: 285 ANYKKIIVFMTDGENLST 302 ++ ++ MTDG+ T Sbjct: 110 GDWAPLLFIMTDGKPTDT 127 >gi|167646242|ref|YP_001683905.1| hypothetical protein Caul_2280 [Caulobacter sp. K31] gi|167348672|gb|ABZ71407.1| conserved hypothetical protein [Caulobacter sp. K31] Length = 249 Score = 39.8 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 31/174 (17%), Positives = 63/174 (36%), Gaps = 27/174 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK---- 229 ++ +DVS S E + T I + E+ Q + TFS+ Sbjct: 31 LIFDMDVSGSFEHEHEEGTTS--KLIARLVPYGMELDPDG------QMDVFTFSDGRHTV 82 Query: 230 --IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ---IFDMQGMRQH----- 279 + + +++ + + + T+ + L+ + Sbjct: 83 THVGTVTPDDCEDYIVRKVVGRVIGWNGGTDYSYVLERNLQHFGWLPASPEAGGFLSRFL 142 Query: 280 --CNTEDANYKK--IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 T + KK I++F+TDGEN ST + ++ E+++RG VY + + Sbjct: 143 GRPKTPEVRAKKRSIVIFVTDGEN-STSDHVRTTRILEESEQRGDQVYFLFVGA 195 >gi|159184836|ref|NP_354603.2| hypothetical protein Atu1609 [Agrobacterium tumefaciens str. C58] gi|159140120|gb|AAK87388.2| conserved hypothetical protein [Agrobacterium tumefaciens str. C58] Length = 579 Score = 39.8 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 7/46 (15%), Positives = 18/46 (39%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSM 53 + F + G + I + P+ +L + ++ ++ K L Sbjct: 8 LPRFLKDKTGNIAISAGLTAPLFIGILALGVDYGYLTLQKRQLQQT 53 >gi|75907289|ref|YP_321585.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75701014|gb|ABA20690.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 201 Score = 39.8 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 31/171 (18%), Positives = 66/171 (38%), Gaps = 18/171 (10%) Query: 173 DMMIVLDVSRSME-SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D +++D S SM ++ ++A +S A+ + + V FS + + Sbjct: 6 DYTLIIDKSGSMSTPDQAGGRSRWEIAQESTLALARKCEQFDPDGITVYL----FSGRFK 61 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + + S +I + +TN L+ A N F + + + I Sbjct: 62 RY---DDVTSAKVAQIFLENDPAGTTNLAGVLQDALNNYFQRKAAGKTKPNGET-----I 113 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKR----GAIVYAIGIRVIRSHEFLRA 338 + +TDGE K +++ + +R G + +G ++ +FL+A Sbjct: 114 LVITDGEPDDRKAVFETIIHATRQMERDEELGISIIQVG-SDAQATKFLKA 163 >gi|320009835|gb|ADW04685.1| von Willebrand factor type A [Streptomyces flavogriseus ATCC 33331] Length = 254 Score = 39.8 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 54/143 (37%), Gaps = 19/143 (13%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 +++K + T R + +VLD S SM F+ D S + ++ A L+ +P Sbjct: 50 AALKAHELTGLRATVYLVLDRSGSMRPFYKDGSAQHLGDRTLALAAHLDADATVP----- 104 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 +V FS I+ ++ +I L+ T N ++ + Sbjct: 105 ----VVFFSTDIDGTGAIDLTTHE--GRIDELNAGLGRLGRT-------NYHRAVEEVVA 151 Query: 279 HCNTEDANYKKIIVFMTDGENLS 301 H A +++F TDG + Sbjct: 152 HYEKSGATGPALVIFQTDGAPDA 174 >gi|328867084|gb|EGG15467.1| type A von Willebrand factor domain-containing protein [Dictyostelium fasciculatum] Length = 854 Score = 39.8 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 40/258 (15%), Positives = 74/258 (28%), Gaps = 38/258 (14%) Query: 82 ILCRIKNTWNMSFRNELRDNGFVNDIDDIVRST---SLDIVVVPQNEGYSISAISRYKIP 138 I + + + E V +T S+ Q Y + K Sbjct: 214 IQNILVQPYKHNTVIEGNKATITVTKTTSVLATFLLSITPSSNQQKPLYHMEYNPTTKST 273 Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA 198 TF P + H+ + + QT + ++D S SM F I+ A Sbjct: 274 ALGITFYPQF---DHLSID-------DIQTKGEF--IFLIDCSGSMAGFT------IEKA 315 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTN 258 K++ ++ + N G S + ++ I + T Sbjct: 316 KKTLQILMRSLPTNSYFNIYCFGGTYK-SLFQTSVLYDDDNLAAASEYISSIDANMGGTQ 374 Query: 259 STPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 LK Y+ RQ I +TDGE D + + K+ Sbjct: 375 LISPLKRIYDFETFPNHPRQ------------IFIITDGE--IPNRDYVIDFVAS--KRD 418 Query: 319 GAIVYAIGIRVIRSHEFL 336 + ++ +G+ + Sbjct: 419 VSRIFTVGLGSDVDKALV 436 >gi|315608293|ref|ZP_07883283.1| aerotolerance protein BatB [Prevotella buccae ATCC 33574] gi|315250074|gb|EFU30073.1| aerotolerance protein BatB [Prevotella buccae ATCC 33574] Length = 342 Score = 39.8 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 64/174 (36%), Gaps = 24/174 (13%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSM--ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 +V ++ +I LD+S SM + S + K + ++S+ V + Sbjct: 79 GAEVQRDKRNGIEAIICLDISNSMLAQDVAPSRLDKSKLLVESL------VDRFTNDK-- 130 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 GL+ F+ + + ++ + + T T + A N + Sbjct: 131 --IGLIVFAGDAYVQLPITSDYVSAKMFLQNIDPSLIQTQGTD-IAQAIN-------LGL 180 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 H T+ + I+ +TDGE+ + + +K+G V+ +G+ + Sbjct: 181 HSFTQADKIGRAIIVITDGEDHEGGAVEAAAEA----RKKGVNVFILGVGDTKG 230 >gi|189233997|ref|XP_971945.2| PREDICTED: similar to AGAP009579-PA [Tribolium castaneum] Length = 1056 Score = 39.8 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 80/212 (37%), Gaps = 20/212 (9%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT-FSNKIE 231 D++I++D S SM+ +A ++N +L+ +N + S T ++ Sbjct: 234 DVIILVDNSGSMDGMGR------HIASLTVNTILDTFSNNDYINILYYSNQTTNYTIPCF 287 Query: 232 EFFLLEWGVSHL---QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L++ ++ + I++L G T+ L+ A++ + + + +R N E Sbjct: 288 RNLLVQATPENIVLFKEAIRHLGPSGK-TDFPQALQMAFDILENYREIRGCNNEEIDEEG 346 Query: 289 K------IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA--IGIRVIRSHEF-LRAC 339 K I+ +TDG + + + + K ++ IG V E AC Sbjct: 347 KSKACNQAIMLITDGISRNFSDIVMRNNQLDGGKTIPVRIFTYLIGKEVTNVEEIRWMAC 406 Query: 340 ASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 A+ + V+ + A I + Sbjct: 407 ANRGFYTQVQTLEQVTSAVLQYINVIARPLVL 438 >gi|145514483|ref|XP_001443148.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124410521|emb|CAK75751.1| unnamed protein product [Paramecium tetraurelia] Length = 625 Score = 39.8 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 30/182 (16%), Positives = 64/182 (35%), Gaps = 35/182 (19%) Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV----TFSN--- 228 ++D S SM S ++I+ A +++ L+ + N++ G+ F++ Sbjct: 403 FIIDRSGSM------SGSRIEKAKQALILFLKSL-PQDSEFNIISFGIADIFLFFNHQSV 455 Query: 229 KIEEFFLLEWGVSHLQ----RKIKYLSKFGVSTNS-TPGLKYAYNQIFDMQGMRQHCNTE 283 + + + +Q + ++ ++ T TP + YN + Sbjct: 456 PLNNVSSQQQFLGIVQNEAIQHVEEMAANMGGTEILTPLQQMVYNASYGTSKNTTLN--- 512 Query: 284 DANYKKIIVFM-TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP 342 VFM TDGE + Q + + +Y +GI S ++ A Sbjct: 513 --------VFMLTDGE----TDADQIIQLVQSNNQAQTRIYTLGIGQGCSQYLIQRVAEV 560 Query: 343 NS 344 + Sbjct: 561 GN 562 >gi|116249090|ref|YP_764931.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] gi|115253740|emb|CAK12133.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] Length = 176 Score = 39.8 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 31/131 (23%), Positives = 53/131 (40%), Gaps = 10/131 (7%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM 66 ++R + KG I AI +F+++ IIEVS +FF+ +S +D S+ + I Sbjct: 9 SLRRLLGDRKGVAAIEFAILALPLFIMIFGIIEVSLMFFV----NSALDASVHKISRMIR 64 Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 + + D I N +SF VN + D+ + S D + N Sbjct: 65 TGEVAS--SNITLADFKAGICNDMLLSFSCSSGLLVKVNVLSDLSSAASADPIDDSGN-- 120 Query: 127 YSISAISRYKI 137 ++ Y I Sbjct: 121 --LTVTETYDI 129 >gi|114775649|ref|ZP_01451217.1| rubisco activation protein cbbO [Mariprofundus ferrooxydans PV-1] gi|114553760|gb|EAU56141.1| rubisco activation protein cbbO [Mariprofundus ferrooxydans PV-1] Length = 745 Score = 39.8 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 56/153 (36%), Gaps = 28/153 (18%) Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDV------NNVVQSGLVTFSNKIEEFFLLEW 238 E T ++++ ++++ + ++ + D ++ + + + K F W Sbjct: 569 EKASGCDQTILELSQEAVSLLSWAIEQVGDSFAIAGFHSNTRHDVRYYHIK---GFSERW 625 Query: 239 GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE 298 G ++ ++ + G ST +++A H KK+++ +TDGE Sbjct: 626 G-DEVKSRLAEMEA-GFSTRMGGAMRHA-----------AHYLKAQQADKKLMLILTDGE 672 Query: 299 NLSTKEDQQSLYYCN------EAKKRGAIVYAI 325 D + L + E + G Y I Sbjct: 673 PSDIDVDDEQLLIADARQAVNELDRDGIYSYCI 705 >gi|21233895|ref|NP_640193.1| hypothetical protein Rts1_232 [Proteus vulgaris] gi|21203079|dbj|BAB93795.1| hypothetical protein [Proteus vulgaris] Length = 270 Score = 39.8 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 30/150 (20%), Positives = 53/150 (35%), Gaps = 16/150 (10%) Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEFFLLEWGVSHLQRKIKYLSKF 253 I A ++ ++L + +P V + F L+ L++ I Sbjct: 111 ITEAEAAVVSLLYALDNLPGVT----TSAYHFPHTTRNSVGKLKDRKQTLRQAIAANHFG 166 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 +T STP + I D+ + ++I++ TDGE S + + N Sbjct: 167 IHTTGSTPLSGALWPAIVDLAVEKAD--------QRILIVCTDGEPDSK---EDVIQMIN 215 Query: 314 EAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 +AK G +V IG + R S Sbjct: 216 DAKSDGMVVIGIGFGSVSQSSMTRIFGSTG 245 >gi|242807550|ref|XP_002484979.1| von Willebrand and RING finger domain protein [Talaromyces stipitatus ATCC 10500] gi|218715604|gb|EED15026.1| von Willebrand and RING finger domain protein [Talaromyces stipitatus ATCC 10500] Length = 1044 Score = 39.8 bits (91), Expect = 0.77, Method: Composition-based stats. Identities = 39/213 (18%), Positives = 72/213 (33%), Gaps = 34/213 (15%) Query: 120 VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLD 179 + SA S Y T + YTNS P + V LD+++V+ Sbjct: 476 RTTKTNKRQSSATSSYGANKSNNTALTEYTNSMRDPGPASFHV--------PLDLVVVIP 527 Query: 180 VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-----KIEEFF 234 VS SM+ KI + ++ +++ + + GLVTF + + Sbjct: 528 VSSSMQGL------KITLLRDALKFLVQNLGPRD------RMGLVTFGSSGGGVPLVGMT 575 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 W + + L G + ++ A N D+ R+ N I+ + Sbjct: 576 TKAWN--GWGKILNSLRPVGQKSLRADVVEGA-NVAMDLLMQRKSNNPIST-----ILLI 627 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 +D + + AK G + +G+ Sbjct: 628 SDSSTSDPESVDFVVSRAEAAK-VGIHSFGLGL 659 >gi|118593079|ref|ZP_01550466.1| von Willebrand factor type A domain protein [Stappia aggregata IAM 12614] gi|118434386|gb|EAV41040.1| von Willebrand factor type A domain protein [Stappia aggregata IAM 12614] Length = 858 Score = 39.8 bits (91), Expect = 0.77, Method: Composition-based stats. Identities = 31/237 (13%), Positives = 69/237 (29%), Gaps = 29/237 (12%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 + + +V S+ +S A +++LD S SM + ++I++A Sbjct: 1 MPLSLPRLAVATSLLVASSAFSLVPSSAQAADRATILILDASGSMWAQLPEGRSRIEVAR 60 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNS 259 + L G++ + + + I+ +S GV + Sbjct: 61 DVLGDYLRSRDGSRP------LGVIAYGH----------NRKGDCQDIETISPVGVQDPA 104 Query: 260 T-----PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 + GL R + + IV +TDG Sbjct: 105 SLGSRLNGLSPRGKTPLAGSLRRAATLIPKTSEEADIVLVTDG---LETCGLDPCAVAAS 161 Query: 315 AKKRGAIV--YAIGIRVIRSHEFLRAC---ASPNSFYLVENPHSMYDAFSHIGKDIV 366 + G V + +G + +C + ++ + DA + + + Sbjct: 162 LAQEGIPVRAHVVGFGLTEGEVRQISCIAETTGGMVLAPQSGAELADALVRVTEPVT 218 >gi|325286049|ref|YP_004261839.1| hypothetical protein Celly_1140 [Cellulophaga lytica DSM 7489] gi|324321503|gb|ADY28968.1| protein of unknown function DUF58 [Cellulophaga lytica DSM 7489] Length = 288 Score = 39.8 bits (91), Expect = 0.78, Method: Composition-based stats. Identities = 38/162 (23%), Positives = 64/162 (39%), Gaps = 30/162 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L MM+++DVS S + + K I + + N + GL+ F Sbjct: 72 EEERELTMMLMVDVSGSE------NFGTTNQFKKDIITEISATLAFSALQNNDKVGLILF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 S+++E F + G SH R I+ L + TN + LKY N + Sbjct: 126 SDEVELFIPPKKGKSHALRIIRELLEFTPKSTKTNLSEALKYLTNVMK------------ 173 Query: 284 DANYKKIIVF-MTDGENLSTKEDQQSLYYCNEAKK-RGAIVY 323 KK IVF ++D T + Q+L + G ++ Sbjct: 174 ----KKAIVFVLSD---FITDDYLQTLRITGKKHDVTGIRIF 208 >gi|288576300|ref|ZP_05978557.2| pilus-associated protein [Neisseria mucosa ATCC 25996] gi|288565840|gb|EFC87400.1| pilus-associated protein [Neisseria mucosa ATCC 25996] Length = 1081 Score = 39.8 bits (91), Expect = 0.78, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 47/133 (35%), Gaps = 14/133 (10%) Query: 174 MMIVLDVSRSMESF-----FDSSITKIDMAIKSINAMLEE-VKLIPDVNNVVQSGLVTFS 227 +M +D S+SM + T++ + ++N +LE + +G + + Sbjct: 45 IMFFIDDSQSMNRNAVTGEYTPGPTRMQVTKNALNGILENHKDKFNWGLQTLYNGGSSDT 104 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 E F + + K+ + G+ TP Y + N + Sbjct: 105 TPEETFDPEKASWQKMIDKVNKMKPTGL----TPATSRYYEVVTQT----VMPNIKYRCQ 156 Query: 288 KKIIVFMTDGENL 300 K +V M+DG+ Sbjct: 157 KSYVVMMSDGDAN 169 >gi|171914252|ref|ZP_02929722.1| von Willebrand factor, type A [Verrucomicrobium spinosum DSM 4136] Length = 212 Score = 39.8 bits (91), Expect = 0.78, Method: Composition-based stats. Identities = 23/132 (17%), Positives = 40/132 (30%), Gaps = 12/132 (9%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + ++LD S SM I+ + ++ ++ P ++TF + Sbjct: 3 RLPVFLLLDTSGSMFG------EPIEQVKNGVQMLVSALRQDPYALETAFLSVITFDSNA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 E L ++ L+ G T+ L + D Sbjct: 57 TEKVPL---TELTAFQVPALTAQGA-TSLGDALTLVADVASKQLAKTTATTKGDWKPMVF 112 Query: 291 IVFMTDGENLST 302 I MTDG Sbjct: 113 I--MTDGAPTDD 122 >gi|323137991|ref|ZP_08073065.1| hypothetical protein Met49242DRAFT_2453 [Methylocystis sp. ATCC 49242] gi|322396710|gb|EFX99237.1| hypothetical protein Met49242DRAFT_2453 [Methylocystis sp. ATCC 49242] Length = 296 Score = 39.8 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 21/148 (14%), Positives = 52/148 (35%), Gaps = 2/148 (1%) Query: 5 NLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQ 64 +R + G + ++ A + L++G ++ S ++ L +D ++ AA + Sbjct: 13 GDRLRTLLRDRNGAVAVIFAFATLPMVLLIGGAVDYSRAIGARSNLQQALDAGVLSAAVK 72 Query: 65 IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQN 124 N +G + L + G V + D S + + + + Sbjct: 73 GGNPDSGQLARYLNSNMSPGGAATNVT--LTRSVATGGAVTFVGDADFSVATNFLKMAGL 130 Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSR 152 + + S +P + T T+++ Sbjct: 131 GAIKLHSHSEATLPAQIVTATFKPTSAQ 158 >gi|308472813|ref|XP_003098633.1| hypothetical protein CRE_04227 [Caenorhabditis remanei] gi|308268233|gb|EFP12186.1| hypothetical protein CRE_04227 [Caenorhabditis remanei] Length = 395 Score = 39.8 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 35/188 (18%), Positives = 65/188 (34%), Gaps = 17/188 (9%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 S +D LD+++V+D S+ + S T+ +IN + E IP + G Sbjct: 33 GGSISDIWLDVVVVVDNSQRVNKRSFVSSTR-----DTINNIFREA-SIP----RTRVGF 82 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ--HCN 281 VT++++ L S+ + + + S Y + + Q Sbjct: 83 VTYNSQATTNADLNKFKSYGDLQQGVYNSYNDMNLSPEKTPYIGTGLIAAGELLQIQGSA 142 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS---HEFLRA 338 N+ K+I+ N L N K G + I + + + L Sbjct: 143 DGHVNHPKVIIAYATALNG--TGLLDPLSVANTLKSAGITIITIAVDTDDNGVIEKQLAP 200 Query: 339 CASPNSFY 346 ASP + + Sbjct: 201 LASPGAAF 208 >gi|301625572|ref|XP_002941978.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like, partial [Xenopus (Silurana) tropicalis] Length = 476 Score = 39.8 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 25/151 (16%), Positives = 60/151 (39%), Gaps = 14/151 (9%) Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE-----FFLLEWGVSHLQRKIKYL 250 ++ A L+ + +P+ ++ G++ F +K+++ + + ++ + + Sbjct: 4 QKIKQTYEAFLKILADLPEEDHF---GILIFDDKVDKWQNTLVKAVPDNIIKAKQFVSKI 60 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED-QQSL 309 S G T+ L A + + + + II+F++DGE S + + + Sbjct: 61 SARG-GTDINKALLAAVKMLKNTSRNKLLPKISTS----IILFLSDGEPTSGVTNHNEII 115 Query: 310 YYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 +A +R +Y +G FL A Sbjct: 116 NNVKKANERQTTLYCLGFGNDVDFNFLEKMA 146 >gi|281208413|gb|EFA82589.1| type A von Willebrand factor domain-containing protein [Polysphondylium pallidum PN500] Length = 904 Score = 39.8 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 25/164 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + VLD S SM I K ++ I L E N+V+ G Sbjct: 341 FIFVLDCSGSMS---GRRIQKAKAVLQLIMRSLNENIKF----NIVRFGSSFVKVYPRST 393 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + +S + +I++ T P ++ I + D Y + + Sbjct: 394 VYNDETLSSVSTQIQFFDADLGGTELFPAIR----DILSTEA--------DPAYPRQVFV 441 Query: 294 MTDGENLSTKEDQQSLYYCNEAKK-RGAIVYAIGIRVIRSHEFL 336 +TDGE + AK+ ++ +GI E + Sbjct: 442 LTDGE-----IYDRKQLISYVAKESSSTRIFTVGIGGQVDRELV 480 >gi|148725344|emb|CAI12059.2| novel protein similar to vertebrate calcium channel, voltage-dependent, alpha 2/delta 3 subunit (CACNA2D3) [Danio rerio] Length = 742 Score = 39.8 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 62/165 (37%), Gaps = 19/165 (11%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + D++I +D+S SM+ ++ +A +IN +L+ + VN + S V + Sbjct: 210 ATSPKDIIIAVDISGSMKGL------RLTIAKHTINTILDTLGENDFVNVIAYSDYVQYV 263 Query: 228 NKIEEFFLLE---WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + L++ H + ++ L G +K ++ + + + Sbjct: 264 EPCFKGTLVQADLDNREHFKLLVQELQVKGEG-KVKKAMKESFKILNEAAAEGRGSLCNQ 322 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 A I+ +TDG ++ + + + R V+ I Sbjct: 323 A-----IMLITDGAMEDFQQVFEEFNWPD----RKVRVFTYLIGR 358 >gi|332711435|ref|ZP_08431366.1| Mg-chelatase subunit ChlD [Lyngbya majuscula 3L] gi|332349413|gb|EGJ29022.1| Mg-chelatase subunit ChlD [Lyngbya majuscula 3L] Length = 579 Score = 39.8 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 42/277 (15%), Positives = 97/277 (35%), Gaps = 39/277 (14%) Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 + N++ T++ + R L + +V++ + ++ + P+ + ++ Sbjct: 284 SANSQGNNTVNYQAVYPSATFSSNIRAILPNAPWVSEKEKAAGEKVIEFLRQPETQKIAV 343 Query: 130 SAISRYKIPLKFCT--FIPWY-----TNSRHIVMPITSSVKV-----NSQTDARLDMMIV 177 R IP F P + P V+ S + +V Sbjct: 344 DLGLRPGIPGVPLGSKFSPQFGVQANPQYDSYRPPKPEVVEAMLQSWQSYAKKPSQVALV 403 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 +DVS SM K+ ++ ++ + + ++ FSN+I++ F++E Sbjct: 404 VDVSGSMRG------EKLSGVQNTLLNYVQNLGSRE------KIAIIPFSNEIKQPFMVE 451 Query: 238 WGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 IK++ G T + IF ++Q+ T+ N ++ +T Sbjct: 452 GTPQGKAEGIKFIGSLKAGGGTRLYD------SAIFARNWLKQNFKTDAINA---VLILT 502 Query: 296 DGENLSTKED----QQSLYYCNEAKKRGAIVYAIGIR 328 DG++ ++ + L N + + +G Sbjct: 503 DGQDSGSEITLDNLSKQLQSSNFEAEESIAFFTVGYG 539 >gi|296506525|ref|YP_003667759.1| hypothetical protein BMB171_P0145 [Bacillus thuringiensis BMB171] gi|296327112|gb|ADH10039.1| hypothetical protein BMB171_P0145 [Bacillus thuringiensis BMB171] Length = 452 Score = 39.8 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 73/199 (36%), Gaps = 16/199 (8%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 L++ I+LD S SM + + K++ A K+I L IPD NV+ L + Sbjct: 148 KAKSLNVEILLDASGSMAGKVNGEV-KMEAAKKAIYNYL---DKIPDNANVM---LRVYG 200 Query: 228 NK---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE- 283 +K E L G S + ++ +K + + + + E Sbjct: 201 HKGSNNENDKSLSCGSSEVMYPLQPYNKEQFNAALSKFGPKGWTPLASAIESINDDFKEY 260 Query: 284 -DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-HEFLRACAS 341 ++ ++DGE + + N++ A+V IG V S + L+ A Sbjct: 261 TGEENLNVVYIVSDGEETCGGDPVNAAKNLNQS-STHAVVNIIGFDVKNSEQQQLKNTAE 319 Query: 342 P--NSFYLVENPHSMYDAF 358 ++ V + ++ Sbjct: 320 AGKGNYATVSSADELHQTL 338 >gi|196011371|ref|XP_002115549.1| hypothetical protein TRIADDRAFT_59612 [Trichoplax adhaerens] gi|190581837|gb|EDV21912.1| hypothetical protein TRIADDRAFT_59612 [Trichoplax adhaerens] Length = 795 Score = 39.8 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 25/132 (18%), Positives = 52/132 (39%), Gaps = 12/132 (9%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ VLD S SM + +D+ ++ L+ ++ + L+T ++ Sbjct: 3 LIFVLDTSASMNQQSYLGTSYLDITKGAVEYFLK-LRSRDPACRTDKCMLIT-TDDPPAM 60 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT------EDA 285 + W SH ++K L G S L+ A+N + ++ M + + Sbjct: 61 IKVGWRESHYSFLNELKNLQATGYSQMD-LALREAFN-LLNVNRMVSGIDKYGQGRDPHS 118 Query: 286 NYKKIIVFMTDG 297 +I+ +TDG Sbjct: 119 IEPAVIITLTDG 130 >gi|116620444|ref|YP_822600.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116223606|gb|ABJ82315.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 354 Score = 39.8 bits (91), Expect = 0.80, Method: Composition-based stats. Identities = 24/151 (15%), Positives = 49/151 (32%), Gaps = 19/151 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 ++ +D++++LDVSRSME A++++ ++ D Sbjct: 72 PQEIRNFQNEKMPVDVVLLLDVSRSMEPNIRRVAAASHEALRALGEQDRIATMVFD---- 127 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 FS + F + + F T+ T GL + Sbjct: 128 ------RFSRLLMPFRPSRRDAETVLENVLERETFDGGTDITRGL---------LDTASY 172 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSL 309 + ++ IV +TD + + + L Sbjct: 173 VEQNARRDARRAIVIVTDDQTERNRNEAAVL 203 >gi|171695786|ref|XP_001912817.1| hypothetical protein [Podospora anserina S mat+] gi|170948135|emb|CAP60299.1| unnamed protein product [Podospora anserina S mat+] Length = 345 Score = 39.8 bits (91), Expect = 0.81, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 59/181 (32%), Gaps = 22/181 (12%) Query: 184 MESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 ME S++ ++ A + ++ Q +V F+ E + L S Sbjct: 1 MEDTDPSNL-RLQAAKLLVGKLIPSSGTTTGQTAD-QVTIVQFNETAEVLYPLGD-PSGA 57 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 I + G T G+ A +++ N DA I +TDG + Sbjct: 58 GATIDGIPANG-GTFIGGGISAAVDELTRSG------NKPDAAG---IFVLTDGADDPPS 107 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE--------FLRACASPNSFYLVENPHSMY 355 ++ N A++ G V + G + + E L + + + E+ + Sbjct: 108 LISDTIDSINRAQQAGIRV-SFGFLSVDAEEQDSRITSAILSSGGTFTTVNTAEDTSKLV 166 Query: 356 D 356 Sbjct: 167 A 167 >gi|114587332|ref|XP_001172399.1| PREDICTED: inter-alpha (globulin) inhibitor H1 isoform 1 [Pan troglodytes] Length = 774 Score = 39.8 bits (91), Expect = 0.81, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 68/196 (34%), Gaps = 28/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D+S SM K+ +++ +L +++ + LV F +++ Sbjct: 293 VVFVIDISGSMRG------QKVKQTKEALLKILGDMQPGDYFD------LVLFGTRVQSW 340 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E + Q ++ S +TN GL + +Q +N+ Sbjct: 341 KGSLVQASEANLQAAQDFVRGFS-LDEATNLNGGLLRGIEILNQVQESLPEL----SNHA 395 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDG+ + Q L A + +Y +G FL + N+ Sbjct: 396 SILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 455 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 456 QRIYEDRDATQQLQGF 471 >gi|281201173|gb|EFA75387.1| hypothetical protein PPL_11464 [Polysphondylium pallidum PN500] Length = 675 Score = 39.8 bits (91), Expect = 0.81, Method: Composition-based stats. Identities = 30/197 (15%), Positives = 62/197 (31%), Gaps = 27/197 (13%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 + ++DVS S+ + + K+I + IP +++F + + Sbjct: 35 SLTFLVDVSSSISN------EAFILTKKTIEGFISAFPPIP-------MKIISFGKEAQI 81 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE------DAN 286 E + L ++ ++ +T S + + F + D N Sbjct: 82 VISEETDKTKLHTAVENFARKIENTYSDGFINK-ITRSFKKEETNMIPAINLCIEGIDIN 140 Query: 287 YKKI--IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 K ++ +TDG+ ++ + K V IGI F Sbjct: 141 NPKTGLVLLITDGKPTDDPTEKITRLL-----KEKVEVITIGIGNQTMKTFFSNHFKNQK 195 Query: 345 FYLVENPHSMYDAFSHI 361 Y +E+ + A I Sbjct: 196 KYYIESFSMVNTAIKSI 212 >gi|282899257|ref|ZP_06307228.1| hypothetical protein CRC_00707 [Cylindrospermopsis raciborskii CS-505] gi|281195826|gb|EFA70752.1| hypothetical protein CRC_00707 [Cylindrospermopsis raciborskii CS-505] Length = 431 Score = 39.8 bits (91), Expect = 0.81, Method: Composition-based stats. Identities = 35/181 (19%), Positives = 67/181 (37%), Gaps = 19/181 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN----- 228 ++++LD S SM KI AI +I K+ + Q +V F Sbjct: 110 IIVLLDFSGSMNQTDSGGSKKITGAINAIGQF---AKVSSERGGDTQISIVPFGEAGKNC 166 Query: 229 ---KIEEFFL---LEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMR--Q 278 +++ L + LQ ++YLS STN LK + + + + R Sbjct: 167 PEYPVDKDTLDKFISASDFKLQNSLEYLSTLNPCGSTNLYQPLKKSLQFLGNPEDPRFTL 226 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK-RGAIVYAIGIRVIRSHEFLR 337 N+ D + I+ ++DG + + E + + K V+ +G + ++ Sbjct: 227 PENSSDPQPRLSIILLSDGYHNAMNEYRDFSELKSLLKDYENITVHTLGYGLTPEQLGVK 286 Query: 338 A 338 Sbjct: 287 Y 287 >gi|239613390|gb|EEQ90377.1| U-box domain-containing protein [Ajellomyces dermatitidis ER-3] Length = 766 Score = 39.8 bits (91), Expect = 0.81, Method: Composition-based stats. Identities = 30/210 (14%), Positives = 75/210 (35%), Gaps = 26/210 (12%) Query: 171 RLDMMIVLDVSRSMESFFDSSIT----KIDMAIKSINAMLEEVK--LIPDVNNVVQSGLV 224 D+++ +D+S SM S T K + S+ + + +I +N+ + G+V Sbjct: 73 PCDIVLCIDISYSMSSSAPLPTTDDSGKPEDTGLSVLDLTKHAARTIIETLNDNDRLGVV 132 Query: 225 TFSNKIEEFFLLEW-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 FS E + + + + +K + +++ ++ + + Sbjct: 133 AFSTDAEVVYKISNMNEDNKKAALKAVEALWPLSSTNL-----WHGLKLSLEALEEVTPI 187 Query: 284 DANYKKIIVFMTDG-----ENLSTKEDQQ----SLYYCNEA--KKRGAIVYAIGIRVIRS 332 N + + +TDG + + + Y ++A K R +++ G Sbjct: 188 PQNVQ-ALYILTDGMYRIVRSRVPHANASKFRHAKSYVSKAGQKDRLPMIHTFGFGYYIR 246 Query: 333 HEFLRACAS--PNSFYLVENPHSMYDAFSH 360 L+A + ++ + + + F H Sbjct: 247 SGLLQAISEVGGGTYSFIPDAGMIGTVFVH 276 >gi|220927960|ref|YP_002504869.1| hypothetical protein Ccel_0507 [Clostridium cellulolyticum H10] gi|219998288|gb|ACL74889.1| hypothetical protein Ccel_0507 [Clostridium cellulolyticum H10] Length = 198 Score = 39.8 bits (91), Expect = 0.81, Method: Composition-based stats. Identities = 18/168 (10%), Positives = 54/168 (32%) Query: 12 FYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNG 71 N KG T+ ++ ++FL++ + + I + + S+ + + A +N+ Sbjct: 1 MKNNKGSFTVEASLVFSVVFLMVTAFVYLFVIMYQYVNMQSVANEAATKGAYFYVNQTGN 60 Query: 72 NNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISA 131 N N + + + ++ + + ++ Sbjct: 61 NYGSNKLDEMYWRIYDTNNNKKASGIIDYANKLLNKSLFPAENNISVNTYNKILIKNLKI 120 Query: 132 ISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLD 179 K + + S + + + ++ ++ + +M IV D Sbjct: 121 EIVDKYTIPVSNMFDIFGLSPVLSLKVVTNSPLDDNAEFIRNMDIVTD 168 >gi|330984300|gb|EGH82403.1| hypothetical protein PLA107_04644 [Pseudomonas syringae pv. lachrymans str. M301315] Length = 235 Score = 39.8 bits (91), Expect = 0.82, Method: Composition-based stats. Identities = 23/158 (14%), Positives = 62/158 (39%), Gaps = 23/158 (14%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + ++V +++VLD S+SM + + +I ++ + + + + Sbjct: 34 THIQVAGLNLNSAQVVVVLDASKSMHAQYKDG--RIQSLLERLTGLAMALDKDQSFELYL 91 Query: 220 QSGLVTFSNKIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 F +++ + L+ ++ R+I + STN P + ++ F + Sbjct: 92 ------FGSEVAKLPALDGESVAGYVDREIIGKYRINQSTNYAPAIARIHSDYFGSKD-- 143 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSL-YYCNE 314 +++F+TDG+ K+ ++ + C + Sbjct: 144 ----------PVLVIFITDGDATDKKQTEKVITQVCTQ 171 >gi|297539403|ref|YP_003675172.1| von Willebrand factor type A [Methylotenera sp. 301] gi|297258750|gb|ADI30595.1| von Willebrand factor type A [Methylotenera sp. 301] Length = 323 Score = 39.8 bits (91), Expect = 0.82, Method: Composition-based stats. Identities = 23/139 (16%), Positives = 46/139 (33%), Gaps = 11/139 (7%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + + + ++V+D S SM+ F T A A + D Sbjct: 68 GASRKEQKVGKGAQTVLVIDRSVSMDHPFAGQTTSGRAAEIKSAAARRLITDFIDSRPDD 127 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQ 278 G+V F+N + + I + ++ TN G+ A N ++Q Sbjct: 128 MMGVVGFTNSALYGMKITTNRDAIHAAINAATGSALNQTNIGAGITEAVNLFDNIQSSGS 187 Query: 279 HCNTEDANYKKIIVFMTDG 297 + ++ ++DG Sbjct: 188 ----------RAVILLSDG 196 >gi|328721893|ref|XP_001947406.2| PREDICTED: midasin [Acyrthosiphon pisum] Length = 4961 Score = 39.8 bits (91), Expect = 0.83, Method: Composition-based stats. Identities = 39/270 (14%), Positives = 81/270 (30%), Gaps = 42/270 (15%) Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 N+ + K D + + R L + D +++ Sbjct: 4677 NKEIQTDEAWNKCMDATSSLAQELSEQLRLVLEPSKASRLQGDFRTGRRINMRR------ 4730 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 + Y I +F W ++ + + +++ +D S SM Sbjct: 4731 -----VIPY-IASQFKKDKIW--------------LRRTRPSKRQYQIIMAVDNSSSMMD 4770 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQ-- 244 S KI L V + + G+++F + + L S Sbjct: 4771 ----SHAKILAFES-----LALVSKALTLVEAGELGVMSFGEQTKIVHPLGEPFSDQTGC 4821 Query: 245 RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 + ++ + T ++ A + R +T D + ++V M+DG + ++ Sbjct: 4822 KLMQNFTFDEQKTKVGELIELA----TAVFEDRGQLDTGDRPAQ-LLVIMSDGRGIKSEG 4876 Query: 305 DQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 AK G + I I +S E Sbjct: 4877 PDVVKNAIRAAKVNGIFIVFIVIDNPKSKE 4906 >gi|220906001|ref|YP_002481312.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] gi|219862612|gb|ACL42951.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] Length = 230 Score = 39.8 bits (91), Expect = 0.83, Method: Composition-based stats. Identities = 29/163 (17%), Positives = 53/163 (32%), Gaps = 14/163 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++++LD S SM I+ + A + V V +VTF Sbjct: 27 VILLLDTSGSMAG------APIEALNNGLLAFKDAVIQDEQAALRVDLAIVTFG-----P 75 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L + + I K T + YA + + + N Y+ + Sbjct: 76 VQLVQDFITIDQFIPPQLKAQGLTPMGEAINYALDLL--ENRKVAYRNNGIQYYRPWLFL 133 Query: 294 MTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIRVIRSHEF 335 +TDG QS A ++R +A+G++ + Sbjct: 134 ITDGAPNRDSPWIQSADRIRNAERERKLSFFAVGVQGADMNTL 176 >gi|157273515|gb|ABV27414.1| von Willebrand factor type A [Candidatus Chloracidobacterium thermophilum] Length = 327 Score = 39.8 bits (91), Expect = 0.83, Method: Composition-based stats. Identities = 38/293 (12%), Positives = 94/293 (32%), Gaps = 44/293 (15%) Query: 102 GFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSS 161 + + I T + + Q + + R P+ T + I S Sbjct: 18 AWAQERAAIPDDT--TVTLRAQQVVVPFTVVDRLNRPVTTITAADVKLYEDGVEQDIVSL 75 Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + +++VLD S SM ++++ +A ++ LE + +P ++ Sbjct: 76 ---GRAPELPTTVVVVLDCSGSM-------VSRLPLAKRATLMFLERILRLPQD----RA 121 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRK-----------IKYLSKFGVSTNSTPGLKYAYNQI 270 L+ + L + L++ + + F + PG Sbjct: 122 ALLACQQDLLLVQPLTNSLDALRQALATLDEQLPSPLGRVVPFEPKQATPPGTALYAALY 181 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 ++ + + D ++++V ++DG ++ + A + +YA+GI Sbjct: 182 AAIELLPEER---DDARRRVVVAISDG--FDSEGLIRVGEVIEHAWRNQVSLYALGIGQP 236 Query: 331 -----------RSHEFLRAC-ASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 + R C ++ + + AF+ + D+ + Sbjct: 237 ELTATDTNQMVNRADLERLCNSTGGQAFFPRLDREFFTAFAQVDADLRHSFVL 289 >gi|331007462|ref|ZP_08330636.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [gamma proteobacterium IMCC1989] gi|330418739|gb|EGG93231.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [gamma proteobacterium IMCC1989] Length = 699 Score = 39.4 bits (90), Expect = 0.83, Method: Composition-based stats. Identities = 49/313 (15%), Positives = 103/313 (32%), Gaps = 54/313 (17%) Query: 75 KKLKGGDILCRIKNTWNMSFRNE--LRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 K I N W+ S N+ L + G ++ + + + G S+ + Sbjct: 209 PKHPNATIQQLSPNEWSASLVNQANLEEAGNTQNVITLSQDIVVYWRHKEGLPG-SVDLV 267 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI 192 + + TF+ T + T S D + VLD+S SM + Sbjct: 268 TYKPEANQRGTFMLTLTPGDDLGKVTTGS-----------DWIFVLDISGSMSGKY---- 312 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVV-QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS 251 ++ + LE+++ V+ +G V ++ + V+++ +++ Sbjct: 313 ---ATLVEGVRQGLEKLRQQDRFKVVLFNNGSVDLTSGFLTVS--QANVANVLQQLDNYK 367 Query: 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY 311 G TN GL+ I DA+ I+ +TDG + ++++ Sbjct: 368 -VGGGTNLYAGLQKGLTGI-------------DADRPAGIILVTDGVANVGQTEKKTFL- 412 Query: 312 CNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL--VENPHSMY------------DA 357 ++ + + L+ A ++ + V N + A Sbjct: 413 -KLLNDHDVRLFTFIMGNSANRPLLKEMADVSNGFAMSVSNADDIVGHIMLATSKLTHQA 471 Query: 358 FSHIGKDIVTKRI 370 F I DI ++ Sbjct: 472 FRDIDLDIDGVKV 484 >gi|307294185|ref|ZP_07574029.1| hypothetical protein SphchDRAFT_1655 [Sphingobium chlorophenolicum L-1] gi|306880336|gb|EFN11553.1| hypothetical protein SphchDRAFT_1655 [Sphingobium chlorophenolicum L-1] Length = 157 Score = 39.4 bits (90), Expect = 0.83, Method: Composition-based stats. Identities = 31/160 (19%), Positives = 51/160 (31%), Gaps = 6/160 (3%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R FF + G T+ A+ +PI+ L+ M +V+ F K L +R Sbjct: 3 RRFFADPMGTSTVELALIMPILVLLACMAGDVAMAFKAKIALQRAAER----TGQLATAG 58 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDI-VVVPQNEGY 127 G N+ K + D V++ S ++ Q Sbjct: 59 GYTNDTSKTQAAYNNLAADAAAAAGVSTNNVTVTPTLLCDATVQTASPEVPCADGQQTKR 118 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 ++ F +P +N +PIT S V Q Sbjct: 119 YVAITISGSYTPMFAKLMPG-SNWSTQGIPITGSASVRLQ 157 >gi|302688515|ref|XP_003033937.1| hypothetical protein SCHCODRAFT_81758 [Schizophyllum commune H4-8] gi|300107632|gb|EFI99034.1| hypothetical protein SCHCODRAFT_81758 [Schizophyllum commune H4-8] Length = 475 Score = 39.4 bits (90), Expect = 0.83, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 54/169 (31%), Gaps = 16/169 (9%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF- 226 LD++ V D + S S+ S+ I+ + I + + ++ GLV F Sbjct: 33 DRKMLDLVFVQDCTGSQGSYIASATKNIEAICEHIFQSGKLLS-----REDLRVGLVAFR 87 Query: 227 ------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVS---TNSTPGLKYAYNQIFDMQGMR 277 I + F ++ + + L G T L A + + Sbjct: 88 DHPPQDHTYITKNFGFSSDITKVHHDLSGLYASGGGDGPEAVTAALAEALQMDWRPTASK 147 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 DA I + DG + + + L E K G ++ + Sbjct: 148 MIVLIADAPPHGIGEY-GDGFDDGSPDGHDPLQLAREMAKHGITLFFVA 195 >gi|167590268|ref|ZP_02382656.1| putative transmembrane protein [Burkholderia ubonensis Bu] Length = 377 Score = 39.4 bits (90), Expect = 0.83, Method: Composition-based stats. Identities = 22/179 (12%), Positives = 49/179 (27%), Gaps = 11/179 (6%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN-G 71 +G + I A+ + I+ + I++ ++ + L + D + + A ++ Sbjct: 11 KKQEGAVAITVALCMVILLGFAALAIDIGNLLIARNELQNSADAAAMAGAGCLIRRTACS 70 Query: 72 NNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISA 131 N + ++ S V ST Y + + Sbjct: 71 NTSASQPDWPTADATASAFSTSATTNQVQGTSVQT------STVATGYWNTTGTPYGLES 124 Query: 132 IS----RYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 + +P T +N+ V + A VL S+ Sbjct: 125 LPFTPGANDLPAVQVTIRKDGSNANGAVPIFLGRIFGARILKASAVATAVLSTPGSVGP 183 >gi|91791751|ref|YP_561402.1| hypothetical protein Sden_0384 [Shewanella denitrificans OS217] gi|91713753|gb|ABE53679.1| conserved hypothetical protein [Shewanella denitrificans OS217] Length = 4285 Score = 39.4 bits (90), Expect = 0.83, Method: Composition-based stats. Identities = 37/253 (14%), Positives = 86/253 (33%), Gaps = 22/253 (8%) Query: 113 STSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPW---YTNSRHIVMPITSSVKVNSQTD 169 + + N S + + ++ ++ + + + ++ + Sbjct: 3788 TVDATQTDLSNNTSISATDTAEVRMTYEYASGTHGDNTIVGTESNDIIVSDQTGIQVVDG 3847 Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 ++ ++D S S+ + S++ + + S+ L+ I + VV+ LV F Sbjct: 3848 QNYNIAFIVDTSGSVGT---SALNTMKTQLTSVFNTLK-ASAIGVHSGVVKILLVDFDTG 3903 Query: 230 IEEFFLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 ++ + +S L + +S G +TN G A + + H Sbjct: 3904 LKFAVSVNLAEANALSTLTTALGTMSS-GGNTNYKAGFDAATSWFTNGSASTNHGTNLTY 3962 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSF 345 F+TDG+ + + + AK V AIGI ++ L A + ++ Sbjct: 3963 -------FITDGQPNTDANKAPAAFASLNAKS---TVEAIGIGNGINYTTLAAYDTDHTP 4012 Query: 346 YLVENPHSMYDAF 358 N + + Sbjct: 4013 VTGVNANQLAAVI 4025 >gi|3764061|gb|AAC94975.1| integrin alpha 2 [Homo sapiens] Length = 156 Score = 39.4 bits (90), Expect = 0.84, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 56/153 (36%), Gaps = 24/153 (15%) Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKI---KYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 Q GL+ ++N F L + + + + G TN+ ++YA + Sbjct: 10 TQVGLIQYANNPRVVFNLNTYKTKEEMIVATSQTSQYGGDLTNTFGAIQYARKYAYSA-- 67 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI------RV 329 + + K++V +TDGE+ + + CN + + I + Sbjct: 68 ----ASGGRRSATKVMVVVTDGESHDGSMLKAVIDQCNH---DNILRFGIAVLGYLNRNA 120 Query: 330 IRSHEF---LRACAS---PNSFYLVENPHSMYD 356 + + ++A AS F+ V + ++ + Sbjct: 121 LDTKNLIKEIKAIASIPTERYFFNVSDEAALLE 153 >gi|51245384|ref|YP_065268.1| hypothetical protein DP1532 [Desulfotalea psychrophila LSv54] gi|50876421|emb|CAG36261.1| unknown protein [Desulfotalea psychrophila LSv54] Length = 420 Score = 39.4 bits (90), Expect = 0.84, Method: Composition-based stats. Identities = 10/75 (13%), Positives = 35/75 (46%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 +G + +LTAI + ++ + + +++ +++ ++ L + D + A ++++ NG Sbjct: 12 KGEEGAVAVLTAILMAVLIMFAALAVDLGYLYGVRNELQNGADAGALAGAHELLDVENGI 71 Query: 73 NRKKLKGGDILCRIK 87 + + + Sbjct: 72 LTRDDAIAEAERVVS 86 >gi|117928940|ref|YP_873491.1| hypothetical protein Acel_1733 [Acidothermus cellulolyticus 11B] gi|117649403|gb|ABK53505.1| hypothetical protein Acel_1733 [Acidothermus cellulolyticus 11B] Length = 177 Score = 39.4 bits (90), Expect = 0.84, Method: Composition-based stats. Identities = 12/50 (24%), Positives = 26/50 (52%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAA 62 + G +++L F +I +I+++S +F + VL + D + + AA Sbjct: 27 RDDGGQISLLIVFFGLVILGFTTVIVDLSTVFLAQRVLQATADGAALTAA 76 >gi|290978651|ref|XP_002672049.1| predicted protein [Naegleria gruberi] gi|284085622|gb|EFC39305.1| predicted protein [Naegleria gruberi] Length = 1682 Score = 39.4 bits (90), Expect = 0.85, Method: Composition-based stats. Identities = 23/128 (17%), Positives = 51/128 (39%), Gaps = 16/128 (12%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 A++D + +LD S S+ + ++ + K I + ++ + +V F + Sbjct: 1399 AKIDFVFLLDASGSIGT---TNYNSMRTLCKQITGRMSI------SSDSILVSIVRFHSN 1449 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + + + ++ ST + PGL+ A + + + N + K Sbjct: 1450 ATLALKLSNDGTTINNTLTNMAYEASSTGTIPGLRVAVDAL-------RPANGGRDDADK 1502 Query: 290 IIVFMTDG 297 +I +TDG Sbjct: 1503 VIFILTDG 1510 >gi|254460998|ref|ZP_05074414.1| von Willebrand factor, type A [Rhodobacterales bacterium HTCC2083] gi|206677587|gb|EDZ42074.1| von Willebrand factor, type A [Rhodobacteraceae bacterium HTCC2083] Length = 627 Score = 39.4 bits (90), Expect = 0.85, Method: Composition-based stats. Identities = 45/301 (14%), Positives = 100/301 (33%), Gaps = 35/301 (11%) Query: 52 SMIDRSLVHA-----ATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVND 106 S+ D S+V A +E ++ + R+ L + Sbjct: 157 SIADASVVAVEVDTEAFASASENTLKVTRETSVSTFSVDVDTASYAIVRSSLMNGQ---- 212 Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFI-PWYTNSRHIVMPITSSVKVN 165 + + ++ I + Y +A ++ PW N++ + + + + Sbjct: 213 ---LPPAAAVRIEEMVNYFPYDYAAPTQGVFATSLAMMATPWNENTQLLRIALQGQM-PE 268 Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + L+++ ++D S SM K+ + +S+ +L +++ V V +G Sbjct: 269 VEARPPLNLVFLIDTSGSMN-----QPNKLPLLKQSLRLLLGQLRAEDQVAIVAYAGSAG 323 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + + + L G TN GL+ AY MQ + Sbjct: 324 QVLEPTAAHERDTILGALDNL-----AAGGETNGQAGLQQAYALAKQMQEEGEVSR---- 374 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGAIVYAIGIRVIR-SHEFLRACASPN 343 ++ TDG+ + ++L Y + + G + +G E ++A A Sbjct: 375 -----VLLATDGDFNVGLSNAEALKGYIADKRDSGTFLSVLGFGRGNLDDETMQALAQNG 429 Query: 344 S 344 + Sbjct: 430 N 430 >gi|194758339|ref|XP_001961419.1| GF14941 [Drosophila ananassae] gi|190615116|gb|EDV30640.1| GF14941 [Drosophila ananassae] Length = 1177 Score = 39.4 bits (90), Expect = 0.85, Method: Composition-based stats. Identities = 38/214 (17%), Positives = 76/214 (35%), Gaps = 25/214 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT-----FS 227 D++I+LD S SM F + +A +I ++L+ + V F+ Sbjct: 243 DIVILLDHSGSMTGF------RHHVAKFTIRSILDTFSNNDFFTIFRYAADVEDIIPCFN 296 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + + I+ L N T A+ ++ +HCN + Sbjct: 297 GALVQATPE--NIEVFNEAIEKLDDPEGYANLTLAYDKAF-RLLRTYYDSRHCN-NSSTC 352 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG-----AIVYA--IGIRVIRSHEF-LRAC 339 + I+ +TDG +T E Q + + + G V+ +G V + E AC Sbjct: 353 NQAIMLVTDGVAGNTTEVFQKYNWGDG--ENGTSEMNVRVFTYLLGKEVTKVREIQWMAC 410 Query: 340 ASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYD 373 + + V+ +++ I T + + Sbjct: 411 LNRGYYSHVQTLDEVHEEVLKYVDVIATPLVLQN 444 >gi|149916833|ref|ZP_01905335.1| hypothetical protein PPSIR1_05713 [Plesiocystis pacifica SIR-1] gi|149822550|gb|EDM81939.1| hypothetical protein PPSIR1_05713 [Plesiocystis pacifica SIR-1] Length = 447 Score = 39.4 bits (90), Expect = 0.85, Method: Composition-based stats. Identities = 33/198 (16%), Positives = 65/198 (32%), Gaps = 40/198 (20%) Query: 164 VNSQTDARLDMMIVLDVSRSMESF-FDSSITKIDMAIKSINAMLEEVKLIPDV-NNVVQS 221 + ++M+VLD S SM + +D I N++ + V I ++ + Sbjct: 136 SATLEPVPPNVMMVLDKSGSMFTNTWDHDNNGGTPQITRWNSLYDVVDNITTTFDDSINF 195 Query: 222 GLVTF---------------SNKIEEFFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLK 264 G F ++ E + E + + I ++ T +T G+ Sbjct: 196 GANLFPSTLAQNIYGPQACTTSNFPEVTVGENNSAQILATIPGPGVTASYGGTPATLGVT 255 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN----------- 313 AYN + + D + ++ +TDG + + + Sbjct: 256 TAYNHLTSL----------DPELPRAMILVTDGAANCDQNAANNFQLFDVYDDGLPVIVG 305 Query: 314 EAKKRGAIVYAIGIRVIR 331 A G Y +GI +I Sbjct: 306 TAAANGVPTYVVGIDIIN 323 >gi|60219503|emb|CAI56763.1| hypothetical protein [Homo sapiens] Length = 860 Score = 39.4 bits (90), Expect = 0.85, Method: Composition-based stats. Identities = 22/131 (16%), Positives = 50/131 (38%), Gaps = 10/131 (7%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM D + +D+A ++ L+ ++ + + VT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVELFLK-LRARDPASRGDRYMPVTYDEPPY-C 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAY-----NQIFDMQGMRQHCNTEDAN 286 W +H ++K L G++T L+ ++ N++ Sbjct: 62 IKAGWKENHATFMSELKNLQASGLTTL-GQALRSSFDLLNLNRLISGIDNYGQGRNPFFL 120 Query: 287 YKKIIVFMTDG 297 I++ +TDG Sbjct: 121 EPSILITITDG 131 >gi|167467474|ref|ZP_02332178.1| hypothetical protein YpesF_06164 [Yersinia pestis FV-1] gi|262360709|gb|ACY57430.1| hypothetical protein YPD4_0521 [Yersinia pestis D106004] gi|262364656|gb|ACY61213.1| hypothetical protein YPD8_0523 [Yersinia pestis D182038] Length = 312 Score = 39.4 bits (90), Expect = 0.85, Method: Composition-based stats. Identities = 22/143 (15%), Positives = 45/143 (31%), Gaps = 8/143 (5%) Query: 195 IDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG 254 + + ++ ++K P ++ F+ + L V + L G Sbjct: 6 LKKMNDGLQMIINDLKKDPHALETAWISVIAFAGVAKTIVPL---VEVVSFYPPRLP-IG 61 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 T+ L+ QI D ++ +TDG + + + + Sbjct: 62 GGTSLGAALQELTRQIDTQVRKTTEERKGDWKP--VVYLLTDGRPTDDTTAEITRWKTHY 119 Query: 315 AKKRGAIVYAIGIRVIRSHEFLR 337 A+K + AIG+ LR Sbjct: 120 ARK--VNLIAIGLGPSADLNILR 140 >gi|113931618|ref|NP_001039260.1| inter-alpha (globulin) inhibitor H2 [Xenopus (Silurana) tropicalis] gi|72679293|gb|AAI00666.1| hypothetical protein MGC107982 [Xenopus (Silurana) tropicalis] Length = 942 Score = 39.4 bits (90), Expect = 0.85, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 72/197 (36%), Gaps = 30/197 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI--- 230 ++ V+DVS SM K+ ++++ ++LE+++ Q ++ F++ I Sbjct: 306 ILFVIDVSGSMWGL------KMKQTVEAMKSILEDLRPDD------QFSIIDFNHNIRCW 353 Query: 231 --EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E + + ++ + G TN L A + + + + Sbjct: 354 KDELVYASSVEKQDASKYVQSIQPNG-GTNINEALLRAIFILKEASN-KGMLEPNSVSL- 410 Query: 289 KIIVFMTDGENL--STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS-- 344 IV ++DG+ K + + A ++++GI ++FL A N Sbjct: 411 --IVLVSDGDPTVGEIKLSKIQKNVRTNIRDDFA-LHSLGIGFDVDYDFLERLAQENHGM 467 Query: 345 ---FYLVENPHSMYDAF 358 Y ++ + F Sbjct: 468 AQRIYGKQDTAAQLKEF 484 >gi|23309005|ref|NP_602209.2| hypothetical protein NCgl2911 [Corynebacterium glutamicum ATCC 13032] gi|21325787|dbj|BAC00408.1| Hypothetical protein [Corynebacterium glutamicum ATCC 13032] Length = 110 Score = 39.4 bits (90), Expect = 0.85, Method: Composition-based stats. Identities = 19/74 (25%), Positives = 38/74 (51%), Gaps = 7/74 (9%) Query: 154 IVMPITSSVKVNSQTDAR-LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 IV+ + S+ +++ +R + M+VLD S SM + ++ D A ++ N ++E+ Sbjct: 28 IVLSVPSTANAQTESTSRGVPTMLVLDSSGSMVTPDAGGQSRSDAAKEATNQFIDEL--- 84 Query: 213 PDVNNVVQSGLVTF 226 + GLVT+ Sbjct: 85 ---AGTLDLGLVTY 95 >gi|307591306|ref|YP_003900105.1| Na-Ca exchanger/integrin-beta4 [Cyanothece sp. PCC 7822] gi|306986160|gb|ADN18039.1| Na-Ca exchanger/integrin-beta4 [Cyanothece sp. PCC 7822] Length = 1547 Score = 39.4 bits (90), Expect = 0.85, Method: Composition-based stats. Identities = 35/278 (12%), Positives = 71/278 (25%), Gaps = 24/278 (8%) Query: 96 NELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISR-YKIPLKFCTFIPWYTNSRHI 154 ++ G ND D T + Y ++ +K + Sbjct: 319 DDTLSGGAGNDTIDGGDGTD-TAIYSGNYADYKLTLSKADFKTVTITHDGGTQEDGIDTL 377 Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + K LD+ V+D + SM + ID S + ++ + Sbjct: 378 TNVEFAQFKDKKVPLKGLDLAFVIDTTGSM-------LDDIDAVKASTSEIINNI--FDG 428 Query: 215 VNNVVQSGLVTFSNKIEEFF-------LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 + +V +++ E +E + I +S G A Sbjct: 429 SFPFARVAVVGYNDPSTETILSFTDQDKIEDRKNAALNAINSISVGGGGD-----FPEAV 483 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 N + I+ D T+ + Q + + G IG Sbjct: 484 NAGLIRALSGGAGQWNKQAVSRSIILFGDAPAKDTELESQVIELAEQVGISGLNGVIIGE 543 Query: 328 RVIRSHEFLRACA-SPNSFYLVENPHSMYDAFSHIGKD 364 S +F + + ++ + D Sbjct: 544 DSSTSADFESLATQTGGKTFNAKDASQVVSVILDTIDD 581 >gi|311114534|ref|YP_003985755.1| hypothetical protein HMPREF0421_20650 [Gardnerella vaginalis ATCC 14019] gi|310946028|gb|ADP38732.1| conserved hypothetical protein [Gardnerella vaginalis ATCC 14019] Length = 368 Score = 39.4 bits (90), Expect = 0.85, Method: Composition-based stats. Identities = 24/125 (19%), Positives = 47/125 (37%), Gaps = 18/125 (14%) Query: 152 RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 I+ S V ++ + D+++ LDVS S + I+ +K + E Sbjct: 89 TTILSSRPSRVLNANEQSSSRDIVLCLDVSGS---TLPYDLEVINAYLKFVEHFQGE--- 142 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL----KYAY 267 + GL F++ F L S ++++++Y SK + + + Y Sbjct: 143 --------RIGLSIFNSTSRTVFPLTDDYSLVKKQLQYASKLLDGVQTQDKIDNMQQKQY 194 Query: 268 NQIFD 272 QI Sbjct: 195 QQIST 199 >gi|134102411|ref|YP_001108072.1| putative magnesium-chelatase subunit [Saccharopolyspora erythraea NRRL 2338] gi|291004204|ref|ZP_06562177.1| putative magnesium-chelatase subunit [Saccharopolyspora erythraea NRRL 2338] gi|133915034|emb|CAM05147.1| putative magnesium-chelatase subunit [Saccharopolyspora erythraea NRRL 2338] Length = 716 Score = 39.4 bits (90), Expect = 0.85, Method: Composition-based stats. Identities = 25/150 (16%), Positives = 50/150 (33%), Gaps = 21/150 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D S SM ++ ++ ++ ++L + G+VTF E Sbjct: 523 VLFVVDASGSM-----AARERMSAVSGAVLSLLR-----DAYQRRDKVGVVTFRGDSAEV 572 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L + G T GL A N++ + R ++V Sbjct: 573 ALPPTSSVDTAAVRMRRLRTGGRTPLADGLLKA-NKVVGTERTRDPRRRP------LLVL 625 Query: 294 MTDGENL----STKEDQQSLYYCNEAKKRG 319 +TDG+ S + + + + G Sbjct: 626 LTDGKATVPLKSDVDGRARRAVDDALRAAG 655 >gi|126306104|ref|XP_001362407.1| PREDICTED: similar to calcium-dependent chloride channel-1 [Monodelphis domestica] Length = 870 Score = 39.4 bits (90), Expect = 0.85, Method: Composition-based stats. Identities = 34/168 (20%), Positives = 64/168 (38%), Gaps = 33/168 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++V+D SRSM +++ ++ L ++ + +G+VTF + Sbjct: 307 LVLVIDTSRSM-----KVGNRLNRLRQASQFFLLQI-----IEKGSWTGIVTFDSSATIQ 356 Query: 234 FLLEWGVSHLQRK--IKYLSK---FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L S +QRK I L G + GL+ A+ + ++ Sbjct: 357 SELIQIESDVQRKTLISRLPNVTVAGGGAHICSGLRTAFMVV-----KKKFLTDGSE--- 408 Query: 289 KIIVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF 335 +V +TDGE+ + C E K+ GAI++ I + Sbjct: 409 --MVLLTDGEDN-------TTNTCFEEVKQSGAIIHTIVLGPSTEKGL 447 >gi|62391862|ref|YP_227264.1| hypothetical protein cg3342 [Corynebacterium glutamicum ATCC 13032] gi|41223009|emb|CAF18954.1| putative secreted protein [Corynebacterium glutamicum ATCC 13032] Length = 107 Score = 39.4 bits (90), Expect = 0.85, Method: Composition-based stats. Identities = 19/74 (25%), Positives = 38/74 (51%), Gaps = 7/74 (9%) Query: 154 IVMPITSSVKVNSQTDAR-LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 IV+ + S+ +++ +R + M+VLD S SM + ++ D A ++ N ++E+ Sbjct: 25 IVLSVPSTANAQTESTSRGVPTMLVLDSSGSMVTPDAGGQSRSDAAKEATNQFIDEL--- 81 Query: 213 PDVNNVVQSGLVTF 226 + GLVT+ Sbjct: 82 ---AGTLDLGLVTY 92 >gi|85705522|ref|ZP_01036620.1| Nitric oxide reductase activation protein [Roseovarius sp. 217] gi|85669947|gb|EAQ24810.1| Nitric oxide reductase activation protein [Roseovarius sp. 217] Length = 747 Score = 39.4 bits (90), Expect = 0.85, Method: Composition-based stats. Identities = 22/173 (12%), Positives = 55/173 (31%), Gaps = 22/173 (12%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 + + +VLD+S+S I+ +DM + + ++ +++ + + Sbjct: 556 SIAVHLVLDISQSTADLAAPGISILDMERDAAAILAGT--MLQLGDDLAITAFSSAGRHD 613 Query: 231 EEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 ++ S L G ST L++A ++ D+ Y Sbjct: 614 VRVIPIKTFGSPLDEATGRALAGLRPGFSTRIGAALRFASQRMQDVS-----------RY 662 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCN------EAKKRGAIVYAIGIRVIRSHE 334 +K+++ +TDG + + +G + + Sbjct: 663 RKLVLLVTDGAPSDIDVADPEYLVADARRAVQSMRSQGIDAVCVALGPDAGQR 715 >gi|256811138|ref|YP_003128507.1| Magnesium chelatase [Methanocaldococcus fervens AG86] gi|256794338|gb|ACV25007.1| Magnesium chelatase [Methanocaldococcus fervens AG86] Length = 246 Score = 39.4 bits (90), Expect = 0.86, Method: Composition-based stats. Identities = 31/175 (17%), Positives = 64/175 (36%), Gaps = 20/175 (11%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 I + V+ Q ++ V+D S SM ++ +++ A +I ++L + Sbjct: 44 IYLEREDIVEKVRQRKISSHILFVVDASGSM-----GAMKRMEAAKGAIISLL-----LD 93 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + G++ F E L L K+ G T +Y +FD Sbjct: 94 AYQKRNKIGMIAFRKDRAELILPFTSSVELGEKLLKDLPTGGKTPLADAFIKSYE-VFDR 152 Query: 274 QGMRQHCNTEDANYKKIIVFMTD-GENLSTKED--QQSLYYCNEAKKRGAIVYAI 325 + + I++ ++D N++ K D ++ C + ++G V I Sbjct: 153 EIRKNPNIIP------IMIVISDFKPNVAVKGDYIKEVFDACEKIVEKGINVVLI 201 >gi|78484444|ref|YP_390369.1| von Willebrand factor, type A [Thiomicrospira crunogena XCL-2] gi|78362730|gb|ABB40695.1| Conserved hypothetical protein; predicted membrane protein with vWA domains and a TPR motif [Thiomicrospira crunogena XCL-2] Length = 651 Score = 39.4 bits (90), Expect = 0.86, Method: Composition-based stats. Identities = 18/99 (18%), Positives = 39/99 (39%), Gaps = 7/99 (7%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + P T T A +D+++ LD SRSM D S + +A + ++ ++ Sbjct: 106 LAGPRTLVPAPQESTRAGVDILVALDTSRSM-LVQDVSPNRFLLAKSLVESLANRLEPSD 164 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK 252 + GL+ ++ + L + + Q + + Sbjct: 165 ------RLGLMVYAGRPHLVSPLSFDRTLFQHYLDLMRP 197 >gi|16081608|ref|NP_393968.1| hypothetical protein Ta0490 [Thermoplasma acidophilum DSM 1728] gi|10639660|emb|CAC11632.1| hypothetical protein [Thermoplasma acidophilum] Length = 277 Score = 39.4 bits (90), Expect = 0.86, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 57/177 (32%), Gaps = 33/177 (18%) Query: 174 MMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEV----KLIPDVNNVVQSGLVTF-- 226 + ++D SRSM S +K A +IN + E+ V + G++ + Sbjct: 14 FIFLIDQSRSMIRKMAGGSESKAKEAADAINRQIGELIMRCTKNDGVRDYFYVGVIGYGG 73 Query: 227 ---------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVST---------------NSTPG 262 ++ L ++++ + T NS Sbjct: 74 EKGRASYLLDGEVVPVSRLAENPLRTEKRMMKVPDGSGGTVDVEYEFGVWFEPVANSDTP 133 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 + A N + + + + ++Y I++ +TDG++ + + G Sbjct: 134 MVRAMN--MAKEAIERWIDEHPSSYPPILINITDGQSTDGDPYPVAQSIMEMETEDG 188 >gi|86134837|ref|ZP_01053419.1| conserved hypothetical protein [Polaribacter sp. MED152] gi|85821700|gb|EAQ42847.1| conserved hypothetical protein [Polaribacter sp. MED152] Length = 289 Score = 39.4 bits (90), Expect = 0.86, Method: Composition-based stats. Identities = 37/161 (22%), Positives = 64/161 (39%), Gaps = 28/161 (17%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L M++++DVS S E F ++ K D + + N+ V GLV F Sbjct: 72 EEERELTMLLMVDVSGS-EFFGTTNQFKKDTITEIAATL---AFSATQNNDKV--GLVLF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 S+ IE + + G SH+ R I+ L TN + LK+ + + Sbjct: 126 SDDIELYIPPKKGKSHVLRIIRELIEFKPKSKKTNISVALKFLSSVLK------------ 173 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK-RGAIVY 323 K+ IVFM + + +++ + G VY Sbjct: 174 ----KRAIVFML--SDFMDDDYEKTAKIAAKKHDLTGIRVY 208 >gi|291575288|gb|ADE10213.1| hypothetical protein [Actinoplanes liguriensis] Length = 432 Score = 39.4 bits (90), Expect = 0.87, Method: Composition-based stats. Identities = 15/148 (10%), Positives = 43/148 (29%), Gaps = 7/148 (4%) Query: 16 KGGMTILTAIFLP-----IIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 +G ++TA+ ++ + ++I++ ++ + L S D + A Sbjct: 12 RGEHGVITALVAVLAGAGVLLGMAALVIDIGALYAEREQLQSGADAASWKVAQACAGTAG 71 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGF--VNDIDDIVRSTSLDIVVVPQNEGYS 128 + + +L + + + +T+ N + Sbjct: 72 RDLTSATCTVAAQRDNAQRYADRNAKDLVSDVQFCITTVSAAGVTTADAGCPSSWNTPVT 131 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVM 156 A P ++ NS + + Sbjct: 132 CPAPPSASGPYRYVEVRTSTRNSDNTSV 159 >gi|89100228|ref|ZP_01173095.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] gi|89085078|gb|EAR64212.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] Length = 469 Score = 39.4 bits (90), Expect = 0.87, Method: Composition-based stats. Identities = 41/211 (19%), Positives = 69/211 (32%), Gaps = 29/211 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 L++ I+LD S SM + D T +D+A +SI E++ +V V T S++ + Sbjct: 164 LNVEILLDASGSMANQIDGK-TMMDIAKESIGEFAEDLPEGANVALRVYGHKGTGSDQDK 222 Query: 232 EFFLLE---------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 E + L ++ + G T ++ A N + +G Sbjct: 223 ELSCKSNELIYEMGDYNGEKLNSTLQTVKPAGW-TPLAAAIEEAKNDLLSYKGDTNTN-- 279 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA--IVYAIGIRV-IRSHEFLRAC 339 II ++DG D + IV IG + L+ Sbjct: 280 -------IIYLVSDG---VETCDGNPVEQAKSLGDSDIQPIVNVIGFDLDAEGQNQLKEV 329 Query: 340 ASP--NSFYLVENPHSMYDAFSHIGKDIVTK 368 A F N + + K I K Sbjct: 330 AKSAKGLFSNARNQSELKEELDK-AKQIAKK 359 >gi|282897345|ref|ZP_06305347.1| hypothetical protein CRD_02269 [Raphidiopsis brookii D9] gi|281197997|gb|EFA72891.1| hypothetical protein CRD_02269 [Raphidiopsis brookii D9] Length = 449 Score = 39.4 bits (90), Expect = 0.88, Method: Composition-based stats. Identities = 35/182 (19%), Positives = 64/182 (35%), Gaps = 21/182 (11%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++++LD S SM KI AI +I K+ D Q +V F + Sbjct: 128 IIVLLDFSGSMNQIDSGGSKKIAGAINAIREFT---KVSSDRGGDTQISVVPFGEAGKNC 184 Query: 234 FLLEWGVSHLQR--------------KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR-- 277 L + + L+ G STN LK A + + + R Sbjct: 185 PEYTVNKDTLDKFLSASDFKLQNSLEYLSGLNPCG-STNLYQPLKKALEFLGNPEDPRFT 243 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK-RGAIVYAIGIRVIRSHEFL 336 N+ + N + I+ ++DG + + E Q + + V+ +G + S + Sbjct: 244 LPENSSEPNPRLSIILLSDGYHNAMNEFQDFNELKSLLQSYENITVHTLGYGLTPSQLGI 303 Query: 337 RA 338 + Sbjct: 304 KY 305 >gi|322418525|ref|YP_004197748.1| hypothetical protein GM18_0996 [Geobacter sp. M18] gi|320124912|gb|ADW12472.1| hypothetical protein GM18_0996 [Geobacter sp. M18] Length = 389 Score = 39.4 bits (90), Expect = 0.88, Method: Composition-based stats. Identities = 8/41 (19%), Positives = 20/41 (48%) Query: 25 IFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 I L + +V G+ I++ +++ + L + + + A I Sbjct: 15 IMLVVFLVVTGLAIDIGYMYVSEEDLQHSAEMAALTGAQTI 55 >gi|126342421|ref|XP_001375707.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 895 Score = 39.4 bits (90), Expect = 0.88, Method: Composition-based stats. Identities = 28/132 (21%), Positives = 52/132 (39%), Gaps = 12/132 (9%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM + T +D+A ++ L+ ++ + + LVT + Sbjct: 4 LLFLIDTSASMNQRTELGTTYLDIAKNAVEIFLK-LRARDPASRGDRYMLVTSEDPPY-C 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIF------DMQGMRQHCNTEDA 285 W +H ++K L G++T LK A++ + + Q N Sbjct: 62 IKAGWKENHATFMTELKNLQASGLTTL-GQALKSAFDLLNVNRLVSGIDNYGQGRNPFFL 120 Query: 286 NYKKIIVFMTDG 297 II F TDG Sbjct: 121 EPALIIAF-TDG 131 >gi|300868463|ref|ZP_07113083.1| von Willebrand factor, type A [Oscillatoria sp. PCC 6506] gi|300333545|emb|CBN58271.1| von Willebrand factor, type A [Oscillatoria sp. PCC 6506] Length = 220 Score = 39.4 bits (90), Expect = 0.88, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 58/167 (34%), Gaps = 17/167 (10%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 +++LD S SM+ +D + + +++ V+ +VTF N+I+ Sbjct: 20 VLLLDTSGSMQG------APLDALNEGLQTFRDDLTRDELAKKRVEVAIVTFDNQIKVVQ 73 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + L T+ G+ YA + I + Y + VFM Sbjct: 74 DFVT----ADQFESPLLTAQGQTHMGAGISYALDMIAAR---KSEYRNNGITYYRPWVFM 126 Query: 295 -TDGENLSTKE---DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 TDGE E +Q + E + +A+G+ + Sbjct: 127 ITDGEPQGESEQLVEQAAQRIREEEANKRVAFFAVGVEGANISRLAQ 173 >gi|255014577|ref|ZP_05286703.1| von Willebrand factor type A [Bacteroides sp. 2_1_7] Length = 250 Score = 39.4 bits (90), Expect = 0.88, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 61/207 (29%), Gaps = 16/207 (7%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + S ++ + V+D S SM KI +I +L + I D N Sbjct: 2 GLLDNESIPRRKMILFFVIDTSGSM------IGKKIGSVNDAIENVLPMIGEISDENPDA 55 Query: 220 QSGLVTFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 + + +E W + I + T+ + N+ G Sbjct: 56 EINVAA----LEFSTGTRWLYDEPKEAKDFIWQKVEANGLTSLGEACEE-LNKKLSRSGG 110 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 ++ + I+ ++DG E N K + AI I E L Sbjct: 111 FMPTSSGSGYFSPAIILLSDGGPTDNFEGGLKTLQGNSWFKHAIKI-AIAIGDDADKEVL 169 Query: 337 R-ACASPNSFYLVENPHSMYDAFSHIG 362 + S + V N ++ I Sbjct: 170 KQFTGSSEAVITVHNIEALKKMIRIIA 196 >gi|213511927|ref|NP_001135173.1| general transcription factor IIE, polypeptide 2-2 [Salmo salar] gi|197632439|gb|ACH70943.1| general transcription factor IIE, polypeptide 2-2 [Salmo salar] Length = 393 Score = 39.4 bits (90), Expect = 0.88, Method: Composition-based stats. Identities = 35/157 (22%), Positives = 62/157 (39%), Gaps = 15/157 (9%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +V+D SR+ME D ++ +K + +EE D N + Q G++ + K + Sbjct: 61 LYVVIDTSRTMEDQ-DLKPNRLTSTLKLVEYFVEEY---FDQNPISQVGII--TTKNKRA 114 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNST-PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L + ++ I L K ST P L YN I +H ++ + +I+ Sbjct: 115 EKLTDLAGNPKKHIDALKKAKDSTCGGEPSL---YNSISLAMQTLKH-MPGHSSREVLII 170 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 F +L+T + K V IG+ Sbjct: 171 F----SSLTTCDPANIYELVKTLKALKIRVSVIGLSA 203 >gi|319952927|ref|YP_004164194.1| von willebrand factor type a [Cellulophaga algicola DSM 14237] gi|319421587|gb|ADV48696.1| von Willebrand factor type A [Cellulophaga algicola DSM 14237] Length = 212 Score = 39.4 bits (90), Expect = 0.89, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 51/167 (30%), Gaps = 16/167 (9%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + I+LD S SM I+ + M+ ++ P ++TF + Sbjct: 3 RLPVYILLDTSGSM------MGEPIEAVKNGVQVMISSLRQNPQAIESAFLSIITFDSSA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L S I+ +T+ L+ I + + D Sbjct: 57 RQIVPLTDLASFQMPDIQA----TGTTSLGEALELVSTCIDNEVASTTSESKGDWKPLVF 112 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 I MTDG D K+R A + A L+ Sbjct: 113 I--MTDG----IPTDDMQKGLSELKKRRTAYIVACAAGNGADSTLLK 153 >gi|298372682|ref|ZP_06982672.1| von Willebrand factor, type A [Bacteroidetes oral taxon 274 str. F0058] gi|298275586|gb|EFI17137.1| von Willebrand factor, type A [Bacteroidetes oral taxon 274 str. F0058] Length = 293 Score = 39.4 bits (90), Expect = 0.89, Method: Composition-based stats. Identities = 23/95 (24%), Positives = 44/95 (46%), Gaps = 6/95 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS+S + + + I + + N + G++ F Sbjct: 72 EEERELTVMLLVDVSKS------NLFGSGEQLKRDIINEIAATLAFSTIQNNDKVGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 S+KIE+F + G H+ I+ L F +NST Sbjct: 126 SDKIEKFIPPQKGRKHVLYIIRELIDFKPESNSTD 160 >gi|118351474|ref|XP_001009012.1| hypothetical protein TTHERM_00259660 [Tetrahymena thermophila] gi|89290779|gb|EAR88767.1| hypothetical protein TTHERM_00259660 [Tetrahymena thermophila SB210] Length = 2382 Score = 39.4 bits (90), Expect = 0.89, Method: Composition-based stats. Identities = 28/146 (19%), Positives = 54/146 (36%), Gaps = 17/146 (11%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 S +K + +IV D S SME K K + ++ + + Sbjct: 2223 SIIKKCKSQINPVHFIIVFDESGSMEG------DKWMSLRKELLDFIDN-RSRYTAQDF- 2274 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 L+ F++ I+ + LE +++K+ + G +TN + L+ +I + Sbjct: 2275 -ITLIGFNDSIKLYTKLEKLNEQIKQKVPEKNMNG-NTNFSAPLQQVL-KILSQDNCKTF 2331 Query: 280 CNTEDANYKKIIVFMTDGENLSTKED 305 +I F++DGE D Sbjct: 2332 NKNN------VIFFLSDGEANKPDTD 2351 >gi|52082207|ref|YP_080998.1| YwmC protein [Bacillus licheniformis ATCC 14580] gi|52787598|ref|YP_093427.1| YwmC [Bacillus licheniformis ATCC 14580] gi|319648080|ref|ZP_08002297.1| YwmC protein [Bacillus sp. BT1B_CT2] gi|52005418|gb|AAU25360.1| YwmC [Bacillus licheniformis ATCC 14580] gi|52350100|gb|AAU42734.1| YwmC [Bacillus licheniformis ATCC 14580] gi|317389715|gb|EFV70525.1| YwmC protein [Bacillus sp. BT1B_CT2] Length = 228 Score = 39.4 bits (90), Expect = 0.89, Method: Composition-based stats. Identities = 44/233 (18%), Positives = 79/233 (33%), Gaps = 27/233 (11%) Query: 136 KIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKI 195 K+ L T + S +V DA ++ I+LD S SM ++K Sbjct: 4 KMTLGILTAMVLSLGSPAFA--AEKKQEVTVAEDAP-NVAIMLDASGSMAKKIGG-VSKY 59 Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQS-GLVTFSNKIEEFFLLE--WG--VSHLQRKIKYL 250 ++A + +++ + V S G S K++ + +G Q L Sbjct: 60 ELAKNEAFSFGSKLENANVLMRVFGSEGNNKNSGKVQSCNAIRGVYGFQTYDEQSFRNSL 119 Query: 251 SKFGVS--TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS 308 + G + T L+ A N + + D N K ++ +TDGE Sbjct: 120 NGIGPTGWTPIANALQDAKNAL----------DQLDNNGKNVVYLLTDGE---ETCGGNP 166 Query: 309 LYYCNEAKKRGAIVYAIGIRVIR---SHEFLRACASPNSFYLVENPHSMYDAF 358 + E +K A+V IG A A ++ + + + F Sbjct: 167 VKVATELRKSNAVVNVIGFDYEGDFHGQLTSIAAAGGGEYFQAKTKNDIKRIF 219 >gi|84387242|ref|ZP_00990263.1| hypothetical protein V12B01_22471 [Vibrio splendidus 12B01] gi|84377889|gb|EAP94751.1| hypothetical protein V12B01_22471 [Vibrio splendidus 12B01] Length = 142 Score = 39.4 bits (90), Expect = 0.89, Method: Composition-based stats. Identities = 25/156 (16%), Positives = 54/156 (34%), Gaps = 18/156 (11%) Query: 11 FFYNY--KGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 N +G + AI P++ LG++IE+ ++ V+ + +A +++ + Sbjct: 2 MLSNKHQRGFAAVEMAIATPVLLFFLGLVIELGNVLIHYNVISKSVQNGARYAVSEVYDT 61 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS 128 G + +N + + T+ D+ V P + Sbjct: 62 KGGTIAP---------------TLEIQNVVVYGQSSVGTAVLSTLTTADVTVTPPSIDSY 106 Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKV 164 + Y + IP S I + +TS ++V Sbjct: 107 VRVSVTYDY-VPLFLSIPLSATSFSIPLSVTSVMRV 141 >gi|66819075|ref|XP_643197.1| hypothetical protein DDB_G0276383 [Dictyostelium discoideum AX4] gi|60471310|gb|EAL69272.1| hypothetical protein DDB_G0276383 [Dictyostelium discoideum AX4] Length = 2026 Score = 39.4 bits (90), Expect = 0.89, Method: Composition-based stats. Identities = 31/199 (15%), Positives = 62/199 (31%), Gaps = 29/199 (14%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + + Q LD++ +D + SM I ++ I SI +E+ K+ V Sbjct: 1681 MSRDHQVSKSLDLVFCIDSTGSMS----GEIKEVKEKITSIVDRIEKAKV------NVNV 1730 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL-KYAYNQIFDMQGMRQHC 280 GLV +++ + + + + + + + + + Sbjct: 1731 GLVFYNDHEIIYLPTDKSPTIVYEFTDDIPLVKKNIGTIKAFGGNDHPEAVADGLYEVSK 1790 Query: 281 NTEDANYKKIIVFMTDGE-----------------NLSTKEDQQSLYYCNEAKKRGAIVY 323 +N KI + +TD S+ +RG Y Sbjct: 1791 LKFRSNASKICILITDAPAHGFDSKLNETYSEDYHPNGCPCGHDSIELVRSLVRRGITFY 1850 Query: 324 AIGIRV-IRSHEFLRACAS 341 + R S ++L A AS Sbjct: 1851 TVSCRPTPNSSDYLNAIAS 1869 >gi|312868536|ref|ZP_07728736.1| LPXTG-motif cell wall anchor domain protein [Streptococcus parasanguinis F0405] gi|311096281|gb|EFQ54525.1| LPXTG-motif cell wall anchor domain protein [Streptococcus parasanguinis F0405] Length = 474 Score = 39.4 bits (90), Expect = 0.90, Method: Composition-based stats. Identities = 24/171 (14%), Positives = 57/171 (33%), Gaps = 17/171 (9%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + V + D++ V+D S SM+ ++++ ++ +++++A N Sbjct: 158 EADVVNKKEVQGTSDIVFVIDKSFSMDDHINNTMKNVETFVRNLSA----------KNIQ 207 Query: 219 VQSGLVTF---SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + GLV + F + + I L + + A + + Sbjct: 208 ARLGLVEYGRSDRTKYHDFNGSKFTTDPESFISALKTI----KTFGADENATVPLHHIAT 263 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 + AN + +TD + TK+ + G + +G Sbjct: 264 SADYNWGTGANNHRFAFLITDEDIDLTKDTPTKEATLKALQDAGISLTVVG 314 >gi|47202242|emb|CAF87956.1| unnamed protein product [Tetraodon nigroviridis] Length = 207 Score = 39.4 bits (90), Expect = 0.90, Method: Composition-based stats. Identities = 20/122 (16%), Positives = 45/122 (36%), Gaps = 14/122 (11%) Query: 207 EEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 + V L+ + ++ F + ++ + ++ + G +T+ G +A Sbjct: 33 DMVILVDVLTRKLRLWFPAFKHLVQANV---RNKKIFKDAVQQMQAKG-TTDYKSGFHFA 88 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 +NQ+ + N AN KII+ TDG ++ + N+ V+ Sbjct: 89 FNQLLNKT------NVPRANCNKIIMLFTDGGEDRAQDVFMQYNWPNKT----VRVFTFS 138 Query: 327 IR 328 + Sbjct: 139 VG 140 >gi|294102191|ref|YP_003554049.1| hypothetical protein Amico_1203 [Aminobacterium colombiense DSM 12261] gi|293617171|gb|ADE57325.1| hypothetical protein Amico_1203 [Aminobacterium colombiense DSM 12261] Length = 329 Score = 39.4 bits (90), Expect = 0.91, Method: Composition-based stats. Identities = 7/80 (8%), Positives = 29/80 (36%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R +G + + A + ++ + ++ + + L + +D + A ++ N+ Sbjct: 6 RKELKRSRGAVLVWVAASMVVLLGAGALSLDYGRLVVARWRLQTAVDAGSLAGAWELGNK 65 Query: 69 GNGNNRKKLKGGDILCRIKN 88 ++ + + + Sbjct: 66 SASQALREASAAQVAGSVAS 85 >gi|187608071|ref|NP_001119900.1| integrator complex subunit 6 [Danio rerio] gi|169158663|emb|CAQ14208.1| novel protein similar to vertebrate integrator complex subunit 6 (INTS6) [Danio rerio] Length = 854 Score = 39.4 bits (90), Expect = 0.91, Method: Composition-based stats. Identities = 26/142 (18%), Positives = 56/142 (39%), Gaps = 12/142 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM + +D+A ++ L+ ++ + + LV+F Sbjct: 4 LLFLIDTSASMNQRSHLGTSYLDIAKGAVETFLK-LRSRDPASRGDRYMLVSFEEAP-AG 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN---TEDANY- 287 W SH +++ L G+ T+ L+ A+ + ++ + + + Sbjct: 62 IKAGWKDSHATFMTELRNLQAVGL-TSFGQALRTAF-DLLNLNRLVSGIDNYGQGRNPFF 119 Query: 288 --KKIIVFMTDGENLSTKEDQQ 307 IIV +TDG L+ Q Sbjct: 120 LEPAIIVAITDGSKLTGSSGVQ 141 >gi|47226572|emb|CAG08588.1| unnamed protein product [Tetraodon nigroviridis] Length = 296 Score = 39.4 bits (90), Expect = 0.91, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 61/177 (34%), Gaps = 12/177 (6%) Query: 61 AATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVV 120 + +++ + KG C + + + L+D+ + D+ ++ +TSL + + Sbjct: 65 SCRLLVSAPWSGYSRNRKGDLYKCPVSESRTSCDKLNLQDSLSIPDVKNVNDNTSLGLTL 124 Query: 121 VPQNEGYSISAISRYKIPLKFCTFIPW-YTNSRHIVMPITSSVKVNSQTDARLDMMIVLD 179 + G + + F P + P + +D++IVLD Sbjct: 125 AQNDGGDLLMCGPLWGQQCSSQRFYPGICARLSPLFQPQPAFSPAVQVCGGPMDVVIVLD 184 Query: 180 VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 S S+ + + I ++ + + P Q ++ + + F L Sbjct: 185 GSNSIYPW--------EPMTAFIQKLIPSLDIGP---QATQLSIIQYGVDPKFEFRL 230 >gi|324112804|gb|EGC06780.1| von Willebrand protein type A [Escherichia fergusonii B253] Length = 530 Score = 39.4 bits (90), Expect = 0.91, Method: Composition-based stats. Identities = 34/237 (14%), Positives = 81/237 (34%), Gaps = 28/237 (11%) Query: 142 CTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKS 201 T PW + + I ++ + +++ ++D S SM S ++ + Sbjct: 140 LTPAPWNEKHTLLRLDI-AANDIARSKLPPANLVFLIDTSGSMNSD-----ERLPLIKSL 193 Query: 202 INAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNS 259 + ++ E++ + +VT++ G S + ++ + G TN Sbjct: 194 LKLLVNELRDQD------RISIVTYAGSARLLLSSTSG-SEKNTILNAIANLQAGGGTNG 246 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 G+ AY Q + N I+ TDG+ + + ++ G Sbjct: 247 GAGVAMAYEQ------AQAGYIKGGVNR---ILLATDGDFNIGDDPSSVEDLVKKQRESG 297 Query: 320 AIVYAIGIRVIRSHE--FLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + +G+ +E ++ + N Y S+ +A + ++ + K Sbjct: 298 ITLSTLGVGDNNYNEAMMVKIADTGNGNYSY--LDSLSEAQKVLSNEMNQTLVTVAK 352 >gi|294665358|ref|ZP_06730648.1| hypothetical protein XAUC_13870 [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] gi|292604861|gb|EFF48222.1| hypothetical protein XAUC_13870 [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] Length = 277 Score = 39.4 bits (90), Expect = 0.91, Method: Composition-based stats. Identities = 24/205 (11%), Positives = 63/205 (30%), Gaps = 32/205 (15%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITK----IDMAIKSINAMLEEVKLIPDVN 216 S + + ++D S SM+ + +K D+ +++ ++ Sbjct: 2 SYTAEISRTSPTAFLFLVDQSGSMQEVMTNGKSKAQFVADVLNRTLATLITRCTKSEGTR 61 Query: 217 NVVQSGLVTFSNKIEEF--------------FLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 N + G++ +SN + +E ++ ++K + S Sbjct: 62 NYFEVGVLGYSNSDAQNGLAGALSHTVLHPISQIEANPLRVEDRVKRVDDGAGGLVSQSI 121 Query: 263 LKYAY------------NQIFD-MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL 309 + N I + C+ +Y ++ +TDGE+ + + Sbjct: 122 KFPVWFEARASGGTPMCNAITSAAAELVAWCDAHPDSYPPTVLHVTDGESTDGDPEGLAQ 181 Query: 310 YYCNEAKKRG-AIVYAIGIRVIRSH 333 A G +++ + + Sbjct: 182 QLQQIATNDGNVLLFNLHVSASGGD 206 >gi|171057436|ref|YP_001789785.1| outer membrane adhesin-like protein [Leptothrix cholodnii SP-6] gi|170774881|gb|ACB33020.1| outer membrane adhesin like proteiin [Leptothrix cholodnii SP-6] Length = 1598 Score = 39.4 bits (90), Expect = 0.91, Method: Composition-based stats. Identities = 35/139 (25%), Positives = 64/139 (46%), Gaps = 14/139 (10%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESF-FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 S T AR +++I LDVS SM++ + T++ AI++I +L+ + + DV V+ Sbjct: 809 DSVSATAARTNLLISLDVSGSMDTADGVAGATRLASAIEAIGQLLDRHEAMGDVA--VR- 865 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 LVTFS+ + + V+ ++ + L G T+ L A + + Sbjct: 866 -LVTFSSTAQAIGEVWTDVATARQLLASLQADG-GTHYDSALAAAQSAFLSAGRL----- 918 Query: 282 TEDANYKKIIVFMTDGENL 300 A + + F++DGE Sbjct: 919 ---AGAQNVSYFLSDGEPN 934 >gi|332884780|gb|EGK05036.1| hypothetical protein HMPREF9456_03189 [Dysgonomonas mossii DSM 22836] Length = 342 Score = 39.4 bits (90), Expect = 0.92, Method: Composition-based stats. Identities = 31/175 (17%), Positives = 65/175 (37%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 KV ++++I +DVS SM + D S +++ A + + +++E K + Sbjct: 81 GTKVEKVDKKGIELVIAIDVSNSMMAE-DISPSRLVKAKQILTRIIDERKND-------K 132 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 +V F+ + L + ++ ++ T + + + Sbjct: 133 VAIVVFAGEAFIQLPLTPDNQSAKLFLETINPSLVPVQGTAIGSAIDMSMS--------- 183 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 C + DA+ K IV +TDGE ++ + +G V +GI Sbjct: 184 --CFSNDADIDKAIVLITDGEGHEGNAEEAAARA----ASKGVHVNVVGIGTAEG 232 >gi|330719552|gb|EGG98147.1| BatA [gamma proteobacterium IMCC2047] Length = 166 Score = 39.4 bits (90), Expect = 0.92, Method: Composition-based stats. Identities = 12/96 (12%), Positives = 34/96 (35%), Gaps = 22/96 (22%) Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR----------- 331 E ++++ +TDG N + + + L A+++ ++ IG+ Sbjct: 8 ERPAESRLLILLTDGANTA--GEIEPLKAAELAQQQQIKIHTIGVGANEMLVPGLFSSRR 65 Query: 332 -------SHEFLRACAS--PNSFYLVENPHSMYDAF 358 + L+ AS ++ ++ + + Sbjct: 66 VNPSADLDEDTLKKIASQTGGQYFRAQDTEQLQQIY 101 >gi|326476599|gb|EGE00609.1| hypothetical protein TESG_07910 [Trichophyton tonsurans CBS 112818] Length = 309 Score = 39.4 bits (90), Expect = 0.92, Method: Composition-based stats. Identities = 28/222 (12%), Positives = 59/222 (26%), Gaps = 20/222 (9%) Query: 102 GFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSS 161 F++ + + Y P + + + +S Sbjct: 15 SFLSRNKQSPTAQKSTYSNNKSTGSSHLDLPPAYTPVSATSATAPSFPAASSEGSSVPAS 74 Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 A+ D + ++D S SM + K+ L ++ I + Sbjct: 75 PDRPDSFLAQFDTVFLIDDSGSMR----------GASWKATENALAKIAPICTAQDKDGV 124 Query: 222 GLVTFSNKIEE---FFLLEWGVSHLQRKIKYLSKFGV---STNSTPGLKYAYNQIFDMQG 275 + +++ + + H++ +S G T LK + Sbjct: 125 EVYFLNHRSDSSNGTYSNVRDTGHVREVFTAVSPCGGTPTGTRLNHILKPYLKGFEEASK 184 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 ++A I+ +TDG D AKK Sbjct: 185 SMSPDERDNAVRPLNIIVITDG----VPTDDVESVIVQAAKK 222 >gi|221042196|dbj|BAH12775.1| unnamed protein product [Homo sapiens] Length = 645 Score = 39.4 bits (90), Expect = 0.92, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 68/196 (34%), Gaps = 28/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D+S SM K+ +++ +L +++ + LV F +++ Sbjct: 293 VVFVIDISGSMRG------QKVKQTKEALLKILGDMQPGDYFD------LVLFGTRVQSW 340 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E + Q ++ S +TN GL + +Q +N+ Sbjct: 341 KGSLVQASEANLQAAQDFVRGFS-LDEATNLNGGLLRGIEILNQVQESLPEL----SNHA 395 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDG+ + Q L A + +Y +G FL + N+ Sbjct: 396 SILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 455 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 456 QRIYEDHDATQQLQGF 471 >gi|145297003|ref|YP_001139824.1| hypothetical protein cgR_2902 [Corynebacterium glutamicum R] gi|140846923|dbj|BAF55922.1| hypothetical protein [Corynebacterium glutamicum R] Length = 230 Score = 39.4 bits (90), Expect = 0.92, Method: Composition-based stats. Identities = 35/206 (16%), Positives = 63/206 (30%), Gaps = 46/206 (22%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 M+VLD S SM + ++ D A + I+ + L GLVT+ E Sbjct: 1 MLVLDSSGSMVTPDAGGQSRSDAANQFIDELAGTFDL----------GLVTYGGNTGETP 50 Query: 235 L--------------LEWGVSH-LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 G + L++ I L G T L+ A ++ + Sbjct: 51 EDYEAGCQDITVVRGPTNGQAEQLKQHIDGLQPRGY-TPIGESLRKAVAELPEGGSGT-- 107 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA--IVYAIGIRVIRSHEFLR 337 I+ ++DG T E +G ++ +G + S + Sbjct: 108 -----------ILLVSDGIATCTPP--PVCEVAAELADQGVDLVINTVGFTLDESARAVL 154 Query: 338 AC---ASPNSFYLVENPHSMYDAFSH 360 C A ++ + S+ Sbjct: 155 ECIAQAGNGTYADASDADSLVAELKQ 180 >gi|47209360|emb|CAF94677.1| unnamed protein product [Tetraodon nigroviridis] Length = 580 Score = 39.4 bits (90), Expect = 0.92, Method: Composition-based stats. Identities = 24/132 (18%), Positives = 54/132 (40%), Gaps = 12/132 (9%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ +LD S SM T +D+A ++ ++ ++ + + LVTF + Sbjct: 4 LLFLLDTSASMNQRTYLGTTFLDVAKGAVEIFMK-LRARDPASRGDRYMLVTFDDSPYGV 62 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN---TEDANY- 287 W +H ++K L G++T L+ A+ + ++ + + + Sbjct: 63 KA-GWKENHATFMCELKNLQASGLTTL-GYALRAAF-DLLNLNRLVSGIDNYGQGRNPFY 119 Query: 288 --KKIIVFMTDG 297 +I+ +TDG Sbjct: 120 LEPAVIITITDG 131 >gi|321460553|gb|EFX71594.1| hypothetical protein DAPPUDRAFT_326977 [Daphnia pulex] Length = 1000 Score = 39.4 bits (90), Expect = 0.93, Method: Composition-based stats. Identities = 41/193 (21%), Positives = 72/193 (37%), Gaps = 30/193 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++V+DVS SM +ID +S+ A ++ DV Q G+V FS+ Sbjct: 326 FVVVMDVSDSM-----KQCNRIDKLGESVRAWIKN-----DVPTGSQLGMVMFSSTAHIV 375 Query: 234 FLLEWGVSHLQRKIKYLSKFGVS----TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L+ +S ++ + + + K T GL A + + + Sbjct: 376 SELQ-VISDMKIRQEMMKKVPKDLYSITCIGCGLDLAVQMLQEKGNNKTGG--------- 425 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR--VIRSHEFLRACASPNSFY- 346 IIV +TDG N + D + + K G V + + E L S+Y Sbjct: 426 IIVLVTDGRNSAGYLDISDVE--EDIVKAGIRVVTVAFGSEADSNIERLADVTGGKSYYI 483 Query: 347 -LVENPHSMYDAF 358 ++ ++ AF Sbjct: 484 KDGDSSEALQRAF 496 >gi|282896738|ref|ZP_06304746.1| Magnesium chelatase ATPase subunit D [Raphidiopsis brookii D9] gi|281198456|gb|EFA73344.1| Magnesium chelatase ATPase subunit D [Raphidiopsis brookii D9] Length = 669 Score = 39.4 bits (90), Expect = 0.93, Method: Composition-based stats. Identities = 29/182 (15%), Positives = 60/182 (32%), Gaps = 22/182 (12%) Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 + + ++ A R P + R IV K + L ++ V+D S Sbjct: 424 GKVKRIAVDATLRAAAPYQ-KARRLRQPGKRVIVEQGDMRTKRLVRKAGAL-VVFVVDAS 481 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-EEFFLLEWGV 240 SM ++ ++ A ++ +L E N Q L+ F + E + Sbjct: 482 GSM------ALNRMQSAKGAVMQLLTE-----SYQNRDQIALIPFRGEQAEVLLPPTRSI 530 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 + + +++ L G S + + + + + +IV +TDG Sbjct: 531 ALAKNRLEKLPCGGGSPLAHG--------LTQAVRVGVNAQMGGDIGQVVIVAITDGRGN 582 Query: 301 ST 302 Sbjct: 583 IP 584 >gi|299138185|ref|ZP_07031365.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] gi|298600115|gb|EFI56273.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] Length = 382 Score = 39.4 bits (90), Expect = 0.94, Method: Composition-based stats. Identities = 31/166 (18%), Positives = 65/166 (39%), Gaps = 14/166 (8%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE-EVKLIPDVNNVVQSGLVTF 226 TD L + +++D S+S T ID + L+ + ++ +V F Sbjct: 80 TDVPLTLGLLVDTSQS-------QRTVIDSERSASGTFLDKMLAPASANRESDKAFVVQF 132 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY----NQIFDMQGMRQHCNT 282 + ++E S L++ +K L ST+S+ G + ++D + T Sbjct: 133 AREVELLQDPTDSRSKLKQALKELDTTAPSTSSSTGDDSGHAHGGTTLYDAVFLSADEVT 192 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 ++ ++ +TDG + + A++ I+YAI + Sbjct: 193 SKQKGRRALILLTDG--VDRNSKESIAEAIEAAQRADTIIYAIYFK 236 >gi|326804623|ref|YP_004327494.1| putative uncharacterised protein [Salmonella phage Vi01] gi|301795273|emb|CBW37991.1| putative uncharacterised protein [Salmonella phage Vi01] Length = 738 Score = 39.4 bits (90), Expect = 0.95, Method: Composition-based stats. Identities = 33/198 (16%), Positives = 61/198 (30%), Gaps = 17/198 (8%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 +IV+D+S SM + + S+ + V ++ ++ G V +I Sbjct: 20 VIVVDISGSMYRSLPEVRKHLKENLPSLVKPEDTVSILY-FSSRGDFGTVFAGRQINSAT 78 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 L S + + I + T LK A + N + FM Sbjct: 79 DL----SEINKLIDRFLQPSGCTGFVEPLKLAAETALSLNKPGYVNN---------LAFM 125 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPH 352 TDG + + ++ L K + I + E L + S + E Sbjct: 126 TDGYDNCWRSNEI-LAAAEVLPKAFDNITFIEYGWYCNRELLALMSERSGATHVFAEGQT 184 Query: 353 SMYDAFSHIGKDIVTKRI 370 ++ K K + Sbjct: 185 EYQTELENVLKSSTPKVV 202 >gi|196007642|ref|XP_002113687.1| hypothetical protein TRIADDRAFT_57368 [Trichoplax adhaerens] gi|190584091|gb|EDV24161.1| hypothetical protein TRIADDRAFT_57368 [Trichoplax adhaerens] Length = 921 Score = 39.4 bits (90), Expect = 0.95, Method: Composition-based stats. Identities = 29/196 (14%), Positives = 60/196 (30%), Gaps = 27/196 (13%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 ++VLD S SM ++ + + + G+V+F+++ Sbjct: 306 VLVLDTSGSM------QGVRLQQMRLATTNFI-----LNSAVEGEFLGIVSFNSRTTIMS 354 Query: 235 LLEW--GVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L + I + V T+ GL A N + C + Sbjct: 355 SLTKIVDQNVKNNLIAQIPSAAVGLTSVGGGLLSALNMLKSSVNQSFPCGGR-------M 407 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYLVE 349 + ++DGE + N+ IV+ + + S + + Y Sbjct: 408 IVLSDGEENVGPYISSVI---NDLVSNQIIVHTVSLGSSASERLQNVSYSTGGKAIYAPS 464 Query: 350 -NPHSMYDAFSHIGKD 364 + ++ AF I + Sbjct: 465 GDIATLNSAFLSISQQ 480 >gi|114566000|ref|YP_753154.1| Mg-chelatase subunit ChlD-like protein [Syntrophomonas wolfei subsp. wolfei str. Goettingen] gi|114336935|gb|ABI67783.1| Mg-chelatase subunit ChlD-like protein [Syntrophomonas wolfei subsp. wolfei str. Goettingen] Length = 592 Score = 39.4 bits (90), Expect = 0.95, Method: Composition-based stats. Identities = 26/166 (15%), Positives = 54/166 (32%), Gaps = 31/166 (18%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK- 229 +D+ +++D S SM + + + N +L + + +VTF + Sbjct: 409 PIDVCLLIDASGSM-----AGDKRQAACFLAQNLLL---------SGKEKVAVVTFQERS 454 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 E + L + + +S G+ T G+ A N I + Sbjct: 455 SEVVVPFTRNQNILNKGLSTISPAGL-TPMADGIMTAVNLI-----------KNNRVRNP 502 Query: 290 IIVFMTDGENLST----KEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 ++V ++DG +L ++ IG+ R Sbjct: 503 LLVLISDGIPNIPLWTLDAQADALEAATHIRENKIHFICIGLESNR 548 >gi|332708512|ref|ZP_08428486.1| protoporphyrin IX magnesium-chelatase [Lyngbya majuscula 3L] gi|332352609|gb|EGJ32175.1| protoporphyrin IX magnesium-chelatase [Lyngbya majuscula 3L] Length = 669 Score = 39.4 bits (90), Expect = 0.96, Method: Composition-based stats. Identities = 37/257 (14%), Positives = 80/257 (31%), Gaps = 44/257 (17%) Query: 118 IVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 I+ + ++ A R P + + N R I+ K ++ L ++ V Sbjct: 419 ILPKGKVRRIAVDATLRAAAPYQ-KARRQRHPNRRVIIEQGDLRSKRLARKAGSL-VVFV 476 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-EEFFLL 236 +D S SM ++ ++ A ++ +L E N Q L+ F + + Sbjct: 477 VDASGSM------ALNRMQSAKGAVMQLLTEA-----YENRDQVSLIPFRGEQADVLLPP 525 Query: 237 EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 ++ +R+++ L G S + + + + + +IV +TD Sbjct: 526 TRSIALAKRRLERLPCGGGSPLAHGLTQA----VHVGMNAQLSGDIGQV----VIVAITD 577 Query: 297 GENL-------------STKEDQQS--LYYCNEAKKRGAIVYAI-----GIRVIRSHEFL 336 G K D + L + G + I + + E Sbjct: 578 GRGNIPLARSLGEPLPEGEKPDIKGELLEIAARIRANGVQLLVIDTESKFVSTGFAKELA 637 Query: 337 RACASPNSFYLVENPHS 353 + +Y + Sbjct: 638 KQAG--GKYYHLPKATD 652 >gi|327390873|gb|EGE89213.1| cell wall surface anchor family protein [Streptococcus pneumoniae GA04375] Length = 886 Score = 39.4 bits (90), Expect = 0.96, Method: Composition-based stats. Identities = 21/134 (15%), Positives = 59/134 (44%), Gaps = 10/134 (7%) Query: 149 TNSRHIVMPITSSVKVNSQTDA-RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 N I + ++ + + LD++I+LD S SM + + + + + A ++ ++++ Sbjct: 194 DNQYGIELTVSGKTVYERKDKSVPLDVVILLDNSNSMSNIRNKNARRAERAGEATRSLID 253 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS-----KFGVSTNSTPG 262 ++ P+ + LVT+++ I + + K L+ + ++ +T Sbjct: 254 KITSDPEN----RVALVTYASTIFDGTEFTVEKGVADKNGKRLNDSLFWNYDQTSFTTNT 309 Query: 263 LKYAYNQIFDMQGM 276 Y+Y ++ + + Sbjct: 310 KDYSYLKLTNDKND 323 >gi|319783910|ref|YP_004143386.1| hypothetical protein Mesci_4225 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317169798|gb|ADV13336.1| hypothetical protein Mesci_4225 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 393 Score = 39.4 bits (90), Expect = 0.96, Method: Composition-based stats. Identities = 25/183 (13%), Positives = 48/183 (26%), Gaps = 27/183 (14%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 + + + A+ LP++ G +E S+ ++ L + Sbjct: 1 MRRSTGANVATIFALTLPVVVGAAGFGVETSYWYYNSLRLQATA---------------- 44 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 L +I + + + N + + P ++ Sbjct: 45 ----DAAAYAGALEQISGSDKPTIVAAATQSAASNGLGSGTIVVNTPPASGPNTAKKAVE 100 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 I K+ F + V TDA M+ LD S S F Sbjct: 101 VIVGQKLDRMFTLIFTQDKVPEQ-------ARAVAVITDASSACMLALDPSASQAVLFSG 153 Query: 191 SIT 193 S + Sbjct: 154 STS 156 >gi|198434104|ref|XP_002126242.1| PREDICTED: similar to calcium activated chloride channel [Ciona intestinalis] Length = 624 Score = 39.4 bits (90), Expect = 0.96, Method: Composition-based stats. Identities = 30/130 (23%), Positives = 44/130 (33%), Gaps = 23/130 (17%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 ++V+DVS SM+S T + + + G+V FSN Sbjct: 53 VLVIDVSGSMDS-VAGGQTLMQRMKTFARLFINKAAT------YSWLGIVAFSNDAHVVL 105 Query: 235 LLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L G R ++ L G+ TN + GL A + I D R Sbjct: 106 RLTQMNKQGKEKATRAVQTLRTEGL-TNISAGLFLALDLIKDRSHSRDS----------- 153 Query: 291 IVFMTDGENL 300 I+ TDG Sbjct: 154 IILFTDGAAN 163 >gi|170077105|ref|YP_001733743.1| hypothetical protein SYNPCC7002_A0479 [Synechococcus sp. PCC 7002] gi|169884774|gb|ACA98487.1| conserved hypothetical protein [Synechococcus sp. PCC 7002] Length = 210 Score = 39.4 bits (90), Expect = 0.96, Method: Composition-based stats. Identities = 28/171 (16%), Positives = 60/171 (35%), Gaps = 17/171 (9%) Query: 173 DMMIVLDVSRSME--SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 D +++D S SM S T+ D A S A+ + + I V FS + Sbjct: 14 DYTLIIDKSGSMYTCDLPTSDKTRWDAAQDSTLALAQTCEKIDPDGITVYL----FSGRF 69 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + + + ++ ++ T+ L+ A + F + R + Sbjct: 70 RRY---DNVTADKVTQVYAENEPMGRTDLAAVLQDALDNYFQRKE-RGEAKPNGETF--- 122 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEA--KKRGAIVYAIGIRV-IRSHEFLRA 338 + +TDGE +K + + + + I + ++ +L+A Sbjct: 123 -LVITDGEPDDSKAVMRVIIDAANKIERDEEIGISLIQVGQDRKAKAYLKA 172 >gi|154432962|gb|ABS82141.1| ancillary pilus subunit [Streptococcus pneumoniae] Length = 886 Score = 39.4 bits (90), Expect = 0.96, Method: Composition-based stats. Identities = 21/134 (15%), Positives = 59/134 (44%), Gaps = 10/134 (7%) Query: 149 TNSRHIVMPITSSVKVNSQTDA-RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 N I + ++ + + LD++I+LD S SM + + + + + A ++ ++++ Sbjct: 194 DNQYGIELTVSGKTVYERKDKSVPLDVVILLDNSNSMSNIRNKNARRAERAGEATRSLID 253 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS-----KFGVSTNSTPG 262 ++ P+ + LVT+++ I + + K L+ + ++ +T Sbjct: 254 KITSDPEN----RVALVTYASTIFDGTEFTVEKGVADKNGKRLNDSLFWNYDQTSFTTNT 309 Query: 263 LKYAYNQIFDMQGM 276 Y+Y ++ + + Sbjct: 310 KDYSYLKLTNDKND 323 >gi|149410925|ref|XP_001511539.1| PREDICTED: similar to Leucine-rich repeat and calponin homology domain-containing protein 3 isoform 1 [Ornithorhynchus anatinus] Length = 898 Score = 39.4 bits (90), Expect = 0.96, Method: Composition-based stats. Identities = 23/142 (16%), Positives = 57/142 (40%), Gaps = 12/142 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM D + +D+A ++ ++ ++ + + LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVEIFMK-LRARDPASRGDRYMLVTYDEPPY-C 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN---TEDANY- 287 W +H ++K L G++T L+ ++ + ++ + + + Sbjct: 62 IKAGWKENHATFMNELKNLQASGLTTL-GQALRSSF-DLLNLNRLVSGIDNYGQGRNPFF 119 Query: 288 --KKIIVFMTDGENLSTKEDQQ 307 I++ +TDG L+ Q Sbjct: 120 LEPSILITITDGNKLTNTAGVQ 141 >gi|149410927|ref|XP_001511563.1| PREDICTED: similar to Leucine-rich repeat and calponin homology domain-containing protein 3 isoform 2 [Ornithorhynchus anatinus] Length = 861 Score = 39.4 bits (90), Expect = 0.96, Method: Composition-based stats. Identities = 23/142 (16%), Positives = 57/142 (40%), Gaps = 12/142 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM D + +D+A ++ ++ ++ + + LVT+ Sbjct: 4 LLFLIDTSASMNQRTDLGTSYLDIAKGAVEIFMK-LRARDPASRGDRYMLVTYDEPPY-C 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN---TEDANY- 287 W +H ++K L G++T L+ ++ + ++ + + + Sbjct: 62 IKAGWKENHATFMNELKNLQASGLTTL-GQALRSSF-DLLNLNRLVSGIDNYGQGRNPFF 119 Query: 288 --KKIIVFMTDGENLSTKEDQQ 307 I++ +TDG L+ Q Sbjct: 120 LEPSILITITDGNKLTNTAGVQ 141 >gi|329898072|ref|ZP_08272282.1| hypothetical protein IMCC3088_280 [gamma proteobacterium IMCC3088] gi|328920971|gb|EGG28395.1| hypothetical protein IMCC3088_280 [gamma proteobacterium IMCC3088] Length = 490 Score = 39.4 bits (90), Expect = 0.97, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 69/201 (34%), Gaps = 37/201 (18%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ +++DVS SM+ ++ ++ V+L+P + ++G+ F + + Sbjct: 28 DVRLLIDVSGSMKQSDPNN------LRAPALELI--VRLLPPGS---KAGVWLFGDGVRT 76 Query: 233 FFL-----LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 W ++ ++ + G TN L A + N D +Y Sbjct: 77 LLPHVVIDEAWRDKAVES-VRVIDNSGQRTNIPAALDAALYDL----------NRLDPDY 125 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKR--------GAIVYAIGIRVIRSHEFLRAC 339 + +V +TDG+ + + + G V+ I + FLR Sbjct: 126 RTSVVLLTDGKVDVSSSAVANANAARMLLEEQAPILGQTGVPVHTIALSNEADWAFLRKL 185 Query: 340 ASPNSFYL--VENPHSMYDAF 358 A E+ S+ F Sbjct: 186 AESTQGLAEKAESADSLTRVF 206 >gi|307943680|ref|ZP_07659024.1| von Willebrand factor type A domain-containing protein [Roseibium sp. TrichSKD4] gi|307773310|gb|EFO32527.1| von Willebrand factor type A domain-containing protein [Roseibium sp. TrichSKD4] Length = 746 Score = 39.4 bits (90), Expect = 0.97, Method: Composition-based stats. Identities = 30/171 (17%), Positives = 60/171 (35%), Gaps = 24/171 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ VLD S SM +D + ++A L+ ++ +++ T + Sbjct: 344 LVFVLDTSGSMGGQP------MDASKSFMHAALDGLRENDQF-RILRFANNTSAFAKSAM 396 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + + LS G + + A N FD+ +I+VF Sbjct: 397 PATRANIKAGKNFVTGLSARGGTE-----MNNAINAAFDL--------PPVPGTMRIVVF 443 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 +TDG D++ + + + A ++A GI + L A Sbjct: 444 LTDG---YIGGDREVIQTVYD-RIGNARIHAFGIGKAINRYLLEGLAREGR 490 >gi|303257915|ref|ZP_07343924.1| putative von Willebrand factor type A domain protein [Burkholderiales bacterium 1_1_47] gi|302859258|gb|EFL82340.1| putative von Willebrand factor type A domain protein [Burkholderiales bacterium 1_1_47] Length = 742 Score = 39.4 bits (90), Expect = 0.97, Method: Composition-based stats. Identities = 35/202 (17%), Positives = 68/202 (33%), Gaps = 36/202 (17%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 LD+ I+ D S SM ++I+ +++ + + + Q L F + + Sbjct: 250 PLDLSILADCSGSMGG------SRINWMKQALKKLNSKFESAD------QVSLTAFGSSV 297 Query: 231 E-----EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 E E + + YL T + N Sbjct: 298 ELKLLPNRMKFEEFQGNFADAVSYLDADLGGTELEQAIVSCMNLF------------PGR 345 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPN 343 N + ++ +TDGE + + EAK+R ++ +GI + L A + Sbjct: 346 NKQSALLLVTDGEVWD--SCKPMIMAALEAKRR---IFILGIGMAPYSNLLVELAEKTGG 400 Query: 344 SFYLVENPHSMYDAFSHIGKDI 365 ++ V N + + A + K I Sbjct: 401 AYEAVYNWYDIEGAVERMLKRI 422 >gi|288932694|ref|YP_003436754.1| hypothetical protein Ferp_2362 [Ferroglobus placidus DSM 10642] gi|288894942|gb|ADC66479.1| conserved hypothetical protein [Ferroglobus placidus DSM 10642] Length = 403 Score = 39.4 bits (90), Expect = 0.97, Method: Composition-based stats. Identities = 43/321 (13%), Positives = 93/321 (28%), Gaps = 57/321 (17%) Query: 39 EVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNEL 98 E+ FF + L +I+ ++ T+ ++ + +K + Sbjct: 115 ELLDYFFEEA-LKELIEMGIIEGVTKRFFRRKVKFSRQAERIIAQKVMKEVSKEA----- 168 Query: 99 RDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPI 158 + + + +V +S I IP + Sbjct: 169 -KGYYAESEGETLSYIPGYELVEYDEYLHSYDLI---DIPETMIRAAKNED----FEIRE 220 Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 V N + + ++++DVS SM KI AI++ A+ ++ D V Sbjct: 221 KDIVSRNPKKVGKRHFVMLIDVSDSMRG------KKIVGAIEAALALKMSIRKGFDDLEV 274 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 F+++ ++ T+ LK A N + G Sbjct: 275 F-----VFNHR----------TEKIREGDIVNVDVEGRTDIALALKTARNALRGKDGA-- 317 Query: 279 HCNTEDANYKKIIVFMTDGENLST-----KEDQQSLYYCNEAKKRGAIVYAIGIRVIRS- 332 K ++ +TDGE ++ + ++ + K + + + Sbjct: 318 ----------KYVILITDGEPTASYNPLIPPWKMAVIESAKLKDEDINLNVLMLNDDSRF 367 Query: 333 ----HEFLRACASPNSFYLVE 349 L+A + FY Sbjct: 368 YALCERMLKAAGKGSIFYFPN 388 >gi|291612496|ref|YP_003522653.1| von Willebrand factor type A [Sideroxydans lithotrophicus ES-1] gi|291582608|gb|ADE10266.1| von Willebrand factor type A [Sideroxydans lithotrophicus ES-1] Length = 754 Score = 39.4 bits (90), Expect = 0.97, Method: Composition-based stats. Identities = 21/158 (13%), Positives = 56/158 (35%), Gaps = 23/158 (14%) Query: 164 VNSQTDARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + +M++LD+S S+ E S T ++++ ++++ + V+ + D Sbjct: 556 SHRNDGRNIAVMLLLDLSESLNEKAAGSDQTILELSQEAVSLLGWAVEKLGDP-----FA 610 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLS------KFGVSTNSTPGLKYAYNQIFDMQGM 276 + F + + ++ + + ST +++A Sbjct: 611 IAGFHSNTRHDVRFLHIKGYSEKWDDQVKGRLAAMEASYSTRMGAAMRHA---------- 660 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 H + KK+++ +TDG+ L + Sbjct: 661 -AHYLEKQQADKKLMLILTDGQPSDVDSKDGELLIADT 697 >gi|189193017|ref|XP_001932847.1| von Willebrand domain containing protein [Pyrenophora tritici-repentis Pt-1C-BFP] gi|187978411|gb|EDU45037.1| von Willebrand domain containing protein [Pyrenophora tritici-repentis Pt-1C-BFP] Length = 906 Score = 39.4 bits (90), Expect = 0.97, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 59/174 (33%), Gaps = 21/174 (12%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V D S SM+ T ID+A +++ L+ + + ++ Sbjct: 295 IVFVCDRSGSMQ-------TAIDLAKQALQVFLKSL----PIGVKFNICSFGNTHSFLWP 343 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + L I +++ + T L+ + R D I+ Sbjct: 344 KSVTYSQETLDLAINHVNSMTANYGGTEMLQ----PLQATIENRYKDMALD------IML 393 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +TDGE ++ L K V+ +G+ + SH + A + + Sbjct: 394 LTDGEIWRQQQLFSYLNQSVLESKDPIRVFTLGVGMGVSHALIEGIAKAGNGFS 447 >gi|183983524|ref|YP_001851815.1| hypothetical protein MMAR_3543 [Mycobacterium marinum M] gi|183176850|gb|ACC41960.1| conserved membrane protein [Mycobacterium marinum M] Length = 983 Score = 39.4 bits (90), Expect = 0.97, Method: Composition-based stats. Identities = 39/195 (20%), Positives = 79/195 (40%), Gaps = 35/195 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++VLD SRSM + K+ A ++ + +++ + + ++TF + IE Sbjct: 301 LVLVLDRSRSMAGW------KMTAARRAASRIVDALTSDD------RFAVLTFDDGIEYP 348 Query: 234 FLLEWGVSHLQ--------RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 L G++ + + G + P L+ A + G Q +T+DA Sbjct: 349 VGLPAGLTEASDRHRYRAVEHLARVEARGDTEMLAP-LRRA----LALLGREQVADTDDA 403 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPN 343 +++ ++DG+ EDQ + + ++ IG+ + FLR A Sbjct: 404 ----VLILISDGQ--VGNEDQLLQELSGDLGR--VRLHTIGVDEAVNAGFLRRLAGVGGG 455 Query: 344 SFYLVENPHSMYDAF 358 LV+N + +A Sbjct: 456 RCVLVDNEDRLDEAL 470 >gi|118618215|ref|YP_906547.1| hypothetical protein MUL_2774 [Mycobacterium ulcerans Agy99] gi|118570325|gb|ABL05076.1| conserved membrane protein [Mycobacterium ulcerans Agy99] Length = 981 Score = 39.4 bits (90), Expect = 0.97, Method: Composition-based stats. Identities = 39/195 (20%), Positives = 79/195 (40%), Gaps = 35/195 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++VLD SRSM + K+ A ++ + +++ + + ++TF + IE Sbjct: 299 LVLVLDRSRSMAGW------KMTAARRAASRIVDALTSDD------RFAVLTFDDGIEYP 346 Query: 234 FLLEWGVSHLQ--------RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 L G++ + + G + P L+ A + G Q +T+DA Sbjct: 347 VGLPAGLTEASDRHRYRAVEHLARVEARGDTEMLAP-LRRA----LALLGREQVADTDDA 401 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPN 343 +++ ++DG+ EDQ + + ++ IG+ + FLR A Sbjct: 402 ----VLILISDGQ--VGNEDQLLQELSGDLGR--VRLHTIGVDEAVNAGFLRRLAGVGGG 453 Query: 344 SFYLVENPHSMYDAF 358 LV+N + +A Sbjct: 454 RCVLVDNEDRLDEAL 468 >gi|326428371|gb|EGD73941.1| hypothetical protein PTSG_05636 [Salpingoeca sp. ATCC 50818] Length = 1446 Score = 39.4 bits (90), Expect = 0.98, Method: Composition-based stats. Identities = 22/145 (15%), Positives = 51/145 (35%), Gaps = 13/145 (8%) Query: 170 ARLDMMIVLDVSRSME-SFFDSSITKIDMAIKSI-------NAMLEEVKLIPDVNNVVQS 221 D++IVLD SRSM ++ ++ + S+ + N V + Sbjct: 484 GPKDIVIVLDCSRSMRGDKWNDAVAMTKFLVNSLSRDDRYNVVCFSSSHKDYNDNFVYRR 543 Query: 222 GLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN----QIFDMQGMR 277 V K E ++ + G T+ G++ + + + Sbjct: 544 TEVLSCRKHELLRGTSSNKEDTFTRLDGYTPAG-GTDPLTGIQVGFRLLRGECNGLDTDC 602 Query: 278 QHCNTEDANYKKIIVFMTDGENLST 302 + + ++++VF++DG++ Sbjct: 603 PMRDPPRTDCQRLMVFLSDGKDRDN 627 >gi|294506858|ref|YP_003570916.1| hypothetical protein SRM_01043 [Salinibacter ruber M8] gi|294343186|emb|CBH23964.1| conserved hypothetical protein [Salinibacter ruber M8] Length = 288 Score = 39.4 bits (90), Expect = 0.98, Method: Composition-based stats. Identities = 26/213 (12%), Positives = 67/213 (31%), Gaps = 45/213 (21%) Query: 163 KVNSQTDARLDMMIVLDVSRSM-------ESFFDSSITKIDMAIKSINAMLEEV----KL 211 ++ +LD S SM E D++ ++ + ++N +L+ + Sbjct: 4 SAEISRQQPTAILFLLDQSASMQEPFGGAEKTGDAAPSRARVLADTVNDLLQNLVLRCAK 63 Query: 212 IPDVNNVVQSGLVTFSNKIE----------------EFFLLEWGVSHLQRKIKYLSKFGV 255 V + G++ + ++ L +++++K + Sbjct: 64 EEGVRDYFHVGVIGYGQRVRRLVQPTEEYDPGTGLVPISHLADRPQRMEQRVKEVDDGDG 123 Query: 256 STNST---------------PGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 T T + A + Q +R + ++ I++ +TDGE Sbjct: 124 GTTETRVKTPIWFDPQAKNGTPMCQALD--LAAQSVRDWIDQHPHSFPPIVLNVTDGEAT 181 Query: 301 STKEDQQSLYYCNEAKKRG-AIVYAIGIRVIRS 332 + + + A G +++ I + Sbjct: 182 DGDPLRYAQELRSFATDDGEVLLFNIHLSASEE 214 >gi|168036243|ref|XP_001770617.1| predicted protein [Physcomitrella patens subsp. patens] gi|162678138|gb|EDQ64600.1| predicted protein [Physcomitrella patens subsp. patens] Length = 634 Score = 39.4 bits (90), Expect = 0.98, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 67/204 (32%), Gaps = 42/204 (20%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV-VQSGLVTFSNKIEE 232 ++ +LD S SM I+ A +++ L+ +K N V L FS+++E Sbjct: 315 VVFLLDRSGSMYG------KPIEDARQALFFALDSLKPEDSFNIVAFDHELTLFSSQMER 368 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 G + ++ T+ L+ A+N + + + Sbjct: 369 ATPNAIGWAREWAMTNCTAR--GGTDILGPLQQAFNFLENFPWAVP-----------YVF 415 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR--ACASPNSFYLVEN 350 +TDG C + R + A+G+R R F C +FY Sbjct: 416 LITDG------AVSDEQNICLAMQSR---IAALGVRSPRISTFGIGFYC----NFYF--- 459 Query: 351 PHSMYDAFSHIGKDIVTKRIWYDK 374 + IG+ + DK Sbjct: 460 ----LKMLAVIGRGMSDVAFTSDK 479 >gi|158316708|ref|YP_001509216.1| von Willebrand factor type A [Frankia sp. EAN1pec] gi|158112113|gb|ABW14310.1| von Willebrand factor type A [Frankia sp. EAN1pec] Length = 431 Score = 39.4 bits (90), Expect = 0.98, Method: Composition-based stats. Identities = 41/213 (19%), Positives = 72/213 (33%), Gaps = 34/213 (15%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL--VTFSNKIEE 232 +I+LD S SM D +KI A ++ A ++ + V +G + + + E Sbjct: 54 VILLDCSGSM----DYPHSKIIEARRAAQAAIDTLHDGVAFAVVAGTGQAEMVYPTRQEL 109 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + +K L G T L A + + H + Sbjct: 110 VEASPRTREAAKAAVKRLQPHG-GTAMGRWLLLARDLMATRPDAIHHA-----------I 157 Query: 293 FMTDGEN-LSTKEDQQSLYYCN-----EAKKRG-----AIVYAIGIRVIRSHEFLRACAS 341 +TDG+N S +L C + + G A + + ++ LR A Sbjct: 158 LLTDGQNGESEAVFAAALAACEGRFQCDCRGVGADWKVAELRRVASTLLGGVALLREPAE 217 Query: 342 PNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 + E+ S+ + G D V R+W K Sbjct: 218 -----MAEDFRSLIERAQARGIDRVGLRVWTPK 245 >gi|269126092|ref|YP_003299462.1| type II secretion system protein [Thermomonospora curvata DSM 43183] gi|268311050|gb|ACY97424.1| type II secretion system protein [Thermomonospora curvata DSM 43183] Length = 660 Score = 39.4 bits (90), Expect = 0.99, Method: Composition-based stats. Identities = 24/171 (14%), Positives = 51/171 (29%), Gaps = 27/171 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +M+VLD S SM + + + L+P V +G T Sbjct: 95 VMLVLDTSGSM---------SAADLAAAKAGAVAFLDLLPAD---VPAGFTTTGTPTRPV 142 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L++ + + G T + A +++ +V Sbjct: 143 IDPTTDRRSLRKALGRMRT-GGETALYDAMSAAVDRLARANAAEGR-----------LVV 190 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRACASPN 343 ++DG++ ++ K+ + + S LR A+ + Sbjct: 191 LSDGKDSASTSTLA--QVLARLKRTRIAADVVAFKTAATSEGTLRQLAADS 239 >gi|15603693|ref|NP_246767.1| hypothetical protein PM1828 [Pasteurella multocida subsp. multocida str. Pm70] gi|12722252|gb|AAK03912.1| unknown [Pasteurella multocida subsp. multocida str. Pm70] Length = 486 Score = 39.4 bits (90), Expect = 0.99, Method: Composition-based stats. Identities = 34/248 (13%), Positives = 78/248 (31%), Gaps = 37/248 (14%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM-------------------------- 184 + P ++++ M+IV D SRSM Sbjct: 248 TEKTPAPPEKVCSISTKPGENPQMVIVFDNSRSMLYSLLESKQSLADFEYRWVRYLTSAE 307 Query: 185 -ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL 243 + S ++++A +S +++ + D+ V G T +N L Sbjct: 308 ETAHIRRSPNRLEVAKQSSASIINSIAKTVDIGLVELKGCPTANNHGYYSPAKR---KQL 364 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 + KI + S ++ G Y+ + M + I+ ++DGE+ Sbjct: 365 RAKINAMQPAPDSDENSGGTPL-YSGLKQAAAMVDGKKRDG-----FILLISDGEDSCET 418 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGK 363 D +L ++ + + I ++ + A+ + + + + + K Sbjct: 419 GDVCALAQQIAKRQPRLKINVVDIGGAKAANCVAK-ATKGKVFTANSQKQLVNMINQAVK 477 Query: 364 DIVTKRIW 371 I + Sbjct: 478 PITEVEVC 485 >gi|19703911|ref|NP_603473.1| D-amino acid dehydrogenase large subunit [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] gi|19714079|gb|AAL94772.1| hypothetical protein FN0576 [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] Length = 369 Score = 39.4 bits (90), Expect = 0.99, Method: Composition-based stats. Identities = 37/228 (16%), Positives = 79/228 (34%), Gaps = 40/228 (17%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + +++ IVLD S SM T +++A +SI +L E+ N V G+ Sbjct: 9 KIEENMNVNVEIVLDASGSMVKKI-GDKTMMEIAKESIKKVLSEM----PANAKV--GIR 61 Query: 225 TFSNKIEEFFLLEW---------------GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 F +K + + V +++ ++ + G T+ ++Y Sbjct: 62 VFGHKGDNTASKKDESCGANELIYPIGDLNVEGIEKALEPIQPTGW-TSIAKSIEYGVED 120 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK--KRGAIVYAIGI 327 + + G + I+ +TDG + + K ++ IG Sbjct: 121 LKALDGEKTLN---------ILYIITDG---IETCGGNPVEIAKQLKGENTNIVLGIIGF 168 Query: 328 RV-IRSHEFLR--ACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWY 372 V + L+ A A+ + V + + + I + + W Sbjct: 169 NVDANQNRLLKQIADAAGGYYSSVNDANKLTGELYRINELAFSDYKWE 216 >gi|119385724|ref|YP_916779.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] gi|119376319|gb|ABL71083.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] Length = 335 Score = 39.4 bits (90), Expect = 0.99, Method: Composition-based stats. Identities = 39/220 (17%), Positives = 78/220 (35%), Gaps = 49/220 (22%) Query: 174 MMIVLDVSRSMESFF------DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +++++D S SM+ F +K A + +L+ + PD + G+ FS Sbjct: 81 LVLLIDRSSSMDDTFAGRSPQGGDESKAAAARR---ILLDFIARRPDD----RIGIAAFS 133 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + + + L++ G+S T+ GL A + Sbjct: 134 TAPMLVLPMTESRTAIAAAVAALAEPGLSQTDVGRGLTLAMGMAHEASASDS-------- 185 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY----------AIGIRVIRSHE-- 334 + +V ++DG + E Q +L N A +R +Y I Sbjct: 186 --RAVVLVSDGAAVIAPEVQTALR--NLAARRQVNIYWLYLRTKGAKGIFEVPEPGQADT 241 Query: 335 -----------FLRACASPNSFYLVENPHSMYDAFSHIGK 363 FL++ P + ++P ++ DA + IG+ Sbjct: 242 PHLRPERHLHIFLQSLGLPYHAFEADSPGAVEDAVTEIGR 281 >gi|301648002|ref|ZP_07247775.1| von Willebrand factor type A domain protein [Escherichia coli MS 146-1] gi|301073903|gb|EFK88709.1| von Willebrand factor type A domain protein [Escherichia coli MS 146-1] Length = 219 Score = 39.4 bits (90), Expect = 0.99, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 62/179 (34%), Gaps = 16/179 (8%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 ++ S + R +++LDVS SM + I+ + +E+ V Sbjct: 8 ATSDFASNPEPRCPCILLLDVSGSM------NGRPINELNAGLVTFRDELLADSLALKRV 61 Query: 220 QSGLVTFS-NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + G+VTF +E+ F S L G T + A N + + R+ Sbjct: 62 ELGIVTFGPVHVEQPF-----TSAANFFPPILFAHG-DTPMGAAITKALNMV--EERKRE 113 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + + Y+ I +TDG + + + ++IG++ + Sbjct: 114 YRANGISYYRPWIFLITDGAPTDE-WQAAANKVFQGEEDKKFAFFSIGVQGADMKTLAQ 171 >gi|298502261|ref|YP_003724201.1| cell wall surface anchor family protein [Streptococcus pneumoniae TCH8431/19A] gi|298237856|gb|ADI68987.1| cell wall surface anchor family protein [Streptococcus pneumoniae TCH8431/19A] Length = 893 Score = 39.4 bits (90), Expect = 0.99, Method: Composition-based stats. Identities = 21/134 (15%), Positives = 59/134 (44%), Gaps = 10/134 (7%) Query: 149 TNSRHIVMPITSSVKVNSQTDA-RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 N I + ++ + + LD++I+LD S SM + + + + + A ++ ++++ Sbjct: 201 DNQYGIELTVSGKTVYERKDKSVPLDVVILLDNSNSMSNIRNKNARRAERAGEATRSLID 260 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS-----KFGVSTNSTPG 262 ++ P+ + LVT+++ I + + K L+ + ++ +T Sbjct: 261 KITSDPEN----RVALVTYASTIFDGTEFTVEKGVADKNGKRLNDSLFWNYDQTSFTTNT 316 Query: 263 LKYAYNQIFDMQGM 276 Y+Y ++ + + Sbjct: 317 KDYSYLKLTNDKND 330 >gi|153872374|ref|ZP_02001287.1| hypothetical protein BGP_0163 [Beggiatoa sp. PS] gi|152071159|gb|EDN68712.1| hypothetical protein BGP_0163 [Beggiatoa sp. PS] Length = 656 Score = 39.4 bits (90), Expect = 0.99, Method: Composition-based stats. Identities = 38/244 (15%), Positives = 78/244 (31%), Gaps = 42/244 (17%) Query: 85 RIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTF 144 R+++ + EL +G N + + SLD N G + ++ Sbjct: 429 RLESVQSEQASCELETSGKSNTVTCQIVDLSLD------NPGDISQVTIDMAVAVEDAGL 482 Query: 145 IPWYTNSRHIVMPITSSVKVNSQ-----TDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 + ++ + S D ++D+ +V+DV+ SM+ + I Sbjct: 483 LLLIHEAKVMSNEFPSHTDRERTAIFIPEDIQVDLALVIDVTGSMQEEMNG-------II 535 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNS 259 K++ ++ E+ L+TF ++++ ++ L+ T S Sbjct: 536 KALKDVINEIDASTAP----LMALLTFGDEVKVAA----FTQDMEVLRGALADL---TAS 584 Query: 260 TPGL--KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 GL + A + H I+F TD S D + Sbjct: 585 GGGLCEEAAVEALLV---AIPHTKAGGE-----ILFATD---ASPYADADVEKVIELLRG 633 Query: 318 RGAI 321 +G Sbjct: 634 KGIR 637 >gi|86747842|ref|YP_484338.1| von Willebrand factor, type A [Rhodopseudomonas palustris HaA2] gi|86570870|gb|ABD05427.1| von Willebrand factor, type A [Rhodopseudomonas palustris HaA2] Length = 372 Score = 39.4 bits (90), Expect = 0.99, Method: Composition-based stats. Identities = 27/179 (15%), Positives = 63/179 (35%), Gaps = 25/179 (13%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + ++ + +++ VLD + SM + + KI +I + PD Sbjct: 14 ALLAPLALPSTASARPTVEVAFVLDTTGSMSGLIEGAKRKIWSIATAI------LDSNPD 67 Query: 215 VNNVVQSGLVTF----SNKIEEFFLLEWGVSHLQRKIKYLSKFGVS---TNSTPGLKYAY 267 + ++ GLV + + + L + L ++ L G + L A Sbjct: 68 AD--IRMGLVMYRDIGDDYVTRRVELTSDIQDLYARLLELQARGGGDWPESVNEALDVAV 125 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN-LSTKEDQQSLYYCNEAKKRGAIVYAI 325 N++ + + +I+ + D + +D + A+++ IV A+ Sbjct: 126 NKLHWTKEG---------DICRIVFLVGDAPPHMDYAQDTKYPTTLAVARQKDIIVNAV 175 >gi|54302734|ref|YP_132727.1| dinitrification protein NorD [Photobacterium profundum SS9] gi|46916158|emb|CAG22927.1| hypothetical dinitrification protein NorD [Photobacterium profundum SS9] Length = 613 Score = 39.4 bits (90), Expect = 0.99, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 74/213 (34%), Gaps = 23/213 (10%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + +++ D+S S +++ ++ ID+ S+ E + + D V G + Sbjct: 415 NCQRDISTLLLSDLSMSTDAYINNEYRVIDVIKDSMLLFSEALAAVGDPFAVY--GFSSV 472 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLS---KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 F LL+ I+ + G T ++ A N + + R Sbjct: 473 KRHHVRFTLLKNFAESYNDHIRGRILSLRPGFYTRMGAAIRQASNILAEQNQHR------ 526 Query: 284 DANYKKIIVFMTDGENLST------KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 K+++ +TDG+ + + A++ G + + I I ++ ++L Sbjct: 527 -----KLLLILTDGKPNDIDNYDGRHGIEDTRQAIIAARRLGLVPFCITID-QKADQYLP 580 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 N F ++ +P + + + Sbjct: 581 YIFGSNGFTVIFDPSQLPTKLPQLYHQLTQSSC 613 >gi|293373675|ref|ZP_06620022.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|292631330|gb|EFF49961.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] Length = 250 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 61/207 (29%), Gaps = 16/207 (7%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + S ++ + V+D S SM KI +I +L + I D N Sbjct: 2 GLLDNESIPRRKMILFFVIDTSGSM------IGKKIGSVNDAIENVLPMIGEISDENPDA 55 Query: 220 QSGLVTFSNKIEEFFLLEW---GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 + + +E W + I + T+ + N+ G Sbjct: 56 EINVAA----LEFSTGTRWLYDEPKEAKDFIWQKVEADGLTSLGEACEE-LNKKLSRSGG 110 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 ++ + I+ ++DG E N K + AI I E L Sbjct: 111 FMPTSSGSGYFSPAIILLSDGGPTDNFEGGLKTLQGNSWFKHAIKI-AIAIGDDADKEVL 169 Query: 337 R-ACASPNSFYLVENPHSMYDAFSHIG 362 + S + V N ++ I Sbjct: 170 KQFTGSSEAVITVHNIEALKKMIRIIA 196 >gi|282882696|ref|ZP_06291306.1| putative collagen adhesion protein [Peptoniphilus lacrimalis 315-B] gi|281297509|gb|EFA89995.1| putative collagen adhesion protein [Peptoniphilus lacrimalis 315-B] Length = 1309 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 31/245 (12%), Positives = 71/245 (28%), Gaps = 37/245 (15%) Query: 93 SFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSR 152 F+ + + +I +ST ++ + Y I+ P K P ++ Sbjct: 775 DFKYSISEVKVSGYTTNIEKSTDINPENDLNKDIYHITNTKEGYNPGKPEPLNPKVKVNK 834 Query: 153 HIVMPITSSVKVNSQ---------------------------TDARLDMMIVLDVSRSME 185 I ++ +D++ VLD S SM Sbjct: 835 QIDYLGDGVENKDTNVQRDQRYKNHLEDIYRLYLDVEGTRVKKQEAVDLLFVLDGSSSMG 894 Query: 186 SFFDSSITKIDMAIKSINAMLEEVKLIPD---VNNVVQSGLVTFSNKI-EEFFLLEWGVS 241 + S + ++ M+ L+ + N+ + + F E E G + Sbjct: 895 TKDMSWNGEAISRKDAMIGMINNTDLVENFLKQNDQNRVAFLYFEGYAGNENTKREDGYT 954 Query: 242 H------LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 + ++ + + Y M + E +++++F++ Sbjct: 955 YHEDAQIIKNWSHSFDRNIDFDFKARYMGTNYQAGLMMAEELLRKSEEQKGRRQVMIFIS 1014 Query: 296 DGENL 300 DG Sbjct: 1015 DGVPT 1019 >gi|225860485|ref|YP_002741994.1| cell wall surface anchor family protein [Streptococcus pneumoniae Taiwan19F-14] gi|298229065|ref|ZP_06962746.1| cell wall surface anchor family protein [Streptococcus pneumoniae str. Canada MDR_19F] gi|298254848|ref|ZP_06978434.1| cell wall surface anchor family protein [Streptococcus pneumoniae str. Canada MDR_19A] gi|154432898|gb|ABS82085.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|225726608|gb|ACO22459.1| cell wall surface anchor family protein [Streptococcus pneumoniae Taiwan19F-14] Length = 886 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 21/134 (15%), Positives = 59/134 (44%), Gaps = 10/134 (7%) Query: 149 TNSRHIVMPITSSVKVNSQTDA-RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 N I + ++ + + LD++I+LD S SM + + + + + A ++ ++++ Sbjct: 194 DNQYGIELTVSGKTVYERKDKSVPLDVVILLDNSNSMSNIRNKNARRAERAGEATRSLID 253 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS-----KFGVSTNSTPG 262 ++ P+ + LVT+++ I + + K L+ + ++ +T Sbjct: 254 KITSDPEN----RVALVTYASTIFDGTEFTVEKGVADKNGKRLNDSLFWNYDQTSFTTNT 309 Query: 263 LKYAYNQIFDMQGM 276 Y+Y ++ + + Sbjct: 310 KDYSYLKLTNDKND 323 >gi|194367003|ref|YP_002029613.1| TPR repeat-containing protein [Stenotrophomonas maltophilia R551-3] gi|194349807|gb|ACF52930.1| TPR repeat-containing protein [Stenotrophomonas maltophilia R551-3] Length = 612 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 58/167 (34%), Gaps = 28/167 (16%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 A +++VLD+S S + D +++ + +L Q GLV +++ Sbjct: 97 QASAPLLVVLDLS-SRITATDLPPSRLLQVRAKVGELLR-------ARQGGQVGLVVYAD 148 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 L S++ + LS + G+ +A + + +R Sbjct: 149 DAYTVAPLTDDGSNVALYLDALSPEVMPRDGQRADRGIDWATRLMRQIGALRGQ------ 202 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 I+ +TD + D ++ +A+ G V +G+ Sbjct: 203 -----ILLVTD------QADGEAGLAAAQARSLGLQVSVLGLGTPAG 238 >gi|293365071|ref|ZP_06611788.1| conserved hypothetical protein [Streptococcus oralis ATCC 35037] gi|307702220|ref|ZP_07639180.1| conserved hypothetical protein [Streptococcus oralis ATCC 35037] gi|291316521|gb|EFE56957.1| conserved hypothetical protein [Streptococcus oralis ATCC 35037] gi|307624233|gb|EFO03210.1| conserved hypothetical protein [Streptococcus oralis ATCC 35037] Length = 459 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 38/287 (13%), Positives = 79/287 (27%), Gaps = 29/287 (10%) Query: 52 SMIDRSLVHAATQIMNEGN--GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDD 109 ++ A +I + N+ K G+++ K + + + +++ Sbjct: 55 EANKSAVEAAVAEIKADEAKPVVNKSDAKPGELVDVSKKVSPIDVKERQEGDQKIHEETA 114 Query: 110 IV--RSTSLDIVVVPQ----NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK 163 V T + V + + S +S T + + + + Sbjct: 115 TVEVTKTEVAPVKSEELPAPKTETTESTVSGKDEDGNSYTQYERVDKTTTVEIKKATRTI 174 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK---LIPDVNNVVQ 220 S + D++ V+D S SM D+ I+ +++ + + Sbjct: 175 SKSNSA---DIVFVVDRSASMSRTIDTVRNNINEFARNLAKDGVAARFGLATFSDEVYGR 231 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS----KFGVS--TNSTPGLKYAYNQIFDMQ 274 G + I F + S K L+ G STP L + + Sbjct: 232 IGGLADEGTILTKFNESYFTSDPAELEKALAGIKLAHGGDFPETSTPTLTQIVSTYDWSK 291 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 + KK +V +TD K Sbjct: 292 SPKN---------KKFVVLLTDAPMKEDPSIPTMANTLASLKAAKIE 329 >gi|253996475|ref|YP_003048539.1| von Willebrand factor type A [Methylotenera mobilis JLW8] gi|253983154|gb|ACT48012.1| von Willebrand factor type A [Methylotenera mobilis JLW8] Length = 321 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 32/225 (14%), Positives = 71/225 (31%), Gaps = 27/225 (12%) Query: 118 IVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQT------DAR 171 I+ Q + YS + + + T S I+ + + ++ Sbjct: 20 ILKSHQGQFYSWIEMVPADKHSEIANLLLKATISLLIICIVLALASPQGESKQEQKIGKG 79 Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMA----IKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 ++V+D S SM+ F T A + + + + + PD G+V F+ Sbjct: 80 AQTVLVIDRSVSMDHPFAGQATGGRAAEIKSMAARRLITDFIDSRPDD----MMGVVGFT 135 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 N + + IK + ++ TN G+ ++Q Sbjct: 136 NSALYGMKITANRDAIHAAIKAATGPALNQTNIGAGITQGVTLFDNIQSSGS-------- 187 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 + ++ ++DG + + + +Y I +R Sbjct: 188 --RAVILLSDG--AGKLSPRVKYKISQQLTGKKINLYWIVLREPD 228 >gi|168488150|ref|ZP_02712349.1| cell wall surface anchor family protein [Streptococcus pneumoniae SP195] gi|154432978|gb|ABS82155.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|183572925|gb|EDT93453.1| cell wall surface anchor family protein [Streptococcus pneumoniae SP195] gi|332075307|gb|EGI85777.1| cell wall surface anchor family protein [Streptococcus pneumoniae GA17570] Length = 886 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 21/134 (15%), Positives = 59/134 (44%), Gaps = 10/134 (7%) Query: 149 TNSRHIVMPITSSVKVNSQTDA-RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 N I + ++ + + LD++I+LD S SM + + + + + A ++ ++++ Sbjct: 194 DNQYGIELTVSGKTVYERKDKSVPLDVVILLDNSNSMSNIRNKNARRAERAGEATRSLID 253 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS-----KFGVSTNSTPG 262 ++ P+ + LVT+++ I + + K L+ + ++ +T Sbjct: 254 KITSDPEN----RVALVTYASTIFDGTEFTVEKGVADKNGKRLNDSLFWNYDQTSFTTNT 309 Query: 263 LKYAYNQIFDMQGM 276 Y+Y ++ + + Sbjct: 310 KDYSYLKLTNDKND 323 >gi|148993547|ref|ZP_01823038.1| cell wall surface anchor family protein [Streptococcus pneumoniae SP9-BS68] gi|147927916|gb|EDK78937.1| cell wall surface anchor family protein [Streptococcus pneumoniae SP9-BS68] Length = 886 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 21/134 (15%), Positives = 59/134 (44%), Gaps = 10/134 (7%) Query: 149 TNSRHIVMPITSSVKVNSQTDA-RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 N I + ++ + + LD++I+LD S SM + + + + + A ++ ++++ Sbjct: 194 DNQYGIELTVSGKTVYERKDKSVPLDVVILLDNSNSMSNIRNKNARRAERAGEATRSLID 253 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS-----KFGVSTNSTPG 262 ++ P+ + LVT+++ I + + K L+ + ++ +T Sbjct: 254 KITSDPEN----RVALVTYASTIFDGTEFTVEKGVADKNGKRLNDSLFWNYDQTSFTTNT 309 Query: 263 LKYAYNQIFDMQGM 276 Y+Y ++ + + Sbjct: 310 KDYSYLKLTNDKND 323 >gi|221115491|ref|XP_002157815.1| PREDICTED: similar to nematocyst outer wall antigen [Hydra magnipapillata] Length = 1356 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 35/214 (16%), Positives = 76/214 (35%), Gaps = 21/214 (9%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 P + + K ++ A+LD+++++D S S+E+ + I +++ ++ D Sbjct: 623 PTPTSPTGKTSAICTAKLDLVLLIDNSGSIEAR--GREGYFEQIKMFIKNLMKLFRVGTD 680 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 N V + +I F + I L+ + L+ A +F Sbjct: 681 AN--VAIIEASSDARIIAKFGDVKDKVVFDQYINNLNYKKERSFIGKALELARTDVFSRI 738 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-- 332 +Q K+++ ++DG +D E + G I+ IG Sbjct: 739 LCKQSA--------KLMMVISDG---YASDDVTLSGV--ELRDLGVIMMGIGYGPETQLM 785 Query: 333 HEFLRACASP--NSFYLVENPHSMYDAFSHIGKD 364 L + +S + + V + ++ I K Sbjct: 786 RYTLESLSSDPKDQYTFVFGYDRLIESVQLIVKK 819 >gi|119486494|ref|ZP_01620552.1| von Willebrand factor, type A [Lyngbya sp. PCC 8106] gi|119456396|gb|EAW37527.1| von Willebrand factor, type A [Lyngbya sp. PCC 8106] Length = 220 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 31/166 (18%), Positives = 65/166 (39%), Gaps = 19/166 (11%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 +++LD S SM+ ID +N ++ V+ +++F N+++ Sbjct: 20 VLLLDTSASMQGVP------IDALNDGLNTFRHDLARDELAKKRVEVAIISFDNQVK--- 70 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR--QHCNTEDANYKKIIV 292 +++ V+ Q + L+ G + + A NQ D+ R ++ N A Y+ + Sbjct: 71 IVQDFVTADQFEPPLLTAQGQT-----FMGTAINQALDLVAARKAEYRNNGIAYYRPWVF 125 Query: 293 FMTDGENLSTK---EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 +TDGE +Q + E + +A+G+ Sbjct: 126 LITDGEPQGESERMAEQAAERIKQEEGNKQVAFFAVGVEGANMERL 171 >gi|58262250|ref|XP_568535.1| hypothetical protein [Cryptococcus neoformans var. neoformans JEC21] gi|57230709|gb|AAW47018.1| expressed protein [Cryptococcus neoformans var. neoformans JEC21] Length = 430 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 24/181 (13%), Positives = 58/181 (32%), Gaps = 35/181 (19%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN--NVVQ 220 S +D++ +LD + SM+ + +S I + + ++ +N + ++ Sbjct: 46 GSGSSHGKCIDLVFILDCTGSMQKYINS-------VRDHIIGICDMIRGEEGLNGPDDLR 98 Query: 221 SGLVTF-------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 +V + S + +F + +Q +K L+ G + + A Sbjct: 99 VAVVNYRDHPPQDSTYVYKFHPFTSDIPEVQNYLKGLTASGGG-DGPEAVTAAMAATLTE 157 Query: 274 QGMRQHCNTEDANYKKIIVFMTD------GEN-----LSTKEDQQSLYYCNEAKKRGAIV 322 R+ ++ V + D GE + L + G + Sbjct: 158 LEWRREAA-------RMAVLVADAPPHGIGEGGDQFKQGDPDGHDPLVVARMMAQNGITM 210 Query: 323 Y 323 + Sbjct: 211 F 211 >gi|222082657|ref|YP_002542022.1| hypothetical protein Arad_9368 [Agrobacterium radiobacter K84] gi|221727336|gb|ACM30425.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 405 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 38/313 (12%), Positives = 94/313 (30%), Gaps = 80/313 (25%) Query: 28 PIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIK 87 + L +G + + ++ + S +D +L+ A K +K Sbjct: 1 MPMLLAVGASFDYIRAYNVRQSMQSDLDAALIAAV-------------KNVDAGDTDALK 47 Query: 88 NTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPW 147 + F + + + DI+ + + + Sbjct: 48 QKVSDWFHAQTESSYSLGDIEIDTTNHRITATASGTVPTTLMK----------------- 90 Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM----------------------- 184 N + + + S+VK + + L++ IV+D S SM Sbjct: 91 LANINTVPVSVASAVKGPASS--YLNVYIVIDKSPSMLLAATTAGQQAMYNGIGCQFACH 148 Query: 185 --ESFFDSSIT---------------KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +S + T + D+A+ +++ +++ + + ++ GL + Sbjct: 149 TGDSHTIGTATYSNNYAYSTEKKIKLRADVAVDAVHEVIDMISASDTNHERIKVGLYSLG 208 Query: 228 NKIEEFFLLEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + I E + +++ + T L N++ Sbjct: 209 DTITEVLAPTLDTTAAGKRVDSDLTSATSTTYTYFDVSLAALKNKVGTGGDGSS-----S 263 Query: 285 ANYKKIIVFMTDG 297 A K+++ +TDG Sbjct: 264 ATPLKLVLLLTDG 276 >gi|218131125|ref|ZP_03459929.1| hypothetical protein BACEGG_02730 [Bacteroides eggerthii DSM 20697] gi|317476997|ref|ZP_07936239.1| von Willebrand factor type A domain-containing protein [Bacteroides eggerthii 1_2_48FAA] gi|217986645|gb|EEC52979.1| hypothetical protein BACEGG_02730 [Bacteroides eggerthii DSM 20697] gi|316906790|gb|EFV28502.1| von Willebrand factor type A domain-containing protein [Bacteroides eggerthii 1_2_48FAA] Length = 342 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 23/175 (13%), Positives = 60/175 (34%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +++MI LD+S SM + D +++ A + + ++++++ + Sbjct: 79 GSKLETVKRQGVEVMIALDISNSMLAQ-DVQPSRLQKAKRLVAQLVDKMEND-------K 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 G++ F+ + + ++ + T + A Sbjct: 131 VGMIVFAGDAFTQLPITSDYISAKMFLESIDPSLISKQGTAIGAAINLA----------- 179 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 T + I+ +TDGEN + + + V +G+ + Sbjct: 180 SRSFTPQEGVGRAIIVITDGENHEGGAVEAAKAAAEKG----IQVNVLGVGMPEG 230 >gi|170724217|ref|YP_001751905.1| Na-Ca exchanger/integrin-beta4 [Pseudomonas putida W619] gi|169762220|gb|ACA75536.1| Na-Ca exchanger/integrin-beta4 [Pseudomonas putida W619] Length = 5962 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 27/214 (12%), Positives = 65/214 (30%), Gaps = 18/214 (8%) Query: 99 RDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPI 158 F + + + ++ + + T + + + Sbjct: 5375 YSGQFDVKVTSTATESVGGSTATSEG-SIKVTVYPQSYTTSNLSSDSDSITGTDGNDIVV 5433 Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN- 217 ++ ++ ++D S SM S + +D A KS+ ++ + + Sbjct: 5434 ADVSGLHVVPGQDYNIAFIVDTSGSMGS------SGVDAAKKSLESVFKTLAASVKGAQS 5487 Query: 218 -VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQ 274 V LV F+ +++ + + L+ + L TN K N + Sbjct: 5488 GTVNILLVDFATQVKSSVSVTLNDAGLKTLLSALGTLNSNGGTNYEDAFKTTAN--WFAN 5545 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS 308 + + F+TDGE + +Q+ Sbjct: 5546 LKAAGSTGSNQTF-----FITDGEPTYYQTSEQA 5574 >gi|91216723|ref|ZP_01253688.1| hypothetical protein P700755_08674 [Psychroflexus torquis ATCC 700755] gi|91185192|gb|EAS71570.1| hypothetical protein P700755_08674 [Psychroflexus torquis ATCC 700755] Length = 287 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 36/159 (22%), Positives = 66/159 (41%), Gaps = 24/159 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L MM+++D+S S ++ + K I + N + GL+ F Sbjct: 72 EEERELTMMLLVDISES------TAFGTRETFKKDIITEMSATLAFSATQNNDKIGLMLF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 S++IE + + G H+ R I+ L +F S +T L A+ + DM Sbjct: 126 SDQIELYIPPKKGRMHVLRIIRELIEFKPSHKTTD-LTKAFKYLIDMMK----------- 173 Query: 287 YKKIIVF-MTDGENLSTKEDQQSLYYCNEAKK-RGAIVY 323 KK IVF M+D + + + +L + G ++ Sbjct: 174 -KKAIVFVMSD---FLSDDYEHTLKILGKKHDVTGIRIF 208 >gi|118347184|ref|XP_001007069.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89288836|gb|EAR86824.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 821 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 32/200 (16%), Positives = 69/200 (34%), Gaps = 35/200 (17%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 N++ ++ ++D S SM + K+ + ++E ++ +PD + ++ Sbjct: 330 NAEKHSKAQFFFLIDRSGSM----------CTIFQKARDTLIEFLQRLPDDSYF---NVI 376 Query: 225 TFSNKIEEFFLLEWGVSHLQ-----RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 +F + + F + +I S T L+ + Sbjct: 377 SFGSGYQFLFEEAKKKNKQSMKSALEQISKFSADMGGTEIYQPLEKIFQ----------- 425 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC 339 C + Y+ I +TDG+ ++ D N A V+ IG+ + LR C Sbjct: 426 CKNVNDLYQMQIFLLTDGQ--VSQPDMVVQLIRNN--SHKARVHCIGLGSGVDKQLLRRC 481 Query: 340 ASPNSFYL--VENPHSMYDA 357 + V+N + + Sbjct: 482 SESGRGANRQVDNASELKEV 501 >gi|321471060|gb|EFX82034.1| hypothetical protein DAPPUDRAFT_317041 [Daphnia pulex] Length = 1102 Score = 39.4 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 36/216 (16%), Positives = 72/216 (33%), Gaps = 28/216 (12%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + DM+I++D S SM + ++A + +L+ + +N FS Sbjct: 179 AASPKDMIILVDTSGSMTG------VRKEIAKHVVLTLLDTLSENDFINIY------KFS 226 Query: 228 NKIEEFFL------LEWGVSHLQRKIKYLSKFGVS--TNSTPGLKYAYNQIFDMQGMRQH 279 ++ + +L+ + + S N T A+ + Q Sbjct: 227 EVPVPVVPCFKDKVVQANLENLRELRNGMIETETSDIANFTSAFTTAFEILQKYNRTMQG 286 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL-RA 338 C A I+ +TDG + KE + + + Y IG +I + + A Sbjct: 287 CQCNQA-----IMLVTDGAPENYKEHLEKYNLPH--RPVRIFTYVIGREIIDTSATINMA 339 Query: 339 CASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 C + F V + + + + + Y K Sbjct: 340 CENKGYFARVTSLAEVREQVLKYIPVLSRPMVMYQK 375 >gi|325288987|ref|YP_004265168.1| hypothetical protein Sgly_0814 [Syntrophobotulus glycolicus DSM 8271] gi|324964388|gb|ADY55167.1| hypothetical protein Sgly_0814 [Syntrophobotulus glycolicus DSM 8271] Length = 629 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 62/169 (36%), Gaps = 28/169 (16%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + +++++D S SME ++ + A+ S + E +K + +V+ + K Sbjct: 437 PDIAVLLLIDGSGSMEG------SRRESAMVSSVILHEVLKKQGINHAIVEHRAIYNEPK 490 Query: 230 IEEFFLLE-WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 +E L++ G + I L+ T L +A I Q Sbjct: 491 VEHNILIDFDGNDEEKYNILALNA-DEGTREGLSLFWAERYITSKTTAEQ---------- 539 Query: 289 KIIVFMTDGENLSTKEDQ----------QSLYYCNEAKKRGAIVYAIGI 327 ++I+ ++DG + D + + KRG + A+ + Sbjct: 540 RLIIVLSDGVPAHSVNDDSCYVPPVSIKDTANTAVKIIKRGTDIIAVAL 588 >gi|308235232|ref|ZP_07665969.1| von Willebrand factor type A domain protein [Gardnerella vaginalis ATCC 14018] Length = 396 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 24/125 (19%), Positives = 47/125 (37%), Gaps = 18/125 (14%) Query: 152 RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 I+ S V ++ + D+++ LDVS S + I+ +K + E Sbjct: 117 TTILSSRPSRVLNANEQSSSRDIVLCLDVSGS---TLPYDLEVINAYLKFVEHFQGE--- 170 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL----KYAY 267 + GL F++ F L S ++++++Y SK + + + Y Sbjct: 171 --------RIGLSIFNSTSRTVFPLTDDYSLVKKQLQYASKLLDGVQTQDKIDNMQQKQY 222 Query: 268 NQIFD 272 QI Sbjct: 223 QQIST 227 >gi|308176071|ref|YP_003915477.1| hypothetical protein AARI_02710 [Arthrobacter arilaitensis Re117] gi|307743534|emb|CBT74506.1| hypothetical membrane protein [Arthrobacter arilaitensis Re117] Length = 1066 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 36/253 (14%), Positives = 80/253 (31%), Gaps = 50/253 (19%) Query: 104 VNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFI--------PWYTNSRHIV 155 N + + Q ++++ S +P + W + ++ Sbjct: 256 YNVLSKFNTGANGSTESTYQYRTKTLTSNSTISMPADHDGALRTADASSGTWANSIKNPT 315 Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI--TKIDMAIKSINAMLEEVKLIP 213 +P+T + + + +V D+S S+++ + + +++ V L Sbjct: 316 LPVTCELGIK--------VALVFDLSGSVKNAGAADTLGNAGKGFVDALSGTNSSVALFS 367 Query: 214 DVNNVVQSGLVTFS--NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 N ++G F I+ + +Q I S G ++N T Sbjct: 368 FGNTSPRAGTPNFPELRNIDSGTNSTRIKNDIQDYINSFSGEGYTSNGT----------- 416 Query: 272 DMQGMRQHCNTEDANYKKI--IVFMTDGEN--------------LSTKEDQQSLYYCNEA 315 N K+ +V +TDG +E + +++ N Sbjct: 417 ---NWDAGLWATAQNAKQYDLVVVLTDGNPTFSGIESTVGPGTSTYFRELEAAVFSANAV 473 Query: 316 KKRGAIVYAIGIR 328 K +GA + +GI Sbjct: 474 KAQGARIINVGIG 486 >gi|290976446|ref|XP_002670951.1| predicted protein [Naegleria gruberi] gi|284084515|gb|EFC38207.1| predicted protein [Naegleria gruberi] Length = 1082 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 39/254 (15%), Positives = 83/254 (32%), Gaps = 31/254 (12%) Query: 95 RNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY---SISAISRYKIPLKFCTFIPWYTNS 151 R L + + ++ ++ +D V+ P N + S ++I ++ + + Sbjct: 52 RGNLDEFDVESAPKNVNLASLIDQVLNPNNSNQKVEQVEEKSLFRIEVEHLIDLDDHDIG 111 Query: 152 RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF-FDSSITKIDMAIKSINAMLEEVK 210 + + L + + DVS SM+ ++ T + + + Sbjct: 112 --VAELTIHVNDPSLNPKPTLFIALA-DVSGSMQGRPWEQVCTSLKHFAQ---------Q 159 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYAYN 268 + + + V + + +E + LQ I+ + G T+ + A Sbjct: 160 SFNNPAIICRM--VAYESSAKEIDM----KGTLQSIIRNIETAFTGGGTDFASAFQLACT 213 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR----GAIVYA 324 I G + I F+TDGE+ S L Y +E R ++ Sbjct: 214 IITRESGQDRENLPFGNVV---ITFLTDGEDFSKVGKPGGLQYLSEEINRVYRGDITIHT 270 Query: 325 IGIRVIRSHEFLRA 338 +G + E L Sbjct: 271 VGFGSHHNLELLDN 284 >gi|183222300|ref|YP_001840296.1| NorD protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] gi|189912350|ref|YP_001963905.1| nitric oxide reductase activation protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167777026|gb|ABZ95327.1| Nitric oxide reductase activation protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167780722|gb|ABZ99020.1| NorD protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] Length = 602 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 68/202 (33%), Gaps = 30/202 (14%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + + + ++D+S S +S + +D+ +S+ E + + + G + + Sbjct: 411 SDIALYFLMDLSLSTDS-WIHDKRVLDVERESLLLFSECLDELKIPFGIA--GFYSRTRN 467 Query: 230 IEEFFLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 +F L W ++ ++ LS G T P L++ N++ G +Q Sbjct: 468 HNQFIHLKQMNESW--MGVRDRLGPLSPIGY-TRVGPSLRHT-NELLKNCGYKQ------ 517 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK------RGAIVYAIGIRVIRSHEF--L 336 K I+ +TD + + + K G VY + I + Sbjct: 518 ----KWIILITDARPNDYDQYEGKYGIEDVNKAVGECLLNGVQVYTLAIGKEEKPTIPAM 573 Query: 337 RACASPNSFYLVENPHSMYDAF 358 AS + E F Sbjct: 574 MRNASYQMLFHPERLLDSLQEF 595 >gi|163852924|ref|YP_001640967.1| hypothetical protein Mext_3511 [Methylobacterium extorquens PA1] gi|163664529|gb|ABY31896.1| hypothetical protein Mext_3511 [Methylobacterium extorquens PA1] Length = 473 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 8/56 (14%), Positives = 22/56 (39%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 G + ++ A+ + ++G I+ + +T L S +D ++ + Sbjct: 22 RLCRQNDGSVAVIFALAGSTLIGLVGGAIDYARFASARTNLQSAVDAGVLAGGNAL 77 >gi|332197210|gb|AEE35331.1| uncharacterized protein [Arabidopsis thaliana] Length = 756 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 68/197 (34%), Gaps = 43/197 (21%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-EE 232 ++ V+D+S SM+ ++ K++ L +++ N ++ F+++I E Sbjct: 327 VVFVIDISASMKW------KPLEDVKKALLECLAKLQAEDVFN------IIAFNDEILEF 374 Query: 233 FFLLEWGV----SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 +E+ S + + TN LK A + + N Sbjct: 375 STSMEFATDETISAVTEWLDSNLIANGGTNMLLPLKQAMKLL-------EGSNIGVP--- 424 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKR--------GAIVYAIGIRVIRSHEFLRACA 340 ++ +TDG + C+ K+ + GI +H FL+ A Sbjct: 425 -LVYLVTDG------SVENEREICHAMKESCSRNGKSISPRISTFGIGSFCNHYFLQMLA 477 Query: 341 SPNS-FYLVENPHSMYD 356 + +Y N ++ Sbjct: 478 RIGNGYYDGTNNTDSFE 494 >gi|289621292|emb|CBI52075.1| unnamed protein product [Sordaria macrospora] Length = 914 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 65/181 (35%), Gaps = 33/181 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V D S SM T+I+ ++ L+ + + N + +F ++ E Sbjct: 299 IVFVCDRSGSMGG------TRIEGLKSALRIFLKSIPVGAKFN------ICSFGSRHEFL 346 Query: 234 FL---LEWGVSHLQRKIKY---LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 F + LQR ++Y + T L+ A+ + + + Sbjct: 347 FPEGSRSYDQETLQRAMEYTNLMRADFGGTEMYRPLEAAFEKRY-------------KDM 393 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG-AIVYAIGIRVIRSHEFLRACASPNSFY 346 + +TDGE + ++ ++ +G ++ +GI SH + A + + Sbjct: 394 DLEVFLLTDGEIWD-QGQLFTMTNEKVSESKGAIRLFTLGIGNDVSHALIEGVARAGNGF 452 Query: 347 L 347 Sbjct: 453 A 453 >gi|254562692|ref|YP_003069787.1| hypothetical protein METDI4317 [Methylobacterium extorquens DM4] gi|254269970|emb|CAX25948.1| hypothetical protein METDI4317 [Methylobacterium extorquens DM4] Length = 473 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 8/56 (14%), Positives = 22/56 (39%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 G + ++ A+ + ++G I+ + +T L S +D ++ + Sbjct: 22 RLCRQNDGSVAVIFALAGSTLIGLVGGAIDYARFASARTNLQSAVDAGVLAGGNAL 77 >gi|218531748|ref|YP_002422564.1| hypothetical protein Mchl_3818 [Methylobacterium chloromethanicum CM4] gi|218524051|gb|ACK84636.1| conserved hypothetical protein [Methylobacterium chloromethanicum CM4] Length = 473 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 8/56 (14%), Positives = 22/56 (39%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 G + ++ A+ + ++G I+ + +T L S +D ++ + Sbjct: 22 RLCRQNDGSVAVIFALAGSTLIGLVGGAIDYARFASARTNLQSAVDAGVLAGGNAL 77 >gi|145482457|ref|XP_001427251.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124394331|emb|CAK59853.1| unnamed protein product [Paramecium tetraurelia] Length = 568 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 37/191 (19%), Positives = 70/191 (36%), Gaps = 28/191 (14%) Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 +I I I + + + + +++ + + A +D++ V+DVS SME Sbjct: 87 TIEKIHINLIARQDVVTLSQFAQTLPVILQVQALKSQLKHAKANIDLVCVVDVSGSMEG- 145 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI 247 KI + S+ + + + + LVTF ++KI Sbjct: 146 -----EKISLVKDSLRYIQKILSPND------RIALVTFGTYSGINLPWTINKPENKQKI 194 Query: 248 KYLS---KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 K K STN G+K G+R + N + +TDG++ + Sbjct: 195 KDAIIGMKIRDSTNIADGVKL---------GLRMIKERKQKNPVTCMFVLTDGQDDNKGA 245 Query: 305 DQQSLYYCNEA 315 D++ C +A Sbjct: 246 DER----CQQA 252 >gi|15218501|ref|NP_177394.1| inter-alpha-trypsin inhibitor heavy chain-related [Arabidopsis thaliana] gi|12325279|gb|AAG52586.1|AC016529_17 hypothetical protein; 14673-17893 [Arabidopsis thaliana] Length = 758 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 68/197 (34%), Gaps = 43/197 (21%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-EE 232 ++ V+D+S SM+ ++ K++ L +++ N ++ F+++I E Sbjct: 329 VVFVIDISASMKW------KPLEDVKKALLECLAKLQAEDVFN------IIAFNDEILEF 376 Query: 233 FFLLEWGV----SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 +E+ S + + TN LK A + + N Sbjct: 377 STSMEFATDETISAVTEWLDSNLIANGGTNMLLPLKQAMKLL-------EGSNIGVP--- 426 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKR--------GAIVYAIGIRVIRSHEFLRACA 340 ++ +TDG + C+ K+ + GI +H FL+ A Sbjct: 427 -LVYLVTDG------SVENEREICHAMKESCSRNGKSISPRISTFGIGSFCNHYFLQMLA 479 Query: 341 SPNS-FYLVENPHSMYD 356 + +Y N ++ Sbjct: 480 RIGNGYYDGTNNTDSFE 496 >gi|58616210|ref|YP_195339.1| hypothetical protein p1B95 [Azoarcus sp. EbN1] gi|56315671|emb|CAI10315.1| hypothetical protein p1B95 [Aromatoleum aromaticum EbN1] Length = 637 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 50/169 (29%), Gaps = 29/169 (17%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + +++D S SM T + + S + + S Sbjct: 451 EGTNTAVALLVDYSGSM-------WTTMPLTDGSSRTTAGQTTAQEAAYETHFALASVLS 503 Query: 228 NKIEEF--FLLEWGVSHLQRKIKYLSKF------GVSTNSTPGLKYAYNQIFDMQGMRQH 279 F V L+ + LS T + ++ A +I + Sbjct: 504 TYEVPFLSIAFANHVCVLKEFDEPLSAHRWDDGSSGDTATGDAVEEALIRITAREEE--- 560 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 K+I+V +TDGE + + C A++ G + + I Sbjct: 561 --------KRILVVVTDGEPN---DHDSLMAGCRFARQNGIQIAVVFIG 598 >gi|313837215|gb|EFS74929.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA2] gi|314971986|gb|EFT16084.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA3] Length = 322 Score = 39.4 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 63/204 (30%), Gaps = 33/204 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ LD S SM++ D+S ++ A + + +V +V+ S E Sbjct: 96 IVVALDSSLSMKAD-DASPNRLAAAKAKAKDFVNSLPTGFNVA------VVSISAHPEIR 148 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + R + G+ L A ++ M + + IV Sbjct: 149 MPPSTDRPTVLRALD-----GIELQDGTALGEAIDKSLQAVKMAPGGSKDRVPA--AIVM 201 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH--------------EFLRAC 339 ++DG N + + AK VY I + + L Sbjct: 202 LSDGGNTQGGSPLVAATHAAAAK---VPVYTIAFGTETGYVDLDGQRERVAPDTKLLSDV 258 Query: 340 A--SPNSFYLVENPHSMYDAFSHI 361 A + + ++ + + + + Sbjct: 259 ADRTDAKSWTADSADKLQEVYKQV 282 >gi|329729355|gb|EGG65761.1| hypothetical protein SEVCU144_0289 [Staphylococcus epidermidis VCU144] Length = 629 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 22/177 (12%), Positives = 60/177 (33%), Gaps = 29/177 (16%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K + +++D S SM K+D IK + E +K + + ++ Sbjct: 426 KQDLSKTFDATFTLLIDASASMHD-------KMDETIKGVVLFHETLKSLNIKHEILAFN 478 Query: 223 LVTFSNKIEE-----FFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQG 275 F + ++++ S +++ + ++ A ++ Sbjct: 479 EDAFEADQRQQPNIIDEIIDYNYSIFEKEGPRIMTLEPQDDNRDGVAIRIASERLL---- 534 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKRGAIVYAIGIR 328 + ++ ++ ++ +DGE + Q + A+K G V+ + + Sbjct: 535 -------QRSHEQRFLIVFSDGEPSAFNYSQDGILDTYEAVERARKFGIEVFNVFLS 584 >gi|301606955|ref|XP_002933090.1| PREDICTED: integrin alpha-1-like [Xenopus (Silurana) tropicalis] Length = 1147 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 27/163 (16%), Positives = 57/163 (34%), Gaps = 26/163 (15%) Query: 221 SGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFD-MQGM 276 G+V + + F L + +++ + + G S T + G+ A + F +G Sbjct: 175 VGIVQYGENVTHEFNLNTYDSTAKVRQASLGIKQRGGSQTMTALGIDTARKEAFTEARGA 234 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI--RVIRSH- 333 R+ +TDGE+ + C ++I I R + Sbjct: 235 RKGVKKVMVV-------VTDGESHDNYRLNSVIKECET---DDIQRFSIAILGSYNRGNL 284 Query: 334 ------EFLRACAS---PNSFYLVENPHSMYDAFSHIGKDIVT 367 E +++ AS F+ V + ++ + +G+ I Sbjct: 285 STETLVEEIKSIASEPKEKHFFNVSDELALLNIVESLGERIFA 327 >gi|218672134|ref|ZP_03521803.1| hypothetical protein RetlG_11055 [Rhizobium etli GR56] Length = 125 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 21/109 (19%), Positives = 37/109 (33%), Gaps = 30/109 (27%) Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYY-------------CNEAKKRGAIV---YAI 325 T + + +KI+ F++DG S K + C K RG + Y Sbjct: 4 TSNTSAEKILFFVSDGVGDSYKPSTCTKKTTGGRCQEPIDTSFCKPLKDRGVKIAVLYTT 63 Query: 326 GIRVIRSH--------------EFLRACASPNSFYLVENPHSMYDAFSH 360 + + + ++ACASP ++ V + DA Sbjct: 64 YLPLPSNSWYNTWIKPFQSEIPTKMQACASPGFYFEVSPTEGITDAMKA 112 >gi|160889564|ref|ZP_02070567.1| hypothetical protein BACUNI_01988 [Bacteroides uniformis ATCC 8492] gi|270296688|ref|ZP_06202887.1| conserved hypothetical protein [Bacteroides sp. D20] gi|317480056|ref|ZP_07939168.1| von Willebrand factor type A domain-containing protein [Bacteroides sp. 4_1_36] gi|156861081|gb|EDO54512.1| hypothetical protein BACUNI_01988 [Bacteroides uniformis ATCC 8492] gi|270272675|gb|EFA18538.1| conserved hypothetical protein [Bacteroides sp. D20] gi|316903798|gb|EFV25640.1| von Willebrand factor type A domain-containing protein [Bacteroides sp. 4_1_36] Length = 342 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 22/175 (12%), Positives = 60/175 (34%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +++MI LD+S SM + D +++ A + + ++++++ + Sbjct: 79 GSKLETVKRQGVEVMIALDISNSMLAQ-DVQPSRLQKAKRLVAQLVDKMQND-------K 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 G++ F+ + + ++ + T + A Sbjct: 131 VGMIVFAGDAFTQLPITSDYISAKMFLESIDPSLISKQGTAIGAAINLA----------- 179 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 T + ++ +TDGEN + + + V +G+ + Sbjct: 180 ARSFTPQEGVGRTVIVITDGENHEGGAVEAAKAAAEKG----IQVNVLGVGMPEG 230 >gi|150377243|ref|YP_001313838.1| hypothetical protein Smed_5128 [Sinorhizobium medicae WSM419] gi|150031790|gb|ABR63905.1| conserved hypothetical protein [Sinorhizobium medicae WSM419] Length = 436 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 22/188 (11%), Positives = 61/188 (32%), Gaps = 28/188 (14%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMI------------- 54 +R F G + ++ A+ P++ +G+ E + + K L Sbjct: 21 LRRFLKAEGGAVAVIAAVAFPVLVGAMGLGAETGYWYLEKRKLQHAADVSAYAAAVRHRA 80 Query: 55 --DRSLVHAATQIMNEGNGNNRKKLK----------GGDILCRIKNTWNMSFRNELRDNG 102 +S + AA + + G+G + L ++ + T F + Sbjct: 81 GDQQSALEAAARRVAGGSGFSPGDLTLSTASAAAGGSNNVTVELTETHPRLFSSVFGTGT 140 Query: 103 FV---NDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPIT 159 + + + ++ + + +++ ++ L C+ + + S +M Sbjct: 141 ITIKARAVAQVTGGSKACVLALSNSASGAVTVTGSTEVQLSGCSVVSNSSASDAFLMRNG 200 Query: 160 SSVKVNSQ 167 S++ Sbjct: 201 SALMSTDC 208 >gi|88801579|ref|ZP_01117107.1| hypothetical protein PI23P_02932 [Polaribacter irgensii 23-P] gi|88782237|gb|EAR13414.1| hypothetical protein PI23P_02932 [Polaribacter irgensii 23-P] Length = 289 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 33/158 (20%), Positives = 60/158 (37%), Gaps = 22/158 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L MM+++DVS S + K + + N+ V GL+ F Sbjct: 72 EEERELTMMLLVDVSGSELFGTATQFKKDTVTEIAATLAF----SATQNNDKV--GLILF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 S+ IE F + G SH+ R I+ L +F + T + A + + Sbjct: 126 SDDIELFIPPKKGKSHVLRIIRELIEFQPKSKKTD-IAAALKFLSSVLK----------- 173 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKK-RGAIVY 323 K+ IVFM + +++L + G ++ Sbjct: 174 -KRAIVFML--SDFMDDAYEKTLKIAAKKHDLTGIRIF 208 >gi|322390210|ref|ZP_08063741.1| peptidoglycan binding domain protein [Streptococcus parasanguinis ATCC 903] gi|321143072|gb|EFX38519.1| peptidoglycan binding domain protein [Streptococcus parasanguinis ATCC 903] Length = 484 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 28/171 (16%), Positives = 58/171 (33%), Gaps = 17/171 (9%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + V + A D++ V+D S SM+S ++ ++ +++++A N Sbjct: 170 EADVVNKKEVQATSDIVFVIDKSTSMDSHIRDTMKNVETFVRNLSA----------KNIQ 219 Query: 219 VQSGLVTF---SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + GLV + S F S + I L + + A + + Sbjct: 220 ARLGLVEYERASEVKYHDFNGSKFTSDPESFISALKTI----KTKGYYENATVPLHHIAT 275 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 + AN + +TD + TK+ + G + +G Sbjct: 276 SGDYNWGAGANNHRFAFLITDEDIDLTKDTPTKEATLKALQDAGISLTVVG 326 >gi|146298479|ref|YP_001193070.1| hypothetical protein Fjoh_0716 [Flavobacterium johnsoniae UW101] gi|146152897|gb|ABQ03751.1| conserved hypothetical protein [Flavobacterium johnsoniae UW101] Length = 287 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 34/160 (21%), Positives = 60/160 (37%), Gaps = 22/160 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L M++++D+S S S K I + N + GL+ F Sbjct: 72 EEERELTMVLMVDISGSEGFGSKSQFKK------DIVTEIAATMAFSATQNNDKIGLILF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 S+ +E + + G SH+ R I+ L +F + T + Sbjct: 126 SDNVELYIPPKKGRSHVLRIIRELIEFEPKSQKTDV-------------AQALKFLSGTQ 172 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKK-RGAIVYAI 325 KK IVFM + ++ +Q+L ++ G VY I Sbjct: 173 KKKAIVFMI--SDFMSENYEQTLKIASKKHDLTGVRVYDI 210 >gi|291451535|ref|ZP_06590925.1| conserved hypothetical protein [Streptomyces albus J1074] gi|291354484|gb|EFE81386.1| conserved hypothetical protein [Streptomyces albus J1074] Length = 444 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 25/146 (17%), Positives = 55/146 (37%), Gaps = 27/146 (18%) Query: 174 MMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 + +VLD S SM ++ D ++ + ++ A L++ +P +V F Sbjct: 233 VYLVLDRSGSMRPYYRDGTVQHLAEQALALAAHLDDDGTVP---------VVFFDTDAHP 283 Query: 233 FFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + + + +++ L + +TN + + +H A Sbjct: 284 ATEVS--LDAYEGRVQELHERYGHMGTTNYAAAML----------EVIEHYTATGATAPA 331 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEA 315 ++F TDG + +E ++ L C A Sbjct: 332 FVIFQTDGGPDAKREAERVL--CRAA 355 >gi|260642112|ref|ZP_05414581.2| von Willebrand factor type A domain protein [Bacteroides finegoldii DSM 17565] gi|260623548|gb|EEX46419.1| von Willebrand factor type A domain protein [Bacteroides finegoldii DSM 17565] Length = 615 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 40/297 (13%), Positives = 100/297 (33%), Gaps = 28/297 (9%) Query: 46 MKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVN 105 MKT + ++ A M+ + + +T +F ++ + N Sbjct: 116 MKTKAMTGAYVAVCPTAMYDMDTRMNTEEYDRIQENGFKSVADTPLSTFSIDVDPASYSN 175 Query: 106 DI-----DDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSR--HIVMPI 158 ++ + ++ + Y + P+K T + +V Sbjct: 176 MRRFINRGELPPADAIRTEELVNYFSYDYPKPTGND-PVKITVEAGTCTWNTAHRLVRIG 234 Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + ++ ++ +++ ++D+S SM ++D+ S+ ++ ++ V V Sbjct: 235 LKAKEIPTEQLPASNLVFLIDISGSMW-----GANRLDLVKSSLKLLVNNLRNKDKVAIV 289 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 +G K+E + ++ I L+ G + G+ AY QI + Sbjct: 290 TYAGSAG--VKLEATSGGD--KQKIREAIDELTAGGST-AGGAGIHLAY-QIAKKNFISD 343 Query: 279 HCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 N I+ +DG+ N+ + + +K G + +G + + Sbjct: 344 GNNR--------IILCSDGDFNVGVSSAEGLEQLIEKERKSGVHLTVLGYGMGNYKD 392 >gi|189465621|ref|ZP_03014406.1| hypothetical protein BACINT_01979 [Bacteroides intestinalis DSM 17393] gi|224540001|ref|ZP_03680540.1| hypothetical protein BACCELL_04913 [Bacteroides cellulosilyticus DSM 14838] gi|189437895|gb|EDV06880.1| hypothetical protein BACINT_01979 [Bacteroides intestinalis DSM 17393] gi|224518391|gb|EEF87496.1| hypothetical protein BACCELL_04913 [Bacteroides cellulosilyticus DSM 14838] Length = 289 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 20/95 (21%), Positives = 42/95 (44%), Gaps = 6/95 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S+E + K + + ++ + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLEFGTVKQMKKDMVTEIAATLAFSAIQNND------KIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP 261 S++IE+F + G H+ I+ L F + T Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDFKAESRRTD 160 >gi|157961836|ref|YP_001501870.1| hypothetical protein Spea_2015 [Shewanella pealeana ATCC 700345] gi|157846836|gb|ABV87335.1| conserved hypothetical protein [Shewanella pealeana ATCC 700345] Length = 445 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 16/111 (14%), Positives = 40/111 (36%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 N G + ++ I L I + ++ HI K L + +D + ++AA ++ Sbjct: 16 RNQSGAILVMFTIGLFSILAFAALALDGGHILLSKGRLQNAVDAAALNAAKELQQGATLF 75 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQ 123 ++ +L + + N N + + + + ++ P Sbjct: 76 EARQAAYTILLQNLGFSENGELNNSISLSSPDFNTTQVTARLHVEFSEQPD 126 >gi|148657117|ref|YP_001277322.1| peptidase M23B [Roseiflexus sp. RS-1] gi|148569227|gb|ABQ91372.1| peptidase M23B [Roseiflexus sp. RS-1] Length = 982 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 17/116 (14%), Positives = 39/116 (33%), Gaps = 15/116 (12%) Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 ST+ GL+ + + R + IV ++DG+ + L + Sbjct: 574 GSTSIGGGLQRSQQLLSASAPGR----------TRAIVLLSDGQENTAPYVSDVLP---Q 620 Query: 315 AKKRGAIVYAIGIRVIRSHEF-LRACA-SPNSFYLVENPHSMYDAFSHIGKDIVTK 368 + V+ IG+ + L A + ++ P + ++ I + + Sbjct: 621 IRASQITVHTIGLGTDADQQLMLSIAAQTGGTYNYAPRPDQLAGIYNTISGAVSNR 676 >gi|228982995|ref|ZP_04143254.1| tellurium resistance protein [Bacillus thuringiensis Bt407] gi|228777178|gb|EEM25486.1| tellurium resistance protein [Bacillus thuringiensis Bt407] Length = 256 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 60/152 (39%), Gaps = 15/152 (9%) Query: 173 DMMIVLDVSRSMESFF-DSSITKIDMAIKSINAMLEEVKLIP------DVNNVVQSGLVT 225 ++ VLD S SM + ++++ + +I ++ +++ I N+ Sbjct: 27 QVVFVLDGSTSMTHLYKNNTVQDVTESILAVALNIDDNNSIDVYGFGKSHRNIGDVSRAN 86 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKF------GVSTNSTPGLKYAY-NQIFDMQGM-R 277 + N I+ FL ++ + +++ +T + K + +++ + + Sbjct: 87 YRNYIQNVFLKKFDLEPYTAYAGVMNEIVNEYGTKTTTKTIEQPKSGFLSKLLGKTEVVK 146 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSL 309 + +VF+TDG+N E ++ + Sbjct: 147 VDKEQGPVKHPTFVVFITDGDNFDKPEAERII 178 >gi|118357564|ref|XP_001012031.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89293798|gb|EAR91786.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 896 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 33/171 (19%), Positives = 60/171 (35%), Gaps = 31/171 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +LD S SM S I A +++ L+ + I NVV G S++ Sbjct: 314 FIFLLDRSGSM------SGQPIQKACEALILFLKSL-PIDSYFNVVSFGS---SHEKLFQ 363 Query: 234 FLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 +++ + L++ IK + + T L+ + + +Q Sbjct: 364 TSIKYDTNSLEKAIKIIKNYTADLGGTEIYKPLQSVFKETKIDGYNKQ------------ 411 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGA-IVYAIGIRVIRSHEFLRACA 340 I +TDGE S KE + + K + +IG + A Sbjct: 412 IFLLTDGEVESPKEVVKLIK-----KNNKYNRINSIGFGSGADKYLIEESA 457 >gi|329963579|ref|ZP_08301058.1| hypothetical protein HMPREF9446_02652 [Bacteroides fluxus YIT 12057] gi|328528568|gb|EGF55539.1| hypothetical protein HMPREF9446_02652 [Bacteroides fluxus YIT 12057] Length = 289 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 21/87 (24%), Positives = 40/87 (45%), Gaps = 6/87 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S+E I K + + + N + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLE------FGTIKQMKKDMATEIAATLAFSAIQNNDKIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKF 253 S++IE+F + G H+ I+ L F Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDF 152 >gi|297843636|ref|XP_002889699.1| T27G7.20 [Arabidopsis lyrata subsp. lyrata] gi|297335541|gb|EFH65958.1| T27G7.20 [Arabidopsis lyrata subsp. lyrata] Length = 772 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 26/179 (14%), Positives = 59/179 (32%), Gaps = 20/179 (11%) Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 + ++ A R P + T +R + + T A ++ V+D S SM Sbjct: 522 KRLAVDATLRAAAPYQKLRREKDITGTRKVFVEKTDMRAKRMARKAGALVIFVVDASGSM 581 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEFFLLEWGVSHL 243 ++ ++ A + +L + Q ++ F + E ++ Sbjct: 582 ------ALNRMQNAKGAALKLL-----AESYTSRDQVSIIPFRGDAAEVLLPPSRSIAMA 630 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 + +++ L G S + + + + + +IV +TDG T Sbjct: 631 RNRLERLPCGGGSPLAHG--------LTTAVRVGLNAEKSGDVGRIMIVAITDGRANIT 681 >gi|221042216|dbj|BAH12785.1| unnamed protein product [Homo sapiens] Length = 677 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 68/196 (34%), Gaps = 28/196 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE- 232 ++ V+D+S SM K+ +++ +L +++ + LV F +++ Sbjct: 293 VVFVIDISGSMRG------QKVKQTKEALLKILGDMQPGDYFD------LVLFGTRVQSW 340 Query: 233 ----FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E + Q ++ S +TN GL + +Q +N+ Sbjct: 341 KGSLVQASEANLQAAQDFVRGFS-LDEATNLNGGLLRGIEILNQVQESLPEL----SNHA 395 Query: 289 KIIVFMTDGENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS--- 344 I++ +TDG+ + Q L A + +Y +G FL + N+ Sbjct: 396 SILIMLTDGDPTEGVTDRSQILKNVRNAIRGRFPLYNLGFGHNVDFNFLEVMSMENNGRA 455 Query: 345 --FYLVENPHSMYDAF 358 Y + F Sbjct: 456 QRIYEDHDATQQLQGF 471 >gi|115345667|ref|YP_771848.1| von Willebrand factor type A domain-containing protein [Roseobacter denitrificans OCh 114] gi|115292988|gb|ABI93440.1| von Willebrand factor type A domain protein, putative [Roseobacter denitrificans OCh 114] Length = 349 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 24/169 (14%), Positives = 54/169 (31%), Gaps = 13/169 (7%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T + LD+++VLD S SM + ++ + A++ + Sbjct: 60 TGVASGVGSSGDGLDLVLVLDSSGSMADINNGK-SRQQWQRDAAIALVNSLPAA-----T 113 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 +V F + L I + N++ G A + Sbjct: 114 TSVSIVEFDSNANVVTGLTSLTP--ASNIPAIIAAINGVNASGGTNIASGIAAAAGELTG 171 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 T + K +V ++DG+ + ++ +L ++++ I Sbjct: 172 ANATTGRS--KQMVVISDGDPTAGDQNAAALAAVAAGVNN---IHSVAI 215 >gi|319902107|ref|YP_004161835.1| hypothetical protein Bache_2281 [Bacteroides helcogenes P 36-108] gi|319417138|gb|ADV44249.1| protein of unknown function DUF58 [Bacteroides helcogenes P 36-108] Length = 289 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 21/87 (24%), Positives = 40/87 (45%), Gaps = 6/87 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S+E I K + + + N + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLE------FGTIKQMKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKF 253 S++IE+F + G H+ I+ L F Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDF 152 >gi|227827415|ref|YP_002829194.1| von Willebrand factor A [Sulfolobus islandicus M.14.25] gi|229584630|ref|YP_002843131.1| von Willebrand factor A [Sulfolobus islandicus M.16.27] gi|238619571|ref|YP_002914396.1| von Willebrand factor type A [Sulfolobus islandicus M.16.4] gi|227459210|gb|ACP37896.1| von Willebrand factor type A [Sulfolobus islandicus M.14.25] gi|228019679|gb|ACP55086.1| von Willebrand factor type A [Sulfolobus islandicus M.16.27] gi|238380640|gb|ACR41728.1| von Willebrand factor type A [Sulfolobus islandicus M.16.4] Length = 380 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 31/211 (14%), Positives = 66/211 (31%), Gaps = 34/211 (16%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + ++ LD S SM + KI++A + + + + + +TF Sbjct: 33 GSATGFHYIVALDTSGSMTGY------KIELAKQGAIELFKRIPKGN------KVSFITF 80 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA-YNQIFDMQGMRQHCNTEDA 285 S+ V+ ++ + L G + A Y I + + Sbjct: 81 SS----------NVNVIKEFVDPLDLTNEILQIAAGGQTALYTAILTANSLAKKYQMPT- 129 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPN 343 ++ +TDG L K VY+ GI + + L++ + + Sbjct: 130 ----YLLLLTDGNPTDETNVGNYLKLPYFEK---MQVYSFGIGDDYNEQLLQSISDKTSG 182 Query: 344 SFYLVENPHSMYDAF-SHIGKDIVTKRIWYD 373 Y + + + + I K + D Sbjct: 183 VMYHISDANEIPQKLPQKAVTQIAAKNVTVD 213 >gi|156974653|ref|YP_001445560.1| hypothetical protein VIBHAR_02371 [Vibrio harveyi ATCC BAA-1116] gi|156526247|gb|ABU71333.1| hypothetical protein VIBHAR_02371 [Vibrio harveyi ATCC BAA-1116] Length = 2127 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 63/182 (34%), Gaps = 26/182 (14%) Query: 155 VMPITSSV--KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 P ++S+ V ++T ++ +++DVS SM + ++ + S +L + + I Sbjct: 1665 DSPYSTSIVHDVVAETKESANVQLIMDVSGSMRTG-----NRLQIMKDSATQLLNQYESI 1719 Query: 213 PDVNNVVQSGLVTFSN--KIEEFFLLEWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 + ++ +S W + I L+ G +TN + A + Sbjct: 1720 ----GQTRVQIIKYSGSATTYAIAGATWLTVDEAKAYIDTLTA-GGATNYNRAINEAKDS 1774 Query: 270 IFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE---DQQSLYYCNEAKKRGAIVYAIG 326 D + N + F++DG+ D + + + + A+ Sbjct: 1775 WDDAGKLPSASN--------VSYFLSDGQPNPASSFINDARENSWIDHLTDSDNQITALA 1826 Query: 327 IR 328 Sbjct: 1827 YG 1828 >gi|119775138|ref|YP_927878.1| inter-alpha-trypsin inhibitor domain-containing protein [Shewanella amazonensis SB2B] gi|119767638|gb|ABM00209.1| inter-alpha-trypsin inhibitor domain protein [Shewanella amazonensis SB2B] Length = 753 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 30/205 (14%), Positives = 68/205 (33%), Gaps = 38/205 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK---- 229 +++V+D S SM + A ++ L + N ++ FS+ Sbjct: 399 LVLVIDTSGSMAG------DSMVQARSALIHALGGLGPQDSFN------IIAFSSDARPL 446 Query: 230 -IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + + + Q+ ++ L G + ++ L+ A + + Sbjct: 447 WPDAKPATAFNLGAAQQFVRSLEADGGTEMAS-ALELALKTPSVVDEDTKRL-------- 497 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGA-IVYAIGIRVIRSHEFLRACASP--NSF 345 + ++F+TDG + ++ G ++ + I + F+ A+ SF Sbjct: 498 RQVLFITDGAVNGEDALFNLIE-----RRLGTSRLFPVAIGAAPNGYFMSRAAAAGRGSF 552 Query: 346 YLV----ENPHSMYDAFSHIGKDIV 366 + E M S I +V Sbjct: 553 TFIGHGGEVAEKMNQLLSRIEHPVV 577 >gi|325297737|ref|YP_004257654.1| hypothetical protein Bacsa_0585 [Bacteroides salanitronis DSM 18170] gi|324317290|gb|ADY35181.1| protein of unknown function DUF58 [Bacteroides salanitronis DSM 18170] Length = 289 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 22/111 (19%), Positives = 49/111 (44%), Gaps = 7/111 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S+ + K + + + N + G++ F Sbjct: 72 EEERELTVMLLVDVSNSL------DFGTVKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 S++IE+F + G H+ I+ L F T++ ++ A + ++ R Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELLDF-QPTSTRTDIQCAIEYLTNVLKKR 175 >gi|308473978|ref|XP_003099212.1| CRE-CLEC-66 protein [Caenorhabditis remanei] gi|308267685|gb|EFP11638.1| CRE-CLEC-66 protein [Caenorhabditis remanei] Length = 375 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 33/157 (21%), Positives = 62/157 (39%), Gaps = 16/157 (10%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS---N 228 LD++ V+D S++M ++ K+ I S+ + ++ D + G +T++ Sbjct: 35 LDIVFVVDNSKNM------NMYKVFDTISSLFSPYVQIGTGYDDPRSTRVGFITYNWNAT 88 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + +F+ L+ S L +I+ LS +S + M Q NYK Sbjct: 89 DVADFYKLQ-SYSDLSNQIQQLSTTPLSRRDESYIDTGLAAAIRMVNATQGL---RDNYK 144 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 K++VF T N + K GA + + Sbjct: 145 KVVVFFTSQYNYYNTYPEDQAKL---LKSWGATLITV 178 >gi|291569186|dbj|BAI91458.1| magnesium chelatase subunit D [Arthrospira platensis NIES-39] Length = 663 Score = 39.0 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 66/201 (32%), Gaps = 42/201 (20%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-EE 232 ++ V+D S SM ++ ++ A ++ +L E D Q L+ F + E Sbjct: 468 VVFVVDASGSM------ALNRMQSAKGAVMQLLTEAYQSRD-----QVSLIPFRGEQAEV 516 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 ++ +R+++ + G S + GL A + +Q + +IV Sbjct: 517 LLPPTRSIAAAKRRLERMPCGGGSPLAH-GLTQAVRVGMN---AKQSGDIGQV----VIV 568 Query: 293 FMTDGENL-------------STKEDQQS--LYYCNEAKKRGAIVYAI-----GIRVIRS 332 +TDG K D + L + + G + I + + Sbjct: 569 AITDGRGNIPLARSLGEPILEGEKPDIKGELLEIAAKIRGLGIQLLVIDTENKFVSTGFA 628 Query: 333 HEFLRACASPNSFYLVENPHS 353 E + +Y + Sbjct: 629 KELAKTAG--GKYYHLPKATD 647 >gi|296131764|ref|YP_003639011.1| von Willebrand factor type A [Thermincola sp. JR] gi|296030342|gb|ADG81110.1| von Willebrand factor type A [Thermincola potens JR] Length = 584 Score = 39.0 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 32/213 (15%), Positives = 70/213 (32%), Gaps = 39/213 (18%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 RH+ + + ++ A +D + +D S SM + I+++ + + Sbjct: 384 GQRHMSIKREDIRQYGRKSFAPIDTCLAIDCSGSM----------VGEKIRAVAYLAQHF 433 Query: 210 KLIPDVNNVVQSGLVTF---SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 L +VTF S+K+ F + L ++ + G++ + L+ Sbjct: 434 LLTSREK----VSVVTFQETSSKVVIPFTKSY--QKLVEGLRSIQPEGMTPLAKGILEAV 487 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST----KEDQQSLYYCNEAKKRGAIV 322 ++ + R +V +TDG +L + + Sbjct: 488 --ELIKKKRARNPL----------LVLITDGIPNYPLWTTDAQADALKAAEMIAENKIRL 535 Query: 323 YAIGIRVIRSHEFLRACAS--PNSFYLVENPHS 353 IG+ FL+ A + Y+V+ Sbjct: 536 VCIGVLPNE--SFLKELAKIGKGNLYIVDELDK 566 >gi|168011582|ref|XP_001758482.1| predicted protein [Physcomitrella patens subsp. patens] gi|162690517|gb|EDQ76884.1| predicted protein [Physcomitrella patens subsp. patens] Length = 1042 Score = 39.0 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 26/148 (17%), Positives = 56/148 (37%), Gaps = 20/148 (13%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + + ++ +LD S SM ++ ++ +D A ++ +V+ V + Sbjct: 42 ITSRKGGGGAMLVVFLLDTSASMNQRCNNGLSLLDCAKSAVEHF-HKVRSRDASCRVDRY 100 Query: 222 GLVT------FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI----- 270 LVT + ++++ +H R +K + + TN L +N + Sbjct: 101 MLVTCDEGSDYIKVVDKYP-----FTHFMRALKSVQARDM-TNLGGALHRIFNFLHVQRL 154 Query: 271 -FDMQGMRQHCNTEDANYKKIIVFMTDG 297 D+ Q N + I+ +TDG Sbjct: 155 VLDIDRYGQGRNPCYNDLT-TILLLTDG 181 >gi|154489100|ref|ZP_02029949.1| hypothetical protein BIFADO_02412 [Bifidobacterium adolescentis L2-32] gi|154083237|gb|EDN82282.1| hypothetical protein BIFADO_02412 [Bifidobacterium adolescentis L2-32] Length = 560 Score = 39.0 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 54/159 (33%), Gaps = 15/159 (9%) Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 V+D S SM S + +++ I ++ V + F++ + + Sbjct: 387 VVDYSGSMSGKGKSGA--VAGLQAALDTDQARASHIEPGDDDVNV-FIPFNSSAKVAQVA 443 Query: 237 EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 + + + + L+ A + + +Y I +TD Sbjct: 444 QGKQTATLLAASENQVANGNADIYNALEVALKNLPS----------DRDDYTVAIALLTD 493 Query: 297 GENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 G++ + K D+ Y ++ K G +++I S + Sbjct: 494 GQSDTAKLDEFKQQYASDGK--GVPIFSIMFGDADSQQL 530 >gi|322435085|ref|YP_004217297.1| von Willebrand factor type A [Acidobacterium sp. MP5ACTX9] gi|321162812|gb|ADW68517.1| von Willebrand factor type A [Acidobacterium sp. MP5ACTX9] Length = 794 Score = 39.0 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 30/226 (13%), Positives = 70/226 (30%), Gaps = 34/226 (15%) Query: 109 DIVRSTSLDIVVVPQNEGYSISAISRYKIPL---------KFCTFIPWYTNSRHIVMPIT 159 + + P+ I + R P K P + ++ ++ P Sbjct: 233 TFTDDLAATWTLDPKAADTLIVSTFRNPHPPLPSATERSPKPPQPEPGFFLAQTLIAPEP 292 Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + S+ +++++LD S SM+ K++ + ++ +L ++ Sbjct: 293 PATTHASE---PRNVILLLDTSLSMQW------DKLERSYAALETVLRSLQPTD------ 337 Query: 220 QSGLVTFSNKIE--EFFLLEWGVSHLQRKIK--YLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + L+ F+ + + +Q ++ S+ T+ L A Q Sbjct: 338 RFSLMLFNQDLSWFRPDPTQATPESVQEALQFIRASRLRGGTDLGKALAAALTQAKSPNQ 397 Query: 276 MRQHCNTEDANY------KKIIVFMTDGENLSTKEDQQSLYYCNEA 315 +++ KI T N ST ++A Sbjct: 398 SLYLFTDGNSDRGVTILDNKIAAAYTQQWNHSTHPRTNVFAIGDDA 443 >gi|260803057|ref|XP_002596408.1| hypothetical protein BRAFLDRAFT_76225 [Branchiostoma floridae] gi|229281663|gb|EEN52420.1| hypothetical protein BRAFLDRAFT_76225 [Branchiostoma floridae] Length = 1258 Score = 39.0 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 26/141 (18%), Positives = 48/141 (34%), Gaps = 21/141 (14%) Query: 165 NSQTDARLDMMIVLDVSRSMESF-FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 ++ LD+ V D S S+ F++S+ I + V +P + G Sbjct: 206 DTNNPNGLDLYFVFDASESVGDHQFNASVDFAKALIGKVG-----VSTLPGG---TRVGA 257 Query: 224 VTFSNKIEEFFL--------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + F++K + F G + TN+ LK+ + Sbjct: 258 IAFASKTKVLFSKQEVLSVQDTLGRLDNVDWSNLSKEINSGTNTREALKFLRENMIV--- 314 Query: 276 MRQHCNTEDANYKKIIVFMTD 296 Q E A K+ ++ +TD Sbjct: 315 ASQQVFPERA-AKQAVILITD 334 >gi|188581386|ref|YP_001924831.1| hypothetical protein Mpop_2134 [Methylobacterium populi BJ001] gi|179344884|gb|ACB80296.1| conserved hypothetical protein [Methylobacterium populi BJ001] Length = 543 Score = 39.0 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 33/220 (15%), Positives = 72/220 (32%), Gaps = 8/220 (3%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 F + +G ++I+TA+ L + +++S ++ + + D + + AA+ + Sbjct: 1 MRFRSDQRGAVSIVTALGLTTLLGAAAFGLDLSRLYGTQRRVQGAADLAALSAASDLSTA 60 Query: 69 GNGNNRKKLKGGD-ILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 R G RI R+ NG +T+ + V V Sbjct: 61 DAAARRALADNGFGDGTRIGVQAGAYLRSAAVANGSRFT----PGATAPNAVRVSLTAPV 116 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD-ARLDMMIVLDVSRSME- 185 ++ +P + + S A + +L S S++ Sbjct: 117 PLTFGRYLGLPTHYDVSAVGTAANTRFAAFSVGSGVAALDAGLANAVLGALLGTSLSLDL 176 Query: 186 -SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + T+ID+ L+ N++V +G+ Sbjct: 177 MDYEALLSTRIDVFRFLDALALDLGLQAVSYNDIVTAGVT 216 >gi|304382528|ref|ZP_07365023.1| von Willebrand factor [Prevotella marshii DSM 16973] gi|304336359|gb|EFM02600.1| von Willebrand factor [Prevotella marshii DSM 16973] Length = 289 Score = 39.0 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 73/193 (37%), Gaps = 22/193 (11%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + + +M+++DVS S++ F + TK ++ + + I + + + G++ F Sbjct: 72 EEEREMTVMLLIDVSGSLD-FGTTQRTKAELVTEIAATL--SFSAIQNND---KIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 S++IE++ + G H+ I+ + F + T + A + + R Sbjct: 126 SDQIEKYIPPQKGRKHILYIIREMLDFHAESRRTD-IGMAVEFLTRVMKRRCTAFVLSDF 184 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKK-RGAIVYAIGIRVIRSHEFLRAC-ASPNS 344 Y + ++ ++ L CN VY I + + L+ C A Sbjct: 185 YTR-------------QDFERQLQICNSKHDVVAIQVYDIRAKELPDVGLLKVCDAETGH 231 Query: 345 FYLVENPHSMYDA 357 ++ A Sbjct: 232 EMYIDTSSKKLRA 244 >gi|241113142|ref|YP_002972977.1| TadE family protein [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240861350|gb|ACS59016.1| TadE family protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 176 Score = 39.0 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 33/131 (25%), Positives = 55/131 (41%), Gaps = 10/131 (7%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM 66 ++R F + KG I AI +F+++ IIEVS +FF+ +S +D S+ + I Sbjct: 9 SLRRLFGDRKGVAAIEFAILALPLFIMIFGIIEVSLMFFV----NSALDASVHKISRMIR 64 Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 + K+ D RI N +SF V + D+ + S D + N Sbjct: 65 TGEVAS--SKITLADFKARICNDMLLSFSCSSGLLVKVIVLSDLSSAASTDPIDDSGN-- 120 Query: 127 YSISAISRYKI 137 ++ Y I Sbjct: 121 --LTVTETYDI 129 >gi|269963802|ref|ZP_06178117.1| conserved hypothetical protein [Vibrio harveyi 1DA3] gi|269831464|gb|EEZ85608.1| conserved hypothetical protein [Vibrio harveyi 1DA3] Length = 1919 Score = 39.0 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 46/297 (15%), Positives = 91/297 (30%), Gaps = 51/297 (17%) Query: 53 MIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRD------------ 100 +D S V A + + EG+ G + +E + Sbjct: 758 TLDESAVGANSGVNAEGDLVANLGSAPGAEFRFDSEQNLENVTSEGKPVVFEVSDDRGTL 817 Query: 101 NGFVNDIDDIVRSTSLDIVVVPQNEGYSISA----------ISRYKIPLKFCTFIPWYTN 150 G+V DD V+ I + YS+ IP+ Sbjct: 818 TGYVGSGDDKVKVIEASI--DGKTGHYSVEQYEPVDHSELGADTLDIPVSVQVDAGGRVE 875 Query: 151 SRHIVMPITSSVKVNSQTDARL--------DMMIVLDVSRSMESFFDSS-----ITKIDM 197 + + + + S+ + + ++I LD S SM + +T+ D+ Sbjct: 876 NATLHLGVVDSLPSVNSEHHEIMDVDPQNNSVVIALDASGSMWDDMVKNEQGVEVTRWDL 935 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRKIKYLSKFGV 255 A +I M E+ + D V+ + T S + EW V + + + + G Sbjct: 936 ARSAIKTMFEKYDDLGD----VRFKIATHSGYPDGQTS-EWLESVDDIDAYLDSIGRGGW 990 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 + S A +Q+ + +N++ F++DG + Sbjct: 991 TPYS-----QAIDQVNAALTDLSQETDDTSNHQLY--FISDGAPSDFNSWTNTSNPA 1040 >gi|222481276|ref|YP_002567512.1| hypothetical protein Hlac_3401 [Halorubrum lacusprofundi ATCC 49239] gi|222454652|gb|ACM58915.1| conserved hypothetical protein [Halorubrum lacusprofundi ATCC 49239] Length = 209 Score = 39.0 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 20/136 (14%), Positives = 46/136 (33%), Gaps = 14/136 (10%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + VLD S SM++ D + +K + L F+ +++ Sbjct: 5 ITFVLDSSGSMDAIAD----------DTRGGFNTFLKDQRNEEGTATVTLYDFNTTVDQI 54 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + V+ K T + A ++ + + IIV Sbjct: 55 YE-TYPVADAPELTDENYKSRGRTALHDAIARAVDE---TAEDIAAVDQAEQPDNVIIVV 110 Query: 294 MTDGENLSTKEDQQSL 309 +TDG+ +++ + ++ Sbjct: 111 LTDGKENASETPKDAV 126 >gi|148360718|ref|YP_001251925.1| type IV fimbrial biogenesis PilY1-like protein [Legionella pneumophila str. Corby] gi|296106216|ref|YP_003617916.1| type IV pilus assembly protein PilY1 [Legionella pneumophila 2300/99 Alcoy] gi|148282491|gb|ABQ56579.1| type IV fimbrial biogenesis PilY1-related protein [Legionella pneumophila str. Corby] gi|295648117|gb|ADG23964.1| type IV pilus assembly protein PilY1 [Legionella pneumophila 2300/99 Alcoy] Length = 1169 Score = 39.0 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 17/87 (19%), Positives = 31/87 (35%), Gaps = 12/87 (13%) Query: 293 FMTDGENLSTKEDQ--QSLYYCNEAKKRGAIVYAIGIRV-------IRSHEFLRACA--- 340 F DG ST ++ K G + + IG+ + LRA A Sbjct: 442 FNADGSLNSTNSQALSDAINEIKALKNDGVLTFIIGMGAGVDPAVNPEAAATLRAMAVAG 501 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVT 367 ++Y +P ++ + + I +I Sbjct: 502 GTENYYPATSPEALVSSLNSILSNIQN 528 >gi|124027145|ref|YP_001012465.1| von Willebrand factor type A protein (vWA) [Hyperthermus butylicus DSM 5456] gi|123977839|gb|ABM80120.1| Von Willebrand factor type A protein (vWA) [Hyperthermus butylicus DSM 5456] Length = 263 Score = 39.0 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 63/187 (33%), Gaps = 22/187 (11%) Query: 174 MMIVLDVSRSMESFFDSS-ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 ++ +LD S SM +K + ++ + +L + ++ + G V FS+ ++ Sbjct: 22 VIFMLDRSLSMSELKTYDHRSKWEHLVELVERLLARLAR-SNMAPAFRVGFVWFSDDVKV 80 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNST-----PGLKYAYNQIFDMQGMRQHCNTEDANY 287 + L F ST + A + ++ Sbjct: 81 VERGGAVYFPVTEHNPALRVFQESTEENRPYGMTAMADALE--HAAMIIERYIEDTSIPS 138 Query: 288 KKII-VFM-TDG-------ENLSTKEDQQSLYYCNEAKK----RGAIVYAIGIRVIRSHE 334 +K + VFM TDG E++ D+ + + + + I + + E Sbjct: 139 EKYVTVFMFTDGKETKRTPEDVVEVADRITGELQEKLRALNQRNRIGLATIALGLDADRE 198 Query: 335 FLRACAS 341 LR AS Sbjct: 199 TLRTIAS 205 >gi|296448099|ref|ZP_06890001.1| conserved hypothetical protein [Methylosinus trichosporium OB3b] gi|296254413|gb|EFH01538.1| conserved hypothetical protein [Methylosinus trichosporium OB3b] Length = 618 Score = 39.0 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 20/150 (13%), Positives = 46/150 (30%), Gaps = 4/150 (2%) Query: 22 LTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGD 81 + A LP++ V +++E + +K + D + A + Sbjct: 1 MFAFVLPLVIGVCSLVVEFGNALLIKARYQRVADIASFSGALAYSGTSSTTTMTNASVAV 60 Query: 82 ILCRIKNTWNMSFRNELRDNGFVNDI--DDIVRSTSLDIVVVPQN-EGYSISAISRYKIP 138 +S + N I + L + V I+A + +I Sbjct: 61 AALNQVPAAAVSANLTVSPASSSNSAVLVAISAAHPLFLTPVLNGPRSLQIAASAYSQIA 120 Query: 139 LKFCTFIPWYTNSRH-IVMPITSSVKVNSQ 167 + + + +S+ + + +SV S Sbjct: 121 VSASSCLLALDSSKSGVTLSGGTSVTATSC 150 >gi|254295356|ref|YP_003061379.1| hypothetical protein Hbal_3014 [Hirschia baltica ATCC 49814] gi|254043887|gb|ACT60682.1| hypothetical protein Hbal_3014 [Hirschia baltica ATCC 49814] Length = 264 Score = 39.0 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 36/213 (16%), Positives = 77/213 (36%), Gaps = 35/213 (16%) Query: 129 ISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS-------QTDARLDMMIVLDVS 181 I+ K + C + + I + ++ ++ + D+ + I+LD + Sbjct: 2 ITNTKTSKTLFQKC--VSGFLAITAISLSACANAEIAPDPDISRIEGDSPMHAYILLDRT 59 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL------VTFSNKIEEFFL 235 SM S +D ++ ++ S+ ++ DV+ V + + F + Sbjct: 60 GSMSSIWDEALNSVNTYAVSLGT--KQAGEKGDVDADVTLAVFDAQDGLQFDVLRKGASA 117 Query: 236 LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 EW + S G+ +FD G D K ++V MT Sbjct: 118 KEWN------------EVSNDEASPRGM----TPLFDAIGRIVSIAETDKPEKAVLVIMT 161 Query: 296 DGENLSTKEDQQ--SLYYCNEAKKRGAIVYAIG 326 DG+ S++E + + + +++G V +G Sbjct: 162 DGQENSSREITKHGAKAALDRVREKGWEVVFLG 194 >gi|227819050|ref|YP_002823021.1| hypothetical protein NGR_b08120 [Sinorhizobium fredii NGR234] gi|227338049|gb|ACP22268.1| hypothetical protein NGR_b08120 [Sinorhizobium fredii NGR234] Length = 155 Score = 39.0 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 8/43 (18%), Positives = 20/43 (46%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLH 51 R + + G + ++ A+ LP++ +G+ E + + L Sbjct: 18 RGWLKDESGTVAVIAAVTLPVLVGAMGLGAETGYWYLKDRKLQ 60 >gi|281202791|gb|EFA76993.1| hypothetical protein PPL_09745 [Polysphondylium pallidum PN500] Length = 717 Score = 39.0 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 23/167 (13%), Positives = 52/167 (31%), Gaps = 28/167 (16%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 + +LD S S+ + A I + +++ ++ + FS+ + Sbjct: 29 ISCTFILDSSGSVGNEG------FKYAKMVITSFCDKLDPQ----MYIRI--INFSSDSK 76 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + + + I+ L T ++ +F + + +I Sbjct: 77 VHLEFTKIRNEISKSIENLVFMSGGTEMKKAFIQTHDGLFSVSYGKN-----------LI 125 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 V +TDGE K+ + IG+ I+ F + Sbjct: 126 VLITDGEPTDDPSS----AIRTLLKEN-VDILLIGVGSIKRQFFEKN 167 >gi|282900088|ref|ZP_06308045.1| Mg chelatase subunit [Cylindrospermopsis raciborskii CS-505] gi|281194970|gb|EFA69910.1| Mg chelatase subunit [Cylindrospermopsis raciborskii CS-505] Length = 671 Score = 39.0 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 29/182 (15%), Positives = 60/182 (32%), Gaps = 22/182 (12%) Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 + + ++ A R P + R IV K + L ++ V+D S Sbjct: 426 GKVKRIAVDATLRAAAPYQ-KARRLRQPGKRVIVEQGDMRTKRLVRKAGAL-VVFVVDAS 483 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-EEFFLLEWGV 240 SM ++ ++ A ++ +L E N Q L+ F + E + Sbjct: 484 GSM------ALNRMQSAKGAVMQLLTE-----SYQNRDQIALIPFRGEQAEVLLPPTRSI 532 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 + + +++ L G S + + + + + +IV +TDG Sbjct: 533 ALAKTRLEKLPCGGGSPLAHG--------LTQAVRVGVNAQMGGDIGQVVIVAITDGRGN 584 Query: 301 ST 302 Sbjct: 585 IP 586 >gi|319640967|ref|ZP_07995675.1| hypothetical protein HMPREF9011_01272 [Bacteroides sp. 3_1_40A] gi|317387412|gb|EFV68283.1| hypothetical protein HMPREF9011_01272 [Bacteroides sp. 3_1_40A] Length = 289 Score = 39.0 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 23/111 (20%), Positives = 48/111 (43%), Gaps = 7/111 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S+ + K + + + N + G++ F Sbjct: 72 EEERELTVMLLIDVSNSL------DFGTVKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 S++IE+F + G H+ I+ L F + T +K A + ++ R Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELLDFKPESKRTD-IKMAVEYLTNVIKKR 175 >gi|297841999|ref|XP_002888881.1| hypothetical protein ARALYDRAFT_476383 [Arabidopsis lyrata subsp. lyrata] gi|297334722|gb|EFH65140.1| hypothetical protein ARALYDRAFT_476383 [Arabidopsis lyrata subsp. lyrata] Length = 757 Score = 39.0 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 68/197 (34%), Gaps = 43/197 (21%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-EE 232 ++ V+D+S SM+ ++ K++ L +++ N ++ F+++I E Sbjct: 327 VVFVIDISASMKW------KPLEDVKKALLECLAKLQAEDVFN------IIAFNDEILEF 374 Query: 233 FFLLEWGV----SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 +E+ S + + TN LK A + + N Sbjct: 375 STSMEFATDETISAVTEWLDTNLIANGGTNMLLPLKQAIKLL-------EGSNIGVP--- 424 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKR--------GAIVYAIGIRVIRSHEFLRACA 340 ++ +TDG + CN K+ + GI +H FL+ A Sbjct: 425 -LVYLVTDG------SVENEREICNAMKESCSRNGKSISPRISTFGIGSFCNHYFLQMLA 477 Query: 341 SPNS-FYLVENPHSMYD 356 + +Y N ++ Sbjct: 478 RIGNGYYDGTNNTDSFE 494 >gi|21325496|dbj|BAC00118.1| Hypothetical membrane protein [Corynebacterium glutamicum ATCC 13032] Length = 925 Score = 39.0 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 36/249 (14%), Positives = 76/249 (30%), Gaps = 53/249 (21%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSM------------ESFFDSSITKIDMAIKSI 202 S++ LD++I++D + S+ E D+ ++ A + Sbjct: 82 ASGSLSNLGACIADKGTLDVIIMIDETESLIHEARDGVVNANEPGADAQHHRVPAAQSFV 141 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG---------VSHLQRKIKYL--S 251 + +L + + D+N ++ + K +G V +Q +I Sbjct: 142 DELLAK-QSDGDLNTRIRVAGFGQTYKSGATDPDNYGAWTQLDASTVGGVQDEISRFADR 200 Query: 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS-TKEDQQSLY 310 TN ++ AY + +++V TDG + D Sbjct: 201 TQEQYTNYASAIEGAYQDFTRS---------GSEDACRMLVTFTDGALTAQEGADVAEAA 251 Query: 311 YC------NEAKKRGAIVYAIGIRVIRS---HEFLRACA----------SPNSFYLVENP 351 C + + G IG+ + LR + +F+ +N Sbjct: 252 LCAPGGVTDRLRSAGITHIGIGLSAPTNPSDFSLLRGTTAGGGTCGVEPANGAFFPADNV 311 Query: 352 HSMYDAFSH 360 ++ AF Sbjct: 312 GGLFAAFRE 320 >gi|308510204|ref|XP_003117285.1| hypothetical protein CRE_01843 [Caenorhabditis remanei] gi|308242199|gb|EFO86151.1| hypothetical protein CRE_01843 [Caenorhabditis remanei] Length = 409 Score = 39.0 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 18/150 (12%), Positives = 47/150 (31%), Gaps = 10/150 (6%) Query: 192 ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIK 248 +T++D I ++ + + + GLV+++ + L+ S+ Sbjct: 70 LTQVDGNIATVFGYDTRIGVRDYEPKTTRVGLVSYNADAKILAGLDTYQSYDDLANGVFD 129 Query: 249 YLSKFGVSTNSTPGL-KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 L+ + S A ++F+ YKK+++ + + Sbjct: 130 SLNSVSATDESYLAKGLSAAEKVFEEGKSTA----NRTQYKKVVIVY--ASSYKGTGELN 183 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + + K G + + L+ Sbjct: 184 PVPVADRMKTAGVKIITVAFSQNNDDGLLK 213 >gi|302391027|ref|YP_003826847.1| hypothetical protein Acear_0232 [Acetohalobium arabaticum DSM 5501] gi|302203104|gb|ADL11782.1| Protein of unknown function DUF2134, membrane [Acetohalobium arabaticum DSM 5501] Length = 307 Score = 39.0 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 7/61 (11%), Positives = 29/61 (47%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 + + KG + ++ A+ + + L ++I++ ++ + L + +D + + + ++ Sbjct: 1 MSLINSQKGTVIVVVALMMTVFISFLALVIDIGSLYLERIRLVNTLDAAALAGVQDLPDD 60 Query: 69 G 69 Sbjct: 61 S 61 >gi|255693882|ref|ZP_05417557.1| von Willebrand factor, type A [Bacteroides finegoldii DSM 17565] gi|260620311|gb|EEX43182.1| von Willebrand factor, type A [Bacteroides finegoldii DSM 17565] Length = 289 Score = 39.0 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 23/111 (20%), Positives = 49/111 (44%), Gaps = 7/111 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S+E + K + + + N + G++ F Sbjct: 72 EEERELTVMLMVDVSGSLE------FGTVKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 S++IE+F + G H+ I+ L F + T ++ A + ++ R Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDFQPESRRTN-IRLALEYLTNVMKRR 175 >gi|241205402|ref|YP_002976498.1| hypothetical protein Rleg_2696 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240859292|gb|ACS56959.1| protein of unknown function DUF1194 [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 258 Score = 39.0 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 68/187 (36%), Gaps = 16/187 (8%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVL--DVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 ++M ++ V + + +D+ +VL D SRSM+ + + ++ VK Sbjct: 7 VLMGLSGLVPIAQAGGSEVDVTLVLAVDTSRSMDFEEIGIQREGYVEALKHKEFIDAVK- 65 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEWGV----SHLQRKIKYLSKFGVSTNSTPGLKYAY 267 + + ++ + + +++W V L ++T + A Sbjct: 66 -DGLTGRIAISYFEWAGYVVQDSVIDWQVIETEEDAIAFADKLEARPIATQRRTSISTAI 124 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 QG ++ + +++I DG N S + ++A + G I+ + I Sbjct: 125 -----AQGASMIVSSPFQSRRQVIDVSGDGPNNSGNPVTPAR---DKAVEAGMIINGLAI 176 Query: 328 RVIRSHE 334 + S Sbjct: 177 MLRPSDA 183 >gi|39936736|ref|NP_949012.1| hypothetical protein RPA3674 [Rhodopseudomonas palustris CGA009] gi|192292562|ref|YP_001993167.1| TadE family protein [Rhodopseudomonas palustris TIE-1] gi|39650592|emb|CAE29115.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009] gi|192286311|gb|ACF02692.1| TadE family protein [Rhodopseudomonas palustris TIE-1] Length = 177 Score = 39.0 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 22/52 (42%), Positives = 28/52 (53%), Gaps = 1/52 (1%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMI-DRSLV 59 R F N KG + AI PI F +L IIEV+ IFF VL + + D S + Sbjct: 13 RKFRRNRKGSAAVEFAIVAPIFFALLFAIIEVAMIFFASQVLETAVQDSSRL 64 >gi|299136257|ref|ZP_07029441.1| von Willebrand factor type A [Acidobacterium sp. MP5ACTX8] gi|298602381|gb|EFI58535.1| von Willebrand factor type A [Acidobacterium sp. MP5ACTX8] Length = 806 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 25/179 (13%), Positives = 61/179 (34%), Gaps = 34/179 (18%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 K P + ++ ++ P ++ + A D++++LD S SM+ ++ + Sbjct: 284 KPRQPEPGFFLAQTLIAPPVAAAPTQT---APRDVVLLLDTSLSMQW------DNLERSF 334 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE--EFFLLEWGVSHLQRKIKYLSKFG--V 255 ++ A+L +K + L+ F+ + + L +Q+ + ++ Sbjct: 335 AALQAVLLSLKPADHFS------LLLFNQDVTLFKPEPLAATPEAVQQALDFVRASKLRG 388 Query: 256 STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNE 314 T+ L G+ Q + + +TDG N + Sbjct: 389 GTDLGKAL---------TAGLAQAKLPNSS-----LFLLTDG-NSDRGTTVITGKIATA 432 >gi|311900457|dbj|BAJ32865.1| hypothetical protein KSE_71090 [Kitasatospora setae KM-6054] Length = 717 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 41/261 (15%), Positives = 82/261 (31%), Gaps = 13/261 (4%) Query: 92 MSFRNELRDNGFVNDIDDIVRSTSLDIVVVP-QNEGYSISAISRYKIPLKFCTFIPWYTN 150 + F N + + D V P +++A + Y+ F Y N Sbjct: 107 RQLDGGVPAGWFTNPVLRTGPGSGSDSVASPYAFTTPALAAGNTYRSTADF-MISDDYRN 165 Query: 151 SRHIVMPITSSVKVNSQ--TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + + N LD+ +VLD+S S+ S T D ++ Sbjct: 166 LPTASGGVWQDSRDNPPLPARCGLDVALVLDLSASVGSELPFLKTAADRFTDALTGTPSR 225 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + + + + V+ ++ G + + + G TN L ++ Sbjct: 226 L-AVFSFDQASPATSVSANHPELHPVSTPAGAAEFKA-LYAGWTLGKGTNWDTAL---WS 280 Query: 269 QIFDMQGMRQHCNTEDANYKKII-VFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 D N + DG ++ + ++ N K G+ + A+G+ Sbjct: 281 VANAAPRYDAVVVLTDGNPTRFADDAQGDGSRTHFRDVENGIFSANAVKAEGSRLIALGV 340 Query: 328 R---VIRSHEFLRACASPNSF 345 S LRA + P ++ Sbjct: 341 GKGVAGDSGLNLRAVSGPTAY 361 >gi|302540662|ref|ZP_07293004.1| von Willebrand factor, type A [Streptomyces hygroscopicus ATCC 53653] gi|302458280|gb|EFL21373.1| von Willebrand factor, type A [Streptomyces himastatinicus ATCC 53653] Length = 340 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 67/206 (32%), Gaps = 31/206 (15%) Query: 177 VLDVSRSMESFFDSSI--TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 V+D S SM +++D+ S+ L + GL FS +++ Sbjct: 136 VVDASASMSDPVPGRPGESRMDVTKASLRQALSRFNAGDE------IGLWEFSTELDGDR 189 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC-NTEDANYKK---- 289 ++ R++ + G + L A++ + + +T A YKK Sbjct: 190 --DYRELVATRRLGARTPDGTGQRA--ELAAAFDALKPLPEGSTGLYDTTLAAYKKAQET 245 Query: 290 -------IIVFMTDGENLSTKEDQQSLYYCNEAK----KRGAIVYAIGIRVIRSHEFLR- 337 +V +TDG N + + R + AI + R Sbjct: 246 FVRGKFNAVVMLTDGANQDPGSISRGALVKELKRLVDPDRPVPLIAIAVGPDADQAACRE 305 Query: 338 -ACASPNSFYLVENPHSMYDA-FSHI 361 A A+ S V +P + A I Sbjct: 306 IAQATGGSAQQVNDPAQINTAMLKAI 331 >gi|167537384|ref|XP_001750361.1| hypothetical protein [Monosiga brevicollis MX1] gi|163771189|gb|EDQ84860.1| predicted protein [Monosiga brevicollis MX1] Length = 1327 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 22/147 (14%), Positives = 56/147 (38%), Gaps = 11/147 (7%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + + Q + ++++ V+D S SM +++ +D A +I ++ + Sbjct: 315 VKQLMARGEQPTSAMNVIFVIDTSASMHKQTPHNMSCLDTARIAILNFVK-ARSQTSHGR 373 Query: 218 VVQSGLVTFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + L+++ + L W + L+ ++ LS T+ T L A+ + + Sbjct: 374 YERYSLLSY--EQGSQALKCWNQPLPVLELCLQELSYTDC-TSPTEALFTAFRLLHVTRM 430 Query: 276 MRQ-----HCNTEDANYKKIIVFMTDG 297 + +I+ ++DG Sbjct: 431 PKDCDTYGQGRQPWNLLPAVIIHVSDG 457 >gi|119513236|ref|ZP_01632280.1| hypothetical protein N9414_00925 [Nodularia spumigena CCY9414] gi|119462103|gb|EAW43096.1| hypothetical protein N9414_00925 [Nodularia spumigena CCY9414] Length = 224 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 65/165 (39%), Gaps = 17/165 (10%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 +++LD S SM+ I+ + + ++ +E+ V+ +VTF + + Sbjct: 24 VLLLDTSGSMQG------DAIEALNQGLLSLKDELVKNSLAARRVEVAIVTFDSNVN--- 74 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 +++ V+ Q L+ G++T G+ A + + Q + T Y + VFM Sbjct: 75 IVQDFVTADQFNPPILTAQGLTTM-GAGIHKALDMV---QERKSLYRTNGIAYYRPWVFM 130 Query: 295 -TDGENLSTKE---DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 TDGE + +Q + E + + +G+ Sbjct: 131 ITDGEPQGELDHLIEQAAQRLQGEEANKRVAFFTVGVENANMMRL 175 >gi|221134029|ref|ZP_03560334.1| vault protein inter-alpha-trypsin [Glaciecola sp. HTCC2999] Length = 757 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 31/190 (16%), Positives = 59/190 (31%), Gaps = 39/190 (20%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 SS+ + + + VLD S SM T + AI +I + + N Sbjct: 377 PSSMASTVTPSIQQNTIFVLDSSGSMHG------TALTQAIDAIREGVSYLTEHDTFN-- 428 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF--------GVSTNSTPGLKYAYNQI 270 +V F ++ W S ++ TN L + Q+ Sbjct: 429 ----IVDFDSEARAL----WRQSQFADEVSKAEAMRFLRHVDSDGGTNMQDALALSLTQL 480 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 D ++F+TDG + +E + + + ++ +GI Sbjct: 481 LDSSTGLTQ-----------VIFVTDGSINNERELLK--QIAEQLGDK--RLFTVGIGAA 525 Query: 331 RSHEFLRACA 340 + F+ A Sbjct: 526 PNSHFMEYAA 535 >gi|94312593|ref|YP_585802.1| hypothetical protein Rmet_3661 [Cupriavidus metallidurans CH34] gi|93356445|gb|ABF10533.1| conserved hypothetical protein [Cupriavidus metallidurans CH34] gi|222832771|gb|EEE71248.1| predicted protein [Populus trichocarpa] Length = 575 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 10/78 (12%), Positives = 36/78 (46%), Gaps = 4/78 (5%) Query: 16 KGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRK 75 +G ++++ A+ + I + + ++V +F+ + L ++D + + A Q+ + + Sbjct: 14 RGTVSLMAALLIAAIGVAALVSLDVGFVFYTQRQLQKLVDVAALSGAQQL----KSADDQ 69 Query: 76 KLKGGDILCRIKNTWNMS 93 ++L + + + Sbjct: 70 ATTNANVLSSVTSAAAQN 87 >gi|320169902|gb|EFW46801.1| von Willebrand domain-containing protein [Capsaspora owczarzaki ATCC 30864] Length = 966 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 28/176 (15%), Positives = 58/176 (32%), Gaps = 33/176 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D S SM ++I A ++ L+ + + +V F + + Sbjct: 293 VIFVVDRSGSMGG------SQILQARNALTLFLKSLPTNS------RFNIVGFGDSYQTL 340 Query: 234 FLLEWGVSHLQ-----RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F G S + + + T L+ A++ Sbjct: 341 FPEAVGYSQANLNTALKHVSTMDANLGGTELLAPLRA------------IERMKTRADFP 388 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 + + +TDG+ + DQ +++K +A+GI SH + A Sbjct: 389 RQVFVLTDGQ--VSNTDQVIAQVNSDSKNA--RYFALGIGSGVSHHLVEGIARAGK 440 >gi|228477464|ref|ZP_04062100.1| putative peptidoglycan-binding domain-containing protein [Streptococcus salivarius SK126] gi|228250899|gb|EEK10087.1| putative peptidoglycan-binding domain-containing protein [Streptococcus salivarius SK126] Length = 463 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 23/163 (14%), Positives = 49/163 (30%), Gaps = 17/163 (10%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + D++ ++D S SM+S I+ ++ + N + GLV + Sbjct: 177 EVKGTTDIVFIVDKSGSMDSHITDVANNIEKLVRDL----------ESKNVQARLGLVDY 226 Query: 227 SN---KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + F + I L +T S A + I Sbjct: 227 ESSKNTTYHDFGGSKFTTDTNAFIGKLKAI-RTTGSDEEATTALSHIATSPDYTWGTGKN 285 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 + ++ + +TD + T + + G + +G Sbjct: 286 N---RRFALLVTDEDIDLTLKTPTKDATLKALQNAGISLTVVG 325 >gi|300778617|ref|ZP_07088475.1| von Willebrand factor [Chryseobacterium gleum ATCC 35910] gi|300504127|gb|EFK35267.1| von Willebrand factor [Chryseobacterium gleum ATCC 35910] Length = 379 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 41/271 (15%), Positives = 89/271 (32%), Gaps = 45/271 (16%) Query: 74 RKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAIS 133 K + + ++ +L+ NG N T + +N Y Sbjct: 94 GKNILTPKLEAALREYALDQIFGKLKKNGAGNHRT---SKTGIGDERDGENRSYQYGDDL 150 Query: 134 RYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSIT 193 + + + + ++ A++ ++++D+S SM + + IT Sbjct: 151 AAVNMTESLKNAQINNGIADLRLTEDDLIVEETKHKAQMSTVLMIDISHSMILYGEDRIT 210 Query: 194 KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF 253 ++ +++ ++ +V G + KI++ L+ G H Sbjct: 211 PAKKVAMALVELIKRKYPKDSIDIIV-FGNEAWPIKIKDLPYLKVGPYH----------- 258 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN------------LS 301 TN+ GL+ A + + R+ NT K I +TDG+ S Sbjct: 259 ---TNTVAGLELAMDIL------RRKRNTN-----KQIFMITDGKPSCIQLPSGEFYMNS 304 Query: 302 TKEDQQSLYYC----NEAKKRGAIVYAIGIR 328 D++ + C +A+K + I Sbjct: 305 FGLDERIVNQCLNRAAQARKLKIPITTFMIA 335 >gi|145297882|ref|YP_001140723.1| RTX protein [Aeromonas salmonicida subsp. salmonicida A449] gi|142850654|gb|ABO88975.1| RTX protein [Aeromonas salmonicida subsp. salmonicida A449] Length = 3195 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 32/276 (11%), Positives = 83/276 (30%), Gaps = 32/276 (11%) Query: 52 SMIDRSLVHA---ATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDID 108 ++D ++ ++ + + G L W+ S + + Sbjct: 2520 EVLDSLVLSGMPKGAELYSGNQLLGTVGVDGKLTLTAGSGLWSASALDVKLSGLTLRVPG 2579 Query: 109 DIVRSTSLDIVVVPQ--NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 L + + + + SA I L + +++ ++ V Sbjct: 2580 SSAGQVDLKVEAIAKEVGTDQTSSASDEDSIRLSYFNATEGEPGNQNRTFGSEHNIVVGD 2639 Query: 167 QTDARL------DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL--IPDVNNV 218 + + ++ ++D S S+ S + + + +K D Sbjct: 2640 LDGSVILPGQNYNIAFMVDSSGSIGS------DAMKTIKSQLAQVFASLKSSAGTDGAGK 2693 Query: 219 VQSGLVTFSNKIEEFFLLE-------WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 V LV F +++ + ++ + ++ + + K G +TN K Sbjct: 2694 VNIFLVDFDSQVNKSISVDLKDPDALNQLQNIIKTMDGSEKNGGATNYEDVFKT------ 2747 Query: 272 DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 + + N K + F+TDG+ + + Sbjct: 2748 TANWFKSDFAKLNTNAKNVAYFITDGQPNTFNSESN 2783 >gi|108762572|ref|YP_632172.1| putative lipoprotein [Myxococcus xanthus DK 1622] gi|108466452|gb|ABF91637.1| putative lipoprotein [Myxococcus xanthus DK 1622] Length = 426 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 15/71 (21%), Positives = 30/71 (42%), Gaps = 7/71 (9%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 +P + + D++ V+D S SME ++ T++ LE +K Sbjct: 52 NLPGKCESESPVIAPQKTDILFVIDNSGSMEEEQNAIATELPA-------FLEALKEGSG 104 Query: 215 VNNVVQSGLVT 225 V+ + G++T Sbjct: 105 VSQDFRVGVIT 115 >gi|312129894|ref|YP_003997234.1| von willebrand factor type a [Leadbetterella byssophila DSM 17132] gi|311906440|gb|ADQ16881.1| von Willebrand factor type A [Leadbetterella byssophila DSM 17132] Length = 329 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 42/255 (16%), Positives = 79/255 (30%), Gaps = 42/255 (16%) Query: 119 VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVL 178 ++ N Y+I I Y+ ++ +QT +M++VL Sbjct: 82 ILDRDNNNYTIQNIKAYEYRGDNVWKEDVEFKVQY------------TQTTDVGEMVLVL 129 Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW 238 D S S+ T + + +E+ V+ G+V FS + Sbjct: 130 DRSESLG-------TDFERIKQYAAEFVEQTFASHPE---VKIGVVDFSAYP-SSLPITN 178 Query: 239 GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG- 297 + IK L + T + + + A+ +I+V TDG Sbjct: 179 NKEVILNYIKNL-EMENFTALYDAMDMGVDMLL-----------RSASQSRILVTFTDGT 226 Query: 298 ENLSTKEDQQSLYYCNEAKK-RGAIVYAIGIRVIRSH---EFLRACASPNSFYLV-ENPH 352 +N S L N K + IG+ + +S V ++ Sbjct: 227 DNFSRATLNDVLSKINSDKNLNKIRGFFIGL-AGKGDLDTSVPTLLSSKGWIVSVPQSAT 285 Query: 353 SMYDAFSHIGKDIVT 367 + + F+ G+ I Sbjct: 286 QVKEVFNKFGRLISN 300 >gi|119872363|ref|YP_930370.1| von Willebrand factor, type A [Pyrobaculum islandicum DSM 4184] gi|119673771|gb|ABL88027.1| von Willebrand factor, type A [Pyrobaculum islandicum DSM 4184] Length = 358 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 40/268 (14%), Positives = 87/268 (32%), Gaps = 42/268 (15%) Query: 104 VNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK 163 +N + V+ I + Y SA+ K + + + + + Sbjct: 121 INLPQEAVKEVEAKITARSEGFAYIKSAVKSIKQYYPYVSISDVDLIRTAMSIARKALQN 180 Query: 164 VNSQTDARL----------DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + I LDVS SM+ + ++TK+ +A +I L ++ + Sbjct: 181 KPISDTDIYIREYIHIVDKPIYIALDVSGSMKEYI-GALTKLKVAKNAIARYLHQMAHLR 239 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 + LV F+ + W + ++ + + LKY Y + Sbjct: 240 GL-----VSLVLFNTDAD----FMWTPHPVNIYLRDMIEI---------LKYIY-AMGGT 280 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + + + IV +TDG T + + L K+ + I Sbjct: 281 ELASALELLQSHEISRDIVIITDGR---THDPDKVLNLAKRFKR-------LHIVATEKS 330 Query: 334 EFLRACA--SPNSFYLVENPHSMYDAFS 359 +FL++ A + + + ++ + S Sbjct: 331 QFLKSLAKTTGGKYRELTPTLNILEVLS 358 >gi|15965603|ref|NP_385956.1| hypothetical protein SMc00158 [Sinorhizobium meliloti 1021] gi|15074784|emb|CAC46429.1| Hypothetical/unknown protein [Sinorhizobium meliloti 1021] Length = 577 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 9/53 (16%), Positives = 23/53 (43%) Query: 1 MFSLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSM 53 M R + G + + A+ +P++ +G+ I+ ++ + L S+ Sbjct: 1 MHGATTAFRRCLKSRTGNIGVSAALVMPLVVASMGLGIDYGYLTLQRRELQSV 53 >gi|323474469|gb|ADX85075.1| von Willebrand factor type A [Sulfolobus islandicus REY15A] gi|323477206|gb|ADX82444.1| von Willebrand factor type A [Sulfolobus islandicus HVE10/4] Length = 380 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 31/211 (14%), Positives = 66/211 (31%), Gaps = 34/211 (16%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + ++ LD S SM + KI++A + + + + + +TF Sbjct: 33 GSATGFHYIVALDTSGSMTGY------KIELAKQGAIELFKRIPKGN------KVSFITF 80 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA-YNQIFDMQGMRQHCNTEDA 285 S+ V+ ++ + L G + A Y I + + Sbjct: 81 SS----------NVTVIKEFVDPLDLTNEILQIAAGGQTALYTAILTANSLAKKYQMPT- 129 Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPN 343 ++ +TDG L K VY+ GI + + L++ + + Sbjct: 130 ----YLLLLTDGNPTDETNVGNYLKLPYFEK---MQVYSFGIGDDYNEQLLQSISDKTSG 182 Query: 344 SFYLVENPHSMYDAF-SHIGKDIVTKRIWYD 373 Y + + + + I K + D Sbjct: 183 VMYHISDANEIPQKLPQKAVTQIAAKNVTVD 213 >gi|326434662|gb|EGD80232.1| hypothetical protein PTSG_10910 [Salpingoeca sp. ATCC 50818] Length = 5100 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 38/186 (20%), Positives = 67/186 (36%), Gaps = 26/186 (13%) Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 +D++ ++D+S SM +F ++ M+E ++ + + ++ F Sbjct: 4902 PGVDLVCIVDLSGSMSGYF-------QQLCDFLHQMMETFEVGVGASRN-RMAIIGFGTH 4953 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L S L R I+ L G T P L+ A + + Q + ++ Sbjct: 4954 SRIDTWLTNDQSTLSRCIRSLDCRG-GTLFQPPLRQALDALRRDQSDYA------THNRR 5006 Query: 290 IIVFMTDGEN--LSTKEDQQSLYY---CNEAKKRGAIVYAIGIRVIRSHEFLRACASPNS 344 I++F TDG N Q S CN Y I + + + +R P Sbjct: 5007 ILMFQTDGMNGDAGPGVGQLSRQIVDGCNTTS------YGIVVGDRNAMDRVRDICGPAG 5060 Query: 345 FYLVEN 350 Y EN Sbjct: 5061 RYDSEN 5066 >gi|254882025|ref|ZP_05254735.1| conserved hypothetical protein [Bacteroides sp. 4_3_47FAA] gi|294776172|ref|ZP_06741661.1| conserved hypothetical protein [Bacteroides vulgatus PC510] gi|254834818|gb|EET15127.1| conserved hypothetical protein [Bacteroides sp. 4_3_47FAA] gi|294449995|gb|EFG18506.1| conserved hypothetical protein [Bacteroides vulgatus PC510] Length = 301 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 23/111 (20%), Positives = 48/111 (43%), Gaps = 7/111 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S+ + K + + + N + G++ F Sbjct: 84 EEERELTVMLLIDVSNSL------DFGTVKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 137 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 S++IE+F + G H+ I+ L F + T +K A + ++ R Sbjct: 138 SDRIEKFIPPKKGRKHILYIIRELLDFKPESKRTD-IKMAVEYLTNVIKKR 187 >gi|332978757|gb|EGK15449.1| hypothetical protein HMPREF9373_0075 [Psychrobacter sp. 1501(2011)] Length = 2843 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 41/210 (19%), Positives = 68/210 (32%), Gaps = 29/210 (13%) Query: 106 DIDDIVRSTSLDIVVVPQNEGYSISAISRYK----IPLKFCTFIPWYTNSRHIVMPITSS 161 D+ I S V + ++F + T S +T Sbjct: 2505 DVGPIATDGSGGFVADYNTTLLHAIDHPSNDDEDQLDVQFEASVSDGTQSASTNFAVTVE 2564 Query: 162 VKVNSQTDARLDMM---------IVLDVSRSM--ESFFDSSITKIDMAIKSINAMLEEVK 210 +Q+++ +D+ I+LD S SM E T++D+A +I +L+ Sbjct: 2565 DDRPAQSESDVDIAAATSDSNITIILDTSGSMSVEVASGGGETRLDLAKAAIVKLLD--- 2621 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 DV V+ L+TF+ E +W I L N+ G Y+ Sbjct: 2622 -GYDVAGDVKVQLITFA--KEAQAQPQWMT------IDELKDLLPGINARGGTN--YDAA 2670 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 D+ F+TDGE Sbjct: 2671 LDVAMTSYDNAGGIFGANNYSYFLTDGEPT 2700 >gi|328907234|gb|EGG27000.1| aerotolerance protein BatA [Propionibacterium sp. P08] Length = 307 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 63/204 (30%), Gaps = 33/204 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++ LD S SM++ D+S ++ A + + +V +V+ S E Sbjct: 81 IVVALDSSLSMKAD-DASPNRLAAAKAKAKDFVNSLPTGFNVA------VVSISAHPEIR 133 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + R + G+ L A ++ M + + IV Sbjct: 134 MPPSTDRPTVLRALD-----GIELQDGTALGEAIDKSLQAVKMAPGGSKDRVPA--AIVM 186 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH--------------EFLRAC 339 ++DG N + + AK VY I + + L Sbjct: 187 LSDGGNTQGGSPLVAATHAAAAK---VPVYTIAFGTETGYVDLDGQRERVAPDTKLLSDV 243 Query: 340 A--SPNSFYLVENPHSMYDAFSHI 361 A + + ++ + + + + Sbjct: 244 ADRTDAKSWTADSADKLQEVYKQV 267 >gi|154507988|ref|ZP_02043630.1| hypothetical protein ACTODO_00474 [Actinomyces odontolyticus ATCC 17982] gi|153797622|gb|EDN80042.1| hypothetical protein ACTODO_00474 [Actinomyces odontolyticus ATCC 17982] Length = 344 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 23/151 (15%), Positives = 52/151 (34%), Gaps = 11/151 (7%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLI 212 I+ S + + L++ V+D + SM +D S +ID I+ M+ + Sbjct: 50 IMGAGPSIPGEAQEVTSTLEVYFVIDRTGSMAAEDWDGSKPRIDGVRNDISIMMNILTGS 109 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + ++++ ++ L S I Y+ F +S+ D Sbjct: 110 -------RFSIMSWDSRAHTDLPL---TSDASAVISYMDSFDRELSSSSQGSSVNRPAKD 159 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 + + + + + +DGE + Sbjct: 160 LAKLLKKNKERHPQNVRALFVFSDGETSNQN 190 >gi|15921062|ref|NP_376731.1| hypothetical protein ST0828 [Sulfolobus tokodaii str. 7] gi|15621846|dbj|BAB65840.1| 360aa long hypothetical protein [Sulfolobus tokodaii str. 7] Length = 360 Score = 39.0 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 39/188 (20%), Positives = 70/188 (37%), Gaps = 32/188 (17%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 +I++D S SM K+D+AI + N + E IP N + FSN +E+ + Sbjct: 42 VILIDNSPSMR-----KDNKLDVAIAAANKLSYE---IPPGN---YISIYLFSNDLEKIY 90 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 G + +K G +TN L Q+ +Q K ++ + Sbjct: 91 EGPSGNPIALQNVKK----GYTTNLHKALSKILQQLAYVQ----------LPVK--LIIL 134 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS--PNSFYLVENPH 352 +DG+ + Y + +IGI + L+ A +Y +E+P Sbjct: 135 SDGKPTDKR---NVKEYEGLQVPPNVQIISIGIGRDYNEVILKRMADKGSGVYYHIEDPT 191 Query: 353 SMYDAFSH 360 + F+ Sbjct: 192 QLPSVFAQ 199 >gi|312882153|ref|ZP_07741902.1| Flp pilus assembly protein TadG [Vibrio caribbenthicus ATCC BAA-2122] gi|309370199|gb|EFP97702.1| Flp pilus assembly protein TadG [Vibrio caribbenthicus ATCC BAA-2122] Length = 498 Score = 39.0 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 40/274 (14%), Positives = 88/274 (32%), Gaps = 46/274 (16%) Query: 12 FYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNG 71 +G + + L I V +E S + L + + + T +G Sbjct: 11 LKKQQGVAAVWMGLLLVPIMGVTFWAVEGSRYVQESSRLRDSAEAAAIA-VTIEDRKGRA 69 Query: 72 NNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISA 131 ++ + + I++T ++ + R I + + Y++ A Sbjct: 70 DSMAEEYVRSYVRDIEST-ALTTSRDYRARDDAKGIREYIE--------------YTVDA 114 Query: 132 ISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSS 191 + +K FIP + + + + + + +D++ V D S SM Sbjct: 115 RTTHKSWF-VSNFIPSFDRKQELAGRSVAGKEPIQPGEDTIDIVFVSDFSTSMYG----- 168 Query: 192 ITKIDMAIK-----SINAMLEEVKLIPDVNNVV----------QSGLVTFSNKIEEFFLL 236 +K+ + SI + E + +++ + + G V F+ + E Sbjct: 169 -SKLRVLKSSIDQISIKILCESPRYDQQISSYICDDKRATGVNRIGFVPFNIRTREMIWS 227 Query: 237 EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 G R + L + + Y YN I Sbjct: 228 RDG-----RAVSQLVYLNGDASVS---TYTYNDI 253 >gi|301614604|ref|XP_002936771.1| PREDICTED: integrator complex subunit 6-A-like [Xenopus (Silurana) tropicalis] Length = 877 Score = 39.0 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 26/133 (19%), Positives = 51/133 (38%), Gaps = 12/133 (9%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ +LD S SM T +D+A ++ ++ ++ + + LVT Sbjct: 4 LLFLLDTSASMNQRTHLGTTYLDIAKGAVETFMK-LRSRDPASRGDRYMLVTLEEPPYG- 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQI------FDMQGMRQHCNTEDA 285 W +H ++K L G++T L+ A++ + + Q N Sbjct: 62 IKAGWKENHATFMNELKNLQAVGLTTL-GQSLRTAFDLLNLNRLVTGIDNYGQGRNPFFL 120 Query: 286 NYKKIIVFMTDGE 298 II+ +TDG Sbjct: 121 EPS-IIIAITDGS 132 >gi|269959182|ref|NP_001161755.1| integrin beta subunit 2 [Bombyx mori] gi|264981086|gb|ACY74432.1| integrin beta subunit 2 [Bombyx mori] Length = 747 Score = 39.0 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 23/155 (14%), Positives = 49/155 (31%), Gaps = 27/155 (17%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 +P+ + LD+ ++D S +M F + +L+E+ Sbjct: 110 VPVDFKMLYKPIEHFPLDVYFLMDNSYTMRQFQN-------ELKSQAINILKELSAF--- 159 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 V+ G TF +E+ + + Q+ I + + ++ + + F Sbjct: 160 TKNVRLGFGTF---VEKPVYPYYDKNRYQKSIPFENVLSLTADISKLNNTVRQIDFGSNF 216 Query: 276 MRQHCNTE--------------DANYKKIIVFMTD 296 Q + ++IIV TD Sbjct: 217 DDQEAGLDALMQVMTCTKEIGWRTEARRIIVLFTD 251 >gi|291514856|emb|CBK64066.1| Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) [Alistipes shahii WAL 8301] Length = 294 Score = 39.0 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 26/111 (23%), Positives = 46/111 (41%), Gaps = 7/111 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L MM+++DVS S D K+I + V N + G + F Sbjct: 74 EEERELTMMLLVDVSAS------RMFGSTDRLKKNIITEIAAVLAFSAAQNNDKVGCIFF 127 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 S++IE+F + G SH+ I+ L F ++ L + ++ R Sbjct: 128 SDRIEKFIPPKKGRSHILMIIRELIGF-RPESAGTKLSEPVRFLTNVNKKR 177 >gi|255068089|ref|ZP_05319944.1| neisseria PilC protein [Neisseria sicca ATCC 29256] gi|255047687|gb|EET43151.1| neisseria PilC protein [Neisseria sicca ATCC 29256] Length = 1097 Score = 39.0 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 30/155 (19%), Positives = 56/155 (36%), Gaps = 23/155 (14%) Query: 174 MMIVLDVSRSME-------SFFDSSITKIDMAIKSINAMLEEVK---------LIPDVNN 217 +M +D S SM F +++D+ ++N +L++ + L+ NN Sbjct: 45 IMFFIDNSGSMSRNAITGGYNFPGQESRMDVTKTALNKVLDKYQDKFNWGLQALVNTENN 104 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 + K + S ++K L ++ TP + Y + Sbjct: 105 FSNPDPTKHTKKNYTNSEETFNTSW--EQMKTLVDNMGASGYTPATRRYYEVVASA---- 158 Query: 278 QHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYY 311 N + K +V M+DG+ N S D S +Y Sbjct: 159 VMPNIKYRCQKSYVVMMSDGDANFSCNHDSTSFHY 193 >gi|154432890|gb|ABS82078.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432914|gb|ABS82099.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432922|gb|ABS82106.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|332077748|gb|EGI88209.1| cell wall surface anchor family protein [Streptococcus pneumoniae GA41301] Length = 886 Score = 39.0 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 21/134 (15%), Positives = 59/134 (44%), Gaps = 10/134 (7%) Query: 149 TNSRHIVMPITSSVKVNSQTDA-RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 N I + ++ + + LD++I+LD S SM + + + + + A ++ ++++ Sbjct: 194 DNQYGIELTVSGKTVYEQKDKSVPLDVVILLDNSNSMSNIRNKNARRAERAGEATRSLID 253 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS-----KFGVSTNSTPG 262 ++ P+ + LVT+++ I + + K L+ + ++ +T Sbjct: 254 KITSDPEN----RVALVTYASTIFDGTEFTVEKGVADKNGKRLNDSLFWNYDQTSFTTNT 309 Query: 263 LKYAYNQIFDMQGM 276 Y+Y ++ + + Sbjct: 310 KDYSYLKLTNDKND 323 >gi|145509270|ref|XP_001440579.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124407796|emb|CAK73182.1| unnamed protein product [Paramecium tetraurelia] Length = 855 Score = 39.0 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 17/116 (14%), Positives = 46/116 (39%), Gaps = 15/116 (12%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 ++ + ++D++ ++D + SME + + A +I +++E V++ V+ Sbjct: 216 TMVNQIELRPQVDIIFIIDCTISMEDW-------LIAAKHNIKFIIKEFTKKISVSSCVR 268 Query: 221 SGLVTFSNKIEEFFLLEW-----GVSHLQRKIKYLSKFGVS---TNSTPGLKYAYN 268 V + + + +++ +++ I G + L AYN Sbjct: 269 IAAVCYRDFTDGPNHIQYHDFTVQPEEIEKFIDKFQPKGGEDIPEDLIGALDVAYN 324 >gi|29346319|ref|NP_809822.1| hypothetical protein BT_0909 [Bacteroides thetaiotaomicron VPI-5482] gi|253568260|ref|ZP_04845671.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] gi|298385673|ref|ZP_06995231.1| conserved hypothetical protein [Bacteroides sp. 1_1_14] gi|29338214|gb|AAO76016.1| von Willebrand factor-like, type A [Bacteroides thetaiotaomicron VPI-5482] gi|251842333|gb|EES70413.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] gi|298261814|gb|EFI04680.1| conserved hypothetical protein [Bacteroides sp. 1_1_14] Length = 289 Score = 39.0 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 23/111 (20%), Positives = 49/111 (44%), Gaps = 7/111 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S+E + K + + + N + G++ F Sbjct: 72 EEERELTVMLMVDVSGSLE------FGTVKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 S++IE+F + G H+ I+ L F + T ++ A + ++ R Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDFQPESRRTN-IRLALEYLTNVMKRR 175 >gi|332519336|ref|ZP_08395803.1| protein of unknown function DUF58 [Lacinutrix algicola 5H-3-7-4] gi|332045184|gb|EGI81377.1| protein of unknown function DUF58 [Lacinutrix algicola 5H-3-7-4] Length = 287 Score = 39.0 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 38/162 (23%), Positives = 63/162 (38%), Gaps = 30/162 (18%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L M++++D+S S F + + I I+A L + GL+ F Sbjct: 72 EEERELTMLLMVDISGS--ELFGTDQQFKNEIITEISATLAFSATQNNDK----IGLILF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 S++IE + + G SH+ R I+ L +F TN LK+ N + Sbjct: 126 SDEIELYIPPKKGRSHVLRIIRELIEFEPKSKGTNVAEALKFMRNVMK------------ 173 Query: 284 DANYKKIIVF-MTDGENLSTKEDQQSLYYCNEAKK-RGAIVY 323 KK IVF ++D + +L G VY Sbjct: 174 ----KKAIVFVLSD---FIADDYSHTLKIAAGKHDVTGIRVY 208 >gi|307302722|ref|ZP_07582478.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti BL225C] gi|307318570|ref|ZP_07598004.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti AK83] gi|306895910|gb|EFN26662.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti AK83] gi|306903086|gb|EFN33677.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti BL225C] Length = 577 Score = 39.0 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 9/53 (16%), Positives = 23/53 (43%) Query: 1 MFSLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSM 53 M R + G + + A+ +P++ +G+ I+ ++ + L S+ Sbjct: 1 MHGATTAFRRCLKSRTGNIGVSAALVMPLVVASMGLGIDYGYLTLQRRELQSV 53 >gi|168700937|ref|ZP_02733214.1| hypothetical protein GobsU_15533 [Gemmata obscuriglobus UQM 2246] Length = 350 Score = 39.0 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 17/129 (13%), Positives = 48/129 (37%), Gaps = 17/129 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++VLDVSRSME K + ++ + + + + ++ ++ +++ + Sbjct: 105 LVLVLDVSRSMEVEDAGGGGKQKRSERAADLIQSFFERVQ--AERYKTTVIAVASEAKPV 162 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN----QIFDMQGMRQHCNTEDANYKK 289 L ++ + L +++A+ +F Sbjct: 163 VLDTTDREVIRNILTELP-----------MRHAFKPGETNMFAGLEEAARVAKPWPPGSA 211 Query: 290 IIVFMTDGE 298 +++ +TDG+ Sbjct: 212 VLMVVTDGD 220 >gi|145296722|ref|YP_001139543.1| hypothetical protein cgR_2627 [Corynebacterium glutamicum R] gi|140846642|dbj|BAF55641.1| hypothetical protein [Corynebacterium glutamicum R] Length = 893 Score = 39.0 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 36/249 (14%), Positives = 76/249 (30%), Gaps = 53/249 (21%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSM------------ESFFDSSITKIDMAIKSI 202 S++ LD++I++D + S+ E D+ ++ A + Sbjct: 50 ASGSLSNLGACIADKGALDVIIMIDETESLIHEARDGVVNANEPGADAQHHRVPAAQSFV 109 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG---------VSHLQRKIKYL--S 251 + +L + + D+N ++ + K +G V +Q +I Sbjct: 110 DELLAK-QSDGDLNTRIRVAGFGQTYKSGATDPDNYGAWTQLDASTVGGVQDEISRFADR 168 Query: 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS-TKEDQQSLY 310 TN ++ AY + +++V TDG + D Sbjct: 169 TQEQYTNYASAIEGAYQDFTRS---------GSEDACRMLVTFTDGALTAQEGADVAEAA 219 Query: 311 YC------NEAKKRGAIVYAIGIRVIRS---HEFLRACA----------SPNSFYLVENP 351 C + + G IG+ + LR + +F+ +N Sbjct: 220 LCAPGGVTDRLRSAGITHIGIGLSAPTNPSDFSLLRGTTAGGGTCGVEPANGAFFPADNV 279 Query: 352 HSMYDAFSH 360 ++ AF Sbjct: 280 GGLFAAFRE 288 >gi|307151097|ref|YP_003886481.1| magnesium chelatase ATPase subunit D [Cyanothece sp. PCC 7822] gi|306981325|gb|ADN13206.1| magnesium chelatase ATPase subunit D [Cyanothece sp. PCC 7822] Length = 672 Score = 39.0 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 28/182 (15%), Positives = 62/182 (34%), Gaps = 22/182 (12%) Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 + ++ A R P + + + + S ++ + A + + V+D S Sbjct: 428 GKVRRIAVDATLRAAAPYQKARRLRHPERNVIVEQGDIRSKRLARKAGALI--VFVVDAS 485 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEFFLLEWGV 240 SM ++ ++ A ++ +L E N Q L+ F K + + Sbjct: 486 GSM------ALNRMQSAKGAVMRLLTEA-----YENRDQVALIPFRGEKADVLLPPTRSI 534 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 + +R+++ L G S S + + + + +IV +TDG Sbjct: 535 ALAKRRLESLPCGGGSPLSHGLTQA----VHVGMNAKMSGDIGQV----VIVAITDGRGN 586 Query: 301 ST 302 Sbjct: 587 IP 588 >gi|182435392|ref|YP_001823111.1| hypothetical protein SGR_1599 [Streptomyces griseus subsp. griseus NBRC 13350] gi|178463908|dbj|BAG18428.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus NBRC 13350] Length = 278 Score = 39.0 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 28/209 (13%), Positives = 61/209 (29%), Gaps = 45/209 (21%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSI--TKIDMAIKSINAMLEEVK----LIPDVN 216 ++ ++D S SM + K D+ +IN +L E+ V Sbjct: 4 TAEISRTNPTSIIFLVDQSTSMSDPIGGEVPQQKADVVADAINRLLTELSVKCAKEEGVR 63 Query: 217 NVVQSGLVTFSNKIEEFF-------------LLEWGVSHLQRKIKYLSKFGVS------- 256 + +V + + F + + + + K + Sbjct: 64 DYFHVAVVGYGATVGSAFAGSLGGRDIVPLSEVADNPARVDERSKKVPDGAGGLVETSVS 123 Query: 257 ------------TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 T L+YA + R A++ I++ +TDGE+ Sbjct: 124 FPLWMDPVAHGGTPMNRALQYA-ESLVSGWTERYP-----ASFPPIVINLTDGESTDGDP 177 Query: 305 DQQSLYYCNEAKKRG-AIVYAIGIRVIRS 332 + + A G A+++ + + Sbjct: 178 TAAAEAITSHATADGAALLFNLHVSAAGG 206 >gi|239908147|ref|YP_002954888.1| hypothetical protein DMR_35110 [Desulfovibrio magneticus RS-1] gi|239798013|dbj|BAH77002.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 301 Score = 39.0 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 22/106 (20%), Positives = 39/106 (36%), Gaps = 9/106 (8%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + L +M+ +DVS SM + A+ ++ + + GLV F+ Sbjct: 76 EERSLTLMLAVDVSPSMTCLCPREPLYLTAAVAAVTLAVSAAVSRD------RVGLVLFT 129 Query: 228 NKIEEFFLLEWGVS---HLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 ++IE F G + + R + G T P L A + Sbjct: 130 DRIEAFLPPRAGPAVPVAVARLLAETRPQGRGTAPGPALTLAETAL 175 >gi|53803884|ref|YP_114279.1| hypothetical protein MCA1842 [Methylococcus capsulatus str. Bath] gi|53757645|gb|AAU91936.1| conserved hypothetical protein [Methylococcus capsulatus str. Bath] Length = 653 Score = 39.0 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 28/173 (16%), Positives = 60/173 (34%), Gaps = 21/173 (12%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D + SM + D + A++ + +E+ L V+ GLV F + + Sbjct: 224 VVFVIDSTISMGPYID----RTREAVRRVYTRIEKAGLADQ----VRFGLVAFRSSTQAV 275 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA-YNQIFDMQGMRQHCNTEDANYK---- 288 LE+ VS + + L A + + T K Sbjct: 276 PGLEY-VSKVYADPSEVKTGKDFLAKVASLSPAKVSSSTFDEDSYAGIMTALQKIKWSGF 334 Query: 289 --KIIVFMTD-GENLSTKEDQQSLYYCNE----AKKRGAIVYAIGIRVIRSHE 334 + +V +TD G Q+ ++ A++ G ++ + ++ Sbjct: 335 GGRYVVLITDAGAIDGNDPLSQTKLGADQVRIEAEQLGVALFGLHLKTPAGKA 387 >gi|119504634|ref|ZP_01626713.1| TPR domain protein [marine gamma proteobacterium HTCC2080] gi|119459656|gb|EAW40752.1| TPR domain protein [marine gamma proteobacterium HTCC2080] Length = 611 Score = 38.6 bits (88), Expect = 1.4, Method: Composition-based stats. Identities = 37/193 (19%), Positives = 65/193 (33%), Gaps = 24/193 (12%) Query: 109 DIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQ 167 + S S V+ P Y +S+ ++ L + W I P + V Sbjct: 30 SVKTSASWAKVIDPHLLPYLMSSNTKRGKKLGHWWTLLWLVMAVITIAGPSLKKIDVPVF 89 Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 A ++IVLD+S SM S D ++I A + I +L +GLV ++ Sbjct: 90 QRAD-ALVIVLDLSASM-SAADIQPSRIQRAKQKILDLLARRDEG-------VTGLVVYA 140 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 L ++ + L+ +N T L+ A + Sbjct: 141 GDAHVVAPLTDDHRTIENLLSALTPDIMPLPGSNVTAALEVATGLLQTAGVANGQ----- 195 Query: 285 ANYKKIIVFMTDG 297 ++ +TDG Sbjct: 196 ------VLLITDG 202 >gi|115666324|ref|XP_798926.2| PREDICTED: similar to LOC594926 protein [Strongylocentrotus purpuratus] gi|115975269|ref|XP_001180487.1| PREDICTED: similar to LOC594926 protein [Strongylocentrotus purpuratus] Length = 870 Score = 38.6 bits (88), Expect = 1.4, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 73/225 (32%), Gaps = 27/225 (12%) Query: 123 QNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 G + I RY + + + I ++ + ++ VLD S Sbjct: 283 NTPGINGDFIIRYDVAHPPGVGEVQTSGTNFIHRFAPANFGAVKKR-----VVFVLDFSA 337 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIP-----DVNNVVQSGLVTFSNKIEEFFLLE 237 SM KI +++ +L+E+ ++ V SG +++ Sbjct: 338 SMYGN------KIKQTKEAMYTILDEMNDSDRFNVLPFSDYVYSG---WNSGQMVDVNP- 387 Query: 238 WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF-MTD 296 + + + I+ L TN L A + N+ +N I+F +TD Sbjct: 388 YNIRDAKDFIRQLD-IQRGTN----LNDALLGGLSLLESTGSMNSTSSNPMVCILFVLTD 442 Query: 297 GENL-STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 G+ + A + + +G + ++ FL A Sbjct: 443 GKPSEGVTSLSEIERNVRNANNQRCSIVTLGFGRLVNYNFLVRLA 487 >gi|171913222|ref|ZP_02928692.1| hypothetical protein VspiD_18620 [Verrucomicrobium spinosum DSM 4136] Length = 624 Score = 38.6 bits (88), Expect = 1.4, Method: Composition-based stats. Identities = 18/102 (17%), Positives = 34/102 (33%), Gaps = 6/102 (5%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + ++I+LD S SM + T+++ A + L + V + L+ Sbjct: 81 EGGGETPKSLVILLDRSASMAATDGKGRTRMEEAKTQLKDRLSALP------ENVVTSLI 134 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 + K E L R I+ + + L A Sbjct: 135 VYDAKAEVTQSRSTNRRELLRMIQQTQPVPMEDRADEALLVA 176 >gi|282898937|ref|ZP_06306919.1| hypothetical protein CRC_00215 [Cylindrospermopsis raciborskii CS-505] gi|281196077|gb|EFA70992.1| hypothetical protein CRC_00215 [Cylindrospermopsis raciborskii CS-505] Length = 468 Score = 38.6 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 33/179 (18%), Positives = 56/179 (31%), Gaps = 26/179 (14%) Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 P+ ++ S L + I LD S SM + S T ++SI + E +P Sbjct: 36 TPVENTNTSKSNN---LSVRIYLDGSGSMLGYVKSGETNYGKTLRSIRNVFELSDKLPV- 91 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + G + I + G S NQ ++ Sbjct: 92 -EYYRIG---------SPMQKITSSEYYNSGISSVFYDGSS-----------NQFPEVSS 130 Query: 276 M-RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + KK+ V +TD + S + + + YA+GI I+S Sbjct: 131 PIDAAIVPPEKEQKKMTVIITDLQQNSGDVTKLNKVINDTYYNIDNRDYAVGIWAIKSE 189 >gi|297734963|emb|CBI17197.3| unnamed protein product [Vitis vinifera] Length = 605 Score = 38.6 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 57/177 (32%), Gaps = 20/177 (11%) Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 + ++ A R P + R + + T A ++ V+D S SM Sbjct: 354 KRLAVDATLRAAAPYQKLRREKNAQKGRKVFVEKTDMRAKRMARKAGALVIFVVDASGSM 413 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEFFLLEWGVSHL 243 ++ ++ A + +L + Q ++ F + E ++ Sbjct: 414 ------ALNRMQNAKGAALKLL-----AESYTSRDQVAIIPFRGDSAEVLLPPSRSIAMA 462 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 +++++ L G S + + + + + +IV +TDG Sbjct: 463 RKRLERLPCGGGSPLAHG--------LTTAVRVGLNAEKSGDVGRVMIVAITDGRAN 511 >gi|150005793|ref|YP_001300537.1| hypothetical protein BVU_3286 [Bacteroides vulgatus ATCC 8482] gi|149934217|gb|ABR40915.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482] Length = 289 Score = 38.6 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 23/111 (20%), Positives = 48/111 (43%), Gaps = 7/111 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S+ + K + + + N + G++ F Sbjct: 72 EEERELTVMLLIDVSNSL------DFGTVKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 S++IE+F + G H+ I+ L F + T +K A + ++ R Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELLDFKPESKRTD-IKIAVEYLTNVIKKR 175 >gi|76253810|ref|NP_001029011.1| complement factor B-2 [Ciona intestinalis] gi|58531106|dbj|BAD89300.1| complement factor B-2 [Ciona intestinalis] Length = 998 Score = 38.6 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 20/159 (12%), Positives = 53/159 (33%), Gaps = 21/159 (13%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPD 214 + + + ++ ++D SRS+ +D ++K ++ Sbjct: 412 PSSGVGGRSITADHEGGNEIYFLIDFSRSVTD------EALDHSLKFAQKLVTRF--AGG 463 Query: 215 VNNVVQSGLVTF------------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 N G++ F + ++ ++++ + Q+K + G TN+ Sbjct: 464 TNKTAHYGVIIFASEPKVELNSRINRRLNSTQIIQYFIRISQKKQEIRQTVGGGTNTGAA 523 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS 301 L + + N N ++ +TDG++ Sbjct: 524 LN-VLRGMLGISYQTDRINNGGNNRQRHCFILTDGKSNH 561 >gi|293190490|ref|ZP_06608877.1| putative von Willebrand factor type A domain protein [Actinomyces odontolyticus F0309] gi|292820901|gb|EFF79857.1| putative von Willebrand factor type A domain protein [Actinomyces odontolyticus F0309] Length = 344 Score = 38.6 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 19/151 (12%), Positives = 54/151 (35%), Gaps = 11/151 (7%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLI 212 I+ S + + L++ V+D + SM +D S +ID I+ M+ + Sbjct: 50 IMGAGPSIPGEAQEVTSTLEVYFVIDRTGSMAAEDWDGSKPRIDGVRNDISIMMNILTGS 109 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + ++++ ++ L S + + + S++ + + Sbjct: 110 -------RFSIMSWDSRAHTDLPLTSDASAVISYMDSFDRELSSSSQGSSVNRPAQDLAK 162 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 + + + ++ + + +DGE + Sbjct: 163 LLKKNKERHPQNV---RALFVFSDGETSNQN 190 >gi|253996911|ref|YP_003048975.1| von Willebrand factor type A [Methylotenera mobilis JLW8] gi|253983590|gb|ACT48448.1| von Willebrand factor type A [Methylotenera mobilis JLW8] Length = 329 Score = 38.6 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 41/256 (16%), Positives = 80/256 (31%), Gaps = 53/256 (20%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF------DSSI 192 LK + + P T+ KV + +VLD S SM+ F + + Sbjct: 48 LKILAVLTLIFIILGLAAPHTTEQKVERIGVGA-QIALVLDRSASMDDPFSGTADHNGNT 106 Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK 252 T + + ++ E + G++TFSN L + ++ + Sbjct: 107 TVGETKSVAAARLITEFVKSRQQDMF---GMITFSNSAMYVLPLSENKKAVIAAVQATAG 163 Query: 253 FGV-STNSTPGL---KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL-STKEDQQ 307 + TN GL ++++ D + ++ ++DG Q+ Sbjct: 164 NALFQTNIGSGLTSSAALFDKVTDSG-------------SRAVILLSDGAGRVDANTQQK 210 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIRS----------------------HEFLRACASPNSF 345 + +Y I +R HE+ + +P Sbjct: 211 IRDWFQRL---NISLYWIVLRQPGGLSIFDPNFVPVEDQPLPAQIELHEYFQTFKTPFKA 267 Query: 346 YLVENPHSMYDAFSHI 361 Y E+P S+ A + I Sbjct: 268 YEAEDPRSLQLAMNDI 283 >gi|320355194|ref|YP_004196533.1| protoporphyrin IX magnesium-chelatase [Desulfobulbus propionicus DSM 2032] gi|320123696|gb|ADW19242.1| protoporphyrin IX magnesium-chelatase [Desulfobulbus propionicus DSM 2032] Length = 681 Score = 38.6 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 26/179 (14%), Positives = 60/179 (33%), Gaps = 20/179 (11%) Query: 123 QNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVN-SQTDARLDMMIVLDVS 181 N +++ A R P + S V ++ + ++ V+D S Sbjct: 448 NNGDFALDATLRAAAPYQQQRAGKN--GSDLAVQLRPQDIRSKVREKRIGNFLLFVVDAS 505 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS 241 SM + ++ + ++ ++L + + ++TF L Sbjct: 506 GSM-----GARGRMAASKGAVMSLL-----LDAYQKRDKVSMITFRRDAAFINLPPTTSV 555 Query: 242 HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 + ++ G T + GL ++ Q+ R + I++F+TDG+ Sbjct: 556 DMAGRLLAEMPVGGRTPLSAGLAKSFEQV------RNYLIKNPTAQP-IVLFITDGKCN 607 >gi|288925754|ref|ZP_06419685.1| von Willebrand factor, type A [Prevotella buccae D17] gi|288337409|gb|EFC75764.1| von Willebrand factor, type A [Prevotella buccae D17] Length = 281 Score = 38.6 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 24/111 (21%), Positives = 51/111 (45%), Gaps = 7/111 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S++ F TK DM + + I + + + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLD-FGTVRQTKRDMVTEIAATL--AFSAIQNND---KIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 S++IE++ + G H+ I+ + F + T + A + + R Sbjct: 126 SDRIEKYIPPKKGRRHILYIIREMLDFEAQSKKTD-IGAAVEYLTRVMKRR 175 >gi|301768028|ref|XP_002919433.1| PREDICTED: calcium-activated chloride channel regulator 2-like [Ailuropoda melanoleuca] gi|281352225|gb|EFB27809.1| hypothetical protein PANDA_008061 [Ailuropoda melanoleuca] Length = 943 Score = 38.6 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 37/212 (17%), Positives = 76/212 (35%), Gaps = 37/212 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLDVS M + ++ ++ L ++ I G+V+F++K E Sbjct: 312 VCLVLDVSSKM-----AEADRLLQLQQAAEFYLMQIVEIHTF-----MGIVSFNSKGEIR 361 Query: 234 FLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L + L + T+ GLK + + + G Sbjct: 362 AQLHQINSDDDRKLLVSHLPMTVSAEAETSICSGLKKGFEVVEKLNGKAYGS-------- 413 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFY 346 +++ +T G++ ++ G+ +++I + L F+ Sbjct: 414 -VMILVTSGDDEHMGNCLLTVL------SSGSTIHSIALGSSVVENLEELSRRTGGLKFF 466 Query: 347 LVE--NPHSMYDAFSHIGK---DIVTKRIWYD 373 + + N +SM +AFS I DI +RI + Sbjct: 467 VPDKSNSNSMTEAFSRISSGTGDIFQQRIQLE 498 >gi|320158848|ref|YP_004191226.1| cell surface protein [Vibrio vulnificus MO6-24/O] gi|319934160|gb|ADV89023.1| cell surface protein precursor [Vibrio vulnificus MO6-24/O] Length = 723 Score = 38.6 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 48/275 (17%), Positives = 93/275 (33%), Gaps = 34/275 (12%) Query: 65 IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVV--VP 122 I N + + + +L I T + + D ++ + + DIV+ Sbjct: 356 ISARYNISLSETRQTTALLQLILQTIGTNKVGGVVDQLVADNNEKNTNAFQFDIVLSFDE 415 Query: 123 QNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARL-------DMM 175 Q + +++ K + T S + P+ +++KV + T + D + Sbjct: 416 QKDNVVMTSTLIAKELFSKYETLIDTTTSESVRAPVNATIKVQNDTITAIEQTVSKADFL 475 Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSI--NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 V+D S SM D T + I + + +V I N+ ++ F+++ E+ Sbjct: 476 FVIDNSGSMGDEQDEISTLTQTFLDEISASGLDYKVGTITTDNDSLRGN--GFTHEPEQI 533 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTP---GLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L F TN + G+ +A + G A Sbjct: 534 ALD----------------FKPGTNGSGYERGIYFAEMALAPTTGTVAVAGYPRAGASLS 577 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 ++ M+D E+ D N G VY+I Sbjct: 578 VIIMSDEESQYPTRDFDVNN--NLFVDNGYRVYSI 610 >gi|294828490|ref|NP_714387.2| hypothetical protein LA_4207 [Leptospira interrogans serovar Lai str. 56601] gi|293386343|gb|AAN51405.2| hypothetical protein LA_4207 [Leptospira interrogans serovar Lai str. 56601] Length = 344 Score = 38.6 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 34/232 (14%), Positives = 72/232 (31%), Gaps = 36/232 (15%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 F + + ++ ++V +QT ++L ++D S SM + KI +A Sbjct: 1 MFLQRLLNFIGFTFFILVTGTNVSSENQTPSQL---FIIDASGSMNEYL-GIYQKIHLAK 56 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF------LLEWGVSHLQRKIKYLSKF 253 K ++ + + + G + + N++ LE G + L Sbjct: 57 KHVSHYISTLP------QETEIGFLAYGNRLPGCSSSRLYQPLEMGNRDTFK--NRLFSL 108 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 S + I + E I+ +TDG Sbjct: 109 TPSGATPLA-----ESIRIAGTLISQRKKETE-----IILVTDG---IESCYGDPKKELQ 155 Query: 314 EAKKRGA--IVYAIGIRVIRSHEFLRACASP---NSFYLVENPHSMYDAFSH 360 K++G + +G+ + E + ++ +E+ S Y A Sbjct: 156 ALKQKGIPFQFHVLGLGLKSHEELQMKILTEEGNGKYFSIEDDSSFYTALDS 207 >gi|254480861|ref|ZP_05094107.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] gi|214038656|gb|EEB79317.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] Length = 726 Score = 38.6 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 28/208 (13%), Positives = 61/208 (29%), Gaps = 13/208 (6%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + L + I+ D S SM + + A+ V + + + +V Sbjct: 255 TTNAPALAVEIIFDASGSMAARLQGQTK----LSLARRALAAAVPGLENPSILVGMRAYG 310 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL-KYAYNQIFDMQGMRQHCNTED 284 F + + + L + + L Y Y I D + H Sbjct: 311 FDQSLNKTPDASCPNTELVLPFTANRQATAINRTADALSAYGYTPIADSLTLAGHDLLAI 370 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA--IGIRV-IRSHEFLRACAS 341 K +I+ ++DGE + G + IG + + + ++A AS Sbjct: 371 DAQKHMIILISDGE---ETCGGFPAAVAANLRSLGIDLQTHVIGFDLDATAQQQMQAIAS 427 Query: 342 --PNSFYLVENPHSMYDAFSHIGKDIVT 367 ++ + + + + Sbjct: 428 AGGGQYFDAADGDELGASLMRVIDLAQE 455 >gi|183231160|ref|XP_001913530.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS] gi|169802601|gb|EDS89694.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS] Length = 565 Score = 38.6 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 33/219 (15%), Positives = 70/219 (31%), Gaps = 40/219 (18%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 SS ++ + + ++++ + D S SM I + L ++ +N+ Sbjct: 71 SSTFISKEKEGNINIVFICDRSGSMYGEG------ITALRNMLQLFLRQL----PLNSKF 120 Query: 220 QSGLVTFSNKIEEFFL-----LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 Q ++F + + F E + + + TN + + D Sbjct: 121 QI--ISFGSTYDFIFKEMTEYNEDTLKFASETVSHFEANYGGTN----MHAPLKALIDNN 174 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIRVIRSH 333 + H I+ +TDG ++ Y KK +G+ Sbjct: 175 TEKCH-----------IILLTDG---YVDNKTSTIEYIRTLSKKNSLH--GVGLGRSCDV 218 Query: 334 EFLRACASPNSFYLV--ENPHSMYDAFSHIGKDIVTKRI 370 E +R + +NP+ + S I + I+ I Sbjct: 219 ELIRNIGRIGNGISAISKNPNVLKKEVSKITERILIPSI 257 >gi|328947244|ref|YP_004364581.1| von Willebrand factor A [Treponema succinifaciens DSM 2489] gi|328447568|gb|AEB13284.1| von Willebrand factor type A [Treponema succinifaciens DSM 2489] Length = 241 Score = 38.6 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 34/174 (19%), Positives = 57/174 (32%), Gaps = 14/174 (8%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 V ++ + V+D S SM TKI +I +L E+K + ++ Sbjct: 6 DVEGIVKRQMVLFFVVDTSGSM------QGTKIGAVNTAIREVLPELKDAGGSDVDLKVA 59 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +TFS+ +W S V + L A ++ + Sbjct: 60 CLTFSS------GCKWMYSSPIAS-DSFQWNNVDADGVTDLGSACRELSEKLSKNGFLKA 112 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 + I M+DGE E +L N K V A+ I + + L Sbjct: 113 PSGSVAPAIFLMSDGEPTDDFESGLNLLQQNNWFKHAIKV-AVAIGDDANKDVL 165 >gi|330805542|ref|XP_003290740.1| hypothetical protein DICPUDRAFT_155275 [Dictyostelium purpureum] gi|325079126|gb|EGC32742.1| hypothetical protein DICPUDRAFT_155275 [Dictyostelium purpureum] Length = 2350 Score = 38.6 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 39/252 (15%), Positives = 82/252 (32%), Gaps = 35/252 (13%) Query: 96 NELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY---SISAISRYKIPLKFCTFIPWYTNSR 152 N ++ ++I T I + P ++ Y + + + P ++ Sbjct: 877 NIVKLTSPTHNIKIKRTHTKATIELQPGDKSYLDKNFQLLIGLEEPYSPRMWVEVDDKGH 936 Query: 153 HIVM-PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 H M + ++S ++ + +++D+S SM D+ + ++ + KL Sbjct: 937 HASMLAFYPKLDIDSTQESFTHLTLLIDLSASM--AGDTFEDLLRAVRLTVRNLRGVPKL 994 Query: 212 IPDVNNVVQSG------LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 + ++ + G V EE + W I L T L+ Sbjct: 995 LFNICDF---GDNYDWLFVEDVLPTEENLQIAWN------HINKLKPISGGTKLHSPLES 1045 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 Y + H I+ TDG + ++ SL +A ++ Sbjct: 1046 IYLLSEKAKPTNPHN----------ILLFTDG---NVSNEELSLMLAKKA-SNFCRLFTF 1091 Query: 326 GIRVIRSHEFLR 337 GI S F++ Sbjct: 1092 GIGEHCSRHFIK 1103 >gi|323341955|ref|ZP_08082188.1| hypothetical protein HMPREF0357_10368 [Erysipelothrix rhusiopathiae ATCC 19414] gi|322464380|gb|EFY09573.1| hypothetical protein HMPREF0357_10368 [Erysipelothrix rhusiopathiae ATCC 19414] Length = 162 Score = 38.6 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 9/45 (20%), Positives = 16/45 (35%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMI 54 NF G + I+ A L + + + +V K L + Sbjct: 3 NFKNEEDGNILIMFAFILMFLLGFVALSTDVVLAMNKKDKLTEVA 47 >gi|260806121|ref|XP_002597933.1| hypothetical protein BRAFLDRAFT_79825 [Branchiostoma floridae] gi|229283203|gb|EEN53945.1| hypothetical protein BRAFLDRAFT_79825 [Branchiostoma floridae] Length = 176 Score = 38.6 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 20/119 (16%), Positives = 43/119 (36%), Gaps = 17/119 (14%) Query: 216 NNVVQSGLVTFSNKIEEFFLL---EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + V+ G++ FS K L G L I+ L + G T++ ++Y Sbjct: 34 EDDVKVGVIQFSEKANATIPLGTYSAGDPGLTDDIESLKRNGGLTHTGHAIRY------- 86 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 + + + + + +TDG + + A+ G +Y++G + Sbjct: 87 ----MKATSKFRDDVPRAAIVITDGR---AQTMDDYAAQADAARDAGITLYSVGNGIPS 138 >gi|255009409|ref|ZP_05281535.1| hypothetical protein Bfra3_09732 [Bacteroides fragilis 3_1_12] gi|313147168|ref|ZP_07809361.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] gi|313135935|gb|EFR53295.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] Length = 289 Score = 38.6 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 24/107 (22%), Positives = 46/107 (42%), Gaps = 9/107 (8%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S+E + K + + + N + G++ F Sbjct: 72 EEERELTVMLMVDVSGSLE------FGTVKQLKKDMVTEIAATLAFSAIQNNDKIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQI 270 S++IE+F + G H+ I+ L TN L+Y N + Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDFKPESRRTNVRLALEYLTNVM 172 >gi|307155035|ref|YP_003890419.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306985263|gb|ADN17144.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 206 Score = 38.6 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 25/133 (18%), Positives = 49/133 (36%), Gaps = 13/133 (9%) Query: 173 DMMIVLDVSRSME-SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D +++D S SM S ++ ++A +S A+ + + I V FS + + Sbjct: 11 DYTLIIDKSGSMSISDQPGGKSRWEIAQESTLALARKCEQIDPDGITVYL----FSGRFK 66 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + + S +I + T+ L+ A F + Q + + Sbjct: 67 RY---DNVTSDKVSQIYAENDPMGHTDLASVLQDALENYFQRKAAGQTKANGET-----V 118 Query: 292 VFMTDGENLSTKE 304 + +TDGE K Sbjct: 119 LVVTDGEPDDRKA 131 >gi|316972929|gb|EFV56575.1| putative von Willebrand factor type A domain protein [Trichinella spiralis] Length = 562 Score = 38.6 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 28/181 (15%), Positives = 64/181 (35%), Gaps = 17/181 (9%) Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW-------- 238 F S + M K NA++ K+ + V+ ++ +S K ++ +L Sbjct: 42 FDSSKVISEKMFEKEKNAIISLAKVFSEDKYNVRLAVIQYSMKSDQQYLPANVVYNLDYI 101 Query: 239 -GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 + + +I + + N + L+ +F+ + N + ++ T G Sbjct: 102 KDIKNFSNQIIKIKQSHWLVNISDALRVCKTNVFESED----INGGRPTRRHFVILFTYG 157 Query: 298 ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDA 357 + ED Y+ K G ++ + R + + ++ +L +N S Sbjct: 158 --TTFVEDDLPHYWAQIMKNAGHHIFVVSTRTYNRATYKK--STQQHHFLQKNVASDPTL 213 Query: 358 F 358 F Sbjct: 214 F 214 >gi|116748933|ref|YP_845620.1| hypothetical protein Sfum_1496 [Syntrophobacter fumaroxidans MPOB] gi|116697997|gb|ABK17185.1| conserved hypothetical protein [Syntrophobacter fumaroxidans MPOB] Length = 427 Score = 38.6 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 12/130 (9%), Positives = 38/130 (29%), Gaps = 8/130 (6%) Query: 12 FYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHA--------AT 63 G I+ A+ L ++ +++ +++ ++ L + D + T Sbjct: 12 LSERDGATVIIVALALVMLLGFGAFAVDIGYLYVVRNELQNAADAGALAGAAALYNNDGT 71 Query: 64 QIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQ 123 + N ++ + +++ + + SL V + Sbjct: 72 AVQPTANVIGQEAAMRNTAVRTAVEVTLNGNSGDVQRGHYSFATGTFTPNASLLPVSLWN 131 Query: 124 NEGYSISAIS 133 + A + Sbjct: 132 VSTEELDANT 141 >gi|291220858|ref|XP_002730442.1| PREDICTED: chloride channel accessory 2-like [Saccoglossus kowalevskii] Length = 937 Score = 38.6 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 67/201 (33%), Gaps = 35/201 (17%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + L +++VLD+S SM+ + ++ I++ + V G+V F Sbjct: 307 QETELRIVLVLDISGSMDRN-----DRFELMIQASTKYIG-----YTVPRGTWIGIVEFD 356 Query: 228 NKIEEFFLLEW-----GVSHLQRKIKYLSK-FGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 +L + + Q I L G T+ GL+ + + Sbjct: 357 ---NTASILSYLIQIDDMETRQELIDLLPDGTGSGTSIGSGLEAGIEVLEEGWETPAGG- 412 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 I+ +TDGE + + +E ++ +V + + A Sbjct: 413 --------ILFLITDGEENTPPFIDDVI---DELVEKEIVVDTLALSDEADPGLAELSAE 461 Query: 342 PNS----FYLVENPHSMYDAF 358 + ++ +++DAF Sbjct: 462 TGGTAYWYSESDDSTALHDAF 482 >gi|229825155|ref|ZP_04451224.1| hypothetical protein GCWU000182_00506 [Abiotrophia defectiva ATCC 49176] gi|229790527|gb|EEP26641.1| hypothetical protein GCWU000182_00506 [Abiotrophia defectiva ATCC 49176] Length = 1109 Score = 38.6 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 28/146 (19%), Positives = 58/146 (39%), Gaps = 19/146 (13%) Query: 163 KVNSQTDARLDMMIVLDVS-RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + LD+++V+D S +M ++ A +++ + NN ++ Sbjct: 78 GSTTIEQKNLDVVLVVDRSYDNMRKNG-----RMAAAKAEAAKLVDYF--LQSGNNKIRV 130 Query: 222 GLVTFS-------NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 GLV+F+ + + L L+ I+ + G N++P L+ A+ Q ++ Sbjct: 131 GLVSFAGNNGGSPSPVLGVTQLTQDADELKNAIRGYNTAGW--NNSPVLREAFTQAGLIK 188 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENL 300 + KKIIV ++ G Sbjct: 189 ANEMFGASN--TNKKIIVLISGGAPT 212 >gi|319782123|ref|YP_004141599.1| von Willebrand factor type A [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317168011|gb|ADV11549.1| von Willebrand factor type A [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 554 Score = 38.6 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 26/131 (19%), Positives = 49/131 (37%), Gaps = 26/131 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK---- 229 ++I+LD S SM + D K+++A +S+ +L+ V + G + + ++ Sbjct: 27 VIIILDASGSMWAQIDGKP-KLEIARESLRTVLQSVPTDDE------IGFMAYGHREKGS 79 Query: 230 ---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 I+ + G KF T T +K A + + Sbjct: 80 CDDIQLIVPPQAGSGSAISAAADSLKFLGKTPLTAAVKQAAEALRYTED----------- 128 Query: 287 YKKIIVFMTDG 297 K +V +TDG Sbjct: 129 -KATVVLITDG 138 >gi|160889561|ref|ZP_02070564.1| hypothetical protein BACUNI_01985 [Bacteroides uniformis ATCC 8492] gi|270296685|ref|ZP_06202884.1| conserved hypothetical protein [Bacteroides sp. D20] gi|317480053|ref|ZP_07939165.1| hypothetical protein HMPREF1007_02282 [Bacteroides sp. 4_1_36] gi|156861078|gb|EDO54509.1| hypothetical protein BACUNI_01985 [Bacteroides uniformis ATCC 8492] gi|270272672|gb|EFA18535.1| conserved hypothetical protein [Bacteroides sp. D20] gi|316903795|gb|EFV25637.1| hypothetical protein HMPREF1007_02282 [Bacteroides sp. 4_1_36] Length = 289 Score = 38.6 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 19/87 (21%), Positives = 40/87 (45%), Gaps = 6/87 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S+E + K + + ++ + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLEFGTVKQMKKDMVTEIAATLAFSAIQNND------KIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKF 253 S++IE+F + G H+ I+ L F Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDF 152 >gi|315608296|ref|ZP_07883286.1| von Willebrand factor [Prevotella buccae ATCC 33574] gi|315250077|gb|EFU30076.1| von Willebrand factor [Prevotella buccae ATCC 33574] Length = 289 Score = 38.6 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 24/111 (21%), Positives = 51/111 (45%), Gaps = 7/111 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S++ F TK DM + + I + + + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLD-FGTVRQTKRDMVTEIAATL--AFSAIQNND---KIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 S++IE++ + G H+ I+ + F + T + A + + R Sbjct: 126 SDRIEKYIPPKKGRRHILYIIREMLDFEAQSKKTD-IGAAVEYLTRVMKRR 175 >gi|145536452|ref|XP_001453948.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124421692|emb|CAK86551.1| unnamed protein product [Paramecium tetraurelia] Length = 886 Score = 38.6 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 19/112 (16%), Positives = 47/112 (41%), Gaps = 15/112 (13%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 Q + D++ +LD ++SM+ + I+++ KSI++++ E K +N ++ V Sbjct: 231 QIQQRPQADIIFILDCTKSMDKW-------IEVSKKSISSIIAEFKKKVQINAAIRMAAV 283 Query: 225 TFSNKIEEFFLLEW-----GVSHLQRKIKYLSKFGVS---TNSTPGLKYAYN 268 + + + +++ +++ L G + L AY Sbjct: 284 CYKDFSDGPNHIQYHNFTVRPEEIEKFFGNLKIEGGDDIPEDLQGALDMAYK 335 >gi|118372013|ref|XP_001019204.1| hypothetical protein TTHERM_00997640 [Tetrahymena thermophila] gi|89300971|gb|EAR98959.1| hypothetical protein TTHERM_00997640 [Tetrahymena thermophila SB210] Length = 996 Score = 38.6 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 29/152 (19%), Positives = 60/152 (39%), Gaps = 17/152 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ + ++S SM KI + + I + ++ + G F+ ++E Sbjct: 790 DVLFLSNISSSM-----GCREKILFSSQIIYTIFQK----YIGKKFDRIGYYLFNESVKE 840 Query: 233 FFLLEWGVSH---LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ---GMRQHCNTEDAN 286 F L+ + L+ ++ + + + L A+ +I+D H D N Sbjct: 841 VFGLQRADKNWCFLENMLQNMPTVSGKASISNALNEAF-KIYDKNQYYAHTLHMKKVDLN 899 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKR 318 YKK ++ TD E +E +Q + K+ Sbjct: 900 YKKFVILFTD-EADILQEGKQIKKQWQKIKEN 930 >gi|307595146|ref|YP_003901463.1| hypothetical protein Vdis_1023 [Vulcanisaeta distributa DSM 14429] gi|307550347|gb|ADN50412.1| conserved hypothetical protein [Vulcanisaeta distributa DSM 14429] Length = 441 Score = 38.6 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 28/156 (17%), Positives = 58/156 (37%), Gaps = 26/156 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDM-AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 + +++D S SM + +I ++ + + L + + N V+ ++ Sbjct: 279 IYMLIDKSGSMFYTVNINIFEVGAISKITWATALAVILAMKGGNLAVRF-------FDQQ 331 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + + + + + L G T+ T ++ A D +R + +V Sbjct: 332 VYPMLTNKNDIIKMLLSLIPLG-GTDITNAVRAAVQDAVDKPSLRDYK----------LV 380 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKK--RGAIVYAIG 326 +TDGE +D N+AK R V IG Sbjct: 381 VITDGE-----DDNVDPTVFNKAKSIFRSVKVLLIG 411 >gi|167763118|ref|ZP_02435245.1| hypothetical protein BACSTE_01487 [Bacteroides stercoris ATCC 43183] gi|167699458|gb|EDS16037.1| hypothetical protein BACSTE_01487 [Bacteroides stercoris ATCC 43183] Length = 289 Score = 38.6 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 24/107 (22%), Positives = 47/107 (43%), Gaps = 9/107 (8%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S+E + K + + ++ + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLEFGTVKQMKKDMVTEIAATLAFSAIQNND------KIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQI 270 S++IE+F + G H+ I+ L TN GL+Y N + Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELLDFRPESRRTNIRLGLEYLTNVM 172 >gi|169342974|ref|ZP_02864002.1| von Willebrand factor type A domain protein [Clostridium perfringens C str. JGS1495] gi|169298883|gb|EDS80957.1| von Willebrand factor type A domain protein [Clostridium perfringens C str. JGS1495] Length = 614 Score = 38.6 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 30/170 (17%), Positives = 62/170 (36%), Gaps = 28/170 (16%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA 198 LK + P + I + + K ++ +D+++++D S SM D ++ Sbjct: 57 LKEVSEKPDEDGNYEITLTVKGKPKKVTK---PVDILLIMDASNSMYYNMDELKESMNSL 113 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL--EWGVSHLQRKIKYLSKF--- 253 + + + IPD + +V F EE F + + + K + K Sbjct: 114 VDKV------IDNIPDS----RIAVVAFGTYSEEVFSFNDKNNFTSKEEYKKAIKKSYNN 163 Query: 254 ---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 +TN + A + + ++N KK ++F +DG Sbjct: 164 IEGMGNTNIEGTWRLA-------DEIFNNELNNNSNSKKDVIFFSDGYPN 206 >gi|160880107|ref|YP_001559075.1| von Willebrand factor type A [Clostridium phytofermentans ISDg] gi|160428773|gb|ABX42336.1| von Willebrand factor type A [Clostridium phytofermentans ISDg] Length = 513 Score = 38.6 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 41/233 (17%), Positives = 76/233 (32%), Gaps = 31/233 (13%) Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD 169 + T + + + Y + K I I P +S +K Sbjct: 276 LSSDTQQKLESMGRRTTYG--GLVSNDEVFKESYGIDKNAYLSPIKYPASSVIKNALNLY 333 Query: 170 ARLD-----MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 L ++ LD S SM F + + + K ++ L +I + + + ++ Sbjct: 334 QDLFRKPSVVVFCLDYSGSM--FGEGNEQLVAAMEKILDHKLASEDMIQ-FSKMDKIFVI 390 Query: 225 TFSNKIEEFFLLEWG--VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 FS++++ G ++L +IK G TN +++A + D Sbjct: 391 PFSSELKWVDSAISGIDTANLISRIKDTEAHGK-TNIYAPVEHAIEILKDFDAD------ 443 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 Y K IV MTDGE+ S V++I + Sbjct: 444 ---VYTKSIVLMTDGESAGNFRKGTSY---------DIPVFSIMFGEANPKQL 484 >gi|47200533|emb|CAF88599.1| unnamed protein product [Tetraodon nigroviridis] Length = 108 Score = 38.6 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 17/105 (16%), Positives = 45/105 (42%), Gaps = 11/105 (10%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++ ++D S S+ S+ ++ + ++S+ + L + N ++ GL +S++ + Sbjct: 8 DIVFLVDGSSSIG---PSNFQEVRLFLRSLASGLN----VSPDN--IRIGLAQYSDEPHQ 58 Query: 233 FFLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 FLL+ + ++ T + + + Q F + Sbjct: 59 EFLLKDHMEKRNLLAALESFPYRNGGTETGKAINFLRKQYFTKKA 103 >gi|158314291|ref|YP_001506799.1| von Willebrand factor type A [Frankia sp. EAN1pec] gi|158109696|gb|ABW11893.1| von Willebrand factor type A [Frankia sp. EAN1pec] Length = 432 Score = 38.6 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 65/203 (32%), Gaps = 28/203 (13%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL--VTFSNKIEE 232 +IV+D S SM + TKI A ++ A ++ ++ + +G V + Sbjct: 45 VIVIDCSGSMS----TPATKIYAARRAACAAVDGLRPGTLFAVIRGTGTAQVVYPPGGGL 100 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 +R I +S G T L A +F + + + Sbjct: 101 ARAAVDTRDQARRTINRMSAAG-GTAIGSWLLAA-RDLFA----------GHPDAVRHAI 148 Query: 293 FMTDGENLSTKED--QQSLYYCN---EAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +TDG N +L C + RG + + + L A Sbjct: 149 LLTDGRNEHESAADFTDALAACRGHFQCDSRGVGRGWVAAELTAVSDTLLGTAR-----D 203 Query: 348 VENPHSMYDAFSHIGKDIVTKRI 370 + +P + F + K + + + Sbjct: 204 IADPADLVADFQAMTKAAMARAL 226 >gi|284929627|ref|YP_003422149.1| protoporphyrin IX magnesium-chelatase [cyanobacterium UCYN-A] gi|284810071|gb|ADB95768.1| protoporphyrin IX magnesium-chelatase [cyanobacterium UCYN-A] Length = 680 Score = 38.6 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 72/216 (33%), Gaps = 44/216 (20%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-EE 232 ++ V+D S SM ++ ++ A ++ +L E N Q L+ F + + Sbjct: 486 IIFVVDASGSM------ALNRMQSAKGAVMRLLTEA-----YENRDQIALIPFRGEQADV 534 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 +S +++++ L G S + + + + + +IV Sbjct: 535 LLPPTRSISLARKRLETLPCGGGSPLAHG--------LTQAVKIGVNAKMSGDIGQVVIV 586 Query: 293 FMTDGENL---------------STKEDQQSLYYCNEAKKRGAIVYAI-----GIRVIRS 332 +TDG ++ L + K G + I + Sbjct: 587 AITDGRGNISLSRSLGEQVLKEDKPNIKEELLNIASRIKMLGIKLLMIDTEKKFVSTGFG 646 Query: 333 HEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 E + A+ ++Y + A + + K+ +++ Sbjct: 647 KELAK--AAMGTYYQLPTATD--KAIAQMTKEAISQ 678 >gi|145547813|ref|XP_001459588.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124427413|emb|CAK92191.1| unnamed protein product [Paramecium tetraurelia] Length = 2087 Score = 38.6 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 28/178 (15%), Positives = 60/178 (33%), Gaps = 22/178 (12%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDAR-LDMMIVLDVSRSMESFFDSSITKID 196 P FC W+T I S ++ + + + +++ V+D + SM+ I I Sbjct: 1822 PKSFCNQAAWHTQKDDIKSHSFSCMERHVENQIQGINVAFVIDSTGSMQHLIKMCINTIK 1881 Query: 197 MAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN------KIEEFFLLEWGVSH--LQRKIK 248 ++ NA K + ++ V++ + K + ++ S + + + Sbjct: 1882 DIMEQANAK----KTVYRDKLEIKFAAVSYKDHKFPYNKDQNIIEVQRFTSDEVILQFLD 1937 Query: 249 YLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 L G A + D D NY+K++ + D + Sbjct: 1938 KLQVDGGGD--------APEAVLDGLNASSGL-KWDQNYEKLLFLIADAPPHGKAYNN 1986 >gi|90412304|ref|ZP_01220309.1| hypothetical dinitrification protein NorD [Photobacterium profundum 3TCK] gi|90326795|gb|EAS43188.1| hypothetical dinitrification protein NorD [Photobacterium profundum 3TCK] Length = 613 Score = 38.6 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 73/213 (34%), Gaps = 23/213 (10%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + +++ D+S S +++ ++ ID+ S+ E + + D G + Sbjct: 415 NCQRDISTLLLSDLSMSTDAYINNEYRVIDVIKDSMLLFSEALAAVEDA--FAIYGFSSV 472 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLS---KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 F LL+ I+ + G T ++ A N + + R Sbjct: 473 KRHHVRFTLLKNFAESYDDHIRGRILSLRPGFYTRMGAAIRQASNILVEQTQHR------ 526 Query: 284 DANYKKIIVFMTDGENLST------KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 K+++ +TDG+ + + A++ G I + I I ++ ++L Sbjct: 527 -----KLLLILTDGKPNDIDNYDGRHGVEDTRQAIIAARRLGLIPFCITID-QKADQYLP 580 Query: 338 ACASPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 N F ++ +P + + + Sbjct: 581 YIFGNNGFTVIFDPSQLPTKLPQLYHQLTQSSC 613 >gi|221136524|ref|XP_002171216.1| PREDICTED: similar to proximal thread matrix protein 1, partial [Hydra magnipapillata] Length = 126 Score = 38.6 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 13/71 (18%), Positives = 26/71 (36%), Gaps = 1/71 (1%) Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRACASPNSF 345 K+I+ +TDG + + +E + G ++ +GI + E + Sbjct: 23 VSKLIILLTDGSQTPGGDAEDPERIADELRNDGVVILGVGIGSAVNETELSHITGGKKNA 82 Query: 346 YLVENPHSMYD 356 Y S+ D Sbjct: 83 YTAATFDSLTD 93 >gi|30249870|ref|NP_841940.1| von Willebrand factor type A domain-containing protein [Nitrosomonas europaea ATCC 19718] gi|30180907|emb|CAD85829.1| von Willebrand factor type A domain [Nitrosomonas europaea ATCC 19718] Length = 775 Score = 38.6 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 25/174 (14%), Positives = 60/174 (34%), Gaps = 25/174 (14%) Query: 171 RLDMMIVLDVSRSMESFFDSS-ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + +M++LD+S S + + T +D+ ++ + + + I D + G + Sbjct: 583 DISVMVLLDLSESTNNQVNGQEFTVLDLTRQATVLLADAINRIGDPFAIH--GFCSDGRH 640 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVS----TNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 E++ + + + G++ T +++A H Sbjct: 641 DVEYYRFKDFDQPYNE-VPKMRLAGMTGQLSTRMGAAIRHA-----------SHFLKLQR 688 Query: 286 NYKKIIVFMTDGENLSTKEDQ------QSLYYCNEAKKRGAIVYAIGIRVIRSH 333 KK+++ +TDGE + EA + G + Y + + Sbjct: 689 AAKKLLLVITDGEPADVDVRDPQYLRFDTKKAVEEAGRSGILTYCMSLDPRADQ 742 >gi|84686516|ref|ZP_01014409.1| hypothetical protein 1099457000254_RB2654_07486 [Maritimibacter alkaliphilus HTCC2654] gi|84665429|gb|EAQ11906.1| hypothetical protein RB2654_07486 [Rhodobacterales bacterium HTCC2654] Length = 2260 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 28/218 (12%), Positives = 75/218 (34%), Gaps = 23/218 (10%) Query: 150 NSRHIVMPITSSVKVNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML 206 + + + T + V+ + +++M V+D S + ++ +++ ++ Sbjct: 1837 QTAEVEVFDTGQISVDLSSLSGSPNVNVMFVMD------SSGSVGSSGWELTKDAVSGLM 1890 Query: 207 EEVKLIPDVNNV-VQSGLVTFSNKI--EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL 263 E+ V + ++ FSN + E F + VS + + LS +T + + Sbjct: 1891 TELDTQFTGTGVNLDFAVINFSNTVRREIEFTDTFPVSTMTALVDTLSWISSTTATGDAM 1950 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIV 322 A + +D + I +TDG + + + A + G Sbjct: 1951 LAAASMFYDQTASSDNN---------FIYLITDGVPYPLAAQETRIDQADTALRAYGVTT 2001 Query: 323 YAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 + + ++ + + V + + D+ + Sbjct: 2002 EVVAVGNFQTGNV-EQLDTDGTILAVSSFSDLLDSVGN 2038 >gi|288928456|ref|ZP_06422303.1| von Willebrand factor, type A [Prevotella sp. oral taxon 317 str. F0108] gi|288331290|gb|EFC69874.1| von Willebrand factor, type A [Prevotella sp. oral taxon 317 str. F0108] Length = 292 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 23/102 (22%), Positives = 49/102 (48%), Gaps = 9/102 (8%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S++ F + TK +MA + + I + + + G++ F Sbjct: 72 EEERELTVMLLIDVSGSLD-FGTTQRTKREMATEMAAIL--AFSAIQNND---KIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKY 265 S++IE++ + G H+ I + TN ++Y Sbjct: 126 SDRIEKYIPPKKGRKHILYIIHEMLDFKPESKRTNVAAAIEY 167 >gi|257063141|ref|YP_003142813.1| hypothetical protein Shel_04030 [Slackia heliotrinireducens DSM 20476] gi|256790794|gb|ACV21464.1| uncharacterized protein [Slackia heliotrinireducens DSM 20476] Length = 268 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 35/189 (18%), Positives = 63/189 (33%), Gaps = 25/189 (13%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 S+ L ++ VLD S SM + I+ ++ A++ +L++V ++ G Sbjct: 6 DTQSKARKLLPIIYVLDTSGSM--NVEGRISAVNEAMRETMDVLKDVAAKNPTAE-LKIG 62 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYL----------SKFGVSTNSTPGLKYAYNQIFD 272 ++ FS+ W L G T+ + NQ+ Sbjct: 63 VLAFSSGA------SWVTKDPSTGAPALLDLDDFYWNDMTAGGVTDLGAAMTELDNQL-- 114 Query: 273 MQGMRQHCNTEDANYK-KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 R D +K +++FM+DG E + N + A AI + Sbjct: 115 ---NRSAMLVSDTGFKVPVLIFMSDGGPTDDWESAYNKAVANNRWVKSATKIAIAVGDGA 171 Query: 332 SHEFLRACA 340 L A Sbjct: 172 DRGVLTRVA 180 >gi|182439436|ref|YP_001827155.1| putative magnesium-chelatase subunit [Streptomyces griseus subsp. griseus NBRC 13350] gi|178467952|dbj|BAG22472.1| putative magnesium-chelatase subunit [Streptomyces griseus subsp. griseus NBRC 13350] Length = 679 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 23/132 (17%), Positives = 50/132 (37%), Gaps = 19/132 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEE 232 ++ V+D S SM ++ ++ ++ ++L + + GLVTF + E Sbjct: 496 VLFVVDASGSM-----AARQRMSAVKGAVLSLL-----LDAYQRRDKVGLVTFRGREAEV 545 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V +++ L G + + LK + + ++ +R ++V Sbjct: 546 ALPPTSSVDAAASRLESLPTGGRTPLAAGLLKA--HDVLRVERLRDPSRRP------LLV 597 Query: 293 FMTDGENLSTKE 304 +TDG E Sbjct: 598 VVTDGRATGGPE 609 >gi|171912850|ref|ZP_02928320.1| hypothetical protein VspiD_16760 [Verrucomicrobium spinosum DSM 4136] Length = 298 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 24/95 (25%), Positives = 35/95 (36%), Gaps = 5/95 (5%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + L I+LD S SM K + A K A+ + D GLVTF Sbjct: 79 EETNLRATILLDSSGSMGYRGTKGTAKYEHARKLAAALAYVLTGQQDA-----VGLVTFD 133 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 +KI +F SHL + L + ++ Sbjct: 134 DKIRDFIPCRTKTSHLHHILDTLQRSQPGGETSLA 168 >gi|154496349|ref|ZP_02035045.1| hypothetical protein BACCAP_00637 [Bacteroides capillosus ATCC 29799] gi|150274432|gb|EDN01509.1| hypothetical protein BACCAP_00637 [Bacteroides capillosus ATCC 29799] Length = 1896 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 67/177 (37%), Gaps = 22/177 (12%) Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHL--QRKIKYL 250 +++ +A+ ++ M+ ++ + V FS++ + + + + + + ++ L Sbjct: 210 SRLKIAVSAVKNMVGTLREQLGGKLTAK--FVVFSSEGYKNGVDKRASAKVITEAQLDQL 267 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 + G T+ + G+ +Q ++++V + DG++ ++ Sbjct: 268 TAVG-GTDLSAGVALGVDQFKSSSA------------RQVLVVVADGDSDDGYPNR---- 310 Query: 311 YCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVT 367 N K + I+Y +G S F + L + + +A I DI Sbjct: 311 TANNFKNKDGIIYTVGFTF-SSDSFNNLATDADHALLANSDTELGEAMEDISTDITA 366 >gi|260868782|ref|YP_003235184.1| hypothetical protein ECO111_2793 [Escherichia coli O111:H- str. 11128] gi|257765138|dbj|BAI36633.1| conserved predicted protein [Escherichia coli O111:H- str. 11128] gi|323177030|gb|EFZ62620.1| von Willebrand factor type A domain protein [Escherichia coli 1180] Length = 219 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 63/179 (35%), Gaps = 16/179 (8%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 ++ S + R +++LDVS SM + I+ + +E+ P V Sbjct: 8 ATSDFASNPEPRCPCILLLDVSGSM------NGRPINELNAGLVTFRDELLADPLALKRV 61 Query: 220 QSGLVTFS-NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + G+VTF +E+ F S L G T + A + + + R+ Sbjct: 62 ELGIVTFGPVHVEQPF-----TSAANFFPPILFAQG-DTPMGAAITKALDMV--EERKRE 113 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + + Y+ I +TDG + + + ++IG++ + Sbjct: 114 YRANGISYYRPWIFLITDGAPTDE-WQAAANKVFQGEEDKKFAFFSIGVQGADMKTLAQ 171 >gi|241842676|ref|XP_002415420.1| CUB and sushi multiple domains (CSMD) protein, putative [Ixodes scapularis] gi|215509632|gb|EEC19085.1| CUB and sushi multiple domains (CSMD) protein, putative [Ixodes scapularis] Length = 609 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 19/124 (15%), Positives = 38/124 (30%), Gaps = 10/124 (8%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + D S S+ ++ I + I ++ + G V F+N + Sbjct: 441 VYFAFDASASIG---------LNNLKTGIGLAKAITRKINVTDDGHRVGAVRFNNTAKLM 491 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI-FDMQGMRQHCNTEDANYKKIIV 292 S + + T ++ A I ++ + H + K I Sbjct: 492 LSPMNVKSTDAALLALDAVATEKTAGGTAIREAVKAIRTSLEIVDGHLKKDGFKLKFSIF 551 Query: 293 FMTD 296 F+TD Sbjct: 552 FITD 555 >gi|170679715|ref|YP_001743066.1| von Willebrand factor type A domain-containing protein [Escherichia coli SMS-3-5] gi|218699334|ref|YP_002406963.1| hypothetical protein ECIAI39_0941 [Escherichia coli IAI39] gi|170517433|gb|ACB15611.1| von Willebrand factor type A domain protein [Escherichia coli SMS-3-5] gi|218369320|emb|CAR17078.1| conserved hypothetical protein [Escherichia coli IAI39] Length = 219 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 63/179 (35%), Gaps = 16/179 (8%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 ++ S + R +++LDVS SM + I+ + +E+ P V Sbjct: 8 ATSDFASNPEPRCPCILLLDVSGSM------NGRPINELNAGLVTFRDELLADPLALKRV 61 Query: 220 QSGLVTFS-NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + G+VTF +E+ F S L G T + A + + + R+ Sbjct: 62 ELGIVTFGPVHVEQPF-----TSAANFFPPILFAQG-DTPMGAAITKALDMV--EERKRE 113 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + + Y+ I +TDG + + + ++IG++ + Sbjct: 114 YRANGISYYRPWIFLITDGAPTDE-WQAAANKVFQGEEDKKFAFFSIGVQGADMKTLAQ 171 >gi|194436063|ref|ZP_03068165.1| von Willebrand factor type A domain protein [Escherichia coli 101-1] gi|253772996|ref|YP_003035827.1| von Willebrand factor A [Escherichia coli 'BL21-Gold(DE3)pLysS AG'] gi|254162064|ref|YP_003045172.1| hypothetical protein ECB_01978 [Escherichia coli B str. REL606] gi|291283314|ref|YP_003500132.1| hypothetical protein G2583_2597 [Escherichia coli O55:H7 str. CB9615] gi|293415361|ref|ZP_06658004.1| yegL protein [Escherichia coli B185] gi|297520674|ref|ZP_06939060.1| von Willebrand factor type A [Escherichia coli OP50] gi|300899770|ref|ZP_07117990.1| von Willebrand factor type A domain protein [Escherichia coli MS 198-1] gi|300928180|ref|ZP_07143721.1| von Willebrand factor type A domain protein [Escherichia coli MS 187-1] gi|301026364|ref|ZP_07189811.1| von Willebrand factor type A domain protein [Escherichia coli MS 196-1] gi|168986340|dbj|BAG11880.1| predicted protein [Escherichia coli O55:H7] gi|194424791|gb|EDX40776.1| von Willebrand factor type A domain protein [Escherichia coli 101-1] gi|242377723|emb|CAQ32484.1| conserved protein [Escherichia coli BL21(DE3)] gi|253324040|gb|ACT28642.1| von Willebrand factor type A [Escherichia coli 'BL21-Gold(DE3)pLysS AG'] gi|253973965|gb|ACT39636.1| hypothetical protein ECB_01978 [Escherichia coli B str. REL606] gi|253978159|gb|ACT43829.1| hypothetical protein ECD_01978 [Escherichia coli BL21(DE3)] gi|290763187|gb|ADD57148.1| Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain protein [Escherichia coli O55:H7 str. CB9615] gi|291433009|gb|EFF05988.1| yegL protein [Escherichia coli B185] gi|299879720|gb|EFI87931.1| von Willebrand factor type A domain protein [Escherichia coli MS 196-1] gi|300356649|gb|EFJ72519.1| von Willebrand factor type A domain protein [Escherichia coli MS 198-1] gi|300463775|gb|EFK27268.1| von Willebrand factor type A domain protein [Escherichia coli MS 187-1] gi|320657496|gb|EFX25294.1| von Willebrand factor type A [Escherichia coli O55:H7 str. 3256-97 TW 07815] gi|320663312|gb|EFX30617.1| von Willebrand factor type A [Escherichia coli O55:H7 str. USDA 5905] gi|323961761|gb|EGB57363.1| von Willebrand protein type A [Escherichia coli H489] gi|323972778|gb|EGB67977.1| von Willebrand protein type A [Escherichia coli TA007] Length = 219 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 63/179 (35%), Gaps = 16/179 (8%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 ++ S + R +++LDVS SM + I+ + +E+ P V Sbjct: 8 ATSDFASNPEPRCPCILLLDVSGSM------NGRPINELNAGLVTFRDELLADPLALKRV 61 Query: 220 QSGLVTFS-NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + G+VTF +E+ F S L G T + A + + + R+ Sbjct: 62 ELGIVTFGPVHVEQPF-----TSAANFFPPILFAQG-DTPMGAAITKALDMV--EERKRE 113 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + + Y+ I +TDG + + + ++IG++ + Sbjct: 114 YRANGISYYRPWIFLITDGAPTDE-WQAAANKVFQGEEDKKFAFFSIGVQGADMKTLAQ 171 >gi|325292999|ref|YP_004278863.1| hypothetical protein AGROH133_06427 [Agrobacterium sp. H13-3] gi|325060852|gb|ADY64543.1| hypothetical protein AGROH133_06427 [Agrobacterium sp. H13-3] Length = 579 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 7/46 (15%), Positives = 18/46 (39%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSM 53 + F + G + I + P+ +L + ++ ++ K L Sbjct: 8 LPRFLKDSTGNIAISAGLTAPLFIGILALGVDYGYLTLQKRQLQQT 53 >gi|218131128|ref|ZP_03459932.1| hypothetical protein BACEGG_02733 [Bacteroides eggerthii DSM 20697] gi|317476994|ref|ZP_07936236.1| hypothetical protein HMPREF1016_03220 [Bacteroides eggerthii 1_2_48FAA] gi|217986648|gb|EEC52982.1| hypothetical protein BACEGG_02733 [Bacteroides eggerthii DSM 20697] gi|316906787|gb|EFV28499.1| hypothetical protein HMPREF1016_03220 [Bacteroides eggerthii 1_2_48FAA] Length = 289 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 18/84 (21%), Positives = 39/84 (46%), Gaps = 6/84 (7%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S+E + K + + ++ + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLEFGTVKQMKKDMVTEIAATLAFSAIQNND------KIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL 250 S++IE+F + G H+ I+ L Sbjct: 126 SDRIEKFIPPKKGRKHILYIIREL 149 >gi|254481586|ref|ZP_05094830.1| tetratricopeptide repeat domain protein [marine gamma proteobacterium HTCC2148] gi|214038214|gb|EEB78877.1| tetratricopeptide repeat domain protein [marine gamma proteobacterium HTCC2148] Length = 603 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 55/165 (33%), Gaps = 27/165 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++VLD+S SM + D + ++ D A + + +L + K Q+GL+ ++ Sbjct: 87 LVLVLDLSYSMMAA-DLAPSRNDRARQKLLDLLSQRKEG-------QTGLIAYAGDAHIV 138 Query: 234 FLLEWGVSHLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + + L+ + L A ++ R Sbjct: 139 TPLTDDNPTIANLLPALNPGMMPLAGSEPAAALSQAVELMYSAGVQRGR----------- 187 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 I+ +TDG +E + G + +GI Sbjct: 188 IMLVTDGITEQDRE-----EISELLQGSGMGLVIMGIGTATGAPL 227 >gi|312135597|ref|YP_004002935.1| von willebrand factor type a [Caldicellulosiruptor owensensis OL] gi|311775648|gb|ADQ05135.1| von Willebrand factor type A [Caldicellulosiruptor owensensis OL] Length = 667 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 36/259 (13%), Positives = 80/259 (30%), Gaps = 61/259 (23%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 + + + +++V+D S S+ +K+ A + +++ K + + G+V F Sbjct: 71 SRSPVSVVLVIDS-----SGSMSASSKMTAAKNAAKNLIDSFKNSAKSGD--KLGIVDFD 123 Query: 228 NKIEEFFLL----------EWGVSHLQRKIKYLSKFGVSTNS------TPGLKYAYNQI- 270 + + W + S T+S T + A N I Sbjct: 124 TFVNDNSNFYVKGFYLQNGSWQKGNSSTIYGPYSLPNTCTSSLLDLTNTSAINSAKNLID 183 Query: 271 -----------FDMQGMRQHCNTEDANYKKIIVFMTDGENL------------------- 300 + + N + K ++F+TDG Sbjct: 184 NMNASGGTNMEAALNKAKNLLNASPSGNDKYVIFITDGMPTFYLNGTHNGYPLVDGPGLQ 243 Query: 301 -STKEDQQSLYYCNEAKKRGAIVYAIGI---RVIRSHEFLRACASP--NSFYLVENPHSM 354 + ++L + G ++ +G+ F+ AS Y + + +++ Sbjct: 244 PNNTTKSETLSAVQSLSQSGTKLFVVGVDTTGADVDKTFIELMASTANGKSYYISSTNAL 303 Query: 355 YDAFSHIGKDIVTKRIWYD 373 I K I+ + YD Sbjct: 304 NSILQDIFK-IINLAVTYD 321 >gi|163754428|ref|ZP_02161550.1| hypothetical protein KAOT1_16073 [Kordia algicida OT-1] gi|161325369|gb|EDP96696.1| hypothetical protein KAOT1_16073 [Kordia algicida OT-1] Length = 288 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 40/163 (24%), Positives = 66/163 (40%), Gaps = 32/163 (19%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L MM+++DVS S F ++ + I I+A L + + GL+ F Sbjct: 72 EEERELTMMLMVDVSGS--ELFGTNTQFKNETITEISATLAFSAMQNNDK----VGLILF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 S++IE F + G SH+ R I+ L T+ + L++ Sbjct: 126 SDEIELFIPPKKGKSHVLRIIRELLEHKPKSTKTDVSQALRF----------------LS 169 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK---RGAIVY 323 KK IVF+ + E +Q+L AKK G +Y Sbjct: 170 SVMKKKAIVFIL--SDFIADEYEQTLKI--SAKKHDLTGIRIY 208 >gi|126457247|ref|YP_001076991.1| hypothetical protein BURPS1106A_A2962 [Burkholderia pseudomallei 1106a] gi|242311800|ref|ZP_04810817.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b] gi|254192526|ref|ZP_04898965.1| conserved hypothetical protein [Burkholderia pseudomallei S13] gi|126231015|gb|ABN94428.1| conserved hypothetical protein [Burkholderia pseudomallei 1106a] gi|169649284|gb|EDS81977.1| conserved hypothetical protein [Burkholderia pseudomallei S13] gi|242135039|gb|EES21442.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b] Length = 418 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 62/168 (36%), Gaps = 6/168 (3%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 +G ++IL A+ L ++ +G+ +++ ++ ++ L + D + AA + G N Sbjct: 17 RRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLT--GAIN 74 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTS-LDIVVVPQNEGYSISA 131 G + F +++ N V D + + + P + Y Sbjct: 75 LSVPEAAGITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCT 134 Query: 132 ISRYKIP---LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMI 176 SR I ++ +P T + V + +QT + + I Sbjct: 135 TSRTGIVNWFIQTLNLVPGVTVTNASVSATAVATVGAAQTTCAIPVFI 182 >gi|323498500|ref|ZP_08103493.1| von Willebrand factor type A (vWA) domain-containing protein [Vibrio sinaloensis DSM 21326] gi|323316389|gb|EGA69407.1| von Willebrand factor type A (vWA) domain-containing protein [Vibrio sinaloensis DSM 21326] Length = 697 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 32/223 (14%), Positives = 68/223 (30%), Gaps = 35/223 (15%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE---- 208 + + T + T R D + VLD S SM + T ++ + + + E Sbjct: 294 TVKLTFTPGDDLTRVTQGR-DWVFVLDKSGSMSGKYS---TLVEGVRQGLGKLPSEDRFR 349 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 V + + G V + V+ + ++ + TN G+ A Sbjct: 350 VVMFDSNTYDLTGGFVAVN---------AANVTKALQSVEQVEPSN-GTNLYEGMAAATR 399 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 ++ D + IV +TDG ++ + + + +K ++ + Sbjct: 400 KL-------------DDDRPTGIVLVTDG--VANVGVTEKRRFFDLMEKHDVRLFTFIMG 444 Query: 329 VIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 + L S V N + I + + Sbjct: 445 NSANTPLLVPMTKLSNGIATSVSNADDIIGHLMSITSKLTYQA 487 >gi|221107135|ref|XP_002168442.1| PREDICTED: similar to predicted protein, partial [Hydra magnipapillata] Length = 603 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 61/180 (33%), Gaps = 19/180 (10%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + + + +DM+ +D+S S E K+I ++++ Sbjct: 440 DTANELKGFVQGVHDGAEGRAFNVDMVFAMDLSSSSEEILQKQ-------KKAIRSLIDY 492 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI-KYLSKFGVSTNSTPGLKYAY 267 +P +N + GL+T+S+ L S +I + G N + A Sbjct: 493 --HLPSKSN--ELGLITYSDIANVNSEL---TSKFNSEILDKIVNNGRRQNVASAITVAS 545 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 IF ++ + + ++++V G+ S Y ++ IG+ Sbjct: 546 ENIFKIRNNDEKLKKK----QRVLVLFVVGKPSSNYPPVVPERYGKLLEENNVKTIIIGL 601 >gi|307177175|gb|EFN66408.1| NUAK family SNF1-like kinase 1 [Camponotus floridanus] Length = 2617 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 48/294 (16%), Positives = 94/294 (31%), Gaps = 33/294 (11%) Query: 59 VHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDI 118 V A +++ E + K + L ++ MS N F N S Sbjct: 466 VAACLELIEESKERSPSKERSKTPLMSEQSCEPMSVENVQMYEKFYNQKPSNEEDKSQQK 525 Query: 119 VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVL 178 ++ A + K +P S + + + N + +A L Sbjct: 526 HTKSPSKSLHEDADTASK-SANKTNEMPLDKKSDALESKVPNENNTNIRNEAYL------ 578 Query: 179 DVSRSMESFFDS--SITKIDMAIKSINAML--------------EEVKLIPDVNNVVQSG 222 S+ M+ F + K+ S+++ L K+ Q Sbjct: 579 --SQDMQHFEPTENDFKKLKEKALSLDSELSNEVITALAKPVERRRSKIFETAEKFNQLA 636 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + K ++ F+ V +R + + T +P A I D+ ++ Sbjct: 637 STAENEKPKKIFIPGVNVGGAKRAFERKASLSSITTPSPTKVSASKVIIDVPTDKKGEK- 695 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 + K+ I +G+ +T ED+ + +EAKKR + + I + L Sbjct: 696 -ETEQKQTI----EGQETATTEDRLNKQ--DEAKKRAIDIISGAIGKPPMQKKL 742 >gi|291440428|ref|ZP_06579818.1| chelatase [Streptomyces ghanaensis ATCC 14672] gi|291343323|gb|EFE70279.1| chelatase [Streptomyces ghanaensis ATCC 14672] Length = 230 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 22/132 (16%), Positives = 50/132 (37%), Gaps = 19/132 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEE 232 ++ V+D S SM ++ ++ ++ ++L + + GLVTF + + Sbjct: 44 VLFVVDASGSM-----AARQRMSAVKGAVLSLL-----LDAYQRRDKVGLVTFRGSAADV 93 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V +++ L G + + LK + + ++ +R ++V Sbjct: 94 ALPPTSSVDAAAARLESLPTGGRTPLAAGLLKA--HDVLRVERLRDPARRP------LLV 145 Query: 293 FMTDGENLSTKE 304 +TDG E Sbjct: 146 VVTDGRATGGPE 157 >gi|239932061|ref|ZP_04689014.1| chelatase [Streptomyces ghanaensis ATCC 14672] Length = 236 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 22/132 (16%), Positives = 50/132 (37%), Gaps = 19/132 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEE 232 ++ V+D S SM ++ ++ ++ ++L + + GLVTF + + Sbjct: 50 VLFVVDASGSM-----AARQRMSAVKGAVLSLL-----LDAYQRRDKVGLVTFRGSAADV 99 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V +++ L G + + LK + + ++ +R ++V Sbjct: 100 ALPPTSSVDAAAARLESLPTGGRTPLAAGLLKA--HDVLRVERLRDPARRP------LLV 151 Query: 293 FMTDGENLSTKE 304 +TDG E Sbjct: 152 VVTDGRATGGPE 163 >gi|254182581|ref|ZP_04889175.1| conserved hypothetical protein [Burkholderia pseudomallei 1655] gi|184213116|gb|EDU10159.1| conserved hypothetical protein [Burkholderia pseudomallei 1655] Length = 418 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 26/147 (17%), Positives = 57/147 (38%), Gaps = 8/147 (5%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 +G ++IL A+ L ++ +G+ +++ ++ ++ L + D + AA + G N Sbjct: 17 RRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLT--GAIN 74 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 G + F +++ N V D + P+N S S+I Sbjct: 75 LSVPEAAGITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQ------PKNAIASPSSI 128 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPIT 159 K + W+ + ++V +T Sbjct: 129 KYVKCTTSRTGIVNWFIQTLNLVPGVT 155 >gi|254187133|ref|ZP_04893648.1| conserved hypothetical protein [Burkholderia pseudomallei Pasteur 52237] gi|254263081|ref|ZP_04953946.1| conserved hypothetical protein [Burkholderia pseudomallei 1710a] gi|157934816|gb|EDO90486.1| conserved hypothetical protein [Burkholderia pseudomallei Pasteur 52237] gi|254214083|gb|EET03468.1| conserved hypothetical protein [Burkholderia pseudomallei 1710a] Length = 418 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 26/147 (17%), Positives = 57/147 (38%), Gaps = 8/147 (5%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 +G ++IL A+ L ++ +G+ +++ ++ ++ L + D + AA + G N Sbjct: 17 RRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLT--GAIN 74 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 G + F +++ N V D + P+N S S+I Sbjct: 75 LSVPEAAGITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQ------PKNAIASPSSI 128 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPIT 159 K + W+ + ++V +T Sbjct: 129 KYVKCTTSRTGIVNWFIQTLNLVPGVT 155 >gi|254296485|ref|ZP_04963941.1| conserved hypothetical protein [Burkholderia pseudomallei 406e] gi|157806387|gb|EDO83557.1| conserved hypothetical protein [Burkholderia pseudomallei 406e] Length = 418 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 26/147 (17%), Positives = 57/147 (38%), Gaps = 8/147 (5%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 +G ++IL A+ L ++ +G+ +++ ++ ++ L + D + AA + G N Sbjct: 17 RRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLT--GAIN 74 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 G + F +++ N V D + P+N S S+I Sbjct: 75 LSVPEAAGITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQ------PKNAIASPSSI 128 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPIT 159 K + W+ + ++V +T Sbjct: 129 KYVKCTTSRTGIVNWFIQTLNLVPGVT 155 >gi|156546336|ref|XP_001606541.1| PREDICTED: similar to ENSANGP00000020925 [Nasonia vitripennis] Length = 2053 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 22/137 (16%), Positives = 58/137 (42%), Gaps = 11/137 (8%) Query: 162 VKVNSQTDARLDMMIVLDVSRSMESFFDS-SITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 ++ ++ + DM+I++D S SM ++ + T + + +++ + + + ++ Q Sbjct: 1101 IEASTCSK---DMVILVDNSGSMTGMSNAIAKTTVSTIMSTLSN--NDFVAVFNFSDSTQ 1155 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + F +K+ + + + I + GV N T A+ + + Sbjct: 1156 QVVSCFQDKLVQATPE--NIRRINDDILTMKPEGV-ANITEAFLAAFTIL--ENYRNESR 1210 Query: 281 NTEDANYKKIIVFMTDG 297 D + ++I+ +TDG Sbjct: 1211 CGSDLSCNQMIMLVTDG 1227 >gi|170743327|ref|YP_001771982.1| hypothetical protein M446_5224 [Methylobacterium sp. 4-46] gi|168197601|gb|ACA19548.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 478 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 62/188 (32%), Gaps = 14/188 (7%) Query: 16 KGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRK 75 G + IL A+ L + ++G+ ++ + D++ + A K Sbjct: 22 DGSVNILFALSLIPVLGLVGLAVDYG---------LAAADKTTLDHAADTAALAAVVTAK 72 Query: 76 KLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRY 135 + ++ N F + + + G +++A Y Sbjct: 73 SYIAANQGQANLTANAIAAGLAQAANVFAVNAGSVPFAQVTLQPPQLVRSGQTLTATVSY 132 Query: 136 KIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSME-SFFDSSITK 194 ++ T + + +T+S + S LD +++DVS SM +T+ Sbjct: 133 GATIQNSFGKLLGTPTTLLGNSVTASADLPS----YLDFYLLVDVSGSMGLPATPGGMTQ 188 Query: 195 IDMAIKSI 202 + K + Sbjct: 189 LASVNKDM 196 >gi|118594678|ref|ZP_01552025.1| hypothetical protein MB2181_03380 [Methylophilales bacterium HTCC2181] gi|118440456|gb|EAV47083.1| hypothetical protein MB2181_03380 [Methylophilales bacterium HTCC2181] Length = 286 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 25/121 (20%), Positives = 41/121 (33%), Gaps = 9/121 (7%) Query: 152 RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 + I P + ++I D+S SM D S K I SI K Sbjct: 54 QTIRDPFEQVQVRIFNQRSATPIVIASDLSASMNFGTDKSKHKFASEIASIITNSVMAKS 113 Query: 212 IPDVNNVVQSGLVTFSNKIEEF--FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQ 269 G + F++KI+ L + + I+ LS + ++ G+ Y Sbjct: 114 DA-------IGFIGFNDKIDPSWVSPLSYRPHRTEALIQRLSSYHAKSSGQKGMGNLYQH 166 Query: 270 I 270 I Sbjct: 167 I 167 >gi|162453507|ref|YP_001615874.1| hypothetical protein sce5231 [Sorangium cellulosum 'So ce 56'] gi|161164089|emb|CAN95394.1| hypothetical protein sce5231 [Sorangium cellulosum 'So ce 56'] Length = 490 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 14/64 (21%), Positives = 23/64 (35%), Gaps = 5/64 (7%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 + + LD+ I+LD S SM + TK ++ A + + G Sbjct: 89 ETKTAELVPLDLYIMLDSSGSMRDRLATGGTKWTAVTDALEAFFT-----DPKSEGLGVG 143 Query: 223 LVTF 226 L F Sbjct: 144 LQYF 147 >gi|251810839|ref|ZP_04825312.1| regulator of nitric oxide reductase [Staphylococcus epidermidis BCM-HMP0060] gi|282876154|ref|ZP_06285021.1| von Willebrand factor type A domain protein [Staphylococcus epidermidis SK135] gi|293366625|ref|ZP_06613302.1| regulator of nitric oxide reductase [Staphylococcus epidermidis M23864:W2(grey)] gi|251805674|gb|EES58331.1| regulator of nitric oxide reductase [Staphylococcus epidermidis BCM-HMP0060] gi|281295179|gb|EFA87706.1| von Willebrand factor type A domain protein [Staphylococcus epidermidis SK135] gi|291319394|gb|EFE59763.1| regulator of nitric oxide reductase [Staphylococcus epidermidis M23864:W2(grey)] gi|329732904|gb|EGG69249.1| von Willebrand factor type A domain protein [Staphylococcus epidermidis VCU028] gi|329735355|gb|EGG71647.1| von Willebrand factor type A domain protein [Staphylococcus epidermidis VCU045] Length = 629 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 22/177 (12%), Positives = 59/177 (33%), Gaps = 29/177 (16%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K + +++D S SM K+D IK + E +K + + ++ Sbjct: 426 KQDLSKTFDATFTLLIDASASMHD-------KMDETIKGVVLFHETLKSLNIKHEILAFN 478 Query: 223 LVTFSNKIEE-----FFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQG 275 F + ++ + S +++ + ++ A ++ Sbjct: 479 EDAFEADQRQQPNIIDEIINYNYSIFEKEGPRIMTLEPQDDNRDGVAIRIASERLL---- 534 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKRGAIVYAIGIR 328 + ++ ++ ++ +DGE + Q + A+K G V+ + + Sbjct: 535 -------QRSHEQRFLIVFSDGEPSAFNYSQDGILDTYEAVERARKFGIEVFNVFLS 584 >gi|27468009|ref|NP_764646.1| hypothetical protein SE1091 [Staphylococcus epidermidis ATCC 12228] gi|27315554|gb|AAO04688.1|AE016747_185 conserved hypothetical protein [Staphylococcus epidermidis ATCC 12228] Length = 629 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 22/177 (12%), Positives = 59/177 (33%), Gaps = 29/177 (16%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K + +++D S SM K+D IK + E +K + + ++ Sbjct: 426 KQDLSKTFDATFTLLIDASASMHD-------KMDETIKGVVLFHETLKSLNIKHEILAFN 478 Query: 223 LVTFSNKIEE-----FFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQG 275 F + ++ + S +++ + ++ A ++ Sbjct: 479 EDAFEADQRQQPNIIDEIINYNYSIFEKEGPRIMTLEPQDDNRDGVAIRIASERLL---- 534 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKRGAIVYAIGIR 328 + ++ ++ ++ +DGE + Q + A+K G V+ + + Sbjct: 535 -------QRSHEQRFLIVFSDGEPSAFNYSQDGILDTYEAVERARKFGIEVFNVFLS 584 >gi|223043129|ref|ZP_03613176.1| conserved hypothetical protein [Staphylococcus capitis SK14] gi|222443340|gb|EEE49438.1| conserved hypothetical protein [Staphylococcus capitis SK14] Length = 629 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 37/309 (11%), Positives = 95/309 (30%), Gaps = 56/309 (18%) Query: 39 EVSHIFFMKTV-LHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNE 97 +++ + K + +D I K + ++ N + Sbjct: 313 DMTDMMTKKGKGSQNTLDH---DEGGFIGQNQAFALEGINKNVKVEWKVPNIQPQHILDY 369 Query: 98 L-RDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISA------ISRYKIPLKFCTFIPWYTN 150 N +I D+++ I Q+E ++++ + + I ++ F Sbjct: 370 QHSKNDVQFEIKDLIQIIKKTIDREHQDERHNLTKGRLQKDLINWFIDDQYKLFYKKQDL 429 Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 S+ T +++D S SM K+D IK + E +K Sbjct: 430 SKTFDATFT----------------LLVDASASMHD-------KMDETIKGVVLFHETLK 466 Query: 211 LIPDVNNVVQSGLVTFSNKIEE-----FFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGL 263 + + ++ F E ++ + S +++ + + Sbjct: 467 SLNIKHEILAFNEDAFEADDREQPNIIDEIINYNYSIFEKEGPRIMSLEPQDDNRDGVAI 526 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKRG 319 + A ++ + + ++ ++ +DGE + Q + A+K G Sbjct: 527 RIASERLL-----------QRIHQQRFLIVFSDGEPSAFNYSQDGIIDTYEAVETARKFG 575 Query: 320 AIVYAIGIR 328 V+ + + Sbjct: 576 IEVFNVFLS 584 >gi|220914886|ref|YP_002490194.1| von Willebrand factor type A [Methylobacterium nodulans ORS 2060] gi|219952637|gb|ACL63027.1| von Willebrand factor type A [Methylobacterium nodulans ORS 2060] Length = 353 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 24/154 (15%), Positives = 54/154 (35%), Gaps = 27/154 (17%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSI------TKIDMAIKSINAMLEEVKLIPDVNNVV 219 + T A++ +++D S SM F +K + + + + Sbjct: 82 TGTGAQVS--LLIDRSGSMNESFAGRQPSGAEESKAAASRRLLAEFVGR-------RPHD 132 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQ 278 Q + FS + ++ I+ + + G+ TN GL A +Q Sbjct: 133 QVAVSVFSTAPIAVLPMTDRSEAVRAAIRAIDRPGLDATNVARGLGLALSQF-------- 184 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 A+ ++++ ++DG + + Q +L Sbjct: 185 ---GAGASTGRVLLLVSDGAAVIDRRTQDTLRAA 215 >gi|237728320|ref|ZP_04558801.1| conserved hypothetical protein [Citrobacter sp. 30_2] gi|226909798|gb|EEH95716.1| conserved hypothetical protein [Citrobacter sp. 30_2] Length = 637 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 31/159 (19%), Positives = 51/159 (32%), Gaps = 23/159 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 I+LD S SM + A ++ +ML + I V + F + Sbjct: 466 FTILLDSSGSMS-------RSVKEAEAAVVSMLYALDGIQGVT----TSAYHFPHAAHNS 514 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNS-TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L G R +FG+ T TP + + + D+ + + ++V Sbjct: 515 VGLLKGREQTLRTAIGTHQFGIGTQGCTPLCESLWPALADLTSAKADRH--------VLV 566 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 TDG+ + AK IV IG Sbjct: 567 IATDGQPDDMAS---ARAMIQSAKDDDIIVIGIGFGDAN 602 >gi|320652019|gb|EFX20372.1| von Willebrand factor type A [Escherichia coli O157:H- str. H 2687] Length = 219 Score = 38.2 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 63/179 (35%), Gaps = 16/179 (8%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 ++ S + R +++LDVS SM + I+ + +E+ P V Sbjct: 8 ATSDFASNPEPRCPCILLLDVSGSM------NGRPINELNAGLVTFRDELLADPLALKRV 61 Query: 220 QSGLVTFS-NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + G+VTF +E+ F S L G T + A + + + R+ Sbjct: 62 ELGIVTFGPVHVEQPF-----TSAANFFPPILFAQG-DTPMGAAITKALDMV--EERKRE 113 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + + Y+ I +TDG + + + ++IG++ + Sbjct: 114 YRANGISYYRPWIFLITDGAPTDE-WQAAANKVFQGEEDKKFAFFSIGVQGADIKTLAQ 171 >gi|37499120|gb|AAQ91617.1| unknown [Antonospora locustae] Length = 824 Score = 38.2 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 36/177 (20%), Positives = 69/177 (38%), Gaps = 28/177 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++V+D S SM ++ + ++I + + D+N + +V F ++ E Sbjct: 290 IILVVDKSGSMGWKCGGTVPH-KLVSEAIEVFM---STVCDLN--IHVNVVFFDHECCES 343 Query: 234 FLLEWGVSH----------LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +L G S + ++ + T GL+ A + E Sbjct: 344 DVLFRGCSERLTSEQGLLDKKEWVREKAGPRGGTCIVAGLQRAVD---------LKPAAE 394 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 D + ++ I+ +TDG S + SL AK G +AIGI S++ + A Sbjct: 395 DGSIRRNIILLTDG-GDSNLREITSLVQREAAK--GTRFFAIGIGNGVSYDTVMEVA 448 >gi|15802553|ref|NP_288580.1| hypothetical protein Z3241 [Escherichia coli O157:H7 EDL933] gi|15832135|ref|NP_310908.1| hypothetical protein ECs2881 [Escherichia coli O157:H7 str. Sakai] gi|168750415|ref|ZP_02775437.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4113] gi|168757974|ref|ZP_02782981.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4401] gi|168761818|ref|ZP_02786825.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4501] gi|168769829|ref|ZP_02794836.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4486] gi|168775476|ref|ZP_02800483.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4196] gi|168782359|ref|ZP_02807366.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4076] gi|168788745|ref|ZP_02813752.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC869] gi|168799042|ref|ZP_02824049.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC508] gi|195938267|ref|ZP_03083649.1| hypothetical protein EscherichcoliO157_17851 [Escherichia coli O157:H7 str. EC4024] gi|208809428|ref|ZP_03251765.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4206] gi|208817158|ref|ZP_03258250.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4045] gi|208821068|ref|ZP_03261388.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4042] gi|209395812|ref|YP_002271316.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4115] gi|217329554|ref|ZP_03445633.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. TW14588] gi|254793857|ref|YP_003078694.1| hypothetical protein ECSP_2829 [Escherichia coli O157:H7 str. TW14359] gi|261225077|ref|ZP_05939358.1| hypothetical protein EscherichiacoliO157_10843 [Escherichia coli O157:H7 str. FRIK2000] gi|261257577|ref|ZP_05950110.1| hypothetical protein EscherichiacoliO157EcO_17392 [Escherichia coli O157:H7 str. FRIK966] gi|12516275|gb|AAG57135.1|AE005433_7 orf, hypothetical protein [Escherichia coli O157:H7 str. EDL933] gi|13362350|dbj|BAB36304.1| hypothetical protein [Escherichia coli O157:H7 str. Sakai] gi|187768972|gb|EDU32816.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4196] gi|188015377|gb|EDU53499.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4113] gi|189000214|gb|EDU69200.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4076] gi|189355130|gb|EDU73549.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4401] gi|189361283|gb|EDU79702.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4486] gi|189367794|gb|EDU86210.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4501] gi|189371544|gb|EDU89960.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC869] gi|189378529|gb|EDU96945.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC508] gi|208729229|gb|EDZ78830.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4206] gi|208730777|gb|EDZ79467.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4045] gi|208741191|gb|EDZ88873.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4042] gi|209157212|gb|ACI34645.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. EC4115] gi|209747972|gb|ACI72293.1| hypothetical protein ECs2881 [Escherichia coli] gi|209747974|gb|ACI72294.1| hypothetical protein ECs2881 [Escherichia coli] gi|209747976|gb|ACI72295.1| hypothetical protein ECs2881 [Escherichia coli] gi|217317322|gb|EEC25751.1| von Willebrand factor type A domain protein [Escherichia coli O157:H7 str. TW14588] gi|254593257|gb|ACT72618.1| conserved protein [Escherichia coli O157:H7 str. TW14359] gi|320191760|gb|EFW66408.1| hypothetical protein ECoD_01435 [Escherichia coli O157:H7 str. EC1212] gi|320641372|gb|EFX10827.1| von Willebrand factor type A [Escherichia coli O157:H7 str. G5101] gi|320646742|gb|EFX15629.1| von Willebrand factor type A [Escherichia coli O157:H- str. 493-89] gi|320668101|gb|EFX34980.1| von Willebrand factor type A [Escherichia coli O157:H7 str. LSU-61] gi|326338430|gb|EGD62258.1| hypothetical protein ECoA_05044 [Escherichia coli O157:H7 str. 1044] gi|326347047|gb|EGD70780.1| hypothetical protein ECF_00219 [Escherichia coli O157:H7 str. 1125] Length = 219 Score = 38.2 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 63/179 (35%), Gaps = 16/179 (8%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 ++ S + R +++LDVS SM + I+ + +E+ P V Sbjct: 8 ATSDFASNPEPRCPCILLLDVSGSM------NGRPINELNAGLVTFRDELLADPLALKRV 61 Query: 220 QSGLVTFS-NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + G+VTF +E+ F S L G T + A + + + R+ Sbjct: 62 ELGIVTFGPVHVEQPF-----TSAANFFPPILFAQG-DTPMGAAITKALDMV--EERKRE 113 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + + Y+ I +TDG + + + ++IG++ + Sbjct: 114 YRANGISYYRPWIFLITDGAPTDE-WQAAANKVFQGEEDKKFAFFSIGVQGADIKTLAQ 171 >gi|320108233|ref|YP_004183823.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319926754|gb|ADV83829.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 342 Score = 38.2 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 38/226 (16%), Positives = 74/226 (32%), Gaps = 30/226 (13%) Query: 158 ITSSVKVNSQTDA-RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 I +K +Q L + I+LD S S + + D + + +L+ Sbjct: 88 IPQKIKNFTQEKTLPLTIGILLDTSGS---QMNVLPLEKDAGAQFLKDVLQPKDEAF--- 141 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQIFDM 273 L+ F ++ L+R + + G + + G ++D Sbjct: 142 ------LIKFDINVDMLADYTNSPRELKRALDKAEINTGAGTGSVTGNGTPRG-TLLYDA 194 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + H +K++V +TDG + + A+K AI+Y I I Sbjct: 195 VYLGAHDKLRQEAGRKVLVVLTDGGDQG--SQLKLQDAIEAAQKANAIIYVILIADRGGF 252 Query: 334 -----------EFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 + L + N + DAF+ I ++ T+ Sbjct: 253 GNFSFGGSGDMDRLSRETGGRMINVGNNGKKLEDAFAQISDELRTQ 298 >gi|269125512|ref|YP_003298882.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268310470|gb|ACY96844.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 814 Score = 38.2 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 35/230 (15%), Positives = 70/230 (30%), Gaps = 39/230 (16%) Query: 154 IVMPITSSVKVNSQTDARLDMMI----VLDVSRSMESFFDSSITKIDMAIKSINA----- 204 + P ++V+ Q ++L + I ++D+S SM + +T++ + Sbjct: 589 LPTPQPATVQELVQAWSKLSLSIRMLSIIDISGSMLAPVGGGLTRMQATARVAQGGLSLL 648 Query: 205 ----------MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG 254 ++ D +V G + E + L + K Sbjct: 649 PNDSELGQWVFSTKLDGDKDYKEIVPMGPLG-----ERVGSVTRRQLLLSSLSRIEPKPT 703 Query: 255 VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST--KEDQQSLYYC 312 T + AY + + I+ TDG+N +Q+L Sbjct: 704 GDTGLYDTILAAYRYMSKTYKPEFGNS---------ILLFTDGKNEDDDGPTLRQTLREL 754 Query: 313 NEAKK--RGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAF 358 R V +G L+ A + Y+ +NP+ + F Sbjct: 755 ESMIDPTRPIQVIMLGFGPGVDVNELKQIAKVTRGDVYVTQNPNEIQKIF 804 >gi|166367319|ref|YP_001659592.1| magnesium protoporphyrin IX chelatase subunit D [Microcystis aeruginosa NIES-843] gi|166089692|dbj|BAG04400.1| magnesium protoporphyrin IX chelatase subunit D [Microcystis aeruginosa NIES-843] Length = 665 Score = 38.2 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 32/220 (14%), Positives = 73/220 (33%), Gaps = 37/220 (16%) Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 + + ++ A R P + + RH+++ A ++ V+D S Sbjct: 422 GKVKRVAVDATLRAAAPYQKARRLRNP--ERHVIIEEGDLRSKRLARKAGSLIVFVVDAS 479 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEFFLLEWGV 240 SM ++ ++ A ++ +L E N Q L+ F + + + Sbjct: 480 GSM------ALNRMQSAKGAVMRLLTEA-----YENRDQIALIPFRGERADVLLPPTKSI 528 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 + +++++ L G S + + + R + +IV +TDG + Sbjct: 529 TLAKKRLETLPCGGGSPLAHGLTQA----VHVGMNARLSGDIGQV----VIVAITDGRSN 580 Query: 301 -------------STKEDQQS--LYYCNEAKKRGAIVYAI 325 K D ++ L + + G + I Sbjct: 581 IPLAKSLGEPIPEGEKPDIKAELLDIAAKIRSLGIKLLVI 620 >gi|159027822|emb|CAO87035.1| unnamed protein product [Microcystis aeruginosa PCC 7806] Length = 704 Score = 38.2 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 32/220 (14%), Positives = 73/220 (33%), Gaps = 37/220 (16%) Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 + + ++ A R P + + RH+++ A ++ V+D S Sbjct: 461 GKVKRVAVDATLRAAAPYQKARRLRNP--ERHVIIEEGDLRSKRLARKAGSLIVFVVDAS 518 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEFFLLEWGV 240 SM ++ ++ A ++ +L E N Q L+ F + + + Sbjct: 519 GSM------ALNRMQSAKGAVMRLLTEA-----YENRDQIALIPFRGERADVLLPPTKSI 567 Query: 241 SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 + +++++ L G S + + + R + +IV +TDG + Sbjct: 568 TLAKKRLETLPCGGGSPLAHGLTQA----VHVGMNARLSGDIGQV----VIVAITDGRSN 619 Query: 301 -------------STKEDQQS--LYYCNEAKKRGAIVYAI 325 K D ++ L + + G + I Sbjct: 620 IPLAKSLGEPIPEGEKPDIKAELLDIAAKIRSLGIKLLVI 659 >gi|126442905|ref|YP_001064078.1| hypothetical protein BURPS668_A3087 [Burkholderia pseudomallei 668] gi|126222396|gb|ABN85901.1| conserved hypothetical protein [Burkholderia pseudomallei 668] Length = 418 Score = 38.2 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 26/147 (17%), Positives = 57/147 (38%), Gaps = 8/147 (5%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 +G ++IL A+ L ++ +G+ +++ ++ ++ L + D + AA + G N Sbjct: 17 RRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLT--GAIN 74 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 G + F +++ N V D + P+N S S+I Sbjct: 75 LSVPEAAGITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQ------PKNAIASPSSI 128 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPIT 159 K + W+ + ++V +T Sbjct: 129 KYVKCTTSRTGIVNWFIQTLNLVPGVT 155 >gi|290990289|ref|XP_002677769.1| predicted protein [Naegleria gruberi] gi|284091378|gb|EFC45025.1| predicted protein [Naegleria gruberi] Length = 754 Score = 38.2 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 74/195 (37%), Gaps = 31/195 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 ++I LDVS SM ID A +I+ + E+V IPDV L+ + E Sbjct: 40 QIVIALDVSGSMRGQG------IDQAKIAISNLFEQVVDIPDVV------LIAYDTSAE- 86 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + L + ++ + G T+ T + ++ + I+ Sbjct: 87 LYDLRKKPAETRQSTLEQIQAGGGTDFTCVF-----------EAISKLDMFNSQSEVAIL 135 Query: 293 FMTDGENLSTKEDQQSLYYCNEA---KKRGAIVYAIGIRVIRSHEFL----RACASPNSF 345 F TDG++ S+ + ++++ + K + + IG L + + +F Sbjct: 136 FFTDGQDGSSHKREKAIEQMKKVLETKTQSFEFHTIGFTSSHDVALLTQITQLGSVQGTF 195 Query: 346 YLVENPHSMYDAFSH 360 V++ + + + + Sbjct: 196 QYVKDANEINQSMEN 210 >gi|170741048|ref|YP_001769703.1| hypothetical protein M446_2844 [Methylobacterium sp. 4-46] gi|168195322|gb|ACA17269.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 432 Score = 38.2 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 54/411 (13%), Positives = 131/411 (31%), Gaps = 64/411 (15%) Query: 15 YKGGMTILTAIFLPIIFLVLGMIIEVSHI-FFMKTVLHSMIDRSLVHAATQIMNEGNGNN 73 G + + + L + + +G+ ++ + + + + L M+ + Sbjct: 24 RSGSVGFVFGLLLLPMMVAMGVSVDYARVSAARSDLAAAADAAVLSVTNKAAMSLDMLSA 83 Query: 74 RKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQN--------E 125 + +++ +KN M + + + V D+ + +T L + Sbjct: 84 QARVRDAF----LKNIQTMPDISGVSADAVVIDLLGVRAAT-LSYTASYRTAFSGILGMR 138 Query: 126 GYSISAISRYKIPLKFCT-FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVL-DVSRS 183 S+S + K + F NS + + TS+ + D+S Sbjct: 139 TLSVSGNAASKSAVPIYMDFYLLLDNSPSMGVGATSADISTMVSRTPDKCAFACHDLSAG 198 Query: 184 MESFFD-----SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW 238 ++ +ID+ ++ +++ V ++ L ++ + + Sbjct: 199 NSDYYHLAKSLGVTMRIDVVRQATQRLMDTAANTALVPGQFRTAL--YTMGADCASVGLT 256 Query: 239 GVSHLQRKIKYLSKFGVSTNSTPGLKYAYN--QIFDMQGMRQHCN---------TEDANY 287 VS L + + + K YN Q D G+ Q N + Sbjct: 257 TVSPLSSDLAAAKTNAQAIDLMTIQKPGYNNDQCTDFDGVFQSLNGKIDVAGDGSTALTP 316 Query: 288 KKIIVFMTDG--ENLSTKEDQQ-----------SLYYCNEAKKRGAIV---YAIGIRVIR 331 +K++ ++DG + + +L C K RG + Y + + Sbjct: 317 QKVVFLVSDGVADAYYPSTCTRKTTGGRCQEPLTLANCTTLKNRGIKIAVLYTTYLPLPT 376 Query: 332 SH--------------EFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 + ++ CASP ++ V + DA + + + V++ Sbjct: 377 NDWYNTWIAPFQATLPSAMQGCASPGLYFEVSPTQGIADAMTTLFQRTVSQ 427 >gi|126730247|ref|ZP_01746058.1| hypothetical protein SSE37_10844 [Sagittula stellata E-37] gi|126708980|gb|EBA08035.1| hypothetical protein SSE37_10844 [Sagittula stellata E-37] Length = 181 Score = 38.2 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 25/151 (16%), Positives = 51/151 (33%), Gaps = 17/151 (11%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMK--------------TVLHS 52 +R F + G +TI T I+LP++ VL + + F K + S Sbjct: 4 KLRRFAGDETGNVTIETLIWLPLLLTVLASMFSLHDAFRQKSLNTKAAYTISDAISRETS 63 Query: 53 MIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVND--IDDI 110 ID + + ++ + + N R+ + G +D +D+ Sbjct: 64 AIDAAYLDGMLDLLEFLTSSEGPYSLRVTQVRYDAN-QGAYIRDWSQTRGLFSDLRTEDL 122 Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKF 141 V T ++ + ++Y P + Sbjct: 123 VSLTDRLPTLLHNERVIMVETETQYVPPFEL 153 >gi|75676719|ref|YP_319140.1| hypothetical protein Nwi_2535 [Nitrobacter winogradskyi Nb-255] gi|74421589|gb|ABA05788.1| conserved hypothetical protein [Nitrobacter winogradskyi Nb-255] Length = 554 Score = 38.2 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 8/45 (17%), Positives = 20/45 (44%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHS 52 +R F + +G I+ A + ++ + ++V I+ + S Sbjct: 5 LRRFISDTRGNFAIMGAGCMVLVLGCTALGVDVGAIYADRRKAQS 49 >gi|295106190|emb|CBL03733.1| hypothetical protein [Gordonibacter pamelaeae 7-10-1-b] Length = 929 Score = 38.2 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 29/160 (18%), Positives = 50/160 (31%), Gaps = 22/160 (13%) Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 NEG G N ++ + + N LR G V D+V+ Sbjct: 75 NEGEGQNDGSEADEAETGQVADPSSSRSVNVLRAGGLVRGDTGGTSGGEGDLVLAK---- 130 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 S+ + + + S+ + + D+++VLD S SM+ Sbjct: 131 -SLETDGQGD-------------KVITLEAYASGSMSI-VEQGIPCDIVLVLDQSGSMDD 175 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 F S N L L + N V+ G ++ Sbjct: 176 RF-GSQGSYHALSGYSNKRLG--DLAENGNLYVRGGDGSY 212 >gi|281207766|gb|EFA81946.1| hypothetical protein PPL_05180 [Polysphondylium pallidum PN500] Length = 1990 Score = 38.2 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 30/211 (14%), Positives = 71/211 (33%), Gaps = 43/211 (20%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN--- 228 LD++ V+D + SM S +I + I +++++ I + V+ +V +++ Sbjct: 1662 LDLVFVVDDTGSMGS-------EIAKVKQEIQNIVDDIVSIGSIE--VRVAMVFYNDHTP 1712 Query: 229 ------KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + + F + L+R + + G + + + ++ + + Sbjct: 1713 NSDHSKSVCKVFKFTSDIPELRRGLDSVVVHG-GADHPEAMADGFYEVTKLDFAKSS--- 1768 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQS-----------LYYCNEAKKRGAIVYAIGIRVIR 331 K+ + + D S + + + G Y + + Sbjct: 1769 -----TKVCIVIGDAPPHGFSGSGDSFPQGCPCGHDLIASVRQLVQGGVTFYTV-MCRGD 1822 Query: 332 SH--EFLRACA--SPNSFYLVENPHSMYDAF 358 S E L A A S F L+ N + + Sbjct: 1823 SQTYETLNAIADLSEGRFVLLNNASELTEII 1853 >gi|17230314|ref|NP_486862.1| hypothetical protein alr2822 [Nostoc sp. PCC 7120] gi|17131916|dbj|BAB74521.1| alr2822 [Nostoc sp. PCC 7120] Length = 224 Score = 38.2 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 65/165 (39%), Gaps = 17/165 (10%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 +++LD S SM+ I+ + + ++ +E+ V+ +VTF + I Sbjct: 24 VLLLDTSGSMQG------AAIEALNQGLLSLKDELMKNSIAARRVEIAIVTFDSHINV-- 75 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 ++ V+ Q L+ G+ T+ G+ A + + Q + Y + VFM Sbjct: 76 -IQDFVTADQFNPPILTAQGL-TSMGAGIHKALDMV---QERKSLYRANGVAYYRPWVFM 130 Query: 295 -TDGENLSTKE---DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 TDGE + +Q +L + + +++G+ Sbjct: 131 ITDGEPQGELDHLVEQAALRLQGDEVNKRVAFFSVGVENANMTRL 175 >gi|302561365|ref|ZP_07313707.1| magnesium chelatase [Streptomyces griseoflavus Tu4000] gi|302478983|gb|EFL42076.1| magnesium chelatase [Streptomyces griseoflavus Tu4000] Length = 201 Score = 38.2 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 24/133 (18%), Positives = 47/133 (35%), Gaps = 21/133 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEE 232 ++ V+D S SM ++ ++ ++ ++L + + GLVTF + E Sbjct: 20 VLFVVDASGSM-----AARQRMGAVKGAVLSLL-----LDAYQRRDKVGLVTFRGSATEV 69 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI-I 291 V +++ L G T GL A+ + ++ + Sbjct: 70 ALPPTSSVDAAAARLESLPT-GGRTPLAAGLLRAHEVL--------RVERLRDPARRALV 120 Query: 292 VFMTDGENLSTKE 304 V +TDG E Sbjct: 121 VVVTDGRATGGPE 133 >gi|302343421|ref|YP_003807950.1| TadE family protein [Desulfarculus baarsii DSM 2075] gi|301640034|gb|ADK85356.1| TadE family protein [Desulfarculus baarsii DSM 2075] Length = 138 Score = 38.2 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 23/158 (14%), Positives = 54/158 (34%), Gaps = 27/158 (17%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 R + +G + + A+FLP+ LV+ IIE+ ++ K +L + Sbjct: 4 FRRLAADGRGSVAVEFALFLPVFLLVIFSIIELGAAWYQKQMLVNA-------------- 49 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 +L + R + +V++ +D + Sbjct: 50 ---SREGARLGALFSTSGGLTAQEVQERVNQYLSDSGFPSQAVVQAVGVD---GASGDPV 103 Query: 128 SISAISRYKIPL--KFCTFIPWYTNSRHIVMPITSSVK 163 +++ + Y+ P+ F +P I + T+ ++ Sbjct: 104 TVNVSADYEFPVLSAFIGAVPG-----TISLSATTVMR 136 >gi|291515302|emb|CBK64512.1| hypothetical protein AL1_22410 [Alistipes shahii WAL 8301] Length = 285 Score = 38.2 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 17/78 (21%), Positives = 31/78 (39%), Gaps = 7/78 (8%) Query: 174 MMIVLDVSRSMES---FFDSSITKIDMAIKSINAML----EEVKLIPDVNNVVQSGLVTF 226 +++ +D S SM F + +TK + N ML E + V + ++ + Sbjct: 14 IILAIDGSGSMAEKIRFRNRQMTKAEAVATITNGMLFELIERARRSDGVRDYYDIAVIGY 73 Query: 227 SNKIEEFFLLEWGVSHLQ 244 S E + LL G + Sbjct: 74 SGDDEVYSLLPGGRELIS 91 >gi|221108027|ref|XP_002158065.1| PREDICTED: similar to Brain-specific angiogenesis inhibitor 3 [Hydra magnipapillata] Length = 335 Score = 38.2 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 36/164 (21%), Positives = 55/164 (33%), Gaps = 23/164 (14%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK-LIPDVNNVVQS 221 V S + VLD S S+E + +L VK VN V Sbjct: 132 TVKSCNHVNFTFVFVLDSSSSVED---------QEWMDEKGLVLSFVKSSAFGVNPNVDV 182 Query: 222 GLVTFSNKIEEFFLLEWGVSH--LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 +V + + + S+ + + L T GL A + + Q Sbjct: 183 AVVNYGSTAQVEADCGTFKSYSTFETFMNNLKMVTGGTAIHRGLTTA-------ETVFQR 235 Query: 280 CNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIV 322 C + N +IV +TDG EN+ T D + N K +V Sbjct: 236 CQKLNMNP--VIVLLTDGFENIDTNIDSN-IAIENRIKSEALLV 276 >gi|260834997|ref|XP_002612496.1| hypothetical protein BRAFLDRAFT_75382 [Branchiostoma floridae] gi|229297873|gb|EEN68505.1| hypothetical protein BRAFLDRAFT_75382 [Branchiostoma floridae] Length = 370 Score = 38.2 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 22/140 (15%), Positives = 42/140 (30%), Gaps = 35/140 (25%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ-SGLVTFSNK 229 LD ++ LD S SM + K++ + V+ +N+ + G+V F K Sbjct: 2 PLDTVLCLDTSGSMAGRG------MRELKKAVREFILGVQETASKHNLRENVGVVEFGAK 55 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L + + R + + TP Sbjct: 56 TRIVQPLTNNYAAVLRAVGGVLVLSGQKKMTPR--------------------------- 88 Query: 290 IIVFMTDGENLSTKEDQQSL 309 ++ MTDG + ++ Sbjct: 89 -VILMTDGHPDDKQNVLKAA 107 >gi|189500596|ref|YP_001960066.1| von Willebrand factor type A [Chlorobium phaeobacteroides BS1] gi|189496037|gb|ACE04585.1| von Willebrand factor type A [Chlorobium phaeobacteroides BS1] Length = 1232 Score = 38.2 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 65/174 (37%), Gaps = 21/174 (12%) Query: 159 TSSVKVNSQTDARLD--MMIVLDVSRSMESF--FDSSITKIDMAIKSINAMLEEVKLIPD 214 T+ + T +D ++I+LD S SM S T++ +++ + ++++ + + Sbjct: 697 TAIPVKETATATPIDTNLLIILDTSGSMGSNPGVSGYSTRMAVSVDAAKELIDQYDALGE 756 Query: 215 VNNVVQSGLVTFSNKIEEFFLLEWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM 273 V V+ V F + + W + + + + V+ ++ PG Y+Q + Sbjct: 757 VK--VRI--VDFDSTGTKKGS-SWQTATAAKSTLDTM----VNAHTDPGQLTNYDQALNY 807 Query: 274 QGMRQHCNTEDANYKKIIVFMTDGE-------NLSTKEDQQSLYYCNEAKKRGA 320 + + + + F++DG N S + + G Sbjct: 808 GMDAYDDSGALTDAQGVSYFISDGVPTTNTNWNTYPNLVSGSDNWPDWGTNNGI 861 >gi|326336011|ref|ZP_08202187.1| von Willebrand factor [Capnocytophaga sp. oral taxon 338 str. F0234] gi|325691808|gb|EGD33771.1| von Willebrand factor [Capnocytophaga sp. oral taxon 338 str. F0234] Length = 288 Score = 38.2 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 27/87 (31%), Positives = 44/87 (50%), Gaps = 6/87 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S FF S + I I A L L + +GL+ F Sbjct: 72 EEERELTLMLLVDVSGS--EFFGSLQQFKNEIITEIAATLAFAALQNNDK----TGLILF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKF 253 S++IE + + G SH+ R I+ L +F Sbjct: 126 SDQIELYIPPKKGKSHILRIIRELIEF 152 >gi|291570879|dbj|BAI93151.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 220 Score = 38.2 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 60/165 (36%), Gaps = 17/165 (10%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 +++LD S SM+ +D + E + V+ ++TF N+++ Sbjct: 20 VLLLDTSASMQG------EPLDGLNVGLMTFRENLIKDELAKKRVEIAVITFDNQVK--- 70 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 +++ V+ + + L+ G T + A + I + Y + VFM Sbjct: 71 IIQDFVTADRFEPPMLTAQGQ-TYMGTAIGEALDMIASR---KAEYRNNGITYYRPWVFM 126 Query: 295 -TDGENLSTKE---DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 TDGE + +Q +E + +A+G+ Sbjct: 127 ITDGEPQGESDRITEQAIKRIRDEEANKQVAFFAVGVEGANMERL 171 >gi|261337937|ref|ZP_05965821.1| putative von Willebrand factor type A domain protein [Bifidobacterium gallicum DSM 20093] gi|270277430|gb|EFA23284.1| putative von Willebrand factor type A domain protein [Bifidobacterium gallicum DSM 20093] Length = 351 Score = 38.2 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 15/94 (15%), Positives = 33/94 (35%), Gaps = 14/94 (14%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 +++ +V ++ D+++ LDVS S ID + + E Sbjct: 79 VLVGRPGTVDSQTERQGNRDIVLCLDVSGS---TLPYDHEVIDTYRELVKHFQGE----- 130 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI 247 + GL F++ F L + +++ Sbjct: 131 ------RIGLSIFNSTSRTVFPLTDDYELVSQQL 158 >gi|209525932|ref|ZP_03274466.1| von Willebrand factor type A [Arthrospira maxima CS-328] gi|209493609|gb|EDZ93930.1| von Willebrand factor type A [Arthrospira maxima CS-328] Length = 228 Score = 38.2 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 69/188 (36%), Gaps = 17/188 (9%) Query: 152 RHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 ++ M + +V+ + R +++LD S SM+ +D + E + Sbjct: 5 KNQNMRLEEAVEFAENPEPRCPCVLLLDTSASMQG------EPLDGLNAGLMTFRENLIK 58 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 V+ ++TF N+++ +++ V+ + + L+ G T + A + I Sbjct: 59 DELAKKRVEIAIITFDNQVK---IIQDFVTADRFEPPLLNAQGQ-TYMGTAIGEALDMIA 114 Query: 272 DMQGMRQHCNTEDANYKKIIVFM-TDGENLSTKE---DQQSLYYCNEAKKRGAIVYAIGI 327 + Y + VFM TDGE + +Q +E + +A+G+ Sbjct: 115 SR---KAEYRNNGITYYRPWVFMITDGEPQGESDRITEQAIKRIRDEEANKQVAFFAVGV 171 Query: 328 RVIRSHEF 335 Sbjct: 172 EGANMERL 179 >gi|253701053|ref|YP_003022242.1| hypothetical protein GM21_2441 [Geobacter sp. M21] gi|251775903|gb|ACT18484.1| protein of unknown function DUF58 [Geobacter sp. M21] Length = 294 Score = 38.2 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 20/97 (20%), Positives = 38/97 (39%), Gaps = 9/97 (9%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 L++M+++D S S+ A + V+ + GL+T S+++E Sbjct: 78 LNLMLLVDRSASLACPTPRGAKSRLAAEAASLLAFAAVRSND------RVGLITCSDRVE 131 Query: 232 EFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKY 265 F G QR + L + G ++ L+Y Sbjct: 132 SFIPPAKGARQAQRIVASLCGGASSGRGSDLAGALEY 168 >gi|291450680|ref|ZP_06590070.1| magnesium-chelatase subunit [Streptomyces albus J1074] gi|291353629|gb|EFE80531.1| magnesium-chelatase subunit [Streptomyces albus J1074] Length = 277 Score = 38.2 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 25/132 (18%), Positives = 53/132 (40%), Gaps = 19/132 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEE 232 ++ V+D S SM ++ ++ ++ ++L + + GLVTF + E Sbjct: 91 VLFVVDASGSM-----AARRRMGAVKGAVLSLL-----LDAYQRRDKVGLVTFRGREAEV 140 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V +++ L G + + LK + + ++ +R DA + ++V Sbjct: 141 ALPPTSSVDAAAARLQALPTGGRTPLAAGLLKA--HDVLRVERLR------DATRRPLLV 192 Query: 293 FMTDGENLSTKE 304 +TDG E Sbjct: 193 VVTDGRATGGPE 204 >gi|239978792|ref|ZP_04701316.1| putative magnesium-chelatase subunit [Streptomyces albus J1074] Length = 336 Score = 38.2 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 25/132 (18%), Positives = 53/132 (40%), Gaps = 19/132 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEE 232 ++ V+D S SM ++ ++ ++ ++L + + GLVTF + E Sbjct: 150 VLFVVDASGSM-----AARRRMGAVKGAVLSLL-----LDAYQRRDKVGLVTFRGREAEV 199 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V +++ L G + + LK + + ++ +R DA + ++V Sbjct: 200 ALPPTSSVDAAAARLQALPTGGRTPLAAGLLKA--HDVLRVERLR------DATRRPLLV 251 Query: 293 FMTDGENLSTKE 304 +TDG E Sbjct: 252 VVTDGRATGGPE 263 >gi|213962893|ref|ZP_03391153.1| tellurium resistance protein [Capnocytophaga sputigena Capno] gi|213954550|gb|EEB65872.1| tellurium resistance protein [Capnocytophaga sputigena Capno] Length = 212 Score = 38.2 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 27/195 (13%), Positives = 59/195 (30%), Gaps = 12/195 (6%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL +LD S SM S I ++ ++ P + + ++TF ++ Sbjct: 4 RLLAYFLLDTSGSM------SGEPIQALNNGFYGLINMLRNDPQAMDSLHLSVITFDREV 57 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L S + I+ T++ L+ + + D Sbjct: 58 RNIIPLIDLASF--QPIEIKCPESGPTHTGEALEMVCDLVKTELIRGSAQQKGDWKPLLF 115 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVEN 350 I TDG+ ++ ++ + +A G + +L+ + Sbjct: 116 I--FTDGKPSDIQKYREYI-PKVKALDFGV-IVGCAAGPKAEVSYLQELTPDVVKLDTTD 171 Query: 351 PHSMYDAFSHIGKDI 365 + + F + I Sbjct: 172 ANMLTTFFKWVSSSI 186 >gi|134281810|ref|ZP_01768517.1| conserved hypothetical protein [Burkholderia pseudomallei 305] gi|134246872|gb|EBA46959.1| conserved hypothetical protein [Burkholderia pseudomallei 305] Length = 418 Score = 38.2 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 62/168 (36%), Gaps = 6/168 (3%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 +G ++IL A+ L ++ +G+ +++ ++ ++ L + D + AA + G N Sbjct: 17 RRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLT--GAIN 74 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTS-LDIVVVPQNEGYSISA 131 G + F +++ N V D + + + P + Y Sbjct: 75 LSVPEAAGITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCT 134 Query: 132 ISRYKIP---LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMI 176 SR I ++ +P T + V + +QT + + I Sbjct: 135 TSRTGIVNWFIQTLNLVPGVTVTNASVSATAVATVGAAQTTCAIPVFI 182 >gi|329954836|ref|ZP_08295853.1| hypothetical protein HMPREF9445_00688 [Bacteroides clarus YIT 12056] gi|328526940|gb|EGF53951.1| hypothetical protein HMPREF9445_00688 [Bacteroides clarus YIT 12056] Length = 303 Score = 38.2 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 18/84 (21%), Positives = 39/84 (46%), Gaps = 6/84 (7%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S+E + K + + ++ + G++ F Sbjct: 86 EEERELTVMLLVDVSGSLEFGTVKQMKKDMVTEIAATLAFSAIQNND------KIGVIFF 139 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL 250 S++IE+F + G H+ I+ L Sbjct: 140 SDRIEKFIPPKKGRKHILYIIREL 163 >gi|323491534|ref|ZP_08096713.1| hypothetical protein VIBR0546_18031 [Vibrio brasiliensis LMG 20546] gi|323314110|gb|EGA67195.1| hypothetical protein VIBR0546_18031 [Vibrio brasiliensis LMG 20546] Length = 411 Score = 38.2 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 23/130 (17%), Positives = 42/130 (32%), Gaps = 6/130 (4%) Query: 12 FYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNG 71 + KG + ++ + + + V + I+VSH K L + +D + + AA + +EG Sbjct: 11 YRRQKGLVLVMVTVAMLALVGVAALAIDVSHAVLNKARLQNSVDAAALAAAIVMDSEGTN 70 Query: 72 NNRKKLKGGDILCRIKNT------WNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 + T ++ S N LD V Sbjct: 71 AEATTAANTTLTNLASATGNSEMDFSSSTVTVQYSNDPTIFPQTSGYDADLDTYVRVTVS 130 Query: 126 GYSISAISRY 135 YS+ Y Sbjct: 131 AYSLDNFFAY 140 >gi|237507530|ref|ZP_04520245.1| flp pilus assembly protein TadG [Burkholderia pseudomallei MSHR346] gi|234999735|gb|EEP49159.1| flp pilus assembly protein TadG [Burkholderia pseudomallei MSHR346] Length = 418 Score = 38.2 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 26/147 (17%), Positives = 57/147 (38%), Gaps = 8/147 (5%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 +G ++IL A+ L ++ +G+ +++ ++ ++ L + D + AA + G N Sbjct: 17 RRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLT--GAIN 74 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 G + F +++ N V D + P+N S S+I Sbjct: 75 LSVPEAAGITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQ------PKNAIASPSSI 128 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPIT 159 K + W+ + ++V +T Sbjct: 129 KYVKCTTSRTGIVNWFIQTLNLVPGVT 155 >gi|226194158|ref|ZP_03789758.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan 9] gi|225933851|gb|EEH29838.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan 9] Length = 418 Score = 38.2 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 26/147 (17%), Positives = 57/147 (38%), Gaps = 8/147 (5%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 +G ++IL A+ L ++ +G+ +++ ++ ++ L + D + AA + G N Sbjct: 17 RRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLT--GAIN 74 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAI 132 G + F +++ N V D + P+N S S+I Sbjct: 75 LSVPEAAGITAGHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQ------PKNAIASPSSI 128 Query: 133 SRYKIPLKFCTFIPWYTNSRHIVMPIT 159 K + W+ + ++V +T Sbjct: 129 KYVKCTTSRTGIVNWFIQTLNLVPGVT 155 >gi|189461335|ref|ZP_03010120.1| hypothetical protein BACCOP_01985 [Bacteroides coprocola DSM 17136] gi|189431864|gb|EDV00849.1| hypothetical protein BACCOP_01985 [Bacteroides coprocola DSM 17136] Length = 289 Score = 38.2 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 22/107 (20%), Positives = 47/107 (43%), Gaps = 9/107 (8%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S++ + K +A + ++ + G++ F Sbjct: 72 EEERELTVMLLVDVSNSLDFGTVKQLKKNMVAEIAATLAFSAIQNND------KIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQI 270 S++IE+F + G H+ I+ L TN ++Y N + Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELLDFKPESTRTNIQCAIEYLTNVL 172 >gi|220912875|ref|YP_002488184.1| von Willebrand factor A [Arthrobacter chlorophenolicus A6] gi|219859753|gb|ACL40095.1| von Willebrand factor type A [Arthrobacter chlorophenolicus A6] Length = 317 Score = 38.2 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 24/141 (17%), Positives = 49/141 (34%), Gaps = 11/141 (7%) Query: 168 TDARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 L++ V+D + SM + +S ++D + I + +E+ + L+TF Sbjct: 39 ATTDLNVFFVVDTTTSMVAEDYGTSAPRMDGVRQDIMDIAKELPGA-------RFSLITF 91 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 K L LQ + L + + A + Sbjct: 92 DTKAFVRMPLTTDTLALQTMVTVLEPQVTAFAKGSSITAARQVL---SERLDAARESHPE 148 Query: 287 YKKIIVFMTDGENLSTKEDQQ 307 +++ ++ DGE S KE ++ Sbjct: 149 RPRLVYYLGDGEQTSGKEPER 169 >gi|261820223|ref|YP_003258329.1| hypothetical protein Pecwa_0904 [Pectobacterium wasabiae WPP163] gi|261604236|gb|ACX86722.1| conserved hypothetical protein [Pectobacterium wasabiae WPP163] Length = 539 Score = 38.2 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 28/223 (12%), Positives = 62/223 (27%), Gaps = 26/223 (11%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F + KG T A+ + + I++ S T + D + + Q G Sbjct: 28 FIHQEKGAGTAFYALGAMALLVTAAFIVDTSTATGDATQIKRATDAAALAVGHQATING- 86 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 I ++ L N +++ + + + Y+++ Sbjct: 87 --------EEYSQEDINTLAYEYVKSNLGMNKALSEKLVAGDVSVTEGRNSATRKTYTVT 138 Query: 131 AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDS 190 K L S V+ + + +V+ V+ M + Sbjct: 139 VAFATKPSLLSLGARAQEVYSTSEVINRPTE------------VALVMPVTGDMSA---G 183 Query: 191 SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 I + S L + + + LV +S + + Sbjct: 184 DIRSLKSVSHSFVERL--LGSADTKRDNLWLSLVPYSQSVNVY 224 >gi|261416578|ref|YP_003250261.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261373034|gb|ACX75779.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|302325972|gb|ADL25173.1| von Willebrand factor type A domain protein [Fibrobacter succinogenes subsp. succinogenes S85] Length = 236 Score = 38.2 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 65/200 (32%), Gaps = 25/200 (12%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAML--EEVKLIPDVNNVVQSGLVTFSN 228 L ++I+ DVS SM I K+D ++N M+ + + + ++TF N Sbjct: 14 PLPVIILADVSGSMNE-----IGKLDSLKHALNNMISSFKDASSSSLEAEIYVSIITFGN 68 Query: 229 -------KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 + + + S + I + G + L A + D+ R+ Sbjct: 69 QAANIILEPQSASEIANDPSKMN-VINKMQAIGNT-----PLGKALTSLVDLLENRE--I 120 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 Y+ IV +DG + N + + A A+ I L+ + Sbjct: 121 YPSRAYRPFIVLASDGMPNDLWQQPLDRLL-NSERSKKANRLALAIGADADESMLKKFVN 179 Query: 342 PNS--FYLVENPHSMYDAFS 359 + N + F Sbjct: 180 NEEMPIFKANNAIEIQKFFK 199 >gi|169334752|ref|ZP_02861945.1| hypothetical protein ANASTE_01158 [Anaerofustis stercorihominis DSM 17244] gi|169257490|gb|EDS71456.1| hypothetical protein ANASTE_01158 [Anaerofustis stercorihominis DSM 17244] Length = 375 Score = 38.2 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 17/102 (16%), Positives = 33/102 (32%), Gaps = 12/102 (11%) Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKII--VFMTDGENLSTKEDQQSLYYCNEAKKRGAIV 322 A +++ + + D K + + +TD N+ D L C K Sbjct: 168 KAVEVVWECPNSVDYASK-DIEAAKYMRSLLVTDKYNVGISSDVIGLEICAAMKNA---- 222 Query: 323 YAIGIRVIRSHEFL-----RACASPNSFYLVENPHSMYDAFS 359 Y+IG+ + + L C + + V + A Sbjct: 223 YSIGLGMAEGFKELDGKLNNNCKAALFTFAVAEMGVLTKAMG 264 >gi|324516166|gb|ADY46443.1| C-type lectin domain-containing protein 160 [Ascaris suum] Length = 397 Score = 38.2 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 33/225 (14%), Positives = 67/225 (29%), Gaps = 24/225 (10%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSS---VKVNSQTDAR---LDMMIVLDVSRSMESFFDSS 191 PL + + + + +P TSS + LD+++ +D S SM Sbjct: 7 PLTILMGLLSTSFAANATVPSTSSSYYAERACACQPSKLYLDIVVAIDSSLSMTKEGL-- 64 Query: 192 ITKIDMAIKSINAMLEE-VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL 250 I +A L V ++ LV F++ L + ++ L Sbjct: 65 ---IQVAADLATLFLPMNVSSESAQGQFIRVALVAFADNAVIVGDL-NKYHNYASLVEGL 120 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 T ++ + +H +I+ + + D + Sbjct: 121 FTIDYHGGKTLNIEAGLKAASTVLESSRHYAKT------VILLYSSAYSAGGFADPNA-- 172 Query: 311 YCNEAKKRGAIVYAIGIRVIRSHEFLRACA---SPNSFYLVENPH 352 N+ K+ G + I R ++ SPN + + Sbjct: 173 IANQIKESGTKIITIAFRQQPEGTLVKKLGHLSSPNFAFGSMDTS 217 >gi|325089698|gb|EGC43008.1| U-box domain-containing protein [Ajellomyces capsulatus H88] Length = 759 Score = 38.2 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 35/205 (17%), Positives = 68/205 (33%), Gaps = 23/205 (11%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL------IPDVNNVVQSGLV 224 D+++ +DVS SM+S T + + ++ I +N + G+V Sbjct: 73 PCDIVLCIDVSYSMQSSAPLPTTDESGEREETGLSVLDLTKHAARTIIETLNENDRLGIV 132 Query: 225 TFSNKIEEFFLLE-WGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCN 281 FS + E + + S+ + +K + STN GLK ++ Sbjct: 133 AFSTEAEVVYKISKMNESNKKAALKAVEALKPLSSTNLWHGLK------LGLKAFENERR 186 Query: 282 TEDANYKKIIVFMTDGENLS----TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 T + + +TDG + R +++ G L+ Sbjct: 187 TPQSVQALYV--LTDGMPNHMCPKQGYVTKLRPILQLLGHRMPMIHTFGFGYNIRSGLLQ 244 Query: 338 ACAS--PNSFYLVENPHSMYDAFSH 360 A A +F + + + F H Sbjct: 245 AIAEVGGGTFAFIPDAGMIGTVFVH 269 >gi|260912476|ref|ZP_05919012.1| von Willebrand factor [Prevotella sp. oral taxon 472 str. F0295] gi|260633395|gb|EEX51549.1| von Willebrand factor [Prevotella sp. oral taxon 472 str. F0295] Length = 292 Score = 38.2 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 23/102 (22%), Positives = 49/102 (48%), Gaps = 9/102 (8%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S++ F + TK +MA + + I + + + G++ F Sbjct: 72 EEERELTVMLLIDVSGSLD-FGTTERTKREMATEMAAIL--AFSAIQNND---KIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKY 265 S++IE++ + G H+ I + TN ++Y Sbjct: 126 SDRIEKYIPPKKGRKHILYIIHEMLDFKPESKRTNVAAAIEY 167 >gi|189350352|ref|YP_001945980.1| hypothetical protein BMULJ_01515 [Burkholderia multivorans ATCC 17616] gi|189334374|dbj|BAG43444.1| putative membrane protein [Burkholderia multivorans ATCC 17616] Length = 599 Score = 38.2 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 23/151 (15%), Positives = 52/151 (34%), Gaps = 9/151 (5%) Query: 28 PIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIK 87 I +VLG I+V +++F + L + D + + + + ++ + N ++ Sbjct: 1 MIAIVVLG-AIDVGNLYFQRRNLQRIADMAAIASVESMTDQCSQQNSP-----AMMAAQS 54 Query: 88 NTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYS-ISAISRYKIPLKFCTFIP 146 N F + R +G I+ TS + +S ++P F Sbjct: 55 NALANGF--DYRASGQTLSIECGRWDTSATPYFNSTFTPLNAVSVSVTQQVPYIFLGRFF 112 Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 + S + S+ K + + + Sbjct: 113 GKSGSTGATVAAFSTAKAINIDSFTIGTTLA 143 >gi|153806289|ref|ZP_01958957.1| hypothetical protein BACCAC_00545 [Bacteroides caccae ATCC 43185] gi|149130966|gb|EDM22172.1| hypothetical protein BACCAC_00545 [Bacteroides caccae ATCC 43185] Length = 289 Score = 38.2 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 24/107 (22%), Positives = 46/107 (42%), Gaps = 9/107 (8%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S+E + K + + + N + G++ F Sbjct: 72 EEERELTVMLMVDVSGSLE------FGTVKQLKKDMITEIAATLAFSAIQNNDKIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQI 270 S++IE+F + G H+ I+ L TN L+Y N + Sbjct: 126 SDRIEKFIPPKKGRKHILYIIRELIDFQPESHRTNIRLALEYLTNVM 172 >gi|302804200|ref|XP_002983852.1| hypothetical protein SELMODRAFT_423100 [Selaginella moellendorffii] gi|300148204|gb|EFJ14864.1| hypothetical protein SELMODRAFT_423100 [Selaginella moellendorffii] Length = 532 Score = 38.2 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 40/187 (21%), Positives = 66/187 (35%), Gaps = 23/187 (12%) Query: 173 DMMIVLDVSRSMESFFD--SSITKIDMAIKSINAMLEEVKLIPDVNNVV----QSGL-VT 225 + I+LD S SM + SS T+ ++A I +L + V + G V+ Sbjct: 196 SLYILLDTSTSMSNPTGVLSSQTRFNVASNIITQLLNTLTNGDQVAVSTIGGEKIGAPVS 255 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 ++E L G+S L+ I S ++N GL+ A + + Sbjct: 256 VVLDVQETSLYLAGISSLKDSISNTSVTNSASNIKNGLQAALDFFNTSSNLN-------- 307 Query: 286 NYKKIIVFMTDGE----NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACAS 341 +I+ TDG+ + A+ VY IG F + +S Sbjct: 308 ----VIILFTDGQFVTPGNFNFTQLSPVLAQLNARNVVVFVYRIGSFTSNDATFQQMQSS 363 Query: 342 PNSFYLV 348 N Y V Sbjct: 364 LNMSYEV 370 >gi|218710404|ref|YP_002418025.1| hypothetical protein VS_2441 [Vibrio splendidus LGP32] gi|218323423|emb|CAV19600.1| hypothetical protein VS_2441 [Vibrio splendidus LGP32] Length = 422 Score = 38.2 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 19/105 (18%), Positives = 42/105 (40%), Gaps = 4/105 (3%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 +G + +L L + V + ++V+H+ KT L + +D + + AAT I++ Sbjct: 10 RKQQGLVVVLVTAALLVFLAVSALAVDVNHMLVNKTRLQNAVDSAALAAAT-ILDNSKDK 68 Query: 73 NRKKLKGGDILCRIKNTWNMSFRNELRDN---GFVNDIDDIVRST 114 + + G L + + + + + ND D + Sbjct: 69 DAVDAEIGTALNAMAASTGNHEIDFTTASINIDYSNDPQDFTGTA 113 >gi|148545445|ref|YP_001265547.1| hypothetical protein Pput_0188 [Pseudomonas putida F1] gi|148509503|gb|ABQ76363.1| conserved repeat domain [Pseudomonas putida F1] Length = 9030 Score = 38.2 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 27/213 (12%), Positives = 62/213 (29%), Gaps = 12/213 (5%) Query: 99 RDNGFVNDIDDIVRST-SLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMP 157 F + + + +A + + + + IV Sbjct: 8435 YKGSFDITVTSTATESLGGSAITTGNIPVTVYAATYKASVGTSGNDTMTGSEGNDIIVAD 8494 Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK--LIPDV 215 ++ V + ++ ++D S SM I A + ++ +K L D Sbjct: 8495 VSGLNVVQGKN---YNIAFMVDSSGSMSD------KSIADAKTQLASVFNTLKASLGSDT 8545 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + V LV F ++ + + K++ + V G Y Sbjct: 8546 SGTVNIFLVDFDTQVNKNVAVNLADPDALSKLQAVLNSMVGGYYGGGTNYEDAFKTTSNF 8605 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS 308 T + + + F+TDG+ + + + Sbjct: 8606 FNSTMATGNKGAENLTYFITDGKPTYYQSGEST 8638 >gi|323650214|gb|ADX97193.1| inter-alpha-trypsin inhibitor heavy chain h2 [Perca flavescens] Length = 550 Score = 38.2 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 23/142 (16%), Positives = 58/142 (40%), Gaps = 22/142 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+DVS SM K+ ++++ A+L+++ + + ++ F++ + + Sbjct: 264 IVFVIDVSGSMWG------VKMKQTVEAMQAILDDLTIDDHFS------IIDFNHNVRCW 311 Query: 234 F-LLEWGVS----HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L G S ++ I+ + G TN L A + + Q + Sbjct: 312 SEELVAGSSIQIADAKKYIQNIKPNG-GTNINEALMRAVQMLL--KASNQGLIDPRSVS- 367 Query: 289 KIIVFMTDGENLSTKEDQQSLY 310 +I+ ++DG+ + ++ Sbjct: 368 -MIILVSDGDPTVGEIKLSTIQ 388 >gi|229596471|ref|XP_001009013.2| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|225565280|gb|EAR88768.2| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 2269 Score = 38.2 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 35/188 (18%), Positives = 70/188 (37%), Gaps = 17/188 (9%) Query: 123 QNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 QN+ + ++ KI I N ++ + S +K + +IV D S Sbjct: 2050 QNQKLQLQRLTYQKIWSIVGQKICNKNNIQYQEICDASIIKKCKSQINPVHFIIVFDESG 2109 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH 242 SME K K + ++ + + L+ F++ ++ + +E Sbjct: 2110 SMEG------EKWITLRKELLNFIDNRSRATAQDFIT---LIGFAHTVKLYTKVEKLNEQ 2160 Query: 243 LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 +++K+ TN + L+ A N + Q Q + +I F++DG+ Sbjct: 2161 IKQKVPQ-EFMDGGTNYSAPLQQALNIL--SQEQCQTFKKNN-----VIFFLSDGDAKEP 2212 Query: 303 KEDQQSLY 310 K + Q L Sbjct: 2213 KTEIQQLQ 2220 >gi|168216759|ref|ZP_02642384.1| von Willebrand factor type A domain protein [Clostridium perfringens NCTC 8239] gi|182381207|gb|EDT78686.1| von Willebrand factor type A domain protein [Clostridium perfringens NCTC 8239] Length = 1341 Score = 38.2 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 24/136 (17%), Positives = 53/136 (38%), Gaps = 25/136 (18%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+++VLD S+++ + T A + +L + ++ G+V ++ K E Sbjct: 74 DVVLVLDTSQTVNESYGDIST---AAQTFVTNILGK------QYKNLKVGVVCYNEKAEI 124 Query: 233 FFLLEWGVSHLQRKIKY-------LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + + IK S + TN + A + + + Sbjct: 125 LHGFDNSTYSINNDIKKCYQNYNIKSGNHLGTNVGDAFRLAISML---------GKDSNP 175 Query: 286 NYKKIIVFMTDGENLS 301 N +KI++FM++G+ + Sbjct: 176 NKEKIVIFMSNGKPNA 191 >gi|108761783|ref|YP_632825.1| hypothetical protein MXAN_4659 [Myxococcus xanthus DK 1622] gi|108465663|gb|ABF90848.1| hypothetical protein MXAN_4659 [Myxococcus xanthus DK 1622] Length = 694 Score = 38.2 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 10/57 (17%), Positives = 23/57 (40%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQ 64 +R F +G +L A+ + I+ + + + + H + L + D + A Sbjct: 6 LRQSFRRQEGQALVLAALMVLIMSIAVLTTVNIGHTVHERIRLQNTADSASYSMAAM 62 >gi|75907326|ref|YP_321622.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75701051|gb|ABA20727.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 224 Score = 38.2 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 65/165 (39%), Gaps = 17/165 (10%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 +++LD S SM+ I+ + + ++ +E+ V+ ++TF + I Sbjct: 24 VLLLDTSGSMQG------AAIEALNQGLLSLKDELMKNSIAARRVEIAIITFDSHINV-- 75 Query: 235 LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 ++ V+ Q L+ G+ T+ G+ A + + Q + Y + VFM Sbjct: 76 -IQDFVTADQFNPPILTAQGL-TSMGAGIHKALDMV---QERKSLYRANGVAYYRPWVFM 130 Query: 295 -TDGENLSTKE---DQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 TDGE + +Q +L + + +++G+ Sbjct: 131 ITDGEPQGELDHLVEQAALRLQGDEVNKRVAFFSVGVENANMTRL 175 >gi|83312056|ref|YP_422320.1| hypothetical protein amb2957 [Magnetospirillum magneticum AMB-1] gi|82946897|dbj|BAE51761.1| hypothetical protein [Magnetospirillum magneticum AMB-1] Length = 752 Score = 38.2 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 15/71 (21%), Positives = 32/71 (45%), Gaps = 9/71 (12%) Query: 172 LDMMIVLDVSRSMESFF------DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 LD++IV+D S SM D +++ +A ++ + K + + V + ++ Sbjct: 32 LDVVIVIDQSGSMSGNKAHPAANDPEASRLQVAKLILSRL---AKSVENTALVHRVSIIE 88 Query: 226 FSNKIEEFFLL 236 F + +E L Sbjct: 89 FGSTVEVPVSL 99 >gi|114567581|ref|YP_754735.1| hypothetical protein Swol_2070 [Syntrophomonas wolfei subsp. wolfei str. Goettingen] gi|114338516|gb|ABI69364.1| hypothetical protein Swol_2070 [Syntrophomonas wolfei subsp. wolfei str. Goettingen] Length = 776 Score = 38.2 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 28/143 (19%), Positives = 51/143 (35%), Gaps = 32/143 (22%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + + ++D+SRSME KI+ A +I L + N L+ F Sbjct: 272 EQRQPKEYIFLIDISRSMEG------KKIEHAADAIQICLRNLDEGDSFN------LLAF 319 Query: 227 SNKIEEFFLLE-----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 ++ F + +K L G TN P ++ A Sbjct: 320 ESENHAFAPKSLPYNQENLDKASAWVKNLHAMG-GTNILPAVQLAL-------------- 364 Query: 282 TEDANYKKIIVFMTDGENLSTKE 304 E + +K+++ TDG+ + E Sbjct: 365 KEAGDQQKVVILATDGQVGNENE 387 >gi|110632964|ref|YP_673172.1| TadE-like [Mesorhizobium sp. BNC1] gi|110283948|gb|ABG62007.1| TadE-like protein [Chelativorans sp. BNC1] Length = 140 Score = 38.2 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 35/91 (38%), Gaps = 9/91 (9%) Query: 8 IRNFFYNYKGGMTILTAIF-LPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM 66 R F + G + AI +P++ + LG+ +E FF++ L D + A +++ Sbjct: 8 FRRFLESGSGAAAVEFAIVCMPLLLICLGI-VEFGRAFFVRNDLSYAADVA----ARKVL 62 Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNE 97 G D ++ +F + Sbjct: 63 ---IGQIPAGAPSSDAASGLETAVREAFVGD 90 >gi|226327208|ref|ZP_03802726.1| hypothetical protein PROPEN_01074 [Proteus penneri ATCC 35198] gi|225204426|gb|EEG86780.1| hypothetical protein PROPEN_01074 [Proteus penneri ATCC 35198] Length = 109 Score = 38.2 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 20/100 (20%), Positives = 37/100 (37%), Gaps = 10/100 (10%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + ++LD S SM I + +L ++ P ++TF++ Sbjct: 3 RLPVYLLLDTSGSMYG------EPIAAVKNGVEMLLSTLRQDPYALETAYISIITFNSTA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 ++ L + K+ L G +T L N+I Sbjct: 57 QQIVPL---TDLINFKVPDLVASG-TTALGSALTLVSNRI 92 >gi|225436577|ref|XP_002274874.1| PREDICTED: hypothetical protein [Vitis vinifera] Length = 747 Score = 38.2 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 57/177 (32%), Gaps = 20/177 (11%) Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 + ++ A R P + R + + T A ++ V+D S SM Sbjct: 496 KRLAVDATLRAAAPYQKLRREKNAQKGRKVFVEKTDMRAKRMARKAGALVIFVVDASGSM 555 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEFFLLEWGVSHL 243 ++ ++ A + +L + Q ++ F + E ++ Sbjct: 556 ------ALNRMQNAKGAALKLL-----AESYTSRDQVAIIPFRGDSAEVLLPPSRSIAMA 604 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 +++++ L G S + + + + + +IV +TDG Sbjct: 605 RKRLERLPCGGGSPLAHG--------LTTAVRVGLNAEKSGDVGRVMIVAITDGRAN 653 >gi|323492793|ref|ZP_08097935.1| von Willebrand factor type A (vWA) domain-containing protein [Vibrio brasiliensis LMG 20546] gi|323312864|gb|EGA65986.1| von Willebrand factor type A (vWA) domain-containing protein [Vibrio brasiliensis LMG 20546] Length = 694 Score = 38.2 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 27/221 (12%), Positives = 68/221 (30%), Gaps = 31/221 (14%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 + + T + T R D + VLD S SM + T ++ + + + Sbjct: 294 TVKLTFTPGDDLTRVTQGR-DWVFVLDKSGSMSGKYS---TLVEGVRQGLGKL------- 342 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQI 270 + + + + + +++ + ++ + + TN G+ A ++ Sbjct: 343 -PAEDRFRVVMFDNNTYDLTGGFVAVNPTNVSKALQSVEQVEPSNGTNLYEGMSAAIRRL 401 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 D + IV +TDG ++ + + +K ++ + Sbjct: 402 -------------DDDRPTGIVLVTDG--VANVGVTEKRRFFELMEKHDVRLFTFIMGNS 446 Query: 331 RSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 + L S V N + +I + + Sbjct: 447 ANTPLLVPMTKLSNGVATSVSNADDIIGHLMNITSKLTYQA 487 >gi|282861923|ref|ZP_06270986.1| cobaltochelatase subunit [Streptomyces sp. ACTE] gi|282562948|gb|EFB68487.1| cobaltochelatase subunit [Streptomyces sp. ACTE] Length = 677 Score = 38.2 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 23/132 (17%), Positives = 50/132 (37%), Gaps = 19/132 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-EE 232 ++ V+D S SM ++ ++ ++ ++L + + GLVTF K E Sbjct: 494 VLFVVDASGSM-----AARRRMGAVKGAVLSLL-----LDAYQRRDKVGLVTFRGKDAEV 543 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V +++ L G + + LK + + ++ +R ++V Sbjct: 544 ALPPTSSVDAAAARLESLPTGGRTPLAAGLLKA--HDVLRVERLRDPSRRP------LLV 595 Query: 293 FMTDGENLSTKE 304 +TDG + Sbjct: 596 VVTDGRATGGPD 607 >gi|329954839|ref|ZP_08295856.1| von Willebrand factor type A domain protein [Bacteroides clarus YIT 12056] gi|328526943|gb|EGF53954.1| von Willebrand factor type A domain protein [Bacteroides clarus YIT 12056] Length = 342 Score = 38.2 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 22/175 (12%), Positives = 60/175 (34%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + +++MI LD+S SM + D +++ A + + ++++++ + Sbjct: 79 GSKLETVKRQGVEVMIALDISNSMLAQ-DVQPSRLQKAKRLVAQLVDKMEND-------K 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 G++ F+ + + ++ + T + A Sbjct: 131 VGMIVFAGDAFTQLPITSDYISAKMFLESIDPSLISKQGTAIGAAINLA----------- 179 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 T + ++ +TDGEN + + + V +G+ + Sbjct: 180 SRSFTPQEGVGRAVIVITDGENHEGGAVEAAKAAAEKG----IQVNVLGVGMPEG 230 >gi|187918197|ref|YP_001883760.1| hypothetical membrane associated protein [Borrelia hermsii DAH] gi|119861045|gb|AAX16840.1| hypothetical membrane associated protein [Borrelia hermsii DAH] Length = 373 Score = 38.2 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 69/200 (34%), Gaps = 24/200 (12%) Query: 132 ISRYKIPLKFCTFIPWYTNSRH---IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFF 188 I + +F F ++ IV I +K + + A LD++ V+DV+ SM++ Sbjct: 189 IENNGLREEFARFADDVIVAKKGLDIVDKIKDILKKSEEPLADLDLVFVIDVTDSMKNH- 247 Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF--LLEWGVSHLQRK 246 I++ + +L+ ++ + + G V + + +E+F ++ Sbjct: 248 ------IEILRE---HLLDMIEPQLNQFRSYRVGFVFYKDYLEDFLTRSFDFNSREYLSN 298 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGEN-LSTKED 305 + G + + N R + I+ + + + Sbjct: 299 VFEGINVGGGGDYPEAVFEGINSAVTQFDWRADS--------RFIIVLGNAPPHEYPRGP 350 Query: 306 QQSLYYCNEAKKRGAIVYAI 325 AK++ I+Y I Sbjct: 351 IVYEDVIRAAKEKDIIIYGI 370 >gi|145487105|ref|XP_001429558.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124396651|emb|CAK62160.1| unnamed protein product [Paramecium tetraurelia] Length = 891 Score = 38.2 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 20/105 (19%), Positives = 47/105 (44%), Gaps = 12/105 (11%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL-IPDVNN 217 T+ +K Q +D++ ++D++ SM+ + S A +I+ ++EE + I N Sbjct: 224 TNQIKKMVQKAQVIDVLFIIDITGSMQRWLTS-------AKSNIHKIIEEFQTQIDKKKN 276 Query: 218 VVQSGLVTFSNKIEEFFLL----EWGVSHLQRKIKYLSKFGVSTN 258 +V++ +V + + +E +L + + L G + Sbjct: 277 IVRTAIVAYRDFGDEDNMLYREFTSDTKIIFEFLNQLQAKGGGDD 321 >gi|313230976|emb|CBY18974.1| unnamed protein product [Oikopleura dioica] Length = 522 Score = 38.2 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 36/206 (17%), Positives = 62/206 (30%), Gaps = 40/206 (19%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN------NVVQSGLV 224 RLD+ +V+D S S K +L+E V + G V Sbjct: 280 RLDISVVIDTSGS------QVGDKQSAVKDFFKDLLKEFDTQTAVKISITDIGDGREGQV 333 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 N + S + + L+ +G +T G+ +Q+ Sbjct: 334 ---NTVLGPTQFVD-TSDINSALNSLTWYGTTTAIADGITEGASQMDTTD---------- 379 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK-RGAIVYAIGIRVIRS--HEFLRACAS 341 N +++ +TDG D SL + A G AIG L A+ Sbjct: 380 -NVNDVMIVITDG----FDGDLSSLQTASAAVATAGITAIAIGYDENGGIIGSTLEDIAN 434 Query: 342 --PNSFYLVENPHSM----YDAFSHI 361 ++ + + F+ I Sbjct: 435 GVSSNVIEATSTSELDGLALSVFNSI 460 >gi|294811900|ref|ZP_06770543.1| Putative magnesium-chelatase subunit [Streptomyces clavuligerus ATCC 27064] gi|326440364|ref|ZP_08215098.1| magnesium-chelatase subunit [Streptomyces clavuligerus ATCC 27064] gi|294324499|gb|EFG06142.1| Putative magnesium-chelatase subunit [Streptomyces clavuligerus ATCC 27064] Length = 737 Score = 38.2 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 24/132 (18%), Positives = 53/132 (40%), Gaps = 19/132 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEE 232 ++ V+D S SM ++ ++ ++ ++L + + GLVTF + E Sbjct: 517 VLFVVDASGSM-----AARRRMSAVKGAVLSLL-----LDAYQRRDKVGLVTFRGREAEV 566 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V +++ L G + + LK + + ++ MR D + + ++V Sbjct: 567 ALPPTSSVDAAAARLERLPTGGRTPLAAGLLKA--HDVLRVERMR------DPSRRALLV 618 Query: 293 FMTDGENLSTKE 304 +TDG + Sbjct: 619 VVTDGRATGGPD 630 >gi|296086006|emb|CBI31447.3| unnamed protein product [Vitis vinifera] Length = 478 Score = 38.2 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 67/174 (38%), Gaps = 31/174 (17%) Query: 105 NDIDDIVRSTSLDIVVVPQNEGY-SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK 163 N + + T ++ P+++ Y + + + K + ++I +++S Sbjct: 134 NPVRTVEIKTYPEVSAAPRSKSYDNFTVLVHLKAAV--------ANTGQNIQRNMSNSPL 185 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI-----PDVNNV 218 + A +D++ VLD+S SM TK+ + +++ L+ V + ++ Sbjct: 186 NSHNPRAPVDLVTVLDISGSMAG------TKLALLKRAMGFALQAVNSLVANGGTNIAEG 239 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG-LKYAYNQIF 271 ++ G ++ E + I L G T +T ++ A+ Q Sbjct: 240 LRKGAKVMEDRKERNP--------VSSII--LLSDGQDTYTTESVIQDAFAQCI 283 >gi|218693385|gb|ACL01196.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 38.2 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 17/92 (18%), Positives = 42/92 (45%), Gaps = 5/92 (5%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQT-DARLDMMIVLDVSRSMESFFDSSITKIDMA 198 K N I + + V ++ + LD++++LD S SM + ++ + A Sbjct: 189 KITGVNDLDKNKYKIELTVEGKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKA 248 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +++ +++++ N + LVT+++ I Sbjct: 249 GEAVEKLIDKITS----NKDNRVALVTYASTI 276 >gi|218693373|gb|ACL01190.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 38.2 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 17/92 (18%), Positives = 42/92 (45%), Gaps = 5/92 (5%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQT-DARLDMMIVLDVSRSMESFFDSSITKIDMA 198 K N I + + V ++ + LD++++LD S SM + ++ + A Sbjct: 189 KITGVNDLDKNKYKIELTVEGKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKA 248 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +++ +++++ N + LVT+++ I Sbjct: 249 GEAVEKLIDKITS----NKDNRVALVTYASTI 276 >gi|218693363|gb|ACL01185.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 38.2 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 17/92 (18%), Positives = 42/92 (45%), Gaps = 5/92 (5%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQT-DARLDMMIVLDVSRSMESFFDSSITKIDMA 198 K N I + + V ++ + LD++++LD S SM + ++ + A Sbjct: 189 KITGVNDLDKNKYKIELTVEGKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKA 248 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +++ +++++ N + LVT+++ I Sbjct: 249 GEAVEKLIDKITS----NKDNRVALVTYASTI 276 >gi|218693343|gb|ACL01175.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 38.2 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 17/92 (18%), Positives = 42/92 (45%), Gaps = 5/92 (5%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQT-DARLDMMIVLDVSRSMESFFDSSITKIDMA 198 K N I + + V ++ + LD++++LD S SM + ++ + A Sbjct: 189 KITGVNDLDKNKYKIELTVEGKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKA 248 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +++ +++++ N + LVT+++ I Sbjct: 249 GEAVEKLIDKITS----NKDNRVALVTYASTI 276 >gi|7741585|gb|AAF69021.1| calcium-binding outer membrane-like protein [Pseudomonas putida] Length = 1800 Score = 38.2 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 25/162 (15%), Positives = 53/162 (32%), Gaps = 8/162 (4%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 T S + + +N ++ ++D S SM I A + ++ Sbjct: 1253 TGSEGNDIFVADVSGLNVVQGKNYNIAFMVDSSGSMSD------KSIADAKTQLASVFNT 1306 Query: 209 VK--LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 +K L D + V LV F ++ + + K++ + V G Y Sbjct: 1307 LKASLGSDTSGTVNIFLVDFDTQVNKNVAVNLADPDALSKLQAVLNSMVGGYYGGGTNYE 1366 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS 308 T + + + F+TDG+ + ++ + Sbjct: 1367 DAFKTTSNFFNSTMATSNKGAENLTYFITDGKPTYYQSNEST 1408 >gi|87308721|ref|ZP_01090860.1| hypothetical protein DSM3645_10797 [Blastopirellula marina DSM 3645] gi|87288432|gb|EAQ80327.1| hypothetical protein DSM3645_10797 [Blastopirellula marina DSM 3645] Length = 329 Score = 38.2 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 57/204 (27%), Gaps = 27/204 (13%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 L + + + + +V V D S SM + Sbjct: 118 ETNGSDMSGLPSASGLTGGGGGSKAGIDGAGTTRVFGIEGTGSKFAYVFDRSGSMG---N 174 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKY 249 S+ + A + A L E+ + Q ++ ++ + F L Sbjct: 175 SAGRPLAAAKSELLASLRELDK------LHQFQIIFYNEEPRVFNPL-----------GG 217 Query: 250 LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY---KKIIVFMTDGENLSTKEDQ 306 + + L + G QH +I F+TD + ++E+ Sbjct: 218 TPRLMFADEQGKALAEQFVGSIVATGGTQHLKALYMALAMSPDVIFFLTDADGSPSEEEM 277 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVI 330 + + R A++ I V Sbjct: 278 ADIRR----RNRSAVINTIEFGVG 297 >gi|22536818|ref|NP_687669.1| cell wall surface anchor family protein [Streptococcus agalactiae 2603V/R] gi|25010692|ref|NP_735087.1| hypothetical protein gbs0632 [Streptococcus agalactiae NEM316] gi|76788219|ref|YP_329407.1| Cna B domain-containing protein [Streptococcus agalactiae A909] gi|77411235|ref|ZP_00787585.1| cell wall surface anchor family protein [Streptococcus agalactiae CJB111] gi|22533664|gb|AAM99541.1|AE014220_13 cell wall surface anchor family protein, putative [Streptococcus agalactiae 2603V/R] gi|23095046|emb|CAD46276.1| Unknown [Streptococcus agalactiae NEM316] gi|76563276|gb|ABA45860.1| cna B-type domain protein [Streptococcus agalactiae A909] gi|77162661|gb|EAO73622.1| cell wall surface anchor family protein [Streptococcus agalactiae CJB111] gi|218693223|gb|ACL01115.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693225|gb|ACL01116.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693227|gb|ACL01117.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693229|gb|ACL01118.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693231|gb|ACL01119.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693233|gb|ACL01120.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693237|gb|ACL01122.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693239|gb|ACL01123.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693243|gb|ACL01125.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693245|gb|ACL01126.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693249|gb|ACL01128.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693251|gb|ACL01129.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693253|gb|ACL01130.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693255|gb|ACL01131.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693257|gb|ACL01132.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693259|gb|ACL01133.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693261|gb|ACL01134.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693263|gb|ACL01135.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693265|gb|ACL01136.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693267|gb|ACL01137.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693269|gb|ACL01138.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693271|gb|ACL01139.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693273|gb|ACL01140.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693275|gb|ACL01141.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693277|gb|ACL01142.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693279|gb|ACL01143.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693281|gb|ACL01144.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693283|gb|ACL01145.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693285|gb|ACL01146.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693289|gb|ACL01148.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693291|gb|ACL01149.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693293|gb|ACL01150.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693295|gb|ACL01151.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693297|gb|ACL01152.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693299|gb|ACL01153.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693301|gb|ACL01154.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693303|gb|ACL01155.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693305|gb|ACL01156.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693307|gb|ACL01157.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693309|gb|ACL01158.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693311|gb|ACL01159.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693313|gb|ACL01160.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693315|gb|ACL01161.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693317|gb|ACL01162.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693319|gb|ACL01163.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693321|gb|ACL01164.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693323|gb|ACL01165.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693325|gb|ACL01166.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693327|gb|ACL01167.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693331|gb|ACL01169.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693333|gb|ACL01170.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693337|gb|ACL01172.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693339|gb|ACL01173.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693341|gb|ACL01174.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693345|gb|ACL01176.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693347|gb|ACL01177.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693349|gb|ACL01178.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693355|gb|ACL01181.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693359|gb|ACL01183.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693369|gb|ACL01188.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693371|gb|ACL01189.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693377|gb|ACL01192.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693379|gb|ACL01193.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693381|gb|ACL01194.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693383|gb|ACL01195.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693387|gb|ACL01197.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693389|gb|ACL01198.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693391|gb|ACL01199.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693393|gb|ACL01200.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693395|gb|ACL01201.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693399|gb|ACL01203.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693401|gb|ACL01204.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693403|gb|ACL01205.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693405|gb|ACL01206.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693407|gb|ACL01207.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693411|gb|ACL01209.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693415|gb|ACL01211.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693419|gb|ACL01213.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693421|gb|ACL01214.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693433|gb|ACL01220.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693435|gb|ACL01221.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693439|gb|ACL01223.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693447|gb|ACL01227.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693453|gb|ACL01230.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693455|gb|ACL01231.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693457|gb|ACL01232.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693459|gb|ACL01233.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693461|gb|ACL01234.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693463|gb|ACL01235.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693465|gb|ACL01236.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693467|gb|ACL01237.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693471|gb|ACL01239.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693473|gb|ACL01240.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693475|gb|ACL01241.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 38.2 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 17/92 (18%), Positives = 42/92 (45%), Gaps = 5/92 (5%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQT-DARLDMMIVLDVSRSMESFFDSSITKIDMA 198 K N I + + V ++ + LD++++LD S SM + ++ + A Sbjct: 189 KITGVNDLDKNKYKIELTVEGKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKA 248 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +++ +++++ N + LVT+++ I Sbjct: 249 GEAVEKLIDKITS----NKDNRVALVTYASTI 276 >gi|332828717|gb|EGK01409.1| hypothetical protein HMPREF9455_02242 [Dysgonomonas gadei ATCC BAA-286] Length = 340 Score = 38.2 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 28/175 (16%), Positives = 61/175 (34%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 KV ++++I +DVS SM + D +++ A + + +++E + Sbjct: 79 GTKVEKVDKKGIELVIAIDVSNSMMAE-DIKPSRLAKAKQMLTRIIDE-------RRDDK 130 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 +V F+ + L + ++ + T + + + Sbjct: 131 VAIVVFAGEAFIQLPLTPDNQSAKLFLETIDPSLVPVQGTAIGSAIDISMS--------- 181 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 C + D + K IV +TDGE + + ++ G V +GI Sbjct: 182 --CFSNDTDIDKAIVLITDGEGHEGDAEGAAARAADQ----GVHVNVVGIGTTEG 230 >gi|299140482|ref|ZP_07033620.1| von Willebrand factor, type A [Prevotella oris C735] gi|298577448|gb|EFI49316.1| von Willebrand factor, type A [Prevotella oris C735] Length = 289 Score = 38.2 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 23/102 (22%), Positives = 50/102 (49%), Gaps = 7/102 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S++ F TK +M + + I + + + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLD-FGTVRQTKREMVTEIAATL--AFSAIQNND---KIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAY 267 S+++E++ + G H+ I+ L F S + G+ +Y Sbjct: 126 SDRVEKYIPPKKGRRHILYIIRELLDFKPASVRTDIGMAVSY 167 >gi|321460550|gb|EFX71591.1| hypothetical protein DAPPUDRAFT_255495 [Daphnia pulex] Length = 930 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 40/208 (19%), Positives = 74/208 (35%), Gaps = 34/208 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF--SNKIE 231 ++V D S SM + +I +S ++ D+++ + G+V F + +I Sbjct: 319 FVLVSDTSGSMNDY-----NRIVRLYESSRRWIK-----YDISDGSKLGMVQFANNARIL 368 Query: 232 EFFLLEWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + G + + I L T GL+ A + G + Sbjct: 369 SPIVEINGDASREALIARLPVTAVGGTCIGCGLQKAL-DLLRPGGPGG-----------V 416 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYLV 348 I+ +TDGE + + K GA V +I + L + S+++ Sbjct: 417 ILLLTDGEETDRPFINDVIS---DVIKSGARVVSIAFGRKAEDKIEDLATKTNGKSYFID 473 Query: 349 ENPHS--MYDAFSHIGKDIVTKRIWYDK 374 +N S + DAF IG + D+ Sbjct: 474 DNDSSQGLNDAF--IGSLTYQPAVSADQ 499 >gi|221197788|ref|ZP_03570834.1| membrane protein [Burkholderia multivorans CGD2M] gi|221181720|gb|EEE14121.1| membrane protein [Burkholderia multivorans CGD2M] Length = 634 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 29/179 (16%), Positives = 65/179 (36%), Gaps = 15/179 (8%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R +G + ++ AI++ + +VLG I+V +++F + L + D + + A + Sbjct: 16 RAGSRAQRGSVAVVAAIWIAVALIVLG-SIDVGNLYFQRRDLQRVADMTALAAVQSV--- 71 Query: 69 GNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSL-----DIVVVPQ 123 N+ + N + NGF D S S+ D+ Sbjct: 72 ---NDLCPQTDTTVTASGSNAVVTAAYRGAALNGF--DAQASGNSMSIACGRWDVSDYGA 126 Query: 124 NEGYSISAISR-YKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 GY +A ++ + + +P + + S+ K ++ + + + S Sbjct: 127 AAGYFGTATNQLNAVRVVATKTVPLFFIGPPRTISAASTAKASNIDTFSIGTTLAMFGS 185 >gi|218693409|gb|ACL01208.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 17/92 (18%), Positives = 42/92 (45%), Gaps = 5/92 (5%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQT-DARLDMMIVLDVSRSMESFFDSSITKIDMA 198 K N I + + V ++ + LD++++LD S SM + ++ + A Sbjct: 189 KITGVNDLDKNKYKIELTVEGKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKA 248 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +++ +++++ N + LVT+++ I Sbjct: 249 GEAVEKLIDKITS----NKDNRVALVTYASTI 276 >gi|218693397|gb|ACL01202.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 17/92 (18%), Positives = 42/92 (45%), Gaps = 5/92 (5%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQT-DARLDMMIVLDVSRSMESFFDSSITKIDMA 198 K N I + + V ++ + LD++++LD S SM + ++ + A Sbjct: 189 KITGVNDLDKNKYKIELTVEGKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKA 248 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +++ +++++ N + LVT+++ I Sbjct: 249 GEAVEKLIDKITS----NKDNRVALVTYASTI 276 >gi|218693367|gb|ACL01187.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693469|gb|ACL01238.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 17/92 (18%), Positives = 42/92 (45%), Gaps = 5/92 (5%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQT-DARLDMMIVLDVSRSMESFFDSSITKIDMA 198 K N I + + V ++ + LD++++LD S SM + ++ + A Sbjct: 189 KITGVNDLDKNKYKIELTVEGKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKA 248 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +++ +++++ N + LVT+++ I Sbjct: 249 GEAVEKLIDKITS----NKDNRVALVTYASTI 276 >gi|218693353|gb|ACL01180.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 17/92 (18%), Positives = 42/92 (45%), Gaps = 5/92 (5%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQT-DARLDMMIVLDVSRSMESFFDSSITKIDMA 198 K N I + + V ++ + LD++++LD S SM + ++ + A Sbjct: 189 KITGVNDLDKNKYKIELTVEGKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKA 248 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +++ +++++ N + LVT+++ I Sbjct: 249 GEAVEKLIDKITS----NKDNRVALVTYASTI 276 >gi|218693235|gb|ACL01121.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 17/92 (18%), Positives = 42/92 (45%), Gaps = 5/92 (5%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQT-DARLDMMIVLDVSRSMESFFDSSITKIDMA 198 K N I + + V ++ + LD++++LD S SM + ++ + A Sbjct: 189 KITGVNDLDKNKYKIELTVEGKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKA 248 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +++ +++++ N + LVT+++ I Sbjct: 249 GEAVEKLIDKITS----NKDNRVALVTYASTI 276 >gi|194857753|ref|XP_001969024.1| GG25194 [Drosophila erecta] gi|190660891|gb|EDV58083.1| GG25194 [Drosophila erecta] Length = 2154 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 41/219 (18%), Positives = 82/219 (37%), Gaps = 33/219 (15%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D++I+LD S SM F + +A +I ++L+ N+ + +S+++ + Sbjct: 236 DIVILLDHSGSMTGF------RHHVAKFTIRSILDTF----SNNDFFTI--LRYSSEVND 283 Query: 233 FFLLEWG---------VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 G + ++I+ L N T + A+ Q+ +HC T Sbjct: 284 IIPCFNGALVQATPENIEVFNQQIELLDDPEGYANLTLAYETAF-QLLRKYYDSRHCATN 342 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG-------AIVYAIGIRVIRSHEF- 335 + I+ +TDG +T E Q + N + G Y +G V + E Sbjct: 343 ST-CNQAIMLVTDGVAGNTTEVFQKYNWGNG--ENGTSQMDTRVFTYLLGKEVTKVREIQ 399 Query: 336 LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 AC + + V+ +++ I T + ++ Sbjct: 400 WMACLNRGYYSHVQTLDEVHEEVLKYVDVIATPLVLQNE 438 >gi|163786713|ref|ZP_02181161.1| hypothetical protein FBALC1_16047 [Flavobacteriales bacterium ALC-1] gi|159878573|gb|EDP72629.1| hypothetical protein FBALC1_16047 [Flavobacteriales bacterium ALC-1] Length = 288 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 29/107 (27%), Positives = 46/107 (42%), Gaps = 9/107 (8%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L MM++ DVS S F + D + I A L + GL+ F Sbjct: 72 EEERELTMMLMADVSGS--KLFGTKNQFKDEIVTEIAATLAFSATQNNDK----IGLILF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQI 270 S++IE + + G SH+ R I+ L +TN LK+ N + Sbjct: 126 SDEIELYIPPKKGRSHVLRIIRELLEFKPKSKNTNIAEALKFLSNVM 172 >gi|146185721|ref|XP_001471501.1| conserved hypothetical protein [Tetrahymena thermophila] gi|146143100|gb|EDK31249.1| conserved hypothetical protein [Tetrahymena thermophila SB210] Length = 792 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 25/128 (19%), Positives = 49/128 (38%), Gaps = 24/128 (18%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++++D S SM + I+ A +++ L+ + ++ + + Sbjct: 282 FLLLIDRSGSMAG------SNIETAKQALIFFLKSL----PEGSIYNIISFGYDYTVMYT 331 Query: 234 FLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 +++ +LQ I + KF T + LKY + D G+R K Sbjct: 332 QSVQFNDQNLQDSIDKIEKFQADMGGTKISQALKYLMYNLQDQYGLR-----------KK 380 Query: 291 IVFMTDGE 298 I +TDGE Sbjct: 381 IYIITDGE 388 >gi|115374651|ref|ZP_01461929.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1] gi|115368319|gb|EAU67276.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1] Length = 403 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 60/170 (35%), Gaps = 24/170 (14%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 +++++ VLD + SM S + + KI SI + + K P ++ G Sbjct: 50 SQQKTAQPQIEVVFVLDTTGSMSSLLEGAKQKI----FSIASRIATGKPTP----RLKIG 101 Query: 223 LVTF----SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTN---STPGLKYAYNQIFDMQG 275 LV + + + F L + + ++ G + GL A +++ Sbjct: 102 LVAYRDVGDEYVTKRFDLSEDLDTVFTHLRQFRADGGGDSPEHVGRGLGEAVSKMSWSA- 160 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 K+I + D S +E + A++R +V + Sbjct: 161 --------QRETMKVIFLVGDAPPASREEAWNFKQWSKRAQERHIVVNTV 202 >gi|118384937|ref|XP_001025607.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89307374|gb|EAS05362.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 670 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 41/205 (20%), Positives = 73/205 (35%), Gaps = 34/205 (16%) Query: 174 MMIVLDVSRSMESFF------------DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 + V+DVS SM S + S++ +D+ SI ++ + Sbjct: 35 ICCVVDVSGSMSSEAKIINQSSQKSDENYSLSILDVVKHSIKMIVNTLGSED------YL 88 Query: 222 GLVTFSNKIEEFFLL----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 +VTFS+ F L + + KI+ LS G T GL A N + + Sbjct: 89 SIVTFSDSANVLFDLLPMNDSNKTMAIEKIENLSTEG-GTELWKGLNSALNILLN----- 142 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 T + N + I +TDG+ + D + + K + G + E + Sbjct: 143 --NKTPNTN--QSIFLLTDGQPTDSGIDTNLVKFKQAYPKLNCTINTFGFSSSSNSELMN 198 Query: 338 ACASP--NSFYLVENPHSMYDAFSH 360 A F + + + AF++ Sbjct: 199 KIAMEYNGMFSFIPDASFIATAFAN 223 >gi|27366971|ref|NP_762498.1| cell surface protein [Vibrio vulnificus CMCP6] gi|27358538|gb|AAO07488.1| Cell surface protein precursor [Vibrio vulnificus CMCP6] Length = 635 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 48/275 (17%), Positives = 93/275 (33%), Gaps = 34/275 (12%) Query: 65 IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVV--VP 122 I N + + + +L I T + + D ++ + + DIV+ Sbjct: 268 ISARYNISLSETRQTTALLQLILQTIGTNKVGGVVDQLVADNNEKNTNAFQFDIVLSFDE 327 Query: 123 QNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARL-------DMM 175 Q + +++ K + T S + P+ +++KV + T + D + Sbjct: 328 QKDNVVMTSTLIAKELFSKYETLIDTTTSESVRAPVNATIKVQNDTITAIEQTVSKADFL 387 Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSI--NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 V+D S SM D T + I + + +V I N+ ++ F+++ E+ Sbjct: 388 FVIDNSGSMGDEQDEISTLTQTFLDEISASGLDYKVGTITTDNDSLRGN--GFTHEPEQI 445 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTP---GLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L F TN + G+ +A + G A Sbjct: 446 ALD----------------FKPGTNGSGYERGIYFAEMALAPTTGTVAVAGYPRAGASLS 489 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 ++ M+D E+ D N G VY+I Sbjct: 490 VIIMSDEESQYPTRDFDVNN--NLFVDNGYRVYSI 522 >gi|218693287|gb|ACL01147.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 17/92 (18%), Positives = 42/92 (45%), Gaps = 5/92 (5%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQT-DARLDMMIVLDVSRSMESFFDSSITKIDMA 198 K N I + + V ++ + LD++++LD S SM + ++ + A Sbjct: 189 KITGVNDLDKNKYKIELTVEGKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKA 248 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +++ +++++ N + LVT+++ I Sbjct: 249 GEAVEKLIDKITS----NKDNRVALVTYASTI 276 >gi|254413290|ref|ZP_05027061.1| magnesium chelatase ATPase subunit D [Microcoleus chthonoplastes PCC 7420] gi|196179910|gb|EDX74903.1| magnesium chelatase ATPase subunit D [Microcoleus chthonoplastes PCC 7420] Length = 676 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 39/250 (15%), Positives = 75/250 (30%), Gaps = 44/250 (17%) Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 + ++ A R P + N + IV K ++ L + +V D S SM Sbjct: 433 KRIAVDATLRAAAPYQ-KARRERQPNRKVIVESGDMRSKRLARKAGALVVFLV-DASGSM 490 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-EEFFLLEWGVSHL 243 ++ ++ A ++ +L E N Q L+ F + E ++ Sbjct: 491 ------ALNRMQSAKGAVMQLLTEA-----YQNRDQVSLIPFRGEQAEVLLPPTRSIALA 539 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL--- 300 + ++ L G S + GL A + + +IV +TDG Sbjct: 540 RGRLDRLPCGGGSPLAH-GLTQAVRVGTNAKMSGDIGQI-------VIVAITDGRGNIPL 591 Query: 301 ----------STKEDQQS--LYYCNEAKKRGAIVYAI-----GIRVIRSHEFLRACASPN 343 K D + L + G + I + + E + Sbjct: 592 ARSLGEPIPEGEKPDIKGELLEIAARIRAMGIQLLVIDTENKFVSTGFAKELAKQAG--G 649 Query: 344 SFYLVENPHS 353 +Y + Sbjct: 650 KYYHLPKATD 659 >gi|159027030|emb|CAO89216.1| unnamed protein product [Microcystis aeruginosa PCC 7806] Length = 266 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 55/172 (31%), Gaps = 32/172 (18%) Query: 174 MMIVLDVSRSMESFF-DSSITKIDMAI--KSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 ++I++D S SM + +S+ + + I ++ ++ + +V + I Sbjct: 14 ILIMIDQSASMSDKYANSNKAEFAALAVNRVIGEIITACSSGNEIKDRCFVAVVGYGASI 73 Query: 231 EEFFLLEW-------GVSHLQRKIK----------------YLSKFGVSTNSTPGLKYAY 267 FL + + L+RK+ T K AY Sbjct: 74 NILFLDKASELAKNPNTTTLKRKVSDGAGGLVEVDEVMRVFVKPTASNGTPMAEAFKQAY 133 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 N G+ + + ++ +++ +TDGE + K Sbjct: 134 N------GVEKFISNHPDSFPPVVINITDGEPNDFNAATTEAKKLAQLKTSD 179 >gi|47219205|emb|CAG11223.1| unnamed protein product [Tetraodon nigroviridis] Length = 950 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 35/251 (13%), Positives = 87/251 (34%), Gaps = 29/251 (11%) Query: 103 FVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSV 162 F + D V + L + + + S S + + + ++ Sbjct: 75 FAVGVQDAV-DSELREMASQPYDTHVFSVDSFLALRDIIQDLVVGICGAVTRSGGAPVAI 133 Query: 163 KVNSQTDA----RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + D+++++D S+++ + + +L+ V+ + +N Sbjct: 134 EAPEAQRGTAPDSADLVLLIDGSQNVGETNFPFVRDL---------VLKIVEALDVRSNA 184 Query: 219 VQSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKY--AYNQIFDMQ 274 V+ L +++ ++ F L + + +K L+ G ++ A N + + Sbjct: 185 VRVALALYTSDLQTKFYLNSYDSKAAVLDAVKGLTYPGGDESNLGAALEDVAMNLLNERA 244 Query: 275 GMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 G R + +++V +T G D + K + + +GI + + Sbjct: 245 GGRA-----EDGVPQMLVVITAG-----SSDDDTAAGDRALKGASVVTFGLGIGDTAAAD 294 Query: 335 FLRACASPNSF 345 L A A+ SF Sbjct: 295 -LEAVATDKSF 304 >gi|281423278|ref|ZP_06254191.1| von Willebrand factor, type A [Prevotella oris F0302] gi|281402614|gb|EFB33445.1| von Willebrand factor, type A [Prevotella oris F0302] Length = 289 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 23/102 (22%), Positives = 50/102 (49%), Gaps = 7/102 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S++ F TK +M + + I + + + G++ F Sbjct: 72 EEERELTVMLLVDVSGSLD-FGTVRQTKREMVTEIAATL--AFSAIQNND---KIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFG-VSTNSTPGLKYAY 267 S+++E++ + G H+ I+ L F S + G+ +Y Sbjct: 126 SDRVEKYIPPKKGRRHILYIIRELLDFKPASVRTDIGMAVSY 167 >gi|113460775|ref|YP_718842.1| hypothetical protein HS_0630 [Haemophilus somnus 129PT] gi|112822818|gb|ABI24907.1| conserved hypothetical protein, with von Willebrand factor (vWF) domain [Haemophilus somnus 129PT] Length = 343 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 23/134 (17%), Positives = 42/134 (31%), Gaps = 12/134 (8%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +V+DVS SM ++IN +++ ++ P V ++ F+ Sbjct: 3 RLPIFLVVDVSESMAG------DSHRQMQEAINRLVQRLRCDPYALESVYISVIAFAGAA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + L G T+ L ++I D Sbjct: 57 GVIAPL---TELMSFYAPRLP-MGSGTSLGAALNLTMDEIQRNVVRSSGDQKGDFKPLVY 112 Query: 291 IVFMTDGENLSTKE 304 I ++DG Sbjct: 113 I--LSDGVATDDPT 124 >gi|242242686|ref|ZP_04797131.1| regulator of nitric oxide reductase [Staphylococcus epidermidis W23144] gi|242233822|gb|EES36134.1| regulator of nitric oxide reductase [Staphylococcus epidermidis W23144] Length = 629 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 22/177 (12%), Positives = 60/177 (33%), Gaps = 29/177 (16%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K + +++D S SM K+D IK + E +K + + ++ Sbjct: 426 KQDLSKTFDATFTLLIDASASMHD-------KMDETIKGVVLFHETLKSLNIKHEILAFN 478 Query: 223 LVTF--SNKIEEFF---LLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQG 275 F + + ++ + S +++ + ++ A ++ Sbjct: 479 EDAFEADERQQPNIIDEIINYNYSIFEKEGPRIMTLEPQDDNRDGVAIRIASERLL---- 534 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKRGAIVYAIGIR 328 + ++ ++ ++ +DGE + Q + A+K G V+ + + Sbjct: 535 -------QRSHEQRFLIVFSDGEPSAFNYSQDGILDTYEAVERARKFGIEVFNVFLS 584 >gi|57866872|ref|YP_188556.1| hypothetical protein SERP0978 [Staphylococcus epidermidis RP62A] gi|57637530|gb|AAW54318.1| conserved hypothetical protein [Staphylococcus epidermidis RP62A] Length = 629 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 22/177 (12%), Positives = 60/177 (33%), Gaps = 29/177 (16%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K + +++D S SM K+D IK + E +K + + ++ Sbjct: 426 KQDLSKTFDATFTLLIDASASMHD-------KMDETIKGVVLFHETLKSLNIKHEILAFN 478 Query: 223 LVTF--SNKIEEFF---LLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQG 275 F + + ++ + S +++ + ++ A ++ Sbjct: 479 EDAFEADERQQPNIIDEIINYNYSIFEKEGPRIMTLEPQDDNRDGVAIRIASERLL---- 534 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKRGAIVYAIGIR 328 + ++ ++ ++ +DGE + Q + A+K G V+ + + Sbjct: 535 -------QRSHEQRFLIVFSDGEPSAFNYSQDGILDTYEAVERARKFGIEVFNVFLS 584 >gi|239940333|ref|ZP_04692270.1| putative magnesium-chelatase subunit [Streptomyces roseosporus NRRL 15998] gi|291443766|ref|ZP_06583156.1| chelatase [Streptomyces roseosporus NRRL 15998] gi|291346713|gb|EFE73617.1| chelatase [Streptomyces roseosporus NRRL 15998] Length = 678 Score = 38.2 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 23/132 (17%), Positives = 50/132 (37%), Gaps = 19/132 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEE 232 ++ V+D S SM ++ ++ ++ ++L + + GLVTF + E Sbjct: 495 VLFVVDASGSM-----AARQRMSAVKGAVLSLL-----LDAYQRRDKVGLVTFRGREAEV 544 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V +++ L G + + LK + + ++ +R ++V Sbjct: 545 ALPPTSSVDAAAARLESLPTGGRTPLAAGLLKA--HDVLRVERLRDPSRRP------LLV 596 Query: 293 FMTDGENLSTKE 304 +TDG E Sbjct: 597 VVTDGRATGGPE 608 >gi|229095272|ref|ZP_04226264.1| Von Willebrand factor type A domain protein [Bacillus cereus Rock3-29] gi|228688131|gb|EEL42017.1| Von Willebrand factor type A domain protein [Bacillus cereus Rock3-29] Length = 610 Score = 38.2 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 57/180 (31%), Gaps = 25/180 (13%) Query: 165 NSQTDARLDMMI--VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 Q LD+ ++D S SM K++ KS+ E +K + + Sbjct: 406 KGQESQELDVAFQLLVDCSGSMY-------NKMEETKKSVVLFHEALKSLKIPH------ 452 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG----MRQ 278 ++ + E + + + Y + N P + + + G + Sbjct: 453 AISGFWEDASSAKPEDKPNVIHEVVNYKNSTLP--NVGPEIMQLREEEDNRDGYIIRIVS 510 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K ++ TDGE + Q + A+K G V I I + E Sbjct: 511 EKLAKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKE 570 >gi|19553918|ref|NP_601920.1| hypothetical protein NCgl2629 [Corynebacterium glutamicum ATCC 13032] gi|62391561|ref|YP_226963.1| hypothetical protein cg3019 [Corynebacterium glutamicum ATCC 13032] gi|41326903|emb|CAF20747.1| putative secreted protein [Corynebacterium glutamicum ATCC 13032] Length = 893 Score = 38.2 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 36/249 (14%), Positives = 76/249 (30%), Gaps = 53/249 (21%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSM------------ESFFDSSITKIDMAIKSI 202 S++ LD++I++D + S+ E D+ ++ A + Sbjct: 50 ASGSLSNLGACIADKGTLDVIIMIDETESLIHEARDGVVNANEPGADAQHHRVPAAQSFV 109 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG---------VSHLQRKIKYL--S 251 + +L + + D+N ++ + K +G V +Q +I Sbjct: 110 DELLAK-QSDGDLNTRIRVAGFGQTYKSGATDPDNYGAWTQLDASTVGGVQDEISRFADR 168 Query: 252 KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS-TKEDQQSLY 310 TN ++ AY + +++V TDG + D Sbjct: 169 TQEQYTNYASAIEGAYQDFTRS---------GSEDACRMLVTFTDGALTAQEGADVAEAA 219 Query: 311 YC------NEAKKRGAIVYAIGIRVIRS---HEFLRACA----------SPNSFYLVENP 351 C + + G IG+ + LR + +F+ +N Sbjct: 220 LCAPGGVTDRLRSAGITHIGIGLSAPTNPSDFSLLRGTTAGGGTCGVEPANGAFFPADNV 279 Query: 352 HSMYDAFSH 360 ++ AF Sbjct: 280 GGLFAAFRE 288 >gi|319400762|gb|EFV88981.1| von Willebrand factor type A domain protein [Staphylococcus epidermidis FRI909] Length = 629 Score = 38.2 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 22/177 (12%), Positives = 60/177 (33%), Gaps = 29/177 (16%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K + +++D S SM K+D IK + E +K + + ++ Sbjct: 426 KQDLSKTFDATFTLLIDASASMHD-------KMDETIKGVVLFHETLKSLNIKHEILAFN 478 Query: 223 LVTF--SNKIEEFF---LLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQG 275 F + + ++ + S +++ + ++ A ++ Sbjct: 479 EDAFEADERQQPNIIDEIINYNYSIFEKEGPRIMTLEPQDDNRDGVAIRIASERLL---- 534 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKRGAIVYAIGIR 328 + ++ ++ ++ +DGE + Q + A+K G V+ + + Sbjct: 535 -------QRSHEQRFLIVFSDGEPSAFNYSQDGILDTYEAVERARKFGIEVFNVFLS 584 >gi|313496540|gb|ADR57906.1| Surface adhesion protein, putative [Pseudomonas putida BIRD-1] Length = 8682 Score = 38.2 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 25/162 (15%), Positives = 53/162 (32%), Gaps = 8/162 (4%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 T S + + +N ++ ++D S SM I A + ++ Sbjct: 8135 TGSEGNDIFVADVSGLNVVQGKNYNIAFMVDSSGSMSD------KSIADAKTQLASVFNT 8188 Query: 209 VK--LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 +K L D + V LV F ++ + + K++ + V G Y Sbjct: 8189 LKASLGSDTSGTVNIFLVDFDTQVNKNVAVNLADPDALSKLQAVLNSMVGGYYGGGTNYE 8248 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS 308 T + + + F+TDG+ + ++ + Sbjct: 8249 DAFKTTSNFFNSTMATSNKGAENLTYFITDGKPTYYQSNEST 8290 >gi|310820813|ref|YP_003953171.1| hypothetical protein STAUR_3554 [Stigmatella aurantiaca DW4/3-1] gi|309393885|gb|ADO71344.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1] Length = 403 Score = 38.2 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 60/170 (35%), Gaps = 24/170 (14%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 +++++ VLD + SM S + + KI SI + + K P ++ G Sbjct: 50 SQQKTAQPQIEVVFVLDTTGSMSSLLEGAKQKI----FSIASRIATGKPTP----RLKIG 101 Query: 223 LVTF----SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTN---STPGLKYAYNQIFDMQG 275 LV + + + F L + + ++ G + GL A +++ Sbjct: 102 LVAYRDVGDEYVTKRFDLSEDLDTVFTHLRQFRADGGGDSPEHVGRGLGEAVSKMSWSA- 160 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 K+I + D S +E + A++R +V + Sbjct: 161 --------QRETMKVIFLVGDAPPASREEAWNFKQWSKRAQERHIVVNTV 202 >gi|300781223|ref|ZP_07091077.1| conserved hypothetical protein [Corynebacterium genitalium ATCC 33030] gi|300532930|gb|EFK53991.1| conserved hypothetical protein [Corynebacterium genitalium ATCC 33030] Length = 465 Score = 38.2 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 29/254 (11%), Positives = 65/254 (25%), Gaps = 35/254 (13%) Query: 98 LRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMP 157 + +N D + + S+ + + I Sbjct: 224 VYSAIPLNTTDAVTEDQTRAAQAFADFASEQYGNDSQLPVVAESVWAAARPNGGERISTA 283 Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 ++ +D + +LD S M F + I S+ +V Sbjct: 284 AQEHEPAQTEIGEPIDTLFLLDTSDQMAHFAEPVAQSIGAVATSLTDAGRQV-------- 335 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 L +S+ + +R +++ TN + A + R Sbjct: 336 ----ALWNYSSPLNPGV-----TQSFRRNVEF-------TNDGAQIGAAAQGFINAGAPR 379 Query: 278 QHCN-------TEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 E ++ +T G T +D A+ G + + + Sbjct: 380 TREAVAAALKYAESVETPIRVMLITSG----TVDDADVTPAIAHARAAGVELSVVHVGAA 435 Query: 331 RSHEFLRACASPNS 344 + L A+ ++ Sbjct: 436 PKDQVLIDAATSST 449 >gi|268553381|ref|XP_002634676.1| Hypothetical protein CBG19659 [Caenorhabditis briggsae] Length = 346 Score = 38.2 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 27/169 (15%), Positives = 59/169 (34%), Gaps = 14/169 (8%) Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH-- 242 F S+ ++ I S+ +++ + P+ + G+VT++ L S Sbjct: 12 RCFVVLSVIEVAALISSLFVDGQQLGIQPNNPRTTRVGIVTYNQGAHVIADLNNFTSVDE 71 Query: 243 -LQRKIKYLSKFGVSTNSTP-GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 + + + L++ S++S A N + + Q NT +Y+K+++ T Sbjct: 72 LIDKTFRVLNQVSTSSDSYLHAGLEAANDLLEQ----QSFNTARGHYQKLVIVYT--SEY 125 Query: 301 STKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR---ACASPNSFY 346 Q L K+ + + L A+P + Sbjct: 126 KGTGSQDPLPLATRMKQT-VSIATVAYNREDDLGVLSELTKIATPGYNF 173 >gi|26986912|ref|NP_742337.1| surface adhesion protein, putative [Pseudomonas putida KT2440] gi|24981520|gb|AAN65801.1|AE016208_1 surface adhesion protein, putative [Pseudomonas putida KT2440] Length = 8682 Score = 38.2 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 25/162 (15%), Positives = 53/162 (32%), Gaps = 8/162 (4%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 T S + + +N ++ ++D S SM I A + ++ Sbjct: 8135 TGSEGNDIFVADVSGLNVVQGKNYNIAFMVDSSGSMSD------KSIADAKTQLASVFNT 8188 Query: 209 VK--LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 +K L D + V LV F ++ + + K++ + V G Y Sbjct: 8189 LKASLGSDTSGTVNIFLVDFDTQVNKNVAVNLADPDALSKLQAVLNSMVGGYYGGGTNYE 8248 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS 308 T + + + F+TDG+ + ++ + Sbjct: 8249 DAFKTTSNFFNSTMATSNKGAENLTYFITDGKPTYYQSNEST 8290 >gi|260837272|ref|XP_002613629.1| hypothetical protein BRAFLDRAFT_93670 [Branchiostoma floridae] gi|229299015|gb|EEN69638.1| hypothetical protein BRAFLDRAFT_93670 [Branchiostoma floridae] Length = 532 Score = 38.2 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 12/58 (20%), Positives = 24/58 (41%) Query: 286 NYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 + +I+F+TDG+ K+ + R ++ G+ + + CA PN Sbjct: 6 DSDNVIIFLTDGKPSDPKDVLDQITEGQYLMDRTVHIFTYGLGNALPNAYSGWCADPN 63 >gi|116626778|ref|YP_828934.1| hypothetical protein Acid_7751 [Candidatus Solibacter usitatus Ellin6076] gi|116229940|gb|ABJ88649.1| hypothetical protein Acid_7751 [Candidatus Solibacter usitatus Ellin6076] Length = 466 Score = 38.2 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 20/169 (11%), Positives = 56/169 (33%), Gaps = 17/169 (10%) Query: 15 YKGGMTI-LTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNN 73 +G ++I L I +P++F ++G +++ ++ ++ L+ + + + AA+ ++ + Sbjct: 33 REGAISIQLAVIMVPVLFGMMGFALDLGRLYLVRGELNHAAEAAAIAAASHLIGTTGALD 92 Query: 74 RKKLKGGDILCRIKNTWNMSFRNELRDN-GFVNDIDDIVRSTSLDIVVVPQ------NEG 126 L K + E N + + P Sbjct: 93 AATTAAQQTLTLNKYNFGSLTPGEGSGNLTSTITDPAYFSTVAGATGNDPNGSQADGTTA 152 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRH---------IVMPITSSVKVNS 166 + PL F + + + + I +P+ ++ + Sbjct: 153 RHVQVSLTADAPLLFWSLLSAGQSRKTPIAAQALAGISVPLCTACGIEP 201 >gi|17229602|ref|NP_486150.1| hypothetical protein all2110 [Nostoc sp. PCC 7120] gi|17131201|dbj|BAB73809.1| all2110 [Nostoc sp. PCC 7120] Length = 201 Score = 38.2 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 24/138 (17%), Positives = 48/138 (34%), Gaps = 13/138 (9%) Query: 173 DMMIVLDVSRSME-SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D +++D S SM ++ ++A +S A+ + + V F + Sbjct: 6 DYTLIIDKSGSMSTPDQAGGRSRWEIAQESTLALARKCEQFDPDGITVYLFSGKFKRYDD 65 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 S +I + +TN L+ A N F + + + I Sbjct: 66 V-------TSAKVAQIFLENDPAGTTNLAGVLQDALNNYFQRKAAGKTKPNGET-----I 113 Query: 292 VFMTDGENLSTKEDQQSL 309 + +TDGE K +++ Sbjct: 114 LVITDGEPDDRKAVFETI 131 >gi|291230030|ref|XP_002734973.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii] Length = 454 Score = 38.2 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 28/205 (13%), Positives = 70/205 (34%), Gaps = 24/205 (11%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 A LD+ V+D + SM S+ I+ A ++I+ +++E+ + ++ LV + Sbjct: 20 DAAVLDLAFVIDCTSSMGSW-------INEAKENIHNIVDEIVAKEMSD--IRLALVEYR 70 Query: 228 NKIEEFFLLEWGV-------SHLQRKIKYLSKFGVS---TNSTPGLKYAYNQIFDMQGMR 277 + + V +Q+++ ++ G GL +N + + Sbjct: 71 DHPPQDSTFVTRVLDFTPSLKDMQKQMDSMAAHGGGDGPEAVADGLHETFNLNWRPLATK 130 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--HEF 335 DA + DG + + ++ +Y++ +F Sbjct: 131 VCVLIADAPPHG-LGDSGDGFPKGCPAGHDPMKIARQMAEQNITLYSVVCGSYAEGFKDF 189 Query: 336 LRACA--SPNSFYLVENPHSMYDAF 358 A A + + +++ + Sbjct: 190 FMAIAHVTGGQYVSLKDAKLLSKVI 214 >gi|302665488|ref|XP_003024354.1| conserved hypothetical protein [Trichophyton verrucosum HKI 0517] gi|291188405|gb|EFE43743.1| conserved hypothetical protein [Trichophyton verrucosum HKI 0517] Length = 308 Score = 38.2 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 30/213 (14%), Positives = 58/213 (27%), Gaps = 20/213 (9%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 + + A Y T P + + + +S A Sbjct: 24 SPAQKSTYSSNKSTGSSHLDAPPAYTPVSTTSTTAPSFPAASSEGGSLAASPDRPDSFLA 83 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 + D + ++D S SM + K+ L +V I + + +++ Sbjct: 84 QFDTVFLIDDSGSMR----------GASWKATENALAKVAPICTAQDKDGVEIYFLNHRS 133 Query: 231 EE---FFLLEWGVSHLQRKIKYLSKFGV---STNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + + H++ +S G T LK + ++ Sbjct: 134 DSSNGAYSNIRDTGHVREVFTAVSPCGGTPTGTRLNHILKPYLKGFEEASKSMSPDERDN 193 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 A I+ +TDG D AKK Sbjct: 194 AVRPLNIIVITDG----VPTDDVESVIIQAAKK 222 >gi|221091172|ref|XP_002170765.1| PREDICTED: similar to Pfemp3-like protein, putative, partial [Hydra magnipapillata] Length = 475 Score = 38.2 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 61/180 (33%), Gaps = 19/180 (10%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + + + +DM+ +D+S S E K+I ++++ Sbjct: 92 DTANELKGFVQGVHDGAEGRAFNVDMVFAMDLSSSSEEILQKQ-------KKAIRSLIDY 144 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI-KYLSKFGVSTNSTPGLKYAY 267 +P +N + GL+T+S+ L S +I + G N + A Sbjct: 145 --HLPSKSN--ELGLITYSDIANVNSEL---TSKFNSEILDKIVNNGRRQNVASAITVAS 197 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 IF ++ + + ++++V G+ S Y ++ IG+ Sbjct: 198 ENIFKIRNNDEKLKKK----QRVLVLFVVGKPSSNYPPVVPERYGKLLEENNVKTIIIGL 253 Score = 38.2 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 61/180 (33%), Gaps = 19/180 (10%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + + + +DM+ +D+S S E K+I ++++ Sbjct: 276 DTANELKGFVQGVHDGAEGRAFNVDMVFAMDLSSSSEEILQKQ-------KKAIRSLIDY 328 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI-KYLSKFGVSTNSTPGLKYAY 267 +P +N + GL+T+S+ L S +I + G N + A Sbjct: 329 --HLPSKSN--ELGLITYSDIANVNSEL---TSKFNSEILDKIVNNGRRQNVASAITVAS 381 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 IF ++ + + ++++V G+ S Y ++ IG+ Sbjct: 382 ENIFKIRNNDEKLKKK----QRVLVLFVVGKPSSNYPPVVPERYGKLLEENNVKTIIIGL 437 >gi|218438267|ref|YP_002376596.1| von Willebrand factor A [Cyanothece sp. PCC 7424] gi|218170995|gb|ACK69728.1| von Willebrand factor type A [Cyanothece sp. PCC 7424] Length = 206 Score = 38.2 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 26/133 (19%), Positives = 49/133 (36%), Gaps = 13/133 (9%) Query: 173 DMMIVLDVSRSME-SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D +++D S SM S ++ ++A +S A+ + + I V FS + Sbjct: 11 DYTLIIDKSGSMSISDQPGGKSRWEIAQESTLALARKCEQIDPDGITVYL----FSGRFR 66 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + + S +I + T+ L+ A N F + + + I Sbjct: 67 RY---DNVTSDKVSQIYAENDPMGHTDLASVLQDALNNYFQRKTAGNTKSNGET-----I 118 Query: 292 VFMTDGENLSTKE 304 + +TDGE K Sbjct: 119 LVVTDGEPDDRKA 131 >gi|156404067|ref|XP_001640229.1| predicted protein [Nematostella vectensis] gi|156227362|gb|EDO48166.1| predicted protein [Nematostella vectensis] Length = 218 Score = 38.2 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 36/218 (16%), Positives = 70/218 (32%), Gaps = 15/218 (6%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T++ + D ++D+ IVLD S SM D + E + + Sbjct: 11 TAAKTEQVKCDKKVDLAIVLDASASMGE---------DNYKLAKTLTKEIISRFTISPDK 61 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + L FS L S + K+ L+ + S L + +++ Sbjct: 62 TRVSLNFFSAHHVIVSKLSDNFS--KSKLMSLTDQMMYEKSFSKLATTLEAVHYELLVKK 119 Query: 279 HCNTEDAN-YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS-HEFL 336 K V +TDG T +++ KK ++ + R+ ++ + Sbjct: 120 GGARPKQKGVKMATVLVTDG--YGTAGFEETSDEAKSMKKYNVEMFTVYPEKGRTYNKIM 177 Query: 337 RACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIWYDK 374 AS + + A + IV + +K Sbjct: 178 MHLASKPTKSHLFKLTKDGGARRKVVDQIVKQICSDNK 215 >gi|319782171|ref|YP_004141647.1| hypothetical protein Mesci_2452 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317168059|gb|ADV11597.1| hypothetical protein Mesci_2452 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 509 Score = 37.8 bits (86), Expect = 2.4, Method: Composition-based stats. Identities = 27/139 (19%), Positives = 51/139 (36%), Gaps = 13/139 (9%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 IR F+ + +G IL +I LP I + I++S I + L D + AA ++ + Sbjct: 5 IRAFWNDQRGIALILVSITLPAIIGFSLLAIDMSRINNLHNDLQKGADSFALAAAAEL-D 63 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGF-----VNDIDDI-------VRSTS 115 +G + + D+L R ++ ++ + L N I + + Sbjct: 64 GSSGAWVRAERAMDVLVRNESQFSTAGPRILLAGQPGGTQRCNSAGSISWCFLKAIPAAD 123 Query: 116 LDIVVVPQNEGYSISAISR 134 + V Y A Sbjct: 124 GIQITVANQATYLADADPA 142 >gi|300870310|ref|YP_003785181.1| hypothetical protein BP951000_0681 [Brachyspira pilosicoli 95/1000] gi|300688009|gb|ADK30680.1| conserved hypothetical protein [Brachyspira pilosicoli 95/1000] Length = 315 Score = 37.8 bits (86), Expect = 2.4, Method: Composition-based stats. Identities = 18/124 (14%), Positives = 49/124 (39%), Gaps = 17/124 (13%) Query: 157 PITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 I + + LD+++V D++ SM ++ + +K K++ +++ + Sbjct: 44 SIEKQMNDEIKRKKGLDLVLVGDLTGSMSAYREKLRSKFMELSKTLFQIIDNL------- 96 Query: 217 NVVQSGLVTF------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + G++ + I + L V L I + G ++ ++ A N + Sbjct: 97 ---RIGIIFYLDHGSGDPYITKVHELSVNVDSLLNFINNVP-NGHGGDANEAVEDALNDV 152 Query: 271 FDMQ 274 +++ Sbjct: 153 YNIN 156 >gi|284052435|ref|ZP_06382645.1| protoporphyrin IX magnesium-chelatase [Arthrospira platensis str. Paraca] Length = 374 Score = 37.8 bits (86), Expect = 2.4, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 66/201 (32%), Gaps = 42/201 (20%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI-EE 232 ++ V+D S SM ++ ++ A ++ +L E D Q L+ F + E Sbjct: 179 VVFVVDASGSM------ALNRMQSAKGAVMQLLTEAYQSRD-----QVSLIPFRGEQAEV 227 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 ++ +R+++ + G S + GL A + +Q + +IV Sbjct: 228 LLPPTRSIAAAKRRLERMPCGGGSPLAH-GLTQAVRVGMN---AKQSGDIGQV----VIV 279 Query: 293 FMTDGENL-------------STKEDQQS--LYYCNEAKKRGAIVYAI-----GIRVIRS 332 +TDG K D + L + + G + I + + Sbjct: 280 AITDGRGNIPLARSLGEPILEGEKPDIKGELLEIAAKIRGLGIQLLVIDTENKFVSTGFA 339 Query: 333 HEFLRACASPNSFYLVENPHS 353 E + +Y + Sbjct: 340 KELAKTAG--GKYYHLPKATD 358 >gi|145516504|ref|XP_001444144.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124411548|emb|CAK76747.1| unnamed protein product [Paramecium tetraurelia] Length = 829 Score = 37.8 bits (86), Expect = 2.4, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 69/201 (34%), Gaps = 26/201 (12%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVV 219 + ++Q R + + ++D S SM S ++I A +++ L+ + N++ Sbjct: 313 GLIIADNQKIKRGNYLFIIDRSGSM------SGSRIKKAKEALILFLKSL-PQDSEYNII 365 Query: 220 QSGLVTFSNKIEEFFLLEWGV-SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 G F+ + ++ + T LK M Sbjct: 366 SFG-TNFTKLWNVSQNYSQNTLETAIKHVEEMDADMGGTCIIAPLKQ----------MIY 414 Query: 279 HCNTEDANYKKIIVF-MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 H N + + VF +TDG++ + + + +Y +GI S +R Sbjct: 415 HKNYGASKNTTLNVFLLTDGQDTADPI----IDLVQKNNLAQTRIYTLGIGRECSQYLIR 470 Query: 338 ACASPNS--FYLVENPHSMYD 356 A + + +V + + + Sbjct: 471 RVAEVGNGKYQIVSDKEDINE 491 >gi|17231904|ref|NP_488452.1| hypothetical protein alr4412 [Nostoc sp. PCC 7120] gi|17133548|dbj|BAB76111.1| alr4412 [Nostoc sp. PCC 7120] Length = 820 Score = 37.8 bits (86), Expect = 2.4, Method: Composition-based stats. Identities = 27/211 (12%), Positives = 63/211 (29%), Gaps = 34/211 (16%) Query: 153 HIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 H + + +++ D++ ++D S S + + + ++ L Sbjct: 280 HFALYLIPAIQYRQDQVVPKDVVFLIDTSGS---QMGAPLMQCQELMRRFINGLNPDDTF 336 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 V+ + + + L + R I Y+++ S N + + + Sbjct: 337 SIVD-------FSDTTRQLSPVPLANNAQNRTRAINYINQL--SANGGTEMLRGIRAVLN 387 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + IV +TDG E+Q K G +Y+ G Sbjct: 388 FPVTDPGRL-------RSIVLLTDG--YIGNENQILAEVQQHLKS-GNRLYSFG------ 431 Query: 333 HEFLRACASPNSFYLVENPHSMYDAFSHIGK 363 S + +L+ + + I + Sbjct: 432 ------AGSSVNRFLLNRIAELGRGIAQIIR 456 >gi|324503613|gb|ADY41566.1| Collagen alpha-1(XIV) chain [Ascaris suum] Length = 1024 Score = 37.8 bits (86), Expect = 2.5, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 63/169 (37%), Gaps = 23/169 (13%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN-NVV 219 ++ + +LD++ +LD S S+ +++ +++ VK +V + + Sbjct: 846 AIPNDKSDTTKLDVLFLLDTSSSIGE---------AQFQRALKLIVDTVKEFRNVGPDGI 896 Query: 220 QSGLVTFSNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 Q LV F+++ F L I + +N + +++ + Sbjct: 897 QVSLVQFNHEPYLEFSFRKHNCKPELLEDIADTAYMKGVSNLGNAI----DKVAKFGFNK 952 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIG 326 + + DA+ ++V +TDG D + +K G V I Sbjct: 953 RRGDRPDAD--NVLVILTDG-----MSDDKIQQPLQLVRKNGTAVLVIA 994 >gi|300781256|ref|ZP_07091110.1| secreted Mg-chelatase subunit [Corynebacterium genitalium ATCC 33030] gi|300532963|gb|EFK54024.1| secreted Mg-chelatase subunit [Corynebacterium genitalium ATCC 33030] Length = 528 Score = 37.8 bits (86), Expect = 2.5, Method: Composition-based stats. Identities = 38/242 (15%), Positives = 77/242 (31%), Gaps = 26/242 (10%) Query: 134 RYKIPLKFCTFIPWYTNSRHIV--------MPITSSVKVNSQT-DARLDMMIVLDVSRSM 184 Y+ P+ +P +++ ++ + + + + R VLD S SM Sbjct: 295 SYRRPVTNADLMPAEMSNQTVIELAFPASYGTVEALLDRYNNDYRQRGTATFVLDTSGSM 354 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG-LVTFSNKIEEFFLLEWGVSHL 243 E +I + + ++++ + ++ G LVTF + G Sbjct: 355 EG------ERIASLQQIMTSLIDGSAATATGDVALRDGELVTFQSFSTAPHEPLLGEFLR 408 Query: 244 QRKIKYLSKFG-VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 +I G V+ G Y+ +FD + IV ++DGE Sbjct: 409 DDRITKAKYQGYVNDLVADGQTAIYDTLFDALRSSDPNAGISS-----IVLLSDGEVTHG 463 Query: 303 KEDQQSLYYCNEA--KKRGAIVYAIGIRVIRSHEFLRACA-SPNSFYLVENPHSMYDAFS 359 + ++R V+ I + E + + + N + AF Sbjct: 464 MDYYAFEKQYQGLSPEQRSIPVFVILYGEANASEMNNLAELTGGAVFDALN-GDLDAAFK 522 Query: 360 HI 361 I Sbjct: 523 EI 524 >gi|221067364|ref|ZP_03543469.1| conserved hypothetical protein [Comamonas testosteroni KF-1] gi|220712387|gb|EED67755.1| conserved hypothetical protein [Comamonas testosteroni KF-1] Length = 419 Score = 37.8 bits (86), Expect = 2.5, Method: Composition-based stats. Identities = 19/132 (14%), Positives = 42/132 (31%), Gaps = 4/132 (3%) Query: 32 LVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWN 91 +G+ ++ +F ++T L + +D + AA ++ + + +R G N N Sbjct: 43 GFMGIALDFGRLFIVRTELQTALDSCALAAARELNKQPDAISRAVSAGAAAGNL--NGVN 100 Query: 92 MSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNS 151 + N ++ L YS ++ I + + Sbjct: 101 LQSANWSGQGQITAADISFRDASYLS-TTSATAAVYSQCTHTQSNIGMWLLK-ALGAFSG 158 Query: 152 RHIVMPITSSVK 163 P T +V Sbjct: 159 NSASFPATGNVG 170 >gi|163857684|ref|YP_001631982.1| hypothetical protein Bpet3372 [Bordetella petrii DSM 12804] gi|163261412|emb|CAP43714.1| putative membrane protein [Bordetella petrii] Length = 706 Score = 37.8 bits (86), Expect = 2.5, Method: Composition-based stats. Identities = 14/77 (18%), Positives = 32/77 (41%), Gaps = 1/77 (1%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 +G + + TA F +I LV+ + ++ ++F+MK L D + + A + G Sbjct: 10 MHDRQRGSIAVSTA-FAVLIGLVMLLSAQIGYLFYMKRELQKAADLAALSAVQVLAPTGA 68 Query: 71 GNNRKKLKGGDILCRIK 87 ++ + + Sbjct: 69 ASDCAAGSPVAVAAQTS 85 >gi|126306102|ref|XP_001362319.1| PREDICTED: similar to putative calcium activated chloride channel-like protein 1; eCLCA1 [Monodelphis domestica] Length = 911 Score = 37.8 bits (86), Expect = 2.5, Method: Composition-based stats. Identities = 34/170 (20%), Positives = 63/170 (37%), Gaps = 37/170 (21%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 +++V+D SRSM +++ ++ L ++ + +G+VTF + Sbjct: 307 LVLVIDTSRSM-----KVGNRLNRLRQASQFFLLQI-----IEKGSWTGVVTFDSSATIQ 356 Query: 234 FLLEWGVSHLQRK--IKYLSK---FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L S +QRK I L G + GL+ A+ + Sbjct: 357 SELIQIESDVQRKTLISRLPTVTVAGGGAHICSGLRTAFMVVKKK--------------- 401 Query: 289 KIIVFMTDGENLS--TKEDQQSLYYC-NEAKKRGAIVYAIGIRVIRSHEF 335 F+TDG ++ T + + C E K+ GAI++ I + Sbjct: 402 ----FLTDGSEMALLTDGEDNTTNTCFEEVKQSGAIIHTIVLGPSTEKGL 447 >gi|303237431|ref|ZP_07323997.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] gi|302482381|gb|EFL45410.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] Length = 625 Score = 37.8 bits (86), Expect = 2.5, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 64/186 (34%), Gaps = 28/186 (15%) Query: 119 VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITS------SVKVNSQTDARL 172 YS R P++ I Y S ++ IT+ + + + Sbjct: 399 TNTGNPINYSYPNTWRIGDPIESLDLILSYITSPKLIPGITTKKWERITTESHGSEKKLR 458 Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+++V+D S SM+ S + D +++ A + + L+ FS+ + Sbjct: 459 DLLLVVDTSGSMK----SVTREADNMHQAVLASFGILNYFESKK--CKVALIEFSDSVRV 512 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + W V + + + K L+ T R E + + + V Sbjct: 513 D--ITWTVQYDEIREKLLTNGSGGTQF--------------PIHRIQSTLEKSKNELVTV 556 Query: 293 FMTDGE 298 +TDGE Sbjct: 557 VITDGE 562 >gi|322433071|ref|YP_004210320.1| hypothetical protein AciX9_4225 [Acidobacterium sp. MP5ACTX9] gi|321165298|gb|ADW71002.1| Protein of unknown function DUF2134, membrane [Acidobacterium sp. MP5ACTX9] Length = 457 Score = 37.8 bits (86), Expect = 2.5, Method: Composition-based stats. Identities = 13/42 (30%), Positives = 19/42 (45%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHS 52 N G + + A+ L +I LG+ I+V H K L S Sbjct: 7 LVSNESGQVLVAAAVSLLVILGFLGISIDVGHRQLSKLRLQS 48 >gi|225559690|gb|EEH07972.1| U-box domain-containing protein [Ajellomyces capsulatus G186AR] Length = 759 Score = 37.8 bits (86), Expect = 2.5, Method: Composition-based stats. Identities = 35/205 (17%), Positives = 68/205 (33%), Gaps = 23/205 (11%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL------IPDVNNVVQSGLV 224 D+++ +DVS SM+S T + + ++ I +N + G+V Sbjct: 73 PCDIVLCIDVSYSMQSSAPLPTTDESGEREETGLSVLDLTKHAARTIIETLNENDRLGIV 132 Query: 225 TFSNKIEEFFLLE-WGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCN 281 FS + E + + S+ + +K + STN GLK + + Q Sbjct: 133 AFSTEAEVVYKISKMNESNKKAALKAVEALKPLSSTNLWHGLKLGLKAFENERHTLQSV- 191 Query: 282 TEDANYKKIIVFMTDGENLS----TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + + +TDG + R +++ G L+ Sbjct: 192 -------QALYVLTDGMPNHMCPKQGYVTKLRPILQLLGHRMPMIHTFGFGYNIRSGLLQ 244 Query: 338 ACAS--PNSFYLVENPHSMYDAFSH 360 A A +F + + + F H Sbjct: 245 AIAEVGGGTFAFIPDAGMIGTVFVH 269 >gi|224106223|ref|XP_002314091.1| predicted protein [Populus trichocarpa] gi|222850499|gb|EEE88046.1| predicted protein [Populus trichocarpa] Length = 786 Score = 37.8 bits (86), Expect = 2.6, Method: Composition-based stats. Identities = 26/177 (14%), Positives = 58/177 (32%), Gaps = 20/177 (11%) Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 + ++ A R P + SR + + T A ++ V+D S SM Sbjct: 536 KRLAVDATLRAAAPYQKLRKEKDTQKSRKVYVEKTDMRAKRMARKAGALVIFVVDASGSM 595 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEFFLLEWGVSHL 243 ++ ++ A + +L + Q ++ F + E +S Sbjct: 596 ------ALNRMQNAKGAALKLL-----AESYTSRDQVAIIPFRGDAAEVLLPPSRSISMA 644 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 +++++ L G S + + + + + +IV +TDG Sbjct: 645 RKRLERLPCGGGSPLAHG--------LTTAVRVGLNAEKSGDVGRIMIVAITDGRAN 693 >gi|308472879|ref|XP_003098666.1| hypothetical protein CRE_04169 [Caenorhabditis remanei] gi|308268266|gb|EFP12219.1| hypothetical protein CRE_04169 [Caenorhabditis remanei] Length = 382 Score = 37.8 bits (86), Expect = 2.6, Method: Composition-based stats. Identities = 35/188 (18%), Positives = 64/188 (34%), Gaps = 27/188 (14%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++IV D S+ + + SI+ + E V IP + G VT+++ Sbjct: 31 LDIVIVADNSQRVNQN-----NSVVDIQNSISNIFEIV-PIPIN----RVGFVTYNSLAT 80 Query: 232 EFFLL----EWG--VSHLQRKIKYLSKFGVSTNS-TPGLKYAYNQIFDMQGMRQHCNTED 284 L WG + ++ +T+ GL A + Q Sbjct: 81 INADLNKFKSWGDLSQGVNDSYNNMNLSSENTSFIGTGLITAGELL-----QVQGSAIGR 135 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS---HEFLRACAS 341 Y K+I+ + L N K G + + + + + L + AS Sbjct: 136 VYYPKVIIVY--ASAFNGTGLLDPLSVANTLKSAGITIITVAVDTDNNGVIQKQLASIAS 193 Query: 342 PNSFYLVE 349 P S + ++ Sbjct: 194 PGSAFSLD 201 >gi|239979500|ref|ZP_04702024.1| hypothetical protein SalbJ_08693 [Streptomyces albus J1074] gi|291451369|ref|ZP_06590759.1| conserved hypothetical protein [Streptomyces albus J1074] gi|291354318|gb|EFE81220.1| conserved hypothetical protein [Streptomyces albus J1074] Length = 452 Score = 37.8 bits (86), Expect = 2.6, Method: Composition-based stats. Identities = 19/147 (12%), Positives = 44/147 (29%), Gaps = 32/147 (21%) Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 +++D S SM+ D ++ A+ + + ++ ++ E + Sbjct: 61 LMVDCSGSMDHPSSKMRHARDATRTAVAAL----------RDGTRFAVIAGTHVAREVYP 110 Query: 236 --------LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + + L G T L+ A + +H Sbjct: 111 GGGRLATADDRTREEAREAVLRLRA-GGGTAIGSWLRLAEQLLTSEPAGVRHG------- 162 Query: 288 KKIIVFMTDGENLSTKEDQ--QSLYYC 312 + +TDG N ++ +L C Sbjct: 163 ----ILLTDGRNEHETPEELRAALDLC 185 >gi|229114220|ref|ZP_04243641.1| Von Willebrand factor type A domain protein [Bacillus cereus Rock1-3] gi|228669240|gb|EEL24661.1| Von Willebrand factor type A domain protein [Bacillus cereus Rock1-3] Length = 627 Score = 37.8 bits (86), Expect = 2.6, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 57/180 (31%), Gaps = 25/180 (13%) Query: 165 NSQTDARLDMMI--VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 Q LD+ ++D S SM K++ KS+ E +K + + Sbjct: 423 KGQESQELDVAFQLLVDCSGSMY-------NKMEETKKSVVLFHEALKSLKIPH------ 469 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG----MRQ 278 ++ + E + + + Y + N P + + + G + Sbjct: 470 AISGFWEDASSAKPEDKPNVIHEVVNYKNS--TLPNVGPEIMQLREEEDNRDGYIIRIVS 527 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K ++ TDGE + Q + A+K G V I I + E Sbjct: 528 EKLAKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKE 587 >gi|87118460|ref|ZP_01074359.1| Magnesium chelatase, ChlI subunit [Marinomonas sp. MED121] gi|86166094|gb|EAQ67360.1| Magnesium chelatase, ChlI subunit [Marinomonas sp. MED121] Length = 602 Score = 37.8 bits (86), Expect = 2.6, Method: Composition-based stats. Identities = 35/272 (12%), Positives = 82/272 (30%), Gaps = 22/272 (8%) Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEG 126 +E + + + S + E + VN + + V +N Sbjct: 345 SEDKDTPQADDDDWGSMPAQSQALDKSIQFESQTRVKVNKVFSTPFALQDFFVSKGENAQ 404 Query: 127 YSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 S + + I ++ P + ++T + +I+LD S SM Sbjct: 405 GRSQTKSTLSKSVNWHQTITQSIDTEGFQWPPEKLHRHRAKTGQSILHLILLDCSASMLG 464 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRK 246 + K+ A+L+ + + + +N++++ + Sbjct: 465 H--------AVFAKAKGAILDITQQAYLKREQIAM-FIFGNNEVKQIQAKVRAPKDMSEF 515 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQ 306 + + G T GL + R + + + I TDG + D Sbjct: 516 LNNIQA-GGGTPFRDGLLQVATWLT-----RYYKQSPTSQSHTYI--FTDGRTQADVTDI 567 Query: 307 QSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRA 338 + C+ + A I+ ++ + +A Sbjct: 568 ELKGNCSL-----IDIEASVIKRGKAQQLAQA 594 >gi|302337458|ref|YP_003802664.1| hypothetical protein Spirs_0936 [Spirochaeta smaragdinae DSM 11293] gi|301634643|gb|ADK80070.1| conserved hypothetical protein [Spirochaeta smaragdinae DSM 11293] Length = 369 Score = 37.8 bits (86), Expect = 2.6, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 65/167 (38%), Gaps = 24/167 (14%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 ++ LD+++ LD ++SME+ T + +K ++ + GL Sbjct: 218 IDKAEGDSLDLVLCLDTTKSMENDIPHLKTSLIPMLKEVSGHFARF----------RFGL 267 Query: 224 VTFSNKIEEF----FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 V + + EE+ F + S LQR + + FG + Y ++ G+ ++ Sbjct: 268 VLYRDYYEEYLTKTFPFQDDFSSLQRVLSGIRVFGGR----DIPEAVYEALYA--GIHEY 321 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC-NEAKKRGAIVYAI 325 + DA K+I+ + D + + +A+ G + I Sbjct: 322 PWSGDA---KMIILVGDAPPHPMPRGRVTKELVYTDAEAAGIELNVI 365 >gi|171847247|gb|AAI61503.1| LOC548952 protein [Xenopus (Silurana) tropicalis] gi|213624339|gb|AAI70959.1| hypothetical protein LOC548952 [Xenopus (Silurana) tropicalis] Length = 896 Score = 37.8 bits (86), Expect = 2.6, Method: Composition-based stats. Identities = 21/132 (15%), Positives = 53/132 (40%), Gaps = 12/132 (9%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM T +D+A ++ ++ ++ + + LVT+ Sbjct: 4 LLFLIDTSASMNQRTYLGTTYLDIAKGAVEIFMK-LRARDPASRGDRYMLVTYDEPPY-C 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN---TEDANY- 287 W +H ++K L G++T L+ ++ + ++ + + + Sbjct: 62 IKAGWKENHATFMNELKNLQASGLTTL-GQALRSSF-DLLNLNRLVSGIDNYGQGRNPFF 119 Query: 288 --KKIIVFMTDG 297 I++ +TDG Sbjct: 120 LEPSILITITDG 131 >gi|62859103|ref|NP_001016198.1| DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 26B [Xenopus (Silurana) tropicalis] Length = 896 Score = 37.8 bits (86), Expect = 2.6, Method: Composition-based stats. Identities = 21/132 (15%), Positives = 53/132 (40%), Gaps = 12/132 (9%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ ++D S SM T +D+A ++ ++ ++ + + LVT+ Sbjct: 4 LLFLIDTSASMNQRTYLGTTYLDIAKGAVEIFMK-LRARDPASRGDRYMLVTYDEPPY-C 61 Query: 234 FLLEWGVSHLQ--RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN---TEDANY- 287 W +H ++K L G++T L+ ++ + ++ + + + Sbjct: 62 IKAGWKENHATFMNELKNLQASGLTTL-GQALRSSF-DLLNLNRLVSGIDNYGQGRNPFF 119 Query: 288 --KKIIVFMTDG 297 I++ +TDG Sbjct: 120 LEPSILITITDG 131 >gi|290971865|ref|XP_002668693.1| predicted protein [Naegleria gruberi] gi|284082192|gb|EFC35949.1| predicted protein [Naegleria gruberi] Length = 454 Score = 37.8 bits (86), Expect = 2.6, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 76/196 (38%), Gaps = 33/196 (16%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 ++I LDVS SM ID A +I+ + E+V PDV L+T+ E Sbjct: 40 QIVIALDVSGSMRGQG------IDQAKIAISNLFEQVVDTPDVV------LITYDTSAE- 86 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN-QIFDMQGMRQHCNTEDANYKKII 291 + L + ++ + G T+ T + N +F+ Q I Sbjct: 87 LYDLRKKPAETRQSTLEQIQAGGGTDFTCVFEAISNLDMFNRQSEVA------------I 134 Query: 292 VFMTDGENLSTKEDQQSLYYCNEA---KKRGAIVYAIGIRVIRSHEFL----RACASPNS 344 +F TDG++ S+ + ++++ + K + + IG L + + + Sbjct: 135 LFFTDGQDGSSHKREKAIEQMKKVLETKTQSFEFHTIGFTSSHDVALLTQITQLGSVQGT 194 Query: 345 FYLVENPHSMYDAFSH 360 F V++ + + + + Sbjct: 195 FQYVKDANEINQSMEN 210 >gi|229101379|ref|ZP_04232123.1| Von Willebrand factor type A domain protein [Bacillus cereus Rock3-28] gi|228682084|gb|EEL36217.1| Von Willebrand factor type A domain protein [Bacillus cereus Rock3-28] Length = 627 Score = 37.8 bits (86), Expect = 2.6, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 57/180 (31%), Gaps = 25/180 (13%) Query: 165 NSQTDARLDMMI--VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 Q LD+ ++D S SM K++ KS+ E +K + + Sbjct: 423 KGQESQELDVAFQLLVDCSGSMY-------NKMEETKKSVVLFHEALKSLKIPH------ 469 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG----MRQ 278 ++ + E + + + Y + N P + + + G + Sbjct: 470 AISGFWEDASSAKPEDKPNVIHEVVNYKNS--TLPNVGPEIMQLREEEDNRDGYIIRIVS 527 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K ++ TDGE + Q + A+K G V I I + E Sbjct: 528 EKLAKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKE 587 >gi|254508395|ref|ZP_05120516.1| protein contAining a von Willebrand factor type A domain [Vibrio parahaemolyticus 16] gi|219548708|gb|EED25712.1| protein contAining a von Willebrand factor type A domain [Vibrio parahaemolyticus 16] Length = 696 Score = 37.8 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 28/228 (12%), Positives = 69/228 (30%), Gaps = 31/228 (13%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 P + + + T + T R D + VLD S SM + T ++ + + + Sbjct: 287 PESSKRGTVKLTFTPGDDLTRVTQGR-DWVFVLDKSGSMSGKYS---TLVEGVRQGLGKL 342 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGL 263 + + + + + S++ + ++ + + TN G+ Sbjct: 343 --------PSEDRFRVVMFDSNTYDLTGGFVAVNQSNISKALQAVEQVEPSNGTNLYEGM 394 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVY 323 A ++ D + IV +TDG ++ + + +K ++ Sbjct: 395 AAAIRKL-------------DDDRPTGIVLVTDG--VANVGVTEKRRFFELMEKHDVRLF 439 Query: 324 AIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 + + L S V N + + + + Sbjct: 440 TFIMGNSANTPLLVPMTKLSNGVATSVSNADDIIGHLMSMTSKLTHQA 487 >gi|294673500|ref|YP_003574116.1| hypothetical protein PRU_0757 [Prevotella ruminicola 23] gi|294473545|gb|ADE82934.1| conserved hypothetical protein [Prevotella ruminicola 23] Length = 289 Score = 37.8 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 21/111 (18%), Positives = 49/111 (44%), Gaps = 7/111 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S++ + + + + ++ + G++ F Sbjct: 72 EEERELTVMLLIDVSGSLDFGTQKQMKRDMVTEIAATLAFSAIQNND------KIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 S+KIE++ + G H+ I+ + F + T +K A + +Q R Sbjct: 126 SDKIEKYIPPKKGRKHILYIIREMLDFKPESTRTD-IKQAIEFLSSVQKRR 175 >gi|294011132|ref|YP_003544592.1| hypothetical protein SJA_C1-11460 [Sphingobium japonicum UT26S] gi|292674462|dbj|BAI95980.1| hypothetical protein SJA_C1-11460 [Sphingobium japonicum UT26S] Length = 157 Score = 37.8 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 15/69 (21%), Positives = 31/69 (44%), Gaps = 1/69 (1%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDR-SLVHAATQIM 66 ++ + G T+ AI +P++ L+ M +V+ F K L +R + + AA Sbjct: 2 MKRLLSDRYGNSTVELAIIMPVLVLLTCMAGDVAMAFKAKIGLQRAAERTAQLAAAGGYT 61 Query: 67 NEGNGNNRK 75 N+ ++ Sbjct: 62 NDTTDTSKA 70 >gi|30680676|ref|NP_563821.2| CHLD; ATP binding / magnesium chelatase/ nucleoside-triphosphatase/ nucleotide binding [Arabidopsis thaliana] gi|148886934|sp|Q9SJE1|CHLD_ARATH RecName: Full=Magnesium-chelatase subunit chlD, chloroplastic; Short=Mg-chelatase subunit D; AltName: Full=Mg-protoporphyrin IX chelatase; Flags: Precursor gi|17380694|gb|AAL36177.1| putative Mg-chelatase [Arabidopsis thaliana] gi|20259647|gb|AAM14341.1| putative Mg-chelatase [Arabidopsis thaliana] gi|110741163|dbj|BAE98674.1| Mg-chelatase like protein [Arabidopsis thaliana] gi|332190180|gb|AEE28301.1| magnesium-chelatase subunit chlD [Arabidopsis thaliana] Length = 760 Score = 37.8 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 25/179 (13%), Positives = 59/179 (32%), Gaps = 20/179 (11%) Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 + ++ A R P + + +R + + T A ++ V+D S SM Sbjct: 510 KRLAVDATLRAAAPYQKLRREKDISGTRKVFVEKTDMRAKRMARKAGALVIFVVDASGSM 569 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEFFLLEWGVSHL 243 ++ ++ A + +L + Q ++ F + E ++ Sbjct: 570 ------ALNRMQNAKGAALKLL-----AESYTSRDQVSIIPFRGDAAEVLLPPSRSIAMA 618 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 + +++ L G S + + + + + +IV +TDG T Sbjct: 619 RNRLERLPCGGGSPLAHG--------LTTAVRVGLNAEKSGDVGRIMIVAITDGRANIT 669 >gi|312221289|emb|CBY01229.1| hypothetical protein [Leptosphaeria maculans] Length = 1122 Score = 37.8 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 24/175 (13%), Positives = 58/175 (33%), Gaps = 22/175 (12%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN-KIEE 232 ++ V D S SM + ++A++++ L + + V F+ + Sbjct: 294 IVFVCDQSGSMRGDRN------NLAVQALKVFLRSLPVG-----------VKFNICRFGS 336 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + W S + +T+ Y ++ ++ + I+ Sbjct: 337 NYSFLWPQSVTYSQ----KSLDEATSHIATFCADYGGTEMLKPLKATIEQRYKDIPLEIM 392 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYL 347 +TDGE + ++ L + V+ +G+ SH + A + + Sbjct: 393 LLTDGEIWNQQDLFTYLNESIGQSRAPIRVFTLGMGNGVSHALIEGIAKAGNGFS 447 >gi|254292588|ref|YP_003058611.1| hypothetical protein Hbal_0212 [Hirschia baltica ATCC 49814] gi|254041119|gb|ACT57914.1| conserved hypothetical protein [Hirschia baltica ATCC 49814] Length = 447 Score = 37.8 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 18/121 (14%), Positives = 48/121 (39%) Query: 6 LNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 +++ F + +G ++++ A+F+ ++ V+ I++S + L S++D + + AA ++ Sbjct: 1 MSLYKFCKSTQGNVSVMAALFIGLLIAVIAGTIDISQKSSLNRELQSVVDAAALAAAREM 60 Query: 66 MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 R + + + L + + + T I+ Q E Sbjct: 61 AVSSADQTRVQSVASSYVDANWTGEQATTHAVLDVSKGIITVSSTAPKTIASILKKDQKE 120 Query: 126 G 126 Sbjct: 121 T 121 >gi|197302285|ref|ZP_03167344.1| hypothetical protein RUMLAC_01012 [Ruminococcus lactaris ATCC 29176] gi|197298716|gb|EDY33257.1| hypothetical protein RUMLAC_01012 [Ruminococcus lactaris ATCC 29176] Length = 273 Score = 37.8 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 20/98 (20%), Positives = 38/98 (38%), Gaps = 7/98 (7%) Query: 5 NLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQ 64 + ++ YN KG +TI A+ +P+ + +I + + ++ VLH+ + Sbjct: 20 GIKVKK--YNIKGIITIEAALAIPLFLFAVICLIYLIEVRNIRLVLHNASQSAAKETVED 77 Query: 65 IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNG 102 + N+ K + IKN L D G Sbjct: 78 LAVLPVFNSVKMQQKI-----IKNIGEERLERSLLDGG 110 >gi|213626825|gb|AAI70192.1| Complement factor B [Xenopus laevis] Length = 745 Score = 37.8 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 30/234 (12%), Positives = 72/234 (30%), Gaps = 33/234 (14%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 N + S V D +++ IVLD S+S+ + + Sbjct: 218 NVDTTNLEDRSDRSVQILKDGLMNIFIVLDTSKSVGEEKFEEAKEASKL---------FI 268 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLL----EWGVSHLQRKIKYLSKFG----VSTNSTP 261 + + D + + ++++++ L + + ++ TN+ Sbjct: 269 EKMADYDIKPRYCIISYASVAIAVVSLRDPDSNDAEAVTKHLEDFQYNNHADKQGTNTRA 328 Query: 262 GLKYAYNQIFDMQGM--RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 L Y + + + ++ + +I+ MTDG+ + ++ + G Sbjct: 329 ALHSIYEHLIEQELAYEKEGKKADFMKIHNVILLMTDGKFNMGGDPREEMKLIIRFLDIG 388 Query: 320 AI----------VYAIGIRV----IRSHEFLRACASPNSFYLVENPHSMYDAFS 359 VY G+ +E + +EN + M + F Sbjct: 389 IRTENPRLEYLDVYVFGLGSDIDQPEINELASKKDKEVHTFHLENVNKMKEFFE 442 >gi|153809330|ref|ZP_01961998.1| hypothetical protein BACCAC_03643 [Bacteroides caccae ATCC 43185] gi|149128100|gb|EDM19321.1| hypothetical protein BACCAC_03643 [Bacteroides caccae ATCC 43185] Length = 611 Score = 37.8 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 37/292 (12%), Positives = 95/292 (32%), Gaps = 31/292 (10%) Query: 53 MIDRSLVHAATQIMNEGNGNNRK-KLKGGDILCRIKNTWNMSFRNELRDNGFVND----- 106 I + + + N + + + + +F ++ + N Sbjct: 120 AITSAYMAICPTPAYDMCVNTEEYGSFQENGFKEVADAALSTFSIDVDAASYSNMRRFVN 179 Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFI---PWYTNSRHIVMPITSSVK 163 ++ ++ + Y + P+K PW +R + + + + + Sbjct: 180 KGELPPVDAVRTEELVNYFSYDYPKPTGND-PVKITMEAGACPWNPANRLVRIGLKAK-E 237 Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + + +++ ++DVS SM ++D+ S+ ++ ++ V V SG Sbjct: 238 IPTDNLPASNLVFLIDVSGSMW-----GANRLDLVKSSLKLLVNNLRDKDKVAIVTYSGS 292 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 K+E + ++ I L+ G + + AY +I + N Sbjct: 293 AG--VKLESTSGSD--KQKIREAIDELTAGGSTAGGAGIML-AY-KIAKKNFISNGNNR- 345 Query: 284 DANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 I+ +DG+ N+ + +K G + +G + + Sbjct: 346 -------IILCSDGDFNVGVSSAEGLEQLIERERKSGVFLTVLGYGMGNYKD 390 >gi|148237705|ref|NP_001079047.1| complement factor B [Xenopus laevis] gi|1030718|dbj|BAA08371.1| Xenopus Bf B [Xenopus laevis] Length = 745 Score = 37.8 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 30/234 (12%), Positives = 72/234 (30%), Gaps = 33/234 (14%) Query: 150 NSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEV 209 N + S V D +++ IVLD S+S+ + + Sbjct: 218 NVDTTNLEDRSDRSVQILKDGLMNIFIVLDTSKSVGEEKFEEAKEASKL---------FI 268 Query: 210 KLIPDVNNVVQSGLVTFSNKIEEFFLL----EWGVSHLQRKIKYLSKFG----VSTNSTP 261 + + D + + ++++++ L + + ++ TN+ Sbjct: 269 EKMADYDIKPRYCIISYASVAIAVVSLRDPDSNDAEAVTKHLEDFQYNNHADKQGTNTRA 328 Query: 262 GLKYAYNQIFDMQGM--RQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 L Y + + + ++ + +I+ MTDG+ + ++ + G Sbjct: 329 ALHSIYEHLIEQELAYEKEGKKADFMKIHNVILLMTDGKFNMGGDPREEMKLIIRFLDIG 388 Query: 320 AI----------VYAIGIRV----IRSHEFLRACASPNSFYLVENPHSMYDAFS 359 VY G+ +E + +EN + M + F Sbjct: 389 IRTENPRLEYLDVYVFGLGSDIDQPEINELASKKDKEVHTFHLENVNKMKEFFE 442 >gi|84385695|ref|ZP_00988726.1| hypothetical protein V12B01_26214 [Vibrio splendidus 12B01] gi|84379675|gb|EAP96527.1| hypothetical protein V12B01_26214 [Vibrio splendidus 12B01] Length = 436 Score = 37.8 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 21/98 (21%), Positives = 44/98 (44%), Gaps = 8/98 (8%) Query: 170 ARLDMMIVLDVSRSMESFFDS---SITKIDMAIKSINAMLEEVKLIPDVNN-VVQSGLVT 225 +D+ +LD S+SM + + + T++D+ ++I ++E++K + Q LVT Sbjct: 125 QPVDIYFILDTSQSMSNRWSGEKNNKTQMDVVKETIVKVVEDLKTFKTGDEKKSQISLVT 184 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL 263 ++ +F V Y F +T + G+ Sbjct: 185 YNAYNAKFDKKSNQVKLY----DYTRDFKHTTETFEGI 218 >gi|260837445|ref|XP_002613714.1| hypothetical protein BRAFLDRAFT_130689 [Branchiostoma floridae] gi|229299103|gb|EEN69723.1| hypothetical protein BRAFLDRAFT_130689 [Branchiostoma floridae] Length = 1875 Score = 37.8 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 15/76 (19%), Positives = 33/76 (43%), Gaps = 6/76 (7%) Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-SPNSFYLVE 349 ++ +TDG K N+ +G VYAIG+ + + L A + N+ + Sbjct: 1 MIVVTDG-----KSGDSVASSANDLASQGVDVYAIGVGNYDATQLLEIAAGNQNNVIELT 55 Query: 350 NPHSMYDAFSHIGKDI 365 + +++ + I + + Sbjct: 56 DFNALSAEINQIAQTV 71 >gi|163846015|ref|YP_001634059.1| magnesium chelatase ATPase subunit D [Chloroflexus aurantiacus J-10-fl] gi|222523743|ref|YP_002568213.1| magnesium chelatase ATPase subunit D [Chloroflexus sp. Y-400-fl] gi|163667304|gb|ABY33670.1| magnesium chelatase ATPase subunit D [Chloroflexus aurantiacus J-10-fl] gi|222447622|gb|ACM51888.1| magnesium chelatase ATPase subunit D [Chloroflexus sp. Y-400-fl] Length = 634 Score = 37.8 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 53/151 (35%), Gaps = 24/151 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF- 226 + A ++D S SM ++ A ++NA+L++ + D Q L+ F Sbjct: 437 SKAGTLFCFLVDASGSMALH------RMRQAKGAVNALLQQAYVHRD-----QVALLAFR 485 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + V +R + L G T L AY Sbjct: 486 GERADLLLPPSQSVELAKRALDVLPT-GGGTPLAAALLAAYQI--------SEQARSRGI 536 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 ++ IV +TDG + ++ ++A++ Sbjct: 537 FRTTIVLITDGRPNVPLKADPTM---DKARR 564 >gi|307109997|gb|EFN58234.1| hypothetical protein CHLNCDRAFT_50640 [Chlorella variabilis] Length = 1183 Score = 37.8 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 24/148 (16%), Positives = 49/148 (33%), Gaps = 21/148 (14%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 + + +VLD S SM + +D A +++ + V LV F+ Sbjct: 134 AVRLYLVLDNSGSMAG------SPLDTAKRAVIRFAQAAPAAAGVMEACW--LVEFNTST 185 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH--------CNT 282 LL G + ++ + + G + A + + D+ Sbjct: 186 RAHSLLGLGGAQIEEAVGTIPAGGST-----CFHRALSTLADLAAPSAPLGATAAAAAAA 240 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLY 310 A + +VF +DG + + +L Sbjct: 241 AVAAARHFVVFFSDGCDSGYGGVEAALE 268 >gi|297558997|ref|YP_003677971.1| peptidase S1 and S6 chymotrypsin/Hap [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|296843445|gb|ADH65465.1| peptidase S1 and S6 chymotrypsin/Hap [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 851 Score = 37.8 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 27/203 (13%), Positives = 64/203 (31%), Gaps = 29/203 (14%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSI---TKIDMAIKSINAMLEE----VKL 211 S ++ ++ A +++++V+D S SM T +A ++ + E+ + Sbjct: 635 ESGLRQQERSRAPVEVLLVVDRSDSMSGPGTGGTRLATAQQLARTAVGLLGEQDTVGIWT 694 Query: 212 IP---DVNNVVQSGLVTFSNKIEEFFL--LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA 266 P N+V +G ++ + I LS +T ++ Sbjct: 695 FPEGGPGNDV--TG----QERVLPPEPGMDARREESAREAIDALSAEFPATPLADAVRDG 748 Query: 267 YNQIFDMQGMRQHCNTEDANYKKIIVFMTDG-----ENLSTKEDQQSLYYCNEAKKRGAI 321 + + + +V +TDG + D + + ++ Sbjct: 749 ------AEELESCADRPGEPGACALVVLTDGVALPEPRGGARADDVAAVLEDLDERVRVH 802 Query: 322 VYAIGIRVIRSHEFLRACASPNS 344 V ++G L A + Sbjct: 803 VVSVGDEGCGGDGLLSRLAGAGA 825 >gi|198419820|ref|XP_002120348.1| PREDICTED: similar to type A von Willebrand factor (VWFA) domain-containing protein [Ciona intestinalis] Length = 863 Score = 37.8 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 40/231 (17%), Positives = 73/231 (31%), Gaps = 41/231 (17%) Query: 174 MMIVLDVSRSM-------ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 ++ +DVS SM + +T++ ++I + ++++ + GLVTF Sbjct: 355 LIFCIDVSGSMCVSSQVKSGNGNVFVTRLQGVQQAIVDQINKIEVTRPN---TRIGLVTF 411 Query: 227 SNKI--------EEFFLLEWGVSHLQRKI----KYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 S+ + E+ LL + I K G +N+ L A N + Sbjct: 412 SSLVNGIGDGSPEQEILLSDFQLYDFDFIFSTGKTAPIPGTISNTRVQLSRALNSLNTSG 471 Query: 275 GM------RQHCNTEDANYKKIIVFMTDGENL---STKEDQQSLYYC--------NEAKK 317 G ++ TDG + E + C A Sbjct: 472 GTALGPALLYSVALASQRPGSQVIVCTDGRANMGIGSLETEDDYQICHHFYDSTTEMALS 531 Query: 318 RGAIV--YAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIV 366 G IV +I R E R + + + ++ F I + V Sbjct: 532 NGVIVSMMSIVGTDCRLVELGRVASKTAGKVSIVDLENISKTFEDILSESV 582 >gi|183235310|ref|XP_653311.2| hypothetical protein [Entamoeba histolytica HM-1:IMSS] gi|169800623|gb|EAL47926.2| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS] Length = 690 Score = 37.8 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 71/221 (32%), Gaps = 40/221 (18%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + SS +N + + ++++ + D S SM I + L ++ +N+ Sbjct: 198 VISSTFINKEKEGNINIVFICDRSGSMYGEG------IKALRNMLQLFLRQL----PLNS 247 Query: 218 VVQSGLVTFSNKIEEFFL-----LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 Q ++F + + F E + + + TN + + D Sbjct: 248 KFQI--ISFGSTYDFMFKEMTEYNEDTLKFASETVSHFEANYGGTN----MHAPLKALID 301 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIRVIR 331 + H I+ +TDG ++ Y KK +G+ Sbjct: 302 NNTEKCH-----------IILLTDG---YVDNKTSTIEYIRTLSKKNSLH--GVGLGSSC 345 Query: 332 SHEFLRACASPNSFYLV--ENPHSMYDAFSHIGKDIVTKRI 370 E +R + +NP+ + S I + I+ I Sbjct: 346 DVELIRNIGRIGNGISAISKNPNVLKKEVSKITERILIPSI 386 >gi|94989622|ref|YP_597722.1| collagen adhesion protein [Streptococcus pyogenes MGAS10270] gi|94543130|gb|ABF33178.1| Collagen adhesion protein [Streptococcus pyogenes MGAS10270] gi|198417325|gb|ACH87880.1| ancillary protein 1 [Streptococcus pyogenes] gi|198417327|gb|ACH87881.1| ancillary protein 1 [Streptococcus pyogenes] gi|198417329|gb|ACH87882.1| ancillary protein 1 [Streptococcus pyogenes] gi|198417331|gb|ACH87883.1| ancillary protein 1 [Streptococcus pyogenes] Length = 890 Score = 37.8 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 16/83 (19%), Positives = 40/83 (48%), Gaps = 5/83 (6%) Query: 149 TNSRHIVMPITSSVKVNSQT-DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 N I + + V ++ + LD++++LD S SM + + + A +++ +++ Sbjct: 198 KNKYKIELTVEGKTIVETKELNQPLDVVLLLDNSNSMNNERAHNSQRALKAGEAVEKLID 257 Query: 208 EVKLIPDVNNVVQSGLVTFSNKI 230 ++ N + LVT+++ I Sbjct: 258 KITS----NKDNRVALVTYASTI 276 >gi|10198152|gb|AAG15217.1|AF288461_3 BchD [Chloroflexus aurantiacus] Length = 634 Score = 37.8 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 53/151 (35%), Gaps = 24/151 (15%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF- 226 + A ++D S SM ++ A ++NA+L++ + D Q L+ F Sbjct: 437 SKAGTLFCFLVDASGSMALH------RMRQAKGAVNALLQQAYVHRD-----QVALLAFR 485 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + V +R + L G T L AY Sbjct: 486 GERADLLLPPSQSVELAKRALDVLPT-GGGTPLAAALLAAYQI--------SEQARSRGI 536 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 ++ IV +TDG + ++ ++A++ Sbjct: 537 FRTTIVLITDGRPNVPLKADPTM---DKARR 564 >gi|307249749|ref|ZP_07531728.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 4 str. M62] gi|306858257|gb|EFM90334.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 4 str. M62] Length = 530 Score = 37.8 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 34/269 (12%), Positives = 90/269 (33%), Gaps = 32/269 (11%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R F + G T++ + I ++ + +E + I + L ++++++ + N Sbjct: 10 RRFIQDESGVYTVMGGLLALPILALIFVSLESAGIIQDQARLSDSLEQAVLSLTAENNNG 69 Query: 69 GNGNNRK----KLKGGDILCRIKNTWNMSFRNELRDNGFVN--------DIDDIVRSTSL 116 N+ K K D R+ FV + ++ + Sbjct: 70 RKDNDYKLSGSSNKENDSFDISSEVGK---RDSQMVTTFVKAFLPQTNEEKMHLIPTCKT 126 Query: 117 DIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNS-----RHIVMPITSSVKVNSQTDAR 171 + S ++ ++ P S + + S + + Sbjct: 127 KTDTNKKGHTSSSEVTCTVSGTIEHKSWFPLKVGSVEVIPHEVNVASKSKAFKKNTFNIP 186 Query: 172 LDMMIVLDVSRSM-------ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 +D+M+V D+S SM S ++K+ + ++ + E+ L + N+ + + Sbjct: 187 IDLMVVADLSGSMNYDLSNKNEIVGSPMSKLGILQDVLSELAEKTLLSEEANHNNRIYVT 246 Query: 225 TFS-----NKIEEFFLLEWGVSHLQRKIK 248 F+ + W ++ ++++ Sbjct: 247 PFALGAEISSSNCAIPYSWDMNKNNQELE 275 >gi|303243772|ref|ZP_07330113.1| Magnesium chelatase [Methanothermococcus okinawensis IH1] gi|302486014|gb|EFL48937.1| Magnesium chelatase [Methanothermococcus okinawensis IH1] Length = 670 Score = 37.8 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 27/217 (12%), Positives = 72/217 (33%), Gaps = 23/217 (10%) Query: 88 NTWNMSFRNELRDNGFVNDIDDIVRSTSL--DIVVVPQNEGYSISAISRYKIPLKFCTFI 145 N + ++ + + I + + + + A R + Sbjct: 458 NIIKIQLKDNISRYSSGRHVKSISKKGRYIKYRIPNGKTTDIAFDATLRASALYQKDRKE 517 Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 + + + + + +++ +D S SM ++ +++ A ++ ++ Sbjct: 518 KSDNKNVALYLKKEDLREKIRERKISTNILFSVDASGSM-----GAMKRMEAAKGAVVSL 572 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGL 263 L + + ++TF + + LL S ++ K LS G T + Sbjct: 573 L-----MDAYQKRNKVSMITF--RKDSAELLLPFTSSVELAEKCLSETPTGGKTPLSKAF 625 Query: 264 KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 +Y I + ++ N I+VF++D + Sbjct: 626 LKSYEVIKN-------NMRKNPNIIPIVVFISDFKPN 655 >gi|298370192|ref|ZP_06981508.1| pilus-associated protein [Neisseria sp. oral taxon 014 str. F0314] gi|298281652|gb|EFI23141.1| pilus-associated protein [Neisseria sp. oral taxon 014 str. F0314] Length = 1092 Score = 37.8 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 14/76 (18%), Positives = 26/76 (34%), Gaps = 12/76 (15%) Query: 296 DGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV---IRSHEFLRACAS-PNSFYLVENP 351 DG+ + K + Y +G +LR AS N+++ + Sbjct: 283 DGDYSDPRG--------TNFKNQLVQTYTVGFGSGVSATGEAYLRNGASGSNNYFNARDE 334 Query: 352 HSMYDAFSHIGKDIVT 367 + ++ AF I I Sbjct: 335 NGLFAAFDAITDSIAN 350 >gi|228919479|ref|ZP_04082843.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar huazhongensis BGSC 4BD1] gi|229068309|ref|ZP_04201612.1| Von Willebrand factor type A domain protein [Bacillus cereus F65185] gi|228714770|gb|EEL66642.1| Von Willebrand factor type A domain protein [Bacillus cereus F65185] gi|228840122|gb|EEM85399.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar huazhongensis BGSC 4BD1] Length = 610 Score = 37.8 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 57/180 (31%), Gaps = 25/180 (13%) Query: 165 NSQTDARLDMMI--VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 Q LD+ ++D S SM K++ KS+ E +K + + Sbjct: 406 KGQESQELDVAFQLLVDCSGSMY-------NKMEETKKSVVLFHEALKSLKIPH------ 452 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG----MRQ 278 ++ + E + + + Y + N P + + + G + Sbjct: 453 AISGFWEDASSAKPEDKPNVIHEVVNYKNS--TLPNVGPEIMQLREEEDNRDGYIIRIVS 510 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K ++ TDGE + Q + A+K G V I I + E Sbjct: 511 EKLAKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKE 570 >gi|225621507|ref|YP_002722766.1| hypothetical protein BHWA1_02609 [Brachyspira hyodysenteriae WA1] gi|225216328|gb|ACN85062.1| putative membrane protein containing von Willebrand factor (vWA) type A domain [Brachyspira hyodysenteriae WA1] Length = 324 Score = 37.8 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 72/210 (34%), Gaps = 29/210 (13%) Query: 122 PQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVS 181 ++ Y IS +I + + ++ P ++ +TD + + I LD+S Sbjct: 20 QNDKNY--KRISNLRIFSIIFMILASASLVFALMQPKWGIIEQKIKTDNYM-VTIALDLS 76 Query: 182 RSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVS 241 RSM++ D ++++ A I +++ + LV F+ + Sbjct: 77 RSMDAD-DVWPSRLERAKLEIEKFVKKTDNLS-------VALVGFAGTSFIASPFTQDME 128 Query: 242 HLQRKIKYLSKFG---VSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE 298 + L+ T L A N DA KK I+ +TDGE Sbjct: 129 TFTYILDNLTTKSVTLQGTRIADALVTAKNTFNV-----------DAVSKKSIILITDGE 177 Query: 299 NLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + D + + VY +G+ Sbjct: 178 DHGGYFDD----ILKQLNEMNVSVYTVGVG 203 >gi|153869156|ref|ZP_01998833.1| peptidase S8 and S53, subtilisin, kexin, sedolisin [Beggiatoa sp. PS] gi|152074295|gb|EDN71164.1| peptidase S8 and S53, subtilisin, kexin, sedolisin [Beggiatoa sp. PS] Length = 2023 Score = 37.8 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 31/157 (19%), Positives = 49/157 (31%), Gaps = 33/157 (21%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D+ V+D++ SM+ + I +L+ + I N + LVTF + Sbjct: 1877 GIEVDIAFVIDITGSMQGEMNG----------VIKGLLDVIAEIGANNAPL-IALVTFRD 1925 Query: 229 KIE-EFFLLEWGVSHLQRKIKYLSKFGVST---NSTPGLKYAYNQIFDMQGMRQHCNTED 284 + E E L + L IK L G T S + H Sbjct: 1926 EDEVELQALTSDLEALVGVIKGLKASGGGTCNEASAEAISLVI----------PHVKKGG 1975 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI 321 I+F TD S D + + G Sbjct: 1976 T-----ILFATD---ASPYPDADIEALAQQLTENGIR 2004 >gi|156355315|ref|XP_001623615.1| predicted protein [Nematostella vectensis] gi|156210333|gb|EDO31515.1| predicted protein [Nematostella vectensis] Length = 1190 Score = 37.8 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 57/169 (33%), Gaps = 23/169 (13%) Query: 179 DVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW 238 D S S+ I ++I+ E K + N + + G + FS + E L Sbjct: 590 DASGSVG---------ISNWERTISFTGELGKKLLQNNPMSRVGAIDFSEQANEAIPLTS 640 Query: 239 GVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTD 296 + L+ ++ L T + GL+ AY+ ++ +TD Sbjct: 641 DQAALETGVQALKYQYQNGMTRTDKGLELAYDLFKTNVRADAGQM---------LIVVTD 691 Query: 297 GE--NLSTKEDQQSLYYCNEA-KKRGAIVYAIGIRVIRSHEFLRACASP 342 G K + + ++A + G GI + E L AS Sbjct: 692 GHTTPWDGKTGIELIKPPSDAIRSLGVHTIVAGIDERVNEEELYYMASD 740 >gi|183234776|ref|XP_001914078.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS] gi|169800919|gb|EDS89147.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS] Length = 508 Score = 37.8 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 71/221 (32%), Gaps = 40/221 (18%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + SS +N + + ++++ + D S SM I + L ++ +N+ Sbjct: 198 VISSTFINKEKEGNINIVFICDRSGSMYGEG------IKALRNMLQLFLRQL----PLNS 247 Query: 218 VVQSGLVTFSNKIEEFFL-----LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 Q ++F + + F E + + + TN + + D Sbjct: 248 KFQI--ISFGSTYDFIFKEMTEYNEDTLKFASETVSHFEANYGGTN----MHAPLKALID 301 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIRVIR 331 + H I+ +TDG ++ Y KK +G+ Sbjct: 302 NNTEKCH-----------IILLTDG---YVDNKTSTIEYIRTLSKKNSLH--GVGLGRSC 345 Query: 332 SHEFLRACASPNSFYLV--ENPHSMYDAFSHIGKDIVTKRI 370 E +R + +NP+ + S I + I+ I Sbjct: 346 DVELIRNIGRIGNGISAISKNPNVLKKEVSKITERILIPSI 386 >gi|270292390|ref|ZP_06198601.1| conserved hypothetical protein [Streptococcus sp. M143] gi|270278369|gb|EFA24215.1| conserved hypothetical protein [Streptococcus sp. M143] Length = 454 Score = 37.8 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 26/178 (14%), Positives = 62/178 (34%), Gaps = 18/178 (10%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + V + Q D++ +D S SM+S + + I+ ++ + K + Sbjct: 175 EADVIITKQAAGSADIVFTIDKSVSMDSSIQNVVQNIENFVRDLAN-----KKVD----- 224 Query: 219 VQSGLVTF---SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + GL+ + N F +++ IK L + G + + + Sbjct: 225 ARLGLIAYEAADNVQYFDFNGSKFTKNVEEYIKALRSI----RTYGGTEEPTVPLHHIAT 280 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEA-KKRGAIVYAIGIRVIRS 332 + + + +N ++ +TD ++ +A K + +G+R S Sbjct: 281 SKDYDWSTASNNRRFAFLITDESIDFRTPGIPTVEETLKALKAADISLSVVGMRYTES 338 >gi|229188830|ref|ZP_04315864.1| Von Willebrand factor type A domain protein [Bacillus cereus ATCC 10876] gi|228594643|gb|EEK52428.1| Von Willebrand factor type A domain protein [Bacillus cereus ATCC 10876] Length = 610 Score = 37.8 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 57/180 (31%), Gaps = 25/180 (13%) Query: 165 NSQTDARLDMMI--VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 Q LD+ ++D S SM K++ KS+ E +K + + Sbjct: 406 KGQESQELDVAFQLLVDCSGSMY-------NKMEETKKSVVLFHEALKSLKIPH------ 452 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG----MRQ 278 ++ + E + + + Y + N P + + + G + Sbjct: 453 AISGFWEDASSAKPEDKPNVIHEVVNYKNS--TLPNVGPEIMQLREEEDNRDGYIIRIVS 510 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K ++ TDGE + Q + A+K G V I I + E Sbjct: 511 EKLAKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKE 570 >gi|5813829|gb|AAD52031.1|AF083555_1 magnesium-chelatase subunit CHLD precursor [Arabidopsis thaliana] Length = 726 Score = 37.8 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 25/179 (13%), Positives = 59/179 (32%), Gaps = 20/179 (11%) Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 + ++ A R P + + +R + + T A ++ V+D S SM Sbjct: 476 KRLAVDATLRAAAPYQKLRREKDISGTRKVFVEKTDMRAKRMARKAGALVIFVVDASGSM 535 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEFFLLEWGVSHL 243 ++ ++ A + +L + Q ++ F + E ++ Sbjct: 536 ------ALNRMQNAKGAALKLL-----AESYTSRDQVSIIPFRGDAAEVLLPPSRSIAMA 584 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 + +++ L G S + + + + + +IV +TDG T Sbjct: 585 RNRLERLPCGGGSPLAHG--------LTTAVRVGLNAEKSGDVGRIMIVAITDGRANIT 635 >gi|228963719|ref|ZP_04124861.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar sotto str. T04001] gi|228795955|gb|EEM43421.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar sotto str. T04001] Length = 610 Score = 37.8 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 57/180 (31%), Gaps = 25/180 (13%) Query: 165 NSQTDARLDMMI--VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 Q LD+ ++D S SM K++ KS+ E +K + + Sbjct: 406 KGQESQELDVAFQLLVDCSGSMY-------NKMEETKKSVVLFHEALKSLKIPH------ 452 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG----MRQ 278 ++ + E + + + Y + N P + + + G + Sbjct: 453 AISGFWEDASSAKPEDKPNVIHEVVNYKNS--TLPNVGPEIMQLREEEDNRDGYIIRIVS 510 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K ++ TDGE + Q + A+K G V I I + E Sbjct: 511 EKLAKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKE 570 >gi|229159703|ref|ZP_04287713.1| Von Willebrand factor type A domain protein [Bacillus cereus R309803] gi|228623782|gb|EEK80598.1| Von Willebrand factor type A domain protein [Bacillus cereus R309803] Length = 627 Score = 37.8 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 57/180 (31%), Gaps = 25/180 (13%) Query: 165 NSQTDARLDMMI--VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 Q LD+ ++D S SM K++ KS+ E +K + + Sbjct: 423 KGQESQELDVAFQLLVDCSGSMY-------NKMEETKKSVVLFHEALKSLKIPH------ 469 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG----MRQ 278 ++ + E + + + Y + N P + + + G + Sbjct: 470 AISGFWEDASSAKPEDKPNVIHEVVNYKNS--TLPNVGPEIMQLREEEDNRDGYIIRIVS 527 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K ++ TDGE + Q + A+K G V I I + E Sbjct: 528 EKLAKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKE 587 >gi|206967848|ref|ZP_03228804.1| conserved hypothetical protein [Bacillus cereus AH1134] gi|218231623|ref|YP_002365420.1| hypothetical protein BCB4264_A0660 [Bacillus cereus B4264] gi|228951116|ref|ZP_04113232.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar kurstaki str. T03a001] gi|229077936|ref|ZP_04210546.1| Von Willebrand factor type A domain protein [Bacillus cereus Rock4-2] gi|229177150|ref|ZP_04304539.1| Von Willebrand factor type A domain protein [Bacillus cereus 172560W] gi|206736768|gb|EDZ53915.1| conserved hypothetical protein [Bacillus cereus AH1134] gi|218159580|gb|ACK59572.1| conserved hypothetical protein [Bacillus cereus B4264] gi|228606331|gb|EEK63763.1| Von Willebrand factor type A domain protein [Bacillus cereus 172560W] gi|228705394|gb|EEL57770.1| Von Willebrand factor type A domain protein [Bacillus cereus Rock4-2] gi|228808526|gb|EEM55029.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar kurstaki str. T03a001] Length = 627 Score = 37.8 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 57/180 (31%), Gaps = 25/180 (13%) Query: 165 NSQTDARLDMMI--VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 Q LD+ ++D S SM K++ KS+ E +K + + Sbjct: 423 KGQESQELDVAFQLLVDCSGSMY-------NKMEETKKSVVLFHEALKSLKIPH------ 469 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG----MRQ 278 ++ + E + + + Y + N P + + + G + Sbjct: 470 AISGFWEDASSAKPEDKPNVIHEVVNYKNSTLP--NVGPEIMQLREEEDNRDGYIIRIVS 527 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K ++ TDGE + Q + A+K G V I I + E Sbjct: 528 EKLAKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKE 587 >gi|91775988|ref|YP_545744.1| membrane protein-like protein [Methylobacillus flagellatus KT] gi|91709975|gb|ABE49903.1| membrane protein-like protein [Methylobacillus flagellatus KT] Length = 542 Score = 37.8 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 21/139 (15%), Positives = 49/139 (35%), Gaps = 14/139 (10%) Query: 11 FFYNYK----GGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM 66 N K G + + + L + L + + ++ ++ K L ++ D + + A Q+ Sbjct: 1 MHKNTKKKQQGAIGLFGVLTLLMAVLFVAVAVDSGRLWMEKRKLQNIADMAAIAAGGQVG 60 Query: 67 NEGNGNNRKKLKGGDILCRIKNTWNMSFR---NELRDNGFVNDIDDIVRSTSLDIVVVPQ 123 N+ + K N + + N ++ G+ D D I Sbjct: 61 GCAQNNSSEAYKAAAQAAAAANGYQGNLLAAPNAVQLGGYHTDSDGIR-------TFAAN 113 Query: 124 NEGYSISAISRYKIPLKFC 142 NE ++ ++ ++P Sbjct: 114 NERSAVRVLATQEVPSSLF 132 >gi|268532310|ref|XP_002631283.1| C. briggsae CBR-CLEC-60 protein [Caenorhabditis briggsae] gi|187036877|emb|CAP23542.1| CBR-CLEC-60 protein [Caenorhabditis briggsae AF16] Length = 408 Score = 37.8 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 28/181 (15%), Positives = 57/181 (31%), Gaps = 17/181 (9%) Query: 189 DSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW--GVSHLQRK 246 + + I +I S+ + + P + GLVT++ L+ + + Sbjct: 56 NGGLINIAASIASVFSNGTRIGTDPYEPRTTRLGLVTYNAVANTVANLDTYQSLDDVYDG 115 Query: 247 I-KYLSKFGVSTNSTP--GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK 303 I LS+ S S GL A + + D + N +Y+++++ Sbjct: 116 IFTALSQVSSSDESYIVHGLAQAEDILEDGK-----SNKNRTHYQRVVIVY--ASTYKGT 168 Query: 304 EDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF---LRACASPNSFYLVENPHSMYDAFSH 360 + + K G + + L+ A+P Y N + Sbjct: 169 GPLDPIPVADRLKTAGVTIVTVAYDQDGDGALLADLQLIATPP--YNFSNTDQAGNEIGE 226 Query: 361 I 361 I Sbjct: 227 I 227 >gi|317123952|ref|YP_004098064.1| protoporphyrin IX magnesium-chelatase [Intrasporangium calvum DSM 43043] gi|315588040|gb|ADU47337.1| protoporphyrin IX magnesium-chelatase [Intrasporangium calvum DSM 43043] Length = 723 Score = 37.8 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 30/252 (11%), Positives = 65/252 (25%), Gaps = 17/252 (6%) Query: 53 MIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVR 112 D + G+ + I Sbjct: 407 ATDSRDLPEEGAATGAPPGSAGRPTTAAAGSPYRARVLTAKGVGHGESGRRSRAITSTGH 466 Query: 113 STSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARL 172 + + I +V+ ++ + Q Sbjct: 467 AIGATADRGSADRSTGRGLHLPATIRAAARHQQRRGRTDGPLVLDVSDLRRPVRQGRESN 526 Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 +++ +D S SM ++ +++ +I ++L + + GLVTF E Sbjct: 527 LVLLCVDASGSM-----AARRRMEAVKAAILSLL-----LDAYQRRDKVGLVTFRGAGAE 576 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L + + G T GL AY+ + + D + ++V Sbjct: 577 LVLPPTSSVDIAARRLADLAHGGRTPLAEGLLTAYSALRTER-------IRDPRRRPLLV 629 Query: 293 FMTDGENLSTKE 304 +TDG + + Sbjct: 630 VVTDGRATAGPD 641 >gi|226308012|ref|YP_002767972.1| hypothetical protein RER_45250 [Rhodococcus erythropolis PR4] gi|226187129|dbj|BAH35233.1| hypothetical protein RER_45250 [Rhodococcus erythropolis PR4] Length = 551 Score = 37.8 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 65/204 (31%), Gaps = 38/204 (18%) Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS--------- 227 V+DVS SM+ + +T++ ++ + + N Q GL FS Sbjct: 355 VVDVSGSMD-YMQDGVTRMAATAQAGDIAIRMF----PAN--AQLGLWAFSVDLGDGTDY 407 Query: 228 ------NKIEEFFLLEWGVSHLQRKIKYLSKF-GVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +++ S L +I LS G T + AY + +Q Sbjct: 408 RELEPVARMDATEGGTDHRSKLLSRIDNLSSIVGGGTGLYDSVLAAYRSM------QQSY 461 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG----AIVYAIGIRVIRSHEFL 336 + N ++ +TDG N +++ + IG+ + L Sbjct: 462 DPASINS---VILLTDGANDDPSGIALQELLDTLTREQDPMRPVPIITIGVTDDADTDVL 518 Query: 337 RACA--SPNSFYLVENPHSMYDAF 358 + + + + P + F Sbjct: 519 EQISTLTGGNSHFAPTPADIPKVF 542 >gi|67924730|ref|ZP_00518134.1| Magnesium chelatase, ChlI subunit [Crocosphaera watsonii WH 8501] gi|67853417|gb|EAM48772.1| Magnesium chelatase, ChlI subunit [Crocosphaera watsonii WH 8501] Length = 682 Score = 37.8 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 32/215 (14%), Positives = 70/215 (32%), Gaps = 35/215 (16%) Query: 102 GFVNDIDDIVRSTSLDIVVVPQNEGY-------------SISAISRYKIPLKFCTFIPWY 148 F + +S S +++ Y ++ A R P + + + Sbjct: 405 YFAQMAQNQGKSGSRNLIFSDDRGRYVKPMLPKGKVRRIAVDATLRAAAPYQKYRRL-RH 463 Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 IV K ++ L ++ V+D S SM ++ ++ A ++ +L E Sbjct: 464 PGRSVIVEQGDIRSKRLARKAGAL-IVFVVDASGSM------ALNRMQSAKGAVMRLLTE 516 Query: 209 VKLIPDVNNVVQSGLVTFSNKI-EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 N Q L+ F + + +S +++++ L G S + + Sbjct: 517 A-----YENRDQVSLIPFRGEQADVLLPPTRSISLARKRLETLPCGGGSPLAHGLTQA-- 569 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 + + + +IV +TDG Sbjct: 570 --VHVGMNAKMSGDIGQV----VIVAITDGRGNIP 598 >gi|167842450|ref|ZP_02469134.1| hypothetical protein Bpse38_37650 [Burkholderia thailandensis MSMB43] Length = 418 Score = 37.8 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 10/54 (18%), Positives = 27/54 (50%) Query: 2 FSLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMID 55 FS + +G ++IL A+ L ++ +G+ +++ ++ ++ L + D Sbjct: 6 FSSGAALHGGRRRQRGVVSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSAD 59 >gi|226307664|ref|YP_002767624.1| hypothetical protein RER_41770 [Rhodococcus erythropolis PR4] gi|226186781|dbj|BAH34885.1| conserved hypothetical protein [Rhodococcus erythropolis PR4] Length = 569 Score = 37.8 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 72/203 (35%), Gaps = 19/203 (9%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + ++ +DVS SM+ T++ + + L ++ PD V +GL FS Sbjct: 364 AKPIRTLVAIDVSASMDYPAAGGKTRMQLTTAAA---LAGNQVFPDS---VAAGLWAFSQ 417 Query: 229 KIE--EFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + + F + + +++ N LK ++D E Sbjct: 418 GMNGTQDFQELVPIRRYDTIVDGVTQRKLMADQANRLTELKRGATGLYDTTLAAFRKVQE 477 Query: 284 DANYKKI--IVFMTDGENLSTKEDQQSLYYCNEAKKRG----AIVYAIGIRVIRSHEFLR 337 + + + ++ +TDG N + A+++ I+ IGI L Sbjct: 478 SYDPRAVNSVIILTDGANEDPDSISREQLLDVLAREQDPARPVIIVTIGITDDADAAALA 537 Query: 338 ACA--SPNSFYLVENPHSMYDAF 358 + + S Y+ +P + + F Sbjct: 538 DISRVTGGSTYIARDPSEISEVF 560 >gi|73960097|ref|XP_855330.1| PREDICTED: similar to calcium activated chloride channel 2 [Canis familiaris] Length = 943 Score = 37.4 bits (85), Expect = 3.2, Method: Composition-based stats. Identities = 38/212 (17%), Positives = 76/212 (35%), Gaps = 37/212 (17%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +VLDVS M + ++ +++ L ++ I G+V+F++K E Sbjct: 312 VCLVLDVSSKM-----AEANRLLRLQQAVEFYLMQIVEIHTF-----VGIVSFNSKGEIR 361 Query: 234 FLL-----EWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 L + L + T+ GLK + + + G Sbjct: 362 AQLHQINSDDDRKLLVSHLPMTVSAEAETSVCSGLKKGFEVVEKLNGKAYGS-------- 413 Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFY 346 +++ +T G++ + A G+ +++I + L F+ Sbjct: 414 -VMILVTSGDDEHIDNCLLT------ALSSGSTIHSIAMGSSVVENLEELSRRTGGLKFF 466 Query: 347 LVE--NPHSMYDAFSHIGK---DIVTKRIWYD 373 + + N +SM DAFS I DI + I + Sbjct: 467 VPDESNSNSMIDAFSRISSGTGDIFQQCIQLE 498 >gi|71493349|gb|AAZ32779.1| chloroplast Mg-chelatase subunit XANTHA-G precursor [Hordeum vulgare] gi|326503464|dbj|BAJ86238.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326508943|dbj|BAJ86864.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 761 Score = 37.4 bits (85), Expect = 3.2, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 59/174 (33%), Gaps = 20/174 (11%) Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 ++ A R P + +R + + T A ++ V+D S SM Sbjct: 514 AVDATLRAAAPYQKLRREKSLDKTRKVFVEKTDMRAKRMARKAGALVIFVVDASGSM--- 570 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEFFLLEWGVSHLQRK 246 ++ ++ A + +L + Q ++ F + E ++ +++ Sbjct: 571 ---ALNRMQNAKGAALKLL-----AESYTSRDQVAIIPFRGDYAEVLLPPSRSIAMARKR 622 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 ++ L G S + GL A + + + +IV +TDG Sbjct: 623 LEKLPCGGGSPLAH-GLSTAVRVGLNAE-------KSGDVGRIMIVAITDGRAN 668 >gi|73539238|ref|YP_299605.1| transmembrane protein [Ralstonia eutropha JMP134] gi|72122575|gb|AAZ64761.1| probable transmembrane protein [Ralstonia eutropha JMP134] Length = 563 Score = 37.4 bits (85), Expect = 3.2, Method: Composition-based stats. Identities = 15/45 (33%), Positives = 23/45 (51%), Gaps = 4/45 (8%) Query: 21 ILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 ILTA + L I+V H+F + L +M+D S + A Q+ Sbjct: 22 ILTATVAFVALL----SIDVGHVFMRQRQLQNMVDLSALSGAQQL 62 >gi|321464007|gb|EFX75018.1| hypothetical protein DAPPUDRAFT_323732 [Daphnia pulex] Length = 923 Score = 37.4 bits (85), Expect = 3.2, Method: Composition-based stats. Identities = 33/193 (17%), Positives = 62/193 (32%), Gaps = 37/193 (19%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK--LIPDVNNVVQSGLVTFSNKIEE 232 ++V DVS SM + I + + + + D+ G+V F + Sbjct: 304 VLVTDVSGSMVDY------------NRIGRLYDSAQRWIQYDIPEGTSLGIVKFHSYATV 351 Query: 233 FFLLEWGVSHLQRKI--KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + ++++ K +K ST L+ A + + + Sbjct: 352 LSNMTVVNDETRQQLMDKVPNKTEDSTCIGCVLQMAIDILHPGGNGG------------V 399 Query: 291 IVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFY- 346 IV +TDG EN + + V +I +E L + S++ Sbjct: 400 IVLVTDGLENEYPFIRDVTPELID----AKIQVVSIAFGRDAENEIENLATKTNGKSYFI 455 Query: 347 -LVENPHSMYDAF 358 N + DAF Sbjct: 456 NDNGNNDELNDAF 468 >gi|298491748|ref|YP_003721925.1| von Willebrand factor type A ['Nostoc azollae' 0708] gi|298233666|gb|ADI64802.1| von Willebrand factor type A ['Nostoc azollae' 0708] Length = 201 Score = 37.4 bits (85), Expect = 3.2, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 62/170 (36%), Gaps = 16/170 (9%) Query: 173 DMMIVLDVSRSME-SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D +++D S SM ++ D+A +S A+ + + V FS + + Sbjct: 6 DYTLIIDKSGSMSTPDQAGGRSRWDIAQESTLALARKAEQFDPDGITVYV----FSGRFK 61 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + + + + + + G +TN L A + F + Q + I Sbjct: 62 RYENV--NSAKVAQIFQENDPAG-TTNLAGVLNNALDNYFQRKASGQTKPNGET-----I 113 Query: 292 VFMTDGENLSTKEDQQSLYYCNEA--KKRGAIVYAIGIRV-IRSHEFLRA 338 + +TDGE K + + + + I + ++ +FL+A Sbjct: 114 LVVTDGEPDDRKAVFEVIIQATRRMDRDEELGISMIQVGADPQATKFLKA 163 >gi|222478564|ref|YP_002564801.1| von Willebrand factor type A [Halorubrum lacusprofundi ATCC 49239] gi|222451466|gb|ACM55731.1| von Willebrand factor type A [Halorubrum lacusprofundi ATCC 49239] Length = 571 Score = 37.4 bits (85), Expect = 3.2, Method: Composition-based stats. Identities = 41/274 (14%), Positives = 83/274 (30%), Gaps = 82/274 (29%) Query: 168 TDARLDMMIVLDVSRSM----------------ESFFDSSITKIDMAIKSINAMLEEVKL 211 +D+M+ LD S SM + TKID+ ++ ++ Sbjct: 220 DAGPIDIMLTLDFSGSMLYDQYGGVVSTDPIQVDGQSYGETTKIDLVELGTRQFIDYLQA 279 Query: 212 IPDVNNVVQSGLVTF--------SNKIEEFFLLEWGVSHLQRKIKYLSK----------- 252 N VQ G+V F + + L +S + + L + Sbjct: 280 Q---NADVQVGVVYFDGEGSGENTPRTGILEPLTTNLSAVDTALSNLRQKLANVVSDAAP 336 Query: 253 ------------------FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 T + GL A ++ A+ +K + + Sbjct: 337 STPFDNDGNPDPYSNADGIATGTYISEGLDDAQTELASN---------GRASAEKRNIVL 387 Query: 295 TDGENLSTKEDQQ-------SLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASP----- 342 +DGE+ + + + + VY I V S L+A A P Sbjct: 388 SDGESFNGDGNTNYAPPASAAANARAASPAPATDVYTI--NVNGSASTLQAMAGPAGGSG 445 Query: 343 -NSFYL--VENPHSMYDAFSHIGKDIVTKRIWYD 373 + + + +P ++ F ++ V +++ + Sbjct: 446 GDPVFFNDINDPLNIPTVFGNLAAQTVAEKVIME 479 >gi|30018809|ref|NP_830440.1| von Willebrand factor type A domain-containing protein [Bacillus cereus ATCC 14579] gi|229126055|ref|ZP_04255077.1| Von Willebrand factor type A domain protein [Bacillus cereus BDRD-Cer4] gi|229143348|ref|ZP_04271779.1| Von Willebrand factor type A domain protein [Bacillus cereus BDRD-ST24] gi|296501384|ref|YP_003663084.1| von Willebrand factor type A domain-containing protein [Bacillus thuringiensis BMB171] gi|29894351|gb|AAP07641.1| von Willebrand factor type A domain protein [Bacillus cereus ATCC 14579] gi|228640155|gb|EEK96554.1| Von Willebrand factor type A domain protein [Bacillus cereus BDRD-ST24] gi|228657377|gb|EEL13193.1| Von Willebrand factor type A domain protein [Bacillus cereus BDRD-Cer4] gi|296322436|gb|ADH05364.1| von Willebrand factor type A domain-containing protein [Bacillus thuringiensis BMB171] Length = 627 Score = 37.4 bits (85), Expect = 3.2, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 57/180 (31%), Gaps = 25/180 (13%) Query: 165 NSQTDARLDMMI--VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 Q LD+ ++D S SM K++ KS+ E +K + + Sbjct: 423 KGQESQELDVAFQLLVDCSGSMY-------NKMEETKKSVVLFHEALKSLKIPH------ 469 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG----MRQ 278 ++ + E + + + Y + N P + + + G + Sbjct: 470 AISGFWEDASSAKPEDKPNVIHEVVNYKNSTLP--NVGPEIMQLREEEDNRDGYIIRIVS 527 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K ++ TDGE + Q + A+K G V I I + E Sbjct: 528 EKLAKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKE 587 >gi|228473096|ref|ZP_04057853.1| conserved hypothetical protein [Capnocytophaga gingivalis ATCC 33624] gi|228275678|gb|EEK14455.1| conserved hypothetical protein [Capnocytophaga gingivalis ATCC 33624] Length = 288 Score = 37.4 bits (85), Expect = 3.2, Method: Composition-based stats. Identities = 23/111 (20%), Positives = 47/111 (42%), Gaps = 7/111 (6%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S + I + + N ++GL+ F Sbjct: 72 EEERELTLMLLVDVSGSE------LFGSLQQFKNEILTEIAATLAFAALQNNDKTGLILF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 S++IE + + G SH+ R I+ L +F T + A+ + + + Sbjct: 126 SDQIELYIPPKKGKSHILRIIRELIEFQPKNKQTD-IAKAFEFLNKITKKK 175 >gi|307153653|ref|YP_003889037.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306983881|gb|ADN15762.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 201 Score = 37.4 bits (85), Expect = 3.2, Method: Composition-based stats. Identities = 24/138 (17%), Positives = 53/138 (38%), Gaps = 13/138 (9%) Query: 173 DMMIVLDVSRSME-SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D +++D S SM + ++ + +S A+ + + + V FS K + Sbjct: 6 DYTLIIDKSGSMSITDQPGGKSRWAVMQESTLALASKCQELDPDGITVYV----FSGKFK 61 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + + + ++I + T+ LK A N F+ + + + I Sbjct: 62 RYNNI---TGNKVKQIFQENDPNGRTDLASVLKDALNYYFERKE--EGKTKPNGE---TI 113 Query: 292 VFMTDGENLSTKEDQQSL 309 + +TDGE K +++ Sbjct: 114 LVVTDGEPDDRKAVMKTI 131 >gi|197117534|ref|YP_002137961.1| hypothetical protein Gbem_1146 [Geobacter bemidjiensis Bem] gi|197086894|gb|ACH38165.1| conserved hypothetical protein [Geobacter bemidjiensis Bem] Length = 383 Score = 37.4 bits (85), Expect = 3.2, Method: Composition-based stats. Identities = 6/41 (14%), Positives = 19/41 (46%) Query: 25 IFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 I L + ++ G+ I++ +++ L + + + A + Sbjct: 8 IMLVVFLVLTGLAIDIGYMYVSDEDLQHSAEMAALTGAESL 48 >gi|325679996|ref|ZP_08159565.1| efflux ABC transporter, permease protein [Ruminococcus albus 8] gi|324108434|gb|EGC02681.1| efflux ABC transporter, permease protein [Ruminococcus albus 8] Length = 750 Score = 37.4 bits (85), Expect = 3.3, Method: Composition-based stats. Identities = 23/166 (13%), Positives = 50/166 (30%), Gaps = 14/166 (8%) Query: 6 LNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIE-VSHIFFMKTVLHSMIDR-------- 56 +R FF N G + I+ I L L G++ E + F +TV + + D Sbjct: 411 FRLRIFFQNISGYIVIIFGIILGYSILFFGLMFEPMLDNFKEETVNNMIADSIYVLRMPA 470 Query: 57 ---SLVHAATQIMNEGNGNNRKKLKGGDILCRIKNT--WNMSFRNELRDNGFVNDIDDIV 111 + + + + K + + +N+ + + + + + Sbjct: 471 ETDAASAEKAAMTSLETTFEKYKAEDVSVYGISENSRYVKIDHNGGVTISTAFMNKYGLK 530 Query: 112 RSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMP 157 + S+ + + Y Y P F+ M Sbjct: 531 KGDSITLTEKYGEKDYEFEITGFYDYPSTIAVFMDIDDFCETFDMD 576 >gi|221120223|ref|XP_002154140.1| PREDICTED: similar to calcium channel, voltage-dependent, alpha2/delta subunit 1 [Hydra magnipapillata] Length = 1450 Score = 37.4 bits (85), Expect = 3.3, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 61/180 (33%), Gaps = 12/180 (6%) Query: 116 LDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMP----ITSSVKVNSQTDAR 171 +D + Y+ L F P N + P + V + + Sbjct: 30 VDNSTMQTTSYYTGIEPIVKSYNLSFLRMFPGNLNGYNKTGPDLYDCRNRVWYQMSSASP 89 Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D++IV+D S SM T I +A + A+++ + N + T+ Sbjct: 90 KDVVIVIDTSGSMAG------TNIIIAGITAKALIDTLTENDYFNVLTVGKSTTYVTPCI 143 Query: 232 EF--FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 +F ++ ++ I +++ N + G+ A+ + Q + + K+ Sbjct: 144 KFLIQATKFNKERMKYNINKIAEPTGQLNLSDGISSAFKTLNSAQFYGMGKSADRQIMKR 203 >gi|319952923|ref|YP_004164190.1| von willebrand factor type a [Cellulophaga algicola DSM 14237] gi|319421583|gb|ADV48692.1| von Willebrand factor type A [Cellulophaga algicola DSM 14237] Length = 350 Score = 37.4 bits (85), Expect = 3.3, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 57/167 (34%), Gaps = 16/167 (9%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + ++DVS SM I I +++E+K P V ++ F+ + Sbjct: 3 RLPIYFLIDVSESM------VGDPISQVQDGIATIIQELKTDPQALETVWISIIVFAGEA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + L+ + G T+ + GL + + D Sbjct: 57 KTLVPLQ----DIISFYPPKFPIGSGTSLSKGLGHLMYDLRKNLVKTTEYKKGDWKP--- 109 Query: 291 IVFM-TDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 IVF+ TDG + ++ N + + A + AI + L Sbjct: 110 IVFLFTDGVPTD--DATSAINEWNNSWSKTANLIAISLGDGADLSLL 154 >gi|289642041|ref|ZP_06474194.1| Magnesium chelatase [Frankia symbiont of Datisca glomerata] gi|289508114|gb|EFD29060.1| Magnesium chelatase [Frankia symbiont of Datisca glomerata] Length = 744 Score = 37.4 bits (85), Expect = 3.3, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 61/181 (33%), Gaps = 21/181 (11%) Query: 118 IVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 + P+ G + A R P + ++ P V + L ++ V Sbjct: 481 VTDSPEAAGLHLPATLRAAAPWQ---RARGRRGPGLLLRPSDRRGAVREGREGNL-VLFV 536 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 +D S SM ++ ++ ++ ++L + + G++TF E L Sbjct: 537 VDASGSM-----AARARMRTVTTAVLSLL-----LDAYQRRDRIGMITFRGASAEVVLPP 586 Query: 238 WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 + G T GL A Q+ + R D + + ++V +TDG Sbjct: 587 TSSVEVGAARLRSLPTGGRTPLAAGLTRA-GQVLAAERRR------DPDRRALLVVVTDG 639 Query: 298 E 298 Sbjct: 640 R 640 >gi|228957042|ref|ZP_04118817.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar pakistani str. T13001] gi|228802673|gb|EEM49515.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar pakistani str. T13001] Length = 627 Score = 37.4 bits (85), Expect = 3.3, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 57/180 (31%), Gaps = 25/180 (13%) Query: 165 NSQTDARLDMMI--VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 Q LD+ ++D S SM K++ KS+ E +K + + Sbjct: 423 KGQESQELDVAFQLLVDCSGSMY-------NKMEETKKSVVLFHEALKSLKIPH------ 469 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG----MRQ 278 ++ + E + + + Y + N P + + + G + Sbjct: 470 AISGFWEDASSAKPEDKPNVIHEVVNYKNSTLP--NVGPEIMQLREEEDNRDGYIIRIVS 527 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K ++ TDGE + Q + A+K G V I I + E Sbjct: 528 EKLAKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKE 587 >gi|228937862|ref|ZP_04100490.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar berliner ATCC 10792] gi|228970743|ref|ZP_04131383.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar thuringiensis str. T01001] gi|228977318|ref|ZP_04137713.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis Bt407] gi|229148962|ref|ZP_04277207.1| Von Willebrand factor type A domain protein [Bacillus cereus m1550] gi|228634502|gb|EEK91086.1| Von Willebrand factor type A domain protein [Bacillus cereus m1550] gi|228782295|gb|EEM30478.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis Bt407] gi|228788868|gb|EEM36807.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar thuringiensis str. T01001] gi|228821768|gb|EEM67768.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis serovar berliner ATCC 10792] gi|326938340|gb|AEA14236.1| von Willebrand factor type A [Bacillus thuringiensis serovar chinensis CT-43] Length = 627 Score = 37.4 bits (85), Expect = 3.3, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 57/180 (31%), Gaps = 25/180 (13%) Query: 165 NSQTDARLDMMI--VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 Q LD+ ++D S SM K++ KS+ E +K + + Sbjct: 423 KGQESQELDVAFQLLVDCSGSMY-------NKMEETKKSVVLFHEALKSLKIPH------ 469 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG----MRQ 278 ++ + E + + + Y + N P + + + G + Sbjct: 470 AISGFWEDASSAKPEDKPNVIHEVVNYKNSTLP--NVGPEIMQLREEEDNRDGYIIRIVS 527 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K ++ TDGE + Q + A+K G V I I + E Sbjct: 528 EKLAKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKE 587 >gi|221091098|ref|XP_002165909.1| PREDICTED: similar to Pfemp3-like protein, putative, partial [Hydra magnipapillata] Length = 427 Score = 37.4 bits (85), Expect = 3.3, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 61/180 (33%), Gaps = 19/180 (10%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + + + +DM+ +D+S S E K+I ++++ Sbjct: 44 DTANELKGFVQGVHDGAEGRAFNVDMVFAMDLSSSSEEILQKQ-------KKAIRSLIDY 96 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI-KYLSKFGVSTNSTPGLKYAY 267 +P +N + GL+T+S+ L S +I + G N + A Sbjct: 97 --HLPSKSN--ELGLITYSDIANVNSEL---TSKFNSEILDKIVNNGRRQNVASAITVAS 149 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 IF ++ + + ++++V G+ S Y ++ IG+ Sbjct: 150 ENIFKIRNNDEKLKKK----QRVLVLFVVGKPSSNYPPVVPERYGKLLEENNVKTIIIGL 205 Score = 37.4 bits (85), Expect = 3.3, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 61/180 (33%), Gaps = 19/180 (10%) Query: 149 TNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEE 208 + + + +DM+ +D+S S E K+I ++++ Sbjct: 228 DTANELKGFVQGVHDGAEGRAFNVDMVFAMDLSSSSEEILQKQ-------KKAIRSLIDY 280 Query: 209 VKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI-KYLSKFGVSTNSTPGLKYAY 267 +P +N + GL+T+S+ L S +I + G N + A Sbjct: 281 --HLPSKSN--ELGLITYSDIANVNSEL---TSKFNSEILDKIVNNGRRQNVASAITVAS 333 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 IF ++ + + ++++V G+ S Y ++ IG+ Sbjct: 334 ENIFKIRNNDEKLKKK----QRVLVLFVVGKPSSNYPPVVPERYGKLLEENNVKTIIIGL 389 >gi|256823631|ref|YP_003147594.1| Vault protein inter-alpha-trypsin domain-containing protein [Kangiella koreensis DSM 16069] gi|256797170|gb|ACV27826.1| Vault protein inter-alpha-trypsin domain protein [Kangiella koreensis DSM 16069] Length = 689 Score = 37.4 bits (85), Expect = 3.3, Method: Composition-based stats. Identities = 37/221 (16%), Positives = 74/221 (33%), Gaps = 42/221 (19%) Query: 151 SRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITK---IDMAIKSINAMLE 207 + H++M + + + Q +M+ V+D S SM + ++ SIN Sbjct: 308 NYHVLMMLPPTHDLVQQKTQPREMIFVIDSSGSMSGESMQQAKQGLYYALSQLSINDTFN 367 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 + D N + +E + + + T + A Sbjct: 368 IIDFDNDANKLF-----------DEAVPATLSNLEMAKYFVATLEADGGTEIAKAINLA- 415 Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 + D++ + +VF+TDG E Q N+ ++ IGI Sbjct: 416 ------------LDKPDSSLLRQVVFLTDGS--IGNERQIFQMIENQLGNN--RLFTIGI 459 Query: 328 RVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 A+PNS+++ + + F++IGK + Sbjct: 460 G-----------AAPNSYFMSKAANYGRGTFTYIGKASEVQ 489 >gi|213966336|ref|ZP_03394518.1| secreted Mg-chelatase subunit [Corynebacterium amycolatum SK46] gi|213951042|gb|EEB62442.1| secreted Mg-chelatase subunit [Corynebacterium amycolatum SK46] Length = 543 Score = 37.4 bits (85), Expect = 3.3, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 53/195 (27%), Gaps = 25/195 (12%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEF 233 + VLD S SM I + ++ I D + ++G V + +I Sbjct: 362 VFVLDTSGSM-------------FGDRIADLQATMRAIVDGSARTETGSVGLRNREIATI 408 Query: 234 FLLEWGVSH-LQRKIKYLSKFGVSTNSTPGLKYA-----YNQIFDMQGMRQHCNTEDANY 287 V I T + GL Y+ + + + Sbjct: 409 LPFSTSVGEPTTTTIDGPESRAQLTAAVDGLYAEGETALYDALIQAFDLLGSSDKNSIPS 468 Query: 288 KKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-SPNSFY 346 IV +TDG+ S K + + V+ I E + + Sbjct: 469 ---IVVLTDGQVTSGKTFSEFRDFYQSKGGNLPPVFVIRYGEADPGEMQELANLTGGKVF 525 Query: 347 LVENPHSMYDAFSHI 361 + D F I Sbjct: 526 ESRET-ELADVFKEI 539 >gi|154280917|ref|XP_001541271.1| predicted protein [Ajellomyces capsulatus NAm1] gi|150411450|gb|EDN06838.1| predicted protein [Ajellomyces capsulatus NAm1] Length = 759 Score = 37.4 bits (85), Expect = 3.3, Method: Composition-based stats. Identities = 35/205 (17%), Positives = 68/205 (33%), Gaps = 23/205 (11%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL------IPDVNNVVQSGLV 224 D+++ +DVS SM+S T + + ++ I +N + G+V Sbjct: 73 PCDIVLCIDVSYSMQSSAPLPTTDESGEREETGLSVLDLTKHAARTIIETLNENDRLGIV 132 Query: 225 TFSNKIEEFFLLE-WGVSHLQRKIKYLSKFG--VSTNSTPGLKYAYNQIFDMQGMRQHCN 281 FS + E + + S + +K + STN GLK ++ + Sbjct: 133 AFSTEAEVVYEISKMNESSKKAALKAVEALKPLSSTNLWHGLK------LGLKAFENERH 186 Query: 282 TEDANYKKIIVFMTDGENLS----TKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 T + + +TDG + R +++ G L+ Sbjct: 187 TPQSVQALYV--LTDGMPNHMCPKQGYVTKLRPILQLLGHRMPMIHTFGFGYNIRSGLLQ 244 Query: 338 ACAS--PNSFYLVENPHSMYDAFSH 360 A A +F + + + F H Sbjct: 245 AIAEVGGGTFAFIPDAGMIGTVFVH 269 >gi|218895679|ref|YP_002444090.1| hypothetical protein BCG9842_B4677 [Bacillus cereus G9842] gi|218545918|gb|ACK98312.1| conserved hypothetical protein [Bacillus cereus G9842] Length = 627 Score = 37.4 bits (85), Expect = 3.4, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 57/180 (31%), Gaps = 25/180 (13%) Query: 165 NSQTDARLDMMI--VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 Q LD+ ++D S SM K++ KS+ E +K + + Sbjct: 423 KGQESQELDVAFQLLVDCSGSMY-------NKMEETKKSVVLFHEALKSLKIPH------ 469 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG----MRQ 278 ++ + E + + + Y + N P + + + G + Sbjct: 470 AISGFWEDASSAKPEDKPNVIHEVVNYKNSTLP--NVGPEIMQLREEEDNRDGYIIRIVS 527 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K ++ TDGE + Q + A+K G V I I + E Sbjct: 528 EKLAKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKE 587 >gi|153876590|ref|ZP_02003832.1| von Willebrand factor, type A [Beggiatoa sp. PS] gi|152066946|gb|EDN66168.1| von Willebrand factor, type A [Beggiatoa sp. PS] Length = 305 Score = 37.4 bits (85), Expect = 3.4, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 72/209 (34%), Gaps = 30/209 (14%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 + +K+ Q ++++VLD S M KI A +LE VK Sbjct: 114 EAIIKIWHQYKKPANIVLVLDTSGGMRG------EKILHARTMALQLLEIVKEAD----- 162 Query: 219 VQSGLVTFSNKIE---EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 L++F++ + + ++ L+R+ Y G T + AY + Sbjct: 163 -YFSLLSFNHSLNWIAKNIQVKSQQKWLKRQFNYQFP-GGGTALYDAIFNAYTFLQKNSF 220 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL--YYCNEAKKRGAIVYAIGIRVIRSH 333 + +++ ++DG + ++ + + L + ++A+G I Sbjct: 221 PDKIA---------VMIVLSDGGDSHSELNFKDLLSKIPFNSDTSPIRIFAVGYGSITDK 271 Query: 334 EFLRACA--SPNSFYLVENPHSMYDAFSH 360 + L A + FY + F Sbjct: 272 KRLNEIAKMTQGKFYD-GAMVDVDKIFKK 299 >gi|75759395|ref|ZP_00739490.1| von Willebrand factor type A domain protein [Bacillus thuringiensis serovar israelensis ATCC 35646] gi|228899309|ref|ZP_04063572.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis IBL 4222] gi|74493107|gb|EAO56228.1| von Willebrand factor type A domain protein [Bacillus thuringiensis serovar israelensis ATCC 35646] gi|228860340|gb|EEN04737.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis IBL 4222] Length = 627 Score = 37.4 bits (85), Expect = 3.4, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 57/180 (31%), Gaps = 25/180 (13%) Query: 165 NSQTDARLDMMI--VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 Q LD+ ++D S SM K++ KS+ E +K + + Sbjct: 423 KGQESQELDVAFQLLVDCSGSMY-------NKMEETKKSVVLFHEALKSLKIPH------ 469 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG----MRQ 278 ++ + E + + + Y + N P + + + G + Sbjct: 470 AISGFWEDASSAKPEDKPNVIHEVVNYKNSTLP--NVGPEIMQLREEEDNRDGYIIRIVS 527 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K ++ TDGE + Q + A+K G V I I + E Sbjct: 528 EKLAKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKE 587 >gi|320105608|ref|YP_004181198.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319924129|gb|ADV81204.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 373 Score = 37.4 bits (85), Expect = 3.4, Method: Composition-based stats. Identities = 29/182 (15%), Positives = 67/182 (36%), Gaps = 31/182 (17%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +L + +++DVS S D ++ ++ L+ + + L+ F + + Sbjct: 85 QLTLGLLVDVSGSQRQVLDEE-------REASSSFLDNMLVADRDKAF----LIQFGHTV 133 Query: 231 EEFFLLEWGVSHLQRKIKYL------SKFGVSTNSTP------------GLKYAYNQIFD 272 E + + LQ +K + +F + + G A ++D Sbjct: 134 ELLTDVTGSIPKLQSGLKQVDTQAARPQFSNNQDPNDNSGGSSQRGRRGGGGGAGTALYD 193 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + ++K ++ +TDGE+ +KE S A++ VY+I + Sbjct: 194 AIFLASDEVIHKQPFRKALILLTDGEDNGSKESLSSAI--EAAQRADTAVYSIYFKGEEH 251 Query: 333 HE 334 ++ Sbjct: 252 ND 253 >gi|307330249|ref|ZP_07609396.1| Magnesium chelatase [Streptomyces violaceusniger Tu 4113] gi|306884046|gb|EFN15085.1| Magnesium chelatase [Streptomyces violaceusniger Tu 4113] Length = 259 Score = 37.4 bits (85), Expect = 3.4, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 50/131 (38%), Gaps = 19/131 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEE 232 ++ V+D S SM ++ T++ ++ ++L + + GL+TF E Sbjct: 68 VLFVVDASGSM-----AARTRMSAIKGAVLSLL-----LDAYQRRDKVGLITFRGADAEL 117 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V +++ L G + + LK ++ ++ +R ++V Sbjct: 118 ALPPTSSVDAAAVRLERLPTGGRTPLAAGLLKA--REVLRVERLRDPSRRP------LLV 169 Query: 293 FMTDGENLSTK 303 +TDG K Sbjct: 170 VVTDGRATEAK 180 >gi|66801705|ref|XP_629777.1| hypothetical protein DDB_G0292028 [Dictyostelium discoideum AX4] gi|74851198|sp|Q54DU5|Y2028_DICDI RecName: Full=von Willebrand factor A domain-containing protein DDB_G0292028 gi|60463177|gb|EAL61370.1| hypothetical protein DDB_G0292028 [Dictyostelium discoideum AX4] Length = 932 Score = 37.4 bits (85), Expect = 3.4, Method: Composition-based stats. Identities = 44/269 (16%), Positives = 90/269 (33%), Gaps = 42/269 (15%) Query: 111 VRSTSLDIVVVPQNE-GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPI---TSSVKVNS 166 T+ + + +N +I AI K K +FI + + + I Sbjct: 276 ESKTNATVTISTENGVSNNIIAIIVPKYSEKPESFIEYSPIDKSYALAINFYPKFSVGLD 335 Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + D + + + VLD S SM S I+ + ++ + + N + +V F Sbjct: 336 EVDQKSEFIFVLDCSGSM------SGKPIEKSKMALEICMRSL------NENSKFNIVCF 383 Query: 227 SNKIEEFFLLE--WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + + F + LQ+ +Y+++ + T L+ + + D Sbjct: 384 GSNFNKLFETSKHYNDETLQKASEYINRIDANLGGTELLEPIVDIL---------SKESD 434 Query: 285 ANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL----RAC 339 + + + +TDGE + K N + ++ GI E + +AC Sbjct: 435 PEFPRQVFILTDGEISNRDKLIDYVGKEANTTR-----IFTYGIGSYVDKELIVGVSKAC 489 Query: 340 ASPNSFYL--VENPHSMYDAFSHIGKDIV 366 +Y V+N I + Sbjct: 490 ---KGYYEMIVDNSDMEEKVMKLISIAMQ 515 >gi|332298718|ref|YP_004440640.1| von Willebrand factor type A [Treponema brennaborense DSM 12168] gi|332181821|gb|AEE17509.1| von Willebrand factor type A [Treponema brennaborense DSM 12168] Length = 566 Score = 37.4 bits (85), Expect = 3.4, Method: Composition-based stats. Identities = 31/174 (17%), Positives = 54/174 (31%), Gaps = 30/174 (17%) Query: 173 DMMIVLDVSRSMESF---------FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ-SG 222 + V DVSRSM + ++++D A+L+ ++ + + SG Sbjct: 87 SVSFVFDVSRSMTAKDVSSGAGAAVSKPVSRLDAVKLYAAALLDRMEGTEVSAVIAKGSG 146 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 +V V L + ++ G++ A N Sbjct: 147 VVA-----VPLTSDLNAVRSLLPSLSPALLTSTGSSIGSGIEAAVNSF-----------P 190 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFL 336 + + IV TDGE + +A K G V +G E L Sbjct: 191 PLSAAARTIVVFTDGEETDGRMSGAVA----DALKFGIPVVFVGFGSDTESEIL 240 >gi|303241097|ref|ZP_07327606.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] gi|302591357|gb|EFL61096.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] Length = 689 Score = 37.4 bits (85), Expect = 3.4, Method: Composition-based stats. Identities = 22/136 (16%), Positives = 47/136 (34%), Gaps = 30/136 (22%) Query: 193 TKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE-----WGVSHLQRKI 247 T++ ++ + ++ ++ + G+V+++ K + L VS ++ I Sbjct: 337 TRLYKEKTAVKSFIDSFSS----DDRTKVGVVSYAEKAKIVSGLTSTLDTTNVSIMKANI 392 Query: 248 KYLSKFGV------------STNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMT 295 LS N ++ AY + N+ DA K I+ +T Sbjct: 393 DALSPSNAIVSNVDGSYKSSRRNIGDAMRRAYYML---------KNSTDAGASKYIIVLT 443 Query: 296 DGENLSTKEDQQSLYY 311 D + S D + Sbjct: 444 DEISNSWTWDSITSNP 459 >gi|256086242|ref|XP_002579311.1| hypothetical protein [Schistosoma mansoni] gi|238664738|emb|CAZ35550.1| expressed protein [Schistosoma mansoni] Length = 1678 Score = 37.4 bits (85), Expect = 3.4, Method: Composition-based stats. Identities = 30/140 (21%), Positives = 54/140 (38%), Gaps = 30/140 (21%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++I++D S SM++ + T + + I E++ N V +V+FS+ Sbjct: 1387 IVILIDTSGSMDAHLNEIKTYLKLLI------WEQLH-----KNKVFFNMVSFSS----- 1430 Query: 234 FLLEWGVSHL-----------QRKIKYLSKFGVSTNSTPGLKYAYN--QIFDMQGMRQHC 280 LLEW S + I L G T + + N QI + +++ Sbjct: 1431 TLLEWQASGVVMADEVYCHDAIAWIDQLKATG-GTCTGEAILCGLNHLQIMNSSNIKEMY 1489 Query: 281 NTEDANYKKIIVFMTDGENL 300 + K I +TDG+ Sbjct: 1490 DVNFNEMSKGIYLITDGKPD 1509 >gi|146304149|ref|YP_001191465.1| von Willebrand factor, type A [Metallosphaera sedula DSM 5348] gi|145702399|gb|ABP95541.1| von Willebrand factor, type A [Metallosphaera sedula DSM 5348] Length = 363 Score = 37.4 bits (85), Expect = 3.4, Method: Composition-based stats. Identities = 22/134 (16%), Positives = 46/134 (34%), Gaps = 26/134 (19%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 L +I++D S SM K++MA + +++ + + L+ F+ Sbjct: 13 EAKNLHYVILIDRSYSM------KGEKLEMAKEGARLLVDNLPKDS------RFSLLAFN 60 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 K+ S + ++++ L G T L+ A+N Sbjct: 61 EKV-SIIKEHEHPSEMGKELESLK-VGSGTAMYKALQEAFNLARKYGEPT---------- 108 Query: 288 KKIIVFMTDGENLS 301 ++ +TDG Sbjct: 109 --YVILLTDGVPSD 120 >gi|76798092|ref|ZP_00780347.1| cell wall surface anchor family protein [Streptococcus agalactiae 18RS21] gi|76586547|gb|EAO63050.1| cell wall surface anchor family protein [Streptococcus agalactiae 18RS21] Length = 364 Score = 37.4 bits (85), Expect = 3.4, Method: Composition-based stats. Identities = 17/92 (18%), Positives = 42/92 (45%), Gaps = 5/92 (5%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQT-DARLDMMIVLDVSRSMESFFDSSITKIDMA 198 K N I + + V ++ + LD++++LD S SM + ++ + A Sbjct: 189 KITGVNDLDKNKYKIELTVEGKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKA 248 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +++ +++++ N + LVT+++ I Sbjct: 249 GEAVEKLIDKITS----NKDNRVALVTYASTI 276 >gi|228906371|ref|ZP_04070255.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis IBL 200] gi|228853283|gb|EEM98056.1| Von Willebrand factor type A domain protein [Bacillus thuringiensis IBL 200] Length = 627 Score = 37.4 bits (85), Expect = 3.5, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 57/180 (31%), Gaps = 25/180 (13%) Query: 165 NSQTDARLDMMI--VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 Q LD+ ++D S SM K++ KS+ E +K + + Sbjct: 423 KGQESQELDVAFQLLVDCSGSMY-------NKMEETKKSVVLFHEALKSLKIPH------ 469 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG----MRQ 278 ++ + E + + + Y + N P + + + G + Sbjct: 470 AISGFWEDASSAKPEDKPNVIHEVVNYKNSTLP--NVGPEIMQLREEEDNRDGYIIRIVS 527 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K ++ TDGE + Q + A+K G V I I + E Sbjct: 528 EKLAKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKE 587 >gi|168705806|ref|ZP_02738083.1| hypothetical protein GobsU_40112 [Gemmata obscuriglobus UQM 2246] Length = 188 Score = 37.4 bits (85), Expect = 3.5, Method: Composition-based stats. Identities = 28/167 (16%), Positives = 57/167 (34%), Gaps = 25/167 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 I+ D S SM+ T + + I + L Q + FS++ + Sbjct: 32 FAILADNSGSMQG------TPLTVLKAEITSTLAR------SRGSAQFYVTFFSSEADPQ 79 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L W + + ++ +SK+ + G QH ++ Sbjct: 80 PLKRW--TADRNEVAAVSKWVQGIQTANG--------TSPVVGFQHVLKLKPPPD-VVYL 128 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA 340 MTDGE + + + N+A + A+++ + + L+ A Sbjct: 129 MTDGE--FDPREVEQIRTLNQALRPPAVIHTVAFGGKQGEAELKMIA 173 >gi|262371395|ref|ZP_06064713.1| conserved hypothetical protein [Acinetobacter johnsonii SH046] gi|262313732|gb|EEY94781.1| conserved hypothetical protein [Acinetobacter johnsonii SH046] Length = 1327 Score = 37.4 bits (85), Expect = 3.6, Method: Composition-based stats. Identities = 26/150 (17%), Positives = 57/150 (38%), Gaps = 27/150 (18%) Query: 174 MMIVLDVSRSM----------ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + I+LDVS SM + FD + ++ +A ++I ++++V + +VN + Sbjct: 568 LTIILDVSGSMAQDENGRGSATAGFDPNTIRLMLAKQAIAQLIDKVDTLGNVN----VKM 623 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 + FS+ E +W V I +++ + T+ + +H Sbjct: 624 INFSSANSEEVS-QWYVDSATGAINFVNTQQADGGTSYLSAINKTMTDWGTPPAADKH-- 680 Query: 282 TEDANYKKIIVFMTDGENLSTKEDQQSLYY 311 + F+TDG + + + Sbjct: 681 --------FVYFITDGAPDTNQGIDATTGT 702 >gi|218693423|gb|ACL01215.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693429|gb|ACL01218.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693431|gb|ACL01219.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 37.4 bits (85), Expect = 3.6, Method: Composition-based stats. Identities = 16/83 (19%), Positives = 41/83 (49%), Gaps = 5/83 (6%) Query: 149 TNSRHIVMPITSSVKVNSQT-DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 N I + + V ++ + LD++++LD S SM + ++ + A +++ +++ Sbjct: 198 KNKYKIELTVEGKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKAGEAVEKLID 257 Query: 208 EVKLIPDVNNVVQSGLVTFSNKI 230 ++ N + LVT+++ I Sbjct: 258 KITS----NKDNRVALVTYASTI 276 >gi|149026148|gb|EDL82391.1| chloride channel, calcium activated, family member 2 (predicted), isoform CRA_b [Rattus norvegicus] Length = 674 Score = 37.4 bits (85), Expect = 3.6, Method: Composition-based stats. Identities = 34/208 (16%), Positives = 74/208 (35%), Gaps = 29/208 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +V+DVSR M + ++ ++ L +V V G+VTF +K E Sbjct: 312 VCLVIDVSRKM-----AEGDRLLRLQQAAELFLMQV-----VEAHTFVGIVTFDSKGEIR 361 Query: 234 FLLEW-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L+ ++ + VST + + + F++ + +++ Sbjct: 362 APLQQINSDDDRKLLVSYLPTTVSTEAETNICAGVKKGFEVVEKQNGRADGS-----VMI 416 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYLVE- 349 +T G D+ A G+ ++++ + + L F++ + Sbjct: 417 LVTSG------ADEHITNCLLTAMSSGSTIHSMALGSSAARRLGELSHVTGGLEFFIPDP 470 Query: 350 -NPHSMYDAFSHIGK---DIVTKRIWYD 373 M +AFS I DI + + + Sbjct: 471 FTSSRMTEAFSGISSGAGDIFQQSLQLE 498 >gi|157818603|ref|NP_001100920.1| calcium-activated chloride channel regulator 2 [Rattus norvegicus] gi|149026147|gb|EDL82390.1| chloride channel, calcium activated, family member 2 (predicted), isoform CRA_a [Rattus norvegicus] Length = 942 Score = 37.4 bits (85), Expect = 3.6, Method: Composition-based stats. Identities = 34/208 (16%), Positives = 74/208 (35%), Gaps = 29/208 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +V+DVSR M + ++ ++ L +V V G+VTF +K E Sbjct: 312 VCLVIDVSRKM-----AEGDRLLRLQQAAELFLMQV-----VEAHTFVGIVTFDSKGEIR 361 Query: 234 FLLEW-GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 L+ ++ + VST + + + F++ + +++ Sbjct: 362 APLQQINSDDDRKLLVSYLPTTVSTEAETNICAGVKKGFEVVEKQNGRADGS-----VMI 416 Query: 293 FMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFYLVE- 349 +T G D+ A G+ ++++ + + L F++ + Sbjct: 417 LVTSG------ADEHITNCLLTAMSSGSTIHSMALGSSAARRLGELSHVTGGLEFFIPDP 470 Query: 350 -NPHSMYDAFSHIGK---DIVTKRIWYD 373 M +AFS I DI + + + Sbjct: 471 FTSSRMTEAFSGISSGAGDIFQQSLQLE 498 >gi|77407764|ref|ZP_00784518.1| cell wall surface anchor family protein [Streptococcus agalactiae COH1] gi|77173630|gb|EAO76745.1| cell wall surface anchor family protein [Streptococcus agalactiae COH1] gi|218693241|gb|ACL01124.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693247|gb|ACL01127.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693329|gb|ACL01168.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693351|gb|ACL01179.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693361|gb|ACL01184.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693365|gb|ACL01186.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693375|gb|ACL01191.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693413|gb|ACL01210.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693417|gb|ACL01212.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693437|gb|ACL01222.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693441|gb|ACL01224.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693443|gb|ACL01225.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693445|gb|ACL01226.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693449|gb|ACL01228.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693451|gb|ACL01229.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 37.4 bits (85), Expect = 3.6, Method: Composition-based stats. Identities = 16/83 (19%), Positives = 41/83 (49%), Gaps = 5/83 (6%) Query: 149 TNSRHIVMPITSSVKVNSQT-DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 N I + + V ++ + LD++++LD S SM + ++ + A +++ +++ Sbjct: 198 KNKYKIELTVEGKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKAGEAVEKLID 257 Query: 208 EVKLIPDVNNVVQSGLVTFSNKI 230 ++ N + LVT+++ I Sbjct: 258 KITS----NKDNRVALVTYASTI 276 >gi|260426558|ref|ZP_05780537.1| thrombospondin type 3 repeat family protein [Citreicella sp. SE45] gi|260421050|gb|EEX14301.1| thrombospondin type 3 repeat family protein [Citreicella sp. SE45] Length = 1088 Score = 37.4 bits (85), Expect = 3.6, Method: Composition-based stats. Identities = 30/214 (14%), Positives = 62/214 (28%), Gaps = 37/214 (17%) Query: 173 DMMIVLDVSRSMESFFDSSITK--IDMAIKSINAMLEEVKLIPDVNNVVQ---------- 220 +M+++D S SM +S + ++ + ++E V+ Sbjct: 454 QIMMIVDRSGSMSWSSNSGQAEVCLNGLDDDNDGTVDEGDCADSRIEFVRAAGRAFVDLQ 513 Query: 221 ------SGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 GL+ F+ ++ + I LS G T Sbjct: 514 TSQGIDLGLLEFNEGNTLLRPIDTLNAGNAQDYKDAIDALSP-GGDTAIGDAFDA----- 567 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + + +TDG N + + + ++ G ++AI Sbjct: 568 -STGEFTRVAEVGRV---RTAYLLTDGFNTAGGDPVAAAERLDDI---GVRIHAIPAGND 620 Query: 331 RSHEFLRACAS--PNSFYLVENPHSMYDAFSHIG 362 E L AS Y N +++ F+ + Sbjct: 621 VDREELTDIASGTGGQVYEARNVNALTGIFAELA 654 >gi|218693357|gb|ACL01182.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693425|gb|ACL01216.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693427|gb|ACL01217.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 37.4 bits (85), Expect = 3.6, Method: Composition-based stats. Identities = 16/83 (19%), Positives = 41/83 (49%), Gaps = 5/83 (6%) Query: 149 TNSRHIVMPITSSVKVNSQT-DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 N I + + V ++ + LD++++LD S SM + ++ + A +++ +++ Sbjct: 198 KNKYKIELTVEGKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKAGEAVEKLID 257 Query: 208 EVKLIPDVNNVVQSGLVTFSNKI 230 ++ N + LVT+++ I Sbjct: 258 KITS----NKDNRVALVTYASTI 276 >gi|220905238|ref|YP_002480550.1| OmpA/MotB domain-containing protein [Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774] gi|219869537|gb|ACL49872.1| OmpA/MotB domain protein [Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774] Length = 331 Score = 37.4 bits (85), Expect = 3.6, Method: Composition-based stats. Identities = 35/180 (19%), Positives = 60/180 (33%), Gaps = 22/180 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 T V+D S SM ++ K D I + N + IP ++ ++ Sbjct: 27 CTKKINSFDFVVDYSGSM--MMKNAQLKQDKIIVAKNVLQRINAAIPALDYNGGLHTISP 84 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSK----FGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + I W + + IK L FG T+ GL+ AY Sbjct: 85 NGVIVAQGP--WDRNTMAAGIKKLRSDFQIFGRMTSMGTGLQ-AYEPFLSSMKRDA---- 137 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR-VIRSHEFLRACAS 341 ++ +TDG+N D + A +R +V+ I E ++ A+ Sbjct: 138 -------AVILVTDGDNN-RGADLVEVARQMYASQRNLVVHIISFADSKNGEETIKQVAA 189 >gi|156402363|ref|XP_001639560.1| predicted protein [Nematostella vectensis] gi|156226689|gb|EDO47497.1| predicted protein [Nematostella vectensis] Length = 260 Score = 37.4 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 33/237 (13%), Positives = 77/237 (32%), Gaps = 38/237 (16%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARL--DMMIVLDVSRSMESF 187 ++ Y++ F +P K ++ + +++ VLD S S+ Sbjct: 51 KRVTAYELAHYFLNPMPNDLQPGRHD-------KRDAGDEFPFTYNVVYVLDSSTSITK- 102 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKI 247 + IK+I + + + N + L+ ++ + S LQ Sbjct: 103 -----VDFENGIKAIQLLTAKAR------NDTKFALILYAAEPVMVTNFTTQASMLQLLT 151 Query: 248 KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQ 307 + + TN+ L+ F G ++ +TDG + ++ + Sbjct: 152 N-VQRLYGKTNTFSALELCRKLFFVKSGSLNR-----------VLLVTDGLSNMNQD--K 197 Query: 308 SLYYCNEAKKRGAIVYAIGIRVIR-SHEFLRACAS--PNSFYLVENPHSMYDAFSHI 361 LY + K ++ + + + + + +S Y VE+ + I Sbjct: 198 MLYEAFQLKMSNIEIFVVAVGRYNYGADEISSLSSIPTTHIYRVESMRGLVRIIMWI 254 >gi|110638307|ref|YP_678516.1| hypothetical protein CHU_1908 [Cytophaga hutchinsonii ATCC 33406] gi|110280988|gb|ABG59174.1| conserved hypothetical protein [Cytophaga hutchinsonii ATCC 33406] Length = 319 Score = 37.4 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 24/129 (18%), Positives = 53/129 (41%), Gaps = 18/129 (13%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ +LDVS SM++ D S ++++ A I+ ++ + N Q GL+++++ Sbjct: 81 ILFILDVSLSMDA-RDVSPSRLEKAHTIIHNIVNQ-------NPTDQYGLISYASGATIQ 132 Query: 234 FLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L + + + TN+ L+ A N + E + Sbjct: 133 CPLTFDTQTFLAFSQTATTSLFDYTGTNTYDALRMANNYLTTY-------KKEGNLKPCV 185 Query: 291 IVFMTDGEN 299 ++ ++DGE Sbjct: 186 VIMLSDGEG 194 >gi|110833328|ref|YP_692187.1| pilin biogenesis-like protein [Alcanivorax borkumensis SK2] gi|110646439|emb|CAL15915.1| pilin biogenesis related protein [Alcanivorax borkumensis SK2] Length = 1009 Score = 37.4 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 23/112 (20%), Positives = 36/112 (32%), Gaps = 10/112 (8%) Query: 268 NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTK--EDQQSLYYCNEA------KKRG 319 +I D NT + IV TDGE S Y C A + Sbjct: 269 KKIVDNGNYVSPRNTANECESNHIVLFTDGEANDVNLPCGGGSSYDCQRAISDYLDRDFE 328 Query: 320 AIVYAIGIRVIRSHEFLRACASPNS--FYLVENPHSMYDAFSHIGKDIVTKR 369 Y +G+ + + + +S + Y + S+ AF I I + Sbjct: 329 IKTYNVGLHMEDNRADMETVSSDGADGTYTASDAESLASAFLDIFDLIEKES 380 >gi|320537257|ref|ZP_08037217.1| hypothetical protein HMPREF9554_01962 [Treponema phagedenis F0421] gi|320145885|gb|EFW37541.1| hypothetical protein HMPREF9554_01962 [Treponema phagedenis F0421] Length = 324 Score = 37.4 bits (85), Expect = 3.8, Method: Composition-based stats. Identities = 20/133 (15%), Positives = 46/133 (34%), Gaps = 11/133 (8%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L + I++D+S SMES FD K + + G + F Sbjct: 93 KEERELQIFIIVDLSFSMESCFDCVSAKSKALEAAGLLSFAALHTATPF------GGLIF 146 Query: 227 SNKIEEFFLLEWGVSHL-----QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 KI ++F G + + + + T L+ + + + + Sbjct: 147 DGKIGKYFEPRAGKNAVLCFLKECDVFANQPGSPGTALAEALQTSAKVLHTRSMVIVISD 206 Query: 282 TEDANYKKIIVFM 294 + +++ +F+ Sbjct: 207 FKVEGFEREFIFL 219 >gi|312217517|emb|CBX97465.1| hypothetical protein [Leptosphaeria maculans] Length = 1258 Score = 37.4 bits (85), Expect = 3.8, Method: Composition-based stats. Identities = 26/145 (17%), Positives = 51/145 (35%), Gaps = 15/145 (10%) Query: 84 CRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCT 143 W + + + N + D + S + S S Y T Sbjct: 651 RSQLELWRRALMDIRLEFPMSNKMADYDQDNSGTEDDEYRRPKRVSSVNSSYGAGRSIMT 710 Query: 144 FIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSIN 203 YTNSR ++ +D+++V+ VS SM+ KI++ ++ Sbjct: 711 APTEYTNSR---GGGVQEARMTQSIHVPIDIVVVIPVSSSMQGL------KINLLRDTLR 761 Query: 204 AMLEEVKLIPDVNNVVQSGLVTFSN 228 ++ ++ + + GLVTF + Sbjct: 762 FLVT------NLGDRDRMGLVTFGS 780 >gi|326680031|ref|XP_696917.5| PREDICTED: von Willebrand factor A domain-containing protein 5A-like [Danio rerio] gi|326680047|ref|XP_003201439.1| PREDICTED: von Willebrand factor A domain-containing protein 5A-like [Danio rerio] Length = 738 Score = 37.4 bits (85), Expect = 3.9, Method: Composition-based stats. Identities = 27/136 (19%), Positives = 50/136 (36%), Gaps = 20/136 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + V+D S SM+ I + + +L +K +P + F ++ E F Sbjct: 281 FVFVMDRSGSMDCMMHRGIGAQKCIESAKDTLLLLLKSLPMGCYF---NIYGFGSRFESF 337 Query: 234 F--LLEWGVSHLQ---RKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 F +E+ + +K+K + T L + Y+Q + Sbjct: 338 FTKSVEYNQDKMDQALKKVKEMRADMGGTEILQPLAHIYSQ------------PCIPEHP 385 Query: 289 KIIVFMTDGENLSTKE 304 + + TDGE +TKE Sbjct: 386 RQLFIFTDGEVENTKE 401 >gi|308501124|ref|XP_003112747.1| CRE-ROP-1 protein [Caenorhabditis remanei] gi|308267315|gb|EFP11268.1| CRE-ROP-1 protein [Caenorhabditis remanei] Length = 644 Score = 37.4 bits (85), Expect = 3.9, Method: Composition-based stats. Identities = 31/164 (18%), Positives = 57/164 (34%), Gaps = 37/164 (22%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE-F 233 + LDVS SM S SS A ++ + +++N + V F +K+ E Sbjct: 477 CLALDVSGSMCSPVSSSPLSCREAATGMSLI--------NLHNEAEVKCVAFCDKLTELP 528 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 F +W + + I LS +T+ + +A + + I++ Sbjct: 529 FTKDWKIGQVNDYIDKLSF--GNTDCGLPMTWA--------------TENNLKFDVFIIY 572 Query: 294 MTDGE----NLSTKEDQQSLYYCNEAKKRGAIV-------YAIG 326 TD + N+ E + + IV Y+I Sbjct: 573 -TDNDTWAGNVHPFEAIKRYREASGIHDAKVIVMAMHAYNYSIA 615 >gi|291230028|ref|XP_002734972.1| PREDICTED: membrane-bound transcription factor peptidase, site 1-like [Saccoglossus kowalevskii] Length = 893 Score = 37.4 bits (85), Expect = 3.9, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 69/205 (33%), Gaps = 24/205 (11%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF- 226 + LD+ V+D + SM S+ I+ A K+I ++++E+ + ++ LV + Sbjct: 20 DASVLDLAFVMDCTGSMGSW-------INEARKNIQSIVDEIVAKEMSD--IRLALVEYR 70 Query: 227 ------SNKIEEFFLLEWGVSHLQRKIKYLSKFGVS---TNSTPGLKYAYNQIFDMQGMR 277 S + + +Q+++ + G GL A N + + Sbjct: 71 DHPPQESTFVTRVLDFTPSLKDMQKQMDNMKANGGGDGPEAVADGLHEALNLNWRPLATK 130 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--HEF 335 DA + DG + + ++ +Y++ +F Sbjct: 131 VCVLIADAPPHG-LRDSGDGFPKGCPAGHDPMKIARQMAEQNITLYSVVCGSYAEGFKDF 189 Query: 336 LRACA--SPNSFYLVENPHSMYDAF 358 A A + + +++ + Sbjct: 190 FMAIAHVTGGQYVSLKDAKLLSKVI 214 >gi|212633291|ref|YP_002309816.1| VCBS [Shewanella piezotolerans WP3] gi|212554775|gb|ACJ27229.1| VCBS [Shewanella piezotolerans WP3] Length = 1477 Score = 37.4 bits (85), Expect = 3.9, Method: Composition-based stats. Identities = 40/269 (14%), Positives = 83/269 (30%), Gaps = 32/269 (11%) Query: 54 IDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRS 113 +D + A I ++ +L N +++F + + I Sbjct: 895 VDYEGLVAGNVITDDTGSGVDSDSNNDPLLLTQVNGIDLNFIGGEAEVILQGGLLTIQED 954 Query: 114 TSLDIVVVPQ-----NEGYSIS-----AISRYKIPLKFCTFIPWYTNSRHIVMPITSSVK 163 S N Y+I+ + + + + +I ++ Sbjct: 955 GSYTFEHDGNSSTPINFSYTINDGNGGTDTANVSIAVYDSETLNPGDDNYIGTDGNDTII 1014 Query: 164 VNSQT---DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 ++ A ++ ++D S SM DS++ I S+ A L V Sbjct: 1015 SDTPDILAGADYNLAFLIDSSGSMG---DSAVATAKAQILSVLATL-ITNANQPSAGTVN 1070 Query: 221 SGLVTFSNKIEEFFLLEWGVSH----LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 LV F + ++ + + ++ +S G +TN + AYN D Sbjct: 1071 VLLVDFDQTAKILIAIDLSSNDPLASITTALEAMSS-GGTTNYSAAFTAAYNWFND---- 1125 Query: 277 RQHCNTEDANYKKIIVFMTDGENLSTKED 305 + + + F+TDGE + Sbjct: 1126 -NYPQGNNRTF-----FITDGEPNTDNGQ 1148 >gi|311070191|ref|YP_003975114.1| hypothetical protein BATR1942_16320 [Bacillus atrophaeus 1942] gi|310870708|gb|ADP34183.1| hypothetical protein BATR1942_16320 [Bacillus atrophaeus 1942] Length = 226 Score = 37.4 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 78/227 (34%), Gaps = 25/227 (11%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + S + ++ ++ D S SM KID+A KS+ + E + Sbjct: 13 LTVSLASPSFAAEKKTPDTNVAVLFDGSGSMVQKTGGER-KIDVAKKSVKSFAELLP--E 69 Query: 214 DVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF-- 271 D N +++ F ++ + I L + S A +QI Sbjct: 70 DTNLMLRV----FGHEGNNKLSGKALSCSTTETIYGLHPYEGS-----LFNNALSQINPT 120 Query: 272 -------DMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 + R+ +A K ++ +TDGE + + + EA IV Sbjct: 121 GWTPIAKALSDTREEFQKVNAEGKNVVYLITDGEETCGGDPAKEIEKLREA-NVDTIVNI 179 Query: 325 IGIRV-IRSHEFLRAC--ASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 IG ++ ++ ++ A + +N A+ + + + Sbjct: 180 IGFNFDVKGNQKMKQAAVAGGGEYISAKNAEEFEQAWENEAQKFTEE 226 >gi|239627038|ref|ZP_04670069.1| argininosuccinate synthase [Clostridiales bacterium 1_7_47_FAA] gi|239517184|gb|EEQ57050.1| argininosuccinate synthase [Clostridiales bacterium 1_7_47FAA] Length = 1453 Score = 37.4 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 34/219 (15%), Positives = 73/219 (33%), Gaps = 17/219 (7%) Query: 57 SLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDID----DIVR 112 + +AA GN + K G + + + + L DN D D++ Sbjct: 899 ANYYAAVSAATPGNASIDYTPKTGAVALDLSDDRMADLIDGLTDNSSDQDAMTAGVDVLY 958 Query: 113 STSLDIVVVPQNEGYSISAISRYK---IPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD 169 + +E ++ A +P + + + +P + Sbjct: 959 TVKFTQRAPKASESEAVKAEVGNDAVKVPWTLSSVLTRQVGGSNKDIP------PGVDKE 1012 Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 + + VLD S M+ + D + ++ ++ PD + +G+V+F Sbjct: 1013 PDIRIYAVLDRS--MQGYTDYQLWNVEDLDGEYTCTQVPMEPDPDDADSGFTGVVSFDAN 1070 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFG--VSTNSTPGLKYA 266 +E ++L + +H R I + S L+ A Sbjct: 1071 VEGTYVLTYLKAHEVRIIDDKRSVEHIIKVRSGSALEDA 1109 >gi|3913240|sp|O24133|CHLD_TOBAC RecName: Full=Magnesium-chelatase subunit chlD, chloroplastic; Short=Mg-chelatase subunit D; AltName: Full=Mg-protoporphyrin IX chelatase; Flags: Precursor gi|2239151|emb|CAA71128.1| CHLD magnesium chelatase subunit [Nicotiana tabacum] Length = 758 Score = 37.4 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 57/177 (32%), Gaps = 20/177 (11%) Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 + ++ A R P + +R + + T A ++ V+D S SM Sbjct: 508 KRLAVDATLRAAAPYQKLRRAKDIQKTRKVYVEKTDMRAKRMARKAGALVIFVVDASGSM 567 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEFFLLEWGVSHL 243 ++ ++ A + +L + Q ++ F + E +S Sbjct: 568 ------ALNRMQNAKGAALKLL-----AESYTSRDQVCIIPFRGDAAEVLLPPSRSISMA 616 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 + +++ L G S + + + + + +IV +TDG Sbjct: 617 RNRLERLPCGGGSPLAHG--------LTTAVRVGMNAEKSGDVGRIMIVAITDGRAN 665 >gi|50913505|ref|YP_059477.1| collagen adhesion protein [Streptococcus pyogenes MGAS10394] gi|19108443|gb|AAL11466.1| FctX [Streptococcus pyogenes] gi|50902579|gb|AAT86294.1| Collagen adhesion protein [Streptococcus pyogenes MGAS10394] gi|198417345|gb|ACH87890.1| ancillary protein 1 [Streptococcus pyogenes] gi|198417347|gb|ACH87891.1| ancillary protein 1 [Streptococcus pyogenes] Length = 1036 Score = 37.4 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 26/167 (15%), Positives = 54/167 (32%), Gaps = 38/167 (22%) Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 +N + L W + L + TN L A + +++ + Sbjct: 572 NNSRDAELLKGWSTNSLLDPNTLTALHNNGTNYHAALLKAKEILNEVKDDGRR------- 624 Query: 287 YKKIIVFMTDG--------------------ENLSTKEDQQSLYYCNEAKKR--GAIVYA 324 KI++F++DG N T+ + S +E K R +Y+ Sbjct: 625 --KIMIFISDGVPTFYFGEDGYRSGNGSSNDRNNVTRSQEGSKLAIDEFKARYPNLSIYS 682 Query: 325 IGIR-------VIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 +G+ S L+ + +Y + + + + I +D Sbjct: 683 LGVSKDINSDTASSSPVVLKYLSGEEHYYGITDTAELEKTLNKIVED 729 >gi|268530440|ref|XP_002630346.1| C. briggsae CBR-CLEC-62 protein [Caenorhabditis briggsae] Length = 386 Score = 37.4 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 51/155 (32%), Gaps = 17/155 (10%) Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPG--LKYA 266 V+ GLVT++N+ L S + L K S L A Sbjct: 68 AYSDKRTVRVGLVTYNNQATVQADLNRFQSADDLFNSVFQILPKLSASDEVYLAKGLDAA 127 Query: 267 YNQIFDMQGMRQHCNTEDANYKK-IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 + + N +NYK+ ++++ +D + ++ K G + + Sbjct: 128 ESVLSAG-----RKNATRSNYKQLVLIYASDYRDDG---EEDPRPTAERMKSSGVSIATV 179 Query: 326 GIRVIRSHEFLRA---CASPNSFYLVENPHSMYDA 357 + ++A ASP + E+ + + Sbjct: 180 AFDQTGNEGVVKAIGEIASPGFNFTNEDADLVREI 214 >gi|229042482|ref|ZP_04190227.1| Von Willebrand factor type A domain protein [Bacillus cereus AH676] gi|228726835|gb|EEL78047.1| Von Willebrand factor type A domain protein [Bacillus cereus AH676] Length = 610 Score = 37.4 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 30/179 (16%), Positives = 57/179 (31%), Gaps = 25/179 (13%) Query: 166 SQTDARLDMMI--VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 Q LD+ ++D S SM K++ KS+ E +K + + Sbjct: 407 GQESQELDVAFQLLVDCSGSMY-------NKMEETKKSVVLFHEALKSLKIPH------A 453 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG----MRQH 279 ++ + E + + + Y + N P + + + G + Sbjct: 454 ISGFWEDASSAKPEDKPNVIHEVVNYKNS--TLPNVGPEIMQLREEEDNRDGYIIRIVSE 511 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K ++ TDGE + Q + A+K G V I I + E Sbjct: 512 KLAKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKE 570 >gi|326430497|gb|EGD76067.1| hypothetical protein PTSG_00776 [Salpingoeca sp. ATCC 50818] Length = 8235 Score = 37.4 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 22/177 (12%), Positives = 53/177 (29%), Gaps = 35/177 (19%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK----I 230 +IV+D + SM + D++ +K + ++ + V+ G V + + + Sbjct: 8018 VIVMDCTGSMGHWIDAAKAHATSMMKFVR---DQAVSKFGRDVNVRLGFVAYRDYDFGTV 8074 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP-----GLKYAYNQIFDMQGM--------R 277 E L + H+ I P GL+ A + + Sbjct: 8075 SEQCPLTRDLRHVDAFISRQVACSGRHRDVPEEVLTGLRAAVSMAWTPTAAVKLVVLVGD 8134 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKED---------QQSLYYCNEAKKRGAI-VYA 324 + + ++K D Q+++ + K + ++ Sbjct: 8135 APQHGPEFVHEK-----RDNHPNGPPAGYPDHGKSHLQRAIEITDAMKDKNIELIFT 8186 >gi|289192722|ref|YP_003458663.1| Magnesium chelatase [Methanocaldococcus sp. FS406-22] gi|288939172|gb|ADC69927.1| Magnesium chelatase [Methanocaldococcus sp. FS406-22] Length = 317 Score = 37.4 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 26/184 (14%), Positives = 63/184 (34%), Gaps = 22/184 (11%) Query: 147 WYTNSRHIVMPITSSVKVNSQTDARLD--MMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 +++ + + + V ++ ++ V+DVS SM ++ +++ A +I + Sbjct: 105 REKSNKKLAIYLEKEDIVEKVRQRKISSHILFVVDVSGSM-----GAMRRMEAAKGAIIS 159 Query: 205 MLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK 264 +L + + G++ F E L L K+ G T Sbjct: 160 LL-----LDAYQKRNKIGMIAFRKDRAELILPFTSSVELGEKLLKDLPTGGKTPLADAFI 214 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY---CNEAKKRGAI 321 +Y +FD + + I++ ++D + + C + ++ Sbjct: 215 KSYE-VFDRELRKNPNIIP------IMIVISDFKPNVAVKGDYVKEVFDACEKIAEKCIN 267 Query: 322 VYAI 325 V I Sbjct: 268 VILI 271 >gi|145548479|ref|XP_001459920.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124427747|emb|CAK92523.1| unnamed protein product [Paramecium tetraurelia] Length = 973 Score = 37.4 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 30/244 (12%), Positives = 78/244 (31%), Gaps = 31/244 (12%) Query: 67 NEGNGNNRKKLKGG--DILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQN 124 E N N K+ + + + + + N+ F + + +I+ + Sbjct: 361 EEENINLLKQQLKDMEALFNQYQQNASNNKSNDQNKKQFEQQLKKLKDEAEYNIIKNNEE 420 Query: 125 -------EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIV 177 + + + K + +R I + + + + +LD+ V Sbjct: 421 VEKNRKLKELQMEESKKKKAMMDQKRMEDKKNENRMIKEIMMKKQQGSLEEYGKLDICYV 480 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF---- 233 +D ++SME + + + + + ++K I + + S + + + Sbjct: 481 IDCTKSMEPYVEQARSCVQESLKVI--------KQQTNRDTIISSVAYYDIEQRPAKGYY 532 Query: 234 -FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 F + Q+ I + + ++ A Q+ + N KI + Sbjct: 533 QFEFSNNIQAFQKFILEVP-IEGGRDIPEDVRGALEQMITKLKWK--------NKFKIAI 583 Query: 293 FMTD 296 +TD Sbjct: 584 LITD 587 >gi|115680671|ref|XP_001197987.1| PREDICTED: similar to parturition-related protein PRP3, partial [Strongylocentrotus purpuratus] gi|115719718|ref|XP_001196973.1| PREDICTED: similar to parturition-related protein PRP3, partial [Strongylocentrotus purpuratus] Length = 528 Score = 37.4 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 29/179 (16%), Positives = 54/179 (30%), Gaps = 30/179 (16%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 +++VLD+S SM+ + D I+S + + G+V F + E Sbjct: 107 SVVLVLDISNSMKEK-----NRFDRMIQSSTVYIMSAIPADS-----KLGIVVFESASRE 156 Query: 233 FFLLEWGVSHLQRK--IKYLSKF-GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L R+ + L+ +T G+K + Sbjct: 157 IAELTDITDTASRQRLVNALNTSPKGTTCIGCGIKSGLKVLGSYAQGG------------ 204 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFY 346 I+ ++DG ++ G I+ I I + L S S + Sbjct: 205 YILLLSDGVENEDPSITDMY---DDINNSGVIIDTITISNEADQQMEDLSKNTSGKSSF 260 >gi|221107176|ref|XP_002169824.1| PREDICTED: similar to predicted protein, partial [Hydra magnipapillata] Length = 228 Score = 37.4 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 28/157 (17%), Positives = 58/157 (36%), Gaps = 19/157 (12%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 +DM+ +D+S S E K+I ++++ +P +N + GL+T+S+ Sbjct: 7 VDMVFAMDLSSSSEEILQKQ-------KKAIRSLIDY--HLPSKSN--ELGLITYSDIAN 55 Query: 232 EFFLLEWGVSHLQRKI-KYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 L S +I + G N + A IF ++ + + +++ Sbjct: 56 VNSEL---TSKFNSEILDKIVNNGRRQNVASAITVASENIFKIRNNDEKLKKK----QRV 108 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGI 327 +V G+ S Y ++ IG+ Sbjct: 109 LVLFVVGKPSSNYPPVVPERYGKLLEENNVKTIIIGL 145 >gi|167388489|ref|XP_001738586.1| hypothetical protein [Entamoeba dispar SAW760] gi|165898119|gb|EDR25073.1| hypothetical protein, conserved [Entamoeba dispar SAW760] Length = 720 Score = 37.4 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 22/142 (15%), Positives = 53/142 (37%), Gaps = 26/142 (18%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + SS +N + + ++++ + D S SM I+ + L ++ + + Sbjct: 198 VISSTFINKEKEGDINIIFICDRSGSMYGEG------INALRNMLQLFLRQL----PLKS 247 Query: 218 VVQSGLVTFSNKIEEFFL--LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + ++F +K + F +E+ L+ +S+F + + + D Sbjct: 248 KFEI--ISFGSKYDFMFKEMVEYNEDTLKNASNRISEFEAN-YGGTSMDAPLKALIDNNT 304 Query: 276 MRQHCNTEDANYKKIIVFMTDG 297 + H I+ +TDG Sbjct: 305 EKCH-----------IILLTDG 315 >gi|222082656|ref|YP_002542021.1| hypothetical protein Arad_9366 [Agrobacterium radiobacter K84] gi|221727335|gb|ACM30424.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 176 Score = 37.1 bits (84), Expect = 4.1, Method: Composition-based stats. Identities = 23/150 (15%), Positives = 55/150 (36%), Gaps = 7/150 (4%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMID--RSLVHAATQ 64 + R + KG I A+ +F+++ IIE+S +FF+ + + + + ++ Sbjct: 9 SFRRLLADRKGVAAIEFALLALPLFMLIFAIIEISVMFFVNSAMDASVQKISRMIRTGEV 68 Query: 65 IMNEGNGNNRKKLKGGDIL----CRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVV 120 ++ + + K D+L C + ++L N I+ + Sbjct: 69 ASSKISQADFKAKICDDMLLAFSCSSNLLVKVDILSDLSSATSANPINASGNLAVTETYN 128 Query: 121 VPQNEGYS-ISAISRYKIPLKFCTFIPWYT 149 + + + + A + + F TF Sbjct: 129 IGKGSDFVLVQAFLPWDAVVNFFTFSSNKM 158 >gi|268530436|ref|XP_002630344.1| C. briggsae CBR-CLEC-66 protein [Caenorhabditis briggsae] Length = 374 Score = 37.1 bits (84), Expect = 4.1, Method: Composition-based stats. Identities = 40/206 (19%), Positives = 76/206 (36%), Gaps = 37/206 (17%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS---- 227 LD++ V+D S++M + + I ++ + L ++ D + G +T++ Sbjct: 36 LDVVFVVDNSKNMNLYNVYNT------ISALFSPLIQIGTGYDDPRSTRVGFITYNYNAT 89 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNST----PGLKYAYNQIFDMQGMRQHCNTE 283 + + + L + L ++I+ LS +S + GL A I QG+R Sbjct: 90 DVADLYMLQSYN--DLNQQIQLLSTTPLSRDDHSYIDSGLGAAIRMINATQGLR------ 141 Query: 284 DANYKKIIVFMT---DGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS--HEFLRA 338 NY+K++VF T D ++ + L K GA V + + Sbjct: 142 -DNYQKVVVFFTSQYDYYYNYPEDQAKVL------KSLGATVITVNTGGDDNVQENLHDK 194 Query: 339 CASPNSFY---LVENPHSMYDAFSHI 361 AS + + A I Sbjct: 195 IASKQMAFTMADGNTTQELTRALLAI 220 >gi|237715741|ref|ZP_04546222.1| conserved hypothetical protein [Bacteroides sp. D1] gi|262408749|ref|ZP_06085295.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|229444450|gb|EEO50241.1| conserved hypothetical protein [Bacteroides sp. D1] gi|262353614|gb|EEZ02708.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] Length = 616 Score = 37.1 bits (84), Expect = 4.2, Method: Composition-based stats. Identities = 38/303 (12%), Positives = 94/303 (31%), Gaps = 43/303 (14%) Query: 51 HSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNM-------SFRNELRDNGF 103 H ++ A + + G + +N + +F ++ + Sbjct: 115 HETRAAKVMSTAYRAVCPTPGIMYDAANAEEYGEFQENGFKSVSDAPLSTFSIDVDAASY 174 Query: 104 VND-----IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFI---PWYTNSRHIV 155 N ++ ++ + Y + P+K PW TN R + Sbjct: 175 SNMRRFINKGELPPVDAIRTEELVNYFSYDYPKPTGSD-PVKITMESGACPWNTNHRLVR 233 Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 + + + ++ + +++ ++DVS SM ++D+ S+ ++ ++ V Sbjct: 234 IGLKAK-EIPTDNLPASNLVFLIDVSGSMW-----GANRLDLVKSSLKLLVNNLRDKDKV 287 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTP---GLKYAYNQIFD 272 V SG ++ ++ I L+ G + K A + Sbjct: 288 AIVTYSGSAG----VKLEATPGSDKQKIREAIDELTAGGSTAGGAGILLAYKIAKKNLIS 343 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 R I+ +DG+ N+ + + +K G + +G + Sbjct: 344 NGNNR-------------IILCSDGDFNVGVSSAEGLEQLIEKERKSGVFLTVLGYGMGN 390 Query: 332 SHE 334 + Sbjct: 391 YKD 393 >gi|300779171|ref|ZP_07089029.1| conserved hypothetical protein [Chryseobacterium gleum ATCC 35910] gi|300504681|gb|EFK35821.1| conserved hypothetical protein [Chryseobacterium gleum ATCC 35910] Length = 396 Score = 37.1 bits (84), Expect = 4.2, Method: Composition-based stats. Identities = 28/227 (12%), Positives = 69/227 (30%), Gaps = 28/227 (12%) Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQ-TDARLDMMIVLDVSRSMESFFDSSITKIDMAI 199 H+ + T+ + V S D ++ + ++LD S SM+ D + +++ + Sbjct: 22 PENHTVNSNTGNHLTVQKTTLIPVTSTVKDNKIQVALLLDTSNSMDGLIDQAKSRLWNIV 81 Query: 200 KSINAMLEEVKLIPDVNNVVQSGLVTFS-------NKIEEFFLLEWGVSHLQRKIKYLSK 252 ++ + ++ L + N I + L + + K+ L Sbjct: 82 NTLTTL-----KYNGKAPEIEIALYEYGNDGIRDENYIRQVTPLTQDLDLVSEKLFALRT 136 Query: 253 FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYC 312 G S ++ A + + I G + Sbjct: 137 NGGSEYCGAVIRDAAANLNWDSNDKSMKLI-------YIA----GNEAFDQGKINYREVV 185 Query: 313 NEAKKRGAIVYAIGIRVIRS--HEFLRACAS--PNSFYLVENPHSMY 355 ++AK + + I F + AS ++ +++ + Sbjct: 186 SKAKNKNIYINTIFCGSREEGIQTFWQNGASLGGGKYFNIDSDRKVL 232 >gi|16330744|ref|NP_441472.1| hypothetical protein slr1338 [Synechocystis sp. PCC 6803] gi|1653237|dbj|BAA18152.1| slr1338 [Synechocystis sp. PCC 6803] Length = 200 Score = 37.1 bits (84), Expect = 4.2, Method: Composition-based stats. Identities = 24/133 (18%), Positives = 46/133 (34%), Gaps = 14/133 (10%) Query: 173 DMMIVLDVSRSMESFFDS-SITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D +++D S SM ++ ++ +S A+ + + V + F Sbjct: 6 DYTLIIDKSGSMSIIEPKFQKSRWELVQESTLALARKCDQLDANGITVYTFSGKFRRYDN 65 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 S +++ + G TN T L+ A N Q + A + I Sbjct: 66 V------NASKVEQIFQENEPVG-GTNLTAVLQDALNNFL------QRKKSNQAPTGETI 112 Query: 292 VFMTDGENLSTKE 304 + +TDGE + Sbjct: 113 LVITDGEPNDRRS 125 >gi|163849593|ref|YP_001637636.1| TadE family protein [Methylobacterium extorquens PA1] gi|163661198|gb|ABY28565.1| TadE family protein [Methylobacterium extorquens PA1] Length = 135 Score = 37.1 bits (84), Expect = 4.2, Method: Composition-based stats. Identities = 21/159 (13%), Positives = 52/159 (32%), Gaps = 24/159 (15%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 +R+F + +G +++ A+ +P++ ++ +E I + + S + AT Sbjct: 1 MRSFRKDQRGVISVEFALVVPLLITLVFAAVEFGGILYTQAAAESATNNVARQLAT---- 56 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGY 127 RI L V ++ + + P++ Y Sbjct: 57 ----------------NRITQAQAKKAVTPLL---PVWARVNVDVAVTASNATKPESNLY 97 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 +++ F+ + R+I +T + S Sbjct: 98 TVATTIP-MAQAAPTGFLKFIYVDRNISATVTQQQEPTS 135 >gi|327293267|ref|XP_003231330.1| von Willebrand factor [Trichophyton rubrum CBS 118892] gi|326466446|gb|EGD91899.1| von Willebrand factor [Trichophyton rubrum CBS 118892] Length = 308 Score = 37.1 bits (84), Expect = 4.3, Method: Composition-based stats. Identities = 28/213 (13%), Positives = 56/213 (26%), Gaps = 20/213 (9%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 + Y T P + + + +S A Sbjct: 24 SPAQKSTYSNNKSTGSRHSDPPPAYTPVSATSTTAPSFPAASSEGGSLAASPDQPDSFLA 83 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 + D + ++D S SM + K+ L ++ I + + +++ Sbjct: 84 QFDTVFLIDDSGSMR----------GASWKATENALAKIAPICTAQDKDGIEIYFLNHRS 133 Query: 231 EE---FFLLEWGVSHLQRKIKYLSKFGV---STNSTPGLKYAYNQIFDMQGMRQHCNTED 284 + + H++ +S G T LK + ++ Sbjct: 134 DSSNGAYSNIRDTGHVREVFTAVSPCGGTPTGTRLNHILKPYLKGFEEASKSMSPDERDN 193 Query: 285 ANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 A I+ +TDG D AKK Sbjct: 194 AVRPLNIIVITDG----VPTDDVDSVIIQTAKK 222 >gi|256379856|ref|YP_003103516.1| magnesium chelatase [Actinosynnema mirum DSM 43827] gi|255924159|gb|ACU39670.1| Magnesium chelatase [Actinosynnema mirum DSM 43827] Length = 680 Score = 37.1 bits (84), Expect = 4.3, Method: Composition-based stats. Identities = 22/135 (16%), Positives = 47/135 (34%), Gaps = 19/135 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEE 232 ++ V+D S SM ++ ++ ++ ++L + G++TF ++ E Sbjct: 498 VLFVVDASGSM-----AARQRMSAVSGAVISLLR-----DAYQRRDKVGVITFRGSEAET 547 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V + G T GL A ++ + +R ++V Sbjct: 548 ALPPTTSV-DAAAARLRALRTGGRTPLADGLLRA-RRVLQSEKVRDPRRRP------LLV 599 Query: 293 FMTDGENLSTKEDQQ 307 +TDG +T Sbjct: 600 LLTDGRATATGAGGD 614 >gi|149371019|ref|ZP_01890614.1| hypothetical protein SCB49_05020 [unidentified eubacterium SCB49] gi|149355805|gb|EDM44363.1| hypothetical protein SCB49_05020 [unidentified eubacterium SCB49] Length = 210 Score = 37.1 bits (84), Expect = 4.3, Method: Composition-based stats. Identities = 35/155 (22%), Positives = 56/155 (36%), Gaps = 30/155 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 MM++ D+S S FF + I I A L + GL+ FS+++E + Sbjct: 1 MMLMADISGS--EFFGTDQQFKSEIITEIAATLAFSATQNNDK----IGLILFSDEVELY 54 Query: 234 FLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 + G H+ R I+ L T+ LKY N + KK Sbjct: 55 IPPKKGKFHVLRIIRELIEFKPKSKKTDIANALKYMRNVMK----------------KKA 98 Query: 291 IVF-MTDGENLSTKEDQQSLYYCNEAKK-RGAIVY 323 IVF ++D + + +L G +Y Sbjct: 99 IVFVLSD---FIADDYKDTLKIAANKHDITGIRIY 130 >gi|170732854|ref|YP_001764801.1| membrane protein [Burkholderia cenocepacia MC0-3] gi|169816096|gb|ACA90679.1| membrane protein [Burkholderia cenocepacia MC0-3] Length = 608 Score = 37.1 bits (84), Expect = 4.3, Method: Composition-based stats. Identities = 12/61 (19%), Positives = 26/61 (42%), Gaps = 2/61 (3%) Query: 5 NLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQ 64 IR G +++ AI +I + I V ++FF + + + D + + A + Sbjct: 7 GGTIRRASRQR-GAFSVM-AIIATLIAITTLGAIGVGNLFFQRRDVQRIADMAALAAVQR 64 Query: 65 I 65 + Sbjct: 65 M 65 >gi|116671476|ref|YP_832409.1| hypothetical protein Arth_2930 [Arthrobacter sp. FB24] gi|116611585|gb|ABK04309.1| hypothetical protein Arth_2930 [Arthrobacter sp. FB24] Length = 353 Score = 37.1 bits (84), Expect = 4.3, Method: Composition-based stats. Identities = 27/163 (16%), Positives = 58/163 (35%), Gaps = 18/163 (11%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 N +G ++++ AI L ++ + ++V ++ +T L + D + + A + Sbjct: 16 NKHDGERGAVSVIVAIMLVVLLGFGAVAVDVGMLYAERTQLRNGADAAALAVAQKCAKSA 75 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTS--LDIVVVPQNEGY 127 ++ D C +T S N ++G N I+ +T+ + + Q +G Sbjct: 76 PSSS-------DADCSNTSTLAASLANSNANDGRSNIKSIILDTTNRKVTVTAGAQEKGK 128 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 S P + F + S+V S Sbjct: 129 S---------PNEVSLFFARVLGMNSAEVNAPSTVVWGSPEKG 162 >gi|310830511|ref|YP_003965612.1| hypothetical protein PPSC2_p0151 [Paenibacillus polymyxa SC2] gi|309249978|gb|ADO59544.1| Putative uncharacterized protein [Paenibacillus polymyxa SC2] Length = 187 Score = 37.1 bits (84), Expect = 4.4, Method: Composition-based stats. Identities = 18/109 (16%), Positives = 42/109 (38%), Gaps = 1/109 (0%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 ++ N KG +IL I + I +V+ +++ + + S++D S V A ++ Sbjct: 4 LKQLLKNNKGSFSILFVILILIGVMVVTAFVDILIESWSMQEVQSVMDTSGVSALRASVD 63 Query: 68 EGNG-NNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTS 115 E ++ + K +M ++ + + +I T Sbjct: 64 ETKLRIGEFDVEKSVAVSEYKKLVSMRLKDTKKIKSVHFNPIEIESFTE 112 >gi|238922693|ref|YP_002936206.1| Carbohydrate-Binding Module Family 2 candidate secreted modular protein with two N-terminal CBM2 domains [Eubacterium rectale ATCC 33656] gi|238874365|gb|ACR74072.1| Carbohydrate-Binding Module Family 2 candidate secreted modular protein with two N-terminal CBM2 domains [Eubacterium rectale ATCC 33656] Length = 1186 Score = 37.1 bits (84), Expect = 4.4, Method: Composition-based stats. Identities = 23/157 (14%), Positives = 55/157 (35%), Gaps = 21/157 (13%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + D++ V+D + SM + + I+ + S+ + ++ GLV Sbjct: 564 SIIEAGKADVVFVIDTTGSMGNEIQNVKNNIETVVSSL----------EENKVDIRLGLV 613 Query: 225 T----FSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 +++ I +W S + L+ GVS + + ++ + M Sbjct: 614 EYRDIYADGIGSTKSYDWYTS-VSSFKSELATLGVS-GGGDTPESVVDALYCARNMEY-- 669 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK 317 KK ++ +TD + +E ++ Sbjct: 670 ---RTGVKKYVILLTDANYKNGTSVDSGATLTDEIRR 703 >gi|229108230|ref|ZP_04237852.1| Von Willebrand factor type A domain protein [Bacillus cereus Rock1-15] gi|228675246|gb|EEL30468.1| Von Willebrand factor type A domain protein [Bacillus cereus Rock1-15] Length = 627 Score = 37.1 bits (84), Expect = 4.4, Method: Composition-based stats. Identities = 30/179 (16%), Positives = 57/179 (31%), Gaps = 25/179 (13%) Query: 166 SQTDARLDMMI--VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 Q LD+ ++D S SM K++ KS+ E +K + + Sbjct: 424 GQESQELDVAFQLLVDCSGSMY-------NKMEETKKSVVLFHEALKSLKIPH------A 470 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG----MRQH 279 ++ + E + + + Y + N P + + + G + Sbjct: 471 ISGFWEDASSAKPEDKPNVIHEVVNYKNSTLP--NVGPEIMQLREEEDNRDGYIIRIVSE 528 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQ----QSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + K ++ TDGE + Q + A+K G V I I + E Sbjct: 529 KLAKRPEKHKFLLVFTDGEPSALDYQQDGILDTHEAVKLARKSGMEVIGIFIEEGEAKE 587 >gi|170578097|ref|XP_001894265.1| hypothetical protein [Brugia malayi] gi|158599218|gb|EDP36896.1| conserved hypothetical protein [Brugia malayi] Length = 309 Score = 37.1 bits (84), Expect = 4.4, Method: Composition-based stats. Identities = 17/109 (15%), Positives = 43/109 (39%), Gaps = 9/109 (8%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + ++D++I LD S S+ + F+ K ++ ++ + + VQ ++ Sbjct: 196 AACKANKVDIVIALDSSGSVFNVFEDE-------RKLVHDLISSLIPAALEDGRVQVSVI 248 Query: 225 TFSN--KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIF 271 F++ + F + + + K+ + G ST + A + Sbjct: 249 RFASSADVVIPFKISQTPNEIMEKVNKIKFTGGSTRIAKVVNLAVTDLL 297 >gi|115757185|ref|XP_784676.2| PREDICTED: similar to parturition-related protein PRP3 [Strongylocentrotus purpuratus] Length = 768 Score = 37.1 bits (84), Expect = 4.4, Method: Composition-based stats. Identities = 37/180 (20%), Positives = 66/180 (36%), Gaps = 29/180 (16%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF--SNKI 230 +++VLD+S SM + D I+S + V + G+V F +++I Sbjct: 257 SVVLVLDISNSMLGNR-----RFDRMIQSSTVYIMNVIPADS-----KLGIVVFDSTSQI 306 Query: 231 EEFFLLEWGVSHLQRKIKYLSKF-GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + QR +K L K +T G+ Y + +I ++ G Sbjct: 307 RANLTDITNTASRQRLVKALPKSPKGTTCIGCGILY-WVRIREVLGSYAQGG-------- 357 Query: 290 IIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFY 346 I+ ++DG EN + + ++ K G I I I + L S S + Sbjct: 358 YILLLSDGVENNAPYI----IDIYDDINKSGVIFDTITISNAADQQMEDLSKNTSGKSSF 413 >gi|323451137|gb|EGB07015.1| hypothetical protein AURANDRAFT_65249 [Aureococcus anophagefferens] Length = 264 Score = 37.1 bits (84), Expect = 4.4, Method: Composition-based stats. Identities = 26/153 (16%), Positives = 56/153 (36%), Gaps = 16/153 (10%) Query: 155 VMPITSSVKVNSQTDARLDMMIVLDVSRSME--SFFDSSITKIDMAIKSINAMLEEVKLI 212 + + ++ + + + D++++ D S SM S F + T+ D ++ ++ + + ++ Sbjct: 34 ELDLPEAMVSDLLSVSGADIVVIADDSGSMNAISDFKAHTTRWDELKNTLTSLAKMLLVV 93 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 + V F N W H ++ L +TP + + D Sbjct: 94 DHADGFH----VQFLNDA------TWHEIHRPEEVVRLFTGRSPHGTTPLMGR-MQPVLD 142 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKED 305 H A +++ MTDG T D Sbjct: 143 GS---WHPKGHGAETDVLVLVMTDGVPSDTSFD 172 >gi|219849983|ref|YP_002464416.1| magnesium chelatase ATPase subunit D [Chloroflexus aggregans DSM 9485] gi|219544242|gb|ACL25980.1| magnesium chelatase ATPase subunit D [Chloroflexus aggregans DSM 9485] Length = 636 Score = 37.1 bits (84), Expect = 4.4, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 51/166 (30%), Gaps = 21/166 (12%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 L+ P I + ++ A ++D S SM ++ Sbjct: 409 RLRASDHPPHQHRRGRIHLRAEDLHIKKYRSKAGTLFCFLVDASGSMALH------RMRQ 462 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEFFLLEWGVSHLQRKIKYLSKFGVS 256 A ++N++L++ + D Q L+ F + + V +R + L G Sbjct: 463 AKGAVNSLLQQAYVHRD-----QVALLAFRGERADLLLPPSQSVELAKRALDVLPT-GGG 516 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 T L AY ++ IV +TDG Sbjct: 517 TPLAAALLAAYQI--------SEQARARGIFRTTIVLITDGRPNVP 554 >gi|78066138|ref|YP_368907.1| membrane protein [Burkholderia sp. 383] gi|77966883|gb|ABB08263.1| membrane protein [Burkholderia sp. 383] Length = 603 Score = 37.1 bits (84), Expect = 4.4, Method: Composition-based stats. Identities = 10/56 (17%), Positives = 27/56 (48%), Gaps = 1/56 (1%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 F +G +++ AI +I + +I V ++F+ + + + D + + A ++ Sbjct: 11 RFSSRQRGAFSVM-AIVALLIAVTTLGVIGVGNLFYQRRDVQRIADMAALAAVQRM 65 >gi|332185631|ref|ZP_08387379.1| hypothetical protein SUS17_560 [Sphingomonas sp. S17] gi|332014609|gb|EGI56666.1| hypothetical protein SUS17_560 [Sphingomonas sp. S17] Length = 420 Score = 37.1 bits (84), Expect = 4.4, Method: Composition-based stats. Identities = 21/164 (12%), Positives = 54/164 (32%), Gaps = 21/164 (12%) Query: 21 ILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGG 80 ++ A+ LP++ +GM ++ + S ++ AA ++ N + Sbjct: 1 MMFALALPVLTCSIGMGVD----YARAAKAQSKLNAIADAAALLAVS----KNAMRADDA 52 Query: 81 DILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLK 140 ++ +++ ++ +G + T + +A+ Y+ + Sbjct: 53 TAAYFARSFFSLQSAALVKSDGITLSNVTVQAPTDGNGRR---------TAVVNYRATSE 103 Query: 141 FCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 + + S +D ++LDVS SM Sbjct: 104 --NVFARILGMSTLTISGKSETANAI--APDIDFYMLLDVSASM 143 >gi|317123703|ref|YP_004097815.1| von Willebrand factor A [Intrasporangium calvum DSM 43043] gi|315587791|gb|ADU47088.1| von Willebrand factor type A [Intrasporangium calvum DSM 43043] Length = 428 Score = 37.1 bits (84), Expect = 4.4, Method: Composition-based stats. Identities = 40/241 (16%), Positives = 86/241 (35%), Gaps = 34/241 (14%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLD-----MMIVLDVSRSMESFFDSSITKIDM 197 F+P H ++ ITSS + + +D +I++D S SM + Sbjct: 12 EFLPDGGTDVHAIVRITSSGASAATSMGVVDPGSAAEVILIDTSGSMG--RRGVMEAGHA 69 Query: 198 AIKSINAMLE-EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS 256 A ++N +++ + + N Q + ++++ + + +R ++ L G + Sbjct: 70 AKAALNEIVDGTLFAVVSGNGTAQI-VYPYAHQGALAVMSSHTRAEARRAVEGLRAQGGT 128 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLS--TKEDQQSLYYCNE 314 + + H K+ + +TDG N Q ++ C Sbjct: 129 -----AMGTWLTLARQIFETVPHVQ------KRHAILLTDGVNEGETPASLQAAVQSC-- 175 Query: 315 AKKRGAIVY---AIGIRVIRSHEFLRACASP--NSFYLVENPHSMYDAFSHIGKDIVTKR 369 V+ A G+ E +RA AS + LV P + F + + +++ Sbjct: 176 -----VGVFQCDARGVGDQWRVEEVRAIASALLGTVGLVPEPEELAAEFQAMMRASMSRG 230 Query: 370 I 370 + Sbjct: 231 V 231 >gi|323345323|ref|ZP_08085546.1| von Willebrand factor [Prevotella oralis ATCC 33269] gi|323093437|gb|EFZ36015.1| von Willebrand factor [Prevotella oralis ATCC 33269] Length = 290 Score = 37.1 bits (84), Expect = 4.4, Method: Composition-based stats. Identities = 16/102 (15%), Positives = 42/102 (41%), Gaps = 9/102 (8%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +M+++DVS S++ + + ++ + G++ F Sbjct: 72 EEERELTVMLLIDVSGSLDFGTRRQFKRDMATEIAATIAFSAIQNND------KIGVIFF 125 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKY 265 S++IE++ + G H+ I+ + T+ ++Y Sbjct: 126 SDRIEKYIPPKKGRKHILYIIREMLDFHPQSKRTDIGAAIEY 167 >gi|158512138|gb|ABW69099.1| structural toxin protein [Legionella pneumophila] Length = 4669 Score = 37.1 bits (84), Expect = 4.4, Method: Composition-based stats. Identities = 30/152 (19%), Positives = 52/152 (34%), Gaps = 15/152 (9%) Query: 155 VMPITSSVKVNS--QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 P+ S++ + A ++M++LD S SM + I I S +LE + + Sbjct: 3402 DGPVASNITRSGLANEGADTNLMLILDTSGSMAG------SGIQTLINSTLELLERYEAL 3455 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 +V V+ VTF+ + W + G +TN L A N Sbjct: 3456 GNVK--VRI--VTFNTSATAIGSV-WMTVDAAKNALLGLTAGGNTNFDAALITAMNAFNS 3510 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 A F++DG ++ Sbjct: 3511 GTVGGADGRIGGAQNVSY--FISDGNPTVNQD 3540 >gi|118388807|ref|XP_001027499.1| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila] gi|89309269|gb|EAS07257.1| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila SB210] Length = 1029 Score = 37.1 bits (84), Expect = 4.4, Method: Composition-based stats. Identities = 47/334 (14%), Positives = 105/334 (31%), Gaps = 41/334 (12%) Query: 38 IEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRK--KLKGGDILCRIKNTWNMSFR 95 I++ HI + + + + + ++ N+ +L+ + Sbjct: 47 IQILHISLEENKFITPNKQQSLQTDKKSLDNSVTKNQTPEQLQSDKKQNNLITVPKELED 106 Query: 96 NELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIV 155 E++ N V D+V ++ L + + + + ++ + F + + Sbjct: 107 EEVKQNLQVQPPKDVVINSQLPVKIQDLIKSHY----NQCQESANFISNFYQLSEDMKQF 162 Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 +PI ++ + LD+M ++D + SM S+ I I ++ +K DV Sbjct: 163 LPIYQNILPRME----LDLMFIMDCTGSMSSW-------IQAVKDEILCIIATIK---DV 208 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 N + V+ I F + ++ + G K+A Q + Q Sbjct: 209 NKGYRDYGVSQRYSIFNFSDDMDEFEKFLINVNAIANSDTPEDVAGGFKHANLQQWKSQA 268 Query: 276 MRQ-HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 A+ K+ +D + K+G +YAI I Sbjct: 269 KYAVFIADCPAHGKEYNNDYSDRYPDGDPDGIDLKQEFKNLIKKGVKLYAIQIT------ 322 Query: 335 FLRACASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 M++ F K++ + Sbjct: 323 --------------NQTKQMFELFKQYNKEVCGQ 342 >gi|118594950|ref|ZP_01552297.1| von Willebrand factor, type A [Methylophilales bacterium HTCC2181] gi|118440728|gb|EAV47355.1| von Willebrand factor, type A [Methylophilales bacterium HTCC2181] Length = 318 Score = 37.1 bits (84), Expect = 4.5, Method: Composition-based stats. Identities = 32/227 (14%), Positives = 73/227 (32%), Gaps = 37/227 (16%) Query: 160 SSVKVNSQTDARLDMMIVLDVSRSM-ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 S ++ ++ ++D S SM + F K ++ + +L++ + Sbjct: 68 SKSTSITEIGKGAQLVFLIDRSVSMAKPFIGDDDNKSEIKSLAARRILKDFIDQRPSD-- 125 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMR 277 G+V FSN + S+ I + ++ TN G+ +++ Sbjct: 126 -MIGIVGFSNSALYASKITKNRSYTYAAIDAATGSSINQTNIGSGITSGLFMFSEIETTG 184 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS----- 332 +V ++DG +K ++ ++ +Y I I+ Sbjct: 185 SQA----------LVLLSDGAGKISKRVKE--RIAEMLNEKKINLYWIIIKEPNDVSLFS 232 Query: 333 ---------------HEFLRACASPNSFYLVENPHSMYDAFSHIGKD 364 F ++ + Y ENP ++ A + I + Sbjct: 233 GNTYLEGREPTVIKLDNFFKSLKTEYKAYEAENPDALSSAIADIDQK 279 >gi|107022601|ref|YP_620928.1| membrane protein [Burkholderia cenocepacia AU 1054] gi|116689550|ref|YP_835173.1| membrane protein [Burkholderia cenocepacia HI2424] gi|105892790|gb|ABF75955.1| membrane protein [Burkholderia cenocepacia AU 1054] gi|116647639|gb|ABK08280.1| membrane protein [Burkholderia cenocepacia HI2424] Length = 603 Score = 37.1 bits (84), Expect = 4.5, Method: Composition-based stats. Identities = 12/61 (19%), Positives = 26/61 (42%), Gaps = 2/61 (3%) Query: 5 NLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQ 64 IR G +++ AI +I + I V ++FF + + + D + + A + Sbjct: 7 GGTIRRASRQR-GAFSVM-AIIATLIAITTLGAIGVGNLFFQRRDVQRIADMAALAAVQR 64 Query: 65 I 65 + Sbjct: 65 M 65 >gi|326680027|ref|XP_001336637.4| PREDICTED: von Willebrand factor A domain-containing protein 5A-like [Danio rerio] Length = 729 Score = 37.1 bits (84), Expect = 4.5, Method: Composition-based stats. Identities = 29/196 (14%), Positives = 58/196 (29%), Gaps = 35/196 (17%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSS--ITKIDMAIKSINAMLEEVKLIPDV 215 V + T + V+D S SM +I+ A ++ +L+ + + Sbjct: 265 FPEEVMSSMATQGEF--VFVMDRSGSMSGMMRGKEAQNRIESAKDTLLLLLKSLPMGCYF 322 Query: 216 NNVVQSGLVTFSNKIEEFFLLE--WGVSHLQRK---IKYLSKFGVSTNSTPGLKYAYNQI 270 N F ++ + FF + + + +K + T LK+ Y+Q Sbjct: 323 NIY------GFGSRFKSFFPKSVEYNQDKMDQALKEVKEMRADMGGTEILQPLKHIYSQ- 375 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKK--RGAIVYAIGIR 328 + + + TDG + + K ++ GI Sbjct: 376 -----------PCIPEHPRQLFIFTDG------AVGNTKEVLDLVKSHAHSHRCFSFGIG 418 Query: 329 VIRSHEFLRACASPNS 344 S + A S Sbjct: 419 EGASTALITGLAREGS 434 >gi|296106233|ref|YP_003617933.1| hypothetical protein lpa_01009 [Legionella pneumophila 2300/99 Alcoy] gi|295648134|gb|ADG23981.1| hypothetical protein lpa_01009 [Legionella pneumophila 2300/99 Alcoy] Length = 4669 Score = 37.1 bits (84), Expect = 4.5, Method: Composition-based stats. Identities = 30/152 (19%), Positives = 52/152 (34%), Gaps = 15/152 (9%) Query: 155 VMPITSSVKVNS--QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 P+ S++ + A ++M++LD S SM + I I S +LE + + Sbjct: 3402 DGPVASNITRSGLANEGADTNLMLILDTSGSMAG------SGIQTLINSTLELLERYEAL 3455 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 +V V+ VTF+ + W + G +TN L A N Sbjct: 3456 GNVK--VRI--VTFNTSATAIGSV-WMTVDAAKNALLGLTAGGNTNFDAALITAMNAFNS 3510 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 A F++DG ++ Sbjct: 3511 GTVGGADGRIGGAQNVSY--FISDGNPTVNQD 3540 >gi|145491133|ref|XP_001431566.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124398671|emb|CAK64168.1| unnamed protein product [Paramecium tetraurelia] Length = 636 Score = 37.1 bits (84), Expect = 4.5, Method: Composition-based stats. Identities = 40/256 (15%), Positives = 85/256 (33%), Gaps = 32/256 (12%) Query: 111 VRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDA 170 + NE S YK+ + + I + + Sbjct: 99 KEDAKQNTNKYDLNEKLYFEVRSLYKMGKLLNSRTQYLPGIVSIKALDQAVTQNQKNQRV 158 Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D++ ++D+S SM KI+M S+ +L+ + + L+TF N Sbjct: 159 GVDLICLIDISGSM------IGVKIEMVKASLIVLLQFLGDND------RLQLITFDNDA 206 Query: 231 EEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 L+ S+ + IK + G + + K A+ Q+ + + Sbjct: 207 HRLTPLKTVTNQNKSYFTQIIKQIKANGGN-RISEATKMAFYQL------KSRKYINNVT 259 Query: 287 YKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV-IRSHEFLRACA-SPNS 344 + ++DG + + E + + NE ++ G + + C S Sbjct: 260 S---VFLLSDGVDYTYPEVKNQIQTVNEV----FTLHTFGFGEDHDAQMMTQLCNLKSGS 312 Query: 345 FYLVENPHSMYDAFSH 360 FY V++ + + F+ Sbjct: 313 FYFVQDVTLLDEFFAD 328 >gi|300866150|ref|ZP_07110871.1| stress protein (modular protein) [Oscillatoria sp. PCC 6506] gi|300335862|emb|CBN56031.1| stress protein (modular protein) [Oscillatoria sp. PCC 6506] Length = 481 Score = 37.1 bits (84), Expect = 4.5, Method: Composition-based stats. Identities = 23/138 (16%), Positives = 48/138 (34%), Gaps = 7/138 (5%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + + LD+S SM S ++S KI + I A+ ++ G + Sbjct: 259 VALCLDISSSMTSLYNSG--KIQRLAEKILALGCRFD-DDGSIDIFLFGANAHNPGEMTV 315 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + ++ + T +K N F + + A+ ++F Sbjct: 316 DNFPNFIHNILHQ----YPLEGWTYYGKVMKEIRNFYFPDGHGQIRNSPIAADRPVYVMF 371 Query: 294 MTDGENLSTKEDQQSLYY 311 +TDG ++ +Q L + Sbjct: 372 VTDGATADEQDTKQHLKW 389 >gi|145500000|ref|XP_001435984.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124403121|emb|CAK68587.1| unnamed protein product [Paramecium tetraurelia] Length = 981 Score = 37.1 bits (84), Expect = 4.5, Method: Composition-based stats. Identities = 32/200 (16%), Positives = 59/200 (29%), Gaps = 34/200 (17%) Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVN---NVVQSGLVTFSNKIEE 232 +D S SM S +I A +S+ L + N N++ G + S + Sbjct: 352 FFIDRSGSM------SGGRIKKAKQSLILFLRSL----PDNCRFNIISFGTMFRSLWSDS 401 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA-YNQIFDMQGMRQHCNTEDANYKKII 291 + + + + + T + YN + I Sbjct: 402 KQYSQDTLDEAIKHVNAMEANMQGTEIFKPFQDVIYNNQYGKSKTTTLN----------I 451 Query: 292 VFMTDGENLSTKEDQQSLYYCNEAKKRGAI--VYAIGIRVIRSHEFLRACASPNS--FYL 347 +TDGE + A VY +GI S ++ A + F Sbjct: 452 FLLTDGEVDVFPIIDLVK------RNNKAETRVYTLGIGEGCSQYLIKNLADVGNGKFQF 505 Query: 348 VENPHSMYDAFSHIGKDIVT 367 V + + + +D +T Sbjct: 506 VADDEDINAKVIDLLEDSMT 525 >gi|115964593|ref|XP_001196489.1| PREDICTED: similar to calcium-activated chloride channel family member 1 [Strongylocentrotus purpuratus] Length = 843 Score = 37.1 bits (84), Expect = 4.5, Method: Composition-based stats. Identities = 37/180 (20%), Positives = 66/180 (36%), Gaps = 29/180 (16%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF--SNKI 230 +++VLD+S SM + D I+S + V + G+V F +++I Sbjct: 332 SVVLVLDISNSMLGNR-----RFDRMIQSSTVYIMNVIPADS-----KLGIVVFDSTSQI 381 Query: 231 EEFFLLEWGVSHLQRKIKYLSKF-GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + QR +K L K +T G+ Y + +I ++ G Sbjct: 382 RANLTDITNTASRQRLVKALPKSPKGTTCIGCGILY-WVRIREVLGSYAQGG-------- 432 Query: 290 IIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF--LRACASPNSFY 346 I+ ++DG EN + + ++ K G I I I + L S S + Sbjct: 433 YILLLSDGVENNAPYI----IDIYDDINKSGVIFDTITISNAADQQMEDLSKNTSGKSSF 488 >gi|85859137|ref|YP_461339.1| flp pilus assembly protein family protein [Syntrophus aciditrophicus SB] gi|85722228|gb|ABC77171.1| flp pilus assembly protein family [Syntrophus aciditrophicus SB] Length = 146 Score = 37.1 bits (84), Expect = 4.5, Method: Composition-based stats. Identities = 30/143 (20%), Positives = 53/143 (37%), Gaps = 6/143 (4%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI--- 65 +KG + AI LPI+ +++ IIE + + K VL + I Sbjct: 7 MRKLREHKGAAVVEFAIVLPILLVLVFGIIEFGILIYNKQVLTNASREGARAGIVYIDGT 66 Query: 66 MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 GN I+ N + ++F + + N D++ S D ++V + Sbjct: 67 SRVPAGNASNSDTIKGIVNDYANDYLITFGSSIPLNT---DVEFPEGQDSGDPLIVTVSY 123 Query: 126 GYSISAISRYKIPLKFCTFIPWY 148 GYS P+K T + + Sbjct: 124 GYSFILFPVTDYPIKARTVMKYE 146 >gi|332288899|ref|YP_004419751.1| hypothetical protein UMN179_00824 [Gallibacterium anatis UMN179] gi|330431795|gb|AEC16854.1| conserved hypothetical protein [Gallibacterium anatis UMN179] Length = 212 Score = 37.1 bits (84), Expect = 4.6, Method: Composition-based stats. Identities = 22/136 (16%), Positives = 49/136 (36%), Gaps = 20/136 (14%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +++D S SM I+ + ++ ++ P ++TF Sbjct: 3 RLPVYLLVDTSGSM------MGEAIEAVRNGLQMLVSALRQDPYALETAYLSVITFDTDA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK-- 288 ++ L + ++ + GV+ + A + + D + + A K Sbjct: 57 KQVTPL---TELMSFQMPDIQASGVT-----AMGEALSLLVDC--INREVQKGSAEVKGD 106 Query: 289 -KIIVF-MTDGENLST 302 K +VF ++DG Sbjct: 107 WKPVVFLLSDGLPTDD 122 >gi|325673441|ref|ZP_08153132.1| hypothetical protein HMPREF0724_10914 [Rhodococcus equi ATCC 33707] gi|325555462|gb|EGD25133.1| hypothetical protein HMPREF0724_10914 [Rhodococcus equi ATCC 33707] Length = 321 Score = 37.1 bits (84), Expect = 4.6, Method: Composition-based stats. Identities = 5/45 (11%), Positives = 19/45 (42%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMI 54 + +G + + AI + ++ + +++ + +K L + Sbjct: 7 RMSNDERGVVAVFVAILMVVLLGCAAISVDIGANYVVKRQLQNGA 51 >gi|156370957|ref|XP_001628533.1| predicted protein [Nematostella vectensis] gi|156215512|gb|EDO36470.1| predicted protein [Nematostella vectensis] Length = 154 Score = 37.1 bits (84), Expect = 4.6, Method: Composition-based stats. Identities = 20/136 (14%), Positives = 43/136 (31%), Gaps = 22/136 (16%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 +D+++ LD S S+ + + I + + + + + GL+ F + Sbjct: 33 QKPVDLVLALDSSESVGAENWMKM------INATIKFVNQFDV-----RYTRFGLIQFHS 81 Query: 229 KIEEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 E +L L I + ST + + M+ + Sbjct: 82 SPEVPIVLSSFPDNKTLSETIAKVYYKAGSTRT---------DLAAMKVVEVFKAASHRK 132 Query: 287 YKKIIVFMTDGENLST 302 K+ VF+ G + Sbjct: 133 ASKVAVFILGGLSTDV 148 >gi|229828671|ref|ZP_04454740.1| hypothetical protein GCWU000342_00737 [Shuttleworthia satelles DSM 14600] gi|229793265|gb|EEP29379.1| hypothetical protein GCWU000342_00737 [Shuttleworthia satelles DSM 14600] Length = 625 Score = 37.1 bits (84), Expect = 4.6, Method: Composition-based stats. Identities = 27/161 (16%), Positives = 52/161 (32%), Gaps = 24/161 (14%) Query: 139 LKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMA 198 L+ ++ + + Q D++ VLD S + + + Sbjct: 19 LQKTAEELDESDESKVTLSFPGK-----QDALASDVVFVLDKSGA---------SAQEDI 64 Query: 199 IKSINAMLEEVKLIPDVNNV-VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYL-SKFGVS 256 LE +K + ++ G+V F+ L V+ I + S+ + Sbjct: 65 YAQARKFLENIKSTAQEKGLNIKVGVVLFNRIGNVKQPLTDVVTGYTAIIDAMNSRVSMG 124 Query: 257 TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 TN GL A + D + K +V ++DG Sbjct: 125 TNMHAGLLAAQKMLDDDTEVLSSN--------KHVVLISDG 157 >gi|225873376|ref|YP_002754835.1| hypothetical protein ACP_1760 [Acidobacterium capsulatum ATCC 51196] gi|225792625|gb|ACO32715.1| hypothetical protein ACP_1760 [Acidobacterium capsulatum ATCC 51196] Length = 363 Score = 37.1 bits (84), Expect = 4.6, Method: Composition-based stats. Identities = 43/224 (19%), Positives = 73/224 (32%), Gaps = 29/224 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + + L + I+LD S S + + A K + +L + V L Sbjct: 105 TANANQPLTLGILLDTSGSQQDVLPL---EKQSASKFLRDVLRPKDQAFVLTFDVDVNLA 161 Query: 225 T-FSNKIE------EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMR 277 F+N I + + I L + V T P Y+ + + Sbjct: 162 QDFTNDIPLLDHALQQAQINTAGGGGSGGIPGLGQGPVPTVGDPKGTVLYDAVAQAANDK 221 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR------ 331 T K ++ +TDGE+L + L +A+K IVY I I Sbjct: 222 LREQTGR----KALILLTDGEDLG--SATKPLQAIADAQKANTIVYVILIADRGFYGGYT 275 Query: 332 ----SHEFLRACA--SPNSFYLV-ENPHSMYDAFSHIGKDIVTK 368 +R A + V N + AF I +++ T+ Sbjct: 276 FGYTGDAQMRRLAEATGGRMINVGNNGAKLTAAFKEIARELRTQ 319 >gi|154250681|ref|YP_001411505.1| TadE family protein [Parvibaculum lavamentivorans DS-1] gi|154154631|gb|ABS61848.1| TadE family protein [Parvibaculum lavamentivorans DS-1] Length = 187 Score = 37.1 bits (84), Expect = 4.7, Method: Composition-based stats. Identities = 15/145 (10%), Positives = 50/145 (34%), Gaps = 6/145 (4%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMI-DRSLVHAATQI 65 ++ F N G + A+ P++ IE +++ + S+ + + A Sbjct: 15 GLKRFLRNCAGIAAVEFALIFPVMIAFYFGSIETTNMLTANRRVTSVAYTAADITAQATS 74 Query: 66 MNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFV-----NDIDDIVRSTSLDIVV 120 ++ + + + + + + + + +D +I ++ V Sbjct: 75 ISNSDLADIFAASSAILAPFSTTPLKVRITSVVAYSSNIAKVAWSDGLNIAPRSTGSTVS 134 Query: 121 VPQNEGYSISAISRYKIPLKFCTFI 145 +P + S++ ++ + + I Sbjct: 135 LPSGLTTAGSSVIMAEVTYSYVSPI 159 >gi|227822736|ref|YP_002826708.1| hypothetical protein NGR_c21950 [Sinorhizobium fredii NGR234] gi|227341737|gb|ACP25955.1| hypothetical protein NGR_c21950 [Sinorhizobium fredii NGR234] Length = 264 Score = 37.1 bits (84), Expect = 4.7, Method: Composition-based stats. Identities = 29/183 (15%), Positives = 59/183 (32%), Gaps = 22/183 (12%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVL--DVSRSMESFFDSSITKIDMAIKSINAMLEEVKL 211 ++M ++ S + +D+ +VL D+S SM+ + + VK Sbjct: 7 LIMALSGSPMQTAAAPPNVDVELVLAVDMSGSMDMEEARVQRSGYVQALRHPEFINAVKG 66 Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEW----GVSHLQRKIKYLSKFGVSTNSTPGLKYAY 267 + G ++ + E ++ W G + + + T + A Sbjct: 67 GYL--GRIAIGYFEWAGMVNEASVVTWQVIEGAEDAETFAARIEARPIGTRRGTSISNAI 124 Query: 268 ---NQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYA 324 + D +++I DG N + + NEA RG I+ Sbjct: 125 IFGTSLIDTNAFSG--------ARRVIDISGDGPNNTGPPVAPAR---NEAVSRGIIING 173 Query: 325 IGI 327 + I Sbjct: 174 LAI 176 >gi|309364360|emb|CAP25015.2| CBR-CLEC-66 protein [Caenorhabditis briggsae AF16] Length = 375 Score = 37.1 bits (84), Expect = 4.7, Method: Composition-based stats. Identities = 36/165 (21%), Positives = 69/165 (41%), Gaps = 32/165 (19%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS---- 227 LD++ V+D S++M + + I ++ + L ++ D + G +T++ Sbjct: 34 LDVVFVVDNSKNMNLYNVYNT------ISALFSPLIQIGTGYDDPRSTRVGFITYNYNAT 87 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNST----PGLKYAYNQIFDMQGMRQHCNTE 283 + + + L + L ++I+ LS +S + GL A I QG+R Sbjct: 88 DVADLYMLQSYN--DLNQQIQLLSTTPLSRDDHSYIDSGLGAAIRMINATQGLR------ 139 Query: 284 DANYKKIIVFMT---DGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 NY+K++VF T D ++ + L K GA V + Sbjct: 140 -DNYQKVVVFFTSQYDYYYNYPEDQAKVL------KSLGATVITV 177 >gi|313672125|ref|YP_004050236.1| von willebrand factor type a [Calditerrivibrio nitroreducens DSM 19672] gi|312938881|gb|ADR18073.1| von Willebrand factor type A [Calditerrivibrio nitroreducens DSM 19672] Length = 1174 Score = 37.1 bits (84), Expect = 4.8, Method: Composition-based stats. Identities = 23/148 (15%), Positives = 48/148 (32%), Gaps = 12/148 (8%) Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 A+L ++ + + + G V FS+ + L + I G STN+ Sbjct: 215 GAVLGVLQKLSNEPRNPRVGAVLFSSNNISVIKPSYDYISLIKAINDTKA-GGSTNTKGA 273 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL-------STKEDQQSLYYCNEA 315 + N + Q + K + ++DGE + ++ ++ N Sbjct: 274 IDTISNY-YKSTEAYQFDSNVVPCAKNFAIVVSDGEWNVGGDPLPTIRDMWKTDLMGNLT 332 Query: 316 KKRGAIVYAIGI---RVIRSHEFLRACA 340 K+ Y + + L+ A Sbjct: 333 GKQNVKTYTLAMFMDSSSNGTRALKHMA 360 >gi|304404942|ref|ZP_07386602.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] gi|304345821|gb|EFM11655.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] Length = 433 Score = 37.1 bits (84), Expect = 4.8, Method: Composition-based stats. Identities = 23/188 (12%), Positives = 55/188 (29%), Gaps = 23/188 (12%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF----S 227 +D+M+V+D + SM ++ + ++ V + ++ + Sbjct: 225 VDLMLVVDTTGSMSD-------ELRYLEAELKDVVTRVSEQNNGQLDIRVSSNFYRDEHD 277 Query: 228 NKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANY 287 + + F V +I FG A + E Sbjct: 278 DYVVRPFPFTRDVDKAVDQIADQEAFGGGD-----FPEAVELALSNAIEEHEWSKEALAR 332 Query: 288 KKIIVFMTDGENLSTKEDQQSLY-YCNEAKKRGAIVYAIGIRV--IRSHEFLRACA--SP 342 +V D + + L +A ++G + I I + +R A + Sbjct: 333 LMFVVL--DAPPHHEPQIMKKLQKLTAKAAEQGIRIIPIASSGVDIATEHLMRYIAVSTG 390 Query: 343 NSFYLVEN 350 ++ + + Sbjct: 391 GTYVFLTD 398 >gi|149064977|gb|EDM15053.1| rCG28280 [Rattus norvegicus] Length = 106 Score = 37.1 bits (84), Expect = 4.8, Method: Composition-based stats. Identities = 14/78 (17%), Positives = 31/78 (39%), Gaps = 11/78 (14%) Query: 250 LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSL 309 + G T + L+ A N +F KK+ + +TDG+ +++ ++ Sbjct: 1 MQYLGEGTYTATALQAA-NDMF---------KEARPGVKKVALVITDGQ-TDSRDKRKLA 49 Query: 310 YYCNEAKKRGAIVYAIGI 327 +A ++ IG+ Sbjct: 50 DVVKDANDSKVEIFVIGV 67 >gi|309364363|emb|CAP25017.2| CBR-CLEC-62 protein [Caenorhabditis briggsae AF16] Length = 393 Score = 37.1 bits (84), Expect = 4.8, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 51/155 (32%), Gaps = 17/155 (10%) Query: 212 IPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH---LQRKIKYLSKFGVSTNSTPG--LKYA 266 V+ GLVT++N+ L S + L K S L A Sbjct: 75 AYSDKRTVRVGLVTYNNQATVQADLNRFQSADDLFNSVFQILPKLSASDEVYLAKGLDAA 134 Query: 267 YNQIFDMQGMRQHCNTEDANYKK-IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 + + N +NYK+ ++++ +D + ++ K G + + Sbjct: 135 ESVLSAG-----RKNATRSNYKQLVLIYASDYRDDG---EEDPRPTAERMKSSGVSIATV 186 Query: 326 GIRVIRSHEFLRA---CASPNSFYLVENPHSMYDA 357 + ++A ASP + E+ + + Sbjct: 187 AFDQTGNEGVVKAIGEIASPGFNFTNEDADLVREI 221 >gi|296270453|ref|YP_003653085.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] gi|296093240|gb|ADG89192.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] Length = 690 Score = 37.1 bits (84), Expect = 4.9, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 44/138 (31%), Gaps = 17/138 (12%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D S SM + ++ ++ ++L + + GLVTF E Sbjct: 515 VLFVVDASGSM-----GARRRMTAVKTAVLSLL-----LDAYQRRDKVGLVTFRGTAAEV 564 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 L G + GL A ++ + +R ++V Sbjct: 565 PLPPTSSVEAGAARLRALATGGRSPLGAGLAKA-AEVLRAERLRDPARRP------LLVL 617 Query: 294 MTDGENLSTKEDQQSLYY 311 +TDG ++ Sbjct: 618 VTDGRATDGDAADRAARL 635 >gi|114705831|ref|ZP_01438734.1| hypothetical protein FP2506_15234 [Fulvimarina pelagi HTCC2506] gi|114538677|gb|EAU41798.1| hypothetical protein FP2506_15234 [Fulvimarina pelagi HTCC2506] Length = 548 Score = 37.1 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 8/40 (20%), Positives = 18/40 (45%) Query: 16 KGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMID 55 +G + + A+ LP+I G ++ + + L + D Sbjct: 14 RGNIAVSFALLLPLILGFGGFATDLGLAYLERRSLQAATD 53 >gi|15900378|ref|NP_344982.1| cell wall surface anchor family protein [Streptococcus pneumoniae TIGR4] gi|284055428|pdb|2WW8|A Chain A, Structure Of The Pilus Adhesin (Rrga) From Streptococcus Pneumoniae gi|14971932|gb|AAK74622.1| cell wall surface anchor family protein [Streptococcus pneumoniae TIGR4] Length = 893 Score = 37.1 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 20/134 (14%), Positives = 58/134 (43%), Gaps = 10/134 (7%) Query: 149 TNSRHIVMPITSSVKVNSQTDA-RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 N I + ++ + + LD++I+LD S SM + + + + + A ++ ++++ Sbjct: 201 DNQYGIELTVSGKTVYEQKDKSVPLDVVILLDNSNSMSNIRNKNARRAERAGEATRSLID 260 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS-----KFGVSTNSTPG 262 ++ + + LVT+++ I + + K L+ + ++ +T Sbjct: 261 KITSDSEN----RVALVTYASTIFDGTEFTVEKGVADKNGKRLNDSLFWNYDQTSFTTNT 316 Query: 263 LKYAYNQIFDMQGM 276 Y+Y ++ + + Sbjct: 317 KDYSYLKLTNDKND 330 >gi|260845839|ref|YP_003223617.1| putative tellurium resistance protein TerF [Escherichia coli O103:H2 str. 12009] gi|257760986|dbj|BAI32483.1| putative tellurium resistance protein TerF [Escherichia coli O103:H2 str. 12009] Length = 413 Score = 37.1 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 61/198 (30%), Gaps = 32/198 (16%) Query: 126 GYSISAISRYKIPLKFCTFIPWY----TNSRHIVMPITSSVKVNSQTDARLD--MMIVLD 179 Y + P K + R + + +SV + L+ + VLD Sbjct: 160 NYGVDVAQPAPQPAKPARISLEKKLETRSPRLVSLAKKASVSLTKNKLDTLEAAVAFVLD 219 Query: 180 VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG 239 S SM F ++S+ + + D + + +K L+ Sbjct: 220 ASGSMSGQFSKGN------VQSVLDRIAVLAAQFDDDGEMDVWGFGEKHKKYPNVTLDNL 273 Query: 240 VSHLQRKIKY--------LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 +++Q L G + N P ++ + D + + Sbjct: 274 DTYIQSIRGAGKRSSWENLPGLGGTNNEPPVMEEIVDYFKDSK------------IPVYV 321 Query: 292 VFMTDGENLSTKEDQQSL 309 VF+TDG T+ + ++ Sbjct: 322 VFITDGGISKTRAIKDAI 339 >gi|237721534|ref|ZP_04552015.1| conserved hypothetical protein [Bacteroides sp. 2_2_4] gi|293369166|ref|ZP_06615760.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|229449330|gb|EEO55121.1| conserved hypothetical protein [Bacteroides sp. 2_2_4] gi|292635749|gb|EFF54247.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] Length = 616 Score = 37.1 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 26/173 (15%), Positives = 62/173 (35%), Gaps = 20/173 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 ++ + +++ ++DVS SM ++D+ S+ ++ ++ V V +G Sbjct: 240 EIPTDNLPASNLVFLIDVSGSMW-----GANRLDLVKSSLKLLVNNLRDKDKVAIVTYAG 294 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 ++ L ++ I L+ G + L AY +I + N Sbjct: 295 SAG----VKLEATLGSDKQKIREAIDELTAGGSTAGGAGILL-AY-KIAKKNFISNGNNR 348 Query: 283 EDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 I+ +DG+ N+ + + +K G + +G + + Sbjct: 349 --------IILCSDGDFNVGVSSAEGLEQLIEKERKSGVFLTVLGYGMGNYKD 393 >gi|148360702|ref|YP_001251909.1| hypothetical protein LPC_2649 [Legionella pneumophila str. Corby] gi|148282475|gb|ABQ56563.1| hypothetical protein LPC_2649 [Legionella pneumophila str. Corby] Length = 6289 Score = 37.1 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 30/152 (19%), Positives = 52/152 (34%), Gaps = 15/152 (9%) Query: 155 VMPITSSVKVNS--QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI 212 P+ S++ + A ++M++LD S SM + I I S +LE + + Sbjct: 5022 DGPVASNITRSGLANEGADTNLMLILDTSGSMAG------SGIQTLINSTLELLERYEAL 5075 Query: 213 PDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFD 272 +V V+ VTF+ + W + G +TN L A N Sbjct: 5076 GNVK--VRI--VTFNTSATAIGSV-WMTVDAAKNALLGLTAGGNTNFDAALITAMNAFNS 5130 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 A F++DG ++ Sbjct: 5131 GTVGGADGRIGGAQNVSY--FISDGNPTVNQD 5160 >gi|89094019|ref|ZP_01166963.1| putative calcium-binding outer membrane-like protein [Oceanospirillum sp. MED92] gi|89081693|gb|EAR60921.1| putative calcium-binding outer membrane-like protein [Oceanospirillum sp. MED92] Length = 2550 Score = 37.1 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 43/305 (14%), Positives = 91/305 (29%), Gaps = 49/305 (16%) Query: 59 VHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDI 118 + I ++G + +L N+ +F ++ + +V I + + Sbjct: 1534 LSGPAGISSDGEAVEWHYDQATGVLTGFTNSAGQAFS-DVPASDYVLTI-----ALDPVV 1587 Query: 119 VVVPQNE-GYSISAISRYKIPLKFCTFIPWYTNSRHIVMP----ITSSVKVNSQTDARLD 173 + GY++ ++ P+ + + + T+++ V + D + Sbjct: 1588 SNGGNHSIGYTLDLLAPVDHPVNSIEDVLTAQFTVSVTDESANTTTANLTVGIEDDKPIS 1647 Query: 174 ---------------MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 +M+VLD S SM ++ S+ MLE + Sbjct: 1648 GDISDTIIVPETKANVMLVLDFSGSMRG------AELTQMKASVIEMLE----AYENTGQ 1697 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 V +V+FS+ W +S T+S G Y+ + Sbjct: 1698 VAVQIVSFSSGATTRNDGGW-----ISASDAISYISGLTDSNMGGLTDYDAALAQAQLAF 1752 Query: 279 HCNTEDANYKKII-VFMTDGENLSTKE-------DQQSLYYCNEAKKRGAIVYAIGIRVI 330 +T I F++DG+ + + N YA+G Sbjct: 1753 AESTGKIAQGNNISYFLSDGQPTDDNGTGSDGIVGAEITAWENFVTSNDIDSYAVGFAGA 1812 Query: 331 RSHEF 335 + Sbjct: 1813 NINAL 1817 >gi|330811037|ref|YP_004355499.1| hypothetical protein PSEBR_a4091 [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327379145|gb|AEA70495.1| Hypothetical protein; putative exported protein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 426 Score = 37.1 bits (84), Expect = 5.1, Method: Composition-based stats. Identities = 9/54 (16%), Positives = 18/54 (33%) Query: 39 EVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNM 92 + H+ KT L + +D + + A + G N L + + Sbjct: 21 DGGHMLLNKTRLQNAVDAAALGGAKTLSQVTGGMNMASTTRAAALDTLSRNASA 74 >gi|154503613|ref|ZP_02040673.1| hypothetical protein RUMGNA_01437 [Ruminococcus gnavus ATCC 29149] gi|260589891|ref|ZP_05855804.1| putative von Willebrand factor type A domain protein [Blautia hansenii DSM 20583] gi|153795713|gb|EDN78133.1| hypothetical protein RUMGNA_01437 [Ruminococcus gnavus ATCC 29149] gi|260539698|gb|EEX20267.1| putative von Willebrand factor type A domain protein [Blautia hansenii DSM 20583] Length = 709 Score = 37.1 bits (84), Expect = 5.1, Method: Composition-based stats. Identities = 34/293 (11%), Positives = 84/293 (28%), Gaps = 51/293 (17%) Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNG----FVNDIDDI-VRSTSLDIVVVP 122 E N+ ++ + +N + +L+ + + N I V ++ V Sbjct: 400 ENAANDLARILNDVATEKAQNDYEQELTEDLQKSADEIHYGNAHAGIHVTIHRINPVSDF 459 Query: 123 QNEGYSISAIS----RYKIPLKFCTFIP---------WYTNSRHIVMPITSSVKVN---- 165 + Y A ++ + + + M Sbjct: 460 FIKSYQTVASPLLRTSKRLQSSILPLLKEEAEGGKQKNLLFGKRLDMRALHRKDGGIFTR 519 Query: 166 ---SQTDARLDMMIVLDVSRSMES-FFDSSITKIDMAIKSINAMLEEVKLIP-DVNNVVQ 220 + +L + +++D S SM + K + + L I + Sbjct: 520 TRLPDEEQKLSVGLLVDESGSMGWGDRITHARKTAIVLYDFCTSLGIPITIYGHSTDSKG 579 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHC 280 L +++ + + + + L+Y +H Sbjct: 580 VALYSYAEFDSLDASDRYRLMDMSAR--------NGNRDGAALRYV----------AEHL 621 Query: 281 NTEDANYKKIIVFMTDGEN-----LSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +K+++ ++DG+ T+ + NE +KRG +++A I Sbjct: 622 AK-RPESQKLLIIISDGQPADCGYSGTEAEADLRGIKNEYRKRGIVIFAAAIG 673 >gi|299132281|ref|ZP_07025476.1| TadE family protein [Afipia sp. 1NLS2] gi|298592418|gb|EFI52618.1| TadE family protein [Afipia sp. 1NLS2] Length = 181 Score = 37.1 bits (84), Expect = 5.1, Method: Composition-based stats. Identities = 16/53 (30%), Positives = 27/53 (50%), Gaps = 1/53 (1%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHS-MIDRSLV 59 R F N +G + A+ P+ F +L I+EV+ +FF VL + D + + Sbjct: 15 FRRFGMNRRGSAAVQFAMVAPLFFALLFAIVEVAMMFFATQVLETGTQDTARL 67 >gi|295087036|emb|CBK68559.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Bacteroides xylanisolvens XB1A] Length = 614 Score = 37.1 bits (84), Expect = 5.1, Method: Composition-based stats. Identities = 38/303 (12%), Positives = 95/303 (31%), Gaps = 43/303 (14%) Query: 51 HSMIDRSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNM-------SFRNELRDNGF 103 H ++ A + + G + + +N + +F ++ + Sbjct: 115 HETRAAKVMSTAYKAVCPTPGIMYDAVNAEEYGEFQENGFKSVSDTPLSTFSIDVDAASY 174 Query: 104 VND-----IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFI---PWYTNSRHIV 155 N ++ ++ + Y + P+K PW TN R + Sbjct: 175 SNMRRFINKGELPPVDAIRTEELVNYFSYDYPKPTGSD-PVKITMESGVCPWNTNHRLVR 233 Query: 156 MPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDV 215 + + + ++ + +++ ++DVS SM ++D+ S+ ++ ++ V Sbjct: 234 IGLKAK-EIPTDNLPASNLVFLIDVSGSMW-----GANRLDLVKSSLKLLVNNLRDKDKV 287 Query: 216 NNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY---AYNQIFD 272 V SG ++ ++ I L+ G + L A + Sbjct: 288 AIVTYSGSAG----VKLEATPGSDKQKIREAIDELTAGGSTAGGAGILLAYRIAKKNLIS 343 Query: 273 MQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 R I+ +DG+ N+ + + +K G + +G + Sbjct: 344 NGNNR-------------IILCSDGDFNVGVSSAEGLEQLIEKERKSGVFLTVLGYGMGN 390 Query: 332 SHE 334 + Sbjct: 391 YKD 393 >gi|262189344|gb|ACY30364.1| von Willebrand factor A domain containing 5A [Salmo salar] Length = 791 Score = 37.1 bits (84), Expect = 5.1, Method: Composition-based stats. Identities = 42/267 (15%), Positives = 81/267 (30%), Gaps = 55/267 (20%) Query: 107 IDDIVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNS 166 I + + LD + P E + + Y H I + + ++ Sbjct: 192 ISRVESNCPLDPLNYPNTEKTQATVSLAAGHQFDRDVELLLYYQDTHQPTAIVEAAQTSA 251 Query: 167 QTDA-------RLDMM------------------IVLDVSRSMES---FFDSSITKIDMA 198 + + L M V+D S SM + +I A Sbjct: 252 KPGSLMGDPMVMLSMYPEFPKDVMSLMTSHGEFLFVVDCSGSMSCPMHNGSGAQDRIGSA 311 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE--WGVSHLQ---RKIKYLSKF 253 ++ +L+ + + N ++ F ++ E FF + + +K+K + Sbjct: 312 RDTLLLLLKSLPMGCYFN------IIGFGSRYESFFPKSVEYSQKTMDEALQKVKAMRAD 365 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 + LK+ Y+Q D + TDGE +TKE + Sbjct: 366 QGGSEILQPLKHIYSQ----------PCLPDQPRQLF--LFTDGEVRNTKEVLDLV---- 409 Query: 314 EAKKRGAIVYAIGIRVIRSHEFLRACA 340 +A ++ GI S + A Sbjct: 410 KANAGSHRCFSFGIGEGASSALISGVA 436 >gi|167031206|ref|YP_001666437.1| hypothetical protein PputGB1_0186 [Pseudomonas putida GB-1] gi|166857694|gb|ABY96101.1| conserved repeat domain protein [Pseudomonas putida GB-1] Length = 6753 Score = 37.1 bits (84), Expect = 5.1, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 54/166 (32%), Gaps = 11/166 (6%) Query: 145 IPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINA 204 + + IV ++ V + ++ ++D S SM I A + + Sbjct: 6206 MIGSEGNDIIVADVSGLNVVQGKN---YNIAFMVDSSGSMSD------KSIADAKTQLAS 6256 Query: 205 MLEEVK--LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 + +K L D + V LV F ++ + + K++ + V G Sbjct: 6257 VFNSLKASLGSDTSGTVNIFLVDFDTQVNKNVAVNLADPDALSKLQAVLNSMVGGYYGGG 6316 Query: 263 LKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQS 308 Y T + + + F+TDG+ + + + Sbjct: 6317 TNYEDAFKTTSNFFNSTMATSNKGAENLTYFITDGKPTYYQSGEST 6362 >gi|156355185|ref|XP_001623553.1| predicted protein [Nematostella vectensis] gi|156210265|gb|EDO31453.1| predicted protein [Nematostella vectensis] Length = 191 Score = 37.1 bits (84), Expect = 5.1, Method: Composition-based stats. Identities = 20/132 (15%), Positives = 51/132 (38%), Gaps = 19/132 (14%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D+M +LD S S+ + + A+++I M+ + + ++T + + + Sbjct: 70 FDIMYILDSSSSVSDW------EFQRAVQAIQTMVAK------SKRDNRHAVITIATRAK 117 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 F +++ ++ +S+ G TN+ L+ A+ + + Sbjct: 118 TFMNFSSRADAVRK-LRGISRSGGKTNTQDALELAFQMFTTS---KYGSTPGGLAR---V 170 Query: 292 VFMTDGENLSTK 303 + +TDG + Sbjct: 171 LVVTDGRSNMKN 182 >gi|119946441|ref|YP_944121.1| TPR repeat-containing protein [Psychromonas ingrahamii 37] gi|119865045|gb|ABM04522.1| protein containing tetratricopeptide (TPR) repeat [Psychromonas ingrahamii 37] Length = 657 Score = 37.1 bits (84), Expect = 5.1, Method: Composition-based stats. Identities = 23/126 (18%), Positives = 46/126 (36%), Gaps = 22/126 (17%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 +I LD+S SM + D+ ++ A + +++ + GL+ ++ Sbjct: 97 VIALDLSYSMYA-TDAKPDRLSQARYKVIDLVKSWEEGEK-------GLIAYAGDAFTIS 148 Query: 235 LLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 L + + I LS + + L+ A + + + H I Sbjct: 149 PLTTDGNAIINHIPSLSPTIMPVTGSRADLALEQAITLLKNAGYQQGH-----------I 197 Query: 292 VFMTDG 297 VF+TDG Sbjct: 198 VFITDG 203 >gi|260771476|ref|ZP_05880401.1| hypothetical protein VFA_000095 [Vibrio furnissii CIP 102972] gi|260613602|gb|EEX38796.1| hypothetical protein VFA_000095 [Vibrio furnissii CIP 102972] Length = 407 Score = 37.1 bits (84), Expect = 5.1, Method: Composition-based stats. Identities = 23/171 (13%), Positives = 53/171 (30%), Gaps = 8/171 (4%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 +G + ++ I + ++ V I+++H +T L + +D + + AA + +G Sbjct: 8 RGIRKQRGLVVVIVTIAMLVLIAVAAFAIDINHAMMNRTKLQNSVDAAALAAAIVLDKDG 67 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELR-----DNGFVNDIDDIVRSTSLDIVVVPQN 124 + + N ++ + F ND S P Sbjct: 68 TEAQADTIARSTLTKMSTAAGNAELTLDVSNVVNVEVQFSNDPTVFPDS---GYSSSPDG 124 Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMM 175 + Y I++ + F + T S N+ + + Sbjct: 125 DRYVRVVINQLDLESFFFARVMGVTKRLTASAVAGPSPGGNACNIVPMAVC 175 >gi|229493940|ref|ZP_04387712.1| von Willebrand factor, type A [Rhodococcus erythropolis SK121] gi|229319217|gb|EEN85066.1| von Willebrand factor, type A [Rhodococcus erythropolis SK121] Length = 533 Score = 37.1 bits (84), Expect = 5.1, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 72/203 (35%), Gaps = 19/203 (9%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 + ++ +DVS SM+ T++ + + L ++ PD V +GL FS Sbjct: 328 AKPIRTLVAIDVSASMDYPAAGGKTRMQLTTAAA---LAGNQVFPDS---VAAGLWAFSQ 381 Query: 229 KIE--EFFLLEWGVSHLQRKIKYLSK---FGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 + + F + + +++ N LK ++D E Sbjct: 382 GMNGTQDFQELVPIRRYDTIVDGVTQRKLMADQANRLTELKRGATGLYDTTLAAFRKVQE 441 Query: 284 DANYKKI--IVFMTDGENLSTKEDQQSLYYCNEAKKRG----AIVYAIGIRVIRSHEFLR 337 + + + ++ +TDG N + A+++ I+ IGI L Sbjct: 442 SYDPRAVNSVIILTDGANEDPDSISREQLLDVLAREQDPARPVIIVTIGITDDADAAALA 501 Query: 338 ACA--SPNSFYLVENPHSMYDAF 358 + + S Y+ +P + + F Sbjct: 502 DISRVTGGSTYIARDPSEISEVF 524 >gi|260788550|ref|XP_002589312.1| hypothetical protein BRAFLDRAFT_218073 [Branchiostoma floridae] gi|229274489|gb|EEN45323.1| hypothetical protein BRAFLDRAFT_218073 [Branchiostoma floridae] Length = 103 Score = 37.1 bits (84), Expect = 5.1, Method: Composition-based stats. Identities = 17/104 (16%), Positives = 39/104 (37%), Gaps = 12/104 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT--FSNKI 230 DM+ + D S S+ +F I K + ++ + + + G + +SN+ Sbjct: 4 DMVFITDGSASIGTFNFEEIKKF--MRQLVDGLTVSLTSF-------RVGAMQFAYSNRE 54 Query: 231 EEFFLLEWGVSHLQRKIKYLSKF-GVSTNSTPGLKYAYNQIFDM 273 E + + + I + G T + + +A + +F Sbjct: 55 EFGLEDNYNNAGVDAAICAIPYMDGPGTYTGEAIMFAKDYMFGK 98 >gi|313233721|emb|CBY09891.1| unnamed protein product [Oikopleura dioica] Length = 1259 Score = 37.1 bits (84), Expect = 5.2, Method: Composition-based stats. Identities = 31/178 (17%), Positives = 61/178 (34%), Gaps = 28/178 (15%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLI----PDVNNVV 219 N D D++ ++D S I + K+ N +++ VK I Sbjct: 1064 PNVCRDQAFDLVFLIDGS-------------IRIGRKNYNMVIDWVKSIVAQLNISQERS 1110 Query: 220 QSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQH 279 + GLV FS K E F L + + ++ L + YN G+R Sbjct: 1111 RVGLVQFSTKTEIEFDLARF-NDKKNLLQGLEQSKSRFK-----NMGYN---AYAGLRNS 1161 Query: 280 CNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLR 337 + + ++ +T G ++ N A+ ++A+ + + + L Sbjct: 1162 IDLFHGENPRFLIMITAGRLYIHP--RRQKEVLNAAEHSKVEIFAVKLGKMSNEAELM 1217 >gi|306819945|ref|ZP_07453596.1| conserved hypothetical protein [Eubacterium yurii subsp. margaretiae ATCC 43715] gi|304551981|gb|EFM39921.1| conserved hypothetical protein [Eubacterium yurii subsp. margaretiae ATCC 43715] Length = 1239 Score = 37.1 bits (84), Expect = 5.2, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 58/184 (31%), Gaps = 48/184 (26%) Query: 175 MIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFF 234 +IV+D S SM + I + L ++K + +V F+ KI + Sbjct: 80 VIVIDASGSMPYQTNH--------IYEVANKLVKIKEAYPYSK--NIAIVGFARKIVQST 129 Query: 235 LLEWGVSHLQRKIKYLSKFGVS---TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 S L++ I + S T+ GL+ A + + + N +K I Sbjct: 130 SFTNDSSLLKQYISEVKLPDDSEVMTDLNLGLEQA-DILLSS-------MPDGKNIRKNI 181 Query: 292 VFMTDGENLSTK---------------------------EDQQSLYYCNEAKKRGAIVYA 324 + ++DG TK + KKRG + + Sbjct: 182 IVLSDGLPNLTKKIKGSKYNDPDSGPGLYQESEIFFTNEAANAAYDTAEVMKKRGYKITS 241 Query: 325 IGIR 328 I Sbjct: 242 IHFS 245 >gi|283780473|ref|YP_003371228.1| hypothetical protein Psta_2700 [Pirellula staleyi DSM 6068] gi|283438926|gb|ADB17368.1| protein of unknown function DUF58 [Pirellula staleyi DSM 6068] Length = 310 Score = 37.1 bits (84), Expect = 5.2, Method: Composition-based stats. Identities = 21/96 (21%), Positives = 39/96 (40%), Gaps = 7/96 (7%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L ++LD S+SM + +K D A + ++ ++ D GLVTF Sbjct: 85 EEETNLKCTLILDCSKSM--KYGEGPSKFDYAATATASLAYLMQQQQDA-----VGLVTF 137 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPG 262 +N+I + +HL+ + L S Sbjct: 138 NNRITKNLPASSHPNHLKLMLHELENTTPDETSDVA 173 >gi|213964787|ref|ZP_03392986.1| von Willebrand factor type A [Corynebacterium amycolatum SK46] gi|213952323|gb|EEB63706.1| von Willebrand factor type A [Corynebacterium amycolatum SK46] Length = 228 Score = 37.1 bits (84), Expect = 5.3, Method: Composition-based stats. Identities = 26/149 (17%), Positives = 49/149 (32%), Gaps = 16/149 (10%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 L + V D S SM I + ++L E+++ P V+ ++ F N Sbjct: 11 GNILPIYFVADESGSMGP-------DIAELNSGLQSLLNEIRMAPFAAANVRFSVIGFDN 63 Query: 229 KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDM-QGMRQHCNTEDANY 287 + + R ++ + S A++ + E Sbjct: 64 EARLYLSNAD-----LRHVEQMPTL--SARFATYFSTAFDLLNAQIPDDVAQLKAEGYRV 116 Query: 288 KKIIVF-MTDGENLSTKEDQQSLYYCNEA 315 + VF +TDG +S Q++L A Sbjct: 117 NRPAVFLLTDGYPMSDDLWQEALSDLQNA 145 >gi|218693335|gb|ACL01171.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 37.1 bits (84), Expect = 5.3, Method: Composition-based stats. Identities = 17/92 (18%), Positives = 41/92 (44%), Gaps = 5/92 (5%) Query: 140 KFCTFIPWYTNSRHIVMPITSSVKVNSQT-DARLDMMIVLDVSRSMESFFDSSITKIDMA 198 K N I + + V ++ + LD++++LD S SM + ++ + A Sbjct: 189 KITGVNDLDKNKYKIELTVEGKTTVETKELNQPLDVVVLLDNSNSMNNERANNSQRALKA 248 Query: 199 IKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 + + +++++ N + LVT+++ I Sbjct: 249 GEEVEKLIDKITS----NKDNRVALVTYASTI 276 >gi|260824523|ref|XP_002607217.1| hypothetical protein BRAFLDRAFT_67985 [Branchiostoma floridae] gi|229292563|gb|EEN63227.1| hypothetical protein BRAFLDRAFT_67985 [Branchiostoma floridae] Length = 1764 Score = 37.1 bits (84), Expect = 5.3, Method: Composition-based stats. Identities = 14/53 (26%), Positives = 23/53 (43%), Gaps = 5/53 (9%) Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEF 335 N ++II+ +TDG K Q K +GA V+ +G+ + E Sbjct: 1285 NQYNNREIIILITDG-----KATDQVREPAQYVKNKGAHVFTVGVAKYKMSEL 1332 >gi|281201200|gb|EFA75414.1| substrate adhesion molecule [Polysphondylium pallidum PN500] Length = 2386 Score = 37.1 bits (84), Expect = 5.4, Method: Composition-based stats. Identities = 41/220 (18%), Positives = 72/220 (32%), Gaps = 28/220 (12%) Query: 110 IVRSTSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTD 169 I S SL P+ + + IS + + T + ++ + P+ S+K +T Sbjct: 972 IQCSISLTQRCDPKKPNFIVDQISNSDMIINPTTAQIYL--NKDVTTPL--SIKSTIETP 1027 Query: 170 ARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK 229 A LD+ ++DVS S F+ S+N V GL FS+ Sbjct: 1028 AALDLYFIVDVS-SERLFYYEFQEYYASFENSLNI------------ENVNFGLGYFSDT 1074 Query: 230 IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 + + + + + N P Y Y + Q + Sbjct: 1075 KNQPNTFQIQKLLSKSIVDDVEAL-TFKNGAPQPNYQYQALTLAASTVQWSANGAI---R 1130 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNE--AKKR---GAIVYA 324 +++ TD N D + AK+ G I Y+ Sbjct: 1131 MMILYTD--NNLAPTDNTLITTLQNELAKQNILLGVICYS 1168 >gi|82703475|ref|YP_413041.1| hypothetical protein Nmul_A2358 [Nitrosospira multiformis ATCC 25196] gi|82411540|gb|ABB75649.1| putative membrane protein [Nitrosospira multiformis ATCC 25196] Length = 437 Score = 37.1 bits (84), Expect = 5.4, Method: Composition-based stats. Identities = 22/162 (13%), Positives = 54/162 (33%), Gaps = 8/162 (4%) Query: 13 YNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGN 72 KG + I+ A+ L ++ G+ +++ ++ K+ L + D + AA ++ N N N Sbjct: 22 RQEKGVVAIIVALSLVVLVGFAGLALDLGKLYVAKSELQNSADACALAAAREL-NGANTN 80 Query: 73 -------NRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNE 125 + + + + + +N + + V+ Q Sbjct: 81 QLVLAEAAGMTAGMRHDVLFQDEMIALGVDDSVTFSQTLNGVYQAKAAIGPAEAVLMQYA 140 Query: 126 GYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQ 167 ++ ++ +P T V + + SQ Sbjct: 141 RCTVQRTGIANWFIQVLNLLPGVTIGNQAVAATAVATRTPSQ 182 >gi|312139254|ref|YP_004006590.1| lipoprotein [Rhodococcus equi 103S] gi|311888593|emb|CBH47905.1| putative lipoprotein [Rhodococcus equi 103S] Length = 314 Score = 36.7 bits (83), Expect = 5.4, Method: Composition-based stats. Identities = 6/44 (13%), Positives = 20/44 (45%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMI 54 + +G + +L AI + ++ + +++ + +K L + Sbjct: 1 MSNDERGVVAVLVAILMVVLLGCAAISVDIGANYVVKRQLQNGA 44 >gi|111657697|ref|ZP_01408424.1| hypothetical protein SpneT_02001110 [Streptococcus pneumoniae TIGR4] Length = 866 Score = 36.7 bits (83), Expect = 5.4, Method: Composition-based stats. Identities = 20/134 (14%), Positives = 58/134 (43%), Gaps = 10/134 (7%) Query: 149 TNSRHIVMPITSSVKVNSQTDA-RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 N I + ++ + + LD++I+LD S SM + + + + + A ++ ++++ Sbjct: 174 DNQYGIELTVSGKTVYEQKDKSVPLDVVILLDNSNSMSNIRNKNARRAERAGEATRSLID 233 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLS-----KFGVSTNSTPG 262 ++ + + LVT+++ I + + K L+ + ++ +T Sbjct: 234 KITSDSEN----RVALVTYASTIFDGTEFTVEKGVADKNGKRLNDSLFWNYDQTSFTTNT 289 Query: 263 LKYAYNQIFDMQGM 276 Y+Y ++ + + Sbjct: 290 KDYSYLKLTNDKND 303 >gi|237728595|ref|ZP_04559076.1| tellurite resistance [Citrobacter sp. 30_2] gi|226910073|gb|EEH95991.1| tellurite resistance [Citrobacter sp. 30_2] Length = 413 Score = 36.7 bits (83), Expect = 5.5, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 60/198 (30%), Gaps = 32/198 (16%) Query: 126 GYSISAISRYKIPLKFCTFIPWY----TNSRHIVMPITSSVKVNSQTDARLD--MMIVLD 179 Y + P K + R + + +SV + L+ + VLD Sbjct: 160 SYGVDVAQPAPQPAKPARISLEKKLETRSPRLVSLAKKASVSLTKNKLDTLEAAVAFVLD 219 Query: 180 VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEW- 238 S SM F ++S+ + + D + + +K L+ Sbjct: 220 ASGSMSGQFSKGN------VQSVLDRIAVLAAQFDDDGEMDVWGFGEKHKKYPNVTLDNL 273 Query: 239 -------GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 S + + L G + N P ++ + D + + Sbjct: 274 DTYIQSIRGSGKRSAWENLPGLGGTNNEPPVMEEIVDYFKDSK------------IPVYV 321 Query: 292 VFMTDGENLSTKEDQQSL 309 VF+TDG T+ + ++ Sbjct: 322 VFITDGGISKTRAIKDAI 339 >gi|119953122|ref|YP_945331.1| hypothetical membrane associated protein [Borrelia turicatae 91E135] gi|119861893|gb|AAX17661.1| hypothetical membrane associated protein [Borrelia turicatae 91E135] Length = 371 Score = 36.7 bits (83), Expect = 5.5, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 73/207 (35%), Gaps = 34/207 (16%) Query: 130 SAISRYKIPLKFCTFIPWYTNSRH---IVMPITSSVKVNSQTDARLDMMIVLDVSRSMES 186 S + +F F ++ +V I + ++ + A LD++ V+DV+ SM++ Sbjct: 185 STSENNDLKKEFSRFSDNVVVAKKGLDLVDKIKNILEESEDPLADLDLVFVIDVTDSMKN 244 Query: 187 FFDSSITKIDMAIKSINAMLEEVKLIPDVNNV--VQSGLVTFSNKIEEFFLLE---WGVS 241 I++ + + +M+E P ++ + G V + + +E+F Sbjct: 245 H-------IEILREHLFSMIE-----PQLSQYRSYRVGFVFYKDYLEDFLTRSFDFNNKE 292 Query: 242 HLQRKIKYLSKFGVS---TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE 298 +L ++ +S G G+ A Q R +A + Sbjct: 293 YLNNVLEDISVGGGGDYPEAVFEGINSAVTQFDWKADNRFIIVLGNAPPHEY-------- 344 Query: 299 NLSTKEDQQSLYYCNEAKKRGAIVYAI 325 + + AK++ +Y I Sbjct: 345 PRGPIVYEDVIR---SAKEKDITIYGI 368 >gi|146340337|ref|YP_001205385.1| hypothetical protein BRADO3364 [Bradyrhizobium sp. ORS278] gi|146193143|emb|CAL77155.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 470 Score = 36.7 bits (83), Expect = 5.5, Method: Composition-based stats. Identities = 27/208 (12%), Positives = 58/208 (27%), Gaps = 33/208 (15%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 F + + ++ A+ + +LGM ++ + + L Sbjct: 8 RFARDRCANVAVIFALMMVPTIYLLGMALDYTQALRKQGQL-----------DAAADAAA 56 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 R + +KNT F + NG V + + + + Sbjct: 57 IAAVRPAMLSVTDTSVVKNTAAAVFATKAAMNGLT-----AVPTPDITVTDSGLQRTIQV 111 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFD 189 S ++R P S + S + + ++ ++LD S SM Sbjct: 112 SYVARS------INNFPSVLGSP--SWQVKGSATAQASSAPNMNFYLLLDDSPSMA---- 159 Query: 190 SSITKIDMAIKSINAMLEEVKLIPDVNN 217 I I+ ++ P + Sbjct: 160 -----IAATQTDIDNLIAATSKQPGGSK 182 >gi|302348849|ref|YP_003816487.1| von Willebrand factor, type A [Acidilobus saccharovorans 345-15] gi|302329261|gb|ADL19456.1| von Willebrand factor, type A [Acidilobus saccharovorans 345-15] Length = 456 Score = 36.7 bits (83), Expect = 5.6, Method: Composition-based stats. Identities = 24/126 (19%), Positives = 46/126 (36%), Gaps = 16/126 (12%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ-SGLVTFSNKIEE 232 + ++LD S SM +KID A A+ ++ + +V+ + +S+ + Sbjct: 290 IYVLLDKSGSM------VGSKIDWARAVAVALFKK-SMDEGRTFIVRFFDSIPYSSMVVR 342 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 V L + + G T+ T + A + I + I+ Sbjct: 343 SNSKPNDVLRLLSYLARVKA-GGGTDITRAVSSAVDDI-------DKMKLGGKDRPSDII 394 Query: 293 FMTDGE 298 +TDGE Sbjct: 395 LITDGE 400 >gi|260463531|ref|ZP_05811730.1| von Willebrand factor type A [Mesorhizobium opportunistum WSM2075] gi|259030622|gb|EEW31899.1| von Willebrand factor type A [Mesorhizobium opportunistum WSM2075] Length = 552 Score = 36.7 bits (83), Expect = 5.6, Method: Composition-based stats. Identities = 26/131 (19%), Positives = 50/131 (38%), Gaps = 26/131 (19%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK---- 229 ++I+LD S SM + D K+++A +S+ +L+ V + G + + ++ Sbjct: 27 VLIILDASGSMWAQIDGKP-KLEIARESLRTVLQSVPADDE------IGFMAYGHREKGS 79 Query: 230 ---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 I+ + G + KF T T +K A + + Sbjct: 80 CDDIQLIVPPQAGSASAITDAADSLKFLGKTPLTAAVKQAAEALKYTED----------- 128 Query: 287 YKKIIVFMTDG 297 K +V +TDG Sbjct: 129 -KATVVLITDG 138 >gi|154487036|ref|ZP_02028443.1| hypothetical protein BIFADO_00874 [Bifidobacterium adolescentis L2-32] gi|154084899|gb|EDN83944.1| hypothetical protein BIFADO_00874 [Bifidobacterium adolescentis L2-32] Length = 355 Score = 36.7 bits (83), Expect = 5.7, Method: Composition-based stats. Identities = 18/108 (16%), Positives = 36/108 (33%), Gaps = 14/108 (12%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 S+V + + D+++ LDVS S ID + +++ E Sbjct: 85 RPSTVDEGEEKASSRDIVLCLDVSGS---ALPYDREVIDTYRQLVDSFKGE--------- 132 Query: 218 VVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 + GL F++ F L + +++ S S + Sbjct: 133 --RIGLSIFNSTSRTVFPLTDDYELVSKQLDKASSILAGVESQDDIDK 178 >gi|307947552|ref|ZP_07662881.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307769249|gb|EFO28481.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 212 Score = 36.7 bits (83), Expect = 5.7, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 61/175 (34%), Gaps = 22/175 (12%) Query: 14 NYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAAT---QIMNEGN 70 N KG + I A+ +I + + I++S + + + D + A T + + +G Sbjct: 15 NIKGSIAIPFALLATLILAAISVGIDMSFAYNKRDQSQLVADEVSLFAVTTFRKYVADGM 74 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRD-NGFVNDIDDIVRSTSLDIVVVPQNEGYSI 129 N+ + + + S + +N +D + ++ + ++E Y Sbjct: 75 SKNQARKRAETDARKFLTARTKSLDGTTEKFSIKINIVDREAKVVKANVNISGKHESY-- 132 Query: 130 SAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 + +I S ++ + + + ++DVS SM Sbjct: 133 ---------------MTHAMGFDNIDYTADSESTISFG-QGKYEFIFLVDVSPSM 171 >gi|218512349|ref|ZP_03509189.1| hypothetical protein Retl8_00989 [Rhizobium etli 8C-3] Length = 222 Score = 36.7 bits (83), Expect = 5.7, Method: Composition-based stats. Identities = 19/108 (17%), Positives = 42/108 (38%), Gaps = 7/108 (6%) Query: 194 KIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF 253 + D+A ++ +L+ + + ++ GL + + +E S+ ++ LS Sbjct: 75 RADVAGDAVREVLDMIDESDSNHERIKVGLYSLGDTTKEVLAPTLDTSNARK---RLSDD 131 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTED----ANYKKIIVFMTDG 297 S + Y Y + + D AN K+++ +TDG Sbjct: 132 SYGLTSATSMNYTYFDVALAALQKIVGTGGDGTSSANPLKLVLLLTDG 179 >gi|118384116|ref|XP_001025211.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89306978|gb|EAS04966.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 631 Score = 36.7 bits (83), Expect = 5.7, Method: Composition-based stats. Identities = 40/200 (20%), Positives = 76/200 (38%), Gaps = 31/200 (15%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 +D++ V+D S SM S K + KS+ +L+ + + L++F Sbjct: 138 SERVPMDLICVIDDSGSM------SGKKAQLVRKSLKYLLKIMNEND------RICLISF 185 Query: 227 SN--KIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 + KI FL S L++ IK + G STN G++ G+ N Sbjct: 186 DSVEKILTPFLRNNLENKSELKKAIKNIVGRG-STNIEAGMEA---------GLWMIKNR 235 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCN--EAKKRGAIVYAIGIRVIRSHEFLRACA 340 ++ N + ++DG++ S + D + + + V G +R A Sbjct: 236 KEKNPITCMFLLSDGQDDSPQVDLRVQKLIQSYDIQDTFI-VNTYGYGADHDATQMRNIA 294 Query: 341 SP--NSFYLVENPHSMYDAF 358 +Y +E+ + + F Sbjct: 295 ETHKGGYYYIEDVKKVSEWF 314 >gi|145530760|ref|XP_001451152.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124418796|emb|CAK83755.1| unnamed protein product [Paramecium tetraurelia] Length = 919 Score = 36.7 bits (83), Expect = 5.8, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 58/187 (31%), Gaps = 30/187 (16%) Query: 176 IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL 235 +D S SM +I+ A +S+ L+ + N++ G S E Sbjct: 352 FFIDRSGSMTG------ARINKAKQSLLLFLKSL-PEDCNFNIISFGSTFRSLWSESKQY 404 Query: 236 LEWGVSHLQRKIKYLSKFGVSTNSTPGL-KYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + + + + + T L + Y++ + + + Sbjct: 405 SQNTLEDAIKHVNNMEANMNGTEILKPLSQVVYSKYYGKSKSTTLN----------VFLL 454 Query: 295 TDGENLSTKEDQQSLYYCNEAKKRG---AIVYAIGIRVIRSHEFLRACASPNS--FYLVE 349 TDGE ++ + KK VY +GI S ++ A + F V Sbjct: 455 TDGE-------VEAQPIIDLVKKNNQAETRVYTLGIGEGCSQFLIKRLAEVGNGKFQFVS 507 Query: 350 NPHSMYD 356 + + Sbjct: 508 DNEDINA 514 >gi|291571720|dbj|BAI93992.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 449 Score = 36.7 bits (83), Expect = 5.8, Method: Composition-based stats. Identities = 26/182 (14%), Positives = 57/182 (31%), Gaps = 33/182 (18%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +LD++ ++D S SM +ID +S + +++ V+ GLV F Sbjct: 93 QQPKLDIVFLVDTSGSMSD-------EIDAVKRSCVSFADQIIKAGAN---VRLGLVGFD 142 Query: 228 NKIEEFFLL--EWGVSHLQRKIKYLSKFGVSTNSTPGLKY-------------AYNQIFD 272 + + V +L R + ++ A D Sbjct: 143 IGGHRGSSMGKAYKVYNLSRYTIGVWPLNTPERFKKNIQSLSLGLFGGGGCYLAQRDTVD 202 Query: 273 MQGMRQHCNTEDANYKKIIVFMTD--GENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + + + +I+V ++D G N + ++ K + +G+ Sbjct: 203 IFPHVINTFDNSSENTRILVIISDEMGGNEGLSD------IVSQLKSASITSHVLGVSGR 256 Query: 331 RS 332 Sbjct: 257 EG 258 >gi|282599302|ref|YP_003358614.1| hypothetical protein [Shigella phage phiSboM-AG3] gi|226973608|gb|ACO94361.1| conserved hypothetical phage protein [Shigella phage phiSboM-AG3] Length = 733 Score = 36.7 bits (83), Expect = 6.0, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 67/210 (31%), Gaps = 17/210 (8%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K T A + +IV+D+S SM + + S+ + V ++ ++ G Sbjct: 8 KNAVATAAPSNHVIVVDISYSMYRALPEVRKHLKENLPSLVKPEDTVSVLY-FSSRGDFG 66 Query: 223 LVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNT 282 V ++ L S L + I + T L A + ++ N Sbjct: 67 TVFAGRQVSAASDL----SELNKLIDRFLQPSGCTGFVEPLGLAVDTARSLKKPGYVNN- 121 Query: 283 EDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-- 340 + FMTDG + + ++ L + V I + E L + Sbjct: 122 --------LAFMTDGYDNCWRSNEI-LETAEKLPSAFDNVTFIEYGWYCNRELLALMSER 172 Query: 341 SPNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 S + E ++ K V K + Sbjct: 173 SGATHVFAEGQTEYQTELENVLKSSVPKVV 202 >gi|260788548|ref|XP_002589311.1| hypothetical protein BRAFLDRAFT_218221 [Branchiostoma floridae] gi|229274488|gb|EEN45322.1| hypothetical protein BRAFLDRAFT_218221 [Branchiostoma floridae] Length = 103 Score = 36.7 bits (83), Expect = 6.0, Method: Composition-based stats. Identities = 19/104 (18%), Positives = 41/104 (39%), Gaps = 12/104 (11%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 DM+ + D S S+ +F I K + ++ + + + G + F+ + E Sbjct: 4 DMVFITDGSASIGTFNFEEIKKF--MRQLVDGLTVSLTSF-------RVGAMQFAYENRE 54 Query: 233 FFLLE--WGVSHLQRKIKYLSKF-GVSTNSTPGLKYAYNQIFDM 273 F LE + + + I + G T + + +A + +F Sbjct: 55 EFGLEDNYNNAGVDAAICAIPYMDGPGTYTGEAIMFAKDYMFGK 98 >gi|305666869|ref|YP_003863156.1| hypothetical protein FB2170_11421 [Maribacter sp. HTCC2170] gi|88709093|gb|EAR01327.1| hypothetical protein FB2170_11421 [Maribacter sp. HTCC2170] Length = 365 Score = 36.7 bits (83), Expect = 6.1, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 56/176 (31%), Gaps = 4/176 (2%) Query: 196 DMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL--LEWGVSHLQRKIKYLSKF 253 + + + + V + ++ + + F + E L L L I ++ Sbjct: 144 EELKTASVSFVNNVMPLTTEDSY-KMAIYWFDGEDELHLLNDLTSSKQELVSAINGITDD 202 Query: 254 GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCN 313 + ST + + E+ +V TDG + +++ +Q+ Sbjct: 203 ISNDPSTDLYGAVIKSTEIAEELLADSIKEEIIGAASVVIFTDGTDQASRYTEQTALKKV 262 Query: 314 EAKKRGAIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSHIGKDIVTKR 369 E ++IG+ + L S + N + + F+ I I + Sbjct: 263 EDAAANISFFSIGLGAEIDTQVLTDIGKTFSVF-AGNAEELENTFNDISIKISERA 317 >gi|332827674|gb|EGK00413.1| hypothetical protein HMPREF9455_03261 [Dysgonomonas gadei ATCC BAA-286] Length = 402 Score = 36.7 bits (83), Expect = 6.2, Method: Composition-based stats. Identities = 30/172 (17%), Positives = 65/172 (37%), Gaps = 19/172 (11%) Query: 163 KVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSG 222 K + T ++ ++ +LD + SM ++ KI SI + L + + +PD+ G Sbjct: 45 KADRTTRPKMQIVFLLDATGSMSGLIGAAKEKI----WSITSSLSQSEPVPDIE----VG 96 Query: 223 LVTFSNK----IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 ++ + ++ I L + +L ++ + G + Y + MQ Sbjct: 97 MLFYRDRGDDFITRIIPLGTDMDNLYEQLMAMDASGGGDGPESVNQALYEGVNKMQWD-- 154 Query: 279 HCNTEDANYKKIIVFMTDGEN-LSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 N + I + D + + D C+E K+G ++ I + Sbjct: 155 ----NLPNTYRAIFLVGDYPPHMDYRNDVHYPETCSEGIKKGIVINTILMGN 202 >gi|167924336|ref|ZP_02511427.1| hypothetical protein BpseBC_37618 [Burkholderia pseudomallei BCC215] Length = 396 Score = 36.7 bits (83), Expect = 6.2, Method: Composition-based stats. Identities = 26/141 (18%), Positives = 56/141 (39%), Gaps = 8/141 (5%) Query: 19 MTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLK 78 M+IL A+ L ++ +G+ +++ ++ ++ L + D + AA + G N Sbjct: 1 MSILVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLT--GAINLSVPEA 58 Query: 79 GGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSISAISRYKIP 138 G + + F +++ N V D + P+N S S+I K Sbjct: 59 AGITAGHLNHALFEQFPVQMQTNSNVTFSDSLSNPFQ------PKNAIASPSSIKYVKCT 112 Query: 139 LKFCTFIPWYTNSRHIVMPIT 159 + W+ + ++V +T Sbjct: 113 TSRTGIVNWFIQTLNLVPGVT 133 >gi|319784615|ref|YP_004144091.1| hypothetical protein Mesci_4936 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317170503|gb|ADV14041.1| Protein of unknown function DUF2134, membrane [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 593 Score = 36.7 bits (83), Expect = 6.3, Method: Composition-based stats. Identities = 12/59 (20%), Positives = 29/59 (49%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMN 67 R + K T++ A+ P+ + + I+ + I+ + +M+D + + AA+ + N Sbjct: 14 RLMLSDRKANFTVMAALSAPVALALAAVAIDEASIYTERREAQAMVDLAAITAASNMTN 72 >gi|315181058|gb|ADT87972.1| hypothetical protein vfu_A02859 [Vibrio furnissii NCTC 11218] Length = 406 Score = 36.7 bits (83), Expect = 6.3, Method: Composition-based stats. Identities = 41/299 (13%), Positives = 92/299 (30%), Gaps = 15/299 (5%) Query: 10 NFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEG 69 +G + ++ I + + V I+++H +T L + +D + + AA + +G Sbjct: 8 RGIRKQRGLVVVIVTIAMLALIAVAAFAIDINHAMMNRTKLQNSVDAAALAAAIVLDKDG 67 Query: 70 NGNNRKKLKGGDILCRIKNTWNMSFRNELRD--NGFVNDIDDIVRSTSLDIVVVPQNEGY 127 + + N ++ D N V +D P + Y Sbjct: 68 TEAQADTIARSTLTKMSTAAGNAELTLDVSDVVNVEVQFSNDPTVFPDSGYSSSPDGDRY 127 Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 I++ + F T S N+ + + D + Sbjct: 128 VRVVINQLDLESFFFARALGVTKRLTASAVAGPSPGGNACNIVPMAVCEGDDAGTNGYDS 187 Query: 188 FDSSITKIDMAIK-SINAMLEEVKLIPDVNNVVQSGLV-TFSNKIEEFFLLEWGVSHLQR 245 KI ++ + ++ + V++ L ++ I+ L+ G + + Sbjct: 188 GVVYALKISSQSDPTMGSGNFQLLDFGSGASTVRTSLAGGYAGCID----LDAGETVTTK 243 Query: 246 KIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKE 304 + G N+ G ++ G+ D K+ +TD + ST Sbjct: 244 PGNTVGPVGQGLNTRFG-------VYSGGGLSSSDYPPDLYVKEASPTLTDSDVGSTPA 295 >gi|300776749|ref|ZP_07086607.1| von Willebrand factor [Chryseobacterium gleum ATCC 35910] gi|300502259|gb|EFK33399.1| von Willebrand factor [Chryseobacterium gleum ATCC 35910] Length = 287 Score = 36.7 bits (83), Expect = 6.3, Method: Composition-based stats. Identities = 19/83 (22%), Positives = 39/83 (46%), Gaps = 6/83 (7%) Query: 166 SQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVT 225 + + L MMI++D+S SM+ TK+ + + + + + NN + GL+ Sbjct: 71 MEEERELTMMILVDISASMD-----YGTKVQLKREYVAEIAASLGFSAAGNND-KVGLIL 124 Query: 226 FSNKIEEFFLLEWGVSHLQRKIK 248 F++K+ + + G H+ I Sbjct: 125 FADKVYKVIPPQKGRKHILSIIS 147 >gi|221112732|ref|XP_002163909.1| PREDICTED: similar to tyrosine kinase receptor, partial [Hydra magnipapillata] Length = 1516 Score = 36.7 bits (83), Expect = 6.3, Method: Composition-based stats. Identities = 12/69 (17%), Positives = 25/69 (36%), Gaps = 4/69 (5%) Query: 289 KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI-RSHEFLRACASPNSFYL 347 K + M+D + + + + + +G I+ AIGI + S + L Sbjct: 1077 KFGLTMSDKPHNDSN---KLASLALKMRNKGVIIVAIGIGPNVDQKKLTDIAGSSDKVLL 1133 Query: 348 VENPHSMYD 356 E + + Sbjct: 1134 AEKMQDLLE 1142 >gi|167562908|ref|ZP_02355824.1| hypothetical protein BoklE_10135 [Burkholderia oklahomensis EO147] Length = 602 Score = 36.7 bits (83), Expect = 6.3, Method: Composition-based stats. Identities = 42/257 (16%), Positives = 92/257 (35%), Gaps = 28/257 (10%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R +G ++ AI++ + LG +++ ++FF + L + D + + AA ++ ++ Sbjct: 17 RRTIARERGSFAVVAAIWMLVAIAALG-AVDIGNVFFARRDLQRVADMAALAAAQKMDDQ 75 Query: 69 ----GNGNNRKKLKGGDILCRIKNTWNMSF-RNELRDN---GFVNDIDDIVRSTSLDIVV 120 N G + NT +S R + + N + N + + + Sbjct: 76 CTQPAAAANANARSNGFDPAKSGNTLTLSCGRWDTQSNAGPSYFNTASTPLNAVQVTATQ 135 Query: 121 VPQ----NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMI 176 ++SA S K TN + T + + L+ ++ Sbjct: 136 SVPYFFLGPSRTVSATSTAK-----------ATNIDQFTVGTTLASLQGGLVNNVLNALL 184 Query: 177 VLDVSRSMESFFDSSITKIDM----AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 ++S S+ S+ + T+I + A ++ + E + Q L S Sbjct: 185 GANLSLSVLSYQALASTQIKLGDLMAAANVLTVNELLSTQVTAGQFAQLTLTALSRTQVV 244 Query: 233 FFLLEWGVSHLQRKIKY 249 L+ V+ LQ ++ Sbjct: 245 NANLQASVAALQTIVRA 261 >gi|156358481|ref|XP_001624547.1| predicted protein [Nematostella vectensis] gi|156211334|gb|EDO32447.1| predicted protein [Nematostella vectensis] Length = 545 Score = 36.7 bits (83), Expect = 6.3, Method: Composition-based stats. Identities = 25/173 (14%), Positives = 59/173 (34%), Gaps = 25/173 (14%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D+ +LD S S+ + +A++ I + +++ + ++ G + F + + Sbjct: 266 DLGFMLDTSSSISGEAN-----FKLALEFIVTVFQQIGM----TATIRFGFMIFGSSSQV 316 Query: 233 FFLLEWGVS--HLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 F S ++ + +S G + + + +F ++ Sbjct: 317 VFDFTKFSSFAEVKTAVLGVSMVGGTCTAGAAITMCRTSMFASSSSSS---------ARV 367 Query: 291 IVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACASPN 343 +V M G K + K+ G + +G+ E L AS + Sbjct: 368 LVAMMAG-----KSTDSVTAPASAIKEIGIKMICVGMGGQYDKEQLTGMASSS 415 >gi|126666680|ref|ZP_01737657.1| 1-deoxy-D-xylulose 5-phosphate reductoisomerase [Marinobacter sp. ELB17] gi|126628725|gb|EAZ99345.1| 1-deoxy-D-xylulose 5-phosphate reductoisomerase [Marinobacter sp. ELB17] Length = 1318 Score = 36.7 bits (83), Expect = 6.3, Method: Composition-based stats. Identities = 21/178 (11%), Positives = 50/178 (28%), Gaps = 24/178 (13%) Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKY 265 +++V + + FS+ + G++ L ++ SK+ T S Sbjct: 269 IDDVDAAHKTRLLNKLNTSQFSSSSDTPLRNA-GLTPLAGTLESASKYFGGTLSNGEAAS 327 Query: 266 AYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAK--------- 316 + + + +V +TDG K ++ + Sbjct: 328 GVSTLA--------PTPNFCGTQDYVVLVTDGLPSVNKSGSKTTDVTSAVDAVADEAAAL 379 Query: 317 -KRGAIVYAIGIRVIRS--HEFLRA---CASPNSFYLVENPHSMYDAFSHIGKDIVTK 368 +G Y +G + L + + ++ A I +I + Sbjct: 380 LSQGVRTYVVGFGLPTGVDKSLLNKIAIAGGTKATLFANDSTTLDAALGSIFLNIFNR 437 >gi|260581678|ref|ZP_05849475.1| tellurium resistance protein [Haemophilus influenzae NT127] gi|260095271|gb|EEW79162.1| tellurium resistance protein [Haemophilus influenzae NT127] Length = 212 Score = 36.7 bits (83), Expect = 6.4, Method: Composition-based stats. Identities = 22/136 (16%), Positives = 52/136 (38%), Gaps = 20/136 (14%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +++D S SM I+ + ++ ++ P ++TF +K Sbjct: 3 RLPVYLLVDTSGSM------MGEAIESVRNGLQMLVSALRQDPYALETAYLSVITFDSKA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK-- 288 ++ L + ++ + G+++ + A + + D + + A K Sbjct: 57 KQVMPL---TELMSFQLPTIEASGLTS-----MGEALSLLTDC--INREVQKGSAEVKGN 106 Query: 289 -KIIVF-MTDGENLST 302 K +VF ++DG Sbjct: 107 WKPVVFLLSDGVPTDD 122 >gi|16649073|gb|AAL24388.1| similar to magnesium-chelatase subunit ChlD precursor [Arabidopsis thaliana] gi|21389627|gb|AAM48012.1| magnesium-chelatase subunit ChlD-like protein [Arabidopsis thaliana] Length = 471 Score = 36.7 bits (83), Expect = 6.5, Method: Composition-based stats. Identities = 25/179 (13%), Positives = 59/179 (32%), Gaps = 20/179 (11%) Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 + ++ A R P + + +R + + T A ++ V+D S SM Sbjct: 221 KRLAVDATLRAAAPYQKLRREKDISGTRKVFVEKTDMRAKRMARKAGALVIFVVDASGSM 280 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEFFLLEWGVSHL 243 ++ ++ A + +L + Q ++ F + E ++ Sbjct: 281 ------ALNRMQNAKGAALKLL-----AESYTSRDQVSIIPFRGDAAEVLLPPSRSIAMA 329 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 + +++ L G S + + + + + +IV +TDG T Sbjct: 330 RNRLERLPCGGGSPLAHG--------LTTAVRVGLNAEKSGDVGRIMIVAITDGRANIT 380 >gi|38347906|ref|NP_941155.1| tellurium resistance protein [Serratia marcescens] gi|190410248|ref|YP_001965749.1| terF [Klebsiella pneumoniae] gi|226807641|ref|YP_002791335.1| TerF [Enterobacter cloacae] gi|226809951|ref|YP_002791645.1| TerF [Enterobacter cloacae] gi|38259383|emb|CAE51608.1| tellurium resistance protein [Serratia marcescens] gi|146151041|gb|ABQ02807.1| terF [Klebsiella pneumoniae] gi|226425866|gb|ACO53959.1| TerF [Enterobacter cloacae] gi|226426177|gb|ACO54269.1| TerF [Enterobacter cloacae] Length = 413 Score = 36.7 bits (83), Expect = 6.5, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 61/198 (30%), Gaps = 32/198 (16%) Query: 126 GYSISAISRYKIPLKFCTFIPWY----TNSRHIVMPITSSVKVNSQTDARLD--MMIVLD 179 Y + P K + R + + +SV + L+ + VLD Sbjct: 160 SYGVDVAQPAPQPAKPARISLEKKLETRSPRLVSLAKKASVSLTKNKLDTLEAAVAFVLD 219 Query: 180 VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG 239 S SM F ++S+ + + D + + +K L+ Sbjct: 220 ASGSMSGQFSKGN------VQSVLDRIAVLAAQFDDDGEMDVWGFGEKHKKYPNVTLDNL 273 Query: 240 VSHLQRKIKY--------LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 +++Q L G + N P ++ + D + + Sbjct: 274 DTYIQSIRGAGKRSAWENLPGLGGTNNEPPVMEEIVDYFKDSK------------IPVYV 321 Query: 292 VFMTDGENLSTKEDQQSL 309 VF+TDG T+ + ++ Sbjct: 322 VFITDGGISKTRAIKDAI 339 >gi|193782688|ref|NP_436104.2| hypothetical protein SMa1579 [Sinorhizobium meliloti 1021] gi|193073144|gb|AAK65516.2| conserved hypothetical protein [Sinorhizobium meliloti 1021] Length = 431 Score = 36.7 bits (83), Expect = 6.6, Method: Composition-based stats. Identities = 9/43 (20%), Positives = 20/43 (46%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLH 51 R F G + ++ A+ P++ +G+ +E + + K L Sbjct: 22 RRFLTAEDGAVAVIAAVAFPVLVGAMGLGVETGYWYLEKRKLQ 64 >gi|307304370|ref|ZP_07584121.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti BL225C] gi|306902572|gb|EFN33166.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti BL225C] Length = 431 Score = 36.7 bits (83), Expect = 6.7, Method: Composition-based stats. Identities = 9/43 (20%), Positives = 20/43 (46%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLH 51 R F G + ++ A+ P++ +G+ +E + + K L Sbjct: 22 RRFLTAEDGAVAVIAAVAFPVLVGAMGLGVETGYWYLEKRKLQ 64 >gi|284054595|ref|ZP_06384805.1| von Willebrand factor type A [Arthrospira platensis str. Paraca] Length = 361 Score = 36.7 bits (83), Expect = 6.7, Method: Composition-based stats. Identities = 26/182 (14%), Positives = 57/182 (31%), Gaps = 33/182 (18%) Query: 168 TDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFS 227 +LD++ ++D S SM +ID +S + +++ V+ GLV F Sbjct: 68 QQPKLDIVFLVDTSGSMSD-------EIDAVKRSCVSFADQIIKAGAN---VRLGLVGFD 117 Query: 228 NKIEEFFLL--EWGVSHLQRKIKYLSKFGVSTNSTPGLKY-------------AYNQIFD 272 + + V +L R + ++ A D Sbjct: 118 IGGHRGSSMGKAYKVYNLSRYTIGVWPLNTPERFKKNIQSLSLGLFGGGGCYLAQRDTVD 177 Query: 273 MQGMRQHCNTEDANYKKIIVFMTD--GENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVI 330 + + + +I+V ++D G N + ++ K + +G+ Sbjct: 178 IFPHVINTFDNSSENTRILVIISDEMGGNEGLSD------IVSQLKSASITSHVLGVSGR 231 Query: 331 RS 332 Sbjct: 232 EG 233 >gi|241662158|ref|YP_002980518.1| hypothetical protein Rpic12D_0541 [Ralstonia pickettii 12D] gi|240864185|gb|ACS61846.1| putative transmembrane protein [Ralstonia pickettii 12D] Length = 344 Score = 36.7 bits (83), Expect = 6.7, Method: Composition-based stats. Identities = 12/116 (10%), Positives = 41/116 (35%), Gaps = 1/116 (0%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM-N 67 R +G + ++ + + +I + + I++ +++ ++ L + D + + A + Sbjct: 7 RTCARRQRGVVGVMAPVLMLVILSIGAIAIDLGYLYVIRNELQNAADAAALAGAAGLYPA 66 Query: 68 EGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQ 123 N + G ++ + + ++ G+ N + I Sbjct: 67 SPKPNWSNGVSQGTSAIKLNASSGATLKDGSVQAGYWNLTGSPAGLQAQTITPGAN 122 >gi|203287782|ref|YP_002222797.1| hypothetical protein BRE_330 [Borrelia recurrentis A1] gi|201085002|gb|ACH94576.1| uncharacterized conserved protein [Borrelia recurrentis A1] Length = 371 Score = 36.7 bits (83), Expect = 6.7, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 53/158 (33%), Gaps = 21/158 (13%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 LD++ V+DV+ SM++ I++ + +L+ V+ + + G V + + + Sbjct: 229 DLDLVFVIDVTDSMKNH-------IEILRE---HLLDMVEPQLNQFKSYRIGFVFYKDYL 278 Query: 231 EEFF--LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E+F ++ I G + + N + Sbjct: 279 EDFLTRSFDFNSKKYLHNILESINVGGGGDYPEAVFEGVNAAVTQFDWKSDN-------- 330 Query: 289 KIIVFMTDGEN-LSTKEDQQSLYYCNEAKKRGAIVYAI 325 + I+ + + + AK++ +Y I Sbjct: 331 RFIIVLGNAPPHEYPRGPIVYEDVIRAAKEKDITIYGI 368 >gi|117618784|ref|YP_857965.1| hypothetical protein AHA_3491 [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|117560191|gb|ABK37139.1| flagellin hook IN motif family [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 5047 Score = 36.7 bits (83), Expect = 6.8, Method: Composition-based stats. Identities = 24/135 (17%), Positives = 45/135 (33%), Gaps = 9/135 (6%) Query: 169 DARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKL--IPDVNNVVQSGLVTF 226 ++ ++D S S+ S + ++ + + +K D V LV F Sbjct: 4497 GQNYNIAFMVDSSGSIGS------SAMNTIKSQLAQVFSSLKASAGTDGAGTVNIFLVDF 4550 Query: 227 SNKIEEFFLLEWGVSHLQRKIKY-LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDA 285 + + S+ +++ L STNS G Y + Sbjct: 4551 DTLANKSVSVNLKDSNALSQLQAILDSMDGSTNSGGGTNYEDVFKTTANWFNSSTVQGNT 4610 Query: 286 NYKKIIVFMTDGENL 300 N K + F+TDG+ Sbjct: 4611 NAKNLTYFITDGQPT 4625 >gi|307318077|ref|ZP_07597513.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti AK83] gi|306896118|gb|EFN26868.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti AK83] Length = 431 Score = 36.7 bits (83), Expect = 6.8, Method: Composition-based stats. Identities = 9/43 (20%), Positives = 20/43 (46%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLH 51 R F G + ++ A+ P++ +G+ +E + + K L Sbjct: 22 RRFLTAEDGAVAVIAAVAFPVLVGAMGLGVETGYWYLEKRKLQ 64 >gi|330792171|ref|XP_003284163.1| hypothetical protein DICPUDRAFT_75136 [Dictyostelium purpureum] gi|325085860|gb|EGC39259.1| hypothetical protein DICPUDRAFT_75136 [Dictyostelium purpureum] Length = 794 Score = 36.7 bits (83), Expect = 6.9, Method: Composition-based stats. Identities = 22/167 (13%), Positives = 43/167 (25%), Gaps = 21/167 (12%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 V S + + ++ D + SM T I I + + + G+ Sbjct: 268 VESSMSSDVQIVFCFDTTGSM-------ATIIQSVRTQIVQTVNRLMKDIPNIQIGIMGM 320 Query: 224 VTFSNKIE--EFFLLEWGVSHLQRKIKYLSKFGVSTNSTP---GLKYAYNQIFDMQGMRQ 278 + + I + L + + I + L A + + Sbjct: 321 GDYCDGINVLKTLDLSSNIDEIVAFINSVPNTSGGDAPEAYEYALHKAKELSWSEHTSKA 380 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 D+N E T C++ G VY + Sbjct: 381 FVMIGDSNPH---------EPTFTNLSINWFKECDDLYDMGIKVYGV 418 >gi|291239167|ref|XP_002739504.1| PREDICTED: chloride channel accessory 2-like [Saccoglossus kowalevskii] Length = 603 Score = 36.7 bits (83), Expect = 6.9, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 68/193 (35%), Gaps = 27/193 (13%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 L +++VLDVS SM+ ++D+ +++ + V N G+V FSN Sbjct: 318 LRIVLVLDVSGSMDDN-----NRLDLLLQASTRYIG-----YTVPNATWIGIVEFSNDAT 367 Query: 232 EFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKK 289 L GV + I+ L S + + + + A Sbjct: 368 ILSELVQIVGVETRKELIEELPDDAKGATSIG------SGLLAGLSVLERGPGGAAGGII 421 Query: 290 IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRAC-ASPNSFY-- 346 ++ +DGE + + + + + +V + + + A+ + Y Sbjct: 422 FLI--SDGEENTPPYMKDVV---DLLVQEEVVVDTLALSDEADEGLAKLSDATGGTAYWY 476 Query: 347 -LVENPHSMYDAF 358 + +++DAF Sbjct: 477 SESDESTALHDAF 489 >gi|256599878|pdb|3IBS|A Chain A, Crystal Structure Of Conserved Hypothetical Protein Batb From Bacteroides Thetaiotaomicron gi|256599879|pdb|3IBS|B Chain B, Crystal Structure Of Conserved Hypothetical Protein Batb From Bacteroides Thetaiotaomicron Length = 218 Score = 36.7 bits (83), Expect = 6.9, Method: Composition-based stats. Identities = 24/172 (13%), Positives = 59/172 (34%), Gaps = 26/172 (15%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 N+ ++++I LD+S S D ++++ A + I+ +++E+ + G Sbjct: 4 SNAVKRKGVEVIIALDISNS-XLAQDVQPSRLEKAKRLISRLVDELDND-------KVGX 55 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHC 280 + F+ + + ++ +S T + A +G+ Sbjct: 56 IVFAGDAFTQLPITSDYISAKXFLESISPSLISKQGTAIGEAINLATRSFTPQEGVG--- 112 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + I+ +TDGEN + + + V +G+ Sbjct: 113 --------RAIIVITDGENHEGGAVEAAKAAAEKG----IQVSVLGVGXPEG 152 >gi|303240110|ref|ZP_07326631.1| protein of unknown function DUF58 [Acetivibrio cellulolyticus CD2] gi|302592379|gb|EFL62106.1| protein of unknown function DUF58 [Acetivibrio cellulolyticus CD2] Length = 273 Score = 36.7 bits (83), Expect = 6.9, Method: Composition-based stats. Identities = 22/102 (21%), Positives = 44/102 (43%), Gaps = 13/102 (12%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + L+M +++D+SRS +S TK D+ + + D + G++ F Sbjct: 72 CEERELNMFLLIDMSRS-----NSFGTKKDLIAEVSATLAFSASKNND-----KVGVIFF 121 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYL---SKFGVSTNSTPGLKY 265 ++K+E+F G H+ I+ + T+ L+Y Sbjct: 122 TDKVEKFIPSVNGRKHILSIIENILNFQPELSGTDLAKALQY 163 >gi|296135239|ref|YP_003642481.1| von Willebrand factor type A [Thiomonas intermedia K12] gi|295795361|gb|ADG30151.1| von Willebrand factor type A [Thiomonas intermedia K12] Length = 336 Score = 36.7 bits (83), Expect = 7.0, Method: Composition-based stats. Identities = 28/213 (13%), Positives = 61/213 (28%), Gaps = 44/213 (20%) Query: 173 DMMIVLDVSRSMESF----FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN 228 D++++LD S +M +++ +A + + LV F + Sbjct: 100 DILVLLDTSLTMSLHDLTWAGKPASRLAVAQRVFADFAR-------ARQGDRFALVAFGS 152 Query: 229 KIEEFFLLEW--GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + + + + G T + Q + G++ Sbjct: 153 HAATLLPPTFDARAAGQMAGLLAVGQLGPDTALGDAIALGLRQAGALHGLKP-------- 204 Query: 287 YKKIIVFMTDG--ENLSTKEDQQSLYYCNEAKKRGAIVYAI---------------GIRV 329 +++ TDG N ++ A+ G +Y + Sbjct: 205 ---VLILYTDGGQSNTGAISPADAVAL---ARHLGVRIYTVEVGTTPDPGRPYTVPAYAG 258 Query: 330 IRSHEFLRACASPNSFYLVENPHSMYDAFSHIG 362 + L A A+ FY + + A IG Sbjct: 259 PQPDLRLIAEATGGRFYFAASAGAQQAAVRDIG 291 >gi|255544946|ref|XP_002513534.1| Magnesium-chelatase subunit chlD, chloroplast precursor, putative [Ricinus communis] gi|223547442|gb|EEF48937.1| Magnesium-chelatase subunit chlD, chloroplast precursor, putative [Ricinus communis] Length = 760 Score = 36.7 bits (83), Expect = 7.0, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 58/177 (32%), Gaps = 20/177 (11%) Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 + ++ A R P + SR + + T A ++ V+D S SM Sbjct: 511 KRLAVDATLRAAAPYQKLRREKDTQKSRKVFVEKTDMRAKRMARKAGALVIFVVDASGSM 570 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEFFLLEWGVSHL 243 ++ ++ A + +L + Q ++ F + E ++ Sbjct: 571 ------ALNRMQNAKGAAIKLL-----AESYTSRDQVAIIPFRGDSAEVLLPPSRSIAMA 619 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 +++++ L G S + + + + + +IV +TDG Sbjct: 620 RKRLERLPCGGGSPLAHG--------LTTAVRVGLNAEKSGDVGRIMIVAITDGRAN 668 >gi|310779934|ref|YP_003968266.1| hypothetical protein Ilyop_2155 [Ilyobacter polytropus DSM 2926] gi|309749257|gb|ADO83918.1| conserved hypothetical protein [Ilyobacter polytropus DSM 2926] Length = 195 Score = 36.3 bits (82), Expect = 7.1, Method: Composition-based stats. Identities = 29/134 (21%), Positives = 53/134 (39%), Gaps = 22/134 (16%) Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 ++D S SMES AI N+ +E + + LV F++ Sbjct: 11 IIDRSGSMESIKSD-------AIGGFNSFIESQRKFDGEAALT---LVLFNDY------- 53 Query: 237 EWGVSHLQRKIKYLSKFGVSTNS---TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV- 292 + + + ++ I+ + +T T + A + D G R + E+ +K+IV Sbjct: 54 -YKIVYDRKDIQTVPFLNDNTYVPGGTTAMLDAIGRTIDSIGERLNKTPEEDRPEKVIVS 112 Query: 293 FMTDGENLSTKEDQ 306 +TDG +KE Sbjct: 113 ILTDGLENDSKEYT 126 >gi|203284244|ref|YP_002221984.1| hypothetical protein BDU_326 [Borrelia duttonii Ly] gi|201083687|gb|ACH93278.1| uncharacterized conserved protein [Borrelia duttonii Ly] Length = 371 Score = 36.3 bits (82), Expect = 7.1, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 53/158 (33%), Gaps = 21/158 (13%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 LD++ V+DV+ SM++ I++ + +L+ V+ + + G V + + + Sbjct: 229 DLDLVFVIDVTDSMKNH-------IEILRE---HLLDMVEPQLNQFKSYRIGFVFYKDYL 278 Query: 231 EEFF--LLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E+F ++ I G + + N + Sbjct: 279 EDFLTRSFDFNSKEYLHNILESINVGGGGDYPEAVFEGVNAAVTQFDWKSDN-------- 330 Query: 289 KIIVFMTDGEN-LSTKEDQQSLYYCNEAKKRGAIVYAI 325 + I+ + + + AK++ +Y I Sbjct: 331 RFIIVLGNAPPHEYPRGPIVYEDVIRAAKEKDITIYGI 368 >gi|167393303|ref|XP_001740519.1| hypothetical protein [Entamoeba dispar SAW760] gi|165895350|gb|EDR23063.1| hypothetical protein, conserved [Entamoeba dispar SAW760] Length = 681 Score = 36.3 bits (82), Expect = 7.1, Method: Composition-based stats. Identities = 22/142 (15%), Positives = 53/142 (37%), Gaps = 26/142 (18%) Query: 158 ITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNN 217 + SS +N + + ++++ + D S SM I+ + L ++ + + Sbjct: 198 VISSTFINKEKEGDINIIFICDRSGSMYGEG------INALRNMLQLFLRQL----PLKS 247 Query: 218 VVQSGLVTFSNKIEEFFL--LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 + ++F +K + F +E+ L+ +S+F + + + D Sbjct: 248 KFEI--ISFGSKYDFMFKEMVEYNEDTLKNASNRISEFEAN-YGGTSMDAPLKALIDNNT 304 Query: 276 MRQHCNTEDANYKKIIVFMTDG 297 + H I+ +TDG Sbjct: 305 EKCH-----------IILLTDG 315 >gi|125536452|gb|EAY82940.1| hypothetical protein OsI_38158 [Oryza sativa Indica Group] Length = 130 Score = 36.3 bits (82), Expect = 7.1, Method: Composition-based stats. Identities = 15/105 (14%), Positives = 40/105 (38%), Gaps = 12/105 (11%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 + + + ++ LD++ V+DVS SM KI+ K++ ++ Sbjct: 37 ALTADSVTAVVEIKATSSTAVREGLDLVAVVDVSGSMRGH------KIESVKKALQFVIM 90 Query: 208 EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSK 252 ++ + + +VTF + + L + ++ + + Sbjct: 91 KLTPVD------RLSIVTFESSAKRLTKLRAMTQDFRGELDGIVE 129 >gi|169861923|ref|XP_001837595.1| hypothetical protein CC1G_08149 [Coprinopsis cinerea okayama7#130] gi|116501324|gb|EAU84219.1| hypothetical protein CC1G_08149 [Coprinopsis cinerea okayama7#130] Length = 327 Score = 36.3 bits (82), Expect = 7.1, Method: Composition-based stats. Identities = 20/126 (15%), Positives = 41/126 (32%), Gaps = 7/126 (5%) Query: 173 DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 D + V+D S SM S S T+ D A +++ + D + + F+++ Sbjct: 105 DTIFVVDDSTSMTSLMPSGRTRWDEACEALADF-ASLAAQYDADG---IDIHFFNSEKSR 160 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 + + K ++ G T + M + + Sbjct: 161 TNIQSRST--IDSLFKGMAPNG-GTYIGKKISQLAEPYISALNMAVLTQEPTWPKPRNYI 217 Query: 293 FMTDGE 298 +TDG+ Sbjct: 218 IITDGK 223 >gi|298481574|ref|ZP_06999766.1| von Willebrand factor type A domain protein [Bacteroides sp. D22] gi|298272438|gb|EFI14007.1| von Willebrand factor type A domain protein [Bacteroides sp. D22] Length = 616 Score = 36.3 bits (82), Expect = 7.1, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 67/193 (34%), Gaps = 27/193 (13%) Query: 146 PWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAM 205 PW TN R + + + + ++ + +++ ++DVS SM ++D+ S+ + Sbjct: 224 PWNTNHRLVRIGLKAK-EIPTDNLPASNLVFLIDVSGSMW-----GANRLDLVKSSLKLL 277 Query: 206 LEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGL-- 263 ++ V V SG ++ ++ I L+ G + + Sbjct: 278 ENNLRDKDKVAIVTYSGSAG----VKLEATPGSDKQKIREAIDELTAGGSTAGGAGIMLA 333 Query: 264 -KYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGE-NLSTKEDQQSLYYCNEAKKRGAI 321 K A + R I+ +DG+ N+ + + +K G Sbjct: 334 YKIAKKNLISNGNNR-------------IILCSDGDFNVGVSSAEGLEQLIEKERKSGVF 380 Query: 322 VYAIGIRVIRSHE 334 + +G + + Sbjct: 381 LTVLGYGMGNYKD 393 >gi|307136080|gb|ADN33929.1| magnesium-chelatase subunit chID [Cucumis melo subsp. melo] Length = 533 Score = 36.3 bits (82), Expect = 7.2, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 58/177 (32%), Gaps = 20/177 (11%) Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 + ++ A R P + N+R + + + A ++ V+D S SM Sbjct: 282 KRLAVDATLRAAAPYQKLRKAKDVQNNRKVFVEKSDMRAKRMARKAGALVIFVVDASGSM 341 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNK-IEEFFLLEWGVSHL 243 ++ ++ A + +L + Q ++ F E ++ Sbjct: 342 ------ALNRMQNAKGAALKLL-----AESYTSRDQVSIIPFRGDCAEVLLPPSRSIAMA 390 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 +++++ L G S + + + + + +IV +TDG Sbjct: 391 RKRLERLPCGGGSPLAHG--------LTTAVRVGLNAEKSGDVGRVMIVAITDGRAN 439 >gi|282896018|ref|ZP_06304049.1| von Willebrand factor, type A [Raphidiopsis brookii D9] gi|281199128|gb|EFA73998.1| von Willebrand factor, type A [Raphidiopsis brookii D9] Length = 200 Score = 36.3 bits (82), Expect = 7.2, Method: Composition-based stats. Identities = 33/170 (19%), Positives = 62/170 (36%), Gaps = 16/170 (9%) Query: 173 DMMIVLDVSRSME-SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D +++D S SM T+ ++A +S A+ + + V FS + + Sbjct: 5 DYTLIIDKSGSMSTPDQRGGRTRWEIAQESTIALARKCEQFDPDGITVYV----FSGRFK 60 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + E + +I + +TN L A N F + Q + I Sbjct: 61 RY---ENVTAAKVGQIFQENDPVGTTNLAGVLLDATNHYFQRKASGQAKPNGET-----I 112 Query: 292 VFMTDGENLSTKEDQQSLYYCNEA--KKRGAIVYAIGIRV-IRSHEFLRA 338 V +TDGE K + + + K + I + ++ +FL+A Sbjct: 113 VVITDGEPDDRKAVFEVVINASRKIDKDEELAISLIQVGTDPQATKFLKA 162 >gi|218441801|ref|YP_002380130.1| von Willebrand factor A [Cyanothece sp. PCC 7424] gi|218174529|gb|ACK73262.1| von Willebrand factor type A [Cyanothece sp. PCC 7424] Length = 201 Score = 36.3 bits (82), Expect = 7.2, Method: Composition-based stats. Identities = 23/138 (16%), Positives = 53/138 (38%), Gaps = 13/138 (9%) Query: 173 DMMIVLDVSRSME-SFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 D +++D S SM ++ + +S A+ + + + V FS + + Sbjct: 6 DYTLIIDKSGSMSIKDQRGDKSRWAVMQESTLALAAKCEELDPDGITVYV----FSGRFK 61 Query: 232 EFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 + + S+ ++I ++ T+ L+ A FD + + + I Sbjct: 62 RY---DNVTSNKVQQIFQENEPNGRTDLAAVLQDALMNYFDRKAAGKTKPNGET-----I 113 Query: 292 VFMTDGENLSTKEDQQSL 309 + +TDGE K +++ Sbjct: 114 IVVTDGEPDDRKAVMKTI 131 >gi|313235811|emb|CBY19795.1| unnamed protein product [Oikopleura dioica] Length = 1120 Score = 36.3 bits (82), Expect = 7.3, Method: Composition-based stats. Identities = 28/221 (12%), Positives = 72/221 (32%), Gaps = 36/221 (16%) Query: 156 MPITSSVKVNSQTDARL-----DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + T + + + ++ D+MI++D S + D+ ++K Sbjct: 20 LEATCVCESDCSSQPQISQCHADVMILMDSSA------CHQGERWDLIRNWSFKAATQLK 73 Query: 211 LIPDVNNVV-----QSGLVTFSNKIEEFFLLEWGV---SHLQRKI-KYLSKFGVSTNSTP 261 ++++V+ + + +FS + + + + L R + + + Sbjct: 74 RQLNIDDVITMESSRVAVGSFSYRTNDVLSFDDNKLSHNDLLRIVSNEHNNLTGGSTDYS 133 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED--QQSLYYCNEAKKRG 319 G+ F + H + K +V +T+G++ Q + + G Sbjct: 134 GMVDNVISAFSARQSENHKDN-----DKYVVLVTNGKDAEGNIGNFDQVDELASRLRSSG 188 Query: 320 AIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 V + I + C SP ++N ++ Sbjct: 189 IRVIPVAIA--------KRC-SPEKEEDLQNICPRWEVLQR 220 >gi|313221851|emb|CBY38921.1| unnamed protein product [Oikopleura dioica] Length = 888 Score = 36.3 bits (82), Expect = 7.3, Method: Composition-based stats. Identities = 28/221 (12%), Positives = 72/221 (32%), Gaps = 36/221 (16%) Query: 156 MPITSSVKVNSQTDARL-----DMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + T + + + ++ D+MI++D S + D+ ++K Sbjct: 20 LEATCVCESDCSSQPQISQCHADVMILMDSSA------CHQGERWDLIRNWSFKAATQLK 73 Query: 211 LIPDVNNVV-----QSGLVTFSNKIEEFFLLEWGV---SHLQRKI-KYLSKFGVSTNSTP 261 ++++V+ + + +FS + + + + L R + + + Sbjct: 74 RQLNIDDVITMESSRVAVGSFSYRTNDVLSFDDNKLSHNDLLRIVSNEHNNLTGGSTDYS 133 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKED--QQSLYYCNEAKKRG 319 G+ F + H + K +V +T+G++ Q + + G Sbjct: 134 GMVDNVISAFSARQSENHKDN-----DKYVVLVTNGKDAEGNIGNFDQVDELASRLRSSG 188 Query: 320 AIVYAIGIRVIRSHEFLRACASPNSFYLVENPHSMYDAFSH 360 V + I + C SP ++N ++ Sbjct: 189 IRVIPVAIA--------KRC-SPEKEEDLQNICPRWEVLQR 220 >gi|170743966|ref|YP_001772621.1| hypothetical protein M446_5903 [Methylobacterium sp. 4-46] gi|168198240|gb|ACA20187.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 417 Score = 36.3 bits (82), Expect = 7.3, Method: Composition-based stats. Identities = 10/55 (18%), Positives = 25/55 (45%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQI 65 F + +GG +I+ + I + I+ ++++ K L + D + + A + Sbjct: 8 FAAHKQGGASIIGLFGMLIAVGFAAVAIDSGNLYYSKLKLQKIADAAALGAVMAL 62 >gi|118357562|ref|XP_001012030.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89293797|gb|EAR91785.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 894 Score = 36.3 bits (82), Expect = 7.3, Method: Composition-based stats. Identities = 30/168 (17%), Positives = 50/168 (29%), Gaps = 25/168 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +LD S SM I A +++ L+ + I NVV G Sbjct: 314 FIFLLDRSGSM------IGQPIQKACEALILFLKSL-PIDSYFNVVSFGSSYEKLFQSSI 366 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVF 293 + + IK + T L+ + + +Q I Sbjct: 367 KYDTNSLEKAIKIIKEYTADLGGTEIYKPLQSVFKETKIDGYNKQ------------IFL 414 Query: 294 MTDGENLSTKEDQQSLYYCNEAKKRGA-IVYAIGIRVIRSHEFLRACA 340 +TDGE S K+ Q + K + +IG + A Sbjct: 415 LTDGEVESPKKVVQLIR-----KNNKFNRINSIGFGSGADKYLIEESA 457 >gi|162451664|ref|YP_001614031.1| putative secreted protein [Sorangium cellulosum 'So ce 56'] gi|161162246|emb|CAN93551.1| putative secreted protein [Sorangium cellulosum 'So ce 56'] Length = 392 Score = 36.3 bits (82), Expect = 7.4, Method: Composition-based stats. Identities = 26/185 (14%), Positives = 50/185 (27%), Gaps = 21/185 (11%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSM---ESFFDSSITKIDMAIKSINAMLEE-------VK 210 + + + + + + DVS SM + + K D + + + Sbjct: 74 ATQTAAASLQPVYLAFAFDVSGSMGKGDKDWHDKALKWDPVVAATKQFFSDPGSEGLTAS 133 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 L + + ++ L + I + S + G A+ Sbjct: 134 LSFFPADDDRCSSEIYATPDVPLTPLP--SAAFTEAIDAIEP-ASSDDWRGGTPTAWVMR 190 Query: 271 FDMQGMRQHCNTEDANYKKI-IVFMTDGENLSTKEDQQSL----YYCNEAKKRGAIVYAI 325 + N K IV +TDG E ++ A G Y I Sbjct: 191 GTSGFIEAQR---RQNPGKYAIVLVTDGYPQGCDEASDTIDAVVADAQAALAEGVPTYVI 247 Query: 326 GIRVI 330 G+ Sbjct: 248 GVENP 252 >gi|3913218|sp|O22437|CHLD_PEA RecName: Full=Magnesium-chelatase subunit chlD, chloroplastic; Short=Mg-chelatase subunit D; AltName: Full=Mg-protoporphyrin IX chelatase; Flags: Precursor gi|2318117|gb|AAB72194.1| Mg-chelatase subunit D [Pisum sativum] Length = 754 Score = 36.3 bits (82), Expect = 7.4, Method: Composition-based stats. Identities = 29/219 (13%), Positives = 66/219 (30%), Gaps = 34/219 (15%) Query: 125 EGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSM 184 + ++ A R P + N R + + T A ++ V+D S SM Sbjct: 503 KRLAVDATLRAAAPYQKLRREKDTENRRKVYVEKTDMRAKRMARKAGALVIFVVDASGSM 562 Query: 185 ESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEFFLLEWGVSHL 243 ++ ++ A + +L + Q ++ F + E ++ Sbjct: 563 ------ALNRMQNAKGAALKLL-----AESYTSRDQVSIIPFRGDSAEVLLPPSRSIAMA 611 Query: 244 QRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG------ 297 +++++ L G S + + + + + +IV +TDG Sbjct: 612 RKRLERLPCGGGSPLAHG--------LTTAVRVGLNAEKSGDVGRIMIVAITDGRANISL 663 Query: 298 ----ENLSTKEDQQSLYYCNEAKKRGA----IVYAIGIR 328 + + E K +Y G+ Sbjct: 664 KRSNDPEAAAASDAPKPTSQELKDEIIEVAAKIYKTGMS 702 >gi|328952507|ref|YP_004369841.1| hypothetical protein Desac_0781 [Desulfobacca acetoxidans DSM 11109] gi|328452831|gb|AEB08660.1| hypothetical protein Desac_0781 [Desulfobacca acetoxidans DSM 11109] Length = 465 Score = 36.3 bits (82), Expect = 7.4, Method: Composition-based stats. Identities = 11/85 (12%), Positives = 33/85 (38%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 + KG + ++ A+ L + VLG ++ + ++ K + ++ + + AT + Sbjct: 1 MLSDEKGSVAVVVAVCLFALVGVLGFSLDTAFLYVEKNHTQNAVEAAALAGATNLCAGDQ 60 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFR 95 + ++ + + F Sbjct: 61 EDIEAIVRNVAAENGLTGANHEVFV 85 >gi|304310158|ref|YP_003809756.1| von Willebrand factor, type A [gamma proteobacterium HdN1] gi|301795891|emb|CBL44092.1| von Willebrand factor, type A [gamma proteobacterium HdN1] Length = 683 Score = 36.3 bits (82), Expect = 7.4, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 65/173 (37%), Gaps = 25/173 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 ++ V+D + SM+ + I+ +++ + + ++ ++ + V+ GLV F + + Sbjct: 247 VVFVIDSTISMDPY-------IERTREAVKKIYQRIE-DENLLSQVKFGLVAFRSSTKAV 298 Query: 234 FLLEW---------GVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 LE+ V + + +S + S+ + + + + + Sbjct: 299 PGLEYVSKMYVDPATVKDGKDFLAKVSSLKQANVSSKAFQE--DSLAGVMQALNEVD-WS 355 Query: 285 ANYKKIIVFMTDG-----ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + IV +TD ++ + EA RG +Y++ ++ Sbjct: 356 QYGGRYIVLITDAGAVTPDSGLSSTGLNPEQVRLEAATRGVAIYSLHLKTPAG 408 >gi|7387491|gb|AAA86852.2| putative TerF [Serratia marcescens] Length = 408 Score = 36.3 bits (82), Expect = 7.4, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 61/198 (30%), Gaps = 32/198 (16%) Query: 126 GYSISAISRYKIPLKFCTFIPWY----TNSRHIVMPITSSVKVNSQTDARLD--MMIVLD 179 Y + P K + R + + +SV + L+ + VLD Sbjct: 160 SYGVDVAQPAPQPAKPARISLEKKLETRSPRLVSLAKKASVSLTKNKLDTLEAAVAFVLD 219 Query: 180 VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG 239 S SM F ++S+ + + D + + +K L+ Sbjct: 220 ASGSMSGQFSKGN------VQSVLDRIAVLAAQFDDDGEMDVWGFGEKHKKYPNVTLDNL 273 Query: 240 VSHLQRKIKY--------LSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 +++Q L G + N P ++ + D + + Sbjct: 274 DTYIQSIRGAGKRSAWENLPGLGGTNNEPPVMEEIVDYFKDSK------------IPVYV 321 Query: 292 VFMTDGENLSTKEDQQSL 309 VF+TDG T+ + ++ Sbjct: 322 VFITDGGISKTRAIKDAI 339 >gi|282897305|ref|ZP_06305307.1| hypothetical protein CRD_02229 [Raphidiopsis brookii D9] gi|281197957|gb|EFA72851.1| hypothetical protein CRD_02229 [Raphidiopsis brookii D9] Length = 468 Score = 36.3 bits (82), Expect = 7.5, Method: Composition-based stats. Identities = 33/175 (18%), Positives = 60/175 (34%), Gaps = 21/175 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 T + N+ L + I LD S SM + T +KSI + E +P Sbjct: 36 TPAENANTSKSNNLSVSIYLDGSGSMLGYVRGGETNYVSTLKSIRNLFELSDKLPV--EY 93 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQ 278 + G + +++ +GV+T G + ++ + Sbjct: 94 YRIG-----SPMQKITS------------SEYYNYGVTTTFYDGSSNQFPEVSSP--IDA 134 Query: 279 HCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + + KK+ V +TD + S + + N YA+GI I+S Sbjct: 135 AIISPEKEQKKMTVIITDLQQNSGDVTKLNKLINNTYYNIENRDYAVGIWAIKSE 189 >gi|296121941|ref|YP_003629719.1| hypothetical protein Plim_1690 [Planctomyces limnophilus DSM 3776] gi|296014281|gb|ADG67520.1| conserved hypothetical protein [Planctomyces limnophilus DSM 3776] Length = 299 Score = 36.3 bits (82), Expect = 7.5, Method: Composition-based stats. Identities = 20/103 (19%), Positives = 38/103 (36%), Gaps = 9/103 (8%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAI-KSINAMLEEVKLIPDVNNVVQSGLVT 225 + + + M+++D+S SM F + K + +I + + GL+T Sbjct: 79 EAETNISGMLLMDLSASMGYTFRQQLNKFEYSICLAAALAYLMIHQQDP------VGLMT 132 Query: 226 FSNKIEEFFLLEWGVSHLQRKIKYLSKF--GVSTNSTPGLKYA 266 F I S L + LS+ +T+ +K A Sbjct: 133 FDTNINASLPPRSRRSQLAELLGLLSRLSPTGTTDFGTSMKRA 175 >gi|52082206|ref|YP_080997.1| YwmD protein [Bacillus licheniformis ATCC 14580] gi|52005417|gb|AAU25359.1| YwmD [Bacillus licheniformis ATCC 14580] Length = 230 Score = 36.3 bits (82), Expect = 7.5, Method: Composition-based stats. Identities = 30/205 (14%), Positives = 66/205 (32%), Gaps = 20/205 (9%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + +++ +VLD S SM + K D+A +S+ + + +V L F Sbjct: 32 NSKKDVNVAVVLDASGSMAQKVEGER-KFDIAKESVTDFADLLSSDANVM------LNVF 84 Query: 227 SNK---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +K + G + ++ S F N+ G+K + Sbjct: 85 GHKGNNKNSGKEVSCGTTETVYDLQPFS-FNSFENALSGIKPTGWSPIAKSLYDVKDDLA 143 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI----VYAIGIRVIRSHEFLRAC 339 D + K + +TDGE + + +K + + ++ ++ L+ Sbjct: 144 DKDGKNYVYIVTDGE---ETCGGDPVQAAKDLRKSNIKTIVNIVGLDVKTVKEKAKLKKV 200 Query: 340 AS--PNSFYLVENPHSMYDAFSHIG 362 A ++ + G Sbjct: 201 ADAGGGKLIEADSASDFKKVWKEEG 225 >gi|226312577|ref|YP_002772471.1| hypothetical protein BBR47_29900 [Brevibacillus brevis NBRC 100599] gi|226095525|dbj|BAH43967.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 262 Score = 36.3 bits (82), Expect = 7.7, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 58/184 (31%), Gaps = 31/184 (16%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + + +D+S SM + + K + +++ +L D N + L ++ Sbjct: 36 VALAIDISGSMHTLY-----KNGVVQRAVERLLALGVKFDD-NRSIDIFLFGKNDYEVGE 89 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYA----YNQIFDMQGMRQHCNTEDANYKK 289 +++ I + TN +K Y Q F+ + +K Sbjct: 90 LSERDFFNYVNDHIVKKYQLEGYTNYAGVMKRITKKYYPQAFEGSSTGGGFFGKLFGGQK 149 Query: 290 ------------------IIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 + F+TDG+N E + + EA K G +GI + Sbjct: 150 FSIDQNAYKAHYQNSEPVYVYFLTDGDNGDKDETELVIR---EASKLGIFWQFVGIGNEK 206 Query: 332 SHEF 335 Sbjct: 207 FQFL 210 >gi|268608768|ref|ZP_06142495.1| hypothetical protein RflaF_04637 [Ruminococcus flavefaciens FD-1] Length = 453 Score = 36.3 bits (82), Expect = 7.7, Method: Composition-based stats. Identities = 36/235 (15%), Positives = 77/235 (32%), Gaps = 43/235 (18%) Query: 153 HIVMPITSSVKVNSQTDARLDMM--IVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVK 210 + + + + + D L+ ++LD S SM+ + + A K + Sbjct: 161 KVTVTVIADAVIPVVNDNSLETYTVLILDASGSMQ--GAPMTAQKEAAKKFCEDTIGT-- 216 Query: 211 LIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 + N+ + ++T + + + L I + +G STN + L+ A + Sbjct: 217 ---NKNSNHKFAVITLDSGSKTLTDFTNDIDELDSAIAKTTAYG-STNYSAALRNAAELL 272 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGEN---------LSTKEDQQSLYYCNEAKK---- 317 + A+ + IV +DG T D Y N A Sbjct: 273 ----------SKVSADAVRNIVLCSDGNPYGGEEKSTGKYTLSDYSDYEYANAAYDIAQE 322 Query: 318 --RGAIVYAIGI-------RVIRSHEFLRACAS-PNSFYLVENPHSMYDAFSHIG 362 + +Y +G + +L+ AS +++ V + F+ + Sbjct: 323 IKKDYEIYTLGFFHSLSGEDLDFGRTYLKDVASYDSNYAEVNKVDDLQKVFADVA 377 >gi|219111153|ref|XP_002177328.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|217411863|gb|EEC51791.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 438 Score = 36.3 bits (82), Expect = 7.7, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 62/177 (35%), Gaps = 23/177 (12%) Query: 172 LDMMIVLDVSRSMES----------FFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQS 221 L+M + +D + S D + + A+ +I ++++ V+ Sbjct: 217 LEMCVAIDFTGSNGDPRKPGTLHYIHPDGQLNDYEKALTAIGSIVDRYDS-DHKYPVLGF 275 Query: 222 GLVTFSNKIEEFFLLE-----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 G + ++ F + G+ + + + + G+ T S P + Sbjct: 276 GA-KYGGVVQHCFQVGPSAELDGIGGILDAYRQVFRTGL-TMSGPTVFAEVLNFAAASAR 333 Query: 277 RQHCNTEDANYK--KIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIR 331 + +N + K+++ +TDG + +++ + N A V +GI Sbjct: 334 SRQDEARRSNRQAYKVLLILTDG---AVTSVEETKHALNAASDAPLSVVIVGIGNAD 387 >gi|166363797|ref|YP_001656070.1| hypothetical protein MAE_10560 [Microcystis aeruginosa NIES-843] gi|166086170|dbj|BAG00878.1| hypothetical protein MAE_10560 [Microcystis aeruginosa NIES-843] Length = 266 Score = 36.3 bits (82), Expect = 7.7, Method: Composition-based stats. Identities = 24/175 (13%), Positives = 61/175 (34%), Gaps = 24/175 (13%) Query: 167 QTDARLDMMIVLDVSRSMESFF-DSSITKIDMAI--KSINAMLEEVKLIPDVNNVVQSGL 223 + ++I++D S SM + +S+ + + I ++ ++ + + Sbjct: 7 SSAKPGFILIMIDQSASMSDKYANSNKAEFAALAVNRVIGEIITACSSGNEIKDRCFVAV 66 Query: 224 VTFSNKI-----EEFFLLEWG--VSHLQRKIK-----------YLSKFGVSTNSTPG-LK 264 V + I ++ L + L+RK+ + F T S + Sbjct: 67 VGYGASISILFLDKASELAKNPNTTTLKRKVSDGAGGLVEVDEVMRVFVKPTASNGTPMA 126 Query: 265 YAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRG 319 A+ Q + G+ + + ++ +++ +TDGE + + K Sbjct: 127 EAFKQAYT--GVEKFISNHPDSFPPVVINITDGEPNDSSAATTEAKKLAQLKTSD 179 >gi|170741047|ref|YP_001769702.1| hypothetical protein M446_2843 [Methylobacterium sp. 4-46] gi|168195321|gb|ACA17268.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 439 Score = 36.3 bits (82), Expect = 7.7, Method: Composition-based stats. Identities = 14/77 (18%), Positives = 33/77 (42%) Query: 7 NIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIM 66 +++ F + G ++ + LP++ G +E + I +T L +D + + AA ++ Sbjct: 3 SLKTFLADRAGAAALILGLCLPMLVAGSGAALEYARIHKRRTELQKAVDTAALAAAGELT 62 Query: 67 NEGNGNNRKKLKGGDIL 83 G+ + L Sbjct: 63 IAGSDTYVRNLAVRTAY 79 >gi|13475416|ref|NP_106980.1| hypothetical protein mll6478 [Mesorhizobium loti MAFF303099] gi|14026168|dbj|BAB52766.1| mll6478 [Mesorhizobium loti MAFF303099] Length = 500 Score = 36.3 bits (82), Expect = 7.7, Method: Composition-based stats. Identities = 22/196 (11%), Positives = 54/196 (27%), Gaps = 28/196 (14%) Query: 8 IRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMID------------ 55 IR F+++ +G IL ++ LP + + I++S + + L D Sbjct: 5 IRAFWHDQRGIALILVSVTLPALIGFSLLAIDMSRVNNLHNDLQKAADAFALAGAAELDG 64 Query: 56 -----------RSLVHAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFV 104 + + + N R L G + + Sbjct: 65 LPGSWARAERAMATLVTNQAAFSTVGTNGRFTLTSGQPGGTARCNSAGNISWCFLKTIPA 124 Query: 105 NDIDDIVRSTSLDIVVV---PQNEGYSISAISRYKIPLKFCTFIPW--YTNSRHIVMPIT 159 +D + + + + ++ + +F+ +NS ++ T Sbjct: 125 SDATPVSSANYANATQSIGEDETVFVEVTVAPTGFAAIFPASFLTGNAASNSFNVAAKAT 184 Query: 160 SSVKVNSQTDARLDMM 175 + + M Sbjct: 185 AGFTSGVCDYTPVFMC 200 >gi|72091964|ref|XP_795239.1| PREDICTED: similar to parturition-related protein PRP3, partial [Strongylocentrotus purpuratus] gi|115929603|ref|XP_001180674.1| PREDICTED: similar to parturition-related protein PRP3, partial [Strongylocentrotus purpuratus] Length = 729 Score = 36.3 bits (82), Expect = 7.8, Method: Composition-based stats. Identities = 27/178 (15%), Positives = 60/178 (33%), Gaps = 23/178 (12%) Query: 159 TSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 TS + Q L +++VLD S SM+ K+ I ++ I N+ Sbjct: 158 TSPNFIVVQPAGSLRIVLVLDTSFSMD---GPRFNKM------IRGAKNFIQSIVPNNSY 208 Query: 219 VQSGLVTFSNKIEEFFLLEWGVSHL-QRKIKYLSKF--GVSTNSTPGLKYAYNQIFDMQG 275 V V F++ + S + ++ + L +T G+ A Sbjct: 209 VAI--VEFNSVSRVNSHMIELSSVISRKNLASLLPTVAENNTCIGCGILTAIEVAQYNGL 266 Query: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 + ++ ++DG+ ++ ++ + G V++I + + Sbjct: 267 DSRGV---------YLILLSDGQENIYEDITLVADTLDDIESSGVTVHSIALYGADTQ 315 >gi|328865688|gb|EGG14074.1| type A von Willebrand factor domain-containing protein [Dictyostelium fasciculatum] Length = 908 Score = 36.3 bits (82), Expect = 7.9, Method: Composition-based stats. Identities = 34/226 (15%), Positives = 76/226 (33%), Gaps = 25/226 (11%) Query: 124 NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLD-MMIVLDVSR 182 N+ + + + + + + + +++ + + +LD S Sbjct: 249 NKNLIVVVKPKTEEKPGYFLEYNKEDKTAALALNFYPRFQISPDEVDQKSEFVFLLDCSG 308 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSH 242 SM +ITK A +++ ++ + N+V G S E + + Sbjct: 309 SMS---GGAITK---AKRALEILMRSL-TENSKFNIVLFGSNFKSLFPESMPYDDANLEI 361 Query: 243 LQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 I+ + T P +K ++ +D Q RQ + +TDGE Sbjct: 362 ASTYIQKIQADLGGTELLPPIKSILSKPYDPQYPRQ------------VFILTDGE---V 406 Query: 303 KEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHEFLRACA-SPNSFYL 347 E Q + + + + ++ +GI E + + S +Y Sbjct: 407 SERDQLIDFVGK-EANTTRIFTLGIGSGVDRELVIGLSKSCKGYYE 451 >gi|156343014|ref|XP_001621008.1| hypothetical protein NEMVEDRAFT_v1g146334 [Nematostella vectensis] gi|156206562|gb|EDO28908.1| predicted protein [Nematostella vectensis] Length = 120 Score = 36.3 bits (82), Expect = 8.0, Method: Composition-based stats. Identities = 20/134 (14%), Positives = 43/134 (32%), Gaps = 22/134 (16%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 +D+++ LD S S+ + + I + + + + + GL+ F + Sbjct: 1 PIDLVLALDSSESVGAENWMKM------INATIKFVNQFDV-----RYTRFGLIQFHSSP 49 Query: 231 EEFFLLEWGV--SHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 E +L L I + ST + + M+ + Sbjct: 50 EVPIVLSSFPDNKTLSETIAKVYYKAGSTRT---------DLAAMKVVEVFKAASHRKAS 100 Query: 289 KIIVFMTDGENLST 302 K+ VF+ G + Sbjct: 101 KVAVFILGGLSTDV 114 >gi|299532826|ref|ZP_07046213.1| membrane protein-like protein [Comamonas testosteroni S44] gi|298719050|gb|EFI60020.1| membrane protein-like protein [Comamonas testosteroni S44] Length = 628 Score = 36.3 bits (82), Expect = 8.1, Method: Composition-based stats. Identities = 15/61 (24%), Positives = 29/61 (47%), Gaps = 2/61 (3%) Query: 2 FSLNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHA 61 F+ L+ R G + L A++L I + L I++ ++F+ K L + D + + Sbjct: 4 FATGLS-RTRIRRQSGSVATLGALWLMIAVICLA-TIDIGNVFWQKRELQKIADLAALAG 61 Query: 62 A 62 A Sbjct: 62 A 62 >gi|149198097|ref|ZP_01875144.1| hypothetical protein LNTAR_15587 [Lentisphaera araneosa HTCC2155] gi|149138699|gb|EDM27105.1| hypothetical protein LNTAR_15587 [Lentisphaera araneosa HTCC2155] Length = 774 Score = 36.3 bits (82), Expect = 8.1, Method: Composition-based stats. Identities = 31/184 (16%), Positives = 69/184 (37%), Gaps = 28/184 (15%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSM-------ESFFDSSITKIDMAIKSINAMLEEVKLIP 213 +VK+++Q +R + +++DVS SM + + + K + L+ + L+ Sbjct: 82 TVKMDTQESSRTVLPVLIDVSSSMSITETKSSDDYLRQMAEACGLRKDEVSKLQRIDLVK 141 Query: 214 DVNNVVQSGLV-----TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYN 268 + N + L+ +S K+ F + + I+ G +T+ + ++Y Sbjct: 142 AILNNQELNLINKLQDKYSVKLYIFDQQTRQLKIDEELIELPQAKGEATSLSSAVRYVEQ 201 Query: 269 QIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 + + I+ TDG + Q E K R ++ +GI Sbjct: 202 SLRGLPLAD-------------IILFTDGVHNKGSSPQ---EMAAELKSRNVRLFPVGIG 245 Query: 329 VIRS 332 + + Sbjct: 246 MPEN 249 >gi|50085107|ref|YP_046617.1| tellurium resistance protein [Acinetobacter sp. ADP1] gi|49531083|emb|CAG68795.1| tellurium resistance protein [Acinetobacter sp. ADP1] Length = 212 Score = 36.3 bits (82), Expect = 8.1, Method: Composition-based stats. Identities = 13/67 (19%), Positives = 27/67 (40%), Gaps = 6/67 (8%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + ++LD S SM I+ + +L ++ P L++F ++ Sbjct: 3 RLPVYLLLDTSGSMHG------EPIEAVKNGVQILLSTLRQDPYALETAYLSLISFDSQA 56 Query: 231 EEFFLLE 237 ++ L Sbjct: 57 KQLVPLT 63 >gi|260171427|ref|ZP_05757839.1| hypothetical protein BacD2_06135 [Bacteroides sp. D2] Length = 608 Score = 36.3 bits (82), Expect = 8.2, Method: Composition-based stats. Identities = 34/278 (12%), Positives = 89/278 (32%), Gaps = 28/278 (10%) Query: 65 IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVND-----IDDIVRSTSLDIV 119 IM + + + + +F ++ + N ++ ++ Sbjct: 128 IMYDAVNAEEYGQIQENGFKSVSDAPLSTFSIDVDAASYSNMRRFINKGELPPVDAIRTE 187 Query: 120 VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH--IVMPITSSVKVNSQTDARLDMMIV 177 + Y + P+K + + +V + ++ + +++ + Sbjct: 188 ELVNYFSYDYPKPTGSD-PVKITMEAGACPWNANHRLVRIGLKAKEIPTDNLPASNLVFL 246 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 +DVS SM ++D+ S+ ++ ++ V V +G ++ Sbjct: 247 IDVSGSMW-----GANRLDLVKSSLKLLVNNLRDKDKVAIVTYAGSAG----VKLEATPG 297 Query: 238 WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 ++ I L+ G + T L AY +I + N I+ +DG Sbjct: 298 SDKQKIREAIDELTAGGSTAGGTGILL-AY-KIAKKNFISNGNNR--------IILCSDG 347 Query: 298 E-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + N+ + + +K G + +G + + Sbjct: 348 DFNVGVSSAEGLEQLIEKERKSGVFLTVLGYGMGNYKD 385 >gi|261855659|ref|YP_003262942.1| von Willebrand factor A [Halothiobacillus neapolitanus c2] gi|261836128|gb|ACX95895.1| von Willebrand factor type A [Halothiobacillus neapolitanus c2] Length = 625 Score = 36.3 bits (82), Expect = 8.3, Method: Composition-based stats. Identities = 35/233 (15%), Positives = 76/233 (32%), Gaps = 28/233 (12%) Query: 138 PLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDM 197 P + T +++ IT V +Q ++ +++DVS SM+ ++ Sbjct: 5 PTSPLIRMHASTICGLLILLITGLAFVPAQAATPPELHVLIDVSGSMKQTDPNN------ 58 Query: 198 AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE-------EFFLLEWGVSHLQRKIKYL 250 + +L L+P Q G+ F +K+ ++ V +KI+ Sbjct: 59 LRRPALRLLG--DLLPPSA---QIGIWFFGDKVSLMLKTAGADPKVKERVRQTAKKIRSN 113 Query: 251 SKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY 310 F + +N D + D + +K I +++ Sbjct: 114 EPFTDIPAALAAAAATWNDGTDRNILLLSDGMVDISPEKAI--------NVRAQEELLQK 165 Query: 311 YCNEAKKRGAIVYAIGIRVIRSHEFLRACA--SPNSFYLVENPHSMYDAFSHI 361 + + V+ I + + L A + F ++ ++ AF I Sbjct: 166 LVPQLRAEHIRVHTIALSKDADSKLLSQIAADTGGIFVEADSADALQRAFLKI 218 >gi|187918046|ref|YP_001883609.1| hypothetical protein BH0172 [Borrelia hermsii DAH] gi|119860894|gb|AAX16689.1| hypothetical protein BH0172 [Borrelia hermsii DAH] Length = 329 Score = 36.3 bits (82), Expect = 8.3, Method: Composition-based stats. Identities = 35/186 (18%), Positives = 61/186 (32%), Gaps = 23/186 (12%) Query: 119 VVVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVL 178 V + Y I I L FI + ++ I+ K A L + V Sbjct: 41 VNNAYIKNYYIKKI------LMIFFFILSLVSLIVSILDISWGQKATEDERANLRISFVF 94 Query: 179 DVSRSMESFFDSS-ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 D+SRSM +F + I +++ A I+ +L + L F K Sbjct: 95 DISRSMLTFDEGKSINRLESAKNFISLVLNNFENAE-------CSLTIFKGKSLLVLPFS 147 Query: 238 WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 + L + + Y+ S+PG +D +Y ++ +TDG Sbjct: 148 KDKASLHKILNYIEP---GLISSPG------SFLGEGVFSAIRGIKDNSYYNFLIILTDG 198 Query: 298 ENLSTK 303 + Sbjct: 199 DEWGEN 204 >gi|224282727|ref|ZP_03646049.1| von Willebrand factor, type A [Bifidobacterium bifidum NCIMB 41171] gi|310287186|ref|YP_003938444.1| Conserved hypothetical membrane spanning protein with a von Willebrand factor type A domain [Bifidobacterium bifidum S17] gi|313139885|ref|ZP_07802078.1| conserved hypothetical protein [Bifidobacterium bifidum NCIMB 41171] gi|309251122|gb|ADO52870.1| Conserved hypothetical membrane spanning protein with a von Willebrand factor type A domain [Bifidobacterium bifidum S17] gi|313132395|gb|EFR50012.1| conserved hypothetical protein [Bifidobacterium bifidum NCIMB 41171] Length = 342 Score = 36.3 bits (82), Expect = 8.3, Method: Composition-based stats. Identities = 31/190 (16%), Positives = 59/190 (31%), Gaps = 25/190 (13%) Query: 162 VKVNSQTDARLDMMIVLDVSRSM-----ESFFDSSITKIDMAIKSINAMLEEVKLIPDVN 216 S+ D+M+ +DV+ SM E +I+++D A + + + Sbjct: 68 TTTTSRAINATDVMVAVDVTGSMAVKDAEYGSSGTISRLDAAKRIVKGITSTY--ADSSF 125 Query: 217 NVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGM 276 ++ G L + L+ ST++ L +Q+ Sbjct: 126 AALRFGASG-----TLDVPLTPDSIAIDGWADTLAVESTSTSAGSSLDTPLDQLM----- 175 Query: 277 RQHCNTEDANYKKIIVF--MTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + D + IIV +TDGE S + + IG+ + Sbjct: 176 LSLKSIRDQHPDDIIVLYVITDGEQTSDTARRSYSALRRYLDDS----FTIGVGSDAGGK 231 Query: 335 FLRACASPNS 344 AS + Sbjct: 232 I--PMASDGT 239 >gi|21219969|ref|NP_625748.1| hypothetical protein SCO1467 [Streptomyces coelicolor A3(2)] gi|7209227|emb|CAB76889.1| conserved hypothetical protein SCL6.24c [Streptomyces coelicolor A3(2)] Length = 491 Score = 36.3 bits (82), Expect = 8.3, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 49/152 (32%), Gaps = 20/152 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + + L ++V+D S SM + K+ A ++ A + ++ Sbjct: 40 LSVRVDGTGGDGHPGGPVLAQVLVMDCSSSMTW----PVEKLHAAQRAAVAAIRKLPDGT 95 Query: 214 DVNNVVQ---SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 VV+ V + + S +R ++ G T L + + Sbjct: 96 PFA-VVRGNEQAAVVYPDTPRMARASARTRSRAERAVRETVA-GGGTCIGAWLDLSRRLL 153 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 + H ++ +TDG+N Sbjct: 154 TEQDAPIGH-----------VLLLTDGKNQHD 174 >gi|261368220|ref|ZP_05981103.1| putative LPXTG-motif protein cell wall anchor domain protein [Subdoligranulum variabile DSM 15176] gi|282569728|gb|EFB75263.1| putative LPXTG-motif protein cell wall anchor domain protein [Subdoligranulum variabile DSM 15176] Length = 1505 Score = 36.3 bits (82), Expect = 8.4, Method: Composition-based stats. Identities = 38/158 (24%), Positives = 63/158 (39%), Gaps = 31/158 (19%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 TN + +T S+ + +T A D++ VLD S S + ++ MLE Sbjct: 581 ATNVENFKSDVTLSIGADQKTVAA-DVVFVLDKSTSTD------------VKQAALNMLE 627 Query: 208 EVKLIPDVNNV-VQSGLVTFSN-------KIEEFFLLEWGVSHLQRKIKYLSKFGVSTNS 259 E++ N + V+ GLVTF+ +E L + V+ L+R S TN Sbjct: 628 ELQKHVSENGLQVKVGLVTFNQSADNSNFNMELSELNDATVTELKRIFN--SSLSSGTNI 685 Query: 260 TPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 G++ + D + K +V +TDG Sbjct: 686 EAGIRKGMKMLDDDTSVPASN--------KHLVLVTDG 715 >gi|281210380|gb|EFA84546.1| SAM domain-containing protein [Polysphondylium pallidum PN500] Length = 1594 Score = 36.3 bits (82), Expect = 8.4, Method: Composition-based stats. Identities = 38/268 (14%), Positives = 76/268 (28%), Gaps = 37/268 (13%) Query: 63 TQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVP 122 TQI N + L + + +T N++ + + N + + L + P Sbjct: 100 TQIRNSLSSAVPAVLTHNIDVRKQGDTANIN-KYSVLHQTLQN-----LDTLELVLTEKP 153 Query: 123 QNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSR 182 S Y+ P + F + + I+S V++ M D + Sbjct: 154 SKSKGFSSGSDNYETPAEDTQFDQDALLNSFLETSISSDVEI----------MFCFDTTG 203 Query: 183 SMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF--SNKIEEFFLLEWGV 240 SM S I + + + + + GL + ++ L V Sbjct: 204 SMSSI-------IGKVREQVEQTVSRLMKDIPNIRIGIMGLGDYCDGTRVLTTLDLTQDV 256 Query: 241 SHLQRKIKYLSKFGVSTNSTP---GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 L + IK + L+ A + + + D Sbjct: 257 DKLVKFIKSVPSTSGGDAPEAYEWALRKAKDLTWSPHTSKAFVMIGDCEPH--------- 307 Query: 298 ENLSTKEDQQSLYYCNEAKKRGAIVYAI 325 E T + C++ G +Y + Sbjct: 308 EPSHTDLNINWFEECDDLFDMGVKIYGV 335 >gi|167570097|ref|ZP_02362971.1| hypothetical protein BoklC_09651 [Burkholderia oklahomensis C6786] Length = 602 Score = 36.3 bits (82), Expect = 8.4, Method: Composition-based stats. Identities = 42/257 (16%), Positives = 91/257 (35%), Gaps = 28/257 (10%) Query: 9 RNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNE 68 R +G ++ AI++ + LG +++ ++FF + L + D + + AA ++ ++ Sbjct: 17 RRTIARERGSFAVVAAIWMLVAIAALG-AVDIGNVFFARRDLQRVADMAALAAAQKMDDQ 75 Query: 69 ----GNGNNRKKLKGGDILCRIKNTWNMSF-RNELRDN---GFVNDIDDIVRSTSLDIVV 120 N G + NT +S R + + N + N + + + Sbjct: 76 CTQPAAAANANARSNGFDPAKSGNTLTLSCGRWDTQSNAGPSYFNTASTPLNAVQVTATQ 135 Query: 121 VPQ----NEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMI 176 ++SA S K TN + T + + L+ ++ Sbjct: 136 SVPYFFLGPSRTVSATSTAK-----------ATNIDQFTVGTTLASLQGGLVNNVLNALL 184 Query: 177 VLDVSRSMESFFDSSITKIDM----AIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 ++S S+ S+ + T+I + A ++ + E + Q L S Sbjct: 185 GANLSLSVLSYQALASTQIKLGDLMAAANVLTVNELLSTQVTAGQFAQLTLTALSRTQVV 244 Query: 233 FFLLEWGVSHLQRKIKY 249 L+ V+ LQ + Sbjct: 245 NANLQASVAALQTIVGA 261 >gi|83815528|ref|YP_445460.1| hypothetical protein SRU_1336 [Salinibacter ruber DSM 13855] gi|83756922|gb|ABC45035.1| Protein of unknown function family [Salinibacter ruber DSM 13855] Length = 310 Score = 36.3 bits (82), Expect = 8.4, Method: Composition-based stats. Identities = 25/140 (17%), Positives = 51/140 (36%), Gaps = 15/140 (10%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + + L +VLD S SM DS ++K+D + + D +GL+ F Sbjct: 81 EEETNLRNYVVLDTSASMRYAGDSELSKLDYGSYLAAGLHNLMLKQQDA-----TGLIGF 135 Query: 227 SNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDAN 286 + + +L++ + L +F + + E Sbjct: 136 DESVHTLRPPKATQGYLRQLLVTLEQFR---------QRDATEQRTNAAAALEEVAERVE 186 Query: 287 YKKIIVFMTD-GENLSTKED 305 + +I+ +TD EN++ +D Sbjct: 187 RRSLIIVITDLFENIAAHDD 206 >gi|332830474|gb|EGK03102.1| hypothetical protein HMPREF9455_01352 [Dysgonomonas gadei ATCC BAA-286] Length = 393 Score = 36.3 bits (82), Expect = 8.5, Method: Composition-based stats. Identities = 17/130 (13%), Positives = 51/130 (39%), Gaps = 15/130 (11%) Query: 148 YTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLE 207 ++ + +P T ++ + I+LD S SM+ + + +++ + ++ + Sbjct: 31 QKHNSELSVPAT---DNKMNRGTKIQVAILLDTSSSMDGLIEQAKSRLWNIVNTLTTLRY 87 Query: 208 -------EVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNST 260 E+ L N+ +++ ++ I + L + + +K+ L+ +G Sbjct: 88 EGKAPRIEIALYEYGNDGIRT-----NDYIRQVTPLTTDLDLISQKLFALTTYGGDEYCG 142 Query: 261 PGLKYAYNQI 270 + A + Sbjct: 143 AVINKAVEVL 152 >gi|301623649|ref|XP_002941126.1| PREDICTED: von Willebrand factor A domain-containing protein 5A-like [Xenopus (Silurana) tropicalis] Length = 720 Score = 36.3 bits (82), Expect = 8.6, Method: Composition-based stats. Identities = 33/247 (13%), Positives = 81/247 (32%), Gaps = 48/247 (19%) Query: 60 HAATQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIV 119 + ++I + N + + ++ F ++ + D++ + + Sbjct: 189 YGISKIQSNCNISPVEYTDSDKTAAKVSLAQGHKFERDVELLAYYMDVNKPSVAVEAGL- 247 Query: 120 VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLD 179 P + ++ A + P T + + + ++D Sbjct: 248 --PDAKADTLMADPAVMLQFYP-------------SFPTTQEQSNSGE------FLFLMD 286 Query: 180 VSRSMESFFDSS---ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFL- 235 S SME +S +I+ A +++ +L+ + L N ++ F ++ E FF Sbjct: 287 RSGSMECGMNSEPNAPQRIESAKETLVLLLKSLPLGCYFN------IIGFGSRFEFFFPE 340 Query: 236 -LEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII 291 +E+ + +K +++ T LK Y ++ + + Sbjct: 341 TIEYTQESMDEAVKKVTQMEANMGGTEILQPLKKIYKTAV------------RPDHPRQL 388 Query: 292 VFMTDGE 298 TDGE Sbjct: 389 FVFTDGE 395 >gi|203454770|ref|YP_002225086.1| gp207 [Mycobacterium phage Myrna] gi|197311818|gb|ACH62176.1| gp207 [Mycobacterium phage Myrna] Length = 250 Score = 36.3 bits (82), Expect = 8.6, Method: Composition-based stats. Identities = 24/132 (18%), Positives = 41/132 (31%), Gaps = 19/132 (14%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEF 233 + +LD S SM S A + E + + L F ++ E Sbjct: 9 IAALLDRSGSMRSR----------ARSTEEGFNELIHGQKNEPGQAYVTLAQFDDQYEVV 58 Query: 234 FLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKII-V 292 + + I + + S L ++ G ED +I V Sbjct: 59 --------YANKPIAEVPPLSLVARSMTALNDGIGKLVTDVGAELAALKEDERPGLVICV 110 Query: 293 FMTDGENLSTKE 304 MTDG +++E Sbjct: 111 IMTDGMENASRE 122 >gi|160887255|ref|ZP_02068258.1| hypothetical protein BACOVA_05272 [Bacteroides ovatus ATCC 8483] gi|156107666|gb|EDO09411.1| hypothetical protein BACOVA_05272 [Bacteroides ovatus ATCC 8483] Length = 616 Score = 36.3 bits (82), Expect = 8.6, Method: Composition-based stats. Identities = 34/278 (12%), Positives = 89/278 (32%), Gaps = 28/278 (10%) Query: 65 IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVND-----IDDIVRSTSLDIV 119 IM + + + + +F ++ + N ++ ++ Sbjct: 136 IMYDAVNAEEYGQIQENGFKSVSDAPLSTFSIDVDAASYSNMRRFINKGELPPVDAIRTE 195 Query: 120 VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH--IVMPITSSVKVNSQTDARLDMMIV 177 + Y + P+K + + +V + ++ + +++ + Sbjct: 196 ELVNYFSYDYPKPTGSD-PVKITMEAGACPWNANHRLVRIGLKAKEIPTDNLPASNLVFL 254 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 +DVS SM ++D+ S+ ++ ++ V V +G ++ Sbjct: 255 IDVSGSMW-----GANRLDLVKSSLKLLVNNLRDKDKVAIVTYAGSAG----VKLEATPG 305 Query: 238 WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 ++ I L+ G + T L AY +I + N I+ +DG Sbjct: 306 SDKQKIREAIDELTAGGSTAGGTGILL-AY-KIAKKNFISNGNNR--------IILCSDG 355 Query: 298 E-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + N+ + + +K G + +G + + Sbjct: 356 DFNVGVSSAEGLEQLIEKERKSGVFLTVLGYGMGNYKD 393 >gi|209551753|ref|YP_002283670.1| hypothetical protein Rleg2_4182 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209537509|gb|ACI57444.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 193 Score = 36.3 bits (82), Expect = 8.6, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 50/168 (29%), Gaps = 12/168 (7%) Query: 4 LNLNIRNFFYNYKGGMTILTAIFLPIIFLVLGMIIEVS-HIFFMKTVLHSMIDRSLVHAA 62 L L +R + KG I AI P++ ++ E++ + K + + V Sbjct: 10 LVLTVRRLAQDRKGAGAIEFAILFPVLIMLYIGAFEITIGLSVSKRATRAAGTVADVVTQ 69 Query: 63 TQIMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNG---------FVNDIDDIVRS 113 Q + + + + + + D G + ++ Sbjct: 70 QQSVTKSALAQMPSVANSIFVPYNTTSLTLKITGITIDAGANAKVLWSWAQDGTVPYAKN 129 Query: 114 TSLDIVVVPQNEGYSISAISRYKIPLKFCTFIPWYT--NSRHIVMPIT 159 T++ V S + IP F P + R I + + Sbjct: 130 TAVSNVPSDMKTANSFLVRTELSIPYTMFLFAPNFMPDGMRTITISRS 177 >gi|153833064|ref|ZP_01985731.1| calx-beta domain family [Vibrio harveyi HY01] gi|148870632|gb|EDL69539.1| calx-beta domain family [Vibrio harveyi HY01] Length = 3802 Score = 36.3 bits (82), Expect = 8.6, Method: Composition-based stats. Identities = 31/170 (18%), Positives = 60/170 (35%), Gaps = 13/170 (7%) Query: 143 TFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSI 202 I +++ IV + + Q ++ VLD S SM S+ S+ +I + Sbjct: 3286 DLIKGSSDNDLIVGDVQGIEIIAGQD---YNIAFVLDTSGSMGSWVGSAKDEILDVFDEL 3342 Query: 203 NAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFG-VSTNSTP 261 A + + VN ++ + IE + ++ + +G +TN Sbjct: 3343 LASVNQGVAPGTVNIMLSEFATGATKVIEVDLSSGTARADFVEELNTIDDYGSGNTNYEA 3402 Query: 262 GLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYY 311 GL+ A ++ + I F+TDGE E + + Sbjct: 3403 GLQSAVE---------WFGGQPNSEGENITYFITDGEPNVISELKNTQEV 3443 >gi|256788931|ref|ZP_05527362.1| hypothetical protein SlivT_30978 [Streptomyces lividans TK24] Length = 477 Score = 36.3 bits (82), Expect = 8.7, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 49/152 (32%), Gaps = 20/152 (13%) Query: 154 IVMPITSSVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIP 213 + + + + L ++V+D S SM + K+ A ++ A + ++ Sbjct: 26 LSVRVDGTGGDGHPGGPVLAQVLVMDCSSSMTW----PVEKLHAAQRAAVAAIRKLPDGT 81 Query: 214 DVNNVVQ---SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI 270 VV+ V + + S +R ++ G T L + + Sbjct: 82 PFA-VVRGNEQAAVVYPDTPRMARASARTRSRAERAVRETVA-GGGTCIGAWLDLSRRLL 139 Query: 271 FDMQGMRQHCNTEDANYKKIIVFMTDGENLST 302 + H ++ +TDG+N Sbjct: 140 TEQDAPIGH-----------VLLLTDGKNQHD 160 >gi|254226123|ref|ZP_04919720.1| type I secretion target ggxgxdxxx repeat (2 copies) domain protein [Vibrio cholerae V51] gi|125621358|gb|EAZ49695.1| type I secretion target ggxgxdxxx repeat (2 copies) domain protein [Vibrio cholerae V51] Length = 1637 Score = 36.3 bits (82), Expect = 8.8, Method: Composition-based stats. Identities = 22/140 (15%), Positives = 58/140 (41%), Gaps = 11/140 (7%) Query: 164 VNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGL 223 + ++ ++LD+S SM++ + +++D+ +S +LE+ + + + L Sbjct: 1018 AEPELRQGANVQLILDISGSMDTSAGNGKSRLDVMKESAKQLLEQYQAM----GQTKVQL 1073 Query: 224 VTFSNKIEEFFLLEWGVSHLQRKIKYLS--KFGVSTNSTPGLKYAYNQIFDMQGMRQHCN 281 V F ++ E +++ I Y++ +T+ ++ A + + Sbjct: 1074 VVFHSEAEVKSQGNSVWMTVEQAINYINGLSTKGTTDYDHAIELAEDNWSGLNTGGLLTG 1133 Query: 282 TEDANYKKIIVFMTDGENLS 301 + +Y F++DGE Sbjct: 1134 ATNVSY-----FLSDGEPYD 1148 >gi|52787597|ref|YP_093426.1| YwmD [Bacillus licheniformis ATCC 14580] gi|319648079|ref|ZP_08002296.1| YwmD protein [Bacillus sp. BT1B_CT2] gi|52350099|gb|AAU42733.1| YwmD [Bacillus licheniformis ATCC 14580] gi|317389714|gb|EFV70524.1| YwmD protein [Bacillus sp. BT1B_CT2] Length = 223 Score = 36.3 bits (82), Expect = 8.8, Method: Composition-based stats. Identities = 30/205 (14%), Positives = 66/205 (32%), Gaps = 20/205 (9%) Query: 167 QTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF 226 + +++ +VLD S SM + K D+A +S+ + + +V L F Sbjct: 25 NSKKDVNVAVVLDASGSMAQKVEGER-KFDIAKESVTDFADLLSSDANVM------LNVF 77 Query: 227 SNK---IEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTE 283 +K + G + ++ S F N+ G+K + Sbjct: 78 GHKGNNKNSGKEVSCGTTETVYDLQPFS-FNSFENALSGIKPTGWSPIAKSLYDVKDDLA 136 Query: 284 DANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAI----VYAIGIRVIRSHEFLRAC 339 D + K + +TDGE + + +K + + ++ ++ L+ Sbjct: 137 DKDGKNYVYIVTDGE---ETCGGDPVQAAKDLRKSNIKTIVNIVGLDVKTVKEKAKLKKV 193 Query: 340 AS--PNSFYLVENPHSMYDAFSHIG 362 A ++ + G Sbjct: 194 ADAGGGKLIEADSASDFKKVWKEEG 218 >gi|167624203|ref|YP_001674497.1| hypothetical protein Shal_2279 [Shewanella halifaxensis HAW-EB4] gi|167354225|gb|ABZ76838.1| conserved hypothetical protein [Shewanella halifaxensis HAW-EB4] Length = 446 Score = 36.3 bits (82), Expect = 9.0, Method: Composition-based stats. Identities = 12/108 (11%), Positives = 39/108 (36%) Query: 16 KGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRK 75 G + ++ I + + + ++ H+ K L + +D + ++AA ++ + Sbjct: 19 SGAILVMFTIGIFAVIAFAALALDGGHMLLSKGRLQNAVDAAALNAAKELQEGATLLEAR 78 Query: 76 KLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQ 123 + +L + T N + + + + ++ +P Sbjct: 79 EAAYTILLQNLSFTENGELNTSVSLSSPDFNNTQVTPRLQVEFSELPD 126 >gi|149921504|ref|ZP_01909956.1| hypothetical protein PPSIR1_30866 [Plesiocystis pacifica SIR-1] gi|149817707|gb|EDM77174.1| hypothetical protein PPSIR1_30866 [Plesiocystis pacifica SIR-1] Length = 560 Score = 36.3 bits (82), Expect = 9.0, Method: Composition-based stats. Identities = 32/166 (19%), Positives = 58/166 (34%), Gaps = 21/166 (12%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + +++ +VLD S SM T I++ ++ A+ ++KL V+ Sbjct: 212 TPEERPPMNVTLVLDTSGSMAG------TPIELLRETSRAIAAQLKLGDTVSICEWDTSN 265 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTED 284 ++ L KI + G TN GL+ Y + Q D Sbjct: 266 DWTLAGYAVTGP--NDELLLEKINDVV-HGGGTNLYGGLESGY-------ELAQMVYDPD 315 Query: 285 ANYKKIIVFMTD-GENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRV 329 A + +V ++D G N + G +Y +G+ V Sbjct: 316 AINR--LVLISDGGANAGITDLDLIAENAAYGGSDG--IYLVGVGV 357 >gi|290986713|ref|XP_002676068.1| predicted protein [Naegleria gruberi] gi|284089668|gb|EFC43324.1| predicted protein [Naegleria gruberi] Length = 413 Score = 36.3 bits (82), Expect = 9.1, Method: Composition-based stats. Identities = 46/224 (20%), Positives = 85/224 (37%), Gaps = 35/224 (15%) Query: 161 SVKVNSQTDARLDMMIVLD--VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNV 218 S K L+++IVLD S S S +K+ +A K I + I ++ + Sbjct: 35 SCKDGKNERKDLNLVIVLDISGSMSSSMSGKGSESKMKVANKVICEI------IDNLKDF 88 Query: 219 VQSGLVTFSNKIEEFFLLE----WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQ 274 + G+V F +K E F L L+ ++ +++ G STN G+K +F Sbjct: 89 ERLGIVLFDDKAETFLPLTIVQDLEKKSLKERVMKITEKG-STNFEAGMKRGI-DLFSTM 146 Query: 275 GMRQHCNTEDANYKKIIVFMTDG-ENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSH 333 N+ I+++TD N+ + L + G + IGI + + Sbjct: 147 DSSDLSNSNR------IIYLTDACPNVGGTDSLDVLTKDANSGPYGILSTFIGIGLDFNS 200 Query: 334 EF------LRACASPNSFYLVENPHSMYDAFSHIGKDIVTKRIW 371 E +R C +++ V++ F I + + Sbjct: 201 EIVEELTKVRGC----NYFSVKSSEE----FKKILNEDFNYIVT 236 >gi|315919742|ref|ZP_07915982.1| conserved hypothetical protein [Bacteroides sp. D2] gi|313693617|gb|EFS30452.1| conserved hypothetical protein [Bacteroides sp. D2] Length = 616 Score = 35.9 bits (81), Expect = 9.3, Method: Composition-based stats. Identities = 34/278 (12%), Positives = 89/278 (32%), Gaps = 28/278 (10%) Query: 65 IMNEGNGNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVND-----IDDIVRSTSLDIV 119 IM + + + + +F ++ + N ++ ++ Sbjct: 136 IMYDAVNAEEYGQIQENGFKSVSDAPLSTFSIDVDAASYSNMRRFINKGELPPVDAIRTE 195 Query: 120 VVPQNEGYSISAISRYKIPLKFCTFIPWYTNSRH--IVMPITSSVKVNSQTDARLDMMIV 177 + Y + P+K + + +V + ++ + +++ + Sbjct: 196 ELVNYFSYDYPKPTGSD-PVKITMEAGACPWNANHRLVRIGLKAKEIPTDNLPASNLVFL 254 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 +DVS SM ++D+ S+ ++ ++ V V +G ++ Sbjct: 255 IDVSGSMW-----GANRLDLVKSSLKLLVNNLRDKDKVAIVTYAGSAG----VKLEATPG 305 Query: 238 WGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDG 297 ++ I L+ G + T L AY +I + N I+ +DG Sbjct: 306 SDKQKIREAIDELTAGGSTAGGTGILL-AY-KIAKKNFISNGNNR--------IILCSDG 355 Query: 298 E-NLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRSHE 334 + N+ + + +K G + +G + + Sbjct: 356 DFNVGVSSAEGLEQLIEKERKSGVFLTVLGYGMGNYKD 393 >gi|242037661|ref|XP_002466225.1| hypothetical protein SORBIDRAFT_01g003860 [Sorghum bicolor] gi|241920079|gb|EER93223.1| hypothetical protein SORBIDRAFT_01g003860 [Sorghum bicolor] Length = 755 Score = 35.9 bits (81), Expect = 9.4, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 59/174 (33%), Gaps = 20/174 (11%) Query: 128 SISAISRYKIPLKFCTFIPWYTNSRHIVMPITSSVKVNSQTDARLDMMIVLDVSRSMESF 187 ++ A R P + +R + + T A ++ V+D S SM Sbjct: 507 AVDATLRAAAPYQKLRREKERDKARKVFVEKTDMRAKRMARKAGALVIFVVDASGSM--- 563 Query: 188 FDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTF-SNKIEEFFLLEWGVSHLQRK 246 ++ ++ A + +L + Q ++ F + E ++ +++ Sbjct: 564 ---ALNRMQNAKGAALKLL-----AESYTSRDQVSIIPFRGDYAEVLLPPSRSIAMARKR 615 Query: 247 IKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFMTDGENL 300 ++ L G S + GL A + + + +IV +TDG Sbjct: 616 LEKLPCGGGSPLAH-GLSTAVRVGLNAE-------KSGDVGRIMIVAITDGRAN 661 >gi|322436386|ref|YP_004218598.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321164113|gb|ADW69818.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 384 Score = 35.9 bits (81), Expect = 9.5, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 60/168 (35%), Gaps = 17/168 (10%) Query: 165 NSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLV 224 + D L + +++D S+S + D + L+++ V + Sbjct: 89 DRDADLPLTLGLLVDTSQSQRAALDDE-------RAASTTFLDQMLKGKQDKAFV----I 137 Query: 225 TFSNKIEEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLK----YAYNQIFDMQGMRQHC 280 F+++ E L LQ +K L ++ + ++D + Sbjct: 138 QFAHETELLQDLTDSRPKLQAALKDLDSSPQRSSDSSDSDGHRDRGGTVLYDAAFLAADE 197 Query: 281 NTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIR 328 +K I+ +TDG++ +KE + A++ VYAI + Sbjct: 198 ILSKPKGRKAIILLTDGDDRGSKE-TITKAI-EAAQRADTTVYAIYFK 243 >gi|224024928|ref|ZP_03643294.1| hypothetical protein BACCOPRO_01659 [Bacteroides coprophilus DSM 18228] gi|224018164|gb|EEF76162.1| hypothetical protein BACCOPRO_01659 [Bacteroides coprophilus DSM 18228] Length = 339 Score = 35.9 bits (81), Expect = 9.5, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 62/175 (35%), Gaps = 26/175 (14%) Query: 161 SVKVNSQTDARLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQ 220 K+ + ++ ++ LD+S SM + D + ++++ + K I+ ++E N+ V Sbjct: 79 GSKMETVKRQGVETVVALDISNSMLAQ-DVTPSRLEKSKKLISRLVETFN-----NDKVA 132 Query: 221 SGLVTFSNKIEEFFLLEWGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMR 277 + F+ + + + ++ +S T+ + A Sbjct: 133 M--IVFAGEAFAQLPITSDYISAKMFLESISPSLIATQGTDIRGAIDLAMKSFT------ 184 Query: 278 QHCNTEDANYKKIIVFMTDGENLSTKEDQQSLYYCNEAKKRGAIVYAIGIRVIRS 332 + + IV +TDGEN + + + V+ +G+ Sbjct: 185 -----PNEGVGRAIVLITDGENHEGGAVEAAKAAAEKG----VRVFVLGVGSPDG 230 >gi|221193689|gb|ACM07876.1| PI-2a ancillary protein 1 [Streptococcus agalactiae] Length = 896 Score = 35.9 bits (81), Expect = 9.5, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 79/234 (33%), Gaps = 27/234 (11%) Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQNEGYSIS 130 N + ++ D I + G D + + N Sbjct: 116 SNGKTTIQNSDDKKSIIEQRQEELDKQYPLTGAYEDTKESYNLEHVK--NSIPNGKLEAK 173 Query: 131 AISRYKIPLKFCTFIPWYT-------------NSRHIVMPITSSVKVNS-QTDARLDMMI 176 A++ Y + IP T N I + ++ + + D LD++ Sbjct: 174 AVNPYSSEGEHIREIPEGTLSKRISEVNDLDHNKYKIELTVSGKSIIKTINKDEPLDVVF 233 Query: 177 VLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLL 236 VLD S SM++ ++ A +++ ++++V N ++ LVT+ + I Sbjct: 234 VLDNSNSMKNNGKNNKA--KKAGEAVETIIKDVLGANVEN---RAALVTYGSDI-----F 283 Query: 237 EWGVSHLQRKIKYLSKFGVSTNSTPGLK-YAYNQIFDMQGMRQHCNTEDANYKK 289 + + + K G+ T+ T Y+Y + ++ ++A K Sbjct: 284 DGRTVKVIKGFKEDPYHGLETSFTVQTNDYSYKKFTNIAADIIKKIPKEAPEAK 337 >gi|153008591|ref|YP_001369806.1| TadE family protein [Ochrobactrum anthropi ATCC 49188] gi|151560479|gb|ABS13977.1| TadE family protein [Ochrobactrum anthropi ATCC 49188] Length = 165 Score = 35.9 bits (81), Expect = 9.5, Method: Composition-based stats. Identities = 18/114 (15%), Positives = 35/114 (30%), Gaps = 7/114 (6%) Query: 11 FFYNYKGGMTILTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGN 70 F N G + AI P+ L+L +I + + + + T + Sbjct: 31 FKRNKSGTAAVEFAILAPVFLLILMGMIAFGLYLGVANAVQQL---AADATRTALA---- 83 Query: 71 GNNRKKLKGGDILCRIKNTWNMSFRNELRDNGFVNDIDDIVRSTSLDIVVVPQN 124 G + + KN S + + V++ ++ I QN Sbjct: 84 GIDPPERLTLATTYIQKNAAKYSLIDPAKMQVSVDNAQSDPNQFTVTIRYNAQN 137 >gi|256394713|ref|YP_003116277.1| von Willebrand factor type A [Catenulispora acidiphila DSM 44928] gi|256360939|gb|ACU74436.1| von Willebrand factor type A [Catenulispora acidiphila DSM 44928] Length = 215 Score = 35.9 bits (81), Expect = 9.6, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 57/158 (36%), Gaps = 25/158 (15%) Query: 174 MMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSN---KI 230 +++VLD S SM+ + + + ++ ++ + L FS + Sbjct: 9 IVMVLDRSGSMQ----------AVKNDTEGGLTAFLEAQKEIAGDTRVSLYHFSTTYEPV 58 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKI 290 E L + L G +T + I ++ + + E+ + + Sbjct: 59 YENLALADVPEYT------LEPRG-NTALLDAIGR---TITAVKAQIKAMDVEERPGEVV 108 Query: 291 IVFMTDGENLSTKEDQ--QSLYYCNEAKKRGAIVYAIG 326 +V +TDG S++E + + + +G V +G Sbjct: 109 LVILTDGMENSSREYTLPEVKKLIEKRRAKGWQVVFLG 146 >gi|90409064|ref|ZP_01217189.1| hypothetical protein PCNPT3_10646 [Psychromonas sp. CNPT3] gi|90309821|gb|EAS37981.1| hypothetical protein PCNPT3_10646 [Psychromonas sp. CNPT3] Length = 641 Score = 35.9 bits (81), Expect = 9.6, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 60/183 (32%), Gaps = 34/183 (18%) Query: 121 VPQNEGYSISAISRY--KIPLKFCTFIPWY-TNSRHIVMPITSSVKVNSQTDARLDMMIV 177 P Y++ ++ PL T I + + I PI K +++ Sbjct: 45 SPNLVHYAVQNKMQHYFLYPLAILTLIALAGPSWQKISTPIYDIKKAQ---------VLI 95 Query: 178 LDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLE 237 +D+S SM + D ++ +++ LV ++ L Sbjct: 96 MDMSYSMYA-TDIKPDRLSQEKYKAMDLIKAWDEGEKA-------LVAYAGDAFTLSPLT 147 Query: 238 WGVSHLQRKIKYLSKF---GVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIVFM 294 + + I L+ + +N+ L+ A + + + H IVFM Sbjct: 148 RDSNAILNHIPNLTPEIMPVMGSNADAALQKAITLLTNAGYTKGH-----------IVFM 196 Query: 295 TDG 297 TDG Sbjct: 197 TDG 199 >gi|145632220|ref|ZP_01787955.1| hypothetical protein CGSHi3655_07194 [Haemophilus influenzae 3655] gi|229844730|ref|ZP_04464869.1| hypothetical protein CGSHi6P18H1_03949 [Haemophilus influenzae 6P18H1] gi|144987127|gb|EDJ93657.1| hypothetical protein CGSHi3655_07194 [Haemophilus influenzae 3655] gi|229812444|gb|EEP48134.1| hypothetical protein CGSHi6P18H1_03949 [Haemophilus influenzae 6P18H1] Length = 212 Score = 35.9 bits (81), Expect = 9.7, Method: Composition-based stats. Identities = 21/136 (15%), Positives = 48/136 (35%), Gaps = 20/136 (14%) Query: 171 RLDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKI 230 RL + +++D S SM I+ + ++ ++ P ++TF ++ Sbjct: 3 RLPVYLLVDTSGSM------MGEAIESVRNGLQMLVSALRQDPYALETAYLSVITFDSQA 56 Query: 231 EEFFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK-- 288 ++ L + ++ + G+ T+ L + + A K Sbjct: 57 KQVTPL---TELMSFQLPTIEASGL-TSMGGALS------LLTDCINREVQKGSAEVKGD 106 Query: 289 -KIIVF-MTDGENLST 302 K +VF ++DG Sbjct: 107 WKPVVFLLSDGVPTDD 122 >gi|120603022|ref|YP_967422.1| outer membrane adhesin like proteiin [Desulfovibrio vulgaris DP4] gi|120563251|gb|ABM28995.1| putative outer membrane adhesin like proteiin [Desulfovibrio vulgaris DP4] Length = 3038 Score = 35.9 bits (81), Expect = 9.9, Method: Composition-based stats. Identities = 31/161 (19%), Positives = 58/161 (36%), Gaps = 14/161 (8%) Query: 174 MMIVLDVSRSMESFFDSS-ITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEE 232 ++IVLD S SM +++++A ++I ++ D V +V F + + Sbjct: 2244 LVIVLDTSGSMAWDSGVDGKSRMELAQEAIAKLMH----AYDDMGHVNIKIVDFYSDADA 2299 Query: 233 FFLLEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQGMRQHCNTEDANYKKIIV 292 G + + +YL+ PG Y+ +A+ ++ Sbjct: 2300 SKPWFEGNDAVAKATQYLT---TDNKFVPGGGTDYDDATAATASALQTGMPNADRT-VLY 2355 Query: 293 FMTDGENLSTKEDQQS---LYYCNEAK--KRGAIVYAIGIR 328 F +DG E + + + K I YAIGI Sbjct: 2356 FFSDGAPDPISEALNATEEKAWVDALKGVDNLDIAYAIGIG 2396 >gi|156933976|ref|YP_001437892.1| hypothetical protein ESA_01802 [Cronobacter sakazakii ATCC BAA-894] gi|156532230|gb|ABU77056.1| hypothetical protein ESA_01802 [Cronobacter sakazakii ATCC BAA-894] Length = 413 Score = 35.9 bits (81), Expect = 9.9, Method: Composition-based stats. Identities = 32/198 (16%), Positives = 61/198 (30%), Gaps = 32/198 (16%) Query: 126 GYSISAISRYKIPLKFCTFIPWY----TNSRHIVMPITSSVKVNSQTDARL--DMMIVLD 179 Y + P K + R + + +SV + L + VLD Sbjct: 160 SYGVDVAQPAPQPAKPARISLEKKLETRSPRLVSLAKKASVSLTKNKLDTLKAAVAFVLD 219 Query: 180 VSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIEEFFLLEWG 239 S SM F ++S+ + + D + + +K L+ Sbjct: 220 ASGSMSGQFSKGN------VQSVLDRIAVLAAQFDDDGEMDVWGFGEKHKKYPNVTLD-- 271 Query: 240 VSHLQRKIKYLSKFGVSTNSTPGLKYAYNQI--FDMQGMRQHCNTEDANYKK------II 291 +L IK + + G + A+ + E +Y K + Sbjct: 272 --NLDTYIKSIRESGK--------RSAWENLPGLGGTNNEPPVMEEIVDYFKDSIIPVYV 321 Query: 292 VFMTDGENLSTKEDQQSL 309 VF+TDG T+ + ++ Sbjct: 322 VFITDGGISKTRAIKDAI 339 >gi|17980650|gb|AAL50672.1|AF447541_1 von Willebrand factor [Nycteris grandis] Length = 389 Score = 35.9 bits (81), Expect = 9.9, Method: Composition-based stats. Identities = 22/127 (17%), Positives = 49/127 (38%), Gaps = 20/127 (15%) Query: 172 LDMMIVLDVSRSMESFFDSSITKIDMAIKSINAMLEEVKLIPDVNNVVQSGLVTFSNKIE 231 LD++ +LD S + + ++ + ++E + + ++ LV + Sbjct: 38 LDLVFLLDGSSKLSE------AEFEVLKAFVVDVMERLHI---SQKRIRVALVEYHEGSH 88 Query: 232 EFFLLEWGV--SHLQRKIKYLSKFGVS-TNSTPGLKYAYNQIFDMQGMRQHCNTEDANYK 288 + L S ++R + G + ++T LKY QIF T+ Sbjct: 89 AYIQLRDRKRPSEMRRIASQVKYAGSTVASTTEVLKYTLFQIFG--------KTDRPEAS 140 Query: 289 KIIVFMT 295 +I++ +T Sbjct: 141 RIVLLLT 147 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.312 0.126 0.295 Lambda K H 0.267 0.0380 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 5,202,858,655 Number of Sequences: 14124377 Number of extensions: 179161952 Number of successful extensions: 642955 Number of sequences better than 10.0: 7829 Number of HSP's better than 10.0 without gapping: 814 Number of HSP's successfully gapped in prelim test: 7268 Number of HSP's that attempted gapping in prelim test: 631902 Number of HSP's gapped (non-prelim): 10836 length of query: 374 length of database: 4,842,793,630 effective HSP length: 140 effective length of query: 234 effective length of database: 2,865,380,850 effective search space: 670499118900 effective search space used: 670499118900 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.5 bits) S2: 82 (36.3 bits)